common, xe: consolidate serialization API #2802

echeresh · 2025-03-03T23:38:17Z

PR consolidates serialization API in src/gpu/intel and src/common.

List of changes:

Moved serialized_t/serialized_data_t/deserializer_t functionality from src/gpu/intel to src/common
- Merged serialized_data_t, serialized_t and serialization_stream_t functionality into serialization_stream_t
Renamed src/common/{serialization_stream.hpp => serialization.hpp} as it now includes broad serialization API (not just stream)
Renamed src/common/{serialization.hpp => primitive_serialization.hpp}

I'm not 100% sure on the names so please share your suggestions if any.

I'm also listing API changes to simplify/unify serialization API for review:

Combine serialized_t and serialized_data_t into a single abstraction (implemented in this PR)
Unify serialize/deserialize member API, stick to obj.serialize(stream). Now there are two versions: see (1) below
Introduce non-member functions for serialization/deserialization:
- serialize(sstream, obj)
- obj = deserialize<Object>(deserializer) or deserialize(deserializer, obj)
Rename sstream.append() to sstream.serialize() to have a single name
Rename deserializer.pop() to deserializer.deserialize() to have a single name
Introduce non-member deserialize(sstream, obj) which would construct a temporary deserializer_t on-the-fly and forward the call to deserialize(deserializer, obj)

(1)

$ grep 'void serialize(serialized_data' src -rIn
src/gpu/intel/gpu_post_ops.hpp:267:    void serialize(serialized_data_t &s) const {
src/gpu/intel/gpu_post_ops.hpp:309:    void serialize(serialized_data_t &s) const {
src/gpu/intel/gpu_post_ops.hpp:536:        void serialize(serialized_data_t &s) const {
src/gpu/intel/gpu_post_ops.hpp:597:    void serialize(serialized_data_t &s) const { s.append(ops_); }
src/gpu/intel/jit/conv/model.hpp:602:    void serialize(serialized_data_t &s) const { s.append(buckets_); }
src/gpu/intel/jit/conv/model.hpp:735:    void serialize(serialized_data_t &s) const {
src/gpu/intel/jit/conv/model.hpp:1083:    void serialize(serialized_data_t &s) const {
src/gpu/intel/jit/gemm/include/strategy.hpp:408:    void serialize(serialized_data_t &s) const
src/gpu/intel/jit/gemm/include/problem.hpp:268:    void serialize(serialized_data_t &s) const
src/gpu/intel/serialization.hpp:78:    // void serialize(serialized_data_t &) const
$ grep 'serialized_t serialize' src -rIn
src/gpu/intel/ocl/bnorm/reusable_bnorm.hpp:60:    serialized_t serialize() const {
src/gpu/intel/ocl/bnorm/nhwc_reusable.hpp:71:    serialized_t serialize() const {
src/gpu/intel/ocl/rnn/rnn_utils.hpp:175:    serialized_t serialize() const {
src/gpu/intel/ocl/gemm/xe_systolic_gemm_copy_kernel.hpp:126:    serialized_t serialize() const { return serialized_t(*this); }
src/gpu/intel/ocl/reusable_softmax.hpp:60:    serialized_t serialize() const {
src/gpu/intel/jit/v2/conv/kernel_desc.hpp:367:    serialized_t serialize() const override;
src/gpu/intel/jit/ir/kernel_desc.hpp:56:    virtual serialized_t serialize() const = 0;
src/gpu/intel/jit/conv/zero_out.hpp:52:    serialized_t serialize() const override;
src/gpu/intel/jit/gemm/gen_gemm_kernel.hpp:74:    serialized_t serialize() const { return serialized_t(problem_, strategy_); }
src/gpu/intel/serialization.hpp:252:    serialized_t serialize() const {

rjoursler · 2025-03-05T01:09:46Z

src/common/primitive_serialization.hpp

+namespace primitive_serialization {
+
+void serialize_post_ops(
+        serialization_stream_t &sstream, const post_ops_t &post_ops);


This code seems excessively verbose and the style is unaligned with how the GPU serialization tends to work. What if we just use a templated function instead?

namespace primitive { template <typename T> void serialize(serialization_stream_t sstream, const T &t); }

And then use extern template in this file. After that, usage is primitive::serialize(stream, data) which seems more readable. Debatably, we could also drop the namespace, it doesn't seem like it serves much of a purpose, as implementing multiple serializers for these core types seems like it would be an antipattern.

I agree, switching to overloading + dropping the namespace look reasonable to me.

@densamoilov please confirm if these changes are fine with you as I see you implemented this originally.

echeresh · 2025-03-05T22:09:09Z

make test
set test_scope=NIGHTLY
disable benchdnn_all
enable benchdnn_conv
enable benchdnn_deconv
enable arch_gpu_xe-hpc

dzarukin · 2025-03-05T22:14:02Z

src/common/primitive_serialization.hpp

+#include "common/primitive_attr.hpp"
+#include "common/serialization.hpp"
+#include "common/type_helpers.hpp"
+#include "oneapi/dnnl/dnnl.h"


Minor: probably can skip this one in favor of c_types_map.

Removed, thanks.

dzarukin · 2025-03-05T22:29:37Z

src/common/serialization.hpp

+    size_t idx;
+    const serialization_stream_t &s;


Make them private?

Thanks, updated.

dzarukin · 2025-03-05T22:42:55Z

src/common/serialization.hpp

+        static const bool value = (sizeof(test<T>(0)) == sizeof(yes_t));
+    };
+
+    // Append helper function for structures with the member function


QQ: is append supposed to be public? Or should a ctor be the only user-facing?

Yes, it's supposed to be public as it's used as sstream.append(...) in many places.
I put some suggestions to unify API in the description (e.g. switch to uniform serialize(sstream, obj) usage) but I'd left it for the future.

echeresh · 2025-03-08T02:20:41Z

make test
set test_scope=NIGHTLY
disable benchdnn_all
enable benchdnn_conv
enable benchdnn_deconv
enable arch_gpu_xe-hpc

common, xe: consolidate serialization API

009310b

echeresh added the platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel label Mar 3, 2025

echeresh requested review from a team as code owners March 3, 2025 23:38

echeresh mentioned this pull request Mar 4, 2025

[GPU] Shapeless conv: add scales support #2790

Open

rjoursler reviewed Mar 5, 2025

View reviewed changes

dzarukin approved these changes Mar 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

common, xe: consolidate serialization API #2802

common, xe: consolidate serialization API #2802

echeresh commented Mar 3, 2025 •

edited

Loading

rjoursler Mar 5, 2025 •

edited

Loading

echeresh Mar 5, 2025 •

edited

Loading

echeresh commented Mar 5, 2025

dzarukin Mar 5, 2025

echeresh Mar 8, 2025

dzarukin Mar 5, 2025

echeresh Mar 8, 2025

dzarukin Mar 5, 2025

echeresh Mar 8, 2025

echeresh commented Mar 8, 2025

common, xe: consolidate serialization API #2802

Are you sure you want to change the base?

common, xe: consolidate serialization API #2802

Conversation

echeresh commented Mar 3, 2025 • edited Loading

rjoursler Mar 5, 2025 • edited Loading

Choose a reason for hiding this comment

echeresh Mar 5, 2025 • edited Loading

Choose a reason for hiding this comment

echeresh commented Mar 5, 2025

dzarukin Mar 5, 2025

Choose a reason for hiding this comment

echeresh Mar 8, 2025

Choose a reason for hiding this comment

dzarukin Mar 5, 2025

Choose a reason for hiding this comment

echeresh Mar 8, 2025

Choose a reason for hiding this comment

dzarukin Mar 5, 2025

Choose a reason for hiding this comment

echeresh Mar 8, 2025

Choose a reason for hiding this comment

echeresh commented Mar 8, 2025

echeresh commented Mar 3, 2025 •

edited

Loading

rjoursler Mar 5, 2025 •

edited

Loading

echeresh Mar 5, 2025 •

edited

Loading