You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Traceback (most recent call last):
File "nvt.py", line 60, in <module>
transformed_workflow.to_parquet(os.path.join("/tmp", "train/"))
File "/usr/local/lib/python3.8/dist-packages/merlin/io/dataset.py", line 902, in to_parquet
tf_metadata.to_proto_text_file(output_path)
File "/usr/local/lib/python3.8/dist-packages/merlin/schema/io/tensorflow_metadata.py", line 153, in to_proto_text_file
_write_file(self.to_proto_text(), path, file_name)
File "/usr/local/lib/python3.8/dist-packages/merlin/schema/io/tensorflow_metadata.py", line 139, in to_proto_text
return proto_utils.better_proto_to_proto_text(self.proto_schema, schema_pb2.Schema())
File "/usr/local/lib/python3.8/dist-packages/merlin/schema/io/proto_utils.py", line 84, in better_proto_to_proto_text
message.ParseFromString(bytes(better_proto_message))
File "/usr/local/lib/python3.8/dist-packages/google/protobuf/message.py", line 199, in ParseFromString
return self.MergeFromString(serialized)
File "/usr/local/lib/python3.8/dist-packages/google/protobuf/internal/python_message.py", line 1128, in MergeFromString
if self._InternalParse(serialized, 0, length) != length:
File "/usr/local/lib/python3.8/dist-packages/google/protobuf/internal/python_message.py", line 1195, in InternalParse
pos = field_decoder(buffer, new_pos, end, self, field_dict)
File "/usr/local/lib/python3.8/dist-packages/google/protobuf/internal/decoder.py", line 705, in DecodeRepeatedField
if value.add()._InternalParse(buffer, pos, new_pos) != new_pos:
File "/usr/local/lib/python3.8/dist-packages/google/protobuf/internal/python_message.py", line 1195, in InternalParse
pos = field_decoder(buffer, new_pos, end, self, field_dict)
File "/usr/local/lib/python3.8/dist-packages/google/protobuf/internal/decoder.py", line 726, in DecodeField
if value._InternalParse(buffer, pos, new_pos) != new_pos:
File "/usr/local/lib/python3.8/dist-packages/google/protobuf/internal/python_message.py", line 1195, in InternalParse
pos = field_decoder(buffer, new_pos, end, self, field_dict)
File "/usr/local/lib/python3.8/dist-packages/google/protobuf/internal/decoder.py", line 705, in DecodeRepeatedField
if value.add()._InternalParse(buffer, pos, new_pos) != new_pos:
File "/usr/local/lib/python3.8/dist-packages/google/protobuf/internal/python_message.py", line 1178, in InternalParse
raise message_mod.DecodeError('Field number 0 is illegal.')
google.protobuf.message.DecodeError: Field number 0 is illegal.
Expected behavior
Dataset is written to parquet files with padded lists and value counts.
Environment details (please complete the following information):
Describe the bug
to_parquet
throws an error whenListSlice(..., pad=True)
andValueCount()
are combined.Steps/Code to reproduce bug
Container version: nvcr.io/nvidia/merlin/merlin-tensorflow:22.10
Add
ValueCount
to Cell 12:Error:
Expected behavior
Dataset is written to parquet files with padded lists and value counts.
Environment details (please complete the following information):
nvcr.io/nvidia/merlin/merlin-tensorflow:22.10
docker run --rm -it --net host --gpus 0 -v /home/edwardk/data:/root/data nvcr.io/nvidia/merlin/merlin-tensorflow:22.10 bash
The text was updated successfully, but these errors were encountered: