Loading then saving a VCF imported dataset fails with `TypeError: expected unicode string, found 130` #991

benjeffery · 2023-01-09T15:46:12Z

vcf_to_zarr("example.vcf.gz", "vcf.zarr")
ds = sg.load_dataset("vcf.zarr")
sg.save_dataset(ds, "vcf2.zarr")

Currently fails with
TypeError: expected unicode string, found 130
I believe this is due to pydata/xarray#3476 as the workaround suggested there:

    for v in list(ds.coords.keys()):
        if ds.coords[v].dtype == object:
            ds[v].encoding.clear()

    for v in list(ds.variables.keys()):
        if ds[v].dtype == object:
            ds[v].encoding.clear()

prevents this error. I'm not sure there is anything sgkit can do about this - looking into it.

The text was updated successfully, but these errors were encountered:

tomwhite · 2023-01-09T15:48:43Z

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading then saving a VCF imported dataset fails with `TypeError: expected unicode string, found 130` #991

Loading then saving a VCF imported dataset fails with `TypeError: expected unicode string, found 130` #991

benjeffery commented Jan 9, 2023

tomwhite commented Jan 9, 2023

benjeffery commented Jan 9, 2023

benjeffery commented Jan 9, 2023

Loading then saving a VCF imported dataset fails with TypeError: expected unicode string, found 130 #991

Loading then saving a VCF imported dataset fails with TypeError: expected unicode string, found 130 #991

Comments

benjeffery commented Jan 9, 2023

tomwhite commented Jan 9, 2023

benjeffery commented Jan 9, 2023

benjeffery commented Jan 9, 2023

Loading then saving a VCF imported dataset fails with `TypeError: expected unicode string, found 130` #991

Loading then saving a VCF imported dataset fails with `TypeError: expected unicode string, found 130` #991