If you're pickling that much data you should definitely consider using a more appropriate data format. Maybe a database or HDF5?
You mean numpy arrays? I think the most efficient way to store them is via np.save. You could try creating a new directory and store all of your arrays there with clever file naming to retrieve the dictionary structure later.
Alternatively if you are up to trying to use pytorch you can convert the arrays to tensors and use torch.save to save the entire dictionary in one file. Installing pytorch just for this might be a bit overkill as it is a >1GB installation tho.
It's a tuple of dicts of tuples of array.array, no numpy or torch :(
So python standard library lists?
this post was submitted on 09 Jun 2023
0 points (50.0% liked)
Python
3244 readers
1 users here now
News and discussions about the programming language Python
founded 5 years ago
MODERATORS