Skip to content

Commit

Permalink
Merge pull request #68 from vkt1414/fix-parquet-nulls
Browse files Browse the repository at this point in the history
Handling null values
  • Loading branch information
fedorov authored May 1, 2024
2 parents 1f8e2c6 + 7059786 commit b5129e9
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion idc_index/index.py
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,7 @@ def __init__(self):
# Read index file
logger.debug(f"Reading index file v{idc_index_data.__version__}")
self.index = pd.read_parquet(file_path)
self.index = self.index.astype(str).replace("nan", "")
# self.index = self.index.astype(str).replace("nan", "")
self.index["series_size_MB"] = self.index["series_size_MB"].astype(float)
self.collection_summary = self.index.groupby("collection_id").agg(
{"Modality": pd.Series.unique, "series_size_MB": "sum"}
Expand Down

0 comments on commit b5129e9

Please sign in to comment.