Overview#

BandHiC is a Python package for efficient storage, manipulation, and analysis of Hi-C matrices using a banded matrix representation.

BandHiC adopts a banded storage scheme that stores only a configurable diagonal bandwidth of the dense Hi-C contact matrices. This design can reduce memory usage by up to 99% compared to dense matrices, while still supporting fast random access and user-friendly indexing operations.

Main features include:

  • Efficient storage of Hi-C matrices using a banded representation.

  • Fast random access and indexing operations.

  • Support for flexible masking of missing values, outliers, and unmappable regions.

  • Vectorized operations optimized with NumPy for high performance.

  • Scalable for ultra-high-resolution Hi-C data analysis.