10.1101/2024.08.14.607850
The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling
2024-08-17