10.1101/2024.08.14.607850

The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling

2024-08-17