Omni-DNA 1B
Omni-DNA 1B is a transformer-based genomic language model that encodes unambiguous DNA sequences into fixed-length embeddings and estimates sequence log probabilities via next-token prediction. The API supports batches of up to 2 sequences, each up to 2048 bp, with GPU-accelerated inference on 1B parameters. Omni-DNA 1B is pretrained on 300B nucleotides and achieves strong performance on NT and Genomic Benchmark tasks, making it suitable for regulatory element modeling, variant prioritization pipelines, and downstream classifier or generative model conditioning.
