Back
OmniGene-4HGNC bioRxivLLM appliedNew tool

OmniGene-4: A Unified Bio-Language MoE Model with Router-Level Interpretability.

Wang LbioRxiv 2026 · June 2026
Relevance score
7/10
Disease / domain
Multimodal bio-language model
Source
bioRxiv
DOI 10.64898/2026.05.12.724542
Share on LinkedIn

Tool / method

MoE (Mixture-of-Experts) model jointly processing natural language, DNA sequences, and protein sequences with router-level interpretability

Summary

OmniGene-4 is a multimodal Mixture-of-Experts (MoE) bio-language model jointly processing natural language, DNA sequences, and protein sequences to answer sequence-grounded biological questions. Router-level interpretability analysis reveals that each expert specializes on distinct question types (structure, function, annotation), providing a window into the model's internal mechanisms. Performance on mixed genomic benchmarks surpasses that of specialized unimodal models. The model is available open-source via Hugging Face.

Synthesis written by Geno'X. For the full original abstract, please refer to the source publication.

Analysis

Multimodal MoE models are a promising path for genomic AI: they allow unifying heterogeneous representations (sequence, annotation, phenotype) in a common space. Router-level interpretability is an original methodological contribution, but performance on real clinical benchmarks (pathogenic variants, diagnosis) remains to be demonstrated.

Why this score?

Clinical impact: 2/3 · Evidence strength: 2/3 · Novelty: 2/2 · Sample size: 1/1 · Publication status: 0/1 → Total: 7/10

Keywords

genomic LLMfoundation modelMixture-of-ExpertsDNAproteininterpretabilityAI
Weekly report in your inbox

Every Wednesday · Annotated selection · Free · Unsubscribe anytime