LAYRA v2 (Large Academic Visual RAG Agent)
LAYRA v2 is a specialized Visual RAG system designed for the ethnopharmacology of Sceletium tortuosum. It processes full PDF pages as images, preserving layout and visual information, and uses a hybrid retrieval stack.
Architecture
- Visual Encoder: ColQwen 2.5 (ColBERT + Qwen-VL)
- Retrieval: Hybrid (Sparse BM25 + Dense ColBERT)
- Reranking: LLM-based Reranking (Generative Listwise)
- Vector DB: Milvus 2.5.5
- Infrastructure: Docker Compose (Lean Stack)
Performance
Evaluated on SAINTHALF/kanna-rag-gold-standard:
| Metric | Score |
|---|---|
| MRR@20 | 0.7403 |
| Recall@20 | 1.0000 |
| Latency (P50) | 1.2s |
Usage
This model represents the deployed system configuration described in the thesis.
Supersedes SAINTHALF/layra-v1-hybrid.
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Evaluation results
- MRR@20 on Kanna RAG Gold Standardself-reported0.740
- Recall@20 on Kanna RAG Gold Standardself-reported1.000