Answer questions using web searches and citations
Generate radar plots for model performance metrics
View and submit LLM evaluations