3 8 15

Praveen Kaushik

PKaushik

AI & ML interests

Generative AI, SLMs, LLMs, Deep Learning, NLP, CV

Recent Activity

upvoted a collection 10 days ago

FaceLLM

liked a Space 8 months ago

davanstrien/ocr-time-capsule

upvoted a paper 8 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

View all activity

Organizations

upvoted a collection 10 days ago

FaceLLM

Collection

A multimodal large language model trained specifically for facial image understanding. Project page: https://www.idiap.ch/paper/facellm • 3 items • Updated Jul 23, 2025 • 4

liked a Space 8 months ago

OCR Time Capsule

📦

Compare original and improved OCR text from historical documents

upvoted a paper 8 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 217

liked a Space 8 months ago

Chat.gradio.app with HFIPs

🌖

gradio chat app MCP and gpt-oss powered

upvoted an article 8 months ago

Article

Consilium: When Multiple LLMs Collaborate

Jul 17, 2025

•

liked a Space 10 months ago

Multimodal OCR

🍍

405

Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR

liked 2 Spaces 11 months ago

Flux Quantized or Original?

🔎

Generate and compare quantized images from prompts

MedGemma - Radiology Explainer Demo

🩺

240

Radiology Image & Report Explainer Demo. Built with MedGemma

upvoted a changelog 11 months ago

Hugging Face Changelog

Filter by MCP compatibility available in HF Spaces

May 21, 2025

• 79

liked 2 Spaces about 1 year ago

OctoTools

🚀

128

An Agentic Framework with Tools for Complex Reasoning

Vision Papers

💻

114

All paper summaries read by Merve

upvoted a collection about 1 year ago

AIMO Progress Prize

Collection

Models and datasets used in the winning solution to the AIMO 1st Progress Prize • 7 items • Updated Jul 19, 2024 • 14

updated 4 Spaces over 1 year ago

LegalLLM

🌖

LegalLLM using SaulLLM

Human Activity Recognition

⚡

Questionnanswer Document

🏆

Upload document and ask question about it – Groq will answer

Imageanalyzer

👀

Multimodal Image Analyzer using Groq

liked a model over 1 year ago

vidore/colpali

Visual Document Retrieval • Updated Nov 24, 2025 • 5.09k • 476

New activity in cvachet/pdf-chatbot over 1 year ago

🚩 Report: Not working

#15 opened over 1 year ago by

be4zad

New activity in HuggingFaceTB/inspect_web_clusters over 1 year ago

Web Cluster visualization error

#2 opened over 1 year ago by

palaashag

updated a collection over 1 year ago

WebGpu

Collection

2 items • Updated Sep 24, 2024

Praveen Kaushik

AI & ML interests

Recent Activity

Organizations

PKaushik's activity

OCR Time Capsule

Chat.gradio.app with HFIPs

Consilium: When Multiple LLMs Collaborate

Multimodal OCR

Flux Quantized or Original?

MedGemma - Radiology Explainer Demo

Filter by MCP compatibility available in HF Spaces

OctoTools

Vision Papers

LegalLLM

Human Activity Recognition

Questionnanswer Document

Imageanalyzer

🚩 Report: Not working

Web Cluster visualization error