🏠
Working from home
M.S. student in Multimodal LLMs | Vision-Language Modeling, Multimodal RAG, Efficient VLM Evaluation
-
Carnegie Mellon University
- Pittsburgh, PA, USA
-
10:38
(UTC -10:00)
Highlights
Popular repositories Loading
-
multimodal-rag-lab
multimodal-rag-lab PublicCompact multimodal RAG baseline with chunking, BM25 retrieval and prompt assembly.
Python
-
vlm-eval-mini
vlm-eval-mini PublicFast sanity-check evaluator for vision-language model outputs.
Python
-
vision-token-bridge
vision-token-bridge PublicTiny PyTorch adapter bridging visual features into language-token embeddings.
Python
-
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.