Skip to content
#

vision-ai

Here are 76 public repositories matching this topic...

AI-powered video understanding — extract key frames from YouTube, Bilibili & any video page, get structured summaries via vision AI. Supports yt-dlp, Playwright, cloud browsers. AI驱动的视频理解-从YouTube, Bilibili和任何视频页面提取关键帧,通过VLM获得结构化摘要。支持yt-dlp、Playwright和一些常见云浏览器。

  • Updated Mar 9, 2026
  • JavaScript

This repository demonstrates YOLOv8-based license plate recognition with GCP Vision AI integration, enabling versatile real-world applications like vehicle identification, traffic monitoring, and geospatial analysis while capturing vital media metadata for enhanced insights.

  • Updated Feb 1, 2024
  • Jupyter Notebook

MCQ_Grading_Bot is an AI-powered tool that grades solved MCQ exam sheets from images using Gemini Vision. It extracts student info, checks answers, calculates score, and displays detailed results—all through a simple Gradio interface in Colab.

  • Updated Jun 19, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the vision-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-ai topic, visit your repo's landing page and select "manage topics."

Learn more