MinerU 2.5 - Local OCR VLM | Text and Table Extraction Test
Dec 16, 2025•Channel
AI Analysis
Data from YouTube Data API v3•Updated Just now
Video Overview
Video Details
Published6 months ago
Duration12:26
Video IDNlptCXNsurM
Languageen
CategoryEducation
PrivacyPublic
Made for KidsNo
Video TypeRegular Video
Performance Metrics
Views149
Likes17
Comments2
Engagement Rate12.75%
Likes per 100 views11.41
Comments per 1K views13.42
Description
MinerU 2.5 is 1.2B vision-language model that does two-stage OCR. Support text, tables and formula recognition. Is it better than classical OCR approaches? Let's test it!
Technical report: https://arxiv.org/pdf/2509.22186
mineru vl utils: https://github.com/opendatalab/mineru-vl-utils
Weights: https://huggingface.co/opendatalab/MinerU2.5-2509-1.2B
AI Academy: https://www.mlexpert.io/
LinkedIn: https://www.linkedin.com/in/venelin-valkov/
Follow me on X: https://twitter.com/venelin_valkov
Discord: https://discord.gg/UaNPxVD6tv
Subscribe: http://bit.ly/venelin-subscribe
GitHub repository: https://github.com/curiousily/AI-Bootcamp
👍 Don't Forget to Like, Comment, and Subscribe for More Tutorials!
Join this channel to get access to the perks and support my work:
https://www.youtube.com/channel/UCoW_WzQNJVAjxo4osNAxd_g/join