MinerU 2.5 - Local OCR VLM | Text and Table Extraction Test

Dec 16, 2025Channel
AI Analysis
Data from YouTube Data API v3Updated Just now

Video Overview

Video Details

Published6 months ago
Duration12:26
Video IDNlptCXNsurM
Languageen
CategoryEducation
PrivacyPublic
Made for KidsNo
Video TypeRegular Video

Performance Metrics

Views149
Likes17
Comments2
Engagement Rate12.75%
Likes per 100 views11.41
Comments per 1K views13.42

Description

MinerU 2.5 is 1.2B vision-language model that does two-stage OCR. Support text, tables and formula recognition. Is it better than classical OCR approaches? Let's test it! Technical report: https://arxiv.org/pdf/2509.22186 mineru vl utils: https://github.com/opendatalab/mineru-vl-utils Weights: https://huggingface.co/opendatalab/MinerU2.5-2509-1.2B AI Academy: https://www.mlexpert.io/ LinkedIn: https://www.linkedin.com/in/venelin-valkov/ Follow me on X: https://twitter.com/venelin_valkov Discord: https://discord.gg/UaNPxVD6tv Subscribe: http://bit.ly/venelin-subscribe GitHub repository: https://github.com/curiousily/AI-Bootcamp 👍 Don't Forget to Like, Comment, and Subscribe for More Tutorials! Join this channel to get access to the perks and support my work: https://www.youtube.com/channel/UCoW_WzQNJVAjxo4osNAxd_g/join

Related Videos

More videos from Venelin Valkov