Friday, August 21, 2009

Robust OCR in video

I used the "Robust OCR dataset" below to make a system for reading runner bibs in video. Standard ML techniques give fairly good results without much tweaking -- AdaBoost with stumps to go through all connected components (in thresholded image) and generate potential candidates, SVM/Gaussian kernel to classify those candidates into digits. Here's a screenshot and a video of this system in action.

Video

17 comments:

hr0nix said...

Are black rectangles false positives from AdaBoost?

Yaroslav Bulatov said...

Kind of, rectangles are initial candidates generated by AdaBoost. I found it to be more robust to consider top 50 candidates regardless of their score, and then use some heuristics to filter out ones not related to bibs

Alex said...

Yaroslav, I was wondering if the OCR was happening in realtime or in post processing of the video? It appears the video is a little slow motion, was that because OCR wasn't as accurate at normal speeds?

Jai Pillai said...

Impressive results. I am wondering whether you are using all the frames in the video, or just one good frame? In the former case, how are you integrating the results from each frame?

Yaroslav Bulatov said...

Alex: it's not real time
Jai: you mean how do I determine which bibs occur in the video? I just return bib numbers which occur in more than k frames (k hand-tuned, 40 seems to be OK)

M P Divecha said...

Hi,
I am working on a similar problem (here: http://stackoverflow.com/questions/6794372/localization-of-numbers-within-a-complex-scene-image). Can you give me some tips on how you did this?
I have never used boosting before, so I would like to know how did you train the classifiers? Is there any dataset availble?
Your help in this regard will be highly appreciated :-)

Thanks.

Unknown said...

Ярослав, здравствуйте,

А выкладывали ли вы код или более подробное описание вашего алгоритма? Очень интересный подход, хотелось бы почитать подробнее. Спасибо!

DomFilk said...

I find another free online ocr, it's using tesseract ocr 3.02.

ben said...

It is important to reach more recipients.
phone girls London

draj said...

thanks for sharing this blog,try this blog too...
Seo Internship in Bangalore
Smo Internship in Bangalore
Digital Marketing Internship Program in Bangalore

Unknown said...

Useful article, thank you for sharing the article!!!

Website bloggiaidap247.com và website blogcothebanchuabiet.com giúp bạn giải đáp mọi thắc mắc.

john said...

Great Article
IEEE final year projects on machine learning


JavaScript Training in Chennai

Final Year Project Centers in Chennai



JavaScript Training in Chennai

pareena said...

Thank for sharing
ufabet
ทางเข้า ufa88
ufa slot

aiviviu said...

Thanks for Sharing This Article.It is very so much valuable content.
combo di da lat
combo di da nang

combo di nha trang

combo di phu quoc

combo di quy nhon

Nursing Thesis Writing Services said...

I want to always read your blogs. I love them Are you also searching for Nursing thesis writing services? we are the best solution for you. We are best known for delivering Nursing thesis writing services to students without having to break the bank

nursing writing help said...

Your blogs are great.Are you also searching for Nursing Writing Help? we are the best solution for you. We are best known for delivering nursing writing services to students without having to break the bank.

Geek Info said...

How to take money from credit card without charges

Kisan card how to apply

How to Activate Kotak Credit card

Fastag Kotak Mahindra Bank

Baroda Rajasthan Kshetriya Gramin Bank Net Banking

Sbi Online balance enquiry

Bank of Baroda credit card

Andhra bank balance enquiry

How to generate OTP for HDFC credit card

CIF number SBI Cheque book