comp.dsp | Text recognition in motion video| page 2

Reply by Jerry Avins ●August 4, 20042004-08-04

Jerry Avins wrote:

> Rob Vermeulen wrote:
> 
>   ...
> 
>> The problem with OCR is that the algoritm only works accurately when 
>> text is
>> placed on a solid background, which isn't the case in video material 
>> most of
>> the time. I want to detect subtitles and other "overlayed text" but also
>> "scene text" such as license places on cars and company logos on 
>> buildings.
>> The text in the last category can also appear in every angle, rotated in
>> every direction and even in perspective.
>> What I need is more than just OCR. It must first do text-detection,
>> classification, de-blurr filters and other preprocessing things before it
>> recognizes characters.
> 
> 
> There may be image enhancement processes that you could use to isolate
> the letter outlines. Once you have that, you can put it on any
> background you like. Some OCR programs may be able to use that
> information directly. I can't begin to e   stimatetheresourcesneeded.
> They could well be excessive.
> 
> Jerry
???

I can't begin to estimate the resources needed.

What happened?

Jerry
-- 
Engineering is the art of making what you want from things you can get.
&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;

Reply by Jerry Avins ●August 4, 20042004-08-04

Jerry Avins wrote:

> Jerry Avins wrote:
> 
>> Rob Vermeulen wrote:
>>
>>   ...
>>
>>> The problem with OCR is that the algoritm only works accurately when 
>>> text is
>>> placed on a solid background, which isn't the case in video material 
>>> most of
>>> the time. I want to detect subtitles and other "overlayed text" but also
>>> "scene text" such as license places on cars and company logos on 
>>> buildings.
>>> The text in the last category can also appear in every angle, rotated in
>>> every direction and even in perspective.
>>> What I need is more than just OCR. It must first do text-detection,
>>> classification, de-blurr filters and other preprocessing things 
>>> before it
>>> recognizes characters.
>>
>>
>>
>> There may be image enhancement processes that you could use to isolate
>> the letter outlines. Once you have that, you can put it on any
>> background you like. Some OCR programs may be able to use that
>> information directly. I can't begin to e   stimatetheresourcesneeded.
>> They could well be excessive.
>>
>> Jerry
> 
> ???
> 
> I can't begin to e   stimatetheresourcesneeded.
> 
> What happened?

Again!

I can't begin to estimate the resources needed.

I can't begin to
estimate
the
resources
needed.

Jerry
-- 
Engineering is the art of making what you want from things you can get.
&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;&#4294967295;

Reply by Stephan M. Bernsee ●August 4, 20042004-08-04

On 2004-08-04 20:30:05 +0200, Jerry Avins <jya@ieee.org> said:

> Jerry Avins wrote:
> 
>> Jerry Avins wrote:
>> 
>>> There may be image enhancement processes that you could use to isolate
>>> the letter outlines. Once you have that, you can put it on any
>>> background you like. Some OCR programs may be able to use that
>>> information directly. I can't begin to e   stimatetheresourcesneeded.
>>> They could well be excessive.
>>> 
>>> Jerry
>> 
>> ???
>> 
>> I can't begin to e   stimatetheresourcesneeded.
>> 
>> What happened?
> 
> Again!
> 
> I can't begin to estimate the resources needed.
> 
> I can't begin to
> estimate
> the
> resources
> needed.
> 
> Jerry


Maybe you have some weird text enhancement processes messing up your 
posting... :-)

Btw: only the quoted text appears messed up, the original looks good on 
my end (individual.net).

Cheers,
Stephan

-- 
Stephan M. Bernsee
http://www.dspdimension.com

Reply by Rob Vermeulen ●August 6, 20042004-08-06

"Jerry Avins" <jya@ieee.org> wrote in message
news:4110fdd0$0$2825$61fed72c@news.rcn.com...

> There may be image enhancement processes that you could use to isolate
> the letter outlines. Once you have that, you can put it on any
> background you like. Some OCR programs may be able to use that
> information directly. I can't begin to estimate the resources needed.
> They could well be excessive.

I'm familiar with image processing / filtering methods and I think I could
figure out a way to separate text from the rest of the image.
(Probably a combination of edge detection & classification).
Perhaps this will make gocr work better, because I only got garbage out when
feeding it with frames from my video database.

But I think there's a different approach necessary. Some approach that I'm
not familiar with.

And there's also the 'speed' component. I'd prefer if a routine worked
realtime and even faster. I've already got an archive of 10000 hours of
video which need to be searched. I'd hate to see that it takes 20000 hours
to process 10000 hours of material.

Anyway, its worth sorting this out and I like the matter, so I'll be doing
research on this for quite some time, I think :-)

If someone else can come up with good suggestions, I'd appreciate it.

Best regards,

Rob

Previous 12Next

Text recognition in motion video

Sign in

You might also like...

Search forums

Free PDF Downloads

Blogs - Hall of Fame

Discussion Groups

Quick Links

About DSPRelated.com

Social Networks

The Related Media Group