DSPRelated.com
Forums

PDF Management and electronic bibliographies

Started by Unknown November 5, 2006
Randy Yates wrote:

[Google Desktop]

> Unfortunately it's only available for Windoze OSs - I run linux (FC4).
htdig is also able to index PDF-files. htdig though is a search engine for web sites, so I don't know if t is possible to configure it so that it runs through your lokal harddisk. URL: http://www.htdig.org bye Andreas -- Andreas H�nnebeck | email: acmh@gmx.de ----- privat ---- | www : http://www.huennebeck-online.de Fax/Anrufbeantworter: 0721/151-284301 GPG-Key: http://www.huennebeck-online.de/public_keys/andreas.asc PGP-Key: http://www.huennebeck-online.de/public_keys/pgp_andreas.asc
porterboy76@yahoo.com wrote:

> Thanks for all your replies... > > I hadn't thought of using Google Desktop or equivalent, so thats a step > forward for me anyway. There was talk of this not working for scanned > documents, unless OCR (optical character recognition?) was used. > Problem is that eg IEEE have a lot of very old, scanned PDFs on > IEEEXplore, not all of which were scanned with OCR and there is not > much I can do about that. I presume Google Desktop would have major > problems with that right? > > I like the way mp3 files are handled in iTunes, there is just a big > folder with all the mp3s, and the artist, album, genre, date, title > etc. are all embedded tags in the mp3 file. Something like that would > be great, but if there are no meta tags in the original PDF file, is it > possible to edit the file to insert them? >
May I suggest posting to comp.lang.postscript I'm a neophyte postscript wannabe user, so I lurk there a lot. I've noticed many posts there that cover a murky area between postscript and PDF. If that is not best forum, I'm sure someone there could refer you to appropriate forum. YMMV
porterboy76@yahoo.com wrote:
> Just wondering how people manage their collections of PDF files and > downloaded electronic documents on their computers? Up to now I have > been maintaining a nice filestructure and naming files well, and I have > folders like iir_equalisation, wireless_channel_modelling and so on. > > I also have a TO_BE_FILED folder in which I have tons of PDFs I just > havent gotten round to filing, and its starting to look like a > mountainous task. I was thinking of just dumping all my PDFs in a > single folder, and trying to organise them by keywords or metatags, but > I dont know if this is easy... > > Does anybody have any recommendations about how I should go about this? > Software I should look at?
After trying out lots of things, the thing that works best for me is: save everything you download per -time-. That is, keep folders per year / month / etc. Same with email. That's also a lot easier with maintaining backups. Most of the time, i don't even need the search software; i usually remember approximately when i downloaded it, and once i get 'close', i recognize some of the files i see, and know if it was before or after that. -- Cheers, Herman Jurjus
Richard Owlett wrote:
> porterboy76@yahoo.com wrote: > > > Thanks for all your replies... > > > > I hadn't thought of using Google Desktop or equivalent, so thats a step > > forward for me anyway. There was talk of this not working for scanned > > documents, unless OCR (optical character recognition?) was used. > > Problem is that eg IEEE have a lot of very old, scanned PDFs on > > IEEEXplore, not all of which were scanned with OCR and there is not > > much I can do about that. I presume Google Desktop would have major > > problems with that right? > > > > I like the way mp3 files are handled in iTunes, there is just a big > > folder with all the mp3s, and the artist, album, genre, date, title > > etc. are all embedded tags in the mp3 file. Something like that would > > be great, but if there are no meta tags in the original PDF file, is it > > possible to edit the file to insert them? > > > > May I suggest posting to comp.lang.postscript > > I'm a neophyte postscript wannabe user, so I lurk there a lot. > I've noticed many posts there that cover a murky area between postscript > and PDF. > > If that is not best forum, I'm sure someone there could refer you to > appropriate forum. > > YMMV
I got another recommendation from a colleague... http://sourceforge.net/projects/nlada-library This is web-based and can run on Linux, however, i am not sure it is well maintained...
Richard Owlett wrote:

(snip on PDFs)

> May I suggest posting to comp.lang.postscript
There is also comp.text.pdf -- glen
> glen herrmannsfeldt wrote: > Richard Owlett wrote: > > (snip on PDFs) > > > May I suggest posting to comp.lang.postscript > > There is also comp.text.pdf > > -- glen
Thank you sir... suggestions so far.... Beagle, Nlada library, Google Desktop, Acrobat MetaTagging, Windows desktop search, yahoo seach, copernic, htdig, Jabref
> Problem is that eg IEEE have a lot of very old, scanned PDFs on > IEEEXplore, not all of which were scanned with OCR and there is not > much I can do about that. I presume Google Desktop would have major > problems with that right?
Actually, I'm making an assumption there... I found that all recent IEEE PDFs have OCR, old IEEE PDFs seem to have it to, and my question is, do all scanned IEEE PDFs have OCR with the resulting text embedded in the PDF? Anybody there from the IEEE who might know, before I give them a call. If they dont have OCR on all scanned PDFs, with meta text embedded, then they SHOULD!
porterboy wrote:

> > Problem is that eg IEEE have a lot of very old, scanned PDFs on > > IEEEXplore, not all of which were scanned with OCR and there is not > > much I can do about that. I presume Google Desktop would have major > > problems with that right? > > Actually, I'm making an assumption there... I found that all recent > IEEE PDFs have OCR, old IEEE PDFs seem to have it to, and my question > is, do all scanned IEEE PDFs have OCR with the resulting text embedded > in the PDF? > > Anybody there from the IEEE who might know, before I give them a call. > If they dont have OCR on all scanned PDFs, with meta text embedded, > then they SHOULD!
Where can I sign that?