Searching trough multiple pdf files
Thread poster: Pablo Bouvier
Pablo Bouvier
Pablo Bouvier  Identity Verified
Local time: 13:16
German to Spanish
+ ...
Sep 15, 2010

I wrote myself :

I have a lot of information in a pdf format (dictionnaries, glossaries, technical books, etc,) I am using as reference for my translations.

However, to search trough all this stuff is very time consuming. Someone can tell me if it exist some tool that can read multiple pdf files and open them highlighting the information I am looking for in one go?

Something like to search with Google trough the Googlebooks bibliotheques...




Addendum: I would like to give all you a big thank you to give me some insight about this issue. I will try now all this tools and take a decision on wich is the one that best suits my needs.



[Edited at 2010-09-16 08:53 GMT]


 
Samuel Murray
Samuel Murray  Identity Verified
Netherlands
Local time: 13:16
Member (2006)
English to Afrikaans
+ ...
Wilbur Sep 15, 2010

Pablo Bouvier wrote:
Someone can tell me if it exist some tool that can read multiple pdf files and open them highlighting the information I am looking for in one go?


The desktop search tool "Wilbur" indexes PDF files after converting it (in the background) to plain text.

http://s3.amazonaws.com/redtree/wilbur/index.html


 
Claudio Porcellana (X)
Claudio Porcellana (X)  Identity Verified
Italy
Searching trough multiple pdf files Sep 15, 2010

Logiterm and you'll have a powerful aligning tool too!
tha uses even PDFs for aligning

as an example, I was able to index ICD-9 that is a mission impossible otherwise, and have it almost bilingual (with number codes as a Rosetta stone)

a lot of huge EU PDFs
DOD Dictionary 2009
all REACH docs
and many, many others

and you can manage 3 kind of docs
bilingual, even Trados TM converted
monolingual
multilingual
... See more
Logiterm and you'll have a powerful aligning tool too!
tha uses even PDFs for aligning

as an example, I was able to index ICD-9 that is a mission impossible otherwise, and have it almost bilingual (with number codes as a Rosetta stone)

a lot of huge EU PDFs
DOD Dictionary 2009
all REACH docs
and many, many others

and you can manage 3 kind of docs
bilingual, even Trados TM converted
monolingual
multilingual

furthermore, Logiterm manages almost every kind of docs, not only PDFs

Claudio

[Modificato alle 2010-09-15 22:06 GMT]
Collapse


 
Hynek Palatin
Hynek Palatin  Identity Verified
Czech Republic
Local time: 13:16
English to Czech
+ ...
Desktop Search Applications Sep 15, 2010

Pablo,

I think you need a "desktop search" application. You can try the following:

Google Desktop

Copernic Desktop Search

Windows Search for Windows XP/Vista
(I am not sure how good is
... See more
Pablo,

I think you need a "desktop search" application. You can try the following:

Google Desktop

Copernic Desktop Search

Windows Search for Windows XP/Vista
(I am not sure how good is the PDF support.)
There is a built-in indexing service in Windows 7.

There is also an older article on lifehacker.

You could also consider converting your glossaries to a text format, which is much easier to search using many tools.

Hynek
Collapse


 
Anna Villegas
Anna Villegas
Mexico
Local time: 05:16
English to Spanish
Acrobat Reader Pro... Sep 16, 2010

...can do the job in a simple way, provided the PDFs are on your PC.



 
Tomás Cano Binder, BA, CT
Tomás Cano Binder, BA, CT  Identity Verified
Spain
Local time: 13:16
Member (2005)
English to Spanish
+ ...
Effective File Search Sep 16, 2010

In Spain we say "Cada maestrillo tiene su librillo", "Each little teacher has his little book". I very much like Effective File Search. It searches really fast in big directory structures and any kind of files.

 
Pablo Bouvier
Pablo Bouvier  Identity Verified
Local time: 13:16
German to Spanish
+ ...
TOPIC STARTER
Searching trough multiple pdf files Sep 16, 2010

Tomás Cano Binder, CT wrote:

In Spain we say "Cada maestrillo tiene su librillo", "Each little teacher has his little book". I very much like Effective File Search. It searches really fast in big directory structures and any kind of files.


Indeed, The name of the application is promising.
Thanks for the info, Tomás!


 
tectranslate ITS GmbH
tectranslate ITS GmbH
Local time: 13:16
German
+ ...
So THAT's why it didn't work for me! Sep 16, 2010

Tadzio Carvallo wrote:

...can do the job in a simple way, provided the PDFs are on your PC.


Just the other day I tried searching a whole network directory full of PDFs that I KNEW had the search term in them, with zero results. This was quite baffling to me, but next time I'll copy them over to my HDD first (which is of course a major nuisance and shouldn't be necessary, but if it works...)

Best regards,
Benjamin


 
FarkasAndras
FarkasAndras  Identity Verified
Local time: 13:16
English to Hungarian
+ ...
EU pdfs? Sep 16, 2010

The question has pretty much been answered, get a good desktop search tool.
A somewhat knowledgeable friend says DTSearch is really great for this. Customizable search profiles, in-context hits in dtsearch's own window, search in many formats including pdf etc. I haven't tried it though.

I'm not sure it makes much sense to collect many large EU pdfs though. In my experience, almost everything is also published in HTML, which is infinitely better for our purposes. You can just
... See more
The question has pretty much been answered, get a good desktop search tool.
A somewhat knowledgeable friend says DTSearch is really great for this. Customizable search profiles, in-context hits in dtsearch's own window, search in many formats including pdf etc. I haven't tried it though.

I'm not sure it makes much sense to collect many large EU pdfs though. In my experience, almost everything is also published in HTML, which is infinitely better for our purposes. You can just grab two HTML files, autoalign them and go on with life. They match so closely you can just use autoaligned TMs without checking them.
For some reason, the treaties aren't easy to find in html, and the Council insists on producing its awful pdfs but everything else is out there in html or at least doc.
Collapse


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Searching trough multiple pdf files







Trados Studio 2022 Freelance
The leading translation software used by over 270,000 translators.

Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop and cloud solution, empowering you to work in the most efficient and cost-effective way.

More info »
Trados Business Manager Lite
Create customer quotes and invoices from within Trados Studio

Trados Business Manager Lite helps to simplify and speed up some of the daily tasks, such as invoicing and reporting, associated with running your freelance translation business.

More info »