![]() |
Search Appliance - 4D Style!Looking for fast and efficient search/indexing of documents without the Google appliance price? We have the answer! |
PDF Meta Processor
PDF Meta Processor
A renown medical facility turned to NowData for help with improving the consistency, accuracy, and search engine readiness of thousands of PDF document.
How can we improve Google's search results?
Relying on individual document authors to correctly and consistently add meta information each time they created or modified a document proved to be a difficult challenge. The desired consistency and accuracy was elusive. Additionally, it was discovered that unrelated meta information was being applied to documents by the original document authoring tool, which was causing unwanted results using seach tools such as Google.
NowData quickly determined that the most reliable solution to the problem would be the addition of a centralized database that would be used to catalog critical information about each individual PDF document.
System Features:
- Extract keyword and meta information from PDF documents dropped onto a shared 'watched' folder.
- Create a database record for each document storing relevant information including:
- Total word count.
- Repeated word count.
- All document meta information including title, author, description, keywords, copyright information, etc.
- Modify PDF documents with latest meta information.
- Copy completed PDF documents to designated web directories.
- Maintain a revision history of all previous documents.
The power and flexibility of a 4D database:
Each PDF document's relevant meta information is stored as a database record. Authorized users accessing the database not only see a listing of words contained in the PDF document, but they are able to edit and save new meta information. Best of all, once a database record is linked to a PDF document, the database will automatically apply the saved meta information each time a PDF document is changed.
Now, when a PDF document is modified and placed into one of the 'watched' folders maintained by the database, it will be automatically updated to always contain the correct meta information. As a finishing touch, the database will copy the revised PDF file to the designated web or shared file directory.
The result? Dramatically-improved file property consistency and improved search engine ranking results.






