Textractor plugin extracts plain text from non-plain files, like PDFs and Microsoft Word documents, as well as exif data from .jpg’s. It is an extremely simple plugin to use: simply pass the local path or URL of a file, and it returns the text in that file.
The currently supported file formats (and we’ll be adding more) are:
- PDF (.pdf)
- Microsoft Word Documents (.doc)
- Microsoft Excel (.xls)
- Rich Text Format (.rtf)
- Plain-text (.txt)
- HTML (.html, .htm)
- Jpeg EXIF data(.jpg)
If you have a database with word documents, and you want to be able to search on the contents of those documents, you need Textractor! Just store the extracted text in an indexed field in FileMaker, and do searches on that. Since Textractor can read from URLs, it also works great with SuperContainer.