I have a requirement to read a .pdf file which we cant convert into .txt or any other format due to insufficient privileges. Can Pentaho read .pdf files ? if yes then how can you please suggest ?
There is a plugin "Load text from file" which according to the documentation is aible to read pdf files. I never used it so I don't know how it works. You can download the plugin in the marketplace
Yes you can, but I do not know of any pdf to txt converter out-of-the box.
In the last case, please publish your work.
Indeed it can! check out my blog:
Unstructured data, Apache Tika and Beer | Codeks Blog
Retrieving data ...