From the tags of your question, I assume that you are using Drupal for the Apache Solr interface. Tika can work from Solr when you send its binary documents, or you can use it before sending documents to Solr. The Drupal Solr Attachments module has a setting for this "Tika (local Java application)." In the second link, you indicated that they fixed the Solr Attachments module to use the PDFBox instead of Tika to parse binary files before submitting to Solr. If you are not using Drupal, you should try a similar approach.
source share