Apache poi vs python xlrd

I was going to use apache poi or python xlrd on linux to read spreadsheets in a linux program. I am prone to python xlrd, but apache poi seems to be a more complete project for me (maybe since it has been supported by the apache base). What would be the best choice (I'm prone to python, but still ...)? I would like to be able to read most versions of excel and read macros.

+4
source share
2 answers

Depending on the complexity of your needs, you can potentially use Apache Tika . Tika calls on the POI (and other libraries) and provides you with XHTML or a text version of the document. If you just need a simple linux command, that might be enough.

Otherwise, it may depend on your Java skills. The POI should do whatever you need, but you need to write some code to capture the bits of interest to you. Check out the quick start guide to get started.

(I don't know about xlrd, sorry)

+2
source

One option for potentially providing the best of both worlds would be to use Apache POI via Jython (python syntax with java interaction).

Here 's a dated jython example ... I believe this will need to be changed with the new version of POI.

0
source

All Articles