With due respect to Colt and Parallel Colt, they are not very fast. If you insist on using Java and expect fast numerical computations, use JBLAS. JBLAS uses ATLAS. I compiled JBLAS to use multi-threaded ATLAS - it does not do this by default. You will need to change several configuration parameters. However, even single-threaded JBLAS are faster than multi-threaded Colt and Parallel Colt. I tested Colt, Parallel Colt, JAMA and JBLAS. JBLAS is the best mile in the country.
Colt and Parallel Colt are very slow. So is JAMA. The best Java library for such things is JBLAS.
Hamaad shah
source share