Could you suggest the steps that should be followed for lucene. especially with big data (about 1 TB of PDF files for indexing)
Please see Lucene Performance Optimization Tips. Since you are working with a lot of data, you also need to monitor the performance of creating the index. Some tips on improving indexing performance and search efficiency are available on the Lucene Wiki.