Google drive, what is the limit for indexing large files?

I am using the go go api to store and extract PDF files. I would like to query these files using the search options.

But before I begin to implement this. I would like to know how Google handles indexing large PDF files. (600 + pages 25Mb +) I would like to know for text pdf files (they do not need ocr)

I tried some searches on the disk website and it does not always work.

I would like to know if there are any restrictions and what they are.

+8
java google-drive-sdk
source share
1 answer

According to this page for PDF files with OCR:

The maximum size of images (.jpg, .gif, .png) and PDF files (.pdf) is 2 MB. For PDF files, we only look at the first 10 pages when searching for text to extract.

And this page is for PDF files with text:

You can search for text in PDFs and images:

  • Type your query in the search box on Google Drive online.
  • Open the Google Drive viewer and use the search box in the upper right corner.

In theory, you should be able to search the first 100 pages of any text documents or text PDF files that you have uploaded. You can also search for text found on the first ten pages of any PDF image files on your Drive.

+3
source share

All Articles