I used 3 languages ​​for web clips - Ruby, PHP and Python, and, frankly, none of them are suitable for this task.
Ruby has an excellent Mechanicalize and XML syntax library, but table support is very poor.
PHP has an excellent spreadsheet and HTML parsing library, but it does not have the equivalent of WWW: Mechanize.
Python has a very poor Mechanize library. I had a lot of problems with this and still could not solve them. His spreadsheet library is also more or less decent, as it cannot create XLSX files.
Is there something that is just perfect for webscraping.
PS: I work on the Windows platform.
python ruby php web-scraping
Shubham
source share