I am trying to find a method in python that allows you to read data in the currently used web browser. In fact, I am trying to load a massive dataframe of data on a locally controlled company web page and implement it as data. The problem is that the website has a rather complicated authentication authentication process that I could not get around using Selenium, using a lot of web drivers, queries, urllib and cookielib, using various user parameters. I completely backed down on this front, as I am pretty sure that the authentication process is more about what can be easily achieved using these libraries.
However, I managed to get around the required tokenization process when I quickly tested opening a new tab in the current browser that was already registered with WebBrowser . Classically, WebBrowser does not offer a reading function, which means that even though the page can be opened, the data on the page cannot be read in the pandas dataframe. This made me think that I can use Win32com, open a browser, log in, and then run the rest of the script, but again, there is no general ability to read the message for the Internet explorer, which means that I can not send information that I want to pandas. I'm at a dead end. Any ideas?
I could get the necessary authentication token scripts, but I'm sure it will take a week or two before anything happens on this front. Obviously, I would prefer to get something in average time while I wait for the actual authorization scenarios from the company.
Update: I received authentication tokens from the company, however, this requires using the python package on another server. I also do not have access, mainly due to its weirdness that I use Python in my department. Thus, the above is still applied - you need a method for reading and managing an open browser.
python authentication web-scraping
Wolves
source share