I have a text file with approximately 8.5 million data points in the form:
Company 87178481 Company 893489 Company 2345788 [...]
I want to use Python to create a connection diagram to see how the network looks between companies. From the above example, two companies would share an edge if the value in the second column were the same (clarification from / for Hooked ).
I used the NetworkX package and was able to create a network with several thousand points, but this did not lead to a full text file with 8.5 million node. I started it and left about 15 hours, and when I returned, the cursor in the shell was still blinking, but there was no graph of output.
Can it be assumed that it is still working? Is there a better / faster / easier approach for a graph of millions of points?
source share