So, I have a program for analyzing words in Excel, with which I hope that I can import more than 30 million words.
First I created a separate object for each of these words so that each word has ...
.value '(string), the actual word itself .bool1 '(boolean) .bool2 '(boolean) .bool3 '(boolean) .isUsed '(boolean) .cancel '(boolean)
When I found out that I can have 30 million of these objects (all of them are stored in one collection), I thought it could be a monster for compilation. And so I decided that all my words would be strings, and that I would bind them to an array.
So my idea of ββthe array is to add each of the 30 million lines by adding 5 spaces (for my 5 bools) at the beginning of each line, with each empty space representing a false bool val value. eg,
If instr(3, arr(n), " ") = 1 then 'my 3rd bool val is false. Elseif instr(3, arr(n), "*") = 1 then '(I'll insert a '*' to denote true) 'my third bool val is true. End If
Anyway, what do you guys think? How (collection or array) should I talk about this (especially for optimization)?
source share