After successfully writing this small piece of code... I tested with a small test set of 3 documents. Ran perfectly fine : that was about 30 seconds of run.
Then...I thought to jump test into 163657 documents. Kinda suicidal decision. I was beginning to think the program will run the whole day o.o; It doesn't seem to stop anytime sooner.
Oh well, after 29 minutes it printed out my results. The sad part is...
I forgot to save it into a file, and I forgot to keep count of non-zero entries and the number of columns :(!
Damn it.
Another rerun?
Oh well, then I recall something during the endless run [29mins only pshhhhh kicks myself for exaggerating], KMP - lets try something?
No comments:
Post a Comment