I'm referring to your interesting blog post here.
Did you pass a fixed random seed, e.g. 42, to allow reproducible results?
Otherwise, the difference might be explained by chance.
If passing 0 however uses the system clock.
|
command = (MalletPath +" train-topics" + |
|
" --input "+ CorpusFile + |
|
" --num-topics "+ str(Topics) + |
|
" --optimize-interval "+ str(Interval) + |
|
" --num-iterations " + str(Iterations) + |
|
" --num-top-words " + str(NumTopWords) + |
|
" --word-topic-counts-file "+ word_topics_counts_file + |
|
" --topic-word-weights-file "+ topic_word_weights_file + |
|
" --output-topic-keys "+ output_topic_keys + |
|
" --output-doc-topics "+ output_doc_topics + |
|
" --doc-topics-max "+ str(DocTopicsMax) + |
|
" --output-state " + output_topic_state + |
|
" --diagnostics-file "+DiagnosticsFile + |
|
" --num-threads " + str(NumThreads)) |
|
#print(command) |
|
subprocess.call(command, shell=True) |
I'm referring to your interesting blog post here.
Did you pass a fixed random seed, e.g. 42, to allow reproducible results?
Otherwise, the difference might be explained by chance.
If passing 0 however uses the system clock.
projects/2015/gddh/code/model.py
Lines 79 to 94 in 44752a7