Lab: Tomotopy

Lab: Tomotopy#

Package tomotopy [Lee, 2022]

What is tomotopy?#

tomotopy is a Python extension of tomoto (Topic Modeling Tool) which is a Gibbs-sampling based topic model library written in C++. The current version of tomoto supports several major topic models including

Latent Dirichlet Allocation (tomotopy.LDAModel)
Labeled LDA (tomotopy.LLDAModel)
Partially Labeled LDA (tomotopy.PLDAModel)
Supervised LDA (tomotopy.SLDAModel)
Dirichlet Multinomial Regression (tomotopy.DMRModel)
Generalized Dirichlet Multinomial Regression (tomotopy.GDMRModel)
Hierarchical Dirichlet Process (tomotopy.HDPModel)
Hierarchical LDA (tomotopy.HLDAModel)
Multi Grain LDA (tomotopy.MGLDAModel)
Pachinko Allocation (tomotopy.PAModel)
Hierarchical PA (tomotopy.HPAModel)
Correlated Topic Model (tomotopy.CTModel)
Dynamic Topic Model (tomotopy.DTModel)
Pseudo-document based Topic Model (tomotopy.PTModel).

Getting Started#

You can install tomotopy easily using pip. (https://pypi.org/project/tomotopy/)

pip install --upgrade pip
pip install tomotopy

After installing, you can start tomotopy by just importing.

import tomotopy as tp
print(tp.isa) # prints 'avx2', 'avx', 'sse2' or 'none'

Here is a sample code for simple LDA training of texts from ‘sample.txt’ file.

import tomotopy as tp
mdl = tp.LDAModel(k=20)
for line in open('sample.txt'):
    mdl.add_doc(line.strip().split())

for i in range(0, 100, 10):
    mdl.train(10)
    print('Iteration: {}\tLog-likelihood: {}'.format(i, mdl.ll_per_word))

for k in range(mdl.k):
    print('Top 10 words of topic #{}'.format(k))
    print(mdl.get_topic_words(k, top_n=10))

mdl.summary()

Model Save and Load#

tomotopy provides save and load method for each topic model class, so you can save the model into the file whenever you want, and re-load it from the file.

import tomotopy as tp

mdl = tp.HDPModel()
for line in open('sample.txt'):
    mdl.add_doc(line.strip().split())

for i in range(0, 100, 10):
    mdl.train(10)
    print('Iteration: {}\tLog-likelihood: {}'.format(i, mdl.ll_per_word))

# save into file
mdl.save('sample_hdp_model.bin')

# load from file
mdl = tp.HDPModel.load('sample_hdp_model.bin')
for k in range(mdl.k):
    if not mdl.is_live_topic(k): continue
    print('Top 10 words of topic #{}'.format(k))
    print(mdl.get_topic_words(k, top_n=10))

# the saved model is HDP model,
# so when you load it by LDA model, it will raise an exception
mdl = tp.LDAModel.load('sample_hdp_model.bin')

When you load the model from a file, a model type in the file should match the class of methods.

Documents in the Model and out of the Model#

We can use Topic Model for two major purposes. The basic one is to discover topics from a set of documents as a result of trained model, and the more advanced one is to infer topic distributions for unseen documents by using trained model.

We named the document in the former purpose (used for model training) as document in the model, and the document in the later purpose (unseen document during training) as document out of the model.

In tomotopy, these two different kinds of document are generated differently. A document in the model can be created by tomotopy.LDAModel.add_doc method. add_doc can be called before tomotopy.LDAModel.train starts. In other words, after train called, add_doc cannot add a document into the model because the set of document used for training has become fixed.

To acquire the instance of the created document, you should use tomotopy.LDAModel.docs like:

mdl = tp.LDAModel(k=20)
idx = mdl.add_doc(words)
if idx < 0: raise RuntimeError("Failed to add doc")
doc_inst = mdl.docs[idx]
# doc_inst is an instance of the added document

A document out of the model is generated by tomotopy.LDAModel.make_doc method. make_doc can be called only after train starts. If you use make_doc before the set of document used for training has become fixed, you may get wrong results. Since make_doc returns the instance directly, you can use its return value for other manipulations.

mdl = tp.LDAModel(k=20)
# add_doc ...
mdl.train(100)
doc_inst = mdl.make_doc(unseen_doc) # doc_inst is an instance of the unseen document

Inference for Unseen Documents#

If a new document is created by tomotopy.LDAModel.make_doc, its topic distribution can be inferred by the model. Inference for unseen document should be performed using tomotopy.LDAModel.infer method.

mdl = tp.LDAModel(k=20)
# add_doc ...
mdl.train(100)
doc_inst = mdl.make_doc(unseen_doc)
topic_dist, ll = mdl.infer(doc_inst)
print("Topic Distribution for Unseen Docs: ", topic_dist)
print("Log-likelihood of inference: ", ll)

The infer method can infer only one instance of tomotopy.Document or a list of instances of tomotopy.Document. See more at tomotopy.LDAModel.infer.

Corpus and transform#

Every topic model in tomotopy has its own internal document type. A document can be created and added into suitable for each model through each model’s add_doc method. However, trying to add the same list of documents to different models becomes quite inconvenient, because add_doc should be called for the same list of documents to each different model. Thus, tomotopy provides tomotopy.utils.Corpus class that holds a list of documents. tomotopy.utils.Corpus can be inserted into any model by passing as argument corpus to __init__ or add_corpus method of each model. So, inserting tomotopy.utils.Corpus just has the same effect to inserting documents the corpus holds.

Some topic models requires different data for its documents. For example, tomotopy.DMRModel requires argument metadata in str type, but tomotopy.PLDAModel requires argument labels in List[str] type. Since tomotopy.utils.Corpus holds an independent set of documents rather than being tied to a specific topic model, data types required by a topic model may be inconsistent when a corpus is added into that topic model. In this case, miscellaneous data can be transformed to be fitted target topic model using argument transform.

See more details in the following code:

from tomotopy import DMRModel
from tomotopy.utils import Corpus

corpus = Corpus()
corpus.add_doc("a b c d e".split(), a_data=1)
corpus.add_doc("e f g h i".split(), a_data=2)
corpus.add_doc("i j k l m".split(), a_data=3)

model = DMRModel(k=10)
model.add_corpus(corpus)
# You lose `a_data` field in `corpus`,
# and `metadata` that `DMRModel` requires is filled with the default value, empty str.

assert model.docs[0].metadata == ''
assert model.docs[1].metadata == ''
assert model.docs[2].metadata == ''

def transform_a_data_to_metadata(misc: dict):
    return {'metadata': str(misc['a_data'])}
# this function transforms `a_data` to `metadata`

model = DMRModel(k=10)
model.add_corpus(corpus, transform=transform_a_data_to_metadata)
# Now docs in `model` has non-default `metadata`, that generated from `a_data` field.

assert model.docs[0].metadata == '1'
assert model.docs[1].metadata == '2'
assert model.docs[2].metadata == '3'

Pining Topics using Word Priors#

Since version 0.6.0, a new method tomotopy.LDAModel.set_word_prior has been added. It allows you to control word prior for each topic. For example, we can set the weight of the word ‘church’ to 1.0 in topic 0, and the weight to 0.1 in the rest of the topics by following codes. This means that the probability that the word ‘church’ is assigned to topic 0 is 10 times higher than the probability of being assigned to another topic. Therefore, most of ‘church’ is assigned to topic 0, so topic 0 contains many words related to ‘church’. This allows to manipulate some topics to be placed at a specific topic number.

import tomotopy as tp
mdl = tp.LDAModel(k=20)

# add documents into `mdl`

# setting word prior
mdl.set_word_prior('church', [1.0 if k == 0 else 0.1 for k in range(20)])

Examples#

Install or upgrade of ekorpkit#

Note

Install ekorpkit package first.

Set logging level to Warning, if you don’t want to see verbose logging.

If you run this notebook in Colab, set Hardware accelerator to GPU.

!pip install -U –pre ekorpkit[topic]

exit()

from ekorpkit import eKonf

eKonf.setLogger("WARNING")
print("version:", eKonf.__version__)
print("is notebook?", eKonf.is_notebook())
print("is colab?", eKonf.is_colab())
print("environment variables:")
eKonf.print(eKonf.env().dict())

data_dir = "../data/topic_models"

version: 0.1.39.post0.dev7
is notebook? True
is colab? False
environment variables:
{'CUDA_DEVICE_ORDER': None,
 'CUDA_VISIBLE_DEVICES': None,
 'EKORPKIT_CONFIG_DIR': '/workspace/projects/ekorpkit-book/config',
 'EKORPKIT_DATA_DIR': None,
 'EKORPKIT_LOG_LEVEL': 'WARNING',
 'EKORPKIT_PROJECT': 'ekorpkit-book',
 'EKORPKIT_WORKSPACE_ROOT': '/workspace',
 'KMP_DUPLICATE_LIB_OK': 'TRUE',
 'NUM_WORKERS': 230}

Load a dataset#

cfg = eKonf.compose("path")
cfg.cache.uri = "https://github.com/entelecheia/ekorpkit-book/raw/main/assets/data/us_equities_news_sampled.zip"
data = eKonf.load_data("us_equities_news_sampled.parquet", cfg.cached_path)
data.text[0]

'Investing com Asian stock markets were broadly lower for a second day on Thursday as weak U S data on durable goods orders added to concerns over the global growth outlook while concerns over declining corporate profits also weighed During late Asian trade Hong Kong s Hang Seng Index tumbled 1 55 Australia s ASX 200 Index dipped 0 1 while Japan s Nikkei 225 Index shed 0 7 The Nikkei came further off a one year closing high hit earlier in the week as investors cashed in ahead of the Japanese fiscal year end March is the final month of Japan s fiscal year and market participants have expected many funds to lock in profits from a meteoric 19 rally in the January to March period after shedding more than 13 in April to December Exporters which have gained sharply in the first quarter on the back a weakening yen declined Automakers Toyota and Nissan slumped 1 65 and 1 8 respectively while consumer electronics giant Sony retreated 1 5 On the upside Sharp saw shares jump 6 7 extending the previous day s 15 rally following reports that Taiwan s Hon Hai Precision Industry is buying 10 of the Japanese electronics manufacturer for JPY66 91 billion with the two to form a tie up in liquid crystal display production Elsewhere shares in Hong Kong came under pressure amid lingering fears over a hard landing in China and worries over declining corporate profits PICC Property Casualty China s biggest non life insurer saw shares drop 4 after reporting 2011 net income rose to CNY8 03 billion missing expectations for income of CNY8 8 billion China Shipping Container Lines fell 1 55 after the nation s second largest container carrier reported a loss of CNY2 74 billion last year wider than the average estimate of CNY2 63 billion amid rising fuel costs and declining demand Port operator China Merchants Holdings declined 1 7 after saying annual profit dropped 5 2 due to costs from a harbor deal Hong Kong traded Chinese banks were also broadly weaker with Industrial and Commercial Bank of China and Bank of China down 2 4 and 2 3 respectively ahead of their 2011 earnings reports later on Thursday Raw material producers also contributed to losses amid concerns over the global growth outlook Copper mining giant Jiangxi Copper Company dropped 3 4 Aluminum Corporation of China or CHALCO slumped 2 4 while shares in oil majors PetroChina and CNOOC fell 2 1 and 3 8 respectively But shares in Australia outperformed regional equities for a second day remaining close to a four month high The index was weighed by a 7 decline in Leighton Holdings the country s largest construction company after saying its underlying profit in the year ending December 31 will be between AUD400 million and AUD450 million below market expectations citing increased costs due to wet weather and lower than expected productivity Meanwhile European stock markets were mildly lower after the open supported by hopes that euro zone leaders will increase the size of the European debt firewall to combat the fiscal crisis The EURO STOXX 50 shed 0 3 France s CAC 40 dipped 0 1 Germany s DAX fell 0 25 while London s FTSE 100 edged 0 15 lower Later in the day Germany was to publish official data on employment change while the U S was to release official data on initial jobless claims'

LDA Basics#

LDA class provides Latent Dirichlet Allocation(LDA) topic model and its implementation is based on following papers:

Blei, D.M., Ng, A.Y., &Jordan, M.I. (2003).Latent dirichlet allocation.Journal of machine Learning research, 3(Jan), 993 - 1022.
Newman, D., Asuncion, A., Smyth, P., &Welling, M. (2009).Distributed algorithms for topic models.Journal of Machine Learning Research, 10(Aug), 1801 - 1828.

import tomotopy as tp

save_path = eKonf.join_path(data_dir, "lda_basic.mdl")
mdl = tp.LDAModel(tw=tp.TermWeight.ONE, min_cf=10, rm_top=10, k=20)
for n, line in enumerate(data["text"][:100]):
    ch = line.strip().split()
    mdl.add_doc(ch)
mdl.burn_in = 100
mdl.train(0)
print(
    "Num docs:",
    len(mdl.docs),
    ", Vocab size:",
    len(mdl.used_vocabs),
    ", Num words:",
    mdl.num_words,
)
print("Removed top words:", mdl.removed_top_words)
for i in range(0, 100, 10):
    mdl.train(10)
    print("Iteration: {}\tLog-likelihood: {}".format(i, mdl.ll_per_word))

mdl.save(save_path, full=True)

mdl.summary()

Show code cell output Hide code cell output

<Basic Info>
| LDAModel (current version: 0.12.3)
| 100 docs, 33695 words
| Total Vocabs: 8855, Used Vocabs: 972
| Entropy of words: 6.35945
| Entropy of term-weighted words: 6.35945
| Removed Vocabs: the to of and in a is for s on
|
<Training Info>
| Iterations: 100, Burn-in steps: 100
| Optimization Interval: 10
| Log-likelihood per word: -6.82328
|
<Initial Parameters>
| tw: TermWeight.ONE
| min_cf: 10 (minimum collection frequency of words)
| min_df: 0 (minimum document frequency of words)
| rm_top: 10 (the number of top words to be removed)
| k: 20 (the number of topics between 1 ~ 32767)
| alpha: [0.1] (hyperparameter of Dirichlet distribution for document-topic, given as a single `float` in case of symmetric prior and as a list with length `k` of `float` in case of asymmetric prior.)
| eta: 0.01 (hyperparameter of Dirichlet distribution for topic-word)
| seed: 3080440490 (random seed)
| trained in version 0.12.3
|
<Parameters>
| alpha (Dirichlet prior on the per-document topic distributions)
|  [0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1
|   0.1 0.1]
| eta (Dirichlet prior on the per-topic word distribution)
|  0.01
|
<Topics>
| #0 (4616) : has have be an it
| #1 (979) : trade market that from growth
| #2 (1177) : he He Trump his said
| #3 (1842) : as that The oil from
| #4 (1204) : It that The by some
| #5 (2673) : quarter 1 year 3 company
| #6 (783) : 1 ratio market margin value
| #7 (2400) : with this that from their
| #8 (1257) : its high as this while
| #9 (2030) : percent 1 0 by 2
| #10 (1354) : I gold t you silver
| #11 (1154) : revenue PepsiCo In billion that
| #12 (2895) : The earnings year P S
| #13 (1123) : Goldman by Reuters up last
| #14 (1125) : companies are NYSE they more
| #15 (1115) : with 7 by sales The
| #16 (2150) : said it not that company
| #17 (1204) : S at U said investors
| #18 (1736) : Zacks Rank growth has expected
| #19 (878) : down demand China it level
|

for k in range(mdl.k):
    print("Topic #{}".format(k))
    for word, prob in mdl.get_topic_words(k):
        print("\t", word, prob, sep="\t")

Show code cell output Hide code cell output

Topic #0
		has	0.043887220323085785
		have	0.041293028742074966
		be	0.040644481778144836
		an	0.03415900468826294
		it	0.03286190703511238
		are	0.03156481310725212
		will	0.031132448464632034
		at	0.02745734713971615
		or	0.027241162955760956
		than	0.023782242089509964
Topic #1
		trade	0.047546323388814926
		market	0.04248927906155586
		that	0.03844364359974861
		from	0.03439800813794136
		growth	0.03439800813794136
		8	0.030352376401424408
		4	0.029340967535972595
		quarter	0.02731814980506897
		25	0.02428392320871353
		results	0.022261105477809906
Topic #2
		he	0.05730922520160675
		He	0.04045604541897774
		Trump	0.037085410207509995
		his	0.03624275326728821
		said	0.03455743566155434
		but	0.031186800450086594
		was	0.028658824041485786
		by	0.026973506435751915
		with	0.026973506435751915
		will	0.025288190692663193
Topic #3
		as	0.05886959284543991
		that	0.05130905658006668
		The	0.046988748013973236
		oil	0.03240770846605301
		from	0.03186766803264618
		two	0.028627438470721245
		out	0.025927245616912842
		last	0.02484716847538948
		production	0.022687014192342758
		over	0.019986823201179504
Topic #4
		It	0.05109086260199547
		that	0.04944303259253502
		The	0.04861912131309509
		by	0.04037998989224434
		some	0.037908248603343964
		also	0.03296476975083351
		global	0.02966911531984806
		stock	0.028845202177762985
		price	0.024725638329982758
		world	0.023077810183167458
Topic #5
		quarter	0.04585271701216698
		1	0.044734448194503784
		year	0.03914311155676842
		3	0.03839759901165962
		company	0.038024842739105225
		2	0.03280625864863396
		million	0.03131523355841637
		from	0.02907869778573513
		billion	0.02460562437772751
		4	0.023860113695263863
Topic #6
		1	0.0706554651260376
		ratio	0.044164396822452545
		market	0.044164396822452545
		margin	0.04037995636463165
		value	0.03659551963210106
		company	0.029026644304394722
		debt	0.02524220570921898
		currency	0.021457767114043236
		13	0.02019628696143627
		total	0.02019628696143627
Topic #7
		with	0.05478229746222496
		this	0.05395232513546944
		that	0.05270737037062645
		from	0.047727540135383606
		their	0.02905316837131977
		just	0.023243363946676254
		see	0.022828377783298492
		as	0.02241339161992073
		investment	0.018678518012166023
		time	0.017433559522032738
Topic #8
		its	0.05605816841125488
		high	0.05132152512669563
		as	0.045795440673828125
		this	0.03553271293640137
		while	0.0292171910405159
		NASDAQ	0.0292171910405159
		week	0.02842774987220764
		day	0.026059428229928017
		new	0.02526998706161976
		markets	0.0244805496186018
Topic #9
		percent	0.08923283219337463
		1	0.05589492991566658
		0	0.05491440370678902
		by	0.04020649939775467
		2	0.0367746576666832
		with	0.024027807638049126
		yen	0.02255701646208763
		after	0.02255701646208763
		expected	0.020595964044332504
		than	0.018634909763932228
Topic #10
		I	0.09166838228702545
		gold	0.06160355731844902
		t	0.040338192135095596
		you	0.03740504011511803
		silver	0.029338866472244263
		Gold	0.027872290462255478
		out	0.02493913657963276
		think	0.020539406687021255
		do	0.019806118682026863
		money	0.019806118682026863
Topic #11
		revenue	0.05758258327841759
		PepsiCo	0.03695906326174736
		In	0.03524043411016464
		billion	0.03524043411016464
		that	0.03352180868387222
		The	0.029225243255496025
		EPS	0.026647305116057396
		are	0.024928677827119827
		growth	0.024928677827119827
		investment	0.02149142511188984
Topic #12
		The	0.057496074587106705
		earnings	0.048889391124248505
		year	0.03615150600671768
		P	0.02582348883152008
		S	0.02444641850888729
		3	0.02375788427889347
		2	0.02169227972626686
		up	0.02134801261126995
		average	0.02100374549627304
		stock	0.01824960671365261
Topic #13
		Goldman	0.03708771616220474
		by	0.03532205522060394
		Reuters	0.030025072395801544
		up	0.029142240062355995
		last	0.025610920041799545
		security	0.023845259100198746
		were	0.022962426766753197
		July	0.022079596295952797
		further	0.020313935354351997
		Inc	0.020313935354351997
Topic #14
		companies	0.08020481467247009
		are	0.06522314250469208
		NYSE	0.05464784428477287
		they	0.04142872244119644
		more	0.03702234849333763
		000	0.03525979816913605
		about	0.029972150921821594
		like	0.02820960246026516
		some	0.0238032303750515
		index	0.022921955212950706
Topic #15
		with	0.05157728120684624
		7	0.05157728120684624
		by	0.05068816989660263
		sales	0.0471317321062088
		The	0.046242620795965195
		year	0.03824062645435333
		Inc	0.024903977289795876
		Amazon	0.024903977289795876
		5	0.02401486597955227
		at	0.022236647084355354
Topic #16
		said	0.051863204687833786
		it	0.04815902188420296
		not	0.03704646974802017
		that	0.03519437834620476
		company	0.0342683307826519
		was	0.032879263162612915
		who	0.030564147979021072
		at	0.0277860090136528
		its	0.02454484812915325
		would	0.024081824347376823
Topic #17
		S	0.08322348445653915
		at	0.0683930441737175
		U	0.06674521416425705
		said	0.03131694346666336
		investors	0.02966911531984806
		dollar	0.027197375893592834
		The	0.023077810183167458
		seen	0.022253897041082382
		rate	0.02060607075691223
		up	0.019782157614827156
Topic #18
		Zacks	0.10655201971530914
		Rank	0.06072566285729408
		growth	0.049841899424791336
		has	0.034375499933958054
		expected	0.025783058255910873
		company	0.025783058255910873
		over	0.022918909788131714
		currently	0.019481932744383812
		Estimate	0.019481932744383812
		you	0.01890910230576992
Topic #19
		down	0.04619699716567993
		demand	0.041691072285175323
		China	0.037185147404670715
		it	0.030426261946558952
		level	0.028173299506306648
		after	0.028173299506306648
		inflation	0.027046818286180496
		consumer	0.025920337066054344
		Google	0.020287929102778435
		from	0.020287929102778435

LDA Visualization#

This example shows how to perform a Latent Dirichlet Allocation using tomotopy and visualize the result.

import tomotopy as tp
import nltk
import re
import numpy as np
import pyLDAvis
from nltk.corpus import stopwords

porter_stemmer = nltk.PorterStemmer().stem
english_stops = set(porter_stemmer(w) for w in stopwords.words("english"))
pat = re.compile("^[a-z]{2,}$")
corpus = tp.utils.Corpus(
    tokenizer=tp.utils.SimpleTokenizer(porter_stemmer),
    stopwords=lambda x: x in english_stops or not pat.match(x),
)

corpus.process(d.lower() for d in data["text"][:100])
# save preprocessed corpus for reuse
save_path = eKonf.join_path(data_dir, "preprocessed_corpus.cps")
corpus.save(save_path)

mdl = tp.LDAModel(min_df=5, rm_top=20, k=30, corpus=corpus)
mdl.train(0)

print(
    "Num docs:{}, Num Vocabs:{}, Total Words:{}".format(
        len(mdl.docs), len(mdl.used_vocabs), mdl.num_words
    )
)
print("Removed Top words: ", *mdl.removed_top_words)

Num docs:100, Num Vocabs:1074, Total Words:22386
Removed Top words:  year compani stock market earn said zack price percent share quarter expect growth report billion revenu trade investor nyse million

# Let's train the model
for i in range(0, 100, 10):
    print("Iteration: {:04}, LL per word: {:.4}".format(i, mdl.ll_per_word))
    mdl.train(10)
print("Iteration: {:04}, LL per word: {:.4}".format(1000, mdl.ll_per_word))

mdl.summary()

Show code cell output Hide code cell output

Iteration: 0000, LL per word: -12.02
Iteration: 0010, LL per word: -7.564
Iteration: 0020, LL per word: -7.357
Iteration: 0030, LL per word: -7.263
Iteration: 0040, LL per word: -7.193
Iteration: 0050, LL per word: -7.167
Iteration: 0060, LL per word: -7.159
Iteration: 0070, LL per word: -7.144
Iteration: 0080, LL per word: -7.138
Iteration: 0090, LL per word: -7.128
Iteration: 1000, LL per word: -7.122
<Basic Info>
| LDAModel (current version: 0.12.3)
| 100 docs, 22386 words
| Total Vocabs: 5139, Used Vocabs: 1074
| Entropy of words: 6.62188
| Entropy of term-weighted words: 6.62188
| Removed Vocabs: year compani stock market earn said zack price percent share quarter expect growth report billion revenu trade investor nyse million
|
<Training Info>
| Iterations: 100, Burn-in steps: 0
| Optimization Interval: 10
| Log-likelihood per word: -7.12162
|
<Initial Parameters>
| tw: TermWeight.ONE
| min_cf: 0 (minimum collection frequency of words)
| min_df: 5 (minimum document frequency of words)
| rm_top: 20 (the number of top words to be removed)
| k: 30 (the number of topics between 1 ~ 32767)
| alpha: [0.1] (hyperparameter of Dirichlet distribution for document-topic, given as a single `float` in case of symmetric prior and as a list with length `k` of `float` in case of asymmetric prior.)
| eta: 0.01 (hyperparameter of Dirichlet distribution for topic-word)
| seed: 3743484366 (random seed)
| trained in version 0.12.3
|
<Parameters>
| alpha (Dirichlet prior on the per-document topic distributions)
|  [0.32200417 0.60077    0.21077627 0.06970649 0.4447674  0.5434947
|   0.5469532  0.45518324 0.6338666  0.5719722  0.8117618  0.47851452
|   0.5830207  0.5280389  0.46882564 0.58426094 0.5307975  0.526721
|   0.66027516 0.20483024 0.817219   0.5032047  0.20158361 0.4396458
|   0.43283358 0.6222955  0.43815643 0.29211614 0.32323366 0.4832791 ]
| eta (Dirichlet prior on the per-topic word distribution)
|  0.01
|
<Topics>
| #0 (486) : currenc high forecast chang way
| #1 (1118) : make think would come anoth
| #2 (525) : yen trump japan presid reuter
| #3 (403) : gold silver metal mine see
| #4 (692) : fund yield larg etf countri
| #5 (753) : high manag move new last
| #6 (877) : week rose fall ahead juli
| #7 (765) : rank consensu posit cloud per
| #8 (788) : invest may time import major
| #9 (926) : buy estim current see sell
| #10 (1130) : industri past hold rank term
| #11 (686) : level valu move follow risk
| #12 (817) : increas rate new demand govern
| #13 (890) : dollar bank futur sinc like
| #14 (733) : secur firm accord activist group
| #15 (780) : estim per ep best beat
| #16 (888) : sector goldman invest us season
| #17 (702) : day averag drop show corpor
| #18 (969) : strong increas result continu cent
| #19 (279) : use system organ center network
| #20 (1220) : also busi one last grow
| #21 (566) : month data china gain rise
| #22 (483) : oil product mexico energi unit
| #23 (703) : index higher close nasdaq fell
| #24 (529) : plan car would state media
| #25 (1070) : inc product nasdaq technolog top
| #26 (739) : back could call pay comment
| #27 (623) : oper ratio total margin dividend
| #28 (571) : sale retail store amazon consum
| #29 (675) : peopl financi look equiti seen
|

topic_term_dists = np.stack([mdl.get_topic_word_dist(k) for k in range(mdl.k)])
doc_topic_dists = np.stack([doc.get_topic_dist() for doc in mdl.docs])
doc_topic_dists /= doc_topic_dists.sum(axis=1, keepdims=True)
doc_lengths = np.array([len(doc.words) for doc in mdl.docs])
vocab = list(mdl.used_vocabs)
term_frequency = mdl.used_vocab_freq

prepared_data = pyLDAvis.prepare(
    topic_term_dists,
    doc_topic_dists,
    doc_lengths,
    vocab,
    term_frequency,
    start_index=0,  # tomotopy starts topic ids with 0, pyLDAvis with 1
    sort_topics=False,  # IMPORTANT: otherwise the topic_ids between pyLDAvis and tomotopy are not matching!
)

pyLDAvis.display(prepared_data)

save_dir = "../../../assets/extra"
filename = "lda_basic.html"
save_path = eKonf.join_path(save_dir, filename)
pyLDAvis.save_html(prepared_data, save_path)

from IPython.display import display, HTML

display(HTML(f"<a href={save_path} target='_blank'>{filename}</a>"))

lda_basic.html

LDA coherence#

This example shows how to perform a Latent Dirichlet Allocation and calculate coherence of the results.

# calculate coherence using preset
for preset in ("u_mass", "c_uci", "c_npmi", "c_v"):
    coh = tp.coherence.Coherence(mdl, coherence=preset)
    average_coherence = coh.get_score()
    coherence_per_topic = [coh.get_score(topic_id=k) for k in range(mdl.k)]
    print("==== Coherence : {} ====".format(preset))
    print("Average:", average_coherence, "\nPer Topic:", coherence_per_topic)
    print()

Show code cell output Hide code cell output

==== Coherence : u_mass ====
Average: -1.2919665389931534 
Per Topic: [-1.17386418557316, -0.8680202687611864, -1.9011274512865044, -1.047667736559903, -0.8366067935289703, -1.0824157815953828, -1.0848464209397464, -1.042256352533198, -2.040907684969454, -0.9648597265277666, -0.6370809895484615, -1.0602086759498055, -0.9300058656023976, -0.8755223027370869, -3.8756595693460896, -0.9561534903362163, -1.0182683509407169, -1.0452619935102012, -0.8683999688876635, -1.6542543949045063, -0.9976812325252717, -1.3517324017228247, -1.4340397599847348, -1.1336003900525302, -3.3037643601664413, -0.9452589069885053, -1.3000326329631926, -1.1188280272579738, -1.1003419572424118, -1.1103284968522937]

==== Coherence : c_uci ====
Average: -4.878004424663513 
Per Topic: [-1.4440664765834577, -2.9780619010239433, -8.708752137868833, -3.0050339171614286, -3.9756801762639395, -5.2244161333617765, -6.687473406288814, -4.228681125865242, -7.003535776093569, -3.3012711354385487, -0.9000521475110517, -3.7900076427042646, -3.7450666628395886, -3.05733166371, -7.122763804824756, -4.240481845103053, -5.210568908088681, -3.822197030576038, -4.042555006747717, -8.432800473204122, -4.129114331246837, -4.856754672889186, -4.192296480531112, -5.375999981840818, -10.244442932288125, -1.8611091058190008, -7.7178064083191105, -2.0541820205768704, -5.6412571244580745, -9.346372310677454]

==== Coherence : c_npmi ====
Average: -0.1325481235208962 
Per Topic: [0.017405643466831665, -0.06713514820217745, -0.250626658245578, -0.04635542135098983, -0.12268777150029593, -0.1647925543989872, -0.19153348461822214, -0.11479310813952832, -0.20325291559000797, -0.05933672775478992, -0.01312994262040823, -0.10370859429506546, -0.12495710347546096, -0.08055725886583803, -0.18367815044046804, -0.1056187907799229, -0.16950989984821221, -0.1013101627858306, -0.12513122278003186, -0.23018273615527657, -0.13565862607035908, -0.14425460236335147, -0.029011488535021748, -0.16785973124959508, -0.3279976915096982, -0.03667616244336371, -0.25424020821444454, 0.04555992891405496, -0.15709031459299314, -0.32832280118185425]

==== Coherence : c_v ====
Average: 0.5564370689820499 
Per Topic: [0.5011142507195473, 0.5537001609802246, 0.6300975859165192, 0.6528063505887985, 0.5177711397409439, 0.4293242484331131, 0.593023756146431, 0.687737911939621, 0.48875993937253953, 0.6143026351928711, 0.4760036665014923, 0.4910168096423149, 0.4035642951726913, 0.5804461449384689, 0.6122414320707321, 0.5381382368505001, 0.5088414877653122, 0.5503150254487992, 0.49591032415628433, 0.6861237555742263, 0.4793075716122985, 0.5599398672580719, 0.6356056213378907, 0.5512014642357826, 0.6266018450260162, 0.5769483402371407, 0.5394160777330399, 0.6937022149562836, 0.5420901507139206, 0.4770597591996193]

import itertools

# calculate coherence using custom combination
for seg, cm, im in itertools.product(
    tp.coherence.Segmentation, tp.coherence.ConfirmMeasure, tp.coherence.IndirectMeasure
):
    coh = tp.coherence.Coherence(
        mdl, coherence=(tp.coherence.ProbEstimation.DOCUMENT, seg, cm, im)
    )
    average_coherence = coh.get_score()
    coherence_per_topic = [coh.get_score(topic_id=k) for k in range(mdl.k)]
    print("==== Coherence : {}, {}, {} =S===".format(repr(seg), repr(cm), repr(im)))
    print("Average:", average_coherence, "\nPer Topic:", coherence_per_topic)
    print()

Show code cell output Hide code cell output

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.NONE: 0> =S===
Average: 0.1351874610643829 
Per Topic: [0.11101613159001103, 0.14175311097344043, 0.16218523657111322, 0.18403090427713625, 0.13375353018989322, 0.11100999661663936, 0.11403467499327492, 0.19470537423940926, 0.09631062974767758, 0.16263582197968346, 0.13661738585142094, 0.11390028382623481, 0.11258613327854435, 0.1902236282755808, 0.1258461538435896, 0.1319174081261988, 0.11790130382631658, 0.101183821270789, 0.13940305217600987, 0.16222103513433575, 0.11832280151689759, 0.09788931234235808, 0.1759731325287265, 0.1419625983771921, 0.12283456255415204, 0.11983944773489327, 0.1397846799755621, 0.17328171317695196, 0.12248752611724914, 0.10001244082020401]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.351438074447317 
Per Topic: [0.33813833362526363, 0.430223838157124, 0.2877488086620967, 0.459748269058764, 0.37483677003118726, 0.3659698486328125, 0.3136667526430554, 0.4313059644566642, 0.3343541645341449, 0.451196633776029, 0.3850690140078465, 0.40851979123221505, 0.29808357678767705, 0.5845632533232371, 0.1891543977169527, 0.32126699541178017, 0.3908944098485841, 0.3035161207119624, 0.34076554398569797, 0.3225314312924941, 0.3825223781996303, 0.29911605285273657, 0.3335316630287303, 0.3914314246426026, 0.2338264191316234, 0.35029977937018786, 0.3607129294011328, 0.31518845558166503, 0.2355661774882012, 0.30939303582741157]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.DICE: 2> =S===
Average: -0.8293424091943761 
Per Topic: [0.007356486366026932, 0.09002556798255278, -19.3119921805544, 0.0981799298690425, 0.041860342708726725, 0.058977754372689456, -0.42121727665782804, -1.2304922645497653, 0.02389199184771213, 0.030123389098379347, 0.04062372690273656, 0.0743200851811303, -0.056963336705747575, 0.18184349636236827, -1.326329428785377, -0.430772896633587, 0.013559697485632367, -0.07364538814872504, -0.016037708872722253, -0.8799245460269352, 0.06188205188243753, -0.0832902544281549, -0.006324527770306708, 0.02566142504931324, -0.44272010181254395, -0.1809079928504717, -0.06870913652399192, -0.8275042325607501, -0.256899498630729, -0.014847449427987967]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.JACCARD: 3> =S===
Average: -0.05052750938504081 
Per Topic: [0.028326774704166585, 0.1131498378701508, -0.3143009778112173, 0.11495007860163847, 0.05273476496545805, 0.07745150006893609, -0.19890510874489944, -0.4506096941108505, 0.03240897646173835, 0.05276763825159934, 0.07799616557442479, 0.10300765294167731, 0.029785138120253882, 0.22646935797399945, -0.30842970315780904, -0.21479199138056074, 0.04015165230052339, -0.06290167171715035, 0.03329444641454352, -0.3777039244822744, 0.0837298682696807, -0.048015411863000027, 0.0031752510759462086, 0.06049309560718636, -0.2972231205811517, -0.10548304048522066, -0.04337195058696024, -0.14943984682775205, -0.09097979786909288, 0.016438758864791857]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.NONE: 0> =S===
Average: 1.7045623505630207 
Per Topic: [1.5266369490971852, 1.5463879113428056, 2.5005275414844204, 2.6211654329045535, 1.615207291260249, 1.4244162026935911, 1.524343961603905, 1.8431163771644157, 1.4003549336881114, 1.482846216572118, 1.3435245777160014, 1.4048138054696506, 1.3353415497716667, 1.7875376955323294, 1.9259544157948747, 1.4914559665787048, 1.5159353629304715, 1.3652218150352202, 1.4493477040020373, 2.7839088670144436, 1.3712513309499346, 1.4650723061795412, 2.5894590816212744, 1.6079433553160218, 1.9996923130945299, 1.4098756572162499, 1.7141295931091425, 2.025746019068962, 1.6136944791990455, 1.451961803479181]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.8551825504170523 
Per Topic: [0.8568254854944017, 0.906697412331899, 0.7271665891011556, 0.8156261616282993, 0.8707838190926446, 0.8924511830012004, 0.8620413064956665, 0.8509526835547553, 0.8582547346750895, 0.9164952185418871, 0.937135538789961, 0.9030599276224772, 0.9020455718040467, 0.9209719644652472, 0.7183065354824066, 0.8972178141276042, 0.8740266468789842, 0.8944442338413663, 0.9104682432280646, 0.6893860750728183, 0.926681931813558, 0.8576761537128025, 0.7724587930573358, 0.8919851157400344, 0.6979990694257948, 0.9129396239916484, 0.8691527287165324, 0.8243044892946879, 0.8304114023844401, 0.8675100591447619]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.DICE: 2> =S===
Average: 0.3953467755295613 
Per Topic: [0.3952705932988061, 0.4116775261031257, 0.3457771937052409, 0.3738812698258294, 0.3972695622179243, 0.4121724492973752, 0.3927939838833279, 0.39261747068829006, 0.3986912210782369, 0.4234406484497918, 0.4311494575606452, 0.4138374083571964, 0.41399666402075025, 0.4178920825322469, 0.35133889416853586, 0.4098752948972914, 0.4021126025252872, 0.41012402772903445, 0.41826743086179097, 0.3370403753386603, 0.4250685731569926, 0.3981595748000675, 0.35959198474884035, 0.4010718471474118, 0.3426904241243998, 0.41851362850930957, 0.3994131624698639, 0.3785300427012973, 0.3875945382648044, 0.4005433334244622]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.JACCARD: 3> =S===
Average: 0.6596688153346381 
Per Topic: [0.6563803288671706, 0.7020963006549411, 0.5323186298211415, 0.6005045566293928, 0.6619628310203552, 0.7054342985153198, 0.6484619776407877, 0.6504347576035394, 0.6657542082998488, 0.736786593331231, 0.759814174969991, 0.7082306557231479, 0.710816662841373, 0.7194733553462558, 0.5525827613141802, 0.6960519154866537, 0.675995961825053, 0.6966831141048008, 0.7229824145634969, 0.5109996477762858, 0.7403956360287136, 0.6643118752373589, 0.5646324700779385, 0.6727087252669864, 0.5272161457273695, 0.720808560318417, 0.6659841272566054, 0.612063941028383, 0.6380366259151035, 0.6701412068472968]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.NONE: 0> =S===
Average: 833248193.1610017 
Per Topic: [2.0082930761066575, 2.0649914529938123, 5.536606056036479, 4074074078.1978583, 2.210945653270672, 1.931628999164661, 2.125470572564088, 2.950515135257185, 1.8025983892937552, 2.373465908815218, 1.8405752408315796, 1.8027322625961777, 1367816093.6748478, 2.8848589893698096, 19555555557.625587, 2.0319740312759507, 1.9730925319655515, 1.6820690483526008, 1.9618860918242997, 4.003495981109249, 1.859194891827816, 1.9271989339005193, 3.968215875053943, 2.4390567261732707, 2.937843376827051, 1.7903929410806272, 2.227756137638969, 2.9174473414104023, 2.1420887517385103, 1.937361064931399]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.7402534365717239 
Per Topic: [0.7901608188947041, 0.843464023537106, 0.4551342276028461, 0.3149481895897124, 0.7948817253112793, 0.8571844696998596, 0.8122045384513007, 0.7541576802730561, 0.8069455133544075, 0.8641168276468912, 0.8905021548271179, 0.8615469124582079, 0.7176932009588028, 0.8644522918595208, 0.27564658059061264, 0.831760479344262, 0.8138297478357951, 0.8520880222320557, 0.8684459182951185, 0.42272013458940716, 0.8856943421893649, 0.8053802145851983, 0.63527436653773, 0.8174244178666009, 0.4379139787207047, 0.8728516340255738, 0.8224291814698114, 0.6472298730578687, 0.7715571337276035, 0.8199644976192051]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.DICE: 2> =S===
Average: 0.33317976239575325 
Per Topic: [0.3563797414302826, 0.3753472553359138, 0.2330093636364418, 0.1232157862403148, 0.35636063549253677, 0.3826446420616574, 0.3597119006845686, 0.3548059927092658, 0.36521188616752626, 0.394514090485043, 0.40643646253479854, 0.38578488296932645, 0.3188276247833031, 0.3827680640750461, 0.10762037602072624, 0.3783985793590546, 0.3688957737551795, 0.3822804285420312, 0.39233422146903146, 0.18592197238293587, 0.39835565818680657, 0.36646339164839853, 0.29996280868848163, 0.35952309105131364, 0.1976712567250935, 0.3922528200679355, 0.3683128794034322, 0.2780546369890838, 0.3555277672078874, 0.3687988817691803]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.JACCARD: 3> =S===
Average: 0.5316045413682643 
Per Topic: [0.5580196784602272, 0.6070075160927243, 0.3320080794477474, 0.19004649233998394, 0.5593037558926477, 0.625640649927987, 0.5640980223814647, 0.5553434246116215, 0.5791655408011542, 0.6549664517243703, 0.6875664088461134, 0.6306196133295695, 0.5326852523605549, 0.6239841527409024, 0.16353459524120556, 0.6113076812691158, 0.5893126898341708, 0.6207689881324768, 0.6512066079510583, 0.2668755257019132, 0.6637576699256897, 0.5820300175084008, 0.4329821487267812, 0.5680274128913879, 0.29369887570285214, 0.6470509966214498, 0.5842362410492368, 0.4285337212423949, 0.557279497385025, 0.5870785329076978]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.NONE: 0> =S===
Average: 0.5405859769669961 
Per Topic: [0.5826810441083398, 0.6660489351224613, 0.359891652097845, 1.321096407409169, 0.6701883879830325, 0.5319959935158121, 0.5658087696202627, 0.9456190446093845, -0.11162484937707866, 0.7352332243367302, 0.5597643057419147, 0.522965590300311, 0.7758243685876923, 0.8836533972493942, -1.1621559621911957, 0.6014894707978905, 0.5812047616908983, 0.4615187886617018, 0.6076949115341027, 1.0887713264147822, 0.5399529510350565, 0.4875531804415069, 1.1033524638471561, 0.6779175245241125, -1.0429912454491495, 0.5440375487491753, 0.6974779167686911, 0.904638576448713, 0.6177343167893081, 0.5002365076418617]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.11935449063389972 
Per Topic: [0.09104066548170522, 0.06104483656365321, 0.3093050652080112, 0.28175778368280996, 0.08212855363470023, 0.008025267011382514, 0.12080522253006873, 0.1865225060661841, 0.030552612939532587, 0.1300905320340664, 0.059253980701517626, 0.041128822967099646, -0.013953080879420869, 0.09569137090713614, 0.17172210053023365, 0.13141210740238118, 0.09200627624264193, 0.09485439706542012, 0.03239099182925808, 0.3284517904122671, 0.029139023562897312, 0.09320585189982214, 0.20915542462219794, 0.14141440639117112, 0.19086722037431578, 0.07778486348171201, 0.1352714304941603, 0.24153346777929982, 0.08869124026872062, 0.039339987812046374]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.DICE: 2> =S===
Average: 9.764055218641406 
Per Topic: [1.4785268584887186, 8.029365785916646, 0.9702214002609253, 1.8041651871469286, 6.375461806191338, -0.7050536228550806, 1.4947402609719171, 2.4129498998324075, 3.9956230812602573, 5.715696952078077, -3.1508254329363505, -3.0207321961720783, -0.4067401882674959, 1.2127181026670668, 1.1320184853341844, 211.9972156604131, 1.2562444779607984, 1.7112665004200405, -0.5146351165241665, 0.9359890752368503, -4.61125920481152, 5.525151567988925, 39.07899330457052, 1.171814935737186, 0.9255641990237766, 2.0271761390897964, 1.5980050881703696, 1.2008336848682828, 1.1204772697554695, 2.160682597425249]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.JACCARD: 3> =S===
Average: -20.825088196727965 
Per Topic: [-2.7075881520907084, -2.272716283135944, 3.2274680429034763, 8.821884269184537, -4.040207725101047, -2.5036019414663313, -0.882104041841295, 18.635549034012687, -1.0613703462812636, -3.0882180836465625, -1.289473260111279, -2.3885018050670626, -2.2382670289940303, -2.2146502597464455, -0.25286320050557454, -6.070304331514571, -3.884779371155633, -2.4781847609413994, -2.379622534248564, -604.0762025992076, -1.6511220229996575, -0.15254030227661133, -5.448412187894186, -3.880208917458852, 4.612057495117187, -3.180569440788693, -3.9969701210657758, -0.6741435077455309, 6.395788433154424, -3.6327709509266746]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.NONE: 0> =S===
Average: 0.2978416590213351 
Per Topic: [0.3780543494180079, 0.41296956919561634, 0.18980654629698537, 0.7691373178155088, 0.43306236075249366, 0.309001147774482, 0.3558471752030721, 0.5755005634088165, -0.2125246165440904, 0.3698390002038444, 0.28614394288966083, 0.3204577164651875, 0.2715209622206862, 0.5262686189508873, -1.423538163168902, 0.3624002845095718, 0.3701383246947222, 0.2877505226455023, 0.3516867899079114, 0.8617703727076176, 0.29600534443070325, 0.3136588599661505, 0.839832037893, 0.4166274363483794, -0.9223808370094703, 0.3299575706066555, 0.48694471889678254, 0.6346445910731934, 0.4248759551503945, 0.31979130793668337]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.17994302767120876 
Per Topic: [0.12805647044959995, 0.11670115184194098, 0.32550658281478617, 0.3205361928480367, 0.1428330622962676, 0.17555796836192408, 0.16637061934840555, 0.2242692696241041, 0.06496669764682236, 0.2734027441797985, 0.23776455167163577, 0.1406386292203226, 0.10335524227573639, 0.2027769971690658, 0.1911078766376401, 0.19030399062256848, 0.15196374676500757, 0.13059370080526506, 0.12559629707700676, 0.3441504937079218, 0.1384310316397912, 0.1333530227472592, 0.21875141391323671, 0.17642224971204995, 0.20602956772264508, 0.14699623237570955, 0.16957532687520144, 0.2709827545409401, 0.10145181030060889, 0.07984513494496544]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.DICE: 2> =S===
Average: 2.7841405712333676 
Per Topic: [1.666050275001261, 0.6319933338297739, 0.9500268008973863, 1.0386896597014532, 2.250735392794013, 0.2756188680003915, 0.17857569058736164, 0.9083827204174466, -2.425627151255806, 0.8707619605792893, -0.2592729784548283, -0.19907247605216172, 0.802290514152911, 0.8496030672556825, 1.077572684817844, 1.2845817714929582, 3.02141981518103, 1.3063109305169847, -0.03761112979716725, 0.9118721087773641, -0.33338644554217656, 1.1887123200628493, 1.0431044512324863, 1.445079586075412, 0.9786576999558343, 61.388298433356816, 1.0510625998179117, 1.1705422348446317, 1.6549388404418197, -1.1656944416877297]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.JACCARD: 3> =S===
Average: 7.0628127242821375 
Per Topic: [-9.166540891263221, -2.342065739466084, -6.15212869114346, -43.87733449406094, 56.44401636720738, -2.6669036716222765, 1.8036339667108323, 0.8181499573919508, -3.2101934199945794, -3.841389800608158, -1.0703579567372798, -2.519921213326355, -4.007283565733168, -2.817425860464573, -3.0735806226730347, -1.983945485121674, -5.066192412128051, -8.380187131961186, -2.8874885131087566, -4.33076155450609, -1.4770257814062966, -2.8912580755021837, 289.9129321826829, -3.833444407582283, -8.179676280419033, -4.677306281195746, -2.820743430985345, 2.5057655387454565, -4.367011500419014, -3.959949502845605]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.NONE: 0> =S===
Average: 0.1766937747568545 
Per Topic: [0.14824682586212107, 0.18708438146379341, 0.20281956968249248, 0.24425282842966542, 0.17419189085101458, 0.14265440707643529, 0.14944404510495848, 0.2521999987572534, 0.10834079577106827, 0.2163301201716118, 0.19242175789761948, 0.15242108698649745, 0.15412284717444524, 0.24714972156952805, 0.11141293530537302, 0.17405880215656602, 0.15648258888424169, 0.13498831971804393, 0.1852731795046233, 0.2408658489817811, 0.15769056713778093, 0.12939673533270415, 0.25091810534198355, 0.18260028518696703, 0.12661959749425444, 0.15883896199371866, 0.18870800407669353, 0.2339521853385075, 0.16486529111142129, 0.1324615583424696]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.30799426429340176 
Per Topic: [0.2577278188119332, 0.36455040506811603, 0.27655373497141733, 0.37316074293080925, 0.30523637334505715, 0.31315051799433097, 0.25819794668091667, 0.3904047669635879, 0.226201621008416, 0.4634987852639622, 0.43473703985412915, 0.3618507874508699, 0.286240578111675, 0.5096372005012301, 0.16657824232760404, 0.31665071715186865, 0.29665989984447755, 0.26487721796664926, 0.34197703417804504, 0.3128788981690175, 0.3643739732810193, 0.22974988263514307, 0.2334528881435593, 0.34796334778269133, 0.1884039598180809, 0.3390489612188604, 0.2937325588117043, 0.29894404822132653, 0.18890943971152108, 0.2344785405840311]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.DICE: 2> =S===
Average: -6.696977050953278 
Per Topic: [-0.501081607511474, -0.058775322056478924, -5.950174129340383, -1.0146991021827692, -0.19650192408718997, -0.5972039008306133, 68.61222103497842, -0.5676436835382548, -0.43249382376670836, -0.2505337163909442, 0.004097158461809159, -0.45448160244462393, 2.5815661487686965, 0.10029054445525011, 3.3434767007827757, -12.810671669674209, -0.8501870264609654, -0.34551202019469607, 13.165061561514934, 0.3700122098128001, -0.013202836861213048, 0.8229714737584194, -0.018339197296235295, -0.1298573174048215, -0.5295765637523598, -270.0585608153294, -0.7114471904726491, 5.680398739212089, 0.04282868843939569, -0.14129233918696021]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.JACCARD: 3> =S===
Average: 2.1071468107418276 
Per Topic: [-0.2321656532999542, -0.009382950059241718, -1.7773478090763093, -0.3373492727159626, -0.12773477082244225, -0.14153379789657064, -0.58121748691968, -0.5889425377361477, -0.20039150988062224, -0.03989723877360423, 0.0805715151131153, -0.027872106507937942, -0.122680844925344, 0.12404205043696695, -1.1835544238487878, -0.5044889359683212, -0.24931146192053955, -0.21346006609706414, -0.1409227941185236, 9.27308953884575, 0.0325390866647164, -0.3664802716837989, -0.6178307306435373, -0.08279434137253297, 63.20406588969959, -0.19907079886438117, -0.30779310458650194, -0.6767237674030993, -0.5674934085458517, -0.2034636748385512]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.NONE: 0> =S===
Average: -1.2272521961226583 
Per Topic: [-1.1493610416512177, -0.9092812064780683, -1.970326895536801, -1.209571909913137, -1.0353222293274311, -0.9238462153413554, -1.1178646906898124, -0.9071476677119651, -1.7523576293806506, -0.7866509199609911, -0.6362178372900401, -0.9782684757568221, -0.8257068116299219, -0.7863431660657688, -3.6965884747288964, -0.9244933878967375, -1.0690041290183774, -0.9893611453220097, -0.795524288275124, -1.504613733026627, -0.7923858074358401, -1.1512319143641574, -1.2850092927096146, -0.9458022725617946, -3.164209136644416, -0.8555574890591362, -1.1133245636868838, -1.171646059376813, -1.200905406979172, -1.1696420858601588]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.4489587392133696 
Per Topic: [0.4172645108567344, 0.3680855777528551, 0.47551360925038655, 0.46617349055078294, 0.42326092951827576, 0.6414405763149261, 0.38452149331569674, 0.4236492501364814, 0.45179317759142984, 0.4734817637337579, 0.513688063621521, 0.4954831686284807, 0.5364161756303575, 0.4674039426777098, 0.3921919425328573, 0.45504247943560283, 0.40978858404689367, 0.34588676674498453, 0.4799913730886247, 0.5026300244861179, 0.48717684911357034, 0.3698685281806522, 0.45938059820069205, 0.48731302552753025, 0.452699175145891, 0.40655995971626707, 0.4119377543528875, 0.42373811536365086, 0.4009397059679031, 0.44544156491756437]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.DICE: 2> =S===
Average: 0.7170875298093867 
Per Topic: [0.7426078796386719, 0.7015290180842082, 0.7337837298711141, 0.7339953780174255, 0.7182757245169745, 0.7243074602550931, 0.7232087903552585, 0.7609699527422588, 0.7437359650929769, 0.704906948407491, 0.6405402686860826, 0.6788614100880093, 0.7135660039054023, 0.6997373965051439, 0.7442686306105719, 0.7021779139836629, 0.7368649337026808, 0.714368814892239, 0.682016827000512, 0.7392514573203193, 0.7062499708599514, 0.7357237325774298, 0.7286302831437853, 0.7034077525138855, 0.7649813916948106, 0.6894622564315795, 0.6908912089135911, 0.7162305646472507, 0.7261502756012811, 0.7119239542219374]

==== Coherence : <Segmentation.ONE_ONE: 1>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.JACCARD: 3> =S===
Average: 2.9189544194716004 
Per Topic: [3.515099125438266, 2.5720635890960692, 3.0092657830980087, 3.2807204246521, 2.874766953786214, 3.7289518859651354, 2.8617791864607067, 3.7924955818388195, 3.4049933248096043, 2.7408284081353083, 1.896372185813056, 2.4040002981821695, 3.1123066107432047, 2.618385820918613, 3.2927805900573732, 2.509185202916463, 3.3346821069717407, 2.619035180409749, 2.6232512129677668, 3.124668386247423, 2.6490831957923042, 3.215196074379815, 2.9361532661649914, 2.5999829795625473, 3.845041184955173, 2.3929948170979816, 2.3027957253985933, 2.6045406103134154, 2.966953155729506, 2.740259716245863]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.NONE: 0> =S===
Average: 0.1259989069056579 
Per Topic: [0.1061245305061154, 0.14334851156873857, 0.16778857356312876, 0.20327777777305223, 0.1617922072094629, 0.08897302425204513, 0.11153553343618199, 0.1664350771921802, 0.06624612605006143, 0.13548263442620365, 0.1378050723416717, 0.10646782095923937, 0.10722655288680988, 0.16845434930938016, 0.10646723646540263, 0.12095094119498716, 0.1218401308467869, 0.09246942280151692, 0.12911488191024367, 0.14767602767328883, 0.09694586040870846, 0.08211303852897013, 0.15206043846975117, 0.11745118766041761, 0.10556810678833571, 0.10622589196313859, 0.11827787994978789, 0.17570182084653996, 0.12563702444193925, 0.11050952574565061]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.351438074447317 
Per Topic: [0.33813833362526363, 0.430223838157124, 0.2877488086620967, 0.459748269058764, 0.37483677003118726, 0.3659698486328125, 0.3136667526430554, 0.4313059644566642, 0.3343541645341449, 0.451196633776029, 0.3850690140078465, 0.40851979123221505, 0.29808357678767705, 0.5845632533232371, 0.1891543977169527, 0.32126699541178017, 0.3908944098485841, 0.3035161207119624, 0.34076554398569797, 0.3225314312924941, 0.3825223781996303, 0.29911605285273657, 0.3335316630287303, 0.3914314246426026, 0.2338264191316234, 0.35029977937018786, 0.3607129294011328, 0.31518845558166503, 0.2355661774882012, 0.30939303582741157]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.DICE: 2> =S===
Average: -0.8293424091943761 
Per Topic: [0.007356486366026932, 0.09002556798255278, -19.3119921805544, 0.0981799298690425, 0.041860342708726725, 0.058977754372689456, -0.42121727665782804, -1.2304922645497653, 0.02389199184771213, 0.030123389098379347, 0.04062372690273656, 0.0743200851811303, -0.056963336705747575, 0.18184349636236827, -1.326329428785377, -0.430772896633587, 0.013559697485632367, -0.07364538814872504, -0.016037708872722253, -0.8799245460269352, 0.06188205188243753, -0.0832902544281549, -0.006324527770306708, 0.02566142504931324, -0.44272010181254395, -0.1809079928504717, -0.06870913652399192, -0.8275042325607501, -0.256899498630729, -0.014847449427987967]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.JACCARD: 3> =S===
Average: -0.05052750938504081 
Per Topic: [0.028326774704166585, 0.1131498378701508, -0.3143009778112173, 0.11495007860163847, 0.05273476496545805, 0.07745150006893609, -0.19890510874489944, -0.4506096941108505, 0.03240897646173835, 0.05276763825159934, 0.07799616557442479, 0.10300765294167731, 0.029785138120253882, 0.22646935797399945, -0.30842970315780904, -0.21479199138056074, 0.04015165230052339, -0.06290167171715035, 0.03329444641454352, -0.3777039244822744, 0.0837298682696807, -0.048015411863000027, 0.0031752510759462086, 0.06049309560718636, -0.2972231205811517, -0.10548304048522066, -0.04337195058696024, -0.14943984682775205, -0.09097979786909288, 0.016438758864791857]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.NONE: 0> =S===
Average: 1.7045623505630207 
Per Topic: [1.5266369490971856, 1.5463879113428052, 2.500527541484421, 2.6211654329045517, 1.6152072912602486, 1.4244162026935918, 1.5243439616039047, 1.8431163771644155, 1.4003549336881116, 1.4828462165721186, 1.3435245777160023, 1.4048138054696502, 1.3353415497716672, 1.7875376955323292, 1.9259544157948747, 1.4914559665787055, 1.5159353629304713, 1.3652218150352202, 1.4493477040020368, 2.7839088670144436, 1.3712513309499348, 1.4650723061795419, 2.5894590816212744, 1.607943355316022, 1.9996923130945297, 1.4098756572162503, 1.7141295931091431, 2.025746019068962, 1.6136944791990455, 1.451961803479182]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.8551825504170523 
Per Topic: [0.8568254854944017, 0.906697412331899, 0.7271665891011556, 0.8156261616282993, 0.8707838190926446, 0.8924511830012004, 0.8620413064956665, 0.8509526835547553, 0.8582547346750895, 0.9164952185418871, 0.937135538789961, 0.9030599276224772, 0.9020455718040467, 0.9209719644652472, 0.7183065354824066, 0.8972178141276042, 0.8740266468789842, 0.8944442338413663, 0.9104682432280646, 0.6893860750728183, 0.926681931813558, 0.8576761537128025, 0.7724587930573358, 0.8919851157400344, 0.6979990694257948, 0.9129396239916484, 0.8691527287165324, 0.8243044892946879, 0.8304114023844401, 0.8675100591447619]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.DICE: 2> =S===
Average: 0.3953467755295613 
Per Topic: [0.3952705932988061, 0.4116775261031257, 0.3457771937052409, 0.3738812698258294, 0.3972695622179243, 0.4121724492973752, 0.3927939838833279, 0.39261747068829006, 0.3986912210782369, 0.4234406484497918, 0.4311494575606452, 0.4138374083571964, 0.41399666402075025, 0.4178920825322469, 0.35133889416853586, 0.4098752948972914, 0.4021126025252872, 0.41012402772903445, 0.41826743086179097, 0.3370403753386603, 0.4250685731569926, 0.3981595748000675, 0.35959198474884035, 0.4010718471474118, 0.3426904241243998, 0.41851362850930957, 0.3994131624698639, 0.3785300427012973, 0.3875945382648044, 0.4005433334244622]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.JACCARD: 3> =S===
Average: 0.6596688153346381 
Per Topic: [0.6563803288671706, 0.7020963006549411, 0.5323186298211415, 0.6005045566293928, 0.6619628310203552, 0.7054342985153198, 0.6484619776407877, 0.6504347576035394, 0.6657542082998488, 0.736786593331231, 0.759814174969991, 0.7082306557231479, 0.710816662841373, 0.7194733553462558, 0.5525827613141802, 0.6960519154866537, 0.675995961825053, 0.6966831141048008, 0.7229824145634969, 0.5109996477762858, 0.7403956360287136, 0.6643118752373589, 0.5646324700779385, 0.6727087252669864, 0.5272161457273695, 0.720808560318417, 0.6659841272566054, 0.612063941028383, 0.6380366259151035, 0.6701412068472968]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.NONE: 0> =S===
Average: 1014644532.1307074 
Per Topic: [2.0716905148356126, 2.045420831647486, 6.179741581835933, 8148148151.5435505, 1.96388427810584, 2.1567399942720877, 2.1689504559035084, 3.295981720050152, 2.031448664028519, 2.761785608553607, 1.817477707696492, 1.8154245281843358, 2735632185.589626, 3.2561513797552215, 19555555557.634724, 2.1183098679418264, 1.9320395422644892, 1.742234440291489, 2.0395844394243032, 4.133499290822854, 2.070774345681312, 2.1017326507876506, 4.527260013181598, 2.805270916317296, 3.061376765367417, 1.8785028352814224, 2.348135626366958, 2.8691604139030074, 2.207386240153904, 1.7533559920550403]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.7402534365717239 
Per Topic: [0.7901608188947041, 0.843464023537106, 0.4551342276028461, 0.3149481895897124, 0.7948817253112793, 0.8571844696998596, 0.8122045384513007, 0.7541576802730561, 0.8069455133544075, 0.8641168276468912, 0.8905021548271179, 0.8615469124582079, 0.7176932009588028, 0.8644522918595208, 0.27564658059061264, 0.831760479344262, 0.8138297478357951, 0.8520880222320557, 0.8684459182951185, 0.42272013458940716, 0.8856943421893649, 0.8053802145851983, 0.63527436653773, 0.8174244178666009, 0.4379139787207047, 0.8728516340255738, 0.8224291814698114, 0.6472298730578687, 0.7715571337276035, 0.8199644976192051]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.DICE: 2> =S===
Average: 0.33317976239575325 
Per Topic: [0.3563797414302826, 0.3753472553359138, 0.2330093636364418, 0.1232157862403148, 0.35636063549253677, 0.3826446420616574, 0.3597119006845686, 0.3548059927092658, 0.36521188616752626, 0.394514090485043, 0.40643646253479854, 0.38578488296932645, 0.31882762478330307, 0.3827680640750461, 0.10762037602072623, 0.3783985793590546, 0.3688957737551795, 0.3822804285420312, 0.39233422146903146, 0.18592197238293587, 0.39835565818680657, 0.36646339164839853, 0.29996280868848163, 0.35952309105131364, 0.19767125672509347, 0.3922528200679355, 0.3683128794034322, 0.2780546369890838, 0.3555277672078874, 0.3687988817691803]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.JACCARD: 3> =S===
Average: 0.5316045413682643 
Per Topic: [0.5580196784602272, 0.6070075160927243, 0.3320080794477474, 0.19004649233998394, 0.5593037558926477, 0.625640649927987, 0.5640980223814647, 0.5553434246116215, 0.5791655408011542, 0.6549664517243703, 0.6875664088461134, 0.6306196133295695, 0.532685252360555, 0.6239841527409024, 0.1635345952412056, 0.6113076812691158, 0.5893126898341708, 0.6207689881324768, 0.6512066079510583, 0.26687552570191325, 0.6637576699256897, 0.5820300175084008, 0.4329821487267812, 0.5680274128913879, 0.29369887570285214, 0.6470509966214498, 0.5842362410492368, 0.4285337212423949, 0.557279497385025, 0.5870785329076978]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.NONE: 0> =S===
Average: 0.5844497149251596 
Per Topic: [0.5936046262814404, 0.6503789352872539, 0.35685782028959667, 1.5368331864793847, 0.5886256409357892, 0.5971336365555293, 0.5579301101135489, 1.0312153663442274, -0.015801263126963696, 0.8471407045467768, 0.5532432510658585, 0.5334630190756529, 1.0494534604589918, 0.9489622076240566, -1.1309658679251207, 0.6300692686554132, 0.5668640611007268, 0.48795462530517447, 0.6444204216480794, 1.1198775447601903, 0.6320095574554566, 0.5382207289094766, 1.1684190336359297, 0.7448662141188256, -1.003586996406117, 0.5806891532275694, 0.74467025670524, 0.9061688786978715, 0.6163566360309041, 0.458417229904026]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.11935449063389972 
Per Topic: [0.09104066548170522, 0.06104483656365321, 0.3093050652080112, 0.28175778368280996, 0.08212855363470023, 0.008025267011382514, 0.12080522253006873, 0.1865225060661841, 0.030552612939532587, 0.1300905320340664, 0.059253980701517626, 0.041128822967099646, -0.013953080879420869, 0.09569137090713614, 0.17172210053023365, 0.13141210740238118, 0.09200627624264193, 0.09485439706542012, 0.03239099182925808, 0.3284517904122671, 0.029139023562897312, 0.09320585189982214, 0.20915542462219794, 0.14141440639117112, 0.19086722037431578, 0.07778486348171201, 0.1352714304941603, 0.24153346777929982, 0.08869124026872062, 0.039339987812046374]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.DICE: 2> =S===
Average: 9.764055218641406 
Per Topic: [1.4785268584887186, 8.029365785916646, 0.9702214002609253, 1.8041651871469286, 6.375461806191338, -0.7050536228550806, 1.4947402609719171, 2.4129498998324075, 3.9956230812602573, 5.715696952078077, -3.1508254329363505, -3.0207321961720783, -0.4067401882674959, 1.2127181026670668, 1.1320184853341844, 211.9972156604131, 1.2562444779607984, 1.7112665004200405, -0.5146351165241665, 0.9359890752368503, -4.61125920481152, 5.525151567988925, 39.07899330457052, 1.171814935737186, 0.9255641990237766, 2.0271761390897964, 1.5980050881703696, 1.2008336848682828, 1.1204772697554695, 2.160682597425249]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.JACCARD: 3> =S===
Average: -20.825088196727965 
Per Topic: [-2.7075881520907084, -2.272716283135944, 3.2274680429034763, 8.821884269184537, -4.040207725101047, -2.5036019414663313, -0.882104041841295, 18.635549034012687, -1.0613703462812636, -3.0882180836465625, -1.289473260111279, -2.3885018050670626, -2.2382670289940303, -2.2146502597464455, -0.25286320050557454, -6.070304331514571, -3.884779371155633, -2.4781847609413994, -2.379622534248564, -604.0762025992076, -1.6511220229996575, -0.15254030227661133, -5.448412187894186, -3.880208917458852, 4.612057495117187, -3.180569440788693, -3.9969701210657758, -0.6741435077455309, 6.395788433154424, -3.6327709509266746]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.NONE: 0> =S===
Average: 0.2978416590213352 
Per Topic: [0.3780543494180079, 0.4129695691956162, 0.18980654629698515, 0.7691373178155086, 0.4330623607524941, 0.30900114777448207, 0.355847175203072, 0.5755005634088165, -0.2125246165440904, 0.36983900020384447, 0.28614394288966066, 0.3204577164651876, 0.2715209622206863, 0.5262686189508876, -1.4235381631689017, 0.362400284509572, 0.37013832469472235, 0.28775052264550244, 0.35168678990791136, 0.8617703727076175, 0.2960053444307032, 0.31365885996615045, 0.8398320378929996, 0.4166274363483794, -0.9223808370094702, 0.3299575706066556, 0.48694471889678254, 0.6346445910731939, 0.42487595515039467, 0.3197913079366835]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.17994302767120876 
Per Topic: [0.12805647044959995, 0.11670115184194098, 0.32550658281478617, 0.3205361928480367, 0.1428330622962676, 0.17555796836192408, 0.16637061934840555, 0.2242692696241041, 0.06496669764682236, 0.2734027441797985, 0.23776455167163577, 0.1406386292203226, 0.10335524227573639, 0.2027769971690658, 0.1911078766376401, 0.19030399062256848, 0.15196374676500757, 0.13059370080526506, 0.12559629707700676, 0.3441504937079218, 0.1384310316397912, 0.1333530227472592, 0.21875141391323671, 0.17642224971204995, 0.20602956772264508, 0.14699623237570955, 0.16957532687520144, 0.2709827545409401, 0.10145181030060889, 0.07984513494496544]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.DICE: 2> =S===
Average: 2.7841405712333676 
Per Topic: [1.666050275001261, 0.6319933338297739, 0.9500268008973863, 1.0386896597014532, 2.250735392794013, 0.2756188680003915, 0.17857569058736164, 0.9083827204174466, -2.425627151255806, 0.8707619605792893, -0.2592729784548283, -0.19907247605216172, 0.802290514152911, 0.8496030672556825, 1.077572684817844, 1.2845817714929582, 3.02141981518103, 1.3063109305169847, -0.03761112979716725, 0.9118721087773641, -0.33338644554217656, 1.1887123200628493, 1.0431044512324863, 1.445079586075412, 0.9786576999558343, 61.388298433356816, 1.0510625998179117, 1.1705422348446317, 1.6549388404418197, -1.1656944416877297]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.JACCARD: 3> =S===
Average: 7.0628127242821375 
Per Topic: [-9.166540891263221, -2.342065739466084, -6.15212869114346, -43.87733449406094, 56.44401636720738, -2.6669036716222765, 1.8036339667108323, 0.8181499573919508, -3.2101934199945794, -3.841389800608158, -1.0703579567372798, -2.519921213326355, -4.007283565733168, -2.817425860464573, -3.0735806226730347, -1.983945485121674, -5.066192412128051, -8.380187131961186, -2.8874885131087566, -4.33076155450609, -1.4770257814062966, -2.8912580755021837, 289.9129321826829, -3.833444407582283, -8.179676280419033, -4.677306281195746, -2.820743430985345, 2.5057655387454565, -4.367011500419014, -3.959949502845605]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.NONE: 0> =S===
Average: 0.1766937747568545 
Per Topic: [0.14824682586212107, 0.18708438146379353, 0.20281956968249237, 0.2442528284296655, 0.1741918908510146, 0.14265440707643534, 0.14944404510495854, 0.25219999875725335, 0.10834079577106827, 0.2163301201716119, 0.19242175789761945, 0.1524210869864975, 0.15412284717444538, 0.247149721569528, 0.111412935305373, 0.17405880215656605, 0.15648258888424163, 0.1349883197180439, 0.18527317950462321, 0.24086584898178115, 0.15769056713778107, 0.1293967353327041, 0.25091810534198355, 0.18260028518696708, 0.12661959749425447, 0.1588389619937186, 0.1887080040766936, 0.2339521853385075, 0.16486529111142117, 0.13246155834246956]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.30799426429340176 
Per Topic: [0.2577278188119332, 0.36455040506811603, 0.27655373497141733, 0.37316074293080925, 0.30523637334505715, 0.31315051799433097, 0.25819794668091667, 0.3904047669635879, 0.226201621008416, 0.4634987852639622, 0.43473703985412915, 0.3618507874508699, 0.286240578111675, 0.5096372005012301, 0.16657824232760404, 0.31665071715186865, 0.29665989984447755, 0.26487721796664926, 0.34197703417804504, 0.3128788981690175, 0.3643739732810193, 0.22974988263514307, 0.2334528881435593, 0.34796334778269133, 0.1884039598180809, 0.3390489612188604, 0.2937325588117043, 0.29894404822132653, 0.18890943971152108, 0.2344785405840311]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.DICE: 2> =S===
Average: -6.696977050953278 
Per Topic: [-0.501081607511474, -0.058775322056478924, -5.950174129340383, -1.0146991021827692, -0.19650192408718997, -0.5972039008306133, 68.61222103497842, -0.5676436835382548, -0.43249382376670836, -0.2505337163909442, 0.004097158461809159, -0.45448160244462393, 2.5815661487686965, 0.10029054445525011, 3.3434767007827757, -12.810671669674209, -0.8501870264609654, -0.34551202019469607, 13.165061561514934, 0.3700122098128001, -0.013202836861213048, 0.8229714737584194, -0.018339197296235295, -0.1298573174048215, -0.5295765637523598, -270.0585608153294, -0.7114471904726491, 5.680398739212089, 0.04282868843939569, -0.14129233918696021]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.JACCARD: 3> =S===
Average: 2.1071468107418276 
Per Topic: [-0.2321656532999542, -0.009382950059241718, -1.7773478090763093, -0.3373492727159626, -0.12773477082244225, -0.14153379789657064, -0.58121748691968, -0.5889425377361477, -0.20039150988062224, -0.03989723877360423, 0.0805715151131153, -0.027872106507937942, -0.122680844925344, 0.12404205043696695, -1.1835544238487878, -0.5044889359683212, -0.24931146192053955, -0.21346006609706414, -0.1409227941185236, 9.27308953884575, 0.0325390866647164, -0.3664802716837989, -0.6178307306435373, -0.08279434137253297, 63.20406588969959, -0.19907079886438117, -0.30779310458650194, -0.6767237674030993, -0.5674934085458517, -0.2034636748385512]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.NONE: 0> =S===
Average: -1.2919665389931534 
Per Topic: [-1.17386418557316, -0.8680202687611864, -1.9011274512865044, -1.047667736559903, -0.8366067935289703, -1.0824157815953828, -1.0848464209397464, -1.042256352533198, -2.040907684969454, -0.9648597265277666, -0.6370809895484615, -1.0602086759498055, -0.9300058656023976, -0.8755223027370869, -3.8756595693460896, -0.9561534903362163, -1.0182683509407169, -1.0452619935102012, -0.8683999688876635, -1.6542543949045063, -0.9976812325252717, -1.3517324017228247, -1.4340397599847348, -1.1336003900525302, -3.3037643601664413, -0.9452589069885053, -1.3000326329631926, -1.1188280272579738, -1.1003419572424118, -1.1103284968522937]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.4489587392133696 
Per Topic: [0.4172645108567344, 0.3680855777528551, 0.47551360925038655, 0.46617349055078294, 0.42326092951827576, 0.6414405763149261, 0.38452149331569674, 0.4236492501364814, 0.45179317759142984, 0.4734817637337579, 0.513688063621521, 0.4954831686284807, 0.5364161756303575, 0.4674039426777098, 0.3921919425328573, 0.45504247943560283, 0.40978858404689367, 0.34588676674498453, 0.4799913730886247, 0.5026300244861179, 0.48717684911357034, 0.3698685281806522, 0.45938059820069205, 0.48731302552753025, 0.452699175145891, 0.40655995971626707, 0.4119377543528875, 0.42373811536365086, 0.4009397059679031, 0.44544156491756437]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.DICE: 2> =S===
Average: 0.7170875298093867 
Per Topic: [0.7426078796386719, 0.7015290180842082, 0.7337837298711141, 0.7339953780174255, 0.7182757245169745, 0.7243074602550931, 0.7232087903552585, 0.7609699527422588, 0.7437359650929769, 0.704906948407491, 0.6405402686860826, 0.6788614100880093, 0.7135660039054023, 0.6997373965051439, 0.7442686306105719, 0.7021779139836629, 0.7368649337026808, 0.714368814892239, 0.682016827000512, 0.7392514573203193, 0.7062499708599514, 0.7357237325774298, 0.7286302831437853, 0.7034077525138855, 0.7649813916948106, 0.6894622564315795, 0.6908912089135911, 0.7162305646472507, 0.7261502756012811, 0.7119239542219374]

==== Coherence : <Segmentation.ONE_PRE: 2>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.JACCARD: 3> =S===
Average: 2.9189544194716004 
Per Topic: [3.515099125438266, 2.5720635890960692, 3.0092657830980087, 3.2807204246521, 2.874766953786214, 3.7289518859651354, 2.8617791864607067, 3.7924955818388195, 3.4049933248096043, 2.7408284081353083, 1.896372185813056, 2.4040002981821695, 3.1123066107432047, 2.618385820918613, 3.2927805900573732, 2.509185202916463, 3.3346821069717407, 2.619035180409749, 2.6232512129677668, 3.124668386247423, 2.6490831957923042, 3.215196074379815, 2.9361532661649914, 2.5999829795625473, 3.845041184955173, 2.3929948170979816, 2.3027957253985933, 2.6045406103134154, 2.966953155729506, 2.740259716245863]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.NONE: 0> =S===
Average: 0.041999999999236134 
Per Topic: [0.04666666666620373, 0.12999999999861117, 0.03666666666620372, 0.03999999999930559, 0.028888888887808717, 0.19888888888711426, -0.007777777778395056, 0.13777777777700623, 0.0011111111108024763, 0.1388888888878087, 0.028888888887808717, -0.06666666666689813, 0.014444444443209914, -0.06888888888919752, 0.018888888888503125, 0.15333333333240745, 0.028888888887808717, 0.05999999999930558, 0.10666666666550928, 0.02111111111080248, 0.0755555555547068, -0.013333333333796271, 0.05666666666620372, -0.08222222222229938, 0.0011111111108024763, 0.05555555555401248, 0.016666666666203728, 0.15333333333240745, -0.026666666667592542, -0.0244444444452932]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.15673434886460502 
Per Topic: [0.28987571597099304, 0.320284903049469, 0.15148071944713593, 0.20374327898025513, 0.08384250849485397, 0.34609341621398926, 0.01871933601796627, 0.4937671720981598, 0.026877814903855324, 0.40721428394317627, 0.08384250849485397, -0.19972720742225647, 0.2210555225610733, -0.22205013036727905, 0.008636128157377243, 0.40278589725494385, 0.3336375951766968, 0.23292849957942963, 0.35347050428390503, 0.26747798919677734, 0.3050496578216553, -0.13561280071735382, 0.14727157354354858, -0.10859449952840805, 0.014918585307896137, 0.25913405418395996, 0.05369899049401283, 0.5455716252326965, 0.013203505426645279, -0.21656668186187744]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.DICE: 2> =S===
Average: -0.3440898985369131 
Per Topic: [-0.44132310152053833, -0.1838964819908142, -0.7713922262191772, -0.15017744898796082, -0.13535699248313904, -0.0145052969455719, -0.5916098952293396, -0.9177692532539368, -0.38130316138267517, -0.014815865084528923, -0.13535699248313904, -1.1045511960983276, -0.03331481292843819, -0.3744139075279236, -0.5134019255638123, -0.002610116498544812, -0.0871577113866806, -0.32402369379997253, -0.04791494831442833, -0.4453229010105133, -0.577428936958313, -0.6193571090698242, -0.283419132232666, -0.317392498254776, -0.6215577125549316, -0.00806919950991869, -0.57356196641922, -0.04842308908700943, -0.2178465873003006, -0.38542279601097107]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.JACCARD: 3> =S===
Average: -0.22468823567808915 
Per Topic: [-0.3061930537223816, -0.155331552028656, -0.4354722201824188, -0.1305689513683319, -0.11921977251768112, -0.014297901652753353, -0.3717052936553955, -0.47856080532073975, -0.2760459780693054, -0.014599558897316456, -0.11921977251768112, -0.5248393416404724, -0.032240718603134155, -0.2724171280860901, -0.3392370343208313, -0.0026033215690404177, -0.08017025142908096, -0.2447265088558197, -0.04572408273816109, -0.30811312794685364, -0.36605697870254517, -0.3824710249900818, -0.22083134949207306, -0.24092480540275574, -0.383309006690979, -0.008004610426723957, -0.36449912190437317, -0.04618659242987633, -0.178878515958786, -0.2781986892223358]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.NONE: 0> =S===
Average: 1.1829920572622323 
Per Topic: [1.3888888888567388, 1.351351351341206, 1.2820512820238879, 1.1904761904604435, 1.0802469135719117, 1.4520202020110353, 0.9661835748675581, 1.9841269840876163, 1.0101010100755026, 1.5555555555382716, 1.0802469135719117, 0.5555555555452675, 1.033591731259473, 0.617283950607758, 1.1574074073806158, 1.8518518518232738, 1.0802469135719117, 1.315789473664974, 1.3440860214933326, 1.234567901196464, 1.3285024154428924, 0.9259259259116369, 1.5151515151132535, 0.25252525251887564, 1.0101010100755026, 1.1111111111049383, 1.111111111090535, 1.8518518518232738, 0.9259259259187814, 0.9259259259181319]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.8453894654909769 
Per Topic: [0.8197667598724365, 0.9256532788276672, 0.8035576939582825, 0.8818578124046326, 0.8918190002441406, 0.9353445768356323, 0.8346521258354187, 0.873748779296875, 0.7586309909820557, 0.9246693849563599, 0.8918190002441406, 0.7219855189323425, 0.9121947288513184, 0.7525374889373779, 0.7535561919212341, 0.8993445038795471, 0.9197915196418762, 0.8803811073303223, 0.9235726594924927, 0.7482755780220032, 0.8938221335411072, 0.8005030751228333, 0.8128131628036499, 0.7002859115600586, 0.7354151606559753, 0.9233746528625488, 0.805790901184082, 0.9214084148406982, 0.8755013942718506, 0.8396104574203491]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.DICE: 2> =S===
Average: 0.38887887398401894 
Per Topic: [0.3785381019115448, 0.42045146226882935, 0.3751106262207031, 0.3976234495639801, 0.4064190685749054, 0.42867404222488403, 0.3846781253814697, 0.3993554711341858, 0.35036173462867737, 0.42043137550354004, 0.4064190685749054, 0.34244710206985474, 0.415459543466568, 0.34844744205474854, 0.36161744594573975, 0.40465158224105835, 0.41982099413871765, 0.39895099401474, 0.4213864803314209, 0.3619428873062134, 0.4089191257953644, 0.37124642729759216, 0.37123751640319824, 0.32440507411956787, 0.3589421510696411, 0.4214225709438324, 0.3721025586128235, 0.41501474380493164, 0.3991887867450714, 0.38110026717185974]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.JACCARD: 3> =S===
Average: 0.6396074990431467 
Per Topic: [0.609109103679657, 0.7254809141159058, 0.6002833247184753, 0.6600911617279053, 0.6846902370452881, 0.7503141760826111, 0.6251657009124756, 0.6648780703544617, 0.5393182039260864, 0.7254211902618408, 0.6846902370452881, 0.520790159702301, 0.7107454538345337, 0.5347955226898193, 0.5664587616920471, 0.6796886920928955, 0.7236059308052063, 0.6637577414512634, 0.7282691597938538, 0.5672577619552612, 0.6918159127235413, 0.5904481410980225, 0.5904256105422974, 0.4801769256591797, 0.5599215030670166, 0.7283771634101868, 0.5926167368888855, 0.7094447016716003, 0.6644162535667419, 0.6157705187797546]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.NONE: 0> =S===
Average: 1.513354111031174 
Per Topic: [1.7777777777432102, 1.6842105263022469, 1.5238095237835223, 1.3333333333185187, 1.1313131313048466, 1.947089947075267, 0.9481481481391936, 4.444444444320989, 1.0158730158556815, 2.2626262625994085, 1.1313131313048466, 0.44444444443950626, 1.053497942380003, 0.5079365079314689, 1.269841269819602, 3.5555555554864204, 1.1313131313048466, 1.5999999999795558, 1.6666666666516203, 1.4222222221898273, 1.6296296296115227, 0.8888888888790125, 2.133333333284741, 0.17777777777550618, 1.0158730158556815, 1.1851851851779427, 1.1851851851687243, 3.5555555554864204, 0.8888888888827162, 0.8888888888823795]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.756882642582059 
Per Topic: [0.7650192379951477, 0.8613271713256836, 0.7436153888702393, 0.8194134831428528, 0.828204870223999, 0.8931920528411865, 0.7570669054985046, 0.84662264585495, 0.6689490675926208, 0.8754242658615112, 0.828204870223999, 0.639968752861023, 0.862912118434906, 0.6384710669517517, 0.05105343088507652, 0.7766585350036621, 0.8765435218811035, 0.8180068135261536, 0.879450798034668, 0.7161286473274231, 0.8418636918067932, 0.7145481109619141, 0.7389515042304993, 0.5211626291275024, 0.689755380153656, 0.8764177560806274, 0.7430278062820435, 0.870980441570282, 0.8075026869773865, 0.7560356259346008]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.DICE: 2> =S===
Average: 0.3428099542940226 
Per Topic: [0.34372544288635254, 0.38872024416923523, 0.34047603607177734, 0.3650163412094116, 0.37595123052597046, 0.4036736488342285, 0.34979259967803955, 0.3731122314929962, 0.3054954707622528, 0.3928699791431427, 0.37595123052597046, 0.30285680294036865, 0.386983186006546, 0.30006176233291626, 2.5203056308598093e-10, 0.3609868586063385, 0.3934319317340851, 0.3656192421913147, 0.3954794108867645, 0.32332706451416016, 0.3795117437839508, 0.3331964910030365, 0.3284977972507477, 0.24949350953102112, 0.32146450877189636, 0.39533209800720215, 0.33701446652412415, 0.38405901193618774, 0.3673328459262848, 0.34486544132232666]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.JACCARD: 3> =S===
Average: 0.535612565287408 
Per Topic: [0.5237525105476379, 0.635912299156189, 0.5162451863288879, 0.57484370470047, 0.6024388074874878, 0.6769340634346008, 0.5379707217216492, 0.5951818227767944, 0.43987542390823364, 0.6470937132835388, 0.6024388074874878, 0.4344255328178406, 0.6312766671180725, 0.4286973774433136, 2.5203056308598093e-10, 0.5649130344390869, 0.6486195921897888, 0.57634037733078, 0.654203474521637, 0.4778187870979309, 0.6116340756416321, 0.4996921420097351, 0.48919835686683655, 0.33243343234062195, 0.47376230359077454, 0.6538003087043762, 0.5083285570144653, 0.6235321164131165, 0.5806098580360413, 0.5264039039611816]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.NONE: 0> =S===
Average: 0.24354935751654835 
Per Topic: [0.57536414488468, 0.521296923625839, 0.4212134650598962, 0.2876820724414198, 0.1233790211540655, 0.6663359231022626, -0.0532445145272019, 1.4916548767501645, 0.01574835695205985, 0.8165262017090233, 0.1233790211540655, -0.8109302162251897, 0.05211600113348184, -0.677398823599758, 0.23889190826607307, 1.268511325444344, 0.1233790211540655, 0.47000362923358285, 0.5108256237575629, 0.3522205935672775, 0.4883527679034347, -0.1177830356663694, 0.7576857016752074, -1.7272209480976366, 0.01574835695205985, 0.16989903679013044, 0.1698990367823524, 1.268511325444344, -0.11778303566220273, -0.11778303566258155]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.10306088653936361 
Per Topic: [0.04774315282702446, 0.00027956522535532713, 0.1759595423936844, 0.044969458132982254, 0.010081577114760876, -0.02322317101061344, 0.25080054998397827, 0.16333380341529846, 0.155797079205513, 0.02492894046008587, 0.010081577114760876, 0.14169538021087646, 0.031085219234228134, 0.2477717101573944, 0.16646547615528107, 0.28673261404037476, 0.030864188447594643, 0.012386607937514782, -0.016495274379849434, 0.16911812126636505, 0.25782379508018494, 0.22052617371082306, 0.20629671216011047, 0.17674943804740906, 0.002177574671804905, 0.022487830370664597, 0.025386424735188484, 0.2607489824295044, 0.019804658368229866, -0.030551111325621605]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.DICE: 2> =S===
Average: 1.6876316348711649 
Per Topic: [1.1808829307556152, 3.1761984825134277, 1.11403226852417, 1.6925550699234009, 7.197402000427246, -44.1919059753418, 1.2981277704238892, 1.0437235832214355, 1.3059754371643066, 3.1600587368011475, 7.197402000427246, 1.1293100118637085, -6.238952159881592, 1.9343199729919434, 1.3272883892059326, 2.058518886566162, 2.8631575107574463, 1.5730520486831665, -8.661764144897461, 1.0858306884765625, 1.2325001955032349, 1.4062464237213135, 1.4607523679733276, 1.4230997562408447, 1.191633939743042, 6.938048362731934, 1.2817714214324951, 1.389438509941101, 48.58103942871094, 4.479205131530762]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.JACCARD: 3> =S===
Average: -4.202050316333771 
Per Topic: [-6.528441429138184, -1.4595166444778442, -9.76945686340332, -2.4439284801483154, -1.1613579988479614, -0.9778721332550049, -4.3542680740356445, -23.871009826660156, -4.2682366371154785, -1.4629502296447754, -1.1613579988479614, -8.733351707458496, -0.8618583679199219, -2.0702972412109375, -4.055410385131836, -1.9447168111801147, -1.5367234945297241, -2.7450411319732666, -0.8964992761611938, -12.650808334350586, -5.301071643829346, -3.4615578651428223, -3.170363187789917, -3.3635103702545166, -6.2182841300964355, -1.168405532836914, -4.548975944519043, -3.5677988529205322, -1.0210167169570923, -1.2874221801757812]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.NONE: 0> =S===
Average: 0.10650250803827695 
Per Topic: [0.3285040669655546, 0.30110509278196973, 0.2484613592937987, 0.17435338714266152, 0.07718963869055703, 0.37295582954090467, -0.03440142671690969, 0.6851790109009271, 0.010050335853248927, 0.44183275227507113, 0.07718963869055703, -0.5877866648873042, 0.033039854077990205, -0.48242614923472477, 0.14618251017493344, 0.6161861394167182, 0.07718963869055703, 0.2744368456982516, 0.2957142441467513, 0.21072103130978853, 0.2840523043982159, -0.07696104113489391, 0.41551544395307993, -1.3762440251916417, 0.010050335853248927, 0.1053605156572708, 0.10536051565597443, 0.6161861394167182, -0.07696104113551105, -0.07696104113545482]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.16347259391720095 
Per Topic: [0.07990583777427673, 0.036092985421419144, 0.2575892210006714, 0.07601101696491241, 0.024034013971686363, 0.03873555362224579, 0.36558184027671814, 0.2416311651468277, 0.21751579642295837, 0.065586157143116, 0.024034013971686363, 0.18903659284114838, 0.1921483874320984, 0.34645339846611023, 0.24603603780269623, 0.4383622705936432, 0.05610727518796921, 0.03076649084687233, -0.01406999584287405, 0.23927003145217896, 0.3885394036769867, 0.318610280752182, 0.3091985285282135, 0.22767148911952972, 0.004244009032845497, 0.05279702693223953, 0.04347703978419304, 0.4040156900882721, 0.07553057372570038, -0.07073431462049484]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.DICE: 2> =S===
Average: 1.2288841644922892 
Per Topic: [1.1499894857406616, 8.694544792175293, 1.0880317687988281, 1.6410962343215942, -7.84513521194458, -1.5090053081512451, 1.2635118961334229, 1.0226049423217773, 1.2583255767822266, 3.92311429977417, -7.84513521194458, 1.1067636013031006, -1.1698986291885376, 1.8490943908691406, 1.211580514907837, 1.8340578079223633, 4.200595378875732, 1.5275579690933228, -1.2552475929260254, 1.0609850883483887, 1.174046516418457, 1.382623314857483, 1.387900710105896, 1.328208327293396, 1.1547659635543823, -7.241245746612549, 1.2499561309814453, 1.2993130683898926, -2.234434127807617, 24.157958984375]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.JACCARD: 3> =S===
Average: -5.430280049641927 
Per Topic: [-7.667129993438721, -1.1299620866775513, -12.359515190124512, -2.559828281402588, -0.8869431614875793, -0.6014356017112732, -4.794895172119141, -45.238197326660156, -4.871084690093994, -1.3421005010604858, -0.8869431614875793, -10.366501808166504, -0.5391488671302795, -2.177724838256836, -5.726335525512695, -2.1989564895629883, -1.3124418258666992, -2.8955254554748535, -0.556589663028717, -17.397472381591797, -6.74558687210083, -3.6135365962982178, -3.5779788494110107, -4.046844959259033, -7.461368560791016, -0.8786590695381165, -5.00070333480835, -4.340982913970947, -0.6908267736434937, -1.0431815385818481]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.NONE: 0> =S===
Average: 0.06624470394148856 
Per Topic: [0.11676363675255017, 0.17559212082902823, 0.08831321989894252, 0.07240752719538407, 0.03926002438804054, 0.2537673862402816, -0.013620396947472472, 0.29756946355149305, 0.0031223123848645212, 0.2247240034286678, 0.03926002438804054, -0.16762497072759863, 0.01802913252790635, -0.14987410993346564, 0.048796920694947676, 0.2906177127978444, 0.03926002438804054, 0.11397136410118747, 0.15587535132395897, 0.06546416910010913, 0.12868913731679366, -0.02735506788144294, 0.14769100063500679, -0.2988475929530144, 0.0031223123848645212, 0.06546416910128068, 0.03744939017575551, 0.2906177127978444, -0.03629786572368363, -0.034866993991498695]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.1795986601151526 
Per Topic: [0.2393397092819214, 0.2939641773700714, 0.1676352471113205, 0.23427294194698334, 0.1200600191950798, 0.45229148864746094, 0.08748222142457962, 0.39698004722595215, 0.004797083325684071, 0.4234808087348938, 0.1200600191950798, -0.13449855148792267, 0.2900974750518799, -0.16396880149841309, 0.033560678362846375, 0.5256941318511963, 0.32555586099624634, 0.1784573495388031, 0.3900874853134155, 0.31562480330467224, 0.3369654417037964, -0.0987323746085167, 0.1808038353919983, -0.1088065505027771, 0.01916022039949894, 0.2978954613208771, 0.07305517047643661, 0.5543202757835388, 0.03412267193198204, -0.20179854333400726]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.DICE: 2> =S===
Average: -35.873382854368536 
Per Topic: [-2.413581609725952, -0.19152449071407318, -7.951733589172363, -0.2445598989725113, -0.12968221306800842, 0.028639836236834526, -1.0611329078674316, 3.067133665084839, -0.9410110116004944, -0.00830205250531435, -0.12968221306800842, -1076.655029296875, -0.011695269495248795, -0.49732834100723267, -1.262964129447937, 0.006568645592778921, -0.09923006594181061, -0.47071945667266846, -0.006059555336833, 22.188892364501953, -0.884681761264801, -1.017217755317688, -0.44068923592567444, -0.7247352600097656, -4.022817134857178, -0.005134203936904669, -1.5642764568328857, -0.15523123741149902, -0.20906424522399902, -0.39463675022125244]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.JACCARD: 3> =S===
Average: -0.39027324233514565 
Per Topic: [-0.7070525884628296, -0.1607390195131302, -0.8882899284362793, -0.19650308787822723, -0.11479531228542328, 0.029484260827302933, -0.5148299932479858, -1.4837617874145508, -0.48480454087257385, -0.008233696222305298, -0.11479531228542328, -0.999072253704071, -0.011560071259737015, -0.33214378356933594, -0.5581017136573792, 0.006612077821046114, -0.09027235209941864, -0.32006070017814636, -0.006023058667778969, -1.0471946001052856, -0.46940648555755615, -0.5042677521705627, -0.30588778853416443, -0.4202008843421936, -0.8009085059165955, -0.005107979290187359, -0.6100264191627502, -0.13437245786190033, -0.17291411757469177, -0.2829674184322357]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.NONE: 0> =S===
Average: -1.4736030129460684 
Per Topic: [-1.7917594692248326, -0.693147180560723, -1.7917594692248326, -1.3862943611186682, -0.9444616088410577, -0.4480247225281728, -1.5040773967745518, -1.280933845461242, -2.197224577329997, -0.9444616088410577, -0.9444616088410577, -2.484906649778778, -0.8109302162168566, -2.197224577329997, -1.9740810260175872, -1.0986122886678875, -0.9444616088410577, -1.3862943611186682, -0.8754687373542778, -2.197224577329997, -1.1856236656572445, -1.7917594692248326, -1.7917594692248326, -3.5835189384228876, -2.197224577329997, -0.5877866649030966, -1.7917594692248326, -1.0986122886678875, -1.0986122886678875, -1.1856236656572445]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.4051099667946498 
Per Topic: [0.3128114342689514, 0.29022714495658875, 0.456012099981308, 0.27219223976135254, 0.27618899941444397, 0.5187976956367493, 0.5151218771934509, 0.45409396290779114, 0.4310186803340912, 0.2844853401184082, 0.27618899941444397, 0.406873881816864, 0.5763556361198425, 0.5211324095726013, 0.44940024614334106, 0.5777320265769958, 0.25635379552841187, 0.26241374015808105, 0.5060144066810608, 0.4632687270641327, 0.5369794368743896, 0.5030347108840942, 0.4982015788555145, 0.4322194457054138, 0.26809704303741455, 0.2195909172296524, 0.2773457169532776, 0.5402519106864929, 0.5302295088768005, 0.24066539108753204]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.DICE: 2> =S===
Average: 0.7169607857863108 
Per Topic: [0.8160318732261658, 0.6320065855979919, 0.7978934049606323, 0.7204073071479797, 0.6506289839744568, 0.6404455900192261, 0.7013998627662659, 0.8310660719871521, 0.7743260264396667, 0.6533805131912231, 0.6506289839744568, 0.800041675567627, 0.6519516706466675, 0.7004050612449646, 0.7702807784080505, 0.6710249781608582, 0.6511701941490173, 0.714690089225769, 0.6047012209892273, 0.8539513349533081, 0.680045485496521, 0.7045339941978455, 0.741414487361908, 0.7823833227157593, 0.8255860805511475, 0.7111220359802246, 0.7788773775100708, 0.6950854659080505, 0.6371696591377258, 0.6661734580993652]

==== Coherence : <Segmentation.ONE_SUC: 3>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.JACCARD: 3> =S===
Average: 2.79475519657135 
Per Topic: [4.435728073120117, 1.7174402475357056, 3.947885513305664, 2.5766310691833496, 1.8622870445251465, 1.7812200784683228, 2.3489603996276855, 4.919475555419922, 3.431169271469116, 1.885008454322815, 1.8622870445251465, 4.001043796539307, 1.8731648921966553, 2.3378403186798096, 3.353139877319336, 2.0397439002990723, 1.866727590560913, 2.5049610137939453, 1.5297322273254395, 5.847029685974121, 2.125443935394287, 2.3844850063323975, 2.8671932220458984, 3.595236301422119, 4.7334885597229, 2.461669683456421, 3.522379159927368, 2.279606819152832, 1.7561087608337402, 1.9955683946609497]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.NONE: 0> =S===
Average: 0.05055555552748148 
Per Topic: [-0.253, 0.6079999999150001, -0.13799999999999998, 0.7759999999074999, -0.25999999999999995, -0.33999999999999997, -0.24899999999999997, -0.26, -0.26599999999999996, -0.343, 0.4843333332497222, -0.29000000000000004, -0.367, 0.50299999993, -0.14900000000000002, 0.7029999999000001, 0.6749999999074999, -0.296, -0.332, -0.10500000000000002, 0.6379999999500001, 0.7369999999000001, -0.13699999999999998, 0.613999999915, -0.145, 0.5583333332497221, -0.21400000000000002, -0.177, -0.21000000000000002, -0.24900000000000003]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.5415724128422639 
Per Topic: [0.5373274594545364, 0.6175571918487549, 0.4770723730325699, 0.6316824167966842, 0.5644107758998871, 0.5595276564359665, 0.5154653616249562, 0.6185740262269974, 0.5269137233495712, 0.6371875941753388, 0.5716313332319259, 0.5969623416662216, 0.5010495139285922, 0.73782097697258, 0.3656056709587574, 0.5169796407222748, 0.5813835948705673, 0.4918918490409851, 0.5391562134027481, 0.5122612968087197, 0.5736709266901017, 0.4945665955543518, 0.5182306855916977, 0.5902392476797104, 0.42227031141519544, 0.54073486328125, 0.5509150475263596, 0.510292598605156, 0.43703860640525816, 0.508752492070198]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.DICE: 2> =S===
Average: -0.05027374318626243 
Per Topic: [0.03927813754417002, 0.04180443454533815, -0.2468165174126625, 0.027459732769057155, 0.025011882465332747, 0.0523207476362586, -0.15124760381877422, -0.7090898901224136, 0.04631739230826497, -0.04348974861204624, 0.033568563591688874, 0.05140705183148384, 0.07163909580558539, 0.03393064513802528, -0.17329260855913162, -0.15716274566948413, 0.031568868551403284, -0.023677445948123932, 0.03465194166637957, -0.35570975244045255, 0.04182005319744349, 0.024973296327516437, 0.0360838046297431, 0.023332816548645496, -0.1388972859829664, -0.06800140934064984, -0.02806585220969282, -0.09312676191329956, 0.019143888726830482, 0.04605297315865755]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.JACCARD: 3> =S===
Average: -0.029941071485324454 
Per Topic: [0.04308585688704625, 0.0446071389131248, -0.1967119961977005, 0.02873232145793736, 0.02626017779111862, 0.056498335860669616, -0.12840020135045052, -0.41183939278125764, 0.04950031088665128, -0.040606487262994054, 0.03527016192674637, 0.05498626958578825, 0.08003168953582644, 0.03558255136013031, -0.14648936986923217, -0.13402051515877247, 0.03433660753071308, -0.02260943448636681, 0.037681630812585354, -0.2607411935925484, 0.04467113250866532, 0.027608619537204504, 0.037783720856532456, 0.026072914712131023, -0.11964958906173706, -0.06285469895228743, -0.02633865411626175, -0.08349631540477276, 0.022510839439928532, 0.05030542407184839]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.NONE: 0> =S===
Average: 1.3552114872225542 
Per Topic: [0.0, 3.754637752793726, 0.0, 8.385281375197048, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.2441218301929142, 0.0, 0.0, 2.9333105725176685, 0.0, 3.8905305203853535, 4.541123934836883, 0.0, 0.0, 0.0, 3.17222144025044, 4.939386063134093, 0.0, 3.8182960730456528, 0.0, 2.9774350543228523, 0.0, 0.0, 0.0, 0.0]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.9146278921763101 
Per Topic: [0.9166006147861481, 0.9460621774196625, 0.8346863150596618, 0.8896758079528808, 0.9241811692714691, 0.937481838464737, 0.9202969610691071, 0.9123888492584229, 0.9169389188289643, 0.9522718071937561, 0.9645551145076752, 0.9446840107440948, 0.9442847371101379, 0.9547281682491302, 0.8270976185798645, 0.9409907221794128, 0.9267397880554199, 0.9392213821411133, 0.9492060959339141, 0.8096889436244965, 0.9583303928375244, 0.9172113299369812, 0.8633925139904022, 0.9372662842273712, 0.8148745715618133, 0.9502478659152984, 0.9243027687072753, 0.8960786700248718, 0.9017829835414887, 0.9235683441162109]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.DICE: 2> =S===
Average: 0.09997679015000661 
Per Topic: [0.09999999850988388, 0.10000000223517418, 0.09983752146363259, 0.09997299909591675, 0.09999999776482582, 0.09999999850988388, 0.1, 0.09998270869255066, 0.10000000074505806, 0.10000000298023223, 0.09999999701976776, 0.09999999403953552, 0.10000000670552253, 0.1, 0.09982802346348763, 0.10000000149011612, 0.10000000521540642, 0.10000000223517418, 0.10000000223517418, 0.09972630068659782, 0.1000000037252903, 0.09999999627470971, 0.09997882321476936, 0.10000000223517418, 0.09999999850988388, 0.10000000149011612, 0.10000000447034836, 0.09997731372714043, 0.09999999776482582, 0.1]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.JACCARD: 3> =S===
Average: 0.1111955966303746 
Per Topic: [0.11120680719614029, 0.11128147840499877, 0.11113375052809715, 0.11124621704220772, 0.1112453892827034, 0.11116116121411324, 0.11123602688312531, 0.1111360877752304, 0.11117456331849099, 0.11113841906189918, 0.11113927066326142, 0.11117525771260262, 0.11122014597058297, 0.11120561137795448, 0.11129894480109215, 0.1112654909491539, 0.11122472137212754, 0.11114718317985535, 0.11117018386721611, 0.11086780652403831, 0.1111651249229908, 0.11118221059441566, 0.11119996532797813, 0.11132487207651139, 0.11122125610709191, 0.11115459278225899, 0.111176498234272, 0.11129455417394638, 0.11122925654053688, 0.1112450510263443]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.NONE: 0> =S===
Average: 1.4393458787279667 
Per Topic: [0.0, 3.91612019971099, 0.0, 9.500013296804193, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.2771944399682726, 0.0, 0.0, 3.023488366399701, 0.0, 4.031166594645543, 4.80990493381229, 0.0, 0.0, 0.0, 3.3473098207303265, 5.222771860531334, 0.0, 3.992185453036204, 0.0, 3.0602213962001406, 0.0, 0.0, 0.0, 0.0]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.703916822627378 
Per Topic: [0.872867226600647, 0.9013971149921417, 0.0466287637129426, 0.11647273153066635, 0.871793395280838, 0.914996188879013, 0.888783973455429, 0.8472237527370453, 0.8838164329528808, 0.92010298371315, 0.9369237124919891, 0.9197418034076691, 0.05875986196674117, 0.9177050948143005, 0.07719137948006391, 0.8989042401313782, 0.8865294039249421, 0.9135325968265533, 0.9234955966472626, 0.10785636231303215, 0.9338362574577331, 0.8839708745479584, 0.7614807307720184, 0.8858444094657898, 0.0600609116256237, 0.9261069238185883, 0.8955683052539826, 0.10792532227933407, 0.8649173259735108, 0.8930710017681122]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.DICE: 2> =S===
Average: 0.07714912438359542 
Per Topic: [0.10000000596046447, 0.09977788031101227, 3.2780742120763763e-09, 0.01643266061982747, 0.09973965883255005, 0.1, 0.09999999627470971, 0.09973392114043236, 0.09999999776482582, 0.09999999701976776, 0.09999999850988388, 0.09999999776482582, 2.743425109308362e-09, 0.09995957762002945, 1.5768337677246613e-10, 0.09989505112171174, 0.09971871227025986, 0.09999999776482582, 0.09999999925494193, 7.87408849234339e-10, 0.09999999701976776, 0.10000000074505806, 0.09936678409576416, 0.09984949454665185, 2.908799512368887e-09, 0.09999999478459358, 0.09999999701976776, 1.9358580072292853e-09, 0.1, 0.09999999925494193]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.JACCARD: 3> =S===
Average: 0.08589823905593837 
Per Topic: [0.11150938868522645, 0.11153265833854675, 3.2780742120763763e-09, 0.01723609397849675, 0.11136268749833107, 0.11126035824418068, 0.11148354113101959, 0.11087922155857086, 0.11130479648709297, 0.11118842512369156, 0.11112882122397423, 0.11124122738838196, 2.743425109308362e-09, 0.11147894263267517, 1.5768337538468736e-10, 0.11138427332043647, 0.11109846010804177, 0.11119341179728508, 0.11115589812397957, 7.87408849234339e-10, 0.11124403774738312, 0.11130044609308243, 0.11093106865882874, 0.1117858462035656, 2.908799512368887e-09, 0.11121246218681335, 0.11125476285815239, 1.9358580072292853e-09, 0.11129547506570817, 0.11148485541343689]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.NONE: 0> =S===
Average: -17.998075495427734 
Per Topic: [-27.631021115928547, 1.2140641365107059, -27.631021115928547, 1.99268501437722, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, 0.7556502297443957, -27.631021115928547, -27.631021115928547, 1.0629795470195091, -27.631021115928547, 1.316640100991759, 1.4100873610433413, -27.631021115928547, -27.631021115928547, -27.631021115928547, 1.1340019846351324, 1.5058705645390194, -27.631021115928547, 1.2574427025423358, -27.631021115928547, 1.0287358143354586, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.2581733147753403 
Per Topic: [0.24508911790326238, 0.17374768014997244, 0.5105727970600128, 0.48035172671079635, 0.2207817789982073, 0.058474055491387844, 0.2861169584095478, 0.377871535718441, 0.1149206195026636, 0.27650429494678974, 0.1526490034069866, 0.12525155174080282, -0.02420790456235409, 0.2585714322398417, 0.34213744625449183, 0.3058826974593103, 0.23692641034722328, 0.2380435297265649, 0.11151184756308793, 0.530694967508316, 0.09301305944100022, 0.23980206837877632, 0.3962100613862276, 0.33828441947698595, 0.38168871533125637, 0.19976201755926012, 0.3081607738509774, 0.43378524780273436, 0.21452752351760865, 0.11807400994002819]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.DICE: 2> =S===
Average: 1.4955023588736858 
Per Topic: [1.2561096668243408, 1.8136528134346008, 0.9747179567813873, 1.2463554620742798, 1.4402464628219604, 1.3493381023406983, 1.1479509949684144, 1.0095133185386658, 1.325226652622223, 1.5603938579559327, 4.509990119934082, 2.0536433339118956, 1.7466441154479981, 2.458484387397766, 0.9938374519348144, 1.266217577457428, 1.2540597200393677, 1.3152145147323608, 1.4414571046829223, 0.9607468128204346, 2.6165847301483156, 1.1989105820655823, 1.0787838399410248, 1.52461838722229, 0.9657190799713135, 1.5096396565437318, 1.245335590839386, 1.0928990066051483, 1.0921587586402892, 1.416620707511902]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.JACCARD: 3> =S===
Average: -7.092051823536552 
Per Topic: [-6.871484327316284, -2.254611086845398, -49.44212474822998, -7.2994259119033815, -3.3631428241729737, -4.071650528907776, -11.374973249435424, -13.510870265960694, -4.225879836082458, -2.798841881752014, -1.2853970527648926, -1.9529073238372803, -2.342919707298279, -1.6996152639389037, -45.50078015327453, -5.234318804740906, -5.3679927587509155, -4.312158226966858, -3.308485984802246, 1.496824312210083, -1.6213171839714051, -6.712603521347046, -0.5913944244384766, -3.05693256855011, 7.437892961502075, -3.0004308223724365, -5.7403112888336185, -6.814890384674072, -14.517410898208619, -3.423400950431824]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.NONE: 0> =S===
Average: -0.7321758026569418 
Per Topic: [-2.1193269466817144, 1.1836213392910842, 0.0, 1.9093945089752844, -2.05001222868822, 0.0, 0.0, -6.282835324932625, -2.09055873945737, -2.159873457471698, 0.7431858331334158, -2.1821878125947856, -6.468966903707212, 1.035352912605204, 0.0, 1.2868936721207613, 1.3698277353097963, 0.0, -2.272474583729167, 0.0, 1.0883911517599014, 1.4648907739423103, -2.151172319776523, 1.2238002725221477, 0.0, 1.0063391125479737, -2.1716517610327064, -2.163955656922057, -2.163955656922057, 0.0]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.3125091246325367 
Per Topic: [0.2762430664151907, 0.25201512277126314, 0.5245177417993545, 0.5173569798469544, 0.28970395103096963, 0.09718930795788765, 0.34388906210660936, 0.4112602941691875, 0.12733350574271754, 0.4111373096704483, 0.3453836627304554, 0.22875212784856558, 0.017550497874617578, 0.36196450125426055, 0.36375135481357573, 0.37084027417004106, 0.287821763753891, 0.2903820521198213, 0.1693682083627209, 0.5441355347633362, 0.21100983675569296, 0.27975747734308243, 0.4058368131518364, 0.3833363808691502, 0.39377264669165013, 0.29187180995941164, 0.3566843113861978, 0.4650069460272789, 0.2167186163365841, 0.1406825812533498]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.DICE: 2> =S===
Average: 3.8570949314037963 
Per Topic: [1.2249209702014923, 1.882071053981781, 0.9637438952922821, 1.1076842308044434, 1.405919623374939, 1.3376509308815003, 1.1359732747077942, 0.9947576940059661, 1.3089823722839355, 1.5321281790733337, 74.32208251953125, 2.260452628135681, 1.8589930772781371, 2.706422758102417, 0.9735818028450012, 1.2448540449142456, 1.2417418599128722, 1.2851081848144532, 1.444409430027008, 0.944037276506424, 3.541761589050293, 1.1833403348922729, 1.0524930655956268, 1.4987923622131347, 0.9549902737140655, 1.5251638889312744, 1.2416791915893555, 1.0531478464603423, 1.0766732692718506, 1.4092903137207031]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.JACCARD: 3> =S===
Average: 20.507366928259525 
Per Topic: [7.635204005241394, -2.1603544473648073, 742.0231639385223, -2.4880732536315917, -3.6107279777526857, -4.309712243080139, -49.40712885856628, -30.79481234550476, -4.448127388954163, -2.899846577644348, -1.013667368888855, -1.797537386417389, -2.1667372226715087, -1.601905071735382, 27.565062618255617, -6.282110285758972, -5.812558126449585, -4.776027750968933, -3.317580962181091, -22.963042879104613, -1.3958378553390502, -7.697141504287719, -9.76264271736145, -3.2256082773208616, 44.59204630851745, -2.959061336517334, -6.136319494247436, -0.9429188728332519, -21.148436403274538, -3.476552414894104]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.NONE: 0> =S===
Average: 0.04916006557067793 
Per Topic: [-0.07670099985772799, 0.2570201081641083, 0.0, 0.41461974952117087, -0.0741924165627901, 0.0, 0.0, -0.22738339269375524, -0.07565984372007947, -0.07816842701577136, 0.16138075318276104, -0.07897601045720357, -0.23411971915790059, 0.22482402838833196, 0.0, 0.27944541030519776, 0.2974543133079955, 0.0, -0.08224359766491388, 0.0, 0.27821696095948567, 0.318096989863972, -0.07785352234183011, 0.26574485265980285, 0.0, 0.21852376175624544, -0.07859469803599864, -0.0783161667403813, -0.0783161667403813, 0.0]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.4864306248476107 
Per Topic: [0.43763422667980195, 0.5497331604361534, 0.47610455825924874, 0.5632155805826187, 0.4909617304801941, 0.46946394741535186, 0.45332501977682116, 0.5710992306470871, 0.38939291536808013, 0.6348734751343728, 0.6118693023920059, 0.541075611114502, 0.45656153485178946, 0.6805806457996368, 0.3075068242847919, 0.5102350264787674, 0.4688612878322601, 0.4481358230113983, 0.5128358319401741, 0.5088331699371338, 0.5522102773189544, 0.40120257437229156, 0.41351083368062974, 0.5533533722162247, 0.35195610448718073, 0.5260767102241516, 0.48553305864334106, 0.48568335771560667, 0.3347507417201996, 0.40634281262755395]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.DICE: 2> =S===
Average: 0.13213661321516457 
Per Topic: [-0.2333679087460041, -0.07421420821920037, 2.4051308393478394, -0.6659291565418244, -0.1743314877152443, -0.128420652449131, -1.008475089073181, 4.273856472969055, -0.18330565243959426, -0.17203156054019927, 0.0013614120427519083, -0.04500359818339348, -0.015463637164793908, -0.06803527064621448, 2.363974618911743, -0.5117599874734878, -0.26517586410045624, -0.24317131489515303, -0.11499179378151894, 1.4837201237678528, -0.02400804781354964, -0.3645357459783554, -1.1503982663154602, -0.1648890905082226, 1.8324528217315674, -0.242745341360569, -0.40612800121307374, -1.3859741806983947, -0.6302479207515717, -0.12379411570727825]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.JACCARD: 3> =S===
Average: -0.5107786150136963 
Per Topic: [-0.18751066848635672, -0.06765787499025464, -1.7275664448738097, -0.3980284184217453, -0.14733463600277902, -0.1131181314587593, -0.49960836470127107, -1.3078935027122498, -0.15389785915613174, -0.146271950006485, 0.0017750745406374334, -0.0421593233011663, -0.012756952526979148, -0.06273228777572512, -1.7361850976943969, -0.3371551722288132, -0.20854200422763824, -0.1951710104942322, -0.10176142491400242, -3.1382298469543457, -0.022615865571424365, -0.26592340022325517, -0.5338767528533935, -0.13852907791733743, -2.2275139331817626, -0.1945817530155182, -0.28789310455322265, -0.580046021938324, -0.3826148182153702, -0.10795782655477523]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.NONE: 0> =S===
Average: -18.455730713128055 
Per Topic: [-27.631021115928547, -0.1386294362007891, -27.631021115928547, -0.06931471814989455, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, -0.17917594700983888, -27.631021115928547, -27.631021115928547, -0.2772588723025781, -27.631021115928547, -9.900003040065772e-11, -0.06931471814989455, -27.631021115928547, -27.631021115928547, -27.631021115928547, -4.8999915237636707e-11, -9.900003040065772e-11, -27.631021115928547, -0.13862943620078907, -27.631021115928547, -0.17917594700983888, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.6004041439294816 
Per Topic: [0.578860592842102, 0.5383680820465088, 0.6549134254455566, 0.6475066512823104, 0.5820991456508636, 0.6632340461015701, 0.5645034700632096, 0.6035295695066452, 0.5770901381969452, 0.6192097634077072, 0.630240973830223, 0.6226463794708252, 0.6046907782554627, 0.6189156651496888, 0.5785669237375259, 0.6159706115722656, 0.5710591435432434, 0.5234188318252564, 0.591490375995636, 0.6765992105007171, 0.6013480097055435, 0.5419433057308197, 0.6310879856348037, 0.6410084575414657, 0.6261020600795746, 0.5683780044317246, 0.591088530421257, 0.6096499741077424, 0.5589282512664795, 0.5796759605407715]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.DICE: 2> =S===
Average: 0.9081074988842013 
Per Topic: [0.9051498711109162, 0.9094338595867157, 0.9018328309059143, 0.9027936637401581, 0.9086619794368744, 0.9168468117713928, 0.9072303771972656, 0.9084529221057892, 0.9122909307479858, 0.9121033608913421, 0.9074244916439056, 0.9111976206302643, 0.9152511358261108, 0.905062073469162, 0.9079863071441651, 0.9058874368667602, 0.9098460018634796, 0.9077514350414276, 0.9143188178539277, 0.9023345887660981, 0.9079844713211059, 0.9084756791591644, 0.903959184885025, 0.9047776937484742, 0.9054386794567109, 0.9087440133094787, 0.9028993308544159, 0.9061625897884369, 0.9105643033981323, 0.9123625040054322]

==== Coherence : <Segmentation.ONE_ALL: 4>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.JACCARD: 3> =S===
Average: 13.783973473707835 
Per Topic: [16.25237317085266, 12.821036100387573, 12.807693099975586, 14.896856880187988, 13.771287155151366, 18.443945384025575, 13.258090591430664, 17.12075710296631, 16.368441152572633, 13.711332082748413, 10.349943590164184, 12.753678321838379, 15.601322603225707, 13.092324829101562, 13.680219697952271, 12.417581510543823, 15.442327070236207, 12.039814376831055, 13.387025356292725, 13.295765495300293, 13.442171192169189, 14.691886854171752, 12.82756757736206, 12.966483163833619, 16.276408004760743, 12.039440584182739, 11.163235473632813, 11.66354718208313, 13.309474611282349, 13.62717399597168]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.NONE: 0> =S===
Average: 0.07499999996833334 
Per Topic: [-0.253, 0.7079999999, -0.13799999999999998, 0.8259999998999998, -0.25999999999999995, -0.33999999999999997, -0.24899999999999997, -0.26, -0.26599999999999996, -0.343, 0.6009999999, -0.29000000000000004, -0.367, 0.7029999999000001, -0.14900000000000002, 0.7029999999000001, 0.7249999999, -0.296, -0.332, -0.10500000000000002, 0.6379999999500001, 0.7369999999000001, -0.13699999999999998, 0.7139999999, -0.145, 0.6749999999, -0.21400000000000002, -0.177, -0.21000000000000002, -0.24900000000000003]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.6410729912131234 
Per Topic: [0.6333910822868347, 0.6973669797182083, 0.5970727115869522, 0.7139965176582337, 0.6602881044149399, 0.6524575412273407, 0.6169759377837181, 0.6953143775463104, 0.6318665146827698, 0.7100220680236816, 0.6676555216312409, 0.6833051443099976, 0.6037357442080975, 0.7908996045589447, 0.5168040946125985, 0.6219531387090683, 0.6696984320878983, 0.6087498277425766, 0.6363498419523239, 0.6194979637861252, 0.6657921642065048, 0.6044882446527481, 0.6306056916713715, 0.6719021423435378, 0.5522374004125595, 0.6434971481561661, 0.6501025140285492, 0.6179495543241501, 0.555858987569809, 0.6123547405004501]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.DICE: 2> =S===
Average: -0.04950055712210693 
Per Topic: [0.03739694920368493, 0.03908383771777153, -0.23679817765951156, 0.026073369197547436, 0.025373382633551954, 0.050565505400300026, -0.1477519493550062, -0.7227683544158936, 0.047584916590858485, -0.04714265819638967, 0.03233479708433151, 0.04880514014512301, 0.0674881803803146, 0.03012196607887745, -0.1613907128572464, -0.15321893766522407, 0.029766265489161014, -0.02233187196252402, 0.031501518096774817, -0.34623068273067475, 0.038427666574716565, 0.029923668410629035, 0.04161144886165857, 0.02365320026874542, -0.13267237772233784, -0.06120393336750567, -0.029163892567157745, -0.09083370342850686, 0.023013164289295673, 0.04376556184142828]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.DIFFERENCE: 1>, <IndirectMeasure.JACCARD: 3> =S===
Average: -0.029246959519878154 
Per Topic: [0.040554215712472794, 0.041367638669908044, -0.18859166279435158, 0.027406617300584912, 0.026466488419100643, 0.05443309042602777, -0.12601786963641642, -0.4160135120153427, 0.0507568078530312, -0.0440474362578243, 0.03395622838288546, 0.05208738949149847, 0.07485087784007191, 0.031389358546584846, -0.13628447167575358, -0.13091368600726128, 0.03229295266792178, -0.02133885601942893, 0.03413750594481826, -0.25280910134315493, 0.04088243441656232, 0.0326758299022913, 0.043880258593708275, 0.025354486890137196, -0.11289855823852121, -0.056442486308515075, -0.027269362611696123, -0.08109336085617543, 0.026242190320044757, 0.04757720679044723]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.NONE: 0> =S===
Average: 1.4820239801682076 
Per Topic: [0.0, 4.143526641437244, 0.0, 8.839826829122668, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.5244248604016497, 0.0, 0.0, 4.073038898727946, 0.0, 3.8905305203853535, 4.957790600982717, 0.0, 0.0, 0.0, 3.17222144025044, 4.939386063134093, 0.0, 4.439508193554606, 0.0, 3.4804653570495008, 0.0, 0.0, 0.0, 0.0]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.9317025850216545 
Per Topic: [0.9332154810428619, 0.9568761706352233, 0.8678891599178314, 0.9118741154670715, 0.939766138792038, 0.9502161026000977, 0.9357312977313995, 0.9302503347396851, 0.9338732779026031, 0.9616746664047241, 0.9712950348854065, 0.9553639352321625, 0.9548657298088074, 0.96359983086586, 0.8633557140827179, 0.9525385618209838, 0.9414067029953003, 0.9512531757354736, 0.9588610708713532, 0.8481564521789551, 0.9664158940315246, 0.933635026216507, 0.8913208246231079, 0.9496804177761078, 0.8528951346874237, 0.9599951684474946, 0.9392126202583313, 0.916973489522934, 0.9204763114452362, 0.9384097099304199]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.DICE: 2> =S===
Average: 0.09084771491587165 
Per Topic: [0.090857744961977, 0.09081744477152824, 0.09078982919454574, 0.09081626906991005, 0.09083664193749427, 0.09088162183761597, 0.09084171429276466, 0.09088353589177131, 0.09087421372532845, 0.09089420288801194, 0.0908934824168682, 0.090873484313488, 0.0908483773469925, 0.09085730984807014, 0.09069855883717537, 0.09082573726773262, 0.09084807336330414, 0.09088974744081497, 0.09087632820010186, 0.09085912853479386, 0.09087948873639107, 0.09087031185626984, 0.09084642007946968, 0.0907941222190857, 0.0908469520509243, 0.09088497683405876, 0.090873122215271, 0.09079249650239944, 0.09084264189004898, 0.09083746895194053]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.RATIO: 2>, <IndirectMeasure.JACCARD: 3> =S===
Average: 0.10000000116725759 
Per Topic: [0.09999999925494193, 0.10000000521540642, 0.09999999329447747, 0.1000000037252903, 0.09999999850988388, 0.1000000074505806, 0.10000000819563866, 0.09999999925494193, 0.09999999329447747, 0.10000000521540642, 0.10000000521540642, 0.10000000670552253, 0.10000001266598701, 0.09999999478459358, 0.09999999403953552, 0.10000000223517418, 0.10000000298023223, 0.1000000037252903, 0.09999999925494193, 0.10000000447034836, 0.10000001266598701, 0.09999999552965164, 0.09999999105930328, 0.09999999478459358, 0.09999998658895493, 0.10000000447034836, 0.10000000447034836, 0.10000000447034836, 0.10000000074505806, 0.10000000074505806]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.NONE: 0> =S===
Average: 1.574614330098895 
Per Topic: [0.0, 4.322135237242685, 0.0, 10.000013296724692, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 2.56441368063523, 0.0, 0.0, 4.235554731218934, 0.0, 4.031166594645543, 5.264450388285885, 0.0, 0.0, 0.0, 3.3473098207303265, 5.222771860531334, 0.0, 4.664599246033868, 0.0, 3.58601504691836, 0.0, 0.0, 0.0, 0.0]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.7202866797702516 
Per Topic: [0.8995069801807404, 0.9238621056079864, 0.03974424544721842, 0.11647273153066635, 0.9006467580795288, 0.9329968214035034, 0.9107627511024475, 0.8805878221988678, 0.9084170997142792, 0.9366410970687866, 0.9493901491165161, 0.9355605125427247, 0.05246623436214254, 0.9351759076118469, 0.07719137948006391, 0.9200849056243896, 0.9109231293201446, 0.9309146285057068, 0.9388108253479004, 0.10785636231303215, 0.94701087474823, 0.9077460765838623, 0.8153495192527771, 0.9106474757194519, 0.0600609116256237, 0.9409192383289338, 0.9164608538150787, 0.09641246907413006, 0.8911628544330596, 0.9148176729679107]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.DICE: 2> =S===
Average: 0.08373983587039549 
Per Topic: [0.09070029854774475, 0.090534558147192, 0.05000000283627202, 0.068759411375146, 0.09059121534228325, 0.09082677140831948, 0.09071093946695327, 0.09085014089941978, 0.09080255255103112, 0.09086695313453674, 0.09089943617582322, 0.09083719700574874, 0.050000002527070084, 0.09068491533398629, 0.07616382919974851, 0.09069322496652603, 0.09071683958172798, 0.09086524546146393, 0.0908846378326416, 0.06374088437782353, 0.09083639308810235, 0.0908079169690609, 0.09055597260594368, 0.09045128077268601, 0.06621104702854534, 0.09085262939333916, 0.09082993492484093, 0.050000001652027934, 0.0908067338168621, 0.09071410968899726]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LIKELIHOOD: 3>, <IndirectMeasure.JACCARD: 3> =S===
Average: 0.09999999938123459 
Per Topic: [0.10000000894069672, 0.10000000149011612, 0.10000000283627201, 0.09999999831161985, 0.10000000223517418, 0.09999999403953552, 0.09999999105930328, 0.09999998137354851, 0.09999998882412911, 0.10000000298023223, 0.10000000223517418, 0.10000000596046447, 0.10000000252707009, 0.09999999925494193, 0.09999999635644484, 0.10000000521540642, 0.09999999403953552, 0.09999999180436134, 0.10000000074505806, 0.10000000359045096, 0.09999999701976776, 0.10000000968575477, 0.10000001043081283, 0.09999999403953552, 0.10000000820064922, 0.09999999925494193, 0.09999998435378074, 0.10000000165202796, 0.10000000074505806, 0.10000000223517418]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.NONE: 0> =S===
Average: -17.962483372280577 
Per Topic: [-27.631021115928547, 1.3547240469054427, -27.631021115928547, 2.063014969574606, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, 0.9378823009649313, -27.631021115928547, -27.631021115928547, 1.3442993678090223, -27.631021115928547, 1.316640100991759, 1.4804173162407261, -27.631021115928547, -27.631021115928547, -27.631021115928547, 1.1340019846351324, 1.5058705645390194, -27.631021115928547, 1.3981026129370882, -27.631021115928547, 1.2109678855560513, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.4232210506374636 
Per Topic: [0.40471704192459584, 0.3748432844877243, 0.6054804116487503, 0.577781380712986, 0.39561948031187055, 0.2525869905948639, 0.4440085932612419, 0.49328284449875354, 0.3226379733532667, 0.44289947897195814, 0.3710936218500137, 0.3367249697446823, 0.23194885086268185, 0.4129734568297863, 0.4842275597155094, 0.450000374764204, 0.3989551968872547, 0.4163795210421085, 0.3081002600491047, 0.6187533557415008, 0.3349212914705276, 0.4033030778169632, 0.5193201501853764, 0.4590431611053646, 0.4998456843197346, 0.39013231322169306, 0.4592802584171295, 0.5574072420597076, 0.39356791619211434, 0.33679577708244324]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.DICE: 2> =S===
Average: 1.4709807678063713 
Per Topic: [1.2335412502288818, 1.7493083596229553, 0.9758998453617096, 1.242317545413971, 1.3921151995658874, 1.305328941345215, 1.1145889222621919, 0.9970690608024597, 1.267557430267334, 1.5003574252128602, 4.813571095466614, 2.008640003204346, 1.6858244955539703, 2.4432090878486634, 0.975768095254898, 1.230519461631775, 1.2183526396751403, 1.2709275364875794, 1.3993108928203584, 0.9570511102676391, 2.5273105978965758, 1.1639196932315827, 1.0658064484596252, 1.497915780544281, 0.9561408758163452, 1.4438908696174622, 1.2263261437416078, 1.0679799139499664, 1.0561725080013276, 1.342701804637909]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LOGLIKELIHOOD: 4>, <IndirectMeasure.JACCARD: 3> =S===
Average: -8.774326799710591 
Per Topic: [-180.41927905082701, -2.694600486755371, -8.551981115341187, 91.20622534751892, -5.142442607879639, -1.0554541110992433, -2.1000454425811768, -15.980204629898072, -6.960401701927185, -28.277136898040773, -1.3747661471366883, -18.568219244480133, -1.4437849283218385, -1.7968761801719666, -127.09476642608642, -12.824231719970703, 5.664210915565491, -6.707182431221009, -2.5311867952346803, -0.9237553596496582, -1.7390028595924378, -7.049778604507447, -6.394920063018799, -3.4542298316955566, -3.7749561309814452, -22.10694603919983, -6.989784550666809, -18.917453050613403, 144.33578805923463, -9.562641906738282]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.NONE: 0> =S===
Average: 0.4454398795410999 
Per Topic: [0.0, 1.3222507753641843, 0.0, 1.9787092269858242, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.9223617800281911, 0.0, 0.0, 1.3126117847152092, 0.0, 1.2868936721207613, 1.439142453324124, 0.0, 0.0, 0.0, 1.0883911517599014, 1.4648907739423103, 0.0, 1.3624297085720154, 0.0, 1.1855150594204762, 0.0, 0.0, 0.0, 0.0]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.44621886669153665 
Per Topic: [0.4124165214598179, 0.41815333664417265, 0.6163956195116043, 0.6051172554492951, 0.4346828691661358, 0.23162247641012074, 0.4828745499253273, 0.5160080606117845, 0.3062569569796324, 0.5360476672649384, 0.47579583078622817, 0.3852004334330559, 0.1935215424746275, 0.4705633774399757, 0.49999310076236725, 0.4825298238545656, 0.4240686010569334, 0.4432110205292702, 0.31062971577048304, 0.629476198554039, 0.38552218191325666, 0.4223848611116409, 0.5205207884311676, 0.4717846617102623, 0.506538450345397, 0.4375795140862465, 0.4865640806034207, 0.580752494931221, 0.37728322446346285, 0.32307078506564724]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.DICE: 2> =S===
Average: 1.0435703825950624 
Per Topic: [1.2180174231529235, 1.8631192445755005, 0.9675707817077637, 1.1141510665416718, 1.378911566734314, 1.3160255074501037, 1.1139546632766724, 0.9873750627040863, 1.2584400296211242, 1.502124309539795, -10.115685820579529, 2.321642255783081, 1.88246488571167, 2.8269158482551573, 0.9587222039699554, 1.229571372270584, 1.2208441019058227, 1.2584603667259215, 1.4352289259433746, 0.9435745537281036, 3.802091729640961, 1.1612120151519776, 1.045456701517105, 1.5008124589920044, 0.946215808391571, 1.4917181432247162, 1.2405412077903748, 1.0362650513648988, 1.0487590491771699, 1.3526109635829926]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.PMI: 5>, <IndirectMeasure.JACCARD: 3> =S===
Average: -11.746772848367693 
Per Topic: [5.979167127609253, -2.6410277843475343, 26.23333306312561, -1.7838737487792968, -8.457412910461425, -1.4535639047622682, -3.577716016769409, -27.5216344833374, -8.907718539237976, -0.33805098533630373, -1.0935552358627318, 0.4839033007621765, -1.4398733735084535, -1.685006892681122, 49.72143075466156, -0.885801887512207, 0.6011057376861573, -27.695952582359315, -2.4876408100128176, -20.633373832702638, -1.4742779970169066, -9.314956903457642, -11.319663143157959, -3.6779964208602904, 28.433544445037843, -0.4355814456939697, -7.089989948272705, -313.0866086959839, -11.304293155670166, 4.449900817871094]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.NONE: 0> =S===
Average: 0.09812190169672579 
Per Topic: [0.0, 0.28712310772271554, 0.0, 0.4296712492948265, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.2002883156967014, 0.0, 0.0, 0.2850300274976867, 0.0, 0.27944541030519776, 0.3125058130824736, 0.0, 0.0, 0.0, 0.27821696095948567, 0.318096989863972, 0.0, 0.29584785221336524, 0.0, 0.25743132426534937, 0.0, 0.0, 0.0, 0.0]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.6040872625385721 
Per Topic: [0.5692125529050827, 0.65080326795578, 0.5849519714713096, 0.6516749501228333, 0.6060932964086533, 0.5997911095619202, 0.5725445315241814, 0.6604177474975585, 0.5421510398387909, 0.7167915105819702, 0.6984683573246002, 0.6483241111040116, 0.5904821053147316, 0.744459581375122, 0.48266232311725615, 0.6158387660980225, 0.5940763026475906, 0.5786906272172928, 0.6278219133615494, 0.6103616148233414, 0.6522184759378433, 0.5438662767410278, 0.5459191679954529, 0.6375426169484854, 0.5009278669953346, 0.6335693955421448, 0.6010507076978684, 0.6048533856868744, 0.5055979818105698, 0.5514543205499649]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.DICE: 2> =S===
Average: 0.16895263698810598 
Per Topic: [-0.22467442005872726, -0.0717879543080926, 2.4869715571403503, -0.6741549015045166, -0.16681005880236627, -0.11651808135211468, -1.0089258432388306, 5.113375616073609, -0.16409189701080323, -0.17005307972431183, 0.0017191542850923725, -0.04115924602374434, -0.01008585398667492, -0.07271983586251736, 2.4648455142974854, -0.5109133243560791, -0.25942555218935015, -0.2349082961678505, -0.11019136048853398, 1.4878202319145202, -0.023804608383215964, -0.35086887776851655, -1.1440933644771576, -0.16545448899269105, 1.8452358961105346, -0.234650719165802, -0.4044821739196777, -1.3970118880271911, -0.6600303232669831, -0.11457271110266447]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.NPMI: 6>, <IndirectMeasure.JACCARD: 3> =S===
Average: -0.53091334684587 
Per Topic: [-0.18150741010904312, -0.06580013204365968, -1.7469874143600463, -0.4021384835243225, -0.1423875130712986, -0.10274352580308914, -0.49714879393577577, -1.3276653647422791, -0.14003336653113366, -0.14451706111431123, 0.0022407311684219168, -0.03821370052173734, -0.006127732509048656, -0.0674062106758356, -2.0379212737083434, -0.3348137766122818, -0.20347139984369278, -0.19002137333154678, -0.0963026087731123, -3.2521924495697023, -0.022489816357847303, -0.2565449684858322, -0.5277079284191132, -0.14061422869563103, -2.4841044306755067, -0.1890276476740837, -0.28705792129039764, -0.5744902908802032, -0.37045379281044005, -0.09975052047520876]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.NONE: 0> =S===
Average: -18.420680743983702 
Per Topic: [-27.631021115928547, -9.900003040065772e-11, -27.631021115928547, -9.900003040065772e-11, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547, -9.900003040065772e-11, -27.631021115928547, -27.631021115928547, -9.900003040065772e-11, -27.631021115928547, -9.900003040065772e-11, -9.900003040065772e-11, -27.631021115928547, -27.631021115928547, -27.631021115928547, -4.8999915237636707e-11, -9.900003040065772e-11, -27.631021115928547, -9.900003040065772e-11, -27.631021115928547, -9.900003040065772e-11, -27.631021115928547, -27.631021115928547, -27.631021115928547, -27.631021115928547]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.COSINE: 1> =S===
Average: 0.6858331813414893 
Per Topic: [0.6675231665372848, 0.6407616078853607, 0.7199262201786041, 0.7087128102779389, 0.674959808588028, 0.7404359757900238, 0.658565005660057, 0.6793989837169647, 0.6806197047233582, 0.7043274402618408, 0.717370119690895, 0.7084757924079895, 0.6948964387178421, 0.6933852344751358, 0.6634315013885498, 0.6951278984546662, 0.6594241172075271, 0.6288093000650405, 0.6775986671447753, 0.7365705847740174, 0.7033872485160828, 0.634419396519661, 0.706954562664032, 0.7053548276424408, 0.7015772461891174, 0.662407511472702, 0.6772064924240112, 0.6904528617858887, 0.6603913962841034, 0.6825235188007355]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.DICE: 2> =S===
Average: 0.9118037780125937 
Per Topic: [0.9116515040397644, 0.9112907469272613, 0.9106871962547303, 0.9116369605064392, 0.9115502536296844, 0.9159986495971679, 0.9106798052787781, 0.9134948909282684, 0.911925446987152, 0.9125157237052918, 0.9103051900863648, 0.9119442880153656, 0.9142145872116089, 0.9110994279384613, 0.9133953988552094, 0.9107547461986542, 0.9126778662204742, 0.9105921447277069, 0.9141834139823913, 0.9106616199016571, 0.9110627174377441, 0.9120863258838654, 0.91106196641922, 0.9114029586315155, 0.9120813608169556, 0.9111285269260406, 0.9101901948451996, 0.9104019999504089, 0.9119841039180756, 0.9114533245563508]

==== Coherence : <Segmentation.ONE_SET: 5>, <ConfirmMeasure.LOGCOND: 7>, <IndirectMeasure.JACCARD: 3> =S===
Average: 14.449915270805356 
Per Topic: [17.00997748374939, 13.411004638671875, 13.621732091903686, 15.789669513702393, 14.465864181518555, 19.18271050453186, 13.796858215332032, 17.785425448417662, 16.787820911407472, 14.353698372840881, 11.057847213745116, 13.338881254196167, 16.170499539375307, 13.692289924621582, 14.495761942863464, 13.061396408081055, 16.125968885421752, 12.430436897277833, 14.109033560752868, 14.151474952697754, 14.06671895980835, 15.34122953414917, 13.646188926696777, 13.780387353897094, 16.990155601501463, 12.628229808807372, 11.969531393051147, 12.254098796844483, 13.900067400932311, 14.082498407363891]

Pining Topics using Word Priors#

# make LDA model and train
mdl = tp.LDAModel(k=20, min_cf=10, min_df=5, corpus=corpus)
# The word 'church' is assigned to Topic 0 with a weight of 1.0 and to the remaining topics with a weight of 0.1.
# Therefore, a topic related to 'nasdaq' can be fixed at Topic 0 .
mdl.set_word_prior("nasdaq", [1.0 if k == 0 else 0.0001 for k in range(20)])
# Topic 1 for a topic related to 'bank'
mdl.set_word_prior("bank", [1.0 if k == 1 else 0.0001 for k in range(20)])
# Topic 2 for a topic related to 'car'
mdl.set_word_prior("oil", [1.0 if k == 2 else 0.0001 for k in range(20)])
mdl.train(0)
print(
    "Num docs:",
    len(mdl.docs),
    ", Vocab size:",
    len(mdl.used_vocabs),
    ", Num words:",
    mdl.num_words,
)
print("Removed top words:", mdl.removed_top_words)
for i in range(0, 100, 10):
    mdl.train(10)
    print("Iteration: {}\tLog-likelihood: {}".format(i, mdl.ll_per_word))

mdl.summary()

Show code cell output Hide code cell output

<Basic Info>
| LDAModel (current version: 0.12.3)
| 100 docs, 23802 words
| Total Vocabs: 5139, Used Vocabs: 726
| Entropy of words: 6.20784
| Entropy of term-weighted words: 6.20784
| Removed Vocabs: <NA>
|
<Training Info>
| Iterations: 100, Burn-in steps: 0
| Optimization Interval: 10
| Log-likelihood per word: -6.60404
|
<Initial Parameters>
| tw: TermWeight.ONE
| min_cf: 10 (minimum collection frequency of words)
| min_df: 5 (minimum document frequency of words)
| rm_top: 0 (the number of top words to be removed)
| k: 20 (the number of topics between 1 ~ 32767)
| alpha: [0.1] (hyperparameter of Dirichlet distribution for document-topic, given as a single `float` in case of symmetric prior and as a list with length `k` of `float` in case of asymmetric prior.)
| eta: 0.01 (hyperparameter of Dirichlet distribution for topic-word)
| seed: 2936119906 (random seed)
| trained in version 0.12.3
|
<Parameters>
| alpha (Dirichlet prior on the per-document topic distributions)
|  [0.91635215 0.40094903 0.46934685 0.31341073 0.8245844  0.24786447
|   0.41051129 0.42268345 0.59450996 0.26436833 0.7942999  0.9315055
|   0.74502105 0.36342382 0.6116812  0.4277962  0.97071016 0.26491374
|   1.0327024  0.6145224 ]
| eta (Dirichlet prior on the per-topic word distribution)
|  0.01
|
<Topics>
| #0 (1420) : nasdaq inc gain global demand
| #1 (870) : bank financi goldman could increas
| #2 (935) : oil data countri product sinc
| #3 (615) : investor fund cash manag activist
| #4 (2113) : zack stock rank compani buy
| #5 (609) : revenu billion report cloud share
| #6 (835) : market invest valu product also
| #7 (1250) : percent index dollar retail yen
| #8 (929) : year sector risk move amazon
| #9 (930) : gold price silver metal see
| #10 (1461) : compani nyse unit time number
| #11 (1492) : price high market last new
| #12 (1752) : year million sale revenu expect
| #13 (714) : dollar market store today rate
| #14 (1257) : quarter compani growth oper earn
| #15 (1027) : said reuter trump govern billion
| #16 (1967) : earn estim stock expect posit
| #17 (467) : peopl googl say media ask
| #18 (1975) : trade week us still forecast
| #19 (1184) : one secur make would presid
|

for k in range(mdl.k):
    print("== Topic #{} ==".format(k))
    for word, prob in mdl.get_topic_words(k, top_n=10):
        print(word, prob, sep="\t")
    print()

Show code cell output Hide code cell output

== Topic #0 ==
nasdaq	0.07707776129245758
inc	0.0574597492814064
gain	0.04695010185241699
global	0.04624945670366287
demand	0.04134495183825493
recent	0.039243023842573166
world	0.03223659098148346
develop	0.028032733127474785
industri	0.028032733127474785
system	0.025230159983038902

== Topic #1 ==
bank	0.0991838201880455
financi	0.06384652107954025
goldman	0.047887738794088364
could	0.04332808777689934
increas	0.04218817502260208
invest	0.041048262268304825
fed	0.03876843675971031
firm	0.0364886112511158
rise	0.034208785742521286
interest	0.03306887298822403

== Topic #2 ==
oil	0.07005497813224792
data	0.05201324075460434
countri	0.043523017317056656
product	0.03821662440896034
sinc	0.03821662440896034
low	0.03715534880757332
well	0.03715534880757332
declin	0.029726402834057808
energi	0.027603846043348312
rose	0.02654256857931614

== Topic #3 ==
investor	0.12054447084665298
fund	0.08358242362737656
cash	0.05304856225848198
manag	0.046620383858680725
activist	0.04019220173358917
score	0.03537106513977051
flow	0.03537106513977051
even	0.028942884877324104
valu	0.027335839346051216
hedg	0.027335839346051216

== Topic #4 ==
zack	0.08725816011428833
stock	0.08112683892250061
rank	0.0603746697306633
compani	0.037264298647642136
buy	0.036792658269405365
share	0.03396281599998474
current	0.02641656994819641
technolog	0.02594492956995964
hold	0.02547328919172287
nyse	0.024058369919657707

== Topic #5 ==
revenu	0.14605848491191864
billion	0.13632233440876007
report	0.07952811568975449
cloud	0.05681043490767479
share	0.04869697988033295
top	0.042206212878227234
ep	0.042206212878227234
busi	0.032470062375068665
consensu	0.032470062375068665
bp	0.032470062375068665

== Topic #6 ==
market	0.09380713850259781
invest	0.07362334430217743
valu	0.05343955382704735
product	0.047503143548965454
also	0.034443046897649765
money	0.034443046897649765
consum	0.03206848353147507
cost	0.03088119998574257
measur	0.027319354936480522
move	0.027319354936480522

== Topic #7 ==
percent	0.1447671800851822
index	0.05091230198740959
dollar	0.04534463584423065
retail	0.04136773571372032
yen	0.04136773571372032
yield	0.0381862111389637
rate	0.029437027871608734
share	0.027050886303186417
currenc	0.025460125878453255
consum	0.02227860502898693

== Topic #8 ==
year	0.0726400837302208
sector	0.04059769585728645
risk	0.03952961415052414
move	0.03312114253640175
amazon	0.029916903004050255
come	0.02884882315993309
chang	0.027780745178461075
option	0.025644585490226746
good	0.02457650564610958
take	0.023508427664637566

== Topic #9 ==
gold	0.13657896220684052
price	0.05335765704512596
silver	0.050156839191913605
metal	0.03948744013905525
see	0.03842049837112427
season	0.034152742475271225
come	0.032018862664699554
report	0.029884984716773033
mine	0.028818044811487198
investor	0.026684165000915527

== Topic #10 ==
compani	0.10489286482334137
nyse	0.044276900589466095
unit	0.025206705555319786
time	0.02384454943239689
number	0.02316347137093544
work	0.021120237186551094
grow	0.019077003002166748
back	0.019077003002166748
last	0.0183959249407053
deal	0.01771484687924385

== Topic #11 ==
price	0.07671117782592773
high	0.07337620109319687
market	0.06070328131318092
last	0.054033324122428894
new	0.05336632952094078
term	0.05003134906291962
long	0.03935941681265831
see	0.033356454223394394
top	0.02802048809826374
remain	0.026686497032642365

== Topic #12 ==
year	0.14438456296920776
million	0.0716267079114914
sale	0.04491093009710312
revenu	0.03524777293205261
expect	0.03297409042716026
result	0.030131986364722252
increas	0.028995145112276077
billion	0.02672146074473858
per	0.026153041049838066
share	0.02217409573495388

== Topic #13 ==
dollar	0.06379114091396332
market	0.05131297558546066
store	0.04299420490860939
today	0.04160774126648903
rate	0.03467543050646782
buy	0.03328896686434746
open	0.0319025032222271
higher	0.030516041442751884
close	0.030516041442751884
part	0.029129577800631523

== Topic #14 ==
quarter	0.1036258339881897
compani	0.0972980260848999
growth	0.08622435480356216
oper	0.05537626892328262
earn	0.0545852929353714
margin	0.03560185059905052
total	0.03560185059905052
cent	0.03322892263531685
strong	0.0316469706594944
corpor	0.029274040833115578

== Topic #15 ==
said	0.18564963340759277
reuter	0.0435190349817276
trump	0.0425521619617939
govern	0.0328834131360054
billion	0.024181541055440903
york	0.023214666172862053
offici	0.021280916407704353
comment	0.021280916407704353
need	0.019347166642546654
tax	0.019347166642546654

== Topic #16 ==
earn	0.07902201265096664
estim	0.06129385158419609
stock	0.04913739860057831
expect	0.042552653700113297
posit	0.041033096611499786
averag	0.036980945616960526
report	0.033435314893722534
market	0.027357084676623344
also	0.026850566267967224
per	0.023304933682084084

== Topic #17 ==
peopl	0.0717117115855217
googl	0.0506262369453907
say	0.04219204559922218
media	0.040083497762680054
ask	0.037974949926137924
center	0.037974949926137924
would	0.035866402089595795
court	0.035866402089595795
face	0.035866402089595795
new	0.035866402089595795

== Topic #18 ==
trade	0.07214492559432983
week	0.048939090222120285
us	0.027751151472330093
still	0.025228777900338173
forecast	0.022201931104063988
point	0.022201931104063988
time	0.021697456017136574
china	0.021697456017136574
two	0.02119298093020916
one	0.020184030756354332

== Topic #19 ==
one	0.04198075830936432
secur	0.040301863104104996
make	0.03862296789884567
would	0.034425731748342514
presid	0.033586286008358
help	0.030228495597839355
also	0.028549600392580032
gener	0.021834025159478188
two	0.020155129954218864
inc	0.020155129954218864

Lab: Tomotopy

Contents

Lab: Tomotopy#

What is tomotopy?#

Getting Started#

Model Save and Load#

Documents in the Model and out of the Model#

Inference for Unseen Documents#

Corpus and transform#

Pining Topics using Word Priors#

Examples#

Install or upgrade of ekorpkit#

Load a dataset#

LDA Basics#

LDA Visualization#

LDA coherence#

Pining Topics using Word Priors#