TOP ACTIVATIONS
MAX = 8.124

igraph from the Greek historian Herod ot us begins the Australian
, is something of a vind ication of Democratic strategic thinking
one element of a tri pty ch called Echo that Indigenous
problems how to address sab er - r att ling
actively making a fool of Gohan by not only making him
narrow passage . According to Herod ot us , a squ
with justice that history has vind icated her .
I had been a Sab rina subscriber for a few
Should a homeowner confront a gy psy , they will give
the president of having a vind ictive streak of deliberately
are you really down with Gohan ? <|endoftext|> R ents still
. Sure , Gohan did get his chance back
: Great full Taco , Sab etta , Café Z ol
the wheel of a Ford Fiesta R 5 , but the
GOP officials are trying to ram through a ridiculous and unpopular
other providers of care , deleg ating the task to states
conflicts of interest pose a vex ing challenge . All the
was recalling what Grand pa Pett ib one had said .
adviser to Inter pol and fut ur ist - in -
Ohio Gr ange Cafe , Sab ine , Chef G .'

MIN ACTIVATIONS
MIN = -8.230

actic acid that reduces the pH of the vagina and restricts
, Uber disrupted taxis and Airbnb is disrupting hospitality . https
<|endoftext|> How to install Netflix Kodi 17 Krypt on addon
urge to binge on streaming Netflix television and movies by some
8 million due to Carlos Quentin . But fact oring in
, Uber disrupted taxis and Airbnb is disrupting hospitality .
When I joined Netflix in 2014 , we had
he should be serving his sentence in Canada as a youth
knocked out to install the Netflix addon in play a role
98 s )} 62 . Ng 7 { ( 67 s
mel bourne - 3000 , vic First posted <|endoftext|>
Add ons -> Netflix . Finally , sign in
Heritage Foundation , told the Ou ach ita Citizen that the
already Next , download the Netflix plugin zip file from here
and making it work at Netflix , and posted some of
Alberta . His sentence is to expire in 2018
Check out Che apest Netflix VPN s !
present - could face jail sentences if found guilty of any
17 Krypt on addon .? Netflix is probably the best streaming
." <|endoftext|> Anonymous asked : AU idea : slightly cl iche

INTERVAL 4.036 - 8.124
CONTAINS 0.366%

Maryland for many years . Parents who complain they were unaware
t one of the bureau s many tech
o , who lives in Lomb ard , was by chance
Fle w , Edward Air Force Base . http
or Academy in Garden Valley , Tex .

INTERVAL -0.053 - 4.036
CONTAINS 51.984%

historic town . Like maybe it is not a good idea
where a game can go from fun to legend ," U
scanners highlight items that are especially dense , and items that
Develop ed by : Relic Entertainment - Sierra Studios
commercials for the beer company , traditionally during the Super Bowl

INTERVAL -4.141 - -0.053
CONTAINS 47.266%

my team to achieve a goal , and that 's to
2002 Matt Le Bl anc 2002 Bernie
of society . Yet mayors , as the top political and
comes to women s rights , they are two
Co ax ) under the list of brid ged connections .

INTERVAL -8.230 - -4.141
CONTAINS 0.384%

devastating for local wildlife and Indigenous communities . 5
ai and Wh ist ling Woods to vac ate 14 .
said people who sought the same benefits for free by "
that . It allows the same app to be installed on
, sign in using your Netflix account and enjoy watching Netflix
Tokenigraph
Token ID45920
Feature activation+9.03e-01

Pos logit contributions
ovan+0.271
ould+0.266
 rapport+0.241
thinkable+0.241
 Conversation+0.239
Neg logit contributions
rag-0.341
-0.333
clerosis-0.305
hib-0.299
hod-0.296
Token from
Token ID422
Feature activation+8.54e-01

Pos logit contributions
rag+0.677
+0.661
clerosis+0.599
hib+0.587
hod+0.584
Neg logit contributions
ovan-0.639
ould-0.629
 rapport-0.579
thinkable-0.577
 Conversation-0.575
Token the
Token ID262
Feature activation+3.47e-01

Pos logit contributions
rag+0.685
+0.668
clerosis+0.610
hib+0.599
hod+0.595
Neg logit contributions
ovan-0.564
ould-0.553
thinkable-0.504
 rapport-0.503
 Conversation-0.498
Token Greek
Token ID8312
Feature activation-1.20e+00

Pos logit contributions
rag+0.279
+0.272
clerosis+0.249
hib+0.245
hod+0.243
Neg logit contributions
ovan-0.229
ould-0.224
thinkable-0.204
 rapport-0.204
 Conversation-0.202
Token historian
Token ID18026
Feature activation+2.61e+00

Pos logit contributions
ovan+0.756
ould+0.740
 rapport+0.672
thinkable+0.668
 Conversation+0.668
Neg logit contributions
rag-1.001
-0.978
clerosis-0.899
hib-0.880
hod-0.875
Token Herod
Token ID48084
Feature activation+8.12e+00

Pos logit contributions
rag+1.979
+1.924
clerosis+1.750
hib+1.725
hod+1.705
Neg logit contributions
ovan-1.825
ould-1.800
thinkable-1.657
 rapport-1.648
 Conversation-1.622
Tokenot
Token ID313
Feature activation+2.26e+00

Pos logit contributions
rag+4.631
+4.192
clerosis+3.821
alos+3.791
hib+3.773
Neg logit contributions
ovan-6.775
ould-6.738
 rapport-6.607
thinkable-6.531
fleet-6.404
Tokenus
Token ID385
Feature activation+2.01e+00

Pos logit contributions
rag+1.361
+1.264
clerosis+1.137
hib+1.100
fare+1.094
Neg logit contributions
ovan-1.958
ould-1.943
 rapport-1.850
 Conversation-1.834
thinkable-1.833
Token begins
Token ID6140
Feature activation+4.85e-01

Pos logit contributions
rag+1.491
+1.454
clerosis+1.310
hib+1.305
hod+1.280
Neg logit contributions
ovan-1.429
ould-1.413
thinkable-1.297
 rapport-1.293
 Conversation-1.287
Token the
Token ID262
Feature activation-1.51e-01

Pos logit contributions
rag+0.386
+0.377
clerosis+0.344
hib+0.337
hod+0.335
Neg logit contributions
ovan-0.323
ould-0.317
 rapport-0.290
thinkable-0.288
 Conversation-0.288
Token Australian
Token ID6638
Feature activation-5.07e+00

Pos logit contributions
ovan+0.092
ould+0.090
thinkable+0.081
 rapport+0.080
 Conversation+0.079
Neg logit contributions
rag-0.128
-0.125
clerosis-0.115
 duplication-0.113
hib-0.113
Token,
Token ID11
Feature activation-3.27e-01

Pos logit contributions
ovan+1.135
ould+1.112
 rapport+1.026
thinkable+1.015
 Conversation+1.007
Neg logit contributions
rag-1.358
-1.333
clerosis-1.217
hib-1.180
hod-1.179
Token is
Token ID318
Feature activation-3.38e-01

Pos logit contributions
ovan+0.232
ould+0.227
 rapport+0.208
thinkable+0.208
 Conversation+0.207
Neg logit contributions
rag-0.247
-0.240
clerosis-0.219
hib-0.214
hod-0.212
Token something
Token ID1223
Feature activation-1.69e+00

Pos logit contributions
ovan+0.227
ould+0.223
 rapport+0.204
thinkable+0.203
 Conversation+0.202
Neg logit contributions
rag-0.266
-0.260
clerosis-0.237
hib-0.232
hod-0.231
Token of
Token ID286
Feature activation+1.32e+00

Pos logit contributions
ovan+1.181
ould+1.157
 rapport+1.065
thinkable+1.059
 Conversation+1.057
Neg logit contributions
rag-1.294
-1.265
clerosis-1.151
hib-1.126
hod-1.116
Token a
Token ID257
Feature activation+1.27e+00

Pos logit contributions
rag+1.026
+1.004
clerosis+0.918
hib+0.899
 duplication+0.896
Neg logit contributions
ovan-0.904
ould-0.892
thinkable-0.810
 Conversation-0.805
 rapport-0.803
Token vind
Token ID29178
Feature activation+8.09e+00

Pos logit contributions
rag+0.943
+0.920
clerosis+0.834
 duplication+0.820
hib+0.818
Neg logit contributions
ovan-0.912
ould-0.902
thinkable-0.825
 Conversation-0.814
 rapport-0.811
Tokenication
Token ID3299
Feature activation+1.79e+00

Pos logit contributions
rag+3.028
+2.374
hib+2.091
alon+2.069
clerosis+2.050
Neg logit contributions
ovan-8.792
ould-8.676
 rapport-8.273
 Conversation-8.203
thinkable-8.098
Token of
Token ID286
Feature activation+5.29e-02

Pos logit contributions
rag+1.331
+1.282
clerosis+1.160
hib+1.145
fare+1.138
Neg logit contributions
ovan-1.283
ould-1.273
thinkable-1.160
 Conversation-1.151
 rapport-1.146
Token Democratic
Token ID4390
Feature activation+3.77e-01

Pos logit contributions
rag+0.043
+0.042
clerosis+0.039
hib+0.038
hod+0.038
Neg logit contributions
ovan-0.034
ould-0.033
thinkable-0.030
 rapport-0.030
 Conversation-0.030
Token strategic
Token ID10039
Feature activation+8.73e-02

Pos logit contributions
rag+0.302
+0.292
clerosis+0.269
hib+0.263
 duplication+0.263
Neg logit contributions
ovan-0.250
ould-0.246
thinkable-0.223
 Conversation-0.221
fleet-0.220
Token thinking
Token ID3612
Feature activation+3.27e-01

Pos logit contributions
rag+0.066
+0.065
clerosis+0.059
 duplication+0.058
hib+0.057
Neg logit contributions
ovan-0.061
ould-0.060
thinkable-0.055
 Conversation-0.055
 rapport-0.054
Token one
Token ID530
Feature activation-1.10e+00

Pos logit contributions
ovan+0.184
ould+0.181
 rapport+0.165
thinkable+0.165
 Conversation+0.164
Neg logit contributions
rag-0.218
-0.212
clerosis-0.194
hib-0.190
hod-0.189
Token element
Token ID5002
Feature activation-1.06e+00

Pos logit contributions
ovan+0.575
ould+0.560
 rapport+0.512
thinkable+0.499
 Conversation+0.485
Neg logit contributions
rag-1.021
-1.012
clerosis-0.945
hib-0.914
hod-0.908
Token of
Token ID286
Feature activation+2.35e-01

Pos logit contributions
ovan+0.652
ould+0.641
 rapport+0.582
 Conversation+0.577
thinkable+0.576
Neg logit contributions
rag-0.900
-0.880
clerosis-0.809
hib-0.790
hod-0.786
Token a
Token ID257
Feature activation+8.19e-01

Pos logit contributions
rag+0.189
+0.184
clerosis+0.169
hib+0.165
hod+0.164
Neg logit contributions
ovan-0.155
ould-0.152
 rapport-0.138
thinkable-0.138
 Conversation-0.137
Token tri
Token ID1333
Feature activation+3.40e+00

Pos logit contributions
rag+0.604
+0.587
clerosis+0.534
hib+0.526
hod+0.522
Neg logit contributions
ovan-0.592
ould-0.580
thinkable-0.534
 rapport-0.529
 Conversation-0.526
Tokenpty
Token ID5835
Feature activation+7.89e+00

Pos logit contributions
rag+2.520
+2.418
clerosis+2.206
hod+2.165
hib+2.151
Neg logit contributions
ovan-2.435
ould-2.397
 rapport-2.218
 Conversation-2.216
thinkable-2.172
Tokench
Token ID354
Feature activation+1.15e+00

Pos logit contributions
rag+3.951
+3.639
clerosis+3.236
hod+3.145
fare+3.087
Neg logit contributions
ovan-7.631
ould-7.406
 rapport-6.929
 Subtle-6.900
thinkable-6.897
Token called
Token ID1444
Feature activation+4.28e-01

Pos logit contributions
rag+0.926
+0.902
clerosis+0.826
hib+0.810
 duplication+0.806
Neg logit contributions
ovan-0.748
ould-0.736
thinkable-0.668
 rapport-0.666
 Conversation-0.664
Token Echo
Token ID21455
Feature activation-1.35e+00

Pos logit contributions
rag+0.353
+0.345
clerosis+0.316
hib+0.310
hod+0.308
Neg logit contributions
ovan-0.273
ould-0.267
 rapport-0.243
thinkable-0.242
 Conversation-0.241
Token that
Token ID326
Feature activation-2.56e-01

Pos logit contributions
ovan+0.914
ould+0.899
 rapport+0.821
 Conversation+0.821
thinkable+0.820
Neg logit contributions
rag-1.051
-1.024
clerosis-0.937
hib-0.915
 duplication-0.907
Token Indigenous
Token ID24033
Feature activation-4.81e+00

Pos logit contributions
ovan+0.179
ould+0.175
 rapport+0.161
thinkable+0.161
 Conversation+0.160
Neg logit contributions
rag-0.195
-0.190
clerosis-0.173
hib-0.169
hod-0.168
Token problems
Token ID2761
Feature activation+3.01e+00

Pos logit contributions
rag+0.462
+0.449
clerosis+0.409
hib+0.400
hod+0.398
Neg logit contributions
ovan-0.436
ould-0.428
 rapport-0.391
thinkable-0.391
 Conversation-0.391
Token
Token ID851
Feature activation-2.34e-01

Pos logit contributions
rag+2.331
+2.235
clerosis+2.087
hib+2.039
 duplication+2.038
Neg logit contributions
ovan-2.029
ould-1.976
 Conversation-1.830
thinkable-1.824
fleet-1.808
Token how
Token ID703
Feature activation-5.21e-01

Pos logit contributions
ovan+0.161
ould+0.159
 rapport+0.145
thinkable+0.145
 Conversation+0.144
Neg logit contributions
rag-0.180
-0.175
clerosis-0.160
hib-0.157
hod-0.156
Token to
Token ID284
Feature activation+3.86e-01

Pos logit contributions
ovan+0.364
ould+0.357
 rapport+0.328
thinkable+0.326
 Conversation+0.325
Neg logit contributions
rag-0.398
-0.389
clerosis-0.354
hib-0.346
hod-0.343
Token address
Token ID2209
Feature activation+2.24e+00

Pos logit contributions
rag+0.298
+0.289
clerosis+0.262
hib+0.260
hod+0.259
Neg logit contributions
ovan-0.265
ould-0.260
thinkable-0.238
 Conversation-0.238
 rapport-0.235
Token sab
Token ID17463
Feature activation+7.87e+00

Pos logit contributions
rag+1.843
+1.793
clerosis+1.642
 duplication+1.624
hib+1.616
Neg logit contributions
ovan-1.428
ould-1.403
thinkable-1.265
 Conversation-1.264
 rapport-1.257
Tokener
Token ID263
Feature activation+2.25e+00

Pos logit contributions
rag+4.868
+4.611
clerosis+4.035
alon+3.996
hod+3.961
Neg logit contributions
ovan-6.382
ould-6.287
 Conversation-6.089
 rapport-6.037
thinkable-5.971
Token-
Token ID12
Feature activation+3.79e-01

Pos logit contributions
rag+1.683
+1.642
clerosis+1.492
hib+1.460
hod+1.446
Neg logit contributions
ovan-1.606
ould-1.576
 rapport-1.448
thinkable-1.442
 Conversation-1.437
Tokenr
Token ID81
Feature activation+1.92e+00

Pos logit contributions
rag+0.177
+0.169
clerosis+0.144
hib+0.139
hod+0.137
Neg logit contributions
ovan-0.377
ould-0.372
 rapport-0.351
thinkable-0.350
 Conversation-0.349
Tokenatt
Token ID1078
Feature activation+2.65e+00

Pos logit contributions
rag+1.148
+1.075
hod+1.009
hib+0.990
clerosis+0.984
Neg logit contributions
ovan-1.620
ould-1.582
 rapport-1.540
thinkable-1.527
 Conversation-1.525
Tokenling
Token ID1359
Feature activation-7.57e-01

Pos logit contributions
rag+2.150
+2.075
clerosis+1.901
hib+1.871
hod+1.867
Neg logit contributions
ovan-1.725
ould-1.691
 rapport-1.554
thinkable-1.551
 Conversation-1.541
Token actively
Token ID10630
Feature activation-6.78e-01

Pos logit contributions
rag+0.448
+0.436
clerosis+0.398
hib+0.393
hod+0.388
Neg logit contributions
ovan-0.378
ould-0.372
thinkable-0.338
 rapport-0.336
 Conversation-0.336
Token making
Token ID1642
Feature activation-1.80e+00

Pos logit contributions
ovan+0.468
ould+0.459
 rapport+0.422
thinkable+0.418
 Conversation+0.418
Neg logit contributions
rag-0.523
-0.511
clerosis-0.465
hib-0.454
hod-0.452
Token a
Token ID257
Feature activation+5.24e-02

Pos logit contributions
ovan+1.187
ould+1.151
 rapport+1.069
 Conversation+1.050
thinkable+1.043
Neg logit contributions
rag-1.445
-1.411
clerosis-1.295
hib-1.264
hod-1.260
Token fool
Token ID9192
Feature activation+3.70e+00

Pos logit contributions
rag+0.043
+0.042
clerosis+0.038
hib+0.037
hod+0.037
Neg logit contributions
ovan-0.034
ould-0.033
thinkable-0.030
 rapport-0.030
 Conversation-0.030
Token of
Token ID286
Feature activation+7.44e-01

Pos logit contributions
rag+3.016
+2.918
clerosis+2.664
fare+2.616
 duplication+2.602
Neg logit contributions
ould-2.342
ovan-2.321
 Conversation-2.141
thinkable-2.113
 rapport-2.101
Token Gohan
Token ID45281
Feature activation+7.86e+00

Pos logit contributions
rag+0.678
+0.663
clerosis+0.614
hib+0.604
hod+0.600
Neg logit contributions
ovan-0.408
ould-0.401
thinkable-0.357
 rapport-0.356
 Conversation-0.354
Token by
Token ID416
Feature activation-2.86e-01

Pos logit contributions
rag+5.446
+5.314
hib+4.737
clerosis+4.665
 duplication+4.635
Neg logit contributions
ould-5.781
ovan-5.667
thinkable-5.258
 Conversation-5.186
 rapport-5.175
Token not
Token ID407
Feature activation+1.93e+00

Pos logit contributions
ovan+0.192
ould+0.188
 rapport+0.172
thinkable+0.171
 Conversation+0.171
Neg logit contributions
rag-0.226
-0.220
clerosis-0.201
hib-0.198
hod-0.196
Token only
Token ID691
Feature activation+9.26e-01

Pos logit contributions
rag+1.491
+1.453
clerosis+1.317
hib+1.309
 duplication+1.298
Neg logit contributions
ovan-1.324
ould-1.302
thinkable-1.195
 Conversation-1.184
 rapport-1.180
Token making
Token ID1642
Feature activation-7.39e-01

Pos logit contributions
rag+0.704
+0.686
clerosis+0.620
hib+0.615
 duplication+0.612
Neg logit contributions
ovan-0.646
ould-0.637
thinkable-0.583
 Conversation-0.579
 rapport-0.579
Token him
Token ID683
Feature activation+1.36e+00

Pos logit contributions
ovan+0.488
ould+0.477
 rapport+0.439
 Conversation+0.435
thinkable+0.433
Neg logit contributions
rag-0.591
-0.575
clerosis-0.530
hib-0.517
hod-0.514
Token narrow
Token ID7135
Feature activation+1.26e+00

Pos logit contributions
rag+0.651
+0.632
clerosis+0.583
hib+0.571
 duplication+0.570
Neg logit contributions
ovan-0.511
ould-0.504
thinkable-0.456
 rapport-0.455
 Conversation-0.451
Token passage
Token ID10066
Feature activation-1.55e+00

Pos logit contributions
rag+1.033
+1.000
clerosis+0.922
hod+0.902
hib+0.901
Neg logit contributions
ovan-0.797
ould-0.790
thinkable-0.715
 rapport-0.712
 Conversation-0.711
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
ovan+1.064
ould+1.042
 rapport+0.956
thinkable+0.953
 Conversation+0.948
Neg logit contributions
rag-1.196
-1.169
clerosis-1.065
hib-1.042
hod-1.033
Token According
Token ID4784
Feature activation+4.60e-01

Pos logit contributions
rag+0.160
+0.157
clerosis+0.142
hib+0.140
hod+0.139
Neg logit contributions
ovan-0.140
ould-0.137
thinkable-0.126
 rapport-0.126
 Conversation-0.122
Token to
Token ID284
Feature activation+1.99e-01

Pos logit contributions
rag+0.318
+0.309
clerosis+0.278
hib+0.275
 duplication+0.272
Neg logit contributions
ovan-0.352
ould-0.348
thinkable-0.324
 rapport-0.321
 Conversation-0.316
Token Herod
Token ID48084
Feature activation+7.79e+00

Pos logit contributions
rag+0.165
+0.161
clerosis+0.148
hib+0.145
hod+0.144
Neg logit contributions
ovan-0.126
ould-0.124
thinkable-0.112
 rapport-0.112
 Conversation-0.111
Tokenot
Token ID313
Feature activation+2.42e+00

Pos logit contributions
rag+4.102
+3.552
alon+3.200
hod+3.169
hib+3.168
Neg logit contributions
ould-6.761
ovan-6.600
thinkable-6.593
 rapport-6.592
 Conversation-6.511
Tokenus
Token ID385
Feature activation+1.97e+00

Pos logit contributions
rag+1.420
+1.304
clerosis+1.183
hod+1.154
hib+1.145
Neg logit contributions
ovan-2.108
ould-2.093
 rapport-2.021
 Conversation-2.004
thinkable-2.002
Token,
Token ID11
Feature activation+4.28e-01

Pos logit contributions
rag+1.542
+1.504
clerosis+1.373
hib+1.344
hod+1.334
Neg logit contributions
ovan-1.343
ould-1.317
 rapport-1.205
thinkable-1.199
 Conversation-1.198
Token a
Token ID257
Feature activation+8.75e-01

Pos logit contributions
rag+0.340
+0.331
clerosis+0.303
hib+0.297
hod+0.295
Neg logit contributions
ovan-0.286
ould-0.281
 rapport-0.256
thinkable-0.255
 Conversation-0.254
Token squ
Token ID2809
Feature activation+4.76e+00

Pos logit contributions
rag+0.684
+0.665
clerosis+0.608
hib+0.596
 duplication+0.596
Neg logit contributions
ovan-0.593
ould-0.584
thinkable-0.534
 rapport-0.528
 Conversation-0.527
Token with
Token ID351
Feature activation+4.90e-01

Pos logit contributions
rag+1.213
+1.182
clerosis+1.072
hib+1.049
hod+1.041
Neg logit contributions
ovan-1.176
ould-1.156
 rapport-1.060
thinkable-1.058
 Conversation-1.055
Token justice
Token ID5316
Feature activation+2.02e-01

Pos logit contributions
rag+0.406
+0.397
clerosis+0.363
hib+0.357
hod+0.355
Neg logit contributions
ovan-0.310
ould-0.304
 rapport-0.275
 Conversation-0.274
thinkable-0.274
Token that
Token ID326
Feature activation+8.46e-01

Pos logit contributions
rag+0.152
+0.148
clerosis+0.135
hib+0.132
hod+0.131
Neg logit contributions
ovan-0.143
ould-0.141
 rapport-0.129
thinkable-0.129
 Conversation-0.128
Token history
Token ID2106
Feature activation-7.21e-02

Pos logit contributions
rag+0.638
+0.620
clerosis+0.562
hib+0.553
hod+0.549
Neg logit contributions
ovan-0.599
ould-0.589
thinkable-0.538
 Conversation-0.535
 rapport-0.535
Token has
Token ID468
Feature activation+1.22e-01

Pos logit contributions
ovan+0.051
ould+0.050
 rapport+0.046
thinkable+0.046
 Conversation+0.046
Neg logit contributions
rag-0.054
-0.053
clerosis-0.048
hib-0.047
hod-0.047
Token vind
Token ID29178
Feature activation+7.67e+00

Pos logit contributions
rag+0.095
+0.093
clerosis+0.084
hib+0.083
 duplication+0.082
Neg logit contributions
ovan-0.083
ould-0.082
thinkable-0.074
 Conversation-0.074
 rapport-0.074
Tokenicated
Token ID3474
Feature activation+1.50e+00

Pos logit contributions
rag+2.801
+2.196
hib+1.857
alon+1.853
hod+1.763
Neg logit contributions
ovan-8.472
ould-8.143
 rapport-8.100
 Conversation-8.093
 Presence-8.023
Token her
Token ID607
Feature activation+7.11e-01

Pos logit contributions
rag+1.233
+1.204
clerosis+1.096
hib+1.082
 duplication+1.074
Neg logit contributions
ovan-0.955
ould-0.938
thinkable-0.847
 Conversation-0.842
 rapport-0.838
Token.
Token ID13
Feature activation-2.76e-01

Pos logit contributions
rag+0.574
+0.556
clerosis+0.507
hib+0.499
hod+0.498
Neg logit contributions
ovan-0.465
ould-0.456
thinkable-0.415
 Conversation-0.413
fleet-0.410
Token
Token ID198
Feature activation-9.55e-01

Pos logit contributions
ovan+0.175
ould+0.171
thinkable+0.156
 rapport+0.156
 Conversation+0.153
Neg logit contributions
rag-0.228
-0.223
clerosis-0.204
hib-0.200
hod-0.199
Token
Token ID198
Feature activation-5.24e-01

Pos logit contributions
ovan+0.589
ould+0.580
 rapport+0.529
thinkable+0.525
 Conversation+0.521
Neg logit contributions
rag-0.805
-0.785
clerosis-0.720
hib-0.712
hod-0.708
Token
Token ID784
Feature activation+7.92e-01

Pos logit contributions
rag+0.419
+0.408
clerosis+0.372
hib+0.365
hod+0.363
Neg logit contributions
ovan-0.369
ould-0.362
 rapport-0.331
thinkable-0.331
 Conversation-0.328
Token I
Token ID314
Feature activation+7.93e-01

Pos logit contributions
rag+0.621
+0.606
clerosis+0.553
hib+0.542
hod+0.538
Neg logit contributions
ovan-0.537
ould-0.527
 rapport-0.481
thinkable-0.480
 Conversation-0.478
Token had
Token ID550
Feature activation-6.57e-01

Pos logit contributions
rag+0.618
+0.599
clerosis+0.547
hib+0.539
hod+0.534
Neg logit contributions
ovan-0.537
ould-0.528
thinkable-0.488
 rapport-0.486
 Conversation-0.484
Token been
Token ID587
Feature activation+2.70e-01

Pos logit contributions
ovan+0.429
ould+0.420
 rapport+0.385
 Conversation+0.382
thinkable+0.381
Neg logit contributions
rag-0.531
-0.519
clerosis-0.475
hib-0.465
hod-0.462
Token a
Token ID257
Feature activation+5.18e-01

Pos logit contributions
rag+0.214
+0.209
clerosis+0.191
hib+0.188
hod+0.186
Neg logit contributions
ovan-0.181
ould-0.178
thinkable-0.161
 rapport-0.161
 Conversation-0.161
Token Sab
Token ID9910
Feature activation+7.63e+00

Pos logit contributions
rag+0.399
+0.389
clerosis+0.354
hib+0.347
hod+0.345
Neg logit contributions
ovan-0.358
ould-0.352
 rapport-0.321
thinkable-0.321
 Conversation-0.319
Tokenrina
Token ID22267
Feature activation+2.31e+00

Pos logit contributions
rag+5.198
+4.853
alon+4.327
clerosis+4.319
hib+4.308
Neg logit contributions
ovan-5.839
ould-5.789
 rapport-5.414
 Conversation-5.413
thinkable-5.404
Token subscriber
Token ID32944
Feature activation+1.22e+00

Pos logit contributions
rag+1.797
+1.751
clerosis+1.601
hib+1.569
hod+1.565
Neg logit contributions
ovan-1.570
ould-1.548
thinkable-1.411
 rapport-1.404
 Conversation-1.395
Token for
Token ID329
Feature activation+3.62e-01

Pos logit contributions
rag+0.873
+0.852
clerosis+0.772
hib+0.757
hod+0.747
Neg logit contributions
ovan-0.897
ould-0.885
thinkable-0.812
 rapport-0.811
 Conversation-0.809
Token a
Token ID257
Feature activation+4.23e-01

Pos logit contributions
rag+0.284
+0.276
clerosis+0.251
hib+0.248
hod+0.245
Neg logit contributions
ovan-0.245
ould-0.241
thinkable-0.220
 rapport-0.220
 Conversation-0.218
Token few
Token ID1178
Feature activation+5.44e-01

Pos logit contributions
rag+0.323
+0.315
clerosis+0.285
hib+0.281
hod+0.278
Neg logit contributions
ovan-0.296
ould-0.290
thinkable-0.265
 rapport-0.264
 Conversation-0.263
Token Should
Token ID10358
Feature activation-2.76e-02

Pos logit contributions
rag+0.544
+0.532
clerosis+0.494
 duplication+0.481
hod+0.481
Neg logit contributions
ovan-0.323
ould-0.314
 Conversation-0.282
 rapport-0.276
thinkable-0.275
Token a
Token ID257
Feature activation+6.90e-02

Pos logit contributions
ovan+0.018
ould+0.017
 rapport+0.016
thinkable+0.016
 Conversation+0.016
Neg logit contributions
rag-0.023
-0.022
clerosis-0.021
hib-0.020
hod-0.020
Token homeowner
Token ID40130
Feature activation+1.71e-01

Pos logit contributions
rag+0.052
+0.050
clerosis+0.046
hib+0.045
hod+0.045
Neg logit contributions
ovan-0.049
ould-0.048
 rapport-0.044
thinkable-0.044
 Conversation-0.044
Token confront
Token ID7239
Feature activation+9.77e-01

Pos logit contributions
rag+0.132
+0.128
clerosis+0.117
hib+0.115
hod+0.114
Neg logit contributions
ovan-0.118
ould-0.116
 rapport-0.106
thinkable-0.106
 Conversation-0.106
Token a
Token ID257
Feature activation-1.88e-02

Pos logit contributions
rag+0.804
+0.784
clerosis+0.719
hib+0.707
hod+0.701
Neg logit contributions
ovan-0.623
ould-0.611
thinkable-0.554
 rapport-0.553
 Conversation-0.551
Token gy
Token ID21486
Feature activation+7.48e+00

Pos logit contributions
ovan+0.013
ould+0.013
 rapport+0.012
thinkable+0.012
 Conversation+0.012
Neg logit contributions
rag-0.014
-0.014
clerosis-0.013
hib-0.012
hod-0.012
Tokenpsy
Token ID13764
Feature activation+1.78e+00

Pos logit contributions
rag+3.561
+3.274
clerosis+2.911
hib+2.880
cler+2.837
Neg logit contributions
ovan-7.164
ould-7.053
 Conversation-6.772
thinkable-6.749
 rapport-6.733
Token,
Token ID11
Feature activation+2.54e-02

Pos logit contributions
rag+1.281
+1.242
clerosis+1.125
hib+1.109
hod+1.098
Neg logit contributions
ovan-1.313
ould-1.299
thinkable-1.192
 Conversation-1.189
 rapport-1.185
Token they
Token ID484
Feature activation-1.01e+00

Pos logit contributions
rag+0.020
+0.019
clerosis+0.017
hib+0.017
hod+0.017
Neg logit contributions
ovan-0.017
ould-0.017
 rapport-0.016
thinkable-0.016
 Conversation-0.016
Token will
Token ID481
Feature activation+4.65e-01

Pos logit contributions
ovan+0.671
ould+0.658
 rapport+0.601
thinkable+0.598
 Conversation+0.596
Neg logit contributions
rag-0.803
-0.784
clerosis-0.716
hib-0.702
hod-0.697
Token give
Token ID1577
Feature activation-4.60e-01

Pos logit contributions
rag+0.343
+0.334
clerosis+0.302
hib+0.299
hod+0.295
Neg logit contributions
ovan-0.334
ould-0.329
thinkable-0.303
 Conversation-0.302
 rapport-0.301
Token the
Token ID262
Feature activation-9.79e-02

Pos logit contributions
ovan+0.303
ould+0.296
 rapport+0.271
 Conversation+0.269
thinkable+0.268
Neg logit contributions
rag-0.376
-0.367
clerosis-0.337
hib-0.329
hod-0.327
Token president
Token ID1893
Feature activation-1.31e+00

Pos logit contributions
ovan+0.070
ould+0.069
 rapport+0.063
thinkable+0.063
 Conversation+0.063
Neg logit contributions
rag-0.073
-0.071
clerosis-0.065
hib-0.063
hod-0.063
Token of
Token ID286
Feature activation-2.02e-01

Pos logit contributions
ovan+0.701
ould+0.670
 rapport+0.612
thinkable+0.607
 Conversation+0.579
Neg logit contributions
rag-1.204
-1.171
clerosis-1.112
hib-1.082
hod-1.064
Token having
Token ID1719
Feature activation-1.40e+00

Pos logit contributions
ovan+0.136
ould+0.133
 rapport+0.122
thinkable+0.121
 Conversation+0.121
Neg logit contributions
rag-0.159
-0.155
clerosis-0.142
hib-0.139
hod-0.137
Token a
Token ID257
Feature activation+3.08e-01

Pos logit contributions
ovan+0.928
ould+0.903
 rapport+0.839
thinkable+0.825
 Conversation+0.822
Neg logit contributions
rag-1.114
-1.083
clerosis-0.995
hib-0.974
hod-0.963
Token vind
Token ID29178
Feature activation+7.33e+00

Pos logit contributions
rag+0.234
+0.228
clerosis+0.207
hib+0.203
hod+0.201
Neg logit contributions
ovan-0.217
ould-0.213
 rapport-0.196
thinkable-0.195
 Conversation-0.194
Tokenictive
Token ID45279
Feature activation-2.08e-02

Pos logit contributions
rag+4.953
+4.409
hib+4.094
clerosis+4.072
hod+4.071
Neg logit contributions
ovan-5.808
ould-5.686
 Conversation-5.362
 rapport-5.344
thinkable-5.254
Token streak
Token ID15113
Feature activation-6.28e-02

Pos logit contributions
ovan+0.014
ould+0.014
thinkable+0.012
 Conversation+0.012
 rapport+0.012
Neg logit contributions
rag-0.017
-0.016
clerosis-0.015
hib-0.015
hod-0.015
Token
Token ID851
Feature activation-8.06e-01

Pos logit contributions
ovan+0.045
ould+0.044
 rapport+0.040
thinkable+0.040
 Conversation+0.040
Neg logit contributions
rag-0.047
-0.046
clerosis-0.042
hib-0.041
hod-0.041
Token of
Token ID286
Feature activation+6.01e-01

Pos logit contributions
ovan+0.553
ould+0.542
 rapport+0.496
thinkable+0.494
 Conversation+0.493
Neg logit contributions
rag-0.626
-0.610
clerosis-0.556
hib-0.545
hod-0.541
Token deliberately
Token ID14593
Feature activation+2.13e+00

Pos logit contributions
rag+0.462
+0.449
clerosis+0.408
hib+0.402
 duplication+0.400
Neg logit contributions
ovan-0.415
ould-0.410
 Conversation-0.373
thinkable-0.372
 rapport-0.371
Token are
Token ID389
Feature activation-6.04e-01

Pos logit contributions
ovan+0.085
ould+0.083
 rapport+0.076
thinkable+0.076
 Conversation+0.076
Neg logit contributions
rag-0.098
-0.096
clerosis-0.087
hib-0.086
hod-0.085
Token you
Token ID345
Feature activation+8.48e-01

Pos logit contributions
ovan+0.383
ould+0.375
 rapport+0.340
thinkable+0.339
 Conversation+0.338
Neg logit contributions
rag-0.500
-0.488
clerosis-0.448
hib-0.440
hod-0.437
Token really
Token ID1107
Feature activation-6.74e-01

Pos logit contributions
rag+0.605
+0.584
clerosis+0.525
hib+0.520
hod+0.515
Neg logit contributions
ovan-0.636
ould-0.625
thinkable-0.578
 rapport-0.574
 Conversation-0.572
Token down
Token ID866
Feature activation+1.18e+00

Pos logit contributions
ovan+0.455
ould+0.446
 rapport+0.407
thinkable+0.406
 Conversation+0.405
Neg logit contributions
rag-0.531
-0.518
clerosis-0.472
hib-0.464
hod-0.460
Token with
Token ID351
Feature activation+4.38e-01

Pos logit contributions
rag+0.875
+0.847
clerosis+0.770
hib+0.757
hod+0.748
Neg logit contributions
ovan-0.859
ould-0.841
thinkable-0.773
 rapport-0.773
 Conversation-0.769
Token Gohan
Token ID45281
Feature activation+7.33e+00

Pos logit contributions
rag+0.338
+0.330
clerosis+0.301
hib+0.294
hod+0.292
Neg logit contributions
ovan-0.302
ould-0.296
 rapport-0.271
thinkable-0.270
 Conversation-0.269
Token?
Token ID30
Feature activation+1.70e+00

Pos logit contributions
rag+5.298
+5.065
hib+4.531
 duplication+4.524
clerosis+4.524
Neg logit contributions
ovan-5.172
ould-5.129
thinkable-4.713
 rapport-4.625
 Conversation-4.603
Token<|endoftext|>
Token ID50256
Feature activation+2.01e+00

Pos logit contributions
rag+1.317
+1.290
clerosis+1.161
hib+1.157
hod+1.141
Neg logit contributions
ovan-1.155
ould-1.129
thinkable-1.050
 rapport-1.034
fleet-1.012
TokenR
Token ID49
Feature activation+1.78e+00

Pos logit contributions
rag+1.399
+1.359
clerosis+1.219
hib+1.198
hod+1.189
Neg logit contributions
ovan-1.530
ould-1.508
 rapport-1.402
 Conversation-1.388
thinkable-1.387
Tokenents
Token ID658
Feature activation+5.94e-01

Pos logit contributions
rag+1.387
+1.345
clerosis+1.234
hod+1.213
hib+1.213
Neg logit contributions
ovan-1.212
ould-1.192
 rapport-1.105
thinkable-1.095
 Conversation-1.091
Token still
Token ID991
Feature activation-9.94e-01

Pos logit contributions
rag+0.464
+0.450
clerosis+0.416
hib+0.406
hod+0.402
Neg logit contributions
ovan-0.405
ould-0.397
 rapport-0.363
thinkable-0.362
 Conversation-0.359
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
ovan+1.305
ould+1.277
 rapport+1.181
thinkable+1.171
 Conversation+1.166
Neg logit contributions
rag-1.360
-1.337
clerosis-1.215
hib-1.180
hod-1.175
Token
Token ID198
Feature activation-6.10e-01

Pos logit contributions
ovan+0.232
ould+0.228
 rapport+0.208
thinkable+0.208
 Conversation+0.205
Neg logit contributions
rag-0.276
-0.270
clerosis-0.246
hib-0.242
hod-0.240
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
ovan+0.348
ould+0.339
 Conversation+0.304
 rapport+0.304
thinkable+0.304
Neg logit contributions
rag-0.542
-0.529
clerosis-0.492
hib-0.480
hod-0.479
TokenSure
Token ID19457
Feature activation+8.88e-01

Pos logit contributions
ovan+0.087
ould+0.086
 rapport+0.080
thinkable+0.079
 Conversation+0.079
Neg logit contributions
rag-0.074
-0.072
clerosis-0.065
hib-0.063
hod-0.063
Token,
Token ID11
Feature activation-5.83e-01

Pos logit contributions
rag+0.698
+0.682
clerosis+0.623
hib+0.609
hod+0.605
Neg logit contributions
ovan-0.599
ould-0.588
 rapport-0.538
thinkable-0.535
 Conversation-0.534
Token Gohan
Token ID45281
Feature activation+7.30e+00

Pos logit contributions
ovan+0.432
ould+0.425
 rapport+0.395
 Conversation+0.392
thinkable+0.389
Neg logit contributions
rag-0.416
-0.406
clerosis-0.370
hib-0.358
hod-0.356
Token did
Token ID750
Feature activation+1.21e+00

Pos logit contributions
rag+5.093
+5.018
clerosis+4.460
 duplication+4.423
hod+4.416
Neg logit contributions
ovan-5.217
ould-5.147
thinkable-4.791
 Conversation-4.732
 rapport-4.667
Token get
Token ID651
Feature activation+8.64e-01

Pos logit contributions
rag+0.953
+0.928
clerosis+0.845
hib+0.831
hod+0.824
Neg logit contributions
ovan-0.819
ould-0.803
thinkable-0.735
 rapport-0.730
 Conversation-0.728
Token his
Token ID465
Feature activation-9.70e-02

Pos logit contributions
rag+0.689
+0.671
clerosis+0.613
hib+0.601
hod+0.597
Neg logit contributions
ovan-0.576
ould-0.565
thinkable-0.514
 rapport-0.513
 Conversation-0.511
Token chance
Token ID2863
Feature activation+6.38e-01

Pos logit contributions
ovan+0.067
ould+0.066
 rapport+0.061
 Conversation+0.060
thinkable+0.060
Neg logit contributions
rag-0.074
-0.073
clerosis-0.066
hib-0.065
hod-0.064
Token back
Token ID736
Feature activation+6.48e-01

Pos logit contributions
rag+0.482
+0.469
clerosis+0.427
hib+0.418
hod+0.415
Neg logit contributions
ovan-0.451
ould-0.443
 rapport-0.406
thinkable-0.405
 Conversation-0.404
Token:
Token ID25
Feature activation+2.11e+00

Pos logit contributions
rag+0.328
+0.321
clerosis+0.288
hib+0.284
fare+0.280
Neg logit contributions
ovan-0.354
ould-0.349
 rapport-0.323
thinkable-0.322
 Conversation-0.316
Token Great
Token ID3878
Feature activation-2.17e-01

Pos logit contributions
rag+1.649
+1.612
clerosis+1.467
hib+1.443
hod+1.432
Neg logit contributions
ovan-1.415
ould-1.395
thinkable-1.274
 rapport-1.271
 Conversation-1.246
Tokenfull
Token ID12853
Feature activation+1.72e+00

Pos logit contributions
ovan+0.147
ould+0.145
thinkable+0.133
 rapport+0.132
 Conversation+0.131
Neg logit contributions
rag-0.169
-0.164
clerosis-0.150
hib-0.148
hod-0.146
Token Taco
Token ID40129
Feature activation+7.15e-01

Pos logit contributions
rag+1.311
+1.272
clerosis+1.160
hib+1.138
hod+1.129
Neg logit contributions
ovan-1.201
ould-1.182
 rapport-1.083
thinkable-1.080
 Conversation-1.071
Token,
Token ID11
Feature activation+1.06e-01

Pos logit contributions
rag+0.549
+0.536
clerosis+0.489
hib+0.477
hod+0.474
Neg logit contributions
ovan-0.497
ould-0.486
 rapport-0.446
 Conversation-0.444
thinkable-0.444
Token Sab
Token ID9910
Feature activation+7.29e+00

Pos logit contributions
rag+0.082
+0.080
clerosis+0.073
hib+0.072
hod+0.071
Neg logit contributions
ovan-0.072
ould-0.071
 rapport-0.065
 Conversation-0.065
thinkable-0.065
Tokenetta
Token ID15253
Feature activation+2.87e+00

Pos logit contributions
rag+5.482
+5.256
clerosis+4.774
hib+4.715
hod+4.679
Neg logit contributions
ovan-5.057
ould-4.996
 rapport-4.631
thinkable-4.611
 Conversation-4.604
Token,
Token ID11
Feature activation-4.50e-01

Pos logit contributions
rag+2.185
+2.126
clerosis+1.934
hib+1.901
hod+1.886
Neg logit contributions
ovan-2.004
ould-1.976
 rapport-1.807
thinkable-1.806
 Conversation-1.787
Token Café
Token ID42151
Feature activation+2.82e+00

Pos logit contributions
ovan+0.310
ould+0.303
 Conversation+0.279
 rapport+0.278
thinkable+0.276
Neg logit contributions
rag-0.349
-0.340
clerosis-0.311
hib-0.304
hod-0.301
Token Z
Token ID1168
Feature activation+5.34e-02

Pos logit contributions
rag+2.198
+2.139
clerosis+1.946
hib+1.916
hod+1.900
Neg logit contributions
ovan-1.907
ould-1.883
thinkable-1.716
 rapport-1.707
 Conversation-1.680
Tokenol
Token ID349
Feature activation-8.36e-02

Pos logit contributions
rag+0.039
+0.038
clerosis+0.034
hib+0.033
hod+0.033
Neg logit contributions
ovan-0.039
ould-0.039
 rapport-0.036
thinkable-0.036
 Conversation-0.035
Token the
Token ID262
Feature activation-5.03e-01

Pos logit contributions
ovan+1.463
ould+1.435
 rapport+1.309
 Conversation+1.295
thinkable+1.283
Neg logit contributions
rag-2.019
-1.977
clerosis-1.824
hib-1.774
hod-1.767
Token wheel
Token ID7825
Feature activation+1.69e+00

Pos logit contributions
ovan+0.418
ould+0.412
thinkable+0.383
 rapport+0.382
 Conversation+0.381
Neg logit contributions
rag-0.316
-0.307
clerosis-0.272
hib-0.267
hod-0.263
Token of
Token ID286
Feature activation+3.08e-01

Pos logit contributions
rag+1.377
+1.343
clerosis+1.231
hib+1.210
hod+1.199
Neg logit contributions
ovan-1.090
ould-1.068
 rapport-0.972
thinkable-0.970
 Conversation-0.965
Token a
Token ID257
Feature activation+3.93e-01

Pos logit contributions
rag+0.256
+0.252
clerosis+0.231
hod+0.225
hib+0.224
Neg logit contributions
ovan-0.194
ould-0.190
 rapport-0.173
 Conversation-0.173
thinkable-0.170
Token Ford
Token ID8092
Feature activation+9.39e-01

Pos logit contributions
rag+0.289
+0.283
clerosis+0.256
hib+0.250
hod+0.248
Neg logit contributions
ovan-0.285
ould-0.280
 rapport-0.258
 Conversation-0.257
thinkable-0.256
Token Fiesta
Token ID49839
Feature activation+7.25e+00

Pos logit contributions
rag+0.682
+0.664
clerosis+0.602
hib+0.588
hod+0.583
Neg logit contributions
ovan-0.690
ould-0.678
 rapport-0.625
thinkable-0.622
 Conversation-0.621
Token R
Token ID371
Feature activation+1.92e+00

Pos logit contributions
rag+5.413
+5.092
clerosis+4.701
hib+4.616
fare+4.594
Neg logit contributions
ovan-5.056
ould-4.999
thinkable-4.590
 rapport-4.530
 Conversation-4.447
Token5
Token ID20
Feature activation+1.83e+00

Pos logit contributions
rag+1.631
+1.607
clerosis+1.471
hib+1.441
hod+1.432
Neg logit contributions
ovan-1.176
ould-1.151
 Conversation-1.030
 rapport-1.028
thinkable-1.028
Token,
Token ID11
Feature activation-7.98e-02

Pos logit contributions
rag+1.352
+1.306
clerosis+1.192
hib+1.171
hod+1.156
Neg logit contributions
ovan-1.324
ould-1.299
thinkable-1.195
 rapport-1.194
 Conversation-1.182
Token but
Token ID475
Feature activation+1.47e-01

Pos logit contributions
ovan+0.056
ould+0.055
thinkable+0.051
 rapport+0.051
 Conversation+0.050
Neg logit contributions
rag-0.060
-0.059
clerosis-0.053
hib-0.052
hod-0.052
Token the
Token ID262
Feature activation-7.69e-02

Pos logit contributions
rag+0.115
+0.111
clerosis+0.102
hib+0.100
hod+0.099
Neg logit contributions
ovan-0.100
ould-0.098
thinkable-0.090
 rapport-0.089
 Conversation-0.089
Token GOP
Token ID6796
Feature activation+3.55e-01

Pos logit contributions
rag+0.005
+0.005
clerosis+0.005
hib+0.005
hod+0.005
Neg logit contributions
ovan-0.005
ould-0.005
 rapport-0.004
thinkable-0.004
 Conversation-0.004
Token officials
Token ID2828
Feature activation+2.71e+00

Pos logit contributions
rag+0.256
+0.249
clerosis+0.226
hib+0.221
hod+0.219
Neg logit contributions
ovan-0.263
ould-0.258
 rapport-0.238
thinkable-0.237
 Conversation-0.237
Token are
Token ID389
Feature activation-2.59e-01

Pos logit contributions
rag+2.086
+2.023
clerosis+1.852
hib+1.826
 duplication+1.817
Neg logit contributions
ovan-1.842
ould-1.811
thinkable-1.672
 Conversation-1.663
fleet-1.655
Token trying
Token ID2111
Feature activation+1.43e+00

Pos logit contributions
ovan+0.184
ould+0.181
thinkable+0.166
 Conversation+0.165
 rapport+0.165
Neg logit contributions
rag-0.195
-0.189
clerosis-0.172
hib-0.170
hod-0.168
Token to
Token ID284
Feature activation+2.12e-02

Pos logit contributions
rag+1.123
+1.103
clerosis+1.008
hod+0.980
hib+0.977
Neg logit contributions
ovan-0.967
ould-0.946
thinkable-0.863
 rapport-0.860
 Conversation-0.853
Token ram
Token ID15770
Feature activation+7.24e+00

Pos logit contributions
rag+0.016
+0.015
clerosis+0.014
hib+0.014
 duplication+0.013
Neg logit contributions
ovan-0.015
ould-0.015
thinkable-0.014
 Conversation-0.014
 rapport-0.014
Token through
Token ID832
Feature activation+1.61e+00

Pos logit contributions
rag+5.855
+5.803
hib+5.189
hod+5.176
clerosis+5.150
Neg logit contributions
ovan-4.519
ould-4.432
thinkable-4.023
 Conversation-4.004
 rapport-3.997
Token a
Token ID257
Feature activation+2.29e-01

Pos logit contributions
rag+1.278
+1.248
clerosis+1.141
hib+1.120
hod+1.110
Neg logit contributions
ovan-1.069
ould-1.049
 rapport-0.952
thinkable-0.952
 Conversation-0.947
Token ridiculous
Token ID11441
Feature activation-6.41e-01

Pos logit contributions
rag+0.160
+0.156
clerosis+0.141
hib+0.137
hod+0.136
Neg logit contributions
ovan-0.175
ould-0.172
 rapport-0.159
thinkable-0.158
 Conversation-0.158
Token and
Token ID290
Feature activation+4.56e-01

Pos logit contributions
ovan+0.477
ould+0.468
 rapport+0.433
thinkable+0.430
 Conversation+0.430
Neg logit contributions
rag-0.459
-0.447
clerosis-0.405
hib-0.395
hod-0.392
Token unpopular
Token ID24166
Feature activation-1.44e+00

Pos logit contributions
rag+0.386
+0.376
clerosis+0.346
hib+0.340
 duplication+0.338
Neg logit contributions
ovan-0.282
ould-0.276
 rapport-0.249
thinkable-0.249
 Conversation-0.248
Token other
Token ID584
Feature activation-9.06e-02

Pos logit contributions
rag+0.619
+0.598
clerosis+0.548
hib+0.538
 duplication+0.534
Neg logit contributions
ovan-0.609
ould-0.600
thinkable-0.549
 Conversation-0.547
 rapport-0.545
Token providers
Token ID9549
Feature activation+1.38e+00

Pos logit contributions
ovan+0.064
ould+0.063
thinkable+0.057
 rapport+0.057
 Conversation+0.057
Neg logit contributions
rag-0.068
-0.067
clerosis-0.061
hib-0.059
hod-0.059
Token of
Token ID286
Feature activation+2.75e-01

Pos logit contributions
rag+1.025
+0.998
clerosis+0.907
hib+0.887
hod+0.880
Neg logit contributions
ovan-0.988
ould-0.971
 rapport-0.892
thinkable-0.889
 Conversation-0.887
Token care
Token ID1337
Feature activation+1.89e+00

Pos logit contributions
rag+0.219
+0.213
clerosis+0.195
hib+0.191
hod+0.190
Neg logit contributions
ovan-0.184
ould-0.180
thinkable-0.164
 rapport-0.164
 Conversation-0.163
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
rag+1.399
+1.359
clerosis+1.237
hib+1.211
hod+1.199
Neg logit contributions
ovan-1.364
ould-1.343
 rapport-1.232
thinkable-1.228
 Conversation-1.226
Token deleg
Token ID8570
Feature activation+7.17e+00

Pos logit contributions
rag+0.170
+0.165
clerosis+0.151
hib+0.149
 duplication+0.148
Neg logit contributions
ovan-0.149
ould-0.147
thinkable-0.134
 rapport-0.134
 Conversation-0.133
Tokenating
Token ID803
Feature activation-3.26e-01

Pos logit contributions
rag+4.141
+3.683
hib+3.484
hod+3.346
clerosis+3.314
Neg logit contributions
ovan-6.054
ould-5.943
 Conversation-5.870
thinkable-5.810
 rapport-5.798
Token the
Token ID262
Feature activation-1.58e-01

Pos logit contributions
ovan+0.211
ould+0.206
 rapport+0.190
 Conversation+0.188
thinkable+0.187
Neg logit contributions
rag-0.265
-0.259
clerosis-0.236
hib-0.232
hod-0.230
Token task
Token ID4876
Feature activation+1.82e-01

Pos logit contributions
ovan+0.100
ould+0.098
 rapport+0.089
thinkable+0.089
 Conversation+0.088
Neg logit contributions
rag-0.131
-0.128
clerosis-0.118
hib-0.115
hod-0.115
Token to
Token ID284
Feature activation-4.38e-02

Pos logit contributions
rag+0.152
+0.150
clerosis+0.138
hib+0.135
hod+0.134
Neg logit contributions
ovan-0.113
ould-0.111
thinkable-0.100
 rapport-0.100
 Conversation-0.099
Token states
Token ID2585
Feature activation+1.53e+00

Pos logit contributions
ovan+0.032
ould+0.031
 rapport+0.029
thinkable+0.028
 Conversation+0.028
Neg logit contributions
rag-0.032
-0.032
clerosis-0.029
hib-0.028
hod-0.028
Token conflicts
Token ID12333
Feature activation+2.13e+00

Pos logit contributions
rag+1.014
+0.983
clerosis+0.901
 duplication+0.888
hib+0.881
Neg logit contributions
ovan-0.904
ould-0.896
thinkable-0.815
 rapport-0.804
 Conversation-0.803
Token of
Token ID286
Feature activation-2.76e-01

Pos logit contributions
rag+1.746
+1.702
clerosis+1.572
hib+1.535
 duplication+1.524
Neg logit contributions
ovan-1.367
ould-1.337
 Conversation-1.216
 rapport-1.214
thinkable-1.211
Token interest
Token ID1393
Feature activation+2.68e+00

Pos logit contributions
ovan+0.263
ould+0.259
thinkable+0.243
 Conversation+0.241
 rapport+0.240
Neg logit contributions
rag-0.141
-0.134
clerosis-0.118
 duplication-0.115
hib-0.114
Token pose
Token ID12705
Feature activation+3.97e+00

Pos logit contributions
rag+2.058
+1.995
clerosis+1.844
hib+1.799
 duplication+1.796
Neg logit contributions
ovan-1.847
ould-1.812
thinkable-1.649
 Conversation-1.649
 rapport-1.639
Token a
Token ID257
Feature activation+4.31e-01

Pos logit contributions
rag+3.015
+2.903
 duplication+2.653
clerosis+2.650
hib+2.603
Neg logit contributions
ovan-2.751
ould-2.709
fleet-2.471
thinkable-2.469
 Conversation-2.468
Token vex
Token ID41548
Feature activation+7.09e+00

Pos logit contributions
rag+0.328
+0.320
clerosis+0.291
hib+0.285
hod+0.283
Neg logit contributions
ovan-0.301
ould-0.296
 rapport-0.271
thinkable-0.270
 Conversation-0.270
Tokening
Token ID278
Feature activation+1.92e-01

Pos logit contributions
rag+4.952
+4.593
hib+4.164
clerosis+4.136
 duplication+4.131
Neg logit contributions
ould-5.142
ovan-5.094
 Conversation-4.934
 rapport-4.848
thinkable-4.745
Token challenge
Token ID4427
Feature activation+2.37e+00

Pos logit contributions
rag+0.134
+0.130
clerosis+0.117
hib+0.114
hod+0.113
Neg logit contributions
ovan-0.147
ould-0.144
 rapport-0.133
thinkable-0.133
 Conversation-0.133
Token.
Token ID13
Feature activation+6.19e-03

Pos logit contributions
rag+1.752
+1.703
clerosis+1.546
hib+1.514
hod+1.499
Neg logit contributions
ovan-1.705
ould-1.676
 rapport-1.539
thinkable-1.535
 Conversation-1.531
Token All
Token ID1439
Feature activation-4.72e-01

Pos logit contributions
rag+0.005
+0.005
clerosis+0.004
hib+0.004
hod+0.004
Neg logit contributions
ovan-0.004
ould-0.004
 rapport-0.004
thinkable-0.004
 Conversation-0.004
Token the
Token ID262
Feature activation+1.61e+00

Pos logit contributions
ovan+0.323
ould+0.317
 rapport+0.290
thinkable+0.289
 Conversation+0.288
Neg logit contributions
rag-0.366
-0.358
clerosis-0.326
hib-0.319
hod-0.317
Token was
Token ID373
Feature activation-1.21e+00

Pos logit contributions
rag+0.089
+0.086
clerosis+0.079
hib+0.077
hod+0.077
Neg logit contributions
ovan-0.074
ould-0.073
 rapport-0.066
thinkable-0.066
 Conversation-0.066
Token recalling
Token ID38203
Feature activation-1.80e+00

Pos logit contributions
ovan+0.829
ould+0.813
 rapport+0.745
thinkable+0.741
 Conversation+0.740
Neg logit contributions
rag-0.941
-0.918
clerosis-0.838
hib-0.819
hod-0.813
Token what
Token ID644
Feature activation-3.06e+00

Pos logit contributions
ovan+1.149
ould+1.114
 rapport+1.036
 Conversation+1.030
thinkable+1.015
Neg logit contributions
rag-1.479
-1.447
clerosis-1.332
hib-1.300
hod-1.291
Token Grand
Token ID5675
Feature activation+6.30e-01

Pos logit contributions
ovan+1.917
ould+1.890
 rapport+1.739
 Conversation+1.730
thinkable+1.697
Neg logit contributions
rag-2.505
-2.476
clerosis-2.285
hib-2.222
hod-2.200
Tokenpa
Token ID8957
Feature activation+1.42e+00

Pos logit contributions
rag+0.519
+0.506
clerosis+0.464
hib+0.456
hod+0.452
Neg logit contributions
ovan-0.400
ould-0.393
 rapport-0.358
thinkable-0.356
 Conversation-0.354
Token Pett
Token ID35043
Feature activation+7.07e+00

Pos logit contributions
rag+0.951
+0.924
clerosis+0.828
hib+0.810
hod+0.804
Neg logit contributions
ovan-1.118
ould-1.098
 rapport-1.018
thinkable-1.016
 Conversation-1.012
Tokenib
Token ID571
Feature activation+2.57e+00

Pos logit contributions
rag+5.387
hib+4.784
+4.703
hod+4.691
clerosis+4.545
Neg logit contributions
ould-4.753
ovan-4.748
 Conversation-4.690
 rapport-4.505
 Presence-4.453
Tokenone
Token ID505
Feature activation-8.87e-01

Pos logit contributions
rag+1.829
+1.705
clerosis+1.576
hod+1.575
hib+1.551
Neg logit contributions
ovan-1.890
ould-1.850
 rapport-1.800
 Conversation-1.793
thinkable-1.751
Token had
Token ID550
Feature activation-6.17e-01

Pos logit contributions
ovan+0.567
ould+0.557
 rapport+0.507
thinkable+0.504
 Conversation+0.502
Neg logit contributions
rag-0.729
-0.712
clerosis-0.654
hib-0.640
hod-0.636
Token said
Token ID531
Feature activation+1.67e+00

Pos logit contributions
ovan+0.374
ould+0.367
 rapport+0.332
thinkable+0.330
 Conversation+0.329
Neg logit contributions
rag-0.527
-0.516
clerosis-0.475
hib-0.465
hod-0.462
Token.
Token ID13
Feature activation+1.77e-01

Pos logit contributions
rag+1.257
+1.219
clerosis+1.108
hib+1.088
 duplication+1.082
Neg logit contributions
ovan-1.184
ould-1.161
thinkable-1.059
 rapport-1.056
 Conversation-1.054
Token adviser
Token ID12534
Feature activation-1.11e+00

Pos logit contributions
ovan+0.797
ould+0.780
 rapport+0.714
thinkable+0.709
 Conversation+0.707
Neg logit contributions
rag-0.988
-0.965
clerosis-0.883
hib-0.866
hod-0.860
Token to
Token ID284
Feature activation-3.57e-01

Pos logit contributions
ovan+0.758
ould+0.733
 rapport+0.680
thinkable+0.669
 Conversation+0.666
Neg logit contributions
rag-0.879
-0.857
clerosis-0.778
hib-0.760
hod-0.757
Token Inter
Token ID4225
Feature activation+6.13e-02

Pos logit contributions
ovan+0.225
ould+0.221
 rapport+0.200
thinkable+0.199
 Conversation+0.198
Neg logit contributions
rag-0.297
-0.290
clerosis-0.266
hib-0.261
hod-0.260
Tokenpol
Token ID16104
Feature activation-1.67e+00

Pos logit contributions
rag+0.049
+0.048
clerosis+0.043
hib+0.043
hod+0.042
Neg logit contributions
ovan-0.041
ould-0.040
 rapport-0.037
thinkable-0.037
 Conversation-0.036
Token and
Token ID290
Feature activation-9.29e-02

Pos logit contributions
ovan+1.110
ould+1.073
 rapport+0.999
 Conversation+0.976
thinkable+0.971
Neg logit contributions
rag-1.336
-1.300
clerosis-1.194
hib-1.165
hod-1.156
Token fut
Token ID13294
Feature activation+7.03e+00

Pos logit contributions
ovan+0.060
ould+0.059
thinkable+0.053
 rapport+0.053
 Conversation+0.053
Neg logit contributions
rag-0.076
-0.074
clerosis-0.068
hib-0.066
hod-0.066
Tokenur
Token ID333
Feature activation+2.83e+00

Pos logit contributions
rag+3.678
+3.499
clerosis+2.926
fare+2.854
hod+2.849
Neg logit contributions
ould-6.470
ovan-6.406
 Conversation-6.103
 rapport-6.093
thinkable-5.940
Tokenist
Token ID396
Feature activation+1.09e+00

Pos logit contributions
rag+1.935
+1.878
clerosis+1.686
hib+1.642
hod+1.631
Neg logit contributions
ovan-2.189
ould-2.162
thinkable-1.993
 Conversation-1.992
 rapport-1.992
Token-
Token ID12
Feature activation-4.44e-01

Pos logit contributions
rag+0.822
+0.802
clerosis+0.727
hib+0.713
hod+0.708
Neg logit contributions
ovan-0.770
ould-0.761
thinkable-0.695
 Conversation-0.691
 rapport-0.690
Tokenin
Token ID259
Feature activation-7.90e-01

Pos logit contributions
ovan+0.318
ould+0.313
thinkable+0.287
 rapport+0.287
 Conversation+0.284
Neg logit contributions
rag-0.329
-0.322
clerosis-0.294
hib-0.286
 duplication-0.286
Token-
Token ID12
Feature activation-7.34e-01

Pos logit contributions
ovan+0.520
ould+0.511
 rapport+0.466
 Conversation+0.464
thinkable+0.463
Neg logit contributions
rag-0.634
-0.619
clerosis-0.568
 duplication-0.552
hod-0.549
Token Ohio
Token ID6835
Feature activation-1.99e-01

Pos logit contributions
ovan+0.181
ould+0.177
 rapport+0.163
 Conversation+0.163
thinkable+0.162
Neg logit contributions
rag-0.204
-0.199
clerosis-0.182
hib-0.178
hod-0.176
Token Gr
Token ID1902
Feature activation-1.72e+00

Pos logit contributions
ovan+0.142
ould+0.140
 rapport+0.128
thinkable+0.128
 Conversation+0.128
Neg logit contributions
rag-0.149
-0.145
clerosis-0.132
hib-0.129
hod-0.128
Tokenange
Token ID858
Feature activation-6.74e-01

Pos logit contributions
ovan+1.190
ould+1.163
thinkable+1.058
 rapport+1.058
 Conversation+1.052
Neg logit contributions
rag-1.324
-1.306
clerosis-1.187
 duplication-1.158
hib-1.156
Token Cafe
Token ID26965
Feature activation+1.99e+00

Pos logit contributions
ovan+0.409
ould+0.395
 Conversation+0.364
 rapport+0.358
thinkable+0.355
Neg logit contributions
rag-0.580
-0.569
clerosis-0.525
hib-0.509
 duplication-0.507
Token,
Token ID11
Feature activation-2.45e-01

Pos logit contributions
rag+1.658
+1.621
clerosis+1.489
hib+1.458
hod+1.448
Neg logit contributions
ovan-1.249
ould-1.221
 rapport-1.108
thinkable-1.105
 Conversation-1.104
Token Sab
Token ID9910
Feature activation+7.02e+00

Pos logit contributions
ovan+0.170
ould+0.167
 rapport+0.153
 Conversation+0.153
thinkable+0.152
Neg logit contributions
rag-0.188
-0.183
clerosis-0.167
hib-0.163
hod-0.162
Tokenine
Token ID500
Feature activation-3.94e-01

Pos logit contributions
rag+4.853
+4.565
clerosis+4.117
hib+4.080
hod+4.064
Neg logit contributions
ovan-5.281
ould-5.223
 rapport-4.918
 Conversation-4.897
thinkable-4.872
Token,
Token ID11
Feature activation-6.59e-01

Pos logit contributions
ovan+0.280
ould+0.273
 Conversation+0.253
 rapport+0.252
thinkable+0.250
Neg logit contributions
rag-0.295
-0.289
clerosis-0.263
hib-0.256
 duplication-0.254
Token Chef
Token ID26383
Feature activation-1.15e+00

Pos logit contributions
ovan+0.455
ould+0.446
 Conversation+0.410
 rapport+0.409
thinkable+0.406
Neg logit contributions
rag-0.509
-0.496
clerosis-0.454
hib-0.442
hod-0.438
Token G
Token ID402
Feature activation+8.08e-01

Pos logit contributions
ovan+0.802
ould+0.783
 Conversation+0.719
 rapport+0.718
thinkable+0.714
Neg logit contributions
rag-0.885
-0.865
clerosis-0.791
hib-0.771
hod-0.765
Token.'
Token ID2637
Feature activation+2.27e+00

Pos logit contributions
rag+0.582
+0.565
clerosis+0.511
hib+0.504
hod+0.499
Neg logit contributions
ovan-0.596
ould-0.586
 rapport-0.541
thinkable-0.541
 Conversation-0.539
Tokenactic
Token ID12009
Feature activation+1.23e+00

Pos logit contributions
ovan+0.546
ould+0.533
 rapport+0.487
thinkable+0.485
 Conversation+0.482
Neg logit contributions
rag-0.615
-0.603
clerosis-0.548
hib-0.535
 duplication-0.533
Token acid
Token ID7408
Feature activation-3.66e+00

Pos logit contributions
rag+0.893
+0.859
clerosis+0.787
hib+0.774
 duplication+0.766
Neg logit contributions
ovan-0.892
ould-0.877
thinkable-0.812
 rapport-0.811
 Conversation-0.808
Token that
Token ID326
Feature activation-8.05e-01

Pos logit contributions
ovan+2.404
ould+2.348
 rapport+2.140
thinkable+2.128
 Conversation+2.110
Neg logit contributions
rag-2.932
-2.888
clerosis-2.605
hod-2.554
hib-2.548
Token reduces
Token ID12850
Feature activation+1.19e+00

Pos logit contributions
ovan+0.550
ould+0.539
 rapport+0.493
thinkable+0.492
 Conversation+0.491
Neg logit contributions
rag-0.625
-0.610
clerosis-0.557
hib-0.547
hod-0.542
Token the
Token ID262
Feature activation-5.92e-01

Pos logit contributions
rag+0.929
+0.902
clerosis+0.830
hib+0.816
 duplication+0.814
Neg logit contributions
ovan-0.798
ould-0.786
thinkable-0.721
 rapport-0.713
 Conversation-0.712
Token pH
Token ID22918
Feature activation-8.23e+00

Pos logit contributions
ovan+0.381
ould+0.374
thinkable+0.340
 rapport+0.339
 Conversation+0.338
Neg logit contributions
rag-0.483
-0.470
clerosis-0.432
hib-0.424
 duplication-0.422
Token of
Token ID286
Feature activation-4.51e-01

Pos logit contributions
ovan+4.487
ould+4.440
 rapport+4.020
 Conversation+3.959
thinkable+3.952
Neg logit contributions
rag-7.158
-7.067
clerosis-6.535
hod-6.365
 duplication-6.279
Token the
Token ID262
Feature activation+7.76e-02

Pos logit contributions
ovan+0.286
ould+0.281
thinkable+0.254
 rapport+0.254
 Conversation+0.253
Neg logit contributions
rag-0.373
-0.363
clerosis-0.334
hib-0.329
hod-0.325
Token vagina
Token ID26150
Feature activation+1.07e+00

Pos logit contributions
rag+0.061
+0.059
clerosis+0.054
hib+0.054
 duplication+0.053
Neg logit contributions
ovan-0.052
ould-0.051
thinkable-0.047
 Conversation-0.046
 rapport-0.046
Token and
Token ID290
Feature activation+1.97e-01

Pos logit contributions
rag+0.802
+0.780
clerosis+0.710
hib+0.697
hod+0.689
Neg logit contributions
ovan-0.754
ould-0.741
 rapport-0.679
thinkable-0.677
 Conversation-0.677
Token restricts
Token ID47075
Feature activation+2.32e-01

Pos logit contributions
rag+0.153
+0.149
clerosis+0.137
hib+0.135
 duplication+0.133
Neg logit contributions
ovan-0.135
ould-0.132
thinkable-0.121
 Conversation-0.120
 rapport-0.120
Token,
Token ID11
Feature activation-4.13e-01

Pos logit contributions
ovan+1.245
ould+1.222
 rapport+1.122
thinkable+1.115
 Conversation+1.114
Neg logit contributions
rag-1.381
-1.350
clerosis-1.228
hib-1.203
hod-1.194
Token Uber
Token ID12024
Feature activation-2.46e+00

Pos logit contributions
ovan+0.279
ould+0.274
 rapport+0.250
thinkable+0.249
 Conversation+0.248
Neg logit contributions
rag-0.325
-0.317
clerosis-0.290
hib-0.284
hod-0.282
Token disrupted
Token ID30067
Feature activation+1.02e+00

Pos logit contributions
ovan+1.641
ould+1.620
 rapport+1.481
thinkable+1.478
 Conversation+1.466
Neg logit contributions
rag-1.925
-1.888
clerosis-1.718
hib-1.689
hod-1.668
Token taxis
Token ID45049
Feature activation+9.74e-01

Pos logit contributions
rag+0.761
+0.735
clerosis+0.672
 duplication+0.658
hib+0.654
Neg logit contributions
ovan-0.738
ould-0.726
thinkable-0.664
 rapport-0.657
 Conversation-0.656
Token and
Token ID290
Feature activation-2.18e-01

Pos logit contributions
rag+0.737
+0.715
clerosis+0.653
hib+0.638
hod+0.634
Neg logit contributions
ovan-0.688
ould-0.674
 rapport-0.619
thinkable-0.617
 Conversation-0.614
Token Airbnb
Token ID35079
Feature activation-7.93e+00

Pos logit contributions
ovan+0.152
ould+0.149
 rapport+0.137
thinkable+0.137
 Conversation+0.136
Neg logit contributions
rag-0.166
-0.161
clerosis-0.147
hib-0.144
hod-0.143
Token is
Token ID318
Feature activation-1.18e+00

Pos logit contributions
ould+4.479
ovan+4.399
thinkable+3.997
 rapport+3.961
fleet+3.943
Neg logit contributions
rag-6.634
-6.579
clerosis-6.079
hib-6.000
 duplication-5.892
Token disrupting
Token ID37915
Feature activation-2.93e-01

Pos logit contributions
ovan+0.771
ould+0.755
 rapport+0.688
thinkable+0.685
 Conversation+0.683
Neg logit contributions
rag-0.953
-0.930
clerosis-0.851
hib-0.835
hod-0.829
Token hospitality
Token ID33362
Feature activation-2.01e+00

Pos logit contributions
ovan+0.203
ould+0.199
 rapport+0.182
thinkable+0.181
 Conversation+0.181
Neg logit contributions
rag-0.226
-0.220
clerosis-0.200
hib-0.196
hod-0.195
Token.
Token ID13
Feature activation-7.18e-01

Pos logit contributions
ovan+1.404
ould+1.379
 rapport+1.266
thinkable+1.259
 Conversation+1.258
Neg logit contributions
rag-1.530
-1.494
clerosis-1.359
hib-1.332
hod-1.320
Token https
Token ID3740
Feature activation-2.53e+00

Pos logit contributions
ovan+0.465
ould+0.450
 rapport+0.420
thinkable+0.418
 Conversation+0.399
Neg logit contributions
rag-0.586
-0.572
clerosis-0.519
hod-0.514
hib-0.511
Token
Token ID251
Feature activation+8.71e-01

Pos logit contributions
rag+0.360
+0.351
clerosis+0.328
hib+0.320
 duplication+0.319
Neg logit contributions
ovan-0.212
ould-0.208
 Conversation-0.184
thinkable-0.184
 rapport-0.184
Token<|endoftext|>
Token ID50256
Feature activation+1.66e+00

Pos logit contributions
rag+0.760
+0.742
clerosis+0.685
hib+0.672
hod+0.668
Neg logit contributions
ovan-0.513
ould-0.502
 rapport-0.452
 Conversation-0.450
thinkable-0.450
TokenHow
Token ID2437
Feature activation+8.84e-02

Pos logit contributions
rag+1.155
+1.122
clerosis+1.008
hib+0.989
hod+0.981
Neg logit contributions
ovan-1.267
ould-1.248
 rapport-1.159
 Conversation-1.148
thinkable-1.148
Token to
Token ID284
Feature activation-3.38e-01

Pos logit contributions
rag+0.069
+0.068
clerosis+0.062
hib+0.061
hod+0.060
Neg logit contributions
ovan-0.060
ould-0.059
 rapport-0.054
thinkable-0.053
 Conversation-0.053
Token install
Token ID2721
Feature activation-1.84e+00

Pos logit contributions
ovan+0.226
ould+0.222
thinkable+0.202
 rapport+0.202
 Conversation+0.200
Neg logit contributions
rag-0.269
-0.262
clerosis-0.239
hib-0.235
hod-0.233
Token Netflix
Token ID12074
Feature activation-7.93e+00

Pos logit contributions
ovan+1.230
ould+1.205
 rapport+1.112
 Conversation+1.105
thinkable+1.092
Neg logit contributions
rag-1.449
-1.418
clerosis-1.297
hib-1.266
hod-1.259
Token Kodi
Token ID44129
Feature activation-5.52e+00

Pos logit contributions
ovan+5.206
ould+4.943
 Conversation+4.662
 rapport+4.624
 Advertisement+4.545
Neg logit contributions
rag-6.125
-6.109
clerosis-5.513
hod-5.385
hib-5.376
Token 17
Token ID1596
Feature activation-5.44e-01

Pos logit contributions
ovan+3.566
ould+3.442
 Conversation+3.151
 rapport+3.136
thinkable+3.095
Neg logit contributions
rag-4.487
-4.448
clerosis-4.071
hib-3.905
hod-3.889
Token Krypt
Token ID44679
Feature activation+6.13e-02

Pos logit contributions
ovan+0.383
ould+0.376
 rapport+0.345
thinkable+0.344
 Conversation+0.342
Neg logit contributions
rag-0.413
-0.402
clerosis-0.366
hib-0.359
hod-0.355
Tokenon
Token ID261
Feature activation-7.33e-01

Pos logit contributions
rag+0.041
+0.040
clerosis+0.036
hib+0.035
hod+0.035
Neg logit contributions
ovan-0.049
ould-0.048
 rapport-0.044
thinkable-0.044
 Conversation-0.044
Token addon
Token ID48557
Feature activation-5.29e+00

Pos logit contributions
ovan+0.488
ould+0.482
thinkable+0.442
 rapport+0.442
 Conversation+0.436
Neg logit contributions
rag-0.578
-0.560
clerosis-0.512
hib-0.506
hod-0.502
Token urge
Token ID14960
Feature activation-3.57e-01

Pos logit contributions
rag+0.601
+0.586
clerosis+0.536
hib+0.525
hod+0.522
Neg logit contributions
ovan-0.511
ould-0.501
 rapport-0.457
thinkable-0.457
 Conversation-0.453
Token to
Token ID284
Feature activation-9.71e-01

Pos logit contributions
ovan+0.247
ould+0.242
 rapport+0.223
thinkable+0.221
 Conversation+0.220
Neg logit contributions
rag-0.275
-0.269
clerosis-0.245
hib-0.239
hod-0.239
Token binge
Token ID34735
Feature activation-4.42e+00

Pos logit contributions
ovan+0.663
ould+0.651
 rapport+0.596
thinkable+0.593
 Conversation+0.592
Neg logit contributions
rag-0.755
-0.737
clerosis-0.672
hib-0.658
hod-0.653
Token on
Token ID319
Feature activation+5.06e-01

Pos logit contributions
ovan+2.787
ould+2.732
 rapport+2.504
 Conversation+2.477
thinkable+2.468
Neg logit contributions
rag-3.617
-3.531
clerosis-3.220
hod-3.173
alon-3.109
Token streaming
Token ID11305
Feature activation-2.46e+00

Pos logit contributions
rag+0.409
+0.399
clerosis+0.365
hib+0.358
hod+0.355
Neg logit contributions
ovan-0.331
ould-0.325
 rapport-0.295
thinkable-0.295
 Conversation-0.293
Token Netflix
Token ID12074
Feature activation-7.51e+00

Pos logit contributions
ovan+1.576
ould+1.536
 rapport+1.410
 Conversation+1.394
thinkable+1.389
Neg logit contributions
rag-2.013
-1.974
clerosis-1.809
hib-1.758
hod-1.754
Token television
Token ID5581
Feature activation-1.49e+00

Pos logit contributions
ovan+4.754
ould+4.643
 rapport+4.278
 Conversation+4.250
thinkable+4.207
Neg logit contributions
rag-5.985
-5.865
clerosis-5.282
hod-5.255
alon-5.168
Token and
Token ID290
Feature activation+3.79e-01

Pos logit contributions
ovan+1.025
ould+1.005
 rapport+0.924
thinkable+0.918
 Conversation+0.915
Neg logit contributions
rag-1.145
-1.118
clerosis-1.017
hib-0.994
hod-0.990
Token movies
Token ID6918
Feature activation-4.60e+00

Pos logit contributions
rag+0.306
+0.298
clerosis+0.273
hib+0.268
hod+0.265
Neg logit contributions
ovan-0.249
ould-0.244
 rapport-0.222
thinkable-0.222
 Conversation-0.220
Token by
Token ID416
Feature activation-1.55e+00

Pos logit contributions
ovan+3.024
ould+2.953
 rapport+2.719
thinkable+2.705
 Conversation+2.663
Neg logit contributions
rag-3.644
-3.594
clerosis-3.232
hod-3.173
hib-3.160
Token some
Token ID617
Feature activation-1.12e+00

Pos logit contributions
ovan+0.994
ould+0.975
 rapport+0.886
thinkable+0.883
 Conversation+0.879
Neg logit contributions
rag-1.273
-1.243
clerosis-1.140
hib-1.118
hod-1.109
Token8
Token ID23
Feature activation+5.22e-01

Pos logit contributions
ovan+2.526
ould+2.477
thinkable+2.281
 rapport+2.275
 Conversation+2.272
Neg logit contributions
rag-2.204
-2.135
clerosis-1.936
hib-1.877
 duplication-1.864
Token million
Token ID1510
Feature activation-2.21e+00

Pos logit contributions
rag+0.341
+0.329
clerosis+0.299
hod+0.286
hib+0.285
Neg logit contributions
ovan-0.429
ould-0.418
 rapport-0.389
thinkable-0.389
 Conversation-0.386
Token due
Token ID2233
Feature activation-1.05e-01

Pos logit contributions
ovan+1.513
ould+1.480
 rapport+1.364
thinkable+1.360
 Conversation+1.349
Neg logit contributions
rag-1.702
-1.665
clerosis-1.519
hib-1.488
hod-1.472
Token to
Token ID284
Feature activation-8.48e-01

Pos logit contributions
ovan+0.072
ould+0.071
 rapport+0.065
thinkable+0.064
 Conversation+0.064
Neg logit contributions
rag-0.081
-0.079
clerosis-0.072
hib-0.071
hod-0.070
Token Carlos
Token ID17409
Feature activation-1.13e+00

Pos logit contributions
ovan+0.535
ould+0.524
 rapport+0.478
 Conversation+0.474
thinkable+0.472
Neg logit contributions
rag-0.706
-0.689
clerosis-0.631
hib-0.620
hod-0.615
Token Quentin
Token ID42447
Feature activation-7.49e+00

Pos logit contributions
ovan+0.806
ould+0.786
 rapport+0.738
 Conversation+0.731
thinkable+0.709
Neg logit contributions
rag-0.845
-0.831
clerosis-0.741
hod-0.730
hib-0.729
Token.
Token ID13
Feature activation-2.05e-01

Pos logit contributions
ovan+4.596
ould+4.594
thinkable+4.126
 rapport+4.115
 Conversation+4.035
Neg logit contributions
rag-6.014
-5.955
clerosis-5.342
 duplication-5.234
hod-5.233
Token But
Token ID887
Feature activation-8.00e-01

Pos logit contributions
ovan+0.138
ould+0.136
thinkable+0.125
 rapport+0.124
 Conversation+0.121
Neg logit contributions
rag-0.160
-0.157
clerosis-0.143
hib-0.140
hod-0.139
Token fact
Token ID1109
Feature activation-3.10e+00

Pos logit contributions
ovan+0.549
ould+0.539
 rapport+0.492
thinkable+0.491
 Conversation+0.490
Neg logit contributions
rag-0.621
-0.605
clerosis-0.552
hib-0.541
hod-0.537
Tokenoring
Token ID3255
Feature activation-2.03e+00

Pos logit contributions
ovan+1.893
ould+1.855
 rapport+1.650
thinkable+1.637
 Conversation+1.631
Neg logit contributions
rag-2.622
-2.569
clerosis-2.389
hod-2.323
hib-2.303
Token in
Token ID287
Feature activation-1.42e+00

Pos logit contributions
ovan+1.425
ould+1.396
 rapport+1.290
 Conversation+1.277
thinkable+1.274
Neg logit contributions
rag-1.542
-1.506
clerosis-1.376
hib-1.346
hod-1.331
Token,
Token ID11
Feature activation-6.97e-01

Pos logit contributions
ovan+1.300
ould+1.278
 rapport+1.170
thinkable+1.166
 Conversation+1.160
Neg logit contributions
rag-1.464
-1.428
clerosis-1.301
hib-1.275
hod-1.266
Token Uber
Token ID12024
Feature activation-2.57e+00

Pos logit contributions
ould+0.554
ovan+0.551
 rapport+0.506
thinkable+0.505
 Conversation+0.499
Neg logit contributions
rag-0.465
-0.450
clerosis-0.402
hod-0.392
 duplication-0.392
Token disrupted
Token ID30067
Feature activation+7.28e-01

Pos logit contributions
ovan+1.569
ould+1.537
thinkable+1.407
 rapport+1.394
 Conversation+1.377
Neg logit contributions
rag-2.161
-2.117
clerosis-1.957
hib-1.910
 duplication-1.882
Token taxis
Token ID45049
Feature activation+9.82e-01

Pos logit contributions
rag+0.242
+0.230
clerosis+0.185
 duplication+0.177
hib+0.176
Neg logit contributions
ovan-0.822
ould-0.809
thinkable-0.769
 Conversation-0.764
 rapport-0.758
Token and
Token ID290
Feature activation-3.73e-01

Pos logit contributions
rag+0.692
+0.667
clerosis+0.606
hod+0.594
fare+0.592
Neg logit contributions
ovan-0.744
ould-0.733
thinkable-0.682
 rapport-0.680
 Conversation-0.674
Token Airbnb
Token ID35079
Feature activation-7.48e+00

Pos logit contributions
ovan+0.185
ould+0.178
 rapport+0.158
thinkable+0.157
 Conversation+0.152
Neg logit contributions
rag-0.359
-0.353
clerosis-0.330
fare-0.323
hod-0.323
Token is
Token ID318
Feature activation-1.36e+00

Pos logit contributions
ovan+4.473
ould+4.371
 rapport+3.958
thinkable+3.941
fleet+3.866
Neg logit contributions
rag-6.181
-6.138
clerosis-5.588
hib-5.540
hner-5.403
Token disrupting
Token ID37915
Feature activation-6.00e-01

Pos logit contributions
ovan+0.840
ould+0.823
 rapport+0.747
thinkable+0.743
 Conversation+0.741
Neg logit contributions
rag-1.142
-1.112
clerosis-1.022
hib-1.008
hod-1.000
Token hospitality
Token ID33362
Feature activation-1.81e+00

Pos logit contributions
ovan+0.242
ould+0.231
thinkable+0.201
 Conversation+0.194
fleet+0.189
Neg logit contributions
rag-0.629
-0.621
clerosis-0.585
hib-0.581
fare-0.579
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
ovan+1.315
ould+1.292
 rapport+1.188
thinkable+1.184
 Conversation+1.181
Neg logit contributions
rag-1.334
-1.299
clerosis-1.179
hib-1.153
hod-1.144
Token
Token ID198
Feature activation-6.88e-01

Pos logit contributions
ovan+0.617
ould+0.605
 rapport+0.552
thinkable+0.550
 Conversation+0.540
Neg logit contributions
rag-0.860
-0.840
clerosis-0.772
hib-0.758
hod-0.756
Token
Token ID198
Feature activation-1.36e-01

Pos logit contributions
rag+0.043
+0.042
clerosis+0.038
hib+0.038
hod+0.038
Neg logit contributions
ovan-0.035
ould-0.034
thinkable-0.031
 rapport-0.031
 Conversation-0.030
Token
Token ID198
Feature activation+1.37e-02

Pos logit contributions
ovan+0.085
ould+0.084
 rapport+0.077
thinkable+0.076
 Conversation+0.075
Neg logit contributions
rag-0.113
-0.110
clerosis-0.100
hib-0.100
hod-0.099
TokenWhen
Token ID2215
Feature activation-1.53e-01

Pos logit contributions
rag+0.010
+0.010
clerosis+0.009
hib+0.009
hod+0.009
Neg logit contributions
ovan-0.010
ould-0.010
 rapport-0.009
thinkable-0.009
 Conversation-0.009
Token I
Token ID314
Feature activation+2.62e-01

Pos logit contributions
ovan+0.099
ould+0.097
thinkable+0.088
 rapport+0.088
 Conversation+0.087
Neg logit contributions
rag-0.125
-0.122
clerosis-0.112
hib-0.110
 duplication-0.109
Token joined
Token ID5399
Feature activation-8.18e-01

Pos logit contributions
rag+0.208
+0.201
clerosis+0.185
 duplication+0.180
hib+0.179
Neg logit contributions
ovan-0.174
ould-0.172
thinkable-0.158
 Conversation-0.157
 rapport-0.157
Token Netflix
Token ID12074
Feature activation-7.46e+00

Pos logit contributions
ovan+0.540
ould+0.530
 rapport+0.483
thinkable+0.481
 Conversation+0.480
Neg logit contributions
rag-0.655
-0.639
clerosis-0.586
hib-0.574
hod-0.569
Token in
Token ID287
Feature activation-1.18e+00

Pos logit contributions
ovan+4.802
ould+4.621
 Conversation+4.285
 rapport+4.246
fleet+4.196
Neg logit contributions
rag-5.853
-5.806
clerosis-5.303
hib-5.210
hod-5.193
Token 2014
Token ID1946
Feature activation-1.37e+00

Pos logit contributions
ovan+0.810
ould+0.796
thinkable+0.727
 rapport+0.726
 Conversation+0.720
Neg logit contributions
rag-0.912
-0.888
clerosis-0.810
hib-0.794
hod-0.788
Token,
Token ID11
Feature activation-2.35e-02

Pos logit contributions
ovan+0.897
ould+0.876
 rapport+0.795
 Conversation+0.793
thinkable+0.791
Neg logit contributions
rag-1.099
-1.076
clerosis-0.988
hib-0.963
hod-0.957
Token we
Token ID356
Feature activation-1.26e+00

Pos logit contributions
ovan+0.015
ould+0.015
 rapport+0.014
thinkable+0.014
 Conversation+0.014
Neg logit contributions
rag-0.019
-0.019
clerosis-0.017
hib-0.017
hod-0.017
Token had
Token ID550
Feature activation-1.25e+00

Pos logit contributions
ovan+0.813
ould+0.801
thinkable+0.730
 rapport+0.730
 Conversation+0.730
Neg logit contributions
rag-1.027
-0.999
clerosis-0.916
hib-0.899
hod-0.895
Token he
Token ID339
Feature activation-2.51e-01

Pos logit contributions
rag+1.462
+1.419
clerosis+1.290
hib+1.268
hod+1.259
Neg logit contributions
ovan-1.381
ould-1.353
thinkable-1.238
 Conversation-1.234
 rapport-1.234
Token should
Token ID815
Feature activation+2.21e-01

Pos logit contributions
ovan+0.171
ould+0.168
 rapport+0.154
thinkable+0.153
 Conversation+0.153
Neg logit contributions
rag-0.196
-0.191
clerosis-0.174
hib-0.170
hod-0.169
Token be
Token ID307
Feature activation-4.73e-02

Pos logit contributions
rag+0.169
+0.165
clerosis+0.150
hib+0.148
hod+0.146
Neg logit contributions
ovan-0.153
ould-0.150
thinkable-0.138
 Conversation-0.138
 rapport-0.138
Token serving
Token ID7351
Feature activation-3.46e+00

Pos logit contributions
ovan+0.031
ould+0.031
 Conversation+0.028
thinkable+0.028
 rapport+0.028
Neg logit contributions
rag-0.038
-0.037
clerosis-0.034
hib-0.033
hod-0.033
Token his
Token ID465
Feature activation-2.86e-01

Pos logit contributions
ovan+2.294
ould+2.241
 rapport+2.074
 Conversation+2.045
thinkable+2.041
Neg logit contributions
rag-2.756
-2.674
clerosis-2.466
hib-2.388
hod-2.380
Token sentence
Token ID6827
Feature activation-7.34e+00

Pos logit contributions
ovan+0.147
ould+0.143
thinkable+0.127
 Conversation+0.126
fleet+0.125
Neg logit contributions
rag-0.271
-0.266
clerosis-0.246
hib-0.243
 duplication-0.242
Token in
Token ID287
Feature activation-9.73e-01

Pos logit contributions
ovan+4.768
ould+4.652
 rapport+4.347
thinkable+4.237
 Conversation+4.203
Neg logit contributions
rag-5.764
-5.674
clerosis-5.207
hib-5.038
hod-5.034
Token Canada
Token ID3340
Feature activation-8.92e-01

Pos logit contributions
ovan+0.627
ould+0.615
thinkable+0.555
 rapport+0.555
 Conversation+0.554
Neg logit contributions
rag-0.794
-0.776
clerosis-0.711
hib-0.699
hod-0.693
Token as
Token ID355
Feature activation+1.34e+00

Pos logit contributions
ovan+0.641
ould+0.630
 rapport+0.579
thinkable+0.577
 Conversation+0.576
Neg logit contributions
rag-0.662
-0.645
clerosis-0.585
hib-0.573
hod-0.569
Token a
Token ID257
Feature activation+2.89e-01

Pos logit contributions
rag+0.987
+0.961
clerosis+0.873
hib+0.856
hod+0.850
Neg logit contributions
ovan-0.966
ould-0.951
 Conversation-0.867
thinkable-0.866
 rapport-0.865
Token youth
Token ID6205
Feature activation-1.60e+00

Pos logit contributions
rag+0.233
+0.228
clerosis+0.209
 duplication+0.205
hib+0.205
Neg logit contributions
ovan-0.188
ould-0.184
thinkable-0.165
fleet-0.165
 Conversation-0.164
Token knocked
Token ID13642
Feature activation-1.88e-01

Pos logit contributions
rag+0.108
+0.105
clerosis+0.096
hib+0.094
hod+0.094
Neg logit contributions
ovan-0.098
ould-0.096
thinkable-0.088
 rapport-0.087
 Conversation-0.087
Token out
Token ID503
Feature activation+1.80e+00

Pos logit contributions
ovan+0.113
ould+0.110
 rapport+0.100
thinkable+0.100
 Conversation+0.100
Neg logit contributions
rag-0.162
-0.158
clerosis-0.144
hib-0.143
hod-0.142
Token to
Token ID284
Feature activation-5.44e-01

Pos logit contributions
rag+1.395
+1.355
clerosis+1.222
hib+1.211
 duplication+1.202
Neg logit contributions
ovan-1.239
ould-1.207
thinkable-1.111
 rapport-1.108
 Conversation-1.106
Token install
Token ID2721
Feature activation-2.20e+00

Pos logit contributions
ovan+0.367
ould+0.359
thinkable+0.330
 rapport+0.327
 Conversation+0.327
Neg logit contributions
rag-0.429
-0.418
clerosis-0.381
hib-0.373
 duplication-0.372
Token the
Token ID262
Feature activation-6.64e-01

Pos logit contributions
ovan+1.334
ould+1.305
 rapport+1.186
 Conversation+1.176
thinkable+1.169
Neg logit contributions
rag-1.883
-1.842
clerosis-1.705
hib-1.665
hod-1.649
Token Netflix
Token ID12074
Feature activation-7.24e+00

Pos logit contributions
ovan+0.392
ould+0.384
 rapport+0.346
thinkable+0.345
 Conversation+0.343
Neg logit contributions
rag-0.578
-0.565
clerosis-0.520
hib-0.511
hod-0.508
Token addon
Token ID48557
Feature activation-5.13e+00

Pos logit contributions
ovan+3.324
ould+3.127
 Conversation+2.809
 rapport+2.765
thinkable+2.691
Neg logit contributions
rag-7.119
-7.003
clerosis-6.607
hib-6.370
fare-6.348
Token in
Token ID287
Feature activation-8.57e-01

Pos logit contributions
ovan+3.404
ould+3.345
 rapport+3.074
thinkable+3.034
 Conversation+3.030
Neg logit contributions
rag-4.016
-3.948
clerosis-3.630
hib-3.528
hod-3.476
Token play
Token ID711
Feature activation-9.83e-02

Pos logit contributions
ovan+0.460
ould+0.449
 rapport+0.399
thinkable+0.399
 Conversation+0.395
Neg logit contributions
rag-0.793
-0.777
clerosis-0.719
hib-0.708
hod-0.704
Token a
Token ID257
Feature activation+1.82e-01

Pos logit contributions
ovan+0.071
ould+0.070
 rapport+0.065
thinkable+0.065
 Conversation+0.064
Neg logit contributions
rag-0.072
-0.070
clerosis-0.063
hib-0.062
hod-0.062
Token role
Token ID2597
Feature activation+9.32e-02

Pos logit contributions
rag+0.140
+0.136
clerosis+0.124
hib+0.122
hod+0.121
Neg logit contributions
ovan-0.126
ould-0.124
thinkable-0.113
 rapport-0.113
 Conversation-0.112
Token98
Token ID4089
Feature activation+6.19e-01

Pos logit contributions
rag+0.532
+0.517
clerosis+0.456
hib+0.452
hod+0.449
Neg logit contributions
ovan-0.644
ould-0.633
 rapport-0.593
thinkable-0.590
 Conversation-0.586
Tokens
Token ID82
Feature activation-6.33e-01

Pos logit contributions
rag+0.440
+0.420
clerosis+0.374
hib+0.373
hod+0.371
Neg logit contributions
ovan-0.468
ould-0.462
thinkable-0.433
 rapport-0.431
 Conversation-0.427
Token)}
Token ID38165
Feature activation-1.01e+00

Pos logit contributions
ovan+0.526
ould+0.517
 rapport+0.484
thinkable+0.482
 Conversation+0.480
Neg logit contributions
rag-0.399
-0.385
clerosis-0.345
hib-0.337
hod-0.333
Token 62
Token ID8190
Feature activation-1.33e-02

Pos logit contributions
ovan+0.749
ould+0.720
 Conversation+0.668
 rapport+0.659
fleet+0.648
Neg logit contributions
rag-0.734
-0.729
clerosis-0.677
hib-0.651
fare-0.640
Token.
Token ID13
Feature activation+9.39e-02

Pos logit contributions
ovan+0.010
ould+0.010
 rapport+0.009
thinkable+0.009
 Conversation+0.009
Neg logit contributions
rag-0.009
-0.009
clerosis-0.008
hib-0.008
hod-0.008
Token Ng
Token ID34786
Feature activation-7.06e+00

Pos logit contributions
rag+0.074
+0.072
clerosis+0.066
hib+0.065
hod+0.064
Neg logit contributions
ovan-0.063
ould-0.062
 rapport-0.057
thinkable-0.057
 Conversation-0.056
Token7
Token ID22
Feature activation-2.68e-01

Pos logit contributions
ovan+4.568
ould+4.411
thinkable+3.976
 rapport+3.960
 Conversation+3.886
Neg logit contributions
rag-5.517
-5.499
clerosis-5.093
 duplication-4.902
hib-4.875
Token {
Token ID1391
Feature activation+2.09e-01

Pos logit contributions
ovan+0.187
ould+0.184
 rapport+0.168
 Conversation+0.167
thinkable+0.167
Neg logit contributions
rag-0.204
-0.200
clerosis-0.184
hib-0.178
 duplication-0.176
Token(
Token ID7
Feature activation-1.70e-01

Pos logit contributions
rag+0.152
+0.149
clerosis+0.131
hib+0.131
hod+0.131
Neg logit contributions
ovan-0.153
ould-0.151
 rapport-0.141
 Conversation-0.140
thinkable-0.140
Token67
Token ID3134
Feature activation+1.75e+00

Pos logit contributions
ovan+0.136
ould+0.134
 rapport+0.125
thinkable+0.125
 Conversation+0.124
Neg logit contributions
rag-0.112
-0.109
clerosis-0.096
hib-0.095
hod-0.094
Tokens
Token ID82
Feature activation-2.94e-01

Pos logit contributions
rag+1.208
+1.146
hib+1.030
hod+1.020
clerosis+1.014
Neg logit contributions
ovan-1.359
ould-1.331
 rapport-1.269
thinkable-1.258
 Conversation-1.252
Token mel
Token ID7758
Feature activation-2.58e+00

Pos logit contributions
ovan+1.029
ould+1.005
 rapport+0.913
thinkable+0.901
 Conversation+0.895
Neg logit contributions
rag-1.509
-1.479
clerosis-1.361
hib-1.335
hod-1.325
Tokenbourne
Token ID12544
Feature activation-4.76e+00

Pos logit contributions
ovan+0.610
ould+0.580
 rapport+0.436
 Conversation+0.428
thinkable+0.428
Neg logit contributions
rag-3.166
-3.112
clerosis-2.944
hib-2.907
hod-2.894
Token-
Token ID12
Feature activation-1.25e+00

Pos logit contributions
ovan+2.993
ould+2.923
 rapport+2.651
thinkable+2.631
fleet+2.607
Neg logit contributions
rag-3.943
-3.841
clerosis-3.533
hib-3.456
 duplication-3.416
Token3000
Token ID23924
Feature activation-3.26e-01

Pos logit contributions
ovan+0.978
ould+0.962
 rapport+0.891
thinkable+0.888
 Conversation+0.886
Neg logit contributions
rag-0.842
-0.818
clerosis-0.735
hib-0.717
hod-0.711
Token,
Token ID11
Feature activation-1.70e+00

Pos logit contributions
ovan+0.192
ould+0.186
thinkable+0.166
 Conversation+0.165
 rapport+0.165
Neg logit contributions
rag-0.287
-0.284
clerosis-0.259
 duplication-0.253
hib-0.253
Token vic
Token ID20429
Feature activation-7.05e+00

Pos logit contributions
ovan+0.878
ould+0.851
 rapport+0.760
thinkable+0.747
 Conversation+0.742
Neg logit contributions
rag-1.605
-1.580
clerosis-1.464
hib-1.439
hod-1.427
Token
Token ID198
Feature activation-1.62e+00

Pos logit contributions
ovan+3.634
ould+3.525
thinkable+3.113
 rapport+3.031
 Conversation+2.964
Neg logit contributions
-6.506
rag-6.420
clerosis-5.872
 duplication-5.759
hod-5.698
Token
Token ID198
Feature activation-1.32e+00

Pos logit contributions
ovan+0.987
ould+0.969
 rapport+0.880
thinkable+0.874
 Conversation+0.869
Neg logit contributions
rag-1.386
-1.352
clerosis-1.244
hib-1.226
hod-1.216
TokenFirst
Token ID5962
Feature activation-3.05e-01

Pos logit contributions
ovan+1.226
ould+1.209
 rapport+1.138
 Conversation+1.134
thinkable+1.133
Neg logit contributions
rag-0.696
-0.670
clerosis-0.586
hib-0.566
hod-0.557
Token posted
Token ID4481
Feature activation-3.42e+00

Pos logit contributions
ovan+0.189
ould+0.176
thinkable+0.160
fleet+0.153
 Conversation+0.153
Neg logit contributions
rag-0.266
-0.261
clerosis-0.240
 duplication-0.237
alon-0.236
Token<|endoftext|>
Token ID50256
Feature activation+1.61e+00

Pos logit contributions
ovan+1.859
ould+1.829
 rapport+1.596
 Conversation+1.588
thinkable+1.586
Neg logit contributions
rag-3.129
-3.084
clerosis-2.848
hib-2.777
hod-2.768
Token Add
Token ID3060
Feature activation-3.03e-01

Pos logit contributions
ovan+1.276
ould+1.251
 rapport+1.146
thinkable+1.142
 Conversation+1.140
Neg logit contributions
rag-1.422
-1.385
clerosis-1.264
hib-1.236
hod-1.227
Tokenons
Token ID684
Feature activation-1.92e+00

Pos logit contributions
ovan+0.200
ould+0.197
 Conversation+0.178
 rapport+0.178
thinkable+0.177
Neg logit contributions
rag-0.244
-0.239
clerosis-0.219
 duplication-0.212
fare-0.211
Token
Token ID447
Feature activation+1.21e-01

Pos logit contributions
ovan+1.362
ould+1.339
 Conversation+1.229
 rapport+1.228
thinkable+1.223
Neg logit contributions
rag-1.433
-1.397
clerosis-1.272
hib-1.240
hod-1.233
Token
Token ID247
Feature activation-2.36e-01

Pos logit contributions
rag+0.120
+0.117
clerosis+0.108
hib+0.107
hod+0.107
Neg logit contributions
ovan-0.057
ould-0.055
thinkable-0.049
 rapport-0.049
 Conversation-0.048
Token ->
Token ID4613
Feature activation+4.30e-01

Pos logit contributions
ovan+0.167
ould+0.164
 rapport+0.150
thinkable+0.150
 Conversation+0.150
Neg logit contributions
rag-0.178
-0.173
clerosis-0.158
hib-0.154
hod-0.153
Token Netflix
Token ID12074
Feature activation-6.97e+00

Pos logit contributions
rag+0.342
+0.335
clerosis+0.304
hib+0.299
hod+0.297
Neg logit contributions
ovan-0.285
ould-0.280
 rapport-0.255
thinkable-0.255
 Conversation-0.252
Token.
Token ID13
Feature activation-7.86e-01

Pos logit contributions
ovan+4.718
ould+4.524
 Conversation+4.202
 rapport+4.127
thinkable+4.122
Neg logit contributions
rag-5.380
-5.239
clerosis-4.846
hib-4.674
hod-4.659
Token Finally
Token ID9461
Feature activation-8.96e-01

Pos logit contributions
ovan+0.537
ould+0.527
thinkable+0.488
 rapport+0.487
 Conversation+0.478
Neg logit contributions
rag-0.614
-0.592
clerosis-0.539
hib-0.531
hod-0.529
Token,
Token ID11
Feature activation+4.47e-02

Pos logit contributions
ovan+0.589
ould+0.577
 rapport+0.526
 Conversation+0.521
thinkable+0.521
Neg logit contributions
rag-0.720
-0.705
clerosis-0.646
hib-0.632
hod-0.628
Token sign
Token ID1051
Feature activation+3.28e-01

Pos logit contributions
rag+0.037
+0.036
clerosis+0.033
 duplication+0.032
hib+0.032
Neg logit contributions
ovan-0.028
ould-0.028
thinkable-0.025
 rapport-0.025
 Conversation-0.025
Token in
Token ID287
Feature activation-1.12e+00

Pos logit contributions
rag+0.257
+0.251
clerosis+0.229
hib+0.224
hod+0.223
Neg logit contributions
ovan-0.223
ould-0.218
 rapport-0.200
thinkable-0.199
 Conversation-0.198
Token Heritage
Token ID18518
Feature activation+3.03e+00

Pos logit contributions
rag+0.287
+0.281
clerosis+0.257
hib+0.251
hod+0.249
Neg logit contributions
ovan-0.234
ould-0.230
 rapport-0.211
 Conversation-0.209
thinkable-0.208
Token Foundation
Token ID5693
Feature activation+1.04e+00

Pos logit contributions
rag+2.171
+2.115
clerosis+1.912
hib+1.868
hod+1.854
Neg logit contributions
ovan-2.252
ould-2.215
 rapport-2.039
thinkable-2.034
 Conversation-2.025
Token,
Token ID11
Feature activation+1.73e-01

Pos logit contributions
rag+0.817
+0.797
clerosis+0.728
hib+0.713
hod+0.708
Neg logit contributions
ovan-0.701
ould-0.688
 rapport-0.629
thinkable-0.626
 Conversation-0.625
Token told
Token ID1297
Feature activation-1.97e+00

Pos logit contributions
rag+0.150
+0.147
clerosis+0.135
hib+0.133
hod+0.132
Neg logit contributions
ovan-0.103
ould-0.101
 rapport-0.091
thinkable-0.091
 Conversation-0.090
Token the
Token ID262
Feature activation-1.37e-01

Pos logit contributions
ovan+1.323
ould+1.303
 Conversation+1.213
 rapport+1.201
thinkable+1.177
Neg logit contributions
rag-1.533
-1.503
clerosis-1.360
hib-1.329
 duplication-1.321
Token Ou
Token ID42201
Feature activation-6.96e+00

Pos logit contributions
ovan+0.103
ould+0.101
 Conversation+0.094
 rapport+0.094
thinkable+0.093
Neg logit contributions
rag-0.098
-0.095
clerosis-0.086
hib-0.084
hod-0.083
Tokenach
Token ID620
Feature activation+9.38e-01

Pos logit contributions
ovan+4.834
ould+4.755
 rapport+4.441
 Conversation+4.332
thinkable+4.266
Neg logit contributions
-5.200
rag-5.112
clerosis-4.656
 duplication-4.493
hib-4.480
Tokenita
Token ID5350
Feature activation-2.59e-01

Pos logit contributions
rag+0.831
+0.766
alon+0.732
hib+0.728
alos+0.722
Neg logit contributions
ovan-0.517
ould-0.501
thinkable-0.492
 Conversation-0.492
 rapport-0.475
Token Citizen
Token ID22307
Feature activation+1.77e+00

Pos logit contributions
ovan+0.205
ould+0.201
 rapport+0.187
 Conversation+0.186
thinkable+0.186
Neg logit contributions
rag-0.174
-0.169
clerosis-0.152
hib-0.148
hod-0.146
Token that
Token ID326
Feature activation+1.41e-01

Pos logit contributions
rag+1.254
+1.220
clerosis+1.103
hib+1.077
hod+1.068
Neg logit contributions
ovan-1.327
ould-1.304
 rapport-1.204
thinkable-1.200
 Conversation-1.198
Token the
Token ID262
Feature activation+2.24e-01

Pos logit contributions
rag+0.108
+0.105
clerosis+0.096
hib+0.093
hod+0.093
Neg logit contributions
ovan-0.099
ould-0.097
 rapport-0.089
 Conversation-0.089
thinkable-0.088
Token already
Token ID1541
Feature activation-8.61e-02

Pos logit contributions
rag+0.174
+0.169
clerosis+0.155
hib+0.152
hod+0.151
Neg logit contributions
ovan-0.141
ould-0.138
 rapport-0.125
thinkable-0.125
 Conversation-0.125
Token Next
Token ID7406
Feature activation+1.54e-01

Pos logit contributions
ovan+0.061
ould+0.060
 rapport+0.055
thinkable+0.055
 Conversation+0.055
Neg logit contributions
rag-0.064
-0.063
clerosis-0.057
hib-0.056
hod-0.055
Token,
Token ID11
Feature activation+1.97e-01

Pos logit contributions
rag+0.120
+0.116
clerosis+0.106
hib+0.104
 duplication+0.104
Neg logit contributions
ovan-0.105
ould-0.103
thinkable-0.094
 rapport-0.094
 Conversation-0.093
Token download
Token ID4321
Feature activation-1.54e+00

Pos logit contributions
rag+0.156
+0.151
clerosis+0.138
 duplication+0.136
hib+0.136
Neg logit contributions
ovan-0.132
ould-0.129
thinkable-0.119
 rapport-0.118
 Conversation-0.117
Token the
Token ID262
Feature activation-1.99e-01

Pos logit contributions
ovan+0.971
ould+0.951
 rapport+0.865
 Conversation+0.859
thinkable+0.858
Neg logit contributions
rag-1.285
-1.254
clerosis-1.156
hib-1.131
hod-1.123
Token Netflix
Token ID12074
Feature activation-6.92e+00

Pos logit contributions
ovan+0.127
ould+0.125
thinkable+0.113
 rapport+0.113
 Conversation+0.112
Neg logit contributions
rag-0.164
-0.159
clerosis-0.146
hib-0.143
hod-0.143
Token plugin
Token ID13877
Feature activation-3.11e+00

Pos logit contributions
ovan+4.203
ould+4.033
 Conversation+3.694
 rapport+3.669
thinkable+3.595
Neg logit contributions
rag-5.802
-5.677
clerosis-5.315
hib-5.098
hod-5.074
Token zip
Token ID19974
Feature activation-1.82e+00

Pos logit contributions
ovan+2.170
ould+2.129
 rapport+1.954
thinkable+1.946
 Conversation+1.941
Neg logit contributions
rag-2.369
-2.311
clerosis-2.107
hib-2.061
hod-2.043
Token file
Token ID2393
Feature activation+3.00e+00

Pos logit contributions
ovan+1.409
ould+1.387
 rapport+1.281
thinkable+1.280
 Conversation+1.275
Neg logit contributions
rag-1.251
-1.213
clerosis-1.088
hib-1.066
hod-1.059
Token from
Token ID422
Feature activation+5.17e-01

Pos logit contributions
rag+2.178
+2.082
 duplication+1.878
clerosis+1.862
hod+1.849
Neg logit contributions
ovan-2.183
ould-2.147
thinkable-1.978
 rapport-1.950
 Conversation-1.942
Token here
Token ID994
Feature activation+6.83e-01

Pos logit contributions
rag+0.404
+0.393
clerosis+0.356
hib+0.351
hod+0.349
Neg logit contributions
ovan-0.352
ould-0.344
thinkable-0.316
 rapport-0.314
 Conversation-0.311
Token and
Token ID290
Feature activation+2.56e-01

Pos logit contributions
ovan+0.251
ould+0.248
 Conversation+0.226
thinkable+0.225
 rapport+0.225
Neg logit contributions
rag-0.287
-0.279
clerosis-0.255
hib-0.250
hod-0.248
Token making
Token ID1642
Feature activation-2.34e+00

Pos logit contributions
rag+0.203
+0.198
clerosis+0.181
 duplication+0.178
hib+0.178
Neg logit contributions
ovan-0.170
ould-0.167
thinkable-0.154
 Conversation-0.151
 rapport-0.149
Token it
Token ID340
Feature activation-3.49e-01

Pos logit contributions
ovan+1.756
ould+1.720
 rapport+1.598
 Conversation+1.581
thinkable+1.578
Neg logit contributions
rag-1.653
-1.605
clerosis-1.461
hib-1.420
hod-1.407
Token work
Token ID670
Feature activation+2.20e+00

Pos logit contributions
ovan+0.262
ould+0.258
 rapport+0.237
thinkable+0.237
 Conversation+0.237
Neg logit contributions
rag-0.249
-0.241
clerosis-0.218
hib-0.213
hod-0.211
Token at
Token ID379
Feature activation-2.02e-01

Pos logit contributions
rag+1.617
+1.546
clerosis+1.403
hib+1.391
hod+1.380
Neg logit contributions
ovan-1.592
ould-1.576
 Conversation-1.444
thinkable-1.442
 rapport-1.433
Token Netflix
Token ID12074
Feature activation-6.85e+00

Pos logit contributions
ovan+0.131
ould+0.129
 rapport+0.117
thinkable+0.117
 Conversation+0.116
Neg logit contributions
rag-0.165
-0.161
clerosis-0.147
hib-0.144
hod-0.144
Token,
Token ID11
Feature activation-4.48e-01

Pos logit contributions
ovan+4.446
ould+4.243
 rapport+3.939
 Conversation+3.937
thinkable+3.924
Neg logit contributions
rag-5.399
-5.335
clerosis-4.876
hod-4.769
hib-4.764
Token and
Token ID290
Feature activation+3.62e-01

Pos logit contributions
ovan+0.315
ould+0.310
thinkable+0.284
 rapport+0.284
 Conversation+0.282
Neg logit contributions
rag-0.339
-0.330
clerosis-0.300
hib-0.294
hod-0.292
Token posted
Token ID4481
Feature activation-3.52e+00

Pos logit contributions
rag+0.277
+0.269
clerosis+0.244
hib+0.239
 duplication+0.239
Neg logit contributions
ovan-0.253
ould-0.249
thinkable-0.229
 rapport-0.225
 Conversation-0.225
Token some
Token ID617
Feature activation-1.68e+00

Pos logit contributions
ovan+2.158
ould+2.107
 rapport+1.964
 Conversation+1.940
fleet+1.895
Neg logit contributions
rag-2.936
-2.870
clerosis-2.666
hib-2.582
hod-2.568
Token of
Token ID286
Feature activation+4.94e-01

Pos logit contributions
ovan+1.096
ould+1.072
 rapport+0.980
 Conversation+0.972
thinkable+0.972
Neg logit contributions
rag-1.360
-1.329
clerosis-1.218
hib-1.193
hod-1.185
Token Alberta
Token ID15555
Feature activation-6.81e-01

Pos logit contributions
rag+1.500
+1.461
clerosis+1.341
 duplication+1.327
hib+1.317
Neg logit contributions
ovan-1.109
ould-1.100
 Conversation-0.981
thinkable-0.974
 rapport-0.970
Token.
Token ID13
Feature activation+1.09e-01

Pos logit contributions
ovan+0.482
ould+0.474
 rapport+0.434
thinkable+0.433
 Conversation+0.432
Neg logit contributions
rag-0.514
-0.500
clerosis-0.455
hib-0.445
hod-0.442
Token
Token ID198
Feature activation-5.39e-01

Pos logit contributions
rag+0.096
+0.094
clerosis+0.087
hib+0.085
 duplication+0.085
Neg logit contributions
ovan-0.064
ould-0.062
 Conversation-0.056
 rapport-0.056
thinkable-0.056
Token
Token ID198
Feature activation-2.50e-01

Pos logit contributions
ovan+0.336
ould+0.330
 rapport+0.302
thinkable+0.299
 Conversation+0.298
Neg logit contributions
rag-0.451
-0.440
clerosis-0.403
hib-0.398
hod-0.395
TokenHis
Token ID6653
Feature activation+5.28e-01

Pos logit contributions
ovan+0.195
ould+0.191
 rapport+0.177
thinkable+0.177
 Conversation+0.176
Neg logit contributions
rag-0.170
-0.166
clerosis-0.149
hib-0.145
hod-0.144
Token sentence
Token ID6827
Feature activation-6.83e+00

Pos logit contributions
rag+0.392
+0.381
clerosis+0.350
 duplication+0.342
hib+0.340
Neg logit contributions
ovan-0.376
ould-0.369
thinkable-0.338
fleet-0.336
 Conversation-0.334
Token is
Token ID318
Feature activation-6.62e-01

Pos logit contributions
ovan+4.342
ould+4.212
 rapport+3.986
 Conversation+3.812
thinkable+3.789
Neg logit contributions
rag-5.473
-5.292
clerosis-4.859
hod-4.793
hib-4.789
Token to
Token ID284
Feature activation-5.39e-01

Pos logit contributions
ovan+0.456
ould+0.450
thinkable+0.410
 Conversation+0.407
 rapport+0.406
Neg logit contributions
rag-0.509
-0.494
clerosis-0.450
hib-0.444
 duplication-0.443
Token expire
Token ID24264
Feature activation-1.35e-01

Pos logit contributions
ovan+0.307
ould+0.299
thinkable+0.272
 Conversation+0.270
 rapport+0.266
Neg logit contributions
rag-0.478
-0.469
clerosis-0.431
hib-0.427
fare-0.426
Token in
Token ID287
Feature activation-8.12e-01

Pos logit contributions
ovan+0.101
ould+0.099
thinkable+0.092
 rapport+0.091
 Conversation+0.091
Neg logit contributions
rag-0.097
-0.094
clerosis-0.085
hib-0.083
hod-0.082
Token 2018
Token ID2864
Feature activation-1.50e+00

Pos logit contributions
ovan+0.633
ould+0.624
thinkable+0.577
 rapport+0.572
fleet+0.570
Neg logit contributions
rag-0.551
-0.536
clerosis-0.481
hib-0.473
 duplication-0.470
Token
Token ID198
Feature activation-1.94e-01

Pos logit contributions
ovan+0.386
ould+0.378
 rapport+0.345
thinkable+0.339
 Conversation+0.339
Neg logit contributions
rag-0.542
-0.531
clerosis-0.485
hib-0.480
hod-0.476
TokenCheck
Token ID9787
Feature activation+5.98e-01

Pos logit contributions
ovan+0.145
ould+0.143
 rapport+0.133
 Conversation+0.132
thinkable+0.132
Neg logit contributions
rag-0.139
-0.135
clerosis-0.120
hib-0.119
hod-0.118
Tokenout
Token ID448
Feature activation-2.56e-01

Pos logit contributions
rag+0.501
+0.489
clerosis+0.446
hib+0.441
hod+0.439
Neg logit contributions
ovan-0.375
ould-0.365
thinkable-0.331
 rapport-0.330
 Conversation-0.326
Token Che
Token ID2580
Feature activation+6.20e-01

Pos logit contributions
ovan+0.161
ould+0.158
 rapport+0.143
thinkable+0.143
 Conversation+0.141
Neg logit contributions
rag-0.213
-0.208
clerosis-0.190
hib-0.187
hod-0.186
Tokenapest
Token ID35746
Feature activation-1.27e-01

Pos logit contributions
rag+0.420
+0.405
clerosis+0.364
hib+0.358
hod+0.353
Neg logit contributions
ovan-0.487
ould-0.478
 rapport-0.445
thinkable-0.444
 Conversation-0.443
Token Netflix
Token ID12074
Feature activation-6.72e+00

Pos logit contributions
ovan+0.070
ould+0.068
 rapport+0.061
thinkable+0.061
 Conversation+0.060
Neg logit contributions
rag-0.117
-0.114
clerosis-0.105
hib-0.104
hod-0.103
Token VPN
Token ID21669
Feature activation-2.80e+00

Pos logit contributions
ovan+2.985
ould+2.831
 Conversation+2.573
 rapport+2.477
thinkable+2.447
Neg logit contributions
rag-6.719
-6.646
clerosis-6.218
hib-6.002
hod-6.002
Tokens
Token ID82
Feature activation-1.90e+00

Pos logit contributions
ovan+1.912
ould+1.875
 rapport+1.715
thinkable+1.709
 Conversation+1.707
Neg logit contributions
rag-2.174
-2.122
clerosis-1.938
hib-1.894
hod-1.880
Token!
Token ID0
Feature activation+6.81e-01

Pos logit contributions
ovan+1.299
ould+1.275
 rapport+1.167
thinkable+1.162
 Conversation+1.159
Neg logit contributions
rag-1.475
-1.439
clerosis-1.313
hib-1.286
hod-1.276
Token
Token ID198
Feature activation+1.89e-03

Pos logit contributions
rag+0.568
+0.554
clerosis+0.507
hib+0.499
hod+0.496
Neg logit contributions
ovan-0.428
ould-0.421
 rapport-0.381
thinkable-0.380
 Conversation-0.375
Token
Token ID198
Feature activation+2.83e-01

Pos logit contributions
rag+0.002
+0.002
clerosis+0.001
hib+0.001
hod+0.001
Neg logit contributions
ovan-0.001
ould-0.001
 rapport-0.001
thinkable-0.001
 Conversation-0.001
Token present
Token ID1944
Feature activation+9.85e-01

Pos logit contributions
rag+0.121
+0.118
clerosis+0.105
hib+0.103
hod+0.102
Neg logit contributions
ovan-0.152
ould-0.149
 rapport-0.139
thinkable-0.138
 Conversation-0.138
Token -
Token ID532
Feature activation-1.57e+00

Pos logit contributions
rag+0.781
+0.778
clerosis+0.707
hib+0.689
hod+0.684
Neg logit contributions
ovan-0.662
ould-0.648
 rapport-0.594
 Conversation-0.582
thinkable-0.580
Token could
Token ID714
Feature activation-4.35e-01

Pos logit contributions
ovan+1.052
ould+1.033
 rapport+0.943
thinkable+0.939
 Conversation+0.936
Neg logit contributions
rag-1.243
-1.213
clerosis-1.109
hib-1.086
hod-1.078
Token face
Token ID1986
Feature activation+2.90e-02

Pos logit contributions
ovan+0.313
ould+0.307
 rapport+0.282
thinkable+0.281
 Conversation+0.281
Neg logit contributions
rag-0.323
-0.315
clerosis-0.286
hib-0.280
hod-0.277
Token jail
Token ID7356
Feature activation-1.99e+00

Pos logit contributions
rag+0.024
+0.023
clerosis+0.022
hib+0.021
 duplication+0.021
Neg logit contributions
ovan-0.018
ould-0.018
thinkable-0.016
 rapport-0.016
 Conversation-0.016
Token sentences
Token ID13439
Feature activation-6.66e+00

Pos logit contributions
ovan+1.449
ould+1.424
 rapport+1.310
thinkable+1.305
 Conversation+1.301
Neg logit contributions
rag-1.463
-1.426
clerosis-1.294
hib-1.264
hod-1.254
Token if
Token ID611
Feature activation+7.52e-01

Pos logit contributions
ovan+4.162
ould+4.113
 rapport+3.776
thinkable+3.727
 Conversation+3.689
Neg logit contributions
-5.346
rag-5.294
clerosis-4.834
hib-4.751
hod-4.640
Token found
Token ID1043
Feature activation-2.10e-01

Pos logit contributions
rag+0.619
+0.599
clerosis+0.550
 duplication+0.540
hod+0.540
Neg logit contributions
ovan-0.479
ould-0.473
thinkable-0.428
 Conversation-0.426
 rapport-0.422
Token guilty
Token ID6717
Feature activation-1.43e+00

Pos logit contributions
ovan+0.155
ould+0.152
 rapport+0.140
thinkable+0.139
 Conversation+0.139
Neg logit contributions
rag-0.153
-0.149
clerosis-0.135
hib-0.132
hod-0.130
Token of
Token ID286
Feature activation-9.32e-01

Pos logit contributions
ovan+0.960
ould+0.945
 rapport+0.868
thinkable+0.854
 Conversation+0.850
Neg logit contributions
rag-1.119
-1.109
clerosis-1.005
hib-0.983
hod-0.972
Token any
Token ID597
Feature activation-5.93e-01

Pos logit contributions
ovan+0.632
ould+0.618
 rapport+0.570
 Conversation+0.561
thinkable+0.561
Neg logit contributions
rag-0.731
-0.717
clerosis-0.653
hib-0.638
hod-0.632
Token 17
Token ID1596
Feature activation-5.44e-01

Pos logit contributions
ovan+3.566
ould+3.442
 Conversation+3.151
 rapport+3.136
thinkable+3.095
Neg logit contributions
rag-4.487
-4.448
clerosis-4.071
hib-3.905
hod-3.889
Token Krypt
Token ID44679
Feature activation+6.13e-02

Pos logit contributions
ovan+0.383
ould+0.376
 rapport+0.345
thinkable+0.344
 Conversation+0.342
Neg logit contributions
rag-0.413
-0.402
clerosis-0.366
hib-0.359
hod-0.355
Tokenon
Token ID261
Feature activation-7.33e-01

Pos logit contributions
rag+0.041
+0.040
clerosis+0.036
hib+0.035
hod+0.035
Neg logit contributions
ovan-0.049
ould-0.048
 rapport-0.044
thinkable-0.044
 Conversation-0.044
Token addon
Token ID48557
Feature activation-5.29e+00

Pos logit contributions
ovan+0.488
ould+0.482
thinkable+0.442
 rapport+0.442
 Conversation+0.436
Neg logit contributions
rag-0.578
-0.560
clerosis-0.512
hib-0.506
hod-0.502
Token.?
Token ID40791
Feature activation-1.02e+00

Pos logit contributions
ovan+3.354
ould+3.254
 Conversation+2.994
 rapport+2.983
thinkable+2.943
Neg logit contributions
rag-4.302
-4.264
clerosis-3.900
hib-3.794
hod-3.771
Token Netflix
Token ID12074
Feature activation-6.65e+00

Pos logit contributions
ovan+0.626
ould+0.612
 rapport+0.555
thinkable+0.553
 Conversation+0.547
Neg logit contributions
rag-0.863
-0.843
clerosis-0.774
hib-0.761
hod-0.756
Token is
Token ID318
Feature activation-6.17e-01

Pos logit contributions
ovan+3.680
ould+3.571
 Conversation+3.229
 rapport+3.172
thinkable+3.123
Neg logit contributions
rag-5.914
-5.811
clerosis-5.414
hib-5.254
hod-5.213
Token probably
Token ID2192
Feature activation-2.88e-01

Pos logit contributions
ovan+0.411
ould+0.405
thinkable+0.371
 rapport+0.370
 Conversation+0.366
Neg logit contributions
rag-0.489
-0.476
clerosis-0.433
hib-0.429
hod-0.425
Token the
Token ID262
Feature activation-4.26e-01

Pos logit contributions
ovan+0.188
ould+0.184
thinkable+0.169
 rapport+0.168
 Conversation+0.167
Neg logit contributions
rag-0.233
-0.226
clerosis-0.206
hib-0.204
hod-0.202
Token best
Token ID1266
Feature activation+3.38e-01

Pos logit contributions
ovan+0.302
ould+0.296
thinkable+0.272
 rapport+0.271
 Conversation+0.269
Neg logit contributions
rag-0.321
-0.313
clerosis-0.283
hib-0.279
hod-0.277
Token streaming
Token ID11305
Feature activation-2.39e+00

Pos logit contributions
rag+0.309
+0.302
clerosis+0.276
hib+0.275
 duplication+0.273
Neg logit contributions
ovan-0.185
ould-0.182
thinkable-0.163
 rapport-0.161
 Conversation-0.160
Token."
Token ID526
Feature activation-4.66e-02

Pos logit contributions
ovan+0.318
ould+0.313
thinkable+0.287
 rapport+0.286
 Conversation+0.286
Neg logit contributions
rag-0.329
-0.319
clerosis-0.289
hib-0.285
hod-0.283
Token<|endoftext|>
Token ID50256
Feature activation+1.80e+00

Pos logit contributions
ovan+0.027
ould+0.027
 Conversation+0.024
 rapport+0.024
thinkable+0.024
Neg logit contributions
rag-0.041
-0.040
clerosis-0.037
hib-0.036
 duplication-0.036
TokenAnonymous
Token ID20660
Feature activation+1.09e+00

Pos logit contributions
rag+1.252
+1.217
clerosis+1.093
hib+1.072
hod+1.064
Neg logit contributions
ovan-1.373
ould-1.353
 rapport-1.256
 Conversation-1.245
thinkable-1.244
Token asked
Token ID1965
Feature activation+7.03e-02

Pos logit contributions
rag+0.847
+0.825
clerosis+0.755
hib+0.739
hod+0.734
Neg logit contributions
ovan-0.749
ould-0.736
thinkable-0.671
 rapport-0.670
 Conversation-0.663
Token:
Token ID25
Feature activation+1.65e+00

Pos logit contributions
rag+0.053
+0.052
clerosis+0.047
hib+0.046
hod+0.046
Neg logit contributions
ovan-0.050
ould-0.049
 rapport-0.045
 Conversation-0.045
thinkable-0.045
Token AU
Token ID27548
Feature activation-6.64e+00

Pos logit contributions
rag+1.324
+1.288
clerosis+1.180
hib+1.161
hod+1.152
Neg logit contributions
ovan-1.090
ould-1.065
thinkable-0.975
 rapport-0.973
 Conversation-0.959
Token idea
Token ID2126
Feature activation-4.41e-01

Pos logit contributions
ovan+4.744
ould+4.732
 Conversation+4.334
 rapport+4.290
fleet+4.252
Neg logit contributions
rag-4.750
-4.675
clerosis-4.231
hib-4.062
hod-4.027
Token:
Token ID25
Feature activation+1.28e+00

Pos logit contributions
ovan+0.303
ould+0.297
 rapport+0.272
thinkable+0.272
 Conversation+0.270
Neg logit contributions
rag-0.341
-0.332
clerosis-0.303
hib-0.297
hod-0.295
Token slightly
Token ID4622
Feature activation+1.27e-01

Pos logit contributions
rag+1.003
+0.975
clerosis+0.892
hib+0.878
 duplication+0.871
Neg logit contributions
ovan-0.860
ould-0.840
thinkable-0.773
 rapport-0.768
 Conversation-0.759
Token cl
Token ID537
Feature activation-1.73e-02

Pos logit contributions
rag+0.098
+0.095
clerosis+0.087
hib+0.085
hod+0.084
Neg logit contributions
ovan-0.088
ould-0.086
 rapport-0.079
thinkable-0.079
 Conversation-0.078
Tokeniche
Token ID14234
Feature activation+2.11e+00

Pos logit contributions
ovan+0.013
ould+0.013
 rapport+0.012
thinkable+0.012
 Conversation+0.012
Neg logit contributions
rag-0.012
-0.012
clerosis-0.011
hib-0.010
hod-0.010
Token Maryland
Token ID10769
Feature activation+1.53e+00

Pos logit contributions
rag+1.276
+1.245
clerosis+1.143
hib+1.117
hod+1.110
Neg logit contributions
ovan-0.906
ould-0.887
 rapport-0.808
 Conversation-0.798
thinkable-0.796
Token for
Token ID329
Feature activation+3.09e-01

Pos logit contributions
rag+1.121
+1.091
clerosis+0.990
hib+0.968
hod+0.960
Neg logit contributions
ovan-1.115
ould-1.099
 rapport-1.010
thinkable-1.007
 Conversation-1.005
Token many
Token ID867
Feature activation+6.90e-01

Pos logit contributions
rag+0.208
+0.202
clerosis+0.181
hib+0.176
hod+0.175
Neg logit contributions
ovan-0.245
ould-0.240
 rapport-0.223
thinkable-0.222
 Conversation-0.222
Token years
Token ID812
Feature activation-9.35e-01

Pos logit contributions
rag+0.381
+0.368
clerosis+0.324
hib+0.309
hod+0.309
Neg logit contributions
ovan-0.627
ould-0.617
 rapport-0.583
thinkable-0.576
 Conversation-0.575
Token.
Token ID13
Feature activation+2.64e-01

Pos logit contributions
ovan+0.625
ould+0.609
 rapport+0.560
thinkable+0.555
 Conversation+0.548
Neg logit contributions
rag-0.743
-0.729
clerosis-0.663
hod-0.647
hib-0.646
Token Parents
Token ID28231
Feature activation+4.32e+00

Pos logit contributions
rag+0.213
+0.210
clerosis+0.191
hib+0.188
hod+0.186
Neg logit contributions
ovan-0.171
ould-0.169
thinkable-0.154
 rapport-0.154
 Conversation-0.149
Token who
Token ID508
Feature activation+3.93e-01

Pos logit contributions
rag+3.162
+3.011
clerosis+2.794
hib+2.732
fare+2.722
Neg logit contributions
ovan-3.098
ould-3.067
thinkable-2.817
 Conversation-2.786
fleet-2.772
Token complain
Token ID13121
Feature activation+3.22e+00

Pos logit contributions
rag+0.297
+0.287
clerosis+0.262
hib+0.257
 duplication+0.255
Neg logit contributions
ovan-0.276
ould-0.273
thinkable-0.249
 Conversation-0.249
 rapport-0.248
Token they
Token ID484
Feature activation-1.17e+00

Pos logit contributions
rag+2.517
+2.415
clerosis+2.233
hib+2.193
fare+2.171
Neg logit contributions
ovan-2.178
ould-2.128
 Conversation-1.950
thinkable-1.947
 rapport-1.938
Token were
Token ID547
Feature activation-1.31e+00

Pos logit contributions
ovan+0.759
ould+0.744
 rapport+0.677
thinkable+0.675
 Conversation+0.673
Neg logit contributions
rag-0.952
-0.929
clerosis-0.851
hib-0.835
hod-0.829
Token unaware
Token ID17261
Feature activation-2.70e+00

Pos logit contributions
ovan+0.920
ould+0.904
 rapport+0.828
thinkable+0.826
 Conversation+0.824
Neg logit contributions
rag-0.990
-0.964
clerosis-0.877
hib-0.859
hod-0.853
Token
Token ID247
Feature activation+4.11e-01

Pos logit contributions
rag+0.095
+0.093
clerosis+0.084
hib+0.083
hod+0.083
Neg logit contributions
ovan-0.064
ould-0.062
 rapport-0.058
thinkable-0.057
 Conversation-0.057
Tokent
Token ID83
Feature activation+8.21e-01

Pos logit contributions
rag+0.240
+0.229
clerosis+0.202
hib+0.199
hod+0.196
Neg logit contributions
ovan-0.360
ould-0.353
thinkable-0.335
 rapport-0.335
 Conversation-0.333
Token one
Token ID530
Feature activation-1.32e+00

Pos logit contributions
rag+0.671
+0.655
clerosis+0.601
hib+0.589
hod+0.584
Neg logit contributions
ovan-0.529
ould-0.519
 rapport-0.472
thinkable-0.470
 Conversation-0.469
Token of
Token ID286
Feature activation+1.71e+00

Pos logit contributions
ovan+0.767
ould+0.746
 rapport+0.675
 Conversation+0.671
thinkable+0.664
Neg logit contributions
rag-1.159
-1.136
clerosis-1.056
hib-1.031
hod-1.020
Token the
Token ID262
Feature activation+1.31e+00

Pos logit contributions
rag+1.458
+1.425
clerosis+1.308
hib+1.288
hod+1.280
Neg logit contributions
ovan-1.035
ould-1.012
 rapport-0.914
thinkable-0.913
 Conversation-0.908
Token bureau
Token ID10583
Feature activation+4.04e+00

Pos logit contributions
rag+1.036
+1.009
clerosis+0.920
hib+0.905
hod+0.899
Neg logit contributions
ovan-0.876
ould-0.860
thinkable-0.784
 rapport-0.781
 Conversation-0.777
Token
Token ID447
Feature activation+7.89e-01

Pos logit contributions
rag+2.792
+2.698
hib+2.393
clerosis+2.389
hod+2.385
Neg logit contributions
ovan-3.084
ould-3.008
thinkable-2.810
 Conversation-2.795
 rapport-2.788
Token
Token ID247
Feature activation+9.03e-01

Pos logit contributions
rag+0.760
+0.749
hib+0.677
clerosis+0.672
hod+0.672
Neg logit contributions
ovan-0.397
ould-0.377
thinkable-0.354
 rapport-0.351
 Conversation-0.342
Tokens
Token ID82
Feature activation-2.11e-01

Pos logit contributions
rag+0.687
+0.672
clerosis+0.613
hib+0.597
hod+0.592
Neg logit contributions
ovan-0.632
ould-0.623
 rapport-0.568
 Conversation-0.564
thinkable-0.564
Token many
Token ID867
Feature activation+1.88e+00

Pos logit contributions
ovan+0.151
ould+0.148
 rapport+0.136
 Conversation+0.135
thinkable+0.135
Neg logit contributions
rag-0.158
-0.154
clerosis-0.140
hib-0.137
hod-0.135
Token tech
Token ID7261
Feature activation-1.51e-01

Pos logit contributions
rag+1.425
+1.389
clerosis+1.260
hib+1.239
hod+1.234
Neg logit contributions
ovan-1.318
ould-1.290
thinkable-1.188
 Conversation-1.179
 rapport-1.178
Tokeno
Token ID78
Feature activation-3.53e-01

Pos logit contributions
rag+0.918
+0.879
clerosis+0.788
hod+0.769
hib+0.768
Neg logit contributions
ovan-1.206
ould-1.193
 rapport-1.112
thinkable-1.110
 Conversation-1.107
Token,
Token ID11
Feature activation-6.24e-01

Pos logit contributions
ovan+0.236
ould+0.232
 rapport+0.211
thinkable+0.210
 Conversation+0.210
Neg logit contributions
rag-0.279
-0.273
clerosis-0.249
hib-0.244
hod-0.242
Token who
Token ID508
Feature activation-4.58e-01

Pos logit contributions
ovan+0.441
ould+0.431
 rapport+0.399
 Conversation+0.395
thinkable+0.394
Neg logit contributions
rag-0.471
-0.460
clerosis-0.418
hib-0.408
hod-0.405
Token lives
Token ID3160
Feature activation-1.02e+00

Pos logit contributions
ovan+0.287
ould+0.281
 rapport+0.254
thinkable+0.253
 Conversation+0.253
Neg logit contributions
rag-0.384
-0.375
clerosis-0.344
hib-0.338
hod-0.336
Token in
Token ID287
Feature activation-7.21e-01

Pos logit contributions
ovan+0.666
ould+0.655
 rapport+0.601
thinkable+0.598
 Conversation+0.595
Neg logit contributions
rag-0.816
-0.797
clerosis-0.735
hib-0.710
hod-0.706
Token Lomb
Token ID28503
Feature activation+5.43e+00

Pos logit contributions
ovan+0.472
ould+0.462
 Conversation+0.425
 rapport+0.423
thinkable+0.421
Neg logit contributions
rag-0.580
-0.570
clerosis-0.523
hib-0.509
hod-0.504
Tokenard
Token ID446
Feature activation+1.95e+00

Pos logit contributions
rag+4.127
+3.901
clerosis+3.645
hib+3.567
hod+3.537
Neg logit contributions
ovan-3.757
ould-3.641
thinkable-3.422
 rapport-3.409
 Conversation-3.371
Token,
Token ID11
Feature activation+3.52e-03

Pos logit contributions
rag+1.464
+1.418
clerosis+1.291
hib+1.267
hod+1.257
Neg logit contributions
ovan-1.375
ould-1.348
 rapport-1.245
thinkable-1.241
 Conversation-1.231
Token was
Token ID373
Feature activation-1.06e+00

Pos logit contributions
rag+0.003
+0.003
clerosis+0.003
hib+0.003
hod+0.003
Neg logit contributions
ovan-0.002
ould-0.002
thinkable-0.002
 rapport-0.002
 Conversation-0.002
Token by
Token ID416
Feature activation-8.86e-01

Pos logit contributions
ovan+0.722
ould+0.708
 rapport+0.648
thinkable+0.645
 Conversation+0.643
Neg logit contributions
rag-0.822
-0.801
clerosis-0.731
hib-0.716
hod-0.711
Token chance
Token ID2863
Feature activation+1.52e+00

Pos logit contributions
ovan+0.477
ould+0.465
 rapport+0.415
thinkable+0.413
 Conversation+0.412
Neg logit contributions
rag-0.817
-0.801
clerosis-0.743
hib-0.729
hod-0.725
Token Fle
Token ID12005
Feature activation+2.11e+00

Pos logit contributions
rag+1.957
+1.908
clerosis+1.734
hib+1.707
hod+1.704
Neg logit contributions
ovan-1.606
ould-1.569
thinkable-1.440
 rapport-1.436
 Conversation-1.408
Tokenw
Token ID86
Feature activation+1.59e+00

Pos logit contributions
rag+1.297
+1.244
clerosis+1.123
hib+1.101
hod+1.089
Neg logit contributions
ovan-1.781
ould-1.739
 Conversation-1.631
 rapport-1.631
thinkable-1.627
Token,
Token ID11
Feature activation-7.38e-01

Pos logit contributions
rag+1.269
+1.239
clerosis+1.133
hib+1.115
hod+1.104
Neg logit contributions
ovan-1.053
ould-1.032
 rapport-0.940
thinkable-0.938
 Conversation-0.925
Token
Token ID447
Feature activation+2.07e-01

Pos logit contributions
ovan+0.538
ould+0.530
 Conversation+0.489
 rapport+0.488
thinkable+0.485
Neg logit contributions
rag-0.538
-0.523
clerosis-0.476
hib-0.463
hod-0.462
Token
Token ID251
Feature activation+9.31e-01

Pos logit contributions
rag+0.192
+0.188
clerosis+0.176
hib+0.172
 duplication+0.171
Neg logit contributions
ovan-0.109
ould-0.107
 rapport-0.095
thinkable-0.095
 Conversation-0.094
Token Edward
Token ID10443
Feature activation+4.26e+00

Pos logit contributions
rag+0.716
+0.699
clerosis+0.637
hib+0.624
hod+0.619
Neg logit contributions
ovan-0.644
ould-0.632
 rapport-0.578
thinkable-0.577
 Conversation-0.573
Token Air
Token ID3701
Feature activation-1.68e+00

Pos logit contributions
rag+3.319
+3.236
clerosis+2.960
hib+2.900
hod+2.875
Neg logit contributions
ovan-2.874
ould-2.815
 rapport-2.591
thinkable-2.577
 Conversation-2.528
Token Force
Token ID5221
Feature activation-3.16e+00

Pos logit contributions
ovan+0.653
ould+0.630
 Conversation+0.538
 rapport+0.535
fleet+0.534
Neg logit contributions
rag-1.786
-1.763
clerosis-1.644
hod-1.622
hib-1.611
Token Base
Token ID7308
Feature activation-5.46e-01

Pos logit contributions
ovan+1.981
ould+1.934
 Conversation+1.803
 rapport+1.779
thinkable+1.746
Neg logit contributions
rag-2.612
-2.548
clerosis-2.339
hod-2.274
hib-2.266
Token.
Token ID13
Feature activation+3.64e-01

Pos logit contributions
ovan+0.368
ould+0.361
 Conversation+0.333
 rapport+0.332
thinkable+0.330
Neg logit contributions
rag-0.428
-0.418
clerosis-0.383
hod-0.372
hib-0.372
Token http
Token ID2638
Feature activation+1.19e-01

Pos logit contributions
rag+0.271
+0.263
clerosis+0.240
hib+0.234
hod+0.232
Neg logit contributions
ovan-0.262
ould-0.257
 rapport-0.236
 Conversation-0.236
thinkable-0.235
Tokenor
Token ID273
Feature activation-2.01e+00

Pos logit contributions
ovan+0.657
ould+0.644
 rapport+0.591
thinkable+0.589
 Conversation+0.588
Neg logit contributions
rag-0.673
-0.656
clerosis-0.596
hib-0.582
hod-0.578
Token Academy
Token ID8581
Feature activation+6.99e-01

Pos logit contributions
ovan+1.399
ould+1.365
 rapport+1.253
 Conversation+1.251
thinkable+1.243
Neg logit contributions
rag-1.534
-1.499
clerosis-1.362
hib-1.332
hod-1.328
Token
Token ID447
Feature activation-1.76e-05

Pos logit contributions
rag+0.526
+0.512
clerosis+0.466
hib+0.457
hod+0.452
Neg logit contributions
ovan-0.497
ould-0.487
 rapport-0.447
thinkable-0.446
 Conversation-0.444
Token
Token ID251
Feature activation-5.78e-02

Pos logit contributions
ovan+0.000
ould+0.000
 rapport+0.000
 Conversation+0.000
thinkable+0.000
Neg logit contributions
rag-0.000
-0.000
clerosis-0.000
hib-0.000
hod-0.000
Token in
Token ID287
Feature activation-6.71e-01

Pos logit contributions
ovan+0.041
ould+0.040
 rapport+0.037
thinkable+0.037
 Conversation+0.037
Neg logit contributions
rag-0.044
-0.042
clerosis-0.039
hib-0.038
hod-0.037
Token Garden
Token ID11652
Feature activation+4.05e+00

Pos logit contributions
ovan+0.498
ould+0.489
 rapport+0.451
thinkable+0.449
 Conversation+0.448
Neg logit contributions
rag-0.484
-0.471
clerosis-0.426
hib-0.416
hod-0.413
Token Valley
Token ID6916
Feature activation+9.35e-01

Pos logit contributions
rag+2.333
+2.225
clerosis+1.948
hib+1.927
hod+1.882
Neg logit contributions
ovan-3.584
ould-3.522
 rapport-3.308
thinkable-3.305
 Conversation-3.279
Token,
Token ID11
Feature activation+1.27e-02

Pos logit contributions
rag+0.711
+0.693
clerosis+0.631
hib+0.618
hod+0.613
Neg logit contributions
ovan-0.656
ould-0.643
 rapport-0.590
thinkable-0.588
 Conversation-0.586
Token Tex
Token ID3567
Feature activation+2.30e+00

Pos logit contributions
rag+0.009
+0.008
clerosis+0.008
hib+0.007
hod+0.007
Neg logit contributions
ovan-0.010
ould-0.010
thinkable-0.009
 rapport-0.009
 Conversation-0.009
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
rag+1.884
+1.858
clerosis+1.695
 duplication+1.659
hib+1.654
Neg logit contributions
ovan-1.489
ould-1.466
 rapport-1.309
 Conversation-1.303
thinkable-1.301
Token
Token ID198
Feature activation-2.97e-01

Pos logit contributions
rag+0.252
+0.246
clerosis+0.225
hib+0.221
hod+0.219
Neg logit contributions
ovan-0.201
ould-0.197
 rapport-0.179
thinkable-0.179
 Conversation-0.177
Token historic
Token ID9566
Feature activation+2.13e-01

Pos logit contributions
rag+0.100
+0.097
clerosis+0.088
hib+0.087
hod+0.086
Neg logit contributions
ovan-0.090
ould-0.088
 rapport-0.081
thinkable-0.081
 Conversation-0.080
Token town
Token ID3240
Feature activation+9.33e-01

Pos logit contributions
rag+0.163
+0.159
clerosis+0.145
hib+0.143
 duplication+0.141
Neg logit contributions
ovan-0.147
ould-0.144
thinkable-0.132
 Conversation-0.131
 rapport-0.131
Token.
Token ID13
Feature activation+1.46e-01

Pos logit contributions
rag+0.692
+0.673
clerosis+0.610
hib+0.600
hod+0.595
Neg logit contributions
ovan-0.671
ould-0.659
 rapport-0.605
thinkable-0.603
 Conversation-0.602
Token Like
Token ID4525
Feature activation-3.41e-02

Pos logit contributions
rag+0.115
+0.113
clerosis+0.102
hib+0.102
hod+0.100
Neg logit contributions
ovan-0.098
ould-0.097
thinkable-0.089
 rapport-0.088
fleet-0.086
Token maybe
Token ID3863
Feature activation-7.60e-01

Pos logit contributions
ovan+0.023
ould+0.023
thinkable+0.021
 rapport+0.021
 Conversation+0.021
Neg logit contributions
rag-0.027
-0.026
clerosis-0.024
hib-0.023
hod-0.023
Token it
Token ID340
Feature activation+2.67e-01

Pos logit contributions
ovan+0.504
ould+0.494
 rapport+0.450
thinkable+0.449
 Conversation+0.447
Neg logit contributions
rag-0.607
-0.592
clerosis-0.542
hib-0.531
hod-0.527
Token is
Token ID318
Feature activation+6.62e-02

Pos logit contributions
rag+0.197
+0.192
clerosis+0.174
hib+0.171
hod+0.169
Neg logit contributions
ovan-0.194
ould-0.190
 rapport-0.175
thinkable-0.174
 Conversation-0.174
Token not
Token ID407
Feature activation+1.76e+00

Pos logit contributions
rag+0.052
+0.050
clerosis+0.046
hib+0.045
hod+0.045
Neg logit contributions
ovan-0.045
ould-0.044
thinkable-0.040
 rapport-0.040
 Conversation-0.040
Token a
Token ID257
Feature activation+4.86e-01

Pos logit contributions
rag+1.392
+1.344
clerosis+1.226
hib+1.214
hod+1.210
Neg logit contributions
ovan-1.184
ould-1.164
thinkable-1.051
 Conversation-1.049
fleet-1.043
Token good
Token ID922
Feature activation-1.53e+00

Pos logit contributions
rag+0.377
+0.367
clerosis+0.335
hib+0.329
hod+0.327
Neg logit contributions
ovan-0.333
ould-0.326
thinkable-0.298
 rapport-0.296
 Conversation-0.295
Token idea
Token ID2126
Feature activation-8.07e-01

Pos logit contributions
ovan+0.900
ould+0.880
 rapport+0.794
thinkable+0.788
 Conversation+0.786
Neg logit contributions
rag-1.344
-1.315
clerosis-1.213
hib-1.190
hod-1.183
Token where
Token ID810
Feature activation-1.10e+00

Pos logit contributions
rag+0.064
+0.062
clerosis+0.057
hib+0.056
hod+0.056
Neg logit contributions
ovan-0.055
ould-0.054
thinkable-0.049
 rapport-0.049
 Conversation-0.049
Token a
Token ID257
Feature activation+2.56e-01

Pos logit contributions
ovan+0.725
ould+0.711
 rapport+0.649
thinkable+0.645
 Conversation+0.644
Neg logit contributions
rag-0.883
-0.862
clerosis-0.789
hib-0.772
hod-0.767
Token game
Token ID983
Feature activation+4.46e-01

Pos logit contributions
rag+0.212
+0.206
clerosis+0.189
hib+0.187
hod+0.185
Neg logit contributions
ovan-0.163
ould-0.160
thinkable-0.146
 rapport-0.143
 Conversation-0.143
Token can
Token ID460
Feature activation+4.97e-02

Pos logit contributions
rag+0.332
+0.322
clerosis+0.292
hib+0.287
hod+0.286
Neg logit contributions
ovan-0.320
ould-0.316
thinkable-0.289
 rapport-0.288
 Conversation-0.287
Token go
Token ID467
Feature activation+2.50e-01

Pos logit contributions
rag+0.038
+0.037
clerosis+0.034
hib+0.034
hod+0.033
Neg logit contributions
ovan-0.034
ould-0.033
thinkable-0.031
 Conversation-0.031
 rapport-0.030
Token from
Token ID422
Feature activation+1.12e+00

Pos logit contributions
rag+0.214
+0.208
clerosis+0.191
hib+0.189
hod+0.188
Neg logit contributions
ovan-0.152
ould-0.149
thinkable-0.134
 rapport-0.134
 Conversation-0.133
Token fun
Token ID1257
Feature activation+1.27e+00

Pos logit contributions
rag+0.890
+0.867
clerosis+0.792
hib+0.780
 duplication+0.774
Neg logit contributions
ovan-0.751
ould-0.735
thinkable-0.671
 rapport-0.667
 Conversation-0.664
Token to
Token ID284
Feature activation+2.57e-01

Pos logit contributions
rag+0.959
+0.935
clerosis+0.850
hib+0.833
hod+0.826
Neg logit contributions
ovan-0.900
ould-0.884
 rapport-0.811
thinkable-0.808
 Conversation-0.806
Token legend
Token ID8177
Feature activation+8.93e-03

Pos logit contributions
rag+0.204
+0.199
clerosis+0.182
hib+0.178
hod+0.177
Neg logit contributions
ovan-0.172
ould-0.169
 rapport-0.154
thinkable-0.153
 Conversation-0.153
Token,"
Token ID553
Feature activation+4.49e-01

Pos logit contributions
rag+0.007
+0.006
clerosis+0.006
hib+0.006
hod+0.006
Neg logit contributions
ovan-0.006
ould-0.006
 rapport-0.006
thinkable-0.006
 Conversation-0.006
Token U
Token ID471
Feature activation+1.93e-01

Pos logit contributions
rag+0.338
+0.329
clerosis+0.299
hib+0.293
hod+0.291
Neg logit contributions
ovan-0.318
ould-0.313
 rapport-0.287
thinkable-0.286
 Conversation-0.285
Token scanners
Token ID47059
Feature activation+5.34e-01

Pos logit contributions
rag+0.489
+0.477
clerosis+0.435
hib+0.429
 duplication+0.426
Neg logit contributions
ovan-0.428
ould-0.419
thinkable-0.382
 rapport-0.381
 Conversation-0.379
Token highlight
Token ID7238
Feature activation+4.95e-01

Pos logit contributions
rag+0.428
+0.417
clerosis+0.382
hib+0.376
hod+0.372
Neg logit contributions
ovan-0.353
ould-0.346
thinkable-0.315
 rapport-0.315
 Conversation-0.313
Token items
Token ID3709
Feature activation+5.34e-02

Pos logit contributions
rag+0.395
+0.385
clerosis+0.351
hib+0.348
 duplication+0.344
Neg logit contributions
ovan-0.329
ould-0.323
 rapport-0.293
thinkable-0.293
 Conversation-0.292
Token that
Token ID326
Feature activation+2.75e-01

Pos logit contributions
rag+0.039
+0.038
clerosis+0.035
hib+0.034
 duplication+0.034
Neg logit contributions
ovan-0.039
ould-0.038
 rapport-0.035
 Conversation-0.035
thinkable-0.035
Token are
Token ID389
Feature activation-1.41e-02

Pos logit contributions
rag+0.224
+0.218
clerosis+0.200
hib+0.197
hod+0.194
Neg logit contributions
ovan-0.179
ould-0.175
 rapport-0.159
thinkable-0.159
 Conversation-0.158
Token especially
Token ID2592
Feature activation+5.37e-01

Pos logit contributions
ovan+0.010
ould+0.010
 rapport+0.009
 Conversation+0.009
thinkable+0.009
Neg logit contributions
rag-0.011
-0.010
clerosis-0.010
hib-0.009
 duplication-0.009
Token dense
Token ID15715
Feature activation+1.78e+00

Pos logit contributions
rag+0.405
+0.392
clerosis+0.357
hib+0.354
hod+0.349
Neg logit contributions
ovan-0.381
ould-0.374
 rapport-0.341
 Conversation-0.341
thinkable-0.340
Token,
Token ID11
Feature activation+2.01e-01

Pos logit contributions
rag+1.320
+1.281
clerosis+1.162
hib+1.145
hod+1.131
Neg logit contributions
ovan-1.277
ould-1.254
 rapport-1.150
thinkable-1.147
 Conversation-1.145
Token and
Token ID290
Feature activation+6.18e-01

Pos logit contributions
rag+0.150
+0.146
clerosis+0.133
hib+0.131
hod+0.129
Neg logit contributions
ovan-0.144
ould-0.141
 rapport-0.129
thinkable-0.129
 Conversation-0.129
Token items
Token ID3709
Feature activation-2.78e-01

Pos logit contributions
rag+0.472
+0.457
clerosis+0.416
hib+0.411
 duplication+0.406
Neg logit contributions
ovan-0.435
ould-0.426
thinkable-0.389
 rapport-0.389
 Conversation-0.388
Token that
Token ID326
Feature activation-8.85e-02

Pos logit contributions
ovan+0.205
ould+0.202
 rapport+0.185
 Conversation+0.184
thinkable+0.184
Neg logit contributions
rag-0.202
-0.196
clerosis-0.178
hib-0.175
hod-0.173
Token
Token ID198
Feature activation+3.34e-01

Pos logit contributions
ovan+0.023
ould+0.022
 rapport+0.020
thinkable+0.020
 Conversation+0.020
Neg logit contributions
rag-0.027
-0.027
clerosis-0.024
hib-0.024
hod-0.024
Token
Token ID198
Feature activation+6.68e-01

Pos logit contributions
rag+0.278
+0.271
clerosis+0.248
hib+0.246
hod+0.244
Neg logit contributions
ovan-0.208
ould-0.206
 rapport-0.189
thinkable-0.187
 Conversation-0.185
TokenDevelop
Token ID19246
Feature activation+3.11e-02

Pos logit contributions
rag+0.462
+0.449
clerosis+0.405
hib+0.395
hod+0.392
Neg logit contributions
ovan-0.515
ould-0.506
 rapport-0.468
thinkable-0.467
 Conversation-0.465
Tokened
Token ID276
Feature activation-1.30e+00

Pos logit contributions
rag+0.018
+0.018
clerosis+0.015
hib+0.015
hod+0.015
Neg logit contributions
ovan-0.027
ould-0.027
 rapport-0.025
thinkable-0.025
 Conversation-0.025
Token by
Token ID416
Feature activation-1.46e+00

Pos logit contributions
ovan+0.823
ould+0.805
 rapport+0.729
 Conversation+0.728
thinkable+0.726
Neg logit contributions
rag-1.078
-1.051
clerosis-0.972
hib-0.953
hod-0.945
Token:
Token ID25
Feature activation+1.97e+00

Pos logit contributions
ovan+0.962
ould+0.941
 rapport+0.859
 Conversation+0.857
thinkable+0.846
Neg logit contributions
rag-1.176
-1.147
clerosis-1.053
hib-1.026
hod-1.020
Token Relic
Token ID43437
Feature activation+6.79e-01

Pos logit contributions
rag+1.602
+1.569
clerosis+1.432
hib+1.409
hod+1.402
Neg logit contributions
ovan-1.264
ould-1.242
thinkable-1.134
 rapport-1.129
 Conversation-1.113
Token Entertainment
Token ID11058
Feature activation+2.57e+00

Pos logit contributions
rag+0.438
+0.423
clerosis+0.377
hib+0.366
 duplication+0.363
Neg logit contributions
ovan-0.557
ould-0.546
 rapport-0.508
 Conversation-0.507
thinkable-0.506
Token -
Token ID532
Feature activation-1.44e+00

Pos logit contributions
rag+2.108
+2.060
clerosis+1.886
hib+1.855
hod+1.844
Neg logit contributions
ovan-1.629
ould-1.605
thinkable-1.457
 rapport-1.454
 Conversation-1.445
Token Sierra
Token ID17559
Feature activation+7.69e-02

Pos logit contributions
ovan+0.987
ould+0.969
 rapport+0.887
 Conversation+0.884
thinkable+0.881
Neg logit contributions
rag-1.115
-1.084
clerosis-0.991
hib-0.969
hod-0.961
Token Studios
Token ID13799
Feature activation+1.73e+00

Pos logit contributions
rag+0.050
+0.048
clerosis+0.043
hib+0.042
hod+0.041
Neg logit contributions
ovan-0.063
ould-0.062
 rapport-0.058
thinkable-0.057
 Conversation-0.057
Token commercials
Token ID30503
Feature activation-2.11e+00

Pos logit contributions
rag+0.084
+0.082
clerosis+0.075
hib+0.073
hod+0.073
Neg logit contributions
ovan-0.070
ould-0.069
 rapport-0.062
thinkable-0.062
 Conversation-0.062
Token for
Token ID329
Feature activation+5.83e-01

Pos logit contributions
ovan+1.455
ould+1.426
 rapport+1.311
thinkable+1.304
 Conversation+1.300
Neg logit contributions
rag-1.622
-1.588
clerosis-1.441
hib-1.407
hod-1.403
Token the
Token ID262
Feature activation+3.68e-01

Pos logit contributions
rag+0.467
+0.455
clerosis+0.416
hib+0.410
hod+0.406
Neg logit contributions
ovan-0.385
ould-0.378
thinkable-0.344
 rapport-0.343
 Conversation-0.341
Token beer
Token ID6099
Feature activation-3.20e+00

Pos logit contributions
rag+0.288
+0.280
clerosis+0.256
hib+0.252
hod+0.249
Neg logit contributions
ovan-0.250
ould-0.246
thinkable-0.225
 rapport-0.224
 Conversation-0.222
Token company
Token ID1664
Feature activation-7.49e-01

Pos logit contributions
ovan+2.153
ould+2.101
 rapport+1.928
thinkable+1.921
 Conversation+1.920
Neg logit contributions
rag-2.514
-2.456
clerosis-2.251
hod-2.175
hib-2.175
Token,
Token ID11
Feature activation+4.19e-01

Pos logit contributions
ovan+0.518
ould+0.507
 rapport+0.466
 Conversation+0.465
thinkable+0.463
Neg logit contributions
rag-0.577
-0.564
clerosis-0.514
hib-0.503
hod-0.498
Token traditionally
Token ID16083
Feature activation+1.12e+00

Pos logit contributions
rag+0.337
+0.328
clerosis+0.300
hib+0.295
hod+0.293
Neg logit contributions
ovan-0.276
ould-0.271
thinkable-0.246
 rapport-0.246
 Conversation-0.244
Token during
Token ID1141
Feature activation+3.18e+00

Pos logit contributions
rag+0.889
+0.864
clerosis+0.792
hib+0.778
 duplication+0.774
Neg logit contributions
ovan-0.745
ould-0.735
thinkable-0.663
 Conversation-0.659
 rapport-0.659
Token the
Token ID262
Feature activation+8.42e-01

Pos logit contributions
rag+2.411
+2.341
clerosis+2.147
hib+2.117
 duplication+2.117
Neg logit contributions
ovan-2.202
ould-2.166
thinkable-1.979
fleet-1.949
 rapport-1.946
Token Super
Token ID3115
Feature activation+9.21e-01

Pos logit contributions
rag+0.585
+0.566
clerosis+0.512
hib+0.500
hod+0.500
Neg logit contributions
ovan-0.644
ould-0.636
thinkable-0.586
 rapport-0.580
 Conversation-0.578
Token Bowl
Token ID8693
Feature activation+1.42e+00

Pos logit contributions
rag+0.572
+0.554
clerosis+0.493
hib+0.481
hod+0.476
Neg logit contributions
ovan-0.774
ould-0.762
 rapport-0.710
thinkable-0.708
 Conversation-0.705
Token my
Token ID616
Feature activation+6.62e-03

Pos logit contributions
rag+0.175
+0.171
clerosis+0.157
hib+0.154
hod+0.153
Neg logit contributions
ovan-0.135
ould-0.133
 rapport-0.121
thinkable-0.120
 Conversation-0.120
Token team
Token ID1074
Feature activation-5.68e-01

Pos logit contributions
rag+0.006
+0.006
clerosis+0.006
hib+0.006
 duplication+0.005
Neg logit contributions
ovan-0.003
ould-0.003
thinkable-0.003
 Conversation-0.003
 rapport-0.003
Token to
Token ID284
Feature activation-1.48e-01

Pos logit contributions
ovan+0.398
ould+0.391
 rapport+0.359
thinkable+0.357
 Conversation+0.356
Neg logit contributions
rag-0.432
-0.422
clerosis-0.384
hib-0.376
hod-0.373
Token achieve
Token ID4620
Feature activation+1.97e+00

Pos logit contributions
ovan+0.099
ould+0.097
thinkable+0.089
 Conversation+0.088
 rapport+0.088
Neg logit contributions
rag-0.118
-0.115
clerosis-0.105
hib-0.103
hod-0.103
Token a
Token ID257
Feature activation-4.29e-01

Pos logit contributions
rag+1.524
+1.473
clerosis+1.350
hib+1.326
 duplication+1.325
Neg logit contributions
ovan-1.361
ould-1.336
thinkable-1.216
 Conversation-1.211
 rapport-1.202
Token goal
Token ID3061
Feature activation-1.15e-01

Pos logit contributions
ovan+0.283
ould+0.277
thinkable+0.251
 Conversation+0.250
 rapport+0.248
Neg logit contributions
rag-0.346
-0.335
clerosis-0.308
hib-0.302
hod-0.302
Token,
Token ID11
Feature activation-2.02e-01

Pos logit contributions
ovan+0.081
ould+0.079
 rapport+0.073
thinkable+0.072
 Conversation+0.072
Neg logit contributions
rag-0.088
-0.086
clerosis-0.078
hib-0.077
hod-0.076
Token and
Token ID290
Feature activation+3.21e-01

Pos logit contributions
ovan+0.135
ould+0.132
 rapport+0.121
 Conversation+0.120
thinkable+0.120
Neg logit contributions
rag-0.161
-0.158
clerosis-0.143
hib-0.140
hod-0.139
Token that
Token ID326
Feature activation-1.15e+00

Pos logit contributions
rag+0.254
+0.247
clerosis+0.226
hib+0.222
hod+0.220
Neg logit contributions
ovan-0.215
ould-0.211
thinkable-0.193
 rapport-0.192
 Conversation-0.191
Token's
Token ID338
Feature activation-1.51e-01

Pos logit contributions
ovan+0.779
ould+0.766
 rapport+0.719
thinkable+0.700
 Conversation+0.696
Neg logit contributions
rag-0.893
-0.883
clerosis-0.799
hib-0.779
hod-0.772
Token to
Token ID284
Feature activation-7.93e-02

Pos logit contributions
ovan+0.102
ould+0.099
 rapport+0.092
 Conversation+0.091
thinkable+0.091
Neg logit contributions
rag-0.119
-0.117
clerosis-0.106
hib-0.104
hod-0.103
Token
Token ID198
Feature activation-4.45e-01

Pos logit contributions
rag+0.593
+0.576
clerosis+0.526
hib+0.521
hod+0.521
Neg logit contributions
ovan-0.457
ould-0.451
thinkable-0.413
 rapport-0.411
 Conversation-0.404
Token
Token ID198
Feature activation-1.06e-01

Pos logit contributions
ovan+0.272
ould+0.267
 rapport+0.243
thinkable+0.241
 Conversation+0.240
Neg logit contributions
rag-0.378
-0.369
clerosis-0.338
hib-0.334
hod-0.332
Token2002
Token ID16942
Feature activation+8.27e-01

Pos logit contributions
ovan+0.088
ould+0.087
 rapport+0.082
thinkable+0.082
 Conversation+0.082
Neg logit contributions
rag-0.066
-0.064
clerosis-0.056
hod-0.056
hib-0.055
Token Matt
Token ID4705
Feature activation-1.28e+00

Pos logit contributions
rag+0.656
+0.641
clerosis+0.586
hib+0.574
hod+0.570
Neg logit contributions
ovan-0.552
ould-0.542
 rapport-0.495
thinkable-0.493
 Conversation-0.491
Token Le
Token ID1004
Feature activation+1.23e+00

Pos logit contributions
ovan+0.855
ould+0.827
 rapport+0.754
 Conversation+0.752
thinkable+0.744
Neg logit contributions
rag-1.016
-1.003
clerosis-0.911
hib-0.886
 duplication-0.883
TokenBl
Token ID3629
Feature activation-1.89e+00

Pos logit contributions
rag+0.694
+0.667
clerosis+0.586
hib+0.573
hod+0.565
Neg logit contributions
ovan-1.107
ould-1.092
 rapport-1.024
thinkable-1.022
 Conversation-1.019
Tokenanc
Token ID1192
Feature activation-3.50e-01

Pos logit contributions
ovan+1.645
ould+1.616
thinkable+1.492
 rapport+1.491
fleet+1.475
Neg logit contributions
rag-1.123
-1.104
clerosis-0.962
hib-0.940
 duplication-0.935
Token
Token ID198
Feature activation-5.19e-01

Pos logit contributions
ovan+0.194
ould+0.189
 rapport+0.169
 Conversation+0.167
thinkable+0.167
Neg logit contributions
rag-0.319
-0.315
clerosis-0.292
hib-0.284
hod-0.282
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
ovan+0.317
ould+0.312
 rapport+0.283
thinkable+0.281
 Conversation+0.280
Neg logit contributions
rag-0.442
-0.431
clerosis-0.395
hib-0.391
hod-0.388
Token2002
Token ID16942
Feature activation+9.35e-01

Pos logit contributions
ovan+0.103
ould+0.102
 rapport+0.096
 Conversation+0.095
thinkable+0.095
Neg logit contributions
rag-0.082
-0.080
clerosis-0.070
hib-0.069
hod-0.069
Token Bernie
Token ID10477
Feature activation-1.81e+00

Pos logit contributions
rag+0.762
+0.745
clerosis+0.683
hib+0.670
hod+0.667
Neg logit contributions
ovan-0.601
ould-0.590
thinkable-0.538
 rapport-0.536
 Conversation-0.529
Token of
Token ID286
Feature activation-6.64e-02

Pos logit contributions
ovan+0.087
ould+0.085
 rapport+0.078
thinkable+0.077
 Conversation+0.077
Neg logit contributions
rag-0.112
-0.110
clerosis-0.100
hib-0.098
hod-0.098
Token society
Token ID3592
Feature activation+2.09e-01

Pos logit contributions
ovan+0.043
ould+0.042
 rapport+0.038
thinkable+0.038
 Conversation+0.038
Neg logit contributions
rag-0.054
-0.053
clerosis-0.049
hib-0.048
hod-0.047
Token.
Token ID13
Feature activation-5.33e-01

Pos logit contributions
rag+0.156
+0.153
clerosis+0.139
hib+0.136
hod+0.135
Neg logit contributions
ovan-0.149
ould-0.146
 rapport-0.134
thinkable-0.134
 Conversation-0.133
Token Yet
Token ID6430
Feature activation-1.72e+00

Pos logit contributions
ovan+0.348
ould+0.341
 rapport+0.310
thinkable+0.310
 Conversation+0.306
Neg logit contributions
rag-0.431
-0.422
clerosis-0.386
hib-0.379
hod-0.376
Token mayors
Token ID44660
Feature activation-1.23e+00

Pos logit contributions
ovan+1.164
ould+1.142
 rapport+1.053
 Conversation+1.044
thinkable+1.039
Neg logit contributions
rag-1.341
-1.317
clerosis-1.196
hib-1.169
hod-1.163
Token,
Token ID11
Feature activation-1.36e-01

Pos logit contributions
ovan+0.825
ould+0.809
 rapport+0.739
thinkable+0.734
 Conversation+0.731
Neg logit contributions
rag-0.979
-0.960
clerosis-0.874
hib-0.857
hod-0.850
Token as
Token ID355
Feature activation+1.78e+00

Pos logit contributions
ovan+0.099
ould+0.097
 rapport+0.089
thinkable+0.089
 Conversation+0.089
Neg logit contributions
rag-0.100
-0.097
clerosis-0.088
hib-0.086
hod-0.085
Token the
Token ID262
Feature activation-1.41e-01

Pos logit contributions
rag+1.340
+1.295
clerosis+1.184
hib+1.161
hod+1.154
Neg logit contributions
ovan-1.252
ould-1.230
thinkable-1.126
 Conversation-1.116
 rapport-1.114
Token top
Token ID1353
Feature activation-1.97e+00

Pos logit contributions
ovan+0.098
ould+0.096
 rapport+0.088
thinkable+0.088
 Conversation+0.087
Neg logit contributions
rag-0.109
-0.106
clerosis-0.097
hib-0.095
hod-0.094
Token political
Token ID1964
Feature activation-9.62e-01

Pos logit contributions
ovan+1.370
ould+1.342
 rapport+1.236
thinkable+1.225
 Conversation+1.224
Neg logit contributions
rag-1.505
-1.468
clerosis-1.340
hib-1.311
hod-1.300
Token and
Token ID290
Feature activation+2.34e-01

Pos logit contributions
ovan+0.630
ould+0.617
thinkable+0.562
 Conversation+0.557
 rapport+0.555
Neg logit contributions
rag-0.780
-0.754
clerosis-0.690
hib-0.679
 duplication-0.677
Token comes
Token ID2058
Feature activation+1.44e+00

Pos logit contributions
rag+0.738
+0.721
clerosis+0.659
hib+0.649
hod+0.643
Neg logit contributions
ovan-0.559
ould-0.546
 rapport-0.497
 Conversation-0.495
thinkable-0.493
Token to
Token ID284
Feature activation+5.78e-01

Pos logit contributions
rag+1.135
+1.116
clerosis+1.019
hod+0.996
hib+0.995
Neg logit contributions
ovan-0.974
ould-0.947
 rapport-0.875
 Conversation-0.866
thinkable-0.862
Token women
Token ID1466
Feature activation+1.87e+00

Pos logit contributions
rag+0.461
+0.448
clerosis+0.410
hib+0.402
hod+0.399
Neg logit contributions
ovan-0.384
ould-0.378
thinkable-0.344
 Conversation-0.341
 rapport-0.341
Token
Token ID447
Feature activation+5.97e-01

Pos logit contributions
rag+1.403
+1.361
clerosis+1.238
hib+1.213
hod+1.200
Neg logit contributions
ovan-1.331
ould-1.309
thinkable-1.198
 rapport-1.198
 Conversation-1.194
Token
Token ID247
Feature activation+5.78e-01

Pos logit contributions
rag+0.573
+0.562
hib+0.511
clerosis+0.509
hod+0.508
Neg logit contributions
ovan-0.305
ould-0.288
 rapport-0.267
thinkable-0.267
 Conversation-0.259
Tokens
Token ID82
Feature activation-1.60e-01

Pos logit contributions
rag+0.434
+0.425
clerosis+0.386
hib+0.377
hod+0.375
Neg logit contributions
ovan-0.410
ould-0.403
 rapport-0.369
thinkable-0.368
 Conversation-0.367
Token rights
Token ID2489
Feature activation+3.69e+00

Pos logit contributions
ovan+0.107
ould+0.105
thinkable+0.095
 rapport+0.095
 Conversation+0.095
Neg logit contributions
rag-0.128
-0.124
clerosis-0.114
hib-0.111
hod-0.111
Token,
Token ID11
Feature activation+4.71e-01

Pos logit contributions
rag+2.777
+2.691
clerosis+2.456
hib+2.410
hod+2.382
Neg logit contributions
ovan-2.601
ould-2.548
thinkable-2.335
 rapport-2.327
 Conversation-2.321
Token they
Token ID484
Feature activation-6.93e-01

Pos logit contributions
rag+0.366
+0.357
clerosis+0.326
hib+0.319
hod+0.317
Neg logit contributions
ovan-0.322
ould-0.316
thinkable-0.289
 rapport-0.288
 Conversation-0.286
Token are
Token ID389
Feature activation-3.36e-01

Pos logit contributions
ovan+0.456
ould+0.446
 rapport+0.407
thinkable+0.406
 Conversation+0.406
Neg logit contributions
rag-0.557
-0.543
clerosis-0.497
hib-0.488
hod-0.484
Token two
Token ID734
Feature activation+5.23e-01

Pos logit contributions
ovan+0.231
ould+0.226
 rapport+0.206
thinkable+0.206
 Conversation+0.206
Neg logit contributions
rag-0.261
-0.253
clerosis-0.230
hib-0.227
hod-0.225
TokenCo
Token ID7222
Feature activation+2.50e-01

Pos logit contributions
ovan+0.258
ould+0.253
thinkable+0.233
 rapport+0.233
 Conversation+0.233
Neg logit contributions
rag-0.233
-0.227
clerosis-0.206
hib-0.200
hod-0.198
Tokenax
Token ID897
Feature activation+4.99e+00

Pos logit contributions
rag+0.145
+0.141
clerosis+0.125
hib+0.121
hod+0.119
Neg logit contributions
ovan-0.220
ould-0.216
 rapport-0.201
 Conversation-0.200
thinkable-0.200
Token)
Token ID8
Feature activation+1.39e+00

Pos logit contributions
rag+4.060
+3.965
clerosis+3.630
hib+3.580
 duplication+3.576
Neg logit contributions
ovan-3.119
ould-3.065
thinkable-2.821
 rapport-2.814
 Conversation-2.763
Token under
Token ID739
Feature activation-2.42e+00

Pos logit contributions
rag+1.053
+1.022
clerosis+0.931
hib+0.919
 duplication+0.914
Neg logit contributions
ovan-0.976
ould-0.962
thinkable-0.882
 rapport-0.874
fleet-0.864
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
ovan+1.643
ould+1.615
 Conversation+1.497
 rapport+1.493
thinkable+1.481
Neg logit contributions
rag-1.873
-1.835
clerosis-1.679
hib-1.632
hod-1.620
Token list
Token ID1351
Feature activation-1.26e+00

Pos logit contributions
ovan+0.850
ould+0.832
 rapport+0.766
 Conversation+0.764
thinkable+0.761
Neg logit contributions
rag-0.969
-0.946
clerosis-0.865
hib-0.844
hod-0.837
Token of
Token ID286
Feature activation-6.24e-01

Pos logit contributions
ovan+0.821
ould+0.803
 rapport+0.733
thinkable+0.732
 Conversation+0.731
Neg logit contributions
rag-1.019
-0.997
clerosis-0.912
hib-0.896
hod-0.887
Token brid
Token ID38265
Feature activation-1.68e-01

Pos logit contributions
ovan+0.420
ould+0.412
 rapport+0.378
 Conversation+0.376
thinkable+0.375
Neg logit contributions
rag-0.491
-0.479
clerosis-0.438
hib-0.428
hod-0.425
Tokenged
Token ID2004
Feature activation-1.99e+00

Pos logit contributions
ovan+0.159
ould+0.157
 rapport+0.148
thinkable+0.147
 Conversation+0.147
Neg logit contributions
rag-0.086
-0.083
clerosis-0.072
hib-0.069
hod-0.068
Token connections
Token ID8787
Feature activation-2.63e+00

Pos logit contributions
ovan+1.358
ould+1.331
 rapport+1.223
 Conversation+1.216
thinkable+1.214
Neg logit contributions
rag-1.553
-1.515
clerosis-1.386
hib-1.351
hod-1.340
Token.
Token ID13
Feature activation-8.30e-01

Pos logit contributions
ovan+1.794
ould+1.754
 rapport+1.623
 Conversation+1.617
thinkable+1.605
Neg logit contributions
rag-2.016
-1.988
clerosis-1.800
hib-1.760
hod-1.746
Token devastating
Token ID14101
Feature activation-9.66e-02

Pos logit contributions
rag+0.877
+0.846
clerosis+0.768
hib+0.759
 duplication+0.756
Neg logit contributions
ovan-0.814
ould-0.797
 rapport-0.728
 Conversation-0.727
thinkable-0.719
Token for
Token ID329
Feature activation+2.03e-01

Pos logit contributions
ovan+0.068
ould+0.066
 rapport+0.061
thinkable+0.061
 Conversation+0.061
Neg logit contributions
rag-0.073
-0.072
clerosis-0.065
hib-0.064
hod-0.063
Token local
Token ID1957
Feature activation-1.36e+00

Pos logit contributions
rag+0.165
+0.161
clerosis+0.147
hib+0.145
hod+0.143
Neg logit contributions
ovan-0.133
ould-0.130
 rapport-0.118
thinkable-0.118
 Conversation-0.118
Token wildlife
Token ID15599
Feature activation-9.15e-01

Pos logit contributions
ovan+0.882
ould+0.864
 rapport+0.785
thinkable+0.784
 Conversation+0.781
Neg logit contributions
rag-1.113
-1.085
clerosis-0.995
hib-0.977
hod-0.970
Token and
Token ID290
Feature activation+6.32e-01

Pos logit contributions
ovan+0.626
ould+0.614
 rapport+0.564
thinkable+0.562
 Conversation+0.558
Neg logit contributions
rag-0.708
-0.693
clerosis-0.630
hib-0.616
hod-0.613
Token Indigenous
Token ID24033
Feature activation-4.92e+00

Pos logit contributions
rag+0.494
+0.479
clerosis+0.442
hib+0.433
 duplication+0.430
Neg logit contributions
ovan-0.429
ould-0.419
thinkable-0.383
 rapport-0.380
 Conversation-0.380
Token communities
Token ID5348
Feature activation-1.85e+00

Pos logit contributions
ovan+3.186
ould+3.123
 rapport+2.906
 Conversation+2.865
thinkable+2.820
Neg logit contributions
rag-3.920
-3.906
clerosis-3.523
hod-3.439
hib-3.432
Token.
Token ID13
Feature activation-3.80e-01

Pos logit contributions
ovan+1.249
ould+1.230
thinkable+1.129
 rapport+1.127
 Conversation+1.112
Neg logit contributions
rag-1.437
-1.415
clerosis-1.283
hib-1.254
hod-1.248
Token
Token ID198
Feature activation-5.21e-01

Pos logit contributions
ovan+0.255
ould+0.250
 rapport+0.229
thinkable+0.229
 Conversation+0.223
Neg logit contributions
rag-0.299
-0.294
clerosis-0.267
hib-0.262
hod-0.260
Token
Token ID198
Feature activation-2.55e-01

Pos logit contributions
ovan+0.326
ould+0.321
 rapport+0.295
thinkable+0.291
 Conversation+0.290
Neg logit contributions
rag-0.434
-0.423
clerosis-0.386
hib-0.384
hod-0.380
Token5
Token ID20
Feature activation+8.95e-01

Pos logit contributions
ovan+0.166
ould+0.163
thinkable+0.147
 rapport+0.147
 Conversation+0.147
Neg logit contributions
rag-0.207
-0.202
clerosis-0.185
hib-0.181
hod-0.180
Tokenai
Token ID1872
Feature activation+1.05e+00

Pos logit contributions
ovan+1.078
ould+1.057
 rapport+0.981
 Conversation+0.978
thinkable+0.978
Neg logit contributions
rag-1.068
-1.028
clerosis-0.936
hib-0.920
hod-0.914
Token and
Token ID290
Feature activation-1.04e+00

Pos logit contributions
rag+0.825
+0.811
clerosis+0.746
hib+0.717
 duplication+0.710
Neg logit contributions
ovan-0.708
ould-0.695
 rapport-0.640
thinkable-0.626
 Conversation-0.624
Token Wh
Token ID854
Feature activation-3.56e+00

Pos logit contributions
ovan+0.667
ould+0.653
 rapport+0.600
 Conversation+0.595
thinkable+0.589
Neg logit contributions
rag-0.853
-0.834
clerosis-0.766
hib-0.747
hod-0.742
Tokenist
Token ID396
Feature activation+1.47e+00

Pos logit contributions
ovan+2.634
ould+2.588
 rapport+2.342
thinkable+2.338
fleet+2.322
Neg logit contributions
-2.558
rag-2.540
clerosis-2.271
 duplication-2.248
hod-2.188
Tokenling
Token ID1359
Feature activation-5.51e-01

Pos logit contributions
rag+1.142
+1.081
hib+1.012
clerosis+1.005
hod+0.989
Neg logit contributions
ovan-0.991
ould-0.969
 rapport-0.913
thinkable-0.903
 Conversation-0.899
Token Woods
Token ID16483
Feature activation-4.15e+00

Pos logit contributions
ovan+0.436
ould+0.427
thinkable+0.397
 rapport+0.396
 Conversation+0.392
Neg logit contributions
rag-0.370
-0.358
clerosis-0.324
hib-0.318
hod-0.317
Token to
Token ID284
Feature activation-1.65e+00

Pos logit contributions
ovan+2.693
ould+2.628
 rapport+2.461
 Conversation+2.392
 sentiments+2.341
Neg logit contributions
rag-3.273
-3.266
clerosis-2.965
fare-2.837
alon-2.831
Token vac
Token ID6658
Feature activation-4.41e-01

Pos logit contributions
ovan+1.226
ould+1.205
 rapport+1.117
thinkable+1.103
 Conversation+1.103
Neg logit contributions
rag-1.184
-1.153
clerosis-1.046
hib-1.016
hod-1.007
Tokenate
Token ID378
Feature activation-1.02e+00

Pos logit contributions
ovan+0.380
ould+0.374
 rapport+0.350
thinkable+0.349
 Conversation+0.349
Neg logit contributions
rag-0.265
-0.255
clerosis-0.225
hib-0.220
hod-0.218
Token 14
Token ID1478
Feature activation-1.53e+00

Pos logit contributions
ovan+0.607
ould+0.596
 rapport+0.544
 Conversation+0.536
thinkable+0.532
Neg logit contributions
rag-0.884
-0.864
clerosis-0.804
hib-0.777
hod-0.771
Token.
Token ID13
Feature activation-7.47e-01

Pos logit contributions
ovan+1.105
ould+1.084
 rapport+0.997
thinkable+0.991
 Conversation+0.987
Neg logit contributions
rag-1.127
-1.096
clerosis-1.002
hib-0.972
hod-0.962
Token said
Token ID531
Feature activation+1.63e+00

Pos logit contributions
rag+0.341
+0.332
clerosis+0.302
hib+0.296
hod+0.295
Neg logit contributions
ovan-0.312
ould-0.307
 Conversation-0.282
 rapport-0.281
thinkable-0.280
Token people
Token ID661
Feature activation-4.00e-01

Pos logit contributions
rag+1.224
+1.190
clerosis+1.078
hib+1.062
hod+1.051
Neg logit contributions
ovan-1.156
ould-1.135
thinkable-1.041
 rapport-1.036
 Conversation-1.033
Token who
Token ID508
Feature activation-3.84e-01

Pos logit contributions
ovan+0.277
ould+0.272
 rapport+0.249
thinkable+0.249
 Conversation+0.248
Neg logit contributions
rag-0.308
-0.300
clerosis-0.274
hib-0.268
hod-0.266
Token sought
Token ID7194
Feature activation+1.91e+00

Pos logit contributions
ovan+0.256
ould+0.252
thinkable+0.230
 Conversation+0.230
 rapport+0.230
Neg logit contributions
rag-0.304
-0.295
clerosis-0.270
hib-0.265
 duplication-0.263
Token the
Token ID262
Feature activation-4.73e-01

Pos logit contributions
rag+1.477
+1.446
clerosis+1.309
hib+1.292
 duplication+1.288
Neg logit contributions
ovan-1.306
ould-1.288
thinkable-1.174
 rapport-1.161
 Conversation-1.161
Token same
Token ID976
Feature activation-4.51e+00

Pos logit contributions
ovan+0.326
ould+0.322
thinkable+0.294
 rapport+0.292
 Conversation+0.291
Neg logit contributions
rag-0.363
-0.355
clerosis-0.322
hib-0.317
 duplication-0.315
Token benefits
Token ID4034
Feature activation-1.36e+00

Pos logit contributions
ovan+3.006
ould+2.926
 rapport+2.719
 Conversation+2.669
thinkable+2.666
Neg logit contributions
rag-3.583
-3.499
clerosis-3.210
hib-3.125
hod-3.105
Token for
Token ID329
Feature activation-4.85e-01

Pos logit contributions
ovan+0.905
ould+0.887
 rapport+0.809
thinkable+0.806
 Conversation+0.804
Neg logit contributions
rag-1.083
-1.057
clerosis-0.967
hib-0.947
hod-0.940
Token free
Token ID1479
Feature activation+3.23e+00

Pos logit contributions
ovan+0.314
ould+0.308
 rapport+0.280
thinkable+0.279
 Conversation+0.278
Neg logit contributions
rag-0.395
-0.385
clerosis-0.353
hib-0.346
hod-0.344
Token by
Token ID416
Feature activation-7.89e-01

Pos logit contributions
rag+2.404
+2.355
clerosis+2.144
 duplication+2.121
fare+2.113
Neg logit contributions
ovan-2.253
ould-2.203
thinkable-2.026
 rapport-2.017
 Conversation-2.010
Token "
Token ID366
Feature activation+2.88e+00

Pos logit contributions
ovan+0.530
ould+0.521
thinkable+0.474
 rapport+0.473
 Conversation+0.471
Neg logit contributions
rag-0.623
-0.608
clerosis-0.555
hib-0.544
 duplication-0.541
Token that
Token ID326
Feature activation+5.04e-01

Pos logit contributions
ovan+0.874
ould+0.857
 rapport+0.785
thinkable+0.782
 Conversation+0.781
Neg logit contributions
rag-0.994
-0.969
clerosis-0.885
hib-0.867
hod-0.862
Token.
Token ID13
Feature activation-1.87e-03

Pos logit contributions
rag+0.370
+0.361
clerosis+0.327
 duplication+0.321
hib+0.320
Neg logit contributions
ovan-0.364
ould-0.359
 rapport-0.329
thinkable-0.328
 Conversation-0.327
Token It
Token ID632
Feature activation+6.74e-01

Pos logit contributions
ovan+0.001
ould+0.001
thinkable+0.001
 rapport+0.001
 Conversation+0.001
Neg logit contributions
rag-0.002
-0.001
clerosis-0.001
hib-0.001
hod-0.001
Token allows
Token ID3578
Feature activation-5.51e-01

Pos logit contributions
rag+0.505
+0.492
clerosis+0.445
hib+0.437
hod+0.435
Neg logit contributions
ovan-0.480
ould-0.470
 rapport-0.434
 Conversation-0.433
thinkable-0.432
Token the
Token ID262
Feature activation-5.75e-01

Pos logit contributions
ovan+0.349
ould+0.342
 rapport+0.311
thinkable+0.309
 Conversation+0.308
Neg logit contributions
rag-0.456
-0.446
clerosis-0.409
hib-0.401
hod-0.398
Token same
Token ID976
Feature activation-4.19e+00

Pos logit contributions
ovan+0.394
ould+0.387
 rapport+0.354
thinkable+0.353
 Conversation+0.351
Neg logit contributions
rag-0.445
-0.434
clerosis-0.395
hib-0.388
 duplication-0.387
Token app
Token ID598
Feature activation-2.43e+00

Pos logit contributions
ovan+2.872
ould+2.820
 rapport+2.586
 Conversation+2.571
thinkable+2.564
Neg logit contributions
rag-3.236
-3.174
clerosis-2.890
hib-2.819
hod-2.802
Token to
Token ID284
Feature activation-3.81e-01

Pos logit contributions
ovan+1.689
ould+1.656
 rapport+1.516
thinkable+1.514
 Conversation+1.509
Neg logit contributions
rag-1.852
-1.812
clerosis-1.648
hib-1.613
hod-1.598
Token be
Token ID307
Feature activation-2.16e-02

Pos logit contributions
ovan+0.236
ould+0.231
 rapport+0.209
thinkable+0.209
 Conversation+0.208
Neg logit contributions
rag-0.321
-0.313
clerosis-0.288
hib-0.282
hod-0.280
Token installed
Token ID6589
Feature activation-1.43e+00

Pos logit contributions
ovan+0.015
ould+0.015
 rapport+0.013
 Conversation+0.013
thinkable+0.013
Neg logit contributions
rag-0.017
-0.016
clerosis-0.015
hib-0.015
 duplication-0.014
Token on
Token ID319
Feature activation+9.95e-01

Pos logit contributions
ovan+0.966
ould+0.947
 rapport+0.866
thinkable+0.863
 Conversation+0.860
Neg logit contributions
rag-1.126
-1.100
clerosis-1.004
hib-0.983
hod-0.976
Token,
Token ID11
Feature activation+4.47e-02

Pos logit contributions
ovan+0.589
ould+0.577
 rapport+0.526
 Conversation+0.521
thinkable+0.521
Neg logit contributions
rag-0.720
-0.705
clerosis-0.646
hib-0.632
hod-0.628
Token sign
Token ID1051
Feature activation+3.28e-01

Pos logit contributions
rag+0.037
+0.036
clerosis+0.033
 duplication+0.032
hib+0.032
Neg logit contributions
ovan-0.028
ould-0.028
thinkable-0.025
 rapport-0.025
 Conversation-0.025
Token in
Token ID287
Feature activation-1.12e+00

Pos logit contributions
rag+0.257
+0.251
clerosis+0.229
hib+0.224
hod+0.223
Neg logit contributions
ovan-0.223
ould-0.218
 rapport-0.200
thinkable-0.199
 Conversation-0.198
Token using
Token ID1262
Feature activation-1.88e+00

Pos logit contributions
ovan+0.787
ould+0.773
thinkable+0.709
 rapport+0.708
 Conversation+0.704
Neg logit contributions
rag-0.846
-0.825
clerosis-0.748
hib-0.734
hod-0.728
Token your
Token ID534
Feature activation+8.02e-02

Pos logit contributions
ovan+1.194
ould+1.170
 rapport+1.062
thinkable+1.059
 Conversation+1.054
Neg logit contributions
rag-1.551
-1.515
clerosis-1.390
hib-1.363
hod-1.354
Token Netflix
Token ID12074
Feature activation-6.62e+00

Pos logit contributions
rag+0.065
+0.064
clerosis+0.058
hod+0.057
hib+0.057
Neg logit contributions
ovan-0.052
ould-0.051
thinkable-0.047
 rapport-0.046
 Conversation-0.046
Token account
Token ID1848
Feature activation-1.34e+00

Pos logit contributions
ovan+4.507
ould+4.347
 Conversation+4.038
 rapport+4.030
fleet+3.920
Neg logit contributions
rag-5.131
-4.947
clerosis-4.580
hib-4.433
fare-4.353
Token and
Token ID290
Feature activation-5.32e-02

Pos logit contributions
ovan+0.948
ould+0.930
 rapport+0.852
thinkable+0.852
 Conversation+0.846
Neg logit contributions
rag-1.018
-0.992
clerosis-0.901
hib-0.883
hod-0.877
Token enjoy
Token ID2883
Feature activation-1.00e+00

Pos logit contributions
ovan+0.035
ould+0.035
thinkable+0.032
 rapport+0.031
 Conversation+0.031
Neg logit contributions
rag-0.043
-0.041
clerosis-0.038
 duplication-0.037
hib-0.037
Token watching
Token ID4964
Feature activation-5.01e+00

Pos logit contributions
ovan+0.660
ould+0.647
 rapport+0.589
thinkable+0.587
 Conversation+0.585
Neg logit contributions
rag-0.808
-0.789
clerosis-0.722
hib-0.708
hod-0.703
Token Netflix
Token ID12074
Feature activation-6.36e+00

Pos logit contributions
ovan+3.112
ould+3.010
 rapport+2.748
 Conversation+2.725
thinkable+2.696
Neg logit contributions
rag-4.166
-4.084
clerosis-3.816
hib-3.677
hod-3.671