TOP ACTIVATIONS
MAX = 0.625

has cut Mr . Gian fort e s lead
About Bh alk a T irth Here 4 .
me to continue reviewing audio gear , I d
6 a 443 f 925 b 3 198 14 34 88
a nice Public Service Announce ment ? And what better way
ado obliged spor ad ically , considering the heat .
territory 's chief executive Donald Ts ang said . It follows
Is Richard Nixon in a Pants uit appeared first on Money
: C aj un Bo il | C aj un Kitchen
Type 2 : Sp iny ker at oder ma ( also
47 ] < en 7 _> lol Mang 0 king [
23 ] Di agn osis [ edit ]
manner isms or movement , dressing and grooming , and possibly
as well as Customer Prem ise Equipment ( CP E )
a late - ons et ker at oder ma , inherited
until we got to the dressing room . Sir Alex Ferguson
woman crosses the road while texting on a smart phone in
Is Richard Nixon in a Pants uit was originally published at
in public forums . Says Neh ring , All
a Curiosity co - invest igator . " With only one

MIN ACTIVATIONS
MIN = -0.661

oom in World ( M uum ima ail ma in Finnish
M DB ) has not complied with the bank 's directions
Brazilian prosecutor that Rolls - Roy ce is co - oper
S . The Incarnation . The Life of Christ
the Italian Army had been intern ed by German forces in
based he i ress Beat rice Joyce Ke an set up
, much revised by other screen writers , helped the bottom
ont f uit Mil ne Orbital em port ant a vec
li the Dwarf , Leg olas the Elf , and the
Mush rooms are like snow fl akes . No two are
PNG larger image larg er image T IFF original
has mostly been turned into cond ominium s but has some
action , grabbing her M amiya 7 camera from her bag
that point , and S ibur 's annual report for 2012
. own a series of cond ominium properties , some of
so - called travel ban on certain
also pretty incredible with st ews , grains , sou ps
à cher cher le conf lit a vec nos an ci
mail and requiring a body guard for some of her speaking
, even though they all passed . An all

INTERVAL 0.304 - 0.625
CONTAINS 0.558%

, Munich design student Mor itz Bre it en hu ber
. How could this go wrong ?! Well ,
s che c kered history at the Congress Theater
are also regarded as a special threat or whether they are
here . <|endoftext|> Mike Bloomberg may well be , as consensus

INTERVAL -0.018 - 0.304
CONTAINS 58.924%

a chance to make it in the NFL . He always
monster ! You won t be held back by
saw a crocod ile kill a water buffalo this summer .
s important is that more and more of our country
Tony Ferguson versus Kh ab ib Nur mag omed ov

INTERVAL -0.339 - -0.018
CONTAINS 40.257%

, the arrest of a foreigner is never done casually .
this light before . <|endoftext|> The history of batteries can be
different workforce , one that values efficacy , flexibility , and
he said . " You have to pay attention to all
Metroid speed run race , from Awesome Games Done Quick 2014

INTERVAL -0.661 - -0.339
CONTAINS 0.261%

That is a very dangerous precedent .
www . ed wards . af . mil / history /
status on each reboot , unlike most of other OD IN
Finance Minister filed with the federal Ethics Commissioner 's office say
my kitchen essentials ! <|endoftext|> Posted on by | in
Token has
Token ID468
Feature activation-9.27e-03

Pos logit contributions
 affair+0.298
angering+0.292
cedented+0.289
Access+0.285
addon+0.284
Neg logit contributions
 Sund-0.374
hest-0.366
 [&-0.350
 deterrent-0.329
 guarantees-0.324
Token cut
Token ID2005
Feature activation-1.49e-01

Pos logit contributions
 Sund+0.024
hest+0.024
 [&+0.023
 deterrent+0.021
 guarantees+0.021
Neg logit contributions
 affair-0.018
cedented-0.018
angering-0.018
addon-0.018
Access-0.017
Token Mr
Token ID1770
Feature activation-7.90e-02

Pos logit contributions
 Sund+0.390
hest+0.380
 [&+0.368
 deterrent+0.345
+0.344
Neg logit contributions
 affair-0.293
angering-0.289
cedented-0.288
addon-0.282
Access-0.281
Token.
Token ID13
Feature activation-2.81e-02

Pos logit contributions
 Sund+0.194
hest+0.190
 [&+0.182
 deterrent+0.176
 guarantees+0.171
Neg logit contributions
 affair-0.164
Access-0.161
addon-0.160
angering-0.159
issy-0.159
Token Gian
Token ID30851
Feature activation+1.96e-01

Pos logit contributions
 Sund+0.067
hest+0.066
 [&+0.063
 deterrent+0.059
 guarantees+0.059
Neg logit contributions
 affair-0.061
cedented-0.060
angering-0.060
addon-0.060
issy-0.059
Tokenfort
Token ID3319
Feature activation+6.25e-01

Pos logit contributions
 affair+0.512
cedented+0.508
angering+0.507
addon+0.503
issy+0.499
Neg logit contributions
 Sund-0.384
hest-0.371
 [&-0.350
 deterrent-0.323
 guarantees-0.322
Tokene
Token ID68
Feature activation+1.15e-01

Pos logit contributions
 affair+1.263
addon+1.231
cedented+1.229
issy+1.228
angering+1.227
Neg logit contributions
 Sund-1.590
hest-1.526
 [&-1.474
 deterrent-1.420
 guarantees-1.388
Token
Token ID447
Feature activation+1.37e-02

Pos logit contributions
 affair+0.244
cedented+0.239
angering+0.238
Access+0.235
addon+0.234
Neg logit contributions
 Sund-0.281
hest-0.273
 [&-0.262
 deterrent-0.248
 guarantees-0.244
Token
Token ID247
Feature activation-2.76e-03

Pos logit contributions
 affair+0.028
cedented+0.028
angering+0.028
addon+0.028
issy+0.027
Neg logit contributions
 Sund-0.034
hest-0.034
 [&-0.032
 deterrent-0.030
 guarantees-0.030
Tokens
Token ID82
Feature activation-4.81e-03

Pos logit contributions
 Sund+0.007
hest+0.007
 [&+0.006
 deterrent+0.006
 guarantees+0.006
Neg logit contributions
 affair-0.006
cedented-0.006
angering-0.006
addon-0.006
Access-0.006
Token lead
Token ID1085
Feature activation+4.44e-02

Pos logit contributions
 Sund+0.012
hest+0.011
 [&+0.011
 Canaver+0.010
eks+0.010
Neg logit contributions
 affair-0.010
angering-0.010
cedented-0.010
Access-0.010
issy-0.010
Token About
Token ID7994
Feature activation-3.49e-04

Pos logit contributions
 Sund+0.015
hest+0.014
 [&+0.014
 deterrent+0.013
 guarantees+0.013
Neg logit contributions
 affair-0.010
cedented-0.010
angering-0.010
addon-0.010
Access-0.010
Token Bh
Token ID16581
Feature activation+3.94e-01

Pos logit contributions
 Sund+0.001
hest+0.001
 [&+0.001
 deterrent+0.001
 guarantees+0.001
Neg logit contributions
 affair-0.001
cedented-0.001
angering-0.001
addon-0.001
Access-0.001
Tokenalk
Token ID971
Feature activation-2.41e-02

Pos logit contributions
addon+0.799
 affair+0.799
angering+0.792
issy+0.788
cedented+0.784
Neg logit contributions
 Sund-0.996
hest-0.961
 [&-0.928
 deterrent-0.883
 guarantees-0.880
Tokena
Token ID64
Feature activation+1.11e-02

Pos logit contributions
 Sund+0.073
hest+0.071
 [&+0.069
 deterrent+0.066
 guarantees+0.066
Neg logit contributions
 affair-0.037
angering-0.036
cedented-0.036
addon-0.036
issy-0.036
Token T
Token ID309
Feature activation+1.58e-01

Pos logit contributions
 affair+0.021
cedented+0.021
angering+0.021
addon+0.021
Access+0.020
Neg logit contributions
 Sund-0.030
hest-0.029
 [&-0.028
 deterrent-0.026
 guarantees-0.026
Tokenirth
Token ID3333
Feature activation+6.17e-01

Pos logit contributions
 affair+0.334
angering+0.331
addon+0.329
issy+0.326
cedented+0.325
Neg logit contributions
 Sund-0.384
hest-0.376
 [&-0.361
 guarantees-0.342
 deterrent-0.341
Token Here
Token ID3423
Feature activation-1.59e-01

Pos logit contributions
 affair+1.167
cedented+1.146
angering+1.144
addon+1.134
Access+1.125
Neg logit contributions
 Sund-1.632
hest-1.614
 [&-1.548
 deterrent-1.471
 guarantees-1.460
Token
Token ID198
Feature activation+7.14e-02

Pos logit contributions
 Sund+0.429
hest+0.421
 [&+0.401
 deterrent+0.382
 guarantees+0.380
Neg logit contributions
 affair-0.298
cedented-0.294
angering-0.292
addon-0.289
Access-0.288
Token
Token ID198
Feature activation+7.91e-02

Pos logit contributions
 affair+0.125
Access+0.122
cedented+0.122
addon+0.121
angering+0.121
Neg logit contributions
 Sund-0.200
hest-0.196
 [&-0.186
 deterrent-0.182
 guarantees-0.178
Token4
Token ID19
Feature activation-4.47e-02

Pos logit contributions
 affair+0.148
cedented+0.146
angering+0.145
Access+0.145
addon+0.145
Neg logit contributions
 Sund-0.214
hest-0.208
 [&-0.199
 deterrent-0.190
 guarantees-0.189
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
 Sund+0.108
hest+0.106
 [&+0.101
 deterrent+0.095
 guarantees+0.094
Neg logit contributions
 affair-0.096
angering-0.095
cedented-0.095
addon-0.094
Access-0.093
Token me
Token ID502
Feature activation-3.33e-02

Pos logit contributions
 Sund+0.077
hest+0.076
 [&+0.072
 deterrent+0.068
 guarantees+0.068
Neg logit contributions
 affair-0.064
angering-0.063
cedented-0.063
addon-0.062
Access-0.062
Token to
Token ID284
Feature activation-5.59e-03

Pos logit contributions
 Sund+0.069
hest+0.067
 [&+0.063
 deterrent+0.059
 guarantees+0.059
Neg logit contributions
 affair-0.083
cedented-0.083
angering-0.082
addon-0.082
issy-0.081
Token continue
Token ID2555
Feature activation+7.27e-02

Pos logit contributions
 Sund+0.015
hest+0.014
 [&+0.014
 deterrent+0.013
 guarantees+0.013
Neg logit contributions
 affair-0.011
angering-0.011
addon-0.011
cedented-0.011
issy-0.011
Token reviewing
Token ID17217
Feature activation-4.23e-02

Pos logit contributions
 affair+0.136
angering+0.133
cedented+0.132
addon+0.132
Access+0.130
Neg logit contributions
 Sund-0.197
hest-0.194
 [&-0.185
 deterrent-0.175
 guarantees-0.175
Token audio
Token ID6597
Feature activation+3.76e-02

Pos logit contributions
 Sund+0.108
hest+0.106
 [&+0.102
 deterrent+0.096
 guarantees+0.095
Neg logit contributions
 affair-0.085
cedented-0.083
angering-0.083
addon-0.083
Access-0.082
Token gear
Token ID7733
Feature activation+5.96e-01

Pos logit contributions
 affair+0.082
angering+0.081
cedented+0.080
addon+0.080
Access+0.079
Neg logit contributions
 Sund-0.090
hest-0.088
 [&-0.084
 deterrent-0.078
-0.078
Token,
Token ID11
Feature activation+4.16e-02

Pos logit contributions
 affair+1.174
angering+1.158
cedented+1.157
addon+1.145
issy+1.132
Neg logit contributions
 Sund-1.540
hest-1.505
 [&-1.439
 deterrent-1.365
 guarantees-1.347
Token I
Token ID314
Feature activation-1.42e-03

Pos logit contributions
 affair+0.087
angering+0.085
cedented+0.085
addon+0.084
Access+0.084
Neg logit contributions
 Sund-0.103
hest-0.101
 [&-0.096
 deterrent-0.091
 guarantees-0.090
Token
Token ID447
Feature activation+2.03e-02

Pos logit contributions
 Sund+0.003
hest+0.003
 [&+0.003
 deterrent+0.003
 guarantees+0.003
Neg logit contributions
 affair-0.003
cedented-0.003
angering-0.003
addon-0.003
Access-0.003
Token
Token ID247
Feature activation+6.98e-02

Pos logit contributions
 affair+0.044
cedented+0.044
angering+0.043
addon+0.043
Access+0.043
Neg logit contributions
 Sund-0.049
hest-0.048
 [&-0.045
 deterrent-0.043
 guarantees-0.042
Tokend
Token ID67
Feature activation+1.92e-01

Pos logit contributions
 affair+0.139
cedented+0.137
angering+0.136
addon+0.135
issy+0.134
Neg logit contributions
 Sund-0.181
hest-0.176
 [&-0.170
 deterrent-0.160
 guarantees-0.159
Token6
Token ID21
Feature activation+3.11e-02

Pos logit contributions
 affair+0.360
angering+0.356
cedented+0.354
addon+0.352
Access+0.350
Neg logit contributions
 Sund-0.409
hest-0.399
 [&-0.373
 deterrent-0.359
 guarantees-0.353
Tokena
Token ID64
Feature activation+1.46e-01

Pos logit contributions
 affair+0.067
cedented+0.066
angering+0.066
addon+0.065
Access+0.065
Neg logit contributions
 Sund-0.076
hest-0.074
 [&-0.069
 deterrent-0.066
 guarantees-0.066
Token443
Token ID34938
Feature activation-1.44e-01

Pos logit contributions
 affair+0.310
angering+0.306
cedented+0.305
addon+0.304
Access+0.302
Neg logit contributions
 Sund-0.357
hest-0.347
 [&-0.325
 deterrent-0.313
 guarantees-0.309
Tokenf
Token ID69
Feature activation+2.26e-01

Pos logit contributions
 Sund+0.353
hest+0.344
 [&+0.325
 deterrent+0.309
 guarantees+0.307
Neg logit contributions
 affair-0.306
angering-0.302
cedented-0.302
addon-0.300
Access-0.297
Token925
Token ID46351
Feature activation+3.51e-02

Pos logit contributions
 affair+0.482
angering+0.480
cedented+0.476
addon+0.476
Access+0.470
Neg logit contributions
 Sund-0.552
hest-0.536
 [&-0.502
 deterrent-0.486
 guarantees-0.477
Tokenb
Token ID65
Feature activation+5.73e-01

Pos logit contributions
 affair+0.076
angering+0.076
cedented+0.076
addon+0.075
Access+0.074
Neg logit contributions
 Sund-0.084
hest-0.082
 [&-0.077
 deterrent-0.074
 guarantees-0.073
Token3
Token ID18
Feature activation+1.14e-01

Pos logit contributions
angering+1.206
 affair+1.203
addon+1.200
cedented+1.188
issy+1.181
Neg logit contributions
 Sund-1.407
hest-1.360
 [&-1.265
 deterrent-1.244
 guarantees-1.213
Token198
Token ID22337
Feature activation+5.10e-02

Pos logit contributions
 affair+0.247
cedented+0.244
angering+0.243
addon+0.242
Access+0.239
Neg logit contributions
 Sund-0.277
hest-0.269
 [&-0.254
 deterrent-0.242
 guarantees-0.240
Token14
Token ID1415
Feature activation+1.62e-02

Pos logit contributions
 affair+0.111
angering+0.110
cedented+0.109
addon+0.109
Access+0.108
Neg logit contributions
 Sund-0.123
hest-0.119
 [&-0.112
 deterrent-0.107
 guarantees-0.106
Token34
Token ID2682
Feature activation+1.24e-01

Pos logit contributions
 affair+0.035
angering+0.035
cedented+0.035
addon+0.035
Access+0.035
Neg logit contributions
 Sund-0.039
hest-0.038
 [&-0.035
 deterrent-0.034
 guarantees-0.033
Token88
Token ID3459
Feature activation-5.79e-02

Pos logit contributions
 affair+0.271
angering+0.268
cedented+0.268
addon+0.267
Access+0.264
Neg logit contributions
 Sund-0.295
hest-0.286
 [&-0.268
 deterrent-0.258
 guarantees-0.255
Token a
Token ID257
Feature activation-4.34e-02

Pos logit contributions
 affair+0.208
angering+0.203
cedented+0.201
addon+0.199
Access+0.198
Neg logit contributions
 Sund-0.244
hest-0.243
 [&-0.230
-0.215
 deterrent-0.214
Token nice
Token ID3621
Feature activation-5.43e-02

Pos logit contributions
 Sund+0.109
hest+0.107
 [&+0.102
 guarantees+0.096
 deterrent+0.096
Neg logit contributions
 affair-0.089
angering-0.087
cedented-0.087
addon-0.086
issy-0.086
Token Public
Token ID5094
Feature activation-5.31e-02

Pos logit contributions
 Sund+0.143
hest+0.141
 [&+0.134
 guarantees+0.127
 deterrent+0.127
Neg logit contributions
 affair-0.105
cedented-0.102
angering-0.102
addon-0.101
issy-0.101
Token Service
Token ID4809
Feature activation+2.34e-01

Pos logit contributions
 Sund+0.152
hest+0.149
 [&+0.142
 deterrent+0.136
 guarantees+0.135
Neg logit contributions
 affair-0.092
angering-0.090
cedented-0.089
addon-0.088
Access-0.088
Token Announce
Token ID43470
Feature activation+1.53e-01

Pos logit contributions
 affair+0.304
cedented+0.297
angering+0.296
addon+0.291
Access+0.287
Neg logit contributions
 Sund-0.767
hest-0.753
 [&-0.727
 deterrent-0.695
 guarantees-0.694
Tokenment
Token ID434
Feature activation+5.71e-01

Pos logit contributions
 affair+0.269
cedented+0.263
angering+0.262
addon+0.259
Access+0.257
Neg logit contributions
 Sund-0.431
hest-0.423
 [&-0.404
 deterrent-0.385
 guarantees-0.384
Token?
Token ID30
Feature activation+1.52e-01

Pos logit contributions
 affair+1.223
angering+1.167
Access+1.161
cedented+1.159
addon+1.158
Neg logit contributions
hest-1.389
 Sund-1.384
 [&-1.287
 deterrent-1.224
wal-1.222
Token And
Token ID843
Feature activation+5.76e-02

Pos logit contributions
 affair+0.276
angering+0.269
cedented+0.269
Access+0.267
addon+0.266
Neg logit contributions
 Sund-0.416
hest-0.411
 [&-0.391
 deterrent-0.375
 guarantees-0.372
Token what
Token ID644
Feature activation-5.24e-03

Pos logit contributions
 affair+0.117
angering+0.114
cedented+0.113
addon+0.113
issy+0.112
Neg logit contributions
 Sund-0.146
hest-0.143
 [&-0.137
 deterrent-0.129
 guarantees-0.128
Token better
Token ID1365
Feature activation+1.24e-01

Pos logit contributions
 Sund+0.013
hest+0.013
 [&+0.012
+0.011
 deterrent+0.011
Neg logit contributions
 affair-0.011
angering-0.011
cedented-0.011
addon-0.011
Access-0.011
Token way
Token ID835
Feature activation+4.18e-02

Pos logit contributions
 affair+0.333
angering+0.322
cedented+0.321
addon+0.321
Access+0.320
Neg logit contributions
 Sund-0.236
hest-0.233
 [&-0.218
-0.198
 guarantees-0.197
Tokenado
Token ID4533
Feature activation+2.78e-02

Pos logit contributions
 affair+0.148
cedented+0.144
angering+0.144
addon+0.144
issy+0.142
Neg logit contributions
 Sund-0.236
hest-0.231
 [&-0.221
 deterrent-0.211
 guarantees-0.209
Token obliged
Token ID23278
Feature activation-1.84e-01

Pos logit contributions
 affair+0.052
cedented+0.051
angering+0.051
addon+0.050
Access+0.049
Neg logit contributions
 Sund-0.076
hest-0.074
 [&-0.071
 deterrent-0.067
 guarantees-0.067
Token
Token ID784
Feature activation+1.45e-02

Pos logit contributions
 Sund+0.474
hest+0.459
 [&+0.439
 guarantees+0.416
 deterrent+0.415
Neg logit contributions
 affair-0.369
cedented-0.368
angering-0.366
addon-0.361
quished-0.359
Token spor
Token ID35810
Feature activation-9.87e-02

Pos logit contributions
 affair+0.030
cedented+0.029
angering+0.029
addon+0.029
quished+0.029
Neg logit contributions
 Sund-0.037
hest-0.036
 [&-0.034
 deterrent-0.032
 guarantees-0.032
Tokenad
Token ID324
Feature activation+7.79e-02

Pos logit contributions
 Sund+0.286
hest+0.279
 [&+0.268
 deterrent+0.255
 guarantees+0.254
Neg logit contributions
 affair-0.166
cedented-0.164
angering-0.164
addon-0.161
Access-0.160
Tokenically
Token ID1146
Feature activation+5.66e-01

Pos logit contributions
 affair+0.151
angering+0.150
cedented+0.150
addon+0.147
issy+0.146
Neg logit contributions
 Sund-0.206
hest-0.201
 [&-0.192
 deterrent-0.182
 guarantees-0.181
Token,
Token ID11
Feature activation+1.92e-02

Pos logit contributions
 affair+1.113
angering+1.090
addon+1.063
cedented+1.062
Access+1.058
Neg logit contributions
 Sund-1.475
hest-1.462
 [&-1.401
 deterrent-1.308
 guarantees-1.298
Token considering
Token ID6402
Feature activation+1.63e-01

Pos logit contributions
 affair+0.039
angering+0.039
cedented+0.038
addon+0.038
Access+0.038
Neg logit contributions
 Sund-0.049
hest-0.048
 [&-0.046
 deterrent-0.043
 guarantees-0.042
Token the
Token ID262
Feature activation+9.11e-03

Pos logit contributions
 affair+0.325
angering+0.319
cedented+0.316
addon+0.315
Access+0.311
Neg logit contributions
 Sund-0.420
hest-0.412
 [&-0.397
 deterrent-0.372
-0.369
Token heat
Token ID4894
Feature activation+1.53e-02

Pos logit contributions
 affair+0.019
angering+0.019
cedented+0.018
addon+0.018
Access+0.018
Neg logit contributions
 Sund-0.023
hest-0.022
 [&-0.021
-0.020
 deterrent-0.020
Token.
Token ID13
Feature activation+3.27e-02

Pos logit contributions
 affair+0.031
angering+0.031
cedented+0.031
addon+0.030
Access+0.030
Neg logit contributions
 Sund-0.039
hest-0.038
 [&-0.036
 deterrent-0.034
 guarantees-0.034
Token territory
Token ID7674
Feature activation-8.27e-02

Pos logit contributions
 affair+0.077
angering+0.076
cedented+0.075
addon+0.074
issy+0.074
Neg logit contributions
 Sund-0.102
hest-0.101
 [&-0.097
-0.090
 guarantees-0.090
Token's
Token ID338
Feature activation+1.05e-02

Pos logit contributions
 Sund+0.202
hest+0.198
 [&+0.189
 deterrent+0.176
+0.176
Neg logit contributions
 affair-0.176
angering-0.173
cedented-0.173
addon-0.172
Access-0.170
Token chief
Token ID4039
Feature activation-2.70e-02

Pos logit contributions
 affair+0.024
angering+0.023
cedented+0.023
addon+0.023
Access+0.023
Neg logit contributions
 Sund-0.024
hest-0.024
 [&-0.023
-0.021
 guarantees-0.021
Token executive
Token ID4640
Feature activation+9.35e-02

Pos logit contributions
 Sund+0.060
hest+0.059
 [&+0.056
 guarantees+0.051
+0.051
Neg logit contributions
 affair-0.064
angering-0.063
cedented-0.063
addon-0.062
issy-0.062
Token Donald
Token ID3759
Feature activation-1.18e-01

Pos logit contributions
 affair+0.164
angering+0.160
cedented+0.159
addon+0.158
issy+0.157
Neg logit contributions
 Sund-0.260
hest-0.260
 [&-0.250
 deterrent-0.234
-0.234
Token Ts
Token ID13146
Feature activation+5.64e-01

Pos logit contributions
 Sund+0.308
hest+0.305
 [&+0.292
 deterrent+0.277
 guarantees+0.276
Neg logit contributions
 affair-0.226
angering-0.221
cedented-0.221
addon-0.220
issy-0.219
Tokenang
Token ID648
Feature activation-4.94e-02

Pos logit contributions
 affair+0.902
cedented+0.895
angering+0.892
addon+0.884
issy+0.869
Neg logit contributions
 Sund-1.660
hest-1.623
 [&-1.579
 deterrent-1.500
 guarantees-1.494
Token said
Token ID531
Feature activation-3.75e-02

Pos logit contributions
 Sund+0.138
hest+0.136
 [&+0.130
 deterrent+0.123
 guarantees+0.123
Neg logit contributions
 affair-0.088
cedented-0.087
angering-0.086
addon-0.086
issy-0.084
Token.
Token ID13
Feature activation-9.82e-03

Pos logit contributions
 Sund+0.092
hest+0.090
 [&+0.086
 deterrent+0.080
 guarantees+0.080
Neg logit contributions
 affair-0.080
angering-0.079
cedented-0.079
addon-0.078
issy-0.077
Token It
Token ID632
Feature activation+1.03e-02

Pos logit contributions
 Sund+0.027
hest+0.026
 [&+0.025
 deterrent+0.024
 guarantees+0.024
Neg logit contributions
 affair-0.018
cedented-0.018
angering-0.018
addon-0.017
Access-0.017
Token follows
Token ID5679
Feature activation-1.69e-01

Pos logit contributions
 affair+0.020
angering+0.019
cedented+0.019
addon+0.019
issy+0.019
Neg logit contributions
 Sund-0.027
hest-0.026
 [&-0.025
 deterrent-0.024
-0.024
Token Is
Token ID1148
Feature activation+5.08e-03

Pos logit contributions
 affair+0.160
angering+0.156
cedented+0.155
Access+0.153
addon+0.153
Neg logit contributions
 Sund-0.217
hest-0.215
 [&-0.205
 deterrent-0.194
 guarantees-0.191
Token Richard
Token ID6219
Feature activation+3.98e-02

Pos logit contributions
 affair+0.009
cedented+0.009
angering+0.009
addon+0.009
Access+0.009
Neg logit contributions
 Sund-0.014
hest-0.014
 [&-0.013
 deterrent-0.013
 guarantees-0.013
Token Nixon
Token ID16642
Feature activation+1.61e-01

Pos logit contributions
 affair+0.113
cedented+0.112
angering+0.112
addon+0.111
issy+0.111
Neg logit contributions
 Sund-0.069
hest-0.066
 [&-0.062
 deterrent-0.056
 guarantees-0.056
Token in
Token ID287
Feature activation+1.13e-01

Pos logit contributions
 affair+0.347
angering+0.340
cedented+0.339
quished+0.336
addon+0.335
Neg logit contributions
 Sund-0.388
hest-0.381
 [&-0.364
 deterrent-0.341
eks-0.338
Token a
Token ID257
Feature activation+5.45e-02

Pos logit contributions
 affair+0.229
angering+0.226
cedented+0.226
addon+0.223
Access+0.222
Neg logit contributions
 Sund-0.288
hest-0.282
 [&-0.270
 deterrent-0.254
 guarantees-0.253
Token Pants
Token ID41689
Feature activation+5.56e-01

Pos logit contributions
 affair+0.106
cedented+0.105
angering+0.104
addon+0.103
issy+0.102
Neg logit contributions
 Sund-0.144
hest-0.140
 [&-0.134
 deterrent-0.127
 guarantees-0.127
Tokenuit
Token ID5013
Feature activation-8.63e-03

Pos logit contributions
 affair+1.000
cedented+0.948
addon+0.945
angering+0.943
Access+0.942
Neg logit contributions
 Sund-1.556
hest-1.490
 [&-1.454
 deterrent-1.399
 guarantees-1.391
Token appeared
Token ID4120
Feature activation+3.87e-01

Pos logit contributions
 Sund+0.022
hest+0.022
 [&+0.021
 deterrent+0.020
 guarantees+0.020
Neg logit contributions
 affair-0.017
cedented-0.017
angering-0.017
addon-0.017
Access-0.017
Token first
Token ID717
Feature activation+9.18e-02

Pos logit contributions
 affair+0.757
angering+0.742
cedented+0.739
issy+0.730
addon+0.730
Neg logit contributions
 Sund-1.007
hest-0.990
 [&-0.947
 deterrent-0.895
 guarantees-0.888
Token on
Token ID319
Feature activation+7.91e-02

Pos logit contributions
 affair+0.174
angering+0.169
cedented+0.169
addon+0.169
Access+0.167
Neg logit contributions
 Sund-0.246
hest-0.242
 [&-0.232
 deterrent-0.218
 guarantees-0.217
Token Money
Token ID12911
Feature activation+9.06e-02

Pos logit contributions
 affair+0.123
cedented+0.122
angering+0.122
addon+0.120
quished+0.119
Neg logit contributions
 Sund-0.239
hest-0.233
 [&-0.224
 guarantees-0.214
 deterrent-0.214
Token:
Token ID25
Feature activation-3.62e-02

Pos logit contributions
 affair+0.300
angering+0.293
cedented+0.293
addon+0.290
issy+0.288
Neg logit contributions
 Sund-0.398
hest-0.391
 [&-0.375
 deterrent-0.353
 guarantees-0.350
Token C
Token ID327
Feature activation-5.00e-03

Pos logit contributions
 Sund+0.094
hest+0.092
 [&+0.088
 deterrent+0.083
 guarantees+0.083
Neg logit contributions
 affair-0.072
angering-0.071
cedented-0.071
addon-0.070
issy-0.069
Tokenaj
Token ID1228
Feature activation-7.32e-02

Pos logit contributions
 Sund+0.012
hest+0.012
 [&+0.011
 deterrent+0.011
 guarantees+0.011
Neg logit contributions
 affair-0.011
cedented-0.011
angering-0.011
addon-0.010
issy-0.010
Tokenun
Token ID403
Feature activation-7.67e-02

Pos logit contributions
 Sund+0.200
hest+0.195
 [&+0.187
 deterrent+0.177
 guarantees+0.177
Neg logit contributions
 affair-0.135
cedented-0.134
angering-0.134
addon-0.132
issy-0.131
Token Bo
Token ID3248
Feature activation+5.21e-02

Pos logit contributions
 Sund+0.194
hest+0.190
 [&+0.181
 deterrent+0.171
 guarantees+0.170
Neg logit contributions
 affair-0.157
cedented-0.154
angering-0.154
addon-0.152
issy-0.151
Tokenil
Token ID346
Feature activation+5.41e-01

Pos logit contributions
 affair+0.132
angering+0.131
cedented+0.131
addon+0.130
issy+0.130
Neg logit contributions
 Sund-0.106
hest-0.102
 [&-0.096
 deterrent-0.090
 guarantees-0.090
Token |
Token ID930
Feature activation+1.94e-01

Pos logit contributions
 affair+1.048
angering+1.019
cedented+1.018
issy+1.009
addon+1.005
Neg logit contributions
 Sund-1.413
hest-1.397
 [&-1.331
 deterrent-1.262
 guarantees-1.254
Token C
Token ID327
Feature activation+9.75e-02

Pos logit contributions
 affair+0.404
angering+0.396
cedented+0.395
issy+0.391
quished+0.390
Neg logit contributions
 Sund-0.478
hest-0.473
 [&-0.450
 deterrent-0.425
 guarantees-0.423
Tokenaj
Token ID1228
Feature activation-1.45e-02

Pos logit contributions
 affair+0.204
cedented+0.203
angering+0.202
issy+0.202
addon+0.201
Neg logit contributions
 Sund-0.240
hest-0.233
 [&-0.223
 deterrent-0.213
 guarantees-0.210
Tokenun
Token ID403
Feature activation-2.03e-02

Pos logit contributions
 Sund+0.039
hest+0.038
 [&+0.037
 deterrent+0.035
 guarantees+0.035
Neg logit contributions
 affair-0.027
cedented-0.027
angering-0.026
addon-0.026
issy-0.026
Token Kitchen
Token ID22541
Feature activation-9.02e-02

Pos logit contributions
 Sund+0.043
hest+0.041
 [&+0.039
 deterrent+0.036
 guarantees+0.036
Neg logit contributions
 affair-0.050
cedented-0.050
angering-0.050
addon-0.049
Access-0.049
Token Type
Token ID5994
Feature activation-1.44e-02

Pos logit contributions
 affair+0.172
cedented+0.170
angering+0.170
addon+0.168
Access+0.166
Neg logit contributions
 Sund-0.243
hest-0.237
 [&-0.227
 deterrent-0.215
 guarantees-0.215
Token 2
Token ID362
Feature activation+1.09e-01

Pos logit contributions
 Sund+0.038
hest+0.037
 [&+0.036
 deterrent+0.034
 guarantees+0.034
Neg logit contributions
 affair-0.028
cedented-0.027
angering-0.027
addon-0.027
issy-0.027
Token:
Token ID25
Feature activation-4.68e-05

Pos logit contributions
 affair+0.210
cedented+0.208
angering+0.208
addon+0.205
quished+0.203
Neg logit contributions
 Sund-0.287
hest-0.279
 [&-0.268
 deterrent-0.252
 guarantees-0.252
Token Sp
Token ID1338
Feature activation+1.10e-02

Pos logit contributions
 Sund+0.000
hest+0.000
 [&+0.000
 deterrent+0.000
 guarantees+0.000
Neg logit contributions
 affair-0.000
cedented-0.000
angering-0.000
addon-0.000
issy-0.000
Tokeniny
Token ID3541
Feature activation-4.09e-02

Pos logit contributions
 affair+0.022
cedented+0.022
angering+0.021
addon+0.021
Access+0.021
Neg logit contributions
 Sund-0.029
hest-0.028
 [&-0.027
 deterrent-0.025
 guarantees-0.025
Token ker
Token ID41927
Feature activation+5.40e-01

Pos logit contributions
 Sund+0.102
hest+0.100
 [&+0.095
 deterrent+0.090
 guarantees+0.089
Neg logit contributions
 affair-0.085
angering-0.084
cedented-0.084
addon-0.083
Access-0.082
Tokenat
Token ID265
Feature activation+2.49e-01

Pos logit contributions
 affair+0.845
cedented+0.833
angering+0.831
addon+0.820
issy+0.808
Neg logit contributions
 Sund-1.628
hest-1.592
 [&-1.533
 deterrent-1.460
 guarantees-1.457
Tokenoder
Token ID12342
Feature activation+2.99e-01

Pos logit contributions
 affair+0.545
angering+0.539
cedented+0.538
addon+0.536
issy+0.529
Neg logit contributions
 Sund-0.595
hest-0.576
 [&-0.550
 deterrent-0.519
 guarantees-0.516
Tokenma
Token ID2611
Feature activation+1.52e-01

Pos logit contributions
 affair+0.558
cedented+0.550
angering+0.549
addon+0.546
issy+0.539
Neg logit contributions
 Sund-0.810
hest-0.791
 [&-0.757
 deterrent-0.717
 guarantees-0.715
Token (
Token ID357
Feature activation-2.25e-03

Pos logit contributions
 affair+0.306
angering+0.300
cedented+0.299
addon+0.297
Access+0.294
Neg logit contributions
 Sund-0.387
hest-0.381
 [&-0.363
 deterrent-0.343
 guarantees-0.342
Tokenalso
Token ID14508
Feature activation+2.53e-02

Pos logit contributions
 Sund+0.006
hest+0.006
 [&+0.006
 deterrent+0.005
 guarantees+0.005
Neg logit contributions
 affair-0.004
cedented-0.004
angering-0.004
addon-0.004
Access-0.004
Token47
Token ID2857
Feature activation+1.29e-01

Pos logit contributions
 affair+0.450
cedented+0.442
addon+0.440
angering+0.440
Access+0.439
Neg logit contributions
 Sund-0.579
hest-0.564
 [&-0.535
 deterrent-0.513
 guarantees-0.509
Token]
Token ID60
Feature activation+3.87e-01

Pos logit contributions
 affair+0.208
cedented+0.203
addon+0.202
Access+0.201
angering+0.201
Neg logit contributions
 Sund-0.382
hest-0.374
 [&-0.358
 deterrent-0.348
 guarantees-0.343
Token <
Token ID1279
Feature activation+3.51e-01

Pos logit contributions
 affair+0.605
cedented+0.598
angering+0.597
addon+0.587
Access+0.579
Neg logit contributions
 Sund-1.169
hest-1.140
 [&-1.100
 deterrent-1.046
 guarantees-1.044
Tokenen
Token ID268
Feature activation+2.77e-01

Pos logit contributions
 affair+0.719
cedented+0.718
addon+0.716
angering+0.715
issy+0.707
Neg logit contributions
 Sund-0.881
hest-0.850
 [&-0.822
 deterrent-0.778
 guarantees-0.776
Token7
Token ID22
Feature activation+1.31e-01

Pos logit contributions
 affair+0.599
cedented+0.597
angering+0.595
addon+0.590
issy+0.587
Neg logit contributions
 Sund-0.667
hest-0.646
 [&-0.619
 deterrent-0.584
 guarantees-0.580
Token_>
Token ID49029
Feature activation+5.40e-01

Pos logit contributions
 affair+0.280
cedented+0.279
angering+0.277
addon+0.276
issy+0.274
Neg logit contributions
 Sund-0.321
hest-0.310
 [&-0.297
 deterrent-0.281
 guarantees-0.280
Token lol
Token ID19462
Feature activation+3.37e-01

Pos logit contributions
 affair+1.045
angering+1.010
cedented+1.008
addon+1.003
issy+1.001
Neg logit contributions
 Sund-1.416
hest-1.393
 [&-1.328
 deterrent-1.260
 guarantees-1.256
Token Mang
Token ID27609
Feature activation+1.78e-01

Pos logit contributions
 affair+0.657
cedented+0.644
angering+0.643
addon+0.635
issy+0.633
Neg logit contributions
 Sund-0.886
hest-0.866
 [&-0.827
 deterrent-0.782
 guarantees-0.782
Token0
Token ID15
Feature activation+3.72e-01

Pos logit contributions
 affair+0.355
angering+0.351
cedented+0.351
addon+0.348
issy+0.346
Neg logit contributions
 Sund-0.455
hest-0.444
 [&-0.424
 deterrent-0.403
 guarantees-0.401
Tokenking
Token ID3364
Feature activation-9.35e-02

Pos logit contributions
 affair+0.873
cedented+0.863
angering+0.862
addon+0.857
issy+0.850
Neg logit contributions
 Sund-0.825
hest-0.803
 [&-0.759
 deterrent-0.712
 guarantees-0.711
Token [
Token ID685
Feature activation+3.10e-01

Pos logit contributions
 Sund+0.210
hest+0.204
 [&+0.194
 deterrent+0.181
 guarantees+0.180
Neg logit contributions
 affair-0.218
cedented-0.216
angering-0.215
addon-0.213
issy-0.211
Token23
Token ID1954
Feature activation+1.97e-02

Pos logit contributions
 affair+0.092
cedented+0.091
angering+0.091
addon+0.091
Access+0.090
Neg logit contributions
 Sund-0.114
hest-0.110
 [&-0.105
 deterrent-0.100
 guarantees-0.099
Token]
Token ID60
Feature activation+1.73e-01

Pos logit contributions
 affair+0.037
angering+0.036
cedented+0.036
addon+0.036
quished+0.035
Neg logit contributions
 Sund-0.053
hest-0.052
 [&-0.050
-0.047
 deterrent-0.047
Token
Token ID198
Feature activation+9.84e-03

Pos logit contributions
 affair+0.320
cedented+0.315
angering+0.314
addon+0.311
Access+0.309
Neg logit contributions
 Sund-0.468
hest-0.459
 [&-0.439
 deterrent-0.418
 guarantees-0.416
Token
Token ID198
Feature activation+2.45e-02

Pos logit contributions
 affair+0.017
Access+0.017
addon+0.017
cedented+0.017
angering+0.017
Neg logit contributions
 Sund-0.027
hest-0.027
 [&-0.026
 deterrent-0.025
 guarantees-0.024
TokenDi
Token ID18683
Feature activation+1.52e-02

Pos logit contributions
 affair+0.045
cedented+0.045
angering+0.045
addon+0.045
Access+0.044
Neg logit contributions
 Sund-0.066
hest-0.065
 [&-0.062
 deterrent-0.059
 guarantees-0.059
Tokenagn
Token ID4660
Feature activation+5.40e-01

Pos logit contributions
 affair+0.030
angering+0.030
cedented+0.030
addon+0.029
quished+0.029
Neg logit contributions
 Sund-0.040
hest-0.038
 [&-0.037
 deterrent-0.035
 guarantees-0.035
Tokenosis
Token ID5958
Feature activation+5.29e-02

Pos logit contributions
 affair+0.735
angering+0.722
cedented+0.722
addon+0.712
quished+0.697
Neg logit contributions
 Sund-1.739
hest-1.694
 [&-1.640
 deterrent-1.573
 guarantees-1.565
Token [
Token ID685
Feature activation+2.58e-02

Pos logit contributions
 affair+0.144
cedented+0.144
angering+0.143
addon+0.142
issy+0.141
Neg logit contributions
 Sund-0.098
hest-0.094
 [&-0.089
 deterrent-0.081
 guarantees-0.081
Token edit
Token ID4370
Feature activation+1.24e-01

Pos logit contributions
 affair+0.052
cedented+0.051
addon+0.051
angering+0.051
Access+0.051
Neg logit contributions
 Sund-0.066
hest-0.064
 [&-0.061
 deterrent-0.059
 guarantees-0.058
Token ]
Token ID2361
Feature activation-1.69e-02

Pos logit contributions
 affair+0.160
cedented+0.155
angering+0.154
addon+0.153
Access+0.152
Neg logit contributions
 Sund-0.408
hest-0.400
 [&-0.385
 deterrent-0.373
 guarantees-0.370
Token
Token ID198
Feature activation-7.33e-03

Pos logit contributions
 Sund+0.046
hest+0.045
 [&+0.043
 deterrent+0.041
 guarantees+0.041
Neg logit contributions
 affair-0.031
cedented-0.030
angering-0.030
addon-0.030
Access-0.030
Token manner
Token ID5642
Feature activation+2.40e-01

Pos logit contributions
 affair+0.352
angering+0.345
cedented+0.342
addon+0.339
issy+0.337
Neg logit contributions
 Sund-0.480
hest-0.469
 [&-0.449
 deterrent-0.422
 guarantees-0.421
Tokenisms
Token ID6583
Feature activation-1.65e-01

Pos logit contributions
 affair+0.471
cedented+0.465
angering+0.464
addon+0.459
issy+0.455
Neg logit contributions
 Sund-0.630
hest-0.614
 [&-0.588
 deterrent-0.555
 guarantees-0.553
Token or
Token ID393
Feature activation-3.59e-02

Pos logit contributions
 Sund+0.430
hest+0.419
 [&+0.401
 deterrent+0.378
 guarantees+0.378
Neg logit contributions
 affair-0.327
cedented-0.323
angering-0.322
addon-0.319
issy-0.315
Token movement
Token ID3356
Feature activation+1.76e-01

Pos logit contributions
 Sund+0.084
hest+0.082
 [&+0.078
 deterrent+0.072
+0.072
Neg logit contributions
 affair-0.081
angering-0.080
cedented-0.079
addon-0.078
issy-0.078
Token,
Token ID11
Feature activation+2.27e-02

Pos logit contributions
 affair+0.362
angering+0.353
cedented+0.351
addon+0.349
issy+0.348
Neg logit contributions
 Sund-0.447
hest-0.438
 [&-0.418
 deterrent-0.391
 guarantees-0.388
Token dressing
Token ID18544
Feature activation+5.33e-01

Pos logit contributions
 affair+0.046
angering+0.045
cedented+0.045
addon+0.044
issy+0.044
Neg logit contributions
 Sund-0.058
hest-0.057
 [&-0.055
 deterrent-0.051
 guarantees-0.051
Token and
Token ID290
Feature activation+2.59e-02

Pos logit contributions
 affair+1.087
angering+1.061
cedented+1.051
addon+1.046
issy+1.040
Neg logit contributions
 Sund-1.358
hest-1.323
 [&-1.267
 deterrent-1.187
 guarantees-1.182
Token grooming
Token ID41836
Feature activation-9.74e-02

Pos logit contributions
 affair+0.054
angering+0.053
cedented+0.053
addon+0.052
Access+0.052
Neg logit contributions
 Sund-0.065
hest-0.064
 [&-0.061
 deterrent-0.057
 guarantees-0.057
Token,
Token ID11
Feature activation+2.04e-02

Pos logit contributions
 Sund+0.255
hest+0.248
 [&+0.237
 deterrent+0.224
 guarantees+0.224
Neg logit contributions
 affair-0.191
cedented-0.189
angering-0.189
addon-0.187
issy-0.185
Token and
Token ID290
Feature activation+3.49e-02

Pos logit contributions
 affair+0.040
angering+0.039
cedented+0.039
addon+0.039
Access+0.038
Neg logit contributions
 Sund-0.054
hest-0.052
 [&-0.050
 deterrent-0.047
 guarantees-0.047
Token possibly
Token ID5457
Feature activation-2.09e-02

Pos logit contributions
 affair+0.070
angering+0.068
cedented+0.068
addon+0.067
Access+0.067
Neg logit contributions
 Sund-0.091
hest-0.089
 [&-0.085
 deterrent-0.079
 guarantees-0.079
Token as
Token ID355
Feature activation-6.78e-02

Pos logit contributions
 Sund+0.361
hest+0.352
 [&+0.336
 deterrent+0.315
 guarantees+0.314
Neg logit contributions
 affair-0.312
cedented-0.308
angering-0.308
addon-0.304
Access-0.302
Token well
Token ID880
Feature activation+5.51e-03

Pos logit contributions
 Sund+0.189
hest+0.186
 [&+0.178
 guarantees+0.168
 deterrent+0.168
Neg logit contributions
 affair-0.120
cedented-0.118
angering-0.118
addon-0.116
Access-0.116
Token as
Token ID355
Feature activation-1.36e-01

Pos logit contributions
 affair+0.012
cedented+0.012
angering+0.011
addon+0.011
Access+0.011
Neg logit contributions
 Sund-0.014
hest-0.013
 [&-0.013
 deterrent-0.012
 guarantees-0.012
Token Customer
Token ID22092
Feature activation+9.56e-02

Pos logit contributions
 Sund+0.341
hest+0.336
 [&+0.319
 deterrent+0.299
 guarantees+0.298
Neg logit contributions
 affair-0.280
angering-0.275
cedented-0.274
addon-0.271
Access-0.271
Token Prem
Token ID6929
Feature activation-2.16e-01

Pos logit contributions
 affair+0.175
angering+0.173
cedented+0.172
addon+0.170
Access+0.169
Neg logit contributions
 Sund-0.262
hest-0.257
 [&-0.246
 deterrent-0.233
 guarantees-0.232
Tokenise
Token ID786
Feature activation+5.26e-01

Pos logit contributions
 Sund+0.373
hest+0.359
 [&+0.336
 deterrent+0.306
 guarantees+0.305
Neg logit contributions
 affair-0.615
cedented-0.610
angering-0.609
addon-0.604
issy-0.600
Token Equipment
Token ID22412
Feature activation+8.39e-02

Pos logit contributions
 affair+1.000
angering+0.984
Access+0.970
cedented+0.967
addon+0.955
Neg logit contributions
 Sund-1.391
hest-1.370
 [&-1.315
 deterrent-1.230
 Canaver-1.228
Token (
Token ID357
Feature activation-8.36e-02

Pos logit contributions
 affair+0.174
angering+0.171
cedented+0.170
addon+0.169
Access+0.169
Neg logit contributions
 Sund-0.208
hest-0.205
 [&-0.195
 deterrent-0.183
 guarantees-0.182
TokenCP
Token ID8697
Feature activation+8.64e-02

Pos logit contributions
 Sund+0.207
hest+0.202
 [&+0.191
 deterrent+0.181
 guarantees+0.180
Neg logit contributions
 affair-0.175
cedented-0.174
angering-0.174
addon-0.173
Access-0.172
TokenE
Token ID36
Feature activation+3.42e-01

Pos logit contributions
 affair+0.161
cedented+0.158
angering+0.158
addon+0.157
Access+0.155
Neg logit contributions
 Sund-0.235
hest-0.229
 [&-0.220
 deterrent-0.208
 guarantees-0.207
Token)
Token ID8
Feature activation-2.91e-02

Pos logit contributions
 affair+0.739
angering+0.728
cedented+0.725
addon+0.722
Access+0.716
Neg logit contributions
 Sund-0.824
hest-0.806
 [&-0.768
 deterrent-0.719
 guarantees-0.715
Token a
Token ID257
Feature activation+1.29e-01

Pos logit contributions
 affair+0.203
cedented+0.201
angering+0.200
addon+0.198
issy+0.195
Neg logit contributions
 Sund-0.255
hest-0.249
 [&-0.239
 deterrent-0.225
 guarantees-0.225
Token late
Token ID2739
Feature activation-6.96e-02

Pos logit contributions
 affair+0.254
cedented+0.250
angering+0.250
addon+0.246
issy+0.244
Neg logit contributions
 Sund-0.333
hest-0.326
 [&-0.313
 deterrent-0.294
 guarantees-0.294
Token-
Token ID12
Feature activation+7.63e-02

Pos logit contributions
 Sund+0.159
hest+0.155
 [&+0.148
 deterrent+0.138
 guarantees+0.138
Neg logit contributions
 affair-0.160
cedented-0.158
angering-0.157
addon-0.156
issy-0.154
Tokenons
Token ID684
Feature activation-3.86e-02

Pos logit contributions
 affair+0.149
cedented+0.148
angering+0.148
addon+0.146
Access+0.144
Neg logit contributions
 Sund-0.200
hest-0.195
 [&-0.187
 deterrent-0.177
 guarantees-0.177
Tokenet
Token ID316
Feature activation+1.76e-01

Pos logit contributions
 Sund+0.084
hest+0.080
 [&+0.077
 deterrent+0.072
 guarantees+0.072
Neg logit contributions
 affair-0.093
cedented-0.092
angering-0.092
addon-0.091
issy-0.091
Token ker
Token ID41927
Feature activation+5.26e-01

Pos logit contributions
 affair+0.371
angering+0.362
cedented+0.361
addon+0.356
issy+0.353
Neg logit contributions
 Sund-0.436
hest-0.428
 [&-0.408
 deterrent-0.385
 guarantees-0.383
Tokenat
Token ID265
Feature activation+1.74e-01

Pos logit contributions
 affair+0.819
cedented+0.806
angering+0.803
addon+0.796
issy+0.783
Neg logit contributions
 Sund-1.589
hest-1.552
 [&-1.497
 deterrent-1.427
 guarantees-1.423
Tokenoder
Token ID12342
Feature activation+3.14e-01

Pos logit contributions
 affair+0.315
cedented+0.311
angering+0.311
addon+0.307
issy+0.303
Neg logit contributions
 Sund-0.482
hest-0.470
 [&-0.452
 deterrent-0.428
 guarantees-0.428
Tokenma
Token ID2611
Feature activation+1.72e-01

Pos logit contributions
 affair+0.600
cedented+0.593
angering+0.591
addon+0.585
issy+0.578
Neg logit contributions
 Sund-0.838
hest-0.818
 [&-0.784
 deterrent-0.741
 guarantees-0.739
Token,
Token ID11
Feature activation+1.03e-01

Pos logit contributions
 affair+0.316
angering+0.309
cedented+0.308
addon+0.304
issy+0.299
Neg logit contributions
 Sund-0.473
hest-0.464
 [&-0.446
 deterrent-0.422
 guarantees-0.420
Token inherited
Token ID19552
Feature activation-6.23e-02

Pos logit contributions
 affair+0.148
angering+0.144
cedented+0.143
addon+0.141
Access+0.138
Neg logit contributions
 Sund-0.324
hest-0.320
 [&-0.308
 deterrent-0.294
 guarantees-0.293
Token until
Token ID1566
Feature activation-2.76e-01

Pos logit contributions
 affair+0.158
angering+0.155
cedented+0.154
addon+0.152
issy+0.151
Neg logit contributions
 Sund-0.212
hest-0.208
 [&-0.201
-0.188
 deterrent-0.187
Token we
Token ID356
Feature activation+8.84e-02

Pos logit contributions
 Sund+0.702
hest+0.683
 [&+0.652
 deterrent+0.615
 guarantees+0.614
Neg logit contributions
 affair-0.561
cedented-0.556
angering-0.555
addon-0.548
quished-0.543
Token got
Token ID1392
Feature activation+9.83e-02

Pos logit contributions
 affair+0.167
angering+0.164
cedented+0.163
addon+0.162
Access+0.160
Neg logit contributions
 Sund-0.238
hest-0.233
 [&-0.224
 deterrent-0.211
 guarantees-0.211
Token to
Token ID284
Feature activation-5.72e-03

Pos logit contributions
 affair+0.207
cedented+0.203
angering+0.203
addon+0.201
Access+0.199
Neg logit contributions
 Sund-0.243
hest-0.238
 [&-0.227
 deterrent-0.213
 guarantees-0.212
Token the
Token ID262
Feature activation-3.21e-02

Pos logit contributions
 Sund+0.015
hest+0.015
 [&+0.014
 deterrent+0.014
 guarantees+0.013
Neg logit contributions
 affair-0.011
angering-0.011
cedented-0.011
addon-0.011
issy-0.011
Token dressing
Token ID18544
Feature activation+5.25e-01

Pos logit contributions
 Sund+0.079
hest+0.078
 [&+0.074
 deterrent+0.069
 guarantees+0.069
Neg logit contributions
 affair-0.068
angering-0.067
cedented-0.066
addon-0.066
Access-0.065
Token room
Token ID2119
Feature activation+6.67e-02

Pos logit contributions
 affair+1.075
angering+1.044
cedented+1.039
addon+1.030
Access+1.021
Neg logit contributions
 Sund-1.333
hest-1.301
 [&-1.248
 guarantees-1.169
 deterrent-1.169
Token.
Token ID13
Feature activation-2.53e-03

Pos logit contributions
 affair+0.139
angering+0.137
cedented+0.137
addon+0.135
issy+0.134
Neg logit contributions
 Sund-0.167
hest-0.162
 [&-0.155
 deterrent-0.146
 guarantees-0.146
Token Sir
Token ID7361
Feature activation+5.70e-02

Pos logit contributions
 Sund+0.007
hest+0.007
 [&+0.006
 deterrent+0.006
 guarantees+0.006
Neg logit contributions
 affair-0.005
cedented-0.005
angering-0.005
addon-0.005
Access-0.005
Token Alex
Token ID4422
Feature activation+9.52e-02

Pos logit contributions
 affair+0.084
cedented+0.083
angering+0.082
addon+0.081
issy+0.081
Neg logit contributions
 Sund-0.176
hest-0.173
 [&-0.167
 deterrent-0.159
 guarantees-0.159
Token Ferguson
Token ID15031
Feature activation-2.50e-01

Pos logit contributions
 affair+0.210
cedented+0.207
angering+0.205
addon+0.204
issy+0.203
Neg logit contributions
 Sund-0.224
hest-0.220
 [&-0.211
 deterrent-0.196
-0.195
Token woman
Token ID2415
Feature activation-3.98e-02

Pos logit contributions
 Sund+0.015
hest+0.014
 [&+0.014
 guarantees+0.013
 deterrent+0.013
Neg logit contributions
 affair-0.011
angering-0.011
cedented-0.011
addon-0.011
issy-0.011
Token crosses
Token ID24234
Feature activation+1.43e-01

Pos logit contributions
 Sund+0.108
hest+0.106
 [&+0.102
 deterrent+0.095
 guarantees+0.095
Neg logit contributions
 affair-0.074
angering-0.073
cedented-0.072
addon-0.072
Access-0.071
Token the
Token ID262
Feature activation+3.89e-02

Pos logit contributions
 affair+0.267
angering+0.265
cedented+0.261
addon+0.259
issy+0.258
Neg logit contributions
 Sund-0.382
hest-0.376
 [&-0.361
 deterrent-0.340
 guarantees-0.339
Token road
Token ID2975
Feature activation-2.23e-01

Pos logit contributions
 affair+0.084
angering+0.083
cedented+0.083
addon+0.082
issy+0.081
Neg logit contributions
 Sund-0.093
hest-0.091
 [&-0.088
 guarantees-0.082
 deterrent-0.082
Token while
Token ID981
Feature activation+1.11e-01

Pos logit contributions
 Sund+0.569
hest+0.556
 [&+0.532
 deterrent+0.501
 guarantees+0.500
Neg logit contributions
 affair-0.451
angering-0.446
cedented-0.443
addon-0.438
issy-0.435
Token texting
Token ID36634
Feature activation+5.24e-01

Pos logit contributions
 affair+0.203
angering+0.201
addon+0.195
cedented+0.195
issy+0.195
Neg logit contributions
 Sund-0.300
hest-0.297
 [&-0.286
 guarantees-0.267
 deterrent-0.267
Token on
Token ID319
Feature activation+3.37e-02

Pos logit contributions
 affair+1.010
angering+0.991
addon+0.970
issy+0.967
cedented+0.965
Neg logit contributions
 Sund-1.375
hest-1.362
 [&-1.301
 guarantees-1.216
 deterrent-1.215
Token a
Token ID257
Feature activation+2.47e-02

Pos logit contributions
 affair+0.062
angering+0.062
cedented+0.061
addon+0.060
Access+0.060
Neg logit contributions
 Sund-0.092
hest-0.090
 [&-0.087
 deterrent-0.082
 guarantees-0.081
Token smart
Token ID4451
Feature activation-4.67e-02

Pos logit contributions
 affair+0.056
angering+0.055
cedented+0.055
addon+0.055
Access+0.054
Neg logit contributions
 Sund-0.057
hest-0.056
 [&-0.053
 guarantees-0.049
 deterrent-0.049
Token phone
Token ID3072
Feature activation-4.26e-02

Pos logit contributions
 Sund+0.126
hest+0.123
 [&+0.118
 guarantees+0.112
 deterrent+0.111
Neg logit contributions
 affair-0.088
angering-0.086
cedented-0.086
addon-0.085
Access-0.085
Token in
Token ID287
Feature activation-2.16e-02

Pos logit contributions
 Sund+0.107
hest+0.105
 [&+0.100
 guarantees+0.094
 deterrent+0.094
Neg logit contributions
 affair-0.088
angering-0.086
cedented-0.086
addon-0.085
Access-0.085
Token Is
Token ID1148
Feature activation+5.20e-02

Pos logit contributions
 affair+0.154
angering+0.151
cedented+0.150
addon+0.149
Access+0.148
Neg logit contributions
 Sund-0.191
hest-0.188
 [&-0.180
 deterrent-0.170
 guarantees-0.168
Token Richard
Token ID6219
Feature activation+9.34e-02

Pos logit contributions
 affair+0.088
cedented+0.087
angering+0.087
addon+0.086
issy+0.085
Neg logit contributions
 Sund-0.150
hest-0.147
 [&-0.141
 deterrent-0.134
 guarantees-0.134
Token Nixon
Token ID16642
Feature activation+1.48e-01

Pos logit contributions
 affair+0.257
addon+0.251
angering+0.251
issy+0.251
Access+0.250
Neg logit contributions
 Sund-0.167
hest-0.164
 [&-0.157
 deterrent-0.144
 guarantees-0.141
Token in
Token ID287
Feature activation+1.10e-01

Pos logit contributions
 affair+0.304
cedented+0.299
angering+0.298
addon+0.295
quished+0.294
Neg logit contributions
 Sund-0.371
hest-0.364
 [&-0.347
 deterrent-0.327
 guarantees-0.323
Token a
Token ID257
Feature activation+4.94e-02

Pos logit contributions
 affair+0.236
cedented+0.234
angering+0.234
addon+0.231
issy+0.229
Neg logit contributions
 Sund-0.268
hest-0.260
 [&-0.248
 deterrent-0.233
 guarantees-0.233
Token Pants
Token ID41689
Feature activation+5.22e-01

Pos logit contributions
 affair+0.106
cedented+0.105
angering+0.105
addon+0.104
quished+0.103
Neg logit contributions
 Sund-0.120
hest-0.117
 [&-0.111
 deterrent-0.104
 guarantees-0.104
Tokenuit
Token ID5013
Feature activation+1.32e-02

Pos logit contributions
 affair+0.897
cedented+0.879
addon+0.877
issy+0.873
angering+0.873
Neg logit contributions
 Sund-1.491
hest-1.414
 [&-1.384
 deterrent-1.348
 guarantees-1.324
Token was
Token ID373
Feature activation+1.01e-01

Pos logit contributions
 affair+0.021
cedented+0.021
angering+0.021
addon+0.020
quished+0.020
Neg logit contributions
 Sund-0.039
hest-0.038
 [&-0.037
 deterrent-0.035
 guarantees-0.035
Token originally
Token ID6198
Feature activation-9.03e-02

Pos logit contributions
 affair+0.218
cedented+0.215
angering+0.214
addon+0.213
Access+0.211
Neg logit contributions
 Sund-0.241
hest-0.236
 [&-0.225
 deterrent-0.210
 guarantees-0.210
Token published
Token ID3199
Feature activation-4.54e-02

Pos logit contributions
 Sund+0.244
hest+0.239
 [&+0.229
 deterrent+0.216
 guarantees+0.216
Neg logit contributions
 affair-0.169
cedented-0.167
angering-0.167
addon-0.165
issy-0.163
Token at
Token ID379
Feature activation+2.21e-02

Pos logit contributions
 Sund+0.111
hest+0.109
 [&+0.103
 deterrent+0.097
 guarantees+0.097
Neg logit contributions
 affair-0.096
angering-0.095
cedented-0.095
addon-0.094
Access-0.093
Token in
Token ID287
Feature activation+5.85e-02

Pos logit contributions
 Sund+0.193
hest+0.187
 [&+0.179
 deterrent+0.168
 guarantees+0.168
Neg logit contributions
 affair-0.174
cedented-0.172
angering-0.172
addon-0.170
issy-0.168
Token public
Token ID1171
Feature activation-9.91e-02

Pos logit contributions
 affair+0.119
cedented+0.118
angering+0.118
addon+0.117
issy+0.115
Neg logit contributions
 Sund-0.148
hest-0.145
 [&-0.138
 deterrent-0.130
 guarantees-0.130
Token forums
Token ID14216
Feature activation-1.88e-01

Pos logit contributions
 Sund+0.250
hest+0.244
 [&+0.233
 deterrent+0.219
 guarantees+0.219
Neg logit contributions
 affair-0.204
angering-0.201
cedented-0.201
addon-0.198
issy-0.196
Token.
Token ID13
Feature activation+1.01e-01

Pos logit contributions
 Sund+0.479
hest+0.466
 [&+0.446
 deterrent+0.420
 guarantees+0.420
Neg logit contributions
 affair-0.383
cedented-0.379
angering-0.378
addon-0.375
quished-0.371
Token Says
Token ID28628
Feature activation-6.42e-02

Pos logit contributions
 affair+0.179
angering+0.176
cedented+0.176
addon+0.173
Access+0.173
Neg logit contributions
 Sund-0.283
hest-0.279
 [&-0.267
 deterrent-0.254
 guarantees-0.253
Token Neh
Token ID44470
Feature activation+5.21e-01

Pos logit contributions
 Sund+0.144
hest+0.140
 [&+0.133
 deterrent+0.124
 guarantees+0.124
Neg logit contributions
 affair-0.150
cedented-0.148
angering-0.148
addon-0.146
issy-0.145
Tokenring
Token ID1806
Feature activation-2.68e-01

Pos logit contributions
angering+0.672
addon+0.670
 affair+0.667
issy+0.661
cedented+0.658
Neg logit contributions
 Sund-1.690
hest-1.622
 [&-1.601
 deterrent-1.543
 guarantees-1.522
Token,
Token ID11
Feature activation+3.63e-02

Pos logit contributions
 Sund+0.701
hest+0.682
 [&+0.653
 guarantees+0.616
 deterrent+0.616
Neg logit contributions
 affair-0.523
cedented-0.520
angering-0.520
addon-0.512
quished-0.507
Token
Token ID198
Feature activation-4.81e-03

Pos logit contributions
 affair+0.054
cedented+0.053
angering+0.053
addon+0.052
quished+0.051
Neg logit contributions
 Sund-0.113
hest-0.110
 [&-0.106
 deterrent-0.101
 guarantees-0.101
Token
Token ID198
Feature activation+9.16e-03

Pos logit contributions
 Sund+0.013
hest+0.013
 [&+0.013
 deterrent+0.012
 guarantees+0.012
Neg logit contributions
 affair-0.009
Access-0.008
cedented-0.008
addon-0.008
angering-0.008
TokenAll
Token ID3237
Feature activation+5.26e-03

Pos logit contributions
 affair+0.017
cedented+0.017
angering+0.017
addon+0.017
Access+0.017
Neg logit contributions
 Sund-0.025
hest-0.024
 [&-0.023
 deterrent-0.022
 guarantees-0.022
Token a
Token ID257
Feature activation+4.80e-03

Pos logit contributions
 Sund+0.072
hest+0.070
 [&+0.067
 deterrent+0.063
 guarantees+0.063
Neg logit contributions
 affair-0.056
cedented-0.055
angering-0.055
addon-0.055
issy-0.054
Token Curiosity
Token ID39736
Feature activation-1.75e-01

Pos logit contributions
 affair+0.010
cedented+0.010
angering+0.010
addon+0.010
issy+0.010
Neg logit contributions
 Sund-0.012
hest-0.012
 [&-0.011
 deterrent-0.010
 guarantees-0.010
Token co
Token ID763
Feature activation-1.69e-01

Pos logit contributions
 Sund+0.417
hest+0.406
 [&+0.387
 deterrent+0.363
 guarantees+0.362
Neg logit contributions
 affair-0.382
cedented-0.378
angering-0.378
addon-0.374
issy-0.370
Token-
Token ID12
Feature activation+3.57e-02

Pos logit contributions
 Sund+0.321
hest+0.311
 [&+0.290
 deterrent+0.268
 guarantees+0.267
Neg logit contributions
 affair-0.455
angering-0.451
cedented-0.450
addon-0.446
issy-0.443
Tokeninvest
Token ID24859
Feature activation+1.45e-01

Pos logit contributions
 affair+0.057
cedented+0.056
angering+0.056
addon+0.056
Access+0.055
Neg logit contributions
 Sund-0.106
hest-0.104
 [&-0.100
 deterrent-0.095
 guarantees-0.095
Tokenigator
Token ID23823
Feature activation+5.14e-01

Pos logit contributions
 affair+0.245
cedented+0.242
angering+0.241
addon+0.238
issy+0.235
Neg logit contributions
 Sund-0.420
hest-0.410
 [&-0.395
 deterrent-0.375
 guarantees-0.374
Token.
Token ID13
Feature activation-2.98e-02

Pos logit contributions
 affair+1.071
cedented+1.054
angering+1.053
addon+1.042
issy+1.031
Neg logit contributions
 Sund-1.280
hest-1.253
 [&-1.196
 deterrent-1.127
 guarantees-1.120
Token "
Token ID366
Feature activation-1.13e-01

Pos logit contributions
 Sund+0.089
hest+0.086
 [&+0.083
+0.079
 guarantees+0.078
Neg logit contributions
 affair-0.048
angering-0.048
cedented-0.047
addon-0.047
quished-0.046
TokenWith
Token ID3152
Feature activation-2.99e-02

Pos logit contributions
 Sund+0.306
hest+0.299
 [&+0.286
 deterrent+0.271
 guarantees+0.270
Neg logit contributions
 affair-0.212
cedented-0.210
angering-0.209
addon-0.207
Access-0.205
Token only
Token ID691
Feature activation-5.03e-02

Pos logit contributions
 Sund+0.076
hest+0.074
 [&+0.071
 deterrent+0.067
 guarantees+0.067
Neg logit contributions
 affair-0.061
cedented-0.060
angering-0.060
addon-0.059
issy-0.059
Token one
Token ID530
Feature activation-2.87e-02

Pos logit contributions
 Sund+0.128
hest+0.125
 [&+0.120
 deterrent+0.112
+0.111
Neg logit contributions
 affair-0.103
angering-0.101
cedented-0.101
addon-0.099
Access-0.099
Tokenoom
Token ID4207
Feature activation-2.18e-01

Pos logit contributions
 affair+0.201
cedented+0.200
angering+0.199
addon+0.195
quished+0.194
Neg logit contributions
 Sund-0.286
hest-0.278
 [&-0.267
 deterrent-0.251
-0.251
Tokenin
Token ID259
Feature activation-5.72e-02

Pos logit contributions
 Sund+0.585
hest+0.575
 [&+0.544
 deterrent+0.515
+0.514
Neg logit contributions
 affair-0.410
cedented-0.408
angering-0.405
quished-0.397
addon-0.396
Token World
Token ID2159
Feature activation-2.09e-01

Pos logit contributions
 Sund+0.169
hest+0.165
 [&+0.159
 deterrent+0.151
 guarantees+0.151
Neg logit contributions
 affair-0.093
cedented-0.091
angering-0.091
addon-0.090
issy-0.089
Token (
Token ID357
Feature activation-7.67e-02

Pos logit contributions
 Sund+0.538
hest+0.524
 [&+0.501
 deterrent+0.472
 guarantees+0.471
Neg logit contributions
 affair-0.422
cedented-0.417
angering-0.416
addon-0.411
issy-0.407
TokenM
Token ID44
Feature activation+7.94e-02

Pos logit contributions
 Sund+0.204
hest+0.199
 [&+0.191
 deterrent+0.181
 guarantees+0.180
Neg logit contributions
 affair-0.147
cedented-0.145
angering-0.145
addon-0.143
Access-0.142
Tokenuum
Token ID13814
Feature activation-6.57e-01

Pos logit contributions
 affair+0.150
cedented+0.149
angering+0.148
addon+0.147
issy+0.145
Neg logit contributions
 Sund-0.213
hest-0.208
 [&-0.199
 deterrent-0.188
 guarantees-0.188
Tokenima
Token ID8083
Feature activation-1.07e-01

Pos logit contributions
 Sund+1.754
hest+1.710
 [&+1.626
 guarantees+1.529
 deterrent+1.528
Neg logit contributions
cedented-1.253
 affair-1.251
angering-1.241
addon-1.218
quished-1.218
Tokenail
Token ID603
Feature activation-3.24e-02

Pos logit contributions
 Sund+0.279
hest+0.273
 [&+0.261
 deterrent+0.248
 guarantees+0.247
Neg logit contributions
 affair-0.208
angering-0.204
cedented-0.204
addon-0.202
issy-0.200
Tokenma
Token ID2611
Feature activation-1.46e-02

Pos logit contributions
 Sund+0.084
hest+0.082
 [&+0.079
 deterrent+0.075
 guarantees+0.075
Neg logit contributions
 affair-0.064
angering-0.063
cedented-0.063
addon-0.062
issy-0.062
Token in
Token ID287
Feature activation-3.03e-02

Pos logit contributions
 Sund+0.037
hest+0.036
 [&+0.035
 deterrent+0.033
 guarantees+0.033
Neg logit contributions
 affair-0.029
angering-0.029
cedented-0.029
addon-0.028
issy-0.028
Token Finnish
Token ID26838
Feature activation-1.13e-01

Pos logit contributions
 Sund+0.081
hest+0.079
 [&+0.075
 deterrent+0.071
 guarantees+0.071
Neg logit contributions
 affair-0.058
cedented-0.057
angering-0.057
addon-0.057
issy-0.056
TokenM
Token ID44
Feature activation+1.35e-01

Pos logit contributions
 affair+0.023
cedented+0.022
angering+0.022
addon+0.022
issy+0.022
Neg logit contributions
 Sund-0.038
hest-0.037
 [&-0.036
 deterrent-0.034
-0.034
TokenDB
Token ID11012
Feature activation-5.24e-02

Pos logit contributions
 affair+0.357
cedented+0.353
angering+0.353
addon+0.353
Access+0.352
Neg logit contributions
 Sund-0.257
hest-0.249
 [&-0.232
 deterrent-0.217
 guarantees-0.214
Token)
Token ID8
Feature activation-4.45e-02

Pos logit contributions
 Sund+0.126
hest+0.122
 [&+0.116
 guarantees+0.110
 deterrent+0.109
Neg logit contributions
cedented-0.114
 affair-0.113
angering-0.113
addon-0.112
quished-0.111
Token has
Token ID468
Feature activation-6.88e-02

Pos logit contributions
 Sund+0.119
hest+0.116
 [&+0.111
 deterrent+0.105
 guarantees+0.105
Neg logit contributions
 affair-0.085
cedented-0.084
angering-0.084
addon-0.083
Access-0.082
Token not
Token ID407
Feature activation-1.49e-02

Pos logit contributions
 Sund+0.181
hest+0.177
 [&+0.169
 deterrent+0.160
 guarantees+0.160
Neg logit contributions
 affair-0.134
cedented-0.132
angering-0.132
addon-0.130
Access-0.129
Token complied
Token ID35413
Feature activation-6.27e-01

Pos logit contributions
 Sund+0.040
hest+0.039
 [&+0.037
 deterrent+0.035
 guarantees+0.035
Neg logit contributions
 affair-0.028
cedented-0.028
angering-0.028
addon-0.028
Access-0.027
Token with
Token ID351
Feature activation-9.66e-03

Pos logit contributions
 Sund+1.508
hest+1.460
 [&+1.387
 guarantees+1.319
 deterrent+1.314
Neg logit contributions
 affair-1.358
cedented-1.348
angering-1.344
addon-1.343
issy-1.327
Token the
Token ID262
Feature activation-2.50e-02

Pos logit contributions
 Sund+0.025
hest+0.024
 [&+0.023
 deterrent+0.022
+0.022
Neg logit contributions
 affair-0.019
angering-0.019
cedented-0.019
addon-0.019
Access-0.019
Token bank
Token ID3331
Feature activation+9.85e-02

Pos logit contributions
 Sund+0.060
hest+0.059
 [&+0.056
+0.052
 deterrent+0.052
Neg logit contributions
 affair-0.054
angering-0.053
cedented-0.053
addon-0.052
Access-0.052
Token's
Token ID338
Feature activation-6.63e-02

Pos logit contributions
 affair+0.212
angering+0.210
cedented+0.209
addon+0.207
Access+0.205
Neg logit contributions
 Sund-0.239
hest-0.232
 [&-0.223
 deterrent-0.208
-0.207
Token directions
Token ID11678
Feature activation+1.80e-02

Pos logit contributions
 Sund+0.150
hest+0.146
 [&+0.139
+0.129
 deterrent+0.129
Neg logit contributions
 affair-0.154
angering-0.152
cedented-0.151
addon-0.149
Access-0.149
Token Brazilian
Token ID17036
Feature activation+1.41e-02

Pos logit contributions
 Sund+0.566
hest+0.551
 [&+0.526
 deterrent+0.495
 guarantees+0.494
Neg logit contributions
 affair-0.479
cedented-0.474
angering-0.473
addon-0.468
issy-0.463
Token prosecutor
Token ID13683
Feature activation-1.21e-01

Pos logit contributions
 affair+0.029
cedented+0.029
angering+0.029
addon+0.028
Access+0.028
Neg logit contributions
 Sund-0.035
hest-0.035
 [&-0.033
 guarantees-0.031
 deterrent-0.031
Token that
Token ID326
Feature activation-1.30e-01

Pos logit contributions
 Sund+0.325
hest+0.318
 [&+0.305
 deterrent+0.287
 guarantees+0.286
Neg logit contributions
 affair-0.231
cedented-0.227
angering-0.227
addon-0.225
Access-0.222
Token Rolls
Token ID40074
Feature activation-1.33e-01

Pos logit contributions
 Sund+0.338
hest+0.329
 [&+0.314
 deterrent+0.297
 guarantees+0.297
Neg logit contributions
 affair-0.259
cedented-0.258
angering-0.257
addon-0.254
quished-0.251
Token-
Token ID12
Feature activation-2.30e-01

Pos logit contributions
 Sund+0.300
hest+0.290
 [&+0.275
 deterrent+0.258
 guarantees+0.257
Neg logit contributions
 affair-0.311
cedented-0.309
angering-0.308
addon-0.305
quished-0.303
TokenRoy
Token ID32027
Feature activation-5.95e-01

Pos logit contributions
 Sund+0.567
hest+0.552
 [&+0.526
 deterrent+0.495
 guarantees+0.494
Neg logit contributions
 affair-0.485
cedented-0.480
angering-0.480
addon-0.474
issy-0.470
Tokence
Token ID344
Feature activation-5.29e-02

Pos logit contributions
 Sund+1.164
hest+1.119
 [&+1.056
 guarantees+0.987
+0.975
Neg logit contributions
angering-1.559
 affair-1.555
cedented-1.553
addon-1.533
quished-1.529
Token is
Token ID318
Feature activation-9.60e-02

Pos logit contributions
 Sund+0.140
hest+0.137
 [&+0.131
 guarantees+0.124
 deterrent+0.124
Neg logit contributions
 affair-0.102
cedented-0.101
angering-0.101
addon-0.099
quished-0.098
Token co
Token ID763
Feature activation-2.91e-01

Pos logit contributions
 Sund+0.264
hest+0.258
 [&+0.247
 deterrent+0.234
 guarantees+0.233
Neg logit contributions
 affair-0.176
angering-0.174
cedented-0.173
addon-0.171
Access-0.169
Token-
Token ID12
Feature activation-1.03e-01

Pos logit contributions
 Sund+0.578
hest+0.562
 [&+0.526
 guarantees+0.488
 deterrent+0.487
Neg logit contributions
 affair-0.749
cedented-0.744
angering-0.740
addon-0.736
issy-0.733
Tokenoper
Token ID3575
Feature activation-1.77e-01

Pos logit contributions
 Sund+0.261
hest+0.255
 [&+0.243
 deterrent+0.229
 guarantees+0.229
Neg logit contributions
 affair-0.211
cedented-0.208
angering-0.208
addon-0.205
issy-0.203
TokenS
Token ID50
Feature activation+9.67e-02

Pos logit contributions
 affair+0.129
cedented+0.127
angering+0.127
addon+0.125
issy+0.124
Neg logit contributions
 Sund-0.187
hest-0.183
 [&-0.175
 deterrent-0.166
 guarantees-0.165
Token.
Token ID764
Feature activation-2.64e-03

Pos logit contributions
 affair+0.231
cedented+0.228
angering+0.227
addon+0.226
issy+0.224
Neg logit contributions
 Sund-0.211
hest-0.205
 [&-0.195
 deterrent-0.182
 guarantees-0.181
Token
Token ID198
Feature activation-4.21e-03

Pos logit contributions
 Sund+0.007
hest+0.007
 [&+0.007
 guarantees+0.006
+0.006
Neg logit contributions
 affair-0.005
angering-0.005
cedented-0.005
addon-0.005
issy-0.005
Token
Token ID198
Feature activation+2.95e-02

Pos logit contributions
 Sund+0.012
hest+0.011
 [&+0.011
 deterrent+0.011
 guarantees+0.010
Neg logit contributions
 affair-0.008
cedented-0.007
Access-0.007
addon-0.007
angering-0.007
TokenThe
Token ID464
Feature activation+4.79e-02

Pos logit contributions
 affair+0.054
cedented+0.053
angering+0.053
Access+0.052
addon+0.052
Neg logit contributions
 Sund-0.081
hest-0.079
 [&-0.076
 deterrent-0.073
 guarantees-0.072
Token Incarnation
Token ID47881
Feature activation-5.87e-01

Pos logit contributions
 affair+0.090
angering+0.089
cedented+0.089
addon+0.088
issy+0.087
Neg logit contributions
 Sund-0.129
hest-0.126
 [&-0.121
 deterrent-0.114
 guarantees-0.114
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
 Sund+1.386
hest+1.345
 [&+1.280
 guarantees+1.200
+1.198
Neg logit contributions
 affair-1.295
cedented-1.293
angering-1.288
addon-1.277
issy-1.266
Token The
Token ID383
Feature activation+9.19e-02

Pos logit contributions
 affair+0.257
angering+0.252
cedented+0.251
addon+0.248
Access+0.247
Neg logit contributions
 Sund-0.413
hest-0.406
 [&-0.389
 deterrent-0.371
 guarantees-0.368
Token Life
Token ID5155
Feature activation+1.36e-01

Pos logit contributions
 affair+0.169
cedented+0.165
angering+0.165
addon+0.163
issy+0.162
Neg logit contributions
 Sund-0.252
hest-0.247
 [&-0.237
 deterrent-0.224
 guarantees-0.223
Token of
Token ID286
Feature activation+7.85e-02

Pos logit contributions
 affair+0.274
angering+0.270
cedented+0.269
addon+0.265
quished+0.264
Neg logit contributions
 Sund-0.350
hest-0.341
 [&-0.326
 deterrent-0.309
 guarantees-0.307
Token Christ
Token ID1951
Feature activation-7.37e-02

Pos logit contributions
 affair+0.122
angering+0.118
cedented+0.118
addon+0.117
issy+0.116
Neg logit contributions
 Sund-0.236
hest-0.233
 [&-0.225
 deterrent-0.215
 guarantees-0.213
Token the
Token ID262
Feature activation-6.47e-02

Pos logit contributions
 Sund+0.204
hest+0.198
 [&+0.189
 deterrent+0.178
 guarantees+0.177
Neg logit contributions
 affair-0.174
cedented-0.172
angering-0.172
addon-0.170
Access-0.168
Token Italian
Token ID8200
Feature activation-1.47e-01

Pos logit contributions
 Sund+0.163
hest+0.159
 [&+0.152
 deterrent+0.142
+0.142
Neg logit contributions
 affair-0.134
angering-0.131
cedented-0.130
addon-0.129
issy-0.128
Token Army
Token ID5407
Feature activation+2.50e-02

Pos logit contributions
 Sund+0.374
hest+0.364
 [&+0.348
 deterrent+0.328
 guarantees+0.327
Neg logit contributions
 affair-0.298
cedented-0.295
angering-0.294
addon-0.291
issy-0.288
Token had
Token ID550
Feature activation-7.61e-02

Pos logit contributions
 affair+0.046
angering+0.045
cedented+0.045
addon+0.045
issy+0.044
Neg logit contributions
 Sund-0.068
hest-0.067
 [&-0.064
 deterrent-0.060
 guarantees-0.060
Token been
Token ID587
Feature activation-8.39e-02

Pos logit contributions
 Sund+0.206
hest+0.201
 [&+0.193
 deterrent+0.181
+0.181
Neg logit contributions
 affair-0.143
angering-0.140
cedented-0.139
addon-0.139
Access-0.137
Token intern
Token ID1788
Feature activation-5.83e-01

Pos logit contributions
 Sund+0.217
hest+0.211
 [&+0.202
 deterrent+0.190
 guarantees+0.190
Neg logit contributions
 affair-0.167
angering-0.165
cedented-0.165
addon-0.163
issy-0.161
Tokened
Token ID276
Feature activation-1.51e-01

Pos logit contributions
 Sund+1.020
hest+0.983
 [&+0.916
 guarantees+0.835
 deterrent+0.830
Neg logit contributions
 affair-1.648
cedented-1.633
angering-1.628
addon-1.619
Access-1.609
Token by
Token ID416
Feature activation-1.30e-01

Pos logit contributions
 Sund+0.373
hest+0.364
 [&+0.347
 deterrent+0.326
 guarantees+0.325
Neg logit contributions
 affair-0.320
angering-0.316
cedented-0.316
addon-0.312
issy-0.309
Token German
Token ID2679
Feature activation+6.45e-02

Pos logit contributions
 Sund+0.370
hest+0.361
 [&+0.346
 deterrent+0.329
 guarantees+0.329
Neg logit contributions
 affair-0.223
cedented-0.221
angering-0.220
addon-0.217
Access-0.215
Token forces
Token ID3386
Feature activation+1.15e-01

Pos logit contributions
 affair+0.131
cedented+0.130
angering+0.129
addon+0.128
issy+0.127
Neg logit contributions
 Sund-0.164
hest-0.160
 [&-0.153
 deterrent-0.144
 guarantees-0.144
Token in
Token ID287
Feature activation-9.52e-02

Pos logit contributions
 affair+0.232
angering+0.228
cedented+0.227
addon+0.226
issy+0.222
Neg logit contributions
 Sund-0.293
hest-0.286
 [&-0.273
 deterrent-0.257
 guarantees-0.257
Tokenbased
Token ID3106
Feature activation+1.03e-01

Pos logit contributions
 affair+0.125
cedented+0.123
angering+0.122
addon+0.122
Access+0.119
Neg logit contributions
 Sund-0.235
hest-0.229
 [&-0.221
 deterrent-0.210
 guarantees-0.210
Token he
Token ID339
Feature activation+5.49e-02

Pos logit contributions
 affair+0.208
angering+0.202
cedented+0.200
addon+0.200
issy+0.198
Neg logit contributions
 Sund-0.265
hest-0.260
 [&-0.248
 deterrent-0.234
 guarantees-0.233
Tokeni
Token ID72
Feature activation+3.56e-02

Pos logit contributions
 affair+0.104
addon+0.102
angering+0.102
cedented+0.102
issy+0.100
Neg logit contributions
 Sund-0.147
hest-0.143
 [&-0.137
 deterrent-0.130
 guarantees-0.130
Tokenress
Token ID601
Feature activation+7.63e-03

Pos logit contributions
 affair+0.035
cedented+0.034
angering+0.034
addon+0.033
Access+0.033
Neg logit contributions
 Sund-0.128
hest-0.125
 [&-0.121
 deterrent-0.117
 guarantees-0.117
Token Beat
Token ID12568
Feature activation-7.80e-02

Pos logit contributions
 affair+0.014
angering+0.014
cedented+0.014
addon+0.014
issy+0.014
Neg logit contributions
 Sund-0.021
hest-0.020
 [&-0.019
 deterrent-0.018
 guarantees-0.018
Tokenrice
Token ID20970
Feature activation-5.63e-01

Pos logit contributions
 Sund+0.133
hest+0.128
 [&+0.120
 deterrent+0.110
 guarantees+0.109
Neg logit contributions
 affair-0.224
angering-0.221
cedented-0.221
addon-0.220
issy-0.219
Token Joyce
Token ID25936
Feature activation-7.95e-02

Pos logit contributions
 Sund+1.495
hest+1.441
 [&+1.386
 guarantees+1.309
 deterrent+1.302
Neg logit contributions
cedented-1.093
angering-1.090
 affair-1.087
addon-1.067
quished-1.063
Token Ke
Token ID3873
Feature activation+1.92e-01

Pos logit contributions
 Sund+0.214
hest+0.210
 [&+0.201
 deterrent+0.190
 guarantees+0.189
Neg logit contributions
 affair-0.150
cedented-0.147
angering-0.147
addon-0.145
Access-0.143
Tokenan
Token ID272
Feature activation+5.30e-02

Pos logit contributions
 affair+0.400
angering+0.395
cedented+0.395
addon+0.393
issy+0.389
Neg logit contributions
 Sund-0.477
hest-0.463
 [&-0.444
 deterrent-0.420
 guarantees-0.418
Token set
Token ID900
Feature activation+3.79e-02

Pos logit contributions
 affair+0.102
angering+0.100
cedented+0.100
addon+0.099
issy+0.098
Neg logit contributions
 Sund-0.141
hest-0.138
 [&-0.132
 deterrent-0.125
 guarantees-0.124
Token up
Token ID510
Feature activation+8.22e-02

Pos logit contributions
 affair+0.081
angering+0.079
cedented+0.078
addon+0.078
Access+0.077
Neg logit contributions
 Sund-0.093
hest-0.091
 [&-0.087
 deterrent-0.082
-0.081
Token,
Token ID11
Feature activation+2.10e-02

Pos logit contributions
 Sund+0.648
hest+0.634
 [&+0.608
 deterrent+0.575
 guarantees+0.573
Neg logit contributions
 affair-0.441
cedented-0.434
angering-0.434
addon-0.429
issy-0.424
Token much
Token ID881
Feature activation+1.52e-01

Pos logit contributions
 affair+0.042
angering+0.042
cedented+0.041
addon+0.041
issy+0.041
Neg logit contributions
 Sund-0.054
hest-0.053
 [&-0.050
 deterrent-0.047
 guarantees-0.047
Token revised
Token ID15556
Feature activation-7.19e-02

Pos logit contributions
 affair+0.314
angering+0.309
cedented+0.308
addon+0.305
quished+0.302
Neg logit contributions
 Sund-0.383
hest-0.376
 [&-0.360
 guarantees-0.335
 deterrent-0.335
Token by
Token ID416
Feature activation-5.76e-02

Pos logit contributions
 Sund+0.184
hest+0.180
 [&+0.172
 deterrent+0.162
 guarantees+0.161
Neg logit contributions
 affair-0.146
angering-0.142
cedented-0.142
addon-0.142
issy-0.140
Token other
Token ID584
Feature activation+7.23e-02

Pos logit contributions
 Sund+0.155
hest+0.152
 [&+0.146
 deterrent+0.138
 guarantees+0.137
Neg logit contributions
 affair-0.108
angering-0.106
cedented-0.106
addon-0.105
issy-0.104
Token screen
Token ID3159
Feature activation-5.59e-01

Pos logit contributions
 affair+0.160
angering+0.156
addon+0.156
cedented+0.155
issy+0.154
Neg logit contributions
 Sund-0.170
hest-0.168
 [&-0.161
 deterrent-0.149
-0.149
Tokenwriters
Token ID34422
Feature activation-1.99e-01

Pos logit contributions
 Sund+0.804
hest+0.767
 [&+0.706
 deterrent+0.630
 guarantees+0.629
Neg logit contributions
 affair-1.755
cedented-1.747
angering-1.743
addon-1.731
Access-1.719
Token,
Token ID11
Feature activation-3.54e-02

Pos logit contributions
 Sund+0.521
hest+0.507
 [&+0.485
 deterrent+0.458
 guarantees+0.457
Neg logit contributions
 affair-0.390
cedented-0.386
angering-0.385
addon-0.380
Access-0.377
Token helped
Token ID4193
Feature activation-1.46e-02

Pos logit contributions
 Sund+0.098
hest+0.097
 [&+0.093
 deterrent+0.088
+0.087
Neg logit contributions
 affair-0.063
cedented-0.062
angering-0.062
addon-0.061
issy-0.061
Token the
Token ID262
Feature activation-7.15e-03

Pos logit contributions
 Sund+0.035
hest+0.034
 [&+0.033
 deterrent+0.030
 guarantees+0.030
Neg logit contributions
 affair-0.032
addon-0.031
angering-0.031
cedented-0.031
issy-0.031
Token bottom
Token ID4220
Feature activation+5.82e-02

Pos logit contributions
 Sund+0.019
hest+0.019
 [&+0.018
 deterrent+0.017
 guarantees+0.017
Neg logit contributions
 affair-0.014
angering-0.013
cedented-0.013
addon-0.013
issy-0.013
Token ont
Token ID39585
Feature activation-3.38e-02

Pos logit contributions
 Sund+0.286
hest+0.279
 [&+0.265
 deterrent+0.249
 guarantees+0.248
Neg logit contributions
 affair-0.258
cedented-0.254
angering-0.254
addon-0.250
issy-0.249
Token f
Token ID277
Feature activation-2.43e-01

Pos logit contributions
 Sund+0.082
hest+0.080
 [&+0.076
 deterrent+0.071
 guarantees+0.071
Neg logit contributions
 affair-0.073
angering-0.072
cedented-0.072
addon-0.071
quished-0.070
Tokenuit
Token ID5013
Feature activation-1.25e-01

Pos logit contributions
 Sund+0.598
hest+0.581
 [&+0.557
 deterrent+0.523
 guarantees+0.522
Neg logit contributions
 affair-0.515
angering-0.508
cedented-0.508
addon-0.503
issy-0.498
Token Mil
Token ID4460
Feature activation+1.23e-02

Pos logit contributions
 Sund+0.300
hest+0.292
 [&+0.278
 deterrent+0.261
 guarantees+0.260
Neg logit contributions
 affair-0.273
cedented-0.268
angering-0.268
addon-0.265
quished-0.263
Tokenne
Token ID710
Feature activation+6.59e-02

Pos logit contributions
 affair+0.026
angering+0.025
cedented+0.025
addon+0.025
issy+0.025
Neg logit contributions
 Sund-0.030
hest-0.030
 [&-0.028
 deterrent-0.027
 guarantees-0.026
Token Orbital
Token ID45453
Feature activation-5.50e-01

Pos logit contributions
 affair+0.141
cedented+0.138
angering+0.138
addon+0.136
issy+0.136
Neg logit contributions
 Sund-0.161
hest-0.157
 [&-0.150
 deterrent-0.141
-0.140
Token em
Token ID795
Feature activation+9.13e-02

Pos logit contributions
 Sund+1.392
hest+1.354
 [&+1.294
 deterrent+1.219
 guarantees+1.218
Neg logit contributions
 affair-1.126
cedented-1.118
angering-1.114
addon-1.101
issy-1.089
Tokenport
Token ID634
Feature activation+9.86e-02

Pos logit contributions
 affair+0.184
angering+0.181
cedented+0.181
addon+0.179
issy+0.178
Neg logit contributions
 Sund-0.234
hest-0.228
 [&-0.219
 deterrent-0.206
 guarantees-0.206
Tokenant
Token ID415
Feature activation-5.87e-02

Pos logit contributions
 affair+0.196
angering+0.194
cedented+0.193
addon+0.191
issy+0.189
Neg logit contributions
 Sund-0.256
hest-0.248
 [&-0.239
 deterrent-0.226
 guarantees-0.224
Token a
Token ID257
Feature activation+7.60e-02

Pos logit contributions
 Sund+0.143
hest+0.139
 [&+0.132
 deterrent+0.124
 guarantees+0.124
Neg logit contributions
 affair-0.127
angering-0.124
cedented-0.124
addon-0.123
Access-0.122
Tokenvec
Token ID35138
Feature activation-7.26e-02

Pos logit contributions
 affair+0.196
angering+0.194
cedented+0.193
addon+0.192
issy+0.191
Neg logit contributions
 Sund-0.152
hest-0.147
 [&-0.139
 deterrent-0.128
 guarantees-0.128
Tokenli
Token ID4528
Feature activation-6.36e-02

Pos logit contributions
 Sund+0.005
hest+0.005
 [&+0.005
 deterrent+0.005
 guarantees+0.005
Neg logit contributions
 affair-0.008
cedented-0.008
angering-0.008
addon-0.008
quished-0.008
Token the
Token ID262
Feature activation+5.62e-02

Pos logit contributions
 Sund+0.167
hest+0.162
 [&+0.155
 guarantees+0.146
+0.145
Neg logit contributions
 affair-0.125
cedented-0.124
angering-0.123
addon-0.122
Access-0.121
Token Dwarf
Token ID33196
Feature activation-1.38e-02

Pos logit contributions
 affair+0.107
cedented+0.106
angering+0.106
addon+0.104
issy+0.103
Neg logit contributions
 Sund-0.150
hest-0.147
 [&-0.141
 deterrent-0.133
 guarantees-0.133
Token,
Token ID11
Feature activation+3.11e-02

Pos logit contributions
 Sund+0.036
hest+0.035
 [&+0.034
 deterrent+0.032
 guarantees+0.032
Neg logit contributions
 affair-0.027
cedented-0.027
angering-0.027
addon-0.026
Access-0.026
Token Leg
Token ID3564
Feature activation+1.25e-01

Pos logit contributions
 affair+0.057
cedented+0.056
angering+0.056
addon+0.056
Access+0.055
Neg logit contributions
 Sund-0.086
hest-0.083
 [&-0.080
 guarantees-0.076
 deterrent-0.075
Tokenolas
Token ID12456
Feature activation-5.43e-01

Pos logit contributions
 affair+0.214
issy+0.214
addon+0.214
angering+0.212
cedented+0.210
Neg logit contributions
 Sund-0.352
hest-0.341
 [&-0.329
 deterrent-0.321
 guarantees-0.318
Token the
Token ID262
Feature activation+6.05e-02

Pos logit contributions
 Sund+1.403
hest+1.359
 [&+1.290
 guarantees+1.217
+1.214
Neg logit contributions
cedented-1.094
 affair-1.085
angering-1.075
Access-1.069
addon-1.063
Token Elf
Token ID19067
Feature activation-1.39e-01

Pos logit contributions
 affair+0.110
cedented+0.109
angering+0.108
addon+0.107
issy+0.106
Neg logit contributions
 Sund-0.167
hest-0.163
 [&-0.156
 deterrent-0.148
 guarantees-0.148
Token,
Token ID11
Feature activation+2.64e-02

Pos logit contributions
 Sund+0.361
hest+0.352
 [&+0.336
 deterrent+0.316
+0.315
Neg logit contributions
 affair-0.275
cedented-0.275
angering-0.272
addon-0.268
Access-0.268
Token and
Token ID290
Feature activation+9.41e-02

Pos logit contributions
 affair+0.048
cedented+0.048
angering+0.047
addon+0.047
quished+0.046
Neg logit contributions
 Sund-0.073
hest-0.071
 [&-0.068
 guarantees-0.065
 deterrent-0.064
Token the
Token ID262
Feature activation+4.22e-02

Pos logit contributions
 affair+0.168
cedented+0.166
angering+0.165
addon+0.163
issy+0.161
Neg logit contributions
 Sund-0.263
hest-0.257
 [&-0.247
 deterrent-0.234
 guarantees-0.233
Token Mush
Token ID24257
Feature activation+2.13e-01

Pos logit contributions
 affair+0.010
cedented+0.010
angering+0.010
addon+0.009
quished+0.009
Neg logit contributions
 Sund-0.014
hest-0.014
 [&-0.013
 deterrent-0.013
 guarantees-0.013
Tokenrooms
Token ID9649
Feature activation+1.85e-01

Pos logit contributions
 affair+0.530
cedented+0.524
angering+0.523
addon+0.519
Access+0.515
Neg logit contributions
 Sund-0.448
hest-0.434
 [&-0.411
 deterrent-0.381
 guarantees-0.380
Token are
Token ID389
Feature activation-1.74e-03

Pos logit contributions
 affair+0.373
cedented+0.367
angering+0.366
addon+0.363
issy+0.359
Neg logit contributions
 Sund-0.474
hest-0.463
 [&-0.443
 deterrent-0.417
 guarantees-0.416
Token like
Token ID588
Feature activation+2.51e-02

Pos logit contributions
 Sund+0.004
hest+0.004
 [&+0.004
 deterrent+0.004
 guarantees+0.004
Neg logit contributions
 affair-0.004
cedented-0.003
angering-0.003
addon-0.003
issy-0.003
Token snow
Token ID6729
Feature activation-1.20e-01

Pos logit contributions
 affair+0.051
cedented+0.050
angering+0.050
addon+0.050
issy+0.049
Neg logit contributions
 Sund-0.064
hest-0.063
 [&-0.060
 deterrent-0.056
 guarantees-0.056
Tokenfl
Token ID2704
Feature activation-5.40e-01

Pos logit contributions
 Sund+0.269
hest+0.261
 [&+0.248
 deterrent+0.231
 guarantees+0.231
Neg logit contributions
 affair-0.279
cedented-0.276
angering-0.276
addon-0.273
issy-0.270
Tokenakes
Token ID1124
Feature activation-1.84e-01

Pos logit contributions
 Sund+1.080
hest+1.054
 [&+0.985
eks+0.911
+0.911
Neg logit contributions
 affair-1.394
cedented-1.379
angering-1.377
addon-1.360
Access-1.349
Token.
Token ID13
Feature activation-7.58e-02

Pos logit contributions
 Sund+0.453
hest+0.440
 [&+0.420
 deterrent+0.395
 guarantees+0.394
Neg logit contributions
 affair-0.389
cedented-0.386
angering-0.385
addon-0.381
Access-0.377
Token No
Token ID1400
Feature activation-2.62e-02

Pos logit contributions
 Sund+0.199
hest+0.193
 [&+0.185
 deterrent+0.174
 guarantees+0.174
Neg logit contributions
 affair-0.149
cedented-0.148
angering-0.148
addon-0.146
quished-0.144
Token two
Token ID734
Feature activation+1.98e-03

Pos logit contributions
 Sund+0.067
hest+0.065
 [&+0.062
 deterrent+0.058
 guarantees+0.058
Neg logit contributions
 affair-0.053
cedented-0.053
angering-0.053
addon-0.052
issy-0.051
Token are
Token ID389
Feature activation-4.14e-02

Pos logit contributions
 affair+0.004
cedented+0.004
angering+0.004
addon+0.004
issy+0.004
Neg logit contributions
 Sund-0.005
hest-0.005
 [&-0.005
 deterrent-0.005
 guarantees-0.005
Token PNG
Token ID36182
Feature activation-1.06e-01

Pos logit contributions
angering+0.346
cedented+0.346
 affair+0.345
addon+0.343
quished+0.337
Neg logit contributions
 Sund-0.325
hest-0.309
 [&-0.292
 guarantees-0.276
eks-0.271
Token larger
Token ID4025
Feature activation-2.34e-01

Pos logit contributions
 Sund+0.246
hest+0.239
 [&+0.228
 deterrent+0.213
 guarantees+0.213
Neg logit contributions
 affair-0.241
angering-0.238
cedented-0.238
addon-0.236
quished-0.234
Token image
Token ID2939
Feature activation-7.10e-02

Pos logit contributions
 Sund+0.506
hest+0.491
 [&+0.465
 deterrent+0.433
 guarantees+0.432
Neg logit contributions
 affair-0.566
cedented-0.560
angering-0.560
addon-0.555
Access-0.550
Token
Token ID198
Feature activation+1.41e-02

Pos logit contributions
 Sund+0.192
hest+0.188
 [&+0.180
 deterrent+0.171
 guarantees+0.170
Neg logit contributions
 affair-0.132
cedented-0.130
angering-0.130
addon-0.129
Access-0.128
Token
Token ID198
Feature activation+5.04e-02

Pos logit contributions
 affair+0.025
Access+0.024
addon+0.024
cedented+0.024
angering+0.024
Neg logit contributions
 Sund-0.039
hest-0.039
 [&-0.037
 deterrent-0.036
 guarantees-0.035
Tokenlarg
Token ID15521
Feature activation-5.39e-01

Pos logit contributions
Access+0.132
cedented+0.130
angering+0.129
addon+0.128
 affair+0.128
Neg logit contributions
 Sund-0.099
hest-0.097
 [&-0.091
 deterrent-0.087
 guarantees-0.084
Tokener
Token ID263
Feature activation-7.56e-02

Pos logit contributions
 Sund+1.431
hest+1.399
 [&+1.338
 guarantees+1.261
 deterrent+1.261
Neg logit contributions
 affair-1.033
angering-1.025
cedented-1.024
addon-1.010
issy-0.999
Token image
Token ID2939
Feature activation-4.68e-02

Pos logit contributions
 Sund+0.170
hest+0.167
 [&+0.157
 deterrent+0.148
 guarantees+0.147
Neg logit contributions
 affair-0.177
cedented-0.171
addon-0.171
angering-0.170
Access-0.169
Token T
Token ID309
Feature activation-4.21e-02

Pos logit contributions
 Sund+0.127
hest+0.124
 [&+0.119
 deterrent+0.112
 guarantees+0.112
Neg logit contributions
 affair-0.087
cedented-0.086
angering-0.086
addon-0.085
issy-0.084
TokenIFF
Token ID29267
Feature activation-9.46e-02

Pos logit contributions
 Sund+0.097
hest+0.094
 [&+0.092
 guarantees+0.085
 deterrent+0.084
Neg logit contributions
 affair-0.095
angering-0.094
addon-0.093
cedented-0.093
Access-0.093
Token original
Token ID2656
Feature activation+4.06e-02

Pos logit contributions
hest+0.184
 Sund+0.180
 [&+0.167
eks+0.161
 Canaver+0.160
Neg logit contributions
 affair-0.247
addon-0.242
Access-0.240
cedented-0.238
issy-0.236
Token has
Token ID468
Feature activation-8.96e-02

Pos logit contributions
 Sund+0.201
hest+0.197
 [&+0.188
 deterrent+0.177
 guarantees+0.175
Neg logit contributions
 affair-0.168
angering-0.165
cedented-0.165
addon-0.164
Access-0.162
Token mostly
Token ID4632
Feature activation-5.39e-02

Pos logit contributions
 Sund+0.241
hest+0.237
 [&+0.227
 deterrent+0.214
 guarantees+0.213
Neg logit contributions
 affair-0.169
cedented-0.165
angering-0.165
addon-0.164
Access-0.162
Token been
Token ID587
Feature activation-1.02e-01

Pos logit contributions
 Sund+0.153
hest+0.150
 [&+0.145
 deterrent+0.136
 guarantees+0.136
Neg logit contributions
 affair-0.093
cedented-0.091
angering-0.091
addon-0.090
Access-0.089
Token turned
Token ID2900
Feature activation+3.77e-02

Pos logit contributions
 Sund+0.293
hest+0.287
 [&+0.275
 deterrent+0.262
 guarantees+0.261
Neg logit contributions
 affair-0.172
cedented-0.170
angering-0.169
addon-0.167
issy-0.165
Token into
Token ID656
Feature activation-8.51e-02

Pos logit contributions
 affair+0.097
angering+0.095
cedented+0.094
addon+0.094
issy+0.094
Neg logit contributions
 Sund-0.076
hest-0.075
 [&-0.070
 deterrent-0.064
eks-0.064
Token cond
Token ID1779
Feature activation-5.27e-01

Pos logit contributions
 Sund+0.185
hest+0.182
 [&+0.172
 deterrent+0.159
eks+0.158
Neg logit contributions
 affair-0.204
angering-0.201
addon-0.199
cedented-0.199
Access-0.197
Tokenominium
Token ID46134
Feature activation-2.83e-01

Pos logit contributions
 Sund+1.004
hest+0.972
 [&+0.907
+0.832
eks+0.831
Neg logit contributions
 affair-1.403
cedented-1.397
angering-1.389
quished-1.374
addon-1.371
Tokens
Token ID82
Feature activation-9.58e-02

Pos logit contributions
 Sund+0.679
hest+0.659
 [&+0.628
 deterrent+0.590
 guarantees+0.590
Neg logit contributions
 affair-0.616
cedented-0.612
angering-0.609
addon-0.603
quished-0.598
Token but
Token ID475
Feature activation-1.29e-01

Pos logit contributions
 Sund+0.241
hest+0.236
 [&+0.225
 deterrent+0.212
 guarantees+0.211
Neg logit contributions
 affair-0.197
angering-0.194
cedented-0.194
addon-0.192
issy-0.190
Token has
Token ID468
Feature activation-4.49e-02

Pos logit contributions
 Sund+0.338
hest+0.333
 [&+0.318
 deterrent+0.300
+0.296
Neg logit contributions
 affair-0.251
angering-0.245
addon-0.244
cedented-0.244
Access-0.241
Token some
Token ID617
Feature activation-1.15e-01

Pos logit contributions
 Sund+0.123
hest+0.121
 [&+0.116
 deterrent+0.110
+0.109
Neg logit contributions
 affair-0.083
angering-0.080
cedented-0.080
addon-0.080
Access-0.079
Token action
Token ID2223
Feature activation+2.68e-02

Pos logit contributions
 affair+0.054
angering+0.054
cedented+0.053
addon+0.053
issy+0.052
Neg logit contributions
 Sund-0.068
hest-0.066
 [&-0.063
 deterrent-0.059
 guarantees-0.059
Token,
Token ID11
Feature activation-3.77e-02

Pos logit contributions
 affair+0.055
angering+0.054
cedented+0.054
addon+0.054
issy+0.053
Neg logit contributions
 Sund-0.068
hest-0.066
 [&-0.063
 deterrent-0.059
 guarantees-0.059
Token grabbing
Token ID23256
Feature activation+1.41e-01

Pos logit contributions
 Sund+0.099
hest+0.097
 [&+0.092
 deterrent+0.087
 guarantees+0.087
Neg logit contributions
 affair-0.073
cedented-0.073
angering-0.072
addon-0.072
issy-0.071
Token her
Token ID607
Feature activation-2.18e-02

Pos logit contributions
 affair+0.260
angering+0.254
addon+0.252
cedented+0.252
Access+0.249
Neg logit contributions
 Sund-0.384
hest-0.379
 [&-0.365
 deterrent-0.343
 guarantees-0.341
Token M
Token ID337
Feature activation+5.01e-02

Pos logit contributions
 Sund+0.056
hest+0.055
 [&+0.052
 deterrent+0.049
 guarantees+0.049
Neg logit contributions
 affair-0.044
angering-0.043
cedented-0.043
addon-0.043
Access-0.042
Tokenamiya
Token ID38241
Feature activation-5.27e-01

Pos logit contributions
 affair+0.108
angering+0.106
addon+0.106
issy+0.105
Access+0.105
Neg logit contributions
 Sund-0.121
hest-0.118
 [&-0.112
 deterrent-0.107
 guarantees-0.106
Token 7
Token ID767
Feature activation+1.51e-02

Pos logit contributions
 Sund+1.376
hest+1.335
 [&+1.279
 deterrent+1.208
 guarantees+1.204
Neg logit contributions
cedented-1.035
 affair-1.034
angering-1.029
addon-1.019
quished-1.009
Token camera
Token ID4676
Feature activation+7.21e-02

Pos logit contributions
 affair+0.035
angering+0.035
cedented+0.035
addon+0.034
Access+0.034
Neg logit contributions
 Sund-0.034
hest-0.033
 [&-0.031
 guarantees-0.029
 deterrent-0.029
Token from
Token ID422
Feature activation+2.71e-02

Pos logit contributions
 affair+0.142
angering+0.139
cedented+0.138
addon+0.137
Access+0.136
Neg logit contributions
 Sund-0.189
hest-0.185
 [&-0.177
 deterrent-0.166
 guarantees-0.166
Token her
Token ID607
Feature activation-4.24e-02

Pos logit contributions
 affair+0.050
angering+0.049
cedented+0.049
addon+0.048
Access+0.048
Neg logit contributions
 Sund-0.074
hest-0.073
 [&-0.070
 deterrent-0.066
 guarantees-0.066
Token bag
Token ID6131
Feature activation-1.81e-01

Pos logit contributions
 Sund+0.111
hest+0.109
 [&+0.104
 guarantees+0.098
 deterrent+0.098
Neg logit contributions
 affair-0.083
angering-0.081
cedented-0.080
addon-0.080
Access-0.080
Token that
Token ID326
Feature activation-3.32e-02

Pos logit contributions
 Sund+0.211
hest+0.208
 [&+0.199
 deterrent+0.189
 guarantees+0.188
Neg logit contributions
 affair-0.136
angering-0.133
cedented-0.133
addon-0.131
Access-0.129
Token point
Token ID966
Feature activation-3.73e-02

Pos logit contributions
 Sund+0.073
hest+0.071
 [&+0.068
 deterrent+0.063
 guarantees+0.063
Neg logit contributions
 affair-0.079
angering-0.078
cedented-0.077
addon-0.077
Access-0.076
Token,
Token ID11
Feature activation-5.17e-02

Pos logit contributions
 Sund+0.095
hest+0.093
 [&+0.089
 deterrent+0.084
 guarantees+0.084
Neg logit contributions
 affair-0.075
angering-0.074
cedented-0.074
addon-0.073
Access-0.073
Token and
Token ID290
Feature activation-1.41e-02

Pos logit contributions
 Sund+0.136
hest+0.133
 [&+0.127
 deterrent+0.120
 guarantees+0.120
Neg logit contributions
 affair-0.101
cedented-0.100
angering-0.100
addon-0.099
quished-0.098
Token S
Token ID311
Feature activation-1.61e-01

Pos logit contributions
 Sund+0.038
hest+0.037
 [&+0.036
 guarantees+0.034
 deterrent+0.034
Neg logit contributions
 affair-0.026
cedented-0.026
angering-0.026
addon-0.026
quished-0.026
Tokenibur
Token ID38616
Feature activation-5.15e-01

Pos logit contributions
 Sund+0.414
hest+0.404
 [&+0.386
 deterrent+0.364
 guarantees+0.363
Neg logit contributions
 affair-0.322
cedented-0.318
angering-0.318
addon-0.314
issy-0.311
Token's
Token ID338
Feature activation-6.86e-02

Pos logit contributions
 Sund+1.447
hest+1.410
 [&+1.346
 guarantees+1.290
+1.280
Neg logit contributions
cedented-0.908
 affair-0.908
angering-0.905
addon-0.883
quished-0.881
Token annual
Token ID5079
Feature activation+7.94e-02

Pos logit contributions
 Sund+0.172
hest+0.169
 [&+0.161
 deterrent+0.151
+0.150
Neg logit contributions
 affair-0.142
angering-0.139
cedented-0.138
addon-0.137
Access-0.137
Token report
Token ID989
Feature activation+1.74e-01

Pos logit contributions
 affair+0.172
angering+0.168
cedented+0.167
addon+0.166
Access+0.166
Neg logit contributions
 Sund-0.191
hest-0.188
 [&-0.179
 deterrent-0.167
-0.167
Token for
Token ID329
Feature activation-7.42e-03

Pos logit contributions
 affair+0.323
angering+0.314
cedented+0.311
addon+0.309
issy+0.307
Neg logit contributions
 Sund-0.474
hest-0.467
 [&-0.445
 deterrent-0.423
-0.416
Token 2012
Token ID2321
Feature activation-1.30e-02

Pos logit contributions
 Sund+0.017
hest+0.017
 [&+0.016
 deterrent+0.015
+0.015
Neg logit contributions
 affair-0.017
angering-0.017
addon-0.016
cedented-0.016
Access-0.016
Token.
Token ID13
Feature activation-1.48e-01

Pos logit contributions
 affair+0.414
cedented+0.410
angering+0.409
addon+0.405
issy+0.401
Neg logit contributions
 Sund-0.461
hest-0.448
 [&-0.427
 deterrent-0.401
 guarantees-0.400
Token own
Token ID898
Feature activation-1.16e-01

Pos logit contributions
 Sund+0.385
hest+0.375
 [&+0.358
 deterrent+0.338
 guarantees+0.338
Neg logit contributions
 affair-0.292
cedented-0.289
angering-0.288
addon-0.285
issy-0.282
Token a
Token ID257
Feature activation-4.11e-02

Pos logit contributions
 Sund+0.292
hest+0.288
 [&+0.275
 deterrent+0.258
+0.256
Neg logit contributions
 affair-0.237
angering-0.232
cedented-0.231
addon-0.230
Access-0.228
Token series
Token ID2168
Feature activation-7.67e-02

Pos logit contributions
 Sund+0.097
hest+0.096
 [&+0.091
 guarantees+0.085
 deterrent+0.085
Neg logit contributions
 affair-0.091
angering-0.088
cedented-0.088
addon-0.087
Access-0.087
Token of
Token ID286
Feature activation-3.94e-02

Pos logit contributions
 Sund+0.200
hest+0.195
 [&+0.187
 deterrent+0.176
 guarantees+0.175
Neg logit contributions
 affair-0.151
angering-0.148
cedented-0.148
addon-0.147
Access-0.145
Token cond
Token ID1779
Feature activation-5.10e-01

Pos logit contributions
 Sund+0.096
hest+0.095
 [&+0.090
 deterrent+0.085
+0.084
Neg logit contributions
 affair-0.084
angering-0.082
cedented-0.082
addon-0.081
Access-0.081
Tokenominium
Token ID46134
Feature activation-2.73e-01

Pos logit contributions
 Sund+0.992
hest+0.960
 [&+0.900
+0.825
 deterrent+0.823
Neg logit contributions
 affair-1.339
cedented-1.328
angering-1.325
addon-1.308
quished-1.306
Token properties
Token ID6608
Feature activation-2.15e-01

Pos logit contributions
 Sund+0.646
hest+0.626
 [&+0.596
 guarantees+0.561
 deterrent+0.560
Neg logit contributions
 affair-0.604
cedented-0.601
angering-0.598
addon-0.593
quished-0.588
Token,
Token ID11
Feature activation-3.58e-02

Pos logit contributions
 Sund+0.553
hest+0.538
 [&+0.514
 deterrent+0.485
 guarantees+0.485
Neg logit contributions
 affair-0.430
cedented-0.426
angering-0.425
addon-0.420
quished-0.416
Token some
Token ID617
Feature activation-1.37e-01

Pos logit contributions
 Sund+0.090
hest+0.088
 [&+0.084
 deterrent+0.079
 guarantees+0.078
Neg logit contributions
 affair-0.074
angering-0.073
cedented-0.073
addon-0.072
Access-0.071
Token of
Token ID286
Feature activation-1.01e-01

Pos logit contributions
 Sund+0.360
hest+0.352
 [&+0.337
 deterrent+0.318
 guarantees+0.316
Neg logit contributions
 affair-0.269
angering-0.265
cedented-0.264
addon-0.262
Access-0.259
Token so
Token ID523
Feature activation-1.04e-02

Pos logit contributions
 affair+0.021
cedented+0.021
angering+0.020
addon+0.020
issy+0.020
Neg logit contributions
 Sund-0.031
hest-0.030
 [&-0.029
 guarantees-0.027
-0.027
Token-
Token ID12
Feature activation-2.32e-02

Pos logit contributions
 Sund+0.019
hest+0.018
 [&+0.017
 deterrent+0.016
 guarantees+0.016
Neg logit contributions
 affair-0.029
angering-0.028
cedented-0.028
addon-0.028
quished-0.028
Tokencalled
Token ID7174
Feature activation+1.86e-01

Pos logit contributions
 Sund+0.079
hest+0.077
 [&+0.074
 deterrent+0.071
 guarantees+0.071
Neg logit contributions
 affair-0.028
cedented-0.027
angering-0.027
addon-0.027
issy-0.026
Token
Token ID564
Feature activation-8.03e-02

Pos logit contributions
 affair+0.301
cedented+0.291
angering+0.289
Access+0.288
issy+0.286
Neg logit contributions
 Sund-0.550
hest-0.541
 [&-0.519
eks-0.490
-0.489
Token
Token ID250
Feature activation+2.26e-02

Pos logit contributions
 Sund+0.219
hest+0.214
 [&+0.205
 deterrent+0.195
 guarantees+0.194
Neg logit contributions
 affair-0.148
angering-0.147
cedented-0.146
addon-0.144
quished-0.143
Tokentravel
Token ID35927
Feature activation-4.99e-01

Pos logit contributions
 affair+0.035
cedented+0.034
angering+0.034
addon+0.034
Access+0.034
Neg logit contributions
 Sund-0.069
hest-0.067
 [&-0.065
 deterrent-0.062
 guarantees-0.061
Token ban
Token ID3958
Feature activation-3.68e-01

Pos logit contributions
 Sund+1.310
hest+1.275
 [&+1.221
 deterrent+1.160
 guarantees+1.157
Neg logit contributions
 affair-0.969
cedented-0.961
angering-0.960
addon-0.948
issy-0.938
Token
Token ID447
Feature activation-7.88e-02

Pos logit contributions
 Sund+0.872
hest+0.847
 [&+0.807
 deterrent+0.759
 guarantees+0.757
Neg logit contributions
 affair-0.811
cedented-0.803
angering-0.802
addon-0.794
quished-0.786
Token
Token ID251
Feature activation-5.49e-02

Pos logit contributions
 Sund+0.243
hest+0.238
 [&+0.229
 deterrent+0.219
 guarantees+0.218
Neg logit contributions
 affair-0.118
cedented-0.116
angering-0.116
addon-0.114
issy-0.112
Token on
Token ID319
Feature activation-2.98e-02

Pos logit contributions
 Sund+0.138
hest+0.134
 [&+0.128
 deterrent+0.121
 guarantees+0.120
Neg logit contributions
 affair-0.113
cedented-0.112
angering-0.112
addon-0.111
Access-0.110
Token certain
Token ID1728
Feature activation+1.21e-01

Pos logit contributions
 Sund+0.084
hest+0.082
 [&+0.079
 deterrent+0.074
 guarantees+0.074
Neg logit contributions
 affair-0.053
cedented-0.052
angering-0.052
addon-0.051
Access-0.051
Token also
Token ID635
Feature activation+1.65e-03

Pos logit contributions
 Sund+0.169
hest+0.164
 [&+0.157
 deterrent+0.148
 guarantees+0.148
Neg logit contributions
 affair-0.134
cedented-0.132
angering-0.132
addon-0.131
quished-0.129
Token pretty
Token ID2495
Feature activation-3.16e-01

Pos logit contributions
 affair+0.003
cedented+0.003
angering+0.003
addon+0.003
issy+0.003
Neg logit contributions
 Sund-0.004
hest-0.004
 [&-0.004
 deterrent-0.004
 guarantees-0.004
Token incredible
Token ID8082
Feature activation-1.77e-01

Pos logit contributions
 Sund+0.793
hest+0.773
 [&+0.738
 deterrent+0.695
 guarantees+0.693
Neg logit contributions
 affair-0.652
cedented-0.646
angering-0.645
addon-0.638
quished-0.631
Token with
Token ID351
Feature activation-1.53e-01

Pos logit contributions
 Sund+0.460
hest+0.449
 [&+0.430
 deterrent+0.405
 guarantees+0.404
Neg logit contributions
 affair-0.351
cedented-0.345
angering-0.345
addon-0.340
issy-0.338
Token st
Token ID336
Feature activation-1.48e-01

Pos logit contributions
 Sund+0.402
hest+0.392
 [&+0.375
 deterrent+0.354
 guarantees+0.353
Neg logit contributions
 affair-0.299
cedented-0.295
angering-0.294
addon-0.291
issy-0.288
Tokenews
Token ID15515
Feature activation-4.98e-01

Pos logit contributions
 Sund+0.370
hest+0.359
 [&+0.345
 deterrent+0.325
 guarantees+0.324
Neg logit contributions
 affair-0.309
angering-0.305
cedented-0.304
addon-0.303
issy-0.299
Token,
Token ID11
Feature activation-1.52e-01

Pos logit contributions
 Sund+1.219
hest+1.186
 [&+1.131
 deterrent+1.063
 guarantees+1.062
Neg logit contributions
 affair-1.058
cedented-1.050
angering-1.047
addon-1.034
quished-1.025
Token grains
Token ID21824
Feature activation-2.36e-01

Pos logit contributions
 Sund+0.377
hest+0.368
 [&+0.351
 deterrent+0.330
 guarantees+0.329
Neg logit contributions
 affair-0.320
cedented-0.316
angering-0.316
addon-0.313
issy-0.309
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 Sund+0.616
hest+0.599
 [&+0.571
+0.540
 guarantees+0.540
Neg logit contributions
 affair-0.465
cedented-0.463
angering-0.461
addon-0.455
quished-0.451
Token sou
Token ID24049
Feature activation-6.89e-02

Pos logit contributions
 Sund+0.383
hest+0.372
 [&+0.355
 deterrent+0.334
 guarantees+0.333
Neg logit contributions
 affair-0.334
cedented-0.332
angering-0.331
addon-0.327
quished-0.324
Tokenps
Token ID862
Feature activation-2.02e-01

Pos logit contributions
 Sund+0.184
hest+0.178
 [&+0.172
 deterrent+0.163
 guarantees+0.161
Neg logit contributions
 affair-0.132
angering-0.131
cedented-0.131
addon-0.128
issy-0.128
Token à
Token ID28141
Feature activation-9.20e-02

Pos logit contributions
 Sund+0.236
hest+0.231
 [&+0.221
 deterrent+0.209
 guarantees+0.209
Neg logit contributions
 affair-0.158
cedented-0.156
angering-0.155
addon-0.153
issy-0.152
Token cher
Token ID21382
Feature activation-1.25e-01

Pos logit contributions
 Sund+0.223
hest+0.218
 [&+0.208
 deterrent+0.195
 guarantees+0.194
Neg logit contributions
 affair-0.199
angering-0.195
cedented-0.194
addon-0.192
Access-0.191
Tokencher
Token ID2044
Feature activation-2.92e-01

Pos logit contributions
 Sund+0.241
hest+0.233
 [&+0.219
 deterrent+0.202
 guarantees+0.201
Neg logit contributions
 affair-0.330
cedented-0.327
angering-0.326
addon-0.324
issy-0.321
Token le
Token ID443
Feature activation-7.73e-02

Pos logit contributions
 Sund+0.718
hest+0.699
 [&+0.666
 deterrent+0.627
 guarantees+0.625
Neg logit contributions
 affair-0.619
cedented-0.614
angering-0.612
addon-0.606
issy-0.600
Token conf
Token ID1013
Feature activation-1.88e-01

Pos logit contributions
 Sund+0.192
hest+0.187
 [&+0.178
 deterrent+0.168
 guarantees+0.167
Neg logit contributions
 affair-0.163
angering-0.160
cedented-0.160
addon-0.158
Access-0.157
Tokenlit
Token ID18250
Feature activation-4.96e-01

Pos logit contributions
 Sund+0.488
hest+0.474
 [&+0.455
 deterrent+0.430
 guarantees+0.429
Neg logit contributions
 affair-0.374
angering-0.369
cedented-0.369
addon-0.365
issy-0.362
Token a
Token ID257
Feature activation+2.70e-03

Pos logit contributions
 Sund+1.252
hest+1.221
 [&+1.164
 deterrent+1.097
 guarantees+1.096
Neg logit contributions
 affair-1.016
cedented-1.010
angering-1.006
addon-0.997
issy-0.986
Tokenvec
Token ID35138
Feature activation-1.62e-01

Pos logit contributions
 affair+0.007
angering+0.007
cedented+0.007
addon+0.007
issy+0.007
Neg logit contributions
 Sund-0.005
hest-0.005
 [&-0.005
 deterrent-0.004
 guarantees-0.004
Token nos
Token ID43630
Feature activation-1.13e-02

Pos logit contributions
 Sund+0.388
hest+0.379
 [&+0.361
 deterrent+0.338
 guarantees+0.336
Neg logit contributions
 affair-0.355
angering-0.347
cedented-0.346
addon-0.342
Access-0.341
Token an
Token ID281
Feature activation+3.56e-02

Pos logit contributions
 Sund+0.027
hest+0.027
 [&+0.025
 deterrent+0.024
+0.024
Neg logit contributions
 affair-0.024
angering-0.024
cedented-0.024
addon-0.024
quished-0.023
Tokenci
Token ID979
Feature activation-2.32e-02

Pos logit contributions
 affair+0.079
cedented+0.078
angering+0.077
addon+0.076
quished+0.076
Neg logit contributions
 Sund-0.085
hest-0.082
 [&-0.079
 deterrent-0.074
 guarantees-0.073
Token mail
Token ID6920
Feature activation-1.28e-02

Pos logit contributions
 Sund+0.454
hest+0.441
 [&+0.422
 deterrent+0.395
 guarantees+0.394
Neg logit contributions
 affair-0.398
cedented-0.392
angering-0.391
addon-0.387
issy-0.383
Token and
Token ID290
Feature activation-8.07e-03

Pos logit contributions
 Sund+0.033
hest+0.032
 [&+0.030
 deterrent+0.029
 guarantees+0.029
Neg logit contributions
 affair-0.026
cedented-0.026
angering-0.026
addon-0.025
issy-0.025
Token requiring
Token ID10616
Feature activation-7.61e-02

Pos logit contributions
 Sund+0.021
hest+0.020
 [&+0.019
 deterrent+0.018
 guarantees+0.018
Neg logit contributions
 affair-0.016
angering-0.016
cedented-0.016
addon-0.016
Access-0.016
Token a
Token ID257
Feature activation+5.27e-03

Pos logit contributions
 Sund+0.211
hest+0.208
 [&+0.200
 deterrent+0.188
+0.187
Neg logit contributions
 affair-0.137
angering-0.134
cedented-0.132
addon-0.131
Access-0.131
Token body
Token ID1767
Feature activation+3.31e-02

Pos logit contributions
 affair+0.011
angering+0.011
cedented+0.011
addon+0.010
Access+0.010
Neg logit contributions
 Sund-0.013
hest-0.013
 [&-0.012
 guarantees-0.012
 deterrent-0.012
Tokenguard
Token ID14864
Feature activation-4.94e-01

Pos logit contributions
 affair+0.071
angering+0.069
cedented+0.068
Access+0.068
addon+0.068
Neg logit contributions
 Sund-0.081
hest-0.080
 [&-0.075
eks-0.071
 guarantees-0.070
Token for
Token ID329
Feature activation-2.74e-02

Pos logit contributions
 Sund+1.157
hest+1.124
 [&+1.070
 deterrent+1.002
 guarantees+1.000
Neg logit contributions
 affair-1.106
cedented-1.095
angering-1.093
addon-1.082
issy-1.071
Token some
Token ID617
Feature activation-2.83e-02

Pos logit contributions
 Sund+0.069
hest+0.068
 [&+0.065
 deterrent+0.061
 guarantees+0.061
Neg logit contributions
 affair-0.056
angering-0.055
cedented-0.055
addon-0.055
Access-0.054
Token of
Token ID286
Feature activation-3.44e-02

Pos logit contributions
 Sund+0.071
hest+0.069
 [&+0.066
 deterrent+0.062
 guarantees+0.062
Neg logit contributions
 affair-0.059
angering-0.058
cedented-0.058
addon-0.057
Access-0.057
Token her
Token ID607
Feature activation-5.12e-02

Pos logit contributions
 Sund+0.089
hest+0.087
 [&+0.083
 deterrent+0.079
 guarantees+0.078
Neg logit contributions
 affair-0.068
angering-0.067
cedented-0.067
addon-0.066
Access-0.066
Token speaking
Token ID5486
Feature activation+7.73e-02

Pos logit contributions
 Sund+0.122
hest+0.119
 [&+0.114
 deterrent+0.106
eks+0.105
Neg logit contributions
 affair-0.114
angering-0.111
cedented-0.109
addon-0.109
Access-0.109
Token,
Token ID11
Feature activation-7.27e-02

Pos logit contributions
 affair+0.108
angering+0.106
cedented+0.105
addon+0.105
issy+0.104
Neg logit contributions
 Sund-0.124
hest-0.122
 [&-0.116
 deterrent-0.108
 guarantees-0.108
Token even
Token ID772
Feature activation+1.66e-02

Pos logit contributions
 Sund+0.191
hest+0.187
 [&+0.179
 deterrent+0.169
+0.168
Neg logit contributions
 affair-0.142
angering-0.140
cedented-0.138
addon-0.137
issy-0.136
Token though
Token ID996
Feature activation+2.26e-02

Pos logit contributions
 affair+0.033
angering+0.033
addon+0.032
cedented+0.032
issy+0.032
Neg logit contributions
 Sund-0.043
hest-0.042
 [&-0.040
 deterrent-0.037
-0.037
Token they
Token ID484
Feature activation-1.47e-01

Pos logit contributions
 affair+0.047
angering+0.047
cedented+0.046
addon+0.046
issy+0.046
Neg logit contributions
 Sund-0.056
hest-0.054
 [&-0.052
 deterrent-0.049
 guarantees-0.049
Token all
Token ID477
Feature activation-1.03e-01

Pos logit contributions
 Sund+0.382
hest+0.372
 [&+0.356
 deterrent+0.336
 guarantees+0.335
Neg logit contributions
 affair-0.291
cedented-0.287
angering-0.287
addon-0.283
issy-0.280
Token passed
Token ID3804
Feature activation-4.91e-01

Pos logit contributions
 Sund+0.272
hest+0.265
 [&+0.254
 deterrent+0.239
 guarantees+0.238
Neg logit contributions
 affair-0.199
angering-0.195
cedented-0.194
addon-0.193
issy-0.191
Token.
Token ID13
Feature activation-8.66e-02

Pos logit contributions
 Sund+1.223
hest+1.187
 [&+1.127
 guarantees+1.072
 deterrent+1.070
Neg logit contributions
cedented-1.032
 affair-1.024
angering-1.015
addon-1.009
quished-1.001
Token
Token ID198
Feature activation-7.66e-02

Pos logit contributions
 Sund+0.234
hest+0.227
 [&+0.218
 guarantees+0.205
 deterrent+0.205
Neg logit contributions
 affair-0.164
cedented-0.162
angering-0.162
addon-0.160
quished-0.158
Token
Token ID198
Feature activation-8.03e-02

Pos logit contributions
 Sund+0.211
hest+0.206
 [&+0.197
 deterrent+0.190
 guarantees+0.187
Neg logit contributions
 affair-0.139
cedented-0.136
angering-0.135
addon-0.135
Access-0.135
TokenAn
Token ID2025
Feature activation-1.10e-01

Pos logit contributions
 Sund+0.216
hest+0.211
 [&+0.202
 deterrent+0.191
 guarantees+0.191
Neg logit contributions
 affair-0.152
cedented-0.150
angering-0.149
addon-0.148
Access-0.146
Token all
Token ID477
Feature activation-1.33e-01

Pos logit contributions
 Sund+0.272
hest+0.266
 [&+0.254
 deterrent+0.239
 guarantees+0.238
Neg logit contributions
 affair-0.234
cedented-0.230
angering-0.228
addon-0.227
Access-0.226
Token,
Token ID11
Feature activation+4.12e-02

Pos logit contributions
 Sund+0.560
hest+0.542
 [&+0.518
 guarantees+0.489
 deterrent+0.488
Neg logit contributions
 affair-0.466
cedented-0.464
angering-0.463
addon-0.457
issy-0.453
Token Munich
Token ID22418
Feature activation-5.97e-02

Pos logit contributions
 affair+0.079
cedented+0.078
angering+0.078
addon+0.077
issy+0.076
Neg logit contributions
 Sund-0.110
hest-0.107
 [&-0.102
 deterrent-0.097
 guarantees-0.097
Token design
Token ID1486
Feature activation+1.52e-01

Pos logit contributions
 Sund+0.153
hest+0.148
 [&+0.141
 guarantees+0.134
 deterrent+0.133
Neg logit contributions
 affair-0.121
cedented-0.121
angering-0.120
addon-0.119
issy-0.118
Token student
Token ID3710
Feature activation-9.04e-02

Pos logit contributions
 affair+0.347
angering+0.340
cedented+0.340
addon+0.337
issy+0.334
Neg logit contributions
 Sund-0.352
hest-0.345
 [&-0.325
 deterrent-0.305
-0.303
Token Mor
Token ID3461
Feature activation+1.15e-01

Pos logit contributions
 Sund+0.241
hest+0.235
 [&+0.225
 deterrent+0.213
 guarantees+0.212
Neg logit contributions
 affair-0.173
cedented-0.171
angering-0.171
addon-0.168
issy-0.167
Tokenitz
Token ID4224
Feature activation+3.94e-01

Pos logit contributions
 affair+0.112
cedented+0.109
angering+0.109
addon+0.106
issy+0.104
Neg logit contributions
 Sund-0.414
hest-0.407
 [&-0.394
 deterrent-0.379
 guarantees-0.378
Token Bre
Token ID3719
Feature activation-4.77e-02

Pos logit contributions
 affair+0.758
angering+0.745
cedented+0.744
addon+0.740
issy+0.734
Neg logit contributions
 Sund-1.029
hest-1.014
 [&-0.971
 deterrent-0.924
 guarantees-0.918
Tokenit
Token ID270
Feature activation+1.77e-01

Pos logit contributions
 Sund+0.116
hest+0.112
 [&+0.108
 deterrent+0.102
 guarantees+0.101
Neg logit contributions
 affair-0.102
cedented-0.101
angering-0.101
addon-0.100
issy-0.100
Tokenen
Token ID268
Feature activation+2.72e-01

Pos logit contributions
 affair+0.367
cedented+0.363
angering+0.363
addon+0.360
issy+0.355
Neg logit contributions
 Sund-0.441
hest-0.429
 [&-0.410
 deterrent-0.388
 guarantees-0.385
Tokenhu
Token ID13415
Feature activation+5.85e-03

Pos logit contributions
 affair+0.464
cedented+0.458
angering+0.456
addon+0.450
issy+0.445
Neg logit contributions
 Sund-0.782
hest-0.764
 [&-0.734
 deterrent-0.698
 guarantees-0.695
Tokenber
Token ID527
Feature activation+3.30e-02

Pos logit contributions
 affair+0.010
cedented+0.009
angering+0.009
addon+0.009
Access+0.009
Neg logit contributions
 Sund-0.017
hest-0.017
 [&-0.016
 deterrent-0.015
 guarantees-0.015
Token.
Token ID13
Feature activation+7.50e-02

Pos logit contributions
 affair+0.455
cedented+0.449
angering+0.448
addon+0.447
Access+0.441
Neg logit contributions
 Sund-0.471
hest-0.461
 [&-0.439
 deterrent-0.412
 guarantees-0.408
Token How
Token ID1374
Feature activation-3.29e-02

Pos logit contributions
 affair+0.140
cedented+0.138
angering+0.137
addon+0.136
Access+0.135
Neg logit contributions
 Sund-0.203
hest-0.199
 [&-0.190
 deterrent-0.181
 guarantees-0.180
Token could
Token ID714
Feature activation-5.25e-02

Pos logit contributions
 Sund+0.083
hest+0.081
 [&+0.078
 deterrent+0.073
 guarantees+0.073
Neg logit contributions
 affair-0.067
cedented-0.066
angering-0.066
addon-0.066
issy-0.065
Token this
Token ID428
Feature activation+3.57e-02

Pos logit contributions
 Sund+0.133
hest+0.130
 [&+0.124
 deterrent+0.117
 guarantees+0.117
Neg logit contributions
 affair-0.107
cedented-0.106
angering-0.105
addon-0.104
issy-0.103
Token go
Token ID467
Feature activation+3.57e-02

Pos logit contributions
 affair+0.073
cedented+0.072
angering+0.072
addon+0.071
issy+0.070
Neg logit contributions
 Sund-0.091
hest-0.088
 [&-0.085
 deterrent-0.080
 guarantees-0.080
Token wrong
Token ID2642
Feature activation+3.22e-01

Pos logit contributions
 affair+0.056
cedented+0.055
angering+0.055
addon+0.054
issy+0.054
Neg logit contributions
 Sund-0.107
hest-0.105
 [&-0.102
 deterrent-0.096
 guarantees-0.096
Token?!
Token ID12248
Feature activation+1.29e-01

Pos logit contributions
 affair+0.719
angering+0.702
cedented+0.702
addon+0.697
quished+0.693
Neg logit contributions
 Sund-0.750
hest-0.736
 [&-0.701
 deterrent-0.655
 guarantees-0.652
Token
Token ID198
Feature activation+2.15e-02

Pos logit contributions
 affair+0.241
cedented+0.236
angering+0.236
addon+0.234
Access+0.232
Neg logit contributions
 Sund-0.346
hest-0.340
 [&-0.324
 deterrent-0.309
 guarantees-0.308
Token
Token ID198
Feature activation+2.02e-02

Pos logit contributions
 affair+0.038
cedented+0.037
Access+0.037
addon+0.037
angering+0.037
Neg logit contributions
 Sund-0.060
hest-0.059
 [&-0.056
 deterrent-0.054
 guarantees-0.053
TokenWell
Token ID5779
Feature activation+2.31e-01

Pos logit contributions
 affair+0.036
cedented+0.036
addon+0.035
angering+0.035
Access+0.035
Neg logit contributions
 Sund-0.056
hest-0.055
 [&-0.053
 deterrent-0.051
 guarantees-0.050
Token,
Token ID11
Feature activation+3.19e-02

Pos logit contributions
 affair+0.452
cedented+0.444
angering+0.443
addon+0.440
quished+0.434
Neg logit contributions
 Sund-0.603
hest-0.590
 [&-0.566
 deterrent-0.534
 guarantees-0.532
Token
Token ID447
Feature activation-1.57e-02

Pos logit contributions
 affair+0.030
cedented+0.030
angering+0.030
addon+0.030
Access+0.029
Neg logit contributions
 Sund-0.045
hest-0.044
 [&-0.042
 deterrent-0.040
 guarantees-0.040
Token
Token ID247
Feature activation+5.03e-02

Pos logit contributions
 Sund+0.039
hest+0.038
 [&+0.036
 deterrent+0.034
 guarantees+0.034
Neg logit contributions
 affair-0.033
cedented-0.032
angering-0.032
addon-0.032
issy-0.032
Tokens
Token ID82
Feature activation+9.22e-02

Pos logit contributions
 affair+0.102
cedented+0.101
angering+0.101
addon+0.100
issy+0.099
Neg logit contributions
 Sund-0.128
hest-0.125
 [&-0.120
 deterrent-0.113
 guarantees-0.112
Token che
Token ID1125
Feature activation-4.01e-02

Pos logit contributions
 affair+0.208
angering+0.203
cedented+0.203
addon+0.202
Access+0.200
Neg logit contributions
 Sund-0.215
hest-0.211
 [&-0.200
 deterrent-0.187
 guarantees-0.187
Tokenc
Token ID66
Feature activation+1.59e-01

Pos logit contributions
 Sund+0.093
hest+0.090
 [&+0.086
 deterrent+0.081
 guarantees+0.081
Neg logit contributions
 affair-0.090
cedented-0.089
angering-0.089
addon-0.088
issy-0.088
Tokenkered
Token ID28970
Feature activation+3.22e-01

Pos logit contributions
 affair+0.366
cedented+0.358
addon+0.357
angering+0.355
issy+0.355
Neg logit contributions
 Sund-0.364
hest-0.351
 [&-0.337
 deterrent-0.318
 guarantees-0.316
Token history
Token ID2106
Feature activation+5.96e-02

Pos logit contributions
 affair+0.635
angering+0.595
cedented+0.594
Access+0.593
addon+0.590
Neg logit contributions
 Sund-0.846
hest-0.835
 [&-0.799
eks-0.752
 deterrent-0.750
Token at
Token ID379
Feature activation+2.60e-03

Pos logit contributions
 affair+0.119
angering+0.116
cedented+0.115
addon+0.115
quished+0.114
Neg logit contributions
 Sund-0.155
hest-0.152
 [&-0.145
 deterrent-0.136
 guarantees-0.135
Token the
Token ID262
Feature activation-2.54e-02

Pos logit contributions
 affair+0.005
cedented+0.005
angering+0.005
addon+0.005
issy+0.005
Neg logit contributions
 Sund-0.007
hest-0.007
 [&-0.007
 deterrent-0.006
 guarantees-0.006
Token Congress
Token ID3162
Feature activation+2.08e-01

Pos logit contributions
 Sund+0.069
hest+0.067
 [&+0.064
 deterrent+0.061
 guarantees+0.061
Neg logit contributions
 affair-0.048
angering-0.047
cedented-0.047
addon-0.046
issy-0.046
Token Theater
Token ID22207
Feature activation+1.15e-02

Pos logit contributions
 affair+0.402
angering+0.393
cedented+0.392
addon+0.389
Access+0.386
Neg logit contributions
 Sund-0.551
hest-0.540
 [&-0.515
 deterrent-0.489
 guarantees-0.486
Token are
Token ID389
Feature activation+2.45e-02

Pos logit contributions
 affair+0.446
angering+0.442
cedented+0.440
addon+0.434
issy+0.430
Neg logit contributions
 Sund-0.565
hest-0.550
 [&-0.528
 deterrent-0.495
 guarantees-0.494
Token also
Token ID635
Feature activation+9.03e-02

Pos logit contributions
 affair+0.046
angering+0.045
cedented+0.044
addon+0.044
Access+0.044
Neg logit contributions
 Sund-0.067
hest-0.065
 [&-0.063
-0.059
 guarantees-0.059
Token regarded
Token ID11987
Feature activation-1.70e-01

Pos logit contributions
 affair+0.161
angering+0.159
cedented+0.158
addon+0.155
Access+0.155
Neg logit contributions
 Sund-0.252
hest-0.247
 [&-0.237
 guarantees-0.223
 deterrent-0.223
Token as
Token ID355
Feature activation-5.94e-03

Pos logit contributions
 Sund+0.412
hest+0.400
 [&+0.381
 deterrent+0.360
 guarantees+0.358
Neg logit contributions
 affair-0.367
cedented-0.364
angering-0.363
addon-0.360
issy-0.356
Token a
Token ID257
Feature activation+2.82e-02

Pos logit contributions
 Sund+0.015
hest+0.014
 [&+0.014
 guarantees+0.013
 deterrent+0.013
Neg logit contributions
 affair-0.012
angering-0.012
cedented-0.012
addon-0.012
Access-0.012
Token special
Token ID2041
Feature activation+3.10e-01

Pos logit contributions
 affair+0.074
angering+0.074
cedented+0.074
addon+0.073
issy+0.072
Neg logit contributions
 Sund-0.055
hest-0.053
 [&-0.050
 guarantees-0.046
 deterrent-0.045
Token threat
Token ID2372
Feature activation-3.51e-03

Pos logit contributions
 affair+0.649
angering+0.636
cedented+0.627
Access+0.620
addon+0.616
Neg logit contributions
 Sund-0.778
hest-0.765
 [&-0.729
 guarantees-0.677
-0.675
Token or
Token ID393
Feature activation-2.83e-02

Pos logit contributions
 Sund+0.009
hest+0.009
 [&+0.008
 guarantees+0.008
 deterrent+0.008
Neg logit contributions
 affair-0.007
angering-0.007
cedented-0.007
addon-0.007
issy-0.007
Token whether
Token ID1771
Feature activation+9.41e-02

Pos logit contributions
 Sund+0.072
hest+0.071
 [&+0.067
 guarantees+0.063
+0.063
Neg logit contributions
 affair-0.058
angering-0.057
cedented-0.057
addon-0.056
Access-0.055
Token they
Token ID484
Feature activation+2.73e-02

Pos logit contributions
 affair+0.190
angering+0.187
cedented+0.186
addon+0.184
Access+0.183
Neg logit contributions
 Sund-0.241
hest-0.236
 [&-0.225
 deterrent-0.211
 guarantees-0.211
Token are
Token ID389
Feature activation+1.17e-02

Pos logit contributions
 affair+0.052
angering+0.051
cedented+0.050
addon+0.050
issy+0.049
Neg logit contributions
 Sund-0.073
hest-0.072
 [&-0.069
 deterrent-0.065
 guarantees-0.065
Token here
Token ID994
Feature activation-2.78e-02

Pos logit contributions
 affair+0.225
angering+0.222
cedented+0.222
addon+0.216
quished+0.215
Neg logit contributions
 Sund-0.708
hest-0.692
 [&-0.673
 guarantees-0.646
-0.644
Token.
Token ID13
Feature activation+3.38e-02

Pos logit contributions
 Sund+0.066
hest+0.064
 [&+0.061
 deterrent+0.057
 guarantees+0.057
Neg logit contributions
 affair-0.062
cedented-0.061
angering-0.061
addon-0.060
issy-0.060
Token<|endoftext|>
Token ID50256
Feature activation+1.17e-02

Pos logit contributions
 affair+0.059
cedented+0.059
angering+0.059
quished+0.057
addon+0.057
Neg logit contributions
 Sund-0.096
hest-0.093
 [&-0.090
-0.086
 guarantees-0.086
TokenMike
Token ID16073
Feature activation-3.72e-02

Pos logit contributions
 affair+0.022
cedented+0.022
angering+0.022
addon+0.021
Access+0.021
Neg logit contributions
 Sund-0.032
hest-0.031
 [&-0.029
 deterrent-0.028
 guarantees-0.028
Token Bloomberg
Token ID15689
Feature activation+1.98e-01

Pos logit contributions
 Sund+0.094
hest+0.092
 [&+0.088
 deterrent+0.083
 guarantees+0.082
Neg logit contributions
 affair-0.076
angering-0.075
cedented-0.074
addon-0.074
issy-0.073
Token may
Token ID743
Feature activation+3.16e-01

Pos logit contributions
 affair+0.380
angering+0.369
cedented+0.368
addon+0.367
Access+0.365
Neg logit contributions
 Sund-0.525
hest-0.516
 [&-0.494
 deterrent-0.467
 guarantees-0.462
Token well
Token ID880
Feature activation+6.30e-02

Pos logit contributions
 affair+0.618
angering+0.606
addon+0.599
cedented+0.597
issy+0.591
Neg logit contributions
 Sund-0.828
hest-0.808
 [&-0.776
 guarantees-0.731
 deterrent-0.731
Token be
Token ID307
Feature activation+6.07e-02

Pos logit contributions
 affair+0.131
angering+0.129
cedented+0.128
addon+0.127
issy+0.126
Neg logit contributions
 Sund-0.158
hest-0.154
 [&-0.147
 deterrent-0.138
 guarantees-0.138
Token,
Token ID11
Feature activation+1.12e-01

Pos logit contributions
 affair+0.113
cedented+0.112
angering+0.111
addon+0.110
Access+0.109
Neg logit contributions
 Sund-0.165
hest-0.161
 [&-0.154
 deterrent-0.146
 guarantees-0.146
Token as
Token ID355
Feature activation+1.56e-02

Pos logit contributions
 affair+0.232
cedented+0.229
angering+0.229
addon+0.226
issy+0.224
Neg logit contributions
 Sund-0.283
hest-0.276
 [&-0.263
 deterrent-0.248
 guarantees-0.247
Token consensus
Token ID11529
Feature activation+5.20e-02

Pos logit contributions
 affair+0.032
cedented+0.032
angering+0.032
addon+0.031
Access+0.031
Neg logit contributions
 Sund-0.039
hest-0.038
 [&-0.036
 deterrent-0.034
 guarantees-0.034
Token a
Token ID257
Feature activation-3.84e-02

Pos logit contributions
 Sund+0.186
hest+0.182
 [&+0.175
 deterrent+0.164
+0.163
Neg logit contributions
 affair-0.146
angering-0.142
cedented-0.142
addon-0.142
Access-0.140
Token chance
Token ID2863
Feature activation-9.10e-02

Pos logit contributions
 Sund+0.099
hest+0.096
 [&+0.092
 deterrent+0.087
 guarantees+0.087
Neg logit contributions
 affair-0.077
cedented-0.076
angering-0.076
addon-0.075
issy-0.075
Token to
Token ID284
Feature activation+3.93e-03

Pos logit contributions
 Sund+0.215
hest+0.209
 [&+0.199
 deterrent+0.186
 guarantees+0.186
Neg logit contributions
 affair-0.202
cedented-0.200
angering-0.200
addon-0.198
issy-0.196
Token make
Token ID787
Feature activation+5.70e-04

Pos logit contributions
 affair+0.008
angering+0.008
addon+0.008
issy+0.008
cedented+0.008
Neg logit contributions
 Sund-0.010
hest-0.010
 [&-0.009
 deterrent-0.009
 guarantees-0.009
Token it
Token ID340
Feature activation-4.36e-02

Pos logit contributions
 affair+0.001
angering+0.001
cedented+0.001
addon+0.001
issy+0.001
Neg logit contributions
 Sund-0.001
hest-0.001
 [&-0.001
 deterrent-0.001
-0.001
Token in
Token ID287
Feature activation-1.49e-02

Pos logit contributions
 Sund+0.108
hest+0.105
 [&+0.101
 deterrent+0.094
 guarantees+0.094
Neg logit contributions
 affair-0.091
angering-0.090
cedented-0.090
addon-0.089
issy-0.088
Token the
Token ID262
Feature activation-5.18e-02

Pos logit contributions
 Sund+0.041
hest+0.040
 [&+0.038
 deterrent+0.036
 guarantees+0.036
Neg logit contributions
 affair-0.028
cedented-0.027
angering-0.027
addon-0.027
issy-0.027
Token NFL
Token ID5134
Feature activation-5.34e-02

Pos logit contributions
 Sund+0.128
hest+0.125
 [&+0.119
 deterrent+0.112
 guarantees+0.111
Neg logit contributions
 affair-0.110
angering-0.108
cedented-0.107
addon-0.106
issy-0.106
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
 Sund+0.132
hest+0.129
 [&+0.123
 deterrent+0.116
 guarantees+0.116
Neg logit contributions
 affair-0.112
cedented-0.111
angering-0.111
addon-0.109
issy-0.108
Token He
Token ID679
Feature activation-6.17e-02

Pos logit contributions
 Sund+0.037
hest+0.037
 [&+0.035
 deterrent+0.033
 guarantees+0.033
Neg logit contributions
 affair-0.029
cedented-0.028
angering-0.028
addon-0.028
Access-0.028
Token always
Token ID1464
Feature activation+1.63e-01

Pos logit contributions
 Sund+0.175
hest+0.171
 [&+0.164
 deterrent+0.156
+0.155
Neg logit contributions
 affair-0.108
cedented-0.106
angering-0.106
addon-0.105
issy-0.103
Token monster
Token ID9234
Feature activation+1.26e-01

Pos logit contributions
 affair+0.068
angering+0.067
cedented+0.067
addon+0.066
issy+0.065
Neg logit contributions
 Sund-0.105
hest-0.103
 [&-0.098
 deterrent-0.093
 guarantees-0.093
Token!
Token ID0
Feature activation+2.08e-02

Pos logit contributions
 affair+0.275
angering+0.270
addon+0.268
cedented+0.268
issy+0.266
Neg logit contributions
 Sund-0.300
hest-0.295
 [&-0.279
 deterrent-0.264
 guarantees-0.261
Token You
Token ID921
Feature activation-1.05e-02

Pos logit contributions
 affair+0.037
cedented+0.036
angering+0.036
addon+0.036
Access+0.036
Neg logit contributions
 Sund-0.058
hest-0.057
 [&-0.055
 deterrent-0.052
 guarantees-0.052
Token won
Token ID1839
Feature activation+1.06e-02

Pos logit contributions
 Sund+0.029
hest+0.028
 [&+0.027
 deterrent+0.025
 guarantees+0.025
Neg logit contributions
 affair-0.020
cedented-0.019
angering-0.019
addon-0.019
Access-0.019
Token
Token ID447
Feature activation+2.84e-02

Pos logit contributions
 affair+0.029
Access+0.029
cedented+0.029
addon+0.029
issy+0.029
Neg logit contributions
 Sund-0.019
hest-0.018
 [&-0.017
 deterrent-0.016
 guarantees-0.016
Token
Token ID247
Feature activation+4.62e-02

Pos logit contributions
 affair+0.061
cedented+0.060
addon+0.059
angering+0.059
Access+0.059
Neg logit contributions
 Sund-0.069
hest-0.067
 [&-0.064
 deterrent-0.060
 guarantees-0.060
Tokent
Token ID83
Feature activation+1.58e-01

Pos logit contributions
 affair+0.093
cedented+0.092
angering+0.092
addon+0.091
issy+0.090
Neg logit contributions
 Sund-0.118
hest-0.115
 [&-0.110
 deterrent-0.104
 guarantees-0.103
Token be
Token ID307
Feature activation+6.00e-02

Pos logit contributions
 affair+0.315
angering+0.310
cedented+0.309
addon+0.306
Access+0.304
Neg logit contributions
 Sund-0.407
hest-0.398
 [&-0.381
 deterrent-0.359
 guarantees-0.358
Token held
Token ID2714
Feature activation-7.70e-02

Pos logit contributions
 affair+0.075
cedented+0.074
angering+0.074
addon+0.073
quished+0.072
Neg logit contributions
 Sund-0.199
hest-0.195
 [&-0.189
 deterrent-0.181
 guarantees-0.180
Token back
Token ID736
Feature activation+2.39e-03

Pos logit contributions
 Sund+0.174
hest+0.170
 [&+0.162
 deterrent+0.151
 guarantees+0.150
Neg logit contributions
 affair-0.178
angering-0.176
cedented-0.175
addon-0.173
Access-0.172
Token by
Token ID416
Feature activation-1.66e-02

Pos logit contributions
 affair+0.005
angering+0.005
cedented+0.005
addon+0.005
issy+0.005
Neg logit contributions
 Sund-0.006
hest-0.005
 [&-0.005
 deterrent-0.005
 guarantees-0.005
Token saw
Token ID2497
Feature activation-1.47e-01

Pos logit contributions
 Sund+0.057
hest+0.055
 [&+0.053
 deterrent+0.050
 guarantees+0.050
Neg logit contributions
 affair-0.040
cedented-0.039
angering-0.039
addon-0.039
issy-0.038
Token a
Token ID257
Feature activation+2.50e-02

Pos logit contributions
 Sund+0.375
hest+0.365
 [&+0.349
 deterrent+0.329
 guarantees+0.328
Neg logit contributions
 affair-0.298
cedented-0.295
angering-0.294
addon-0.291
issy-0.288
Token crocod
Token ID37565
Feature activation+2.17e-01

Pos logit contributions
 affair+0.044
cedented+0.043
angering+0.043
addon+0.043
issy+0.042
Neg logit contributions
 Sund-0.071
hest-0.069
 [&-0.066
 deterrent-0.063
 guarantees-0.063
Tokenile
Token ID576
Feature activation+1.38e-01

Pos logit contributions
 affair+0.588
cedented+0.582
angering+0.581
addon+0.578
issy+0.575
Neg logit contributions
 Sund-0.406
hest-0.389
 [&-0.369
 guarantees-0.338
 deterrent-0.337
Token kill
Token ID1494
Feature activation+2.12e-01

Pos logit contributions
 affair+0.265
angering+0.260
cedented+0.259
addon+0.256
issy+0.253
Neg logit contributions
 Sund-0.366
hest-0.358
 [&-0.342
 guarantees-0.323
 deterrent-0.322
Token a
Token ID257
Feature activation+2.16e-02

Pos logit contributions
 affair+0.443
angering+0.438
cedented+0.435
addon+0.429
issy+0.427
Neg logit contributions
 Sund-0.527
hest-0.517
 [&-0.493
 deterrent-0.462
 guarantees-0.461
Token water
Token ID1660
Feature activation+1.24e-01

Pos logit contributions
 affair+0.039
cedented+0.038
angering+0.038
addon+0.038
quished+0.037
Neg logit contributions
 Sund-0.060
hest-0.058
 [&-0.056
 deterrent-0.053
 guarantees-0.053
Token buffalo
Token ID45019
Feature activation+5.66e-02

Pos logit contributions
 affair+0.255
angering+0.252
cedented+0.251
addon+0.248
issy+0.247
Neg logit contributions
 Sund-0.314
hest-0.306
 [&-0.292
 deterrent-0.274
 guarantees-0.274
Token this
Token ID428
Feature activation-2.17e-02

Pos logit contributions
 affair+0.116
angering+0.114
cedented+0.114
addon+0.113
issy+0.112
Neg logit contributions
 Sund-0.143
hest-0.140
 [&-0.134
 deterrent-0.126
 guarantees-0.125
Token summer
Token ID3931
Feature activation+5.63e-02

Pos logit contributions
 Sund+0.050
hest+0.049
 [&+0.047
 guarantees+0.044
 deterrent+0.044
Neg logit contributions
 affair-0.049
angering-0.048
cedented-0.048
addon-0.048
issy-0.047
Token.
Token ID13
Feature activation-7.60e-02

Pos logit contributions
 affair+0.120
cedented+0.119
angering+0.118
addon+0.117
Access+0.116
Neg logit contributions
 Sund-0.138
hest-0.134
 [&-0.128
 deterrent-0.120
 guarantees-0.120
Token
Token ID247
Feature activation+5.88e-02

Pos logit contributions
 affair+0.018
cedented+0.017
angering+0.017
addon+0.017
issy+0.017
Neg logit contributions
 Sund-0.022
hest-0.022
 [&-0.021
 deterrent-0.020
 guarantees-0.020
Tokens
Token ID82
Feature activation+2.59e-02

Pos logit contributions
 affair+0.124
cedented+0.122
angering+0.122
addon+0.121
Access+0.120
Neg logit contributions
 Sund-0.145
hest-0.141
 [&-0.135
 deterrent-0.127
 guarantees-0.126
Token important
Token ID1593
Feature activation+1.00e-01

Pos logit contributions
 affair+0.050
cedented+0.049
angering+0.049
addon+0.049
Access+0.048
Neg logit contributions
 Sund-0.068
hest-0.067
 [&-0.064
 deterrent-0.060
 guarantees-0.060
Token is
Token ID318
Feature activation-3.39e-02

Pos logit contributions
 affair+0.207
cedented+0.203
angering+0.203
addon+0.200
issy+0.200
Neg logit contributions
 Sund-0.252
hest-0.247
 [&-0.236
 deterrent-0.221
 guarantees-0.220
Token that
Token ID326
Feature activation-5.78e-02

Pos logit contributions
 Sund+0.087
hest+0.085
 [&+0.082
 deterrent+0.077
 guarantees+0.077
Neg logit contributions
 affair-0.068
cedented-0.067
angering-0.067
addon-0.066
Access-0.065
Token more
Token ID517
Feature activation-2.31e-03

Pos logit contributions
 Sund+0.142
hest+0.139
 [&+0.132
 deterrent+0.124
 guarantees+0.123
Neg logit contributions
 affair-0.123
angering-0.121
cedented-0.120
addon-0.119
issy-0.118
Token and
Token ID290
Feature activation-4.00e-02

Pos logit contributions
 Sund+0.006
hest+0.005
 [&+0.005
 deterrent+0.005
 guarantees+0.005
Neg logit contributions
 affair-0.005
cedented-0.005
angering-0.005
addon-0.005
Access-0.005
Token more
Token ID517
Feature activation-3.94e-02

Pos logit contributions
 Sund+0.095
hest+0.093
 [&+0.088
 deterrent+0.083
 guarantees+0.082
Neg logit contributions
 affair-0.088
angering-0.087
cedented-0.086
addon-0.085
Access-0.085
Token of
Token ID286
Feature activation-5.91e-02

Pos logit contributions
 Sund+0.096
hest+0.093
 [&+0.089
 deterrent+0.083
 guarantees+0.083
Neg logit contributions
 affair-0.085
cedented-0.083
angering-0.083
addon-0.082
issy-0.082
Token our
Token ID674
Feature activation-1.53e-02

Pos logit contributions
 Sund+0.151
hest+0.147
 [&+0.141
 deterrent+0.133
 guarantees+0.132
Neg logit contributions
 affair-0.120
cedented-0.119
angering-0.118
addon-0.117
issy-0.116
Token country
Token ID1499
Feature activation-1.86e-02

Pos logit contributions
 Sund+0.037
hest+0.036
 [&+0.034
 deterrent+0.032
+0.032
Neg logit contributions
 affair-0.033
angering-0.032
cedented-0.032
addon-0.032
Access-0.032
Token Tony
Token ID8832
Feature activation-2.21e-02

Pos logit contributions
 affair+0.449
cedented+0.447
angering+0.445
addon+0.440
quished+0.435
Neg logit contributions
 Sund-0.625
hest-0.608
 [&-0.582
 deterrent-0.550
 guarantees-0.550
Token Ferguson
Token ID15031
Feature activation-7.81e-03

Pos logit contributions
 Sund+0.047
hest+0.046
 [&+0.044
 deterrent+0.040
 guarantees+0.040
Neg logit contributions
 affair-0.054
cedented-0.053
angering-0.053
addon-0.053
issy-0.052
Token versus
Token ID9051
Feature activation-4.76e-02

Pos logit contributions
 Sund+0.020
hest+0.019
 [&+0.018
 deterrent+0.017
 guarantees+0.017
Neg logit contributions
 affair-0.016
cedented-0.016
angering-0.016
addon-0.016
quished-0.015
Token Kh
Token ID5311
Feature activation+2.62e-01

Pos logit contributions
 Sund+0.133
hest+0.130
 [&+0.125
 deterrent+0.118
 guarantees+0.118
Neg logit contributions
 affair-0.085
cedented-0.084
angering-0.084
addon-0.083
Access-0.082
Tokenab
Token ID397
Feature activation+6.74e-02

Pos logit contributions
 affair+0.480
addon+0.479
issy+0.474
angering+0.470
cedented+0.470
Neg logit contributions
 Sund-0.708
hest-0.694
 [&-0.669
 guarantees-0.638
 deterrent-0.631
Tokenib
Token ID571
Feature activation+1.92e-01

Pos logit contributions
 affair+0.149
addon+0.149
angering+0.147
issy+0.146
cedented+0.145
Neg logit contributions
 Sund-0.158
hest-0.154
 [&-0.148
 deterrent-0.138
 guarantees-0.138
Token Nur
Token ID49281
Feature activation+2.21e-01

Pos logit contributions
 affair+0.338
cedented+0.334
angering+0.333
addon+0.329
quished+0.325
Neg logit contributions
 Sund-0.540
hest-0.527
 [&-0.506
 deterrent-0.479
 guarantees-0.479
Tokenmag
Token ID19726
Feature activation+7.22e-02

Pos logit contributions
 affair+0.255
angering+0.246
issy+0.243
addon+0.242
cedented+0.242
Neg logit contributions
 Sund-0.757
hest-0.746
 [&-0.723
 deterrent-0.693
eks-0.686
Tokenomed
Token ID12657
Feature activation-9.46e-02

Pos logit contributions
 affair+0.212
angering+0.211
cedented+0.210
addon+0.209
issy+0.208
Neg logit contributions
 Sund-0.118
hest-0.113
 [&-0.106
 deterrent-0.096
 guarantees-0.095
Tokenov
Token ID709
Feature activation-1.63e-01

Pos logit contributions
 Sund+0.331
hest+0.325
 [&+0.315
 deterrent+0.302
 guarantees+0.301
Neg logit contributions
 affair-0.102
cedented-0.100
angering-0.100
addon-0.098
issy-0.096
Token
Token ID784
Feature activation-2.79e-02

Pos logit contributions
 Sund+0.410
hest+0.400
 [&+0.382
 deterrent+0.359
 guarantees+0.359
Neg logit contributions
 affair-0.337
cedented-0.333
angering-0.333
addon-0.329
issy-0.326
Token,
Token ID11
Feature activation-5.48e-02

Pos logit contributions
 Sund+0.599
hest+0.581
 [&+0.554
 guarantees+0.528
 deterrent+0.527
Neg logit contributions
 affair-0.456
cedented-0.454
angering-0.451
addon-0.450
issy-0.446
Token the
Token ID262
Feature activation+2.15e-02

Pos logit contributions
 Sund+0.142
hest+0.139
 [&+0.133
 deterrent+0.125
 guarantees+0.124
Neg logit contributions
 affair-0.109
angering-0.107
cedented-0.107
addon-0.106
Access-0.105
Token arrest
Token ID3251
Feature activation-2.84e-02

Pos logit contributions
 affair+0.042
angering+0.041
cedented+0.041
addon+0.041
Access+0.040
Neg logit contributions
 Sund-0.056
hest-0.055
 [&-0.053
-0.050
 deterrent-0.050
Token of
Token ID286
Feature activation-6.64e-03

Pos logit contributions
 Sund+0.074
hest+0.072
 [&+0.069
 deterrent+0.065
 guarantees+0.064
Neg logit contributions
 affair-0.057
angering-0.056
cedented-0.055
addon-0.055
quished-0.054
Token a
Token ID257
Feature activation+2.18e-02

Pos logit contributions
 Sund+0.017
hest+0.017
 [&+0.016
 deterrent+0.015
 guarantees+0.015
Neg logit contributions
 affair-0.013
angering-0.013
cedented-0.013
addon-0.013
issy-0.012
Token foreigner
Token ID44897
Feature activation-5.27e-02

Pos logit contributions
 affair+0.033
cedented+0.033
angering+0.033
addon+0.032
issy+0.032
Neg logit contributions
 Sund-0.067
hest-0.065
 [&-0.063
 deterrent-0.060
 guarantees-0.060
Token is
Token ID318
Feature activation+2.65e-02

Pos logit contributions
 Sund+0.136
hest+0.135
 [&+0.128
 deterrent+0.120
wal+0.119
Neg logit contributions
 affair-0.104
angering-0.103
cedented-0.102
addon-0.100
issy-0.099
Token never
Token ID1239
Feature activation+1.31e-01

Pos logit contributions
 affair+0.052
angering+0.051
cedented+0.051
addon+0.050
Access+0.049
Neg logit contributions
 Sund-0.070
hest-0.069
 [&-0.066
-0.062
 guarantees-0.062
Token done
Token ID1760
Feature activation+5.43e-02

Pos logit contributions
 affair+0.240
angering+0.234
cedented+0.233
addon+0.229
Access+0.228
Neg logit contributions
 Sund-0.360
hest-0.353
 [&-0.340
-0.319
eks-0.317
Token casually
Token ID27793
Feature activation+6.95e-02

Pos logit contributions
 affair+0.112
angering+0.109
cedented+0.109
addon+0.107
issy+0.107
Neg logit contributions
 Sund-0.137
hest-0.134
 [&-0.129
 guarantees-0.119
 deterrent-0.119
Token.
Token ID13
Feature activation+1.56e-03

Pos logit contributions
 affair+0.141
angering+0.139
cedented+0.137
addon+0.135
issy+0.134
Neg logit contributions
 Sund-0.178
hest-0.174
 [&-0.167
 deterrent-0.156
 guarantees-0.156
Token this
Token ID428
Feature activation+2.33e-02

Pos logit contributions
 affair+0.079
cedented+0.078
angering+0.077
addon+0.077
Access+0.076
Neg logit contributions
 Sund-0.137
hest-0.135
 [&-0.130
 deterrent-0.123
 guarantees-0.123
Token light
Token ID1657
Feature activation+8.99e-02

Pos logit contributions
 affair+0.058
cedented+0.057
angering+0.056
addon+0.056
Access+0.056
Neg logit contributions
 Sund-0.049
hest-0.048
 [&-0.045
wal-0.041
 guarantees-0.041
Token before
Token ID878
Feature activation+9.07e-02

Pos logit contributions
 affair+0.189
cedented+0.186
angering+0.186
addon+0.184
quished+0.182
Neg logit contributions
 Sund-0.222
hest-0.217
 [&-0.207
 deterrent-0.194
 guarantees-0.194
Token.
Token ID13
Feature activation+2.98e-02

Pos logit contributions
 affair+0.192
cedented+0.189
angering+0.189
addon+0.188
Access+0.186
Neg logit contributions
 Sund-0.222
hest-0.217
 [&-0.207
 deterrent-0.195
 guarantees-0.194
Token<|endoftext|>
Token ID50256
Feature activation+3.71e-02

Pos logit contributions
 affair+0.055
angering+0.054
cedented+0.054
addon+0.054
Access+0.053
Neg logit contributions
 Sund-0.081
hest-0.079
 [&-0.075
 deterrent-0.072
 guarantees-0.072
TokenThe
Token ID464
Feature activation-2.58e-02

Pos logit contributions
 affair+0.069
cedented+0.069
angering+0.069
addon+0.068
Access+0.068
Neg logit contributions
 Sund-0.100
hest-0.098
 [&-0.093
 deterrent-0.089
 guarantees-0.089
Token history
Token ID2106
Feature activation+1.16e-01

Pos logit contributions
 Sund+0.067
hest+0.066
 [&+0.063
 deterrent+0.059
 guarantees+0.059
Neg logit contributions
 affair-0.051
cedented-0.050
angering-0.050
addon-0.050
issy-0.049
Token of
Token ID286
Feature activation+1.79e-02

Pos logit contributions
 affair+0.236
cedented+0.235
angering+0.234
addon+0.232
issy+0.229
Neg logit contributions
 Sund-0.294
hest-0.286
 [&-0.273
 guarantees-0.258
 deterrent-0.258
Token batteries
Token ID13591
Feature activation+1.56e-01

Pos logit contributions
 affair+0.034
cedented+0.034
angering+0.034
addon+0.033
issy+0.033
Neg logit contributions
 Sund-0.048
hest-0.046
 [&-0.044
 deterrent-0.042
 guarantees-0.042
Token can
Token ID460
Feature activation-8.64e-03

Pos logit contributions
 affair+0.311
angering+0.303
cedented+0.302
addon+0.300
issy+0.297
Neg logit contributions
 Sund-0.402
hest-0.394
 [&-0.374
 deterrent-0.353
-0.350
Token be
Token ID307
Feature activation+2.91e-02

Pos logit contributions
 Sund+0.021
hest+0.020
 [&+0.019
 deterrent+0.018
 guarantees+0.018
Neg logit contributions
 affair-0.019
angering-0.019
cedented-0.018
addon-0.018
issy-0.018
Token different
Token ID1180
Feature activation+1.21e-01

Pos logit contributions
 Sund+0.051
hest+0.050
 [&+0.047
 guarantees+0.044
+0.044
Neg logit contributions
 affair-0.043
angering-0.042
cedented-0.042
addon-0.042
issy-0.041
Token workforce
Token ID16433
Feature activation-2.37e-01

Pos logit contributions
 affair+0.262
angering+0.251
cedented+0.249
addon+0.248
issy+0.246
Neg logit contributions
 Sund-0.294
hest-0.293
 [&-0.278
eks-0.258
 guarantees-0.258
Token,
Token ID11
Feature activation-6.37e-03

Pos logit contributions
 Sund+0.586
hest+0.576
 [&+0.546
 guarantees+0.511
 deterrent+0.508
Neg logit contributions
 affair-0.504
angering-0.492
cedented-0.488
addon-0.485
issy-0.480
Token one
Token ID530
Feature activation-7.51e-03

Pos logit contributions
 Sund+0.016
hest+0.016
 [&+0.015
 deterrent+0.014
 guarantees+0.014
Neg logit contributions
 affair-0.013
angering-0.013
cedented-0.013
addon-0.013
Access-0.013
Token that
Token ID326
Feature activation+2.18e-02

Pos logit contributions
 Sund+0.019
hest+0.019
 [&+0.018
 deterrent+0.017
 guarantees+0.017
Neg logit contributions
 affair-0.015
angering-0.015
cedented-0.015
addon-0.014
Access-0.014
Token values
Token ID3815
Feature activation-2.01e-02

Pos logit contributions
 affair+0.046
angering+0.045
cedented+0.045
addon+0.044
Access+0.044
Neg logit contributions
 Sund-0.053
hest-0.053
 [&-0.050
 deterrent-0.046
-0.046
Token efficacy
Token ID20179
Feature activation+9.37e-02

Pos logit contributions
 Sund+0.047
hest+0.046
 [&+0.044
+0.041
eks+0.041
Neg logit contributions
 affair-0.045
angering-0.044
cedented-0.044
addon-0.043
issy-0.043
Token,
Token ID11
Feature activation+9.69e-03

Pos logit contributions
 affair+0.192
angering+0.189
cedented+0.188
addon+0.186
Access+0.185
Neg logit contributions
 Sund-0.237
hest-0.232
 [&-0.221
 deterrent-0.206
eks-0.206
Token flexibility
Token ID13688
Feature activation+2.41e-01

Pos logit contributions
 affair+0.021
angering+0.021
cedented+0.020
addon+0.020
Access+0.020
Neg logit contributions
 Sund-0.023
hest-0.023
 [&-0.022
-0.020
wal-0.020
Token,
Token ID11
Feature activation+3.79e-02

Pos logit contributions
 affair+0.453
cedented+0.446
angering+0.446
addon+0.439
Access+0.435
Neg logit contributions
 Sund-0.649
hest-0.634
 [&-0.607
 deterrent-0.573
 guarantees-0.571
Token and
Token ID290
Feature activation+6.06e-02

Pos logit contributions
 affair+0.075
angering+0.074
cedented+0.074
addon+0.073
Access+0.072
Neg logit contributions
 Sund-0.098
hest-0.096
 [&-0.092
 deterrent-0.086
 guarantees-0.086
Token he
Token ID339
Feature activation-6.38e-02

Pos logit contributions
 Sund+0.043
hest+0.042
 [&+0.040
 deterrent+0.038
 guarantees+0.038
Neg logit contributions
 affair-0.032
cedented-0.031
angering-0.031
addon-0.031
Access-0.031
Token said
Token ID531
Feature activation-1.24e-01

Pos logit contributions
 Sund+0.189
hest+0.185
 [&+0.178
 guarantees+0.170
 deterrent+0.169
Neg logit contributions
 affair-0.102
cedented-0.101
angering-0.101
addon-0.099
Access-0.098
Token.
Token ID13
Feature activation-5.49e-02

Pos logit contributions
 Sund+0.284
hest+0.275
 [&+0.260
 guarantees+0.246
 deterrent+0.244
Neg logit contributions
cedented-0.282
 affair-0.282
angering-0.281
addon-0.278
quished-0.275
Token "
Token ID366
Feature activation-1.13e-01

Pos logit contributions
 Sund+0.168
hest+0.163
 [&+0.157
 guarantees+0.149
 deterrent+0.149
Neg logit contributions
 affair-0.084
cedented-0.083
angering-0.083
addon-0.082
Access-0.081
TokenYou
Token ID1639
Feature activation-1.74e-01

Pos logit contributions
 Sund+0.311
hest+0.303
 [&+0.290
 deterrent+0.280
 guarantees+0.277
Neg logit contributions
 affair-0.205
Access-0.204
angering-0.202
addon-0.201
cedented-0.201
Token have
Token ID423
Feature activation-2.08e-02

Pos logit contributions
 Sund+0.448
hest+0.436
 [&+0.418
 deterrent+0.393
 guarantees+0.392
Neg logit contributions
 affair-0.350
angering-0.344
cedented-0.343
addon-0.340
issy-0.337
Token to
Token ID284
Feature activation-2.17e-02

Pos logit contributions
 Sund+0.047
hest+0.047
 [&+0.044
+0.042
wal+0.041
Neg logit contributions
 affair-0.047
angering-0.046
addon-0.046
Access-0.046
cedented-0.046
Token pay
Token ID1414
Feature activation+1.08e-01

Pos logit contributions
 Sund+0.056
hest+0.055
 [&+0.052
 deterrent+0.049
+0.049
Neg logit contributions
 affair-0.044
angering-0.043
addon-0.042
issy-0.042
cedented-0.042
Token attention
Token ID3241
Feature activation+1.89e-01

Pos logit contributions
 affair+0.225
angering+0.220
addon+0.217
Access+0.216
cedented+0.216
Neg logit contributions
 Sund-0.266
hest-0.261
 [&-0.254
-0.237
 deterrent-0.235
Token to
Token ID284
Feature activation-7.98e-02

Pos logit contributions
 affair+0.440
angering+0.433
cedented+0.431
addon+0.429
issy+0.427
Neg logit contributions
 Sund-0.423
hest-0.414
 [&-0.394
 deterrent-0.365
 guarantees-0.364
Token all
Token ID477
Feature activation-1.07e-01

Pos logit contributions
 Sund+0.190
hest+0.188
 [&+0.178
+0.167
 deterrent+0.167
Neg logit contributions
 affair-0.173
angering-0.171
addon-0.169
cedented-0.168
issy-0.167
Token Metroid
Token ID45430
Feature activation+2.30e-01

Pos logit contributions
 affair+0.112
cedented+0.111
angering+0.111
addon+0.110
issy+0.110
Neg logit contributions
 Sund-0.086
hest-0.083
 [&-0.079
 deterrent-0.073
 guarantees-0.073
Token speed
Token ID2866
Feature activation-1.62e-01

Pos logit contributions
 affair+0.461
cedented+0.451
angering+0.450
addon+0.446
Access+0.442
Neg logit contributions
 Sund-0.594
hest-0.582
 [&-0.557
 deterrent-0.523
 guarantees-0.522
Tokenrun
Token ID5143
Feature activation-1.00e-01

Pos logit contributions
 Sund+0.443
hest+0.433
 [&+0.415
 deterrent+0.393
 guarantees+0.392
Neg logit contributions
 affair-0.299
cedented-0.296
angering-0.296
addon-0.292
Access-0.289
Token race
Token ID3234
Feature activation-1.12e-01

Pos logit contributions
 Sund+0.255
hest+0.250
 [&+0.238
 deterrent+0.224
+0.223
Neg logit contributions
 affair-0.205
cedented-0.199
angering-0.199
addon-0.197
quished-0.196
Token,
Token ID11
Feature activation-5.52e-02

Pos logit contributions
 Sund+0.283
hest+0.277
 [&+0.264
 deterrent+0.248
+0.246
Neg logit contributions
 affair-0.230
cedented-0.223
angering-0.223
addon-0.221
Access-0.220
Token from
Token ID422
Feature activation-6.85e-02

Pos logit contributions
 Sund+0.139
hest+0.138
 [&+0.131
 deterrent+0.123
+0.123
Neg logit contributions
 affair-0.113
angering-0.110
cedented-0.109
addon-0.109
Access-0.108
Token Awesome
Token ID25020
Feature activation+1.51e-02

Pos logit contributions
 Sund+0.186
hest+0.183
 [&+0.175
 deterrent+0.166
 guarantees+0.165
Neg logit contributions
 affair-0.127
angering-0.125
cedented-0.124
addon-0.123
issy-0.122
Token Games
Token ID5776
Feature activation+2.08e-02

Pos logit contributions
 affair+0.029
cedented+0.028
angering+0.028
addon+0.028
Access+0.028
Neg logit contributions
 Sund-0.040
hest-0.039
 [&-0.038
 deterrent-0.036
 guarantees-0.035
Token Done
Token ID24429
Feature activation-7.45e-02

Pos logit contributions
 affair+0.043
cedented+0.042
angering+0.042
addon+0.042
Access+0.042
Neg logit contributions
 Sund-0.052
hest-0.051
 [&-0.049
 deterrent-0.046
 guarantees-0.046
Token Quick
Token ID12029
Feature activation-2.81e-02

Pos logit contributions
 Sund+0.193
hest+0.188
 [&+0.181
 deterrent+0.171
 guarantees+0.170
Neg logit contributions
 affair-0.148
cedented-0.146
angering-0.145
addon-0.144
Access-0.143
Token 2014
Token ID1946
Feature activation-6.14e-02

Pos logit contributions
 Sund+0.070
hest+0.069
 [&+0.066
 deterrent+0.062
 guarantees+0.062
Neg logit contributions
 affair-0.058
angering-0.056
cedented-0.056
addon-0.056
Access-0.056
Token That
Token ID1320
Feature activation-1.08e-01

Pos logit contributions
 Sund+0.286
hest+0.278
 [&+0.261
 deterrent+0.242
 guarantees+0.241
Neg logit contributions
 affair-0.389
cedented-0.385
angering-0.385
addon-0.381
Access-0.380
Token is
Token ID318
Feature activation-6.91e-02

Pos logit contributions
 Sund+0.266
hest+0.262
 [&+0.248
 deterrent+0.231
 guarantees+0.229
Neg logit contributions
 affair-0.227
angering-0.223
cedented-0.222
addon-0.220
Access-0.219
Token a
Token ID257
Feature activation-3.91e-02

Pos logit contributions
 Sund+0.178
hest+0.174
 [&+0.166
 guarantees+0.155
 deterrent+0.155
Neg logit contributions
 affair-0.138
angering-0.137
cedented-0.136
addon-0.135
Access-0.134
Token very
Token ID845
Feature activation+4.68e-02

Pos logit contributions
 Sund+0.100
hest+0.097
 [&+0.093
 guarantees+0.087
 deterrent+0.087
Neg logit contributions
 affair-0.079
angering-0.078
cedented-0.078
addon-0.077
issy-0.077
Token dangerous
Token ID4923
Feature activation-8.39e-02

Pos logit contributions
 affair+0.086
angering+0.084
cedented+0.084
addon+0.083
issy+0.082
Neg logit contributions
 Sund-0.128
hest-0.126
 [&-0.121
 guarantees-0.114
 deterrent-0.113
Token precedent
Token ID19719
Feature activation-3.44e-01

Pos logit contributions
 Sund+0.207
hest+0.203
 [&+0.194
 guarantees+0.181
 deterrent+0.179
Neg logit contributions
 affair-0.178
angering-0.174
cedented-0.173
addon-0.172
issy-0.170
Token.
Token ID13
Feature activation-6.49e-02

Pos logit contributions
 Sund+0.849
hest+0.825
 [&+0.787
 deterrent+0.741
 guarantees+0.740
Neg logit contributions
 affair-0.728
cedented-0.721
angering-0.719
addon-0.712
quished-0.705
Token
Token ID198
Feature activation-3.87e-02

Pos logit contributions
 Sund+0.149
hest+0.146
 [&+0.138
 deterrent+0.131
 guarantees+0.130
Neg logit contributions
 affair-0.147
angering-0.145
cedented-0.144
Access-0.143
addon-0.143
Token
Token ID198
Feature activation-3.29e-02

Pos logit contributions
 Sund+0.107
hest+0.105
 [&+0.100
 deterrent+0.097
 guarantees+0.095
Neg logit contributions
 affair-0.069
cedented-0.068
Access-0.067
angering-0.067
addon-0.067
Token
Token ID447
Feature activation+1.42e-02

Pos logit contributions
 Sund+0.087
hest+0.085
 [&+0.081
 deterrent+0.077
 guarantees+0.077
Neg logit contributions
 affair-0.063
cedented-0.063
angering-0.062
Access-0.062
addon-0.062
Token
Token ID250
Feature activation+2.34e-02

Pos logit contributions
 affair+0.027
cedented+0.027
addon+0.027
angering+0.027
Access+0.027
Neg logit contributions
 Sund-0.038
hest-0.037
 [&-0.035
 deterrent-0.033
 guarantees-0.033
Tokenwww
Token ID2503
Feature activation-1.57e-01

Pos logit contributions
 Sund+0.182
hest+0.176
 [&+0.168
 deterrent+0.158
 guarantees+0.157
Neg logit contributions
 affair-0.170
cedented-0.169
angering-0.169
addon-0.167
Access-0.166
Token.
Token ID13
Feature activation-1.57e-01

Pos logit contributions
 Sund+0.387
hest+0.374
 [&+0.356
 deterrent+0.345
 guarantees+0.337
Neg logit contributions
 affair-0.327
Access-0.323
cedented-0.323
addon-0.322
angering-0.321
Tokened
Token ID276
Feature activation-2.38e-01

Pos logit contributions
 Sund+0.369
hest+0.356
 [&+0.341
 deterrent+0.320
 guarantees+0.319
Neg logit contributions
 affair-0.352
cedented-0.349
angering-0.349
addon-0.346
issy-0.342
Tokenwards
Token ID2017
Feature activation-1.29e-01

Pos logit contributions
 Sund+0.427
hest+0.409
 [&+0.386
 deterrent+0.353
 guarantees+0.351
Neg logit contributions
 affair-0.665
angering-0.661
cedented-0.660
addon-0.655
issy-0.650
Token.
Token ID13
Feature activation-1.67e-01

Pos logit contributions
 Sund+0.376
hest+0.367
 [&+0.353
 deterrent+0.335
 guarantees+0.334
Neg logit contributions
 affair-0.217
cedented-0.214
angering-0.213
addon-0.211
issy-0.208
Tokenaf
Token ID1878
Feature activation-4.04e-01

Pos logit contributions
 Sund+0.374
hest+0.363
 [&+0.345
 deterrent+0.322
 guarantees+0.321
Neg logit contributions
 affair-0.391
cedented-0.388
angering-0.387
addon-0.384
issy-0.380
Token.
Token ID13
Feature activation-1.39e-01

Pos logit contributions
 Sund+0.978
hest+0.950
 [&+0.907
 deterrent+0.850
 guarantees+0.849
Neg logit contributions
 affair-0.873
cedented-0.864
angering-0.864
addon-0.854
issy-0.846
Tokenmil
Token ID25433
Feature activation-2.49e-01

Pos logit contributions
 Sund+0.279
hest+0.270
 [&+0.255
 deterrent+0.236
 guarantees+0.235
Neg logit contributions
 affair-0.357
cedented-0.354
angering-0.354
addon-0.351
Access-0.347
Token/
Token ID14
Feature activation+6.82e-03

Pos logit contributions
 Sund+0.555
hest+0.539
 [&+0.511
 deterrent+0.477
 guarantees+0.476
Neg logit contributions
 affair-0.586
cedented-0.580
angering-0.579
addon-0.573
Access-0.569
Tokenhistory
Token ID23569
Feature activation-1.70e-01

Pos logit contributions
 affair+0.014
cedented+0.014
angering+0.014
addon+0.013
Access+0.013
Neg logit contributions
 Sund-0.017
hest-0.017
 [&-0.016
 deterrent-0.015
 guarantees-0.015
Token/
Token ID14
Feature activation+4.61e-02

Pos logit contributions
 Sund+0.380
hest+0.369
 [&+0.351
 deterrent+0.327
 guarantees+0.326
Neg logit contributions
 affair-0.398
cedented-0.395
angering-0.393
addon-0.389
issy-0.386
Token status
Token ID3722
Feature activation-7.03e-02

Pos logit contributions
 affair+0.005
angering+0.005
cedented+0.005
addon+0.005
Access+0.005
Neg logit contributions
 Sund-0.006
hest-0.006
 [&-0.005
 deterrent-0.005
 guarantees-0.005
Token on
Token ID319
Feature activation+1.01e-01

Pos logit contributions
 Sund+0.180
hest+0.176
 [&+0.168
 deterrent+0.158
 guarantees+0.158
Neg logit contributions
 affair-0.142
cedented-0.140
angering-0.140
addon-0.138
Access-0.137
Token each
Token ID1123
Feature activation+2.07e-02

Pos logit contributions
 affair+0.190
angering+0.186
cedented+0.185
addon+0.184
Access+0.183
Neg logit contributions
 Sund-0.271
hest-0.266
 [&-0.254
 deterrent-0.241
 guarantees-0.240
Token reboot
Token ID20149
Feature activation+1.12e-02

Pos logit contributions
 affair+0.042
angering+0.042
cedented+0.042
addon+0.041
Access+0.041
Neg logit contributions
 Sund-0.053
hest-0.051
 [&-0.049
 deterrent-0.046
 guarantees-0.046
Token,
Token ID11
Feature activation+6.32e-02

Pos logit contributions
 affair+0.021
angering+0.021
cedented+0.021
addon+0.021
Access+0.021
Neg logit contributions
 Sund-0.030
hest-0.029
 [&-0.028
 deterrent-0.026
 guarantees-0.026
Token unlike
Token ID5023
Feature activation-3.59e-01

Pos logit contributions
 affair+0.132
angering+0.130
cedented+0.129
addon+0.128
Access+0.127
Neg logit contributions
 Sund-0.157
hest-0.155
 [&-0.146
 deterrent-0.137
-0.136
Token most
Token ID749
Feature activation+1.33e-02

Pos logit contributions
 Sund+0.899
hest+0.877
 [&+0.837
 deterrent+0.788
 guarantees+0.786
Neg logit contributions
 affair-0.744
cedented-0.734
angering-0.734
addon-0.726
Access-0.718
Token of
Token ID286
Feature activation-6.48e-02

Pos logit contributions
 affair+0.031
angering+0.030
cedented+0.030
addon+0.030
Access+0.029
Neg logit contributions
 Sund-0.030
hest-0.030
 [&-0.028
 deterrent-0.026
-0.026
Token other
Token ID584
Feature activation+1.47e-01

Pos logit contributions
 Sund+0.178
hest+0.175
 [&+0.167
 deterrent+0.158
 guarantees+0.158
Neg logit contributions
 affair-0.118
cedented-0.116
angering-0.116
addon-0.115
quished-0.113
Token OD
Token ID31245
Feature activation-1.25e-01

Pos logit contributions
 affair+0.328
angering+0.321
cedented+0.320
addon+0.318
Access+0.316
Neg logit contributions
 Sund-0.345
hest-0.340
 [&-0.321
 deterrent-0.299
-0.298
TokenIN
Token ID1268
Feature activation+1.15e-01

Pos logit contributions
 Sund+0.312
hest+0.302
 [&+0.289
 deterrent+0.273
 guarantees+0.273
Neg logit contributions
 affair-0.263
cedented-0.261
angering-0.260
addon-0.257
quished-0.255
Token Finance
Token ID15007
Feature activation+6.34e-03

Pos logit contributions
 affair+0.007
cedented+0.007
angering+0.007
addon+0.007
Access+0.007
Neg logit contributions
 Sund-0.009
hest-0.009
 [&-0.008
 deterrent-0.008
 guarantees-0.008
Token Minister
Token ID4139
Feature activation-3.86e-01

Pos logit contributions
angering+0.012
 affair+0.012
cedented+0.012
addon+0.012
issy+0.012
Neg logit contributions
 Sund-0.017
hest-0.016
 [&-0.016
 guarantees-0.015
 deterrent-0.015
Token filed
Token ID5717
Feature activation+1.47e-01

Pos logit contributions
 Sund+1.071
hest+1.041
 [&+1.000
 guarantees+0.948
 deterrent+0.946
Neg logit contributions
 affair-0.696
cedented-0.692
angering-0.690
addon-0.680
quished-0.673
Token with
Token ID351
Feature activation-2.76e-02

Pos logit contributions
 affair+0.305
angering+0.299
cedented+0.298
Access+0.294
addon+0.294
Neg logit contributions
 Sund-0.367
hest-0.359
 [&-0.342
 deterrent-0.322
 guarantees-0.320
Token the
Token ID262
Feature activation-3.81e-02

Pos logit contributions
 Sund+0.077
hest+0.075
 [&+0.072
 deterrent+0.069
 guarantees+0.068
Neg logit contributions
 affair-0.049
cedented-0.049
angering-0.049
addon-0.048
issy-0.047
Token federal
Token ID2717
Feature activation-3.91e-01

Pos logit contributions
 Sund+0.100
hest+0.098
 [&+0.094
 deterrent+0.089
 guarantees+0.088
Neg logit contributions
 affair-0.074
cedented-0.073
angering-0.073
addon-0.072
Access-0.071
Token Ethics
Token ID25092
Feature activation+2.39e-03

Pos logit contributions
 Sund+0.971
hest+0.943
 [&+0.899
 deterrent+0.848
 guarantees+0.848
Neg logit contributions
 affair-0.820
cedented-0.817
angering-0.815
addon-0.807
quished-0.797
Token Commissioner
Token ID13270
Feature activation-2.06e-03

Pos logit contributions
 affair+0.005
cedented+0.005
angering+0.005
addon+0.005
issy+0.005
Neg logit contributions
 Sund-0.006
hest-0.006
 [&-0.006
 deterrent-0.006
 guarantees-0.006
Token's
Token ID338
Feature activation+1.03e-03

Pos logit contributions
 Sund+0.005
hest+0.005
 [&+0.005
 deterrent+0.005
+0.005
Neg logit contributions
 affair-0.004
angering-0.004
cedented-0.004
Access-0.004
addon-0.004
Token office
Token ID2607
Feature activation+4.64e-02

Pos logit contributions
 affair+0.002
angering+0.002
cedented+0.002
addon+0.002
Access+0.002
Neg logit contributions
 Sund-0.002
hest-0.002
 [&-0.002
 deterrent-0.002
 guarantees-0.002
Token say
Token ID910
Feature activation-1.06e-01

Pos logit contributions
 affair+0.090
angering+0.088
cedented+0.088
Access+0.086
addon+0.086
Neg logit contributions
 Sund-0.123
hest-0.120
 [&-0.115
 deterrent-0.108
-0.108
Token my
Token ID616
Feature activation+3.97e-02

Pos logit contributions
 affair+0.123
cedented+0.121
angering+0.121
addon+0.119
Access+0.119
Neg logit contributions
 Sund-0.193
hest-0.190
 [&-0.182
 deterrent-0.173
 guarantees-0.172
Token kitchen
Token ID9592
Feature activation+3.64e-02

Pos logit contributions
 affair+0.085
addon+0.083
angering+0.083
cedented+0.083
issy+0.083
Neg logit contributions
 Sund-0.095
hest-0.093
 [&-0.089
 deterrent-0.084
 guarantees-0.083
Token essentials
Token ID41954
Feature activation-2.00e-01

Pos logit contributions
 affair+0.074
angering+0.072
cedented+0.072
addon+0.072
issy+0.071
Neg logit contributions
 Sund-0.093
hest-0.091
 [&-0.087
 deterrent-0.082
 guarantees-0.082
Token!
Token ID0
Feature activation-2.32e-02

Pos logit contributions
 Sund+0.525
hest+0.513
 [&+0.491
 deterrent+0.464
 guarantees+0.462
Neg logit contributions
 affair-0.392
angering-0.385
cedented-0.384
addon-0.382
issy-0.378
Token<|endoftext|>
Token ID50256
Feature activation+8.74e-02

Pos logit contributions
 Sund+0.062
hest+0.061
 [&+0.058
 deterrent+0.055
 guarantees+0.055
Neg logit contributions
 affair-0.044
cedented-0.043
angering-0.043
addon-0.043
Access-0.042
TokenPosted
Token ID14231
Feature activation-3.68e-01

Pos logit contributions
 affair+0.163
cedented+0.161
angering+0.161
addon+0.160
Access+0.160
Neg logit contributions
 Sund-0.236
hest-0.230
 [&-0.220
 deterrent-0.210
 guarantees-0.209
Token on
Token ID319
Feature activation+2.82e-03

Pos logit contributions
 Sund+0.945
hest+0.921
 [&+0.880
 deterrent+0.830
 guarantees+0.828
Neg logit contributions
 affair-0.742
cedented-0.733
angering-0.732
addon-0.724
issy-0.716
Token by
Token ID416
Feature activation-4.35e-02

Pos logit contributions
 affair+0.006
angering+0.006
cedented+0.006
Access+0.006
addon+0.006
Neg logit contributions
 Sund-0.007
hest-0.007
 [&-0.006
 deterrent-0.006
 guarantees-0.006
Token |
Token ID930
Feature activation+8.24e-02

Pos logit contributions
 Sund+0.115
hest+0.113
 [&+0.108
 deterrent+0.102
 guarantees+0.102
Neg logit contributions
 affair-0.084
cedented-0.082
angering-0.082
addon-0.081
issy-0.081
Token in
Token ID287
Feature activation-2.31e-02

Pos logit contributions
 affair+0.155
cedented+0.151
angering+0.151
addon+0.150
Access+0.149
Neg logit contributions
 Sund-0.222
hest-0.218
 [&-0.208
 deterrent-0.198
 guarantees-0.197
Token
Token ID198
Feature activation-7.99e-03

Pos logit contributions
 Sund+0.061
hest+0.059
 [&+0.057
 deterrent+0.053
 guarantees+0.053
Neg logit contributions
 affair-0.045
cedented-0.044
angering-0.044
addon-0.044
issy-0.043