TOP ACTIVATIONS
MAX = 10.329

! The town spe ople were sad and scared
. He asked the town spe ople what was wrong and
. The town spe ople were sad because they
. The town spe ople decided to ask their
So , the town spe ople looked for Old Tom
rope bouncing . The town spe ople gathered around to watch
. The town spe ople soon discovered where the
The town spe ople thought and thought until
had to help the town spe ople . He quickly formulated
. The town spe ople worked together to help
bell stopped . The town spe ople cheered and hugged each
. The town spe ople were relieved . They
! The town spe ople thanked the hero for
it sang , the town spe ople would act nice .
. The town spe ople were amazed by this
happy and the nation pro spe red . <|endoftext|> Once upon
. The town spe ople were so happy they
shine through . The town spe ople were very happy and
â <|endoftext|> The town spe ople were walking quickly down
safe again . The town spe ople thanked the bear many

MIN ACTIVATIONS
MIN = -8.338

treats !" St rip ran away with the stolen
Mama said , " St rip , you naughty k itty
Mary stopped . " St rip . Hot !"
It was an incredible peac ock ! Lily was so happy
The peac ock was now even more gorgeous
feathers . But the peac ock was very impressive ! He
dream called a " night mare ". She woke up crying
It was poison he ml ock . It was very dangerous
When the peac ock heard his partner â
to decor ate the peac ock â s tail
mess . St rip es didn 't listen .
look at in the padd ock . The farmer smiled and
tro tted around the padd ock . When it
Kn ock , knock , I 'm
the merry - go - round . They laughed and smiled
the sky , the peac ock would ring out his beautiful
stuck in a small padd ock , far away from the
a delicate cup and sa ucer . " Yes , please
the gate to the padd ock and put the tiny hole
Katie put on her sm ock and went outside . She

INTERVAL 5.662 - 10.329
CONTAINS 0.034%

about me ," he said sadly . Mom
the rain , and the de w . You can smell
box and share the ch ocol ates . They are happy
high they were just tiny spec ks in the sky .
little boy thanked the shop keeper and ran out the door

INTERVAL 0.995 - 5.662
CONTAINS 13.161%

. <|endoftext|> Once upon a time , there was a fat
The friends happily said yes , and they found a
have a big box of cars , dolls , balls and
. <|endoftext|> Once upon a time , there was a little
. They say , " Do you want to play with

INTERVAL -3.671 - 0.995
CONTAINS 86.460%

said , " No , I still don 't want to
to put on her gear , climb to the top of
. She loved to play outside with her friends and neighbors
Ros ie replied , " I like this road too ,
upset and said , " That 's not fair ! I

INTERVAL -8.338 - -3.671
CONTAINS 0.344%

, it spun a coc oon and stayed inside for a
reach . She was unhappy because she couldn 't get it
to explore and her fear turned to joy . She ran
said okay . As they were reading , Tim
but he couldn 't move because he was too scared .
Token!
Token ID0
Feature activation-1.07e+00

Pos logit contributions
vered+0.036
 aston+0.034
ement+0.034
 symp+0.033
 fascination+0.032
Neg logit contributions
oto-0.013
!"-0.011
lic-0.010
round-0.010
ogo-0.010
Token
Token ID198
Feature activation+4.83e-02

Pos logit contributions
lic+0.077
oto+0.068
ibi+0.068
EEP+0.067
round+0.062
Neg logit contributions
vered-0.121
ement-0.112
 spe-0.111
 symp-0.105
 aston-0.103
Token
Token ID198
Feature activation+7.85e-02

Pos logit contributions
vered+0.005
ement+0.005
 spe+0.005
 symp+0.005
 aston+0.005
Neg logit contributions
lic-0.004
oto-0.004
ogo-0.003
ibi-0.003
EEP-0.003
TokenThe
Token ID464
Feature activation+2.07e-01

Pos logit contributions
vered+0.008
ement+0.008
 spe+0.007
 symp+0.007
 aston+0.007
Neg logit contributions
oto-0.006
lic-0.006
ogo-0.006
ibi-0.006
EEP-0.006
Token town
Token ID3240
Feature activation+1.69e+00

Pos logit contributions
vered+0.026
ement+0.025
 symp+0.023
 spe+0.023
ingly+0.022
Neg logit contributions
oto-0.012
lic-0.011
ogo-0.010
ye-0.009
una-0.009
Tokenspe
Token ID4125
Feature activation+1.03e+01

Pos logit contributions
vered+0.186
 spe+0.182
ement+0.182
 symp+0.180
 aston+0.176
Neg logit contributions
lic-0.106
oto-0.089
round-0.086
ibi-0.083
oys-0.082
Tokenople
Token ID643
Feature activation+2.16e+00

Pos logit contributions
 spe+0.901
 symp+0.868
 aston+0.799
vered+0.774
ement+0.768
Neg logit contributions
round-0.907
EEP-0.854
oys-0.809
oto-0.802
lic-0.792
Token were
Token ID547
Feature activation+8.86e-01

Pos logit contributions
ement+0.227
vered+0.225
 spe+0.220
 symp+0.211
 aston+0.206
Neg logit contributions
lic-0.152
oto-0.132
round-0.122
ogo-0.120
EEP-0.117
Token sad
Token ID6507
Feature activation+3.13e+00

Pos logit contributions
vered+0.101
ement+0.096
 symp+0.093
+0.090
 spe+0.089
Neg logit contributions
lic-0.059
oto-0.052
ogo-0.048
ibi-0.044
round-0.042
Token and
Token ID290
Feature activation-1.20e-01

Pos logit contributions
vered+0.386
 spe+0.353
ement+0.353
 aston+0.347
 symp+0.342
Neg logit contributions
oto-0.178
lic-0.171
oys-0.154
ogo-0.152
EEP-0.146
Token scared
Token ID12008
Feature activation+3.58e-01

Pos logit contributions
lic+0.009
oto+0.009
ogo+0.008
ibi+0.008
oys+0.008
Neg logit contributions
vered-0.012
ement-0.012
 symp-0.011
 spe-0.011
 aston-0.010
Token.
Token ID13
Feature activation+4.80e-02

Pos logit contributions
vered+0.029
 spe+0.027
ement+0.027
 symp+0.026
 aston+0.026
Neg logit contributions
lic-0.014
oto-0.013
oys-0.012
ock-0.011
EEP-0.011
Token He
Token ID679
Feature activation-2.18e-01

Pos logit contributions
vered+0.006
 spe+0.006
ement+0.006
 symp+0.005
 aston+0.005
Neg logit contributions
lic-0.003
oto-0.002
EEP-0.002
ogo-0.002
ibi-0.002
Token asked
Token ID1965
Feature activation+1.45e+00

Pos logit contributions
lic+0.014
oto+0.013
ogo+0.012
ibi+0.011
EEP+0.011
Neg logit contributions
vered-0.025
ement-0.025
 spe-0.024
 symp-0.023
 aston-0.022
Token the
Token ID262
Feature activation+1.53e-01

Pos logit contributions
vered+0.175
ement+0.172
 spe+0.170
 symp+0.163
 aston+0.159
Neg logit contributions
oto-0.086
lic-0.078
ogo-0.074
ibi-0.065
oys-0.065
Token town
Token ID3240
Feature activation+1.63e+00

Pos logit contributions
vered+0.018
ement+0.017
 symp+0.017
 spe+0.016
 aston+0.016
Neg logit contributions
lic-0.010
oto-0.010
ogo-0.009
EEP-0.008
 robber-0.008
Tokenspe
Token ID4125
Feature activation+1.03e+01

Pos logit contributions
vered+0.185
ement+0.178
 spe+0.176
 symp+0.173
 aston+0.165
Neg logit contributions
oto-0.108
lic-0.106
ogo-0.096
oys-0.092
round-0.092
Tokenople
Token ID643
Feature activation+2.10e+00

Pos logit contributions
 spe+0.993
+0.907
 symp+0.892
 aston+0.849
 blonde+0.846
Neg logit contributions
round-0.874
oys-0.781
EEP-0.757
oto-0.732
ock-0.723
Token what
Token ID644
Feature activation-1.71e+00

Pos logit contributions
vered+0.252
 spe+0.248
ement+0.246
 symp+0.233
 aston+0.228
Neg logit contributions
oto-0.124
lic-0.116
ogo-0.110
round-0.099
oys-0.096
Token was
Token ID373
Feature activation+8.17e-01

Pos logit contributions
oto+0.081
lic+0.076
ogo+0.058
ibi+0.058
una+0.056
Neg logit contributions
vered-0.223
ement-0.222
 symp-0.222
ingly-0.205
 spe-0.204
Token wrong
Token ID2642
Feature activation+1.27e+00

Pos logit contributions
vered+0.082
ement+0.077
 spe+0.074
 symp+0.074
 aston+0.071
Neg logit contributions
oto-0.067
lic-0.066
ogo-0.062
ibi-0.059
EEP-0.059
Token and
Token ID290
Feature activation-3.06e-01

Pos logit contributions
vered+0.145
 spe+0.142
ement+0.139
 symp+0.136
 aston+0.131
Neg logit contributions
oto-0.081
lic-0.074
oys-0.072
ogo-0.068
EEP-0.065
Token.
Token ID13
Feature activation+2.12e-02

Pos logit contributions
vered+0.301
ement+0.289
 symp+0.282
 spe+0.280
 aston+0.276
Neg logit contributions
oto-0.204
lic-0.197
ogo-0.185
EEP-0.178
round-0.176
Token
Token ID198
Feature activation-3.99e-01

Pos logit contributions
vered+0.002
 spe+0.002
ement+0.002
 symp+0.002
 aston+0.002
Neg logit contributions
lic-0.002
oto-0.001
EEP-0.001
round-0.001
ibi-0.001
Token
Token ID198
Feature activation-3.66e-01

Pos logit contributions
oto+0.031
lic+0.031
ogo+0.029
ibi+0.027
EEP+0.027
Neg logit contributions
vered-0.041
ement-0.039
 spe-0.038
 symp-0.038
 aston-0.036
TokenThe
Token ID464
Feature activation+1.08e-01

Pos logit contributions
oto+0.033
lic+0.032
ogo+0.031
ibi+0.029
EEP+0.029
Neg logit contributions
vered-0.034
ement-0.032
 symp-0.031
 spe-0.030
 aston-0.029
Token town
Token ID3240
Feature activation+1.62e+00

Pos logit contributions
vered+0.012
ement+0.012
 symp+0.011
 spe+0.011
 aston+0.011
Neg logit contributions
oto-0.007
lic-0.007
ogo-0.006
EEP-0.006
ibi-0.006
Tokenspe
Token ID4125
Feature activation+1.03e+01

Pos logit contributions
vered+0.183
 spe+0.179
ement+0.177
 symp+0.174
 aston+0.171
Neg logit contributions
lic-0.101
oto-0.092
round-0.089
EEP-0.085
ibi-0.081
Tokenople
Token ID643
Feature activation+2.03e+00

Pos logit contributions
 spe+0.765
 symp+0.755
vered+0.738
ement+0.719
 aston+0.697
Neg logit contributions
oto-0.971
round-0.969
EEP-0.962
lic-0.949
oys-0.918
Token were
Token ID547
Feature activation+8.08e-01

Pos logit contributions
 spe+0.227
ement+0.224
vered+0.222
 symp+0.212
 aston+0.206
Neg logit contributions
lic-0.128
oto-0.118
round-0.106
EEP-0.105
ogo-0.100
Token sad
Token ID6507
Feature activation+3.20e+00

Pos logit contributions
vered+0.086
ement+0.081
 symp+0.079
 spe+0.077
ingly+0.073
Neg logit contributions
lic-0.059
oto-0.056
ogo-0.050
ibi-0.047
round-0.047
Token because
Token ID780
Feature activation-4.35e+00

Pos logit contributions
vered+0.357
ement+0.334
 spe+0.329
 symp+0.322
 aston+0.317
Neg logit contributions
oto-0.225
lic-0.218
ogo-0.203
EEP-0.195
ibi-0.191
Token they
Token ID484
Feature activation-4.68e-01

Pos logit contributions
oto+0.192
lic+0.164
oys+0.161
round+0.141
ibi+0.139
Neg logit contributions
vered-0.582
 symp-0.548
ement-0.533
 spe-0.513
pen-0.497
Token.
Token ID13
Feature activation+4.77e-02

Pos logit contributions
vered+0.036
 symp+0.035
ement+0.035
 spe+0.035
 aston+0.034
Neg logit contributions
oto-0.018
lic-0.017
ogo-0.016
oys-0.015
EEP-0.015
Token
Token ID198
Feature activation-3.81e-01

Pos logit contributions
vered+0.006
 spe+0.006
ement+0.005
 symp+0.005
ingly+0.005
Neg logit contributions
lic-0.003
EEP-0.002
oto-0.002
ibi-0.002
oys-0.002
Token
Token ID198
Feature activation-3.58e-01

Pos logit contributions
lic+0.026
oto+0.025
ogo+0.022
ibi+0.022
EEP+0.021
Neg logit contributions
vered-0.045
 spe-0.042
ement-0.042
 symp-0.041
 aston-0.039
TokenThe
Token ID464
Feature activation+1.42e-01

Pos logit contributions
lic+0.026
oto+0.025
ogo+0.023
ibi+0.022
EEP+0.022
Neg logit contributions
vered-0.039
ement-0.038
 spe-0.037
 symp-0.036
 aston-0.034
Token town
Token ID3240
Feature activation+1.62e+00

Pos logit contributions
vered+0.016
ement+0.015
 spe+0.014
 symp+0.014
 aston+0.014
Neg logit contributions
oto-0.010
lic-0.010
ogo-0.009
ibi-0.009
EEP-0.009
Tokenspe
Token ID4125
Feature activation+1.03e+01

Pos logit contributions
 spe+0.173
vered+0.170
+0.167
ement+0.165
ocol+0.164
Neg logit contributions
lic-0.106
oto-0.094
oys-0.092
round-0.089
washer-0.085
Tokenople
Token ID643
Feature activation+2.10e+00

Pos logit contributions
 spe+0.845
 symp+0.827
vered+0.765
 aston+0.753
ement+0.746
Neg logit contributions
round-0.912
oto-0.879
EEP-0.873
oys-0.847
lic-0.821
Token decided
Token ID3066
Feature activation+2.41e-01

Pos logit contributions
 spe+0.247
ement+0.246
vered+0.235
ocol+0.229
 symp+0.225
Neg logit contributions
lic-0.117
oto-0.096
 jumped-0.091
EEP-0.085
round-0.085
Token to
Token ID284
Feature activation+4.69e-02

Pos logit contributions
vered+0.030
ement+0.029
 spe+0.028
 symp+0.027
 aston+0.026
Neg logit contributions
oto-0.014
lic-0.014
ogo-0.012
EEP-0.012
ibi-0.012
Token ask
Token ID1265
Feature activation-5.79e-01

Pos logit contributions
vered+0.005
ement+0.005
 symp+0.005
 spe+0.005
 aston+0.005
Neg logit contributions
oto-0.003
lic-0.003
ogo-0.003
EEP-0.002
ibi-0.002
Token their
Token ID511
Feature activation+2.87e-01

Pos logit contributions
oto+0.040
lic+0.036
ogo+0.035
oys+0.034
EEP+0.032
Neg logit contributions
vered-0.064
ement-0.062
 spe-0.062
 symp-0.058
 aston-0.057
Token
Token ID198
Feature activation-3.36e-01

Pos logit contributions
oto+0.031
lic+0.030
ogo+0.029
round+0.028
ibi+0.028
Neg logit contributions
vered-0.033
ement-0.031
 symp-0.030
 spe-0.029
 aston-0.029
TokenSo
Token ID2396
Feature activation+1.04e+00

Pos logit contributions
oto+0.027
ogo+0.026
lic+0.025
ibi+0.023
round+0.023
Neg logit contributions
vered-0.033
ement-0.030
 symp-0.030
 aston-0.028
ingly-0.028
Token,
Token ID11
Feature activation-1.62e-01

Pos logit contributions
vered+0.110
ement+0.104
 spe+0.102
 symp+0.101
 aston+0.096
Neg logit contributions
oto-0.078
lic-0.075
EEP-0.068
ogo-0.068
oys-0.068
Token the
Token ID262
Feature activation+2.16e-01

Pos logit contributions
oto+0.010
lic+0.010
round+0.009
EEP+0.009
ibi+0.009
Neg logit contributions
vered-0.020
ement-0.018
 spe-0.018
 symp-0.018
 aston-0.017
Token town
Token ID3240
Feature activation+1.63e+00

Pos logit contributions
vered+0.022
ement+0.021
 symp+0.020
 spe+0.020
 aston+0.020
Neg logit contributions
lic-0.017
oto-0.017
ogo-0.016
EEP-0.016
round-0.015
Tokenspe
Token ID4125
Feature activation+1.03e+01

Pos logit contributions
vered+0.156
 spe+0.149
ement+0.147
 symp+0.147
 aston+0.137
Neg logit contributions
lic-0.140
oto-0.128
oys-0.123
ibi-0.116
round-0.116
Tokenople
Token ID643
Feature activation+2.09e+00

Pos logit contributions
 spe+0.870
 symp+0.843
vered+0.783
 aston+0.770
ement+0.759
Neg logit contributions
round-0.905
EEP-0.869
oys-0.869
oto-0.866
lic-0.808
Token looked
Token ID3114
Feature activation-1.42e+00

Pos logit contributions
ement+0.244
vered+0.241
 spe+0.237
 symp+0.229
 aston+0.220
Neg logit contributions
lic-0.125
oto-0.113
ogo-0.100
round-0.097
EEP-0.094
Token for
Token ID329
Feature activation-6.02e-01

Pos logit contributions
lic+0.085
oto+0.077
ogo+0.065
round+0.063
EEP+0.061
Neg logit contributions
vered-0.170
 spe-0.158
ement-0.153
-0.153
 symp-0.147
Token Old
Token ID5706
Feature activation-1.05e+00

Pos logit contributions
oto+0.045
lic+0.043
ogo+0.039
oys+0.038
ibi+0.038
Neg logit contributions
vered-0.064
ement-0.061
 symp-0.059
 spe-0.058
 aston-0.055
Token Tom
Token ID4186
Feature activation-4.32e-01

Pos logit contributions
oto+0.079
lic+0.076
ogo+0.070
ibi+0.068
oys+0.066
Neg logit contributions
vered-0.112
 spe-0.105
ement-0.105
 symp-0.104
 aston-0.100
Token rope
Token ID17182
Feature activation+6.16e-01

Pos logit contributions
vered+0.036
ement+0.034
 spe+0.033
 symp+0.032
 aston+0.032
Neg logit contributions
oto-0.020
lic-0.019
ogo-0.017
EEP-0.017
ibi-0.017
Token bouncing
Token ID30713
Feature activation+1.40e+00

Pos logit contributions
vered+0.067
 spe+0.062
ement+0.062
 aston+0.060
 symp+0.059
Neg logit contributions
oto-0.042
lic-0.042
round-0.038
ogo-0.038
EEP-0.038
Token.
Token ID13
Feature activation+1.21e-01

Pos logit contributions
vered+0.155
ement+0.149
 spe+0.147
 aston+0.143
 symp+0.142
Neg logit contributions
oto-0.092
lic-0.087
round-0.083
ibi-0.080
ogo-0.079
Token The
Token ID383
Feature activation+3.88e-01

Pos logit contributions
vered+0.012
ement+0.012
 spe+0.011
 symp+0.011
 aston+0.011
Neg logit contributions
lic-0.010
oto-0.010
ogo-0.009
ibi-0.009
EEP-0.009
Token town
Token ID3240
Feature activation+1.66e+00

Pos logit contributions
vered+0.040
ement+0.038
 spe+0.037
 symp+0.037
 aston+0.035
Neg logit contributions
lic-0.031
oto-0.030
ogo-0.028
round-0.026
ibi-0.026
Tokenspe
Token ID4125
Feature activation+1.02e+01

Pos logit contributions
vered+0.180
ement+0.171
 symp+0.169
 spe+0.167
 aston+0.162
Neg logit contributions
lic-0.119
oto-0.113
round-0.103
ibi-0.102
EEP-0.102
Tokenople
Token ID643
Feature activation+2.07e+00

Pos logit contributions
 spe+0.731
 symp+0.717
vered+0.717
ement+0.690
 aston+0.663
Neg logit contributions
oto-1.022
round-1.003
lic-0.993
EEP-0.988
ibi-0.955
Token gathered
Token ID9272
Feature activation-1.86e-01

Pos logit contributions
vered+0.193
ement+0.181
 spe+0.174
 symp+0.174
 aston+0.166
Neg logit contributions
oto-0.183
lic-0.180
ogo-0.170
ibi-0.164
EEP-0.162
Token around
Token ID1088
Feature activation+5.66e-01

Pos logit contributions
oto+0.009
 bye+0.007
 cool+0.007
 water+0.007
lic+0.007
Neg logit contributions
 spe-0.022
vered-0.022
-0.021
ement-0.021
 symp-0.021
Token to
Token ID284
Feature activation+1.23e-01

Pos logit contributions
vered+0.072
ement+0.071
 spe+0.066
ingly+0.065
 symp+0.064
Neg logit contributions
oto-0.024
oys-0.019
 listen-0.019
lic-0.019
EEP-0.017
Token watch
Token ID2342
Feature activation-2.75e+00

Pos logit contributions
vered+0.012
 spe+0.011
 aston+0.011
ement+0.011
 symp+0.011
Neg logit contributions
lic-0.011
oto-0.010
ogo-0.010
EEP-0.009
oys-0.009
Token.
Token ID13
Feature activation-3.71e-02

Pos logit contributions
oto+0.175
lic+0.172
ibi+0.158
ogo+0.158
round+0.156
Neg logit contributions
vered-0.221
ement-0.216
 spe-0.208
 symp-0.202
 aston-0.201
Token
Token ID198
Feature activation-4.64e-01

Pos logit contributions
lic+0.002
oto+0.002
EEP+0.002
round+0.002
ibi+0.002
Neg logit contributions
vered-0.005
 spe-0.004
ement-0.004
 symp-0.004
ingly-0.004
Token
Token ID198
Feature activation-4.37e-01

Pos logit contributions
oto+0.038
lic+0.038
ogo+0.036
ibi+0.034
EEP+0.034
Neg logit contributions
vered-0.046
ement-0.043
 symp-0.042
 spe-0.041
 aston-0.040
TokenThe
Token ID464
Feature activation+7.86e-02

Pos logit contributions
oto+0.037
lic+0.037
ogo+0.034
ibi+0.033
EEP+0.033
Neg logit contributions
vered-0.042
ement-0.040
 spe-0.038
 symp-0.038
 aston-0.037
Token town
Token ID3240
Feature activation+1.57e+00

Pos logit contributions
vered+0.009
ement+0.008
 symp+0.008
 spe+0.008
 aston+0.008
Neg logit contributions
oto-0.005
lic-0.005
ogo-0.005
round-0.004
EEP-0.004
Tokenspe
Token ID4125
Feature activation+1.02e+01

Pos logit contributions
vered+0.163
 spe+0.159
 symp+0.157
ement+0.157
 aston+0.151
Neg logit contributions
lic-0.113
oto-0.103
round-0.101
ibi-0.093
oys-0.092
Tokenople
Token ID643
Feature activation+2.01e+00

Pos logit contributions
 spe+0.849
 symp+0.820
vered+0.758
 aston+0.755
ement+0.745
Neg logit contributions
round-0.931
EEP-0.889
oto-0.881
oys-0.856
lic-0.836
Token soon
Token ID2582
Feature activation+1.68e+00

Pos logit contributions
vered+0.217
ement+0.217
 spe+0.212
 symp+0.206
 aston+0.198
Neg logit contributions
lic-0.136
oto-0.131
round-0.117
ogo-0.115
ibi-0.111
Token discovered
Token ID5071
Feature activation+6.58e-01

Pos logit contributions
vered+0.170
ement+0.160
 spe+0.155
 symp+0.154
 aston+0.149
Neg logit contributions
oto-0.134
lic-0.131
ogo-0.125
ibi-0.120
EEP-0.118
Token where
Token ID810
Feature activation-2.10e+00

Pos logit contributions
vered+0.080
 symp+0.078
 spe+0.077
ement+0.075
ingly+0.073
Neg logit contributions
oto-0.036
lic-0.036
ogo-0.027
una-0.026
ibi-0.026
Token the
Token ID262
Feature activation+1.59e-01

Pos logit contributions
oto+0.114
lic+0.110
EEP+0.093
oys+0.092
ibi+0.091
Neg logit contributions
vered-0.260
ement-0.255
 symp-0.249
 spe-0.238
 aston-0.231
Token
Token ID220
Feature activation+1.29e+00

Pos logit contributions
lic+0.001
EEP+0.001
oto+0.001
oys+0.001
round+0.001
Neg logit contributions
vered-0.002
 spe-0.002
ement-0.002
 symp-0.002
 aston-0.002
Token
Token ID198
Feature activation-3.88e-01

Pos logit contributions
vered+0.125
ement+0.119
 symp+0.114
 spe+0.113
 aston+0.109
Neg logit contributions
oto-0.108
lic-0.106
ogo-0.100
ibi-0.096
round-0.095
Token
Token ID198
Feature activation-3.84e-01

Pos logit contributions
oto+0.031
lic+0.031
ogo+0.029
ibi+0.028
EEP+0.027
Neg logit contributions
vered-0.039
ement-0.037
 spe-0.035
 symp-0.035
 aston-0.034
TokenThe
Token ID464
Feature activation+7.49e-02

Pos logit contributions
oto+0.031
lic+0.031
ogo+0.029
ibi+0.028
EEP+0.028
Neg logit contributions
vered-0.038
ement-0.036
 spe-0.035
 symp-0.035
 aston-0.033
Token town
Token ID3240
Feature activation+1.59e+00

Pos logit contributions
vered+0.009
ement+0.008
 symp+0.008
 spe+0.008
ingly+0.007
Neg logit contributions
oto-0.005
lic-0.004
ogo-0.004
EEP-0.004
round-0.004
Tokenspe
Token ID4125
Feature activation+1.02e+01

Pos logit contributions
 spe+0.163
vered+0.162
 symp+0.155
ocol+0.155
ement+0.155
Neg logit contributions
lic-0.113
oto-0.102
round-0.099
oys-0.098
EEP-0.095
Tokenople
Token ID643
Feature activation+2.00e+00

Pos logit contributions
 spe+0.812
 symp+0.788
vered+0.746
ement+0.732
 aston+0.729
Neg logit contributions
round-0.928
EEP-0.917
oto-0.912
lic-0.887
oys-0.882
Token thought
Token ID1807
Feature activation-6.78e-01

Pos logit contributions
 spe+0.237
ement+0.229
vered+0.227
ocol+0.218
 symp+0.215
Neg logit contributions
lic-0.114
oto-0.099
round-0.086
EEP-0.086
ogo-0.081
Token and
Token ID290
Feature activation-3.31e-01

Pos logit contributions
oto+0.041
lic+0.039
ogo+0.036
ibi+0.033
EEP+0.033
Neg logit contributions
vered-0.083
ement-0.077
 spe-0.077
 symp-0.076
 aston-0.074
Token thought
Token ID1807
Feature activation-6.63e-01

Pos logit contributions
oto+0.025
lic+0.024
ogo+0.022
ibi+0.021
EEP+0.021
Neg logit contributions
vered-0.035
ement-0.034
 spe-0.033
 symp-0.032
 aston-0.031
Token until
Token ID1566
Feature activation+2.62e+00

Pos logit contributions
lic+0.035
oto+0.034
ogo+0.030
EEP+0.028
oys+0.026
Neg logit contributions
vered-0.084
 spe-0.079
ement-0.077
 symp-0.077
 aston-0.076
Token had
Token ID550
Feature activation+1.34e-01

Pos logit contributions
lic+0.020
oto+0.020
ogo+0.018
ibi+0.017
EEP+0.016
Neg logit contributions
vered-0.039
ement-0.038
 spe-0.037
 symp-0.037
 aston-0.036
Token to
Token ID284
Feature activation+3.96e-02

Pos logit contributions
vered+0.015
ement+0.015
 spe+0.014
 symp+0.014
 aston+0.014
Neg logit contributions
oto-0.009
lic-0.009
ogo-0.008
ibi-0.008
EEP-0.008
Token help
Token ID1037
Feature activation+4.96e-01

Pos logit contributions
vered+0.005
ement+0.004
 spe+0.004
 symp+0.004
 aston+0.004
Neg logit contributions
oto-0.003
lic-0.002
ogo-0.002
ibi-0.002
EEP-0.002
Token the
Token ID262
Feature activation+1.68e-01

Pos logit contributions
vered+0.058
ement+0.057
 spe+0.055
 symp+0.053
 aston+0.052
Neg logit contributions
oto-0.030
ogo-0.026
lic-0.026
oys-0.024
una-0.023
Token town
Token ID3240
Feature activation+1.60e+00

Pos logit contributions
vered+0.018
ement+0.018
 symp+0.017
 spe+0.017
 aston+0.016
Neg logit contributions
lic-0.012
oto-0.011
ogo-0.011
EEP-0.010
round-0.010
Tokenspe
Token ID4125
Feature activation+1.02e+01

Pos logit contributions
vered+0.172
ement+0.169
 spe+0.169
 symp+0.162
 aston+0.157
Neg logit contributions
lic-0.104
oto-0.096
round-0.091
oys-0.090
ogo-0.087
Tokenople
Token ID643
Feature activation+2.03e+00

Pos logit contributions
 spe+0.936
 symp+0.870
 aston+0.821
vered+0.796
 blonde+0.794
Neg logit contributions
round-0.897
EEP-0.826
oys-0.800
oto-0.788
ock-0.772
Token.
Token ID13
Feature activation+4.19e-02

Pos logit contributions
 spe+0.229
ement+0.226
vered+0.225
 symp+0.212
 aston+0.207
Neg logit contributions
oto-0.109
lic-0.108
round-0.100
ogo-0.096
 listen-0.095
Token He
Token ID679
Feature activation-1.99e-01

Pos logit contributions
vered+0.005
 spe+0.005
ement+0.005
 symp+0.005
 aston+0.005
Neg logit contributions
lic-0.002
oto-0.002
EEP-0.002
ogo-0.002
round-0.002
Token quickly
Token ID2952
Feature activation+2.76e-01

Pos logit contributions
lic+0.013
oto+0.013
ogo+0.012
ibi+0.011
round+0.011
Neg logit contributions
vered-0.022
ement-0.022
 spe-0.021
 symp-0.021
 aston-0.020
Token formulated
Token ID34391
Feature activation-4.03e-01

Pos logit contributions
vered+0.029
ement+0.027
 spe+0.026
 symp+0.026
 aston+0.025
Neg logit contributions
oto-0.022
lic-0.021
ogo-0.020
ibi-0.019
EEP-0.019
Token.
Token ID13
Feature activation+2.10e-01

Pos logit contributions
vered+0.170
 symp+0.163
 spe+0.160
ement+0.158
 aston+0.153
Neg logit contributions
lic-0.120
oto-0.111
round-0.104
EEP-0.102
oys-0.100
Token
Token ID198
Feature activation-3.50e-01

Pos logit contributions
vered+0.020
ement+0.018
 spe+0.018
 symp+0.018
 aston+0.017
Neg logit contributions
oto-0.019
lic-0.019
ogo-0.017
ibi-0.017
EEP-0.017
Token
Token ID198
Feature activation-3.42e-01

Pos logit contributions
oto+0.031
lic+0.030
ogo+0.030
ibi+0.028
EEP+0.028
Neg logit contributions
vered-0.030
ement-0.029
 symp-0.028
 spe-0.027
 aston-0.027
TokenThe
Token ID464
Feature activation+1.00e-01

Pos logit contributions
oto+0.033
ogo+0.031
lic+0.031
ibi+0.030
EEP+0.029
Neg logit contributions
vered-0.029
ement-0.026
 symp-0.026
 spe-0.025
 aston-0.025
Token town
Token ID3240
Feature activation+1.60e+00

Pos logit contributions
vered+0.010
ement+0.009
 spe+0.009
 symp+0.009
 aston+0.008
Neg logit contributions
oto-0.008
lic-0.008
ogo-0.008
ibi-0.007
round-0.007
Tokenspe
Token ID4125
Feature activation+1.02e+01

Pos logit contributions
vered+0.146
 spe+0.139
 symp+0.137
ement+0.134
 aston+0.129
Neg logit contributions
lic-0.143
oto-0.137
round-0.128
oys-0.123
EEP-0.122
Tokenople
Token ID643
Feature activation+2.04e+00

Pos logit contributions
 spe+0.792
 symp+0.765
vered+0.739
ement+0.700
 aston+0.699
Neg logit contributions
round-0.966
oto-0.960
EEP-0.929
lic-0.927
oys-0.913
Token worked
Token ID3111
Feature activation+1.89e+00

Pos logit contributions
vered+0.214
 spe+0.206
ement+0.206
 symp+0.198
 aston+0.189
Neg logit contributions
lic-0.150
oto-0.146
ogo-0.128
round-0.128
EEP-0.125
Token together
Token ID1978
Feature activation+1.51e+00

Pos logit contributions
vered+0.185
 spe+0.176
ement+0.169
 symp+0.167
 aston+0.163
Neg logit contributions
lic-0.158
oto-0.148
ogo-0.136
oys-0.132
ibi-0.129
Token to
Token ID284
Feature activation+1.23e-01

Pos logit contributions
vered+0.196
ement+0.188
 spe+0.187
 symp+0.185
 aston+0.178
Neg logit contributions
lic-0.074
oto-0.067
ogo-0.059
oys-0.056
round-0.052
Token help
Token ID1037
Feature activation+5.33e-01

Pos logit contributions
vered+0.014
ement+0.013
 spe+0.013
 symp+0.013
 aston+0.012
Neg logit contributions
oto-0.008
lic-0.008
ogo-0.008
ibi-0.007
EEP-0.007
Token bell
Token ID8966
Feature activation+5.46e-01

Pos logit contributions
vered+0.022
 symp+0.022
ement+0.020
ingly+0.019
 spe+0.019
Neg logit contributions
lic-0.010
oto-0.010
EEP-0.009
round-0.009
 sound-0.009
Token stopped
Token ID5025
Feature activation-2.21e+00

Pos logit contributions
vered+0.061
ement+0.058
 symp+0.058
 aston+0.056
 spe+0.056
Neg logit contributions
lic-0.037
oto-0.034
ibi-0.031
EEP-0.031
round-0.031
Token.
Token ID13
Feature activation+8.53e-02

Pos logit contributions
ruck+0.098
oto+0.096
lic+0.090
reat+0.089
EEP+0.084
Neg logit contributions
vered-0.281
ement-0.262
-0.239
 aston-0.234
 spe-0.228
Token The
Token ID383
Feature activation+2.39e-01

Pos logit contributions
vered+0.010
 spe+0.009
ement+0.009
 symp+0.009
ingly+0.008
Neg logit contributions
lic-0.006
oto-0.005
EEP-0.005
ibi-0.005
round-0.005
Token town
Token ID3240
Feature activation+1.59e+00

Pos logit contributions
vered+0.030
 symp+0.028
ement+0.026
ingly+0.026
 spe+0.025
Neg logit contributions
oto-0.011
 sound-0.011
lic-0.011
 cool-0.010
ruck-0.010
Tokenspe
Token ID4125
Feature activation+1.02e+01

Pos logit contributions
vered+0.171
 symp+0.165
ement+0.163
 spe+0.162
 aston+0.155
Neg logit contributions
lic-0.110
oto-0.098
round-0.095
ibi-0.090
EEP-0.089
Tokenople
Token ID643
Feature activation+1.99e+00

Pos logit contributions
 spe+0.865
 symp+0.833
 aston+0.761
vered+0.756
ement+0.745
Neg logit contributions
round-0.907
EEP-0.890
oys-0.832
lic-0.824
oto-0.816
Token cheered
Token ID34767
Feature activation+2.91e+00

Pos logit contributions
vered+0.199
ement+0.192
 spe+0.187
 symp+0.186
 aston+0.176
Neg logit contributions
lic-0.157
oto-0.150
round-0.136
ogo-0.136
ibi-0.133
Token and
Token ID290
Feature activation-3.23e-01

Pos logit contributions
vered+0.327
 spe+0.304
ement+0.303
 symp+0.287
 aston+0.284
Neg logit contributions
oto-0.205
lic-0.199
ogo-0.177
EEP-0.173
oys-0.172
Token hugged
Token ID41130
Feature activation-2.13e+00

Pos logit contributions
oto+0.028
lic+0.026
ogo+0.026
ibi+0.024
round+0.024
Neg logit contributions
vered-0.032
ement-0.029
 spe-0.029
 symp-0.028
 aston-0.028
Token each
Token ID1123
Feature activation+1.99e-01

Pos logit contributions
oto+0.119
lic+0.101
oys+0.100
 bye+0.099
round+0.098
Neg logit contributions
vered-0.242
ement-0.237
 spe-0.234
 aston-0.221
-0.220
Token.
Token ID13
Feature activation+2.92e-02

Pos logit contributions
vered+0.237
ement+0.225
 symp+0.216
 spe+0.212
 aston+0.210
Neg logit contributions
oto-0.132
lic-0.129
ogo-0.113
EEP-0.112
oys-0.111
Token
Token ID198
Feature activation-3.89e-01

Pos logit contributions
vered+0.003
ement+0.003
 spe+0.003
 symp+0.003
 aston+0.003
Neg logit contributions
lic-0.002
oto-0.002
ogo-0.002
EEP-0.002
ibi-0.002
Token
Token ID198
Feature activation-3.68e-01

Pos logit contributions
oto+0.030
lic+0.030
ogo+0.028
ibi+0.027
EEP+0.026
Neg logit contributions
vered-0.040
ement-0.038
 spe-0.037
 symp-0.037
 aston-0.035
TokenThe
Token ID464
Feature activation+1.03e-01

Pos logit contributions
lic+0.030
oto+0.030
ogo+0.027
ibi+0.026
EEP+0.026
Neg logit contributions
vered-0.037
ement-0.035
 spe-0.034
 symp-0.034
 aston-0.032
Token town
Token ID3240
Feature activation+1.58e+00

Pos logit contributions
vered+0.012
ement+0.011
 symp+0.011
 spe+0.010
ingly+0.010
Neg logit contributions
oto-0.007
lic-0.006
ogo-0.006
ye-0.006
round-0.005
Tokenspe
Token ID4125
Feature activation+1.02e+01

Pos logit contributions
vered+0.161
 symp+0.156
ement+0.155
 spe+0.154
 aston+0.148
Neg logit contributions
lic-0.112
oto-0.108
round-0.100
oys-0.096
ogo-0.093
Tokenople
Token ID643
Feature activation+1.99e+00

Pos logit contributions
 spe+0.828
 symp+0.812
vered+0.737
 aston+0.734
ement+0.723
Neg logit contributions
round-0.914
EEP-0.885
oys-0.869
oto-0.862
lic-0.849
Token were
Token ID547
Feature activation+7.55e-01

Pos logit contributions
vered+0.202
ement+0.199
 spe+0.190
 symp+0.189
 aston+0.179
Neg logit contributions
lic-0.147
oto-0.144
ogo-0.128
round-0.127
ibi-0.122
Token relieved
Token ID22598
Feature activation+5.63e-01

Pos logit contributions
vered+0.090
ement+0.085
 symp+0.082
 spe+0.080
 aston+0.076
Neg logit contributions
lic-0.047
oto-0.045
ogo-0.040
ibi-0.038
round-0.037
Token.
Token ID13
Feature activation+5.72e-03

Pos logit contributions
vered+0.073
 spe+0.066
ement+0.065
 aston+0.064
 symp+0.063
Neg logit contributions
oto-0.031
lic-0.028
oys-0.024
ogo-0.022
EEP-0.021
Token They
Token ID1119
Feature activation-5.77e-01

Pos logit contributions
vered+0.001
 spe+0.001
ement+0.001
 symp+0.001
 aston+0.001
Neg logit contributions
lic-0.000
oto-0.000
EEP-0.000
oys-0.000
ogo-0.000
Token!
Token ID0
Feature activation-1.38e+00

Pos logit contributions
lic+0.022
oto+0.021
round+0.021
ogo+0.020
EEP+0.020
Neg logit contributions
ement-0.039
vered-0.038
 symp-0.037
 spe-0.036
 aston-0.036
Token
Token ID198
Feature activation-2.91e-01

Pos logit contributions
lic+0.097
oto+0.094
ogo+0.086
EEP+0.086
ibi+0.085
Neg logit contributions
vered-0.153
ement-0.143
 spe-0.143
 symp-0.140
 aston-0.133
Token
Token ID198
Feature activation-2.76e-01

Pos logit contributions
oto+0.022
lic+0.022
ogo+0.020
ibi+0.019
EEP+0.018
Neg logit contributions
vered-0.031
ement-0.030
 spe-0.029
 symp-0.028
 aston-0.028
TokenThe
Token ID464
Feature activation+1.26e-01

Pos logit contributions
lic+0.022
oto+0.022
ogo+0.020
ibi+0.019
round+0.019
Neg logit contributions
vered-0.028
ement-0.027
 spe-0.026
 symp-0.026
 aston-0.025
Token town
Token ID3240
Feature activation+1.59e+00

Pos logit contributions
vered+0.012
ement+0.012
 symp+0.011
 spe+0.011
 aston+0.011
Neg logit contributions
oto-0.011
lic-0.010
ogo-0.010
EEP-0.009
round-0.009
Tokenspe
Token ID4125
Feature activation+1.02e+01

Pos logit contributions
vered+0.141
 symp+0.140
 spe+0.137
ement+0.137
 aston+0.130
Neg logit contributions
lic-0.136
oto-0.127
round-0.122
EEP-0.117
ogo-0.116
Tokenople
Token ID643
Feature activation+1.98e+00

Pos logit contributions
 spe+0.837
 symp+0.813
 aston+0.733
ement+0.729
vered+0.723
Neg logit contributions
round-0.911
EEP-0.875
oto-0.851
oys-0.839
lic-0.834
Token thanked
Token ID26280
Feature activation+2.33e+00

Pos logit contributions
ement+0.202
vered+0.201
 spe+0.193
 symp+0.193
 aston+0.183
Neg logit contributions
lic-0.144
oto-0.138
ogo-0.125
round-0.125
ibi-0.119
Token the
Token ID262
Feature activation+2.05e-01

Pos logit contributions
vered+0.293
ement+0.269
 spe+0.267
 symp+0.257
+0.252
Neg logit contributions
oto-0.137
lic-0.100
 listen-0.100
 bye-0.097
ock-0.097
Token hero
Token ID4293
Feature activation-4.87e-01

Pos logit contributions
vered+0.021
ement+0.020
 symp+0.019
 spe+0.019
 aston+0.018
Neg logit contributions
oto-0.016
lic-0.016
ogo-0.015
ibi-0.014
EEP-0.014
Token for
Token ID329
Feature activation-6.62e-01

Pos logit contributions
oto+0.025
lic+0.022
round+0.022
ogo+0.022
oys+0.020
Neg logit contributions
vered-0.059
 spe-0.059
ement-0.057
 aston-0.056
 symp-0.055
Token it
Token ID340
Feature activation-3.76e-01

Pos logit contributions
 symp+0.284
vered+0.282
ement+0.280
ingly+0.262
 aston+0.253
Neg logit contributions
oto-0.094
ye-0.080
lic-0.079
EEP-0.077
 bye-0.075
Token sang
Token ID25889
Feature activation-1.74e+00

Pos logit contributions
lic+0.025
oto+0.023
ogo+0.022
EEP+0.020
round+0.020
Neg logit contributions
vered-0.042
ement-0.041
 symp-0.040
 aston-0.038
 spe-0.038
Token,
Token ID11
Feature activation-3.65e-01

Pos logit contributions
lic+0.132
oto+0.126
ogo+0.109
round+0.106
ibi+0.105
Neg logit contributions
vered-0.184
ement-0.173
 symp-0.168
 spe-0.168
 aston-0.163
Token the
Token ID262
Feature activation+9.16e-02

Pos logit contributions
lic+0.022
oto+0.022
ogo+0.018
EEP+0.018
round+0.018
Neg logit contributions
vered-0.042
ement-0.040
 symp-0.039
 aston-0.037
ingly-0.037
Token town
Token ID3240
Feature activation+1.52e+00

Pos logit contributions
vered+0.011
 symp+0.011
ement+0.011
ingly+0.010
+0.010
Neg logit contributions
 sound-0.004
oto-0.004
lic-0.004
 cool-0.004
 bus-0.004
Tokenspe
Token ID4125
Feature activation+1.02e+01

Pos logit contributions
vered+0.171
 symp+0.169
ement+0.167
 aston+0.161
 spe+0.156
Neg logit contributions
lic-0.101
oto-0.087
round-0.081
EEP-0.076
ibi-0.074
Tokenople
Token ID643
Feature activation+1.94e+00

Pos logit contributions
 symp+0.830
 spe+0.830
 aston+0.782
vered+0.772
ement+0.755
Neg logit contributions
round-0.920
EEP-0.881
oto-0.837
oys-0.822
lic-0.793
Token would
Token ID561
Feature activation+6.00e-01

Pos logit contributions
vered+0.216
ement+0.212
 symp+0.205
 spe+0.203
 aston+0.197
Neg logit contributions
lic-0.128
oto-0.127
ogo-0.113
round-0.109
EEP-0.105
Token act
Token ID719
Feature activation-6.48e-01

Pos logit contributions
vered+0.072
ement+0.069
 spe+0.065
 symp+0.065
 aston+0.063
Neg logit contributions
oto-0.038
lic-0.036
ogo-0.033
ibi-0.031
round-0.031
Token nice
Token ID3621
Feature activation-2.66e+00

Pos logit contributions
lic+0.049
oto+0.046
ogo+0.040
round+0.038
ibi+0.038
Neg logit contributions
vered-0.070
ement-0.067
 symp-0.064
 spe-0.064
 aston-0.061
Token.
Token ID13
Feature activation-8.89e-02

Pos logit contributions
lic+0.189
oto+0.188
ogo+0.164
ibi+0.159
round+0.158
Neg logit contributions
vered-0.293
ement-0.277
 symp-0.272
 spe-0.271
 aston-0.260
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
oto+0.084
lic+0.084
ogo+0.077
EEP+0.074
ibi+0.072
Neg logit contributions
vered-0.118
ement-0.109
 symp-0.106
 spe-0.104
 aston-0.100
Token
Token ID198
Feature activation-4.37e-01

Pos logit contributions
vered+0.001
ement+0.001
 symp+0.001
 aston+0.001
 spe+0.001
Neg logit contributions
oto-0.002
lic-0.002
ogo-0.001
ibi-0.001
round-0.001
Token
Token ID198
Feature activation-4.30e-01

Pos logit contributions
oto+0.043
ogo+0.041
lic+0.041
round+0.039
EEP+0.039
Neg logit contributions
vered-0.032
 symp-0.030
ement-0.030
 aston-0.029
 spe-0.027
TokenThe
Token ID464
Feature activation+1.19e-01

Pos logit contributions
oto+0.034
ogo+0.034
lic+0.031
ibi+0.029
round+0.028
Neg logit contributions
vered-0.041
ement-0.039
 symp-0.038
 aston-0.037
ingly-0.035
Token town
Token ID3240
Feature activation+1.52e+00

Pos logit contributions
vered+0.013
ement+0.012
 spe+0.012
 symp+0.012
 aston+0.011
Neg logit contributions
oto-0.009
lic-0.008
ogo-0.008
ibi-0.008
oys-0.007
Tokenspe
Token ID4125
Feature activation+1.01e+01

Pos logit contributions
vered+0.142
 spe+0.136
 symp+0.130
ement+0.129
 aston+0.123
Neg logit contributions
lic-0.133
oto-0.124
oys-0.116
round-0.113
ibi-0.112
Tokenople
Token ID643
Feature activation+1.95e+00

Pos logit contributions
 spe+0.848
 symp+0.820
vered+0.752
 aston+0.741
ement+0.733
Neg logit contributions
round-0.887
EEP-0.877
oys-0.862
oto-0.855
lic-0.832
Token were
Token ID547
Feature activation+7.39e-01

Pos logit contributions
vered+0.206
 spe+0.202
ement+0.200
 symp+0.186
ocol+0.181
Neg logit contributions
lic-0.134
oto-0.125
ogo-0.110
round-0.105
EEP-0.103
Token amazed
Token ID24562
Feature activation+6.25e-01

Pos logit contributions
vered+0.089
ement+0.083
 symp+0.081
 spe+0.079
+0.077
Neg logit contributions
lic-0.046
oto-0.041
ogo-0.036
ibi-0.034
round-0.031
Token by
Token ID416
Feature activation-1.40e-01

Pos logit contributions
vered+0.077
 spe+0.070
 symp+0.067
 aston+0.065
+0.064
Neg logit contributions
oto-0.041
lic-0.038
oys-0.031
ogo-0.030
ock-0.029
Token this
Token ID428
Feature activation+4.52e-02

Pos logit contributions
oto+0.011
lic+0.010
ogo+0.010
ibi+0.009
oys+0.009
Neg logit contributions
vered-0.015
ement-0.014
 spe-0.014
 symp-0.013
 aston-0.013
Token happy
Token ID3772
Feature activation+5.18e-01

Pos logit contributions
vered+0.077
ement+0.074
 symp+0.073
 spe+0.069
 aston+0.067
Neg logit contributions
lic-0.038
oto-0.036
ibi-0.030
round-0.029
ogo-0.029
Token and
Token ID290
Feature activation-4.37e-01

Pos logit contributions
vered+0.065
 symp+0.061
ement+0.060
 spe+0.060
 aston+0.058
Neg logit contributions
oto-0.027
lic-0.025
oys-0.024
EEP-0.022
ruck-0.021
Token the
Token ID262
Feature activation+5.58e-02

Pos logit contributions
oto+0.036
lic+0.035
ogo+0.034
ibi+0.032
round+0.032
Neg logit contributions
vered-0.043
ement-0.041
 spe-0.040
 symp-0.039
 aston-0.038
Token nation
Token ID3277
Feature activation+2.22e+00

Pos logit contributions
vered+0.006
 symp+0.005
ement+0.005
 spe+0.005
ingly+0.005
Neg logit contributions
lic-0.004
oto-0.004
oys-0.004
EEP-0.004
round-0.004
Token pro
Token ID386
Feature activation+2.90e+00

Pos logit contributions
vered+0.250
 symp+0.246
 spe+0.235
ement+0.234
 aston+0.228
Neg logit contributions
lic-0.134
oto-0.124
round-0.112
EEP-0.110
ogo-0.108
Tokenspe
Token ID4125
Feature activation+1.01e+01

Pos logit contributions
 symp+0.245
vered+0.244
 spe+0.237
ement+0.234
 aston+0.232
Neg logit contributions
lic-0.260
oto-0.251
EEP-0.232
ogo-0.230
round-0.226
Tokenred
Token ID445
Feature activation+6.17e-02

Pos logit contributions
 symp+0.859
 spe+0.854
 aston+0.769
vered+0.754
 distress+0.750
Neg logit contributions
round-0.894
EEP-0.884
oys-0.857
oto-0.820
lic-0.808
Token.
Token ID13
Feature activation+7.88e-02

Pos logit contributions
vered+0.006
ement+0.006
 symp+0.006
 spe+0.006
 aston+0.006
Neg logit contributions
lic-0.004
oto-0.004
ogo-0.004
EEP-0.004
oys-0.004
Token<|endoftext|>
Token ID50256
Feature activation+9.78e-01

Pos logit contributions
vered+0.009
 spe+0.008
ement+0.008
 symp+0.008
ingly+0.008
Neg logit contributions
lic-0.005
oto-0.005
ibi-0.005
EEP-0.005
round-0.005
TokenOnce
Token ID7454
Feature activation-1.10e-01

Pos logit contributions
vered+0.105
 spe+0.098
ement+0.097
 symp+0.096
 aston+0.092
Neg logit contributions
lic-0.074
oto-0.071
ogo-0.066
ibi-0.063
EEP-0.062
Token upon
Token ID2402
Feature activation+6.25e-02

Pos logit contributions
oto+0.007
lic+0.007
ogo+0.007
ibi+0.007
EEP+0.006
Neg logit contributions
vered-0.013
 spe-0.012
ement-0.012
 symp-0.011
 aston-0.011
Token.
Token ID13
Feature activation-8.89e-02

Pos logit contributions
lic+0.189
oto+0.188
ogo+0.164
ibi+0.159
round+0.158
Neg logit contributions
vered-0.293
ement-0.277
 symp-0.272
 spe-0.271
 aston-0.260
Token
Token ID198
Feature activation-5.08e-01

Pos logit contributions
lic+0.005
EEP+0.005
oto+0.005
round+0.004
ibi+0.004
Neg logit contributions
vered-0.011
 spe-0.010
 symp-0.010
ement-0.010
ingly-0.010
Token
Token ID198
Feature activation-4.91e-01

Pos logit contributions
oto+0.040
lic+0.039
ogo+0.036
ibi+0.035
EEP+0.034
Neg logit contributions
vered-0.053
ement-0.050
 spe-0.049
 symp-0.048
 aston-0.046
TokenThe
Token ID464
Feature activation+3.02e-02

Pos logit contributions
lic+0.042
oto+0.042
ogo+0.038
round+0.037
ibi+0.036
Neg logit contributions
vered-0.047
ement-0.045
 spe-0.043
 symp-0.043
 aston-0.041
Token town
Token ID3240
Feature activation+1.49e+00

Pos logit contributions
vered+0.004
 symp+0.004
ement+0.004
ingly+0.003
pen+0.003
Neg logit contributions
oto-0.001
lic-0.001
 sound-0.001
 bus-0.001
 cool-0.001
Tokenspe
Token ID4125
Feature activation+1.01e+01

Pos logit contributions
vered+0.147
 symp+0.147
ement+0.141
 spe+0.140
 aston+0.138
Neg logit contributions
lic-0.108
oto-0.099
round-0.098
oys-0.088
EEP-0.087
Tokenople
Token ID643
Feature activation+1.92e+00

Pos logit contributions
 spe+0.852
 symp+0.838
 aston+0.765
vered+0.750
ement+0.740
Neg logit contributions
round-0.892
EEP-0.847
oto-0.820
oys-0.815
lic-0.791
Token were
Token ID547
Feature activation+6.54e-01

Pos logit contributions
vered+0.212
ement+0.210
 spe+0.201
 symp+0.201
 aston+0.192
Neg logit contributions
oto-0.125
lic-0.125
ogo-0.109
round-0.108
ibi-0.102
Token so
Token ID523
Feature activation+3.05e-01

Pos logit contributions
vered+0.075
ement+0.071
 symp+0.070
 spe+0.067
 aston+0.065
Neg logit contributions
lic-0.044
oto-0.043
ogo-0.038
ibi-0.036
round-0.035
Token happy
Token ID3772
Feature activation+5.64e-01

Pos logit contributions
vered+0.030
ement+0.028
 spe+0.027
 symp+0.027
 aston+0.026
Neg logit contributions
oto-0.026
lic-0.025
ogo-0.024
ibi-0.023
EEP-0.023
Token they
Token ID484
Feature activation-6.43e-01

Pos logit contributions
vered+0.062
ement+0.058
 symp+0.057
 spe+0.057
 aston+0.055
Neg logit contributions
oto-0.041
lic-0.040
ogo-0.037
ibi-0.035
EEP-0.035
Token shine
Token ID18340
Feature activation-1.67e+00

Pos logit contributions
vered+0.029
ement+0.028
 symp+0.027
 spe+0.027
 aston+0.027
Neg logit contributions
lic-0.018
oto-0.016
EEP-0.016
ogo-0.015
round-0.015
Token through
Token ID832
Feature activation-1.05e+00

Pos logit contributions
lic+0.121
oto+0.113
ock+0.104
round+0.101
ogo+0.098
Neg logit contributions
vered-0.178
 aston-0.163
 symp-0.162
ement-0.158
 distress-0.156
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
ruck+0.032
lic+0.030
oto+0.027
 Sam+0.025
 cool+0.024
Neg logit contributions
vered-0.136
 symp-0.136
ement-0.134
-0.122
-0.122
Token The
Token ID383
Feature activation+2.25e-01

Pos logit contributions
vered+0.011
ement+0.010
 spe+0.010
 symp+0.010
 aston+0.010
Neg logit contributions
lic-0.009
oto-0.009
EEP-0.008
round-0.008
ogo-0.008
Token town
Token ID3240
Feature activation+1.51e+00

Pos logit contributions
vered+0.026
 symp+0.024
ement+0.024
 spe+0.023
ingly+0.023
Neg logit contributions
lic-0.013
oto-0.012
EEP-0.012
ogo-0.011
ruck-0.011
Tokenspe
Token ID4125
Feature activation+1.01e+01

Pos logit contributions
vered+0.159
 symp+0.157
ement+0.156
ocol+0.151
+0.150
Neg logit contributions
lic-0.102
oto-0.087
round-0.086
EEP-0.079
ibi-0.077
Tokenople
Token ID643
Feature activation+1.92e+00

Pos logit contributions
 spe+0.833
 symp+0.813
 aston+0.743
ement+0.737
vered+0.726
Neg logit contributions
round-0.878
EEP-0.872
oys-0.820
oto-0.816
lic-0.810
Token were
Token ID547
Feature activation+7.26e-01

Pos logit contributions
vered+0.192
ement+0.188
 spe+0.183
 symp+0.181
 aston+0.171
Neg logit contributions
lic-0.148
oto-0.140
ogo-0.126
round-0.125
EEP-0.121
Token very
Token ID845
Feature activation-4.00e-01

Pos logit contributions
vered+0.082
ement+0.078
 symp+0.076
 spe+0.074
 aston+0.071
Neg logit contributions
lic-0.049
oto-0.047
ogo-0.042
ibi-0.040
round-0.039
Token happy
Token ID3772
Feature activation+6.06e-01

Pos logit contributions
oto+0.033
lic+0.032
ogo+0.030
ibi+0.029
EEP+0.028
Neg logit contributions
vered-0.040
ement-0.038
 spe-0.037
 symp-0.036
 aston-0.035
Token and
Token ID290
Feature activation-3.78e-01

Pos logit contributions
vered+0.074
ement+0.069
 symp+0.069
 spe+0.068
 aston+0.066
Neg logit contributions
oto-0.035
lic-0.033
oys-0.029
ogo-0.028
EEP-0.028
Tokenâ
Token ID22940
Feature activation-2.14e-02

Pos logit contributions
oto+0.125
lic+0.122
ogo+0.117
ibi+0.110
round+0.109
Neg logit contributions
vered-0.130
ement-0.122
 spe-0.119
 symp-0.118
 aston-0.114
Token
Token ID26391
Feature activation-2.19e+00

Pos logit contributions
oto+0.002
ogo+0.002
lic+0.002
ibi+0.002
una+0.001
Neg logit contributions
vered-0.002
ement-0.002
 symp-0.002
 aston-0.002
 spe-0.002
Token<|endoftext|>
Token ID50256
Feature activation+9.71e-01

Pos logit contributions
oto+0.194
lic+0.185
ogo+0.178
ibi+0.167
round+0.165
Neg logit contributions
vered-0.205
ement-0.193
 spe-0.190
 symp-0.187
 aston-0.181
TokenThe
Token ID464
Feature activation-6.11e-02

Pos logit contributions
vered+0.106
ement+0.098
 spe+0.097
 symp+0.096
 aston+0.092
Neg logit contributions
lic-0.072
oto-0.071
ogo-0.065
ibi-0.061
EEP-0.061
Token town
Token ID3240
Feature activation+1.52e+00

Pos logit contributions
oto+0.003
lic+0.003
ibi+0.003
ogo+0.003
round+0.003
Neg logit contributions
vered-0.008
ement-0.008
 spe-0.007
 symp-0.007
 aston-0.007
Tokenspe
Token ID4125
Feature activation+1.01e+01

Pos logit contributions
vered+0.166
ement+0.159
 symp+0.156
 spe+0.155
 aston+0.148
Neg logit contributions
lic-0.106
oto-0.106
round-0.095
ogo-0.094
ibi-0.093
Tokenople
Token ID643
Feature activation+1.97e+00

Pos logit contributions
vered+0.745
 spe+0.732
 symp+0.714
ement+0.696
 aston+0.663
Neg logit contributions
oto-1.006
round-0.991
lic-0.974
EEP-0.970
ibi-0.928
Token were
Token ID547
Feature activation+6.53e-01

Pos logit contributions
vered+0.215
ement+0.207
 symp+0.200
 spe+0.198
 aston+0.190
Neg logit contributions
oto-0.139
lic-0.138
ogo-0.124
round-0.123
EEP-0.122
Token walking
Token ID6155
Feature activation+2.20e+00

Pos logit contributions
vered+0.076
 spe+0.068
ement+0.066
 symp+0.064
 aston+0.061
Neg logit contributions
lic-0.045
oto-0.044
ogo-0.040
ibi-0.039
EEP-0.038
Token quickly
Token ID2952
Feature activation+1.61e-01

Pos logit contributions
vered+0.259
ement+0.255
 spe+0.254
 symp+0.239
 aston+0.235
Neg logit contributions
oto-0.128
round-0.122
lic-0.118
ibi-0.111
oys-0.110
Token down
Token ID866
Feature activation-1.76e+00

Pos logit contributions
vered+0.018
ement+0.017
 spe+0.016
 symp+0.016
 aston+0.016
Neg logit contributions
lic-0.011
oto-0.011
round-0.010
EEP-0.010
ogo-0.010
Token safe
Token ID3338
Feature activation-2.91e+00

Pos logit contributions
vered+0.172
ement+0.164
 symp+0.156
 spe+0.154
 aston+0.147
Neg logit contributions
lic-0.106
oto-0.097
round-0.088
ibi-0.086
EEP-0.085
Token again
Token ID757
Feature activation+2.01e+00

Pos logit contributions
oto+0.179
lic+0.172
oys+0.167
ogo+0.154
round+0.154
Neg logit contributions
vered-0.315
ement-0.300
 spe-0.293
 symp-0.293
 aston-0.286
Token.
Token ID13
Feature activation+5.47e-02

Pos logit contributions
vered+0.219
ement+0.214
 symp+0.203
 aston+0.198
 spe+0.197
Neg logit contributions
lic-0.130
oto-0.130
EEP-0.116
oys-0.114
ogo-0.114
Token The
Token ID383
Feature activation+2.81e-01

Pos logit contributions
vered+0.006
ement+0.005
 spe+0.005
 symp+0.005
 aston+0.005
Neg logit contributions
lic-0.004
oto-0.004
ogo-0.004
EEP-0.004
ibi-0.004
Token town
Token ID3240
Feature activation+1.52e+00

Pos logit contributions
vered+0.032
ement+0.030
 symp+0.029
 spe+0.029
ingly+0.028
Neg logit contributions
lic-0.018
oto-0.017
ogo-0.016
EEP-0.015
 robber-0.015
Tokenspe
Token ID4125
Feature activation+1.01e+01

Pos logit contributions
vered+0.153
 symp+0.150
ement+0.150
 spe+0.144
 aston+0.140
Neg logit contributions
lic-0.112
oto-0.104
round-0.098
oys-0.096
EEP-0.095
Tokenople
Token ID643
Feature activation+1.93e+00

Pos logit contributions
 spe+0.857
 symp+0.827
 aston+0.755
ement+0.742
+0.728
Neg logit contributions
round-0.871
EEP-0.856
oys-0.826
oto-0.803
lic-0.774
Token thanked
Token ID26280
Feature activation+2.28e+00

Pos logit contributions
vered+0.198
ement+0.197
 symp+0.189
 spe+0.184
 aston+0.178
Neg logit contributions
lic-0.143
oto-0.137
ogo-0.122
round-0.121
EEP-0.118
Token the
Token ID262
Feature activation+1.81e-01

Pos logit contributions
vered+0.276
ement+0.260
 spe+0.245
 symp+0.239
 aston+0.238
Neg logit contributions
oto-0.137
ogo-0.109
lic-0.107
una-0.105
icated-0.100
Token bear
Token ID6842
Feature activation-1.30e+00

Pos logit contributions
vered+0.019
ement+0.018
 symp+0.018
 spe+0.018
 aston+0.017
Neg logit contributions
lic-0.014
oto-0.013
ogo-0.012
EEP-0.012
round-0.011
Token many
Token ID867
Feature activation-1.47e+00

Pos logit contributions
oto+0.081
lic+0.078
ogo+0.072
round+0.069
ibi+0.067
Neg logit contributions
vered-0.152
ement-0.144
 spe-0.143
 aston-0.136
 symp-0.136
Token treats
Token ID18432
Feature activation-2.69e-01

Pos logit contributions
vered+0.005
 symp+0.005
ement+0.005
 aston+0.004
 spe+0.004
Neg logit contributions
oto-0.003
ogo-0.002
lic-0.002
round-0.002
ibi-0.002
Token!"
Token ID2474
Feature activation-2.40e+00

Pos logit contributions
lic+0.019
oto+0.019
ogo+0.018
ibi+0.017
EEP+0.016
Neg logit contributions
vered-0.029
ement-0.028
 spe-0.027
 symp-0.026
 aston-0.025
Token
Token ID198
Feature activation-5.68e-01

Pos logit contributions
oto+0.182
lic+0.172
EEP+0.168
round+0.165
ogo+0.163
Neg logit contributions
vered-0.251
ement-0.240
 spe-0.228
 aston-0.223
 symp-0.218
Token
Token ID198
Feature activation-5.56e-01

Pos logit contributions
oto+0.048
lic+0.048
ogo+0.045
ibi+0.044
round+0.043
Neg logit contributions
vered-0.053
ement-0.050
 symp-0.049
 spe-0.048
 aston-0.046
TokenSt
Token ID1273
Feature activation-3.43e-01

Pos logit contributions
oto+0.045
lic+0.044
ogo+0.042
EEP+0.040
ibi+0.040
Neg logit contributions
vered-0.055
ement-0.053
 spe-0.051
 symp-0.050
 aston-0.048
Tokenrip
Token ID5528
Feature activation-7.74e+00

Pos logit contributions
oto+0.028
lic+0.027
ogo+0.026
ibi+0.025
round+0.025
Neg logit contributions
vered-0.034
ement-0.033
 spe-0.031
 symp-0.031
 aston-0.030
Token ran
Token ID4966
Feature activation-7.51e-01

Pos logit contributions
lic+0.697
oto+0.697
ogo+0.652
ibi+0.619
round+0.616
Neg logit contributions
vered-0.699
ement-0.662
 spe-0.636
 symp-0.632
 aston-0.615
Token away
Token ID1497
Feature activation+3.04e-01

Pos logit contributions
oto+0.070
lic+0.069
ogo+0.066
ibi+0.063
EEP+0.063
Neg logit contributions
vered-0.066
ement-0.060
 symp-0.059
 spe-0.058
 aston-0.056
Token with
Token ID351
Feature activation+1.50e+00

Pos logit contributions
vered+0.031
ement+0.030
 symp+0.028
 spe+0.028
 aston+0.027
Neg logit contributions
oto-0.024
lic-0.023
ogo-0.021
ibi-0.021
round-0.021
Token the
Token ID262
Feature activation+1.64e-02

Pos logit contributions
vered+0.168
ement+0.165
 symp+0.154
 aston+0.148
 spe+0.147
Neg logit contributions
oto-0.102
ogo-0.093
una-0.090
lic-0.088
haw-0.083
Token stolen
Token ID9909
Feature activation+1.37e+00

Pos logit contributions
vered+0.002
 symp+0.002
ement+0.002
 aston+0.002
 spe+0.002
Neg logit contributions
oto-0.001
ibi-0.001
lic-0.001
ogo-0.001
oys-0.001
Token Mama
Token ID40084
Feature activation-9.12e-01

Pos logit contributions
oto+0.006
lic+0.006
ogo+0.006
round+0.006
EEP+0.006
Neg logit contributions
vered-0.009
ement-0.009
 spe-0.008
 symp-0.008
 aston-0.008
Token said
Token ID531
Feature activation-8.38e-01

Pos logit contributions
oto+0.064
lic+0.063
ogo+0.058
ibi+0.055
round+0.053
Neg logit contributions
vered-0.102
ement-0.097
 spe-0.094
 symp-0.093
 aston-0.091
Token,
Token ID11
Feature activation-3.58e-01

Pos logit contributions
oto+0.052
lic+0.050
EEP+0.043
 bye+0.042
ogo+0.040
Neg logit contributions
vered-0.097
ement-0.096
 aston-0.091
 symp-0.090
 spe-0.088
Token "
Token ID366
Feature activation-1.45e+00

Pos logit contributions
lic+0.015
oto+0.015
EEP+0.012
ogo+0.011
round+0.011
Neg logit contributions
vered-0.050
ement-0.048
 spe-0.046
 aston-0.045
 symp-0.045
TokenSt
Token ID1273
Feature activation-2.99e-01

Pos logit contributions
oto+0.106
lic+0.094
round+0.092
ibi+0.091
EEP+0.089
Neg logit contributions
ement-0.166
vered-0.154
 aston-0.147
 symp-0.146
 spe-0.142
Tokenrip
Token ID5528
Feature activation-7.71e+00

Pos logit contributions
oto+0.026
lic+0.025
ogo+0.024
ibi+0.023
EEP+0.023
Neg logit contributions
vered-0.029
ement-0.027
 symp-0.026
 spe-0.026
 aston-0.025
Token,
Token ID11
Feature activation-3.66e-01

Pos logit contributions
lic+0.614
oto+0.611
ogo+0.568
round+0.535
ibi+0.533
Neg logit contributions
vered-0.772
ement-0.740
 symp-0.710
 spe-0.709
 aston-0.691
Token you
Token ID345
Feature activation+7.23e-01

Pos logit contributions
oto+0.029
lic+0.028
ogo+0.026
ibi+0.026
EEP+0.026
Neg logit contributions
vered-0.037
 spe-0.035
ement-0.034
 symp-0.034
 aston-0.032
Token naughty
Token ID45128
Feature activation-9.63e-01

Pos logit contributions
vered+0.066
ement+0.060
 spe+0.060
 symp+0.058
 aston+0.056
Neg logit contributions
oto-0.066
lic-0.064
ogo-0.061
ibi-0.060
round-0.059
Token k
Token ID479
Feature activation+3.48e+00

Pos logit contributions
lic+0.075
oto+0.073
ogo+0.069
ibi+0.065
EEP+0.065
Neg logit contributions
vered-0.099
ement-0.091
 spe-0.090
 symp-0.088
 aston-0.083
Tokenitty
Token ID9760
Feature activation+1.42e+00

Pos logit contributions
 symp+0.317
ement+0.316
 aston+0.315
vered+0.308
 amaz+0.303
Neg logit contributions
lic-0.301
round-0.294
oto-0.290
ogo-0.288
haw-0.272
TokenMary
Token ID24119
Feature activation-5.41e-01

Pos logit contributions
oto+0.030
ogo+0.029
lic+0.028
ibi+0.027
EEP+0.026
Neg logit contributions
vered-0.032
ement-0.029
 aston-0.029
 symp-0.029
 spe-0.028
Token stopped
Token ID5025
Feature activation-2.24e+00

Pos logit contributions
lic+0.035
oto+0.034
round+0.031
ogo+0.031
EEP+0.030
Neg logit contributions
ement-0.060
vered-0.060
 symp-0.057
 spe-0.055
 aston-0.054
Token.
Token ID13
Feature activation+2.24e-01

Pos logit contributions
oto+0.123
 listen+0.095
lic+0.094
ogo+0.092
ruck+0.091
Neg logit contributions
vered-0.283
ement-0.275
 spe-0.244
-0.242
-0.232
Token "
Token ID366
Feature activation-1.31e+00

Pos logit contributions
vered+0.024
ement+0.023
 spe+0.022
 symp+0.022
 aston+0.021
Neg logit contributions
lic-0.017
oto-0.016
ogo-0.015
EEP-0.015
ibi-0.015
TokenSt
Token ID1273
Feature activation-3.04e-01

Pos logit contributions
lic+0.110
oto+0.110
round+0.100
ibi+0.099
EEP+0.097
Neg logit contributions
vered-0.127
ement-0.123
 symp-0.115
 spe-0.114
 aston-0.108
Tokenrip
Token ID5528
Feature activation-7.62e+00

Pos logit contributions
oto+0.026
lic+0.026
ibi+0.023
ogo+0.023
round+0.022
Neg logit contributions
vered-0.029
ement-0.027
 symp-0.027
el-0.026
 spe-0.025
Token.
Token ID13
Feature activation+2.06e-01

Pos logit contributions
lic+0.559
oto+0.557
ogo+0.514
EEP+0.486
round+0.482
Neg logit contributions
vered-0.813
ement-0.771
 symp-0.746
 spe-0.745
 aston-0.722
Token Hot
Token ID6964
Feature activation-1.09e+00

Pos logit contributions
vered+0.019
ement+0.018
 spe+0.018
 symp+0.018
 aston+0.017
Neg logit contributions
oto-0.018
lic-0.018
ogo-0.017
ibi-0.016
EEP-0.016
Token!"
Token ID2474
Feature activation-2.22e+00

Pos logit contributions
oto+0.079
lic+0.074
ogo+0.072
ibi+0.070
round+0.066
Neg logit contributions
vered-0.120
ement-0.114
 spe-0.112
 symp-0.109
 aston-0.104
Token
Token ID198
Feature activation-3.75e-01

Pos logit contributions
oto+0.181
lic+0.180
ibi+0.163
EEP+0.161
ogo+0.160
Neg logit contributions
vered-0.221
ement-0.205
 spe-0.200
 symp-0.198
 aston-0.187
Token
Token ID198
Feature activation-3.74e-01

Pos logit contributions
oto+0.035
lic+0.034
ogo+0.033
ibi+0.031
round+0.031
Neg logit contributions
vered-0.031
ement-0.029
 symp-0.028
 aston-0.027
 spe-0.026
Token It
Token ID632
Feature activation-1.26e+00

Pos logit contributions
vered+0.005
ement+0.005
 spe+0.005
 symp+0.005
 aston+0.005
Neg logit contributions
lic-0.004
oto-0.003
round-0.003
ibi-0.003
EEP-0.003
Token was
Token ID373
Feature activation+5.06e-01

Pos logit contributions
lic+0.083
oto+0.077
ogo+0.068
ibi+0.065
round+0.065
Neg logit contributions
ement-0.144
vered-0.144
 symp-0.136
 aston-0.131
ingly-0.128
Token an
Token ID281
Feature activation+8.23e-01

Pos logit contributions
vered+0.061
ement+0.057
 symp+0.055
 aston+0.051
ingly+0.051
Neg logit contributions
oto-0.034
lic-0.033
ogo-0.029
ibi-0.027
una-0.026
Token incredible
Token ID8082
Feature activation-1.75e+00

Pos logit contributions
vered+0.093
ement+0.085
 symp+0.084
 spe+0.083
 aston+0.081
Neg logit contributions
oto-0.059
lic-0.051
ogo-0.050
round-0.049
ibi-0.048
Token peac
Token ID41961
Feature activation+4.92e-01

Pos logit contributions
oto+0.115
round+0.101
lic+0.098
ibi+0.093
ruck+0.091
Neg logit contributions
vered-0.208
 symp-0.185
ement-0.183
 aston-0.174
ingly-0.172
Tokenock
Token ID735
Feature activation-7.57e+00

Pos logit contributions
ement+0.068
vered+0.067
 symp+0.063
 spe+0.062
el+0.062
Neg logit contributions
oto-0.019
lic-0.016
ogo-0.015
ibi-0.015
EEP-0.014
Token!
Token ID0
Feature activation-1.62e+00

Pos logit contributions
lic+0.541
oto+0.532
ogo+0.483
round+0.482
ibi+0.457
Neg logit contributions
vered-0.801
ement-0.764
 symp-0.751
 aston-0.747
 spe-0.714
Token Lily
Token ID20037
Feature activation-7.72e-01

Pos logit contributions
lic+0.116
oto+0.112
ibi+0.103
round+0.103
EEP+0.101
Neg logit contributions
vered-0.180
ement-0.168
 spe-0.164
 symp-0.161
 aston-0.155
Token was
Token ID373
Feature activation+4.92e-01

Pos logit contributions
lic+0.057
oto+0.056
ogo+0.051
ibi+0.050
round+0.049
Neg logit contributions
vered-0.082
ement-0.079
 spe-0.076
 symp-0.075
 aston-0.073
Token so
Token ID523
Feature activation+1.11e-01

Pos logit contributions
vered+0.061
ement+0.057
 symp+0.056
 spe+0.054
ingly+0.053
Neg logit contributions
lic-0.030
oto-0.029
ogo-0.025
ibi-0.024
round-0.022
Token happy
Token ID3772
Feature activation+3.52e-01

Pos logit contributions
vered+0.012
ement+0.012
 symp+0.011
 spe+0.011
ingly+0.010
Neg logit contributions
lic-0.008
oto-0.008
ibi-0.007
ogo-0.007
round-0.006
Token
Token ID220
Feature activation+1.04e+00

Pos logit contributions
oto+0.014
lic+0.014
ogo+0.013
ibi+0.013
EEP+0.012
Neg logit contributions
vered-0.018
ement-0.017
 spe-0.017
 symp-0.016
 aston-0.016
Token
Token ID198
Feature activation-5.77e-01

Pos logit contributions
vered+0.105
ement+0.099
 spe+0.096
 symp+0.096
 aston+0.092
Neg logit contributions
oto-0.083
lic-0.082
ogo-0.077
ibi-0.074
EEP-0.073
Token
Token ID198
Feature activation-5.68e-01

Pos logit contributions
oto+0.048
lic+0.047
ogo+0.044
ibi+0.043
EEP+0.042
Neg logit contributions
vered-0.056
ement-0.053
 symp-0.051
 spe-0.051
 aston-0.049
TokenThe
Token ID464
Feature activation-1.57e-01

Pos logit contributions
oto+0.046
lic+0.046
ogo+0.043
ibi+0.041
round+0.040
Neg logit contributions
vered-0.057
ement-0.054
 spe-0.052
 symp-0.052
 aston-0.050
Token peac
Token ID41961
Feature activation+6.01e-01

Pos logit contributions
oto+0.012
lic+0.011
ibi+0.010
ogo+0.010
EEP+0.010
Neg logit contributions
vered-0.017
ement-0.016
 symp-0.015
 spe-0.015
ingly-0.015
Tokenock
Token ID735
Feature activation-7.49e+00

Pos logit contributions
vered+0.086
ement+0.084
 symp+0.079
 spe+0.079
 aston+0.077
Neg logit contributions
oto-0.022
lic-0.019
EEP-0.018
ogo-0.017
ibi-0.017
Token was
Token ID373
Feature activation+5.39e-01

Pos logit contributions
lic+0.560
oto+0.554
ogo+0.505
ibi+0.489
round+0.479
Neg logit contributions
vered-0.781
ement-0.746
 symp-0.721
 spe-0.718
 aston-0.701
Token now
Token ID783
Feature activation+6.12e-01

Pos logit contributions
vered+0.063
ement+0.059
 symp+0.057
 spe+0.055
 aston+0.053
Neg logit contributions
lic-0.035
oto-0.035
ogo-0.030
ibi-0.030
round-0.027
Token even
Token ID772
Feature activation+1.49e+00

Pos logit contributions
vered+0.068
ement+0.066
 symp+0.064
 spe+0.060
 aston+0.058
Neg logit contributions
oto-0.042
lic-0.042
ogo-0.036
ibi-0.036
round-0.034
Token more
Token ID517
Feature activation-3.42e-01

Pos logit contributions
vered+0.152
ement+0.147
 symp+0.139
+0.137
 spe+0.132
Neg logit contributions
lic-0.121
oto-0.110
ibi-0.096
ogo-0.095
ock-0.094
Token gorgeous
Token ID18857
Feature activation+1.22e+00

Pos logit contributions
oto+0.024
lic+0.024
ibi+0.020
ogo+0.020
round+0.019
Neg logit contributions
vered-0.038
 symp-0.036
ement-0.035
 spe-0.033
 aston-0.032
Token feathers
Token ID27400
Feature activation+2.49e+00

Pos logit contributions
 symp+0.025
vered+0.025
ement+0.024
 aston+0.024
 spe+0.021
Neg logit contributions
oto-0.018
lic-0.018
ogo-0.016
ibi-0.016
round-0.015
Token.
Token ID13
Feature activation-1.26e-01

Pos logit contributions
vered+0.264
ement+0.247
 symp+0.244
 aston+0.241
 spe+0.237
Neg logit contributions
oto-0.191
lic-0.191
ogo-0.172
ibi-0.166
round-0.162
Token But
Token ID887
Feature activation-5.76e-01

Pos logit contributions
lic+0.009
oto+0.008
ibi+0.008
round+0.008
ogo+0.008
Neg logit contributions
vered-0.014
ement-0.013
 spe-0.013
 symp-0.012
ingly-0.012
Token the
Token ID262
Feature activation-8.33e-02

Pos logit contributions
oto+0.030
 bye+0.025
 not+0.024
 listen+0.024
ogo+0.024
Neg logit contributions
vered-0.070
ement-0.070
 aston-0.062
 symp-0.061
ingly-0.061
Token peac
Token ID41961
Feature activation+6.36e-01

Pos logit contributions
oto+0.006
lic+0.005
ogo+0.004
ibi+0.004
round+0.004
Neg logit contributions
vered-0.010
ement-0.009
 symp-0.009
 aston-0.009
ingly-0.009
Tokenock
Token ID735
Feature activation-7.47e+00

Pos logit contributions
vered+0.090
ement+0.090
 symp+0.083
 spe+0.083
el+0.082
Neg logit contributions
oto-0.022
lic-0.019
EEP-0.018
oys-0.018
ibi-0.017
Token was
Token ID373
Feature activation+5.99e-01

Pos logit contributions
lic+0.474
oto+0.469
ogo+0.413
ibi+0.398
round+0.396
Neg logit contributions
vered-0.824
ement-0.814
 symp-0.795
 aston-0.794
 spe-0.767
Token very
Token ID845
Feature activation-5.66e-01

Pos logit contributions
vered+0.071
ement+0.068
 symp+0.065
 aston+0.062
 spe+0.062
Neg logit contributions
oto-0.039
lic-0.038
ogo-0.034
ibi-0.031
una-0.031
Token impressive
Token ID8036
Feature activation-1.68e+00

Pos logit contributions
lic+0.039
oto+0.039
ogo+0.033
ibi+0.033
EEP+0.032
Neg logit contributions
vered-0.065
ement-0.061
 symp-0.059
 spe-0.055
 aston-0.054
Token!
Token ID0
Feature activation-1.52e+00

Pos logit contributions
oto+0.119
lic+0.110
ogo+0.104
round+0.100
ibi+0.096
Neg logit contributions
vered-0.183
 symp-0.173
 spe-0.170
 aston-0.170
ement-0.167
Token He
Token ID679
Feature activation-4.01e-01

Pos logit contributions
lic+0.102
oto+0.095
ibi+0.089
round+0.088
EEP+0.088
Neg logit contributions
vered-0.177
ement-0.163
 spe-0.162
 symp-0.154
 aston-0.152
Token dream
Token ID4320
Feature activation+1.22e+00

Pos logit contributions
oto+0.016
lic+0.016
ogo+0.015
ibi+0.014
EEP+0.014
Neg logit contributions
vered-0.017
ement-0.016
 spe-0.015
 symp-0.015
 aston-0.014
Token called
Token ID1444
Feature activation-1.48e+00

Pos logit contributions
vered+0.095
ement+0.091
 spe+0.088
 symp+0.086
ingly+0.080
Neg logit contributions
oto-0.119
lic-0.117
ogo-0.114
ibi-0.109
round-0.106
Token a
Token ID257
Feature activation-8.32e-02

Pos logit contributions
oto+0.120
lic+0.119
ogo+0.111
ibi+0.107
round+0.105
Neg logit contributions
vered-0.147
ement-0.138
 spe-0.135
 symp-0.134
 aston-0.129
Token "
Token ID366
Feature activation-1.40e+00

Pos logit contributions
oto+0.007
lic+0.006
ogo+0.006
ibi+0.006
round+0.006
Neg logit contributions
 spe-0.008
vered-0.008
ement-0.008
 symp-0.008
 aston-0.007
Tokennight
Token ID3847
Feature activation-2.76e+00

Pos logit contributions
lic+0.118
oto+0.115
ibi+0.106
round+0.106
ogo+0.103
Neg logit contributions
vered-0.135
ement-0.134
 symp-0.126
 aston-0.125
 spe-0.123
Tokenmare
Token ID11449
Feature activation-7.46e+00

Pos logit contributions
oto+0.158
lic+0.157
ogo+0.148
ibi+0.141
EEP+0.130
Neg logit contributions
vered-0.328
 spe-0.317
ement-0.312
 symp-0.311
 aston-0.297
Token".
Token ID1911
Feature activation-2.21e+00

Pos logit contributions
oto+0.534
ogo+0.501
lic+0.481
ibi+0.472
round+0.429
Neg logit contributions
vered-0.803
ement-0.754
 spe-0.751
 symp-0.749
 aston-0.715
Token She
Token ID1375
Feature activation+3.25e-01

Pos logit contributions
oto+0.248
lic+0.241
ogo+0.239
round+0.221
ibi+0.218
Neg logit contributions
vered-0.152
ement-0.142
 symp-0.140
 spe-0.129
 aston-0.127
Token woke
Token ID19092
Feature activation-3.84e+00

Pos logit contributions
vered+0.034
ement+0.032
 symp+0.032
 spe+0.032
 aston+0.030
Neg logit contributions
oto-0.024
lic-0.024
ogo-0.022
ibi-0.022
round-0.021
Token up
Token ID510
Feature activation+1.01e-01

Pos logit contributions
lic+0.350
oto+0.348
ogo+0.325
ibi+0.316
EEP+0.312
Neg logit contributions
vered-0.343
ement-0.326
 symp-0.314
 spe-0.308
 aston-0.295
Token crying
Token ID13774
Feature activation+2.30e+00

Pos logit contributions
vered+0.011
ement+0.011
 spe+0.010
 symp+0.010
 aston+0.010
Neg logit contributions
oto-0.007
lic-0.007
ogo-0.006
EEP-0.006
round-0.006
Token It
Token ID632
Feature activation-9.65e-01

Pos logit contributions
vered+0.073
ement+0.069
 spe+0.068
 symp+0.065
 aston+0.062
Neg logit contributions
lic-0.040
oto-0.039
round-0.036
EEP-0.036
ibi-0.036
Token was
Token ID373
Feature activation+4.45e-01

Pos logit contributions
lic+0.055
oto+0.050
ogo+0.046
EEP+0.045
round+0.045
Neg logit contributions
ement-0.115
vered-0.112
 symp-0.107
joy-0.105
 aston-0.104
Token poison
Token ID8764
Feature activation+6.16e-01

Pos logit contributions
vered+0.053
ement+0.050
 symp+0.048
 spe+0.045
 aston+0.045
Neg logit contributions
oto-0.030
lic-0.029
ogo-0.026
ibi-0.025
una-0.023
Token he
Token ID339
Feature activation-6.23e-01

Pos logit contributions
vered+0.064
ement+0.061
 symp+0.060
 aston+0.059
 spe+0.058
Neg logit contributions
lic-0.046
oto-0.046
ogo-0.042
EEP-0.039
una-0.039
Tokenml
Token ID4029
Feature activation-2.72e+00

Pos logit contributions
lic+0.041
oto+0.041
ogo+0.038
EEP+0.035
ibi+0.035
Neg logit contributions
vered-0.070
ement-0.069
 symp-0.066
 spe-0.064
 aston-0.064
Tokenock
Token ID735
Feature activation-7.44e+00

Pos logit contributions
oto+0.141
lic+0.139
ogo+0.124
ibi+0.118
EEP+0.115
Neg logit contributions
vered-0.347
ement-0.334
 symp-0.323
 spe-0.321
 aston-0.311
Token.
Token ID13
Feature activation+3.72e-01

Pos logit contributions
oto+0.609
lic+0.601
ogo+0.567
ibi+0.536
EEP+0.534
Neg logit contributions
vered-0.729
ement-0.698
 symp-0.672
 spe-0.668
 aston-0.658
Token It
Token ID632
Feature activation-1.14e+00

Pos logit contributions
vered+0.045
ement+0.042
 spe+0.041
 symp+0.041
 aston+0.039
Neg logit contributions
lic-0.023
oto-0.022
ogo-0.020
ibi-0.020
EEP-0.019
Token was
Token ID373
Feature activation+7.44e-01

Pos logit contributions
lic+0.071
oto+0.067
ogo+0.061
una+0.057
EEP+0.055
Neg logit contributions
ement-0.135
vered-0.133
 symp-0.125
joy-0.122
 aston-0.121
Token very
Token ID845
Feature activation-5.52e-01

Pos logit contributions
vered+0.088
ement+0.086
 symp+0.078
 aston+0.076
 spe+0.073
Neg logit contributions
lic-0.048
oto-0.048
ogo-0.043
una-0.040
ibi-0.037
Token dangerous
Token ID4923
Feature activation+2.79e-01

Pos logit contributions
lic+0.039
oto+0.038
ogo+0.034
EEP+0.032
una+0.032
Neg logit contributions
vered-0.063
ement-0.060
 symp-0.055
 aston-0.054
 spe-0.053
Token
Token ID198
Feature activation-5.85e-01

Pos logit contributions
lic+0.014
oto+0.014
ogo+0.013
ibi+0.013
EEP+0.013
Neg logit contributions
vered-0.017
ement-0.016
 spe-0.016
 symp-0.015
 aston-0.015
Token
Token ID198
Feature activation-5.58e-01

Pos logit contributions
oto+0.050
lic+0.049
ogo+0.047
ibi+0.045
EEP+0.045
Neg logit contributions
vered-0.055
ement-0.052
 symp-0.051
 spe-0.050
 aston-0.048
TokenWhen
Token ID2215
Feature activation-1.66e+00

Pos logit contributions
oto+0.049
lic+0.047
ogo+0.046
ibi+0.044
EEP+0.044
Neg logit contributions
vered-0.052
ement-0.048
 symp-0.048
 spe-0.047
 aston-0.045
Token the
Token ID262
Feature activation-2.11e-02

Pos logit contributions
oto+0.122
lic+0.118
EEP+0.108
ibi+0.108
ogo+0.108
Neg logit contributions
vered-0.178
ement-0.168
 symp-0.157
 spe-0.154
 aston-0.154
Token peac
Token ID41961
Feature activation+6.82e-01

Pos logit contributions
oto+0.001
lic+0.001
ogo+0.001
ibi+0.001
EEP+0.001
Neg logit contributions
vered-0.003
ement-0.002
 symp-0.002
 spe-0.002
 aston-0.002
Tokenock
Token ID735
Feature activation-7.42e+00

Pos logit contributions
vered+0.097
ement+0.095
 symp+0.090
 spe+0.089
 aston+0.087
Neg logit contributions
oto-0.026
lic-0.022
ogo-0.020
EEP-0.020
ibi-0.020
Token heard
Token ID2982
Feature activation+4.23e-02

Pos logit contributions
lic+0.536
oto+0.494
ogo+0.464
ibi+0.439
round+0.433
Neg logit contributions
vered-0.789
ement-0.773
 symp-0.749
 aston-0.725
 spe-0.715
Token his
Token ID465
Feature activation-1.62e+00

Pos logit contributions
vered+0.004
ement+0.004
 spe+0.004
 symp+0.004
 aston+0.004
Neg logit contributions
oto-0.004
lic-0.003
ogo-0.003
ibi-0.003
round-0.003
Token partner
Token ID5212
Feature activation-2.27e-01

Pos logit contributions
oto+0.119
lic+0.109
ogo+0.102
ibi+0.101
una+0.093
Neg logit contributions
vered-0.176
ement-0.160
 spe-0.159
 symp-0.155
 aston-0.152
Tokenâ
Token ID22940
Feature activation-2.34e-01

Pos logit contributions
lic+0.017
oto+0.017
ogo+0.015
ibi+0.015
EEP+0.014
Neg logit contributions
vered-0.024
ement-0.022
 spe-0.022
 symp-0.022
 aston-0.021
Token
Token ID26391
Feature activation-2.37e+00

Pos logit contributions
oto+0.016
lic+0.016
ogo+0.015
ibi+0.014
EEP+0.014
Neg logit contributions
vered-0.026
ement-0.025
 symp-0.024
 spe-0.024
 aston-0.023
Token to
Token ID284
Feature activation-9.94e-02

Pos logit contributions
vered+0.074
ement+0.070
 symp+0.068
 spe+0.067
 aston+0.066
Neg logit contributions
oto-0.058
lic-0.057
ogo-0.054
ibi-0.052
round-0.051
Token decor
Token ID11705
Feature activation+3.11e+00

Pos logit contributions
oto+0.006
lic+0.005
ogo+0.005
round+0.005
una+0.005
Neg logit contributions
vered-0.012
ement-0.012
 symp-0.011
 aston-0.011
 spe-0.011
Tokenate
Token ID378
Feature activation-2.31e+00

Pos logit contributions
vered+0.384
 spe+0.368
 symp+0.362
 aston+0.352
ement+0.343
Neg logit contributions
oto-0.175
lic-0.170
ogo-0.157
ibi-0.130
una-0.128
Token the
Token ID262
Feature activation-1.32e-02

Pos logit contributions
oto+0.148
lic+0.129
ogo+0.126
una+0.120
ibi+0.115
Neg logit contributions
vered-0.269
 symp-0.249
ement-0.247
 aston-0.246
 spe-0.239
Token peac
Token ID41961
Feature activation+6.98e-01

Pos logit contributions
oto+0.001
lic+0.001
EEP+0.001
round+0.001
ibi+0.001
Neg logit contributions
vered-0.002
ement-0.001
 symp-0.001
ingly-0.001
 aston-0.001
Tokenock
Token ID735
Feature activation-7.39e+00

Pos logit contributions
vered+0.105
ement+0.103
 spe+0.100
 symp+0.099
 aston+0.096
Neg logit contributions
oto-0.020
lic-0.017
ogo-0.015
ibi-0.014
EEP-0.013
Tokenâ
Token ID22940
Feature activation-2.08e-01

Pos logit contributions
lic+0.556
oto+0.543
round+0.496
ogo+0.490
ibi+0.473
Neg logit contributions
vered-0.755
ement-0.706
 aston-0.685
 spe-0.683
 symp-0.675
Token
Token ID26391
Feature activation-2.35e+00

Pos logit contributions
oto+0.016
lic+0.016
ogo+0.015
ibi+0.014
EEP+0.014
Neg logit contributions
vered-0.021
ement-0.020
 spe-0.020
 symp-0.019
 aston-0.019
Token
Token ID8151
Feature activation-1.78e-01

Pos logit contributions
oto+0.192
lic+0.191
ogo+0.178
ibi+0.172
EEP+0.170
Neg logit contributions
vered-0.232
ement-0.219
 spe-0.211
 symp-0.210
 aston-0.202
Tokens
Token ID82
Feature activation-1.85e+00

Pos logit contributions
oto+0.014
lic+0.014
ogo+0.013
ibi+0.013
EEP+0.013
Neg logit contributions
vered-0.018
 spe-0.017
ement-0.016
 aston-0.016
 symp-0.016
Token tail
Token ID7894
Feature activation+9.34e-02

Pos logit contributions
oto+0.120
lic+0.111
ogo+0.108
EEP+0.102
ibi+0.102
Neg logit contributions
vered-0.191
ement-0.187
 symp-0.184
 aston-0.170
 spe-0.170
Token mess
Token ID2085
Feature activation+1.32e+00

Pos logit contributions
vered+0.027
ement+0.025
 spe+0.024
 symp+0.024
 aston+0.023
Neg logit contributions
oto-0.019
lic-0.019
ogo-0.018
ibi-0.017
EEP-0.017
Token.
Token ID13
Feature activation+4.84e-01

Pos logit contributions
vered+0.138
ement+0.127
 spe+0.125
 aston+0.125
 symp+0.125
Neg logit contributions
lic-0.102
oto-0.100
ogo-0.091
oys-0.090
round-0.088
Token
Token ID198
Feature activation-8.68e-02

Pos logit contributions
vered+0.049
ement+0.045
 spe+0.044
 symp+0.044
 aston+0.042
Neg logit contributions
oto-0.039
lic-0.039
ogo-0.036
ibi-0.036
EEP-0.035
Token
Token ID198
Feature activation-6.19e-02

Pos logit contributions
oto+0.007
lic+0.007
ogo+0.006
ibi+0.006
EEP+0.006
Neg logit contributions
vered-0.009
ement-0.008
 spe-0.008
 symp-0.008
 aston-0.008
TokenSt
Token ID1273
Feature activation-7.88e-02

Pos logit contributions
oto+0.005
lic+0.005
ogo+0.004
ibi+0.004
EEP+0.004
Neg logit contributions
vered-0.007
ement-0.006
 symp-0.006
 spe-0.006
 aston-0.006
Tokenrip
Token ID5528
Feature activation-7.37e+00

Pos logit contributions
oto+0.008
lic+0.008
ogo+0.007
ibi+0.007
EEP+0.007
Neg logit contributions
vered-0.006
ement-0.006
 spe-0.006
 symp-0.006
 aston-0.006
Tokenes
Token ID274
Feature activation-1.58e-01

Pos logit contributions
lic+0.603
oto+0.576
ogo+0.540
ibi+0.507
round+0.505
Neg logit contributions
vered-0.736
ement-0.706
 symp-0.687
 spe-0.679
 aston-0.676
Token didn
Token ID1422
Feature activation+4.74e-02

Pos logit contributions
lic+0.011
oto+0.011
ogo+0.009
ibi+0.009
round+0.009
Neg logit contributions
vered-0.017
ement-0.017
 spe-0.017
 aston-0.016
 symp-0.016
Token't
Token ID470
Feature activation-1.66e+00

Pos logit contributions
vered+0.005
ement+0.005
 symp+0.005
 spe+0.005
 aston+0.005
Neg logit contributions
oto-0.003
lic-0.003
ogo-0.003
ibi-0.003
round-0.003
Token listen
Token ID6004
Feature activation-5.31e+00

Pos logit contributions
ogo+0.157
lic+0.154
oto+0.149
EEP+0.139
ibi+0.136
Neg logit contributions
vered-0.124
 longer-0.122
 spe-0.117
 symp-0.114
 distress-0.112
Token.
Token ID13
Feature activation+4.47e-01

Pos logit contributions
oto+0.279
lic+0.276
EEP+0.224
oys+0.223
ogo+0.222
Neg logit contributions
vered-0.680
 spe-0.649
ement-0.646
 aston-0.628
 symp-0.612
Token look
Token ID804
Feature activation-2.07e+00

Pos logit contributions
vered+0.002
ement+0.002
 symp+0.001
ingly+0.001
 aston+0.001
Neg logit contributions
oto-0.001
lic-0.001
ogo-0.001
ibi-0.001
round-0.001
Token at
Token ID379
Feature activation+9.71e-01

Pos logit contributions
lic+0.133
oto+0.132
ogo+0.108
round+0.105
ibi+0.101
Neg logit contributions
vered-0.247
ement-0.223
 symp-0.220
 spe-0.219
-0.212
Token in
Token ID287
Feature activation+8.02e-01

Pos logit contributions
vered+0.112
 symp+0.111
 aston+0.108
ement+0.104
 spe+0.104
Neg logit contributions
oto-0.054
ogo-0.046
lic-0.042
una-0.039
ock-0.038
Token the
Token ID262
Feature activation+2.39e-01

Pos logit contributions
vered+0.105
 symp+0.099
ement+0.099
used+0.092
+0.092
Neg logit contributions
 cool-0.029
 water-0.025
 nice-0.022
oto-0.020
 my-0.019
Token padd
Token ID14098
Feature activation+2.37e-02

Pos logit contributions
vered+0.030
ement+0.027
 symp+0.026
ingly+0.025
 aston+0.025
Neg logit contributions
lic-0.012
 cool-0.012
oto-0.011
 sound-0.011
 water-0.011
Tokenock
Token ID735
Feature activation-7.36e+00

Pos logit contributions
vered+0.002
 spe+0.002
 aston+0.002
 symp+0.002
ement+0.002
Neg logit contributions
lic-0.002
oto-0.002
ogo-0.002
round-0.002
ock-0.002
Token.
Token ID13
Feature activation-3.64e-02

Pos logit contributions
lic+0.537
oto+0.505
ogo+0.470
round+0.455
ock+0.437
Neg logit contributions
vered-0.771
ement-0.720
 symp-0.705
 aston-0.705
 spe-0.704
Token The
Token ID383
Feature activation+2.69e-01

Pos logit contributions
oto+0.003
lic+0.003
ogo+0.003
ibi+0.003
round+0.003
Neg logit contributions
vered-0.004
ement-0.003
 spe-0.003
 symp-0.003
 aston-0.003
Token farmer
Token ID18739
Feature activation+2.00e-01

Pos logit contributions
vered+0.035
ement+0.033
 symp+0.031
 spe+0.031
 aston+0.030
Neg logit contributions
oto-0.015
lic-0.014
ogo-0.013
ibi-0.011
una-0.011
Token smiled
Token ID13541
Feature activation+7.06e-01

Pos logit contributions
vered+0.022
ement+0.022
 symp+0.021
 spe+0.021
 aston+0.020
Neg logit contributions
oto-0.013
lic-0.012
ogo-0.011
round-0.011
ibi-0.010
Token and
Token ID290
Feature activation-3.66e-01

Pos logit contributions
vered+0.086
 spe+0.079
ement+0.078
 symp+0.074
+0.073
Neg logit contributions
oto-0.042
lic-0.040
ogo-0.032
 bye-0.031
oys-0.031
Token tro
Token ID4161
Feature activation+3.44e-01

Pos logit contributions
lic+0.015
oto+0.014
ogo+0.013
round+0.013
EEP+0.013
Neg logit contributions
vered-0.026
ement-0.026
 symp-0.025
 aston-0.025
 spe-0.024
Tokentted
Token ID28734
Feature activation+1.45e+00

Pos logit contributions
vered+0.038
ement+0.036
 spe+0.034
 symp+0.034
 aston+0.033
Neg logit contributions
oto-0.025
lic-0.024
ogo-0.023
ibi-0.021
EEP-0.021
Token around
Token ID1088
Feature activation+4.13e-01

Pos logit contributions
vered+0.162
ement+0.155
 spe+0.153
 symp+0.150
 aston+0.146
Neg logit contributions
oto-0.098
lic-0.096
round-0.088
ogo-0.088
ibi-0.087
Token the
Token ID262
Feature activation+5.76e-02

Pos logit contributions
ement+0.052
vered+0.051
 aston+0.047
 symp+0.047
 spe+0.047
Neg logit contributions
lic-0.016
oto-0.015
round-0.013
 jumped-0.012
 listen-0.012
Token padd
Token ID14098
Feature activation+5.70e-02

Pos logit contributions
vered+0.006
ement+0.006
 spe+0.005
 symp+0.005
 aston+0.005
Neg logit contributions
oto-0.004
lic-0.004
round-0.004
ogo-0.004
ibi-0.004
Tokenock
Token ID735
Feature activation-7.35e+00

Pos logit contributions
vered+0.005
 spe+0.005
ement+0.005
 symp+0.005
 aston+0.005
Neg logit contributions
lic-0.005
oto-0.005
ogo-0.005
round-0.004
ibi-0.004
Token.
Token ID13
Feature activation+3.99e-02

Pos logit contributions
lic+0.536
oto+0.513
round+0.505
ogo+0.492
ibi+0.463
Neg logit contributions
vered-0.731
ement-0.703
 aston-0.692
 spe-0.692
 symp-0.676
Token
Token ID198
Feature activation-2.88e-01

Pos logit contributions
vered+0.004
ement+0.004
 spe+0.004
 symp+0.004
 aston+0.004
Neg logit contributions
lic-0.003
oto-0.003
ibi-0.003
ogo-0.003
EEP-0.003
Token
Token ID198
Feature activation-2.47e-01

Pos logit contributions
lic+0.022
oto+0.022
ogo+0.020
ibi+0.019
EEP+0.019
Neg logit contributions
vered-0.030
ement-0.029
 spe-0.028
 symp-0.028
 aston-0.027
TokenWhen
Token ID2215
Feature activation-1.54e+00

Pos logit contributions
oto+0.019
lic+0.019
ogo+0.017
ibi+0.017
EEP+0.016
Neg logit contributions
vered-0.026
ement-0.024
 spe-0.024
 symp-0.024
 aston-0.023
Token it
Token ID340
Feature activation-3.77e-01

Pos logit contributions
oto+0.095
lic+0.093
EEP+0.084
ogo+0.084
ibi+0.083
Neg logit contributions
vered-0.179
ement-0.171
 symp-0.162
 spe-0.156
 aston-0.156
Token
Token ID95
Feature activation-7.13e-01

Pos logit contributions
oto+0.145
lic+0.138
ibi+0.134
ogo+0.130
oys+0.124
Neg logit contributions
vered-0.152
ement-0.148
 symp-0.146
 spe-0.143
 aston-0.135
Token
Token ID26391
Feature activation-2.24e+00

Pos logit contributions
oto+0.068
lic+0.065
ogo+0.061
ibi+0.060
oys+0.055
Neg logit contributions
vered-0.063
ement-0.059
 symp-0.058
 aston-0.056
 spe-0.054
Token
Token ID129
Feature activation+3.77e-01

Pos logit contributions
oto+0.227
lic+0.206
ogo+0.199
round+0.199
ibi+0.195
Neg logit contributions
vered-0.181
 spe-0.168
ement-0.167
 symp-0.165
 aston-0.155
Token
Token ID241
Feature activation-1.53e+00

Pos logit contributions
vered+0.031
 symp+0.031
ement+0.029
 afar+0.028
 aston+0.027
Neg logit contributions
lic-0.036
oto-0.034
oys-0.032
ogo-0.032
EEP-0.031
TokenKn
Token ID25095
Feature activation-2.25e+00

Pos logit contributions
oto+0.127
lic+0.125
ogo+0.116
ibi+0.113
EEP+0.111
Neg logit contributions
vered-0.150
ement-0.142
 symp-0.137
 spe-0.137
 aston-0.131
Tokenock
Token ID735
Feature activation-7.34e+00

Pos logit contributions
oto+0.122
 Indians+0.111
ibi+0.103
ogo+0.103
 dads+0.101
Neg logit contributions
ement-0.286
vered-0.268
ingly-0.256
 symp-0.250
 aston-0.245
Token,
Token ID11
Feature activation-3.87e-01

Pos logit contributions
oto+0.575
lic+0.555
ogo+0.524
ibi+0.508
EEP+0.497
Neg logit contributions
vered-0.761
ement-0.715
 spe-0.694
 symp-0.686
 aston-0.657
Token knock
Token ID10643
Feature activation+1.06e+00

Pos logit contributions
lic+0.027
oto+0.027
ogo+0.025
ibi+0.024
EEP+0.023
Neg logit contributions
vered-0.043
ement-0.040
 symp-0.039
 spe-0.038
 aston-0.037
Token,
Token ID11
Feature activation-3.97e-01

Pos logit contributions
vered+0.110
ement+0.103
 spe+0.099
 symp+0.099
 aston+0.095
Neg logit contributions
oto-0.083
lic-0.081
ogo-0.076
ibi-0.073
EEP-0.071
Token I
Token ID314
Feature activation-9.05e-01

Pos logit contributions
oto+0.030
lic+0.030
ogo+0.028
ibi+0.027
EEP+0.026
Neg logit contributions
vered-0.042
ement-0.039
 spe-0.038
 symp-0.038
 aston-0.037
Token'm
Token ID1101
Feature activation-6.69e-01

Pos logit contributions
oto+0.075
lic+0.070
ogo+0.068
ibi+0.066
round+0.065
Neg logit contributions
vered-0.091
 spe-0.083
ement-0.082
 symp-0.080
 aston-0.079
Token the
Token ID262
Feature activation-4.14e-01

Pos logit contributions
oto+0.044
ibi+0.037
 not+0.037
ogo+0.036
 Sam+0.034
Neg logit contributions
vered-0.144
ement-0.138
 spe-0.136
 symp-0.133
eled-0.130
Token merry
Token ID44570
Feature activation-1.27e-01

Pos logit contributions
oto+0.027
lic+0.026
ogo+0.024
ibi+0.023
round+0.023
Neg logit contributions
vered-0.049
ement-0.047
 symp-0.045
 spe-0.045
 aston-0.043
Token-
Token ID12
Feature activation-1.72e+00

Pos logit contributions
lic+0.011
oto+0.011
ogo+0.010
EEP+0.010
ibi+0.009
Neg logit contributions
vered-0.012
ement-0.011
 spe-0.011
 symp-0.010
 aston-0.010
Tokengo
Token ID2188
Feature activation-3.79e+00

Pos logit contributions
oto+0.106
ogo+0.100
lic+0.098
EEP+0.086
ibi+0.086
Neg logit contributions
vered-0.199
ement-0.191
 spe-0.180
 symp-0.178
ingly-0.174
Token-
Token ID12
Feature activation-1.80e+00

Pos logit contributions
oto+0.317
lic+0.311
ogo+0.291
ibi+0.278
EEP+0.275
Neg logit contributions
vered-0.375
ement-0.352
 symp-0.336
 spe-0.335
 aston-0.319
Tokenround
Token ID744
Feature activation-7.31e+00

Pos logit contributions
oto+0.052
lic+0.046
ogo+0.044
EEP+0.031
ibi+0.030
Neg logit contributions
vered-0.267
ement-0.256
 spe-0.245
 symp-0.245
ingly-0.239
Token.
Token ID13
Feature activation-1.77e-01

Pos logit contributions
oto+0.638
lic+0.632
ogo+0.593
ibi+0.573
round+0.568
Neg logit contributions
vered-0.683
ement-0.639
 spe-0.620
 symp-0.616
 aston-0.593
Token They
Token ID1119
Feature activation-7.31e-01

Pos logit contributions
oto+0.014
lic+0.014
ogo+0.013
ibi+0.012
EEP+0.012
Neg logit contributions
vered-0.018
ement-0.017
 spe-0.017
 symp-0.017
 aston-0.016
Token laughed
Token ID13818
Feature activation-1.43e+00

Pos logit contributions
oto+0.049
lic+0.049
ibi+0.044
ogo+0.044
round+0.044
Neg logit contributions
vered-0.082
ement-0.081
 symp-0.076
 spe-0.074
 aston-0.073
Token and
Token ID290
Feature activation-4.18e-01

Pos logit contributions
oto+0.113
lic+0.112
ogo+0.105
ibi+0.100
EEP+0.099
Neg logit contributions
vered-0.145
ement-0.137
 spe-0.133
 symp-0.132
 aston-0.127
Token smiled
Token ID13541
Feature activation+6.17e-01

Pos logit contributions
oto+0.033
lic+0.032
ogo+0.031
EEP+0.028
ibi+0.028
Neg logit contributions
vered-0.043
 spe-0.040
ement-0.039
 symp-0.039
 aston-0.037
Token the
Token ID262
Feature activation+5.79e-02

Pos logit contributions
vered+0.099
ement+0.091
 symp+0.087
+0.082
ingly+0.082
Neg logit contributions
oto-0.029
ruck-0.023
 cool-0.022
 water-0.022
lic-0.021
Token sky
Token ID6766
Feature activation+1.28e-01

Pos logit contributions
vered+0.007
ement+0.006
 symp+0.006
 spe+0.006
 aston+0.006
Neg logit contributions
lic-0.004
oto-0.004
round-0.004
EEP-0.004
ibi-0.003
Token,
Token ID11
Feature activation-2.86e-01

Pos logit contributions
vered+0.015
ement+0.013
 symp+0.013
 aston+0.013
 spe+0.013
Neg logit contributions
lic-0.009
oto-0.008
ogo-0.007
round-0.007
ibi-0.007
Token the
Token ID262
Feature activation+7.11e-02

Pos logit contributions
oto+0.021
lic+0.021
ogo+0.019
ibi+0.018
round+0.018
Neg logit contributions
vered-0.031
ement-0.029
 symp-0.028
 spe-0.028
 aston-0.027
Token peac
Token ID41961
Feature activation+7.56e-01

Pos logit contributions
vered+0.009
ement+0.008
 symp+0.008
ingly+0.007
 aston+0.007
Neg logit contributions
oto-0.004
lic-0.004
ibi-0.004
EEP-0.003
ogo-0.003
Tokenock
Token ID735
Feature activation-7.30e+00

Pos logit contributions
vered+0.109
ement+0.109
 spe+0.102
 symp+0.102
el+0.099
Neg logit contributions
oto-0.025
lic-0.020
EEP-0.019
ibi-0.019
ogo-0.019
Token would
Token ID561
Feature activation+6.43e-01

Pos logit contributions
lic+0.553
oto+0.524
ogo+0.477
ibi+0.460
round+0.459
Neg logit contributions
vered-0.752
ement-0.731
 symp-0.701
 aston-0.695
 spe-0.678
Token ring
Token ID5858
Feature activation-8.12e-01

Pos logit contributions
vered+0.079
ement+0.076
 symp+0.070
 spe+0.069
 aston+0.068
Neg logit contributions
oto-0.039
lic-0.036
ogo-0.034
round-0.032
ibi-0.031
Token out
Token ID503
Feature activation-2.32e+00

Pos logit contributions
oto+0.058
lic+0.057
ogo+0.051
round+0.048
ibi+0.048
Neg logit contributions
vered-0.088
ement-0.083
 symp-0.080
 aston-0.080
 spe-0.079
Token his
Token ID465
Feature activation-1.58e+00

Pos logit contributions
lic+0.121
oto+0.098
ock+0.087
ogo+0.082
ruck+0.076
Neg logit contributions
vered-0.286
-0.273
ement-0.267
 symp-0.264
 spe-0.262
Token beautiful
Token ID4950
Feature activation+3.40e-01

Pos logit contributions
oto+0.108
EEP+0.103
ogo+0.097
ibi+0.096
lic+0.093
Neg logit contributions
vered-0.165
ement-0.155
 symp-0.149
-0.134
 spe-0.134
Token stuck
Token ID7819
Feature activation+4.28e+00

Pos logit contributions
vered+0.068
ement+0.065
 symp+0.062
 spe+0.061
 aston+0.060
Neg logit contributions
oto-0.049
lic-0.049
ogo-0.046
ibi-0.043
round-0.042
Token in
Token ID287
Feature activation+7.28e-01

Pos logit contributions
vered+0.442
ement+0.427
 spe+0.414
 symp+0.411
 aston+0.403
Neg logit contributions
oto-0.319
lic-0.309
ogo-0.302
ibi-0.273
EEP-0.271
Token a
Token ID257
Feature activation-9.57e-02

Pos logit contributions
ement+0.091
vered+0.089
 symp+0.084
 spe+0.082
+0.080
Neg logit contributions
oto-0.035
ogo-0.031
lic-0.030
una-0.028
 gear-0.027
Token small
Token ID1402
Feature activation+1.17e+00

Pos logit contributions
lic+0.005
oto+0.005
ogo+0.005
EEP+0.004
round+0.004
Neg logit contributions
vered-0.012
ement-0.012
 symp-0.011
 aston-0.010
ingly-0.010
Token padd
Token ID14098
Feature activation+1.62e-02

Pos logit contributions
vered+0.126
ement+0.124
 symp+0.114
 spe+0.108
 aston+0.107
Neg logit contributions
lic-0.079
oto-0.078
round-0.074
ogo-0.073
EEP-0.069
Tokenock
Token ID735
Feature activation-7.29e+00

Pos logit contributions
vered+0.001
ement+0.001
 symp+0.001
 spe+0.001
 aston+0.001
Neg logit contributions
oto-0.002
lic-0.002
ogo-0.002
ibi-0.001
EEP-0.001
Token,
Token ID11
Feature activation-4.11e-01

Pos logit contributions
oto+0.516
lic+0.513
ogo+0.485
round+0.479
EEP+0.451
Neg logit contributions
vered-0.759
ement-0.738
 aston-0.704
 spe-0.699
 symp-0.692
Token far
Token ID1290
Feature activation+1.36e+00

Pos logit contributions
oto+0.021
lic+0.020
round+0.018
ogo+0.018
ibi+0.017
Neg logit contributions
vered-0.051
ement-0.050
 aston-0.046
 symp-0.046
 spe-0.045
Token away
Token ID1497
Feature activation+2.79e-01

Pos logit contributions
vered+0.119
ement+0.112
 spe+0.107
 symp+0.107
 aston+0.102
Neg logit contributions
oto-0.127
lic-0.126
ogo-0.119
ibi-0.114
round-0.114
Token from
Token ID422
Feature activation-1.22e+00

Pos logit contributions
vered+0.028
ement+0.027
 symp+0.026
 spe+0.026
 aston+0.025
Neg logit contributions
oto-0.022
lic-0.021
ogo-0.020
round-0.019
EEP-0.019
Token the
Token ID262
Feature activation-5.73e-02

Pos logit contributions
oto+0.065
ogo+0.058
round+0.053
ruck+0.051
ock+0.051
Neg logit contributions
ement-0.152
vered-0.151
 symp-0.145
 aston-0.137
ingly-0.136
Token a
Token ID257
Feature activation+1.25e-01

Pos logit contributions
 spe+0.061
vered+0.061
ement+0.058
 symp+0.056
 aston+0.054
Neg logit contributions
oto-0.026
lic-0.023
una-0.022
ibi-0.021
ogo-0.019
Token delicate
Token ID19217
Feature activation-4.65e-01

Pos logit contributions
vered+0.014
ement+0.013
 symp+0.012
 spe+0.012
 aston+0.011
Neg logit contributions
lic-0.010
oto-0.009
ogo-0.008
ibi-0.008
round-0.008
Token cup
Token ID6508
Feature activation+2.31e-01

Pos logit contributions
oto+0.033
lic+0.033
ibi+0.028
round+0.028
ogo+0.028
Neg logit contributions
vered-0.052
ement-0.048
 symp-0.047
 aston-0.045
 spe-0.044
Token and
Token ID290
Feature activation-2.80e-01

Pos logit contributions
vered+0.026
ement+0.024
 spe+0.023
 symp+0.023
 aston+0.023
Neg logit contributions
oto-0.017
lic-0.017
ogo-0.015
round-0.015
ibi-0.014
Token sa
Token ID473
Feature activation+1.56e+00

Pos logit contributions
lic+0.020
oto+0.020
ogo+0.017
ibi+0.017
EEP+0.017
Neg logit contributions
vered-0.031
ement-0.030
 symp-0.028
 spe-0.027
 aston-0.027
Tokenucer
Token ID48915
Feature activation-7.25e+00

Pos logit contributions
vered+0.184
ement+0.173
ingly+0.168
 symp+0.167
el+0.167
Neg logit contributions
oto-0.094
lic-0.093
round-0.083
EEP-0.082
ibi-0.081
Token.
Token ID13
Feature activation+1.83e-01

Pos logit contributions
lic+0.456
oto+0.408
oys+0.361
round+0.354
ogo+0.351
Neg logit contributions
vered-0.840
ement-0.784
 afar-0.778
 aston-0.759
 symp-0.754
Token "
Token ID366
Feature activation-1.24e+00

Pos logit contributions
vered+0.019
ement+0.018
 spe+0.017
 symp+0.017
 aston+0.016
Neg logit contributions
oto-0.015
lic-0.014
ogo-0.013
ibi-0.013
EEP-0.013
TokenYes
Token ID5297
Feature activation+3.13e-01

Pos logit contributions
lic+0.103
oto+0.098
round+0.093
ibi+0.087
EEP+0.086
Neg logit contributions
vered-0.128
ement-0.124
 symp-0.113
 spe-0.110
 aston-0.110
Token,
Token ID11
Feature activation-1.24e-01

Pos logit contributions
vered+0.028
ement+0.027
 spe+0.026
 symp+0.026
 aston+0.025
Neg logit contributions
oto-0.028
lic-0.028
ogo-0.026
ibi-0.025
EEP-0.025
Token please
Token ID3387
Feature activation+1.34e+00

Pos logit contributions
lic+0.009
oto+0.009
ogo+0.009
ibi+0.009
EEP+0.008
Neg logit contributions
vered-0.012
ement-0.012
 spe-0.012
 symp-0.012
 aston-0.011
Token the
Token ID262
Feature activation+4.11e-01

Pos logit contributions
vered+0.025
ement+0.024
 symp+0.024
 spe+0.023
+0.022
Neg logit contributions
oto-0.009
lic-0.009
ogo-0.007
round-0.007
oys-0.006
Token gate
Token ID8946
Feature activation-7.87e-01

Pos logit contributions
vered+0.047
ement+0.045
 symp+0.044
ingly+0.041
 aston+0.041
Neg logit contributions
lic-0.027
oto-0.023
ogo-0.023
EEP-0.022
ye-0.021
Token to
Token ID284
Feature activation+1.27e-01

Pos logit contributions
lic+0.046
oto+0.044
round+0.041
ogo+0.039
EEP+0.038
Neg logit contributions
vered-0.095
 spe-0.088
 symp-0.085
ement-0.085
 aston-0.082
Token the
Token ID262
Feature activation+3.77e-01

Pos logit contributions
vered+0.013
ement+0.012
 spe+0.012
 symp+0.012
 aston+0.011
Neg logit contributions
oto-0.010
lic-0.010
ogo-0.010
ibi-0.009
EEP-0.009
Token padd
Token ID14098
Feature activation+1.52e-01

Pos logit contributions
vered+0.045
ement+0.043
 symp+0.040
 spe+0.038
 aston+0.038
Neg logit contributions
oto-0.024
lic-0.023
ogo-0.021
EEP-0.020
oys-0.020
Tokenock
Token ID735
Feature activation-7.23e+00

Pos logit contributions
vered+0.014
ement+0.014
 spe+0.014
 symp+0.013
 aston+0.013
Neg logit contributions
lic-0.013
oto-0.013
ogo-0.012
round-0.012
ibi-0.012
Token and
Token ID290
Feature activation-2.26e-01

Pos logit contributions
lic+0.515
oto+0.490
round+0.459
ogo+0.459
ibi+0.427
Neg logit contributions
vered-0.760
ement-0.730
 spe-0.699
 symp-0.697
 aston-0.684
Token put
Token ID1234
Feature activation+3.10e-01

Pos logit contributions
oto+0.019
lic+0.018
ogo+0.018
ibi+0.017
EEP+0.017
Neg logit contributions
vered-0.022
 spe-0.021
ement-0.020
 symp-0.020
 aston-0.019
Token the
Token ID262
Feature activation+3.60e-01

Pos logit contributions
vered+0.033
ement+0.032
 symp+0.031
 spe+0.031
 aston+0.030
Neg logit contributions
oto-0.022
lic-0.022
ogo-0.020
ibi-0.019
round-0.018
Token tiny
Token ID7009
Feature activation+8.90e-01

Pos logit contributions
vered+0.039
ement+0.039
 symp+0.037
 spe+0.035
 aston+0.035
Neg logit contributions
lic-0.025
oto-0.025
ogo-0.022
ibi-0.022
EEP-0.021
Token hole
Token ID7604
Feature activation+3.08e-01

Pos logit contributions
ement+0.098
 symp+0.092
vered+0.092
 aston+0.086
+0.085
Neg logit contributions
lic-0.059
oto-0.054
oys-0.052
ibi-0.050
ogo-0.049
Token Katie
Token ID24721
Feature activation+1.55e+00

Pos logit contributions
oto+0.019
lic+0.019
ogo+0.018
ibi+0.017
EEP+0.017
Neg logit contributions
vered-0.016
ement-0.015
 spe-0.015
 symp-0.015
 aston-0.014
Token put
Token ID1234
Feature activation+2.78e-01

Pos logit contributions
vered+0.175
ement+0.173
 symp+0.165
 spe+0.163
 aston+0.157
Neg logit contributions
lic-0.101
oto-0.099
EEP-0.088
ogo-0.088
ibi-0.087
Token on
Token ID319
Feature activation-2.54e-01

Pos logit contributions
vered+0.028
ement+0.026
 spe+0.026
 symp+0.025
 aston+0.025
Neg logit contributions
oto-0.022
lic-0.022
ogo-0.020
ibi-0.020
round-0.019
Token her
Token ID607
Feature activation+6.24e-01

Pos logit contributions
oto+0.018
lic+0.017
ibi+0.016
EEP+0.015
ogo+0.015
Neg logit contributions
vered-0.027
ement-0.026
 spe-0.025
 symp-0.024
ingly-0.023
Token sm
Token ID895
Feature activation-4.07e-01

Pos logit contributions
vered+0.070
ement+0.065
 spe+0.064
 symp+0.063
 aston+0.061
Neg logit contributions
oto-0.043
lic-0.042
ogo-0.039
ibi-0.038
EEP-0.037
Tokenock
Token ID735
Feature activation-7.23e+00

Pos logit contributions
lic+0.040
oto+0.037
ogo+0.036
EEP+0.035
 Indians+0.035
Neg logit contributions
ement-0.032
vered-0.032
ingly-0.030
 symp-0.029
el-0.028
Token and
Token ID290
Feature activation-3.19e-01

Pos logit contributions
oto+0.504
lic+0.489
ibi+0.454
round+0.443
ogo+0.442
Neg logit contributions
vered-0.794
ement-0.770
 spe-0.757
 symp-0.720
 aston-0.718
Token went
Token ID1816
Feature activation-2.23e-01

Pos logit contributions
oto+0.025
lic+0.025
ogo+0.023
ibi+0.022
round+0.022
Neg logit contributions
vered-0.033
ement-0.031
 spe-0.030
 symp-0.030
 aston-0.029
Token outside
Token ID2354
Feature activation-2.35e+00

Pos logit contributions
oto+0.021
lic+0.020
ogo+0.020
EEP+0.019
ibi+0.019
Neg logit contributions
vered-0.018
 spe-0.017
ement-0.017
 symp-0.017
 aston-0.016
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
lic+0.072
oto+0.064
oys+0.059
round+0.058
EEP+0.055
Neg logit contributions
vered-0.339
ement-0.333
 spe-0.305
 symp-0.300
 aston-0.299
Token She
Token ID1375
Feature activation+3.98e-01

Pos logit contributions
vered+0.009
ement+0.008
 spe+0.008
 symp+0.008
 aston+0.007
Neg logit contributions
oto-0.010
lic-0.010
ogo-0.010
ibi-0.009
round-0.009
Token about
Token ID546
Feature activation-3.51e+00

Pos logit contributions
vered+0.124
ement+0.119
 symp+0.115
 spe+0.114
 aston+0.111
Neg logit contributions
oto-0.055
lic-0.053
ogo-0.050
ibi-0.048
round-0.046
Token me
Token ID502
Feature activation+9.55e-02

Pos logit contributions
oto+0.237
lic+0.215
ogo+0.207
EEP+0.193
ibi+0.191
Neg logit contributions
vered-0.404
ement-0.386
 symp-0.375
 spe-0.372
 aston-0.360
Token,"
Token ID553
Feature activation+3.19e+00

Pos logit contributions
vered+0.011
 spe+0.010
ement+0.010
 aston+0.010
 symp+0.010
Neg logit contributions
oto-0.007
lic-0.006
ogo-0.006
round-0.006
EEP-0.005
Token he
Token ID339
Feature activation-1.60e-01

Pos logit contributions
vered+0.466
ement+0.436
 spe+0.423
 symp+0.420
 aston+0.416
Neg logit contributions
oto-0.124
lic-0.111
ibi-0.101
round-0.091
EEP-0.087
Token said
Token ID531
Feature activation-5.69e-01

Pos logit contributions
oto+0.011
lic+0.010
ogo+0.010
ibi+0.009
EEP+0.009
Neg logit contributions
vered-0.018
ement-0.018
 spe-0.017
 symp-0.017
 aston-0.016
Token sadly
Token ID21098
Feature activation+6.16e+00

Pos logit contributions
lic+0.037
oto+0.035
EEP+0.031
oys+0.030
ogo+0.029
Neg logit contributions
vered-0.063
 spe-0.061
ement-0.059
 aston-0.057
 afar-0.055
Token.
Token ID13
Feature activation+3.66e-01

Pos logit contributions
vered+0.735
ement+0.697
 spe+0.697
 aston+0.651
 spec+0.628
Neg logit contributions
oto-0.367
lic-0.355
oys-0.308
EEP-0.299
ibi-0.292
Token
Token ID220
Feature activation+1.28e+00

Pos logit contributions
vered+0.039
ement+0.037
 spe+0.036
 symp+0.035
 aston+0.034
Neg logit contributions
oto-0.027
lic-0.027
ogo-0.025
ibi-0.024
EEP-0.024
Token
Token ID198
Feature activation-2.15e-01

Pos logit contributions
vered+0.141
 spe+0.132
ement+0.131
 symp+0.127
 aston+0.124
Neg logit contributions
oto-0.093
lic-0.092
ogo-0.086
ibi-0.082
EEP-0.082
Token
Token ID198
Feature activation-1.79e-01

Pos logit contributions
oto+0.018
lic+0.018
ogo+0.017
ibi+0.016
round+0.016
Neg logit contributions
vered-0.020
ement-0.019
 symp-0.018
 spe-0.018
 aston-0.017
TokenMom
Token ID29252
Feature activation-2.96e-01

Pos logit contributions
oto+0.014
lic+0.014
ogo+0.013
ibi+0.012
EEP+0.012
Neg logit contributions
vered-0.018
ement-0.017
 symp-0.017
 aston-0.016
 spe-0.016
Token the
Token ID262
Feature activation+2.69e-01

Pos logit contributions
vered+0.033
ement+0.031
 spe+0.029
 symp+0.029
 aston+0.028
Neg logit contributions
oto-0.020
lic-0.018
ogo-0.018
ibi-0.017
round-0.017
Token rain
Token ID6290
Feature activation-3.06e-01

Pos logit contributions
vered+0.035
ement+0.032
 symp+0.032
ingly+0.030
 aston+0.030
Neg logit contributions
oto-0.015
lic-0.012
EEP-0.012
round-0.011
ibi-0.011
Token,
Token ID11
Feature activation+3.13e-01

Pos logit contributions
oto+0.023
lic+0.023
ogo+0.021
EEP+0.021
round+0.021
Neg logit contributions
vered-0.032
ement-0.030
 symp-0.029
 aston-0.028
 spe-0.028
Token and
Token ID290
Feature activation+2.33e-01

Pos logit contributions
vered+0.034
ement+0.032
 symp+0.031
 spe+0.030
 aston+0.030
Neg logit contributions
oto-0.023
lic-0.022
ogo-0.021
EEP-0.020
ibi-0.019
Token the
Token ID262
Feature activation+3.21e-01

Pos logit contributions
vered+0.027
ement+0.027
 symp+0.026
 aston+0.024
 spe+0.024
Neg logit contributions
oto-0.014
lic-0.013
ogo-0.012
EEP-0.012
round-0.012
Token de
Token ID390
Feature activation+6.00e+00

Pos logit contributions
vered+0.041
ement+0.038
 symp+0.037
ingly+0.036
 aston+0.035
Neg logit contributions
oto-0.018
lic-0.016
EEP-0.015
ibi-0.014
round-0.014
Tokenw
Token ID86
Feature activation-1.73e+00

Pos logit contributions
vered+0.708
ement+0.674
 spe+0.656
 symp+0.652
 aston+0.629
Neg logit contributions
oto-0.380
lic-0.373
ogo-0.344
ibi-0.328
EEP-0.319
Token.
Token ID13
Feature activation+6.99e-01

Pos logit contributions
oto+0.150
lic+0.149
ogo+0.138
ibi+0.136
round+0.136
Neg logit contributions
vered-0.162
ement-0.151
 spe-0.145
 symp-0.143
el-0.135
Token You
Token ID921
Feature activation+1.49e+00

Pos logit contributions
vered+0.071
ement+0.067
 spe+0.065
 symp+0.064
 aston+0.061
Neg logit contributions
oto-0.056
lic-0.055
ogo-0.051
EEP-0.049
ibi-0.049
Token can
Token ID460
Feature activation+1.18e+00

Pos logit contributions
vered+0.155
ement+0.146
 spe+0.142
 symp+0.141
 aston+0.136
Neg logit contributions
oto-0.115
lic-0.113
ogo-0.105
ibi-0.101
EEP-0.100
Token smell
Token ID8508
Feature activation-1.39e+00

Pos logit contributions
vered+0.135
ement+0.128
 symp+0.124
 spe+0.123
 aston+0.119
Neg logit contributions
oto-0.080
lic-0.077
ogo-0.073
ibi-0.069
round-0.068
Token box
Token ID3091
Feature activation-3.67e-01

Pos logit contributions
vered+0.060
ement+0.055
 symp+0.054
 spe+0.054
 aston+0.051
Neg logit contributions
lic-0.039
oto-0.038
ogo-0.036
EEP-0.034
ibi-0.033
Token and
Token ID290
Feature activation+6.43e-02

Pos logit contributions
lic+0.020
oto+0.019
oys+0.017
ogo+0.017
round+0.016
Neg logit contributions
vered-0.046
 spe-0.042
 symp-0.041
ement-0.040
 afar-0.040
Token share
Token ID2648
Feature activation-1.10e+00

Pos logit contributions
vered+0.008
ement+0.008
 symp+0.007
 spe+0.007
ingly+0.007
Neg logit contributions
oto-0.003
lic-0.003
ogo-0.003
EEP-0.003
round-0.003
Token the
Token ID262
Feature activation+5.33e-01

Pos logit contributions
oto+0.080
lic+0.075
ogo+0.069
round+0.067
ibi+0.066
Neg logit contributions
vered-0.120
ement-0.114
 symp-0.108
 spe-0.107
ingly-0.103
Token ch
Token ID442
Feature activation+9.59e-01

Pos logit contributions
vered+0.063
ement+0.060
 symp+0.058
ingly+0.056
 spe+0.056
Neg logit contributions
oto-0.033
lic-0.031
ogo-0.029
ibi-0.027
oys-0.027
Tokenocol
Token ID4668
Feature activation+7.98e+00

Pos logit contributions
vered+0.038
ement+0.033
 spe+0.029
 symp+0.028
 aston+0.025
Neg logit contributions
oto-0.136
lic-0.136
ibi-0.129
ogo-0.128
EEP-0.127
Tokenates
Token ID689
Feature activation-1.20e-01

Pos logit contributions
vered+0.650
ement+0.590
 spe+0.563
el+0.548
 symp+0.542
Neg logit contributions
lic-0.780
oto-0.767
ibi-0.729
EEP-0.712
round-0.696
Token.
Token ID13
Feature activation+5.37e-01

Pos logit contributions
oto+0.010
lic+0.009
ogo+0.009
round+0.009
ibi+0.008
Neg logit contributions
vered-0.012
ement-0.011
 spe-0.011
 symp-0.011
 aston-0.011
Token They
Token ID1119
Feature activation-2.02e-01

Pos logit contributions
vered+0.051
ement+0.048
 spe+0.048
 symp+0.047
 aston+0.046
Neg logit contributions
oto-0.046
lic-0.044
ogo-0.043
ibi-0.040
round-0.040
Token are
Token ID389
Feature activation-7.29e-01

Pos logit contributions
oto+0.013
lic+0.013
ogo+0.012
ibi+0.011
EEP+0.011
Neg logit contributions
vered-0.023
ement-0.023
 symp-0.021
 spe-0.021
ingly-0.020
Token happy
Token ID3772
Feature activation+7.71e-01

Pos logit contributions
oto+0.063
lic+0.063
ogo+0.059
EEP+0.058
round+0.058
Neg logit contributions
vered-0.067
ement-0.063
 spe-0.063
 symp-0.061
 aston-0.060
Token high
Token ID1029
Feature activation+1.89e+00

Pos logit contributions
vered+0.045
 spe+0.043
ement+0.042
 symp+0.041
 aston+0.040
Neg logit contributions
oto-0.039
lic-0.038
ogo-0.037
ibi-0.035
round-0.034
Token they
Token ID484
Feature activation-5.22e-01

Pos logit contributions
vered+0.214
 aston+0.198
 symp+0.196
ement+0.195
 spe+0.189
Neg logit contributions
oto-0.127
lic-0.122
ogo-0.113
round-0.108
EEP-0.104
Token were
Token ID547
Feature activation+7.65e-01

Pos logit contributions
oto+0.043
lic+0.041
ogo+0.040
ibi+0.038
oys+0.037
Neg logit contributions
vered-0.052
 spe-0.050
ement-0.048
 symp-0.047
 aston-0.045
Token just
Token ID655
Feature activation+5.49e-01

Pos logit contributions
vered+0.077
ement+0.072
 symp+0.070
 spe+0.069
 aston+0.067
Neg logit contributions
oto-0.062
lic-0.062
ogo-0.058
ibi-0.055
EEP-0.054
Token tiny
Token ID7009
Feature activation+8.42e-01

Pos logit contributions
vered+0.063
ement+0.060
 symp+0.057
ingly+0.053
+0.053
Neg logit contributions
oto-0.034
lic-0.034
ibi-0.030
EEP-0.029
round-0.029
Token spec
Token ID1020
Feature activation+5.69e+00

Pos logit contributions
 symp+0.101
ement+0.101
vered+0.098
 aston+0.094
+0.090
Neg logit contributions
lic-0.036
!"-0.034
ruck-0.033
EEP-0.033
oys-0.032
Tokenks
Token ID591
Feature activation+1.47e+00

Pos logit contributions
vered+0.516
ement+0.514
 symp+0.472
 aston+0.450
 spe+0.441
Neg logit contributions
lic-0.496
oto-0.484
ogo-0.466
una-0.442
round-0.435
Token in
Token ID287
Feature activation+8.65e-01

Pos logit contributions
vered+0.145
ement+0.137
 spe+0.133
 symp+0.131
 aston+0.126
Neg logit contributions
lic-0.123
oto-0.119
ogo-0.110
EEP-0.107
ibi-0.104
Token the
Token ID262
Feature activation+7.28e-02

Pos logit contributions
ement+0.111
vered+0.110
 symp+0.099
bugs+0.094
ome+0.094
Neg logit contributions
 cool-0.033
oto-0.032
 water-0.029
!"-0.026
 sound-0.026
Token sky
Token ID6766
Feature activation+2.72e-01

Pos logit contributions
vered+0.009
ement+0.008
 symp+0.008
 spe+0.007
 aston+0.007
Neg logit contributions
lic-0.005
oto-0.005
EEP-0.004
ibi-0.004
round-0.004
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
vered+0.032
ement+0.029
 symp+0.028
 aston+0.028
 spe+0.026
Neg logit contributions
lic-0.018
oto-0.017
EEP-0.015
round-0.015
ogo-0.015
Token little
Token ID1310
Feature activation-3.06e-01

Pos logit contributions
vered+0.024
ement+0.022
 symp+0.022
 spe+0.022
 aston+0.021
Neg logit contributions
oto-0.012
lic-0.011
round-0.011
oys-0.010
ogo-0.010
Token boy
Token ID2933
Feature activation+6.76e-01

Pos logit contributions
round+0.010
oto+0.009
oys+0.009
 robber+0.007
 bye+0.007
Neg logit contributions
vered-0.042
 symp-0.042
ement-0.041
 aston-0.038
 spe-0.037
Token thanked
Token ID26280
Feature activation+2.47e+00

Pos logit contributions
vered+0.075
ement+0.074
 spe+0.071
 symp+0.069
 aston+0.067
Neg logit contributions
oto-0.044
lic-0.043
round-0.039
ibi-0.038
ogo-0.038
Token the
Token ID262
Feature activation+3.07e-01

Pos logit contributions
vered+0.278
ement+0.256
 spe+0.250
 symp+0.241
 aston+0.237
Neg logit contributions
oto-0.186
lic-0.162
ogo-0.160
round-0.149
ibi-0.146
Token shop
Token ID6128
Feature activation+2.74e+00

Pos logit contributions
vered+0.025
ement+0.023
 symp+0.023
 spe+0.022
 aston+0.021
Neg logit contributions
oto-0.031
lic-0.029
round-0.028
ogo-0.028
ibi-0.027
Tokenkeeper
Token ID13884
Feature activation+6.10e+00

Pos logit contributions
vered+0.230
ement+0.200
 spe+0.199
 symp+0.189
 aston+0.188
Neg logit contributions
oto-0.277
lic-0.265
round-0.254
ogo-0.251
ibi-0.247
Token and
Token ID290
Feature activation-2.52e-01

Pos logit contributions
vered+0.786
 spe+0.754
ement+0.734
 aston+0.706
 distress+0.678
Neg logit contributions
oto-0.342
round-0.286
ogo-0.280
lic-0.278
oys-0.258
Token ran
Token ID4966
Feature activation-4.64e-01

Pos logit contributions
oto+0.019
lic+0.019
ogo+0.017
ibi+0.017
EEP+0.016
Neg logit contributions
vered-0.027
ement-0.025
 spe-0.025
 symp-0.024
 aston-0.024
Token out
Token ID503
Feature activation-2.16e+00

Pos logit contributions
oto+0.042
lic+0.041
ogo+0.040
EEP+0.037
abs+0.037
Neg logit contributions
vered-0.040
ement-0.037
 symp-0.036
 aston-0.035
ingly-0.035
Token the
Token ID262
Feature activation+2.92e-01

Pos logit contributions
lic+0.100
oto+0.087
ogo+0.079
round+0.074
oys+0.070
Neg logit contributions
vered-0.288
ement-0.275
 spe-0.264
 symp-0.259
 afar-0.256
Token door
Token ID3420
Feature activation-1.93e+00

Pos logit contributions
vered+0.030
ement+0.028
 spe+0.027
 symp+0.027
 aston+0.026
Neg logit contributions
oto-0.023
lic-0.023
ogo-0.022
ibi-0.021
EEP-0.020
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
vered+0.032
 symp+0.030
ement+0.029
 spe+0.029
 aston+0.029
Neg logit contributions
oto-0.016
lic-0.014
oys-0.014
EEP-0.014
round-0.013
Token<|endoftext|>
Token ID50256
Feature activation+9.70e-01

Pos logit contributions
vered+0.013
ement+0.012
 spe+0.012
 symp+0.011
 aston+0.011
Neg logit contributions
oto-0.008
lic-0.008
ogo-0.008
ibi-0.008
EEP-0.007
TokenOnce
Token ID7454
Feature activation-3.56e-02

Pos logit contributions
vered+0.105
ement+0.099
 spe+0.097
 symp+0.096
 aston+0.092
Neg logit contributions
oto-0.070
lic-0.070
ogo-0.065
ibi-0.061
EEP-0.061
Token upon
Token ID2402
Feature activation+2.35e-01

Pos logit contributions
oto+0.002
lic+0.002
ogo+0.002
ibi+0.002
EEP+0.002
Neg logit contributions
vered-0.004
 spe-0.004
ement-0.004
 symp-0.004
 aston-0.004
Token a
Token ID257
Feature activation-2.00e-01

Pos logit contributions
vered+0.024
ement+0.022
 spe+0.022
 symp+0.022
 aston+0.021
Neg logit contributions
oto-0.019
lic-0.019
ogo-0.017
ibi-0.017
EEP-0.016
Token time
Token ID640
Feature activation+2.06e+00

Pos logit contributions
oto+0.016
lic+0.015
ogo+0.014
EEP+0.014
ibi+0.014
Neg logit contributions
vered-0.021
ement-0.020
 spe-0.019
 symp-0.018
 aston-0.018
Token,
Token ID11
Feature activation-2.83e-01

Pos logit contributions
vered+0.178
ement+0.167
 spe+0.160
 symp+0.160
 aston+0.153
Neg logit contributions
oto-0.194
lic-0.192
ogo-0.181
ibi-0.176
round-0.174
Token there
Token ID612
Feature activation-4.70e-02

Pos logit contributions
lic+0.022
oto+0.022
ogo+0.020
EEP+0.019
ibi+0.019
Neg logit contributions
vered-0.029
ement-0.028
 spe-0.026
 symp-0.026
 aston-0.025
Token was
Token ID373
Feature activation+7.40e-01

Pos logit contributions
oto+0.003
lic+0.003
ogo+0.003
ibi+0.003
EEP+0.003
Neg logit contributions
vered-0.005
ement-0.005
 spe-0.005
 symp-0.005
 aston-0.005
Token a
Token ID257
Feature activation-1.04e-01

Pos logit contributions
vered+0.086
ement+0.082
 symp+0.078
 spe+0.077
 aston+0.074
Neg logit contributions
oto-0.051
lic-0.049
ogo-0.043
ibi-0.043
EEP-0.041
Token fat
Token ID3735
Feature activation+8.29e-01

Pos logit contributions
oto+0.007
lic+0.007
ogo+0.007
ibi+0.006
EEP+0.006
Neg logit contributions
vered-0.011
ement-0.011
 spe-0.011
 symp-0.011
 aston-0.010
Token
Token ID198
Feature activation-3.61e-01

Pos logit contributions
oto+0.029
lic+0.028
ogo+0.026
ibi+0.025
EEP+0.025
Neg logit contributions
vered-0.037
ement-0.035
 spe-0.034
 symp-0.034
 aston-0.033
TokenThe
Token ID464
Feature activation+8.11e-02

Pos logit contributions
oto+0.039
lic+0.039
ogo+0.037
ibi+0.036
EEP+0.036
Neg logit contributions
vered-0.026
ement-0.024
 spe-0.023
 symp-0.023
 aston-0.022
Token friends
Token ID2460
Feature activation-2.39e-01

Pos logit contributions
vered+0.009
ement+0.008
 spe+0.008
 symp+0.008
 aston+0.008
Neg logit contributions
oto-0.006
lic-0.006
ogo-0.006
ibi-0.005
EEP-0.005
Token happily
Token ID18177
Feature activation+4.26e+00

Pos logit contributions
oto+0.013
lic+0.012
round+0.011
EEP+0.010
ibi+0.010
Neg logit contributions
ement-0.029
vered-0.029
 spe-0.028
 symp-0.027
 aston-0.027
Token said
Token ID531
Feature activation-7.09e-01

Pos logit contributions
vered+0.432
 spe+0.395
 symp+0.389
ement+0.385
 aston+0.373
Neg logit contributions
oto-0.329
lic-0.320
ogo-0.318
EEP-0.298
ibi-0.295
Token yes
Token ID3763
Feature activation+3.34e+00

Pos logit contributions
oto+0.050
lic+0.049
EEP+0.043
oys+0.042
round+0.040
Neg logit contributions
vered-0.077
ement-0.073
 spe-0.071
 symp-0.071
 aston-0.069
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
vered+0.268
ement+0.263
 symp+0.251
 spe+0.235
ingly+0.230
Neg logit contributions
lic-0.329
oto-0.321
ogo-0.310
ibi-0.299
round-0.290
Token and
Token ID290
Feature activation-2.73e-01

Pos logit contributions
oto+0.017
lic+0.017
ogo+0.016
ibi+0.015
EEP+0.015
Neg logit contributions
vered-0.018
ement-0.018
 spe-0.017
 symp-0.017
 aston-0.016
Token they
Token ID484
Feature activation-4.86e-01

Pos logit contributions
oto+0.022
lic+0.022
ogo+0.021
ibi+0.020
EEP+0.020
Neg logit contributions
vered-0.027
 spe-0.025
ement-0.025
 symp-0.024
 aston-0.024
Token found
Token ID1043
Feature activation+3.35e-01

Pos logit contributions
oto+0.038
lic+0.038
ogo+0.036
EEP+0.035
ibi+0.034
Neg logit contributions
vered-0.049
 spe-0.046
ement-0.045
 symp-0.045
 aston-0.043
Token a
Token ID257
Feature activation+1.52e-02

Pos logit contributions
vered+0.039
ement+0.039
 symp+0.039
 spe+0.037
ingly+0.036
Neg logit contributions
oto-0.019
lic-0.019
ogo-0.016
oys-0.015
ibi-0.015
Token have
Token ID423
Feature activation+2.93e-01

Pos logit contributions
oto+0.027
lic+0.026
ogo+0.024
ibi+0.023
round+0.023
Neg logit contributions
vered-0.052
ement-0.052
 symp-0.048
 spe-0.046
 aston-0.046
Token a
Token ID257
Feature activation+1.65e-01

Pos logit contributions
vered+0.033
ement+0.032
 symp+0.030
 spe+0.029
 aston+0.029
Neg logit contributions
lic-0.021
oto-0.021
ogo-0.019
ibi-0.018
EEP-0.018
Token big
Token ID1263
Feature activation-5.95e-02

Pos logit contributions
vered+0.021
ement+0.020
 symp+0.019
ingly+0.019
 spe+0.018
Neg logit contributions
lic-0.009
oto-0.009
ogo-0.008
ibi-0.008
EEP-0.008
Token box
Token ID3091
Feature activation-3.97e-01

Pos logit contributions
oto+0.004
lic+0.003
ibi+0.003
round+0.003
ogo+0.003
Neg logit contributions
vered-0.007
ement-0.007
ingly-0.006
 symp-0.006
 spe-0.006
Token of
Token ID286
Feature activation+5.25e-01

Pos logit contributions
oto+0.029
lic+0.029
ogo+0.027
ibi+0.026
EEP+0.025
Neg logit contributions
vered-0.043
ement-0.040
 spe-0.039
 symp-0.039
 aston-0.037
Token cars
Token ID5006
Feature activation+1.39e+00

Pos logit contributions
vered+0.069
ement+0.065
 symp+0.064
 spe+0.061
ingly+0.060
Neg logit contributions
oto-0.027
lic-0.027
ibi-0.022
ogo-0.021
oys-0.021
Token,
Token ID11
Feature activation-5.08e-02

Pos logit contributions
vered+0.130
ement+0.122
 spe+0.118
 symp+0.116
 aston+0.111
Neg logit contributions
oto-0.122
lic-0.119
ogo-0.113
ibi-0.108
EEP-0.107
Token dolls
Token ID36062
Feature activation+5.80e-01

Pos logit contributions
oto+0.004
lic+0.004
ogo+0.004
ibi+0.003
round+0.003
Neg logit contributions
vered-0.005
ement-0.005
 symp-0.005
 spe-0.005
 aston-0.005
Token,
Token ID11
Feature activation-6.32e-02

Pos logit contributions
vered+0.052
ement+0.049
 spe+0.047
 symp+0.046
 aston+0.044
Neg logit contributions
oto-0.053
lic-0.052
ogo-0.049
ibi-0.048
EEP-0.047
Token balls
Token ID11333
Feature activation+7.02e-01

Pos logit contributions
oto+0.005
lic+0.005
ogo+0.004
ibi+0.004
round+0.004
Neg logit contributions
vered-0.007
ement-0.006
 symp-0.006
 spe-0.006
 aston-0.006
Token and
Token ID290
Feature activation-2.31e-01

Pos logit contributions
vered+0.065
ement+0.061
 spe+0.059
 symp+0.058
 aston+0.056
Neg logit contributions
oto-0.062
lic-0.062
ogo-0.058
ibi-0.056
EEP-0.055
Token.
Token ID13
Feature activation+3.37e-01

Pos logit contributions
vered+0.090
 spe+0.089
ement+0.088
 symp+0.084
joy+0.080
Neg logit contributions
oto-0.048
lic-0.043
ogo-0.042
ibi-0.042
oys-0.040
Token<|endoftext|>
Token ID50256
Feature activation+1.10e+00

Pos logit contributions
vered+0.033
ement+0.031
 symp+0.030
 spe+0.030
 aston+0.028
Neg logit contributions
oto-0.028
lic-0.028
ogo-0.026
ibi-0.025
round-0.025
TokenOnce
Token ID7454
Feature activation+5.07e-02

Pos logit contributions
vered+0.126
ement+0.117
 spe+0.117
 symp+0.115
 aston+0.110
Neg logit contributions
lic-0.075
oto-0.074
ogo-0.067
ibi-0.064
EEP-0.063
Token upon
Token ID2402
Feature activation+2.11e-01

Pos logit contributions
vered+0.006
ement+0.005
 spe+0.005
 symp+0.005
 aston+0.005
Neg logit contributions
oto-0.004
lic-0.004
ogo-0.003
ibi-0.003
EEP-0.003
Token a
Token ID257
Feature activation+4.78e-02

Pos logit contributions
vered+0.021
ement+0.019
 symp+0.019
 spe+0.019
 aston+0.018
Neg logit contributions
oto-0.017
lic-0.017
ogo-0.016
ibi-0.016
EEP-0.015
Token time
Token ID640
Feature activation+2.31e+00

Pos logit contributions
vered+0.005
ement+0.004
 spe+0.004
 symp+0.004
 aston+0.004
Neg logit contributions
oto-0.004
lic-0.004
ogo-0.004
ibi-0.003
EEP-0.003
Token,
Token ID11
Feature activation-1.58e-01

Pos logit contributions
vered+0.232
ement+0.216
 spe+0.211
 symp+0.208
 aston+0.198
Neg logit contributions
oto-0.185
lic-0.182
ogo-0.172
EEP-0.164
ibi-0.162
Token there
Token ID612
Feature activation+8.22e-03

Pos logit contributions
lic+0.009
oto+0.008
oys+0.007
ogo+0.007
EEP+0.007
Neg logit contributions
vered-0.019
ement-0.019
 spe-0.018
 symp-0.018
 aston-0.017
Token was
Token ID373
Feature activation+7.95e-01

Pos logit contributions
vered+0.001
ement+0.001
 spe+0.001
 symp+0.001
 aston+0.001
Neg logit contributions
oto-0.001
lic-0.001
ogo-0.000
ibi-0.000
EEP-0.000
Token a
Token ID257
Feature activation+7.55e-02

Pos logit contributions
vered+0.101
ement+0.096
 symp+0.091
 spe+0.090
ingly+0.087
Neg logit contributions
oto-0.048
lic-0.043
ibi-0.039
una-0.038
ogo-0.037
Token little
Token ID1310
Feature activation-3.03e-01

Pos logit contributions
vered+0.008
 spe+0.008
ement+0.007
 symp+0.007
 aston+0.007
Neg logit contributions
oto-0.006
lic-0.006
ogo-0.005
ibi-0.005
EEP-0.005
Token.
Token ID13
Feature activation+8.05e-01

Pos logit contributions
vered+0.265
ement+0.254
 spe+0.249
 aston+0.238
 symp+0.231
Neg logit contributions
oto-0.167
ogo-0.145
lic-0.145
ibi-0.137
EEP-0.136
Token They
Token ID1119
Feature activation-6.48e-02

Pos logit contributions
vered+0.097
ement+0.093
 symp+0.087
 spe+0.086
 aston+0.084
Neg logit contributions
lic-0.051
oto-0.050
EEP-0.045
ibi-0.043
ogo-0.043
Token say
Token ID910
Feature activation-1.35e-01

Pos logit contributions
oto+0.004
lic+0.004
ogo+0.004
ibi+0.003
EEP+0.003
Neg logit contributions
vered-0.008
ement-0.007
 spe-0.007
 symp-0.007
 aston-0.007
Token,
Token ID11
Feature activation+4.45e-01

Pos logit contributions
oto+0.011
lic+0.011
ogo+0.010
EEP+0.010
ibi+0.009
Neg logit contributions
vered-0.014
ement-0.013
 symp-0.012
 spe-0.012
 aston-0.012
Token "
Token ID366
Feature activation-9.33e-01

Pos logit contributions
vered+0.055
ement+0.052
 spe+0.051
 symp+0.051
 aston+0.049
Neg logit contributions
oto-0.026
lic-0.025
ogo-0.023
ibi-0.022
EEP-0.021
TokenDo
Token ID5211
Feature activation+3.07e+00

Pos logit contributions
EEP+0.059
oto+0.059
lic+0.053
round+0.052
ibi+0.052
Neg logit contributions
ement-0.106
vered-0.102
 aston-0.095
 symp-0.094
ocol-0.092
Token you
Token ID345
Feature activation+1.12e+00

Pos logit contributions
vered+0.320
ement+0.305
 symp+0.301
 spe+0.294
 aston+0.288
Neg logit contributions
lic-0.231
oto-0.224
ogo-0.210
round-0.210
ibi-0.204
Token want
Token ID765
Feature activation+7.70e-01

Pos logit contributions
vered+0.134
ement+0.129
 spe+0.125
 symp+0.122
 aston+0.118
Neg logit contributions
oto-0.070
lic-0.068
ogo-0.061
ibi-0.060
EEP-0.060
Token to
Token ID284
Feature activation+3.04e-01

Pos logit contributions
vered+0.089
ement+0.084
 symp+0.082
 spe+0.081
 aston+0.078
Neg logit contributions
oto-0.052
lic-0.051
ogo-0.047
ibi-0.044
EEP-0.044
Token play
Token ID711
Feature activation-5.10e-02

Pos logit contributions
vered+0.034
ement+0.032
 spe+0.031
 symp+0.031
 aston+0.030
Neg logit contributions
lic-0.021
oto-0.021
ogo-0.019
ibi-0.019
EEP-0.018
Token with
Token ID351
Feature activation+1.99e+00

Pos logit contributions
oto+0.004
lic+0.004
ogo+0.004
round+0.004
ibi+0.004
Neg logit contributions
vered-0.005
ement-0.005
 spe-0.005
 symp-0.005
 aston-0.004
Token said
Token ID531
Feature activation-4.92e-01

Pos logit contributions
oto+0.003
lic+0.003
ogo+0.003
ibi+0.003
EEP+0.003
Neg logit contributions
vered-0.005
ement-0.004
 symp-0.004
 spe-0.004
 aston-0.004
Token,
Token ID11
Feature activation+1.42e-01

Pos logit contributions
lic+0.040
oto+0.039
ogo+0.036
EEP+0.035
round+0.034
Neg logit contributions
vered-0.049
ement-0.046
 symp-0.045
 spe-0.045
 aston-0.043
Token "
Token ID366
Feature activation-1.02e+00

Pos logit contributions
vered+0.018
ement+0.017
 spe+0.016
 symp+0.016
 aston+0.016
Neg logit contributions
oto-0.008
lic-0.008
ogo-0.007
ibi-0.007
EEP-0.007
TokenNo
Token ID2949
Feature activation+1.96e+00

Pos logit contributions
oto+0.087
lic+0.085
ibi+0.077
ogo+0.077
round+0.077
Neg logit contributions
vered-0.100
ement-0.093
 spe-0.091
 symp-0.089
 aston-0.085
Token,
Token ID11
Feature activation+1.47e-01

Pos logit contributions
vered+0.180
ement+0.170
 spe+0.163
 symp+0.161
 aston+0.154
Neg logit contributions
oto-0.177
lic-0.176
ogo-0.165
ibi-0.159
round-0.157
Token I
Token ID314
Feature activation-5.78e-01

Pos logit contributions
vered+0.017
ement+0.016
 spe+0.016
 symp+0.016
 aston+0.015
Neg logit contributions
oto-0.010
lic-0.009
ogo-0.009
ibi-0.008
EEP-0.008
Token still
Token ID991
Feature activation+2.36e+00

Pos logit contributions
oto+0.044
lic+0.043
ogo+0.040
ibi+0.038
EEP+0.038
Neg logit contributions
vered-0.061
ement-0.058
 symp-0.056
 spe-0.056
 aston-0.054
Token don
Token ID836
Feature activation+2.42e+00

Pos logit contributions
vered+0.260
ement+0.250
 symp+0.240
 spe+0.237
 aston+0.228
Neg logit contributions
oto-0.166
lic-0.164
ogo-0.148
ibi-0.145
EEP-0.145
Token't
Token ID470
Feature activation-1.60e+00

Pos logit contributions
ement+0.190
vered+0.185
 symp+0.174
el+0.170
+0.168
Neg logit contributions
oto-0.238
lic-0.228
ogo-0.222
ibi-0.219
haw-0.207
Token want
Token ID765
Feature activation+7.29e-01

Pos logit contributions
lic+0.122
oto+0.121
ogo+0.115
ibi+0.106
EEP+0.105
Neg logit contributions
vered-0.167
ement-0.154
 spe-0.154
 symp-0.150
 aston-0.145
Token to
Token ID284
Feature activation+2.09e-01

Pos logit contributions
vered+0.088
 symp+0.084
ement+0.083
 spe+0.081
 aston+0.079
Neg logit contributions
oto-0.045
lic-0.042
ogo-0.039
round-0.037
EEP-0.037
Token to
Token ID284
Feature activation+9.03e-02

Pos logit contributions
vered+0.030
ement+0.028
 spe+0.028
 symp+0.027
 aston+0.027
Neg logit contributions
oto-0.017
lic-0.017
ogo-0.015
ibi-0.015
EEP-0.015
Token put
Token ID1234
Feature activation+2.33e-01

Pos logit contributions
vered+0.010
ement+0.010
 symp+0.009
 spe+0.009
 aston+0.009
Neg logit contributions
oto-0.006
lic-0.006
ogo-0.006
ibi-0.005
EEP-0.005
Token on
Token ID319
Feature activation-2.46e-01

Pos logit contributions
vered+0.025
ement+0.024
 symp+0.024
 spe+0.024
 aston+0.023
Neg logit contributions
oto-0.016
lic-0.015
ogo-0.015
ibi-0.014
round-0.014
Token her
Token ID607
Feature activation+6.48e-01

Pos logit contributions
oto+0.014
ibi+0.012
ogo+0.011
 cool+0.011
lic+0.011
Neg logit contributions
ement-0.029
vered-0.028
 spe-0.027
 symp-0.027
ingly-0.026
Token gear
Token ID7733
Feature activation-3.05e+00

Pos logit contributions
vered+0.071
ement+0.070
 symp+0.067
 spe+0.065
 aston+0.064
Neg logit contributions
lic-0.046
oto-0.044
round-0.039
EEP-0.038
ogo-0.038
Token,
Token ID11
Feature activation-1.51e-01

Pos logit contributions
oto+0.203
lic+0.192
ogo+0.187
round+0.185
ibi+0.178
Neg logit contributions
vered-0.340
 spe-0.328
ement-0.323
 aston-0.314
 symp-0.313
Token climb
Token ID12080
Feature activation+2.49e-01

Pos logit contributions
oto+0.011
lic+0.011
ogo+0.010
ibi+0.009
round+0.009
Neg logit contributions
vered-0.016
ement-0.016
 symp-0.015
 spe-0.015
 aston-0.014
Token to
Token ID284
Feature activation+8.14e-02

Pos logit contributions
vered+0.028
ement+0.027
 symp+0.026
 aston+0.026
 spe+0.026
Neg logit contributions
oto-0.016
lic-0.015
ogo-0.015
round-0.014
ibi-0.014
Token the
Token ID262
Feature activation+2.37e-01

Pos logit contributions
vered+0.009
ement+0.009
 symp+0.008
 spe+0.008
ingly+0.008
Neg logit contributions
oto-0.005
lic-0.005
ogo-0.005
ibi-0.005
round-0.005
Token top
Token ID1353
Feature activation+1.60e+00

Pos logit contributions
vered+0.025
ement+0.023
 spe+0.023
 symp+0.022
 aston+0.022
Neg logit contributions
oto-0.018
lic-0.018
ogo-0.017
ibi-0.016
EEP-0.016
Token of
Token ID286
Feature activation+3.06e-01

Pos logit contributions
vered+0.193
 symp+0.178
ement+0.177
 spe+0.173
 aston+0.173
Neg logit contributions
lic-0.099
oto-0.091
round-0.088
ogo-0.087
EEP-0.083
Token.
Token ID13
Feature activation-7.44e-02

Pos logit contributions
oto+0.068
lic+0.067
ogo+0.063
ibi+0.061
EEP+0.060
Neg logit contributions
vered-0.063
ement-0.059
 spe-0.057
 symp-0.057
 aston-0.054
Token She
Token ID1375
Feature activation+4.14e-01

Pos logit contributions
lic+0.006
oto+0.006
ibi+0.006
ogo+0.006
EEP+0.006
Neg logit contributions
vered-0.007
ement-0.007
 spe-0.006
 symp-0.006
 aston-0.006
Token loved
Token ID6151
Feature activation-1.55e+00

Pos logit contributions
 spe+0.030
vered+0.030
 symp+0.027
 aston+0.026
el+0.024
Neg logit contributions
oto-0.043
ogo-0.043
lic-0.041
round-0.040
oys-0.039
Token to
Token ID284
Feature activation+4.67e-02

Pos logit contributions
lic+0.130
oto+0.125
ogo+0.123
ibi+0.115
EEP+0.115
Neg logit contributions
 spe-0.144
vered-0.143
 symp-0.132
ement-0.132
 aston-0.127
Token play
Token ID711
Feature activation-2.95e-01

Pos logit contributions
 spe+0.004
vered+0.004
 symp+0.004
 aston+0.004
ement+0.004
Neg logit contributions
lic-0.004
ogo-0.004
oto-0.004
EEP-0.004
oys-0.004
Token outside
Token ID2354
Feature activation-2.38e+00

Pos logit contributions
oto+0.029
lic+0.028
ogo+0.027
ibi+0.027
EEP+0.027
Neg logit contributions
vered-0.024
ement-0.023
 symp-0.022
 spe-0.022
 aston-0.020
Token with
Token ID351
Feature activation+1.56e+00

Pos logit contributions
round+0.083
oto+0.082
lic+0.072
ock+0.070
!"+0.070
Neg logit contributions
vered-0.329
ement-0.311
 aston-0.292
 distress-0.284
 symp-0.283
Token her
Token ID607
Feature activation+6.06e-01

Pos logit contributions
vered+0.100
 spe+0.091
ement+0.091
 symp+0.087
 aston+0.084
Neg logit contributions
lic-0.182
oto-0.177
ogo-0.171
EEP-0.168
round-0.166
Token friends
Token ID2460
Feature activation-2.70e-01

Pos logit contributions
vered+0.061
ement+0.058
 spe+0.057
 symp+0.056
 aston+0.053
Neg logit contributions
lic-0.049
oto-0.048
ogo-0.045
round-0.043
EEP-0.042
Token and
Token ID290
Feature activation-3.33e-01

Pos logit contributions
oto+0.018
lic+0.016
round+0.015
ogo+0.015
ibi+0.015
Neg logit contributions
vered-0.031
ement-0.029
 spe-0.029
 aston-0.028
 symp-0.028
Token neighbors
Token ID12020
Feature activation+1.80e-01

Pos logit contributions
lic+0.028
oto+0.026
ogo+0.026
round+0.024
EEP+0.024
Neg logit contributions
vered-0.032
 spe-0.031
 symp-0.029
 aston-0.029
ement-0.028
TokenRos
Token ID35740
Feature activation+1.40e-01

Pos logit contributions
oto+0.022
ogo+0.020
lic+0.020
ibi+0.019
round+0.019
Neg logit contributions
vered-0.026
 symp-0.024
ement-0.023
 spe-0.023
 aston-0.022
Tokenie
Token ID494
Feature activation-4.87e-01

Pos logit contributions
vered+0.011
ement+0.010
 symp+0.010
 spe+0.009
el+0.009
Neg logit contributions
oto-0.014
lic-0.014
round-0.013
ogo-0.013
EEP-0.013
Token replied
Token ID8712
Feature activation+1.12e+00

Pos logit contributions
oto+0.039
lic+0.038
ogo+0.035
ibi+0.034
round+0.034
Neg logit contributions
vered-0.049
ement-0.047
 spe-0.045
 symp-0.045
 aston-0.043
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
vered+0.119
 spe+0.111
ement+0.111
 aston+0.105
 symp+0.105
Neg logit contributions
lic-0.084
oto-0.083
ogo-0.072
EEP-0.071
round-0.069
Token "
Token ID366
Feature activation-1.15e+00

Pos logit contributions
oto+0.007
lic+0.007
ogo+0.006
ibi+0.006
EEP+0.006
Neg logit contributions
vered-0.017
ement-0.016
 spe-0.015
 symp-0.015
 aston-0.015
TokenI
Token ID40
Feature activation+8.55e-01

Pos logit contributions
oto+0.101
lic+0.100
ogo+0.092
ibi+0.090
round+0.089
Neg logit contributions
vered-0.109
ement-0.103
 spe-0.098
 symp-0.098
 aston-0.094
Token like
Token ID588
Feature activation+7.35e-01

Pos logit contributions
vered+0.084
 spe+0.078
ement+0.077
 symp+0.076
 aston+0.074
Neg logit contributions
oto-0.071
lic-0.070
ogo-0.067
round-0.063
EEP-0.062
Token this
Token ID428
Feature activation+1.97e-01

Pos logit contributions
vered+0.069
 spe+0.067
ement+0.063
 symp+0.062
 aston+0.060
Neg logit contributions
lic-0.064
oto-0.062
ogo-0.059
ibi-0.057
EEP-0.056
Token road
Token ID2975
Feature activation-9.23e-01

Pos logit contributions
vered+0.017
 spe+0.016
ement+0.016
 symp+0.015
 aston+0.015
Neg logit contributions
lic-0.017
oto-0.017
ogo-0.016
oys-0.015
EEP-0.015
Token too
Token ID1165
Feature activation-9.76e-01

Pos logit contributions
oto+0.072
lic+0.071
ogo+0.066
round+0.064
ibi+0.063
Neg logit contributions
vered-0.096
ement-0.089
 symp-0.088
 spe-0.086
 aston-0.085
Token,
Token ID11
Feature activation-1.22e-01

Pos logit contributions
oto+0.091
lic+0.088
ogo+0.088
ibi+0.086
oys+0.079
Neg logit contributions
ement-0.080
 spe-0.079
vered-0.076
 aston-0.071
 symp-0.070
Token upset
Token ID9247
Feature activation+2.96e+00

Pos logit contributions
lic+0.220
oto+0.218
ibi+0.200
oys+0.186
 dads+0.179
Neg logit contributions
ement-0.294
vered-0.286
 spe-0.280
-0.251
-0.247
Token and
Token ID290
Feature activation-1.81e-01

Pos logit contributions
vered+0.309
ement+0.290
 spe+0.286
 symp+0.279
 aston+0.272
Neg logit contributions
oto-0.231
lic-0.227
ogo-0.211
ibi-0.203
EEP-0.201
Token said
Token ID531
Feature activation-7.15e-01

Pos logit contributions
oto+0.014
lic+0.014
ogo+0.013
ibi+0.012
round+0.012
Neg logit contributions
vered-0.019
ement-0.018
 symp-0.017
 spe-0.017
 aston-0.017
Token,
Token ID11
Feature activation-7.99e-02

Pos logit contributions
lic+0.055
oto+0.053
EEP+0.049
ogo+0.047
oys+0.046
Neg logit contributions
vered-0.074
ement-0.070
 spe-0.069
 symp-0.067
 aston-0.066
Token "
Token ID366
Feature activation-1.24e+00

Pos logit contributions
oto+0.004
lic+0.004
ogo+0.004
ibi+0.004
EEP+0.004
Neg logit contributions
vered-0.010
ement-0.010
 spe-0.009
 symp-0.009
 aston-0.009
TokenThat
Token ID2504
Feature activation-3.45e+00

Pos logit contributions
oto+0.098
lic+0.096
ibi+0.091
EEP+0.089
round+0.087
Neg logit contributions
vered-0.127
ement-0.122
 spe-0.119
 symp-0.114
 aston-0.113
Token's
Token ID338
Feature activation+1.17e-01

Pos logit contributions
lic+0.396
oto+0.386
ogo+0.378
round+0.353
ibi+0.347
Neg logit contributions
vered-0.229
ement-0.211
 spe-0.195
 symp-0.195
el-0.184
Token not
Token ID407
Feature activation-1.58e+00

Pos logit contributions
 symp+0.011
vered+0.011
ement+0.010
 spe+0.010
 aston+0.010
Neg logit contributions
lic-0.010
oto-0.009
round-0.009
ogo-0.009
ibi-0.009
Token fair
Token ID3148
Feature activation-2.74e+00

Pos logit contributions
lic+0.120
ogo+0.118
oto+0.115
ibi+0.106
round+0.104
Neg logit contributions
vered-0.164
 symp-0.152
 spe-0.148
ement-0.147
 aston-0.145
Token!
Token ID0
Feature activation-1.21e+00

Pos logit contributions
oto+0.260
lic+0.260
ogo+0.245
ibi+0.232
round+0.231
Neg logit contributions
vered-0.235
ement-0.219
 spe-0.204
 symp-0.203
ingly-0.191
Token I
Token ID314
Feature activation-6.96e-01

Pos logit contributions
oto+0.093
lic+0.091
ogo+0.085
ibi+0.081
EEP+0.080
Neg logit contributions
vered-0.127
ement-0.120
 spe-0.116
 symp-0.116
 aston-0.111
Token,
Token ID11
Feature activation-3.17e-01

Pos logit contributions
oto+0.039
lic+0.039
ogo+0.037
ibi+0.036
round+0.035
Neg logit contributions
vered-0.036
ement-0.033
 spe-0.032
 symp-0.032
 aston-0.031
Token it
Token ID340
Feature activation-3.89e-01

Pos logit contributions
oto+0.018
lic+0.018
EEP+0.016
round+0.015
ibi+0.014
Neg logit contributions
vered-0.038
ement-0.037
 symp-0.035
 aston-0.034
 spe-0.033
Token spun
Token ID26843
Feature activation-8.54e-01

Pos logit contributions
oto+0.035
lic+0.034
ogo+0.033
ibi+0.032
round+0.032
Neg logit contributions
vered-0.036
 spe-0.033
ement-0.033
 symp-0.033
 aston-0.031
Token a
Token ID257
Feature activation-5.77e-02

Pos logit contributions
oto+0.068
lic+0.068
ogo+0.063
ibi+0.060
round+0.060
Neg logit contributions
vered-0.086
ement-0.082
 spe-0.079
 symp-0.078
 aston-0.076
Token coc
Token ID8954
Feature activation-4.31e-01

Pos logit contributions
oto+0.005
lic+0.004
ogo+0.004
ibi+0.004
oys+0.004
Neg logit contributions
vered-0.006
 spe-0.006
ement-0.005
 symp-0.005
 aston-0.005
Tokenoon
Token ID2049
Feature activation-3.82e+00

Pos logit contributions
oto+0.030
lic+0.026
ogo+0.026
EEP+0.026
ibi+0.026
Neg logit contributions
vered-0.048
ement-0.047
 spe-0.044
 symp-0.044
ingly-0.042
Token and
Token ID290
Feature activation-4.28e-01

Pos logit contributions
lic+0.278
oto+0.266
round+0.240
ogo+0.239
EEP+0.234
Neg logit contributions
vered-0.406
ement-0.390
 spe-0.380
 symp-0.377
 aston-0.373
Token stayed
Token ID9658
Feature activation-1.06e+00

Pos logit contributions
oto+0.036
lic+0.035
ogo+0.033
ibi+0.032
EEP+0.031
Neg logit contributions
vered-0.042
ement-0.040
 spe-0.039
 symp-0.038
 aston-0.037
Token inside
Token ID2641
Feature activation-2.64e+00

Pos logit contributions
oto+0.075
lic+0.073
ogo+0.069
ibi+0.064
EEP+0.061
Neg logit contributions
vered-0.114
ement-0.109
 spe-0.102
 symp-0.100
 aston-0.098
Token for
Token ID329
Feature activation-7.00e-01

Pos logit contributions
oto+0.112
lic+0.099
una+0.094
ogo+0.091
EEP+0.090
Neg logit contributions
vered-0.335
ement-0.316
 spe-0.295
 symp-0.293
 aston-0.289
Token a
Token ID257
Feature activation-8.18e-02

Pos logit contributions
oto+0.051
lic+0.050
ogo+0.046
ibi+0.043
EEP+0.043
Neg logit contributions
vered-0.076
ement-0.073
 symp-0.070
 spe-0.068
 aston-0.067
Token reach
Token ID3151
Feature activation+1.51e+00

Pos logit contributions
vered+0.029
ement+0.028
 spe+0.027
 symp+0.027
 aston+0.026
Neg logit contributions
oto-0.020
lic-0.019
ogo-0.018
ibi-0.018
round-0.018
Token.
Token ID13
Feature activation+4.74e-01

Pos logit contributions
vered+0.184
ement+0.170
 aston+0.168
 spe+0.167
 symp+0.161
Neg logit contributions
oto-0.092
ogo-0.082
ibi-0.076
EEP-0.073
lic-0.072
Token She
Token ID1375
Feature activation+7.98e-01

Pos logit contributions
 spe+0.055
vered+0.054
ement+0.049
ocol+0.047
 symp+0.046
Neg logit contributions
EEP-0.032
lic-0.031
ibi-0.029
oto-0.028
oys-0.026
Token was
Token ID373
Feature activation+1.07e+00

Pos logit contributions
ement+0.095
vered+0.092
 spe+0.089
 aston+0.089
joy+0.087
Neg logit contributions
lic-0.045
oto-0.042
ogo-0.039
ibi-0.037
EEP-0.035
Token unhappy
Token ID19283
Feature activation+4.61e+00

Pos logit contributions
vered+0.113
ement+0.108
 spe+0.099
 symp+0.098
 aston+0.096
Neg logit contributions
lic-0.080
oto-0.077
ogo-0.070
ibi-0.067
una-0.065
Token because
Token ID780
Feature activation-4.14e+00

Pos logit contributions
vered+0.492
ement+0.458
 spe+0.457
 symp+0.446
 aston+0.437
Neg logit contributions
oto-0.350
lic-0.341
ogo-0.318
ibi-0.305
EEP-0.302
Token she
Token ID673
Feature activation+1.19e+00

Pos logit contributions
oto+0.221
ibi+0.184
oys+0.182
 not+0.173
lic+0.172
Neg logit contributions
vered-0.522
 spe-0.473
 symp-0.470
ement-0.461
 aston-0.437
Token couldn
Token ID3521
Feature activation+5.54e-01

Pos logit contributions
vered+0.124
ement+0.115
 spe+0.113
 symp+0.112
 aston+0.108
Neg logit contributions
oto-0.093
lic-0.089
ogo-0.085
ibi-0.081
round-0.080
Token't
Token ID470
Feature activation-1.65e+00

Pos logit contributions
vered+0.069
ement+0.066
 spe+0.065
 symp+0.064
 aston+0.063
Neg logit contributions
oto-0.030
lic-0.029
ogo-0.028
EEP-0.026
ibi-0.026
Token get
Token ID651
Feature activation-9.83e-01

Pos logit contributions
oto+0.116
lic+0.112
ogo+0.105
round+0.101
ibi+0.101
Neg logit contributions
vered-0.183
ement-0.176
 spe-0.168
 symp-0.167
 aston-0.162
Token it
Token ID340
Feature activation-7.99e-02

Pos logit contributions
oto+0.061
ogo+0.053
lic+0.053
ibi+0.050
round+0.049
Neg logit contributions
ement-0.118
vered-0.118
 symp-0.110
 aston-0.107
 spe-0.103
Token to
Token ID284
Feature activation-1.28e-01

Pos logit contributions
oto+0.099
lic+0.093
ibi+0.087
round+0.080
ogo+0.073
Neg logit contributions
vered-0.289
ement-0.288
 spe-0.263
 symp-0.262
 aston-0.253
Token explore
Token ID7301
Feature activation-1.01e+00

Pos logit contributions
oto+0.007
ibi+0.006
lic+0.006
round+0.006
ogo+0.006
Neg logit contributions
ement-0.016
vered-0.016
 spe-0.014
 symp-0.014
ingly-0.014
Token and
Token ID290
Feature activation-5.70e-01

Pos logit contributions
oto+0.048
ock+0.041
 cool+0.034
lic+0.034
 about+0.034
Neg logit contributions
vered-0.125
 spe-0.119
-0.115
 symp-0.115
 aston-0.114
Token her
Token ID607
Feature activation+4.75e-01

Pos logit contributions
oto+0.035
lic+0.034
ibi+0.032
round+0.030
ogo+0.030
Neg logit contributions
ement-0.067
vered-0.066
 symp-0.062
 spe-0.062
ingly-0.058
Token fear
Token ID3252
Feature activation+7.25e-01

Pos logit contributions
vered+0.057
 spe+0.055
 symp+0.054
ement+0.053
ocol+0.052
Neg logit contributions
oto-0.025
ibi-0.023
ogo-0.021
 bus-0.020
lic-0.019
Token turned
Token ID2900
Feature activation-4.19e+00

Pos logit contributions
vered+0.074
ement+0.072
 symp+0.069
 spe+0.068
 aston+0.067
Neg logit contributions
lic-0.056
oto-0.053
ogo-0.048
EEP-0.048
ibi-0.047
Token to
Token ID284
Feature activation-7.91e-02

Pos logit contributions
lic+0.283
oto+0.237
ock+0.212
round+0.201
ogo+0.201
Neg logit contributions
vered-0.492
ement-0.452
-0.442
 symp-0.440
 aston-0.439
Token joy
Token ID8716
Feature activation-2.69e-01

Pos logit contributions
oto+0.005
lic+0.005
ogo+0.005
ibi+0.004
EEP+0.004
Neg logit contributions
vered-0.009
ement-0.009
 spe-0.008
 symp-0.008
 aston-0.008
Token.
Token ID13
Feature activation+6.17e-02

Pos logit contributions
oto+0.024
lic+0.023
ogo+0.021
ibi+0.021
round+0.020
Neg logit contributions
vered-0.026
ement-0.024
 spe-0.023
 symp-0.023
 aston-0.022
Token She
Token ID1375
Feature activation+4.36e-01

Pos logit contributions
vered+0.006
ement+0.006
 spe+0.006
 symp+0.006
 aston+0.005
Neg logit contributions
lic-0.005
oto-0.005
ibi-0.004
EEP-0.004
round-0.004
Token ran
Token ID4966
Feature activation-6.42e-01

Pos logit contributions
ement+0.049
vered+0.048
 symp+0.045
 spe+0.045
 aston+0.043
Neg logit contributions
lic-0.028
oto-0.027
ogo-0.025
ibi-0.024
round-0.023
Token said
Token ID531
Feature activation-8.21e-01

Pos logit contributions
oto+0.044
lic+0.043
ogo+0.042
ibi+0.040
EEP+0.040
Neg logit contributions
vered-0.042
 symp-0.039
 spe-0.039
ement-0.038
 aston-0.036
Token okay
Token ID8788
Feature activation+1.11e+00

Pos logit contributions
lic+0.053
oto+0.053
EEP+0.046
ogo+0.043
ibi+0.043
Neg logit contributions
vered-0.096
ement-0.091
 spe-0.088
 symp-0.088
 aston-0.086
Token.
Token ID13
Feature activation+1.18e-01

Pos logit contributions
vered+0.095
ement+0.089
 symp+0.086
 spe+0.085
 aston+0.081
Neg logit contributions
oto-0.105
lic-0.105
ogo-0.099
ibi-0.096
EEP-0.095
Token
Token ID198
Feature activation-4.86e-01

Pos logit contributions
vered+0.010
ement+0.010
 symp+0.009
 spe+0.009
 aston+0.009
Neg logit contributions
oto-0.011
lic-0.011
ogo-0.010
ibi-0.010
round-0.009
Token
Token ID198
Feature activation-4.80e-01

Pos logit contributions
oto+0.046
lic+0.045
ogo+0.044
EEP+0.042
round+0.041
Neg logit contributions
vered-0.037
ement-0.036
 symp-0.035
 spe-0.033
 aston-0.032
TokenAs
Token ID1722
Feature activation-5.46e+00

Pos logit contributions
ogo+0.039
oto+0.039
lic+0.037
ibi+0.034
una+0.033
Neg logit contributions
vered-0.044
 symp-0.043
ement-0.040
ingly-0.039
 spe-0.039
Token they
Token ID484
Feature activation-6.26e-01

Pos logit contributions
oto+0.368
lic+0.354
ogo+0.327
ibi+0.317
round+0.311
Neg logit contributions
vered-0.639
ement-0.602
 symp-0.579
 spe-0.572
 aston-0.558
Token were
Token ID547
Feature activation+7.17e-01

Pos logit contributions
lic+0.047
oto+0.043
round+0.040
ibi+0.039
ogo+0.039
Neg logit contributions
vered-0.065
ement-0.063
 spe-0.061
 symp-0.060
 aston-0.058
Token reading
Token ID3555
Feature activation+5.45e-01

Pos logit contributions
vered+0.068
ement+0.063
 spe+0.061
 symp+0.060
 aston+0.058
Neg logit contributions
lic-0.063
oto-0.063
ogo-0.059
ibi-0.056
round-0.056
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
vered+0.065
 symp+0.061
ement+0.060
 spe+0.059
 aston+0.057
Neg logit contributions
oto-0.030
lic-0.030
 robber-0.027
round-0.027
oys-0.026
Token Tim
Token ID5045
Feature activation-2.77e-01

Pos logit contributions
lic+0.011
oto+0.011
round+0.009
ibi+0.009
EEP+0.008
Neg logit contributions
vered-0.036
ement-0.034
 symp-0.033
 aston-0.032
ingly-0.032
Token but
Token ID475
Feature activation-4.05e-01

Pos logit contributions
oto+0.018
lic+0.018
ogo+0.016
ibi+0.016
EEP+0.016
Neg logit contributions
vered-0.029
ement-0.027
 spe-0.026
 symp-0.026
 aston-0.025
Token he
Token ID339
Feature activation-4.06e-01

Pos logit contributions
oto+0.020
lic+0.018
ogo+0.016
ibi+0.015
round+0.015
Neg logit contributions
vered-0.054
ement-0.052
 symp-0.050
 spe-0.050
 aston-0.048
Token couldn
Token ID3521
Feature activation+4.35e-01

Pos logit contributions
oto+0.029
lic+0.028
ogo+0.026
ibi+0.025
EEP+0.025
Neg logit contributions
vered-0.045
ement-0.043
 spe-0.041
 symp-0.041
 aston-0.040
Token't
Token ID470
Feature activation-1.80e+00

Pos logit contributions
vered+0.051
ement+0.048
 spe+0.048
 symp+0.047
 aston+0.046
Neg logit contributions
oto-0.027
lic-0.027
ogo-0.025
EEP-0.024
ibi-0.024
Token move
Token ID1445
Feature activation-1.43e+00

Pos logit contributions
oto+0.135
lic+0.133
ogo+0.124
ibi+0.119
EEP+0.117
Neg logit contributions
vered-0.190
ement-0.180
 spe-0.175
 symp-0.174
 aston-0.167
Token because
Token ID780
Feature activation-4.37e+00

Pos logit contributions
oto+0.096
lic+0.089
ogo+0.087
ock+0.084
round+0.084
Neg logit contributions
vered-0.151
ement-0.149
 spe-0.147
 aston-0.145
 symp-0.143
Token he
Token ID339
Feature activation-4.21e-01

Pos logit contributions
oto+0.189
lic+0.174
ogo+0.155
round+0.152
ibi+0.141
Neg logit contributions
vered-0.603
 symp-0.579
ement-0.571
 spe-0.554
 aston-0.545
Token was
Token ID373
Feature activation+7.16e-01

Pos logit contributions
oto+0.032
lic+0.032
ogo+0.030
ibi+0.029
EEP+0.028
Neg logit contributions
vered-0.044
ement-0.041
 spe-0.040
 symp-0.040
 aston-0.038
Token too
Token ID1165
Feature activation-1.09e+00

Pos logit contributions
vered+0.072
ement+0.068
 symp+0.065
 spe+0.065
 aston+0.062
Neg logit contributions
oto-0.058
lic-0.057
ogo-0.054
ibi-0.051
round-0.050
Token scared
Token ID12008
Feature activation+2.27e-01

Pos logit contributions
oto+0.095
lic+0.094
ogo+0.089
ibi+0.085
EEP+0.084
Neg logit contributions
vered-0.102
ement-0.095
 spe-0.093
 symp-0.092
 aston-0.088
Token.
Token ID13
Feature activation+7.80e-03

Pos logit contributions
vered+0.028
ement+0.026
 spe+0.026
 symp+0.025
 aston+0.025
Neg logit contributions
oto-0.014
lic-0.013
ogo-0.012
ock-0.012
EEP-0.012