TOP ACTIVATIONS
MAX = 7.786

It was poison he ml ock . It was very dangerous
It was an incredible peac ock ! Lily was so happy
dream called a " night mare ". She woke up crying
idea of having a peac ock tail . She let her
to watch tick - t ock . One day , Lily
Mary stopped . " St rip . Hot !"
a cup and a sa ucer . " Be careful ,
The peac ock was now even more gorgeous
. " Kn ock knock ," Lily says .
look at in the padd ock . The farmer smiled and
treats !" St rip ran away with the stolen
stuck in a small padd ock , far away from the
. It is a peac ock . It is proud of
made a tick - t ock sound . He wore it
, I do !" S ock y says . " What
feathers . But the peac ock was very impressive ! He
Kn ock , knock , I 'm
your hair into a peac ock tail with clips and rib
Katie put on her sm ock and went outside . She
Mama said , " St rip , you naughty k itty

MIN ACTIVATIONS
MIN = -10.306

happy and the nation pro spe red . <|endoftext|> Once upon
had to help the town spe ople . He quickly formulated
. The town spe ople were sad because they
bell stopped . The town spe ople cheered and hugged each
. He asked the town spe ople what was wrong and
. The town spe ople decided to ask their
. The town spe ople soon discovered where the
So , the town spe ople looked for Old Tom
The town spe ople thought and thought until
rope bouncing . The town spe ople gathered around to watch
! The town spe ople were sad and scared
it sang , the town spe ople would act nice .
! The town spe ople thanked the hero for
. The town spe ople were so happy they
. The town spe ople worked together to help
. The town spe ople were relieved . They
shine through . The town spe ople were very happy and
. The town spe ople were amazed by this
â <|endoftext|> The town spe ople were walking quickly down
safe again . The town spe ople thanked the bear many

INTERVAL 3.263 - 7.786
CONTAINS 0.785%

Tim and Tom think about Mom 's idea . They
in the ground and the water coming out of it .
and filled her with stories about the wonder clay . After
the man said . " That handle belongs to my axe
. It sounds like " t weet , tweet , tweet

INTERVAL -1.260 - 3.263
CONTAINS 89.256%

waved goodbye to her friends and went home . She couldn
other around , laughed and built sand cast les . Soon
day , they find a big steel slide in the park
has a scratch on his hand , but he is fine
and threw it at the shark 's nose .

INTERVAL -5.783 - -1.260
CONTAINS 9.928%

and pizza . John always thanked her . At
as he dropped into the pond . And
to protect Kitty from the itch , so he decided to
on the slide first , then we can go on the
. Her mom smiles . She hugs Lily .

INTERVAL -10.306 - -5.783
CONTAINS 0.032%

shouldn 't have stolen the cherry . Next time , just
them put the cards in envelop es and write grandma 's
That 's good , my de ars . But you still
looked she saw people shouting angrily and having terrible fights .
an important lesson â pat ience pays off
Token It
Token ID632
Feature activation+1.29e+00

Pos logit contributions
lic+0.011
oto+0.011
ibi+0.010
round+0.010
EEP+0.010
Neg logit contributions
vered-0.020
ement-0.019
 spe-0.019
 symp-0.018
 aston-0.017
Token was
Token ID373
Feature activation-3.69e-01

Pos logit contributions
lic+0.073
oto+0.066
ogo+0.061
round+0.061
haw+0.061
Neg logit contributions
ement-0.151
vered-0.145
 symp-0.142
 aston-0.137
ingly-0.135
Token poison
Token ID8764
Feature activation-8.05e-01

Pos logit contributions
vered+0.043
ement+0.041
 symp+0.039
 spe+0.037
 aston+0.036
Neg logit contributions
oto-0.024
lic-0.024
ibi-0.021
ogo-0.021
oys-0.019
Token he
Token ID339
Feature activation+6.24e-01

Pos logit contributions
vered+0.080
ement+0.078
 symp+0.077
 aston+0.075
 spe+0.074
Neg logit contributions
lic-0.060
oto-0.059
ogo-0.055
ibi-0.052
round-0.051
Tokenml
Token ID4029
Feature activation+2.65e+00

Pos logit contributions
lic+0.041
oto+0.040
ogo+0.037
ibi+0.036
EEP+0.034
Neg logit contributions
ement-0.069
vered-0.068
 symp-0.066
 spe-0.064
 aston-0.063
Tokenock
Token ID735
Feature activation+7.57e+00

Pos logit contributions
oto+0.135
lic+0.134
ibi+0.120
ogo+0.120
round+0.112
Neg logit contributions
vered-0.326
ement-0.320
 symp-0.309
 spe-0.307
 aston-0.296
Token.
Token ID13
Feature activation-8.13e-02

Pos logit contributions
oto+0.613
lic+0.609
ogo+0.571
ibi+0.558
round+0.540
Neg logit contributions
vered-0.710
ement-0.696
 symp-0.672
 spe-0.668
 aston-0.653
Token It
Token ID632
Feature activation+1.08e+00

Pos logit contributions
vered+0.009
ement+0.009
 spe+0.009
 symp+0.009
 aston+0.008
Neg logit contributions
lic-0.005
oto-0.005
ibi-0.004
ogo-0.004
round-0.004
Token was
Token ID373
Feature activation-6.17e-01

Pos logit contributions
lic+0.067
oto+0.062
ogo+0.057
ibi+0.053
una+0.051
Neg logit contributions
ement-0.127
vered-0.123
 symp-0.118
 aston-0.114
joy-0.110
Token very
Token ID845
Feature activation+4.67e-01

Pos logit contributions
vered+0.070
ement+0.070
 symp+0.064
 aston+0.061
 spe+0.060
Neg logit contributions
lic-0.040
oto-0.040
ogo-0.036
una-0.032
ibi-0.032
Token dangerous
Token ID4923
Feature activation-4.36e-01

Pos logit contributions
lic+0.033
oto+0.032
ogo+0.028
ibi+0.027
EEP+0.026
Neg logit contributions
vered-0.051
ement-0.050
 symp-0.046
 aston-0.045
 spe-0.044
Token It
Token ID632
Feature activation+1.22e+00

Pos logit contributions
lic+0.002
oto+0.002
ibi+0.002
round+0.002
EEP+0.002
Neg logit contributions
vered-0.004
ement-0.004
 spe-0.003
 symp-0.003
 aston-0.003
Token was
Token ID373
Feature activation-4.90e-01

Pos logit contributions
lic+0.080
oto+0.073
ibi+0.065
ogo+0.064
round+0.063
Neg logit contributions
ement-0.137
vered-0.134
 symp-0.130
 aston-0.124
 spe-0.122
Token an
Token ID281
Feature activation-9.52e-01

Pos logit contributions
vered+0.057
ement+0.055
 symp+0.053
 aston+0.049
 spe+0.049
Neg logit contributions
oto-0.032
lic-0.031
ogo-0.028
ibi-0.027
ruck-0.024
Token incredible
Token ID8082
Feature activation+1.46e+00

Pos logit contributions
vered+0.104
ement+0.097
 symp+0.096
 spe+0.094
 aston+0.091
Neg logit contributions
oto-0.067
lic-0.058
ibi-0.057
ogo-0.057
round-0.057
Token peac
Token ID41961
Feature activation-7.61e-01

Pos logit contributions
oto+0.096
round+0.085
lic+0.083
ibi+0.081
ruck+0.081
Neg logit contributions
vered-0.166
 symp-0.151
ement-0.149
 aston-0.141
bugs-0.138
Tokenock
Token ID735
Feature activation+7.47e+00

Pos logit contributions
ement+0.104
vered+0.102
 symp+0.097
 spe+0.096
el+0.096
Neg logit contributions
oto-0.028
ibi-0.023
lic-0.023
ogo-0.022
round-0.021
Token!
Token ID0
Feature activation+1.44e+00

Pos logit contributions
lic+0.533
oto+0.520
round+0.478
ogo+0.474
ibi+0.466
Neg logit contributions
vered-0.760
ement-0.739
 symp-0.729
 aston-0.721
 spe-0.694
Token Lily
Token ID20037
Feature activation+6.17e-01

Pos logit contributions
lic+0.103
oto+0.099
ibi+0.094
round+0.092
ogo+0.089
Neg logit contributions
vered-0.154
ement-0.147
 spe-0.144
 symp-0.141
 aston-0.135
Token was
Token ID373
Feature activation-6.75e-01

Pos logit contributions
lic+0.045
oto+0.045
ibi+0.041
ogo+0.041
round+0.039
Neg logit contributions
vered-0.063
ement-0.062
 spe-0.060
 symp-0.059
 aston-0.057
Token so
Token ID523
Feature activation-3.33e-01

Pos logit contributions
vered+0.080
ement+0.077
 symp+0.075
 spe+0.072
 aston+0.070
Neg logit contributions
lic-0.040
oto-0.039
ogo-0.034
ibi-0.034
round-0.031
Token happy
Token ID3772
Feature activation-6.68e-01

Pos logit contributions
vered+0.036
ement+0.035
 symp+0.034
 spe+0.032
el+0.031
Neg logit contributions
lic-0.024
oto-0.023
ibi-0.021
ogo-0.019
round-0.019
Token dream
Token ID4320
Feature activation-1.25e+00

Pos logit contributions
lic+0.021
oto+0.021
ogo+0.019
ibi+0.019
round+0.019
Neg logit contributions
vered-0.021
ement-0.020
 spe-0.019
 symp-0.019
 aston-0.018
Token called
Token ID1444
Feature activation+1.54e+00

Pos logit contributions
vered+0.092
ement+0.091
 spe+0.088
 symp+0.087
ingly+0.078
Neg logit contributions
oto-0.121
lic-0.119
ogo-0.116
ibi-0.113
round-0.108
Token a
Token ID257
Feature activation+6.17e-02

Pos logit contributions
oto+0.123
lic+0.123
ogo+0.115
ibi+0.114
round+0.110
Neg logit contributions
vered-0.146
ement-0.140
 spe-0.138
 symp-0.137
 aston-0.130
Token "
Token ID366
Feature activation+1.17e+00

Pos logit contributions
oto+0.005
lic+0.005
ogo+0.005
ibi+0.005
round+0.004
Neg logit contributions
 spe-0.006
vered-0.005
ement-0.005
 symp-0.005
 aston-0.005
Tokennight
Token ID3847
Feature activation+2.82e+00

Pos logit contributions
lic+0.097
oto+0.095
ibi+0.090
round+0.088
ogo+0.085
Neg logit contributions
ement-0.111
vered-0.109
 symp-0.105
 aston-0.103
 spe-0.102
Tokenmare
Token ID11449
Feature activation+7.45e+00

Pos logit contributions
lic+0.159
oto+0.158
ogo+0.149
ibi+0.149
round+0.131
Neg logit contributions
vered-0.325
 spe-0.321
ement-0.315
 symp-0.314
 aston-0.298
Token".
Token ID1911
Feature activation+2.03e+00

Pos logit contributions
oto+0.527
ogo+0.494
ibi+0.484
lic+0.480
round+0.430
Neg logit contributions
vered-0.774
ement-0.741
 spe-0.740
 symp-0.738
 aston-0.701
Token She
Token ID1375
Feature activation-5.45e-01

Pos logit contributions
oto+0.226
lic+0.220
ogo+0.218
ibi+0.204
round+0.203
Neg logit contributions
vered-0.132
ement-0.127
 symp-0.127
 spe-0.116
 aston-0.113
Token woke
Token ID19092
Feature activation+3.76e+00

Pos logit contributions
vered+0.055
 symp+0.052
 spe+0.052
ement+0.052
 aston+0.049
Neg logit contributions
oto-0.041
lic-0.040
ibi-0.038
ogo-0.038
round-0.037
Token up
Token ID510
Feature activation-1.18e-01

Pos logit contributions
lic+0.344
oto+0.340
ibi+0.318
ogo+0.318
round+0.308
Neg logit contributions
vered-0.318
ement-0.309
 symp-0.299
 spe-0.294
 aston-0.278
Token crying
Token ID13774
Feature activation-2.09e+00

Pos logit contributions
vered+0.013
ement+0.012
 spe+0.012
 symp+0.011
 aston+0.011
Neg logit contributions
oto-0.008
lic-0.008
ogo-0.007
ibi-0.007
round-0.007
Token idea
Token ID2126
Feature activation-2.64e+00

Pos logit contributions
vered+0.015
ement+0.014
 symp+0.014
 spe+0.013
 aston+0.013
Neg logit contributions
oto-0.011
lic-0.011
ibi-0.010
ogo-0.010
EEP-0.009
Token of
Token ID286
Feature activation-1.01e-01

Pos logit contributions
vered+0.301
 spe+0.287
 symp+0.279
ement+0.276
 aston+0.265
Neg logit contributions
oto-0.159
lic-0.149
oys-0.143
ibi-0.140
round-0.130
Token having
Token ID1719
Feature activation+4.66e-01

Pos logit contributions
vered+0.011
ement+0.010
 symp+0.010
ingly+0.010
 aston+0.009
Neg logit contributions
oto-0.007
lic-0.007
ogo-0.007
ibi-0.007
round-0.006
Token a
Token ID257
Feature activation+9.14e-02

Pos logit contributions
oto+0.023
lic+0.023
ibi+0.021
ogo+0.017
 Indians+0.017
Neg logit contributions
vered-0.058
ement-0.057
 symp-0.057
ingly-0.054
 aston-0.052
Token peac
Token ID41961
Feature activation-8.46e-01

Pos logit contributions
lic+0.007
oto+0.006
ibi+0.006
ogo+0.006
round+0.006
Neg logit contributions
vered-0.010
ement-0.009
 symp-0.009
 aston-0.009
ingly-0.009
Tokenock
Token ID735
Feature activation+7.42e+00

Pos logit contributions
ement+0.121
vered+0.118
 spe+0.113
 symp+0.112
el+0.109
Neg logit contributions
oto-0.026
lic-0.022
ibi-0.022
ogo-0.021
EEP-0.020
Token tail
Token ID7894
Feature activation-1.87e-01

Pos logit contributions
lic+0.427
oto+0.422
ibi+0.397
round+0.387
ock+0.375
Neg logit contributions
vered-0.809
 aston-0.808
ement-0.799
 symp-0.794
 spe-0.760
Token.
Token ID13
Feature activation-1.93e-01

Pos logit contributions
vered+0.022
 symp+0.021
ement+0.021
 aston+0.021
 spe+0.020
Neg logit contributions
oto-0.010
lic-0.010
round-0.009
ogo-0.009
ibi-0.009
Token She
Token ID1375
Feature activation-5.27e-01

Pos logit contributions
vered+0.018
ement+0.017
 spe+0.017
 symp+0.016
 aston+0.016
Neg logit contributions
lic-0.017
oto-0.016
ibi-0.015
ogo-0.015
EEP-0.015
Token let
Token ID1309
Feature activation+2.33e+00

Pos logit contributions
vered+0.059
ement+0.058
 symp+0.056
 spe+0.055
 aston+0.053
Neg logit contributions
oto-0.034
lic-0.034
ibi-0.031
ogo-0.030
round-0.029
Token her
Token ID607
Feature activation-5.32e-01

Pos logit contributions
oto+0.146
lic+0.139
ibi+0.133
ogo+0.132
EEP+0.122
Neg logit contributions
vered-0.268
ement-0.263
 spe-0.253
 symp-0.249
 aston-0.243
Token to
Token ID284
Feature activation-1.26e-02

Pos logit contributions
oto+0.084
lic+0.081
ibi+0.074
ogo+0.072
round+0.071
Neg logit contributions
vered-0.177
ement-0.172
 symp-0.164
 aston-0.162
 spe-0.160
Token watch
Token ID2342
Feature activation+2.82e+00

Pos logit contributions
vered+0.001
 spe+0.001
ement+0.001
 symp+0.001
 aston+0.001
Neg logit contributions
lic-0.001
oto-0.001
ogo-0.001
ibi-0.001
oys-0.001
Token tick
Token ID4378
Feature activation-4.89e-01

Pos logit contributions
oto+0.211
lic+0.210
ibi+0.189
ogo+0.188
round+0.185
Neg logit contributions
vered-0.291
ement-0.269
 spe-0.263
 symp-0.261
 aston-0.254
Token-
Token ID12
Feature activation+1.30e+00

Pos logit contributions
vered+0.043
ement+0.041
 spe+0.040
 symp+0.040
 aston+0.038
Neg logit contributions
oto-0.043
lic-0.043
ogo-0.040
ibi-0.040
round-0.039
Tokent
Token ID83
Feature activation+3.55e+00

Pos logit contributions
oto+0.110
ogo+0.108
lic+0.101
oys+0.100
ibi+0.098
Neg logit contributions
vered-0.112
 symp-0.098
 spe-0.097
ement-0.097
ingly-0.094
Tokenock
Token ID735
Feature activation+7.41e+00

Pos logit contributions
lic+0.248
oto+0.227
EEP+0.227
ogo+0.224
ibi+0.223
Neg logit contributions
el-0.355
vered-0.345
ement-0.344
ingly-0.327
 symp-0.321
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
lic+0.574
oto+0.571
ogo+0.526
ibi+0.520
round+0.505
Neg logit contributions
vered-0.732
ement-0.703
 spe-0.686
 symp-0.677
 aston-0.660
Token One
Token ID1881
Feature activation-1.99e+00

Pos logit contributions
vered+0.016
 spe+0.014
ement+0.014
 symp+0.013
 aston+0.013
Neg logit contributions
oto-0.021
lic-0.021
ibi-0.021
ogo-0.020
round-0.020
Token day
Token ID1110
Feature activation+5.04e-01

Pos logit contributions
vered+0.151
 spe+0.151
ement+0.150
 symp+0.149
el+0.146
Neg logit contributions
lic-0.194
ogo-0.186
oto-0.177
EEP-0.164
ibi-0.162
Token,
Token ID11
Feature activation+2.07e-01

Pos logit contributions
oto+0.054
lic+0.053
ogo+0.052
ibi+0.050
round+0.049
Neg logit contributions
vered-0.036
ement-0.033
 spe-0.033
 symp-0.032
 aston-0.031
Token Lily
Token ID20037
Feature activation+6.06e-01

Pos logit contributions
lic+0.014
oto+0.014
ibi+0.013
EEP+0.012
round+0.012
Neg logit contributions
vered-0.023
ement-0.022
 spe-0.021
 symp-0.021
 aston-0.021
TokenMary
Token ID24119
Feature activation+5.92e-01

Pos logit contributions
oto+0.025
ogo+0.025
lic+0.024
ibi+0.024
round+0.022
Neg logit contributions
vered-0.027
ement-0.025
 aston-0.025
 symp-0.025
 spe-0.024
Token stopped
Token ID5025
Feature activation+2.09e+00

Pos logit contributions
lic+0.039
oto+0.037
round+0.034
ibi+0.033
ogo+0.033
Neg logit contributions
ement-0.064
vered-0.063
 symp-0.061
 spe-0.060
 aston-0.058
Token.
Token ID13
Feature activation-2.19e-01

Pos logit contributions
oto+0.114
ruck+0.091
lic+0.089
 listen+0.088
ogo+0.086
Neg logit contributions
vered-0.255
ement-0.253
 spe-0.226
-0.220
 bidder-0.213
Token "
Token ID366
Feature activation+1.17e+00

Pos logit contributions
vered+0.023
ement+0.022
 spe+0.021
 symp+0.021
 aston+0.020
Neg logit contributions
lic-0.016
oto-0.016
ibi-0.015
ogo-0.014
round-0.014
TokenSt
Token ID1273
Feature activation+2.79e-02

Pos logit contributions
lic+0.097
oto+0.096
ibi+0.090
round+0.088
ogo+0.086
Neg logit contributions
vered-0.109
ement-0.107
 symp-0.101
 spe-0.101
 aston-0.095
Tokenrip
Token ID5528
Feature activation+7.40e+00

Pos logit contributions
lic+0.002
oto+0.002
ibi+0.002
ogo+0.002
round+0.002
Neg logit contributions
vered-0.003
ement-0.002
 symp-0.002
el-0.002
 spe-0.002
Token.
Token ID13
Feature activation-2.04e-01

Pos logit contributions
lic+0.543
oto+0.536
ogo+0.496
ibi+0.483
round+0.471
Neg logit contributions
vered-0.758
ement-0.733
 symp-0.711
 spe-0.710
 aston-0.683
Token Hot
Token ID6964
Feature activation+9.25e-01

Pos logit contributions
vered+0.018
ement+0.018
 spe+0.017
 symp+0.017
 aston+0.016
Neg logit contributions
oto-0.018
lic-0.017
ogo-0.017
ibi-0.016
round-0.015
Token!"
Token ID2474
Feature activation+2.05e+00

Pos logit contributions
oto+0.066
lic+0.063
ibi+0.061
ogo+0.061
round+0.056
Neg logit contributions
vered-0.098
ement-0.094
 spe-0.093
 symp-0.090
 aston-0.086
Token
Token ID198
Feature activation+3.23e-01

Pos logit contributions
lic+0.165
oto+0.165
ibi+0.154
round+0.148
ogo+0.148
Neg logit contributions
vered-0.195
ement-0.185
 spe-0.182
 symp-0.180
 aston-0.169
Token
Token ID198
Feature activation+3.30e-01

Pos logit contributions
oto+0.030
lic+0.029
ogo+0.029
ibi+0.027
round+0.027
Neg logit contributions
vered-0.025
ement-0.024
 symp-0.024
 aston-0.023
 spe-0.022
Token a
Token ID257
Feature activation+4.30e-02

Pos logit contributions
vered+0.198
ement+0.197
 spe+0.190
 symp+0.185
 aston+0.177
Neg logit contributions
oto-0.162
lic-0.153
ogo-0.147
ibi-0.144
round-0.138
Token cup
Token ID6508
Feature activation-2.12e-01

Pos logit contributions
oto+0.004
lic+0.004
ogo+0.004
ibi+0.004
round+0.003
Neg logit contributions
vered-0.004
ement-0.004
 spe-0.004
 symp-0.003
 aston-0.003
Token and
Token ID290
Feature activation+2.35e-01

Pos logit contributions
vered+0.022
ement+0.021
 spe+0.021
 symp+0.020
 aston+0.020
Neg logit contributions
oto-0.015
lic-0.015
ogo-0.014
ibi-0.014
round-0.014
Token a
Token ID257
Feature activation+5.28e-02

Pos logit contributions
lic+0.016
oto+0.016
ibi+0.014
ogo+0.014
EEP+0.014
Neg logit contributions
vered-0.025
ement-0.025
 symp-0.024
 spe-0.023
 aston-0.022
Token sa
Token ID473
Feature activation-1.45e+00

Pos logit contributions
lic+0.004
oto+0.004
ibi+0.004
ogo+0.004
round+0.004
Neg logit contributions
vered-0.005
ement-0.005
 symp-0.005
 spe-0.005
 aston-0.004
Tokenucer
Token ID48915
Feature activation+7.40e+00

Pos logit contributions
vered+0.161
ement+0.155
 symp+0.151
 spe+0.150
el+0.145
Neg logit contributions
lic-0.093
oto-0.092
ibi-0.084
ogo-0.083
round-0.081
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
lic+0.498
oto+0.480
ogo+0.435
oys+0.434
round+0.424
Neg logit contributions
vered-0.791
ement-0.762
 symp-0.740
 afar-0.729
 aston-0.728
Token "
Token ID366
Feature activation+1.16e+00

Pos logit contributions
vered+0.024
ement+0.023
 symp+0.023
 spe+0.023
 aston+0.022
Neg logit contributions
oto-0.016
lic-0.016
ogo-0.015
ibi-0.015
round-0.014
TokenBe
Token ID3856
Feature activation+9.07e-01

Pos logit contributions
lic+0.094
oto+0.093
round+0.086
ibi+0.086
EEP+0.082
Neg logit contributions
ement-0.112
vered-0.111
 symp-0.104
 spe-0.103
 aston-0.098
Token careful
Token ID8161
Feature activation+1.13e+00

Pos logit contributions
oto+0.052
lic+0.052
ogo+0.046
ibi+0.045
round+0.042
Neg logit contributions
vered-0.108
ement-0.106
 spe-0.103
 symp-0.102
 aston-0.098
Token,
Token ID11
Feature activation+1.91e-01

Pos logit contributions
oto+0.096
lic+0.095
ogo+0.089
ibi+0.088
round+0.086
Neg logit contributions
vered-0.105
ement-0.101
 spe-0.098
 symp-0.097
 aston-0.093
Token
Token ID220
Feature activation-1.18e+00

Pos logit contributions
vered+0.023
ement+0.022
 spe+0.021
 symp+0.021
 aston+0.020
Neg logit contributions
oto-0.018
lic-0.018
ibi-0.017
ogo-0.017
round-0.016
Token
Token ID198
Feature activation+2.97e-01

Pos logit contributions
vered+0.115
ement+0.111
 spe+0.108
 symp+0.107
 aston+0.102
Neg logit contributions
oto-0.093
lic-0.093
ogo-0.086
ibi-0.085
round-0.082
Token
Token ID198
Feature activation+3.00e-01

Pos logit contributions
oto+0.025
lic+0.024
ogo+0.023
ibi+0.023
round+0.022
Neg logit contributions
vered-0.028
ement-0.026
 symp-0.026
 spe-0.026
 aston-0.025
TokenThe
Token ID464
Feature activation-1.73e-01

Pos logit contributions
oto+0.024
lic+0.024
ogo+0.022
ibi+0.022
round+0.021
Neg logit contributions
vered-0.029
ement-0.028
 spe-0.027
 symp-0.027
 aston-0.026
Token peac
Token ID41961
Feature activation-8.66e-01

Pos logit contributions
vered+0.018
ement+0.017
 symp+0.016
 spe+0.016
 aston+0.016
Neg logit contributions
oto-0.013
lic-0.012
ibi-0.012
ogo-0.011
EEP-0.011
Tokenock
Token ID735
Feature activation+7.39e+00

Pos logit contributions
ement+0.120
vered+0.120
 symp+0.113
 spe+0.112
el+0.109
Neg logit contributions
oto-0.031
lic-0.026
ibi-0.026
EEP-0.025
ogo-0.024
Token was
Token ID373
Feature activation-6.97e-01

Pos logit contributions
lic+0.549
oto+0.541
ibi+0.496
ogo+0.495
round+0.474
Neg logit contributions
vered-0.742
ement-0.723
 symp-0.701
 spe-0.699
 aston-0.676
Token now
Token ID783
Feature activation-6.12e-01

Pos logit contributions
vered+0.078
ement+0.074
 symp+0.073
 spe+0.070
 aston+0.066
Neg logit contributions
lic-0.045
oto-0.045
ibi-0.040
ogo-0.039
round-0.036
Token even
Token ID772
Feature activation-1.59e+00

Pos logit contributions
vered+0.065
ement+0.064
 symp+0.062
 spe+0.059
 aston+0.057
Neg logit contributions
lic-0.042
oto-0.042
ibi-0.037
ogo-0.037
round-0.034
Token more
Token ID517
Feature activation+3.26e-01

Pos logit contributions
vered+0.154
ement+0.152
 symp+0.144
+0.140
 spe+0.137
Neg logit contributions
lic-0.129
oto-0.117
ibi-0.106
ock-0.102
ogo-0.101
Token gorgeous
Token ID18857
Feature activation-1.36e+00

Pos logit contributions
oto+0.023
lic+0.023
ibi+0.020
ogo+0.020
round+0.019
Neg logit contributions
vered-0.035
 symp-0.033
ement-0.033
 spe-0.031
 aston-0.030
Token.
Token ID13
Feature activation-2.37e-01

Pos logit contributions
oto+0.102
lic+0.100
ogo+0.097
ibi+0.097
round+0.092
Neg logit contributions
vered-0.078
ement-0.076
 spe-0.071
 symp-0.071
 aston-0.065
Token
Token ID198
Feature activation+2.95e-01

Pos logit contributions
vered+0.026
ement+0.025
 spe+0.024
 symp+0.024
 aston+0.023
Neg logit contributions
lic-0.017
oto-0.016
ibi-0.015
EEP-0.014
ogo-0.014
Token
Token ID198
Feature activation+2.92e-01

Pos logit contributions
oto+0.026
lic+0.025
ogo+0.024
ibi+0.024
round+0.023
Neg logit contributions
vered-0.026
ement-0.025
 spe-0.024
 symp-0.024
 aston-0.023
Token"
Token ID1
Feature activation+1.13e+00

Pos logit contributions
oto+0.023
lic+0.021
ogo+0.021
ibi+0.020
round+0.019
Neg logit contributions
vered-0.029
ement-0.028
 spe-0.027
 symp-0.027
 aston-0.026
TokenKn
Token ID25095
Feature activation+1.96e+00

Pos logit contributions
lic+0.095
oto+0.094
ibi+0.087
round+0.084
ogo+0.083
Neg logit contributions
vered-0.107
ement-0.103
 symp-0.099
 spe-0.097
 aston-0.093
Tokenock
Token ID735
Feature activation+7.39e+00

Pos logit contributions
 Indians+0.115
oto+0.108
haw+0.101
ibi+0.100
ogo+0.098
Neg logit contributions
ement-0.231
ingly-0.206
vered-0.201
 symp-0.200
el-0.192
Token knock
Token ID10643
Feature activation-1.21e+00

Pos logit contributions
oto+0.507
lic+0.503
ogo+0.463
ibi+0.459
round+0.437
Neg logit contributions
vered-0.792
ement-0.765
 spe-0.746
 symp-0.743
 aston-0.711
Token,"
Token ID553
Feature activation-2.93e+00

Pos logit contributions
vered+0.129
ement+0.124
 spe+0.121
 symp+0.121
 aston+0.116
Neg logit contributions
oto-0.083
lic-0.083
ogo-0.076
ibi-0.075
round-0.072
Token Lily
Token ID20037
Feature activation+5.79e-01

Pos logit contributions
vered+0.393
ement+0.380
 symp+0.366
 spe+0.364
 aston+0.344
Neg logit contributions
oto-0.130
lic-0.117
ibi-0.110
oys-0.104
EEP-0.089
Token says
Token ID1139
Feature activation-2.58e-01

Pos logit contributions
oto+0.041
lic+0.040
ogo+0.037
ibi+0.037
round+0.035
Neg logit contributions
vered-0.061
ement-0.059
 spe-0.058
 symp-0.057
 aston-0.055
Token.
Token ID13
Feature activation-2.12e-01

Pos logit contributions
vered+0.032
 spe+0.030
ement+0.030
+0.028
 aston+0.028
Neg logit contributions
oto-0.013
oys-0.011
lic-0.011
EEP-0.011
 listen-0.010
Token look
Token ID804
Feature activation+2.00e+00

Pos logit contributions
vered+0.005
ement+0.005
 symp+0.004
 aston+0.004
ingly+0.004
Neg logit contributions
oto-0.002
lic-0.002
ibi-0.002
ogo-0.002
round-0.002
Token at
Token ID379
Feature activation-1.15e+00

Pos logit contributions
lic+0.128
oto+0.126
ogo+0.103
round+0.102
ibi+0.101
Neg logit contributions
vered-0.231
ement-0.212
 symp-0.210
 spe-0.208
-0.201
Token in
Token ID287
Feature activation-8.63e-01

Pos logit contributions
 symp+0.130
vered+0.128
 aston+0.125
 spe+0.121
ement+0.121
Neg logit contributions
oto-0.063
ogo-0.055
lic-0.050
ruck-0.048
ock-0.046
Token the
Token ID262
Feature activation-1.79e-01

Pos logit contributions
vered+0.109
 symp+0.104
ement+0.104
+0.096
used+0.095
Neg logit contributions
 cool-0.030
 water-0.026
 nice-0.023
oto-0.022
 my-0.020
Token padd
Token ID14098
Feature activation+1.02e-01

Pos logit contributions
vered+0.022
ement+0.020
 symp+0.019
ingly+0.018
 aston+0.018
Neg logit contributions
lic-0.009
 cool-0.008
oto-0.008
 sound-0.008
 water-0.008
Tokenock
Token ID735
Feature activation+7.38e+00

Pos logit contributions
lic+0.008
oto+0.008
round+0.007
ock+0.007
ogo+0.007
Neg logit contributions
vered-0.010
 spe-0.009
 symp-0.009
 aston-0.009
ement-0.009
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
lic+0.535
oto+0.499
ogo+0.465
round+0.457
ibi+0.445
Neg logit contributions
vered-0.743
ement-0.708
 symp-0.696
 spe-0.695
 aston-0.692
Token The
Token ID383
Feature activation-3.35e-01

Pos logit contributions
vered+0.023
ement+0.022
 spe+0.022
 symp+0.021
 aston+0.020
Neg logit contributions
oto-0.019
lic-0.019
ibi-0.018
ogo-0.017
round-0.017
Token farmer
Token ID18739
Feature activation-3.22e-01

Pos logit contributions
vered+0.042
ement+0.040
 symp+0.038
 spe+0.038
 aston+0.037
Neg logit contributions
oto-0.019
lic-0.017
ogo-0.016
ibi-0.015
oys-0.014
Token smiled
Token ID13541
Feature activation-7.79e-01

Pos logit contributions
ement+0.035
vered+0.034
 symp+0.034
 spe+0.033
 aston+0.032
Neg logit contributions
oto-0.020
lic-0.020
ogo-0.018
round-0.017
ibi-0.017
Token and
Token ID290
Feature activation+2.52e-01

Pos logit contributions
vered+0.091
 spe+0.086
ement+0.084
 symp+0.080
ocol+0.078
Neg logit contributions
oto-0.046
lic-0.044
ogo-0.036
ibi-0.035
oys-0.034
Token treats
Token ID18432
Feature activation+2.94e-01

Pos logit contributions
vered+0.023
 symp+0.022
ement+0.021
 aston+0.020
 spe+0.020
Neg logit contributions
oto-0.013
ibi-0.011
lic-0.011
ogo-0.011
round-0.011
Token!"
Token ID2474
Feature activation+2.01e+00

Pos logit contributions
lic+0.021
oto+0.020
ogo+0.019
ibi+0.019
round+0.017
Neg logit contributions
vered-0.031
ement-0.030
 spe-0.029
 symp-0.029
 aston-0.027
Token
Token ID198
Feature activation+2.81e-01

Pos logit contributions
oto+0.147
lic+0.140
EEP+0.135
round+0.135
ibi+0.133
Neg logit contributions
vered-0.204
ement-0.200
 spe-0.191
 aston-0.185
 symp-0.182
Token
Token ID198
Feature activation+2.82e-01

Pos logit contributions
oto+0.023
lic+0.023
ogo+0.022
ibi+0.022
round+0.021
Neg logit contributions
vered-0.026
ement-0.025
 symp-0.024
 spe-0.024
 aston-0.023
TokenSt
Token ID1273
Feature activation-7.85e-03

Pos logit contributions
oto+0.022
lic+0.022
ogo+0.021
ibi+0.020
round+0.020
Neg logit contributions
vered-0.027
ement-0.027
 spe-0.026
 symp-0.025
 aston-0.024
Tokenrip
Token ID5528
Feature activation+7.37e+00

Pos logit contributions
vered+0.001
ement+0.001
 spe+0.001
 symp+0.001
 aston+0.001
Neg logit contributions
oto-0.001
lic-0.001
ogo-0.001
ibi-0.001
round-0.001
Token ran
Token ID4966
Feature activation+6.76e-01

Pos logit contributions
lic+0.661
oto+0.657
ogo+0.616
ibi+0.602
round+0.587
Neg logit contributions
vered-0.637
ement-0.618
 spe-0.595
 symp-0.591
 aston-0.571
Token away
Token ID1497
Feature activation-4.33e-01

Pos logit contributions
oto+0.064
lic+0.063
ogo+0.060
ibi+0.058
EEP+0.056
Neg logit contributions
vered-0.056
ement-0.051
 symp-0.051
 spe-0.050
 aston-0.048
Token with
Token ID351
Feature activation-1.80e+00

Pos logit contributions
vered+0.042
ement+0.041
 spe+0.039
 symp+0.039
 aston+0.038
Neg logit contributions
oto-0.034
lic-0.034
ibi-0.031
ogo-0.031
round-0.030
Token the
Token ID262
Feature activation-1.70e-01

Pos logit contributions
ement+0.195
vered+0.195
 symp+0.182
 aston+0.175
 spe+0.175
Neg logit contributions
oto-0.120
ogo-0.110
lic-0.105
una-0.104
haw-0.100
Token stolen
Token ID9909
Feature activation-1.37e+00

Pos logit contributions
vered+0.020
 symp+0.019
ement+0.018
 aston+0.018
 spe+0.017
Neg logit contributions
oto-0.012
ibi-0.010
lic-0.010
ogo-0.009
oys-0.009
Token stuck
Token ID7819
Feature activation-4.28e+00

Pos logit contributions
vered+0.072
ement+0.070
 symp+0.067
 spe+0.066
 aston+0.064
Neg logit contributions
lic-0.056
oto-0.056
ogo-0.052
ibi-0.050
round-0.048
Token in
Token ID287
Feature activation-8.63e-01

Pos logit contributions
vered+0.418
ement+0.410
 spe+0.400
 symp+0.396
 aston+0.385
Neg logit contributions
oto-0.325
lic-0.318
ogo-0.308
ibi-0.291
round-0.281
Token a
Token ID257
Feature activation+4.98e-02

Pos logit contributions
ement+0.104
vered+0.101
 symp+0.097
 spe+0.095
bugs+0.093
Neg logit contributions
oto-0.042
ogo-0.038
lic-0.038
 gear-0.034
ibi-0.034
Token small
Token ID1402
Feature activation-1.21e+00

Pos logit contributions
lic+0.003
oto+0.003
ogo+0.002
ibi+0.002
round+0.002
Neg logit contributions
ement-0.006
vered-0.006
 symp-0.006
 aston-0.005
 spe-0.005
Token padd
Token ID14098
Feature activation+1.05e-01

Pos logit contributions
vered+0.124
ement+0.123
 symp+0.115
 spe+0.108
 aston+0.107
Neg logit contributions
lic-0.083
oto-0.081
round-0.078
ogo-0.076
ibi-0.074
Tokenock
Token ID735
Feature activation+7.37e+00

Pos logit contributions
oto+0.010
lic+0.010
ogo+0.010
ibi+0.010
EEP+0.009
Neg logit contributions
vered-0.008
ement-0.008
 symp-0.007
 spe-0.007
 aston-0.007
Token,
Token ID11
Feature activation+1.79e-01

Pos logit contributions
lic+0.515
oto+0.515
ogo+0.485
round+0.485
ibi+0.468
Neg logit contributions
vered-0.737
ement-0.732
 aston-0.696
 spe-0.696
 symp-0.689
Token far
Token ID1290
Feature activation-1.59e+00

Pos logit contributions
oto+0.009
lic+0.009
round+0.008
ibi+0.008
ogo+0.008
Neg logit contributions
vered-0.021
ement-0.021
 symp-0.020
 aston-0.020
 spe-0.019
Token away
Token ID1497
Feature activation-4.20e-01

Pos logit contributions
vered+0.133
ement+0.127
 spe+0.122
 symp+0.122
 aston+0.116
Neg logit contributions
oto-0.147
lic-0.147
ogo-0.138
ibi-0.137
round-0.133
Token from
Token ID422
Feature activation+1.17e+00

Pos logit contributions
vered+0.041
ement+0.040
 symp+0.038
 spe+0.038
 aston+0.036
Neg logit contributions
oto-0.033
lic-0.032
ogo-0.030
ibi-0.029
round-0.029
Token the
Token ID262
Feature activation-1.59e-01

Pos logit contributions
oto+0.063
ogo+0.057
ruck+0.053
round+0.053
ock+0.050
Neg logit contributions
ement-0.142
vered-0.139
 symp-0.136
 aston-0.128
ingly-0.125
Token.
Token ID13
Feature activation-2.55e-01

Pos logit contributions
vered+0.113
ement+0.108
 spe+0.105
 symp+0.102
 aston+0.099
Neg logit contributions
oto-0.118
lic-0.117
ogo-0.109
ibi-0.108
round-0.105
Token It
Token ID632
Feature activation+9.47e-01

Pos logit contributions
vered+0.024
ement+0.023
 symp+0.023
 spe+0.023
 aston+0.022
Neg logit contributions
oto-0.021
lic-0.021
ogo-0.020
ibi-0.019
round-0.018
Token is
Token ID318
Feature activation-4.98e-01

Pos logit contributions
oto+0.078
lic+0.078
ogo+0.072
ibi+0.072
round+0.069
Neg logit contributions
vered-0.089
ement-0.085
 spe-0.083
 symp-0.082
 aston-0.078
Token a
Token ID257
Feature activation+4.24e-02

Pos logit contributions
vered+0.050
ement+0.049
 spe+0.048
 symp+0.047
 aston+0.045
Neg logit contributions
oto-0.037
lic-0.037
ogo-0.034
ibi-0.034
round-0.033
Token peac
Token ID41961
Feature activation-8.90e-01

Pos logit contributions
oto+0.003
lic+0.003
ogo+0.003
ibi+0.003
round+0.003
Neg logit contributions
vered-0.004
ement-0.004
 spe-0.004
 symp-0.004
 aston-0.004
Tokenock
Token ID735
Feature activation+7.36e+00

Pos logit contributions
ement+0.122
vered+0.117
 symp+0.112
 spe+0.111
bird+0.110
Neg logit contributions
oto-0.034
ibi-0.030
lic-0.028
oys-0.026
ogo-0.026
Token.
Token ID13
Feature activation-2.40e-01

Pos logit contributions
lic+0.468
oto+0.458
round+0.444
ogo+0.417
ibi+0.413
Neg logit contributions
vered-0.786
ement-0.766
 aston-0.752
 symp-0.735
 spe-0.687
Token It
Token ID632
Feature activation+9.65e-01

Pos logit contributions
vered+0.023
ement+0.022
 spe+0.021
 symp+0.021
 aston+0.020
Neg logit contributions
oto-0.020
lic-0.020
ogo-0.018
ibi-0.018
round-0.017
Token is
Token ID318
Feature activation-4.84e-01

Pos logit contributions
lic+0.066
oto+0.066
ogo+0.062
ibi+0.057
round+0.056
Neg logit contributions
vered-0.101
ement-0.100
 symp-0.095
 spe-0.091
ingly-0.091
Token proud
Token ID6613
Feature activation+8.07e-01

Pos logit contributions
vered+0.052
ement+0.051
 symp+0.049
 spe+0.048
 aston+0.046
Neg logit contributions
oto-0.034
lic-0.033
ogo-0.031
ibi-0.030
round-0.028
Token of
Token ID286
Feature activation-1.25e-01

Pos logit contributions
oto+0.060
lic+0.059
ogo+0.055
ibi+0.055
round+0.052
Neg logit contributions
vered-0.083
ement-0.080
 spe-0.078
 symp-0.077
 aston-0.074
Token made
Token ID925
Feature activation+1.03e+00

Pos logit contributions
lic+0.028
oto+0.028
ogo+0.025
ibi+0.025
round+0.024
Neg logit contributions
vered-0.039
ement-0.038
 symp-0.037
 spe-0.036
 aston-0.035
Token a
Token ID257
Feature activation+2.51e-02

Pos logit contributions
oto+0.081
lic+0.081
ogo+0.074
ibi+0.074
round+0.070
Neg logit contributions
vered-0.103
ement-0.098
 spe-0.097
 symp-0.095
 aston-0.091
Token tick
Token ID4378
Feature activation-5.39e-01

Pos logit contributions
lic+0.001
oto+0.001
ibi+0.001
ogo+0.001
EEP+0.001
Neg logit contributions
vered-0.003
ement-0.003
 symp-0.003
 spe-0.003
 aston-0.003
Token-
Token ID12
Feature activation+1.25e+00

Pos logit contributions
vered+0.040
ement+0.038
 spe+0.037
 symp+0.037
 aston+0.035
Neg logit contributions
oto-0.055
lic-0.055
ogo-0.052
ibi-0.051
round-0.050
Tokent
Token ID83
Feature activation+3.50e+00

Pos logit contributions
ogo+0.091
oto+0.090
lic+0.082
ibi+0.081
 Indians+0.081
Neg logit contributions
vered-0.124
 spe-0.119
ement-0.116
 symp-0.115
el-0.108
Tokenock
Token ID735
Feature activation+7.36e+00

Pos logit contributions
lic+0.200
oto+0.196
ibi+0.190
EEP+0.189
ogo+0.186
Neg logit contributions
vered-0.394
ement-0.384
 spe-0.374
el-0.364
ingly-0.362
Token sound
Token ID2128
Feature activation+3.69e+00

Pos logit contributions
oto+0.544
lic+0.544
ogo+0.498
ibi+0.496
round+0.479
Neg logit contributions
vered-0.751
ement-0.725
 spe-0.705
 symp-0.698
 aston-0.675
Token.
Token ID13
Feature activation-2.46e-01

Pos logit contributions
oto+0.345
lic+0.343
ogo+0.323
ibi+0.321
round+0.310
Neg logit contributions
vered-0.304
ement-0.291
 spe-0.281
 symp-0.279
 aston-0.263
Token He
Token ID679
Feature activation+1.99e-01

Pos logit contributions
vered+0.023
ement+0.023
 symp+0.022
 spe+0.022
 aston+0.021
Neg logit contributions
lic-0.019
oto-0.019
ogo-0.018
ibi-0.017
round-0.017
Token wore
Token ID12408
Feature activation+1.84e+00

Pos logit contributions
oto+0.011
lic+0.011
ibi+0.010
ogo+0.009
round+0.009
Neg logit contributions
ement-0.024
vered-0.023
 symp-0.022
 aston-0.022
 spe-0.021
Token it
Token ID340
Feature activation+4.06e-01

Pos logit contributions
oto+0.164
lic+0.162
ogo+0.156
oys+0.155
ibi+0.155
Neg logit contributions
vered-0.158
 spe-0.151
ement-0.149
 symp-0.144
 aston-0.138
Token,
Token ID11
Feature activation+2.55e-01

Pos logit contributions
vered+0.018
ement+0.017
 spe+0.017
 symp+0.016
 aston+0.016
Neg logit contributions
oto-0.016
lic-0.015
ogo-0.014
ibi-0.014
round-0.014
Token I
Token ID314
Feature activation+7.71e-01

Pos logit contributions
lic+0.018
oto+0.018
ogo+0.017
ibi+0.017
round+0.016
Neg logit contributions
vered-0.026
ement-0.025
 spe-0.025
 symp-0.025
 aston-0.024
Token do
Token ID466
Feature activation+1.16e-01

Pos logit contributions
oto+0.067
lic+0.066
ogo+0.064
round+0.062
ibi+0.060
Neg logit contributions
vered-0.068
 spe-0.063
ement-0.063
 symp-0.062
 aston-0.060
Token!"
Token ID2474
Feature activation+2.05e+00

Pos logit contributions
oto+0.010
lic+0.010
ogo+0.009
ibi+0.009
round+0.009
Neg logit contributions
vered-0.011
ement-0.010
 spe-0.010
 symp-0.010
 aston-0.010
Token S
Token ID311
Feature activation-3.93e-02

Pos logit contributions
oto+0.114
lic+0.108
ibi+0.102
round+0.095
ogo+0.092
Neg logit contributions
vered-0.257
ement-0.249
 spe-0.238
 symp-0.234
 aston-0.225
Tokenock
Token ID735
Feature activation+7.36e+00

Pos logit contributions
ement+0.004
vered+0.004
 symp+0.004
el+0.004
bird+0.003
Neg logit contributions
lic-0.003
oto-0.002
round-0.002
ogo-0.002
ibi-0.002
Tokeny
Token ID88
Feature activation-6.01e-01

Pos logit contributions
oto+0.517
lic+0.515
ogo+0.469
ibi+0.466
round+0.445
Neg logit contributions
vered-0.782
ement-0.759
 spe-0.743
 symp-0.734
 aston-0.709
Token says
Token ID1139
Feature activation-3.03e-01

Pos logit contributions
vered+0.076
 spe+0.075
ement+0.074
 symp+0.071
 aston+0.070
Neg logit contributions
oto-0.031
lic-0.030
ibi-0.026
round-0.025
ogo-0.025
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
vered+0.035
ement+0.033
 spe+0.032
 afar+0.031
 aston+0.031
Neg logit contributions
lic-0.019
oto-0.017
oys-0.015
round-0.015
ibi-0.015
Token "
Token ID366
Feature activation+1.10e+00

Pos logit contributions
vered+0.028
ement+0.027
 spe+0.027
 symp+0.027
 aston+0.026
Neg logit contributions
oto-0.018
lic-0.018
ogo-0.016
ibi-0.016
round-0.015
TokenWhat
Token ID2061
Feature activation+2.32e-01

Pos logit contributions
oto+0.091
lic+0.090
ibi+0.083
ogo+0.083
round+0.082
Neg logit contributions
vered-0.104
ement-0.101
 spe-0.097
 symp-0.096
 aston-0.092
Token feathers
Token ID27400
Feature activation-2.83e+00

Pos logit contributions
 symp+0.053
vered+0.051
ement+0.050
 aston+0.050
 spe+0.045
Neg logit contributions
oto-0.039
lic-0.039
ogo-0.036
ibi-0.036
ruck-0.034
Token.
Token ID13
Feature activation-2.05e-01

Pos logit contributions
vered+0.288
ement+0.275
 symp+0.272
 aston+0.268
 spe+0.265
Neg logit contributions
lic-0.216
oto-0.216
ogo-0.194
ibi-0.194
round-0.185
Token But
Token ID887
Feature activation+4.37e-01

Pos logit contributions
vered+0.023
ement+0.021
 spe+0.021
 symp+0.020
 aston+0.019
Neg logit contributions
lic-0.014
oto-0.014
ibi-0.013
round-0.013
ogo-0.012
Token the
Token ID262
Feature activation-1.89e-01

Pos logit contributions
oto+0.023
 bye+0.019
 listen+0.018
ogo+0.018
 Indians+0.018
Neg logit contributions
ement-0.052
vered-0.051
 symp-0.046
 aston-0.046
 spe-0.046
Token peac
Token ID41961
Feature activation-8.92e-01

Pos logit contributions
vered+0.022
ement+0.021
 symp+0.020
 aston+0.019
ingly+0.019
Neg logit contributions
oto-0.012
lic-0.011
ibi-0.011
ogo-0.010
round-0.010
Tokenock
Token ID735
Feature activation+7.35e+00

Pos logit contributions
ement+0.125
vered+0.124
 symp+0.116
 spe+0.116
el+0.115
Neg logit contributions
oto-0.029
ibi-0.025
lic-0.025
oys-0.024
EEP-0.023
Token was
Token ID373
Feature activation-7.40e-01

Pos logit contributions
lic+0.462
oto+0.454
ibi+0.403
ogo+0.400
round+0.390
Neg logit contributions
ement-0.789
vered-0.784
 symp-0.773
 aston-0.769
 spe-0.747
Token very
Token ID845
Feature activation+3.35e-01

Pos logit contributions
vered+0.084
ement+0.082
 symp+0.079
 spe+0.075
 aston+0.074
Neg logit contributions
oto-0.048
lic-0.048
ogo-0.043
ibi-0.041
una-0.038
Token impressive
Token ID8036
Feature activation+1.39e+00

Pos logit contributions
lic+0.023
oto+0.023
ibi+0.020
ogo+0.019
EEP+0.018
Neg logit contributions
vered-0.037
ement-0.036
 symp-0.034
 spe-0.032
 aston-0.032
Token!
Token ID0
Feature activation+1.29e+00

Pos logit contributions
oto+0.099
lic+0.093
ogo+0.087
round+0.085
ibi+0.084
Neg logit contributions
vered-0.144
 symp-0.139
 spe-0.138
 aston-0.136
ement-0.134
Token He
Token ID679
Feature activation+1.33e-01

Pos logit contributions
lic+0.086
oto+0.080
ibi+0.078
round+0.075
ogo+0.073
Neg logit contributions
vered-0.145
ement-0.136
 spe-0.135
 symp-0.129
 aston-0.127
Token
Token ID95
Feature activation+6.87e-01

Pos logit contributions
oto+0.138
lic+0.131
ibi+0.131
ogo+0.123
oys+0.119
Neg logit contributions
ement-0.141
vered-0.141
 symp-0.140
 spe-0.137
 aston-0.129
Token
Token ID26391
Feature activation+2.46e+00

Pos logit contributions
oto+0.064
lic+0.062
ibi+0.058
ogo+0.058
 Indians+0.052
Neg logit contributions
vered-0.058
ement-0.056
 symp-0.055
 aston-0.053
 spe-0.051
Token
Token ID129
Feature activation-2.22e-01

Pos logit contributions
oto+0.247
lic+0.224
ibi+0.218
round+0.217
ogo+0.216
Neg logit contributions
vered-0.191
 spe-0.184
ement-0.181
 symp-0.180
 aston-0.167
Token
Token ID241
Feature activation+1.51e+00

Pos logit contributions
 symp+0.018
vered+0.018
ement+0.017
 afar+0.016
 aston+0.016
Neg logit contributions
lic-0.021
oto-0.020
oys-0.019
ogo-0.018
ibi-0.018
TokenKn
Token ID25095
Feature activation+1.92e+00

Pos logit contributions
oto+0.124
lic+0.123
ogo+0.115
ibi+0.113
round+0.109
Neg logit contributions
vered-0.142
ement-0.137
 spe-0.133
 symp-0.132
 aston-0.126
Tokenock
Token ID735
Feature activation+7.35e+00

Pos logit contributions
oto+0.103
 Indians+0.099
ibi+0.092
ogo+0.087
haw+0.085
Neg logit contributions
ement-0.241
vered-0.220
ingly-0.213
 symp-0.210
 aston-0.205
Token,
Token ID11
Feature activation+1.55e-01

Pos logit contributions
oto+0.570
lic+0.553
ibi+0.522
ogo+0.520
round+0.492
Neg logit contributions
vered-0.732
ement-0.702
 spe-0.684
 symp-0.675
 aston-0.642
Token knock
Token ID10643
Feature activation-1.24e+00

Pos logit contributions
lic+0.011
oto+0.011
ogo+0.010
ibi+0.010
round+0.010
Neg logit contributions
vered-0.016
ement-0.016
 symp-0.015
 spe-0.015
 aston-0.014
Token,
Token ID11
Feature activation+1.63e-01

Pos logit contributions
vered+0.122
ement+0.117
 spe+0.113
 symp+0.112
 aston+0.107
Neg logit contributions
oto-0.097
lic-0.095
ogo-0.089
ibi-0.089
round-0.084
Token I
Token ID314
Feature activation+7.07e-01

Pos logit contributions
oto+0.013
lic+0.012
ogo+0.012
ibi+0.012
round+0.011
Neg logit contributions
vered-0.016
ement-0.015
 spe-0.015
 symp-0.015
 aston-0.014
Token'm
Token ID1101
Feature activation+7.24e-01

Pos logit contributions
oto+0.058
lic+0.055
ogo+0.053
ibi+0.053
round+0.051
Neg logit contributions
vered-0.068
 spe-0.063
ement-0.062
 symp-0.061
 aston-0.060
Token your
Token ID534
Feature activation+9.53e-01

Pos logit contributions
oto+0.062
lic+0.058
ogo+0.055
round+0.055
ock+0.053
Neg logit contributions
vered-0.130
ement-0.127
 symp-0.125
 aston-0.120
 spe-0.118
Token hair
Token ID4190
Feature activation-2.85e+00

Pos logit contributions
oto+0.080
lic+0.079
ogo+0.075
ibi+0.075
EEP+0.072
Neg logit contributions
vered-0.088
ement-0.084
 symp-0.082
 spe-0.080
 aston-0.078
Token into
Token ID656
Feature activation+3.37e-01

Pos logit contributions
vered+0.276
ement+0.266
 spe+0.259
 symp+0.257
 aston+0.245
Neg logit contributions
oto-0.225
lic-0.224
ogo-0.208
ibi-0.206
round-0.198
Token a
Token ID257
Feature activation+2.96e-02

Pos logit contributions
oto+0.016
lic+0.014
icated+0.014
ruck+0.014
ogo+0.013
Neg logit contributions
ement-0.041
vered-0.041
 symp-0.039
-0.037
ingly-0.036
Token peac
Token ID41961
Feature activation-9.03e-01

Pos logit contributions
lic+0.002
oto+0.002
round+0.002
ibi+0.002
ogo+0.002
Neg logit contributions
vered-0.003
ement-0.003
 symp-0.003
ingly-0.003
 aston-0.003
Tokenock
Token ID735
Feature activation+7.35e+00

Pos logit contributions
ement+0.122
vered+0.115
 symp+0.113
bird+0.113
el+0.112
Neg logit contributions
oto-0.034
ibi-0.030
oys-0.028
lic-0.027
ogo-0.026
Token tail
Token ID7894
Feature activation-2.53e-01

Pos logit contributions
lic+0.455
oto+0.453
round+0.422
ogo+0.410
ibi+0.397
Neg logit contributions
vered-0.824
ement-0.802
 symp-0.779
 aston-0.767
 spe-0.756
Token with
Token ID351
Feature activation-1.82e+00

Pos logit contributions
vered+0.026
ement+0.026
 symp+0.025
 spe+0.024
 aston+0.024
Neg logit contributions
oto-0.018
lic-0.018
ogo-0.016
round-0.016
ibi-0.016
Token clips
Token ID19166
Feature activation-2.52e+00

Pos logit contributions
ement+0.185
 symp+0.181
vered+0.180
+0.158
 aston+0.156
Neg logit contributions
oto-0.128
lic-0.120
haw-0.115
icated-0.114
ogo-0.114
Token and
Token ID290
Feature activation+2.27e-01

Pos logit contributions
vered+0.235
ement+0.226
 spe+0.219
 symp+0.218
 aston+0.207
Neg logit contributions
oto-0.208
lic-0.207
ogo-0.193
ibi-0.192
round-0.184
Token rib
Token ID12183
Feature activation-4.42e+00

Pos logit contributions
oto+0.023
lic+0.022
ogo+0.021
ibi+0.021
EEP+0.020
Neg logit contributions
vered-0.017
ement-0.017
 symp-0.016
 spe-0.015
 aston-0.015
Token Katie
Token ID24721
Feature activation-1.43e+00

Pos logit contributions
oto+0.012
lic+0.012
ogo+0.011
ibi+0.011
round+0.011
Neg logit contributions
vered-0.010
ement-0.010
 spe-0.010
 symp-0.009
 aston-0.009
Token put
Token ID1234
Feature activation-1.69e-01

Pos logit contributions
ement+0.157
vered+0.156
 symp+0.150
 spe+0.149
 aston+0.142
Neg logit contributions
lic-0.092
oto-0.090
ibi-0.082
ogo-0.080
EEP-0.079
Token on
Token ID319
Feature activation+4.43e-01

Pos logit contributions
vered+0.016
ement+0.016
 spe+0.015
 symp+0.015
 aston+0.015
Neg logit contributions
oto-0.013
lic-0.013
ogo-0.012
ibi-0.012
round-0.012
Token her
Token ID607
Feature activation-6.21e-01

Pos logit contributions
oto+0.031
ibi+0.030
lic+0.030
ogo+0.027
oys+0.027
Neg logit contributions
ement-0.044
vered-0.044
 spe-0.042
 symp-0.041
 aston-0.038
Token sm
Token ID895
Feature activation+3.90e-01

Pos logit contributions
vered+0.066
ement+0.063
 spe+0.062
 symp+0.061
 aston+0.059
Neg logit contributions
oto-0.043
lic-0.043
ibi-0.040
ogo-0.039
round-0.037
Tokenock
Token ID735
Feature activation+7.35e+00

Pos logit contributions
lic+0.039
oto+0.035
ogo+0.034
 Indians+0.034
EEP+0.033
Neg logit contributions
ement-0.030
vered-0.029
ingly-0.027
el-0.027
 symp-0.027
Token and
Token ID290
Feature activation+2.15e-01

Pos logit contributions
oto+0.509
lic+0.498
ibi+0.476
round+0.453
ogo+0.449
Neg logit contributions
vered-0.776
ement-0.765
 spe-0.754
 symp-0.719
 aston-0.711
Token went
Token ID1816
Feature activation+1.11e-01

Pos logit contributions
oto+0.017
lic+0.017
ogo+0.016
ibi+0.015
round+0.015
Neg logit contributions
vered-0.021
ement-0.020
 spe-0.020
 symp-0.020
 aston-0.019
Token outside
Token ID2354
Feature activation+2.45e+00

Pos logit contributions
oto+0.011
lic+0.010
ogo+0.010
ibi+0.010
EEP+0.010
Neg logit contributions
vered-0.008
 spe-0.008
 symp-0.008
ement-0.008
 aston-0.007
Token.
Token ID13
Feature activation-2.61e-01

Pos logit contributions
lic+0.078
oto+0.069
oys+0.066
round+0.065
ruck+0.063
Neg logit contributions
vered-0.339
ement-0.338
 spe-0.311
 symp-0.305
 aston-0.303
Token She
Token ID1375
Feature activation-5.89e-01

Pos logit contributions
vered+0.021
ement+0.020
 spe+0.019
 symp+0.019
 aston+0.018
Neg logit contributions
oto-0.025
lic-0.025
ogo-0.024
ibi-0.023
round-0.023
Token Mama
Token ID40084
Feature activation+8.22e-01

Pos logit contributions
vered+0.029
ement+0.029
 spe+0.027
 symp+0.026
 aston+0.026
Neg logit contributions
oto-0.021
lic-0.020
ogo-0.019
round-0.019
ibi-0.019
Token said
Token ID531
Feature activation+5.86e-01

Pos logit contributions
oto+0.057
lic+0.056
ogo+0.052
ibi+0.050
round+0.048
Neg logit contributions
vered-0.089
ement-0.086
 spe-0.084
 symp-0.083
 aston-0.080
Token,
Token ID11
Feature activation+1.21e-01

Pos logit contributions
oto+0.036
lic+0.035
EEP+0.030
 bye+0.029
oys+0.029
Neg logit contributions
ement-0.065
vered-0.065
 aston-0.062
 symp-0.061
 spe-0.060
Token "
Token ID366
Feature activation+1.11e+00

Pos logit contributions
lic+0.005
oto+0.005
ibi+0.004
ogo+0.004
EEP+0.004
Neg logit contributions
vered-0.016
ement-0.016
 spe-0.015
 aston-0.015
 symp-0.015
TokenSt
Token ID1273
Feature activation-4.55e-02

Pos logit contributions
oto+0.079
lic+0.071
ibi+0.071
round+0.070
haw+0.067
Neg logit contributions
ement-0.125
vered-0.115
 symp-0.111
 aston-0.110
 spe-0.108
Tokenrip
Token ID5528
Feature activation+7.34e+00

Pos logit contributions
vered+0.004
ement+0.004
 symp+0.004
 spe+0.004
 aston+0.004
Neg logit contributions
oto-0.004
lic-0.004
ibi-0.004
ogo-0.004
round-0.003
Token,
Token ID11
Feature activation+1.31e-01

Pos logit contributions
lic+0.582
oto+0.575
ogo+0.535
ibi+0.520
round+0.510
Neg logit contributions
vered-0.706
ement-0.691
 symp-0.665
 spe-0.665
 aston-0.643
Token you
Token ID345
Feature activation-1.04e+00

Pos logit contributions
lic+0.010
oto+0.010
ibi+0.010
ogo+0.009
EEP+0.009
Neg logit contributions
vered-0.013
 spe-0.012
 symp-0.012
ement-0.012
 aston-0.011
Token naughty
Token ID45128
Feature activation+7.94e-01

Pos logit contributions
vered+0.090
ement+0.085
 spe+0.084
 symp+0.082
 aston+0.079
Neg logit contributions
oto-0.094
lic-0.092
ibi-0.088
ogo-0.087
round-0.084
Token k
Token ID479
Feature activation-3.62e+00

Pos logit contributions
lic+0.061
oto+0.059
ogo+0.056
ibi+0.055
round+0.052
Neg logit contributions
vered-0.078
ement-0.074
 spe-0.073
 symp-0.072
 aston-0.067
Tokenitty
Token ID9760
Feature activation-1.25e+00

Pos logit contributions
 symp+0.330
ement+0.327
 aston+0.326
 amaz+0.313
vered+0.312
Neg logit contributions
lic-0.305
round-0.299
oto-0.293
ogo-0.291
haw-0.278
Token happy
Token ID3772
Feature activation-8.19e-01

Pos logit contributions
vered+0.098
ement+0.095
 symp+0.094
 spe+0.089
el+0.086
Neg logit contributions
lic-0.051
oto-0.047
ibi-0.042
round-0.039
ogo-0.038
Token and
Token ID290
Feature activation+9.40e-02

Pos logit contributions
vered+0.099
 symp+0.094
ement+0.093
 spe+0.093
 aston+0.089
Neg logit contributions
oto-0.043
lic-0.040
oys-0.038
ruck-0.036
EEP-0.035
Token the
Token ID262
Feature activation-3.07e-01

Pos logit contributions
oto+0.008
lic+0.008
ogo+0.007
ibi+0.007
round+0.007
Neg logit contributions
vered-0.009
ement-0.009
 spe-0.008
 symp-0.008
 aston-0.008
Token nation
Token ID3277
Feature activation-2.78e+00

Pos logit contributions
vered+0.029
 symp+0.028
ement+0.028
 spe+0.027
ingly+0.025
Neg logit contributions
lic-0.024
oto-0.024
oys-0.022
round-0.021
EEP-0.021
Token pro
Token ID386
Feature activation-3.21e+00

Pos logit contributions
 symp+0.306
vered+0.302
 spe+0.292
ement+0.288
 aston+0.280
Neg logit contributions
lic-0.166
oto-0.153
round-0.140
ibi-0.136
EEP-0.134
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
 symp+0.268
vered+0.260
 spe+0.259
ement+0.255
 aston+0.252
Neg logit contributions
lic-0.284
oto-0.273
ogo-0.250
EEP-0.250
round-0.248
Tokenred
Token ID445
Feature activation-9.85e-02

Pos logit contributions
 symp+0.866
 spe+0.861
 aston+0.768
ement+0.738
 distress+0.733
Neg logit contributions
round-0.903
EEP-0.877
oys-0.866
oto-0.817
lic-0.809
Token.
Token ID13
Feature activation-3.85e-01

Pos logit contributions
ement+0.010
vered+0.010
 symp+0.010
 spe+0.010
 aston+0.009
Neg logit contributions
lic-0.007
oto-0.007
ogo-0.006
oys-0.006
ibi-0.006
Token<|endoftext|>
Token ID50256
Feature activation-1.26e+00

Pos logit contributions
vered+0.042
 spe+0.040
ement+0.040
 symp+0.039
 aston+0.037
Neg logit contributions
lic-0.025
oto-0.025
ibi-0.024
round-0.023
EEP-0.023
TokenOnce
Token ID7454
Feature activation-1.91e-01

Pos logit contributions
vered+0.130
 spe+0.125
ement+0.124
 symp+0.122
 aston+0.116
Neg logit contributions
lic-0.095
oto-0.091
ogo-0.084
ibi-0.083
round-0.079
Token upon
Token ID2402
Feature activation-4.17e-01

Pos logit contributions
vered+0.022
 spe+0.020
ement+0.020
 symp+0.020
 aston+0.019
Neg logit contributions
oto-0.013
lic-0.012
ogo-0.012
ibi-0.011
EEP-0.010
Token had
Token ID550
Feature activation-3.92e-02

Pos logit contributions
lic+0.014
oto+0.014
ibi+0.012
ogo+0.012
round+0.011
Neg logit contributions
vered-0.026
ement-0.025
 spe-0.025
 symp-0.025
 aston-0.024
Token to
Token ID284
Feature activation-1.65e-01

Pos logit contributions
vered+0.004
ement+0.004
 spe+0.004
 symp+0.004
 aston+0.004
Neg logit contributions
oto-0.003
lic-0.003
ogo-0.002
ibi-0.002
round-0.002
Token help
Token ID1037
Feature activation-3.30e-01

Pos logit contributions
vered+0.018
ement+0.018
 spe+0.017
 symp+0.017
 aston+0.017
Neg logit contributions
oto-0.011
lic-0.010
ogo-0.010
ibi-0.010
round-0.009
Token the
Token ID262
Feature activation-3.14e-01

Pos logit contributions
vered+0.037
ement+0.037
 spe+0.036
 symp+0.035
 aston+0.034
Neg logit contributions
oto-0.020
lic-0.017
ogo-0.017
oys-0.016
ruck-0.016
Token town
Token ID3240
Feature activation-1.66e+00

Pos logit contributions
vered+0.033
ement+0.032
 symp+0.032
 spe+0.031
 aston+0.029
Neg logit contributions
lic-0.022
oto-0.021
ogo-0.020
EEP-0.019
round-0.019
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
 spe+0.171
ement+0.171
vered+0.170
 symp+0.164
 aston+0.158
Neg logit contributions
lic-0.109
oto-0.100
round-0.096
oys-0.095
ibi-0.092
Tokenople
Token ID643
Feature activation-2.16e+00

Pos logit contributions
 spe+0.932
 symp+0.865
 aston+0.810
ement+0.780
vered+0.768
Neg logit contributions
round-0.901
EEP-0.817
oys-0.807
oto-0.785
ock-0.781
Token.
Token ID13
Feature activation-3.74e-01

Pos logit contributions
 spe+0.240
ement+0.236
vered+0.230
 symp+0.221
 aston+0.216
Neg logit contributions
oto-0.116
lic-0.115
round-0.108
ruck-0.106
ock-0.103
Token He
Token ID679
Feature activation+7.80e-02

Pos logit contributions
vered+0.046
 spe+0.046
ement+0.044
 symp+0.043
 aston+0.041
Neg logit contributions
lic-0.020
oto-0.018
EEP-0.017
ibi-0.017
ogo-0.017
Token quickly
Token ID2952
Feature activation-3.02e-01

Pos logit contributions
lic+0.005
oto+0.005
ogo+0.005
ibi+0.005
round+0.004
Neg logit contributions
vered-0.008
ement-0.008
 spe-0.008
 symp-0.008
 aston-0.008
Token formulated
Token ID34391
Feature activation+2.82e-01

Pos logit contributions
vered+0.031
 spe+0.028
ement+0.028
 symp+0.028
 aston+0.027
Neg logit contributions
oto-0.024
lic-0.022
ogo-0.022
ibi-0.021
round-0.020
Token.
Token ID13
Feature activation-3.74e-01

Pos logit contributions
vered+0.309
ement+0.303
 symp+0.297
 spe+0.295
 aston+0.288
Neg logit contributions
oto-0.219
lic-0.213
ogo-0.198
ibi-0.194
round-0.191
Token
Token ID198
Feature activation+1.59e-01

Pos logit contributions
vered+0.040
 spe+0.039
ement+0.038
 symp+0.037
 aston+0.036
Neg logit contributions
lic-0.027
oto-0.024
EEP-0.024
ibi-0.023
round-0.022
Token
Token ID198
Feature activation+1.50e-01

Pos logit contributions
oto+0.012
lic+0.012
ogo+0.011
ibi+0.011
round+0.011
Neg logit contributions
vered-0.016
ement-0.015
 spe-0.015
 symp-0.015
 aston-0.014
TokenThe
Token ID464
Feature activation-3.25e-01

Pos logit contributions
oto+0.013
lic+0.013
ogo+0.012
ibi+0.012
round+0.012
Neg logit contributions
vered-0.013
ement-0.013
 symp-0.012
 spe-0.012
 aston-0.012
Token town
Token ID3240
Feature activation-1.66e+00

Pos logit contributions
vered+0.036
ement+0.034
 symp+0.033
 spe+0.033
 aston+0.031
Neg logit contributions
oto-0.021
lic-0.021
ogo-0.019
ibi-0.018
EEP-0.018
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
 spe+0.182
vered+0.182
ement+0.180
 symp+0.177
 aston+0.173
Neg logit contributions
lic-0.102
oto-0.092
round-0.091
ibi-0.085
EEP-0.084
Tokenople
Token ID643
Feature activation-2.17e+00

Pos logit contributions
 spe+0.759
 symp+0.749
ement+0.710
vered+0.707
 aston+0.685
Neg logit contributions
round-0.967
oto-0.960
EEP-0.945
lic-0.943
oys-0.917
Token were
Token ID547
Feature activation-9.76e-01

Pos logit contributions
 spe+0.240
ement+0.236
vered+0.229
 symp+0.224
 aston+0.216
Neg logit contributions
lic-0.134
oto-0.122
round-0.112
EEP-0.108
ibi-0.105
Token sad
Token ID6507
Feature activation-3.33e+00

Pos logit contributions
vered+0.100
ement+0.096
 symp+0.093
 spe+0.092
 aston+0.085
Neg logit contributions
lic-0.071
oto-0.067
ogo-0.059
ibi-0.059
round-0.057
Token because
Token ID780
Feature activation+4.35e+00

Pos logit contributions
vered+0.355
ement+0.339
 spe+0.335
 symp+0.328
 aston+0.320
Neg logit contributions
oto-0.235
lic-0.230
ogo-0.213
ibi-0.209
EEP-0.202
Token they
Token ID484
Feature activation+2.55e-01

Pos logit contributions
oto+0.198
lic+0.172
oys+0.171
ibi+0.156
ruck+0.155
Neg logit contributions
vered-0.555
 symp-0.532
ement-0.516
 spe-0.498
pen-0.479
Token bell
Token ID8966
Feature activation-6.57e-01

Pos logit contributions
vered+0.038
 symp+0.037
ement+0.035
 spe+0.033
ingly+0.033
Neg logit contributions
lic-0.017
oto-0.017
ruck-0.016
EEP-0.016
round-0.015
Token stopped
Token ID5025
Feature activation+1.95e+00

Pos logit contributions
vered+0.071
 symp+0.070
ement+0.069
 spe+0.067
 aston+0.067
Neg logit contributions
lic-0.043
oto-0.040
ibi-0.038
round-0.037
EEP-0.036
Token.
Token ID13
Feature activation-3.74e-01

Pos logit contributions
ruck+0.093
oto+0.085
lic+0.081
reat+0.077
EEP+0.073
Neg logit contributions
vered-0.240
ement-0.227
-0.206
 aston-0.202
 bidder-0.200
Token The
Token ID383
Feature activation-4.77e-01

Pos logit contributions
vered+0.041
 spe+0.039
ement+0.038
 symp+0.037
 aston+0.035
Neg logit contributions
lic-0.027
oto-0.023
EEP-0.023
ibi-0.023
round-0.022
Token town
Token ID3240
Feature activation-1.65e+00

Pos logit contributions
vered+0.057
 symp+0.056
ement+0.052
ingly+0.050
 spe+0.050
Neg logit contributions
oto-0.022
ruck-0.022
lic-0.022
 sound-0.021
 bus-0.021
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
vered+0.171
 symp+0.169
ement+0.166
 spe+0.166
 aston+0.158
Neg logit contributions
lic-0.114
oto-0.101
round-0.099
ibi-0.097
EEP-0.091
Tokenople
Token ID643
Feature activation-2.16e+00

Pos logit contributions
 spe+0.858
 symp+0.827
 aston+0.748
ement+0.738
vered+0.728
Neg logit contributions
round-0.915
EEP-0.884
oys-0.842
lic-0.832
ibi-0.828
Token cheered
Token ID34767
Feature activation-3.01e+00

Pos logit contributions
vered+0.206
ement+0.203
 spe+0.199
 symp+0.198
 aston+0.185
Neg logit contributions
lic-0.170
oto-0.161
ibi-0.149
round-0.149
ogo-0.146
Token and
Token ID290
Feature activation+9.40e-02

Pos logit contributions
vered+0.325
 spe+0.309
ement+0.307
 symp+0.292
 aston+0.286
Neg logit contributions
oto-0.210
lic-0.205
ibi-0.183
ogo-0.182
oys-0.179
Token hugged
Token ID41130
Feature activation+2.16e+00

Pos logit contributions
oto+0.008
lic+0.008
ogo+0.008
ibi+0.007
round+0.007
Neg logit contributions
vered-0.009
 spe-0.008
ement-0.008
 symp-0.008
 aston-0.008
Token each
Token ID1123
Feature activation-3.80e-01

Pos logit contributions
oto+0.123
lic+0.107
oys+0.107
round+0.105
ibi+0.105
Neg logit contributions
vered-0.232
ement-0.231
 spe-0.229
 aston-0.214
 symp-0.214
Token.
Token ID13
Feature activation-3.72e-01

Pos logit contributions
vered+0.038
 spe+0.036
ement+0.036
 symp+0.035
 aston+0.035
Neg logit contributions
lic-0.019
oto-0.019
oys-0.016
ock-0.016
ruck-0.016
Token He
Token ID679
Feature activation+7.44e-02

Pos logit contributions
vered+0.045
 spe+0.044
ement+0.043
 symp+0.041
 aston+0.040
Neg logit contributions
lic-0.021
oto-0.018
ibi-0.017
EEP-0.017
ogo-0.017
Token asked
Token ID1965
Feature activation-1.46e+00

Pos logit contributions
lic+0.005
oto+0.004
ogo+0.004
ibi+0.004
round+0.004
Neg logit contributions
ement-0.008
vered-0.008
 spe-0.008
 symp-0.008
 aston-0.007
Token the
Token ID262
Feature activation-3.12e-01

Pos logit contributions
vered+0.169
ement+0.169
 spe+0.168
 symp+0.161
 aston+0.156
Neg logit contributions
oto-0.087
lic-0.080
ogo-0.075
ibi-0.070
oys-0.067
Token town
Token ID3240
Feature activation-1.66e+00

Pos logit contributions
vered+0.035
ement+0.034
 symp+0.033
 spe+0.032
 aston+0.031
Neg logit contributions
lic-0.020
oto-0.020
ogo-0.018
ibi-0.017
 robber-0.017
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
vered+0.181
ement+0.178
 spe+0.177
 symp+0.173
 aston+0.164
Neg logit contributions
oto-0.110
lic-0.108
ogo-0.097
ibi-0.095
oys-0.095
Tokenople
Token ID643
Feature activation-2.17e+00

Pos logit contributions
 spe+0.979
+0.882
 symp+0.880
 aston+0.831
 blonde+0.803
Neg logit contributions
round-0.875
oys-0.785
EEP-0.746
ock-0.731
oto-0.729
Token what
Token ID644
Feature activation+1.72e+00

Pos logit contributions
 spe+0.251
vered+0.250
ement+0.248
 symp+0.236
 aston+0.230
Neg logit contributions
oto-0.128
lic-0.120
ogo-0.114
round-0.104
ibi-0.103
Token was
Token ID373
Feature activation-8.55e-01

Pos logit contributions
oto+0.080
lic+0.077
ibi+0.062
ogo+0.058
haw+0.056
Neg logit contributions
 symp-0.220
ement-0.219
vered-0.217
 spe-0.202
ingly-0.200
Token wrong
Token ID2642
Feature activation-1.43e+00

Pos logit contributions
vered+0.080
ement+0.078
 spe+0.075
 symp+0.074
 aston+0.071
Neg logit contributions
oto-0.071
lic-0.070
ogo-0.066
ibi-0.065
round-0.063
Token and
Token ID290
Feature activation+1.00e-01

Pos logit contributions
 spe+0.156
vered+0.156
ement+0.152
 symp+0.149
 aston+0.143
Neg logit contributions
oto-0.091
lic-0.085
oys-0.083
ogo-0.077
ibi-0.076
Token.
Token ID13
Feature activation-3.69e-01

Pos logit contributions
 symp+0.043
vered+0.042
 spe+0.042
ement+0.042
 aston+0.041
Neg logit contributions
oto-0.022
lic-0.021
ogo-0.020
oys-0.020
ruck-0.019
Token
Token ID198
Feature activation+1.54e-01

Pos logit contributions
vered+0.045
 spe+0.044
ement+0.042
 symp+0.040
ingly+0.039
Neg logit contributions
lic-0.022
ibi-0.018
EEP-0.018
oto-0.017
oys-0.017
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
lic+0.010
oto+0.010
ibi+0.009
ogo+0.009
EEP+0.008
Neg logit contributions
vered-0.017
 spe-0.017
ement-0.017
 symp-0.016
 aston-0.015
TokenThe
Token ID464
Feature activation-3.19e-01

Pos logit contributions
lic+0.011
oto+0.011
ibi+0.010
ogo+0.010
round+0.010
Neg logit contributions
vered-0.016
ement-0.016
 spe-0.016
 symp-0.015
 aston-0.015
Token town
Token ID3240
Feature activation-1.66e+00

Pos logit contributions
vered+0.033
ement+0.032
 spe+0.031
 symp+0.031
 aston+0.030
Neg logit contributions
oto-0.023
lic-0.023
ogo-0.021
ibi-0.021
round-0.020
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
 spe+0.176
vered+0.168
+0.168
ement+0.167
ocol+0.167
Neg logit contributions
lic-0.108
oto-0.095
oys-0.094
round-0.091
ibi-0.088
Tokenople
Token ID643
Feature activation-2.16e+00

Pos logit contributions
 spe+0.841
 symp+0.822
 aston+0.742
ement+0.737
vered+0.734
Neg logit contributions
round-0.909
oto-0.866
EEP-0.855
oys-0.845
ibi-0.818
Token decided
Token ID3066
Feature activation-3.06e-01

Pos logit contributions
 spe+0.251
ement+0.250
ocol+0.234
vered+0.233
 symp+0.228
Neg logit contributions
lic-0.119
oto-0.096
 jumped-0.093
round-0.087
EEP-0.084
Token to
Token ID284
Feature activation-1.62e-01

Pos logit contributions
vered+0.036
ement+0.035
 spe+0.034
 symp+0.034
 aston+0.032
Neg logit contributions
oto-0.018
lic-0.018
ibi-0.016
ogo-0.016
EEP-0.015
Token ask
Token ID1265
Feature activation+5.21e-01

Pos logit contributions
ement+0.018
vered+0.018
 symp+0.017
 spe+0.017
 aston+0.016
Neg logit contributions
oto-0.010
lic-0.010
ogo-0.009
ibi-0.009
EEP-0.008
Token their
Token ID511
Feature activation-4.70e-01

Pos logit contributions
oto+0.037
lic+0.033
ogo+0.032
oys+0.032
ibi+0.030
Neg logit contributions
vered-0.055
 spe-0.054
ement-0.054
 symp-0.051
 aston-0.049
Token.
Token ID13
Feature activation-3.79e-01

Pos logit contributions
oto+0.182
lic+0.181
ibi+0.171
ogo+0.165
round+0.164
Neg logit contributions
vered-0.219
ement-0.218
 spe-0.211
 symp-0.205
 aston-0.202
Token
Token ID198
Feature activation+1.61e-01

Pos logit contributions
vered+0.044
 spe+0.042
ement+0.041
 symp+0.040
ingly+0.039
Neg logit contributions
lic-0.024
ibi-0.022
oto-0.021
round-0.021
EEP-0.021
Token
Token ID198
Feature activation+1.54e-01

Pos logit contributions
oto+0.013
lic+0.013
ogo+0.012
ibi+0.012
round+0.012
Neg logit contributions
vered-0.015
ement-0.015
 symp-0.014
 spe-0.014
 aston-0.013
TokenThe
Token ID464
Feature activation-3.21e-01

Pos logit contributions
oto+0.013
lic+0.013
ogo+0.012
ibi+0.012
round+0.011
Neg logit contributions
vered-0.014
ement-0.014
 spe-0.013
 symp-0.013
 aston-0.013
Token town
Token ID3240
Feature activation-1.66e+00

Pos logit contributions
vered+0.036
ement+0.034
 symp+0.033
 spe+0.033
 aston+0.031
Neg logit contributions
oto-0.022
lic-0.021
ogo-0.019
ibi-0.019
round-0.018
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
vered+0.166
 spe+0.165
 symp+0.164
ement+0.163
 aston+0.157
Neg logit contributions
lic-0.119
oto-0.107
round-0.106
ibi-0.101
oys-0.098
Tokenople
Token ID643
Feature activation-2.17e+00

Pos logit contributions
 spe+0.840
 symp+0.811
 aston+0.740
ement+0.734
vered+0.726
Neg logit contributions
round-0.932
EEP-0.876
oto-0.876
oys-0.860
ibi-0.847
Token soon
Token ID2582
Feature activation-1.60e+00

Pos logit contributions
ement+0.231
vered+0.226
 spe+0.226
 symp+0.220
 aston+0.210
Neg logit contributions
lic-0.146
oto-0.140
round-0.127
ibi-0.124
ogo-0.123
Token discovered
Token ID5071
Feature activation-6.58e-01

Pos logit contributions
vered+0.156
ement+0.150
 spe+0.146
 symp+0.145
 aston+0.139
Neg logit contributions
oto-0.125
lic-0.123
ogo-0.117
ibi-0.115
round-0.111
Token where
Token ID810
Feature activation+1.89e+00

Pos logit contributions
 symp+0.078
vered+0.077
 spe+0.076
ement+0.074
ingly+0.071
Neg logit contributions
lic-0.035
oto-0.035
ibi-0.027
ogo-0.027
ruck-0.025
Token the
Token ID262
Feature activation-3.04e-01

Pos logit contributions
oto+0.104
lic+0.102
ibi+0.089
oys+0.086
EEP+0.085
Neg logit contributions
vered-0.223
ement-0.223
 symp-0.218
 spe-0.208
 aston-0.200
Token
Token ID198
Feature activation+1.62e-01

Pos logit contributions
oto+0.014
lic+0.014
ogo+0.013
ibi+0.013
round+0.013
Neg logit contributions
vered-0.015
ement-0.014
 symp-0.014
 spe-0.014
 aston-0.013
TokenSo
Token ID2396
Feature activation-1.06e+00

Pos logit contributions
oto+0.013
ogo+0.012
lic+0.012
ibi+0.011
round+0.011
Neg logit contributions
vered-0.015
 symp-0.014
ement-0.014
 aston-0.013
 spe-0.013
Token,
Token ID11
Feature activation+4.82e-02

Pos logit contributions
vered+0.108
ement+0.103
 spe+0.101
 symp+0.101
 aston+0.095
Neg logit contributions
oto-0.079
lic-0.077
ibi-0.071
oys-0.070
round-0.070
Token the
Token ID262
Feature activation-2.91e-01

Pos logit contributions
oto+0.003
lic+0.003
ibi+0.003
round+0.003
EEP+0.003
Neg logit contributions
vered-0.006
ement-0.005
 spe-0.005
 symp-0.005
 aston-0.005
Token town
Token ID3240
Feature activation-1.65e+00

Pos logit contributions
vered+0.028
ement+0.027
 symp+0.026
 spe+0.026
 aston+0.026
Neg logit contributions
lic-0.023
oto-0.023
ogo-0.022
ibi-0.021
round-0.021
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
vered+0.151
 spe+0.147
ement+0.145
 symp+0.145
 aston+0.134
Neg logit contributions
lic-0.141
oto-0.129
oys-0.125
ibi-0.121
round-0.118
Tokenople
Token ID643
Feature activation-2.15e+00

Pos logit contributions
 spe+0.858
 symp+0.831
 aston+0.753
vered+0.749
ement+0.746
Neg logit contributions
round-0.904
oys-0.869
oto-0.858
EEP-0.854
ibi-0.812
Token looked
Token ID3114
Feature activation+1.36e+00

Pos logit contributions
ement+0.249
 spe+0.243
vered+0.241
 symp+0.234
 aston+0.224
Neg logit contributions
lic-0.127
oto-0.114
ogo-0.101
round-0.099
ibi-0.097
Token for
Token ID329
Feature activation+6.62e-01

Pos logit contributions
lic+0.083
oto+0.076
ogo+0.065
round+0.063
ibi+0.062
Neg logit contributions
vered-0.154
 spe-0.147
ement-0.142
-0.140
 symp-0.136
Token Old
Token ID5706
Feature activation+1.21e+00

Pos logit contributions
oto+0.049
lic+0.048
ogo+0.044
ibi+0.043
oys+0.042
Neg logit contributions
vered-0.067
ement-0.065
 symp-0.063
 spe-0.063
 aston-0.059
Token Tom
Token ID4186
Feature activation+1.18e-01

Pos logit contributions
oto+0.089
lic+0.087
ibi+0.080
ogo+0.079
oys+0.075
Neg logit contributions
vered-0.126
ement-0.120
 spe-0.120
 symp-0.119
 aston-0.113
Token
Token ID220
Feature activation-1.30e+00

Pos logit contributions
vered+0.039
 spe+0.039
ement+0.037
 symp+0.036
 aston+0.035
Neg logit contributions
lic-0.023
EEP-0.021
oys-0.020
oto-0.020
ibi-0.019
Token
Token ID198
Feature activation+1.66e-01

Pos logit contributions
vered+0.123
ement+0.119
 symp+0.115
 spe+0.115
 aston+0.109
Neg logit contributions
oto-0.105
lic-0.104
ogo-0.097
ibi-0.097
round-0.094
Token
Token ID198
Feature activation+1.79e-01

Pos logit contributions
oto+0.013
lic+0.013
ogo+0.012
ibi+0.012
round+0.012
Neg logit contributions
vered-0.016
ement-0.015
 spe-0.015
 symp-0.015
 aston-0.014
TokenThe
Token ID464
Feature activation-3.01e-01

Pos logit contributions
oto+0.014
lic+0.014
ogo+0.013
ibi+0.013
round+0.013
Neg logit contributions
vered-0.017
ement-0.017
 spe-0.016
 symp-0.016
 aston-0.015
Token town
Token ID3240
Feature activation-1.64e+00

Pos logit contributions
vered+0.034
ement+0.032
 symp+0.031
 spe+0.030
ingly+0.029
Neg logit contributions
oto-0.019
lic-0.018
ogo-0.016
EEP-0.016
ibi-0.016
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
 spe+0.165
vered+0.161
 symp+0.158
ocol+0.157
ement+0.156
Neg logit contributions
lic-0.116
oto-0.104
round-0.102
oys-0.102
ibi-0.097
Tokenople
Token ID643
Feature activation-2.14e+00

Pos logit contributions
 spe+0.803
 symp+0.779
ement+0.721
vered+0.714
 aston+0.714
Neg logit contributions
round-0.929
oto-0.906
EEP-0.904
lic-0.885
oys-0.885
Token thought
Token ID1807
Feature activation+6.87e-01

Pos logit contributions
 spe+0.252
ement+0.243
vered+0.235
ocol+0.232
 symp+0.228
Neg logit contributions
lic-0.119
oto-0.102
round-0.091
EEP-0.087
 jumped-0.086
Token and
Token ID290
Feature activation+1.28e-01

Pos logit contributions
oto+0.041
lic+0.041
ogo+0.037
ibi+0.036
round+0.034
Neg logit contributions
vered-0.080
ement-0.076
 spe-0.076
 symp-0.075
 aston-0.073
Token thought
Token ID1807
Feature activation+6.95e-01

Pos logit contributions
oto+0.009
lic+0.009
ogo+0.009
ibi+0.009
round+0.008
Neg logit contributions
vered-0.013
ement-0.013
 spe-0.012
 symp-0.012
 aston-0.012
Token until
Token ID1566
Feature activation-2.65e+00

Pos logit contributions
lic+0.037
oto+0.036
ogo+0.032
EEP+0.029
ibi+0.029
Neg logit contributions
vered-0.085
 spe-0.081
ement-0.079
 symp-0.079
 aston-0.078
Token rope
Token ID17182
Feature activation-3.51e-01

Pos logit contributions
vered+0.032
ement+0.031
 spe+0.030
 symp+0.030
 aston+0.029
Neg logit contributions
oto-0.018
lic-0.018
ibi-0.016
ogo-0.016
EEP-0.016
Token bouncing
Token ID30713
Feature activation-1.28e+00

Pos logit contributions
vered+0.037
 spe+0.035
ement+0.035
 aston+0.034
 symp+0.033
Neg logit contributions
lic-0.023
oto-0.023
ibi-0.022
round-0.022
ock-0.022
Token.
Token ID13
Feature activation-3.48e-01

Pos logit contributions
vered+0.136
ement+0.134
 spe+0.132
 aston+0.128
 symp+0.128
Neg logit contributions
oto-0.082
lic-0.078
round-0.075
ibi-0.075
ogo-0.071
Token The
Token ID383
Feature activation-4.49e-01

Pos logit contributions
vered+0.034
ement+0.032
 spe+0.032
 symp+0.031
 aston+0.030
Neg logit contributions
lic-0.028
oto-0.028
ibi-0.026
ogo-0.025
round-0.025
Token town
Token ID3240
Feature activation-1.63e+00

Pos logit contributions
vered+0.045
ement+0.043
 spe+0.042
 symp+0.042
 aston+0.040
Neg logit contributions
lic-0.035
oto-0.034
ogo-0.033
ibi-0.031
round-0.031
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
vered+0.171
ement+0.167
 symp+0.165
 spe+0.164
 aston+0.158
Neg logit contributions
lic-0.114
oto-0.107
ibi-0.101
round-0.100
EEP-0.096
Tokenople
Token ID643
Feature activation-2.13e+00

Pos logit contributions
 spe+0.722
 symp+0.708
vered+0.684
ement+0.678
 aston+0.648
Neg logit contributions
oto-1.012
round-1.001
lic-0.988
EEP-0.971
ibi-0.971
Token gathered
Token ID9272
Feature activation+2.31e-01

Pos logit contributions
vered+0.190
ement+0.183
 spe+0.177
 symp+0.176
 aston+0.167
Neg logit contributions
oto-0.184
lic-0.183
ogo-0.171
ibi-0.170
round-0.164
Token around
Token ID1088
Feature activation-5.43e-01

Pos logit contributions
oto+0.011
ibi+0.009
 bye+0.009
 cool+0.009
 off+0.009
Neg logit contributions
 spe-0.027
ement-0.026
-0.026
vered-0.026
 symp-0.026
Token to
Token ID284
Feature activation-1.30e-01

Pos logit contributions
ement+0.066
vered+0.066
 spe+0.062
 symp+0.060
ingly+0.060
Neg logit contributions
oto-0.023
oys-0.019
lic-0.019
 listen-0.019
EEP-0.016
Token watch
Token ID2342
Feature activation+2.71e+00

Pos logit contributions
vered+0.012
 spe+0.012
 symp+0.011
 aston+0.011
ement+0.011
Neg logit contributions
lic-0.011
oto-0.011
ogo-0.010
oys-0.010
EEP-0.010
Token!
Token ID0
Feature activation+1.28e+00

Pos logit contributions
vered+0.026
 aston+0.025
ement+0.025
 symp+0.024
 spe+0.024
Neg logit contributions
oto-0.010
!"-0.008
lic-0.008
round-0.008
ock-0.007
Token
Token ID198
Feature activation+2.09e-01

Pos logit contributions
lic+0.093
ibi+0.085
oto+0.083
EEP+0.080
round+0.076
Neg logit contributions
vered-0.139
ement-0.132
 spe-0.130
 symp-0.124
 aston-0.121
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
lic+0.016
oto+0.015
ogo+0.014
ibi+0.014
round+0.013
Neg logit contributions
vered-0.022
ement-0.021
 spe-0.021
 symp-0.020
 aston-0.019
TokenThe
Token ID464
Feature activation-2.76e-01

Pos logit contributions
oto+0.017
lic+0.016
ogo+0.015
ibi+0.015
round+0.015
Neg logit contributions
vered-0.020
ement-0.019
 spe-0.019
 symp-0.019
 aston-0.018
Token town
Token ID3240
Feature activation-1.61e+00

Pos logit contributions
vered+0.034
ement+0.032
 symp+0.031
 spe+0.030
 aston+0.029
Neg logit contributions
oto-0.016
lic-0.015
ibi-0.013
ogo-0.013
 robber-0.013
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
vered+0.172
 spe+0.172
ement+0.171
 symp+0.170
ocol+0.165
Neg logit contributions
lic-0.100
oto-0.084
round-0.082
ibi-0.082
oys-0.079
Tokenople
Token ID643
Feature activation-2.13e+00

Pos logit contributions
 spe+0.884
 symp+0.851
 aston+0.777
ement+0.750
+0.737
Neg logit contributions
round-0.896
EEP-0.829
oys-0.802
oto-0.785
lic-0.779
Token were
Token ID547
Feature activation-9.48e-01

Pos logit contributions
ement+0.219
 spe+0.213
vered+0.212
 symp+0.204
 aston+0.198
Neg logit contributions
lic-0.150
oto-0.130
round-0.122
ibi-0.119
ogo-0.118
Token sad
Token ID6507
Feature activation-3.29e+00

Pos logit contributions
vered+0.104
ement+0.101
 symp+0.097
 spe+0.094
+0.092
Neg logit contributions
lic-0.063
oto-0.055
ogo-0.051
ibi-0.050
round-0.045
Token and
Token ID290
Feature activation+1.24e-01

Pos logit contributions
vered+0.388
 spe+0.361
ement+0.360
 aston+0.353
 symp+0.351
Neg logit contributions
oto-0.189
lic-0.184
oys-0.166
ogo-0.163
ibi-0.159
Token scared
Token ID12008
Feature activation-3.13e-01

Pos logit contributions
lic+0.010
oto+0.009
ibi+0.008
ogo+0.008
oys+0.008
Neg logit contributions
vered-0.012
ement-0.012
 symp-0.011
 spe-0.011
 aston-0.011
Token it
Token ID340
Feature activation+2.97e-01

Pos logit contributions
 symp+0.313
ement+0.308
vered+0.306
ingly+0.285
 aston+0.278
Neg logit contributions
oto-0.107
lic-0.092
ye-0.091
EEP-0.087
 listen-0.086
Token sang
Token ID25889
Feature activation+1.63e+00

Pos logit contributions
lic+0.020
oto+0.018
ogo+0.017
ibi+0.016
round+0.016
Neg logit contributions
vered-0.032
ement-0.032
 symp-0.031
 spe-0.029
 aston-0.029
Token,
Token ID11
Feature activation+6.61e-02

Pos logit contributions
lic+0.124
oto+0.118
ibi+0.102
ogo+0.102
round+0.101
Neg logit contributions
vered-0.164
ement-0.158
 symp-0.154
 spe-0.153
 aston-0.148
Token the
Token ID262
Feature activation-2.73e-01

Pos logit contributions
lic+0.004
oto+0.004
round+0.003
ibi+0.003
ogo+0.003
Neg logit contributions
vered-0.007
ement-0.007
 symp-0.007
 spe-0.006
 aston-0.006
Token town
Token ID3240
Feature activation-1.62e+00

Pos logit contributions
vered+0.033
 symp+0.033
ement+0.032
ingly+0.030
+0.029
Neg logit contributions
 sound-0.013
oto-0.012
lic-0.012
 bus-0.011
 cool-0.011
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
 symp+0.179
vered+0.177
ement+0.176
 aston+0.169
 spe+0.165
Neg logit contributions
lic-0.107
oto-0.092
round-0.087
ibi-0.082
EEP-0.079
Tokenople
Token ID643
Feature activation-2.13e+00

Pos logit contributions
 spe+0.826
 symp+0.825
 aston+0.771
ement+0.748
vered+0.743
Neg logit contributions
round-0.923
EEP-0.870
oto-0.833
oys-0.827
ibi-0.803
Token would
Token ID561
Feature activation-7.16e-01

Pos logit contributions
ement+0.229
vered+0.229
 symp+0.223
 spe+0.220
 aston+0.212
Neg logit contributions
lic-0.140
oto-0.137
ogo-0.121
round-0.119
ibi-0.118
Token act
Token ID719
Feature activation+6.06e-01

Pos logit contributions
vered+0.083
ement+0.081
 spe+0.076
 symp+0.076
 aston+0.073
Neg logit contributions
oto-0.045
lic-0.043
ogo-0.040
ibi-0.039
round-0.037
Token nice
Token ID3621
Feature activation+2.48e+00

Pos logit contributions
lic+0.046
oto+0.043
ogo+0.037
ibi+0.037
round+0.036
Neg logit contributions
vered-0.063
ement-0.061
 spe-0.059
 symp-0.059
 aston-0.056
Token.
Token ID13
Feature activation-3.29e-01

Pos logit contributions
lic+0.175
oto+0.174
ibi+0.153
ogo+0.152
round+0.148
Neg logit contributions
vered-0.261
ement-0.252
 symp-0.248
 spe-0.247
 aston-0.235
Token!
Token ID0
Feature activation+1.26e+00

Pos logit contributions
lic+0.032
round+0.031
oto+0.031
ogo+0.029
EEP+0.028
Neg logit contributions
ement-0.055
vered-0.054
 symp-0.052
 spe-0.052
 aston-0.051
Token
Token ID198
Feature activation+1.91e-01

Pos logit contributions
lic+0.089
oto+0.087
ibi+0.081
ogo+0.079
round+0.078
Neg logit contributions
vered-0.134
ement-0.128
 spe-0.128
 symp-0.126
 aston-0.119
Token
Token ID198
Feature activation+1.94e-01

Pos logit contributions
lic+0.014
oto+0.014
ogo+0.013
ibi+0.013
round+0.012
Neg logit contributions
vered-0.020
ement-0.019
 spe-0.019
 symp-0.018
 aston-0.018
TokenThe
Token ID464
Feature activation-2.87e-01

Pos logit contributions
lic+0.015
oto+0.015
ogo+0.014
ibi+0.014
round+0.013
Neg logit contributions
vered-0.019
ement-0.019
 spe-0.018
 symp-0.018
 aston-0.017
Token town
Token ID3240
Feature activation-1.62e+00

Pos logit contributions
vered+0.027
ement+0.026
 spe+0.025
 symp+0.025
 aston+0.024
Neg logit contributions
oto-0.024
lic-0.024
ogo-0.022
ibi-0.021
round-0.021
Tokenspe
Token ID4125
Feature activation-1.03e+01

Pos logit contributions
 symp+0.139
vered+0.137
 spe+0.136
ement+0.136
 aston+0.128
Neg logit contributions
lic-0.138
oto-0.128
round-0.125
ibi-0.119
oys-0.118
Tokenople
Token ID643
Feature activation-2.13e+00

Pos logit contributions
 spe+0.826
 symp+0.802
ement+0.718
 aston+0.717
vered+0.693
Neg logit contributions
round-0.918
EEP-0.868
oto-0.854
oys-0.848
lic-0.841
Token thanked
Token ID26280
Feature activation-2.53e+00

Pos logit contributions
ement+0.211
vered+0.206
 spe+0.203
 symp+0.202
 aston+0.191
Neg logit contributions
lic-0.155
oto-0.147
round-0.135
ogo-0.133
ibi-0.132
Token the
Token ID262
Feature activation-2.63e-01

Pos logit contributions
vered+0.307
ement+0.287
 spe+0.285
 symp+0.274
+0.266
Neg logit contributions
oto-0.147
lic-0.108
 listen-0.108
ock-0.107
ruck-0.106
Token hero
Token ID4293
Feature activation+4.92e-01

Pos logit contributions
vered+0.026
ement+0.025
 spe+0.024
 symp+0.024
 aston+0.023
Neg logit contributions
oto-0.021
lic-0.021
ogo-0.019
ibi-0.019
round-0.018
Token for
Token ID329
Feature activation+6.95e-01

Pos logit contributions
oto+0.026
round+0.023
lic+0.023
ogo+0.022
oys+0.021
Neg logit contributions
 spe-0.058
vered-0.058
ement-0.057
 aston-0.055
 symp-0.054
Token.
Token ID13
Feature activation-3.29e-01

Pos logit contributions
lic+0.175
oto+0.174
ibi+0.153
ogo+0.152
round+0.148
Neg logit contributions
vered-0.261
ement-0.252
 symp-0.248
 spe-0.247
 aston-0.235
Token
Token ID198
Feature activation+1.95e-01

Pos logit contributions
vered+0.040
 spe+0.038
 symp+0.036
ement+0.036
ingly+0.035
Neg logit contributions
lic-0.019
EEP-0.017
oto-0.017
ibi-0.017
round-0.016
Token
Token ID198
Feature activation+1.94e-01

Pos logit contributions
lic+0.015
oto+0.015
ogo+0.014
ibi+0.014
round+0.013
Neg logit contributions
vered-0.020
ement-0.019
 spe-0.018
 symp-0.018
 aston-0.017
TokenThe
Token ID464
Feature activation-2.76e-01

Pos logit contributions
lic+0.016
oto+0.016
ogo+0.015
ibi+0.015
round+0.014
Neg logit contributions
vered-0.018
ement-0.018
 spe-0.017
 symp-0.017
 aston-0.016
Token town
Token ID3240
Feature activation-1.60e+00

Pos logit contributions
vered+0.034
 symp+0.033
ement+0.032
ingly+0.031
eter+0.030
Neg logit contributions
oto-0.012
lic-0.012
 sound-0.012
 bus-0.011
 robber-0.011
Tokenspe
Token ID4125
Feature activation-1.02e+01

Pos logit contributions
 symp+0.156
vered+0.152
ement+0.149
 spe+0.148
ocol+0.146
Neg logit contributions
lic-0.116
round-0.106
oto-0.105
oys-0.095
ibi-0.094
Tokenople
Token ID643
Feature activation-2.12e+00

Pos logit contributions
 spe+0.848
 symp+0.833
 aston+0.755
ement+0.734
+0.723
Neg logit contributions
round-0.899
EEP-0.840
oys-0.823
oto-0.820
lic-0.796
Token were
Token ID547
Feature activation-9.29e-01

Pos logit contributions
ement+0.228
vered+0.225
 spe+0.220
 symp+0.219
 aston+0.208
Neg logit contributions
lic-0.136
oto-0.135
round-0.119
ogo-0.118
ibi-0.115
Token so
Token ID523
Feature activation-4.46e-01

Pos logit contributions
vered+0.102
ement+0.099
 symp+0.098
 spe+0.094
 aston+0.090
Neg logit contributions
lic-0.063
oto-0.060
ogo-0.053
ibi-0.053
round-0.050
Token happy
Token ID3772
Feature activation-7.71e-01

Pos logit contributions
vered+0.042
ement+0.040
 spe+0.039
 symp+0.039
 aston+0.037
Neg logit contributions
lic-0.037
oto-0.037
ogo-0.034
ibi-0.034
round-0.033
Token they
Token ID484
Feature activation+3.17e-01

Pos logit contributions
vered+0.080
ement+0.077
 spe+0.076
 symp+0.075
 aston+0.072
Neg logit contributions
oto-0.056
lic-0.055
ogo-0.051
ibi-0.050
round-0.049
Token.
Token ID13
Feature activation-3.12e-01

Pos logit contributions
vered+0.164
 symp+0.161
 spe+0.157
ement+0.155
 aston+0.149
Neg logit contributions
lic-0.118
oto-0.109
round-0.104
ibi-0.101
oys-0.100
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
vered+0.028
ement+0.027
 spe+0.026
 symp+0.026
 aston+0.025
Neg logit contributions
oto-0.027
lic-0.027
ogo-0.025
ibi-0.025
round-0.024
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
oto+0.019
lic+0.019
ogo+0.018
ibi+0.018
round+0.018
Neg logit contributions
vered-0.019
ement-0.018
 symp-0.018
 spe-0.017
 aston-0.017
TokenThe
Token ID464
Feature activation-2.56e-01

Pos logit contributions
oto+0.021
lic+0.020
ogo+0.020
ibi+0.020
round+0.019
Neg logit contributions
vered-0.018
ement-0.017
 symp-0.017
 spe-0.016
 aston-0.016
Token town
Token ID3240
Feature activation-1.59e+00

Pos logit contributions
vered+0.024
ement+0.023
 spe+0.023
 symp+0.022
 aston+0.021
Neg logit contributions
oto-0.021
lic-0.021
ogo-0.019
ibi-0.019
round-0.019
Tokenspe
Token ID4125
Feature activation-1.02e+01

Pos logit contributions
vered+0.139
 spe+0.136
 symp+0.134
ement+0.130
 aston+0.125
Neg logit contributions
lic-0.141
oto-0.134
round-0.127
ibi-0.123
oys-0.122
Tokenople
Token ID643
Feature activation-2.09e+00

Pos logit contributions
 spe+0.779
 symp+0.753
vered+0.704
ement+0.686
 aston+0.681
Neg logit contributions
round-0.964
oto-0.951
lic-0.924
EEP-0.913
oys-0.913
Token worked
Token ID3111
Feature activation-1.82e+00

Pos logit contributions
vered+0.211
 spe+0.210
ement+0.208
 symp+0.201
 aston+0.190
Neg logit contributions
lic-0.151
oto-0.147
round-0.130
ibi-0.129
ogo-0.128
Token together
Token ID1978
Feature activation-1.56e+00

Pos logit contributions
vered+0.168
 spe+0.164
ement+0.157
 symp+0.155
 aston+0.150
Neg logit contributions
lic-0.154
oto-0.145
ogo-0.133
ibi-0.131
oys-0.131
Token to
Token ID284
Feature activation-8.41e-02

Pos logit contributions
vered+0.196
ement+0.191
 spe+0.191
 symp+0.189
 aston+0.181
Neg logit contributions
lic-0.074
oto-0.066
ogo-0.057
oys-0.056
ibi-0.052
Token help
Token ID1037
Feature activation-2.46e-01

Pos logit contributions
vered+0.009
ement+0.009
 spe+0.008
 symp+0.008
 aston+0.008
Neg logit contributions
oto-0.006
lic-0.006
ogo-0.005
ibi-0.005
round-0.005
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
vered+0.223
ement+0.216
 symp+0.208
 spe+0.205
 aston+0.201
Neg logit contributions
oto-0.130
lic-0.128
ibi-0.114
ogo-0.112
oys-0.111
Token
Token ID198
Feature activation+2.48e-01

Pos logit contributions
vered+0.031
ement+0.029
 spe+0.029
 symp+0.029
 aston+0.027
Neg logit contributions
lic-0.023
oto-0.022
ibi-0.020
ogo-0.020
EEP-0.020
Token
Token ID198
Feature activation+2.39e-01

Pos logit contributions
oto+0.019
lic+0.019
ogo+0.018
ibi+0.017
round+0.017
Neg logit contributions
vered-0.025
ement-0.024
 spe-0.023
 symp-0.023
 aston-0.022
TokenThe
Token ID464
Feature activation-2.33e-01

Pos logit contributions
lic+0.020
oto+0.019
ogo+0.018
ibi+0.018
round+0.017
Neg logit contributions
vered-0.023
ement-0.022
 spe-0.022
 symp-0.021
 aston-0.020
Token town
Token ID3240
Feature activation-1.57e+00

Pos logit contributions
vered+0.025
ement+0.024
 symp+0.024
 spe+0.023
ingly+0.023
Neg logit contributions
oto-0.015
lic-0.015
ogo-0.013
ye-0.013
round-0.013
Tokenspe
Token ID4125
Feature activation-1.02e+01

Pos logit contributions
vered+0.154
 symp+0.153
ement+0.152
 spe+0.151
 aston+0.143
Neg logit contributions
lic-0.111
oto-0.106
round-0.100
oys-0.096
ibi-0.095
Tokenople
Token ID643
Feature activation-2.07e+00

Pos logit contributions
 spe+0.815
 symp+0.798
 aston+0.715
ement+0.709
vered+0.702
Neg logit contributions
round-0.914
EEP-0.870
oys-0.870
oto-0.856
lic-0.848
Token were
Token ID547
Feature activation-8.80e-01

Pos logit contributions
ement+0.202
vered+0.201
 spe+0.194
 symp+0.193
 aston+0.181
Neg logit contributions
lic-0.154
oto-0.149
round-0.133
ogo-0.133
ibi-0.132
Token relieved
Token ID22598
Feature activation-4.11e-01

Pos logit contributions
vered+0.101
ement+0.097
 symp+0.094
 spe+0.092
 aston+0.087
Neg logit contributions
lic-0.055
oto-0.052
ibi-0.046
ogo-0.046
round-0.043
Token.
Token ID13
Feature activation-2.69e-01

Pos logit contributions
vered+0.051
 spe+0.047
ement+0.046
 symp+0.045
 aston+0.045
Neg logit contributions
oto-0.022
lic-0.020
oys-0.018
ogo-0.016
ibi-0.016
Token They
Token ID1119
Feature activation+3.31e-01

Pos logit contributions
vered+0.029
 spe+0.028
ement+0.027
 symp+0.027
 aston+0.025
Neg logit contributions
lic-0.018
oto-0.017
EEP-0.016
ibi-0.016
oys-0.016
Token shine
Token ID18340
Feature activation+1.53e+00

Pos logit contributions
vered+0.042
ement+0.041
 symp+0.040
 spe+0.040
 aston+0.039
Neg logit contributions
lic-0.028
oto-0.024
round-0.023
EEP-0.023
ibi-0.023
Token through
Token ID832
Feature activation+1.15e+00

Pos logit contributions
lic+0.109
oto+0.102
ock+0.095
round+0.092
ibi+0.088
Neg logit contributions
vered-0.156
 symp-0.146
 aston-0.145
ement-0.142
 spe-0.138
Token.
Token ID13
Feature activation-2.75e-01

Pos logit contributions
ruck+0.039
lic+0.034
oto+0.030
 Sam+0.027
oys+0.026
Neg logit contributions
 symp-0.147
vered-0.145
ement-0.145
-0.131
-0.130
Token The
Token ID383
Feature activation-3.73e-01

Pos logit contributions
vered+0.027
ement+0.026
 spe+0.025
 symp+0.025
 aston+0.023
Neg logit contributions
lic-0.022
oto-0.021
EEP-0.020
ibi-0.020
round-0.019
Token town
Token ID3240
Feature activation-1.55e+00

Pos logit contributions
vered+0.041
 symp+0.040
ement+0.039
 spe+0.037
ingly+0.036
Neg logit contributions
lic-0.022
oto-0.021
ruck-0.020
EEP-0.019
ogo-0.019
Tokenspe
Token ID4125
Feature activation-1.02e+01

Pos logit contributions
 symp+0.159
vered+0.158
ement+0.158
ocol+0.153
 spe+0.152
Neg logit contributions
lic-0.104
round-0.088
oto-0.087
ibi-0.081
EEP-0.079
Tokenople
Token ID643
Feature activation-2.05e+00

Pos logit contributions
 spe+0.821
 symp+0.802
 aston+0.726
ement+0.725
+0.699
Neg logit contributions
round-0.883
EEP-0.862
oys-0.828
oto-0.817
lic-0.815
Token were
Token ID547
Feature activation-8.71e-01

Pos logit contributions
ement+0.198
vered+0.197
 spe+0.193
 symp+0.191
 aston+0.179
Neg logit contributions
lic-0.157
oto-0.147
round-0.134
ogo-0.132
ibi-0.132
Token very
Token ID845
Feature activation+3.62e-01

Pos logit contributions
vered+0.095
ement+0.092
 symp+0.090
 spe+0.088
 aston+0.083
Neg logit contributions
lic-0.059
oto-0.056
ogo-0.050
ibi-0.050
round-0.046
Token happy
Token ID3772
Feature activation-7.06e-01

Pos logit contributions
oto+0.029
lic+0.029
ogo+0.027
ibi+0.027
round+0.025
Neg logit contributions
vered-0.035
ement-0.034
 symp-0.033
 spe-0.033
 aston-0.031
Token and
Token ID290
Feature activation+2.20e-01

Pos logit contributions
vered+0.083
ement+0.079
 symp+0.078
 spe+0.078
 aston+0.075
Neg logit contributions
oto-0.041
lic-0.038
oys-0.035
ogo-0.033
ibi-0.033
Token.
Token ID13
Feature activation-2.59e-01

Pos logit contributions
lic+0.089
oto+0.088
ogo+0.081
ibi+0.079
EEP+0.077
Neg logit contributions
vered-0.119
ement-0.112
 symp-0.109
 spe-0.107
 aston-0.102
Token
Token ID198
Feature activation+2.64e-01

Pos logit contributions
ement+0.022
vered+0.021
 symp+0.020
 spe+0.020
 aston+0.019
Neg logit contributions
oto-0.024
lic-0.023
ogo-0.022
ibi-0.021
round-0.021
Token
Token ID198
Feature activation+2.67e-01

Pos logit contributions
oto+0.025
lic+0.025
ogo+0.024
round+0.024
ibi+0.023
Neg logit contributions
vered-0.019
 symp-0.018
ement-0.018
 aston-0.017
 spe-0.016
TokenThe
Token ID464
Feature activation-2.08e-01

Pos logit contributions
oto+0.021
ogo+0.021
lic+0.019
ibi+0.019
round+0.017
Neg logit contributions
vered-0.025
ement-0.024
 symp-0.023
 aston-0.023
 spe-0.021
Token town
Token ID3240
Feature activation-1.54e+00

Pos logit contributions
vered+0.022
ement+0.021
 spe+0.020
 symp+0.020
 aston+0.019
Neg logit contributions
oto-0.015
lic-0.015
ibi-0.014
ogo-0.013
oys-0.013
Tokenspe
Token ID4125
Feature activation-1.02e+01

Pos logit contributions
vered+0.137
 spe+0.135
 symp+0.129
ement+0.127
 aston+0.121
Neg logit contributions
lic-0.134
oto-0.125
oys-0.118
ibi-0.117
round-0.115
Tokenople
Token ID643
Feature activation-2.04e+00

Pos logit contributions
 spe+0.836
 symp+0.807
 aston+0.724
ement+0.720
vered+0.718
Neg logit contributions
round-0.888
oys-0.864
EEP-0.863
oto-0.849
lic-0.831
Token were
Token ID547
Feature activation-8.53e-01

Pos logit contributions
 spe+0.208
vered+0.206
ement+0.205
 symp+0.191
ocol+0.186
Neg logit contributions
lic-0.141
oto-0.130
ogo-0.115
ibi-0.112
round-0.111
Token amazed
Token ID24562
Feature activation-7.86e-01

Pos logit contributions
vered+0.099
ement+0.095
 symp+0.092
 spe+0.090
+0.086
Neg logit contributions
lic-0.053
oto-0.046
ogo-0.041
ibi-0.040
round-0.036
Token by
Token ID416
Feature activation+4.71e-02

Pos logit contributions
vered+0.093
 spe+0.087
 symp+0.082
 aston+0.080
+0.078
Neg logit contributions
oto-0.052
lic-0.048
oys-0.039
ibi-0.039
ogo-0.038
Token this
Token ID428
Feature activation+5.74e-02

Pos logit contributions
oto+0.004
lic+0.003
ogo+0.003
ibi+0.003
oys+0.003
Neg logit contributions
vered-0.005
ement-0.005
 spe-0.004
 symp-0.004
 aston-0.004
Tokenâ
Token ID22940
Feature activation+9.66e-02

Pos logit contributions
oto+0.118
lic+0.116
ogo+0.111
ibi+0.108
round+0.104
Neg logit contributions
vered-0.118
ement-0.113
 spe-0.111
 symp-0.110
 aston-0.106
Token
Token ID26391
Feature activation+2.49e+00

Pos logit contributions
oto+0.008
ogo+0.008
lic+0.008
ibi+0.007
 Indians+0.007
Neg logit contributions
ement-0.009
vered-0.009
 symp-0.009
 aston-0.008
el-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.12e+00

Pos logit contributions
oto+0.217
lic+0.209
ogo+0.200
ibi+0.194
round+0.187
Neg logit contributions
vered-0.225
ement-0.217
 spe-0.214
 symp-0.210
 aston-0.203
TokenThe
Token ID464
Feature activation-1.89e-01

Pos logit contributions
vered+0.116
ement+0.110
 spe+0.109
 symp+0.108
 aston+0.103
Neg logit contributions
lic-0.083
oto-0.081
ogo-0.074
ibi-0.073
round-0.070
Token town
Token ID3240
Feature activation-1.52e+00

Pos logit contributions
vered+0.025
ement+0.023
 spe+0.023
 symp+0.022
 aston+0.022
Neg logit contributions
oto-0.010
lic-0.009
ibi-0.008
round-0.008
ogo-0.008
Tokenspe
Token ID4125
Feature activation-1.02e+01

Pos logit contributions
vered+0.160
ement+0.156
 symp+0.153
 spe+0.153
 aston+0.145
Neg logit contributions
lic-0.105
oto-0.104
ibi-0.096
round-0.095
ogo-0.093
Tokenople
Token ID643
Feature activation-2.01e+00

Pos logit contributions
 spe+0.721
vered+0.710
 symp+0.702
ement+0.683
 aston+0.646
Neg logit contributions
oto-1.000
round-0.992
lic-0.974
EEP-0.957
ibi-0.948
Token were
Token ID547
Feature activation-8.30e-01

Pos logit contributions
vered+0.211
ement+0.208
 symp+0.201
 spe+0.199
 aston+0.189
Neg logit contributions
lic-0.140
oto-0.140
ibi-0.127
round-0.125
ogo-0.125
Token walking
Token ID6155
Feature activation-1.87e+00

Pos logit contributions
vered+0.094
 spe+0.086
ement+0.083
 symp+0.081
 aston+0.076
Neg logit contributions
lic-0.056
oto-0.054
ibi-0.050
ogo-0.050
round-0.047
Token quickly
Token ID2952
Feature activation-1.52e-01

Pos logit contributions
ement+0.211
vered+0.210
 spe+0.210
 symp+0.198
 aston+0.194
Neg logit contributions
oto-0.110
round-0.106
lic-0.103
ibi-0.100
oys-0.096
Token down
Token ID866
Feature activation+1.69e+00

Pos logit contributions
vered+0.016
ement+0.016
 spe+0.015
 symp+0.015
 aston+0.014
Neg logit contributions
lic-0.010
oto-0.010
round-0.010
EEP-0.009
ibi-0.009
Token safe
Token ID3338
Feature activation+3.15e+00

Pos logit contributions
vered+0.165
ement+0.160
 symp+0.153
 spe+0.150
el+0.142
Neg logit contributions
lic-0.105
oto-0.095
ibi-0.088
round-0.088
oys-0.084
Token again
Token ID757
Feature activation-2.00e+00

Pos logit contributions
oto+0.192
lic+0.186
oys+0.183
ruck+0.172
round+0.168
Neg logit contributions
vered-0.327
ement-0.317
 spe-0.312
 symp-0.311
 aston-0.301
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
vered+0.208
ement+0.207
 symp+0.197
 aston+0.192
 spe+0.191
Neg logit contributions
lic-0.130
oto-0.128
oys-0.115
EEP-0.113
ogo-0.113
Token The
Token ID383
Feature activation-3.35e-01

Pos logit contributions
vered+0.023
ement+0.022
 spe+0.022
 symp+0.021
 aston+0.020
Neg logit contributions
lic-0.018
oto-0.018
ibi-0.017
ogo-0.017
EEP-0.016
Token town
Token ID3240
Feature activation-1.51e+00

Pos logit contributions
vered+0.037
ement+0.035
 symp+0.035
 spe+0.034
ingly+0.032
Neg logit contributions
lic-0.022
oto-0.021
ogo-0.019
 robber-0.018
 Indians-0.018
Tokenspe
Token ID4125
Feature activation-1.01e+01

Pos logit contributions
 symp+0.147
ement+0.146
vered+0.146
 spe+0.141
 aston+0.136
Neg logit contributions
lic-0.111
oto-0.103
round-0.098
oys-0.096
ibi-0.095
Tokenople
Token ID643
Feature activation-2.02e+00

Pos logit contributions
 spe+0.844
 symp+0.815
 aston+0.738
ement+0.730
+0.704
Neg logit contributions
round-0.877
EEP-0.848
oys-0.834
oto-0.806
ruck-0.784
Token thanked
Token ID26280
Feature activation-2.41e+00

Pos logit contributions
ement+0.201
vered+0.197
 symp+0.193
 spe+0.188
 aston+0.181
Neg logit contributions
lic-0.149
oto-0.141
round-0.128
ibi-0.127
ogo-0.127
Token the
Token ID262
Feature activation-1.60e-01

Pos logit contributions
vered+0.282
ement+0.270
 spe+0.256
 symp+0.250
 aston+0.247
Neg logit contributions
oto-0.144
ogo-0.115
lic-0.114
una-0.108
ock-0.106
Token bear
Token ID6842
Feature activation+1.35e+00

Pos logit contributions
vered+0.016
ement+0.016
 symp+0.015
 spe+0.015
 aston+0.014
Neg logit contributions
lic-0.012
oto-0.012
ogo-0.011
ibi-0.010
round-0.010
Token many
Token ID867
Feature activation+1.45e+00

Pos logit contributions
oto+0.084
lic+0.082
ogo+0.075
round+0.073
ibi+0.073
Neg logit contributions
vered-0.152
ement-0.147
 spe-0.146
 symp-0.139
 aston-0.138
Token
Token ID198
Feature activation+1.94e-01

Pos logit contributions
lic+0.016
oto+0.016
ogo+0.015
round+0.015
ibi+0.015
Neg logit contributions
vered-0.016
ement-0.016
 symp-0.015
 spe-0.015
 aston-0.014
TokenTim
Token ID14967
Feature activation+1.30e+00

Pos logit contributions
lic+0.012
oto+0.012
ogo+0.011
ibi+0.010
round+0.010
Neg logit contributions
vered-0.023
ement-0.022
 symp-0.021
 spe-0.021
 aston-0.021
Token and
Token ID290
Feature activation+1.42e-01

Pos logit contributions
oto+0.100
lic+0.098
ogo+0.091
ibi+0.091
round+0.087
Neg logit contributions
vered-0.129
ement-0.124
 spe-0.122
 symp-0.120
 aston-0.114
Token Tom
Token ID4186
Feature activation+1.32e-01

Pos logit contributions
lic+0.009
oto+0.008
ogo+0.008
ibi+0.007
round+0.007
Neg logit contributions
vered-0.017
ement-0.016
 symp-0.016
 spe-0.016
 aston-0.015
Token think
Token ID892
Feature activation-2.69e-01

Pos logit contributions
oto+0.009
lic+0.008
ibi+0.008
EEP+0.008
round+0.007
Neg logit contributions
vered-0.014
 spe-0.014
ement-0.014
 symp-0.013
 aston-0.012
Token about
Token ID546
Feature activation+3.57e+00

Pos logit contributions
vered+0.039
 spe+0.038
 symp+0.036
 aston+0.036
ement+0.035
Neg logit contributions
oto-0.010
ogo-0.007
lic-0.007
ibi-0.007
EEP-0.006
Token Mom
Token ID11254
Feature activation-2.54e-01

Pos logit contributions
oto+0.235
lic+0.193
ibi+0.190
ogo+0.184
oys+0.182
Neg logit contributions
vered-0.412
ement-0.386
 spe-0.385
 symp-0.371
 aston-0.364
Token's
Token ID338
Feature activation-1.37e-01

Pos logit contributions
 spe+0.031
vered+0.030
ement+0.028
+0.028
 aston+0.028
Neg logit contributions
oto-0.013
 about-0.011
oys-0.010
lic-0.010
 listen-0.010
Token idea
Token ID2126
Feature activation-2.77e+00

Pos logit contributions
vered+0.016
bugs+0.016
 spe+0.016
bug+0.015
 aston+0.014
Neg logit contributions
oto-0.008
oys-0.007
ibi-0.006
 robber-0.006
 gear-0.006
Token.
Token ID13
Feature activation-3.34e-01

Pos logit contributions
vered+0.288
 spe+0.279
 symp+0.262
ement+0.257
 aston+0.255
Neg logit contributions
oto-0.203
lic-0.192
oys-0.183
ibi-0.179
EEP-0.174
Token They
Token ID1119
Feature activation+2.72e-01

Pos logit contributions
vered+0.033
ement+0.032
 spe+0.031
 symp+0.031
 aston+0.029
Neg logit contributions
oto-0.026
lic-0.026
ogo-0.024
ibi-0.024
round-0.023
Token in
Token ID287
Feature activation-9.41e-01

Pos logit contributions
vered+0.038
 spe+0.035
ement+0.035
 symp+0.035
 aston+0.035
Neg logit contributions
lic-0.021
oto-0.019
ogo-0.017
oys-0.017
round-0.016
Token the
Token ID262
Feature activation-2.78e-01

Pos logit contributions
vered+0.107
ement+0.107
 symp+0.101
 spe+0.094
bugs+0.091
Neg logit contributions
oto-0.059
ibi-0.050
lic-0.049
oys-0.047
ogo-0.047
Token ground
Token ID2323
Feature activation+1.35e+00

Pos logit contributions
vered+0.028
ement+0.027
 spe+0.027
 symp+0.026
 aston+0.025
Neg logit contributions
oto-0.021
lic-0.021
ogo-0.019
ibi-0.019
round-0.018
Token and
Token ID290
Feature activation+1.33e-01

Pos logit contributions
lic+0.066
oto+0.062
round+0.054
oys+0.054
ogo+0.053
Neg logit contributions
vered-0.168
 spe-0.156
ement-0.153
 aston-0.153
 symp-0.153
Token the
Token ID262
Feature activation-2.76e-01

Pos logit contributions
lic+0.010
oto+0.010
ogo+0.010
ibi+0.010
round+0.009
Neg logit contributions
vered-0.013
ement-0.013
 symp-0.012
 spe-0.012
 aston-0.012
Token water
Token ID1660
Feature activation+3.51e+00

Pos logit contributions
vered+0.030
ement+0.030
 symp+0.028
 aston+0.027
 spe+0.027
Neg logit contributions
oto-0.018
lic-0.018
ibi-0.016
ogo-0.016
EEP-0.016
Token coming
Token ID2406
Feature activation+6.69e-01

Pos logit contributions
lic+0.236
oto+0.221
ogo+0.201
oys+0.199
round+0.194
Neg logit contributions
vered-0.376
ement-0.352
 aston-0.341
 symp-0.338
 spe-0.334
Token out
Token ID503
Feature activation+2.11e+00

Pos logit contributions
oto+0.049
lic+0.049
ogo+0.045
ibi+0.045
round+0.043
Neg logit contributions
vered-0.069
ement-0.067
 spe-0.064
 symp-0.064
 aston-0.062
Token of
Token ID286
Feature activation-2.61e-01

Pos logit contributions
lic+0.111
oto+0.090
ogo+0.084
oys+0.082
ock+0.079
Neg logit contributions
vered-0.262
ement-0.249
-0.243
 symp-0.241
 afar-0.239
Token it
Token ID340
Feature activation+2.81e-01

Pos logit contributions
vered+0.028
ement+0.027
 symp+0.026
 spe+0.024
 aston+0.024
Neg logit contributions
oto-0.019
lic-0.017
ogo-0.017
ibi-0.016
oys-0.015
Token.
Token ID13
Feature activation-3.68e-01

Pos logit contributions
lic+0.017
oto+0.015
ogo+0.015
EEP+0.014
round+0.014
Neg logit contributions
vered-0.031
ement-0.030
 spe-0.029
 aston-0.029
 symp-0.029
Token and
Token ID290
Feature activation+1.10e-01

Pos logit contributions
 symp+0.529
vered+0.525
ement+0.504
 aston+0.495
 spe+0.491
Neg logit contributions
oto-0.303
lic-0.293
oys-0.264
ogo-0.260
ibi-0.251
Token filled
Token ID5901
Feature activation-2.24e-01

Pos logit contributions
lic+0.007
oto+0.007
ibi+0.006
ruck+0.006
oys+0.006
Neg logit contributions
ement-0.012
 symp-0.012
vered-0.011
 spe-0.011
-0.010
Token her
Token ID607
Feature activation-7.17e-01

Pos logit contributions
vered+0.027
 spe+0.026
 symp+0.025
ement+0.025
+0.024
Neg logit contributions
oto-0.012
lic-0.011
 sound-0.010
una-0.009
oys-0.009
Token with
Token ID351
Feature activation-1.93e+00

Pos logit contributions
vered+0.074
ement+0.070
 symp+0.070
 spe+0.069
+0.063
Neg logit contributions
 cool-0.036
oto-0.036
lic-0.035
 sound-0.035
oys-0.034
Token stories
Token ID3923
Feature activation+5.31e-01

Pos logit contributions
 symp+0.227
vered+0.225
ement+0.225
+0.216
+0.213
Neg logit contributions
 cool-0.087
 water-0.080
ruck-0.080
 sound-0.077
oto-0.074
Token about
Token ID546
Feature activation+3.53e+00

Pos logit contributions
lic+0.039
oto+0.037
round+0.034
ibi+0.033
ogo+0.033
Neg logit contributions
vered-0.056
 symp-0.052
ement-0.051
 spe-0.051
 aston-0.050
Token the
Token ID262
Feature activation-3.17e-01

Pos logit contributions
oto+0.180
ruck+0.173
lic+0.154
icated+0.140
 cool+0.135
Neg logit contributions
vered-0.397
 symp-0.393
ement-0.385
 aston-0.357
 spe-0.344
Token wonder
Token ID4240
Feature activation-3.52e+00

Pos logit contributions
vered+0.038
 symp+0.035
ement+0.034
eled+0.032
ingly+0.031
Neg logit contributions
oto-0.017
ruck-0.016
 robber-0.015
 sound-0.015
 cool-0.015
Token clay
Token ID21558
Feature activation-4.84e+00

Pos logit contributions
vered+0.363
 symp+0.355
 spe+0.333
 aston+0.333
ement+0.332
Neg logit contributions
oto-0.248
lic-0.245
round-0.224
ibi-0.214
ogo-0.214
Token.
Token ID13
Feature activation-3.74e-01

Pos logit contributions
vered+0.521
 symp+0.519
ement+0.491
 aston+0.483
 spe+0.468
Neg logit contributions
lic-0.311
oto-0.308
oys-0.266
ogo-0.266
round-0.264
Token After
Token ID2293
Feature activation-9.97e-01

Pos logit contributions
 spe+0.045
vered+0.044
ement+0.041
 symp+0.040
 aston+0.039
Neg logit contributions
lic-0.023
ibi-0.020
oto-0.019
EEP-0.018
ogo-0.018
Token the
Token ID262
Feature activation-1.89e-01

Pos logit contributions
oto+0.140
ibi+0.124
lic+0.124
round+0.116
EEP+0.112
Neg logit contributions
vered-0.227
 spe-0.218
ement-0.211
 symp-0.197
 aston-0.190
Token man
Token ID582
Feature activation-1.00e-01

Pos logit contributions
vered+0.021
ement+0.020
 symp+0.020
 spe+0.020
 aston+0.019
Neg logit contributions
oto-0.013
lic-0.012
ibi-0.011
oys-0.011
ogo-0.010
Token said
Token ID531
Feature activation+6.10e-01

Pos logit contributions
vered+0.014
 spe+0.014
ement+0.014
+0.013
 aston+0.013
Neg logit contributions
lic-0.004
oto-0.002
round-0.002
ibi-0.002
 jumped-0.002
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
lic+0.036
oto+0.032
oys+0.029
round+0.028
ibi+0.027
Neg logit contributions
vered-0.071
 spe-0.067
ement-0.065
 afar-0.064
el-0.062
Token "
Token ID366
Feature activation+1.10e+00

Pos logit contributions
vered+0.030
ement+0.028
 spe+0.028
 symp+0.027
 aston+0.026
Neg logit contributions
lic-0.017
oto-0.017
ogo-0.015
ibi-0.015
round-0.015
TokenThat
Token ID2504
Feature activation+3.52e+00

Pos logit contributions
lic+0.082
oto+0.081
round+0.075
ibi+0.075
EEP+0.071
Neg logit contributions
ement-0.113
vered-0.111
 spe-0.106
 symp-0.102
 aston-0.099
Token handle
Token ID5412
Feature activation+1.37e+00

Pos logit contributions
oto+0.271
lic+0.264
ibi+0.246
EEP+0.242
round+0.240
Neg logit contributions
vered-0.341
ement-0.333
 symp-0.328
 spe-0.313
 aston-0.308
Token belongs
Token ID14448
Feature activation-1.86e-01

Pos logit contributions
lic+0.104
oto+0.104
ogo+0.096
ibi+0.096
round+0.092
Neg logit contributions
vered-0.135
ement-0.131
 symp-0.128
 spe-0.127
 aston-0.122
Token to
Token ID284
Feature activation-1.12e-01

Pos logit contributions
ement+0.022
vered+0.022
 symp+0.022
 spe+0.021
 aston+0.020
Neg logit contributions
lic-0.010
oto-0.010
ogo-0.009
oys-0.009
ibi-0.009
Token my
Token ID616
Feature activation+2.33e+00

Pos logit contributions
vered+0.013
ement+0.012
 symp+0.012
 spe+0.012
 aston+0.011
Neg logit contributions
oto-0.007
lic-0.007
ogo-0.006
ibi-0.006
EEP-0.006
Token axe
Token ID23272
Feature activation-4.16e-01

Pos logit contributions
oto+0.132
ibi+0.119
oys+0.113
EEP+0.109
round+0.104
Neg logit contributions
vered-0.271
ement-0.266
 symp-0.262
 spe-0.241
 aston-0.234
Token.
Token ID13
Feature activation-3.37e-01

Pos logit contributions
lic+0.106
oto+0.104
round+0.093
ogo+0.092
ibi+0.090
Neg logit contributions
vered-0.130
ement-0.122
 symp-0.117
 spe-0.116
 aston-0.112
Token It
Token ID632
Feature activation+8.68e-01

Pos logit contributions
vered+0.033
ement+0.031
 symp+0.030
 spe+0.030
 aston+0.029
Neg logit contributions
lic-0.027
oto-0.027
ibi-0.025
ogo-0.025
round-0.024
Token sounds
Token ID5238
Feature activation+1.89e+00

Pos logit contributions
lic+0.059
oto+0.059
ogo+0.054
ibi+0.050
round+0.049
Neg logit contributions
vered-0.092
ement-0.090
 symp-0.085
ingly-0.082
 spe-0.082
Token like
Token ID588
Feature activation-8.50e-01

Pos logit contributions
oto+0.157
lic+0.156
ogo+0.146
ibi+0.145
round+0.140
Neg logit contributions
vered-0.177
ement-0.170
 spe-0.165
 symp-0.164
 aston-0.156
Token "
Token ID366
Feature activation+1.06e+00

Pos logit contributions
vered+0.101
ement+0.099
 symp+0.095
ingly+0.090
 aston+0.089
Neg logit contributions
oto-0.052
lic-0.051
ibi-0.043
ogo-0.042
EEP-0.041
Tokent
Token ID83
Feature activation+3.42e+00

Pos logit contributions
lic+0.079
oto+0.076
round+0.073
ibi+0.072
EEP+0.069
Neg logit contributions
ement-0.109
vered-0.108
 symp-0.102
 aston-0.101
 spe-0.099
Tokenweet
Token ID7277
Feature activation+3.76e+00

Pos logit contributions
lic+0.273
oto+0.264
ibi+0.253
ogo+0.252
EEP+0.232
Neg logit contributions
vered-0.330
 spe-0.319
ement-0.311
 symp-0.301
el-0.298
Token,
Token ID11
Feature activation+8.67e-02

Pos logit contributions
oto+0.291
lic+0.288
ogo+0.269
ibi+0.269
round+0.255
Neg logit contributions
vered-0.373
ement-0.357
 spe-0.350
 symp-0.346
 aston-0.328
Token tweet
Token ID6126
Feature activation-1.88e+00

Pos logit contributions
oto+0.006
lic+0.006
ogo+0.006
ibi+0.006
round+0.005
Neg logit contributions
vered-0.009
ement-0.009
 symp-0.008
 spe-0.008
 aston-0.008
Token,
Token ID11
Feature activation+9.37e-02

Pos logit contributions
vered+0.213
ement+0.206
 spe+0.203
 symp+0.201
 aston+0.192
Neg logit contributions
oto-0.118
lic-0.114
ogo-0.107
ibi-0.107
round-0.099
Token tweet
Token ID6126
Feature activation-1.89e+00

Pos logit contributions
oto+0.007
lic+0.007
ogo+0.006
ibi+0.006
round+0.006
Neg logit contributions
vered-0.010
ement-0.009
 symp-0.009
 aston-0.009
 spe-0.008
Token waved
Token ID26834
Feature activation+1.36e+00

Pos logit contributions
vered+0.158
ement+0.156
 symp+0.150
 spe+0.149
 aston+0.143
Neg logit contributions
oto-0.099
lic-0.094
ogo-0.088
ibi-0.086
round-0.083
Token goodbye
Token ID24829
Feature activation-5.62e-01

Pos logit contributions
oto+0.078
 bye+0.064
lic+0.064
round+0.064
ibi+0.063
Neg logit contributions
vered-0.161
ement-0.157
 spe-0.153
 symp-0.149
-0.147
Token to
Token ID284
Feature activation-6.23e-02

Pos logit contributions
vered+0.064
ement+0.063
 spe+0.060
 symp+0.060
 aston+0.059
Neg logit contributions
oto-0.034
lic-0.033
round-0.031
ibi-0.031
EEP-0.030
Token her
Token ID607
Feature activation-6.10e-01

Pos logit contributions
vered+0.006
ement+0.006
 spe+0.006
 symp+0.006
 aston+0.006
Neg logit contributions
oto-0.005
ibi-0.004
lic-0.004
ogo-0.004
round-0.004
Token friends
Token ID2460
Feature activation+2.92e-01

Pos logit contributions
vered+0.074
 symp+0.070
 spe+0.069
ement+0.069
ingly+0.066
Neg logit contributions
oto-0.033
ogo-0.027
ibi-0.026
 bus-0.022
 gear-0.022
Token and
Token ID290
Feature activation+2.15e-01

Pos logit contributions
oto+0.017
round+0.016
lic+0.015
ibi+0.014
ogo+0.014
Neg logit contributions
vered-0.034
 spe-0.033
ement-0.032
 symp-0.030
 aston-0.030
Token went
Token ID1816
Feature activation+1.06e-01

Pos logit contributions
lic+0.017
oto+0.016
ogo+0.015
ibi+0.015
round+0.014
Neg logit contributions
vered-0.021
ement-0.020
 spe-0.020
 symp-0.020
 aston-0.019
Token home
Token ID1363
Feature activation+9.00e-01

Pos logit contributions
oto+0.010
lic+0.010
ibi+0.009
ogo+0.009
EEP+0.009
Neg logit contributions
vered-0.008
ement-0.008
 symp-0.008
 spe-0.007
 aston-0.007
Token.
Token ID13
Feature activation-2.55e-01

Pos logit contributions
lic+0.067
oto+0.064
ibi+0.060
ogo+0.058
round+0.057
Neg logit contributions
vered-0.091
ement-0.088
 spe-0.084
 symp-0.082
 aston-0.079
Token She
Token ID1375
Feature activation-5.88e-01

Pos logit contributions
vered+0.021
ement+0.021
 symp+0.020
 spe+0.020
 aston+0.019
Neg logit contributions
lic-0.023
oto-0.023
ogo-0.022
ibi-0.021
round-0.021
Token couldn
Token ID3521
Feature activation-3.88e-01

Pos logit contributions
vered+0.066
ement+0.065
 symp+0.062
 spe+0.061
 aston+0.059
Neg logit contributions
oto-0.037
lic-0.035
ogo-0.033
ibi-0.033
round-0.031
Token other
Token ID584
Feature activation+2.79e+00

Pos logit contributions
oto+0.006
lic+0.005
ibi+0.005
ogo+0.005
round+0.005
Neg logit contributions
vered-0.011
ement-0.011
 symp-0.010
 spe-0.010
 aston-0.010
Token around
Token ID1088
Feature activation-1.76e-01

Pos logit contributions
oto+0.165
round+0.154
ibi+0.152
ogo+0.149
EEP+0.147
Neg logit contributions
ement-0.312
 spe-0.300
vered-0.298
 symp-0.294
 aston-0.289
Token,
Token ID11
Feature activation+4.28e-01

Pos logit contributions
ement+0.021
vered+0.021
 symp+0.020
 spe+0.020
 aston+0.020
Neg logit contributions
oto-0.008
 robber-0.007
round-0.007
lic-0.007
oys-0.006
Token laughed
Token ID13818
Feature activation+1.23e+00

Pos logit contributions
oto+0.028
lic+0.027
ibi+0.027
round+0.026
ogo+0.025
Neg logit contributions
vered-0.046
ement-0.046
 spe-0.043
 symp-0.043
 aston-0.041
Token and
Token ID290
Feature activation+4.31e-01

Pos logit contributions
oto+0.096
lic+0.095
ibi+0.087
ogo+0.087
round+0.084
Neg logit contributions
vered-0.121
ement-0.117
 spe-0.114
 symp-0.112
 aston-0.108
Token built
Token ID3170
Feature activation+4.46e-01

Pos logit contributions
oto+0.033
lic+0.032
ogo+0.030
ibi+0.029
round+0.028
Neg logit contributions
vered-0.043
ement-0.042
 spe-0.041
 symp-0.040
 aston-0.039
Token sand
Token ID6450
Feature activation+1.59e+00

Pos logit contributions
lic+0.020
oto+0.019
 cool+0.016
 sound+0.015
 gear+0.015
Neg logit contributions
ement-0.052
vered-0.052
 spe-0.051
 symp-0.051
-0.050
Tokencast
Token ID2701
Feature activation-4.56e-01

Pos logit contributions
oto+0.102
lic+0.101
ogo+0.096
round+0.086
ibi+0.085
Neg logit contributions
vered-0.181
ement-0.169
 symp-0.169
 spe-0.166
 aston-0.165
Tokenles
Token ID829
Feature activation+2.13e+00

Pos logit contributions
vered+0.045
ement+0.044
 spe+0.043
 symp+0.043
 aston+0.041
Neg logit contributions
oto-0.035
lic-0.035
ogo-0.032
ibi-0.032
round-0.031
Token.
Token ID13
Feature activation-1.83e-01

Pos logit contributions
lic+0.141
oto+0.139
ogo+0.125
round+0.124
ibi+0.121
Neg logit contributions
vered-0.220
 symp-0.217
 spe-0.215
 aston-0.214
ement-0.211
Token Soon
Token ID15894
Feature activation-1.82e+00

Pos logit contributions
vered+0.018
ement+0.017
 spe+0.017
 symp+0.016
 aston+0.016
Neg logit contributions
lic-0.014
oto-0.014
ibi-0.013
ogo-0.013
round-0.013
Token day
Token ID1110
Feature activation+5.04e-01

Pos logit contributions
vered+0.073
ement+0.071
 spe+0.070
 symp+0.069
 aston+0.066
Neg logit contributions
lic-0.055
oto-0.055
ogo-0.052
ibi-0.051
EEP-0.048
Token,
Token ID11
Feature activation+2.05e-01

Pos logit contributions
oto+0.045
lic+0.044
ogo+0.042
ibi+0.041
round+0.040
Neg logit contributions
vered-0.044
ement-0.043
 spe-0.041
 symp-0.041
 aston-0.039
Token they
Token ID484
Feature activation+4.11e-01

Pos logit contributions
oto+0.013
lic+0.012
ibi+0.011
ogo+0.011
round+0.011
Neg logit contributions
vered-0.024
ement-0.023
 symp-0.022
 spe-0.022
 aston-0.021
Token find
Token ID1064
Feature activation-4.08e-01

Pos logit contributions
oto+0.029
lic+0.028
ogo+0.026
ibi+0.025
round+0.025
Neg logit contributions
vered-0.044
ement-0.043
 spe-0.041
 symp-0.041
 aston-0.039
Token a
Token ID257
Feature activation+4.09e-02

Pos logit contributions
vered+0.042
ement+0.041
 symp+0.040
 spe+0.040
 aston+0.037
Neg logit contributions
oto-0.030
lic-0.030
ogo-0.027
ibi-0.027
EEP-0.025
Token big
Token ID1263
Feature activation+2.36e-01

Pos logit contributions
oto+0.003
lic+0.003
ogo+0.002
ibi+0.002
round+0.002
Neg logit contributions
vered-0.005
ement-0.004
 symp-0.004
 spe-0.004
 aston-0.004
Token steel
Token ID7771
Feature activation-1.18e+00

Pos logit contributions
oto+0.015
lic+0.013
round+0.013
 robber+0.013
ibi+0.013
Neg logit contributions
vered-0.027
ement-0.025
 symp-0.025
 aston-0.023
ingly-0.023
Token slide
Token ID10649
Feature activation+1.55e+00

Pos logit contributions
vered+0.127
ement+0.121
 symp+0.119
 spe+0.117
 aston+0.115
Neg logit contributions
oto-0.081
lic-0.080
ogo-0.074
round-0.073
ibi-0.072
Token in
Token ID287
Feature activation-9.05e-01

Pos logit contributions
oto+0.122
lic+0.121
round+0.116
ibi+0.114
ogo+0.111
Neg logit contributions
vered-0.151
 spe-0.142
ement-0.142
 symp-0.140
 aston-0.137
Token the
Token ID262
Feature activation-2.44e-01

Pos logit contributions
vered+0.103
ement+0.102
 symp+0.099
 spe+0.097
 aston+0.088
Neg logit contributions
oto-0.058
ibi-0.050
oys-0.050
lic-0.049
ogo-0.048
Token park
Token ID3952
Feature activation+1.03e-01

Pos logit contributions
vered+0.023
ement+0.023
 spe+0.022
 symp+0.022
 aston+0.021
Neg logit contributions
oto-0.020
lic-0.019
ogo-0.018
ibi-0.018
round-0.017
Token has
Token ID468
Feature activation-1.21e+00

Pos logit contributions
ogo+0.004
oto+0.004
lic+0.004
haw+0.004
ibi+0.004
Neg logit contributions
vered-0.009
ement-0.009
 symp-0.009
 spe-0.009
ingly-0.009
Token a
Token ID257
Feature activation-7.73e-02

Pos logit contributions
ement+0.130
vered+0.127
 symp+0.123
 spe+0.116
 aston+0.114
Neg logit contributions
oto-0.085
lic-0.082
ibi-0.077
ogo-0.076
EEP-0.073
Token scratch
Token ID12692
Feature activation-2.64e+00

Pos logit contributions
vered+0.009
ement+0.009
 symp+0.008
el+0.008
ingly+0.008
Neg logit contributions
oto-0.005
lic-0.004
ibi-0.004
ogo-0.004
 robber-0.004
Token on
Token ID319
Feature activation+3.54e-01

Pos logit contributions
vered+0.260
ement+0.252
 symp+0.246
 spe+0.246
 aston+0.240
Neg logit contributions
oto-0.204
lic-0.201
ogo-0.186
ibi-0.185
round-0.183
Token his
Token ID465
Feature activation+1.36e+00

Pos logit contributions
oto+0.018
lic+0.017
ogo+0.016
ibi+0.016
round+0.014
Neg logit contributions
vered-0.045
ement-0.043
 spe-0.042
 symp-0.042
 aston-0.041
Token hand
Token ID1021
Feature activation-6.84e-01

Pos logit contributions
oto+0.089
ibi+0.082
ogo+0.080
lic+0.080
round+0.079
Neg logit contributions
vered-0.149
ement-0.144
 spe-0.138
 symp-0.135
 aston-0.134
Token,
Token ID11
Feature activation+5.03e-02

Pos logit contributions
vered+0.067
ement+0.063
 spe+0.063
 symp+0.061
 aston+0.061
Neg logit contributions
oto-0.054
lic-0.052
ibi-0.050
round-0.049
ogo-0.049
Token but
Token ID475
Feature activation+4.05e-01

Pos logit contributions
oto+0.003
lic+0.003
ibi+0.003
ogo+0.003
round+0.003
Neg logit contributions
vered-0.006
ement-0.006
 symp-0.005
 spe-0.005
 aston-0.005
Token he
Token ID339
Feature activation+2.43e-01

Pos logit contributions
oto+0.025
lic+0.024
ogo+0.022
ibi+0.022
round+0.020
Neg logit contributions
vered-0.047
ement-0.045
 spe-0.044
 symp-0.044
 aston-0.042
Token is
Token ID318
Feature activation-6.13e-01

Pos logit contributions
oto+0.015
ogo+0.015
lic+0.014
ibi+0.013
una+0.012
Neg logit contributions
vered-0.027
ement-0.027
 symp-0.026
 spe-0.025
 aston-0.025
Token fine
Token ID3734
Feature activation+7.01e-01

Pos logit contributions
vered+0.062
ement+0.059
 symp+0.057
 spe+0.057
 aston+0.054
Neg logit contributions
oto-0.047
lic-0.047
ogo-0.043
ibi-0.043
round-0.041
Token and
Token ID290
Feature activation+9.72e-02

Pos logit contributions
oto+0.150
lic+0.144
ogo+0.137
round+0.133
ibi+0.132
Neg logit contributions
vered-0.228
ement-0.224
 spe-0.220
 symp-0.212
 aston-0.206
Token threw
Token ID9617
Feature activation-7.74e-01

Pos logit contributions
oto+0.008
lic+0.007
ogo+0.007
ibi+0.007
round+0.007
Neg logit contributions
vered-0.010
 spe-0.009
ement-0.009
 symp-0.009
 aston-0.009
Token it
Token ID340
Feature activation+2.65e-01

Pos logit contributions
ement+0.087
vered+0.084
 spe+0.082
 symp+0.082
 aston+0.077
Neg logit contributions
oto-0.048
lic-0.043
ibi-0.041
haw-0.041
ogo-0.040
Token at
Token ID379
Feature activation-1.28e+00

Pos logit contributions
lic+0.017
oto+0.017
ogo+0.016
EEP+0.015
ibi+0.015
Neg logit contributions
ement-0.028
 spe-0.027
vered-0.027
 symp-0.026
 aston-0.025
Token the
Token ID262
Feature activation-3.18e-01

Pos logit contributions
vered+0.142
ement+0.138
 spe+0.133
 symp+0.130
bugs+0.128
Neg logit contributions
oto-0.090
ogo-0.070
una-0.068
ibi-0.064
round-0.063
Token shark
Token ID21027
Feature activation+2.61e+00

Pos logit contributions
vered+0.034
ement+0.033
 spe+0.032
 symp+0.032
 aston+0.030
Neg logit contributions
oto-0.022
lic-0.022
ogo-0.020
ibi-0.020
round-0.019
Token's
Token ID338
Feature activation-1.82e-01

Pos logit contributions
lic+0.198
oto+0.196
ogo+0.188
round+0.180
ibi+0.178
Neg logit contributions
ement-0.254
vered-0.247
 spe-0.245
 symp-0.235
 aston-0.229
Token nose
Token ID9686
Feature activation+1.73e+00

Pos logit contributions
ement+0.019
vered+0.019
 spe+0.018
 symp+0.018
 aston+0.017
Neg logit contributions
oto-0.013
ogo-0.012
ibi-0.011
EEP-0.011
oys-0.011
Token.
Token ID13
Feature activation-3.74e-01

Pos logit contributions
lic+0.136
oto+0.135
ogo+0.126
ibi+0.125
round+0.122
Neg logit contributions
vered-0.164
ement-0.159
 spe-0.157
 symp-0.153
 aston-0.151
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
vered+0.037
ement+0.036
 spe+0.036
 symp+0.034
 aston+0.033
Neg logit contributions
oto-0.028
lic-0.028
ogo-0.025
ibi-0.025
round-0.025
Token
Token ID198
Feature activation+1.62e-01

Pos logit contributions
oto+0.013
lic+0.013
ibi+0.013
ogo+0.013
round+0.012
Neg logit contributions
vered-0.013
ement-0.013
 symp-0.012
 spe-0.012
 aston-0.012
Token and
Token ID290
Feature activation+1.58e-01

Pos logit contributions
vered+0.232
ement+0.218
 symp+0.214
 aston+0.208
 spe+0.204
Neg logit contributions
oto-0.183
lic-0.178
ogo-0.167
round-0.158
ibi-0.156
Token pizza
Token ID14256
Feature activation-1.32e+00

Pos logit contributions
oto+0.011
lic+0.011
ibi+0.009
ogo+0.009
round+0.009
Neg logit contributions
vered-0.017
ement-0.017
 symp-0.016
 aston-0.015
 spe-0.015
Token.
Token ID13
Feature activation-3.22e-01

Pos logit contributions
vered+0.118
ement+0.112
 symp+0.110
 spe+0.108
 aston+0.104
Neg logit contributions
oto-0.116
lic-0.115
ogo-0.107
ibi-0.105
round-0.103
Token John
Token ID1757
Feature activation-1.58e+00

Pos logit contributions
vered+0.030
ement+0.029
 spe+0.028
 symp+0.028
 aston+0.026
Neg logit contributions
oto-0.027
lic-0.027
ogo-0.025
ibi-0.025
round-0.024
Token always
Token ID1464
Feature activation-6.09e-01

Pos logit contributions
ement+0.175
vered+0.173
 symp+0.166
 spe+0.163
 aston+0.161
Neg logit contributions
oto-0.103
lic-0.097
ogo-0.089
ibi-0.086
round-0.084
Token thanked
Token ID26280
Feature activation-2.51e+00

Pos logit contributions
vered+0.060
ement+0.058
 spe+0.057
 symp+0.056
 aston+0.054
Neg logit contributions
oto-0.047
lic-0.046
ogo-0.043
ibi-0.043
round-0.041
Token her
Token ID607
Feature activation-6.61e-01

Pos logit contributions
vered+0.257
ement+0.244
 spe+0.237
 symp+0.236
 aston+0.226
Neg logit contributions
oto-0.192
lic-0.186
ogo-0.174
ibi-0.172
round-0.165
Token.
Token ID13
Feature activation-3.20e-01

Pos logit contributions
vered+0.066
ement+0.064
 symp+0.063
 spe+0.062
 aston+0.059
Neg logit contributions
oto-0.049
lic-0.049
ogo-0.046
ibi-0.045
round-0.044
Token
Token ID198
Feature activation+2.16e-01

Pos logit contributions
vered+0.029
ement+0.029
 symp+0.027
 spe+0.027
 aston+0.026
Neg logit contributions
lic-0.027
oto-0.027
ogo-0.025
ibi-0.024
round-0.023
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
lic+0.021
oto+0.021
oys+0.020
ogo+0.020
EEP+0.020
Neg logit contributions
 symp-0.015
ement-0.014
vered-0.014
 aston-0.013
 spe-0.012
TokenAt
Token ID2953
Feature activation-6.95e-01

Pos logit contributions
lic+0.022
oto+0.022
ogo+0.021
ibi+0.021
EEP+0.020
Neg logit contributions
vered-0.017
ement-0.016
 symp-0.015
 spe-0.015
 aston-0.015
Token as
Token ID355
Feature activation-2.72e-01

Pos logit contributions
lic+0.116
oto+0.114
round+0.105
ibi+0.101
ogo+0.101
Neg logit contributions
vered-0.163
ement-0.157
 symp-0.156
 spe-0.155
 aston-0.154
Token he
Token ID339
Feature activation+2.29e-01

Pos logit contributions
vered+0.029
ement+0.028
 symp+0.027
 spe+0.027
 aston+0.026
Neg logit contributions
oto-0.019
lic-0.019
ogo-0.017
ibi-0.017
round-0.016
Token dropped
Token ID5710
Feature activation+7.56e-01

Pos logit contributions
lic+0.016
oto+0.016
ogo+0.015
ibi+0.014
round+0.014
Neg logit contributions
vered-0.024
ement-0.023
 symp-0.023
 spe-0.023
 aston-0.022
Token into
Token ID656
Feature activation+2.23e-01

Pos logit contributions
oto+0.044
lic+0.040
ibi+0.036
round+0.036
ogo+0.036
Neg logit contributions
ement-0.085
vered-0.085
 symp-0.081
 spe-0.079
 aston-0.078
Token the
Token ID262
Feature activation-3.13e-01

Pos logit contributions
oto+0.010
lic+0.009
ruck+0.008
ogo+0.008
ibi+0.008
Neg logit contributions
ement-0.028
vered-0.027
 symp-0.026
-0.025
ingly-0.024
Token pond
Token ID16723
Feature activation-1.99e+00

Pos logit contributions
vered+0.040
ement+0.038
 symp+0.037
 spe+0.037
 aston+0.036
Neg logit contributions
lic-0.016
oto-0.015
ibi-0.013
ogo-0.013
round-0.013
Token.
Token ID13
Feature activation-3.78e-01

Pos logit contributions
vered+0.211
ement+0.201
 spe+0.195
 symp+0.193
 aston+0.192
Neg logit contributions
lic-0.132
oto-0.130
round-0.124
ogo-0.116
ibi-0.115
Token
Token ID220
Feature activation-1.32e+00

Pos logit contributions
vered+0.037
 spe+0.036
ement+0.035
 symp+0.034
 aston+0.032
Neg logit contributions
lic-0.029
oto-0.029
ibi-0.027
ogo-0.027
round-0.027
Token
Token ID198
Feature activation+1.56e-01

Pos logit contributions
vered+0.130
ement+0.125
 spe+0.122
 symp+0.121
 aston+0.115
Neg logit contributions
oto-0.102
lic-0.102
ogo-0.095
ibi-0.094
round-0.090
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
oto+0.013
lic+0.013
ogo+0.012
ibi+0.012
round+0.012
Neg logit contributions
vered-0.014
ement-0.013
 symp-0.013
 spe-0.013
 aston-0.012
TokenAnd
Token ID1870
Feature activation+9.46e-02

Pos logit contributions
oto+0.013
lic+0.013
ogo+0.012
ibi+0.012
round+0.011
Neg logit contributions
vered-0.014
ement-0.013
 symp-0.013
 spe-0.013
 aston-0.013
Token to
Token ID284
Feature activation-1.41e-01

Pos logit contributions
lic+0.004
oto+0.004
ogo+0.004
ibi+0.004
round+0.004
Neg logit contributions
vered-0.006
 spe-0.005
ement-0.005
 symp-0.005
 aston-0.005
Token protect
Token ID1805
Feature activation+4.14e-01

Pos logit contributions
vered+0.011
 spe+0.010
 symp+0.009
 aston+0.009
ement+0.009
Neg logit contributions
lic-0.014
oto-0.014
ogo-0.013
ibi-0.013
round-0.012
Token Kitty
Token ID21168
Feature activation+1.30e+00

Pos logit contributions
oto+0.031
lic+0.030
ogo+0.028
ibi+0.028
round+0.027
Neg logit contributions
vered-0.042
ement-0.042
 symp-0.040
 spe-0.039
 aston-0.038
Token from
Token ID422
Feature activation+1.05e+00

Pos logit contributions
oto+0.080
ogo+0.070
ibi+0.070
ty+0.069
lic+0.068
Neg logit contributions
ement-0.145
vered-0.141
 spe-0.140
 aston-0.138
joy-0.128
Token the
Token ID262
Feature activation-2.76e-01

Pos logit contributions
oto+0.065
ogo+0.061
una+0.056
 robber+0.055
ibi+0.053
Neg logit contributions
ement-0.120
vered-0.114
 aston-0.112
 symp-0.111
 spe-0.107
Token itch
Token ID49700
Feature activation-2.49e+00

Pos logit contributions
vered+0.031
ement+0.030
 symp+0.029
 aston+0.028
 spe+0.028
Neg logit contributions
oto-0.018
lic-0.017
ibi-0.016
EEP-0.016
ogo-0.016
Token,
Token ID11
Feature activation+6.49e-02

Pos logit contributions
vered+0.259
ement+0.252
 aston+0.245
 spe+0.244
 symp+0.238
Neg logit contributions
oto-0.174
lic-0.170
ogo-0.160
EEP-0.160
ibi-0.160
Token so
Token ID523
Feature activation-4.71e-01

Pos logit contributions
oto+0.005
lic+0.005
ibi+0.004
ogo+0.004
round+0.004
Neg logit contributions
vered-0.007
ement-0.007
 spe-0.006
 symp-0.006
 aston-0.006
Token he
Token ID339
Feature activation+2.62e-01

Pos logit contributions
vered+0.048
 spe+0.046
ement+0.046
 symp+0.045
el+0.042
Neg logit contributions
oto-0.033
lic-0.033
ogo-0.031
ibi-0.030
round-0.030
Token decided
Token ID3066
Feature activation-2.60e-01

Pos logit contributions
ibi+0.024
oto+0.024
oys+0.023
ogo+0.023
round+0.022
Neg logit contributions
vered-0.021
 spe-0.020
 symp-0.018
 sadly-0.018
bug-0.017
Token to
Token ID284
Feature activation-1.29e-01

Pos logit contributions
vered+0.018
 sadly+0.017
 symp+0.016
 spe+0.016
ement+0.015
Neg logit contributions
lic-0.027
ogo-0.026
oto-0.025
ibi-0.025
icated-0.024
Token on
Token ID319
Feature activation+3.58e-01

Pos logit contributions
lic+0.057
oto+0.055
ibi+0.051
ogo+0.051
round+0.050
Neg logit contributions
vered-0.053
ement-0.051
 symp-0.049
 spe-0.049
 aston-0.046
Token the
Token ID262
Feature activation-2.99e-01

Pos logit contributions
oto+0.023
ogo+0.021
ibi+0.021
lic+0.021
round+0.021
Neg logit contributions
vered-0.038
ement-0.038
 symp-0.037
 spe-0.036
ingly-0.035
Token slide
Token ID10649
Feature activation+1.47e+00

Pos logit contributions
vered+0.034
ement+0.033
 symp+0.032
 spe+0.032
 aston+0.031
Neg logit contributions
lic-0.019
oto-0.019
ogo-0.017
ibi-0.016
oys-0.015
Token first
Token ID717
Feature activation-9.48e-02

Pos logit contributions
oto+0.119
lic+0.119
ogo+0.111
ibi+0.110
round+0.109
Neg logit contributions
vered-0.139
ement-0.133
 spe-0.130
 symp-0.129
 aston-0.125
Token,
Token ID11
Feature activation+4.82e-02

Pos logit contributions
vered+0.011
ement+0.011
 symp+0.011
 spe+0.011
 aston+0.010
Neg logit contributions
oto-0.006
lic-0.005
ogo-0.005
ibi-0.005
round-0.005
Token then
Token ID788
Feature activation-3.35e+00

Pos logit contributions
oto+0.004
lic+0.004
ogo+0.003
ibi+0.003
round+0.003
Neg logit contributions
vered-0.005
ement-0.005
 spe-0.004
 symp-0.004
 aston-0.004
Token we
Token ID356
Feature activation+2.06e-01

Pos logit contributions
vered+0.380
ement+0.368
 spe+0.359
 symp+0.358
 aston+0.343
Neg logit contributions
oto-0.211
lic-0.209
ogo-0.190
ibi-0.189
round-0.180
Token can
Token ID460
Feature activation-1.06e+00

Pos logit contributions
lic+0.014
oto+0.013
ogo+0.013
ibi+0.012
round+0.012
Neg logit contributions
vered-0.022
ement-0.022
 symp-0.021
 spe-0.021
 aston-0.020
Token go
Token ID467
Feature activation-3.79e-02

Pos logit contributions
vered+0.120
ement+0.116
 spe+0.113
 symp+0.113
 aston+0.108
Neg logit contributions
lic-0.067
oto-0.067
ogo-0.060
ibi-0.059
round-0.056
Token on
Token ID319
Feature activation+3.55e-01

Pos logit contributions
vered+0.003
ement+0.003
 spe+0.003
 symp+0.003
el+0.003
Neg logit contributions
oto-0.003
lic-0.003
ogo-0.003
ibi-0.003
una-0.003
Token the
Token ID262
Feature activation-2.88e-01

Pos logit contributions
oto+0.022
round+0.020
ogo+0.020
ibi+0.020
EEP+0.020
Neg logit contributions
vered-0.039
ement-0.037
 symp-0.037
 spe-0.036
ingly-0.036
Token.
Token ID13
Feature activation-3.22e-01

Pos logit contributions
vered+0.138
 spe+0.127
ement+0.126
 aston+0.124
 symp+0.122
Neg logit contributions
oto-0.074
una-0.061
ock-0.060
ogo-0.060
lic-0.056
Token
Token ID198
Feature activation+2.04e-01

Pos logit contributions
vered+0.036
ement+0.035
 spe+0.034
 symp+0.032
 aston+0.030
Neg logit contributions
oto-0.021
lic-0.021
ibi-0.020
EEP-0.020
oys-0.019
Token
Token ID198
Feature activation+2.04e-01

Pos logit contributions
oto+0.016
lic+0.016
ogo+0.015
ibi+0.015
round+0.014
Neg logit contributions
vered-0.020
ement-0.019
 spe-0.019
 symp-0.019
 aston-0.018
TokenHer
Token ID9360
Feature activation+1.31e+00

Pos logit contributions
lic+0.017
oto+0.017
ogo+0.015
ibi+0.015
round+0.015
Neg logit contributions
vered-0.019
ement-0.018
 symp-0.018
 spe-0.018
 aston-0.017
Token mom
Token ID1995
Feature activation-9.84e-02

Pos logit contributions
oto+0.078
ibi+0.070
lic+0.068
ogo+0.067
EEP+0.064
Neg logit contributions
vered-0.163
ement-0.154
 symp-0.148
 spe-0.147
 aston-0.138
Token smiles
Token ID21845
Feature activation-3.59e+00

Pos logit contributions
vered+0.010
 spe+0.010
ement+0.010
 symp+0.009
 aston+0.009
Neg logit contributions
lic-0.007
oto-0.007
ogo-0.007
ibi-0.006
round-0.006
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
vered+0.385
 spe+0.365
ement+0.359
 symp+0.334
 aston+0.329
Neg logit contributions
oto-0.249
lic-0.244
EEP-0.212
ogo-0.209
ibi-0.208
Token She
Token ID1375
Feature activation-6.85e-01

Pos logit contributions
vered+0.027
ement+0.025
 spe+0.024
 symp+0.024
 aston+0.023
Neg logit contributions
oto-0.033
lic-0.033
ogo-0.031
ibi-0.031
round-0.030
Token hugs
Token ID40657
Feature activation+1.21e+00

Pos logit contributions
vered+0.071
ement+0.069
 spe+0.067
 symp+0.067
 aston+0.064
Neg logit contributions
oto-0.049
lic-0.049
ogo-0.045
ibi-0.045
round-0.043
Token Lily
Token ID20037
Feature activation+4.39e-01

Pos logit contributions
oto+0.098
lic+0.088
ogo+0.086
ibi+0.085
una+0.082
Neg logit contributions
vered-0.125
ement-0.117
 spe-0.113
 symp-0.110
 aston-0.110
Token.
Token ID13
Feature activation-3.66e-01

Pos logit contributions
oto+0.026
ibi+0.023
round+0.022
lic+0.022
ogo+0.022
Neg logit contributions
 spe-0.051
vered-0.051
ement-0.049
 aston-0.047
ocol-0.045
Token shouldn
Token ID6584
Feature activation-1.17e+00

Pos logit contributions
vered+0.103
ement+0.095
 spe+0.094
 symp+0.092
 aston+0.090
Neg logit contributions
oto-0.101
lic-0.094
ibi-0.094
ogo-0.093
round-0.093
Token't
Token ID470
Feature activation+1.62e+00

Pos logit contributions
vered+0.109
ement+0.106
 spe+0.103
 symp+0.103
 aston+0.099
Neg logit contributions
lic-0.095
oto-0.093
ogo-0.087
round-0.086
ibi-0.086
Token have
Token ID423
Feature activation-1.32e-01

Pos logit contributions
oto+0.113
lic+0.112
ogo+0.103
ibi+0.103
round+0.098
Neg logit contributions
vered-0.172
ement-0.167
 spe-0.162
 symp-0.162
 aston-0.154
Token stolen
Token ID9909
Feature activation-1.52e+00

Pos logit contributions
vered+0.014
ement+0.014
 symp+0.014
 spe+0.014
 aston+0.013
Neg logit contributions
oto-0.009
lic-0.009
ogo-0.008
ibi-0.008
round-0.008
Token the
Token ID262
Feature activation-3.17e-01

Pos logit contributions
vered+0.175
ement+0.169
 symp+0.168
 spe+0.165
 aston+0.160
Neg logit contributions
oto-0.095
lic-0.085
ogo-0.083
ibi-0.080
round-0.079
Token cherry
Token ID23612
Feature activation-6.54e+00

Pos logit contributions
vered+0.031
 symp+0.030
ement+0.029
 aston+0.028
ingly+0.027
Neg logit contributions
oto-0.025
ibi-0.023
lic-0.023
round-0.023
EEP-0.022
Token.
Token ID13
Feature activation-3.74e-01

Pos logit contributions
vered+0.737
 symp+0.707
ement+0.705
 aston+0.688
 spe+0.681
Neg logit contributions
oto-0.408
lic-0.405
round-0.392
ogo-0.371
ibi-0.367
Token Next
Token ID7406
Feature activation-1.72e+00

Pos logit contributions
vered+0.036
ement+0.034
 spe+0.033
 symp+0.033
 aston+0.032
Neg logit contributions
oto-0.030
lic-0.030
ogo-0.028
ibi-0.028
round-0.026
Token time
Token ID640
Feature activation-2.33e+00

Pos logit contributions
vered+0.163
ement+0.158
 symp+0.153
 spe+0.152
 aston+0.146
Neg logit contributions
oto-0.143
lic-0.139
ogo-0.130
ibi-0.128
EEP-0.124
Token,
Token ID11
Feature activation+3.75e-02

Pos logit contributions
vered+0.218
ement+0.209
 spe+0.203
 symp+0.202
 aston+0.193
Neg logit contributions
oto-0.192
lic-0.191
ogo-0.178
ibi-0.177
round-0.169
Token just
Token ID655
Feature activation-6.63e-01

Pos logit contributions
lic+0.003
oto+0.003
ogo+0.003
ibi+0.003
round+0.002
Neg logit contributions
vered-0.004
ement-0.004
 spe-0.004
 symp-0.004
 aston-0.003
Token them
Token ID606
Feature activation-2.61e+00

Pos logit contributions
oto+0.184
oys+0.162
ibi+0.160
lic+0.156
una+0.150
Neg logit contributions
vered-0.266
ement-0.260
 spe-0.253
 aston-0.236
 symp-0.225
Token put
Token ID1234
Feature activation-1.95e-01

Pos logit contributions
vered+0.275
ement+0.273
 spe+0.260
 symp+0.260
 aston+0.245
Neg logit contributions
lic-0.186
oto-0.182
ibi-0.167
ogo-0.165
round-0.162
Token the
Token ID262
Feature activation-2.39e-01

Pos logit contributions
vered+0.023
 symp+0.021
ement+0.021
 aston+0.020
 distress+0.020
Neg logit contributions
oto-0.011
lic-0.009
una-0.009
oys-0.009
ogo-0.009
Token cards
Token ID4116
Feature activation-3.40e+00

Pos logit contributions
vered+0.025
 symp+0.025
ement+0.024
ingly+0.022
 aston+0.022
Neg logit contributions
oto-0.017
lic-0.016
oys-0.016
EEP-0.015
 robber-0.015
Token in
Token ID287
Feature activation-9.11e-01

Pos logit contributions
vered+0.386
ement+0.373
 symp+0.359
 spe+0.355
 aston+0.341
Neg logit contributions
oto-0.214
lic-0.198
ogo-0.187
round-0.187
oys-0.182
Token envelop
Token ID16441
Feature activation-6.39e+00

Pos logit contributions
vered+0.124
ement+0.122
 symp+0.112
used+0.109
ingly+0.103
Neg logit contributions
oto-0.033
oys-0.029
ibi-0.026
 my-0.025
ruck-0.025
Tokenes
Token ID274
Feature activation+5.91e-01

Pos logit contributions
vered+0.542
 spe+0.531
 symp+0.518
ement+0.506
 aston+0.495
Neg logit contributions
oto-0.567
lic-0.559
ogo-0.543
ibi-0.513
oys-0.509
Token and
Token ID290
Feature activation+1.83e-01

Pos logit contributions
oto+0.028
lic+0.027
round+0.025
oys+0.023
ogo+0.023
Neg logit contributions
vered-0.072
ement-0.071
 symp-0.070
 spe-0.069
 aston-0.068
Token write
Token ID3551
Feature activation-1.34e+00

Pos logit contributions
oto+0.013
lic+0.012
ibi+0.011
oys+0.011
ogo+0.011
Neg logit contributions
ement-0.020
vered-0.019
 symp-0.019
 spe-0.017
 aston-0.017
Token grandma
Token ID49890
Feature activation-5.39e-01

Pos logit contributions
vered+0.137
 spe+0.128
ement+0.128
 symp+0.127
 aston+0.122
Neg logit contributions
lic-0.104
oto-0.104
ogo-0.090
ibi-0.088
oys-0.086
Token's
Token ID338
Feature activation-8.77e-02

Pos logit contributions
vered+0.054
ement+0.053
 symp+0.051
 spe+0.051
 aston+0.050
Neg logit contributions
lic-0.040
oto-0.039
round-0.035
ibi-0.035
una-0.034
TokenThat
Token ID2504
Feature activation+3.43e+00

Pos logit contributions
oto+0.085
lic+0.085
ibi+0.078
ogo+0.077
round+0.075
Neg logit contributions
vered-0.092
ement-0.088
 symp-0.085
 spe-0.085
 aston-0.081
Token's
Token ID338
Feature activation-1.76e-01

Pos logit contributions
lic+0.350
oto+0.346
ogo+0.334
ibi+0.322
round+0.320
Neg logit contributions
vered-0.252
ement-0.242
 spe-0.233
 symp-0.229
 aston-0.216
Token good
Token ID922
Feature activation+1.86e+00

Pos logit contributions
 symp+0.015
ement+0.014
vered+0.014
 spe+0.014
 aston+0.014
Neg logit contributions
lic-0.016
round-0.015
oto-0.015
ogo-0.015
ibi-0.015
Token,
Token ID11
Feature activation+4.17e-02

Pos logit contributions
lic+0.176
oto+0.176
ogo+0.166
ibi+0.164
round+0.160
Neg logit contributions
vered-0.151
ement-0.146
 spe-0.141
 symp-0.139
 aston-0.131
Token my
Token ID616
Feature activation+2.27e+00

Pos logit contributions
lic+0.004
oto+0.003
ogo+0.003
ibi+0.003
round+0.003
Neg logit contributions
ement-0.004
vered-0.004
 spe-0.004
 symp-0.004
 aston-0.004
Token de
Token ID390
Feature activation-5.94e+00

Pos logit contributions
oto+0.159
lic+0.158
ogo+0.147
ibi+0.143
round+0.139
Neg logit contributions
vered-0.241
ement-0.235
 symp-0.228
 spe-0.227
 aston-0.218
Tokenars
Token ID945
Feature activation+9.77e-01

Pos logit contributions
ement+0.543
vered+0.537
 spe+0.509
 symp+0.488
 aston+0.455
Neg logit contributions
oto-0.486
ibi-0.481
lic-0.472
ogo-0.462
 Indians-0.408
Token.
Token ID13
Feature activation-3.67e-01

Pos logit contributions
oto+0.073
lic+0.073
ogo+0.068
ibi+0.065
oys+0.065
Neg logit contributions
vered-0.098
ement-0.094
 symp-0.093
 spe-0.093
 aston-0.090
Token But
Token ID887
Feature activation+2.90e-01

Pos logit contributions
vered+0.032
ement+0.031
 spe+0.031
 symp+0.030
 aston+0.029
Neg logit contributions
oto-0.033
lic-0.032
ogo-0.031
ibi-0.030
round-0.028
Token you
Token ID345
Feature activation-1.13e+00

Pos logit contributions
oto+0.020
lic+0.019
ogo+0.018
ibi+0.017
oys+0.017
Neg logit contributions
vered-0.031
ement-0.030
 symp-0.029
 spe-0.029
 aston-0.028
Token still
Token ID991
Feature activation-2.41e+00

Pos logit contributions
vered+0.104
 spe+0.101
ement+0.098
 symp+0.097
 aston+0.094
Neg logit contributions
oto-0.094
lic-0.092
ibi-0.087
ogo-0.087
round-0.084
Token looked
Token ID3114
Feature activation+1.37e+00

Pos logit contributions
ement+0.125
vered+0.124
 symp+0.119
 spe+0.112
 aston+0.112
Neg logit contributions
lic-0.062
oto-0.059
ibi-0.052
EEP-0.051
round-0.051
Token she
Token ID673
Feature activation-1.08e+00

Pos logit contributions
oto+0.092
lic+0.084
ogo+0.080
round+0.077
ibi+0.077
Neg logit contributions
vered-0.145
ement-0.141
 symp-0.141
 aston-0.134
 spe-0.133
Token saw
Token ID2497
Feature activation-7.63e-01

Pos logit contributions
ement+0.138
vered+0.135
 symp+0.130
 aston+0.129
 spe+0.125
Neg logit contributions
lic-0.040
oto-0.036
ogo-0.030
EEP-0.029
 jumped-0.029
Token people
Token ID661
Feature activation+6.42e-01

Pos logit contributions
vered+0.088
 symp+0.088
ement+0.085
 spe+0.082
 aston+0.081
Neg logit contributions
oto-0.046
lic-0.042
ibi-0.038
ock-0.036
ruck-0.035
Token shouting
Token ID19642
Feature activation+5.79e-01

Pos logit contributions
lic+0.030
oto+0.029
ock+0.028
round+0.027
ruck+0.027
Neg logit contributions
vered-0.079
ement-0.071
 symp-0.071
 spe-0.070
 aston-0.069
Token angrily
Token ID34425
Feature activation-7.09e+00

Pos logit contributions
lic+0.042
oto+0.041
ibi+0.037
round+0.036
EEP+0.035
Neg logit contributions
vered-0.060
ement-0.057
 spe-0.057
 symp-0.055
 aston-0.053
Token and
Token ID290
Feature activation+1.02e-01

Pos logit contributions
vered+0.706
ement+0.692
 spe+0.673
 symp+0.650
 aston+0.636
Neg logit contributions
oto-0.536
lic-0.528
ibi-0.471
ogo-0.467
round-0.463
Token having
Token ID1719
Feature activation+2.99e-01

Pos logit contributions
lic+0.007
oto+0.007
ibi+0.006
ogo+0.006
 Indians+0.006
Neg logit contributions
ement-0.011
vered-0.011
 symp-0.010
 spe-0.010
 aston-0.010
Token terrible
Token ID7818
Feature activation-4.01e-01

Pos logit contributions
oto+0.009
lic+0.009
ibi+0.007
 cool+0.006
 Indians+0.005
Neg logit contributions
ement-0.043
vered-0.042
 symp-0.041
-0.040
ingly-0.039
Token fights
Token ID11418
Feature activation-3.34e+00

Pos logit contributions
vered+0.043
 symp+0.043
 spe+0.042
ement+0.041
 aston+0.039
Neg logit contributions
oto-0.026
lic-0.025
ibi-0.023
oys-0.023
EEP-0.023
Token.
Token ID13
Feature activation-3.72e-01

Pos logit contributions
vered+0.362
ement+0.352
 symp+0.351
 spe+0.347
 aston+0.338
Neg logit contributions
lic-0.218
oto-0.207
oys-0.186
ibi-0.186
ock-0.182
Token an
Token ID281
Feature activation-1.28e+00

Pos logit contributions
ement+0.151
vered+0.149
 spe+0.148
 symp+0.146
 aston+0.138
Neg logit contributions
oto-0.114
lic-0.114
ibi-0.103
ogo-0.101
round-0.098
Token important
Token ID1593
Feature activation+9.12e-01

Pos logit contributions
vered+0.146
 spe+0.140
ement+0.139
 symp+0.136
 aston+0.131
Neg logit contributions
oto-0.080
lic-0.076
ibi-0.071
ogo-0.070
round-0.068
Token lesson
Token ID11483
Feature activation+1.88e+00

Pos logit contributions
oto+0.042
lic+0.039
ibi+0.036
round+0.033
oys+0.033
Neg logit contributions
vered-0.119
ement-0.117
 spe-0.113
 symp-0.113
 aston-0.109
Tokenâ
Token ID22940
Feature activation-2.82e-02

Pos logit contributions
oto+0.139
lic+0.137
ogo+0.127
ibi+0.126
round+0.121
Neg logit contributions
vered-0.191
ement-0.184
 spe-0.180
 symp-0.180
 aston-0.172
Token
Token ID26391
Feature activation+2.36e+00

Pos logit contributions
vered+0.003
ement+0.003
 symp+0.003
 spe+0.003
 aston+0.003
Neg logit contributions
lic-0.002
oto-0.002
ogo-0.002
ibi-0.002
round-0.002
Token
Token ID447
Feature activation-7.24e+00

Pos logit contributions
lic+0.172
oto+0.166
ibi+0.160
ogo+0.155
EEP+0.154
Neg logit contributions
vered-0.241
ement-0.234
 spe-0.230
 symp-0.227
 aston-0.216
Token
Token ID251
Feature activation-2.32e+00

Pos logit contributions
vered+0.267
ement+0.229
 spe+0.227
 symp+0.207
 aston+0.199
Neg logit contributions
oto-0.989
lic-0.983
EEP-0.950
ogo-0.947
ibi-0.945
Tokenpat
Token ID8071
Feature activation-1.30e+00

Pos logit contributions
vered+0.242
 spe+0.238
ement+0.211
 aston+0.209
 symp+0.206
Neg logit contributions
oto-0.166
lic-0.153
round-0.142
oys-0.140
EEP-0.140
Tokenience
Token ID1240
Feature activation-3.58e+00

Pos logit contributions
vered+0.119
ement+0.115
 spe+0.112
 symp+0.111
 aston+0.106
Neg logit contributions
oto-0.110
lic-0.109
ogo-0.102
ibi-0.101
round-0.098
Token pays
Token ID13831
Feature activation+6.75e-01

Pos logit contributions
vered+0.363
 spe+0.360
ement+0.356
 symp+0.350
 aston+0.342
Neg logit contributions
lic-0.252
oto-0.250
ibi-0.225
ogo-0.222
EEP-0.220
Token off
Token ID572
Feature activation+3.15e+00

Pos logit contributions
lic+0.050
oto+0.049
ogo+0.046
ibi+0.045
round+0.044
Neg logit contributions
vered-0.069
ement-0.065
 symp-0.064
 spe-0.061
 aston-0.060