TOP ACTIVATIONS
MAX = 0.208

stepped into an ant - hill and could not get out
didn 't like being called unint elligent , so he pointed
and books . They see alt ars and offerings and priests
" Error . Data corrupted . System shutting down ."
could go on a stay cation instead . They could have
, that said : B OUN CE ALL YOU WANT !
again and enjoyed her stay cation . She realized that sometimes
wood . One day a trou l be came to the
! AND YOU ARE E ATING ALL THE S UG AR
floor . " Â M ira mi
said : " Ro - bin , car - din -
L ily got m - mad and r - ran away
watched , she realized that    the ground was alive
E ATING ALL THE S UG AR ! THAT IS VERY
are you in a dust bin ?" Joe
In the stream over y onder , it 's very nice
sign that says " T oxic Waste ". "
the store and buy some bottled water . Lily was happy
So they left the dust bin , never to disappear again
him in an old dust bin . Joe was using it

MIN ACTIVATIONS
MIN = -0.238

his mom saw the ash tr ay , she laughed and
mom found an old ash tr ay . She told Lily
Tim saw a small ash tr ay outside the door .
to see the orange ash tr ay . She hugged Jack
his mom put the ash tr ay away , but Little
" It 's an ash tr ay . People put things
show his mom the ash tr ay . At
gave him a small ash tr ay as a gift .
and gave her the ash tr ay . She was very
her mom what an ash tr ay was , and her
carefully picked up the ash tr ay and put it in
wanted to keep the ash tr ay . So he tried
" That 's an ash tr ay , sweet ie .
showed his mom the ash tr ay . She asked him
. Jack put the ash tr ay on the kitchen table
She hid behind an ash tr ay . "
and accidentally dropped her ash tr ay on the ground .
it came across an ash tr ay . It wanted to
very tired . The ash tr ay watched as the little
. It was an ash tr ay , and it was

INTERVAL 0.097 - 0.208
CONTAINS 0.129%

said , Do you need to
said , Oh no ! You
. â Look , a big
with dirt . T ina observed . She watched
He was three - years - old and very curious .

INTERVAL -0.015 - 0.097
CONTAINS 75.570%

that the little girl should always remember to find a safe
was over . Mom my was proud of
him . The funny thing was that the boat was not
home . It was red and sweet . They were so
are twins . They like to play in the snow .

INTERVAL -0.126 - -0.015
CONTAINS 24.270%

was bigger and he could reach it first . He was
It was a big , happy yawn .
are friends . <|endoftext|> L ily and Ben were at the
and the calf was now grown . The cow was happy
â His friend said ,

INTERVAL -0.238 - -0.126
CONTAINS 0.030%

a bite of the water melon , he heard the farmer
You should not touch the oven . It is hot and
stealing honey from the be eh ives without permission . She
. They sne aked behind the truck and climbed
the dough out of the oven and cut it into pieces
Token stepped
Token ID10764
Feature activation-3.76e-02

Pos logit contributions
int+0.024
 extingu+0.023
aj+0.020
ccoli+0.020
ider+0.020
Neg logit contributions
 it-0.018
 the-0.013
 his-0.012
 he-0.012
 fast-0.012
Token into
Token ID656
Feature activation-1.32e-02

Pos logit contributions
int+0.105
 extingu+0.097
oke+0.090
azed+0.089
rots+0.088
Neg logit contributions
 it-0.071
 his-0.050
 the-0.050
 fast-0.049
 everyone-0.045
Token an
Token ID281
Feature activation-4.75e-03

Pos logit contributions
 extingu+0.040
int+0.039
ifying+0.035
azed+0.035
rots+0.034
Neg logit contributions
 it-0.015
 he-0.009
 fast-0.009
 everyone-0.009
 his-0.008
Token ant
Token ID1885
Feature activation-3.88e-02

Pos logit contributions
int+0.012
 extingu+0.011
azed+0.010
oke+0.009
ener+0.009
Neg logit contributions
 it-0.010
 his-0.007
 the-0.007
 fast-0.007
 everyone-0.007
Token-
Token ID12
Feature activation+1.03e-01

Pos logit contributions
int+0.106
 extingu+0.098
azed+0.090
ifying+0.089
rots+0.089
Neg logit contributions
 it-0.071
 his-0.050
 the-0.048
 fast-0.048
 everyone-0.047
Tokenhill
Token ID12639
Feature activation+2.08e-01

Pos logit contributions
 it+0.252
 everyone+0.201
 his+0.199
 fast+0.197
 the+0.196
Neg logit contributions
int-0.204
 extingu-0.183
rots-0.164
ifying-0.163
ives-0.161
Token and
Token ID290
Feature activation-1.54e-02

Pos logit contributions
 it+0.333
 his+0.221
 the+0.213
 everyone+0.211
 fast+0.211
Neg logit contributions
int-0.614
 extingu-0.569
azed-0.530
ider-0.523
oke-0.521
Token could
Token ID714
Feature activation+4.30e-02

Pos logit contributions
int+0.038
 extingu+0.035
rots+0.032
azed+0.032
ener+0.032
Neg logit contributions
 it-0.028
 fast-0.021
 his-0.020
 everyone-0.020
 the-0.019
Token not
Token ID407
Feature activation+6.10e-03

Pos logit contributions
 it+0.062
 his+0.038
 the+0.036
 fast+0.036
 everyone+0.035
Neg logit contributions
int-0.134
 extingu-0.124
azed-0.116
ider-0.114
rots-0.113
Token get
Token ID651
Feature activation+1.18e-03

Pos logit contributions
 it+0.010
 his+0.007
 the+0.007
 fast+0.007
 everyone+0.006
Neg logit contributions
int-0.017
 extingu-0.016
azed-0.015
ener-0.014
oke-0.014
Token out
Token ID503
Feature activation-3.54e-02

Pos logit contributions
 it+0.002
 everyone+0.001
 his+0.001
 fast+0.001
 he+0.001
Neg logit contributions
int-0.003
 extingu-0.003
ishes-0.003
azed-0.003
ener-0.003
Token didn
Token ID1422
Feature activation+3.90e-02

Pos logit contributions
 it+0.055
 his+0.039
 the+0.039
 fast+0.037
 everyone+0.036
Neg logit contributions
int-0.081
 extingu-0.075
ifying-0.069
azed-0.069
ider-0.068
Token't
Token ID470
Feature activation+4.91e-02

Pos logit contributions
 it+0.053
 his+0.033
 everyone+0.032
 the+0.031
 fast+0.030
Neg logit contributions
int-0.113
 extingu-0.109
oke-0.100
ifying-0.100
ives-0.099
Token like
Token ID588
Feature activation+2.30e-04

Pos logit contributions
 it+0.096
 the+0.068
 his+0.067
 fast+0.066
 everyone+0.065
Neg logit contributions
int-0.130
 extingu-0.118
oke-0.107
azed-0.107
ives-0.105
Token being
Token ID852
Feature activation-1.65e-02

Pos logit contributions
 it+0.000
 fast+0.000
 +0.000
 slow+0.000
 his+0.000
Neg logit contributions
 extingu-0.001
oke-0.001
ache-0.001
der-0.001
int-0.001
Token called
Token ID1444
Feature activation-6.11e-03

Pos logit contributions
int+0.042
 extingu+0.038
azed+0.036
oke+0.035
rots+0.035
Neg logit contributions
 it-0.033
 his-0.023
 the-0.023
 fast-0.022
 everyone-0.022
Token unint
Token ID17679
Feature activation+1.78e-01

Pos logit contributions
int+0.018
 extingu+0.017
azed+0.016
oke+0.016
ifying+0.016
Neg logit contributions
 it-0.009
 his-0.006
 fast-0.005
 the-0.005
 everyone-0.005
Tokenelligent
Token ID32940
Feature activation+1.82e-03

Pos logit contributions
 it+0.489
 his+0.392
 the+0.385
 fast+0.384
 everyone+0.383
Neg logit contributions
int-0.329
 extingu-0.285
azed-0.254
rots-0.251
ifying-0.249
Token,
Token ID11
Feature activation+1.05e-02

Pos logit contributions
 it+0.004
 his+0.003
 fast+0.003
 the+0.002
 everyone+0.002
Neg logit contributions
int-0.005
 extingu-0.004
oke-0.004
azed-0.004
rots-0.004
Token so
Token ID523
Feature activation+1.39e-02

Pos logit contributions
 it+0.014
 fast+0.010
 everyone+0.009
 his+0.009
 the+0.009
Neg logit contributions
int-0.028
 extingu-0.026
oke-0.025
der-0.025
ener-0.024
Token he
Token ID339
Feature activation+1.98e-02

Pos logit contributions
 it+0.012
 fast+0.005
 slow+0.004
 everyone+0.004
 the+0.004
Neg logit contributions
int-0.043
oke-0.040
 extingu-0.040
led-0.039
der-0.038
Token pointed
Token ID6235
Feature activation-4.72e-03

Pos logit contributions
 it+0.038
 his+0.027
 the+0.027
 fast+0.026
 everyone+0.026
Neg logit contributions
int-0.053
 extingu-0.048
azed-0.045
oke-0.044
ider-0.044
Token and
Token ID290
Feature activation+7.29e-03

Pos logit contributions
 it+0.037
 the+0.024
 his+0.024
 fast+0.023
 everyone+0.022
Neg logit contributions
int-0.068
 extingu-0.062
azed-0.058
ider-0.057
aj-0.057
Token books
Token ID3835
Feature activation+2.30e-02

Pos logit contributions
 it+0.016
 his+0.012
 the+0.012
 fast+0.011
 everyone+0.011
Neg logit contributions
int-0.018
 extingu-0.016
azed-0.015
ider-0.015
oke-0.015
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
 it+0.036
 his+0.023
 the+0.023
 fast+0.022
 everyone+0.021
Neg logit contributions
int-0.071
 extingu-0.065
azed-0.061
aj-0.060
oke-0.060
Token They
Token ID1119
Feature activation+3.58e-03

Pos logit contributions
 it+0.032
 his+0.022
 the+0.022
 fast+0.022
 everyone+0.021
Neg logit contributions
int-0.049
 extingu-0.046
azed-0.042
ifying-0.042
oke-0.041
Token see
Token ID766
Feature activation+4.12e-02

Pos logit contributions
 it+0.006
 the+0.004
 his+0.004
 fast+0.004
 everyone+0.004
Neg logit contributions
int-0.011
 extingu-0.010
azed-0.009
ider-0.009
rots-0.009
Token alt
Token ID5988
Feature activation+1.76e-01

Pos logit contributions
 it+0.073
 his+0.050
 the+0.050
 fast+0.047
 everyone+0.046
Neg logit contributions
int-0.119
 extingu-0.108
azed-0.101
ider-0.100
oke-0.099
Tokenars
Token ID945
Feature activation-3.78e-02

Pos logit contributions
 it+0.470
 fast+0.380
 the+0.370
 his+0.358
 everyone+0.353
Neg logit contributions
 extingu-0.326
int-0.306
sed-0.277
ener-0.269
azed-0.264
Token and
Token ID290
Feature activation+9.11e-03

Pos logit contributions
int+0.113
 extingu+0.103
azed+0.096
ider+0.095
aj+0.094
Neg logit contributions
 it-0.066
 his-0.045
 the-0.045
 fast-0.042
 everyone-0.040
Token offerings
Token ID18369
Feature activation+3.84e-02

Pos logit contributions
 it+0.019
 his+0.014
 the+0.014
 fast+0.013
 everyone+0.013
Neg logit contributions
int-0.024
 extingu-0.021
azed-0.020
oke-0.019
ider-0.019
Token and
Token ID290
Feature activation+8.02e-03

Pos logit contributions
 it+0.060
 the+0.040
 his+0.038
 fast+0.035
 he+0.034
Neg logit contributions
int-0.123
 extingu-0.113
ider-0.106
azed-0.106
aj-0.105
Token priests
Token ID17345
Feature activation-1.57e-02

Pos logit contributions
 it+0.016
 his+0.012
 the+0.012
 fast+0.012
 everyone+0.011
Neg logit contributions
int-0.021
 extingu-0.019
azed-0.018
ider-0.017
oke-0.017
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
 it+0.016
 his+0.011
 everyone+0.011
 he+0.010
 fast+0.010
Neg logit contributions
int-0.029
 extingu-0.027
aj-0.027
oke-0.026
rots-0.026
Token"
Token ID1
Feature activation-1.94e-02

Pos logit contributions
 it+0.022
 his+0.016
 the+0.015
 everyone+0.015
 fast+0.015
Neg logit contributions
int-0.030
 extingu-0.027
azed-0.025
oke-0.025
aj-0.025
TokenError
Token ID12331
Feature activation+2.68e-02

Pos logit contributions
int+0.051
 extingu+0.046
azed+0.043
urger+0.043
aj+0.043
Neg logit contributions
 it-0.040
 the-0.028
 his-0.028
 fast-0.028
 everyone-0.027
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
 it+0.047
 the+0.032
 his+0.031
 fast+0.028
 he+0.027
Neg logit contributions
int-0.078
 extingu-0.071
istered-0.068
ishes-0.067
ifying-0.067
Token Data
Token ID6060
Feature activation+3.24e-02

Pos logit contributions
 it+0.025
 his+0.018
 fast+0.018
 the+0.018
 everyone+0.017
Neg logit contributions
int-0.029
 extingu-0.027
azed-0.024
oke-0.024
rots-0.024
Token corrupted
Token ID26940
Feature activation+1.72e-01

Pos logit contributions
 it+0.086
 the+0.070
 his+0.070
 fast+0.069
 everyone+0.069
Neg logit contributions
int-0.056
 extingu-0.050
oke-0.046
ifying-0.046
ives-0.046
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
 it+0.275
 his+0.178
 the+0.176
 fast+0.168
 everyone+0.163
Neg logit contributions
int-0.522
 extingu-0.480
azed-0.449
ifying-0.442
ider-0.441
Token System
Token ID4482
Feature activation-2.54e-02

Pos logit contributions
 it+0.028
 his+0.021
 the+0.020
 fast+0.020
 everyone+0.019
Neg logit contributions
int-0.032
 extingu-0.030
azed-0.027
oke-0.027
ifying-0.027
Token shutting
Token ID25136
Feature activation-1.12e-02

Pos logit contributions
int+0.061
 extingu+0.054
azed+0.051
rots+0.048
ishes+0.048
Neg logit contributions
 it-0.057
 fast-0.043
 the-0.043
 his-0.042
 everyone-0.040
Token down
Token ID866
Feature activation+1.83e-04

Pos logit contributions
int+0.032
 extingu+0.030
ifying+0.028
azed+0.028
oke+0.028
Neg logit contributions
 it-0.017
 his-0.011
 fast-0.011
 everyone-0.010
 he-0.010
Token."
Token ID526
Feature activation-2.97e-02

Pos logit contributions
 it+0.000
 his+0.000
 the+0.000
 fast+0.000
 everyone+0.000
Neg logit contributions
int-0.001
 extingu-0.000
azed-0.000
ider-0.000
rots-0.000
Token could
Token ID714
Feature activation+4.06e-02

Pos logit contributions
int+0.052
 extingu+0.048
azed+0.043
aj+0.043
rots+0.043
Neg logit contributions
 it-0.036
 the-0.025
 his-0.025
 fast-0.024
 everyone-0.024
Token go
Token ID467
Feature activation+2.28e-02

Pos logit contributions
 it+0.071
 his+0.050
 fast+0.049
 the+0.048
 everyone+0.047
Neg logit contributions
int-0.105
 extingu-0.100
ifying-0.089
ider-0.088
 department-0.088
Token on
Token ID319
Feature activation+4.69e-02

Pos logit contributions
 it+0.041
 his+0.028
 the+0.027
 everyone+0.027
 fast+0.027
Neg logit contributions
int-0.062
 extingu-0.057
azed-0.052
ider-0.052
aj-0.052
Token a
Token ID257
Feature activation-5.12e-03

Pos logit contributions
 it+0.062
 he+0.038
 fast+0.037
 everyone+0.037
 his+0.037
Neg logit contributions
 extingu-0.129
int-0.128
aj-0.119
ifying-0.119
uddled-0.113
Token stay
Token ID2652
Feature activation+7.54e-05

Pos logit contributions
int+0.011
 extingu+0.010
aj+0.009
ifying+0.009
urger+0.009
Neg logit contributions
 it-0.011
 his-0.008
 everyone-0.008
 the-0.008
 Tim-0.008
Tokencation
Token ID30907
Feature activation+1.71e-01

Pos logit contributions
 it+0.000
 his+0.000
 the+0.000
 fast+0.000
 everyone+0.000
Neg logit contributions
int-0.000
 extingu-0.000
oke-0.000
azed-0.000
rots-0.000
Token instead
Token ID2427
Feature activation-9.85e-03

Pos logit contributions
 it+0.281
 fast+0.185
 his+0.182
 he+0.175
 everyone+0.169
Neg logit contributions
int-0.459
 extingu-0.428
azed-0.396
ider-0.390
ifying-0.389
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
int+0.026
 extingu+0.024
azed+0.022
ider+0.022
ifying+0.022
Neg logit contributions
 it-0.018
 his-0.013
 the-0.013
 fast-0.013
 everyone-0.012
Token They
Token ID1119
Feature activation-2.27e-02

Pos logit contributions
int+0.046
 extingu+0.043
aj+0.041
oke+0.040
azed+0.040
Neg logit contributions
 it-0.022
 his-0.015
 fast-0.014
 the-0.013
 everyone-0.013
Token could
Token ID714
Feature activation+4.30e-02

Pos logit contributions
int+0.057
 extingu+0.051
azed+0.047
ifying+0.046
oke+0.046
Neg logit contributions
 it-0.047
 his-0.034
 fast-0.033
 the-0.033
 everyone-0.032
Token have
Token ID423
Feature activation+3.00e-03

Pos logit contributions
 it+0.073
 his+0.048
 fast+0.048
 everyone+0.045
 the+0.045
Neg logit contributions
int-0.111
 extingu-0.107
aj-0.094
ifying-0.094
 department-0.093
Token,
Token ID11
Feature activation+1.15e-03

Pos logit contributions
 it+0.015
 his+0.010
 the+0.010
 fast+0.009
 everyone+0.009
Neg logit contributions
int-0.028
 extingu-0.026
azed-0.024
ider-0.024
oke-0.024
Token that
Token ID326
Feature activation-5.43e-03

Pos logit contributions
 it+0.001
 his+0.001
 fast+0.001
 everyone+0.001
 the+0.001
Neg logit contributions
int-0.004
 extingu-0.003
ifying-0.003
azed-0.003
oke-0.003
Token said
Token ID531
Feature activation-3.39e-02

Pos logit contributions
int+0.015
 extingu+0.014
oke+0.013
azed+0.013
ccoli+0.013
Neg logit contributions
 it-0.007
 fast-0.005
 his-0.005
 the-0.004
 Tim-0.004
Token:
Token ID25
Feature activation-4.43e-02

Pos logit contributions
int+0.104
 extingu+0.097
oke+0.093
ider+0.092
azed+0.092
Neg logit contributions
 it-0.040
 fast-0.024
 his-0.024
 the-0.021
 everyone-0.021
Token B
Token ID347
Feature activation+5.33e-02

Pos logit contributions
int+0.114
 extingu+0.106
oke+0.101
ccoli+0.100
ifying+0.099
Neg logit contributions
 it-0.076
 his-0.054
 fast-0.054
 everyone-0.052
 the-0.052
TokenOUN
Token ID19385
Feature activation+1.70e-01

Pos logit contributions
 it+0.145
 his+0.115
 the+0.114
 fast+0.114
 everyone+0.111
Neg logit contributions
int-0.097
 extingu-0.088
azed-0.076
ifying-0.075
rots-0.075
TokenCE
Token ID5222
Feature activation+1.36e-02

Pos logit contributions
 it+0.501
 his+0.410
 he+0.408
 everyone+0.406
 the+0.406
Neg logit contributions
int-0.278
 extingu-0.227
ider-0.206
azed-0.205
rots-0.205
Token ALL
Token ID11096
Feature activation+1.33e-01

Pos logit contributions
 it+0.030
 his+0.023
 the+0.022
 fast+0.022
 everyone+0.022
Neg logit contributions
int-0.032
 extingu-0.030
ishes-0.028
ifying-0.027
ccoli-0.027
Token YOU
Token ID7013
Feature activation+9.77e-03

Pos logit contributions
 it+0.264
 his+0.192
 fast+0.188
 the+0.187
 everyone+0.182
Neg logit contributions
int-0.336
 extingu-0.306
azed-0.283
ifying-0.282
oke-0.281
Token WANT
Token ID41300
Feature activation+8.85e-02

Pos logit contributions
 it+0.023
 his+0.018
 the+0.018
 fast+0.017
 everyone+0.016
Neg logit contributions
int-0.022
 extingu-0.020
azed-0.018
rots-0.018
aj-0.018
Token!
Token ID0
Feature activation-2.84e-03

Pos logit contributions
 it+0.214
 his+0.166
 the+0.163
 fast+0.161
 everyone+0.159
Neg logit contributions
int-0.191
 extingu-0.169
azed-0.154
ifying-0.151
rots-0.151
Token again
Token ID757
Feature activation+1.87e-02

Pos logit contributions
int+0.037
 extingu+0.035
azed+0.032
ider+0.032
der+0.032
Neg logit contributions
 it-0.022
 fast-0.016
 his-0.015
 everyone-0.014
 slow-0.014
Token and
Token ID290
Feature activation-1.75e-02

Pos logit contributions
 it+0.031
 his+0.021
 the+0.020
 fast+0.020
 everyone+0.019
Neg logit contributions
int-0.056
 extingu-0.051
azed-0.048
rots-0.047
oke-0.047
Token enjoyed
Token ID8359
Feature activation+2.34e-02

Pos logit contributions
int+0.042
 extingu+0.037
uddled+0.037
azed+0.036
led+0.036
Neg logit contributions
 it-0.031
 fast-0.023
 his-0.022
 slow-0.021
 everyone-0.020
Token her
Token ID607
Feature activation+7.60e-04

Pos logit contributions
 it+0.027
 fast+0.017
 slow+0.016
 he+0.015
 everyone+0.014
Neg logit contributions
 extingu-0.062
int-0.059
aj-0.056
uddled-0.055
ifying-0.055
Token stay
Token ID2652
Feature activation-4.93e-03

Pos logit contributions
 it+0.001
 his+0.001
 the+0.001
 fast+0.001
 he+0.001
Neg logit contributions
int-0.002
 extingu-0.002
oke-0.002
rots-0.002
azed-0.002
Tokencation
Token ID30907
Feature activation+1.67e-01

Pos logit contributions
int+0.014
 extingu+0.014
oke+0.013
ives+0.013
rots+0.013
Neg logit contributions
 it-0.008
 his-0.006
 the-0.006
 fast-0.006
 everyone-0.005
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
 it+0.265
 fast+0.172
 Tim+0.165
 his+0.164
 he+0.164
Neg logit contributions
int-0.445
 extingu-0.413
azed-0.385
ider-0.384
rots-0.381
Token She
Token ID1375
Feature activation-2.81e-02

Pos logit contributions
int+0.024
ives+0.022
ifying+0.022
 extingu+0.022
azed+0.021
Neg logit contributions
 it-0.020
 his-0.016
 fast-0.015
 slow-0.014
 the-0.014
Token realized
Token ID6939
Feature activation+3.86e-02

Pos logit contributions
int+0.072
 extingu+0.064
azed+0.061
aj+0.060
oke+0.059
Neg logit contributions
 it-0.054
 his-0.038
 fast-0.037
 the-0.037
 everyone-0.036
Token that
Token ID326
Feature activation-8.35e-03

Pos logit contributions
 it+0.063
 fast+0.051
 slow+0.047
 his+0.045
 +0.043
Neg logit contributions
int-0.092
 extingu-0.086
der-0.082
uddled-0.082
rots-0.080
Token sometimes
Token ID3360
Feature activation-2.23e-03

Pos logit contributions
 extingu+0.020
int+0.019
 chicks+0.019
ccoli+0.019
der+0.019
Neg logit contributions
 it-0.009
 fast-0.008
 his-0.006
 slow-0.006
 -0.006
Token wood
Token ID4898
Feature activation+1.97e-02

Pos logit contributions
 it+0.011
 fast+0.008
 his+0.008
 everyone+0.008
 he+0.008
Neg logit contributions
 extingu-0.011
int-0.011
ishes-0.009
ifying-0.009
rots-0.009
Token.
Token ID13
Feature activation-3.25e-03

Pos logit contributions
 it+0.033
 his+0.022
 the+0.022
 fast+0.021
 everyone+0.020
Neg logit contributions
int-0.058
 extingu-0.053
azed-0.049
oke-0.049
ider-0.049
Token One
Token ID1881
Feature activation-5.89e-03

Pos logit contributions
int+0.009
 extingu+0.008
oke+0.008
ifying+0.008
azed+0.008
Neg logit contributions
 it-0.005
 fast-0.004
 his-0.004
 everyone-0.003
 the-0.003
Token day
Token ID1110
Feature activation-2.10e-02

Pos logit contributions
int+0.018
 extingu+0.016
azed+0.015
oke+0.015
ifying+0.015
Neg logit contributions
 it-0.009
 his-0.005
 fast-0.005
 everyone-0.005
 the-0.005
Token a
Token ID257
Feature activation+1.24e-03

Pos logit contributions
int+0.060
 extingu+0.055
ifying+0.051
azed+0.051
ider+0.051
Neg logit contributions
 it-0.029
 fast-0.020
 his-0.019
 everyone-0.018
 slow-0.017
Token trou
Token ID4057
Feature activation+1.63e-01

Pos logit contributions
 it+0.003
 his+0.002
 the+0.002
 fast+0.002
 everyone+0.002
Neg logit contributions
int-0.003
 extingu-0.003
azed-0.003
rots-0.003
ifying-0.003
Tokenl
Token ID75
Feature activation+7.65e-02

Pos logit contributions
 it+0.391
 his+0.326
 everyone+0.312
 the+0.307
 Tim+0.306
Neg logit contributions
int-0.313
 extingu-0.260
oke-0.259
rots-0.254
ider-0.249
Tokenbe
Token ID1350
Feature activation+5.83e-02

Pos logit contributions
 it+0.183
 the+0.150
 he+0.141
 his+0.141
 everyone+0.137
Neg logit contributions
aj-0.156
 extingu-0.155
istered-0.153
int-0.153
ccoli-0.149
Token came
Token ID1625
Feature activation-3.08e-02

Pos logit contributions
 it+0.149
 fast+0.117
 his+0.117
 the+0.116
 everyone+0.114
Neg logit contributions
int-0.117
 extingu-0.093
azed-0.090
oke-0.089
led-0.085
Token to
Token ID284
Feature activation+1.03e-02

Pos logit contributions
int+0.093
 extingu+0.086
azed+0.080
oke+0.079
ider+0.079
Neg logit contributions
 it-0.049
 his-0.032
 the-0.031
 fast-0.030
 everyone-0.029
Token the
Token ID262
Feature activation+7.05e-03

Pos logit contributions
 it+0.010
 fast+0.005
 his+0.004
 he+0.004
 everyone+0.004
Neg logit contributions
 extingu-0.033
int-0.033
ishes-0.029
ifying-0.029
 department-0.029
Token!
Token ID0
Feature activation+2.71e-02

Pos logit contributions
 it+0.199
 the+0.150
 his+0.145
 fast+0.143
 everyone+0.140
Neg logit contributions
int-0.208
 extingu-0.189
azed-0.173
ider-0.172
ifying-0.168
Token AND
Token ID5357
Feature activation+8.69e-02

Pos logit contributions
 it+0.049
 his+0.036
 the+0.034
 fast+0.033
 everyone+0.032
Neg logit contributions
int-0.074
 extingu-0.068
azed-0.063
oke-0.063
rots-0.063
Token YOU
Token ID7013
Feature activation+3.09e-02

Pos logit contributions
 it+0.166
 his+0.142
 he+0.126
 fast+0.121
 the+0.117
Neg logit contributions
int-0.210
 extingu-0.187
oke-0.186
rots-0.179
omed-0.178
Token ARE
Token ID15986
Feature activation-1.52e-02

Pos logit contributions
 it+0.075
 the+0.058
 fast+0.056
 his+0.056
 everyone+0.054
Neg logit contributions
int-0.068
 extingu-0.062
azed-0.056
ider-0.056
aj-0.055
Token E
Token ID412
Feature activation+8.92e-02

Pos logit contributions
int+0.038
 extingu+0.034
ifying+0.032
azed+0.032
oke+0.032
Neg logit contributions
 it-0.029
 his-0.022
 he-0.021
 fast-0.020
 the-0.019
TokenATING
Token ID33881
Feature activation+1.62e-01

Pos logit contributions
 it+0.227
 the+0.176
 his+0.172
 fast+0.171
 he+0.166
Neg logit contributions
int-0.181
 extingu-0.170
ifying-0.150
azed-0.150
 department-0.147
Token ALL
Token ID11096
Feature activation+1.40e-01

Pos logit contributions
 it+0.400
 the+0.308
 his+0.308
 fast+0.303
 everyone+0.297
Neg logit contributions
int-0.348
 extingu-0.313
azed-0.281
oke-0.275
ifying-0.273
Token THE
Token ID3336
Feature activation+7.20e-02

Pos logit contributions
 it+0.243
 his+0.182
 fast+0.174
 he+0.171
 the+0.162
Neg logit contributions
int-0.371
 extingu-0.338
ifying-0.321
oke-0.320
azed-0.319
Token S
Token ID311
Feature activation+8.09e-03

Pos logit contributions
 it+0.134
 his+0.093
 the+0.092
 fast+0.090
 everyone+0.087
Neg logit contributions
int-0.198
 extingu-0.181
azed-0.168
oke-0.165
ifying-0.165
TokenUG
Token ID7340
Feature activation+1.57e-01

Pos logit contributions
 it+0.021
 the+0.017
 his+0.016
 everyone+0.016
 fast+0.016
Neg logit contributions
 extingu-0.015
int-0.013
ifying-0.013
istered-0.013
 department-0.012
TokenAR
Token ID1503
Feature activation-5.15e-02

Pos logit contributions
 it+0.306
 his+0.218
 the+0.216
 fast+0.211
 everyone+0.204
Neg logit contributions
int-0.416
 extingu-0.380
azed-0.350
oke-0.343
ider-0.343
Token floor
Token ID4314
Feature activation+1.83e-02

Pos logit contributions
 it+0.032
 his+0.024
 everyone+0.024
 Tim+0.023
 he+0.023
Neg logit contributions
int-0.026
 extingu-0.026
ifying-0.022
ider-0.022
ener-0.022
Token.
Token ID13
Feature activation+6.78e-03

Pos logit contributions
 it+0.030
 his+0.020
 everyone+0.019
 the+0.019
 fast+0.018
Neg logit contributions
int-0.052
 extingu-0.048
ider-0.046
azed-0.046
ifying-0.044
Token
Token ID198
Feature activation+1.45e-02

Pos logit contributions
 it+0.010
 fast+0.006
 everyone+0.006
 his+0.006
 the+0.006
Neg logit contributions
int-0.020
 extingu-0.018
oke-0.017
azed-0.017
aj-0.017
Token
Token ID198
Feature activation+1.38e-02

Pos logit contributions
 it+0.024
 everyone+0.017
 his+0.017
 fast+0.016
 the+0.016
Neg logit contributions
int-0.035
oke-0.032
 extingu-0.030
aj-0.030
azed-0.029
Token"
Token ID1
Feature activation-2.05e-02

Pos logit contributions
 it+0.028
 his+0.020
 the+0.019
 everyone+0.019
 fast+0.019
Neg logit contributions
int-0.035
 extingu-0.031
ider-0.030
azed-0.029
aj-0.029
TokenÂ
Token ID5523
Feature activation+1.62e-01

Pos logit contributions
int+0.037
 extingu+0.035
ifying+0.034
ider+0.034
rots+0.032
Neg logit contributions
 it-0.047
 his-0.038
 the-0.036
 Tim-0.036
 everyone-0.034
Token
Token ID126
Feature activation+5.79e-02

Pos logit contributions
 he+0.048
 she+0.034
whe+0.032
 Tim+0.032
 it+0.031
Neg logit contributions
rots-0.428
ocado-0.426
ray-0.423
 envelope-0.423
 chick-0.415
Token
Token ID94
Feature activation+8.41e-02

Pos logit contributions
 it+0.155
 his+0.133
 he+0.130
 everyone+0.127
 the+0.125
Neg logit contributions
int-0.103
oke-0.081
 extingu-0.080
ives-0.075
ccoli-0.075
TokenM
Token ID44
Feature activation-4.80e-02

Pos logit contributions
 it+0.200
 his+0.156
 the+0.153
 fast+0.148
 everyone+0.148
Neg logit contributions
int-0.188
 extingu-0.162
ider-0.152
ifying-0.151
oke-0.151
Tokenira
Token ID8704
Feature activation+2.36e-02

Pos logit contributions
int+0.079
 extingu+0.077
ifying+0.065
ishes+0.062
ccoli+0.062
Neg logit contributions
 it-0.136
 his-0.109
 the-0.108
 fast-0.107
 he-0.106
Token mi
Token ID21504
Feature activation+1.05e-01

Pos logit contributions
 it+0.040
 the+0.026
 his+0.026
 fast+0.025
 everyone+0.024
Neg logit contributions
int-0.069
 extingu-0.065
azed-0.059
ifying-0.059
aj-0.059
Token said
Token ID531
Feature activation-1.98e-02

Pos logit contributions
int+0.058
 extingu+0.052
azed+0.049
rots+0.047
oke+0.047
Neg logit contributions
 it-0.046
 his-0.033
 fast-0.033
 the-0.032
 everyone-0.032
Token:
Token ID25
Feature activation-2.89e-02

Pos logit contributions
int+0.048
 extingu+0.045
azed+0.040
ifying+0.040
aj+0.040
Neg logit contributions
 it-0.045
 the-0.033
 his-0.033
 fast-0.032
 everyone-0.032
Token "
Token ID366
Feature activation-1.38e-02

Pos logit contributions
int+0.074
 extingu+0.067
ccoli+0.064
rots+0.064
oke+0.064
Neg logit contributions
 it-0.049
 his-0.036
 fast-0.034
 everyone-0.034
 the-0.033
TokenRo
Token ID15450
Feature activation+1.63e-02

Pos logit contributions
int+0.035
 extingu+0.031
azed+0.028
rots+0.028
ifying+0.028
Neg logit contributions
 it-0.029
 his-0.022
 the-0.021
 fast-0.021
 everyone-0.020
Token-
Token ID12
Feature activation+1.08e-01

Pos logit contributions
 it+0.045
 his+0.036
 the+0.036
 fast+0.035
 everyone+0.035
Neg logit contributions
int-0.030
 extingu-0.026
azed-0.023
ifying-0.023
oke-0.023
Tokenbin
Token ID8800
Feature activation+1.62e-01

Pos logit contributions
 it+0.251
 his+0.199
 the+0.189
 everyone+0.186
 fast+0.185
Neg logit contributions
int-0.231
 extingu-0.198
ives-0.194
rots-0.194
oke-0.187
Token,
Token ID11
Feature activation+2.13e-02

Pos logit contributions
 it+0.261
 his+0.179
 fast+0.169
 everyone+0.167
 the+0.167
Neg logit contributions
int-0.460
 extingu-0.418
oke-0.401
azed-0.397
ider-0.393
Token car
Token ID1097
Feature activation+2.86e-02

Pos logit contributions
 it+0.042
 his+0.032
 fast+0.031
 the+0.029
 everyone+0.029
Neg logit contributions
int-0.051
 extingu-0.048
oke-0.045
rots-0.044
azed-0.044
Token-
Token ID12
Feature activation+1.08e-01

Pos logit contributions
 it+0.049
 his+0.034
 the+0.032
 fast+0.032
 everyone+0.032
Neg logit contributions
int-0.079
 extingu-0.073
rots-0.068
oke-0.068
azed-0.068
Tokendin
Token ID25194
Feature activation+5.68e-02

Pos logit contributions
 it+0.236
 his+0.189
 fast+0.177
 the+0.175
 everyone+0.175
Neg logit contributions
int-0.232
 extingu-0.208
rots-0.204
ives-0.197
ifying-0.192
Token-
Token ID12
Feature activation+1.08e-01

Pos logit contributions
 it+0.143
 his+0.114
 the+0.113
 fast+0.111
 everyone+0.108
Neg logit contributions
int-0.105
 extingu-0.097
ifying-0.090
uddled-0.090
azed-0.089
TokenL
Token ID43
Feature activation+4.32e-02

Pos logit contributions
 it+0.255
 his+0.200
 the+0.196
 fast+0.195
 everyone+0.192
Neg logit contributions
int-0.218
 extingu-0.191
azed-0.172
ifying-0.171
oke-0.170
Tokenily
Token ID813
Feature activation-2.55e-02

Pos logit contributions
 it+0.107
 the+0.084
 his+0.083
 everyone+0.081
 fast+0.080
Neg logit contributions
int-0.086
 extingu-0.082
ifying-0.071
ccoli-0.071
rots-0.071
Token got
Token ID1392
Feature activation+4.95e-04

Pos logit contributions
int+0.074
 extingu+0.069
ider+0.063
ifying+0.063
azed+0.062
Neg logit contributions
 it-0.045
 the-0.031
 his-0.030
 fast-0.028
 he-0.028
Token m
Token ID285
Feature activation-4.33e-03

Pos logit contributions
 it+0.001
 his+0.001
 the+0.001
 fast+0.001
 everyone+0.001
Neg logit contributions
int-0.001
 extingu-0.001
azed-0.001
ifying-0.001
oke-0.001
Token-
Token ID12
Feature activation+1.09e-01

Pos logit contributions
int+0.007
 extingu+0.007
ccoli+0.006
rots+0.006
ifying+0.006
Neg logit contributions
 it-0.012
 his-0.010
 the-0.009
 fast-0.009
 he-0.009
Tokenmad
Token ID9937
Feature activation+1.60e-01

Pos logit contributions
 it+0.284
 his+0.224
 the+0.221
 fast+0.218
 everyone+0.215
Neg logit contributions
int-0.217
 extingu-0.191
azed-0.171
ider-0.167
oke-0.167
Token and
Token ID290
Feature activation-9.25e-04

Pos logit contributions
 it+0.262
 his+0.171
 the+0.171
 fast+0.159
 he+0.152
Neg logit contributions
int-0.487
 extingu-0.442
azed-0.419
ifying-0.417
rots-0.416
Token r
Token ID374
Feature activation-2.57e-02

Pos logit contributions
int+0.002
 extingu+0.002
aj+0.002
azed+0.002
rots+0.002
Neg logit contributions
 it-0.002
 his-0.001
 fast-0.001
 the-0.001
 everyone-0.001
Token-
Token ID12
Feature activation+1.07e-01

Pos logit contributions
int+0.040
 extingu+0.035
ifying+0.030
rots+0.030
azed+0.030
Neg logit contributions
 it-0.076
 his-0.063
 the-0.062
 fast-0.062
 everyone-0.061
Tokenran
Token ID2596
Feature activation-1.02e-02

Pos logit contributions
 it+0.279
 his+0.223
 the+0.218
 fast+0.215
 everyone+0.214
Neg logit contributions
int-0.208
 extingu-0.179
azed-0.159
ider-0.159
ifying-0.159
Token away
Token ID1497
Feature activation+1.99e-02

Pos logit contributions
int+0.030
 extingu+0.027
azed+0.025
ener+0.025
ifying+0.025
Neg logit contributions
 it-0.015
 his-0.010
 fast-0.010
 he-0.010
 everyone-0.009
Token watched
Token ID7342
Feature activation-7.27e-03

Pos logit contributions
 it+0.029
 the+0.020
 his+0.020
 fast+0.019
 he+0.018
Neg logit contributions
int-0.041
 extingu-0.039
oke-0.036
ifying-0.036
rots-0.036
Token,
Token ID11
Feature activation+8.08e-03

Pos logit contributions
int+0.025
 extingu+0.024
azed+0.022
ener+0.022
ifying+0.022
Neg logit contributions
 it-0.005
 fast-0.002
 his-0.002
 he-0.001
 everyone-0.001
Token she
Token ID673
Feature activation+1.24e-02

Pos logit contributions
 it+0.007
 his+0.004
 fast+0.004
 slow+0.003
 Tim+0.003
Neg logit contributions
int-0.025
 extingu-0.024
azed-0.023
ifying-0.023
rots-0.023
Token realized
Token ID6939
Feature activation+4.14e-02

Pos logit contributions
 it+0.026
 the+0.019
 his+0.019
 fast+0.018
 everyone+0.018
Neg logit contributions
int-0.032
 extingu-0.031
ifying-0.028
oke-0.027
ider-0.027
Token that
Token ID326
Feature activation-6.43e-03

Pos logit contributions
 it+0.037
 fast+0.031
 slow+0.027
 his+0.024
 +0.023
Neg logit contributions
int-0.116
 extingu-0.112
der-0.108
rots-0.108
azed-0.105
TokenÂ
Token ID5523
Feature activation+1.58e-01

Pos logit contributions
 extingu+0.019
int+0.019
 chicks+0.018
 chick+0.018
azed+0.018
Neg logit contributions
 it-0.004
 fast-0.002
 his-0.002
 slow-0.001
 Tim-0.001
Token 
Token ID1849
Feature activation+8.13e-02

Pos logit contributions
 it+0.010
 fast-0.004
 he-0.007
 Tim-0.013
 V-0.021
Neg logit contributions
int-0.525
 chick-0.525
 extingu-0.518
 chicks-0.504
ifying-0.501
Tokenthe
Token ID1169
Feature activation+3.80e-02

Pos logit contributions
 it+0.144
 his+0.101
 fast+0.098
 the+0.097
 he+0.094
Neg logit contributions
int-0.229
 extingu-0.202
azed-0.195
oke-0.191
rots-0.189
Token ground
Token ID2323
Feature activation-2.26e-02

Pos logit contributions
 it+0.072
 the+0.049
 fast+0.049
 everyone+0.045
 slow+0.043
Neg logit contributions
int-0.109
 extingu-0.105
oke-0.099
ives-0.097
ifying-0.097
Token was
Token ID373
Feature activation-7.92e-03

Pos logit contributions
int+0.069
 extingu+0.063
azed+0.059
ider+0.058
aj+0.058
Neg logit contributions
 it-0.037
 the-0.024
 his-0.023
 fast-0.022
 everyone-0.022
Token alive
Token ID6776
Feature activation-2.38e-02

Pos logit contributions
int+0.023
 extingu+0.021
azed+0.020
oke+0.019
rots+0.019
Neg logit contributions
 it-0.014
 his-0.009
 the-0.009
 fast-0.009
 everyone-0.009
Token E
Token ID412
Feature activation+8.92e-02

Pos logit contributions
int+0.038
 extingu+0.034
ifying+0.032
azed+0.032
oke+0.032
Neg logit contributions
 it-0.029
 his-0.022
 he-0.021
 fast-0.020
 the-0.019
TokenATING
Token ID33881
Feature activation+1.62e-01

Pos logit contributions
 it+0.227
 the+0.176
 his+0.172
 fast+0.171
 he+0.166
Neg logit contributions
int-0.181
 extingu-0.170
ifying-0.150
azed-0.150
 department-0.147
Token ALL
Token ID11096
Feature activation+1.40e-01

Pos logit contributions
 it+0.400
 the+0.308
 his+0.308
 fast+0.303
 everyone+0.297
Neg logit contributions
int-0.348
 extingu-0.313
azed-0.281
oke-0.275
ifying-0.273
Token THE
Token ID3336
Feature activation+7.20e-02

Pos logit contributions
 it+0.243
 his+0.182
 fast+0.174
 he+0.171
 the+0.162
Neg logit contributions
int-0.371
 extingu-0.338
ifying-0.321
oke-0.320
azed-0.319
Token S
Token ID311
Feature activation+8.09e-03

Pos logit contributions
 it+0.134
 his+0.093
 the+0.092
 fast+0.090
 everyone+0.087
Neg logit contributions
int-0.198
 extingu-0.181
azed-0.168
oke-0.165
ifying-0.165
TokenUG
Token ID7340
Feature activation+1.57e-01

Pos logit contributions
 it+0.021
 the+0.017
 his+0.016
 everyone+0.016
 fast+0.016
Neg logit contributions
 extingu-0.015
int-0.013
ifying-0.013
istered-0.013
 department-0.012
TokenAR
Token ID1503
Feature activation-5.15e-02

Pos logit contributions
 it+0.306
 his+0.218
 the+0.216
 fast+0.211
 everyone+0.204
Neg logit contributions
int-0.416
 extingu-0.380
azed-0.350
oke-0.343
ider-0.343
Token!
Token ID0
Feature activation+2.45e-02

Pos logit contributions
int+0.155
 extingu+0.145
ishes+0.132
aj+0.131
azed+0.130
Neg logit contributions
 it-0.086
 the-0.057
 fast-0.056
 his-0.053
 everyone-0.052
Token THAT
Token ID14603
Feature activation+2.55e-02

Pos logit contributions
 it+0.048
 his+0.038
 fast+0.035
 he+0.035
 the+0.034
Neg logit contributions
int-0.062
 extingu-0.057
oke-0.053
ifying-0.053
azed-0.052
Token IS
Token ID3180
Feature activation-6.69e-03

Pos logit contributions
 it+0.058
 his+0.044
 the+0.043
 fast+0.042
 everyone+0.041
Neg logit contributions
int-0.059
 extingu-0.054
azed-0.049
oke-0.048
rots-0.048
Token VERY
Token ID29550
Feature activation+6.94e-02

Pos logit contributions
int+0.017
 extingu+0.015
azed+0.014
oke+0.013
ener+0.013
Neg logit contributions
 it-0.014
 the-0.011
 fast-0.011
 his-0.011
 everyone-0.010
Token are
Token ID389
Feature activation-2.19e-02

Pos logit contributions
int+0.034
 extingu+0.030
oke+0.029
azed+0.028
rots+0.028
Neg logit contributions
 it-0.036
 fast-0.028
 his-0.027
 everyone-0.026
 the-0.026
Token you
Token ID345
Feature activation-3.41e-02

Pos logit contributions
int+0.060
 extingu+0.055
ifying+0.055
rots+0.054
azed+0.053
Neg logit contributions
 it-0.030
 fast-0.020
 -0.018
 his-0.018
 slow-0.017
Token in
Token ID287
Feature activation-1.28e-03

Pos logit contributions
int+0.097
 extingu+0.089
azed+0.082
rots+0.081
ider+0.081
Neg logit contributions
 it-0.062
 the-0.042
 his-0.042
 fast-0.041
 everyone-0.039
Token a
Token ID257
Feature activation-1.03e-03

Pos logit contributions
int+0.004
 extingu+0.004
azed+0.004
ifying+0.004
rots+0.004
Neg logit contributions
 it-0.001
 his-0.001
 fast-0.001
 everyone-0.001
 the-0.001
Token dust
Token ID8977
Feature activation+2.82e-02

Pos logit contributions
int+0.003
 extingu+0.002
azed+0.002
ifying+0.002
ider+0.002
Neg logit contributions
 it-0.002
 his-0.002
 the-0.002
 everyone-0.001
 fast-0.001
Tokenbin
Token ID8800
Feature activation+1.56e-01

Pos logit contributions
 it+0.056
 the+0.041
 his+0.040
 fast+0.038
 he+0.037
Neg logit contributions
int-0.077
 extingu-0.069
azed-0.065
oke-0.064
rots-0.064
Token?"
Token ID1701
Feature activation-3.53e-02

Pos logit contributions
 it+0.314
 the+0.226
 his+0.224
 fast+0.216
 he+0.209
Neg logit contributions
int-0.410
 extingu-0.391
azed-0.346
rots-0.344
ider-0.342
Token
Token ID220
Feature activation+2.90e-02

Pos logit contributions
int+0.097
 extingu+0.086
ray+0.077
aj+0.077
omed+0.077
Neg logit contributions
 it-0.050
 fast-0.041
 slow-0.033
 everyone-0.032
 his-0.028
Token
Token ID198
Feature activation+2.03e-03

Pos logit contributions
 it+0.042
 his+0.027
 fast+0.027
 the+0.025
 everyone+0.025
Neg logit contributions
int-0.087
 extingu-0.078
oke-0.076
azed-0.075
aj-0.075
Token
Token ID198
Feature activation+2.34e-03

Pos logit contributions
 it+0.004
 everyone+0.003
 fast+0.003
 his+0.003
 slow+0.003
Neg logit contributions
int-0.004
omed-0.004
aj-0.004
oke-0.004
apa-0.003
TokenJoe
Token ID19585
Feature activation+1.86e-02

Pos logit contributions
 it+0.005
 everyone+0.004
 the+0.004
 his+0.004
 fast+0.004
Neg logit contributions
int-0.005
 extingu-0.004
ider-0.004
omed-0.004
aj-0.004
TokenIn
Token ID818
Feature activation-2.67e-02

Pos logit contributions
int+0.047
 extingu+0.043
azed+0.040
oke+0.039
ifying+0.039
Neg logit contributions
 it-0.035
 his-0.025
 the-0.025
 fast-0.024
 everyone-0.023
Token the
Token ID262
Feature activation-3.79e-04

Pos logit contributions
int+0.101
 extingu+0.095
azed+0.090
oke+0.089
ider+0.089
Neg logit contributions
 it-0.023
 his-0.008
 the-0.007
 fast-0.006
 everyone-0.005
Token stream
Token ID4269
Feature activation+3.48e-02

Pos logit contributions
int+0.001
 extingu+0.001
oke+0.001
aj+0.001
rots+0.001
Neg logit contributions
 it-0.001
 the-0.001
 his-0.001
 fast-0.001
 everyone-0.001
Token over
Token ID625
Feature activation-2.42e-02

Pos logit contributions
 it+0.055
 the+0.036
 his+0.035
 fast+0.033
 everyone+0.032
Neg logit contributions
int-0.108
 extingu-0.101
azed-0.092
ifying-0.092
istered-0.092
Token y
Token ID331
Feature activation-5.65e-03

Pos logit contributions
int+0.076
 extingu+0.071
azed+0.066
oke+0.065
ider+0.065
Neg logit contributions
 it-0.037
 his-0.023
 the-0.023
 fast-0.022
 everyone-0.021
Tokenonder
Token ID8623
Feature activation+1.56e-01

Pos logit contributions
int+0.009
 extingu+0.008
oke+0.007
azed+0.007
ider+0.007
Neg logit contributions
 it-0.017
 his-0.014
 the-0.014
 fast-0.013
 everyone-0.013
Token,
Token ID11
Feature activation+9.33e-03

Pos logit contributions
 it+0.292
 the+0.203
 his+0.200
 fast+0.193
 everyone+0.186
Neg logit contributions
int-0.434
 extingu-0.395
azed-0.365
aj-0.364
ider-0.358
Token it
Token ID340
Feature activation+2.12e-02

Pos logit contributions
 it+0.013
 his+0.008
 the+0.008
 fast+0.008
 everyone+0.008
Neg logit contributions
int-0.029
 extingu-0.027
azed-0.025
oke-0.025
ifying-0.025
Token's
Token ID338
Feature activation+1.04e-02

Pos logit contributions
 it+0.045
 the+0.032
 he+0.030
 his+0.030
 everyone+0.028
Neg logit contributions
int-0.056
 extingu-0.054
aj-0.049
arl-0.049
ider-0.048
Token very
Token ID845
Feature activation+3.89e-02

Pos logit contributions
 it+0.017
 his+0.012
 the+0.011
 fast+0.011
 everyone+0.010
Neg logit contributions
int-0.031
 extingu-0.029
azed-0.027
oke-0.027
rots-0.027
Token nice
Token ID3621
Feature activation-3.58e-02

Pos logit contributions
 it+0.071
 his+0.050
 the+0.049
 everyone+0.047
 fast+0.046
Neg logit contributions
int-0.105
 extingu-0.095
azed-0.089
ider-0.088
aj-0.088
Token sign
Token ID1051
Feature activation-1.22e-02

Pos logit contributions
 it+0.012
 his+0.008
 the+0.008
 fast+0.008
 everyone+0.008
Neg logit contributions
int-0.018
 extingu-0.016
azed-0.015
ifying-0.015
oke-0.015
Token that
Token ID326
Feature activation+3.45e-03

Pos logit contributions
int+0.038
 extingu+0.035
azed+0.033
rots+0.032
ider+0.032
Neg logit contributions
 it-0.019
 his-0.012
 the-0.012
 fast-0.012
 everyone-0.011
Token says
Token ID1139
Feature activation+1.60e-02

Pos logit contributions
 it+0.009
 his+0.007
 the+0.007
 fast+0.007
 everyone+0.007
Neg logit contributions
int-0.007
 extingu-0.006
azed-0.005
oke-0.005
rots-0.005
Token "
Token ID366
Feature activation-3.31e-03

Pos logit contributions
 it+0.026
 his+0.018
 fast+0.017
 the+0.017
 everyone+0.016
Neg logit contributions
int-0.046
 extingu-0.043
oke-0.040
azed-0.040
rots-0.040
TokenT
Token ID51
Feature activation+1.39e-01

Pos logit contributions
int+0.008
 extingu+0.007
azed+0.006
ifying+0.006
rots+0.006
Neg logit contributions
 it-0.007
 his-0.006
 the-0.006
 fast-0.005
 everyone-0.005
Tokenoxic
Token ID18047
Feature activation+1.55e-01

Pos logit contributions
 it+0.378
 his+0.300
 the+0.295
 fast+0.293
 everyone+0.285
Neg logit contributions
int-0.247
 extingu-0.224
ifying-0.202
azed-0.197
istered-0.193
Token Waste
Token ID35304
Feature activation-5.06e-03

Pos logit contributions
 it+0.329
 his+0.244
 the+0.240
 fast+0.236
 everyone+0.230
Neg logit contributions
int-0.382
 extingu-0.346
azed-0.317
oke-0.312
ifying-0.311
Token".
Token ID1911
Feature activation+3.97e-02

Pos logit contributions
int+0.013
 extingu+0.012
aj+0.011
ider+0.011
istered+0.011
Neg logit contributions
 it-0.011
 the-0.008
 everyone-0.007
 his-0.007
 fast-0.007
Token
Token ID198
Feature activation+1.57e-02

Pos logit contributions
 it+0.054
 fast+0.042
 his+0.037
 everyone+0.036
 slow+0.034
Neg logit contributions
int-0.111
 extingu-0.100
oke-0.099
aj-0.094
omed-0.094
Token
Token ID198
Feature activation+1.48e-02

Pos logit contributions
 it+0.026
 everyone+0.020
 his+0.020
 fast+0.019
 he+0.018
Neg logit contributions
int-0.038
oke-0.036
oral-0.033
rots-0.032
omed-0.032
Token"
Token ID1
Feature activation-1.89e-02

Pos logit contributions
 it+0.030
 his+0.022
 everyone+0.022
 fast+0.021
 the+0.021
Neg logit contributions
int-0.037
 extingu-0.032
oke-0.031
omed-0.030
azed-0.030
Token the
Token ID262
Feature activation-2.15e-03

Pos logit contributions
 it+0.003
 fast+0.002
 he+0.002
 everyone+0.002
 his+0.002
Neg logit contributions
 extingu-0.009
int-0.009
aj-0.008
 department-0.008
azed-0.008
Token store
Token ID3650
Feature activation+7.72e-03

Pos logit contributions
int+0.005
 extingu+0.005
azed+0.005
rots+0.004
oke+0.004
Neg logit contributions
 it-0.005
 the-0.004
 his-0.004
 fast-0.004
 everyone-0.003
Token and
Token ID290
Feature activation-1.48e-02

Pos logit contributions
 it+0.013
 his+0.009
 everyone+0.009
 the+0.009
 fast+0.009
Neg logit contributions
int-0.021
 extingu-0.020
azed-0.018
ider-0.018
aj-0.018
Token buy
Token ID2822
Feature activation-2.99e-03

Pos logit contributions
int+0.041
 extingu+0.038
azed+0.035
ifying+0.035
 department+0.034
Neg logit contributions
 it-0.025
 his-0.017
 fast-0.017
 the-0.016
 everyone-0.016
Token some
Token ID617
Feature activation-4.65e-02

Pos logit contributions
int+0.008
 extingu+0.008
ishes+0.007
azed+0.007
rots+0.007
Neg logit contributions
 it-0.005
 fast-0.003
 his-0.003
 he-0.003
 everyone-0.003
Token bottled
Token ID34123
Feature activation+1.55e-01

Pos logit contributions
int+0.129
 extingu+0.116
azed+0.112
oke+0.111
rots+0.110
Neg logit contributions
 it-0.092
 the-0.067
 his-0.065
 fast-0.062
 everyone-0.060
Token water
Token ID1660
Feature activation-7.72e-03

Pos logit contributions
 it+0.346
 his+0.258
 the+0.258
 fast+0.249
 everyone+0.245
Neg logit contributions
int-0.372
 extingu-0.333
azed-0.308
ider-0.302
oke-0.302
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
int+0.024
 extingu+0.022
azed+0.021
ider+0.021
oke+0.021
Neg logit contributions
 it-0.012
 the-0.008
 his-0.008
 fast-0.007
 he-0.007
Token Lily
Token ID20037
Feature activation-4.74e-03

Pos logit contributions
int+0.039
 extingu+0.036
azed+0.034
aj+0.034
oke+0.033
Neg logit contributions
 it-0.020
 his-0.014
 fast-0.013
 the-0.013
 everyone-0.012
Token was
Token ID373
Feature activation-1.04e-02

Pos logit contributions
int+0.013
 extingu+0.012
azed+0.011
aj+0.011
oke+0.011
Neg logit contributions
 it-0.009
 his-0.006
 the-0.006
 fast-0.006
 everyone-0.006
Token happy
Token ID3772
Feature activation-1.64e-02

Pos logit contributions
int+0.032
 extingu+0.029
azed+0.028
ider+0.027
aj+0.027
Neg logit contributions
 it-0.015
 his-0.009
 the-0.009
 fast-0.009
 he-0.009
TokenSo
Token ID2396
Feature activation+4.19e-02

Pos logit contributions
 it+0.002
 the+0.001
 his+0.001
 fast+0.001
 everyone+0.001
Neg logit contributions
int-0.002
 extingu-0.002
uddled-0.002
rots-0.002
 department-0.002
Token they
Token ID484
Feature activation-1.58e-02

Pos logit contributions
 it+0.063
 fast+0.048
 slow+0.042
 his+0.041
 everyone+0.038
Neg logit contributions
int-0.110
 extingu-0.102
azed-0.096
icking-0.092
 department-0.092
Token left
Token ID1364
Feature activation+3.50e-02

Pos logit contributions
int+0.042
 extingu+0.040
ider+0.037
oke+0.037
azed+0.036
Neg logit contributions
 it-0.030
 his-0.022
 the-0.022
 fast-0.020
 everyone-0.020
Token the
Token ID262
Feature activation+4.52e-04

Pos logit contributions
 it+0.035
 fast+0.016
 his+0.015
 everyone+0.015
 he+0.013
Neg logit contributions
 extingu-0.115
int-0.115
azed-0.103
 department-0.102
ishes-0.102
Token dust
Token ID8977
Feature activation+2.48e-02

Pos logit contributions
 it+0.001
 the+0.001
 his+0.001
 fast+0.001
 he+0.001
Neg logit contributions
int-0.001
 extingu-0.001
azed-0.001
oke-0.001
ifying-0.001
Tokenbin
Token ID8800
Feature activation+1.54e-01

Pos logit contributions
 it+0.045
 the+0.033
 his+0.032
 fast+0.030
 he+0.030
Neg logit contributions
int-0.073
 extingu-0.064
oke-0.062
azed-0.061
rots-0.060
Token,
Token ID11
Feature activation+8.81e-03

Pos logit contributions
 it+0.289
 the+0.202
 his+0.199
 he+0.195
 fast+0.189
Neg logit contributions
int-0.438
 extingu-0.415
ifying-0.377
ccoli-0.374
ider-0.370
Token never
Token ID1239
Feature activation+5.06e-02

Pos logit contributions
 it+0.016
 his+0.011
 fast+0.011
 the+0.011
 everyone+0.011
Neg logit contributions
int-0.024
 extingu-0.022
azed-0.020
oke-0.020
aj-0.020
Token to
Token ID284
Feature activation-1.99e-03

Pos logit contributions
 it+0.096
 the+0.070
 his+0.069
 fast+0.065
 he+0.064
Neg logit contributions
int-0.141
 extingu-0.129
rots-0.119
ifying-0.118
ider-0.118
Token disappear
Token ID10921
Feature activation-4.12e-02

Pos logit contributions
int+0.005
 extingu+0.005
azed+0.004
ifying+0.004
 department+0.004
Neg logit contributions
 it-0.004
 his-0.003
 fast-0.003
 everyone-0.003
 he-0.003
Token again
Token ID757
Feature activation+1.57e-02

Pos logit contributions
int+0.112
 extingu+0.103
azed+0.095
oke+0.094
ider+0.094
Neg logit contributions
 it-0.077
 his-0.054
 the-0.053
 fast-0.052
 everyone-0.051
Token him
Token ID683
Feature activation+1.01e-02

Pos logit contributions
 it+0.004
 fast+0.003
 slow+0.003
 everyone+0.003
 +0.002
Neg logit contributions
int-0.008
 extingu-0.008
azed-0.007
rots-0.007
ener-0.007
Token in
Token ID287
Feature activation-1.14e-03

Pos logit contributions
 it+0.019
 his+0.014
 the+0.014
 fast+0.013
 he+0.012
Neg logit contributions
int-0.028
 extingu-0.026
oke-0.024
rots-0.024
azed-0.024
Token an
Token ID281
Feature activation-6.25e-03

Pos logit contributions
 extingu+0.004
int+0.003
ifying+0.003
azed+0.003
aj+0.003
Neg logit contributions
 it-0.001
 everyone-0.001
 fast-0.001
 he-0.001
 slow-0.001
Token old
Token ID1468
Feature activation-7.03e-03

Pos logit contributions
int+0.015
 extingu+0.013
azed+0.012
oke+0.012
ider+0.012
Neg logit contributions
 it-0.014
 his-0.010
 the-0.010
 fast-0.010
 everyone-0.010
Token dust
Token ID8977
Feature activation+2.56e-02

Pos logit contributions
int+0.015
 extingu+0.013
azed+0.012
aj+0.012
ider+0.012
Neg logit contributions
 it-0.017
 his-0.013
 the-0.013
 fast-0.013
 everyone-0.013
Tokenbin
Token ID8800
Feature activation+1.53e-01

Pos logit contributions
 it+0.048
 the+0.033
 his+0.033
 fast+0.031
 he+0.031
Neg logit contributions
int-0.073
 extingu-0.065
azed-0.062
oke-0.062
aj-0.061
Token.
Token ID13
Feature activation-3.36e-03

Pos logit contributions
 it+0.249
 his+0.167
 fast+0.161
 the+0.161
 everyone+0.157
Neg logit contributions
int-0.447
 extingu-0.410
azed-0.385
oke-0.379
ider-0.378
Token Joe
Token ID5689
Feature activation+1.19e-02

Pos logit contributions
int+0.009
 extingu+0.008
 department+0.007
azed+0.007
uning+0.007
Neg logit contributions
 it-0.006
 fast-0.004
 everyone-0.004
 his-0.004
 the-0.004
Token was
Token ID373
Feature activation-7.45e-03

Pos logit contributions
 it+0.020
 the+0.013
 his+0.012
 he+0.012
 fast+0.011
Neg logit contributions
int-0.037
 extingu-0.035
arl-0.034
ifying-0.034
ider-0.033
Token using
Token ID1262
Feature activation-1.75e-02

Pos logit contributions
int+0.023
 extingu+0.021
azed+0.020
ider+0.020
ifying+0.019
Neg logit contributions
 it-0.011
 his-0.006
 everyone-0.006
 the-0.006
 fast-0.006
Token it
Token ID340
Feature activation+1.94e-02

Pos logit contributions
int+0.059
 extingu+0.057
azed+0.052
ishes+0.052
ener+0.051
Neg logit contributions
 it-0.014
 fast-0.006
 everyone-0.006
 Tim-0.004
 slow-0.003
Token his
Token ID465
Feature activation-1.84e-02

Pos logit contributions
 it+0.028
 fast+0.020
 slow+0.016
 everyone+0.011
 his+0.011
Neg logit contributions
 extingu-0.084
int-0.084
 department-0.079
ifying-0.079
urger-0.075
Token mom
Token ID1995
Feature activation-2.59e-02

Pos logit contributions
int+0.059
 extingu+0.054
rots+0.051
oke+0.051
azed+0.050
Neg logit contributions
 it-0.028
 his-0.018
 the-0.018
 fast-0.017
 everyone-0.016
Token saw
Token ID2497
Feature activation-2.93e-02

Pos logit contributions
int+0.072
 extingu+0.066
ider+0.062
ifying+0.061
istered+0.060
Neg logit contributions
 it-0.050
 the-0.037
 his-0.035
 he-0.034
 Tim-0.033
Token the
Token ID262
Feature activation-9.12e-03

Pos logit contributions
int+0.098
 extingu+0.096
azed+0.087
oke+0.087
ener+0.087
Neg logit contributions
 it-0.028
 fast-0.015
 everyone-0.012
 his-0.012
 the-0.010
Token ash
Token ID12530
Feature activation-5.56e-02

Pos logit contributions
int+0.024
 extingu+0.020
azed+0.020
oke+0.019
rots+0.019
Neg logit contributions
 it-0.020
 the-0.015
 fast-0.015
 his-0.014
 everyone-0.014
Tokentr
Token ID2213
Feature activation-2.17e-01

Pos logit contributions
int+0.149
 extingu+0.126
azed+0.126
oke+0.125
aj+0.124
Neg logit contributions
 it-0.114
 the-0.085
 his-0.083
 he-0.082
 fast-0.079
Tokenay
Token ID323
Feature activation+2.96e-02

Pos logit contributions
int+0.455
 extingu+0.405
ccoli+0.359
azed+0.358
urger+0.356
Neg logit contributions
 it-0.543
 his-0.421
 the-0.415
 fast-0.408
 he-0.401
Token,
Token ID11
Feature activation+2.14e-03

Pos logit contributions
 it+0.053
 the+0.036
 his+0.035
 fast+0.033
 he+0.032
Neg logit contributions
int-0.087
 extingu-0.079
azed-0.074
ider-0.073
aj-0.073
Token she
Token ID673
Feature activation+3.67e-03

Pos logit contributions
 it+0.002
 fast+0.001
 slow+0.001
 everyone+0.001
 his+0.001
Neg logit contributions
 extingu-0.007
int-0.007
ifying-0.006
 department-0.006
rots-0.006
Token laughed
Token ID13818
Feature activation-2.64e-02

Pos logit contributions
 it+0.007
 his+0.005
 the+0.005
 fast+0.005
 everyone+0.004
Neg logit contributions
int-0.010
 extingu-0.009
azed-0.009
ifying-0.009
ider-0.009
Token and
Token ID290
Feature activation-1.79e-02

Pos logit contributions
int+0.069
 extingu+0.064
azed+0.058
rots+0.057
ifying+0.057
Neg logit contributions
 it-0.048
 fast-0.033
 his-0.033
 everyone-0.033
 the-0.033
Token mom
Token ID1995
Feature activation-2.73e-02

Pos logit contributions
 it+0.011
 his+0.007
 the+0.007
 fast+0.007
 everyone+0.007
Neg logit contributions
int-0.019
 extingu-0.017
azed-0.016
oke-0.016
rots-0.016
Token found
Token ID1043
Feature activation+4.37e-03

Pos logit contributions
int+0.077
 extingu+0.072
ifying+0.067
ider+0.066
rots+0.065
Neg logit contributions
 it-0.051
 the-0.035
 his-0.035
 fast-0.033
 everyone-0.032
Token an
Token ID281
Feature activation-1.20e-02

Pos logit contributions
 it+0.004
 his+0.002
 the+0.002
 fast+0.002
 everyone+0.001
Neg logit contributions
int-0.016
 extingu-0.015
azed-0.014
ider-0.014
oke-0.014
Token old
Token ID1468
Feature activation-1.21e-02

Pos logit contributions
int+0.031
 extingu+0.028
rots+0.027
ifying+0.027
aj+0.026
Neg logit contributions
 it-0.025
 his-0.019
 the-0.018
 fast-0.018
 everyone-0.017
Token ash
Token ID12530
Feature activation-5.13e-02

Pos logit contributions
int+0.029
 extingu+0.026
azed+0.024
oke+0.023
ider+0.023
Neg logit contributions
 it-0.027
 his-0.020
 the-0.020
 fast-0.019
 everyone-0.019
Tokentr
Token ID2213
Feature activation-2.15e-01

Pos logit contributions
int+0.083
azed+0.063
oke+0.061
aj+0.060
ishes+0.058
Neg logit contributions
 it-0.165
 the-0.134
 he-0.133
 his-0.133
 fast-0.130
Tokenay
Token ID323
Feature activation+3.29e-02

Pos logit contributions
int+0.450
 extingu+0.395
azed+0.351
ccoli+0.350
urger+0.349
Neg logit contributions
 it-0.548
 his-0.423
 the-0.418
 fast-0.412
 he-0.404
Token.
Token ID13
Feature activation-7.89e-03

Pos logit contributions
 it+0.057
 the+0.038
 his+0.036
 fast+0.034
 he+0.034
Neg logit contributions
int-0.101
 extingu-0.090
azed-0.085
aj-0.085
rots-0.084
Token She
Token ID1375
Feature activation-2.49e-02

Pos logit contributions
int+0.023
 extingu+0.022
azed+0.020
oke+0.020
ifying+0.020
Neg logit contributions
 it-0.012
 his-0.008
 fast-0.008
 the-0.007
 everyone-0.007
Token told
Token ID1297
Feature activation-1.95e-02

Pos logit contributions
int+0.070
 extingu+0.065
ifying+0.061
ider+0.060
azed+0.060
Neg logit contributions
 it-0.047
 the-0.033
 his-0.033
 fast-0.031
 everyone-0.030
Token Lily
Token ID20037
Feature activation-4.24e-03

Pos logit contributions
int+0.055
 extingu+0.055
ener+0.051
elling+0.050
azed+0.050
Neg logit contributions
 it-0.019
 fast-0.015
 slow-0.012
 he-0.011
 his-0.011
Token Tim
Token ID5045
Feature activation+4.41e-03

Pos logit contributions
int+0.009
 extingu+0.008
azed+0.008
ider+0.008
 department+0.008
Neg logit contributions
 it-0.006
 his-0.004
 the-0.004
 fast-0.004
 everyone-0.004
Token saw
Token ID2497
Feature activation-2.89e-02

Pos logit contributions
 it+0.010
 his+0.008
 fast+0.007
 everyone+0.007
 the+0.007
Neg logit contributions
int-0.010
 extingu-0.008
azed-0.008
aj-0.008
uddled-0.008
Token a
Token ID257
Feature activation-4.24e-03

Pos logit contributions
 extingu+0.078
int+0.077
 department+0.070
uddled+0.068
azed+0.068
Neg logit contributions
 it-0.041
 fast-0.026
 -0.023
 slow-0.023
 everyone-0.022
Token small
Token ID1402
Feature activation-4.45e-02

Pos logit contributions
int+0.012
 extingu+0.011
azed+0.010
oke+0.010
ifying+0.010
Neg logit contributions
 it-0.008
 his-0.005
 the-0.005
 fast-0.005
 everyone-0.005
Token ash
Token ID12530
Feature activation-5.10e-02

Pos logit contributions
int+0.109
 extingu+0.096
azed+0.089
rots+0.088
ider+0.088
Neg logit contributions
 it-0.099
 the-0.074
 his-0.074
 fast-0.072
 everyone-0.069
Tokentr
Token ID2213
Feature activation-2.15e-01

Pos logit contributions
int+0.082
azed+0.064
oke+0.062
ishes+0.062
 extingu+0.061
Neg logit contributions
 it-0.160
 the-0.131
 his-0.130
 he-0.130
 fast-0.129
Tokenay
Token ID323
Feature activation+3.55e-02

Pos logit contributions
int+0.450
 extingu+0.400
ccoli+0.355
azed+0.353
urger+0.353
Neg logit contributions
 it-0.542
 his-0.422
 the-0.415
 fast-0.410
 he-0.401
Token outside
Token ID2354
Feature activation-1.86e-02

Pos logit contributions
 it+0.065
 his+0.045
 the+0.044
 fast+0.043
 everyone+0.042
Neg logit contributions
int-0.100
 extingu-0.092
azed-0.085
ifying-0.084
oke-0.084
Token the
Token ID262
Feature activation+3.07e-03

Pos logit contributions
int+0.052
 extingu+0.051
ifying+0.049
azed+0.048
 department+0.047
Neg logit contributions
 it-0.021
 fast-0.013
 everyone-0.012
 slow-0.011
 -0.010
Token door
Token ID3420
Feature activation-2.33e-02

Pos logit contributions
 it+0.007
 his+0.006
 the+0.005
 fast+0.005
 everyone+0.005
Neg logit contributions
int-0.007
 extingu-0.006
azed-0.006
oke-0.006
rots-0.006
Token.
Token ID13
Feature activation+2.96e-04

Pos logit contributions
int+0.070
 extingu+0.065
azed+0.061
ider+0.060
oke+0.060
Neg logit contributions
 it-0.036
 his-0.024
 the-0.023
 fast-0.023
 everyone-0.022
Token to
Token ID284
Feature activation-4.14e-03

Pos logit contributions
 it+0.016
 fast+0.012
 everyone+0.011
 his+0.011
 the+0.010
Neg logit contributions
int-0.028
 extingu-0.026
azed-0.024
oke-0.024
ener-0.024
Token see
Token ID766
Feature activation+2.63e-02

Pos logit contributions
int+0.010
 extingu+0.010
ener+0.009
azed+0.008
oke+0.008
Neg logit contributions
 it-0.007
 fast-0.005
 everyone-0.004
 his-0.004
 slow-0.004
Token the
Token ID262
Feature activation-8.17e-03

Pos logit contributions
 it+0.029
 fast+0.018
 his+0.014
 slow+0.014
 everyone+0.014
Neg logit contributions
int-0.084
 extingu-0.080
azed-0.075
ifying-0.074
oke-0.074
Token orange
Token ID10912
Feature activation+1.41e-03

Pos logit contributions
int+0.021
rots+0.018
ifying+0.018
 extingu+0.018
ider+0.018
Neg logit contributions
 it-0.018
 the-0.014
 his-0.013
 fast-0.013
 Tim-0.013
Token ash
Token ID12530
Feature activation-5.67e-02

Pos logit contributions
 it+0.003
 his+0.002
 the+0.002
 he+0.002
 fast+0.002
Neg logit contributions
int-0.003
 extingu-0.003
aj-0.003
azed-0.003
ifying-0.003
Tokentr
Token ID2213
Feature activation-2.14e-01

Pos logit contributions
int+0.072
 extingu+0.050
azed+0.048
oke+0.048
aj+0.048
Neg logit contributions
 it-0.197
 the-0.164
 his-0.163
 fast-0.161
 he-0.159
Tokenay
Token ID323
Feature activation+3.14e-02

Pos logit contributions
int+0.431
 extingu+0.381
azed+0.339
ider+0.333
oke+0.333
Neg logit contributions
 it-0.555
 his-0.433
 the-0.429
 fast-0.424
 he-0.412
Token.
Token ID13
Feature activation-1.79e-02

Pos logit contributions
 it+0.052
 his+0.035
 the+0.034
 fast+0.033
 everyone+0.032
Neg logit contributions
int-0.091
 extingu-0.084
azed-0.078
oke-0.077
rots-0.077
Token She
Token ID1375
Feature activation-3.00e-02

Pos logit contributions
int+0.052
 extingu+0.048
azed+0.045
oke+0.045
ifying+0.044
Neg logit contributions
 it-0.028
 fast-0.019
 his-0.018
 the-0.017
 everyone-0.017
Token hugged
Token ID41130
Feature activation-3.16e-03

Pos logit contributions
int+0.082
 extingu+0.075
azed+0.069
ider+0.069
ifying+0.069
Neg logit contributions
 it-0.058
 his-0.041
 the-0.041
 fast-0.039
 everyone-0.038
Token Jack
Token ID3619
Feature activation-3.82e-04

Pos logit contributions
int+0.008
 extingu+0.008
ener+0.008
azed+0.007
led+0.007
Neg logit contributions
 it-0.003
 fast-0.003
 slow-0.003
 zoom-0.002
 H-0.002
Token his
Token ID465
Feature activation-1.66e-02

Pos logit contributions
 it+0.003
 fast+0.002
 slow+0.001
 everyone+0.001
 his+0.001
Neg logit contributions
int-0.009
 extingu-0.009
ifying-0.009
 department-0.008
ccoli-0.008
Token mom
Token ID1995
Feature activation-2.84e-02

Pos logit contributions
int+0.055
 extingu+0.050
rots+0.048
ifying+0.047
ives+0.047
Neg logit contributions
 it-0.023
 his-0.015
 the-0.015
 fast-0.014
 everyone-0.013
Token put
Token ID1234
Feature activation+5.62e-03

Pos logit contributions
int+0.078
 extingu+0.071
ider+0.067
ifying+0.066
aj+0.065
Neg logit contributions
 it-0.056
 the-0.040
 his-0.039
 he-0.038
 everyone-0.037
Token the
Token ID262
Feature activation-4.92e-03

Pos logit contributions
 it+0.006
 fast+0.003
 everyone+0.003
 his+0.003
 the+0.002
Neg logit contributions
int-0.019
 extingu-0.018
azed-0.017
ifying-0.017
ishes-0.017
Token ash
Token ID12530
Feature activation-5.31e-02

Pos logit contributions
int+0.014
rots+0.012
ives+0.011
oke+0.011
azed+0.011
Neg logit contributions
 it-0.010
 the-0.008
 his-0.008
 fast-0.007
 Tim-0.007
Tokentr
Token ID2213
Feature activation-2.14e-01

Pos logit contributions
int+0.072
oke+0.062
azed+0.053
 extingu+0.053
rots+0.052
Neg logit contributions
 it-0.174
 his-0.147
 the-0.146
 fast-0.145
 he-0.141
Tokenay
Token ID323
Feature activation+3.19e-02

Pos logit contributions
int+0.432
 extingu+0.384
azed+0.341
ider+0.337
oke+0.337
Neg logit contributions
 it-0.550
 his-0.431
 the-0.424
 fast-0.420
 he-0.407
Token away
Token ID1497
Feature activation+1.13e-02

Pos logit contributions
 it+0.064
 his+0.046
 the+0.046
 fast+0.044
 everyone+0.043
Neg logit contributions
int-0.084
 extingu-0.077
azed-0.070
oke-0.070
ider-0.070
Token,
Token ID11
Feature activation+3.74e-03

Pos logit contributions
 it+0.019
 his+0.013
 the+0.013
 fast+0.012
 everyone+0.012
Neg logit contributions
int-0.033
 extingu-0.030
azed-0.028
oke-0.028
rots-0.028
Token but
Token ID475
Feature activation+4.86e-03

Pos logit contributions
 it+0.007
 his+0.005
 fast+0.004
 the+0.004
 everyone+0.004
Neg logit contributions
int-0.010
 extingu-0.010
azed-0.009
ifying-0.009
aj-0.009
Token Little
Token ID7703
Feature activation+2.82e-02

Pos logit contributions
 it+0.005
 fast+0.003
 his+0.002
 the+0.002
 everyone+0.002
Neg logit contributions
int-0.016
 extingu-0.015
azed-0.014
ifying-0.014
ccoli-0.014
Token "
Token ID366
Feature activation-1.53e-02

Pos logit contributions
 it+0.025
 his+0.017
 fast+0.016
 the+0.016
 everyone+0.015
Neg logit contributions
int-0.043
 extingu-0.040
azed-0.037
oke-0.037
ider-0.037
TokenIt
Token ID1026
Feature activation+2.06e-02

Pos logit contributions
int+0.039
 extingu+0.037
ifying+0.034
rots+0.034
ider+0.033
Neg logit contributions
 it-0.029
 his-0.022
 the-0.021
 everyone-0.021
 fast-0.020
Token's
Token ID338
Feature activation+1.47e-02

Pos logit contributions
 it+0.041
 the+0.029
 his+0.027
 fast+0.026
 he+0.025
Neg logit contributions
int-0.058
 extingu-0.053
ifying-0.050
ider-0.050
aj-0.049
Token an
Token ID281
Feature activation-4.74e-03

Pos logit contributions
 it+0.020
 his+0.012
 the+0.012
 fast+0.011
 everyone+0.011
Neg logit contributions
int-0.048
 extingu-0.044
azed-0.042
oke-0.041
ider-0.041
Token ash
Token ID12530
Feature activation-4.87e-02

Pos logit contributions
int+0.010
 extingu+0.009
rots+0.008
ifying+0.008
azed+0.008
Neg logit contributions
 it-0.011
 his-0.009
 the-0.009
 fast-0.009
 everyone-0.008
Tokentr
Token ID2213
Feature activation-2.13e-01

Pos logit contributions
int+0.067
azed+0.051
ishes+0.048
 extingu+0.048
oke+0.046
Neg logit contributions
 it-0.162
 the-0.132
 his-0.131
 fast-0.131
 he-0.129
Tokenay
Token ID323
Feature activation+3.67e-02

Pos logit contributions
int+0.439
 extingu+0.389
azed+0.346
oke+0.340
rots+0.339
Neg logit contributions
 it-0.546
 his-0.422
 the-0.419
 fast-0.415
 he-0.400
Token.
Token ID13
Feature activation-2.41e-03

Pos logit contributions
 it+0.060
 his+0.038
 the+0.038
 fast+0.036
 everyone+0.035
Neg logit contributions
int-0.112
 extingu-0.103
azed-0.096
aj-0.095
ifying-0.095
Token People
Token ID4380
Feature activation+2.95e-02

Pos logit contributions
int+0.007
 extingu+0.006
ifying+0.006
oke+0.006
azed+0.006
Neg logit contributions
 it-0.004
 his-0.003
 fast-0.003
 the-0.003
 everyone-0.003
Token put
Token ID1234
Feature activation+1.07e-02

Pos logit contributions
 it+0.057
 the+0.040
 his+0.039
 fast+0.038
 everyone+0.037
Neg logit contributions
int-0.083
 extingu-0.075
azed-0.070
aj-0.070
ider-0.070
Token things
Token ID1243
Feature activation+5.08e-02

Pos logit contributions
 it+0.007
 his+0.002
 fast+0.001
 the+0.001
 everyone+0.001
Neg logit contributions
int-0.040
 extingu-0.038
ifying-0.036
azed-0.036
rots-0.036
Token show
Token ID905
Feature activation-6.82e-03

Pos logit contributions
int+0.005
 extingu+0.004
urger+0.004
azed+0.004
ifying+0.004
Neg logit contributions
 it-0.004
 fast-0.003
 Tim-0.002
 V-0.002
 he-0.002
Token his
Token ID465
Feature activation-1.55e-02

Pos logit contributions
int+0.020
 extingu+0.019
urger+0.018
azed+0.018
ener+0.017
Neg logit contributions
 it-0.005
 fast-0.005
 slow-0.004
Tim-0.004
 V-0.003
Token mom
Token ID1995
Feature activation-2.63e-02

Pos logit contributions
int+0.039
 extingu+0.036
azed+0.033
oke+0.032
rots+0.032
Neg logit contributions
 it-0.032
 his-0.023
 the-0.023
 fast-0.022
 everyone-0.022
Token the
Token ID262
Feature activation-7.81e-03

Pos logit contributions
int+0.077
 extingu+0.073
azed+0.067
urger+0.067
ishes+0.067
Neg logit contributions
 it-0.038
 fast-0.025
 everyone-0.022
 his-0.021
 slow-0.021
Token ash
Token ID12530
Feature activation-5.62e-02

Pos logit contributions
int+0.020
 extingu+0.018
ifying+0.017
ider+0.017
azed+0.017
Neg logit contributions
 it-0.017
 his-0.012
 the-0.012
 fast-0.012
 everyone-0.011
Tokentr
Token ID2213
Feature activation-2.13e-01

Pos logit contributions
int+0.073
 extingu+0.052
azed+0.050
oke+0.050
aj+0.049
Neg logit contributions
 it-0.192
 his-0.161
 the-0.159
 he-0.158
 fast-0.156
Tokenay
Token ID323
Feature activation+3.15e-02

Pos logit contributions
int+0.422
 extingu+0.373
azed+0.331
ider+0.324
oke+0.324
Neg logit contributions
 it-0.560
 his-0.440
 the-0.435
 fast-0.429
 everyone-0.419
Token.
Token ID13
Feature activation-1.94e-02

Pos logit contributions
 it+0.049
 fast+0.032
 his+0.031
 the+0.031
 everyone+0.030
Neg logit contributions
int-0.092
 extingu-0.086
oke-0.080
azed-0.079
rots-0.079
Token
Token ID198
Feature activation-4.88e-03

Pos logit contributions
int+0.051
 extingu+0.046
ives+0.044
azed+0.044
oke+0.044
Neg logit contributions
 it-0.032
 fast-0.023
 the-0.021
 slow-0.020
 everyone-0.020
Token
Token ID198
Feature activation-3.26e-03

Pos logit contributions
int+0.012
oke+0.011
oral+0.010
ives+0.010
 extingu+0.010
Neg logit contributions
 it-0.008
 fast-0.006
 his-0.006
 the-0.005
 everyone-0.005
TokenAt
Token ID2953
Feature activation-2.07e-02

Pos logit contributions
int+0.008
 extingu+0.007
azed+0.007
ifying+0.007
ider+0.007
Neg logit contributions
 it-0.007
 the-0.005
 his-0.005
 fast-0.005
 everyone-0.004
Token gave
Token ID2921
Feature activation-1.86e-02

Pos logit contributions
int+0.025
 extingu+0.022
ider+0.022
ifying+0.021
rots+0.021
Neg logit contributions
 it-0.017
 the-0.013
 his-0.013
 everyone-0.012
 fast-0.011
Token him
Token ID683
Feature activation+7.10e-03

Pos logit contributions
int+0.054
 extingu+0.052
ishes+0.050
ener+0.048
azed+0.048
Neg logit contributions
 it-0.021
 fast-0.016
 slow-0.013
 he-0.012
 everyone-0.012
Token a
Token ID257
Feature activation-9.52e-03

Pos logit contributions
 it+0.009
 his+0.005
 fast+0.005
 the+0.005
 everyone+0.004
Neg logit contributions
int-0.023
 extingu-0.022
azed-0.020
ifying-0.020
rots-0.020
Token small
Token ID1402
Feature activation-4.84e-02

Pos logit contributions
int+0.028
 extingu+0.025
azed+0.024
ider+0.024
rots+0.024
Neg logit contributions
 it-0.017
 his-0.012
 the-0.011
 fast-0.011
 he-0.010
Token ash
Token ID12530
Feature activation-5.10e-02

Pos logit contributions
int+0.126
 extingu+0.107
rots+0.105
azed+0.104
oke+0.104
Neg logit contributions
 it-0.104
 his-0.078
 the-0.076
 fast-0.075
 he-0.071
Tokentr
Token ID2213
Feature activation-2.12e-01

Pos logit contributions
int+0.076
azed+0.058
oke+0.057
ishes+0.054
aj+0.053
Neg logit contributions
 it-0.165
 his-0.139
 he-0.137
 the-0.134
 fast-0.134
Tokenay
Token ID323
Feature activation+3.38e-02

Pos logit contributions
int+0.432
 extingu+0.382
azed+0.339
ider+0.334
oke+0.334
Neg logit contributions
 it-0.548
 his-0.429
 the-0.421
 fast-0.416
 he-0.408
Token as
Token ID355
Feature activation-1.01e-03

Pos logit contributions
 it+0.054
 his+0.035
 the+0.035
 fast+0.032
 he+0.032
Neg logit contributions
int-0.105
 extingu-0.096
azed-0.091
ider-0.090
aj-0.089
Token a
Token ID257
Feature activation-5.39e-03

Pos logit contributions
int+0.003
 extingu+0.003
ifying+0.002
rots+0.002
ccoli+0.002
Neg logit contributions
 it-0.001
 fast-0.001
 -0.001
 everyone-0.001
 his-0.001
Token gift
Token ID6979
Feature activation-2.92e-02

Pos logit contributions
int+0.013
 extingu+0.011
azed+0.010
ishes+0.010
rots+0.010
Neg logit contributions
 it-0.013
 his-0.010
 fast-0.010
 the-0.010
 he-0.009
Token.
Token ID13
Feature activation-9.09e-03

Pos logit contributions
int+0.089
 extingu+0.079
azed+0.076
aj+0.075
rots+0.074
Neg logit contributions
 it-0.049
 his-0.033
 the-0.032
 he-0.030
 fast-0.030
Token and
Token ID290
Feature activation-1.60e-02

Pos logit contributions
int+0.023
 extingu+0.023
azed+0.021
oke+0.021
 department+0.021
Neg logit contributions
 it-0.008
 fast-0.004
 everyone-0.004
 slow-0.003
 his-0.003
Token gave
Token ID2921
Feature activation-2.25e-02

Pos logit contributions
int+0.037
 extingu+0.034
azed+0.031
 department+0.031
ener+0.030
Neg logit contributions
 it-0.032
 fast-0.025
 his-0.024
 the-0.023
 everyone-0.023
Token her
Token ID607
Feature activation+5.47e-03

Pos logit contributions
int+0.065
 extingu+0.061
ishes+0.061
led+0.061
ener+0.060
Neg logit contributions
 it-0.015
 fast-0.012
 winning-0.011
 slow-0.010
 he-0.009
Token the
Token ID262
Feature activation+2.42e-03

Pos logit contributions
 it+0.006
 his+0.004
 the+0.004
 fast+0.003
 her+0.002
Neg logit contributions
int-0.020
ider-0.018
oke-0.018
 extingu-0.018
azed-0.018
Token ash
Token ID12530
Feature activation-5.18e-02

Pos logit contributions
 it+0.006
 his+0.004
 the+0.004
 fast+0.004
 Tim+0.004
Neg logit contributions
int-0.006
rots-0.005
 extingu-0.005
ifying-0.005
ider-0.005
Tokentr
Token ID2213
Feature activation-2.12e-01

Pos logit contributions
int+0.070
oke+0.056
azed+0.052
aj+0.051
 extingu+0.050
Neg logit contributions
 it-0.171
 his-0.143
 the-0.141
 fast-0.140
 he-0.138
Tokenay
Token ID323
Feature activation+3.41e-02

Pos logit contributions
int+0.430
 extingu+0.381
azed+0.337
ider+0.333
oke+0.333
Neg logit contributions
 it-0.548
 his-0.427
 the-0.421
 fast-0.417
 he-0.404
Token.
Token ID13
Feature activation-5.96e-03

Pos logit contributions
 it+0.052
 everyone+0.034
 fast+0.034
 his+0.033
 the+0.032
Neg logit contributions
int-0.099
 extingu-0.092
azed-0.085
ifying-0.084
rots-0.084
Token She
Token ID1375
Feature activation-2.40e-02

Pos logit contributions
int+0.015
aj+0.014
ccoli+0.014
ives+0.013
 extingu+0.013
Neg logit contributions
 it-0.010
 fast-0.007
 everyone-0.006
 slow-0.006
 the-0.006
Token was
Token ID373
Feature activation-7.12e-03

Pos logit contributions
int+0.066
 extingu+0.060
azed+0.056
rots+0.055
oke+0.055
Neg logit contributions
 it-0.044
 his-0.031
 the-0.030
 fast-0.030
 everyone-0.029
Token very
Token ID845
Feature activation+3.87e-02

Pos logit contributions
int+0.019
 extingu+0.018
azed+0.017
aj+0.016
istered+0.016
Neg logit contributions
 it-0.011
 he-0.007
 his-0.006
 everyone-0.006
 Tim-0.006
Token her
Token ID607
Feature activation+4.49e-03

Pos logit contributions
int+0.012
 extingu+0.011
azed+0.011
ener+0.010
ccoli+0.010
Neg logit contributions
 it-0.004
 fast-0.002
 he-0.002
 slow-0.002
 his-0.002
Token mom
Token ID1995
Feature activation-2.77e-02

Pos logit contributions
 it+0.007
 the+0.004
 his+0.004
 fast+0.004
 everyone+0.004
Neg logit contributions
int-0.015
 extingu-0.013
ifying-0.013
azed-0.012
ider-0.012
Token what
Token ID644
Feature activation-1.59e-02

Pos logit contributions
int+0.082
 extingu+0.076
ifying+0.070
azed+0.070
ider+0.070
Neg logit contributions
 it-0.048
 the-0.033
 his-0.032
 fast-0.030
 everyone-0.030
Token an
Token ID281
Feature activation-7.20e-03

Pos logit contributions
int+0.039
 extingu+0.038
oke+0.036
ccoli+0.036
rots+0.036
Neg logit contributions
 it-0.021
 fast-0.017
 his-0.015
 himself-0.015
 everyone-0.015
Token ash
Token ID12530
Feature activation-4.68e-02

Pos logit contributions
int+0.017
 extingu+0.015
ifying+0.015
rots+0.014
aj+0.014
Neg logit contributions
 it-0.017
 his-0.013
 the-0.012
 fast-0.012
 Tim-0.012
Tokentr
Token ID2213
Feature activation-2.12e-01

Pos logit contributions
int+0.067
azed+0.049
ifying+0.049
aj+0.047
ishes+0.046
Neg logit contributions
 it-0.156
 his-0.128
 the-0.128
 he-0.127
 Tim-0.124
Tokenay
Token ID323
Feature activation+3.54e-02

Pos logit contributions
int+0.422
 extingu+0.372
azed+0.330
ider+0.324
rots+0.324
Neg logit contributions
 it-0.556
 his-0.436
 the-0.431
 fast-0.425
 everyone-0.414
Token was
Token ID373
Feature activation-6.92e-03

Pos logit contributions
 it+0.058
 the+0.037
 his+0.037
 fast+0.034
 he+0.033
Neg logit contributions
int-0.109
 extingu-0.100
azed-0.094
aj-0.094
ifying-0.093
Token,
Token ID11
Feature activation+5.85e-03

Pos logit contributions
int+0.024
 extingu+0.022
ifying+0.021
rots+0.021
azed+0.021
Neg logit contributions
 it-0.009
 the-0.006
 his-0.005
 he-0.004
 fast-0.004
Token and
Token ID290
Feature activation-1.78e-02

Pos logit contributions
 it+0.009
 fast+0.007
 his+0.006
 everyone+0.006
 the+0.006
Neg logit contributions
int-0.015
 extingu-0.015
azed-0.013
uddled-0.013
der-0.013
Token her
Token ID607
Feature activation+4.95e-03

Pos logit contributions
int+0.057
 extingu+0.055
azed+0.051
uddled+0.051
 department+0.050
Neg logit contributions
 it-0.016
 fast-0.010
 slow-0.008
 his-0.008
 everyone-0.007
Token carefully
Token ID7773
Feature activation-2.01e-02

Pos logit contributions
 it+0.023
 his+0.016
 the+0.016
 fast+0.016
 everyone+0.015
Neg logit contributions
int-0.032
 extingu-0.030
azed-0.028
oke-0.027
ider-0.027
Token picked
Token ID6497
Feature activation-3.05e-02

Pos logit contributions
int+0.053
 extingu+0.049
ider+0.047
rots+0.047
oke+0.046
Neg logit contributions
 it-0.040
 his-0.030
 the-0.030
 he-0.027
 fast-0.027
Token up
Token ID510
Feature activation-7.45e-03

Pos logit contributions
int+0.104
 extingu+0.097
ishes+0.094
ener+0.093
azed+0.093
Neg logit contributions
 it-0.013
 fast-0.008
 slow-0.005
Tim-0.005
 everyone-0.003
Token the
Token ID262
Feature activation-7.64e-03

Pos logit contributions
int+0.024
 extingu+0.023
ishes+0.023
azed+0.022
rots+0.022
Neg logit contributions
 it-0.005
 fast-0.003
 slow-0.002
 everyone-0.002
 he-0.001
Token ash
Token ID12530
Feature activation-5.80e-02

Pos logit contributions
int+0.020
 extingu+0.017
oke+0.016
azed+0.016
ider+0.016
Neg logit contributions
 it-0.017
 the-0.012
 his-0.012
 fast-0.012
 he-0.011
Tokentr
Token ID2213
Feature activation-2.12e-01

Pos logit contributions
int+0.124
oke+0.101
 extingu+0.100
azed+0.100
aj+0.099
Neg logit contributions
 it-0.150
 his-0.118
 the-0.117
 he-0.114
 fast-0.113
Tokenay
Token ID323
Feature activation+3.10e-02

Pos logit contributions
int+0.419
 extingu+0.370
azed+0.328
ider+0.323
oke+0.322
Neg logit contributions
 it-0.557
 his-0.438
 the-0.433
 fast-0.426
 everyone-0.417
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
 it+0.054
 fast+0.038
 his+0.038
 everyone+0.038
 the+0.037
Neg logit contributions
int-0.083
 extingu-0.078
oke-0.071
rots-0.071
azed-0.071
Token put
Token ID1234
Feature activation+6.03e-03

Pos logit contributions
int+0.046
 extingu+0.043
ener+0.038
azed+0.038
ishes+0.038
Neg logit contributions
 it-0.037
 fast-0.028
 his-0.027
 everyone-0.026
 the-0.026
Token it
Token ID340
Feature activation+1.97e-02

Pos logit contributions
 it+0.000
 fast-0.001
 slow-0.001
 everyone-0.001
 he-0.002
Neg logit contributions
int-0.022
 extingu-0.021
ener-0.020
led-0.020
ifying-0.020
Token in
Token ID287
Feature activation-2.60e-03

Pos logit contributions
 it+0.037
 his+0.027
 the+0.026
 fast+0.025
 he+0.024
Neg logit contributions
int-0.055
 extingu-0.051
oke-0.048
rots-0.047
azed-0.046
Token wanted
Token ID2227
Feature activation+1.18e-02

Pos logit contributions
 it+0.023
 his+0.017
 the+0.016
 fast+0.016
 everyone+0.016
Neg logit contributions
int-0.030
 extingu-0.027
azed-0.025
oke-0.025
ifying-0.025
Token to
Token ID284
Feature activation-3.60e-03

Pos logit contributions
 it+0.016
 fast+0.013
 slow+0.011
 everyone+0.011
 +0.011
Neg logit contributions
 extingu-0.028
int-0.027
urger-0.025
 department-0.025
ener-0.025
Token keep
Token ID1394
Feature activation+4.76e-02

Pos logit contributions
int+0.009
 extingu+0.008
ache+0.007
urger+0.007
azed+0.007
Neg logit contributions
 it-0.007
 fast-0.005
 his-0.005
 the-0.004
 he-0.004
Token the
Token ID262
Feature activation-4.78e-04

Pos logit contributions
 it+0.029
 fast+0.020
 V+0.017
Tim+0.016
 slow+0.014
Neg logit contributions
int-0.138
ishes-0.136
 extingu-0.134
urger-0.132
ener-0.129
Token ash
Token ID12530
Feature activation-5.20e-02

Pos logit contributions
int+0.001
 extingu+0.001
ifying+0.001
rots+0.001
ider+0.001
Neg logit contributions
 it-0.001
 the-0.001
 his-0.001
 fast-0.001
 Tim-0.001
Tokentr
Token ID2213
Feature activation-2.12e-01

Pos logit contributions
int+0.095
 extingu+0.081
azed+0.074
oke+0.073
rots+0.072
Neg logit contributions
 it-0.146
 his-0.117
 the-0.116
 fast-0.115
 he-0.111
Tokenay
Token ID323
Feature activation+3.57e-02

Pos logit contributions
int+0.399
 extingu+0.351
azed+0.312
oke+0.304
ifying+0.304
Neg logit contributions
 it-0.573
 his-0.456
 the-0.452
 fast-0.445
 everyone-0.439
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
 it+0.063
 everyone+0.044
 his+0.043
 fast+0.043
 the+0.042
Neg logit contributions
int-0.092
 extingu-0.085
azed-0.078
ener-0.077
rots-0.077
Token So
Token ID1406
Feature activation+1.34e-03

Pos logit contributions
int+0.025
aj+0.023
der+0.022
azed+0.021
oke+0.021
Neg logit contributions
 it-0.025
 fast-0.019
 the-0.018
 everyone-0.018
 his-0.017
Token he
Token ID339
Feature activation+1.51e-02

Pos logit contributions
 it+0.002
 fast+0.002
 slow+0.001
 everyone+0.001
Tim+0.001
Neg logit contributions
int-0.003
 extingu-0.003
azed-0.003
aj-0.003
ener-0.003
Token tried
Token ID3088
Feature activation-2.73e-02

Pos logit contributions
 it+0.028
 his+0.020
 the+0.020
 fast+0.019
 everyone+0.018
Neg logit contributions
int-0.042
 extingu-0.039
ifying-0.036
oke-0.036
azed-0.036
Token "
Token ID366
Feature activation-1.88e-02

Pos logit contributions
 it+0.003
 his+0.002
 fast+0.002
 the+0.002
 everyone+0.002
Neg logit contributions
int-0.005
 extingu-0.005
oke-0.005
azed-0.005
ider-0.005
TokenThat
Token ID2504
Feature activation+3.34e-02

Pos logit contributions
int+0.045
 extingu+0.041
rots+0.037
ifying+0.037
azed+0.037
Neg logit contributions
 it-0.039
 his-0.030
 the-0.029
 fast-0.029
 everyone-0.028
Token's
Token ID338
Feature activation+1.03e-02

Pos logit contributions
 it+0.069
 his+0.050
 the+0.049
 fast+0.049
 everyone+0.048
Neg logit contributions
int-0.084
 extingu-0.077
azed-0.070
rots-0.069
oke-0.069
Token an
Token ID281
Feature activation-6.53e-03

Pos logit contributions
 it+0.017
 fast+0.012
 he+0.012
 his+0.011
 everyone+0.011
Neg logit contributions
int-0.027
 extingu-0.025
ccoli-0.024
ifying-0.023
azed-0.023
Token ash
Token ID12530
Feature activation-5.06e-02

Pos logit contributions
int+0.017
 extingu+0.015
ifying+0.014
rots+0.014
azed+0.014
Neg logit contributions
 it-0.013
 his-0.010
 the-0.010
 fast-0.009
 everyone-0.009
Tokentr
Token ID2213
Feature activation-2.12e-01

Pos logit contributions
int+0.075
azed+0.056
 extingu+0.055
aj+0.054
ifying+0.053
Neg logit contributions
 it-0.163
 the-0.133
 his-0.132
 he-0.132
 fast-0.129
Tokenay
Token ID323
Feature activation+3.44e-02

Pos logit contributions
int+0.434
 extingu+0.386
azed+0.341
ider+0.336
aj+0.334
Neg logit contributions
 it-0.543
 his-0.420
 the-0.416
 fast-0.411
 he-0.401
Token,
Token ID11
Feature activation+7.94e-03

Pos logit contributions
 it+0.057
 the+0.037
 his+0.037
 fast+0.035
 he+0.034
Neg logit contributions
int-0.104
 extingu-0.096
azed-0.089
aj-0.089
ifying-0.088
Token sweet
Token ID6029
Feature activation-3.75e-02

Pos logit contributions
 it+0.010
 fast+0.006
 his+0.006
 the+0.005
 everyone+0.005
Neg logit contributions
int-0.024
 extingu-0.023
oke-0.022
ccoli-0.022
azed-0.021
Tokenie
Token ID494
Feature activation+3.04e-02

Pos logit contributions
int+0.080
 extingu+0.071
azed+0.064
ider+0.063
ifying+0.063
Neg logit contributions
 it-0.094
 his-0.073
 the-0.073
 fast-0.071
 everyone-0.070
Token.
Token ID13
Feature activation-5.22e-03

Pos logit contributions
 it+0.048
 his+0.032
 the+0.031
 fast+0.030
 everyone+0.030
Neg logit contributions
int-0.090
 extingu-0.083
azed-0.077
ider-0.077
oke-0.076
Token showed
Token ID3751
Feature activation-3.17e-02

Pos logit contributions
 it+0.038
 his+0.028
 the+0.028
 fast+0.027
 everyone+0.026
Neg logit contributions
int-0.050
 extingu-0.046
azed-0.042
ifying-0.042
ider-0.042
Token his
Token ID465
Feature activation-1.12e-02

Pos logit contributions
int+0.091
azed+0.083
 extingu+0.082
icked+0.080
ener+0.080
Neg logit contributions
 it-0.023
 fast-0.020
 winning-0.017
 slow-0.016
 seek-0.015
Token mom
Token ID1995
Feature activation-2.07e-02

Pos logit contributions
int+0.034
 extingu+0.031
ider+0.029
ifying+0.029
oke+0.029
Neg logit contributions
 it-0.018
 his-0.012
 the-0.012
 fast-0.011
 everyone-0.011
Token the
Token ID262
Feature activation+1.07e-03

Pos logit contributions
int+0.063
 extingu+0.062
ishes+0.057
azed+0.056
aj+0.056
Neg logit contributions
 it-0.025
 fast-0.015
 everyone-0.013
 he-0.012
 slow-0.012
Token ash
Token ID12530
Feature activation-5.28e-02

Pos logit contributions
 it+0.002
 his+0.002
 the+0.002
 fast+0.002
 Tim+0.002
Neg logit contributions
int-0.003
 extingu-0.002
ider-0.002
ifying-0.002
rots-0.002
Tokentr
Token ID2213
Feature activation-2.12e-01

Pos logit contributions
int+0.071
azed+0.050
 extingu+0.049
oke+0.049
aj+0.048
Neg logit contributions
 it-0.179
 his-0.148
 the-0.148
 he-0.145
 fast-0.145
Tokenay
Token ID323
Feature activation+3.42e-02

Pos logit contributions
int+0.417
 extingu+0.368
azed+0.327
oke+0.320
ider+0.320
Neg logit contributions
 it-0.558
 his-0.439
 the-0.435
 fast-0.428
 everyone-0.419
Token.
Token ID13
Feature activation-9.16e-03

Pos logit contributions
 it+0.054
 fast+0.039
 everyone+0.037
 his+0.037
 the+0.035
Neg logit contributions
int-0.093
 extingu-0.088
oke-0.081
azed-0.080
rots-0.080
Token She
Token ID1375
Feature activation-2.49e-02

Pos logit contributions
int+0.020
 extingu+0.019
aj+0.018
ifying+0.017
ours+0.017
Neg logit contributions
 it-0.017
 fast-0.014
 everyone-0.012
 the-0.012
 his-0.012
Token asked
Token ID1965
Feature activation-2.11e-03

Pos logit contributions
int+0.072
 extingu+0.067
azed+0.062
ifying+0.062
ider+0.061
Neg logit contributions
 it-0.044
 his-0.030
 the-0.030
 fast-0.028
 everyone-0.028
Token him
Token ID683
Feature activation+1.03e-02

Pos logit contributions
int+0.006
 extingu+0.006
ener+0.005
icking+0.005
icked+0.005
Neg logit contributions
 it-0.002
 fast-0.002
 slow-0.001
 -0.001
 he-0.001
Token.
Token ID13
Feature activation-1.70e-02

Pos logit contributions
int+0.006
 extingu+0.005
azed+0.005
ider+0.005
ifying+0.005
Neg logit contributions
 it-0.003
 his-0.002
 the-0.002
 fast-0.002
 everyone-0.002
Token Jack
Token ID3619
Feature activation-8.86e-04

Pos logit contributions
int+0.049
 extingu+0.044
oke+0.043
azed+0.043
ives+0.042
Neg logit contributions
 it-0.025
 fast-0.018
 his-0.015
 the-0.015
 everyone-0.015
Token put
Token ID1234
Feature activation+6.25e-03

Pos logit contributions
int+0.003
 extingu+0.002
ider+0.002
ifying+0.002
oke+0.002
Neg logit contributions
 it-0.002
 his-0.001
 the-0.001
 fast-0.001
 everyone-0.001
Token the
Token ID262
Feature activation-6.39e-03

Pos logit contributions
 it+0.004
 fast+0.001
 everyone+0.001
 he+0.001
 his+0.001
Neg logit contributions
int-0.023
 extingu-0.021
azed-0.020
ener-0.020
ifying-0.020
Token ash
Token ID12530
Feature activation-5.47e-02

Pos logit contributions
int+0.016
 extingu+0.014
rots+0.013
ider+0.013
ifying+0.013
Neg logit contributions
 it-0.014
 the-0.011
 his-0.011
 fast-0.011
 Tim-0.010
Tokentr
Token ID2213
Feature activation-2.12e-01

Pos logit contributions
int+0.073
 extingu+0.057
oke+0.056
azed+0.051
rots+0.051
Neg logit contributions
 it-0.182
 his-0.152
 the-0.151
 fast-0.151
 he-0.145
Tokenay
Token ID323
Feature activation+3.39e-02

Pos logit contributions
int+0.424
 extingu+0.377
azed+0.333
ider+0.330
oke+0.330
Neg logit contributions
 it-0.549
 his-0.430
 the-0.425
 fast-0.420
 everyone-0.408
Token on
Token ID319
Feature activation+4.15e-02

Pos logit contributions
 it+0.067
 his+0.048
 the+0.047
 fast+0.047
 everyone+0.046
Neg logit contributions
int-0.087
 extingu-0.079
azed-0.073
ifying-0.072
rots-0.072
Token the
Token ID262
Feature activation-4.75e-03

Pos logit contributions
 it+0.040
 fast+0.021
 everyone+0.019
 he+0.016
 slow+0.015
Neg logit contributions
 extingu-0.127
int-0.126
ifying-0.118
ener-0.113
azed-0.113
Token kitchen
Token ID9592
Feature activation-6.73e-02

Pos logit contributions
int+0.012
 extingu+0.010
rots+0.010
oke+0.010
azed+0.010
Neg logit contributions
 it-0.010
 the-0.008
 his-0.008
 fast-0.008
 everyone-0.007
Token table
Token ID3084
Feature activation-5.93e-03

Pos logit contributions
int+0.194
 extingu+0.174
azed+0.165
rots+0.164
oke+0.164
Neg logit contributions
 it-0.121
 his-0.084
 the-0.083
 fast-0.080
 he-0.077
Token She
Token ID1375
Feature activation-1.82e-02

Pos logit contributions
int+0.005
 extingu+0.005
oke+0.005
azed+0.005
rots+0.005
Neg logit contributions
 it-0.003
 fast-0.002
 his-0.002
 everyone-0.002
 the-0.002
Token hid
Token ID24519
Feature activation+1.01e-02

Pos logit contributions
int+0.051
 extingu+0.048
ifying+0.044
azed+0.044
ider+0.044
Neg logit contributions
 it-0.033
 his-0.023
 the-0.023
 fast-0.022
 everyone-0.021
Token behind
Token ID2157
Feature activation-3.17e-02

Pos logit contributions
 it+0.023
 his+0.018
 everyone+0.018
 fast+0.018
 he+0.018
Neg logit contributions
int-0.019
 extingu-0.019
ener-0.016
azed-0.016
ifying-0.016
Token an
Token ID281
Feature activation-5.41e-03

Pos logit contributions
int+0.088
 extingu+0.085
ifying+0.081
azed+0.080
rots+0.079
Neg logit contributions
 it-0.035
 fast-0.022
 he-0.021
 everyone-0.020
 slow-0.020
Token ash
Token ID12530
Feature activation-5.23e-02

Pos logit contributions
int+0.013
 extingu+0.012
azed+0.011
oke+0.011
ider+0.011
Neg logit contributions
 it-0.011
 his-0.008
 the-0.008
 fast-0.008
 everyone-0.008
Tokentr
Token ID2213
Feature activation-2.11e-01

Pos logit contributions
int+0.108
azed+0.087
oke+0.086
aj+0.084
 extingu+0.082
Neg logit contributions
 it-0.141
 the-0.113
 his-0.112
 fast-0.108
 he-0.107
Tokenay
Token ID323
Feature activation+3.68e-02

Pos logit contributions
int+0.406
 extingu+0.358
azed+0.318
oke+0.311
ider+0.310
Neg logit contributions
 it-0.565
 his-0.447
 the-0.443
 fast-0.437
 everyone-0.429
Token.
Token ID13
Feature activation-4.39e-03

Pos logit contributions
 it+0.069
 fast+0.055
 everyone+0.054
 his+0.052
 slow+0.050
Neg logit contributions
int-0.084
 extingu-0.079
ives-0.071
oke-0.070
der-0.069
Token
Token ID198
Feature activation+6.31e-03

Pos logit contributions
int+0.012
 extingu+0.011
azed+0.010
ives+0.010
oke+0.010
Neg logit contributions
 it-0.007
 his-0.005
 fast-0.005
 everyone-0.005
 the-0.005
Token
Token ID198
Feature activation+6.44e-03

Pos logit contributions
 it+0.011
 his+0.008
 everyone+0.008
 he+0.007
 fast+0.007
Neg logit contributions
int-0.015
oke-0.013
ives-0.013
 extingu-0.013
oral-0.012
Token"
Token ID1
Feature activation-2.45e-02

Pos logit contributions
 it+0.012
 his+0.008
 the+0.007
 everyone+0.007
 he+0.007
Neg logit contributions
int-0.017
 extingu-0.015
ider-0.014
azed-0.014
ives-0.014
Token and
Token ID290
Feature activation-1.54e-02

Pos logit contributions
int+0.176
azed+0.155
aj+0.154
 extingu+0.154
oke+0.151
Neg logit contributions
 it-0.080
 the-0.050
 his-0.049
 he-0.045
 fast-0.042
Token accidentally
Token ID14716
Feature activation+2.70e-02

Pos logit contributions
int+0.043
 extingu+0.040
azed+0.036
ifying+0.036
oke+0.036
Neg logit contributions
 it-0.027
 his-0.019
 the-0.018
 fast-0.018
 everyone-0.017
Token dropped
Token ID5710
Feature activation-2.18e-02

Pos logit contributions
 it+0.063
 his+0.048
 the+0.047
 fast+0.045
 everyone+0.044
Neg logit contributions
int-0.064
 extingu-0.056
azed-0.053
oke-0.052
ider-0.052
Token her
Token ID607
Feature activation+6.39e-03

Pos logit contributions
 extingu+0.076
int+0.074
ifying+0.071
ishes+0.068
ener+0.068
Neg logit contributions
 it-0.011
 fast-0.003
 everyone-0.002
 he-0.001
 slow-0.001
Token ash
Token ID12530
Feature activation-5.01e-02

Pos logit contributions
 it+0.014
 the+0.011
 his+0.011
 fast+0.010
 her+0.009
Neg logit contributions
int-0.016
oke-0.015
omed-0.014
rots-0.014
ives-0.014
Tokentr
Token ID2213
Feature activation-2.11e-01

Pos logit contributions
int+0.072
oke+0.061
azed+0.054
aj+0.053
rots+0.053
Neg logit contributions
 it-0.160
 his-0.133
 the-0.132
 fast-0.130
 he-0.126
Tokenay
Token ID323
Feature activation+3.65e-02

Pos logit contributions
int+0.430
 extingu+0.381
azed+0.338
oke+0.336
rots+0.334
Neg logit contributions
 it-0.541
 his-0.422
 the-0.416
 fast-0.409
 he-0.398
Token on
Token ID319
Feature activation+4.61e-02

Pos logit contributions
 it+0.058
 his+0.038
 fast+0.037
 the+0.037
 everyone+0.036
Neg logit contributions
int-0.107
 extingu-0.099
azed-0.092
ifying-0.091
oke-0.090
Token the
Token ID262
Feature activation+4.71e-03

Pos logit contributions
 it+0.035
 fast+0.020
 everyone+0.019
 he+0.017
 slow+0.017
Neg logit contributions
 extingu-0.144
ifying-0.131
int-0.126
 oven-0.123
 department-0.120
Token ground
Token ID2323
Feature activation-1.91e-02

Pos logit contributions
 it+0.010
 his+0.008
 the+0.008
 fast+0.008
 everyone+0.007
Neg logit contributions
int-0.012
 extingu-0.010
azed-0.010
oke-0.009
rots-0.009
Token.
Token ID13
Feature activation-5.77e-03

Pos logit contributions
int+0.055
 extingu+0.052
azed+0.048
ifying+0.048
ider+0.048
Neg logit contributions
 it-0.031
 his-0.020
 everyone-0.020
 the-0.019
 fast-0.019
Token it
Token ID340
Feature activation+1.45e-02

Pos logit contributions
int+0.015
 extingu+0.015
rots+0.014
ifying+0.014
ener+0.014
Neg logit contributions
 it-0.002
 fast-0.001
 his-0.001
 slow-0.000
 everyone-0.000
Token came
Token ID1625
Feature activation-4.19e-02

Pos logit contributions
 it+0.030
 his+0.022
 the+0.022
 fast+0.021
 everyone+0.021
Neg logit contributions
int-0.037
 extingu-0.035
ifying-0.031
azed-0.031
oke-0.031
Token across
Token ID1973
Feature activation+3.77e-03

Pos logit contributions
int+0.116
 extingu+0.106
azed+0.098
oke+0.097
ifying+0.097
Neg logit contributions
 it-0.077
 his-0.054
 the-0.053
 fast-0.052
 everyone-0.050
Token an
Token ID281
Feature activation-1.34e-02

Pos logit contributions
 it+0.005
 his+0.003
 the+0.003
 fast+0.003
 everyone+0.002
Neg logit contributions
int-0.013
 extingu-0.012
azed-0.011
oke-0.011
ider-0.011
Token ash
Token ID12530
Feature activation-5.71e-02

Pos logit contributions
int+0.034
 extingu+0.031
azed+0.028
rots+0.028
ifying+0.028
Neg logit contributions
 it-0.028
 his-0.020
 the-0.020
 fast-0.020
 everyone-0.019
Tokentr
Token ID2213
Feature activation-2.11e-01

Pos logit contributions
int+0.108
oke+0.089
azed+0.088
ishes+0.085
aj+0.085
Neg logit contributions
 it-0.161
 the-0.129
 he-0.129
 his-0.127
 fast-0.126
Tokenay
Token ID323
Feature activation+3.36e-02

Pos logit contributions
int+0.444
 extingu+0.399
ccoli+0.363
urger+0.359
ishes+0.355
Neg logit contributions
 it-0.523
 his-0.402
 the-0.397
 fast-0.392
 he-0.383
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
 it+0.058
 the+0.039
 his+0.039
 fast+0.036
 he+0.036
Neg logit contributions
int-0.099
 extingu-0.091
azed-0.085
rots-0.084
oke-0.084
Token It
Token ID632
Feature activation+1.08e-02

Pos logit contributions
int+0.042
 extingu+0.038
azed+0.036
ifying+0.036
rots+0.035
Neg logit contributions
 it-0.025
 his-0.019
 fast-0.018
 everyone-0.017
 the-0.016
Token wanted
Token ID2227
Feature activation+9.59e-03

Pos logit contributions
 it+0.018
 his+0.012
 the+0.012
 fast+0.012
 everyone+0.011
Neg logit contributions
int-0.032
 extingu-0.029
azed-0.027
oke-0.027
ider-0.027
Token to
Token ID284
Feature activation+2.08e-03

Pos logit contributions
 it+0.013
 fast+0.010
 everyone+0.010
 +0.009
 his+0.009
Neg logit contributions
 extingu-0.023
int-0.023
 department-0.021
azed-0.020
rots-0.020
Token very
Token ID845
Feature activation+4.05e-02

Pos logit contributions
int+0.013
 extingu+0.012
azed+0.011
ider+0.011
aj+0.011
Neg logit contributions
 it-0.008
 his-0.005
 he-0.005
 everyone-0.005
 the-0.005
Token tired
Token ID10032
Feature activation+5.54e-02

Pos logit contributions
 it+0.074
 his+0.051
 the+0.050
 everyone+0.048
 fast+0.047
Neg logit contributions
int-0.112
 extingu-0.101
azed-0.095
ider-0.094
aj-0.093
Token.
Token ID13
Feature activation-6.58e-03

Pos logit contributions
 it+0.094
 his+0.064
 everyone+0.063
 fast+0.062
 the+0.061
Neg logit contributions
int-0.143
 extingu-0.141
ifying-0.125
azed-0.124
 department-0.124
Token The
Token ID383
Feature activation-3.26e-02

Pos logit contributions
int+0.018
 extingu+0.015
azed+0.015
oke+0.015
aj+0.014
Neg logit contributions
 it-0.012
 fast-0.008
 everyone-0.008
 the-0.008
 his-0.008
Token ash
Token ID12530
Feature activation-5.31e-02

Pos logit contributions
int+0.089
 extingu+0.081
rots+0.076
ifying+0.075
azed+0.075
Neg logit contributions
 it-0.064
 the-0.047
 fast-0.047
 his-0.046
 everyone-0.044
Tokentr
Token ID2213
Feature activation-2.10e-01

Pos logit contributions
int+0.154
 extingu+0.143
oke+0.138
ifying+0.133
azed+0.132
Neg logit contributions
 it-0.095
 his-0.064
 the-0.063
 he-0.062
 fast-0.061
Tokenay
Token ID323
Feature activation+3.67e-02

Pos logit contributions
int+0.432
 extingu+0.390
urger+0.337
azed+0.337
ider+0.336
Neg logit contributions
 it-0.537
 his-0.418
 the-0.413
 fast-0.406
 everyone-0.397
Token watched
Token ID7342
Feature activation-7.84e-03

Pos logit contributions
 it+0.061
 the+0.040
 his+0.039
 he+0.036
 fast+0.035
Neg logit contributions
int-0.111
 extingu-0.105
aj-0.097
ifying-0.097
oke-0.096
Token as
Token ID355
Feature activation+9.81e-04

Pos logit contributions
int+0.023
 extingu+0.022
azed+0.020
ener+0.019
 department+0.019
Neg logit contributions
 it-0.010
 fast-0.006
 his-0.006
 he-0.005
 -0.005
Token the
Token ID262
Feature activation+2.56e-03

Pos logit contributions
 it+0.001
 +0.000
 Ellie+0.000
 everyone+0.000
 her+0.000
Neg logit contributions
int-0.003
 extingu-0.003
der-0.003
 department-0.003
azed-0.003
Token little
Token ID1310
Feature activation-3.08e-02

Pos logit contributions
 it+0.005
 the+0.004
 his+0.003
 fast+0.003
 everyone+0.003
Neg logit contributions
int-0.007
 extingu-0.007
ifying-0.006
rots-0.006
ives-0.006
Token.
Token ID13
Feature activation-9.75e-03

Pos logit contributions
int+0.075
 extingu+0.069
azed+0.065
ider+0.064
ifying+0.064
Neg logit contributions
 it-0.037
 his-0.024
 the-0.023
 fast-0.023
 everyone-0.022
Token It
Token ID632
Feature activation+1.66e-02

Pos logit contributions
int+0.028
 extingu+0.026
azed+0.024
rots+0.024
ifying+0.024
Neg logit contributions
 it-0.016
 his-0.011
 fast-0.010
 the-0.010
 everyone-0.010
Token was
Token ID373
Feature activation-5.87e-03

Pos logit contributions
 it+0.029
 his+0.021
 fast+0.020
 the+0.020
 everyone+0.020
Neg logit contributions
int-0.045
 extingu-0.040
azed-0.039
rots-0.038
oke-0.037
Token an
Token ID281
Feature activation-9.63e-03

Pos logit contributions
int+0.016
 extingu+0.016
ifying+0.015
azed+0.015
sed+0.014
Neg logit contributions
 it-0.009
 everyone-0.005
 his-0.005
 he-0.005
 fast-0.005
Token ash
Token ID12530
Feature activation-5.38e-02

Pos logit contributions
int+0.024
 extingu+0.022
azed+0.020
oke+0.020
ider+0.020
Neg logit contributions
 it-0.020
 his-0.015
 the-0.015
 fast-0.014
 everyone-0.014
Tokentr
Token ID2213
Feature activation-2.10e-01

Pos logit contributions
int+0.084
 extingu+0.064
aj+0.063
azed+0.063
oke+0.063
Neg logit contributions
 it-0.170
 the-0.137
 his-0.136
 he-0.136
 fast-0.136
Tokenay
Token ID323
Feature activation+3.65e-02

Pos logit contributions
int+0.431
 extingu+0.386
ccoli+0.338
oke+0.338
azed+0.338
Neg logit contributions
 it-0.534
 his-0.414
 the-0.409
 fast-0.405
 he-0.394
Token,
Token ID11
Feature activation+6.82e-03

Pos logit contributions
 it+0.061
 the+0.039
 his+0.039
 fast+0.036
 he+0.036
Neg logit contributions
int-0.111
 extingu-0.102
azed-0.095
aj-0.095
ishes-0.094
Token and
Token ID290
Feature activation-1.32e-02

Pos logit contributions
 it+0.010
 his+0.007
 everyone+0.007
 fast+0.007
 the+0.007
Neg logit contributions
int-0.019
 extingu-0.018
ifying-0.017
azed-0.016
ider-0.016
Token it
Token ID340
Feature activation+2.25e-02

Pos logit contributions
int+0.046
 extingu+0.046
ifying+0.044
azed+0.043
rots+0.043
Neg logit contributions
 it-0.005
 his-0.001
 fast-0.000
 slow+0.001
 Tim+0.001
Token was
Token ID373
Feature activation-5.57e-03

Pos logit contributions
 it+0.041
 his+0.029
 the+0.028
 fast+0.028
 everyone+0.027
Neg logit contributions
int-0.061
 extingu-0.056
azed-0.052
rots-0.051
ifying-0.051
Token said
Token ID531
Feature activation-2.59e-02

Pos logit contributions
int+0.070
 extingu+0.066
ifying+0.062
ider+0.061
rots+0.060
Neg logit contributions
 it-0.045
 the-0.031
 his-0.030
 he-0.029
 fast-0.028
Token,
Token ID11
Feature activation+9.12e-03

Pos logit contributions
int+0.085
 extingu+0.079
azed+0.074
ifying+0.073
aj+0.073
Neg logit contributions
 it-0.036
 the-0.021
 his-0.021
 fast-0.019
 everyone-0.019
Token
Token ID6184
Feature activation+6.03e-02

Pos logit contributions
 it+0.015
 his+0.011
 fast+0.011
 the+0.010
 everyone+0.010
Neg logit contributions
int-0.025
 extingu-0.023
oke-0.021
azed-0.021
ider-0.021
Token
Token ID95
Feature activation+9.64e-02

Pos logit contributions
 it+0.162
 his+0.128
 the+0.127
 fast+0.126
 everyone+0.123
Neg logit contributions
int-0.116
 extingu-0.102
azed-0.091
oke-0.089
ifying-0.089
Token
Token ID26391
Feature activation+1.71e-02

Pos logit contributions
 it+0.234
 his+0.181
 the+0.178
 fast+0.175
 everyone+0.172
Neg logit contributions
int-0.211
 extingu-0.188
azed-0.170
ider-0.168
oke-0.167
Token
Token ID129
Feature activation+1.09e-01

Pos logit contributions
 it+0.042
 his+0.035
 the+0.033
 everyone+0.032
 fast+0.032
Neg logit contributions
int-0.035
 extingu-0.030
aj-0.029
rots-0.029
azed-0.028
Token
Token ID241
Feature activation+4.90e-02

Pos logit contributions
 it+0.289
 he+0.237
 his+0.235
 everyone+0.229
 the+0.225
Neg logit contributions
int-0.201
 extingu-0.161
aj-0.152
oke-0.151
ider-0.149
TokenDo
Token ID5211
Feature activation+2.16e-02

Pos logit contributions
 it+0.091
 his+0.077
 everyone+0.073
 the+0.071
 Tim+0.066
Neg logit contributions
int-0.110
 extingu-0.098
ider-0.097
ifying-0.096
ives-0.091
Token you
Token ID345
Feature activation-3.87e-02

Pos logit contributions
 it+0.035
 his+0.025
 fast+0.023
 the+0.022
 everyone+0.021
Neg logit contributions
int-0.059
oke-0.051
omed-0.050
aj-0.050
uddled-0.050
Token need
Token ID761
Feature activation+1.06e-02

Pos logit contributions
int+0.103
 extingu+0.094
azed+0.087
oke+0.085
ifying+0.085
Neg logit contributions
 it-0.076
 his-0.054
 the-0.053
 fast-0.052
 everyone-0.051
Token to
Token ID284
Feature activation-3.45e-03

Pos logit contributions
 it+0.017
 his+0.012
 fast+0.012
 everyone+0.011
 Tim+0.011
Neg logit contributions
int-0.028
 extingu-0.025
aj-0.024
ider-0.024
azed-0.024
Token said
Token ID531
Feature activation-2.40e-02

Pos logit contributions
int+0.059
 extingu+0.055
ifying+0.052
ider+0.050
azed+0.050
Neg logit contributions
 it-0.040
 the-0.028
 his-0.027
 fast-0.026
 everyone-0.026
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
int+0.073
 extingu+0.067
azed+0.064
oke+0.063
ider+0.063
Neg logit contributions
 it-0.035
 his-0.022
 fast-0.022
 the-0.021
 everyone-0.021
Token
Token ID6184
Feature activation+5.93e-02

Pos logit contributions
 it+0.019
 his+0.014
 fast+0.013
 the+0.013
 everyone+0.013
Neg logit contributions
int-0.031
 extingu-0.029
oke-0.027
azed-0.027
ider-0.027
Token
Token ID95
Feature activation+9.57e-02

Pos logit contributions
 it+0.156
 his+0.128
 everyone+0.123
 the+0.123
 fast+0.120
Neg logit contributions
int-0.117
 extingu-0.100
ider-0.091
oke-0.089
rots-0.086
Token
Token ID26391
Feature activation+1.56e-02

Pos logit contributions
 it+0.229
 his+0.182
 everyone+0.175
 the+0.174
 fast+0.172
Neg logit contributions
int-0.212
 extingu-0.180
ider-0.168
oke-0.167
ives-0.163
Token
Token ID129
Feature activation+1.12e-01

Pos logit contributions
 it+0.038
 his+0.034
 everyone+0.032
 fast+0.031
 the+0.031
Neg logit contributions
int-0.029
aj-0.025
rots-0.024
oke-0.024
 extingu-0.023
Token
Token ID241
Feature activation+5.25e-02

Pos logit contributions
 it+0.291
 he+0.241
 everyone+0.237
 his+0.235
 the+0.228
Neg logit contributions
int-0.201
 extingu-0.149
oke-0.147
aj-0.143
ider-0.139
TokenOh
Token ID5812
Feature activation+7.23e-03

Pos logit contributions
 it+0.092
 everyone+0.076
 the+0.073
 his+0.073
 Ellie+0.069
Neg logit contributions
int-0.123
ider-0.106
 extingu-0.105
oke-0.102
ives-0.100
Token no
Token ID645
Feature activation+2.37e-02

Pos logit contributions
 it+0.010
 fast+0.008
 his+0.008
 everyone+0.007
 slow+0.007
Neg logit contributions
int-0.019
oke-0.017
azed-0.016
 extingu-0.016
omed-0.016
Token!
Token ID0
Feature activation+1.13e-02

Pos logit contributions
 it+0.041
 his+0.029
 everyone+0.028
 the+0.027
 fast+0.027
Neg logit contributions
int-0.066
 extingu-0.059
oke-0.057
ider-0.055
azed-0.055
Token You
Token ID921
Feature activation-2.49e-02

Pos logit contributions
 it+0.023
 his+0.017
 fast+0.016
 the+0.016
 everyone+0.016
Neg logit contributions
int-0.029
 extingu-0.026
oke-0.024
azed-0.024
ifying-0.024
Token.
Token ID13
Feature activation-3.12e-03

Pos logit contributions
 it+0.021
 his+0.013
 the+0.013
 fast+0.013
 everyone+0.012
Neg logit contributions
int-0.041
 extingu-0.038
azed-0.035
oke-0.035
rots-0.035
Token
Token ID198
Feature activation+9.54e-03

Pos logit contributions
int+0.009
 extingu+0.008
oke+0.008
azed+0.008
aj+0.008
Neg logit contributions
 it-0.004
 fast-0.003
 his-0.003
 everyone-0.003
 the-0.003
Token
Token ID198
Feature activation+9.21e-03

Pos logit contributions
 it+0.015
 his+0.011
 everyone+0.011
 fast+0.011
 the+0.011
Neg logit contributions
int-0.024
oke-0.021
 extingu-0.020
oral-0.020
aj-0.019
Tokenâ
Token ID22940
Feature activation+1.79e-02

Pos logit contributions
 it+0.019
 his+0.014
 the+0.013
 fast+0.013
 everyone+0.013
Neg logit contributions
int-0.023
 extingu-0.020
azed-0.019
oke-0.019
rots-0.019
Token
Token ID26391
Feature activation+1.99e-02

Pos logit contributions
 it+0.040
 his+0.030
 the+0.029
 fast+0.029
 everyone+0.028
Neg logit contributions
int-0.043
 extingu-0.038
azed-0.035
ider-0.035
oke-0.034
Token
Token ID129
Feature activation+1.14e-01

Pos logit contributions
 it+0.043
 his+0.033
 the+0.032
 fast+0.032
 everyone+0.032
Neg logit contributions
int-0.047
 extingu-0.042
aj-0.039
azed-0.039
oke-0.039
Token
Token ID241
Feature activation+5.12e-02

Pos logit contributions
 it+0.297
 he+0.234
 his+0.233
 everyone+0.233
 the+0.227
Neg logit contributions
int-0.206
 extingu-0.169
aj-0.156
oke-0.151
ider-0.150
TokenLook
Token ID8567
Feature activation-2.25e-02

Pos logit contributions
 it+0.103
 everyone+0.083
 the+0.082
 his+0.080
 Tim+0.074
Neg logit contributions
int-0.115
 extingu-0.104
ifying-0.097
ider-0.094
ocado-0.093
Token,
Token ID11
Feature activation+9.54e-03

Pos logit contributions
int+0.055
 extingu+0.049
azed+0.046
aj+0.044
rots+0.044
Neg logit contributions
 it-0.047
 his-0.036
 fast-0.035
 the-0.035
 everyone-0.034
Token a
Token ID257
Feature activation-1.56e-03

Pos logit contributions
 it+0.014
 fast+0.011
 his+0.010
 slow+0.009
 everyone+0.009
Neg logit contributions
int-0.026
 extingu-0.024
azed-0.022
ccoli-0.022
oke-0.022
Token big
Token ID1263
Feature activation-2.33e-02

Pos logit contributions
int+0.004
 extingu+0.004
azed+0.003
ifying+0.003
oke+0.003
Neg logit contributions
 it-0.003
 his-0.002
 the-0.002
 fast-0.002
 everyone-0.002
Token with
Token ID351
Feature activation+7.48e-03

Pos logit contributions
int+0.003
 extingu+0.003
azed+0.002
ifying+0.002
ider+0.002
Neg logit contributions
 it-0.001
 his-0.001
 everyone-0.001
 fast-0.001
 the-0.001
Token dirt
Token ID13647
Feature activation-1.53e-02

Pos logit contributions
 it+0.012
 his+0.007
 the+0.007
 fast+0.007
 everyone+0.007
Neg logit contributions
int-0.022
 extingu-0.021
azed-0.019
ifying-0.019
rots-0.019
Token.
Token ID13
Feature activation-9.61e-03

Pos logit contributions
int+0.051
 extingu+0.046
oke+0.044
azed+0.044
aj+0.044
Neg logit contributions
 it-0.022
 the-0.013
 his-0.013
 fast-0.012
 he-0.012
Token
Token ID198
Feature activation+4.22e-04

Pos logit contributions
int+0.028
 extingu+0.026
aj+0.024
oke+0.024
ifying+0.024
Neg logit contributions
 it-0.014
 fast-0.009
 his-0.009
 everyone-0.009
 the-0.009
Token
Token ID198
Feature activation+1.30e-03

Pos logit contributions
 it+0.001
 his+0.000
 everyone+0.000
 the+0.000
 fast+0.000
Neg logit contributions
int-0.001
 extingu-0.001
oke-0.001
aj-0.001
azed-0.001
TokenT
Token ID51
Feature activation+1.27e-01

Pos logit contributions
 it+0.003
 his+0.002
 the+0.002
 everyone+0.002
 fast+0.002
Neg logit contributions
int-0.003
 extingu-0.003
ifying-0.003
azed-0.003
ider-0.003
Tokenina
Token ID1437
Feature activation+3.31e-02

Pos logit contributions
 it+0.312
 his+0.246
 the+0.238
 fast+0.234
 everyone+0.230
Neg logit contributions
int-0.260
 extingu-0.239
ifying-0.221
ider-0.217
azed-0.211
Token observed
Token ID6515
Feature activation-4.54e-03

Pos logit contributions
 it+0.058
 the+0.039
 his+0.035
 everyone+0.034
 fast+0.034
Neg logit contributions
int-0.098
 extingu-0.095
arl-0.089
ifying-0.088
aj-0.086
Token.
Token ID13
Feature activation-7.83e-03

Pos logit contributions
int+0.016
 extingu+0.015
azed+0.015
rots+0.014
ishes+0.014
Neg logit contributions
 it-0.004
 fast-0.001
 his-0.001
 everyone-0.001
 the-0.001
Token She
Token ID1375
Feature activation-2.47e-02

Pos logit contributions
int+0.023
 extingu+0.022
azed+0.020
rots+0.020
oke+0.020
Neg logit contributions
 it-0.012
 his-0.008
 the-0.008
 fast-0.008
 everyone-0.007
Token watched
Token ID7342
Feature activation-8.18e-03

Pos logit contributions
int+0.069
 extingu+0.066
ifying+0.061
ider+0.060
oke+0.060
Neg logit contributions
 it-0.046
 the-0.032
 his-0.031
 fast-0.030
 everyone-0.029
Token He
Token ID679
Feature activation-2.13e-02

Pos logit contributions
int+0.011
 extingu+0.010
urger+0.009
ccoli+0.009
aj+0.009
Neg logit contributions
 it-0.007
 everyone-0.005
 fast-0.005
 the-0.004
 his-0.004
Token was
Token ID373
Feature activation-1.79e-03

Pos logit contributions
int+0.049
 extingu+0.042
ocado+0.040
ccoli+0.040
ider+0.039
Neg logit contributions
 it-0.045
 everyone-0.033
 the-0.032
 his-0.031
 fast-0.031
Token three
Token ID1115
Feature activation+2.61e-02

Pos logit contributions
int+0.005
 extingu+0.005
azed+0.004
sed+0.004
ider+0.004
Neg logit contributions
 it-0.003
 everyone-0.002
 his-0.002
 he-0.001
 fast-0.001
Token-
Token ID12
Feature activation+1.02e-01

Pos logit contributions
 it+0.057
 his+0.043
 the+0.042
 fast+0.041
 everyone+0.040
Neg logit contributions
int-0.063
 extingu-0.057
azed-0.052
oke-0.051
ifying-0.051
Tokenyears
Token ID19002
Feature activation+2.71e-02

Pos logit contributions
 it+0.306
 the+0.248
 his+0.246
 fast+0.241
 everyone+0.240
Neg logit contributions
int-0.165
 extingu-0.134
oke-0.119
azed-0.119
ider-0.118
Token-
Token ID12
Feature activation+1.02e-01

Pos logit contributions
 it+0.064
 his+0.050
 the+0.049
 fast+0.049
 everyone+0.047
Neg logit contributions
int-0.059
 extingu-0.054
ifying-0.049
azed-0.048
ider-0.048
Tokenold
Token ID727
Feature activation+1.97e-02

Pos logit contributions
 it+0.326
 the+0.267
 his+0.254
 everyone+0.254
 fast+0.251
Neg logit contributions
int-0.142
ives-0.106
oke-0.106
ocado-0.099
ari-0.097
Token and
Token ID290
Feature activation-1.43e-02

Pos logit contributions
 it+0.038
 everyone+0.027
 fast+0.027
 his+0.025
 the+0.025
Neg logit contributions
int-0.047
 extingu-0.045
ener-0.042
oke-0.041
urger-0.041
Token very
Token ID845
Feature activation+4.25e-02

Pos logit contributions
int+0.023
ccoli+0.021
ocado+0.021
uddled+0.020
led+0.020
Neg logit contributions
 it-0.031
 fast-0.022
 the-0.022
 everyone-0.021
 slow-0.021
Token curious
Token ID11040
Feature activation+5.07e-03

Pos logit contributions
 it+0.089
 the+0.063
 his+0.061
 everyone+0.060
 fast+0.056
Neg logit contributions
int-0.104
 extingu-0.088
ider-0.087
oke-0.087
azed-0.085
Token.
Token ID13
Feature activation-4.44e-03

Pos logit contributions
 it+0.009
 fast+0.006
 his+0.006
 the+0.006
 everyone+0.006
Neg logit contributions
int-0.014
 extingu-0.013
azed-0.012
oke-0.012
ifying-0.011
Token that
Token ID326
Feature activation-8.24e-03

Pos logit contributions
int+0.085
 extingu+0.078
azed+0.073
oke+0.072
rots+0.072
Neg logit contributions
 it-0.044
 his-0.029
 the-0.027
 fast-0.027
 everyone-0.026
Token the
Token ID262
Feature activation-7.04e-04

Pos logit contributions
int+0.026
 extingu+0.025
oke+0.024
azed+0.024
ccoli+0.024
Neg logit contributions
 it-0.007
 fast-0.004
 his-0.004
 slow-0.003
 he-0.003
Token little
Token ID1310
Feature activation-3.29e-02

Pos logit contributions
int+0.002
 extingu+0.002
ifying+0.002
ccoli+0.002
ishes+0.002
Neg logit contributions
 it-0.001
 the-0.001
 fast-0.001
 everyone-0.001
 slow-0.001
Token girl
Token ID2576
Feature activation+5.33e-03

Pos logit contributions
int+0.084
 extingu+0.075
ifying+0.074
urger+0.073
ccoli+0.072
Neg logit contributions
 it-0.073
 the-0.057
 fast-0.052
 her-0.051
 his-0.050
Token should
Token ID815
Feature activation+6.00e-02

Pos logit contributions
 it+0.010
 his+0.006
 the+0.006
 fast+0.006
 everyone+0.006
Neg logit contributions
int-0.015
 extingu-0.014
ifying-0.013
azed-0.013
aj-0.013
Token always
Token ID1464
Feature activation-7.23e-03

Pos logit contributions
 it+0.095
 his+0.064
 fast+0.063
 the+0.061
 everyone+0.058
Neg logit contributions
int-0.170
 extingu-0.156
azed-0.143
ider-0.142
aj-0.141
Token remember
Token ID3505
Feature activation-3.70e-02

Pos logit contributions
int+0.020
 extingu+0.019
azed+0.017
ider+0.017
aj+0.017
Neg logit contributions
 it-0.013
 his-0.009
 the-0.009
 fast-0.008
 everyone-0.008
Token to
Token ID284
Feature activation-4.07e-03

Pos logit contributions
int+0.121
 extingu+0.115
azed+0.107
aj+0.106
rots+0.106
Neg logit contributions
 it-0.041
 fast-0.022
 his-0.021
 he-0.018
 everyone-0.018
Token find
Token ID1064
Feature activation+1.16e-02

Pos logit contributions
int+0.011
 extingu+0.010
azed+0.009
ider+0.009
aj+0.009
Neg logit contributions
 it-0.007
 his-0.005
 fast-0.005
 the-0.005
 he-0.005
Token a
Token ID257
Feature activation-6.83e-03

Pos logit contributions
 it+0.018
 his+0.011
 the+0.011
 fast+0.011
 everyone+0.010
Neg logit contributions
int-0.036
 extingu-0.033
azed-0.031
oke-0.031
ider-0.031
Token safe
Token ID3338
Feature activation-2.92e-03

Pos logit contributions
int+0.021
 extingu+0.019
azed+0.018
oke+0.017
rots+0.017
Neg logit contributions
 it-0.011
 his-0.007
 the-0.007
 fast-0.007
 everyone-0.007
Token was
Token ID373
Feature activation-1.39e-02

Pos logit contributions
int+0.053
 extingu+0.050
oke+0.048
azed+0.048
rots+0.047
Neg logit contributions
 it-0.022
 fast-0.015
 his-0.014
 slow-0.012
 the-0.012
Token over
Token ID625
Feature activation-2.87e-02

Pos logit contributions
int+0.042
 extingu+0.039
azed+0.036
oke+0.036
ifying+0.036
Neg logit contributions
 it-0.020
 his-0.012
 fast-0.012
 the-0.011
 he-0.011
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
int+0.087
 extingu+0.080
azed+0.074
ider+0.074
oke+0.074
Neg logit contributions
 it-0.048
 the-0.032
 his-0.032
 fast-0.030
 everyone-0.029
Token
Token ID220
Feature activation+2.32e-02

Pos logit contributions
int+0.033
 extingu+0.030
ifying+0.029
azed+0.028
oke+0.028
Neg logit contributions
 it-0.021
 fast-0.015
 his-0.014
 the-0.014
 everyone-0.013
Token
Token ID198
Feature activation-3.44e-03

Pos logit contributions
 it+0.029
 his+0.016
 the+0.015
 fast+0.015
 everyone+0.014
Neg logit contributions
int-0.078
 extingu-0.073
azed-0.068
oke-0.068
rots-0.067
Token
Token ID198
Feature activation-2.47e-03

Pos logit contributions
int+0.009
 extingu+0.008
oke+0.008
aj+0.008
uddled+0.008
Neg logit contributions
 it-0.006
 fast-0.004
 his-0.004
 the-0.004
 everyone-0.003
TokenMom
Token ID29252
Feature activation-2.35e-02

Pos logit contributions
int+0.006
 extingu+0.006
ifying+0.005
oral+0.005
rots+0.005
Neg logit contributions
 it-0.005
 his-0.003
 the-0.003
 fast-0.003
 everyone-0.003
Tokenmy
Token ID1820
Feature activation-1.05e-02

Pos logit contributions
int+0.066
 extingu+0.062
ider+0.058
ifying+0.058
rots+0.056
Neg logit contributions
 it-0.042
 the-0.030
 his-0.030
 fast-0.027
 he-0.027
Token was
Token ID373
Feature activation-1.36e-02

Pos logit contributions
int+0.030
 extingu+0.028
ifying+0.027
ider+0.027
arl+0.026
Neg logit contributions
 it-0.019
 the-0.014
 his-0.013
 he-0.012
 everyone-0.012
Token proud
Token ID6613
Feature activation-1.41e-02

Pos logit contributions
int+0.043
 extingu+0.040
azed+0.037
rots+0.037
oke+0.037
Neg logit contributions
 it-0.020
 his-0.012
 the-0.012
 fast-0.012
 everyone-0.011
Token of
Token ID286
Feature activation-4.59e-04

Pos logit contributions
int+0.035
 extingu+0.032
azed+0.030
oke+0.029
ider+0.029
Neg logit contributions
 it-0.029
 his-0.021
 fast-0.021
 the-0.020
 everyone-0.020
Token him
Token ID683
Feature activation+1.35e-02

Pos logit contributions
 it+0.003
 fast+0.002
 slow+0.001
 everyone+0.001
 he+0.001
Neg logit contributions
int-0.010
 extingu-0.010
azed-0.010
ener-0.009
urger-0.009
Token.
Token ID13
Feature activation-5.18e-03

Pos logit contributions
 it+0.022
 his+0.015
 everyone+0.015
 the+0.014
 fast+0.014
Neg logit contributions
int-0.038
 extingu-0.035
azed-0.033
ifying-0.032
ider-0.032
Token The
Token ID383
Feature activation-3.14e-02

Pos logit contributions
int+0.014
 extingu+0.013
azed+0.012
oke+0.012
ives+0.012
Neg logit contributions
 it-0.009
 fast-0.006
 his-0.006
 everyone-0.006
 her-0.005
Token funny
Token ID8258
Feature activation-1.01e-02

Pos logit contributions
int+0.080
 extingu+0.072
aj+0.066
oke+0.066
azed+0.066
Neg logit contributions
 it-0.069
 the-0.051
 his-0.051
 fast-0.050
 everyone-0.049
Token thing
Token ID1517
Feature activation+8.70e-03

Pos logit contributions
int+0.024
 extingu+0.022
azed+0.020
oke+0.020
ider+0.020
Neg logit contributions
 it-0.022
 his-0.016
 the-0.016
 fast-0.016
 everyone-0.016
Token was
Token ID373
Feature activation-2.52e-03

Pos logit contributions
 it+0.015
 his+0.010
 the+0.010
 fast+0.010
 everyone+0.010
Neg logit contributions
int-0.025
 extingu-0.023
azed-0.021
aj-0.021
ider-0.021
Token that
Token ID326
Feature activation-5.66e-03

Pos logit contributions
int+0.007
 extingu+0.007
azed+0.006
ifying+0.006
oke+0.006
Neg logit contributions
 it-0.004
 his-0.003
 the-0.003
 fast-0.003
 everyone-0.003
Token the
Token ID262
Feature activation+2.64e-03

Pos logit contributions
int+0.018
 extingu+0.017
azed+0.016
 department+0.016
rots+0.016
Neg logit contributions
 it-0.005
 fast-0.003
 his-0.002
 the-0.002
 everyone-0.002
Token boat
Token ID8848
Feature activation+6.11e-03

Pos logit contributions
 it+0.006
 his+0.004
 the+0.004
 fast+0.004
 everyone+0.004
Neg logit contributions
int-0.007
 extingu-0.006
azed-0.005
oke-0.005
rots-0.005
Token was
Token ID373
Feature activation-3.11e-03

Pos logit contributions
 it+0.011
 the+0.007
 his+0.007
 fast+0.007
 he+0.007
Neg logit contributions
int-0.018
 extingu-0.016
aj-0.015
azed-0.015
ider-0.015
Token not
Token ID407
Feature activation+9.03e-03

Pos logit contributions
int+0.009
 extingu+0.008
azed+0.007
ifying+0.007
oke+0.007
Neg logit contributions
 it-0.005
 his-0.004
 the-0.003
 everyone-0.003
 fast-0.003
Token home
Token ID1363
Feature activation+4.45e-02

Pos logit contributions
 it+0.039
 he+0.022
 fast+0.020
 his+0.019
 everyone+0.017
Neg logit contributions
 extingu-0.113
int-0.110
ishes-0.101
aj-0.099
ccoli-0.098
Token.
Token ID13
Feature activation+1.19e-03

Pos logit contributions
 it+0.070
 his+0.045
 fast+0.043
 the+0.043
 everyone+0.042
Neg logit contributions
int-0.132
 extingu-0.124
azed-0.115
ider-0.114
oke-0.113
Token It
Token ID632
Feature activation+2.14e-02

Pos logit contributions
 it+0.002
 his+0.001
 fast+0.001
 he+0.001
 everyone+0.001
Neg logit contributions
int-0.003
 extingu-0.003
ifying-0.003
aj-0.003
oke-0.003
Token was
Token ID373
Feature activation+1.13e-03

Pos logit contributions
 it+0.034
 his+0.022
 the+0.021
 fast+0.020
 everyone+0.020
Neg logit contributions
int-0.066
 extingu-0.061
azed-0.057
oke-0.056
ifying-0.056
Token red
Token ID2266
Feature activation+2.08e-02

Pos logit contributions
 it+0.002
 his+0.001
 everyone+0.001
 fast+0.001
 he+0.001
Neg logit contributions
int-0.003
 extingu-0.003
ifying-0.003
aj-0.003
azed-0.003
Token and
Token ID290
Feature activation-1.66e-03

Pos logit contributions
 it+0.033
 the+0.021
 his+0.021
 fast+0.020
 he+0.019
Neg logit contributions
int-0.065
 extingu-0.058
oke-0.056
azed-0.056
rots-0.055
Token sweet
Token ID6029
Feature activation-3.11e-02

Pos logit contributions
 extingu+0.003
int+0.003
ccoli+0.003
ifying+0.003
sed+0.003
Neg logit contributions
 it-0.003
 his-0.002
 everyone-0.002
 fast-0.002
 Tim-0.002
Token.
Token ID13
Feature activation+1.55e-03

Pos logit contributions
int+0.101
 extingu+0.093
aj+0.087
azed+0.087
ifying+0.087
Neg logit contributions
 it-0.046
 the-0.028
 his-0.027
 fast-0.025
 everyone-0.024
Token They
Token ID1119
Feature activation-3.03e-03

Pos logit contributions
 it+0.002
 his+0.001
 fast+0.001
 he+0.001
 the+0.001
Neg logit contributions
int-0.005
 extingu-0.005
ifying-0.004
oke-0.004
aj-0.004
Token were
Token ID547
Feature activation+1.60e-02

Pos logit contributions
int+0.008
 extingu+0.007
azed+0.007
ider+0.007
aj+0.007
Neg logit contributions
 it-0.006
 the-0.004
 his-0.004
 fast-0.004
 everyone-0.004
Token so
Token ID523
Feature activation+1.80e-02

Pos logit contributions
 it+0.024
 his+0.015
 he+0.015
 fast+0.015
 everyone+0.014
Neg logit contributions
int-0.047
 extingu-0.043
azed-0.040
aj-0.040
ishes-0.040
Token are
Token ID389
Feature activation-4.85e-03

Pos logit contributions
int+0.013
 extingu+0.013
ider+0.012
ifying+0.012
aj+0.012
Neg logit contributions
 it-0.008
 the-0.006
 his-0.006
 her-0.005
 he-0.005
Token twins
Token ID20345
Feature activation+2.78e-02

Pos logit contributions
int+0.015
 extingu+0.014
ifying+0.013
ener+0.013
rots+0.013
Neg logit contributions
 it-0.008
 the-0.006
 his-0.005
 fast-0.005
 everyone-0.005
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
 it+0.043
 his+0.028
 the+0.027
 fast+0.027
 everyone+0.026
Neg logit contributions
int-0.083
 extingu-0.077
azed-0.072
oke-0.071
ider-0.071
Token They
Token ID1119
Feature activation-3.35e-03

Pos logit contributions
 it+0.017
 his+0.011
 the+0.011
 fast+0.011
 everyone+0.010
Neg logit contributions
int-0.030
 extingu-0.028
azed-0.026
rots-0.025
oke-0.025
Token like
Token ID588
Feature activation+9.43e-03

Pos logit contributions
int+0.010
 extingu+0.009
azed+0.008
rots+0.008
oke+0.008
Neg logit contributions
 it-0.006
 the-0.004
 his-0.004
 fast-0.004
 everyone-0.004
Token to
Token ID284
Feature activation+1.03e-02

Pos logit contributions
 it+0.015
 his+0.009
 fast+0.009
 everyone+0.009
 the+0.009
Neg logit contributions
int-0.026
 extingu-0.025
 department-0.023
ives-0.023
ccoli-0.022
Token play
Token ID711
Feature activation+7.76e-02

Pos logit contributions
 it+0.015
 everyone+0.008
 fast+0.008
 his+0.008
 the+0.008
Neg logit contributions
int-0.032
 extingu-0.030
ider-0.027
aj-0.027
ifying-0.027
Token in
Token ID287
Feature activation+5.33e-03

Pos logit contributions
 it+0.139
 his+0.098
 the+0.096
 fast+0.091
 everyone+0.087
Neg logit contributions
int-0.226
 extingu-0.208
azed-0.196
oke-0.196
ifying-0.194
Token the
Token ID262
Feature activation+1.61e-02

Pos logit contributions
 it+0.006
 he+0.004
 everyone+0.004
 hardly+0.003
 everybody+0.003
Neg logit contributions
 extingu-0.015
aj-0.012
 bowls-0.012
ifying-0.012
ishes-0.012
Token snow
Token ID6729
Feature activation+3.98e-02

Pos logit contributions
 it+0.034
 his+0.024
 the+0.024
 fast+0.024
 everyone+0.023
Neg logit contributions
int-0.040
 extingu-0.037
azed-0.033
ifying-0.033
aj-0.033
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
 it+0.065
 his+0.042
 the+0.041
 fast+0.040
 everyone+0.039
Neg logit contributions
int-0.118
 extingu-0.109
azed-0.101
ider-0.100
rots-0.100
Token was
Token ID373
Feature activation-2.63e-03

Pos logit contributions
int+0.010
 extingu+0.010
ifying+0.009
ider+0.009
aj+0.009
Neg logit contributions
 it-0.006
 the-0.004
 his-0.004
 he-0.004
 fast-0.004
Token bigger
Token ID5749
Feature activation+1.82e-02

Pos logit contributions
int+0.008
 extingu+0.007
azed+0.007
ider+0.007
ifying+0.007
Neg logit contributions
 it-0.004
 his-0.002
 the-0.002
 fast-0.002
 everyone-0.002
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
 it+0.035
 his+0.025
 the+0.025
 fast+0.024
 everyone+0.023
Neg logit contributions
int-0.049
 extingu-0.045
azed-0.041
ifying-0.041
oke-0.041
Token he
Token ID339
Feature activation+1.98e-02

Pos logit contributions
int+0.030
 extingu+0.028
 department+0.026
azed+0.026
ener+0.025
Neg logit contributions
 it-0.025
 his-0.018
 everyone-0.017
 the-0.017
 fast-0.017
Token could
Token ID714
Feature activation+5.10e-02

Pos logit contributions
 it+0.038
 his+0.027
 the+0.026
 fast+0.026
 everyone+0.025
Neg logit contributions
int-0.053
 extingu-0.048
azed-0.045
oke-0.044
rots-0.044
Token reach
Token ID3151
Feature activation-3.21e-02

Pos logit contributions
 it+0.083
 his+0.054
 fast+0.053
 the+0.053
 everyone+0.052
Neg logit contributions
int-0.146
 extingu-0.134
azed-0.123
ider-0.122
ifying-0.120
Token it
Token ID340
Feature activation+2.61e-02

Pos logit contributions
int+0.100
 extingu+0.099
ener+0.091
ishes+0.090
azed+0.089
Neg logit contributions
 it-0.025
 fast-0.012
 everyone-0.010
 slow-0.009
 -0.009
Token first
Token ID717
Feature activation-3.35e-02

Pos logit contributions
 it+0.043
 his+0.028
 everyone+0.027
 the+0.027
 fast+0.027
Neg logit contributions
int-0.075
 extingu-0.069
azed-0.065
ider-0.063
ifying-0.062
Token.
Token ID13
Feature activation-5.16e-03

Pos logit contributions
int+0.094
 extingu+0.086
azed+0.082
oke+0.079
ider+0.079
Neg logit contributions
 it-0.054
 fast-0.036
 his-0.036
 everyone-0.035
 the-0.035
Token He
Token ID679
Feature activation-1.80e-02

Pos logit contributions
int+0.014
 extingu+0.013
oke+0.012
azed+0.012
ifying+0.012
Neg logit contributions
 it-0.009
 fast-0.006
 his-0.006
 the-0.006
 everyone-0.006
Token was
Token ID373
Feature activation-2.03e-03

Pos logit contributions
int+0.049
 extingu+0.045
azed+0.041
ifying+0.041
oke+0.041
Neg logit contributions
 it-0.034
 his-0.024
 the-0.024
 fast-0.023
 everyone-0.023
Token It
Token ID632
Feature activation+1.41e-02

Pos logit contributions
int+0.006
 extingu+0.005
azed+0.005
ifying+0.005
oke+0.005
Neg logit contributions
 it-0.003
 his-0.002
 the-0.002
 fast-0.002
 everyone-0.002
Token was
Token ID373
Feature activation-6.99e-03

Pos logit contributions
 it+0.024
 his+0.016
 the+0.016
 fast+0.016
 everyone+0.015
Neg logit contributions
int-0.041
 extingu-0.038
azed-0.035
oke-0.035
ider-0.035
Token a
Token ID257
Feature activation-2.79e-03

Pos logit contributions
int+0.021
 extingu+0.020
ifying+0.019
azed+0.019
aj+0.019
Neg logit contributions
 it-0.010
 his-0.006
 fast-0.006
 everyone-0.006
 the-0.005
Token big
Token ID1263
Feature activation-2.50e-02

Pos logit contributions
int+0.008
 extingu+0.007
azed+0.007
ifying+0.007
oke+0.007
Neg logit contributions
 it-0.005
 his-0.003
 the-0.003
 fast-0.003
 everyone-0.003
Token,
Token ID11
Feature activation+1.29e-02

Pos logit contributions
int+0.069
 extingu+0.063
azed+0.058
oke+0.057
aj+0.057
Neg logit contributions
 it-0.046
 his-0.032
 the-0.031
 fast-0.030
 everyone-0.030
Token happy
Token ID3772
Feature activation-1.77e-02

Pos logit contributions
 it+0.023
 everyone+0.016
 his+0.016
 the+0.016
 fast+0.015
Neg logit contributions
int-0.034
 extingu-0.031
aj-0.028
ifying-0.028
azed-0.028
Token yawn
Token ID40908
Feature activation+3.99e-02

Pos logit contributions
int+0.048
 extingu+0.044
azed+0.041
oke+0.040
ider+0.040
Neg logit contributions
 it-0.033
 his-0.023
 the-0.023
 fast-0.022
 everyone-0.022
Token.
Token ID13
Feature activation-2.57e-03

Pos logit contributions
 it+0.083
 his+0.060
 the+0.060
 fast+0.059
 everyone+0.057
Neg logit contributions
int-0.102
 extingu-0.093
azed-0.085
aj-0.084
rots-0.084
Token
Token ID220
Feature activation+3.18e-02

Pos logit contributions
int+0.007
 extingu+0.007
ifying+0.006
aj+0.006
azed+0.006
Neg logit contributions
 it-0.004
 fast-0.003
 his-0.003
 the-0.003
 everyone-0.003
Token
Token ID198
Feature activation+9.09e-03

Pos logit contributions
 it+0.044
 his+0.028
 fast+0.027
 the+0.026
 everyone+0.025
Neg logit contributions
int-0.097
 extingu-0.090
aj-0.086
oke-0.085
azed-0.084
Token
Token ID198
Feature activation+9.14e-03

Pos logit contributions
 it+0.015
 his+0.012
 everyone+0.011
 he+0.010
 fast+0.010
Neg logit contributions
int-0.021
oke-0.019
 extingu-0.019
aj-0.018
omed-0.018
Token are
Token ID389
Feature activation-7.79e-03

Pos logit contributions
 it+0.013
 his+0.009
 the+0.009
 fast+0.009
 everyone+0.009
Neg logit contributions
int-0.016
 extingu-0.015
azed-0.013
ifying-0.013
oke-0.013
Token friends
Token ID2460
Feature activation+3.27e-02

Pos logit contributions
int+0.021
 extingu+0.019
azed+0.019
ider+0.018
aj+0.018
Neg logit contributions
 it-0.013
 his-0.008
 he-0.008
 everyone-0.008
 the-0.008
Token.
Token ID13
Feature activation+2.28e-02

Pos logit contributions
 it+0.056
 his+0.040
 everyone+0.039
 fast+0.037
 the+0.037
Neg logit contributions
int-0.085
 extingu-0.078
azed-0.077
ider-0.075
rots-0.074
Token<|endoftext|>
Token ID50256
Feature activation-6.46e-02

Pos logit contributions
 it+0.034
 his+0.022
 the+0.021
 fast+0.021
 everyone+0.020
Neg logit contributions
int-0.070
 extingu-0.065
azed-0.060
ifying-0.060
oke-0.060
TokenL
Token ID43
Feature activation+3.89e-02

Pos logit contributions
int+0.112
 extingu+0.098
azed+0.085
ider+0.083
oke+0.082
Neg logit contributions
 it-0.186
 his-0.150
 the-0.149
 fast-0.146
 everyone-0.144
Tokenily
Token ID813
Feature activation-1.49e-02

Pos logit contributions
 it+0.099
 his+0.077
 the+0.077
 fast+0.074
 everyone+0.073
Neg logit contributions
int-0.082
 extingu-0.073
azed-0.065
ifying-0.065
ider-0.064
Token and
Token ID290
Feature activation-2.26e-04

Pos logit contributions
int+0.035
 extingu+0.032
ifying+0.029
azed+0.029
ider+0.029
Neg logit contributions
 it-0.034
 the-0.026
 his-0.026
 fast-0.025
 he-0.024
Token Ben
Token ID3932
Feature activation+2.22e-02

Pos logit contributions
int+0.001
 extingu+0.001
ifying+0.001
azed+0.001
sed+0.001
Neg logit contributions
 it-0.000
 fast-0.000
 his-0.000
 everyone-0.000
 he-0.000
Token were
Token ID547
Feature activation+1.61e-02

Pos logit contributions
 it+0.043
 the+0.032
 his+0.031
 fast+0.029
 he+0.028
Neg logit contributions
int-0.061
 extingu-0.055
ider-0.052
azed-0.052
aj-0.051
Token at
Token ID379
Feature activation+3.74e-02

Pos logit contributions
 it+0.030
 the+0.022
 his+0.022
 fast+0.021
 everyone+0.020
Neg logit contributions
int-0.045
 extingu-0.043
ener-0.038
azed-0.038
ifying-0.038
Token the
Token ID262
Feature activation+1.80e-02

Pos logit contributions
 it+0.028
 fast+0.007
 his+0.006
 everyone+0.005
 he+0.005
Neg logit contributions
int-0.132
 extingu-0.131
ishes-0.119
aj-0.119
ifying-0.119
Token and
Token ID290
Feature activation-1.71e-02

Pos logit contributions
 it+0.029
 his+0.021
 he+0.019
 fast+0.019
 everyone+0.019
Neg logit contributions
int-0.046
 extingu-0.042
azed-0.040
rots-0.039
ishes-0.038
Token the
Token ID262
Feature activation-2.85e-03

Pos logit contributions
int+0.060
 extingu+0.057
azed+0.054
 department+0.054
oke+0.053
Neg logit contributions
 it-0.015
 his-0.006
 fast-0.005
 the-0.004
 he-0.004
Token calf
Token ID31134
Feature activation-6.41e-02

Pos logit contributions
int+0.007
 extingu+0.006
ifying+0.006
aj+0.006
ider+0.006
Neg logit contributions
 it-0.007
 the-0.006
 fast-0.005
 everyone-0.005
 her-0.005
Token was
Token ID373
Feature activation-1.27e-02

Pos logit contributions
int+0.202
 extingu+0.190
ifying+0.177
arl+0.177
ider+0.176
Neg logit contributions
 it-0.103
 the-0.070
 his-0.064
 everyone-0.062
 he-0.061
Token now
Token ID783
Feature activation-3.53e-02

Pos logit contributions
int+0.035
 extingu+0.032
azed+0.030
ifying+0.029
ider+0.029
Neg logit contributions
 it-0.023
 his-0.016
 the-0.016
 fast-0.015
 everyone-0.015
Token grown
Token ID7334
Feature activation-5.68e-02

Pos logit contributions
int+0.102
 extingu+0.093
azed+0.087
oke+0.086
rots+0.085
Neg logit contributions
 it-0.063
 his-0.043
 the-0.043
 fast-0.041
 everyone-0.040
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
int+0.140
 extingu+0.127
azed+0.117
ider+0.114
rots+0.114
Neg logit contributions
 it-0.121
 his-0.089
 the-0.088
 fast-0.086
 everyone-0.085
Token The
Token ID383
Feature activation-3.82e-02

Pos logit contributions
int+0.033
 extingu+0.030
azed+0.028
oke+0.028
ifying+0.028
Neg logit contributions
 it-0.020
 his-0.014
 fast-0.013
 the-0.013
 everyone-0.012
Token cow
Token ID9875
Feature activation-1.78e-02

Pos logit contributions
int+0.099
aj+0.088
ifying+0.088
ives+0.088
 extingu+0.087
Neg logit contributions
 it-0.082
 the-0.069
 fast-0.064
 everyone-0.063
 her-0.061
Token was
Token ID373
Feature activation-1.24e-02

Pos logit contributions
int+0.055
 extingu+0.052
ifying+0.049
arl+0.049
ider+0.048
Neg logit contributions
 it-0.029
 the-0.020
 his-0.019
 everyone-0.017
 he-0.017
Token happy
Token ID3772
Feature activation-1.65e-02

Pos logit contributions
int+0.039
 extingu+0.037
azed+0.035
ifying+0.034
aj+0.034
Neg logit contributions
 it-0.017
 his-0.010
 the-0.010
 fast-0.009
 everyone-0.009
Tokenâ
Token ID22940
Feature activation+1.47e-02

Pos logit contributions
 it+0.018
 his+0.013
 fast+0.013
 the+0.013
 everyone+0.013
Neg logit contributions
int-0.026
 extingu-0.023
ifying-0.022
oke-0.022
azed-0.022
Token
Token ID26391
Feature activation+1.37e-02

Pos logit contributions
 it+0.030
 his+0.022
 the+0.022
 fast+0.021
 everyone+0.021
Neg logit contributions
int-0.038
 extingu-0.034
azed-0.031
oke-0.031
ider-0.031
Token
Token ID220
Feature activation+2.69e-02

Pos logit contributions
 it+0.025
 his+0.021
 everyone+0.019
 fast+0.019
 the+0.018
Neg logit contributions
int-0.034
aj-0.030
oke-0.030
rots-0.030
 extingu-0.030
Token
Token ID198
Feature activation-1.31e-03

Pos logit contributions
 it+0.038
 his+0.024
 fast+0.022
 the+0.022
 everyone+0.021
Neg logit contributions
int-0.083
 extingu-0.076
oke-0.074
aj-0.073
azed-0.073
Token
Token ID198
Feature activation-1.13e-03

Pos logit contributions
int+0.003
oke+0.003
ives+0.003
 extingu+0.003
aj+0.003
Neg logit contributions
 it-0.002
 his-0.002
 everyone-0.001
 the-0.001
 fast-0.001
TokenHis
Token ID6653
Feature activation-5.08e-02

Pos logit contributions
int+0.003
 extingu+0.002
oke+0.002
ives+0.002
ifying+0.002
Neg logit contributions
 it-0.002
 the-0.002
 his-0.002
 everyone-0.002
 fast-0.002
Token friend
Token ID1545
Feature activation-3.57e-02

Pos logit contributions
int+0.118
 extingu+0.107
azed+0.097
ifying+0.096
oke+0.095
Neg logit contributions
 it-0.116
 his-0.089
 the-0.088
 fast-0.086
 everyone-0.084
Token said
Token ID531
Feature activation-3.21e-02

Pos logit contributions
int+0.096
 extingu+0.088
azed+0.081
oke+0.080
rots+0.080
Neg logit contributions
 it-0.067
 his-0.048
 the-0.047
 fast-0.046
 everyone-0.045
Token,
Token ID11
Feature activation+3.99e-03

Pos logit contributions
int+0.094
 extingu+0.088
azed+0.082
oke+0.082
rots+0.080
Neg logit contributions
 it-0.047
 fast-0.031
 his-0.030
 the-0.029
 everyone-0.029
Token
Token ID6184
Feature activation+5.94e-02

Pos logit contributions
 it+0.007
 his+0.005
 fast+0.005
 the+0.005
 everyone+0.005
Neg logit contributions
int-0.011
 extingu-0.010
oke-0.009
azed-0.009
ifying-0.009
Token
Token ID95
Feature activation+9.40e-02

Pos logit contributions
 it+0.157
 his+0.126
 the+0.123
 everyone+0.121
 fast+0.121
Neg logit contributions
int-0.118
 extingu-0.102
ider-0.093
oke-0.091
azed-0.089
Token a
Token ID257
Feature activation-4.02e-03

Pos logit contributions
 it+0.003
 fast+0.002
 he+0.002
 everyone+0.002
 his+0.002
Neg logit contributions
int-0.009
 extingu-0.008
azed-0.007
rots-0.007
ishes-0.007
Token bite
Token ID13197
Feature activation-6.25e-02

Pos logit contributions
int+0.011
 extingu+0.010
azed+0.009
rots+0.009
ider+0.009
Neg logit contributions
 it-0.008
 his-0.005
 the-0.005
 fast-0.005
 he-0.005
Token of
Token ID286
Feature activation+4.29e-03

Pos logit contributions
int+0.177
 extingu+0.165
azed+0.154
ifying+0.151
oke+0.150
Neg logit contributions
 it-0.102
 his-0.068
 fast-0.067
 the-0.066
 everyone-0.065
Token the
Token ID262
Feature activation-8.51e-04

Pos logit contributions
 it+0.003
 fast+0.001
 everyone+0.001
 his+0.001
 he+0.000
Neg logit contributions
int-0.015
 extingu-0.015
azed-0.014
ifying-0.014
ishes-0.013
Token water
Token ID1660
Feature activation-5.90e-03

Pos logit contributions
int+0.002
 extingu+0.002
aj+0.002
oke+0.002
ider+0.002
Neg logit contributions
 it-0.002
 the-0.001
 fast-0.001
 he-0.001
 his-0.001
Tokenmelon
Token ID45690
Feature activation-1.39e-01

Pos logit contributions
int+0.019
 extingu+0.018
azed+0.017
ider+0.016
rots+0.016
Neg logit contributions
 it-0.008
 his-0.005
 the-0.005
 fast-0.005
 everyone-0.004
Token,
Token ID11
Feature activation+4.74e-03

Pos logit contributions
int+0.452
 extingu+0.419
azed+0.393
ider+0.389
aj+0.388
Neg logit contributions
 it-0.194
 his-0.117
 the-0.115
 fast-0.107
 everyone-0.103
Token he
Token ID339
Feature activation+1.77e-02

Pos logit contributions
 it+0.003
 fast+0.002
 his+0.001
 slow+0.001
 everyone+0.001
Neg logit contributions
int-0.015
 extingu-0.015
ifying-0.014
azed-0.014
rots-0.014
Token heard
Token ID2982
Feature activation-2.99e-02

Pos logit contributions
 it+0.036
 the+0.025
 his+0.025
 fast+0.024
 he+0.024
Neg logit contributions
int-0.047
 extingu-0.045
oke-0.040
arl-0.040
elling-0.039
Token the
Token ID262
Feature activation-1.64e-03

Pos logit contributions
int+0.094
 extingu+0.088
azed+0.082
ifying+0.081
rots+0.081
Neg logit contributions
 it-0.039
 fast-0.023
 his-0.022
 the-0.021
 everyone-0.020
Token farmer
Token ID18739
Feature activation-3.11e-02

Pos logit contributions
int+0.005
 extingu+0.004
aj+0.004
azed+0.004
oke+0.004
Neg logit contributions
 it-0.003
 the-0.002
 fast-0.002
 his-0.002
 everyone-0.002
TokenYou
Token ID1639
Feature activation+1.19e-02

Pos logit contributions
int+0.042
 extingu+0.038
azed+0.035
oke+0.034
rots+0.034
Neg logit contributions
 it-0.040
 his-0.030
 the-0.030
 fast-0.030
 everyone-0.029
Token should
Token ID815
Feature activation+7.71e-02

Pos logit contributions
 it+0.025
 the+0.018
 his+0.018
 everyone+0.017
 fast+0.017
Neg logit contributions
int-0.031
 extingu-0.029
ider-0.027
azed-0.026
aj-0.026
Token not
Token ID407
Feature activation+1.77e-02

Pos logit contributions
 it+0.102
 his+0.065
 fast+0.065
 he+0.059
 everyone+0.058
Neg logit contributions
int-0.223
 extingu-0.216
ider-0.196
ccoli-0.195
arl-0.194
Token touch
Token ID3638
Feature activation+1.62e-02

Pos logit contributions
 it+0.031
 his+0.022
 the+0.021
 fast+0.021
 everyone+0.020
Neg logit contributions
int-0.050
 extingu-0.046
azed-0.042
oke-0.042
ifying-0.041
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
 it+0.014
 fast+0.006
 his+0.006
 he+0.005
 everyone+0.005
Neg logit contributions
int-0.057
 extingu-0.054
azed-0.051
ifying-0.051
rots-0.051
Token oven
Token ID14361
Feature activation-1.30e-01

Pos logit contributions
 it+0.031
 the+0.023
 fast+0.021
 his+0.021
 everyone+0.020
Neg logit contributions
int-0.044
azed-0.035
aj-0.034
oke-0.034
rots-0.034
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
int+0.443
 extingu+0.391
rots+0.384
azed+0.381
ifying+0.379
Neg logit contributions
 it-0.185
 the-0.121
 his-0.106
 fast-0.105
 he-0.097
Token It
Token ID632
Feature activation+3.00e-02

Pos logit contributions
 it+0.037
 his+0.027
 fast+0.026
 the+0.025
 everyone+0.025
Neg logit contributions
int-0.056
 extingu-0.052
azed-0.048
ifying-0.047
oke-0.047
Token is
Token ID318
Feature activation-1.31e-02

Pos logit contributions
 it+0.056
 the+0.040
 his+0.037
 fast+0.035
 he+0.035
Neg logit contributions
int-0.089
 extingu-0.078
aj-0.076
ider-0.075
ifying-0.075
Token hot
Token ID3024
Feature activation-1.99e-02

Pos logit contributions
int+0.045
 extingu+0.040
azed+0.039
rots+0.039
ives+0.039
Neg logit contributions
 it-0.019
 the-0.012
 his-0.011
 fast-0.011
 everyone-0.009
Token and
Token ID290
Feature activation+1.44e-02

Pos logit contributions
int+0.061
 extingu+0.055
azed+0.053
rots+0.052
ifying+0.052
Neg logit contributions
 it-0.033
 the-0.022
 his-0.021
 fast-0.021
 he-0.019
Token stealing
Token ID16568
Feature activation-3.93e-02

Pos logit contributions
int+0.025
 extingu+0.023
azed+0.022
oke+0.021
ider+0.021
Neg logit contributions
 it-0.016
 his-0.011
 the-0.011
 fast-0.011
 everyone-0.011
Token honey
Token ID12498
Feature activation-1.36e-02

Pos logit contributions
int+0.127
 extingu+0.121
azed+0.112
ifying+0.111
oke+0.111
Neg logit contributions
 it-0.047
 fast-0.026
 his-0.025
 the-0.024
 everyone-0.024
Token from
Token ID422
Feature activation-5.84e-03

Pos logit contributions
int+0.042
 extingu+0.038
azed+0.036
aj+0.035
rots+0.035
Neg logit contributions
 it-0.022
 the-0.015
 his-0.014
 fast-0.013
 he-0.013
Token the
Token ID262
Feature activation-4.40e-03

Pos logit contributions
int+0.020
 extingu+0.020
ifying+0.018
ener+0.018
azed+0.018
Neg logit contributions
 it-0.004
 fast-0.001
 he-0.001
 his-0.001
 everyone-0.001
Token be
Token ID307
Feature activation-1.90e-02

Pos logit contributions
int+0.012
 extingu+0.011
azed+0.010
oke+0.010
aj+0.010
Neg logit contributions
 it-0.009
 the-0.006
 fast-0.006
 his-0.006
 everyone-0.006
Tokeneh
Token ID17231
Feature activation-1.85e-01

Pos logit contributions
int+0.029
 extingu+0.027
ifying+0.022
rots+0.022
azed+0.022
Neg logit contributions
 it-0.059
 his-0.049
 the-0.049
 fast-0.048
 he-0.046
Tokenives
Token ID1083
Feature activation-1.45e-01

Pos logit contributions
int+0.181
 extingu+0.177
aj+0.142
led+0.129
ccoli+0.125
Neg logit contributions
 it-0.575
 he-0.501
 the-0.495
 fast-0.493
 Tim-0.479
Token without
Token ID1231
Feature activation+6.30e-03

Pos logit contributions
int+0.433
 extingu+0.401
azed+0.373
oke+0.368
ifying+0.367
Neg logit contributions
 it-0.232
 his-0.152
 the-0.149
 fast-0.145
 everyone-0.140
Token permission
Token ID7170
Feature activation-7.82e-02

Pos logit contributions
 it+0.010
 fast+0.007
 his+0.006
 he+0.006
 everyone+0.006
Neg logit contributions
int-0.017
 extingu-0.016
azed-0.015
urger-0.015
ener-0.015
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
int+0.218
 extingu+0.202
azed+0.191
oke+0.187
ider+0.187
Neg logit contributions
 it-0.128
 his-0.088
 fast-0.084
 everyone-0.083
 the-0.083
Token She
Token ID1375
Feature activation-2.08e-02

Pos logit contributions
int+0.037
 extingu+0.034
oke+0.032
azed+0.032
ifying+0.031
Neg logit contributions
 it-0.022
 fast-0.015
 his-0.015
 the-0.015
 everyone-0.014
Token.
Token ID13
Feature activation-2.94e-04

Pos logit contributions
int+0.169
 extingu+0.156
azed+0.146
aj+0.145
ifying+0.144
Neg logit contributions
 it-0.082
 the-0.051
 his-0.051
 fast-0.048
 everyone-0.047
Token
Token ID198
Feature activation+8.01e-03

Pos logit contributions
int+0.001
 extingu+0.001
oke+0.001
ives+0.001
azed+0.001
Neg logit contributions
 it-0.000
 his-0.000
 fast-0.000
 everyone-0.000
 the-0.000
Token
Token ID198
Feature activation+8.25e-03

Pos logit contributions
 it+0.013
 his+0.010
 everyone+0.010
 he+0.009
 the+0.009
Neg logit contributions
int-0.019
oke-0.018
 extingu-0.017
ives-0.017
omed-0.017
TokenThey
Token ID2990
Feature activation+1.35e-02

Pos logit contributions
 it+0.017
 his+0.013
 the+0.012
 everyone+0.012
 he+0.012
Neg logit contributions
int-0.020
 extingu-0.018
ider-0.017
ifying-0.017
oke-0.017
Token sne
Token ID10505
Feature activation+5.18e-02

Pos logit contributions
 it+0.026
 the+0.019
 his+0.018
 fast+0.017
 everyone+0.017
Neg logit contributions
int-0.037
 extingu-0.034
aj-0.032
ider-0.032
oke-0.032
Tokenaked
Token ID4335
Feature activation-1.28e-01

Pos logit contributions
 it+0.191
 his+0.164
 the+0.162
 fast+0.159
 everyone+0.157
Neg logit contributions
int-0.043
 extingu-0.036
ider-0.028
ifying-0.027
aj-0.025
Token behind
Token ID2157
Feature activation-2.34e-02

Pos logit contributions
int+0.344
 extingu+0.312
azed+0.291
oke+0.288
rots+0.287
Neg logit contributions
 it-0.249
 his-0.178
 the-0.177
 fast-0.171
 everyone-0.165
Token the
Token ID262
Feature activation+9.84e-03

Pos logit contributions
 extingu+0.076
int+0.075
ifying+0.069
azed+0.068
 department+0.068
Neg logit contributions
 it-0.019
 he-0.010
 fast-0.009
 his-0.009
 everyone-0.007
Token truck
Token ID7779
Feature activation-5.32e-02

Pos logit contributions
 it+0.022
 his+0.017
 the+0.017
 fast+0.016
 everyone+0.016
Neg logit contributions
int-0.023
 extingu-0.021
azed-0.019
oke-0.019
ifying-0.019
Token and
Token ID290
Feature activation-4.99e-03

Pos logit contributions
int+0.147
 extingu+0.139
azed+0.126
oke+0.125
ider+0.123
Neg logit contributions
 it-0.094
 his-0.064
 the-0.062
 fast-0.062
 everyone-0.062
Token climbed
Token ID19952
Feature activation-4.27e-02

Pos logit contributions
int+0.012
 extingu+0.011
azed+0.010
ifying+0.010
ener+0.010
Neg logit contributions
 it-0.010
 his-0.008
 fast-0.008
 everyone-0.007
 the-0.007
Token the
Token ID262
Feature activation-1.58e-03

Pos logit contributions
int+0.143
 extingu+0.133
azed+0.124
ifying+0.123
rots+0.122
Neg logit contributions
 it-0.060
 his-0.036
 fast-0.035
 the-0.033
 he-0.033
Token dough
Token ID15756
Feature activation-7.06e-02

Pos logit contributions
int+0.004
aj+0.003
ider+0.003
 extingu+0.003
oke+0.003
Neg logit contributions
 it-0.004
 the-0.003
 fast-0.003
 his-0.003
 everyone-0.003
Token out
Token ID503
Feature activation-2.71e-02

Pos logit contributions
int+0.214
 extingu+0.185
oke+0.183
ider+0.182
azed+0.181
Neg logit contributions
 it-0.124
 the-0.085
 his-0.082
 fast-0.078
 he-0.075
Token of
Token ID286
Feature activation+2.33e-03

Pos logit contributions
int+0.078
 extingu+0.070
ider+0.066
azed+0.066
rots+0.065
Neg logit contributions
 it-0.049
 the-0.035
 his-0.034
 fast-0.033
 he-0.031
Token the
Token ID262
Feature activation+7.40e-04

Pos logit contributions
 it+0.002
 fast+0.001
 his+0.001
 he+0.000
 everyone+0.000
Neg logit contributions
int-0.008
 extingu-0.008
ifying-0.008
azed-0.007
ishes-0.007
Token oven
Token ID14361
Feature activation-1.44e-01

Pos logit contributions
 it+0.002
 the+0.002
 fast+0.002
 his+0.002
 everyone+0.001
Neg logit contributions
int-0.002
 extingu-0.001
azed-0.001
oke-0.001
uddled-0.001
Token and
Token ID290
Feature activation-9.45e-03

Pos logit contributions
int+0.465
 extingu+0.405
azed+0.401
rots+0.399
ider+0.398
Neg logit contributions
 it-0.223
 the-0.145
 his-0.133
 fast-0.128
 he-0.125
Token cut
Token ID2005
Feature activation-2.18e-02

Pos logit contributions
int+0.027
 extingu+0.024
azed+0.023
ider+0.022
oke+0.022
Neg logit contributions
 it-0.018
 the-0.013
 his-0.012
 fast-0.012
 everyone-0.012
Token it
Token ID340
Feature activation+2.48e-02

Pos logit contributions
int+0.086
 extingu+0.082
azed+0.077
ifying+0.077
rots+0.077
Neg logit contributions
 it-0.009
 his+0.001
 fast+0.001
 he+0.002
 everyone+0.003
Token into
Token ID656
Feature activation-5.75e-03

Pos logit contributions
 it+0.045
 his+0.031
 the+0.031
 fast+0.030
 everyone+0.029
Neg logit contributions
int-0.071
 extingu-0.064
oke-0.060
azed-0.060
rots-0.060
Token pieces
Token ID5207
Feature activation+1.74e-02

Pos logit contributions
int+0.016
 extingu+0.016
ifying+0.014
azed+0.014
rots+0.014
Neg logit contributions
 it-0.009
 his-0.006
 fast-0.006
 the-0.006
 everyone-0.006