TOP ACTIVATIONS
MAX = 8.030

clown s juggling and ac rob ats doing flips . But
things aren 't always what they seem . Thank you for
What are they doing ? â
elephants , lions and ac rob ats . They are amazed
mom , " What are they doing ?" Her mom told
to give his family what they wanted . One night ,
parts and explained what each one did . Her mom was
don 't know what is up there , so it is
and appreciate them for what they are . She said that
m ama " What are they doing ?" Mama said "
and want to see what he has . They ask Mom
What do you think 's up there , cloud ?" Max
meant everybody could find what they needed easier and faster .
chocolate and nuts . Which one do you want , Tom
mom , " What are they doing ?"
have to give them what they want ." "
them to find out what they were , and was surprised
go and see what 's up there ," said Anna ,
then Jane asked what it weighed . Her mom said it
parents and always did what they asked .

MIN ACTIVATIONS
MIN = -5.696

she could and pushed Wh isk ers out of the way
while he climbed . Wh isk ers me owed and watched
. He laughed and threw Jazz on the floor . Jazz
unlock the lock . Wh isk ers was so happy !
The door broke and Wh isk ers got stuck inside .
with the brush . Wh isk ers got very angry and
a happy song and Wh isk ers listened .
happy that she hugged Wh isk ers . " Thank you
. The vet showed Wh isk ers the x - ray
and held tight . Wh isk ers pushed him gently ,
yard . Buster saw Wh isk ers and started to chase
singing and looked at Wh isk ers . "
drove home . Lily hugged Rose and said , " We
his paw again . Wh isk ers loved playing in the
of the camera . Wh isk ers wanted to be in
graceful ." Wh isk ers climbed the tree and
share ." Spot and Wh isk ers agreed , and they
, and he and Wh isk ers played with the new
just in time . Wh isk ers was safe and sound
family sh oo ed Wh isk ers away and told Max

INTERVAL 4.599 - 8.030
CONTAINS 0.057%

and wanted to choose what she did . One
it 's good for dinner ," Mama said .
made sure they had everything they needed . All of the
John felt sorry for what he did , so he said
for was something that no one could take away . And

INTERVAL 1.167 - 4.599
CONTAINS 13.355%

can 't always get what you want . Sometimes you have
in the boat . Then he lay down and started to
You are in big trouble !" He grabbed Lily
too . I help people who have problems . I read
the forest one day when she heard a very calm voice

INTERVAL -2.265 - 1.167
CONTAINS 84.854%

a big , pink bag to carry all my toys .
. The princess sang along and enjoyed the ride .
track with some books and boxes ?" Tom nods and smiles
Mom my had reminded her of the fun they could have
. She is holding a tray with two cups and a

INTERVAL -5.696 - -2.265
CONTAINS 1.733%

nose , a mouth and ears . But he also had
. " No , thank you , cat ," he said
ooter was still harmless and didn 't hurt anyone . Tim
. Anna smiled and hugged her mum back .
, she put the orange peel in the garbage . She
Token clown
Token ID25573
Feature activation+6.71e-01

Pos logit contributions
mun+0.009
 wrong+0.008
 first+0.008
 fishing+0.007
 older+0.007
Neg logit contributions
 relief-0.013
tex-0.013
olk-0.012
text-0.012
 silk-0.012
Tokens
Token ID82
Feature activation+3.85e-01

Pos logit contributions
mun+0.009
 wrong+0.008
roads+0.008
 exception+0.007
top+0.007
Neg logit contributions
tex-0.022
 relief-0.021
text-0.021
 ow-0.020
olk-0.020
Token juggling
Token ID48097
Feature activation+6.99e-01

Pos logit contributions
mun+0.008
 wrong+0.007
 first+0.007
 fishing+0.007
roads+0.007
Neg logit contributions
 relief-0.010
tex-0.010
olk-0.009
 silk-0.009
text-0.009
Token and
Token ID290
Feature activation-3.82e-01

Pos logit contributions
mun+0.016
 fishing+0.015
 first+0.015
 wrong+0.014
roads+0.014
Neg logit contributions
 relief-0.019
tex-0.018
olk-0.017
 silk-0.016
 ow-0.016
Token ac
Token ID936
Feature activation-3.41e+00

Pos logit contributions
 relief+0.010
tex+0.010
 silk+0.009
olk+0.009
 ow+0.009
Neg logit contributions
mun-0.008
 wrong-0.007
roads-0.007
 first-0.007
 exception-0.007
Tokenrob
Token ID22609
Feature activation+6.70e+00

Pos logit contributions
 relief+0.145
tex+0.141
olk+0.135
 silk+0.133
 ow+0.131
Neg logit contributions
mun-0.027
 wrong-0.016
roads-0.015
 first-0.015
 fishing-0.014
Tokenats
Token ID1381
Feature activation-1.51e+00

Pos logit contributions
mun+0.150
 first+0.149
roads+0.149
 wrong+0.146
 fishing+0.144
Neg logit contributions
tex-0.178
text-0.158
 relief-0.151
olk-0.140
 ow-0.136
Token doing
Token ID1804
Feature activation-1.28e+00

Pos logit contributions
 relief+0.043
tex+0.043
olk+0.039
 silk+0.038
text+0.038
Neg logit contributions
mun-0.030
 wrong-0.027
 first-0.026
roads-0.026
 fishing-0.026
Token flips
Token ID45971
Feature activation+7.98e-02

Pos logit contributions
 relief+0.032
tex+0.032
 silk+0.029
olk+0.028
 ow+0.028
Neg logit contributions
mun-0.030
 wrong-0.026
roads-0.026
 first-0.026
 fishing-0.025
Token.
Token ID13
Feature activation-1.80e-01

Pos logit contributions
mun+0.002
 wrong+0.001
 first+0.001
roads+0.001
 fishing+0.001
Neg logit contributions
 relief-0.002
tex-0.002
olk-0.002
 silk-0.002
text-0.002
Token But
Token ID887
Feature activation-7.58e-01

Pos logit contributions
tex+0.004
 relief+0.004
olk+0.004
 Kitty+0.004
text+0.003
Neg logit contributions
mun-0.005
 wrong-0.004
roads-0.004
 fishing-0.004
 first-0.004
Token things
Token ID1243
Feature activation+1.19e-02

Pos logit contributions
mun+0.005
roads+0.004
top+0.004
 exception+0.004
outs+0.004
Neg logit contributions
 relief-0.007
tex-0.007
 silk-0.006
 ow-0.006
 click-0.006
Token aren
Token ID3588
Feature activation-1.78e-01

Pos logit contributions
mun+0.000
 wrong+0.000
 first+0.000
roads+0.000
 fishing+0.000
Neg logit contributions
 relief-0.000
tex-0.000
olk-0.000
 silk-0.000
text-0.000
Token't
Token ID470
Feature activation+1.56e+00

Pos logit contributions
tex+0.005
 relief+0.005
text+0.005
 ow+0.005
 silk+0.005
Neg logit contributions
mun-0.004
 fishing-0.003
 exception-0.003
roads-0.003
 shopping-0.003
Token always
Token ID1464
Feature activation+3.50e+00

Pos logit contributions
mun+0.041
roads+0.035
top+0.034
outs+0.033
our+0.032
Neg logit contributions
 relief-0.038
tex-0.037
 ow-0.034
 silk-0.033
 Kitty-0.032
Token what
Token ID644
Feature activation+2.20e+00

Pos logit contributions
mun+0.095
roads+0.080
top+0.079
outs+0.075
 shopping+0.075
Neg logit contributions
 relief-0.085
tex-0.076
 silk-0.071
 ow-0.070
 click-0.068
Token they
Token ID484
Feature activation+6.44e+00

Pos logit contributions
mun+0.043
roads+0.037
 exception+0.035
top+0.035
 shopping+0.033
Neg logit contributions
tex-0.065
 relief-0.064
 silk-0.060
 ow-0.059
text-0.059
Token seem
Token ID1283
Feature activation+5.04e+00

Pos logit contributions
mun+0.192
roads+0.176
top+0.169
 fishing+0.167
outs+0.164
Neg logit contributions
 relief-0.124
tex-0.110
 ow-0.107
 click-0.095
 sil-0.088
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
mun+0.138
roads+0.123
top+0.114
 salon+0.112
 register+0.111
Neg logit contributions
 relief-0.110
 ow-0.099
tex-0.097
 silk-0.095
 sil-0.094
Token Thank
Token ID6952
Feature activation-6.27e-01

Pos logit contributions
mun+0.004
 first+0.004
 wrong+0.004
 fishing+0.004
roads+0.004
Neg logit contributions
tex-0.005
 relief-0.005
olk-0.005
 ow-0.004
 silk-0.004
Token you
Token ID345
Feature activation-2.63e+00

Pos logit contributions
 relief+0.022
tex+0.022
text+0.020
 silk+0.020
 ow+0.020
Neg logit contributions
mun-0.009
 exception-0.008
roads-0.007
 wrong-0.007
top-0.007
Token for
Token ID329
Feature activation+1.00e+00

Pos logit contributions
 relief+0.086
tex+0.085
olk+0.080
 silk+0.077
 ow+0.076
Neg logit contributions
mun-0.041
 first-0.038
 wrong-0.038
 fishing-0.036
 older-0.035
Token
Token ID26391
Feature activation-8.39e-01

Pos logit contributions
mun+0.049
 wrong+0.041
 exception+0.040
 older+0.040
re+0.039
Neg logit contributions
 relief-0.041
tex-0.036
icol-0.035
text-0.033
 aver-0.033
Token
Token ID129
Feature activation+3.50e-01

Pos logit contributions
tex+0.020
 relief+0.020
text+0.018
icol+0.017
 ow+0.017
Neg logit contributions
mun-0.020
 wrong-0.018
roads-0.018
 exception-0.018
 first-0.018
Token
Token ID241
Feature activation+9.55e-01

Pos logit contributions
mun+0.010
 wrong+0.008
re+0.008
 leaders+0.008
roads+0.008
Neg logit contributions
 relief-0.008
tex-0.007
icol-0.007
olk-0.006
 Kitty-0.006
TokenWhat
Token ID2061
Feature activation+3.37e-01

Pos logit contributions
mun+0.025
 wrong+0.022
 leaders+0.022
roads+0.022
 older+0.022
Neg logit contributions
 relief-0.022
tex-0.021
 silk-0.019
olk-0.018
text-0.018
Token are
Token ID389
Feature activation+4.35e+00

Pos logit contributions
mun+0.009
roads+0.009
top+0.008
 exception+0.008
 shopping+0.008
Neg logit contributions
tex-0.007
 relief-0.007
text-0.006
 ow-0.006
 silk-0.006
Token they
Token ID484
Feature activation+6.41e+00

Pos logit contributions
roads+0.091
mun+0.089
top+0.086
yard+0.084
 workshop+0.084
Neg logit contributions
tex-0.110
text-0.106
 Kitty-0.105
 silk-0.097
 Snow-0.097
Token doing
Token ID1804
Feature activation+2.60e+00

Pos logit contributions
mun+0.139
top+0.120
outs+0.118
roads+0.116
 exception+0.106
Neg logit contributions
 relief-0.162
 ow-0.156
tex-0.156
 click-0.143
 Kitty-0.140
Token?
Token ID30
Feature activation+3.43e+00

Pos logit contributions
mun+0.052
top+0.041
roads+0.039
 exception+0.038
yard+0.038
Neg logit contributions
 relief-0.071
tex-0.071
 silk-0.064
 ow-0.064
 sil-0.063
Tokenâ
Token ID22940
Feature activation+1.47e+00

Pos logit contributions
mun+0.091
 wrong+0.084
roads+0.079
 exception+0.078
 leaders+0.078
Neg logit contributions
 relief-0.071
tex-0.070
text-0.064
 Kitty-0.064
 silk-0.061
Token
Token ID26391
Feature activation-2.08e-01

Pos logit contributions
mun+0.036
 leaders+0.031
 fishing+0.031
 first+0.031
 older+0.031
Neg logit contributions
 relief-0.037
tex-0.035
olk-0.031
icol-0.030
text-0.030
Token
Token ID198
Feature activation+7.69e-01

Pos logit contributions
tex+0.005
 relief+0.005
text+0.005
icol+0.005
olk+0.004
Neg logit contributions
mun-0.005
 wrong-0.004
roads-0.004
 first-0.004
 exception-0.004
Token elephants
Token ID25368
Feature activation+5.76e-01

Pos logit contributions
mun+0.010
 wrong+0.009
 first+0.009
 fishing+0.009
roads+0.008
Neg logit contributions
 relief-0.011
tex-0.011
olk-0.010
 silk-0.010
text-0.010
Token,
Token ID11
Feature activation+2.09e-02

Pos logit contributions
mun+0.013
 wrong+0.011
roads+0.011
 first+0.011
 exception+0.011
Neg logit contributions
 relief-0.015
tex-0.015
olk-0.014
 silk-0.014
 ow-0.013
Token lions
Token ID31420
Feature activation-2.03e-01

Pos logit contributions
mun+0.000
 first+0.000
 fishing+0.000
 wrong+0.000
 older+0.000
Neg logit contributions
 relief-0.001
tex-0.001
olk-0.000
text-0.000
 silk-0.000
Token and
Token ID290
Feature activation-1.04e+00

Pos logit contributions
 relief+0.006
tex+0.006
olk+0.006
text+0.006
 silk+0.006
Neg logit contributions
mun-0.004
 first-0.003
 fishing-0.003
 wrong-0.003
 shopping-0.003
Token ac
Token ID936
Feature activation-3.85e+00

Pos logit contributions
 relief+0.025
tex+0.025
 ow+0.023
 silk+0.023
olk+0.022
Neg logit contributions
mun-0.025
 wrong-0.023
roads-0.022
 first-0.022
top-0.022
Tokenrob
Token ID22609
Feature activation+6.41e+00

Pos logit contributions
 relief+0.159
tex+0.155
olk+0.147
 silk+0.144
text+0.144
Neg logit contributions
mun-0.033
 wrong-0.022
 fishing-0.021
 first-0.021
roads-0.020
Tokenats
Token ID1381
Feature activation-1.25e+00

Pos logit contributions
mun+0.147
roads+0.143
 wrong+0.142
 first+0.142
 older+0.138
Neg logit contributions
tex-0.167
text-0.147
 relief-0.146
olk-0.133
 ow-0.129
Token.
Token ID13
Feature activation-4.89e-01

Pos logit contributions
 relief+0.038
tex+0.038
olk+0.035
 silk+0.034
text+0.034
Neg logit contributions
mun-0.023
 wrong-0.019
roads-0.019
 first-0.019
 fishing-0.018
Token They
Token ID1119
Feature activation-2.67e-01

Pos logit contributions
 relief+0.012
tex+0.012
olk+0.011
 silk+0.010
text+0.010
Neg logit contributions
mun-0.012
 wrong-0.011
 first-0.011
roads-0.010
 fishing-0.010
Token are
Token ID389
Feature activation+4.10e-01

Pos logit contributions
 relief+0.006
tex+0.006
 click+0.005
 ow+0.005
 silk+0.005
Neg logit contributions
mun-0.007
 wrong-0.006
roads-0.006
 fishing-0.006
 shopping-0.006
Token amazed
Token ID24562
Feature activation+3.60e+00

Pos logit contributions
mun+0.009
 wrong+0.008
roads+0.007
 first+0.007
 fishing+0.007
Neg logit contributions
 relief-0.011
tex-0.011
olk-0.010
 silk-0.010
text-0.010
Token mom
Token ID1995
Feature activation+2.91e-01

Pos logit contributions
 relief+0.011
tex+0.011
olk+0.010
 silk+0.009
text+0.009
Neg logit contributions
mun-0.013
 wrong-0.011
roads-0.011
 first-0.011
 fishing-0.011
Token,
Token ID11
Feature activation+2.90e-01

Pos logit contributions
mun+0.007
roads+0.007
rob+0.006
 fishing+0.006
 leaders+0.006
Neg logit contributions
 relief-0.006
tex-0.005
ker-0.005
 Kitty-0.005
olk-0.005
Token "
Token ID366
Feature activation+3.67e-01

Pos logit contributions
mun+0.009
roads+0.009
 exception+0.008
 leaders+0.008
 wrong+0.008
Neg logit contributions
 relief-0.005
tex-0.004
 click-0.004
 Kitty-0.004
 silk-0.004
TokenWhat
Token ID2061
Feature activation+4.39e-01

Pos logit contributions
mun+0.011
 wrong+0.010
 fishing+0.010
roads+0.010
 first+0.009
Neg logit contributions
tex-0.007
 relief-0.007
olk-0.006
 silk-0.006
text-0.006
Token are
Token ID389
Feature activation+4.46e+00

Pos logit contributions
mun+0.012
roads+0.011
 exception+0.011
top+0.011
 first+0.010
Neg logit contributions
tex-0.009
 relief-0.008
text-0.008
 silk-0.008
 ow-0.008
Token they
Token ID484
Feature activation+6.38e+00

Pos logit contributions
mun+0.112
roads+0.097
top+0.091
 workshop+0.089
smanship+0.088
Neg logit contributions
tex-0.099
 Kitty-0.099
 silk-0.097
 Snow-0.092
 ow-0.092
Token doing
Token ID1804
Feature activation+2.38e+00

Pos logit contributions
mun+0.150
roads+0.130
outs+0.126
top+0.125
 exception+0.114
Neg logit contributions
 relief-0.160
tex-0.148
 ow-0.146
 click-0.141
olk-0.133
Token?"
Token ID1701
Feature activation+3.37e+00

Pos logit contributions
mun+0.055
roads+0.043
top+0.042
outs+0.040
 exception+0.039
Neg logit contributions
 relief-0.061
tex-0.059
 silk-0.055
 ow-0.054
 click-0.054
Token Her
Token ID2332
Feature activation-1.27e+00

Pos logit contributions
mun+0.097
 wrong+0.089
 fastest+0.088
roads+0.087
 leaders+0.086
Neg logit contributions
 relief-0.068
tex-0.067
olk-0.059
 Kitty-0.057
text-0.055
Token mom
Token ID1995
Feature activation-5.15e-02

Pos logit contributions
tex+0.034
 relief+0.034
 silk+0.030
olk+0.030
text+0.030
Neg logit contributions
 first-0.029
 older-0.028
mun-0.028
 wrong-0.026
 shopping-0.025
Token told
Token ID1297
Feature activation+1.14e+00

Pos logit contributions
 relief+0.001
tex+0.001
 ow+0.001
olk+0.001
 click+0.001
Neg logit contributions
mun-0.001
 fishing-0.001
roads-0.001
 wrong-0.001
 leaders-0.001
Token to
Token ID284
Feature activation+1.11e-01

Pos logit contributions
mun+0.041
roads+0.034
 exception+0.030
 leaders+0.030
top+0.029
Neg logit contributions
 relief-0.050
tex-0.049
 silk-0.045
olk-0.044
 click-0.044
Token give
Token ID1577
Feature activation+1.88e+00

Pos logit contributions
mun+0.003
 wrong+0.002
roads+0.002
 fishing+0.002
 exception+0.002
Neg logit contributions
 relief-0.003
tex-0.003
olk-0.003
 click-0.003
 ow-0.003
Token his
Token ID465
Feature activation+1.03e+00

Pos logit contributions
mun+0.044
roads+0.038
 wrong+0.038
 first+0.037
 fishing+0.037
Neg logit contributions
 relief-0.049
tex-0.048
olk-0.044
 silk-0.043
text-0.042
Token family
Token ID1641
Feature activation+4.46e-01

Pos logit contributions
mun+0.023
 wrong+0.021
roads+0.020
 first+0.020
 fishing+0.020
Neg logit contributions
 relief-0.027
tex-0.027
olk-0.025
 silk-0.024
text-0.024
Token what
Token ID644
Feature activation+1.04e+00

Pos logit contributions
mun+0.010
roads+0.009
 leaders+0.008
 wrong+0.008
 exception+0.008
Neg logit contributions
 relief-0.012
tex-0.011
olk-0.010
text-0.010
 silk-0.010
Token they
Token ID484
Feature activation+6.33e+00

Pos logit contributions
mun+0.022
roads+0.022
 exception+0.020
our+0.019
outs+0.019
Neg logit contributions
 relief-0.027
tex-0.025
 ow-0.024
 Kitty-0.024
 sil-0.024
Token wanted
Token ID2227
Feature activation+1.69e+00

Pos logit contributions
mun+0.180
roads+0.166
 fishing+0.165
 older+0.162
 wrong+0.162
Neg logit contributions
 relief-0.128
tex-0.120
 ow-0.103
olk-0.096
 click-0.096
Token.
Token ID13
Feature activation+5.63e-01

Pos logit contributions
mun+0.045
roads+0.038
rob+0.035
 exception+0.034
 leaders+0.034
Neg logit contributions
tex-0.038
 relief-0.036
text-0.035
 silk-0.033
 itself-0.033
Token One
Token ID1881
Feature activation+1.25e+00

Pos logit contributions
mun+0.015
 wrong+0.013
 leaders+0.013
roads+0.013
 first+0.012
Neg logit contributions
tex-0.013
 relief-0.012
 Kitty-0.011
olk-0.011
text-0.011
Token night
Token ID1755
Feature activation-1.27e+00

Pos logit contributions
mun+0.025
 first+0.024
 wrong+0.023
 fishing+0.023
 older+0.022
Neg logit contributions
 relief-0.035
tex-0.035
olk-0.032
 silk-0.032
text-0.032
Token,
Token ID11
Feature activation-4.91e-01

Pos logit contributions
 relief+0.039
tex+0.039
olk+0.035
 silk+0.035
text+0.034
Neg logit contributions
mun-0.023
 wrong-0.020
 fishing-0.020
 first-0.020
 shopping-0.019
Token parts
Token ID3354
Feature activation+6.96e-01

Pos logit contributions
mun+0.035
 wrong+0.031
roads+0.030
 first+0.030
 fishing+0.030
Neg logit contributions
 relief-0.030
tex-0.030
olk-0.026
 silk-0.026
text-0.025
Token and
Token ID290
Feature activation+6.00e-01

Pos logit contributions
mun+0.016
 wrong+0.014
roads+0.014
 fishing+0.014
 first+0.014
Neg logit contributions
 relief-0.018
tex-0.018
olk-0.016
 silk-0.016
text-0.015
Token explained
Token ID4893
Feature activation+1.96e+00

Pos logit contributions
mun+0.012
 first+0.010
 wrong+0.010
 fishing+0.010
roads+0.010
Neg logit contributions
 relief-0.018
tex-0.018
olk-0.016
text-0.016
 silk-0.016
Token what
Token ID644
Feature activation+6.84e-01

Pos logit contributions
mun+0.053
 wrong+0.048
 first+0.047
roads+0.046
 fishing+0.046
Neg logit contributions
 relief-0.044
tex-0.043
olk-0.038
 silk-0.037
text-0.037
Token each
Token ID1123
Feature activation+5.70e+00

Pos logit contributions
mun+0.018
 exception+0.016
roads+0.016
 fishing+0.015
top+0.015
Neg logit contributions
 relief-0.015
tex-0.015
 silk-0.014
 ow-0.013
 sil-0.013
Token one
Token ID530
Feature activation+6.31e+00

Pos logit contributions
mun+0.177
ider+0.149
roads+0.149
yard+0.146
 fishing+0.145
Neg logit contributions
tex-0.107
 relief-0.105
 silk-0.097
olk-0.091
 click-0.091
Token did
Token ID750
Feature activation+2.91e+00

Pos logit contributions
mun+0.179
roads+0.157
 fishing+0.157
 exception+0.155
 shopping+0.155
Neg logit contributions
 relief-0.131
tex-0.126
 silk-0.111
 ow-0.109
olk-0.106
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
mun+0.080
roads+0.063
rob+0.063
 fishing+0.062
 asks+0.059
Neg logit contributions
 relief-0.066
 itself-0.061
 silk-0.061
tex-0.061
 click-0.060
Token Her
Token ID2332
Feature activation-5.84e-01

Pos logit contributions
 relief+0.003
tex+0.003
olk+0.002
 silk+0.002
text+0.002
Neg logit contributions
mun-0.002
 wrong-0.002
 first-0.002
roads-0.002
 fishing-0.002
Token mom
Token ID1995
Feature activation-2.18e-01

Pos logit contributions
tex+0.016
 relief+0.015
olk+0.015
text+0.015
icol+0.014
Neg logit contributions
 first-0.012
 older-0.012
 wrong-0.011
 fishing-0.011
mun-0.010
Token was
Token ID373
Feature activation+1.66e+00

Pos logit contributions
 relief+0.006
tex+0.006
olk+0.005
 silk+0.005
text+0.005
Neg logit contributions
mun-0.005
 wrong-0.004
roads-0.004
 first-0.004
 fishing-0.004
Token don
Token ID836
Feature activation-7.40e-01

Pos logit contributions
 relief+0.049
tex+0.049
olk+0.044
 silk+0.043
text+0.042
Neg logit contributions
mun-0.050
 wrong-0.045
 first-0.044
roads-0.043
 fishing-0.043
Token't
Token ID470
Feature activation+1.90e-01

Pos logit contributions
tex+0.024
 relief+0.024
olk+0.023
text+0.022
 ow+0.022
Neg logit contributions
mun-0.012
 wrong-0.010
roads-0.009
 fishing-0.009
 exception-0.009
Token know
Token ID760
Feature activation+1.21e-01

Pos logit contributions
mun+0.004
 first+0.004
 wrong+0.004
 fishing+0.003
roads+0.003
Neg logit contributions
tex-0.005
 relief-0.005
olk-0.005
 silk-0.005
text-0.005
Token what
Token ID644
Feature activation+1.44e+00

Pos logit contributions
mun+0.003
 wrong+0.003
 first+0.003
 older+0.003
 shopping+0.003
Neg logit contributions
 relief-0.003
tex-0.003
olk-0.003
text-0.003
icol-0.002
Token is
Token ID318
Feature activation+4.44e+00

Pos logit contributions
mun+0.035
roads+0.029
 exception+0.028
top+0.027
ios+0.027
Neg logit contributions
tex-0.036
 relief-0.035
 silk-0.034
 ow-0.033
olk-0.032
Token up
Token ID510
Feature activation+6.27e+00

Pos logit contributions
mun+0.114
roads+0.096
top+0.091
 fishing+0.091
 leaders+0.090
Neg logit contributions
 relief-0.109
tex-0.107
 silk-0.098
olk-0.098
 ow-0.095
Token there
Token ID612
Feature activation+4.07e+00

Pos logit contributions
mun+0.136
roads+0.110
 wrong+0.106
 fishing+0.105
top+0.105
Neg logit contributions
 relief-0.176
tex-0.171
olk-0.158
 silk-0.157
 ow-0.151
Token,
Token ID11
Feature activation-1.65e-02

Pos logit contributions
mun+0.103
roads+0.085
 fishing+0.084
 exception+0.080
top+0.080
Neg logit contributions
tex-0.096
 relief-0.094
olk-0.087
 ow-0.085
 silk-0.085
Token so
Token ID523
Feature activation+1.38e+00

Pos logit contributions
tex+0.001
 relief+0.000
text+0.000
olk+0.000
icol+0.000
Neg logit contributions
 first-0.000
 wrong-0.000
mun-0.000
 older-0.000
 shopping-0.000
Token it
Token ID340
Feature activation+8.69e-01

Pos logit contributions
mun+0.019
 first+0.017
 wrong+0.017
roads+0.015
 fishing+0.015
Neg logit contributions
tex-0.048
 relief-0.048
olk-0.045
text-0.044
 silk-0.043
Token is
Token ID318
Feature activation+8.13e-01

Pos logit contributions
mun+0.020
 fishing+0.019
roads+0.018
 shopping+0.017
 exception+0.017
Neg logit contributions
tex-0.022
 relief-0.022
text-0.020
 ow-0.019
olk-0.019
Token and
Token ID290
Feature activation+1.24e+00

Pos logit contributions
mun+0.017
roads+0.014
 asks+0.013
top+0.013
 wrong+0.013
Neg logit contributions
 relief-0.018
tex-0.018
 silk-0.017
 sil-0.016
text-0.016
Token appreciate
Token ID9144
Feature activation+4.09e-02

Pos logit contributions
mun+0.030
 wrong+0.026
roads+0.026
 first+0.025
 fishing+0.025
Neg logit contributions
 relief-0.032
tex-0.031
olk-0.028
 silk-0.027
 ow-0.027
Token them
Token ID606
Feature activation+1.62e+00

Pos logit contributions
mun+0.001
 wrong+0.001
 first+0.001
roads+0.001
 fishing+0.001
Neg logit contributions
 relief-0.001
tex-0.001
olk-0.001
 silk-0.001
text-0.001
Token for
Token ID329
Feature activation+1.49e+00

Pos logit contributions
mun+0.037
roads+0.032
 fishing+0.029
top+0.029
 exception+0.029
Neg logit contributions
 relief-0.043
tex-0.040
 silk-0.038
 sil-0.036
text-0.036
Token what
Token ID644
Feature activation+1.45e+00

Pos logit contributions
mun+0.032
 wrong+0.029
 first+0.028
 fishing+0.028
roads+0.028
Neg logit contributions
 relief-0.041
tex-0.040
olk-0.037
 silk-0.036
text-0.036
Token they
Token ID484
Feature activation+6.25e+00

Pos logit contributions
mun+0.030
roads+0.028
 exception+0.027
top+0.026
 register+0.026
Neg logit contributions
 relief-0.037
tex-0.037
 silk-0.034
text-0.034
 ow-0.034
Token are
Token ID389
Feature activation+4.32e+00

Pos logit contributions
mun+0.182
roads+0.166
 fishing+0.165
 wrong+0.161
 exception+0.159
Neg logit contributions
 relief-0.124
tex-0.110
 ow-0.097
 sil-0.092
 click-0.091
Token.
Token ID13
Feature activation+5.18e-01

Pos logit contributions
mun+0.108
roads+0.092
top+0.082
rob+0.081
 fishing+0.080
Neg logit contributions
 relief-0.106
tex-0.093
 sil-0.091
 silk-0.091
 click-0.088
Token She
Token ID1375
Feature activation-6.20e-01

Pos logit contributions
mun+0.013
 wrong+0.011
 first+0.011
roads+0.011
 fishing+0.011
Neg logit contributions
 relief-0.013
tex-0.013
olk-0.011
text-0.011
 silk-0.011
Token said
Token ID531
Feature activation+3.23e-01

Pos logit contributions
 relief+0.017
tex+0.017
olk+0.016
 silk+0.016
text+0.015
Neg logit contributions
mun-0.013
 first-0.011
 wrong-0.011
roads-0.011
 fishing-0.011
Token that
Token ID326
Feature activation-6.94e-01

Pos logit contributions
mun+0.007
 wrong+0.006
 first+0.006
 fishing+0.006
roads+0.006
Neg logit contributions
 relief-0.009
tex-0.009
olk-0.008
 silk-0.008
text-0.008
Token m
Token ID285
Feature activation+1.83e+00

Pos logit contributions
 relief+0.005
tex+0.005
olk+0.004
 silk+0.004
text+0.004
Neg logit contributions
mun-0.005
 wrong-0.004
 first-0.004
 fishing-0.004
roads-0.004
Tokenama
Token ID1689
Feature activation+2.70e-01

Pos logit contributions
mun+0.051
 wrong+0.049
roads+0.047
 fishing+0.045
 shopping+0.043
Neg logit contributions
tex-0.035
olk-0.035
 relief-0.035
ker-0.031
text-0.031
Token "
Token ID366
Feature activation+1.15e+00

Pos logit contributions
mun+0.006
 wrong+0.005
roads+0.005
 first+0.005
 fishing+0.005
Neg logit contributions
 relief-0.007
tex-0.007
olk-0.006
 silk-0.006
text-0.006
TokenWhat
Token ID2061
Feature activation+6.60e-01

Pos logit contributions
mun+0.030
 first+0.028
 wrong+0.027
roads+0.027
 fishing+0.027
Neg logit contributions
 relief-0.026
tex-0.026
olk-0.023
text-0.022
 silk-0.022
Token are
Token ID389
Feature activation+4.52e+00

Pos logit contributions
mun+0.017
roads+0.016
 exception+0.015
top+0.015
 fishing+0.015
Neg logit contributions
tex-0.015
 relief-0.014
 silk-0.013
text-0.013
 ow-0.013
Token they
Token ID484
Feature activation+6.24e+00

Pos logit contributions
mun+0.091
roads+0.082
top+0.074
 workshop+0.072
 exception+0.070
Neg logit contributions
tex-0.128
 silk-0.120
 Kitty-0.118
text-0.118
 ow-0.117
Token doing
Token ID1804
Feature activation+2.49e+00

Pos logit contributions
mun+0.138
roads+0.120
top+0.115
outs+0.111
 wrong+0.109
Neg logit contributions
 relief-0.165
tex-0.156
 ow-0.149
 click-0.143
olk-0.142
Token?"
Token ID1701
Feature activation+3.63e+00

Pos logit contributions
mun+0.055
roads+0.045
top+0.043
 wrong+0.042
 exception+0.042
Neg logit contributions
 relief-0.067
tex-0.065
olk-0.059
 silk-0.059
 ow-0.058
Token Mama
Token ID40084
Feature activation+4.66e-03

Pos logit contributions
mun+0.101
 wrong+0.097
 fastest+0.092
roads+0.091
 fishing+0.090
Neg logit contributions
 relief-0.076
tex-0.072
olk-0.066
 Kitty-0.065
 ow-0.062
Token said
Token ID531
Feature activation+1.70e+00

Pos logit contributions
mun+0.000
 wrong+0.000
 first+0.000
roads+0.000
 fishing+0.000
Neg logit contributions
 relief-0.000
tex-0.000
olk-0.000
 silk-0.000
text-0.000
Token "
Token ID366
Feature activation+7.20e-01

Pos logit contributions
mun+0.038
 wrong+0.034
 first+0.033
 fishing+0.032
roads+0.032
Neg logit contributions
tex-0.046
 relief-0.046
olk-0.042
 silk-0.040
text-0.040
Token and
Token ID290
Feature activation+2.53e+00

Pos logit contributions
mun+0.068
 wrong+0.058
roads+0.058
 fishing+0.058
 first+0.058
Neg logit contributions
 relief-0.080
tex-0.078
 ow-0.070
olk-0.070
 silk-0.069
Token want
Token ID765
Feature activation-3.55e-01

Pos logit contributions
mun+0.056
roads+0.047
 wrong+0.047
top+0.046
 fishing+0.046
Neg logit contributions
 relief-0.069
tex-0.064
 silk-0.059
 ow-0.057
 click-0.057
Token to
Token ID284
Feature activation+4.23e-01

Pos logit contributions
 relief+0.009
tex+0.008
 ow+0.008
 silk+0.008
 Kitty+0.008
Neg logit contributions
mun-0.009
our-0.007
roads-0.007
 exception-0.007
top-0.007
Token see
Token ID766
Feature activation+1.12e+00

Pos logit contributions
mun+0.011
 wrong+0.010
roads+0.009
top+0.009
outs+0.009
Neg logit contributions
 relief-0.010
tex-0.009
olk-0.009
 ow-0.009
 silk-0.008
Token what
Token ID644
Feature activation+1.51e+00

Pos logit contributions
mun+0.030
top+0.027
roads+0.027
 leaders+0.026
 wrong+0.026
Neg logit contributions
 relief-0.024
tex-0.023
 silk-0.021
 ow-0.021
olk-0.020
Token he
Token ID339
Feature activation+6.21e+00

Pos logit contributions
mun+0.044
our+0.041
 exception+0.041
 register+0.040
ider+0.039
Neg logit contributions
tex-0.028
 relief-0.027
 ow-0.026
 silk-0.026
 Kitty-0.025
Token has
Token ID468
Feature activation+3.01e+00

Pos logit contributions
mun+0.160
 wrong+0.141
roads+0.141
 fishing+0.140
 shopping+0.136
Neg logit contributions
 relief-0.143
tex-0.141
 ow-0.122
text-0.121
 silk-0.120
Token.
Token ID13
Feature activation+3.30e-01

Pos logit contributions
mun+0.073
roads+0.057
 older+0.055
 fishing+0.055
 exception+0.055
Neg logit contributions
tex-0.075
 relief-0.073
 silk-0.066
text-0.065
 ow-0.064
Token They
Token ID1119
Feature activation+2.22e-01

Pos logit contributions
mun+0.008
 wrong+0.008
 leaders+0.007
 first+0.007
roads+0.007
Neg logit contributions
 relief-0.008
tex-0.008
 Kitty-0.007
 silk-0.007
text-0.007
Token ask
Token ID1265
Feature activation+1.22e+00

Pos logit contributions
mun+0.005
 wrong+0.005
 fishing+0.005
roads+0.005
 shopping+0.004
Neg logit contributions
 relief-0.006
tex-0.005
olk-0.005
 ow-0.005
 click-0.005
Token Mom
Token ID11254
Feature activation+9.14e-01

Pos logit contributions
mun+0.029
roads+0.025
 wrong+0.025
 fishing+0.024
top+0.024
Neg logit contributions
 relief-0.031
tex-0.031
 silk-0.027
olk-0.027
text-0.027
TokenWhat
Token ID2061
Feature activation+5.01e-02

Pos logit contributions
tex+0.003
 relief+0.003
olk+0.003
text+0.003
icol+0.003
Neg logit contributions
mun-0.003
roads-0.003
 first-0.003
P-0.003
top-0.002
Token do
Token ID466
Feature activation+2.61e+00

Pos logit contributions
mun+0.001
roads+0.001
 wrong+0.001
 first+0.001
 fishing+0.001
Neg logit contributions
tex-0.001
 relief-0.001
olk-0.001
 silk-0.001
text-0.001
Token you
Token ID345
Feature activation+9.45e-01

Pos logit contributions
mun+0.053
 register+0.046
 exception+0.045
top+0.045
outs+0.043
Neg logit contributions
tex-0.076
 relief-0.076
 ow-0.075
 Kitty-0.073
 silk-0.072
Token think
Token ID892
Feature activation+2.93e+00

Pos logit contributions
mun+0.020
 wrong+0.018
 first+0.018
roads+0.018
 fishing+0.017
Neg logit contributions
 relief-0.026
tex-0.026
olk-0.024
 silk-0.023
text-0.023
Token's
Token ID338
Feature activation+3.22e+00

Pos logit contributions
mun+0.062
roads+0.050
 wrong+0.050
top+0.049
 fishing+0.049
Neg logit contributions
tex-0.082
 relief-0.082
 silk-0.075
 ow-0.075
olk-0.074
Token up
Token ID510
Feature activation+6.19e+00

Pos logit contributions
 wrong+0.055
 first+0.052
mun+0.048
 fishing+0.048
 older+0.047
Neg logit contributions
tex-0.106
 relief-0.102
text-0.098
olk-0.095
 silk-0.092
Token there
Token ID612
Feature activation+4.32e+00

Pos logit contributions
mun+0.097
 wrong+0.087
 first+0.085
 fishing+0.082
roads+0.081
Neg logit contributions
tex-0.204
 relief-0.204
olk-0.188
text-0.185
 silk-0.183
Token,
Token ID11
Feature activation+8.01e-01

Pos logit contributions
mun+0.078
roads+0.059
top+0.058
 leaders+0.058
 exception+0.057
Neg logit contributions
 relief-0.132
tex-0.131
 ow-0.125
olk-0.121
 silk-0.120
Token cloud
Token ID6279
Feature activation+1.01e+00

Pos logit contributions
 first+0.017
 older+0.017
mun+0.017
 wrong+0.017
 fishing+0.016
Neg logit contributions
tex-0.021
 relief-0.021
olk-0.020
text-0.019
icol-0.019
Token?"
Token ID1701
Feature activation+4.51e+00

Pos logit contributions
mun+0.015
 fishing+0.011
 wrong+0.011
roads+0.011
 shopping+0.011
Neg logit contributions
 relief-0.034
tex-0.034
olk-0.032
 ow-0.031
 silk-0.031
Token Max
Token ID5436
Feature activation-4.96e-01

Pos logit contributions
mun+0.096
 leaders+0.084
 wrong+0.082
 first+0.079
 fishing+0.079
Neg logit contributions
 relief-0.130
tex-0.121
 ow-0.112
 Kitty-0.112
 silk-0.109
Token meant
Token ID4001
Feature activation+3.67e-02

Pos logit contributions
mun+0.001
roads+0.001
 fishing+0.001
 exception+0.001
 leaders+0.001
Neg logit contributions
 relief-0.002
tex-0.002
 silk-0.002
olk-0.001
 ow-0.001
Token everybody
Token ID7288
Feature activation+2.75e+00

Pos logit contributions
mun+0.001
roads+0.001
 wrong+0.001
 fishing+0.001
 first+0.001
Neg logit contributions
 relief-0.001
tex-0.001
 silk-0.001
olk-0.001
text-0.001
Token could
Token ID714
Feature activation+1.67e+00

Pos logit contributions
mun+0.070
roads+0.063
 fishing+0.061
 exception+0.060
 wrong+0.060
Neg logit contributions
 relief-0.063
tex-0.060
 silk-0.052
 ow-0.052
text-0.052
Token find
Token ID1064
Feature activation+1.43e+00

Pos logit contributions
mun+0.048
rob+0.040
roads+0.040
outs+0.038
 fishing+0.038
Neg logit contributions
 relief-0.037
 click-0.032
tex-0.030
 itself-0.030
 sil-0.030
Token what
Token ID644
Feature activation+1.77e+00

Pos logit contributions
mun+0.032
roads+0.027
 wrong+0.027
 fishing+0.026
 first+0.026
Neg logit contributions
 relief-0.039
tex-0.038
 silk-0.034
olk-0.034
text-0.033
Token they
Token ID484
Feature activation+6.17e+00

Pos logit contributions
mun+0.040
roads+0.037
 exception+0.036
our+0.035
 register+0.034
Neg logit contributions
 relief-0.044
tex-0.042
 silk-0.040
 ow-0.039
 sil-0.038
Token needed
Token ID2622
Feature activation+2.46e+00

Pos logit contributions
mun+0.181
roads+0.165
 fishing+0.163
 older+0.157
 wrong+0.157
Neg logit contributions
 relief-0.121
tex-0.106
 ow-0.093
 click-0.091
 sil-0.090
Token easier
Token ID4577
Feature activation+2.30e+00

Pos logit contributions
mun+0.069
roads+0.060
rob+0.057
Isa+0.054
 exception+0.053
Neg logit contributions
 relief-0.051
tex-0.048
 itself-0.046
 silk-0.043
text-0.042
Token and
Token ID290
Feature activation+8.35e-01

Pos logit contributions
mun+0.055
roads+0.045
 exception+0.041
 older+0.041
 fishing+0.040
Neg logit contributions
 relief-0.059
tex-0.056
 silk-0.052
olk-0.050
text-0.049
Token faster
Token ID5443
Feature activation+2.09e+00

Pos logit contributions
mun+0.020
roads+0.017
 fishing+0.017
 exception+0.017
 wrong+0.017
Neg logit contributions
 relief-0.022
tex-0.019
 click-0.018
 silk-0.018
 sil-0.017
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
mun+0.038
 wrong+0.032
 first+0.031
roads+0.031
 fishing+0.031
Neg logit contributions
 relief-0.065
tex-0.064
olk-0.059
 silk-0.058
text-0.057
Token chocolate
Token ID11311
Feature activation-4.80e-01

Pos logit contributions
mun+0.005
 first+0.005
 wrong+0.005
 fishing+0.005
 shopping+0.005
Neg logit contributions
tex-0.006
 relief-0.006
olk-0.005
text-0.005
 silk-0.005
Token and
Token ID290
Feature activation+2.54e-01

Pos logit contributions
 relief+0.012
tex+0.012
olk+0.011
 silk+0.011
text+0.011
Neg logit contributions
mun-0.011
 wrong-0.010
 first-0.010
roads-0.010
 fishing-0.010
Token nuts
Token ID14380
Feature activation-1.67e-03

Pos logit contributions
mun+0.006
 first+0.006
 wrong+0.006
 fishing+0.006
 shopping+0.006
Neg logit contributions
tex-0.006
 relief-0.006
olk-0.005
text-0.005
 silk-0.005
Token.
Token ID13
Feature activation-6.03e-02

Pos logit contributions
 relief+0.000
tex+0.000
 silk+0.000
olk+0.000
 ow+0.000
Neg logit contributions
mun-0.000
roads-0.000
 wrong-0.000
 exception-0.000
 older-0.000
Token Which
Token ID9022
Feature activation+1.54e+00

Pos logit contributions
 relief+0.002
tex+0.002
olk+0.001
text+0.001
 silk+0.001
Neg logit contributions
mun-0.001
 first-0.001
 wrong-0.001
 fishing-0.001
roads-0.001
Token one
Token ID530
Feature activation+6.17e+00

Pos logit contributions
mun+0.028
 wrong+0.023
roads+0.023
 fishing+0.023
 leaders+0.022
Neg logit contributions
 relief-0.048
tex-0.047
olk-0.044
 ow-0.043
 silk-0.043
Token do
Token ID466
Feature activation+2.04e+00

Pos logit contributions
mun+0.163
roads+0.141
 leaders+0.139
top+0.138
 register+0.136
Neg logit contributions
 relief-0.141
tex-0.132
 ow-0.121
olk-0.116
 silk-0.115
Token you
Token ID345
Feature activation+1.27e-01

Pos logit contributions
mun+0.042
 exception+0.038
roads+0.038
top+0.038
 register+0.035
Neg logit contributions
 relief-0.054
 Kitty-0.052
 ow-0.051
tex-0.051
 silk-0.051
Token want
Token ID765
Feature activation+8.37e-01

Pos logit contributions
mun+0.004
roads+0.004
 leaders+0.003
outs+0.003
rob+0.003
Neg logit contributions
 relief-0.002
tex-0.002
 ow-0.002
olk-0.002
 sil-0.002
Token,
Token ID11
Feature activation+3.85e-01

Pos logit contributions
mun+0.020
roads+0.015
 leaders+0.015
 exception+0.014
minded+0.014
Neg logit contributions
tex-0.021
 relief-0.021
olk-0.020
 ow-0.020
 Kitty-0.020
Token Tom
Token ID4186
Feature activation+1.24e+00

Pos logit contributions
mun+0.010
 first+0.009
 wrong+0.009
 fishing+0.009
 older+0.009
Neg logit contributions
tex-0.009
 relief-0.009
olk-0.008
text-0.008
 silk-0.008
Token mom
Token ID1995
Feature activation+2.89e-01

Pos logit contributions
tex+0.016
 relief+0.015
olk+0.014
text+0.014
 silk+0.013
Neg logit contributions
mun-0.014
 first-0.014
 older-0.013
 wrong-0.013
 fishing-0.013
Token,
Token ID11
Feature activation+3.51e-01

Pos logit contributions
mun+0.007
roads+0.006
 wrong+0.006
 fishing+0.006
 leaders+0.006
Neg logit contributions
 relief-0.007
tex-0.007
olk-0.006
 silk-0.006
 ow-0.006
Token "
Token ID366
Feature activation+5.08e-01

Pos logit contributions
mun+0.010
roads+0.009
 wrong+0.009
 exception+0.009
 leaders+0.009
Neg logit contributions
 relief-0.007
tex-0.007
 silk-0.006
olk-0.006
 ow-0.006
TokenWhat
Token ID2061
Feature activation+3.58e-01

Pos logit contributions
mun+0.014
 first+0.013
roads+0.013
 wrong+0.013
 fishing+0.012
Neg logit contributions
 relief-0.011
tex-0.011
olk-0.010
text-0.009
 silk-0.009
Token are
Token ID389
Feature activation+4.01e+00

Pos logit contributions
mun+0.008
roads+0.007
 wrong+0.007
 first+0.007
 fishing+0.007
Neg logit contributions
tex-0.010
 relief-0.009
olk-0.008
 silk-0.008
text-0.008
Token they
Token ID484
Feature activation+6.16e+00

Pos logit contributions
mun+0.088
roads+0.077
top+0.073
 workshop+0.069
 exception+0.068
Neg logit contributions
tex-0.108
 silk-0.105
 ow-0.104
olk-0.101
 Kitty-0.101
Token doing
Token ID1804
Feature activation+2.25e+00

Pos logit contributions
mun+0.127
roads+0.107
top+0.104
outs+0.100
 exception+0.094
Neg logit contributions
 relief-0.174
tex-0.166
 ow-0.160
 click-0.152
olk-0.151
Token?"
Token ID1701
Feature activation+3.45e+00

Pos logit contributions
mun+0.048
roads+0.039
top+0.037
 wrong+0.035
outs+0.035
Neg logit contributions
 relief-0.062
tex-0.060
 silk-0.056
 ow-0.055
olk-0.055
Token
Token ID220
Feature activation+2.50e+00

Pos logit contributions
mun+0.093
 wrong+0.084
roads+0.082
 exception+0.082
 fishing+0.081
Neg logit contributions
 relief-0.076
tex-0.074
olk-0.066
 Kitty-0.064
 click-0.063
Token
Token ID198
Feature activation+5.46e-01

Pos logit contributions
mun+0.059
roads+0.051
 wrong+0.051
 fishing+0.050
 leaders+0.049
Neg logit contributions
tex-0.063
 relief-0.061
olk-0.057
text-0.056
 silk-0.054
Token
Token ID198
Feature activation+4.47e-01

Pos logit contributions
mun+0.014
 wrong+0.013
 exception+0.012
roads+0.012
 fishing+0.012
Neg logit contributions
tex-0.013
 relief-0.013
olk-0.011
text-0.011
 click-0.011
Token have
Token ID423
Feature activation-2.59e-01

Pos logit contributions
mun+0.002
roads+0.002
 fishing+0.002
 wrong+0.002
 older+0.002
Neg logit contributions
 relief-0.002
tex-0.002
olk-0.002
 ow-0.002
 silk-0.002
Token to
Token ID284
Feature activation+2.52e-01

Pos logit contributions
 relief+0.007
tex+0.007
 ow+0.007
 silk+0.007
olk+0.007
Neg logit contributions
mun-0.006
roads-0.005
 exception-0.004
our-0.004
top-0.004
Token give
Token ID1577
Feature activation+9.82e-01

Pos logit contributions
mun+0.006
 wrong+0.005
roads+0.005
 first+0.005
 fishing+0.005
Neg logit contributions
 relief-0.007
tex-0.007
olk-0.006
 silk-0.006
text-0.006
Token them
Token ID606
Feature activation-1.63e-01

Pos logit contributions
mun+0.020
 wrong+0.017
 first+0.017
 fishing+0.017
roads+0.017
Neg logit contributions
 relief-0.029
tex-0.028
olk-0.026
 silk-0.025
text-0.025
Token what
Token ID644
Feature activation+1.28e+00

Pos logit contributions
 relief+0.005
tex+0.004
olk+0.004
 silk+0.004
text+0.004
Neg logit contributions
mun-0.003
 wrong-0.003
 first-0.003
 fishing-0.003
roads-0.003
Token they
Token ID484
Feature activation+6.15e+00

Pos logit contributions
mun+0.027
roads+0.026
 exception+0.025
top+0.024
ios+0.022
Neg logit contributions
tex-0.034
 relief-0.032
 ow-0.032
 Kitty-0.031
 silk-0.031
Token want
Token ID765
Feature activation+1.59e+00

Pos logit contributions
mun+0.179
roads+0.174
 fishing+0.161
top+0.160
 older+0.157
Neg logit contributions
 relief-0.115
tex-0.100
 ow-0.094
 click-0.085
bit-0.083
Token."
Token ID526
Feature activation+2.92e+00

Pos logit contributions
mun+0.041
roads+0.036
 exception+0.032
top+0.032
 leaders+0.032
Neg logit contributions
tex-0.036
 relief-0.036
 silk-0.032
text-0.032
 ow-0.032
Token
Token ID198
Feature activation+3.80e-01

Pos logit contributions
mun+0.103
 fastest+0.095
roads+0.093
 wrong+0.093
 leaders+0.092
Neg logit contributions
tex-0.040
 relief-0.038
 Snow-0.036
ot-0.033
 Kitty-0.033
Token
Token ID198
Feature activation+2.69e-01

Pos logit contributions
mun+0.007
 wrong+0.006
 first+0.006
roads+0.006
 fishing+0.006
Neg logit contributions
 relief-0.012
tex-0.012
olk-0.011
 silk-0.010
 ow-0.010
Token"
Token ID1
Feature activation-1.41e-01

Pos logit contributions
mun+0.008
 wrong+0.007
 first+0.007
 fishing+0.007
roads+0.007
Neg logit contributions
 relief-0.005
tex-0.005
olk-0.005
 silk-0.004
text-0.004
Token them
Token ID606
Feature activation+1.75e-01

Pos logit contributions
 first+0.021
mun+0.021
 wrong+0.020
 fishing+0.018
 older+0.018
Neg logit contributions
 relief-0.039
tex-0.038
olk-0.036
icol-0.035
text-0.035
Token to
Token ID284
Feature activation+5.80e-01

Pos logit contributions
mun+0.005
 wrong+0.004
roads+0.004
 fishing+0.004
 first+0.004
Neg logit contributions
 relief-0.004
tex-0.004
olk-0.003
 silk-0.003
 ow-0.003
Token find
Token ID1064
Feature activation+8.45e-01

Pos logit contributions
mun+0.013
 wrong+0.012
 first+0.011
roads+0.011
 fishing+0.011
Neg logit contributions
 relief-0.015
tex-0.015
olk-0.014
 silk-0.013
text-0.013
Token out
Token ID503
Feature activation+3.52e+00

Pos logit contributions
mun+0.018
 wrong+0.015
 first+0.015
roads+0.015
 fishing+0.015
Neg logit contributions
 relief-0.024
tex-0.023
olk-0.021
 silk-0.021
text-0.021
Token what
Token ID644
Feature activation+1.51e+00

Pos logit contributions
mun+0.093
 wrong+0.084
 first+0.083
roads+0.082
 fishing+0.082
Neg logit contributions
 relief-0.080
tex-0.079
olk-0.071
 silk-0.069
text-0.068
Token they
Token ID484
Feature activation+6.14e+00

Pos logit contributions
mun+0.034
 exception+0.032
our+0.032
roads+0.032
top+0.030
Neg logit contributions
 relief-0.036
tex-0.036
 ow-0.034
 Kitty-0.034
 silk-0.033
Token were
Token ID547
Feature activation+4.65e+00

Pos logit contributions
mun+0.171
 fishing+0.152
roads+0.152
 wrong+0.150
 exception+0.147
Neg logit contributions
 relief-0.133
tex-0.126
 ow-0.111
olk-0.108
 silk-0.108
Token,
Token ID11
Feature activation+6.30e-01

Pos logit contributions
mun+0.122
roads+0.113
rob+0.102
Isa+0.102
itable+0.101
Neg logit contributions
 relief-0.094
 ow-0.086
 click-0.082
 silk-0.082
 th-0.082
Token and
Token ID290
Feature activation+6.41e-01

Pos logit contributions
mun+0.014
 wrong+0.012
 first+0.012
 fishing+0.012
roads+0.012
Neg logit contributions
 relief-0.017
tex-0.017
olk-0.015
 silk-0.015
text-0.015
Token was
Token ID373
Feature activation+6.27e-01

Pos logit contributions
mun+0.012
 wrong+0.010
roads+0.010
 exception+0.009
 fishing+0.009
Neg logit contributions
 relief-0.020
tex-0.019
 silk-0.017
 ow-0.017
olk-0.017
Token surprised
Token ID6655
Feature activation+2.64e+00

Pos logit contributions
mun+0.015
roads+0.013
our+0.013
re+0.012
top+0.012
Neg logit contributions
 relief-0.016
tex-0.014
 ow-0.014
 silk-0.014
 aver-0.013
Token go
Token ID467
Feature activation+9.20e-01

Pos logit contributions
mun+0.052
 wrong+0.047
 first+0.046
roads+0.046
 fishing+0.046
Neg logit contributions
 relief-0.037
tex-0.037
olk-0.033
 silk-0.031
text-0.031
Token and
Token ID290
Feature activation+3.97e-01

Pos logit contributions
mun+0.027
 wrong+0.022
outs+0.022
roads+0.021
 leaders+0.021
Neg logit contributions
tex-0.019
 relief-0.018
 click-0.017
text-0.016
 Kitty-0.016
Token see
Token ID766
Feature activation+5.85e-04

Pos logit contributions
mun+0.011
 wrong+0.010
outs+0.010
 leaders+0.009
 exception+0.009
Neg logit contributions
 relief-0.009
tex-0.008
 click-0.008
 Kitty-0.007
 silk-0.007
Token what
Token ID644
Feature activation+4.39e-01

Pos logit contributions
mun+0.000
 wrong+0.000
roads+0.000
 first+0.000
 fishing+0.000
Neg logit contributions
 relief-0.000
tex-0.000
olk-0.000
 silk-0.000
text-0.000
Token's
Token ID338
Feature activation+3.81e+00

Pos logit contributions
mun+0.009
 exception+0.008
roads+0.007
top+0.007
 first+0.007
Neg logit contributions
tex-0.013
 relief-0.012
text-0.012
 silk-0.011
 ow-0.011
Token up
Token ID510
Feature activation+6.13e+00

Pos logit contributions
mun+0.100
roads+0.085
 wrong+0.084
 first+0.084
top+0.083
Neg logit contributions
 relief-0.090
tex-0.088
olk-0.078
 silk-0.077
 ow-0.075
Token there
Token ID612
Feature activation+4.46e+00

Pos logit contributions
mun+0.161
roads+0.126
yard+0.121
top+0.119
 register+0.118
Neg logit contributions
 relief-0.149
tex-0.138
 silk-0.132
 click-0.129
 sil-0.129
Token,"
Token ID553
Feature activation+5.05e+00

Pos logit contributions
mun+0.124
roads+0.104
 leaders+0.097
outs+0.097
top+0.097
Neg logit contributions
 relief-0.090
tex-0.088
 ow-0.083
 click-0.082
 silk-0.080
Token said
Token ID531
Feature activation+2.07e+00

Pos logit contributions
mun+0.128
roads+0.122
 leaders+0.121
 fastest+0.121
 wrong+0.121
Neg logit contributions
 relief-0.114
tex-0.100
 ow-0.091
 Kitty-0.088
text-0.087
Token Anna
Token ID11735
Feature activation-2.65e-01

Pos logit contributions
mun+0.061
roads+0.057
 wrong+0.053
ll+0.053
 exception+0.053
Neg logit contributions
 relief-0.042
tex-0.038
olk-0.035
text-0.034
 ow-0.033
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
 relief+0.006
tex+0.006
 click+0.006
ker+0.006
 silk+0.006
Neg logit contributions
mun-0.006
roads-0.005
 wrong-0.005
 leaders-0.005
 contest-0.005
Token then
Token ID788
Feature activation+9.28e-01

Pos logit contributions
mun+0.005
 wrong+0.005
 first+0.005
roads+0.004
 fishing+0.004
Neg logit contributions
 relief-0.008
tex-0.008
olk-0.007
text-0.007
 silk-0.007
Token Jane
Token ID12091
Feature activation+7.73e-01

Pos logit contributions
mun+0.017
 wrong+0.015
 first+0.015
roads+0.015
 fishing+0.014
Neg logit contributions
tex-0.028
 relief-0.028
olk-0.026
text-0.025
 silk-0.025
Token asked
Token ID1965
Feature activation+2.66e+00

Pos logit contributions
mun+0.017
 wrong+0.015
 first+0.015
roads+0.015
 fishing+0.015
Neg logit contributions
 relief-0.021
tex-0.020
olk-0.019
 silk-0.018
text-0.018
Token what
Token ID644
Feature activation+6.31e-01

Pos logit contributions
mun+0.052
 first+0.049
 wrong+0.049
 shopping+0.045
 older+0.044
Neg logit contributions
tex-0.078
 relief-0.078
olk-0.072
text-0.070
 silk-0.070
Token it
Token ID340
Feature activation+4.08e+00

Pos logit contributions
mun+0.017
roads+0.015
 wrong+0.015
 first+0.015
 fishing+0.015
Neg logit contributions
 relief-0.014
tex-0.014
 silk-0.012
olk-0.012
text-0.012
Token weighed
Token ID20261
Feature activation+6.12e+00

Pos logit contributions
mun+0.102
 wrong+0.093
 first+0.092
roads+0.088
 fishing+0.087
Neg logit contributions
 relief-0.100
tex-0.098
olk-0.089
 silk-0.085
text-0.085
Token.
Token ID13
Feature activation+3.28e-01

Pos logit contributions
mun+0.147
roads+0.118
 leaders+0.114
 exception+0.109
 fishing+0.107
Neg logit contributions
 relief-0.155
tex-0.149
 silk-0.148
olk-0.140
 itself-0.136
Token Her
Token ID2332
Feature activation-5.59e-01

Pos logit contributions
mun+0.008
 wrong+0.007
 fishing+0.007
roads+0.007
 leaders+0.007
Neg logit contributions
 relief-0.008
tex-0.008
olk-0.007
text-0.007
 silk-0.007
Token mom
Token ID1995
Feature activation-1.93e-02

Pos logit contributions
tex+0.017
 relief+0.016
text+0.016
 silk+0.015
olk+0.015
Neg logit contributions
 first-0.011
 older-0.011
 wrong-0.010
mun-0.009
 shopping-0.009
Token said
Token ID531
Feature activation+1.41e+00

Pos logit contributions
 relief+0.001
tex+0.001
olk+0.001
 ow+0.001
text+0.001
Neg logit contributions
mun-0.000
 wrong-0.000
roads-0.000
 fishing-0.000
 first-0.000
Token it
Token ID340
Feature activation-2.70e-01

Pos logit contributions
mun+0.029
 wrong+0.027
 first+0.026
 fishing+0.024
 shopping+0.024
Neg logit contributions
tex-0.040
 relief-0.040
olk-0.037
text-0.036
 silk-0.035
Token parents
Token ID3397
Feature activation+1.53e+00

Pos logit contributions
mun+0.018
 wrong+0.016
 first+0.016
 fishing+0.016
roads+0.015
Neg logit contributions
 relief-0.023
tex-0.023
olk-0.021
 silk-0.020
text-0.020
Token and
Token ID290
Feature activation+1.48e+00

Pos logit contributions
mun+0.044
roads+0.042
rob+0.038
 leaders+0.038
outs+0.037
Neg logit contributions
 relief-0.025
tex-0.025
ker-0.023
 Kitty-0.023
 Snow-0.022
Token always
Token ID1464
Feature activation+1.92e+00

Pos logit contributions
mun+0.033
roads+0.031
top+0.030
outs+0.029
 wrong+0.029
Neg logit contributions
 relief-0.038
tex-0.037
 ow-0.032
 click-0.032
 silk-0.031
Token did
Token ID750
Feature activation-4.92e-02

Pos logit contributions
mun+0.044
 wrong+0.039
roads+0.038
 first+0.038
 fishing+0.038
Neg logit contributions
 relief-0.050
tex-0.049
olk-0.044
 silk-0.043
text-0.043
Token what
Token ID644
Feature activation+8.88e-01

Pos logit contributions
 relief+0.001
tex+0.001
olk+0.001
 silk+0.001
 ow+0.001
Neg logit contributions
mun-0.001
roads-0.001
 wrong-0.001
 first-0.001
 fishing-0.001
Token they
Token ID484
Feature activation+6.11e+00

Pos logit contributions
mun+0.016
our+0.014
roads+0.014
 exception+0.014
 register+0.013
Neg logit contributions
tex-0.027
 relief-0.026
 ow-0.025
 Kitty-0.024
text-0.024
Token asked
Token ID1965
Feature activation+4.02e+00

Pos logit contributions
mun+0.168
 fishing+0.157
roads+0.156
top+0.151
 wrong+0.149
Neg logit contributions
 relief-0.125
tex-0.118
 ow-0.106
 click-0.096
olk-0.094
Token.
Token ID13
Feature activation+5.41e-01

Pos logit contributions
mun+0.086
roads+0.074
 wrong+0.067
top+0.067
 fishing+0.066
Neg logit contributions
 relief-0.113
tex-0.109
 silk-0.100
 ow-0.097
text-0.096
Token
Token ID220
Feature activation+1.89e+00

Pos logit contributions
mun+0.015
roads+0.013
 register+0.013
rob+0.013
 fastest+0.012
Neg logit contributions
 relief-0.011
tex-0.010
 Kitty-0.010
 It-0.010
Wow-0.010
Token
Token ID198
Feature activation+2.35e-01

Pos logit contributions
mun+0.050
 wrong+0.045
roads+0.044
 leaders+0.044
 fishing+0.043
Neg logit contributions
tex-0.042
 relief-0.039
text-0.036
 Kitty-0.035
olk-0.034
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
mun+0.008
 register+0.007
 leaders+0.007
our+0.007
 wrong+0.007
Neg logit contributions
tex-0.003
 Kitty-0.003
ay-0.003
Wow-0.002
text-0.002
Token she
Token ID673
Feature activation+4.52e-01

Pos logit contributions
 relief+0.006
tex+0.006
olk+0.006
text+0.006
icol+0.006
Neg logit contributions
mun-0.002
 first-0.002
 wrong-0.001
 fishing-0.001
 he-0.001
Token could
Token ID714
Feature activation-4.59e-01

Pos logit contributions
mun+0.009
 first+0.009
 wrong+0.008
 fishing+0.008
re+0.008
Neg logit contributions
 relief-0.013
tex-0.013
olk-0.013
text-0.011
 silk-0.011
Token and
Token ID290
Feature activation-8.19e-01

Pos logit contributions
 relief+0.012
tex+0.012
olk+0.011
 silk+0.011
text+0.011
Neg logit contributions
mun-0.010
 wrong-0.009
roads-0.009
 first-0.009
 fishing-0.009
Token pushed
Token ID7121
Feature activation+6.95e-01

Pos logit contributions
tex+0.024
olk+0.024
 relief+0.023
text+0.022
 silk+0.021
Neg logit contributions
mun-0.017
 first-0.016
 wrong-0.016
roads-0.015
 he-0.015
Token Wh
Token ID854
Feature activation-2.20e+00

Pos logit contributions
 first+0.014
 wrong+0.012
mun+0.012
 he+0.011
 one+0.011
Neg logit contributions
tex-0.020
 relief-0.020
olk-0.019
 lull-0.019
 ow-0.018
Tokenisk
Token ID1984
Feature activation-5.70e+00

Pos logit contributions
 relief+0.040
tex+0.036
olk+0.032
 silk+0.031
 ow+0.029
Neg logit contributions
mun-0.071
 wrong-0.066
 first-0.065
re-0.063
top-0.063
Tokeners
Token ID364
Feature activation-4.67e-01

Pos logit contributions
 relief+0.181
tex+0.169
olk+0.167
 ow+0.161
 silk+0.155
Neg logit contributions
mun-0.104
 wrong-0.095
 first-0.094
roads-0.088
top-0.085
Token out
Token ID503
Feature activation+4.88e-01

Pos logit contributions
 relief+0.010
tex+0.010
olk+0.009
 silk+0.009
text+0.009
Neg logit contributions
mun-0.013
 wrong-0.011
roads-0.011
 first-0.011
 fishing-0.011
Token of
Token ID286
Feature activation-8.18e-01

Pos logit contributions
mun+0.009
 first+0.009
 wrong+0.008
 older+0.008
roads+0.008
Neg logit contributions
 relief-0.015
tex-0.015
olk-0.014
icol-0.013
 ow-0.013
Token the
Token ID262
Feature activation+4.67e-01

Pos logit contributions
tex+0.028
text+0.026
olk+0.026
 relief+0.026
kered+0.025
Neg logit contributions
 first-0.013
 shopping-0.011
 fishing-0.011
 wrong-0.010
 older-0.010
Token way
Token ID835
Feature activation+1.20e-01

Pos logit contributions
mun+0.011
 exception+0.009
roads+0.009
 wrong+0.009
 first+0.009
Neg logit contributions
 relief-0.012
tex-0.012
olk-0.011
 ow-0.011
 silk-0.010
Token while
Token ID981
Feature activation+5.29e-01

Pos logit contributions
 relief+0.044
tex+0.043
olk+0.040
text+0.039
 ow+0.037
Neg logit contributions
mun-0.034
 first-0.032
 wrong-0.032
 fishing-0.030
roads-0.030
Token he
Token ID339
Feature activation+2.12e+00

Pos logit contributions
mun+0.006
 wrong+0.005
roads+0.005
 first+0.004
 fishing+0.004
Neg logit contributions
 relief-0.020
tex-0.020
olk-0.018
 silk-0.018
 ow-0.018
Token climbed
Token ID19952
Feature activation+2.02e-01

Pos logit contributions
mun+0.048
 first+0.046
 wrong+0.042
 older+0.041
 fishing+0.040
Neg logit contributions
 relief-0.058
tex-0.054
olk-0.053
 silk-0.049
 click-0.049
Token.
Token ID13
Feature activation-8.43e-01

Pos logit contributions
mun+0.005
 wrong+0.004
roads+0.004
 first+0.004
 fishing+0.004
Neg logit contributions
 relief-0.005
tex-0.005
olk-0.005
 silk-0.005
text-0.005
Token Wh
Token ID854
Feature activation-1.94e+00

Pos logit contributions
 relief+0.023
tex+0.023
olk+0.021
 silk+0.020
text+0.020
Neg logit contributions
mun-0.018
 wrong-0.017
 first-0.017
 fishing-0.016
 shopping-0.016
Tokenisk
Token ID1984
Feature activation-5.62e+00

Pos logit contributions
 relief+0.039
tex+0.033
 silk+0.032
 click+0.031
olk+0.030
Neg logit contributions
mun-0.062
 wrong-0.056
 first-0.055
re-0.054
 exception-0.053
Tokeners
Token ID364
Feature activation+1.81e-01

Pos logit contributions
 relief+0.212
olk+0.197
icol+0.196
tex+0.190
 ow+0.188
Neg logit contributions
mun-0.077
 wrong-0.065
 first-0.063
roads-0.055
 leaders-0.054
Token me
Token ID502
Feature activation-2.26e+00

Pos logit contributions
mun+0.005
 wrong+0.004
 fishing+0.004
roads+0.004
 exception+0.004
Neg logit contributions
tex-0.004
 relief-0.004
olk-0.003
text-0.003
 ow-0.003
Tokenowed
Token ID6972
Feature activation-2.07e+00

Pos logit contributions
 relief+0.058
tex+0.056
olk+0.054
text+0.051
icol+0.051
Neg logit contributions
mun-0.055
 first-0.054
 wrong-0.051
re-0.049
top-0.048
Token and
Token ID290
Feature activation-8.72e-01

Pos logit contributions
tex+0.062
 relief+0.059
olk+0.057
text+0.056
 ow+0.053
Neg logit contributions
mun-0.039
 first-0.038
 fishing-0.037
 wrong-0.037
top-0.035
Token watched
Token ID7342
Feature activation-7.87e-02

Pos logit contributions
tex+0.028
 relief+0.028
olk+0.027
text+0.026
 silk+0.025
Neg logit contributions
mun-0.015
 first-0.013
 wrong-0.013
 fishing-0.012
roads-0.012
Token.
Token ID13
Feature activation-3.56e-01

Pos logit contributions
mun+0.036
roads+0.031
 fastest+0.029
 leaders+0.029
yard+0.028
Neg logit contributions
 relief-0.035
tex-0.033
 click-0.031
 ow-0.030
 lull-0.030
Token He
Token ID679
Feature activation+4.99e-02

Pos logit contributions
tex+0.013
 relief+0.013
olk+0.012
text+0.012
 silk+0.012
Neg logit contributions
 wrong-0.004
mun-0.004
 first-0.004
 older-0.004
 fishing-0.004
Token laughed
Token ID13818
Feature activation+1.09e-01

Pos logit contributions
mun+0.001
 wrong+0.001
 first+0.001
 leaders+0.001
 older+0.001
Neg logit contributions
 relief-0.001
tex-0.001
olk-0.001
 silk-0.001
text-0.001
Token and
Token ID290
Feature activation-2.88e-01

Pos logit contributions
mun+0.002
 wrong+0.002
 first+0.002
 older+0.002
 fishing+0.002
Neg logit contributions
 relief-0.003
tex-0.003
olk-0.003
 silk-0.003
text-0.003
Token threw
Token ID9617
Feature activation+4.64e-01

Pos logit contributions
tex+0.009
 relief+0.009
olk+0.009
text+0.008
 silk+0.008
Neg logit contributions
mun-0.005
 wrong-0.005
 first-0.005
 fishing-0.004
 asks-0.004
Token Jazz
Token ID21406
Feature activation-5.61e+00

Pos logit contributions
 wrong+0.009
 first+0.008
mun+0.008
 older+0.007
 fishing+0.007
Neg logit contributions
tex-0.014
 relief-0.014
olk-0.014
icol-0.013
text-0.013
Token on
Token ID319
Feature activation-3.04e+00

Pos logit contributions
 relief+0.155
tex+0.148
olk+0.143
 silk+0.132
 ow+0.132
Neg logit contributions
mun-0.120
 wrong-0.114
 first-0.112
 fishing-0.106
roads-0.104
Token the
Token ID262
Feature activation-4.32e-01

Pos logit contributions
 relief+0.093
tex+0.093
olk+0.088
text+0.084
 ow+0.081
Neg logit contributions
 first-0.053
 wrong-0.052
mun-0.050
 fishing-0.048
 older-0.047
Token floor
Token ID4314
Feature activation-1.56e-01

Pos logit contributions
 relief+0.012
tex+0.012
olk+0.011
text+0.011
icol+0.010
Neg logit contributions
 first-0.009
 wrong-0.009
mun-0.008
 older-0.008
 shopping-0.007
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
 relief+0.004
tex+0.004
 silk+0.004
 ow+0.004
 click+0.004
Neg logit contributions
mun-0.003
roads-0.003
 fishing-0.003
 exception-0.003
 shopping-0.003
Token Jazz
Token ID21406
Feature activation-4.26e+00

Pos logit contributions
tex+0.035
 relief+0.034
olk+0.033
text+0.031
 silk+0.030
Neg logit contributions
 wrong-0.023
mun-0.023
 first-0.022
 older-0.021
 fishing-0.021
Token unlock
Token ID12116
Feature activation+1.19e-01

Pos logit contributions
tex+0.011
 relief+0.010
text+0.010
olk+0.010
icol+0.009
Neg logit contributions
 first-0.007
mun-0.007
 wrong-0.007
roads-0.007
 fishing-0.007
Token the
Token ID262
Feature activation+5.61e-01

Pos logit contributions
mun+0.002
 wrong+0.002
 first+0.002
 fishing+0.002
roads+0.002
Neg logit contributions
 relief-0.003
tex-0.003
olk-0.003
text-0.003
 ow-0.003
Token lock
Token ID5793
Feature activation-8.18e-01

Pos logit contributions
 wrong+0.013
 first+0.012
 older+0.011
 fishing+0.011
 shopping+0.011
Neg logit contributions
tex-0.015
text-0.015
 relief-0.014
olk-0.013
icol-0.013
Token.
Token ID13
Feature activation-7.51e-01

Pos logit contributions
 relief+0.016
tex+0.015
 silk+0.014
 click+0.014
olk+0.014
Neg logit contributions
mun-0.025
 exception-0.020
roads-0.020
 leaders-0.020
 fishing-0.020
Token Wh
Token ID854
Feature activation-1.86e+00

Pos logit contributions
 relief+0.024
tex+0.023
olk+0.023
 silk+0.022
 ow+0.022
Neg logit contributions
mun-0.012
 fishing-0.012
 first-0.011
 wrong-0.011
 older-0.010
Tokenisk
Token ID1984
Feature activation-5.59e+00

Pos logit contributions
 relief+0.041
olk+0.034
 click+0.033
 silk+0.033
tex+0.033
Neg logit contributions
mun-0.056
 wrong-0.050
 first-0.050
re-0.050
our-0.049
Tokeners
Token ID364
Feature activation+3.77e-01

Pos logit contributions
 relief+0.209
olk+0.197
icol+0.196
tex+0.190
 ow+0.189
Neg logit contributions
mun-0.075
 wrong-0.065
 first-0.064
roads-0.054
 leaders-0.051
Token was
Token ID373
Feature activation-7.20e-02

Pos logit contributions
mun+0.009
 first+0.008
top+0.007
 wrong+0.007
roads+0.007
Neg logit contributions
 relief-0.010
tex-0.010
olk-0.009
 silk-0.009
 ow-0.009
Token so
Token ID523
Feature activation+1.24e+00

Pos logit contributions
tex+0.002
olk+0.002
text+0.002
 relief+0.002
icol+0.002
Neg logit contributions
 first-0.001
 wrong-0.001
 older-0.001
 fishing-0.001
mun-0.001
Token happy
Token ID3772
Feature activation-6.02e-01

Pos logit contributions
 first+0.021
 older+0.019
 fishing+0.019
mun+0.019
 wrong+0.018
Neg logit contributions
tex-0.040
olk-0.039
 relief-0.037
icol-0.037
 silk-0.036
Token!
Token ID0
Feature activation-5.93e-01

Pos logit contributions
 relief+0.014
tex+0.014
 silk+0.013
olk+0.012
 ow+0.012
Neg logit contributions
mun-0.015
roads-0.014
 wrong-0.013
 first-0.013
 exception-0.013
Token The
Token ID383
Feature activation-4.40e-01

Pos logit contributions
 relief+0.016
tex+0.015
olk+0.014
 silk+0.014
 ow+0.013
Neg logit contributions
mun-0.014
 wrong-0.013
 first-0.013
 fishing-0.013
roads-0.012
Token door
Token ID3420
Feature activation-1.58e+00

Pos logit contributions
tex+0.014
text+0.013
 relief+0.013
olk+0.013
icol+0.013
Neg logit contributions
 wrong-0.008
 first-0.008
mun-0.008
 fishing-0.007
 older-0.007
Token broke
Token ID6265
Feature activation-1.59e+00

Pos logit contributions
tex+0.041
 relief+0.040
olk+0.037
text+0.036
 silk+0.035
Neg logit contributions
 first-0.035
 wrong-0.034
mun-0.034
roads-0.032
 older-0.032
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
 relief+0.050
tex+0.049
text+0.043
olk+0.042
 ow+0.041
Neg logit contributions
 first-0.030
 wrong-0.029
 fishing-0.028
 older-0.027
mun-0.026
Token Wh
Token ID854
Feature activation-1.59e+00

Pos logit contributions
 relief+0.027
tex+0.026
olk+0.026
text+0.024
icol+0.023
Neg logit contributions
 wrong-0.027
 first-0.027
 he-0.026
mun-0.025
 older-0.024
Tokenisk
Token ID1984
Feature activation-5.55e+00

Pos logit contributions
 relief+0.030
tex+0.025
olk+0.024
 silk+0.024
 click+0.023
Neg logit contributions
mun-0.051
 wrong-0.047
 first-0.046
re-0.045
 shopping-0.044
Tokeners
Token ID364
Feature activation+1.65e-01

Pos logit contributions
 relief+0.197
olk+0.183
icol+0.181
tex+0.179
 ow+0.175
Neg logit contributions
mun-0.083
 wrong-0.076
 first-0.070
roads-0.063
 leaders-0.062
Token got
Token ID1392
Feature activation-9.65e-01

Pos logit contributions
mun+0.004
 first+0.003
 wrong+0.003
roads+0.003
re+0.003
Neg logit contributions
 relief-0.004
tex-0.004
olk-0.004
 silk-0.004
icol-0.004
Token stuck
Token ID7819
Feature activation-8.01e-03

Pos logit contributions
tex+0.029
 relief+0.029
olk+0.026
 silk+0.026
text+0.026
Neg logit contributions
mun-0.018
 wrong-0.017
 first-0.016
 fishing-0.016
roads-0.015
Token inside
Token ID2641
Feature activation-3.88e-01

Pos logit contributions
 relief+0.000
tex+0.000
olk+0.000
text+0.000
 silk+0.000
Neg logit contributions
mun-0.000
 wrong-0.000
 first-0.000
 fishing-0.000
roads-0.000
Token.
Token ID13
Feature activation-6.64e-01

Pos logit contributions
tex+0.012
 relief+0.012
olk+0.011
text+0.011
 silk+0.011
Neg logit contributions
mun-0.007
 wrong-0.006
 first-0.006
 fishing-0.006
roads-0.006
Token with
Token ID351
Feature activation-2.64e-01

Pos logit contributions
mun+0.009
 wrong+0.008
 first+0.008
roads+0.008
 fishing+0.008
Neg logit contributions
 relief-0.012
tex-0.012
olk-0.011
 silk-0.011
text-0.011
Token the
Token ID262
Feature activation+3.96e-01

Pos logit contributions
tex+0.009
 relief+0.009
olk+0.008
icol+0.008
text+0.008
Neg logit contributions
 wrong-0.004
 first-0.003
mun-0.003
 he-0.003
 fishing-0.003
Token brush
Token ID14093
Feature activation-1.61e+00

Pos logit contributions
mun+0.011
 wrong+0.010
 first+0.010
 older+0.010
 fishing+0.010
Neg logit contributions
tex-0.009
 relief-0.008
text-0.007
olk-0.007
icol-0.007
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
 relief+0.049
tex+0.048
olk+0.044
 silk+0.043
text+0.043
Neg logit contributions
mun-0.030
 wrong-0.026
 first-0.025
roads-0.025
 fishing-0.025
Token Wh
Token ID854
Feature activation-1.78e+00

Pos logit contributions
 relief+0.006
tex+0.006
olk+0.005
 silk+0.005
 ow+0.005
Neg logit contributions
mun-0.005
 wrong-0.004
 first-0.004
 fishing-0.004
roads-0.004
Tokenisk
Token ID1984
Feature activation-5.53e+00

Pos logit contributions
 relief+0.034
tex+0.029
olk+0.027
 silk+0.027
 click+0.026
Neg logit contributions
mun-0.057
 wrong-0.052
 first-0.052
re-0.050
 shopping-0.050
Tokeners
Token ID364
Feature activation+5.39e-01

Pos logit contributions
 relief+0.201
icol+0.189
olk+0.189
tex+0.185
 ow+0.181
Neg logit contributions
mun-0.082
 wrong-0.071
 first-0.068
roads-0.060
 leaders-0.058
Token got
Token ID1392
Feature activation-5.81e-01

Pos logit contributions
mun+0.012
 first+0.011
 wrong+0.011
roads+0.010
top+0.010
Neg logit contributions
 relief-0.015
tex-0.014
olk-0.013
 silk-0.013
icol-0.013
Token very
Token ID845
Feature activation-2.50e-01

Pos logit contributions
 relief+0.017
tex+0.016
olk+0.015
 silk+0.015
text+0.015
Neg logit contributions
mun-0.012
roads-0.010
 wrong-0.010
 first-0.010
 fishing-0.010
Token angry
Token ID7954
Feature activation+3.05e-01

Pos logit contributions
 relief+0.007
tex+0.007
olk+0.007
 ow+0.006
 silk+0.006
Neg logit contributions
mun-0.005
roads-0.004
 wrong-0.004
 first-0.004
 fishing-0.004
Token and
Token ID290
Feature activation+7.31e-02

Pos logit contributions
mun+0.008
 wrong+0.007
 first+0.007
 fishing+0.007
roads+0.007
Neg logit contributions
 relief-0.007
tex-0.007
olk-0.007
 silk-0.006
text-0.006
Token a
Token ID257
Feature activation-1.50e+00

Pos logit contributions
olk+0.017
tex+0.016
 relief+0.015
 ow+0.015
text+0.015
Neg logit contributions
 first-0.007
 he-0.006
 one-0.006
 wrong-0.006
 older-0.006
Token happy
Token ID3772
Feature activation-1.26e+00

Pos logit contributions
olk+0.045
tex+0.040
adoes+0.039
keeping+0.038
 relief+0.037
Neg logit contributions
mun-0.028
 first-0.028
 he-0.025
 fishing-0.025
 guessing-0.025
Token song
Token ID3496
Feature activation-1.81e+00

Pos logit contributions
tex+0.028
olk+0.028
text+0.026
 aver+0.024
Got+0.024
Neg logit contributions
 first-0.032
 fishing-0.031
 he-0.031
 wrong-0.030
mun-0.030
Token and
Token ID290
Feature activation-3.51e-01

Pos logit contributions
 relief+0.053
tex+0.052
olk+0.048
text+0.047
 silk+0.046
Neg logit contributions
mun-0.036
 wrong-0.032
 first-0.032
 fishing-0.031
roads-0.030
Token Wh
Token ID854
Feature activation-1.52e+00

Pos logit contributions
olk+0.013
 relief+0.013
tex+0.013
text+0.012
 silk+0.011
Neg logit contributions
mun-0.005
 first-0.004
 he-0.004
 wrong-0.004
 asks-0.003
Tokenisk
Token ID1984
Feature activation-5.48e+00

Pos logit contributions
 relief+0.033
tex+0.028
olk+0.026
 click+0.026
 silk+0.025
Neg logit contributions
mun-0.046
 wrong-0.042
 first-0.041
re-0.041
ll-0.040
Tokeners
Token ID364
Feature activation+2.41e-01

Pos logit contributions
 relief+0.195
olk+0.184
icol+0.182
tex+0.179
 ow+0.174
Neg logit contributions
mun-0.082
 wrong-0.070
 first-0.066
 leaders-0.059
roads-0.058
Token listened
Token ID16399
Feature activation+1.08e+00

Pos logit contributions
mun+0.006
 first+0.005
 wrong+0.005
 leaders+0.005
 fishing+0.005
Neg logit contributions
 relief-0.006
tex-0.006
olk-0.006
 silk-0.005
text-0.005
Token.
Token ID13
Feature activation-2.03e-01

Pos logit contributions
mun+0.027
roads+0.024
rob+0.021
 register+0.020
 leaders+0.020
Neg logit contributions
 relief-0.027
 silk-0.026
tex-0.026
 lull-0.025
 sil-0.025
Token
Token ID198
Feature activation+5.82e-02

Pos logit contributions
 relief+0.007
olk+0.007
tex+0.007
 ow+0.006
 silk+0.006
Neg logit contributions
mun-0.003
 first-0.002
 fishing-0.002
 asks-0.002
 he-0.002
Token
Token ID198
Feature activation+1.98e-01

Pos logit contributions
mun+0.001
 first+0.001
 wrong+0.001
 fishing+0.001
roads+0.001
Neg logit contributions
 relief-0.002
tex-0.002
olk-0.002
 silk-0.002
 ow-0.002
Token happy
Token ID3772
Feature activation-2.98e-01

Pos logit contributions
mun+0.032
 first+0.029
 wrong+0.029
 older+0.029
 bored+0.029
Neg logit contributions
tex-0.048
olk-0.046
 relief-0.046
 silk-0.045
icol-0.045
Token that
Token ID326
Feature activation-1.32e+00

Pos logit contributions
 relief+0.007
tex+0.007
 silk+0.006
 ow+0.006
olk+0.006
Neg logit contributions
mun-0.007
roads-0.007
 wrong-0.006
 first-0.006
top-0.006
Token she
Token ID673
Feature activation+5.72e-01

Pos logit contributions
 relief+0.049
tex+0.048
olk+0.046
 silk+0.044
text+0.044
Neg logit contributions
mun-0.017
 first-0.014
 wrong-0.014
roads-0.012
 fishing-0.012
Token hugged
Token ID41130
Feature activation-1.37e+00

Pos logit contributions
mun+0.012
 first+0.011
 wrong+0.010
 leaders+0.010
 fishing+0.010
Neg logit contributions
 relief-0.016
olk-0.016
tex-0.016
 silk-0.015
text-0.014
Token Wh
Token ID854
Feature activation-2.22e+00

Pos logit contributions
 relief+0.038
tex+0.035
olk+0.035
 ray+0.034
text+0.033
Neg logit contributions
mun-0.030
 first-0.029
 wrong-0.028
 he-0.026
 one-0.026
Tokenisk
Token ID1984
Feature activation-5.47e+00

Pos logit contributions
 relief+0.031
tex+0.030
olk+0.025
 silk+0.023
text+0.023
Neg logit contributions
mun-0.079
 wrong-0.073
 first-0.072
roads-0.072
 fishing-0.072
Tokeners
Token ID364
Feature activation+8.56e-02

Pos logit contributions
 relief+0.164
tex+0.159
olk+0.150
 ow+0.145
 silk+0.144
Neg logit contributions
mun-0.108
 wrong-0.095
 first-0.092
roads-0.088
 fishing-0.087
Token.
Token ID13
Feature activation-6.97e-01

Pos logit contributions
mun+0.002
roads+0.002
 wrong+0.002
 exception+0.002
 leaders+0.002
Neg logit contributions
 relief-0.002
tex-0.002
 silk-0.001
olk-0.001
text-0.001
Token "
Token ID366
Feature activation-2.31e-01

Pos logit contributions
 relief+0.024
tex+0.022
olk+0.021
text+0.021
 ow+0.020
Neg logit contributions
mun-0.011
 first-0.011
 shopping-0.010
 wrong-0.010
yard-0.010
TokenThank
Token ID10449
Feature activation-1.58e+00

Pos logit contributions
 relief+0.006
olk+0.006
tex+0.006
text+0.006
 ow+0.006
Neg logit contributions
mun-0.005
P-0.004
 first-0.004
top-0.004
roads-0.004
Token you
Token ID345
Feature activation-3.28e+00

Pos logit contributions
tex+0.061
 relief+0.061
 silk+0.056
text+0.056
 ow+0.056
Neg logit contributions
mun-0.016
 wrong-0.012
roads-0.012
 exception-0.012
top-0.011
Token.
Token ID13
Feature activation+4.02e-01

Pos logit contributions
mun+0.002
 wrong+0.002
 first+0.002
top+0.002
roads+0.002
Neg logit contributions
tex-0.006
 relief-0.006
olk-0.006
text-0.005
 silk-0.005
Token The
Token ID383
Feature activation+7.06e-01

Pos logit contributions
mun+0.010
 wrong+0.009
 first+0.009
 leaders+0.009
roads+0.009
Neg logit contributions
 relief-0.010
tex-0.009
olk-0.008
text-0.008
 Kitty-0.008
Token vet
Token ID20202
Feature activation+1.72e+00

Pos logit contributions
mun+0.013
 wrong+0.011
 first+0.011
 fishing+0.011
roads+0.010
Neg logit contributions
 relief-0.022
tex-0.022
olk-0.020
text-0.020
 silk-0.020
Token showed
Token ID3751
Feature activation+1.62e+00

Pos logit contributions
mun+0.038
 wrong+0.034
 first+0.033
roads+0.033
 fishing+0.033
Neg logit contributions
 relief-0.046
tex-0.045
olk-0.041
 silk-0.040
text-0.040
Token Wh
Token ID854
Feature activation-2.13e+00

Pos logit contributions
mun+0.031
 wrong+0.029
 first+0.029
 older+0.027
top+0.025
Neg logit contributions
 relief-0.048
tex-0.046
olk-0.044
icol-0.042
 silk-0.042
Tokenisk
Token ID1984
Feature activation-5.47e+00

Pos logit contributions
 relief+0.037
tex+0.034
olk+0.030
 silk+0.029
 ow+0.028
Neg logit contributions
mun-0.070
 wrong-0.065
 first-0.065
roads-0.062
 fishing-0.062
Tokeners
Token ID364
Feature activation+4.76e-02

Pos logit contributions
 relief+0.162
tex+0.157
olk+0.148
 ow+0.142
 silk+0.141
Neg logit contributions
mun-0.109
 wrong-0.097
 first-0.095
roads-0.092
 fishing-0.090
Token the
Token ID262
Feature activation+6.99e-01

Pos logit contributions
mun+0.001
 wrong+0.001
 first+0.001
 fishing+0.001
roads+0.001
Neg logit contributions
 relief-0.001
tex-0.001
olk-0.001
 silk-0.001
text-0.001
Token x
Token ID2124
Feature activation-2.00e+00

Pos logit contributions
mun+0.016
 wrong+0.014
roads+0.014
 first+0.014
 fishing+0.014
Neg logit contributions
 relief-0.018
tex-0.018
olk-0.016
 silk-0.016
 ow-0.016
Token-
Token ID12
Feature activation-1.80e+00

Pos logit contributions
 relief+0.051
tex+0.051
olk+0.046
text+0.044
 silk+0.044
Neg logit contributions
mun-0.046
 wrong-0.043
 first-0.042
 fishing-0.041
roads-0.041
Tokenray
Token ID2433
Feature activation+5.75e-01

Pos logit contributions
 relief+0.044
tex+0.043
olk+0.039
 silk+0.038
text+0.037
Neg logit contributions
mun-0.044
 wrong-0.039
 fishing-0.038
roads-0.038
 first-0.038
Token and
Token ID290
Feature activation-1.14e+00

Pos logit contributions
tex+0.020
 relief+0.019
 silk+0.017
olk+0.017
text+0.017
Neg logit contributions
mun-0.028
 wrong-0.024
roads-0.024
 fishing-0.024
 leaders-0.024
Token held
Token ID2714
Feature activation+2.91e-01

Pos logit contributions
 relief+0.029
tex+0.028
olk+0.027
text+0.025
 silk+0.025
Neg logit contributions
mun-0.028
 first-0.025
 wrong-0.025
roads-0.024
 fishing-0.024
Token tight
Token ID5381
Feature activation-2.96e+00

Pos logit contributions
 first+0.006
 fishing+0.005
mun+0.005
 wrong+0.005
 shopping+0.005
Neg logit contributions
 relief-0.009
tex-0.008
olk-0.008
 ow-0.008
 lull-0.008
Token.
Token ID13
Feature activation-8.11e-01

Pos logit contributions
 relief+0.093
tex+0.092
olk+0.085
 silk+0.085
text+0.083
Neg logit contributions
mun-0.052
roads-0.043
 wrong-0.042
 first-0.041
 fishing-0.041
Token Wh
Token ID854
Feature activation-1.81e+00

Pos logit contributions
 relief+0.023
tex+0.022
olk+0.021
 silk+0.020
 ray+0.019
Neg logit contributions
mun-0.017
 fishing-0.016
 first-0.016
 wrong-0.016
 he-0.015
Tokenisk
Token ID1984
Feature activation-5.45e+00

Pos logit contributions
 relief+0.035
tex+0.030
 silk+0.029
olk+0.028
 click+0.028
Neg logit contributions
mun-0.058
 wrong-0.053
 first-0.052
re-0.051
 shopping-0.050
Tokeners
Token ID364
Feature activation+3.19e-01

Pos logit contributions
 relief+0.195
olk+0.182
icol+0.179
tex+0.178
 ow+0.175
Neg logit contributions
mun-0.083
 wrong-0.071
 first-0.069
roads-0.064
 leaders-0.059
Token pushed
Token ID7121
Feature activation+5.13e-01

Pos logit contributions
mun+0.008
 wrong+0.008
roads+0.008
 fishing+0.008
 exception+0.007
Neg logit contributions
tex-0.007
 relief-0.007
olk-0.006
text-0.006
 silk-0.006
Token him
Token ID683
Feature activation-2.46e+00

Pos logit contributions
 first+0.008
mun+0.007
 fishing+0.007
 wrong+0.007
 he+0.007
Neg logit contributions
olk-0.017
 relief-0.017
tex-0.016
icol-0.016
 ow-0.015
Token gently
Token ID15165
Feature activation-1.37e+00

Pos logit contributions
 relief+0.073
tex+0.071
olk+0.067
text+0.063
 ow+0.063
Neg logit contributions
 first-0.047
mun-0.047
 wrong-0.046
 fishing-0.044
 shopping-0.042
Token,
Token ID11
Feature activation-7.96e-01

Pos logit contributions
 relief+0.036
tex+0.035
olk+0.032
 silk+0.031
text+0.031
Neg logit contributions
mun-0.031
 wrong-0.028
 first-0.027
roads-0.027
 fishing-0.027
Token yard
Token ID12699
Feature activation-8.08e-01

Pos logit contributions
 relief+0.034
tex+0.033
olk+0.031
text+0.030
 silk+0.030
Neg logit contributions
mun-0.021
 wrong-0.019
 first-0.018
 fishing-0.018
roads-0.018
Token.
Token ID13
Feature activation-6.37e-01

Pos logit contributions
 relief+0.018
tex+0.016
 click+0.016
 Kitty+0.016
 silk+0.016
Neg logit contributions
mun-0.020
 asks-0.018
P-0.018
 exception-0.017
roads-0.017
Token Buster
Token ID34452
Feature activation-1.76e+00

Pos logit contributions
 relief+0.018
tex+0.017
olk+0.016
 silk+0.015
 ow+0.015
Neg logit contributions
mun-0.014
 fishing-0.012
 first-0.012
 wrong-0.012
 shopping-0.012
Token saw
Token ID2497
Feature activation-1.08e-01

Pos logit contributions
 relief+0.047
tex+0.045
olk+0.045
 silk+0.043
 ow+0.041
Neg logit contributions
mun-0.041
 first-0.037
 wrong-0.035
roads-0.034
 fishing-0.034
Token Wh
Token ID854
Feature activation-1.79e+00

Pos logit contributions
 relief+0.003
olk+0.003
tex+0.003
text+0.003
 click+0.003
Neg logit contributions
mun-0.002
 wrong-0.002
 first-0.002
 fishing-0.002
 older-0.002
Tokenisk
Token ID1984
Feature activation-5.42e+00

Pos logit contributions
 relief+0.026
tex+0.025
olk+0.021
 silk+0.020
text+0.019
Neg logit contributions
mun-0.063
 wrong-0.058
 first-0.058
roads-0.057
 fishing-0.057
Tokeners
Token ID364
Feature activation+3.62e-01

Pos logit contributions
 relief+0.168
tex+0.161
olk+0.157
 ow+0.149
 silk+0.147
Neg logit contributions
mun-0.104
 wrong-0.090
 first-0.089
roads-0.086
 fishing-0.081
Token and
Token ID290
Feature activation-2.90e-01

Pos logit contributions
mun+0.008
 wrong+0.007
 first+0.007
roads+0.007
 fishing+0.007
Neg logit contributions
 relief-0.010
tex-0.010
olk-0.009
 silk-0.009
text-0.009
Token started
Token ID2067
Feature activation+4.65e-01

Pos logit contributions
 relief+0.008
olk+0.008
tex+0.008
text+0.007
icol+0.007
Neg logit contributions
mun-0.006
 first-0.006
 he-0.006
 fishing-0.005
 older-0.005
Token to
Token ID284
Feature activation-2.89e-01

Pos logit contributions
mun+0.009
roads+0.008
 wrong+0.007
our+0.007
 exception+0.007
Neg logit contributions
 relief-0.013
tex-0.013
 silk-0.012
 Kitty-0.012
 ow-0.012
Token chase
Token ID15505
Feature activation+6.52e-01

Pos logit contributions
 relief+0.007
tex+0.007
olk+0.006
 silk+0.006
text+0.006
Neg logit contributions
mun-0.007
 wrong-0.006
 first-0.006
roads-0.006
 fishing-0.006
Token singing
Token ID13777
Feature activation-1.78e+00

Pos logit contributions
mun+0.016
roads+0.015
 exception+0.013
top+0.013
 wrong+0.013
Neg logit contributions
 relief-0.015
tex-0.013
 silk-0.013
 ow-0.013
 click-0.012
Token and
Token ID290
Feature activation-6.21e-01

Pos logit contributions
tex+0.056
olk+0.054
 relief+0.053
text+0.050
 ow+0.048
Neg logit contributions
 first-0.030
 wrong-0.030
 fishing-0.029
mun-0.029
 he-0.028
Token looked
Token ID3114
Feature activation+5.11e-01

Pos logit contributions
tex+0.018
 relief+0.017
olk+0.017
text+0.017
 silk+0.016
Neg logit contributions
mun-0.013
 first-0.011
 wrong-0.011
 fishing-0.011
 asks-0.010
Token at
Token ID379
Feature activation+2.42e-02

Pos logit contributions
mun+0.008
 wrong+0.007
 first+0.007
 fishing+0.007
roads+0.006
Neg logit contributions
tex-0.017
 relief-0.017
olk-0.016
text-0.015
 silk-0.015
Token Wh
Token ID854
Feature activation-1.67e+00

Pos logit contributions
mun+0.001
 wrong+0.001
 first+0.001
 fishing+0.001
 he+0.000
Neg logit contributions
 relief-0.001
olk-0.001
tex-0.001
 click-0.001
 ray-0.001
Tokenisk
Token ID1984
Feature activation-5.41e+00

Pos logit contributions
 relief+0.023
tex+0.023
olk+0.019
 silk+0.018
text+0.017
Neg logit contributions
mun-0.059
 wrong-0.055
 first-0.054
roads-0.054
 fishing-0.053
Tokeners
Token ID364
Feature activation+8.78e-01

Pos logit contributions
 relief+0.157
tex+0.154
olk+0.144
 silk+0.139
 ow+0.138
Neg logit contributions
mun-0.110
 wrong-0.097
 first-0.095
roads-0.092
 fishing-0.090
Token.
Token ID13
Feature activation-5.32e-01

Pos logit contributions
mun+0.018
roads+0.015
 wrong+0.015
 exception+0.015
 first+0.014
Neg logit contributions
 relief-0.025
tex-0.024
 click-0.022
 silk-0.022
olk-0.022
Token
Token ID198
Feature activation-1.96e-01

Pos logit contributions
 relief+0.015
tex+0.014
olk+0.013
 silk+0.013
text+0.013
Neg logit contributions
mun-0.011
 fishing-0.011
 wrong-0.010
 first-0.010
 older-0.010
Token
Token ID198
Feature activation-9.92e-02

Pos logit contributions
 relief+0.007
olk+0.007
tex+0.007
 silk+0.007
 ow+0.006
Neg logit contributions
mun-0.003
 first-0.002
 fishing-0.002
 wrong-0.002
roads-0.002
Token"
Token ID1
Feature activation-2.01e-01

Pos logit contributions
 relief+0.003
tex+0.003
olk+0.003
 silk+0.002
 ow+0.002
Neg logit contributions
mun-0.002
 first-0.002
 fishing-0.002
roads-0.002
 wrong-0.002
Token drove
Token ID10357
Feature activation+9.96e-01

Pos logit contributions
 relief+0.011
tex+0.011
olk+0.010
 silk+0.010
 ow+0.010
Neg logit contributions
mun-0.011
 first-0.010
 wrong-0.010
 fishing-0.010
roads-0.009
Token home
Token ID1363
Feature activation-1.82e+00

Pos logit contributions
mun+0.019
 wrong+0.017
roads+0.016
 first+0.016
 fishing+0.016
Neg logit contributions
 relief-0.030
tex-0.029
olk-0.027
 silk-0.027
text-0.026
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
 relief+0.047
tex+0.044
 click+0.042
 silk+0.041
 Kitty+0.040
Neg logit contributions
mun-0.041
roads-0.037
 exception-0.035
 wrong-0.034
 fishing-0.033
Token Lily
Token ID20037
Feature activation-1.94e+00

Pos logit contributions
tex+0.039
 relief+0.038
olk+0.035
 ow+0.034
 aver+0.033
Neg logit contributions
 older-0.024
 first-0.024
 wrong-0.023
 shopping-0.023
mun-0.023
Token hugged
Token ID41130
Feature activation-2.09e+00

Pos logit contributions
 relief+0.049
tex+0.048
olk+0.044
 silk+0.043
text+0.042
Neg logit contributions
mun-0.046
 wrong-0.041
 first-0.041
roads-0.040
 fishing-0.040
Token Rose
Token ID8049
Feature activation-5.41e+00

Pos logit contributions
 relief+0.066
tex+0.063
olk+0.060
 aver+0.058
text+0.058
Neg logit contributions
 first-0.036
mun-0.036
 wrong-0.034
 older-0.033
 asks-0.030
Token and
Token ID290
Feature activation-8.21e-01

Pos logit contributions
 relief+0.157
tex+0.149
olk+0.147
icol+0.141
 ow+0.140
Neg logit contributions
 first-0.107
mun-0.106
 wrong-0.100
 older-0.094
 shopping-0.094
Token said
Token ID531
Feature activation-2.83e-01

Pos logit contributions
tex+0.026
 relief+0.026
olk+0.025
 silk+0.024
text+0.024
Neg logit contributions
mun-0.014
 first-0.012
 wrong-0.011
 asks-0.011
 older-0.011
Token,
Token ID11
Feature activation-1.65e+00

Pos logit contributions
 relief+0.009
tex+0.009
olk+0.008
 silk+0.008
text+0.008
Neg logit contributions
mun-0.005
 wrong-0.005
 first-0.004
 fishing-0.004
 older-0.004
Token "
Token ID366
Feature activation-2.31e-01

Pos logit contributions
 relief+0.050
tex+0.050
olk+0.047
text+0.047
 silk+0.044
Neg logit contributions
mun-0.031
 first-0.030
 fishing-0.029
 wrong-0.028
 shopping-0.027
TokenWe
Token ID1135
Feature activation+2.43e-01

Pos logit contributions
tex+0.006
 relief+0.006
olk+0.006
 ow+0.006
text+0.006
Neg logit contributions
mun-0.005
top-0.004
P-0.004
 first-0.004
roads-0.004
Token his
Token ID465
Feature activation-6.79e-01

Pos logit contributions
 relief+0.009
tex+0.009
olk+0.008
text+0.008
 silk+0.008
Neg logit contributions
mun-0.006
 wrong-0.005
 first-0.005
 fishing-0.005
roads-0.005
Token paw
Token ID41827
Feature activation-1.17e+00

Pos logit contributions
tex+0.016
 relief+0.016
text+0.015
olk+0.015
icol+0.014
Neg logit contributions
 first-0.016
 wrong-0.016
mun-0.016
 older-0.016
 fishing-0.016
Token again
Token ID757
Feature activation+1.10e+00

Pos logit contributions
 relief+0.037
tex+0.037
olk+0.034
text+0.034
icol+0.034
Neg logit contributions
 wrong-0.020
 first-0.019
mun-0.019
 fishing-0.017
 older-0.016
Token.
Token ID13
Feature activation-5.55e-01

Pos logit contributions
mun+0.024
roads+0.019
 wrong+0.019
 leaders+0.018
 exception+0.018
Neg logit contributions
 relief-0.031
tex-0.030
 silk-0.028
 ow-0.028
olk-0.027
Token Wh
Token ID854
Feature activation-1.74e+00

Pos logit contributions
 relief+0.018
olk+0.017
tex+0.017
 silk+0.016
 ow+0.015
Neg logit contributions
mun-0.009
 first-0.009
 wrong-0.009
 fishing-0.008
 older-0.008
Tokenisk
Token ID1984
Feature activation-5.40e+00

Pos logit contributions
 relief+0.031
tex+0.027
olk+0.025
 silk+0.025
 click+0.024
Neg logit contributions
mun-0.057
 wrong-0.052
 first-0.052
roads-0.050
 shopping-0.050
Tokeners
Token ID364
Feature activation+4.91e-01

Pos logit contributions
 relief+0.184
olk+0.174
tex+0.171
icol+0.170
 ow+0.164
Neg logit contributions
mun-0.090
 wrong-0.079
 first-0.075
roads-0.071
 leaders-0.067
Token loved
Token ID6151
Feature activation-1.17e+00

Pos logit contributions
mun+0.013
 wrong+0.011
 first+0.011
roads+0.011
 fishing+0.011
Neg logit contributions
 relief-0.012
tex-0.011
olk-0.010
 silk-0.010
text-0.010
Token playing
Token ID2712
Feature activation-1.88e+00

Pos logit contributions
 relief+0.041
tex+0.040
olk+0.040
text+0.038
 click+0.036
Neg logit contributions
mun-0.016
 fishing-0.016
 wrong-0.015
 first-0.015
 shopping-0.014
Token in
Token ID287
Feature activation-1.02e+00

Pos logit contributions
 relief+0.052
tex+0.052
olk+0.047
 silk+0.046
text+0.046
Neg logit contributions
mun-0.040
 wrong-0.035
 first-0.035
roads-0.034
 fishing-0.034
Token the
Token ID262
Feature activation-7.42e-02

Pos logit contributions
 relief+0.030
tex+0.029
olk+0.027
text+0.026
 silk+0.026
Neg logit contributions
mun-0.020
 first-0.019
 wrong-0.018
 fishing-0.018
roads-0.018
Token of
Token ID286
Feature activation-1.42e+00

Pos logit contributions
mun+0.047
 wrong+0.042
 first+0.041
 fishing+0.041
roads+0.040
Neg logit contributions
 relief-0.053
tex-0.053
olk-0.047
 silk-0.046
text-0.046
Token the
Token ID262
Feature activation+3.31e-01

Pos logit contributions
tex+0.044
 relief+0.042
olk+0.041
text+0.041
icol+0.039
Neg logit contributions
mun-0.025
 first-0.024
 wrong-0.023
 fishing-0.022
 shopping-0.021
Token camera
Token ID4676
Feature activation+1.67e-01

Pos logit contributions
mun+0.009
 wrong+0.008
 first+0.008
 fishing+0.008
roads+0.008
Neg logit contributions
tex-0.008
 relief-0.008
olk-0.007
text-0.007
 silk-0.007
Token.
Token ID13
Feature activation-2.95e-01

Pos logit contributions
mun+0.003
roads+0.003
 fishing+0.003
 wrong+0.003
 leaders+0.002
Neg logit contributions
 relief-0.005
tex-0.005
olk-0.005
 silk-0.005
 ow-0.004
Token Wh
Token ID854
Feature activation-1.82e+00

Pos logit contributions
tex+0.007
 relief+0.007
olk+0.006
 silk+0.006
text+0.006
Neg logit contributions
mun-0.008
 wrong-0.007
 first-0.007
 fishing-0.007
 shopping-0.007
Tokenisk
Token ID1984
Feature activation-5.39e+00

Pos logit contributions
 relief+0.035
tex+0.031
 silk+0.030
olk+0.030
 click+0.028
Neg logit contributions
mun-0.058
 wrong-0.053
 first-0.052
re-0.050
roads-0.049
Tokeners
Token ID364
Feature activation+6.51e-01

Pos logit contributions
 relief+0.190
olk+0.180
icol+0.179
tex+0.176
 ow+0.171
Neg logit contributions
mun-0.084
 wrong-0.073
 first-0.070
roads-0.064
 leaders-0.061
Token wanted
Token ID2227
Feature activation-1.24e+00

Pos logit contributions
mun+0.016
 wrong+0.015
 first+0.014
roads+0.014
 fishing+0.014
Neg logit contributions
 relief-0.016
tex-0.016
olk-0.014
 silk-0.014
text-0.013
Token to
Token ID284
Feature activation-2.31e-01

Pos logit contributions
 relief+0.041
tex+0.040
olk+0.037
 silk+0.037
text+0.036
Neg logit contributions
mun-0.021
 wrong-0.017
roads-0.017
 first-0.017
 fishing-0.016
Token be
Token ID307
Feature activation+4.70e-02

Pos logit contributions
tex+0.006
 relief+0.006
olk+0.006
text+0.006
 silk+0.006
Neg logit contributions
mun-0.005
 first-0.005
 wrong-0.004
roads-0.004
 fishing-0.004
Token in
Token ID287
Feature activation+1.11e+00

Pos logit contributions
mun+0.001
 wrong+0.001
 first+0.001
 fishing+0.001
 older+0.001
Neg logit contributions
tex-0.001
 relief-0.001
olk-0.001
text-0.001
 silk-0.001
Token graceful
Token ID44363
Feature activation-2.10e-01

Pos logit contributions
mun+0.035
 wrong+0.031
 first+0.031
 fishing+0.030
roads+0.030
Neg logit contributions
tex-0.048
 relief-0.048
olk-0.044
 silk-0.042
text-0.042
Token."
Token ID526
Feature activation+1.93e+00

Pos logit contributions
 relief+0.005
tex+0.005
olk+0.005
 silk+0.005
text+0.004
Neg logit contributions
mun-0.005
 wrong-0.004
roads-0.004
 first-0.004
 fishing-0.004
Token
Token ID198
Feature activation-3.02e-02

Pos logit contributions
mun+0.033
 fishing+0.028
 first+0.028
 wrong+0.028
roads+0.026
Neg logit contributions
 relief-0.062
tex-0.060
olk-0.059
 silk-0.056
 ow-0.055
Token
Token ID198
Feature activation+9.51e-03

Pos logit contributions
 relief+0.001
olk+0.001
tex+0.001
 silk+0.001
 ow+0.001
Neg logit contributions
mun-0.000
 first-0.000
 fishing-0.000
 wrong-0.000
roads-0.000
TokenWh
Token ID1199
Feature activation-2.37e+00

Pos logit contributions
mun+0.000
 first+0.000
 fishing+0.000
roads+0.000
 wrong+0.000
Neg logit contributions
 relief-0.000
olk-0.000
tex-0.000
 ow-0.000
 silk-0.000
Tokenisk
Token ID1984
Feature activation-5.38e+00

Pos logit contributions
 relief+0.048
olk+0.043
 silk+0.041
tex+0.040
 click+0.039
Neg logit contributions
mun-0.073
 wrong-0.066
 first-0.066
roads-0.063
re-0.063
Tokeners
Token ID364
Feature activation+8.59e-01

Pos logit contributions
 relief+0.193
olk+0.187
icol+0.181
tex+0.179
 ow+0.174
Neg logit contributions
mun-0.080
 wrong-0.067
 first-0.066
 leaders-0.057
roads-0.056
Token climbed
Token ID19952
Feature activation-6.84e-01

Pos logit contributions
mun+0.020
 first+0.018
 wrong+0.018
 fishing+0.017
 leaders+0.017
Neg logit contributions
 relief-0.022
tex-0.022
olk-0.020
 silk-0.020
text-0.019
Token the
Token ID262
Feature activation-7.42e-02

Pos logit contributions
 relief+0.020
tex+0.020
olk+0.018
text+0.018
 ow+0.017
Neg logit contributions
mun-0.014
 first-0.013
 wrong-0.013
 fishing-0.012
roads-0.012
Token tree
Token ID5509
Feature activation-3.18e-01

Pos logit contributions
 relief+0.002
tex+0.002
text+0.002
 lull+0.002
icol+0.002
Neg logit contributions
 first-0.002
 wrong-0.001
 shopping-0.001
 older-0.001
 contest-0.001
Token and
Token ID290
Feature activation-9.65e-01

Pos logit contributions
 relief+0.007
tex+0.007
 click+0.006
 silk+0.006
olk+0.006
Neg logit contributions
mun-0.009
 wrong-0.008
roads-0.007
 leaders-0.007
 exception-0.007
Token share
Token ID2648
Feature activation-1.04e+00

Pos logit contributions
mun+0.013
 wrong+0.012
 first+0.012
 fishing+0.012
roads+0.011
Neg logit contributions
tex-0.016
 relief-0.016
olk-0.015
text-0.014
 silk-0.014
Token."
Token ID526
Feature activation+1.63e+00

Pos logit contributions
 relief+0.030
tex+0.030
olk+0.028
text+0.026
 silk+0.026
Neg logit contributions
mun-0.021
 first-0.019
 wrong-0.019
 fishing-0.018
roads-0.017
Token Spot
Token ID15899
Feature activation-1.50e+00

Pos logit contributions
mun+0.041
 first+0.035
 wrong+0.035
 fishing+0.035
roads+0.034
Neg logit contributions
 relief-0.041
tex-0.039
olk-0.036
 silk-0.036
 ow-0.034
Token and
Token ID290
Feature activation-2.37e-01

Pos logit contributions
 relief+0.042
olk+0.041
tex+0.041
 silk+0.039
 ow+0.037
Neg logit contributions
mun-0.033
 first-0.030
 wrong-0.028
 leaders-0.027
 older-0.027
Token Wh
Token ID854
Feature activation-1.44e+00

Pos logit contributions
olk+0.005
 relief+0.005
tex+0.005
 silk+0.005
text+0.005
Neg logit contributions
mun-0.007
 first-0.006
 he-0.006
 wrong-0.006
 fishing-0.006
Tokenisk
Token ID1984
Feature activation-5.38e+00

Pos logit contributions
 relief+0.030
tex+0.026
 silk+0.026
olk+0.025
 click+0.024
Neg logit contributions
mun-0.045
 wrong-0.040
 first-0.040
re-0.039
 leaders-0.037
Tokeners
Token ID364
Feature activation+6.99e-01

Pos logit contributions
 relief+0.195
icol+0.194
olk+0.185
 ow+0.180
 silk+0.179
Neg logit contributions
mun-0.077
 wrong-0.063
 first-0.060
 leaders-0.054
roads-0.052
Token agreed
Token ID4987
Feature activation+7.45e-01

Pos logit contributions
mun+0.017
 first+0.015
 wrong+0.015
 leaders+0.014
 fishing+0.014
Neg logit contributions
 relief-0.018
tex-0.018
olk-0.017
 silk-0.016
 ow-0.015
Token,
Token ID11
Feature activation-5.43e-01

Pos logit contributions
mun+0.014
 first+0.013
 wrong+0.013
 fishing+0.012
 shopping+0.012
Neg logit contributions
tex-0.023
 relief-0.022
olk-0.021
text-0.020
 silk-0.020
Token and
Token ID290
Feature activation-6.86e-02

Pos logit contributions
 relief+0.014
tex+0.014
olk+0.013
 silk+0.013
text+0.012
Neg logit contributions
mun-0.012
 first-0.012
 wrong-0.011
 fishing-0.011
 shopping-0.011
Token they
Token ID484
Feature activation+1.23e+00

Pos logit contributions
tex+0.003
 relief+0.003
olk+0.002
 silk+0.002
 ow+0.002
Neg logit contributions
mun-0.001
 first-0.001
 fishing-0.001
 he-0.001
 wrong-0.001
Token,
Token ID11
Feature activation-9.54e-01

Pos logit contributions
 relief+0.022
tex+0.021
 silk+0.019
olk+0.019
 ow+0.019
Neg logit contributions
mun-0.023
roads-0.021
 wrong-0.020
 first-0.020
 exception-0.020
Token and
Token ID290
Feature activation-4.69e-01

Pos logit contributions
tex+0.040
 relief+0.040
olk+0.040
text+0.038
 silk+0.037
Neg logit contributions
mun-0.006
 fishing-0.005
 first-0.004
 wrong-0.004
 he-0.004
Token he
Token ID339
Feature activation+1.41e+00

Pos logit contributions
tex+0.018
 relief+0.018
olk+0.018
 silk+0.017
text+0.016
Neg logit contributions
mun-0.005
 fishing-0.004
 first-0.004
 he-0.004
 wrong-0.004
Token and
Token ID290
Feature activation-5.92e-01

Pos logit contributions
mun+0.032
 first+0.029
 asks+0.028
 fishing+0.027
 leaders+0.027
Neg logit contributions
 relief-0.039
olk-0.037
tex-0.035
 silk-0.034
 ow-0.032
Token Wh
Token ID854
Feature activation-1.63e+00

Pos logit contributions
olk+0.016
 silk+0.015
 relief+0.015
arl+0.014
ay+0.014
Neg logit contributions
mun-0.014
 he-0.013
 first-0.012
 fishing-0.012
 asks-0.011
Tokenisk
Token ID1984
Feature activation-5.38e+00

Pos logit contributions
 relief+0.031
tex+0.027
olk+0.026
 silk+0.026
 click+0.024
Neg logit contributions
mun-0.052
 wrong-0.047
 first-0.047
re-0.046
top-0.045
Tokeners
Token ID364
Feature activation+4.77e-01

Pos logit contributions
 relief+0.180
olk+0.169
tex+0.168
icol+0.165
 ow+0.161
Neg logit contributions
mun-0.092
 wrong-0.078
 first-0.075
roads-0.071
 leaders-0.070
Token played
Token ID2826
Feature activation-2.31e+00

Pos logit contributions
mun+0.012
 wrong+0.011
 first+0.011
 fishing+0.010
roads+0.010
Neg logit contributions
 relief-0.012
tex-0.011
olk-0.010
 silk-0.010
text-0.010
Token with
Token ID351
Feature activation-1.47e+00

Pos logit contributions
tex+0.069
 relief+0.069
olk+0.065
text+0.062
 silk+0.062
Neg logit contributions
mun-0.044
 wrong-0.039
 first-0.038
 fishing-0.037
roads-0.037
Token the
Token ID262
Feature activation-8.59e-01

Pos logit contributions
tex+0.045
 relief+0.045
olk+0.044
icol+0.043
text+0.041
Neg logit contributions
mun-0.026
 wrong-0.023
 first-0.023
 fishing-0.022
 he-0.021
Token new
Token ID649
Feature activation-1.87e+00

Pos logit contributions
tex+0.021
 relief+0.020
text+0.019
olk+0.019
icol+0.018
Neg logit contributions
 wrong-0.021
mun-0.021
 first-0.021
 fishing-0.020
 older-0.020
Token just
Token ID655
Feature activation+1.99e-01

Pos logit contributions
mun+0.003
roads+0.002
 leaders+0.002
 research+0.002
 asks+0.002
Neg logit contributions
tex-0.003
 relief-0.003
 silk-0.003
olk-0.003
 click-0.003
Token in
Token ID287
Feature activation+1.26e+00

Pos logit contributions
mun+0.004
 first+0.004
 wrong+0.004
 fishing+0.004
roads+0.004
Neg logit contributions
tex-0.006
 relief-0.005
olk-0.005
text-0.005
 silk-0.005
Token time
Token ID640
Feature activation-2.86e-01

Pos logit contributions
 first+0.023
 wrong+0.021
mun+0.021
 shopping+0.020
 fishing+0.020
Neg logit contributions
tex-0.038
olk-0.037
 relief-0.036
icol-0.034
text-0.033
Token.
Token ID13
Feature activation-6.22e-01

Pos logit contributions
 relief+0.008
tex+0.008
 silk+0.008
olk+0.008
 click+0.007
Neg logit contributions
mun-0.006
roads-0.005
 wrong-0.005
 fishing-0.004
 leaders-0.004
Token Wh
Token ID854
Feature activation-1.89e+00

Pos logit contributions
 relief+0.021
tex+0.020
olk+0.020
 silk+0.019
 ow+0.019
Neg logit contributions
mun-0.010
 first-0.008
 wrong-0.008
 shopping-0.008
top-0.008
Tokenisk
Token ID1984
Feature activation-5.38e+00

Pos logit contributions
 relief+0.040
tex+0.033
 silk+0.033
 click+0.032
olk+0.032
Neg logit contributions
mun-0.058
 wrong-0.053
 first-0.052
re-0.051
our-0.050
Tokeners
Token ID364
Feature activation+5.19e-01

Pos logit contributions
 relief+0.201
olk+0.189
icol+0.187
tex+0.179
 ow+0.178
Neg logit contributions
mun-0.076
 wrong-0.064
 first-0.062
roads-0.056
 leaders-0.053
Token was
Token ID373
Feature activation-9.66e-01

Pos logit contributions
mun+0.013
 first+0.012
 wrong+0.011
roads+0.011
 fishing+0.011
Neg logit contributions
 relief-0.014
olk-0.013
tex-0.013
 silk-0.012
icol-0.012
Token safe
Token ID3338
Feature activation-2.19e+00

Pos logit contributions
tex+0.032
olk+0.031
 relief+0.030
text+0.029
 silk+0.029
Neg logit contributions
 wrong-0.016
 first-0.015
mun-0.015
 older-0.015
 fishing-0.015
Token and
Token ID290
Feature activation+1.62e-01

Pos logit contributions
 relief+0.053
tex+0.053
olk+0.048
 silk+0.046
text+0.046
Neg logit contributions
mun-0.054
 wrong-0.049
 first-0.049
 fishing-0.047
roads-0.047
Token sound
Token ID2128
Feature activation-1.38e+00

Pos logit contributions
mun+0.004
 first+0.003
 wrong+0.003
 he+0.003
 older+0.003
Neg logit contributions
tex-0.005
olk-0.005
 relief-0.004
text-0.004
 silk-0.004
Token family
Token ID1641
Feature activation+4.18e-02

Pos logit contributions
mun+0.020
 first+0.020
 wrong+0.019
 older+0.019
 fishing+0.019
Neg logit contributions
tex-0.024
 relief-0.024
olk-0.022
text-0.021
 silk-0.020
Token sh
Token ID427
Feature activation+6.88e-02

Pos logit contributions
mun+0.001
 wrong+0.001
 first+0.001
 fishing+0.001
roads+0.001
Neg logit contributions
 relief-0.001
tex-0.001
olk-0.001
 silk-0.001
text-0.001
Tokenoo
Token ID2238
Feature activation-2.16e+00

Pos logit contributions
mun+0.002
 wrong+0.002
 first+0.002
roads+0.002
top+0.002
Neg logit contributions
 relief-0.002
tex-0.002
 silk-0.001
olk-0.001
text-0.001
Tokened
Token ID276
Feature activation+2.67e-01

Pos logit contributions
 relief+0.057
tex+0.053
olk+0.053
 silk+0.052
icol+0.047
Neg logit contributions
mun-0.051
 wrong-0.049
 first-0.048
re-0.043
 older-0.042
Token Wh
Token ID854
Feature activation-1.76e+00

Pos logit contributions
mun+0.005
 first+0.005
 wrong+0.005
 fishing+0.005
 older+0.005
Neg logit contributions
tex-0.008
 relief-0.007
olk-0.007
text-0.007
 silk-0.007
Tokenisk
Token ID1984
Feature activation-5.36e+00

Pos logit contributions
 relief+0.026
tex+0.025
olk+0.021
 silk+0.020
text+0.019
Neg logit contributions
mun-0.061
 wrong-0.057
 first-0.056
roads-0.055
 fishing-0.055
Tokeners
Token ID364
Feature activation+4.74e-01

Pos logit contributions
 relief+0.156
tex+0.152
olk+0.142
 silk+0.138
 ow+0.136
Neg logit contributions
mun-0.110
 wrong-0.097
 first-0.095
roads-0.092
 fishing-0.090
Token away
Token ID1497
Feature activation-1.56e+00

Pos logit contributions
mun+0.012
 wrong+0.011
 first+0.010
roads+0.010
 fishing+0.010
Neg logit contributions
 relief-0.011
tex-0.011
olk-0.010
 silk-0.010
text-0.010
Token and
Token ID290
Feature activation+3.37e-01

Pos logit contributions
tex+0.047
 relief+0.046
olk+0.043
text+0.042
 silk+0.041
Neg logit contributions
mun-0.029
 first-0.029
 wrong-0.028
 fishing-0.026
roads-0.025
Token told
Token ID1297
Feature activation+3.38e-01

Pos logit contributions
mun+0.007
 wrong+0.006
 first+0.006
 he+0.006
 fishing+0.006
Neg logit contributions
tex-0.010
olk-0.010
 relief-0.009
text-0.009
 silk-0.009
Token Max
Token ID5436
Feature activation-1.64e+00

Pos logit contributions
 wrong+0.003
mun+0.003
 first+0.003
 he+0.003
 older+0.002
Neg logit contributions
 relief-0.013
tex-0.012
olk-0.012
 ray-0.012
icol-0.012
Token and
Token ID290
Feature activation+3.57e-01

Pos logit contributions
mun+0.040
 wrong+0.035
 first+0.034
roads+0.033
 fishing+0.033
Neg logit contributions
 relief-0.069
tex-0.068
olk-0.063
 silk-0.062
text-0.061
Token wanted
Token ID2227
Feature activation-6.60e-01

Pos logit contributions
mun+0.008
 wrong+0.007
roads+0.007
 fishing+0.007
 first+0.007
Neg logit contributions
 relief-0.009
tex-0.009
olk-0.008
 silk-0.008
 ow-0.008
Token to
Token ID284
Feature activation-5.41e-02

Pos logit contributions
 relief+0.019
tex+0.018
 silk+0.018
 ow+0.017
olk+0.017
Neg logit contributions
mun-0.014
roads-0.012
our-0.011
 fishing-0.010
 exception-0.010
Token choose
Token ID3853
Feature activation+2.01e+00

Pos logit contributions
 relief+0.001
tex+0.001
olk+0.001
 silk+0.001
text+0.001
Neg logit contributions
mun-0.001
 wrong-0.001
 first-0.001
roads-0.001
 fishing-0.001
Token what
Token ID644
Feature activation+1.57e+00

Pos logit contributions
mun+0.052
roads+0.044
 leaders+0.042
 fishing+0.042
top+0.042
Neg logit contributions
 relief-0.047
tex-0.045
 silk-0.043
olk-0.042
 ow-0.041
Token she
Token ID673
Feature activation+5.35e+00

Pos logit contributions
mun+0.035
roads+0.032
 exception+0.032
our+0.031
 leaders+0.030
Neg logit contributions
tex-0.038
 relief-0.037
 Kitty-0.037
 ow-0.037
 silk-0.036
Token did
Token ID750
Feature activation+3.30e+00

Pos logit contributions
mun+0.137
 fishing+0.122
roads+0.122
 wrong+0.121
 first+0.118
Neg logit contributions
 relief-0.124
tex-0.122
 ow-0.106
olk-0.106
 silk-0.104
Token.
Token ID13
Feature activation+6.05e-01

Pos logit contributions
mun+0.101
rob+0.079
 fishing+0.078
roads+0.077
 workshop+0.076
Neg logit contributions
tex-0.056
 relief-0.055
 itself-0.052
 click-0.049
 sil-0.049
Token
Token ID198
Feature activation+3.83e-01

Pos logit contributions
mun+0.014
 leaders+0.013
 fishing+0.013
 wrong+0.013
roads+0.013
Neg logit contributions
tex-0.015
 relief-0.015
olk-0.013
text-0.013
 silk-0.013
Token
Token ID198
Feature activation+3.89e-01

Pos logit contributions
mun+0.010
 leaders+0.009
 wrong+0.009
 fishing+0.009
roads+0.008
Neg logit contributions
tex-0.009
 relief-0.009
text-0.008
 Kitty-0.007
 silk-0.007
TokenOne
Token ID3198
Feature activation+1.43e+00

Pos logit contributions
mun+0.006
 fishing+0.005
 leaders+0.005
 wrong+0.005
 older+0.005
Neg logit contributions
tex-0.013
 relief-0.013
text-0.012
 silk-0.011
olk-0.011
Token it
Token ID340
Feature activation+8.72e-01

Pos logit contributions
 relief+0.016
tex+0.015
olk+0.015
 silk+0.015
 ow+0.014
Neg logit contributions
mun-0.016
roads-0.015
 wrong-0.014
 fishing-0.013
 exception-0.013
Token's
Token ID338
Feature activation+1.90e+00

Pos logit contributions
mun+0.018
roads+0.018
 fishing+0.017
 shopping+0.016
 exception+0.016
Neg logit contributions
tex-0.022
 relief-0.022
 ow-0.021
olk-0.021
 silk-0.020
Token good
Token ID922
Feature activation+5.87e-01

Pos logit contributions
mun+0.044
 wrong+0.038
roads+0.038
 first+0.037
 fishing+0.037
Neg logit contributions
 relief-0.050
tex-0.049
olk-0.045
 silk-0.044
 ow-0.043
Token for
Token ID329
Feature activation+8.35e-01

Pos logit contributions
mun+0.013
roads+0.012
 wrong+0.010
top+0.010
 fishing+0.010
Neg logit contributions
 relief-0.016
tex-0.016
olk-0.015
 silk-0.015
 ow-0.014
Token dinner
Token ID8073
Feature activation+1.80e+00

Pos logit contributions
mun+0.015
roads+0.013
 wrong+0.012
top+0.012
 exception+0.012
Neg logit contributions
 relief-0.026
tex-0.025
olk-0.024
 silk-0.024
 ow-0.024
Token,"
Token ID553
Feature activation+4.62e+00

Pos logit contributions
mun+0.047
roads+0.042
 leaders+0.038
 wrong+0.038
 older+0.038
Neg logit contributions
 relief-0.038
tex-0.036
 silk-0.034
 sil-0.034
 ow-0.034
Token Mama
Token ID40084
Feature activation+5.34e-01

Pos logit contributions
mun+0.100
 wrong+0.098
 leaders+0.094
roads+0.092
ider+0.091
Neg logit contributions
 relief-0.126
tex-0.112
 Kitty-0.111
 click-0.103
 ow-0.103
Token said
Token ID531
Feature activation+2.58e+00

Pos logit contributions
mun+0.010
roads+0.009
 wrong+0.009
 fishing+0.008
 leaders+0.008
Neg logit contributions
 relief-0.016
tex-0.015
olk-0.014
 ow-0.014
 silk-0.014
Token.
Token ID13
Feature activation+3.86e-01

Pos logit contributions
mun+0.058
roads+0.054
 leaders+0.049
 fishing+0.048
 exception+0.048
Neg logit contributions
 relief-0.070
tex-0.061
 click-0.060
olk-0.059
 ow-0.058
Token
Token ID198
Feature activation+5.88e-01

Pos logit contributions
mun+0.011
roads+0.010
 wrong+0.010
 leaders+0.010
 exception+0.009
Neg logit contributions
 relief-0.008
tex-0.007
 Kitty-0.007
olk-0.007
 ow-0.007
Token
Token ID198
Feature activation+5.80e-01

Pos logit contributions
mun+0.013
 wrong+0.012
roads+0.012
 fishing+0.012
 first+0.012
Neg logit contributions
 relief-0.015
tex-0.015
olk-0.013
text-0.013
 silk-0.013
Token made
Token ID925
Feature activation-2.11e-01

Pos logit contributions
mun+0.016
 wrong+0.014
 first+0.014
 fishing+0.014
roads+0.013
Neg logit contributions
tex-0.025
 relief-0.025
olk-0.023
text-0.023
 silk-0.022
Token sure
Token ID1654
Feature activation+1.42e-01

Pos logit contributions
 relief+0.005
tex+0.005
olk+0.005
 silk+0.005
text+0.005
Neg logit contributions
mun-0.005
 wrong-0.004
 first-0.004
 fishing-0.004
roads-0.004
Token they
Token ID484
Feature activation+2.18e+00

Pos logit contributions
mun+0.003
roads+0.003
 exception+0.003
our+0.003
 fishing+0.003
Neg logit contributions
 relief-0.004
tex-0.004
olk-0.003
 silk-0.003
 ow-0.003
Token had
Token ID550
Feature activation+7.23e-01

Pos logit contributions
mun+0.056
 wrong+0.051
roads+0.050
 first+0.050
 fishing+0.050
Neg logit contributions
 relief-0.051
tex-0.050
olk-0.044
 silk-0.043
text-0.043
Token everything
Token ID2279
Feature activation+4.62e-01

Pos logit contributions
mun+0.015
 wrong+0.013
roads+0.013
 first+0.013
 fishing+0.013
Neg logit contributions
 relief-0.020
tex-0.020
olk-0.018
 silk-0.018
text-0.018
Token they
Token ID484
Feature activation+4.84e+00

Pos logit contributions
mun+0.010
roads+0.009
 fishing+0.008
 exception+0.008
 leaders+0.008
Neg logit contributions
 relief-0.013
tex-0.012
 silk-0.011
text-0.011
 sil-0.011
Token needed
Token ID2622
Feature activation+8.09e-01

Pos logit contributions
mun+0.142
roads+0.133
 fishing+0.127
 exception+0.126
 wrong+0.125
Neg logit contributions
 relief-0.096
tex-0.090
 ow-0.076
text-0.073
 sil-0.072
Token.
Token ID13
Feature activation+2.62e-01

Pos logit contributions
mun+0.018
roads+0.016
 exception+0.014
 older+0.014
 leaders+0.014
Neg logit contributions
 relief-0.022
tex-0.021
 silk-0.019
text-0.019
 aver-0.018
Token All
Token ID1439
Feature activation+1.13e+00

Pos logit contributions
mun+0.006
 wrong+0.006
 first+0.006
 fishing+0.006
roads+0.006
Neg logit contributions
 relief-0.006
tex-0.006
olk-0.006
 silk-0.006
 ow-0.006
Token of
Token ID286
Feature activation-1.46e+00

Pos logit contributions
mun+0.027
 wrong+0.024
roads+0.023
 first+0.023
 fishing+0.023
Neg logit contributions
 relief-0.029
tex-0.029
olk-0.025
text-0.025
 silk-0.025
Token the
Token ID262
Feature activation+8.01e-01

Pos logit contributions
 relief+0.041
tex+0.041
olk+0.038
text+0.037
 silk+0.036
Neg logit contributions
mun-0.030
 wrong-0.027
 first-0.026
 fishing-0.026
roads-0.025
TokenJohn
Token ID7554
Feature activation+7.33e-01

Pos logit contributions
 relief+0.009
tex+0.009
olk+0.008
 silk+0.008
 ow+0.008
Neg logit contributions
mun-0.007
 wrong-0.006
 first-0.006
roads-0.006
 fishing-0.006
Token felt
Token ID2936
Feature activation+4.98e-02

Pos logit contributions
mun+0.017
 first+0.015
 wrong+0.015
 fishing+0.014
roads+0.014
Neg logit contributions
 relief-0.019
tex-0.019
olk-0.018
 silk-0.017
 ow-0.017
Token sorry
Token ID7926
Feature activation-5.98e-01

Pos logit contributions
mun+0.001
 wrong+0.001
 first+0.001
 fishing+0.001
roads+0.001
Neg logit contributions
tex-0.001
 relief-0.001
olk-0.001
 silk-0.001
text-0.001
Token for
Token ID329
Feature activation-3.43e-01

Pos logit contributions
 relief+0.017
tex+0.016
olk+0.015
 silk+0.015
text+0.015
Neg logit contributions
mun-0.013
 wrong-0.011
roads-0.011
 fishing-0.011
 first-0.011
Token what
Token ID644
Feature activation+2.35e-02

Pos logit contributions
 relief+0.008
tex+0.007
 ow+0.007
 click+0.006
olk+0.006
Neg logit contributions
mun-0.009
roads-0.008
 fishing-0.008
 wrong-0.008
top-0.008
Token he
Token ID339
Feature activation+4.76e+00

Pos logit contributions
mun+0.000
roads+0.000
 fishing+0.000
our+0.000
top+0.000
Neg logit contributions
 relief-0.001
tex-0.001
 ow-0.001
 Kitty-0.001
 silk-0.001
Token did
Token ID750
Feature activation+2.70e+00

Pos logit contributions
mun+0.125
 wrong+0.115
 first+0.114
roads+0.111
 fishing+0.110
Neg logit contributions
 relief-0.108
tex-0.108
olk-0.098
 silk-0.094
text-0.093
Token,
Token ID11
Feature activation+3.62e-02

Pos logit contributions
mun+0.071
roads+0.060
 fishing+0.059
 older+0.058
 exception+0.058
Neg logit contributions
 relief-0.062
tex-0.059
 click-0.051
text-0.050
 itself-0.049
Token so
Token ID523
Feature activation+1.10e+00

Pos logit contributions
mun+0.001
 wrong+0.001
 first+0.001
 fishing+0.001
roads+0.001
Neg logit contributions
 relief-0.001
tex-0.001
olk-0.001
 silk-0.001
text-0.001
Token he
Token ID339
Feature activation+1.86e+00

Pos logit contributions
mun+0.014
roads+0.010
 fishing+0.010
 wrong+0.010
 shopping+0.010
Neg logit contributions
 relief-0.041
tex-0.039
 ow-0.036
olk-0.036
 click-0.036
Token said
Token ID531
Feature activation+5.87e-01

Pos logit contributions
mun+0.036
 first+0.034
 wrong+0.034
 older+0.031
 leaders+0.031
Neg logit contributions
tex-0.054
 relief-0.053
olk-0.052
 silk-0.050
text-0.048
Token for
Token ID329
Feature activation-3.46e-01

Pos logit contributions
tex+0.007
 relief+0.007
olk+0.007
text+0.006
 ow+0.006
Neg logit contributions
 wrong-0.004
mun-0.004
 first-0.004
 older-0.003
 fishing-0.003
Token was
Token ID373
Feature activation-5.03e-01

Pos logit contributions
tex+0.010
 relief+0.010
olk+0.009
text+0.009
icol+0.008
Neg logit contributions
mun-0.007
 first-0.007
 wrong-0.007
 shopping-0.006
roads-0.006
Token something
Token ID1223
Feature activation+5.84e-01

Pos logit contributions
tex+0.014
 relief+0.014
olk+0.013
text+0.012
 silk+0.012
Neg logit contributions
mun-0.011
 wrong-0.010
 first-0.010
 fishing-0.009
roads-0.009
Token that
Token ID326
Feature activation+1.99e+00

Pos logit contributions
mun+0.014
roads+0.013
 fishing+0.012
 leaders+0.012
top+0.012
Neg logit contributions
 relief-0.015
tex-0.014
 silk-0.013
 ow-0.013
 sil-0.012
Token no
Token ID645
Feature activation+1.92e+00

Pos logit contributions
mun+0.047
 wrong+0.041
 fishing+0.041
roads+0.041
 first+0.041
Neg logit contributions
 relief-0.051
tex-0.050
olk-0.044
 silk-0.044
text-0.043
Token one
Token ID530
Feature activation+4.88e+00

Pos logit contributions
mun+0.056
roads+0.051
 fishing+0.051
 leaders+0.049
yard+0.049
Neg logit contributions
 relief-0.041
tex-0.035
 silk-0.033
olk-0.031
 ow-0.031
Token could
Token ID714
Feature activation+1.13e+00

Pos logit contributions
mun+0.116
roads+0.101
 fishing+0.100
 wrong+0.100
 exception+0.099
Neg logit contributions
 relief-0.123
tex-0.121
 silk-0.106
 ow-0.105
text-0.105
Token take
Token ID1011
Feature activation+1.58e+00

Pos logit contributions
mun+0.030
roads+0.026
 fishing+0.025
 wrong+0.025
 shopping+0.025
Neg logit contributions
 relief-0.027
tex-0.025
 ow-0.023
 silk-0.023
olk-0.022
Token away
Token ID1497
Feature activation-1.13e+00

Pos logit contributions
mun+0.030
 wrong+0.029
 first+0.028
 shopping+0.026
 fishing+0.026
Neg logit contributions
tex-0.046
 relief-0.046
olk-0.043
text-0.041
 silk-0.041
Token.
Token ID13
Feature activation-2.44e-01

Pos logit contributions
tex+0.033
 relief+0.033
olk+0.031
text+0.030
 silk+0.029
Neg logit contributions
mun-0.021
 wrong-0.019
 first-0.019
 fishing-0.018
roads-0.018
Token And
Token ID843
Feature activation-8.46e-01

Pos logit contributions
 relief+0.006
tex+0.006
olk+0.005
 silk+0.005
text+0.005
Neg logit contributions
mun-0.006
 wrong-0.005
 first-0.005
roads-0.005
 fishing-0.005
Token can
Token ID460
Feature activation-5.53e-01

Pos logit contributions
 relief+0.029
tex+0.029
olk+0.026
 silk+0.025
text+0.025
Neg logit contributions
mun-0.030
 wrong-0.027
 first-0.027
roads-0.027
 fishing-0.027
Token't
Token ID470
Feature activation+1.34e-01

Pos logit contributions
 relief+0.014
tex+0.013
 click+0.012
olk+0.012
 ow+0.012
Neg logit contributions
mun-0.013
roads-0.012
 fishing-0.011
 wrong-0.011
 shopping-0.011
Token always
Token ID1464
Feature activation+2.29e+00

Pos logit contributions
mun+0.004
 wrong+0.003
roads+0.003
 fishing+0.003
 shopping+0.003
Neg logit contributions
 relief-0.003
tex-0.003
 click-0.002
olk-0.002
 ow-0.002
Token get
Token ID651
Feature activation+1.31e+00

Pos logit contributions
mun+0.061
roads+0.053
 wrong+0.052
top+0.052
 fishing+0.052
Neg logit contributions
 relief-0.054
tex-0.049
 click-0.044
 ow-0.043
 silk-0.043
Token what
Token ID644
Feature activation+1.14e+00

Pos logit contributions
mun+0.032
roads+0.029
yard+0.027
top+0.026
 leaders+0.026
Neg logit contributions
 relief-0.032
tex-0.029
 ow-0.027
 silk-0.027
olk-0.026
Token you
Token ID345
Feature activation+1.98e+00

Pos logit contributions
mun+0.020
roads+0.019
 exception+0.018
top+0.017
ios+0.016
Neg logit contributions
tex-0.034
 relief-0.033
text-0.032
 ow-0.031
 silk-0.031
Token want
Token ID765
Feature activation+1.75e+00

Pos logit contributions
mun+0.053
roads+0.048
 wrong+0.046
 fishing+0.046
 shopping+0.045
Neg logit contributions
 relief-0.045
tex-0.042
 ow-0.036
text-0.036
olk-0.035
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
mun+0.040
roads+0.033
 exception+0.032
top+0.031
 fishing+0.031
Neg logit contributions
tex-0.046
 relief-0.045
text-0.041
 silk-0.041
 ow-0.039
Token Sometimes
Token ID8975
Feature activation-9.23e-01

Pos logit contributions
 relief+0.006
tex+0.006
olk+0.005
 silk+0.005
text+0.005
Neg logit contributions
mun-0.006
 wrong-0.005
roads-0.005
 first-0.005
 fishing-0.005
Token you
Token ID345
Feature activation-2.10e+00

Pos logit contributions
 relief+0.027
tex+0.027
 silk+0.025
 ow+0.025
olk+0.024
Neg logit contributions
mun-0.018
roads-0.015
 wrong-0.015
top-0.015
 exception-0.014
Token have
Token ID423
Feature activation+3.49e-02

Pos logit contributions
 relief+0.051
tex+0.050
olk+0.043
text+0.043
 silk+0.042
Neg logit contributions
mun-0.053
 wrong-0.047
roads-0.046
 fishing-0.046
 first-0.045
Token in
Token ID287
Feature activation+6.21e-01

Pos logit contributions
 relief+0.018
tex+0.018
olk+0.016
text+0.016
 silk+0.016
Neg logit contributions
mun-0.012
 wrong-0.011
 first-0.011
 fishing-0.011
roads-0.010
Token the
Token ID262
Feature activation+1.12e-01

Pos logit contributions
 first+0.009
 fishing+0.009
mun+0.008
 wrong+0.008
 shopping+0.008
Neg logit contributions
tex-0.021
olk-0.019
 relief-0.019
text-0.019
 ow-0.018
Token boat
Token ID8848
Feature activation+2.03e+00

Pos logit contributions
mun+0.003
 wrong+0.003
 first+0.003
 fishing+0.003
roads+0.003
Neg logit contributions
tex-0.002
 relief-0.002
olk-0.002
 silk-0.002
text-0.002
Token.
Token ID13
Feature activation-3.07e-01

Pos logit contributions
mun+0.043
 wrong+0.034
roads+0.034
 leaders+0.034
 exception+0.034
Neg logit contributions
 relief-0.056
tex-0.054
 silk-0.051
olk-0.051
 click-0.050
Token Then
Token ID3244
Feature activation-1.07e+00

Pos logit contributions
 relief+0.008
tex+0.008
olk+0.007
 silk+0.007
text+0.007
Neg logit contributions
mun-0.007
 wrong-0.006
 first-0.006
roads-0.006
 fishing-0.006
Token he
Token ID339
Feature activation+1.33e+00

Pos logit contributions
 relief+0.038
tex+0.037
olk+0.035
 silk+0.034
 ow+0.034
Neg logit contributions
mun-0.015
 wrong-0.012
roads-0.012
 first-0.011
 exception-0.011
Token lay
Token ID3830
Feature activation-3.88e-01

Pos logit contributions
mun+0.029
 first+0.026
 fishing+0.026
 wrong+0.025
 older+0.024
Neg logit contributions
 relief-0.037
tex-0.036
olk-0.034
 silk-0.033
text-0.031
Token down
Token ID866
Feature activation-4.21e-01

Pos logit contributions
tex+0.011
 relief+0.011
text+0.011
olk+0.010
 silk+0.010
Neg logit contributions
 first-0.007
mun-0.007
 wrong-0.007
 fishing-0.007
roads-0.007
Token and
Token ID290
Feature activation-3.71e-01

Pos logit contributions
 relief+0.009
tex+0.008
 silk+0.008
olk+0.007
 click+0.007
Neg logit contributions
mun-0.012
roads-0.010
 wrong-0.010
 exception-0.010
 leaders-0.010
Token started
Token ID2067
Feature activation-1.15e-01

Pos logit contributions
tex+0.009
 relief+0.009
olk+0.008
text+0.008
 silk+0.008
Neg logit contributions
mun-0.009
 first-0.008
 wrong-0.008
 fishing-0.008
roads-0.008
Token to
Token ID284
Feature activation-6.31e-01

Pos logit contributions
 relief+0.004
tex+0.003
 silk+0.003
olk+0.003
text+0.003
Neg logit contributions
mun-0.002
 wrong-0.002
roads-0.002
 first-0.002
 leaders-0.002
Token You
Token ID921
Feature activation-2.20e+00

Pos logit contributions
mun+0.007
 wrong+0.006
 first+0.006
 fishing+0.005
roads+0.005
Neg logit contributions
 relief-0.013
tex-0.013
olk-0.012
 silk-0.012
 ow-0.012
Token are
Token ID389
Feature activation+1.64e+00

Pos logit contributions
 relief+0.057
tex+0.055
olk+0.051
 silk+0.050
icol+0.049
Neg logit contributions
mun-0.051
 wrong-0.047
 first-0.046
 leaders-0.043
 fishing-0.043
Token in
Token ID287
Feature activation+1.40e+00

Pos logit contributions
mun+0.036
roads+0.032
top+0.030
 exception+0.029
 fishing+0.029
Neg logit contributions
 relief-0.046
tex-0.044
olk-0.041
 ow-0.040
 silk-0.040
Token big
Token ID1263
Feature activation+6.20e-01

Pos logit contributions
 wrong+0.023
mun+0.023
 fishing+0.022
 first+0.021
 shopping+0.020
Neg logit contributions
tex-0.044
 relief-0.043
olk-0.040
 silk-0.039
text-0.039
Token trouble
Token ID5876
Feature activation+4.48e-01

Pos logit contributions
mun+0.014
roads+0.013
top+0.013
 first+0.013
 exception+0.013
Neg logit contributions
 relief-0.017
tex-0.016
 ow-0.014
olk-0.014
text-0.014
Token!"
Token ID2474
Feature activation+3.07e+00

Pos logit contributions
mun+0.010
 wrong+0.009
 first+0.009
 fishing+0.008
roads+0.008
Neg logit contributions
tex-0.012
 relief-0.012
olk-0.011
 silk-0.011
text-0.011
Token
Token ID198
Feature activation+7.36e-02

Pos logit contributions
mun+0.092
 exception+0.081
roads+0.080
 leaders+0.079
 fastest+0.077
Neg logit contributions
 relief-0.067
tex-0.061
 click-0.054
 ow-0.053
text-0.052
Token
Token ID198
Feature activation+1.79e-02

Pos logit contributions
mun+0.001
 wrong+0.001
 first+0.001
 fishing+0.001
 older+0.001
Neg logit contributions
tex-0.002
 relief-0.002
olk-0.002
 silk-0.002
icol-0.002
TokenHe
Token ID1544
Feature activation+2.07e-01

Pos logit contributions
mun+0.000
 wrong+0.000
 first+0.000
roads+0.000
 fishing+0.000
Neg logit contributions
tex-0.000
 relief-0.000
olk-0.000
 silk-0.000
text-0.000
Token grabbed
Token ID13176
Feature activation+7.98e-01

Pos logit contributions
mun+0.005
 wrong+0.005
 first+0.004
 older+0.004
 asks+0.004
Neg logit contributions
 relief-0.005
olk-0.005
tex-0.005
 silk-0.005
 click-0.004
Token Lily
Token ID20037
Feature activation-2.05e+00

Pos logit contributions
mun+0.021
 wrong+0.020
 first+0.020
 fishing+0.019
roads+0.019
Neg logit contributions
 relief-0.017
tex-0.017
olk-0.015
 silk-0.015
text-0.015
Token too
Token ID1165
Feature activation+5.97e-01

Pos logit contributions
mun+0.018
roads+0.016
 fishing+0.015
top+0.015
 exception+0.015
Neg logit contributions
 relief-0.020
tex-0.020
 silk-0.018
 ow-0.018
text-0.018
Token.
Token ID13
Feature activation-1.93e-02

Pos logit contributions
mun+0.014
roads+0.013
top+0.012
 wrong+0.011
 fishing+0.011
Neg logit contributions
 relief-0.015
tex-0.014
 ow-0.014
olk-0.013
 silk-0.013
Token I
Token ID314
Feature activation-4.80e-01

Pos logit contributions
tex+0.001
 relief+0.001
olk+0.001
text+0.001
 silk+0.001
Neg logit contributions
mun-0.000
 first-0.000
 wrong-0.000
roads-0.000
 fishing-0.000
Token help
Token ID1037
Feature activation+1.58e+00

Pos logit contributions
 relief+0.011
tex+0.011
olk+0.010
 silk+0.010
text+0.009
Neg logit contributions
mun-0.012
roads-0.011
 wrong-0.011
 fishing-0.011
 first-0.011
Token people
Token ID661
Feature activation+1.29e+00

Pos logit contributions
mun+0.032
roads+0.028
 fishing+0.026
top+0.026
 exception+0.025
Neg logit contributions
tex-0.044
 relief-0.044
 ow-0.040
 silk-0.040
 Kitty-0.040
Token who
Token ID508
Feature activation+2.31e+00

Pos logit contributions
mun+0.034
roads+0.030
 wrong+0.027
rob+0.027
top+0.027
Neg logit contributions
 relief-0.027
tex-0.026
 click-0.025
 silk-0.024
 lull-0.023
Token have
Token ID423
Feature activation+7.31e-01

Pos logit contributions
mun+0.061
roads+0.054
 fishing+0.054
 exception+0.054
 wrong+0.053
Neg logit contributions
 relief-0.051
tex-0.049
 click-0.042
 silk-0.041
olk-0.041
Token problems
Token ID2761
Feature activation+1.57e+00

Pos logit contributions
mun+0.016
roads+0.014
 wrong+0.014
 first+0.014
 fishing+0.014
Neg logit contributions
 relief-0.020
tex-0.019
olk-0.017
 silk-0.017
text-0.017
Token.
Token ID13
Feature activation-1.30e-01

Pos logit contributions
mun+0.037
roads+0.030
 exception+0.029
top+0.029
 fishing+0.029
Neg logit contributions
tex-0.040
 relief-0.040
 silk-0.036
text-0.036
 click-0.035
Token I
Token ID314
Feature activation-6.20e-01

Pos logit contributions
 relief+0.004
tex+0.004
olk+0.003
 silk+0.003
text+0.003
Neg logit contributions
mun-0.003
 wrong-0.002
 first-0.002
roads-0.002
 fishing-0.002
Token read
Token ID1100
Feature activation-1.16e+00

Pos logit contributions
 relief+0.015
tex+0.014
olk+0.013
 silk+0.013
text+0.013
Neg logit contributions
mun-0.016
 wrong-0.014
roads-0.014
 fishing-0.014
 first-0.014
Token the
Token ID262
Feature activation+6.50e-01

Pos logit contributions
tex+0.006
 relief+0.006
olk+0.006
text+0.005
icol+0.005
Neg logit contributions
 first-0.003
mun-0.003
 shopping-0.003
 fishing-0.003
 older-0.003
Token forest
Token ID8222
Feature activation+3.19e-01

Pos logit contributions
mun+0.011
 wrong+0.009
 first+0.009
 fishing+0.009
roads+0.009
Neg logit contributions
 relief-0.021
tex-0.021
olk-0.019
text-0.019
 silk-0.019
Token one
Token ID530
Feature activation+3.51e+00

Pos logit contributions
mun+0.009
 register+0.009
rob+0.009
 jury+0.008
 asks+0.008
Neg logit contributions
tex-0.005
 relief-0.005
 silk-0.004
 shade-0.004
 sil-0.004
Token day
Token ID1110
Feature activation-1.89e-01

Pos logit contributions
mun+0.083
 wrong+0.074
 first+0.073
roads+0.072
 fishing+0.071
Neg logit contributions
 relief-0.090
tex-0.089
olk-0.081
 silk-0.078
text-0.078
Token when
Token ID618
Feature activation+1.04e+00

Pos logit contributions
 relief+0.004
tex+0.004
 silk+0.003
olk+0.003
 sil+0.003
Neg logit contributions
mun-0.006
roads-0.005
rob-0.005
 wrong-0.005
 leaders-0.005
Token she
Token ID673
Feature activation+1.54e+00

Pos logit contributions
mun+0.019
roads+0.016
 wrong+0.016
 exception+0.015
 fishing+0.015
Neg logit contributions
 relief-0.032
tex-0.032
olk-0.029
 silk-0.029
 ow-0.029
Token heard
Token ID2982
Feature activation+1.01e+00

Pos logit contributions
mun+0.033
 wrong+0.030
 exception+0.030
 fishing+0.030
roads+0.029
Neg logit contributions
tex-0.041
 relief-0.039
text-0.036
 ow-0.036
olk-0.035
Token a
Token ID257
Feature activation+4.48e-01

Pos logit contributions
mun+0.017
 wrong+0.016
 older+0.015
 first+0.015
roads+0.015
Neg logit contributions
tex-0.031
 relief-0.031
olk-0.029
text-0.028
icol-0.028
Token very
Token ID845
Feature activation-5.10e-01

Pos logit contributions
mun+0.012
 wrong+0.011
 first+0.011
 fishing+0.011
roads+0.011
Neg logit contributions
tex-0.010
 relief-0.010
olk-0.009
text-0.008
 silk-0.008
Token calm
Token ID9480
Feature activation+3.20e-01

Pos logit contributions
 relief+0.010
tex+0.009
 ow+0.009
olk+0.008
 silk+0.008
Neg logit contributions
mun-0.015
roads-0.013
re-0.013
top-0.013
 shopping-0.013
Token voice
Token ID3809
Feature activation+9.61e-01

Pos logit contributions
mun+0.008
 wrong+0.007
 fishing+0.007
 first+0.007
roads+0.007
Neg logit contributions
tex-0.008
 relief-0.008
text-0.007
olk-0.007
 silk-0.007
Token a
Token ID257
Feature activation+8.41e-01

Pos logit contributions
 relief+0.013
tex+0.013
text+0.012
 silk+0.012
olk+0.012
Neg logit contributions
mun-0.012
roads-0.010
 wrong-0.010
 first-0.010
 exception-0.010
Token big
Token ID1263
Feature activation-1.13e+00

Pos logit contributions
 fishing+0.017
 first+0.017
 wrong+0.016
 contest+0.016
 older+0.016
Neg logit contributions
tex-0.024
olk-0.024
text-0.022
 relief-0.022
kered-0.022
Token,
Token ID11
Feature activation-1.47e+00

Pos logit contributions
tex+0.029
olk+0.027
text+0.027
 silk+0.026
 relief+0.026
Neg logit contributions
 first-0.025
 fishing-0.025
mun-0.025
 wrong-0.024
 older-0.024
Token pink
Token ID11398
Feature activation-1.26e+00

Pos logit contributions
tex+0.040
olk+0.038
 relief+0.037
text+0.036
icol+0.033
Neg logit contributions
 first-0.032
 fishing-0.031
 wrong-0.031
mun-0.030
 older-0.030
Token bag
Token ID6131
Feature activation+2.05e-01

Pos logit contributions
tex+0.028
 relief+0.028
olk+0.025
text+0.025
 silk+0.023
Neg logit contributions
 first-0.032
mun-0.032
 fishing-0.031
 wrong-0.031
 contest-0.030
Token to
Token ID284
Feature activation-1.22e+00

Pos logit contributions
mun+0.005
 wrong+0.004
 first+0.004
roads+0.004
 fishing+0.004
Neg logit contributions
 relief-0.005
tex-0.005
olk-0.005
 silk-0.005
text-0.005
Token carry
Token ID3283
Feature activation+6.43e-01

Pos logit contributions
tex+0.032
 relief+0.032
olk+0.030
text+0.029
 silk+0.028
Neg logit contributions
mun-0.027
 first-0.024
 wrong-0.024
 fishing-0.024
roads-0.023
Token all
Token ID477
Feature activation-2.54e-02

Pos logit contributions
mun+0.013
 wrong+0.012
 first+0.012
 fishing+0.012
roads+0.012
Neg logit contributions
 relief-0.018
tex-0.018
olk-0.016
 ow-0.016
text-0.016
Token my
Token ID616
Feature activation-5.66e-01

Pos logit contributions
tex+0.001
 relief+0.001
olk+0.001
 silk+0.001
text+0.001
Neg logit contributions
mun-0.001
 wrong-0.001
our-0.001
 exception-0.001
roads-0.001
Token toys
Token ID14958
Feature activation-5.33e-01

Pos logit contributions
 relief+0.016
tex+0.015
olk+0.014
 silk+0.013
text+0.013
Neg logit contributions
mun-0.012
 wrong-0.011
 first-0.011
 fishing-0.011
 shopping-0.010
Token.
Token ID13
Feature activation-5.16e-01

Pos logit contributions
tex+0.012
 relief+0.012
 silk+0.011
olk+0.011
text+0.010
Neg logit contributions
mun-0.014
roads-0.012
 wrong-0.012
 exception-0.011
 fishing-0.011
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
 relief+0.046
tex+0.045
olk+0.042
 silk+0.041
text+0.040
Neg logit contributions
mun-0.027
 wrong-0.023
 first-0.023
 fishing-0.022
roads-0.022
Token The
Token ID383
Feature activation-7.37e-02

Pos logit contributions
mun+0.000
 wrong+0.000
 first+0.000
roads+0.000
 fishing+0.000
Neg logit contributions
 relief-0.000
tex-0.000
olk-0.000
 silk-0.000
text-0.000
Token princess
Token ID21752
Feature activation-4.78e-01

Pos logit contributions
 relief+0.002
tex+0.002
olk+0.002
text+0.002
 silk+0.002
Neg logit contributions
mun-0.002
 wrong-0.002
 first-0.002
roads-0.002
 fishing-0.002
Token sang
Token ID25889
Feature activation-9.10e-01

Pos logit contributions
 relief+0.011
tex+0.011
olk+0.009
text+0.009
 silk+0.009
Neg logit contributions
mun-0.012
 wrong-0.011
 first-0.011
roads-0.011
 fishing-0.011
Token along
Token ID1863
Feature activation-8.90e-01

Pos logit contributions
olk+0.030
tex+0.028
 relief+0.028
icol+0.026
 ow+0.026
Neg logit contributions
 first-0.016
mun-0.015
 wrong-0.014
 older-0.014
 leaders-0.013
Token and
Token ID290
Feature activation-8.38e-02

Pos logit contributions
 relief+0.025
tex+0.025
olk+0.023
text+0.022
 silk+0.022
Neg logit contributions
mun-0.019
 first-0.017
 wrong-0.017
 fishing-0.016
roads-0.016
Token enjoyed
Token ID8359
Feature activation-1.32e+00

Pos logit contributions
 relief+0.002
tex+0.002
olk+0.002
text+0.002
 silk+0.002
Neg logit contributions
mun-0.002
 wrong-0.002
 first-0.002
roads-0.002
 fishing-0.002
Token the
Token ID262
Feature activation-3.79e-01

Pos logit contributions
tex+0.040
 relief+0.039
olk+0.039
text+0.037
 ow+0.035
Neg logit contributions
mun-0.025
 first-0.023
 fishing-0.022
 wrong-0.022
 shopping-0.022
Token ride
Token ID6594
Feature activation+1.01e+00

Pos logit contributions
 relief+0.009
tex+0.009
olk+0.008
 silk+0.007
text+0.007
Neg logit contributions
mun-0.010
 wrong-0.009
 first-0.009
roads-0.009
 fishing-0.009
Token.
Token ID13
Feature activation-1.69e-01

Pos logit contributions
mun+0.020
 wrong+0.017
 fishing+0.017
roads+0.017
 first+0.016
Neg logit contributions
 relief-0.029
tex-0.029
 silk-0.026
olk-0.026
text-0.026
Token
Token ID198
Feature activation-9.95e-02

Pos logit contributions
 relief+0.005
tex+0.005
 ow+0.005
olk+0.005
 silk+0.005
Neg logit contributions
mun-0.003
 first-0.003
 shopping-0.003
top-0.003
 wrong-0.003
Token track
Token ID2610
Feature activation+5.84e-01

Pos logit contributions
mun+0.037
re+0.026
 wrong+0.025
roads+0.025
top+0.025
Neg logit contributions
 relief-0.053
tex-0.052
 click-0.048
 silk-0.047
 ray-0.046
Token with
Token ID351
Feature activation+3.76e-01

Pos logit contributions
mun+0.014
 exception+0.010
roads+0.010
re+0.010
 fishing+0.010
Neg logit contributions
 relief-0.016
tex-0.015
 click-0.014
 silk-0.014
text-0.013
Token some
Token ID617
Feature activation+1.83e-01

Pos logit contributions
mun+0.007
 wrong+0.006
 first+0.006
 fishing+0.006
roads+0.006
Neg logit contributions
 relief-0.012
tex-0.011
olk-0.010
 silk-0.010
text-0.010
Token books
Token ID3835
Feature activation-1.45e+00

Pos logit contributions
mun+0.004
 wrong+0.003
roads+0.003
top+0.003
 first+0.003
Neg logit contributions
 relief-0.005
tex-0.005
 silk-0.004
olk-0.004
text-0.004
Token and
Token ID290
Feature activation+1.98e-01

Pos logit contributions
 relief+0.040
tex+0.040
olk+0.037
 click+0.037
 ow+0.036
Neg logit contributions
mun-0.032
 exception-0.023
roads-0.022
 fishing-0.022
 leaders-0.021
Token boxes
Token ID10559
Feature activation+3.46e-01

Pos logit contributions
mun+0.004
 wrong+0.004
 first+0.004
roads+0.004
 fishing+0.004
Neg logit contributions
 relief-0.005
tex-0.005
olk-0.005
 silk-0.005
text-0.005
Token?"
Token ID1701
Feature activation+2.80e+00

Pos logit contributions
mun+0.008
roads+0.006
 fishing+0.006
 wrong+0.006
 exception+0.006
Neg logit contributions
 relief-0.009
tex-0.009
olk-0.008
 silk-0.008
 click-0.008
Token Tom
Token ID4186
Feature activation+3.55e-01

Pos logit contributions
mun+0.080
 leaders+0.077
 exception+0.076
 wrong+0.075
 fastest+0.074
Neg logit contributions
 relief-0.053
tex-0.053
 Kitty-0.049
text-0.042
 ow-0.042
Token nods
Token ID34833
Feature activation+1.45e+00

Pos logit contributions
mun+0.009
roads+0.009
 fishing+0.009
 leaders+0.009
 wrong+0.009
Neg logit contributions
 relief-0.007
tex-0.007
text-0.006
 ow-0.006
 click-0.005
Token and
Token ID290
Feature activation+1.36e-01

Pos logit contributions
mun+0.032
 first+0.031
 wrong+0.030
 fishing+0.029
 older+0.028
Neg logit contributions
tex-0.038
 relief-0.038
olk-0.036
 silk-0.034
text-0.034
Token smiles
Token ID21845
Feature activation+7.41e-02

Pos logit contributions
mun+0.003
 wrong+0.002
roads+0.002
 fishing+0.002
 leaders+0.002
Neg logit contributions
 relief-0.004
tex-0.004
 silk-0.003
olk-0.003
text-0.003
Token Mom
Token ID11254
Feature activation+8.41e-01

Pos logit contributions
tex+0.031
 relief+0.027
 silk+0.027
olk+0.027
text+0.027
Neg logit contributions
 first-0.027
 older-0.027
 fishing-0.026
 wrong-0.025
 shopping-0.025
Tokenmy
Token ID1820
Feature activation-2.72e-01

Pos logit contributions
mun+0.015
 first+0.013
 wrong+0.013
 older+0.012
 fishing+0.012
Neg logit contributions
 relief-0.027
tex-0.027
olk-0.025
 silk-0.024
text-0.024
Token had
Token ID550
Feature activation-7.79e-02

Pos logit contributions
 relief+0.007
tex+0.007
olk+0.006
 silk+0.006
text+0.006
Neg logit contributions
mun-0.007
 first-0.006
 wrong-0.006
 fishing-0.006
roads-0.006
Token reminded
Token ID14516
Feature activation+6.25e-01

Pos logit contributions
 relief+0.002
tex+0.002
olk+0.002
 silk+0.002
text+0.002
Neg logit contributions
mun-0.002
 wrong-0.002
 first-0.002
roads-0.002
 fishing-0.002
Token her
Token ID607
Feature activation-2.06e-01

Pos logit contributions
mun+0.014
 wrong+0.013
 first+0.012
roads+0.012
 fishing+0.012
Neg logit contributions
tex-0.017
 relief-0.017
olk-0.015
 silk-0.015
text-0.015
Token of
Token ID286
Feature activation+4.41e-01

Pos logit contributions
tex+0.007
 relief+0.007
olk+0.006
text+0.006
 ow+0.006
Neg logit contributions
mun-0.003
 first-0.003
 wrong-0.003
 fishing-0.003
 shopping-0.003
Token the
Token ID262
Feature activation+1.53e+00

Pos logit contributions
mun+0.010
 wrong+0.009
 first+0.009
 fishing+0.008
roads+0.008
Neg logit contributions
tex-0.012
 relief-0.012
olk-0.011
text-0.011
 silk-0.011
Token fun
Token ID1257
Feature activation-2.39e-01

Pos logit contributions
mun+0.037
roads+0.032
top+0.031
our+0.031
outs+0.031
Neg logit contributions
 relief-0.040
tex-0.036
 silk-0.032
olk-0.032
 ow-0.032
Token they
Token ID484
Feature activation+2.45e+00

Pos logit contributions
 relief+0.007
tex+0.007
olk+0.006
 silk+0.006
text+0.006
Neg logit contributions
mun-0.005
 wrong-0.004
 first-0.004
roads-0.004
 fishing-0.004
Token could
Token ID714
Feature activation+4.01e-01

Pos logit contributions
mun+0.064
 wrong+0.058
 first+0.058
roads+0.056
 fishing+0.055
Neg logit contributions
 relief-0.056
tex-0.056
olk-0.051
 silk-0.049
text-0.048
Token have
Token ID423
Feature activation+8.60e-01

Pos logit contributions
mun+0.011
 wrong+0.010
roads+0.009
 fishing+0.009
 leaders+0.009
Neg logit contributions
 relief-0.009
tex-0.009
olk-0.008
 silk-0.008
 ow-0.008
Token.
Token ID13
Feature activation-3.67e-01

Pos logit contributions
mun+0.001
 wrong+0.001
roads+0.001
 fishing+0.001
 first+0.001
Neg logit contributions
 relief-0.001
tex-0.001
olk-0.001
 ow-0.001
 silk-0.001
Token She
Token ID1375
Feature activation-1.54e+00

Pos logit contributions
 relief+0.010
tex+0.010
olk+0.009
 silk+0.009
text+0.009
Neg logit contributions
mun-0.008
 wrong-0.007
 first-0.007
roads-0.007
 fishing-0.007
Token is
Token ID318
Feature activation-6.59e-02

Pos logit contributions
 relief+0.041
tex+0.041
olk+0.036
text+0.036
 silk+0.036
Neg logit contributions
mun-0.034
 wrong-0.030
roads-0.030
 first-0.030
 fishing-0.029
Token holding
Token ID4769
Feature activation-3.34e-01

Pos logit contributions
 relief+0.002
tex+0.002
olk+0.002
 silk+0.002
 ow+0.002
Neg logit contributions
mun-0.001
roads-0.001
 wrong-0.001
top-0.001
 first-0.001
Token a
Token ID257
Feature activation-6.62e-01

Pos logit contributions
tex+0.010
 relief+0.010
olk+0.009
text+0.009
 ow+0.009
Neg logit contributions
mun-0.006
 first-0.006
 wrong-0.006
 fishing-0.006
 shopping-0.005
Token tray
Token ID26473
Feature activation+3.16e-01

Pos logit contributions
tex+0.018
 relief+0.017
text+0.016
olk+0.016
 ow+0.015
Neg logit contributions
mun-0.014
 fishing-0.014
 wrong-0.014
 first-0.013
 shopping-0.013
Token with
Token ID351
Feature activation+3.40e-01

Pos logit contributions
mun+0.008
roads+0.007
 wrong+0.007
 leaders+0.007
 exception+0.007
Neg logit contributions
 relief-0.007
tex-0.007
olk-0.007
 silk-0.006
 click-0.006
Token two
Token ID734
Feature activation+4.72e-01

Pos logit contributions
mun+0.006
 first+0.006
 fishing+0.006
 wrong+0.006
 shopping+0.006
Neg logit contributions
tex-0.010
 relief-0.010
text-0.009
olk-0.008
icol-0.008
Token cups
Token ID14180
Feature activation+1.93e-01

Pos logit contributions
mun+0.011
 wrong+0.010
roads+0.010
 first+0.010
 fishing+0.010
Neg logit contributions
 relief-0.012
tex-0.012
olk-0.011
 silk-0.011
 ow-0.010
Token and
Token ID290
Feature activation-5.99e-01

Pos logit contributions
mun+0.004
 wrong+0.004
roads+0.004
 first+0.004
 exception+0.004
Neg logit contributions
 relief-0.005
tex-0.005
olk-0.005
 silk-0.004
text-0.004
Token a
Token ID257
Feature activation-2.16e-01

Pos logit contributions
 relief+0.016
tex+0.016
olk+0.014
 silk+0.014
text+0.014
Neg logit contributions
mun-0.013
 wrong-0.012
 first-0.012
 fishing-0.012
roads-0.011
Token nose
Token ID9686
Feature activation-1.56e+00

Pos logit contributions
mun+0.001
 wrong+0.001
 fishing+0.001
 first+0.001
 older+0.001
Neg logit contributions
tex-0.001
 relief-0.001
text-0.001
olk-0.001
 silk-0.001
Token,
Token ID11
Feature activation+9.64e-01

Pos logit contributions
 relief+0.056
tex+0.055
text+0.049
olk+0.048
 silk+0.048
Neg logit contributions
mun-0.021
 first-0.021
 wrong-0.021
 fishing-0.020
re-0.018
Token a
Token ID257
Feature activation+1.92e-02

Pos logit contributions
mun+0.016
 fishing+0.016
 first+0.016
 wrong+0.015
 older+0.014
Neg logit contributions
tex-0.031
 relief-0.030
olk-0.029
text-0.029
 aver-0.026
Token mouth
Token ID5422
Feature activation-6.81e-01

Pos logit contributions
mun+0.000
 fishing+0.000
 wrong+0.000
 first+0.000
 shopping+0.000
Neg logit contributions
tex-0.001
 relief-0.000
text-0.000
olk-0.000
 silk-0.000
Token and
Token ID290
Feature activation+2.53e-01

Pos logit contributions
 relief+0.023
tex+0.022
text+0.020
olk+0.020
 silk+0.020
Neg logit contributions
mun-0.011
 wrong-0.010
 first-0.010
 fishing-0.010
top-0.009
Token ears
Token ID11368
Feature activation-2.46e+00

Pos logit contributions
mun+0.005
 wrong+0.005
 first+0.005
 fishing+0.004
roads+0.004
Neg logit contributions
tex-0.007
 relief-0.007
olk-0.007
text-0.006
 silk-0.006
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
tex+0.092
 relief+0.091
text+0.084
olk+0.083
 silk+0.081
Neg logit contributions
mun-0.029
 wrong-0.027
 first-0.026
 fishing-0.025
top-0.023
Token But
Token ID887
Feature activation-6.59e-01

Pos logit contributions
 relief+0.005
tex+0.005
olk+0.004
 silk+0.004
text+0.004
Neg logit contributions
mun-0.005
roads-0.005
 exception-0.005
 wrong-0.005
 first-0.005
Token he
Token ID339
Feature activation+2.13e+00

Pos logit contributions
 relief+0.017
tex+0.016
olk+0.015
 silk+0.015
 ow+0.014
Neg logit contributions
mun-0.016
roads-0.014
 wrong-0.014
 fishing-0.013
 first-0.013
Token also
Token ID635
Feature activation+3.66e+00

Pos logit contributions
mun+0.053
 wrong+0.048
 first+0.048
 fishing+0.047
roads+0.046
Neg logit contributions
 relief-0.052
tex-0.051
olk-0.046
 silk-0.045
text-0.044
Token had
Token ID550
Feature activation+4.97e-02

Pos logit contributions
mun+0.079
 wrong+0.071
 first+0.070
 fishing+0.068
roads+0.068
Neg logit contributions
tex-0.101
 relief-0.100
olk-0.093
 silk-0.089
text-0.088
Token.
Token ID13
Feature activation-6.92e-01

Pos logit contributions
 relief+0.032
tex+0.032
olk+0.029
 silk+0.029
 ow+0.029
Neg logit contributions
mun-0.020
 wrong-0.017
roads-0.017
 fishing-0.016
 first-0.016
Token "
Token ID366
Feature activation-1.75e-01

Pos logit contributions
tex+0.022
olk+0.020
text+0.020
 relief+0.020
 silk+0.018
Neg logit contributions
 wrong-0.012
 older-0.012
 first-0.012
 fishing-0.011
 he-0.011
TokenNo
Token ID2949
Feature activation+4.19e-01

Pos logit contributions
 relief+0.003
tex+0.003
Got+0.002
 silk+0.002
O+0.002
Neg logit contributions
mun-0.006
 shopping-0.005
 older-0.005
yard-0.005
 wrong-0.005
Token,
Token ID11
Feature activation-1.36e+00

Pos logit contributions
mun+0.009
roads+0.008
 wrong+0.008
 first+0.008
 fishing+0.008
Neg logit contributions
 relief-0.012
tex-0.011
olk-0.010
 silk-0.010
 ow-0.010
Token thank
Token ID5875
Feature activation+1.44e+00

Pos logit contributions
tex+0.049
olk+0.048
icol+0.047
text+0.047
 relief+0.046
Neg logit contributions
 first-0.018
 wrong-0.018
 he-0.016
 you-0.015
mun-0.014
Token you
Token ID345
Feature activation-2.49e+00

Pos logit contributions
mun+0.020
roads+0.016
 exception+0.016
top+0.015
 fishing+0.015
Neg logit contributions
 relief-0.052
tex-0.050
 ow-0.047
 silk-0.047
 Kitty-0.046
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
 relief+0.069
tex+0.067
olk+0.062
 ow+0.061
text+0.061
Neg logit contributions
mun-0.054
roads-0.045
 fishing-0.043
top-0.043
 exception-0.042
Token cat
Token ID3797
Feature activation-1.26e+00

Pos logit contributions
tex+0.034
icol+0.033
olk+0.033
text+0.032
 relief+0.031
Neg logit contributions
 first-0.022
 wrong-0.021
 you-0.020
 he-0.020
 older-0.020
Token,"
Token ID553
Feature activation+3.54e+00

Pos logit contributions
 relief+0.037
tex+0.037
olk+0.033
 silk+0.033
 ow+0.033
Neg logit contributions
mun-0.025
roads-0.021
 fishing-0.021
 wrong-0.020
 first-0.020
Token he
Token ID339
Feature activation+1.68e+00

Pos logit contributions
mun+0.059
roads+0.048
 fishing+0.048
 wrong+0.048
 exception+0.047
Neg logit contributions
 relief-0.117
tex-0.112
 silk-0.105
 ow-0.103
olk-0.103
Token said
Token ID531
Feature activation+1.09e+00

Pos logit contributions
mun+0.028
 wrong+0.024
roads+0.024
 fishing+0.024
 shopping+0.023
Neg logit contributions
 relief-0.054
tex-0.053
olk-0.048
 ow-0.048
 silk-0.048
Tokenooter
Token ID25141
Feature activation-7.77e-01

Pos logit contributions
mun+0.040
 wrong+0.034
roads+0.033
 fishing+0.033
 first+0.033
Neg logit contributions
tex-0.049
 relief-0.048
text-0.044
olk-0.043
 click-0.043
Token was
Token ID373
Feature activation-2.38e-01

Pos logit contributions
 relief+0.019
tex+0.018
olk+0.017
 silk+0.016
text+0.016
Neg logit contributions
mun-0.019
 wrong-0.018
 first-0.018
roads-0.017
 fishing-0.017
Token still
Token ID991
Feature activation+1.93e-01

Pos logit contributions
tex+0.008
olk+0.008
 relief+0.008
text+0.008
icol+0.008
Neg logit contributions
 wrong-0.004
 first-0.004
 older-0.003
 fishing-0.003
 bored-0.003
Token harmless
Token ID23585
Feature activation-9.17e-01

Pos logit contributions
mun+0.004
 wrong+0.004
 first+0.003
 fishing+0.003
 older+0.003
Neg logit contributions
tex-0.006
 relief-0.006
olk-0.005
text-0.005
 silk-0.005
Token and
Token ID290
Feature activation-2.41e-01

Pos logit contributions
 relief+0.025
tex+0.025
olk+0.023
text+0.022
 silk+0.022
Neg logit contributions
mun-0.020
 wrong-0.018
 first-0.018
 fishing-0.017
roads-0.017
Token didn
Token ID1422
Feature activation-2.57e+00

Pos logit contributions
tex+0.009
 relief+0.009
olk+0.009
text+0.008
 silk+0.008
Neg logit contributions
mun-0.003
 wrong-0.003
 first-0.003
 he-0.002
 fishing-0.002
Token't
Token ID470
Feature activation-4.52e-01

Pos logit contributions
tex+0.086
 relief+0.085
olk+0.079
text+0.078
 silk+0.078
Neg logit contributions
mun-0.041
 wrong-0.032
roads-0.032
 fishing-0.031
 first-0.030
Token hurt
Token ID5938
Feature activation+4.61e-01

Pos logit contributions
 relief+0.011
tex+0.010
 ow+0.009
 silk+0.009
olk+0.009
Neg logit contributions
mun-0.012
roads-0.010
 wrong-0.010
 fishing-0.010
 shopping-0.010
Token anyone
Token ID2687
Feature activation+7.54e-01

Pos logit contributions
mun+0.009
roads+0.008
 fishing+0.007
 wrong+0.007
 first+0.007
Neg logit contributions
 relief-0.014
tex-0.014
olk-0.013
 silk-0.012
 ow-0.012
Token.
Token ID13
Feature activation-7.50e-02

Pos logit contributions
mun+0.015
roads+0.012
 leaders+0.012
top+0.012
 wrong+0.012
Neg logit contributions
 relief-0.022
tex-0.021
 silk-0.020
 ow-0.020
 click-0.019
Token Tim
Token ID5045
Feature activation+5.00e-01

Pos logit contributions
 relief+0.002
tex+0.002
olk+0.002
 silk+0.002
text+0.002
Neg logit contributions
mun-0.002
 wrong-0.002
 first-0.002
 fishing-0.002
roads-0.002
Token.
Token ID13
Feature activation-5.30e-01

Pos logit contributions
tex+0.020
 relief+0.019
olk+0.018
text+0.018
 silk+0.017
Neg logit contributions
mun-0.008
 first-0.007
 wrong-0.007
 older-0.007
 fishing-0.007
Token Anna
Token ID11735
Feature activation-1.25e+00

Pos logit contributions
 relief+0.014
tex+0.014
olk+0.013
 silk+0.013
text+0.012
Neg logit contributions
mun-0.012
 wrong-0.010
 first-0.010
 fishing-0.010
roads-0.010
Token smiled
Token ID13541
Feature activation+2.55e-01

Pos logit contributions
 relief+0.034
olk+0.034
tex+0.034
 silk+0.033
 ow+0.030
Neg logit contributions
mun-0.027
 first-0.024
 wrong-0.023
 older-0.022
 asks-0.022
Token and
Token ID290
Feature activation-5.48e-01

Pos logit contributions
 older+0.004
 first+0.004
 wrong+0.004
mun+0.004
 he+0.004
Neg logit contributions
tex-0.008
olk-0.007
 relief-0.007
text-0.007
 silk-0.007
Token hugged
Token ID41130
Feature activation-1.65e+00

Pos logit contributions
olk+0.016
tex+0.016
 silk+0.015
 relief+0.015
text+0.014
Neg logit contributions
mun-0.011
 he-0.009
 asks-0.009
 first-0.009
 wrong-0.009
Token her
Token ID607
Feature activation-3.51e+00

Pos logit contributions
tex+0.055
text+0.050
 relief+0.050
olk+0.049
Got+0.049
Neg logit contributions
mun-0.025
 first-0.025
 wrong-0.024
 older-0.022
 shopping-0.022
Token mum
Token ID25682
Feature activation-5.85e-01

Pos logit contributions
tex+0.101
 relief+0.092
text+0.090
olk+0.089
Got+0.087
Neg logit contributions
 first-0.075
 older-0.069
mun-0.068
 wrong-0.068
 fishing-0.068
Token back
Token ID736
Feature activation-6.02e-01

Pos logit contributions
tex+0.020
 relief+0.020
olk+0.019
 silk+0.019
 ow+0.019
Neg logit contributions
mun-0.008
 first-0.008
 wrong-0.007
 fishing-0.007
 shopping-0.007
Token.
Token ID13
Feature activation-7.98e-01

Pos logit contributions
 relief+0.020
tex+0.020
olk+0.019
 silk+0.018
text+0.018
Neg logit contributions
mun-0.009
 wrong-0.008
 first-0.008
 fishing-0.007
roads-0.007
Token
Token ID198
Feature activation-6.70e-01

Pos logit contributions
 relief+0.026
tex+0.025
 silk+0.024
 click+0.024
 aver+0.023
Neg logit contributions
mun-0.012
 asks-0.011
 first-0.010
 fishing-0.010
 shopping-0.010
Token
Token ID198
Feature activation-6.31e-01

Pos logit contributions
 relief+0.020
olk+0.019
 silk+0.018
tex+0.017
 ray+0.017
Neg logit contributions
mun-0.013
 first-0.012
 fishing-0.012
 wrong-0.011
roads-0.011
Token,
Token ID11
Feature activation-4.16e-01

Pos logit contributions
 relief+0.023
tex+0.023
olk+0.021
 silk+0.021
text+0.020
Neg logit contributions
mun-0.014
 first-0.013
 wrong-0.012
 fishing-0.012
 shopping-0.012
Token she
Token ID673
Feature activation+1.51e-01

Pos logit contributions
tex+0.014
 relief+0.013
olk+0.013
text+0.012
 silk+0.012
Neg logit contributions
mun-0.007
 first-0.006
 wrong-0.006
roads-0.006
 shopping-0.006
Token put
Token ID1234
Feature activation-2.47e-01

Pos logit contributions
mun+0.004
 wrong+0.003
 first+0.003
 fishing+0.003
roads+0.003
Neg logit contributions
 relief-0.004
tex-0.004
olk-0.003
 silk-0.003
text-0.003
Token the
Token ID262
Feature activation-1.07e+00

Pos logit contributions
tex+0.007
 relief+0.007
text+0.007
olk+0.007
 silk+0.006
Neg logit contributions
mun-0.005
 wrong-0.005
 first-0.005
 shopping-0.004
 fishing-0.004
Token orange
Token ID10912
Feature activation-2.86e+00

Pos logit contributions
tex+0.027
 relief+0.027
olk+0.024
text+0.023
 silk+0.023
Neg logit contributions
mun-0.026
 wrong-0.024
 first-0.024
 fishing-0.023
 shopping-0.022
Token peel
Token ID31738
Feature activation-2.27e+00

Pos logit contributions
 relief+0.072
tex+0.070
olk+0.062
text+0.062
 ow+0.061
Neg logit contributions
mun-0.067
 first-0.065
 wrong-0.063
 fishing-0.062
 shopping-0.060
Token in
Token ID287
Feature activation+5.92e-01

Pos logit contributions
 relief+0.062
tex+0.060
olk+0.055
 silk+0.053
 ow+0.053
Neg logit contributions
mun-0.049
 first-0.046
 wrong-0.045
 fishing-0.044
roads-0.043
Token the
Token ID262
Feature activation+5.18e-01

Pos logit contributions
 first+0.011
mun+0.011
 wrong+0.010
 shopping+0.010
 fishing+0.010
Neg logit contributions
tex-0.018
 relief-0.017
text-0.016
olk-0.016
 ow-0.015
Token garbage
Token ID15413
Feature activation-1.93e+00

Pos logit contributions
mun+0.010
 wrong+0.010
 first+0.009
 fishing+0.009
 shopping+0.009
Neg logit contributions
tex-0.015
 relief-0.015
olk-0.013
text-0.013
 silk-0.013
Token.
Token ID13
Feature activation-6.11e-01

Pos logit contributions
 relief+0.052
tex+0.051
olk+0.048
 silk+0.045
 ow+0.045
Neg logit contributions
mun-0.043
 wrong-0.036
roads-0.036
 fishing-0.036
 exception-0.036
Token She
Token ID1375
Feature activation-1.53e+00

Pos logit contributions
tex+0.020
 relief+0.020
 ow+0.018
text+0.018
 silk+0.018
Neg logit contributions
top-0.009
 first-0.009
 shopping-0.008
 older-0.008
mun-0.008