TOP ACTIVATIONS
MAX = 5.964

thinks . She remembers what Mom does when she has a
too . He remembered what Mom my said . She said
" The law is what Mom and Dad say . They
" Look , Mom , look , Dad !"
he did not know that Mom saw him . She saw
" Stop it !" says Mom . She comes and takes
Maybe tomorrow , if the sun comes out and the ground
her . He remembered what Mom said . He ran to
go to the airport , Mom my ?" Mom
" Follow me ," the sun flower says . He bends
scared . She remembers what Mom said . The rail is
Mom smiles and says : " I love
smiled . She said she loved them too . She said
to come in when the sun was low .
" Yes , Mom , you can play !"
dad . Dad said he loved them , but they had
Can we go now , Mom ?" Lily asked .
kissed them and said they loved them . They said they
She does not know that Mom and Dad know . They
they did not know that Mom had turned on the oven

MIN ACTIVATIONS
MIN = -5.820

the brav est and boun ci est bunny in the whole
best friends , Matt and Lynn . They lived next door
to cry and threw a tant rum . She didn 't
Lily . She loved l oll ip ops , especially the
sad . She threw a tant rum and cried loudly .
Lily . She loved l oll ip ops and would always
Lily . She loved l oll ip ops and always had
kept crying and throwing a tant rum . Finally
't like all of the unnecessary extravag ance .
and so she threw a tant rum when her parents said
to put on her fan ci est sunglasses .
she always wore her fan ci est dress to the museum
Lily . She loved l oll ip ops and always had
the Princess would apply tooth paste to her tooth brush and
would take out a j igsaw puzzle and try to solve
Lily . She loved l oll ip ops and always had
her feet and threw a tant rum . She grabbed the
Sara said , " C rick et , is this your
and started to throw a tant rum . Suddenly , she
One day , Matt and Lynn wanted to discuss something special

INTERVAL 3.018 - 5.964
CONTAINS 1.442%

a big gear on the grass . It is shiny and
!" His mom my said , " Don
said . Her mom my smiled and said ,
clever game Lily made . <|endoftext|> Once upon a time ,
them stayed there until the sun went down . Then Mat

INTERVAL 0.072 - 3.018
CONTAINS 43.789%

A kind squirrel heard the bird . The squirrel came to
is gone . Tom starts to cry . He
. He said he was sorry . He said he was
with those kinds of things ." The boy felt a little
ila thought for a moment . She had an unusual idea

INTERVAL -2.874 - 0.072
CONTAINS 54.245%

hug . He said goodbye to his town and
and happy people dancing . She forgot all about being tired
He would run around in the grass and make a big
. Every day , he would travel on his horse around
caught them . He took the key and ran away .

INTERVAL -5.820 - -2.874
CONTAINS 0.524%

He had a big smile on his face and felt so
! The wild pants was so excited . He
frozen , so he hopped on it and fell through the
to her , Let â
family in and shared his meal with them . They were
Token thinks
Token ID6834
Feature activation+1.49e+00

Pos logit contributions
o+0.048
ll+0.041
ossible+0.039
rou+0.037
om+0.037
Neg logit contributions
ured-0.065
zed-0.060
zing-0.059
tered-0.058
comb-0.058
Token.
Token ID13
Feature activation+1.74e+00

Pos logit contributions
o+0.037
ll+0.031
ossible+0.029
rou+0.028
om+0.027
Neg logit contributions
ured-0.054
zed-0.050
tered-0.049
zing-0.049
comb-0.049
Token She
Token ID1375
Feature activation+2.23e+00

Pos logit contributions
o+0.059
ll+0.054
ossible+0.052
rou+0.052
om+0.049
Neg logit contributions
ured-0.044
tered-0.039
zed-0.039
comb-0.038
oned-0.037
Token remembers
Token ID18140
Feature activation+2.25e+00

Pos logit contributions
o+0.057
ll+0.048
ossible+0.046
om+0.043
rou+0.043
Neg logit contributions
ured-0.078
zed-0.073
zing-0.070
tered-0.070
comb-0.070
Token what
Token ID644
Feature activation+1.49e-01

Pos logit contributions
o+0.065
ll+0.056
ossible+0.054
om+0.051
rou+0.051
Neg logit contributions
ured-0.071
zed-0.066
tered-0.064
zing-0.063
comb-0.063
Token Mom
Token ID11254
Feature activation+5.89e+00

Pos logit contributions
o+0.003
ll+0.003
rou+0.002
ossible+0.002
om+0.002
Neg logit contributions
ured-0.006
zed-0.005
tered-0.005
comb-0.005
zing-0.005
Token does
Token ID857
Feature activation-4.49e-01

Pos logit contributions
o+0.130
ossible+0.121
rou+0.112
ll+0.109
om+0.108
Neg logit contributions
ured-0.194
isy-0.185
zed-0.182
comb-0.182
tered-0.179
Token when
Token ID618
Feature activation+2.00e+00

Pos logit contributions
ured+0.019
zed+0.018
comb+0.017
zing+0.017
tered+0.017
Neg logit contributions
o-0.008
ll-0.006
ossible-0.006
om-0.005
rou-0.005
Token she
Token ID673
Feature activation+2.21e+00

Pos logit contributions
o+0.065
ll+0.056
ossible+0.054
om+0.053
rou+0.052
Neg logit contributions
ured-0.057
zed-0.050
tered-0.050
zing-0.049
comb-0.049
Token has
Token ID468
Feature activation+2.03e+00

Pos logit contributions
o+0.059
ll+0.050
ossible+0.048
om+0.045
rou+0.045
Neg logit contributions
ured-0.076
zed-0.070
tered-0.068
zing-0.068
comb-0.067
Token a
Token ID257
Feature activation+1.34e+00

Pos logit contributions
o+0.051
ll+0.043
ossible+0.042
om+0.039
rou+0.039
Neg logit contributions
ured-0.072
zed-0.067
tered-0.065
zing-0.065
comb-0.065
Token too
Token ID1165
Feature activation-3.14e-01

Pos logit contributions
o+0.038
ll+0.034
ossible+0.032
rou+0.031
om+0.030
Neg logit contributions
ured-0.051
tered-0.046
zing-0.046
zed-0.046
isy-0.046
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
ured+0.010
zing+0.010
tered+0.009
zed+0.009
comb+0.009
Neg logit contributions
o-0.008
ll-0.007
ossible-0.007
rou-0.007
om-0.006
Token He
Token ID679
Feature activation+9.89e-01

Pos logit contributions
o+0.008
ll+0.007
ossible+0.007
rou+0.007
om+0.007
Neg logit contributions
ured-0.009
zed-0.008
tered-0.008
zing-0.008
comb-0.008
Token remembered
Token ID12086
Feature activation+2.12e+00

Pos logit contributions
o+0.026
ll+0.022
ossible+0.021
rou+0.020
om+0.020
Neg logit contributions
ured-0.034
comb-0.031
zed-0.031
zing-0.030
tered-0.030
Token what
Token ID644
Feature activation+3.91e-01

Pos logit contributions
o+0.060
ll+0.053
ossible+0.051
om+0.050
rou+0.048
Neg logit contributions
ured-0.066
zed-0.062
zing-0.059
comb-0.059
tered-0.059
Token Mom
Token ID11254
Feature activation+5.82e+00

Pos logit contributions
o+0.010
ll+0.009
ossible+0.008
rou+0.008
om+0.008
Neg logit contributions
ured-0.013
zed-0.013
tered-0.012
zing-0.012
comb-0.012
Tokenmy
Token ID1820
Feature activation+3.83e+00

Pos logit contributions
ossible+0.098
o+0.086
rou+0.084
om+0.083
ll+0.074
Neg logit contributions
isy-0.210
est-0.203
ered-0.201
le-0.200
ured-0.199
Token said
Token ID531
Feature activation+1.70e-01

Pos logit contributions
o+0.098
ossible+0.083
ll+0.083
rou+0.078
om+0.077
Neg logit contributions
ured-0.130
zed-0.121
tered-0.119
zing-0.118
comb-0.117
Token.
Token ID13
Feature activation+8.03e-01

Pos logit contributions
o+0.003
ll+0.003
ossible+0.002
om+0.002
rou+0.002
Neg logit contributions
ured-0.007
zed-0.006
zing-0.006
tered-0.006
comb-0.006
Token She
Token ID1375
Feature activation+2.23e+00

Pos logit contributions
o+0.025
ll+0.023
ossible+0.022
rou+0.022
om+0.020
Neg logit contributions
ured-0.022
zed-0.020
tered-0.020
zing-0.020
comb-0.019
Token said
Token ID531
Feature activation+1.75e-01

Pos logit contributions
o+0.064
ll+0.055
ossible+0.053
rou+0.050
om+0.050
Neg logit contributions
ured-0.072
zed-0.066
tered-0.064
zing-0.064
comb-0.064
Token"
Token ID1
Feature activation-9.03e-01

Pos logit contributions
o+0.020
ll+0.019
rou+0.018
ossible+0.017
om+0.017
Neg logit contributions
ured-0.020
zed-0.018
zing-0.017
tered-0.017
comb-0.017
TokenThe
Token ID464
Feature activation+2.05e+00

Pos logit contributions
ured+0.027
tered+0.024
zed+0.024
comb+0.023
ze+0.023
Neg logit contributions
o-0.027
ll-0.025
ossible-0.024
rou-0.023
om-0.022
Token law
Token ID1099
Feature activation+8.79e-01

Pos logit contributions
o+0.067
ll+0.059
ossible+0.058
om+0.056
rou+0.055
Neg logit contributions
ured-0.057
zed-0.052
comb-0.050
tered-0.050
zing-0.050
Token is
Token ID318
Feature activation+7.26e-01

Pos logit contributions
o+0.021
ll+0.019
ossible+0.017
rou+0.017
om+0.016
Neg logit contributions
ured-0.031
tered-0.028
zed-0.028
ued-0.027
zing-0.027
Token what
Token ID644
Feature activation+1.54e-02

Pos logit contributions
o+0.018
ll+0.015
ossible+0.014
om+0.013
rou+0.013
Neg logit contributions
ured-0.026
zed-0.025
tered-0.024
zing-0.024
comb-0.024
Token Mom
Token ID11254
Feature activation+5.77e+00

Pos logit contributions
o+0.000
ll+0.000
ossible+0.000
rou+0.000
om+0.000
Neg logit contributions
ured-0.000
zed-0.000
zing-0.000
tered-0.000
ered-0.000
Token and
Token ID290
Feature activation+2.81e+00

Pos logit contributions
o+0.132
ossible+0.129
rou+0.125
ipt+0.115
ll+0.113
Neg logit contributions
ured-0.173
isy-0.166
ered-0.164
est-0.163
 happen-0.160
Token Dad
Token ID17415
Feature activation+3.93e+00

Pos logit contributions
o+0.066
ll+0.060
ossible+0.053
om+0.051
rou+0.050
Neg logit contributions
ured-0.097
zed-0.092
zing-0.088
comb-0.088
bees-0.087
Token say
Token ID910
Feature activation-3.86e-01

Pos logit contributions
o+0.098
rou+0.084
ossible+0.082
ll+0.080
om+0.077
Neg logit contributions
ured-0.126
zed-0.116
isy-0.116
zing-0.116
tered-0.115
Token.
Token ID13
Feature activation+2.17e+00

Pos logit contributions
ured+0.016
zed+0.015
comb+0.015
tered+0.015
growth+0.015
Neg logit contributions
o-0.007
ll-0.005
om-0.005
ossible-0.005
rou-0.004
Token They
Token ID1119
Feature activation+1.55e+00

Pos logit contributions
o+0.064
ll+0.057
ossible+0.054
rou+0.052
om+0.051
Neg logit contributions
ured-0.067
zed-0.061
tered-0.059
zing-0.059
comb-0.058
Token
Token ID198
Feature activation+8.54e-01

Pos logit contributions
o+0.005
ll+0.005
rou+0.005
ossible+0.004
om+0.004
Neg logit contributions
ured-0.006
zed-0.006
tered-0.006
zing-0.005
ued-0.005
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
o+0.018
ll+0.017
rou+0.016
ety+0.014
etchup+0.014
Neg logit contributions
zed-0.029
ured-0.029
tered-0.027
zing-0.027
ued-0.026
Token"
Token ID1
Feature activation-2.55e+00

Pos logit contributions
ured+0.005
zed+0.004
tered+0.004
zing+0.004
comb+0.004
Neg logit contributions
o-0.004
ll-0.004
ossible-0.003
rou-0.003
om-0.003
TokenLook
Token ID8567
Feature activation-1.95e+00

Pos logit contributions
ured+0.075
zed+0.068
zing+0.066
comb+0.065
tered+0.064
Neg logit contributions
o-0.081
ll-0.070
ossible-0.068
om-0.064
rou-0.064
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
ured+0.070
zed+0.064
tered+0.063
zing+0.063
comb+0.062
Neg logit contributions
o-0.049
ll-0.041
ossible-0.039
rou-0.037
om-0.037
Token Mom
Token ID11254
Feature activation+5.66e+00

Pos logit contributions
o+0.038
ll+0.033
ossible+0.030
rou+0.030
om+0.029
Neg logit contributions
ured-0.051
zed-0.046
comb-0.046
zing-0.045
tered-0.045
Token,
Token ID11
Feature activation+2.27e+00

Pos logit contributions
ossible+0.167
o+0.166
ll+0.162
ipt+0.158
rou+0.158
Neg logit contributions
est-0.124
isy-0.113
ered-0.110
ured-0.110
comb-0.109
Token look
Token ID804
Feature activation+1.21e-01

Pos logit contributions
o+0.065
ll+0.059
rou+0.053
om+0.053
ucker+0.052
Neg logit contributions
ured-0.067
comb-0.062
zed-0.062
zing-0.060
 happen-0.060
Token,
Token ID11
Feature activation+2.47e+00

Pos logit contributions
o+0.003
ll+0.003
ossible+0.003
om+0.002
rou+0.002
Neg logit contributions
ured-0.004
zed-0.004
tered-0.004
comb-0.004
zing-0.004
Token Dad
Token ID17415
Feature activation+4.47e+00

Pos logit contributions
o+0.066
ll+0.057
ossible+0.053
rou+0.051
om+0.051
Neg logit contributions
ured-0.082
zed-0.075
comb-0.074
tered-0.073
zing-0.073
Token!"
Token ID2474
Feature activation+1.21e+00

Pos logit contributions
o+0.144
ossible+0.141
ll+0.133
rou+0.133
ipt+0.128
Neg logit contributions
ured-0.094
est-0.090
comb-0.088
isy-0.088
zing-0.088
Token he
Token ID339
Feature activation+1.28e+00

Pos logit contributions
ured+0.049
zed+0.045
zing+0.044
comb+0.043
tered+0.043
Neg logit contributions
o-0.040
ll-0.034
ossible-0.033
om-0.031
rou-0.031
Token did
Token ID750
Feature activation-1.15e-02

Pos logit contributions
o+0.035
ossible+0.030
ll+0.030
om+0.029
rou+0.028
Neg logit contributions
ured-0.041
zed-0.038
tered-0.038
comb-0.037
isy-0.036
Token not
Token ID407
Feature activation+7.46e-01

Pos logit contributions
ured+0.000
zed+0.000
comb+0.000
tered+0.000
zing+0.000
Neg logit contributions
o-0.000
ll-0.000
ossible-0.000
om-0.000
rou-0.000
Token know
Token ID760
Feature activation+1.54e+00

Pos logit contributions
o+0.024
ll+0.021
ossible+0.020
om+0.019
rou+0.018
Neg logit contributions
ured-0.022
zed-0.021
zing-0.019
comb-0.019
tered-0.019
Token that
Token ID326
Feature activation+2.15e+00

Pos logit contributions
o+0.045
ll+0.039
ossible+0.038
om+0.036
rou+0.035
Neg logit contributions
ured-0.048
zed-0.045
tered-0.043
zing-0.043
comb-0.042
Token Mom
Token ID11254
Feature activation+5.63e+00

Pos logit contributions
o+0.057
ll+0.051
ossible+0.048
rou+0.045
om+0.045
Neg logit contributions
ured-0.072
zed-0.067
tered-0.066
zing-0.065
comb-0.064
Token saw
Token ID2497
Feature activation+2.57e+00

Pos logit contributions
o+0.101
ossible+0.088
ll+0.084
rou+0.077
om+0.076
Neg logit contributions
ured-0.223
zed-0.212
tered-0.211
zing-0.207
isy-0.206
Token him
Token ID683
Feature activation+6.49e-01

Pos logit contributions
o+0.082
ll+0.075
ossible+0.072
rou+0.068
om+0.066
Neg logit contributions
ured-0.073
zed-0.067
tered-0.064
zing-0.064
ered-0.063
Token.
Token ID13
Feature activation+8.12e-02

Pos logit contributions
o+0.015
ll+0.012
ossible+0.012
rou+0.011
om+0.011
Neg logit contributions
ured-0.025
zed-0.023
zing-0.022
tered-0.022
comb-0.022
Token She
Token ID1375
Feature activation+2.24e+00

Pos logit contributions
o+0.003
ll+0.002
ossible+0.002
om+0.002
rou+0.002
Neg logit contributions
ured-0.002
zed-0.002
tered-0.002
zing-0.002
comb-0.002
Token saw
Token ID2497
Feature activation+1.66e+00

Pos logit contributions
o+0.060
ll+0.052
ossible+0.050
om+0.047
rou+0.047
Neg logit contributions
ured-0.076
zed-0.070
tered-0.068
zing-0.068
comb-0.067
Token"
Token ID1
Feature activation-2.06e+00

Pos logit contributions
o+0.008
ll+0.008
rou+0.007
ossible+0.007
om+0.007
Neg logit contributions
ured-0.009
zed-0.008
tered-0.008
zing-0.008
oned-0.008
TokenStop
Token ID19485
Feature activation-1.56e+00

Pos logit contributions
ured+0.062
zed+0.057
zing+0.055
comb+0.055
isy+0.054
Neg logit contributions
o-0.066
ll-0.055
ossible-0.055
etchup-0.052
om-0.052
Token it
Token ID340
Feature activation+5.51e-01

Pos logit contributions
ured+0.052
zed+0.048
tered+0.047
zing+0.046
comb+0.046
Neg logit contributions
o-0.043
ll-0.037
ossible-0.035
om-0.034
rou-0.033
Token!"
Token ID2474
Feature activation+3.58e-01

Pos logit contributions
o+0.013
ll+0.010
ossible+0.009
om+0.009
rou+0.008
Neg logit contributions
ured-0.022
zed-0.021
tered-0.020
comb-0.019
zing-0.019
Token says
Token ID1139
Feature activation-7.27e-01

Pos logit contributions
o+0.005
ossible+0.004
ll+0.004
om+0.003
rou+0.003
Neg logit contributions
ured-0.016
zed-0.015
comb-0.015
zing-0.015
isy-0.015
Token Mom
Token ID11254
Feature activation+5.57e+00

Pos logit contributions
ured+0.033
zed+0.032
comb+0.031
zing+0.031
isy+0.031
Neg logit contributions
o-0.010
ossible-0.007
ll-0.007
om-0.006
etchup-0.005
Token.
Token ID13
Feature activation+2.81e+00

Pos logit contributions
o+0.129
ossible+0.115
ll+0.114
rou+0.104
om+0.103
Neg logit contributions
ured-0.197
zed-0.179
tered-0.178
zing-0.177
isy-0.175
Token She
Token ID1375
Feature activation+3.92e+00

Pos logit contributions
o+0.079
ll+0.070
ossible+0.066
rou+0.064
om+0.061
Neg logit contributions
ured-0.092
zed-0.083
tered-0.083
zing-0.081
comb-0.081
Token comes
Token ID2058
Feature activation+1.39e+00

Pos logit contributions
o+0.101
ll+0.086
ossible+0.083
om+0.076
rou+0.075
Neg logit contributions
ured-0.135
zed-0.127
zing-0.123
tered-0.121
ered-0.119
Token and
Token ID290
Feature activation+3.20e+00

Pos logit contributions
o+0.035
ll+0.033
ossible+0.032
rou+0.032
ucker+0.031
Neg logit contributions
ured-0.046
oned-0.041
ered-0.040
zing-0.040
tered-0.040
Token takes
Token ID2753
Feature activation-4.31e-01

Pos logit contributions
o+0.080
ll+0.072
ossible+0.068
om+0.065
rou+0.064
Neg logit contributions
ured-0.107
zed-0.100
zing-0.097
ered-0.094
tered-0.092
TokenMaybe
Token ID13300
Feature activation+1.14e+00

Pos logit contributions
o+0.017
ll+0.015
om+0.015
ossible+0.015
rou+0.015
Neg logit contributions
ured-0.013
zed-0.012
tered-0.012
isy-0.012
zing-0.011
Token tomorrow
Token ID9439
Feature activation+1.98e+00

Pos logit contributions
o+0.030
ll+0.026
ossible+0.025
rou+0.024
om+0.024
Neg logit contributions
ured-0.039
zed-0.036
tered-0.035
zing-0.035
comb-0.035
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
o+0.054
rou+0.046
ll+0.046
ossible+0.046
om+0.043
Neg logit contributions
ured-0.063
zed-0.061
zing-0.058
tered-0.058
comb-0.058
Token if
Token ID611
Feature activation+2.09e+00

Pos logit contributions
o+0.044
ll+0.037
ossible+0.036
om+0.034
rou+0.034
Neg logit contributions
ured-0.060
zed-0.055
tered-0.053
zing-0.053
comb-0.053
Token the
Token ID262
Feature activation+2.69e+00

Pos logit contributions
o+0.052
ll+0.044
ossible+0.042
om+0.041
rou+0.040
Neg logit contributions
ured-0.074
zed-0.070
zing-0.068
tered-0.067
comb-0.066
Token sun
Token ID4252
Feature activation+5.56e+00

Pos logit contributions
o+0.075
ll+0.065
ossible+0.060
om+0.059
rou+0.058
Neg logit contributions
ured-0.086
zed-0.081
est-0.080
zing-0.080
tered-0.079
Token comes
Token ID2058
Feature activation+9.95e-01

Pos logit contributions
o+0.139
ossible+0.122
ll+0.120
rou+0.118
om+0.110
Neg logit contributions
ured-0.174
zed-0.170
est-0.166
zing-0.165
ze-0.162
Token out
Token ID503
Feature activation+1.40e+00

Pos logit contributions
o+0.031
ll+0.028
ossible+0.027
rou+0.026
om+0.025
Neg logit contributions
ured-0.028
zed-0.025
tered-0.025
zing-0.025
comb-0.024
Token and
Token ID290
Feature activation+2.91e+00

Pos logit contributions
o+0.032
ll+0.027
ossible+0.025
om+0.024
rou+0.024
Neg logit contributions
ured-0.052
zed-0.049
tered-0.048
zing-0.047
comb-0.047
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
o+0.080
ll+0.069
ossible+0.066
om+0.063
rou+0.063
Neg logit contributions
ured-0.095
zed-0.088
tered-0.085
zing-0.085
comb-0.085
Token ground
Token ID2323
Feature activation+2.09e+00

Pos logit contributions
o+0.047
ll+0.042
ossible+0.039
rou+0.038
om+0.038
Neg logit contributions
ured-0.041
zed-0.037
zing-0.037
tered-0.036
comb-0.036
Token her
Token ID607
Feature activation-6.17e-01

Pos logit contributions
ured+0.051
zed+0.047
zing+0.046
ued+0.045
comb+0.045
Neg logit contributions
o-0.046
ll-0.037
ossible-0.037
om-0.034
 dad-0.033
Token.
Token ID13
Feature activation+4.09e-01

Pos logit contributions
ured+0.025
zed+0.024
tered+0.023
zing+0.023
comb+0.022
Neg logit contributions
o-0.013
ll-0.010
ossible-0.009
om-0.009
rou-0.008
Token He
Token ID679
Feature activation+1.05e+00

Pos logit contributions
o+0.012
ll+0.011
ossible+0.010
rou+0.010
om+0.010
Neg logit contributions
ured-0.012
zed-0.011
tered-0.011
zing-0.011
comb-0.011
Token remembered
Token ID12086
Feature activation+2.13e+00

Pos logit contributions
o+0.033
ll+0.030
ossible+0.029
rou+0.028
om+0.028
Neg logit contributions
ured-0.030
zed-0.027
tered-0.027
zing-0.026
isy-0.025
Token what
Token ID644
Feature activation+3.30e-01

Pos logit contributions
o+0.064
ll+0.057
ossible+0.055
om+0.054
rou+0.052
Neg logit contributions
ured-0.062
zed-0.059
zing-0.056
comb-0.055
tered-0.055
Token Mom
Token ID11254
Feature activation+5.55e+00

Pos logit contributions
o+0.009
ll+0.008
ossible+0.007
rou+0.007
om+0.007
Neg logit contributions
ured-0.011
zed-0.010
tered-0.010
zing-0.010
comb-0.010
Token said
Token ID531
Feature activation+1.30e-01

Pos logit contributions
o+0.139
ossible+0.135
rou+0.125
om+0.121
ll+0.121
Neg logit contributions
ured-0.167
isy-0.160
tered-0.158
zed-0.157
comb-0.155
Token.
Token ID13
Feature activation+7.37e-01

Pos logit contributions
o+0.003
ll+0.002
ossible+0.002
rou+0.002
om+0.002
Neg logit contributions
ured-0.005
zed-0.005
tered-0.005
zing-0.005
comb-0.005
Token He
Token ID679
Feature activation+1.77e+00

Pos logit contributions
o+0.021
ll+0.020
rou+0.019
ossible+0.018
om+0.017
Neg logit contributions
ured-0.022
tered-0.020
zed-0.020
ger-0.019
zing-0.019
Token ran
Token ID4966
Feature activation+3.43e-01

Pos logit contributions
o+0.052
ll+0.046
ossible+0.045
om+0.042
rou+0.042
Neg logit contributions
ured-0.054
zed-0.049
tered-0.048
zing-0.048
comb-0.047
Token to
Token ID284
Feature activation-5.89e-01

Pos logit contributions
o+0.006
ll+0.004
ossible+0.004
rou+0.004
om+0.004
Neg logit contributions
ured-0.015
zed-0.014
tered-0.014
zing-0.014
comb-0.014
Token go
Token ID467
Feature activation+2.20e+00

Pos logit contributions
ured+0.045
zed+0.043
zing+0.041
tered+0.040
ued+0.039
Neg logit contributions
o-0.038
ll-0.033
ossible-0.032
om-0.028
rou-0.028
Token to
Token ID284
Feature activation-6.02e-01

Pos logit contributions
o+0.052
ll+0.043
ossible+0.041
rou+0.040
om+0.040
Neg logit contributions
ured-0.081
zed-0.074
zing-0.073
comb-0.072
tered-0.072
Token the
Token ID262
Feature activation+8.31e-01

Pos logit contributions
ured+0.019
zed+0.018
zing+0.017
comb+0.017
tered+0.017
Neg logit contributions
o-0.017
ll-0.015
rou-0.013
om-0.013
ossible-0.013
Token airport
Token ID9003
Feature activation+1.63e+00

Pos logit contributions
o+0.020
ll+0.017
ossible+0.016
om+0.015
rou+0.015
Neg logit contributions
ured-0.030
zed-0.028
tered-0.027
zing-0.027
comb-0.027
Token,
Token ID11
Feature activation+9.69e-01

Pos logit contributions
o+0.040
ll+0.034
om+0.033
rou+0.032
ossible+0.032
Neg logit contributions
ured-0.056
zed-0.052
comb-0.051
tered-0.051
zing-0.050
Token Mom
Token ID11254
Feature activation+5.54e+00

Pos logit contributions
o+0.021
ll+0.017
ossible+0.016
om+0.016
rou+0.015
Neg logit contributions
ured-0.038
zed-0.035
tered-0.034
comb-0.034
zing-0.034
Tokenmy
Token ID1820
Feature activation+3.87e+00

Pos logit contributions
o+0.087
ossible+0.086
om+0.084
rou+0.084
ipt+0.079
Neg logit contributions
est-0.191
ured-0.187
isy-0.181
le-0.180
comb-0.179
Token?"
Token ID1701
Feature activation+1.80e+00

Pos logit contributions
o+0.091
ossible+0.077
ll+0.076
rou+0.076
om+0.075
Neg logit contributions
ured-0.131
zed-0.122
zing-0.120
comb-0.119
tered-0.118
Token
Token ID198
Feature activation+6.46e-01

Pos logit contributions
o+0.037
ll+0.033
rou+0.033
om+0.028
ossible+0.027
Neg logit contributions
ured-0.066
zed-0.062
tered-0.061
oned-0.061
zing-0.060
Token
Token ID198
Feature activation-3.26e-02

Pos logit contributions
o+0.015
rou+0.014
ll+0.013
etchup+0.012
ucker+0.012
Neg logit contributions
zed-0.020
ured-0.019
zing-0.019
isy-0.019
oned-0.018
TokenMom
Token ID29252
Feature activation+4.57e+00

Pos logit contributions
ured+0.001
zed+0.001
tered+0.001
zing+0.001
isy+0.001
Neg logit contributions
o-0.001
ll-0.001
rou-0.001
om-0.001
ucker-0.001
Token"
Token ID1
Feature activation-8.66e-01

Pos logit contributions
o+0.012
ll+0.010
ossible+0.010
rou+0.009
om+0.009
Neg logit contributions
ured-0.014
zed-0.013
tered-0.013
zing-0.012
comb-0.012
TokenFollow
Token ID7155
Feature activation-2.15e-01

Pos logit contributions
ured+0.028
zed+0.025
tered+0.025
zing+0.025
comb+0.024
Neg logit contributions
o-0.025
ll-0.022
ossible-0.021
rou-0.020
om-0.020
Token me
Token ID502
Feature activation-2.88e-01

Pos logit contributions
ured+0.008
zed+0.007
tered+0.007
zing+0.007
comb+0.007
Neg logit contributions
o-0.005
ll-0.004
ossible-0.004
om-0.004
rou-0.004
Token,"
Token ID553
Feature activation+8.86e-01

Pos logit contributions
ured+0.010
zed+0.009
tered+0.009
zing+0.009
comb+0.009
Neg logit contributions
o-0.008
ll-0.006
ossible-0.006
rou-0.006
om-0.006
Token the
Token ID262
Feature activation+1.51e+00

Pos logit contributions
o+0.019
ll+0.015
ossible+0.014
rou+0.014
om+0.013
Neg logit contributions
ured-0.035
zed-0.033
tered-0.032
comb-0.032
zing-0.032
Token sun
Token ID4252
Feature activation+5.51e+00

Pos logit contributions
o+0.046
ll+0.039
ossible+0.038
om+0.036
rou+0.035
Neg logit contributions
ured-0.048
zed-0.045
tered-0.043
zing-0.043
comb-0.042
Tokenflower
Token ID25547
Feature activation+4.12e+00

Pos logit contributions
o+0.157
ll+0.136
ossible+0.132
rou+0.126
om+0.125
Neg logit contributions
ured-0.173
zed-0.159
comb-0.155
zing-0.154
tered-0.154
Token says
Token ID1139
Feature activation-6.56e-01

Pos logit contributions
o+0.090
ll+0.075
ossible+0.070
rou+0.067
om+0.066
Neg logit contributions
ured-0.158
zed-0.147
comb-0.144
tered-0.143
zing-0.143
Token.
Token ID13
Feature activation+2.54e+00

Pos logit contributions
ured+0.024
tered+0.022
zed+0.022
zing+0.021
comb+0.021
Neg logit contributions
o-0.016
ll-0.014
ossible-0.013
rou-0.012
om-0.012
Token He
Token ID679
Feature activation+2.61e+00

Pos logit contributions
o+0.063
ll+0.053
ossible+0.051
om+0.048
rou+0.047
Neg logit contributions
ured-0.092
zed-0.085
zing-0.082
tered-0.082
comb-0.082
Token bends
Token ID44370
Feature activation+1.07e+00

Pos logit contributions
o+0.079
ll+0.067
ossible+0.065
rou+0.061
om+0.061
Neg logit contributions
ured-0.080
zed-0.075
zing-0.071
tered-0.069
ered-0.069
Token scared
Token ID12008
Feature activation+1.25e+00

Pos logit contributions
o+0.016
ll+0.014
ossible+0.014
rou+0.013
om+0.013
Neg logit contributions
ured-0.017
zed-0.016
tered-0.015
zing-0.015
comb-0.015
Token.
Token ID13
Feature activation+2.82e-01

Pos logit contributions
o+0.032
ll+0.028
ossible+0.027
rou+0.026
om+0.025
Neg logit contributions
ured-0.042
tered-0.039
zing-0.038
zed-0.038
isy-0.038
Token She
Token ID1375
Feature activation+6.82e-01

Pos logit contributions
o+0.010
ll+0.009
rou+0.009
ossible+0.009
etchup+0.008
Neg logit contributions
ured-0.006
zed-0.006
tered-0.005
comb-0.005
zing-0.005
Token remembers
Token ID18140
Feature activation+1.88e+00

Pos logit contributions
o+0.017
ll+0.015
ossible+0.014
om+0.013
rou+0.013
Neg logit contributions
ured-0.024
zed-0.022
comb-0.022
tered-0.022
zing-0.022
Token what
Token ID644
Feature activation-1.66e-01

Pos logit contributions
o+0.055
ll+0.048
ossible+0.046
om+0.044
rou+0.043
Neg logit contributions
ured-0.059
zed-0.054
zing-0.052
tered-0.052
comb-0.052
Token Mom
Token ID11254
Feature activation+5.51e+00

Pos logit contributions
ured+0.006
zed+0.005
comb+0.005
zing+0.005
tered+0.005
Neg logit contributions
o-0.004
ll-0.004
ossible-0.003
rou-0.003
om-0.003
Token said
Token ID531
Feature activation+4.68e-01

Pos logit contributions
o+0.121
ossible+0.121
rou+0.114
ipt+0.111
om+0.105
Neg logit contributions
ured-0.166
isy-0.161
ered-0.158
le-0.156
 happen-0.153
Token.
Token ID13
Feature activation+6.96e-01

Pos logit contributions
o+0.010
ll+0.008
ossible+0.007
om+0.007
rou+0.007
Neg logit contributions
ured-0.019
zed-0.017
zing-0.017
tered-0.017
comb-0.017
Token The
Token ID383
Feature activation+1.76e+00

Pos logit contributions
o+0.021
ll+0.020
rou+0.018
ossible+0.018
etchup+0.017
Neg logit contributions
ured-0.019
zed-0.017
comb-0.017
tered-0.017
 Kerry-0.016
Token rail
Token ID6787
Feature activation+9.58e-01

Pos logit contributions
o+0.056
ll+0.049
ossible+0.047
rou+0.045
om+0.044
Neg logit contributions
ured-0.051
zed-0.046
tered-0.045
zing-0.045
comb-0.044
Token is
Token ID318
Feature activation+3.58e-01

Pos logit contributions
o+0.022
ll+0.018
ossible+0.017
om+0.016
rou+0.016
Neg logit contributions
ured-0.036
zed-0.034
tered-0.033
zing-0.033
comb-0.033
Token
Token ID198
Feature activation+6.58e-03

Pos logit contributions
o+0.019
ll+0.018
rou+0.018
om+0.016
ety+0.015
Neg logit contributions
zed-0.025
ured-0.025
zing-0.024
tered-0.024
isy-0.024
TokenMom
Token ID29252
Feature activation+4.45e+00

Pos logit contributions
o+0.000
ll+0.000
rou+0.000
om+0.000
ucker+0.000
Neg logit contributions
ured-0.000
tered-0.000
isy-0.000
zed-0.000
zing-0.000
Token smiles
Token ID21845
Feature activation+3.75e+00

Pos logit contributions
o+0.120
packs+0.110
om+0.109
ll+0.108
ossible+0.107
Neg logit contributions
isy-0.110
ured-0.106
ered-0.103
nob-0.101
le-0.100
Token and
Token ID290
Feature activation+2.70e+00

Pos logit contributions
o+0.101
ll+0.094
etchup+0.085
rou+0.082
om+0.081
Neg logit contributions
ured-0.115
tered-0.113
zing-0.107
est-0.107
ze-0.105
Token says
Token ID1139
Feature activation+1.07e+00

Pos logit contributions
o+0.081
ll+0.070
om+0.069
ossible+0.068
rou+0.064
Neg logit contributions
ured-0.081
zed-0.072
zing-0.070
tered-0.069
isy-0.067
Token:
Token ID25
Feature activation+5.48e+00

Pos logit contributions
o+0.034
ll+0.030
ossible+0.029
rou+0.027
etchup+0.027
Neg logit contributions
ured-0.031
zed-0.029
zing-0.028
tered-0.028
isy-0.027
Token
Token ID198
Feature activation+2.39e+00

Pos logit contributions
o+0.127
ll+0.114
ossible+0.105
rou+0.103
etchup+0.100
Neg logit contributions
ured-0.189
zed-0.183
zing-0.177
tered-0.176
oned-0.174
Token
Token ID198
Feature activation+1.55e+00

Pos logit contributions
o+0.051
ll+0.044
rou+0.044
om+0.040
ucker+0.038
Neg logit contributions
ured-0.081
zed-0.076
oned-0.076
isy-0.075
zing-0.075
Token"
Token ID1
Feature activation+6.92e-02

Pos logit contributions
o+0.048
ll+0.044
om+0.043
rou+0.043
ucker+0.041
Neg logit contributions
ured-0.039
isy-0.034
zed-0.033
comb-0.033
oned-0.033
TokenI
Token ID40
Feature activation-3.55e-01

Pos logit contributions
o+0.002
om+0.002
ossible+0.002
rou+0.002
aj+0.002
Neg logit contributions
ured-0.002
tered-0.002
isy-0.001
zed-0.001
comb-0.001
Token love
Token ID1842
Feature activation+4.31e+00

Pos logit contributions
ured+0.009
tered+0.008
zed+0.008
comb+0.008
zing+0.008
Neg logit contributions
o-0.011
ll-0.010
om-0.010
rou-0.009
etchup-0.009
Token smiled
Token ID13541
Feature activation+3.41e+00

Pos logit contributions
o+0.120
om+0.113
ossible+0.113
ll+0.112
rou+0.109
Neg logit contributions
isy-0.102
ured-0.099
tered-0.098
ered-0.094
zed-0.093
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
o+0.093
ll+0.083
ossible+0.075
om+0.074
etchup+0.074
Neg logit contributions
ured-0.111
tered-0.104
est-0.101
zing-0.098
comb-0.097
Token She
Token ID1375
Feature activation+3.00e+00

Pos logit contributions
o+0.038
ll+0.035
ossible+0.032
rou+0.032
om+0.031
Neg logit contributions
ured-0.028
tered-0.024
zed-0.024
zing-0.023
comb-0.023
Token said
Token ID531
Feature activation+1.28e+00

Pos logit contributions
o+0.077
ll+0.066
ossible+0.066
om+0.064
rou+0.061
Neg logit contributions
ured-0.099
zed-0.094
tered-0.092
isy-0.091
comb-0.090
Token she
Token ID673
Feature activation+4.34e+00

Pos logit contributions
o+0.036
ll+0.031
ossible+0.030
rou+0.028
om+0.028
Neg logit contributions
ured-0.041
zed-0.038
zing-0.037
tered-0.037
isy-0.036
Token loved
Token ID6151
Feature activation+5.47e+00

Pos logit contributions
o+0.123
ossible+0.112
ll+0.107
om+0.106
rou+0.101
Neg logit contributions
ured-0.127
zed-0.120
tered-0.119
zing-0.115
isy-0.114
Token them
Token ID606
Feature activation-1.05e+00

Pos logit contributions
o+0.193
ll+0.173
ossible+0.173
om+0.164
rou+0.161
Neg logit contributions
ured-0.128
zed-0.114
isy-0.112
tered-0.110
zing-0.106
Token too
Token ID1165
Feature activation-5.21e-01

Pos logit contributions
ured+0.042
tered+0.038
zed+0.038
comb+0.038
zing+0.038
Neg logit contributions
o-0.021
ll-0.018
ossible-0.016
om-0.015
rou-0.015
Token.
Token ID13
Feature activation+7.86e-01

Pos logit contributions
ured+0.019
tered+0.017
zed+0.017
zing+0.017
comb+0.017
Neg logit contributions
o-0.013
ll-0.011
ossible-0.010
rou-0.010
om-0.009
Token She
Token ID1375
Feature activation+3.00e+00

Pos logit contributions
o+0.027
ll+0.025
ossible+0.023
rou+0.023
om+0.023
Neg logit contributions
ured-0.020
tered-0.018
zed-0.017
zing-0.017
comb-0.017
Token said
Token ID531
Feature activation+8.31e-01

Pos logit contributions
o+0.074
ll+0.062
ossible+0.062
om+0.060
rou+0.058
Neg logit contributions
ured-0.102
zed-0.097
tered-0.095
isy-0.094
comb-0.093
Token to
Token ID284
Feature activation+2.71e-01

Pos logit contributions
o+0.008
ll+0.006
ossible+0.005
om+0.004
rou+0.004
Neg logit contributions
ured-0.024
zed-0.023
comb-0.023
tered-0.022
isy-0.022
Token come
Token ID1282
Feature activation+1.49e+00

Pos logit contributions
o+0.008
ll+0.007
ossible+0.007
om+0.006
rou+0.006
Neg logit contributions
ured-0.008
zed-0.008
zing-0.008
tered-0.007
comb-0.007
Token in
Token ID287
Feature activation+1.46e+00

Pos logit contributions
o+0.037
ll+0.031
ossible+0.030
rou+0.028
om+0.028
Neg logit contributions
ured-0.054
zed-0.048
tered-0.048
zing-0.048
comb-0.048
Token when
Token ID618
Feature activation+1.95e+00

Pos logit contributions
o+0.036
ll+0.030
rou+0.027
om+0.026
ossible+0.026
Neg logit contributions
ured-0.054
zed-0.050
tered-0.049
zing-0.049
comb-0.048
Token the
Token ID262
Feature activation+1.41e+00

Pos logit contributions
o+0.041
ll+0.034
ossible+0.032
om+0.030
rou+0.029
Neg logit contributions
ured-0.077
zed-0.071
tered-0.070
zing-0.070
comb-0.070
Token sun
Token ID4252
Feature activation+5.46e+00

Pos logit contributions
o+0.032
ossible+0.027
ll+0.026
om+0.023
rou+0.023
Neg logit contributions
ured-0.054
zed-0.051
tered-0.049
zing-0.049
comb-0.049
Token was
Token ID373
Feature activation+7.61e-02

Pos logit contributions
o+0.145
ll+0.122
ossible+0.115
om+0.111
rou+0.108
Neg logit contributions
ured-0.190
zed-0.174
zing-0.169
tered-0.169
comb-0.167
Token low
Token ID1877
Feature activation+4.67e-01

Pos logit contributions
o+0.002
ll+0.002
ossible+0.002
om+0.002
rou+0.002
Neg logit contributions
ured-0.003
zed-0.002
tered-0.002
zing-0.002
comb-0.002
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
o+0.010
ll+0.008
ossible+0.008
om+0.007
rou+0.007
Neg logit contributions
ured-0.018
zed-0.017
zing-0.017
comb-0.017
isy-0.017
Token
Token ID198
Feature activation+1.73e+00

Pos logit contributions
o+0.035
ll+0.030
rou+0.029
ossible+0.026
om+0.024
Neg logit contributions
ured-0.061
zed-0.057
tered-0.056
zing-0.055
oned-0.054
Token
Token ID198
Feature activation+7.07e-01

Pos logit contributions
o+0.036
rou+0.030
ll+0.029
etchup+0.026
ety+0.025
Neg logit contributions
zed-0.064
ured-0.063
tered-0.060
zing-0.060
ued-0.060
Token
Token ID198
Feature activation+2.11e+00

Pos logit contributions
o+0.048
ll+0.041
rou+0.041
ossible+0.038
om+0.034
Neg logit contributions
ured-0.092
tered-0.085
zed-0.085
comb-0.084
oned-0.083
Token
Token ID198
Feature activation+1.24e+00

Pos logit contributions
o+0.042
rou+0.036
ll+0.035
ossible+0.032
etchup+0.031
Neg logit contributions
ured-0.076
zed-0.074
ered-0.071
zing-0.070
oned-0.070
Token"
Token ID1
Feature activation-4.65e-01

Pos logit contributions
o+0.034
ll+0.030
ossible+0.028
rou+0.027
om+0.027
Neg logit contributions
ured-0.041
zed-0.037
tered-0.036
zing-0.036
comb-0.036
TokenYes
Token ID5297
Feature activation-1.20e+00

Pos logit contributions
ured+0.014
zed+0.013
tered+0.012
zing+0.012
comb+0.012
Neg logit contributions
o-0.014
ll-0.012
ossible-0.012
om-0.011
rou-0.011
Token,
Token ID11
Feature activation+1.99e+00

Pos logit contributions
ured+0.046
zed+0.043
zing+0.042
comb+0.042
tered+0.041
Neg logit contributions
o-0.026
ll-0.022
ossible-0.022
om-0.020
rou-0.019
Token Mom
Token ID11254
Feature activation+5.45e+00

Pos logit contributions
o+0.055
ll+0.046
ossible+0.045
om+0.043
rou+0.043
Neg logit contributions
ured-0.066
zed-0.060
tered-0.059
zing-0.059
isy-0.059
Token,
Token ID11
Feature activation+2.70e+00

Pos logit contributions
o+0.143
rou+0.138
ll+0.137
ipt+0.137
ossible+0.136
Neg logit contributions
est-0.142
isy-0.132
ured-0.128
ered-0.126
comb-0.125
Token you
Token ID345
Feature activation-1.47e+00

Pos logit contributions
o+0.066
ll+0.054
ossible+0.053
om+0.050
rou+0.050
Neg logit contributions
ured-0.099
zed-0.091
tered-0.089
zing-0.089
comb-0.088
Token can
Token ID460
Feature activation+7.75e-01

Pos logit contributions
ured+0.050
tered+0.046
zed+0.046
zing+0.046
comb+0.045
Neg logit contributions
o-0.038
ll-0.032
rou-0.030
ossible-0.030
om-0.029
Token play
Token ID711
Feature activation+1.38e+00

Pos logit contributions
o+0.021
ll+0.018
ossible+0.017
rou+0.016
om+0.016
Neg logit contributions
ured-0.026
zed-0.024
tered-0.024
zing-0.023
comb-0.023
Token!"
Token ID2474
Feature activation+9.15e-01

Pos logit contributions
o+0.028
ll+0.022
ossible+0.022
om+0.020
rou+0.019
Neg logit contributions
ured-0.055
zed-0.052
comb-0.051
zing-0.051
tered-0.050
Token dad
Token ID9955
Feature activation+2.60e+00

Pos logit contributions
ured+0.012
zed+0.011
tered+0.011
 happen+0.010
growth+0.010
Neg logit contributions
o-0.012
ll-0.012
rou-0.011
aj-0.010
ucker-0.010
Token.
Token ID13
Feature activation-4.75e-03

Pos logit contributions
o+0.068
ossible+0.059
ll+0.059
rou+0.058
om+0.056
Neg logit contributions
ured-0.085
zing-0.077
tered-0.077
zed-0.076
est-0.074
Token Dad
Token ID17415
Feature activation+3.43e+00

Pos logit contributions
ured+0.000
tered+0.000
zed+0.000
ger+0.000
zing+0.000
Neg logit contributions
o-0.000
ll-0.000
rou-0.000
etchup-0.000
om-0.000
Token said
Token ID531
Feature activation+1.48e+00

Pos logit contributions
o+0.104
ossible+0.092
ll+0.091
om+0.090
rou+0.088
Neg logit contributions
ured-0.089
zed-0.087
tered-0.086
ered-0.085
zing-0.084
Token he
Token ID339
Feature activation+4.27e+00

Pos logit contributions
o+0.039
ll+0.033
ossible+0.032
rou+0.031
om+0.031
Neg logit contributions
ured-0.049
zed-0.047
zing-0.046
tered-0.045
ered-0.043
Token loved
Token ID6151
Feature activation+5.45e+00

Pos logit contributions
o+0.121
ossible+0.116
om+0.107
ll+0.105
etchup+0.103
Neg logit contributions
ured-0.117
zed-0.113
tered-0.112
zing-0.109
ze-0.106
Token them
Token ID606
Feature activation-6.16e-01

Pos logit contributions
o+0.210
ll+0.198
ossible+0.191
om+0.191
rou+0.190
Neg logit contributions
 happen-0.090
ured-0.089
zed-0.088
 herself-0.084
ered-0.078
Token,
Token ID11
Feature activation+2.20e+00

Pos logit contributions
ured+0.023
tered+0.022
zed+0.021
zing+0.021
est+0.021
Neg logit contributions
o-0.013
ll-0.011
ossible-0.011
rou-0.010
om-0.010
Token but
Token ID475
Feature activation+7.94e-01

Pos logit contributions
o+0.081
ll+0.073
ossible+0.072
om+0.071
rou+0.069
Neg logit contributions
ured-0.046
tered-0.043
zed-0.042
isy-0.040
zing-0.039
Token they
Token ID484
Feature activation+1.35e+00

Pos logit contributions
o+0.023
ossible+0.020
ll+0.019
om+0.019
rou+0.019
Neg logit contributions
ured-0.024
tered-0.022
zed-0.022
zing-0.021
isy-0.021
Token had
Token ID550
Feature activation+1.84e+00

Pos logit contributions
o+0.038
ll+0.032
ossible+0.032
om+0.031
rou+0.030
Neg logit contributions
ured-0.043
zed-0.040
tered-0.039
zing-0.038
comb-0.038
TokenCan
Token ID6090
Feature activation-3.40e-01

Pos logit contributions
ured+0.078
zed+0.071
zing+0.069
tered+0.068
comb+0.068
Neg logit contributions
o-0.083
ll-0.072
ossible-0.070
om-0.066
rou-0.066
Token we
Token ID356
Feature activation-1.09e+00

Pos logit contributions
ured+0.011
zed+0.011
tered+0.010
zing+0.010
comb+0.010
Neg logit contributions
o-0.009
rou-0.008
ll-0.008
om-0.007
ossible-0.007
Token go
Token ID467
Feature activation+2.32e+00

Pos logit contributions
ured+0.033
zed+0.030
tered+0.029
zing+0.029
comb+0.029
Neg logit contributions
o-0.033
ll-0.029
ossible-0.028
rou-0.027
om-0.027
Token now
Token ID783
Feature activation+1.02e-01

Pos logit contributions
o+0.058
rou+0.052
ll+0.051
ossible+0.050
om+0.050
Neg logit contributions
ured-0.074
zed-0.067
zing-0.066
ered-0.065
tered-0.065
Token,
Token ID11
Feature activation+1.08e+00

Pos logit contributions
o+0.002
ll+0.002
ossible+0.002
rou+0.002
om+0.002
Neg logit contributions
ured-0.004
zed-0.003
zing-0.003
tered-0.003
comb-0.003
Token Mom
Token ID11254
Feature activation+5.45e+00

Pos logit contributions
o+0.022
ll+0.018
ossible+0.017
rou+0.016
om+0.016
Neg logit contributions
ured-0.044
zed-0.040
tered-0.040
comb-0.040
zing-0.040
Token?"
Token ID1701
Feature activation+1.50e+00

Pos logit contributions
rou+0.137
o+0.133
ossible+0.131
ll+0.128
ipt+0.127
Neg logit contributions
est-0.142
ured-0.140
le-0.135
 happen-0.132
isy-0.129
Token Lily
Token ID20037
Feature activation+1.67e+00

Pos logit contributions
o+0.036
ll+0.029
ossible+0.029
om+0.027
rou+0.026
Neg logit contributions
ured-0.056
zed-0.052
zing-0.050
comb-0.050
tered-0.050
Token asked
Token ID1965
Feature activation+3.45e-01

Pos logit contributions
o+0.053
ll+0.049
rou+0.048
ossible+0.047
etchup+0.044
Neg logit contributions
ured-0.045
tered-0.041
isy-0.039
ze-0.038
comb-0.038
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
o+0.009
ll+0.008
ossible+0.008
rou+0.008
om+0.007
Neg logit contributions
ured-0.012
tered-0.011
oned-0.010
zing-0.010
ered-0.010
Token
Token ID198
Feature activation+1.19e+00

Pos logit contributions
o+0.035
rou+0.032
ll+0.032
ossible+0.028
om+0.026
Neg logit contributions
ured-0.059
tered-0.056
oned-0.055
zed-0.054
comb-0.053
Token kissed
Token ID27946
Feature activation+3.43e+00

Pos logit contributions
o+0.061
ll+0.052
om+0.051
rou+0.051
ossible+0.050
Neg logit contributions
ured-0.058
tered-0.054
zed-0.053
ered-0.052
isy-0.051
Token them
Token ID606
Feature activation+5.97e-02

Pos logit contributions
o+0.101
ll+0.090
rou+0.083
ossible+0.083
om+0.080
Neg logit contributions
ured-0.107
zed-0.098
tered-0.097
zing-0.094
comb-0.092
Token and
Token ID290
Feature activation+2.63e+00

Pos logit contributions
o+0.001
ll+0.001
rou+0.001
ossible+0.001
om+0.001
Neg logit contributions
ured-0.002
tered-0.002
zed-0.002
zing-0.002
comb-0.002
Token said
Token ID531
Feature activation+1.92e+00

Pos logit contributions
o+0.070
ll+0.062
om+0.058
rou+0.057
ossible+0.057
Neg logit contributions
ured-0.086
zed-0.080
tered-0.079
comb-0.077
zing-0.076
Token they
Token ID484
Feature activation+2.77e+00

Pos logit contributions
o+0.047
ll+0.039
ossible+0.038
rou+0.035
om+0.035
Neg logit contributions
ured-0.070
zed-0.066
zing-0.064
tered-0.064
comb-0.062
Token loved
Token ID6151
Feature activation+5.44e+00

Pos logit contributions
o+0.083
ll+0.071
ossible+0.070
om+0.068
rou+0.067
Neg logit contributions
ured-0.082
zed-0.077
tered-0.076
zing-0.074
comb-0.073
Token them
Token ID606
Feature activation-5.95e-01

Pos logit contributions
o+0.189
ll+0.174
ossible+0.173
rou+0.160
om+0.158
Neg logit contributions
ured-0.126
zed-0.113
tered-0.110
zing-0.105
 happen-0.105
Token.
Token ID13
Feature activation+1.21e+00

Pos logit contributions
ured+0.022
tered+0.021
zed+0.020
comb+0.020
zing+0.020
Neg logit contributions
o-0.013
ll-0.011
ossible-0.010
om-0.010
rou-0.010
Token They
Token ID1119
Feature activation+1.34e+00

Pos logit contributions
o+0.031
ll+0.028
rou+0.025
ossible+0.024
om+0.023
Neg logit contributions
ured-0.042
tered-0.038
zed-0.037
zing-0.036
ger-0.035
Token said
Token ID531
Feature activation+1.12e+00

Pos logit contributions
o+0.033
ll+0.027
ossible+0.026
om+0.026
rou+0.025
Neg logit contributions
ured-0.047
zed-0.044
tered-0.044
comb-0.043
isy-0.042
Token they
Token ID484
Feature activation+2.44e+00

Pos logit contributions
o+0.026
ll+0.021
ossible+0.020
rou+0.019
om+0.019
Neg logit contributions
ured-0.042
zed-0.040
zing-0.039
tered-0.039
comb-0.038
Token She
Token ID1375
Feature activation+9.36e-01

Pos logit contributions
ured+0.002
zed+0.001
tered+0.001
comb+0.001
zing+0.001
Neg logit contributions
o-0.002
ll-0.002
ossible-0.002
rou-0.002
om-0.002
Token does
Token ID857
Feature activation-6.24e-01

Pos logit contributions
o+0.025
ll+0.021
ossible+0.020
om+0.019
rou+0.018
Neg logit contributions
ured-0.032
zed-0.030
zing-0.029
tered-0.029
comb-0.029
Token not
Token ID407
Feature activation+4.66e-01

Pos logit contributions
ured+0.026
tered+0.024
comb+0.024
est+0.024
zed+0.024
Neg logit contributions
o-0.010
ll-0.008
ossible-0.008
om-0.007
rou-0.007
Token know
Token ID760
Feature activation+1.40e+00

Pos logit contributions
o+0.013
ossible+0.011
ll+0.010
 dad+0.009
om+0.009
Neg logit contributions
ured-0.016
zed-0.015
zing-0.014
ued-0.014
tered-0.014
Token that
Token ID326
Feature activation+2.15e+00

Pos logit contributions
o+0.037
ll+0.031
ossible+0.029
rou+0.028
om+0.028
Neg logit contributions
ured-0.049
zed-0.044
tered-0.044
zing-0.043
comb-0.043
Token Mom
Token ID11254
Feature activation+5.43e+00

Pos logit contributions
o+0.056
ll+0.045
ossible+0.044
om+0.042
rou+0.041
Neg logit contributions
ured-0.077
zed-0.070
zing-0.068
tered-0.068
ued-0.067
Token and
Token ID290
Feature activation+1.92e+00

Pos logit contributions
o+0.114
ossible+0.108
rou+0.097
ll+0.096
om+0.093
Neg logit contributions
ured-0.186
zed-0.177
tered-0.174
zing-0.173
isy-0.173
Token Dad
Token ID17415
Feature activation+3.02e+00

Pos logit contributions
o+0.035
ll+0.028
om+0.026
rou+0.025
ounce+0.023
Neg logit contributions
ured-0.077
zed-0.074
comb-0.072
tered-0.071
zing-0.070
Token know
Token ID760
Feature activation+1.51e+00

Pos logit contributions
o+0.085
ll+0.073
ossible+0.072
rou+0.069
om+0.068
Neg logit contributions
ured-0.096
zed-0.088
tered-0.086
zing-0.086
comb-0.085
Token.
Token ID13
Feature activation+9.44e-02

Pos logit contributions
o+0.036
ll+0.030
ossible+0.028
rou+0.027
om+0.026
Neg logit contributions
ured-0.055
zed-0.050
zing-0.050
ued-0.049
tered-0.049
Token They
Token ID1119
Feature activation+5.16e-01

Pos logit contributions
o+0.003
ll+0.003
ossible+0.003
rou+0.003
om+0.003
Neg logit contributions
ured-0.003
zed-0.002
tered-0.002
comb-0.002
zing-0.002
Token they
Token ID484
Feature activation-3.01e-01

Pos logit contributions
ured+0.042
zed+0.040
zing+0.040
isy+0.039
comb+0.039
Neg logit contributions
o-0.032
ossible-0.025
ll-0.025
om-0.024
 dad-0.024
Token did
Token ID750
Feature activation-4.85e-01

Pos logit contributions
ured+0.011
zed+0.011
zing+0.011
est+0.010
tered+0.010
Neg logit contributions
o-0.007
ll-0.006
ossible-0.006
om-0.005
rou-0.005
Token not
Token ID407
Feature activation+5.85e-01

Pos logit contributions
ured+0.019
zed+0.018
tered+0.017
zing+0.017
comb+0.017
Neg logit contributions
o-0.011
ll-0.009
ossible-0.008
om-0.008
rou-0.008
Token know
Token ID760
Feature activation+1.55e+00

Pos logit contributions
o+0.019
ll+0.017
ossible+0.016
om+0.015
rou+0.015
Neg logit contributions
ured-0.017
zed-0.015
tered-0.015
zing-0.015
comb-0.015
Token that
Token ID326
Feature activation+2.18e+00

Pos logit contributions
o+0.042
ll+0.036
ossible+0.034
rou+0.033
om+0.033
Neg logit contributions
ured-0.052
zed-0.048
tered-0.047
zing-0.047
comb-0.046
Token Mom
Token ID11254
Feature activation+5.43e+00

Pos logit contributions
o+0.048
ll+0.038
ossible+0.037
om+0.035
rou+0.034
Neg logit contributions
ured-0.085
zed-0.079
zing-0.078
tered-0.077
comb-0.077
Token had
Token ID550
Feature activation+2.68e+00

Pos logit contributions
o+0.130
ll+0.110
ossible+0.110
rou+0.101
om+0.100
Neg logit contributions
ured-0.194
zed-0.180
tered-0.177
zing-0.176
comb-0.175
Token turned
Token ID2900
Feature activation+2.12e+00

Pos logit contributions
o+0.076
ll+0.067
ossible+0.060
rou+0.060
om+0.057
Neg logit contributions
ured-0.084
zed-0.082
zing-0.076
ered-0.076
tered-0.076
Token on
Token ID319
Feature activation-8.99e-01

Pos logit contributions
o+0.056
ossible+0.045
ll+0.045
om+0.045
rou+0.043
Neg logit contributions
ured-0.073
zed-0.068
comb-0.067
isy-0.066
zing-0.065
Token the
Token ID262
Feature activation+6.37e-01

Pos logit contributions
ured+0.035
zed+0.032
comb+0.032
isy+0.032
zing+0.031
Neg logit contributions
o-0.019
ossible-0.015
ll-0.015
om-0.015
rou-0.014
Token oven
Token ID14361
Feature activation+9.83e-01

Pos logit contributions
o+0.016
ll+0.014
ossible+0.014
om+0.013
rou+0.013
Neg logit contributions
ured-0.022
zed-0.020
tered-0.019
comb-0.019
zing-0.019
Token the
Token ID262
Feature activation+5.64e-02

Pos logit contributions
o+0.006
ll+0.005
ossible+0.005
rou+0.005
om+0.004
Neg logit contributions
ured-0.009
zed-0.008
comb-0.008
tered-0.008
zing-0.008
Token brav
Token ID49025
Feature activation-3.02e+00

Pos logit contributions
o+0.002
ll+0.001
ossible+0.001
om+0.001
rou+0.001
Neg logit contributions
ured-0.002
comb-0.002
zed-0.002
isy-0.002
zing-0.002
Tokenest
Token ID395
Feature activation-1.15e+00

Pos logit contributions
ured+0.018
zed+0.010
tered+0.007
zing+0.007
comb+0.006
Neg logit contributions
o-0.165
ll-0.154
ossible-0.151
om-0.147
rou-0.147
Token and
Token ID290
Feature activation+2.67e-01

Pos logit contributions
ured+0.043
zed+0.040
tered+0.038
zing+0.038
comb+0.038
Neg logit contributions
o-0.028
ll-0.023
ossible-0.022
rou-0.022
om-0.021
Token boun
Token ID31283
Feature activation+1.59e-01

Pos logit contributions
o+0.008
ll+0.007
ossible+0.007
rou+0.006
om+0.006
Neg logit contributions
ured-0.008
zed-0.008
tered-0.007
zing-0.007
comb-0.007
Tokenci
Token ID979
Feature activation-5.82e+00

Pos logit contributions
o+0.003
ll+0.003
ossible+0.003
om+0.003
rou+0.003
Neg logit contributions
ured-0.006
est-0.005
ger-0.005
comb-0.005
zed-0.005
Tokenest
Token ID395
Feature activation-2.00e+00

Pos logit contributions
ured+0.073
comb+0.055
zed+0.048
tered+0.047
zing+0.044
Neg logit contributions
o-0.291
ll-0.260
ossible-0.258
rou-0.250
om-0.247
Token bunny
Token ID44915
Feature activation-1.53e+00

Pos logit contributions
ured+0.077
zed+0.072
tered+0.070
zing+0.070
comb+0.069
Neg logit contributions
o-0.045
ll-0.037
ossible-0.034
rou-0.033
om-0.033
Token in
Token ID287
Feature activation+1.22e+00

Pos logit contributions
ured+0.055
zed+0.050
tered+0.050
comb+0.049
zing+0.049
Neg logit contributions
o-0.039
ll-0.032
ossible-0.029
rou-0.028
om-0.028
Token the
Token ID262
Feature activation-3.29e-01

Pos logit contributions
o+0.032
ll+0.028
ossible+0.026
rou+0.025
om+0.025
Neg logit contributions
ured-0.042
zed-0.039
zing-0.038
tered-0.038
comb-0.037
Token whole
Token ID2187
Feature activation+1.23e+00

Pos logit contributions
ured+0.012
zed+0.011
zing+0.011
tered+0.011
comb+0.011
Neg logit contributions
o-0.008
ll-0.007
ossible-0.007
om-0.006
rou-0.006
Token best
Token ID1266
Feature activation-1.13e+00

Pos logit contributions
ured+0.070
isy+0.067
zing+0.066
est+0.065
zed+0.065
Neg logit contributions
o-0.040
ll-0.033
ossible-0.032
rou-0.029
om-0.028
Token friends
Token ID2460
Feature activation-1.29e+00

Pos logit contributions
ured+0.040
zing+0.039
isy+0.038
zed+0.038
ze+0.037
Neg logit contributions
o-0.025
ll-0.021
rou-0.018
ossible-0.017
ucker-0.017
Token,
Token ID11
Feature activation-6.59e-01

Pos logit contributions
ured+0.037
zed+0.034
zing+0.033
isy+0.033
tered+0.033
Neg logit contributions
ll-0.034
o-0.034
rou-0.033
ucker-0.031
aj-0.030
Token Matt
Token ID4705
Feature activation-1.87e+00

Pos logit contributions
ured+0.019
tered+0.018
comb+0.017
isy+0.017
zed+0.017
Neg logit contributions
o-0.018
ll-0.018
rou-0.016
ossible-0.016
ucker-0.016
Token and
Token ID290
Feature activation+1.16e-01

Pos logit contributions
isy+0.064
ured+0.060
zed+0.060
est+0.059
zing+0.059
Neg logit contributions
o-0.039
ossible-0.035
rou-0.035
ll-0.035
etchup-0.035
Token Lynn
Token ID24868
Feature activation-5.79e+00

Pos logit contributions
o+0.003
ll+0.002
ossible+0.002
om+0.002
rou+0.002
Neg logit contributions
ured-0.004
zed-0.004
tered-0.004
zing-0.004
comb-0.004
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
ured+0.234
zed+0.218
comb+0.213
zing+0.213
tered+0.211
Neg logit contributions
o-0.120
ll-0.095
ossible-0.092
om-0.084
rou-0.082
Token They
Token ID1119
Feature activation-1.37e+00

Pos logit contributions
ured+0.034
zed+0.034
tered+0.033
nob+0.031
growth+0.031
Neg logit contributions
o-0.031
rou-0.030
ll-0.030
etchup-0.026
ucker-0.025
Token lived
Token ID5615
Feature activation+8.32e-01

Pos logit contributions
ured+0.053
zed+0.050
comb+0.049
zing+0.049
isy+0.047
Neg logit contributions
o-0.033
ossible-0.026
ll-0.026
om-0.023
rou-0.022
Token next
Token ID1306
Feature activation+1.11e+00

Pos logit contributions
o+0.027
ossible+0.026
ucker+0.026
ll+0.026
rou+0.025
Neg logit contributions
tered-0.016
ured-0.016
comb-0.014
ped-0.014
zing-0.014
Token door
Token ID3420
Feature activation-9.04e-01

Pos logit contributions
o+0.024
ucker+0.023
etchup+0.022
rou+0.022
ll+0.021
Neg logit contributions
ured-0.035
isy-0.033
zing-0.033
est-0.032
tered-0.032
Token to
Token ID284
Feature activation+1.31e-01

Pos logit contributions
o+0.044
ll+0.040
ossible+0.035
rou+0.030
etchup+0.029
Neg logit contributions
ured-0.099
zed-0.096
zing-0.095
ze-0.092
tered-0.090
Token cry
Token ID3960
Feature activation-1.21e+00

Pos logit contributions
o+0.004
ll+0.004
ossible+0.004
rou+0.003
om+0.003
Neg logit contributions
ured-0.004
zed-0.003
tered-0.003
zing-0.003
comb-0.003
Token and
Token ID290
Feature activation+3.12e-01

Pos logit contributions
ured+0.051
zed+0.048
comb+0.047
zing+0.047
ued+0.046
Neg logit contributions
o-0.022
ll-0.017
ossible-0.016
om-0.015
rou-0.015
Token threw
Token ID9617
Feature activation-3.58e-01

Pos logit contributions
o+0.008
ossible+0.007
ll+0.007
rou+0.006
om+0.006
Neg logit contributions
ured-0.011
zed-0.010
comb-0.010
zing-0.009
tered-0.009
Token a
Token ID257
Feature activation-6.42e-01

Pos logit contributions
ured+0.013
zed+0.012
comb+0.012
zing+0.012
isy+0.012
Neg logit contributions
o-0.009
ossible-0.007
om-0.007
 to-0.007
 dad-0.007
Token tant
Token ID24246
Feature activation-5.57e+00

Pos logit contributions
ured+0.020
zed+0.018
tered+0.018
zing+0.018
comb+0.018
Neg logit contributions
o-0.019
ll-0.016
ossible-0.016
rou-0.015
om-0.015
Tokenrum
Token ID6582
Feature activation-2.01e+00

Pos logit contributions
ured+0.100
comb+0.082
zed+0.082
tered+0.076
zing+0.076
Neg logit contributions
o-0.245
ll-0.213
om-0.209
ossible-0.207
rou-0.201
Token.
Token ID13
Feature activation-1.60e+00

Pos logit contributions
ured+0.086
zed+0.080
isy+0.078
comb+0.078
zing+0.078
Neg logit contributions
o-0.037
ll-0.026
rou-0.024
om-0.024
ossible-0.024
Token She
Token ID1375
Feature activation-8.30e-01

Pos logit contributions
ured+0.043
zed+0.039
comb+0.039
zing+0.038
isy+0.038
Neg logit contributions
o-0.053
ossible-0.045
ll-0.045
om-0.044
rou-0.043
Token didn
Token ID1422
Feature activation-9.52e-02

Pos logit contributions
ured+0.025
zed+0.023
tered+0.023
zing+0.022
comb+0.022
Neg logit contributions
o-0.025
ll-0.022
ossible-0.021
rou-0.020
om-0.020
Token't
Token ID470
Feature activation-6.97e-01

Pos logit contributions
ured+0.003
zed+0.002
comb+0.002
tered+0.002
zing+0.002
Neg logit contributions
o-0.003
ll-0.003
ossible-0.003
om-0.003
rou-0.003
Token Lily
Token ID20037
Feature activation-7.78e-01

Pos logit contributions
o+0.035
ll+0.030
ossible+0.028
om+0.026
rou+0.026
Neg logit contributions
ured-0.053
zed-0.049
zing-0.047
tered-0.047
comb-0.047
Token.
Token ID13
Feature activation-1.49e+00

Pos logit contributions
ured+0.022
tered+0.021
zed+0.021
isy+0.021
nob+0.021
Neg logit contributions
o-0.020
ll-0.020
ucker-0.018
rou-0.018
ossible-0.018
Token She
Token ID1375
Feature activation-9.15e-01

Pos logit contributions
ured+0.046
zed+0.042
zing+0.040
comb+0.040
tered+0.039
Neg logit contributions
o-0.046
ll-0.040
ossible-0.039
om-0.037
rou-0.036
Token loved
Token ID6151
Feature activation+3.19e+00

Pos logit contributions
ured+0.033
zed+0.029
comb+0.029
tered+0.028
zing+0.028
Neg logit contributions
o-0.024
ll-0.020
ossible-0.020
om-0.018
rou-0.018
Token l
Token ID300
Feature activation-1.63e+00

Pos logit contributions
o+0.061
ll+0.056
ossible+0.050
rou+0.049
om+0.047
Neg logit contributions
zed-0.117
ured-0.115
zing-0.114
ze-0.110
tered-0.107
Tokenoll
Token ID692
Feature activation-5.49e+00

Pos logit contributions
ured+0.040
zed+0.036
tered+0.034
zing+0.034
comb+0.033
Neg logit contributions
o-0.060
ll-0.054
ossible-0.052
om-0.051
rou-0.050
Tokenip
Token ID541
Feature activation-5.02e+00

Pos logit contributions
ured+0.186
zed+0.180
zing+0.174
isy+0.171
tered+0.170
Neg logit contributions
o-0.136
ossible-0.113
ll-0.112
rou-0.107
om-0.105
Tokenops
Token ID2840
Feature activation-3.39e+00

Pos logit contributions
ured+0.120
zed+0.104
tered+0.101
comb+0.100
zing+0.099
Neg logit contributions
o-0.188
ll-0.170
ossible-0.162
om-0.157
rou-0.156
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
ured+0.116
isy+0.110
comb+0.109
tered+0.108
zed+0.108
Neg logit contributions
o-0.080
ossible-0.068
ll-0.068
rou-0.067
om-0.064
Token especially
Token ID2592
Feature activation+1.94e+00

Pos logit contributions
ured+0.028
zed+0.026
tered+0.026
isy+0.025
zing+0.025
Neg logit contributions
o-0.033
ll-0.030
ossible-0.029
rou-0.028
om-0.027
Token the
Token ID262
Feature activation-9.15e-02

Pos logit contributions
o+0.053
ll+0.044
ossible+0.042
om+0.041
rou+0.040
Neg logit contributions
ured-0.067
zed-0.061
comb-0.059
tered-0.059
zing-0.059
Token sad
Token ID6507
Feature activation-7.00e-01

Pos logit contributions
ured+0.002
comb+0.002
zed+0.002
isy+0.002
growth+0.002
Neg logit contributions
o-0.001
ll-0.001
ossible-0.001
rou-0.001
om-0.001
Token.
Token ID13
Feature activation-5.37e-01

Pos logit contributions
ured+0.030
zed+0.029
comb+0.028
zing+0.028
ued+0.027
Neg logit contributions
o-0.012
ll-0.010
ossible-0.009
om-0.008
rou-0.008
Token She
Token ID1375
Feature activation-1.19e+00

Pos logit contributions
ured+0.017
zed+0.016
tered+0.016
zing+0.015
oned+0.015
Neg logit contributions
o-0.015
ll-0.013
rou-0.012
ossible-0.012
etchup-0.012
Token threw
Token ID9617
Feature activation-6.38e-01

Pos logit contributions
ured+0.042
comb+0.040
zed+0.039
zing+0.038
est+0.038
Neg logit contributions
o-0.030
ll-0.025
ossible-0.024
rou-0.022
om-0.022
Token a
Token ID257
Feature activation-8.95e-01

Pos logit contributions
ured+0.025
zed+0.024
comb+0.023
zing+0.023
isy+0.022
Neg logit contributions
o-0.014
ossible-0.011
om-0.011
ll-0.010
etchup-0.010
Token tant
Token ID24246
Feature activation-5.48e+00

Pos logit contributions
ured+0.027
zed+0.024
tered+0.024
zing+0.024
comb+0.023
Neg logit contributions
o-0.027
ll-0.024
ossible-0.023
rou-0.022
om-0.022
Tokenrum
Token ID6582
Feature activation-1.93e+00

Pos logit contributions
ured+0.099
comb+0.084
zed+0.081
tered+0.076
zing+0.075
Neg logit contributions
o-0.242
ll-0.209
om-0.206
ossible-0.202
rou-0.197
Token and
Token ID290
Feature activation-9.11e-02

Pos logit contributions
ured+0.081
zed+0.076
comb+0.074
zing+0.074
isy+0.074
Neg logit contributions
o-0.037
ll-0.028
ossible-0.026
om-0.025
rou-0.025
Token cried
Token ID16896
Feature activation-1.69e+00

Pos logit contributions
ured+0.003
zed+0.003
comb+0.003
zing+0.003
isy+0.003
Neg logit contributions
o-0.003
ll-0.002
ossible-0.002
om-0.002
rou-0.002
Token loudly
Token ID23112
Feature activation-6.74e-01

Pos logit contributions
ured+0.068
zed+0.066
comb+0.065
ued+0.064
isy+0.063
Neg logit contributions
o-0.031
om-0.023
rou-0.022
ossible-0.022
ll-0.021
Token.
Token ID13
Feature activation-4.31e-01

Pos logit contributions
ured+0.029
comb+0.028
zed+0.027
isy+0.027
ered+0.026
Neg logit contributions
o-0.011
om-0.008
ll-0.007
ossible-0.007
rou-0.007
Token Lily
Token ID20037
Feature activation-1.00e+00

Pos logit contributions
o+0.036
ll+0.030
ossible+0.029
om+0.027
rou+0.027
Neg logit contributions
ured-0.052
zed-0.048
zing-0.047
tered-0.046
comb-0.046
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
ured+0.028
tered+0.028
zed+0.028
isy+0.027
nob+0.027
Neg logit contributions
o-0.026
ll-0.025
rou-0.023
ucker-0.023
ossible-0.023
Token She
Token ID1375
Feature activation-9.74e-01

Pos logit contributions
ured+0.041
zed+0.037
tered+0.036
zing+0.036
comb+0.036
Neg logit contributions
o-0.046
ll-0.040
ossible-0.039
om-0.037
rou-0.037
Token loved
Token ID6151
Feature activation+3.21e+00

Pos logit contributions
ured+0.035
comb+0.031
zed+0.031
tered+0.031
zing+0.030
Neg logit contributions
o-0.026
ll-0.021
ossible-0.020
om-0.019
rou-0.019
Token l
Token ID300
Feature activation-1.59e+00

Pos logit contributions
o+0.062
ll+0.057
ossible+0.051
rou+0.050
om+0.048
Neg logit contributions
zed-0.117
ured-0.115
zing-0.114
ze-0.110
tered-0.107
Tokenoll
Token ID692
Feature activation-5.45e+00

Pos logit contributions
ured+0.038
zed+0.035
tered+0.033
zing+0.033
comb+0.032
Neg logit contributions
o-0.059
ll-0.053
ossible-0.051
om-0.050
rou-0.049
Tokenip
Token ID541
Feature activation-5.01e+00

Pos logit contributions
ured+0.188
zed+0.180
zing+0.175
tered+0.172
isy+0.172
Neg logit contributions
o-0.135
ossible-0.110
ll-0.110
rou-0.104
om-0.102
Tokenops
Token ID2840
Feature activation-3.38e+00

Pos logit contributions
ured+0.119
zed+0.103
tered+0.100
comb+0.100
zing+0.098
Neg logit contributions
o-0.188
ll-0.170
ossible-0.162
om-0.157
rou-0.156
Token and
Token ID290
Feature activation-5.09e-01

Pos logit contributions
ured+0.115
isy+0.110
comb+0.109
tered+0.109
zed+0.108
Neg logit contributions
o-0.079
ossible-0.067
rou-0.067
ll-0.067
om-0.063
Token would
Token ID561
Feature activation-1.23e+00

Pos logit contributions
ured+0.019
ued+0.017
comb+0.017
oned+0.017
zing+0.016
Neg logit contributions
o-0.013
ossible-0.010
ll-0.010
om-0.010
rou-0.009
Token always
Token ID1464
Feature activation+1.17e+00

Pos logit contributions
ured+0.040
zed+0.035
tered+0.035
comb+0.034
zing+0.034
Neg logit contributions
o-0.035
ossible-0.031
ll-0.031
om-0.027
rou-0.027
Token Lily
Token ID20037
Feature activation-8.84e-01

Pos logit contributions
o+0.033
ll+0.028
ossible+0.027
om+0.025
rou+0.024
Neg logit contributions
ured-0.051
zed-0.047
zing-0.046
comb-0.046
tered-0.046
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
ured+0.025
tered+0.025
zed+0.024
isy+0.024
nob+0.024
Neg logit contributions
o-0.023
ll-0.023
ucker-0.021
rou-0.021
ossible-0.021
Token She
Token ID1375
Feature activation-9.58e-01

Pos logit contributions
ured+0.042
zed+0.038
zing+0.036
tered+0.036
comb+0.036
Neg logit contributions
o-0.044
ll-0.038
ossible-0.037
om-0.035
rou-0.035
Token loved
Token ID6151
Feature activation+3.05e+00

Pos logit contributions
ured+0.034
comb+0.031
zed+0.030
tered+0.030
zing+0.030
Neg logit contributions
o-0.026
ll-0.021
ossible-0.020
om-0.019
rou-0.019
Token l
Token ID300
Feature activation-1.65e+00

Pos logit contributions
o+0.058
ll+0.054
ossible+0.048
rou+0.047
om+0.045
Neg logit contributions
zed-0.112
ured-0.110
zing-0.109
ze-0.105
tered-0.103
Tokenoll
Token ID692
Feature activation-5.45e+00

Pos logit contributions
ured+0.038
zed+0.034
tered+0.032
zing+0.032
comb+0.032
Neg logit contributions
o-0.063
ll-0.056
ossible-0.055
om-0.053
rou-0.053
Tokenip
Token ID541
Feature activation-5.00e+00

Pos logit contributions
ured+0.184
zed+0.177
zing+0.171
tered+0.169
isy+0.168
Neg logit contributions
o-0.137
ll-0.114
ossible-0.113
rou-0.108
om-0.106
Tokenops
Token ID2840
Feature activation-3.34e+00

Pos logit contributions
ured+0.120
zed+0.104
comb+0.100
tered+0.100
zing+0.099
Neg logit contributions
o-0.187
ll-0.169
ossible-0.161
om-0.156
rou-0.154
Token and
Token ID290
Feature activation-5.12e-01

Pos logit contributions
ured+0.112
isy+0.107
tered+0.106
comb+0.106
zed+0.105
Neg logit contributions
o-0.079
ossible-0.068
rou-0.067
ll-0.067
om-0.063
Token always
Token ID1464
Feature activation+7.63e-01

Pos logit contributions
ured+0.019
ued+0.017
comb+0.017
oned+0.017
zing+0.017
Neg logit contributions
o-0.013
ossible-0.010
ll-0.010
om-0.009
rou-0.009
Token had
Token ID550
Feature activation+3.53e-01

Pos logit contributions
o+0.020
ll+0.016
ossible+0.016
om+0.014
rou+0.014
Neg logit contributions
ured-0.028
zed-0.025
comb-0.024
zing-0.024
tered-0.024
Token kept
Token ID4030
Feature activation+1.60e-01

Pos logit contributions
ured+0.017
zed+0.015
comb+0.015
zing+0.015
tered+0.015
Neg logit contributions
o-0.014
ll-0.012
ossible-0.012
om-0.011
rou-0.011
Token crying
Token ID13774
Feature activation-2.34e+00

Pos logit contributions
o+0.005
ll+0.004
ossible+0.004
rou+0.004
om+0.004
Neg logit contributions
ured-0.005
zed-0.005
zing-0.004
ze-0.004
tered-0.004
Token and
Token ID290
Feature activation+6.07e-01

Pos logit contributions
ured+0.109
zed+0.105
comb+0.102
ued+0.101
zing+0.100
Neg logit contributions
o-0.028
ll-0.022
ossible-0.019
rou-0.019
om-0.017
Token throwing
Token ID9644
Feature activation-7.23e-01

Pos logit contributions
o+0.017
ll+0.015
ossible+0.014
om+0.014
rou+0.014
Neg logit contributions
ured-0.020
zed-0.018
tered-0.018
zing-0.017
comb-0.017
Token a
Token ID257
Feature activation-1.24e+00

Pos logit contributions
ured+0.027
zed+0.026
comb+0.026
isy+0.025
ued+0.025
Neg logit contributions
o-0.016
om-0.013
ossible-0.012
etchup-0.012
 to-0.012
Token tant
Token ID24246
Feature activation-5.41e+00

Pos logit contributions
ured+0.041
zed+0.038
comb+0.036
tered+0.036
zing+0.036
Neg logit contributions
o-0.035
ll-0.030
ossible-0.029
om-0.029
rou-0.027
Tokenrum
Token ID6582
Feature activation-1.74e+00

Pos logit contributions
ured+0.099
comb+0.079
zed+0.078
zing+0.073
tered+0.073
Neg logit contributions
o-0.240
ll-0.208
om-0.204
ossible-0.203
rou-0.196
Token.
Token ID13
Feature activation-8.97e-01

Pos logit contributions
ured+0.073
zed+0.068
isy+0.067
comb+0.066
zing+0.066
Neg logit contributions
o-0.033
ll-0.024
rou-0.022
om-0.022
ossible-0.022
Token
Token ID198
Feature activation-2.63e-01

Pos logit contributions
ured+0.032
zed+0.030
tered+0.029
zing+0.029
comb+0.029
Neg logit contributions
o-0.022
ll-0.019
ossible-0.018
rou-0.017
om-0.017
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
ured+0.010
zed+0.010
tered+0.010
zing+0.010
oned+0.010
Neg logit contributions
o-0.005
rou-0.004
ll-0.004
ossible-0.004
etchup-0.004
TokenFinally
Token ID11158
Feature activation-1.13e+00

Pos logit contributions
ured+0.039
comb+0.037
est+0.035
 odor+0.035
zed+0.034
Neg logit contributions
o-0.038
ossible-0.033
ll-0.033
om-0.030
rou-0.030
Token't
Token ID470
Feature activation-5.50e-01

Pos logit contributions
ured+0.006
zed+0.005
comb+0.005
zing+0.005
tered+0.005
Neg logit contributions
o-0.006
ll-0.005
ossible-0.005
om-0.005
rou-0.005
Token like
Token ID588
Feature activation+2.69e+00

Pos logit contributions
ured+0.021
zed+0.020
ued+0.019
atively+0.019
comb+0.019
Neg logit contributions
o-0.014
ossible-0.010
ll-0.010
ety-0.010
 to-0.009
Token all
Token ID477
Feature activation+1.88e+00

Pos logit contributions
o+0.080
ll+0.069
ossible+0.068
om+0.063
rou+0.063
Neg logit contributions
ured-0.084
zed-0.077
zing-0.075
tered-0.074
comb-0.072
Token of
Token ID286
Feature activation+9.27e-01

Pos logit contributions
o+0.044
ll+0.035
ossible+0.034
om+0.033
rou+0.033
Neg logit contributions
ured-0.071
zed-0.065
comb-0.065
tered-0.063
zing-0.063
Token the
Token ID262
Feature activation-1.56e+00

Pos logit contributions
o+0.027
ll+0.024
ossible+0.023
om+0.022
rou+0.022
Neg logit contributions
ured-0.029
zed-0.027
tered-0.026
comb-0.026
zing-0.026
Token unnecessary
Token ID13114
Feature activation-5.35e+00

Pos logit contributions
ured+0.058
comb+0.056
zed+0.055
tered+0.052
zing+0.052
Neg logit contributions
o-0.036
ll-0.031
ossible-0.029
om-0.028
rou-0.027
Token extravag
Token ID36827
Feature activation+5.97e-01

Pos logit contributions
ured+0.181
comb+0.166
zed+0.164
tered+0.159
isy+0.158
Neg logit contributions
o-0.148
ll-0.121
om-0.120
ossible-0.116
rou-0.113
Tokenance
Token ID590
Feature activation+6.57e-01

Pos logit contributions
o+0.023
ossible+0.021
ll+0.021
om+0.020
etchup+0.020
Neg logit contributions
ured-0.012
zed-0.010
comb-0.010
tered-0.009
ered-0.009
Token.
Token ID13
Feature activation-8.50e-01

Pos logit contributions
o+0.015
ll+0.012
ossible+0.011
om+0.011
rou+0.011
Neg logit contributions
ured-0.025
zed-0.023
comb-0.023
tered-0.022
zing-0.022
Token
Token ID198
Feature activation-1.83e-01

Pos logit contributions
ured+0.031
zed+0.030
tered+0.030
oned+0.029
zing+0.029
Neg logit contributions
o-0.018
ll-0.016
rou-0.015
etchup-0.013
om-0.013
Token
Token ID198
Feature activation-1.27e+00

Pos logit contributions
zed+0.007
ured+0.007
zing+0.007
isy+0.007
tered+0.006
Neg logit contributions
o-0.004
rou-0.003
ll-0.003
etchup-0.003
ossible-0.003
Token and
Token ID290
Feature activation-3.70e-01

Pos logit contributions
ured+0.020
zed+0.018
tered+0.018
zing+0.018
est+0.018
Neg logit contributions
o-0.015
ll-0.012
ossible-0.012
rou-0.012
etchup-0.012
Token so
Token ID523
Feature activation-2.01e+00

Pos logit contributions
ured+0.013
comb+0.012
oned+0.012
zed+0.012
growth+0.012
Neg logit contributions
o-0.010
ll-0.008
ossible-0.008
om-0.007
rou-0.007
Token she
Token ID673
Feature activation-8.33e-01

Pos logit contributions
ured+0.063
comb+0.054
zed+0.053
oned+0.053
tered+0.052
Neg logit contributions
o-0.063
ll-0.054
ossible-0.054
om-0.051
rou-0.051
Token threw
Token ID9617
Feature activation-8.58e-01

Pos logit contributions
ured+0.027
zed+0.025
comb+0.024
zing+0.024
tered+0.024
Neg logit contributions
o-0.024
ll-0.020
ossible-0.019
om-0.018
rou-0.018
Token a
Token ID257
Feature activation-9.51e-01

Pos logit contributions
ured+0.030
zed+0.029
comb+0.028
zing+0.027
isy+0.027
Neg logit contributions
o-0.022
ossible-0.020
om-0.017
 dad-0.017
 to-0.016
Token tant
Token ID24246
Feature activation-5.35e+00

Pos logit contributions
ured+0.027
zing+0.025
 trigger+0.025
isy+0.025
tered+0.024
Neg logit contributions
o-0.029
ll-0.026
ossible-0.024
rou-0.023
ucker-0.021
Tokenrum
Token ID6582
Feature activation-2.24e+00

Pos logit contributions
ured+0.094
comb+0.077
zed+0.075
tered+0.072
zing+0.070
Neg logit contributions
o-0.238
ll-0.209
om-0.202
ossible-0.200
rou-0.195
Token when
Token ID618
Feature activation+1.14e-01

Pos logit contributions
ured+0.090
zed+0.083
isy+0.082
zing+0.081
comb+0.081
Neg logit contributions
o-0.047
ll-0.036
ossible-0.034
om-0.033
rou-0.033
Token her
Token ID607
Feature activation+6.01e-01

Pos logit contributions
o+0.004
ll+0.003
ossible+0.003
om+0.003
rou+0.003
Neg logit contributions
ured-0.003
zed-0.003
comb-0.003
tered-0.003
isy-0.003
Token parents
Token ID3397
Feature activation+2.57e+00

Pos logit contributions
o+0.014
ll+0.012
rou+0.011
ossible+0.011
om+0.010
Neg logit contributions
ured-0.022
zed-0.021
zing-0.020
ze-0.020
comb-0.020
Token said
Token ID531
Feature activation-3.01e-01

Pos logit contributions
o+0.072
ossible+0.063
ll+0.062
rou+0.061
om+0.058
Neg logit contributions
ured-0.080
zed-0.073
tered-0.073
zing-0.071
ered-0.069
Token to
Token ID284
Feature activation+3.18e-01

Pos logit contributions
ured+0.006
comb+0.006
zed+0.006
tered+0.006
isy+0.006
Neg logit contributions
o-0.001
ll-0.001
ossible-0.001
rou-0.000
om-0.000
Token put
Token ID1234
Feature activation+3.65e-01

Pos logit contributions
o+0.009
ossible+0.008
ll+0.008
om+0.007
rou+0.007
Neg logit contributions
ured-0.010
zed-0.009
zing-0.009
tered-0.009
comb-0.009
Token on
Token ID319
Feature activation-2.74e+00

Pos logit contributions
o+0.009
ossible+0.007
ll+0.007
om+0.007
etchup+0.007
Neg logit contributions
ured-0.013
zed-0.012
comb-0.012
zing-0.012
tered-0.012
Token her
Token ID607
Feature activation-1.32e+00

Pos logit contributions
ured+0.093
comb+0.084
tered+0.083
zed+0.083
isy+0.082
Neg logit contributions
o-0.073
ossible-0.062
ll-0.059
om-0.058
rou-0.056
Token fan
Token ID4336
Feature activation-3.44e-01

Pos logit contributions
ured+0.047
zed+0.042
tered+0.042
comb+0.042
zing+0.041
Neg logit contributions
o-0.034
ll-0.028
ossible-0.028
om-0.026
rou-0.025
Tokenci
Token ID979
Feature activation-5.33e+00

Pos logit contributions
ured+0.013
zed+0.012
tered+0.011
zing+0.011
comb+0.011
Neg logit contributions
o-0.008
ll-0.007
ossible-0.007
om-0.006
rou-0.006
Tokenest
Token ID395
Feature activation-1.71e+00

Pos logit contributions
ured+0.071
comb+0.056
tered+0.051
zed+0.050
zing+0.049
Neg logit contributions
o-0.258
ossible-0.237
ll-0.228
om-0.222
rou-0.219
Token sunglasses
Token ID36251
Feature activation-2.37e+00

Pos logit contributions
ured+0.061
tered+0.056
zed+0.056
comb+0.055
ued+0.053
Neg logit contributions
o-0.042
ossible-0.035
om-0.034
ll-0.032
 dad-0.032
Token.
Token ID13
Feature activation-6.06e-01

Pos logit contributions
ured+0.088
zed+0.082
zing+0.080
tered+0.078
comb+0.078
Neg logit contributions
o-0.056
ll-0.048
ossible-0.044
om-0.042
rou-0.041
Token
Token ID198
Feature activation+1.97e-01

Pos logit contributions
ured+0.023
zed+0.022
tered+0.021
zing+0.021
comb+0.020
Neg logit contributions
o-0.013
ll-0.011
rou-0.011
ossible-0.010
om-0.010
Token
Token ID198
Feature activation-9.57e-01

Pos logit contributions
o+0.004
rou+0.003
ll+0.003
etchup+0.003
ossible+0.003
Neg logit contributions
zed-0.007
ured-0.007
zing-0.007
tered-0.007
isy-0.007
Token she
Token ID673
Feature activation+2.62e-01

Pos logit contributions
ured+0.011
zed+0.010
comb+0.010
zing+0.010
ued+0.010
Neg logit contributions
o-0.008
ll-0.007
ossible-0.007
om-0.006
rou-0.006
Token always
Token ID1464
Feature activation+1.28e+00

Pos logit contributions
o+0.008
ossible+0.007
ll+0.007
om+0.006
rou+0.006
Neg logit contributions
ured-0.008
zed-0.008
zing-0.007
tered-0.007
comb-0.007
Token wore
Token ID12408
Feature activation-2.40e-01

Pos logit contributions
o+0.039
ll+0.035
ossible+0.034
om+0.033
rou+0.033
Neg logit contributions
ured-0.037
zed-0.034
zing-0.033
tered-0.033
ered-0.032
Token her
Token ID607
Feature activation-1.90e+00

Pos logit contributions
ured+0.008
comb+0.007
zed+0.007
isy+0.007
tered+0.007
Neg logit contributions
o-0.006
ossible-0.006
om-0.005
ll-0.005
 dad-0.005
Token fan
Token ID4336
Feature activation-8.08e-01

Pos logit contributions
ured+0.060
zed+0.055
zing+0.053
comb+0.052
tered+0.052
Neg logit contributions
o-0.055
ll-0.047
rou-0.044
ossible-0.043
om-0.042
Tokenci
Token ID979
Feature activation-5.29e+00

Pos logit contributions
ured+0.029
zed+0.027
tered+0.027
zing+0.027
comb+0.026
Neg logit contributions
o-0.020
ll-0.016
ossible-0.016
om-0.015
rou-0.015
Tokenest
Token ID395
Feature activation-1.55e+00

Pos logit contributions
ured+0.055
comb+0.040
zed+0.038
tered+0.035
zing+0.034
Neg logit contributions
o-0.270
ossible-0.246
ll-0.245
rou-0.236
om-0.236
Token dress
Token ID6576
Feature activation-1.23e+00

Pos logit contributions
ured+0.053
zed+0.049
tered+0.048
comb+0.048
zing+0.047
Neg logit contributions
o-0.041
ll-0.034
ossible-0.034
om-0.032
rou-0.031
Token to
Token ID284
Feature activation-6.38e-01

Pos logit contributions
ured+0.047
zed+0.044
tered+0.042
zing+0.042
comb+0.042
Neg logit contributions
o-0.028
ll-0.023
ossible-0.021
rou-0.020
om-0.020
Token the
Token ID262
Feature activation+2.93e-01

Pos logit contributions
ured+0.020
zed+0.018
comb+0.018
tered+0.018
zing+0.018
Neg logit contributions
o-0.018
ll-0.016
ossible-0.016
rou-0.014
om-0.014
Token museum
Token ID13257
Feature activation+4.09e-01

Pos logit contributions
o+0.008
ll+0.006
ossible+0.006
om+0.006
rou+0.006
Neg logit contributions
ured-0.010
zed-0.009
tered-0.009
zing-0.009
comb-0.009
Token Lily
Token ID20037
Feature activation-6.72e-01

Pos logit contributions
o+0.040
ll+0.033
ossible+0.032
om+0.029
rou+0.028
Neg logit contributions
ured-0.064
zed-0.059
zing-0.058
comb-0.057
tered-0.057
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
ured+0.019
tered+0.019
zed+0.018
nob+0.018
isy+0.018
Neg logit contributions
ll-0.017
o-0.017
rou-0.016
ucker-0.016
ossible-0.016
Token She
Token ID1375
Feature activation-8.10e-01

Pos logit contributions
ured+0.043
zed+0.039
zing+0.037
comb+0.037
tered+0.037
Neg logit contributions
o-0.046
ll-0.040
ossible-0.039
om-0.037
rou-0.036
Token loved
Token ID6151
Feature activation+3.45e+00

Pos logit contributions
ured+0.028
zed+0.026
comb+0.025
tered+0.025
zing+0.025
Neg logit contributions
o-0.022
ll-0.018
ossible-0.017
om-0.016
rou-0.016
Token l
Token ID300
Feature activation-1.49e+00

Pos logit contributions
o+0.067
ll+0.062
rou+0.055
ossible+0.055
om+0.053
Neg logit contributions
zed-0.123
ured-0.122
zing-0.120
ze-0.116
tered-0.114
Tokenoll
Token ID692
Feature activation-5.26e+00

Pos logit contributions
ured+0.035
zed+0.032
tered+0.030
zing+0.030
comb+0.030
Neg logit contributions
o-0.056
ll-0.050
ossible-0.049
om-0.047
rou-0.047
Tokenip
Token ID541
Feature activation-4.79e+00

Pos logit contributions
ured+0.180
zed+0.173
zing+0.168
tered+0.164
isy+0.164
Neg logit contributions
o-0.131
ll-0.107
ossible-0.107
rou-0.102
om-0.100
Tokenops
Token ID2840
Feature activation-3.17e+00

Pos logit contributions
ured+0.113
zed+0.098
tered+0.095
comb+0.095
zing+0.093
Neg logit contributions
o-0.181
ll-0.163
ossible-0.156
om-0.151
rou-0.150
Token and
Token ID290
Feature activation-4.92e-01

Pos logit contributions
ured+0.108
isy+0.103
comb+0.103
tered+0.102
zed+0.101
Neg logit contributions
o-0.073
ossible-0.063
ll-0.062
rou-0.062
om-0.058
Token always
Token ID1464
Feature activation+8.89e-01

Pos logit contributions
ured+0.019
ued+0.017
comb+0.017
oned+0.016
zing+0.016
Neg logit contributions
o-0.012
ossible-0.010
ll-0.010
om-0.009
rou-0.008
Token had
Token ID550
Feature activation+3.87e-01

Pos logit contributions
o+0.023
ll+0.019
ossible+0.019
rou+0.017
om+0.017
Neg logit contributions
ured-0.032
zed-0.029
comb-0.028
zing-0.028
tered-0.028
Token the
Token ID262
Feature activation+1.13e-01

Pos logit contributions
ured+0.023
comb+0.020
ered+0.020
 trigger+0.020
ued+0.020
Neg logit contributions
o-0.009
om-0.006
 to-0.006
ll-0.006
 upon-0.006
Token Princess
Token ID8449
Feature activation-1.51e+00

Pos logit contributions
o+0.004
ll+0.003
ossible+0.003
rou+0.003
 dad+0.003
Neg logit contributions
ured-0.003
zed-0.003
comb-0.003
oned-0.003
ered-0.002
Token would
Token ID561
Feature activation-9.88e-01

Pos logit contributions
ured+0.057
zed+0.052
oned+0.051
ued+0.050
zing+0.050
Neg logit contributions
o-0.036
ll-0.029
ossible-0.027
om-0.025
rou-0.024
Token apply
Token ID4174
Feature activation+2.29e-01

Pos logit contributions
ured+0.031
tered+0.028
zed+0.027
comb+0.027
ger+0.027
Neg logit contributions
o-0.027
ll-0.024
ossible-0.024
rou-0.022
om-0.022
Token tooth
Token ID16162
Feature activation-2.26e+00

Pos logit contributions
o+0.007
ossible+0.006
ll+0.006
om+0.006
rou+0.005
Neg logit contributions
ured-0.007
zed-0.007
comb-0.006
tered-0.006
zing-0.006
Tokenpaste
Token ID34274
Feature activation-5.25e+00

Pos logit contributions
ured+0.065
zed+0.059
tered+0.057
ued+0.057
oned+0.056
Neg logit contributions
o-0.073
ll-0.062
om-0.060
ossible-0.059
rou-0.056
Token to
Token ID284
Feature activation-2.12e+00

Pos logit contributions
ured+0.204
zed+0.193
zing+0.186
comb+0.184
tered+0.183
Neg logit contributions
o-0.110
ll-0.095
ossible-0.085
om-0.080
rou-0.075
Token her
Token ID607
Feature activation-1.33e+00

Pos logit contributions
ured+0.070
ued+0.065
ger+0.062
comb+0.062
zed+0.061
Neg logit contributions
o-0.051
ossible-0.046
 dad-0.045
ll-0.043
 grand-0.043
Token tooth
Token ID16162
Feature activation-4.71e-01

Pos logit contributions
ured+0.043
zed+0.039
tered+0.038
zing+0.038
comb+0.038
Neg logit contributions
o-0.038
ll-0.033
ossible-0.032
om-0.030
rou-0.030
Tokenbrush
Token ID32680
Feature activation-6.44e-01

Pos logit contributions
ured+0.013
zed+0.011
tered+0.011
ued+0.011
oned+0.011
Neg logit contributions
o-0.016
ll-0.014
ossible-0.013
om-0.013
rou-0.013
Token and
Token ID290
Feature activation+1.19e-01

Pos logit contributions
ured+0.024
zed+0.022
tered+0.022
zing+0.022
comb+0.022
Neg logit contributions
o-0.015
ll-0.012
ossible-0.012
om-0.011
rou-0.011
Token would
Token ID561
Feature activation-4.81e-01

Pos logit contributions
ured+0.024
zed+0.022
tered+0.022
zing+0.022
comb+0.022
Neg logit contributions
o-0.018
ll-0.015
ossible-0.014
om-0.013
rou-0.013
Token take
Token ID1011
Feature activation-2.61e+00

Pos logit contributions
ured+0.016
zed+0.014
isy+0.014
zing+0.014
tered+0.013
Neg logit contributions
o-0.013
ossible-0.011
ll-0.011
om-0.010
rou-0.010
Token out
Token ID503
Feature activation-2.19e-01

Pos logit contributions
ured+0.094
comb+0.088
zed+0.085
isy+0.085
tered+0.082
Neg logit contributions
o-0.065
ossible-0.052
om-0.050
ll-0.050
rou-0.045
Token a
Token ID257
Feature activation+2.22e-01

Pos logit contributions
ured+0.007
zed+0.006
zing+0.006
tered+0.006
ered+0.006
Neg logit contributions
o-0.006
ll-0.006
rou-0.005
ossible-0.005
om-0.005
Token j
Token ID474
Feature activation-1.20e+00

Pos logit contributions
o+0.006
ll+0.005
ossible+0.004
om+0.004
rou+0.004
Neg logit contributions
ured-0.008
zed-0.007
tered-0.007
zing-0.007
comb-0.007
Tokenigsaw
Token ID45636
Feature activation-5.24e+00

Pos logit contributions
ured+0.037
zed+0.034
zing+0.034
isy+0.034
ger+0.033
Neg logit contributions
o-0.035
ll-0.029
ossible-0.027
rou-0.027
om-0.026
Token puzzle
Token ID15027
Feature activation-2.80e+00

Pos logit contributions
ured+0.165
zed+0.152
tered+0.150
comb+0.149
zing+0.149
Neg logit contributions
o-0.149
ll-0.127
ossible-0.123
rou-0.120
om-0.118
Token and
Token ID290
Feature activation-7.40e-02

Pos logit contributions
ured+0.104
comb+0.096
tered+0.095
zed+0.095
zing+0.093
Neg logit contributions
o-0.064
ll-0.054
ossible-0.051
rou-0.050
om-0.048
Token try
Token ID1949
Feature activation+6.60e-02

Pos logit contributions
ured+0.002
zed+0.002
isy+0.002
zing+0.002
tered+0.002
Neg logit contributions
o-0.002
ossible-0.002
ll-0.002
om-0.001
 dad-0.001
Token to
Token ID284
Feature activation-8.56e-01

Pos logit contributions
o+0.001
ossible+0.001
ll+0.001
rou+0.001
etchup+0.001
Neg logit contributions
ured-0.003
zed-0.002
zing-0.002
tered-0.002
isy-0.002
Token solve
Token ID8494
Feature activation-9.92e-01

Pos logit contributions
ured+0.025
zed+0.024
zing+0.023
ued+0.021
comb+0.021
Neg logit contributions
o-0.025
ll-0.023
ossible-0.022
om-0.020
rou-0.020
Token Lily
Token ID20037
Feature activation-7.82e-01

Pos logit contributions
o+0.041
ll+0.033
ossible+0.032
om+0.029
rou+0.028
Neg logit contributions
ured-0.071
zed-0.066
zing-0.064
comb-0.064
ued-0.063
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
ured+0.022
tered+0.021
zed+0.021
zing+0.020
nob+0.020
Neg logit contributions
ll-0.021
o-0.020
ucker-0.019
rou-0.019
ossible-0.019
Token She
Token ID1375
Feature activation-7.27e-01

Pos logit contributions
ured+0.040
zed+0.036
zing+0.035
tered+0.035
comb+0.034
Neg logit contributions
o-0.045
ll-0.040
ossible-0.039
om-0.037
rou-0.037
Token loved
Token ID6151
Feature activation+3.55e+00

Pos logit contributions
ured+0.026
comb+0.023
zed+0.023
tered+0.023
zing+0.023
Neg logit contributions
o-0.019
ll-0.016
ossible-0.015
om-0.014
rou-0.014
Token l
Token ID300
Feature activation-1.55e+00

Pos logit contributions
o+0.070
ll+0.065
ossible+0.058
rou+0.057
om+0.056
Neg logit contributions
zed-0.127
ured-0.124
zing-0.123
ze-0.119
tered-0.116
Tokenoll
Token ID692
Feature activation-5.22e+00

Pos logit contributions
ured+0.038
zed+0.034
tered+0.033
zing+0.033
comb+0.032
Neg logit contributions
o-0.058
ll-0.051
ossible-0.050
om-0.048
rou-0.048
Tokenip
Token ID541
Feature activation-4.66e+00

Pos logit contributions
ured+0.182
zed+0.175
zing+0.171
tered+0.167
isy+0.166
Neg logit contributions
o-0.126
ll-0.104
ossible-0.103
rou-0.097
om-0.095
Tokenops
Token ID2840
Feature activation-3.08e+00

Pos logit contributions
ured+0.112
zed+0.096
comb+0.094
tered+0.093
zing+0.092
Neg logit contributions
o-0.175
ll-0.158
ossible-0.150
om-0.146
rou-0.145
Token and
Token ID290
Feature activation-5.32e-01

Pos logit contributions
ured+0.111
comb+0.104
zed+0.103
isy+0.103
tered+0.103
Neg logit contributions
o-0.069
ossible-0.059
ll-0.058
rou-0.054
om-0.053
Token always
Token ID1464
Feature activation+9.06e-01

Pos logit contributions
ured+0.021
comb+0.019
ued+0.019
oned+0.019
zing+0.018
Neg logit contributions
o-0.012
ossible-0.010
ll-0.009
om-0.009
rou-0.008
Token had
Token ID550
Feature activation+3.76e-01

Pos logit contributions
o+0.024
ll+0.020
ossible+0.019
rou+0.018
om+0.018
Neg logit contributions
ured-0.032
zed-0.029
zing-0.029
comb-0.029
tered-0.028
Token her
Token ID607
Feature activation-1.12e+00

Pos logit contributions
comb+0.030
ured+0.030
isy+0.029
zed+0.029
ued+0.027
Neg logit contributions
o-0.023
om-0.018
ossible-0.018
 to-0.018
 dad-0.017
Token feet
Token ID3625
Feature activation-7.51e-01

Pos logit contributions
ured+0.034
zed+0.030
zing+0.030
comb+0.030
tered+0.030
Neg logit contributions
o-0.036
ll-0.031
ossible-0.029
rou-0.029
etchup-0.027
Token and
Token ID290
Feature activation+4.60e-01

Pos logit contributions
ured+0.032
zed+0.030
tered+0.029
zing+0.029
comb+0.029
Neg logit contributions
o-0.014
ll-0.011
ossible-0.010
rou-0.009
om-0.009
Token threw
Token ID9617
Feature activation-1.84e-01

Pos logit contributions
o+0.012
ll+0.009
ossible+0.009
om+0.009
 dad+0.008
Neg logit contributions
ured-0.016
comb-0.015
zed-0.015
isy-0.015
zing-0.015
Token a
Token ID257
Feature activation-2.97e-01

Pos logit contributions
ured+0.007
zed+0.006
comb+0.006
zing+0.006
isy+0.006
Neg logit contributions
o-0.005
om-0.004
ossible-0.004
etchup-0.003
 dad-0.003
Token tant
Token ID24246
Feature activation-5.17e+00

Pos logit contributions
ured+0.008
zed+0.008
tered+0.007
zing+0.007
comb+0.007
Neg logit contributions
o-0.010
ll-0.008
ossible-0.008
rou-0.008
om-0.008
Tokenrum
Token ID6582
Feature activation-1.52e+00

Pos logit contributions
ured+0.092
comb+0.078
zed+0.077
zing+0.071
tered+0.071
Neg logit contributions
o-0.229
ll-0.197
om-0.195
ossible-0.190
rou-0.185
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
ured+0.063
zed+0.060
isy+0.058
comb+0.058
zing+0.058
Neg logit contributions
o-0.029
ll-0.021
om-0.020
rou-0.019
ossible-0.019
Token She
Token ID1375
Feature activation+3.41e-02

Pos logit contributions
ured+0.026
zed+0.023
tered+0.022
zing+0.022
comb+0.021
Neg logit contributions
o-0.036
ll-0.032
ossible-0.031
rou-0.030
om-0.029
Token grabbed
Token ID13176
Feature activation-1.82e+00

Pos logit contributions
o+0.001
ll+0.001
ossible+0.001
om+0.001
rou+0.001
Neg logit contributions
ured-0.001
zed-0.001
comb-0.001
zing-0.001
tered-0.001
Token the
Token ID262
Feature activation+1.21e-01

Pos logit contributions
ured+0.067
comb+0.065
zed+0.064
isy+0.062
zing+0.062
Neg logit contributions
o-0.042
om-0.034
ossible-0.033
etchup-0.031
 dad-0.030
Token Sara
Token ID24799
Feature activation-7.67e-01

Pos logit contributions
ured+0.007
zed+0.007
tered+0.007
zing+0.007
comb+0.007
Neg logit contributions
o-0.007
ll-0.006
ossible-0.006
rou-0.006
om-0.006
Token said
Token ID531
Feature activation-1.38e+00

Pos logit contributions
ured+0.025
zed+0.023
tered+0.022
zing+0.022
comb+0.022
Neg logit contributions
o-0.020
ossible-0.018
ll-0.018
rou-0.017
om-0.016
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
ured+0.057
comb+0.053
zed+0.053
nob+0.052
growth+0.051
Neg logit contributions
o-0.026
ll-0.019
ossible-0.019
om-0.019
 to-0.017
Token "
Token ID366
Feature activation-1.31e+00

Pos logit contributions
o+0.027
ll+0.021
ossible+0.020
om+0.020
rou+0.019
Neg logit contributions
ured-0.042
zed-0.039
zing-0.038
comb-0.038
tered-0.038
TokenC
Token ID34
Feature activation-1.26e+00

Pos logit contributions
ured+0.043
zed+0.041
zing+0.039
ered+0.039
ued+0.038
Neg logit contributions
o-0.039
ll-0.031
ossible-0.031
om-0.028
ety-0.028
Tokenrick
Token ID5557
Feature activation-5.17e+00

Pos logit contributions
ured+0.042
isy+0.038
zed+0.038
zing+0.038
comb+0.037
Neg logit contributions
o-0.032
ossible-0.028
ll-0.027
om-0.026
rou-0.025
Tokenet
Token ID316
Feature activation-1.05e+00

Pos logit contributions
ured+0.157
comb+0.146
growth+0.137
oned+0.135
ued+0.134
Neg logit contributions
o-0.164
om-0.134
ll-0.133
ossible-0.128
rou-0.126
Token,
Token ID11
Feature activation-8.79e-02

Pos logit contributions
ured+0.038
zed+0.035
tered+0.034
comb+0.034
zing+0.034
Neg logit contributions
o-0.027
ll-0.022
ossible-0.021
om-0.020
rou-0.020
Token is
Token ID318
Feature activation+5.20e-01

Pos logit contributions
ured+0.003
zed+0.003
tered+0.003
zing+0.003
ued+0.003
Neg logit contributions
o-0.002
ll-0.002
ossible-0.002
om-0.002
rou-0.002
Token this
Token ID428
Feature activation+4.32e-01

Pos logit contributions
o+0.012
ll+0.010
ossible+0.009
rou+0.009
om+0.009
Neg logit contributions
ured-0.020
zed-0.018
tered-0.018
comb-0.018
zing-0.018
Token your
Token ID534
Feature activation-5.05e-02

Pos logit contributions
o+0.012
ll+0.010
om+0.009
ossible+0.009
rou+0.009
Neg logit contributions
ured-0.015
tered-0.014
zed-0.014
zing-0.013
oned-0.013
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
ured+0.094
zed+0.088
tered+0.087
zing+0.086
comb+0.085
Neg logit contributions
o-0.042
ll-0.034
ossible-0.032
om-0.029
rou-0.029
Token started
Token ID2067
Feature activation+2.66e+00

Pos logit contributions
o+0.008
ll+0.007
ossible+0.007
om+0.007
rou+0.007
Neg logit contributions
ured-0.009
comb-0.008
zed-0.008
zing-0.008
tered-0.008
Token to
Token ID284
Feature activation-3.58e-01

Pos logit contributions
o+0.041
ll+0.033
ossible+0.030
rou+0.028
om+0.026
Neg logit contributions
ured-0.117
zed-0.113
zing-0.111
ze-0.107
tered-0.107
Token throw
Token ID3714
Feature activation-7.81e-01

Pos logit contributions
ured+0.010
tered+0.009
zed+0.009
comb+0.008
isy+0.008
Neg logit contributions
o-0.012
ll-0.010
ossible-0.010
om-0.010
rou-0.009
Token a
Token ID257
Feature activation-7.66e-01

Pos logit contributions
ured+0.030
zed+0.028
comb+0.027
isy+0.027
ued+0.027
Neg logit contributions
o-0.019
ossible-0.015
om-0.015
etchup-0.014
 to-0.012
Token tant
Token ID24246
Feature activation-5.16e+00

Pos logit contributions
ured+0.023
zed+0.021
zing+0.021
tered+0.021
isy+0.021
Neg logit contributions
o-0.023
ll-0.020
ossible-0.019
rou-0.018
om-0.018
Tokenrum
Token ID6582
Feature activation-2.08e+00

Pos logit contributions
ured+0.088
zed+0.071
comb+0.071
tered+0.067
zing+0.065
Neg logit contributions
o-0.232
ll-0.205
ossible-0.198
om-0.198
rou-0.192
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
ured+0.089
zed+0.083
isy+0.081
comb+0.081
zing+0.081
Neg logit contributions
o-0.037
ll-0.027
om-0.025
rou-0.024
ossible-0.024
Token Suddenly
Token ID24975
Feature activation-3.95e-01

Pos logit contributions
ured+0.036
zed+0.032
comb+0.031
zing+0.031
tered+0.031
Neg logit contributions
o-0.050
ll-0.045
ossible-0.044
om-0.042
rou-0.042
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
ured+0.019
zed+0.017
isy+0.016
ued+0.016
comb+0.016
Neg logit contributions
o-0.006
ossible-0.004
om-0.004
 dad-0.004
 to-0.004
Token she
Token ID673
Feature activation+2.74e-01

Pos logit contributions
ured+0.067
isy+0.059
zed+0.059
ued+0.059
est+0.058
Neg logit contributions
o-0.036
 dad-0.034
 grandma-0.032
ossible-0.031
 upon-0.031
TokenOne
Token ID3198
Feature activation+1.72e+00

Pos logit contributions
zed+0.004
ured+0.004
zing+0.004
tered+0.004
isy+0.004
Neg logit contributions
o-0.004
ll-0.004
rou-0.004
ucker-0.004
etchup-0.004
Token day
Token ID1110
Feature activation-2.07e+00

Pos logit contributions
o+0.051
ll+0.048
rou+0.044
ucker+0.043
ossible+0.043
Neg logit contributions
ured-0.047
zed-0.045
tered-0.045
zing-0.044
isy-0.044
Token,
Token ID11
Feature activation-8.83e-01

Pos logit contributions
ured+0.066
zed+0.062
tered+0.060
zing+0.060
comb+0.059
Neg logit contributions
o-0.056
ll-0.050
ossible-0.048
rou-0.047
om-0.044
Token Matt
Token ID4705
Feature activation-1.51e+00

Pos logit contributions
ured+0.029
tered+0.027
zed+0.027
zing+0.027
ered+0.026
Neg logit contributions
o-0.022
ll-0.020
rou-0.018
ossible-0.018
ucker-0.017
Token and
Token ID290
Feature activation+1.31e-01

Pos logit contributions
ured+0.053
zed+0.050
tered+0.049
zing+0.049
isy+0.049
Neg logit contributions
o-0.032
ossible-0.028
ll-0.028
rou-0.028
etchup-0.026
Token Lynn
Token ID24868
Feature activation-5.15e+00

Pos logit contributions
o+0.004
ll+0.003
ossible+0.003
rou+0.003
om+0.003
Neg logit contributions
ured-0.004
zed-0.004
zing-0.004
tered-0.003
comb-0.003
Token wanted
Token ID2227
Feature activation+6.05e-01

Pos logit contributions
ured+0.179
zed+0.169
zing+0.165
comb+0.162
isy+0.161
Neg logit contributions
o-0.131
ll-0.112
ossible-0.110
om-0.094
rou-0.092
Token to
Token ID284
Feature activation-4.97e-01

Pos logit contributions
o+0.012
ll+0.011
rou+0.010
ucker+0.010
ossible+0.010
Neg logit contributions
ured-0.023
zed-0.022
ered-0.021
zing-0.021
tered-0.021
Token discuss
Token ID2112
Feature activation-6.76e-01

Pos logit contributions
ured+0.017
zed+0.016
zing+0.016
tered+0.016
comb+0.016
Neg logit contributions
o-0.013
ossible-0.011
ll-0.011
om-0.010
rou-0.009
Token something
Token ID1223
Feature activation-2.37e+00

Pos logit contributions
ured+0.022
zed+0.021
tered+0.020
zing+0.020
comb+0.020
Neg logit contributions
o-0.019
ll-0.016
ossible-0.015
om-0.014
rou-0.014
Token special
Token ID2041
Feature activation-4.79e-01

Pos logit contributions
ured+0.088
zed+0.081
tered+0.079
zing+0.079
comb+0.078
Neg logit contributions
o-0.056
ll-0.047
ossible-0.046
om-0.042
rou-0.042
Token a
Token ID257
Feature activation+4.94e-01

Pos logit contributions
o+0.017
ll+0.016
ossible+0.014
rou+0.014
om+0.013
Neg logit contributions
ured-0.018
zed-0.016
tered-0.016
zing-0.016
comb-0.015
Token big
Token ID1263
Feature activation-5.17e-01

Pos logit contributions
o+0.014
ll+0.013
ossible+0.012
rou+0.011
om+0.011
Neg logit contributions
ured-0.015
isy-0.014
zing-0.014
comb-0.014
ger-0.014
Token gear
Token ID7733
Feature activation-8.11e-01

Pos logit contributions
ured+0.016
zed+0.015
comb+0.015
zing+0.015
tered+0.015
Neg logit contributions
o-0.015
ll-0.013
ossible-0.012
rou-0.012
om-0.011
Token on
Token ID319
Feature activation-1.60e+00

Pos logit contributions
ured+0.027
zed+0.024
tered+0.024
comb+0.024
zing+0.024
Neg logit contributions
o-0.022
ll-0.019
ossible-0.019
rou-0.018
om-0.017
Token the
Token ID262
Feature activation-7.44e-01

Pos logit contributions
ured+0.057
zed+0.053
tered+0.052
zing+0.052
comb+0.051
Neg logit contributions
o-0.040
ll-0.034
ossible-0.032
rou-0.030
om-0.030
Token grass
Token ID8701
Feature activation+3.25e+00

Pos logit contributions
ured+0.019
zed+0.018
zing+0.017
tered+0.017
comb+0.017
Neg logit contributions
o-0.026
ll-0.023
ossible-0.022
om-0.021
rou-0.021
Token.
Token ID13
Feature activation-3.34e-01

Pos logit contributions
o+0.085
ll+0.081
ossible+0.079
rou+0.077
aj+0.072
Neg logit contributions
ured-0.099
comb-0.094
tered-0.092
ger-0.089
est-0.088
Token It
Token ID632
Feature activation+4.45e-01

Pos logit contributions
ured+0.010
tered+0.009
zed+0.008
oned+0.008
ger+0.008
Neg logit contributions
o-0.010
ll-0.010
rou-0.010
ossible-0.008
om-0.008
Token is
Token ID318
Feature activation+9.67e-03

Pos logit contributions
o+0.010
ll+0.009
ossible+0.008
rou+0.008
om+0.007
Neg logit contributions
ured-0.016
zed-0.015
tered-0.015
zing-0.015
comb-0.015
Token shiny
Token ID22441
Feature activation-2.90e-01

Pos logit contributions
o+0.000
ll+0.000
ossible+0.000
om+0.000
rou+0.000
Neg logit contributions
ured-0.000
zed-0.000
tered-0.000
zing-0.000
comb-0.000
Token and
Token ID290
Feature activation+9.22e-01

Pos logit contributions
ured+0.012
zed+0.012
tered+0.011
zing+0.011
comb+0.011
Neg logit contributions
o-0.005
ll-0.004
ossible-0.004
om-0.003
rou-0.003
Token!"
Token ID2474
Feature activation-9.85e-01

Pos logit contributions
o+0.017
ll+0.015
ossible+0.014
rou+0.013
om+0.013
Neg logit contributions
ured-0.026
zed-0.024
zing-0.023
tered-0.023
comb-0.023
Token
Token ID220
Feature activation+2.00e+00

Pos logit contributions
ured+0.028
zed+0.026
tered+0.025
zing+0.025
ze+0.023
Neg logit contributions
o-0.032
ll-0.029
rou-0.028
ossible-0.027
etchup-0.027
Token
Token ID198
Feature activation-2.83e-01

Pos logit contributions
o+0.043
rou+0.037
ll+0.035
etchup+0.035
ossible+0.032
Neg logit contributions
zed-0.072
ured-0.071
tered-0.069
zing-0.068
ued-0.067
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
zed+0.010
ured+0.010
zing+0.010
tered+0.010
ued+0.010
Neg logit contributions
o-0.006
rou-0.005
etchup-0.005
ll-0.005
ucker-0.005
TokenHis
Token ID6653
Feature activation-3.46e-01

Pos logit contributions
ured+0.030
zed+0.028
tered+0.027
zing+0.027
comb+0.026
Neg logit contributions
o-0.040
ll-0.036
ossible-0.035
rou-0.034
om-0.033
Token mom
Token ID1995
Feature activation+3.28e+00

Pos logit contributions
ured+0.011
zed+0.011
zing+0.011
tered+0.010
ze+0.010
Neg logit contributions
o-0.009
ll-0.008
rou-0.007
etchup-0.007
ossible-0.007
Tokenmy
Token ID1820
Feature activation+2.59e+00

Pos logit contributions
o+0.045
ll+0.039
ossible+0.039
etchup+0.038
om+0.036
Neg logit contributions
ured-0.136
isy-0.131
tered-0.130
ered-0.130
zing-0.128
Token said
Token ID531
Feature activation+3.24e-01

Pos logit contributions
o+0.064
ll+0.055
ossible+0.053
rou+0.050
om+0.050
Neg logit contributions
ured-0.091
zed-0.085
tered-0.083
zing-0.083
isy-0.082
Token,
Token ID11
Feature activation+1.96e+00

Pos logit contributions
o+0.006
ll+0.004
om+0.004
ossible+0.004
 to+0.004
Neg logit contributions
ured-0.014
comb-0.013
growth-0.013
zed-0.012
nob-0.012
Token "
Token ID366
Feature activation-3.12e-01

Pos logit contributions
o+0.044
ll+0.035
ossible+0.033
om+0.032
rou+0.030
Neg logit contributions
ured-0.075
zed-0.070
comb-0.069
zing-0.068
isy-0.067
TokenDon
Token ID3987
Feature activation-1.45e+00

Pos logit contributions
ured+0.007
tered+0.007
zed+0.007
zing+0.006
isy+0.006
Neg logit contributions
o-0.011
ll-0.010
ossible-0.010
om-0.010
rou-0.009
Token said
Token ID531
Feature activation-1.69e+00

Pos logit contributions
ured+0.004
zed+0.004
tered+0.004
zing+0.004
comb+0.004
Neg logit contributions
o-0.004
ll-0.004
ossible-0.003
rou-0.003
om-0.003
Token.
Token ID13
Feature activation+3.69e-01

Pos logit contributions
ured+0.065
zed+0.061
tered+0.059
zing+0.059
comb+0.059
Neg logit contributions
o-0.038
ll-0.031
ossible-0.029
om-0.027
rou-0.027
Token
Token ID198
Feature activation+2.63e-01

Pos logit contributions
o+0.010
ll+0.009
rou+0.009
ossible+0.008
etchup+0.008
Neg logit contributions
ured-0.012
tered-0.011
zed-0.011
oned-0.010
zing-0.010
Token
Token ID198
Feature activation-9.23e-01

Pos logit contributions
o+0.005
rou+0.005
ll+0.005
etchup+0.005
ossible+0.004
Neg logit contributions
zed-0.009
isy-0.009
tered-0.009
ured-0.009
zing-0.009
TokenHer
Token ID9360
Feature activation+4.63e-02

Pos logit contributions
ured+0.027
zed+0.025
comb+0.025
zing+0.024
tered+0.023
Neg logit contributions
o-0.029
ll-0.025
ossible-0.024
om-0.023
rou-0.023
Token mom
Token ID1995
Feature activation+3.16e+00

Pos logit contributions
o+0.001
ll+0.001
rou+0.001
etchup+0.001
om+0.001
Neg logit contributions
ured-0.002
isy-0.001
zed-0.001
zing-0.001
comb-0.001
Tokenmy
Token ID1820
Feature activation+2.45e+00

Pos logit contributions
o+0.049
etchup+0.049
ossible+0.048
ll+0.046
om+0.046
Neg logit contributions
isy-0.116
ured-0.116
ered-0.114
tered-0.111
est-0.110
Token smiled
Token ID13541
Feature activation+2.77e+00

Pos logit contributions
o+0.067
ossible+0.062
ll+0.061
rou+0.057
etchup+0.057
Neg logit contributions
ured-0.075
tered-0.068
isy-0.068
zed-0.067
zing-0.067
Token and
Token ID290
Feature activation+2.53e+00

Pos logit contributions
o+0.063
ll+0.055
ossible+0.051
etchup+0.048
om+0.048
Neg logit contributions
ured-0.105
tered-0.098
est-0.094
zing-0.094
comb-0.093
Token said
Token ID531
Feature activation+1.24e-01

Pos logit contributions
o+0.067
ossible+0.060
ll+0.059
om+0.057
rou+0.055
Neg logit contributions
ured-0.081
tered-0.074
zed-0.073
isy-0.072
ered-0.070
Token,
Token ID11
Feature activation+2.26e+00

Pos logit contributions
o+0.003
ll+0.002
ossible+0.002
om+0.002
rou+0.002
Neg logit contributions
ured-0.005
zed-0.005
comb-0.004
tered-0.004
zing-0.004
Token clever
Token ID14169
Feature activation-3.55e-01

Pos logit contributions
o+0.013
ll+0.011
ossible+0.011
rou+0.010
om+0.010
Neg logit contributions
ured-0.017
zed-0.016
tered-0.016
zing-0.016
comb-0.015
Token game
Token ID983
Feature activation-1.41e+00

Pos logit contributions
ured+0.012
zed+0.011
comb+0.011
zing+0.010
tered+0.010
Neg logit contributions
o-0.010
ll-0.008
ossible-0.008
om-0.008
rou-0.008
Token Lily
Token ID20037
Feature activation+1.22e-01

Pos logit contributions
ured+0.053
zed+0.049
tered+0.048
comb+0.048
zing+0.047
Neg logit contributions
o-0.032
ll-0.027
ossible-0.026
rou-0.024
om-0.024
Token made
Token ID925
Feature activation-2.31e+00

Pos logit contributions
o+0.003
ll+0.002
ossible+0.002
rou+0.002
om+0.002
Neg logit contributions
ured-0.005
zed-0.004
tered-0.004
zing-0.004
comb-0.004
Token.
Token ID13
Feature activation-4.58e-01

Pos logit contributions
ured+0.091
zed+0.084
comb+0.082
tered+0.082
zing+0.082
Neg logit contributions
o-0.050
ll-0.040
ossible-0.038
om-0.036
rou-0.036
Token<|endoftext|>
Token ID50256
Feature activation+3.14e+00

Pos logit contributions
ured+0.018
tered+0.017
zed+0.017
zing+0.016
oned+0.016
Neg logit contributions
o-0.009
ll-0.008
rou-0.007
ossible-0.007
om-0.006
TokenOnce
Token ID7454
Feature activation+6.19e-01

Pos logit contributions
rou+0.081
ll+0.074
o+0.072
ucker+0.071
aj+0.066
Neg logit contributions
ger-0.088
zed-0.088
nob-0.086
growth-0.086
tered-0.084
Token upon
Token ID2402
Feature activation+1.50e+00

Pos logit contributions
o+0.011
rou+0.010
etchup+0.009
 licenses+0.008
ll+0.008
Neg logit contributions
zed-0.022
isy-0.021
ued-0.021
ered-0.021
tered-0.020
Token a
Token ID257
Feature activation+4.09e-01

Pos logit contributions
o+0.043
ll+0.039
ossible+0.036
rou+0.036
om+0.035
Neg logit contributions
ured-0.044
zed-0.043
tered-0.043
zing-0.042
ze-0.041
Token time
Token ID640
Feature activation+1.26e+00

Pos logit contributions
o+0.014
rou+0.013
ll+0.013
etchup+0.012
ipt+0.012
Neg logit contributions
zed-0.009
zing-0.009
isy-0.009
ured-0.008
tered-0.008
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
o+0.038
rou+0.037
ll+0.035
ucker+0.034
etchup+0.034
Neg logit contributions
zed-0.030
ured-0.030
zing-0.029
comb-0.028
ued-0.028
Token them
Token ID606
Feature activation-8.65e-01

Pos logit contributions
o+0.006
ll+0.005
ossible+0.005
om+0.005
rou+0.005
Neg logit contributions
ured-0.006
zed-0.006
tered-0.005
zing-0.005
isy-0.005
Token stayed
Token ID9658
Feature activation-2.27e-01

Pos logit contributions
ured+0.028
zed+0.025
zing+0.025
comb+0.025
tered+0.025
Neg logit contributions
o-0.025
ll-0.021
ossible-0.020
om-0.019
rou-0.019
Token there
Token ID612
Feature activation+9.67e-01

Pos logit contributions
ured+0.008
zed+0.007
zing+0.007
tered+0.007
comb+0.007
Neg logit contributions
o-0.006
ll-0.005
ossible-0.005
om-0.005
rou-0.005
Token until
Token ID1566
Feature activation-9.45e-01

Pos logit contributions
o+0.026
ll+0.022
ossible+0.021
rou+0.020
om+0.020
Neg logit contributions
ured-0.033
zed-0.030
tered-0.030
zing-0.029
comb-0.029
Token the
Token ID262
Feature activation+7.30e-01

Pos logit contributions
ured+0.032
zed+0.030
tered+0.029
zing+0.029
comb+0.028
Neg logit contributions
o-0.026
ll-0.022
ossible-0.021
rou-0.020
om-0.020
Token sun
Token ID4252
Feature activation+4.06e+00

Pos logit contributions
o+0.020
ll+0.017
ossible+0.017
rou+0.016
om+0.016
Neg logit contributions
ured-0.024
zed-0.022
tered-0.021
zing-0.021
oned-0.021
Token went
Token ID1816
Feature activation+2.13e+00

Pos logit contributions
o+0.122
ll+0.104
ossible+0.099
om+0.096
rou+0.094
Neg logit contributions
ured-0.127
zed-0.115
zing-0.112
tered-0.111
comb-0.111
Token down
Token ID866
Feature activation+4.28e-01

Pos logit contributions
o+0.070
ll+0.063
ossible+0.060
rou+0.058
om+0.057
Neg logit contributions
ured-0.059
tered-0.053
zed-0.051
zing-0.051
comb-0.049
Token.
Token ID13
Feature activation-3.55e-01

Pos logit contributions
o+0.010
ll+0.009
ossible+0.008
rou+0.008
om+0.008
Neg logit contributions
ured-0.016
zed-0.014
tered-0.014
zing-0.014
comb-0.014
Token Then
Token ID3244
Feature activation-4.63e-01

Pos logit contributions
ured+0.010
zed+0.009
tered+0.008
zing+0.008
oned+0.008
Neg logit contributions
o-0.012
ll-0.011
ossible-0.010
rou-0.010
om-0.010
Token Mat
Token ID6550
Feature activation-1.47e+00

Pos logit contributions
ured+0.016
zed+0.015
comb+0.015
isy+0.015
tered+0.015
Neg logit contributions
o-0.012
ossible-0.010
ll-0.010
om-0.009
rou-0.009
TokenA
Token ID32
Feature activation+5.90e-01

Pos logit contributions
ured+0.032
zed+0.030
tered+0.029
zing+0.029
isy+0.028
Neg logit contributions
o-0.036
ll-0.032
ossible-0.031
rou-0.030
om-0.030
Token kind
Token ID1611
Feature activation+1.74e+00

Pos logit contributions
o+0.018
ll+0.016
ossible+0.016
rou+0.015
om+0.015
Neg logit contributions
isy-0.016
ger-0.015
ured-0.015
tered-0.015
zing-0.015
Token squirrel
Token ID33039
Feature activation+5.52e-01

Pos logit contributions
o+0.053
ll+0.051
ossible+0.046
rou+0.046
etchup+0.043
Neg logit contributions
ured-0.046
isy-0.046
ger-0.045
est-0.044
tered-0.043
Token heard
Token ID2982
Feature activation-9.51e-01

Pos logit contributions
o+0.015
ll+0.013
ossible+0.013
rou+0.012
om+0.012
Neg logit contributions
ured-0.018
zed-0.016
tered-0.016
zing-0.016
isy-0.016
Token the
Token ID262
Feature activation-2.23e-02

Pos logit contributions
ured+0.034
zed+0.031
tered+0.030
zing+0.030
comb+0.030
Neg logit contributions
o-0.024
ll-0.021
ossible-0.020
rou-0.019
om-0.018
Token bird
Token ID6512
Feature activation+4.52e-01

Pos logit contributions
ured+0.001
zed+0.001
zing+0.001
isy+0.001
tered+0.001
Neg logit contributions
o-0.001
ll-0.001
ossible-0.001
rou-0.001
om-0.001
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
o+0.011
ll+0.010
ossible+0.009
rou+0.009
om+0.009
Neg logit contributions
ured-0.016
zed-0.014
zing-0.014
isy-0.014
comb-0.014
Token The
Token ID383
Feature activation+3.36e-02

Pos logit contributions
ured+0.042
zed+0.039
zing+0.037
comb+0.037
tered+0.037
Neg logit contributions
o-0.032
ll-0.026
ossible-0.026
om-0.025
rou-0.024
Token squirrel
Token ID33039
Feature activation+1.10e-01

Pos logit contributions
o+0.001
ll+0.001
ossible+0.001
om+0.001
rou+0.001
Neg logit contributions
ured-0.001
isy-0.001
zed-0.001
zing-0.001
comb-0.001
Token came
Token ID1625
Feature activation+1.12e+00

Pos logit contributions
o+0.003
ossible+0.003
rou+0.002
ll+0.002
om+0.002
Neg logit contributions
ured-0.003
zed-0.003
tered-0.003
zing-0.003
isy-0.003
Token to
Token ID284
Feature activation-6.49e-01

Pos logit contributions
ossible+0.029
ucker+0.028
etchup+0.028
aj+0.028
o+0.027
Neg logit contributions
ured-0.032
ered-0.030
zing-0.030
oned-0.029
tered-0.029
Token is
Token ID318
Feature activation-4.52e-03

Pos logit contributions
o+0.044
ll+0.036
ossible+0.034
om+0.032
rou+0.032
Neg logit contributions
ured-0.075
zed-0.070
tered-0.068
zing-0.068
comb-0.067
Token gone
Token ID3750
Feature activation+1.24e+00

Pos logit contributions
ured+0.000
zed+0.000
tered+0.000
zing+0.000
comb+0.000
Neg logit contributions
o-0.000
ll-0.000
ossible-0.000
rou-0.000
om-0.000
Token.
Token ID13
Feature activation+1.32e+00

Pos logit contributions
o+0.033
ossible+0.028
ll+0.028
rou+0.025
om+0.025
Neg logit contributions
ured-0.042
tered-0.039
comb-0.038
zing-0.038
zed-0.038
Token
Token ID198
Feature activation+1.17e+00

Pos logit contributions
o+0.035
ll+0.034
rou+0.030
ossible+0.028
ounce+0.026
Neg logit contributions
ured-0.040
tered-0.039
zed-0.038
growth-0.036
zing-0.036
Token
Token ID198
Feature activation+1.57e-01

Pos logit contributions
o+0.025
ll+0.024
rou+0.023
etchup+0.019
ety+0.019
Neg logit contributions
zed-0.038
ured-0.036
tered-0.036
zing-0.035
isy-0.034
TokenTom
Token ID13787
Feature activation+6.43e-01

Pos logit contributions
o+0.004
ll+0.004
rou+0.004
ossible+0.004
om+0.003
Neg logit contributions
ured-0.005
tered-0.004
zed-0.004
zing-0.004
isy-0.004
Token starts
Token ID4940
Feature activation+2.79e+00

Pos logit contributions
o+0.016
ll+0.015
ossible+0.014
rou+0.013
etchup+0.013
Neg logit contributions
ze-0.019
zing-0.018
ured-0.018
zed-0.018
ered-0.017
Token to
Token ID284
Feature activation+2.97e-01

Pos logit contributions
o+0.063
ll+0.056
ossible+0.048
ite+0.047
rou+0.045
Neg logit contributions
ured-0.100
zed-0.098
zing-0.095
ered-0.093
ze-0.092
Token cry
Token ID3960
Feature activation-3.78e-01

Pos logit contributions
o+0.011
ll+0.009
ossible+0.009
om+0.009
rou+0.009
Neg logit contributions
ured-0.008
zed-0.007
tered-0.006
zing-0.006
comb-0.006
Token.
Token ID13
Feature activation-5.44e-02

Pos logit contributions
ured+0.015
zed+0.014
tered+0.013
zing+0.013
comb+0.013
Neg logit contributions
o-0.008
ll-0.007
ossible-0.006
om-0.006
rou-0.006
Token He
Token ID679
Feature activation+1.16e-01

Pos logit contributions
ured+0.001
zed+0.001
tered+0.001
zing+0.001
comb+0.001
Neg logit contributions
o-0.002
ll-0.002
ossible-0.002
rou-0.001
om-0.001
Token.
Token ID13
Feature activation-8.12e-01

Pos logit contributions
ured+0.041
zed+0.038
tered+0.038
comb+0.037
zing+0.037
Neg logit contributions
o-0.028
ll-0.024
ossible-0.023
rou-0.022
om-0.021
Token He
Token ID679
Feature activation+1.08e+00

Pos logit contributions
ured+0.023
zed+0.021
tered+0.021
zing+0.020
comb+0.020
Neg logit contributions
o-0.026
ll-0.023
ossible-0.022
rou-0.022
om-0.021
Token said
Token ID531
Feature activation-3.18e-01

Pos logit contributions
o+0.030
ll+0.027
ossible+0.026
rou+0.025
om+0.025
Neg logit contributions
ured-0.032
zed-0.031
tered-0.029
ered-0.029
zing-0.028
Token he
Token ID339
Feature activation+3.05e+00

Pos logit contributions
ured+0.012
zed+0.011
comb+0.011
tered+0.011
zing+0.010
Neg logit contributions
o-0.008
ll-0.006
ossible-0.006
om-0.006
rou-0.006
Token was
Token ID373
Feature activation+1.61e+00

Pos logit contributions
o+0.086
ossible+0.079
ll+0.078
om+0.073
rou+0.072
Neg logit contributions
ured-0.087
zed-0.082
tered-0.081
zing-0.079
ze-0.077
Token sorry
Token ID7926
Feature activation+1.01e+00

Pos logit contributions
o+0.048
ll+0.042
ossible+0.041
om+0.039
rou+0.038
Neg logit contributions
ured-0.050
zed-0.045
zing-0.044
tered-0.044
comb-0.043
Token.
Token ID13
Feature activation-1.96e-01

Pos logit contributions
o+0.026
ossible+0.022
ll+0.022
om+0.021
rou+0.021
Neg logit contributions
ured-0.034
zed-0.031
tered-0.031
zing-0.031
isy-0.030
Token He
Token ID679
Feature activation+1.48e+00

Pos logit contributions
ured+0.005
zed+0.005
tered+0.005
zing+0.005
comb+0.005
Neg logit contributions
o-0.006
ll-0.006
rou-0.005
ossible-0.005
om-0.005
Token said
Token ID531
Feature activation-2.74e-02

Pos logit contributions
o+0.037
ll+0.032
ossible+0.031
om+0.030
rou+0.029
Neg logit contributions
ured-0.050
zed-0.047
tered-0.046
isy-0.045
zing-0.045
Token he
Token ID339
Feature activation+3.38e+00

Pos logit contributions
ured+0.001
zed+0.001
tered+0.001
zing+0.001
comb+0.001
Neg logit contributions
o-0.001
ll-0.001
ossible-0.001
om-0.001
rou-0.001
Token was
Token ID373
Feature activation+1.72e+00

Pos logit contributions
o+0.095
ossible+0.088
ll+0.085
om+0.081
rou+0.079
Neg logit contributions
ured-0.096
zed-0.091
tered-0.090
zing-0.088
ze-0.086
Token with
Token ID351
Feature activation+7.16e-01

Pos logit contributions
o+0.022
om+0.014
ll+0.014
ossible+0.013
rou+0.013
Neg logit contributions
ured-0.056
zed-0.054
comb-0.053
isy-0.052
ued-0.052
Token those
Token ID883
Feature activation-1.14e+00

Pos logit contributions
o+0.018
om+0.014
ll+0.014
ossible+0.014
rou+0.013
Neg logit contributions
ured-0.025
zed-0.023
comb-0.023
tered-0.023
ued-0.022
Token kinds
Token ID6982
Feature activation-3.82e-01

Pos logit contributions
ured+0.042
zed+0.041
comb+0.039
zing+0.039
tered+0.038
Neg logit contributions
o-0.028
ll-0.023
rou-0.021
om-0.020
ossible-0.020
Token of
Token ID286
Feature activation-1.11e+00

Pos logit contributions
ured+0.014
zed+0.013
zing+0.013
comb+0.013
tered+0.013
Neg logit contributions
o-0.009
ll-0.008
ossible-0.007
om-0.007
rou-0.007
Token things
Token ID1243
Feature activation-2.35e+00

Pos logit contributions
ured+0.039
zed+0.036
comb+0.036
tered+0.034
zing+0.034
Neg logit contributions
o-0.030
ll-0.025
ossible-0.024
om-0.024
rou-0.023
Token."
Token ID526
Feature activation+5.69e-01

Pos logit contributions
ured+0.084
zed+0.078
tered+0.076
zing+0.076
comb+0.076
Neg logit contributions
o-0.058
ll-0.048
ossible-0.047
om-0.044
rou-0.044
Token The
Token ID383
Feature activation+7.71e-01

Pos logit contributions
o+0.013
ll+0.011
ossible+0.011
rou+0.010
om+0.010
Neg logit contributions
ured-0.021
zed-0.020
tered-0.019
zing-0.019
comb-0.019
Token boy
Token ID2933
Feature activation-9.49e-01

Pos logit contributions
o+0.020
ll+0.017
ossible+0.016
rou+0.015
om+0.015
Neg logit contributions
ured-0.027
zed-0.025
tered-0.024
comb-0.024
zing-0.024
Token felt
Token ID2936
Feature activation-2.07e+00

Pos logit contributions
ured+0.034
zed+0.031
comb+0.030
zing+0.030
tered+0.030
Neg logit contributions
o-0.025
ll-0.020
ossible-0.020
om-0.018
rou-0.018
Token a
Token ID257
Feature activation-6.89e-01

Pos logit contributions
comb+0.088
ured+0.083
bees+0.083
isy+0.081
nob+0.078
Neg logit contributions
o-0.032
 tired-0.028
ossible-0.028
rou-0.024
 dad-0.024
Token little
Token ID1310
Feature activation-2.39e-01

Pos logit contributions
ured+0.024
zed+0.022
comb+0.022
tered+0.021
ered+0.021
Neg logit contributions
o-0.016
ll-0.014
ossible-0.014
rou-0.013
om-0.013
Tokenila
Token ID10102
Feature activation+3.02e-01

Pos logit contributions
o+0.040
ll+0.036
ossible+0.036
om+0.034
rou+0.034
Neg logit contributions
ured-0.019
zed-0.017
tered-0.016
zing-0.015
comb-0.015
Token thought
Token ID1807
Feature activation+1.05e+00

Pos logit contributions
o+0.008
ll+0.007
ossible+0.007
rou+0.006
om+0.006
Neg logit contributions
ured-0.010
zed-0.009
tered-0.009
zing-0.009
comb-0.009
Token for
Token ID329
Feature activation+5.76e-01

Pos logit contributions
o+0.027
ll+0.023
ossible+0.022
rou+0.020
om+0.020
Neg logit contributions
ured-0.037
zed-0.035
tered-0.034
zing-0.034
comb-0.033
Token a
Token ID257
Feature activation+7.49e-01

Pos logit contributions
o+0.015
ll+0.012
ossible+0.012
om+0.011
rou+0.011
Neg logit contributions
ured-0.020
zed-0.019
tered-0.018
zing-0.018
comb-0.018
Token moment
Token ID2589
Feature activation+1.33e-01

Pos logit contributions
o+0.023
ll+0.020
ossible+0.020
om+0.019
rou+0.018
Neg logit contributions
ured-0.022
zed-0.020
tered-0.020
comb-0.019
zing-0.019
Token.
Token ID13
Feature activation+6.03e-01

Pos logit contributions
o+0.003
ll+0.003
ossible+0.002
rou+0.002
om+0.002
Neg logit contributions
ured-0.005
zed-0.005
tered-0.005
comb-0.005
zing-0.004
Token She
Token ID1375
Feature activation+1.11e+00

Pos logit contributions
o+0.021
ll+0.019
ossible+0.019
rou+0.018
om+0.018
Neg logit contributions
ured-0.015
tered-0.013
zed-0.013
comb-0.013
zing-0.013
Token had
Token ID550
Feature activation+1.49e+00

Pos logit contributions
o+0.030
ll+0.025
ossible+0.024
om+0.023
rou+0.023
Neg logit contributions
ured-0.038
zed-0.035
tered-0.034
zing-0.034
comb-0.034
Token an
Token ID281
Feature activation-3.32e-01

Pos logit contributions
o+0.046
ll+0.041
rou+0.037
ossible+0.036
om+0.036
Neg logit contributions
ured-0.043
zed-0.042
zing-0.039
tered-0.039
ered-0.038
Token unusual
Token ID8468
Feature activation+4.83e-02

Pos logit contributions
ured+0.009
zed+0.009
comb+0.008
zing+0.008
bees+0.008
Neg logit contributions
o-0.010
ossible-0.010
om-0.009
 dad-0.009
rou-0.009
Token idea
Token ID2126
Feature activation-7.69e-01

Pos logit contributions
o+0.001
ossible+0.001
om+0.001
ll+0.001
rou+0.001
Neg logit contributions
ured-0.002
zed-0.001
tered-0.001
zing-0.001
comb-0.001
Token hug
Token ID16225
Feature activation+1.14e+00

Pos logit contributions
o+0.010
ll+0.009
ossible+0.008
rou+0.008
om+0.008
Neg logit contributions
ured-0.012
zed-0.011
zing-0.011
tered-0.011
comb-0.011
Token.
Token ID13
Feature activation-4.24e-01

Pos logit contributions
o+0.023
ll+0.017
ossible+0.016
om+0.016
rou+0.015
Neg logit contributions
ured-0.046
zed-0.043
isy-0.042
zing-0.042
ued-0.042
Token
Token ID198
Feature activation-4.26e-02

Pos logit contributions
zed+0.013
ured+0.013
zing+0.012
tered+0.012
oned+0.011
Neg logit contributions
o-0.011
ll-0.011
rou-0.010
ossible-0.009
om-0.009
Token
Token ID198
Feature activation-1.08e+00

Pos logit contributions
zed+0.002
ured+0.002
zing+0.002
tered+0.001
isy+0.001
Neg logit contributions
o-0.001
ll-0.001
rou-0.001
ossible-0.001
om-0.001
TokenHe
Token ID1544
Feature activation-2.98e-01

Pos logit contributions
ured+0.032
zed+0.028
comb+0.028
tered+0.027
zing+0.027
Neg logit contributions
o-0.034
ll-0.030
ossible-0.029
rou-0.028
om-0.028
Token said
Token ID531
Feature activation-1.92e+00

Pos logit contributions
ured+0.009
zed+0.009
zing+0.008
tered+0.008
comb+0.008
Neg logit contributions
o-0.009
ll-0.007
ossible-0.007
om-0.007
rou-0.007
Token goodbye
Token ID24829
Feature activation-2.60e-01

Pos logit contributions
ured+0.057
comb+0.055
isy+0.052
 odor+0.051
nob+0.051
Neg logit contributions
o-0.055
ossible-0.049
ll-0.046
om-0.046
 to-0.045
Token to
Token ID284
Feature activation-1.86e+00

Pos logit contributions
ured+0.012
zed+0.011
tered+0.011
zing+0.011
comb+0.011
Neg logit contributions
o-0.004
ll-0.003
ossible-0.003
rou-0.002
om-0.002
Token his
Token ID465
Feature activation-1.07e+00

Pos logit contributions
ured+0.065
zed+0.060
tered+0.058
zing+0.058
comb+0.058
Neg logit contributions
o-0.049
ll-0.041
ossible-0.040
om-0.037
rou-0.037
Token town
Token ID3240
Feature activation+1.79e+00

Pos logit contributions
ured+0.040
zed+0.038
zing+0.037
comb+0.036
tered+0.036
Neg logit contributions
o-0.025
ll-0.021
ossible-0.019
om-0.018
rou-0.018
Token and
Token ID290
Feature activation-3.75e-01

Pos logit contributions
o+0.039
ll+0.033
ossible+0.031
rou+0.029
om+0.029
Neg logit contributions
ured-0.069
zed-0.064
zing-0.063
tered-0.063
comb-0.062
Token and
Token ID290
Feature activation+4.87e-01

Pos logit contributions
o+0.006
ll+0.005
ossible+0.005
rou+0.005
etchup+0.005
Neg logit contributions
ured-0.011
tered-0.010
zing-0.010
zed-0.010
est-0.010
Token happy
Token ID3772
Feature activation-8.22e-02

Pos logit contributions
o+0.012
ll+0.010
ossible+0.009
om+0.009
rou+0.009
Neg logit contributions
ured-0.018
comb-0.016
zed-0.016
zing-0.016
isy-0.016
Token people
Token ID661
Feature activation+1.15e-01

Pos logit contributions
ured+0.003
zed+0.003
comb+0.003
tered+0.003
zing+0.003
Neg logit contributions
o-0.002
ll-0.002
ossible-0.002
om-0.002
rou-0.002
Token dancing
Token ID15360
Feature activation+2.60e-02

Pos logit contributions
o+0.003
ll+0.002
ossible+0.002
rou+0.002
om+0.002
Neg logit contributions
ured-0.004
zed-0.004
tered-0.004
comb-0.004
zing-0.004
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
o+0.001
ll+0.000
ossible+0.000
om+0.000
rou+0.000
Neg logit contributions
ured-0.001
zed-0.001
tered-0.001
zing-0.001
comb-0.001
Token She
Token ID1375
Feature activation-7.79e-01

Pos logit contributions
ured+0.022
zed+0.020
tered+0.020
zing+0.018
oned+0.018
Neg logit contributions
o-0.042
ll-0.039
rou-0.036
etchup-0.036
ossible-0.035
Token forgot
Token ID16453
Feature activation-3.42e-01

Pos logit contributions
ured+0.023
zed+0.022
tered+0.021
zing+0.021
isy+0.020
Neg logit contributions
o-0.023
ll-0.020
ossible-0.020
om-0.019
rou-0.019
Token all
Token ID477
Feature activation+1.67e+00

Pos logit contributions
ured+0.012
comb+0.011
zed+0.011
tered+0.011
zing+0.011
Neg logit contributions
o-0.008
ll-0.007
ossible-0.007
rou-0.006
om-0.006
Token about
Token ID546
Feature activation+2.46e+00

Pos logit contributions
o+0.045
ll+0.037
ossible+0.037
rou+0.034
om+0.034
Neg logit contributions
ured-0.058
zed-0.052
comb-0.052
tered-0.051
zing-0.051
Token being
Token ID852
Feature activation-4.69e-02

Pos logit contributions
o+0.071
ll+0.061
ossible+0.060
om+0.056
rou+0.055
Neg logit contributions
ured-0.079
zed-0.073
tered-0.070
zing-0.070
comb-0.069
Token tired
Token ID10032
Feature activation+1.48e+00

Pos logit contributions
ured+0.002
zed+0.001
zing+0.001
tered+0.001
comb+0.001
Neg logit contributions
o-0.001
ll-0.001
ossible-0.001
om-0.001
rou-0.001
Token He
Token ID679
Feature activation-9.58e-01

Pos logit contributions
est+0.023
zed+0.023
tered+0.022
ured+0.022
nob+0.021
Neg logit contributions
ll-0.054
o-0.052
rou-0.052
etchup-0.047
ucker-0.046
Token would
Token ID561
Feature activation-1.13e+00

Pos logit contributions
ured+0.031
zed+0.029
tered+0.028
zing+0.028
comb+0.028
Neg logit contributions
o-0.026
ll-0.023
ossible-0.022
rou-0.021
om-0.021
Token run
Token ID1057
Feature activation-1.17e+00

Pos logit contributions
ured+0.034
zed+0.030
tered+0.030
zing+0.030
comb+0.029
Neg logit contributions
o-0.035
ll-0.030
ossible-0.030
om-0.028
rou-0.028
Token around
Token ID1088
Feature activation-4.85e-01

Pos logit contributions
ured+0.043
zed+0.040
comb+0.040
zing+0.039
isy+0.038
Neg logit contributions
o-0.029
ll-0.023
ossible-0.022
om-0.021
rou-0.020
Token in
Token ID287
Feature activation+4.79e-01

Pos logit contributions
ured+0.018
zed+0.017
tered+0.016
zing+0.016
comb+0.016
Neg logit contributions
o-0.011
ll-0.010
ossible-0.009
rou-0.009
ucker-0.009
Token the
Token ID262
Feature activation-9.40e-01

Pos logit contributions
o+0.015
ll+0.015
ucker+0.013
rou+0.012
om+0.012
Neg logit contributions
ured-0.013
zed-0.013
est-0.012
zing-0.012
ered-0.012
Token grass
Token ID8701
Feature activation+1.87e+00

Pos logit contributions
ured+0.030
zed+0.028
tered+0.027
zing+0.027
comb+0.027
Neg logit contributions
o-0.027
ll-0.023
ossible-0.022
om-0.021
rou-0.021
Token and
Token ID290
Feature activation+2.23e-01

Pos logit contributions
o+0.045
ossible+0.044
etchup+0.042
aj+0.041
rou+0.040
Neg logit contributions
ured-0.057
comb-0.056
est-0.055
tered-0.054
atively-0.054
Token make
Token ID787
Feature activation-3.02e+00

Pos logit contributions
o+0.006
ll+0.005
ossible+0.005
om+0.004
rou+0.004
Neg logit contributions
ured-0.008
tered-0.007
zed-0.007
isy-0.007
zing-0.007
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
ured+0.095
zed+0.087
tered+0.085
zing+0.084
comb+0.084
Neg logit contributions
o-0.088
ll-0.076
ossible-0.074
om-0.070
rou-0.070
Token big
Token ID1263
Feature activation-1.99e+00

Pos logit contributions
ured+0.033
zed+0.030
comb+0.030
zing+0.030
isy+0.030
Neg logit contributions
o-0.031
ll-0.027
ossible-0.026
om-0.024
rou-0.024
Token.
Token ID13
Feature activation-8.74e-01

Pos logit contributions
ured+0.017
zed+0.016
tered+0.015
comb+0.015
zing+0.015
Neg logit contributions
o-0.010
ll-0.009
ossible-0.008
rou-0.008
om-0.008
Token Every
Token ID3887
Feature activation+1.97e+00

Pos logit contributions
ured+0.026
zed+0.025
tered+0.025
est+0.024
zing+0.024
Neg logit contributions
o-0.023
ll-0.023
rou-0.022
ossible-0.020
ucker-0.019
Token day
Token ID1110
Feature activation-2.30e+00

Pos logit contributions
o+0.058
ll+0.055
rou+0.050
ossible+0.048
ucker+0.048
Neg logit contributions
zed-0.058
ured-0.056
zing-0.054
comb-0.053
est-0.053
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
ured+0.085
zed+0.077
comb+0.077
isy+0.075
zing+0.074
Neg logit contributions
o-0.058
ossible-0.045
ll-0.045
om-0.044
etchup-0.039
Token he
Token ID339
Feature activation-3.04e-01

Pos logit contributions
ured+0.044
zed+0.040
zing+0.038
comb+0.038
tered+0.038
Neg logit contributions
o-0.035
ll-0.029
ossible-0.029
om-0.027
rou-0.026
Token would
Token ID561
Feature activation-2.42e-01

Pos logit contributions
ured+0.010
zed+0.009
tered+0.009
zing+0.009
comb+0.009
Neg logit contributions
o-0.008
ll-0.007
ossible-0.007
rou-0.007
om-0.006
Token travel
Token ID3067
Feature activation+2.48e-01

Pos logit contributions
ured+0.008
zed+0.007
tered+0.007
zing+0.007
comb+0.007
Neg logit contributions
o-0.007
ll-0.006
ossible-0.006
om-0.005
rou-0.005
Token on
Token ID319
Feature activation-2.14e+00

Pos logit contributions
o+0.005
ll+0.004
ossible+0.004
rou+0.004
etchup+0.004
Neg logit contributions
ured-0.010
tered-0.009
zed-0.009
zing-0.009
ger-0.009
Token his
Token ID465
Feature activation-1.48e+00

Pos logit contributions
ured+0.065
zed+0.062
zing+0.060
tered+0.060
ered+0.057
Neg logit contributions
o-0.064
ll-0.059
ossible-0.053
rou-0.051
ucker-0.051
Token horse
Token ID8223
Feature activation-1.66e-01

Pos logit contributions
ured+0.043
zed+0.041
zing+0.039
isy+0.038
comb+0.038
Neg logit contributions
o-0.047
ll-0.039
rou-0.039
om-0.038
ossible-0.038
Token around
Token ID1088
Feature activation+5.26e-01

Pos logit contributions
ured+0.006
zed+0.005
comb+0.005
zing+0.005
tered+0.005
Neg logit contributions
o-0.004
ll-0.004
rou-0.004
ucker-0.004
etchup-0.003
Token caught
Token ID4978
Feature activation-5.99e-02

Pos logit contributions
o+0.018
ossible+0.016
ll+0.016
rou+0.015
om+0.015
Neg logit contributions
ured-0.018
zed-0.017
tered-0.017
zing-0.016
comb-0.016
Token them
Token ID606
Feature activation-1.24e+00

Pos logit contributions
ured+0.002
zed+0.002
comb+0.002
zing+0.002
tered+0.002
Neg logit contributions
o-0.002
ll-0.001
ossible-0.001
om-0.001
rou-0.001
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
ured+0.047
zed+0.044
tered+0.043
zing+0.043
comb+0.043
Neg logit contributions
o-0.028
ll-0.023
ossible-0.022
rou-0.020
om-0.020
Token He
Token ID679
Feature activation+1.13e-01

Pos logit contributions
ured+0.032
tered+0.029
zed+0.029
zing+0.028
ued+0.027
Neg logit contributions
o-0.031
ll-0.029
rou-0.027
ossible-0.026
om-0.025
Token took
Token ID1718
Feature activation-2.26e+00

Pos logit contributions
o+0.003
ll+0.003
ossible+0.003
om+0.002
rou+0.002
Neg logit contributions
ured-0.004
zed-0.003
comb-0.003
zing-0.003
tered-0.003
Token the
Token ID262
Feature activation-5.87e-01

Pos logit contributions
comb+0.089
ured+0.087
isy+0.085
zed+0.083
bees+0.083
Neg logit contributions
o-0.048
ossible-0.036
om-0.035
 dad-0.030
etchup-0.030
Token key
Token ID1994
Feature activation-1.67e+00

Pos logit contributions
ured+0.018
zed+0.016
zing+0.015
tered+0.015
comb+0.015
Neg logit contributions
o-0.018
ll-0.016
ossible-0.016
om-0.015
rou-0.014
Token and
Token ID290
Feature activation+5.83e-02

Pos logit contributions
ured+0.058
tered+0.053
comb+0.052
zing+0.051
zed+0.051
Neg logit contributions
o-0.042
ll-0.036
ossible-0.036
rou-0.033
om-0.032
Token ran
Token ID4966
Feature activation-7.27e-01

Pos logit contributions
o+0.002
ll+0.001
ossible+0.001
om+0.001
etchup+0.001
Neg logit contributions
ured-0.002
isy-0.002
zed-0.002
comb-0.002
zing-0.002
Token away
Token ID1497
Feature activation-1.30e+00

Pos logit contributions
ured+0.021
comb+0.021
zed+0.020
isy+0.020
bees+0.020
Neg logit contributions
o-0.022
ll-0.018
ossible-0.017
om-0.017
rou-0.017
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
ured+0.053
zed+0.050
isy+0.048
zing+0.048
comb+0.048
Neg logit contributions
o-0.026
ll-0.020
om-0.018
ossible-0.018
rou-0.018
Token He
Token ID679
Feature activation-5.73e-01

Pos logit contributions
ured+0.013
zed+0.012
tered+0.012
zing+0.012
comb+0.011
Neg logit contributions
o-0.015
ll-0.015
rou-0.013
om-0.013
ossible-0.013
Token had
Token ID550
Feature activation-1.25e-02

Pos logit contributions
ured+0.019
zed+0.017
comb+0.017
tered+0.017
zing+0.017
Neg logit contributions
o-0.016
ll-0.013
ossible-0.013
rou-0.012
om-0.012
Token a
Token ID257
Feature activation+1.45e-01

Pos logit contributions
ured+0.000
zed+0.000
zing+0.000
tered+0.000
ered+0.000
Neg logit contributions
o-0.000
ll-0.000
ossible-0.000
om-0.000
rou-0.000
Token big
Token ID1263
Feature activation-1.02e+00

Pos logit contributions
o+0.005
ll+0.004
ossible+0.004
om+0.004
rou+0.004
Neg logit contributions
ured-0.004
zed-0.004
zing-0.004
comb-0.003
isy-0.003
Token smile
Token ID8212
Feature activation+7.48e-01

Pos logit contributions
ured+0.031
zed+0.029
comb+0.028
zing+0.028
tered+0.028
Neg logit contributions
o-0.030
ll-0.026
ossible-0.024
rou-0.024
om-0.023
Token on
Token ID319
Feature activation-2.99e+00

Pos logit contributions
o+0.018
ll+0.014
ossible+0.014
om+0.013
rou+0.013
Neg logit contributions
ured-0.028
zed-0.026
tered-0.025
comb-0.025
ued-0.025
Token his
Token ID465
Feature activation-2.11e+00

Pos logit contributions
comb+0.123
ured+0.120
bees+0.117
isy+0.115
ued+0.111
Neg logit contributions
ossible-0.047
o-0.045
 dad-0.039
 to-0.033
 Dad-0.033
Token face
Token ID1986
Feature activation+2.07e-01

Pos logit contributions
ured+0.056
zed+0.050
comb+0.049
tered+0.049
ued+0.049
Neg logit contributions
o-0.069
ossible-0.061
ll-0.060
om-0.058
rou-0.057
Token and
Token ID290
Feature activation+2.77e-01

Pos logit contributions
o+0.004
ll+0.003
ossible+0.002
om+0.002
rou+0.002
Neg logit contributions
ured-0.009
zed-0.008
tered-0.008
zing-0.008
comb-0.008
Token felt
Token ID2936
Feature activation-1.81e+00

Pos logit contributions
o+0.007
ll+0.006
ossible+0.006
rou+0.006
om+0.005
Neg logit contributions
ured-0.010
comb-0.009
tered-0.009
isy-0.009
zed-0.009
Token so
Token ID523
Feature activation-1.93e+00

Pos logit contributions
comb+0.068
ured+0.068
isy+0.065
bees+0.065
oned+0.063
Neg logit contributions
o-0.036
rou-0.030
ossible-0.028
 tired-0.026
 to-0.025
Token!
Token ID0
Feature activation-9.63e-01

Pos logit contributions
ured+0.003
zed+0.003
tered+0.003
zing+0.003
comb+0.003
Neg logit contributions
o-0.002
ll-0.002
ossible-0.002
om-0.001
rou-0.001
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
ured+0.032
tered+0.030
zed+0.029
ued+0.029
oned+0.029
Neg logit contributions
o-0.023
rou-0.022
ll-0.022
ossible-0.020
ucker-0.019
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
o+0.002
rou+0.002
ll+0.002
ossible+0.002
etchup+0.002
Neg logit contributions
zed-0.004
ured-0.004
zing-0.004
tered-0.004
ued-0.004
TokenThe
Token ID464
Feature activation+1.35e-01

Pos logit contributions
ured+0.042
comb+0.039
zed+0.038
 odor+0.037
est+0.037
Neg logit contributions
o-0.032
ll-0.026
ossible-0.026
om-0.023
rou-0.023
Token wild
Token ID4295
Feature activation+3.60e-01

Pos logit contributions
o+0.004
ll+0.003
om+0.003
ossible+0.003
rou+0.003
Neg logit contributions
ured-0.004
zed-0.004
zing-0.004
tered-0.004
isy-0.004
Token pants
Token ID12581
Feature activation-3.30e+00

Pos logit contributions
o+0.010
ll+0.009
ossible+0.009
rou+0.008
om+0.008
Neg logit contributions
ured-0.012
zed-0.010
tered-0.010
zing-0.010
est-0.010
Token was
Token ID373
Feature activation-5.75e-01

Pos logit contributions
ured+0.106
zed+0.097
zing+0.094
comb+0.093
tered+0.093
Neg logit contributions
o-0.095
ll-0.082
ossible-0.079
om-0.075
rou-0.074
Token so
Token ID523
Feature activation-1.44e+00

Pos logit contributions
ured+0.020
zed+0.018
tered+0.018
zing+0.018
comb+0.017
Neg logit contributions
o-0.015
ll-0.013
ossible-0.013
rou-0.012
om-0.012
Token excited
Token ID6568
Feature activation+1.31e+00

Pos logit contributions
ured+0.043
tered+0.040
zed+0.040
zing+0.038
ered+0.037
Neg logit contributions
o-0.043
ll-0.037
ossible-0.036
rou-0.034
etchup-0.034
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
o+0.033
ossible+0.029
ll+0.029
rou+0.028
om+0.027
Neg logit contributions
ured-0.044
zed-0.041
tered-0.040
zing-0.040
isy-0.039
Token He
Token ID679
Feature activation-1.31e+00

Pos logit contributions
ured+0.041
zed+0.037
tered+0.037
zing+0.036
comb+0.036
Neg logit contributions
o-0.040
ll-0.035
ossible-0.034
rou-0.032
om-0.032
Token frozen
Token ID12912
Feature activation-9.78e-01

Pos logit contributions
o+0.029
ll+0.024
ossible+0.023
rou+0.022
om+0.022
Neg logit contributions
ured-0.041
zed-0.038
zing-0.037
tered-0.037
comb-0.037
Token,
Token ID11
Feature activation-3.97e-01

Pos logit contributions
ured+0.038
zed+0.036
zing+0.034
isy+0.034
tered+0.034
Neg logit contributions
o-0.021
ll-0.017
rou-0.015
ossible-0.015
om-0.015
Token so
Token ID523
Feature activation-1.26e+00

Pos logit contributions
ured+0.013
zed+0.012
comb+0.012
zing+0.012
tered+0.012
Neg logit contributions
o-0.011
ll-0.009
ossible-0.009
rou-0.008
om-0.008
Token he
Token ID339
Feature activation-6.35e-01

Pos logit contributions
ured+0.046
comb+0.042
oned+0.041
zed+0.040
ued+0.040
Neg logit contributions
o-0.032
ll-0.025
ossible-0.024
rou-0.023
om-0.022
Token hopped
Token ID45230
Feature activation-1.89e-02

Pos logit contributions
ured+0.019
zed+0.018
comb+0.017
zing+0.017
tered+0.017
Neg logit contributions
o-0.019
ll-0.017
ossible-0.016
om-0.015
rou-0.015
Token on
Token ID319
Feature activation-3.07e+00

Pos logit contributions
ured+0.001
zed+0.001
comb+0.001
zing+0.001
isy+0.001
Neg logit contributions
o-0.001
ll-0.000
ossible-0.000
rou-0.000
om-0.000
Token it
Token ID340
Feature activation-6.61e-01

Pos logit contributions
ured+0.111
zed+0.103
zing+0.100
tered+0.100
comb+0.099
Neg logit contributions
o-0.077
ll-0.065
ossible-0.062
om-0.058
rou-0.058
Token and
Token ID290
Feature activation+1.82e-02

Pos logit contributions
ured+0.027
zed+0.025
zing+0.024
tered+0.024
comb+0.024
Neg logit contributions
o-0.013
ll-0.011
ossible-0.010
om-0.009
rou-0.009
Token fell
Token ID3214
Feature activation+8.03e-01

Pos logit contributions
o+0.001
ll+0.000
ossible+0.000
rou+0.000
om+0.000
Neg logit contributions
ured-0.001
comb-0.001
isy-0.001
zed-0.001
est-0.001
Token through
Token ID832
Feature activation-1.03e-01

Pos logit contributions
o+0.020
ll+0.017
ossible+0.016
rou+0.015
om+0.015
Neg logit contributions
ured-0.029
zed-0.027
tered-0.026
comb-0.026
zing-0.026
Token the
Token ID262
Feature activation-1.13e-01

Pos logit contributions
ured+0.004
tered+0.004
comb+0.004
isy+0.004
zed+0.004
Neg logit contributions
o-0.002
ossible-0.002
ll-0.002
rou-0.002
om-0.002
Token to
Token ID284
Feature activation-8.83e-01

Pos logit contributions
ured+0.018
zed+0.017
comb+0.017
isy+0.016
tered+0.016
Neg logit contributions
o-0.009
ossible-0.007
ll-0.007
om-0.007
rou-0.006
Token her
Token ID607
Feature activation+7.53e-01

Pos logit contributions
ured+0.022
zed+0.020
tered+0.019
zing+0.019
comb+0.019
Neg logit contributions
o-0.031
ll-0.028
ossible-0.027
rou-0.026
om-0.026
Token,
Token ID11
Feature activation+1.23e+00

Pos logit contributions
o+0.016
ossible+0.013
ll+0.013
rou+0.012
om+0.011
Neg logit contributions
ured-0.030
zed-0.028
tered-0.027
comb-0.027
zing-0.027
Token
Token ID6184
Feature activation-1.24e+00

Pos logit contributions
o+0.032
ll+0.027
ossible+0.026
rou+0.025
om+0.025
Neg logit contributions
ured-0.043
zed-0.040
tered-0.039
zing-0.039
comb-0.038
Token
Token ID95
Feature activation-2.30e-01

Pos logit contributions
ured+0.033
zed+0.031
tered+0.030
zing+0.030
est+0.029
Neg logit contributions
o-0.040
ll-0.036
ossible-0.034
rou-0.033
om-0.033
Token
Token ID26391
Feature activation-3.15e+00

Pos logit contributions
ured+0.005
 trigger+0.004
zed+0.004
ered+0.004
comb+0.004
Neg logit contributions
o-0.009
ll-0.008
ossible-0.008
rou-0.007
om-0.007
Token
Token ID129
Feature activation-6.62e-01

Pos logit contributions
zed+0.089
ured+0.087
zing+0.086
comb+0.085
ze+0.083
Neg logit contributions
o-0.087
ll-0.078
rou-0.077
ossible-0.075
etchup-0.072
Token
Token ID241
Feature activation+5.96e-01

Pos logit contributions
ured+0.013
comb+0.012
zed+0.012
est+0.011
tered+0.011
Neg logit contributions
o-0.026
ll-0.023
ossible-0.022
om-0.022
rou-0.022
TokenLet
Token ID5756
Feature activation-2.74e+00

Pos logit contributions
o+0.017
ll+0.015
ossible+0.015
om+0.014
rou+0.014
Neg logit contributions
ured-0.018
zed-0.017
tered-0.017
zing-0.016
comb-0.016
Tokenâ
Token ID22940
Feature activation-1.91e+00

Pos logit contributions
ured+0.118
comb+0.114
zed+0.108
tered+0.106
bees+0.105
Neg logit contributions
o-0.051
ossible-0.043
ll-0.039
om-0.032
 dad-0.029
Token
Token ID26391
Feature activation-3.30e+00

Pos logit contributions
ured+0.045
comb+0.039
bees+0.038
tered+0.037
zing+0.037
Neg logit contributions
o-0.071
ossible-0.064
ll-0.061
om-0.055
etchup-0.054
Token family
Token ID1641
Feature activation-1.76e-01

Pos logit contributions
o+0.001
ll+0.001
ossible+0.001
rou+0.001
om+0.001
Neg logit contributions
ured-0.001
zed-0.001
tered-0.001
zing-0.001
comb-0.001
Token in
Token ID287
Feature activation+1.09e+00

Pos logit contributions
ured+0.006
zed+0.006
zing+0.005
tered+0.005
comb+0.005
Neg logit contributions
o-0.005
ll-0.004
ossible-0.004
om-0.004
rou-0.004
Token and
Token ID290
Feature activation+9.16e-01

Pos logit contributions
o+0.028
ll+0.024
ossible+0.022
rou+0.022
om+0.021
Neg logit contributions
ured-0.038
zed-0.035
tered-0.034
zing-0.034
comb-0.034
Token shared
Token ID4888
Feature activation-2.21e-01

Pos logit contributions
o+0.022
ll+0.017
ossible+0.016
rou+0.015
om+0.015
Neg logit contributions
ured-0.035
zed-0.032
comb-0.032
zing-0.031
tered-0.031
Token his
Token ID465
Feature activation-2.48e+00

Pos logit contributions
ured+0.008
zed+0.007
tered+0.007
zing+0.007
comb+0.007
Neg logit contributions
o-0.006
ll-0.005
ossible-0.005
rou-0.004
om-0.004
Token meal
Token ID9799
Feature activation-3.19e+00

Pos logit contributions
ured+0.079
zed+0.072
tered+0.070
comb+0.070
zing+0.069
Neg logit contributions
o-0.072
ll-0.062
ossible-0.061
om-0.057
rou-0.057
Token with
Token ID351
Feature activation-2.38e-01

Pos logit contributions
ured+0.118
zed+0.109
tered+0.107
comb+0.106
zing+0.106
Neg logit contributions
o-0.075
ll-0.062
ossible-0.060
rou-0.057
om-0.056
Token them
Token ID606
Feature activation-9.05e-01

Pos logit contributions
ured+0.008
zed+0.007
comb+0.007
tered+0.007
zing+0.007
Neg logit contributions
o-0.006
ll-0.005
ossible-0.005
om-0.005
rou-0.005
Token.
Token ID13
Feature activation+5.16e-02

Pos logit contributions
ured+0.034
zed+0.032
tered+0.031
zing+0.031
comb+0.031
Neg logit contributions
o-0.020
ll-0.017
ossible-0.016
om-0.015
rou-0.015
Token They
Token ID1119
Feature activation-1.05e+00

Pos logit contributions
o+0.001
ll+0.001
rou+0.001
ossible+0.001
om+0.001
Neg logit contributions
ured-0.002
zed-0.002
tered-0.002
zing-0.002
comb-0.002
Token were
Token ID547
Feature activation-2.11e-01

Pos logit contributions
ured+0.037
zed+0.034
zing+0.034
comb+0.033
oned+0.032
Neg logit contributions
o-0.028
ll-0.024
ossible-0.023
rou-0.021
om-0.020