TOP ACTIVATIONS
MAX = 8.406

journal back . She shouted , " Max , you 're
this ?" she sob bed , pointing at the cup .
do that ." They say , " Thank you , sir
chased us ." Mom says , " No , the ducks
, okay ?" mom says , hugging Anna .
at Max . They say , " Stop ! That is
saved me ." She says , " I 'm sorry ,
, Anna ?" he said , wiping her tears . "
, kids ," mom said , after they said goodbye to
hug Sam . They say , " Thank you , Sam
." Mom said , " Playing is good ,
and hug . They say , " Yes , we can
to the boy and shouted , " That 's my rabbit
my fault ," Mia said , as she hugged him .
do that !" she said , crying . Mom
scissors ?" Mom my said , " Yes , you can
the track ," Mia said , pointing at him .
a shelf . She whispered , " Don 't worry ,
pushed him . She yelled , " No ! Those are
L ila !" Tom said , relieved . "

MIN ACTIVATIONS
MIN = -7.811

sailed for days , exploring unch arted waters until he came
, the farmer in the de ll ". Suddenly , he
there was a ready as hr t ay . Every day
! <|endoftext|> One day , Sil via was getting ready to
's very mild , like tal cum powder . I 'm
M olly was watching the Sh ower . She watched the
. One day Sh rimp decided to go visit
. It was covered in de w from the wet rain
Jimmy was always excited for Wed nes days because it was
Wed nes days because on Wed nes days she did something
are not flowers , my de ars . Those are caul
to wait until tomorrow . Because it is time for bed
Her mom said , " Ste ak is a type of
jelly . <|endoftext|> John loved k ites . He looked at
and wet from the morning de w . She pointed at
looks wonderful on you , Sil via ! You look so
of water from the morning de w . Then
did something special . On Wed nes days she liked to
was a three - year - old girl called Joy .
two friends named Alex and Rh onda . Alex wanted to

INTERVAL 4.352 - 8.406
CONTAINS 1.003%

" S ara and Ben , what did you do ?
like it ?" Ben asks , showing his image to Anna
! He picked it up , and it seemed to be
M ia said , " No , Tim ,
stopped . Sarah took a closer look and saw that the

INTERVAL 0.298 - 4.352
CONTAINS 40.016%

beach . So Jack and his dad parted ways with the
C rows don 't like it when people come too close
! She moved around the me adow again , this time
feel better and you can go home ." But
They saw trains that were red , blue , green ,

INTERVAL -3.757 - 0.298
CONTAINS 58.303%

her toys and talk on her phone . One day ,
flowers . Soon it saw something new - a shiny stone
said . " You might damage your suitcase or someone else
let 's go ." <|endoftext|> Ben and Lily were playing in
to carefully shave . It was perfect ! He was so

INTERVAL -7.811 - -3.757
CONTAINS 0.678%

Jim . Jim loved playing with his toys , but one
L ily was sad because she really wanted to see
, and Sally became anxious because she really wanted to play
ready to pay , but Grand pa was last in line
drove . Inside was a mom , dad and a little
Token journal
Token ID3989
Feature activation+4.16e+00

Pos logit contributions
paper+0.053
ro+0.050
coat+0.050
pit+0.050
ked+0.047
Neg logit contributions
body-0.050
 of-0.047
 much-0.046
old-0.043
 important-0.042
Token back
Token ID736
Feature activation+4.55e-01

Pos logit contributions
paper+0.088
ro+0.088
pit+0.087
ked+0.087
coat+0.084
Neg logit contributions
 of-0.085
 much-0.083
 important-0.069
-0.068
 unexpected-0.060
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
paper+0.011
coat+0.010
ro+0.010
pit+0.010
ked+0.010
Neg logit contributions
body-0.008
 of-0.008
 much-0.008
old-0.007
 important-0.007
Token She
Token ID1375
Feature activation+2.94e+00

Pos logit contributions
pit+0.041
paper+0.039
coat+0.038
ro+0.036
ked+0.036
Neg logit contributions
 much-0.018
 of-0.017
 Everyone-0.015
 you-0.014
-0.014
Token shouted
Token ID17293
Feature activation+3.41e+00

Pos logit contributions
paper+0.065
pit+0.062
ro+0.061
coat+0.061
thank+0.055
Neg logit contributions
 of-0.058
body-0.055
 much-0.054
old-0.052
-0.050
Token,
Token ID11
Feature activation+8.41e+00

Pos logit contributions
paper+0.091
pit+0.090
coat+0.088
ro+0.087
thank+0.081
Neg logit contributions
 of-0.055
 much-0.053
body-0.049
 important-0.044
-0.043
Token "
Token ID366
Feature activation+3.98e+00

Pos logit contributions
paper+0.233
pit+0.228
coat+0.213
ro+0.212
thank+0.205
Neg logit contributions
 of-0.128
 much-0.122
 about-0.104
-0.100
 important-0.100
TokenMax
Token ID11518
Feature activation-6.02e-01

Pos logit contributions
pit+0.091
paper+0.090
ro+0.084
coat+0.083
cast+0.079
Neg logit contributions
body-0.082
-0.072
 of-0.071
 much-0.068
 you-0.067
Token,
Token ID11
Feature activation+4.99e+00

Pos logit contributions
 of+0.009
body+0.009
 much+0.009
+0.008
old+0.008
Neg logit contributions
paper-0.016
ro-0.015
pit-0.015
coat-0.015
ked-0.015
Token you
Token ID345
Feature activation+1.46e+00

Pos logit contributions
paper+0.132
ro+0.127
coat+0.127
pit+0.126
ked+0.122
Neg logit contributions
body-0.079
 of-0.073
 much-0.071
old-0.064
 important-0.062
Token're
Token ID821
Feature activation-2.16e+00

Pos logit contributions
paper+0.027
coat+0.025
ked+0.025
ro+0.024
pit+0.024
Neg logit contributions
body-0.035
 much-0.031
old-0.031
 of-0.031
ug-0.030
Token this
Token ID428
Feature activation+6.29e-01

Pos logit contributions
body+0.043
 of+0.041
old+0.039
 much+0.038
ug+0.037
Neg logit contributions
paper-0.070
coat-0.064
pit-0.064
ked-0.062
ro-0.062
Token?"
Token ID1701
Feature activation+1.78e+00

Pos logit contributions
paper+0.015
ro+0.015
pit+0.015
coat+0.014
ked+0.014
Neg logit contributions
body-0.011
 of-0.011
 much-0.010
 important-0.009
old-0.009
Token she
Token ID673
Feature activation+3.56e+00

Pos logit contributions
paper+0.043
pit+0.041
coat+0.041
ro+0.040
ked+0.039
Neg logit contributions
body-0.033
 of-0.032
 much-0.031
old-0.028
 important-0.027
Token sob
Token ID27355
Feature activation+1.39e+00

Pos logit contributions
paper+0.067
ro+0.064
coat+0.063
pit+0.062
ked+0.061
Neg logit contributions
body-0.083
 of-0.077
 much-0.076
old-0.072
igators-0.072
Tokenbed
Token ID3077
Feature activation+2.49e+00

Pos logit contributions
paper+0.019
ro+0.018
pit+0.017
coat+0.017
ked+0.017
Neg logit contributions
body-0.040
 of-0.038
 much-0.038
old-0.036
igators-0.036
Token,
Token ID11
Feature activation+8.34e+00

Pos logit contributions
paper+0.058
ro+0.056
coat+0.056
pit+0.055
ked+0.052
Neg logit contributions
body-0.046
 of-0.045
 much-0.044
old-0.040
 important-0.039
Token pointing
Token ID10609
Feature activation+2.49e+00

Pos logit contributions
paper+0.208
ro+0.194
pit+0.191
coat+0.188
suit+0.177
Neg logit contributions
 of-0.146
 much-0.144
body-0.133
 important-0.121
 importantly-0.121
Token at
Token ID379
Feature activation-5.51e-01

Pos logit contributions
coat+0.075
ro+0.075
paper+0.074
pit+0.073
 extract+0.068
Neg logit contributions
 of-0.037
 much-0.032
old-0.026
 important-0.023
-0.023
Token the
Token ID262
Feature activation-1.35e-01

Pos logit contributions
body+0.009
 of+0.009
 much+0.008
old+0.007
 important+0.007
Neg logit contributions
paper-0.015
coat-0.014
pit-0.014
ro-0.014
ked-0.013
Token cup
Token ID6508
Feature activation+1.51e+00

Pos logit contributions
 of+0.003
body+0.003
 much+0.003
 important+0.003
+0.003
Neg logit contributions
paper-0.003
ked-0.003
pit-0.003
ro-0.003
coat-0.002
Token.
Token ID13
Feature activation+2.31e+00

Pos logit contributions
paper+0.035
coat+0.033
ked+0.033
ro+0.033
pit+0.032
Neg logit contributions
 of-0.030
body-0.027
 much-0.026
old-0.023
igators-0.023
Token do
Token ID466
Feature activation+6.51e-01

Pos logit contributions
paper+0.044
pit+0.042
ro+0.042
coat+0.041
ked+0.041
Neg logit contributions
 of-0.037
body-0.036
 much-0.035
-0.032
old-0.031
Token that
Token ID326
Feature activation+4.71e-01

Pos logit contributions
paper+0.017
ro+0.016
coat+0.016
pit+0.016
ked+0.016
Neg logit contributions
body-0.010
 of-0.009
 much-0.009
old-0.008
igators-0.008
Token."
Token ID526
Feature activation+1.29e+00

Pos logit contributions
paper+0.012
coat+0.012
ro+0.012
pit+0.011
ked+0.011
Neg logit contributions
body-0.008
 of-0.007
 much-0.007
 important-0.007
old-0.006
Token They
Token ID1119
Feature activation+3.23e+00

Pos logit contributions
paper+0.031
pit+0.030
coat+0.030
ked+0.028
ro+0.028
Neg logit contributions
 of-0.024
 much-0.023
body-0.022
old-0.020
ug-0.019
Token say
Token ID910
Feature activation+3.91e+00

Pos logit contributions
paper+0.080
pit+0.076
coat+0.074
ro+0.073
ked+0.070
Neg logit contributions
 of-0.057
 much-0.052
-0.051
body-0.050
old-0.048
Token,
Token ID11
Feature activation+8.31e+00

Pos logit contributions
paper+0.095
pit+0.093
coat+0.093
ro+0.092
ked+0.086
Neg logit contributions
body-0.068
 of-0.068
 much-0.066
old-0.058
 important-0.058
Token "
Token ID366
Feature activation+4.66e+00

Pos logit contributions
paper+0.218
coat+0.208
pit+0.206
ro+0.191
suit+0.187
Neg logit contributions
 of-0.137
 much-0.127
-0.107
 God-0.097
 Everyone-0.095
TokenThank
Token ID10449
Feature activation-1.13e+00

Pos logit contributions
pit+0.093
paper+0.089
coat+0.086
aj+0.082
ro+0.079
Neg logit contributions
body-0.105
 much-0.099
-0.096
 of-0.094
 you-0.090
Token you
Token ID345
Feature activation+2.16e+00

Pos logit contributions
 earthquake+0.015
body+0.015
igators+0.015
 important+0.014
old+0.014
Neg logit contributions
paper-0.028
ked-0.026
ro-0.026
balls-0.025
-0.025
Token,
Token ID11
Feature activation+2.97e+00

Pos logit contributions
paper+0.039
pit+0.038
ro+0.036
coat+0.036
ked+0.035
Neg logit contributions
body-0.052
old-0.045
igators-0.045
 much-0.043
ug-0.042
Token sir
Token ID15967
Feature activation-2.55e+00

Pos logit contributions
paper+0.061
pit+0.058
ro+0.057
coat+0.055
ked+0.054
Neg logit contributions
body-0.063
 much-0.057
igators-0.056
 of-0.055
old-0.055
Token chased
Token ID26172
Feature activation-1.20e-01

Pos logit contributions
paper+0.023
pit+0.022
coat+0.022
ro+0.022
ked+0.021
Neg logit contributions
body-0.017
 of-0.016
 much-0.015
old-0.014
 important-0.014
Token us
Token ID514
Feature activation+1.08e+00

Pos logit contributions
body+0.002
 much+0.002
 importantly+0.002
 of+0.002
old+0.002
Neg logit contributions
paper-0.003
ro-0.003
ked-0.003
coat-0.003
pit-0.003
Token."
Token ID526
Feature activation+1.33e+00

Pos logit contributions
paper+0.024
pit+0.023
coat+0.022
ro+0.022
ked+0.021
Neg logit contributions
 of-0.022
body-0.021
 much-0.021
old-0.019
igators-0.018
Token Mom
Token ID11254
Feature activation+3.81e-01

Pos logit contributions
paper+0.033
coat+0.031
pit+0.031
ro+0.030
ked+0.030
Neg logit contributions
body-0.024
 of-0.023
 much-0.023
old-0.020
 important-0.020
Token says
Token ID1139
Feature activation+4.22e+00

Pos logit contributions
paper+0.008
ro+0.008
coat+0.008
pit+0.008
ked+0.007
Neg logit contributions
body-0.008
 of-0.008
 much-0.007
old-0.007
igators-0.007
Token,
Token ID11
Feature activation+8.23e+00

Pos logit contributions
paper+0.106
ro+0.104
pit+0.104
coat+0.102
ked+0.096
Neg logit contributions
 of-0.072
body-0.070
 much-0.068
old-0.060
 important-0.057
Token "
Token ID366
Feature activation+3.99e+00

Pos logit contributions
paper+0.230
pit+0.218
ro+0.209
cast+0.203
coat+0.203
Neg logit contributions
 of-0.137
 much-0.126
body-0.106
-0.105
 importantly-0.104
TokenNo
Token ID2949
Feature activation+4.44e-01

Pos logit contributions
paper+0.086
pit+0.082
coat+0.082
ro+0.079
ked+0.075
Neg logit contributions
body-0.088
 of-0.082
 much-0.079
old-0.072
 important-0.071
Token,
Token ID11
Feature activation+3.25e+00

Pos logit contributions
paper+0.007
coat+0.007
pit+0.007
ro+0.006
ked+0.006
Neg logit contributions
body-0.011
igators-0.010
old-0.010
 much-0.010
 of-0.010
Token the
Token ID262
Feature activation-1.22e-01

Pos logit contributions
paper+0.070
pit+0.067
ro+0.065
ked+0.064
coat+0.063
Neg logit contributions
body-0.062
old-0.059
 much-0.056
 of-0.056
ug-0.055
Token ducks
Token ID39694
Feature activation-1.59e+00

Pos logit contributions
body+0.002
 of+0.002
 much+0.002
 important+0.002
igators+0.002
Neg logit contributions
paper-0.003
coat-0.003
ked-0.003
ro-0.003
pit-0.003
Token,
Token ID11
Feature activation+3.15e+00

Pos logit contributions
 of+0.002
body+0.002
 much+0.002
 important+0.001
old+0.001
Neg logit contributions
paper-0.002
pit-0.002
coat-0.002
ked-0.002
ro-0.002
Token okay
Token ID8788
Feature activation-1.10e+00

Pos logit contributions
paper+0.077
coat+0.073
pit+0.072
ro+0.071
ked+0.070
Neg logit contributions
body-0.058
 of-0.056
 much-0.054
 important-0.049
old-0.047
Token?"
Token ID1701
Feature activation+1.79e+00

Pos logit contributions
body+0.016
 of+0.015
 much+0.015
old+0.013
 important+0.013
Neg logit contributions
paper-0.030
coat-0.029
ro-0.029
pit-0.029
ked-0.028
Token mom
Token ID1995
Feature activation+6.31e-01

Pos logit contributions
paper+0.037
ro+0.036
coat+0.035
pit+0.035
ked+0.034
Neg logit contributions
body-0.038
 of-0.034
 much-0.034
igators-0.032
old-0.032
Token says
Token ID1139
Feature activation+4.18e+00

Pos logit contributions
paper+0.015
pit+0.014
ro+0.014
coat+0.014
ked+0.013
Neg logit contributions
body-0.012
 of-0.011
 much-0.011
 important-0.010
old-0.010
Token,
Token ID11
Feature activation+8.14e+00

Pos logit contributions
paper+0.104
pit+0.100
coat+0.100
ro+0.100
ked+0.093
Neg logit contributions
 of-0.072
body-0.068
 much-0.067
old-0.061
ug-0.057
Token hugging
Token ID46292
Feature activation-9.94e-01

Pos logit contributions
paper+0.227
ro+0.206
coat+0.204
suit+0.201
pit+0.200
Neg logit contributions
 of-0.121
 much-0.111
-0.097
body-0.096
 importantly-0.093
Token Anna
Token ID11735
Feature activation-2.40e-01

Pos logit contributions
 of+0.016
 much+0.014
body+0.012
old+0.012
 you+0.011
Neg logit contributions
coat-0.027
paper-0.027
pit-0.027
ro-0.025
thank-0.025
Token.
Token ID13
Feature activation+1.56e+00

Pos logit contributions
 of+0.004
body+0.004
 much+0.004
old+0.004
 important+0.003
Neg logit contributions
paper-0.006
coat-0.006
ro-0.006
pit-0.006
ked-0.005
Token
Token ID198
Feature activation+7.57e-01

Pos logit contributions
paper+0.046
pit+0.043
coat+0.042
ked+0.042
ro+0.040
Neg logit contributions
 of-0.024
 much-0.024
body-0.020
-0.019
 importantly-0.018
Token
Token ID198
Feature activation-7.56e-01

Pos logit contributions
paper+0.015
ro+0.015
coat+0.015
pit+0.015
ked+0.014
Neg logit contributions
body-0.016
 of-0.015
 much-0.015
old-0.014
 important-0.014
Token at
Token ID379
Feature activation+8.09e-02

Pos logit contributions
paper+0.083
pit+0.082
coat+0.081
ro+0.080
thank+0.074
Neg logit contributions
 of-0.059
 much-0.057
body-0.053
 important-0.049
old-0.047
Token Max
Token ID5436
Feature activation+1.26e+00

Pos logit contributions
paper+0.002
ro+0.002
coat+0.002
pit+0.002
ked+0.002
Neg logit contributions
body-0.002
 of-0.001
 much-0.001
old-0.001
 important-0.001
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
paper+0.030
pit+0.029
coat+0.029
ro+0.029
ked+0.027
Neg logit contributions
 of-0.023
 much-0.021
body-0.020
old-0.019
-0.019
Token They
Token ID1119
Feature activation+2.66e+00

Pos logit contributions
pit+0.049
paper+0.048
coat+0.046
ro+0.043
cast+0.043
Neg logit contributions
 much-0.022
 of-0.019
 Everyone-0.017
-0.015
 important-0.013
Token say
Token ID910
Feature activation+3.98e+00

Pos logit contributions
paper+0.063
pit+0.060
ro+0.060
coat+0.059
ked+0.057
Neg logit contributions
body-0.049
 of-0.047
 much-0.046
old-0.042
igators-0.041
Token,
Token ID11
Feature activation+8.14e+00

Pos logit contributions
pit+0.113
coat+0.111
ro+0.111
paper+0.110
thank+0.102
Neg logit contributions
 of-0.058
 much-0.057
body-0.053
 important-0.046
old-0.043
Token "
Token ID366
Feature activation+4.20e+00

Pos logit contributions
paper+0.221
pit+0.221
coat+0.209
ro+0.200
thank+0.198
Neg logit contributions
 of-0.128
 much-0.125
-0.107
body-0.096
 about-0.096
TokenStop
Token ID19485
Feature activation+3.38e-01

Pos logit contributions
pit+0.090
paper+0.089
coat+0.086
ro+0.084
aj+0.075
Neg logit contributions
body-0.089
 of-0.082
 much-0.079
-0.079
 you-0.073
Token!
Token ID0
Feature activation+5.14e-01

Pos logit contributions
paper+0.009
coat+0.008
ro+0.008
pit+0.008
ked+0.008
Neg logit contributions
body-0.006
 of-0.005
 much-0.005
old-0.005
 important-0.004
Token That
Token ID1320
Feature activation+1.67e+00

Pos logit contributions
paper+0.012
coat+0.012
pit+0.012
ro+0.012
ked+0.011
Neg logit contributions
body-0.010
 of-0.009
 much-0.009
old-0.008
igators-0.008
Token is
Token ID318
Feature activation-1.87e+00

Pos logit contributions
paper+0.034
coat+0.032
pit+0.031
ked+0.030
ro+0.029
Neg logit contributions
 importantly-0.034
body-0.032
old-0.032
ug-0.032
 of-0.030
Token saved
Token ID7448
Feature activation-2.54e+00

Pos logit contributions
paper+0.024
ro+0.023
coat+0.023
pit+0.023
ked+0.022
Neg logit contributions
body-0.024
 of-0.022
 much-0.022
old-0.020
 important-0.020
Token me
Token ID502
Feature activation+5.95e-02

Pos logit contributions
body+0.045
 of+0.041
 much+0.039
old+0.038
 importantly+0.037
Neg logit contributions
paper-0.063
ro-0.060
coat-0.058
pit-0.057
ked-0.056
Token."
Token ID526
Feature activation+6.64e-01

Pos logit contributions
paper+0.001
ro+0.001
coat+0.001
pit+0.001
ked+0.001
Neg logit contributions
body-0.001
 importantly-0.001
old-0.001
igators-0.001
ime-0.001
Token She
Token ID1375
Feature activation+4.23e+00

Pos logit contributions
paper+0.015
pit+0.015
coat+0.015
ked+0.014
ro+0.014
Neg logit contributions
 of-0.013
 much-0.013
body-0.012
old-0.011
-0.010
Token says
Token ID1139
Feature activation+4.04e+00

Pos logit contributions
paper+0.090
coat+0.086
ro+0.085
pit+0.083
ked+0.079
Neg logit contributions
 of-0.088
body-0.085
 much-0.082
old-0.077
-0.076
Token,
Token ID11
Feature activation+8.12e+00

Pos logit contributions
paper+0.098
pit+0.097
coat+0.096
ro+0.096
ked+0.089
Neg logit contributions
 of-0.071
body-0.069
 much-0.068
old-0.060
 important-0.059
Token "
Token ID366
Feature activation+4.55e+00

Pos logit contributions
paper+0.231
pit+0.220
coat+0.214
ro+0.207
suit+0.203
Neg logit contributions
 of-0.121
 much-0.116
 importantly-0.092
 about-0.091
-0.083
TokenI
Token ID40
Feature activation+3.26e+00

Pos logit contributions
pit+0.116
paper+0.115
coat+0.108
ked+0.106
ro+0.105
Neg logit contributions
body-0.080
 much-0.072
 of-0.072
-0.065
 importantly-0.063
Token'm
Token ID1101
Feature activation-1.10e+00

Pos logit contributions
pit+0.084
paper+0.081
ked+0.075
ro+0.074
coat+0.073
Neg logit contributions
 much-0.063
 of-0.062
-0.057
 important-0.055
body-0.055
Token sorry
Token ID7926
Feature activation+2.41e-01

Pos logit contributions
body+0.022
igators+0.018
old+0.018
idy+0.018
 obstacle+0.018
Neg logit contributions
coat-0.024
paper-0.023
ked-0.022
pit-0.022
ro-0.021
Token,
Token ID11
Feature activation+3.02e+00

Pos logit contributions
paper+0.005
ro+0.004
coat+0.004
pit+0.004
ked+0.004
Neg logit contributions
body-0.005
igators-0.005
old-0.005
 much-0.005
 of-0.005
Token,
Token ID11
Feature activation+1.99e+00

Pos logit contributions
body+0.015
 of+0.014
 much+0.014
old+0.012
 important+0.012
Neg logit contributions
paper-0.022
ro-0.021
coat-0.021
pit-0.021
ked-0.020
Token Anna
Token ID11735
Feature activation-2.52e+00

Pos logit contributions
paper+0.038
pit+0.037
ro+0.037
coat+0.036
ked+0.034
Neg logit contributions
body-0.045
 much-0.040
 of-0.040
old-0.040
ug-0.039
Token?"
Token ID1701
Feature activation+2.25e+00

Pos logit contributions
body+0.035
 of+0.033
 much+0.032
old+0.028
 important+0.028
Neg logit contributions
paper-0.071
ro-0.069
coat-0.068
pit-0.068
ked-0.066
Token he
Token ID339
Feature activation+3.89e+00

Pos logit contributions
paper+0.048
coat+0.046
ro+0.046
pit+0.046
ked+0.044
Neg logit contributions
body-0.047
 of-0.045
 much-0.044
old-0.041
 important-0.040
Token said
Token ID531
Feature activation+3.83e+00

Pos logit contributions
paper+0.085
coat+0.081
pit+0.081
ro+0.081
ked+0.076
Neg logit contributions
body-0.078
 of-0.075
 much-0.073
old-0.067
 important-0.067
Token,
Token ID11
Feature activation+8.12e+00

Pos logit contributions
paper+0.091
ro+0.087
coat+0.087
pit+0.086
ked+0.083
Neg logit contributions
body-0.070
 of-0.067
 much-0.065
old-0.060
 important-0.058
Token wiping
Token ID36906
Feature activation+1.10e+00

Pos logit contributions
paper+0.202
ro+0.189
coat+0.186
pit+0.183
suit+0.171
Neg logit contributions
 of-0.141
 much-0.134
body-0.123
-0.115
 importantly-0.112
Token her
Token ID607
Feature activation+3.31e+00

Pos logit contributions
paper+0.027
coat+0.026
pit+0.026
ro+0.025
ked+0.025
Neg logit contributions
 of-0.019
body-0.019
 much-0.018
old-0.016
 important-0.016
Token tears
Token ID10953
Feature activation-2.33e-01

Pos logit contributions
paper+0.075
coat+0.071
pit+0.070
ro+0.069
ked+0.067
Neg logit contributions
body-0.064
 of-0.063
 much-0.061
 important-0.055
old-0.054
Token.
Token ID13
Feature activation+2.06e+00

Pos logit contributions
body+0.004
 of+0.004
 much+0.004
old+0.004
 important+0.004
Neg logit contributions
paper-0.005
ro-0.005
coat-0.005
pit-0.005
ked-0.005
Token "
Token ID366
Feature activation+4.52e+00

Pos logit contributions
paper+0.059
coat+0.057
pit+0.055
ked+0.053
thank+0.053
Neg logit contributions
 of-0.035
 much-0.034
 Everyone-0.025
 about-0.024
 importantly-0.024
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
paper+0.008
ked+0.007
coat+0.007
pit+0.007
ro+0.007
Neg logit contributions
 of-0.005
body-0.005
 much-0.005
 important-0.004
old-0.004
Token kids
Token ID3988
Feature activation-3.19e+00

Pos logit contributions
paper+0.031
coat+0.029
ro+0.029
pit+0.028
ked+0.028
Neg logit contributions
body-0.025
 of-0.024
 much-0.023
 important-0.021
old-0.021
Token,"
Token ID553
Feature activation+2.63e+00

Pos logit contributions
 of+0.051
body+0.050
 much+0.047
old+0.043
ug+0.040
Neg logit contributions
paper-0.083
pit-0.080
ro-0.079
coat-0.078
ked-0.078
Token mom
Token ID1995
Feature activation+7.02e-01

Pos logit contributions
paper+0.054
coat+0.051
ro+0.051
pit+0.051
ked+0.048
Neg logit contributions
body-0.057
 of-0.055
 much-0.053
old-0.050
 important-0.049
Token said
Token ID531
Feature activation+3.91e+00

Pos logit contributions
paper+0.015
ro+0.014
coat+0.014
pit+0.014
ked+0.013
Neg logit contributions
body-0.015
 of-0.014
 much-0.014
old-0.013
 important-0.013
Token,
Token ID11
Feature activation+8.09e+00

Pos logit contributions
paper+0.093
ro+0.089
pit+0.089
coat+0.088
ked+0.084
Neg logit contributions
body-0.070
 of-0.069
 much-0.066
old-0.061
ug-0.059
Token after
Token ID706
Feature activation-9.34e-01

Pos logit contributions
paper+0.217
ro+0.197
coat+0.189
suit+0.183
pit+0.183
Neg logit contributions
 of-0.127
 much-0.122
body-0.105
-0.105
 importantly-0.103
Token they
Token ID484
Feature activation+1.60e+00

Pos logit contributions
body+0.015
 of+0.015
 much+0.014
old+0.013
 important+0.012
Neg logit contributions
paper-0.024
coat-0.023
ro-0.023
pit-0.023
ked-0.022
Token said
Token ID531
Feature activation+2.25e+00

Pos logit contributions
paper+0.033
pit+0.031
ro+0.031
coat+0.030
ked+0.028
Neg logit contributions
 of-0.035
body-0.033
 much-0.033
old-0.030
-0.029
Token goodbye
Token ID24829
Feature activation+1.43e+00

Pos logit contributions
paper+0.055
ro+0.052
coat+0.052
pit+0.052
ked+0.049
Neg logit contributions
body-0.040
 of-0.040
 much-0.038
old-0.034
 important-0.034
Token to
Token ID284
Feature activation+8.60e-01

Pos logit contributions
pit+0.041
paper+0.040
coat+0.039
ro+0.037
 extract+0.036
Neg logit contributions
 of-0.021
 much-0.020
 important-0.015
-0.014
old-0.014
Token hug
Token ID16225
Feature activation+4.75e-01

Pos logit contributions
paper+0.045
coat+0.043
ro+0.043
pit+0.043
ked+0.041
Neg logit contributions
body-0.046
 of-0.045
 much-0.044
old-0.041
igators-0.040
Token Sam
Token ID3409
Feature activation-1.56e-01

Pos logit contributions
paper+0.012
coat+0.012
pit+0.012
ro+0.011
ked+0.011
Neg logit contributions
 of-0.008
body-0.008
 much-0.008
 important-0.007
old-0.006
Token.
Token ID13
Feature activation+4.57e-01

Pos logit contributions
body+0.003
 of+0.003
 much+0.003
old+0.002
 important+0.002
Neg logit contributions
paper-0.004
coat-0.004
ro-0.003
pit-0.003
ked-0.003
Token They
Token ID1119
Feature activation+3.08e+00

Pos logit contributions
paper+0.012
coat+0.012
pit+0.011
ro+0.011
ked+0.011
Neg logit contributions
 much-0.008
 of-0.007
body-0.007
old-0.006
 important-0.006
Token say
Token ID910
Feature activation+3.75e+00

Pos logit contributions
paper+0.079
pit+0.075
ro+0.075
ked+0.074
coat+0.074
Neg logit contributions
 of-0.050
 much-0.049
body-0.047
-0.045
old-0.043
Token,
Token ID11
Feature activation+8.08e+00

Pos logit contributions
paper+0.090
coat+0.088
ro+0.088
pit+0.088
ked+0.083
Neg logit contributions
body-0.067
 of-0.065
 much-0.064
old-0.057
 important-0.056
Token "
Token ID366
Feature activation+3.92e+00

Pos logit contributions
paper+0.208
coat+0.204
pit+0.203
ro+0.190
suit+0.179
Neg logit contributions
 of-0.140
 much-0.138
-0.114
 importantly-0.108
 Everyone-0.107
TokenThank
Token ID10449
Feature activation-1.56e+00

Pos logit contributions
paper+0.088
pit+0.085
coat+0.084
ro+0.084
ked+0.080
Neg logit contributions
body-0.078
 of-0.074
 much-0.073
old-0.065
-0.065
Token you
Token ID345
Feature activation+2.19e+00

Pos logit contributions
body+0.021
 important+0.020
 earthquake+0.020
old+0.020
igators+0.020
Neg logit contributions
paper-0.038
ked-0.036
balls-0.035
pit-0.034
-0.034
Token,
Token ID11
Feature activation+3.13e+00

Pos logit contributions
paper+0.035
pit+0.035
coat+0.032
ro+0.031
ked+0.031
Neg logit contributions
body-0.057
old-0.050
igators-0.049
ime-0.047
idy-0.047
Token Sam
Token ID3409
Feature activation-1.63e+00

Pos logit contributions
paper+0.058
pit+0.058
ro+0.054
Or+0.050
coat+0.050
Neg logit contributions
body-0.071
old-0.061
idy-0.060
ime-0.060
 much-0.059
Token."
Token ID526
Feature activation+1.94e+00

Pos logit contributions
paper+0.006
pit+0.006
coat+0.006
ro+0.006
ked+0.006
Neg logit contributions
 of-0.004
body-0.004
 much-0.004
old-0.004
 important-0.004
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
paper+0.050
pit+0.048
coat+0.047
ked+0.045
ro+0.044
Neg logit contributions
 of-0.034
 much-0.033
body-0.032
old-0.028
 important-0.027
Token
Token ID198
Feature activation+5.90e-02

Pos logit contributions
paper+0.026
ro+0.025
coat+0.025
pit+0.025
ked+0.023
Neg logit contributions
body-0.027
 of-0.025
 much-0.025
old-0.023
 important-0.023
TokenMom
Token ID29252
Feature activation+1.33e+00

Pos logit contributions
paper+0.001
ro+0.001
coat+0.001
pit+0.001
ked+0.001
Neg logit contributions
body-0.001
 of-0.001
 much-0.001
old-0.001
igators-0.001
Token said
Token ID531
Feature activation+3.99e+00

Pos logit contributions
paper+0.030
ro+0.029
pit+0.028
coat+0.028
thank+0.026
Neg logit contributions
 of-0.029
old-0.025
 much-0.025
body-0.025
-0.024
Token,
Token ID11
Feature activation+8.07e+00

Pos logit contributions
paper+0.100
ro+0.098
pit+0.098
coat+0.097
ked+0.091
Neg logit contributions
 of-0.067
body-0.067
 much-0.065
old-0.057
 important-0.055
Token "
Token ID366
Feature activation+4.10e+00

Pos logit contributions
paper+0.234
pit+0.230
ro+0.228
coat+0.221
cast+0.220
Neg logit contributions
 of-0.119
 much-0.110
 importantly-0.086
-0.085
 important-0.082
TokenPlaying
Token ID36530
Feature activation+1.25e+00

Pos logit contributions
paper+0.088
coat+0.084
pit+0.084
ro+0.082
ked+0.080
Neg logit contributions
body-0.088
 of-0.082
 much-0.081
 important-0.074
old-0.074
Token is
Token ID318
Feature activation-4.75e-01

Pos logit contributions
paper+0.028
ro+0.027
coat+0.027
pit+0.027
ked+0.025
Neg logit contributions
body-0.024
 of-0.022
 much-0.022
old-0.021
igators-0.020
Token good
Token ID922
Feature activation+7.60e-01

Pos logit contributions
body+0.008
 of+0.008
old+0.007
ug+0.007
 much+0.007
Neg logit contributions
paper-0.011
coat-0.011
ro-0.011
ked-0.010
pit-0.010
Token,
Token ID11
Feature activation+2.59e+00

Pos logit contributions
paper+0.017
ro+0.017
coat+0.017
pit+0.016
ked+0.016
Neg logit contributions
body-0.015
 of-0.014
 much-0.014
old-0.013
 important-0.012
Token and
Token ID290
Feature activation+2.24e+00

Pos logit contributions
paper+0.060
coat+0.057
pit+0.057
ro+0.054
mons+0.050
Neg logit contributions
 of-0.039
 much-0.037
-0.031
 important-0.030
 about-0.028
Token hug
Token ID16225
Feature activation+1.16e+00

Pos logit contributions
paper+0.050
coat+0.048
ro+0.048
pit+0.048
ked+0.045
Neg logit contributions
body-0.043
 of-0.043
 much-0.041
old-0.039
igators-0.037
Token.
Token ID13
Feature activation+6.87e-01

Pos logit contributions
paper+0.027
coat+0.026
ro+0.026
pit+0.026
ked+0.024
Neg logit contributions
body-0.022
 of-0.021
 much-0.021
old-0.019
 important-0.019
Token They
Token ID1119
Feature activation+2.55e+00

Pos logit contributions
paper+0.016
coat+0.015
pit+0.015
ro+0.015
ked+0.014
Neg logit contributions
 of-0.013
body-0.013
 much-0.013
old-0.011
 important-0.011
Token say
Token ID910
Feature activation+3.99e+00

Pos logit contributions
paper+0.065
ro+0.063
pit+0.063
coat+0.062
ked+0.060
Neg logit contributions
 of-0.041
body-0.040
 much-0.040
-0.037
old-0.037
Token,
Token ID11
Feature activation+8.05e+00

Pos logit contributions
paper+0.094
ro+0.091
coat+0.090
pit+0.090
ked+0.085
Neg logit contributions
body-0.074
 of-0.071
 much-0.069
old-0.063
 important-0.062
Token "
Token ID366
Feature activation+4.14e+00

Pos logit contributions
paper+0.193
pit+0.190
coat+0.185
ro+0.175
suit+0.161
Neg logit contributions
 of-0.154
 much-0.153
-0.133
body-0.127
 importantly-0.126
TokenYes
Token ID5297
Feature activation+1.10e+00

Pos logit contributions
paper+0.087
pit+0.086
ro+0.082
coat+0.081
ked+0.077
Neg logit contributions
body-0.089
 of-0.085
 much-0.083
-0.075
 important-0.075
Token,
Token ID11
Feature activation+3.34e+00

Pos logit contributions
paper+0.017
pit+0.015
ked+0.015
coat+0.015
ro+0.015
Neg logit contributions
body-0.028
igators-0.026
old-0.026
 sorts-0.025
 importantly-0.024
Token we
Token ID356
Feature activation+2.96e+00

Pos logit contributions
paper+0.071
ro+0.063
Or+0.063
ked+0.063
coat+0.063
Neg logit contributions
body-0.064
old-0.060
ime-0.057
igators-0.056
ug-0.055
Token can
Token ID460
Feature activation+2.49e+00

Pos logit contributions
ro+0.068
paper+0.067
pit+0.066
coat+0.063
ked+0.061
Neg logit contributions
 of-0.058
 much-0.058
body-0.056
 important-0.052
-0.052
Token to
Token ID284
Feature activation+2.17e+00

Pos logit contributions
paper+0.071
pit+0.070
coat+0.064
ro+0.064
ked+0.061
Neg logit contributions
 of-0.068
 much-0.065
body-0.061
old-0.057
 important-0.055
Token the
Token ID262
Feature activation+6.87e-03

Pos logit contributions
paper+0.051
pit+0.048
coat+0.048
ro+0.048
ked+0.046
Neg logit contributions
body-0.041
 of-0.039
 much-0.038
old-0.035
igators-0.034
Token boy
Token ID2933
Feature activation+5.91e-01

Pos logit contributions
paper+0.000
ked+0.000
ro+0.000
coat+0.000
pit+0.000
Neg logit contributions
body-0.000
 of-0.000
 much-0.000
 important-0.000
old-0.000
Token and
Token ID290
Feature activation+3.52e+00

Pos logit contributions
paper+0.010
ro+0.009
coat+0.009
pit+0.009
ked+0.009
Neg logit contributions
body-0.015
 much-0.014
 of-0.014
 important-0.013
igators-0.013
Token shouted
Token ID17293
Feature activation+3.14e+00

Pos logit contributions
paper+0.071
pit+0.069
ro+0.068
coat+0.067
ked+0.060
Neg logit contributions
body-0.076
 of-0.075
 much-0.072
old-0.069
-0.066
Token,
Token ID11
Feature activation+8.04e+00

Pos logit contributions
paper+0.076
pit+0.073
coat+0.073
ro+0.073
ked+0.068
Neg logit contributions
body-0.056
 of-0.054
 much-0.053
old-0.048
 important-0.047
Token "
Token ID366
Feature activation+3.87e+00

Pos logit contributions
paper+0.199
pit+0.195
ro+0.184
coat+0.183
thank+0.171
Neg logit contributions
 of-0.141
 much-0.141
-0.132
body-0.119
 important-0.115
TokenThat
Token ID2504
Feature activation+1.48e+00

Pos logit contributions
paper+0.088
pit+0.087
ro+0.084
coat+0.082
cast+0.075
Neg logit contributions
body-0.080
-0.072
 you-0.069
 of-0.067
 much-0.063
Token's
Token ID338
Feature activation-8.39e-01

Pos logit contributions
paper+0.033
coat+0.032
ro+0.032
pit+0.032
ked+0.029
Neg logit contributions
body-0.029
 of-0.026
 much-0.026
old-0.026
ug-0.025
Token my
Token ID616
Feature activation+3.56e+00

Pos logit contributions
old+0.017
body+0.017
 of+0.016
ug+0.016
igators+0.015
Neg logit contributions
paper-0.016
coat-0.016
ked-0.015
pit-0.015
ro-0.014
Token rabbit
Token ID22746
Feature activation-7.27e-01

Pos logit contributions
paper+0.071
pit+0.068
ked+0.064
ro+0.064
Or+0.062
Neg logit contributions
body-0.076
 of-0.076
 much-0.075
 important-0.072
old-0.065
Token my
Token ID616
Feature activation+1.91e+00

Pos logit contributions
body+0.036
old+0.034
 of+0.032
ug+0.031
ime+0.029
Neg logit contributions
paper-0.035
coat-0.032
pit-0.031
ro-0.031
ked-0.031
Token fault
Token ID8046
Feature activation+3.26e-01

Pos logit contributions
paper+0.037
coat+0.035
pit+0.035
ro+0.035
ked+0.033
Neg logit contributions
body-0.044
 of-0.040
 much-0.039
old-0.038
igators-0.038
Token,"
Token ID553
Feature activation+3.24e+00

Pos logit contributions
paper+0.006
pit+0.006
coat+0.006
ro+0.006
ked+0.006
Neg logit contributions
body-0.007
old-0.007
igators-0.006
 much-0.006
 unexpected-0.006
Token Mia
Token ID32189
Feature activation-5.92e-03

Pos logit contributions
paper+0.079
coat+0.076
ro+0.075
pit+0.075
ked+0.072
Neg logit contributions
body-0.059
 of-0.056
 much-0.054
old-0.049
 important-0.048
Token said
Token ID531
Feature activation+4.43e+00

Pos logit contributions
body+0.000
 of+0.000
 much+0.000
old+0.000
 important+0.000
Neg logit contributions
paper-0.000
ro-0.000
coat-0.000
pit-0.000
ked-0.000
Token,
Token ID11
Feature activation+8.03e+00

Pos logit contributions
paper+0.103
ro+0.099
coat+0.098
pit+0.098
ked+0.093
Neg logit contributions
body-0.083
 of-0.079
 much-0.077
old-0.071
 important-0.069
Token as
Token ID355
Feature activation+1.96e-01

Pos logit contributions
paper+0.204
ro+0.190
pit+0.187
coat+0.186
suit+0.174
Neg logit contributions
 of-0.135
 much-0.131
body-0.115
-0.114
 important-0.107
Token she
Token ID673
Feature activation+3.66e+00

Pos logit contributions
paper+0.004
ro+0.004
coat+0.004
pit+0.003
ked+0.003
Neg logit contributions
body-0.005
 of-0.004
ug-0.004
old-0.004
igators-0.004
Token hugged
Token ID41130
Feature activation+7.60e-01

Pos logit contributions
paper+0.076
pit+0.075
coat+0.072
ro+0.071
thank+0.065
Neg logit contributions
 of-0.078
 much-0.073
body-0.070
-0.068
old-0.065
Token him
Token ID683
Feature activation+3.19e+00

Pos logit contributions
paper+0.018
coat+0.017
pit+0.017
ro+0.017
ked+0.016
Neg logit contributions
 of-0.014
body-0.013
 much-0.013
old-0.012
ug-0.011
Token.
Token ID13
Feature activation+2.51e+00

Pos logit contributions
paper+0.070
ro+0.067
coat+0.067
pit+0.066
ked+0.064
Neg logit contributions
body-0.066
 of-0.060
 much-0.059
old-0.055
igators-0.054
Token do
Token ID466
Feature activation+1.17e+00

Pos logit contributions
paper+0.055
ro+0.052
coat+0.052
pit+0.051
ked+0.049
Neg logit contributions
body-0.055
 of-0.051
 much-0.050
old-0.047
igators-0.046
Token that
Token ID326
Feature activation+1.11e+00

Pos logit contributions
paper+0.031
ro+0.029
coat+0.029
pit+0.028
ked+0.028
Neg logit contributions
body-0.019
 of-0.017
 much-0.016
old-0.016
igators-0.015
Token!"
Token ID2474
Feature activation+2.38e+00

Pos logit contributions
paper+0.025
ro+0.024
coat+0.024
pit+0.024
ked+0.022
Neg logit contributions
body-0.022
 of-0.020
 much-0.020
old-0.019
igators-0.018
Token she
Token ID673
Feature activation+4.73e+00

Pos logit contributions
paper+0.055
pit+0.053
coat+0.053
ro+0.052
ked+0.050
Neg logit contributions
body-0.046
 of-0.045
 much-0.044
old-0.039
 important-0.038
Token said
Token ID531
Feature activation+4.04e+00

Pos logit contributions
paper+0.107
pit+0.104
ro+0.101
coat+0.100
ked+0.092
Neg logit contributions
 of-0.091
body-0.091
 much-0.087
old-0.080
 important-0.079
Token,
Token ID11
Feature activation+8.03e+00

Pos logit contributions
paper+0.094
ro+0.090
coat+0.090
pit+0.089
ked+0.085
Neg logit contributions
body-0.076
 of-0.072
 much-0.070
old-0.065
 important-0.063
Token crying
Token ID13774
Feature activation+3.32e+00

Pos logit contributions
paper+0.210
ro+0.195
pit+0.191
coat+0.191
suit+0.180
Neg logit contributions
 of-0.131
 much-0.122
body-0.114
-0.104
 importantly-0.102
Token.
Token ID13
Feature activation+2.17e+00

Pos logit contributions
coat+0.106
pit+0.103
paper+0.102
ro+0.099
 extract+0.096
Neg logit contributions
 of-0.042
 much-0.036
 important-0.026
 about-0.023
ug-0.022
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
paper+0.065
pit+0.064
coat+0.062
ro+0.059
ked+0.057
Neg logit contributions
 much-0.030
 of-0.030
body-0.024
-0.023
 importantly-0.021
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
paper+0.024
coat+0.023
pit+0.023
ro+0.023
ked+0.022
Neg logit contributions
body-0.023
 of-0.022
 much-0.022
old-0.020
 important-0.019
TokenMom
Token ID29252
Feature activation+1.13e+00

Pos logit contributions
body+0.003
 of+0.003
 much+0.003
old+0.003
+0.003
Neg logit contributions
paper-0.002
coat-0.002
pit-0.002
ro-0.002
ked-0.002
Token scissors
Token ID41786
Feature activation+3.78e-01

Pos logit contributions
paper+0.022
coat+0.021
pit+0.021
ro+0.020
ked+0.020
Neg logit contributions
 of-0.021
body-0.021
 much-0.020
 important-0.018
-0.018
Token?"
Token ID1701
Feature activation+1.57e+00

Pos logit contributions
paper+0.011
pit+0.011
coat+0.011
ro+0.010
ked+0.010
Neg logit contributions
 of-0.005
 much-0.004
body-0.004
 you-0.004
 important-0.004
Token Mom
Token ID11254
Feature activation+2.90e-01

Pos logit contributions
paper+0.043
coat+0.041
ked+0.041
pit+0.041
ro+0.039
Neg logit contributions
 of-0.026
 much-0.025
body-0.023
old-0.021
-0.020
Tokenmy
Token ID1820
Feature activation+1.56e+00

Pos logit contributions
paper+0.007
ro+0.007
pit+0.007
coat+0.007
cast+0.006
Neg logit contributions
 of-0.006
old-0.005
 much-0.005
body-0.004
-0.004
Token said
Token ID531
Feature activation+3.89e+00

Pos logit contributions
paper+0.039
ro+0.039
pit+0.039
coat+0.037
cast+0.035
Neg logit contributions
 of-0.029
body-0.025
old-0.024
 much-0.024
-0.023
Token,
Token ID11
Feature activation+8.02e+00

Pos logit contributions
paper+0.095
pit+0.093
ro+0.093
coat+0.092
ked+0.086
Neg logit contributions
 of-0.067
body-0.066
 much-0.064
old-0.057
 important-0.056
Token "
Token ID366
Feature activation+3.17e+00

Pos logit contributions
paper+0.220
pit+0.220
cast+0.208
ro+0.201
 pockets+0.197
Neg logit contributions
 of-0.124
 much-0.117
-0.101
 importantly-0.092
 you-0.091
TokenYes
Token ID5297
Feature activation+6.02e-01

Pos logit contributions
paper+0.063
pit+0.062
coat+0.060
ro+0.060
ked+0.057
Neg logit contributions
body-0.071
 of-0.070
 much-0.067
old-0.062
 important-0.061
Token,
Token ID11
Feature activation+3.13e+00

Pos logit contributions
paper+0.011
coat+0.011
pit+0.011
ro+0.010
ked+0.010
Neg logit contributions
body-0.014
igators-0.012
old-0.012
 much-0.012
 of-0.012
Token you
Token ID345
Feature activation+2.19e+00

Pos logit contributions
paper+0.082
coat+0.078
pit+0.077
ro+0.077
ked+0.074
Neg logit contributions
body-0.048
 much-0.041
ug-0.041
igators-0.040
old-0.040
Token can
Token ID460
Feature activation+2.12e+00

Pos logit contributions
paper+0.042
pit+0.041
ro+0.041
coat+0.041
ked+0.038
Neg logit contributions
body-0.050
 of-0.050
 much-0.049
old-0.045
 important-0.044
Token the
Token ID262
Feature activation+1.97e+00

Pos logit contributions
paper+0.033
coat+0.032
ro+0.032
pit+0.032
ked+0.031
Neg logit contributions
body-0.016
 of-0.016
 much-0.015
 important-0.013
igators-0.013
Token track
Token ID2610
Feature activation+1.41e+00

Pos logit contributions
paper+0.044
ro+0.042
coat+0.042
pit+0.042
ked+0.040
Neg logit contributions
body-0.039
 of-0.037
 much-0.036
old-0.034
igators-0.033
Token,"
Token ID553
Feature activation+3.60e+00

Pos logit contributions
paper+0.032
ro+0.031
coat+0.031
pit+0.031
ked+0.029
Neg logit contributions
body-0.027
 of-0.024
 much-0.024
old-0.023
 important-0.022
Token Mia
Token ID32189
Feature activation+9.48e-02

Pos logit contributions
paper+0.092
coat+0.088
ro+0.087
pit+0.087
ked+0.083
Neg logit contributions
body-0.062
 of-0.062
 much-0.059
old-0.052
 important-0.051
Token said
Token ID531
Feature activation+4.29e+00

Pos logit contributions
paper+0.002
coat+0.002
ro+0.002
pit+0.002
ked+0.002
Neg logit contributions
body-0.002
 of-0.002
 much-0.002
old-0.002
 important-0.002
Token,
Token ID11
Feature activation+7.99e+00

Pos logit contributions
paper+0.100
ro+0.095
coat+0.095
pit+0.095
ked+0.092
Neg logit contributions
body-0.081
 of-0.075
 much-0.074
old-0.068
igators-0.067
Token pointing
Token ID10609
Feature activation+2.02e+00

Pos logit contributions
paper+0.207
ro+0.192
coat+0.188
pit+0.188
suit+0.177
Neg logit contributions
 of-0.133
 much-0.127
body-0.120
-0.109
 important-0.106
Token at
Token ID379
Feature activation-6.40e-01

Pos logit contributions
coat+0.058
ro+0.057
pit+0.056
paper+0.056
thank+0.053
Neg logit contributions
 of-0.030
 much-0.028
-0.022
old-0.022
 important-0.022
Token him
Token ID683
Feature activation+1.67e+00

Pos logit contributions
body+0.011
 of+0.010
 much+0.010
old+0.009
 important+0.009
Neg logit contributions
paper-0.016
ro-0.015
pit-0.015
coat-0.015
ked-0.014
Token.
Token ID13
Feature activation+2.43e+00

Pos logit contributions
paper+0.042
coat+0.040
ro+0.040
pit+0.040
ked+0.038
Neg logit contributions
body-0.028
 of-0.028
 much-0.027
old-0.024
 important-0.023
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
paper+0.068
pit+0.066
coat+0.066
ro+0.062
ked+0.061
Neg logit contributions
 of-0.036
 much-0.035
body-0.029
-0.029
 Everyone-0.027
Token a
Token ID257
Feature activation+2.81e+00

Pos logit contributions
body+0.028
 of+0.025
 much+0.025
old+0.023
 important+0.022
Neg logit contributions
paper-0.046
ro-0.044
coat-0.044
pit-0.044
ked-0.042
Token shelf
Token ID18316
Feature activation+5.47e-01

Pos logit contributions
paper+0.051
ro+0.048
coat+0.048
pit+0.048
ked+0.044
Neg logit contributions
body-0.067
 of-0.063
 much-0.062
old-0.059
ug-0.058
Token.
Token ID13
Feature activation+7.65e-01

Pos logit contributions
paper+0.013
coat+0.013
ro+0.012
pit+0.012
ked+0.012
Neg logit contributions
 of-0.010
body-0.009
 much-0.009
old-0.008
 important-0.008
Token She
Token ID1375
Feature activation+3.03e+00

Pos logit contributions
paper+0.020
coat+0.019
pit+0.018
ked+0.018
ro+0.018
Neg logit contributions
 much-0.013
 of-0.013
body-0.012
old-0.011
 important-0.011
Token whispered
Token ID25029
Feature activation+2.48e+00

Pos logit contributions
paper+0.070
pit+0.066
ro+0.066
coat+0.065
ked+0.061
Neg logit contributions
body-0.058
 of-0.058
 much-0.056
old-0.052
 important-0.051
Token,
Token ID11
Feature activation+7.99e+00

Pos logit contributions
ro+0.061
coat+0.061
pit+0.061
paper+0.061
thank+0.058
Neg logit contributions
 of-0.041
 much-0.040
body-0.037
 important-0.034
ug-0.033
Token "
Token ID366
Feature activation+3.50e+00

Pos logit contributions
paper+0.207
pit+0.198
coat+0.194
ro+0.183
 sandbox+0.172
Neg logit contributions
 much-0.137
 of-0.137
-0.122
body-0.119
 important-0.119
TokenDon
Token ID3987
Feature activation+2.72e-01

Pos logit contributions
paper+0.077
pit+0.074
coat+0.071
ked+0.067
ro+0.065
Neg logit contributions
body-0.077
 of-0.070
 much-0.066
-0.064
old-0.063
Token't
Token ID470
Feature activation-1.53e-01

Pos logit contributions
paper+0.004
ro+0.004
coat+0.004
ked+0.004
balls+0.004
Neg logit contributions
old-0.006
 importantly-0.006
body-0.006
 sorts-0.006
weet-0.006
Token worry
Token ID5490
Feature activation+3.14e-01

Pos logit contributions
body+0.003
 of+0.003
 much+0.003
old+0.003
igators+0.003
Neg logit contributions
paper-0.003
ro-0.003
coat-0.003
pit-0.003
ked-0.003
Token,
Token ID11
Feature activation+2.02e+00

Pos logit contributions
paper+0.005
ro+0.005
pit+0.005
coat+0.005
ked+0.004
Neg logit contributions
body-0.008
old-0.007
 much-0.007
igators-0.007
 importantly-0.007
Token pushed
Token ID7121
Feature activation+5.64e-01

Pos logit contributions
paper+0.084
pit+0.082
ro+0.080
coat+0.079
ked+0.072
Neg logit contributions
 of-0.079
body-0.078
 much-0.076
old-0.072
 important-0.068
Token him
Token ID683
Feature activation+3.13e+00

Pos logit contributions
paper+0.011
ro+0.011
coat+0.010
pit+0.010
ked+0.010
Neg logit contributions
body-0.013
 of-0.012
 much-0.011
igators-0.011
old-0.011
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
paper+0.071
coat+0.068
ro+0.067
pit+0.067
ked+0.064
Neg logit contributions
body-0.061
 of-0.058
 much-0.056
old-0.052
 important-0.051
Token She
Token ID1375
Feature activation+2.88e+00

Pos logit contributions
pit+0.039
paper+0.036
coat+0.035
thank+0.035
ro+0.034
Neg logit contributions
 much-0.017
 of-0.017
 Everyone-0.013
-0.013
body-0.011
Token yelled
Token ID22760
Feature activation+2.87e+00

Pos logit contributions
paper+0.064
pit+0.063
ro+0.059
coat+0.059
ked+0.055
Neg logit contributions
 of-0.056
body-0.055
 much-0.054
old-0.050
 important-0.048
Token,
Token ID11
Feature activation+7.98e+00

Pos logit contributions
pit+0.079
paper+0.078
coat+0.078
ro+0.077
thank+0.074
Neg logit contributions
 of-0.043
 much-0.042
body-0.036
ug-0.033
 important-0.033
Token "
Token ID366
Feature activation+4.19e+00

Pos logit contributions
pit+0.209
paper+0.208
coat+0.193
thank+0.192
ro+0.192
Neg logit contributions
 of-0.129
 much-0.126
-0.105
body-0.104
 important-0.102
TokenNo
Token ID2949
Feature activation+1.11e+00

Pos logit contributions
paper+0.082
pit+0.081
coat+0.077
ro+0.077
ked+0.072
Neg logit contributions
body-0.097
 of-0.091
 much-0.089
-0.084
old-0.082
Token!
Token ID0
Feature activation+3.32e-02

Pos logit contributions
paper+0.025
coat+0.024
ro+0.024
pit+0.024
ked+0.023
Neg logit contributions
body-0.022
 of-0.020
 much-0.020
old-0.019
igators-0.018
Token Those
Token ID5845
Feature activation+9.38e-02

Pos logit contributions
paper+0.001
ro+0.001
coat+0.001
pit+0.001
ked+0.001
Neg logit contributions
body-0.001
 of-0.001
 much-0.001
old-0.001
 important-0.001
Token are
Token ID389
Feature activation-2.35e+00

Pos logit contributions
paper+0.002
coat+0.002
ro+0.002
pit+0.002
ked+0.002
Neg logit contributions
body-0.002
 of-0.002
 much-0.002
old-0.002
ug-0.002
Token L
Token ID406
Feature activation-4.07e+00

Pos logit contributions
paper+0.078
ro+0.075
pit+0.074
coat+0.074
ked+0.072
Neg logit contributions
body-0.044
 of-0.040
 much-0.040
old-0.037
 important-0.035
Tokenila
Token ID10102
Feature activation-2.48e+00

Pos logit contributions
body+0.082
 much+0.080
 of+0.075
 important+0.075
igators+0.074
Neg logit contributions
paper-0.087
ro-0.083
pit-0.083
coat-0.082
ked-0.076
Token!"
Token ID2474
Feature activation+2.90e+00

Pos logit contributions
body+0.050
 of+0.046
 much+0.046
old+0.043
igators+0.043
Neg logit contributions
paper-0.054
ro-0.051
coat-0.051
pit-0.051
ked-0.049
Token Tom
Token ID4186
Feature activation+7.41e-01

Pos logit contributions
paper+0.058
ro+0.056
coat+0.055
pit+0.054
ked+0.052
Neg logit contributions
body-0.065
 of-0.060
 much-0.059
old-0.056
igators-0.055
Token said
Token ID531
Feature activation+4.00e+00

Pos logit contributions
paper+0.016
pit+0.016
coat+0.015
ro+0.015
ked+0.014
Neg logit contributions
body-0.015
 of-0.015
 much-0.014
old-0.013
 important-0.012
Token,
Token ID11
Feature activation+7.96e+00

Pos logit contributions
paper+0.090
ro+0.086
coat+0.086
pit+0.085
ked+0.082
Neg logit contributions
body-0.079
 of-0.073
 much-0.072
old-0.067
 important-0.065
Token relieved
Token ID22598
Feature activation+1.03e+00

Pos logit contributions
paper+0.192
ro+0.179
coat+0.178
pit+0.178
suit+0.161
Neg logit contributions
 of-0.142
 much-0.136
body-0.133
-0.119
 important-0.115
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
paper+0.031
coat+0.030
pit+0.030
ro+0.027
ked+0.027
Neg logit contributions
 of-0.015
 much-0.013
body-0.012
 you-0.010
old-0.010
Token
Token ID198
Feature activation+9.46e-01

Pos logit contributions
paper+0.046
coat+0.046
pit+0.044
ked+0.042
thank+0.042
Neg logit contributions
 of-0.028
 much-0.026
 Everyone-0.020
 importantly-0.018
body-0.018
Token
Token ID198
Feature activation-1.40e-01

Pos logit contributions
paper+0.019
ro+0.018
coat+0.018
pit+0.018
ked+0.017
Neg logit contributions
body-0.021
 of-0.020
 much-0.019
old-0.018
igators-0.018
Token"
Token ID1
Feature activation+4.55e+00

Pos logit contributions
 of+0.003
body+0.003
 much+0.003
old+0.002
+0.002
Neg logit contributions
paper-0.003
coat-0.003
pit-0.003
ked-0.003
ro-0.003
Token sailed
Token ID33835
Feature activation+9.08e-01

Pos logit contributions
body+0.014
 of+0.014
 much+0.013
old+0.013
 important+0.012
Neg logit contributions
paper-0.015
coat-0.014
ro-0.014
pit-0.014
ked-0.013
Token for
Token ID329
Feature activation-1.13e+00

Pos logit contributions
paper+0.024
coat+0.023
ro+0.022
pit+0.022
ked+0.022
Neg logit contributions
 of-0.016
 much-0.014
body-0.014
 important-0.012
ug-0.012
Token days
Token ID1528
Feature activation-2.65e-01

Pos logit contributions
body+0.021
 of+0.019
 much+0.018
old+0.018
igators+0.018
Neg logit contributions
paper-0.027
ro-0.025
coat-0.025
pit-0.025
ked-0.024
Token,
Token ID11
Feature activation+1.78e+00

Pos logit contributions
body+0.006
 much+0.005
 of+0.005
igators+0.005
old+0.005
Neg logit contributions
paper-0.005
ro-0.005
coat-0.005
pit-0.005
ked-0.005
Token exploring
Token ID13504
Feature activation+1.81e-01

Pos logit contributions
paper+0.040
ro+0.039
coat+0.039
pit+0.038
ked+0.036
Neg logit contributions
 of-0.033
body-0.032
 much-0.032
 important-0.028
old-0.028
Token unch
Token ID16258
Feature activation-7.81e+00

Pos logit contributions
paper+0.005
coat+0.005
ro+0.005
pit+0.005
ked+0.005
Neg logit contributions
 of-0.003
body-0.003
 much-0.003
 important-0.002
old-0.002
Tokenarted
Token ID19112
Feature activation-7.51e-01

Pos logit contributions
body+0.181
 much+0.160
 of+0.155
 importantly+0.149
 important+0.146
Neg logit contributions
paper-0.157
ro-0.156
pit-0.149
ked-0.142
coat-0.139
Token waters
Token ID10150
Feature activation-3.32e-02

Pos logit contributions
 of+0.013
body+0.012
 much+0.012
 important+0.011
old+0.010
Neg logit contributions
paper-0.018
coat-0.018
pit-0.018
ro-0.017
ked-0.017
Token until
Token ID1566
Feature activation-1.41e+00

Pos logit contributions
 of+0.001
body+0.001
 much+0.001
old+0.001
 important+0.001
Neg logit contributions
paper-0.001
coat-0.001
pit-0.001
ro-0.001
ked-0.001
Token he
Token ID339
Feature activation+2.49e-02

Pos logit contributions
body+0.023
 of+0.022
 much+0.021
old+0.019
 important+0.018
Neg logit contributions
paper-0.036
coat-0.035
ro-0.035
pit-0.035
ked-0.034
Token came
Token ID1625
Feature activation-1.59e-01

Pos logit contributions
paper+0.000
coat+0.000
pit+0.000
ro+0.000
ked+0.000
Neg logit contributions
body-0.001
 of-0.001
 much-0.001
old-0.001
 important-0.001
Token,
Token ID11
Feature activation+9.74e-01

Pos logit contributions
paper+0.000
pit+0.000
coat+0.000
ro+0.000
ked+0.000
Neg logit contributions
body-0.000
 of-0.000
 much-0.000
old-0.000
igators-0.000
Token the
Token ID262
Feature activation-1.95e+00

Pos logit contributions
paper+0.023
pit+0.023
ro+0.022
coat+0.021
ked+0.021
Neg logit contributions
body-0.016
ug-0.015
old-0.014
 much-0.014
 of-0.014
Token farmer
Token ID18739
Feature activation-3.59e+00

Pos logit contributions
body+0.036
 of+0.034
 much+0.033
old+0.030
 important+0.030
Neg logit contributions
paper-0.046
ro-0.044
coat-0.044
pit-0.044
ked-0.042
Token in
Token ID287
Feature activation-1.58e+00

Pos logit contributions
body+0.063
 of+0.058
 much+0.057
old+0.053
 important+0.051
Neg logit contributions
paper-0.087
ro-0.084
coat-0.084
pit-0.083
ked-0.080
Token the
Token ID262
Feature activation-2.57e+00

Pos logit contributions
body+0.026
old+0.024
 of+0.023
 much+0.022
ug+0.022
Neg logit contributions
paper-0.037
ro-0.037
pit-0.036
coat-0.036
ked-0.034
Token de
Token ID390
Feature activation-7.68e+00

Pos logit contributions
body+0.051
 of+0.048
 much+0.047
 important+0.043
old+0.042
Neg logit contributions
paper-0.058
ro-0.056
coat-0.055
pit-0.055
ked-0.054
Tokenll
Token ID297
Feature activation-2.59e-01

Pos logit contributions
body+0.151
 of+0.150
 much+0.147
 important+0.146
 earthquake+0.146
Neg logit contributions
paper-0.144
coat-0.143
ro-0.136
thank-0.130
pit-0.129
Token".
Token ID1911
Feature activation-6.88e-01

Pos logit contributions
body+0.004
 much+0.004
 of+0.004
old+0.004
igators+0.004
Neg logit contributions
paper-0.006
ro-0.006
coat-0.006
pit-0.006
ked-0.006
Token Suddenly
Token ID24975
Feature activation+3.12e-02

Pos logit contributions
body+0.013
 of+0.013
 much+0.013
old+0.011
igators+0.011
Neg logit contributions
paper-0.016
coat-0.015
pit-0.015
ro-0.015
ked-0.014
Token,
Token ID11
Feature activation-1.29e-01

Pos logit contributions
paper+0.001
ro+0.001
coat+0.001
pit+0.001
ked+0.001
Neg logit contributions
body-0.001
 of-0.001
 much-0.001
old-0.001
 important-0.001
Token he
Token ID339
Feature activation+5.25e-01

Pos logit contributions
body+0.003
old+0.002
 of+0.002
ug+0.002
 much+0.002
Neg logit contributions
paper-0.003
ked-0.002
pit-0.002
ro-0.002
coat-0.002
Token there
Token ID612
Feature activation+2.19e+00

Pos logit contributions
paper+0.023
pit+0.021
ro+0.021
coat+0.021
ked+0.020
Neg logit contributions
body-0.026
 of-0.023
 much-0.022
old-0.022
igators-0.022
Token was
Token ID373
Feature activation-2.22e+00

Pos logit contributions
paper+0.046
coat+0.043
ro+0.043
pit+0.043
ked+0.041
Neg logit contributions
body-0.047
 of-0.045
 much-0.044
old-0.040
 important-0.040
Token a
Token ID257
Feature activation+1.44e-01

Pos logit contributions
body+0.052
igators+0.046
old+0.046
ug+0.045
 of+0.045
Neg logit contributions
coat-0.038
paper-0.038
ro-0.036
ked-0.034
pit-0.034
Token ready
Token ID3492
Feature activation+8.09e-01

Pos logit contributions
paper+0.003
coat+0.003
ro+0.003
pit+0.003
ked+0.003
Neg logit contributions
body-0.003
 of-0.002
old-0.002
igators-0.002
 much-0.002
Token as
Token ID355
Feature activation-3.16e+00

Pos logit contributions
paper+0.020
ro+0.019
coat+0.019
pit+0.019
ked+0.018
Neg logit contributions
body-0.014
 of-0.013
 much-0.012
old-0.012
igators-0.011
Tokenhr
Token ID11840
Feature activation-6.87e+00

Pos logit contributions
body+0.065
 of+0.056
old+0.056
igators+0.056
ug+0.053
Neg logit contributions
paper-0.068
ro-0.062
coat-0.060
pit-0.059
ked-0.057
Tokent
Token ID83
Feature activation-3.93e+00

Pos logit contributions
 importantly+0.113
body+0.112
 much+0.098
 years+0.090
 earthquake+0.089
Neg logit contributions
ro-0.162
paper-0.155
ked-0.151
coat-0.143
thank-0.137
Tokenay
Token ID323
Feature activation-1.79e+00

Pos logit contributions
body+0.070
+0.066
 importantly+0.065
ug+0.061
vere+0.060
Neg logit contributions
ro-0.083
paper-0.080
pit-0.075
coat-0.073
ware-0.071
Token.
Token ID13
Feature activation-2.02e+00

Pos logit contributions
body+0.033
 much+0.031
 of+0.029
 importantly+0.029
igators+0.028
Neg logit contributions
paper-0.042
ked-0.039
coat-0.039
ro-0.038
pit-0.038
Token Every
Token ID3887
Feature activation+1.76e-01

Pos logit contributions
body+0.038
 of+0.036
 much+0.035
old+0.032
 important+0.031
Neg logit contributions
paper-0.047
pit-0.045
coat-0.045
ro-0.045
ked-0.043
Token day
Token ID1110
Feature activation-1.29e+00

Pos logit contributions
ro+0.005
coat+0.004
paper+0.004
pit+0.004
thank+0.004
Neg logit contributions
body-0.003
old-0.002
igators-0.002
idy-0.002
 importantly-0.002
Token!
Token ID0
Feature activation-1.88e+00

Pos logit contributions
body+0.021
 much+0.014
igators+0.013
ug+0.012
old+0.011
Neg logit contributions
paper-0.090
pit-0.088
ro-0.088
coat-0.086
ked-0.083
Token<|endoftext|>
Token ID50256
Feature activation+6.97e-01

Pos logit contributions
body+0.053
 important+0.047
ug+0.047
 earthquake+0.047
 of+0.046
Neg logit contributions
paper-0.023
ro-0.023
pit-0.022
Or-0.020
ked-0.020
TokenOne
Token ID3198
Feature activation-6.20e-01

Pos logit contributions
paper+0.013
coat+0.012
pit+0.012
ro+0.012
ked+0.012
Neg logit contributions
body-0.016
 of-0.016
 much-0.015
old-0.014
igators-0.014
Token day
Token ID1110
Feature activation+2.36e-01

Pos logit contributions
body+0.011
ug+0.010
keys+0.010
idy+0.009
 importantly+0.009
Neg logit contributions
coat-0.013
ro-0.013
pit-0.013
paper-0.012
thank-0.011
Token,
Token ID11
Feature activation+6.04e-01

Pos logit contributions
paper+0.005
coat+0.005
ro+0.005
pit+0.005
ked+0.005
Neg logit contributions
body-0.005
 of-0.004
 much-0.004
old-0.004
igators-0.004
Token Sil
Token ID4243
Feature activation-6.66e+00

Pos logit contributions
paper+0.015
ro+0.014
coat+0.014
pit+0.014
ked+0.014
Neg logit contributions
body-0.011
 of-0.010
 much-0.009
old-0.009
 important-0.009
Tokenvia
Token ID8869
Feature activation-3.27e+00

Pos logit contributions
body+0.108
 of+0.097
 earthquake+0.096
 importantly+0.094
 much+0.094
Neg logit contributions
ro-0.165
paper-0.164
pit-0.157
coat-0.151
ked-0.150
Token was
Token ID373
Feature activation-1.04e+00

Pos logit contributions
body+0.079
 of+0.075
 much+0.074
old+0.069
 important+0.069
Neg logit contributions
paper-0.058
ro-0.055
coat-0.054
pit-0.054
ked-0.051
Token getting
Token ID1972
Feature activation-3.41e+00

Pos logit contributions
body+0.023
igators+0.022
idy+0.020
icated+0.020
old+0.020
Neg logit contributions
coat-0.020
paper-0.018
pit-0.018
ro-0.018
ked-0.018
Token ready
Token ID3492
Feature activation+1.07e+00

Pos logit contributions
body+0.067
 of+0.062
 much+0.060
old+0.057
igators+0.056
Neg logit contributions
paper-0.077
coat-0.073
ro-0.073
pit-0.072
ked-0.070
Token to
Token ID284
Feature activation+3.07e-01

Pos logit contributions
paper+0.023
pit+0.022
coat+0.022
ro+0.022
ked+0.021
Neg logit contributions
 of-0.023
body-0.022
 much-0.021
 important-0.019
old-0.019
Token's
Token ID338
Feature activation-9.68e-01

Pos logit contributions
paper+0.047
ro+0.045
coat+0.045
pit+0.045
ked+0.044
Neg logit contributions
body-0.029
 of-0.027
 much-0.026
old-0.024
ug-0.023
Token very
Token ID845
Feature activation-1.22e+00

Pos logit contributions
body+0.017
 of+0.016
 much+0.016
old+0.015
igators+0.014
Neg logit contributions
paper-0.024
ro-0.023
coat-0.022
pit-0.022
ked-0.021
Token mild
Token ID11607
Feature activation-4.98e-01

Pos logit contributions
body+0.021
 of+0.019
old+0.018
 much+0.018
ug+0.017
Neg logit contributions
paper-0.030
ro-0.029
coat-0.028
pit-0.028
ked-0.027
Token,
Token ID11
Feature activation+2.71e+00

Pos logit contributions
body+0.009
 of+0.009
 much+0.008
old+0.008
igators+0.008
Neg logit contributions
paper-0.012
ro-0.011
coat-0.011
pit-0.011
ked-0.011
Token like
Token ID588
Feature activation-2.25e+00

Pos logit contributions
paper+0.057
ro+0.054
coat+0.054
pit+0.054
ked+0.051
Neg logit contributions
body-0.057
 of-0.053
 much-0.052
old-0.049
 important-0.048
Token tal
Token ID3305
Feature activation-6.52e+00

Pos logit contributions
body+0.039
 of+0.035
 much+0.034
old+0.033
ug+0.032
Neg logit contributions
paper-0.055
ro-0.052
coat-0.051
pit-0.051
ked-0.049
Tokencum
Token ID36340
Feature activation-1.17e+00

Pos logit contributions
body+0.100
 much+0.095
 of+0.089
igators+0.088
 important+0.086
Neg logit contributions
paper-0.164
coat-0.152
pit-0.150
ro-0.145
ked-0.139
Token powder
Token ID11913
Feature activation-6.32e-01

Pos logit contributions
body+0.028
 of+0.027
 much+0.026
old+0.024
 important+0.024
Neg logit contributions
paper-0.021
ro-0.020
coat-0.020
pit-0.020
ked-0.019
Token.
Token ID13
Feature activation-9.81e-01

Pos logit contributions
body+0.011
 of+0.010
 much+0.010
 important+0.009
old+0.009
Neg logit contributions
paper-0.016
pit-0.015
ro-0.015
coat-0.015
ked-0.015
Token I
Token ID314
Feature activation+2.19e+00

Pos logit contributions
body+0.020
 of+0.018
 much+0.018
 important+0.017
old+0.017
Neg logit contributions
paper-0.021
ro-0.021
coat-0.020
pit-0.020
ked-0.019
Token'm
Token ID1101
Feature activation-2.02e+00

Pos logit contributions
paper+0.056
pit+0.053
coat+0.051
ked+0.050
ro+0.048
Neg logit contributions
 of-0.041
body-0.039
 much-0.039
-0.038
 important-0.035
TokenM
Token ID44
Feature activation-1.97e+00

Pos logit contributions
paper+0.022
ro+0.021
coat+0.021
pit+0.020
ked+0.019
Neg logit contributions
body-0.026
 of-0.024
 much-0.024
old-0.023
 important-0.022
Tokenolly
Token ID5098
Feature activation-1.77e+00

Pos logit contributions
body+0.037
igators+0.034
ug+0.033
 matters+0.032
 of+0.031
Neg logit contributions
paper-0.039
pit-0.037
coat-0.037
ro-0.035
aj-0.035
Token was
Token ID373
Feature activation-1.33e+00

Pos logit contributions
body+0.037
 of+0.037
 much+0.035
old+0.033
 important+0.032
Neg logit contributions
paper-0.037
pit-0.035
coat-0.034
ro-0.034
ked-0.033
Token watching
Token ID4964
Feature activation-1.60e+00

Pos logit contributions
body+0.029
old+0.025
igators+0.025
idy+0.024
 of+0.024
Neg logit contributions
coat-0.027
pit-0.025
ked-0.024
thank-0.024
paper-0.024
Token the
Token ID262
Feature activation-2.58e+00

Pos logit contributions
body+0.027
 of+0.024
 much+0.024
old+0.023
igators+0.022
Neg logit contributions
paper-0.040
ro-0.038
pit-0.038
coat-0.038
ked-0.037
Token Sh
Token ID911
Feature activation-6.46e+00

Pos logit contributions
body+0.054
old+0.049
 much+0.047
ug+0.047
 of+0.047
Neg logit contributions
paper-0.052
pit-0.049
ro-0.049
coat-0.047
thank-0.044
Tokenower
Token ID789
Feature activation-1.61e+00

Pos logit contributions
 much+0.113
body+0.112
 important+0.106
 earthquake+0.105
 of+0.105
Neg logit contributions
ro-0.150
coat-0.147
paper-0.142
pit-0.140
thank-0.123
Token.
Token ID13
Feature activation-2.47e+00

Pos logit contributions
body+0.031
 much+0.028
 of+0.028
old+0.027
 important+0.026
Neg logit contributions
paper-0.037
pit-0.035
ro-0.035
coat-0.034
ked-0.033
Token She
Token ID1375
Feature activation-5.29e-01

Pos logit contributions
body+0.051
 of+0.050
 much+0.049
old+0.045
igators+0.044
Neg logit contributions
paper-0.053
coat-0.049
pit-0.049
ro-0.048
ked-0.048
Token watched
Token ID7342
Feature activation-1.01e+00

Pos logit contributions
body+0.011
 of+0.011
 much+0.010
 important+0.009
old+0.009
Neg logit contributions
paper-0.011
coat-0.011
pit-0.011
ro-0.011
ked-0.010
Token the
Token ID262
Feature activation-1.98e+00

Pos logit contributions
body+0.017
 of+0.015
 much+0.015
old+0.014
igators+0.014
Neg logit contributions
paper-0.026
ro-0.024
coat-0.024
pit-0.024
ked-0.024
Token.
Token ID13
Feature activation-1.58e+00

Pos logit contributions
body+0.034
 much+0.030
 of+0.030
old+0.028
 importantly+0.028
Neg logit contributions
paper-0.033
ro-0.031
pit-0.031
ked-0.031
coat-0.030
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
body+0.032
 of+0.031
 much+0.030
old+0.028
igators+0.027
Neg logit contributions
paper-0.034
coat-0.032
pit-0.032
ro-0.032
ked-0.031
Token
Token ID198
Feature activation-6.17e-01

Pos logit contributions
paper+0.016
ro+0.016
coat+0.016
pit+0.014
ked+0.012
Neg logit contributions
body-0.028
 earthquake-0.026
igators-0.026
 important-0.025
ug-0.025
TokenOne
Token ID3198
Feature activation-1.32e+00

Pos logit contributions
body+0.013
 of+0.012
 much+0.012
old+0.011
igators+0.011
Neg logit contributions
paper-0.013
coat-0.013
ro-0.013
pit-0.012
ked-0.012
Token day
Token ID1110
Feature activation+6.71e-01

Pos logit contributions
body+0.025
 mistakes+0.024
ayed+0.024
ug+0.023
 importantly+0.023
Neg logit contributions
ro-0.028
suit-0.023
coat-0.022
thank-0.022
paper-0.021
Token Sh
Token ID911
Feature activation-6.34e+00

Pos logit contributions
paper+0.015
ro+0.014
coat+0.014
pit+0.014
ked+0.013
Neg logit contributions
body-0.014
 of-0.013
 much-0.012
old-0.012
igators-0.012
Tokenrimp
Token ID23750
Feature activation-3.26e+00

Pos logit contributions
body+0.123
 much+0.116
 of+0.116
 earthquake+0.108
 important+0.107
Neg logit contributions
paper-0.141
ro-0.141
coat-0.135
pit-0.133
ked-0.125
Token decided
Token ID3066
Feature activation+1.24e+00

Pos logit contributions
body+0.076
 of+0.074
 much+0.071
old+0.067
igators+0.066
Neg logit contributions
paper-0.061
coat-0.058
ro-0.058
pit-0.057
ked-0.054
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
pit+0.025
paper+0.025
coat+0.024
ro+0.024
ked+0.023
Neg logit contributions
 of-0.027
body-0.026
 much-0.026
 important-0.024
old-0.023
Token go
Token ID467
Feature activation+5.83e-01

Pos logit contributions
ro+0.021
paper+0.020
coat+0.020
pit+0.019
thank+0.017
Neg logit contributions
body-0.023
 much-0.022
 importantly-0.022
 of-0.021
old-0.020
Token visit
Token ID3187
Feature activation-1.65e+00

Pos logit contributions
paper+0.012
ro+0.011
coat+0.011
pit+0.011
ked+0.011
Neg logit contributions
body-0.013
 of-0.012
 much-0.012
old-0.011
 important-0.011
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
body+0.005
 of+0.005
 much+0.004
old+0.004
 important+0.004
Neg logit contributions
paper-0.007
coat-0.007
ro-0.007
pit-0.006
ked-0.006
Token It
Token ID632
Feature activation+6.58e-02

Pos logit contributions
body+0.021
 of+0.019
 much+0.019
 important+0.018
igators+0.017
Neg logit contributions
paper-0.029
ro-0.028
pit-0.028
coat-0.027
ked-0.027
Token was
Token ID373
Feature activation-2.34e+00

Pos logit contributions
paper+0.001
ro+0.001
coat+0.001
pit+0.001
ked+0.001
Neg logit contributions
body-0.001
 of-0.001
 much-0.001
 important-0.001
-0.001
Token covered
Token ID5017
Feature activation-2.30e+00

Pos logit contributions
 of+0.035
 much+0.035
body+0.034
 important+0.030
+0.027
Neg logit contributions
paper-0.065
ro-0.062
coat-0.062
pit-0.062
ked-0.060
Token in
Token ID287
Feature activation-1.81e+00

Pos logit contributions
body+0.047
 much+0.042
 of+0.042
old+0.041
igators+0.039
Neg logit contributions
paper-0.050
ro-0.048
pit-0.048
coat-0.048
ked-0.045
Token de
Token ID390
Feature activation-6.27e+00

Pos logit contributions
body+0.035
 of+0.033
 much+0.032
old+0.029
 important+0.029
Neg logit contributions
paper-0.041
coat-0.040
ro-0.040
pit-0.039
ked-0.038
Tokenw
Token ID86
Feature activation-3.35e-01

Pos logit contributions
body+0.122
 of+0.119
 much+0.114
old+0.110
 importantly+0.109
Neg logit contributions
paper-0.133
coat-0.127
ro-0.124
pit-0.121
ked-0.117
Token from
Token ID422
Feature activation-2.57e+00

Pos logit contributions
body+0.006
 of+0.005
 much+0.005
old+0.005
 important+0.005
Neg logit contributions
paper-0.008
coat-0.008
ro-0.008
pit-0.008
ked-0.008
Token the
Token ID262
Feature activation-9.07e-01

Pos logit contributions
body+0.042
 of+0.039
 much+0.038
old+0.035
 important+0.034
Neg logit contributions
paper-0.066
ro-0.064
coat-0.063
pit-0.063
ked-0.061
Token wet
Token ID9583
Feature activation+2.07e+00

Pos logit contributions
body+0.018
 of+0.017
 much+0.017
 important+0.016
old+0.015
Neg logit contributions
paper-0.020
coat-0.019
ro-0.019
pit-0.019
ked-0.018
Token rain
Token ID6290
Feature activation+1.59e+00

Pos logit contributions
paper+0.044
coat+0.041
ked+0.041
pit+0.040
ro+0.040
Neg logit contributions
body-0.043
 of-0.043
 much-0.040
 earthquake-0.037
 important-0.037
TokenJimmy
Token ID40335
Feature activation-7.58e-01

Pos logit contributions
paper+0.005
coat+0.005
pit+0.005
ro+0.005
ked+0.005
Neg logit contributions
 of-0.006
body-0.006
 much-0.006
old-0.005
-0.005
Token was
Token ID373
Feature activation-1.79e+00

Pos logit contributions
 of+0.015
+0.014
body+0.014
 much+0.013
old+0.013
Neg logit contributions
paper-0.018
pit-0.016
coat-0.016
ked-0.016
ro-0.016
Token always
Token ID1464
Feature activation-1.32e+00

Pos logit contributions
body+0.035
 of+0.030
igators+0.030
old+0.029
 much+0.029
Neg logit contributions
coat-0.039
pit-0.039
paper-0.039
ro-0.038
ked-0.036
Token excited
Token ID6568
Feature activation-1.11e+00

Pos logit contributions
body+0.023
 of+0.022
 much+0.022
 important+0.020
old+0.020
Neg logit contributions
paper-0.032
ro-0.031
coat-0.030
pit-0.030
ked-0.029
Token for
Token ID329
Feature activation-2.20e+00

Pos logit contributions
body+0.028
 much+0.026
 of+0.025
igators+0.025
ug+0.024
Neg logit contributions
paper-0.019
ro-0.018
pit-0.018
coat-0.018
ked-0.016
Token Wed
Token ID3300
Feature activation-6.22e+00

Pos logit contributions
body+0.046
igators+0.040
ug+0.039
 much+0.039
 of+0.038
Neg logit contributions
paper-0.045
ro-0.043
pit-0.042
coat-0.040
ked-0.038
Tokennes
Token ID2516
Feature activation-2.66e+00

Pos logit contributions
body+0.144
 of+0.129
igators+0.121
 much+0.119
 obstacle+0.119
Neg logit contributions
ro-0.119
paper-0.118
coat-0.116
thank-0.106
bel-0.103
Tokendays
Token ID12545
Feature activation-5.40e-01

Pos logit contributions
body+0.064
 much+0.054
 important+0.052
igators+0.052
 importantly+0.052
Neg logit contributions
ro-0.049
pit-0.049
coat-0.049
paper-0.048
ked-0.041
Token because
Token ID780
Feature activation-4.77e+00

Pos logit contributions
body+0.009
 of+0.009
 much+0.009
old+0.008
 important+0.008
Neg logit contributions
paper-0.013
ro-0.012
coat-0.012
pit-0.012
ked-0.012
Token it
Token ID340
Feature activation-1.10e+00

Pos logit contributions
body+0.111
ug+0.101
old+0.097
asm+0.097
ime+0.095
Neg logit contributions
paper-0.078
pit-0.077
ro-0.073
coat-0.071
 shoes-0.065
Token was
Token ID373
Feature activation-3.85e+00

Pos logit contributions
body+0.026
 of+0.024
 much+0.024
old+0.023
ug+0.023
Neg logit contributions
paper-0.020
coat-0.019
ro-0.019
pit-0.018
ked-0.017
Token Wed
Token ID3300
Feature activation-4.71e+00

Pos logit contributions
 of+0.004
body+0.004
 much+0.004
 important+0.004
old+0.004
Neg logit contributions
paper-0.004
pit-0.004
ro-0.004
coat-0.004
ked-0.004
Tokennes
Token ID2516
Feature activation-2.66e+00

Pos logit contributions
body+0.104
 of+0.094
 much+0.089
old+0.086
 importantly+0.085
Neg logit contributions
coat-0.098
paper-0.097
ro-0.097
pit-0.087
thank-0.085
Tokendays
Token ID12545
Feature activation-4.68e-01

Pos logit contributions
body+0.072
 obstacle+0.060
 importantly+0.059
igators+0.058
 much+0.058
Neg logit contributions
coat-0.043
paper-0.041
pit-0.041
ro-0.041
bel-0.035
Token because
Token ID780
Feature activation-4.42e+00

Pos logit contributions
body+0.010
igators+0.009
old+0.008
 much+0.008
 of+0.008
Neg logit contributions
coat-0.010
paper-0.010
ro-0.009
pit-0.009
ked-0.008
Token on
Token ID319
Feature activation-3.79e+00

Pos logit contributions
body+0.100
ug+0.093
asm+0.092
old+0.091
ime+0.089
Neg logit contributions
paper-0.073
coat-0.072
pit-0.071
ro-0.068
 shoes-0.067
Token Wed
Token ID3300
Feature activation-6.16e+00

Pos logit contributions
body+0.082
old+0.075
ug+0.074
 of+0.072
 importantly+0.072
Neg logit contributions
paper-0.073
coat-0.072
ro-0.072
pit-0.070
 shoes-0.061
Tokennes
Token ID2516
Feature activation-2.66e+00

Pos logit contributions
body+0.132
 of+0.125
 much+0.117
old+0.112
 important+0.112
Neg logit contributions
paper-0.126
coat-0.126
ro-0.126
pit-0.116
thank-0.112
Tokendays
Token ID12545
Feature activation-1.28e+00

Pos logit contributions
body+0.054
 of+0.049
 much+0.048
old+0.045
 important+0.044
Neg logit contributions
paper-0.059
coat-0.056
ro-0.056
pit-0.056
ked-0.052
Token she
Token ID673
Feature activation-1.63e+00

Pos logit contributions
body+0.027
 of+0.024
 much+0.024
old+0.023
igators+0.023
Neg logit contributions
paper-0.027
coat-0.027
ro-0.026
pit-0.025
ked-0.024
Token did
Token ID750
Feature activation-3.64e-01

Pos logit contributions
body+0.040
 of+0.036
 much+0.036
old+0.034
igators+0.034
Neg logit contributions
paper-0.029
coat-0.028
ro-0.028
pit-0.027
ked-0.025
Token something
Token ID1223
Feature activation-1.06e+00

Pos logit contributions
body+0.006
 much+0.006
 of+0.005
 important+0.005
old+0.005
Neg logit contributions
paper-0.010
ro-0.009
ked-0.009
coat-0.009
pit-0.009
Token are
Token ID389
Feature activation-2.58e+00

Pos logit contributions
body+0.011
 of+0.010
 much+0.010
old+0.009
ug+0.009
Neg logit contributions
paper-0.009
ro-0.009
coat-0.009
pit-0.009
ked-0.008
Token not
Token ID407
Feature activation-3.00e+00

Pos logit contributions
body+0.051
old+0.047
ug+0.044
 of+0.042
ime+0.041
Neg logit contributions
paper-0.056
pit-0.052
ro-0.052
coat-0.051
Or-0.049
Token flowers
Token ID12734
Feature activation-2.29e+00

Pos logit contributions
body+0.055
 of+0.048
old+0.048
ug+0.046
 much+0.045
Neg logit contributions
paper-0.069
coat-0.066
ro-0.066
pit-0.065
ked-0.063
Token,
Token ID11
Feature activation+2.53e+00

Pos logit contributions
body+0.047
 much+0.042
 of+0.042
old+0.041
ug+0.040
Neg logit contributions
paper-0.049
ro-0.046
coat-0.046
pit-0.045
ked-0.043
Token my
Token ID616
Feature activation+1.63e+00

Pos logit contributions
paper+0.059
ro+0.056
pit+0.056
coat+0.055
ked+0.053
Neg logit contributions
body-0.046
 of-0.043
 much-0.042
old-0.041
igators-0.039
Token de
Token ID390
Feature activation-6.10e+00

Pos logit contributions
paper+0.039
coat+0.037
ro+0.037
pit+0.036
ked+0.036
Neg logit contributions
body-0.030
 of-0.028
 much-0.027
 important-0.027
old-0.024
Tokenars
Token ID945
Feature activation-1.52e+00

Pos logit contributions
body+0.132
 of+0.129
 much+0.124
old+0.120
igators+0.120
Neg logit contributions
paper-0.113
coat-0.110
ro-0.106
ked-0.100
pit-0.099
Token.
Token ID13
Feature activation-1.77e+00

Pos logit contributions
body+0.028
 of+0.026
 much+0.025
old+0.023
 important+0.023
Neg logit contributions
paper-0.036
ro-0.035
coat-0.035
pit-0.035
ked-0.033
Token Those
Token ID5845
Feature activation-1.58e-01

Pos logit contributions
body+0.037
 of+0.034
 much+0.034
old+0.032
 important+0.031
Neg logit contributions
paper-0.037
ro-0.036
coat-0.035
pit-0.035
ked-0.034
Token are
Token ID389
Feature activation-2.93e+00

Pos logit contributions
body+0.003
 of+0.003
 much+0.003
old+0.003
 important+0.003
Neg logit contributions
paper-0.003
ro-0.003
coat-0.003
pit-0.003
ked-0.003
Token caul
Token ID46087
Feature activation-3.71e+00

Pos logit contributions
body+0.054
old+0.048
 of+0.046
ug+0.045
 much+0.044
Neg logit contributions
paper-0.067
ro-0.064
coat-0.062
pit-0.062
ked-0.061
Token to
Token ID284
Feature activation+1.36e+00

Pos logit contributions
body+0.006
 of+0.006
 much+0.006
 important+0.005
old+0.005
Neg logit contributions
paper-0.005
pit-0.004
coat-0.004
ro-0.004
ked-0.004
Token wait
Token ID4043
Feature activation+1.80e+00

Pos logit contributions
paper+0.030
pit+0.029
coat+0.029
ro+0.028
ked+0.028
Neg logit contributions
body-0.027
 of-0.026
 much-0.025
old-0.023
 important-0.023
Token until
Token ID1566
Feature activation-2.45e+00

Pos logit contributions
paper+0.046
pit+0.043
ked+0.043
coat+0.043
ro+0.043
Neg logit contributions
 of-0.031
 much-0.031
body-0.030
 important-0.028
old-0.025
Token tomorrow
Token ID9439
Feature activation+9.41e-03

Pos logit contributions
body+0.032
 of+0.029
 much+0.028
old+0.025
 important+0.024
Neg logit contributions
paper-0.071
ro-0.068
coat-0.068
pit-0.068
ked-0.066
Token.
Token ID13
Feature activation-1.92e+00

Pos logit contributions
paper+0.000
pit+0.000
coat+0.000
ro+0.000
ked+0.000
Neg logit contributions
body-0.000
 of-0.000
 much-0.000
old-0.000
 important-0.000
Token Because
Token ID4362
Feature activation-6.01e+00

Pos logit contributions
body+0.040
 of+0.036
 much+0.035
igators+0.034
old+0.034
Neg logit contributions
ro-0.040
paper-0.038
coat-0.038
pit-0.037
ked-0.036
Token it
Token ID340
Feature activation-5.92e-01

Pos logit contributions
body+0.073
old+0.061
igators+0.058
 much+0.057
 of+0.055
Neg logit contributions
paper-0.172
ro-0.169
coat-0.161
pit-0.161
ked-0.153
Token is
Token ID318
Feature activation-2.68e+00

Pos logit contributions
body+0.012
ug+0.011
 much+0.011
old+0.010
 importantly+0.010
Neg logit contributions
paper-0.013
ro-0.012
coat-0.012
pit-0.012
ked-0.011
Token time
Token ID640
Feature activation+1.61e+00

Pos logit contributions
body+0.043
 of+0.039
 much+0.037
old+0.036
ug+0.034
Neg logit contributions
paper-0.070
coat-0.067
ro-0.067
pit-0.067
ked-0.064
Token for
Token ID329
Feature activation-3.24e+00

Pos logit contributions
paper+0.032
ro+0.031
coat+0.031
pit+0.031
ked+0.029
Neg logit contributions
body-0.035
 of-0.034
 much-0.033
old-0.031
 important-0.030
Token bed
Token ID3996
Feature activation-9.22e-02

Pos logit contributions
body+0.057
 of+0.052
 much+0.050
old+0.049
igators+0.047
Neg logit contributions
paper-0.078
ro-0.076
coat-0.075
pit-0.073
ked-0.071
TokenHer
Token ID9360
Feature activation+1.43e+00

Pos logit contributions
body+0.010
 of+0.009
 much+0.009
old+0.009
 important+0.008
Neg logit contributions
paper-0.010
ro-0.010
coat-0.009
pit-0.009
ked-0.009
Token mom
Token ID1995
Feature activation-1.48e-01

Pos logit contributions
paper+0.037
ro+0.036
coat+0.034
pit+0.034
ked+0.034
Neg logit contributions
 of-0.024
 much-0.023
body-0.023
 important-0.022
old-0.020
Token said
Token ID531
Feature activation+2.86e+00

Pos logit contributions
 of+0.003
body+0.003
 much+0.003
old+0.003
 important+0.002
Neg logit contributions
paper-0.003
ro-0.003
pit-0.003
coat-0.003
ked-0.003
Token,
Token ID11
Feature activation+5.24e+00

Pos logit contributions
paper+0.067
ro+0.064
pit+0.064
coat+0.064
ked+0.061
Neg logit contributions
body-0.054
 of-0.051
 much-0.050
old-0.045
 important-0.045
Token "
Token ID366
Feature activation+2.35e+00

Pos logit contributions
pit+0.108
paper+0.107
ro+0.105
coat+0.103
ked+0.094
Neg logit contributions
 of-0.115
 much-0.112
body-0.109
 important-0.102
-0.099
TokenSte
Token ID7447
Feature activation-5.99e+00

Pos logit contributions
paper+0.043
pit+0.042
ro+0.041
coat+0.040
ked+0.039
Neg logit contributions
body-0.058
 of-0.055
 much-0.053
old-0.050
 important-0.049
Tokenak
Token ID461
Feature activation-1.03e+00

Pos logit contributions
body+0.113
 of+0.107
 importantly+0.106
 much+0.102
 important+0.097
Neg logit contributions
paper-0.132
ro-0.131
coat-0.120
pit-0.119
thank-0.114
Token is
Token ID318
Feature activation-2.42e+00

Pos logit contributions
body+0.020
 of+0.018
 much+0.018
old+0.017
 important+0.016
Neg logit contributions
paper-0.023
ro-0.022
coat-0.022
pit-0.022
ked-0.021
Token a
Token ID257
Feature activation+3.84e-01

Pos logit contributions
body+0.045
 of+0.040
ug+0.039
old+0.039
 much+0.038
Neg logit contributions
paper-0.056
coat-0.053
ro-0.052
pit-0.052
ked-0.050
Token type
Token ID2099
Feature activation-1.42e+00

Pos logit contributions
paper+0.010
ro+0.010
coat+0.010
pit+0.010
ked+0.009
Neg logit contributions
body-0.006
 of-0.006
 much-0.005
old-0.005
 important-0.005
Token of
Token ID286
Feature activation-1.27e-01

Pos logit contributions
body+0.010
 much+0.006
ug+0.006
 matters+0.006
old+0.006
Neg logit contributions
ro-0.049
coat-0.047
pit-0.046
paper-0.046
suit-0.044
Token jelly
Token ID27644
Feature activation-8.51e-02

Pos logit contributions
body+0.051
 of+0.048
 much+0.047
old+0.045
ug+0.045
Neg logit contributions
paper-0.046
pit-0.044
ro-0.044
coat-0.043
ked-0.040
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
body+0.002
 of+0.002
 much+0.002
old+0.001
 important+0.001
Neg logit contributions
paper-0.002
coat-0.002
ro-0.002
ked-0.002
pit-0.002
Token<|endoftext|>
Token ID50256
Feature activation+4.75e-01

Pos logit contributions
body+0.034
 of+0.031
 important+0.030
ug+0.030
igators+0.030
Neg logit contributions
ro-0.022
paper-0.021
pit-0.021
coat-0.020
ked-0.019
TokenJohn
Token ID7554
Feature activation-1.37e+00

Pos logit contributions
paper+0.010
coat+0.009
pit+0.009
ked+0.009
ro+0.009
Neg logit contributions
 of-0.010
body-0.010
 much-0.010
old-0.009
-0.009
Token loved
Token ID6151
Feature activation-8.80e-01

Pos logit contributions
 of+0.027
body+0.025
+0.025
old+0.025
 much+0.025
Neg logit contributions
paper-0.031
pit-0.029
coat-0.029
ked-0.028
ro-0.028
Token k
Token ID479
Feature activation-5.94e+00

Pos logit contributions
body+0.018
 of+0.018
 much+0.017
 important+0.016
old+0.016
Neg logit contributions
paper-0.019
pit-0.018
ro-0.018
coat-0.018
ked-0.018
Tokenites
Token ID2737
Feature activation-2.80e+00

Pos logit contributions
body+0.128
 much+0.116
 important+0.113
 of+0.113
 earthquake+0.110
Neg logit contributions
paper-0.124
coat-0.116
pit-0.115
ro-0.114
ked-0.110
Token.
Token ID13
Feature activation-2.20e+00

Pos logit contributions
body+0.053
 of+0.050
 much+0.049
old+0.045
 important+0.044
Neg logit contributions
paper-0.064
ro-0.061
coat-0.061
pit-0.061
ked-0.059
Token He
Token ID679
Feature activation+9.97e-02

Pos logit contributions
body+0.048
 of+0.047
 much+0.046
old+0.043
igators+0.042
Neg logit contributions
paper-0.044
coat-0.042
pit-0.041
ro-0.041
ked-0.040
Token looked
Token ID3114
Feature activation+1.43e+00

Pos logit contributions
paper+0.002
pit+0.002
ro+0.002
coat+0.002
ked+0.002
Neg logit contributions
body-0.002
 of-0.002
 much-0.002
 important-0.002
old-0.002
Token at
Token ID379
Feature activation-1.68e+00

Pos logit contributions
paper+0.034
coat+0.032
pit+0.032
ro+0.031
ked+0.031
Neg logit contributions
body-0.026
 of-0.023
 much-0.022
old-0.022
ug-0.021
Token and
Token ID290
Feature activation-2.54e+00

Pos logit contributions
paper+0.013
coat+0.013
pit+0.012
ked+0.012
ro+0.012
Neg logit contributions
 of-0.011
body-0.011
 much-0.010
 important-0.009
igators-0.009
Token wet
Token ID9583
Feature activation+2.85e-01

Pos logit contributions
body+0.045
 of+0.042
 much+0.041
old+0.037
 important+0.037
Neg logit contributions
paper-0.062
ro-0.059
coat-0.059
pit-0.059
ked-0.057
Token from
Token ID422
Feature activation-3.10e+00

Pos logit contributions
paper+0.007
coat+0.006
ro+0.006
pit+0.006
ked+0.006
Neg logit contributions
 of-0.005
body-0.005
 much-0.005
igators-0.005
 earthquake-0.004
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
body+0.057
 of+0.051
 much+0.050
old+0.048
ug+0.047
Neg logit contributions
paper-0.072
pit-0.070
ro-0.070
coat-0.069
ked-0.067
Token morning
Token ID3329
Feature activation+2.21e-01

Pos logit contributions
body+0.024
 of+0.023
 much+0.022
 important+0.021
igators+0.020
Neg logit contributions
paper-0.028
coat-0.026
ked-0.025
ro-0.025
pit-0.025
Token de
Token ID390
Feature activation-5.93e+00

Pos logit contributions
paper+0.005
ro+0.005
coat+0.005
pit+0.005
ked+0.005
Neg logit contributions
body-0.004
 of-0.004
 much-0.004
old-0.004
igators-0.003
Tokenw
Token ID86
Feature activation-5.45e-01

Pos logit contributions
body+0.115
 of+0.112
 much+0.107
old+0.102
 important+0.100
Neg logit contributions
paper-0.130
coat-0.125
ro-0.124
ked-0.121
pit-0.121
Token.
Token ID13
Feature activation-8.56e-01

Pos logit contributions
body+0.010
 of+0.009
 much+0.009
old+0.008
igators+0.008
Neg logit contributions
paper-0.013
coat-0.013
ro-0.013
pit-0.012
ked-0.012
Token She
Token ID1375
Feature activation+3.66e-02

Pos logit contributions
body+0.017
 of+0.016
 much+0.016
old+0.015
 important+0.014
Neg logit contributions
paper-0.019
ro-0.018
coat-0.018
pit-0.018
ked-0.017
Token pointed
Token ID6235
Feature activation+5.77e-01

Pos logit contributions
paper+0.001
pit+0.001
coat+0.001
ro+0.001
ked+0.001
Neg logit contributions
body-0.001
 of-0.001
 much-0.001
old-0.001
 important-0.001
Token at
Token ID379
Feature activation-1.49e+00

Pos logit contributions
paper+0.013
coat+0.013
ro+0.013
pit+0.013
ked+0.012
Neg logit contributions
 of-0.012
 much-0.011
body-0.011
old-0.010
-0.009
Token looks
Token ID3073
Feature activation-3.17e-01

Pos logit contributions
body+0.010
 of+0.009
 much+0.009
old+0.009
ug+0.008
Neg logit contributions
paper-0.015
coat-0.014
pit-0.014
ro-0.014
ked-0.012
Token wonderful
Token ID7932
Feature activation+4.15e-01

Pos logit contributions
body+0.005
 of+0.004
 much+0.004
old+0.004
 important+0.004
Neg logit contributions
paper-0.009
ro-0.008
coat-0.008
pit-0.008
ked-0.008
Token on
Token ID319
Feature activation-2.64e+00

Pos logit contributions
paper+0.011
coat+0.011
ro+0.011
ked+0.011
pit+0.011
Neg logit contributions
 of-0.006
body-0.006
 much-0.006
 important-0.005
-0.005
Token you
Token ID345
Feature activation-6.02e-01

Pos logit contributions
body+0.033
old+0.028
 of+0.027
 much+0.026
igators+0.025
Neg logit contributions
paper-0.076
ro-0.073
coat-0.071
pit-0.070
ked-0.068
Token,
Token ID11
Feature activation+2.60e-01

Pos logit contributions
body+0.011
 of+0.010
 much+0.010
old+0.010
igators+0.009
Neg logit contributions
paper-0.014
ro-0.013
pit-0.013
coat-0.013
ked-0.012
Token Sil
Token ID4243
Feature activation-5.91e+00

Pos logit contributions
paper+0.006
ro+0.006
pit+0.005
coat+0.005
ked+0.005
Neg logit contributions
body-0.005
 much-0.004
old-0.004
igators-0.004
 of-0.004
Tokenvia
Token ID8869
Feature activation-4.22e+00

Pos logit contributions
body+0.102
 of+0.097
 much+0.095
 importantly+0.086
 earthquake+0.086
Neg logit contributions
paper-0.142
ro-0.138
pit-0.136
coat-0.133
ked-0.129
Token!
Token ID0
Feature activation-2.59e+00

Pos logit contributions
body+0.069
 of+0.064
 much+0.062
old+0.057
 important+0.055
Neg logit contributions
paper-0.108
ro-0.104
coat-0.104
pit-0.103
ked-0.100
Token You
Token ID921
Feature activation+2.18e-01

Pos logit contributions
body+0.050
 much+0.045
 of+0.044
 important+0.042
 unexpected+0.041
Neg logit contributions
ro-0.058
paper-0.057
coat-0.056
pit-0.054
ked-0.054
Token look
Token ID804
Feature activation+1.52e+00

Pos logit contributions
paper+0.005
ro+0.005
pit+0.005
coat+0.005
ked+0.004
Neg logit contributions
body-0.004
 of-0.004
 much-0.004
old-0.004
 important-0.004
Token so
Token ID523
Feature activation-1.87e+00

Pos logit contributions
paper+0.042
coat+0.040
ro+0.040
pit+0.040
ked+0.039
Neg logit contributions
body-0.022
 of-0.020
 much-0.019
old-0.018
igators-0.017
Token of
Token ID286
Feature activation-1.36e+00

Pos logit contributions
body+0.005
igators+0.003
ug+0.002
old+0.002
 earthquake+0.002
Neg logit contributions
paper-0.039
ro-0.039
coat-0.037
pit-0.036
ked-0.036
Token water
Token ID1660
Feature activation+1.80e+00

Pos logit contributions
body+0.030
 of+0.029
 much+0.028
old+0.026
 important+0.026
Neg logit contributions
paper-0.027
ro-0.026
coat-0.026
pit-0.026
ked-0.024
Token from
Token ID422
Feature activation-3.42e+00

Pos logit contributions
pit+0.041
paper+0.041
coat+0.041
ro+0.040
ked+0.039
Neg logit contributions
 of-0.035
 much-0.031
body-0.030
 important-0.027
-0.026
Token the
Token ID262
Feature activation-9.71e-01

Pos logit contributions
body+0.049
 of+0.047
 much+0.046
 important+0.040
igators+0.040
Neg logit contributions
paper-0.095
coat-0.092
ro-0.090
pit-0.090
ked-0.089
Token morning
Token ID3329
Feature activation+1.69e+00

Pos logit contributions
body+0.019
 of+0.018
 much+0.018
 important+0.016
 earthquake+0.015
Neg logit contributions
paper-0.022
ked-0.022
coat-0.021
ro-0.021
pit-0.021
Token de
Token ID390
Feature activation-5.87e+00

Pos logit contributions
paper+0.036
pit+0.034
coat+0.034
ro+0.033
ked+0.033
Neg logit contributions
body-0.035
 of-0.035
 much-0.033
old-0.030
 important-0.030
Tokenw
Token ID86
Feature activation+1.74e-01

Pos logit contributions
body+0.112
 of+0.108
 much+0.104
old+0.098
igators+0.096
Neg logit contributions
paper-0.132
coat-0.126
ro-0.125
pit-0.122
ked-0.121
Token.
Token ID13
Feature activation-5.93e-01

Pos logit contributions
pit+0.004
paper+0.004
coat+0.004
ro+0.004
Or+0.004
Neg logit contributions
body-0.003
 of-0.003
 much-0.003
ug-0.003
 important-0.003
Token
Token ID198
Feature activation+9.14e-01

Pos logit contributions
 of+0.011
 much+0.011
body+0.011
old+0.009
 important+0.009
Neg logit contributions
paper-0.014
coat-0.013
pit-0.013
ked-0.013
ro-0.013
Token
Token ID198
Feature activation-7.32e-01

Pos logit contributions
ro+0.013
paper+0.013
coat+0.012
pit+0.011
Or+0.010
Neg logit contributions
body-0.024
igators-0.022
 earthquake-0.022
 important-0.021
 of-0.021
TokenThen
Token ID6423
Feature activation-1.15e+00

Pos logit contributions
body+0.016
 of+0.015
 much+0.014
old+0.014
igators+0.014
Neg logit contributions
ro-0.014
paper-0.014
coat-0.014
pit-0.014
ked-0.013
Token did
Token ID750
Feature activation-3.64e-01

Pos logit contributions
body+0.040
 of+0.036
 much+0.036
old+0.034
igators+0.034
Neg logit contributions
paper-0.029
coat-0.028
ro-0.028
pit-0.027
ked-0.025
Token something
Token ID1223
Feature activation-1.06e+00

Pos logit contributions
body+0.006
 much+0.006
 of+0.005
 important+0.005
old+0.005
Neg logit contributions
paper-0.010
ro-0.009
ked-0.009
coat-0.009
pit-0.009
Token special
Token ID2041
Feature activation-6.88e-01

Pos logit contributions
body+0.019
ug+0.018
old+0.016
asm+0.016
ime+0.016
Neg logit contributions
paper-0.025
coat-0.024
ro-0.022
pit-0.022
cast-0.022
Token.
Token ID13
Feature activation-2.23e+00

Pos logit contributions
body+0.013
 of+0.012
 much+0.012
old+0.011
 important+0.011
Neg logit contributions
paper-0.016
ro-0.015
coat-0.015
pit-0.015
ked-0.015
Token On
Token ID1550
Feature activation-2.09e+00

Pos logit contributions
body+0.051
 of+0.046
 much+0.045
ug+0.044
old+0.043
Neg logit contributions
paper-0.043
coat-0.042
ro-0.042
pit-0.041
ked-0.037
Token Wed
Token ID3300
Feature activation-5.85e+00

Pos logit contributions
body+0.039
old+0.034
 of+0.034
ug+0.033
 much+0.033
Neg logit contributions
pit-0.045
paper-0.045
ro-0.045
coat-0.045
Or-0.039
Tokennes
Token ID2516
Feature activation-2.72e+00

Pos logit contributions
body+0.101
 of+0.094
 much+0.092
old+0.084
 important+0.082
Neg logit contributions
paper-0.145
ro-0.139
coat-0.139
pit-0.138
ked-0.133
Tokendays
Token ID12545
Feature activation-1.21e+00

Pos logit contributions
body+0.054
 of+0.050
 much+0.049
old+0.045
 important+0.045
Neg logit contributions
paper-0.060
ro-0.058
coat-0.058
pit-0.058
ked-0.054
Token she
Token ID673
Feature activation-1.72e+00

Pos logit contributions
body+0.015
 of+0.014
 much+0.013
old+0.012
 important+0.011
Neg logit contributions
paper-0.036
ro-0.035
coat-0.035
pit-0.034
ked-0.033
Token liked
Token ID8288
Feature activation-2.11e+00

Pos logit contributions
body+0.041
 of+0.039
 much+0.039
old+0.036
 important+0.036
Neg logit contributions
paper-0.031
ro-0.029
pit-0.029
coat-0.029
ked-0.028
Token to
Token ID284
Feature activation-1.91e-01

Pos logit contributions
body+0.050
 of+0.049
 much+0.048
 important+0.044
old+0.044
Neg logit contributions
paper-0.038
pit-0.037
ro-0.036
coat-0.036
ked-0.035
Token was
Token ID373
Feature activation-2.45e+00

Pos logit contributions
paper+0.047
coat+0.044
ro+0.044
pit+0.044
ked+0.042
Neg logit contributions
body-0.049
 of-0.047
 much-0.046
old-0.043
 important-0.042
Token a
Token ID257
Feature activation-1.10e-01

Pos logit contributions
body+0.058
old+0.052
ug+0.052
igators+0.051
 of+0.050
Neg logit contributions
paper-0.041
coat-0.041
ro-0.039
pit-0.038
thank-0.037
Token three
Token ID1115
Feature activation-3.46e+00

Pos logit contributions
body+0.002
 of+0.002
old+0.002
igators+0.002
 much+0.002
Neg logit contributions
paper-0.002
coat-0.002
ro-0.002
pit-0.002
ked-0.002
Token-
Token ID12
Feature activation-4.85e+00

Pos logit contributions
body+0.058
+0.050
 matters+0.050
 possible+0.050
idy+0.048
Neg logit contributions
pit-0.081
ro-0.077
paper-0.076
coat-0.075
suit-0.069
Tokenyear
Token ID1941
Feature activation-2.01e+00

Pos logit contributions
body+0.075
 earthquake+0.060
 obstacle+0.059
 importantly+0.059
 decisions+0.057
Neg logit contributions
paper-0.126
coat-0.121
ro-0.116
pit-0.113
thank-0.112
Token-
Token ID12
Feature activation-5.83e+00

Pos logit contributions
body+0.049
+0.046
 matters+0.043
 importantly+0.042
igators+0.042
Neg logit contributions
ro-0.036
paper-0.035
coat-0.031
pit-0.029
ked-0.029
Tokenold
Token ID727
Feature activation-2.74e+00

Pos logit contributions
body+0.065
 obstacle+0.050
 earthquake+0.050
 tornado+0.048
 importantly+0.045
Neg logit contributions
paper-0.174
coat-0.166
ro-0.162
bel-0.158
thank-0.158
Token girl
Token ID2576
Feature activation-3.31e+00

Pos logit contributions
body+0.063
 of+0.059
 much+0.058
old+0.055
 important+0.054
Neg logit contributions
paper-0.053
ro-0.050
coat-0.050
pit-0.049
ked-0.047
Token called
Token ID1444
Feature activation-8.84e-01

Pos logit contributions
body+0.074
igators+0.063
 obstacle+0.060
idy+0.059
hest+0.058
Neg logit contributions
coat-0.066
ro-0.063
paper-0.060
pit-0.060
thank-0.056
Token Joy
Token ID14087
Feature activation-2.44e+00

Pos logit contributions
body+0.019
 of+0.016
 much+0.016
 important+0.015
ug+0.015
Neg logit contributions
ro-0.019
paper-0.019
coat-0.019
pit-0.018
ked-0.017
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
body+0.057
 earthquake+0.048
igators+0.048
ug+0.047
 much+0.047
Neg logit contributions
paper-0.046
ro-0.044
coat-0.044
pit-0.043
ked-0.042
Token two
Token ID734
Feature activation-3.66e+00

Pos logit contributions
body+0.067
old+0.060
ug+0.058
igators+0.056
 of+0.055
Neg logit contributions
paper-0.053
pit-0.049
coat-0.047
ro-0.046
ked-0.045
Token friends
Token ID2460
Feature activation-3.10e+00

Pos logit contributions
body+0.069
 matters+0.060
igators+0.057
 much+0.056
 of+0.055
Neg logit contributions
pit-0.083
paper-0.082
coat-0.079
ked-0.073
 necks-0.072
Token named
Token ID3706
Feature activation-2.63e+00

Pos logit contributions
body+0.076
igators+0.065
 matters+0.064
idy+0.063
 obstacle+0.061
Neg logit contributions
pit-0.052
coat-0.049
ked-0.048
paper-0.047
 boots-0.043
Token Alex
Token ID4422
Feature activation-2.50e+00

Pos logit contributions
body+0.061
 of+0.052
 much+0.051
 importantly+0.050
old+0.050
Neg logit contributions
paper-0.054
pit-0.051
coat-0.051
ro-0.050
ked-0.047
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
body+0.079
igators+0.072
 earthquake+0.072
 obstacle+0.067
 much+0.067
Neg logit contributions
ro-0.025
ked-0.024
coat-0.024
paper-0.023
pit-0.021
Token Rh
Token ID10323
Feature activation-5.83e+00

Pos logit contributions
body+0.022
 of+0.019
 much+0.018
old+0.017
ug+0.017
Neg logit contributions
paper-0.025
coat-0.024
pit-0.023
ro-0.023
ked-0.023
Tokenonda
Token ID13533
Feature activation-4.58e+00

Pos logit contributions
body+0.113
 much+0.106
 earthquake+0.105
igators+0.103
 of+0.102
Neg logit contributions
paper-0.125
pit-0.122
ro-0.113
coat-0.112
ked-0.110
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
body+0.121
igators+0.102
 earthquake+0.101
 unexpected+0.097
 much+0.096
Neg logit contributions
ked-0.067
paper-0.066
pit-0.064
ro-0.064
coat-0.060
Token Alex
Token ID4422
Feature activation-2.60e+00

Pos logit contributions
body+0.023
 of+0.022
 much+0.022
igators+0.020
old+0.020
Neg logit contributions
paper-0.028
coat-0.027
ro-0.026
pit-0.026
ked-0.026
Token wanted
Token ID2227
Feature activation+6.35e-01

Pos logit contributions
body+0.057
 of+0.054
 much+0.053
old+0.050
 important+0.049
Neg logit contributions
paper-0.052
ro-0.049
coat-0.049
pit-0.049
ked-0.047
Token to
Token ID284
Feature activation+6.63e-01

Pos logit contributions
paper+0.010
ro+0.009
coat+0.009
pit+0.008
ked+0.008
Neg logit contributions
body-0.018
 of-0.016
 much-0.016
ug-0.015
old-0.015
Token "
Token ID366
Feature activation+3.46e+00

Pos logit contributions
pit+0.134
coat+0.133
paper+0.130
ked+0.121
suit+0.117
Neg logit contributions
 much-0.083
 of-0.077
-0.076
body-0.068
 Everyone-0.064
TokenS
Token ID50
Feature activation-1.46e+00

Pos logit contributions
paper+0.076
coat+0.075
pit+0.075
ro+0.069
suit+0.066
Neg logit contributions
body-0.076
 of-0.067
-0.063
 you-0.061
 much-0.060
Tokenara
Token ID3301
Feature activation-1.64e+00

Pos logit contributions
body+0.038
 of+0.036
old+0.034
 much+0.033
ug+0.032
Neg logit contributions
paper-0.023
ro-0.022
coat-0.022
pit-0.022
Or-0.021
Token and
Token ID290
Feature activation+2.32e+00

Pos logit contributions
 of+0.033
body+0.031
 much+0.030
old+0.028
 important+0.027
Neg logit contributions
paper-0.038
pit-0.036
ro-0.035
coat-0.034
ked-0.034
Token Ben
Token ID3932
Feature activation-9.24e-01

Pos logit contributions
paper+0.050
coat+0.047
ro+0.047
pit+0.047
ked+0.045
Neg logit contributions
body-0.048
 of-0.046
 much-0.045
old-0.042
igators-0.041
Token,
Token ID11
Feature activation+4.48e+00

Pos logit contributions
 of+0.015
body+0.015
 much+0.014
old+0.013
+0.013
Neg logit contributions
paper-0.024
ro-0.023
pit-0.023
coat-0.023
ked-0.022
Token what
Token ID644
Feature activation+1.19e+00

Pos logit contributions
paper+0.117
ro+0.112
coat+0.111
pit+0.111
ked+0.107
Neg logit contributions
body-0.071
 of-0.064
 much-0.064
old-0.059
 important-0.057
Token did
Token ID750
Feature activation-2.40e+00

Pos logit contributions
paper+0.019
ro+0.017
pit+0.016
coat+0.016
ked+0.015
Neg logit contributions
body-0.029
ug-0.028
 importantly-0.026
asm-0.026
 obstacle-0.026
Token you
Token ID345
Feature activation+8.02e-01

Pos logit contributions
body+0.029
ug+0.028
 of+0.027
asm+0.027
 instant+0.026
Neg logit contributions
paper-0.065
ro-0.059
coat-0.058
balls-0.056
 shoes-0.056
Token do
Token ID466
Feature activation-8.75e-01

Pos logit contributions
paper+0.020
pit+0.019
ro+0.019
coat+0.019
ked+0.018
Neg logit contributions
body-0.014
 of-0.013
 much-0.013
old-0.012
 important-0.011
Token?
Token ID30
Feature activation-1.06e+00

Pos logit contributions
body+0.013
 of+0.012
 much+0.011
old+0.010
 important+0.010
Neg logit contributions
paper-0.025
coat-0.024
ro-0.024
pit-0.023
ked-0.023
Token like
Token ID588
Feature activation-5.78e-01

Pos logit contributions
paper+0.008
coat+0.008
pit+0.008
ro+0.008
ked+0.008
Neg logit contributions
body-0.007
 of-0.007
 much-0.007
old-0.006
 important-0.006
Token it
Token ID340
Feature activation-2.80e-01

Pos logit contributions
body+0.009
 of+0.008
 much+0.008
old+0.007
 important+0.007
Neg logit contributions
paper-0.015
coat-0.015
ro-0.015
pit-0.015
ked-0.014
Token?"
Token ID1701
Feature activation+6.35e-01

Pos logit contributions
 of+0.003
body+0.003
 much+0.003
 important+0.002
igators+0.002
Neg logit contributions
paper-0.009
coat-0.009
ro-0.008
ked-0.008
pit-0.008
Token Ben
Token ID3932
Feature activation+1.14e+00

Pos logit contributions
paper+0.014
coat+0.013
pit+0.013
ro+0.013
ked+0.013
Neg logit contributions
body-0.013
 of-0.012
 much-0.012
old-0.011
 important-0.011
Token asks
Token ID7893
Feature activation+2.59e+00

Pos logit contributions
paper+0.031
coat+0.030
pit+0.029
ro+0.028
ked+0.028
Neg logit contributions
 of-0.018
-0.016
body-0.015
 much-0.015
old-0.015
Token,
Token ID11
Feature activation+6.50e+00

Pos logit contributions
paper+0.069
coat+0.067
pit+0.066
ro+0.065
ked+0.062
Neg logit contributions
 of-0.042
body-0.039
 much-0.039
old-0.034
ug-0.032
Token showing
Token ID4478
Feature activation-4.84e-01

Pos logit contributions
paper+0.169
coat+0.152
pit+0.148
ro+0.148
suit+0.142
Neg logit contributions
 of-0.121
 much-0.112
-0.095
 God-0.093
 about-0.091
Token his
Token ID465
Feature activation+1.35e+00

Pos logit contributions
body+0.010
 of+0.009
 much+0.009
old+0.009
 important+0.009
Neg logit contributions
paper-0.010
ro-0.010
pit-0.010
coat-0.010
ked-0.009
Token image
Token ID2939
Feature activation+5.59e-01

Pos logit contributions
paper+0.030
coat+0.029
ro+0.029
pit+0.028
ked+0.027
Neg logit contributions
body-0.027
 of-0.026
 much-0.025
 important-0.023
old-0.023
Token to
Token ID284
Feature activation-1.53e-01

Pos logit contributions
paper+0.013
coat+0.013
ro+0.013
pit+0.012
ked+0.012
Neg logit contributions
 of-0.010
body-0.010
 much-0.010
old-0.009
 important-0.009
Token Anna
Token ID11735
Feature activation-1.15e+00

Pos logit contributions
body+0.003
 of+0.003
 much+0.003
old+0.003
 important+0.003
Neg logit contributions
paper-0.003
ro-0.003
pit-0.003
coat-0.003
ked-0.003
Token!
Token ID0
Feature activation-1.93e+00

Pos logit contributions
body+0.018
igators+0.016
old+0.016
 much+0.015
 importantly+0.015
Neg logit contributions
paper-0.022
coat-0.021
ro-0.021
pit-0.020
ked-0.019
Token He
Token ID679
Feature activation+1.01e+00

Pos logit contributions
body+0.038
 of+0.034
 much+0.034
 important+0.032
igators+0.032
Neg logit contributions
paper-0.043
ro-0.041
pit-0.040
coat-0.040
ked-0.039
Token picked
Token ID6497
Feature activation+8.85e-01

Pos logit contributions
paper+0.020
coat+0.019
pit+0.019
ro+0.018
ked+0.017
Neg logit contributions
 of-0.022
body-0.022
 much-0.022
-0.020
old-0.020
Token it
Token ID340
Feature activation+2.14e+00

Pos logit contributions
paper+0.024
ro+0.023
coat+0.023
pit+0.023
ked+0.022
Neg logit contributions
body-0.014
 of-0.012
 much-0.012
old-0.011
igators-0.011
Token up
Token ID510
Feature activation-5.68e-02

Pos logit contributions
paper+0.050
ro+0.047
coat+0.046
pit+0.046
ked+0.045
Neg logit contributions
body-0.042
 much-0.036
 of-0.036
old-0.035
igators-0.035
Token,
Token ID11
Feature activation+4.50e+00

Pos logit contributions
 of+0.001
 much+0.001
+0.001
body+0.001
 important+0.001
Neg logit contributions
paper-0.001
pit-0.001
coat-0.001
ro-0.001
 extract-0.001
Token and
Token ID290
Feature activation-8.63e-02

Pos logit contributions
paper+0.101
pit+0.099
ro+0.095
coat+0.094
ked+0.086
Neg logit contributions
 of-0.091
 much-0.087
body-0.085
-0.079
 important-0.076
Token it
Token ID340
Feature activation-3.41e-02

Pos logit contributions
 of+0.002
 much+0.001
body+0.001
+0.001
old+0.001
Neg logit contributions
coat-0.002
pit-0.002
ro-0.002
paper-0.002
suit-0.002
Token seemed
Token ID3947
Feature activation-1.22e+00

Pos logit contributions
 of+0.001
 much+0.001
body+0.001
+0.001
 important+0.001
Neg logit contributions
pit-0.001
paper-0.001
coat-0.001
ro-0.001
ked-0.001
Token to
Token ID284
Feature activation+3.51e-01

Pos logit contributions
 much+0.023
 of+0.023
body+0.022
 important+0.021
old+0.018
Neg logit contributions
pit-0.028
paper-0.028
coat-0.028
ro-0.028
ked-0.026
Token be
Token ID307
Feature activation-2.67e+00

Pos logit contributions
paper+0.008
pit+0.008
coat+0.008
ked+0.008
ro+0.007
Neg logit contributions
 of-0.007
body-0.007
 much-0.006
 important-0.006
-0.006
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
paper+0.028
coat+0.027
pit+0.027
ro+0.026
ked+0.026
Neg logit contributions
 of-0.022
body-0.021
 much-0.020
old-0.018
-0.017
Token
Token ID198
Feature activation-3.55e-01

Pos logit contributions
paper+0.022
ro+0.021
pit+0.021
coat+0.021
ked+0.019
Neg logit contributions
body-0.025
 of-0.023
 much-0.023
old-0.022
igators-0.022
TokenM
Token ID44
Feature activation-1.90e-01

Pos logit contributions
body+0.007
 of+0.007
 much+0.007
old+0.006
+0.006
Neg logit contributions
paper-0.008
coat-0.008
ro-0.008
ked-0.008
pit-0.007
Tokenia
Token ID544
Feature activation-9.15e-01

Pos logit contributions
body+0.003
 of+0.003
 much+0.003
old+0.003
 important+0.003
Neg logit contributions
paper-0.005
coat-0.005
ro-0.005
pit-0.005
ked-0.004
Token said
Token ID531
Feature activation+4.07e+00

Pos logit contributions
 of+0.017
old+0.015
+0.015
body+0.014
 much+0.013
Neg logit contributions
paper-0.023
pit-0.021
coat-0.021
ro-0.020
 extract-0.020
Token,
Token ID11
Feature activation+7.05e+00

Pos logit contributions
paper+0.099
ro+0.097
pit+0.097
coat+0.096
ked+0.090
Neg logit contributions
 of-0.070
body-0.069
 much-0.067
old-0.059
ug-0.058
Token "
Token ID366
Feature activation+3.60e+00

Pos logit contributions
paper+0.189
pit+0.186
ro+0.180
aj+0.177
coat+0.176
Neg logit contributions
 of-0.107
 much-0.092
-0.076
 you-0.074
body-0.073
TokenNo
Token ID2949
Feature activation+7.99e-01

Pos logit contributions
pit+0.072
paper+0.070
aj+0.067
coat+0.067
cast+0.066
Neg logit contributions
body-0.081
 of-0.072
 much-0.071
-0.069
 you-0.067
Token,
Token ID11
Feature activation+3.39e+00

Pos logit contributions
paper+0.014
pit+0.013
coat+0.013
ro+0.012
ked+0.012
Neg logit contributions
body-0.020
igators-0.017
 much-0.017
 of-0.017
old-0.017
Token Tim
Token ID5045
Feature activation-2.13e+00

Pos logit contributions
paper+0.079
ro+0.075
pit+0.075
coat+0.074
ked+0.072
Neg logit contributions
body-0.064
 of-0.059
 much-0.058
old-0.056
 important-0.053
Token,
Token ID11
Feature activation+3.85e+00

Pos logit contributions
body+0.039
 of+0.037
 much+0.036
old+0.033
 important+0.033
Neg logit contributions
paper-0.050
ro-0.048
coat-0.048
pit-0.048
ked-0.046
Token stopped
Token ID5025
Feature activation-6.42e-01

Pos logit contributions
 of+0.012
body+0.012
 much+0.012
 important+0.011
+0.010
Neg logit contributions
paper-0.011
pit-0.010
coat-0.010
ro-0.010
ked-0.009
Token.
Token ID13
Feature activation-4.10e-01

Pos logit contributions
 of+0.010
body+0.010
 much+0.010
+0.009
ug+0.009
Neg logit contributions
coat-0.016
paper-0.016
ro-0.016
pit-0.016
ked-0.015
Token Sarah
Token ID10490
Feature activation-1.13e+00

Pos logit contributions
body+0.008
 of+0.007
 much+0.007
old+0.006
 important+0.006
Neg logit contributions
paper-0.010
pit-0.009
ro-0.009
coat-0.009
ked-0.009
Token took
Token ID1718
Feature activation-3.83e-01

Pos logit contributions
 of+0.023
body+0.022
+0.021
 much+0.021
old+0.020
Neg logit contributions
paper-0.025
pit-0.023
ro-0.023
coat-0.023
ked-0.021
Token a
Token ID257
Feature activation+3.70e+00

Pos logit contributions
body+0.008
igators+0.007
 of+0.006
 importantly+0.006
 much+0.006
Neg logit contributions
paper-0.009
coat-0.008
ro-0.008
ked-0.008
pit-0.007
Token closer
Token ID5699
Feature activation+5.88e+00

Pos logit contributions
paper+0.082
coat+0.081
ro+0.079
pit+0.076
ked+0.074
Neg logit contributions
body-0.072
 of-0.065
 much-0.064
igators-0.063
old-0.063
Token look
Token ID804
Feature activation+1.46e+00

Pos logit contributions
paper+0.113
pit+0.110
coat+0.107
ro+0.107
ked+0.106
Neg logit contributions
 of-0.132
body-0.129
 much-0.124
old-0.112
-0.110
Token and
Token ID290
Feature activation-4.98e-01

Pos logit contributions
paper+0.030
coat+0.029
ro+0.029
pit+0.029
ked+0.027
Neg logit contributions
body-0.031
 of-0.030
 much-0.029
old-0.027
 important-0.026
Token saw
Token ID2497
Feature activation+4.15e-01

Pos logit contributions
 of+0.012
 much+0.012
body+0.011
+0.011
old+0.011
Neg logit contributions
coat-0.009
ro-0.009
paper-0.009
pit-0.009
ked-0.008
Token that
Token ID326
Feature activation-1.04e+00

Pos logit contributions
paper+0.011
ro+0.011
coat+0.011
pit+0.011
ked+0.010
Neg logit contributions
body-0.006
 of-0.006
 much-0.006
old-0.005
 important-0.005
Token the
Token ID262
Feature activation-9.37e-01

Pos logit contributions
body+0.019
 of+0.016
 much+0.016
old+0.016
ug+0.016
Neg logit contributions
paper-0.025
pit-0.024
coat-0.024
ro-0.023
ked-0.022
Token beach
Token ID10481
Feature activation+6.73e-01

Pos logit contributions
body+0.037
 of+0.035
 much+0.034
old+0.032
igators+0.031
Neg logit contributions
paper-0.040
ro-0.038
coat-0.038
pit-0.038
ked-0.036
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
paper+0.016
coat+0.016
ked+0.015
ro+0.015
pit+0.015
Neg logit contributions
 of-0.012
 much-0.012
body-0.011
 important-0.010
old-0.010
Token So
Token ID1406
Feature activation-1.29e+00

Pos logit contributions
body+0.031
ug+0.028
 of+0.028
igators+0.027
idy+0.026
Neg logit contributions
paper-0.020
pit-0.020
ro-0.019
coat-0.018
ked-0.017
Token Jack
Token ID3619
Feature activation-8.58e-01

Pos logit contributions
body+0.028
igators+0.025
old+0.025
ug+0.024
 of+0.024
Neg logit contributions
paper-0.025
ro-0.024
pit-0.024
ked-0.022
coat-0.022
Token and
Token ID290
Feature activation-7.23e-01

Pos logit contributions
body+0.019
 of+0.019
 much+0.018
old+0.017
 important+0.016
Neg logit contributions
paper-0.017
coat-0.016
pit-0.016
ro-0.016
ked-0.015
Token his
Token ID465
Feature activation+2.93e+00

Pos logit contributions
body+0.019
ug+0.017
igators+0.017
 of+0.017
 much+0.017
Neg logit contributions
paper-0.011
ro-0.010
pit-0.010
thank-0.009
ked-0.009
Token dad
Token ID9955
Feature activation+3.48e-01

Pos logit contributions
paper+0.058
ro+0.055
coat+0.055
pit+0.055
ked+0.052
Neg logit contributions
body-0.066
 of-0.062
 much-0.061
old-0.057
igators-0.056
Token parted
Token ID37813
Feature activation-2.83e-01

Pos logit contributions
paper+0.006
ro+0.006
coat+0.006
pit+0.006
ked+0.006
Neg logit contributions
body-0.008
 of-0.008
 much-0.008
old-0.007
 important-0.007
Token ways
Token ID2842
Feature activation-1.63e+00

Pos logit contributions
body+0.005
 much+0.004
 of+0.004
old+0.004
igators+0.004
Neg logit contributions
paper-0.007
ro-0.007
pit-0.007
coat-0.007
ked-0.006
Token with
Token ID351
Feature activation-3.59e+00

Pos logit contributions
body+0.039
 much+0.034
igators+0.034
 of+0.033
old+0.033
Neg logit contributions
paper-0.030
ro-0.029
pit-0.028
coat-0.027
ked-0.026
Token the
Token ID262
Feature activation-8.24e-01

Pos logit contributions
body+0.065
 of+0.057
 much+0.055
old+0.053
igators+0.052
Neg logit contributions
paper-0.087
ro-0.081
pit-0.081
coat-0.081
ked-0.080
TokenC
Token ID34
Feature activation-2.37e+00

Pos logit contributions
paper+0.088
pit+0.082
coat+0.077
ro+0.077
ked+0.075
Neg logit contributions
body-0.099
 of-0.091
 much-0.090
-0.085
 important-0.084
Tokenrows
Token ID8516
Feature activation-3.53e-01

Pos logit contributions
body+0.044
 of+0.043
 much+0.041
igators+0.037
old+0.037
Neg logit contributions
paper-0.056
ro-0.054
coat-0.052
pit-0.052
ked-0.049
Token don
Token ID836
Feature activation-3.39e-01

Pos logit contributions
body+0.007
 of+0.007
 much+0.006
old+0.006
igators+0.006
Neg logit contributions
paper-0.008
coat-0.008
pit-0.008
ro-0.008
ked-0.007
Token't
Token ID470
Feature activation-2.96e-01

Pos logit contributions
old+0.008
body+0.008
igators+0.008
 importantly+0.008
 sorts+0.008
Neg logit contributions
paper-0.005
driver-0.004
coat-0.004
ked-0.004
-0.004
Token like
Token ID588
Feature activation-3.45e-01

Pos logit contributions
body+0.007
old+0.006
 importantly+0.006
 much+0.006
 of+0.006
Neg logit contributions
paper-0.006
coat-0.006
ro-0.005
ked-0.005
thank-0.005
Token it
Token ID340
Feature activation+4.96e-01

Pos logit contributions
body+0.005
 of+0.005
 much+0.005
old+0.004
ug+0.004
Neg logit contributions
paper-0.009
ro-0.009
coat-0.009
pit-0.008
ked-0.008
Token when
Token ID618
Feature activation-2.86e+00

Pos logit contributions
paper+0.012
ro+0.012
coat+0.012
pit+0.012
ked+0.011
Neg logit contributions
body-0.009
 of-0.008
 much-0.008
old-0.007
 important-0.007
Token people
Token ID661
Feature activation-2.31e+00

Pos logit contributions
body+0.037
 of+0.032
 much+0.031
old+0.031
ug+0.028
Neg logit contributions
paper-0.081
ro-0.078
coat-0.077
pit-0.077
ked-0.074
Token come
Token ID1282
Feature activation+1.08e+00

Pos logit contributions
body+0.048
 of+0.046
 much+0.044
old+0.041
igators+0.040
Neg logit contributions
paper-0.050
pit-0.047
coat-0.047
ro-0.047
ked-0.045
Token too
Token ID1165
Feature activation-4.91e-01

Pos logit contributions
paper+0.025
pit+0.024
coat+0.024
ro+0.023
ked+0.022
Neg logit contributions
 of-0.021
body-0.020
 much-0.020
 important-0.018
-0.017
Token close
Token ID1969
Feature activation-5.75e-01

Pos logit contributions
body+0.010
old+0.009
 importantly+0.009
azy+0.009
ug+0.009
Neg logit contributions
paper-0.010
ro-0.009
coat-0.009
pit-0.009
ked-0.008
Token!
Token ID0
Feature activation-7.77e-01

Pos logit contributions
body+0.005
 of+0.005
 much+0.005
 important+0.004
old+0.004
Neg logit contributions
paper-0.007
coat-0.007
pit-0.007
ro-0.007
ked-0.006
Token She
Token ID1375
Feature activation+8.88e-01

Pos logit contributions
body+0.016
 of+0.016
 much+0.016
old+0.015
 important+0.014
Neg logit contributions
paper-0.016
coat-0.015
pit-0.015
ro-0.015
ked-0.015
Token moved
Token ID3888
Feature activation+6.18e-01

Pos logit contributions
paper+0.019
coat+0.017
pit+0.017
ro+0.017
ked+0.016
Neg logit contributions
body-0.019
 of-0.018
 much-0.018
old-0.017
 important-0.016
Token around
Token ID1088
Feature activation+3.61e-01

Pos logit contributions
paper+0.017
pit+0.016
coat+0.016
ro+0.016
ked+0.015
Neg logit contributions
 of-0.010
 much-0.010
body-0.008
-0.008
 important-0.008
Token the
Token ID262
Feature activation-4.30e-01

Pos logit contributions
paper+0.009
coat+0.009
pit+0.009
ked+0.009
ro+0.008
Neg logit contributions
 of-0.006
 much-0.006
body-0.006
 important-0.005
-0.005
Token me
Token ID502
Feature activation+9.53e-01

Pos logit contributions
body+0.009
 of+0.008
 much+0.008
 important+0.008
old+0.007
Neg logit contributions
paper-0.009
ked-0.009
coat-0.009
pit-0.009
ro-0.009
Tokenadow
Token ID4584
Feature activation+1.52e+00

Pos logit contributions
paper+0.027
ro+0.026
coat+0.026
pit+0.026
ked+0.026
Neg logit contributions
body-0.012
 of-0.012
 much-0.011
old-0.010
 important-0.009
Token again
Token ID757
Feature activation+1.08e-02

Pos logit contributions
paper+0.037
ked+0.034
coat+0.034
ro+0.033
pit+0.032
Neg logit contributions
 of-0.028
 much-0.027
body-0.025
 important-0.023
old-0.022
Token,
Token ID11
Feature activation+2.88e+00

Pos logit contributions
paper+0.000
coat+0.000
ked+0.000
pit+0.000
ro+0.000
Neg logit contributions
 of-0.000
 much-0.000
body-0.000
-0.000
 important-0.000
Token this
Token ID428
Feature activation-1.33e-01

Pos logit contributions
paper+0.070
ro+0.065
coat+0.064
pit+0.064
ked+0.062
Neg logit contributions
 of-0.051
 much-0.051
body-0.048
-0.045
 important-0.043
Token time
Token ID640
Feature activation-4.87e-01

Pos logit contributions
body+0.001
 of+0.001
 much+0.001
old+0.001
igators+0.001
Neg logit contributions
paper-0.004
ro-0.004
coat-0.004
pit-0.004
ked-0.004
Token feel
Token ID1254
Feature activation-3.59e+00

Pos logit contributions
paper+0.029
pit+0.029
ked+0.027
coat+0.027
ro+0.026
Neg logit contributions
body-0.021
 of-0.021
 much-0.021
 important-0.018
-0.018
Token better
Token ID1365
Feature activation+3.34e-01

Pos logit contributions
body+0.057
 of+0.050
old+0.047
ug+0.046
 much+0.046
Neg logit contributions
paper-0.090
coat-0.087
ro-0.087
pit-0.084
ked-0.083
Token and
Token ID290
Feature activation-1.53e+00

Pos logit contributions
paper+0.009
pit+0.008
coat+0.008
ked+0.008
ro+0.008
Neg logit contributions
 of-0.006
 much-0.005
body-0.005
 important-0.005
ug-0.004
Token you
Token ID345
Feature activation+1.16e+00

Pos logit contributions
body+0.027
 of+0.025
 much+0.024
old+0.022
 important+0.022
Neg logit contributions
paper-0.038
ro-0.036
coat-0.036
pit-0.036
ked-0.035
Token can
Token ID460
Feature activation+1.73e+00

Pos logit contributions
paper+0.021
pit+0.021
coat+0.020
ro+0.020
ked+0.019
Neg logit contributions
body-0.028
 of-0.027
 much-0.027
 important-0.025
old-0.025
Token go
Token ID467
Feature activation+1.52e+00

Pos logit contributions
paper+0.040
pit+0.038
ro+0.038
coat+0.038
ked+0.037
Neg logit contributions
body-0.032
 of-0.032
 much-0.031
old-0.028
 important-0.027
Token home
Token ID1363
Feature activation+3.46e-01

Pos logit contributions
paper+0.034
pit+0.033
ro+0.032
ked+0.032
coat+0.031
Neg logit contributions
 of-0.031
 much-0.030
body-0.029
 important-0.026
old-0.026
Token."
Token ID526
Feature activation+6.22e-01

Pos logit contributions
paper+0.009
coat+0.009
pit+0.008
ked+0.008
ro+0.008
Neg logit contributions
 of-0.006
body-0.006
 much-0.006
 important-0.005
old-0.005
Token
Token ID198
Feature activation+7.70e-01

Pos logit contributions
paper+0.013
coat+0.013
pit+0.013
ro+0.013
ked+0.012
Neg logit contributions
body-0.013
 of-0.012
 much-0.012
old-0.011
 important-0.011
Token
Token ID198
Feature activation-7.36e-01

Pos logit contributions
paper+0.015
ro+0.014
coat+0.014
pit+0.014
ked+0.013
Neg logit contributions
body-0.018
 of-0.016
 much-0.016
igators-0.015
 important-0.015
TokenBut
Token ID1537
Feature activation-1.28e+00

Pos logit contributions
body+0.017
 of+0.016
 much+0.016
old+0.015
 important+0.014
Neg logit contributions
paper-0.015
coat-0.014
pit-0.014
ro-0.014
ked-0.013
Token They
Token ID1119
Feature activation+2.82e-01

Pos logit contributions
body+0.010
 of+0.010
 much+0.010
old+0.009
 important+0.009
Neg logit contributions
paper-0.013
coat-0.013
pit-0.012
ro-0.012
ked-0.012
Token saw
Token ID2497
Feature activation-2.17e-01

Pos logit contributions
paper+0.006
ro+0.006
coat+0.006
pit+0.005
ked+0.005
Neg logit contributions
 of-0.006
 much-0.006
body-0.005
old-0.005
-0.005
Token trains
Token ID13404
Feature activation-9.97e-01

Pos logit contributions
body+0.004
 of+0.003
 much+0.003
old+0.003
 important+0.003
Neg logit contributions
paper-0.005
ro-0.005
coat-0.005
pit-0.005
ked-0.005
Token that
Token ID326
Feature activation-1.03e-01

Pos logit contributions
 of+0.019
body+0.019
 much+0.018
old+0.016
igators+0.016
Neg logit contributions
paper-0.023
ro-0.022
coat-0.021
pit-0.021
ked-0.021
Token were
Token ID547
Feature activation-2.87e+00

Pos logit contributions
body+0.002
 of+0.002
 much+0.002
old+0.002
 important+0.002
Neg logit contributions
paper-0.002
ro-0.002
coat-0.002
pit-0.002
ked-0.002
Token red
Token ID2266
Feature activation+9.43e-01

Pos logit contributions
body+0.047
 of+0.046
 much+0.045
 important+0.040
igators+0.039
Neg logit contributions
paper-0.073
ro-0.070
coat-0.070
pit-0.070
ked-0.067
Token,
Token ID11
Feature activation+3.17e+00

Pos logit contributions
paper+0.020
pit+0.019
coat+0.019
ro+0.018
ked+0.018
Neg logit contributions
 of-0.020
body-0.019
 much-0.018
ug-0.016
 important-0.016
Token blue
Token ID4171
Feature activation+2.64e-01

Pos logit contributions
paper+0.069
pit+0.068
coat+0.064
ro+0.064
ked+0.062
Neg logit contributions
 of-0.065
 much-0.063
body-0.061
 important-0.056
igators-0.053
Token,
Token ID11
Feature activation+3.83e+00

Pos logit contributions
paper+0.006
coat+0.006
pit+0.006
thank+0.006
Or+0.006
Neg logit contributions
 of-0.005
body-0.004
 much-0.004
ug-0.004
-0.004
Token green
Token ID4077
Feature activation+1.16e+00

Pos logit contributions
pit+0.085
paper+0.084
coat+0.079
ro+0.078
ked+0.077
Neg logit contributions
 of-0.078
 much-0.074
body-0.069
 important-0.064
-0.061
Token,
Token ID11
Feature activation+4.10e+00

Pos logit contributions
paper+0.028
coat+0.027
pit+0.026
ked+0.026
thank+0.026
Neg logit contributions
 of-0.022
body-0.018
 much-0.018
ug-0.017
-0.016
Token her
Token ID607
Feature activation-3.67e-01

Pos logit contributions
body+0.088
 of+0.080
 much+0.079
old+0.076
ug+0.074
Neg logit contributions
paper-0.079
coat-0.074
ro-0.074
pit-0.072
ked-0.069
Token toys
Token ID14958
Feature activation+2.36e-01

Pos logit contributions
 of+0.007
body+0.007
 much+0.007
 important+0.007
igators+0.006
Neg logit contributions
paper-0.008
ro-0.008
pit-0.008
coat-0.007
ked-0.007
Token and
Token ID290
Feature activation-2.61e-01

Pos logit contributions
paper+0.006
ro+0.006
ked+0.006
pit+0.006
coat+0.006
Neg logit contributions
 of-0.004
 much-0.004
body-0.004
 important-0.003
 year-0.003
Token talk
Token ID1561
Feature activation-4.84e-01

Pos logit contributions
body+0.005
 of+0.005
 much+0.005
 important+0.004
old+0.004
Neg logit contributions
paper-0.006
ro-0.006
pit-0.006
coat-0.005
ked-0.005
Token on
Token ID319
Feature activation-1.65e+00

Pos logit contributions
 of+0.009
 much+0.009
body+0.007
 important+0.007
old+0.007
Neg logit contributions
paper-0.012
ro-0.012
pit-0.012
coat-0.012
ked-0.011
Token her
Token ID607
Feature activation+1.54e-01

Pos logit contributions
body+0.034
 of+0.030
old+0.029
ug+0.028
 much+0.028
Neg logit contributions
paper-0.037
ro-0.035
coat-0.035
pit-0.034
ked-0.032
Token phone
Token ID3072
Feature activation+3.10e-01

Pos logit contributions
paper+0.003
ro+0.003
pit+0.003
coat+0.003
ked+0.003
Neg logit contributions
body-0.004
 of-0.004
 much-0.003
 important-0.003
old-0.003
Token.
Token ID13
Feature activation-1.32e+00

Pos logit contributions
paper+0.008
ro+0.008
pit+0.008
coat+0.008
ked+0.008
Neg logit contributions
 of-0.005
 much-0.004
body-0.004
 important-0.004
old-0.003
Token One
Token ID1881
Feature activation-3.83e-01

Pos logit contributions
 of+0.018
 much+0.018
body+0.018
 important+0.015
old+0.015
Neg logit contributions
paper-0.037
ked-0.035
pit-0.035
coat-0.035
ro-0.035
Token day
Token ID1110
Feature activation+7.28e-01

Pos logit contributions
body+0.008
ug+0.007
 importantly+0.007
 decisions+0.007
asm+0.007
Neg logit contributions
coat-0.008
ro-0.008
paper-0.007
pit-0.007
thank-0.007
Token,
Token ID11
Feature activation-1.09e-01

Pos logit contributions
paper+0.015
coat+0.015
ro+0.015
pit+0.014
ked+0.014
Neg logit contributions
body-0.015
 of-0.014
 much-0.014
igators-0.013
old-0.013
Token flowers
Token ID12734
Feature activation-1.08e+00

Pos logit contributions
 of+0.021
body+0.021
 much+0.021
 important+0.018
igators+0.018
Neg logit contributions
paper-0.027
coat-0.026
pit-0.026
ro-0.026
ked-0.025
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 of+0.018
 much+0.014
body+0.012
+0.011
igators+0.011
Neg logit contributions
paper-0.029
ro-0.028
pit-0.028
coat-0.028
aj-0.028
Token Soon
Token ID15894
Feature activation-1.11e+00

Pos logit contributions
body+0.022
 of+0.020
 much+0.020
old+0.019
 important+0.018
Neg logit contributions
paper-0.021
ro-0.021
coat-0.020
pit-0.020
ked-0.019
Token it
Token ID340
Feature activation-2.54e-01

Pos logit contributions
body+0.021
 of+0.019
 much+0.019
old+0.018
igators+0.017
Neg logit contributions
paper-0.027
ro-0.025
pit-0.025
coat-0.025
ked-0.024
Token saw
Token ID2497
Feature activation-1.23e+00

Pos logit contributions
 of+0.005
body+0.005
 much+0.005
+0.004
 important+0.004
Neg logit contributions
paper-0.006
coat-0.006
pit-0.006
ro-0.005
ked-0.005
Token something
Token ID1223
Feature activation-1.32e+00

Pos logit contributions
body+0.021
 of+0.019
 much+0.018
old+0.017
 important+0.016
Neg logit contributions
paper-0.031
ro-0.030
coat-0.030
pit-0.030
ked-0.029
Token new
Token ID649
Feature activation-9.24e-01

Pos logit contributions
body+0.025
old+0.022
ug+0.022
 of+0.021
igators+0.021
Neg logit contributions
paper-0.031
coat-0.028
pit-0.028
ro-0.028
ked-0.027
Token -
Token ID532
Feature activation-6.84e-01

Pos logit contributions
 of+0.019
body+0.019
 much+0.018
old+0.016
 important+0.016
Neg logit contributions
paper-0.020
coat-0.019
ro-0.019
pit-0.018
ked-0.018
Token a
Token ID257
Feature activation+3.47e-01

Pos logit contributions
body+0.013
old+0.012
 of+0.012
 much+0.011
ug+0.011
Neg logit contributions
paper-0.015
ro-0.015
coat-0.014
pit-0.014
ked-0.014
Token shiny
Token ID22441
Feature activation+1.86e+00

Pos logit contributions
paper+0.008
ro+0.008
coat+0.008
pit+0.008
ked+0.008
Neg logit contributions
body-0.006
 of-0.006
 much-0.006
 important-0.005
old-0.005
Token stone
Token ID7815
Feature activation-6.07e-01

Pos logit contributions
paper+0.038
coat+0.035
ked+0.035
pit+0.035
ro+0.035
Neg logit contributions
body-0.041
 of-0.039
 much-0.037
 important-0.035
 earthquake-0.033
Token said
Token ID531
Feature activation+4.19e+00

Pos logit contributions
paper+0.006
coat+0.006
ro+0.006
pit+0.006
ked+0.006
Neg logit contributions
body-0.007
 of-0.007
 much-0.007
old-0.006
 important-0.006
Token.
Token ID13
Feature activation+2.81e+00

Pos logit contributions
paper+0.102
pit+0.098
coat+0.098
ro+0.098
ked+0.093
Neg logit contributions
body-0.073
 of-0.071
 much-0.069
old-0.063
 important-0.061
Token "
Token ID366
Feature activation+4.36e+00

Pos logit contributions
coat+0.080
paper+0.079
pit+0.073
thank+0.072
fl+0.070
Neg logit contributions
 of-0.048
 much-0.044
-0.034
body-0.032
 about-0.031
TokenYou
Token ID1639
Feature activation+9.55e-01

Pos logit contributions
paper+0.098
coat+0.096
pit+0.095
ro+0.090
ked+0.087
Neg logit contributions
body-0.086
 of-0.081
 much-0.079
old-0.072
-0.071
Token might
Token ID1244
Feature activation+6.62e-01

Pos logit contributions
paper+0.019
coat+0.017
ro+0.017
ked+0.017
pit+0.016
Neg logit contributions
body-0.021
old-0.019
 much-0.019
 importantly-0.019
 of-0.019
Token damage
Token ID2465
Feature activation-3.10e-01

Pos logit contributions
paper+0.013
ro+0.013
coat+0.012
pit+0.011
ked+0.011
Neg logit contributions
body-0.015
old-0.014
 of-0.014
 importantly-0.013
ug-0.013
Token your
Token ID534
Feature activation+3.09e+00

Pos logit contributions
body+0.004
 of+0.004
 much+0.004
old+0.003
 important+0.003
Neg logit contributions
paper-0.009
ro-0.008
coat-0.008
pit-0.008
ked-0.008
Token suitcase
Token ID45391
Feature activation+1.95e+00

Pos logit contributions
paper+0.064
ked+0.060
pit+0.060
coat+0.058
ro+0.058
Neg logit contributions
body-0.067
 of-0.066
 much-0.064
 important-0.062
 year-0.056
Token or
Token ID393
Feature activation-4.34e-01

Pos logit contributions
paper+0.049
ked+0.048
coat+0.046
pit+0.046
ro+0.045
Neg logit contributions
 of-0.034
 much-0.031
body-0.031
 important-0.028
igators-0.026
Token someone
Token ID2130
Feature activation-2.09e+00

Pos logit contributions
body+0.006
 of+0.005
 much+0.005
old+0.005
 important+0.004
Neg logit contributions
paper-0.012
pit-0.012
coat-0.012
ro-0.012
ked-0.012
Token else
Token ID2073
Feature activation-3.89e-01

Pos logit contributions
body+0.026
 of+0.023
 much+0.022
ug+0.019
old+0.019
Neg logit contributions
paper-0.062
coat-0.059
ro-0.059
pit-0.058
ked-0.057
Token let
Token ID1309
Feature activation-6.70e-01

Pos logit contributions
paper+0.034
coat+0.033
pit+0.032
ro+0.032
ked+0.031
Neg logit contributions
body-0.023
 of-0.023
 much-0.021
 important-0.019
old-0.019
Token's
Token ID338
Feature activation-6.82e-01

Pos logit contributions
body+0.012
 importantly+0.012
igators+0.010
ug+0.010
 much+0.010
Neg logit contributions
paper-0.016
ro-0.014
coat-0.014
pit-0.014
 bisc-0.013
Token go
Token ID467
Feature activation+1.78e+00

Pos logit contributions
body+0.013
 of+0.012
 much+0.012
old+0.011
igators+0.011
Neg logit contributions
paper-0.016
ro-0.015
coat-0.015
pit-0.015
ked-0.014
Token."
Token ID526
Feature activation+1.51e-01

Pos logit contributions
paper+0.038
pit+0.036
ro+0.035
ked+0.035
coat+0.035
Neg logit contributions
 of-0.038
body-0.037
 much-0.036
old-0.033
 important-0.031
Token<|endoftext|>
Token ID50256
Feature activation+2.10e-01

Pos logit contributions
paper+0.004
coat+0.004
pit+0.004
ked+0.004
ro+0.003
Neg logit contributions
 of-0.003
 much-0.003
body-0.002
old-0.002
-0.002
TokenBen
Token ID11696
Feature activation+1.05e-01

Pos logit contributions
paper+0.003
ro+0.003
coat+0.003
pit+0.003
ked+0.003
Neg logit contributions
body-0.006
 of-0.005
 much-0.005
old-0.005
 important-0.005
Token and
Token ID290
Feature activation+2.19e-01

Pos logit contributions
paper+0.003
pit+0.003
ked+0.003
coat+0.002
bel+0.002
Neg logit contributions
 of-0.002
old-0.002
body-0.002
-0.002
 much-0.002
Token Lily
Token ID20037
Feature activation-1.01e-01

Pos logit contributions
paper+0.004
ro+0.004
coat+0.004
pit+0.004
ked+0.004
Neg logit contributions
body-0.005
 much-0.004
 of-0.004
 important-0.004
ug-0.004
Token were
Token ID547
Feature activation-8.91e-01

Pos logit contributions
 of+0.002
+0.002
body+0.002
 much+0.002
old+0.002
Neg logit contributions
paper-0.002
pit-0.002
ro-0.002
coat-0.002
ked-0.002
Token playing
Token ID2712
Feature activation+1.97e+00

Pos logit contributions
body+0.021
igators+0.017
idy+0.017
icated+0.016
ug+0.016
Neg logit contributions
pit-0.017
coat-0.017
aj-0.016
paper-0.015
ked-0.014
Token in
Token ID287
Feature activation-1.75e+00

Pos logit contributions
paper+0.056
coat+0.052
ro+0.052
pit+0.050
ked+0.050
Neg logit contributions
 of-0.030
body-0.028
 much-0.025
ug-0.023
old-0.022
Token to
Token ID284
Feature activation+2.23e+00

Pos logit contributions
pit+0.017
coat+0.017
ro+0.016
paper+0.016
suit+0.015
Neg logit contributions
 of-0.015
 much-0.014
old-0.012
 important-0.012
body-0.011
Token carefully
Token ID7773
Feature activation+2.10e+00

Pos logit contributions
pit+0.051
paper+0.050
coat+0.047
ro+0.047
ked+0.047
Neg logit contributions
 of-0.041
 much-0.040
body-0.039
-0.038
 matters-0.036
Token shave
Token ID34494
Feature activation+5.31e-01

Pos logit contributions
pit+0.046
coat+0.046
paper+0.045
ked+0.043
ro+0.043
Neg logit contributions
 of-0.041
 much-0.038
-0.036
body-0.034
 matters-0.033
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
paper+0.013
coat+0.013
ro+0.013
pit+0.013
ked+0.012
Neg logit contributions
body-0.009
 of-0.009
 much-0.009
 important-0.008
old-0.008
Token It
Token ID632
Feature activation+8.30e-01

Pos logit contributions
body+0.023
 of+0.022
 much+0.022
old+0.020
 important+0.019
Neg logit contributions
paper-0.024
coat-0.023
pit-0.023
ro-0.023
ked-0.022
Token was
Token ID373
Feature activation-2.15e+00

Pos logit contributions
pit+0.018
paper+0.017
coat+0.017
ro+0.017
ked+0.016
Neg logit contributions
 of-0.017
 much-0.016
body-0.016
-0.015
 important-0.015
Token perfect
Token ID2818
Feature activation-2.09e-01

Pos logit contributions
 much+0.028
 of+0.027
 important+0.024
body+0.023
 importantly+0.019
Neg logit contributions
paper-0.064
ro-0.064
pit-0.064
coat-0.063
ked-0.061
Token!
Token ID0
Feature activation-2.13e+00

Pos logit contributions
 of+0.003
body+0.003
 much+0.003
 important+0.003
old+0.002
Neg logit contributions
paper-0.006
coat-0.005
ro-0.005
pit-0.005
ked-0.005
Token He
Token ID679
Feature activation+7.66e-01

Pos logit contributions
body+0.055
 of+0.048
 important+0.047
 much+0.047
 unexpected+0.047
Neg logit contributions
ro-0.036
paper-0.034
coat-0.032
pit-0.032
ked-0.029
Token was
Token ID373
Feature activation-1.75e+00

Pos logit contributions
paper+0.015
pit+0.015
coat+0.015
ro+0.014
ked+0.014
Neg logit contributions
 of-0.017
body-0.016
 much-0.016
old-0.015
-0.015
Token so
Token ID523
Feature activation-2.79e+00

Pos logit contributions
body+0.022
 of+0.021
 much+0.021
old+0.018
 important+0.018
Neg logit contributions
paper-0.051
ro-0.050
coat-0.049
pit-0.049
ked-0.048
Token Jim
Token ID5395
Feature activation-1.22e+00

Pos logit contributions
body+0.017
 important+0.014
 much+0.014
 of+0.014
 importantly+0.014
Neg logit contributions
ro-0.012
paper-0.011
coat-0.010
ked-0.010
pit-0.010
Token.
Token ID13
Feature activation-1.47e+00

Pos logit contributions
body+0.024
 of+0.022
 much+0.022
old+0.020
igators+0.020
Neg logit contributions
paper-0.028
ro-0.026
coat-0.026
pit-0.026
ked-0.025
Token Jim
Token ID5395
Feature activation-2.09e+00

Pos logit contributions
body+0.022
 of+0.021
 much+0.021
old+0.018
igators+0.018
Neg logit contributions
paper-0.039
coat-0.038
ro-0.038
pit-0.037
ked-0.037
Token loved
Token ID6151
Feature activation-3.53e-01

Pos logit contributions
body+0.046
 of+0.043
 much+0.042
old+0.039
igators+0.039
Neg logit contributions
paper-0.042
ro-0.040
coat-0.040
pit-0.040
ked-0.038
Token playing
Token ID2712
Feature activation+1.66e+00

Pos logit contributions
body+0.008
 of+0.008
 much+0.008
 important+0.007
old+0.007
Neg logit contributions
paper-0.007
pit-0.007
ro-0.007
coat-0.007
ked-0.006
Token with
Token ID351
Feature activation-4.71e+00

Pos logit contributions
paper+0.046
ked+0.044
pit+0.043
coat+0.043
ro+0.043
Neg logit contributions
 of-0.025
body-0.023
 much-0.020
 important-0.018
ug-0.018
Token his
Token ID465
Feature activation-5.15e-01

Pos logit contributions
body+0.099
 of+0.093
 much+0.091
old+0.085
 important+0.083
Neg logit contributions
paper-0.100
ro-0.095
coat-0.095
pit-0.094
ked-0.090
Token toys
Token ID14958
Feature activation-1.89e-01

Pos logit contributions
body+0.010
 of+0.010
 much+0.009
 important+0.009
igators+0.008
Neg logit contributions
paper-0.012
ro-0.011
coat-0.011
pit-0.011
ked-0.011
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
 of+0.003
 much+0.003
body+0.003
 important+0.003
 year+0.002
Neg logit contributions
paper-0.005
ro-0.005
coat-0.005
ked-0.005
pit-0.005
Token but
Token ID475
Feature activation-2.91e+00

Pos logit contributions
paper+0.033
coat+0.032
ro+0.031
pit+0.031
ked+0.029
Neg logit contributions
body-0.029
 of-0.027
 much-0.026
old-0.026
ug-0.024
Token one
Token ID530
Feature activation-1.88e+00

Pos logit contributions
body+0.059
 of+0.054
 much+0.053
old+0.051
ug+0.050
Neg logit contributions
paper-0.063
pit-0.061
ro-0.060
coat-0.059
ked-0.056
Token
Token ID198
Feature activation-5.39e-01

Pos logit contributions
ro+0.020
paper+0.020
coat+0.020
pit+0.018
ked+0.016
Neg logit contributions
body-0.035
igators-0.032
 earthquake-0.032
 much-0.031
ug-0.031
TokenL
Token ID43
Feature activation-3.43e+00

Pos logit contributions
body+0.015
 of+0.014
igators+0.014
 much+0.014
ug+0.014
Neg logit contributions
ro-0.007
coat-0.007
paper-0.007
pit-0.006
ked-0.006
Tokenily
Token ID813
Feature activation-1.65e-01

Pos logit contributions
body+0.060
 much+0.057
 of+0.054
 important+0.050
igators+0.050
Neg logit contributions
ro-0.079
paper-0.079
pit-0.078
coat-0.077
ked-0.073
Token was
Token ID373
Feature activation-9.90e-01

Pos logit contributions
body+0.003
 of+0.003
 much+0.003
old+0.003
 important+0.003
Neg logit contributions
paper-0.004
pit-0.003
ro-0.003
coat-0.003
ked-0.003
Token sad
Token ID6507
Feature activation+1.05e+00

Pos logit contributions
body+0.019
igators+0.016
icated+0.016
 of+0.016
old+0.016
Neg logit contributions
coat-0.022
ked-0.021
ro-0.020
paper-0.020
pit-0.020
Token because
Token ID780
Feature activation-4.04e+00

Pos logit contributions
paper+0.025
coat+0.024
pit+0.024
ro+0.024
ked+0.023
Neg logit contributions
body-0.019
 of-0.019
 much-0.018
old-0.017
 important-0.017
Token she
Token ID673
Feature activation+1.25e+00

Pos logit contributions
body+0.117
ug+0.113
asm+0.108
igators+0.107
old+0.104
Neg logit contributions
paper-0.038
pit-0.037
Or-0.031
coat-0.030
ro-0.030
Token really
Token ID1107
Feature activation-9.22e-01

Pos logit contributions
paper+0.026
ro+0.025
pit+0.024
coat+0.024
ked+0.023
Neg logit contributions
body-0.027
 of-0.026
 much-0.025
old-0.023
 important-0.023
Token wanted
Token ID2227
Feature activation+6.00e-01

Pos logit contributions
body+0.025
 of+0.023
 much+0.022
igators+0.022
old+0.021
Neg logit contributions
paper-0.014
ro-0.014
coat-0.014
ked-0.013
pit-0.012
Token to
Token ID284
Feature activation+1.36e+00

Pos logit contributions
paper+0.010
coat+0.009
ro+0.009
pit+0.009
ked+0.008
Neg logit contributions
body-0.016
 of-0.014
 much-0.014
ug-0.014
old-0.013
Token see
Token ID766
Feature activation-1.40e+00

Pos logit contributions
paper+0.023
coat+0.023
ro+0.023
pit+0.020
thank+0.020
Neg logit contributions
body-0.033
 much-0.030
 importantly-0.030
igators-0.029
 of-0.029
Token,
Token ID11
Feature activation+2.45e+00

Pos logit contributions
 of+0.002
 much+0.002
body+0.002
 important+0.002
old+0.001
Neg logit contributions
coat-0.004
paper-0.004
pit-0.004
ro-0.004
ked-0.004
Token and
Token ID290
Feature activation-2.02e+00

Pos logit contributions
paper+0.061
ro+0.059
coat+0.059
pit+0.058
ked+0.056
Neg logit contributions
body-0.042
 of-0.040
 much-0.039
old-0.035
 important-0.035
Token Sally
Token ID25737
Feature activation-1.97e+00

Pos logit contributions
body+0.040
 of+0.036
 much+0.035
ug+0.033
old+0.033
Neg logit contributions
paper-0.045
ro-0.043
pit-0.043
coat-0.043
ked-0.041
Token became
Token ID2627
Feature activation-2.48e+00

Pos logit contributions
body+0.041
 of+0.040
 much+0.039
old+0.037
 important+0.036
Neg logit contributions
paper-0.042
ro-0.040
pit-0.039
coat-0.039
ked-0.037
Token anxious
Token ID18116
Feature activation-1.74e-01

Pos logit contributions
body+0.042
 of+0.037
old+0.034
ug+0.033
 much+0.033
Neg logit contributions
paper-0.062
coat-0.059
ro-0.059
pit-0.058
ked-0.058
Token because
Token ID780
Feature activation-4.20e+00

Pos logit contributions
body+0.003
 of+0.003
 much+0.003
old+0.003
 important+0.003
Neg logit contributions
paper-0.004
ro-0.004
coat-0.004
pit-0.004
ked-0.004
Token she
Token ID673
Feature activation+4.45e-01

Pos logit contributions
body+0.100
ug+0.090
old+0.087
asm+0.086
igators+0.086
Neg logit contributions
paper-0.070
pit-0.067
coat-0.063
ro-0.062
ked-0.057
Token really
Token ID1107
Feature activation-1.41e+00

Pos logit contributions
paper+0.009
pit+0.008
ro+0.008
coat+0.008
ked+0.008
Neg logit contributions
body-0.010
 of-0.009
 much-0.009
old-0.009
 important-0.009
Token wanted
Token ID2227
Feature activation+9.64e-02

Pos logit contributions
body+0.034
 of+0.032
 much+0.032
old+0.030
 important+0.030
Neg logit contributions
paper-0.025
pit-0.024
coat-0.024
ro-0.024
ked-0.022
Token to
Token ID284
Feature activation+1.24e+00

Pos logit contributions
paper+0.002
coat+0.002
ro+0.002
pit+0.002
ked+0.002
Neg logit contributions
body-0.002
 of-0.002
 much-0.002
old-0.002
 important-0.002
Token play
Token ID711
Feature activation+1.22e+00

Pos logit contributions
paper+0.022
ro+0.021
coat+0.021
pit+0.020
ked+0.019
Neg logit contributions
body-0.031
 of-0.028
 much-0.028
old-0.027
igators-0.026
Token ready
Token ID3492
Feature activation+9.86e-01

Pos logit contributions
body+0.031
 of+0.031
 much+0.030
 important+0.026
old+0.026
Neg logit contributions
paper-0.056
ro-0.054
pit-0.053
coat-0.053
ked-0.052
Token to
Token ID284
Feature activation+9.68e-01

Pos logit contributions
paper+0.023
ro+0.022
pit+0.022
coat+0.022
ked+0.022
Neg logit contributions
 of-0.020
 much-0.019
body-0.017
 important-0.016
 year-0.015
Token pay
Token ID1414
Feature activation-1.65e+00

Pos logit contributions
paper+0.019
ro+0.018
coat+0.018
pit+0.018
ked+0.017
Neg logit contributions
body-0.021
 of-0.020
 much-0.020
old-0.019
igators-0.019
Token,
Token ID11
Feature activation+1.96e+00

Pos logit contributions
body+0.032
 of+0.029
 much+0.028
igators+0.028
old+0.027
Neg logit contributions
paper-0.038
ro-0.035
coat-0.035
pit-0.035
ked-0.033
Token but
Token ID475
Feature activation-1.98e+00

Pos logit contributions
paper+0.042
coat+0.040
ro+0.040
pit+0.039
ked+0.038
Neg logit contributions
 of-0.041
body-0.041
 much-0.039
 important-0.035
-0.035
Token Grand
Token ID5675
Feature activation-4.69e+00

Pos logit contributions
body+0.037
 of+0.035
 much+0.034
old+0.032
igators+0.031
Neg logit contributions
paper-0.046
ro-0.044
coat-0.044
pit-0.044
ked-0.042
Tokenpa
Token ID8957
Feature activation-3.27e+00

Pos logit contributions
body+0.070
 of+0.064
 much+0.063
old+0.056
igators+0.056
Neg logit contributions
paper-0.128
pit-0.122
coat-0.122
ro-0.121
ked-0.117
Token was
Token ID373
Feature activation-3.04e+00

Pos logit contributions
body+0.070
 of+0.069
 much+0.066
old+0.062
 important+0.061
Neg logit contributions
paper-0.067
ro-0.064
pit-0.064
coat-0.063
ked-0.060
Token last
Token ID938
Feature activation-1.21e+00

Pos logit contributions
body+0.052
 of+0.047
 much+0.045
old+0.043
igators+0.042
Neg logit contributions
paper-0.076
coat-0.074
ro-0.073
pit-0.073
ked-0.070
Token in
Token ID287
Feature activation-1.35e+00

Pos logit contributions
body+0.021
 of+0.019
 much+0.018
old+0.017
 important+0.017
Neg logit contributions
paper-0.030
ro-0.029
coat-0.029
pit-0.029
ked-0.028
Token line
Token ID1627
Feature activation-1.89e+00

Pos logit contributions
body+0.021
 of+0.020
 much+0.019
old+0.017
 important+0.016
Neg logit contributions
paper-0.037
coat-0.035
ro-0.035
pit-0.035
ked-0.034
Token drove
Token ID10357
Feature activation+4.18e-02

Pos logit contributions
body+0.017
 much+0.015
 of+0.015
old+0.014
igators+0.014
Neg logit contributions
paper-0.025
ro-0.025
coat-0.025
pit-0.024
ked-0.023
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
paper+0.001
ro+0.001
coat+0.001
pit+0.001
ked+0.001
Neg logit contributions
body-0.001
 of-0.001
 much-0.001
old-0.001
igators-0.001
Token Inside
Token ID14384
Feature activation+4.48e-02

Pos logit contributions
body+0.029
 of+0.026
 much+0.026
old+0.025
 important+0.024
Neg logit contributions
paper-0.029
ro-0.029
coat-0.028
pit-0.027
ked-0.026
Token was
Token ID373
Feature activation-3.61e+00

Pos logit contributions
paper+0.001
pit+0.001
coat+0.001
ro+0.001
ked+0.001
Neg logit contributions
body-0.001
 importantly-0.001
old-0.001
ug-0.001
 much-0.001
Token a
Token ID257
Feature activation-4.38e-01

Pos logit contributions
body+0.072
 of+0.062
old+0.061
ug+0.059
 much+0.059
Neg logit contributions
paper-0.081
coat-0.076
ro-0.075
pit-0.074
ked-0.073
Token mom
Token ID1995
Feature activation-3.90e+00

Pos logit contributions
body+0.008
 of+0.007
old+0.007
 much+0.007
igators+0.007
Neg logit contributions
paper-0.010
coat-0.010
pit-0.010
ro-0.010
ked-0.009
Token,
Token ID11
Feature activation+1.24e+00

Pos logit contributions
body+0.088
 much+0.078
igators+0.076
ug+0.074
 of+0.073
Neg logit contributions
paper-0.076
coat-0.075
pit-0.073
ro-0.071
ked-0.065
Token dad
Token ID9955
Feature activation-2.60e+00

Pos logit contributions
paper+0.029
coat+0.028
ro+0.027
pit+0.027
ked+0.026
Neg logit contributions
body-0.023
 of-0.021
 much-0.020
old-0.020
igators-0.019
Token and
Token ID290
Feature activation-8.63e-01

Pos logit contributions
body+0.058
 much+0.050
 of+0.050
igators+0.049
old+0.048
Neg logit contributions
paper-0.052
coat-0.050
ro-0.050
pit-0.049
ked-0.045
Token a
Token ID257
Feature activation+8.51e-01

Pos logit contributions
body+0.016
 of+0.015
 much+0.014
old+0.013
 important+0.013
Neg logit contributions
paper-0.021
coat-0.020
ro-0.020
pit-0.020
ked-0.019
Token little
Token ID1310
Feature activation-1.78e+00

Pos logit contributions
paper+0.023
ro+0.022
pit+0.021
ked+0.021
coat+0.021
Neg logit contributions
 much-0.013
body-0.013
 of-0.013
 important-0.012
old-0.010