TOP ACTIVATIONS
MAX = 13.561

" Can we fix the blanket , mom my ?"
do not want to separate the ball . It is ours
you . I will fix the pengu in . Then you
don 't have to fill the grill with only one thing
't you take turns wearing the hat and sitting on the
Can you help me split the fruit ?" The cow tried
helped the old lady fix the fence .
going to help us fix the screen ." The
you discuss how to share the sausage ?" Mom says .
impatient . We can share the mel on ." Tom said
Look . I have fixed the important roof . Let 's
" Can we repair the calendar , Mom ?" Lily
why he had fixed her the screen . "
she could help him move the cart . To his surprise
to wait for me on the slide . Then we can
, can you please replace the rock you moved from the
" I regret kicking the ball ," Lily said .
" Maybe we can share the ball with him . Maybe
said , " I broke the branch . I wanted to
late . The dog ripped the rocket apart and spat out

MIN ACTIVATIONS
MIN = -8.521

mom my explained that the doctors were going to fix something
the hospital , where the doctors said he had eaten poisonous
the land . Everywhere the butterfly flew , the dinosaur 's
people to the hospital where doctors can make them better ."
wizard . Every day the wizard used this staff for magic
. The yoga instructor was very friendly and charming
be safe ." So the sheep quickly slid through the hole
she kept hoping that the anchor would come back .
proceeded to guess what the shield was for . Was it
to shrink ! Everywhere the tide went it seemed to get
pa said that if the kid wanted to go outside ,
helpful . Every day the doctor asked the patient different questions
, no matter how the monster yelled . The
the air . Everywhere the creature had been was full of
The massage therapist asked him to lay down
of questions , while the wizard laughed and told him stories
from danger . Everywhere the knight went , he was careful
colorful day . Everywhere the person looked , there were bright
fix it . The pl umber came and fixed the f
to the hospital . The doctors and nurses there were very

INTERVAL 8.040 - 13.561
CONTAINS 5.834%

park . Claire was holding a k ite that had smooth
when Lily 's mom saw the broken plate , she didn
saw a strange boy in the corner . The boy looked
. Lily ran behind a big tree and waited for Tom
They waved goodbye to the neighbor and went back to

INTERVAL 2.520 - 8.040
CONTAINS 9.551%

to eat the leaves without any worries . <|endoftext|> Once upon
me find <|endoftext|> Once upon a time , there was a
smiled , happy to have her picture in a funny frame
and her mom my had a lovely dinner together . They
the ants had more than enough for the whole day .

INTERVAL -3.001 - 2.520
CONTAINS 70.420%

, look at my pe b ble !" Sara said ,
ries on it . She wanted to eat them , but
â„¢ t sure what to do . He kept on holding
ate it anyway because she was a big girl . After
His sister was happy and they played a fun

INTERVAL -8.521 - -3.001
CONTAINS 14.195%

She walked to Farmer Joe and took the toy cars from
other and felt happy . They liked their room and their
, drink some water . Water is very healthy ." So
When they got to the house , Bobby opened the jar
with stars and moons . Lily and Mr . Sn ugg
Token
Token ID198
Feature activation+1.90e+00

Pos logit contributions
 sight+0.054
 new+0.051
 biggest+0.048
 view+0.048
 world+0.047
Neg logit contributions
 could-0.037
 wouldn-0.033
 rode-0.032
 fought-0.032
es-0.032
Token"
Token ID1
Feature activation-1.74e+00

Pos logit contributions
 new+0.043
 sight+0.043
 big+0.039
 old+0.039
 elderly+0.038
Neg logit contributions
 could-0.032
 wouldn-0.032
 argued-0.030
 fought-0.030
 rode-0.029
TokenCan
Token ID6090
Feature activation-2.81e+00

Pos logit contributions
 could+0.029
 wouldn+0.027
 fought+0.026
 argued+0.026
 rode+0.025
Neg logit contributions
 sight-0.041
 new-0.040
 view-0.037
 world-0.037
 old-0.036
Token we
Token ID356
Feature activation-9.27e-01

Pos logit contributions
ched+0.045
 argued+0.044
 managed+0.044
 fought+0.042
oped+0.042
Neg logit contributions
 new-0.069
 sight-0.062
 old-0.062
 big-0.061
 sun-0.058
Token fix
Token ID4259
Feature activation+3.60e-01

Pos logit contributions
 could+0.016
 wouldn+0.016
es+0.016
 argued+0.016
ched+0.016
Neg logit contributions
 new-0.020
 sight-0.019
 big-0.019
 sun-0.018
 old-0.018
Token the
Token ID262
Feature activation+1.36e+01

Pos logit contributions
 sight+0.008
 new+0.008
 world+0.007
 view+0.007
 elderly+0.007
Neg logit contributions
 could-0.006
 wouldn-0.006
 argued-0.005
 fought-0.005
es-0.005
Token blanket
Token ID18447
Feature activation-3.18e+00

Pos logit contributions
igsaw+0.222
 sight+0.214
andel+0.210
err+0.202
 future+0.201
Neg logit contributions
 could-0.280
 wouldn-0.260
 let-0.244
 would-0.234
 snatched-0.225
Token,
Token ID11
Feature activation-2.26e+00

Pos logit contributions
 could+0.057
 argued+0.056
 wouldn+0.055
 fought+0.055
 rode+0.054
Neg logit contributions
 new-0.068
 sight-0.067
 big-0.062
 old-0.061
 world-0.060
Token mom
Token ID1995
Feature activation-1.47e+00

Pos logit contributions
 could+0.051
 wouldn+0.049
 argued+0.048
 fought+0.048
 rode+0.047
Neg logit contributions
 sight-0.039
 new-0.038
 old-0.033
 big-0.033
 view-0.033
Tokenmy
Token ID1820
Feature activation-3.07e+00

Pos logit contributions
 wouldn+0.031
 argued+0.031
ched+0.031
ked+0.030
 could+0.030
Neg logit contributions
 new-0.027
 old-0.025
 sight-0.025
 big-0.024
 elderly-0.023
Token?"
Token ID1701
Feature activation-2.40e+00

Pos logit contributions
ked+0.075
oped+0.074
ched+0.073
 argued+0.072
 wouldn+0.071
Neg logit contributions
 new-0.050
 old-0.048
 big-0.046
 elderly-0.042
 sight-0.042
Token do
Token ID466
Feature activation-8.34e-01

Pos logit contributions
es+0.053
ged+0.048
ked+0.048
med+0.047
oped+0.047
Neg logit contributions
 big-0.061
 whole-0.058
 new-0.057
 lot-0.057
 job-0.055
Token not
Token ID407
Feature activation+7.80e-01

Pos logit contributions
 could+0.017
 argued+0.016
 wouldn+0.016
oped+0.016
 rode+0.016
Neg logit contributions
 new-0.017
 sight-0.016
 big-0.015
 old-0.015
 future-0.014
Token want
Token ID765
Feature activation+8.11e-01

Pos logit contributions
 sight+0.017
 new+0.017
 big+0.017
 future+0.016
 old+0.016
Neg logit contributions
es-0.013
 could-0.013
 wouldn-0.013
ked-0.012
oped-0.012
Token to
Token ID284
Feature activation+1.20e+00

Pos logit contributions
 sight+0.020
 new+0.020
 big+0.018
 old+0.018
 view+0.018
Neg logit contributions
 could-0.012
 wouldn-0.012
 argued-0.011
 fought-0.011
 rode-0.011
Token separate
Token ID4553
Feature activation+1.97e+00

Pos logit contributions
 new+0.027
 sight+0.027
 big+0.025
 old+0.024
 elderly+0.024
Neg logit contributions
 could-0.021
 wouldn-0.021
es-0.021
ched-0.020
oped-0.019
Token the
Token ID262
Feature activation+1.35e+01

Pos logit contributions
 sight+0.044
 new+0.043
 world+0.039
 view+0.039
 elderly+0.039
Neg logit contributions
 could-0.034
 wouldn-0.032
 argued-0.031
 fought-0.031
 rode-0.030
Token ball
Token ID2613
Feature activation-1.06e+00

Pos logit contributions
 sight+0.234
andel+0.224
 new+0.216
err+0.215
 future+0.210
Neg logit contributions
 could-0.266
 argued-0.243
 fought-0.242
 rode-0.238
 scored-0.238
Token.
Token ID13
Feature activation-1.98e+00

Pos logit contributions
 could+0.019
 wouldn+0.019
 argued+0.018
 fought+0.018
 rode+0.018
Neg logit contributions
 new-0.023
 sight-0.023
 big-0.021
 old-0.021
 world-0.020
Token It
Token ID632
Feature activation-2.23e+00

Pos logit contributions
 argued+0.033
 wouldn+0.033
 fought+0.032
 managed+0.032
 rode+0.031
Neg logit contributions
 new-0.044
 sight-0.043
 sun-0.042
 big-0.041
 old-0.041
Token is
Token ID318
Feature activation+1.72e+00

Pos logit contributions
ged+0.042
 argued+0.041
oped+0.040
ked+0.039
 rode+0.039
Neg logit contributions
 new-0.046
 big-0.043
 sight-0.042
 old-0.040
 biggest-0.040
Token ours
Token ID16903
Feature activation-1.13e+00

Pos logit contributions
 sight+0.031
 new+0.030
 world+0.028
 view+0.027
 future+0.027
Neg logit contributions
 could-0.037
 wouldn-0.036
 fought-0.034
 argued-0.034
 managed-0.033
Token you
Token ID345
Feature activation-7.59e-01

Pos logit contributions
 could+0.012
 wouldn+0.011
 argued+0.011
 fought+0.011
 rode+0.010
Neg logit contributions
 sight-0.017
 new-0.017
 view-0.015
 old-0.015
 world-0.015
Token.
Token ID13
Feature activation-2.30e+00

Pos logit contributions
 could+0.015
 wouldn+0.015
 fought+0.014
 argued+0.014
 rode+0.014
Neg logit contributions
 new-0.015
 sight-0.015
 old-0.014
 big-0.014
 elderly-0.013
Token I
Token ID314
Feature activation-1.86e+00

Pos logit contributions
 argued+0.039
 wouldn+0.039
 could+0.039
 fought+0.038
 managed+0.036
Neg logit contributions
 sight-0.051
 new-0.051
 future-0.047
 old-0.046
 sun-0.046
Token will
Token ID481
Feature activation+3.04e-01

Pos logit contributions
 argued+0.040
 wouldn+0.039
ked+0.039
ged+0.038
es+0.037
Neg logit contributions
 new-0.033
 big-0.030
 sight-0.029
 whole-0.029
 old-0.028
Token fix
Token ID4259
Feature activation+7.03e-02

Pos logit contributions
 new+0.007
 big+0.006
 sight+0.006
 old+0.006
 elderly+0.006
Neg logit contributions
 could-0.005
 wouldn-0.005
oped-0.005
es-0.005
 argued-0.005
Token the
Token ID262
Feature activation+1.35e+01

Pos logit contributions
 sight+0.002
 new+0.002
 world+0.001
 view+0.001
 old+0.001
Neg logit contributions
 could-0.001
 wouldn-0.001
 argued-0.001
 fought-0.001
es-0.001
Token pengu
Token ID38373
Feature activation+3.76e+00

Pos logit contributions
 sight+0.262
igsaw+0.246
 new+0.240
err+0.232
 future+0.232
Neg logit contributions
 could-0.242
 fought-0.229
 argued-0.219
 snatched-0.216
 wouldn-0.216
Tokenin
Token ID259
Feature activation-6.14e-01

Pos logit contributions
 sight+0.077
 new+0.075
 old+0.067
 big+0.067
 view+0.067
Neg logit contributions
 could-0.073
 wouldn-0.070
 fought-0.068
 argued-0.068
 rode-0.066
Token.
Token ID13
Feature activation-2.16e+00

Pos logit contributions
 could+0.012
 wouldn+0.012
 argued+0.011
 managed+0.011
 rode+0.011
Neg logit contributions
 new-0.013
 sight-0.012
 world-0.011
 big-0.011
 old-0.011
Token Then
Token ID3244
Feature activation-2.01e+00

Pos logit contributions
 argued+0.035
 wouldn+0.034
 could+0.034
 fought+0.034
 managed+0.032
Neg logit contributions
 new-0.050
 sight-0.050
 sun-0.046
 old-0.046
 future-0.045
Token you
Token ID345
Feature activation-3.33e+00

Pos logit contributions
 argued+0.038
oped+0.038
 fought+0.037
ched+0.037
ked+0.036
Neg logit contributions
 new-0.045
 sight-0.041
 big-0.039
 sun-0.038
 old-0.038
Token don
Token ID836
Feature activation-7.16e-03

Pos logit contributions
ged+0.051
oped+0.048
ked+0.048
es+0.044
 argued+0.043
Neg logit contributions
 big-0.061
 new-0.059
 whole-0.055
 old-0.054
 large-0.054
Token't
Token ID470
Feature activation+4.40e-01

Pos logit contributions
 wouldn+0.000
 could+0.000
 argued+0.000
 fought+0.000
 managed+0.000
Neg logit contributions
 new-0.000
 sight-0.000
 big-0.000
 old-0.000
 elderly-0.000
Token have
Token ID423
Feature activation+1.13e+00

Pos logit contributions
 sight+0.010
 new+0.010
 big+0.009
 future+0.009
 world+0.009
Neg logit contributions
 wouldn-0.007
es-0.007
 could-0.007
ked-0.007
oped-0.007
Token to
Token ID284
Feature activation+6.92e-01

Pos logit contributions
 new+0.026
 sight+0.026
 big+0.024
 old+0.023
 future+0.022
Neg logit contributions
 could-0.019
 wouldn-0.019
oped-0.019
es-0.019
ched-0.018
Token fill
Token ID6070
Feature activation+1.32e+00

Pos logit contributions
 new+0.018
 sight+0.018
 big+0.016
 old+0.016
 future+0.016
Neg logit contributions
 could-0.010
 wouldn-0.009
es-0.009
 argued-0.008
ched-0.008
Token the
Token ID262
Feature activation+1.34e+01

Pos logit contributions
 sight+0.030
 doorway+0.028
 new+0.028
 world+0.027
 elderly+0.027
Neg logit contributions
 could-0.020
es-0.019
 wouldn-0.018
 fought-0.018
 argued-0.018
Token grill
Token ID29901
Feature activation-1.22e+00

Pos logit contributions
 sight+0.214
andel+0.210
igsaw+0.204
 particular+0.201
agg+0.184
Neg logit contributions
 could-0.268
 stole-0.242
 wouldn-0.239
 fought-0.237
 shared-0.235
Token with
Token ID351
Feature activation+1.17e+00

Pos logit contributions
 could+0.019
 wouldn+0.018
 fought+0.018
 argued+0.018
 rode+0.017
Neg logit contributions
 sight-0.029
 new-0.029
 view-0.026
 old-0.026
 big-0.026
Token only
Token ID691
Feature activation-4.87e-01

Pos logit contributions
 sight+0.016
 new+0.015
 world+0.013
 view+0.013
 elderly+0.013
Neg logit contributions
 could-0.030
 wouldn-0.029
 fought-0.029
 argued-0.029
 managed-0.027
Token one
Token ID530
Feature activation+1.90e+00

Pos logit contributions
 could+0.010
 wouldn+0.010
 argued+0.009
 fought+0.009
 rode+0.009
Neg logit contributions
 new-0.009
 sight-0.009
 old-0.008
 big-0.008
 elderly-0.008
Token thing
Token ID1517
Feature activation-2.32e+00

Pos logit contributions
 sight+0.042
 new+0.040
 view+0.037
 elderly+0.036
 world+0.036
Neg logit contributions
 could-0.035
 wouldn-0.032
 fought-0.031
 argued-0.030
 managed-0.029
Token't
Token ID470
Feature activation+1.84e-01

Pos logit contributions
 argued+0.012
 wouldn+0.012
 fought+0.012
 could+0.012
 managed+0.011
Neg logit contributions
 new-0.018
 sight-0.017
 big-0.016
 old-0.016
 sun-0.016
Token you
Token ID345
Feature activation-8.79e-01

Pos logit contributions
 sight+0.004
 new+0.004
 big+0.004
 old+0.004
 view+0.004
Neg logit contributions
 could-0.003
 wouldn-0.003
 argued-0.003
 fought-0.003
 rode-0.003
Token take
Token ID1011
Feature activation+9.45e-01

Pos logit contributions
 could+0.016
es+0.015
 wouldn+0.015
ged+0.015
ched+0.015
Neg logit contributions
 new-0.018
 sight-0.018
 big-0.017
 old-0.016
 elderly-0.016
Token turns
Token ID4962
Feature activation-3.18e-01

Pos logit contributions
 sight+0.019
 new+0.018
 world+0.018
 elderly+0.017
 view+0.017
Neg logit contributions
 could-0.017
 fought-0.017
 argued-0.017
 wouldn-0.016
 rode-0.016
Token wearing
Token ID5762
Feature activation+2.11e+00

Pos logit contributions
 wouldn+0.006
 could+0.006
 argued+0.006
ked+0.006
 managed+0.006
Neg logit contributions
 new-0.006
 sight-0.006
 big-0.006
 old-0.006
 world-0.005
Token the
Token ID262
Feature activation+1.34e+01

Pos logit contributions
 sight+0.045
 new+0.044
 world+0.040
 view+0.039
 elderly+0.039
Neg logit contributions
 could-0.039
 wouldn-0.037
 fought-0.036
 argued-0.036
 rode-0.035
Token hat
Token ID6877
Feature activation-3.65e+00

Pos logit contributions
 sight+0.261
igsaw+0.261
umm+0.242
err+0.240
ummy+0.237
Neg logit contributions
 could-0.243
 wouldn-0.232
 rode-0.220
 wrote-0.213
 let-0.207
Token and
Token ID290
Feature activation+3.19e-01

Pos logit contributions
 argued+0.075
 fought+0.072
 snatched+0.072
 managed+0.072
 wouldn+0.071
Neg logit contributions
 new-0.069
 sight-0.064
 big-0.062
 old-0.061
 world-0.060
Token sitting
Token ID5586
Feature activation-1.22e-02

Pos logit contributions
 new+0.006
 sight+0.006
 big+0.005
 old+0.005
 elderly+0.005
Neg logit contributions
 could-0.007
 wouldn-0.007
 argued-0.006
ched-0.006
 managed-0.006
Token on
Token ID319
Feature activation+8.86e-01

Pos logit contributions
 could+0.000
 wouldn+0.000
 fought+0.000
 argued+0.000
 rode+0.000
Neg logit contributions
 sight-0.000
 new-0.000
 view-0.000
 old-0.000
 big-0.000
Token the
Token ID262
Feature activation+1.06e+01

Pos logit contributions
 sight+0.020
 new+0.020
 biggest+0.018
 world+0.018
 view+0.018
Neg logit contributions
 could-0.015
 wouldn-0.014
 fought-0.014
 argued-0.013
 rode-0.013
TokenCan
Token ID6090
Feature activation-2.90e+00

Pos logit contributions
 wouldn+0.041
 could+0.040
 argued+0.039
 fought+0.038
es+0.037
Neg logit contributions
 sight-0.048
 new-0.047
 world-0.044
 big-0.043
 elderly-0.043
Token you
Token ID345
Feature activation-1.75e-01

Pos logit contributions
 argued+0.054
ched+0.053
oped+0.052
ged+0.052
es+0.052
Neg logit contributions
 new-0.062
 big-0.056
 old-0.055
 sight-0.054
 sun-0.051
Token help
Token ID1037
Feature activation-9.04e-01

Pos logit contributions
 could+0.003
 wouldn+0.003
 argued+0.003
es+0.003
 fought+0.003
Neg logit contributions
 new-0.004
 sight-0.004
 big-0.003
 old-0.003
 large-0.003
Token me
Token ID502
Feature activation+5.80e-01

Pos logit contributions
 could+0.023
 wouldn+0.023
 argued+0.022
 rode+0.022
 fought+0.022
Neg logit contributions
 new-0.013
 sight-0.013
 big-0.012
 old-0.011
 elderly-0.011
Token split
Token ID6626
Feature activation+1.30e+00

Pos logit contributions
 sight+0.013
 new+0.013
 world+0.012
 view+0.012
 old+0.012
Neg logit contributions
 could-0.010
 wouldn-0.009
 fought-0.009
 argued-0.009
 rode-0.009
Token the
Token ID262
Feature activation+1.34e+01

Pos logit contributions
 sight+0.030
 new+0.029
 world+0.027
 future+0.026
 view+0.026
Neg logit contributions
 could-0.022
 wouldn-0.020
 fought-0.019
 argued-0.019
es-0.019
Token fruit
Token ID8234
Feature activation-4.57e-02

Pos logit contributions
 sight+0.214
andel+0.209
igsaw+0.203
 former+0.198
 new+0.197
Neg logit contributions
 could-0.281
 wouldn-0.250
 snatched-0.248
 argued-0.247
 fought-0.246
Token?"
Token ID1701
Feature activation-1.73e+00

Pos logit contributions
 could+0.001
 wouldn+0.001
 fought+0.001
 argued+0.001
 rode+0.001
Neg logit contributions
 sight-0.001
 new-0.001
 view-0.001
 world-0.001
 old-0.001
Token The
Token ID383
Feature activation+9.81e+00

Pos logit contributions
 wouldn+0.034
 managed+0.032
ched+0.032
 argued+0.032
 could+0.032
Neg logit contributions
 new-0.036
 sight-0.034
 big-0.033
 old-0.032
 sun-0.031
Token cow
Token ID9875
Feature activation-5.35e+00

Pos logit contributions
igsaw+0.163
 sight+0.163
 view+0.141
 new+0.139
 ey+0.139
Neg logit contributions
 could-0.215
 let-0.203
 argued-0.202
 fought-0.200
 snatched-0.195
Token tried
Token ID3088
Feature activation-9.22e-01

Pos logit contributions
pped+0.069
 Leave+0.066
ched+0.066
oped+0.066
pping+0.065
Neg logit contributions
 new-0.133
 big-0.125
 sight-0.124
 old-0.123
 little-0.118
Token helped
Token ID4193
Feature activation-8.25e-01

Pos logit contributions
 Leave+0.034
 wouldn+0.034
es+0.032
 scored+0.032
 could+0.032
Neg logit contributions
 new-0.081
 big-0.080
 sight-0.077
 large-0.076
 old-0.076
Token the
Token ID262
Feature activation+9.65e+00

Pos logit contributions
 could+0.014
 wouldn+0.013
 argued+0.013
 fought+0.013
 rode+0.012
Neg logit contributions
 sight-0.019
 new-0.018
 world-0.017
 old-0.017
 view-0.017
Token old
Token ID1468
Feature activation+8.07e+00

Pos logit contributions
igsaw+0.112
 ey+0.104
 sight+0.099
umm+0.092
 former+0.082
Neg logit contributions
 could-0.248
 let-0.243
 argued-0.234
 begged-0.231
 hid-0.227
Token lady
Token ID10846
Feature activation-2.81e+00

Pos logit contributions
 sight+0.099
 new+0.091
 biggest+0.089
igsaw+0.086
 future+0.085
Neg logit contributions
 could-0.203
 argued-0.200
 fought-0.195
 wouldn-0.195
 rode-0.191
Token fix
Token ID4259
Feature activation+7.21e-01

Pos logit contributions
 could+0.048
 wouldn+0.047
 fought+0.046
 argued+0.046
 rode+0.045
Neg logit contributions
 new-0.063
 sight-0.060
 big-0.058
 old-0.057
 large-0.055
Token the
Token ID262
Feature activation+1.34e+01

Pos logit contributions
 sight+0.016
 new+0.016
 world+0.015
 view+0.014
 elderly+0.014
Neg logit contributions
 could-0.012
 wouldn-0.012
 argued-0.011
 fought-0.011
es-0.011
Token fence
Token ID13990
Feature activation-2.64e+00

Pos logit contributions
 sight+0.159
igsaw+0.156
umm+0.145
 ey+0.141
err+0.139
Neg logit contributions
 could-0.320
 wouldn-0.305
 hid-0.304
 snatched-0.295
 argued-0.295
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
 could+0.049
 fought+0.049
 argued+0.049
 rode+0.048
 wouldn+0.047
Neg logit contributions
 new-0.055
 sight-0.053
 old-0.049
 big-0.049
 world-0.047
Token
Token ID220
Feature activation+7.53e-02

Pos logit contributions
 could+0.020
 wouldn+0.020
 fought+0.019
 argued+0.019
 rode+0.018
Neg logit contributions
 sight-0.029
 new-0.029
 big-0.026
 old-0.026
 view-0.026
Token
Token ID198
Feature activation+1.57e+00

Pos logit contributions
 new+0.002
 sight+0.001
 big+0.001
 old+0.001
 world+0.001
Neg logit contributions
 wouldn-0.001
 could-0.001
 argued-0.001
 fought-0.001
 rode-0.001
Token
Token ID198
Feature activation+1.82e+00

Pos logit contributions
 sight+0.033
 new+0.033
 big+0.029
 old+0.029
 view+0.029
Neg logit contributions
 could-0.029
 wouldn-0.029
 argued-0.028
 fought-0.028
 rode-0.027
Token going
Token ID1016
Feature activation-3.82e-01

Pos logit contributions
 sight+0.026
 new+0.026
 big+0.023
 old+0.023
 elderly+0.023
Neg logit contributions
 could-0.030
 wouldn-0.029
 argued-0.028
 fought-0.028
 rode-0.027
Token to
Token ID284
Feature activation+3.81e-01

Pos logit contributions
 argued+0.007
 could+0.007
 fought+0.006
 wouldn+0.006
es+0.006
Neg logit contributions
 new-0.009
 sight-0.009
 big-0.008
 old-0.008
 world-0.008
Token help
Token ID1037
Feature activation-1.31e+00

Pos logit contributions
 sight+0.009
 new+0.008
 big+0.008
 old+0.008
 view+0.008
Neg logit contributions
 could-0.007
 wouldn-0.006
 argued-0.006
 fought-0.006
 rode-0.006
Token us
Token ID514
Feature activation-2.96e-01

Pos logit contributions
 could+0.023
 wouldn+0.023
 argued+0.022
 fought+0.022
 rode+0.021
Neg logit contributions
 new-0.029
 sight-0.028
 big-0.026
 old-0.026
 world-0.025
Token fix
Token ID4259
Feature activation+3.13e-01

Pos logit contributions
 could+0.005
 argued+0.005
 wouldn+0.005
 fought+0.005
 rode+0.005
Neg logit contributions
 new-0.006
 sight-0.006
 big-0.006
 old-0.006
 view-0.006
Token the
Token ID262
Feature activation+1.34e+01

Pos logit contributions
 sight+0.007
 new+0.007
 view+0.007
 world+0.007
 elderly+0.007
Neg logit contributions
 could-0.005
 wouldn-0.005
 fought-0.005
 argued-0.005
es-0.004
Token screen
Token ID3159
Feature activation-3.07e+00

Pos logit contributions
 sight+0.175
andel+0.169
 new+0.155
err+0.155
igsaw+0.149
Neg logit contributions
 could-0.321
 wouldn-0.303
 fought-0.297
 wrote-0.288
 scored-0.287
Token."
Token ID526
Feature activation-2.34e+00

Pos logit contributions
 argued+0.055
 wouldn+0.054
 could+0.054
 fought+0.054
 rode+0.054
Neg logit contributions
 new-0.065
 sight-0.062
 big-0.060
 old-0.059
 world-0.058
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
 wouldn+0.050
 fought+0.049
 argued+0.049
 could+0.048
 managed+0.046
Neg logit contributions
 new-0.044
 sight-0.043
 big-0.038
 old-0.038
 sun-0.038
Token
Token ID198
Feature activation+1.23e+00

Pos logit contributions
 new+0.032
 sight+0.032
 big+0.029
 old+0.028
 world+0.028
Neg logit contributions
 wouldn-0.030
 could-0.030
 argued-0.029
 fought-0.029
 rode-0.027
TokenThe
Token ID464
Feature activation+9.50e+00

Pos logit contributions
 new+0.021
 sight+0.020
 big+0.019
 elderly+0.019
 old+0.018
Neg logit contributions
 wouldn-0.028
 could-0.027
 argued-0.027
 fought-0.026
ched-0.026
Token you
Token ID345
Feature activation-7.12e-01

Pos logit contributions
 new+0.009
 sight+0.009
 big+0.008
 old+0.008
 sun+0.008
Neg logit contributions
 could-0.007
 wouldn-0.007
ched-0.007
 managed-0.007
oped-0.007
Token discuss
Token ID2112
Feature activation-1.23e+00

Pos logit contributions
es+0.012
 could+0.012
 wouldn+0.012
ched+0.011
 rode+0.011
Neg logit contributions
 new-0.015
 sight-0.015
 big-0.015
 old-0.014
 large-0.014
Token how
Token ID703
Feature activation-4.89e-01

Pos logit contributions
 could+0.022
 wouldn+0.022
 argued+0.021
 fought+0.021
 managed+0.021
Neg logit contributions
 new-0.027
 sight-0.026
 big-0.024
 old-0.024
 elderly-0.023
Token to
Token ID284
Feature activation+4.84e-02

Pos logit contributions
 wouldn+0.008
 could+0.008
 argued+0.008
 fought+0.008
 managed+0.008
Neg logit contributions
 new-0.011
 sight-0.011
 old-0.010
 big-0.010
 elderly-0.010
Token share
Token ID2648
Feature activation+2.89e-01

Pos logit contributions
 new+0.001
 sight+0.001
 big+0.001
 old+0.001
 view+0.001
Neg logit contributions
 could-0.001
 wouldn-0.001
 argued-0.001
es-0.001
ched-0.001
Token the
Token ID262
Feature activation+1.33e+01

Pos logit contributions
 new+0.007
 sight+0.006
 big+0.006
 old+0.006
 view+0.006
Neg logit contributions
 could-0.005
 wouldn-0.005
 argued-0.005
 fought-0.005
 rode-0.005
Token sausage
Token ID31889
Feature activation-1.35e+00

Pos logit contributions
 sight+0.261
andel+0.244
 new+0.238
 view+0.236
igsaw+0.233
Neg logit contributions
 could-0.233
 wouldn-0.222
 argued-0.220
 fought-0.213
 snatched-0.212
Token?"
Token ID1701
Feature activation-1.89e+00

Pos logit contributions
 could+0.023
 wouldn+0.022
 argued+0.021
 fought+0.021
 rode+0.020
Neg logit contributions
 sight-0.031
 new-0.030
 big-0.027
 old-0.027
 view-0.027
Token Mom
Token ID11254
Feature activation-4.28e+00

Pos logit contributions
 could+0.037
 let+0.032
 rode+0.031
 suggested+0.031
 fought+0.030
Neg logit contributions
 sight-0.038
 entire-0.035
 world-0.035
 biggest-0.034
 view-0.034
Token says
Token ID1139
Feature activation-1.97e+00

Pos logit contributions
 wouldn+0.065
 could+0.064
 fought+0.061
 managed+0.061
 argued+0.059
Neg logit contributions
 new-0.105
 sight-0.101
 old-0.099
 big-0.096
 elderly-0.095
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
 wouldn+0.043
 could+0.043
 rode+0.042
 fought+0.042
 argued+0.042
Neg logit contributions
 new-0.035
 sight-0.032
 old-0.032
 big-0.030
 elderly-0.028
Token impatient
Token ID33440
Feature activation-1.32e+00

Pos logit contributions
 sight+0.031
 new+0.028
 view+0.027
 world+0.027
 future+0.026
Neg logit contributions
 could-0.032
 wouldn-0.031
 fought-0.031
 argued-0.030
 managed-0.029
Token.
Token ID13
Feature activation-2.57e+00

Pos logit contributions
 wouldn+0.024
 could+0.024
 argued+0.023
 fought+0.023
 rode+0.023
Neg logit contributions
 new-0.028
 sight-0.028
 old-0.026
 big-0.025
 world-0.025
Token We
Token ID775
Feature activation-3.36e+00

Pos logit contributions
 argued+0.042
 wouldn+0.040
 fought+0.040
 managed+0.039
 rode+0.037
Neg logit contributions
 new-0.060
 sight-0.058
 sun-0.056
 old-0.055
 biggest-0.054
Token can
Token ID460
Feature activation-1.04e+00

Pos logit contributions
es+0.044
ged+0.043
fed+0.040
ked+0.040
oped+0.039
Neg logit contributions
 new-0.078
 big-0.078
 sight-0.076
 sun-0.074
 dark-0.074
Token share
Token ID2648
Feature activation+5.88e-03

Pos logit contributions
 could+0.016
 wouldn+0.016
 argued+0.015
es+0.015
ged+0.015
Neg logit contributions
 new-0.025
 sight-0.024
 big-0.023
 sun-0.022
 old-0.022
Token the
Token ID262
Feature activation+1.33e+01

Pos logit contributions
 sight+0.000
 new+0.000
 big+0.000
 old+0.000
 view+0.000
Neg logit contributions
 could-0.000
 wouldn-0.000
 argued-0.000
 fought-0.000
 rode-0.000
Token mel
Token ID7758
Feature activation+3.56e+00

Pos logit contributions
 sight+0.245
igsaw+0.234
 doorway+0.234
andel+0.233
 chamber+0.233
Neg logit contributions
 could-0.249
 fought-0.216
 wouldn-0.215
 wrote-0.214
 argued-0.213
Tokenon
Token ID261
Feature activation+4.74e-03

Pos logit contributions
 sight+0.104
 new+0.103
 old+0.095
 elderly+0.094
 big+0.094
Neg logit contributions
 could-0.038
 wouldn-0.034
 fought-0.032
 argued-0.032
 rode-0.031
Token."
Token ID526
Feature activation-2.19e+00

Pos logit contributions
 sight+0.000
 new+0.000
 view+0.000
 future+0.000
 world+0.000
Neg logit contributions
 could-0.000
 wouldn-0.000
 fought-0.000
 argued-0.000
es-0.000
Token Tom
Token ID4186
Feature activation-5.01e+00

Pos logit contributions
 could+0.030
 wouldn+0.029
 argued+0.028
 fought+0.027
 rode+0.026
Neg logit contributions
 sight-0.058
 new-0.057
 big-0.052
 old-0.052
 view-0.052
Token said
Token ID531
Feature activation-1.54e+00

Pos logit contributions
pping+0.085
oped+0.085
 wouldn+0.082
pped+0.080
uable+0.080
Neg logit contributions
 new-0.111
 old-0.106
 big-0.103
 little-0.102
 elderly-0.101
TokenLook
Token ID8567
Feature activation-1.68e+00

Pos logit contributions
 could+0.058
 wouldn+0.057
 fought+0.055
 argued+0.055
es+0.053
Neg logit contributions
 sight-0.067
 new-0.065
 world-0.061
 view-0.060
 elderly-0.060
Token.
Token ID13
Feature activation-3.76e+00

Pos logit contributions
 fought+0.031
 could+0.031
 argued+0.031
 rode+0.030
ched+0.029
Neg logit contributions
 new-0.036
 sight-0.034
 big-0.033
 old-0.031
 view-0.030
Token I
Token ID314
Feature activation-2.77e+00

Pos logit contributions
 argued+0.064
 wouldn+0.061
 fought+0.060
ched+0.058
ged+0.058
Neg logit contributions
 new-0.082
 sight-0.081
 sun-0.079
 elderly-0.077
 biggest-0.076
Token have
Token ID423
Feature activation+3.82e-02

Pos logit contributions
ked+0.037
 argued+0.035
es+0.034
ged+0.032
 wouldn+0.032
Neg logit contributions
 big-0.065
 whole-0.065
 sight-0.065
 new-0.064
 entire-0.063
Token fixed
Token ID5969
Feature activation+6.69e-01

Pos logit contributions
 new+0.001
 sight+0.001
 big+0.001
 old+0.001
 view+0.001
Neg logit contributions
 could-0.001
 wouldn-0.001
 argued-0.001
es-0.001
oped-0.001
Token the
Token ID262
Feature activation+1.33e+01

Pos logit contributions
 sight+0.015
 new+0.015
 old+0.013
 big+0.013
 world+0.013
Neg logit contributions
 could-0.012
 wouldn-0.011
 argued-0.011
 fought-0.011
 rode-0.010
Token important
Token ID1593
Feature activation+1.12e+01

Pos logit contributions
igsaw+0.202
 sight+0.182
err+0.176
 new+0.167
ummy+0.165
Neg logit contributions
 could-0.313
 wouldn-0.297
 fought-0.275
 won-0.264
 argued-0.264
Token roof
Token ID9753
Feature activation-3.19e+00

Pos logit contributions
 new+0.123
 sight+0.123
igsaw+0.117
 elderly+0.109
err+0.106
Neg logit contributions
 could-0.286
 wouldn-0.284
 fought-0.267
 hid-0.261
 argued-0.259
Token.
Token ID13
Feature activation-3.30e+00

Pos logit contributions
 could+0.056
 argued+0.055
 fought+0.054
 wouldn+0.054
 rode+0.053
Neg logit contributions
 new-0.069
 sight-0.068
 old-0.062
 big-0.062
 world-0.061
Token Let
Token ID3914
Feature activation-2.49e+00

Pos logit contributions
 argued+0.053
 fought+0.051
 wouldn+0.049
ged+0.048
 managed+0.047
Neg logit contributions
 new-0.076
 sight-0.075
 sun-0.072
 elderly-0.071
 world-0.071
Token's
Token ID338
Feature activation-4.73e-01

Pos logit contributions
 fought+0.032
ked+0.031
 could+0.031
 argued+0.031
oped+0.030
Neg logit contributions
 new-0.067
 sight-0.063
 big-0.063
 world-0.060
 old-0.060
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
 sight+0.032
 new+0.031
 view+0.028
 old+0.028
 big+0.028
Neg logit contributions
 could-0.026
 wouldn-0.024
 argued-0.024
 fought-0.024
 rode-0.023
Token"
Token ID1
Feature activation-2.11e+00

Pos logit contributions
 sight+0.026
 new+0.026
 big+0.023
 old+0.023
 elderly+0.023
Neg logit contributions
 could-0.025
 wouldn-0.024
 argued-0.024
 fought-0.023
 rode-0.023
TokenCan
Token ID6090
Feature activation-3.25e+00

Pos logit contributions
 could+0.036
 wouldn+0.035
 argued+0.034
 fought+0.034
 rode+0.032
Neg logit contributions
 sight-0.048
 new-0.047
 big-0.042
 old-0.042
 view-0.042
Token we
Token ID356
Feature activation-9.22e-01

Pos logit contributions
ched+0.057
 argued+0.056
ged+0.055
oped+0.055
es+0.055
Neg logit contributions
 new-0.075
 sight-0.066
 big-0.066
 old-0.066
 sun-0.063
Token repair
Token ID9185
Feature activation-2.34e-01

Pos logit contributions
 could+0.016
 wouldn+0.016
 argued+0.016
 fought+0.016
ched+0.015
Neg logit contributions
 new-0.020
 sight-0.020
 big-0.018
 sun-0.018
 old-0.018
Token the
Token ID262
Feature activation+1.33e+01

Pos logit contributions
 could+0.004
 wouldn+0.003
es+0.003
 fought+0.003
 argued+0.003
Neg logit contributions
 sight-0.005
 new-0.005
 world-0.005
 elderly-0.005
 view-0.005
Token calendar
Token ID11845
Feature activation-2.22e+00

Pos logit contributions
umm+0.156
err+0.155
 sight+0.155
andel+0.148
agg+0.147
Neg logit contributions
 could-0.294
 wouldn-0.272
 wrote-0.267
 let-0.252
es-0.250
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
 could+0.039
 wouldn+0.038
 argued+0.038
 fought+0.037
 rode+0.036
Neg logit contributions
 new-0.048
 sight-0.048
 big-0.044
 old-0.043
 world-0.043
Token Mom
Token ID11254
Feature activation-2.69e+00

Pos logit contributions
 could+0.038
 wouldn+0.037
 argued+0.036
 fought+0.036
 rode+0.034
Neg logit contributions
 sight-0.024
 new-0.023
 world-0.020
 view-0.020
 future-0.020
Token?"
Token ID1701
Feature activation-2.75e+00

Pos logit contributions
ched+0.061
ked+0.061
oped+0.060
 argued+0.060
 wouldn+0.059
Neg logit contributions
 new-0.046
 old-0.043
 elderly-0.041
 big-0.041
 sight-0.041
Token Lily
Token ID20037
Feature activation-5.15e+00

Pos logit contributions
 wouldn+0.049
 could+0.048
ched+0.046
 argued+0.046
 fought+0.046
Neg logit contributions
 new-0.062
 sight-0.061
 big-0.056
 old-0.056
 elderly-0.055
Token why
Token ID1521
Feature activation-2.30e+00

Pos logit contributions
 fought+0.067
 argued+0.066
 rode+0.065
 snatched+0.063
ched+0.062
Neg logit contributions
 new-0.083
 old-0.073
 big-0.073
 sight-0.073
 world-0.068
Token he
Token ID339
Feature activation-3.17e+00

Pos logit contributions
 argued+0.036
ked+0.035
ched+0.035
 fought+0.034
 snatched+0.033
Neg logit contributions
 new-0.058
 sight-0.055
 old-0.053
 big-0.053
 dark-0.050
Token had
Token ID550
Feature activation+7.93e-01

Pos logit contributions
ked+0.049
es+0.047
fed+0.046
 argued+0.044
icked+0.044
Neg logit contributions
 big-0.073
 new-0.072
 old-0.069
 large-0.068
 sight-0.068
Token fixed
Token ID5969
Feature activation+1.48e+00

Pos logit contributions
 sight+0.018
 new+0.018
 big+0.016
 old+0.016
 dark+0.015
Neg logit contributions
es-0.014
oped-0.014
 could-0.014
 wouldn-0.013
ked-0.013
Token her
Token ID607
Feature activation+7.31e+00

Pos logit contributions
 sight+0.038
 new+0.038
 old+0.035
 big+0.035
 view+0.034
Neg logit contributions
 could-0.020
 wouldn-0.019
 argued-0.019
 fought-0.019
 rode-0.018
Token the
Token ID262
Feature activation+1.32e+01

Pos logit contributions
 sight+0.092
 new+0.080
 view+0.078
 future+0.078
andel+0.075
Neg logit contributions
 could-0.186
 wouldn-0.178
 argued-0.168
es-0.168
 fought-0.164
Token screen
Token ID3159
Feature activation-2.33e+00

Pos logit contributions
andel+0.200
 former+0.191
 sight+0.182
 swamp+0.181
 surroundings+0.176
Neg logit contributions
 could-0.274
 wouldn-0.259
 wrote-0.244
 scored-0.239
 let-0.229
Token.
Token ID13
Feature activation-2.48e+00

Pos logit contributions
 could+0.045
 fought+0.044
 argued+0.044
 wouldn+0.043
 rode+0.043
Neg logit contributions
 new-0.047
 sight-0.046
 big-0.042
 old-0.042
 world-0.041
Token
Token ID198
Feature activation+2.05e-01

Pos logit contributions
 fought+0.050
 wouldn+0.049
 argued+0.049
 could+0.048
 managed+0.048
Neg logit contributions
 new-0.048
 sight-0.047
 old-0.043
 big-0.043
 world-0.042
Token
Token ID198
Feature activation+1.69e-01

Pos logit contributions
 sight+0.004
 new+0.004
 big+0.004
 old+0.004
 view+0.004
Neg logit contributions
 could-0.004
 wouldn-0.004
 argued-0.003
 fought-0.003
 rode-0.003
Token"
Token ID1
Feature activation-2.28e+00

Pos logit contributions
 new+0.003
 sight+0.003
 big+0.003
 world+0.003
 old+0.003
Neg logit contributions
 could-0.003
 wouldn-0.003
 fought-0.003
 argued-0.003
 rode-0.003
Token she
Token ID673
Feature activation-5.38e+00

Pos logit contributions
 argued+0.057
 fought+0.056
ched+0.055
oped+0.055
 rode+0.054
Neg logit contributions
 new-0.087
 sight-0.083
 old-0.078
 big-0.077
 elderly-0.075
Token could
Token ID714
Feature activation-1.96e+00

Pos logit contributions
ged+0.043
 argued+0.042
ked+0.042
pping+0.042
fed+0.040
Neg logit contributions
 new-0.155
 big-0.151
 old-0.149
 large-0.143
 little-0.143
Token help
Token ID1037
Feature activation-1.60e+00

Pos logit contributions
 argued+0.040
ked+0.038
es+0.038
 fought+0.037
ged+0.037
Neg logit contributions
 new-0.039
 big-0.037
 sight-0.035
 view-0.033
 large-0.033
Token him
Token ID683
Feature activation-1.60e+00

Pos logit contributions
 could+0.029
 wouldn+0.028
 fought+0.028
 argued+0.028
 rode+0.028
Neg logit contributions
 new-0.035
 sight-0.034
 big-0.031
 old-0.031
 world-0.030
Token move
Token ID1445
Feature activation-4.79e-01

Pos logit contributions
 argued+0.032
 fought+0.032
 could+0.032
 wouldn+0.032
ked+0.031
Neg logit contributions
 new-0.032
 big-0.029
 sight-0.028
 old-0.027
 biggest-0.027
Token the
Token ID262
Feature activation+1.32e+01

Pos logit contributions
 could+0.008
 wouldn+0.008
 argued+0.008
 fought+0.008
 rode+0.008
Neg logit contributions
 sight-0.011
 new-0.010
 big-0.009
 old-0.009
 view-0.009
Token cart
Token ID6383
Feature activation-3.26e+00

Pos logit contributions
 sight+0.243
 new+0.212
err+0.210
 beauty+0.201
 future+0.201
Neg logit contributions
 could-0.277
 wouldn-0.265
 argued-0.250
 rode-0.246
 fought-0.242
Token.
Token ID13
Feature activation-2.32e+00

Pos logit contributions
 could+0.062
 argued+0.060
 fought+0.060
 wouldn+0.060
 rode+0.058
Neg logit contributions
 new-0.066
 sight-0.064
 big-0.059
 old-0.059
 world-0.058
Token To
Token ID1675
Feature activation-2.55e+00

Pos logit contributions
 wouldn+0.043
 could+0.042
 fought+0.042
 argued+0.042
 rode+0.040
Neg logit contributions
 new-0.049
 sight-0.048
 old-0.044
 elderly-0.043
 big-0.043
Token his
Token ID465
Feature activation+4.33e+00

Pos logit contributions
 could+0.050
 wouldn+0.048
 argued+0.048
 fought+0.047
ked+0.046
Neg logit contributions
 new-0.050
 sight-0.050
 big-0.046
 old-0.044
 view-0.044
Token surprise
Token ID5975
Feature activation-2.43e+00

Pos logit contributions
 sight+0.055
 elderly+0.052
 new+0.051
 biggest+0.050
 world+0.049
Neg logit contributions
 could-0.102
 argued-0.101
 let-0.099
 fought-0.098
 managed-0.097
Token to
Token ID284
Feature activation+8.70e-01

Pos logit contributions
 new+0.020
 sight+0.020
 big+0.018
 old+0.018
 elderly+0.018
Neg logit contributions
 could-0.013
 wouldn-0.013
 argued-0.012
 fought-0.012
es-0.012
Token wait
Token ID4043
Feature activation-1.29e+00

Pos logit contributions
 sight+0.020
 new+0.020
 old+0.018
 big+0.018
 view+0.018
Neg logit contributions
 could-0.015
 wouldn-0.014
 argued-0.014
 fought-0.014
 rode-0.013
Token for
Token ID329
Feature activation+1.17e+00

Pos logit contributions
 could+0.023
 wouldn+0.023
 argued+0.022
 fought+0.022
 rode+0.021
Neg logit contributions
 sight-0.028
 new-0.027
 big-0.024
 old-0.024
 view-0.024
Token me
Token ID502
Feature activation-1.37e+00

Pos logit contributions
 sight+0.023
 new+0.021
 world+0.021
 biggest+0.021
 view+0.021
Neg logit contributions
 could-0.022
 fought-0.021
 argued-0.021
 let-0.020
 wouldn-0.020
Token on
Token ID319
Feature activation+6.51e-01

Pos logit contributions
 argued+0.025
 wouldn+0.025
 could+0.024
 rode+0.024
 fought+0.024
Neg logit contributions
 new-0.029
 sight-0.028
 big-0.026
 old-0.026
 elderly-0.025
Token the
Token ID262
Feature activation+1.32e+01

Pos logit contributions
 sight+0.014
 new+0.014
 world+0.012
 biggest+0.012
 view+0.012
Neg logit contributions
 could-0.012
 wouldn-0.011
 fought-0.011
 argued-0.011
 rode-0.010
Token slide
Token ID10649
Feature activation-2.71e+00

Pos logit contributions
 sight+0.228
igsaw+0.211
err+0.204
 new+0.203
andel+0.198
Neg logit contributions
 could-0.272
 wouldn-0.269
 rode-0.260
 let-0.242
 snatched-0.242
Token.
Token ID13
Feature activation-1.54e+00

Pos logit contributions
 wouldn+0.053
 could+0.052
 argued+0.052
 managed+0.051
ched+0.051
Neg logit contributions
 new-0.053
 sight-0.051
 old-0.048
 big-0.048
 elderly-0.046
Token Then
Token ID3244
Feature activation-1.45e+00

Pos logit contributions
 could+0.022
 wouldn+0.021
 argued+0.020
 fought+0.020
 rode+0.019
Neg logit contributions
 sight-0.039
 new-0.038
 big-0.035
 old-0.035
 view-0.035
Token we
Token ID356
Feature activation-3.10e+00

Pos logit contributions
 argued+0.025
ched+0.025
oped+0.025
 fought+0.024
 wouldn+0.024
Neg logit contributions
 new-0.033
 sight-0.032
 big-0.030
 sun-0.029
 old-0.029
Token can
Token ID460
Feature activation-2.90e-01

Pos logit contributions
es+0.042
 argued+0.042
ked+0.041
ged+0.041
 could+0.040
Neg logit contributions
 big-0.071
 new-0.071
 sight-0.070
 view-0.069
 world-0.068
Token,
Token ID11
Feature activation-6.42e-01

Pos logit contributions
 could+0.003
 wouldn+0.003
 fought+0.003
 argued+0.003
 rode+0.003
Neg logit contributions
 sight-0.005
 new-0.005
 view-0.004
 world-0.004
 old-0.004
Token can
Token ID460
Feature activation-2.60e+00

Pos logit contributions
 could+0.012
 wouldn+0.011
 fought+0.011
 argued+0.011
 rode+0.011
Neg logit contributions
 sight-0.014
 new-0.013
 view-0.012
 future-0.012
 world-0.012
Token you
Token ID345
Feature activation-1.34e-01

Pos logit contributions
 argued+0.041
 wouldn+0.041
 could+0.040
 fought+0.040
ched+0.039
Neg logit contributions
 new-0.061
 sight-0.058
 big-0.056
 old-0.055
 large-0.053
Token please
Token ID3387
Feature activation-1.37e+00

Pos logit contributions
 could+0.003
 wouldn+0.002
 argued+0.002
 fought+0.002
 rode+0.002
Neg logit contributions
 sight-0.003
 new-0.003
 big-0.002
 old-0.002
 view-0.002
Token replace
Token ID6330
Feature activation+2.25e+00

Pos logit contributions
 could+0.024
 wouldn+0.023
 argued+0.023
 fought+0.022
es+0.022
Neg logit contributions
 new-0.030
 sight-0.030
 big-0.028
 old-0.027
 view-0.027
Token the
Token ID262
Feature activation+1.32e+01

Pos logit contributions
 sight+0.047
 new+0.045
 world+0.043
 future+0.043
 entire+0.042
Neg logit contributions
 could-0.039
 wouldn-0.037
es-0.035
 fought-0.035
 argued-0.034
Token rock
Token ID3881
Feature activation-2.13e+00

Pos logit contributions
igsaw+0.264
 sight+0.209
err+0.200
andel+0.196
ummy+0.196
Neg logit contributions
 could-0.273
 hid-0.246
 wouldn-0.239
 let-0.239
 fought-0.228
Token you
Token ID345
Feature activation-1.44e-01

Pos logit contributions
 could+0.036
 wouldn+0.035
 fought+0.034
 argued+0.034
 rode+0.032
Neg logit contributions
 sight-0.048
 new-0.047
 big-0.043
 old-0.043
 view-0.043
Token moved
Token ID3888
Feature activation+3.09e-01

Pos logit contributions
 could+0.003
 wouldn+0.002
 argued+0.002
 fought+0.002
 rode+0.002
Neg logit contributions
 sight-0.003
 new-0.003
 big-0.003
 old-0.003
 world-0.003
Token from
Token ID422
Feature activation+9.56e-01

Pos logit contributions
 sight+0.007
 new+0.007
 big+0.006
 old+0.006
 world+0.006
Neg logit contributions
 could-0.005
 wouldn-0.005
 argued-0.005
 fought-0.005
 rode-0.005
Token the
Token ID262
Feature activation+1.12e+01

Pos logit contributions
 sight+0.019
 new+0.019
 world+0.017
 view+0.017
 biggest+0.017
Neg logit contributions
 could-0.018
 wouldn-0.018
 fought-0.017
 argued-0.017
 rode-0.016
Token
Token ID198
Feature activation+1.65e+00

Pos logit contributions
 sight+0.050
 new+0.048
 view+0.044
 biggest+0.044
 world+0.044
Neg logit contributions
 could-0.037
 wouldn-0.033
 fought-0.033
 argued-0.032
 rode-0.032
Token"
Token ID1
Feature activation-2.21e+00

Pos logit contributions
 sight+0.041
 new+0.040
 big+0.037
 old+0.036
 view+0.036
Neg logit contributions
 could-0.025
 wouldn-0.024
 argued-0.023
 fought-0.023
 rode-0.022
TokenI
Token ID40
Feature activation-7.49e-01

Pos logit contributions
 could+0.038
 wouldn+0.038
 argued+0.036
 fought+0.035
 rode+0.035
Neg logit contributions
 sight-0.049
 new-0.049
 big-0.044
 elderly-0.043
 world-0.043
Token regret
Token ID13721
Feature activation-2.32e+00

Pos logit contributions
 argued+0.013
 wouldn+0.013
ked+0.013
 rode+0.013
es+0.013
Neg logit contributions
 new-0.015
 sight-0.015
 big-0.015
 whole-0.014
 large-0.013
Token kicking
Token ID17997
Feature activation+5.08e-01

Pos logit contributions
 could+0.042
 rode+0.042
 argued+0.041
 wouldn+0.040
 managed+0.040
Neg logit contributions
 sight-0.049
 new-0.049
 big-0.045
 old-0.044
 elderly-0.043
Token the
Token ID262
Feature activation+1.32e+01

Pos logit contributions
 sight+0.011
 new+0.011
 old+0.010
 big+0.010
 view+0.010
Neg logit contributions
 could-0.009
 wouldn-0.009
 argued-0.008
 fought-0.008
 rode-0.008
Token ball
Token ID2613
Feature activation-1.45e+00

Pos logit contributions
 sight+0.252
andel+0.233
 former+0.232
 new+0.231
igsaw+0.231
Neg logit contributions
 could-0.250
 wouldn-0.226
 snatched-0.225
 fought-0.224
 scored-0.220
Token,"
Token ID553
Feature activation-3.21e+00

Pos logit contributions
 could+0.023
 wouldn+0.022
 fought+0.022
 argued+0.021
 rode+0.020
Neg logit contributions
 sight-0.034
 new-0.034
 old-0.030
 view-0.030
 world-0.030
Token Lily
Token ID20037
Feature activation-5.06e+00

Pos logit contributions
 wouldn+0.054
 could+0.054
 argued+0.051
 fought+0.051
ched+0.050
Neg logit contributions
 new-0.074
 sight-0.074
 old-0.068
 big-0.067
 elderly-0.066
Token said
Token ID531
Feature activation-1.53e+00

Pos logit contributions
oped+0.080
 wouldn+0.079
sels+0.075
pped+0.074
pping+0.074
Neg logit contributions
 old-0.114
 new-0.114
 big-0.109
 elderly-0.107
 sight-0.104
Token.
Token ID13
Feature activation-1.67e+00

Pos logit contributions
 could+0.032
 wouldn+0.032
 fought+0.032
 argued+0.032
 rode+0.031
Neg logit contributions
 new-0.029
 sight-0.026
 old-0.025
 big-0.025
 sun-0.023
Token "
Token ID366
Feature activation-2.30e+00

Pos logit contributions
 argued+0.078
 fought+0.074
 rode+0.070
ched+0.070
 managed+0.070
Neg logit contributions
 new-0.046
 old-0.041
 big-0.040
 sun-0.039
 sight-0.038
TokenMaybe
Token ID13300
Feature activation-9.33e-01

Pos logit contributions
 wouldn+0.049
 could+0.046
icked+0.045
 hid+0.044
 fought+0.044
Neg logit contributions
 world-0.040
 sight-0.039
 future-0.039
 new-0.037
 whole-0.036
Token we
Token ID356
Feature activation-3.36e+00

Pos logit contributions
 wouldn+0.015
 could+0.015
 argued+0.015
 fought+0.014
ched+0.014
Neg logit contributions
 new-0.022
 sight-0.021
 old-0.020
 big-0.020
 sun-0.019
Token can
Token ID460
Feature activation-1.56e-01

Pos logit contributions
ged+0.058
ked+0.056
fed+0.056
 rode+0.054
oped+0.054
Neg logit contributions
 big-0.068
 new-0.065
 world-0.061
 lot-0.060
 whole-0.060
Token share
Token ID2648
Feature activation+6.38e-01

Pos logit contributions
 wouldn+0.003
 could+0.003
ged+0.003
 argued+0.003
 fought+0.003
Neg logit contributions
 new-0.003
 big-0.003
 sight-0.003
 view-0.003
 large-0.003
Token the
Token ID262
Feature activation+1.32e+01

Pos logit contributions
 sight+0.014
 new+0.014
 big+0.012
 old+0.012
 view+0.012
Neg logit contributions
 could-0.011
 wouldn-0.011
 argued-0.011
 fought-0.011
 rode-0.010
Token ball
Token ID2613
Feature activation-1.50e+00

Pos logit contributions
andel+0.247
 sight+0.243
 doorway+0.223
umm+0.217
igsaw+0.214
Neg logit contributions
 could-0.250
 argued-0.230
 rode-0.229
 fought-0.229
 let-0.226
Token with
Token ID351
Feature activation-4.65e-01

Pos logit contributions
 could+0.026
 wouldn+0.025
 argued+0.024
 fought+0.024
 rode+0.023
Neg logit contributions
 sight-0.034
 new-0.033
 big-0.030
 old-0.030
 world-0.030
Token him
Token ID683
Feature activation-2.13e+00

Pos logit contributions
 could+0.008
 wouldn+0.008
 argued+0.008
 fought+0.008
 rode+0.007
Neg logit contributions
 sight-0.010
 new-0.010
 big-0.009
 old-0.009
 elderly-0.009
Token.
Token ID13
Feature activation-2.23e+00

Pos logit contributions
 argued+0.044
 rode+0.044
 snatched+0.043
 wouldn+0.043
 managed+0.042
Neg logit contributions
 new-0.039
 sight-0.036
 big-0.035
 old-0.034
 world-0.034
Token Maybe
Token ID6674
Feature activation-5.68e-01

Pos logit contributions
 argued+0.040
 managed+0.038
 fought+0.038
 wouldn+0.038
ged+0.038
Neg logit contributions
 new-0.047
 sight-0.045
 sun-0.044
 old-0.043
 biggest-0.043
Token said
Token ID531
Feature activation-2.62e+00

Pos logit contributions
ched+0.073
 wouldn+0.073
 Leave+0.073
 could+0.071
 argued+0.071
Neg logit contributions
 new-0.141
 sight-0.135
 big-0.133
 old-0.131
 little-0.127
Token,
Token ID11
Feature activation-3.17e+00

Pos logit contributions
 argued+0.060
 rode+0.059
 fought+0.057
 wrote+0.056
 wouldn+0.055
Neg logit contributions
 new-0.049
 big-0.042
 sight-0.041
 old-0.040
 large-0.036
Token "
Token ID366
Feature activation-2.10e+00

Pos logit contributions
 argued+0.074
 fought+0.073
ched+0.071
 rode+0.070
 managed+0.070
Neg logit contributions
 new-0.055
 sight-0.048
 old-0.048
 big-0.048
 sun-0.047
TokenI
Token ID40
Feature activation-1.47e+00

Pos logit contributions
 wouldn+0.037
 argued+0.036
 could+0.036
 fought+0.035
es+0.034
Neg logit contributions
 sight-0.044
 new-0.044
 world-0.041
 big-0.040
 elderly-0.040
Token broke
Token ID6265
Feature activation+9.78e-01

Pos logit contributions
 argued+0.021
ked+0.021
es+0.020
ged+0.019
fed+0.019
Neg logit contributions
 big-0.034
 sight-0.034
 new-0.033
 large-0.032
 whole-0.031
Token the
Token ID262
Feature activation+1.32e+01

Pos logit contributions
 sight+0.021
 new+0.020
 world+0.018
 future+0.018
 view+0.018
Neg logit contributions
 could-0.018
 wouldn-0.017
 fought-0.016
 argued-0.016
es-0.016
Token branch
Token ID8478
Feature activation-2.00e+00

Pos logit contributions
igsaw+0.172
 sight+0.169
andel+0.159
 new+0.157
err+0.154
Neg logit contributions
 could-0.323
 wouldn-0.310
 snatched-0.290
 fought-0.286
 hid-0.286
Token.
Token ID13
Feature activation-2.92e+00

Pos logit contributions
 could+0.035
 wouldn+0.034
 argued+0.033
 fought+0.033
 rode+0.032
Neg logit contributions
 sight-0.044
 new-0.044
 big-0.040
 old-0.039
 world-0.039
Token I
Token ID314
Feature activation-3.22e+00

Pos logit contributions
 argued+0.048
 fought+0.046
 wouldn+0.045
 managed+0.043
ged+0.043
Neg logit contributions
 new-0.066
 sight-0.065
 old-0.063
 sun-0.063
 elderly-0.062
Token wanted
Token ID2227
Feature activation-4.91e-02

Pos logit contributions
 argued+0.042
es+0.041
ked+0.041
ged+0.039
 rode+0.038
Neg logit contributions
 big-0.076
 new-0.075
 sight-0.074
 whole-0.071
 large-0.071
Token to
Token ID284
Feature activation-2.40e-01

Pos logit contributions
 could+0.001
 wouldn+0.001
 argued+0.001
 fought+0.001
 rode+0.001
Neg logit contributions
 new-0.001
 sight-0.001
 big-0.001
 old-0.001
 view-0.001
Token late
Token ID2739
Feature activation-2.88e+00

Pos logit contributions
 biggest+0.076
 sight+0.074
 doorway+0.072
umm+0.072
 world+0.070
Neg logit contributions
 could-0.095
 fought-0.086
 let-0.085
 managed-0.085
 stood-0.083
Token.
Token ID13
Feature activation-2.22e+00

Pos logit contributions
 argued+0.064
 could+0.064
 rode+0.063
 managed+0.063
 fought+0.063
Neg logit contributions
 sight-0.049
 new-0.049
 old-0.043
 big-0.042
 view-0.040
Token The
Token ID383
Feature activation+1.03e+01

Pos logit contributions
 could+0.036
 wouldn+0.035
 argued+0.034
 fought+0.034
 rode+0.033
Neg logit contributions
 sight-0.052
 new-0.051
 old-0.047
 big-0.046
 elderly-0.046
Token dog
Token ID3290
Feature activation-4.35e+00

Pos logit contributions
andel+0.198
igsaw+0.197
 sight+0.189
ummy+0.185
umm+0.184
Neg logit contributions
 could-0.193
 fought-0.175
 rode-0.174
 wouldn-0.168
 let-0.167
Token ripped
Token ID19551
Feature activation+8.55e-01

Pos logit contributions
uable+0.054
 argued+0.051
 wrote+0.051
 whoever+0.049
lets+0.048
Neg logit contributions
 big-0.111
 new-0.111
 sight-0.103
 large-0.102
 little-0.101
Token the
Token ID262
Feature activation+1.32e+01

Pos logit contributions
 sight+0.020
 new+0.019
 world+0.018
 view+0.018
 elderly+0.017
Neg logit contributions
 could-0.014
 wouldn-0.014
 argued-0.013
 fought-0.013
 rode-0.013
Token rocket
Token ID10701
Feature activation-7.86e-01

Pos logit contributions
 sight+0.229
andel+0.213
ummy+0.213
igsaw+0.212
 new+0.208
Neg logit contributions
 could-0.263
 wouldn-0.244
 fought-0.240
 scored-0.238
 rode-0.237
Token apart
Token ID5475
Feature activation+7.56e-01

Pos logit contributions
 could+0.014
 argued+0.014
 wouldn+0.014
 fought+0.013
 rode+0.013
Neg logit contributions
 new-0.017
 sight-0.017
 big-0.015
 old-0.015
 world-0.015
Token and
Token ID290
Feature activation-3.44e+00

Pos logit contributions
 sight+0.016
 new+0.016
 big+0.014
 old+0.014
 world+0.014
Neg logit contributions
 could-0.014
 wouldn-0.014
 argued-0.014
 rode-0.014
 fought-0.014
Token spat
Token ID15246
Feature activation-1.64e+00

Pos logit contributions
 argued+0.044
oped+0.044
 could+0.043
es+0.043
ched+0.041
Neg logit contributions
 new-0.086
 sight-0.085
 big-0.082
 dark-0.079
 old-0.079
Token out
Token ID503
Feature activation+1.74e+00

Pos logit contributions
 fought+0.030
 argued+0.030
 rode+0.029
 could+0.028
 shared+0.027
Neg logit contributions
 new-0.036
 sight-0.035
 big-0.034
 sun-0.031
 old-0.031
Token mom
Token ID1995
Feature activation-3.63e+00

Pos logit contributions
igsaw+0.085
 sight+0.084
andel+0.080
 view+0.077
umm+0.074
Neg logit contributions
 could-0.107
 wouldn-0.105
 rode-0.097
 fought-0.092
 hid-0.091
Tokenmy
Token ID1820
Feature activation-5.37e+00

Pos logit contributions
ched+0.082
ked+0.081
 Leave+0.081
es+0.079
 could+0.079
Neg logit contributions
 new-0.055
 elderly-0.049
 sight-0.048
 old-0.048
 big-0.048
Token explained
Token ID4893
Feature activation-2.37e+00

Pos logit contributions
 Leave+0.080
ked+0.074
sels+0.071
ched+0.071
 scored+0.069
Neg logit contributions
 new-0.126
 elderly-0.120
 old-0.119
 big-0.118
 little-0.114
Token that
Token ID326
Feature activation-1.44e+00

Pos logit contributions
 rode+0.040
oped+0.040
 fought+0.039
 managed+0.039
 argued+0.039
Neg logit contributions
 new-0.058
 sight-0.053
 old-0.052
 big-0.050
 elderly-0.047
Token the
Token ID262
Feature activation+8.12e+00

Pos logit contributions
 argued+0.021
 could+0.020
 fought+0.020
ked+0.020
 wouldn+0.020
Neg logit contributions
 new-0.036
 sight-0.035
 old-0.033
 big-0.032
 world-0.032
Token doctors
Token ID7519
Feature activation-8.52e+00

Pos logit contributions
 sight+0.125
 new+0.111
 view+0.103
igsaw+0.102
ummy+0.098
Neg logit contributions
 could-0.198
 wouldn-0.191
 rode-0.180
 fought-0.180
 argued-0.176
Token were
Token ID547
Feature activation+1.10e+00

Pos logit contributions
ked+0.103
oped+0.096
 snatched+0.096
ged+0.094
 fet+0.093
Neg logit contributions
 new-0.213
 old-0.204
 big-0.198
 world-0.195
 sight-0.189
Token going
Token ID1016
Feature activation-7.97e-01

Pos logit contributions
 new+0.021
 sight+0.020
 old+0.019
 big+0.019
 elderly+0.018
Neg logit contributions
 could-0.023
 wouldn-0.022
 argued-0.022
 fought-0.022
 rode-0.022
Token to
Token ID284
Feature activation-6.36e-01

Pos logit contributions
 could+0.014
 argued+0.014
 fought+0.014
 wouldn+0.013
 scored+0.013
Neg logit contributions
 new-0.018
 sight-0.017
 old-0.016
 big-0.016
 world-0.015
Token fix
Token ID4259
Feature activation-1.25e+00

Pos logit contributions
 could+0.012
 wouldn+0.011
 argued+0.011
 fought+0.011
 rode+0.010
Neg logit contributions
 sight-0.014
 new-0.014
 old-0.012
 big-0.012
 view-0.012
Token something
Token ID1223
Feature activation-7.22e-01

Pos logit contributions
 could+0.024
 wouldn+0.023
 argued+0.022
 fought+0.022
 rode+0.021
Neg logit contributions
 sight-0.026
 new-0.026
 big-0.023
 old-0.023
 view-0.023
Token the
Token ID262
Feature activation+9.12e+00

Pos logit contributions
 sight+0.011
 new+0.011
 biggest+0.010
 world+0.010
 view+0.010
Neg logit contributions
 could-0.008
 wouldn-0.008
 fought-0.008
 argued-0.008
 rode-0.008
Token hospital
Token ID4436
Feature activation-3.35e+00

Pos logit contributions
 sight+0.166
err+0.152
igsaw+0.152
andel+0.149
 view+0.146
Neg logit contributions
 could-0.177
 wouldn-0.164
 snatched-0.162
 stole-0.155
 let-0.154
Token,
Token ID11
Feature activation-2.31e+00

Pos logit contributions
 could+0.065
ched+0.062
 argued+0.062
 wouldn+0.062
 fought+0.062
Neg logit contributions
 new-0.069
 sight-0.066
 old-0.063
 big-0.060
 elderly-0.059
Token where
Token ID810
Feature activation-1.80e+00

Pos logit contributions
 could+0.042
ched+0.041
es+0.041
 argued+0.040
 fought+0.040
Neg logit contributions
 new-0.049
 sight-0.048
 old-0.044
 big-0.044
 elderly-0.043
Token the
Token ID262
Feature activation+8.51e+00

Pos logit contributions
 could+0.025
 argued+0.025
 wouldn+0.024
 fought+0.024
ched+0.024
Neg logit contributions
 new-0.045
 sight-0.044
 big-0.042
 old-0.041
 elderly-0.040
Token doctors
Token ID7519
Feature activation-8.33e+00

Pos logit contributions
 sight+0.155
igsaw+0.151
 new+0.136
ummy+0.134
 view+0.134
Neg logit contributions
 could-0.179
 wouldn-0.172
 fought-0.158
 rode-0.157
 snatched-0.152
Token said
Token ID531
Feature activation-3.37e+00

Pos logit contributions
ked+0.103
es+0.099
uable+0.098
lets+0.097
icked+0.097
Neg logit contributions
 new-0.190
 big-0.189
 sight-0.186
 window-0.182
 world-0.179
Token he
Token ID339
Feature activation-4.36e+00

Pos logit contributions
 argued+0.053
 fought+0.052
oped+0.052
 rode+0.051
ched+0.050
Neg logit contributions
 new-0.083
 sight-0.075
 old-0.075
 big-0.075
 sun-0.069
Token had
Token ID550
Feature activation-1.35e+00

Pos logit contributions
 Whoever+0.065
pping+0.065
 argued+0.065
ged+0.064
ked+0.064
Neg logit contributions
 new-0.092
 big-0.091
 old-0.087
 little-0.084
 whole-0.082
Token eaten
Token ID17065
Feature activation-6.02e-01

Pos logit contributions
es+0.024
oped+0.024
 could+0.023
ked+0.023
 wouldn+0.022
Neg logit contributions
 new-0.030
 sight-0.029
 big-0.027
 old-0.026
 dark-0.025
Token poisonous
Token ID32199
Feature activation+1.87e+00

Pos logit contributions
 could+0.010
 wouldn+0.009
 argued+0.009
 fought+0.009
 managed+0.009
Neg logit contributions
 sight-0.014
 new-0.014
 world-0.012
 elderly-0.012
 view-0.012
Token the
Token ID262
Feature activation+8.60e+00

Pos logit contributions
 could+0.003
 fought+0.003
 wouldn+0.003
 argued+0.003
 managed+0.003
Neg logit contributions
 sight-0.005
 new-0.005
 biggest-0.005
 elderly-0.005
 view-0.005
Token land
Token ID1956
Feature activation-3.62e+00

Pos logit contributions
 sight+0.086
 new+0.084
err+0.080
ummy+0.070
andel+0.067
Neg logit contributions
 could-0.226
 wouldn-0.210
 wrote-0.209
 shared-0.207
 fought-0.206
Token.
Token ID13
Feature activation-3.40e+00

Pos logit contributions
 argued+0.070
 fought+0.069
 wouldn+0.068
 could+0.068
 rode+0.067
Neg logit contributions
 new-0.071
 sight-0.070
 big-0.065
 world-0.064
 old-0.064
Token Everywhere
Token ID48201
Feature activation-2.63e+00

Pos logit contributions
 wouldn+0.064
 fought+0.063
 argued+0.063
 could+0.061
 managed+0.060
Neg logit contributions
 new-0.069
 sight-0.069
 sun-0.063
 elderly-0.062
 world-0.062
Token the
Token ID262
Feature activation+8.80e+00

Pos logit contributions
ked+0.046
 argued+0.046
ched+0.045
icked+0.045
 fought+0.043
Neg logit contributions
 new-0.061
 sight-0.059
 old-0.055
 big-0.054
 world-0.052
Token butterfly
Token ID35113
Feature activation-8.21e+00

Pos logit contributions
 sight+0.122
err+0.110
 future+0.109
andel+0.107
 new+0.105
Neg logit contributions
 could-0.216
 fought-0.195
 wrote-0.195
 rode-0.195
 let-0.190
Token flew
Token ID13112
Feature activation-1.78e+00

Pos logit contributions
ked+0.103
icked+0.101
uable+0.099
pped+0.099
 argued+0.098
Neg logit contributions
 new-0.203
 sight-0.195
 big-0.192
 sun-0.186
 world-0.185
Token,
Token ID11
Feature activation-2.07e+00

Pos logit contributions
 argued+0.032
 fought+0.031
 could+0.031
 wouldn+0.031
 rode+0.030
Neg logit contributions
 sight-0.039
 new-0.039
 big-0.035
 old-0.034
 view-0.034
Token the
Token ID262
Feature activation+9.85e+00

Pos logit contributions
 could+0.034
 wouldn+0.034
 argued+0.034
 fought+0.033
ched+0.031
Neg logit contributions
 new-0.048
 sight-0.047
 big-0.043
 old-0.043
 world-0.042
Token dinosaur
Token ID28257
Feature activation-4.86e+00

Pos logit contributions
 sight+0.136
 new+0.128
 future+0.117
err+0.113
 view+0.112
Neg logit contributions
 could-0.241
 fought-0.224
 wouldn-0.222
 argued-0.216
 scored-0.215
Token's
Token ID338
Feature activation+4.58e+00

Pos logit contributions
 argued+0.061
uable+0.059
icked+0.058
ked+0.058
pped+0.058
Neg logit contributions
 new-0.125
 big-0.119
 sight-0.115
 world-0.114
 old-0.112
Token people
Token ID661
Feature activation-1.11e+00

Pos logit contributions
 could+0.007
 wouldn+0.007
 argued+0.007
 fought+0.007
 rode+0.006
Neg logit contributions
 sight-0.006
 new-0.006
 view-0.005
 world-0.005
 old-0.005
Token to
Token ID284
Feature activation+1.95e-01

Pos logit contributions
 could+0.019
 wouldn+0.019
 argued+0.018
 fought+0.018
 managed+0.018
Neg logit contributions
 new-0.025
 sight-0.024
 big-0.022
 old-0.022
 elderly-0.021
Token the
Token ID262
Feature activation+9.40e+00

Pos logit contributions
 sight+0.004
 new+0.004
 view+0.004
 world+0.004
 old+0.004
Neg logit contributions
 could-0.004
 wouldn-0.004
 fought-0.003
 argued-0.003
 rode-0.003
Token hospital
Token ID4436
Feature activation-4.33e+00

Pos logit contributions
 sight+0.149
igsaw+0.148
err+0.130
ummy+0.130
 new+0.130
Neg logit contributions
 could-0.206
 wouldn-0.199
 fought-0.198
 rode-0.195
 argued-0.194
Token where
Token ID810
Feature activation-1.99e+00

Pos logit contributions
 could+0.079
 argued+0.078
 wouldn+0.076
 fought+0.076
 managed+0.076
Neg logit contributions
 new-0.092
 sight-0.086
 old-0.084
 big-0.083
 elderly-0.080
Token doctors
Token ID7519
Feature activation-8.15e+00

Pos logit contributions
 argued+0.040
 wouldn+0.040
 could+0.039
 fought+0.039
ked+0.039
Neg logit contributions
 new-0.038
 sight-0.036
 big-0.034
 old-0.034
 world-0.033
Token can
Token ID460
Feature activation-1.54e+00

Pos logit contributions
ked+0.130
ged+0.124
 argued+0.124
es+0.123
icked+0.122
Neg logit contributions
 new-0.175
 big-0.170
 old-0.161
 world-0.161
 sight-0.160
Token make
Token ID787
Feature activation+3.76e-01

Pos logit contributions
 could+0.027
 wouldn+0.027
 argued+0.026
 fought+0.026
 managed+0.026
Neg logit contributions
 new-0.034
 sight-0.032
 big-0.032
 old-0.031
 world-0.030
Token them
Token ID606
Feature activation+4.06e-01

Pos logit contributions
 sight+0.007
 new+0.007
 big+0.006
 old+0.006
 view+0.006
Neg logit contributions
 could-0.008
 wouldn-0.007
 argued-0.007
 fought-0.007
 rode-0.007
Token better
Token ID1365
Feature activation-6.40e-01

Pos logit contributions
 new+0.007
 sight+0.007
 big+0.006
 old+0.006
 view+0.006
Neg logit contributions
 could-0.009
 wouldn-0.009
 argued-0.008
 fought-0.008
 rode-0.008
Token."
Token ID526
Feature activation-1.86e+00

Pos logit contributions
 could+0.010
 wouldn+0.010
 argued+0.010
 fought+0.010
 rode+0.009
Neg logit contributions
 sight-0.015
 new-0.015
 old-0.014
 big-0.014
 world-0.013
Token wizard
Token ID18731
Feature activation-2.64e+00

Pos logit contributions
 sight+0.080
err+0.069
andel+0.066
 ey+0.066
ummy+0.065
Neg logit contributions
 could-0.207
 wouldn-0.194
 let-0.193
 shared-0.188
 fought-0.188
Token.
Token ID13
Feature activation-3.68e+00

Pos logit contributions
 argued+0.050
 wouldn+0.049
 could+0.048
 fought+0.048
 managed+0.048
Neg logit contributions
 new-0.051
 sight-0.049
 world-0.047
 old-0.047
 big-0.045
Token Every
Token ID3887
Feature activation+2.18e+00

Pos logit contributions
 wouldn+0.074
 argued+0.073
 fought+0.072
 managed+0.069
ched+0.069
Neg logit contributions
 new-0.071
 sight-0.069
 old-0.065
 world-0.065
 sun-0.064
Token day
Token ID1110
Feature activation-3.34e+00

Pos logit contributions
 sight+0.020
 new+0.019
 elderly+0.015
 biggest+0.015
 old+0.014
Neg logit contributions
 could-0.066
 wouldn-0.064
 fought-0.063
 argued-0.062
 rode-0.062
Token the
Token ID262
Feature activation+8.82e+00

Pos logit contributions
 argued+0.061
ked+0.060
oped+0.058
 wouldn+0.058
 fought+0.058
Neg logit contributions
 new-0.072
 sight-0.068
 big-0.065
 sun-0.065
 old-0.063
Token wizard
Token ID18731
Feature activation-8.14e+00

Pos logit contributions
 sight+0.168
 new+0.149
igsaw+0.146
err+0.146
ummy+0.145
Neg logit contributions
 could-0.178
 wouldn-0.160
 wrote-0.156
 rode-0.150
 fought-0.149
Token used
Token ID973
Feature activation-7.87e-01

Pos logit contributions
ked+0.087
 argued+0.086
ged+0.083
pped+0.082
oped+0.082
Neg logit contributions
 new-0.218
 big-0.206
 old-0.204
 sight-0.200
 world-0.193
Token this
Token ID428
Feature activation+4.81e+00

Pos logit contributions
 could+0.016
 wouldn+0.015
 argued+0.015
 fought+0.015
 rode+0.014
Neg logit contributions
 sight-0.016
 new-0.015
 old-0.014
 big-0.014
 world-0.014
Token staff
Token ID3085
Feature activation-4.77e+00

Pos logit contributions
 sight+0.064
 new+0.058
 elderly+0.056
 view+0.051
 future+0.051
Neg logit contributions
 could-0.125
 wouldn-0.121
ched-0.113
 managed-0.112
 fought-0.111
Token for
Token ID329
Feature activation-6.63e-01

Pos logit contributions
 argued+0.080
 fought+0.079
 wouldn+0.077
 rode+0.076
 could+0.076
Neg logit contributions
 new-0.107
 sight-0.101
 big-0.097
 old-0.097
 world-0.093
Token magic
Token ID5536
Feature activation+6.17e-01

Pos logit contributions
 could+0.013
 wouldn+0.013
 fought+0.013
 argued+0.013
 rode+0.012
Neg logit contributions
 sight-0.013
 new-0.012
 world-0.011
 view-0.011
 old-0.011
Token.
Token ID13
Feature activation-2.08e+00

Pos logit contributions
 could+0.023
 fought+0.023
 wouldn+0.023
ched+0.022
 rode+0.022
Neg logit contributions
 sight-0.027
 new-0.027
 old-0.024
 world-0.024
 big-0.024
Token
Token ID198
Feature activation+1.67e+00

Pos logit contributions
 wouldn+0.036
 could+0.035
 fought+0.035
 argued+0.034
ched+0.033
Neg logit contributions
 sight-0.046
 new-0.046
 world-0.042
 old-0.041
 big-0.041
Token
Token ID198
Feature activation+1.16e+00

Pos logit contributions
 sight+0.039
 new+0.038
 biggest+0.035
 old+0.034
 elderly+0.034
Neg logit contributions
 could-0.028
 wouldn-0.025
 argued-0.024
 rode-0.024
 fought-0.024
TokenThe
Token ID464
Feature activation+8.30e+00

Pos logit contributions
 sight+0.024
 new+0.024
 old+0.021
 big+0.021
 view+0.021
Neg logit contributions
 could-0.022
 wouldn-0.021
 argued-0.020
 fought-0.020
 rode-0.020
Token yoga
Token ID20351
Feature activation-4.58e-01

Pos logit contributions
 sight+0.127
 view+0.103
err+0.100
ummy+0.099
 new+0.098
Neg logit contributions
 could-0.195
 let-0.175
 wouldn-0.174
 scored-0.173
 rode-0.172
Token instructor
Token ID21187
Feature activation-8.08e+00

Pos logit contributions
 could+0.010
 wouldn+0.010
 argued+0.010
 fought+0.010
ked+0.010
Neg logit contributions
 new-0.008
 sight-0.008
 world-0.007
 big-0.007
 old-0.006
Token was
Token ID373
Feature activation+4.17e-01

Pos logit contributions
ked+0.100
uable+0.096
ched+0.093
 fought+0.090
pped+0.089
Neg logit contributions
 new-0.200
 big-0.189
 old-0.188
 sight-0.186
 world-0.181
Token very
Token ID845
Feature activation+1.51e+00

Pos logit contributions
 new+0.008
 sight+0.008
 big+0.008
 old+0.007
 elderly+0.007
Neg logit contributions
 could-0.008
 wouldn-0.008
 argued-0.008
 fought-0.008
 rode-0.008
Token friendly
Token ID8030
Feature activation-1.72e+00

Pos logit contributions
 sight+0.020
 new+0.019
 view+0.016
 world+0.016
 future+0.015
Neg logit contributions
 could-0.040
 wouldn-0.039
 fought-0.038
 argued-0.038
 managed-0.037
Token and
Token ID290
Feature activation-4.31e+00

Pos logit contributions
 rode+0.036
 fought+0.036
 wouldn+0.036
 argued+0.036
ched+0.036
Neg logit contributions
 new-0.032
 sight-0.032
 old-0.028
 world-0.028
 big-0.028
Token charming
Token ID23332
Feature activation-2.78e-01

Pos logit contributions
 argued+0.058
ched+0.058
oped+0.058
ked+0.057
es+0.057
Neg logit contributions
 new-0.111
 sight-0.104
 big-0.102
 old-0.102
 elderly-0.100
Token be
Token ID307
Feature activation+1.97e+00

Pos logit contributions
 could+0.009
 argued+0.009
es+0.009
 wouldn+0.009
 fought+0.009
Neg logit contributions
 new-0.011
 sight-0.011
 big-0.011
 view-0.010
 old-0.010
Token safe
Token ID3338
Feature activation-1.24e+00

Pos logit contributions
 new+0.035
 sight+0.034
 big+0.031
 old+0.030
 elderly+0.029
Neg logit contributions
 could-0.044
 wouldn-0.043
 argued-0.042
 fought-0.041
 rode-0.041
Token."
Token ID526
Feature activation-2.35e+00

Pos logit contributions
 argued+0.022
 could+0.022
 wouldn+0.022
 fought+0.022
 rode+0.021
Neg logit contributions
 sight-0.027
 new-0.027
 world-0.024
 old-0.024
 big-0.024
Token So
Token ID1406
Feature activation-3.07e+00

Pos logit contributions
 wouldn+0.048
 fought+0.046
 argued+0.046
 could+0.046
 managed+0.045
Neg logit contributions
 sight-0.045
 new-0.045
 sun-0.039
 big-0.039
 world-0.039
Token the
Token ID262
Feature activation+9.37e+00

Pos logit contributions
 argued+0.060
 fought+0.059
ched+0.058
 could+0.058
 wouldn+0.057
Neg logit contributions
 new-0.062
 sight-0.061
 big-0.056
 sun-0.055
 old-0.053
Token sheep
Token ID15900
Feature activation-7.97e+00

Pos logit contributions
 sight+0.156
 new+0.146
igsaw+0.139
 future+0.134
 view+0.129
Neg logit contributions
 could-0.216
 wouldn-0.201
 argued-0.196
 fought-0.195
 rode-0.192
Token quickly
Token ID2952
Feature activation-2.82e+00

Pos logit contributions
uable+0.090
 Leave+0.088
 could+0.088
 fought+0.085
 scored+0.085
Neg logit contributions
 new-0.193
 sight-0.188
 big-0.187
 old-0.181
 sun-0.179
Token slid
Token ID27803
Feature activation+4.85e-02

Pos logit contributions
 could+0.036
 wouldn+0.034
uable+0.033
 Did+0.032
 Leave+0.031
Neg logit contributions
 new-0.071
 old-0.069
 big-0.069
 sight-0.067
 sun-0.065
Token through
Token ID832
Feature activation+1.19e+00

Pos logit contributions
 sight+0.001
 new+0.001
 big+0.001
 view+0.001
 old+0.001
Neg logit contributions
 could-0.001
 wouldn-0.001
 fought-0.001
 argued-0.001
 rode-0.001
Token the
Token ID262
Feature activation+9.67e+00

Pos logit contributions
 sight+0.027
 new+0.027
 old+0.024
 big+0.024
 view+0.024
Neg logit contributions
 could-0.020
 wouldn-0.020
 argued-0.019
 fought-0.019
 rode-0.018
Token hole
Token ID7604
Feature activation-3.45e+00

Pos logit contributions
 sight+0.121
 new+0.114
ummy+0.107
igsaw+0.106
err+0.103
Neg logit contributions
 could-0.249
 hid-0.223
 fought-0.222
 wouldn-0.220
 managed-0.218
Token she
Token ID673
Feature activation-4.89e+00

Pos logit contributions
 fought+0.044
 argued+0.042
 rode+0.041
ched+0.041
 could+0.040
Neg logit contributions
 new-0.067
 sight-0.066
 old-0.060
 big-0.060
 sun-0.058
Token kept
Token ID4030
Feature activation+3.29e-01

Pos logit contributions
ked+0.068
 Leave+0.065
 argued+0.065
ged+0.064
es+0.063
Neg logit contributions
 new-0.114
 big-0.110
 sight-0.109
 old-0.105
 little-0.104
Token hoping
Token ID7725
Feature activation-6.44e-01

Pos logit contributions
 sight+0.006
 new+0.006
 big+0.006
 old+0.005
 view+0.005
Neg logit contributions
 could-0.007
 wouldn-0.007
 fought-0.007
 argued-0.007
ched-0.006
Token that
Token ID326
Feature activation-4.21e-01

Pos logit contributions
 could+0.010
 wouldn+0.009
 argued+0.008
 fought+0.008
 managed+0.008
Neg logit contributions
 sight-0.016
 new-0.016
 view-0.015
 world-0.014
 elderly-0.014
Token the
Token ID262
Feature activation+1.04e+01

Pos logit contributions
 could+0.007
 wouldn+0.007
 fought+0.007
 argued+0.007
 rode+0.007
Neg logit contributions
 sight-0.009
 new-0.009
 big-0.008
 old-0.008
 world-0.008
Token anchor
Token ID18021
Feature activation-7.97e+00

Pos logit contributions
igsaw+0.195
 sight+0.190
 new+0.187
err+0.180
ummy+0.176
Neg logit contributions
 could-0.209
 wouldn-0.196
 managed-0.180
 rode-0.179
 argued-0.177
Token would
Token ID561
Feature activation-9.18e-01

Pos logit contributions
 argued+0.079
 Leave+0.078
uable+0.078
es+0.076
ged+0.074
Neg logit contributions
 new-0.216
 sight-0.204
 big-0.203
 old-0.201
 world-0.193
Token come
Token ID1282
Feature activation-9.76e-01

Pos logit contributions
 could+0.015
 argued+0.015
es+0.015
 fought+0.015
 wouldn+0.015
Neg logit contributions
 new-0.020
 sight-0.020
 big-0.019
 view-0.018
 old-0.018
Token back
Token ID736
Feature activation-2.20e+00

Pos logit contributions
 could+0.021
 wouldn+0.020
 argued+0.020
 fought+0.020
 rode+0.019
Neg logit contributions
 sight-0.018
 new-0.018
 big-0.016
 old-0.015
 view-0.015
Token.
Token ID13
Feature activation-2.01e+00

Pos logit contributions
 could+0.038
 argued+0.037
 fought+0.037
 wouldn+0.036
 rode+0.035
Neg logit contributions
 new-0.049
 sight-0.049
 big-0.044
 old-0.044
 world-0.043
Token
Token ID220
Feature activation+1.56e-01

Pos logit contributions
 could+0.033
 wouldn+0.032
 fought+0.032
 argued+0.031
 rode+0.030
Neg logit contributions
 sight-0.047
 new-0.046
 big-0.042
 old-0.042
 world-0.042
Token proceeded
Token ID18814
Feature activation-2.97e+00

Pos logit contributions
ked+0.039
 wouldn+0.038
 argued+0.037
pping+0.037
uable+0.036
Neg logit contributions
 new-0.084
 big-0.082
 old-0.078
 sight-0.075
 dark-0.074
Token to
Token ID284
Feature activation-9.05e-01

Pos logit contributions
 could+0.053
 wouldn+0.052
 fought+0.052
 argued+0.051
ched+0.050
Neg logit contributions
 new-0.064
 sight-0.062
 old-0.059
 big-0.058
 elderly-0.056
Token guess
Token ID4724
Feature activation-2.48e+00

Pos logit contributions
 could+0.016
 wouldn+0.016
 argued+0.015
 fought+0.015
es+0.015
Neg logit contributions
 sight-0.019
 new-0.019
 big-0.018
 view-0.018
 old-0.017
Token what
Token ID644
Feature activation-1.47e+00

Pos logit contributions
 fought+0.040
 argued+0.040
 could+0.040
 wouldn+0.039
 rode+0.039
Neg logit contributions
 new-0.060
 sight-0.056
 big-0.053
 old-0.053
 view-0.051
Token the
Token ID262
Feature activation+8.48e+00

Pos logit contributions
 argued+0.026
ked+0.026
 wouldn+0.025
oped+0.025
 fought+0.024
Neg logit contributions
 new-0.035
 sight-0.032
 big-0.032
 old-0.031
 world-0.030
Token shield
Token ID7614
Feature activation-7.96e+00

Pos logit contributions
 sight+0.136
 new+0.129
ummy+0.124
andel+0.121
igsaw+0.120
Neg logit contributions
 could-0.191
 wouldn-0.177
 wrote-0.171
 fought-0.170
 rode-0.167
Token was
Token ID373
Feature activation+2.19e-03

Pos logit contributions
ked+0.113
 argued+0.107
ged+0.107
pped+0.100
oped+0.100
Neg logit contributions
 new-0.185
 sight-0.177
 big-0.176
 world-0.176
 old-0.173
Token for
Token ID329
Feature activation-2.29e+00

Pos logit contributions
 new+0.000
 sight+0.000
 old+0.000
 big+0.000
 world+0.000
Neg logit contributions
 argued-0.000
 could-0.000
 fought-0.000
 rode-0.000
 wouldn-0.000
Token.
Token ID13
Feature activation-2.76e+00

Pos logit contributions
 wouldn+0.046
 could+0.045
 argued+0.045
 rode+0.044
 fought+0.044
Neg logit contributions
 new-0.045
 sight-0.044
 old-0.040
 big-0.040
 world-0.039
Token Was
Token ID8920
Feature activation-2.48e+00

Pos logit contributions
 wouldn+0.052
 fought+0.051
 argued+0.050
 could+0.049
 managed+0.047
Neg logit contributions
 sight-0.057
 new-0.057
 sun-0.052
 world-0.051
 big-0.051
Token it
Token ID340
Feature activation-1.07e+00

Pos logit contributions
 argued+0.035
 fought+0.033
 could+0.032
 wouldn+0.032
ked+0.031
Neg logit contributions
 new-0.064
 sight-0.062
 big-0.060
 old-0.058
 view-0.057
Token to
Token ID284
Feature activation+4.84e-01

Pos logit contributions
 could+0.003
 wouldn+0.003
 fought+0.003
 argued+0.003
 rode+0.002
Neg logit contributions
 sight-0.005
 new-0.005
 elderly-0.005
 old-0.005
 view-0.005
Token shrink
Token ID22085
Feature activation+8.56e-01

Pos logit contributions
 sight+0.008
 new+0.008
 old+0.007
 world+0.007
 elderly+0.007
Neg logit contributions
 could-0.011
 wouldn-0.011
 argued-0.011
 fought-0.011
 rode-0.010
Token!
Token ID0
Feature activation-1.91e+00

Pos logit contributions
 sight+0.019
 new+0.019
 elderly+0.017
 future+0.017
 view+0.017
Neg logit contributions
 could-0.015
 wouldn-0.014
 fought-0.013
 argued-0.013
es-0.013
Token Everywhere
Token ID48201
Feature activation-1.35e+00

Pos logit contributions
 could+0.031
 wouldn+0.031
 argued+0.029
 fought+0.029
 managed+0.028
Neg logit contributions
 sight-0.045
 new-0.044
 big-0.040
 old-0.040
 world-0.040
Token the
Token ID262
Feature activation+9.89e+00

Pos logit contributions
 argued+0.023
ked+0.023
ched+0.022
icked+0.022
ppy+0.022
Neg logit contributions
 new-0.033
 sight-0.032
 big-0.030
 old-0.030
 dark-0.029
Token tide
Token ID20013
Feature activation-7.96e+00

Pos logit contributions
 former+0.126
igsaw+0.121
 purpose+0.117
andel+0.114
err+0.112
Neg logit contributions
 could-0.261
 let-0.230
 reached-0.224
 wouldn-0.224
 tried-0.224
Token went
Token ID1816
Feature activation-8.94e-01

Pos logit contributions
uable+0.100
ked+0.098
 Leave+0.097
ppy+0.097
pping+0.096
Neg logit contributions
 sight-0.191
 new-0.187
 big-0.183
 sun-0.179
 world-0.177
Token it
Token ID340
Feature activation-3.94e+00

Pos logit contributions
 could+0.016
 argued+0.016
 wouldn+0.016
 fought+0.016
 rode+0.016
Neg logit contributions
 new-0.019
 sight-0.019
 old-0.017
 world-0.017
 big-0.017
Token seemed
Token ID3947
Feature activation+9.71e-01

Pos logit contributions
 argued+0.053
ked+0.053
uable+0.053
 Leave+0.052
oped+0.051
Neg logit contributions
 new-0.097
 big-0.095
 sight-0.092
 dark-0.090
 old-0.087
Token to
Token ID284
Feature activation+1.94e-01

Pos logit contributions
 sight+0.020
 new+0.020
 big+0.018
 old+0.018
 world+0.018
Neg logit contributions
 could-0.018
 wouldn-0.018
 argued-0.018
 fought-0.017
 rode-0.017
Token get
Token ID651
Feature activation+1.15e+00

Pos logit contributions
 sight+0.004
 new+0.004
 big+0.004
 view+0.003
 old+0.003
Neg logit contributions
 could-0.004
 wouldn-0.004
es-0.004
 argued-0.004
 rode-0.004
Tokenpa
Token ID8957
Feature activation-5.74e+00

Pos logit contributions
 sight+0.057
 new+0.056
 old+0.050
 world+0.049
 elderly+0.049
Neg logit contributions
 could-0.068
 wouldn-0.066
 fought-0.064
 argued-0.064
 rode-0.062
Token said
Token ID531
Feature activation-3.18e+00

Pos logit contributions
ked+0.073
uable+0.073
pped+0.072
 Leave+0.072
ged+0.071
Neg logit contributions
 new-0.134
 old-0.131
 big-0.129
 little-0.127
 elderly-0.125
Token that
Token ID326
Feature activation-1.55e+00

Pos logit contributions
 argued+0.063
 fought+0.061
 rode+0.061
oped+0.060
lets+0.060
Neg logit contributions
 new-0.064
 big-0.058
 old-0.057
 sight-0.054
 little-0.054
Token if
Token ID611
Feature activation-1.67e+00

Pos logit contributions
ked+0.023
 argued+0.023
 fought+0.023
 could+0.022
oped+0.022
Neg logit contributions
 new-0.038
 sight-0.036
 big-0.034
 old-0.034
 world-0.034
Token the
Token ID262
Feature activation+1.02e+01

Pos logit contributions
 could+0.023
 argued+0.023
ched+0.023
 fought+0.022
 wouldn+0.022
Neg logit contributions
 new-0.043
 sight-0.042
 old-0.039
 big-0.039
 elderly-0.038
Token kid
Token ID5141
Feature activation-7.95e+00

Pos logit contributions
 sight+0.162
andel+0.152
igsaw+0.150
 new+0.148
 view+0.143
Neg logit contributions
 could-0.218
 argued-0.204
 wouldn-0.202
 fought-0.201
 let-0.196
Token wanted
Token ID2227
Feature activation-6.48e-01

Pos logit contributions
ked+0.093
ged+0.093
pped+0.092
oped+0.088
es+0.087
Neg logit contributions
 new-0.201
 sight-0.191
 big-0.189
 old-0.188
 world-0.180
Token to
Token ID284
Feature activation-8.12e-01

Pos logit contributions
 could+0.011
 wouldn+0.010
 argued+0.010
 fought+0.010
 rode+0.010
Neg logit contributions
 new-0.015
 sight-0.015
 big-0.014
 old-0.014
 view-0.013
Token go
Token ID467
Feature activation-1.10e+00

Pos logit contributions
 could+0.015
 wouldn+0.014
es+0.014
 argued+0.014
 fought+0.014
Neg logit contributions
 new-0.017
 sight-0.017
 big-0.016
 view-0.015
 old-0.015
Token outside
Token ID2354
Feature activation-2.93e+00

Pos logit contributions
 could+0.021
 wouldn+0.021
 argued+0.020
 fought+0.020
 rode+0.019
Neg logit contributions
 new-0.022
 sight-0.022
 big-0.020
 old-0.020
 view-0.020
Token,
Token ID11
Feature activation-2.64e+00

Pos logit contributions
 could+0.052
 argued+0.051
 wouldn+0.051
 fought+0.051
ched+0.050
Neg logit contributions
 new-0.064
 sight-0.063
 big-0.057
 old-0.056
 world-0.056
Token helpful
Token ID7613
Feature activation+7.39e-02

Pos logit contributions
 sight+0.018
 new+0.016
 view+0.015
 world+0.014
 biggest+0.014
Neg logit contributions
 could-0.042
 wouldn-0.041
 fought-0.040
 argued-0.039
 rode-0.039
Token.
Token ID13
Feature activation-2.54e+00

Pos logit contributions
 new+0.002
 sight+0.001
 old+0.001
 big+0.001
 world+0.001
Neg logit contributions
 could-0.001
 argued-0.001
 wouldn-0.001
 rode-0.001
 fought-0.001
Token Every
Token ID3887
Feature activation+2.52e+00

Pos logit contributions
 could+0.042
 wouldn+0.041
 argued+0.040
 fought+0.040
 rode+0.038
Neg logit contributions
 sight-0.058
 new-0.058
 old-0.052
 big-0.052
 world-0.052
Token day
Token ID1110
Feature activation-2.16e+00

Pos logit contributions
 sight+0.021
 new+0.020
 big+0.015
 old+0.015
 view+0.014
Neg logit contributions
 could-0.079
 wouldn-0.077
 argued-0.076
 fought-0.076
 rode-0.075
Token the
Token ID262
Feature activation+8.20e+00

Pos logit contributions
 argued+0.040
ched+0.039
ked+0.039
oped+0.038
 wouldn+0.038
Neg logit contributions
 new-0.046
 sight-0.045
 big-0.042
 sun-0.041
 old-0.041
Token doctor
Token ID6253
Feature activation-7.95e+00

Pos logit contributions
 sight+0.131
 new+0.116
 view+0.109
err+0.102
 world+0.101
Neg logit contributions
 could-0.193
 wouldn-0.185
 fought-0.180
 rode-0.179
 argued-0.176
Token asked
Token ID1965
Feature activation-3.10e+00

Pos logit contributions
ked+0.090
ged+0.089
pped+0.088
 argued+0.087
pping+0.085
Neg logit contributions
 new-0.203
 old-0.197
 big-0.196
 elderly-0.181
 little-0.181
Token the
Token ID262
Feature activation+7.63e+00

Pos logit contributions
 could+0.052
 wouldn+0.050
 argued+0.050
 fought+0.050
 rode+0.050
Neg logit contributions
 new-0.071
 sight-0.068
 old-0.064
 big-0.064
 elderly-0.062
Token patient
Token ID5827
Feature activation-2.93e-01

Pos logit contributions
 sight+0.129
 new+0.110
 view+0.108
igsaw+0.105
err+0.101
Neg logit contributions
 could-0.173
 wouldn-0.163
 fought-0.158
 argued-0.155
 rode-0.154
Token different
Token ID1180
Feature activation+3.49e+00

Pos logit contributions
 could+0.006
 wouldn+0.006
 argued+0.006
 fought+0.006
 rode+0.006
Neg logit contributions
 new-0.006
 sight-0.005
 old-0.005
 world-0.005
 big-0.005
Token questions
Token ID2683
Feature activation-4.72e+00

Pos logit contributions
 sight+0.048
 new+0.046
 view+0.039
 big+0.039
 world+0.038
Neg logit contributions
 could-0.090
 wouldn-0.088
 argued-0.086
 fought-0.086
 rode-0.084
Token,
Token ID11
Feature activation-2.00e+00

Pos logit contributions
 could+0.040
 argued+0.039
 wouldn+0.039
 fought+0.039
 rode+0.038
Neg logit contributions
 sight-0.045
 new-0.045
 big-0.040
 old-0.040
 view-0.039
Token no
Token ID645
Feature activation+2.67e+00

Pos logit contributions
es+0.038
 fought+0.038
icked+0.037
 could+0.037
ched+0.037
Neg logit contributions
 new-0.040
 sight-0.040
 old-0.036
 big-0.036
 world-0.035
Token matter
Token ID2300
Feature activation-1.57e+00

Pos logit contributions
 sight+0.046
 new+0.045
 biggest+0.044
 world+0.043
 elderly+0.043
Neg logit contributions
 could-0.053
 wouldn-0.052
 argued-0.052
 fought-0.051
 let-0.050
Token how
Token ID703
Feature activation+6.21e-01

Pos logit contributions
 could+0.023
 argued+0.023
 wouldn+0.022
 fought+0.022
 rode+0.022
Neg logit contributions
 new-0.039
 sight-0.038
 old-0.035
 big-0.035
 world-0.034
Token the
Token ID262
Feature activation+9.70e+00

Pos logit contributions
 sight+0.008
 new+0.007
 old+0.006
 big+0.006
 view+0.006
Neg logit contributions
 could-0.017
 wouldn-0.017
 argued-0.016
 fought-0.016
 rode-0.016
Token monster
Token ID9234
Feature activation-7.87e+00

Pos logit contributions
igsaw+0.155
andel+0.148
 sight+0.147
 new+0.145
ummy+0.141
Neg logit contributions
 could-0.214
 fought-0.197
 let-0.194
 wouldn-0.188
 hid-0.188
Token yelled
Token ID22760
Feature activation-2.83e+00

Pos logit contributions
ged+0.085
ked+0.083
 Leave+0.081
icked+0.080
sels+0.080
Neg logit contributions
 new-0.197
 sight-0.193
 big-0.190
 old-0.186
 little-0.181
Token.
Token ID13
Feature activation-2.29e+00

Pos logit contributions
 fought+0.059
 rode+0.058
 argued+0.058
 could+0.055
 wrote+0.054
Neg logit contributions
 new-0.056
 sight-0.052
 big-0.050
 old-0.048
 sun-0.046
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
 wouldn+0.041
 could+0.041
 fought+0.040
 argued+0.040
 rode+0.039
Neg logit contributions
 new-0.049
 sight-0.049
 old-0.044
 world-0.044
 elderly-0.044
Token
Token ID198
Feature activation+9.50e-01

Pos logit contributions
 sight+0.022
 new+0.022
 big+0.020
 old+0.020
 world+0.020
Neg logit contributions
 could-0.019
 wouldn-0.019
 argued-0.018
 fought-0.018
 rode-0.017
TokenThe
Token ID464
Feature activation+9.96e+00

Pos logit contributions
 new+0.018
 sight+0.018
 big+0.016
 world+0.016
 old+0.016
Neg logit contributions
 could-0.019
 wouldn-0.019
 fought-0.018
 argued-0.018
 rode-0.018
Token the
Token ID262
Feature activation+1.09e+01

Pos logit contributions
 sight+0.035
 new+0.034
 biggest+0.032
 elderly+0.032
 window+0.032
Neg logit contributions
 could-0.021
 fought-0.018
es-0.018
 wouldn-0.018
 argued-0.018
Token air
Token ID1633
Feature activation-1.59e+00

Pos logit contributions
 new+0.079
 sight+0.077
 elderly+0.064
err+0.060
andel+0.058
Neg logit contributions
 could-0.329
 wouldn-0.294
 shared-0.292
 wrote-0.291
 stood-0.290
Token.
Token ID13
Feature activation-2.32e+00

Pos logit contributions
 could+0.029
 wouldn+0.029
 fought+0.028
 argued+0.028
 rode+0.027
Neg logit contributions
 sight-0.034
 new-0.034
 big-0.031
 world-0.030
 view-0.030
Token Everywhere
Token ID48201
Feature activation-2.10e+00

Pos logit contributions
 could+0.037
 wouldn+0.037
 argued+0.036
 fought+0.036
 managed+0.034
Neg logit contributions
 sight-0.055
 new-0.054
 old-0.049
 sun-0.049
 big-0.049
Token the
Token ID262
Feature activation+1.03e+01

Pos logit contributions
 argued+0.037
ked+0.037
icked+0.036
ppy+0.035
ched+0.034
Neg logit contributions
 new-0.050
 sight-0.048
 old-0.046
 big-0.045
 dark-0.043
Token creature
Token ID7185
Feature activation-7.87e+00

Pos logit contributions
andel+0.136
 purpose+0.134
igsaw+0.133
 future+0.133
arl+0.132
Neg logit contributions
 could-0.242
 tried-0.209
 let-0.208
 fought-0.207
 hid-0.205
Token had
Token ID550
Feature activation+1.35e+00

Pos logit contributions
ked+0.105
uable+0.100
ged+0.097
ched+0.097
icked+0.097
Neg logit contributions
 new-0.196
 sight-0.191
 big-0.186
 world-0.181
 old-0.179
Token been
Token ID587
Feature activation+1.51e+00

Pos logit contributions
 sight+0.028
 new+0.027
 big+0.024
 view+0.024
 old+0.024
Neg logit contributions
 could-0.026
 wouldn-0.026
 fought-0.025
 argued-0.025
 rode-0.024
Token was
Token ID373
Feature activation+9.59e-01

Pos logit contributions
 new+0.033
 sight+0.033
 big+0.030
 old+0.029
 world+0.029
Neg logit contributions
 could-0.028
 wouldn-0.027
 argued-0.026
 fought-0.026
 rode-0.025
Token full
Token ID1336
Feature activation+8.26e-01

Pos logit contributions
 new+0.019
 sight+0.019
 big+0.017
 old+0.016
 world+0.016
Neg logit contributions
 could-0.019
 wouldn-0.019
 argued-0.019
 fought-0.019
 rode-0.018
Token of
Token ID286
Feature activation+4.62e+00

Pos logit contributions
 new+0.015
 sight+0.015
 old+0.013
 big+0.013
 world+0.013
Neg logit contributions
 wouldn-0.018
 argued-0.018
 could-0.018
 fought-0.018
 rode-0.017
Token
Token ID220
Feature activation-4.99e-01

Pos logit contributions
 wouldn+0.050
 fought+0.049
 could+0.049
 argued+0.049
 managed+0.046
Neg logit contributions
 new-0.060
 sight-0.060
 sun-0.053
 big-0.053
 old-0.053
Token
Token ID198
Feature activation+3.86e-01

Pos logit contributions
 argued+0.011
 fought+0.011
 wouldn+0.011
 could+0.010
 snatched+0.010
Neg logit contributions
 new-0.009
 sight-0.009
 big-0.008
 old-0.008
 world-0.008
Token
Token ID198
Feature activation+6.21e-01

Pos logit contributions
 new+0.008
 sight+0.008
 big+0.007
 old+0.007
 world+0.007
Neg logit contributions
 wouldn-0.008
 could-0.008
 argued-0.007
 fought-0.007
 snatched-0.007
TokenThe
Token ID464
Feature activation+8.39e+00

Pos logit contributions
 new+0.011
 sight+0.011
 big+0.010
 world+0.010
 elderly+0.010
Neg logit contributions
 wouldn-0.013
 could-0.013
 argued-0.013
 fought-0.013
ked-0.012
Token massage
Token ID26900
Feature activation-4.14e+00

Pos logit contributions
 sight+0.132
 new+0.123
 view+0.109
 biggest+0.106
 future+0.106
Neg logit contributions
 could-0.200
 wouldn-0.190
 rode-0.183
 argued-0.182
 let-0.178
Token therapist
Token ID24636
Feature activation-7.86e+00

Pos logit contributions
 fought+0.065
 argued+0.064
ked+0.064
 could+0.063
icked+0.062
Neg logit contributions
 new-0.090
 sight-0.090
 old-0.084
 big-0.083
 sun-0.082
Token asked
Token ID1965
Feature activation-3.19e+00

Pos logit contributions
uable+0.099
 fought+0.097
ked+0.096
pping+0.095
 Leave+0.095
Neg logit contributions
 new-0.191
 big-0.183
 old-0.177
 little-0.177
 sight-0.176
Token him
Token ID683
Feature activation-3.44e+00

Pos logit contributions
 could+0.049
 fought+0.048
 argued+0.048
 rode+0.048
 wouldn+0.047
Neg logit contributions
 new-0.077
 sight-0.073
 big-0.069
 old-0.069
 world-0.066
Token to
Token ID284
Feature activation-1.85e+00

Pos logit contributions
 fought+0.061
 argued+0.061
 rode+0.058
oped+0.057
 snatched+0.056
Neg logit contributions
 new-0.081
 big-0.071
 old-0.071
 sight-0.070
 biggest-0.066
Token lay
Token ID3830
Feature activation-1.10e+00

Pos logit contributions
 could+0.030
 wouldn+0.029
 fought+0.028
 argued+0.028
es+0.027
Neg logit contributions
 new-0.042
 sight-0.042
 big-0.039
 old-0.038
 view-0.038
Token down
Token ID866
Feature activation-2.56e+00

Pos logit contributions
 could+0.019
 fought+0.018
 argued+0.018
 wouldn+0.018
 rode+0.018
Neg logit contributions
 sight-0.025
 new-0.024
 old-0.022
 big-0.022
 world-0.021
Token of
Token ID286
Feature activation+3.04e+00

Pos logit contributions
 sight+0.028
 new+0.026
 biggest+0.024
 view+0.024
 future+0.024
Neg logit contributions
 could-0.029
 wouldn-0.027
 fought-0.026
 argued-0.026
es-0.026
Token questions
Token ID2683
Feature activation-4.14e+00

Pos logit contributions
 sight+0.052
 entire+0.045
 whole+0.045
 biggest+0.044
igsaw+0.044
Neg logit contributions
 could-0.066
 wouldn-0.061
 fought-0.059
 argued-0.059
es-0.058
Token,
Token ID11
Feature activation-2.30e+00

Pos logit contributions
 argued+0.080
 fought+0.079
 snatched+0.077
 rode+0.077
 wouldn+0.076
Neg logit contributions
 new-0.087
 sight-0.080
 old-0.077
 big-0.076
 world-0.074
Token while
Token ID981
Feature activation-2.19e+00

Pos logit contributions
 could+0.040
 argued+0.039
ched+0.039
 wouldn+0.039
 fought+0.039
Neg logit contributions
 new-0.050
 sight-0.049
 old-0.045
 big-0.045
 elderly-0.044
Token the
Token ID262
Feature activation+9.41e+00

Pos logit contributions
 could+0.032
 wouldn+0.032
 argued+0.031
ched+0.031
 fought+0.030
Neg logit contributions
 new-0.054
 sight-0.054
 big-0.049
 old-0.049
 elderly-0.048
Token wizard
Token ID18731
Feature activation-7.84e+00

Pos logit contributions
 sight+0.138
ummy+0.135
igsaw+0.133
 new+0.124
err+0.123
Neg logit contributions
 could-0.218
 wouldn-0.206
 rode-0.189
 let-0.189
 fought-0.189
Token laughed
Token ID13818
Feature activation-2.03e+00

Pos logit contributions
ked+0.086
pped+0.086
ged+0.085
 Leave+0.084
oped+0.084
Neg logit contributions
 new-0.200
 sight-0.190
 old-0.187
 big-0.186
 elderly-0.178
Token and
Token ID290
Feature activation-3.77e+00

Pos logit contributions
 fought+0.040
 argued+0.039
 rode+0.038
 could+0.038
 wouldn+0.037
Neg logit contributions
 new-0.043
 sight-0.041
 big-0.038
 old-0.037
 dark-0.036
Token told
Token ID1297
Feature activation-1.27e+00

Pos logit contributions
es+0.047
ched+0.047
oped+0.046
pped+0.046
pping+0.045
Neg logit contributions
 new-0.099
 sight-0.098
 old-0.091
 big-0.090
 sun-0.088
Token him
Token ID683
Feature activation-3.78e+00

Pos logit contributions
 could+0.014
 wouldn+0.013
 argued+0.012
 fought+0.012
 rode+0.012
Neg logit contributions
 sight-0.036
 new-0.036
 world-0.033
 view-0.033
 big-0.033
Token stories
Token ID3923
Feature activation-4.66e+00

Pos logit contributions
oped+0.066
 argued+0.066
 fought+0.064
ked+0.063
ged+0.063
Neg logit contributions
 new-0.090
 big-0.079
 old-0.079
 sight-0.078
 large-0.072
Token from
Token ID422
Feature activation+1.12e+00

Pos logit contributions
 could+0.050
 wouldn+0.049
 argued+0.048
 fought+0.047
 rode+0.046
Neg logit contributions
 new-0.058
 sight-0.057
 big-0.052
 old-0.052
 world-0.051
Token danger
Token ID3514
Feature activation-2.37e+00

Pos logit contributions
 sight+0.023
 new+0.022
 biggest+0.021
 world+0.020
 view+0.020
Neg logit contributions
 could-0.020
 wouldn-0.019
 fought-0.019
 argued-0.019
 rode-0.018
Token.
Token ID13
Feature activation-1.64e+00

Pos logit contributions
 argued+0.052
 wouldn+0.051
 fought+0.050
 snatched+0.050
 rode+0.049
Neg logit contributions
 new-0.042
 sight-0.040
 old-0.038
 big-0.038
 world-0.037
Token Everywhere
Token ID48201
Feature activation-1.69e+00

Pos logit contributions
 could+0.028
 wouldn+0.026
 fought+0.025
 argued+0.025
 rode+0.025
Neg logit contributions
 sight-0.038
 new-0.037
 big-0.034
 old-0.034
 view-0.034
Token the
Token ID262
Feature activation+9.23e+00

Pos logit contributions
 argued+0.028
ked+0.027
ched+0.026
icked+0.026
oped+0.025
Neg logit contributions
 new-0.042
 sight-0.041
 big-0.039
 old-0.039
 dark-0.038
Token knight
Token ID22062
Feature activation-7.84e+00

Pos logit contributions
 sight+0.191
 new+0.173
ummy+0.172
err+0.172
igsaw+0.169
Neg logit contributions
 could-0.181
 rode-0.159
 wrote-0.154
 fought-0.153
 wouldn-0.151
Token went
Token ID1816
Feature activation-9.75e-01

Pos logit contributions
ked+0.103
ched+0.102
 Leave+0.101
icked+0.101
sels+0.100
Neg logit contributions
 new-0.180
 sight-0.178
 big-0.174
 world-0.174
 old-0.171
Token,
Token ID11
Feature activation-2.02e+00

Pos logit contributions
 argued+0.019
 wouldn+0.018
 could+0.018
 fought+0.018
ched+0.018
Neg logit contributions
 new-0.021
 sight-0.020
 old-0.019
 big-0.019
 world-0.018
Token he
Token ID339
Feature activation-4.85e+00

Pos logit contributions
 could+0.031
 wouldn+0.030
 argued+0.029
 fought+0.029
ched+0.028
Neg logit contributions
 sight-0.049
 new-0.049
 big-0.044
 old-0.044
 world-0.044
Token was
Token ID373
Feature activation+6.33e-01

Pos logit contributions
 Leave+0.063
icked+0.063
ked+0.061
 argued+0.061
ged+0.061
Neg logit contributions
 new-0.116
 big-0.113
 old-0.111
 sight-0.109
 world-0.108
Token careful
Token ID8161
Feature activation-1.15e+00

Pos logit contributions
 new+0.009
 big+0.008
 sight+0.008
 old+0.008
 elderly+0.008
Neg logit contributions
 argued-0.016
 wouldn-0.016
 could-0.016
oped-0.016
ched-0.015
Token colorful
Token ID20239
Feature activation+7.95e+00

Pos logit contributions
 sight+0.098
andel+0.088
ummy+0.087
 elderly+0.086
err+0.086
Neg logit contributions
 could-0.235
 wouldn-0.234
 fought-0.226
 argued-0.223
 rode-0.221
Token day
Token ID1110
Feature activation-2.00e+00

Pos logit contributions
 sight+0.103
 new+0.100
 elderly+0.090
 biggest+0.088
 future+0.085
Neg logit contributions
 could-0.198
 wouldn-0.189
 argued-0.187
 fought-0.187
 rode-0.182
Token.
Token ID13
Feature activation-3.01e+00

Pos logit contributions
 wouldn+0.038
 could+0.037
 fought+0.037
 argued+0.037
 rode+0.036
Neg logit contributions
 new-0.041
 sight-0.040
 big-0.037
 old-0.036
 world-0.036
Token Everywhere
Token ID48201
Feature activation-2.56e+00

Pos logit contributions
 wouldn+0.052
 could+0.051
 fought+0.050
 argued+0.050
ched+0.048
Neg logit contributions
 sight-0.067
 new-0.066
 world-0.060
 old-0.060
 elderly-0.060
Token the
Token ID262
Feature activation+9.31e+00

Pos logit contributions
 argued+0.044
ched+0.044
ked+0.044
icked+0.042
pping+0.042
Neg logit contributions
 new-0.060
 sight-0.058
 old-0.054
 big-0.054
 dark-0.051
Token person
Token ID1048
Feature activation-7.83e+00

Pos logit contributions
igsaw+0.152
err+0.145
 former+0.142
 ey+0.139
ummy+0.138
Neg logit contributions
 could-0.188
 let-0.170
 fought-0.167
 tried-0.166
 decided-0.163
Token looked
Token ID3114
Feature activation-1.69e+00

Pos logit contributions
ked+0.097
ched+0.089
uable+0.088
pped+0.086
ged+0.085
Neg logit contributions
 new-0.197
 big-0.189
 sight-0.187
 old-0.186
 world-0.184
Token,
Token ID11
Feature activation-2.69e+00

Pos logit contributions
 argued+0.030
 fought+0.030
 could+0.029
 wouldn+0.029
 rode+0.029
Neg logit contributions
 new-0.038
 sight-0.035
 big-0.034
 old-0.033
 dark-0.031
Token there
Token ID612
Feature activation-4.93e+00

Pos logit contributions
 could+0.043
 wouldn+0.042
 argued+0.042
 fought+0.042
ched+0.040
Neg logit contributions
 new-0.063
 sight-0.062
 big-0.057
 old-0.056
 world-0.055
Token were
Token ID547
Feature activation+7.66e-01

Pos logit contributions
ked+0.071
 snatched+0.069
ppy+0.067
uable+0.067
pped+0.066
Neg logit contributions
 new-0.120
 big-0.111
 sight-0.111
 world-0.107
 old-0.107
Token bright
Token ID6016
Feature activation+4.30e+00

Pos logit contributions
 sight+0.013
 new+0.012
 big+0.011
 old+0.011
 view+0.011
Neg logit contributions
 could-0.018
 wouldn-0.017
 argued-0.017
 fought-0.017
 rode-0.016
Token fix
Token ID4259
Feature activation-6.59e-01

Pos logit contributions
 could+0.048
 wouldn+0.046
 argued+0.045
 fought+0.044
es+0.043
Neg logit contributions
 new-0.063
 sight-0.063
 big-0.057
 old-0.057
 elderly-0.056
Token it
Token ID340
Feature activation-1.82e+00

Pos logit contributions
 could+0.010
 wouldn+0.010
 argued+0.009
 fought+0.009
 rode+0.009
Neg logit contributions
 sight-0.016
 new-0.016
 old-0.014
 big-0.014
 view-0.014
Token.
Token ID13
Feature activation-3.25e+00

Pos logit contributions
 could+0.037
 argued+0.036
 rode+0.036
 fought+0.036
 snatched+0.035
Neg logit contributions
 new-0.036
 sight-0.032
 big-0.032
 old-0.031
 large-0.029
Token The
Token ID383
Feature activation+8.42e+00

Pos logit contributions
 fought+0.060
 wouldn+0.060
 argued+0.059
 could+0.059
ched+0.057
Neg logit contributions
 new-0.069
 sight-0.067
 old-0.062
 elderly-0.060
 sun-0.060
Token pl
Token ID458
Feature activation-1.45e-01

Pos logit contributions
 sight+0.159
igsaw+0.152
 chamber+0.136
err+0.136
 view+0.134
Neg logit contributions
 could-0.174
 wouldn-0.165
 fought-0.154
 let-0.150
 managed-0.149
Tokenumber
Token ID4494
Feature activation-7.75e+00

Pos logit contributions
 could+0.003
 wouldn+0.003
 argued+0.003
 fought+0.003
 rode+0.003
Neg logit contributions
 sight-0.003
 new-0.003
 elderly-0.002
 big-0.002
 old-0.002
Token came
Token ID1625
Feature activation-9.26e-01

Pos logit contributions
 argued+0.098
 Leave+0.096
uable+0.095
oped+0.095
ged+0.094
Neg logit contributions
 new-0.187
 big-0.183
 old-0.178
 sight-0.171
 large-0.167
Token and
Token ID290
Feature activation-4.25e+00

Pos logit contributions
 could+0.017
 wouldn+0.017
 argued+0.016
 fought+0.016
 rode+0.016
Neg logit contributions
 sight-0.020
 new-0.019
 big-0.017
 old-0.017
 view-0.017
Token fixed
Token ID5969
Feature activation-7.35e-01

Pos logit contributions
 argued+0.051
 Leave+0.049
 could+0.047
 wouldn+0.046
pping+0.046
Neg logit contributions
 new-0.113
 big-0.107
 old-0.105
 sight-0.103
 elderly-0.102
Token the
Token ID262
Feature activation+1.13e+01

Pos logit contributions
 could+0.013
 wouldn+0.012
 argued+0.012
 fought+0.012
 rode+0.011
Neg logit contributions
 sight-0.017
 new-0.017
 big-0.015
 old-0.015
 view-0.015
Token f
Token ID277
Feature activation+1.62e+00

Pos logit contributions
 sight+0.193
igsaw+0.177
 new+0.171
err+0.168
 view+0.164
Neg logit contributions
 could-0.240
 wouldn-0.233
 fought-0.210
 snatched-0.206
 managed-0.204
Token to
Token ID284
Feature activation+5.70e-01

Pos logit contributions
 sight+0.061
 new+0.059
 view+0.054
 world+0.053
 future+0.053
Neg logit contributions
 could-0.066
 wouldn-0.063
 argued-0.061
 fought-0.060
 rode-0.059
Token the
Token ID262
Feature activation+9.92e+00

Pos logit contributions
 sight+0.013
 new+0.013
 world+0.012
 biggest+0.012
 view+0.011
Neg logit contributions
 could-0.009
 wouldn-0.009
 argued-0.009
 fought-0.009
 rode-0.008
Token hospital
Token ID4436
Feature activation-3.90e+00

Pos logit contributions
 sight+0.132
 new+0.113
err+0.113
andel+0.106
ummy+0.106
Neg logit contributions
 could-0.240
 wouldn-0.226
 snatched-0.223
 fought-0.217
 rode-0.216
Token.
Token ID13
Feature activation-2.41e+00

Pos logit contributions
 could+0.077
 fought+0.075
 argued+0.075
ched+0.075
 rode+0.074
Neg logit contributions
 new-0.077
 sight-0.073
 old-0.070
 big-0.067
 elderly-0.065
Token The
Token ID383
Feature activation+8.40e+00

Pos logit contributions
 fought+0.042
 wouldn+0.042
 argued+0.042
 could+0.041
 managed+0.039
Neg logit contributions
 sight-0.053
 new-0.052
 old-0.048
 elderly-0.048
 big-0.047
Token doctors
Token ID7519
Feature activation-7.71e+00

Pos logit contributions
 sight+0.153
igsaw+0.144
 new+0.134
 view+0.131
err+0.131
Neg logit contributions
 could-0.172
 wouldn-0.164
 rode-0.153
 fought-0.150
 snatched-0.147
Token and
Token ID290
Feature activation-1.21e-01

Pos logit contributions
uable+0.089
ked+0.084
 Leave+0.082
icked+0.082
ched+0.082
Neg logit contributions
 new-0.179
 big-0.177
 sight-0.174
 old-0.174
 little-0.172
Token nurses
Token ID20669
Feature activation-5.33e+00

Pos logit contributions
 could+0.003
 wouldn+0.003
 argued+0.003
 fought+0.003
 rode+0.003
Neg logit contributions
 sight-0.002
 new-0.002
 world-0.002
 view-0.002
 future-0.002
Token there
Token ID612
Feature activation-4.73e+00

Pos logit contributions
uable+0.063
 scored+0.061
 Leave+0.056
 snatched+0.055
ched+0.055
Neg logit contributions
 new-0.128
 big-0.123
 old-0.122
 sight-0.122
 little-0.119
Token were
Token ID547
Feature activation+9.61e-01

Pos logit contributions
 snatched+0.064
 rode+0.064
ched+0.063
ked+0.062
 fought+0.062
Neg logit contributions
 new-0.118
 sight-0.112
 big-0.108
 old-0.103
 world-0.102
Token very
Token ID845
Feature activation+1.99e+00

Pos logit contributions
 new+0.018
 sight+0.018
 big+0.016
 old+0.016
 elderly+0.016
Neg logit contributions
 could-0.020
 wouldn-0.019
 argued-0.019
 fought-0.019
 rode-0.019
Token park
Token ID3952
Feature activation-1.24e+00

Pos logit contributions
andel+0.103
err+0.099
 amount+0.093
 sight+0.091
 decision+0.090
Neg logit contributions
 could-0.177
 rode-0.175
 played-0.170
 shared-0.160
 stood-0.160
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
 could+0.020
 argued+0.019
 wouldn+0.019
 fought+0.018
 rode+0.018
Neg logit contributions
 sight-0.029
 new-0.028
 elderly-0.025
 view-0.025
 future-0.025
Token Claire
Token ID20758
Feature activation-4.61e+00

Pos logit contributions
 could+0.020
 wouldn+0.019
 fought+0.019
 argued+0.018
 rode+0.018
Neg logit contributions
 sight-0.030
 new-0.030
 big-0.027
 old-0.027
 view-0.027
Token was
Token ID373
Feature activation+2.86e+00

Pos logit contributions
uable+0.075
 argued+0.074
ked+0.073
 fought+0.072
ched+0.072
Neg logit contributions
 new-0.103
 big-0.100
 old-0.097
 large-0.095
 sight-0.093
Token holding
Token ID4769
Feature activation+9.18e-01

Pos logit contributions
 sight+0.057
 new+0.055
 future+0.050
 view+0.050
 world+0.049
Neg logit contributions
 could-0.056
 wouldn-0.053
 argued-0.052
 fought-0.051
 rode-0.051
Token a
Token ID257
Feature activation+9.62e+00

Pos logit contributions
 sight+0.019
 future+0.017
 new+0.017
 world+0.017
 view+0.016
Neg logit contributions
 could-0.016
es-0.016
 argued-0.015
 wouldn-0.015
 stood-0.015
Token k
Token ID479
Feature activation+3.72e+00

Pos logit contributions
 sight+0.122
 future+0.121
err+0.118
 entire+0.115
umm+0.113
Neg logit contributions
 could-0.221
 snatched-0.211
 wouldn-0.208
 scored-0.204
 rode-0.202
Tokenite
Token ID578
Feature activation-8.22e-01

Pos logit contributions
 sight+0.097
 new+0.096
 future+0.091
 old+0.089
 view+0.087
Neg logit contributions
 could-0.050
oped-0.045
es-0.044
 snatched-0.042
 fought-0.042
Token that
Token ID326
Feature activation-2.50e+00

Pos logit contributions
 could+0.015
 wouldn+0.014
 fought+0.014
 argued+0.014
 rode+0.013
Neg logit contributions
 sight-0.018
 new-0.018
 big-0.016
 old-0.016
 world-0.016
Token had
Token ID550
Feature activation+2.28e-01

Pos logit contributions
 argued+0.033
 fought+0.033
 could+0.032
 wouldn+0.032
 rode+0.031
Neg logit contributions
 new-0.064
 sight-0.064
 big-0.059
 old-0.058
 world-0.057
Token smooth
Token ID7209
Feature activation+3.44e+00

Pos logit contributions
 sight+0.004
 new+0.004
 big+0.004
 old+0.004
 view+0.004
Neg logit contributions
 could-0.005
 wouldn-0.005
 fought-0.004
 argued-0.004
es-0.004
Token when
Token ID618
Feature activation-8.09e-01

Pos logit contributions
 could+0.035
 wouldn+0.034
 argued+0.032
 fought+0.032
 rode+0.031
Neg logit contributions
 sight-0.050
 new-0.049
 world-0.045
 old-0.045
 view-0.045
Token Lily
Token ID20037
Feature activation-3.97e+00

Pos logit contributions
 could+0.012
 wouldn+0.011
 fought+0.010
 argued+0.010
 managed+0.010
Neg logit contributions
 sight-0.020
 new-0.020
 world-0.019
 elderly-0.019
 biggest-0.019
Token's
Token ID338
Feature activation+5.08e+00

Pos logit contributions
ked+0.056
fed+0.055
uable+0.053
ged+0.053
icked+0.052
Neg logit contributions
 big-0.092
 new-0.090
 sight-0.089
 little-0.087
 world-0.085
Token mom
Token ID1995
Feature activation-3.75e+00

Pos logit contributions
 sight+0.082
err+0.074
andel+0.073
 view+0.069
igsaw+0.069
Neg logit contributions
 could-0.109
 let-0.102
 wouldn-0.100
 fought-0.099
 rode-0.099
Token saw
Token ID2497
Feature activation-2.93e-01

Pos logit contributions
 argued+0.047
uable+0.046
 wouldn+0.045
ked+0.045
ched+0.044
Neg logit contributions
 new-0.093
 big-0.092
 sight-0.088
 window-0.088
 little-0.085
Token the
Token ID262
Feature activation+1.06e+01

Pos logit contributions
 could+0.005
 wouldn+0.005
 argued+0.004
 fought+0.004
 rode+0.004
Neg logit contributions
 sight-0.007
 new-0.007
 world-0.006
 view-0.006
 big-0.006
Token broken
Token ID5445
Feature activation+8.24e+00

Pos logit contributions
 sight+0.114
igsaw+0.096
 new+0.093
err+0.092
 future+0.089
Neg logit contributions
 could-0.287
 wouldn-0.281
 argued-0.272
 fought-0.270
 managed-0.265
Token plate
Token ID7480
Feature activation-1.86e+00

Pos logit contributions
 sight+0.114
 new+0.111
 old+0.093
 big+0.093
 view+0.092
Neg logit contributions
 could-0.215
 wouldn-0.209
 argued-0.205
 fought-0.205
 rode-0.200
Token,
Token ID11
Feature activation-1.97e+00

Pos logit contributions
 could+0.035
 argued+0.035
 fought+0.034
 wouldn+0.034
 rode+0.034
Neg logit contributions
 new-0.038
 sight-0.037
 big-0.035
 old-0.034
 world-0.033
Token she
Token ID673
Feature activation-4.19e+00

Pos logit contributions
 could+0.027
 wouldn+0.025
 argued+0.023
 fought+0.023
 rode+0.022
Neg logit contributions
 sight-0.052
 new-0.051
 world-0.047
 view-0.047
 future-0.047
Token didn
Token ID1422
Feature activation+4.84e-01

Pos logit contributions
ched+0.055
ked+0.054
uable+0.054
 argued+0.053
sels+0.053
Neg logit contributions
 new-0.103
 big-0.101
 little-0.094
 sight-0.094
 old-0.094
Token saw
Token ID2497
Feature activation+2.76e-01

Pos logit contributions
ked+0.077
 argued+0.074
ched+0.072
uable+0.072
icked+0.070
Neg logit contributions
 new-0.091
 big-0.088
 old-0.082
 sight-0.082
 large-0.081
Token a
Token ID257
Feature activation+9.79e+00

Pos logit contributions
 sight+0.005
 new+0.005
 world+0.005
 future+0.005
 entire+0.005
Neg logit contributions
 could-0.006
 wouldn-0.005
 rode-0.005
 fought-0.005
 argued-0.005
Token strange
Token ID6283
Feature activation+8.70e+00

Pos logit contributions
 sight+0.087
err+0.084
 future+0.083
ags+0.075
 surroundings+0.075
Neg logit contributions
 could-0.278
 wouldn-0.258
 snatched-0.256
 let-0.254
 scored-0.248
Token boy
Token ID2933
Feature activation-8.04e-01

Pos logit contributions
 new+0.110
 biggest+0.109
 sight+0.107
 future+0.105
 elderly+0.102
Neg logit contributions
 could-0.203
 wouldn-0.198
 fought-0.189
 snatched-0.188
 wrote-0.187
Token in
Token ID287
Feature activation+3.06e+00

Pos logit contributions
 wouldn+0.017
 could+0.016
 argued+0.016
 managed+0.016
ched+0.015
Neg logit contributions
 new-0.015
 sight-0.014
 world-0.013
 big-0.013
 old-0.013
Token the
Token ID262
Feature activation+9.21e+00

Pos logit contributions
 new+0.071
 sight+0.069
 biggest+0.068
 future+0.065
 world+0.065
Neg logit contributions
 could-0.045
 fought-0.042
 rode-0.042
 argued-0.041
 snatched-0.040
Token corner
Token ID5228
Feature activation-1.20e+00

Pos logit contributions
igsaw+0.126
andel+0.116
 sight+0.116
err+0.113
ummy+0.108
Neg logit contributions
 could-0.240
 rode-0.213
 wouldn-0.212
 hid-0.206
 stole-0.203
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
 could+0.026
 wouldn+0.026
 argued+0.025
 fought+0.025
 rode+0.024
Neg logit contributions
 sight-0.021
 new-0.021
 old-0.019
 big-0.019
 world-0.019
Token The
Token ID383
Feature activation+9.71e+00

Pos logit contributions
 wouldn+0.029
 could+0.028
 argued+0.028
 fought+0.027
ched+0.026
Neg logit contributions
 sight-0.038
 new-0.038
 big-0.035
 old-0.035
 sun-0.034
Token boy
Token ID2933
Feature activation-3.10e+00

Pos logit contributions
andel+0.207
 former+0.195
igsaw+0.195
 purpose+0.187
ummy+0.184
Neg logit contributions
 could-0.189
 did-0.157
 rode-0.154
 fought-0.153
 wouldn-0.150
Token looked
Token ID3114
Feature activation+1.44e+00

Pos logit contributions
ched+0.048
pping+0.046
ked+0.046
pped+0.046
sels+0.045
Neg logit contributions
 new-0.071
 big-0.068
 old-0.067
 sight-0.065
 large-0.064
Token.
Token ID13
Feature activation-1.96e+00

Pos logit contributions
 could+0.006
 wouldn+0.005
 fought+0.005
 argued+0.005
es+0.005
Neg logit contributions
 sight-0.008
 new-0.008
 elderly-0.007
 view-0.007
 biggest-0.007
Token Lily
Token ID20037
Feature activation-4.72e+00

Pos logit contributions
 could+0.033
 wouldn+0.031
 argued+0.030
 fought+0.030
 rode+0.029
Neg logit contributions
 sight-0.046
 new-0.045
 old-0.040
 big-0.040
 view-0.040
Token ran
Token ID4966
Feature activation+4.76e-02

Pos logit contributions
fed+0.075
uable+0.067
sels+0.067
pped+0.065
 Leave+0.065
Neg logit contributions
 big-0.109
 new-0.107
 old-0.105
 dark-0.098
 large-0.097
Token behind
Token ID2157
Feature activation-7.85e-01

Pos logit contributions
 sight+0.001
 new+0.001
 world+0.001
 old+0.001
 view+0.001
Neg logit contributions
 could-0.001
 wouldn-0.001
 fought-0.001
 argued-0.001
 rode-0.001
Token a
Token ID257
Feature activation+7.56e+00

Pos logit contributions
 could+0.015
 argued+0.014
 wouldn+0.014
 fought+0.014
 rode+0.013
Neg logit contributions
 sight-0.016
 new-0.016
 biggest-0.014
 elderly-0.014
 world-0.014
Token big
Token ID1263
Feature activation+8.50e+00

Pos logit contributions
 sight+0.093
 future+0.076
 new+0.076
err+0.075
 biggest+0.074
Neg logit contributions
 could-0.212
 let-0.187
 wouldn-0.185
 snatched-0.184
 managed-0.182
Token tree
Token ID5509
Feature activation-9.85e-01

Pos logit contributions
 sight+0.132
 new+0.130
 biggest+0.120
 future+0.116
 elderly+0.113
Neg logit contributions
 could-0.195
 fought-0.183
 argued-0.178
 snatched-0.178
 wouldn-0.178
Token and
Token ID290
Feature activation-3.80e+00

Pos logit contributions
 could+0.017
 wouldn+0.017
 argued+0.017
 fought+0.016
 rode+0.016
Neg logit contributions
 sight-0.022
 new-0.021
 big-0.019
 old-0.019
 world-0.019
Token waited
Token ID13488
Feature activation-2.24e+00

Pos logit contributions
es+0.043
oped+0.042
 argued+0.042
ched+0.041
 wouldn+0.040
Neg logit contributions
 new-0.101
 sight-0.098
 big-0.096
 old-0.095
 dark-0.094
Token for
Token ID329
Feature activation+1.25e+00

Pos logit contributions
 could+0.044
 wouldn+0.042
 argued+0.042
 fought+0.042
 rode+0.041
Neg logit contributions
 sight-0.045
 new-0.045
 big-0.039
 old-0.039
 view-0.039
Token Tom
Token ID4186
Feature activation-1.57e+00

Pos logit contributions
 sight+0.032
 world+0.032
 new+0.030
 biggest+0.030
 entire+0.030
Neg logit contributions
 could-0.016
 fought-0.014
 argued-0.014
 wouldn-0.014
 let-0.013
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
 sight+0.035
 new+0.034
 view+0.031
 biggest+0.031
 world+0.031
Neg logit contributions
 could-0.028
 wouldn-0.026
 fought-0.025
 argued-0.025
 rode-0.025
TokenThey
Token ID2990
Feature activation-4.71e+00

Pos logit contributions
 sight+0.028
 new+0.027
 old+0.025
 big+0.025
 view+0.025
Neg logit contributions
 could-0.019
 wouldn-0.018
 argued-0.018
 fought-0.018
 rode-0.017
Token waved
Token ID26834
Feature activation-7.15e-01

Pos logit contributions
es+0.057
ched+0.057
 Leave+0.055
uable+0.054
oped+0.053
Neg logit contributions
 big-0.117
 new-0.114
 elderly-0.108
 old-0.107
 little-0.106
Token goodbye
Token ID24829
Feature activation-1.93e+00

Pos logit contributions
 wouldn+0.014
 could+0.014
 fought+0.013
 managed+0.013
 argued+0.013
Neg logit contributions
 new-0.015
 sight-0.014
 big-0.013
 old-0.013
 sun-0.013
Token to
Token ID284
Feature activation-9.78e-02

Pos logit contributions
 could+0.031
 wouldn+0.030
 argued+0.029
 fought+0.029
 managed+0.028
Neg logit contributions
 sight-0.045
 new-0.045
 old-0.041
 big-0.041
 elderly-0.040
Token the
Token ID262
Feature activation+1.01e+01

Pos logit contributions
 could+0.002
 wouldn+0.001
 argued+0.001
 fought+0.001
 rode+0.001
Neg logit contributions
 sight-0.002
 new-0.002
 world-0.002
 view-0.002
 old-0.002
Token neighbor
Token ID4780
Feature activation-1.80e+00

Pos logit contributions
 sight+0.158
igsaw+0.144
andel+0.141
 new+0.137
ummy+0.133
Neg logit contributions
 could-0.221
 fought-0.210
 rode-0.206
 wouldn-0.205
 let-0.201
Token and
Token ID290
Feature activation-3.14e+00

Pos logit contributions
ched+0.038
 managed+0.038
 wouldn+0.037
 argued+0.037
 fought+0.037
Neg logit contributions
 new-0.033
 old-0.030
 sun-0.029
 window-0.029
 sight-0.028
Token went
Token ID1816
Feature activation-1.35e-01

Pos logit contributions
ched+0.036
 could+0.035
pping+0.035
 wouldn+0.035
oped+0.035
Neg logit contributions
 new-0.085
 sight-0.082
 big-0.080
 old-0.078
 elderly-0.077
Token back
Token ID736
Feature activation-4.19e-01

Pos logit contributions
 could+0.003
 wouldn+0.003
 argued+0.003
 fought+0.003
 rode+0.003
Neg logit contributions
 sight-0.002
 new-0.002
 old-0.002
 big-0.002
 view-0.002
Token to
Token ID284
Feature activation+5.60e-01

Pos logit contributions
 could+0.006
 wouldn+0.006
 fought+0.006
 argued+0.006
 rode+0.006
Neg logit contributions
 sight-0.010
 new-0.010
 view-0.009
 world-0.009
 old-0.009
Token to
Token ID284
Feature activation+5.62e-01

Pos logit contributions
 could+0.023
 wouldn+0.022
 argued+0.022
 fought+0.021
ked+0.021
Neg logit contributions
 new-0.023
 sight-0.023
 big-0.020
 old-0.020
 future-0.019
Token eat
Token ID4483
Feature activation+2.28e-02

Pos logit contributions
 new+0.011
 sight+0.010
 big+0.010
 view+0.009
 old+0.009
Neg logit contributions
 could-0.012
 wouldn-0.011
ked-0.011
 argued-0.011
es-0.011
Token the
Token ID262
Feature activation+1.03e+01

Pos logit contributions
 sight+0.001
 new+0.000
 elderly+0.000
 world+0.000
 biggest+0.000
Neg logit contributions
 could-0.000
 wouldn-0.000
 argued-0.000
 fought-0.000
es-0.000
Token leaves
Token ID5667
Feature activation-2.58e+00

Pos logit contributions
 sight+0.139
igsaw+0.136
err+0.120
 new+0.120
 future+0.118
Neg logit contributions
 could-0.250
 wouldn-0.238
 fought-0.225
 argued-0.221
 let-0.219
Token without
Token ID1231
Feature activation+9.57e-01

Pos logit contributions
 could+0.045
 fought+0.045
 argued+0.044
 wouldn+0.044
 rode+0.043
Neg logit contributions
 new-0.056
 sight-0.055
 big-0.051
 old-0.050
 view-0.049
Token any
Token ID597
Feature activation+5.20e+00

Pos logit contributions
 sight+0.020
 new+0.020
 big+0.017
 old+0.017
 view+0.017
Neg logit contributions
 could-0.018
 wouldn-0.017
 argued-0.017
 fought-0.017
 rode-0.016
Token worries
Token ID18572
Feature activation-1.93e+00

Pos logit contributions
 sight+0.076
 new+0.070
 elderly+0.068
 world+0.066
err+0.066
Neg logit contributions
 could-0.126
 wouldn-0.120
 fought-0.118
 argued-0.118
 managed-0.115
Token.
Token ID13
Feature activation-2.59e+00

Pos logit contributions
 argued+0.038
 fought+0.038
 rode+0.037
 could+0.036
 wouldn+0.036
Neg logit contributions
 new-0.038
 sight-0.036
 big-0.035
 old-0.034
 sun-0.033
Token<|endoftext|>
Token ID50256
Feature activation+8.50e-01

Pos logit contributions
 wouldn+0.047
 could+0.047
 argued+0.047
 fought+0.047
 rode+0.044
Neg logit contributions
 sight-0.054
 new-0.054
 world-0.048
 old-0.048
 big-0.048
TokenOnce
Token ID7454
Feature activation+1.20e+00

Pos logit contributions
 sight+0.017
 new+0.017
 big+0.015
 world+0.015
 view+0.015
Neg logit contributions
 wouldn-0.017
 could-0.016
 argued-0.016
 fought-0.016
 rode-0.015
Token upon
Token ID2402
Feature activation+1.73e+00

Pos logit contributions
 sight+0.020
 new+0.018
 view+0.018
 elderly+0.018
 biggest+0.017
Neg logit contributions
 could-0.028
 wouldn-0.026
 fought-0.024
 argued-0.024
 rode-0.024
Token me
Token ID502
Feature activation-2.08e+00

Pos logit contributions
 could+0.029
 wouldn+0.028
 argued+0.027
 fought+0.027
 rode+0.026
Neg logit contributions
 sight-0.033
 new-0.032
 big-0.029
 old-0.029
 world-0.029
Token find
Token ID1064
Feature activation-2.08e-01

Pos logit contributions
 could+0.038
 argued+0.038
 fought+0.037
 wouldn+0.037
 rode+0.036
Neg logit contributions
 new-0.044
 sight-0.042
 big-0.040
 old-0.039
 world-0.038
Token<|endoftext|>
Token ID50256
Feature activation-6.67e-01

Pos logit contributions
 could+0.004
 wouldn+0.004
 fought+0.003
 argued+0.003
es+0.003
Neg logit contributions
 sight-0.004
 new-0.004
 world-0.004
 view-0.004
 elderly-0.004
TokenOnce
Token ID7454
Feature activation+5.14e-01

Pos logit contributions
 wouldn+0.016
 argued+0.016
 could+0.015
 fought+0.015
 managed+0.015
Neg logit contributions
 sight-0.011
 new-0.010
 world-0.009
 big-0.009
 view-0.009
Token upon
Token ID2402
Feature activation+1.43e+00

Pos logit contributions
 sight+0.008
 new+0.007
 view+0.007
 elderly+0.006
 biggest+0.006
Neg logit contributions
 could-0.013
 wouldn-0.012
 argued-0.012
 fought-0.012
 rode-0.011
Token a
Token ID257
Feature activation+5.75e+00

Pos logit contributions
 sight+0.029
 new+0.027
 future+0.025
 world+0.025
 window+0.025
Neg logit contributions
 could-0.028
 rode-0.025
 fought-0.024
 stood-0.024
 wouldn-0.024
Token time
Token ID640
Feature activation-8.46e-01

Pos logit contributions
 sight+0.086
 elderly+0.077
 doorway+0.075
 new+0.072
 beauty+0.072
Neg logit contributions
 could-0.136
 let-0.129
 managed-0.129
 rode-0.128
 fought-0.128
Token,
Token ID11
Feature activation+1.32e-01

Pos logit contributions
 could+0.014
 wouldn+0.013
 argued+0.013
 fought+0.013
 rode+0.012
Neg logit contributions
 sight-0.020
 new-0.020
 big-0.018
 old-0.018
 view-0.018
Token there
Token ID612
Feature activation-1.67e-01

Pos logit contributions
 sight+0.003
 new+0.003
 big+0.003
 old+0.003
 world+0.003
Neg logit contributions
 could-0.002
 wouldn-0.002
 argued-0.002
 fought-0.002
 rode-0.002
Token was
Token ID373
Feature activation+2.30e+00

Pos logit contributions
 argued+0.003
 fought+0.003
 wouldn+0.003
 snatched+0.002
 could+0.002
Neg logit contributions
 new-0.004
 sight-0.004
 big-0.004
 world-0.004
 old-0.004
Token a
Token ID257
Feature activation+7.97e+00

Pos logit contributions
 decision+0.051
 sight+0.051
 surroundings+0.050
 new+0.048
 world+0.048
Neg logit contributions
 could-0.037
 rode-0.033
 stood-0.033
 fought-0.032
 wrote-0.032
Token smiled
Token ID13541
Feature activation+1.53e-01

Pos logit contributions
 Leave+0.044
 argued+0.043
oped+0.043
 fought+0.043
pping+0.043
Neg logit contributions
 new-0.105
 sight-0.097
 big-0.097
 sun-0.093
 old-0.092
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
 sight+0.003
 new+0.003
 world+0.003
 view+0.003
 future+0.003
Neg logit contributions
 could-0.003
 wouldn-0.003
 argued-0.002
 fought-0.002
 rode-0.002
Token happy
Token ID3772
Feature activation+8.62e-01

Pos logit contributions
es+0.024
 could+0.024
 fought+0.023
ched+0.023
 argued+0.023
Neg logit contributions
 new-0.019
 sight-0.019
 old-0.017
 elderly-0.017
 big-0.017
Token to
Token ID284
Feature activation+2.52e-01

Pos logit contributions
 sight+0.022
 new+0.022
 big+0.020
 old+0.020
 view+0.019
Neg logit contributions
 could-0.013
 wouldn-0.012
 argued-0.012
 fought-0.012
 rode-0.011
Token have
Token ID423
Feature activation-1.45e-01

Pos logit contributions
 sight+0.007
 new+0.007
 big+0.006
 old+0.006
 view+0.006
Neg logit contributions
 could-0.003
 wouldn-0.003
 argued-0.003
 fought-0.003
 rode-0.003
Token her
Token ID607
Feature activation+6.16e+00

Pos logit contributions
 wouldn+0.003
 could+0.003
 argued+0.002
ked+0.002
oped+0.002
Neg logit contributions
 new-0.003
 sight-0.003
 big-0.003
 sun-0.003
 future-0.003
Token picture
Token ID4286
Feature activation-2.31e-01

Pos logit contributions
 sight+0.081
andel+0.074
 view+0.071
 doorway+0.070
err+0.070
Neg logit contributions
 could-0.152
 fought-0.144
 wouldn-0.144
 argued-0.141
 managed-0.139
Token in
Token ID287
Feature activation+9.58e-01

Pos logit contributions
 could+0.004
 wouldn+0.004
 argued+0.004
 fought+0.004
 rode+0.004
Neg logit contributions
 sight-0.005
 new-0.005
 big-0.004
 old-0.004
 view-0.004
Token a
Token ID257
Feature activation+7.34e+00

Pos logit contributions
 sight+0.022
 new+0.022
 biggest+0.021
 view+0.021
 world+0.020
Neg logit contributions
 could-0.015
 wouldn-0.013
 fought-0.013
 argued-0.013
 managed-0.012
Token funny
Token ID8258
Feature activation+8.14e+00

Pos logit contributions
 sight+0.081
 elderly+0.071
err+0.070
 view+0.070
ummy+0.068
Neg logit contributions
 could-0.202
 wouldn-0.190
 fought-0.189
 snatched-0.189
 managed-0.187
Token frame
Token ID5739
Feature activation-3.48e-01

Pos logit contributions
 sight+0.127
 new+0.127
 elderly+0.121
 biggest+0.116
 old+0.111
Neg logit contributions
 could-0.183
 fought-0.175
 argued-0.175
 wouldn-0.175
 wrote-0.168
Token and
Token ID290
Feature activation+6.42e-01

Pos logit contributions
 argued+0.082
ked+0.081
 fought+0.081
uable+0.080
ged+0.079
Neg logit contributions
 new-0.104
 big-0.102
 old-0.100
 sight-0.099
 elderly-0.098
Token her
Token ID607
Feature activation+5.31e+00

Pos logit contributions
 sight+0.012
 new+0.012
 old+0.011
 big+0.011
 elderly+0.011
Neg logit contributions
 could-0.013
 wouldn-0.013
ched-0.013
 fought-0.013
 managed-0.013
Token mom
Token ID1995
Feature activation-1.97e+00

Pos logit contributions
andel+0.097
 sight+0.092
err+0.089
 view+0.089
 future+0.087
Neg logit contributions
 could-0.109
 wouldn-0.096
 fought-0.093
 shared-0.092
 hugged-0.091
Tokenmy
Token ID1820
Feature activation-2.65e+00

Pos logit contributions
ched+0.041
 could+0.041
 wouldn+0.041
 fought+0.040
 snatched+0.040
Neg logit contributions
 new-0.033
 sight-0.032
 big-0.030
 sun-0.030
 window-0.030
Token had
Token ID550
Feature activation+2.33e-01

Pos logit contributions
ched+0.032
uable+0.031
 wouldn+0.030
 scored+0.030
 could+0.029
Neg logit contributions
 new-0.064
 big-0.063
 sight-0.062
 sun-0.062
 whole-0.062
Token a
Token ID257
Feature activation+7.57e+00

Pos logit contributions
 new+0.004
 sight+0.004
 big+0.004
 whole+0.003
 elderly+0.003
Neg logit contributions
 wouldn-0.005
oped-0.005
ked-0.005
ged-0.005
 could-0.005
Token lovely
Token ID14081
Feature activation+7.88e+00

Pos logit contributions
 sight+0.091
andel+0.086
iring+0.085
 ey+0.085
 elderly+0.085
Neg logit contributions
 could-0.195
 wouldn-0.187
 managed-0.182
 shared-0.182
 rode-0.180
Token dinner
Token ID8073
Feature activation-1.71e+00

Pos logit contributions
 sight+0.098
 elderly+0.097
 biggest+0.096
 new+0.093
andel+0.092
Neg logit contributions
 could-0.187
 rode-0.179
 wouldn-0.178
 shared-0.177
 wrote-0.177
Token together
Token ID1978
Feature activation-1.15e+00

Pos logit contributions
 could+0.030
 wouldn+0.029
 argued+0.029
 fought+0.028
ched+0.028
Neg logit contributions
 new-0.037
 sight-0.037
 big-0.034
 world-0.033
 old-0.033
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
ched+0.024
 wouldn+0.024
 could+0.024
 rode+0.024
ked+0.024
Neg logit contributions
 sight-0.021
 new-0.021
 big-0.018
 world-0.018
 view-0.018
Token They
Token ID1119
Feature activation-3.51e+00

Pos logit contributions
 could+0.021
 wouldn+0.021
 fought+0.021
 argued+0.020
 managed+0.019
Neg logit contributions
 sight-0.030
 new-0.029
 big-0.026
 world-0.026
 old-0.026
Token the
Token ID262
Feature activation+8.82e+00

Pos logit contributions
 could+0.041
 wouldn+0.040
ched+0.039
 fought+0.038
 argued+0.038
Neg logit contributions
 new-0.082
 sight-0.081
 big-0.075
 sun-0.073
 old-0.073
Token ants
Token ID27842
Feature activation-5.10e+00

Pos logit contributions
 sight+0.114
 new+0.103
igsaw+0.093
 future+0.092
andel+0.092
Neg logit contributions
 could-0.220
 wouldn-0.209
 argued-0.208
 fought-0.206
 shared-0.201
Token had
Token ID550
Feature activation-2.28e-01

Pos logit contributions
uable+0.069
ked+0.067
 wouldn+0.066
 Leave+0.064
 rode+0.063
Neg logit contributions
 new-0.124
 big-0.117
 sight-0.116
 old-0.111
 large-0.109
Token more
Token ID517
Feature activation+3.75e+00

Pos logit contributions
 wouldn+0.005
oped+0.005
 could+0.005
ked+0.005
 rode+0.005
Neg logit contributions
 new-0.004
 sight-0.004
 big-0.003
 old-0.003
 large-0.003
Token than
Token ID621
Feature activation-7.99e-01

Pos logit contributions
 sight+0.047
 new+0.044
 elderly+0.038
 world+0.038
 view+0.037
Neg logit contributions
 could-0.102
 wouldn-0.098
 argued-0.097
 fought-0.097
 managed-0.094
Token enough
Token ID1576
Feature activation+2.81e+00

Pos logit contributions
 could+0.013
 wouldn+0.013
 argued+0.012
 fought+0.012
 rode+0.012
Neg logit contributions
 sight-0.018
 new-0.018
 big-0.016
 old-0.016
 view-0.016
Token for
Token ID329
Feature activation-4.79e-02

Pos logit contributions
 sight+0.060
 new+0.059
 old+0.052
 big+0.052
 view+0.052
Neg logit contributions
 could-0.052
 wouldn-0.050
 argued-0.048
 fought-0.048
 rode-0.047
Token the
Token ID262
Feature activation+9.94e+00

Pos logit contributions
 could+0.001
 wouldn+0.001
 fought+0.001
 argued+0.001
 rode+0.001
Neg logit contributions
 sight-0.001
 new-0.001
 big-0.001
 old-0.001
 view-0.001
Token whole
Token ID2187
Feature activation+8.34e+00

Pos logit contributions
 sight+0.117
 new+0.111
err+0.098
igsaw+0.098
 view+0.094
Neg logit contributions
 could-0.250
 wouldn-0.242
 argued-0.234
 fought-0.232
 let-0.227
Token day
Token ID1110
Feature activation-1.94e+00

Pos logit contributions
 new+0.081
 sight+0.077
 biggest+0.069
 elderly+0.068
 old+0.064
Neg logit contributions
 could-0.237
 argued-0.226
 wouldn-0.225
 fought-0.224
 managed-0.221
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
 wouldn+0.041
 fought+0.040
 argued+0.040
 rode+0.040
 could+0.039
Neg logit contributions
 sight-0.035
 new-0.035
 big-0.032
 world-0.031
 old-0.031
Token,
Token ID11
Feature activation-1.98e+00

Pos logit contributions
 argued+0.018
ked+0.018
 wouldn+0.018
ched+0.018
 could+0.017
Neg logit contributions
 new-0.015
 sight-0.014
 big-0.014
 old-0.013
 large-0.012
Token look
Token ID804
Feature activation-6.27e-01

Pos logit contributions
oped+0.039
 argued+0.037
ched+0.037
 wouldn+0.036
ged+0.036
Neg logit contributions
 new-0.044
 big-0.041
 sight-0.039
 old-0.039
 sun-0.039
Token at
Token ID379
Feature activation+1.69e-01

Pos logit contributions
 argued+0.012
 fought+0.012
 rode+0.012
 could+0.011
 wouldn+0.011
Neg logit contributions
 new-0.014
 sight-0.013
 big-0.013
 old-0.012
 large-0.012
Token my
Token ID616
Feature activation+8.09e+00

Pos logit contributions
 sight+0.004
 new+0.004
 big+0.004
 old+0.004
 view+0.004
Neg logit contributions
 could-0.003
 wouldn-0.002
 argued-0.002
 fought-0.002
 rode-0.002
Token pe
Token ID613
Feature activation+4.78e+00

Pos logit contributions
 sight+0.156
 new+0.150
igsaw+0.149
 future+0.145
 elderly+0.142
Neg logit contributions
 could-0.151
 wouldn-0.142
 argued-0.132
 fought-0.131
 stood-0.130
Tokenb
Token ID65
Feature activation+2.13e+00

Pos logit contributions
 sight+0.118
 new+0.115
 elderly+0.105
 world+0.105
 old+0.105
Neg logit contributions
 could-0.072
 wouldn-0.069
 fought-0.065
 argued-0.065
 rode-0.063
Tokenble
Token ID903
Feature activation-1.86e+00

Pos logit contributions
 new+0.031
 sight+0.031
 big+0.027
 view+0.027
 biggest+0.027
Neg logit contributions
 could-0.052
 argued-0.051
 fought-0.051
 wouldn-0.051
 rode-0.049
Token!"
Token ID2474
Feature activation-1.84e+00

Pos logit contributions
 could+0.031
 wouldn+0.031
 argued+0.030
 fought+0.030
 rode+0.030
Neg logit contributions
 new-0.042
 sight-0.041
 big-0.038
 old-0.038
 view-0.037
Token Sara
Token ID24799
Feature activation-4.63e+00

Pos logit contributions
 wouldn+0.035
 could+0.035
 fought+0.033
 argued+0.033
es+0.032
Neg logit contributions
 new-0.039
 sight-0.038
 big-0.035
 old-0.035
 elderly-0.034
Token said
Token ID531
Feature activation-1.13e+00

Pos logit contributions
pped+0.077
pping+0.072
 wouldn+0.072
fed+0.069
oped+0.069
Neg logit contributions
 new-0.104
 old-0.102
 big-0.100
 elderly-0.094
 small-0.094
Token,
Token ID11
Feature activation-8.42e-01

Pos logit contributions
 could+0.025
 wouldn+0.025
 fought+0.024
 rode+0.024
 argued+0.024
Neg logit contributions
 new-0.020
 sight-0.018
 big-0.018
 old-0.018
 sun-0.016
Tokenries
Token ID1678
Feature activation-8.74e-01

Pos logit contributions
 sight+0.105
 new+0.104
 old+0.093
 big+0.092
 world+0.092
Neg logit contributions
 could-0.093
 wouldn-0.089
 argued-0.087
 fought-0.087
 rode-0.084
Token on
Token ID319
Feature activation+2.20e+00

Pos logit contributions
 wouldn+0.017
 could+0.016
 fought+0.016
 argued+0.016
 rode+0.016
Neg logit contributions
 sight-0.017
 new-0.017
 big-0.016
 old-0.016
 sun-0.015
Token it
Token ID340
Feature activation-1.49e+00

Pos logit contributions
 sight+0.041
 new+0.041
 elderly+0.036
 world+0.036
 old+0.036
Neg logit contributions
 could-0.046
 wouldn-0.044
 argued-0.043
 fought-0.043
 rode-0.042
Token.
Token ID13
Feature activation-2.07e+00

Pos logit contributions
 fought+0.033
 argued+0.033
 rode+0.033
 managed+0.033
oped+0.033
Neg logit contributions
 new-0.026
 sight-0.025
 big-0.023
 old-0.023
 dark-0.021
Token She
Token ID1375
Feature activation-3.55e+00

Pos logit contributions
 fought+0.038
 wouldn+0.038
 argued+0.037
 could+0.036
 managed+0.036
Neg logit contributions
 new-0.045
 sight-0.044
 big-0.041
 sun-0.040
 old-0.040
Token wanted
Token ID2227
Feature activation-4.45e-02

Pos logit contributions
 wouldn+0.036
es+0.036
ched+0.036
oped+0.036
 could+0.034
Neg logit contributions
 new-0.099
 sight-0.095
 big-0.094
 old-0.092
 large-0.089
Token to
Token ID284
Feature activation+3.52e-01

Pos logit contributions
 could+0.001
 wouldn+0.001
 argued+0.001
 fought+0.001
 rode+0.001
Neg logit contributions
 sight-0.001
 new-0.001
 old-0.001
 world-0.001
 big-0.001
Token eat
Token ID4483
Feature activation+6.11e-02

Pos logit contributions
 sight+0.008
 new+0.008
 world+0.008
 old+0.008
 elderly+0.007
Neg logit contributions
 could-0.006
 wouldn-0.005
 argued-0.005
 fought-0.005
 rode-0.005
Token them
Token ID606
Feature activation-5.59e-01

Pos logit contributions
 sight+0.002
 new+0.002
 big+0.001
 old+0.001
 view+0.001
Neg logit contributions
 could-0.001
 wouldn-0.001
 fought-0.001
 argued-0.001
 rode-0.001
Token,
Token ID11
Feature activation-9.32e-01

Pos logit contributions
 could+0.011
 wouldn+0.010
 fought+0.010
 argued+0.010
 rode+0.010
Neg logit contributions
 new-0.012
 sight-0.011
 big-0.010
 view-0.010
 old-0.010
Token but
Token ID475
Feature activation-2.07e+00

Pos logit contributions
oped+0.019
es+0.019
 fought+0.019
ched+0.019
 could+0.018
Neg logit contributions
 new-0.019
 sight-0.017
 big-0.016
 old-0.016
 sun-0.016
Tokenâ„¢
Token ID8151
Feature activation-2.17e+00

Pos logit contributions
 sight+0.003
 new+0.003
 big+0.003
 world+0.003
 whole+0.003
Neg logit contributions
 argued-0.002
ched-0.002
 wouldn-0.002
ked-0.002
icked-0.002
Tokent
Token ID83
Feature activation+2.00e+00

Pos logit contributions
 could+0.034
 wouldn+0.032
 argued+0.031
 fought+0.031
 rode+0.030
Neg logit contributions
 sight-0.052
 new-0.052
 big-0.047
 old-0.047
 world-0.046
Token sure
Token ID1654
Feature activation-1.11e+00

Pos logit contributions
 sight+0.036
 new+0.036
 big+0.032
 old+0.032
 elderly+0.031
Neg logit contributions
 could-0.043
 wouldn-0.042
 argued-0.041
 fought-0.041
 rode-0.040
Token what
Token ID644
Feature activation-1.84e+00

Pos logit contributions
 argued+0.022
ked+0.022
 fought+0.022
ched+0.022
 snatched+0.022
Neg logit contributions
 new-0.023
 sight-0.020
 old-0.020
 big-0.019
 little-0.018
Token to
Token ID284
Feature activation-5.53e-01

Pos logit contributions
 argued+0.035
ked+0.035
oped+0.033
 snatched+0.033
 wouldn+0.033
Neg logit contributions
 new-0.042
 big-0.038
 old-0.037
 sight-0.037
 world-0.035
Token do
Token ID466
Feature activation-2.91e+00

Pos logit contributions
 could+0.009
 wouldn+0.009
 fought+0.009
 argued+0.009
 managed+0.009
Neg logit contributions
 new-0.013
 sight-0.012
 big-0.012
 old-0.012
 world-0.011
Token.
Token ID13
Feature activation-2.28e+00

Pos logit contributions
ked+0.070
 rode+0.069
 argued+0.069
 snatched+0.068
oped+0.068
Neg logit contributions
 new-0.049
 sight-0.044
 big-0.043
 old-0.043
 world-0.042
Token He
Token ID679
Feature activation-4.48e+00

Pos logit contributions
 fought+0.039
 could+0.039
 argued+0.038
 wouldn+0.038
es+0.037
Neg logit contributions
 new-0.051
 sight-0.050
 old-0.047
 elderly-0.047
 big-0.046
Token kept
Token ID4030
Feature activation-2.27e-01

Pos logit contributions
pped+0.055
ched+0.055
ged+0.055
ked+0.053
pping+0.053
Neg logit contributions
 new-0.115
 big-0.110
 old-0.109
 sight-0.107
 elderly-0.105
Token on
Token ID319
Feature activation+1.51e+00

Pos logit contributions
 could+0.004
 wouldn+0.004
 fought+0.004
 argued+0.004
 managed+0.004
Neg logit contributions
 sight-0.005
 new-0.005
 big-0.004
 old-0.004
 elderly-0.004
Token holding
Token ID4769
Feature activation-3.85e-01

Pos logit contributions
 sight+0.031
 new+0.031
 view+0.027
 old+0.027
 big+0.027
Neg logit contributions
 could-0.029
 wouldn-0.028
 argued-0.027
 fought-0.027
 rode-0.026
Token ate
Token ID15063
Feature activation-5.66e-01

Pos logit contributions
ked+0.058
 Leave+0.057
oped+0.056
es+0.055
ged+0.055
Neg logit contributions
 new-0.114
 big-0.108
 sight-0.108
 old-0.106
 elderly-0.104
Token it
Token ID340
Feature activation-1.57e+00

Pos logit contributions
 could+0.007
 wouldn+0.006
 fought+0.006
 argued+0.006
 rode+0.006
Neg logit contributions
 sight-0.016
 new-0.016
 big-0.014
 old-0.014
 view-0.014
Token anyway
Token ID6949
Feature activation-2.46e+00

Pos logit contributions
 could+0.028
 wouldn+0.027
 fought+0.026
 argued+0.026
 rode+0.026
Neg logit contributions
 sight-0.034
 new-0.034
 big-0.030
 old-0.030
 view-0.030
Token because
Token ID780
Feature activation-8.79e-01

Pos logit contributions
 could+0.046
 fought+0.046
 argued+0.045
 wouldn+0.045
 rode+0.044
Neg logit contributions
 new-0.051
 sight-0.050
 old-0.045
 big-0.045
 world-0.044
Token she
Token ID673
Feature activation-3.07e+00

Pos logit contributions
 could+0.009
 wouldn+0.008
 argued+0.008
 fought+0.008
 rode+0.007
Neg logit contributions
 sight-0.026
 new-0.026
 big-0.024
 old-0.024
 view-0.024
Token was
Token ID373
Feature activation+9.52e-01

Pos logit contributions
ked+0.043
oped+0.039
ged+0.039
 Leave+0.039
 rode+0.038
Neg logit contributions
 new-0.073
 big-0.072
 sight-0.070
 old-0.069
 large-0.067
Token a
Token ID257
Feature activation+6.37e+00

Pos logit contributions
 new+0.016
 sight+0.015
 old+0.014
 elderly+0.014
 big+0.014
Neg logit contributions
 could-0.023
 wouldn-0.022
oped-0.022
 fought-0.022
 argued-0.022
Token big
Token ID1263
Feature activation+7.66e+00

Pos logit contributions
 sight+0.107
 view+0.095
 new+0.094
err+0.092
ummy+0.091
Neg logit contributions
 could-0.144
 wouldn-0.133
 managed-0.129
 snatched-0.127
 fought-0.127
Token girl
Token ID2576
Feature activation-1.65e+00

Pos logit contributions
 sight+0.120
 new+0.120
 biggest+0.108
 view+0.106
err+0.103
Neg logit contributions
 could-0.170
 wouldn-0.164
 fought-0.161
 argued-0.159
 snatched-0.154
Token.
Token ID13
Feature activation-2.06e+00

Pos logit contributions
 argued+0.033
ched+0.032
 rode+0.032
 fought+0.031
icked+0.031
Neg logit contributions
 new-0.032
 sight-0.030
 old-0.028
 world-0.028
 big-0.028
Token After
Token ID2293
Feature activation-1.31e+00

Pos logit contributions
 fought+0.037
 wouldn+0.036
 could+0.035
 argued+0.035
 rode+0.034
Neg logit contributions
 sight-0.045
 new-0.045
 big-0.041
 sun-0.040
 world-0.040
Token
Token ID198
Feature activation+1.50e+00

Pos logit contributions
 could+0.029
 wouldn+0.028
 argued+0.027
 fought+0.027
 rode+0.026
Neg logit contributions
 sight-0.037
 new-0.037
 old-0.033
 big-0.033
 view-0.033
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
 sight+0.032
 new+0.032
 big+0.028
 old+0.028
 world+0.028
Neg logit contributions
 could-0.027
 wouldn-0.027
 argued-0.026
 fought-0.026
 rode-0.025
TokenHis
Token ID6653
Feature activation+4.58e+00

Pos logit contributions
 new+0.027
 sight+0.026
 big+0.024
 old+0.023
 elderly+0.023
Neg logit contributions
 could-0.026
 wouldn-0.025
 argued-0.025
 fought-0.024
es-0.024
Token sister
Token ID6621
Feature activation-5.14e+00

Pos logit contributions
 sight+0.109
 view+0.107
 beauty+0.100
umm+0.098
 sunrise+0.096
Neg logit contributions
 could-0.071
 let-0.062
 wouldn-0.059
 fought-0.059
es-0.058
Token was
Token ID373
Feature activation+1.96e+00

Pos logit contributions
pping+0.074
pped+0.072
oped+0.071
ched+0.071
sels+0.069
Neg logit contributions
 new-0.120
 old-0.114
 big-0.112
 little-0.111
 sight-0.110
Token happy
Token ID3772
Feature activation-6.57e-01

Pos logit contributions
 new+0.029
 sight+0.028
 big+0.025
 old+0.025
 elderly+0.024
Neg logit contributions
 could-0.049
 wouldn-0.048
 argued-0.047
 fought-0.047
 rode-0.045
Token and
Token ID290
Feature activation-3.72e+00

Pos logit contributions
ched+0.013
 wouldn+0.013
oped+0.013
 argued+0.013
 rode+0.013
Neg logit contributions
 new-0.013
 sight-0.013
 old-0.012
 big-0.011
 sun-0.011
Token they
Token ID484
Feature activation-3.52e+00

Pos logit contributions
pping+0.060
pped+0.059
oped+0.058
ched+0.056
ppy+0.055
Neg logit contributions
 new-0.094
 old-0.087
 big-0.086
 elderly-0.085
 sun-0.079
Token played
Token ID2826
Feature activation+1.63e+00

Pos logit contributions
 Leave+0.052
es+0.052
ched+0.048
 loses+0.046
pping+0.044
Neg logit contributions
 big-0.076
 new-0.074
 sun-0.072
 bright-0.070
 elderly-0.070
Token a
Token ID257
Feature activation+7.20e+00

Pos logit contributions
 new+0.037
 sight+0.036
 big+0.033
 old+0.032
 view+0.032
Neg logit contributions
 could-0.029
 wouldn-0.028
 argued-0.027
 fought-0.027
 rode-0.026
Token fun
Token ID1257
Feature activation+6.48e+00

Pos logit contributions
 sight+0.116
 ey+0.106
ummy+0.100
andel+0.098
umm+0.096
Neg logit contributions
 scored-0.160
 could-0.160
 let-0.157
 wouldn-0.154
 fought-0.152
TokenShe
Token ID3347
Feature activation-4.74e+00

Pos logit contributions
 sight+0.031
 new+0.030
 old+0.028
 view+0.028
 world+0.028
Neg logit contributions
 could-0.016
 wouldn-0.015
 fought-0.015
 argued-0.015
 rode-0.014
Token walked
Token ID6807
Feature activation-8.04e-01

Pos logit contributions
 argued+0.056
 wouldn+0.056
 could+0.054
 fought+0.053
ched+0.053
Neg logit contributions
 new-0.126
 big-0.120
 sight-0.119
 old-0.118
 elderly-0.115
Token to
Token ID284
Feature activation-8.43e-01

Pos logit contributions
 could+0.012
 wouldn+0.011
 argued+0.011
 fought+0.010
 managed+0.010
Neg logit contributions
 sight-0.020
 new-0.020
 world-0.018
 biggest-0.018
 future-0.018
Token Farmer
Token ID37755
Feature activation-1.40e+00

Pos logit contributions
 could+0.012
 wouldn+0.011
 argued+0.011
 fought+0.011
 let+0.010
Neg logit contributions
 sight-0.021
 new-0.020
 world-0.020
 biggest-0.019
 future-0.019
Token Joe
Token ID5689
Feature activation-2.07e+00

Pos logit contributions
ched+0.026
 wouldn+0.026
 could+0.026
 rode+0.026
 argued+0.026
Neg logit contributions
 new-0.028
 sight-0.028
 old-0.026
 big-0.025
 elderly-0.025
Token and
Token ID290
Feature activation-3.82e+00

Pos logit contributions
 wouldn+0.043
 could+0.042
 fought+0.042
 argued+0.042
 managed+0.042
Neg logit contributions
 new-0.039
 old-0.036
 big-0.035
 window-0.034
 sight-0.034
Token took
Token ID1718
Feature activation-7.51e-01

Pos logit contributions
 wouldn+0.046
 could+0.045
 argued+0.044
 fought+0.042
ched+0.042
Neg logit contributions
 new-0.102
 sight-0.100
 big-0.095
 old-0.095
 elderly-0.093
Token the
Token ID262
Feature activation+9.75e+00

Pos logit contributions
 could+0.011
 wouldn+0.010
 argued+0.010
 fought+0.010
ched+0.010
Neg logit contributions
 sight-0.019
 new-0.018
 world-0.017
 elderly-0.017
 future-0.017
Token toy
Token ID13373
Feature activation+2.33e+00

Pos logit contributions
 sight+0.158
 view+0.137
 world+0.135
 future+0.132
ummy+0.132
Neg logit contributions
 could-0.215
 wouldn-0.208
 argued-0.207
 fought-0.207
 snatched-0.206
Token cars
Token ID5006
Feature activation-1.38e+00

Pos logit contributions
 sight+0.038
 new+0.037
 view+0.032
 old+0.032
 world+0.032
Neg logit contributions
 could-0.055
 wouldn-0.053
 fought-0.052
 argued-0.052
 rode-0.050
Token from
Token ID422
Feature activation+9.56e-01

Pos logit contributions
 could+0.025
 wouldn+0.024
 argued+0.024
 fought+0.024
 rode+0.023
Neg logit contributions
 sight-0.029
 new-0.029
 big-0.026
 old-0.026
 view-0.026
Token other
Token ID584
Feature activation-4.14e-01

Pos logit contributions
 sight+0.060
umm+0.054
igsaw+0.053
 former+0.052
 world+0.048
Neg logit contributions
 could-0.095
 let-0.092
 argued-0.085
es-0.085
 rode-0.085
Token and
Token ID290
Feature activation-3.19e+00

Pos logit contributions
 could+0.006
 wouldn+0.006
 argued+0.006
 fought+0.006
 rode+0.006
Neg logit contributions
 sight-0.010
 new-0.010
 view-0.009
 world-0.009
 future-0.009
Token felt
Token ID2936
Feature activation+1.89e+00

Pos logit contributions
 could+0.034
ched+0.034
 wouldn+0.033
oped+0.032
es+0.031
Neg logit contributions
 new-0.088
 sight-0.086
 big-0.083
 old-0.081
 sun-0.080
Token happy
Token ID3772
Feature activation-8.97e-02

Pos logit contributions
 sight+0.029
 new+0.028
 world+0.024
 view+0.024
 old+0.024
Neg logit contributions
 could-0.046
 wouldn-0.045
 fought-0.044
 argued-0.044
 rode-0.043
Token.
Token ID13
Feature activation-1.79e+00

Pos logit contributions
 could+0.002
 wouldn+0.002
 argued+0.002
 fought+0.002
 rode+0.001
Neg logit contributions
 sight-0.002
 new-0.002
 big-0.002
 old-0.002
 view-0.002
Token They
Token ID1119
Feature activation-3.84e+00

Pos logit contributions
 could+0.025
 wouldn+0.024
 argued+0.023
 fought+0.023
 rode+0.022
Neg logit contributions
 sight-0.046
 new-0.045
 view-0.042
 old-0.041
 big-0.041
Token liked
Token ID8288
Feature activation+9.43e-01

Pos logit contributions
es+0.045
ched+0.042
 Leave+0.041
oped+0.040
ged+0.040
Neg logit contributions
 big-0.098
 new-0.096
 sight-0.091
 large-0.091
 dark-0.091
Token their
Token ID511
Feature activation+8.58e+00

Pos logit contributions
 new+0.021
 sight+0.021
 big+0.019
 old+0.019
 elderly+0.018
Neg logit contributions
 could-0.017
 wouldn-0.016
 argued-0.016
 fought-0.016
 rode-0.015
Token room
Token ID2119
Feature activation-6.36e-01

Pos logit contributions
 sight+0.112
err+0.105
andel+0.102
 biggest+0.099
 new+0.099
Neg logit contributions
 could-0.205
 wouldn-0.197
 fought-0.196
 argued-0.190
 rode-0.185
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
 could+0.012
 wouldn+0.012
 fought+0.011
 argued+0.011
 rode+0.011
Neg logit contributions
 sight-0.013
 new-0.012
 view-0.011
 elderly-0.011
 world-0.011
Token their
Token ID511
Feature activation+7.90e+00

Pos logit contributions
ched+0.023
 wouldn+0.023
 could+0.023
oped+0.023
 argued+0.022
Neg logit contributions
 new-0.028
 sight-0.027
 big-0.025
 old-0.025
 sun-0.024
Token,
Token ID11
Feature activation-2.42e+00

Pos logit contributions
 argued+0.092
ked+0.087
 scored+0.086
 fought+0.086
 rode+0.085
Neg logit contributions
 new-0.067
 old-0.063
 big-0.063
 sight-0.060
 little-0.059
Token drink
Token ID4144
Feature activation-7.87e-01

Pos logit contributions
 argued+0.038
 fought+0.038
ked+0.038
oped+0.037
es+0.037
Neg logit contributions
 new-0.060
 big-0.055
 sight-0.055
 old-0.054
 sun-0.053
Token some
Token ID617
Feature activation+4.17e+00

Pos logit contributions
 could+0.016
 wouldn+0.016
 argued+0.015
 fought+0.015
 rode+0.015
Neg logit contributions
 sight-0.015
 new-0.015
 old-0.013
 world-0.013
 view-0.013
Token water
Token ID1660
Feature activation-2.66e+00

Pos logit contributions
 sight+0.063
 new+0.053
 world+0.052
err+0.051
igsaw+0.051
Neg logit contributions
 could-0.105
 wouldn-0.097
ched-0.092
 let-0.092
 argued-0.091
Token.
Token ID13
Feature activation-2.80e+00

Pos logit contributions
 could+0.050
 fought+0.049
 argued+0.049
 rode+0.049
 wouldn+0.048
Neg logit contributions
 new-0.054
 sight-0.054
 big-0.050
 old-0.049
 sun-0.049
Token Water
Token ID5638
Feature activation-5.13e+00

Pos logit contributions
 could+0.039
 fought+0.038
 wouldn+0.038
 argued+0.038
 rode+0.036
Neg logit contributions
 sight-0.070
 new-0.069
 sun-0.065
 world-0.064
 old-0.064
Token is
Token ID318
Feature activation-1.61e-02

Pos logit contributions
 rode+0.098
 fought+0.098
 argued+0.097
 managed+0.092
 wrote+0.091
Neg logit contributions
 new-0.104
 sight-0.100
 big-0.099
 sun-0.095
 old-0.093
Token very
Token ID845
Feature activation+2.64e+00

Pos logit contributions
 could+0.000
 wouldn+0.000
 argued+0.000
 fought+0.000
 rode+0.000
Neg logit contributions
 sight-0.000
 new-0.000
 big-0.000
 old-0.000
 view-0.000
Token healthy
Token ID5448
Feature activation-2.27e+00

Pos logit contributions
 sight+0.030
 new+0.025
 world+0.023
 view+0.023
 future+0.022
Neg logit contributions
 could-0.077
 wouldn-0.073
 fought-0.070
 argued-0.070
 shared-0.068
Token."
Token ID526
Feature activation-2.32e+00

Pos logit contributions
 argued+0.043
 rode+0.043
 fought+0.042
 wouldn+0.042
ked+0.042
Neg logit contributions
 new-0.047
 sight-0.045
 old-0.043
 big-0.042
 world-0.042
Token So
Token ID1406
Feature activation-2.51e+00

Pos logit contributions
 wouldn+0.043
ched+0.042
 could+0.042
 fought+0.042
 managed+0.041
Neg logit contributions
 sight-0.049
 new-0.048
 old-0.043
 view-0.043
 big-0.042
TokenWhen
Token ID2215
Feature activation-1.67e+00

Pos logit contributions
 new+0.025
 sight+0.024
 big+0.022
 whole+0.022
 world+0.022
Neg logit contributions
 wouldn-0.026
 argued-0.026
 could-0.026
 fought-0.025
ked-0.024
Token they
Token ID484
Feature activation-2.35e+00

Pos logit contributions
 could+0.020
 argued+0.019
 fought+0.019
 wouldn+0.019
ched+0.018
Neg logit contributions
 sight-0.046
 new-0.046
 big-0.042
 view-0.042
 old-0.041
Token got
Token ID1392
Feature activation+2.90e-01

Pos logit contributions
es+0.031
 wouldn+0.029
 fought+0.029
ked+0.027
 argued+0.027
Neg logit contributions
 new-0.060
 sight-0.059
 big-0.058
 whole-0.056
 large-0.054
Token to
Token ID284
Feature activation+1.53e+00

Pos logit contributions
 sight+0.004
 new+0.004
 big+0.003
 old+0.003
 view+0.003
Neg logit contributions
 could-0.008
 wouldn-0.007
 fought-0.007
 argued-0.007
 rode-0.007
Token the
Token ID262
Feature activation+9.61e+00

Pos logit contributions
 sight+0.035
 new+0.034
 world+0.031
 biggest+0.031
 old+0.031
Neg logit contributions
 could-0.025
 wouldn-0.024
 argued-0.023
 fought-0.023
 rode-0.023
Token house
Token ID2156
Feature activation-3.10e+00

Pos logit contributions
 sight+0.182
err+0.162
 new+0.158
 view+0.154
igsaw+0.154
Neg logit contributions
 could-0.189
 wouldn-0.178
 snatched-0.169
 stole-0.161
 shared-0.161
Token,
Token ID11
Feature activation-2.49e+00

Pos logit contributions
 argued+0.057
 fought+0.057
ched+0.057
 could+0.056
 wouldn+0.056
Neg logit contributions
 new-0.066
 sight-0.063
 big-0.058
 old-0.058
 world-0.056
Token Bobby
Token ID17155
Feature activation-4.40e+00

Pos logit contributions
 could+0.038
 wouldn+0.036
 argued+0.035
 fought+0.035
 rode+0.033
Neg logit contributions
 sight-0.061
 new-0.061
 big-0.055
 old-0.055
 view-0.055
Token opened
Token ID4721
Feature activation+9.52e-01

Pos logit contributions
icked+0.067
uable+0.066
ked+0.064
ched+0.063
 fought+0.060
Neg logit contributions
 new-0.098
 big-0.098
 little-0.095
 elderly-0.095
 old-0.095
Token the
Token ID262
Feature activation+1.01e+01

Pos logit contributions
 sight+0.021
 new+0.021
 big+0.019
 old+0.019
 view+0.019
Neg logit contributions
 could-0.017
 wouldn-0.016
 fought-0.016
 argued-0.016
 rode-0.015
Token jar
Token ID17379
Feature activation-2.28e+00

Pos logit contributions
 sight+0.165
 sunny+0.144
 future+0.144
 new+0.143
err+0.139
Neg logit contributions
 could-0.228
 wouldn-0.213
 snatched-0.204
 stole-0.191
 managed-0.190
Token with
Token ID351
Feature activation+2.07e+00

Pos logit contributions
 could+0.002
 wouldn+0.002
 argued+0.002
 fought+0.002
es+0.002
Neg logit contributions
 sight-0.003
 new-0.003
 future-0.002
 elderly-0.002
 world-0.002
Token stars
Token ID5788
Feature activation-2.20e+00

Pos logit contributions
 sight+0.031
 new+0.029
 future+0.028
 elderly+0.027
 world+0.027
Neg logit contributions
 could-0.049
 wouldn-0.046
 argued-0.045
 fought-0.045
 rode-0.044
Token and
Token ID290
Feature activation-1.05e+00

Pos logit contributions
 argued+0.042
 fought+0.042
 could+0.041
 wouldn+0.041
 rode+0.040
Neg logit contributions
 new-0.045
 sight-0.045
 big-0.042
 sun-0.040
 view-0.040
Token moons
Token ID31093
Feature activation-2.79e+00

Pos logit contributions
 could+0.025
 wouldn+0.024
 fought+0.023
 argued+0.023
 rode+0.023
Neg logit contributions
 sight-0.017
 new-0.017
 world-0.014
 old-0.014
 view-0.014
Token.
Token ID13
Feature activation-2.92e+00

Pos logit contributions
 argued+0.056
 wouldn+0.056
 fought+0.055
 could+0.054
 snatched+0.052
Neg logit contributions
 new-0.055
 sight-0.054
 big-0.051
 sun-0.050
 view-0.049
Token Lily
Token ID20037
Feature activation-4.90e+00

Pos logit contributions
 wouldn+0.054
 argued+0.053
 could+0.053
 fought+0.053
 managed+0.050
Neg logit contributions
 new-0.062
 sight-0.061
 old-0.055
 big-0.055
 sun-0.055
Token and
Token ID290
Feature activation-1.61e-01

Pos logit contributions
 argued+0.108
ked+0.108
ged+0.106
uable+0.105
 wouldn+0.104
Neg logit contributions
 new-0.076
 big-0.073
 old-0.072
 elderly-0.071
 sight-0.068
Token Mr
Token ID1770
Feature activation+5.84e-01

Pos logit contributions
 could+0.003
 managed+0.003
ched+0.003
 wouldn+0.003
 fought+0.003
Neg logit contributions
 new-0.003
 sight-0.003
 old-0.003
 elderly-0.003
 big-0.003
Token.
Token ID13
Feature activation+2.62e+00

Pos logit contributions
 new+0.011
 sight+0.010
 old+0.009
 big+0.009
 world+0.009
Neg logit contributions
 could-0.012
 rode-0.012
 wouldn-0.012
 argued-0.012
 managed-0.012
Token Sn
Token ID5489
Feature activation+9.51e-01

Pos logit contributions
 sight+0.061
 new+0.060
 world+0.054
 big+0.054
 elderly+0.054
Neg logit contributions
 could-0.044
 wouldn-0.043
 argued-0.042
 fought-0.041
 managed-0.041
Tokenugg
Token ID6837
Feature activation+1.73e+00

Pos logit contributions
 sight+0.019
 new+0.019
 world+0.017
 elderly+0.017
 old+0.017
Neg logit contributions
 could-0.018
 wouldn-0.018
 argued-0.017
 fought-0.017
 managed-0.017