TOP ACTIVATIONS
MAX = 0.108

help and they cleaned the television together . After they finished
It 's something we put flowers in to make the room
, can you fill the jug with water from the river
! Let 's roll the pumpkin down the hill !"
the loop and put the trains away ?" she asks .
" Watch ! Put the hay in the middle there and
by size and put the toys away in a box .
your drink and put your blocks away . It 's time
. They quickly put the carpet in their living room and
idea and they put the flag in a jar .
" Time to load the truck , Pete !" So Pete
and together they carried the squash to Benny 's bur row
helped the boy grind the wheat . When they finished ,
, and they put the turkey in their cart .
elephant decided to take the brick home . On
, and we put our toys away when we 're done
, the family loaded the yacht full of supplies and made
and Grand ma put the pie on the table and waited
the cage and put the flowers inside . The bird was
, can we put my flag in a jar so it

MIN ACTIVATIONS
MIN = -0.137

disappear ?" His mother smiled knowingly and said , " Well
sick ?" Joe said :" Don 't worry .
you forgive me ?" Tom said , " I forgive you
understand ?" Ben said , " Yes , sir
lost or broken ?" Anna said , looking at her black
the birds again ," mom said , smiling and kissing Anna
you forgive me ?" Sara said . Ben looked
and hurt ourselves ?" Ben said . He was scared and
mad at me ?" Lily said . " Really
not see it ." Lily said . She was
?" L ily said , " Thank you ,
we are sorry ," Lily said . " We wanted to
owner sees us ?" Ben said , " Don 't be
for it someday ." Anna said , putting the necklace back
know any jokes ?" Max said , " Yes , I
for now ?" her mom said , trying to cheer her
, it hurts !" Lily said . Tom felt
healthy and happy ," Tommy said . " I
you forgive me ?" Tom said . L ily
my duck back ," Lily said , trying to grab the

INTERVAL 0.047 - 0.108
CONTAINS 4.506%

asked him to clean the bathroom . Tim my didn 't
She loved to walk in the park with her mom my
decided to close the palace and go away . All the
rolled under the couch where it was even hotter .
he saw something shiny on the ground . He picked it

INTERVAL -0.014 - 0.047
CONTAINS 64.262%

looked sadly at the lock . There was no way to
little girl and the dove became great friends . They spent
she ate , and when she went to bed . She
sat down and Tom helped Tim read . Soon , Tim
to go away , and he couldn 't come back .

INTERVAL -0.076 - -0.014
CONTAINS 29.738%

L ily had an idea . She grabbed her t
pray again . But this time she prayed differently . She
to jog with her dad every morning . They would put
good girl who did not like to take things that did
the flowers ?" The butterfly moved its wings slowly

INTERVAL -0.137 - -0.076
CONTAINS 1.494%

go first , but Sue said , " No , I
job , Tom !" Mom says . She cl aps her
" Ok ," said her mom my . And
, a duck !" Lily says , pointing to the pond
. Her mom heard her cry and came running
Token help
Token ID1037
Feature activation-2.84e-02

Pos logit contributions
ubby+0.046
ited+0.044
 enough+0.043
um+0.042
vious+0.041
Neg logit contributions
tr-0.040
 ramp-0.039
 globe-0.039
 tracks-0.038
 float-0.038
Token and
Token ID290
Feature activation+3.62e-02

Pos logit contributions
ubby+0.043
 enough+0.042
ited+0.040
um+0.039
vious+0.038
Neg logit contributions
tr-0.048
 ramp-0.047
 globe-0.047
 float-0.047
 tracks-0.045
Token they
Token ID484
Feature activation+3.20e-02

Pos logit contributions
 float+0.050
tr+0.049
 ramp+0.049
 tracks+0.049
 globe+0.046
Neg logit contributions
ubby-0.067
ited-0.066
 enough-0.062
vious-0.061
um-0.059
Token cleaned
Token ID20750
Feature activation+3.62e-03

Pos logit contributions
 ramp+0.056
 globe+0.054
tr+0.054
 tracks+0.053
 float+0.053
Neg logit contributions
ited-0.047
 enough-0.045
 disagreed-0.045
ubby-0.044
vious-0.041
Token the
Token ID262
Feature activation+4.07e-02

Pos logit contributions
tr+0.006
 hunt+0.006
 ramp+0.006
 float+0.006
 globe+0.006
Neg logit contributions
 enough-0.006
ubby-0.005
ited-0.005
ised-0.005
 disagreed-0.005
Token television
Token ID5581
Feature activation+1.08e-01

Pos logit contributions
tr+0.054
 ramp+0.052
 float+0.052
 glide+0.050
 tracks+0.049
Neg logit contributions
ubby-0.076
ited-0.072
 enough-0.070
vious-0.070
um-0.069
Token together
Token ID1978
Feature activation+2.68e-02

Pos logit contributions
Ro+0.190
 Zig+0.182
 glide+0.177
 Fog+0.175
 ramp+0.171
Neg logit contributions
 enough-0.157
 disagreed-0.138
ised-0.135
um-0.133
arre-0.130
Token.
Token ID13
Feature activation+3.50e-02

Pos logit contributions
 ramp+0.042
tr+0.042
 tracks+0.040
 globe+0.040
 spaceship+0.039
Neg logit contributions
 enough-0.045
 disagreed-0.041
ited-0.040
ously-0.039
ubby-0.039
Token After
Token ID2293
Feature activation+3.68e-03

Pos logit contributions
 ramp+0.052
tr+0.052
 float+0.052
 globe+0.051
 tracks+0.050
Neg logit contributions
ubby-0.057
ited-0.057
 enough-0.055
um-0.054
 disagreed-0.053
Token they
Token ID484
Feature activation+2.34e-02

Pos logit contributions
 float+0.005
tr+0.005
 ramp+0.005
 tracks+0.005
 globe+0.005
Neg logit contributions
ubby-0.007
ited-0.007
 enough-0.006
vious-0.006
 disagreed-0.006
Token finished
Token ID5201
Feature activation+6.44e-03

Pos logit contributions
 float+0.040
tr+0.040
 tracks+0.039
 ramp+0.039
 rug+0.038
Neg logit contributions
ubby-0.035
ited-0.032
 enough-0.031
joy-0.030
vious-0.030
Token It
Token ID632
Feature activation+5.39e-02

Pos logit contributions
 hunt+0.081
 ramp+0.080
 float+0.080
 tracks+0.078
nob+0.078
Neg logit contributions
ubby-0.048
um-0.045
 enough-0.043
vious-0.040
eared-0.038
Token's
Token ID338
Feature activation-3.10e-04

Pos logit contributions
 ramp+0.102
fire+0.095
 glide+0.093
 globe+0.093
 outer+0.092
Neg logit contributions
 enough-0.075
vious-0.064
 disagreed-0.063
ubby-0.059
oke-0.055
Token something
Token ID1223
Feature activation+9.47e-03

Pos logit contributions
 enough+0.000
ubby+0.000
um+0.000
 disagreed+0.000
ige+0.000
Neg logit contributions
 ramp-0.001
 globe-0.001
nob-0.001
fire-0.001
 float-0.001
Token we
Token ID356
Feature activation+4.18e-02

Pos logit contributions
 ramp+0.020
nob+0.018
 glide+0.018
tr+0.017
 globe+0.017
Neg logit contributions
 enough-0.012
 disagreed-0.010
ubby-0.010
vious-0.010
 dear-0.010
Token put
Token ID1234
Feature activation+4.51e-02

Pos logit contributions
 ramp+0.069
tr+0.069
 float+0.067
 tracks+0.067
 globe+0.066
Neg logit contributions
ubby-0.064
ited-0.063
 enough-0.061
vious-0.058
 disagreed-0.058
Token flowers
Token ID12734
Feature activation+1.03e-01

Pos logit contributions
 ramp+0.068
tr+0.066
 float+0.064
 tracks+0.061
nob+0.061
Neg logit contributions
 enough-0.080
ubby-0.076
ited-0.070
 disagreed-0.069
um-0.068
Token in
Token ID287
Feature activation+6.86e-02

Pos logit contributions
 ramp+0.150
tr+0.146
 tracks+0.142
nob+0.139
 float+0.139
Neg logit contributions
 enough-0.182
ubby-0.179
ited-0.168
vious-0.162
um-0.160
Token to
Token ID284
Feature activation+5.66e-02

Pos logit contributions
tr+0.108
 ramp+0.106
 float+0.104
 tracks+0.102
 globe+0.100
Neg logit contributions
ubby-0.112
 enough-0.109
ited-0.107
 disagreed-0.099
vious-0.099
Token make
Token ID787
Feature activation+9.19e-03

Pos logit contributions
tr+0.086
 ramp+0.083
 globe+0.080
nob+0.076
 spaceship+0.074
Neg logit contributions
 enough-0.092
ubby-0.091
ited-0.088
vious-0.082
um-0.082
Token the
Token ID262
Feature activation+2.40e-02

Pos logit contributions
 ramp+0.016
tr+0.016
 globe+0.015
 float+0.015
nob+0.014
Neg logit contributions
 enough-0.014
ubby-0.013
um-0.013
vious-0.012
 disagreed-0.012
Token room
Token ID2119
Feature activation+4.84e-02

Pos logit contributions
tr+0.027
nob+0.027
 ramp+0.027
 globe+0.027
 float+0.027
Neg logit contributions
ubby-0.047
 enough-0.047
ited-0.045
vious-0.044
 End-0.043
Token,
Token ID11
Feature activation+5.03e-04

Pos logit contributions
ited+0.039
ubby+0.036
vious+0.034
 enough+0.032
ously+0.031
Neg logit contributions
tr-0.054
 tracks-0.053
 globe-0.052
 ramp-0.052
 float-0.052
Token can
Token ID460
Feature activation-1.81e-02

Pos logit contributions
tr+0.001
 ramp+0.001
 float+0.001
 tracks+0.001
 globe+0.001
Neg logit contributions
ubby-0.001
ited-0.001
 enough-0.001
vious-0.001
 disagreed-0.001
Token you
Token ID345
Feature activation+3.09e-02

Pos logit contributions
ubby+0.028
ited+0.026
um+0.026
 enough+0.026
 disagreed+0.026
Neg logit contributions
 ramp-0.030
tr-0.030
 globe-0.029
 tracks-0.028
 float-0.027
Token fill
Token ID6070
Feature activation+3.14e-02

Pos logit contributions
tr+0.046
 ramp+0.046
 float+0.045
 tracks+0.045
 globe+0.044
Neg logit contributions
ubby-0.052
ited-0.051
 enough-0.050
vious-0.048
 disagreed-0.047
Token the
Token ID262
Feature activation+4.95e-02

Pos logit contributions
tr+0.061
 ramp+0.059
 float+0.056
 globe+0.055
 tracks+0.055
Neg logit contributions
 enough-0.042
ubby-0.041
um-0.038
ited-0.037
illy-0.036
Token jug
Token ID45808
Feature activation+1.02e-01

Pos logit contributions
 ramp+0.065
tr+0.065
 float+0.064
 tracks+0.063
 hunt+0.061
Neg logit contributions
ubby-0.092
ited-0.088
 enough-0.087
um-0.084
vious-0.084
Token with
Token ID351
Feature activation+4.41e-02

Pos logit contributions
 tracks+0.138
tr+0.138
 ramp+0.137
 float+0.131
 globe+0.131
Neg logit contributions
ubby-0.181
ited-0.178
 enough-0.178
um-0.172
vious-0.168
Token water
Token ID1660
Feature activation+5.96e-02

Pos logit contributions
tr+0.050
 ramp+0.049
 float+0.048
 tracks+0.047
 globe+0.046
Neg logit contributions
ubby-0.091
ited-0.089
 enough-0.088
vious-0.084
 disagreed-0.084
Token from
Token ID422
Feature activation+2.26e-02

Pos logit contributions
tr+0.096
 ramp+0.096
 float+0.095
 tracks+0.094
 globe+0.092
Neg logit contributions
ubby-0.093
ited-0.092
 enough-0.088
vious-0.084
 disagreed-0.083
Token the
Token ID262
Feature activation+4.02e-02

Pos logit contributions
tr+0.039
 ramp+0.039
 float+0.037
 globe+0.037
 tracks+0.036
Neg logit contributions
ubby-0.034
 enough-0.033
ited-0.032
vious-0.029
 disagreed-0.029
Token river
Token ID7850
Feature activation+6.29e-02

Pos logit contributions
nob+0.055
tr+0.051
 ramp+0.051
 glide+0.049
 Zig+0.049
Neg logit contributions
ubby-0.073
 enough-0.071
um-0.067
 disagreed-0.065
vious-0.065
Token!
Token ID0
Feature activation+3.67e-02

Pos logit contributions
ubby+0.061
ited+0.060
 enough+0.059
vious+0.056
 disagreed+0.056
Neg logit contributions
tr-0.046
 ramp-0.046
 float-0.045
 tracks-0.044
 globe-0.043
Token Let
Token ID3914
Feature activation-1.92e-02

Pos logit contributions
 float+0.058
 hunt+0.057
 ramp+0.056
 globe+0.056
 tracks+0.055
Neg logit contributions
ubby-0.057
 enough-0.055
vious-0.054
oke-0.052
um-0.052
Token's
Token ID338
Feature activation+2.32e-02

Pos logit contributions
ubby+0.044
ited+0.043
 enough+0.041
vious+0.040
ously+0.040
Neg logit contributions
tr-0.018
 float-0.018
 tracks-0.018
 ramp-0.017
 hunt-0.016
Token roll
Token ID4836
Feature activation+2.03e-02

Pos logit contributions
tr+0.030
 ramp+0.029
 globe+0.029
 tracks+0.028
 float+0.027
Neg logit contributions
ubby-0.043
 enough-0.043
ited-0.042
 disagreed-0.040
um-0.040
Token the
Token ID262
Feature activation+4.77e-02

Pos logit contributions
 float+0.025
 ramp+0.025
 tracks+0.025
tr+0.024
 globe+0.024
Neg logit contributions
ited-0.040
ubby-0.040
ously-0.037
 enough-0.037
 disagreed-0.036
Token pumpkin
Token ID30089
Feature activation+1.02e-01

Pos logit contributions
 globe+0.058
 float+0.058
 tracks+0.055
 ramp+0.054
tr+0.054
Neg logit contributions
ited-0.095
ubby-0.093
iley-0.090
ously-0.090
 enough-0.089
Token down
Token ID866
Feature activation+5.11e-02

Pos logit contributions
tr+0.140
 globe+0.135
 ramp+0.134
 tracks+0.134
nob+0.133
Neg logit contributions
 enough-0.182
ubby-0.172
vious-0.163
ited-0.162
um-0.160
Token the
Token ID262
Feature activation+4.81e-02

Pos logit contributions
 float+0.079
 ramp+0.079
tr+0.078
 tracks+0.078
 globe+0.076
Neg logit contributions
ubby-0.083
ited-0.083
 enough-0.077
vious-0.075
 disagreed-0.075
Token hill
Token ID12788
Feature activation+5.54e-02

Pos logit contributions
tr+0.040
 float+0.035
 globe+0.035
nob+0.034
 ramp+0.034
Neg logit contributions
ubby-0.113
ited-0.111
 enough-0.110
vious-0.106
 disagreed-0.105
Token!"
Token ID2474
Feature activation+3.99e-03

Pos logit contributions
tr+0.084
 ramp+0.084
 float+0.084
 tracks+0.082
 globe+0.081
Neg logit contributions
ubby-0.092
ited-0.090
 enough-0.087
vious-0.083
 disagreed-0.082
Token
Token ID220
Feature activation-3.95e-02

Pos logit contributions
 ramp+0.007
tr+0.007
 spaceship+0.006
fire+0.006
 float+0.006
Neg logit contributions
ously-0.006
ited-0.006
 enough-0.005
 disagreed-0.005
um-0.005
Token the
Token ID262
Feature activation+3.36e-02

Pos logit contributions
ited+0.005
ubby+0.004
 disagreed+0.004
vious+0.004
ously+0.004
Neg logit contributions
 float-0.003
 tracks-0.003
 ramp-0.003
 globe-0.003
 rug-0.003
Token loop
Token ID9052
Feature activation+4.75e-02

Pos logit contributions
fire+0.043
 float+0.043
nob+0.042
fields+0.042
tr+0.042
Neg logit contributions
ubby-0.059
 enough-0.057
um-0.054
vious-0.052
 End-0.050
Token and
Token ID290
Feature activation+7.12e-02

Pos logit contributions
tr+0.074
 ramp+0.074
 float+0.074
 tracks+0.072
 globe+0.071
Neg logit contributions
ubby-0.077
ited-0.076
 enough-0.072
vious-0.069
 disagreed-0.069
Token put
Token ID1234
Feature activation+3.01e-02

Pos logit contributions
 globe+0.104
nob+0.102
tr+0.100
 ramp+0.097
cel+0.093
Neg logit contributions
 enough-0.114
ited-0.100
 disagreed-0.099
ubby-0.096
 worry-0.095
Token the
Token ID262
Feature activation+2.79e-02

Pos logit contributions
tr+0.066
fire+0.062
nob+0.061
 hunt+0.061
 ramp+0.059
Neg logit contributions
 enough-0.040
ubby-0.031
um-0.025
 End-0.024
vious-0.024
Token trains
Token ID13404
Feature activation+1.02e-01

Pos logit contributions
nob+0.026
tr+0.026
fire+0.025
fields+0.024
 hunt+0.024
Neg logit contributions
ubby-0.061
 enough-0.059
vious-0.056
 End-0.054
ited-0.053
Token away
Token ID1497
Feature activation+2.96e-02

Pos logit contributions
tr+0.118
 ramp+0.117
 float+0.117
 tracks+0.114
 globe+0.112
Neg logit contributions
ubby-0.205
ited-0.202
 enough-0.195
vious-0.189
 disagreed-0.187
Token?"
Token ID1701
Feature activation-7.77e-03

Pos logit contributions
 ramp+0.053
tr+0.052
 tracks+0.052
 float+0.051
 hunt+0.051
Neg logit contributions
ubby-0.040
 enough-0.039
ited-0.036
um-0.035
vious-0.034
Token she
Token ID673
Feature activation-3.40e-02

Pos logit contributions
ubby+0.014
ited+0.014
 enough+0.013
vious+0.013
ously+0.012
Neg logit contributions
 float-0.011
 tracks-0.010
tr-0.010
 ramp-0.010
 cozy-0.010
Token asks
Token ID7893
Feature activation-9.59e-02

Pos logit contributions
ubby+0.043
ited+0.042
 enough+0.039
vious+0.037
um+0.037
Neg logit contributions
 float-0.065
 ramp-0.065
tr-0.064
 tracks-0.064
 hunt-0.062
Token.
Token ID13
Feature activation+6.26e-03

Pos logit contributions
ubby+0.188
 enough+0.183
ited+0.182
um+0.173
 disagreed+0.172
Neg logit contributions
 ramp-0.120
tr-0.119
 float-0.115
 tracks-0.114
 globe-0.113
Token "
Token ID366
Feature activation+2.55e-02

Pos logit contributions
ubby+0.021
ited+0.021
 enough+0.020
vious+0.019
 disagreed+0.019
Neg logit contributions
tr-0.022
 ramp-0.021
 float-0.021
 tracks-0.021
 globe-0.021
TokenWatch
Token ID10723
Feature activation-3.67e-02

Pos logit contributions
tr+0.039
 float+0.039
 ramp+0.039
 tracks+0.038
 globe+0.038
Neg logit contributions
ubby-0.042
ited-0.041
 enough-0.040
vious-0.038
 disagreed-0.038
Token!
Token ID0
Feature activation+3.33e-02

Pos logit contributions
 enough+0.062
ubby+0.062
ited+0.057
um+0.057
illy+0.055
Neg logit contributions
tr-0.056
 ramp-0.054
 float-0.054
 globe-0.054
 tracks-0.051
Token Put
Token ID5930
Feature activation+9.20e-03

Pos logit contributions
 ramp+0.049
 float+0.048
tr+0.048
 tracks+0.047
 globe+0.047
Neg logit contributions
ubby-0.057
 enough-0.054
ited-0.053
vious-0.052
um-0.050
Token the
Token ID262
Feature activation+2.03e-02

Pos logit contributions
tr+0.013
 ramp+0.013
 float+0.013
 tracks+0.013
 globe+0.013
Neg logit contributions
ubby-0.016
ited-0.016
 enough-0.015
vious-0.015
 disagreed-0.015
Token hay
Token ID27678
Feature activation+1.01e-01

Pos logit contributions
tr+0.020
 ramp+0.020
 float+0.020
 tracks+0.019
 globe+0.019
Neg logit contributions
ubby-0.045
ited-0.044
 enough-0.043
vious-0.042
 disagreed-0.041
Token in
Token ID287
Feature activation+6.10e-02

Pos logit contributions
 ramp+0.132
tr+0.131
 tracks+0.130
 float+0.128
 globe+0.123
Neg logit contributions
ubby-0.187
 enough-0.183
ited-0.178
um-0.170
vious-0.170
Token the
Token ID262
Feature activation+6.05e-02

Pos logit contributions
 float+0.093
 ramp+0.093
tr+0.093
 tracks+0.091
 globe+0.089
Neg logit contributions
ubby-0.100
ited-0.100
 enough-0.095
vious-0.092
 disagreed-0.091
Token middle
Token ID3504
Feature activation+5.15e-02

Pos logit contributions
 float+0.067
 globe+0.066
tr+0.066
 ramp+0.065
nob+0.063
Neg logit contributions
 enough-0.125
ubby-0.123
ited-0.120
um-0.115
vious-0.113
Token there
Token ID612
Feature activation+3.85e-02

Pos logit contributions
tr+0.114
 ramp+0.113
 float+0.113
 tracks+0.112
 globe+0.110
Neg logit contributions
ubby-0.050
ited-0.049
 enough-0.045
vious-0.042
 disagreed-0.042
Token and
Token ID290
Feature activation+6.10e-02

Pos logit contributions
tr+0.051
 ramp+0.051
 float+0.050
 tracks+0.049
 globe+0.048
Neg logit contributions
ubby-0.071
ited-0.070
 enough-0.068
vious-0.065
 disagreed-0.065
Token by
Token ID416
Feature activation+2.96e-02

Pos logit contributions
 float+0.124
tr+0.124
 ramp+0.123
 tracks+0.121
 globe+0.119
Neg logit contributions
ubby-0.128
ited-0.126
 enough-0.119
vious-0.116
 disagreed-0.114
Token size
Token ID2546
Feature activation-6.22e-03

Pos logit contributions
 float+0.042
 ramp+0.042
tr+0.042
 tracks+0.041
 globe+0.040
Neg logit contributions
ited-0.052
ubby-0.052
 enough-0.050
vious-0.048
 disagreed-0.047
Token and
Token ID290
Feature activation+6.89e-02

Pos logit contributions
ited+0.011
ubby+0.011
 disagreed+0.010
vious+0.010
oph+0.010
Neg logit contributions
 tracks-0.009
 float-0.009
 ramp-0.009
tr-0.008
 globe-0.008
Token put
Token ID1234
Feature activation+3.40e-02

Pos logit contributions
tr+0.089
 ramp+0.088
 float+0.087
 tracks+0.085
 globe+0.085
Neg logit contributions
ubby-0.129
ited-0.127
 enough-0.125
vious-0.119
 disagreed-0.119
Token the
Token ID262
Feature activation+3.14e-02

Pos logit contributions
tr+0.072
 ramp+0.069
 float+0.066
 hunt+0.066
 globe+0.066
Neg logit contributions
 enough-0.040
ubby-0.038
ited-0.034
vious-0.032
um-0.032
Token toys
Token ID14958
Feature activation+1.01e-01

Pos logit contributions
tr+0.045
 ramp+0.045
 float+0.044
 globe+0.043
 tracks+0.043
Neg logit contributions
ubby-0.055
ited-0.054
 enough-0.053
vious-0.050
 disagreed-0.049
Token away
Token ID1497
Feature activation+3.50e-02

Pos logit contributions
tr+0.144
 ramp+0.141
 tracks+0.138
 float+0.132
 globe+0.132
Neg logit contributions
 enough-0.179
ubby-0.176
ited-0.172
vious-0.162
 disagreed-0.161
Token in
Token ID287
Feature activation+5.48e-02

Pos logit contributions
 ramp+0.051
tr+0.051
 float+0.048
 tracks+0.047
 globe+0.047
Neg logit contributions
 enough-0.062
ubby-0.059
ited-0.056
 disagreed-0.054
vious-0.054
Token a
Token ID257
Feature activation+6.37e-02

Pos logit contributions
tr+0.082
 ramp+0.082
 float+0.082
 tracks+0.080
 globe+0.079
Neg logit contributions
ubby-0.092
ited-0.091
 enough-0.087
vious-0.084
 disagreed-0.083
Token box
Token ID3091
Feature activation+5.22e-02

Pos logit contributions
tr+0.094
 float+0.090
 glide+0.089
 Zig+0.089
 globe+0.087
Neg logit contributions
ited-0.106
 enough-0.105
ubby-0.105
joy-0.093
 disagreed-0.093
Token.
Token ID13
Feature activation+2.92e-02

Pos logit contributions
tr+0.064
 ramp+0.064
 float+0.063
 tracks+0.062
 globe+0.061
Neg logit contributions
ubby-0.102
ited-0.101
 enough-0.097
vious-0.094
 disagreed-0.093
Token your
Token ID534
Feature activation+9.13e-03

Pos logit contributions
ubby+0.011
 enough+0.011
ited+0.010
um+0.009
illy+0.009
Neg logit contributions
tr-0.013
 ramp-0.013
 globe-0.012
 float-0.012
 tracks-0.012
Token drink
Token ID4144
Feature activation+3.90e-02

Pos logit contributions
tr+0.014
 globe+0.013
nob+0.013
 ramp+0.013
fire+0.013
Neg logit contributions
ubby-0.015
 enough-0.015
ited-0.014
 efforts-0.014
 End-0.013
Token and
Token ID290
Feature activation+6.55e-02

Pos logit contributions
tr+0.066
 float+0.066
 ramp+0.066
 tracks+0.065
 globe+0.064
Neg logit contributions
ubby-0.058
ited-0.057
 enough-0.053
vious-0.052
 disagreed-0.051
Token put
Token ID1234
Feature activation+2.66e-02

Pos logit contributions
 ramp+0.091
tr+0.090
 globe+0.087
 tracks+0.084
nob+0.084
Neg logit contributions
 enough-0.116
ited-0.113
ubby-0.112
 disagreed-0.107
vious-0.104
Token your
Token ID534
Feature activation+1.62e-02

Pos logit contributions
tr+0.048
 ramp+0.047
 float+0.044
 tracks+0.044
 hunt+0.044
Neg logit contributions
 enough-0.040
ubby-0.039
ited-0.036
um-0.035
vious-0.034
Token blocks
Token ID7021
Feature activation+1.01e-01

Pos logit contributions
 ramp+0.025
tr+0.025
 globe+0.023
 tracks+0.023
 float+0.023
Neg logit contributions
ubby-0.028
ited-0.026
 enough-0.026
 disagreed-0.024
vious-0.024
Token away
Token ID1497
Feature activation+2.82e-02

Pos logit contributions
 ramp+0.132
tr+0.132
 tracks+0.131
 float+0.129
 globe+0.126
Neg logit contributions
ubby-0.187
ited-0.184
 enough-0.183
vious-0.172
um-0.170
Token.
Token ID13
Feature activation+4.72e-02

Pos logit contributions
tr+0.040
 ramp+0.040
 float+0.040
 tracks+0.039
 globe+0.038
Neg logit contributions
ubby-0.050
ited-0.049
 enough-0.047
vious-0.045
 disagreed-0.045
Token It
Token ID632
Feature activation+5.05e-02

Pos logit contributions
 ramp+0.077
 float+0.077
 globe+0.076
 hunt+0.075
 tracks+0.074
Neg logit contributions
ubby-0.073
 enough-0.069
vious-0.065
um-0.064
ited-0.064
Token's
Token ID338
Feature activation+8.90e-03

Pos logit contributions
 ramp+0.087
 globe+0.086
 tracks+0.086
 outer+0.086
 track+0.084
Neg logit contributions
 enough-0.073
vious-0.066
 disagreed-0.065
ited-0.065
ubby-0.061
Token time
Token ID640
Feature activation+1.07e-02

Pos logit contributions
 ramp+0.014
tr+0.014
 globe+0.014
 float+0.013
 tracks+0.013
Neg logit contributions
 enough-0.015
ubby-0.015
ited-0.014
 disagreed-0.013
um-0.013
Token.
Token ID13
Feature activation+4.29e-02

Pos logit contributions
ubby+0.053
ited+0.052
 enough+0.052
vious+0.049
 disagreed+0.049
Neg logit contributions
tr-0.034
 ramp-0.033
 float-0.033
 tracks-0.033
 globe-0.032
Token They
Token ID1119
Feature activation+2.77e-02

Pos logit contributions
tr+0.062
 ramp+0.061
 float+0.060
 globe+0.060
nob+0.058
Neg logit contributions
 enough-0.073
ubby-0.073
ited-0.071
vious-0.069
 disagreed-0.068
Token quickly
Token ID2952
Feature activation-8.60e-03

Pos logit contributions
tr+0.044
 float+0.044
 ramp+0.044
 tracks+0.043
 globe+0.042
Neg logit contributions
ubby-0.044
ited-0.043
 enough-0.041
vious-0.040
 disagreed-0.039
Token put
Token ID1234
Feature activation+2.64e-02

Pos logit contributions
ubby+0.015
ited+0.014
 enough+0.013
vious+0.013
um+0.013
Neg logit contributions
tr-0.013
 float-0.013
 tracks-0.013
 ramp-0.013
 hunt-0.012
Token the
Token ID262
Feature activation+1.45e-02

Pos logit contributions
tr+0.046
 ramp+0.045
 float+0.044
nob+0.043
 hunt+0.043
Neg logit contributions
 enough-0.039
ubby-0.038
ited-0.037
 disagreed-0.034
vious-0.034
Token carpet
Token ID20710
Feature activation+1.01e-01

Pos logit contributions
tr+0.013
 hunt+0.011
nob+0.010
 glide+0.010
fire+0.010
Neg logit contributions
ubby-0.035
 End-0.032
 enough-0.031
um-0.031
vious-0.031
Token in
Token ID287
Feature activation+5.60e-02

Pos logit contributions
 ramp+0.143
tr+0.139
 tracks+0.135
 float+0.133
 hunt+0.130
Neg logit contributions
 enough-0.178
ubby-0.172
ited-0.169
um-0.159
vious-0.158
Token their
Token ID511
Feature activation+4.86e-02

Pos logit contributions
tr+0.088
 ramp+0.087
 float+0.086
 tracks+0.085
 globe+0.084
Neg logit contributions
ubby-0.091
ited-0.089
 enough-0.087
vious-0.082
 disagreed-0.082
Token living
Token ID2877
Feature activation+2.35e-02

Pos logit contributions
tr+0.069
nob+0.068
 ramp+0.065
 glide+0.064
 float+0.063
Neg logit contributions
ubby-0.083
 enough-0.078
ited-0.075
 End-0.075
joy-0.074
Token room
Token ID2119
Feature activation+4.34e-02

Pos logit contributions
 ramp+0.021
tr+0.021
 float+0.021
 tracks+0.020
 globe+0.019
Neg logit contributions
ubby-0.054
ited-0.054
 enough-0.052
vious-0.051
 disagreed-0.050
Token and
Token ID290
Feature activation+5.63e-02

Pos logit contributions
tr+0.072
 float+0.072
 tracks+0.072
 ramp+0.070
 globe+0.070
Neg logit contributions
ubby-0.068
ited-0.065
vious-0.061
 enough-0.059
um-0.058
Token idea
Token ID2126
Feature activation-4.31e-02

Pos logit contributions
ubby+0.040
ited+0.039
 enough+0.035
vious+0.033
 disagreed+0.032
Neg logit contributions
tr-0.089
 ramp-0.089
 float-0.089
 tracks-0.087
 globe-0.086
Token and
Token ID290
Feature activation+3.15e-02

Pos logit contributions
ubby+0.075
ited+0.074
vious+0.065
 End+0.063
years+0.062
Neg logit contributions
tr-0.065
 float-0.064
 tracks-0.064
 hunt-0.064
 globe-0.064
Token they
Token ID484
Feature activation+2.38e-02

Pos logit contributions
 float+0.042
tr+0.041
 tracks+0.041
 ramp+0.040
 cozy+0.038
Neg logit contributions
ubby-0.061
ited-0.058
vious-0.054
 enough-0.053
joy-0.052
Token put
Token ID1234
Feature activation+2.85e-02

Pos logit contributions
tr+0.035
 float+0.034
 ramp+0.033
 tracks+0.033
 globe+0.032
Neg logit contributions
ubby-0.043
ited-0.040
 enough-0.039
vious-0.038
joy-0.038
Token the
Token ID262
Feature activation+1.58e-02

Pos logit contributions
tr+0.047
 ramp+0.046
 hunt+0.045
 float+0.044
 tracks+0.043
Neg logit contributions
 enough-0.045
ubby-0.043
ited-0.042
um-0.039
 disagreed-0.039
Token flag
Token ID6056
Feature activation+1.01e-01

Pos logit contributions
 float+0.015
 ramp+0.014
tr+0.014
 tracks+0.014
 globe+0.014
Neg logit contributions
ubby-0.036
ited-0.036
 enough-0.034
vious-0.033
 disagreed-0.033
Token in
Token ID287
Feature activation+5.21e-02

Pos logit contributions
 ramp+0.157
Ro+0.153
 hunt+0.150
 Fog+0.149
 tracks+0.148
Neg logit contributions
 enough-0.161
ubby-0.142
ited-0.139
um-0.136
ously-0.132
Token a
Token ID257
Feature activation+6.11e-02

Pos logit contributions
tr+0.080
 ramp+0.080
 float+0.080
 tracks+0.078
 globe+0.077
Neg logit contributions
ubby-0.086
ited-0.085
 enough-0.082
vious-0.079
 disagreed-0.077
Token jar
Token ID17379
Feature activation+5.18e-02

Pos logit contributions
tr+0.078
nob+0.074
 globe+0.073
 float+0.072
 ramp+0.070
Neg logit contributions
ited-0.112
 enough-0.110
ubby-0.110
 disagreed-0.100
um-0.099
Token.
Token ID13
Feature activation+3.57e-02

Pos logit contributions
tr+0.080
 ramp+0.079
 float+0.079
 tracks+0.078
 globe+0.076
Neg logit contributions
ubby-0.085
ited-0.084
 enough-0.081
vious-0.077
 disagreed-0.076
Token
Token ID198
Feature activation+4.37e-03

Pos logit contributions
 ramp+0.056
 float+0.056
tr+0.055
 globe+0.055
 tracks+0.054
Neg logit contributions
ubby-0.055
 enough-0.054
ited-0.054
um-0.052
 disagreed-0.052
Token "
Token ID366
Feature activation+1.35e-02

Pos logit contributions
ubby+0.018
 enough+0.018
ited+0.017
 disagreed+0.017
um+0.017
Neg logit contributions
 ramp-0.023
tr-0.023
 float-0.023
 tracks-0.023
 globe-0.022
TokenTime
Token ID7575
Feature activation-3.18e-03

Pos logit contributions
 hunt+0.016
 float+0.016
tr+0.016
 hyd+0.015
 globe+0.015
Neg logit contributions
ubby-0.027
 enough-0.024
atted-0.024
oke-0.023
vious-0.023
Token to
Token ID284
Feature activation+3.95e-02

Pos logit contributions
ubby+0.005
ited+0.004
 enough+0.004
vious+0.004
 disagreed+0.004
Neg logit contributions
tr-0.006
 ramp-0.005
 float-0.005
 tracks-0.005
 globe-0.005
Token load
Token ID3440
Feature activation+2.00e-02

Pos logit contributions
tr+0.051
 ramp+0.050
 globe+0.048
 tracks+0.048
 float+0.047
Neg logit contributions
ubby-0.075
ited-0.073
 enough-0.072
um-0.070
vious-0.068
Token the
Token ID262
Feature activation+3.75e-02

Pos logit contributions
 ramp+0.029
tr+0.029
 float+0.029
 tracks+0.029
 globe+0.028
Neg logit contributions
ubby-0.034
ited-0.034
 enough-0.032
vious-0.031
 disagreed-0.031
Token truck
Token ID7779
Feature activation+1.01e-01

Pos logit contributions
tr+0.049
 ramp+0.048
 float+0.048
 tracks+0.047
 globe+0.046
Neg logit contributions
ubby-0.070
ited-0.069
 enough-0.068
vious-0.065
 disagreed-0.064
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
tr+0.136
 float+0.135
 ramp+0.134
 tracks+0.134
 globe+0.131
Neg logit contributions
ubby-0.178
 enough-0.176
ited-0.173
vious-0.163
um-0.162
Token Pete
Token ID17542
Feature activation+3.97e-03

Pos logit contributions
 float+0.020
 hunt+0.018
 ramp+0.018
 planet+0.018
 cozy+0.018
Neg logit contributions
ited-0.025
ubby-0.023
vious-0.022
eared-0.021
ously-0.021
Token!"
Token ID2474
Feature activation+8.33e-03

Pos logit contributions
tr+0.005
 ramp+0.005
 float+0.005
 tracks+0.004
 globe+0.004
Neg logit contributions
ubby-0.008
ited-0.008
 enough-0.008
vious-0.007
 disagreed-0.007
Token So
Token ID1406
Feature activation-2.95e-02

Pos logit contributions
tr+0.013
 ramp+0.013
 float+0.013
 tracks+0.012
 globe+0.012
Neg logit contributions
ubby-0.014
ited-0.014
 enough-0.013
vious-0.013
 disagreed-0.013
Token Pete
Token ID17542
Feature activation-2.21e-03

Pos logit contributions
ited+0.045
ubby+0.043
vious+0.042
oke+0.039
eared+0.039
Neg logit contributions
 float-0.051
 hunt-0.047
 tracks-0.046
 globe-0.046
 planet-0.046
Token and
Token ID290
Feature activation+2.95e-02

Pos logit contributions
ubby+0.053
ited+0.052
 enough+0.049
vious+0.048
oph+0.047
Neg logit contributions
 float-0.042
 tracks-0.041
 ramp-0.041
 hunt-0.040
tr-0.040
Token together
Token ID1978
Feature activation-4.83e-03

Pos logit contributions
 float+0.041
 ramp+0.041
tr+0.041
 tracks+0.040
 hunt+0.039
Neg logit contributions
ubby-0.054
ited-0.053
 enough-0.050
vious-0.049
um-0.048
Token they
Token ID484
Feature activation+2.99e-02

Pos logit contributions
ubby+0.009
ited+0.009
 enough+0.009
 disagreed+0.008
vious+0.008
Neg logit contributions
tr-0.006
 ramp-0.006
 float-0.006
 tracks-0.006
 globe-0.006
Token carried
Token ID5281
Feature activation+3.52e-03

Pos logit contributions
tr+0.048
 ramp+0.048
 float+0.047
 tracks+0.047
 globe+0.046
Neg logit contributions
ubby-0.048
ited-0.047
 enough-0.046
vious-0.043
 disagreed-0.043
Token the
Token ID262
Feature activation+2.57e-02

Pos logit contributions
tr+0.007
 ramp+0.006
 float+0.006
 globe+0.006
 tracks+0.006
Neg logit contributions
 enough-0.005
ubby-0.005
ited-0.005
 disagreed-0.004
um-0.004
Token squash
Token ID34613
Feature activation+1.00e-01

Pos logit contributions
 float+0.028
 globe+0.028
 hunt+0.026
 carpet+0.026
 cozy+0.025
Neg logit contributions
ited-0.055
ubby-0.054
iley-0.053
 enough-0.052
years-0.051
Token to
Token ID284
Feature activation+3.62e-02

Pos logit contributions
 ramp+0.168
tr+0.166
 Fog+0.159
cel+0.157
 float+0.156
Neg logit contributions
 enough-0.158
ubby-0.134
ously-0.133
 disagreed-0.133
um-0.132
Token Benny
Token ID44275
Feature activation+2.62e-02

Pos logit contributions
tr+0.065
 ramp+0.061
 globe+0.060
 tracks+0.059
 float+0.058
Neg logit contributions
 enough-0.052
ubby-0.052
ited-0.050
 disagreed-0.047
vious-0.046
Token's
Token ID338
Feature activation+6.52e-03

Pos logit contributions
tr+0.035
 ramp+0.035
 globe+0.034
 float+0.034
 tracks+0.033
Neg logit contributions
 enough-0.048
ubby-0.047
ited-0.047
 disagreed-0.044
vious-0.043
Token bur
Token ID4356
Feature activation+1.29e-02

Pos logit contributions
tr+0.008
 ramp+0.008
 float+0.008
 tracks+0.007
 globe+0.007
Neg logit contributions
ubby-0.013
ited-0.013
 enough-0.013
vious-0.012
 disagreed-0.012
Tokenrow
Token ID808
Feature activation+3.24e-02

Pos logit contributions
tr+0.019
 tracks+0.018
 float+0.018
 ramp+0.018
nob+0.018
Neg logit contributions
ited-0.023
ubby-0.022
 enough-0.022
 disagreed-0.021
vious-0.021
Token helped
Token ID4193
Feature activation-1.39e-02

Pos logit contributions
tr+0.082
 ramp+0.082
 float+0.081
 tracks+0.080
 globe+0.079
Neg logit contributions
ited-0.072
ubby-0.071
 enough-0.069
 disagreed-0.065
vious-0.065
Token the
Token ID262
Feature activation+1.00e-02

Pos logit contributions
 enough+0.018
ited+0.018
ubby+0.018
um+0.017
 disagreed+0.017
Neg logit contributions
tr-0.026
 ramp-0.025
 globe-0.025
 tracks-0.024
 spaceship-0.024
Token boy
Token ID2933
Feature activation+4.61e-02

Pos logit contributions
 ramp+0.018
 float+0.018
 tracks+0.018
tr+0.018
 globe+0.017
Neg logit contributions
ubby-0.015
ited-0.014
 enough-0.014
vious-0.013
um-0.013
Token grind
Token ID20078
Feature activation+2.89e-02

Pos logit contributions
 ramp+0.064
 globe+0.064
tr+0.063
 spaceship+0.061
 tracks+0.060
Neg logit contributions
 enough-0.081
ited-0.080
ubby-0.079
 disagreed-0.075
um-0.075
Token the
Token ID262
Feature activation+5.50e-02

Pos logit contributions
 float+0.038
 tracks+0.036
 ramp+0.035
tr+0.034
 globe+0.034
Neg logit contributions
ubby-0.055
ited-0.054
joy-0.051
vious-0.050
 disagreed-0.050
Token wheat
Token ID17135
Feature activation+1.00e-01

Pos logit contributions
tr+0.076
 float+0.075
 tracks+0.075
 ramp+0.075
 globe+0.072
Neg logit contributions
ubby-0.100
ited-0.099
 enough-0.093
vious-0.091
 disagreed-0.089
Token.
Token ID13
Feature activation+4.46e-02

Pos logit contributions
 spaceship+0.146
 ramp+0.143
 planet+0.139
 globe+0.137
 tracks+0.135
Neg logit contributions
 enough-0.179
um-0.166
ubby-0.154
 disagreed-0.148
ited-0.147
Token When
Token ID1649
Feature activation+1.89e-02

Pos logit contributions
 ramp+0.077
tr+0.077
 float+0.077
 globe+0.076
 tracks+0.076
Neg logit contributions
ubby-0.060
 enough-0.060
um-0.058
ited-0.056
vious-0.054
Token they
Token ID484
Feature activation+2.79e-02

Pos logit contributions
tr+0.033
 ramp+0.030
nob+0.030
 tracks+0.030
 float+0.029
Neg logit contributions
 enough-0.031
ited-0.027
ubby-0.026
um-0.026
 disagreed-0.026
Token finished
Token ID5201
Feature activation+1.07e-02

Pos logit contributions
 float+0.044
tr+0.041
 hunt+0.040
 globe+0.039
 ramp+0.039
Neg logit contributions
ubby-0.048
joy-0.043
ited-0.042
iley-0.042
heart-0.042
Token,
Token ID11
Feature activation+1.95e-02

Pos logit contributions
 float+0.020
 ramp+0.019
tr+0.019
 tracks+0.019
 globe+0.019
Neg logit contributions
ited-0.015
ubby-0.015
vious-0.014
 enough-0.013
 disagreed-0.013
Token,
Token ID11
Feature activation+6.89e-03

Pos logit contributions
ubby+0.107
ited+0.101
years+0.100
vious+0.096
joy+0.095
Neg logit contributions
 hunt-0.065
 tracks-0.060
tr-0.058
 track-0.057
 float-0.055
Token and
Token ID290
Feature activation+2.50e-02

Pos logit contributions
 float+0.009
 tracks+0.009
 ramp+0.009
 cozy+0.009
 hunt+0.009
Neg logit contributions
ubby-0.013
ited-0.013
um-0.012
ared-0.011
vious-0.011
Token they
Token ID484
Feature activation+2.96e-02

Pos logit contributions
 float+0.027
tr+0.027
 ramp+0.026
 tracks+0.026
 globe+0.025
Neg logit contributions
ubby-0.054
ited-0.052
vious-0.049
 enough-0.049
um-0.048
Token put
Token ID1234
Feature activation+3.36e-02

Pos logit contributions
tr+0.040
 float+0.040
 ramp+0.038
 globe+0.038
 tracks+0.038
Neg logit contributions
ubby-0.055
ited-0.053
vious-0.051
 enough-0.050
joy-0.049
Token the
Token ID262
Feature activation+2.06e-02

Pos logit contributions
tr+0.056
 ramp+0.056
 float+0.054
 tracks+0.052
nob+0.052
Neg logit contributions
 enough-0.053
ubby-0.051
ited-0.050
um-0.047
 disagreed-0.047
Token turkey
Token ID26876
Feature activation+9.99e-02

Pos logit contributions
 globe+0.028
 float+0.027
 hunt+0.026
 tracks+0.026
tr+0.026
Neg logit contributions
ited-0.038
ubby-0.037
ously-0.036
vious-0.036
 enough-0.036
Token in
Token ID287
Feature activation+5.98e-02

Pos logit contributions
 ramp+0.160
tr+0.149
 tracks+0.145
 float+0.145
Ro+0.144
Neg logit contributions
 enough-0.171
ubby-0.153
ited-0.149
 disagreed-0.145
ously-0.138
Token their
Token ID511
Feature activation+5.88e-02

Pos logit contributions
tr+0.091
 ramp+0.091
 float+0.091
 tracks+0.090
 globe+0.088
Neg logit contributions
ubby-0.100
ited-0.098
 enough-0.094
vious-0.091
 disagreed-0.089
Token cart
Token ID6383
Feature activation+7.07e-02

Pos logit contributions
tr+0.088
 ramp+0.087
 float+0.086
 globe+0.084
nob+0.083
Neg logit contributions
ubby-0.098
 enough-0.097
ited-0.096
 disagreed-0.088
um-0.088
Token.
Token ID13
Feature activation+3.79e-02

Pos logit contributions
tr+0.089
 ramp+0.088
 float+0.088
 tracks+0.086
 globe+0.084
Neg logit contributions
ubby-0.135
ited-0.134
 enough-0.130
vious-0.125
 disagreed-0.124
Token
Token ID198
Feature activation+9.94e-03

Pos logit contributions
tr+0.058
 ramp+0.058
 globe+0.057
 float+0.057
 tracks+0.056
Neg logit contributions
ubby-0.059
 enough-0.058
ited-0.057
um-0.056
 disagreed-0.055
Token elephant
Token ID20950
Feature activation+1.66e-02

Pos logit contributions
ubby+0.058
ited+0.055
 enough+0.054
um+0.052
vious+0.052
Neg logit contributions
tr-0.046
 ramp-0.045
 float-0.045
 tracks-0.043
 hunt-0.043
Token decided
Token ID3066
Feature activation-2.56e-02

Pos logit contributions
 ramp+0.032
 tracks+0.030
tr+0.030
 float+0.030
 globe+0.030
Neg logit contributions
 disagreed-0.021
ited-0.021
 enough-0.021
vious-0.020
ubby-0.020
Token to
Token ID284
Feature activation+1.84e-02

Pos logit contributions
 enough+0.033
ubby+0.033
ited+0.032
 disagreed+0.030
vious+0.030
Neg logit contributions
 ramp-0.048
tr-0.048
 float-0.047
 globe-0.046
 tracks-0.046
Token take
Token ID1011
Feature activation+9.43e-03

Pos logit contributions
 ramp+0.025
tr+0.025
 globe+0.024
 tracks+0.024
 spaceship+0.023
Neg logit contributions
ubby-0.032
ited-0.031
 enough-0.031
um-0.031
 disagreed-0.030
Token the
Token ID262
Feature activation+4.60e-02

Pos logit contributions
tr+0.018
 ramp+0.018
 globe+0.017
 float+0.017
 rug+0.017
Neg logit contributions
 enough-0.013
ubby-0.012
ited-0.012
vious-0.011
um-0.011
Token brick
Token ID17214
Feature activation+9.98e-02

Pos logit contributions
 ramp+0.064
 float+0.063
fire+0.061
tr+0.060
 tracks+0.060
Neg logit contributions
ubby-0.080
ited-0.078
 enough-0.077
um-0.075
vious-0.075
Token home
Token ID1363
Feature activation+2.89e-02

Pos logit contributions
 ramp+0.196
 globe+0.190
 Fog+0.188
 float+0.188
 glide+0.186
Neg logit contributions
 enough-0.128
vious-0.104
um-0.098
ited-0.096
ubby-0.094
Token.
Token ID13
Feature activation+4.47e-02

Pos logit contributions
 ramp+0.045
tr+0.044
 float+0.042
 tracks+0.042
 globe+0.041
Neg logit contributions
 enough-0.049
ubby-0.045
ited-0.045
um-0.043
 disagreed-0.043
Token
Token ID198
Feature activation+6.50e-03

Pos logit contributions
tr+0.084
 float+0.081
 ramp+0.080
 globe+0.080
 spaceship+0.080
Neg logit contributions
 enough-0.053
um-0.052
ubby-0.049
ited-0.049
oph-0.049
Token
Token ID198
Feature activation+2.60e-02

Pos logit contributions
tr+0.009
 ramp+0.009
 float+0.009
 tracks+0.009
 globe+0.009
Neg logit contributions
ubby-0.012
ited-0.011
 enough-0.011
vious-0.011
 disagreed-0.010
TokenOn
Token ID2202
Feature activation+3.24e-03

Pos logit contributions
tr+0.048
 float+0.048
 ramp+0.047
 tracks+0.047
 globe+0.047
Neg logit contributions
ited-0.034
ubby-0.033
 enough-0.031
vious-0.031
 disagreed-0.030
Token,
Token ID11
Feature activation+2.27e-02

Pos logit contributions
tr+0.092
 ramp+0.090
 float+0.090
 tracks+0.089
 globe+0.087
Neg logit contributions
ubby-0.105
ited-0.103
 enough-0.098
vious-0.095
 disagreed-0.093
Token and
Token ID290
Feature activation+3.94e-02

Pos logit contributions
 float+0.033
 tracks+0.032
tr+0.032
 ramp+0.032
 rug+0.031
Neg logit contributions
ubby-0.040
ited-0.040
vious-0.036
 enough-0.035
um-0.035
Token we
Token ID356
Feature activation+2.38e-02

Pos logit contributions
tr+0.050
 float+0.050
 ramp+0.050
 tracks+0.049
 globe+0.047
Neg logit contributions
ubby-0.075
ited-0.074
 enough-0.071
vious-0.069
 disagreed-0.068
Token put
Token ID1234
Feature activation+3.63e-02

Pos logit contributions
tr+0.034
 float+0.034
 ramp+0.034
 tracks+0.033
 globe+0.032
Neg logit contributions
ubby-0.042
ited-0.041
 enough-0.040
vious-0.038
um-0.038
Token our
Token ID674
Feature activation+2.63e-02

Pos logit contributions
tr+0.064
 ramp+0.062
 hunt+0.061
nob+0.058
 float+0.058
Neg logit contributions
 enough-0.056
ubby-0.052
ited-0.047
um-0.045
ised-0.044
Token toys
Token ID14958
Feature activation+9.97e-02

Pos logit contributions
tr+0.048
 ramp+0.046
 globe+0.044
 float+0.044
nob+0.044
Neg logit contributions
ubby-0.038
ited-0.036
 enough-0.035
 disagreed-0.034
um-0.032
Token away
Token ID1497
Feature activation+3.77e-02

Pos logit contributions
tr+0.130
 ramp+0.130
 tracks+0.128
 float+0.126
 globe+0.123
Neg logit contributions
ubby-0.184
ited-0.183
 enough-0.182
vious-0.170
 disagreed-0.169
Token when
Token ID618
Feature activation+3.87e-02

Pos logit contributions
 ramp+0.055
tr+0.055
 tracks+0.052
 float+0.052
 globe+0.051
Neg logit contributions
 enough-0.065
ubby-0.064
ited-0.063
 disagreed-0.058
vious-0.058
Token we
Token ID356
Feature activation+3.01e-02

Pos logit contributions
tr+0.052
 ramp+0.047
nob+0.046
 globe+0.045
 hunt+0.044
Neg logit contributions
 enough-0.077
ubby-0.073
 disagreed-0.069
um-0.068
ited-0.067
Token're
Token ID821
Feature activation+9.20e-03

Pos logit contributions
 ramp+0.052
 globe+0.051
tr+0.051
 tracks+0.050
 float+0.048
Neg logit contributions
ited-0.044
ubby-0.044
 disagreed-0.042
 enough-0.042
vious-0.041
Token done
Token ID1760
Feature activation-1.49e-02

Pos logit contributions
 float+0.015
tr+0.015
 ramp+0.015
 tracks+0.015
 globe+0.015
Neg logit contributions
ubby-0.014
ited-0.014
 enough-0.013
vious-0.012
 disagreed-0.012
Token,
Token ID11
Feature activation+1.25e-02

Pos logit contributions
ubby+0.012
ited+0.012
vious+0.011
 enough+0.010
 disagreed+0.010
Neg logit contributions
 float-0.020
 ramp-0.019
 tracks-0.019
tr-0.019
 globe-0.019
Token the
Token ID262
Feature activation-4.04e-02

Pos logit contributions
tr+0.023
nob+0.022
 ramp+0.022
 tracks+0.022
 globe+0.022
Neg logit contributions
 enough-0.018
ubby-0.016
ited-0.016
 disagreed-0.016
um-0.016
Token family
Token ID1641
Feature activation+2.32e-02

Pos logit contributions
ited+0.058
ubby+0.056
vious+0.053
 enough+0.053
ously+0.052
Neg logit contributions
 float-0.074
 ramp-0.073
 tracks-0.073
 globe-0.071
 spaceship-0.070
Token loaded
Token ID9639
Feature activation-1.32e-03

Pos logit contributions
 float+0.032
 ramp+0.032
 tracks+0.031
tr+0.031
 globe+0.030
Neg logit contributions
ubby-0.042
ited-0.041
 enough-0.039
vious-0.038
um-0.037
Token the
Token ID262
Feature activation+2.81e-02

Pos logit contributions
ubby+0.002
ited+0.002
 enough+0.002
vious+0.002
 disagreed+0.002
Neg logit contributions
tr-0.002
 ramp-0.002
 float-0.002
 tracks-0.002
 globe-0.002
Token yacht
Token ID43207
Feature activation+9.96e-02

Pos logit contributions
 float+0.028
 ramp+0.026
 tracks+0.026
 globe+0.026
tr+0.024
Neg logit contributions
ited-0.063
ubby-0.063
 enough-0.061
vious-0.059
ously-0.058
Token full
Token ID1336
Feature activation-4.26e-03

Pos logit contributions
tr+0.154
Ro+0.150
 Fog+0.148
 ramp+0.147
 tracks+0.144
Neg logit contributions
 enough-0.164
ited-0.143
um-0.143
ubby-0.142
ously-0.138
Token of
Token ID286
Feature activation+3.85e-02

Pos logit contributions
ited+0.005
ubby+0.005
vious+0.004
Fine+0.004
 disagreed+0.004
Neg logit contributions
 float-0.009
 tracks-0.009
 ramp-0.008
 globe-0.008
 hunt-0.008
Token supplies
Token ID9416
Feature activation+5.75e-02

Pos logit contributions
 float+0.041
 ramp+0.040
 tracks+0.039
tr+0.038
 globe+0.038
Neg logit contributions
ubby-0.082
ited-0.082
 enough-0.078
vious-0.077
 disagreed-0.076
Token and
Token ID290
Feature activation+5.13e-02

Pos logit contributions
tr+0.088
 ramp+0.087
 float+0.085
 tracks+0.084
 globe+0.083
Neg logit contributions
ubby-0.094
ited-0.093
 enough-0.093
 disagreed-0.086
vious-0.086
Token made
Token ID925
Feature activation-8.90e-04

Pos logit contributions
tr+0.076
 ramp+0.075
 float+0.074
 globe+0.072
 tracks+0.072
Neg logit contributions
ubby-0.087
ited-0.086
 enough-0.085
 disagreed-0.080
vious-0.079
Token and
Token ID290
Feature activation-9.85e-03

Pos logit contributions
tr+0.003
 float+0.003
 ramp+0.003
 tracks+0.003
 globe+0.003
Neg logit contributions
ubby-0.004
ited-0.003
 enough-0.003
vious-0.003
um-0.003
Token Grand
Token ID5675
Feature activation-4.96e-02

Pos logit contributions
ubby+0.016
ited+0.015
 enough+0.014
vious+0.014
 disagreed+0.014
Neg logit contributions
tr-0.016
 float-0.016
 ramp-0.016
 tracks-0.016
 globe-0.015
Tokenma
Token ID2611
Feature activation+2.52e-02

Pos logit contributions
ubby+0.068
ited+0.062
ously+0.060
joy+0.059
vious+0.059
Neg logit contributions
tr-0.091
 globe-0.087
nob-0.086
 rug-0.085
 tracks-0.083
Token put
Token ID1234
Feature activation+2.41e-02

Pos logit contributions
 ramp+0.046
tr+0.045
 tracks+0.044
 float+0.044
 globe+0.044
Neg logit contributions
ited-0.035
 enough-0.033
ubby-0.033
 disagreed-0.033
vious-0.031
Token the
Token ID262
Feature activation+2.48e-02

Pos logit contributions
tr+0.036
 ramp+0.036
 float+0.036
 tracks+0.035
 globe+0.035
Neg logit contributions
ubby-0.040
ited-0.040
 enough-0.039
vious-0.037
 disagreed-0.036
Token pie
Token ID2508
Feature activation+9.96e-02

Pos logit contributions
tr+0.029
 float+0.028
 ramp+0.028
 globe+0.028
 tracks+0.027
Neg logit contributions
ited-0.052
ubby-0.051
ously-0.050
iley-0.049
ody-0.048
Token on
Token ID319
Feature activation+1.93e-02

Pos logit contributions
 ramp+0.159
 tracks+0.152
 outer+0.148
 Fog+0.147
 Jupiter+0.146
Neg logit contributions
 enough-0.166
um-0.148
ubby-0.146
ited-0.143
 disagreed-0.135
Token the
Token ID262
Feature activation+4.50e-02

Pos logit contributions
 float+0.028
tr+0.028
 ramp+0.028
 tracks+0.028
 globe+0.027
Neg logit contributions
ubby-0.033
ited-0.033
vious-0.031
 enough-0.031
 disagreed-0.030
Token table
Token ID3084
Feature activation+4.65e-02

Pos logit contributions
 float+0.053
 ramp+0.052
nob+0.051
tr+0.051
 globe+0.050
Neg logit contributions
ubby-0.083
 enough-0.081
um-0.078
ited-0.078
ige-0.075
Token and
Token ID290
Feature activation+5.50e-02

Pos logit contributions
tr+0.076
 float+0.076
 ramp+0.076
 tracks+0.075
 globe+0.074
Neg logit contributions
ubby-0.073
ited-0.072
vious-0.065
 enough-0.065
um-0.063
Token waited
Token ID13488
Feature activation-3.46e-02

Pos logit contributions
 ramp+0.098
 float+0.094
tr+0.093
 globe+0.092
 tracks+0.092
Neg logit contributions
 enough-0.078
ited-0.076
 disagreed-0.073
ubby-0.070
vious-0.066
Token the
Token ID262
Feature activation+2.65e-02

Pos logit contributions
tr+0.037
 ramp+0.034
nob+0.033
 globe+0.032
 tracks+0.031
Neg logit contributions
 enough-0.019
ubby-0.016
joy-0.016
 understanding-0.015
ited-0.015
Token cage
Token ID17186
Feature activation+3.15e-02

Pos logit contributions
tr+0.031
 ramp+0.031
nob+0.030
 float+0.029
 globe+0.029
Neg logit contributions
ubby-0.053
 enough-0.053
ited-0.051
vious-0.048
 disagreed-0.048
Token and
Token ID290
Feature activation+4.27e-02

Pos logit contributions
tr+0.060
 ramp+0.060
 float+0.060
 tracks+0.059
 globe+0.058
Neg logit contributions
ubby-0.040
ited-0.039
 enough-0.038
vious-0.035
 disagreed-0.035
Token put
Token ID1234
Feature activation+2.27e-02

Pos logit contributions
 ramp+0.092
tr+0.092
 globe+0.087
 rug+0.085
nob+0.085
Neg logit contributions
 enough-0.050
 disagreed-0.049
ited-0.047
 understood-0.040
 argued-0.040
Token the
Token ID262
Feature activation+1.30e-02

Pos logit contributions
 ramp+0.045
tr+0.045
nob+0.041
 tracks+0.041
 hunt+0.040
Neg logit contributions
 enough-0.032
ubby-0.029
ited-0.026
um-0.026
vious-0.024
Token flowers
Token ID12734
Feature activation+9.95e-02

Pos logit contributions
tr+0.014
 ramp+0.014
 float+0.014
 tracks+0.014
 globe+0.013
Neg logit contributions
ubby-0.027
ited-0.027
 enough-0.026
vious-0.025
 disagreed-0.025
Token inside
Token ID2641
Feature activation+5.08e-02

Pos logit contributions
 ramp+0.186
tr+0.173
Ro+0.168
 tracks+0.167
 Fog+0.162
Neg logit contributions
 enough-0.151
ubby-0.133
ited-0.129
um-0.123
ously-0.118
Token.
Token ID13
Feature activation+4.34e-02

Pos logit contributions
tr+0.091
 ramp+0.090
 tracks+0.080
nob+0.078
 globe+0.078
Neg logit contributions
 enough-0.082
ubby-0.073
ited-0.071
 disagreed-0.070
joy-0.069
Token The
Token ID383
Feature activation-2.45e-02

Pos logit contributions
 globe+0.078
tr+0.077
 ramp+0.077
 float+0.073
 spaceship+0.072
Neg logit contributions
 enough-0.060
ubby-0.057
vious-0.057
ited-0.055
 disagreed-0.054
Token bird
Token ID6512
Feature activation+3.24e-02

Pos logit contributions
ubby+0.042
ited+0.041
 enough+0.039
vious+0.038
 disagreed+0.037
Neg logit contributions
tr-0.038
 ramp-0.037
 tracks-0.036
 float-0.036
nob-0.035
Token was
Token ID373
Feature activation-8.66e-03

Pos logit contributions
 ramp+0.054
tr+0.054
 float+0.053
 tracks+0.052
 globe+0.052
Neg logit contributions
ited-0.049
ubby-0.048
 enough-0.047
 disagreed-0.045
vious-0.044
Token,
Token ID11
Feature activation+3.42e-03

Pos logit contributions
ubby+0.028
ited+0.027
 enough+0.025
vious+0.024
 disagreed+0.023
Neg logit contributions
tr-0.048
 ramp-0.048
 float-0.048
 tracks-0.047
 globe-0.047
Token can
Token ID460
Feature activation-1.35e-02

Pos logit contributions
tr+0.005
 ramp+0.005
 float+0.005
 tracks+0.005
 globe+0.005
Neg logit contributions
ubby-0.006
ited-0.006
 enough-0.006
vious-0.005
 disagreed-0.005
Token we
Token ID356
Feature activation+3.34e-02

Pos logit contributions
ubby+0.022
 enough+0.022
ited+0.021
 disagreed+0.021
um+0.021
Neg logit contributions
tr-0.020
 ramp-0.020
 tracks-0.019
 globe-0.019
nob-0.019
Token put
Token ID1234
Feature activation+2.70e-02

Pos logit contributions
tr+0.041
 ramp+0.040
 tracks+0.040
 float+0.039
 globe+0.039
Neg logit contributions
ubby-0.065
ited-0.064
 enough-0.063
vious-0.060
 disagreed-0.059
Token my
Token ID616
Feature activation+3.47e-03

Pos logit contributions
tr+0.047
 ramp+0.045
 hunt+0.044
nob+0.043
fire+0.043
Neg logit contributions
 enough-0.043
ubby-0.039
vious-0.035
ited-0.035
um-0.034
Token flag
Token ID6056
Feature activation+9.93e-02

Pos logit contributions
 float+0.003
tr+0.003
 ramp+0.003
 tracks+0.003
 globe+0.003
Neg logit contributions
ubby-0.008
ited-0.007
 enough-0.007
vious-0.007
 disagreed-0.007
Token in
Token ID287
Feature activation+5.73e-02

Pos logit contributions
tr+0.118
 ramp+0.118
 float+0.117
 tracks+0.115
 globe+0.112
Neg logit contributions
ubby-0.196
ited-0.194
 enough-0.188
vious-0.181
 disagreed-0.179
Token a
Token ID257
Feature activation+6.63e-02

Pos logit contributions
 float+0.084
 ramp+0.083
 tracks+0.082
tr+0.082
 globe+0.081
Neg logit contributions
ited-0.101
ubby-0.099
vious-0.093
ously-0.091
 disagreed-0.091
Token jar
Token ID17379
Feature activation+5.52e-02

Pos logit contributions
nob+0.082
tr+0.081
 tracks+0.076
 float+0.076
 globe+0.075
Neg logit contributions
ited-0.122
ubby-0.121
 enough-0.121
 disagreed-0.109
vious-0.109
Token so
Token ID523
Feature activation+7.38e-03

Pos logit contributions
tr+0.078
 float+0.074
 ramp+0.071
 hunt+0.070
nob+0.066
Neg logit contributions
ubby-0.103
ited-0.100
ously-0.096
joy-0.095
iley-0.094
Token it
Token ID340
Feature activation+5.73e-02

Pos logit contributions
tr+0.009
 ramp+0.009
 float+0.009
 tracks+0.008
 globe+0.008
Neg logit contributions
ubby-0.015
ited-0.014
 enough-0.014
vious-0.014
 disagreed-0.014
Token disappear
Token ID10921
Feature activation-1.27e-02

Pos logit contributions
tr+0.039
 ramp+0.039
 globe+0.039
 tracks+0.039
 float+0.037
Neg logit contributions
ubby-0.039
ited-0.039
 enough-0.037
vious-0.037
 disagreed-0.037
Token?"
Token ID1701
Feature activation-3.48e-03

Pos logit contributions
ubby+0.019
ited+0.019
 enough+0.019
vious+0.017
 disagreed+0.017
Neg logit contributions
tr-0.021
 ramp-0.021
 float-0.021
 tracks-0.021
 globe-0.020
Token His
Token ID2399
Feature activation-6.59e-02

Pos logit contributions
ubby+0.005
ited+0.005
 enough+0.005
vious+0.005
 disagreed+0.005
Neg logit contributions
tr-0.006
 ramp-0.006
 float-0.006
 tracks-0.006
 globe-0.005
Token mother
Token ID2802
Feature activation-3.66e-02

Pos logit contributions
ited+0.129
ubby+0.121
vious+0.120
 enough+0.117
ously+0.112
Neg logit contributions
tr-0.092
 float-0.089
 ramp-0.089
 tracks-0.089
 rug-0.087
Token smiled
Token ID13541
Feature activation-8.44e-02

Pos logit contributions
ubby+0.056
ited+0.052
vious+0.050
 enough+0.050
joy+0.048
Neg logit contributions
tr-0.062
 float-0.060
 tracks-0.058
 rug-0.058
nob-0.057
Token knowingly
Token ID26526
Feature activation-1.25e-01

Pos logit contributions
ited+0.157
ubby+0.152
oke+0.142
Fine+0.141
vious+0.139
Neg logit contributions
 float-0.119
 cozy-0.118
tr-0.111
 spaceship-0.107
 tracks-0.106
Token and
Token ID290
Feature activation+1.09e-02

Pos logit contributions
ited+0.260
oph+0.240
 disagreed+0.240
Fine+0.239
vious+0.235
Neg logit contributions
 float-0.133
 hyd-0.129
 globe-0.126
 tracks-0.123
 cozy-0.123
Token said
Token ID531
Feature activation-9.98e-02

Pos logit contributions
 ramp+0.022
tr+0.022
 float+0.022
 tracks+0.022
 globe+0.022
Neg logit contributions
ited-0.013
ubby-0.013
 enough-0.012
 disagreed-0.011
vious-0.011
Token,
Token ID11
Feature activation-1.33e-02

Pos logit contributions
ited+0.155
ubby+0.144
vious+0.136
ody+0.124
oph+0.123
Neg logit contributions
 float-0.175
tr-0.175
 globe-0.173
 ramp-0.167
 tracks-0.166
Token "
Token ID366
Feature activation+1.23e-02

Pos logit contributions
ubby+0.020
ited+0.020
 enough+0.020
 disagreed+0.019
vious+0.018
Neg logit contributions
 ramp-0.022
tr-0.022
 float-0.021
 tracks-0.021
 globe-0.021
TokenWell
Token ID5779
Feature activation-7.18e-02

Pos logit contributions
tr+0.019
 ramp+0.018
 float+0.018
 globe+0.018
 tracks+0.018
Neg logit contributions
ubby-0.022
ited-0.021
 enough-0.020
vious-0.019
um-0.019
Token sick
Token ID6639
Feature activation-2.58e-02

Pos logit contributions
 float+0.015
tr+0.015
 ramp+0.015
 tracks+0.015
 globe+0.014
Neg logit contributions
ubby-0.020
ited-0.020
 enough-0.019
vious-0.019
ously-0.018
Token?"
Token ID1701
Feature activation-1.95e-02

Pos logit contributions
ited+0.044
ubby+0.042
ously+0.039
 enough+0.038
vious+0.038
Neg logit contributions
 float-0.040
tr-0.039
 ramp-0.038
 tracks-0.038
 hunt-0.037
Token
Token ID198
Feature activation-3.24e-02

Pos logit contributions
ubby+0.034
ited+0.033
 enough+0.032
vious+0.031
 disagreed+0.031
Neg logit contributions
tr-0.028
 ramp-0.028
 float-0.028
 tracks-0.027
 globe-0.027
Token
Token ID198
Feature activation-1.30e-02

Pos logit contributions
ubby+0.055
ited+0.052
 enough+0.051
um+0.049
vious+0.048
Neg logit contributions
tr-0.049
 ramp-0.048
 float-0.048
 tracks-0.047
 globe-0.046
TokenJoe
Token ID19585
Feature activation-5.26e-02

Pos logit contributions
ubby+0.016
ited+0.015
 enough+0.014
um+0.014
vious+0.013
Neg logit contributions
tr-0.026
 float-0.025
 ramp-0.025
 tracks-0.025
 globe-0.025
Token said
Token ID531
Feature activation-1.24e-01

Pos logit contributions
ubby+0.084
 enough+0.077
joy+0.074
ited+0.073
 intentions+0.072
Neg logit contributions
 float-0.084
tr-0.082
nob-0.082
 tracks-0.080
 ramp-0.078
Token:"
Token ID11097
Feature activation+1.91e-02

Pos logit contributions
ited+0.166
ubby+0.164
vious+0.146
 enough+0.143
oph+0.137
Neg logit contributions
 float-0.236
tr-0.233
 ramp-0.227
 globe-0.226
 tracks-0.225
TokenDon
Token ID3987
Feature activation-5.12e-02

Pos logit contributions
 tracks+0.034
 ramp+0.033
 spaceship+0.033
 float+0.032
tr+0.031
Neg logit contributions
ited-0.027
ubby-0.026
 disagreed-0.026
vious-0.025
Fine-0.025
Token't
Token ID470
Feature activation-5.05e-02

Pos logit contributions
ubby+0.081
ited+0.079
 enough+0.076
vious+0.073
um+0.072
Neg logit contributions
tr-0.083
 ramp-0.082
 float-0.082
 tracks-0.081
 globe-0.079
Token worry
Token ID5490
Feature activation-9.24e-02

Pos logit contributions
ubby+0.053
ited+0.051
 enough+0.048
vious+0.045
 disagreed+0.043
Neg logit contributions
 float-0.109
tr-0.109
 ramp-0.108
 tracks-0.107
 globe-0.105
Token.
Token ID13
Feature activation+3.15e-02

Pos logit contributions
ited+0.134
ubby+0.133
 enough+0.123
vious+0.121
 disagreed+0.118
Neg logit contributions
tr-0.161
 float-0.161
 ramp-0.161
 tracks-0.158
 globe-0.155
Token you
Token ID345
Feature activation+1.14e-02

Pos logit contributions
ubby+0.034
ited+0.033
 enough+0.033
vious+0.032
 disagreed+0.032
Neg logit contributions
tr-0.021
 ramp-0.021
 float-0.021
 tracks-0.020
 globe-0.020
Token forgive
Token ID20927
Feature activation-4.02e-02

Pos logit contributions
 float+0.017
tr+0.017
 ramp+0.017
 tracks+0.016
 hunt+0.016
Neg logit contributions
ubby-0.020
ited-0.020
 enough-0.019
vious-0.018
ously-0.018
Token me
Token ID502
Feature activation-1.62e-02

Pos logit contributions
ubby+0.053
ited+0.051
 enough+0.050
vious+0.047
um+0.046
Neg logit contributions
 ramp-0.076
tr-0.075
 float-0.074
 tracks-0.073
 globe-0.073
Token?"
Token ID1701
Feature activation-9.64e-03

Pos logit contributions
ubby+0.024
ited+0.024
 enough+0.023
vious+0.022
 disagreed+0.022
Neg logit contributions
tr-0.027
 ramp-0.027
 float-0.027
 tracks-0.027
 globe-0.026
Token Tom
Token ID4186
Feature activation-4.60e-02

Pos logit contributions
ubby+0.017
ited+0.017
 enough+0.017
vious+0.016
 disagreed+0.016
Neg logit contributions
tr-0.013
 ramp-0.013
 float-0.013
 tracks-0.013
 globe-0.013
Token said
Token ID531
Feature activation-1.23e-01

Pos logit contributions
ubby+0.085
ited+0.074
years+0.070
 ago+0.070
 enough+0.069
Neg logit contributions
 float-0.067
tr-0.066
 hunt-0.064
 ramp-0.063
 tracks-0.063
Token,
Token ID11
Feature activation-2.87e-02

Pos logit contributions
ited+0.209
ubby+0.205
vious+0.185
oph+0.185
 enough+0.182
Neg logit contributions
 float-0.191
tr-0.185
 ramp-0.184
 tracks-0.181
 globe-0.179
Token "
Token ID366
Feature activation+7.08e-03

Pos logit contributions
ubby+0.046
ited+0.046
 enough+0.044
vious+0.042
 disagreed+0.041
Neg logit contributions
tr-0.045
 ramp-0.045
 float-0.045
 tracks-0.044
 globe-0.043
TokenI
Token ID40
Feature activation-6.23e-03

Pos logit contributions
 ramp+0.011
tr+0.011
 tracks+0.011
 float+0.011
 globe+0.011
Neg logit contributions
ited-0.011
ubby-0.011
 enough-0.010
vious-0.010
 disagreed-0.010
Token forgive
Token ID20927
Feature activation-5.54e-02

Pos logit contributions
ubby+0.011
ited+0.011
 enough+0.010
um+0.010
illy+0.010
Neg logit contributions
tr-0.009
 float-0.009
 ramp-0.009
 hunt-0.008
 tracks-0.008
Token you
Token ID345
Feature activation-3.53e-02

Pos logit contributions
ubby+0.096
ited+0.093
 enough+0.092
vious+0.087
um+0.086
Neg logit contributions
tr-0.083
 ramp-0.082
 float-0.080
 tracks-0.080
 globe-0.079
Token understand
Token ID1833
Feature activation-7.65e-02

Pos logit contributions
ited+0.029
oph+0.027
ubby+0.027
joy+0.026
ously+0.026
Neg logit contributions
 float-0.027
tr-0.026
 breathe-0.025
 hunt-0.024
 cozy-0.024
Token?"
Token ID1701
Feature activation-1.07e-02

Pos logit contributions
ubby+0.118
ited+0.115
 enough+0.111
vious+0.106
 disagreed+0.104
Neg logit contributions
tr-0.126
 ramp-0.126
 float-0.125
 tracks-0.123
 globe-0.121
Token
Token ID198
Feature activation-3.19e-02

Pos logit contributions
ubby+0.019
ited+0.019
 enough+0.018
vious+0.017
 disagreed+0.017
Neg logit contributions
tr-0.015
 ramp-0.015
 float-0.015
 tracks-0.015
 globe-0.014
Token
Token ID198
Feature activation-8.50e-03

Pos logit contributions
ubby+0.055
ited+0.053
 enough+0.051
um+0.049
vious+0.049
Neg logit contributions
tr-0.047
 ramp-0.046
 float-0.046
 tracks-0.045
 globe-0.044
TokenBen
Token ID11696
Feature activation-4.08e-02

Pos logit contributions
ubby+0.010
ited+0.009
 enough+0.009
um+0.008
vious+0.008
Neg logit contributions
tr-0.017
 float-0.017
 ramp-0.017
 tracks-0.017
 globe-0.017
Token said
Token ID531
Feature activation-1.23e-01

Pos logit contributions
ubby+0.058
ited+0.049
 enough+0.047
illy+0.045
years+0.045
Neg logit contributions
tr-0.079
 float-0.075
nob-0.073
 globe-0.073
 hunt-0.073
Token,
Token ID11
Feature activation-4.13e-02

Pos logit contributions
ited+0.205
ubby+0.186
oph+0.176
vious+0.170
ody+0.168
Neg logit contributions
 float-0.209
 ramp-0.195
tr-0.195
 globe-0.193
 tracks-0.190
Token "
Token ID366
Feature activation-1.04e-02

Pos logit contributions
ubby+0.067
ited+0.066
 enough+0.064
vious+0.061
 disagreed+0.060
Neg logit contributions
tr-0.064
 ramp-0.064
 float-0.064
 tracks-0.062
 globe-0.061
TokenYes
Token ID5297
Feature activation-8.08e-02

Pos logit contributions
ubby+0.016
ited+0.015
 enough+0.015
vious+0.014
 disagreed+0.014
Neg logit contributions
tr-0.017
 ramp-0.017
 tracks-0.017
 float-0.017
 globe-0.017
Token,
Token ID11
Feature activation-2.78e-02

Pos logit contributions
ited+0.135
ubby+0.117
vious+0.117
eared+0.116
years+0.112
Neg logit contributions
 float-0.135
 ramp-0.128
 globe-0.124
 rug-0.123
 tracks-0.122
Token sir
Token ID15967
Feature activation-3.47e-02

Pos logit contributions
ubby+0.051
ited+0.050
 enough+0.049
vious+0.047
 disagreed+0.047
Neg logit contributions
tr-0.037
 ramp-0.037
 float-0.036
 tracks-0.036
 globe-0.035
Token lost
Token ID2626
Feature activation-2.00e-02

Pos logit contributions
ubby+0.003
ited+0.003
 enough+0.002
vious+0.002
 disagreed+0.002
Neg logit contributions
tr-0.002
 float-0.002
 ramp-0.002
 tracks-0.002
 globe-0.002
Token or
Token ID393
Feature activation+3.81e-02

Pos logit contributions
ited+0.034
ubby+0.032
vious+0.030
 disagreed+0.030
 enough+0.030
Neg logit contributions
 float-0.031
 tracks-0.030
tr-0.030
 ramp-0.030
 globe-0.029
Token broken
Token ID5445
Feature activation-1.44e-02

Pos logit contributions
tr+0.050
 ramp+0.050
 globe+0.049
 float+0.049
 tracks+0.048
Neg logit contributions
ubby-0.070
ited-0.069
 enough-0.068
vious-0.065
 disagreed-0.064
Token?"
Token ID1701
Feature activation-1.74e-02

Pos logit contributions
ited+0.025
ubby+0.024
vious+0.023
 enough+0.023
 disagreed+0.022
Neg logit contributions
 float-0.021
 ramp-0.021
tr-0.021
 tracks-0.021
 globe-0.020
Token Anna
Token ID11735
Feature activation-5.40e-02

Pos logit contributions
ited+0.032
ubby+0.032
vious+0.029
 enough+0.029
illy+0.028
Neg logit contributions
 float-0.024
 tracks-0.024
 ramp-0.023
tr-0.023
 rug-0.022
Token said
Token ID531
Feature activation-1.23e-01

Pos logit contributions
ubby+0.093
ited+0.092
 End+0.083
vious+0.082
joy+0.081
Neg logit contributions
 tracks-0.077
 float-0.076
tr-0.074
 hunt-0.072
 ramp-0.071
Token,
Token ID11
Feature activation-3.95e-02

Pos logit contributions
ubby+0.149
ited+0.143
 enough+0.143
 disagreed+0.129
vious+0.129
Neg logit contributions
 ramp-0.242
tr-0.241
 float-0.237
 tracks-0.236
 globe-0.234
Token looking
Token ID2045
Feature activation-6.20e-02

Pos logit contributions
ubby+0.064
ited+0.063
 enough+0.062
 disagreed+0.058
vious+0.058
Neg logit contributions
tr-0.061
 ramp-0.061
 tracks-0.059
 float-0.059
 globe-0.059
Token at
Token ID379
Feature activation-6.01e-02

Pos logit contributions
ited+0.101
ubby+0.099
 enough+0.092
oph+0.091
 disagreed+0.090
Neg logit contributions
 float-0.096
 tracks-0.095
tr-0.094
 ramp-0.092
 cozy-0.090
Token her
Token ID607
Feature activation-2.57e-02

Pos logit contributions
ited+0.108
ubby+0.106
vious+0.098
 disagreed+0.097
 enough+0.096
Neg logit contributions
 float-0.089
 tracks-0.088
 ramp-0.086
 rug-0.085
 globe-0.085
Token black
Token ID2042
Feature activation-1.33e-02

Pos logit contributions
ubby+0.042
 enough+0.040
ited+0.039
 disagreed+0.038
um+0.037
Neg logit contributions
tr-0.039
 ramp-0.038
nob-0.038
 hunt-0.037
 float-0.037
Token the
Token ID262
Feature activation+9.49e-03

Pos logit contributions
 ramp+0.105
tr+0.104
nob+0.102
 globe+0.099
fire+0.097
Neg logit contributions
 enough-0.074
ubby-0.063
ited-0.062
vious-0.060
 disagreed-0.059
Token birds
Token ID10087
Feature activation+4.40e-02

Pos logit contributions
tr+0.012
nob+0.012
 ramp+0.012
 tracks+0.011
 float+0.011
Neg logit contributions
ubby-0.018
 enough-0.017
ited-0.017
vious-0.016
 End-0.016
Token again
Token ID757
Feature activation+1.92e-02

Pos logit contributions
tr+0.076
 ramp+0.075
nob+0.072
 tracks+0.072
 rug+0.069
Neg logit contributions
 enough-0.067
ubby-0.064
ited-0.062
vious-0.059
 disagreed-0.058
Token,"
Token ID553
Feature activation-1.26e-02

Pos logit contributions
tr+0.025
 ramp+0.025
 tracks+0.023
nob+0.023
 globe+0.023
Neg logit contributions
ubby-0.037
 enough-0.037
ited-0.035
um-0.033
 disagreed-0.033
Token mom
Token ID1995
Feature activation-3.98e-02

Pos logit contributions
ubby+0.023
ited+0.023
 enough+0.021
vious+0.021
 disagreed+0.020
Neg logit contributions
 float-0.018
tr-0.017
 ramp-0.017
 tracks-0.017
 globe-0.017
Token said
Token ID531
Feature activation-1.23e-01

Pos logit contributions
ubby+0.077
joy+0.066
ited+0.066
 intentions+0.066
 efforts+0.063
Neg logit contributions
tr-0.055
nob-0.053
 rug-0.052
 float-0.051
 tracks-0.050
Token,
Token ID11
Feature activation-2.50e-02

Pos logit contributions
ited+0.208
ubby+0.203
vious+0.188
ody+0.180
oph+0.180
Neg logit contributions
 float-0.190
 tracks-0.184
tr-0.183
 ramp-0.180
 globe-0.176
Token smiling
Token ID16755
Feature activation-6.84e-02

Pos logit contributions
ubby+0.040
ited+0.039
 enough+0.038
 disagreed+0.036
joy+0.035
Neg logit contributions
tr-0.039
 ramp-0.039
 tracks-0.039
 float-0.038
 globe-0.037
Token and
Token ID290
Feature activation+2.17e-02

Pos logit contributions
ited+0.129
ubby+0.129
vious+0.114
illy+0.113
ody+0.113
Neg logit contributions
 float-0.092
tr-0.090
 tracks-0.085
 globe-0.085
 cozy-0.084
Token kissing
Token ID25847
Feature activation-4.53e-02

Pos logit contributions
 ramp+0.040
tr+0.039
nob+0.039
 globe+0.037
 planet+0.037
Neg logit contributions
 enough-0.030
 understanding-0.029
 disagreed-0.027
atted-0.027
joy-0.027
Token Anna
Token ID11735
Feature activation-1.41e-02

Pos logit contributions
ubby+0.079
ited+0.078
 enough+0.074
vious+0.072
 disagreed+0.071
Neg logit contributions
tr-0.065
 float-0.065
 ramp-0.064
 tracks-0.064
 globe-0.063
Token you
Token ID345
Feature activation+5.94e-03

Pos logit contributions
ubby+0.036
ited+0.035
 enough+0.034
vious+0.033
 disagreed+0.033
Neg logit contributions
tr-0.022
 ramp-0.022
 float-0.022
 tracks-0.022
 globe-0.021
Token forgive
Token ID20927
Feature activation-4.19e-02

Pos logit contributions
 float+0.007
tr+0.007
 ramp+0.006
 hunt+0.006
 tracks+0.006
Neg logit contributions
ubby-0.012
ited-0.012
 enough-0.012
joy-0.012
ously-0.011
Token me
Token ID502
Feature activation-2.09e-02

Pos logit contributions
ubby+0.063
ited+0.062
 enough+0.059
vious+0.056
 disagreed+0.055
Neg logit contributions
tr-0.071
 ramp-0.071
 float-0.070
 tracks-0.069
 globe-0.068
Token?"
Token ID1701
Feature activation-1.10e-02

Pos logit contributions
ited+0.035
ubby+0.035
 enough+0.032
ously+0.032
oph+0.032
Neg logit contributions
 float-0.032
tr-0.031
 tracks-0.030
 ramp-0.030
 hunt-0.030
Token Sara
Token ID24799
Feature activation-5.23e-02

Pos logit contributions
ubby+0.021
ited+0.021
 enough+0.020
vious+0.019
 disagreed+0.019
Neg logit contributions
 float-0.014
tr-0.013
 tracks-0.013
 ramp-0.013
 globe-0.013
Token said
Token ID531
Feature activation-1.22e-01

Pos logit contributions
ubby+0.087
ited+0.082
 enough+0.077
 End+0.074
vious+0.074
Neg logit contributions
tr-0.082
 float-0.081
 tracks-0.079
 ramp-0.078
 hunt-0.076
Token.
Token ID13
Feature activation-6.94e-03

Pos logit contributions
ubby+0.254
ited+0.253
 enough+0.242
vious+0.236
 disagreed+0.233
Neg logit contributions
tr-0.134
 float-0.134
 ramp-0.132
 tracks-0.130
 globe-0.127
Token
Token ID198
Feature activation-3.07e-02

Pos logit contributions
ubby+0.011
 enough+0.011
ited+0.011
um+0.010
vious+0.010
Neg logit contributions
 ramp-0.011
tr-0.011
 float-0.010
 tracks-0.010
 globe-0.010
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
ubby+0.057
ited+0.056
 enough+0.054
vious+0.052
 disagreed+0.052
Neg logit contributions
tr-0.041
 ramp-0.040
 float-0.040
 tracks-0.040
 globe-0.039
TokenBen
Token ID11696
Feature activation-4.00e-02

Pos logit contributions
ited+0.019
ubby+0.019
 enough+0.019
vious+0.018
 disagreed+0.017
Neg logit contributions
 ramp-0.025
 float-0.025
tr-0.025
 tracks-0.025
 globe-0.024
Token looked
Token ID3114
Feature activation-5.45e-02

Pos logit contributions
ubby+0.066
ited+0.060
 enough+0.057
heart+0.055
illy+0.055
Neg logit contributions
tr-0.066
 float-0.065
 ramp-0.063
 globe-0.062
 tracks-0.061
Token and
Token ID290
Feature activation+5.39e-02

Pos logit contributions
ubby+0.007
ited+0.007
 enough+0.006
vious+0.006
 disagreed+0.006
Neg logit contributions
tr-0.005
 ramp-0.005
 float-0.005
 tracks-0.005
 globe-0.005
Token hurt
Token ID5938
Feature activation-3.43e-02

Pos logit contributions
tr+0.069
 globe+0.067
 ramp+0.067
 tracks+0.065
nob+0.062
Neg logit contributions
 enough-0.102
ited-0.101
ubby-0.100
vious-0.097
 disagreed-0.096
Token ourselves
Token ID6731
Feature activation+2.82e-02

Pos logit contributions
ubby+0.053
 enough+0.051
ited+0.048
um+0.047
vious+0.046
Neg logit contributions
 ramp-0.058
tr-0.058
 float-0.055
 globe-0.054
 hunt-0.054
Token?"
Token ID1701
Feature activation-1.65e-02

Pos logit contributions
tr+0.039
 ramp+0.038
 float+0.038
 tracks+0.037
 globe+0.037
Neg logit contributions
ubby-0.051
ited-0.050
 enough-0.049
vious-0.047
 disagreed-0.046
Token Ben
Token ID3932
Feature activation-5.67e-02

Pos logit contributions
ubby+0.031
ited+0.031
 enough+0.030
vious+0.029
 disagreed+0.028
Neg logit contributions
 float-0.021
 ramp-0.021
tr-0.021
 tracks-0.021
 globe-0.020
Token said
Token ID531
Feature activation-1.22e-01

Pos logit contributions
ubby+0.092
ited+0.089
 enough+0.081
um+0.079
illy+0.078
Neg logit contributions
 float-0.090
 tracks-0.088
tr-0.087
 ramp-0.087
 hunt-0.086
Token.
Token ID13
Feature activation-4.40e-03

Pos logit contributions
ubby+0.224
ited+0.219
 enough+0.216
vious+0.205
 disagreed+0.204
Neg logit contributions
tr-0.165
 ramp-0.165
 float-0.162
 tracks-0.160
 globe-0.157
Token He
Token ID679
Feature activation-2.08e-02

Pos logit contributions
ubby+0.007
 enough+0.007
ited+0.007
um+0.006
joy+0.006
Neg logit contributions
tr-0.007
 ramp-0.007
 tracks-0.007
 globe-0.007
 float-0.007
Token was
Token ID373
Feature activation-3.59e-02

Pos logit contributions
ubby+0.035
ited+0.033
 enough+0.031
vious+0.031
um+0.030
Neg logit contributions
tr-0.033
 float-0.032
 ramp-0.032
 tracks-0.031
 rug-0.030
Token scared
Token ID12008
Feature activation-7.66e-02

Pos logit contributions
ubby+0.054
ited+0.054
 enough+0.051
vious+0.049
 disagreed+0.048
Neg logit contributions
tr-0.060
 float-0.060
 ramp-0.060
 tracks-0.059
 globe-0.057
Token and
Token ID290
Feature activation+1.68e-02

Pos logit contributions
ited+0.145
ubby+0.135
oph+0.134
ige+0.128
ody+0.127
Neg logit contributions
 float-0.107
 tracks-0.107
 rug-0.096
 ramp-0.096
 cozy-0.093
Token mad
Token ID8805
Feature activation-6.77e-02

Pos logit contributions
ited+0.054
ubby+0.052
um+0.048
vious+0.048
 enough+0.047
Neg logit contributions
 float-0.053
tr-0.052
 tracks-0.052
 ramp-0.052
 hunt-0.050
Token at
Token ID379
Feature activation-4.28e-02

Pos logit contributions
ited+0.144
 End+0.126
years+0.122
ubby+0.121
illy+0.119
Neg logit contributions
 float-0.073
tr-0.072
 ramp-0.065
 tracks-0.064
 hunt-0.064
Token me
Token ID502
Feature activation-1.44e-02

Pos logit contributions
ited+0.068
ubby+0.067
 enough+0.062
vious+0.061
ously+0.060
Neg logit contributions
 float-0.070
 tracks-0.068
 ramp-0.068
tr-0.068
 hunt-0.066
Token?"
Token ID1701
Feature activation-2.25e-02

Pos logit contributions
ubby+0.026
ited+0.026
 enough+0.024
vious+0.024
ously+0.023
Neg logit contributions
tr-0.020
 float-0.020
 ramp-0.019
 tracks-0.019
 hunt-0.019
Token Lily
Token ID20037
Feature activation-7.03e-02

Pos logit contributions
ubby+0.036
ited+0.035
 enough+0.034
vious+0.032
 disagreed+0.032
Neg logit contributions
tr-0.036
 ramp-0.036
 float-0.035
 tracks-0.035
 globe-0.035
Token said
Token ID531
Feature activation-1.22e-01

Pos logit contributions
ubby+0.123
ited+0.116
 enough+0.105
 End+0.103
illy+0.103
Neg logit contributions
 tracks-0.106
 float-0.104
tr-0.101
 ramp-0.100
nob-0.099
Token.
Token ID13
Feature activation-1.42e-02

Pos logit contributions
ubby+0.257
ited+0.257
 enough+0.243
vious+0.239
 disagreed+0.235
Neg logit contributions
 float-0.132
tr-0.131
 ramp-0.128
 tracks-0.127
 globe-0.123
Token
Token ID198
Feature activation-3.82e-02

Pos logit contributions
ubby+0.021
 enough+0.020
ited+0.020
um+0.019
vious+0.019
Neg logit contributions
 ramp-0.024
tr-0.024
 float-0.023
 tracks-0.023
 globe-0.023
Token
Token ID198
Feature activation-2.14e-02

Pos logit contributions
ubby+0.066
ited+0.064
 enough+0.063
um+0.060
vious+0.060
Neg logit contributions
tr-0.055
 ramp-0.055
 float-0.054
 tracks-0.053
 globe-0.052
Token"
Token ID1
Feature activation-4.91e-03

Pos logit contributions
ubby+0.024
ited+0.023
 enough+0.022
vious+0.020
 disagreed+0.020
Neg logit contributions
tr-0.045
 ramp-0.044
 float-0.044
 tracks-0.044
 globe-0.043
TokenReally
Token ID26392
Feature activation-3.18e-02

Pos logit contributions
ubby+0.008
ited+0.007
 enough+0.007
vious+0.007
 disagreed+0.007
Neg logit contributions
 ramp-0.008
 tracks-0.008
tr-0.008
 float-0.008
 globe-0.008
Token not
Token ID407
Feature activation-2.47e-02

Pos logit contributions
ubby+0.070
 enough+0.068
ited+0.067
um+0.063
 disagreed+0.062
Neg logit contributions
tr-0.072
 ramp-0.071
 tracks-0.069
 float-0.069
 globe-0.069
Token see
Token ID766
Feature activation-2.95e-02

Pos logit contributions
ubby+0.038
ited+0.037
 enough+0.036
vious+0.034
 disagreed+0.034
Neg logit contributions
tr-0.041
 ramp-0.041
 float-0.040
 tracks-0.040
 globe-0.039
Token it
Token ID340
Feature activation+3.75e-02

Pos logit contributions
 enough+0.044
ubby+0.042
ited+0.040
vious+0.038
 disagreed+0.037
Neg logit contributions
 ramp-0.053
tr-0.053
 float-0.051
 globe-0.050
 tracks-0.049
Token."
Token ID526
Feature activation-3.63e-03

Pos logit contributions
 ramp+0.056
 globe+0.055
tr+0.055
 tracks+0.055
 float+0.053
Neg logit contributions
 enough-0.064
ubby-0.061
ited-0.061
vious-0.057
 disagreed-0.056
Token Lily
Token ID20037
Feature activation-5.43e-02

Pos logit contributions
ited+0.008
ubby+0.008
vious+0.007
 enough+0.007
 disagreed+0.007
Neg logit contributions
 float-0.004
 tracks-0.004
 ramp-0.004
tr-0.004
 globe-0.003
Token said
Token ID531
Feature activation-1.22e-01

Pos logit contributions
ubby+0.105
ited+0.096
years+0.093
joy+0.087
eared+0.086
Neg logit contributions
 tracks-0.076
tr-0.075
 float-0.075
 ramp-0.075
 rug-0.074
Token.
Token ID13
Feature activation-3.57e-03

Pos logit contributions
ited+0.210
ubby+0.205
vious+0.192
 enough+0.184
 disagreed+0.181
Neg logit contributions
 float-0.187
tr-0.183
 tracks-0.180
 ramp-0.179
 globe-0.177
Token
Token ID198
Feature activation-2.54e-02

Pos logit contributions
ubby+0.006
ited+0.006
 enough+0.006
vious+0.005
 disagreed+0.005
Neg logit contributions
tr-0.005
 ramp-0.005
 float-0.005
 tracks-0.005
 globe-0.005
Token
Token ID198
Feature activation-9.41e-03

Pos logit contributions
ited+0.054
ubby+0.052
 enough+0.051
vious+0.050
 disagreed+0.049
Neg logit contributions
 tracks-0.029
tr-0.028
 ramp-0.028
 float-0.028
 globe-0.027
TokenShe
Token ID3347
Feature activation-1.95e-02

Pos logit contributions
ited+0.013
ubby+0.013
 enough+0.012
vious+0.012
 disagreed+0.011
Neg logit contributions
tr-0.017
 ramp-0.017
 tracks-0.017
 float-0.017
 globe-0.017
Token was
Token ID373
Feature activation-3.46e-02

Pos logit contributions
ubby+0.038
ited+0.035
vious+0.033
joy+0.033
 enough+0.033
Neg logit contributions
tr-0.026
 ramp-0.025
 float-0.025
 tracks-0.025
 rug-0.024
Token?"
Token ID1701
Feature activation-1.49e-02

Pos logit contributions
ubby+0.073
 enough+0.070
ited+0.067
um+0.064
 disagreed+0.063
Neg logit contributions
 ramp-0.095
tr-0.093
 tracks-0.093
 globe-0.092
 float-0.092
Token
Token ID198
Feature activation-3.33e-02

Pos logit contributions
ubby+0.025
ited+0.024
 enough+0.024
 disagreed+0.023
vious+0.023
Neg logit contributions
tr-0.023
 ramp-0.022
 float-0.022
 tracks-0.022
 globe-0.022
Token
Token ID198
Feature activation-1.34e-02

Pos logit contributions
ubby+0.054
ited+0.050
 enough+0.049
um+0.048
illy+0.047
Neg logit contributions
tr-0.054
 ramp-0.052
 float-0.052
 globe-0.050
 tracks-0.050
TokenL
Token ID43
Feature activation-6.31e-02

Pos logit contributions
ubby+0.018
ited+0.017
um+0.015
Fine+0.015
iley+0.015
Neg logit contributions
tr-0.025
 float-0.025
 ramp-0.025
 tracks-0.024
 hunt-0.024
Tokenily
Token ID813
Feature activation-5.54e-02

Pos logit contributions
ited+0.098
 enough+0.090
 disagreed+0.088
ubby+0.084
ised+0.081
Neg logit contributions
tr-0.108
nob-0.107
 ramp-0.105
 rug-0.103
 globe-0.102
Token said
Token ID531
Feature activation-1.22e-01

Pos logit contributions
ubby+0.086
ited+0.079
 enough+0.073
vious+0.070
illy+0.070
Neg logit contributions
tr-0.096
 float-0.094
 tracks-0.093
 ramp-0.092
nob-0.090
Token,
Token ID11
Feature activation-4.36e-02

Pos logit contributions
ited+0.169
ubby+0.161
vious+0.146
 enough+0.140
 disagreed+0.137
Neg logit contributions
 float-0.231
tr-0.229
 globe-0.222
 tracks-0.221
 ramp-0.221
Token "
Token ID366
Feature activation-1.13e-02

Pos logit contributions
ubby+0.062
ited+0.060
 enough+0.060
 disagreed+0.056
ously+0.055
Neg logit contributions
 ramp-0.075
 tracks-0.074
tr-0.074
 float-0.073
 hunt-0.071
TokenThank
Token ID10449
Feature activation-7.74e-02

Pos logit contributions
ited+0.016
ubby+0.016
Fine+0.015
vious+0.015
 disagreed+0.014
Neg logit contributions
 ramp-0.020
 tracks-0.020
nob-0.019
tr-0.019
 hunt-0.019
Token you
Token ID345
Feature activation-5.50e-02

Pos logit contributions
ubby+0.128
ited+0.121
 enough+0.121
um+0.116
vious+0.115
Neg logit contributions
tr-0.123
 ramp-0.123
 float-0.119
 tracks-0.118
 globe-0.115
Token,
Token ID11
Feature activation-3.09e-02

Pos logit contributions
ited+0.075
ubby+0.074
 enough+0.067
vious+0.066
 disagreed+0.065
Neg logit contributions
tr-0.102
 ramp-0.101
 float-0.101
 tracks-0.100
 globe-0.098
Token we
Token ID356
Feature activation-2.87e-02

Pos logit contributions
ited+0.023
ubby+0.023
 enough+0.022
vious+0.021
 disagreed+0.021
Neg logit contributions
tr-0.017
 float-0.017
 ramp-0.017
 tracks-0.017
 globe-0.017
Token are
Token ID389
Feature activation-3.04e-02

Pos logit contributions
ubby+0.054
ited+0.052
joy+0.051
ously+0.051
um+0.051
Neg logit contributions
tr-0.036
 float-0.035
 hunt-0.033
 hide-0.032
 cozy-0.032
Token sorry
Token ID7926
Feature activation-6.61e-02

Pos logit contributions
ubby+0.035
 enough+0.033
ited+0.032
vious+0.030
 disagreed+0.030
Neg logit contributions
tr-0.063
 ramp-0.063
 float-0.062
 tracks-0.062
 globe-0.061
Token,"
Token ID553
Feature activation-2.04e-02

Pos logit contributions
ited+0.128
ubby+0.117
vious+0.112
years+0.107
 disagreed+0.106
Neg logit contributions
 float-0.094
 tracks-0.091
 ramp-0.090
tr-0.089
 spaceship-0.086
Token Lily
Token ID20037
Feature activation-6.94e-02

Pos logit contributions
ited+0.041
ubby+0.041
 enough+0.039
vious+0.038
 disagreed+0.037
Neg logit contributions
 float-0.024
 ramp-0.024
 tracks-0.024
tr-0.024
 globe-0.023
Token said
Token ID531
Feature activation-1.22e-01

Pos logit contributions
ubby+0.139
ited+0.129
 End+0.120
 intentions+0.114
joy+0.113
Neg logit contributions
 tracks-0.094
tr-0.091
 float-0.086
 ramp-0.085
 hunt-0.085
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
ited+0.256
ubby+0.256
 enough+0.242
vious+0.239
 disagreed+0.234
Neg logit contributions
 float-0.132
tr-0.131
 ramp-0.130
 tracks-0.128
 globe-0.123
Token "
Token ID366
Feature activation-8.05e-03

Pos logit contributions
ited+0.027
ubby+0.026
 enough+0.025
vious+0.024
 disagreed+0.024
Neg logit contributions
tr-0.022
 float-0.022
 ramp-0.022
 tracks-0.021
 globe-0.021
TokenWe
Token ID1135
Feature activation-2.18e-02

Pos logit contributions
ubby+0.015
ited+0.015
 enough+0.014
vious+0.014
 disagreed+0.014
Neg logit contributions
 ramp-0.011
tr-0.011
 float-0.010
 tracks-0.010
 globe-0.010
Token wanted
Token ID2227
Feature activation-5.35e-02

Pos logit contributions
ubby+0.035
ited+0.034
joy+0.033
oph+0.032
 enough+0.032
Neg logit contributions
tr-0.034
 float-0.034
 globe-0.031
 ramp-0.031
 tracks-0.031
Token to
Token ID284
Feature activation+7.37e-03

Pos logit contributions
 enough+0.063
ubby+0.049
 disagreed+0.046
 intentions+0.045
vious+0.044
Neg logit contributions
 ramp-0.117
 globe-0.114
tr-0.111
nob-0.109
fire-0.108
Token owner
Token ID4870
Feature activation-1.58e-03

Pos logit contributions
ited+0.043
ubby+0.043
ously+0.042
 enough+0.039
years+0.038
Neg logit contributions
 float-0.030
tr-0.029
 tracks-0.028
 ramp-0.028
 globe-0.027
Token sees
Token ID7224
Feature activation-2.17e-02

Pos logit contributions
ubby+0.003
ited+0.003
ously+0.003
 enough+0.003
joy+0.003
Neg logit contributions
tr-0.002
 float-0.002
 ramp-0.002
 hunt-0.002
 tracks-0.002
Token us
Token ID514
Feature activation+2.43e-02

Pos logit contributions
ubby+0.036
ited+0.035
 enough+0.034
vious+0.032
 disagreed+0.032
Neg logit contributions
tr-0.033
 ramp-0.033
 float-0.033
 tracks-0.033
 globe-0.032
Token?"
Token ID1701
Feature activation-1.27e-02

Pos logit contributions
 float+0.037
tr+0.036
 ramp+0.036
 tracks+0.036
 hunt+0.036
Neg logit contributions
ited-0.040
ubby-0.040
 enough-0.037
vious-0.036
ously-0.036
Token Ben
Token ID3932
Feature activation-4.92e-02

Pos logit contributions
ubby+0.022
ited+0.022
 enough+0.021
vious+0.020
 disagreed+0.020
Neg logit contributions
tr-0.018
 ramp-0.018
 float-0.018
 tracks-0.018
 globe-0.018
Token said
Token ID531
Feature activation-1.22e-01

Pos logit contributions
ubby+0.078
ited+0.071
 enough+0.068
vious+0.065
eared+0.064
Neg logit contributions
 float-0.082
tr-0.080
 ramp-0.078
 hunt-0.077
 tracks-0.077
Token,
Token ID11
Feature activation-3.48e-02

Pos logit contributions
ited+0.166
ubby+0.162
vious+0.145
oph+0.142
 enough+0.141
Neg logit contributions
 float-0.230
tr-0.222
 ramp-0.222
 tracks-0.219
 globe-0.217
Token "
Token ID366
Feature activation+1.31e-02

Pos logit contributions
ubby+0.057
ited+0.056
 enough+0.054
vious+0.052
 disagreed+0.051
Neg logit contributions
tr-0.054
 ramp-0.053
 float-0.053
 tracks-0.052
 globe-0.052
TokenDon
Token ID3987
Feature activation-5.67e-02

Pos logit contributions
 ramp+0.023
 tracks+0.022
 float+0.022
tr+0.022
 globe+0.022
Neg logit contributions
ited-0.019
ubby-0.019
vious-0.018
 enough-0.017
 disagreed-0.017
Token't
Token ID470
Feature activation-5.44e-02

Pos logit contributions
ubby+0.094
ited+0.094
 enough+0.089
vious+0.086
 disagreed+0.085
Neg logit contributions
tr-0.087
 ramp-0.086
 float-0.086
 tracks-0.084
 globe-0.082
Token be
Token ID307
Feature activation-2.87e-02

Pos logit contributions
ubby+0.051
ited+0.050
 enough+0.046
vious+0.042
joy+0.040
Neg logit contributions
 float-0.126
tr-0.123
 ramp-0.122
 tracks-0.121
 hunt-0.120
Token for
Token ID329
Feature activation+9.45e-04

Pos logit contributions
 ramp+0.037
tr+0.036
 float+0.036
 tracks+0.035
 globe+0.035
Neg logit contributions
ubby-0.034
ited-0.033
 enough-0.033
vious-0.030
 disagreed-0.030
Token it
Token ID340
Feature activation+3.79e-02

Pos logit contributions
 float+0.001
 ramp+0.001
tr+0.001
 tracks+0.001
 globe+0.001
Neg logit contributions
ubby-0.002
ited-0.002
 enough-0.001
vious-0.001
 disagreed-0.001
Token someday
Token ID25580
Feature activation+6.28e-03

Pos logit contributions
 ramp+0.055
tr+0.055
 float+0.054
 tracks+0.054
 globe+0.053
Neg logit contributions
ubby-0.065
 enough-0.065
ited-0.064
vious-0.060
 disagreed-0.060
Token."
Token ID526
Feature activation-3.24e-03

Pos logit contributions
tr+0.008
 ramp+0.008
 float+0.008
 tracks+0.008
 globe+0.008
Neg logit contributions
ubby-0.012
ited-0.012
 enough-0.011
vious-0.011
 disagreed-0.011
Token Anna
Token ID11735
Feature activation-4.37e-02

Pos logit contributions
ubby+0.005
ited+0.005
 enough+0.005
vious+0.005
 disagreed+0.005
Neg logit contributions
 float-0.005
 ramp-0.005
tr-0.005
 tracks-0.005
 globe-0.005
Token said
Token ID531
Feature activation-1.22e-01

Pos logit contributions
ubby+0.077
ited+0.071
vious+0.065
 enough+0.065
 End+0.064
Neg logit contributions
 float-0.066
 tracks-0.066
tr-0.065
 hunt-0.063
 ramp-0.063
Token,
Token ID11
Feature activation-2.84e-02

Pos logit contributions
ubby+0.161
 enough+0.158
ited+0.154
ously+0.143
um+0.143
Neg logit contributions
tr-0.226
 ramp-0.225
 float-0.219
 tracks-0.219
 globe-0.217
Token putting
Token ID5137
Feature activation-1.03e-02

Pos logit contributions
ubby+0.045
ited+0.045
 enough+0.044
 disagreed+0.042
ously+0.041
Neg logit contributions
tr-0.045
 ramp-0.044
 tracks-0.043
 globe-0.042
 float-0.042
Token the
Token ID262
Feature activation+8.05e-03

Pos logit contributions
ubby+0.017
ited+0.017
 enough+0.017
vious+0.016
 disagreed+0.016
Neg logit contributions
tr-0.016
 ramp-0.015
 float-0.015
 tracks-0.015
 globe-0.015
Token necklace
Token ID38987
Feature activation+4.87e-02

Pos logit contributions
tr+0.006
 ramp+0.006
 float+0.006
 tracks+0.005
 globe+0.005
Neg logit contributions
ubby-0.020
ited-0.020
 enough-0.019
 disagreed-0.019
vious-0.019
Token back
Token ID736
Feature activation+2.49e-02

Pos logit contributions
 ramp+0.069
 tracks+0.066
 float+0.065
tr+0.064
 hunt+0.064
Neg logit contributions
 enough-0.087
ubby-0.082
ited-0.080
ously-0.076
um-0.076
Token know
Token ID760
Feature activation-6.60e-02

Pos logit contributions
ubby+0.041
ited+0.041
ously+0.038
oph+0.038
ised+0.038
Neg logit contributions
 float-0.043
 hunt-0.040
 soar-0.038
 glide-0.038
 ramp-0.037
Token any
Token ID597
Feature activation-3.86e-02

Pos logit contributions
 enough+0.065
ubby+0.057
 disagreed+0.047
vious+0.047
um+0.047
Neg logit contributions
tr-0.150
 ramp-0.149
 globe-0.144
fire-0.144
nob-0.143
Token jokes
Token ID14532
Feature activation-1.30e-02

Pos logit contributions
ubby+0.059
 enough+0.057
ited+0.055
vious+0.054
 disagreed+0.052
Neg logit contributions
tr-0.064
 ramp-0.063
 globe-0.060
nob-0.060
 float-0.060
Token?"
Token ID1701
Feature activation-1.10e-02

Pos logit contributions
ubby+0.021
ited+0.020
 enough+0.019
vious+0.019
 disagreed+0.018
Neg logit contributions
tr-0.021
 float-0.021
 ramp-0.020
 tracks-0.020
 globe-0.020
Token Max
Token ID5436
Feature activation-6.02e-02

Pos logit contributions
ubby+0.020
ited+0.019
 enough+0.019
vious+0.018
 disagreed+0.018
Neg logit contributions
 float-0.015
tr-0.015
 ramp-0.015
 tracks-0.015
 globe-0.014
Token said
Token ID531
Feature activation-1.21e-01

Pos logit contributions
ubby+0.093
ited+0.088
 enough+0.082
um+0.078
vious+0.078
Neg logit contributions
 float-0.102
tr-0.100
 ramp-0.099
 tracks-0.099
 hunt-0.096
Token,
Token ID11
Feature activation-3.15e-02

Pos logit contributions
ubby+0.218
ited+0.215
 enough+0.207
vious+0.199
 disagreed+0.197
Neg logit contributions
tr-0.169
 ramp-0.168
 float-0.168
 tracks-0.164
 globe-0.161
Token "
Token ID366
Feature activation+2.16e-03

Pos logit contributions
ubby+0.048
ited+0.047
 enough+0.046
 disagreed+0.043
vious+0.043
Neg logit contributions
tr-0.052
 ramp-0.051
 tracks-0.051
 float-0.050
 globe-0.049
TokenYes
Token ID5297
Feature activation-6.89e-02

Pos logit contributions
 ramp+0.004
 tracks+0.004
tr+0.004
 float+0.004
 globe+0.004
Neg logit contributions
ited-0.003
ubby-0.003
vious-0.003
 enough-0.003
 disagreed-0.003
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
ited+0.093
ubby+0.091
vious+0.083
 enough+0.080
 disagreed+0.079
Neg logit contributions
 float-0.130
 ramp-0.127
tr-0.124
 tracks-0.124
 globe-0.124
Token I
Token ID314
Feature activation-3.83e-02

Pos logit contributions
ubby+0.029
 enough+0.028
ited+0.028
vious+0.027
 disagreed+0.026
Neg logit contributions
tr-0.023
 ramp-0.022
 float-0.022
 tracks-0.022
 globe-0.022
Token for
Token ID329
Feature activation-6.50e-03

Pos logit contributions
tr+0.060
 float+0.059
 ramp+0.057
 tracks+0.056
 globe+0.056
Neg logit contributions
ited-0.059
ubby-0.059
 enough-0.053
ously-0.052
vious-0.052
Token now
Token ID783
Feature activation+1.77e-03

Pos logit contributions
ited+0.011
ubby+0.011
 enough+0.010
vious+0.010
ously+0.010
Neg logit contributions
 float-0.010
 tracks-0.010
 ramp-0.010
tr-0.010
 hunt-0.009
Token?"
Token ID1701
Feature activation-1.64e-02

Pos logit contributions
 float+0.002
 hunt+0.002
tr+0.002
 ramp+0.002
 tracks+0.002
Neg logit contributions
ited-0.003
ubby-0.003
ously-0.003
vious-0.003
years-0.003
Token her
Token ID607
Feature activation-6.81e-02

Pos logit contributions
ubby+0.026
ited+0.025
 enough+0.024
vious+0.023
 disagreed+0.023
Neg logit contributions
tr-0.027
 ramp-0.027
 float-0.026
 tracks-0.026
 globe-0.026
Token mom
Token ID1995
Feature activation-4.04e-02

Pos logit contributions
ited+0.111
ubby+0.110
 enough+0.105
vious+0.102
 disagreed+0.099
Neg logit contributions
tr-0.106
 ramp-0.106
 float-0.106
 tracks-0.104
 globe-0.102
Token said
Token ID531
Feature activation-1.21e-01

Pos logit contributions
ubby+0.070
ited+0.063
joy+0.062
 intentions+0.062
 enough+0.062
Neg logit contributions
tr-0.061
nob-0.060
 float-0.057
 rug-0.057
 tracks-0.057
Token,
Token ID11
Feature activation-2.23e-02

Pos logit contributions
ited+0.185
ubby+0.181
vious+0.167
oph+0.160
 enough+0.159
Neg logit contributions
 float-0.208
tr-0.202
 tracks-0.201
 ramp-0.198
 globe-0.195
Token trying
Token ID2111
Feature activation-3.46e-02

Pos logit contributions
ubby+0.039
ited+0.038
 enough+0.037
vious+0.035
 disagreed+0.035
Neg logit contributions
tr-0.032
 ramp-0.032
 float-0.032
 tracks-0.031
 globe-0.031
Token to
Token ID284
Feature activation+1.10e-02

Pos logit contributions
 enough+0.043
ubby+0.040
ited+0.039
 disagreed+0.038
ously+0.037
Neg logit contributions
 ramp-0.068
tr-0.066
 float-0.066
 globe-0.065
 tracks-0.064
Token cheer
Token ID14042
Feature activation-8.22e-02

Pos logit contributions
tr+0.014
 ramp+0.014
 float+0.014
 tracks+0.013
 globe+0.013
Neg logit contributions
ubby-0.021
ited-0.021
 enough-0.020
vious-0.020
 disagreed-0.020
Token her
Token ID607
Feature activation-4.13e-02

Pos logit contributions
ited+0.141
ubby+0.137
vious+0.125
 enough+0.124
 disagreed+0.123
Neg logit contributions
tr-0.124
 float-0.123
 tracks-0.122
 ramp-0.119
nob-0.118
Token,
Token ID11
Feature activation-6.09e-03

Pos logit contributions
ubby+0.044
ited+0.044
 enough+0.042
vious+0.040
 disagreed+0.040
Neg logit contributions
tr-0.041
 float-0.041
 ramp-0.041
 tracks-0.041
 globe-0.040
Token it
Token ID340
Feature activation+1.21e-02

Pos logit contributions
ubby+0.011
ited+0.011
 enough+0.011
vious+0.010
 disagreed+0.010
Neg logit contributions
 float-0.008
tr-0.008
 ramp-0.008
 tracks-0.008
 globe-0.008
Token hurts
Token ID20406
Feature activation-5.89e-02

Pos logit contributions
 ramp+0.018
 globe+0.018
 float+0.017
 tracks+0.017
fire+0.017
Neg logit contributions
ubby-0.020
 enough-0.020
ited-0.020
vious-0.019
 disagreed-0.019
Token!"
Token ID2474
Feature activation-1.51e-02

Pos logit contributions
ited+0.114
ubby+0.109
joy+0.103
vious+0.103
oph+0.102
Neg logit contributions
 float-0.075
 tracks-0.075
tr-0.075
 ramp-0.072
 rug-0.071
Token Lily
Token ID20037
Feature activation-6.54e-02

Pos logit contributions
ubby+0.028
ited+0.028
 enough+0.026
vious+0.025
 disagreed+0.025
Neg logit contributions
 float-0.021
 ramp-0.020
 tracks-0.020
tr-0.020
 globe-0.019
Token said
Token ID531
Feature activation-1.21e-01

Pos logit contributions
ubby+0.110
ited+0.107
 enough+0.098
joy+0.095
um+0.094
Neg logit contributions
 tracks-0.101
 float-0.100
tr-0.099
 ramp-0.099
 rug-0.095
Token.
Token ID13
Feature activation-6.98e-03

Pos logit contributions
ubby+0.221
ited+0.216
 enough+0.213
vious+0.202
 disagreed+0.201
Neg logit contributions
tr-0.166
 ramp-0.166
 float-0.162
 tracks-0.160
 globe-0.159
Token
Token ID198
Feature activation-3.03e-02

Pos logit contributions
ubby+0.011
ited+0.011
 enough+0.010
vious+0.010
 disagreed+0.010
Neg logit contributions
tr-0.011
 ramp-0.011
 float-0.011
 tracks-0.011
 globe-0.011
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
ubby+0.054
ited+0.053
 enough+0.051
vious+0.049
um+0.049
Neg logit contributions
tr-0.043
 ramp-0.042
 float-0.042
 tracks-0.041
 globe-0.041
TokenTom
Token ID13787
Feature activation-4.99e-02

Pos logit contributions
ubby+0.017
ited+0.016
 enough+0.015
vious+0.014
 disagreed+0.014
Neg logit contributions
tr-0.028
 ramp-0.028
 float-0.027
 tracks-0.027
 globe-0.027
Token felt
Token ID2936
Feature activation-6.01e-02

Pos logit contributions
ubby+0.088
ited+0.081
 enough+0.078
 intentions+0.076
vious+0.076
Neg logit contributions
tr-0.074
 float-0.071
 rug-0.070
 ramp-0.070
 tracks-0.070
Token healthy
Token ID5448
Feature activation-2.84e-02

Pos logit contributions
ubby+0.002
ited+0.002
 enough+0.002
vious+0.002
 disagreed+0.002
Neg logit contributions
tr-0.002
 ramp-0.002
 float-0.002
 tracks-0.002
 globe-0.002
Token and
Token ID290
Feature activation+3.45e-02

Pos logit contributions
ited+0.056
ubby+0.055
oph+0.052
illy+0.051
ody+0.050
Neg logit contributions
 hunt-0.034
 float-0.034
 tracks-0.034
 cozy-0.034
 ramp-0.033
Token happy
Token ID3772
Feature activation-2.97e-02

Pos logit contributions
tr+0.057
 ramp+0.057
 float+0.056
 tracks+0.056
 globe+0.055
Neg logit contributions
ubby-0.053
ited-0.052
 enough-0.050
vious-0.048
 disagreed-0.048
Token,"
Token ID553
Feature activation-1.92e-02

Pos logit contributions
ubby+0.059
ited+0.059
 enough+0.056
vious+0.055
 disagreed+0.054
Neg logit contributions
tr-0.035
 float-0.035
 ramp-0.035
 tracks-0.034
 globe-0.033
Token Tommy
Token ID19919
Feature activation-5.91e-02

Pos logit contributions
ubby+0.035
ited+0.035
 enough+0.033
vious+0.033
 disagreed+0.032
Neg logit contributions
 float-0.026
tr-0.026
 ramp-0.026
 tracks-0.025
 globe-0.025
Token said
Token ID531
Feature activation-1.21e-01

Pos logit contributions
ubby+0.119
ited+0.106
joy+0.102
 End+0.101
years+0.098
Neg logit contributions
tr-0.077
 tracks-0.076
 float-0.074
 hunt-0.073
nob-0.072
Token.
Token ID13
Feature activation-9.02e-03

Pos logit contributions
ubby+0.255
ited+0.253
 enough+0.243
vious+0.236
 disagreed+0.233
Neg logit contributions
tr-0.131
 float-0.131
 ramp-0.130
 tracks-0.127
 globe-0.123
Token
Token ID198
Feature activation-3.32e-02

Pos logit contributions
ubby+0.016
ited+0.015
 enough+0.015
vious+0.014
um+0.014
Neg logit contributions
tr-0.013
 ramp-0.013
 float-0.013
 tracks-0.012
 globe-0.012
Token
Token ID198
Feature activation-1.56e-02

Pos logit contributions
ited+0.064
ubby+0.064
 enough+0.062
vious+0.060
 disagreed+0.059
Neg logit contributions
 ramp-0.042
tr-0.042
 float-0.041
 tracks-0.041
 globe-0.040
Token"
Token ID1
Feature activation-1.09e-02

Pos logit contributions
ited+0.024
ubby+0.023
 enough+0.023
vious+0.023
 disagreed+0.022
Neg logit contributions
 ramp-0.027
tr-0.026
 tracks-0.026
 float-0.025
 globe-0.025
TokenI
Token ID40
Feature activation-1.58e-02

Pos logit contributions
ubby+0.018
ited+0.018
 enough+0.017
vious+0.016
 disagreed+0.016
Neg logit contributions
tr-0.017
 float-0.017
 ramp-0.017
 tracks-0.016
 globe-0.016
Token you
Token ID345
Feature activation+4.40e-03

Pos logit contributions
ubby+0.036
ited+0.035
 enough+0.034
vious+0.033
 disagreed+0.033
Neg logit contributions
tr-0.025
 ramp-0.025
 tracks-0.025
 float-0.025
 globe-0.024
Token forgive
Token ID20927
Feature activation-4.62e-02

Pos logit contributions
 float+0.006
tr+0.006
 ramp+0.006
 tracks+0.006
 hunt+0.006
Neg logit contributions
ubby-0.008
ited-0.008
 enough-0.007
vious-0.007
 disagreed-0.007
Token me
Token ID502
Feature activation-2.29e-02

Pos logit contributions
ubby+0.063
ited+0.062
 enough+0.059
vious+0.056
 disagreed+0.055
Neg logit contributions
tr-0.084
 ramp-0.084
 float-0.084
 tracks-0.082
 globe-0.081
Token?"
Token ID1701
Feature activation-2.20e-02

Pos logit contributions
ubby+0.035
ited+0.035
 enough+0.033
vious+0.032
 disagreed+0.032
Neg logit contributions
 float-0.038
tr-0.037
 ramp-0.037
 tracks-0.036
 hunt-0.036
Token Tom
Token ID4186
Feature activation-5.79e-02

Pos logit contributions
ited+0.043
ubby+0.043
 enough+0.040
vious+0.039
 disagreed+0.038
Neg logit contributions
 float-0.028
 ramp-0.027
 tracks-0.027
tr-0.027
 globe-0.026
Token said
Token ID531
Feature activation-1.21e-01

Pos logit contributions
ubby+0.106
ited+0.100
 End+0.094
 efforts+0.094
ised+0.093
Neg logit contributions
 float-0.077
 tracks-0.075
 hunt-0.074
 ramp-0.073
 rug-0.073
Token.
Token ID13
Feature activation-7.68e-03

Pos logit contributions
ubby+0.245
ited+0.243
 enough+0.234
vious+0.227
 disagreed+0.224
Neg logit contributions
tr-0.140
 float-0.139
 ramp-0.139
 tracks-0.135
 globe-0.132
Token
Token ID198
Feature activation-3.28e-02

Pos logit contributions
ubby+0.012
 enough+0.011
ited+0.011
um+0.011
vious+0.011
Neg logit contributions
tr-0.012
 float-0.012
 ramp-0.012
 tracks-0.012
 hunt-0.012
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
ited+0.064
ubby+0.063
 enough+0.061
vious+0.059
 disagreed+0.059
Neg logit contributions
 ramp-0.041
 float-0.041
tr-0.041
 tracks-0.040
 globe-0.039
TokenL
Token ID43
Feature activation-6.09e-02

Pos logit contributions
ited+0.020
ubby+0.020
 enough+0.019
 disagreed+0.018
vious+0.018
Neg logit contributions
 ramp-0.025
tr-0.024
 tracks-0.024
 float-0.023
 globe-0.023
Tokenily
Token ID813
Feature activation-4.94e-02

Pos logit contributions
ited+0.112
 disagreed+0.103
 hesitation+0.101
 enough+0.099
ised+0.098
Neg logit contributions
nob-0.089
tr-0.087
 ramp-0.084
 rug-0.081
 globe-0.075
Token my
Token ID616
Feature activation-3.37e-02

Pos logit contributions
 enough+0.060
ubby+0.057
ited+0.053
vious+0.052
 disagreed+0.052
Neg logit contributions
 ramp-0.087
tr-0.087
 globe-0.084
 float-0.083
nob-0.082
Token duck
Token ID22045
Feature activation+1.96e-02

Pos logit contributions
ubby+0.063
 enough+0.060
ited+0.059
vious+0.056
 disagreed+0.056
Neg logit contributions
tr-0.047
 ramp-0.044
nob-0.043
 tracks-0.042
 float-0.042
Token back
Token ID736
Feature activation+1.30e-02

Pos logit contributions
tr+0.028
 ramp+0.028
 float+0.028
 tracks+0.027
 globe+0.027
Neg logit contributions
ubby-0.034
ited-0.034
 enough-0.033
vious-0.031
 disagreed-0.031
Token,"
Token ID553
Feature activation-2.20e-02

Pos logit contributions
tr+0.015
 ramp+0.015
 float+0.014
 tracks+0.014
 globe+0.014
Neg logit contributions
ubby-0.027
ited-0.026
 enough-0.026
vious-0.025
 disagreed-0.025
Token Lily
Token ID20037
Feature activation-6.14e-02

Pos logit contributions
ubby+0.036
ited+0.036
 enough+0.033
vious+0.032
um+0.031
Neg logit contributions
 float-0.035
 ramp-0.035
tr-0.034
 tracks-0.034
 globe-0.033
Token said
Token ID531
Feature activation-1.21e-01

Pos logit contributions
ubby+0.118
ited+0.109
years+0.101
joy+0.099
 End+0.098
Neg logit contributions
tr-0.085
 tracks-0.083
 float-0.082
 hunt-0.080
 rug-0.080
Token,
Token ID11
Feature activation-3.00e-02

Pos logit contributions
ubby+0.218
ited+0.218
 enough+0.205
vious+0.201
 disagreed+0.197
Neg logit contributions
tr-0.167
 float-0.166
 ramp-0.165
 tracks-0.162
 globe-0.159
Token trying
Token ID2111
Feature activation-5.19e-02

Pos logit contributions
ubby+0.049
ited+0.049
 enough+0.047
vious+0.045
 disagreed+0.044
Neg logit contributions
tr-0.046
 float-0.046
 ramp-0.046
 tracks-0.045
 globe-0.044
Token to
Token ID284
Feature activation-1.29e-03

Pos logit contributions
 enough+0.060
 disagreed+0.055
ubby+0.055
ited+0.053
ously+0.053
Neg logit contributions
 ramp-0.106
 float-0.102
tr-0.101
 globe-0.101
 tracks-0.101
Token grab
Token ID5552
Feature activation-4.26e-02

Pos logit contributions
ubby+0.002
 enough+0.002
ited+0.002
vious+0.002
 disagreed+0.002
Neg logit contributions
tr-0.002
 globe-0.002
 ramp-0.002
 tracks-0.002
 spaceship-0.002
Token the
Token ID262
Feature activation+1.30e-02

Pos logit contributions
ubby+0.063
 enough+0.062
ited+0.061
vious+0.057
um+0.056
Neg logit contributions
tr-0.074
 ramp-0.073
 float-0.072
 tracks-0.070
 globe-0.070
Token asked
Token ID1965
Feature activation-7.27e-02

Pos logit contributions
ubby+0.045
joy+0.043
 intentions+0.043
ited+0.042
 hesitation+0.042
Neg logit contributions
tr-0.028
 hunt-0.027
 tracks-0.027
 float-0.026
 carpet-0.026
Token him
Token ID683
Feature activation-2.26e-02

Pos logit contributions
ited+0.156
ubby+0.148
vious+0.141
oph+0.132
ody+0.132
Neg logit contributions
 hunt-0.091
 Fl-0.086
nob-0.084
 tracks-0.083
 rug-0.083
Token to
Token ID284
Feature activation+2.85e-03

Pos logit contributions
ubby+0.031
ited+0.030
 enough+0.030
vious+0.028
 disagreed+0.028
Neg logit contributions
tr-0.040
 ramp-0.040
 float-0.040
 tracks-0.040
 globe-0.039
Token clean
Token ID3424
Feature activation-1.92e-03

Pos logit contributions
tr+0.004
 ramp+0.004
 globe+0.004
 tracks+0.004
 float+0.003
Neg logit contributions
ubby-0.005
ited-0.005
 enough-0.005
um-0.005
 disagreed-0.005
Token the
Token ID262
Feature activation+3.71e-02

Pos logit contributions
 enough+0.003
ubby+0.003
ited+0.003
um+0.003
vious+0.003
Neg logit contributions
tr-0.003
 ramp-0.003
 float-0.003
 globe-0.003
fire-0.003
Token bathroom
Token ID12436
Feature activation+5.63e-02

Pos logit contributions
tr+0.040
nob+0.036
 ramp+0.035
 globe+0.034
 float+0.034
Neg logit contributions
ubby-0.080
 enough-0.079
ited-0.075
um-0.074
vious-0.073
Token.
Token ID13
Feature activation+2.59e-02

Pos logit contributions
tr+0.067
 ramp+0.066
 float+0.066
 tracks+0.065
 globe+0.063
Neg logit contributions
ubby-0.112
ited-0.110
 enough-0.107
vious-0.103
 disagreed-0.102
Token Tim
Token ID5045
Feature activation-2.36e-02

Pos logit contributions
tr+0.035
 ramp+0.035
 float+0.035
 tracks+0.034
 globe+0.033
Neg logit contributions
ubby-0.048
ited-0.047
 enough-0.045
vious-0.044
 disagreed-0.043
Tokenmy
Token ID1820
Feature activation-1.35e-02

Pos logit contributions
 hesitation+0.047
joy+0.047
ubby+0.046
 intentions+0.044
 enough+0.043
Neg logit contributions
nob-0.030
 rug-0.030
tr-0.029
 tracks-0.026
 carpet-0.026
Token didn
Token ID1422
Feature activation-4.88e-02

Pos logit contributions
ubby+0.025
joy+0.025
 hesitation+0.024
 enough+0.023
ited+0.023
Neg logit contributions
 rug-0.018
tr-0.018
 float-0.017
nob-0.017
 tracks-0.017
Token't
Token ID470
Feature activation-1.98e-02

Pos logit contributions
ubby+0.074
ited+0.071
 enough+0.070
vious+0.066
um+0.066
Neg logit contributions
tr-0.081
 ramp-0.080
 float-0.079
 tracks-0.079
 globe-0.078
Token She
Token ID1375
Feature activation-1.19e-03

Pos logit contributions
 ramp+0.025
 float+0.025
tr+0.025
 tracks+0.025
 globe+0.024
Neg logit contributions
ubby-0.020
ited-0.020
 enough-0.019
vious-0.018
 disagreed-0.018
Token loved
Token ID6151
Feature activation-3.08e-02

Pos logit contributions
ubby+0.001
ited+0.001
 disagreed+0.001
 enough+0.001
vious+0.001
Neg logit contributions
 ramp-0.002
 float-0.002
tr-0.002
 tracks-0.002
 globe-0.002
Token to
Token ID284
Feature activation+2.09e-02

Pos logit contributions
ubby+0.043
ited+0.042
 enough+0.041
vious+0.038
 disagreed+0.038
Neg logit contributions
 ramp-0.055
tr-0.055
 float-0.054
 globe-0.053
 tracks-0.053
Token walk
Token ID2513
Feature activation+8.31e-03

Pos logit contributions
tr+0.027
 ramp+0.026
 globe+0.026
fire+0.025
 tracks+0.024
Neg logit contributions
ubby-0.037
ited-0.037
 enough-0.036
 disagreed-0.035
um-0.035
Token in
Token ID287
Feature activation+6.08e-02

Pos logit contributions
 ramp+0.016
tr+0.014
 globe+0.014
fire+0.013
 glide+0.013
Neg logit contributions
 enough-0.012
ubby-0.011
um-0.011
ited-0.010
ised-0.010
Token the
Token ID262
Feature activation+5.90e-02

Pos logit contributions
tr+0.110
 ramp+0.105
 float+0.104
 globe+0.102
fire+0.100
Neg logit contributions
ubby-0.086
 enough-0.085
ited-0.083
vious-0.076
um-0.074
Token park
Token ID3952
Feature activation+3.54e-02

Pos logit contributions
fire+0.094
 ramp+0.093
 globe+0.091
 float+0.089
nob+0.089
Neg logit contributions
 enough-0.091
ubby-0.091
um-0.088
vious-0.087
ited-0.087
Token with
Token ID351
Feature activation+3.47e-02

Pos logit contributions
 ramp+0.059
tr+0.054
 globe+0.053
 spaceship+0.052
 float+0.052
Neg logit contributions
 enough-0.057
ited-0.055
ubby-0.053
 disagreed-0.053
um-0.051
Token her
Token ID607
Feature activation-5.30e-03

Pos logit contributions
 ramp+0.078
tr+0.075
fire+0.074
 globe+0.074
 float+0.072
Neg logit contributions
ubby-0.036
 enough-0.035
um-0.034
ited-0.033
illy-0.030
Token mom
Token ID1995
Feature activation-1.75e-03

Pos logit contributions
ubby+0.007
 enough+0.006
um+0.005
joy+0.005
 disagreed+0.005
Neg logit contributions
nob-0.011
 ramp-0.011
tr-0.010
fire-0.010
 float-0.010
Tokenmy
Token ID1820
Feature activation-4.99e-03

Pos logit contributions
ubby+0.002
ited+0.002
 enough+0.002
vious+0.002
 disagreed+0.002
Neg logit contributions
 ramp-0.003
tr-0.003
 float-0.003
 tracks-0.003
 globe-0.003
Token decided
Token ID3066
Feature activation-2.88e-02

Pos logit contributions
ubby+0.031
ited+0.030
vious+0.027
joy+0.027
Fine+0.026
Neg logit contributions
 float-0.030
 tracks-0.029
tr-0.029
 hunt-0.029
 spaceship-0.029
Token to
Token ID284
Feature activation+1.88e-02

Pos logit contributions
ubby+0.053
ited+0.052
vious+0.047
um+0.045
illy+0.044
Neg logit contributions
 float-0.042
 tracks-0.042
tr-0.041
 hunt-0.040
 ramp-0.040
Token close
Token ID1969
Feature activation-5.90e-03

Pos logit contributions
tr+0.024
 ramp+0.024
 tracks+0.023
 globe+0.023
 float+0.023
Neg logit contributions
ubby-0.035
ited-0.035
 enough-0.034
 disagreed-0.033
vious-0.032
Token the
Token ID262
Feature activation+3.23e-02

Pos logit contributions
ited+0.011
ubby+0.011
vious+0.010
 enough+0.010
 disagreed+0.010
Neg logit contributions
 float-0.008
 tracks-0.008
 ramp-0.008
tr-0.008
 globe-0.008
Token palace
Token ID20562
Feature activation+5.46e-02

Pos logit contributions
 float+0.040
tr+0.040
 ramp+0.039
 tracks+0.037
 globe+0.036
Neg logit contributions
ubby-0.062
 enough-0.061
ited-0.061
vious-0.057
 disagreed-0.057
Token and
Token ID290
Feature activation+6.32e-02

Pos logit contributions
tr+0.083
 ramp+0.083
 float+0.082
 tracks+0.080
 globe+0.079
Neg logit contributions
ubby-0.089
ited-0.088
 enough-0.087
vious-0.081
 disagreed-0.081
Token go
Token ID467
Feature activation+3.19e-02

Pos logit contributions
tr+0.089
 ramp+0.088
 globe+0.086
 float+0.082
 tracks+0.082
Neg logit contributions
 enough-0.111
ited-0.107
 disagreed-0.105
ubby-0.104
vious-0.100
Token away
Token ID1497
Feature activation+1.61e-02

Pos logit contributions
tr+0.055
 globe+0.049
 ramp+0.049
 float+0.047
 planet+0.046
Neg logit contributions
 enough-0.052
ubby-0.047
ited-0.045
ously-0.043
joy-0.043
Token.
Token ID13
Feature activation+5.22e-02

Pos logit contributions
tr+0.023
 ramp+0.022
 float+0.021
 tracks+0.021
 globe+0.021
Neg logit contributions
 enough-0.029
ubby-0.029
ited-0.028
 disagreed-0.027
vious-0.026
Token All
Token ID1439
Feature activation+1.49e-02

Pos logit contributions
tr+0.079
 float+0.078
 ramp+0.078
 globe+0.076
 tracks+0.074
Neg logit contributions
ubby-0.086
 enough-0.085
ited-0.082
vious-0.079
um-0.078
Token the
Token ID262
Feature activation-5.27e-03

Pos logit contributions
 float+0.022
 tracks+0.022
 ramp+0.021
 globe+0.020
 spaceship+0.020
Neg logit contributions
ubby-0.025
ited-0.024
um-0.023
oph-0.022
 enough-0.022
Token rolled
Token ID11686
Feature activation+2.08e-03

Pos logit contributions
tr+0.032
 ramp+0.032
 float+0.031
 tracks+0.031
 globe+0.030
Neg logit contributions
ubby-0.045
ited-0.044
 enough-0.043
vious-0.041
 disagreed-0.041
Token under
Token ID739
Feature activation+1.92e-02

Pos logit contributions
tr+0.002
 ramp+0.002
 float+0.002
 tracks+0.002
 globe+0.002
Neg logit contributions
ubby-0.004
ited-0.004
 enough-0.004
vious-0.004
 disagreed-0.004
Token the
Token ID262
Feature activation+4.53e-02

Pos logit contributions
tr+0.033
 ramp+0.032
 float+0.032
 tracks+0.032
 globe+0.031
Neg logit contributions
ubby-0.029
ited-0.028
 enough-0.027
vious-0.026
 disagreed-0.025
Token couch
Token ID18507
Feature activation+4.22e-02

Pos logit contributions
 float+0.047
 glide+0.047
 soar+0.043
tr+0.042
 ramp+0.042
Neg logit contributions
ubby-0.089
um-0.086
 enough-0.084
vious-0.084
illy-0.082
Token where
Token ID810
Feature activation+2.05e-02

Pos logit contributions
 ramp+0.057
 float+0.057
 globe+0.056
tr+0.056
 tracks+0.055
Neg logit contributions
 enough-0.075
ited-0.074
ubby-0.073
 disagreed-0.070
vious-0.069
Token it
Token ID340
Feature activation+5.02e-02

Pos logit contributions
 ramp+0.031
tr+0.030
 float+0.030
 globe+0.029
 tracks+0.029
Neg logit contributions
ubby-0.034
 enough-0.034
ited-0.034
 disagreed-0.032
vious-0.032
Token was
Token ID373
Feature activation+1.48e-02

Pos logit contributions
 globe+0.094
 ramp+0.093
 outer+0.091
 tracks+0.090
fire+0.089
Neg logit contributions
 disagreed-0.066
 enough-0.062
ited-0.061
vious-0.057
 argued-0.053
Token even
Token ID772
Feature activation+2.08e-02

Pos logit contributions
 ramp+0.025
 globe+0.024
 tracks+0.024
tr+0.023
 spaceship+0.023
Neg logit contributions
 enough-0.024
ited-0.022
 disagreed-0.021
ubby-0.021
vious-0.020
Token hotter
Token ID37546
Feature activation+1.97e-03

Pos logit contributions
tr+0.030
 ramp+0.029
 float+0.029
 tracks+0.029
 globe+0.028
Neg logit contributions
ubby-0.037
ited-0.036
 enough-0.035
vious-0.034
 disagreed-0.033
Token.
Token ID13
Feature activation+3.82e-02

Pos logit contributions
tr+0.003
 ramp+0.003
 float+0.003
 tracks+0.002
 globe+0.002
Neg logit contributions
 enough-0.004
ubby-0.004
ited-0.004
um-0.003
vious-0.003
Token
Token ID220
Feature activation-2.33e-02

Pos logit contributions
 ramp+0.061
 float+0.060
tr+0.060
 globe+0.060
 hunt+0.059
Neg logit contributions
ubby-0.058
ited-0.057
 enough-0.057
um-0.056
 disagreed-0.055
Token he
Token ID339
Feature activation+2.30e-02

Pos logit contributions
tr+0.099
 ramp+0.096
 globe+0.095
 float+0.091
nob+0.090
Neg logit contributions
 disagreed-0.051
 enough-0.049
ited-0.048
vious-0.044
ody-0.042
Token saw
Token ID2497
Feature activation-3.75e-02

Pos logit contributions
tr+0.047
 ramp+0.047
 globe+0.046
 float+0.046
 tracks+0.045
Neg logit contributions
ited-0.026
ubby-0.026
 enough-0.025
 disagreed-0.024
vious-0.023
Token something
Token ID1223
Feature activation+1.59e-02

Pos logit contributions
ited+0.075
ubby+0.074
 disagreed+0.064
um+0.064
oph+0.064
Neg logit contributions
 tracks-0.051
 float-0.049
tr-0.048
 ramp-0.047
 rug-0.046
Token shiny
Token ID22441
Feature activation-1.72e-02

Pos logit contributions
 ramp+0.028
tr+0.027
nob+0.027
 globe+0.026
 blasted+0.026
Neg logit contributions
 enough-0.023
 disagreed-0.021
ubby-0.020
um-0.019
illy-0.019
Token on
Token ID319
Feature activation+2.22e-02

Pos logit contributions
ubby+0.036
ited+0.036
vious+0.033
 disagreed+0.033
 enough+0.033
Neg logit contributions
 float-0.019
tr-0.019
 ramp-0.018
 globe-0.018
 tracks-0.018
Token the
Token ID262
Feature activation+5.71e-02

Pos logit contributions
tr+0.036
 ramp+0.035
 float+0.035
 tracks+0.034
 globe+0.034
Neg logit contributions
ubby-0.035
ited-0.034
 enough-0.033
vious-0.032
 disagreed-0.031
Token ground
Token ID2323
Feature activation+5.02e-02

Pos logit contributions
tr+0.073
nob+0.067
 ramp+0.067
 float+0.065
 spaceship+0.065
Neg logit contributions
ubby-0.103
 enough-0.099
um-0.098
vious-0.097
ige-0.097
Token.
Token ID13
Feature activation+4.63e-02

Pos logit contributions
 ramp+0.070
tr+0.070
 tracks+0.066
 float+0.066
 globe+0.066
Neg logit contributions
 enough-0.088
ited-0.084
ubby-0.084
vious-0.081
 disagreed-0.081
Token He
Token ID679
Feature activation+2.59e-02

Pos logit contributions
tr+0.073
 ramp+0.072
 float+0.072
 tracks+0.071
 globe+0.070
Neg logit contributions
ubby-0.075
ited-0.074
 enough-0.071
vious-0.068
 disagreed-0.067
Token picked
Token ID6497
Feature activation-1.18e-02

Pos logit contributions
tr+0.044
 float+0.044
 ramp+0.043
 tracks+0.043
 globe+0.042
Neg logit contributions
ubby-0.039
ited-0.038
 enough-0.036
vious-0.035
um-0.033
Token it
Token ID340
Feature activation+4.62e-02

Pos logit contributions
ubby+0.022
ited+0.022
 enough+0.021
vious+0.020
 disagreed+0.020
Neg logit contributions
tr-0.016
 float-0.015
 ramp-0.015
 tracks-0.015
 globe-0.015
Token looked
Token ID3114
Feature activation-1.93e-02

Pos logit contributions
 ramp+0.079
tr+0.078
 globe+0.076
 float+0.076
 tracks+0.074
Neg logit contributions
 enough-0.054
 disagreed-0.052
ited-0.052
ously-0.046
 argued-0.045
Token sadly
Token ID21098
Feature activation-2.37e-02

Pos logit contributions
ubby+0.031
ited+0.031
 enough+0.030
vious+0.028
 disagreed+0.028
Neg logit contributions
tr-0.031
 ramp-0.031
 float-0.030
 tracks-0.030
 globe-0.029
Token at
Token ID379
Feature activation-2.50e-02

Pos logit contributions
ubby+0.030
 enough+0.030
ited+0.029
 disagreed+0.027
um+0.026
Neg logit contributions
 ramp-0.046
tr-0.046
 globe-0.045
 float-0.044
 tracks-0.044
Token the
Token ID262
Feature activation+1.09e-02

Pos logit contributions
ubby+0.042
ited+0.041
 enough+0.039
vious+0.038
 disagreed+0.037
Neg logit contributions
 ramp-0.038
 float-0.038
tr-0.038
 tracks-0.038
 globe-0.037
Token lock
Token ID5793
Feature activation+1.29e-02

Pos logit contributions
 float+0.012
tr+0.011
 ramp+0.011
 tracks+0.011
nob+0.011
Neg logit contributions
ubby-0.022
 enough-0.022
ited-0.022
vious-0.021
um-0.021
Token.
Token ID13
Feature activation+3.67e-02

Pos logit contributions
tr+0.015
 ramp+0.015
 float+0.015
 tracks+0.015
 globe+0.015
Neg logit contributions
ubby-0.025
ited-0.025
 enough-0.024
vious-0.024
 disagreed-0.023
Token There
Token ID1318
Feature activation-2.84e-03

Pos logit contributions
tr+0.050
 ramp+0.050
 float+0.050
 tracks+0.049
 globe+0.048
Neg logit contributions
ubby-0.067
ited-0.066
 enough-0.064
vious-0.061
 disagreed-0.061
Token was
Token ID373
Feature activation+1.18e-02

Pos logit contributions
ubby+0.005
ited+0.005
vious+0.004
 enough+0.004
oph+0.004
Neg logit contributions
 float-0.004
 tracks-0.004
 ramp-0.004
tr-0.004
 globe-0.004
Token no
Token ID645
Feature activation-3.00e-02

Pos logit contributions
tr+0.019
 tracks+0.019
 ramp+0.018
 float+0.018
 globe+0.018
Neg logit contributions
ubby-0.020
ited-0.020
vious-0.018
um-0.017
oph-0.017
Token way
Token ID835
Feature activation+1.03e-02

Pos logit contributions
ubby+0.054
ited+0.053
 enough+0.053
vious+0.050
 disagreed+0.049
Neg logit contributions
tr-0.041
 ramp-0.041
 float-0.041
 tracks-0.040
 globe-0.040
Token to
Token ID284
Feature activation+1.05e-02

Pos logit contributions
tr+0.017
 ramp+0.016
 globe+0.016
fire+0.016
 float+0.016
Neg logit contributions
 enough-0.017
ubby-0.015
ously-0.014
vious-0.014
um-0.013
Token little
Token ID1310
Feature activation-4.81e-02

Pos logit contributions
ited+0.091
ubby+0.091
ously+0.088
vious+0.086
 disagreed+0.085
Neg logit contributions
 float-0.062
 tracks-0.060
 ramp-0.058
 cozy-0.057
tr-0.057
Token girl
Token ID2576
Feature activation+5.94e-03

Pos logit contributions
ubby+0.077
arre+0.075
ited+0.075
oke+0.072
years+0.072
Neg logit contributions
 float-0.075
 tracks-0.072
 cozy-0.070
 spaceship-0.069
 ramp-0.069
Token and
Token ID290
Feature activation-5.27e-03

Pos logit contributions
tr+0.009
 float+0.009
 tracks+0.009
 ramp+0.009
 globe+0.008
Neg logit contributions
ubby-0.010
ited-0.010
 enough-0.009
vious-0.009
um-0.009
Token the
Token ID262
Feature activation-4.07e-02

Pos logit contributions
ubby+0.009
ited+0.009
vious+0.008
 enough+0.008
um+0.008
Neg logit contributions
 float-0.008
 tracks-0.008
tr-0.008
 ramp-0.008
 globe-0.008
Token dove
Token ID38862
Feature activation+3.35e-02

Pos logit contributions
ited+0.090
ubby+0.089
vious+0.084
 enough+0.084
 disagreed+0.083
Neg logit contributions
 float-0.041
 tracks-0.040
 ramp-0.039
tr-0.038
 cozy-0.037
Token became
Token ID2627
Feature activation-5.14e-03

Pos logit contributions
 ramp+0.055
tr+0.054
 globe+0.053
 float+0.052
 tracks+0.050
Neg logit contributions
ited-0.051
 enough-0.051
ubby-0.050
 disagreed-0.048
ously-0.046
Token great
Token ID1049
Feature activation-2.03e-02

Pos logit contributions
ubby+0.008
ited+0.008
 enough+0.008
vious+0.007
 disagreed+0.007
Neg logit contributions
tr-0.008
 ramp-0.008
 float-0.008
 tracks-0.008
 globe-0.008
Token friends
Token ID2460
Feature activation-1.31e-02

Pos logit contributions
ited+0.030
ubby+0.030
vious+0.027
oph+0.026
um+0.026
Neg logit contributions
 float-0.035
 tracks-0.034
 ramp-0.034
 hunt-0.033
tr-0.033
Token.
Token ID13
Feature activation+4.44e-02

Pos logit contributions
ubby+0.023
ited+0.022
 enough+0.022
vious+0.021
 disagreed+0.020
Neg logit contributions
tr-0.019
 ramp-0.019
 float-0.019
 tracks-0.018
 globe-0.018
Token They
Token ID1119
Feature activation+3.47e-02

Pos logit contributions
 ramp+0.073
tr+0.073
 float+0.072
 globe+0.070
 tracks+0.070
Neg logit contributions
ubby-0.068
ited-0.066
 enough-0.065
vious-0.062
um-0.061
Token spent
Token ID3377
Feature activation-1.13e-02

Pos logit contributions
 ramp+0.057
tr+0.056
 float+0.055
 tracks+0.055
 globe+0.054
Neg logit contributions
ited-0.054
ubby-0.053
 enough-0.052
 disagreed-0.049
vious-0.048
Token she
Token ID673
Feature activation+1.26e-02

Pos logit contributions
tr+0.029
 ramp+0.029
nob+0.028
fire+0.028
 hunt+0.027
Neg logit contributions
 enough-0.023
ubby-0.020
 End-0.020
joy-0.020
 disagreed-0.019
Token ate
Token ID15063
Feature activation-3.33e-03

Pos logit contributions
 float+0.019
tr+0.019
 ramp+0.019
 tracks+0.018
 globe+0.018
Neg logit contributions
ubby-0.021
ited-0.021
 enough-0.020
vious-0.019
um-0.019
Token,
Token ID11
Feature activation+1.77e-02

Pos logit contributions
 enough+0.005
ubby+0.005
ited+0.004
um+0.004
vious+0.004
Neg logit contributions
tr-0.006
 ramp-0.006
 tracks-0.006
 globe-0.006
 float-0.006
Token and
Token ID290
Feature activation+3.38e-02

Pos logit contributions
tr+0.028
 ramp+0.027
 float+0.027
 tracks+0.027
 globe+0.026
Neg logit contributions
ubby-0.029
ited-0.028
 enough-0.027
vious-0.026
 disagreed-0.026
Token when
Token ID618
Feature activation+6.35e-03

Pos logit contributions
 float+0.049
tr+0.049
 tracks+0.048
 ramp+0.046
 globe+0.046
Neg logit contributions
ubby-0.062
ited-0.059
vious-0.054
um-0.053
 enough-0.053
Token she
Token ID673
Feature activation+7.01e-03

Pos logit contributions
tr+0.011
 ramp+0.011
nob+0.011
 globe+0.011
 tracks+0.011
Neg logit contributions
 enough-0.009
 End-0.008
ubby-0.008
 disagreed-0.008
ited-0.008
Token went
Token ID1816
Feature activation-1.57e-03

Pos logit contributions
 float+0.011
tr+0.010
 ramp+0.010
 tracks+0.010
 globe+0.010
Neg logit contributions
ubby-0.012
ited-0.012
 enough-0.011
vious-0.011
um-0.011
Token to
Token ID284
Feature activation+3.54e-02

Pos logit contributions
 enough+0.002
ubby+0.001
joy+0.001
um+0.001
ously+0.001
Neg logit contributions
tr-0.003
 globe-0.003
 ramp-0.003
nob-0.003
 spir-0.003
Token bed
Token ID3996
Feature activation-1.11e-02

Pos logit contributions
tr+0.041
 globe+0.037
fire+0.035
 ramp+0.034
nob+0.034
Neg logit contributions
 enough-0.074
ubby-0.071
ited-0.069
illy-0.067
 hesitation-0.067
Token.
Token ID13
Feature activation+4.45e-02

Pos logit contributions
ubby+0.018
ited+0.018
 enough+0.017
vious+0.016
 disagreed+0.016
Neg logit contributions
tr-0.018
 ramp-0.017
 float-0.017
 tracks-0.017
 globe-0.017
Token She
Token ID1375
Feature activation+3.17e-02

Pos logit contributions
 globe+0.091
 ramp+0.090
fire+0.090
 float+0.087
 hunt+0.087
Neg logit contributions
ubby-0.050
 enough-0.048
um-0.047
vious-0.046
oph-0.045
Token sat
Token ID3332
Feature activation-2.08e-02

Pos logit contributions
tr+0.020
 ramp+0.019
 float+0.019
 tracks+0.019
 globe+0.019
Neg logit contributions
ubby-0.023
ited-0.022
 enough-0.021
vious-0.021
 disagreed-0.020
Token down
Token ID866
Feature activation+1.78e-02

Pos logit contributions
ubby+0.043
ited+0.042
 enough+0.042
vious+0.040
 disagreed+0.040
Neg logit contributions
tr-0.023
 ramp-0.023
 float-0.023
 tracks-0.022
 globe-0.021
Token and
Token ID290
Feature activation+5.92e-02

Pos logit contributions
 ramp+0.033
tr+0.033
 float+0.033
 tracks+0.032
 hunt+0.032
Neg logit contributions
 enough-0.023
ubby-0.023
ited-0.021
um-0.020
vious-0.020
Token Tom
Token ID4186
Feature activation+1.82e-02

Pos logit contributions
 ramp+0.120
 globe+0.109
 tracks+0.109
 float+0.108
 rug+0.107
Neg logit contributions
 disagreed-0.078
ited-0.075
 enough-0.074
 argued-0.069
 understood-0.065
Token helped
Token ID4193
Feature activation-3.24e-02

Pos logit contributions
 ramp+0.031
 float+0.031
 globe+0.030
tr+0.030
 tracks+0.029
Neg logit contributions
 enough-0.027
ited-0.027
 disagreed-0.026
ubby-0.024
vious-0.024
Token Tim
Token ID5045
Feature activation+3.30e-02

Pos logit contributions
ubby+0.041
ited+0.041
 enough+0.040
vious+0.037
 disagreed+0.037
Neg logit contributions
tr-0.061
 ramp-0.061
 float-0.061
 tracks-0.060
 globe-0.059
Token read
Token ID1100
Feature activation-3.17e-03

Pos logit contributions
 ramp+0.048
 globe+0.047
 float+0.046
 spaceship+0.045
 tracks+0.044
Neg logit contributions
 enough-0.058
ited-0.056
 disagreed-0.053
ubby-0.052
vious-0.052
Token.
Token ID13
Feature activation+4.26e-02

Pos logit contributions
 enough+0.005
ited+0.004
ubby+0.004
 disagreed+0.004
um+0.004
Neg logit contributions
tr-0.005
 ramp-0.005
 float-0.005
 globe-0.005
 tracks-0.005
Token Soon
Token ID15894
Feature activation-2.66e-02

Pos logit contributions
 float+0.076
 ramp+0.075
 globe+0.075
tr+0.074
 hunt+0.073
Neg logit contributions
 enough-0.055
ubby-0.055
um-0.053
ited-0.053
 disagreed-0.052
Token,
Token ID11
Feature activation+2.22e-02

Pos logit contributions
ited+0.042
ubby+0.041
vious+0.036
heart+0.034
ody+0.034
Neg logit contributions
 float-0.046
 tracks-0.045
 ramp-0.045
 spaceship-0.044
 globe-0.043
Token Tim
Token ID5045
Feature activation+2.47e-03

Pos logit contributions
tr+0.037
 ramp+0.036
 float+0.036
 tracks+0.035
nob+0.035
Neg logit contributions
 enough-0.034
ubby-0.032
ited-0.032
 disagreed-0.031
vious-0.030
Token to
Token ID284
Feature activation+1.97e-02

Pos logit contributions
ubby+0.017
ited+0.017
um+0.015
illy+0.015
 End+0.015
Neg logit contributions
 float-0.014
 tracks-0.014
tr-0.013
 hunt-0.013
 ramp-0.013
Token go
Token ID467
Feature activation+2.02e-02

Pos logit contributions
tr+0.027
 ramp+0.026
 float+0.026
 tracks+0.026
 globe+0.025
Neg logit contributions
ubby-0.036
ited-0.036
 enough-0.035
vious-0.033
 disagreed-0.033
Token away
Token ID1497
Feature activation+3.57e-03

Pos logit contributions
tr+0.027
 ramp+0.027
 float+0.027
 tracks+0.027
 globe+0.026
Neg logit contributions
ubby-0.037
ited-0.036
 enough-0.035
vious-0.034
 disagreed-0.033
Token,
Token ID11
Feature activation+2.19e-02

Pos logit contributions
 float+0.006
 ramp+0.006
tr+0.006
 tracks+0.006
 globe+0.006
Neg logit contributions
ubby-0.005
ited-0.005
 enough-0.005
vious-0.004
 disagreed-0.004
Token and
Token ID290
Feature activation+3.01e-02

Pos logit contributions
 float+0.034
 ramp+0.034
tr+0.034
 tracks+0.033
 globe+0.032
Neg logit contributions
ubby-0.036
ited-0.035
 enough-0.033
vious-0.033
um-0.032
Token he
Token ID339
Feature activation+1.69e-02

Pos logit contributions
tr+0.045
 ramp+0.045
 float+0.044
 tracks+0.044
 globe+0.043
Neg logit contributions
ubby-0.050
ited-0.050
 enough-0.049
vious-0.046
 disagreed-0.046
Token couldn
Token ID3521
Feature activation-2.65e-02

Pos logit contributions
tr+0.028
 ramp+0.027
 float+0.027
 tracks+0.027
 globe+0.026
Neg logit contributions
ubby-0.026
ited-0.026
 enough-0.024
vious-0.023
um-0.023
Token't
Token ID470
Feature activation+1.27e-02

Pos logit contributions
ubby+0.040
ited+0.039
 enough+0.039
vious+0.037
um+0.036
Neg logit contributions
tr-0.044
 ramp-0.043
 float-0.043
 tracks-0.043
 globe-0.042
Token come
Token ID1282
Feature activation+6.04e-03

Pos logit contributions
tr+0.020
 tracks+0.018
 ramp+0.018
 globe+0.018
 rug+0.017
Neg logit contributions
ited-0.021
ubby-0.021
 enough-0.021
 disagreed-0.020
vious-0.020
Token back
Token ID736
Feature activation+3.14e-02

Pos logit contributions
tr+0.010
 globe+0.009
 ramp+0.009
fire+0.009
nob+0.009
Neg logit contributions
 enough-0.010
ubby-0.010
um-0.009
ously-0.008
vious-0.008
Token.
Token ID13
Feature activation+4.21e-02

Pos logit contributions
tr+0.050
 ramp+0.049
 globe+0.047
 float+0.047
 tracks+0.046
Neg logit contributions
 enough-0.052
ubby-0.051
ited-0.049
vious-0.046
 disagreed-0.046
Token
Token ID198
Feature activation+1.25e-02

Pos logit contributions
tr+0.000
 ramp+0.000
 float+0.000
 tracks+0.000
 globe+0.000
Neg logit contributions
ubby-0.000
ited-0.000
 enough-0.000
vious-0.000
 disagreed-0.000
TokenL
Token ID43
Feature activation-4.81e-02

Pos logit contributions
tr+0.017
 ramp+0.017
 tracks+0.016
nob+0.016
 float+0.016
Neg logit contributions
ubby-0.023
ited-0.023
 enough-0.023
vious-0.021
 disagreed-0.021
Tokenily
Token ID813
Feature activation-1.41e-02

Pos logit contributions
ited+0.083
 hesitation+0.081
joy+0.080
 protests+0.079
 disagreed+0.078
Neg logit contributions
nob-0.068
 carpet-0.063
 rug-0.061
tr-0.057
 planet-0.055
Token had
Token ID550
Feature activation-1.58e-02

Pos logit contributions
ubby+0.029
ited+0.027
 enough+0.026
vious+0.026
joy+0.025
Neg logit contributions
 tracks-0.018
tr-0.017
 ramp-0.017
nob-0.016
 float-0.016
Token an
Token ID281
Feature activation-2.27e-02

Pos logit contributions
ubby+0.031
ited+0.029
um+0.027
heart+0.027
oke+0.026
Neg logit contributions
tr-0.021
 tracks-0.021
 float-0.021
 hunt-0.021
 cozy-0.020
Token idea
Token ID2126
Feature activation-4.49e-02

Pos logit contributions
ited+0.031
joy+0.030
oph+0.029
ubby+0.028
ously+0.027
Neg logit contributions
 tracks-0.042
 float-0.041
 ramp-0.040
nob-0.039
 rug-0.039
Token.
Token ID13
Feature activation+3.57e-02

Pos logit contributions
ited+0.077
ubby+0.077
 enough+0.077
 disagreed+0.072
vious+0.072
Neg logit contributions
tr-0.065
 ramp-0.065
 float-0.064
 tracks-0.062
 globe-0.061
Token She
Token ID1375
Feature activation+1.16e-02

Pos logit contributions
tr+0.050
 ramp+0.050
 tracks+0.050
 float+0.049
 globe+0.047
Neg logit contributions
ited-0.067
ubby-0.065
 enough-0.059
vious-0.058
 disagreed-0.057
Token grabbed
Token ID13176
Feature activation-2.86e-02

Pos logit contributions
tr+0.018
 ramp+0.018
 float+0.018
 tracks+0.018
 hunt+0.017
Neg logit contributions
ubby-0.019
ited-0.018
 enough-0.018
vious-0.017
um-0.017
Token her
Token ID607
Feature activation-8.65e-03

Pos logit contributions
ubby+0.042
 enough+0.041
ited+0.041
vious+0.037
um+0.037
Neg logit contributions
tr-0.050
 ramp-0.050
 float-0.049
 globe-0.048
 tracks-0.047
Token t
Token ID256
Feature activation-8.30e-03

Pos logit contributions
ubby+0.015
 enough+0.014
ited+0.014
um+0.013
 disagreed+0.013
Neg logit contributions
 ramp-0.012
tr-0.012
nob-0.012
 float-0.012
 hunt-0.012
Token pray
Token ID12472
Feature activation-2.32e-02

Pos logit contributions
tr+0.021
 ramp+0.021
 globe+0.020
 float+0.020
 tracks+0.020
Neg logit contributions
ubby-0.022
ited-0.022
 enough-0.022
 disagreed-0.020
vious-0.020
Token again
Token ID757
Feature activation+3.99e-03

Pos logit contributions
ubby+0.044
ited+0.043
 enough+0.043
 disagreed+0.040
vious+0.040
Neg logit contributions
 ramp-0.030
tr-0.029
 float-0.029
 tracks-0.028
 globe-0.028
Token.
Token ID13
Feature activation+4.56e-02

Pos logit contributions
 ramp+0.005
tr+0.005
 float+0.005
 globe+0.005
 tracks+0.005
Neg logit contributions
ubby-0.008
ited-0.008
 enough-0.008
vious-0.007
 disagreed-0.007
Token But
Token ID887
Feature activation-3.99e-03

Pos logit contributions
tr+0.069
 float+0.067
 tracks+0.066
 ramp+0.066
 globe+0.063
Neg logit contributions
ubby-0.078
ited-0.077
 enough-0.072
vious-0.069
 disagreed-0.068
Token this
Token ID428
Feature activation-3.09e-02

Pos logit contributions
ubby+0.007
ited+0.006
vious+0.006
ously+0.005
 enough+0.005
Neg logit contributions
 float-0.006
tr-0.006
 tracks-0.006
 cozy-0.006
 hunt-0.006
Token time
Token ID640
Feature activation-2.69e-02

Pos logit contributions
ubby+0.052
years+0.050
ited+0.047
ously+0.046
ody+0.045
Neg logit contributions
 float-0.048
tr-0.047
 tracks-0.046
 globe-0.045
 planet-0.045
Token she
Token ID673
Feature activation+2.18e-02

Pos logit contributions
ubby+0.049
ited+0.046
vious+0.041
arre+0.039
ody+0.039
Neg logit contributions
 tracks-0.038
 float-0.038
tr-0.038
 cozy-0.037
nob-0.036
Token prayed
Token ID32168
Feature activation-3.86e-02

Pos logit contributions
tr+0.036
 ramp+0.036
 float+0.036
 tracks+0.035
 globe+0.035
Neg logit contributions
ubby-0.033
ited-0.032
 enough-0.031
vious-0.030
 disagreed-0.029
Token differently
Token ID10338
Feature activation-3.02e-02

Pos logit contributions
ubby+0.068
ited+0.066
vious+0.061
ody+0.058
 disagreed+0.058
Neg logit contributions
tr-0.058
 float-0.056
 tracks-0.056
 ramp-0.054
nob-0.054
Token.
Token ID13
Feature activation+4.35e-02

Pos logit contributions
ubby+0.058
ited+0.058
vious+0.054
 enough+0.053
oph+0.052
Neg logit contributions
 float-0.038
tr-0.037
 tracks-0.036
 ramp-0.036
 globe-0.036
Token She
Token ID1375
Feature activation+2.46e-02

Pos logit contributions
tr+0.067
 float+0.066
 ramp+0.065
 tracks+0.065
 globe+0.063
Neg logit contributions
ubby-0.072
ited-0.071
 enough-0.067
vious-0.065
 disagreed-0.064
Token to
Token ID284
Feature activation+7.87e-03

Pos logit contributions
ited+0.066
ubby+0.065
 enough+0.060
vious+0.059
 disagreed+0.057
Neg logit contributions
 float-0.062
tr-0.061
 tracks-0.060
 ramp-0.059
 hunt-0.057
Token jog
Token ID48342
Feature activation-1.71e-03

Pos logit contributions
tr+0.009
 ramp+0.008
 tracks+0.008
 float+0.008
 globe+0.008
Neg logit contributions
ubby-0.017
ited-0.016
 enough-0.016
vious-0.015
 disagreed-0.015
Token with
Token ID351
Feature activation+2.05e-02

Pos logit contributions
ubby+0.003
 enough+0.003
ited+0.003
vious+0.003
um+0.003
Neg logit contributions
 ramp-0.003
tr-0.003
 globe-0.003
 tracks-0.002
nob-0.002
Token her
Token ID607
Feature activation-9.08e-03

Pos logit contributions
tr+0.041
 ramp+0.041
 globe+0.040
fire+0.039
 float+0.039
Neg logit contributions
ubby-0.025
 enough-0.024
ited-0.023
um-0.022
vious-0.021
Token dad
Token ID9955
Feature activation+4.34e-03

Pos logit contributions
ubby+0.012
 enough+0.011
ited+0.010
 disagreed+0.010
um+0.010
Neg logit contributions
tr-0.017
nob-0.017
 ramp-0.017
fire-0.017
 float-0.017
Token every
Token ID790
Feature activation-2.53e-02

Pos logit contributions
 ramp+0.007
tr+0.006
 float+0.006
 globe+0.006
 tracks+0.006
Neg logit contributions
 enough-0.007
ubby-0.007
ited-0.007
um-0.007
 disagreed-0.006
Token morning
Token ID3329
Feature activation+3.08e-04

Pos logit contributions
ubby+0.043
ited+0.043
 enough+0.041
vious+0.039
 disagreed+0.039
Neg logit contributions
tr-0.037
 ramp-0.037
 float-0.037
 tracks-0.036
 globe-0.036
Token.
Token ID13
Feature activation+2.85e-02

Pos logit contributions
 ramp+0.001
 globe+0.001
tr+0.001
nob+0.000
 spaceship+0.000
Neg logit contributions
 enough-0.000
ubby-0.000
um-0.000
ited-0.000
 disagreed-0.000
Token They
Token ID1119
Feature activation+3.41e-02

Pos logit contributions
tr+0.043
 ramp+0.043
 float+0.042
 tracks+0.042
 globe+0.041
Neg logit contributions
ubby-0.047
ited-0.046
 enough-0.045
vious-0.043
 disagreed-0.042
Token would
Token ID561
Feature activation+1.94e-02

Pos logit contributions
tr+0.048
 float+0.048
 ramp+0.048
 tracks+0.047
 globe+0.046
Neg logit contributions
ubby-0.060
ited-0.059
 enough-0.057
vious-0.055
 disagreed-0.054
Token put
Token ID1234
Feature activation+2.74e-02

Pos logit contributions
tr+0.021
 ramp+0.020
 globe+0.019
 tracks+0.019
 float+0.019
Neg logit contributions
ubby-0.041
ited-0.040
 enough-0.040
 disagreed-0.038
vious-0.038
Token good
Token ID922
Feature activation-5.06e-02

Pos logit contributions
ubby+0.016
ited+0.016
 enough+0.015
vious+0.015
 disagreed+0.014
Neg logit contributions
tr-0.020
 ramp-0.020
 float-0.020
 globe-0.020
 tracks-0.020
Token girl
Token ID2576
Feature activation-1.76e-02

Pos logit contributions
ited+0.084
ubby+0.077
vious+0.073
ised+0.072
 disagreed+0.072
Neg logit contributions
tr-0.080
 float-0.080
 ramp-0.079
 spaceship-0.078
 globe-0.078
Token who
Token ID508
Feature activation+8.29e-03

Pos logit contributions
ubby+0.029
ited+0.029
 enough+0.029
vious+0.027
 disagreed+0.027
Neg logit contributions
 ramp-0.026
tr-0.026
 float-0.026
 tracks-0.026
 globe-0.025
Token did
Token ID750
Feature activation-4.19e-02

Pos logit contributions
 ramp+0.016
tr+0.015
 float+0.015
 tracks+0.015
 globe+0.015
Neg logit contributions
ited-0.011
ubby-0.011
 enough-0.010
 disagreed-0.010
vious-0.010
Token not
Token ID407
Feature activation-1.03e-02

Pos logit contributions
 enough+0.064
ubby+0.062
ited+0.061
 disagreed+0.058
vious+0.056
Neg logit contributions
tr-0.071
 ramp-0.071
 float-0.068
 tracks-0.067
 globe-0.066
Token like
Token ID588
Feature activation-1.72e-02

Pos logit contributions
ubby+0.014
ited+0.014
 enough+0.013
vious+0.012
 disagreed+0.012
Neg logit contributions
tr-0.019
 ramp-0.019
 float-0.019
 tracks-0.019
 globe-0.018
Token to
Token ID284
Feature activation-1.44e-02

Pos logit contributions
ubby+0.030
ited+0.030
 enough+0.028
vious+0.027
 disagreed+0.027
Neg logit contributions
 float-0.025
tr-0.025
 tracks-0.025
 ramp-0.025
 globe-0.024
Token take
Token ID1011
Feature activation-2.03e-02

Pos logit contributions
ubby+0.027
ited+0.027
 enough+0.026
vious+0.025
 disagreed+0.025
Neg logit contributions
 float-0.019
tr-0.018
 ramp-0.018
 tracks-0.018
 globe-0.017
Token things
Token ID1243
Feature activation+4.84e-02

Pos logit contributions
ubby+0.032
 enough+0.032
ited+0.031
vious+0.029
 disagreed+0.029
Neg logit contributions
tr-0.033
 ramp-0.033
 float-0.032
 globe-0.031
 tracks-0.031
Token that
Token ID326
Feature activation+3.25e-02

Pos logit contributions
tr+0.088
 ramp+0.088
nob+0.083
 tracks+0.083
 globe+0.082
Neg logit contributions
 enough-0.072
ubby-0.065
ited-0.062
vious-0.060
 disagreed-0.060
Token did
Token ID750
Feature activation-1.94e-02

Pos logit contributions
 ramp+0.068
tr+0.066
 tracks+0.064
nob+0.064
 float+0.063
Neg logit contributions
 disagreed-0.037
 enough-0.034
ited-0.034
ubby-0.032
vious-0.032
Token the
Token ID262
Feature activation+6.77e-03

Pos logit contributions
 enough+0.011
ubby+0.011
vious+0.010
ited+0.009
 disagreed+0.009
Neg logit contributions
tr-0.015
 ramp-0.014
 globe-0.014
nob-0.014
fire-0.013
Token flowers
Token ID12734
Feature activation+4.78e-02

Pos logit contributions
tr+0.009
 ramp+0.009
 float+0.009
 tracks+0.008
nob+0.008
Neg logit contributions
ubby-0.013
ited-0.012
 enough-0.012
vious-0.012
 disagreed-0.012
Token?"
Token ID1701
Feature activation-1.36e-03

Pos logit contributions
tr+0.088
 ramp+0.088
 tracks+0.085
nob+0.084
 globe+0.082
Neg logit contributions
ubby-0.068
 enough-0.066
ited-0.061
vious-0.061
 disagreed-0.057
Token
Token ID198
Feature activation-2.31e-02

Pos logit contributions
ubby+0.002
ited+0.002
 enough+0.002
vious+0.002
um+0.002
Neg logit contributions
tr-0.002
 float-0.002
 ramp-0.002
 globe-0.002
 tracks-0.002
Token
Token ID198
Feature activation+3.71e-05

Pos logit contributions
ubby+0.039
ited+0.038
 enough+0.037
um+0.036
vious+0.036
Neg logit contributions
tr-0.035
 ramp-0.034
 float-0.034
 tracks-0.033
 globe-0.033
TokenThe
Token ID464
Feature activation-2.64e-02

Pos logit contributions
 ramp+0.000
tr+0.000
 tracks+0.000
 float+0.000
 globe+0.000
Neg logit contributions
ubby-0.000
ited-0.000
 enough-0.000
 disagreed-0.000
illy-0.000
Token butterfly
Token ID35113
Feature activation-2.72e-03

Pos logit contributions
ubby+0.049
ited+0.049
 enough+0.047
vious+0.045
 disagreed+0.045
Neg logit contributions
tr-0.035
 float-0.034
 ramp-0.034
 tracks-0.034
 globe-0.033
Token moved
Token ID3888
Feature activation-3.40e-02

Pos logit contributions
ited+0.004
 disagreed+0.003
 enough+0.003
vious+0.003
 understood+0.003
Neg logit contributions
 ramp-0.005
 tracks-0.005
tr-0.005
nob-0.005
 globe-0.005
Token its
Token ID663
Feature activation+1.95e-02

Pos logit contributions
ubby+0.067
ited+0.066
 enough+0.063
vious+0.061
 disagreed+0.060
Neg logit contributions
tr-0.042
 float-0.042
 ramp-0.041
 tracks-0.041
 globe-0.039
Token wings
Token ID12098
Feature activation+8.12e-03

Pos logit contributions
 ramp+0.030
tr+0.030
nob+0.029
 globe+0.029
fire+0.028
Neg logit contributions
ubby-0.032
 enough-0.031
ited-0.029
um-0.028
 disagreed-0.028
Token slowly
Token ID6364
Feature activation+2.12e-03

Pos logit contributions
 ramp+0.015
tr+0.015
 float+0.015
 tracks+0.015
 globe+0.015
Neg logit contributions
ubby-0.011
ited-0.011
 enough-0.010
vious-0.010
 disagreed-0.009
Token go
Token ID467
Feature activation+1.29e-02

Pos logit contributions
tr+0.018
 globe+0.017
 ramp+0.017
fire+0.016
 tracks+0.016
Neg logit contributions
ited-0.026
ubby-0.026
 enough-0.026
vious-0.026
 disagreed-0.025
Token first
Token ID717
Feature activation+6.91e-04

Pos logit contributions
tr+0.020
 ramp+0.019
 globe+0.019
 spaceship+0.019
 tracks+0.018
Neg logit contributions
 enough-0.021
ubby-0.021
ited-0.021
vious-0.020
 disagreed-0.019
Token,
Token ID11
Feature activation+9.45e-03

Pos logit contributions
 float+0.001
 ramp+0.001
 tracks+0.001
tr+0.001
 hunt+0.001
Neg logit contributions
ited-0.001
ubby-0.001
vious-0.001
oph-0.001
um-0.001
Token but
Token ID475
Feature activation-8.41e-03

Pos logit contributions
tr+0.014
 ramp+0.014
 float+0.014
 tracks+0.014
 globe+0.013
Neg logit contributions
ubby-0.016
ited-0.016
 enough-0.015
vious-0.015
 disagreed-0.014
Token Sue
Token ID26089
Feature activation-2.11e-02

Pos logit contributions
ubby+0.017
ited+0.016
vious+0.015
 enough+0.015
illy+0.015
Neg logit contributions
 float-0.011
tr-0.010
 ramp-0.010
 tracks-0.010
 globe-0.010
Token said
Token ID531
Feature activation-9.34e-02

Pos logit contributions
ubby+0.032
ited+0.030
 enough+0.029
vious+0.028
joy+0.028
Neg logit contributions
tr-0.035
 float-0.035
 tracks-0.035
 ramp-0.034
 hunt-0.034
Token,
Token ID11
Feature activation-1.19e-02

Pos logit contributions
ited+0.144
ubby+0.140
vious+0.124
oph+0.119
ody+0.119
Neg logit contributions
 float-0.164
tr-0.161
 tracks-0.157
 ramp-0.156
 globe-0.154
Token "
Token ID366
Feature activation+1.25e-02

Pos logit contributions
ubby+0.019
ited+0.019
 enough+0.018
vious+0.018
 disagreed+0.017
Neg logit contributions
tr-0.018
 ramp-0.018
 float-0.018
 tracks-0.018
 globe-0.018
TokenNo
Token ID2949
Feature activation-7.67e-02

Pos logit contributions
tr+0.025
 float+0.024
 ramp+0.024
 tracks+0.024
 globe+0.023
Neg logit contributions
ubby-0.016
 enough-0.014
ited-0.014
ously-0.013
vious-0.013
Token,
Token ID11
Feature activation-7.12e-03

Pos logit contributions
ited+0.105
ubby+0.103
vious+0.093
 enough+0.088
years+0.086
Neg logit contributions
 ramp-0.146
 tracks-0.146
 float-0.144
tr-0.142
 rug-0.141
Token I
Token ID314
Feature activation-1.38e-02

Pos logit contributions
ubby+0.013
ited+0.013
 enough+0.012
vious+0.012
um+0.012
Neg logit contributions
 float-0.010
tr-0.010
 ramp-0.010
 tracks-0.010
 globe-0.009
Token job
Token ID1693
Feature activation-4.70e-02

Pos logit contributions
ubby+0.064
 enough+0.061
ited+0.060
vious+0.056
um+0.055
Neg logit contributions
tr-0.084
 ramp-0.083
 tracks-0.083
 float-0.082
nob-0.082
Token,
Token ID11
Feature activation-5.27e-03

Pos logit contributions
ited+0.073
ubby+0.068
vious+0.065
years+0.060
ously+0.060
Neg logit contributions
 ramp-0.084
 float-0.084
 tracks-0.081
 rug-0.080
 globe-0.079
Token Tom
Token ID4186
Feature activation-3.78e-02

Pos logit contributions
ited+0.009
ubby+0.008
ously+0.008
years+0.007
joy+0.007
Neg logit contributions
 ramp-0.009
 float-0.009
 tracks-0.009
 rug-0.009
 track-0.008
Token!"
Token ID2474
Feature activation+4.47e-03

Pos logit contributions
ited+0.075
ubby+0.073
vious+0.068
years+0.068
ously+0.066
Neg logit contributions
 tracks-0.046
 ramp-0.046
 hunt-0.046
 float-0.045
tr-0.045
Token Mom
Token ID11254
Feature activation-3.59e-02

Pos logit contributions
 tracks+0.006
 float+0.006
 ramp+0.006
tr+0.006
 rug+0.006
Neg logit contributions
ited-0.008
ubby-0.008
 enough-0.008
vious-0.008
ously-0.007
Token says
Token ID1139
Feature activation-9.84e-02

Pos logit contributions
ubby+0.048
ited+0.047
 enough+0.045
vious+0.043
 disagreed+0.042
Neg logit contributions
tr-0.066
 ramp-0.066
 float-0.066
 tracks-0.065
 globe-0.064
Token.
Token ID13
Feature activation+6.46e-04

Pos logit contributions
ubby+0.191
 enough+0.188
ited+0.186
 disagreed+0.177
vious+0.176
Neg logit contributions
tr-0.123
 ramp-0.121
 float-0.117
 globe-0.116
 tracks-0.116
Token She
Token ID1375
Feature activation-2.66e-03

Pos logit contributions
 tracks+0.001
tr+0.001
 ramp+0.001
 float+0.001
 rug+0.001
Neg logit contributions
ited-0.001
ubby-0.001
 enough-0.001
vious-0.001
illy-0.001
Token cl
Token ID537
Feature activation-4.57e-02

Pos logit contributions
ubby+0.004
ited+0.004
 enough+0.004
vious+0.004
 disagreed+0.004
Neg logit contributions
tr-0.004
 ramp-0.004
 float-0.004
 tracks-0.004
 globe-0.004
Tokenaps
Token ID1686
Feature activation-3.67e-02

Pos logit contributions
ubby+0.068
ited+0.067
 enough+0.065
vious+0.061
 disagreed+0.061
Neg logit contributions
tr-0.078
 ramp-0.077
 float-0.077
 tracks-0.076
 globe-0.075
Token her
Token ID607
Feature activation-3.73e-02

Pos logit contributions
ited+0.072
ubby+0.069
oph+0.066
oke+0.065
 disagreed+0.065
Neg logit contributions
 float-0.048
 tracks-0.046
 cozy-0.044
 spaceship-0.044
 rug-0.043
Token
Token ID198
Feature activation-5.02e-03

Pos logit contributions
 ramp+0.025
 float+0.025
tr+0.024
 hunt+0.024
 globe+0.024
Neg logit contributions
ubby-0.025
 enough-0.024
um-0.024
ited-0.024
 disagreed-0.024
Token
Token ID198
Feature activation+9.47e-03

Pos logit contributions
ubby+0.009
ited+0.008
 enough+0.008
um+0.008
vious+0.008
Neg logit contributions
 ramp-0.007
tr-0.007
 float-0.007
 tracks-0.007
 globe-0.007
Token"
Token ID1
Feature activation+1.07e-02

Pos logit contributions
tr+0.016
 tracks+0.016
 float+0.016
nob+0.016
 globe+0.015
Neg logit contributions
ubby-0.015
ited-0.014
 enough-0.014
vious-0.013
joy-0.012
TokenOk
Token ID18690
Feature activation-5.19e-02

Pos logit contributions
tr+0.016
 float+0.016
 ramp+0.015
 globe+0.015
 tracks+0.015
Neg logit contributions
ubby-0.019
 enough-0.018
ited-0.018
vious-0.017
um-0.016
Token,"
Token ID553
Feature activation-5.31e-03

Pos logit contributions
ited+0.096
ubby+0.093
vious+0.091
years+0.086
eared+0.084
Neg logit contributions
 float-0.074
 tracks-0.071
 ramp-0.070
 globe-0.070
tr-0.070
Token said
Token ID531
Feature activation-8.31e-02

Pos logit contributions
ubby+0.007
ited+0.007
 enough+0.006
vious+0.006
 disagreed+0.006
Neg logit contributions
tr-0.010
 ramp-0.010
 float-0.010
 tracks-0.010
 globe-0.010
Token her
Token ID607
Feature activation-6.80e-02

Pos logit contributions
ubby+0.118
ited+0.115
 enough+0.112
 disagreed+0.105
vious+0.105
Neg logit contributions
 ramp-0.147
tr-0.147
 float-0.145
 tracks-0.143
 globe-0.141
Token mom
Token ID1995
Feature activation-3.86e-02

Pos logit contributions
ubby+0.106
ited+0.104
 enough+0.101
vious+0.095
 disagreed+0.095
Neg logit contributions
tr-0.112
 ramp-0.111
 float-0.111
 tracks-0.109
 globe-0.107
Tokenmy
Token ID1820
Feature activation-3.25e-02

Pos logit contributions
ubby+0.050
ited+0.048
 enough+0.045
vious+0.043
 disagreed+0.041
Neg logit contributions
tr-0.075
 float-0.073
 ramp-0.073
 tracks-0.073
 globe-0.071
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
ubby+0.058
 enough+0.058
ited+0.058
 disagreed+0.055
um+0.054
Neg logit contributions
 ramp-0.045
tr-0.044
 float-0.043
 globe-0.043
 tracks-0.042
Token And
Token ID843
Feature activation+1.70e-02

Pos logit contributions
 ramp+0.027
 float+0.027
tr+0.027
 tracks+0.026
 globe+0.026
Neg logit contributions
ubby-0.027
 enough-0.026
ited-0.026
 disagreed-0.025
um-0.025
Token,
Token ID11
Feature activation-5.70e-03

Pos logit contributions
ited+0.082
ubby+0.078
vious+0.074
years+0.074
ously+0.071
Neg logit contributions
 hunt-0.087
tr-0.087
 ramp-0.086
 globe-0.086
 float-0.086
Token a
Token ID257
Feature activation-1.14e-02

Pos logit contributions
ubby+0.010
 enough+0.010
ited+0.010
vious+0.009
 disagreed+0.009
Neg logit contributions
tr-0.008
 ramp-0.008
 float-0.008
 tracks-0.008
 globe-0.008
Token duck
Token ID22045
Feature activation+1.20e-02

Pos logit contributions
ubby+0.022
 enough+0.021
joy+0.019
ited+0.019
 disagreed+0.019
Neg logit contributions
nob-0.015
tr-0.014
 tracks-0.013
 hunt-0.012
fire-0.012
Token!"
Token ID2474
Feature activation-7.89e-03

Pos logit contributions
tr+0.020
 ramp+0.020
 tracks+0.019
 float+0.019
nob+0.019
Neg logit contributions
ubby-0.019
ited-0.018
 enough-0.017
vious-0.016
 disagreed-0.016
Token Lily
Token ID20037
Feature activation-4.92e-02

Pos logit contributions
ubby+0.015
ited+0.015
 enough+0.014
vious+0.013
um+0.013
Neg logit contributions
 float-0.011
 tracks-0.011
 ramp-0.011
tr-0.010
 globe-0.010
Token says
Token ID1139
Feature activation-1.08e-01

Pos logit contributions
ubby+0.062
ited+0.061
 enough+0.060
vious+0.057
 disagreed+0.057
Neg logit contributions
tr-0.094
 ramp-0.093
 float-0.092
 globe-0.091
 tracks-0.090
Token,
Token ID11
Feature activation-3.51e-02

Pos logit contributions
ubby+0.173
 enough+0.172
ited+0.166
 disagreed+0.160
ously+0.158
Neg logit contributions
tr-0.169
 ramp-0.169
 globe-0.163
 tracks-0.162
 float-0.161
Token pointing
Token ID10609
Feature activation-4.90e-02

Pos logit contributions
ubby+0.057
ited+0.056
 enough+0.056
 disagreed+0.053
ously+0.051
Neg logit contributions
tr-0.055
 ramp-0.053
 tracks-0.051
 globe-0.051
nob-0.049
Token to
Token ID284
Feature activation-1.54e-02

Pos logit contributions
ited+0.075
ubby+0.074
vious+0.066
 enough+0.065
illy+0.065
Neg logit contributions
 float-0.083
tr-0.080
 tracks-0.080
 ramp-0.078
 globe-0.078
Token the
Token ID262
Feature activation+1.33e-02

Pos logit contributions
 enough+0.022
ubby+0.021
ited+0.020
vious+0.019
 disagreed+0.019
Neg logit contributions
tr-0.028
 ramp-0.027
 globe-0.026
 float-0.026
nob-0.026
Token pond
Token ID16723
Feature activation+1.26e-02

Pos logit contributions
tr+0.015
nob+0.015
 ramp+0.013
 float+0.013
 tracks+0.013
Neg logit contributions
ubby-0.028
 enough-0.027
ited-0.026
vious-0.025
 disagreed-0.025
Token.
Token ID13
Feature activation+2.64e-02

Pos logit contributions
 enough+0.105
ubby+0.095
um+0.090
ously+0.089
vious+0.085
Neg logit contributions
 ramp-0.109
 globe-0.105
 hunt-0.102
 tracks-0.100
tr-0.099
Token
Token ID198
Feature activation-2.63e-03

Pos logit contributions
 ramp+0.050
 float+0.049
 globe+0.048
fire+0.046
 hunt+0.046
Neg logit contributions
ubby-0.034
 enough-0.033
um-0.033
 disagreed-0.033
ited-0.032
Token
Token ID198
Feature activation+8.88e-03

Pos logit contributions
ubby+0.005
ited+0.004
 enough+0.004
um+0.004
vious+0.004
Neg logit contributions
 ramp-0.004
tr-0.004
 float-0.004
 tracks-0.004
 globe-0.004
TokenHer
Token ID9360
Feature activation-6.12e-02

Pos logit contributions
 ramp+0.016
 float+0.016
tr+0.016
 globe+0.016
 tracks+0.016
Neg logit contributions
ubby-0.012
ited-0.012
 enough-0.011
um-0.011
vious-0.011
Token mom
Token ID1995
Feature activation-2.81e-02

Pos logit contributions
ubby+0.093
ited+0.089
 enough+0.087
um+0.082
vious+0.082
Neg logit contributions
 ramp-0.103
tr-0.102
 float-0.102
 tracks-0.100
 globe-0.099
Token heard
Token ID2982
Feature activation-7.75e-02

Pos logit contributions
ubby+0.056
 intentions+0.050
ited+0.049
vious+0.049
um+0.049
Neg logit contributions
 tracks-0.037
nob-0.034
 hunt-0.034
tr-0.034
 rug-0.033
Token her
Token ID607
Feature activation-4.73e-02

Pos logit contributions
ubby+0.161
ited+0.157
ared+0.142
 disagreed+0.142
um+0.141
Neg logit contributions
 tracks-0.097
 rug-0.095
 float-0.094
 carpet-0.091
nob-0.089
Token cry
Token ID3960
Feature activation-7.57e-02

Pos logit contributions
ubby+0.078
ited+0.078
 enough+0.072
vious+0.070
oph+0.069
Neg logit contributions
 tracks-0.074
tr-0.074
 float-0.073
nob-0.071
 ramp-0.070
Token and
Token ID290
Feature activation+1.95e-02

Pos logit contributions
ubby+0.099
ited+0.098
 enough+0.091
vious+0.088
 disagreed+0.085
Neg logit contributions
tr-0.143
 float-0.142
 ramp-0.140
 tracks-0.139
 globe-0.136
Token came
Token ID1625
Feature activation-3.63e-02

Pos logit contributions
 float+0.031
tr+0.031
 ramp+0.031
 tracks+0.031
 globe+0.030
Neg logit contributions
ubby-0.031
ited-0.030
 enough-0.028
vious-0.027
um-0.027
Token running
Token ID2491
Feature activation-1.40e-02

Pos logit contributions
ubby+0.056
 enough+0.054
ited+0.054
vious+0.050
 disagreed+0.050
Neg logit contributions
tr-0.060
 ramp-0.059
 float-0.059
 globe-0.058
 tracks-0.057