TOP ACTIVATIONS
MAX = 0.301

boy . Everyday , his father would give him a belt
Oh little Lucy " it said in a deep and scary
the pet shop , the father saw a little frog with
. One day , his father took him to the airport
be sad little bird " said the cloud . " Come
. Every day , the father put on his work clothes
upon a time , a mother and a daughter were sitting
day , the mother and father tried to make ends meet
her mother mention to her father that there was trouble .
right here " his Mom said , pointing to the entrance
I 'm adding yarn " said grandma . "
, â the boy said .
crack ling fire , her father said , " Tomorrow night
hair ly fruit ?" Her father smiled and said , "
in the pond with her father . She was trying to
. Every day , the father would go to collect lunch
don 't like what you said , it made me mad
said again , " I said who 's there ?"
sky . " Look !", said M andy . " It
in the garden with her father , who began to explain

MIN ACTIVATIONS
MIN = -0.463

was filled with delicious ch ocol ates . He was happy
not eat too many ch ocol ates and his friend learned
as he opened the ch ocol ates and started eating them
find a box of ch ocol ates . "
was a box of ch ocol ates . " For me
box and share the ch ocol ates . They are happy
she paid for the ch ocol ates and the two of
and started eating the ch ocol ates . The shop keeper
oll ip ops , ch ocol ates , g ummies ,
son is eating our ch ocol ates !"
a big box of ch ocol ates . It was so
a big box of ch ocol ates ! <|endoftext|> Once upon
colorful cand ies and ch ocol ates . She said to
looking at all the ch ocol ates , Lex i chose
It was sweet , ch ocol at y and warm ,
a large selection of ch ocol ates . Ell a picked
enjoyed her cool , ch ocol at y treat !
't have all these ch ocol ates ! Now , go
, cup cakes and ch ocol ates .
he ate all the ch ocol ates and sweets . Then

INTERVAL 0.110 - 0.301
CONTAINS 6.599%

. Every day , her dad asked her a lot of
Lily and Max hugged and said sorry to each other .
waited and waited , but mummy didn 't come and find
structure that he was very proud of . He wanted to
looked at the bag and said , " Mom my ,

INTERVAL -0.081 - 0.110
CONTAINS 79.619%

, Tim found his basketball very quickly and went to play
a special day . The little boy was so excited .
play somewhere safe and fun ." L ily and
She grabbed her passport and said , " I
The fire man said he liked to slide too . He

INTERVAL -0.272 - -0.081
CONTAINS 13.742%

hugs Rex . He says , " Rex , you are
out what Kitty did . They were very angry . They
gathered up a handful of sand and watched it slip away
sing songs . One day , she found a shiny rock
with her blanket covering her , it quickly ran away into

INTERVAL -0.463 - -0.272
CONTAINS 0.041%

they put it in a v ase inside . Lily was
the broken glass and the ol ives . They washed the
Wh isk ers . Wh isk ers looked at Spot and
ers . Mr . Wh isk ers loved to study .
Lily 's mom saw the v ase and asked where she
Token boy
Token ID2933
Feature activation+1.58e-01

Pos logit contributions
 said+0.083
 asked+0.077
 say+0.070
 understand+0.065
 know+0.064
Neg logit contributions
ocol-0.276
igs-0.234
icer-0.231
 degrees-0.229
ipping-0.226
Token.
Token ID13
Feature activation+4.38e-02

Pos logit contributions
 said+0.215
 asked+0.200
 say+0.198
 mother+0.195
 understand+0.193
Neg logit contributions
ocol-0.379
igs-0.309
icer-0.304
ions-0.301
 degrees-0.297
Token Everyday
Token ID48154
Feature activation+1.11e-01

Pos logit contributions
 said+0.049
 asked+0.048
 say+0.042
 grand+0.040
 know+0.040
Neg logit contributions
ocol-0.124
igs-0.108
 degrees-0.106
ipping-0.104
Too-0.101
Token,
Token ID11
Feature activation-6.30e-02

Pos logit contributions
 said+0.118
 asked+0.106
 mom+0.099
 dad+0.095
 mother+0.091
Neg logit contributions
ocol-0.345
igs-0.290
 degrees-0.289
icer-0.280
orns-0.278
Token his
Token ID465
Feature activation-5.39e-03

Pos logit contributions
ocol+0.162
igs+0.132
icer+0.130
 degrees+0.129
ions+0.128
Neg logit contributions
 said-0.083
 asked-0.078
 say-0.074
 know-0.070
 understand-0.070
Token father
Token ID2988
Feature activation+2.90e-01

Pos logit contributions
ocol+0.014
igs+0.013
 algae+0.012
icer+0.012
 degrees+0.012
Neg logit contributions
 asked-0.005
 said-0.005
 say-0.004
 know-0.004
 understand-0.004
Token would
Token ID561
Feature activation+5.70e-02

Pos logit contributions
 mother+0.275
 say+0.274
 understand+0.264
 said+0.262
 know+0.259
Neg logit contributions
ocol-0.716
igs-0.610
ions-0.607
icer-0.607
arine-0.606
Token give
Token ID1577
Feature activation-3.72e-02

Pos logit contributions
 said+0.063
 asked+0.054
 mother+0.053
 woman+0.051
 lost+0.050
Neg logit contributions
ocol-0.148
igs-0.124
arine-0.122
ipping-0.118
ions-0.118
Token him
Token ID683
Feature activation-1.70e-02

Pos logit contributions
ocol+0.078
igs+0.060
icer+0.059
 degrees+0.058
ions+0.057
Neg logit contributions
 said-0.065
 asked-0.062
 say-0.060
 mother-0.059
 understand-0.059
Token a
Token ID257
Feature activation+8.31e-02

Pos logit contributions
ocol+0.047
igs+0.039
 degrees+0.039
icer+0.039
ions+0.038
Neg logit contributions
 said-0.019
 asked-0.018
 say-0.016
 mother-0.015
 understand-0.015
Token belt
Token ID10999
Feature activation-2.33e-02

Pos logit contributions
 asked+0.113
 said+0.112
 say+0.091
 wondered+0.090
 know+0.087
Neg logit contributions
ocol-0.195
igs-0.160
 algae-0.157
 salsa-0.154
arine-0.154
TokenOh
Token ID5812
Feature activation-6.64e-02

Pos logit contributions
ocol+0.165
igs+0.123
 degrees+0.122
icer+0.122
ions+0.118
Neg logit contributions
 said-0.172
 asked-0.165
 say-0.160
 mother-0.155
 understand-0.153
Token little
Token ID1310
Feature activation+7.51e-02

Pos logit contributions
ocol+0.147
 degrees+0.116
mm+0.115
igs+0.115
ions+0.114
Neg logit contributions
 said-0.103
 asked-0.103
 say-0.097
 understand-0.096
 write-0.094
Token Lucy
Token ID22162
Feature activation+5.19e-02

Pos logit contributions
 said+0.088
 asked+0.085
 understand+0.082
 know+0.079
 say+0.078
Neg logit contributions
ocol-0.197
ipping-0.161
 degrees-0.159
igs-0.157
icer-0.156
Token"
Token ID1
Feature activation-1.31e-01

Pos logit contributions
 said+0.080
 asked+0.078
 understand+0.078
 say+0.075
 know+0.072
Neg logit contributions
ocol-0.112
 degrees-0.088
ipping-0.087
igs-0.086
icer-0.086
Token it
Token ID340
Feature activation-2.77e-02

Pos logit contributions
ocol+0.312
 degrees+0.288
mm+0.279
orns+0.278
ions+0.271
Neg logit contributions
 mother-0.123
 wish-0.121
 father-0.120
 parent-0.120
 understand-0.118
Token said
Token ID531
Feature activation+2.90e-01

Pos logit contributions
ocol+0.056
 degrees+0.051
ions+0.048
wic+0.047
icer+0.047
Neg logit contributions
 understand-0.033
 smile-0.030
 write-0.029
 contain-0.029
 dad-0.028
Token in
Token ID287
Feature activation+5.54e-02

Pos logit contributions
 said+0.341
 asked+0.305
 say+0.303
 mom+0.294
 mother+0.291
Neg logit contributions
ocol-0.802
igs-0.679
icer-0.676
orns-0.661
 degrees-0.654
Token a
Token ID257
Feature activation+7.68e-02

Pos logit contributions
 said+0.061
 asked+0.057
 say+0.052
 understand+0.049
 mother+0.048
Neg logit contributions
ocol-0.156
igs-0.129
 degrees-0.127
icer-0.127
ions-0.126
Token deep
Token ID2769
Feature activation-6.05e-02

Pos logit contributions
 said+0.094
 asked+0.088
 say+0.079
 understand+0.077
 know+0.075
Neg logit contributions
ocol-0.195
 degrees-0.159
icer-0.156
igs-0.152
 centimeters-0.151
Token and
Token ID290
Feature activation+1.27e-02

Pos logit contributions
ocol+0.150
igs+0.123
icer+0.121
 degrees+0.116
ipping+0.116
Neg logit contributions
 said-0.083
 asked-0.077
 say-0.074
 understand-0.074
 know-0.069
Token scary
Token ID14343
Feature activation-3.50e-02

Pos logit contributions
 understand+0.015
 mother+0.014
 member+0.013
 father+0.013
 friend+0.013
Neg logit contributions
ocol-0.027
 degrees-0.022
 algae-0.022
Too-0.021
igs-0.021
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
ocol+0.286
igs+0.241
icer+0.236
 degrees+0.235
ions+0.232
Neg logit contributions
 said-0.109
 asked-0.102
 say-0.094
 mother-0.089
 understand-0.088
Token pet
Token ID4273
Feature activation+6.93e-02

Pos logit contributions
 said+0.018
 asked+0.018
 say+0.016
 know+0.016
 understand+0.016
Neg logit contributions
ocol-0.028
icer-0.022
igs-0.022
 degrees-0.022
arine-0.021
Token shop
Token ID6128
Feature activation+6.02e-02

Pos logit contributions
 said+0.093
 asked+0.091
 say+0.086
 understand+0.086
 know+0.084
Neg logit contributions
ocol-0.169
igs-0.136
icer-0.135
ions-0.135
 degrees-0.134
Token,
Token ID11
Feature activation-7.08e-02

Pos logit contributions
 said+0.076
 understand+0.075
 asked+0.074
 know+0.074
 say+0.073
Neg logit contributions
ocol-0.143
 degrees-0.116
icer-0.115
ipping-0.115
mm-0.114
Token the
Token ID262
Feature activation+1.02e-02

Pos logit contributions
ocol+0.175
igs+0.142
 degrees+0.141
icer+0.140
ions+0.138
Neg logit contributions
 said-0.096
 asked-0.091
 say-0.086
 mother-0.082
 understand-0.082
Token father
Token ID2988
Feature activation+2.90e-01

Pos logit contributions
 said+0.013
 asked+0.012
 say+0.011
 know+0.011
 understand+0.011
Neg logit contributions
ocol-0.027
igs-0.022
 degrees-0.022
icer-0.022
arine-0.021
Token saw
Token ID2497
Feature activation+7.27e-02

Pos logit contributions
 mother+0.220
 understand+0.220
 said+0.219
 know+0.211
 say+0.210
Neg logit contributions
ocol-0.822
 degrees-0.683
igs-0.678
ipping-0.678
icer-0.676
Token a
Token ID257
Feature activation+7.41e-02

Pos logit contributions
 said+0.084
 asked+0.080
 say+0.076
 mother+0.072
 know+0.071
Neg logit contributions
ocol-0.196
igs-0.160
 degrees-0.160
icer-0.158
ions-0.157
Token little
Token ID1310
Feature activation+8.22e-02

Pos logit contributions
 said+0.088
 asked+0.085
 say+0.078
 know+0.075
 understand+0.074
Neg logit contributions
ocol-0.195
 degrees-0.160
icer-0.159
igs-0.159
arine-0.154
Token frog
Token ID21264
Feature activation+1.19e-02

Pos logit contributions
 said+0.083
 asked+0.077
 say+0.070
 understand+0.070
 know+0.069
Neg logit contributions
ocol-0.232
 degrees-0.190
igs-0.188
ipping-0.188
icer-0.187
Token with
Token ID351
Feature activation-6.42e-02

Pos logit contributions
 said+0.015
 asked+0.014
 say+0.013
 mother+0.012
 understand+0.012
Neg logit contributions
ocol-0.032
igs-0.026
 degrees-0.026
icer-0.026
ipping-0.025
Token.
Token ID13
Feature activation+4.74e-02

Pos logit contributions
 know+0.200
 mother+0.194
 understand+0.192
 said+0.188
 father+0.182
Neg logit contributions
ocol-0.240
ipping-0.221
icer-0.216
 diet-0.207
-0.206
Token One
Token ID1881
Feature activation-5.58e-02

Pos logit contributions
 said+0.055
 asked+0.054
 say+0.049
 know+0.046
 grand+0.045
Neg logit contributions
ocol-0.133
igs-0.112
 degrees-0.111
ipping-0.108
orns-0.107
Token day
Token ID1110
Feature activation+2.03e-03

Pos logit contributions
ocol+0.078
mm+0.068
Spl+0.068
icer+0.067
Too+0.065
Neg logit contributions
 say-0.098
 asked-0.096
 said-0.096
 write-0.095
 father-0.093
Token,
Token ID11
Feature activation-7.04e-02

Pos logit contributions
 said+0.002
 asked+0.002
 mom+0.002
 lost+0.002
 dad+0.001
Neg logit contributions
ocol-0.006
icer-0.005
igs-0.005
 degrees-0.005
icious-0.005
Token his
Token ID465
Feature activation-9.23e-03

Pos logit contributions
ocol+0.183
igs+0.151
icer+0.148
 degrees+0.147
ions+0.146
Neg logit contributions
 said-0.091
 asked-0.086
 say-0.081
 understand-0.077
 know-0.076
Token father
Token ID2988
Feature activation+2.90e-01

Pos logit contributions
igs+0.018
ocol+0.018
+0.017
balls+0.017
ipping+0.016
Neg logit contributions
 know-0.011
 said-0.011
 say-0.011
 asked-0.011
 understand-0.010
Token took
Token ID1718
Feature activation-1.91e-02

Pos logit contributions
 wish+0.218
 understand+0.214
 mother+0.209
 smile+0.207
 know+0.195
Neg logit contributions
ocol-0.718
ar-0.623
ions-0.615
arine-0.610
igs-0.609
Token him
Token ID683
Feature activation-1.98e-02

Pos logit contributions
ocol+0.046
igs+0.037
 degrees+0.037
icer+0.036
ipping+0.036
Neg logit contributions
 said-0.030
 asked-0.028
 say-0.027
 mother-0.025
 know-0.025
Token to
Token ID284
Feature activation+8.97e-02

Pos logit contributions
ocol+0.048
ions+0.043
igs+0.043
orns+0.042
ips+0.041
Neg logit contributions
 asked-0.021
 said-0.020
 smile-0.018
 mother-0.018
 say-0.018
Token the
Token ID262
Feature activation+1.03e-02

Pos logit contributions
 said+0.104
 asked+0.093
 mother+0.088
 say+0.084
 understand+0.081
Neg logit contributions
ocol-0.237
igs-0.195
icer-0.191
arine-0.191
 degrees-0.189
Token airport
Token ID9003
Feature activation+9.04e-03

Pos logit contributions
 said+0.018
 know+0.018
 understand+0.017
 say+0.017
 asked+0.017
Neg logit contributions
ocol-0.018
icer-0.017
Too-0.015
arine-0.015
 algae-0.015
Token be
Token ID307
Feature activation-8.96e-02

Pos logit contributions
ocol+0.034
Too+0.027
 pleasures+0.026
 algae+0.026
igs+0.026
Neg logit contributions
 said-0.022
 asked-0.019
 woman-0.019
 grandmother-0.019
 wrote-0.018
Token sad
Token ID6507
Feature activation+4.99e-02

Pos logit contributions
ocol+0.254
igs+0.211
 degrees+0.210
 centimeters+0.205
icer+0.203
Neg logit contributions
 said-0.085
 say-0.076
 mother-0.075
 asked-0.074
 write-0.072
Token little
Token ID1310
Feature activation+6.59e-02

Pos logit contributions
 said+0.075
 asked+0.070
 woman+0.067
 say+0.067
 write+0.067
Neg logit contributions
ocol-0.109
 degrees-0.087
igs-0.085
 centimeters-0.084
 pleasures-0.083
Token bird
Token ID6512
Feature activation+1.13e-01

Pos logit contributions
 said+0.080
 asked+0.075
 say+0.070
 know+0.070
 understand+0.069
Neg logit contributions
ocol-0.170
 degrees-0.140
ipping-0.137
igs-0.137
 centimeters-0.135
Token"
Token ID1
Feature activation-1.38e-01

Pos logit contributions
 said+0.137
 asked+0.136
 mother+0.129
 dad+0.124
 say+0.123
Neg logit contributions
ocol-0.274
igs-0.227
 degrees-0.227
ipping-0.223
 centimeters-0.222
Token said
Token ID531
Feature activation+2.89e-01

Pos logit contributions
ocol+0.348
orns+0.328
 degrees+0.317
ions+0.311
mm+0.309
Neg logit contributions
 mother-0.106
 wish-0.103
 parent-0.095
 father-0.094
 smile-0.090
Token the
Token ID262
Feature activation+6.53e-03

Pos logit contributions
 said+0.342
 asked+0.325
 say+0.302
 mother+0.289
 understand+0.288
Neg logit contributions
ocol-0.763
igs-0.628
 degrees-0.624
ions-0.607
ipping-0.596
Token cloud
Token ID6279
Feature activation+4.53e-02

Pos logit contributions
 said+0.010
 asked+0.010
 write+0.010
 say+0.009
 wrote+0.009
Neg logit contributions
ocol-0.012
icer-0.011
 degrees-0.011
Too-0.010
 centimeters-0.010
Token.
Token ID13
Feature activation+5.45e-02

Pos logit contributions
 know+0.046
 asked+0.046
 said+0.045
 say+0.045
 write+0.044
Neg logit contributions
ocol-0.111
 degrees-0.095
ions-0.094
ar-0.092
 centimeters-0.090
Token "
Token ID366
Feature activation-8.69e-02

Pos logit contributions
 said+0.075
 asked+0.071
 say+0.068
 mother+0.065
 lost+0.064
Neg logit contributions
ocol-0.134
igs-0.109
 degrees-0.108
icer-0.107
ions-0.106
TokenCome
Token ID16773
Feature activation-1.71e-01

Pos logit contributions
ocol+0.202
igs+0.166
 degrees+0.157
icer+0.152
mm+0.150
Neg logit contributions
 said-0.145
 asked-0.144
 understand-0.131
 say-0.130
 know-0.125
Token.
Token ID13
Feature activation+3.15e-02

Pos logit contributions
ocol+0.064
ions+0.054
igs+0.053
ipping+0.053
icer+0.052
Neg logit contributions
 said-0.029
 know-0.028
 asked-0.027
 understand-0.027
 say-0.027
Token Every
Token ID3887
Feature activation+1.12e-01

Pos logit contributions
 said+0.046
 asked+0.044
 say+0.041
 mother+0.039
 know+0.038
Neg logit contributions
ocol-0.078
igs-0.063
 degrees-0.062
icer-0.062
ipping-0.061
Token day
Token ID1110
Feature activation+1.10e-03

Pos logit contributions
 said+0.155
 asked+0.146
 say+0.142
 understand+0.134
 know+0.133
Neg logit contributions
ocol-0.276
igs-0.226
ions-0.220
icer-0.220
 degrees-0.218
Token,
Token ID11
Feature activation-6.73e-02

Pos logit contributions
 said+0.001
 asked+0.001
 say+0.001
 mother+0.001
 lost+0.001
Neg logit contributions
ocol-0.003
igs-0.003
icer-0.002
 degrees-0.002
ipping-0.002
Token the
Token ID262
Feature activation+9.70e-03

Pos logit contributions
ocol+0.185
igs+0.154
 degrees+0.152
icer+0.152
ions+0.149
Neg logit contributions
 said-0.077
 asked-0.072
 say-0.068
 mother-0.063
 understand-0.062
Token father
Token ID2988
Feature activation+2.89e-01

Pos logit contributions
 said+0.010
 asked+0.009
 say+0.009
 understand+0.009
 know+0.009
Neg logit contributions
ocol-0.025
igs-0.022
icer-0.021
 algae-0.021
ions-0.021
Token put
Token ID1234
Feature activation-9.10e-03

Pos logit contributions
 know+0.276
 say+0.273
 wish+0.269
 understand+0.267
 said+0.265
Neg logit contributions
ocol-0.698
igs-0.612
ions-0.604
orns-0.595
ipping-0.584
Token on
Token ID319
Feature activation+2.61e-02

Pos logit contributions
ocol+0.026
igs+0.022
 degrees+0.022
icer+0.021
 centimeters+0.021
Neg logit contributions
 said-0.010
 asked-0.009
 say-0.009
 lost-0.008
 mother-0.008
Token his
Token ID465
Feature activation-1.38e-02

Pos logit contributions
 said+0.031
 asked+0.030
 say+0.028
 mother+0.026
 dad+0.026
Neg logit contributions
ocol-0.071
 degrees-0.060
igs-0.059
icer-0.059
mm-0.059
Token work
Token ID670
Feature activation-6.71e-02

Pos logit contributions
ocol+0.035
igs+0.029
 degrees+0.029
ipping+0.028
icer+0.028
Neg logit contributions
 said-0.017
 asked-0.016
 say-0.015
 know-0.015
 understand-0.014
Token clothes
Token ID8242
Feature activation-6.21e-02

Pos logit contributions
ocol+0.184
igs+0.156
 degrees+0.153
ions+0.152
icer+0.151
Neg logit contributions
 said-0.071
 asked-0.067
 say-0.063
 mother-0.061
 know-0.059
Token upon
Token ID2402
Feature activation-5.21e-02

Pos logit contributions
ocol+0.029
icer+0.024
igs+0.023
ips+0.020
ions+0.020
Neg logit contributions
 write-0.035
 say-0.035
 whisper-0.034
 understand-0.033
 read-0.033
Token a
Token ID257
Feature activation+8.82e-02

Pos logit contributions
ocol+0.124
 centimeters+0.104
 algae+0.103
igs+0.103
icer+0.102
Neg logit contributions
 said-0.064
 asked-0.061
 mother-0.060
 say-0.058
 father-0.056
Token time
Token ID640
Feature activation-1.19e-01

Pos logit contributions
 said+0.145
 asked+0.138
 say+0.131
 understand+0.126
 mother+0.126
Neg logit contributions
ocol-0.203
igs-0.159
ions-0.158
 degrees-0.158
ipping-0.154
Token,
Token ID11
Feature activation-6.09e-02

Pos logit contributions
ocol+0.374
 degrees+0.320
arine+0.316
igs+0.314
icer+0.310
Neg logit contributions
 said-0.108
 asked-0.093
 mom-0.083
 lost-0.082
 say-0.079
Token a
Token ID257
Feature activation+8.51e-02

Pos logit contributions
ocol+0.144
icer+0.121
igs+0.119
ions+0.116
 degrees+0.114
Neg logit contributions
 said-0.087
 say-0.082
 asked-0.082
 know-0.077
 understand-0.077
Token mother
Token ID2802
Feature activation+2.88e-01

Pos logit contributions
 said+0.055
 asked+0.048
 say+0.042
 mother+0.037
 understand+0.035
Neg logit contributions
ocol-0.278
igs-0.239
 degrees-0.236
icer-0.235
ipping-0.232
Token and
Token ID290
Feature activation+1.57e-02

Pos logit contributions
 understand+0.326
 mother+0.308
 said+0.298
 say+0.292
 grand+0.290
Neg logit contributions
ocol-0.730
ions-0.598
orns-0.598
icer-0.590
mm-0.585
Token a
Token ID257
Feature activation+8.34e-02

Pos logit contributions
 said+0.019
 understand+0.018
 say+0.018
 asked+0.018
 lost+0.017
Neg logit contributions
ocol-0.036
ions-0.029
igs-0.029
mm-0.028
 algae-0.028
Token daughter
Token ID4957
Feature activation+2.16e-01

Pos logit contributions
 said+0.074
 asked+0.068
 say+0.065
 know+0.062
 understand+0.059
Neg logit contributions
ocol-0.240
igs-0.200
icer-0.199
 degrees-0.199
ipping-0.193
Token were
Token ID547
Feature activation+1.26e-02

Pos logit contributions
 said+0.284
 asked+0.266
 say+0.262
 understand+0.259
 know+0.255
Neg logit contributions
ocol-0.530
ions-0.426
icer-0.423
ipping-0.419
igs-0.418
Token sitting
Token ID5586
Feature activation-4.38e-02

Pos logit contributions
 said+0.021
 asked+0.020
 say+0.019
 mother+0.018
 understand+0.018
Neg logit contributions
ocol-0.028
igs-0.023
icer-0.022
 degrees-0.022
ipping-0.022
Token day
Token ID1110
Feature activation+1.90e-03

Pos logit contributions
 said+0.136
 asked+0.128
 say+0.117
 mother+0.113
 lost+0.109
Neg logit contributions
ocol-0.334
igs-0.279
 degrees-0.275
icer-0.273
ipping-0.268
Token,
Token ID11
Feature activation-6.42e-02

Pos logit contributions
 said+0.002
 asked+0.002
 say+0.002
 mother+0.002
 mom+0.002
Neg logit contributions
ocol-0.005
igs-0.005
icer-0.004
 degrees-0.004
ipping-0.004
Token the
Token ID262
Feature activation+1.54e-02

Pos logit contributions
ocol+0.168
igs+0.138
ions+0.135
icer+0.135
 degrees+0.134
Neg logit contributions
 said-0.079
 asked-0.074
 say-0.071
 understand-0.068
 know-0.067
Token mother
Token ID2802
Feature activation+2.82e-01

Pos logit contributions
 said+0.015
 asked+0.013
 say+0.013
 know+0.013
 understand+0.012
Neg logit contributions
ocol-0.041
icer-0.035
 algae-0.035
 degrees-0.035
dogs-0.034
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
 understand+0.306
 mother+0.302
 grand+0.295
 say+0.294
 know+0.292
Neg logit contributions
ocol-0.618
ions-0.563
orns-0.561
igs-0.556
ins-0.529
Token father
Token ID2988
Feature activation+2.88e-01

Pos logit contributions
 said+0.011
 say+0.010
 asked+0.009
 understand+0.009
 know+0.009
Neg logit contributions
ocol-0.027
igs-0.022
ions-0.022
 degrees-0.021
 algae-0.021
Token tried
Token ID3088
Feature activation+2.68e-02

Pos logit contributions
 mother+0.306
 said+0.278
 woman+0.274
 say+0.271
 wish+0.269
Neg logit contributions
ocol-0.654
ions-0.576
arine-0.548
orns-0.547
igs-0.547
Token to
Token ID284
Feature activation+8.83e-02

Pos logit contributions
 said+0.016
 asked+0.014
 say+0.012
 mother+0.011
 understand+0.010
Neg logit contributions
ocol-0.089
igs-0.076
icer-0.076
 degrees-0.075
ipping-0.074
Token make
Token ID787
Feature activation+4.52e-03

Pos logit contributions
 said+0.104
 asked+0.089
 mother+0.085
 woman+0.083
 lost+0.083
Neg logit contributions
ocol-0.222
igs-0.185
arine-0.183
 algae-0.177
 degrees-0.177
Token ends
Token ID5645
Feature activation+1.91e-02

Pos logit contributions
 said+0.006
 asked+0.005
 say+0.005
 know+0.005
 mother+0.005
Neg logit contributions
ocol-0.012
icer-0.010
igs-0.010
 degrees-0.010
ipping-0.010
Token meet
Token ID1826
Feature activation-1.74e-02

Pos logit contributions
 said+0.023
 asked+0.022
 say+0.020
 understand+0.019
 lost+0.019
Neg logit contributions
ocol-0.052
icer-0.045
mm-0.044
Too-0.044
 degrees-0.044
Token her
Token ID607
Feature activation+5.94e-02

Pos logit contributions
 said+0.034
 asked+0.033
 say+0.031
 mother+0.029
 lost+0.028
Neg logit contributions
ocol-0.077
 degrees-0.064
ions-0.063
igs-0.062
 centimeters-0.060
Token mother
Token ID2802
Feature activation+2.77e-01

Pos logit contributions
 asked+0.056
 wrote+0.048
 said+0.048
 know+0.046
 write+0.043
Neg logit contributions
ocol-0.135
 degrees-0.118
ipping-0.117
-0.117
igs-0.113
Token mention
Token ID3068
Feature activation+5.88e-02

Pos logit contributions
 said+0.353
 dad+0.345
 grand+0.343
 asked+0.339
 mother+0.337
Neg logit contributions
ocol-0.634
ipping-0.497
 degrees-0.484
 centimeters-0.470
 dams-0.462
Token to
Token ID284
Feature activation+8.56e-02

Pos logit contributions
 said+0.071
 asked+0.068
 say+0.062
 know+0.061
 lost+0.060
Neg logit contributions
ocol-0.155
 degrees-0.125
ions-0.125
igs-0.123
ipping-0.120
Token her
Token ID607
Feature activation+5.95e-02

Pos logit contributions
 said+0.101
 asked+0.094
 lost+0.084
 say+0.082
 mother+0.081
Neg logit contributions
ocol-0.225
 degrees-0.181
igs-0.179
ions-0.178
arine-0.176
Token father
Token ID2988
Feature activation+2.87e-01

Pos logit contributions
 asked+0.072
 said+0.070
 know+0.060
 lost+0.059
 rang+0.058
Neg logit contributions
ocol-0.139
 degrees-0.114
ions-0.114
ipping-0.111
igs-0.109
Token that
Token ID326
Feature activation+1.89e-01

Pos logit contributions
 said+0.328
 asked+0.316
 lost+0.282
 mother+0.280
 say+0.279
Neg logit contributions
ocol-0.767
 degrees-0.621
ions-0.610
igs-0.609
ipping-0.605
Token there
Token ID612
Feature activation+6.10e-02

Pos logit contributions
 said+0.200
 asked+0.197
 say+0.189
 know+0.187
 mother+0.187
Neg logit contributions
ocol-0.485
ions-0.407
 degrees-0.398
igs-0.394
orns-0.389
Token was
Token ID373
Feature activation-1.25e-02

Pos logit contributions
 said+0.062
 asked+0.061
 mother+0.057
 say+0.056
 know+0.054
Neg logit contributions
ocol-0.160
ions-0.133
 degrees-0.133
icer-0.130
igs-0.130
Token trouble
Token ID5876
Feature activation-1.00e-01

Pos logit contributions
ocol+0.038
igs+0.032
icer+0.032
 degrees+0.031
ipping+0.031
Neg logit contributions
 said-0.012
 asked-0.011
 say-0.009
 mother-0.009
 lost-0.009
Token.
Token ID13
Feature activation+3.59e-02

Pos logit contributions
ocol+0.305
igs+0.261
icer+0.259
mm+0.254
 degrees+0.253
Neg logit contributions
 said-0.091
 asked-0.083
 say-0.076
 lost-0.069
 understand-0.067
Token right
Token ID826
Feature activation-7.85e-03

Pos logit contributions
ocol+0.229
igs+0.189
 degrees+0.184
icer+0.184
ipping+0.182
Neg logit contributions
 said-0.118
 asked-0.110
 say-0.104
 mother-0.099
 understand-0.098
Token here
Token ID994
Feature activation-5.71e-02

Pos logit contributions
ocol+0.020
 degrees+0.016
igs+0.016
icer+0.016
ions+0.016
Neg logit contributions
 said-0.010
 asked-0.009
 say-0.009
 know-0.009
 understand-0.009
Token"
Token ID1
Feature activation-1.33e-01

Pos logit contributions
ocol+0.140
 degrees+0.121
 centimeters+0.118
igs+0.117
ions+0.116
Neg logit contributions
 said-0.067
 asked-0.064
 say-0.062
 mother-0.061
 woman-0.059
Token his
Token ID465
Feature activation-1.51e-02

Pos logit contributions
ocol+0.344
mm+0.303
 degrees+0.299
ions+0.294
orns+0.293
Neg logit contributions
 wish-0.114
 lost-0.112
 mother-0.111
 know-0.107
 asked-0.107
Token Mom
Token ID11254
Feature activation+1.19e-01

Pos logit contributions
ocol+0.037
mm+0.032
 centimeters+0.032
+0.032
 degrees+0.031
Neg logit contributions
 asked-0.015
 said-0.015
 understand-0.014
 wrote-0.013
 whispered-0.013
Token said
Token ID531
Feature activation+2.87e-01

Pos logit contributions
 spirit+0.080
 wish+0.078
 understand+0.078
 woman+0.073
 pilot+0.072
Neg logit contributions
ocol-0.294
 degrees-0.270
mm-0.262
arine-0.259
ions-0.256
Token,
Token ID11
Feature activation-6.64e-02

Pos logit contributions
 said+0.301
 asked+0.278
 say+0.257
 mother+0.240
 understand+0.234
Neg logit contributions
ocol-0.826
igs-0.694
icer-0.690
 degrees-0.678
ipping-0.670
Token pointing
Token ID10609
Feature activation+4.80e-03

Pos logit contributions
ocol+0.184
igs+0.152
icer+0.149
ipping+0.147
 degrees+0.147
Neg logit contributions
 said-0.086
 asked-0.079
 say-0.073
 mother-0.067
 dad-0.066
Token to
Token ID284
Feature activation+8.47e-02

Pos logit contributions
 said+0.005
 asked+0.005
 say+0.005
 understand+0.004
 know+0.004
Neg logit contributions
ocol-0.013
 degrees-0.011
igs-0.010
icer-0.010
arine-0.010
Token the
Token ID262
Feature activation+9.77e-03

Pos logit contributions
 said+0.093
 asked+0.085
 say+0.077
 mother+0.075
 lost+0.072
Neg logit contributions
ocol-0.234
igs-0.192
 degrees-0.191
icer-0.189
arine-0.187
Token entrance
Token ID10384
Feature activation+9.92e-03

Pos logit contributions
 said+0.019
 asked+0.018
 say+0.017
 understand+0.017
 mother+0.016
Neg logit contributions
ocol-0.019
icer-0.015
igs-0.014
 degrees-0.014
 centimeters-0.014
TokenI
Token ID40
Feature activation+4.36e-02

Pos logit contributions
ocol+0.271
Too+0.229
ions+0.223
icer+0.220
+0.217
Neg logit contributions
 said-0.225
 say-0.219
 asked-0.210
 smile-0.209
 mother-0.209
Token'm
Token ID1101
Feature activation-9.01e-02

Pos logit contributions
 mother+0.055
 said+0.054
 woman+0.051
 asked+0.050
 say+0.050
Neg logit contributions
ocol-0.107
 degrees-0.084
icer-0.084
igs-0.083
arine-0.081
Token adding
Token ID4375
Feature activation-1.46e-01

Pos logit contributions
ocol+0.202
igs+0.158
 degrees+0.155
icer+0.154
ions+0.151
Neg logit contributions
 said-0.150
 asked-0.142
 say-0.137
 mother-0.132
 understand-0.132
Token yarn
Token ID21181
Feature activation-3.95e-02

Pos logit contributions
ocol+0.334
igs+0.268
icer+0.267
 degrees+0.266
ipping+0.262
Neg logit contributions
 said-0.237
 asked-0.223
 say-0.214
 mother-0.201
 know-0.200
Token"
Token ID1
Feature activation-1.37e-01

Pos logit contributions
ocol+0.110
igs+0.095
ions+0.094
 degrees+0.092
orns+0.090
Neg logit contributions
 said-0.038
 asked-0.037
 say-0.034
 mother-0.033
 understand-0.033
Token said
Token ID531
Feature activation+2.87e-01

Pos logit contributions
ocol+0.311
mm+0.278
 degrees+0.272
ions+0.270
orns+0.264
Neg logit contributions
 mother-0.147
 lost-0.146
 wish-0.143
 father-0.140
 asked-0.138
Token grandma
Token ID49890
Feature activation+1.77e-01

Pos logit contributions
 said+0.380
 asked+0.357
 say+0.336
 mother+0.318
 know+0.310
Neg logit contributions
ocol-0.739
igs-0.612
icer-0.603
 degrees-0.599
ipping-0.590
Token.
Token ID13
Feature activation+6.41e-02

Pos logit contributions
 said+0.204
 asked+0.199
 say+0.185
 understand+0.183
 lost+0.181
Neg logit contributions
ocol-0.458
 degrees-0.374
ions-0.373
igs-0.371
arine-0.365
Token
Token ID198
Feature activation+2.86e-02

Pos logit contributions
 said+0.087
 lost+0.082
 say+0.081
 asked+0.080
 understand+0.079
Neg logit contributions
ocol-0.150
icer-0.120
mm-0.120
ions-0.119
 degrees-0.119
Token
Token ID198
Feature activation+2.80e-02

Pos logit contributions
 said+0.055
 lost+0.053
 asked+0.052
 say+0.052
 smile+0.050
Neg logit contributions
ocol-0.049
icer-0.038
ions-0.038
igs-0.037
 degrees-0.037
Token"
Token ID1
Feature activation-1.38e-01

Pos logit contributions
 said+0.054
 lost+0.051
 asked+0.050
 say+0.050
 know+0.048
Neg logit contributions
ocol-0.051
 degrees-0.039
igs-0.038
icer-0.038
ions-0.038
Token,
Token ID11
Feature activation-5.80e-02

Pos logit contributions
 asked+0.130
 said+0.128
 mother+0.126
 woman+0.122
 lost+0.118
Neg logit contributions
ocol-0.192
icer-0.154
ions-0.153
mm-0.151
ipping-0.150
Tokenâ
Token ID22940
Feature activation-9.20e-02

Pos logit contributions
ocol+0.134
ions+0.106
igs+0.103
 algae+0.103
 degrees+0.102
Neg logit contributions
 said-0.079
 asked-0.077
 mother-0.072
 lost-0.071
 say-0.068
Token
Token ID26391
Feature activation-4.73e-02

Pos logit contributions
ocol+0.193
igs+0.154
 degrees+0.149
icer+0.146
ions+0.142
Neg logit contributions
 said-0.172
 asked-0.165
 say-0.160
 understand-0.147
 know-0.147
Token the
Token ID262
Feature activation+2.17e-02

Pos logit contributions
ocol+0.110
igs+0.089
 degrees+0.087
icer+0.087
ipping+0.085
Neg logit contributions
 said-0.074
 asked-0.071
 say-0.067
 mother-0.064
 lost-0.063
Token boy
Token ID2933
Feature activation+1.48e-01

Pos logit contributions
 said+0.030
 asked+0.029
 say+0.027
 know+0.027
 understand+0.027
Neg logit contributions
ocol-0.053
icer-0.042
igs-0.042
 degrees-0.042
ions-0.041
Token said
Token ID531
Feature activation+2.86e-01

Pos logit contributions
 mother+0.138
 said+0.138
 understand+0.137
 asked+0.132
 say+0.129
Neg logit contributions
ocol-0.398
 degrees-0.329
igs-0.325
icer-0.324
ipping-0.320
Token.
Token ID13
Feature activation+3.29e-02

Pos logit contributions
 said+0.381
 asked+0.364
 say+0.343
 mother+0.327
 know+0.324
Neg logit contributions
ocol-0.720
igs-0.581
 degrees-0.581
ipping-0.563
ions-0.563
Token
Token ID6184
Feature activation-1.50e-01

Pos logit contributions
 said+0.045
 asked+0.043
 say+0.039
 mother+0.037
 know+0.036
Neg logit contributions
ocol-0.086
igs-0.072
 degrees-0.070
icer-0.069
ipping-0.068
Token
Token ID95
Feature activation-1.35e-01

Pos logit contributions
ocol+0.269
orns+0.223
igs+0.218
ions+0.209
 degrees+0.208
Neg logit contributions
 said-0.297
 asked-0.287
 say-0.286
 open-0.268
 write-0.263
Token
Token ID26391
Feature activation-5.91e-02

Pos logit contributions
ocol+0.288
igs+0.240
ions+0.218
 degrees+0.218
orns+0.212
Neg logit contributions
 said-0.248
 asked-0.239
 say-0.235
 write-0.218
 know-0.216
Token
Token ID129
Feature activation-2.01e-01

Pos logit contributions
ocol+0.155
igs+0.126
 degrees+0.124
icer+0.122
ions+0.122
Neg logit contributions
 said-0.081
 asked-0.075
 say-0.070
 know-0.067
 lost-0.066
Token crack
Token ID8469
Feature activation-1.79e-01

Pos logit contributions
 asked+0.023
 said+0.023
 know+0.021
 understand+0.021
 say+0.020
Neg logit contributions
ocol-0.029
icer-0.024
 degrees-0.023
 algae-0.023
igs-0.023
Tokenling
Token ID1359
Feature activation+4.44e-02

Pos logit contributions
ocol+0.475
igs+0.395
 degrees+0.389
icer+0.387
ions+0.380
Neg logit contributions
 said-0.219
 asked-0.207
 say-0.194
 mother-0.184
 understand-0.181
Token fire
Token ID2046
Feature activation-1.13e-01

Pos logit contributions
 said+0.053
 asked+0.050
 say+0.047
 mother+0.045
 understand+0.044
Neg logit contributions
ocol-0.118
igs-0.098
 degrees-0.097
icer-0.096
ions-0.095
Token,
Token ID11
Feature activation-6.63e-02

Pos logit contributions
ocol+0.270
 degrees+0.220
igs+0.219
icer+0.218
ions+0.216
Neg logit contributions
 said-0.157
 asked-0.153
 say-0.144
 understand-0.143
 mother-0.140
Token her
Token ID607
Feature activation+5.99e-02

Pos logit contributions
ocol+0.169
igs+0.139
icer+0.137
 degrees+0.136
ipping+0.134
Neg logit contributions
 said-0.091
 asked-0.086
 say-0.081
 mother-0.076
 lost-0.075
Token father
Token ID2988
Feature activation+2.86e-01

Pos logit contributions
 asked+0.057
 said+0.055
 say+0.051
 know+0.049
 understand+0.048
Neg logit contributions
ocol-0.163
 degrees-0.139
igs-0.137
 centimeters-0.133
icer-0.130
Token said
Token ID531
Feature activation+2.71e-01

Pos logit contributions
 said+0.267
 asked+0.256
 mother+0.254
 understand+0.248
 say+0.244
Neg logit contributions
ocol-0.772
 degrees-0.636
ipping-0.630
igs-0.629
arine-0.625
Token,
Token ID11
Feature activation-7.81e-02

Pos logit contributions
 said+0.340
 asked+0.317
 say+0.295
 mother+0.279
 understand+0.275
Neg logit contributions
ocol-0.729
igs-0.608
icer-0.606
 degrees-0.591
ipping-0.583
Token "
Token ID366
Feature activation-8.30e-02

Pos logit contributions
ocol+0.194
igs+0.159
icer+0.158
 degrees+0.152
ipping+0.152
Neg logit contributions
 said-0.119
 asked-0.113
 say-0.106
 mother-0.101
 understand-0.099
TokenTomorrow
Token ID49488
Feature activation-4.42e-03

Pos logit contributions
ocol+0.194
igs+0.163
 degrees+0.152
icer+0.145
ipping+0.144
Neg logit contributions
 said-0.141
 asked-0.138
 say-0.125
 understand-0.122
 mother-0.118
Token night
Token ID1755
Feature activation+2.42e-02

Pos logit contributions
ocol+0.011
igs+0.009
 degrees+0.009
icer+0.009
mm+0.009
Neg logit contributions
 said-0.006
 asked-0.006
 say-0.005
 mother-0.005
 understand-0.005
Token hair
Token ID4190
Feature activation-2.62e-02

Pos logit contributions
 said+0.118
 asked+0.114
 say+0.105
 understand+0.103
 know+0.099
Neg logit contributions
ocol-0.167
igs-0.130
icer-0.129
 degrees-0.128
arine-0.126
Tokenly
Token ID306
Feature activation+9.02e-02

Pos logit contributions
ocol+0.061
igs+0.048
mm+0.048
icer+0.048
ipping+0.047
Neg logit contributions
 said-0.038
 asked-0.038
 know-0.034
 understand-0.034
 lost-0.034
Token fruit
Token ID8234
Feature activation-1.01e-01

Pos logit contributions
 said+0.117
 asked+0.111
 say+0.103
 understand+0.100
 mother+0.099
Neg logit contributions
ocol-0.232
igs-0.189
 degrees-0.186
icer-0.184
arine-0.182
Token?"
Token ID1701
Feature activation+1.19e-01

Pos logit contributions
ocol+0.280
igs+0.234
icer+0.231
 degrees+0.228
ions+0.227
Neg logit contributions
 said-0.109
 asked-0.103
 say-0.095
 understand-0.092
 mother-0.091
Token Her
Token ID2332
Feature activation+2.84e-02

Pos logit contributions
 lost+0.155
 said+0.149
 say+0.142
 know+0.140
 understand+0.140
Neg logit contributions
ocol-0.259
-0.210
-0.209
ipping-0.206
icer-0.205
Token father
Token ID2988
Feature activation+2.86e-01

Pos logit contributions
case+0.025
 say+0.025
 rang+0.025
 said+0.024
 know+0.024
Neg logit contributions
ocol-0.064
icer-0.058
 degrees-0.057
ipping-0.055
igs-0.054
Token smiled
Token ID13541
Feature activation+1.50e-01

Pos logit contributions
 know+0.235
 wish+0.219
 understand+0.214
 lost+0.212
 parent+0.208
Neg logit contributions
ocol-0.725
ipping-0.622
ar-0.615
 degrees-0.605
omp-0.598
Token and
Token ID290
Feature activation+1.12e-02

Pos logit contributions
 know+0.204
 said+0.202
 say+0.198
 asked+0.191
 write+0.186
Neg logit contributions
ocol-0.319
ipping-0.274
 degrees-0.251
icer-0.249
 centimeters-0.247
Token said
Token ID531
Feature activation+2.76e-01

Pos logit contributions
 understand+0.007
 know+0.006
 lost+0.006
 parent+0.006
 spirit+0.005
Neg logit contributions
ocol-0.030
mm-0.027
corn-0.027
igs-0.026
ar-0.026
Token,
Token ID11
Feature activation-7.56e-02

Pos logit contributions
 said+0.328
 asked+0.303
 say+0.276
 mother+0.260
 understand+0.255
Neg logit contributions
ocol-0.764
icer-0.645
igs-0.643
 degrees-0.625
orns-0.624
Token "
Token ID366
Feature activation-8.88e-02

Pos logit contributions
ocol+0.184
igs+0.150
icer+0.147
 degrees+0.145
ions+0.142
Neg logit contributions
 said-0.116
 asked-0.111
 say-0.103
 mother-0.099
 understand-0.096
Token in
Token ID287
Feature activation+5.46e-02

Pos logit contributions
ocol+0.476
igs+0.387
 degrees+0.385
mm+0.384
icer+0.381
Neg logit contributions
 said-0.185
 asked-0.167
 say-0.153
 lost-0.146
 dad-0.139
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
 said+0.046
 asked+0.044
 say+0.038
 mother+0.035
 lost+0.035
Neg logit contributions
ocol-0.169
igs-0.144
 degrees-0.142
icer-0.141
ipping-0.141
Token pond
Token ID16723
Feature activation-7.11e-02

Pos logit contributions
 said+0.018
 asked+0.017
 say+0.016
 lost+0.015
 mother+0.015
Neg logit contributions
ocol-0.028
igs-0.022
 degrees-0.022
ions-0.021
icer-0.021
Token with
Token ID351
Feature activation-6.33e-02

Pos logit contributions
ocol+0.196
igs+0.164
 degrees+0.161
icer+0.161
ipping+0.158
Neg logit contributions
 said-0.081
 asked-0.076
 say-0.071
 mother-0.066
 lost-0.065
Token her
Token ID607
Feature activation+6.19e-02

Pos logit contributions
ocol+0.167
igs+0.137
ions+0.135
 degrees+0.132
orns+0.132
Neg logit contributions
 said-0.074
 asked-0.068
 say-0.067
 understand-0.062
 mother-0.060
Token father
Token ID2988
Feature activation+2.86e-01

Pos logit contributions
 asked+0.063
 understand+0.061
 know+0.060
 say+0.058
 write+0.057
Neg logit contributions
ocol-0.153
 rods-0.138
-0.137
igs-0.137
icer-0.133
Token.
Token ID13
Feature activation+4.13e-02

Pos logit contributions
 said+0.319
 asked+0.301
 say+0.281
 know+0.280
 understand+0.279
Neg logit contributions
ocol-0.755
 degrees-0.613
igs-0.611
icer-0.606
ions-0.605
Token She
Token ID1375
Feature activation+3.90e-02

Pos logit contributions
 said+0.062
 asked+0.058
 say+0.056
 lost+0.055
 understand+0.053
Neg logit contributions
ocol-0.095
icer-0.076
igs-0.076
 degrees-0.076
arine-0.074
Token was
Token ID373
Feature activation-1.52e-02

Pos logit contributions
 say+0.032
 understand+0.032
 mother+0.030
 aunt+0.029
 angel+0.029
Neg logit contributions
ocol-0.096
igs-0.088
icer-0.086
 degrees-0.085
Spl-0.084
Token trying
Token ID2111
Feature activation-6.07e-02

Pos logit contributions
ocol+0.039
igs+0.033
icer+0.032
 degrees+0.032
 centimeters+0.032
Neg logit contributions
 said-0.017
 say-0.016
 asked-0.016
 mother-0.015
 understand-0.015
Token to
Token ID284
Feature activation+8.42e-02

Pos logit contributions
ocol+0.214
icer+0.183
igs+0.181
 degrees+0.181
mm+0.180
Neg logit contributions
 said-0.029
 asked-0.024
 say-0.019
 lost-0.017
 mother-0.016
Token.
Token ID13
Feature activation+5.28e-02

Pos logit contributions
 said+0.032
 asked+0.030
 say+0.029
 know+0.028
 understand+0.028
Neg logit contributions
ocol-0.078
igs-0.065
 degrees-0.065
icer-0.065
ipping-0.064
Token Every
Token ID3887
Feature activation+1.14e-01

Pos logit contributions
 said+0.070
 asked+0.069
 say+0.062
 grand+0.059
 mother+0.059
Neg logit contributions
ocol-0.140
igs-0.119
ipping-0.117
 degrees-0.117
 centimeters-0.112
Token day
Token ID1110
Feature activation-3.48e-03

Pos logit contributions
 said+0.133
 asked+0.125
 say+0.113
 mother+0.110
 lost+0.107
Neg logit contributions
ocol-0.325
igs-0.268
 degrees-0.267
icer-0.264
mm-0.259
Token,
Token ID11
Feature activation-6.97e-02

Pos logit contributions
ocol+0.010
icer+0.008
igs+0.008
 degrees+0.008
ipping+0.008
Neg logit contributions
 said-0.004
 asked-0.004
 say-0.003
 mother-0.003
 mom-0.003
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
ocol+0.184
igs+0.152
icer+0.149
 degrees+0.148
ions+0.147
Neg logit contributions
 said-0.086
 asked-0.080
 say-0.076
 understand-0.072
 know-0.072
Token father
Token ID2988
Feature activation+2.86e-01

Pos logit contributions
 said+0.011
 know+0.010
 asked+0.010
 say+0.009
 answered+0.009
Neg logit contributions
ocol-0.027
 algae-0.024
dogs-0.024
icer-0.023
 degrees-0.023
Token would
Token ID561
Feature activation+4.79e-02

Pos logit contributions
 mother+0.277
 know+0.275
 say+0.270
 wish+0.264
 understand+0.261
Neg logit contributions
ocol-0.678
igs-0.604
orns-0.591
ions-0.585
ipping-0.575
Token go
Token ID467
Feature activation-6.66e-02

Pos logit contributions
 said+0.054
 asked+0.046
 mother+0.044
 lost+0.043
 woman+0.042
Neg logit contributions
ocol-0.123
igs-0.102
arine-0.101
ipping-0.100
 degrees-0.099
Token to
Token ID284
Feature activation+8.58e-02

Pos logit contributions
ocol+0.190
igs+0.166
ipping+0.161
icer+0.159
ions+0.158
Neg logit contributions
 said-0.061
 asked-0.055
 mother-0.051
 say-0.048
 understand-0.048
Token collect
Token ID2824
Feature activation-1.50e-01

Pos logit contributions
 said+0.108
 asked+0.097
 mother+0.092
 answered+0.084
 understand+0.084
Neg logit contributions
ocol-0.219
igs-0.179
icer-0.176
arine-0.173
ions-0.170
Token lunch
Token ID9965
Feature activation-1.89e-01

Pos logit contributions
ocol+0.331
 degrees+0.263
igs+0.263
icer+0.262
ipping+0.257
Neg logit contributions
 said-0.256
 asked-0.242
 say-0.231
 lost-0.221
 know-0.217
Token don
Token ID836
Feature activation-1.66e-01

Pos logit contributions
 mother+0.075
 said+0.071
 father+0.070
 woman+0.070
 asked+0.069
Neg logit contributions
ocol-0.111
icer-0.087
 degrees-0.085
Too-0.085
arine-0.084
Token't
Token ID470
Feature activation-1.03e-02

Pos logit contributions
ocol+0.405
igs+0.327
 degrees+0.322
icer+0.321
ipping+0.312
Neg logit contributions
 said-0.245
 asked-0.234
 say-0.221
 mother-0.208
 understand-0.207
Token like
Token ID588
Feature activation-7.55e-03

Pos logit contributions
ocol+0.028
igs+0.023
 degrees+0.022
icer+0.022
ipping+0.022
Neg logit contributions
 said-0.012
 asked-0.012
 say-0.011
 mother-0.010
 lost-0.010
Token what
Token ID644
Feature activation+8.72e-03

Pos logit contributions
ocol+0.021
igs+0.017
icer+0.017
 degrees+0.017
mm+0.017
Neg logit contributions
 said-0.009
 asked-0.009
 say-0.008
 lost-0.008
 dad-0.008
Token you
Token ID345
Feature activation-2.07e-02

Pos logit contributions
 said+0.012
 asked+0.012
 say+0.011
 mother+0.011
 understand+0.011
Neg logit contributions
ocol-0.021
igs-0.017
 degrees-0.017
ions-0.017
icer-0.017
Token said
Token ID531
Feature activation+2.85e-01

Pos logit contributions
ocol+0.053
icer+0.043
 degrees+0.042
arine+0.042
ipping+0.042
Neg logit contributions
 mother-0.023
 woman-0.023
 grandmother-0.022
 asked-0.022
 said-0.022
Token,
Token ID11
Feature activation-6.81e-02

Pos logit contributions
 said+0.357
 asked+0.338
 say+0.315
 mother+0.300
 lost+0.294
Neg logit contributions
ocol-0.751
igs-0.620
 degrees-0.613
icer-0.608
ipping-0.598
Token it
Token ID340
Feature activation-2.86e-02

Pos logit contributions
ocol+0.175
igs+0.144
icer+0.143
 degrees+0.140
ipping+0.140
Neg logit contributions
 said-0.095
 asked-0.089
 say-0.086
 mother-0.081
 understand-0.079
Token made
Token ID925
Feature activation+1.95e-02

Pos logit contributions
ocol+0.083
igs+0.072
 degrees+0.072
icer+0.071
ipping+0.070
Neg logit contributions
 said-0.024
 asked-0.022
 mother-0.020
 say-0.020
 dad-0.019
Token me
Token ID502
Feature activation+1.15e-02

Pos logit contributions
 said+0.032
 asked+0.030
 say+0.029
 know+0.027
 mother+0.027
Neg logit contributions
ocol-0.046
igs-0.036
icer-0.036
 degrees-0.036
ipping-0.035
Token mad
Token ID8805
Feature activation+1.53e-02

Pos logit contributions
 asked+0.013
 said+0.012
 mother+0.012
 father+0.011
 woman+0.011
Neg logit contributions
ocol-0.030
igs-0.025
 degrees-0.025
 centimeters-0.025
icer-0.024
Token said
Token ID531
Feature activation+2.74e-01

Pos logit contributions
 daddy+0.094
 know+0.085
 dad+0.085
 write+0.085
 father+0.083
Neg logit contributions
ocol-0.246
ar-0.228
Too-0.225
izing-0.222
 degrees-0.220
Token again
Token ID757
Feature activation-4.59e-02

Pos logit contributions
 said+0.349
 asked+0.329
 say+0.309
 mother+0.292
 understand+0.287
Neg logit contributions
ocol-0.719
igs-0.593
 degrees-0.584
icer-0.583
ipping-0.571
Token,
Token ID11
Feature activation-7.39e-02

Pos logit contributions
ocol+0.125
igs+0.104
icer+0.103
 degrees+0.102
ipping+0.100
Neg logit contributions
 said-0.056
 asked-0.053
 say-0.049
 mother-0.046
 understand-0.045
Token "
Token ID366
Feature activation-8.83e-02

Pos logit contributions
ocol+0.188
igs+0.153
icer+0.151
 degrees+0.147
orns+0.146
Neg logit contributions
 said-0.111
 asked-0.106
 say-0.098
 mother-0.093
 lost-0.090
TokenI
Token ID40
Feature activation+4.56e-02

Pos logit contributions
ocol+0.181
igs+0.140
 degrees+0.138
icer+0.138
ipping+0.134
Neg logit contributions
 said-0.163
 asked-0.156
 say-0.150
 mother-0.145
 lost-0.143
Token said
Token ID531
Feature activation+2.85e-01

Pos logit contributions
 mother+0.067
 woman+0.065
 dad+0.064
 said+0.064
 father+0.064
Neg logit contributions
ocol-0.094
ar-0.074
icer-0.071
Too-0.071
igs-0.070
Token who
Token ID508
Feature activation+1.55e-01

Pos logit contributions
 said+0.395
 asked+0.377
 say+0.352
 mother+0.332
 understand+0.332
Neg logit contributions
ocol-0.707
igs-0.578
 degrees-0.574
icer-0.555
ions-0.554
Token's
Token ID338
Feature activation+4.76e-02

Pos logit contributions
 mother+0.206
 said+0.204
 asked+0.199
 say+0.192
 understand+0.188
Neg logit contributions
ocol-0.362
ions-0.298
igs-0.295
icer-0.288
 degrees-0.288
Token there
Token ID612
Feature activation+5.60e-02

Pos logit contributions
 said+0.065
 asked+0.061
 say+0.059
 understand+0.056
 know+0.055
Neg logit contributions
ocol-0.117
igs-0.097
 degrees-0.096
ions-0.093
arine-0.093
Token?"
Token ID1701
Feature activation+1.23e-01

Pos logit contributions
 said+0.072
 asked+0.071
 mother+0.067
 say+0.066
 understand+0.065
Neg logit contributions
ocol-0.129
 degrees-0.110
ions-0.107
icer-0.106
igs-0.106
Token
Token ID220
Feature activation-2.69e-03

Pos logit contributions
 mother+0.163
 said+0.157
 lost+0.156
 parent+0.156
 father+0.155
Neg logit contributions
ocol-0.255
Too-0.220
 degrees-0.220
mm-0.219
arine-0.216
Token sky
Token ID6766
Feature activation+8.57e-02

Pos logit contributions
 said+0.025
 asked+0.024
 understand+0.024
 say+0.023
 know+0.023
Neg logit contributions
ocol-0.026
icer-0.022
orns-0.021
igs-0.021
 algae-0.021
Token.
Token ID13
Feature activation+5.81e-02

Pos logit contributions
 said+0.084
 asked+0.078
 say+0.072
 mother+0.067
 understand+0.066
Neg logit contributions
ocol-0.248
igs-0.209
 degrees-0.207
icer-0.206
ipping-0.203
Token "
Token ID366
Feature activation-8.27e-02

Pos logit contributions
 said+0.083
 asked+0.079
 say+0.074
 mother+0.071
 lost+0.069
Neg logit contributions
ocol-0.147
igs-0.119
 degrees-0.118
icer-0.117
ipping-0.115
TokenLook
Token ID8567
Feature activation-1.20e-01

Pos logit contributions
ocol+0.186
igs+0.158
 degrees+0.146
ipping+0.140
mm+0.137
Neg logit contributions
 asked-0.146
 said-0.144
 say-0.131
 understand-0.129
 write-0.127
Token!",
Token ID40754
Feature activation+1.22e-01

Pos logit contributions
ocol+0.280
igs+0.225
ipping+0.222
icer+0.219
mm+0.216
Neg logit contributions
 asked-0.173
 said-0.169
 understand-0.163
 woman-0.159
 know-0.156
Token said
Token ID531
Feature activation+2.85e-01

Pos logit contributions
 understand+0.085
 pilot+0.073
 lost+0.071
 know+0.070
 aunt+0.069
Neg logit contributions
ocol-0.347
igs-0.301
ions-0.296
mm-0.290
 degrees-0.288
Token M
Token ID337
Feature activation-6.22e-02

Pos logit contributions
 said+0.476
 understand+0.474
 asked+0.466
 say+0.463
 know+0.459
Neg logit contributions
ocol-0.554
ions-0.426
igs-0.414
ipping-0.404
ar-0.395
Tokenandy
Token ID10757
Feature activation-5.76e-02

Pos logit contributions
ocol+0.123
igs+0.094
arine+0.092
icer+0.090
 degrees+0.090
Neg logit contributions
 said-0.123
 asked-0.115
 mother-0.112
 say-0.111
 dad-0.107
Token.
Token ID13
Feature activation+5.63e-02

Pos logit contributions
ocol+0.119
 degrees+0.099
igs+0.098
ipping+0.095
 centimeters+0.093
Neg logit contributions
 understand-0.075
 say-0.072
 wish-0.071
 grand-0.071
 know-0.071
Token "
Token ID366
Feature activation-8.45e-02

Pos logit contributions
 said+0.084
 asked+0.081
 say+0.075
 mother+0.072
 lost+0.070
Neg logit contributions
ocol-0.139
igs-0.112
 degrees-0.111
icer-0.110
ions-0.107
TokenIt
Token ID1026
Feature activation+1.46e-02

Pos logit contributions
ocol+0.179
igs+0.140
 degrees+0.136
icer+0.135
ipping+0.132
Neg logit contributions
 said-0.154
 asked-0.149
 say-0.142
 understand-0.136
 mother-0.136
Token in
Token ID287
Feature activation+6.00e-02

Pos logit contributions
ocol+0.314
igs+0.264
icer+0.262
ipping+0.257
 degrees+0.257
Neg logit contributions
 said-0.120
 asked-0.111
 say-0.105
 know-0.098
 understand-0.098
Token the
Token ID262
Feature activation+1.25e-02

Pos logit contributions
 said+0.053
 asked+0.049
 say+0.044
 mother+0.041
 lost+0.040
Neg logit contributions
ocol-0.182
igs-0.155
 degrees-0.153
icer-0.152
ipping-0.150
Token garden
Token ID11376
Feature activation+2.50e-02

Pos logit contributions
 said+0.019
 asked+0.018
 say+0.017
 understand+0.017
 know+0.016
Neg logit contributions
ocol-0.030
icer-0.025
igs-0.024
ipping-0.023
 degrees-0.023
Token with
Token ID351
Feature activation-5.72e-02

Pos logit contributions
 said+0.033
 asked+0.031
 say+0.029
 mother+0.027
 lost+0.027
Neg logit contributions
ocol-0.065
igs-0.054
 degrees-0.053
icer-0.052
ions-0.051
Token her
Token ID607
Feature activation+6.59e-02

Pos logit contributions
ocol+0.144
ions+0.116
igs+0.115
 algae+0.110
icer+0.109
Neg logit contributions
 said-0.073
 asked-0.067
 say-0.066
 understand-0.062
 woman-0.060
Token father
Token ID2988
Feature activation+2.85e-01

Pos logit contributions
 know+0.074
 understand+0.073
 say+0.069
 asked+0.068
 write+0.064
Neg logit contributions
ocol-0.160
-0.145
igs-0.143
mons-0.141
icer-0.139
Token,
Token ID11
Feature activation-7.02e-02

Pos logit contributions
 said+0.333
 asked+0.313
 say+0.296
 know+0.285
 understand+0.284
Neg logit contributions
ocol-0.764
igs-0.626
icer-0.623
 degrees-0.620
ions-0.611
Token who
Token ID508
Feature activation+1.62e-01

Pos logit contributions
ocol+0.166
icer+0.136
ions+0.134
igs+0.134
ipping+0.133
Neg logit contributions
 said-0.094
 asked-0.088
 know-0.088
 say-0.087
 understand-0.085
Token began
Token ID2540
Feature activation+6.12e-02

Pos logit contributions
 mother+0.194
 say+0.186
 understand+0.180
 father+0.177
 said+0.177
Neg logit contributions
ocol-0.369
ions-0.313
icer-0.310
mm-0.299
igs-0.296
Token to
Token ID284
Feature activation+8.87e-02

Pos logit contributions
 said+0.037
 asked+0.032
 say+0.028
 mother+0.024
 lost+0.023
Neg logit contributions
ocol-0.202
igs-0.174
 degrees-0.172
icer-0.172
ipping-0.169
Token explain
Token ID4727
Feature activation+1.03e-01

Pos logit contributions
 said+0.105
 asked+0.095
 mother+0.087
 say+0.086
 lost+0.082
Neg logit contributions
ocol-0.238
igs-0.195
 degrees-0.192
icer-0.191
arine-0.189
Token was
Token ID373
Feature activation-2.06e-02

Pos logit contributions
ocol+0.106
igs+0.089
 degrees+0.088
ipping+0.087
icer+0.086
Neg logit contributions
 said-0.048
 asked-0.046
 say-0.043
 mother-0.042
 know-0.040
Token filled
Token ID5901
Feature activation-1.98e-02

Pos logit contributions
ocol+0.052
igs+0.042
 degrees+0.042
arine+0.041
ions+0.041
Neg logit contributions
 said-0.026
 asked-0.024
 say-0.024
 mother-0.023
 know-0.023
Token with
Token ID351
Feature activation-7.61e-02

Pos logit contributions
ocol+0.052
igs+0.043
 degrees+0.042
icer+0.042
ipping+0.041
Neg logit contributions
 said-0.025
 asked-0.024
 say-0.022
 mother-0.021
 lost-0.021
Token delicious
Token ID12625
Feature activation-6.38e-02

Pos logit contributions
ocol+0.188
igs+0.147
 degrees+0.145
ions+0.144
icer+0.141
Neg logit contributions
 said-0.105
 asked-0.102
 say-0.094
 mother-0.092
 know-0.089
Token ch
Token ID442
Feature activation-1.84e-02

Pos logit contributions
ocol+0.133
icer+0.107
igs+0.107
 degrees+0.104
arine+0.103
Neg logit contributions
 said-0.114
 asked-0.109
 say-0.105
 mother-0.100
 lost-0.099
Tokenocol
Token ID4668
Feature activation-4.63e-01

Pos logit contributions
mm+0.009
ocol+0.009
Too+0.009
icer+0.008
arine+0.008
Neg logit contributions
 said-0.051
 asked-0.050
 know-0.050
 smile-0.050
 say-0.049
Tokenates
Token ID689
Feature activation-1.16e-01

Pos logit contributions
ocol+0.638
Too+0.603
 algae+0.481
abouts+0.473
 pleasures+0.472
Neg logit contributions
 said-1.012
 asked-0.966
 mother-0.894
 answered-0.894
 dad-0.888
Token.
Token ID13
Feature activation+2.68e-02

Pos logit contributions
ocol+0.309
 degrees+0.256
igs+0.255
ions+0.251
ipping+0.250
Neg logit contributions
 said-0.134
 asked-0.131
 say-0.119
 mother-0.116
 know-0.115
Token He
Token ID679
Feature activation-1.43e-02

Pos logit contributions
 said+0.036
 asked+0.034
 say+0.033
 mother+0.031
 lost+0.031
Neg logit contributions
ocol-0.068
igs-0.056
icer-0.055
 degrees-0.055
ipping-0.054
Token was
Token ID373
Feature activation-2.27e-02

Pos logit contributions
ocol+0.037
igs+0.032
 degrees+0.031
 centimeters+0.030
ions+0.030
Neg logit contributions
 say-0.012
 understand-0.012
 know-0.012
 said-0.011
 mother-0.011
Token happy
Token ID3772
Feature activation+4.45e-02

Pos logit contributions
ocol+0.053
 centimeters+0.043
ions+0.042
ipping+0.042
 degrees+0.042
Neg logit contributions
 said-0.028
 say-0.026
 asked-0.026
 mother-0.026
 write-0.026
Token not
Token ID407
Feature activation-2.36e-03

Pos logit contributions
 said+0.167
 asked+0.155
 say+0.143
 mother+0.132
 understand+0.129
Neg logit contributions
ocol-0.494
igs-0.416
 degrees-0.411
icer-0.410
ipping-0.403
Token eat
Token ID4483
Feature activation-1.45e-01

Pos logit contributions
ocol+0.006
igs+0.005
arine+0.005
 degrees+0.005
icer+0.005
Neg logit contributions
 said-0.003
 asked-0.002
 mother-0.002
 say-0.002
 woman-0.002
Token too
Token ID1165
Feature activation-4.56e-02

Pos logit contributions
ocol+0.346
ipping+0.292
Too+0.292
icer+0.285
igs+0.284
Neg logit contributions
 said-0.215
 asked-0.202
 lost-0.193
 say-0.189
 big-0.188
Token many
Token ID867
Feature activation-1.96e-02

Pos logit contributions
ocol+0.107
icer+0.084
igs+0.084
 degrees+0.084
ipping+0.083
Neg logit contributions
 said-0.073
 asked-0.070
 say-0.065
 lost-0.064
 know-0.064
Token ch
Token ID442
Feature activation-2.26e-02

Pos logit contributions
ocol+0.043
igs+0.034
icer+0.034
 degrees+0.033
ipping+0.033
Neg logit contributions
 said-0.034
 asked-0.033
 say-0.031
 lost-0.029
 mother-0.029
Tokenocol
Token ID4668
Feature activation-4.63e-01

Pos logit contributions
ocol+0.006
Too+0.005
mm+0.004
icious+0.004
 degrees+0.004
Neg logit contributions
 asked-0.070
 said-0.070
 know-0.069
 smile-0.068
 lost-0.068
Tokenates
Token ID689
Feature activation-1.20e-01

Pos logit contributions
ocol+0.616
Too+0.516
 degrees+0.451
 algae+0.443
igs+0.427
Neg logit contributions
 said-1.064
 asked-1.031
 say-0.958
 dad-0.957
 friend-0.956
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
ocol+0.336
igs+0.281
icer+0.277
 degrees+0.276
ipping+0.271
Neg logit contributions
 said-0.134
 asked-0.124
 say-0.115
 mother-0.107
 lost-0.106
Token his
Token ID465
Feature activation-3.14e-02

Pos logit contributions
ocol+0.036
igs+0.030
 degrees+0.029
ions+0.028
arine+0.028
Neg logit contributions
 said-0.017
 asked-0.016
 mother-0.015
 say-0.015
 woman-0.014
Token friend
Token ID1545
Feature activation+1.58e-01

Pos logit contributions
ocol+0.080
igs+0.065
 degrees+0.064
icer+0.064
arine+0.062
Neg logit contributions
 said-0.041
 asked-0.040
 say-0.037
 understand-0.035
 know-0.035
Token learned
Token ID4499
Feature activation+6.55e-02

Pos logit contributions
 said+0.141
 asked+0.134
 say+0.133
 mother+0.127
 understand+0.121
Neg logit contributions
ocol-0.434
 degrees-0.368
igs-0.364
arine-0.357
ions-0.355
Token as
Token ID355
Feature activation-7.32e-02

Pos logit contributions
ocol+0.203
 degrees+0.172
igs+0.171
ipping+0.166
icer+0.165
Neg logit contributions
 said-0.082
 asked-0.079
 say-0.072
 mother-0.070
 know-0.068
Token he
Token ID339
Feature activation+5.06e-02

Pos logit contributions
ocol+0.181
igs+0.148
 degrees+0.146
ions+0.143
icer+0.141
Neg logit contributions
 said-0.101
 asked-0.096
 say-0.091
 know-0.086
 mother-0.086
Token opened
Token ID4721
Feature activation+2.75e-02

Pos logit contributions
 mother+0.045
 said+0.045
 say+0.044
 asked+0.042
 understand+0.042
Neg logit contributions
ocol-0.136
 degrees-0.117
igs-0.115
icer-0.113
ipping-0.112
Token the
Token ID262
Feature activation+1.50e-03

Pos logit contributions
 said+0.036
 asked+0.033
 say+0.032
 mother+0.030
 understand+0.030
Neg logit contributions
ocol-0.071
igs-0.059
 degrees-0.058
icer-0.058
ions-0.057
Token ch
Token ID442
Feature activation-1.57e-02

Pos logit contributions
 said+0.002
 asked+0.002
 understand+0.002
 know+0.002
 say+0.002
Neg logit contributions
ocol-0.004
icer-0.003
 degrees-0.003
igs-0.003
 centimeters-0.003
Tokenocol
Token ID4668
Feature activation-4.63e-01

Pos logit contributions
ocol+0.012
mm+0.007
icer+0.007
 degrees+0.007
arine+0.006
Neg logit contributions
 said-0.046
 asked-0.045
 say-0.044
 mother-0.043
 smile-0.043
Tokenates
Token ID689
Feature activation-1.19e-01

Pos logit contributions
ocol+0.587
Too+0.533
 algae+0.464
 degrees+0.446
 pleasures+0.439
Neg logit contributions
 said-1.033
 asked-1.000
 mother-0.929
 dad-0.929
 say-0.929
Token and
Token ID290
Feature activation-9.23e-03

Pos logit contributions
ocol+0.317
igs+0.262
 degrees+0.262
icer+0.260
ions+0.259
Neg logit contributions
 said-0.139
 asked-0.133
 say-0.123
 know-0.119
 mother-0.118
Token started
Token ID2067
Feature activation+4.50e-02

Pos logit contributions
ocol+0.023
 degrees+0.019
igs+0.019
arine+0.019
ions+0.018
Neg logit contributions
 said-0.009
 understand-0.009
 asked-0.009
 mother-0.009
 know-0.009
Token eating
Token ID6600
Feature activation-1.63e-01

Pos logit contributions
 said+0.052
 asked+0.049
 say+0.045
 mother+0.043
 lost+0.042
Neg logit contributions
ocol-0.123
igs-0.103
 degrees-0.101
icer-0.101
ipping-0.099
Token them
Token ID606
Feature activation-1.10e-01

Pos logit contributions
ocol+0.427
Too+0.366
icer+0.364
igs+0.363
ipping+0.361
Neg logit contributions
 said-0.211
 asked-0.190
 say-0.184
 mom-0.171
 big-0.171
Token find
Token ID1064
Feature activation+1.99e-02

Pos logit contributions
ocol+0.288
igs+0.242
icer+0.237
arine+0.230
 algae+0.224
Neg logit contributions
 said-0.146
 mother-0.141
 asked-0.140
 woman-0.138
 aunt-0.135
Token a
Token ID257
Feature activation+4.28e-02

Pos logit contributions
 said+0.028
 asked+0.028
 say+0.026
 understand+0.025
 write+0.025
Neg logit contributions
ocol-0.047
ions-0.039
igs-0.039
 degrees-0.037
icer-0.037
Token box
Token ID3091
Feature activation+3.72e-02

Pos logit contributions
 said+0.057
 asked+0.055
 say+0.051
 understand+0.048
 know+0.048
Neg logit contributions
ocol-0.107
igs-0.089
icer-0.087
 degrees-0.087
arine-0.085
Token of
Token ID286
Feature activation+8.22e-02

Pos logit contributions
 asked+0.047
 understand+0.046
 said+0.046
 know+0.046
 mother+0.045
Neg logit contributions
ocol-0.085
ions-0.074
igs-0.071
ipping-0.070
ips-0.069
Token ch
Token ID442
Feature activation-2.53e-02

Pos logit contributions
 said+0.140
 asked+0.135
 say+0.127
 mother+0.124
 understand+0.123
Neg logit contributions
ocol-0.181
igs-0.142
 degrees-0.138
icer-0.138
ions-0.135
Tokenocol
Token ID4668
Feature activation-4.60e-01

Pos logit contributions
Too+0.020
mm+0.020
icious+0.018
+0.018
 degrees+0.017
Neg logit contributions
 know-0.063
 open-0.063
 dad-0.063
 said-0.062
 big-0.062
Tokenates
Token ID689
Feature activation-1.13e-01

Pos logit contributions
ocol+0.675
Too+0.651
 pleasures+0.534
 degrees+0.532
 algae+0.528
Neg logit contributions
 said-0.941
 asked-0.887
 dad-0.871
 mother-0.856
 write-0.853
Token.
Token ID13
Feature activation+1.45e-03

Pos logit contributions
ocol+0.325
igs+0.273
 degrees+0.269
icer+0.269
ipping+0.264
Neg logit contributions
 said-0.116
 asked-0.108
 say-0.099
 mother-0.092
 understand-0.091
Token
Token ID198
Feature activation-1.73e-02

Pos logit contributions
 said+0.002
 asked+0.002
 say+0.001
 mom+0.001
 dad+0.001
Neg logit contributions
ocol-0.004
igs-0.003
 degrees-0.003
icer-0.003
ipping-0.003
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
ocol+0.046
igs+0.039
icer+0.038
 degrees+0.038
ions+0.038
Neg logit contributions
 said-0.019
 asked-0.018
 say-0.017
 mother-0.017
 lost-0.016
Token"
Token ID1
Feature activation-1.65e-01

Pos logit contributions
ocol+0.044
icer+0.034
ipping+0.034
 degrees+0.033
igs+0.033
Neg logit contributions
 said-0.029
 asked-0.028
 say-0.024
 saw-0.024
 know-0.023
Token was
Token ID373
Feature activation-2.02e-02

Pos logit contributions
ocol+0.098
igs+0.081
icer+0.081
ions+0.080
 degrees+0.079
Neg logit contributions
 said-0.043
 asked-0.042
 say-0.040
 understand-0.039
 know-0.038
Token a
Token ID257
Feature activation+6.23e-02

Pos logit contributions
ocol+0.053
 degrees+0.045
ions+0.045
igs+0.044
 centimeters+0.044
Neg logit contributions
 said-0.022
 asked-0.021
 say-0.021
 know-0.019
 understand-0.019
Token box
Token ID3091
Feature activation+4.40e-02

Pos logit contributions
 said+0.069
 asked+0.067
 say+0.060
 know+0.059
 understand+0.057
Neg logit contributions
ocol-0.162
 degrees-0.136
igs-0.134
 centimeters-0.133
icer-0.132
Token of
Token ID286
Feature activation+9.31e-02

Pos logit contributions
 know+0.055
 asked+0.054
 said+0.054
 understand+0.053
 mother+0.052
Neg logit contributions
ocol-0.104
ipping-0.087
ions-0.086
 centimeters-0.083
igs-0.082
Token ch
Token ID442
Feature activation-1.79e-02

Pos logit contributions
 said+0.143
 asked+0.141
 say+0.130
 know+0.128
 understand+0.127
Neg logit contributions
ocol-0.218
igs-0.166
ions-0.166
 degrees-0.163
icer-0.159
Tokenocol
Token ID4668
Feature activation-4.59e-01

Pos logit contributions
mm+0.008
ocol+0.007
arine+0.006
Too+0.006
icer+0.005
Neg logit contributions
 said-0.053
 asked-0.052
 know-0.051
 open-0.051
 say-0.051
Tokenates
Token ID689
Feature activation-1.09e-01

Pos logit contributions
ocol+0.638
Too+0.609
 algae+0.521
 pleasures+0.514
 salsa+0.480
Neg logit contributions
 said-0.967
 asked-0.926
 know-0.864
 mother-0.862
 answered-0.861
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
ocol+0.287
 degrees+0.236
igs+0.236
ions+0.232
ipping+0.232
Neg logit contributions
 said-0.131
 asked-0.127
 say-0.118
 mother-0.113
 know-0.113
Token "
Token ID366
Feature activation-9.88e-02

Pos logit contributions
 said+0.024
 asked+0.023
 say+0.021
 mother+0.020
 know+0.019
Neg logit contributions
ocol-0.052
igs-0.043
 degrees-0.042
icer-0.042
ipping-0.041
TokenFor
Token ID1890
Feature activation-9.04e-02

Pos logit contributions
ocol+0.216
igs+0.185
 degrees+0.172
ipping+0.168
icer+0.162
Neg logit contributions
 said-0.172
 asked-0.171
 say-0.158
 understand-0.153
 write-0.150
Token me
Token ID502
Feature activation+5.38e-03

Pos logit contributions
ocol+0.236
igs+0.196
icer+0.195
 degrees+0.193
ipping+0.191
Neg logit contributions
 said-0.118
 asked-0.112
 say-0.105
 know-0.098
 mother-0.097
Token box
Token ID3091
Feature activation+3.57e-02

Pos logit contributions
ocol+0.055
icer+0.046
igs+0.046
 degrees+0.045
ions+0.044
Neg logit contributions
 said-0.024
 asked-0.024
 say-0.021
 understand-0.021
 know-0.020
Token and
Token ID290
Feature activation-2.08e-02

Pos logit contributions
 asked+0.045
 said+0.044
 understand+0.043
 know+0.043
 say+0.043
Neg logit contributions
ocol-0.080
ips-0.069
ions-0.069
igs-0.069
ipping-0.068
Token share
Token ID2648
Feature activation-5.92e-02

Pos logit contributions
ocol+0.054
igs+0.044
 algae+0.043
arine+0.042
icer+0.042
Neg logit contributions
 said-0.020
 asked-0.020
 friend-0.019
 mother-0.019
 aunt-0.018
Token the
Token ID262
Feature activation-2.03e-02

Pos logit contributions
ocol+0.175
igs+0.147
 degrees+0.145
icer+0.145
ipping+0.144
Neg logit contributions
 said-0.061
 asked-0.056
 say-0.051
 mother-0.046
 lost-0.046
Token ch
Token ID442
Feature activation-2.55e-02

Pos logit contributions
ocol+0.052
igs+0.043
 degrees+0.042
icer+0.042
ipping+0.041
Neg logit contributions
 said-0.027
 asked-0.026
 say-0.024
 mother-0.023
 lost-0.023
Tokenocol
Token ID4668
Feature activation-4.59e-01

Pos logit contributions
ocol+0.009
Too+0.009
+0.008
 degrees+0.008
mm+0.006
Neg logit contributions
 know-0.074
 said-0.073
 say-0.073
 write-0.072
 friend-0.072
Tokenates
Token ID689
Feature activation-1.14e-01

Pos logit contributions
ocol+0.669
Too+0.637
 algae+0.536
 degrees+0.535
ipping+0.534
Neg logit contributions
 said-0.911
 asked-0.854
 write-0.846
 dad-0.840
 mother-0.834
Token.
Token ID13
Feature activation+2.15e-03

Pos logit contributions
ocol+0.314
igs+0.265
 degrees+0.261
icer+0.260
ions+0.258
Neg logit contributions
 said-0.121
 asked-0.117
 say-0.106
 mother-0.101
 lost-0.100
Token They
Token ID1119
Feature activation-1.20e-01

Pos logit contributions
 said+0.003
 asked+0.003
 say+0.003
 mother+0.003
 know+0.003
Neg logit contributions
ocol-0.005
icer-0.004
igs-0.004
 degrees-0.004
ipping-0.004
Token are
Token ID389
Feature activation-5.49e-02

Pos logit contributions
ocol+0.276
igs+0.228
icer+0.221
arine+0.218
 algae+0.216
Neg logit contributions
 mother-0.143
 woman-0.143
 said-0.142
 asked-0.140
 aunt-0.137
Token happy
Token ID3772
Feature activation+4.01e-02

Pos logit contributions
ocol+0.133
igs+0.118
 centimeters+0.115
 degrees+0.111
 algae+0.110
Neg logit contributions
 said-0.059
 asked-0.057
 understand-0.054
 say-0.053
 write-0.053
Token she
Token ID673
Feature activation+7.40e-02

Pos logit contributions
ocol+0.133
igs+0.108
 degrees+0.107
icer+0.106
ipping+0.105
Neg logit contributions
 said-0.073
 asked-0.069
 say-0.067
 understand-0.064
 mother-0.063
Token paid
Token ID3432
Feature activation-7.27e-02

Pos logit contributions
 understand+0.067
 know+0.062
 spirit+0.060
 say+0.059
 woman+0.057
Neg logit contributions
ocol-0.176
 degrees-0.156
igs-0.151
ar-0.150
 centimeters-0.147
Token for
Token ID329
Feature activation-6.07e-02

Pos logit contributions
ocol+0.177
 degrees+0.144
igs+0.143
ions+0.140
ipping+0.137
Neg logit contributions
 said-0.098
 asked-0.097
 say-0.088
 lost-0.088
 understand-0.087
Token the
Token ID262
Feature activation-7.65e-03

Pos logit contributions
ocol+0.182
igs+0.158
icer+0.156
ipping+0.155
 degrees+0.153
Neg logit contributions
 said-0.057
 asked-0.054
 say-0.048
 dad-0.046
 mom-0.045
Token ch
Token ID442
Feature activation-2.03e-02

Pos logit contributions
ocol+0.018
 degrees+0.015
icer+0.015
igs+0.014
arine+0.014
Neg logit contributions
 said-0.011
 asked-0.011
 say-0.010
 know-0.010
 understand-0.010
Tokenocol
Token ID4668
Feature activation-4.59e-01

Pos logit contributions
ocol+0.001
Too-0.002
mm-0.002
 algae-0.002
abouts-0.002
Neg logit contributions
 said-0.067
 asked-0.066
 say-0.065
 know-0.065
 smile-0.065
Tokenates
Token ID689
Feature activation-1.13e-01

Pos logit contributions
ocol+0.630
Too+0.557
 algae+0.505
abouts+0.476
 degrees+0.470
Neg logit contributions
 said-0.993
 asked-0.944
 mother-0.911
 dad-0.902
 say-0.898
Token and
Token ID290
Feature activation-6.12e-03

Pos logit contributions
ocol+0.300
 degrees+0.248
igs+0.247
ions+0.246
icer+0.243
Neg logit contributions
 said-0.130
 asked-0.127
 say-0.115
 know-0.111
 mother-0.110
Token the
Token ID262
Feature activation-8.47e-03

Pos logit contributions
ocol+0.016
 degrees+0.014
igs+0.014
ions+0.014
mm+0.013
Neg logit contributions
 understand-0.005
 said-0.005
 know-0.005
 asked-0.005
 say-0.005
Token two
Token ID734
Feature activation+1.39e-01

Pos logit contributions
ocol+0.020
icer+0.016
 degrees+0.016
igs+0.015
arine+0.015
Neg logit contributions
 said-0.013
 asked-0.012
 say-0.012
 know-0.011
 understand-0.011
Token of
Token ID286
Feature activation+9.13e-02

Pos logit contributions
 said+0.158
 asked+0.151
 say+0.144
 understand+0.136
 mother+0.135
Neg logit contributions
ocol-0.377
igs-0.309
 degrees-0.308
icer-0.301
ipping-0.298
Token and
Token ID290
Feature activation-1.24e-03

Pos logit contributions
 know+0.056
 understand+0.055
 asked+0.054
 say+0.053
 parent+0.052
Neg logit contributions
ocol-0.094
ipping-0.082
ions-0.082
icer-0.080
 degrees-0.079
Token started
Token ID2067
Feature activation+4.32e-02

Pos logit contributions
ocol+0.003
icer+0.003
ions+0.003
arine+0.002
 degrees+0.002
Neg logit contributions
 understand-0.001
 know-0.001
 parent-0.001
 say-0.001
 aunt-0.001
Token eating
Token ID6600
Feature activation-1.61e-01

Pos logit contributions
 said+0.035
 asked+0.033
 say+0.029
 mother+0.026
 lost+0.026
Neg logit contributions
ocol-0.134
igs-0.114
icer-0.113
 degrees-0.113
mm-0.111
Token the
Token ID262
Feature activation-1.03e-03

Pos logit contributions
ocol+0.507
Too+0.441
igs+0.438
rim+0.436
ipping+0.434
Neg logit contributions
 said-0.143
 asked-0.119
 say-0.107
 big-0.106
 mom-0.104
Token ch
Token ID442
Feature activation-1.68e-02

Pos logit contributions
ocol+0.002
icer+0.002
 degrees+0.002
igs+0.002
 centimeters+0.002
Neg logit contributions
 said-0.002
 asked-0.002
 say-0.002
 mother-0.002
 understand-0.002
Tokenocol
Token ID4668
Feature activation-4.58e-01

Pos logit contributions
mm+0.001
Too+0.001
ocol+0.001
abouts+0.000
Sp+0.000
Neg logit contributions
 said-0.051
 know-0.051
 say-0.050
 smile-0.050
 open-0.050
Tokenates
Token ID689
Feature activation-1.10e-01

Pos logit contributions
ocol+0.631
Too+0.591
 algae+0.512
 degrees+0.497
 pleasures+0.490
Neg logit contributions
 said-0.959
 asked-0.908
 mother-0.866
 dad-0.861
 write-0.856
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
ocol+0.305
igs+0.255
 degrees+0.253
icer+0.250
ions+0.249
Neg logit contributions
 said-0.117
 asked-0.112
 say-0.103
 mother-0.098
 understand-0.096
Token The
Token ID383
Feature activation-6.17e-02

Pos logit contributions
 said+0.026
 asked+0.025
 say+0.023
 mother+0.022
 know+0.022
Neg logit contributions
ocol-0.047
igs-0.039
 degrees-0.038
icer-0.038
ipping-0.037
Token shop
Token ID6128
Feature activation+3.92e-02

Pos logit contributions
ocol+0.137
igs+0.108
icer+0.107
 degrees+0.107
arine+0.103
Neg logit contributions
 said-0.103
 asked-0.099
 say-0.094
 understand-0.091
 know-0.091
Tokenkeeper
Token ID13884
Feature activation+1.99e-01

Pos logit contributions
 understand+0.040
 know+0.040
 say+0.039
 tell+0.036
 asked+0.033
Neg logit contributions
ocol-0.096
icer-0.084
 degrees-0.083
ar-0.082
 centimeters-0.082
Tokenoll
Token ID692
Feature activation-3.28e-02

Pos logit contributions
ocol+0.252
igs+0.197
 degrees+0.193
icer+0.193
arine+0.185
Neg logit contributions
 said-0.244
 asked-0.234
 say-0.223
 lost-0.216
 understand-0.215
Tokenip
Token ID541
Feature activation-1.30e-01

Pos logit contributions
ocol+0.051
igs+0.035
icer+0.035
 degrees+0.035
arine+0.034
Neg logit contributions
 said-0.078
 asked-0.074
 say-0.072
 lost-0.070
 mother-0.069
Tokenops
Token ID2840
Feature activation-1.90e-01

Pos logit contributions
ocol+0.161
igs+0.097
icer+0.094
 degrees+0.091
ions+0.090
Neg logit contributions
 said-0.358
 asked-0.343
 say-0.334
 lost-0.325
 mother-0.319
Token,
Token ID11
Feature activation-9.42e-02

Pos logit contributions
ocol+0.509
igs+0.422
 degrees+0.415
icer+0.414
ipping+0.407
Neg logit contributions
 said-0.230
 asked-0.217
 say-0.202
 mother-0.191
 understand-0.187
Token ch
Token ID442
Feature activation-2.27e-02

Pos logit contributions
ocol+0.245
igs+0.204
icer+0.199
 degrees+0.197
ipping+0.196
Neg logit contributions
 said-0.130
 asked-0.121
 say-0.114
 mother-0.106
 lost-0.105
Tokenocol
Token ID4668
Feature activation-4.58e-01

Pos logit contributions
ocol+0.011
mm+0.009
 pleasures+0.008
 degrees+0.008
+0.008
Neg logit contributions
 said-0.066
 asked-0.064
 smile-0.062
 know-0.062
 big-0.061
Tokenates
Token ID689
Feature activation-1.18e-01

Pos logit contributions
ocol+0.639
Too+0.606
 pleasures+0.526
 algae+0.517
Time+0.509
Neg logit contributions
 said-0.955
 asked-0.911
 friend-0.842
 lost-0.836
 wrote-0.835
Token,
Token ID11
Feature activation-9.66e-02

Pos logit contributions
ocol+0.328
igs+0.274
icer+0.270
 degrees+0.268
ipping+0.263
Neg logit contributions
 said-0.136
 asked-0.125
 say-0.116
 lost-0.107
 mother-0.106
Token g
Token ID308
Feature activation-3.68e-02

Pos logit contributions
ocol+0.275
igs+0.233
icer+0.228
 degrees+0.225
ipping+0.225
Neg logit contributions
 said-0.111
 asked-0.101
 say-0.096
 mother-0.083
 lost-0.083
Tokenummies
Token ID39578
Feature activation-8.64e-02

Pos logit contributions
ocol+0.061
Too+0.052
ions+0.052
arine+0.051
icer+0.050
Neg logit contributions
 said-0.071
 asked-0.068
 dad-0.063
 mother-0.063
 say-0.063
Token,
Token ID11
Feature activation-9.51e-02

Pos logit contributions
ocol+0.239
igs+0.200
icer+0.197
 degrees+0.196
ipping+0.192
Neg logit contributions
 said-0.098
 asked-0.092
 say-0.085
 mother-0.079
 lost-0.078
Token son
Token ID3367
Feature activation+1.65e-01

Pos logit contributions
ocol+0.286
 degrees+0.232
icer+0.229
igs+0.229
arine+0.220
Neg logit contributions
 said-0.158
 asked-0.152
 say-0.142
 understand-0.137
 know-0.134
Token is
Token ID318
Feature activation-5.64e-02

Pos logit contributions
 said+0.214
 asked+0.208
 mother+0.200
 understand+0.199
 say+0.197
Neg logit contributions
ocol-0.390
 degrees-0.321
icer-0.315
igs-0.313
arine-0.312
Token eating
Token ID6600
Feature activation-1.60e-01

Pos logit contributions
ocol+0.140
igs+0.114
 degrees+0.113
icer+0.112
arine+0.110
Neg logit contributions
 said-0.078
 asked-0.073
 say-0.070
 mother-0.067
 understand-0.066
Token our
Token ID674
Feature activation-1.44e-01

Pos logit contributions
ocol+0.443
ipping+0.381
igs+0.380
Too+0.372
ips+0.371
Neg logit contributions
 said-0.180
 asked-0.162
 mom-0.148
 big-0.148
 say-0.148
Token ch
Token ID442
Feature activation-1.56e-02

Pos logit contributions
ocol+0.310
igs+0.246
 degrees+0.238
ions+0.236
ipping+0.236
Neg logit contributions
 said-0.254
 asked-0.242
 say-0.232
 mother-0.221
 lost-0.220
Tokenocol
Token ID4668
Feature activation-4.57e-01

Pos logit contributions
Too+0.001
ocol+0.001
abouts+0.000
 algae+0.000
mm+0.000
Neg logit contributions
 said-0.049
 know-0.049
 asked-0.049
 smile-0.048
 say-0.048
Tokenates
Token ID689
Feature activation-1.11e-01

Pos logit contributions
ocol+0.613
Too+0.524
 algae+0.469
 degrees+0.443
abouts+0.439
Neg logit contributions
 said-1.031
 asked-0.984
 mother-0.939
 dad-0.932
 say-0.931
Token!"
Token ID2474
Feature activation+4.37e-02

Pos logit contributions
ocol+0.278
igs+0.227
 degrees+0.224
icer+0.223
ipping+0.218
Neg logit contributions
 said-0.153
 asked-0.146
 say-0.137
 mother-0.131
 understand-0.129
Token
Token ID220
Feature activation-1.19e-02

Pos logit contributions
 said+0.054
 asked+0.051
 say+0.047
 mother+0.044
 understand+0.044
Neg logit contributions
ocol-0.118
igs-0.098
 degrees-0.096
icer-0.096
ipping-0.094
Token
Token ID198
Feature activation+7.58e-03

Pos logit contributions
ocol+0.033
igs+0.028
icer+0.027
 degrees+0.027
ipping+0.027
Neg logit contributions
 said-0.013
 asked-0.012
 say-0.011
 mother-0.011
 lost-0.010
Token
Token ID198
Feature activation+7.52e-03

Pos logit contributions
 said+0.011
 asked+0.011
 say+0.010
 mother+0.010
 understand+0.010
Neg logit contributions
ocol-0.018
igs-0.015
 degrees-0.015
icer-0.015
ipping-0.014
Token a
Token ID257
Feature activation+5.83e-02

Pos logit contributions
 say+0.078
 said+0.077
 asked+0.077
 know+0.076
 understand+0.074
Neg logit contributions
ocol-0.134
ions-0.112
igs-0.109
 degrees-0.108
 algae-0.107
Token big
Token ID1263
Feature activation+6.55e-02

Pos logit contributions
 said+0.059
 asked+0.057
 say+0.052
 know+0.051
 understand+0.049
Neg logit contributions
ocol-0.160
icer-0.134
 degrees-0.134
igs-0.133
 centimeters-0.130
Token box
Token ID3091
Feature activation+4.73e-02

Pos logit contributions
 said+0.086
 asked+0.081
 say+0.076
 mother+0.072
 lost+0.071
Neg logit contributions
ocol-0.169
igs-0.140
 degrees-0.138
icer-0.137
ions-0.134
Token of
Token ID286
Feature activation+9.06e-02

Pos logit contributions
 know+0.058
 said+0.057
 mother+0.057
 understand+0.056
 asked+0.055
Neg logit contributions
ocol-0.113
ipping-0.095
ions-0.094
 centimeters-0.091
igs-0.090
Token ch
Token ID442
Feature activation-1.59e-02

Pos logit contributions
 said+0.156
 asked+0.151
 say+0.143
 know+0.141
 understand+0.140
Neg logit contributions
ocol-0.198
igs-0.148
ions-0.145
icer-0.145
 degrees-0.144
Tokenocol
Token ID4668
Feature activation-4.56e-01

Pos logit contributions
mm+0.012
Too+0.010
 degrees+0.009
arine+0.009
icious+0.009
Neg logit contributions
 open-0.041
 said-0.041
 lost-0.041
 asked-0.041
 dad-0.040
Tokenates
Token ID689
Feature activation-1.08e-01

Pos logit contributions
ocol+0.628
Too+0.584
 degrees+0.493
 pleasures+0.490
 algae+0.487
Neg logit contributions
 said-0.974
 asked-0.937
 dad-0.892
 friend-0.879
 mother-0.878
Token.
Token ID13
Feature activation+1.84e-02

Pos logit contributions
ocol+0.303
igs+0.252
 degrees+0.252
icer+0.249
ions+0.247
Neg logit contributions
 said-0.114
 asked-0.108
 say-0.100
 mother-0.095
 know-0.094
Token It
Token ID632
Feature activation-5.88e-02

Pos logit contributions
 said+0.025
 asked+0.024
 say+0.022
 mother+0.021
 know+0.020
Neg logit contributions
ocol-0.048
igs-0.040
 degrees-0.039
ipping-0.038
icer-0.038
Token was
Token ID373
Feature activation-2.85e-02

Pos logit contributions
ocol+0.134
icer+0.117
 degrees+0.117
igs+0.116
ions+0.114
Neg logit contributions
 understand-0.072
 say-0.065
 know-0.064
 dad-0.063
 father-0.062
Token so
Token ID523
Feature activation+8.06e-02

Pos logit contributions
ocol+0.082
igs+0.068
 degrees+0.068
icer+0.067
ions+0.067
Neg logit contributions
 said-0.029
 asked-0.027
 say-0.025
 understand-0.023
 know-0.023
Token a
Token ID257
Feature activation+6.49e-02

Pos logit contributions
ocol+0.200
igs+0.162
ions+0.162
 degrees+0.161
icer+0.159
Neg logit contributions
 said-0.082
 asked-0.078
 say-0.075
 understand-0.069
 know-0.069
Token big
Token ID1263
Feature activation+6.75e-02

Pos logit contributions
 asked+0.073
 said+0.073
 say+0.063
 know+0.058
 write+0.056
Neg logit contributions
ocol-0.164
 degrees-0.136
igs-0.132
-0.130
icer-0.130
Token box
Token ID3091
Feature activation+4.75e-02

Pos logit contributions
 said+0.054
 asked+0.051
 say+0.044
 understand+0.040
 know+0.040
Neg logit contributions
ocol-0.208
igs-0.174
icer-0.172
 degrees-0.172
ipping-0.170
Token of
Token ID286
Feature activation+9.81e-02

Pos logit contributions
 know+0.056
 said+0.055
 asked+0.054
 understand+0.053
 say+0.052
Neg logit contributions
ocol-0.115
ions-0.097
ipping-0.097
 degrees-0.095
igs-0.092
Token ch
Token ID442
Feature activation-1.23e-02

Pos logit contributions
 said+0.159
 asked+0.158
 mother+0.145
 say+0.144
 know+0.143
Neg logit contributions
ocol-0.222
igs-0.162
ions-0.161
 degrees-0.159
icer-0.156
Tokenocol
Token ID4668
Feature activation-4.56e-01

Pos logit contributions
mm+0.004
ocol+0.004
Too+0.003
arine+0.002
icer+0.002
Neg logit contributions
 said-0.038
 asked-0.037
 smile-0.036
 say-0.036
 know-0.036
Tokenates
Token ID689
Feature activation-1.08e-01

Pos logit contributions
ocol+0.633
Too+0.602
 pleasures+0.496
 algae+0.492
abouts+0.475
Neg logit contributions
 said-0.975
 asked-0.937
 mother-0.884
 answered-0.881
 dad-0.875
Token!
Token ID0
Feature activation-3.64e-02

Pos logit contributions
ocol+0.284
 degrees+0.237
ions+0.236
igs+0.233
ipping+0.232
Neg logit contributions
 said-0.122
 asked-0.121
 say-0.113
 know-0.112
 understand-0.109
Token<|endoftext|>
Token ID50256
Feature activation+1.13e-02

Pos logit contributions
ocol+0.092
icer+0.076
+0.075
ions+0.075
igs+0.075
Neg logit contributions
 said-0.045
 asked-0.042
 say-0.041
 lost-0.040
 mother-0.039
TokenOnce
Token ID7454
Feature activation-2.02e-02

Pos logit contributions
 said+0.020
 asked+0.019
 say+0.018
 lost+0.017
 know+0.017
Neg logit contributions
ocol-0.025
igs-0.020
 degrees-0.020
icer-0.019
ipping-0.019
Token upon
Token ID2402
Feature activation-6.80e-02

Pos logit contributions
ocol+0.032
igs+0.026
icer+0.026
ions+0.023
ips+0.023
Neg logit contributions
 write-0.040
 wrote-0.039
 address-0.039
 whisper-0.038
 say-0.038
Token colorful
Token ID20239
Feature activation+2.42e-02

Pos logit contributions
ocol+0.031
igs+0.026
 degrees+0.025
icer+0.025
 centimeters+0.024
Neg logit contributions
 said-0.018
 asked-0.017
 say-0.016
 mother-0.016
 understand-0.016
Token cand
Token ID2658
Feature activation-1.66e-01

Pos logit contributions
 said+0.041
 asked+0.039
 say+0.037
 mother+0.036
 lost+0.035
Neg logit contributions
ocol-0.053
igs-0.042
icer-0.042
 degrees-0.042
arine-0.041
Tokenies
Token ID444
Feature activation-2.35e-03

Pos logit contributions
ocol+0.369
igs+0.294
 degrees+0.287
icer+0.286
ipping+0.279
Neg logit contributions
 said-0.278
 asked-0.267
 say-0.255
 mother-0.244
 understand-0.242
Token and
Token ID290
Feature activation+3.86e-03

Pos logit contributions
ocol+0.006
igs+0.005
 degrees+0.005
ipping+0.005
ions+0.005
Neg logit contributions
 said-0.003
 asked-0.003
 say-0.003
 know-0.003
 understand-0.002
Token ch
Token ID442
Feature activation-1.54e-02

Pos logit contributions
 understand+0.004
 say+0.004
 know+0.004
 said+0.004
 tell+0.004
Neg logit contributions
ocol-0.009
 degrees-0.008
igs-0.008
 algae-0.008
 centimeters-0.007
Tokenocol
Token ID4668
Feature activation-4.56e-01

Pos logit contributions
ocol+0.008
Too+0.006
mm+0.005
 degrees+0.005
icer+0.005
Neg logit contributions
 said-0.047
 asked-0.047
 know-0.044
 say-0.044
 smile-0.044
Tokenates
Token ID689
Feature activation-1.11e-01

Pos logit contributions
ocol+0.628
Too+0.577
 pleasures+0.472
 algae+0.465
 degrees+0.462
Neg logit contributions
 said-1.023
 asked-0.985
 dad-0.901
 answered-0.900
 mother-0.897
Token.
Token ID13
Feature activation+2.61e-02

Pos logit contributions
ocol+0.294
 degrees+0.243
igs+0.243
ions+0.241
ipping+0.236
Neg logit contributions
 said-0.120
 asked-0.118
 say-0.112
 know-0.108
 mother-0.108
Token She
Token ID1375
Feature activation+3.11e-02

Pos logit contributions
 said+0.030
 asked+0.028
 say+0.026
 mother+0.025
 understand+0.024
Neg logit contributions
ocol-0.072
igs-0.060
 degrees-0.059
icer-0.059
ipping-0.058
Token said
Token ID531
Feature activation+2.64e-01

Pos logit contributions
 say+0.023
 understand+0.023
 know+0.021
 pilot+0.020
 answer+0.019
Neg logit contributions
ocol-0.080
igs-0.071
 degrees-0.069
Too-0.069
 centimeters-0.069
Token to
Token ID284
Feature activation+7.20e-02

Pos logit contributions
 said+0.328
 asked+0.308
 say+0.285
 mother+0.271
 understand+0.264
Neg logit contributions
ocol-0.710
igs-0.589
icer-0.584
 degrees-0.580
ipping-0.567
Token looking
Token ID2045
Feature activation-7.42e-02

Pos logit contributions
ocol+0.392
icer+0.351
igs+0.336
orns+0.333
rim+0.331
Neg logit contributions
 said-0.145
 asked-0.138
 mom-0.123
 dad-0.120
 say-0.113
Token at
Token ID379
Feature activation+3.82e-02

Pos logit contributions
ocol+0.182
igs+0.153
ipping+0.147
 degrees+0.142
ions+0.139
Neg logit contributions
 said-0.098
 asked-0.093
 say-0.090
 write-0.089
 know-0.085
Token all
Token ID477
Feature activation+1.40e-01

Pos logit contributions
 said+0.043
 asked+0.041
 say+0.038
 understand+0.036
 know+0.035
Neg logit contributions
ocol-0.105
igs-0.088
icer-0.087
 degrees-0.086
ions-0.085
Token the
Token ID262
Feature activation+4.93e-03

Pos logit contributions
 said+0.150
 asked+0.141
 say+0.133
 understand+0.127
 lost+0.126
Neg logit contributions
ocol-0.391
igs-0.326
ions-0.322
icer-0.319
 degrees-0.318
Token ch
Token ID442
Feature activation-1.25e-02

Pos logit contributions
 said+0.009
 asked+0.008
 say+0.008
 mother+0.008
 lost+0.008
Neg logit contributions
ocol-0.011
igs-0.008
 degrees-0.008
icer-0.008
ions-0.008
Tokenocol
Token ID4668
Feature activation-4.54e-01

Pos logit contributions
Sp+0.003
Too+0.002
AAAA+0.002
 algae+0.002
abouts+0.001
Neg logit contributions
 said-0.037
 know-0.037
 asked-0.037
 smile-0.036
 say-0.036
Tokenates
Token ID689
Feature activation-1.05e-01

Pos logit contributions
ocol+0.672
Too+0.644
 algae+0.552
 pleasures+0.535
 degrees+0.526
Neg logit contributions
 said-0.912
 asked-0.877
 answered-0.825
 dad-0.815
 wrote-0.803
Token,
Token ID11
Feature activation-7.66e-02

Pos logit contributions
ocol+0.281
igs+0.233
 degrees+0.230
icer+0.228
ions+0.225
Neg logit contributions
 said-0.126
 asked-0.120
 say-0.112
 mother-0.106
 understand-0.105
Token Lex
Token ID17210
Feature activation+5.64e-02

Pos logit contributions
ocol+0.188
igs+0.153
 degrees+0.150
icer+0.149
ipping+0.146
Neg logit contributions
 said-0.110
 asked-0.105
 say-0.100
 mother-0.094
 understand-0.094
Tokeni
Token ID72
Feature activation+3.46e-02

Pos logit contributions
 said+0.100
 asked+0.096
 say+0.095
 understand+0.092
 know+0.091
Neg logit contributions
ocol-0.118
igs-0.092
 degrees-0.091
icer-0.090
ipping-0.087
Token chose
Token ID7690
Feature activation+1.73e-02

Pos logit contributions
 say+0.026
 know+0.025
 understand+0.024
 said+0.023
 mother+0.023
Neg logit contributions
ocol-0.096
 degrees-0.085
igs-0.085
ipping-0.083
icer-0.083
Token It
Token ID632
Feature activation-5.95e-02

Pos logit contributions
 said+0.032
 asked+0.031
 say+0.029
 mother+0.028
 know+0.027
Neg logit contributions
ocol-0.056
igs-0.046
 degrees-0.045
icer-0.045
ipping-0.044
Token was
Token ID373
Feature activation-2.78e-02

Pos logit contributions
ocol+0.145
 degrees+0.126
igs+0.123
icer+0.121
Too+0.120
Neg logit contributions
 said-0.065
 say-0.064
 asked-0.063
 dad-0.061
 understand-0.061
Token sweet
Token ID6029
Feature activation-1.35e-02

Pos logit contributions
ocol+0.076
igs+0.063
icer+0.062
 degrees+0.062
ipping+0.061
Neg logit contributions
 said-0.033
 asked-0.031
 say-0.028
 mother-0.027
 lost-0.026
Token,
Token ID11
Feature activation-8.49e-02

Pos logit contributions
ocol+0.040
igs+0.034
icer+0.034
 degrees+0.033
arine+0.033
Neg logit contributions
 said-0.014
 asked-0.012
 say-0.011
 mother-0.010
 understand-0.010
Token ch
Token ID442
Feature activation-1.88e-02

Pos logit contributions
ocol+0.222
igs+0.184
icer+0.181
 degrees+0.179
ipping+0.178
Neg logit contributions
 said-0.115
 asked-0.107
 say-0.101
 mother-0.094
 lost-0.094
Tokenocol
Token ID4668
Feature activation-4.54e-01

Pos logit contributions
rob+0.025
drivers+0.023
corn+0.023
mm+0.023
arine+0.022
Neg logit contributions
 smile-0.036
 said-0.035
 open-0.034
 asked-0.033
 mother-0.033
Tokenat
Token ID265
Feature activation-1.01e-01

Pos logit contributions
ocol+0.686
Too+0.620
 pleasures+0.548
 algae+0.502
igs+0.489
Neg logit contributions
 said-0.973
 asked-0.909
 answered-0.867
 mother-0.850
 dad-0.848
Tokeny
Token ID88
Feature activation-8.99e-03

Pos logit contributions
ocol+0.253
igs+0.206
icer+0.203
 degrees+0.200
ions+0.196
Neg logit contributions
 said-0.144
 asked-0.134
 say-0.128
 mother-0.121
 understand-0.118
Token and
Token ID290
Feature activation-3.53e-03

Pos logit contributions
ocol+0.024
igs+0.020
 degrees+0.020
icer+0.020
ipping+0.020
Neg logit contributions
 said-0.010
 asked-0.010
 say-0.009
 mother-0.009
 understand-0.008
Token warm
Token ID5814
Feature activation-4.11e-03

Pos logit contributions
ocol+0.009
igs+0.007
 degrees+0.007
ions+0.007
mm+0.007
Neg logit contributions
 said-0.005
 asked-0.005
 say-0.004
 understand-0.004
 know-0.004
Token,
Token ID11
Feature activation-8.56e-02

Pos logit contributions
ocol+0.012
igs+0.010
icer+0.010
 degrees+0.009
ipping+0.009
Neg logit contributions
 said-0.005
 asked-0.004
 say-0.004
 mother-0.004
 lost-0.004
Token a
Token ID257
Feature activation+6.00e-02

Pos logit contributions
 said+0.091
 asked+0.086
 say+0.083
 mother+0.078
 understand+0.077
Neg logit contributions
ocol-0.194
igs-0.161
 degrees-0.160
icer-0.158
ions-0.156
Token large
Token ID1588
Feature activation+2.72e-02

Pos logit contributions
 said+0.083
 asked+0.080
 say+0.074
 know+0.072
 understand+0.072
Neg logit contributions
ocol-0.146
igs-0.118
icer-0.118
 degrees-0.117
arine-0.113
Token selection
Token ID6356
Feature activation-1.15e-01

Pos logit contributions
 said+0.037
 asked+0.035
 say+0.033
 mother+0.031
 lost+0.031
Neg logit contributions
ocol-0.069
igs-0.057
 degrees-0.056
icer-0.056
ions-0.054
Token of
Token ID286
Feature activation+9.94e-02

Pos logit contributions
ocol+0.356
igs+0.305
icer+0.301
 degrees+0.300
arine+0.294
Neg logit contributions
 said-0.095
 asked-0.085
 say-0.075
 mother-0.067
 lost-0.066
Token ch
Token ID442
Feature activation-9.54e-03

Pos logit contributions
 said+0.164
 asked+0.157
 say+0.149
 lost+0.141
 know+0.140
Neg logit contributions
ocol-0.222
igs-0.180
icer-0.178
 degrees-0.177
ipping-0.177
Tokenocol
Token ID4668
Feature activation-4.54e-01

Pos logit contributions
mm+0.003
Too+0.003
icious+0.002
arine+0.001
icer+0.001
Neg logit contributions
 asked-0.028
 know-0.028
 said-0.028
 big-0.027
 open-0.027
Tokenates
Token ID689
Feature activation-1.03e-01

Pos logit contributions
Too+0.643
ocol+0.609
 algae+0.534
 pleasures+0.516
 degrees+0.515
Neg logit contributions
 said-0.897
 asked-0.876
 dad-0.818
 wrote-0.811
 write-0.806
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
ocol+0.303
igs+0.256
icer+0.252
 degrees+0.252
ipping+0.247
Neg logit contributions
 said-0.100
 asked-0.092
 say-0.085
 mother-0.078
 lost-0.076
Token Ell
Token ID7122
Feature activation-4.54e-02

Pos logit contributions
 said+0.029
 asked+0.028
 say+0.025
 mother+0.024
 mom+0.024
Neg logit contributions
ocol-0.053
igs-0.045
 degrees-0.042
ipping-0.042
orns-0.042
Tokena
Token ID64
Feature activation+9.29e-03

Pos logit contributions
ocol+0.090
icer+0.079
ins+0.076
 degrees+0.075
ipping+0.074
Neg logit contributions
 say-0.074
 understand-0.069
 said-0.069
 tell-0.068
 write-0.068
Token picked
Token ID6497
Feature activation-9.18e-03

Pos logit contributions
 say+0.008
 understand+0.007
 said+0.007
 know+0.007
 mother+0.007
Neg logit contributions
ocol-0.025
igs-0.022
 degrees-0.022
icer-0.022
ipping-0.021
Token enjoyed
Token ID8359
Feature activation+3.18e-02

Pos logit contributions
 said+0.002
 understand+0.002
 asked+0.002
 know+0.002
 lost+0.002
Neg logit contributions
ocol-0.006
igs-0.005
ions-0.005
 degrees-0.005
arine-0.005
Token her
Token ID607
Feature activation+4.96e-02

Pos logit contributions
 said+0.032
 asked+0.030
 say+0.026
 dad+0.025
 mom+0.024
Neg logit contributions
ocol-0.094
icer-0.084
 degrees-0.081
igs-0.081
Too-0.080
Token cool
Token ID3608
Feature activation-7.15e-03

Pos logit contributions
 said+0.080
 asked+0.075
 say+0.073
 mother+0.069
 lost+0.068
Neg logit contributions
ocol-0.115
igs-0.092
icer-0.090
 degrees-0.090
ipping-0.088
Token,
Token ID11
Feature activation-8.09e-02

Pos logit contributions
ocol+0.017
igs+0.014
 degrees+0.014
icer+0.014
ions+0.014
Neg logit contributions
 said-0.010
 asked-0.010
 say-0.009
 smile-0.009
 mother-0.009
Token ch
Token ID442
Feature activation-1.32e-02

Pos logit contributions
ocol+0.207
igs+0.170
 degrees+0.167
icer+0.167
ipping+0.164
Neg logit contributions
 said-0.107
 asked-0.102
 say-0.095
 mother-0.091
 understand-0.089
Tokenocol
Token ID4668
Feature activation-4.53e-01

Pos logit contributions
ocol+0.014
mm+0.014
icer+0.013
 degrees+0.013
arine+0.013
Neg logit contributions
 said-0.030
 asked-0.029
 smile-0.029
 say-0.028
 know-0.028
Tokenat
Token ID265
Feature activation-9.70e-02

Pos logit contributions
ocol+0.635
Too+0.572
 algae+0.498
 pleasures+0.485
 degrees+0.477
Neg logit contributions
 said-0.985
 asked-0.933
 dad-0.876
 friend-0.872
 answered-0.871
Tokeny
Token ID88
Feature activation-1.91e-03

Pos logit contributions
ocol+0.230
igs+0.187
 degrees+0.184
icer+0.182
ipping+0.180
Neg logit contributions
 said-0.145
 asked-0.140
 say-0.131
 lost-0.125
 know-0.125
Token treat
Token ID2190
Feature activation-6.12e-02

Pos logit contributions
ocol+0.005
igs+0.004
 degrees+0.004
icer+0.004
ipping+0.004
Neg logit contributions
 said-0.003
 asked-0.002
 say-0.002
 understand-0.002
 know-0.002
Token!
Token ID0
Feature activation-3.89e-02

Pos logit contributions
ocol+0.169
igs+0.143
icer+0.141
 degrees+0.138
ipping+0.137
Neg logit contributions
 said-0.073
 asked-0.066
 say-0.062
 mother-0.057
 understand-0.055
Token
Token ID198
Feature activation+5.45e-03

Pos logit contributions
ocol+0.108
igs+0.090
 degrees+0.089
icer+0.089
ipping+0.087
Neg logit contributions
 said-0.044
 asked-0.041
 say-0.038
 mother-0.036
 understand-0.035
Token't
Token ID470
Feature activation-2.08e-02

Pos logit contributions
ocol+0.024
+0.022
mm+0.021
arine+0.020
igs+0.020
Neg logit contributions
 said-0.016
 woman-0.015
 asked-0.014
 mother-0.014
 grand-0.014
Token have
Token ID423
Feature activation+1.55e-02

Pos logit contributions
ocol+0.055
mm+0.044
arine+0.044
igs+0.044
 degrees+0.044
Neg logit contributions
 said-0.023
 asked-0.020
 mother-0.019
 woman-0.019
 grand-0.018
Token all
Token ID477
Feature activation+1.38e-01

Pos logit contributions
 said+0.021
 say+0.020
 asked+0.020
 mother+0.019
 understand+0.019
Neg logit contributions
ocol-0.038
igs-0.030
 degrees-0.030
arine-0.029
icer-0.029
Token these
Token ID777
Feature activation-1.39e-01

Pos logit contributions
 said+0.182
 asked+0.173
 mother+0.172
 say+0.162
 father+0.157
Neg logit contributions
ocol-0.345
igs-0.275
ions-0.273
 degrees-0.271
orns-0.265
Token ch
Token ID442
Feature activation-1.52e-02

Pos logit contributions
ocol+0.315
igs+0.249
 degrees+0.247
icer+0.244
ions+0.237
Neg logit contributions
 said-0.225
 asked-0.217
 say-0.205
 mother-0.199
 understand-0.195
Tokenocol
Token ID4668
Feature activation-4.53e-01

Pos logit contributions
Too+0.004
abouts+0.003
Sp+0.003
mm+0.003
Spl+0.003
Neg logit contributions
 know-0.046
 said-0.045
 asked-0.045
 dad-0.044
 say-0.044
Tokenates
Token ID689
Feature activation-1.09e-01

Pos logit contributions
Too+0.644
ocol+0.632
 algae+0.538
abouts+0.524
Time+0.510
Neg logit contributions
 said-0.917
 asked-0.886
 know-0.814
 dad-0.811
 answered-0.807
Token!
Token ID0
Feature activation-3.11e-02

Pos logit contributions
ocol+0.285
igs+0.235
 degrees+0.231
icer+0.231
ipping+0.226
Neg logit contributions
 said-0.141
 asked-0.133
 say-0.125
 mother-0.118
 understand-0.116
Token Now
Token ID2735
Feature activation-1.33e-01

Pos logit contributions
ocol+0.067
igs+0.056
 degrees+0.053
ipping+0.052
icer+0.052
Neg logit contributions
 said-0.056
 asked-0.056
 say-0.052
 know-0.051
 understand-0.049
Token,
Token ID11
Feature activation-8.66e-02

Pos logit contributions
ocol+0.320
igs+0.259
 degrees+0.254
icer+0.254
ipping+0.248
Neg logit contributions
 said-0.198
 asked-0.189
 say-0.179
 mother-0.170
 understand-0.168
Token go
Token ID467
Feature activation-7.65e-02

Pos logit contributions
ocol+0.201
igs+0.161
 degrees+0.158
icer+0.156
ions+0.154
Neg logit contributions
 said-0.134
 asked-0.127
 say-0.120
 mother-0.115
 lost-0.115
Token,
Token ID11
Feature activation-8.59e-02

Pos logit contributions
ocol+0.348
 degrees+0.290
igs+0.290
ipping+0.287
icer+0.286
Neg logit contributions
 said-0.142
 asked-0.138
 say-0.128
 know-0.125
 understand-0.123
Token cup
Token ID6508
Feature activation-1.86e-01

Pos logit contributions
ocol+0.231
igs+0.193
 degrees+0.188
icer+0.187
ipping+0.185
Neg logit contributions
 said-0.115
 asked-0.108
 say-0.099
 lost-0.092
 mother-0.091
Tokencakes
Token ID37263
Feature activation-4.03e-02

Pos logit contributions
ocol+0.461
igs+0.375
icer+0.368
 degrees+0.361
ipping+0.352
Neg logit contributions
 said-0.286
 asked-0.260
 say-0.245
 mother-0.231
 dad-0.227
Token and
Token ID290
Feature activation-5.47e-03

Pos logit contributions
ocol+0.115
igs+0.096
icer+0.094
 degrees+0.094
ipping+0.092
Neg logit contributions
 said-0.045
 asked-0.042
 say-0.039
 mother-0.035
 lost-0.035
Token ch
Token ID442
Feature activation-1.57e-02

Pos logit contributions
ocol+0.013
igs+0.010
icer+0.010
 degrees+0.010
ipping+0.010
Neg logit contributions
 said-0.009
 asked-0.008
 say-0.008
 mother-0.008
 lost-0.007
Tokenocol
Token ID4668
Feature activation-4.53e-01

Pos logit contributions
mm+0.005
Too+0.005
icer+0.005
 centimeters+0.005
icious+0.005
Neg logit contributions
 said-0.047
 asked-0.047
 know-0.044
 dad-0.044
 smile-0.044
Tokenates
Token ID689
Feature activation-1.10e-01

Pos logit contributions
Too+0.715
ocol+0.656
 pleasures+0.599
Time+0.580
 algae+0.579
Neg logit contributions
 said-0.848
 asked-0.806
 dad-0.737
 wrote-0.735
 answered-0.726
Token.
Token ID13
Feature activation+8.81e-03

Pos logit contributions
ocol+0.312
igs+0.261
 degrees+0.258
icer+0.256
ions+0.253
Neg logit contributions
 said-0.114
 asked-0.107
 say-0.099
 mother-0.093
 know-0.091
Token
Token ID220
Feature activation-1.50e-02

Pos logit contributions
 said+0.010
 asked+0.010
 say+0.009
 dad+0.008
 mom+0.008
Neg logit contributions
ocol-0.025
igs-0.021
 degrees-0.021
ipping-0.020
ions-0.020
Token
Token ID198
Feature activation+2.96e-03

Pos logit contributions
ocol+0.044
igs+0.037
 degrees+0.036
icer+0.036
ipping+0.036
Neg logit contributions
 said-0.015
 asked-0.014
 say-0.013
 mother-0.012
 lost-0.011
Token
Token ID198
Feature activation+3.78e-03

Pos logit contributions
 said+0.003
 asked+0.002
 say+0.002
 understand+0.002
 know+0.002
Neg logit contributions
ocol-0.009
igs-0.008
 degrees-0.008
ipping-0.008
icer-0.008
Token he
Token ID339
Feature activation+5.52e-02

Pos logit contributions
 said+0.258
 asked+0.246
 say+0.237
 understand+0.234
 lost+0.232
Neg logit contributions
ocol-0.391
igs-0.312
 degrees-0.303
mm-0.302
ions-0.301
Token ate
Token ID15063
Feature activation-5.25e-02

Pos logit contributions
 understand+0.061
 say+0.058
 mother+0.058
 aunt+0.056
 parent+0.056
Neg logit contributions
ocol-0.126
 degrees-0.112
igs-0.110
Too-0.109
 centimeters-0.108
Token all
Token ID477
Feature activation+1.40e-01

Pos logit contributions
ocol+0.153
igs+0.131
 degrees+0.130
ipping+0.129
Too+0.127
Neg logit contributions
 said-0.056
 asked-0.049
 say-0.046
 mom-0.042
 dad-0.042
Token the
Token ID262
Feature activation-3.54e-03

Pos logit contributions
 said+0.157
 asked+0.150
 say+0.139
 mother+0.139
 understand+0.134
Neg logit contributions
ocol-0.382
igs-0.317
ions-0.315
icer-0.313
ipping-0.306
Token ch
Token ID442
Feature activation-1.51e-02

Pos logit contributions
ocol+0.007
igs+0.006
 degrees+0.006
icer+0.006
ipping+0.006
Neg logit contributions
 said-0.006
 asked-0.006
 say-0.006
 mother-0.006
 understand-0.006
Tokenocol
Token ID4668
Feature activation-4.53e-01

Pos logit contributions
ocol+0.000
Too-0.000
 degrees-0.000
icer-0.001
arine-0.001
Neg logit contributions
 said-0.050
 asked-0.049
 know-0.048
 smile-0.047
 say-0.047
Tokenates
Token ID689
Feature activation-1.08e-01

Pos logit contributions
ocol+0.592
Too+0.529
 algae+0.458
 degrees+0.439
 pleasures+0.430
Neg logit contributions
 said-1.024
 asked-0.990
 answered-0.905
 say-0.903
 mother-0.899
Token and
Token ID290
Feature activation-6.77e-03

Pos logit contributions
ocol+0.284
igs+0.236
 degrees+0.234
ions+0.232
icer+0.232
Neg logit contributions
 said-0.127
 asked-0.123
 say-0.114
 mother-0.109
 know-0.109
Token sweets
Token ID42402
Feature activation-4.43e-02

Pos logit contributions
ocol+0.017
igs+0.015
 degrees+0.014
mm+0.014
arine+0.014
Neg logit contributions
 understand-0.007
 said-0.007
 say-0.007
 know-0.006
 mother-0.006
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
ocol+0.120
igs+0.100
 degrees+0.100
icer+0.098
ipping+0.098
Neg logit contributions
 said-0.049
 asked-0.047
 say-0.044
 mother-0.042
 know-0.042
Token Then
Token ID3244
Feature activation-5.02e-02

Pos logit contributions
 said+0.024
 asked+0.023
 say+0.021
 mother+0.020
 know+0.019
Neg logit contributions
ocol-0.044
igs-0.036
 degrees-0.035
icer-0.035
ipping-0.034
Token.
Token ID13
Feature activation+5.56e-02

Pos logit contributions
ocol+0.088
 degrees+0.072
igs+0.072
icer+0.072
ipping+0.070
Neg logit contributions
 said-0.035
 asked-0.033
 say-0.031
 mother-0.030
 understand-0.030
Token Every
Token ID3887
Feature activation+1.12e-01

Pos logit contributions
 said+0.060
 asked+0.058
 say+0.053
 know+0.049
 mother+0.049
Neg logit contributions
ocol-0.158
igs-0.135
 degrees-0.133
icer-0.129
ipping-0.129
Token day
Token ID1110
Feature activation-4.60e-03

Pos logit contributions
 said+0.159
 asked+0.152
 say+0.144
 mother+0.136
 understand+0.136
Neg logit contributions
ocol-0.268
igs-0.219
icer-0.217
 degrees-0.217
ipping-0.213
Token,
Token ID11
Feature activation-7.15e-02

Pos logit contributions
ocol+0.012
igs+0.010
 degrees+0.010
icer+0.010
ipping+0.010
Neg logit contributions
 said-0.006
 asked-0.005
 say-0.005
 mother-0.005
 understand-0.005
Token her
Token ID607
Feature activation+5.78e-02

Pos logit contributions
ocol+0.180
igs+0.147
icer+0.144
ipping+0.144
 degrees+0.144
Neg logit contributions
 said-0.093
 asked-0.088
 say-0.084
 know-0.080
 understand-0.080
Token dad
Token ID9955
Feature activation+2.35e-01

Pos logit contributions
 asked+0.059
 know+0.056
 said+0.056
 understand+0.053
 say+0.049
Neg logit contributions
ocol-0.134
-0.118
ipping-0.116
mm-0.114
 degrees-0.113
Token asked
Token ID1965
Feature activation+2.27e-01

Pos logit contributions
 mother+0.219
 know+0.213
 understand+0.210
 wish+0.209
 said+0.205
Neg logit contributions
ocol-0.577
ions-0.490
arine-0.484
igs-0.483
ipping-0.480
Token her
Token ID607
Feature activation+5.75e-02

Pos logit contributions
 said+0.210
 asked+0.197
 say+0.178
 mother+0.158
 understand+0.157
Neg logit contributions
ocol-0.688
igs-0.600
icer-0.594
 degrees-0.585
ipping-0.574
Token a
Token ID257
Feature activation+7.40e-02

Pos logit contributions
 said+0.067
 lost+0.062
 asked+0.061
 say+0.058
 whispered+0.054
Neg logit contributions
ocol-0.134
ions-0.113
 cents-0.110
 degrees-0.109
igs-0.108
Token lot
Token ID1256
Feature activation-2.07e-01

Pos logit contributions
 said+0.084
 asked+0.080
 say+0.072
 understand+0.071
 know+0.070
Neg logit contributions
ocol-0.193
 degrees-0.157
igs-0.155
icer-0.155
arine-0.155
Token of
Token ID286
Feature activation+9.99e-02

Pos logit contributions
ocol+0.590
igs+0.492
ions+0.491
 degrees+0.485
ipping+0.475
Neg logit contributions
 said-0.213
 asked-0.203
 say-0.184
 lost-0.177
 know-0.174
Token Lily
Token ID20037
Feature activation-8.26e-02

Pos logit contributions
 said+0.018
 asked+0.018
 say+0.017
 understand+0.017
 lost+0.017
Neg logit contributions
ocol-0.024
igs-0.019
mm-0.019
icer-0.019
 degrees-0.019
Token and
Token ID290
Feature activation-1.22e-02

Pos logit contributions
Too+0.158
ipping+0.152
igs+0.148
mm+0.148
ede+0.147
Neg logit contributions
 say-0.086
 pilot-0.084
 aunt-0.084
 grand-0.084
 wish-0.081
Token Max
Token ID5436
Feature activation+2.36e-02

Pos logit contributions
ocol+0.026
igs+0.021
icer+0.021
ipping+0.021
arine+0.021
Neg logit contributions
 said-0.017
 asked-0.017
 know-0.015
 understand-0.015
 lost-0.015
Token hugged
Token ID41130
Feature activation+1.59e-01

Pos logit contributions
 grandmother+0.030
 woman+0.030
 grand+0.029
 pilot+0.029
 grandfather+0.028
Neg logit contributions
Too-0.038
ocol-0.037
ar-0.037
mm-0.037
orting-0.037
Token and
Token ID290
Feature activation-1.11e-02

Pos logit contributions
 said+0.215
 asked+0.211
 lost+0.193
 say+0.193
 know+0.192
Neg logit contributions
ocol-0.351
igs-0.296
ipping-0.288
icer-0.284
mm-0.283
Token said
Token ID531
Feature activation+2.52e-01

Pos logit contributions
ocol+0.021
mm+0.020
igs+0.020
Too+0.019
 algae+0.019
Neg logit contributions
 name-0.012
 pilot-0.012
 father-0.011
case-0.011
 mother-0.011
Token sorry
Token ID7926
Feature activation-2.73e-03

Pos logit contributions
 said+0.363
 asked+0.343
 say+0.322
 mother+0.307
 understand+0.306
Neg logit contributions
ocol-0.632
icer-0.516
igs-0.514
 degrees-0.507
ions-0.493
Token to
Token ID284
Feature activation+6.74e-02

Pos logit contributions
ocol+0.008
icer+0.007
igs+0.007
 degrees+0.007
ipping+0.007
Neg logit contributions
 said-0.002
 asked-0.002
 say-0.002
 mother-0.002
 understand-0.002
Token each
Token ID1123
Feature activation+6.46e-02

Pos logit contributions
 said+0.090
 asked+0.084
 answered+0.077
 say+0.074
 lost+0.074
Neg logit contributions
ocol-0.159
igs-0.132
mm-0.129
arine-0.128
 degrees-0.126
Token other
Token ID584
Feature activation+5.86e-02

Pos logit contributions
 said+0.081
 asked+0.076
 lost+0.073
 grand+0.072
 rang+0.071
Neg logit contributions
ocol-0.147
ions-0.121
ipping-0.120
icer-0.120
orns-0.119
Token.
Token ID13
Feature activation+6.85e-03

Pos logit contributions
 said+0.069
 asked+0.066
 say+0.061
 mother+0.059
 know+0.056
Neg logit contributions
ocol-0.153
igs-0.127
 degrees-0.127
ipping-0.125
icer-0.124
Token waited
Token ID13488
Feature activation-2.53e-02

Pos logit contributions
ocol+0.017
 degrees+0.015
igs+0.015
icer+0.015
arine+0.015
Neg logit contributions
 understand-0.006
 say-0.006
 know-0.006
 mother-0.006
 parent-0.005
Token and
Token ID290
Feature activation+4.13e-04

Pos logit contributions
ocol+0.065
orns+0.053
icer+0.053
igs+0.052
mm+0.052
Neg logit contributions
 said-0.036
 asked-0.034
 say-0.032
 dad-0.031
 mother-0.030
Token waited
Token ID13488
Feature activation-2.63e-02

Pos logit contributions
 said+0.000
 asked+0.000
 say+0.000
 lost+0.000
 know+0.000
Neg logit contributions
ocol-0.001
igs-0.001
icer-0.001
orns-0.001
 degrees-0.001
Token,
Token ID11
Feature activation-7.83e-02

Pos logit contributions
ocol+0.066
orns+0.055
icer+0.055
igs+0.053
mm+0.053
Neg logit contributions
 said-0.038
 asked-0.036
 say-0.034
 dad-0.032
 mother-0.032
Token but
Token ID475
Feature activation-3.54e-02

Pos logit contributions
ocol+0.230
igs+0.196
orns+0.193
ions+0.189
icer+0.188
Neg logit contributions
 said-0.099
 asked-0.094
 say-0.084
 dad-0.077
 saw-0.074
Token mummy
Token ID38567
Feature activation+1.84e-01

Pos logit contributions
ocol+0.097
igs+0.082
orns+0.081
 degrees+0.079
ips+0.078
Neg logit contributions
 said-0.046
 asked-0.044
 say-0.038
 mother-0.038
 lost-0.037
Token didn
Token ID1422
Feature activation-1.08e-01

Pos logit contributions
 know+0.136
 say+0.135
 asked+0.133
 said+0.133
 understand+0.132
Neg logit contributions
ocol-0.504
 degrees-0.443
ions-0.431
ipping-0.428
igs-0.425
Token't
Token ID470
Feature activation-1.77e-02

Pos logit contributions
ocol+0.312
igs+0.260
orns+0.258
icer+0.256
 degrees+0.254
Neg logit contributions
 said-0.118
 asked-0.113
 say-0.107
 open-0.101
 understand-0.100
Token come
Token ID1282
Feature activation-9.93e-02

Pos logit contributions
ocol+0.040
 degrees+0.034
 pleasures+0.034
 centimeters+0.033
ipping+0.032
Neg logit contributions
 said-0.020
 aunt-0.018
 mother-0.018
 asked-0.017
 grand-0.017
Token and
Token ID290
Feature activation+3.06e-04

Pos logit contributions
ocol+0.304
igs+0.258
 degrees+0.254
icer+0.254
mm+0.249
Neg logit contributions
 said-0.085
 asked-0.079
 say-0.072
 mother-0.067
 lost-0.065
Token find
Token ID1064
Feature activation+3.10e-02

Pos logit contributions
 said+0.000
 understand+0.000
 asked+0.000
 mother+0.000
 woman+0.000
Neg logit contributions
ocol-0.001
 degrees-0.001
ions-0.001
arine-0.001
ipping-0.001
Token structure
Token ID4645
Feature activation-1.62e-02

Pos logit contributions
 said+0.110
 asked+0.106
 say+0.099
 understand+0.096
 mother+0.096
Neg logit contributions
ocol-0.156
igs-0.126
icer-0.124
 degrees-0.120
ipping-0.119
Token that
Token ID326
Feature activation+1.79e-01

Pos logit contributions
ocol+0.039
igs+0.033
icer+0.032
ions+0.032
ipping+0.031
Neg logit contributions
 said-0.022
 asked-0.021
 mother-0.020
 say-0.020
 understand-0.020
Token he
Token ID339
Feature activation+5.38e-02

Pos logit contributions
 said+0.185
 understand+0.182
 say+0.179
 mother+0.179
 woman+0.174
Neg logit contributions
ocol-0.430
icer-0.385
igs-0.378
mm-0.371
arine-0.368
Token was
Token ID373
Feature activation-2.01e-02

Pos logit contributions
 say+0.059
 mother+0.059
 woman+0.057
 understand+0.056
 smile+0.054
Neg logit contributions
ocol-0.128
igs-0.119
icer-0.116
 centimeters-0.113
Too-0.110
Token very
Token ID845
Feature activation+4.12e-02

Pos logit contributions
ocol+0.048
igs+0.043
 centimeters+0.040
ions+0.039
 degrees+0.039
Neg logit contributions
 said-0.025
 mother-0.024
 say-0.023
 woman-0.023
 asked-0.022
Token proud
Token ID6613
Feature activation+1.43e-01

Pos logit contributions
 said+0.047
 asked+0.043
 say+0.042
 mother+0.041
 understand+0.040
Neg logit contributions
ocol-0.111
igs-0.094
 degrees-0.089
 centimeters-0.089
icer-0.088
Token of
Token ID286
Feature activation+9.51e-02

Pos logit contributions
 said+0.140
 asked+0.130
 say+0.123
 understand+0.116
 mother+0.114
Neg logit contributions
ocol-0.409
igs-0.346
ions-0.341
 degrees-0.339
icer-0.336
Token.
Token ID13
Feature activation+3.86e-02

Pos logit contributions
 said+0.095
 asked+0.094
 mom+0.078
 say+0.077
 know+0.075
Neg logit contributions
ocol-0.290
 degrees-0.247
icer-0.245
arine-0.241
ipping-0.240
Token He
Token ID679
Feature activation-1.01e-02

Pos logit contributions
 said+0.047
 asked+0.045
 say+0.041
 mother+0.038
 know+0.038
Neg logit contributions
ocol-0.106
igs-0.087
 degrees-0.086
icer-0.085
ipping-0.084
Token wanted
Token ID2227
Feature activation+8.37e-02

Pos logit contributions
ocol+0.021
igs+0.021
Spl+0.020
 rods+0.019
 centimeters+0.019
Neg logit contributions
 understand-0.011
 woman-0.011
 mother-0.010
 grandmother-0.010
 say-0.010
Token to
Token ID284
Feature activation+7.80e-02

Pos logit contributions
 said+0.058
 asked+0.051
 say+0.046
 mother+0.042
 understand+0.040
Neg logit contributions
ocol-0.265
igs-0.229
icer-0.224
ions-0.223
 degrees-0.223
Token looked
Token ID3114
Feature activation+5.17e-02

Pos logit contributions
 understand+0.012
 say+0.011
 pilot+0.011
 mother+0.010
 smile+0.010
Neg logit contributions
ocol-0.030
igs-0.029
icer-0.028
-0.028
corn-0.027
Token at
Token ID379
Feature activation+4.09e-02

Pos logit contributions
 said+0.075
 write+0.074
 say+0.074
 understand+0.074
 know+0.072
Neg logit contributions
ocol-0.112
igs-0.092
icer-0.088
ipping-0.086
 algae-0.084
Token the
Token ID262
Feature activation+1.92e-03

Pos logit contributions
 said+0.059
 understand+0.055
 say+0.054
 asked+0.053
 write+0.052
Neg logit contributions
ocol-0.093
igs-0.079
ions-0.079
icer-0.078
orns-0.077
Token bag
Token ID6131
Feature activation-3.74e-02

Pos logit contributions
 said+0.003
 asked+0.003
 say+0.003
 know+0.003
 understand+0.003
Neg logit contributions
ocol-0.004
icer-0.003
igs-0.003
 degrees-0.003
arine-0.003
Token and
Token ID290
Feature activation-2.77e-03

Pos logit contributions
ocol+0.085
ipping+0.075
ions+0.074
igs+0.074
icer+0.072
Neg logit contributions
 know-0.047
 asked-0.046
 said-0.046
 say-0.045
 understand-0.044
Token said
Token ID531
Feature activation+2.63e-01

Pos logit contributions
ocol+0.007
igs+0.006
 algae+0.006
abouts+0.006
 degrees+0.006
Neg logit contributions
 understand-0.002
 know-0.002
 aunt-0.002
 pilot-0.002
 parent-0.002
Token,
Token ID11
Feature activation-8.02e-02

Pos logit contributions
 said+0.303
 asked+0.278
 say+0.254
 mother+0.242
 mom+0.236
Neg logit contributions
ocol-0.741
icer-0.629
igs-0.617
 degrees-0.608
orns-0.602
Token "
Token ID366
Feature activation-9.70e-02

Pos logit contributions
ocol+0.222
igs+0.185
icer+0.184
orns+0.182
 degrees+0.177
Neg logit contributions
 said-0.108
 asked-0.101
 say-0.091
 mother-0.088
 dad-0.087
TokenMom
Token ID29252
Feature activation+6.10e-02

Pos logit contributions
ocol+0.233
igs+0.193
 degrees+0.186
ipping+0.180
ions+0.178
Neg logit contributions
 said-0.154
 asked-0.150
 say-0.139
 mother-0.133
 understand-0.131
Tokenmy
Token ID1820
Feature activation+1.24e-01

Pos logit contributions
 said+0.062
 asked+0.059
 say+0.054
 understand+0.053
 lost+0.051
Neg logit contributions
ocol-0.169
icer-0.142
igs-0.141
ipping-0.139
 degrees-0.139
Token,
Token ID11
Feature activation-7.66e-02

Pos logit contributions
 said+0.172
 asked+0.166
 say+0.155
 understand+0.152
 mother+0.149
Neg logit contributions
ocol-0.299
igs-0.242
icer-0.240
ipping-0.240
 degrees-0.239
Token,
Token ID11
Feature activation-8.09e-02

Pos logit contributions
ocol+0.043
 degrees+0.037
igs+0.036
icer+0.036
arine+0.035
Neg logit contributions
 said-0.012
 asked-0.011
 mom-0.009
 lost-0.009
 saw-0.008
Token Tim
Token ID5045
Feature activation-8.76e-02

Pos logit contributions
ocol+0.214
 degrees+0.174
igs+0.173
icer+0.172
ipping+0.167
Neg logit contributions
 said-0.119
 asked-0.111
 say-0.103
 mom-0.098
 lost-0.097
Token found
Token ID1043
Feature activation-9.86e-05

Pos logit contributions
ocol+0.204
icer+0.190
igs+0.190
orns+0.189
ipping+0.187
Neg logit contributions
 mother-0.087
 understand-0.084
 say-0.082
 said-0.078
 aunt-0.078
Token his
Token ID465
Feature activation-2.27e-02

Pos logit contributions
ocol+0.000
igs+0.000
icer+0.000
ions+0.000
 degrees+0.000
Neg logit contributions
 said-0.000
 asked-0.000
 say-0.000
 mother-0.000
 understand-0.000
Token basketball
Token ID9669
Feature activation-1.05e-01

Pos logit contributions
ocol+0.035
igs+0.028
ipping+0.028
+0.027
ips+0.026
Neg logit contributions
 said-0.048
 asked-0.047
 say-0.045
 understand-0.045
 know-0.044
Token very
Token ID845
Feature activation+3.69e-02

Pos logit contributions
ocol+0.298
igs+0.249
 degrees+0.245
icer+0.245
ipping+0.240
Neg logit contributions
 said-0.114
 asked-0.106
 say-0.098
 mother-0.091
 lost-0.090
Token quickly
Token ID2952
Feature activation+6.30e-02

Pos logit contributions
 said+0.053
 say+0.051
 asked+0.050
 woman+0.048
 mother+0.048
Neg logit contributions
ocol-0.087
igs-0.075
icer-0.070
 degrees-0.070
 centimeters-0.069
Token and
Token ID290
Feature activation-1.33e-03

Pos logit contributions
 said+0.062
 asked+0.058
 say+0.053
 mother+0.050
 understand+0.049
Neg logit contributions
ocol-0.182
igs-0.154
 degrees-0.152
icer-0.151
ipping-0.149
Token went
Token ID1816
Feature activation-1.54e-02

Pos logit contributions
ocol+0.003
igs+0.003
 algae+0.003
ions+0.002
 salsa+0.002
Neg logit contributions
 understand-0.001
 mother-0.001
 aunt-0.001
 woman-0.001
 know-0.001
Token to
Token ID284
Feature activation+7.68e-02

Pos logit contributions
ocol+0.045
igs+0.039
ipping+0.038
ips+0.038
ions+0.038
Neg logit contributions
 said-0.013
 asked-0.012
 mother-0.011
 understand-0.011
 say-0.011
Token play
Token ID711
Feature activation-3.75e-02

Pos logit contributions
 said+0.100
 woman+0.098
 pilot+0.092
 mother+0.092
 asked+0.087
Neg logit contributions
ocol-0.170
igs-0.140
ipping-0.135
 salsa-0.133
icer-0.133
Token a
Token ID257
Feature activation+7.81e-02

Pos logit contributions
ocol+0.117
icer+0.096
igs+0.095
 degrees+0.093
Too+0.092
Neg logit contributions
 said-0.077
 asked-0.074
 say-0.070
 dad-0.068
 mother-0.067
Token special
Token ID2041
Feature activation+3.79e-02

Pos logit contributions
 said+0.107
 asked+0.103
 understand+0.092
 say+0.092
 know+0.089
Neg logit contributions
ocol-0.181
igs-0.147
-0.147
 degrees-0.147
ipping-0.142
Token day
Token ID1110
Feature activation-2.34e-03

Pos logit contributions
 said+0.054
 asked+0.052
 say+0.048
 know+0.046
 understand+0.046
Neg logit contributions
ocol-0.092
igs-0.075
 degrees-0.074
icer-0.073
ipping-0.073
Token.
Token ID13
Feature activation+4.91e-02

Pos logit contributions
ocol+0.007
igs+0.006
 degrees+0.006
icer+0.006
ipping+0.005
Neg logit contributions
 said-0.002
 asked-0.002
 say-0.002
 mother-0.002
 understand-0.002
Token The
Token ID383
Feature activation-3.85e-02

Pos logit contributions
 said+0.077
 asked+0.073
 say+0.070
 lost+0.067
 understand+0.067
Neg logit contributions
ocol-0.113
igs-0.090
icer-0.089
 degrees-0.088
ions-0.086
Token little
Token ID1310
Feature activation+7.04e-02

Pos logit contributions
ocol+0.103
 degrees+0.085
icer+0.085
igs+0.083
 algae+0.083
Neg logit contributions
 said-0.042
 asked-0.041
 know-0.039
 understand-0.037
 say-0.037
Token boy
Token ID2933
Feature activation+1.38e-01

Pos logit contributions
 said+0.057
 asked+0.053
 understand+0.052
 know+0.051
 say+0.051
Neg logit contributions
ocol-0.207
icer-0.176
igs-0.175
ipping-0.172
 degrees-0.172
Token was
Token ID373
Feature activation-1.31e-02

Pos logit contributions
 understand+0.116
 say+0.111
 mother+0.104
 know+0.102
 woman+0.100
Neg logit contributions
ocol-0.340
 degrees-0.306
igs-0.290
 centimeters-0.289
ar-0.285
Token so
Token ID523
Feature activation+9.18e-02

Pos logit contributions
ocol+0.032
icer+0.026
igs+0.026
 degrees+0.026
ipping+0.026
Neg logit contributions
 said-0.016
 say-0.015
 understand-0.015
 know-0.015
 mother-0.015
Token excited
Token ID6568
Feature activation+9.28e-02

Pos logit contributions
 said+0.085
 say+0.084
 aunt+0.083
 woman+0.082
 mother+0.081
Neg logit contributions
ocol-0.245
 degrees-0.202
igs-0.201
ipping-0.197
mm-0.194
Token.
Token ID13
Feature activation+4.60e-02

Pos logit contributions
 said+0.099
 asked+0.098
 understand+0.097
 know+0.095
 say+0.094
Neg logit contributions
ocol-0.238
 degrees-0.200
igs-0.197
ipping-0.197
ions-0.195
Token play
Token ID711
Feature activation-4.82e-02

Pos logit contributions
ocol+0.278
igs+0.233
icer+0.233
 degrees+0.230
ions+0.226
Neg logit contributions
 said-0.121
 asked-0.115
 say-0.111
 mother-0.100
 understand-0.098
Token somewhere
Token ID7382
Feature activation-9.42e-02

Pos logit contributions
ocol+0.156
icer+0.132
mm+0.131
 seventeen+0.130
arine+0.129
Neg logit contributions
 said-0.034
 big-0.034
 asked-0.031
 friend-0.031
 open-0.031
Token safe
Token ID3338
Feature activation+9.35e-04

Pos logit contributions
ocol+0.268
igs+0.229
 degrees+0.224
ipping+0.218
ions+0.217
Neg logit contributions
 said-0.095
 asked-0.089
 say-0.082
 mother-0.079
 understand-0.076
Token and
Token ID290
Feature activation-2.69e-02

Pos logit contributions
 said+0.001
 asked+0.001
 say+0.001
 lost+0.001
 mother+0.001
Neg logit contributions
ocol-0.003
igs-0.002
icer-0.002
 degrees-0.002
mm-0.002
Token fun
Token ID1257
Feature activation-7.56e-02

Pos logit contributions
ocol+0.071
igs+0.059
 degrees+0.058
icer+0.057
ipping+0.056
Neg logit contributions
 said-0.033
 asked-0.031
 say-0.028
 mother-0.028
 lost-0.026
Token."
Token ID526
Feature activation+2.19e-02

Pos logit contributions
ocol+0.223
icer+0.188
igs+0.187
 degrees+0.184
mm+0.183
Neg logit contributions
 said-0.074
 asked-0.071
 say-0.063
 lost-0.058
 understand-0.057
Token
Token ID198
Feature activation-2.01e-02

Pos logit contributions
 said+0.024
 asked+0.023
 say+0.021
 mother+0.020
 understand+0.020
Neg logit contributions
ocol-0.060
igs-0.050
icer-0.049
mm-0.049
 degrees-0.049
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
ocol+0.042
igs+0.036
icer+0.035
mm+0.034
orns+0.034
Neg logit contributions
 asked-0.028
 said-0.026
 pilot-0.026
 mother-0.025
 rang-0.025
TokenL
Token ID43
Feature activation-2.68e-01

Pos logit contributions
ocol+0.035
igs+0.025
 degrees+0.025
icer+0.025
ipping+0.025
Neg logit contributions
 said-0.040
 asked-0.038
 say-0.037
 mother-0.036
 understand-0.035
Tokenily
Token ID813
Feature activation-8.38e-03

Pos logit contributions
ocol+0.734
igs+0.610
 degrees+0.600
icer+0.600
arine+0.580
Neg logit contributions
 said-0.333
 asked-0.283
 say-0.282
 dad-0.265
 understand-0.253
Token and
Token ID290
Feature activation-2.65e-02

Pos logit contributions
ocol+0.015
Too+0.014
ipping+0.014
mm+0.014
igs+0.014
Neg logit contributions
 know-0.011
 understand-0.010
 say-0.010
 mother-0.010
 aunt-0.010
Token
Token ID198
Feature activation+2.38e-02

Pos logit contributions
ocol+0.213
igs+0.179
icer+0.176
 degrees+0.175
ions+0.172
Neg logit contributions
 said-0.078
 asked-0.072
 say-0.065
 mother-0.061
 lost-0.060
Token
Token ID198
Feature activation+1.88e-02

Pos logit contributions
 said+0.033
 asked+0.032
 say+0.029
 know+0.028
 understand+0.028
Neg logit contributions
ocol-0.061
igs-0.050
 degrees-0.050
icer-0.049
ipping-0.048
TokenShe
Token ID3347
Feature activation+6.68e-02

Pos logit contributions
 said+0.034
 asked+0.033
 say+0.031
 understand+0.030
 know+0.030
Neg logit contributions
ocol-0.041
 degrees-0.032
igs-0.032
icer-0.031
ipping-0.031
Token grabbed
Token ID13176
Feature activation-5.48e-02

Pos logit contributions
 understand+0.062
 mother+0.061
 know+0.057
 say+0.057
 woman+0.057
Neg logit contributions
ocol-0.165
igs-0.144
 degrees-0.138
Too-0.137
ar-0.135
Token her
Token ID607
Feature activation+4.31e-02

Pos logit contributions
ocol+0.150
igs+0.123
ions+0.121
icer+0.119
 degrees+0.119
Neg logit contributions
 said-0.063
 asked-0.060
 say-0.056
 understand-0.054
 know-0.052
Token passport
Token ID17981
Feature activation+2.43e-02

Pos logit contributions
 asked+0.062
 know+0.060
 understand+0.059
 said+0.059
 say+0.051
Neg logit contributions
ocol-0.104
icer-0.083
igs-0.082
ions-0.078
 degrees-0.078
Token and
Token ID290
Feature activation-6.69e-03

Pos logit contributions
 asked+0.036
 know+0.035
 understand+0.033
 said+0.033
 wish+0.032
Neg logit contributions
ocol-0.046
ions-0.042
igs-0.041
ipping-0.041
 degrees-0.039
Token said
Token ID531
Feature activation+2.52e-01

Pos logit contributions
ocol+0.016
igs+0.014
 degrees+0.014
ions+0.014
ipping+0.013
Neg logit contributions
 understand-0.007
 know-0.007
 mother-0.006
 say-0.006
 said-0.006
Token,
Token ID11
Feature activation-9.29e-02

Pos logit contributions
 said+0.306
 asked+0.284
 say+0.260
 mother+0.248
 understand+0.243
Neg logit contributions
ocol-0.691
icer-0.586
igs-0.577
 degrees-0.564
ipping-0.556
Token "
Token ID366
Feature activation-1.03e-01

Pos logit contributions
ocol+0.244
igs+0.203
icer+0.201
orns+0.197
 degrees+0.193
Neg logit contributions
 said-0.133
 asked-0.125
 say-0.117
 mother-0.112
 dad-0.109
TokenI
Token ID40
Feature activation+3.49e-02

Pos logit contributions
ocol+0.221
igs+0.183
 degrees+0.173
icer+0.164
 centimeters+0.164
Neg logit contributions
 said-0.186
 asked-0.181
 say-0.170
 write-0.162
 understand-0.162
Token The
Token ID383
Feature activation-7.22e-02

Pos logit contributions
ocol+0.011
igs+0.010
 degrees+0.009
icer+0.009
ipping+0.009
Neg logit contributions
 said-0.005
 asked-0.005
 say-0.005
 mother-0.005
 understand-0.004
Token fire
Token ID2046
Feature activation-1.59e-01

Pos logit contributions
ocol+0.121
igs+0.088
icer+0.086
 degrees+0.086
 centimeters+0.083
Neg logit contributions
 said-0.150
 asked-0.144
 say-0.139
 know-0.139
 understand-0.138
Tokenman
Token ID805
Feature activation+9.15e-02

Pos logit contributions
ocol+0.342
icer+0.281
ions+0.279
 degrees+0.278
igs+0.275
Neg logit contributions
 said-0.225
 asked-0.223
 know-0.218
 say-0.217
 understand-0.217
Token said
Token ID531
Feature activation+2.46e-01

Pos logit contributions
 know+0.078
 understand+0.077
 mother+0.076
 father+0.075
 parent+0.073
Neg logit contributions
ocol-0.218
ar-0.196
 degrees-0.196
igs-0.196
mm-0.196
Token he
Token ID339
Feature activation+3.82e-02

Pos logit contributions
 said+0.314
 asked+0.284
 say+0.260
 mom+0.245
 dad+0.245
Neg logit contributions
ocol-0.680
icer-0.592
igs-0.578
ipping-0.565
orns-0.558
Token liked
Token ID8288
Feature activation+6.12e-02

Pos logit contributions
 mother+0.040
 say+0.038
 daddy+0.037
 father+0.036
 smile+0.036
Neg logit contributions
ocol-0.083
 degrees-0.076
ar-0.074
Too-0.073
arine-0.072
Token to
Token ID284
Feature activation+6.77e-02

Pos logit contributions
 said+0.071
 asked+0.066
 say+0.061
 mother+0.057
 lost+0.056
Neg logit contributions
ocol-0.170
igs-0.142
icer-0.141
 degrees-0.140
ipping-0.139
Token slide
Token ID10649
Feature activation-4.22e-02

Pos logit contributions
 said+0.076
 asked+0.071
 mother+0.066
 lost+0.065
 say+0.062
Neg logit contributions
ocol-0.181
igs-0.152
 degrees-0.147
arine-0.145
mm-0.144
Token too
Token ID1165
Feature activation-4.93e-02

Pos logit contributions
ocol+0.126
igs+0.104
 degrees+0.104
icer+0.103
arine+0.103
Neg logit contributions
 said-0.044
 asked-0.039
 say-0.036
 lost-0.032
 mom-0.031
Token.
Token ID13
Feature activation-4.82e-03

Pos logit contributions
ocol+0.152
igs+0.128
icer+0.128
 degrees+0.126
ipping+0.126
Neg logit contributions
 said-0.044
 asked-0.040
 say-0.035
 know-0.032
 understand-0.032
Token He
Token ID679
Feature activation-2.79e-02

Pos logit contributions
ocol+0.013
igs+0.011
 degrees+0.011
icer+0.011
ipping+0.011
Neg logit contributions
 said-0.006
 asked-0.006
 say-0.005
 mother-0.005
 know-0.005
Token hugs
Token ID40657
Feature activation+6.72e-02

Pos logit contributions
ocol+0.076
igs+0.063
icer+0.063
 degrees+0.062
 algae+0.060
Neg logit contributions
 said-0.050
 mother-0.050
 asked-0.049
 understand-0.049
 woman-0.047
Token Rex
Token ID17853
Feature activation-1.12e-01

Pos logit contributions
 said+0.085
 asked+0.080
 say+0.076
 mother+0.073
 dad+0.069
Neg logit contributions
ocol-0.181
igs-0.149
 degrees-0.149
icer-0.147
ipping-0.144
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
ocol+0.313
igs+0.265
 degrees+0.263
icer+0.261
ipping+0.257
Neg logit contributions
 said-0.113
 asked-0.108
 say-0.097
 mother-0.092
 know-0.091
Token He
Token ID679
Feature activation-3.26e-02

Pos logit contributions
ocol+0.028
igs+0.023
icer+0.023
 degrees+0.023
ipping+0.023
Neg logit contributions
 said-0.014
 asked-0.013
 say-0.012
 mother-0.012
 know-0.011
Token says
Token ID1139
Feature activation+2.14e-01

Pos logit contributions
ocol+0.078
igs+0.068
 algae+0.067
 degrees+0.066
icer+0.066
Neg logit contributions
 mother-0.032
 understand-0.032
 asked-0.031
 daddy-0.031
 woman-0.031
Token,
Token ID11
Feature activation-1.02e-01

Pos logit contributions
 said+0.238
 asked+0.213
 say+0.198
 mom+0.189
 mother+0.187
Neg logit contributions
ocol-0.615
icer-0.536
 degrees-0.511
igs-0.511
arine-0.509
Token "
Token ID366
Feature activation-1.16e-01

Pos logit contributions
ocol+0.279
icer+0.233
igs+0.232
 degrees+0.224
ipping+0.223
Neg logit contributions
 said-0.141
 asked-0.132
 say-0.119
 mom-0.116
 mother-0.114
TokenRex
Token ID47389
Feature activation-1.01e-01

Pos logit contributions
ocol+0.257
igs+0.205
 degrees+0.197
icer+0.196
ipping+0.191
Neg logit contributions
 said-0.210
 asked-0.200
 say-0.191
 mother-0.181
 understand-0.181
Token,
Token ID11
Feature activation-9.97e-02

Pos logit contributions
ocol+0.303
igs+0.250
icer+0.249
ions+0.244
 degrees+0.244
Neg logit contributions
 said-0.116
 asked-0.099
 say-0.096
 lost-0.088
 mom-0.083
Token you
Token ID345
Feature activation-4.19e-02

Pos logit contributions
ocol+0.252
igs+0.207
icer+0.203
 degrees+0.203
ipping+0.200
Neg logit contributions
 said-0.140
 asked-0.132
 say-0.126
 mother-0.119
 know-0.117
Token are
Token ID389
Feature activation-5.04e-02

Pos logit contributions
ocol+0.078
igs+0.063
icer+0.061
 algae+0.060
 centimeters+0.060
Neg logit contributions
 woman-0.072
 mother-0.071
 father-0.069
 grandmother-0.069
 asked-0.068
Token out
Token ID503
Feature activation+4.28e-02

Pos logit contributions
ocol+0.014
igs+0.012
icer+0.012
 degrees+0.012
ipping+0.011
Neg logit contributions
 said-0.006
 asked-0.006
 say-0.006
 lost-0.005
 know-0.005
Token what
Token ID644
Feature activation-2.72e-03

Pos logit contributions
 said+0.047
 asked+0.046
 say+0.043
 mother+0.043
 understand+0.041
Neg logit contributions
ocol-0.115
ions-0.096
igs-0.095
ipping-0.094
 degrees-0.094
Token Kitty
Token ID21168
Feature activation-2.73e-02

Pos logit contributions
ocol+0.006
ions+0.005
igs+0.005
 degrees+0.005
orns+0.005
Neg logit contributions
 say-0.003
 said-0.003
 mother-0.003
 asked-0.003
 understand-0.003
Token did
Token ID750
Feature activation-5.95e-03

Pos logit contributions
ocol+0.079
 degrees+0.068
igs+0.067
icer+0.067
ipping+0.067
Neg logit contributions
 said-0.019
 understand-0.018
 say-0.018
 mother-0.018
 asked-0.018
Token.
Token ID13
Feature activation+2.90e-02

Pos logit contributions
ocol+0.017
igs+0.015
 degrees+0.014
icer+0.014
ipping+0.014
Neg logit contributions
 said-0.006
 asked-0.005
 say-0.005
 mother-0.005
 understand-0.004
Token They
Token ID1119
Feature activation-1.04e-01

Pos logit contributions
 said+0.046
 asked+0.044
 say+0.041
 mother+0.040
 know+0.039
Neg logit contributions
ocol-0.069
igs-0.055
icer-0.055
 degrees-0.054
ipping-0.053
Token were
Token ID547
Feature activation-2.71e-03

Pos logit contributions
ocol+0.201
igs+0.174
 algae+0.173
ions+0.173
 dams+0.169
Neg logit contributions
 grandmother-0.123
 woman-0.120
 mother-0.117
 letter-0.116
 tutor-0.114
Token very
Token ID845
Feature activation+3.36e-02

Pos logit contributions
ocol+0.007
igs+0.006
 centimeters+0.006
 degrees+0.006
ips+0.006
Neg logit contributions
 said-0.003
 write-0.003
 say-0.003
 mother-0.003
 asked-0.003
Token angry
Token ID7954
Feature activation+6.26e-02

Pos logit contributions
 said+0.032
 write+0.031
 asked+0.031
 mother+0.030
 say+0.030
Neg logit contributions
ocol-0.088
igs-0.076
 degrees-0.072
 centimeters-0.071
ips-0.070
Token.
Token ID13
Feature activation+2.81e-02

Pos logit contributions
 said+0.054
 asked+0.046
 say+0.040
 lost+0.038
 dad+0.034
Neg logit contributions
ocol-0.204
icer-0.174
igs-0.172
arine-0.171
 degrees-0.168
Token They
Token ID1119
Feature activation-1.04e-01

Pos logit contributions
 said+0.046
 asked+0.043
 say+0.040
 mother+0.039
 know+0.038
Neg logit contributions
ocol-0.067
igs-0.053
icer-0.052
 degrees-0.052
ipping-0.051
Token gathered
Token ID9272
Feature activation-3.98e-02

Pos logit contributions
ocol+0.038
 degrees+0.034
igs+0.034
Too+0.033
Spl+0.033
Neg logit contributions
 understand-0.015
 say-0.015
 letter-0.014
 know-0.014
 aunt-0.014
Token up
Token ID510
Feature activation+8.80e-02

Pos logit contributions
ocol+0.097
igs+0.079
ions+0.077
icer+0.076
 degrees+0.076
Neg logit contributions
 said-0.055
 asked-0.053
 say-0.051
 mother-0.049
 write-0.049
Token a
Token ID257
Feature activation+5.70e-02

Pos logit contributions
 said+0.100
 asked+0.093
 say+0.086
 mother+0.081
 lost+0.079
Neg logit contributions
ocol-0.245
igs-0.204
 degrees-0.201
icer-0.201
ipping-0.197
Token handful
Token ID10089
Feature activation-1.53e-01

Pos logit contributions
 asked+0.072
 said+0.070
 say+0.062
 know+0.061
 understand+0.061
Neg logit contributions
ocol-0.137
igs-0.114
icer-0.113
 degrees-0.113
 algae-0.111
Token of
Token ID286
Feature activation+9.50e-02

Pos logit contributions
ocol+0.466
igs+0.396
 degrees+0.391
icer+0.390
ipping+0.383
Neg logit contributions
 said-0.133
 asked-0.121
 say-0.110
 mother-0.100
 lost-0.097
Token sand
Token ID6450
Feature activation-1.40e-01

Pos logit contributions
 said+0.173
 asked+0.168
 say+0.160
 mother+0.157
 understand+0.156
Neg logit contributions
ocol-0.199
igs-0.152
ions-0.146
icer-0.145
 degrees-0.144
Token and
Token ID290
Feature activation-1.32e-03

Pos logit contributions
ocol+0.391
igs+0.327
 degrees+0.322
icer+0.321
ipping+0.315
Neg logit contributions
 said-0.156
 asked-0.146
 say-0.135
 mother-0.126
 lost-0.124
Token watched
Token ID7342
Feature activation+1.60e-03

Pos logit contributions
ocol+0.003
igs+0.003
ions+0.003
 degrees+0.003
 algae+0.003
Neg logit contributions
 understand-0.002
 said-0.002
 say-0.002
 know-0.002
 asked-0.002
Token it
Token ID340
Feature activation-4.15e-02

Pos logit contributions
 said+0.002
 asked+0.002
 say+0.002
 mother+0.002
 understand+0.002
Neg logit contributions
ocol-0.004
igs-0.004
icer-0.004
 degrees-0.004
ipping-0.003
Token slip
Token ID13819
Feature activation-1.47e-01

Pos logit contributions
ocol+0.109
igs+0.090
 degrees+0.089
icer+0.089
ipping+0.087
Neg logit contributions
 said-0.053
 asked-0.050
 say-0.047
 mother-0.044
 understand-0.043
Token away
Token ID1497
Feature activation+9.36e-03

Pos logit contributions
ocol+0.453
mm+0.383
igs+0.380
icer+0.375
 degrees+0.374
Neg logit contributions
 said-0.139
 asked-0.126
 say-0.112
 lost-0.111
 open-0.102
Token sing
Token ID1702
Feature activation+2.61e-02

Pos logit contributions
 said+0.001
 asked+0.001
 lost+0.001
 answered+0.001
 mother+0.001
Neg logit contributions
ocol-0.002
igs-0.002
 algae-0.002
 salsa-0.002
 pic-0.002
Token songs
Token ID7259
Feature activation-7.24e-03

Pos logit contributions
 said+0.041
 asked+0.039
 say+0.037
 mother+0.036
 lost+0.035
Neg logit contributions
ocol-0.059
igs-0.048
 degrees-0.047
ions-0.046
icer-0.046
Token.
Token ID13
Feature activation+2.07e-02

Pos logit contributions
ocol+0.015
ions+0.013
igs+0.012
ipping+0.012
ips+0.012
Neg logit contributions
 said-0.010
 asked-0.010
 know-0.010
 mother-0.010
 say-0.010
Token One
Token ID1881
Feature activation-7.73e-02

Pos logit contributions
 said+0.032
 asked+0.030
 say+0.029
 mother+0.027
 understand+0.027
Neg logit contributions
ocol-0.049
igs-0.040
 degrees-0.039
icer-0.039
ipping-0.038
Token day
Token ID1110
Feature activation-8.74e-03

Pos logit contributions
ocol+0.160
ions+0.130
igs+0.130
mm+0.128
icer+0.127
Neg logit contributions
 said-0.117
 asked-0.116
 say-0.110
 understand-0.108
 know-0.106
Token,
Token ID11
Feature activation-8.26e-02

Pos logit contributions
ocol+0.026
icer+0.023
 degrees+0.022
igs+0.022
arine+0.022
Neg logit contributions
 said-0.009
 asked-0.009
 say-0.007
 lost-0.007
 mom-0.007
Token she
Token ID673
Feature activation+8.19e-02

Pos logit contributions
ocol+0.196
igs+0.159
icer+0.155
 degrees+0.154
ions+0.154
Neg logit contributions
 said-0.119
 asked-0.113
 say-0.110
 know-0.106
 understand-0.106
Token found
Token ID1043
Feature activation-4.32e-04

Pos logit contributions
 understand+0.079
 say+0.076
 mother+0.072
 know+0.071
 smile+0.069
Neg logit contributions
ocol-0.173
igs-0.158
 algae-0.158
Spl-0.157
CR-0.156
Token a
Token ID257
Feature activation+6.18e-02

Pos logit contributions
ocol+0.001
ions+0.001
orns+0.001
igs+0.001
 algae+0.001
Neg logit contributions
 say-0.001
 smile-0.001
 asked-0.001
 said-0.001
 know-0.001
Token shiny
Token ID22441
Feature activation-3.16e-02

Pos logit contributions
 said+0.073
 asked+0.072
 know+0.067
 say+0.065
 understand+0.065
Neg logit contributions
ocol-0.150
igs-0.127
 algae-0.127
icer-0.127
arine-0.124
Token rock
Token ID3881
Feature activation+1.51e-02

Pos logit contributions
ocol+0.078
igs+0.063
icer+0.062
ipping+0.061
 degrees+0.060
Neg logit contributions
 said-0.045
 asked-0.044
 say-0.041
 understand-0.040
 know-0.040
Token with
Token ID351
Feature activation-8.18e-02

Pos logit contributions
ocol+0.010
igs+0.009
ipping+0.008
ions+0.008
 degrees+0.008
Neg logit contributions
 understand-0.003
 asked-0.003
 said-0.003
 know-0.003
 mother-0.003
Token her
Token ID607
Feature activation+4.69e-02

Pos logit contributions
ocol+0.204
ions+0.163
igs+0.152
arine+0.151
 degrees+0.149
Neg logit contributions
 asked-0.096
 said-0.093
 rang-0.089
 mother-0.086
 say-0.084
Token blanket
Token ID18447
Feature activation+3.08e-02

Pos logit contributions
 asked+0.055
 said+0.052
 rang+0.046
 understand+0.046
 say+0.046
Neg logit contributions
ocol-0.121
igs-0.099
 degrees-0.098
ions-0.097
icer-0.094
Token covering
Token ID9505
Feature activation-8.44e-02

Pos logit contributions
 asked+0.038
 know+0.037
 said+0.036
 understand+0.035
 say+0.034
Neg logit contributions
ocol-0.073
ions-0.062
igs-0.061
ipping-0.061
 degrees-0.060
Token her
Token ID607
Feature activation+4.70e-02

Pos logit contributions
ocol+0.243
igs+0.206
icer+0.205
 degrees+0.204
 centimeters+0.202
Neg logit contributions
 said-0.091
 asked-0.080
 say-0.076
 mother-0.070
 mom-0.069
Token,
Token ID11
Feature activation-8.31e-02

Pos logit contributions
 said+0.069
 asked+0.068
 say+0.063
 understand+0.062
 know+0.060
Neg logit contributions
ocol-0.110
igs-0.093
 degrees-0.092
icer-0.090
ipping-0.087
Token it
Token ID340
Feature activation-3.89e-02

Pos logit contributions
ocol+0.235
igs+0.197
icer+0.195
 degrees+0.194
ions+0.189
Neg logit contributions
 said-0.093
 asked-0.086
 say-0.080
 mother-0.074
 lost-0.073
Token quickly
Token ID2952
Feature activation+6.19e-02

Pos logit contributions
ocol+0.104
 degrees+0.091
igs+0.090
ions+0.087
ipping+0.087
Neg logit contributions
 said-0.036
 asked-0.035
 say-0.034
 know-0.033
 understand-0.033
Token ran
Token ID4966
Feature activation-4.25e-02

Pos logit contributions
 said+0.067
 asked+0.064
 say+0.059
 mother+0.059
 understand+0.056
Neg logit contributions
ocol-0.168
igs-0.141
 degrees-0.140
ipping-0.136
arine-0.134
Token away
Token ID1497
Feature activation+1.16e-02

Pos logit contributions
ocol+0.126
igs+0.107
icer+0.106
 degrees+0.105
mm+0.104
Neg logit contributions
 said-0.041
 asked-0.038
 say-0.035
 lost-0.032
 mother-0.032
Token into
Token ID656
Feature activation+1.31e-02

Pos logit contributions
 said+0.012
 asked+0.012
 say+0.011
 mother+0.010
 understand+0.010
Neg logit contributions
ocol-0.033
igs-0.027
 degrees-0.027
icer-0.027
ipping-0.026
Token they
Token ID484
Feature activation-2.41e-02

Pos logit contributions
ocol+0.014
igs+0.012
ions+0.012
ipping+0.012
arine+0.011
Neg logit contributions
 said-0.006
 asked-0.006
 know-0.006
 understand-0.006
 mother-0.006
Token put
Token ID1234
Feature activation-2.93e-02

Pos logit contributions
ocol+0.056
igs+0.049
+0.048
ions+0.047
arine+0.047
Neg logit contributions
 know-0.025
 say-0.024
 woman-0.024
 understand-0.024
 tutor-0.024
Token it
Token ID340
Feature activation-4.74e-02

Pos logit contributions
ocol+0.080
 degrees+0.067
 centimeters+0.066
igs+0.065
icer+0.065
Neg logit contributions
 said-0.038
 asked-0.034
 say-0.032
 lost-0.031
 mom-0.030
Token in
Token ID287
Feature activation+3.84e-02

Pos logit contributions
ocol+0.148
igs+0.124
 degrees+0.124
icer+0.124
mm+0.121
Neg logit contributions
 said-0.039
 asked-0.036
 say-0.031
 lost-0.029
 mother-0.029
Token a
Token ID257
Feature activation+5.03e-02

Pos logit contributions
 said+0.034
 asked+0.031
 say+0.028
 mother+0.025
 lost+0.025
Neg logit contributions
ocol-0.120
igs-0.103
 degrees-0.101
icer-0.100
ipping-0.100
Token v
Token ID410
Feature activation-2.84e-01

Pos logit contributions
 said+0.071
 asked+0.067
 say+0.064
 mother+0.060
 understand+0.060
Neg logit contributions
ocol-0.125
igs-0.102
 degrees-0.100
icer-0.100
ipping-0.098
Tokenase
Token ID589
Feature activation-1.11e-01

Pos logit contributions
ocol+0.479
 degrees+0.339
igs+0.331
icer+0.319
 centimeters+0.316
Neg logit contributions
 said-0.641
 asked-0.615
 say-0.591
 mother-0.577
 understand-0.566
Token inside
Token ID2641
Feature activation+1.10e-01

Pos logit contributions
ocol+0.273
igs+0.238
ions+0.236
 degrees+0.231
ipping+0.230
Neg logit contributions
 asked-0.130
 said-0.126
 know-0.122
 mother-0.121
 say-0.120
Token.
Token ID13
Feature activation+1.68e-02

Pos logit contributions
 said+0.105
 asked+0.097
 say+0.088
 mother+0.081
 lost+0.079
Neg logit contributions
ocol-0.328
igs-0.277
 degrees-0.273
icer-0.272
ipping-0.268
Token Lily
Token ID20037
Feature activation-8.13e-02

Pos logit contributions
 said+0.025
 asked+0.024
 say+0.021
 mom+0.021
 mother+0.020
Neg logit contributions
ocol-0.044
igs-0.035
icer-0.035
 degrees-0.035
orns-0.034
Token was
Token ID373
Feature activation-3.08e-02

Pos logit contributions
ocol+0.190
igs+0.180
 degrees+0.173
ipping+0.172
icer+0.171
Neg logit contributions
 say-0.081
 understand-0.076
 mother-0.075
 know-0.074
 pilot-0.072
Token the
Token ID262
Feature activation-2.50e-02

Pos logit contributions
 said+0.004
 asked+0.003
 mom+0.003
 dad+0.003
 say+0.003
Neg logit contributions
ocol-0.010
icer-0.009
igs-0.009
 degrees-0.009
ipping-0.008
Token broken
Token ID5445
Feature activation-3.08e-02

Pos logit contributions
ocol+0.051
igs+0.039
icer+0.038
 degrees+0.038
ions+0.036
Neg logit contributions
 said-0.047
 asked-0.046
 say-0.043
 know-0.042
 understand-0.042
Token glass
Token ID5405
Feature activation-1.22e-01

Pos logit contributions
ocol+0.072
igs+0.058
 degrees+0.057
icer+0.056
mm+0.056
Neg logit contributions
 said-0.049
 asked-0.047
 say-0.044
 mother-0.043
 lost-0.042
Token and
Token ID290
Feature activation-2.76e-02

Pos logit contributions
ocol+0.359
igs+0.300
 degrees+0.298
icer+0.298
mm+0.291
Neg logit contributions
 said-0.122
 asked-0.110
 say-0.101
 mother-0.093
 lost-0.092
Token the
Token ID262
Feature activation-2.42e-02

Pos logit contributions
ocol+0.074
igs+0.061
 degrees+0.061
ions+0.060
icer+0.059
Neg logit contributions
 said-0.030
 asked-0.029
 say-0.027
 mother-0.027
 understand-0.027
Token ol
Token ID25776
Feature activation-3.13e-01

Pos logit contributions
ocol+0.052
igs+0.040
icer+0.039
 degrees+0.039
arine+0.038
Neg logit contributions
 said-0.042
 asked-0.041
 say-0.038
 know-0.038
 understand-0.037
Tokenives
Token ID1083
Feature activation-1.18e-01

Pos logit contributions
ocol+0.652
icer+0.524
igs+0.511
 degrees+0.503
Too+0.495
Neg logit contributions
 said-0.586
 asked-0.528
 say-0.526
 write-0.490
 open-0.487
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
ocol+0.334
igs+0.280
 degrees+0.277
icer+0.274
ions+0.273
Neg logit contributions
 said-0.119
 asked-0.112
 say-0.103
 mother-0.098
 know-0.096
Token They
Token ID1119
Feature activation-1.23e-01

Pos logit contributions
ocol+0.037
igs+0.031
icer+0.030
ipping+0.030
 degrees+0.030
Neg logit contributions
 said-0.021
 asked-0.021
 mom-0.018
 say-0.018
 dad-0.018
Token washed
Token ID18989
Feature activation-5.83e-02

Pos logit contributions
ocol+0.220
 wra+0.205
 vinegar+0.203
 soda+0.196
mons+0.196
Neg logit contributions
 grandmother-0.148
 angel-0.147
 letter-0.145
 pilot-0.145
 tutor-0.144
Token the
Token ID262
Feature activation-2.55e-02

Pos logit contributions
ocol+0.166
igs+0.139
 degrees+0.138
ipping+0.134
mm+0.134
Neg logit contributions
 said-0.071
 asked-0.064
 say-0.060
 mom-0.059
 dad-0.056
Token Wh
Token ID854
Feature activation-2.55e-01

Pos logit contributions
ocol+0.018
igs+0.015
 degrees+0.015
icer+0.015
ipping+0.014
Neg logit contributions
 said-0.012
 asked-0.012
 say-0.011
 mother-0.010
 lost-0.010
Tokenisk
Token ID1984
Feature activation-3.76e-01

Pos logit contributions
ocol+0.372
igs+0.239
 degrees+0.230
mm+0.215
Too+0.213
Neg logit contributions
 said-0.655
 asked-0.626
 say-0.625
 lost-0.603
 aunt-0.590
Tokeners
Token ID364
Feature activation-9.90e-02

Pos logit contributions
ocol+0.687
 degrees+0.493
icer+0.488
igs+0.482
mm+0.475
Neg logit contributions
 said-0.820
 asked-0.783
 say-0.753
 lost-0.731
 understand-0.725
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
ocol+0.261
igs+0.216
 degrees+0.213
ipping+0.213
icer+0.213
Neg logit contributions
 said-0.112
 asked-0.106
 say-0.101
 know-0.098
 understand-0.096
Token Wh
Token ID854
Feature activation-2.56e-01

Pos logit contributions
 said+0.029
 asked+0.028
 say+0.026
 mom+0.025
 aunt+0.025
Neg logit contributions
ocol-0.046
igs-0.037
ipping-0.036
ions-0.036
 degrees-0.036
Tokenisk
Token ID1984
Feature activation-3.77e-01

Pos logit contributions
ocol+0.455
 degrees+0.304
igs+0.294
dogs+0.289
 rect+0.289
Neg logit contributions
 said-0.588
 lost-0.546
 asked-0.544
 say-0.537
 know-0.516
Tokeners
Token ID364
Feature activation-1.02e-01

Pos logit contributions
ocol+0.669
 degrees+0.466
icer+0.457
igs+0.452
ipping+0.444
Neg logit contributions
 said-0.848
 asked-0.809
 say-0.769
 lost-0.759
 understand-0.757
Token looked
Token ID3114
Feature activation+4.48e-02

Pos logit contributions
ocol+0.275
igs+0.239
ipping+0.236
icer+0.235
 degrees+0.233
Neg logit contributions
 say-0.076
 mother-0.073
 know-0.073
 understand-0.072
 said-0.071
Token at
Token ID379
Feature activation+3.62e-02

Pos logit contributions
 said+0.054
 asked+0.053
 say+0.049
 write+0.048
 mother+0.048
Neg logit contributions
ocol-0.116
igs-0.096
icer-0.093
 degrees-0.093
ipping-0.093
Token Spot
Token ID15899
Feature activation-5.17e-02

Pos logit contributions
 said+0.054
 asked+0.052
 say+0.049
 mother+0.047
 understand+0.046
Neg logit contributions
ocol-0.087
igs-0.071
 degrees-0.069
icer-0.069
ipping-0.068
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
ocol+0.167
igs+0.141
 degrees+0.139
icer+0.138
ions+0.137
Neg logit contributions
 said-0.041
 asked-0.036
 say-0.032
 lost-0.029
 mother-0.028
Tokeners
Token ID364
Feature activation-7.86e-02

Pos logit contributions
ocol+0.545
igs+0.364
 degrees+0.359
icer+0.351
ipping+0.344
Neg logit contributions
 said-0.910
 asked-0.880
 say-0.841
 lost-0.826
 know-0.821
Token.
Token ID13
Feature activation+5.92e-02

Pos logit contributions
ocol+0.194
icer+0.166
mm+0.164
igs+0.164
 degrees+0.161
Neg logit contributions
 said-0.085
 say-0.081
 asked-0.080
 mother-0.078
 know-0.075
Token Mr
Token ID1770
Feature activation-3.24e-02

Pos logit contributions
 said+0.073
 asked+0.070
 say+0.065
 grand+0.063
 mom+0.063
Neg logit contributions
ocol-0.165
 degrees-0.138
igs-0.138
ipping-0.136
 centimeters-0.135
Token.
Token ID13
Feature activation+6.37e-02

Pos logit contributions
ocol+0.092
igs+0.078
icer+0.077
 degrees+0.077
orns+0.075
Neg logit contributions
 said-0.032
 asked-0.031
 say-0.028
 mother-0.026
 understand-0.026
Token Wh
Token ID854
Feature activation-2.36e-01

Pos logit contributions
 asked+0.111
 said+0.110
 lost+0.105
 mom+0.105
 grand+0.104
Neg logit contributions
ocol-0.139
igs-0.112
 degrees-0.112
 pleasures-0.109
Too-0.108
Tokenisk
Token ID1984
Feature activation-3.69e-01

Pos logit contributions
ocol+0.322
igs+0.190
 degrees+0.186
ions+0.179
ipping+0.171
Neg logit contributions
 said-0.633
 asked-0.604
 lost-0.594
 say-0.580
 know-0.568
Tokeners
Token ID364
Feature activation-8.28e-02

Pos logit contributions
ocol+0.504
igs+0.329
 degrees+0.320
icer+0.316
ipping+0.304
Neg logit contributions
 said-0.943
 asked-0.915
 say-0.883
 lost-0.860
 understand-0.858
Token loved
Token ID6151
Feature activation+9.35e-02

Pos logit contributions
ocol+0.178
mm+0.167
icer+0.166
igs+0.161
Too+0.160
Neg logit contributions
 mother-0.087
 say-0.083
 father-0.082
 woman-0.080
 grandmother-0.079
Token to
Token ID284
Feature activation+8.18e-02

Pos logit contributions
 said+0.041
 asked+0.037
 say+0.025
 lost+0.023
 mother+0.020
Neg logit contributions
ocol-0.332
icer-0.288
ipping-0.284
igs-0.284
mm-0.284
Token study
Token ID2050
Feature activation-6.34e-02

Pos logit contributions
 said+0.088
 asked+0.081
 mother+0.072
 lost+0.071
 say+0.070
Neg logit contributions
ocol-0.227
igs-0.193
 degrees-0.185
arine-0.183
ions-0.182
Token.
Token ID13
Feature activation+6.37e-02

Pos logit contributions
ocol+0.180
icer+0.149
igs+0.148
ipping+0.147
arine+0.146
Neg logit contributions
 said-0.071
 asked-0.066
 say-0.059
 grand-0.056
 lost-0.056
Token Lily
Token ID20037
Feature activation-8.50e-02

Pos logit contributions
ocol+0.211
igs+0.172
 degrees+0.169
icer+0.167
ions+0.167
Neg logit contributions
 said-0.121
 asked-0.115
 say-0.110
 understand-0.105
 know-0.104
Token's
Token ID338
Feature activation+2.09e-02

Pos logit contributions
ocol+0.190
 degrees+0.180
icer+0.179
ipping+0.178
igs+0.172
Neg logit contributions
 understand-0.081
 know-0.079
 say-0.078
 mother-0.069
 wish-0.068
Token mom
Token ID1995
Feature activation+1.81e-01

Pos logit contributions
 said+0.018
 asked+0.017
 know+0.016
 say+0.016
 understand+0.015
Neg logit contributions
ocol-0.059
icer-0.051
igs-0.050
ipping-0.049
 degrees-0.049
Token saw
Token ID2497
Feature activation+4.77e-02

Pos logit contributions
 wish+0.139
 understand+0.138
 spirit+0.137
 know+0.137
 name+0.122
Neg logit contributions
ocol-0.449
ipping-0.412
 degrees-0.403
icer-0.399
-0.386
Token the
Token ID262
Feature activation-1.69e-02

Pos logit contributions
 said+0.050
 asked+0.045
 say+0.041
 lost+0.038
 mother+0.038
Neg logit contributions
ocol-0.140
igs-0.119
 degrees-0.117
icer-0.117
arine-0.115
Token v
Token ID410
Feature activation-2.87e-01

Pos logit contributions
ocol+0.040
icer+0.033
igs+0.031
 degrees+0.031
 centimeters+0.030
Neg logit contributions
 said-0.025
 asked-0.025
 know-0.023
 understand-0.022
 say-0.022
Tokenase
Token ID589
Feature activation-1.14e-01

Pos logit contributions
ocol+0.499
 degrees+0.363
igs+0.357
 algae+0.327
arine+0.326
Neg logit contributions
 said-0.628
 asked-0.599
 say-0.580
 mother-0.561
 lost-0.559
Token and
Token ID290
Feature activation-1.16e-02

Pos logit contributions
ocol+0.281
ipping+0.239
igs+0.238
 degrees+0.237
ions+0.233
Neg logit contributions
 asked-0.134
 said-0.133
 know-0.125
 mother-0.123
 say-0.122
Token asked
Token ID1965
Feature activation+2.06e-01

Pos logit contributions
ocol+0.032
igs+0.028
 degrees+0.028
ions+0.027
Too+0.027
Neg logit contributions
 understand-0.008
 say-0.007
 wish-0.007
 mother-0.007
 name-0.007
Token where
Token ID810
Feature activation-3.89e-02

Pos logit contributions
 said+0.213
 asked+0.201
 say+0.180
 mom+0.175
 dad+0.171
Neg logit contributions
ocol-0.610
icer-0.534
igs-0.533
ipping-0.514
 degrees-0.512
Token she
Token ID673
Feature activation+7.04e-02

Pos logit contributions
ocol+0.074
ions+0.058
mm+0.056
 degrees+0.056
igs+0.055
Neg logit contributions
 said-0.060
 understand-0.059
 say-0.059
 asked-0.059
 mother-0.059