TOP ACTIVATIONS
MAX = 9.366

They said , " L adies and gentlemen , welcome to
. He says " L adies and gentlemen , boys and
here , â said Earth Man . Lu
not toys . They said lasers were dangerous .
lasers anymore . They said lasers were not toys . They
that read , " Fore ver Friends ".
of the planets is called Earth . That 's our planet
on a modest planet called Earth ." Tim my didn 't
live on a planet called Earth . It is blue and
, and everyone called it Earth . One
why , and she said trucks were dangerous .
there was a planet called Earth . Many animals lived on
have swords and shields and helmets . We have a dragon
smiles . She says the bikes are ready . She says
a giant blue planet called Earth . Everyone on Earth was
bikes as rockets and our helmets as space suits ."
and rockets . They find helmets and scar ves and gloves
bike . She gives them helmets and tells them to be
always wore their gear : helmets , pads and gloves .
was a happy planet called Earth . The planet had lots

MIN ACTIVATIONS
MIN = -10.484

crown and playing in the freshly cut grass . It looked
to play and explore the freshly organized lawn . <|endoftext|> Once
he saw a basket of freshly baked bread . It smelled
very special toy for L ott ie . It was a
the sweet aroma of a freshly baked cake .
he could do in the o asis . He could have
it away in a wheel bar row . Once
?" D ott ie per ked up and
One day , L ott ie 's mom went to
dolls around in a wheel bar row she had found in
of fun watching the ac rob ats and clown s .
. When D ott ie arrived at the lab
she was inside , D ott ie saw lots of colorful
kitchen and made a fresh batch of sa us ages .
house and made a fresh batch of blue berry muff ins
mom would bake a fresh batch of cookies for her to
of soil into the wheel bar row . It was heavy
the church and noticed a freshly dug hole in the ground
a big box of ch ocol ates ! <|endoftext|> Once upon
help her make a new batch of dough . Max was

INTERVAL 4.403 - 9.366
CONTAINS 0.360%

the other kids . Come in ." Anna and
, " By e - bye !" and ran off .
bye and went on their way . Bobby and Daisy were
flowers . He knew the grass was safe and harmless .
known for his strength , courage and success . <|endoftext|> Once

INTERVAL -0.559 - 4.403
CONTAINS 68.031%

there was a dog named Spot . Spot loved to sail
to rest when it was tired so it could have the
. " Hi , I am Tip ," said
tried and tried . He tried to make something with the
dog said , " Please don 't take me . I

INTERVAL -5.522 - -0.559
CONTAINS 31.570%

's dad had taught him what to do in case of
had to wait until after dinner . Tim my
it . She felt a sharp shock ! Ethan quickly looked
Ben loved to play in the forest with his friends .
he should go anyway and make new friends . Tim my

INTERVAL -10.484 - -5.522
CONTAINS 0.039%

They picked out some l oll ip ops and headed to
going to fly in an aer opl ane . She jumped
send the letter in a few days , but then time
. So , the l oll ipop decided to find some
street , he began to sne e ze .
Token They
Token ID1119
Feature activation+2.16e-01

Pos logit contributions
 ho+0.001
 distance+0.001
 direction+0.001
 flap+0.001
arse+0.001
Neg logit contributions
affles-0.003
affle-0.002
atmeal-0.002
 breakfast-0.002
ic-0.002
Token said
Token ID531
Feature activation-4.13e-01

Pos logit contributions
packs+0.005
 leads+0.005
 ho+0.005
 distance+0.005
 steering+0.005
Neg logit contributions
affles-0.010
affle-0.009
ic-0.008
atmeal-0.008
ilt-0.008
Token,
Token ID11
Feature activation+1.84e+00

Pos logit contributions
affles+0.020
atmeal+0.019
affle+0.018
ic+0.018
ilt+0.017
Neg logit contributions
 leads-0.006
 ho-0.006
packs-0.006
 slows-0.006
 crashing-0.006
Token "
Token ID366
Feature activation+9.44e-01

Pos logit contributions
 direction+0.043
packs+0.043
 distance+0.042
 leads+0.040
 ho+0.039
Neg logit contributions
affles-0.074
atmeal-0.064
affle-0.064
ic-0.060
ils-0.059
TokenL
Token ID43
Feature activation-5.43e-01

Pos logit contributions
packs+0.025
 distance+0.025
 direction+0.025
 leads+0.025
 ho+0.024
Neg logit contributions
affles-0.036
atmeal-0.031
affle-0.031
ic-0.029
ils-0.028
Tokenadies
Token ID50192
Feature activation+9.37e+00

Pos logit contributions
affles+0.012
atmeal+0.010
affle+0.009
ic+0.009
ils+0.008
Neg logit contributions
packs-0.022
 leads-0.022
 pays-0.022
 ho-0.021
 slows-0.021
Token and
Token ID290
Feature activation+3.39e+00

Pos logit contributions
 steep+0.273
 distance+0.269
arse+0.269
 slows+0.269
 flap+0.266
Neg logit contributions
affles-0.285
 breakfast-0.246
ic-0.241
atmeal-0.234
ils-0.233
Token gentlemen
Token ID28527
Feature activation+4.43e+00

Pos logit contributions
packs+0.054
 ho+0.053
 leads+0.053
 distance+0.051
 direction+0.051
Neg logit contributions
affles-0.166
atmeal-0.150
affle-0.149
ic-0.144
ilt-0.137
Token,
Token ID11
Feature activation+1.66e+00

Pos logit contributions
 distance+0.096
 direction+0.091
 crashing+0.085
 slows+0.084
 flap+0.083
Neg logit contributions
affles-0.187
atmeal-0.168
affle-0.163
ic-0.159
ils-0.157
Token welcome
Token ID7062
Feature activation+4.20e+00

Pos logit contributions
packs+0.046
 direction+0.046
 distance+0.046
 leads+0.046
 ho+0.045
Neg logit contributions
affles-0.059
atmeal-0.050
affle-0.050
ic-0.047
ils-0.046
Token to
Token ID284
Feature activation-5.17e-01

Pos logit contributions
 distance+0.113
 direction+0.111
 slows+0.110
 leads+0.108
 crashing+0.107
Neg logit contributions
affles-0.153
atmeal-0.131
affle-0.129
ic-0.126
ils-0.124
Token.
Token ID13
Feature activation+1.81e-01

Pos logit contributions
 ho+0.023
packs+0.023
arse+0.023
 direction+0.023
 slows+0.022
Neg logit contributions
affles-0.025
atmeal-0.022
ic-0.022
affle-0.021
ils-0.020
Token He
Token ID679
Feature activation-9.03e-02

Pos logit contributions
packs+0.003
 leads+0.003
 direction+0.003
 distance+0.003
 ho+0.003
Neg logit contributions
affles-0.008
atmeal-0.007
affle-0.007
ic-0.007
ils-0.007
Token says
Token ID1139
Feature activation-2.92e-01

Pos logit contributions
affles+0.004
affle+0.003
ils+0.003
atmeal+0.003
making+0.003
Neg logit contributions
 distance-0.002
arse-0.002
 crashing-0.002
 direction-0.002
packs-0.002
Token "
Token ID366
Feature activation+1.85e+00

Pos logit contributions
affles+0.013
atmeal+0.012
affle+0.012
ic+0.011
ils+0.011
Neg logit contributions
packs-0.005
 direction-0.005
 distance-0.005
 leads-0.005
 ho-0.005
TokenL
Token ID43
Feature activation-2.15e-02

Pos logit contributions
 distance+0.040
packs+0.039
 leads+0.039
 direction+0.038
 ho+0.038
Neg logit contributions
affles-0.081
affle-0.071
atmeal-0.070
ic-0.066
ils-0.065
Tokenadies
Token ID50192
Feature activation+9.14e+00

Pos logit contributions
affles+0.001
atmeal+0.000
affle+0.000
ic+0.000
ils+0.000
Neg logit contributions
packs-0.001
 leads-0.001
 ho-0.001
 pays-0.001
 direction-0.001
Token and
Token ID290
Feature activation+3.30e+00

Pos logit contributions
arse+0.272
 steep+0.269
 slows+0.269
 distance+0.264
 flap+0.262
Neg logit contributions
affles-0.262
 breakfast-0.225
ic-0.225
affle-0.212
ils-0.211
Token gentlemen
Token ID28527
Feature activation+4.37e+00

Pos logit contributions
packs+0.061
 direction+0.060
 distance+0.059
 leads+0.059
 ho+0.058
Neg logit contributions
affles-0.148
atmeal-0.131
affle-0.130
ic-0.125
ils-0.123
Token,
Token ID11
Feature activation+1.45e+00

Pos logit contributions
 distance+0.099
 direction+0.095
 crashing+0.087
packs+0.086
 slows+0.086
Neg logit contributions
affles-0.181
atmeal-0.160
affle-0.158
ic-0.152
ils-0.151
Token boys
Token ID6510
Feature activation+3.47e+00

Pos logit contributions
packs+0.040
 direction+0.040
 distance+0.040
 leads+0.039
 ho+0.039
Neg logit contributions
affles-0.052
atmeal-0.045
affle-0.044
ic-0.042
ils-0.041
Token and
Token ID290
Feature activation+1.82e+00

Pos logit contributions
 direction+0.080
 distance+0.080
packs+0.079
 leads+0.076
 slows+0.075
Neg logit contributions
affles-0.139
atmeal-0.122
affle-0.121
ic-0.117
ils-0.115
Token here
Token ID994
Feature activation+1.57e+00

Pos logit contributions
packs+0.045
 direction+0.044
 leads+0.044
 distance+0.044
 ho+0.044
Neg logit contributions
affles-0.079
atmeal-0.069
affle-0.068
ic-0.065
ils-0.063
Token,
Token ID11
Feature activation-1.26e+00

Pos logit contributions
packs+0.038
 direction+0.037
 distance+0.037
 leads+0.037
 ho+0.037
Neg logit contributions
affles-0.062
atmeal-0.054
affle-0.054
ic-0.051
ils-0.050
Tokenâ
Token ID22940
Feature activation-2.36e+00

Pos logit contributions
affles+0.047
atmeal+0.041
affle+0.040
ic+0.039
ils+0.038
Neg logit contributions
packs-0.032
 leads-0.029
 direction-0.029
 ho-0.029
 distance-0.028
Token
Token ID26391
Feature activation-2.92e+00

Pos logit contributions
affles+0.090
atmeal+0.076
affle+0.074
ils+0.071
ic+0.071
Neg logit contributions
 pays-0.062
 distance-0.061
 direction-0.060
 leads-0.059
 slows-0.059
Token said
Token ID531
Feature activation+1.93e-01

Pos logit contributions
affles+0.108
atmeal+0.100
ic+0.092
affle+0.090
ils+0.087
Neg logit contributions
 flap-0.072
 direction-0.071
 distance-0.071
 leads-0.071
 steering-0.070
Token Earth
Token ID3668
Feature activation+9.06e+00

Pos logit contributions
 ho+0.003
 leads+0.003
packs+0.003
 direction+0.003
 distance+0.003
Neg logit contributions
affles-0.009
atmeal-0.008
affle-0.008
ic-0.008
ils-0.007
Token Man
Token ID1869
Feature activation+2.67e+00

Pos logit contributions
packs+0.206
 leads+0.204
 direction+0.203
 distance+0.201
 ho+0.201
Neg logit contributions
affles-0.371
atmeal-0.325
affle-0.321
ic-0.308
ils-0.299
Token.
Token ID13
Feature activation+5.54e-01

Pos logit contributions
packs+0.062
 direction+0.061
 leads+0.061
 distance+0.061
 ho+0.060
Neg logit contributions
affles-0.108
atmeal-0.094
affle-0.093
ic-0.090
ils-0.087
Token
Token ID198
Feature activation+9.80e-01

Pos logit contributions
packs+0.012
 ho+0.012
 flap+0.012
 leads+0.012
 distance+0.012
Neg logit contributions
affles-0.022
affle-0.020
atmeal-0.020
ic-0.019
 breakfast-0.018
Token
Token ID198
Feature activation+9.67e-01

Pos logit contributions
 distance+0.012
 direction+0.012
packs+0.012
 leads+0.011
 slows+0.011
Neg logit contributions
affles-0.050
atmeal-0.047
affle-0.045
ils-0.044
ic-0.043
TokenLu
Token ID25596
Feature activation+1.31e+00

Pos logit contributions
 distance+0.024
 Have+0.023
 crashing+0.022
 tow+0.022
 direction+0.022
Neg logit contributions
affles-0.039
atmeal-0.035
affle-0.034
ils-0.032
ic-0.031
Token not
Token ID407
Feature activation+3.13e+00

Pos logit contributions
packs+0.082
 distance+0.081
 direction+0.081
 leads+0.080
 ho+0.079
Neg logit contributions
affles-0.093
affle-0.078
atmeal-0.078
ic-0.073
ils-0.072
Token toys
Token ID14958
Feature activation+4.42e+00

Pos logit contributions
 distance+0.092
 direction+0.091
packs+0.091
 leads+0.090
 slows+0.089
Neg logit contributions
affles-0.107
affle-0.090
atmeal-0.089
ils-0.085
ic-0.084
Token.
Token ID13
Feature activation+7.05e-01

Pos logit contributions
 direction+0.112
packs+0.110
 distance+0.109
 slows+0.107
 ho+0.106
Neg logit contributions
affles-0.163
atmeal-0.143
affle-0.140
ils-0.138
ic-0.135
Token They
Token ID1119
Feature activation+7.65e-01

Pos logit contributions
 ho+0.013
 distance+0.013
packs+0.013
 steering+0.013
 flap+0.013
Neg logit contributions
affles-0.031
affle-0.027
atmeal-0.027
ic-0.026
ils-0.025
Token said
Token ID531
Feature activation-1.28e-01

Pos logit contributions
 leads+0.018
 ho+0.017
 distance+0.017
 flap+0.017
 direction+0.017
Neg logit contributions
affles-0.032
atmeal-0.029
affle-0.028
ic-0.028
at-0.025
Token lasers
Token ID34628
Feature activation+8.99e+00

Pos logit contributions
affles+0.006
atmeal+0.006
affle+0.006
ic+0.005
at+0.005
Neg logit contributions
 leads-0.002
 crashing-0.002
 ho-0.002
 slows-0.002
 pays-0.002
Token were
Token ID547
Feature activation+2.82e+00

Pos logit contributions
 distance+0.214
 direction+0.212
packs+0.211
 ho+0.208
 steering+0.200
Neg logit contributions
affles-0.351
atmeal-0.305
affle-0.303
ic-0.295
ils-0.294
Token dangerous
Token ID4923
Feature activation+7.49e-01

Pos logit contributions
packs+0.085
 distance+0.084
 direction+0.084
 leads+0.083
 ho+0.081
Neg logit contributions
affles-0.096
affle-0.081
atmeal-0.080
ils-0.075
ic-0.074
Token.
Token ID13
Feature activation+7.79e-01

Pos logit contributions
 distance+0.016
packs+0.016
 direction+0.016
 ho+0.016
 leads+0.015
Neg logit contributions
affles-0.031
atmeal-0.027
affle-0.027
ic-0.026
ils-0.026
Token
Token ID198
Feature activation+8.38e-01

Pos logit contributions
 ho+0.016
 distance+0.016
packs+0.016
 steering+0.015
 flap+0.015
Neg logit contributions
affles-0.033
affle-0.029
atmeal-0.028
ic-0.027
ott-0.026
Token
Token ID198
Feature activation+8.77e-01

Pos logit contributions
packs+0.015
 leads+0.014
 direction+0.014
 ho+0.014
 distance+0.014
Neg logit contributions
affles-0.039
atmeal-0.034
affle-0.034
ic-0.033
ils-0.032
Token lasers
Token ID34628
Feature activation+5.63e+00

Pos logit contributions
affles+0.008
atmeal+0.007
affle+0.007
ic+0.007
at+0.006
Neg logit contributions
packs-0.007
 leads-0.007
 ho-0.007
 direction-0.007
 pays-0.007
Token anymore
Token ID7471
Feature activation-8.78e-01

Pos logit contributions
packs+0.143
 direction+0.142
 leads+0.141
 distance+0.141
 ho+0.140
Neg logit contributions
affles-0.215
atmeal-0.186
affle-0.184
ic-0.176
ils-0.171
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
affles+0.039
atmeal+0.035
affle+0.035
ic+0.033
ils+0.032
Neg logit contributions
packs-0.017
 leads-0.017
 ho-0.017
 direction-0.016
 distance-0.016
Token They
Token ID1119
Feature activation+5.20e-01

Pos logit contributions
 steering+0.004
 ho+0.004
 flap+0.004
 distance+0.004
packs+0.004
Neg logit contributions
affles-0.008
affle-0.007
atmeal-0.007
ott-0.007
ic-0.007
Token said
Token ID531
Feature activation-1.28e-01

Pos logit contributions
 leads+0.013
 ho+0.012
 distance+0.012
 flap+0.012
 direction+0.012
Neg logit contributions
affles-0.021
atmeal-0.019
affle-0.018
ic-0.018
ilt-0.017
Token lasers
Token ID34628
Feature activation+8.87e+00

Pos logit contributions
affles+0.006
atmeal+0.006
affle+0.006
ic+0.005
ilt+0.005
Neg logit contributions
 leads-0.002
 ho-0.002
 crashing-0.002
 slows-0.002
 pays-0.002
Token were
Token ID547
Feature activation+2.73e+00

Pos logit contributions
 distance+0.213
 direction+0.210
packs+0.207
 ho+0.204
 steering+0.199
Neg logit contributions
affles-0.343
atmeal-0.297
affle-0.295
ils-0.290
ic-0.289
Token not
Token ID407
Feature activation+3.13e+00

Pos logit contributions
packs+0.082
 distance+0.081
 direction+0.081
 leads+0.080
 ho+0.079
Neg logit contributions
affles-0.093
affle-0.078
atmeal-0.078
ic-0.073
ils-0.072
Token toys
Token ID14958
Feature activation+4.42e+00

Pos logit contributions
 distance+0.092
 direction+0.091
packs+0.091
 leads+0.090
 slows+0.089
Neg logit contributions
affles-0.107
affle-0.090
atmeal-0.089
ils-0.085
ic-0.084
Token.
Token ID13
Feature activation+7.05e-01

Pos logit contributions
 direction+0.112
packs+0.110
 distance+0.109
 slows+0.107
 ho+0.106
Neg logit contributions
affles-0.163
atmeal-0.143
affle-0.140
ils-0.138
ic-0.135
Token They
Token ID1119
Feature activation+7.65e-01

Pos logit contributions
 ho+0.013
 distance+0.013
packs+0.013
 steering+0.013
 flap+0.013
Neg logit contributions
affles-0.031
affle-0.027
atmeal-0.027
ic-0.026
ils-0.025
Token that
Token ID326
Feature activation-3.81e+00

Pos logit contributions
affles+0.085
atmeal+0.075
affle+0.074
ic+0.070
ils+0.069
Neg logit contributions
packs-0.038
 direction-0.037
 distance-0.037
 ho-0.037
 leads-0.037
Token read
Token ID1100
Feature activation-1.19e+00

Pos logit contributions
affles+0.153
affle+0.133
atmeal+0.130
ic+0.122
ils+0.116
Neg logit contributions
 leads-0.098
 direction-0.096
 distance-0.096
packs-0.094
 ho-0.093
Token,
Token ID11
Feature activation-1.25e+00

Pos logit contributions
affles+0.049
atmeal+0.043
affle+0.043
ic+0.041
ils+0.039
Neg logit contributions
packs-0.028
 leads-0.027
 direction-0.027
 distance-0.027
 ho-0.027
Token "
Token ID366
Feature activation-1.44e+00

Pos logit contributions
affles+0.052
atmeal+0.046
affle+0.045
ic+0.043
ils+0.042
Neg logit contributions
packs-0.028
 direction-0.027
 distance-0.027
 leads-0.027
 ho-0.027
TokenFore
Token ID16351
Feature activation+4.49e+00

Pos logit contributions
affles+0.058
affle+0.051
atmeal+0.050
ils+0.046
ic+0.045
Neg logit contributions
packs-0.036
 distance-0.036
 ho-0.035
 direction-0.034
 steering-0.034
Tokenver
Token ID332
Feature activation+8.61e+00

Pos logit contributions
 distance+0.116
 direction+0.114
packs+0.112
 leads+0.110
 slows+0.109
Neg logit contributions
affles-0.170
atmeal-0.149
affle-0.147
ils-0.140
ic-0.139
Token Friends
Token ID14213
Feature activation+2.10e+00

Pos logit contributions
 distance+0.230
packs+0.228
 leads+0.226
 slows+0.226
 direction+0.225
Neg logit contributions
affles-0.299
ic-0.258
atmeal-0.256
affle-0.252
ils-0.251
Token".
Token ID1911
Feature activation-1.15e+00

Pos logit contributions
 ho+0.049
 leads+0.048
 flap+0.047
 direction+0.047
 distance+0.047
Neg logit contributions
affles-0.087
atmeal-0.078
affle-0.075
ic-0.072
making-0.067
Token
Token ID220
Feature activation-1.81e+00

Pos logit contributions
affles+0.052
atmeal+0.046
affle+0.046
ic+0.044
ils+0.043
Neg logit contributions
packs-0.021
 leads-0.021
 direction-0.021
 ho-0.020
 distance-0.020
Token
Token ID198
Feature activation+6.01e-01

Pos logit contributions
affles+0.079
atmeal+0.069
affle+0.069
ic+0.067
ils+0.065
Neg logit contributions
 distance-0.036
 ho-0.035
 direction-0.035
 leads-0.034
packs-0.034
Token
Token ID198
Feature activation+7.23e-01

Pos logit contributions
 distance+0.008
packs+0.008
 direction+0.008
 ho+0.007
 leads+0.007
Neg logit contributions
affles-0.031
atmeal-0.028
affle-0.028
ils-0.026
ic-0.026
Token of
Token ID286
Feature activation+6.65e-01

Pos logit contributions
packs+0.042
 ho+0.042
 crashing+0.041
 steering+0.041
 distance+0.041
Neg logit contributions
affles-0.061
atmeal-0.053
affle-0.052
ic-0.050
ils-0.050
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
 leads+0.020
 direction+0.020
 ho+0.019
packs+0.019
 pays+0.019
Neg logit contributions
affles-0.023
atmeal-0.020
affle-0.020
ic-0.019
ils-0.017
Token planets
Token ID14705
Feature activation+4.80e+00

Pos logit contributions
packs+0.033
 slows+0.032
 crashing+0.031
 leads+0.030
 steering+0.030
Neg logit contributions
affles-0.033
affle-0.028
ils-0.027
atmeal-0.027
ic-0.026
Token is
Token ID318
Feature activation+1.07e+00

Pos logit contributions
 direction+0.098
packs+0.098
 distance+0.097
 ho+0.095
 leads+0.094
Neg logit contributions
affles-0.204
atmeal-0.181
affle-0.178
ic-0.172
ils-0.171
Token called
Token ID1444
Feature activation+4.33e+00

Pos logit contributions
packs+0.030
 leads+0.030
 distance+0.030
 slows+0.030
 direction+0.029
Neg logit contributions
affles-0.036
affle-0.030
atmeal-0.030
ils-0.029
ic-0.028
Token Earth
Token ID3668
Feature activation+8.61e+00

Pos logit contributions
 distance+0.103
packs+0.103
 direction+0.101
 leads+0.098
 slows+0.097
Neg logit contributions
affles-0.169
affle-0.146
atmeal-0.143
ic-0.139
ils-0.139
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
 distance+0.202
 direction+0.199
packs+0.198
 slows+0.193
 leads+0.193
Neg logit contributions
affles-0.339
atmeal-0.295
affle-0.293
ic-0.284
ils-0.281
Token That
Token ID1320
Feature activation-2.14e+00

Pos logit contributions
packs+0.039
 distance+0.037
 direction+0.036
 ho+0.035
 leads+0.034
Neg logit contributions
affles-0.057
affle-0.049
atmeal-0.048
ic-0.046
umbers-0.045
Token's
Token ID338
Feature activation+1.46e-02

Pos logit contributions
affles+0.076
atmeal+0.067
ils+0.066
ic+0.065
affle+0.065
Neg logit contributions
packs-0.055
 crashing-0.053
 steering-0.052
 ho-0.052
 distance-0.051
Token our
Token ID674
Feature activation-1.31e-01

Pos logit contributions
packs+0.000
 distance+0.000
 slows+0.000
 leads+0.000
 direction+0.000
Neg logit contributions
affles-0.000
affle-0.000
atmeal-0.000
ils-0.000
ic-0.000
Token planet
Token ID5440
Feature activation+4.08e+00

Pos logit contributions
affles+0.004
affle+0.004
ils+0.003
 breakfast+0.003
atmeal+0.003
Neg logit contributions
packs-0.004
 slows-0.004
 crashing-0.004
arse-0.004
 distance-0.004
Token on
Token ID319
Feature activation+1.73e-01

Pos logit contributions
packs+0.031
 direction+0.031
 leads+0.031
 distance+0.031
 ho+0.030
Neg logit contributions
affles-0.044
atmeal-0.038
affle-0.038
ic-0.036
ils-0.035
Token a
Token ID257
Feature activation-1.23e+00

Pos logit contributions
packs+0.005
 direction+0.005
 leads+0.005
 distance+0.005
 ho+0.005
Neg logit contributions
affles-0.006
atmeal-0.005
affle-0.005
ic-0.005
ils-0.005
Token modest
Token ID12949
Feature activation-2.36e+00

Pos logit contributions
affles+0.046
atmeal+0.040
affle+0.040
ic+0.038
ils+0.037
Neg logit contributions
packs-0.032
 leads-0.031
 pays-0.030
 direction-0.030
 ho-0.030
Token planet
Token ID5440
Feature activation+1.63e+00

Pos logit contributions
affles+0.083
atmeal+0.073
affle+0.073
ic+0.071
ilt+0.068
Neg logit contributions
packs-0.062
 leads-0.062
 pays-0.060
 slows-0.060
 lets-0.059
Token called
Token ID1444
Feature activation+3.58e+00

Pos logit contributions
packs+0.041
 direction+0.040
 leads+0.040
 distance+0.040
 ho+0.040
Neg logit contributions
affles-0.062
atmeal-0.054
affle-0.053
ic-0.051
ils-0.050
Token Earth
Token ID3668
Feature activation+8.28e+00

Pos logit contributions
 distance+0.096
 direction+0.090
packs+0.090
stretched+0.088
 horror+0.086
Neg logit contributions
affles-0.121
affle-0.103
ils-0.102
atmeal-0.101
ic-0.099
Token."
Token ID526
Feature activation-1.37e+00

Pos logit contributions
 distance+0.212
 direction+0.211
 leads+0.207
packs+0.206
 slows+0.206
Neg logit contributions
affles-0.305
atmeal-0.264
affle-0.262
ic-0.253
ils-0.250
Token Tim
Token ID5045
Feature activation-7.29e-01

Pos logit contributions
affles+0.057
atmeal+0.050
affle+0.050
ic+0.048
ils+0.046
Neg logit contributions
packs-0.030
 ho-0.030
 direction-0.030
 distance-0.030
 leads-0.030
Tokenmy
Token ID1820
Feature activation-9.33e-01

Pos logit contributions
affles+0.029
atmeal+0.026
affle+0.026
ic+0.025
ils+0.024
Neg logit contributions
packs-0.017
 leads-0.017
 distance-0.017
 direction-0.016
 crashing-0.016
Token didn
Token ID1422
Feature activation-4.61e-01

Pos logit contributions
affles+0.037
atmeal+0.032
affle+0.032
ic+0.030
ils+0.030
Neg logit contributions
packs-0.023
 leads-0.022
 direction-0.022
 ho-0.022
 distance-0.022
Token't
Token ID470
Feature activation+8.45e-01

Pos logit contributions
affles+0.016
atmeal+0.014
affle+0.014
ils+0.013
ic+0.013
Neg logit contributions
 leads-0.013
 slows-0.013
packs-0.013
 crashing-0.013
 ho-0.013
Token live
Token ID2107
Feature activation+5.14e-02

Pos logit contributions
 flap+0.009
 leads+0.009
 ho+0.008
packs+0.008
 distance+0.008
Neg logit contributions
affles-0.034
atmeal-0.031
ic-0.031
affle-0.031
illy-0.028
Token on
Token ID319
Feature activation+6.09e-01

Pos logit contributions
packs+0.001
 direction+0.001
 leads+0.001
 distance+0.001
 ho+0.001
Neg logit contributions
affles-0.002
atmeal-0.002
affle-0.002
ic-0.001
ils-0.001
Token a
Token ID257
Feature activation-1.21e+00

Pos logit contributions
packs+0.016
 ho+0.016
 direction+0.016
 distance+0.016
 leads+0.015
Neg logit contributions
affles-0.024
atmeal-0.021
affle-0.021
ic-0.020
ils-0.019
Token planet
Token ID5440
Feature activation+1.37e+00

Pos logit contributions
affles+0.047
atmeal+0.041
affle+0.040
ic+0.039
ils+0.038
Neg logit contributions
packs-0.030
 leads-0.030
 direction-0.029
 pays-0.029
 ho-0.029
Token called
Token ID1444
Feature activation+3.84e+00

Pos logit contributions
packs+0.029
 direction+0.028
 leads+0.028
 distance+0.028
 ho+0.028
Neg logit contributions
affles-0.058
atmeal-0.051
affle-0.051
ic-0.049
ils-0.048
Token Earth
Token ID3668
Feature activation+8.19e+00

Pos logit contributions
 distance+0.101
 direction+0.098
packs+0.096
 slows+0.094
 leads+0.094
Neg logit contributions
affles-0.140
affle-0.118
atmeal-0.117
ils-0.115
ic-0.111
Token.
Token ID13
Feature activation+8.37e-01

Pos logit contributions
 distance+0.188
 direction+0.186
 leads+0.183
packs+0.182
 crashing+0.182
Neg logit contributions
affles-0.325
atmeal-0.281
affle-0.281
ic-0.271
ils-0.269
Token It
Token ID632
Feature activation-7.34e-01

Pos logit contributions
 distance+0.021
 direction+0.020
packs+0.019
arse+0.019
 way+0.019
Neg logit contributions
affles-0.031
affle-0.026
atmeal-0.026
 breakfast-0.025
chini-0.024
Token is
Token ID318
Feature activation+6.25e-01

Pos logit contributions
affles+0.032
atmeal+0.028
affle+0.028
ic+0.027
ils+0.027
Neg logit contributions
packs-0.014
 distance-0.014
 direction-0.014
 ho-0.014
 leads-0.014
Token blue
Token ID4171
Feature activation+1.20e+00

Pos logit contributions
packs+0.019
 distance+0.019
arse+0.019
 leads+0.019
 slows+0.019
Neg logit contributions
affles-0.019
affle-0.016
ils-0.015
atmeal-0.015
 breakfast-0.014
Token and
Token ID290
Feature activation+1.61e-01

Pos logit contributions
 distance+0.024
 direction+0.024
packs+0.024
 leads+0.024
 pays+0.024
Neg logit contributions
affles-0.050
atmeal-0.045
affle-0.044
ic-0.043
ils-0.041
Token,
Token ID11
Feature activation-9.15e-01

Pos logit contributions
affles+0.025
atmeal+0.022
affle+0.022
ic+0.021
ils+0.021
Neg logit contributions
packs-0.012
 distance-0.012
 direction-0.012
 leads-0.012
 ho-0.012
Token and
Token ID290
Feature activation-2.05e-02

Pos logit contributions
affles+0.032
atmeal+0.028
affle+0.027
ic+0.027
ils+0.026
Neg logit contributions
packs-0.026
arse-0.024
 leads-0.023
 distance-0.023
 direction-0.023
Token everyone
Token ID2506
Feature activation+2.85e+00

Pos logit contributions
affles+0.001
affle+0.001
atmeal+0.001
ic+0.001
ils+0.001
Neg logit contributions
 ho-0.001
 direction-0.001
 leads-0.001
packs-0.001
 distance-0.001
Token called
Token ID1444
Feature activation+4.24e+00

Pos logit contributions
packs+0.063
 leads+0.063
 direction+0.062
 crashing+0.060
 slows+0.060
Neg logit contributions
affles-0.121
atmeal-0.107
affle-0.106
ic-0.101
umbers-0.093
Token it
Token ID340
Feature activation+4.40e-01

Pos logit contributions
 distance+0.152
packs+0.146
 direction+0.145
 slows+0.143
 pays+0.142
Neg logit contributions
affles-0.108
ils-0.087
 breakfast-0.085
affle-0.085
atmeal-0.082
Token Earth
Token ID3668
Feature activation+8.17e+00

Pos logit contributions
 distance+0.012
 direction+0.012
packs+0.012
 slows+0.011
 steering+0.011
Neg logit contributions
affles-0.016
atmeal-0.014
affle-0.013
ils-0.013
ic-0.013
Token.
Token ID13
Feature activation+3.31e-01

Pos logit contributions
 distance+0.196
packs+0.195
 direction+0.194
 ho+0.191
 leads+0.191
Neg logit contributions
affles-0.320
atmeal-0.279
affle-0.277
ic-0.268
ils-0.263
Token
Token ID220
Feature activation-2.22e+00

Pos logit contributions
 ho+0.008
packs+0.008
 pays+0.008
 crashing+0.008
 steering+0.008
Neg logit contributions
affles-0.012
atmeal-0.010
affle-0.010
 breakfast-0.010
ic-0.010
Token
Token ID198
Feature activation+8.14e-01

Pos logit contributions
affles+0.092
atmeal+0.079
affle+0.079
ic+0.077
ils+0.073
Neg logit contributions
 ho-0.048
 distance-0.047
 steering-0.046
 leads-0.046
 direction-0.045
Token
Token ID198
Feature activation+8.34e-01

Pos logit contributions
packs+0.017
 ho+0.017
 leads+0.017
 pays+0.016
 steering+0.016
Neg logit contributions
affles-0.034
atmeal-0.030
affle-0.030
ic-0.029
ils-0.028
TokenOne
Token ID3198
Feature activation+7.03e-01

Pos logit contributions
packs+0.021
 leads+0.021
 direction+0.021
 distance+0.021
 ho+0.021
Neg logit contributions
affles-0.032
atmeal-0.028
affle-0.028
ic-0.026
ils-0.026
Token why
Token ID1521
Feature activation-2.26e-01

Pos logit contributions
packs+0.061
 leads+0.060
 direction+0.059
 ho+0.059
 distance+0.058
Neg logit contributions
affles-0.121
atmeal-0.107
affle-0.106
ic-0.102
ils-0.099
Token,
Token ID11
Feature activation+2.40e-01

Pos logit contributions
affles+0.009
affle+0.008
atmeal+0.008
ic+0.007
ils+0.007
Neg logit contributions
packs-0.006
 leads-0.006
 ho-0.006
 direction-0.006
 crashing-0.006
Token and
Token ID290
Feature activation+8.97e-01

Pos logit contributions
 leads+0.006
packs+0.006
 ho+0.006
 direction+0.006
 distance+0.006
Neg logit contributions
affles-0.009
atmeal-0.008
affle-0.008
ic-0.008
ils-0.008
Token she
Token ID673
Feature activation-3.94e-02

Pos logit contributions
packs+0.022
 leads+0.022
 ho+0.022
 direction+0.021
 pays+0.021
Neg logit contributions
affles-0.036
affle-0.031
atmeal-0.031
ic-0.030
ils-0.029
Token said
Token ID531
Feature activation+3.41e-01

Pos logit contributions
affles+0.002
atmeal+0.001
affle+0.001
ic+0.001
ils+0.001
Neg logit contributions
 leads-0.001
 pays-0.001
 ho-0.001
 slows-0.001
 lets-0.001
Token trucks
Token ID13960
Feature activation+8.04e+00

Pos logit contributions
 ho+0.009
 leads+0.009
packs+0.008
 direction+0.008
 crashing+0.008
Neg logit contributions
affles-0.013
affle-0.012
atmeal-0.011
ic-0.011
ils-0.010
Token were
Token ID547
Feature activation+3.36e+00

Pos logit contributions
packs+0.157
 leads+0.155
 direction+0.154
 distance+0.153
 ho+0.152
Neg logit contributions
affles-0.355
atmeal-0.314
affle-0.311
ic-0.299
ils-0.291
Token dangerous
Token ID4923
Feature activation+8.79e-01

Pos logit contributions
 ho+0.085
packs+0.085
 direction+0.085
 leads+0.084
 distance+0.084
Neg logit contributions
affles-0.130
atmeal-0.113
affle-0.111
ic-0.108
ils-0.103
Token.
Token ID13
Feature activation+5.28e-01

Pos logit contributions
packs+0.018
 leads+0.018
 direction+0.017
 ho+0.017
 distance+0.017
Neg logit contributions
affles-0.039
atmeal-0.034
affle-0.034
ic-0.032
ils-0.032
Token
Token ID198
Feature activation+1.14e+00

Pos logit contributions
 distance+0.013
 flap+0.013
 ho+0.012
 direction+0.012
packs+0.012
Neg logit contributions
affles-0.020
atmeal-0.018
affle-0.017
ic-0.017
 breakfast-0.016
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
packs+0.016
 direction+0.016
 distance+0.016
 leads+0.015
 ho+0.015
Neg logit contributions
affles-0.057
atmeal-0.051
affle-0.051
ic-0.049
ils-0.049
Token there
Token ID612
Feature activation+7.70e-01

Pos logit contributions
affles+0.048
atmeal+0.041
affle+0.041
ic+0.038
ils+0.037
Neg logit contributions
 distance-0.041
packs-0.041
 direction-0.041
 leads-0.041
 ho-0.041
Token was
Token ID373
Feature activation-2.16e-01

Pos logit contributions
packs+0.021
 direction+0.021
 leads+0.021
 distance+0.021
 ho+0.021
Neg logit contributions
affles-0.028
atmeal-0.024
affle-0.024
ic-0.022
ils-0.022
Token a
Token ID257
Feature activation+9.97e-01

Pos logit contributions
affles+0.009
atmeal+0.008
ic+0.008
affle+0.008
ils+0.007
Neg logit contributions
packs-0.005
 ho-0.005
 distance-0.005
 direction-0.005
 thanks-0.005
Token planet
Token ID5440
Feature activation+2.60e+00

Pos logit contributions
 ho+0.024
 distance+0.023
 direction+0.023
 leads+0.023
packs+0.023
Neg logit contributions
affles-0.040
atmeal-0.035
affle-0.034
ic-0.033
ils-0.033
Token called
Token ID1444
Feature activation+3.12e+00

Pos logit contributions
packs+0.072
 leads+0.071
 direction+0.071
 ho+0.070
 distance+0.070
Neg logit contributions
affles-0.093
atmeal-0.080
affle-0.079
ic-0.076
ils-0.073
Token Earth
Token ID3668
Feature activation+7.78e+00

Pos logit contributions
 distance+0.094
 direction+0.090
packs+0.087
stretched+0.084
 slows+0.084
Neg logit contributions
affles-0.099
affle-0.084
ils-0.082
 breakfast-0.081
atmeal-0.080
Token.
Token ID13
Feature activation+4.91e-01

Pos logit contributions
packs+0.187
 distance+0.186
 direction+0.185
 leads+0.185
 pays+0.183
Neg logit contributions
affles-0.302
atmeal-0.263
affle-0.262
ic-0.254
ils-0.247
Token Many
Token ID4650
Feature activation-7.19e-01

Pos logit contributions
packs+0.012
 direction+0.012
 distance+0.012
 ho+0.012
 leads+0.012
Neg logit contributions
affles-0.019
atmeal-0.016
affle-0.016
ic-0.016
ils-0.015
Token animals
Token ID4695
Feature activation+2.54e+00

Pos logit contributions
affles+0.020
atmeal+0.016
affle+0.016
ic+0.015
ils+0.015
Neg logit contributions
packs-0.026
 distance-0.025
 leads-0.025
 direction-0.025
 ho-0.025
Token lived
Token ID5615
Feature activation-3.37e-01

Pos logit contributions
 direction+0.083
 ho+0.083
 distance+0.082
 pays+0.081
packs+0.081
Neg logit contributions
affles-0.077
atmeal-0.063
affle-0.062
ils-0.061
ic-0.057
Token on
Token ID319
Feature activation-6.22e-01

Pos logit contributions
affles+0.010
affle+0.008
atmeal+0.008
ic+0.008
ils+0.008
Neg logit contributions
 slows-0.011
 distance-0.011
 direction-0.011
 pays-0.011
 leads-0.011
Token have
Token ID423
Feature activation-1.34e+00

Pos logit contributions
 leads+0.011
 ho+0.011
packs+0.011
 pays+0.011
 flap+0.011
Neg logit contributions
affles-0.053
atmeal-0.049
ic-0.048
affle-0.048
making-0.044
Token swords
Token ID19735
Feature activation+5.56e+00

Pos logit contributions
affles+0.047
atmeal+0.040
affle+0.039
ic+0.037
ils+0.037
Neg logit contributions
packs-0.039
 distance-0.039
 direction-0.039
 leads-0.038
 ho-0.038
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
 direction+0.120
 distance+0.116
 crashing+0.116
 leads+0.115
 slows+0.115
Neg logit contributions
affles-0.227
atmeal-0.201
affle-0.197
ic-0.193
ils-0.193
Token shields
Token ID21132
Feature activation+6.17e+00

Pos logit contributions
 crashing+0.030
 distance+0.030
 direction+0.030
arse+0.029
 slows+0.028
Neg logit contributions
affles-0.035
atmeal-0.029
ils-0.028
affle-0.028
 breakfast-0.026
Token and
Token ID290
Feature activation+6.28e-01

Pos logit contributions
 direction+0.149
 leads+0.145
packs+0.144
 distance+0.144
 ho+0.143
Neg logit contributions
affles-0.236
atmeal-0.207
affle-0.203
ic-0.199
ils-0.196
Token helmets
Token ID28359
Feature activation+7.76e+00

Pos logit contributions
 crashing+0.018
 direction+0.018
arse+0.018
 distance+0.017
packs+0.017
Neg logit contributions
affles-0.021
atmeal-0.018
ils-0.017
affle-0.017
 breakfast-0.016
Token.
Token ID13
Feature activation+9.23e-01

Pos logit contributions
 direction+0.192
 leads+0.186
 distance+0.186
packs+0.185
 ho+0.183
Neg logit contributions
affles-0.287
atmeal-0.254
affle-0.246
ic-0.246
ils-0.236
Token We
Token ID775
Feature activation+1.11e+00

Pos logit contributions
 distance+0.025
 direction+0.023
arse+0.023
stretched+0.021
packs+0.021
Neg logit contributions
affles-0.033
 breakfast-0.028
 pasta-0.027
affle-0.027
 soup-0.027
Token have
Token ID423
Feature activation-1.26e+00

Pos logit contributions
 leads+0.013
 ho+0.012
packs+0.012
 pays+0.011
 flap+0.011
Neg logit contributions
affles-0.058
atmeal-0.053
affle-0.052
ic-0.052
ils-0.049
Token a
Token ID257
Feature activation+5.23e-01

Pos logit contributions
affles+0.041
atmeal+0.034
affle+0.034
ils+0.033
 breakfast+0.031
Neg logit contributions
 distance-0.038
packs-0.038
 direction-0.038
 crashing-0.038
 leads-0.037
Token dragon
Token ID10441
Feature activation+1.40e+00

Pos logit contributions
hole+0.018
packs+0.017
 direction+0.017
stretched+0.017
 Hold+0.017
Neg logit contributions
affles-0.014
 breakfast-0.013
affle-0.013
 soup-0.012
atmeal-0.012
Token smiles
Token ID21845
Feature activation+4.06e+00

Pos logit contributions
affles+0.037
affle+0.033
atmeal+0.032
ils+0.031
ic+0.029
Neg logit contributions
 distance-0.018
 direction-0.017
packs-0.017
 crashing-0.017
arse-0.016
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
 direction+0.109
 distance+0.107
packs+0.103
 flap+0.102
 ho+0.102
Neg logit contributions
affles-0.139
atmeal-0.121
affle-0.119
ic-0.118
ils-0.117
Token She
Token ID1375
Feature activation-2.15e-01

Pos logit contributions
arse+0.015
 steering+0.015
 flap+0.014
 distance+0.014
 direction+0.014
Neg logit contributions
affles-0.017
affle-0.014
atmeal-0.014
ott-0.013
 breakfast-0.013
Token says
Token ID1139
Feature activation-1.90e-01

Pos logit contributions
affles+0.009
affle+0.008
atmeal+0.008
ils+0.008
ic+0.007
Neg logit contributions
 distance-0.005
 direction-0.005
 crashing-0.005
arse-0.005
packs-0.004
Token the
Token ID262
Feature activation+3.83e+00

Pos logit contributions
affles+0.009
atmeal+0.008
affle+0.008
ic+0.008
ilt+0.007
Neg logit contributions
 leads-0.003
 ho-0.003
 slows-0.003
packs-0.003
 pays-0.003
Token bikes
Token ID16715
Feature activation+7.75e+00

Pos logit contributions
packs+0.098
 ho+0.090
 leads+0.089
 distance+0.088
arse+0.087
Neg logit contributions
affles-0.140
affle-0.123
atmeal-0.121
ic-0.119
 breakfast-0.113
Token are
Token ID389
Feature activation+2.56e+00

Pos logit contributions
 distance+0.157
 direction+0.154
packs+0.154
 ho+0.152
 crashing+0.149
Neg logit contributions
affles-0.322
atmeal-0.284
affle-0.279
ils-0.277
ic-0.276
Token ready
Token ID3492
Feature activation+1.35e+00

Pos logit contributions
 slows+0.079
 direction+0.079
 distance+0.078
packs+0.078
 leads+0.076
Neg logit contributions
affles-0.079
affle-0.066
ils-0.064
atmeal-0.064
 breakfast-0.060
Token.
Token ID13
Feature activation+4.30e-01

Pos logit contributions
 distance+0.034
packs+0.034
 direction+0.034
 slows+0.034
 steering+0.033
Neg logit contributions
affles-0.049
atmeal-0.043
ic-0.042
affle-0.042
ils-0.040
Token She
Token ID1375
Feature activation-4.84e-01

Pos logit contributions
 distance+0.013
 flap+0.012
 direction+0.012
arse+0.012
 steering+0.012
Neg logit contributions
affles-0.014
atmeal-0.012
affle-0.012
 breakfast-0.011
ott-0.011
Token says
Token ID1139
Feature activation-2.35e-01

Pos logit contributions
affles+0.020
affle+0.018
atmeal+0.017
ils+0.017
ic+0.016
Neg logit contributions
 distance-0.012
 crashing-0.011
 direction-0.011
arse-0.010
packs-0.010
Token a
Token ID257
Feature activation+1.24e+00

Pos logit contributions
affles+0.068
affle+0.059
atmeal+0.057
ic+0.054
ils+0.052
Neg logit contributions
 leads-0.036
 direction-0.036
 distance-0.035
packs-0.035
 pays-0.035
Token giant
Token ID6175
Feature activation-1.33e-01

Pos logit contributions
packs+0.035
 leads+0.034
 pays+0.033
 direction+0.033
 slows+0.033
Neg logit contributions
affles-0.043
affle-0.038
atmeal-0.037
ic-0.035
ilt-0.033
Token blue
Token ID4171
Feature activation-1.94e-01

Pos logit contributions
affles+0.005
atmeal+0.004
affle+0.004
ic+0.004
ils+0.004
Neg logit contributions
packs-0.004
 direction-0.004
 distance-0.004
 leads-0.004
 ho-0.003
Token planet
Token ID5440
Feature activation+1.30e+00

Pos logit contributions
affles+0.007
atmeal+0.006
affle+0.006
ic+0.006
ils+0.006
Neg logit contributions
packs-0.005
 leads-0.005
 distance-0.005
 direction-0.005
 pays-0.005
Token called
Token ID1444
Feature activation+2.56e+00

Pos logit contributions
packs+0.031
 direction+0.031
 leads+0.031
 distance+0.031
 ho+0.031
Neg logit contributions
affles-0.051
atmeal-0.044
affle-0.044
ic-0.042
ils-0.041
Token Earth
Token ID3668
Feature activation+7.75e+00

Pos logit contributions
 distance+0.073
packs+0.073
 direction+0.072
 leads+0.071
 slows+0.070
Neg logit contributions
affles-0.088
affle-0.075
atmeal-0.075
ic-0.071
ils-0.070
Token.
Token ID13
Feature activation+9.22e-01

Pos logit contributions
packs+0.183
 distance+0.183
 direction+0.182
 leads+0.182
 ho+0.179
Neg logit contributions
affles-0.301
atmeal-0.263
affle-0.260
ic-0.252
ils-0.248
Token Everyone
Token ID11075
Feature activation+2.28e+00

Pos logit contributions
 distance+0.022
packs+0.022
 leads+0.022
 ho+0.022
 direction+0.022
Neg logit contributions
affles-0.034
affle-0.030
atmeal-0.029
ic-0.028
ils-0.027
Token on
Token ID319
Feature activation-9.27e-01

Pos logit contributions
packs+0.064
 distance+0.063
 direction+0.063
 leads+0.063
 ho+0.062
Neg logit contributions
affles-0.081
atmeal-0.069
affle-0.069
ic-0.066
ils-0.064
Token Earth
Token ID3668
Feature activation+3.77e+00

Pos logit contributions
affles+0.029
affle+0.024
atmeal+0.024
ils+0.024
ic+0.024
Neg logit contributions
packs-0.028
 distance-0.027
 leads-0.027
 slows-0.027
 ho-0.027
Token was
Token ID373
Feature activation-1.10e-01

Pos logit contributions
 distance+0.096
packs+0.096
 ho+0.095
 direction+0.094
 leads+0.093
Neg logit contributions
affles-0.138
atmeal-0.119
affle-0.116
ic-0.114
ils-0.114
Token bikes
Token ID16715
Feature activation+4.91e+00

Pos logit contributions
affles+0.019
atmeal+0.016
affle+0.016
ic+0.015
ils+0.015
Neg logit contributions
packs-0.014
 distance-0.014
 direction-0.014
 leads-0.014
 crashing-0.014
Token as
Token ID355
Feature activation+1.51e+00

Pos logit contributions
 direction+0.110
 leads+0.108
 distance+0.108
 slows+0.106
packs+0.106
Neg logit contributions
affles-0.199
atmeal-0.177
affle-0.172
ic-0.167
ils-0.163
Token rockets
Token ID20293
Feature activation+5.97e+00

Pos logit contributions
packs+0.044
 direction+0.043
 distance+0.043
 leads+0.043
 ho+0.043
Neg logit contributions
affles-0.052
atmeal-0.045
affle-0.044
ic-0.042
ils-0.041
Token and
Token ID290
Feature activation+6.03e-01

Pos logit contributions
 direction+0.136
 distance+0.133
packs+0.131
 leads+0.131
 ho+0.129
Neg logit contributions
affles-0.239
atmeal-0.212
affle-0.207
ic-0.202
ils-0.197
Token our
Token ID674
Feature activation+4.30e-01

Pos logit contributions
 direction+0.016
 distance+0.015
packs+0.015
arse+0.015
 crashing+0.015
Neg logit contributions
affles-0.021
atmeal-0.019
affle-0.018
ils-0.017
 breakfast-0.017
Token helmets
Token ID28359
Feature activation+7.70e+00

Pos logit contributions
 leads+0.012
 distance+0.012
 direction+0.011
 pays+0.011
packs+0.011
Neg logit contributions
affles-0.015
affle-0.013
atmeal-0.013
ils-0.012
 breakfast-0.012
Token as
Token ID355
Feature activation+1.55e+00

Pos logit contributions
 direction+0.197
 distance+0.194
 aston+0.194
 horror+0.190
packs+0.189
Neg logit contributions
affles-0.266
atmeal-0.243
ic-0.235
affle-0.230
ils-0.220
Token space
Token ID2272
Feature activation+2.98e+00

Pos logit contributions
 distance+0.043
 direction+0.042
packs+0.042
 leads+0.042
 ho+0.041
Neg logit contributions
affles-0.056
atmeal-0.048
affle-0.047
ic-0.045
ils-0.044
Token suits
Token ID14803
Feature activation+1.94e+00

Pos logit contributions
 direction+0.074
 distance+0.074
packs+0.072
 leads+0.072
 ho+0.071
Neg logit contributions
affles-0.114
atmeal-0.099
affle-0.098
ic-0.095
ils-0.093
Token."
Token ID526
Feature activation-1.64e+00

Pos logit contributions
 direction+0.047
 distance+0.046
packs+0.046
 leads+0.046
 ho+0.045
Neg logit contributions
affles-0.076
atmeal-0.067
affle-0.065
ic-0.063
ils-0.062
Token
Token ID198
Feature activation+1.76e-01

Pos logit contributions
affles+0.053
affle+0.045
atmeal+0.043
 breakfast+0.042
 pasta+0.041
Neg logit contributions
arse-0.051
 steering-0.049
 ho-0.048
 crashing-0.048
 distance-0.048
Token and
Token ID290
Feature activation+2.89e-01

Pos logit contributions
 direction+0.118
 distance+0.118
packs+0.113
 ho+0.112
 leads+0.111
Neg logit contributions
affles-0.158
atmeal-0.139
ic-0.136
ils-0.132
affle-0.132
Token rockets
Token ID20293
Feature activation+5.91e+00

Pos logit contributions
 distance+0.008
 direction+0.008
packs+0.008
 leads+0.008
 ho+0.008
Neg logit contributions
affles-0.010
atmeal-0.008
affle-0.008
ils-0.008
ic-0.008
Token.
Token ID13
Feature activation+3.16e-01

Pos logit contributions
 direction+0.149
 distance+0.146
packs+0.144
 ho+0.142
 leads+0.142
Neg logit contributions
affles-0.217
atmeal-0.192
affle-0.185
ic-0.185
ils-0.181
Token They
Token ID1119
Feature activation+5.11e-01

Pos logit contributions
 distance+0.006
 direction+0.006
 ho+0.006
 flap+0.006
 leads+0.006
Neg logit contributions
affles-0.014
atmeal-0.012
affle-0.012
ic-0.011
 breakfast-0.011
Token find
Token ID1064
Feature activation-2.51e+00

Pos logit contributions
packs+0.011
 leads+0.011
 ho+0.011
 distance+0.011
 direction+0.011
Neg logit contributions
affles-0.021
atmeal-0.019
affle-0.019
ic-0.018
ils-0.017
Token helmets
Token ID28359
Feature activation+7.61e+00

Pos logit contributions
affles+0.086
atmeal+0.074
affle+0.073
ic+0.069
ils+0.065
Neg logit contributions
packs-0.075
 direction-0.074
 leads-0.074
 distance-0.073
 ho-0.073
Token and
Token ID290
Feature activation+7.24e-01

Pos logit contributions
 direction+0.201
 aston+0.198
 horror+0.192
 distance+0.192
packs+0.189
Neg logit contributions
affles-0.237
atmeal-0.222
ic-0.218
ils-0.201
at-0.201
Token scar
Token ID10153
Feature activation-1.50e+00

Pos logit contributions
arse+0.022
 distance+0.022
 Nice+0.022
 crashing+0.022
 slows+0.022
Neg logit contributions
affles-0.022
ils-0.017
atmeal-0.017
affle-0.017
 breakfast-0.017
Tokenves
Token ID1158
Feature activation+2.66e+00

Pos logit contributions
affles+0.042
atmeal+0.036
affle+0.035
ic+0.031
ott+0.030
Neg logit contributions
 leads-0.052
 steering-0.051
 pays-0.051
 ho-0.051
packs-0.051
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
 direction+0.067
 distance+0.065
arse+0.063
 aston+0.062
 slows+0.062
Neg logit contributions
affles-0.094
atmeal-0.082
ils-0.081
ic-0.079
affle-0.079
Token gloves
Token ID18051
Feature activation+1.75e+00

Pos logit contributions
arse+0.038
 Nice+0.038
 distance+0.038
 direction+0.038
 crashing+0.037
Neg logit contributions
affles-0.040
atmeal-0.033
ils-0.032
 breakfast-0.032
affle-0.031
Token bike
Token ID7161
Feature activation+8.49e-01

Pos logit contributions
affles+0.084
atmeal+0.076
affle+0.075
 breakfast+0.073
 pancakes+0.067
Neg logit contributions
 pays-0.057
 lets-0.053
 leads-0.053
 aims-0.053
 slows-0.053
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
 crashing+0.021
 direction+0.021
 slows+0.021
 leads+0.021
packs+0.021
Neg logit contributions
affles-0.030
atmeal-0.027
affle-0.026
ic-0.026
ils-0.025
Token She
Token ID1375
Feature activation-7.16e-01

Pos logit contributions
 distance+0.005
 flap+0.005
 direction+0.005
packs+0.005
 ho+0.005
Neg logit contributions
affles-0.006
affle-0.005
atmeal-0.005
 breakfast-0.004
ott-0.004
Token gives
Token ID3607
Feature activation-1.33e-01

Pos logit contributions
affles+0.031
affle+0.027
atmeal+0.027
ils+0.025
ic+0.024
Neg logit contributions
 distance-0.016
 direction-0.015
 crashing-0.014
packs-0.014
arse-0.014
Token them
Token ID606
Feature activation+1.72e-01

Pos logit contributions
affles+0.004
atmeal+0.003
affle+0.003
ic+0.003
ilt+0.003
Neg logit contributions
 ho-0.005
packs-0.005
 leads-0.005
 direction-0.005
 distance-0.004
Token helmets
Token ID28359
Feature activation+7.59e+00

Pos logit contributions
packs+0.005
 direction+0.005
 leads+0.005
 distance+0.005
 ho+0.005
Neg logit contributions
affles-0.006
atmeal-0.005
affle-0.005
ic-0.005
ils-0.005
Token and
Token ID290
Feature activation+5.16e-01

Pos logit contributions
 aston+0.214
 horror+0.210
 direction+0.209
 distance+0.206
 slows+0.203
Neg logit contributions
affles-0.224
atmeal-0.202
ic-0.202
 breakfast-0.193
at-0.188
Token tells
Token ID4952
Feature activation+6.50e-01

Pos logit contributions
 distance+0.015
 crashing+0.014
 direction+0.014
arse+0.014
packs+0.013
Neg logit contributions
affles-0.019
atmeal-0.015
affle-0.015
 breakfast-0.014
ils-0.014
Token them
Token ID606
Feature activation+2.42e+00

Pos logit contributions
 ho+0.021
 leads+0.021
 crashing+0.021
 pays+0.020
 direction+0.020
Neg logit contributions
affles-0.020
atmeal-0.018
affle-0.017
ic-0.017
ilt-0.015
Token to
Token ID284
Feature activation+3.34e-01

Pos logit contributions
packs+0.049
 leads+0.049
 direction+0.047
 ho+0.047
 pays+0.046
Neg logit contributions
affles-0.106
atmeal-0.094
affle-0.093
ic-0.090
ils-0.086
Token be
Token ID307
Feature activation-2.23e+00

Pos logit contributions
packs+0.008
 direction+0.007
 leads+0.007
 pays+0.007
 distance+0.007
Neg logit contributions
affles-0.013
atmeal-0.012
affle-0.012
ic-0.011
ils-0.011
Token always
Token ID1464
Feature activation+2.36e-01

Pos logit contributions
 direction+0.021
packs+0.021
 leads+0.021
 pays+0.021
 distance+0.020
Neg logit contributions
affles-0.029
atmeal-0.026
affle-0.025
ic-0.024
ils-0.024
Token wore
Token ID12408
Feature activation-9.36e-01

Pos logit contributions
 horror+0.007
packs+0.007
 direction+0.007
 aston+0.007
 leads+0.007
Neg logit contributions
affles-0.007
atmeal-0.006
affle-0.005
ils-0.005
ic-0.005
Token their
Token ID511
Feature activation-7.77e-01

Pos logit contributions
affles+0.034
affle+0.029
ils+0.028
atmeal+0.028
 breakfast+0.027
Neg logit contributions
 distance-0.024
 pays-0.024
 leads-0.023
 direction-0.023
 slows-0.023
Token gear
Token ID7733
Feature activation+2.57e+00

Pos logit contributions
affles+0.028
affle+0.024
atmeal+0.024
 breakfast+0.022
ils+0.021
Neg logit contributions
 distance-0.020
 pays-0.020
 horror-0.020
 leads-0.020
 Nice-0.020
Token:
Token ID25
Feature activation-6.93e-02

Pos logit contributions
 crashing+0.064
 distance+0.064
 leads+0.064
 direction+0.063
 pays+0.063
Neg logit contributions
affles-0.095
atmeal-0.082
affle-0.082
ic-0.080
ils-0.078
Token helmets
Token ID28359
Feature activation+7.57e+00

Pos logit contributions
affles+0.002
atmeal+0.002
affle+0.002
 breakfast+0.002
ic+0.002
Neg logit contributions
 direction-0.002
 distance-0.002
packs-0.002
 leads-0.002
arse-0.002
Token,
Token ID11
Feature activation-2.57e-01

Pos logit contributions
 aston+0.197
packs+0.195
 horror+0.194
 direction+0.190
 distance+0.187
Neg logit contributions
affles-0.247
ic-0.230
atmeal-0.224
affle-0.214
at-0.206
Token pads
Token ID21226
Feature activation+4.47e+00

Pos logit contributions
affles+0.009
atmeal+0.007
affle+0.007
 breakfast+0.007
 pancakes+0.006
Neg logit contributions
 direction-0.007
arse-0.007
 horror-0.007
 distance-0.007
 pays-0.007
Token and
Token ID290
Feature activation+8.61e-01

Pos logit contributions
 direction+0.111
 aston+0.109
 awe+0.108
packs+0.107
 horror+0.106
Neg logit contributions
affles-0.168
atmeal-0.155
affle-0.145
ic-0.144
at-0.136
Token gloves
Token ID18051
Feature activation+1.83e+00

Pos logit contributions
arse+0.025
 distance+0.024
 direction+0.024
 Nice+0.023
 aston+0.023
Neg logit contributions
affles-0.027
atmeal-0.022
 breakfast-0.022
ils-0.022
 pancakes-0.021
Token.
Token ID13
Feature activation+4.65e-01

Pos logit contributions
packs+0.044
 direction+0.044
 distance+0.043
 leads+0.043
 ho+0.042
Neg logit contributions
affles-0.071
atmeal-0.062
affle-0.061
ic-0.060
ils-0.057
Token was
Token ID373
Feature activation-5.34e-01

Pos logit contributions
packs+0.018
 direction+0.018
 leads+0.018
 distance+0.018
 ho+0.017
Neg logit contributions
affles-0.022
atmeal-0.019
affle-0.019
ic-0.018
ils-0.017
Token a
Token ID257
Feature activation+9.92e-01

Pos logit contributions
affles+0.021
atmeal+0.019
ic+0.019
affle+0.018
ils+0.017
Neg logit contributions
packs-0.013
 ho-0.013
 distance-0.012
 direction-0.012
 thanks-0.012
Token happy
Token ID3772
Feature activation-3.04e+00

Pos logit contributions
packs+0.025
 direction+0.025
 distance+0.025
 leads+0.025
 ho+0.025
Neg logit contributions
affles-0.038
atmeal-0.033
affle-0.033
ic-0.031
ils-0.030
Token planet
Token ID5440
Feature activation+2.87e+00

Pos logit contributions
affles+0.121
atmeal+0.105
affle+0.103
ic+0.100
ils+0.098
Neg logit contributions
 ho-0.075
 distance-0.074
packs-0.072
 direction-0.072
 leads-0.072
Token called
Token ID1444
Feature activation+2.35e+00

Pos logit contributions
packs+0.075
 leads+0.074
 direction+0.073
 ho+0.073
 distance+0.073
Neg logit contributions
affles-0.108
atmeal-0.094
affle-0.092
ic-0.089
ils-0.086
Token Earth
Token ID3668
Feature activation+7.47e+00

Pos logit contributions
 distance+0.070
 direction+0.066
packs+0.064
 slows+0.062
stretched+0.062
Neg logit contributions
affles-0.074
affle-0.063
ils-0.063
 breakfast-0.063
atmeal-0.059
Token.
Token ID13
Feature activation+3.67e-01

Pos logit contributions
packs+0.177
 distance+0.177
 direction+0.176
 leads+0.176
 pays+0.173
Neg logit contributions
affles-0.292
atmeal-0.255
affle-0.254
ic-0.245
ils-0.239
Token The
Token ID383
Feature activation+9.39e-01

Pos logit contributions
packs+0.009
 distance+0.009
 direction+0.009
 ho+0.009
 leads+0.009
Neg logit contributions
affles-0.014
atmeal-0.012
affle-0.012
ic-0.011
ils-0.011
Token planet
Token ID5440
Feature activation+2.42e+00

Pos logit contributions
packs+0.024
 leads+0.023
 pays+0.022
 direction+0.022
 ho+0.022
Neg logit contributions
affles-0.035
affle-0.031
atmeal-0.030
ic-0.029
ils-0.029
Token had
Token ID550
Feature activation-8.84e-01

Pos logit contributions
packs+0.067
 direction+0.066
 leads+0.066
 distance+0.065
 ho+0.065
Neg logit contributions
affles-0.087
atmeal-0.075
affle-0.074
ic-0.071
ils-0.068
Token lots
Token ID6041
Feature activation-1.08e+00

Pos logit contributions
affles+0.022
atmeal+0.018
ils+0.018
ic+0.017
affle+0.017
Neg logit contributions
packs-0.032
 leads-0.031
arse-0.030
 pays-0.030
 slows-0.030
Token crown
Token ID12389
Feature activation-5.92e-01

Pos logit contributions
affles+0.025
affle+0.022
atmeal+0.021
ic+0.020
ils+0.020
Neg logit contributions
packs-0.016
 flap-0.015
 direction-0.015
 leads-0.015
 ho-0.015
Token and
Token ID290
Feature activation+1.83e-01

Pos logit contributions
affles+0.026
affle+0.023
atmeal+0.022
ils+0.021
ic+0.020
Neg logit contributions
packs-0.013
 ho-0.013
 leads-0.013
 distance-0.012
 direction-0.012
Token playing
Token ID2712
Feature activation-2.49e+00

Pos logit contributions
packs+0.005
 leads+0.005
 ho+0.005
 direction+0.005
 pays+0.005
Neg logit contributions
affles-0.007
affle-0.006
atmeal-0.006
ic-0.005
at-0.005
Token in
Token ID287
Feature activation+1.12e+00

Pos logit contributions
affles+0.150
atmeal+0.133
affle+0.132
at+0.127
illy+0.124
Neg logit contributions
 flap-0.028
 low-0.026
 but-0.021
 higher-0.020
 close-0.019
Token the
Token ID262
Feature activation-2.38e+00

Pos logit contributions
 distance+0.030
 direction+0.030
packs+0.030
 leads+0.029
 ho+0.029
Neg logit contributions
affles-0.045
atmeal-0.039
affle-0.038
ic-0.036
ils-0.034
Token freshly
Token ID29026
Feature activation-1.05e+01

Pos logit contributions
affles+0.099
atmeal+0.086
affle+0.086
ic+0.083
ils+0.080
Neg logit contributions
 distance-0.055
 direction-0.054
packs-0.053
 leads-0.053
 ho-0.052
Token cut
Token ID2005
Feature activation-4.62e+00

Pos logit contributions
affles+0.404
affle+0.329
ic+0.301
illy+0.294
umbers+0.290
Neg logit contributions
 way-0.279
 distance-0.277
 direction-0.273
 pays-0.266
 leads-0.266
Token grass
Token ID8701
Feature activation+3.28e+00

Pos logit contributions
affles+0.186
affle+0.157
atmeal+0.152
making+0.137
dd+0.135
Neg logit contributions
 direction-0.121
 leads-0.118
 distance-0.118
packs-0.116
 way-0.116
Token.
Token ID13
Feature activation-4.83e-01

Pos logit contributions
packs+0.067
 distance+0.066
 ho+0.066
 direction+0.065
 leads+0.064
Neg logit contributions
affles-0.147
affle-0.128
atmeal-0.127
ic-0.118
ils-0.117
Token It
Token ID632
Feature activation-1.66e+00

Pos logit contributions
affles+0.018
atmeal+0.016
affle+0.016
ic+0.015
ils+0.015
Neg logit contributions
 distance-0.012
 ho-0.012
packs-0.012
 steering-0.012
 direction-0.012
Token looked
Token ID3114
Feature activation-6.18e-01

Pos logit contributions
affles+0.077
affle+0.067
atmeal+0.064
ic+0.059
umbers+0.058
Neg logit contributions
 leads-0.037
 pays-0.036
 way-0.036
 direction-0.035
 thanks-0.034
Token to
Token ID284
Feature activation-1.04e+00

Pos logit contributions
packs+0.036
 leads+0.036
 direction+0.036
 ho+0.036
 distance+0.036
Neg logit contributions
affles-0.058
atmeal-0.051
affle-0.050
ic-0.048
ils-0.047
Token play
Token ID711
Feature activation-1.42e+00

Pos logit contributions
affles+0.042
atmeal+0.036
affle+0.036
ic+0.035
ils+0.033
Neg logit contributions
packs-0.026
 leads-0.025
 distance-0.025
 direction-0.025
 crashing-0.025
Token and
Token ID290
Feature activation-7.97e-02

Pos logit contributions
affles+0.069
atmeal+0.061
affle+0.060
ic+0.060
at+0.055
Neg logit contributions
packs-0.023
 direction-0.022
 leads-0.022
 distance-0.022
 flap-0.021
Token explore
Token ID7301
Feature activation-9.42e-01

Pos logit contributions
affles+0.003
atmeal+0.003
affle+0.003
ic+0.003
ils+0.003
Neg logit contributions
packs-0.002
 leads-0.002
 direction-0.002
 distance-0.002
 ho-0.002
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
affles+0.037
atmeal+0.032
affle+0.031
ic+0.030
ils+0.029
Neg logit contributions
packs-0.024
 direction-0.024
 distance-0.024
 leads-0.024
 ho-0.023
Token freshly
Token ID29026
Feature activation-9.36e+00

Pos logit contributions
affles+0.039
atmeal+0.034
affle+0.034
ic+0.032
ils+0.032
Neg logit contributions
packs-0.028
 leads-0.027
 ho-0.027
 direction-0.027
 distance-0.026
Token organized
Token ID8389
Feature activation-2.44e+00

Pos logit contributions
affles+0.350
affle+0.294
atmeal+0.286
ic+0.277
ils+0.256
Neg logit contributions
 distance-0.257
 direction-0.257
packs-0.252
 leads-0.251
 way-0.243
Token lawn
Token ID21133
Feature activation+1.71e+00

Pos logit contributions
affles+0.102
atmeal+0.090
affle+0.089
ic+0.085
ils+0.083
Neg logit contributions
packs-0.054
 direction-0.053
 distance-0.053
 leads-0.053
 ho-0.052
Token.
Token ID13
Feature activation-4.42e-01

Pos logit contributions
packs+0.040
 direction+0.039
 leads+0.039
 ho+0.039
 distance+0.039
Neg logit contributions
affles-0.068
atmeal-0.060
affle-0.059
ic-0.057
ils-0.055
Token<|endoftext|>
Token ID50256
Feature activation-3.41e-01

Pos logit contributions
affles+0.018
affle+0.016
atmeal+0.015
ic+0.015
ott+0.014
Neg logit contributions
 steering-0.010
 ho-0.010
 distance-0.010
 flap-0.010
packs-0.010
TokenOnce
Token ID7454
Feature activation-2.96e-01

Pos logit contributions
affles+0.013
atmeal+0.011
affle+0.011
ic+0.011
ils+0.010
Neg logit contributions
 ho-0.009
packs-0.008
 leads-0.008
 direction-0.008
 steering-0.008
Token he
Token ID339
Feature activation-5.29e-01

Pos logit contributions
affles+0.033
affle+0.028
atmeal+0.028
ic+0.028
ils+0.027
Neg logit contributions
 ho-0.021
 distance-0.021
 direction-0.021
 leads-0.020
 steering-0.020
Token saw
Token ID2497
Feature activation-2.19e+00

Pos logit contributions
affles+0.021
atmeal+0.018
affle+0.018
ic+0.018
ils+0.017
Neg logit contributions
 ho-0.013
 leads-0.013
 distance-0.013
 direction-0.013
packs-0.013
Token a
Token ID257
Feature activation+7.95e-01

Pos logit contributions
affles+0.111
affle+0.097
atmeal+0.097
ic+0.096
ilt+0.094
Neg logit contributions
 thanks-0.038
 direction-0.036
 way-0.035
 distance-0.034
 hello-0.033
Token basket
Token ID7988
Feature activation-1.36e-01

Pos logit contributions
 distance+0.021
 direction+0.021
 ho+0.021
 leads+0.020
packs+0.020
Neg logit contributions
affles-0.030
atmeal-0.026
affle-0.025
ic-0.024
ils-0.024
Token of
Token ID286
Feature activation-1.46e+00

Pos logit contributions
affles+0.005
atmeal+0.004
affle+0.004
ic+0.004
ils+0.004
Neg logit contributions
packs-0.004
 leads-0.004
 direction-0.004
 pays-0.004
 ho-0.004
Token freshly
Token ID29026
Feature activation-9.00e+00

Pos logit contributions
affles+0.048
ic+0.045
atmeal+0.045
affle+0.043
making+0.043
Neg logit contributions
 ho-0.041
 thanks-0.041
 way-0.040
 distance-0.040
 direction-0.040
Token baked
Token ID22979
Feature activation-5.00e+00

Pos logit contributions
affles+0.318
affle+0.254
ic+0.236
ilt+0.222
atmeal+0.220
Neg logit contributions
 way-0.263
 distance-0.256
 leads-0.254
 ho-0.253
 direction-0.248
Token bread
Token ID8509
Feature activation-1.09e+00

Pos logit contributions
affles+0.194
affle+0.166
atmeal+0.160
making+0.154
ott+0.149
Neg logit contributions
 leads-0.127
 thanks-0.125
 way-0.122
packs-0.118
 ho-0.116
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
affles+0.050
affle+0.045
atmeal+0.044
ic+0.042
making+0.040
Neg logit contributions
packs-0.019
 ho-0.019
 leads-0.019
 distance-0.018
 thanks-0.018
Token It
Token ID632
Feature activation-1.56e+00

Pos logit contributions
affles+0.010
atmeal+0.009
affle+0.009
ic+0.008
ils+0.008
Neg logit contributions
 leads-0.004
 direction-0.004
packs-0.004
 ho-0.004
 distance-0.004
Token smelled
Token ID37609
Feature activation-1.43e+00

Pos logit contributions
affles+0.079
affle+0.068
atmeal+0.066
ic+0.060
umbers+0.059
Neg logit contributions
 leads-0.030
 lets-0.027
 way-0.026
 pays-0.026
 waved-0.026
Token very
Token ID845
Feature activation-3.24e+00

Pos logit contributions
 distance+0.021
 direction+0.020
 ho+0.020
 leads+0.020
 steering+0.020
Neg logit contributions
affles-0.030
atmeal-0.026
affle-0.025
ic-0.025
ils-0.025
Token special
Token ID2041
Feature activation-1.62e+00

Pos logit contributions
affles+0.130
atmeal+0.119
ils+0.116
ic+0.113
affle+0.112
Neg logit contributions
 way-0.073
 direction-0.073
 ho-0.068
 leads-0.067
packs-0.066
Token toy
Token ID13373
Feature activation-8.68e-01

Pos logit contributions
affles+0.056
atmeal+0.047
affle+0.045
ils+0.045
ic+0.044
Neg logit contributions
 direction-0.048
 leads-0.048
 distance-0.048
 ho-0.047
packs-0.047
Token for
Token ID329
Feature activation-4.83e+00

Pos logit contributions
affles+0.039
atmeal+0.034
affle+0.033
ic+0.032
ils+0.032
Neg logit contributions
 leads-0.017
 ho-0.017
packs-0.017
 steering-0.017
 direction-0.016
Token L
Token ID406
Feature activation-3.70e+00

Pos logit contributions
affles+0.205
affle+0.192
atmeal+0.186
ilt+0.183
at+0.181
Neg logit contributions
 ho-0.112
 crashing-0.099
 waving-0.095
 tow-0.092
 thanks-0.088
Tokenott
Token ID1252
Feature activation-8.96e+00

Pos logit contributions
affles+0.083
ils+0.064
affle+0.063
 pancakes+0.059
 breakfast+0.058
Neg logit contributions
 pays-0.136
packs-0.135
 Maybe-0.125
yt-0.125
 leads-0.124
Tokenie
Token ID494
Feature activation-2.07e+00

Pos logit contributions
affles+0.224
atmeal+0.185
affle+0.165
umbers+0.163
ic+0.161
Neg logit contributions
packs-0.329
 way-0.307
 pays-0.304
 direction-0.295
 leads-0.293
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
affles+0.089
atmeal+0.078
affle+0.077
ic+0.074
ils+0.072
Neg logit contributions
packs-0.044
 direction-0.043
 leads-0.043
 distance-0.043
 ho-0.043
Token It
Token ID632
Feature activation-1.59e+00

Pos logit contributions
affles+0.010
atmeal+0.009
affle+0.008
ic+0.008
ils+0.008
Neg logit contributions
packs-0.005
 direction-0.005
 leads-0.005
 distance-0.005
 ho-0.005
Token was
Token ID373
Feature activation+2.25e-01

Pos logit contributions
affles+0.072
affle+0.062
atmeal+0.061
ic+0.055
ils+0.055
Neg logit contributions
 leads-0.037
 pays-0.034
 way-0.034
 direction-0.034
 ho-0.033
Token a
Token ID257
Feature activation+9.10e-01

Pos logit contributions
 ho+0.006
 direction+0.006
 distance+0.006
packs+0.006
 leads+0.006
Neg logit contributions
affles-0.009
atmeal-0.008
affle-0.007
ic-0.007
ils-0.007
Token the
Token ID262
Feature activation+1.43e+00

Pos logit contributions
affles+0.060
atmeal+0.054
affle+0.054
ase+0.051
ic+0.051
Neg logit contributions
 thanks-0.030
 distance-0.027
 crashing-0.027
 direction-0.027
 awe-0.027
Token sweet
Token ID6029
Feature activation-9.77e-01

Pos logit contributions
 way+0.046
 distance+0.043
 direction+0.042
 ho+0.041
 steering+0.039
Neg logit contributions
affles-0.054
affle-0.045
ils-0.044
atmeal-0.043
dd-0.042
Token aroma
Token ID31242
Feature activation-1.41e+00

Pos logit contributions
affles+0.035
affle+0.029
atmeal+0.029
ils+0.027
umbers+0.026
Neg logit contributions
 direction-0.028
 way-0.028
 thanks-0.028
 leads-0.028
 distance-0.027
Token of
Token ID286
Feature activation-5.99e-02

Pos logit contributions
affles+0.063
affle+0.055
atmeal+0.054
ic+0.049
ils+0.049
Neg logit contributions
 distance-0.029
 crashing-0.029
 leads-0.029
 ho-0.029
 direction-0.028
Token a
Token ID257
Feature activation-1.59e-01

Pos logit contributions
affles+0.002
atmeal+0.002
affle+0.002
ic+0.002
ils+0.002
Neg logit contributions
 distance-0.002
 thanks-0.002
 direction-0.002
 ho-0.002
 leads-0.002
Token freshly
Token ID29026
Feature activation-8.63e+00

Pos logit contributions
affles+0.006
atmeal+0.005
affle+0.005
ils+0.005
ic+0.005
Neg logit contributions
 distance-0.004
 direction-0.004
 ho-0.004
 leads-0.004
 crashing-0.004
Token baked
Token ID22979
Feature activation-5.34e+00

Pos logit contributions
affles+0.264
affle+0.207
atmeal+0.186
ic+0.186
lf+0.183
Neg logit contributions
 pays-0.263
 way-0.261
 leads-0.261
 distance-0.256
 thanks-0.255
Token cake
Token ID12187
Feature activation-2.59e+00

Pos logit contributions
affles+0.176
affle+0.141
atmeal+0.135
ils+0.125
making+0.122
Neg logit contributions
 distance-0.175
 way-0.172
 leads-0.169
 direction-0.169
 ho-0.166
Token.
Token ID13
Feature activation-3.90e-01

Pos logit contributions
affles+0.127
affle+0.113
atmeal+0.107
ils+0.100
ic+0.099
Neg logit contributions
 distance-0.044
 leads-0.043
 ho-0.043
packs-0.042
 direction-0.041
Token
Token ID198
Feature activation+3.69e-01

Pos logit contributions
affles+0.019
affle+0.016
atmeal+0.016
ic+0.016
ils+0.016
Neg logit contributions
 leads-0.007
 direction-0.007
packs-0.006
 distance-0.006
 pays-0.006
Token
Token ID198
Feature activation+4.71e-01

Pos logit contributions
packs+0.006
 direction+0.006
 leads+0.006
 distance+0.006
 ho+0.006
Neg logit contributions
affles-0.018
atmeal-0.016
affle-0.016
ic-0.015
ils-0.015
Token he
Token ID339
Feature activation-1.68e+00

Pos logit contributions
packs+0.035
 direction+0.035
 distance+0.035
 leads+0.035
 ho+0.035
Neg logit contributions
affles-0.040
atmeal-0.034
affle-0.033
ic-0.032
ils-0.031
Token could
Token ID714
Feature activation+4.51e-01

Pos logit contributions
affles+0.085
affle+0.076
ic+0.075
atmeal+0.074
ilt+0.071
Neg logit contributions
 pays-0.027
 lets-0.024
 leads-0.024
 distance-0.024
 ho-0.023
Token do
Token ID466
Feature activation-4.43e+00

Pos logit contributions
 flap+0.009
 distance+0.009
 follow+0.008
 way+0.008
 forget+0.008
Neg logit contributions
affles-0.020
ic-0.018
affle-0.018
atmeal-0.017
ils-0.017
Token in
Token ID287
Feature activation+1.57e+00

Pos logit contributions
affles+0.202
affle+0.179
atmeal+0.178
ic+0.169
ilt+0.160
Neg logit contributions
packs-0.085
 leads-0.083
 direction-0.082
 pays-0.081
 distance-0.081
Token the
Token ID262
Feature activation-1.92e+00

Pos logit contributions
 direction+0.043
 distance+0.043
 ho+0.043
packs+0.043
 leads+0.043
Neg logit contributions
affles-0.059
atmeal-0.051
affle-0.050
ic-0.048
ils-0.046
Token o
Token ID267
Feature activation-8.56e+00

Pos logit contributions
affles+0.070
atmeal+0.058
affle+0.058
ic+0.056
ils+0.054
Neg logit contributions
 distance-0.060
 direction-0.058
 way-0.055
 leads-0.054
 ho-0.053
Tokenasis
Token ID17765
Feature activation-4.12e+00

Pos logit contributions
affles+0.225
affle+0.181
ils+0.173
ic+0.167
atmeal+0.164
Neg logit contributions
 distance-0.319
 direction-0.310
 leads-0.306
packs-0.305
 ho-0.300
Token.
Token ID13
Feature activation-5.41e-01

Pos logit contributions
affles+0.181
affle+0.159
atmeal+0.156
ic+0.150
ils+0.146
Neg logit contributions
packs-0.084
 distance-0.084
 direction-0.083
 leads-0.083
 ho-0.081
Token He
Token ID679
Feature activation-5.25e-01

Pos logit contributions
affles+0.022
atmeal+0.019
affle+0.019
ic+0.018
ils+0.017
Neg logit contributions
packs-0.013
 ho-0.013
 distance-0.012
 flap-0.012
 direction-0.012
Token could
Token ID714
Feature activation+7.93e-01

Pos logit contributions
affles+0.023
atmeal+0.020
affle+0.020
ic+0.020
ilt+0.018
Neg logit contributions
 ho-0.012
 leads-0.011
 pays-0.011
 lets-0.011
 waved-0.011
Token have
Token ID423
Feature activation-3.06e+00

Pos logit contributions
 flap+0.016
 ho+0.015
 distance+0.015
 leads+0.014
 crashing+0.014
Neg logit contributions
affles-0.037
ic-0.033
affle-0.032
atmeal-0.032
ils-0.031
Token it
Token ID340
Feature activation-1.83e+00

Pos logit contributions
affles+0.031
atmeal+0.026
affle+0.025
ic+0.024
ils+0.022
Neg logit contributions
 direction-0.030
 distance-0.030
packs-0.030
 ho-0.029
 leads-0.029
Token away
Token ID1497
Feature activation+9.51e-01

Pos logit contributions
affles+0.065
atmeal+0.058
affle+0.056
ils+0.054
ic+0.054
Neg logit contributions
 leads-0.047
 slows-0.046
packs-0.046
 pays-0.046
 direction-0.046
Token in
Token ID287
Feature activation+1.70e+00

Pos logit contributions
 slows+0.026
 leads+0.026
packs+0.026
 pays+0.025
 ho+0.025
Neg logit contributions
affles-0.031
atmeal-0.028
ic-0.027
affle-0.027
ils-0.026
Token a
Token ID257
Feature activation-2.02e+00

Pos logit contributions
 slows+0.055
 leads+0.055
packs+0.054
 ho+0.053
 pays+0.053
Neg logit contributions
affles-0.050
atmeal-0.042
affle-0.042
ic-0.041
ils-0.038
Token wheel
Token ID7825
Feature activation-2.10e+00

Pos logit contributions
affles+0.074
atmeal+0.065
affle+0.064
ic+0.061
ils+0.059
Neg logit contributions
packs-0.054
 leads-0.052
 slows-0.051
 pays-0.051
 ho-0.051
Tokenbar
Token ID5657
Feature activation-8.56e+00

Pos logit contributions
affles+0.085
atmeal+0.075
affle+0.074
ic+0.072
ils+0.069
Neg logit contributions
packs-0.048
 leads-0.047
 direction-0.047
 distance-0.047
 ho-0.047
Tokenrow
Token ID808
Feature activation-2.97e+00

Pos logit contributions
affles+0.220
atmeal+0.179
affle+0.175
ic+0.164
ils+0.152
Neg logit contributions
 leads-0.321
 distance-0.321
packs-0.321
 direction-0.320
 ho-0.317
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
affles+0.114
atmeal+0.101
affle+0.099
ic+0.098
ils+0.093
Neg logit contributions
packs-0.072
 leads-0.071
 direction-0.070
 ho-0.069
 slows-0.069
Token
Token ID198
Feature activation+5.91e-01

Pos logit contributions
affles+0.008
atmeal+0.007
affle+0.007
ic+0.007
ils+0.007
Neg logit contributions
packs-0.003
 ho-0.003
 distance-0.003
 direction-0.003
 leads-0.003
Token
Token ID198
Feature activation+6.45e-01

Pos logit contributions
 distance+0.008
packs+0.008
 direction+0.008
 leads+0.007
 ho+0.007
Neg logit contributions
affles-0.030
atmeal-0.027
affle-0.027
ic-0.026
ils-0.026
TokenOnce
Token ID7454
Feature activation-3.82e-01

Pos logit contributions
 distance+0.012
 crashing+0.011
 Have+0.011
 slows+0.011
 tow+0.011
Neg logit contributions
affles-0.030
atmeal-0.027
affle-0.026
ic-0.026
ils-0.025
Token?"
Token ID1701
Feature activation-3.29e+00

Pos logit contributions
packs+0.034
 direction+0.033
 leads+0.033
 distance+0.033
 ho+0.033
Neg logit contributions
affles-0.061
atmeal-0.053
affle-0.053
ic-0.050
ils-0.048
Token
Token ID220
Feature activation-2.44e+00

Pos logit contributions
affles+0.178
atmeal+0.161
affle+0.158
ils+0.157
ic+0.150
Neg logit contributions
 direction-0.035
 distance-0.035
 leads-0.034
 slows-0.031
 ho-0.031
Token
Token ID198
Feature activation+3.22e-01

Pos logit contributions
affles+0.112
atmeal+0.100
affle+0.099
ic+0.096
ils+0.092
Neg logit contributions
packs-0.043
 distance-0.043
 direction-0.042
 ho-0.042
 leads-0.042
Token
Token ID198
Feature activation+5.16e-01

Pos logit contributions
 distance+0.004
 direction+0.004
 ho+0.004
 leads+0.004
packs+0.004
Neg logit contributions
affles-0.017
atmeal-0.015
affle-0.015
ils-0.014
ic-0.014
TokenD
Token ID35
Feature activation-2.41e+00

Pos logit contributions
 distance+0.014
 ho+0.014
 direction+0.014
 crashing+0.014
 leads+0.014
Neg logit contributions
affles-0.019
atmeal-0.017
affle-0.016
ils-0.015
ic-0.014
Tokenott
Token ID1252
Feature activation-8.42e+00

Pos logit contributions
affles+0.049
atmeal+0.034
affle+0.032
ils+0.032
 breakfast+0.031
Neg logit contributions
 pays-0.107
 leads-0.102
packs-0.099
 slows-0.098
 direction-0.098
Tokenie
Token ID494
Feature activation-2.13e+00

Pos logit contributions
affles+0.236
atmeal+0.216
 breakfast+0.198
 pancakes+0.190
 dessert+0.178
Neg logit contributions
 pays-0.288
packs-0.274
 direction-0.270
 way-0.262
 leads-0.258
Token per
Token ID583
Feature activation+3.21e+00

Pos logit contributions
affles+0.094
atmeal+0.083
affle+0.080
ic+0.074
ils+0.073
Neg logit contributions
 leads-0.048
 direction-0.048
 pays-0.047
 ho-0.047
 slows-0.045
Tokenked
Token ID9091
Feature activation+2.26e+00

Pos logit contributions
packs+0.061
 distance+0.060
 direction+0.059
 leads+0.058
 pays+0.055
Neg logit contributions
affles-0.143
atmeal-0.127
affle-0.127
ic-0.124
ils-0.119
Token up
Token ID510
Feature activation+1.56e+00

Pos logit contributions
 direction+0.056
 distance+0.055
 ho+0.055
 leads+0.054
 flap+0.053
Neg logit contributions
affles-0.094
atmeal-0.083
affle-0.080
ic-0.076
ils-0.074
Token and
Token ID290
Feature activation+1.95e-01

Pos logit contributions
packs+0.035
 pays+0.034
 leads+0.034
 direction+0.034
 ho+0.033
Neg logit contributions
affles-0.061
atmeal-0.054
affle-0.053
ic-0.053
ils-0.050
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
packs+0.014
 direction+0.013
 distance+0.013
 leads+0.013
 ho+0.012
Neg logit contributions
affles-0.052
atmeal-0.048
affle-0.047
ic-0.045
ils-0.045
TokenOne
Token ID3198
Feature activation+7.62e-01

Pos logit contributions
 crashing+0.016
 direction+0.016
 distance+0.016
 slows+0.015
packs+0.015
Neg logit contributions
affles-0.049
atmeal-0.046
affle-0.044
ils-0.041
ic-0.041
Token day
Token ID1110
Feature activation+3.74e-01

Pos logit contributions
 distance+0.017
 ho+0.016
 direction+0.016
 thanks+0.016
 leads+0.016
Neg logit contributions
affles-0.033
atmeal-0.029
ic-0.028
affle-0.028
ils-0.028
Token,
Token ID11
Feature activation-7.73e-01

Pos logit contributions
 distance+0.006
packs+0.006
 direction+0.006
 leads+0.006
 pays+0.006
Neg logit contributions
affles-0.017
atmeal-0.016
affle-0.015
ic-0.015
ils-0.015
Token L
Token ID406
Feature activation-3.56e+00

Pos logit contributions
affles+0.034
affle+0.030
atmeal+0.030
ic+0.028
ils+0.027
Neg logit contributions
 ho-0.017
packs-0.017
 direction-0.017
 leads-0.017
 crashing-0.016
Tokenott
Token ID1252
Feature activation-8.41e+00

Pos logit contributions
affles+0.070
affle+0.049
ils+0.049
atmeal+0.046
making+0.043
Neg logit contributions
 pays-0.153
packs-0.149
 leads-0.145
 thanks-0.143
 direction-0.141
Tokenie
Token ID494
Feature activation-1.66e+00

Pos logit contributions
affles+0.230
atmeal+0.197
 breakfast+0.182
affle+0.177
umbers+0.176
Neg logit contributions
packs-0.292
 pays-0.287
 way-0.272
 direction-0.266
 leads-0.265
Token's
Token ID338
Feature activation+2.16e-01

Pos logit contributions
affles+0.065
atmeal+0.056
affle+0.056
ic+0.053
ils+0.051
Neg logit contributions
packs-0.041
 leads-0.041
 direction-0.041
 distance-0.041
 ho-0.040
Token mom
Token ID1995
Feature activation+6.72e-01

Pos logit contributions
packs+0.006
 pays+0.006
 leads+0.006
 direction+0.006
 distance+0.006
Neg logit contributions
affles-0.007
atmeal-0.006
affle-0.006
ic-0.006
 breakfast-0.006
Token went
Token ID1816
Feature activation+1.01e+00

Pos logit contributions
 leads+0.018
 pays+0.017
 ho+0.017
packs+0.017
 distance+0.017
Neg logit contributions
affles-0.026
affle-0.022
atmeal-0.022
ic-0.020
ils-0.020
Token to
Token ID284
Feature activation-9.27e-01

Pos logit contributions
 leads+0.021
 distance+0.020
packs+0.020
 direction+0.020
 ho+0.020
Neg logit contributions
affles-0.045
atmeal-0.040
affle-0.039
ic-0.038
ils-0.036
Token dolls
Token ID36062
Feature activation+2.07e+00

Pos logit contributions
packs+0.012
 distance+0.012
 leads+0.012
 direction+0.012
 ho+0.012
Neg logit contributions
affles-0.011
atmeal-0.009
affle-0.009
ic-0.009
ils-0.009
Token around
Token ID1088
Feature activation-5.96e-01

Pos logit contributions
packs+0.050
 direction+0.049
 leads+0.049
 distance+0.049
 slows+0.049
Neg logit contributions
affles-0.078
atmeal-0.069
affle-0.067
ic-0.066
ils-0.064
Token in
Token ID287
Feature activation+1.85e+00

Pos logit contributions
affles+0.028
atmeal+0.025
affle+0.024
ic+0.023
ils+0.022
Neg logit contributions
 direction-0.012
 ho-0.011
 distance-0.011
packs-0.011
 crashing-0.011
Token a
Token ID257
Feature activation-1.52e+00

Pos logit contributions
 slows+0.058
 leads+0.056
packs+0.055
 steering+0.055
 pays+0.055
Neg logit contributions
affles-0.054
ic-0.045
affle-0.045
ils-0.044
atmeal-0.044
Token wheel
Token ID7825
Feature activation-2.16e+00

Pos logit contributions
affles+0.063
atmeal+0.055
affle+0.054
ic+0.052
ils+0.050
Neg logit contributions
 distance-0.036
 direction-0.036
packs-0.035
 leads-0.035
 ho-0.035
Tokenbar
Token ID5657
Feature activation-8.41e+00

Pos logit contributions
affles+0.092
atmeal+0.081
affle+0.080
ic+0.077
ils+0.075
Neg logit contributions
packs-0.046
 direction-0.045
 distance-0.045
 leads-0.045
 ho-0.045
Tokenrow
Token ID808
Feature activation-3.08e+00

Pos logit contributions
affles+0.230
atmeal+0.189
affle+0.185
ic+0.174
ils+0.161
Neg logit contributions
 leads-0.303
 distance-0.301
packs-0.301
 direction-0.301
 ho-0.298
Token she
Token ID673
Feature activation-2.08e+00

Pos logit contributions
affles+0.130
atmeal+0.114
affle+0.113
ic+0.109
ils+0.106
Neg logit contributions
packs-0.066
 leads-0.065
 direction-0.065
 distance-0.064
 ho-0.064
Token had
Token ID550
Feature activation-4.81e-01

Pos logit contributions
affles+0.090
affle+0.078
atmeal+0.078
ic+0.076
ilt+0.072
Neg logit contributions
 ho-0.044
 pays-0.043
 leads-0.043
 direction-0.042
packs-0.042
Token found
Token ID1043
Feature activation-3.58e+00

Pos logit contributions
affles+0.019
atmeal+0.017
affle+0.017
ic+0.016
ils+0.015
Neg logit contributions
packs-0.012
 direction-0.012
 ho-0.012
 distance-0.012
 leads-0.012
Token in
Token ID287
Feature activation+2.68e+00

Pos logit contributions
affles+0.138
atmeal+0.119
affle+0.118
ic+0.112
ils+0.108
Neg logit contributions
packs-0.092
 direction-0.091
 leads-0.090
 distance-0.090
 ho-0.090
Token of
Token ID286
Feature activation-1.40e+00

Pos logit contributions
affles+0.057
atmeal+0.048
affle+0.047
ic+0.045
ils+0.043
Neg logit contributions
 ho-0.053
packs-0.053
 direction-0.052
 leads-0.052
 distance-0.051
Token fun
Token ID1257
Feature activation-1.05e+00

Pos logit contributions
affles+0.052
affle+0.044
atmeal+0.044
ic+0.041
ils+0.040
Neg logit contributions
packs-0.038
 distance-0.038
 leads-0.038
 direction-0.038
 ho-0.038
Token watching
Token ID4964
Feature activation-3.42e-01

Pos logit contributions
affles+0.045
atmeal+0.040
affle+0.039
ic+0.037
ils+0.036
Neg logit contributions
packs-0.023
 direction-0.022
 leads-0.022
 ho-0.022
 distance-0.022
Token the
Token ID262
Feature activation+7.79e-01

Pos logit contributions
affles+0.013
atmeal+0.011
affle+0.011
ic+0.010
ils+0.010
Neg logit contributions
packs-0.009
 leads-0.009
 direction-0.009
 ho-0.009
 crashing-0.009
Token ac
Token ID936
Feature activation-6.15e+00

Pos logit contributions
 leads+0.021
 distance+0.021
 direction+0.021
 ho+0.021
 pays+0.020
Neg logit contributions
affles-0.029
atmeal-0.025
affle-0.024
ic-0.023
ils-0.022
Tokenrob
Token ID22609
Feature activation-8.31e+00

Pos logit contributions
affles+0.189
 breakfast+0.151
ils+0.148
affle+0.147
 pancakes+0.144
Neg logit contributions
 distance-0.182
 leads-0.177
 pays-0.175
 thanks-0.168
 slows-0.166
Tokenats
Token ID1381
Feature activation+1.03e+00

Pos logit contributions
affles+0.273
affle+0.244
ott+0.241
atmeal+0.236
ic+0.227
Neg logit contributions
packs-0.232
 ho-0.210
 leads-0.208
 pays-0.206
 way-0.201
Token and
Token ID290
Feature activation+4.47e-01

Pos logit contributions
packs+0.022
 distance+0.022
 leads+0.022
 crashing+0.022
 direction+0.022
Neg logit contributions
affles-0.044
affle-0.039
atmeal-0.039
ic-0.036
ils-0.036
Token clown
Token ID25573
Feature activation+1.20e+00

Pos logit contributions
 direction+0.013
packs+0.013
 leads+0.012
 distance+0.012
 slows+0.012
Neg logit contributions
affles-0.015
atmeal-0.013
affle-0.013
ils-0.012
ic-0.012
Tokens
Token ID82
Feature activation+1.99e-01

Pos logit contributions
packs+0.022
 ho+0.022
 leads+0.021
 direction+0.021
 distance+0.021
Neg logit contributions
affles-0.055
atmeal-0.048
affle-0.048
ic-0.046
ils-0.045
Token.
Token ID13
Feature activation-7.78e-02

Pos logit contributions
 leads+0.005
 direction+0.005
 distance+0.005
packs+0.005
 pays+0.005
Neg logit contributions
affles-0.007
atmeal-0.007
affle-0.006
ic-0.006
ils-0.006
Token.
Token ID13
Feature activation+6.65e-02

Pos logit contributions
packs+0.044
 direction+0.043
 ho+0.043
 distance+0.042
 leads+0.041
Neg logit contributions
affles-0.097
affle-0.086
atmeal-0.085
ic-0.081
ilt-0.078
Token
Token ID198
Feature activation+6.82e-01

Pos logit contributions
 steering+0.002
packs+0.002
 pays+0.002
 ho+0.002
 distance+0.002
Neg logit contributions
affles-0.003
atmeal-0.002
affle-0.002
ic-0.002
ott-0.002
Token
Token ID198
Feature activation+7.47e-01

Pos logit contributions
packs+0.012
 leads+0.011
 direction+0.011
 ho+0.011
 distance+0.011
Neg logit contributions
affles-0.032
atmeal-0.028
affle-0.028
ic-0.027
ils-0.026
TokenWhen
Token ID2215
Feature activation+2.58e+00

Pos logit contributions
 distance+0.020
 direction+0.020
 ho+0.020
 leads+0.019
packs+0.019
Neg logit contributions
affles-0.028
atmeal-0.024
affle-0.024
ic-0.023
ils-0.022
Token D
Token ID360
Feature activation-2.40e+00

Pos logit contributions
packs+0.077
 direction+0.075
 leads+0.074
 steering+0.074
 distance+0.074
Neg logit contributions
affles-0.088
atmeal-0.076
affle-0.073
ic-0.070
ils-0.067
Tokenott
Token ID1252
Feature activation-8.29e+00

Pos logit contributions
affles+0.041
ils+0.029
atmeal+0.026
affle+0.026
ic+0.023
Neg logit contributions
 pays-0.109
 leads-0.105
 direction-0.104
 ho-0.104
packs-0.103
Tokenie
Token ID494
Feature activation-1.75e+00

Pos logit contributions
affles+0.223
atmeal+0.200
 breakfast+0.179
 pancakes+0.177
umbers+0.177
Neg logit contributions
 pays-0.291
packs-0.280
 way-0.271
 leads-0.267
 direction-0.264
Token arrived
Token ID5284
Feature activation-3.88e-01

Pos logit contributions
affles+0.067
atmeal+0.059
affle+0.058
ic+0.055
ils+0.054
Neg logit contributions
packs-0.044
 direction-0.043
 distance-0.043
 leads-0.043
 ho-0.043
Token at
Token ID379
Feature activation-6.25e-01

Pos logit contributions
affles+0.016
atmeal+0.014
affle+0.014
ic+0.013
ils+0.013
Neg logit contributions
packs-0.009
 direction-0.009
 leads-0.009
 distance-0.009
 ho-0.009
Token the
Token ID262
Feature activation+8.47e-01

Pos logit contributions
affles+0.028
atmeal+0.024
affle+0.024
ic+0.023
ils+0.022
Neg logit contributions
 distance-0.015
 leads-0.015
packs-0.015
 direction-0.015
 way-0.015
Token lab
Token ID2248
Feature activation+9.93e-01

Pos logit contributions
 leads+0.019
 distance+0.019
 direction+0.019
 pays+0.019
 ho+0.018
Neg logit contributions
affles-0.036
atmeal-0.032
affle-0.031
ic-0.030
ils-0.029
Token she
Token ID673
Feature activation-9.98e-01

Pos logit contributions
affles+0.020
affle+0.017
atmeal+0.017
ic+0.016
ils+0.015
Neg logit contributions
packs-0.010
 leads-0.010
 ho-0.009
 direction-0.009
 pays-0.009
Token was
Token ID373
Feature activation+5.97e-01

Pos logit contributions
affles+0.038
atmeal+0.033
affle+0.033
ic+0.031
ils+0.030
Neg logit contributions
packs-0.026
 direction-0.025
 leads-0.025
 distance-0.025
 ho-0.025
Token inside
Token ID2641
Feature activation-3.31e-01

Pos logit contributions
 direction+0.015
 distance+0.015
 ho+0.015
packs+0.015
 leads+0.015
Neg logit contributions
affles-0.024
atmeal-0.021
affle-0.020
ic-0.020
ils-0.019
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
affles+0.016
atmeal+0.014
affle+0.014
ic+0.014
ils+0.013
Neg logit contributions
packs-0.005
 direction-0.005
 leads-0.005
 distance-0.005
 ho-0.005
Token D
Token ID360
Feature activation-1.53e+00

Pos logit contributions
affles+0.044
affle+0.039
atmeal+0.037
ic+0.036
ils+0.034
Neg logit contributions
 direction-0.028
 distance-0.028
 ho-0.028
packs-0.027
 leads-0.027
Tokenott
Token ID1252
Feature activation-8.28e+00

Pos logit contributions
affles+0.035
atmeal+0.027
affle+0.026
ic+0.024
ils+0.024
Neg logit contributions
packs-0.062
 ho-0.061
 leads-0.061
 direction-0.061
 distance-0.061
Tokenie
Token ID494
Feature activation-2.05e+00

Pos logit contributions
affles+0.226
atmeal+0.185
affle+0.173
umbers+0.166
 breakfast+0.161
Neg logit contributions
packs-0.297
 way-0.294
 pays-0.288
 direction-0.288
 leads-0.274
Token saw
Token ID2497
Feature activation-2.59e+00

Pos logit contributions
affles+0.082
affle+0.070
atmeal+0.070
ic+0.067
ils+0.063
Neg logit contributions
 direction-0.052
 ho-0.052
 distance-0.051
 leads-0.051
packs-0.050
Token lots
Token ID6041
Feature activation-1.83e-01

Pos logit contributions
affles+0.110
atmeal+0.099
affle+0.094
ic+0.093
ils+0.088
Neg logit contributions
 direction-0.064
 distance-0.063
 thanks-0.059
 flap-0.058
 waving-0.057
Token of
Token ID286
Feature activation-1.63e+00

Pos logit contributions
affles+0.007
atmeal+0.007
affle+0.006
ic+0.006
ils+0.006
Neg logit contributions
 distance-0.004
 direction-0.004
 leads-0.004
 thanks-0.004
 crashing-0.004
Token colorful
Token ID20239
Feature activation-2.29e+00

Pos logit contributions
affles+0.060
atmeal+0.059
ic+0.055
at+0.054
affle+0.052
Neg logit contributions
 direction-0.043
 way-0.041
 distance-0.040
 ho-0.039
 crashing-0.038
Token kitchen
Token ID9592
Feature activation-1.54e+00

Pos logit contributions
affles+0.024
atmeal+0.020
affle+0.020
ils+0.020
ic+0.019
Neg logit contributions
 distance-0.015
 direction-0.015
 way-0.013
 leads-0.013
packs-0.013
Token and
Token ID290
Feature activation+1.06e-01

Pos logit contributions
affles+0.069
atmeal+0.061
affle+0.060
ic+0.058
ils+0.057
Neg logit contributions
packs-0.029
 direction-0.028
 leads-0.028
 distance-0.028
 ho-0.028
Token made
Token ID925
Feature activation-2.59e+00

Pos logit contributions
 distance+0.002
 leads+0.002
 direction+0.002
 ho+0.002
 pays+0.002
Neg logit contributions
affles-0.005
affle-0.004
atmeal-0.004
ic-0.004
ilt-0.004
Token a
Token ID257
Feature activation+1.89e-01

Pos logit contributions
affles+0.107
atmeal+0.101
affle+0.095
at+0.094
ase+0.093
Neg logit contributions
 thanks-0.059
 way-0.055
 direction-0.051
 leads-0.051
 ho-0.050
Token fresh
Token ID4713
Feature activation-8.20e-01

Pos logit contributions
packs+0.005
 direction+0.005
 leads+0.005
 distance+0.005
 ho+0.005
Neg logit contributions
affles-0.007
atmeal-0.006
affle-0.006
ic-0.006
ils-0.005
Token batch
Token ID15458
Feature activation-8.27e+00

Pos logit contributions
affles+0.025
atmeal+0.021
affle+0.020
ils+0.019
ic+0.018
Neg logit contributions
 distance-0.028
 direction-0.028
 leads-0.027
 ho-0.027
 pays-0.026
Token of
Token ID286
Feature activation-1.03e+00

Pos logit contributions
affles+0.362
atmeal+0.326
affle+0.319
ils+0.291
ilt+0.290
Neg logit contributions
 leads-0.160
 thanks-0.159
 crashing-0.155
 ho-0.153
 pays-0.151
Token sa
Token ID473
Feature activation-4.86e+00

Pos logit contributions
affles+0.032
atmeal+0.029
affle+0.026
ils+0.026
at+0.026
Neg logit contributions
 thanks-0.033
 direction-0.033
 way-0.032
 distance-0.032
 leads-0.032
Tokenus
Token ID385
Feature activation-6.14e+00

Pos logit contributions
affles+0.126
affle+0.100
atmeal+0.099
ilt+0.091
ils+0.089
Neg logit contributions
 ho-0.176
packs-0.176
 pays-0.170
 leads-0.167
arse-0.165
Tokenages
Token ID1095
Feature activation-2.44e+00

Pos logit contributions
affles+0.184
affle+0.155
atmeal+0.151
ilt+0.143
ic+0.137
Neg logit contributions
 ho-0.205
 pays-0.196
 leads-0.196
packs-0.193
 thanks-0.184
Token.
Token ID13
Feature activation-2.11e-02

Pos logit contributions
affles+0.105
atmeal+0.092
affle+0.092
ic+0.086
ils+0.085
Neg logit contributions
packs-0.051
 leads-0.051
 ho-0.050
 distance-0.050
 direction-0.050
Token house
Token ID2156
Feature activation+3.99e-02

Pos logit contributions
affles+0.011
atmeal+0.009
affle+0.009
ic+0.009
ils+0.009
Neg logit contributions
packs-0.007
 direction-0.007
 leads-0.007
 distance-0.007
 ho-0.007
Token and
Token ID290
Feature activation+5.89e-01

Pos logit contributions
 direction+0.001
packs+0.001
 leads+0.001
 ho+0.001
 distance+0.001
Neg logit contributions
affles-0.002
atmeal-0.001
affle-0.001
ic-0.001
ils-0.001
Token made
Token ID925
Feature activation-2.61e+00

Pos logit contributions
 leads+0.014
packs+0.014
 distance+0.013
 ho+0.013
 pays+0.013
Neg logit contributions
affles-0.026
ic-0.023
affle-0.023
atmeal-0.021
ilt-0.021
Token a
Token ID257
Feature activation+2.71e-01

Pos logit contributions
affles+0.111
atmeal+0.105
affle+0.102
at+0.096
ase+0.096
Neg logit contributions
 thanks-0.053
 way-0.049
 leads-0.049
 ho-0.047
 smiles-0.043
Token fresh
Token ID4713
Feature activation-4.54e-01

Pos logit contributions
 distance+0.006
 ho+0.006
 direction+0.006
 flap+0.006
 steering+0.006
Neg logit contributions
affles-0.011
atmeal-0.010
affle-0.009
ils-0.009
ic-0.009
Token batch
Token ID15458
Feature activation-8.24e+00

Pos logit contributions
affles+0.014
ils+0.011
atmeal+0.011
affle+0.011
making+0.010
Neg logit contributions
 distance-0.016
 thanks-0.016
 direction-0.016
 ho-0.016
 way-0.016
Token of
Token ID286
Feature activation-7.76e-01

Pos logit contributions
affles+0.360
atmeal+0.326
affle+0.320
making+0.290
ilt+0.287
Neg logit contributions
 leads-0.162
 thanks-0.160
 ho-0.152
 pays-0.150
 crashing-0.147
Token blue
Token ID4171
Feature activation-6.80e-01

Pos logit contributions
affles+0.025
atmeal+0.022
affle+0.021
ils+0.021
at+0.020
Neg logit contributions
 thanks-0.026
 ho-0.024
 way-0.024
 hug-0.024
 waving-0.024
Tokenberry
Token ID8396
Feature activation-4.02e+00

Pos logit contributions
affles+0.024
affle+0.021
atmeal+0.020
ils+0.020
ilt+0.019
Neg logit contributions
 ho-0.019
packs-0.019
 leads-0.018
 thanks-0.018
 steering-0.018
Token muff
Token ID27563
Feature activation-5.81e+00

Pos logit contributions
affles+0.129
affle+0.111
dd+0.101
ilt+0.100
atmeal+0.098
Neg logit contributions
 leads-0.124
 ho-0.121
 thanks-0.118
packs-0.114
 way-0.114
Tokenins
Token ID1040
Feature activation-3.08e+00

Pos logit contributions
 accident+0.147
affles+0.147
affle+0.145
 operation+0.142
ZE+0.142
Neg logit contributions
packs-0.162
 ho-0.138
ocular-0.136
s-0.134
stretched-0.133
Token mom
Token ID1995
Feature activation+6.62e-01

Pos logit contributions
 ho+0.006
 leads+0.006
packs+0.006
 direction+0.006
 way+0.006
Neg logit contributions
affles-0.012
atmeal-0.010
affle-0.010
ic-0.010
ils-0.010
Token would
Token ID561
Feature activation+7.85e-01

Pos logit contributions
 leads+0.012
 pays+0.012
packs+0.012
 ho+0.012
 distance+0.011
Neg logit contributions
affles-0.030
affle-0.027
atmeal-0.026
ic-0.025
ils-0.025
Token bake
Token ID28450
Feature activation-2.83e+00

Pos logit contributions
 leads+0.020
 ho+0.020
packs+0.020
 distance+0.020
 direction+0.020
Neg logit contributions
affles-0.029
atmeal-0.025
affle-0.025
ic-0.024
ils-0.023
Token a
Token ID257
Feature activation-4.91e-01

Pos logit contributions
affles+0.102
affle+0.096
ott+0.090
atmeal+0.088
dd+0.087
Neg logit contributions
 thanks-0.069
 leads-0.069
 ho-0.068
packs-0.064
 lets-0.063
Token fresh
Token ID4713
Feature activation-1.10e+00

Pos logit contributions
affles+0.016
dd+0.014
ils+0.014
affle+0.013
atmeal+0.013
Neg logit contributions
 way-0.015
 distance-0.015
 ho-0.014
 hug-0.014
 direction-0.014
Token batch
Token ID15458
Feature activation-8.23e+00

Pos logit contributions
affles+0.038
affle+0.032
ils+0.031
atmeal+0.030
dd+0.030
Neg logit contributions
 distance-0.033
 thanks-0.032
 leads-0.032
 way-0.032
 hug-0.030
Token of
Token ID286
Feature activation-1.16e+00

Pos logit contributions
affles+0.375
affle+0.350
atmeal+0.346
making+0.330
ilt+0.327
Neg logit contributions
 thanks-0.118
 leads-0.113
 pays-0.103
 ho-0.100
 forward-0.094
Token cookies
Token ID14746
Feature activation-1.45e+00

Pos logit contributions
affles+0.040
affle+0.039
dd+0.037
atmeal+0.036
at+0.036
Neg logit contributions
 thanks-0.032
 way-0.031
 leads-0.030
 hug-0.029
 lets-0.028
Token for
Token ID329
Feature activation-4.93e+00

Pos logit contributions
affles+0.067
affle+0.060
atmeal+0.057
ic+0.055
ilt+0.054
Neg logit contributions
 leads-0.026
packs-0.026
 ho-0.024
 distance-0.024
 pays-0.024
Token her
Token ID607
Feature activation-1.39e-01

Pos logit contributions
affles+0.182
affle+0.180
at+0.172
atmeal+0.167
ic+0.166
Neg logit contributions
 ho-0.123
 way-0.104
packs-0.102
 thanks-0.099
 waving-0.095
Token to
Token ID284
Feature activation-1.82e+00

Pos logit contributions
affles+0.006
affle+0.005
atmeal+0.005
ic+0.005
ilt+0.005
Neg logit contributions
packs-0.003
 ho-0.003
 leads-0.003
 way-0.003
 direction-0.003
Token of
Token ID286
Feature activation-8.39e-01

Pos logit contributions
affles+0.123
atmeal+0.107
affle+0.105
ic+0.102
ils+0.098
Neg logit contributions
packs-0.083
 direction-0.082
 leads-0.082
 ho-0.081
 distance-0.081
Token soil
Token ID9260
Feature activation+3.31e-01

Pos logit contributions
affles+0.033
atmeal+0.031
affle+0.029
ic+0.029
at+0.027
Neg logit contributions
 direction-0.022
 leads-0.022
 distance-0.021
 way-0.021
packs-0.021
Token into
Token ID656
Feature activation-3.17e-01

Pos logit contributions
packs+0.008
 slows+0.008
 direction+0.008
 ho+0.008
 distance+0.008
Neg logit contributions
affles-0.012
atmeal-0.011
affle-0.010
ic-0.010
ils-0.010
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
affles+0.014
atmeal+0.012
affle+0.012
at+0.011
ic+0.011
Neg logit contributions
 direction-0.008
 distance-0.008
packs-0.008
 leads-0.008
 steering-0.008
Token wheel
Token ID7825
Feature activation-1.75e+00

Pos logit contributions
affles+0.050
affle+0.043
atmeal+0.043
ic+0.041
ils+0.040
Neg logit contributions
 distance-0.032
 direction-0.031
 ho-0.029
 leads-0.029
 way-0.028
Tokenbar
Token ID5657
Feature activation-8.18e+00

Pos logit contributions
affles+0.064
atmeal+0.057
ic+0.055
affle+0.055
ils+0.051
Neg logit contributions
packs-0.044
 leads-0.044
 direction-0.044
 ho-0.044
 distance-0.044
Tokenrow
Token ID808
Feature activation-2.64e+00

Pos logit contributions
affles+0.208
atmeal+0.168
affle+0.164
ic+0.154
ils+0.142
Neg logit contributions
packs-0.309
 distance-0.309
 leads-0.309
 direction-0.309
 ho-0.307
Token.
Token ID13
Feature activation-9.64e-02

Pos logit contributions
affles+0.097
atmeal+0.086
ic+0.086
affle+0.084
ils+0.079
Neg logit contributions
packs-0.066
 ho-0.065
 leads-0.064
 slows-0.064
 direction-0.064
Token It
Token ID632
Feature activation-1.24e+00

Pos logit contributions
affles+0.003
atmeal+0.003
affle+0.003
 breakfast+0.003
ic+0.003
Neg logit contributions
 flap-0.003
 ho-0.003
packs-0.003
 distance-0.002
 leads-0.002
Token was
Token ID373
Feature activation+4.98e-01

Pos logit contributions
affles+0.056
atmeal+0.047
affle+0.047
ils+0.043
ic+0.043
Neg logit contributions
 leads-0.029
 direction-0.027
 way-0.026
 pays-0.026
 flap-0.026
Token heavy
Token ID4334
Feature activation+7.82e-01

Pos logit contributions
packs+0.014
 direction+0.014
 leads+0.014
 distance+0.014
 ho+0.014
Neg logit contributions
affles-0.018
atmeal-0.015
affle-0.015
ic-0.014
ils-0.014
Token the
Token ID262
Feature activation+1.01e-01

Pos logit contributions
affles+0.021
atmeal+0.018
affle+0.018
ic+0.017
ils+0.016
Neg logit contributions
 distance-0.013
 direction-0.013
 leads-0.013
 ho-0.013
packs-0.013
Token church
Token ID4928
Feature activation-1.61e+00

Pos logit contributions
 distance+0.003
 direction+0.003
 leads+0.002
 way+0.002
 steering+0.002
Neg logit contributions
affles-0.004
affle-0.004
atmeal-0.004
ic-0.004
ils-0.003
Token and
Token ID290
Feature activation-3.78e-01

Pos logit contributions
affles+0.070
atmeal+0.062
affle+0.061
ic+0.058
ils+0.057
Neg logit contributions
packs-0.033
 direction-0.033
 distance-0.033
 leads-0.032
 ho-0.032
Token noticed
Token ID6810
Feature activation-2.82e+00

Pos logit contributions
affles+0.016
affle+0.014
atmeal+0.013
ic+0.013
ilt+0.012
Neg logit contributions
 distance-0.010
 direction-0.010
 ho-0.009
 leads-0.009
 pays-0.009
Token a
Token ID257
Feature activation+6.08e-01

Pos logit contributions
affles+0.101
atmeal+0.088
affle+0.086
ic+0.077
ilt+0.075
Neg logit contributions
 direction-0.085
 distance-0.083
 leads-0.083
 crashing-0.082
 ho-0.082
Token freshly
Token ID29026
Feature activation-8.18e+00

Pos logit contributions
 direction+0.016
 distance+0.016
packs+0.016
 leads+0.015
 ho+0.015
Neg logit contributions
affles-0.023
atmeal-0.020
affle-0.020
ic-0.019
ils-0.018
Token dug
Token ID18735
Feature activation-4.15e+00

Pos logit contributions
affles+0.300
atmeal+0.238
affle+0.238
ic+0.225
ils+0.213
Neg logit contributions
 distance-0.242
 direction-0.239
 ho-0.231
 way-0.229
 flap-0.228
Token hole
Token ID7604
Feature activation-1.40e+00

Pos logit contributions
affles+0.168
atmeal+0.147
affle+0.144
ic+0.133
ils+0.129
Neg logit contributions
 direction-0.103
 distance-0.102
 leads-0.102
packs-0.099
 crashing-0.098
Token in
Token ID287
Feature activation+2.05e+00

Pos logit contributions
affles+0.047
atmeal+0.040
affle+0.040
ic+0.040
ils+0.038
Neg logit contributions
packs-0.040
 leads-0.039
 pays-0.039
 direction-0.039
 distance-0.039
Token the
Token ID262
Feature activation-9.65e-01

Pos logit contributions
packs+0.065
 leads+0.064
 ho+0.064
 slows+0.063
 pays+0.063
Neg logit contributions
affles-0.062
atmeal-0.052
affle-0.052
ic-0.050
ils-0.048
Token ground
Token ID2323
Feature activation+2.30e-01

Pos logit contributions
affles+0.036
atmeal+0.030
affle+0.030
ic+0.029
ils+0.028
Neg logit contributions
 distance-0.027
 direction-0.027
 ho-0.026
 leads-0.026
packs-0.026
Token a
Token ID257
Feature activation+6.34e-01

Pos logit contributions
affles+0.001
atmeal+0.001
affle+0.001
ic+0.001
ils+0.001
Neg logit contributions
packs-0.001
 direction-0.001
 leads-0.001
 distance-0.001
 ho-0.001
Token big
Token ID1263
Feature activation+6.32e-01

Pos logit contributions
packs+0.018
 leads+0.017
 pays+0.016
 slows+0.016
 lets+0.016
Neg logit contributions
affles-0.020
affle-0.019
atmeal-0.018
ic-0.017
 breakfast-0.017
Token box
Token ID3091
Feature activation+3.37e-02

Pos logit contributions
packs+0.013
 pays+0.012
 leads+0.012
 slows+0.011
 lets+0.011
Neg logit contributions
affles-0.025
affle-0.024
atmeal-0.022
 breakfast-0.022
ic-0.022
Token of
Token ID286
Feature activation-1.44e+00

Pos logit contributions
packs+0.001
 slows+0.001
 pays+0.001
 direction+0.001
 leads+0.001
Neg logit contributions
affles-0.001
atmeal-0.001
ic-0.001
affle-0.001
ils-0.001
Token ch
Token ID442
Feature activation-1.52e+00

Pos logit contributions
affles+0.050
atmeal+0.044
affle+0.042
ic+0.041
ils+0.039
Neg logit contributions
packs-0.042
 direction-0.041
 distance-0.041
 leads-0.041
 ho-0.041
Tokenocol
Token ID4668
Feature activation-8.17e+00

Pos logit contributions
affles+0.037
atmeal+0.031
affle+0.029
ic+0.027
ils+0.026
Neg logit contributions
packs-0.061
 ho-0.060
 direction-0.059
 leads-0.059
 distance-0.058
Tokenates
Token ID689
Feature activation-5.35e-01

Pos logit contributions
affles+0.200
atmeal+0.158
affle+0.155
ic+0.143
ils+0.137
Neg logit contributions
 direction-0.320
packs-0.320
 distance-0.319
 leads-0.319
 ho-0.315
Token!
Token ID0
Feature activation-8.91e-01

Pos logit contributions
affles+0.021
atmeal+0.019
affle+0.019
ic+0.018
ils+0.017
Neg logit contributions
packs-0.012
 direction-0.012
 leads-0.012
 distance-0.012
 ho-0.012
Token<|endoftext|>
Token ID50256
Feature activation-3.36e-01

Pos logit contributions
affles+0.035
atmeal+0.032
affle+0.031
ic+0.030
ott+0.029
Neg logit contributions
 ho-0.020
packs-0.019
 leads-0.019
 steering-0.019
 flap-0.019
TokenOnce
Token ID7454
Feature activation-4.02e-01

Pos logit contributions
affles+0.012
atmeal+0.011
affle+0.010
ic+0.010
ils+0.010
Neg logit contributions
 ho-0.009
packs-0.009
 steering-0.009
 direction-0.009
 distance-0.009
Token upon
Token ID2402
Feature activation-2.24e-01

Pos logit contributions
affles+0.017
atmeal+0.015
affle+0.015
ic+0.014
ils+0.014
Neg logit contributions
 ho-0.008
 distance-0.008
 steering-0.008
 leads-0.008
 direction-0.008
Token help
Token ID1037
Feature activation+1.84e+00

Pos logit contributions
 ho+0.018
 leads+0.018
 distance+0.018
 direction+0.018
packs+0.018
Neg logit contributions
affles-0.036
atmeal-0.032
affle-0.031
ic-0.031
ils-0.029
Token her
Token ID607
Feature activation+3.53e-01

Pos logit contributions
 leads+0.054
 ho+0.052
packs+0.051
 flap+0.051
 distance+0.051
Neg logit contributions
affles-0.066
atmeal-0.058
affle-0.057
ic-0.054
at-0.050
Token make
Token ID787
Feature activation-2.14e+00

Pos logit contributions
packs+0.008
 leads+0.008
 direction+0.008
 ho+0.008
 flap+0.008
Neg logit contributions
affles-0.015
affle-0.013
atmeal-0.013
ic-0.012
ils-0.012
Token a
Token ID257
Feature activation-2.67e-01

Pos logit contributions
affles+0.092
atmeal+0.087
at+0.086
affle+0.084
ase+0.083
Neg logit contributions
 thanks-0.041
 way-0.041
 leads-0.040
 flap-0.038
 pays-0.035
Token new
Token ID649
Feature activation-3.54e-03

Pos logit contributions
affles+0.010
atmeal+0.008
affle+0.008
ils+0.008
ic+0.008
Neg logit contributions
 direction-0.008
 way-0.008
 distance-0.008
 flap-0.008
 ho-0.008
Token batch
Token ID15458
Feature activation-8.17e+00

Pos logit contributions
affles+0.000
atmeal+0.000
affle+0.000
ils+0.000
at+0.000
Neg logit contributions
 way-0.000
 direction-0.000
 leads-0.000
 flap-0.000
 ho-0.000
Token of
Token ID286
Feature activation-1.25e+00

Pos logit contributions
affles+0.371
atmeal+0.340
affle+0.334
ic+0.304
making+0.303
Neg logit contributions
 leads-0.148
 thanks-0.139
 pays-0.133
 ho-0.125
 crashing-0.123
Token dough
Token ID15756
Feature activation-1.16e+00

Pos logit contributions
affles+0.044
atmeal+0.040
affle+0.040
at+0.038
dd+0.037
Neg logit contributions
 way-0.035
 leads-0.035
 flap-0.035
 thanks-0.034
 direction-0.033
Token.
Token ID13
Feature activation-5.26e-01

Pos logit contributions
affles+0.054
affle+0.049
atmeal+0.048
ic+0.045
ils+0.044
Neg logit contributions
 leads-0.022
packs-0.022
 pays-0.020
 ho-0.020
 flap-0.019
Token Max
Token ID5436
Feature activation+5.20e-01

Pos logit contributions
affles+0.024
atmeal+0.021
affle+0.021
ic+0.020
ils+0.020
Neg logit contributions
packs-0.010
 distance-0.010
 direction-0.010
 ho-0.010
 leads-0.009
Token was
Token ID373
Feature activation-3.54e-02

Pos logit contributions
 leads+0.011
 pays+0.011
 lets+0.010
 ho+0.010
 way+0.010
Neg logit contributions
affles-0.024
atmeal-0.020
affle-0.020
ic-0.020
ilt-0.019
Token the
Token ID262
Feature activation+9.18e-01

Pos logit contributions
affles+0.027
atmeal+0.024
affle+0.023
ic+0.023
ilt+0.021
Neg logit contributions
 ho-0.020
packs-0.020
 leads-0.019
 direction-0.019
 distance-0.019
Token other
Token ID584
Feature activation+6.95e-01

Pos logit contributions
packs+0.028
 leads+0.027
 direction+0.027
 distance+0.027
 ho+0.027
Neg logit contributions
affles-0.031
atmeal-0.026
affle-0.026
ic-0.024
ils-0.023
Token kids
Token ID3988
Feature activation+9.42e-01

Pos logit contributions
 slows+0.024
 crashing+0.024
 ho+0.024
 distance+0.024
 direction+0.023
Neg logit contributions
affles-0.020
atmeal-0.016
affle-0.016
ils-0.015
ic-0.015
Token.
Token ID13
Feature activation+7.99e-01

Pos logit contributions
 direction+0.025
 distance+0.024
 slows+0.024
 leads+0.024
 ho+0.024
Neg logit contributions
affles-0.034
atmeal-0.030
affle-0.029
ils-0.028
ic-0.028
Token Come
Token ID7911
Feature activation+1.70e+00

Pos logit contributions
arse+0.024
 direction+0.023
 flap+0.023
packs+0.023
 ho+0.023
Neg logit contributions
affles-0.025
affle-0.022
atmeal-0.020
ily-0.020
 pasta-0.020
Token in
Token ID287
Feature activation+4.79e+00

Pos logit contributions
 direction+0.050
 distance+0.049
 pays+0.048
 slows+0.048
 charges+0.047
Neg logit contributions
affles-0.054
atmeal-0.049
affle-0.047
 breakfast-0.044
at-0.043
Token."
Token ID526
Feature activation-1.39e+00

Pos logit contributions
 slows+0.120
packs+0.118
 distance+0.116
 direction+0.116
 steering+0.116
Neg logit contributions
affles-0.176
affle-0.152
atmeal-0.151
ic-0.147
 breakfast-0.144
Token
Token ID198
Feature activation+4.16e-01

Pos logit contributions
affles+0.047
atmeal+0.039
affle+0.039
ott+0.039
 breakfast+0.038
Neg logit contributions
 steering-0.039
arse-0.038
 crashing-0.037
 ho-0.037
 flap-0.036
Token
Token ID198
Feature activation+4.81e-01

Pos logit contributions
packs+0.009
 ho+0.009
 leads+0.009
 direction+0.009
 steering+0.008
Neg logit contributions
affles-0.017
affle-0.015
atmeal-0.015
ic-0.015
ils-0.014
TokenAnna
Token ID31160
Feature activation-7.74e-01

Pos logit contributions
packs+0.015
 leads+0.014
 direction+0.014
 ho+0.014
 distance+0.014
Neg logit contributions
affles-0.016
atmeal-0.013
affle-0.013
ic-0.013
ils-0.012
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
affles+0.030
atmeal+0.027
affle+0.026
ils+0.026
ic+0.025
Neg logit contributions
packs-0.018
 distance-0.018
 direction-0.018
 crashing-0.018
 steering-0.017
Token,
Token ID11
Feature activation+1.26e+00

Pos logit contributions
affles+0.013
atmeal+0.012
affle+0.012
ic+0.012
ils+0.012
Neg logit contributions
packs-0.006
 distance-0.005
 direction-0.005
 ho-0.005
 leads-0.005
Token "
Token ID366
Feature activation+3.28e-01

Pos logit contributions
packs+0.029
 direction+0.027
 distance+0.026
 leads+0.026
 pays+0.025
Neg logit contributions
affles-0.051
atmeal-0.046
ic-0.045
affle-0.044
ils-0.042
TokenBy
Token ID3886
Feature activation+2.64e+00

Pos logit contributions
packs+0.011
 leads+0.010
 direction+0.010
 ho+0.010
 slam+0.010
Neg logit contributions
affles-0.010
atmeal-0.009
ic-0.009
affle-0.008
ils-0.008
Tokene
Token ID68
Feature activation-7.03e-01

Pos logit contributions
 ho+0.089
 direction+0.088
 leads+0.088
 flap+0.088
 distance+0.088
Neg logit contributions
affles-0.081
atmeal-0.067
affle-0.066
ic-0.061
ils-0.057
Token-
Token ID12
Feature activation-2.52e+00

Pos logit contributions
affles+0.034
atmeal+0.030
affle+0.030
ic+0.028
ils+0.028
Neg logit contributions
 ho-0.012
 leads-0.011
packs-0.011
 direction-0.011
 distance-0.011
Tokenbye
Token ID16390
Feature activation+4.43e+00

Pos logit contributions
affles+0.124
affle+0.111
atmeal+0.108
ils+0.101
ic+0.101
Neg logit contributions
 distance-0.039
 ho-0.038
 pays-0.037
 direction-0.037
 slows-0.037
Token!"
Token ID2474
Feature activation-2.53e+00

Pos logit contributions
packs+0.103
 direction+0.101
 distance+0.099
 leads+0.099
 pays+0.096
Neg logit contributions
affles-0.174
atmeal-0.155
affle-0.151
ic-0.149
ils-0.145
Token and
Token ID290
Feature activation+3.03e-01

Pos logit contributions
affles+0.094
atmeal+0.087
ic+0.083
affle+0.081
 breakfast+0.079
Neg logit contributions
 direction-0.058
packs-0.058
 steering-0.057
 distance-0.056
 ho-0.056
Token ran
Token ID4966
Feature activation+1.88e+00

Pos logit contributions
packs+0.008
 direction+0.008
 leads+0.008
 distance+0.008
 ho+0.008
Neg logit contributions
affles-0.011
atmeal-0.010
affle-0.010
ic-0.009
ils-0.009
Token off
Token ID572
Feature activation+1.97e+00

Pos logit contributions
packs+0.040
 leads+0.039
 direction+0.039
 distance+0.038
 ho+0.038
Neg logit contributions
affles-0.079
affle-0.070
atmeal-0.070
ic-0.067
ils-0.066
Token.
Token ID13
Feature activation+6.77e-03

Pos logit contributions
packs+0.046
 direction+0.045
 leads+0.045
 pays+0.044
 slows+0.044
Neg logit contributions
affles-0.075
atmeal-0.066
affle-0.065
ic-0.064
ils-0.063
Token bye
Token ID33847
Feature activation+4.13e+00

Pos logit contributions
 ho+0.052
 leads+0.052
 direction+0.051
packs+0.051
 pays+0.051
Neg logit contributions
affles-0.064
atmeal-0.055
affle-0.054
ic-0.050
ils-0.048
Token and
Token ID290
Feature activation+6.01e-01

Pos logit contributions
packs+0.113
 distance+0.111
 direction+0.110
 flap+0.110
 leads+0.108
Neg logit contributions
affles-0.138
ic-0.124
atmeal-0.123
ils-0.119
affle-0.118
Token went
Token ID1816
Feature activation+1.05e+00

Pos logit contributions
packs+0.016
 direction+0.015
 leads+0.015
 distance+0.015
 slows+0.015
Neg logit contributions
affles-0.021
atmeal-0.019
affle-0.018
ic-0.018
ils-0.018
Token on
Token ID319
Feature activation+7.11e-01

Pos logit contributions
 leads+0.029
 pays+0.029
packs+0.029
 ho+0.028
 direction+0.028
Neg logit contributions
affles-0.035
atmeal-0.031
affle-0.030
ils-0.029
ic-0.029
Token their
Token ID511
Feature activation-1.81e+00

Pos logit contributions
 slows+0.020
arse+0.020
hole+0.020
 lets+0.020
 leads+0.020
Neg logit contributions
affles-0.022
 breakfast-0.020
ils-0.018
 dinner-0.018
affle-0.018
Token way
Token ID835
Feature activation+5.37e+00

Pos logit contributions
affles+0.067
 breakfast+0.063
atmeal+0.060
ils+0.060
 dinner+0.058
Neg logit contributions
 crashing-0.038
packs-0.038
stretched-0.038
hole-0.038
arl-0.037
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
 pays+0.200
 lets+0.199
 aims+0.196
 charges+0.194
 leads+0.192
Neg logit contributions
 all-0.094
 breakfast-0.090
 dinner-0.087
ie-0.087
 that-0.084
Token Bobby
Token ID17155
Feature activation-1.29e+00

Pos logit contributions
affles+0.005
 breakfast+0.005
ott+0.005
ic+0.005
affle+0.005
Neg logit contributions
 ho-0.006
packs-0.005
 distance-0.005
 brake-0.005
 flap-0.005
Token and
Token ID290
Feature activation+1.11e-01

Pos logit contributions
affles+0.048
atmeal+0.042
affle+0.042
ic+0.041
ils+0.040
Neg logit contributions
packs-0.033
 direction-0.031
 leads-0.031
 distance-0.031
 steering-0.031
Token Daisy
Token ID40355
Feature activation+6.76e-01

Pos logit contributions
packs+0.003
 direction+0.003
 leads+0.003
 ho+0.003
 distance+0.003
Neg logit contributions
affles-0.004
atmeal-0.004
affle-0.004
ic-0.003
ils-0.003
Token were
Token ID547
Feature activation-2.05e-01

Pos logit contributions
packs+0.018
 direction+0.018
 leads+0.017
 steering+0.017
 crashing+0.017
Neg logit contributions
affles-0.023
atmeal-0.020
ils-0.019
affle-0.019
ic-0.019
Token flowers
Token ID12734
Feature activation+1.74e+00

Pos logit contributions
affles+0.040
atmeal+0.034
affle+0.034
ic+0.032
ils+0.031
Neg logit contributions
 direction-0.029
packs-0.029
 distance-0.029
 leads-0.029
 ho-0.029
Token.
Token ID13
Feature activation-7.54e-02

Pos logit contributions
packs+0.040
 direction+0.040
 leads+0.039
 distance+0.039
 ho+0.039
Neg logit contributions
affles-0.071
atmeal-0.062
affle-0.061
ic-0.059
ils-0.057
Token He
Token ID679
Feature activation-1.05e-01

Pos logit contributions
affles+0.003
atmeal+0.003
affle+0.003
ic+0.003
ils+0.003
Neg logit contributions
packs-0.002
 ho-0.002
 direction-0.002
 leads-0.002
 distance-0.002
Token knew
Token ID2993
Feature activation-1.22e+00

Pos logit contributions
affles+0.005
ic+0.004
atmeal+0.004
affle+0.004
ilt+0.004
Neg logit contributions
 ho-0.002
 leads-0.002
 distance-0.002
 way-0.002
 flap-0.002
Token the
Token ID262
Feature activation+1.87e+00

Pos logit contributions
affles+0.043
affle+0.037
atmeal+0.036
ic+0.036
ils+0.033
Neg logit contributions
 ho-0.036
 direction-0.036
 leads-0.036
packs-0.035
 distance-0.035
Token grass
Token ID8701
Feature activation+5.72e+00

Pos logit contributions
packs+0.054
 leads+0.052
 pays+0.051
 direction+0.050
 slows+0.050
Neg logit contributions
affles-0.066
atmeal-0.057
affle-0.056
ic-0.053
ils-0.051
Token was
Token ID373
Feature activation+1.34e+00

Pos logit contributions
packs+0.147
 direction+0.144
 leads+0.143
 distance+0.142
 ho+0.142
Neg logit contributions
affles-0.216
atmeal-0.189
affle-0.185
ic-0.178
ils-0.173
Token safe
Token ID3338
Feature activation+8.82e-01

Pos logit contributions
 distance+0.035
 direction+0.035
packs+0.035
 leads+0.035
 ho+0.035
Neg logit contributions
affles-0.051
atmeal-0.044
affle-0.043
ic-0.042
ils-0.040
Token and
Token ID290
Feature activation-1.38e-01

Pos logit contributions
 slows+0.023
packs+0.023
 steering+0.023
arse+0.022
 flap+0.022
Neg logit contributions
affles-0.030
atmeal-0.027
ic-0.026
affle-0.025
ils-0.025
Token harmless
Token ID23585
Feature activation+5.15e-01

Pos logit contributions
affles+0.005
atmeal+0.004
affle+0.004
ic+0.004
ils+0.004
Neg logit contributions
packs-0.004
 leads-0.004
 direction-0.004
 ho-0.004
 distance-0.004
Token.
Token ID13
Feature activation-1.33e-01

Pos logit contributions
packs+0.012
 distance+0.012
 direction+0.012
 leads+0.012
 ho+0.012
Neg logit contributions
affles-0.020
atmeal-0.017
affle-0.017
ic-0.017
ils-0.016
Token known
Token ID1900
Feature activation+1.12e+00

Pos logit contributions
 distance+0.009
 direction+0.009
 ho+0.009
packs+0.008
 crashing+0.008
Neg logit contributions
affles-0.014
affle-0.012
atmeal-0.012
ic-0.011
ils-0.011
Token for
Token ID329
Feature activation-2.59e+00

Pos logit contributions
packs+0.032
 leads+0.031
 direction+0.030
 distance+0.030
 slows+0.030
Neg logit contributions
affles-0.038
affle-0.033
atmeal-0.032
ic-0.031
ils-0.031
Token his
Token ID465
Feature activation+1.27e+00

Pos logit contributions
affles+0.096
atmeal+0.084
affle+0.082
ic+0.078
ilt+0.074
Neg logit contributions
 ho-0.072
 direction-0.071
 leads-0.070
 crashing-0.070
 distance-0.070
Token strength
Token ID4202
Feature activation+6.27e+00

Pos logit contributions
packs+0.036
 distance+0.034
 leads+0.034
 slows+0.033
 direction+0.033
Neg logit contributions
affles-0.044
affle-0.038
atmeal-0.037
ic-0.036
ils-0.035
Token,
Token ID11
Feature activation-8.05e-01

Pos logit contributions
packs+0.162
 slows+0.160
 leads+0.158
 distance+0.157
 direction+0.155
Neg logit contributions
affles-0.214
atmeal-0.190
ic-0.187
affle-0.181
ils-0.179
Token courage
Token ID11917
Feature activation+6.34e+00

Pos logit contributions
affles+0.027
atmeal+0.024
affle+0.023
ic+0.022
ils+0.022
Neg logit contributions
packs-0.022
arse-0.021
 leads-0.021
 slows-0.021
 direction-0.021
Token and
Token ID290
Feature activation+4.09e-01

Pos logit contributions
packs+0.181
 slows+0.171
 direction+0.171
 distance+0.170
 leads+0.167
Neg logit contributions
affles-0.200
ic-0.178
atmeal-0.175
ils-0.174
affle-0.169
Token success
Token ID1943
Feature activation+1.37e+00

Pos logit contributions
packs+0.012
 distance+0.011
 ho+0.011
 leads+0.011
 direction+0.011
Neg logit contributions
affles-0.014
atmeal-0.012
affle-0.012
ic-0.012
ils-0.012
Token.
Token ID13
Feature activation-8.37e-02

Pos logit contributions
packs+0.033
 leads+0.033
 direction+0.033
 distance+0.033
 ho+0.033
Neg logit contributions
affles-0.054
atmeal-0.047
affle-0.047
ic-0.044
ils-0.043
Token<|endoftext|>
Token ID50256
Feature activation-5.56e-01

Pos logit contributions
affles+0.004
atmeal+0.003
affle+0.003
ic+0.003
ils+0.003
Neg logit contributions
packs-0.002
 ho-0.002
 leads-0.002
 direction-0.002
 distance-0.002
TokenOnce
Token ID7454
Feature activation-4.62e-01

Pos logit contributions
affles+0.022
atmeal+0.019
affle+0.019
ic+0.018
ils+0.017
Neg logit contributions
 ho-0.013
packs-0.013
 leads-0.013
 direction-0.013
 distance-0.013
Token there
Token ID612
Feature activation+6.52e-01

Pos logit contributions
affles+0.049
atmeal+0.042
affle+0.042
ic+0.039
ils+0.038
Neg logit contributions
 distance-0.043
 leads-0.043
packs-0.043
 direction-0.043
 pays-0.042
Token was
Token ID373
Feature activation-3.91e-01

Pos logit contributions
packs+0.018
 direction+0.018
 leads+0.018
 distance+0.018
 ho+0.018
Neg logit contributions
affles-0.023
atmeal-0.020
affle-0.020
ic-0.019
ils-0.018
Token a
Token ID257
Feature activation+9.15e-01

Pos logit contributions
affles+0.016
atmeal+0.014
ic+0.014
affle+0.014
ils+0.013
Neg logit contributions
packs-0.009
 ho-0.009
 distance-0.009
 thanks-0.008
 direction-0.008
Token dog
Token ID3290
Feature activation+7.27e-02

Pos logit contributions
 ho+0.022
 distance+0.022
packs+0.022
 direction+0.022
 leads+0.022
Neg logit contributions
affles-0.036
atmeal-0.031
affle-0.031
ic-0.030
ils-0.029
Token named
Token ID3706
Feature activation+4.80e-01

Pos logit contributions
 distance+0.001
 ho+0.001
packs+0.001
 leads+0.001
 direction+0.001
Neg logit contributions
affles-0.003
atmeal-0.003
affle-0.003
ic-0.003
ils-0.003
Token Spot
Token ID15899
Feature activation+2.34e+00

Pos logit contributions
packs+0.010
 direction+0.010
 distance+0.009
 leads+0.009
 pays+0.009
Neg logit contributions
affles-0.021
atmeal-0.018
affle-0.018
ic-0.017
ils-0.017
Token.
Token ID13
Feature activation+5.87e-01

Pos logit contributions
 direction+0.064
 leads+0.063
 slows+0.063
packs+0.063
 distance+0.062
Neg logit contributions
affles-0.083
atmeal-0.072
affle-0.072
ils-0.067
ic-0.066
Token Spot
Token ID15899
Feature activation+7.83e-01

Pos logit contributions
 pays+0.015
packs+0.015
 direction+0.015
arse+0.014
 distance+0.014
Neg logit contributions
affles-0.022
affle-0.019
atmeal-0.019
 breakfast-0.018
ic-0.016
Token loved
Token ID6151
Feature activation-2.39e-01

Pos logit contributions
packs+0.021
 leads+0.021
 direction+0.021
 distance+0.021
 ho+0.021
Neg logit contributions
affles-0.029
atmeal-0.025
affle-0.025
ic-0.024
ils-0.023
Token to
Token ID284
Feature activation-2.00e+00

Pos logit contributions
affles+0.011
affle+0.010
atmeal+0.010
ic+0.010
ilt+0.009
Neg logit contributions
 ho-0.004
packs-0.004
 leads-0.004
 distance-0.004
 crashing-0.004
Token sail
Token ID14936
Feature activation+1.30e+00

Pos logit contributions
affles+0.089
ic+0.078
atmeal+0.078
affle+0.078
ils+0.075
Neg logit contributions
 ho-0.039
 leads-0.039
packs-0.038
 distance-0.038
 flap-0.037
Token to
Token ID284
Feature activation-4.25e-01

Pos logit contributions
affles+0.113
affle+0.101
atmeal+0.097
ic+0.092
at+0.089
Neg logit contributions
 direction-0.050
 pays-0.049
 leads-0.049
 distance-0.049
 way-0.048
Token rest
Token ID1334
Feature activation+3.58e-01

Pos logit contributions
affles+0.018
atmeal+0.015
affle+0.015
ic+0.015
ils+0.014
Neg logit contributions
packs-0.009
 distance-0.009
 direction-0.009
 leads-0.009
 ho-0.009
Token when
Token ID618
Feature activation+1.84e+00

Pos logit contributions
packs+0.009
 direction+0.009
 distance+0.009
 leads+0.009
 ho+0.009
Neg logit contributions
affles-0.014
atmeal-0.012
affle-0.012
ic-0.012
ils-0.011
Token it
Token ID340
Feature activation-1.91e+00

Pos logit contributions
 direction+0.054
 distance+0.054
 ho+0.053
 leads+0.053
packs+0.052
Neg logit contributions
affles-0.067
affle-0.058
atmeal-0.055
ic-0.054
ils-0.051
Token was
Token ID373
Feature activation+6.03e-01

Pos logit contributions
affles+0.081
affle+0.070
ic+0.067
atmeal+0.065
ils+0.063
Neg logit contributions
 leads-0.048
 pays-0.045
 distance-0.043
 ho-0.043
 lets-0.043
Token tired
Token ID10032
Feature activation-5.78e-03

Pos logit contributions
 distance+0.016
 direction+0.016
 ho+0.016
packs+0.016
 leads+0.015
Neg logit contributions
affles-0.023
atmeal-0.020
affle-0.019
ic-0.019
ils-0.018
Token so
Token ID523
Feature activation-2.76e+00

Pos logit contributions
affles+0.000
atmeal+0.000
ic+0.000
affle+0.000
ils+0.000
Neg logit contributions
packs-0.000
 leads-0.000
 steering-0.000
 slows-0.000
 direction-0.000
Token it
Token ID340
Feature activation-2.62e+00

Pos logit contributions
affles+0.101
atmeal+0.088
affle+0.086
ic+0.082
ils+0.080
Neg logit contributions
packs-0.074
 direction-0.073
 leads-0.073
 distance-0.073
 ho-0.072
Token could
Token ID714
Feature activation+8.87e-01

Pos logit contributions
affles+0.113
affle+0.098
atmeal+0.096
ic+0.093
ils+0.088
Neg logit contributions
 leads-0.060
 pays-0.058
 direction-0.057
 distance-0.057
 ho-0.056
Token have
Token ID423
Feature activation-3.18e+00

Pos logit contributions
 distance+0.018
 ho+0.018
 flap+0.018
 leads+0.018
 direction+0.017
Neg logit contributions
affles-0.040
affle-0.034
ic-0.034
atmeal-0.034
ils-0.033
Token the
Token ID262
Feature activation-2.36e-01

Pos logit contributions
affles+0.124
affle+0.107
atmeal+0.103
ic+0.099
ils+0.092
Neg logit contributions
 direction-0.087
 distance-0.086
 way-0.086
 leads-0.084
 thanks-0.084
Token.
Token ID13
Feature activation+3.77e-01

Pos logit contributions
 steering+0.039
packs+0.038
 distance+0.037
 aston+0.037
 pays+0.037
Neg logit contributions
affles-0.032
affle-0.024
ic-0.024
atmeal-0.023
ils-0.023
Token
Token ID198
Feature activation+7.39e-01

Pos logit contributions
 steering+0.010
arse+0.010
 aston+0.010
packs+0.010
 horror+0.009
Neg logit contributions
affles-0.013
atmeal-0.011
affle-0.011
ic-0.010
 breakfast-0.010
Token
Token ID198
Feature activation+7.75e-01

Pos logit contributions
 ho+0.015
packs+0.015
 leads+0.015
 direction+0.014
 steering+0.014
Neg logit contributions
affles-0.032
affle-0.028
atmeal-0.027
ic-0.027
ils-0.026
Token"
Token ID1
Feature activation-1.48e+00

Pos logit contributions
packs+0.023
 steering+0.022
 horror+0.022
 direction+0.022
 leads+0.022
Neg logit contributions
affles-0.025
atmeal-0.021
affle-0.021
ic-0.020
ils-0.020
TokenHi
Token ID17250
Feature activation+2.01e+00

Pos logit contributions
affles+0.049
atmeal+0.044
ic+0.042
affle+0.041
ils+0.039
Neg logit contributions
packs-0.042
 leads-0.041
 pays-0.040
 direction-0.039
 steering-0.039
Token,
Token ID11
Feature activation+1.27e+00

Pos logit contributions
 distance+0.052
 steering+0.050
packs+0.048
 crashing+0.048
 direction+0.047
Neg logit contributions
affles-0.074
affle-0.065
atmeal-0.064
ic-0.063
ils-0.062
Token I
Token ID314
Feature activation+1.31e+00

Pos logit contributions
packs+0.044
 direction+0.039
 distance+0.039
stretched+0.037
arse+0.036
Neg logit contributions
affles-0.040
affle-0.035
atmeal-0.032
ic-0.029
ils-0.029
Token am
Token ID716
Feature activation-1.39e+00

Pos logit contributions
packs+0.035
 direction+0.035
 leads+0.034
 steering+0.034
 distance+0.034
Neg logit contributions
affles-0.048
atmeal-0.042
affle-0.042
ic-0.039
ils-0.038
Token Tip
Token ID23095
Feature activation+2.93e+00

Pos logit contributions
affles+0.058
atmeal+0.052
affle+0.050
ic+0.050
ils+0.048
Neg logit contributions
 ho-0.032
 leads-0.031
 direction-0.031
packs-0.030
 distance-0.030
Token,"
Token ID553
Feature activation-4.53e+00

Pos logit contributions
 ho+0.062
 leads+0.060
packs+0.060
 direction+0.059
 distance+0.059
Neg logit contributions
affles-0.127
atmeal-0.113
affle-0.110
ic-0.105
ils-0.102
Token said
Token ID531
Feature activation-2.50e+00

Pos logit contributions
affles+0.191
atmeal+0.167
affle+0.165
ic+0.160
ils+0.158
Neg logit contributions
packs-0.093
 distance-0.091
 direction-0.089
 crashing-0.089
 pays-0.087
Token tried
Token ID3088
Feature activation+6.53e-01

Pos logit contributions
affles+0.053
affle+0.047
atmeal+0.046
ic+0.045
ilt+0.044
Neg logit contributions
 leads-0.022
 ho-0.022
 waved-0.022
 lets-0.020
packs-0.020
Token and
Token ID290
Feature activation+3.58e-01

Pos logit contributions
 direction+0.013
packs+0.013
 ho+0.013
 crashing+0.013
 leads+0.013
Neg logit contributions
affles-0.029
atmeal-0.026
affle-0.026
ic-0.024
ils-0.024
Token tried
Token ID3088
Feature activation+9.09e-01

Pos logit contributions
 leads+0.008
 pays+0.008
 steering+0.008
 ho+0.008
packs+0.007
Neg logit contributions
affles-0.015
affle-0.014
atmeal-0.013
ic-0.013
ilt-0.012
Token.
Token ID13
Feature activation-1.85e-01

Pos logit contributions
packs+0.020
 direction+0.019
 leads+0.019
 distance+0.019
 ho+0.019
Neg logit contributions
affles-0.039
atmeal-0.034
affle-0.034
ic-0.032
ils-0.031
Token He
Token ID679
Feature activation-4.62e-02

Pos logit contributions
affles+0.008
atmeal+0.007
affle+0.007
ic+0.007
ils+0.007
Neg logit contributions
packs-0.003
 ho-0.003
 distance-0.003
 direction-0.003
 leads-0.003
Token tried
Token ID3088
Feature activation+8.77e-01

Pos logit contributions
affles+0.002
atmeal+0.002
affle+0.002
ilt+0.002
ic+0.002
Neg logit contributions
 waved-0.001
 ho-0.001
 way-0.001
 steering-0.001
 leads-0.001
Token to
Token ID284
Feature activation-4.41e-01

Pos logit contributions
 direction+0.018
packs+0.018
 leads+0.018
 ho+0.018
 distance+0.018
Neg logit contributions
affles-0.038
atmeal-0.034
affle-0.033
ic-0.032
ils-0.031
Token make
Token ID787
Feature activation-2.10e+00

Pos logit contributions
affles+0.022
atmeal+0.020
affle+0.019
ic+0.018
ils+0.018
Neg logit contributions
 flap-0.009
 follow-0.008
 steering-0.008
 ho-0.008
 close-0.007
Token something
Token ID1223
Feature activation-1.23e+00

Pos logit contributions
affles+0.090
atmeal+0.083
affle+0.079
at+0.076
ase+0.074
Neg logit contributions
 way-0.047
 thanks-0.047
 leads-0.047
 direction-0.047
 ho-0.046
Token with
Token ID351
Feature activation-6.69e-01

Pos logit contributions
affles+0.044
atmeal+0.037
affle+0.037
ic+0.036
ils+0.034
Neg logit contributions
packs-0.035
 distance-0.034
 direction-0.034
 ho-0.033
 leads-0.033
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
affles+0.024
atmeal+0.020
affle+0.020
ic+0.019
ils+0.018
Neg logit contributions
 direction-0.020
packs-0.019
 distance-0.019
 ho-0.019
 leads-0.019
Token dog
Token ID3290
Feature activation+7.20e-01

Pos logit contributions
 leads+0.028
 ho+0.028
 direction+0.027
 distance+0.026
 slows+0.026
Neg logit contributions
affles-0.049
atmeal-0.042
affle-0.042
ic-0.038
ilt-0.037
Token said
Token ID531
Feature activation+4.90e-01

Pos logit contributions
 leads+0.017
 ho+0.017
 direction+0.016
 pays+0.016
 distance+0.016
Neg logit contributions
affles-0.030
atmeal-0.026
affle-0.026
ic-0.025
ilt-0.023
Token,
Token ID11
Feature activation+2.29e+00

Pos logit contributions
 ho+0.007
 leads+0.006
 crashing+0.006
 slows+0.006
 direction+0.006
Neg logit contributions
affles-0.025
atmeal-0.023
affle-0.022
ic-0.022
ilt-0.021
Token "
Token ID366
Feature activation+1.49e+00

Pos logit contributions
packs+0.047
 direction+0.045
 distance+0.044
 leads+0.044
 ho+0.042
Neg logit contributions
affles-0.099
atmeal-0.088
affle-0.087
ic-0.084
ils-0.082
TokenPlease
Token ID5492
Feature activation+3.70e+00

Pos logit contributions
 distance+0.035
 direction+0.033
 ho+0.033
 crashing+0.032
 slows+0.032
Neg logit contributions
affles-0.064
affle-0.056
atmeal-0.055
ils-0.051
ilt-0.050
Token don
Token ID836
Feature activation+1.11e+00

Pos logit contributions
packs+0.102
 direction+0.092
 distance+0.092
 steering+0.084
stretched+0.084
Neg logit contributions
affles-0.139
affle-0.125
atmeal-0.123
ils-0.112
ic-0.111
Token't
Token ID470
Feature activation+2.06e+00

Pos logit contributions
packs+0.036
 direction+0.036
 distance+0.036
 leads+0.035
 ho+0.035
Neg logit contributions
affles-0.035
atmeal-0.029
affle-0.029
ic-0.028
ils-0.026
Token take
Token ID1011
Feature activation-7.13e-01

Pos logit contributions
packs+0.051
 leads+0.048
 distance+0.048
 direction+0.048
 pays+0.047
Neg logit contributions
affles-0.082
atmeal-0.071
affle-0.071
ic-0.068
ils-0.066
Token me
Token ID502
Feature activation-1.09e-01

Pos logit contributions
affles+0.025
atmeal+0.021
affle+0.021
ic+0.020
ils+0.019
Neg logit contributions
 direction-0.021
packs-0.021
 leads-0.021
 ho-0.021
 distance-0.021
Token.
Token ID13
Feature activation+1.95e+00

Pos logit contributions
affles+0.004
atmeal+0.003
affle+0.003
ils+0.003
ic+0.003
Neg logit contributions
packs-0.003
 leads-0.003
 slows-0.003
 direction-0.003
 distance-0.003
Token I
Token ID314
Feature activation+3.09e-01

Pos logit contributions
packs+0.043
 distance+0.043
 direction+0.041
 leads+0.040
 ho+0.039
Neg logit contributions
affles-0.082
affle-0.072
atmeal-0.071
ic-0.067
ils-0.065
Token's
Token ID338
Feature activation-5.09e-01

Pos logit contributions
affles+0.048
atmeal+0.042
affle+0.041
ic+0.040
ils+0.039
Neg logit contributions
packs-0.037
 steering-0.035
 distance-0.035
 crashing-0.035
 ho-0.035
Token dad
Token ID9955
Feature activation+3.92e-01

Pos logit contributions
affles+0.019
affle+0.017
atmeal+0.017
ic+0.015
ils+0.015
Neg logit contributions
packs-0.014
 leads-0.013
 distance-0.013
 direction-0.013
 ho-0.013
Token had
Token ID550
Feature activation-2.96e-01

Pos logit contributions
packs+0.010
 distance+0.010
 direction+0.010
 ho+0.009
 crashing+0.009
Neg logit contributions
affles-0.015
atmeal-0.013
affle-0.013
ic-0.013
ils-0.012
Token taught
Token ID7817
Feature activation+1.54e+00

Pos logit contributions
affles+0.011
atmeal+0.010
affle+0.010
ic+0.009
ils+0.009
Neg logit contributions
packs-0.008
 direction-0.008
 leads-0.008
 distance-0.008
 ho-0.008
Token him
Token ID683
Feature activation+9.00e-01

Pos logit contributions
 direction+0.056
packs+0.056
 leads+0.055
 distance+0.055
 ho+0.055
Neg logit contributions
affles-0.043
atmeal-0.035
affle-0.034
ic-0.032
ils-0.030
Token what
Token ID644
Feature activation-1.06e+00

Pos logit contributions
 leads+0.027
packs+0.026
 direction+0.026
 pays+0.026
 ho+0.026
Neg logit contributions
affles-0.031
atmeal-0.027
affle-0.026
ic-0.025
ils-0.024
Token to
Token ID284
Feature activation-1.38e+00

Pos logit contributions
affles+0.040
atmeal+0.035
affle+0.034
ic+0.033
ils+0.032
Neg logit contributions
packs-0.028
 distance-0.027
 direction-0.027
 ho-0.027
 leads-0.027
Token do
Token ID466
Feature activation-3.21e+00

Pos logit contributions
affles+0.045
atmeal+0.038
affle+0.037
ic+0.035
ils+0.034
Neg logit contributions
 leads-0.044
 direction-0.044
 distance-0.044
packs-0.043
 ho-0.043
Token in
Token ID287
Feature activation+2.30e+00

Pos logit contributions
affles+0.134
atmeal+0.118
affle+0.117
ic+0.113
ils+0.110
Neg logit contributions
packs-0.069
 direction-0.069
 distance-0.068
 leads-0.068
 ho-0.068
Token case
Token ID1339
Feature activation-3.16e+00

Pos logit contributions
packs+0.065
 direction+0.064
 leads+0.064
 distance+0.064
 ho+0.064
Neg logit contributions
affles-0.082
atmeal-0.070
affle-0.069
ic-0.066
ils-0.064
Token of
Token ID286
Feature activation-1.61e+00

Pos logit contributions
affles+0.117
atmeal+0.102
affle+0.100
ic+0.095
ils+0.091
Neg logit contributions
 leads-0.085
packs-0.085
 direction-0.084
 distance-0.083
 ho-0.083
Token had
Token ID550
Feature activation+2.08e+00

Pos logit contributions
 pays+0.059
 leads+0.058
 ho+0.057
 slows+0.054
 lets+0.054
Neg logit contributions
affles-0.159
atmeal-0.140
affle-0.139
ic-0.135
ilt-0.134
Token to
Token ID284
Feature activation-7.60e-01

Pos logit contributions
 ho+0.035
packs+0.035
 pays+0.035
 leads+0.034
 direction+0.034
Neg logit contributions
affles-0.101
affle-0.091
atmeal-0.089
ic-0.087
ils-0.083
Token wait
Token ID4043
Feature activation+1.58e+00

Pos logit contributions
affles+0.033
atmeal+0.029
affle+0.029
ic+0.028
ils+0.027
Neg logit contributions
packs-0.016
 distance-0.016
 leads-0.015
 direction-0.015
 ho-0.015
Token until
Token ID1566
Feature activation+5.13e-01

Pos logit contributions
packs+0.038
 distance+0.038
 direction+0.038
 leads+0.037
 ho+0.037
Neg logit contributions
affles-0.062
atmeal-0.054
affle-0.054
ic-0.051
ils-0.050
Token after
Token ID706
Feature activation-1.00e+00

Pos logit contributions
 distance+0.014
 direction+0.013
 ho+0.013
packs+0.013
 leads+0.013
Neg logit contributions
affles-0.020
affle-0.017
atmeal-0.017
ic-0.017
ils-0.016
Token dinner
Token ID8073
Feature activation-1.77e+00

Pos logit contributions
affles+0.035
affle+0.031
atmeal+0.030
ic+0.030
ils+0.029
Neg logit contributions
 ho-0.029
packs-0.029
 leads-0.028
 way-0.028
 waving-0.028
Token.
Token ID13
Feature activation-1.44e-01

Pos logit contributions
affles+0.082
affle+0.072
atmeal+0.071
ils+0.067
ic+0.066
Neg logit contributions
 way-0.033
 leads-0.033
 pays-0.032
 distance-0.032
 direction-0.031
Token
Token ID198
Feature activation+7.47e-01

Pos logit contributions
affles+0.006
atmeal+0.005
affle+0.005
ic+0.005
ott+0.005
Neg logit contributions
 ho-0.003
packs-0.003
 flap-0.003
 steering-0.003
 distance-0.003
Token
Token ID198
Feature activation+8.30e-01

Pos logit contributions
packs+0.013
 ho+0.013
 leads+0.013
 direction+0.013
 distance+0.013
Neg logit contributions
affles-0.034
atmeal-0.030
affle-0.030
ic-0.029
ils-0.028
TokenTim
Token ID14967
Feature activation-1.42e+00

Pos logit contributions
 direction+0.026
 distance+0.026
packs+0.026
 leads+0.025
 ho+0.025
Neg logit contributions
affles-0.028
atmeal-0.023
affle-0.023
ic-0.022
ils-0.021
Tokenmy
Token ID1820
Feature activation-1.36e+00

Pos logit contributions
affles+0.068
atmeal+0.060
affle+0.060
ic+0.058
ils+0.057
Neg logit contributions
packs-0.023
 direction-0.023
 pays-0.022
 leads-0.022
 distance-0.022
Token it
Token ID340
Feature activation-2.42e+00

Pos logit contributions
affles+0.005
atmeal+0.004
affle+0.004
ic+0.004
ils+0.003
Neg logit contributions
packs-0.004
 direction-0.004
 ho-0.004
 leads-0.004
 distance-0.004
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
affles+0.086
atmeal+0.074
affle+0.073
ic+0.073
ils+0.073
Neg logit contributions
packs-0.064
 distance-0.064
 direction-0.061
 leads-0.060
 slows-0.060
Token She
Token ID1375
Feature activation-9.62e-01

Pos logit contributions
affles+0.006
atmeal+0.006
affle+0.006
ic+0.005
ils+0.005
Neg logit contributions
packs-0.003
 direction-0.003
 leads-0.003
 ho-0.003
 distance-0.003
Token felt
Token ID2936
Feature activation-2.68e+00

Pos logit contributions
affles+0.044
atmeal+0.040
affle+0.039
ic+0.036
ilt+0.035
Neg logit contributions
 ho-0.021
 direction-0.019
 waved-0.019
 leads-0.019
 steering-0.018
Token a
Token ID257
Feature activation+3.42e-01

Pos logit contributions
affles+0.096
atmeal+0.082
affle+0.081
ic+0.077
ils+0.075
Neg logit contributions
packs-0.076
 leads-0.074
 direction-0.074
 distance-0.074
 ho-0.073
Token sharp
Token ID7786
Feature activation-8.93e-01

Pos logit contributions
packs+0.010
 leads+0.009
 pays+0.009
 direction+0.008
 slows+0.008
Neg logit contributions
affles-0.012
affle-0.011
ic-0.010
atmeal-0.010
ilt-0.010
Token shock
Token ID6380
Feature activation+6.76e-01

Pos logit contributions
affles+0.035
atmeal+0.030
affle+0.029
ic+0.028
ils+0.027
Neg logit contributions
 direction-0.022
 ho-0.022
 leads-0.022
packs-0.022
 distance-0.022
Token!
Token ID0
Feature activation-5.27e-01

Pos logit contributions
packs+0.016
 direction+0.016
 leads+0.016
 distance+0.016
 ho+0.016
Neg logit contributions
affles-0.026
atmeal-0.023
affle-0.023
ic-0.022
ils-0.021
Token Ethan
Token ID28926
Feature activation-2.46e+00

Pos logit contributions
affles+0.023
atmeal+0.021
affle+0.020
ic+0.020
ils+0.019
Neg logit contributions
packs-0.010
 direction-0.010
 leads-0.010
 distance-0.010
 ho-0.010
Token quickly
Token ID2952
Feature activation+1.86e+00

Pos logit contributions
affles+0.104
atmeal+0.090
affle+0.090
ic+0.083
ils+0.080
Neg logit contributions
 ho-0.058
 leads-0.056
 direction-0.056
 pays-0.055
 distance-0.055
Token looked
Token ID3114
Feature activation-6.17e-01

Pos logit contributions
 leads+0.041
 ho+0.041
 direction+0.041
 distance+0.041
packs+0.040
Neg logit contributions
affles-0.079
atmeal-0.069
affle-0.068
ic-0.065
ils-0.063
Token Ben
Token ID3932
Feature activation-1.60e+00

Pos logit contributions
 direction+0.013
 distance+0.013
packs+0.013
 steering+0.013
 leads+0.012
Neg logit contributions
affles-0.019
affle-0.016
atmeal-0.016
ic-0.015
 breakfast-0.014
Token loved
Token ID6151
Feature activation-3.74e-01

Pos logit contributions
affles+0.061
atmeal+0.053
affle+0.052
ic+0.051
ils+0.048
Neg logit contributions
packs-0.042
 leads-0.041
 direction-0.041
 distance-0.041
 ho-0.041
Token to
Token ID284
Feature activation-2.16e+00

Pos logit contributions
affles+0.019
ic+0.017
affle+0.017
ilt+0.017
atmeal+0.017
Neg logit contributions
 ho-0.005
 crashing-0.005
 way-0.005
 leads-0.005
packs-0.005
Token play
Token ID711
Feature activation-1.95e+00

Pos logit contributions
affles+0.095
ic+0.087
affle+0.083
atmeal+0.083
ils+0.082
Neg logit contributions
 flap-0.043
 ho-0.042
 leads-0.042
packs-0.041
 distance-0.040
Token in
Token ID287
Feature activation+1.27e+00

Pos logit contributions
affles+0.092
atmeal+0.084
ic+0.083
izz+0.079
illy+0.078
Neg logit contributions
 flap-0.032
 ho-0.028
 leads-0.027
 steering-0.026
 follow-0.026
Token the
Token ID262
Feature activation-1.68e+00

Pos logit contributions
 distance+0.032
 leads+0.032
packs+0.032
 direction+0.032
 ho+0.032
Neg logit contributions
affles-0.051
atmeal-0.044
affle-0.043
ic-0.042
ils-0.041
Token forest
Token ID8222
Feature activation-1.04e-01

Pos logit contributions
affles+0.068
atmeal+0.057
ic+0.057
affle+0.057
ils+0.056
Neg logit contributions
 distance-0.042
 direction-0.041
 way-0.039
 ho-0.039
 flap-0.038
Token with
Token ID351
Feature activation-1.16e+00

Pos logit contributions
affles+0.004
atmeal+0.004
affle+0.004
ic+0.004
ils+0.003
Neg logit contributions
packs-0.002
 distance-0.002
 direction-0.002
 leads-0.002
 ho-0.002
Token his
Token ID465
Feature activation-2.06e-01

Pos logit contributions
affles+0.041
atmeal+0.036
ic+0.035
affle+0.035
ils+0.033
Neg logit contributions
 ho-0.034
packs-0.034
 direction-0.033
 leads-0.033
 distance-0.033
Token friends
Token ID2460
Feature activation+1.13e+00

Pos logit contributions
affles+0.006
affle+0.005
atmeal+0.005
ic+0.005
ils+0.004
Neg logit contributions
 leads-0.007
 pays-0.007
packs-0.007
 distance-0.007
 slows-0.007
Token.
Token ID13
Feature activation+7.22e-02

Pos logit contributions
 distance+0.029
 direction+0.029
 slows+0.029
packs+0.028
 leads+0.028
Neg logit contributions
affles-0.043
atmeal-0.037
affle-0.037
ic-0.035
ils-0.035
Token he
Token ID339
Feature activation-3.22e-01

Pos logit contributions
affles+0.060
affle+0.053
atmeal+0.051
ic+0.051
ils+0.049
Neg logit contributions
 leads-0.037
packs-0.037
 ho-0.036
 distance-0.036
 direction-0.036
Token should
Token ID815
Feature activation-5.03e-01

Pos logit contributions
affles+0.016
atmeal+0.014
affle+0.014
ic+0.014
ilt+0.013
Neg logit contributions
 pays-0.005
 leads-0.005
 lets-0.005
 thanks-0.005
 slows-0.005
Token go
Token ID467
Feature activation+5.72e-01

Pos logit contributions
affles+0.022
atmeal+0.019
affle+0.019
ic+0.019
ils+0.018
Neg logit contributions
 leads-0.010
 ho-0.010
 distance-0.010
 flap-0.010
 direction-0.010
Token anyway
Token ID6949
Feature activation-8.41e-01

Pos logit contributions
packs+0.013
 distance+0.013
 direction+0.013
 leads+0.013
 ho+0.013
Neg logit contributions
affles-0.024
atmeal-0.021
affle-0.021
ic-0.020
ils-0.019
Token and
Token ID290
Feature activation+6.80e-01

Pos logit contributions
affles+0.032
atmeal+0.028
affle+0.027
ic+0.026
ils+0.025
Neg logit contributions
 leads-0.021
 direction-0.021
 distance-0.021
packs-0.020
 ho-0.020
Token make
Token ID787
Feature activation-2.33e+00

Pos logit contributions
packs+0.019
 direction+0.019
 distance+0.019
 ho+0.019
 leads+0.019
Neg logit contributions
affles-0.024
atmeal-0.021
affle-0.020
ic-0.019
ils-0.019
Token new
Token ID649
Feature activation+6.22e-01

Pos logit contributions
affles+0.089
atmeal+0.084
ase+0.083
at+0.079
affle+0.078
Neg logit contributions
 way-0.057
 thanks-0.057
 ho-0.055
 direction-0.052
 pays-0.051
Token friends
Token ID2460
Feature activation+4.61e-01

Pos logit contributions
packs+0.020
 direction+0.019
 leads+0.019
 ho+0.018
 distance+0.018
Neg logit contributions
affles-0.021
atmeal-0.018
affle-0.018
ic-0.017
ils-0.017
Token.
Token ID13
Feature activation-1.48e-01

Pos logit contributions
packs+0.009
 direction+0.009
 distance+0.009
 leads+0.009
 crashing+0.009
Neg logit contributions
affles-0.021
affle-0.018
atmeal-0.018
ic-0.017
ils-0.017
Token Tim
Token ID5045
Feature activation-1.20e+00

Pos logit contributions
affles+0.007
atmeal+0.006
affle+0.006
ic+0.006
ils+0.006
Neg logit contributions
packs-0.003
 direction-0.003
 leads-0.003
 distance-0.003
 ho-0.003
Tokenmy
Token ID1820
Feature activation-1.27e+00

Pos logit contributions
affles+0.058
atmeal+0.052
affle+0.051
ils+0.049
ic+0.049
Neg logit contributions
packs-0.019
 direction-0.019
 distance-0.019
 pays-0.019
 ho-0.018
Token They
Token ID1119
Feature activation+5.99e-01

Pos logit contributions
affles+0.008
atmeal+0.007
affle+0.007
ic+0.007
ils+0.006
Neg logit contributions
packs-0.004
 leads-0.004
 direction-0.004
 distance-0.004
 ho-0.004
Token picked
Token ID6497
Feature activation-3.47e+00

Pos logit contributions
 ho+0.012
packs+0.012
 leads+0.012
 steering+0.011
 waved+0.011
Neg logit contributions
affles-0.028
affle-0.025
atmeal-0.024
ic-0.024
ilt-0.022
Token out
Token ID503
Feature activation+2.96e+00

Pos logit contributions
affles+0.159
atmeal+0.142
affle+0.139
lf+0.133
ic+0.127
Neg logit contributions
 leads-0.073
 thanks-0.068
 pays-0.067
 ho-0.067
packs-0.066
Token some
Token ID617
Feature activation-1.47e+00

Pos logit contributions
packs+0.075
 leads+0.073
 thanks+0.070
 ho+0.069
 direction+0.069
Neg logit contributions
affles-0.127
atmeal-0.114
affle-0.113
ic-0.105
ilt-0.098
Token l
Token ID300
Feature activation-3.86e+00

Pos logit contributions
affles+0.048
atmeal+0.042
affle+0.040
ic+0.039
ils+0.037
Neg logit contributions
 leads-0.045
 direction-0.045
 ho-0.045
packs-0.044
 distance-0.044
Tokenoll
Token ID692
Feature activation-6.02e+00

Pos logit contributions
affles+0.084
atmeal+0.068
ic+0.063
lf+0.063
affle+0.060
Neg logit contributions
packs-0.153
 ho-0.148
 leads-0.145
 pays-0.142
stretched-0.141
Tokenip
Token ID541
Feature activation-5.47e+00

Pos logit contributions
affles+0.174
atmeal+0.143
affle+0.143
ilt+0.139
ic+0.136
Neg logit contributions
packs-0.209
 ho-0.205
 leads-0.203
 pays-0.196
 slows-0.186
Tokenops
Token ID2840
Feature activation-1.97e+00

Pos logit contributions
affles+0.130
affle+0.102
atmeal+0.095
ilt+0.089
ils+0.085
Neg logit contributions
 leads-0.217
 ho-0.216
 pays-0.214
packs-0.207
 way-0.204
Token and
Token ID290
Feature activation-6.48e-02

Pos logit contributions
affles+0.088
atmeal+0.078
affle+0.077
ic+0.075
ils+0.073
Neg logit contributions
 direction-0.037
packs-0.037
 distance-0.036
 leads-0.036
 ho-0.036
Token headed
Token ID9153
Feature activation+2.97e+00

Pos logit contributions
affles+0.003
affle+0.003
ic+0.002
atmeal+0.002
ilt+0.002
Neg logit contributions
 leads-0.001
 ho-0.001
 lets-0.001
packs-0.001
 way-0.001
Token to
Token ID284
Feature activation-1.33e+00

Pos logit contributions
 direction+0.076
packs+0.076
 leads+0.076
 ho+0.075
 distance+0.075
Neg logit contributions
affles-0.110
atmeal-0.096
affle-0.096
ic-0.089
ils-0.088
Token going
Token ID1016
Feature activation+1.73e+00

Pos logit contributions
 crashing+0.019
 ho+0.019
 waving+0.019
 steering+0.019
 thanks+0.018
Neg logit contributions
affles-0.044
affle-0.039
ic-0.038
atmeal-0.037
ils-0.036
Token to
Token ID284
Feature activation-7.29e-01

Pos logit contributions
 direction+0.032
packs+0.032
 distance+0.032
 leads+0.032
 ho+0.032
Neg logit contributions
affles-0.080
affle-0.070
atmeal-0.070
ic-0.067
ils-0.066
Token fly
Token ID6129
Feature activation+2.01e+00

Pos logit contributions
affles+0.034
affle+0.029
ic+0.029
atmeal+0.029
ils+0.028
Neg logit contributions
 flap-0.016
 distance-0.016
 way-0.016
 leads-0.015
 crashing-0.015
Token in
Token ID287
Feature activation+2.39e+00

Pos logit contributions
 direction+0.042
 distance+0.042
packs+0.040
 flap+0.039
 leads+0.038
Neg logit contributions
affles-0.097
atmeal-0.086
affle-0.085
ic-0.078
ils-0.074
Token an
Token ID281
Feature activation-1.61e+00

Pos logit contributions
 direction+0.067
packs+0.067
 distance+0.067
 leads+0.065
 ho+0.065
Neg logit contributions
affles-0.090
atmeal-0.078
affle-0.077
ic-0.072
ils-0.070
Token aer
Token ID9551
Feature activation-5.97e+00

Pos logit contributions
affles+0.059
atmeal+0.051
affle+0.049
ic+0.048
ils+0.046
Neg logit contributions
 direction-0.048
 distance-0.047
packs-0.044
 way-0.043
 flap-0.043
Tokenopl
Token ID20106
Feature activation-9.17e-01

Pos logit contributions
affles+0.264
affle+0.230
atmeal+0.229
ils+0.223
ic+0.218
Neg logit contributions
 distance-0.121
 direction-0.117
packs-0.110
 pays-0.108
 slows-0.107
Tokenane
Token ID1531
Feature activation-1.19e+00

Pos logit contributions
affles+0.036
atmeal+0.031
affle+0.031
ic+0.029
ils+0.029
Neg logit contributions
packs-0.023
 direction-0.022
 leads-0.022
 distance-0.022
 ho-0.022
Token.
Token ID13
Feature activation+4.13e-02

Pos logit contributions
affles+0.050
atmeal+0.044
affle+0.044
ic+0.042
ils+0.041
Neg logit contributions
packs-0.026
 direction-0.026
 distance-0.026
 leads-0.026
 ho-0.025
Token She
Token ID1375
Feature activation-7.69e-01

Pos logit contributions
packs+0.001
 direction+0.001
 leads+0.001
 distance+0.001
 ho+0.001
Neg logit contributions
affles-0.002
atmeal-0.002
affle-0.002
ic-0.002
ils-0.002
Token jumped
Token ID11687
Feature activation+1.46e+00

Pos logit contributions
affles+0.029
atmeal+0.025
affle+0.025
ic+0.024
ils+0.024
Neg logit contributions
packs-0.020
 distance-0.019
 direction-0.019
 leads-0.019
 ho-0.019
Token send
Token ID3758
Feature activation+1.46e+00

Pos logit contributions
packs+0.015
 direction+0.015
 leads+0.014
 distance+0.014
 slows+0.014
Neg logit contributions
affles-0.027
atmeal-0.024
affle-0.024
ic-0.023
ils-0.022
Token the
Token ID262
Feature activation+1.91e+00

Pos logit contributions
packs+0.045
 direction+0.044
 leads+0.044
 distance+0.044
 ho+0.044
Neg logit contributions
affles-0.048
atmeal-0.041
affle-0.040
ic-0.038
ils-0.037
Token letter
Token ID3850
Feature activation+1.37e+00

Pos logit contributions
packs+0.059
 crashing+0.058
 ho+0.058
 slows+0.057
 leads+0.057
Neg logit contributions
affles-0.061
atmeal-0.051
affle-0.050
ic-0.049
ils-0.048
Token in
Token ID287
Feature activation+2.29e+00

Pos logit contributions
 slows+0.033
 crashing+0.033
packs+0.033
 direction+0.033
 ho+0.033
Neg logit contributions
affles-0.049
atmeal-0.044
ic-0.042
ils-0.042
affle-0.042
Token a
Token ID257
Feature activation-1.29e+00

Pos logit contributions
 slows+0.069
 leads+0.068
packs+0.068
 ho+0.066
 direction+0.066
Neg logit contributions
affles-0.071
atmeal-0.059
affle-0.059
ic-0.058
ils-0.057
Token few
Token ID1178
Feature activation-5.73e+00

Pos logit contributions
affles+0.040
affle+0.035
atmeal+0.035
ic+0.033
ils+0.031
Neg logit contributions
packs-0.040
 slows-0.038
 leads-0.038
 pays-0.037
 ho-0.037
Token days
Token ID1528
Feature activation+1.92e+00

Pos logit contributions
affles+0.170
atmeal+0.138
affle+0.137
ic+0.130
ils+0.129
Neg logit contributions
packs-0.194
 ho-0.191
 distance-0.191
 leads-0.191
 direction-0.190
Token,
Token ID11
Feature activation-1.56e+00

Pos logit contributions
 direction+0.043
packs+0.042
 leads+0.042
 distance+0.042
 slows+0.042
Neg logit contributions
affles-0.076
atmeal-0.069
ic-0.066
affle-0.065
ils-0.063
Token but
Token ID475
Feature activation-7.08e-01

Pos logit contributions
affles+0.057
atmeal+0.052
affle+0.049
ic+0.048
ils+0.047
Neg logit contributions
packs-0.039
 direction-0.037
arse-0.037
 slows-0.036
 distance-0.036
Token then
Token ID788
Feature activation+1.46e-02

Pos logit contributions
affles+0.026
atmeal+0.023
affle+0.022
ic+0.021
ils+0.021
Neg logit contributions
packs-0.019
 direction-0.019
 leads-0.019
 distance-0.019
 ho-0.018
Token time
Token ID640
Feature activation+3.61e+00

Pos logit contributions
 leads+0.000
 direction+0.000
packs+0.000
 ho+0.000
 distance+0.000
Neg logit contributions
affles-0.001
affle-0.001
atmeal-0.001
ic-0.001
ilt-0.001
Token.
Token ID13
Feature activation-3.22e-01

Pos logit contributions
affles+0.023
atmeal+0.020
affle+0.020
ic+0.019
ils+0.019
Neg logit contributions
packs-0.010
 direction-0.010
 ho-0.010
 distance-0.010
 leads-0.010
Token So
Token ID1406
Feature activation+1.48e+00

Pos logit contributions
affles+0.014
atmeal+0.013
affle+0.013
ic+0.012
ils+0.012
Neg logit contributions
packs-0.006
 distance-0.006
 ho-0.006
 crashing-0.006
 direction-0.006
Token,
Token ID11
Feature activation-1.04e+00

Pos logit contributions
packs+0.038
 distance+0.035
 crashing+0.034
 steering+0.034
 pays+0.033
Neg logit contributions
affles-0.053
ils-0.047
ic-0.046
atmeal-0.046
affle-0.045
Token the
Token ID262
Feature activation+1.55e+00

Pos logit contributions
affles+0.041
affle+0.036
atmeal+0.035
ic+0.033
ils+0.032
Neg logit contributions
 direction-0.028
 leads-0.028
 ho-0.028
 distance-0.027
 flap-0.027
Token l
Token ID300
Feature activation-3.78e+00

Pos logit contributions
 ho+0.046
 way+0.045
 distance+0.045
 direction+0.044
 leads+0.042
Neg logit contributions
affles-0.055
atmeal-0.046
ic-0.045
ilt-0.045
ils-0.045
Tokenoll
Token ID692
Feature activation-6.28e+00

Pos logit contributions
affles+0.097
making+0.086
atmeal+0.085
ic+0.083
 pancakes+0.076
Neg logit contributions
 ho-0.127
 lets-0.124
stretched-0.123
 slows-0.122
 slips-0.119
Tokenipop
Token ID42800
Feature activation-2.87e+00

Pos logit contributions
affles+0.233
ilt+0.225
ic+0.211
making+0.204
atmeal+0.196
Neg logit contributions
 ho-0.156
 leads-0.138
 lets-0.137
packs-0.131
 pays-0.129
Token decided
Token ID3066
Feature activation+1.22e+00

Pos logit contributions
affles+0.121
affle+0.104
atmeal+0.102
ic+0.100
ils+0.092
Neg logit contributions
 leads-0.068
 distance-0.066
 ho-0.066
 direction-0.065
packs-0.065
Token to
Token ID284
Feature activation-2.24e-01

Pos logit contributions
packs+0.031
 distance+0.030
 direction+0.030
 leads+0.029
 ho+0.029
Neg logit contributions
affles-0.046
atmeal-0.041
affle-0.039
ic-0.038
ils-0.038
Token find
Token ID1064
Feature activation-2.72e+00

Pos logit contributions
affles+0.010
atmeal+0.009
affle+0.009
ic+0.008
ils+0.008
Neg logit contributions
 leads-0.005
 direction-0.004
 ho-0.004
packs-0.004
 distance-0.004
Token some
Token ID617
Feature activation-1.02e+00

Pos logit contributions
affles+0.098
atmeal+0.084
affle+0.082
ic+0.079
ils+0.073
Neg logit contributions
 direction-0.080
 leads-0.080
 distance-0.079
 ho-0.079
packs-0.078
Token street
Token ID4675
Feature activation+2.18e+00

Pos logit contributions
affles+0.024
atmeal+0.022
affle+0.021
ic+0.021
ils+0.021
Neg logit contributions
 distance-0.008
 direction-0.008
 way-0.008
 crashing-0.007
 flap-0.007
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
packs+0.032
 direction+0.031
 leads+0.031
 distance+0.031
 ho+0.031
Neg logit contributions
affles-0.108
atmeal-0.097
affle-0.096
ic-0.093
ils-0.091
Token he
Token ID339
Feature activation-9.50e-01

Pos logit contributions
affles+0.039
atmeal+0.033
affle+0.033
ic+0.031
ils+0.030
Neg logit contributions
packs-0.030
 direction-0.030
 leads-0.030
 distance-0.030
 ho-0.030
Token began
Token ID2540
Feature activation+1.54e+00

Pos logit contributions
affles+0.042
affle+0.037
atmeal+0.036
ic+0.036
ils+0.034
Neg logit contributions
 distance-0.022
 direction-0.021
 ho-0.020
 leads-0.020
packs-0.019
Token to
Token ID284
Feature activation-1.13e+00

Pos logit contributions
packs+0.031
 direction+0.031
 distance+0.031
 leads+0.031
 ho+0.030
Neg logit contributions
affles-0.068
atmeal-0.060
affle-0.059
ic-0.057
ils-0.056
Token sne
Token ID10505
Feature activation-7.00e+00

Pos logit contributions
affles+0.051
affle+0.045
ic+0.045
atmeal+0.044
ils+0.043
Neg logit contributions
 distance-0.023
 direction-0.023
 leads-0.022
 flap-0.022
 ho-0.022
Tokene
Token ID68
Feature activation-8.51e-01

Pos logit contributions
affles+0.227
atmeal+0.203
ic+0.200
lf+0.193
iflower+0.193
Neg logit contributions
 direction-0.201
packs-0.196
 distance-0.193
 leads-0.185
 ho-0.185
Tokenze
Token ID2736
Feature activation-2.93e+00

Pos logit contributions
affles+0.023
atmeal+0.019
affle+0.018
ils+0.016
ic+0.016
Neg logit contributions
 ho-0.033
 crashing-0.032
 slows-0.031
 steering-0.031
 leads-0.030
Token.
Token ID13
Feature activation+1.05e-03

Pos logit contributions
affles+0.161
atmeal+0.147
affle+0.141
ott+0.138
ils+0.133
Neg logit contributions
 crashing-0.034
 direction-0.031
 distance-0.030
 way-0.029
 ho-0.029
Token
Token ID198
Feature activation+7.73e-01

Pos logit contributions
packs+0.000
 leads+0.000
 ho+0.000
 distance+0.000
 pays+0.000
Neg logit contributions
affles-0.000
affle-0.000
atmeal-0.000
ic-0.000
ils-0.000
Token
Token ID198
Feature activation+8.34e-01

Pos logit contributions
 ho+0.016
packs+0.016
 leads+0.016
 pays+0.015
 distance+0.015
Neg logit contributions
affles-0.032
affle-0.028
atmeal-0.028
ic-0.027
ils-0.026