TOP ACTIVATIONS
MAX = 7.003

. The owl told Wh isk ers , " To be
. The vet showed Wh isk ers the x - ray
was so sad and cried all the way home . She
lonely bird and felt sad all the time . One day
lonely bird and felt sad all the time . One day
mom smiled and explained that nature is all around us ,
Sarah was scared and happy all at once !
because she was so happy all the time . But Joy
mom smiled and explained that nature is powerful , and wise
cough and felt so tired all the time . He was
pill was so lazy that all it did was swim .
a smart cat named Wh isk ers . Wh isk ers
tree . This tree loved nature and enjoyed looking around it
happy that she hugged Wh isk ers . " Thank you
a naughty cat named Wh isk ers . Wh isk ers
little girl thought maybe the all igator was too stubborn to
j olly cat named Wh isk ers . Wh isk ers
nice and kind , so all the other creatures in the
ch ubby cat named Wh isk ers . Tim my loved
Ch ir py and Wh isk ers knew that poison was

MIN ACTIVATIONS
MIN = -6.114

Do you want to be happy ?" S ara
! Can I have a slice too ? â
down . " Did you have fun , kids
cake . " Did you like the tour ,
, " How about this mel on , Tom ? It
satisfied . " Did you like your food ?"
fun . " Did you like the dragon slide
! Can I have a slice ?" Anna nodded
maybe , we will be happy ." The end
cookies . " Did you have fun in the
them . " Did you have fun ?" she
other . " Did you like the green car
a pearl in this fancy oy ster !" His Mom smiled
other . " Did you make your wish ?"
them . " Did you have fun , kids
adventure . " Did you have fun ?" their
't you share and be happy ?" S ara
" Really ? Did you ever lose your boats
. We helped each other ," Lily said .
nice . " Did you like the clown ?"

INTERVAL 3.724 - 7.003
CONTAINS 0.315%

toys by the dog stand all day . <|endoftext|> There was
not have the same things as everyone else . He learned
" No way ! That all igator is too deep in
He hopped to the music all day long , until the
face and two hands . Every day , the clock went

INTERVAL 0.445 - 3.724
CONTAINS 34.584%

. She loved to take baths in her bath tub every
the tuna and then he walked back inside with his mom
everyday . One morning , the mall closed its doors to
delicious . Lily was happy too because she made her brother
inside . He knew that he had managed to help the

INTERVAL -2.835 - 0.445
CONTAINS 63.933%

hugged and said goodbye . Ben went home with his mom
exam . L ily started to feel better
<|endoftext|> Once upon a time , there was a little girl
know you like Patch . Would you like to play with
On a and her friends had so much fun at the

INTERVAL -6.114 - -2.835
CONTAINS 1.167%

. She felt worried and angry . She knocked
" Let me go ! Please , I want to go
finished , they were very proud of their work .
They put each block in a box that matched its color
throw it . You could hurt someone or lose it .
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
rated+0.065
 supposed+0.060
lling+0.060
igators+0.058
 attention+0.057
Neg logit contributions
aw-0.034
ra-0.033
light-0.031
era-0.030
iolet-0.028
Token The
Token ID383
Feature activation+1.89e+00

Pos logit contributions
 supposed+0.027
lling+0.027
igators+0.026
rated+0.026
 attention+0.026
Neg logit contributions
ra-0.025
mm-0.024
era-0.023
aw-0.023
iolet-0.023
Token owl
Token ID39610
Feature activation-1.71e-01

Pos logit contributions
rated+0.050
igators+0.050
 supposed+0.050
lling+0.049
 sizes+0.044
Neg logit contributions
ra-0.046
 ha-0.041
aw-0.039
mm-0.039
P-0.037
Token told
Token ID1297
Feature activation+1.61e+00

Pos logit contributions
ra+0.004
era+0.004
mm+0.004
iolet+0.004
aw+0.003
Neg logit contributions
 supposed-0.005
igators-0.005
 attention-0.004
rated-0.004
lling-0.004
Token Wh
Token ID854
Feature activation+2.60e+00

Pos logit contributions
 supposed+0.043
rated+0.043
igators+0.042
lling+0.041
 attention+0.040
Neg logit contributions
ra-0.039
aw-0.036
mm-0.036
iolet-0.034
era-0.034
Tokenisk
Token ID1984
Feature activation+7.00e+00

Pos logit contributions
igators+0.054
 attention+0.052
 supposed+0.050
rated+0.049
lling+0.049
Neg logit contributions
ra-0.080
ara-0.075
iolet-0.075
era-0.075
mm-0.075
Tokeners
Token ID364
Feature activation+3.69e+00

Pos logit contributions
rated+0.204
 supposed+0.201
 attention+0.197
 way+0.195
igators+0.188
Neg logit contributions
ra-0.147
iolet-0.143
aw-0.143
era-0.142
mm-0.129
Token,
Token ID11
Feature activation+1.72e+00

Pos logit contributions
rated+0.108
 supposed+0.107
igators+0.104
lling+0.102
 attention+0.100
Neg logit contributions
ra-0.074
era-0.072
mm-0.070
aw-0.070
iolet-0.068
Token "
Token ID366
Feature activation+7.41e-01

Pos logit contributions
rated+0.054
igators+0.051
 supposed+0.050
lling+0.050
 attention+0.049
Neg logit contributions
M-0.031
aw-0.031
mmm-0.031
ra-0.030
mm-0.030
TokenTo
Token ID2514
Feature activation-8.82e-01

Pos logit contributions
 supposed+0.020
igators+0.019
 attention+0.019
rated+0.019
lling+0.018
Neg logit contributions
ra-0.018
era-0.017
mm-0.016
iolet-0.016
aw-0.016
Token be
Token ID307
Feature activation-2.14e+00

Pos logit contributions
ra+0.024
era+0.023
mm+0.023
iolet+0.023
aw+0.022
Neg logit contributions
 supposed-0.020
igators-0.019
 attention-0.019
 way-0.019
rated-0.018
Token.
Token ID13
Feature activation+1.67e-03

Pos logit contributions
ra+0.003
era+0.003
light+0.003
aw+0.003
iolet+0.003
Neg logit contributions
 supposed-0.005
igators-0.005
rated-0.005
lling-0.005
 attention-0.004
Token The
Token ID383
Feature activation+8.16e-01

Pos logit contributions
igators+0.000
 supposed+0.000
 having+0.000
rated+0.000
 attention+0.000
Neg logit contributions
ra-0.000
era-0.000
mm-0.000
iolet-0.000
aw-0.000
Token vet
Token ID20202
Feature activation-3.86e-04

Pos logit contributions
 supposed+0.024
igators+0.023
 attention+0.023
rated+0.023
lling+0.022
Neg logit contributions
ra-0.018
era-0.016
mm-0.016
iolet-0.016
aw-0.015
Token showed
Token ID3751
Feature activation+6.66e-01

Pos logit contributions
ra+0.000
mm+0.000
era+0.000
iolet+0.000
mmm+0.000
Neg logit contributions
 supposed-0.000
igators-0.000
rated-0.000
 attention-0.000
 way-0.000
Token Wh
Token ID854
Feature activation+1.92e+00

Pos logit contributions
 supposed+0.017
rated+0.016
igators+0.015
 attention+0.015
lling+0.015
Neg logit contributions
ra-0.018
era-0.016
aw-0.016
mm-0.016
iolet-0.016
Tokenisk
Token ID1984
Feature activation+6.79e+00

Pos logit contributions
igators+0.037
 supposed+0.035
 attention+0.034
rated+0.034
 way+0.033
Neg logit contributions
ra-0.063
era-0.059
iolet-0.057
ara-0.057
mm-0.057
Tokeners
Token ID364
Feature activation+2.76e+00

Pos logit contributions
 supposed+0.189
rated+0.188
 way+0.183
 attention+0.182
igators+0.179
Neg logit contributions
ra-0.156
era-0.146
iolet-0.145
aw-0.144
mm-0.134
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
rated+0.077
 supposed+0.076
igators+0.070
lling+0.068
 causing+0.067
Neg logit contributions
ra-0.063
era-0.059
aw-0.058
iolet-0.057
mm-0.056
Token x
Token ID2124
Feature activation-3.66e-01

Pos logit contributions
igators+0.031
 supposed+0.030
rated+0.029
lling+0.029
 attention+0.028
Neg logit contributions
ra-0.030
era-0.027
mm-0.026
iolet-0.026
aw-0.025
Token-
Token ID12
Feature activation+1.85e+00

Pos logit contributions
ra+0.008
mm+0.008
era+0.008
iolet+0.008
ara+0.008
Neg logit contributions
 supposed-0.010
 attention-0.009
igators-0.009
rated-0.009
 valuable-0.009
Tokenray
Token ID2433
Feature activation-9.02e-01

Pos logit contributions
 supposed+0.048
igators+0.047
 attention+0.047
rated+0.046
 valuable+0.045
Neg logit contributions
ra-0.044
era-0.042
mm-0.042
iolet-0.040
ara-0.038
Token was
Token ID373
Feature activation+1.15e-01

Pos logit contributions
mm+0.019
ra+0.018
mmm+0.018
era+0.017
mmmm+0.017
Neg logit contributions
 supposed-0.012
 way-0.011
 having-0.011
 around-0.011
rated-0.010
Token so
Token ID523
Feature activation+3.79e-01

Pos logit contributions
rated+0.002
igators+0.002
 attention+0.002
 supposed+0.002
lling+0.002
Neg logit contributions
ra-0.004
era-0.004
iolet-0.004
aw-0.004
mm-0.004
Token sad
Token ID6507
Feature activation-4.31e-01

Pos logit contributions
 supposed+0.005
igators+0.005
 attention+0.005
rated+0.005
lling+0.004
Neg logit contributions
ra-0.014
era-0.014
mm-0.014
iolet-0.013
aw-0.013
Token and
Token ID290
Feature activation+2.45e+00

Pos logit contributions
ra+0.013
era+0.012
mm+0.012
iolet+0.012
ink+0.012
Neg logit contributions
 supposed-0.009
 attention-0.009
igators-0.008
rated-0.008
 having-0.008
Token cried
Token ID16896
Feature activation+8.28e-01

Pos logit contributions
 supposed+0.055
igators+0.053
 attention+0.052
rated+0.051
lling+0.050
Neg logit contributions
ra-0.068
era-0.063
mm-0.062
aw-0.061
iolet-0.060
Token all
Token ID477
Feature activation+6.27e+00

Pos logit contributions
 supposed+0.021
igators+0.021
rated+0.020
 attention+0.020
lling+0.020
Neg logit contributions
ra-0.021
era-0.020
mm-0.019
iolet-0.019
aw-0.018
Token the
Token ID262
Feature activation+2.21e+00

Pos logit contributions
rated+0.193
rying+0.191
 supposed+0.188
lling+0.187
 fastest+0.185
Neg logit contributions
aster-0.080
ad-0.079
ossible-0.069
on-0.068
aw-0.068
Token way
Token ID835
Feature activation+2.62e+00

Pos logit contributions
 supposed+0.047
rated+0.046
igators+0.046
lling+0.045
 sizes+0.042
Neg logit contributions
ra-0.060
era-0.054
 ha-0.054
aw-0.052
ara-0.051
Token home
Token ID1363
Feature activation+7.51e-01

Pos logit contributions
lling+0.071
igators+0.070
rated+0.070
 supposed+0.068
 sizes+0.066
Neg logit contributions
ra-0.058
light-0.052
era-0.052
apy-0.050
ap-0.049
Token.
Token ID13
Feature activation+2.47e-01

Pos logit contributions
igators+0.024
 supposed+0.023
rated+0.022
lling+0.022
 valuable+0.021
Neg logit contributions
ra-0.014
era-0.013
light-0.012
iolet-0.012
mm-0.011
Token She
Token ID1375
Feature activation-1.09e+00

Pos logit contributions
 supposed+0.006
igators+0.006
 having+0.006
rated+0.006
 attention+0.006
Neg logit contributions
ra-0.006
era-0.006
mm-0.006
iolet-0.006
aw-0.005
Token lonely
Token ID21757
Feature activation+5.69e-01

Pos logit contributions
 supposed+0.037
rated+0.037
lling+0.036
igators+0.036
 attention+0.034
Neg logit contributions
ra-0.072
mm-0.067
era-0.067
ink-0.065
ara-0.065
Token bird
Token ID6512
Feature activation-6.85e-01

Pos logit contributions
 supposed+0.013
igators+0.013
 attention+0.012
rated+0.012
 way+0.012
Neg logit contributions
ra-0.016
era-0.015
mm-0.015
iolet-0.014
aw-0.014
Token and
Token ID290
Feature activation+2.24e+00

Pos logit contributions
ra+0.016
era+0.015
aw+0.014
mm+0.014
iolet+0.014
Neg logit contributions
 supposed-0.019
rated-0.018
 attention-0.018
igators-0.018
lling-0.017
Token felt
Token ID2936
Feature activation+1.71e+00

Pos logit contributions
 supposed+0.057
igators+0.055
 attention+0.054
rated+0.054
lling+0.053
Neg logit contributions
ra-0.056
era-0.053
mm-0.052
iolet-0.051
aw-0.050
Token sad
Token ID6507
Feature activation-2.93e-01

Pos logit contributions
rated+0.032
 supposed+0.031
lling+0.029
igators+0.029
 attention+0.028
Neg logit contributions
ra-0.051
aw-0.049
apy-0.047
mm-0.046
era-0.046
Token all
Token ID477
Feature activation+6.19e+00

Pos logit contributions
ra+0.007
era+0.007
mm+0.007
iolet+0.007
aw+0.006
Neg logit contributions
 supposed-0.008
igators-0.007
rated-0.007
 attention-0.007
lling-0.007
Token the
Token ID262
Feature activation+3.54e+00

Pos logit contributions
 goats+0.134
 treadmill+0.133
rated+0.130
 tempting+0.129
rig+0.129
Neg logit contributions
on-0.131
e-0.124
?-0.121
light-0.121
aster-0.118
Token time
Token ID640
Feature activation+2.02e+00

Pos logit contributions
ounded+0.084
rying+0.083
ried+0.082
 supposed+0.082
rated+0.081
Neg logit contributions
aw-0.072
 ha-0.069
 frost-0.069
ra-0.069
ara-0.067
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
 supposed+0.061
rated+0.061
igators+0.059
 attention+0.059
lling+0.057
Neg logit contributions
ra-0.039
aw-0.037
era-0.037
light-0.035
mm-0.035
Token One
Token ID1881
Feature activation+1.96e+00

Pos logit contributions
 supposed+0.041
igators+0.039
 attention+0.039
rated+0.039
lling+0.038
Neg logit contributions
ra-0.033
era-0.031
mm-0.031
iolet-0.030
ara-0.029
Token day
Token ID1110
Feature activation-1.77e+00

Pos logit contributions
 supposed+0.030
lling+0.028
 causing+0.028
igators+0.027
 attention+0.026
Neg logit contributions
ra-0.067
light-0.066
ara-0.066
era-0.065
mm-0.065
Token lonely
Token ID21757
Feature activation+5.77e-01

Pos logit contributions
 supposed+0.039
rated+0.038
lling+0.038
igators+0.038
 sizes+0.036
Neg logit contributions
ra-0.073
mm-0.068
era-0.067
ink-0.066
ara-0.065
Token bird
Token ID6512
Feature activation-6.03e-01

Pos logit contributions
 supposed+0.014
igators+0.013
 attention+0.013
rated+0.013
lling+0.012
Neg logit contributions
ra-0.016
era-0.015
mm-0.015
iolet-0.014
aw-0.014
Token and
Token ID290
Feature activation+2.33e+00

Pos logit contributions
ra+0.014
era+0.013
aw+0.013
iolet+0.012
mm+0.012
Neg logit contributions
 supposed-0.016
igators-0.016
rated-0.016
 attention-0.016
lling-0.015
Token felt
Token ID2936
Feature activation+1.71e+00

Pos logit contributions
 supposed+0.057
igators+0.055
 attention+0.054
rated+0.054
lling+0.053
Neg logit contributions
ra-0.061
era-0.057
mm-0.057
iolet-0.055
aw-0.054
Token sad
Token ID6507
Feature activation-3.48e-01

Pos logit contributions
rated+0.033
 supposed+0.031
igators+0.030
lling+0.030
 attention+0.028
Neg logit contributions
ra-0.050
aw-0.048
apy-0.046
era-0.046
mm-0.045
Token all
Token ID477
Feature activation+6.15e+00

Pos logit contributions
ra+0.009
era+0.008
mm+0.008
iolet+0.008
aw+0.008
Neg logit contributions
 supposed-0.009
igators-0.009
rated-0.008
 attention-0.008
lling-0.008
Token the
Token ID262
Feature activation+3.56e+00

Pos logit contributions
 goats+0.133
 treadmill+0.127
 harming+0.127
 tempting+0.126
 supposed+0.125
Neg logit contributions
on-0.132
e-0.126
light-0.124
?-0.123
aw-0.122
Token time
Token ID640
Feature activation+2.04e+00

Pos logit contributions
rying+0.084
ounded+0.083
ried+0.082
rated+0.082
 supposed+0.082
Neg logit contributions
aw-0.072
ra-0.069
 ha-0.069
ara-0.068
 frost-0.067
Token.
Token ID13
Feature activation+1.39e+00

Pos logit contributions
 supposed+0.060
rated+0.060
igators+0.059
 attention+0.058
lling+0.057
Neg logit contributions
ra-0.041
aw-0.039
era-0.039
mm-0.037
light-0.037
Token One
Token ID1881
Feature activation+1.86e+00

Pos logit contributions
 supposed+0.039
igators+0.037
 attention+0.037
rated+0.037
lling+0.036
Neg logit contributions
ra-0.032
era-0.030
mm-0.030
iolet-0.029
aw-0.028
Token day
Token ID1110
Feature activation-1.92e+00

Pos logit contributions
 supposed+0.027
lling+0.025
igators+0.025
 causing+0.025
 attention+0.024
Neg logit contributions
ra-0.064
light-0.063
era-0.063
mm-0.063
ara-0.063
Token mom
Token ID1995
Feature activation-1.30e+00

Pos logit contributions
 supposed+0.025
igators+0.025
 attention+0.024
rated+0.024
lling+0.024
Neg logit contributions
ra-0.015
era-0.014
mm-0.014
iolet-0.013
aw-0.013
Token smiled
Token ID13541
Feature activation+3.98e-01

Pos logit contributions
ra+0.032
mm+0.029
era+0.029
iolet+0.029
mmm+0.027
Neg logit contributions
 supposed-0.033
igators-0.032
 way-0.031
rated-0.031
lling-0.031
Token and
Token ID290
Feature activation+9.19e-01

Pos logit contributions
 supposed+0.007
 attention+0.007
 around+0.007
rated+0.006
 old+0.006
Neg logit contributions
ra-0.013
mm-0.013
era-0.013
iolet-0.012
mmm-0.012
Token explained
Token ID4893
Feature activation+9.94e-01

Pos logit contributions
 supposed+0.025
igators+0.024
 attention+0.023
 way+0.023
 having+0.023
Neg logit contributions
ra-0.022
era-0.020
mm-0.020
iolet-0.019
aw-0.018
Token that
Token ID326
Feature activation+2.09e+00

Pos logit contributions
 supposed+0.029
rated+0.028
igators+0.028
lling+0.027
 attention+0.026
Neg logit contributions
ra-0.021
aw-0.020
era-0.020
mm-0.020
iolet-0.019
Token nature
Token ID3450
Feature activation+6.12e+00

Pos logit contributions
rated+0.059
 supposed+0.058
lling+0.054
 attention+0.052
ping+0.051
Neg logit contributions
mm-0.046
ra-0.044
light-0.043
aw-0.042
era-0.041
Token is
Token ID318
Feature activation+5.89e-01

Pos logit contributions
 supposed+0.199
 attention+0.192
rated+0.188
 harming+0.181
ping+0.177
Neg logit contributions
ra-0.097
ink-0.095
aw-0.094
light-0.091
era-0.082
Token all
Token ID477
Feature activation+3.73e+00

Pos logit contributions
rated+0.012
lling+0.012
igators+0.011
ping+0.011
ooters+0.011
Neg logit contributions
ra-0.015
light-0.014
M-0.014
era-0.014
mmm-0.013
Token around
Token ID1088
Feature activation+1.40e+00

Pos logit contributions
rated+0.084
 treadmill+0.081
ooters+0.073
ping+0.073
lling+0.072
Neg logit contributions
light-0.086
aw-0.079
ra-0.074
ap-0.074
?-0.073
Token us
Token ID514
Feature activation+1.99e+00

Pos logit contributions
 supposed+0.037
rated+0.036
igators+0.034
lling+0.034
 attention+0.034
Neg logit contributions
ra-0.033
light-0.030
aw-0.029
era-0.029
mm-0.029
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
rated+0.054
igators+0.054
 supposed+0.053
lling+0.053
 causing+0.050
Neg logit contributions
ra-0.040
light-0.039
aw-0.036
ap-0.035
ink-0.035
Token Sarah
Token ID10490
Feature activation+1.31e+00

Pos logit contributions
ra+0.008
era+0.008
mm+0.007
iolet+0.007
ara+0.007
Neg logit contributions
 supposed-0.011
igators-0.011
rated-0.011
lling-0.011
 attention-0.011
Token was
Token ID373
Feature activation-6.54e-01

Pos logit contributions
 supposed+0.033
igators+0.032
rated+0.031
 attention+0.031
lling+0.030
Neg logit contributions
ra-0.034
era-0.032
mm-0.031
iolet-0.031
aw-0.030
Token scared
Token ID12008
Feature activation+6.97e-01

Pos logit contributions
ra+0.019
era+0.019
aw+0.018
iolet+0.017
ink+0.017
Neg logit contributions
igators-0.014
rated-0.014
 attention-0.013
 sizes-0.012
 supposed-0.012
Token and
Token ID290
Feature activation+2.29e+00

Pos logit contributions
igators+0.020
 supposed+0.020
rated+0.019
lling+0.018
 valuable+0.018
Neg logit contributions
ra-0.015
era-0.014
aw-0.014
ara-0.013
iolet-0.013
Token happy
Token ID3772
Feature activation-2.58e+00

Pos logit contributions
 supposed+0.062
igators+0.061
 attention+0.059
rated+0.058
 valuable+0.057
Neg logit contributions
ra-0.054
era-0.049
aw-0.047
mm-0.045
iolet-0.045
Token all
Token ID477
Feature activation+6.07e+00

Pos logit contributions
ra+0.072
mm+0.068
era+0.068
iolet+0.066
ara+0.064
Neg logit contributions
 supposed-0.060
 attention-0.057
igators-0.056
rated-0.055
 way-0.054
Token at
Token ID379
Feature activation-6.88e-01

Pos logit contributions
 supposed+0.172
 harming+0.163
rated+0.161
 valuable+0.161
 unnecessary+0.160
Neg logit contributions
aw-0.104
on-0.101
light-0.100
era-0.098
ara-0.097
Token once
Token ID1752
Feature activation-6.27e-01

Pos logit contributions
ra+0.015
era+0.014
on+0.013
mm+0.013
iolet+0.012
Neg logit contributions
rated-0.021
igators-0.019
 sizes-0.018
 valuable-0.018
ooters-0.018
Token!
Token ID0
Feature activation-3.33e-01

Pos logit contributions
ra+0.015
era+0.014
mm+0.013
iolet+0.013
aw+0.013
Neg logit contributions
 supposed-0.017
igators-0.017
rated-0.016
 attention-0.016
lling-0.016
Token
Token ID198
Feature activation+5.43e-01

Pos logit contributions
ra+0.008
era+0.007
mm+0.007
iolet+0.007
ara+0.007
Neg logit contributions
 supposed-0.009
igators-0.009
rated-0.009
 attention-0.009
lling-0.009
Token
Token ID198
Feature activation-2.46e-02

Pos logit contributions
igators+0.016
rated+0.016
 supposed+0.015
 attention+0.015
lling+0.015
Neg logit contributions
ra-0.011
era-0.011
mm-0.011
iolet-0.010
ara-0.010
Token because
Token ID780
Feature activation+4.32e+00

Pos logit contributions
rated+0.051
 supposed+0.049
igators+0.048
 attention+0.047
lling+0.047
Neg logit contributions
ra-0.035
era-0.033
aw-0.032
iolet-0.031
light-0.031
Token she
Token ID673
Feature activation+1.06e+00

Pos logit contributions
 supposed+0.128
rated+0.128
 attention+0.123
igators+0.121
lling+0.120
Neg logit contributions
ra-0.088
era-0.084
mm-0.083
iolet-0.079
aw-0.077
Token was
Token ID373
Feature activation+3.51e-01

Pos logit contributions
 supposed+0.022
igators+0.021
rated+0.021
 attention+0.021
lling+0.020
Neg logit contributions
ra-0.032
era-0.030
mm-0.030
iolet-0.029
aw-0.028
Token so
Token ID523
Feature activation+1.39e+00

Pos logit contributions
rated+0.009
igators+0.008
 attention+0.007
lling+0.007
 sizes+0.007
Neg logit contributions
ra-0.010
era-0.009
iolet-0.009
aw-0.009
 ha-0.008
Token happy
Token ID3772
Feature activation-1.94e+00

Pos logit contributions
rated+0.024
igators+0.023
 supposed+0.023
 attention+0.023
lling+0.022
Neg logit contributions
ra-0.046
era-0.044
mm-0.043
iolet-0.042
aw-0.042
Token all
Token ID477
Feature activation+6.06e+00

Pos logit contributions
ra+0.053
era+0.050
mm+0.049
iolet+0.048
aw+0.047
Neg logit contributions
 supposed-0.045
igators-0.045
rated-0.044
 attention-0.043
lling-0.042
Token the
Token ID262
Feature activation+3.11e+00

Pos logit contributions
 harming+0.158
rated+0.158
itionally+0.149
 supposed+0.149
 winding+0.147
Neg logit contributions
aw-0.121
on-0.110
era-0.103
room-0.099
ara-0.099
Token time
Token ID640
Feature activation+2.04e+00

Pos logit contributions
rated+0.080
rying+0.071
ried+0.069
 causing+0.069
ounded+0.068
Neg logit contributions
ra-0.066
era-0.064
P-0.062
 frost-0.062
 ha-0.062
Token.
Token ID13
Feature activation+9.82e-01

Pos logit contributions
 supposed+0.055
rated+0.054
igators+0.054
 attention+0.052
lling+0.051
Neg logit contributions
ra-0.047
era-0.045
mm-0.043
aw-0.042
iolet-0.042
Token But
Token ID887
Feature activation+4.31e-01

Pos logit contributions
 supposed+0.026
igators+0.025
 attention+0.024
rated+0.024
lling+0.024
Neg logit contributions
ra-0.024
era-0.023
mm-0.022
iolet-0.022
aw-0.021
Token Joy
Token ID14087
Feature activation+9.35e-01

Pos logit contributions
 supposed+0.010
igators+0.010
 having+0.010
 attention+0.010
 way+0.010
Neg logit contributions
ra-0.011
mm-0.010
iolet-0.010
era-0.010
aw-0.010
Token mom
Token ID1995
Feature activation-1.52e+00

Pos logit contributions
 supposed+0.003
igators+0.003
rated+0.003
lling+0.003
 attention+0.003
Neg logit contributions
ra-0.002
era-0.001
mm-0.001
iolet-0.001
ara-0.001
Token smiled
Token ID13541
Feature activation-2.18e-02

Pos logit contributions
ra+0.040
mm+0.039
iolet+0.036
mmm+0.036
era+0.036
Neg logit contributions
igators-0.034
lling-0.034
 supposed-0.034
 way-0.034
 having-0.033
Token and
Token ID290
Feature activation+6.31e-01

Pos logit contributions
ra+0.001
mm+0.001
era+0.001
iolet+0.001
mmm+0.001
Neg logit contributions
 around-0.000
 attention-0.000
 old-0.000
 supposed-0.000
 valuable-0.000
Token explained
Token ID4893
Feature activation+8.01e-01

Pos logit contributions
 supposed+0.017
igators+0.016
 attention+0.016
 way+0.016
 having+0.016
Neg logit contributions
ra-0.015
mm-0.014
era-0.014
iolet-0.014
ara-0.013
Token that
Token ID326
Feature activation+2.16e+00

Pos logit contributions
 supposed+0.022
rated+0.022
igators+0.021
lling+0.021
 way+0.020
Neg logit contributions
ra-0.018
aw-0.017
era-0.017
mm-0.017
iolet-0.016
Token nature
Token ID3450
Feature activation+6.02e+00

Pos logit contributions
rated+0.059
 supposed+0.057
lling+0.052
 attention+0.051
igators+0.050
Neg logit contributions
ra-0.050
mm-0.049
light-0.048
era-0.046
aw-0.046
Token is
Token ID318
Feature activation+5.19e-01

Pos logit contributions
 supposed+0.185
 attention+0.178
rated+0.177
ping+0.167
 harming+0.165
Neg logit contributions
ra-0.110
ink-0.101
aw-0.099
light-0.096
era-0.094
Token powerful
Token ID3665
Feature activation-2.58e-01

Pos logit contributions
rated+0.011
igators+0.010
ping+0.010
lling+0.010
ooters+0.009
Neg logit contributions
ra-0.014
era-0.013
M-0.013
light-0.012
mmm-0.012
Token,
Token ID11
Feature activation+9.86e-01

Pos logit contributions
ra+0.005
light+0.005
aw+0.004
era+0.004
ap+0.004
Neg logit contributions
 supposed-0.008
rated-0.008
lling-0.007
igators-0.007
ping-0.007
Token and
Token ID290
Feature activation+1.75e+00

Pos logit contributions
rated+0.024
 supposed+0.023
igators+0.023
lling+0.022
 attention+0.021
Neg logit contributions
ra-0.025
mm-0.024
aw-0.024
light-0.023
era-0.023
Token wise
Token ID10787
Feature activation+7.48e-01

Pos logit contributions
rated+0.047
 supposed+0.046
lling+0.043
igators+0.043
ping+0.043
Neg logit contributions
ra-0.040
mm-0.036
light-0.035
aw-0.035
era-0.035
Token cough
Token ID22094
Feature activation+1.06e+00

Pos logit contributions
ra+0.017
era+0.016
mm+0.016
iolet+0.015
aw+0.015
Neg logit contributions
 supposed-0.020
igators-0.019
rated-0.018
 attention-0.018
lling-0.018
Token and
Token ID290
Feature activation+7.55e-01

Pos logit contributions
igators+0.028
 supposed+0.027
rated+0.027
 attention+0.026
lling+0.026
Neg logit contributions
ra-0.025
aw-0.023
era-0.023
light-0.022
iolet-0.022
Token felt
Token ID2936
Feature activation+1.18e+00

Pos logit contributions
 supposed+0.018
igators+0.017
 attention+0.016
 way+0.016
 having+0.016
Neg logit contributions
ra-0.021
era-0.020
iolet-0.019
mm-0.019
ara-0.018
Token so
Token ID523
Feature activation+1.85e+00

Pos logit contributions
rated+0.025
igators+0.024
 supposed+0.024
lling+0.024
 sizes+0.022
Neg logit contributions
ra-0.031
apy-0.030
aw-0.030
mm-0.029
light-0.029
Token tired
Token ID10032
Feature activation+1.85e+00

Pos logit contributions
 supposed+0.033
igators+0.032
rated+0.031
lling+0.031
 attention+0.031
Neg logit contributions
ra-0.059
mm-0.056
apy-0.054
ink-0.054
aw-0.054
Token all
Token ID477
Feature activation+6.00e+00

Pos logit contributions
 supposed+0.053
igators+0.053
rated+0.052
lling+0.050
 attention+0.049
Neg logit contributions
ra-0.038
aw-0.036
era-0.036
mm-0.035
iolet-0.034
Token the
Token ID262
Feature activation+3.14e+00

Pos logit contributions
 supposed+0.156
 valuable+0.149
itionally+0.148
 harming+0.146
lling+0.146
Neg logit contributions
aster-0.118
on-0.114
light-0.112
aw-0.108
ad-0.099
Token time
Token ID640
Feature activation+1.97e+00

Pos logit contributions
igators+0.072
 supposed+0.072
lling+0.070
rated+0.069
rying+0.069
Neg logit contributions
ara-0.066
 frost-0.066
mm-0.066
aw-0.064
light-0.064
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
rated+0.059
igators+0.058
 supposed+0.058
 attention+0.056
lling+0.055
Neg logit contributions
ra-0.039
era-0.036
aw-0.036
light-0.035
mm-0.035
Token He
Token ID679
Feature activation-5.92e-01

Pos logit contributions
 supposed+0.032
igators+0.031
rated+0.030
 attention+0.030
lling+0.030
Neg logit contributions
ra-0.026
era-0.024
mm-0.024
iolet-0.023
ara-0.023
Token was
Token ID373
Feature activation-1.79e-01

Pos logit contributions
ra+0.017
mm+0.017
era+0.016
iolet+0.016
mmm+0.016
Neg logit contributions
 supposed-0.013
rated-0.013
igators-0.012
 way-0.012
 attention-0.012
Token pill
Token ID9582
Feature activation+1.70e+00

Pos logit contributions
 supposed+0.037
 way+0.036
 attention+0.036
 valuable+0.035
igators+0.035
Neg logit contributions
ra-0.055
era-0.053
mm-0.052
aw-0.051
iolet-0.051
Token was
Token ID373
Feature activation+1.12e-01

Pos logit contributions
 supposed+0.040
igators+0.039
 attention+0.038
rated+0.038
lling+0.037
Neg logit contributions
ra-0.046
era-0.043
mm-0.043
iolet-0.042
aw-0.041
Token so
Token ID523
Feature activation+1.80e+00

Pos logit contributions
rated+0.002
igators+0.002
lling+0.002
 attention+0.002
 supposed+0.002
Neg logit contributions
ra-0.003
light-0.003
ink-0.003
era-0.003
mm-0.003
Token lazy
Token ID16931
Feature activation+1.84e+00

Pos logit contributions
 supposed+0.031
igators+0.029
rated+0.029
lling+0.028
 attention+0.028
Neg logit contributions
ra-0.061
mm-0.058
era-0.057
ink-0.055
ara-0.055
Token that
Token ID326
Feature activation+2.45e+00

Pos logit contributions
 supposed+0.051
rated+0.051
igators+0.050
 attention+0.050
lling+0.049
Neg logit contributions
ra-0.041
era-0.038
mm-0.038
aw-0.037
iolet-0.037
Token all
Token ID477
Feature activation+5.98e+00

Pos logit contributions
 supposed+0.065
igators+0.063
rated+0.062
 attention+0.062
lling+0.060
Neg logit contributions
ra-0.059
era-0.055
mm-0.055
iolet-0.053
aw-0.051
Token it
Token ID340
Feature activation+1.44e+00

Pos logit contributions
 supposed+0.159
rated+0.159
 attention+0.150
lling+0.148
 valuable+0.145
Neg logit contributions
light-0.124
ra-0.118
mm-0.117
on-0.116
aw-0.116
Token did
Token ID750
Feature activation+8.97e-01

Pos logit contributions
 supposed+0.040
 attention+0.039
rated+0.038
igators+0.037
lling+0.036
Neg logit contributions
ra-0.033
era-0.030
light-0.028
mm-0.028
aw-0.028
Token was
Token ID373
Feature activation+5.77e-01

Pos logit contributions
igators+0.025
lling+0.025
rated+0.025
 supposed+0.024
 attention+0.023
Neg logit contributions
ra-0.018
era-0.016
mm-0.015
light-0.015
apy-0.015
Token swim
Token ID9422
Feature activation+4.13e+00

Pos logit contributions
lling+0.014
rated+0.014
igators+0.014
 automobiles+0.013
 sizes+0.013
Neg logit contributions
ra-0.012
light-0.011
era-0.011
 please-0.010
apy-0.010
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
 supposed+0.100
igators+0.092
 attention+0.090
rated+0.087
lling+0.087
Neg logit contributions
ra-0.110
aw-0.103
era-0.103
iolet-0.099
light-0.098
Token a
Token ID257
Feature activation-4.74e-01

Pos logit contributions
ra+0.045
era+0.042
mm+0.042
iolet+0.041
ara+0.039
Neg logit contributions
 supposed-0.046
igators-0.043
 attention-0.042
 having-0.042
 valuable-0.041
Token smart
Token ID4451
Feature activation+1.19e+00

Pos logit contributions
ra+0.016
era+0.015
mm+0.015
iolet+0.015
aw+0.015
Neg logit contributions
 supposed-0.008
igators-0.007
 attention-0.007
rated-0.007
 way-0.007
Token cat
Token ID3797
Feature activation-1.34e+00

Pos logit contributions
 supposed+0.027
 attention+0.025
igators+0.025
rated+0.025
lling+0.025
Neg logit contributions
ra-0.034
era-0.032
mm-0.031
ara-0.031
iolet-0.031
Token named
Token ID3706
Feature activation+1.71e-01

Pos logit contributions
ra+0.033
mm+0.031
era+0.030
iolet+0.029
mmm+0.028
Neg logit contributions
 supposed-0.036
igators-0.035
 having-0.033
 around-0.033
 way-0.032
Token Wh
Token ID854
Feature activation+1.71e+00

Pos logit contributions
 supposed+0.005
rated+0.005
lling+0.005
igators+0.005
 attention+0.005
Neg logit contributions
ra-0.004
era-0.004
mm-0.004
ara-0.003
iolet-0.003
Tokenisk
Token ID1984
Feature activation+5.98e+00

Pos logit contributions
 supposed+0.028
igators+0.028
 attention+0.027
rated+0.027
lling+0.026
Neg logit contributions
ra-0.058
era-0.056
mm-0.055
iolet-0.055
ara-0.054
Tokeners
Token ID364
Feature activation+1.88e+00

Pos logit contributions
rated+0.171
 attention+0.167
 supposed+0.165
 valuable+0.160
 way+0.160
Neg logit contributions
ra-0.128
iolet-0.126
era-0.125
aw-0.121
room-0.118
Token.
Token ID13
Feature activation+1.63e+00

Pos logit contributions
 supposed+0.042
igators+0.039
 having+0.038
 way+0.038
 attention+0.038
Neg logit contributions
ra-0.053
mm-0.049
era-0.049
iolet-0.047
mmm-0.047
Token Wh
Token ID854
Feature activation+1.33e+00

Pos logit contributions
 supposed+0.041
lling+0.039
rated+0.039
igators+0.039
 attention+0.038
Neg logit contributions
ra-0.042
mm-0.042
era-0.039
iolet-0.038
ara-0.038
Tokenisk
Token ID1984
Feature activation+5.67e+00

Pos logit contributions
 attention+0.026
igators+0.025
rated+0.024
 supposed+0.024
 valuable+0.024
Neg logit contributions
ra-0.041
mm-0.041
era-0.040
iolet-0.040
ara-0.040
Tokeners
Token ID364
Feature activation+1.87e+00

Pos logit contributions
 attention+0.164
rated+0.163
 supposed+0.159
 way+0.156
 valuable+0.152
Neg logit contributions
ra-0.117
iolet-0.116
era-0.114
aw-0.111
room-0.109
Token tree
Token ID5509
Feature activation-7.88e-01

Pos logit contributions
ra+0.005
era+0.004
mm+0.004
iolet+0.004
aw+0.004
Neg logit contributions
 supposed-0.003
igators-0.003
rated-0.003
 attention-0.003
 way-0.003
Token.
Token ID13
Feature activation+8.47e-01

Pos logit contributions
ra+0.019
era+0.018
mm+0.017
iolet+0.017
aw+0.017
Neg logit contributions
 supposed-0.021
igators-0.021
 attention-0.020
rated-0.020
lling-0.020
Token This
Token ID770
Feature activation+6.35e-01

Pos logit contributions
 supposed+0.021
igators+0.020
rated+0.019
 having+0.019
 way+0.019
Neg logit contributions
ra-0.022
era-0.021
iolet-0.020
mm-0.020
aw-0.020
Token tree
Token ID5509
Feature activation+2.34e-01

Pos logit contributions
 supposed+0.014
igators+0.013
 attention+0.013
rated+0.013
lling+0.013
Neg logit contributions
ra-0.019
era-0.018
mm-0.017
iolet-0.017
aw-0.017
Token loved
Token ID6151
Feature activation+2.67e+00

Pos logit contributions
 supposed+0.006
 attention+0.006
rated+0.006
igators+0.006
lling+0.006
Neg logit contributions
ra-0.005
mm-0.005
era-0.005
aw-0.005
ara-0.005
Token nature
Token ID3450
Feature activation+5.98e+00

Pos logit contributions
rated+0.073
lling+0.071
igators+0.063
 attention+0.062
 supposed+0.061
Neg logit contributions
ra-0.065
aw-0.062
light-0.058
mm-0.056
era-0.056
Token and
Token ID290
Feature activation+2.01e+00

Pos logit contributions
 supposed+0.183
rated+0.177
 attention+0.174
 harming+0.172
lling+0.166
Neg logit contributions
aw-0.121
light-0.112
era-0.106
mm-0.104
ink-0.103
Token enjoyed
Token ID8359
Feature activation+9.00e-02

Pos logit contributions
 supposed+0.052
lling+0.052
rated+0.050
igators+0.050
 attention+0.049
Neg logit contributions
ra-0.045
light-0.042
apy-0.042
mm-0.042
aw-0.041
Token looking
Token ID2045
Feature activation+3.22e+00

Pos logit contributions
rated+0.002
igators+0.002
 supposed+0.002
 attention+0.002
lling+0.002
Neg logit contributions
ra-0.002
aw-0.002
mm-0.002
ink-0.002
M-0.002
Token around
Token ID1088
Feature activation+2.99e+00

Pos logit contributions
 supposed+0.086
lling+0.077
igators+0.076
rated+0.076
 blowing+0.073
Neg logit contributions
aw-0.078
ra-0.075
ap-0.070
um-0.068
aps-0.068
Token it
Token ID340
Feature activation+1.47e+00

Pos logit contributions
 supposed+0.088
igators+0.085
rated+0.084
lling+0.080
 attention+0.079
Neg logit contributions
ra-0.064
era-0.058
mm-0.056
aw-0.055
iolet-0.055
Token happy
Token ID3772
Feature activation-2.56e+00

Pos logit contributions
 supposed+0.010
igators+0.010
rated+0.009
 attention+0.009
lling+0.009
Neg logit contributions
ra-0.030
era-0.029
mm-0.028
iolet-0.028
aw-0.027
Token that
Token ID326
Feature activation+2.44e+00

Pos logit contributions
ra+0.075
era+0.071
mm+0.071
iolet+0.069
aw+0.067
Neg logit contributions
 supposed-0.055
igators-0.052
 attention-0.052
rated-0.050
lling-0.049
Token she
Token ID673
Feature activation+2.23e+00

Pos logit contributions
rated+0.076
 supposed+0.075
igators+0.075
lling+0.072
 attention+0.072
Neg logit contributions
ra-0.048
mm-0.045
era-0.044
iolet-0.042
aw-0.041
Token hugged
Token ID41130
Feature activation+4.22e-01

Pos logit contributions
 supposed+0.051
lling+0.047
igators+0.046
rated+0.046
 having+0.046
Neg logit contributions
ra-0.061
mm-0.060
era-0.060
iolet-0.058
ara-0.056
Token Wh
Token ID854
Feature activation+1.47e+00

Pos logit contributions
 supposed+0.011
igators+0.011
rated+0.011
lling+0.010
 attention+0.010
Neg logit contributions
ra-0.011
era-0.010
mm-0.010
aw-0.010
iolet-0.010
Tokenisk
Token ID1984
Feature activation+5.97e+00

Pos logit contributions
 supposed+0.024
igators+0.024
 attention+0.023
rated+0.023
 way+0.022
Neg logit contributions
ra-0.051
era-0.049
mm-0.048
iolet-0.048
ara-0.047
Tokeners
Token ID364
Feature activation+2.58e+00

Pos logit contributions
 supposed+0.164
rated+0.163
igators+0.156
 way+0.155
 attention+0.154
Neg logit contributions
ra-0.139
iolet-0.130
era-0.130
aw-0.129
mm-0.125
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 supposed+0.067
igators+0.067
rated+0.065
lling+0.060
 attention+0.060
Neg logit contributions
ra-0.063
era-0.060
iolet-0.058
aw-0.057
mm-0.056
Token "
Token ID366
Feature activation+2.71e-01

Pos logit contributions
 supposed+0.029
lling+0.029
igators+0.029
rated+0.029
 attention+0.027
Neg logit contributions
mm-0.022
ra-0.022
era-0.021
iolet-0.021
ara-0.021
TokenThank
Token ID10449
Feature activation-2.32e+00

Pos logit contributions
 supposed+0.007
igators+0.006
lling+0.006
rated+0.006
 having+0.006
Neg logit contributions
ra-0.007
era-0.007
mm-0.007
iolet-0.006
ink-0.006
Token you
Token ID345
Feature activation+3.17e+00

Pos logit contributions
era+0.057
ra+0.056
light+0.053
iolet+0.053
ara+0.052
Neg logit contributions
 supposed-0.055
 attention-0.054
igators-0.053
 having-0.053
 causing-0.051
Token a
Token ID257
Feature activation-5.70e-01

Pos logit contributions
ra+0.055
era+0.052
mm+0.050
iolet+0.049
ink+0.047
Neg logit contributions
 supposed-0.050
igators-0.046
 having-0.046
 valuable-0.043
 important-0.043
Token naughty
Token ID45128
Feature activation+1.12e+00

Pos logit contributions
ra+0.020
era+0.019
mm+0.019
iolet+0.018
aw+0.018
Neg logit contributions
 supposed-0.009
igators-0.008
 valuable-0.008
 way-0.008
 attention-0.008
Token cat
Token ID3797
Feature activation-1.50e+00

Pos logit contributions
 supposed+0.025
igators+0.024
 attention+0.024
rated+0.024
lling+0.023
Neg logit contributions
ra-0.031
era-0.030
mm-0.029
iolet-0.029
aw-0.028
Token named
Token ID3706
Feature activation+4.68e-02

Pos logit contributions
ra+0.038
mm+0.035
era+0.035
iolet+0.033
mmm+0.033
Neg logit contributions
 supposed-0.039
igators-0.039
 having-0.036
 around-0.036
 way-0.036
Token Wh
Token ID854
Feature activation+1.69e+00

Pos logit contributions
 supposed+0.001
rated+0.001
lling+0.001
igators+0.001
 attention+0.001
Neg logit contributions
ra-0.001
era-0.001
mm-0.001
ara-0.001
iolet-0.001
Tokenisk
Token ID1984
Feature activation+5.96e+00

Pos logit contributions
 supposed+0.026
igators+0.025
 attention+0.024
rated+0.024
lling+0.023
Neg logit contributions
ra-0.059
era-0.057
mm-0.056
iolet-0.056
ara-0.054
Tokeners
Token ID364
Feature activation+2.01e+00

Pos logit contributions
rated+0.167
 attention+0.162
 supposed+0.160
 way+0.156
igators+0.155
Neg logit contributions
ra-0.133
iolet-0.128
era-0.128
aw-0.124
mm-0.121
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
 supposed+0.046
igators+0.043
 having+0.042
 attention+0.042
 way+0.042
Neg logit contributions
ra-0.056
mm-0.052
era-0.052
iolet-0.050
mmm-0.049
Token Wh
Token ID854
Feature activation+1.34e+00

Pos logit contributions
 supposed+0.040
lling+0.039
igators+0.038
rated+0.038
 attention+0.038
Neg logit contributions
ra-0.041
mm-0.040
era-0.038
iolet-0.037
ara-0.036
Tokenisk
Token ID1984
Feature activation+5.70e+00

Pos logit contributions
 attention+0.025
igators+0.025
rated+0.024
 supposed+0.024
lling+0.023
Neg logit contributions
ra-0.042
mm-0.041
era-0.041
iolet-0.041
ara-0.040
Tokeners
Token ID364
Feature activation+1.94e+00

Pos logit contributions
rated+0.163
 attention+0.162
 supposed+0.157
 way+0.156
 valuable+0.151
Neg logit contributions
ra-0.120
iolet-0.118
era-0.116
aw-0.114
room-0.110
Token little
Token ID1310
Feature activation+1.59e+00

Pos logit contributions
 supposed+0.060
 attention+0.049
rated+0.047
 sizes+0.047
 blowing+0.047
Neg logit contributions
um-0.036
iolet-0.033
era-0.032
M-0.032
P-0.032
Token girl
Token ID2576
Feature activation-5.35e-01

Pos logit contributions
 supposed+0.052
 valuable+0.043
 attention+0.043
rated+0.043
lling+0.043
Neg logit contributions
ra-0.032
era-0.030
iolet-0.030
ara-0.030
aw-0.028
Token thought
Token ID1807
Feature activation+1.99e+00

Pos logit contributions
mm+0.015
ra+0.015
era+0.014
iolet+0.014
mmm+0.014
Neg logit contributions
 supposed-0.012
 way-0.011
igators-0.011
rated-0.011
lling-0.011
Token maybe
Token ID3863
Feature activation+1.81e+00

Pos logit contributions
rated+0.064
 supposed+0.059
lling+0.057
 sizes+0.057
igators+0.055
Neg logit contributions
aw-0.036
on-0.035
mm-0.035
ra-0.034
apy-0.033
Token the
Token ID262
Feature activation+2.68e+00

Pos logit contributions
 supposed+0.054
rated+0.052
igators+0.050
lling+0.050
 attention+0.049
Neg logit contributions
ra-0.038
mm-0.036
aw-0.034
era-0.034
ara-0.034
Token all
Token ID477
Feature activation+5.95e+00

Pos logit contributions
 supposed+0.083
 attention+0.074
rated+0.072
 blowing+0.068
rying+0.067
Neg logit contributions
era-0.046
iolet-0.046
um-0.046
mm-0.044
P-0.043
Tokenigator
Token ID23823
Feature activation+1.91e+00

Pos logit contributions
 supposed+0.100
 attention+0.090
 harming+0.082
 winding+0.081
rated+0.077
Neg logit contributions
aw-0.193
on-0.174
ra-0.174
era-0.172
light-0.170
Token was
Token ID373
Feature activation+1.40e-01

Pos logit contributions
 supposed+0.048
igators+0.046
 attention+0.045
rated+0.045
lling+0.044
Neg logit contributions
ra-0.049
era-0.046
mm-0.045
iolet-0.044
aw-0.043
Token too
Token ID1165
Feature activation+1.55e+00

Pos logit contributions
rated+0.003
igators+0.003
ooters+0.003
 treadmill+0.003
 attention+0.003
Neg logit contributions
ra-0.003
aw-0.003
era-0.003
iolet-0.003
light-0.003
Token stubborn
Token ID24444
Feature activation+2.32e-01

Pos logit contributions
 supposed+0.029
 attention+0.029
rated+0.029
igators+0.029
 causing+0.027
Neg logit contributions
ra-0.047
aw-0.045
ara-0.044
era-0.043
ink-0.043
Token to
Token ID284
Feature activation-8.10e-02

Pos logit contributions
igators+0.007
rated+0.007
 supposed+0.006
lling+0.006
 winding+0.006
Neg logit contributions
ra-0.004
aw-0.004
era-0.004
iolet-0.004
ara-0.004
Token j
Token ID474
Feature activation+1.75e+00

Pos logit contributions
ra+0.011
era+0.011
mm+0.011
iolet+0.011
aw+0.010
Neg logit contributions
 supposed-0.006
igators-0.005
rated-0.005
 attention-0.005
 way-0.005
Tokenolly
Token ID5098
Feature activation+9.56e-01

Pos logit contributions
 supposed+0.069
rated+0.067
igators+0.066
 attention+0.066
lling+0.065
Neg logit contributions
ra-0.020
era-0.018
mm-0.017
iolet-0.016
ara-0.016
Token cat
Token ID3797
Feature activation-1.20e+00

Pos logit contributions
 supposed+0.021
igators+0.020
 attention+0.019
rated+0.019
lling+0.019
Neg logit contributions
ra-0.028
era-0.026
mm-0.026
iolet-0.025
aw-0.025
Token named
Token ID3706
Feature activation+2.18e-01

Pos logit contributions
ra+0.030
mm+0.028
era+0.028
iolet+0.027
mmm+0.026
Neg logit contributions
 supposed-0.031
igators-0.030
 having-0.029
 around-0.029
 way-0.028
Token Wh
Token ID854
Feature activation+1.77e+00

Pos logit contributions
rated+0.006
lling+0.006
 supposed+0.006
igators+0.006
 attention+0.006
Neg logit contributions
ra-0.005
ara-0.004
mm-0.004
era-0.004
iolet-0.004
Tokenisk
Token ID1984
Feature activation+5.95e+00

Pos logit contributions
 supposed+0.027
igators+0.026
 attention+0.025
rated+0.025
lling+0.024
Neg logit contributions
ra-0.063
era-0.060
mm-0.060
iolet-0.059
ara-0.057
Tokeners
Token ID364
Feature activation+1.96e+00

Pos logit contributions
rated+0.166
 attention+0.161
 supposed+0.160
 valuable+0.155
igators+0.155
Neg logit contributions
ra-0.134
iolet-0.129
era-0.129
aw-0.124
mm-0.120
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
 supposed+0.046
igators+0.043
 attention+0.042
 having+0.042
rated+0.041
Neg logit contributions
ra-0.054
mm-0.050
era-0.050
iolet-0.048
ara-0.047
Token Wh
Token ID854
Feature activation+1.27e+00

Pos logit contributions
 supposed+0.043
lling+0.043
rated+0.041
igators+0.041
 attention+0.040
Neg logit contributions
mm-0.038
ra-0.038
era-0.034
ara-0.034
iolet-0.034
Tokenisk
Token ID1984
Feature activation+5.54e+00

Pos logit contributions
 attention+0.024
igators+0.024
 supposed+0.022
rated+0.022
 valuable+0.021
Neg logit contributions
ra-0.041
mm-0.040
era-0.039
iolet-0.039
ara-0.039
Tokeners
Token ID364
Feature activation+1.85e+00

Pos logit contributions
rated+0.156
 attention+0.156
 supposed+0.153
 way+0.150
 valuable+0.146
Neg logit contributions
ra-0.119
iolet-0.117
era-0.116
aw-0.112
mm-0.109
Token nice
Token ID3621
Feature activation+4.77e-01

Pos logit contributions
 supposed+0.043
igators+0.042
rated+0.041
 attention+0.039
lling+0.039
Neg logit contributions
ra-0.100
era-0.094
mm-0.093
iolet-0.091
ara-0.090
Token and
Token ID290
Feature activation+2.40e+00

Pos logit contributions
 supposed+0.011
igators+0.011
rated+0.011
 attention+0.011
lling+0.010
Neg logit contributions
ra-0.013
era-0.012
mm-0.012
iolet-0.012
aw-0.012
Token kind
Token ID1611
Feature activation-2.15e-03

Pos logit contributions
igators+0.053
 supposed+0.053
rated+0.053
lling+0.052
 attention+0.049
Neg logit contributions
ra-0.066
mm-0.062
era-0.060
aw-0.059
light-0.058
Token,
Token ID11
Feature activation+1.80e+00

Pos logit contributions
ra+0.000
mm+0.000
era+0.000
iolet+0.000
ara+0.000
Neg logit contributions
 supposed-0.000
igators-0.000
 attention-0.000
rated-0.000
 way-0.000
Token so
Token ID523
Feature activation+1.85e+00

Pos logit contributions
rated+0.043
igators+0.042
 supposed+0.042
lling+0.040
 attention+0.039
Neg logit contributions
ra-0.049
mm-0.045
era-0.044
aw-0.044
iolet-0.042
Token all
Token ID477
Feature activation+5.93e+00

Pos logit contributions
 supposed+0.041
 attention+0.040
 valuable+0.040
 having+0.040
 important+0.039
Neg logit contributions
ra-0.052
era-0.050
iolet-0.048
mm-0.047
aw-0.045
Token the
Token ID262
Feature activation+3.01e+00

Pos logit contributions
 supposed+0.177
rated+0.176
lling+0.165
 treadmill+0.165
 winding+0.164
Neg logit contributions
ra-0.097
light-0.094
aw-0.092
mm-0.090
on-0.089
Token other
Token ID584
Feature activation+2.54e+00

Pos logit contributions
 supposed+0.079
rated+0.075
igators+0.074
lling+0.070
 causing+0.069
Neg logit contributions
ra-0.073
era-0.066
ara-0.066
mm-0.064
iolet-0.063
Token creatures
Token ID8109
Feature activation+3.19e+00

Pos logit contributions
 supposed+0.050
igators+0.048
rated+0.046
 attention+0.045
lling+0.043
Neg logit contributions
ra-0.079
era-0.075
mm-0.074
iolet-0.073
aw-0.073
Token in
Token ID287
Feature activation+1.99e+00

Pos logit contributions
 supposed+0.075
rated+0.071
 attention+0.071
igators+0.069
lling+0.067
Neg logit contributions
ra-0.086
era-0.080
aw-0.078
iolet-0.078
mm-0.077
Token the
Token ID262
Feature activation-8.15e-01

Pos logit contributions
 supposed+0.041
igators+0.039
rated+0.039
 attention+0.038
lling+0.037
Neg logit contributions
ra-0.060
era-0.057
mm-0.056
iolet-0.055
aw-0.054
Token ch
Token ID442
Feature activation+2.27e-01

Pos logit contributions
ra+0.028
era+0.027
mm+0.027
iolet+0.026
aw+0.026
Neg logit contributions
 supposed-0.015
igators-0.014
 attention-0.014
 way-0.014
rated-0.014
Tokenubby
Token ID38393
Feature activation-1.20e-01

Pos logit contributions
igators+0.004
 supposed+0.004
 attention+0.004
 old+0.004
 around+0.004
Neg logit contributions
mm-0.007
era-0.007
mmm-0.007
iolet-0.007
light-0.007
Token cat
Token ID3797
Feature activation-1.71e+00

Pos logit contributions
ra+0.003
era+0.003
mm+0.003
iolet+0.003
aw+0.003
Neg logit contributions
 supposed-0.003
igators-0.003
 attention-0.003
rated-0.003
 way-0.003
Token named
Token ID3706
Feature activation+2.15e-01

Pos logit contributions
ra+0.039
era+0.037
mm+0.037
iolet+0.035
ara+0.034
Neg logit contributions
 supposed-0.048
igators-0.046
 having-0.044
 attention-0.044
 way-0.044
Token Wh
Token ID854
Feature activation+1.48e+00

Pos logit contributions
 supposed+0.005
igators+0.005
rated+0.005
 attention+0.005
lling+0.005
Neg logit contributions
ra-0.006
era-0.005
mm-0.005
iolet-0.005
aw-0.005
Tokenisk
Token ID1984
Feature activation+5.93e+00

Pos logit contributions
 supposed+0.023
igators+0.022
 attention+0.022
rated+0.022
lling+0.021
Neg logit contributions
ra-0.051
era-0.050
mm-0.049
iolet-0.048
ara-0.047
Tokeners
Token ID364
Feature activation+2.30e+00

Pos logit contributions
rated+0.177
 attention+0.174
 supposed+0.172
 way+0.170
 valuable+0.168
Neg logit contributions
ra-0.119
iolet-0.118
era-0.114
aw-0.114
room-0.111
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 supposed+0.065
igators+0.065
rated+0.064
 attention+0.064
lling+0.061
Neg logit contributions
ra-0.051
era-0.048
iolet-0.046
mm-0.046
aw-0.045
Token Tim
Token ID5045
Feature activation+6.58e-01

Pos logit contributions
 supposed+0.029
 attention+0.028
rated+0.028
lling+0.028
igators+0.027
Neg logit contributions
ra-0.028
mm-0.028
era-0.027
iolet-0.026
mmm-0.025
Tokenmy
Token ID1820
Feature activation-2.00e-02

Pos logit contributions
 attention+0.015
igators+0.015
 supposed+0.014
 valuable+0.014
lling+0.014
Neg logit contributions
ra-0.018
ara-0.017
era-0.017
aw-0.017
iolet-0.017
Token loved
Token ID6151
Feature activation+1.62e+00

Pos logit contributions
ra+0.001
era+0.001
mm+0.001
iolet+0.001
aw+0.001
Neg logit contributions
 supposed-0.000
igators-0.000
rated-0.000
 attention-0.000
lling-0.000
Token Ch
Token ID609
Feature activation+5.41e-01

Pos logit contributions
ra+0.050
mm+0.047
era+0.047
ink+0.045
iolet+0.045
Neg logit contributions
 having-0.036
 supposed-0.035
igators-0.035
 way-0.034
 around-0.033
Tokenir
Token ID343
Feature activation+1.14e+00

Pos logit contributions
 supposed+0.018
igators+0.017
rated+0.017
 attention+0.016
lling+0.016
Neg logit contributions
ra-0.009
mm-0.009
era-0.009
aw-0.008
iolet-0.008
Tokenpy
Token ID9078
Feature activation+7.72e-01

Pos logit contributions
 supposed+0.020
igators+0.019
rated+0.019
 attention+0.018
lling+0.018
Neg logit contributions
ra-0.038
mm-0.036
era-0.036
iolet-0.035
ara-0.034
Token and
Token ID290
Feature activation+8.09e-01

Pos logit contributions
igators+0.019
 supposed+0.019
rated+0.019
lling+0.018
 attention+0.018
Neg logit contributions
ra-0.020
mm-0.019
era-0.019
iolet-0.018
ara-0.017
Token Wh
Token ID854
Feature activation+1.14e+00

Pos logit contributions
 supposed+0.019
igators+0.019
 attention+0.018
rated+0.018
lling+0.017
Neg logit contributions
ra-0.022
era-0.021
mm-0.020
iolet-0.020
aw-0.019
Tokenisk
Token ID1984
Feature activation+5.92e+00

Pos logit contributions
 supposed+0.019
igators+0.018
rated+0.018
 attention+0.017
lling+0.017
Neg logit contributions
ra-0.039
era-0.037
mm-0.037
iolet-0.036
aw-0.035
Tokeners
Token ID364
Feature activation+1.84e+00

Pos logit contributions
 supposed+0.160
 attention+0.158
rated+0.157
 way+0.154
 valuable+0.151
Neg logit contributions
ra-0.136
era-0.130
iolet-0.129
aw-0.127
mm-0.119
Token knew
Token ID2993
Feature activation+4.86e-01

Pos logit contributions
igators+0.043
 supposed+0.043
rated+0.042
lling+0.041
 way+0.041
Neg logit contributions
ra-0.048
mm-0.046
era-0.045
iolet-0.044
ara-0.043
Token that
Token ID326
Feature activation+1.28e+00

Pos logit contributions
 supposed+0.013
igators+0.013
rated+0.013
lling+0.012
 attention+0.012
Neg logit contributions
ra-0.011
era-0.011
mm-0.010
aw-0.010
iolet-0.010
Token poison
Token ID8764
Feature activation+1.16e+00

Pos logit contributions
rated+0.035
 supposed+0.035
igators+0.034
lling+0.033
 attention+0.031
Neg logit contributions
ra-0.028
light-0.027
era-0.025
mm-0.025
aw-0.025
Token was
Token ID373
Feature activation-2.21e-01

Pos logit contributions
 supposed+0.033
 attention+0.031
igators+0.030
rated+0.030
lling+0.030
Neg logit contributions
ra-0.025
era-0.023
aw-0.023
light-0.022
iolet-0.022
Token Do
Token ID2141
Feature activation-4.00e+00

Pos logit contributions
 supposed+0.004
igators+0.004
 having+0.004
 valuable+0.004
 attention+0.004
Neg logit contributions
ra-0.006
era-0.005
aw-0.005
light-0.005
iolet-0.005
Token you
Token ID345
Feature activation-1.34e-01

Pos logit contributions
ra+0.089
era+0.081
light+0.078
iolet+0.078
mm+0.076
Neg logit contributions
 supposed-0.114
igators-0.110
 attention-0.109
 having-0.106
 valuable-0.106
Token want
Token ID765
Feature activation-1.97e+00

Pos logit contributions
ra+0.004
mm+0.004
era+0.004
iolet+0.004
aw+0.004
Neg logit contributions
 supposed-0.003
rated-0.003
 attention-0.003
igators-0.002
 valuable-0.002
Token to
Token ID284
Feature activation-1.33e+00

Pos logit contributions
ra+0.078
era+0.073
mm+0.072
ink+0.072
iolet+0.071
Neg logit contributions
 supposed-0.024
 attention-0.023
igators-0.022
 old-0.021
 valuable-0.021
Token be
Token ID307
Feature activation-1.98e+00

Pos logit contributions
ra+0.041
mm+0.040
aw+0.039
light+0.039
era+0.039
Neg logit contributions
 attention-0.025
 supposed-0.025
 way-0.025
 having-0.024
 old-0.023
Token happy
Token ID3772
Feature activation-6.11e+00

Pos logit contributions
ra+0.064
mm+0.062
era+0.062
iolet+0.059
ara+0.059
Neg logit contributions
 supposed-0.038
igators-0.035
 attention-0.034
 having-0.033
 valuable-0.033
Token?"
Token ID1701
Feature activation-3.22e+00

Pos logit contributions
ra+0.148
era+0.137
mm+0.132
aw+0.131
iolet+0.128
Neg logit contributions
 having-0.161
 supposed-0.160
 attention-0.159
 around-0.156
igators-0.156
Token
Token ID198
Feature activation-3.26e-01

Pos logit contributions
ra+0.104
light+0.097
era+0.094
ink+0.091
mm+0.091
Neg logit contributions
 having-0.058
 old-0.057
 supposed-0.055
 way-0.054
 attention-0.054
Token
Token ID198
Feature activation-6.93e-01

Pos logit contributions
ra+0.010
light+0.009
era+0.009
mm+0.009
iolet+0.009
Neg logit contributions
 supposed-0.006
 old-0.006
 valuable-0.006
igators-0.006
 having-0.006
TokenS
Token ID50
Feature activation-2.76e+00

Pos logit contributions
light+0.016
ra+0.015
era+0.013
etti+0.013
mm+0.013
Neg logit contributions
 valuable-0.019
igators-0.018
 old-0.018
 supposed-0.018
 important-0.017
Tokenara
Token ID3301
Feature activation-9.67e-01

Pos logit contributions
apy+0.035
light+0.032
 rectangle+0.028
ra+0.027
mmm+0.025
Neg logit contributions
 supposed-0.092
 attention-0.091
 old-0.090
rated-0.089
 way-0.087
Token!
Token ID0
Feature activation-1.35e+00

Pos logit contributions
ra+0.058
era+0.054
iolet+0.053
mm+0.052
aw+0.050
Neg logit contributions
 supposed-0.066
 having-0.063
 attention-0.063
igators-0.062
 around-0.062
Token Can
Token ID1680
Feature activation-3.63e+00

Pos logit contributions
ra+0.037
era+0.035
iolet+0.034
light+0.034
aw+0.033
Neg logit contributions
 supposed-0.030
igators-0.029
 having-0.028
 valuable-0.027
 attention-0.027
Token I
Token ID314
Feature activation-2.59e+00

Pos logit contributions
ra+0.091
era+0.088
iolet+0.087
aw+0.084
light+0.083
Neg logit contributions
 having-0.085
igators-0.083
 supposed-0.083
 attention-0.082
 valuable-0.080
Token have
Token ID423
Feature activation-4.11e+00

Pos logit contributions
aw+0.093
iolet+0.089
era+0.088
light+0.085
mm+0.085
Neg logit contributions
 supposed-0.039
 having-0.034
 over-0.034
 around-0.033
 way-0.033
Token a
Token ID257
Feature activation-1.71e+00

Pos logit contributions
era+0.125
ra+0.124
iolet+0.120
mm+0.115
room+0.113
Neg logit contributions
 supposed-0.085
 attention-0.080
 valuable-0.078
 way-0.078
igators-0.077
Token slice
Token ID16416
Feature activation-5.74e+00

Pos logit contributions
ra+0.043
era+0.042
mm+0.042
iolet+0.041
aw+0.040
Neg logit contributions
 supposed-0.043
 way-0.042
 valuable-0.042
 attention-0.041
 old-0.040
Token too
Token ID1165
Feature activation-1.37e+00

Pos logit contributions
ra+0.176
era+0.159
iolet+0.156
ink+0.156
apy+0.154
Neg logit contributions
 over-0.118
 around-0.118
 attention-0.112
 supposed-0.110
 way-0.106
Token?
Token ID30
Feature activation-1.51e+00

Pos logit contributions
ra+0.032
era+0.029
light+0.029
iolet+0.029
mm+0.029
Neg logit contributions
 supposed-0.038
 old-0.036
 important-0.036
rated-0.035
 way-0.035
Tokenâ
Token ID22940
Feature activation-4.51e-01

Pos logit contributions
ra+0.041
era+0.039
mm+0.038
aw+0.038
iolet+0.038
Neg logit contributions
 supposed-0.032
igators-0.032
 having-0.031
 attention-0.031
 valuable-0.031
Token
Token ID26391
Feature activation-2.93e+00

Pos logit contributions
ra+0.012
era+0.011
mm+0.011
iolet+0.011
aw+0.011
Neg logit contributions
 supposed-0.011
igators-0.010
 attention-0.010
rated-0.010
lling-0.010
Token
Token ID198
Feature activation+4.36e-02

Pos logit contributions
ra+0.056
era+0.052
iolet+0.050
mm+0.049
aw+0.044
Neg logit contributions
 supposed-0.090
 way-0.086
 having-0.085
rated-0.083
 around-0.082
Token down
Token ID866
Feature activation-2.17e+00

Pos logit contributions
 supposed+0.078
rated+0.072
igators+0.071
lling+0.068
 valuable+0.066
Neg logit contributions
ra-0.076
era-0.072
aw-0.069
iolet-0.069
ara-0.067
Token.
Token ID13
Feature activation-3.90e-01

Pos logit contributions
ra+0.064
era+0.061
mm+0.059
iolet+0.059
aw+0.057
Neg logit contributions
 supposed-0.047
rated-0.045
igators-0.044
 attention-0.043
lling-0.042
Token
Token ID198
Feature activation-3.68e-01

Pos logit contributions
ra+0.010
era+0.010
mm+0.010
iolet+0.009
aw+0.009
Neg logit contributions
 supposed-0.009
igators-0.009
 attention-0.009
rated-0.009
 valuable-0.009
Token
Token ID198
Feature activation-8.56e-01

Pos logit contributions
ra+0.009
era+0.008
mm+0.008
iolet+0.008
aw+0.008
Neg logit contributions
 supposed-0.010
igators-0.009
 attention-0.009
rated-0.009
lling-0.009
Token"
Token ID1
Feature activation-1.11e+00

Pos logit contributions
ra+0.020
mm+0.019
era+0.018
iolet+0.017
light+0.017
Neg logit contributions
igators-0.023
 valuable-0.022
 supposed-0.022
 having-0.022
 around-0.021
TokenDid
Token ID11633
Feature activation-5.69e+00

Pos logit contributions
ra+0.028
mm+0.026
ink+0.026
mmm+0.025
era+0.024
Neg logit contributions
igators-0.027
lling-0.027
 supposed-0.026
rated-0.026
 attention-0.025
Token you
Token ID345
Feature activation-1.09e+00

Pos logit contributions
ra+0.114
era+0.105
mm+0.104
iolet+0.100
aw+0.096
Neg logit contributions
 supposed-0.176
igators-0.170
rated-0.168
 attention-0.167
lling-0.163
Token have
Token ID423
Feature activation-3.46e+00

Pos logit contributions
ra+0.031
mm+0.031
era+0.031
iolet+0.031
ara+0.030
Neg logit contributions
 supposed-0.024
igators-0.023
 way-0.022
 attention-0.022
rated-0.022
Token fun
Token ID1257
Feature activation-1.43e+00

Pos logit contributions
ra+0.119
era+0.115
mm+0.113
iolet+0.111
ara+0.108
Neg logit contributions
 supposed-0.059
igators-0.055
 attention-0.053
 way-0.051
rated-0.051
Token,
Token ID11
Feature activation-2.22e+00

Pos logit contributions
ra+0.038
mm+0.037
era+0.036
iolet+0.035
aw+0.034
Neg logit contributions
 around-0.034
 having-0.033
igators-0.032
 supposed-0.032
 happening-0.031
Token kids
Token ID3988
Feature activation-2.47e+00

Pos logit contributions
ra+0.053
era+0.048
mm+0.048
iolet+0.047
aw+0.046
Neg logit contributions
 supposed-0.060
igators-0.059
 attention-0.058
rated-0.056
 way-0.056
Token cake
Token ID12187
Feature activation-2.58e+00

Pos logit contributions
ra+0.054
era+0.051
mm+0.050
iolet+0.049
aw+0.048
Neg logit contributions
 supposed-0.036
 way-0.035
 valuable-0.035
 attention-0.035
igators-0.035
Token.
Token ID13
Feature activation-6.27e-02

Pos logit contributions
ra+0.079
era+0.073
mm+0.072
iolet+0.071
apy+0.070
Neg logit contributions
 supposed-0.050
 attention-0.050
 way-0.049
 around-0.048
 having-0.048
Token
Token ID198
Feature activation-2.29e-01

Pos logit contributions
ra+0.002
era+0.002
mm+0.002
iolet+0.002
aw+0.002
Neg logit contributions
 supposed-0.001
igators-0.001
 attention-0.001
rated-0.001
 valuable-0.001
Token
Token ID198
Feature activation-7.22e-01

Pos logit contributions
ra+0.005
era+0.005
mm+0.005
iolet+0.005
aw+0.005
Neg logit contributions
 supposed-0.006
igators-0.006
rated-0.006
 attention-0.006
lling-0.006
Token"
Token ID1
Feature activation-5.59e-01

Pos logit contributions
ra+0.015
era+0.014
mm+0.013
iolet+0.013
light+0.013
Neg logit contributions
 supposed-0.021
igators-0.021
 valuable-0.020
rated-0.020
 attention-0.020
TokenDid
Token ID11633
Feature activation-5.68e+00

Pos logit contributions
ra+0.014
era+0.013
mm+0.012
iolet+0.012
ink+0.012
Neg logit contributions
 supposed-0.014
igators-0.014
lling-0.014
rated-0.013
 valuable-0.013
Token you
Token ID345
Feature activation-9.49e-01

Pos logit contributions
ra+0.117
era+0.109
mm+0.107
iolet+0.103
aw+0.100
Neg logit contributions
 supposed-0.172
igators-0.166
rated-0.164
 attention-0.163
lling-0.160
Token like
Token ID588
Feature activation-3.79e-01

Pos logit contributions
ra+0.029
era+0.028
mm+0.028
iolet+0.028
ara+0.028
Neg logit contributions
 supposed-0.019
igators-0.018
 having-0.018
 attention-0.017
 way-0.017
Token the
Token ID262
Feature activation-5.01e-01

Pos logit contributions
ra+0.009
mm+0.009
era+0.008
aw+0.008
iolet+0.008
Neg logit contributions
rated-0.010
 supposed-0.010
lling-0.009
igators-0.009
 attention-0.009
Token tour
Token ID4205
Feature activation-1.71e+00

Pos logit contributions
ra+0.014
era+0.013
mm+0.013
iolet+0.013
aw+0.012
Neg logit contributions
 supposed-0.012
igators-0.011
rated-0.011
 attention-0.011
lling-0.011
Token,
Token ID11
Feature activation-2.01e+00

Pos logit contributions
ra+0.039
era+0.036
iolet+0.034
mm+0.034
apy+0.033
Neg logit contributions
igators-0.047
 supposed-0.047
 way-0.045
 around-0.045
 attention-0.045
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
 attention+0.012
 having+0.012
 old+0.012
 around+0.012
igators+0.011
Neg logit contributions
ra-0.017
era-0.016
light-0.016
iolet-0.015
ink-0.015
Token "
Token ID366
Feature activation+1.86e-01

Pos logit contributions
 supposed+0.021
igators+0.020
rated+0.019
 attention+0.019
lling+0.019
Neg logit contributions
ra-0.034
era-0.032
mm-0.032
iolet-0.031
aw-0.030
TokenHow
Token ID2437
Feature activation-1.40e+00

Pos logit contributions
 valuable+0.004
igators+0.004
 sizes+0.004
 attention+0.004
 supposed+0.004
Neg logit contributions
ra-0.005
ink-0.005
era-0.005
apy-0.005
light-0.005
Token about
Token ID546
Feature activation-1.72e+00

Pos logit contributions
ra+0.051
iolet+0.048
era+0.047
light+0.046
ink+0.046
Neg logit contributions
 supposed-0.022
 valuable-0.021
 important-0.021
 attention-0.019
 old-0.019
Token this
Token ID428
Feature activation-2.13e+00

Pos logit contributions
ra+0.047
era+0.046
iolet+0.045
mm+0.043
light+0.043
Neg logit contributions
 supposed-0.038
igators-0.038
 attention-0.036
 having-0.036
 valuable-0.035
Token mel
Token ID7758
Feature activation-5.68e+00

Pos logit contributions
ra+0.043
era+0.041
iolet+0.040
mm+0.039
light+0.037
Neg logit contributions
 supposed-0.064
 attention-0.061
igators-0.061
rated-0.061
 way-0.060
Tokenon
Token ID261
Feature activation-4.48e+00

Pos logit contributions
ra+0.071
era+0.070
iolet+0.062
apy+0.051
ara+0.050
Neg logit contributions
 supposed-0.215
igators-0.202
 valuable-0.199
 having-0.198
lling-0.198
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
ra+0.084
era+0.080
iolet+0.076
aw+0.070
mm+0.070
Neg logit contributions
 supposed-0.142
igators-0.135
 way-0.134
 attention-0.133
rated-0.133
Token Tom
Token ID4186
Feature activation-2.62e+00

Pos logit contributions
iolet+0.055
ink+0.055
ra+0.055
light+0.054
P+0.054
Neg logit contributions
 old-0.058
 attention-0.058
 having-0.056
 way-0.056
 supposed-0.053
Token?
Token ID30
Feature activation-1.06e+00

Pos logit contributions
ra+0.039
era+0.035
mm+0.034
iolet+0.033
aw+0.031
Neg logit contributions
 supposed-0.094
igators-0.091
rated-0.090
 attention-0.089
lling-0.088
Token It
Token ID632
Feature activation-9.59e-01

Pos logit contributions
iolet+0.031
era+0.030
oph+0.029
aw+0.029
ra+0.028
Neg logit contributions
 having-0.019
 valuable-0.018
igators-0.018
 supposed-0.017
 around-0.017
Token satisfied
Token ID11378
Feature activation-4.58e+00

Pos logit contributions
igators+0.019
 supposed+0.018
rated+0.017
 attention+0.016
lling+0.015
Neg logit contributions
ra-0.036
mm-0.035
era-0.034
aw-0.034
apy-0.033
Token.
Token ID13
Feature activation-4.23e-01

Pos logit contributions
ra+0.153
era+0.142
iolet+0.141
mm+0.136
umber+0.134
Neg logit contributions
 having-0.082
 attention-0.079
 supposed-0.076
 old-0.074
 happening-0.072
Token
Token ID198
Feature activation-2.63e-01

Pos logit contributions
ra+0.011
era+0.010
mm+0.010
iolet+0.010
aw+0.009
Neg logit contributions
 supposed-0.011
igators-0.010
 attention-0.010
rated-0.010
 valuable-0.010
Token
Token ID198
Feature activation-7.94e-01

Pos logit contributions
ra+0.006
era+0.005
mm+0.005
iolet+0.005
ara+0.005
Neg logit contributions
 supposed-0.008
rated-0.008
igators-0.008
 attention-0.007
lling-0.007
Token"
Token ID1
Feature activation-7.88e-01

Pos logit contributions
ra+0.015
era+0.014
mm+0.014
iolet+0.013
light+0.013
Neg logit contributions
 supposed-0.025
igators-0.024
rated-0.023
lling-0.023
 attention-0.023
TokenDid
Token ID11633
Feature activation-5.66e+00

Pos logit contributions
ra+0.020
era+0.018
ink+0.018
aw+0.017
light+0.017
Neg logit contributions
 supposed-0.019
igators-0.019
lling-0.019
rated-0.018
 having-0.018
Token you
Token ID345
Feature activation-9.12e-01

Pos logit contributions
ra+0.128
era+0.119
iolet+0.117
mm+0.113
aw+0.111
Neg logit contributions
igators-0.156
 supposed-0.155
 attention-0.152
 way-0.147
 having-0.147
Token like
Token ID588
Feature activation-3.75e-01

Pos logit contributions
ra+0.032
iolet+0.031
era+0.031
mm+0.030
ara+0.030
Neg logit contributions
 supposed-0.015
igators-0.014
 having-0.014
 attention-0.013
rated-0.013
Token your
Token ID534
Feature activation-5.69e-01

Pos logit contributions
ra+0.008
mm+0.008
era+0.007
aw+0.007
iolet+0.007
Neg logit contributions
 supposed-0.011
rated-0.011
igators-0.010
 attention-0.010
lling-0.010
Token food
Token ID2057
Feature activation-2.08e+00

Pos logit contributions
ra+0.013
mm+0.012
era+0.012
iolet+0.011
ara+0.011
Neg logit contributions
 supposed-0.016
igators-0.016
rated-0.016
lling-0.015
 attention-0.015
Token?"
Token ID1701
Feature activation-4.54e+00

Pos logit contributions
ra+0.044
era+0.041
iolet+0.038
mm+0.037
aw+0.036
Neg logit contributions
 supposed-0.061
igators-0.061
 way-0.059
 attention-0.059
 old-0.058
Token fun
Token ID1257
Feature activation-1.77e-01

Pos logit contributions
 supposed+0.000
igators+0.000
rated+0.000
lling+0.000
 attention+0.000
Neg logit contributions
ra-0.000
era-0.000
mm-0.000
iolet-0.000
aw-0.000
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
ra+0.004
era+0.004
mm+0.004
iolet+0.004
aw+0.004
Neg logit contributions
 supposed-0.005
igators-0.005
rated-0.004
 attention-0.004
lling-0.004
Token
Token ID198
Feature activation+6.11e-02

Pos logit contributions
 supposed+0.003
igators+0.003
rated+0.003
lling+0.003
 attention+0.003
Neg logit contributions
ra-0.002
era-0.002
mm-0.002
iolet-0.002
ara-0.002
Token
Token ID198
Feature activation-5.36e-01

Pos logit contributions
 supposed+0.002
igators+0.002
rated+0.002
 attention+0.002
lling+0.002
Neg logit contributions
ra-0.001
era-0.001
mm-0.001
iolet-0.001
aw-0.001
Token"
Token ID1
Feature activation-4.51e-01

Pos logit contributions
ra+0.011
era+0.010
mm+0.010
iolet+0.010
light+0.009
Neg logit contributions
 supposed-0.016
igators-0.015
lling-0.015
rated-0.015
 valuable-0.015
TokenDid
Token ID11633
Feature activation-5.65e+00

Pos logit contributions
ra+0.010
mm+0.009
era+0.009
mmm+0.009
ink+0.009
Neg logit contributions
 supposed-0.012
igators-0.012
lling-0.012
rated-0.012
 having-0.012
Token you
Token ID345
Feature activation-1.01e+00

Pos logit contributions
ra+0.122
era+0.114
mm+0.110
iolet+0.110
aw+0.105
Neg logit contributions
 supposed-0.163
igators-0.160
 attention-0.157
rated-0.154
 way-0.152
Token like
Token ID588
Feature activation-3.86e-01

Pos logit contributions
ra+0.032
mm+0.031
era+0.031
iolet+0.031
ara+0.030
Neg logit contributions
 supposed-0.020
igators-0.019
 attention-0.019
rated-0.018
 way-0.018
Token the
Token ID262
Feature activation-1.72e-01

Pos logit contributions
ra+0.008
mm+0.008
aw+0.008
era+0.007
mmm+0.007
Neg logit contributions
rated-0.012
lling-0.011
 supposed-0.011
 attention-0.010
 valuable-0.010
Token dragon
Token ID10441
Feature activation-2.72e+00

Pos logit contributions
ra+0.005
era+0.005
mm+0.005
iolet+0.005
ara+0.005
Neg logit contributions
 supposed-0.004
igators-0.004
 attention-0.003
rated-0.003
 way-0.003
Token slide
Token ID10649
Feature activation-6.75e-01

Pos logit contributions
ra+0.049
era+0.046
mm+0.044
iolet+0.043
aw+0.041
Neg logit contributions
 supposed-0.088
igators-0.086
rated-0.084
 attention-0.084
 way-0.082
Token!
Token ID0
Feature activation-9.33e-01

Pos logit contributions
 supposed+0.029
 attention+0.028
 important+0.026
igators+0.026
 valuable+0.026
Neg logit contributions
ra-0.038
era-0.034
iolet-0.033
light-0.033
ink-0.033
Token Can
Token ID1680
Feature activation-3.96e+00

Pos logit contributions
ra+0.025
era+0.024
aw+0.023
iolet+0.022
mm+0.022
Neg logit contributions
 supposed-0.022
igators-0.021
 having-0.020
 valuable-0.020
 attention-0.019
Token I
Token ID314
Feature activation-3.09e+00

Pos logit contributions
ra+0.103
aw+0.102
era+0.102
iolet+0.099
light+0.095
Neg logit contributions
 having-0.088
igators-0.087
 attention-0.085
 supposed-0.085
 way-0.082
Token have
Token ID423
Feature activation-4.27e+00

Pos logit contributions
aw+0.112
iolet+0.105
era+0.105
mm+0.102
ra+0.101
Neg logit contributions
 supposed-0.047
 way-0.043
 over-0.040
 having-0.040
 important-0.039
Token a
Token ID257
Feature activation-2.30e+00

Pos logit contributions
era+0.136
ra+0.132
iolet+0.130
room+0.124
ink+0.124
Neg logit contributions
 supposed-0.085
 attention-0.080
 having-0.078
 way-0.077
 old-0.076
Token slice
Token ID16416
Feature activation-5.65e+00

Pos logit contributions
ra+0.058
era+0.056
mm+0.055
iolet+0.054
aw+0.053
Neg logit contributions
 supposed-0.058
 way-0.058
 valuable-0.056
 attention-0.056
 old-0.055
Token?"
Token ID1701
Feature activation-2.96e+00

Pos logit contributions
ra+0.187
apy+0.173
era+0.173
ink+0.168
iolet+0.166
Neg logit contributions
 over-0.103
 around-0.101
 day-0.093
 supposed-0.092
 attention-0.091
Token
Token ID198
Feature activation-5.02e-02

Pos logit contributions
ra+0.098
aw+0.089
era+0.089
mm+0.089
ink+0.087
Neg logit contributions
 having-0.054
 way-0.053
 old-0.053
 attention-0.052
 around-0.051
Token
Token ID198
Feature activation-5.90e-01

Pos logit contributions
ra+0.002
mm+0.001
era+0.001
light+0.001
mmm+0.001
Neg logit contributions
 old-0.001
 around-0.001
 valuable-0.001
 supposed-0.001
 having-0.001
TokenAnna
Token ID31160
Feature activation-1.11e+00

Pos logit contributions
ra+0.014
mm+0.014
ink+0.013
mmm+0.013
etti+0.012
Neg logit contributions
 valuable-0.015
 old-0.015
 important-0.015
igators-0.015
 around-0.014
Token nodded
Token ID14464
Feature activation-1.39e+00

Pos logit contributions
mm+0.034
ra+0.032
mmm+0.032
mmmm+0.031
apy+0.031
Neg logit contributions
 way-0.021
 around-0.020
 over-0.020
 old-0.019
 having-0.019
Token maybe
Token ID3863
Feature activation+1.49e-01

Pos logit contributions
ra+0.014
era+0.013
mm+0.013
iolet+0.013
aw+0.012
Neg logit contributions
 supposed-0.016
igators-0.015
rated-0.015
 attention-0.015
lling-0.015
Token,
Token ID11
Feature activation-9.67e-01

Pos logit contributions
 attention+0.003
 way+0.003
 having+0.003
 supposed+0.003
 happening+0.003
Neg logit contributions
iolet-0.004
ra-0.004
aw-0.004
era-0.004
ink-0.004
Token we
Token ID356
Feature activation-8.92e-01

Pos logit contributions
ra+0.022
era+0.020
mm+0.020
iolet+0.019
ara+0.018
Neg logit contributions
 supposed-0.028
rated-0.027
igators-0.027
lling-0.026
 attention-0.025
Token will
Token ID481
Feature activation-1.76e+00

Pos logit contributions
ra+0.024
era+0.023
mm+0.023
iolet+0.022
aw+0.022
Neg logit contributions
 supposed-0.021
igators-0.020
 attention-0.020
rated-0.020
lling-0.019
Token be
Token ID307
Feature activation-1.42e+00

Pos logit contributions
ra+0.049
mm+0.048
era+0.048
iolet+0.047
aw+0.046
Neg logit contributions
 supposed-0.040
 attention-0.037
 way-0.037
igators-0.036
 having-0.036
Token happy
Token ID3772
Feature activation-5.64e+00

Pos logit contributions
ra+0.048
mm+0.046
era+0.046
iolet+0.044
ara+0.043
Neg logit contributions
 supposed-0.025
igators-0.023
 attention-0.023
 having-0.022
 way-0.022
Token."
Token ID526
Feature activation-3.13e+00

Pos logit contributions
ra+0.162
era+0.153
mm+0.151
iolet+0.148
aw+0.144
Neg logit contributions
 supposed-0.125
igators-0.121
 attention-0.119
rated-0.115
 having-0.113
Token
Token ID198
Feature activation-3.35e-01

Pos logit contributions
ra+0.109
ink+0.098
era+0.095
light+0.094
aw+0.094
Neg logit contributions
 having-0.053
 attention-0.049
 old-0.048
igators-0.048
 way-0.046
Token
Token ID198
Feature activation-8.60e-01

Pos logit contributions
ra+0.010
era+0.009
mm+0.009
iolet+0.009
light+0.008
Neg logit contributions
 supposed-0.007
igators-0.007
 attention-0.007
 valuable-0.007
 having-0.007
TokenThe
Token ID464
Feature activation-4.72e-01

Pos logit contributions
ra+0.020
mm+0.018
era+0.017
our+0.017
int+0.017
Neg logit contributions
 valuable-0.021
igators-0.021
 important-0.020
 having-0.020
 blowing-0.020
Token end
Token ID886
Feature activation+1.82e+00

Pos logit contributions
ra+0.015
era+0.014
mm+0.014
iolet+0.014
ara+0.014
Neg logit contributions
 supposed-0.009
igators-0.009
rated-0.008
 attention-0.008
lling-0.008
Token cookies
Token ID14746
Feature activation-5.45e-01

Pos logit contributions
ra+0.019
era+0.018
mm+0.018
iolet+0.018
aw+0.017
Neg logit contributions
 supposed-0.015
igators-0.015
rated-0.014
 attention-0.014
lling-0.014
Token.
Token ID13
Feature activation-4.63e-01

Pos logit contributions
ra+0.012
era+0.011
aw+0.011
iolet+0.010
mm+0.010
Neg logit contributions
rated-0.015
 supposed-0.015
igators-0.015
 valuable-0.014
 attention-0.014
Token
Token ID198
Feature activation-4.42e-01

Pos logit contributions
ra+0.011
era+0.011
mm+0.010
iolet+0.010
aw+0.010
Neg logit contributions
 supposed-0.012
igators-0.012
 attention-0.012
rated-0.012
 way-0.011
Token
Token ID198
Feature activation-9.62e-01

Pos logit contributions
ra+0.010
era+0.010
mm+0.010
iolet+0.009
aw+0.009
Neg logit contributions
 supposed-0.012
igators-0.012
rated-0.011
 attention-0.011
lling-0.011
Token"
Token ID1
Feature activation-9.21e-01

Pos logit contributions
ra+0.020
era+0.019
mm+0.018
light+0.018
iolet+0.018
Neg logit contributions
 supposed-0.028
igators-0.028
 valuable-0.027
 attention-0.026
 having-0.026
TokenDid
Token ID11633
Feature activation-5.63e+00

Pos logit contributions
ra+0.023
era+0.021
mm+0.021
iolet+0.020
ink+0.020
Neg logit contributions
igators-0.023
 supposed-0.023
lling-0.022
rated-0.022
 attention-0.022
Token you
Token ID345
Feature activation-1.16e+00

Pos logit contributions
ra+0.104
mm+0.099
era+0.095
iolet+0.088
ara+0.088
Neg logit contributions
 supposed-0.185
rated-0.178
igators-0.174
lling-0.173
 attention-0.172
Token have
Token ID423
Feature activation-3.55e+00

Pos logit contributions
ra+0.035
era+0.033
mm+0.033
iolet+0.033
aw+0.031
Neg logit contributions
 supposed-0.025
igators-0.024
 attention-0.023
rated-0.023
 way-0.022
Token fun
Token ID1257
Feature activation-1.46e+00

Pos logit contributions
ra+0.104
era+0.098
mm+0.097
iolet+0.095
aw+0.093
Neg logit contributions
 supposed-0.077
igators-0.074
 attention-0.072
rated-0.072
lling-0.069
Token in
Token ID287
Feature activation+1.87e-01

Pos logit contributions
ra+0.041
mm+0.039
era+0.038
iolet+0.037
aw+0.036
Neg logit contributions
 having-0.032
 around-0.032
 supposed-0.031
igators-0.031
 happening-0.030
Token the
Token ID262
Feature activation-2.44e+00

Pos logit contributions
 supposed+0.004
igators+0.004
rated+0.004
 attention+0.004
lling+0.004
Neg logit contributions
ra-0.005
era-0.005
mm-0.005
iolet-0.005
ara-0.005
Token them
Token ID606
Feature activation+7.51e-01

Pos logit contributions
ra+0.041
era+0.039
mm+0.038
iolet+0.037
aw+0.036
Neg logit contributions
 supposed-0.045
igators-0.044
rated-0.043
 attention-0.042
lling-0.042
Token.
Token ID13
Feature activation-7.39e-01

Pos logit contributions
rated+0.021
 supposed+0.021
igators+0.021
lling+0.021
 attention+0.020
Neg logit contributions
ra-0.016
era-0.015
iolet-0.014
aw-0.014
mm-0.014
Token
Token ID198
Feature activation-2.88e-01

Pos logit contributions
ra+0.020
era+0.019
mm+0.018
iolet+0.018
light+0.017
Neg logit contributions
 supposed-0.017
 attention-0.017
igators-0.017
 having-0.016
 valuable-0.016
Token
Token ID198
Feature activation-7.76e-01

Pos logit contributions
ra+0.007
era+0.006
mm+0.006
iolet+0.006
ara+0.006
Neg logit contributions
 supposed-0.008
igators-0.008
rated-0.008
 attention-0.008
lling-0.007
Token"
Token ID1
Feature activation-1.03e+00

Pos logit contributions
ra+0.016
era+0.015
mm+0.015
iolet+0.014
light+0.013
Neg logit contributions
 supposed-0.023
igators-0.023
 valuable-0.022
 attention-0.022
 having-0.022
TokenDid
Token ID11633
Feature activation-5.63e+00

Pos logit contributions
ra+0.024
era+0.022
mm+0.021
ink+0.020
mmm+0.020
Neg logit contributions
 supposed-0.028
igators-0.027
lling-0.026
 attention-0.026
rated-0.026
Token you
Token ID345
Feature activation-1.29e+00

Pos logit contributions
ra+0.107
mm+0.099
era+0.099
iolet+0.092
ara+0.090
Neg logit contributions
 supposed-0.180
rated-0.174
igators-0.173
 attention-0.170
lling-0.169
Token have
Token ID423
Feature activation-3.58e+00

Pos logit contributions
ra+0.041
era+0.040
iolet+0.039
mm+0.039
aw+0.038
Neg logit contributions
 supposed-0.024
igators-0.023
 attention-0.022
rated-0.022
 having-0.022
Token fun
Token ID1257
Feature activation-1.54e+00

Pos logit contributions
ra+0.093
mm+0.087
era+0.086
iolet+0.084
aw+0.083
Neg logit contributions
 supposed-0.088
igators-0.086
rated-0.085
 attention-0.083
lling-0.082
Token?"
Token ID1701
Feature activation-4.48e+00

Pos logit contributions
ra+0.039
era+0.037
mm+0.037
iolet+0.035
aw+0.034
Neg logit contributions
 having-0.037
 supposed-0.037
igators-0.037
 around-0.037
 happening-0.035
Token she
Token ID673
Feature activation-2.33e+00

Pos logit contributions
ra+0.133
era+0.126
mm+0.123
iolet+0.123
light+0.121
Neg logit contributions
 old-0.095
igators-0.091
 having-0.090
 attention-0.090
 around-0.088
Token other
Token ID584
Feature activation+4.33e-01

Pos logit contributions
ra+0.052
era+0.050
iolet+0.049
mm+0.049
ara+0.048
Neg logit contributions
 supposed-0.021
igators-0.020
rated-0.019
 attention-0.018
lling-0.018
Token.
Token ID13
Feature activation-7.69e-01

Pos logit contributions
 supposed+0.014
igators+0.013
rated+0.013
lling+0.013
 causing+0.012
Neg logit contributions
ra-0.008
light-0.008
era-0.008
iolet-0.007
aw-0.007
Token
Token ID198
Feature activation-3.69e-01

Pos logit contributions
ra+0.021
era+0.019
mm+0.019
iolet+0.018
aw+0.018
Neg logit contributions
 supposed-0.019
igators-0.018
 attention-0.018
 having-0.017
 valuable-0.017
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
ra+0.009
era+0.009
mm+0.008
iolet+0.008
aw+0.008
Neg logit contributions
 supposed-0.010
igators-0.009
 attention-0.009
rated-0.009
lling-0.009
Token"
Token ID1
Feature activation-1.03e+00

Pos logit contributions
ra+0.024
era+0.022
mm+0.021
iolet+0.021
light+0.020
Neg logit contributions
 supposed-0.026
igators-0.026
 valuable-0.025
 having-0.025
 attention-0.025
TokenDid
Token ID11633
Feature activation-5.61e+00

Pos logit contributions
ra+0.026
era+0.023
ink+0.023
mm+0.023
mmm+0.022
Neg logit contributions
igators-0.025
 supposed-0.025
lling-0.025
rated-0.024
 attention-0.024
Token you
Token ID345
Feature activation-1.26e+00

Pos logit contributions
ra+0.117
era+0.108
iolet+0.104
mm+0.104
aw+0.100
Neg logit contributions
 supposed-0.166
igators-0.164
 attention-0.161
rated-0.157
 way-0.155
Token like
Token ID588
Feature activation-3.92e-01

Pos logit contributions
ra+0.037
iolet+0.036
era+0.036
mm+0.036
aw+0.035
Neg logit contributions
 supposed-0.027
igators-0.026
 attention-0.025
rated-0.025
 having-0.025
Token the
Token ID262
Feature activation-4.39e-01

Pos logit contributions
ra+0.009
mm+0.009
era+0.009
light+0.008
aw+0.008
Neg logit contributions
rated-0.011
 supposed-0.010
lling-0.010
igators-0.010
 attention-0.009
Token green
Token ID4077
Feature activation-1.73e+00

Pos logit contributions
ra+0.012
era+0.012
mm+0.012
iolet+0.011
aw+0.011
Neg logit contributions
 supposed-0.010
igators-0.010
 attention-0.009
rated-0.009
 way-0.009
Token car
Token ID1097
Feature activation-1.41e+00

Pos logit contributions
ra+0.047
era+0.045
mm+0.044
iolet+0.044
aw+0.043
Neg logit contributions
igators-0.039
 attention-0.038
 supposed-0.038
 way-0.038
rated-0.037
Token a
Token ID257
Feature activation-1.38e+00

Pos logit contributions
era+0.035
iolet+0.034
ra+0.034
mm+0.033
ara+0.032
Neg logit contributions
 attention-0.027
 supposed-0.027
 old-0.025
 valuable-0.025
 around-0.024
Token pearl
Token ID43836
Feature activation-2.69e+00

Pos logit contributions
ra+0.032
era+0.031
mm+0.031
iolet+0.030
aw+0.029
Neg logit contributions
 supposed-0.037
 way-0.035
 attention-0.035
 valuable-0.035
igators-0.035
Token in
Token ID287
Feature activation+3.20e-01

Pos logit contributions
era+0.067
ra+0.067
mm+0.066
iolet+0.066
aw+0.065
Neg logit contributions
 supposed-0.066
 attention-0.062
 way-0.060
 around-0.060
 having-0.060
Token this
Token ID428
Feature activation-3.27e+00

Pos logit contributions
 supposed+0.005
igators+0.005
 way+0.005
 attention+0.005
lling+0.005
Neg logit contributions
ra-0.011
era-0.010
mm-0.010
iolet-0.010
aw-0.010
Token fancy
Token ID14996
Feature activation-3.37e+00

Pos logit contributions
ra+0.092
era+0.088
mm+0.086
iolet+0.085
aw+0.082
Neg logit contributions
 supposed-0.073
igators-0.071
 attention-0.069
rated-0.069
 way-0.068
Token oy
Token ID35104
Feature activation-5.60e+00

Pos logit contributions
mm+0.089
era+0.088
ra+0.086
iolet+0.084
apy+0.083
Neg logit contributions
 way-0.087
 attention-0.077
 old-0.077
 supposed-0.076
 valuable-0.074
Tokenster
Token ID1706
Feature activation-3.34e+00

Pos logit contributions
apy+0.127
era+0.116
room+0.113
iolet+0.112
Jake+0.108
Neg logit contributions
 way-0.149
 attention-0.143
 supposed-0.140
 old-0.139
 valuable-0.138
Token!"
Token ID2474
Feature activation-2.69e+00

Pos logit contributions
ra+0.088
era+0.085
mm+0.084
iolet+0.082
apy+0.079
Neg logit contributions
 supposed-0.079
igators-0.076
 way-0.075
 attention-0.074
rated-0.073
Token His
Token ID2399
Feature activation+1.22e-01

Pos logit contributions
ra+0.088
era+0.079
iolet+0.076
aw+0.076
mm+0.076
Neg logit contributions
 way-0.053
 having-0.052
 attention-0.051
 supposed-0.051
 around-0.050
Token Mom
Token ID11254
Feature activation-1.52e+00

Pos logit contributions
 supposed+0.004
igators+0.004
rated+0.004
lling+0.004
 attention+0.004
Neg logit contributions
ra-0.002
era-0.002
mm-0.002
iolet-0.002
ara-0.002
Token smiled
Token ID13541
Feature activation+1.04e-01

Pos logit contributions
mm+0.042
ra+0.040
iolet+0.039
mmm+0.039
era+0.038
Neg logit contributions
 around-0.033
 having-0.031
 way-0.031
lling-0.031
 supposed-0.031
Token other
Token ID584
Feature activation+2.34e+00

Pos logit contributions
ra+0.010
era+0.009
iolet+0.009
ara+0.009
aw+0.009
Neg logit contributions
 supposed-0.006
rated-0.006
igators-0.006
 happening-0.005
 causing-0.005
Token.
Token ID13
Feature activation-3.53e-01

Pos logit contributions
 supposed+0.075
igators+0.073
rated+0.072
 causing+0.070
lling+0.067
Neg logit contributions
ra-0.045
era-0.042
light-0.040
iolet-0.039
aw-0.038
Token
Token ID198
Feature activation+2.93e-01

Pos logit contributions
ra+0.011
mm+0.010
era+0.010
ink+0.010
aw+0.010
Neg logit contributions
 supposed-0.006
 having-0.006
igators-0.006
 valuable-0.006
 attention-0.006
Token
Token ID198
Feature activation-3.47e-01

Pos logit contributions
 supposed+0.008
igators+0.007
 attention+0.007
rated+0.007
lling+0.007
Neg logit contributions
ra-0.007
era-0.007
mm-0.007
iolet-0.007
aw-0.006
Token"
Token ID1
Feature activation-6.24e-01

Pos logit contributions
ra+0.009
mm+0.008
era+0.008
iolet+0.007
aw+0.007
Neg logit contributions
igators-0.008
 valuable-0.008
 supposed-0.008
 having-0.008
lling-0.008
TokenDid
Token ID11633
Feature activation-5.58e+00

Pos logit contributions
ra+0.017
era+0.015
mm+0.015
ink+0.015
iolet+0.015
Neg logit contributions
 supposed-0.014
igators-0.014
lling-0.014
 valuable-0.014
rated-0.013
Token you
Token ID345
Feature activation-9.24e-01

Pos logit contributions
ra+0.113
era+0.105
mm+0.104
iolet+0.100
aw+0.096
Neg logit contributions
 supposed-0.170
igators-0.164
rated-0.162
 attention-0.161
lling-0.158
Token make
Token ID787
Feature activation-3.02e+00

Pos logit contributions
mm+0.030
ra+0.030
iolet+0.029
era+0.029
ara+0.029
Neg logit contributions
 supposed-0.017
igators-0.016
 attention-0.016
 having-0.015
 around-0.015
Token your
Token ID534
Feature activation-2.99e-01

Pos logit contributions
ra+0.084
era+0.080
mm+0.078
iolet+0.078
ara+0.076
Neg logit contributions
 supposed-0.070
 attention-0.067
 having-0.065
 old-0.065
igators-0.065
Token wish
Token ID4601
Feature activation-2.85e+00

Pos logit contributions
ra+0.006
era+0.006
mm+0.006
iolet+0.006
aw+0.005
Neg logit contributions
 supposed-0.009
igators-0.009
rated-0.008
 attention-0.008
lling-0.008
Token?"
Token ID1701
Feature activation-4.57e+00

Pos logit contributions
ra+0.058
era+0.053
mm+0.052
iolet+0.051
aw+0.049
Neg logit contributions
 supposed-0.087
igators-0.084
 attention-0.084
rated-0.083
 way-0.081
Token them
Token ID606
Feature activation+2.66e+00

Pos logit contributions
ra+0.013
era+0.013
iolet+0.012
mm+0.012
aw+0.012
Neg logit contributions
 supposed-0.013
rated-0.012
igators-0.012
lling-0.012
 valuable-0.012
Token.
Token ID13
Feature activation+6.47e-02

Pos logit contributions
 supposed+0.078
rated+0.075
igators+0.074
lling+0.072
 valuable+0.070
Neg logit contributions
ra-0.058
era-0.056
iolet-0.054
aw-0.052
mm-0.052
Token
Token ID198
Feature activation+1.20e-01

Pos logit contributions
 supposed+0.002
igators+0.002
rated+0.002
 attention+0.002
lling+0.002
Neg logit contributions
ra-0.002
era-0.001
mm-0.001
iolet-0.001
aw-0.001
Token
Token ID198
Feature activation-4.07e-01

Pos logit contributions
 supposed+0.003
igators+0.003
rated+0.003
 attention+0.003
lling+0.003
Neg logit contributions
ra-0.003
era-0.002
mm-0.002
iolet-0.002
ara-0.002
Token"
Token ID1
Feature activation-6.23e-01

Pos logit contributions
ra+0.008
era+0.008
mm+0.008
iolet+0.007
light+0.007
Neg logit contributions
 supposed-0.012
igators-0.012
 attention-0.011
lling-0.011
 valuable-0.011
TokenDid
Token ID11633
Feature activation-5.51e+00

Pos logit contributions
ra+0.015
era+0.014
mm+0.013
ink+0.013
iolet+0.013
Neg logit contributions
 supposed-0.016
igators-0.016
 attention-0.016
lling-0.016
rated-0.016
Token you
Token ID345
Feature activation-1.15e+00

Pos logit contributions
ra+0.101
mm+0.094
era+0.093
iolet+0.087
aw+0.085
Neg logit contributions
 supposed-0.181
rated-0.174
igators-0.173
 attention-0.169
lling-0.169
Token have
Token ID423
Feature activation-3.50e+00

Pos logit contributions
ra+0.035
mm+0.034
era+0.033
iolet+0.033
aw+0.032
Neg logit contributions
 supposed-0.024
igators-0.023
 attention-0.023
rated-0.022
 way-0.022
Token fun
Token ID1257
Feature activation-1.49e+00

Pos logit contributions
ra+0.125
era+0.120
mm+0.118
iolet+0.116
aw+0.113
Neg logit contributions
 supposed-0.054
igators-0.051
 attention-0.049
rated-0.048
 way-0.047
Token,
Token ID11
Feature activation-2.40e+00

Pos logit contributions
ra+0.041
mm+0.039
era+0.038
iolet+0.037
aw+0.036
Neg logit contributions
 having-0.034
 around-0.034
igators-0.034
 supposed-0.033
 attention-0.032
Token kids
Token ID3988
Feature activation-2.31e+00

Pos logit contributions
ra+0.055
era+0.051
mm+0.050
iolet+0.049
aw+0.048
Neg logit contributions
 supposed-0.067
igators-0.065
 attention-0.064
rated-0.063
 way-0.062
Token adventure
Token ID8855
Feature activation+2.91e+00

Pos logit contributions
igators+0.039
 supposed+0.039
rated+0.039
lling+0.038
 causing+0.036
Neg logit contributions
ra-0.048
era-0.044
mm-0.044
ara-0.041
aw-0.040
Token.
Token ID13
Feature activation-6.41e-02

Pos logit contributions
 supposed+0.091
rated+0.090
igators+0.087
 attention+0.083
 valuable+0.082
Neg logit contributions
ra-0.056
era-0.053
aw-0.053
iolet-0.049
mm-0.048
Token
Token ID198
Feature activation+3.50e-01

Pos logit contributions
ra+0.002
era+0.002
mm+0.002
iolet+0.002
aw+0.001
Neg logit contributions
 supposed-0.002
igators-0.002
 attention-0.001
rated-0.001
lling-0.001
Token
Token ID198
Feature activation-1.73e-01

Pos logit contributions
 supposed+0.010
rated+0.010
igators+0.010
 attention+0.009
lling+0.009
Neg logit contributions
ra-0.008
era-0.007
mm-0.007
ara-0.007
aw-0.007
Token"
Token ID1
Feature activation-4.31e-01

Pos logit contributions
ra+0.003
era+0.003
mm+0.003
iolet+0.003
aw+0.003
Neg logit contributions
 supposed-0.006
igators-0.005
rated-0.005
 attention-0.005
lling-0.005
TokenDid
Token ID11633
Feature activation-5.47e+00

Pos logit contributions
ra+0.011
era+0.010
mm+0.010
ink+0.010
mmm+0.010
Neg logit contributions
 supposed-0.010
igators-0.010
lling-0.010
 having-0.010
 causing-0.010
Token you
Token ID345
Feature activation-8.08e-01

Pos logit contributions
ra+0.110
era+0.102
mm+0.100
iolet+0.097
aw+0.093
Neg logit contributions
 supposed-0.168
igators-0.163
rated-0.160
 attention-0.160
lling-0.156
Token have
Token ID423
Feature activation-3.38e+00

Pos logit contributions
ra+0.025
iolet+0.024
mm+0.024
era+0.024
ara+0.023
Neg logit contributions
 supposed-0.016
igators-0.016
 attention-0.015
rated-0.015
 way-0.015
Token fun
Token ID1257
Feature activation-1.30e+00

Pos logit contributions
ra+0.126
era+0.121
mm+0.118
iolet+0.117
ara+0.115
Neg logit contributions
 supposed-0.050
igators-0.046
 attention-0.045
 way-0.044
 having-0.042
Token?"
Token ID1701
Feature activation-4.30e+00

Pos logit contributions
ra+0.036
mm+0.035
era+0.033
ara+0.033
iolet+0.033
Neg logit contributions
 around-0.030
 having-0.030
 supposed-0.028
igators-0.028
 happening-0.027
Token their
Token ID511
Feature activation-5.21e-01

Pos logit contributions
ra+0.121
era+0.114
mm+0.113
iolet+0.113
light+0.111
Neg logit contributions
 old-0.103
 having-0.097
igators-0.096
 attention-0.094
 supposed-0.093
Token't
Token ID470
Feature activation+2.55e-01

Pos logit contributions
ra+0.052
mm+0.049
era+0.047
iolet+0.045
ink+0.045
Neg logit contributions
 supposed-0.072
rated-0.072
 attention-0.071
igators-0.070
lling-0.067
Token you
Token ID345
Feature activation-9.27e-02

Pos logit contributions
 supposed+0.006
 attention+0.006
 having+0.006
igators+0.006
 way+0.006
Neg logit contributions
ra-0.006
era-0.006
iolet-0.006
ink-0.006
aw-0.006
Token share
Token ID2648
Feature activation-2.62e+00

Pos logit contributions
mm+0.003
ra+0.003
era+0.003
iolet+0.003
light+0.003
Neg logit contributions
 supposed-0.002
igators-0.002
 attention-0.002
 way-0.002
rated-0.002
Token and
Token ID290
Feature activation-6.55e-01

Pos logit contributions
ra+0.070
era+0.068
mm+0.065
iolet+0.064
aw+0.062
Neg logit contributions
 supposed-0.061
igators-0.059
 attention-0.059
 way-0.057
 valuable-0.057
Token be
Token ID307
Feature activation-1.15e+00

Pos logit contributions
ra+0.017
era+0.016
mm+0.015
iolet+0.015
aw+0.015
Neg logit contributions
 supposed-0.017
igators-0.016
 attention-0.016
rated-0.016
 way-0.015
Token happy
Token ID3772
Feature activation-5.47e+00

Pos logit contributions
ra+0.039
era+0.037
mm+0.037
iolet+0.036
aw+0.035
Neg logit contributions
 supposed-0.020
igators-0.019
rated-0.018
 attention-0.018
lling-0.017
Token?"
Token ID1701
Feature activation-3.62e+00

Pos logit contributions
ra+0.117
era+0.110
mm+0.105
iolet+0.104
aw+0.101
Neg logit contributions
 supposed-0.161
 attention-0.157
igators-0.156
 having-0.155
 around-0.152
Token
Token ID198
Feature activation-6.49e-01

Pos logit contributions
ra+0.123
light+0.118
ink+0.114
aw+0.113
era+0.112
Neg logit contributions
 having-0.064
 old-0.062
 eager-0.055
 attention-0.054
 way-0.053
Token
Token ID198
Feature activation-1.10e+00

Pos logit contributions
ra+0.023
light+0.021
era+0.021
mm+0.020
iolet+0.020
Neg logit contributions
 old-0.011
 valuable-0.010
 having-0.009
 supposed-0.009
 important-0.009
TokenS
Token ID50
Feature activation-2.36e+00

Pos logit contributions
light+0.030
etti+0.030
ra+0.030
apped+0.027
ifier+0.026
Neg logit contributions
 valuable-0.023
 old-0.022
 important-0.022
 lesson-0.019
 eager-0.019
Tokenara
Token ID3301
Feature activation-3.67e-01

Pos logit contributions
apy+0.037
light+0.030
 rectangle+0.029
bridge+0.026
 Sorry+0.026
Neg logit contributions
rated-0.073
 old-0.072
 supposed-0.071
 attention-0.071
 way-0.068
Token
Token ID198
Feature activation+2.22e-02

Pos logit contributions
ra+0.001
era+0.001
mm+0.001
iolet+0.001
light+0.001
Neg logit contributions
 supposed-0.001
igators-0.001
 attention-0.001
 valuable-0.001
 having-0.001
Token
Token ID198
Feature activation-5.10e-01

Pos logit contributions
 supposed+0.001
igators+0.001
 attention+0.001
rated+0.001
 valuable+0.001
Neg logit contributions
ra-0.001
era-0.001
mm-0.001
iolet-0.001
light-0.000
Token"
Token ID1
Feature activation-2.66e-01

Pos logit contributions
ra+0.010
mm+0.009
era+0.009
light+0.009
iolet+0.008
Neg logit contributions
igators-0.016
 supposed-0.015
 valuable-0.015
 attention-0.015
lling-0.015
TokenReally
Token ID26392
Feature activation-4.56e+00

Pos logit contributions
ra+0.007
era+0.006
mm+0.006
iolet+0.006
light+0.006
Neg logit contributions
 supposed-0.007
igators-0.007
lling-0.006
 attention-0.006
 valuable-0.006
Token?
Token ID30
Feature activation-1.71e+00

Pos logit contributions
ra+0.089
era+0.079
iolet+0.075
aw+0.074
light+0.073
Neg logit contributions
 supposed-0.139
 attention-0.136
 having-0.132
rated-0.130
 important-0.128
Token Did
Token ID7731
Feature activation-5.46e+00

Pos logit contributions
ra+0.048
era+0.047
aw+0.046
iolet+0.046
ara+0.044
Neg logit contributions
igators-0.034
 having-0.034
 supposed-0.034
 valuable-0.031
 around-0.031
Token you
Token ID345
Feature activation-7.44e-01

Pos logit contributions
ra+0.132
iolet+0.125
era+0.124
light+0.121
ink+0.118
Neg logit contributions
igators-0.139
 attention-0.137
 supposed-0.134
 having-0.133
 way-0.131
Token ever
Token ID1683
Feature activation-2.89e-02

Pos logit contributions
ra+0.021
era+0.020
mm+0.020
iolet+0.019
aw+0.019
Neg logit contributions
 supposed-0.017
igators-0.017
rated-0.016
 attention-0.016
 way-0.016
Token lose
Token ID4425
Feature activation-2.91e+00

Pos logit contributions
ra+0.001
era+0.001
mm+0.001
iolet+0.001
aw+0.001
Neg logit contributions
 supposed-0.001
igators-0.001
 attention-0.001
rated-0.001
lling-0.001
Token your
Token ID534
Feature activation-1.65e-01

Pos logit contributions
ra+0.081
era+0.079
mm+0.077
iolet+0.076
ink+0.073
Neg logit contributions
 attention-0.067
igators-0.063
 supposed-0.063
 around-0.063
 old-0.062
Token boats
Token ID16659
Feature activation-3.64e-02

Pos logit contributions
ra+0.004
mm+0.003
era+0.003
light+0.003
aw+0.003
Neg logit contributions
 supposed-0.005
rated-0.005
igators-0.005
lling-0.004
 causing-0.004
Token.
Token ID13
Feature activation-4.55e-01

Pos logit contributions
ra+0.054
era+0.051
mm+0.050
iolet+0.049
aw+0.048
Neg logit contributions
 supposed-0.045
igators-0.043
rated-0.042
 attention-0.041
lling-0.040
Token We
Token ID775
Feature activation-9.38e-01

Pos logit contributions
ra+0.013
era+0.012
iolet+0.012
ink+0.011
aw+0.011
Neg logit contributions
 having-0.009
 supposed-0.009
 valuable-0.009
igators-0.009
 sizes-0.009
Token helped
Token ID4193
Feature activation-1.44e+00

Pos logit contributions
mm+0.028
iolet+0.027
era+0.027
aw+0.026
ara+0.026
Neg logit contributions
 supposed-0.020
igators-0.018
 having-0.018
rated-0.018
 sizes-0.018
Token each
Token ID1123
Feature activation-2.43e+00

Pos logit contributions
iolet+0.037
mm+0.037
ra+0.037
era+0.036
ink+0.035
Neg logit contributions
 attention-0.034
 supposed-0.033
 way-0.032
igators-0.031
 having-0.031
Token other
Token ID584
Feature activation+7.07e-01

Pos logit contributions
mm+0.105
ra+0.104
era+0.100
ink+0.100
mmm+0.099
Neg logit contributions
 way-0.021
 supposed-0.018
igators-0.018
 attention-0.016
lling-0.015
Token,"
Token ID553
Feature activation-5.46e+00

Pos logit contributions
 supposed+0.017
igators+0.017
rated+0.016
 attention+0.016
lling+0.016
Neg logit contributions
ra-0.019
era-0.018
mm-0.017
iolet-0.017
aw-0.016
Token Lily
Token ID20037
Feature activation-1.69e+00

Pos logit contributions
ra+0.158
ink+0.147
light+0.146
era+0.138
iolet+0.137
Neg logit contributions
 old-0.132
 having-0.130
 attention-0.124
igators-0.122
 way-0.119
Token said
Token ID531
Feature activation-1.75e+00

Pos logit contributions
ra+0.054
mm+0.050
era+0.048
apy+0.047
iolet+0.047
Neg logit contributions
 supposed-0.032
 old-0.031
 having-0.031
 way-0.030
 valuable-0.030
Token.
Token ID13
Feature activation-7.55e-01

Pos logit contributions
ra+0.044
era+0.041
mm+0.041
iolet+0.040
aw+0.039
Neg logit contributions
 supposed-0.045
igators-0.044
 attention-0.043
rated-0.043
lling-0.041
Token
Token ID198
Feature activation-4.34e-01

Pos logit contributions
ra+0.024
mm+0.022
era+0.022
light+0.022
ink+0.021
Neg logit contributions
 supposed-0.014
 having-0.014
 old-0.013
 attention-0.013
 valuable-0.013
Token
Token ID198
Feature activation-1.12e+00

Pos logit contributions
ra+0.012
era+0.011
mm+0.011
iolet+0.011
light+0.011
Neg logit contributions
 supposed-0.010
 attention-0.009
igators-0.009
 having-0.009
 valuable-0.009
Token nice
Token ID3621
Feature activation-1.54e+00

Pos logit contributions
 supposed+0.024
rated+0.024
igators+0.023
 attention+0.022
lling+0.022
Neg logit contributions
ra-0.040
era-0.038
mm-0.037
aw-0.037
iolet-0.036
Token.
Token ID13
Feature activation-4.27e-02

Pos logit contributions
ra+0.041
mm+0.038
era+0.038
iolet+0.037
ara+0.036
Neg logit contributions
 supposed-0.038
igators-0.037
 attention-0.036
rated-0.035
 way-0.035
Token
Token ID198
Feature activation+2.40e-01

Pos logit contributions
ra+0.002
ink+0.001
era+0.001
light+0.001
iolet+0.001
Neg logit contributions
igators-0.001
 old-0.001
 having-0.001
 valuable-0.001
 attention-0.001
Token
Token ID198
Feature activation-3.38e-01

Pos logit contributions
 supposed+0.006
igators+0.006
 attention+0.006
rated+0.005
 valuable+0.005
Neg logit contributions
ra-0.006
era-0.006
mm-0.006
iolet-0.006
aw-0.005
Token"
Token ID1
Feature activation-3.80e-01

Pos logit contributions
ra+0.008
mm+0.007
era+0.007
iolet+0.007
light+0.007
Neg logit contributions
igators-0.009
 supposed-0.009
 valuable-0.009
lling-0.008
 having-0.008
TokenDid
Token ID11633
Feature activation-5.45e+00

Pos logit contributions
ra+0.013
ink+0.012
era+0.011
oppy+0.011
mm+0.011
Neg logit contributions
igators-0.007
 sizes-0.006
lling-0.006
 having-0.006
 valuable-0.006
Token you
Token ID345
Feature activation-6.70e-01

Pos logit contributions
ra+0.120
era+0.111
iolet+0.108
mm+0.106
light+0.102
Neg logit contributions
igators-0.154
 supposed-0.153
 attention-0.150
 having-0.145
 way-0.145
Token like
Token ID588
Feature activation-3.07e-01

Pos logit contributions
mm+0.021
ra+0.021
era+0.021
iolet+0.021
ara+0.020
Neg logit contributions
 supposed-0.012
igators-0.012
 having-0.012
 attention-0.011
 around-0.011
Token the
Token ID262
Feature activation-3.94e-01

Pos logit contributions
ra+0.008
mm+0.007
aw+0.007
era+0.007
iolet+0.007
Neg logit contributions
rated-0.008
 supposed-0.008
lling-0.007
 attention-0.007
igators-0.007
Token clown
Token ID25573
Feature activation-3.28e+00

Pos logit contributions
ra+0.010
era+0.009
mm+0.009
iolet+0.009
aw+0.009
Neg logit contributions
 supposed-0.010
igators-0.010
rated-0.010
 attention-0.010
lling-0.009
Token?"
Token ID1701
Feature activation-4.26e+00

Pos logit contributions
ra+0.070
era+0.066
iolet+0.062
mm+0.062
ara+0.058
Neg logit contributions
 supposed-0.095
igators-0.094
 having-0.090
 around-0.088
 way-0.088
Token toys
Token ID14958
Feature activation-9.85e-02

Pos logit contributions
mm+0.076
ra+0.076
era+0.075
iolet+0.074
ara+0.073
Neg logit contributions
 supposed-0.029
 attention-0.027
 around-0.025
igators-0.024
 having-0.024
Token by
Token ID416
Feature activation-7.36e-01

Pos logit contributions
ra+0.003
era+0.002
mm+0.002
iolet+0.002
aw+0.002
Neg logit contributions
 supposed-0.002
igators-0.002
 attention-0.002
rated-0.002
 way-0.002
Token the
Token ID262
Feature activation-1.36e+00

Pos logit contributions
ra+0.020
era+0.018
mm+0.018
iolet+0.017
ara+0.017
Neg logit contributions
 supposed-0.018
igators-0.017
rated-0.017
 attention-0.017
lling-0.016
Token dog
Token ID3290
Feature activation-1.62e+00

Pos logit contributions
ra+0.038
era+0.035
mm+0.034
iolet+0.033
aw+0.033
Neg logit contributions
 supposed-0.031
rated-0.030
igators-0.030
lling-0.029
 attention-0.029
Token stand
Token ID1302
Feature activation+1.27e+00

Pos logit contributions
ra+0.039
era+0.037
mm+0.036
iolet+0.035
aw+0.035
Neg logit contributions
 supposed-0.043
igators-0.041
 attention-0.041
rated-0.041
lling-0.040
Token all
Token ID477
Feature activation+4.37e+00

Pos logit contributions
 supposed+0.031
igators+0.030
 attention+0.029
rated+0.029
lling+0.028
Neg logit contributions
ra-0.033
era-0.031
mm-0.030
aw-0.030
iolet-0.030
Token day
Token ID1110
Feature activation+4.43e-01

Pos logit contributions
 supposed+0.078
 harming+0.068
 valuable+0.066
 attention+0.065
rated+0.062
Neg logit contributions
ra-0.129
aw-0.128
light-0.124
era-0.124
ara-0.123
Token.
Token ID13
Feature activation-8.47e-02

Pos logit contributions
igators+0.011
rated+0.011
 supposed+0.011
 attention+0.011
lling+0.010
Neg logit contributions
ra-0.011
era-0.010
ink-0.010
mm-0.010
aw-0.009
Token<|endoftext|>
Token ID50256
Feature activation+1.28e+00

Pos logit contributions
ra+0.002
era+0.002
mm+0.002
iolet+0.002
aw+0.002
Neg logit contributions
igators-0.002
 supposed-0.002
 having-0.002
 way-0.002
rated-0.002
TokenThere
Token ID1858
Feature activation+9.16e-01

Pos logit contributions
 supposed+0.036
igators+0.035
rated+0.035
 attention+0.035
lling+0.034
Neg logit contributions
ra-0.028
era-0.027
mm-0.026
iolet-0.026
ara-0.025
Token was
Token ID373
Feature activation-1.25e+00

Pos logit contributions
 way+0.021
 supposed+0.021
igators+0.020
 having+0.020
rated+0.019
Neg logit contributions
ra-0.025
era-0.023
mm-0.023
iolet-0.022
mmm-0.022
Token not
Token ID407
Feature activation+1.48e-01

Pos logit contributions
ra+0.011
era+0.011
mm+0.010
iolet+0.010
ara+0.010
Neg logit contributions
 supposed-0.012
igators-0.012
rated-0.011
 attention-0.011
lling-0.011
Token have
Token ID423
Feature activation-2.78e+00

Pos logit contributions
rated+0.004
 supposed+0.004
igators+0.004
lling+0.004
 attention+0.004
Neg logit contributions
ra-0.003
era-0.003
mm-0.003
iolet-0.003
aw-0.003
Token the
Token ID262
Feature activation+3.47e-01

Pos logit contributions
ra+0.065
aw+0.061
era+0.057
mm+0.057
iolet+0.056
Neg logit contributions
rated-0.077
 supposed-0.072
igators-0.072
lling-0.071
 attention-0.067
Token same
Token ID976
Feature activation-1.08e+00

Pos logit contributions
lling+0.009
igators+0.008
rated+0.008
 supposed+0.008
ping+0.008
Neg logit contributions
ra-0.009
era-0.008
mm-0.008
aw-0.008
 ha-0.008
Token things
Token ID1243
Feature activation+1.50e+00

Pos logit contributions
ra+0.030
era+0.026
aw+0.026
mm+0.026
iolet+0.026
Neg logit contributions
 supposed-0.025
igators-0.025
lling-0.024
rated-0.024
 attention-0.023
Token as
Token ID355
Feature activation+4.18e+00

Pos logit contributions
rated+0.043
igators+0.040
 supposed+0.039
lling+0.038
ping+0.037
Neg logit contributions
ra-0.034
aw-0.030
mm-0.030
era-0.029
light-0.028
Token everyone
Token ID2506
Feature activation+1.56e+00

Pos logit contributions
rated+0.110
igators+0.100
lling+0.099
ping+0.091
 supposed+0.090
Neg logit contributions
ra-0.104
mm-0.097
era-0.094
mmm-0.091
aw-0.090
Token else
Token ID2073
Feature activation-2.41e-01

Pos logit contributions
rated+0.043
 attention+0.040
 supposed+0.039
ping+0.038
 winding+0.038
Neg logit contributions
ra-0.035
aw-0.032
era-0.031
light-0.029
ink-0.028
Token.
Token ID13
Feature activation-8.64e-01

Pos logit contributions
ra+0.006
era+0.005
aw+0.005
mm+0.005
light+0.005
Neg logit contributions
rated-0.006
igators-0.006
 supposed-0.006
lling-0.006
 attention-0.006
Token He
Token ID679
Feature activation-1.57e+00

Pos logit contributions
ra+0.021
era+0.020
mm+0.019
iolet+0.019
aw+0.018
Neg logit contributions
 supposed-0.023
igators-0.022
 attention-0.022
rated-0.022
lling-0.021
Token learned
Token ID4499
Feature activation+6.44e-01

Pos logit contributions
ra+0.043
era+0.041
mm+0.041
iolet+0.040
aw+0.039
Neg logit contributions
 supposed-0.036
igators-0.034
rated-0.033
 attention-0.033
 way-0.032
Token "
Token ID366
Feature activation+1.39e+00

Pos logit contributions
igators+0.069
 supposed+0.068
rated+0.066
 attention+0.064
lling+0.063
Neg logit contributions
ra-0.069
era-0.066
mm-0.066
M-0.064
iolet-0.064
TokenNo
Token ID2949
Feature activation-1.49e-01

Pos logit contributions
 supposed+0.030
rated+0.029
igators+0.029
lling+0.028
 valuable+0.028
Neg logit contributions
ra-0.040
mm-0.038
era-0.038
iolet-0.037
ink-0.036
Token way
Token ID835
Feature activation+1.46e+00

Pos logit contributions
ra+0.003
era+0.003
mm+0.003
iolet+0.002
ara+0.002
Neg logit contributions
 supposed-0.005
igators-0.005
rated-0.005
lling-0.005
 attention-0.005
Token!
Token ID0
Feature activation-1.23e-01

Pos logit contributions
 supposed+0.050
igators+0.049
rated+0.048
lling+0.048
 valuable+0.047
Neg logit contributions
ra-0.024
era-0.022
mm-0.019
iolet-0.019
ara-0.019
Token That
Token ID1320
Feature activation+9.27e-01

Pos logit contributions
ra+0.003
era+0.003
mm+0.003
iolet+0.003
aw+0.003
Neg logit contributions
 supposed-0.003
igators-0.003
rated-0.003
 attention-0.003
lling-0.003
Token all
Token ID477
Feature activation+4.49e+00

Pos logit contributions
 supposed+0.023
 attention+0.023
ping+0.021
rated+0.020
lling+0.020
Neg logit contributions
aw-0.021
ra-0.021
era-0.020
mm-0.019
iolet-0.019
Tokenigator
Token ID23823
Feature activation+2.94e-01

Pos logit contributions
 supposed+0.094
 harming+0.086
 valuable+0.082
 attention+0.080
rated+0.080
Neg logit contributions
aw-0.124
ra-0.112
era-0.111
mm-0.110
light-0.110
Token is
Token ID318
Feature activation-2.15e-01

Pos logit contributions
 supposed+0.008
igators+0.007
 attention+0.007
rated+0.007
lling+0.007
Neg logit contributions
ra-0.007
era-0.007
mm-0.007
iolet-0.007
aw-0.006
Token too
Token ID1165
Feature activation+1.45e+00

Pos logit contributions
ra+0.004
?+0.004
 please+0.004
light+0.004
 just+0.004
Neg logit contributions
lling-0.005
rated-0.005
 cultures-0.005
igators-0.005
ping-0.005
Token deep
Token ID2769
Feature activation-4.81e-01

Pos logit contributions
lling+0.030
 having+0.029
 attention+0.029
 supposed+0.028
ping+0.028
Neg logit contributions
aw-0.038
ara-0.037
ink-0.036
ra-0.036
apy-0.035
Token in
Token ID287
Feature activation+4.34e-01

Pos logit contributions
ra+0.010
aw+0.009
light+0.008
era+0.008
ara+0.008
Neg logit contributions
igators-0.014
 supposed-0.013
 valuable-0.013
lling-0.013
rated-0.012
Token He
Token ID679
Feature activation-6.35e-01

Pos logit contributions
igators+0.019
 supposed+0.018
 way+0.018
 having+0.018
 valuable+0.017
Neg logit contributions
ra-0.025
era-0.023
mm-0.022
iolet-0.022
aw-0.021
Token hopped
Token ID45230
Feature activation+2.74e+00

Pos logit contributions
ra+0.018
era+0.017
aw+0.016
iolet+0.016
ara+0.016
Neg logit contributions
 supposed-0.015
 attention-0.015
igators-0.014
rated-0.014
lling-0.013
Token to
Token ID284
Feature activation-6.04e-01

Pos logit contributions
 supposed+0.054
rated+0.050
igators+0.050
 sizes+0.047
lling+0.045
Neg logit contributions
ra-0.086
era-0.081
iolet-0.081
aw-0.079
M-0.078
Token the
Token ID262
Feature activation-3.86e-01

Pos logit contributions
ra+0.015
era+0.015
iolet+0.014
mm+0.014
aw+0.014
Neg logit contributions
 supposed-0.015
igators-0.014
rated-0.014
lling-0.014
 attention-0.013
Token music
Token ID2647
Feature activation+7.13e-01

Pos logit contributions
ra+0.011
era+0.011
mm+0.010
iolet+0.010
aw+0.010
Neg logit contributions
 supposed-0.008
igators-0.008
rated-0.008
 attention-0.008
lling-0.008
Token all
Token ID477
Feature activation+4.25e+00

Pos logit contributions
 supposed+0.014
igators+0.014
 way+0.014
 attention+0.013
 around+0.013
Neg logit contributions
ra-0.022
mm-0.021
era-0.020
iolet-0.020
mmm-0.020
Token day
Token ID1110
Feature activation+4.70e-01

Pos logit contributions
 supposed+0.078
 valuable+0.067
 harming+0.065
 attention+0.065
 tempting+0.064
Neg logit contributions
ra-0.131
aw-0.131
light-0.127
era-0.126
ara-0.125
Token long
Token ID890
Feature activation-1.11e+00

Pos logit contributions
igators+0.013
rated+0.012
 supposed+0.012
 attention+0.012
lling+0.011
Neg logit contributions
ra-0.012
era-0.011
iolet-0.010
aw-0.010
light-0.010
Token,
Token ID11
Feature activation-4.40e-02

Pos logit contributions
ra+0.023
aw+0.021
era+0.021
mm+0.021
iolet+0.020
Neg logit contributions
 supposed-0.033
rated-0.033
igators-0.032
 attention-0.031
lling-0.031
Token until
Token ID1566
Feature activation-4.57e-02

Pos logit contributions
ra+0.001
aw+0.001
light+0.001
era+0.001
room+0.001
Neg logit contributions
igators-0.001
rated-0.001
 supposed-0.001
 attention-0.001
 valuable-0.001
Token the
Token ID262
Feature activation+1.13e+00

Pos logit contributions
ra+0.001
era+0.001
iolet+0.001
mm+0.001
aw+0.001
Neg logit contributions
 way-0.001
 having-0.001
 supposed-0.001
 attention-0.001
 around-0.001
Token face
Token ID1986
Feature activation-1.30e+00

Pos logit contributions
mm+0.012
era+0.012
iolet+0.012
ra+0.012
ara+0.012
Neg logit contributions
 supposed-0.005
 way-0.005
 valuable-0.005
 important-0.005
 old-0.005
Token and
Token ID290
Feature activation+6.84e-01

Pos logit contributions
ra+0.035
mm+0.032
era+0.032
aw+0.032
light+0.031
Neg logit contributions
igators-0.031
 supposed-0.030
rated-0.030
 attention-0.029
lling-0.029
Token two
Token ID734
Feature activation+9.22e-01

Pos logit contributions
 supposed+0.014
 having+0.014
 old+0.014
 way+0.014
 important+0.013
Neg logit contributions
iolet-0.020
ra-0.020
era-0.020
mm-0.019
ara-0.019
Token hands
Token ID2832
Feature activation+5.10e-01

Pos logit contributions
 supposed+0.020
igators+0.019
rated+0.019
 attention+0.018
lling+0.018
Neg logit contributions
ra-0.027
era-0.026
mm-0.025
aw-0.025
iolet-0.025
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 supposed+0.014
igators+0.014
rated+0.014
 attention+0.014
lling+0.013
Neg logit contributions
ra-0.011
era-0.011
mm-0.010
iolet-0.010
aw-0.010
Token Every
Token ID3887
Feature activation+4.32e+00

Pos logit contributions
igators+0.025
 having+0.024
 supposed+0.024
 way+0.023
 old+0.023
Neg logit contributions
ra-0.033
era-0.030
iolet-0.029
mm-0.028
aw-0.028
Token day
Token ID1110
Feature activation+2.34e-01

Pos logit contributions
 supposed+0.043
igators+0.039
rated+0.036
 attention+0.036
lling+0.033
Neg logit contributions
ra-0.178
era-0.171
mm-0.170
iolet-0.167
aw-0.164
Token,
Token ID11
Feature activation+3.62e-01

Pos logit contributions
 way+0.005
igators+0.005
 supposed+0.005
 around+0.005
 having+0.005
Neg logit contributions
ra-0.006
era-0.006
iolet-0.006
mm-0.006
mmm-0.005
Token the
Token ID262
Feature activation+2.06e+00

Pos logit contributions
igators+0.006
 having+0.005
 attention+0.005
 old+0.005
 way+0.005
Neg logit contributions
ra-0.013
era-0.012
iolet-0.012
mm-0.012
ara-0.012
Token clock
Token ID8801
Feature activation-8.83e-02

Pos logit contributions
 supposed+0.058
igators+0.057
rated+0.056
 attention+0.055
lling+0.054
Neg logit contributions
ra-0.046
era-0.043
mm-0.042
iolet-0.041
aw-0.040
Token went
Token ID1816
Feature activation+3.35e+00

Pos logit contributions
ra+0.003
era+0.003
mm+0.003
iolet+0.003
mmm+0.003
Neg logit contributions
 supposed-0.002
igators-0.001
rated-0.001
 attention-0.001
 way-0.001
Token.
Token ID13
Feature activation+1.48e+00

Pos logit contributions
 supposed+0.017
igators+0.016
 attention+0.016
rated+0.016
 way+0.015
Neg logit contributions
ra-0.018
era-0.017
mm-0.017
iolet-0.017
aw-0.016
Token She
Token ID1375
Feature activation+2.31e-01

Pos logit contributions
 supposed+0.043
igators+0.042
 attention+0.041
rated+0.041
lling+0.040
Neg logit contributions
ra-0.032
era-0.030
mm-0.030
iolet-0.029
aw-0.028
Token loved
Token ID6151
Feature activation+2.04e+00

Pos logit contributions
 supposed+0.004
igators+0.004
rated+0.004
 attention+0.004
lling+0.004
Neg logit contributions
ra-0.007
era-0.007
mm-0.007
iolet-0.007
aw-0.007
Token to
Token ID284
Feature activation+7.65e-01

Pos logit contributions
rated+0.055
lling+0.049
igators+0.048
 supposed+0.045
 attention+0.044
Neg logit contributions
ra-0.049
apy-0.047
light-0.046
M-0.045
era-0.045
Token take
Token ID1011
Feature activation+1.63e+00

Pos logit contributions
 supposed+0.014
igators+0.014
 attention+0.014
 way+0.013
rated+0.013
Neg logit contributions
ra-0.025
era-0.024
aw-0.024
mm-0.024
iolet-0.023
Token baths
Token ID49076
Feature activation+3.14e+00

Pos logit contributions
igators+0.032
rated+0.032
 supposed+0.032
lling+0.029
 attention+0.028
Neg logit contributions
ra-0.051
mm-0.047
era-0.047
aw-0.046
iolet-0.046
Token in
Token ID287
Feature activation+1.29e+00

Pos logit contributions
 supposed+0.076
igators+0.073
rated+0.072
 attention+0.070
 valuable+0.067
Neg logit contributions
ra-0.084
era-0.080
mm-0.078
iolet-0.077
aw-0.075
Token her
Token ID607
Feature activation-1.74e+00

Pos logit contributions
 supposed+0.031
rated+0.030
igators+0.030
 attention+0.029
lling+0.029
Neg logit contributions
ra-0.034
era-0.033
mm-0.032
iolet-0.031
ara-0.031
Token bath
Token ID7837
Feature activation+2.00e-01

Pos logit contributions
ra+0.049
era+0.047
mm+0.046
iolet+0.045
aw+0.044
Neg logit contributions
 supposed-0.039
igators-0.037
 attention-0.037
rated-0.036
 way-0.035
Tokentub
Token ID37995
Feature activation+3.82e-01

Pos logit contributions
 supposed+0.005
igators+0.005
 attention+0.004
rated+0.004
 way+0.004
Neg logit contributions
ra-0.005
era-0.005
mm-0.005
iolet-0.005
aw-0.005
Token every
Token ID790
Feature activation+3.34e+00

Pos logit contributions
 supposed+0.010
igators+0.009
rated+0.009
 attention+0.009
lling+0.009
Neg logit contributions
ra-0.010
era-0.009
mm-0.009
iolet-0.009
aw-0.009
Token the
Token ID262
Feature activation+4.91e-01

Pos logit contributions
ra+0.019
era+0.018
mm+0.018
iolet+0.017
aw+0.017
Neg logit contributions
 supposed-0.019
igators-0.018
rated-0.018
 attention-0.018
lling-0.017
Token tuna
Token ID38883
Feature activation+1.09e+00

Pos logit contributions
 supposed+0.013
igators+0.013
rated+0.012
lling+0.011
 attention+0.011
Neg logit contributions
ra-0.012
era-0.012
mm-0.011
iolet-0.011
ara-0.011
Token and
Token ID290
Feature activation-6.75e-02

Pos logit contributions
 supposed+0.027
igators+0.026
rated+0.025
 attention+0.025
lling+0.024
Neg logit contributions
ra-0.028
era-0.027
mm-0.026
iolet-0.026
aw-0.025
Token then
Token ID788
Feature activation-1.51e+00

Pos logit contributions
ra+0.002
era+0.002
mm+0.002
iolet+0.002
ara+0.002
Neg logit contributions
 supposed-0.002
 way-0.002
 attention-0.002
igators-0.002
 having-0.002
Token he
Token ID339
Feature activation-1.25e+00

Pos logit contributions
ra+0.041
mm+0.040
era+0.040
iolet+0.040
mmm+0.038
Neg logit contributions
 supposed-0.034
 way-0.033
 attention-0.033
igators-0.032
 having-0.032
Token walked
Token ID6807
Feature activation+2.86e+00

Pos logit contributions
mm+0.035
ra+0.035
era+0.034
iolet+0.033
mmm+0.033
Neg logit contributions
 way-0.027
 supposed-0.027
 having-0.025
lling-0.025
 around-0.024
Token back
Token ID736
Feature activation-7.87e-01

Pos logit contributions
 supposed+0.042
igators+0.040
 attention+0.039
rated+0.038
lling+0.036
Neg logit contributions
ra-0.104
mm-0.099
era-0.099
iolet-0.097
aw-0.095
Token inside
Token ID2641
Feature activation-8.74e-01

Pos logit contributions
ra+0.024
era+0.022
light+0.021
iolet+0.021
aps+0.021
Neg logit contributions
rated-0.016
igators-0.016
 supposed-0.016
lling-0.015
 sizes-0.015
Token with
Token ID351
Feature activation-4.14e-01

Pos logit contributions
ra+0.025
era+0.024
mm+0.023
iolet+0.023
aw+0.022
Neg logit contributions
 supposed-0.019
igators-0.019
rated-0.018
 attention-0.018
lling-0.018
Token his
Token ID465
Feature activation-1.26e+00

Pos logit contributions
ra+0.012
mm+0.012
era+0.012
iolet+0.011
ara+0.011
Neg logit contributions
 attention-0.009
 supposed-0.009
 way-0.009
 old-0.009
 valuable-0.008
Token mom
Token ID1995
Feature activation-1.94e+00

Pos logit contributions
ra+0.024
era+0.021
mm+0.021
iolet+0.020
ara+0.019
Neg logit contributions
 supposed-0.041
igators-0.040
rated-0.040
lling-0.039
 attention-0.038
Token everyday
Token ID10908
Feature activation+2.65e+00

Pos logit contributions
 supposed+0.013
rated+0.013
igators+0.012
lling+0.012
 attention+0.012
Neg logit contributions
ra-0.009
era-0.008
light-0.008
mm-0.008
iolet-0.008
Token.
Token ID13
Feature activation+2.00e-01

Pos logit contributions
 supposed+0.075
igators+0.075
rated+0.073
 attention+0.072
lling+0.072
Neg logit contributions
ra-0.058
era-0.053
aw-0.051
mm-0.051
light-0.050
Token One
Token ID1881
Feature activation+1.64e+00

Pos logit contributions
igators+0.004
 supposed+0.004
 having+0.004
 way+0.004
 valuable+0.004
Neg logit contributions
ra-0.006
era-0.005
mm-0.005
iolet-0.005
ink-0.005
Token morning
Token ID3329
Feature activation-9.99e-01

Pos logit contributions
 supposed+0.020
lling+0.019
igators+0.018
 attention+0.018
rated+0.018
Neg logit contributions
ra-0.062
era-0.060
mm-0.060
ara-0.059
aw-0.059
Token,
Token ID11
Feature activation-1.46e-02

Pos logit contributions
ra+0.028
era+0.027
mm+0.027
iolet+0.025
ara+0.024
Neg logit contributions
 way-0.022
 around-0.021
 having-0.021
 supposed-0.021
 causing-0.021
Token the
Token ID262
Feature activation+1.70e+00

Pos logit contributions
ra+0.000
mm+0.000
era+0.000
iolet+0.000
ara+0.000
Neg logit contributions
 supposed-0.000
igators-0.000
 attention-0.000
 having-0.000
 way-0.000
Token mall
Token ID17374
Feature activation+1.42e+00

Pos logit contributions
 supposed+0.040
rated+0.039
igators+0.039
lling+0.036
 attention+0.036
Neg logit contributions
ra-0.047
era-0.044
mm-0.043
aw-0.042
ara-0.042
Token closed
Token ID4838
Feature activation+1.86e+00

Pos logit contributions
 supposed+0.035
igators+0.033
rated+0.033
 attention+0.032
 way+0.032
Neg logit contributions
ra-0.037
mm-0.035
era-0.035
iolet-0.034
aw-0.032
Token its
Token ID663
Feature activation-3.01e-02

Pos logit contributions
 supposed+0.036
igators+0.035
 attention+0.035
rated+0.035
 way+0.033
Neg logit contributions
ra-0.057
era-0.054
mm-0.054
iolet-0.053
aw-0.051
Token doors
Token ID8215
Feature activation-1.23e-01

Pos logit contributions
ra+0.001
era+0.001
mm+0.001
iolet+0.001
aw+0.001
Neg logit contributions
 supposed-0.001
igators-0.001
rated-0.001
 attention-0.001
lling-0.001
Token to
Token ID284
Feature activation-5.99e-01

Pos logit contributions
ra+0.003
era+0.003
light+0.003
mm+0.003
aw+0.003
Neg logit contributions
igators-0.003
 supposed-0.003
rated-0.003
lling-0.003
 attention-0.003
Token delicious
Token ID12625
Feature activation-1.44e+00

Pos logit contributions
 supposed+0.005
igators+0.005
rated+0.005
 attention+0.005
lling+0.005
Neg logit contributions
ra-0.007
mm-0.007
era-0.007
iolet-0.006
aw-0.006
Token.
Token ID13
Feature activation+1.67e-02

Pos logit contributions
ra+0.036
era+0.034
mm+0.034
iolet+0.033
ara+0.032
Neg logit contributions
 supposed-0.037
igators-0.035
 attention-0.035
rated-0.034
 way-0.033
Token Lily
Token ID20037
Feature activation+2.51e-01

Pos logit contributions
 supposed+0.000
igators+0.000
rated+0.000
 attention+0.000
 way+0.000
Neg logit contributions
ra-0.000
era-0.000
mm-0.000
iolet-0.000
aw-0.000
Token was
Token ID373
Feature activation-8.19e-01

Pos logit contributions
 way+0.005
 having+0.005
 supposed+0.005
lling+0.004
 around+0.004
Neg logit contributions
ra-0.008
mm-0.007
era-0.007
iolet-0.007
mmm-0.007
Token happy
Token ID3772
Feature activation-3.09e+00

Pos logit contributions
ra+0.026
era+0.026
mm+0.025
iolet+0.024
aw+0.024
Neg logit contributions
igators-0.015
rated-0.015
 attention-0.014
 supposed-0.013
lling-0.013
Token too
Token ID1165
Feature activation+1.79e+00

Pos logit contributions
ra+0.103
era+0.096
mm+0.096
iolet+0.094
aw+0.092
Neg logit contributions
 supposed-0.057
 having-0.054
 attention-0.053
 way-0.051
 around-0.050
Token because
Token ID780
Feature activation+3.85e+00

Pos logit contributions
igators+0.049
 supposed+0.046
rated+0.046
 attention+0.045
lling+0.044
Neg logit contributions
ra-0.042
era-0.041
mm-0.041
iolet-0.039
aw-0.038
Token she
Token ID673
Feature activation+8.73e-01

Pos logit contributions
 supposed+0.107
igators+0.103
rated+0.101
 attention+0.101
lling+0.098
Neg logit contributions
ra-0.089
era-0.083
mm-0.082
iolet-0.080
aw-0.077
Token made
Token ID925
Feature activation-6.87e-01

Pos logit contributions
 supposed+0.019
igators+0.016
lling+0.016
rated+0.016
 valuable+0.016
Neg logit contributions
ra-0.026
era-0.025
mm-0.025
iolet-0.025
aw-0.024
Token her
Token ID607
Feature activation+4.93e-01

Pos logit contributions
ra+0.019
era+0.018
mm+0.018
iolet+0.017
ara+0.017
Neg logit contributions
 supposed-0.016
 attention-0.015
igators-0.015
rated-0.014
 valuable-0.014
Token brother
Token ID3956
Feature activation-9.94e-01

Pos logit contributions
igators+0.016
 supposed+0.016
rated+0.016
lling+0.016
 causing+0.015
Neg logit contributions
ra-0.009
mm-0.008
era-0.008
iolet-0.007
ara-0.007
Token inside
Token ID2641
Feature activation+9.11e-01

Pos logit contributions
ra+0.001
era+0.001
mm+0.001
iolet+0.001
aw+0.001
Neg logit contributions
igators-0.001
 supposed-0.001
rated-0.001
 attention-0.000
lling-0.000
Token.
Token ID13
Feature activation-2.18e-01

Pos logit contributions
rated+0.026
igators+0.025
 supposed+0.025
 attention+0.023
lling+0.023
Neg logit contributions
ra-0.020
era-0.018
light-0.018
aw-0.017
iolet-0.017
Token He
Token ID679
Feature activation-1.33e+00

Pos logit contributions
ra+0.006
era+0.005
mm+0.005
iolet+0.005
aw+0.005
Neg logit contributions
 supposed-0.005
igators-0.005
 attention-0.005
rated-0.005
 valuable-0.005
Token knew
Token ID2993
Feature activation-1.83e-01

Pos logit contributions
ra+0.038
mm+0.037
era+0.036
iolet+0.036
mmm+0.034
Neg logit contributions
 supposed-0.029
igators-0.028
rated-0.027
 way-0.027
 valuable-0.027
Token that
Token ID326
Feature activation+1.11e+00

Pos logit contributions
ra+0.004
era+0.004
mm+0.004
iolet+0.004
aw+0.004
Neg logit contributions
 supposed-0.005
igators-0.005
rated-0.005
 attention-0.005
lling-0.005
Token he
Token ID339
Feature activation+9.93e-01

Pos logit contributions
 supposed+0.030
rated+0.030
igators+0.030
lling+0.028
 attention+0.028
Neg logit contributions
ra-0.025
mm-0.023
era-0.023
light-0.023
aw-0.021
Token had
Token ID550
Feature activation-1.60e+00

Pos logit contributions
 attention+0.023
igators+0.023
 supposed+0.023
rated+0.022
lling+0.021
Neg logit contributions
ra-0.027
era-0.024
mm-0.023
aw-0.023
light-0.023
Token managed
Token ID5257
Feature activation-4.23e-01

Pos logit contributions
ra+0.044
era+0.041
iolet+0.040
mm+0.040
aw+0.039
Neg logit contributions
igators-0.037
 supposed-0.036
rated-0.036
 attention-0.034
lling-0.034
Token to
Token ID284
Feature activation-8.27e-01

Pos logit contributions
ra+0.010
aw+0.010
era+0.010
iolet+0.009
ara+0.009
Neg logit contributions
igators-0.010
rated-0.010
 sizes-0.009
lling-0.009
 supposed-0.009
Token help
Token ID1037
Feature activation-2.96e-01

Pos logit contributions
ra+0.023
era+0.021
mm+0.021
iolet+0.021
aw+0.020
Neg logit contributions
 supposed-0.019
igators-0.019
rated-0.018
 attention-0.018
lling-0.018
Token the
Token ID262
Feature activation+2.75e-01

Pos logit contributions
ra+0.007
era+0.007
mm+0.007
iolet+0.007
aw+0.007
Neg logit contributions
 supposed-0.007
rated-0.007
igators-0.007
lling-0.007
 valuable-0.007
Token hugged
Token ID41130
Feature activation+3.45e-02

Pos logit contributions
igators+0.001
 way+0.001
 around+0.001
lling+0.001
 having+0.001
Neg logit contributions
mm-0.002
mmm-0.002
ra-0.002
mmmm-0.002
aps-0.002
Token and
Token ID290
Feature activation-4.33e-01

Pos logit contributions
 supposed+0.001
igators+0.001
rated+0.001
 attention+0.001
lling+0.001
Neg logit contributions
ra-0.001
era-0.001
mm-0.001
iolet-0.001
aw-0.001
Token said
Token ID531
Feature activation+3.93e-01

Pos logit contributions
ra+0.011
mm+0.010
era+0.010
iolet+0.010
mmm+0.010
Neg logit contributions
 supposed-0.011
igators-0.011
 attention-0.011
 way-0.010
rated-0.010
Token goodbye
Token ID24829
Feature activation-1.35e+00

Pos logit contributions
 supposed+0.010
igators+0.010
 attention+0.010
rated+0.009
lling+0.009
Neg logit contributions
ra-0.010
era-0.009
mm-0.009
iolet-0.009
ara-0.009
Token.
Token ID13
Feature activation+6.08e-01

Pos logit contributions
ra+0.037
era+0.035
mm+0.034
iolet+0.033
aw+0.033
Neg logit contributions
 supposed-0.032
igators-0.031
rated-0.030
 attention-0.029
lling-0.029
Token Ben
Token ID3932
Feature activation+3.86e-01

Pos logit contributions
 supposed+0.012
igators+0.012
 attention+0.012
 having+0.011
 valuable+0.011
Neg logit contributions
ra-0.018
era-0.017
mm-0.017
iolet-0.017
ink-0.016
Token went
Token ID1816
Feature activation+2.47e+00

Pos logit contributions
 supposed+0.009
 way+0.008
igators+0.008
rated+0.008
lling+0.008
Neg logit contributions
ra-0.011
mm-0.010
era-0.010
iolet-0.010
mmm-0.010
Token home
Token ID1363
Feature activation+4.55e-01

Pos logit contributions
rated+0.056
 sizes+0.054
 supposed+0.052
igators+0.052
lling+0.050
Neg logit contributions
ra-0.069
era-0.068
iolet-0.067
aw-0.067
ara-0.066
Token with
Token ID351
Feature activation+8.02e-03

Pos logit contributions
igators+0.014
 supposed+0.013
rated+0.013
lling+0.012
 attention+0.012
Neg logit contributions
ra-0.010
era-0.009
iolet-0.009
mm-0.008
light-0.008
Token his
Token ID465
Feature activation-2.31e-01

Pos logit contributions
 supposed+0.000
 attention+0.000
igators+0.000
 way+0.000
 old+0.000
Neg logit contributions
ra-0.000
era-0.000
mm-0.000
iolet-0.000
mmm-0.000
Token mom
Token ID1995
Feature activation-1.22e+00

Pos logit contributions
ra+0.006
mm+0.005
era+0.005
iolet+0.005
ara+0.005
Neg logit contributions
rated-0.006
 supposed-0.006
lling-0.006
igators-0.006
 causing-0.006
Token exam
Token ID2814
Feature activation-1.14e+00

Pos logit contributions
igators+0.016
 supposed+0.015
rated+0.015
lling+0.014
 attention+0.014
Neg logit contributions
ra-0.025
era-0.024
mm-0.023
iolet-0.023
aw-0.022
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
ra+0.029
era+0.027
mm+0.027
iolet+0.026
aw+0.025
Neg logit contributions
 supposed-0.029
igators-0.028
 attention-0.027
rated-0.027
lling-0.026
Token
Token ID220
Feature activation-7.57e-01

Pos logit contributions
ra+0.003
era+0.003
mm+0.003
iolet+0.003
ara+0.003
Neg logit contributions
 supposed-0.003
igators-0.003
rated-0.003
 attention-0.003
lling-0.003
Token
Token ID198
Feature activation+3.16e-01

Pos logit contributions
ra+0.017
era+0.016
mm+0.016
iolet+0.016
aw+0.015
Neg logit contributions
 supposed-0.021
igators-0.020
rated-0.020
 attention-0.020
lling-0.020
Token
Token ID198
Feature activation-9.37e-02

Pos logit contributions
 supposed+0.009
igators+0.009
 attention+0.008
rated+0.008
lling+0.008
Neg logit contributions
ra-0.007
era-0.007
mm-0.007
iolet-0.007
ara-0.006
TokenL
Token ID43
Feature activation-1.42e-01

Pos logit contributions
ra+0.002
era+0.002
mm+0.002
iolet+0.002
aw+0.002
Neg logit contributions
 supposed-0.003
 valuable-0.003
igators-0.003
lling-0.003
rated-0.003
Tokenily
Token ID813
Feature activation-4.13e-01

Pos logit contributions
ra+0.003
era+0.003
mm+0.003
iolet+0.003
light+0.003
Neg logit contributions
 supposed-0.004
igators-0.004
rated-0.004
 attention-0.004
lling-0.004
Token started
Token ID2067
Feature activation+1.92e-01

Pos logit contributions
ra+0.011
mm+0.011
era+0.011
iolet+0.011
mmm+0.010
Neg logit contributions
 supposed-0.009
igators-0.009
rated-0.009
 way-0.009
lling-0.009
Token to
Token ID284
Feature activation-7.98e-01

Pos logit contributions
rated+0.005
igators+0.005
lling+0.004
 supposed+0.004
 attention+0.004
Neg logit contributions
ra-0.005
era-0.005
iolet-0.005
apy-0.005
 ha-0.005
Token feel
Token ID1254
Feature activation+4.66e-01

Pos logit contributions
ra+0.021
era+0.020
mm+0.020
iolet+0.019
ara+0.019
Neg logit contributions
 supposed-0.019
igators-0.019
rated-0.018
 attention-0.018
lling-0.017
Token better
Token ID1365
Feature activation-1.87e+00

Pos logit contributions
 supposed+0.009
rated+0.009
igators+0.009
lling+0.008
 attention+0.008
Neg logit contributions
ra-0.014
era-0.013
aw-0.013
mm-0.013
apy-0.013
Token<|endoftext|>
Token ID50256
Feature activation+2.05e+00

Pos logit contributions
ra+0.011
era+0.010
mm+0.010
iolet+0.010
aw+0.010
Neg logit contributions
 supposed-0.009
igators-0.009
 valuable-0.008
 way-0.008
 having-0.008
TokenOnce
Token ID7454
Feature activation+1.31e+00

Pos logit contributions
 supposed+0.060
igators+0.059
 attention+0.058
rated+0.058
lling+0.056
Neg logit contributions
ra-0.043
era-0.041
iolet-0.040
mm-0.040
ara-0.039
Token upon
Token ID2402
Feature activation+7.50e-02

Pos logit contributions
 supposed+0.034
igators+0.032
rated+0.032
lling+0.032
 attention+0.031
Neg logit contributions
ra-0.032
era-0.031
mm-0.030
ara-0.030
iolet-0.029
Token a
Token ID257
Feature activation-8.79e-01

Pos logit contributions
 supposed+0.002
 having+0.002
igators+0.002
 attention+0.002
 valuable+0.002
Neg logit contributions
ra-0.002
mm-0.002
era-0.002
iolet-0.002
ink-0.002
Token time
Token ID640
Feature activation-1.08e+00

Pos logit contributions
ra+0.031
era+0.029
mm+0.029
iolet+0.029
aw+0.028
Neg logit contributions
 supposed-0.014
igators-0.013
rated-0.013
 attention-0.012
 way-0.012
Token,
Token ID11
Feature activation-2.13e-02

Pos logit contributions
ra+0.021
era+0.019
mm+0.019
iolet+0.018
ara+0.018
Neg logit contributions
 supposed-0.034
igators-0.033
rated-0.033
 attention-0.033
lling-0.032
Token there
Token ID612
Feature activation+9.75e-01

Pos logit contributions
ra+0.001
era+0.001
mm+0.001
iolet+0.001
ara+0.001
Neg logit contributions
 supposed-0.001
 attention-0.000
igators-0.000
rated-0.000
lling-0.000
Token was
Token ID373
Feature activation-1.47e+00

Pos logit contributions
 supposed+0.022
igators+0.021
 way+0.021
rated+0.021
 attention+0.020
Neg logit contributions
ra-0.027
era-0.025
mm-0.025
iolet-0.025
aw-0.024
Token a
Token ID257
Feature activation-3.55e-01

Pos logit contributions
ra+0.037
era+0.035
mm+0.035
iolet+0.034
ara+0.032
Neg logit contributions
 supposed-0.038
igators-0.036
 having-0.035
 attention-0.035
 valuable-0.034
Token little
Token ID1310
Feature activation-1.85e-01

Pos logit contributions
ra+0.012
era+0.012
mm+0.012
iolet+0.011
aw+0.011
Neg logit contributions
 supposed-0.006
igators-0.005
 attention-0.005
rated-0.005
 valuable-0.005
Token girl
Token ID2576
Feature activation-2.54e+00

Pos logit contributions
ra+0.006
era+0.006
mm+0.006
iolet+0.005
mmm+0.005
Neg logit contributions
igators-0.003
 supposed-0.003
 attention-0.003
 way-0.003
rated-0.003
Token know
Token ID760
Feature activation-7.74e-01

Pos logit contributions
mm+0.028
ra+0.028
era+0.027
aw+0.027
iolet+0.027
Neg logit contributions
 supposed-0.019
 valuable-0.016
 way-0.016
rated-0.016
 important-0.016
Token you
Token ID345
Feature activation+1.97e+00

Pos logit contributions
ra+0.022
era+0.020
mm+0.020
iolet+0.020
ara+0.019
Neg logit contributions
 supposed-0.018
igators-0.017
 attention-0.017
 having-0.016
 way-0.016
Token like
Token ID588
Feature activation+7.53e-01

Pos logit contributions
 supposed+0.048
igators+0.045
 attention+0.044
rated+0.044
lling+0.043
Neg logit contributions
ra-0.053
era-0.050
mm-0.050
iolet-0.048
aw-0.047
Token Patch
Token ID17106
Feature activation+8.04e-01

Pos logit contributions
rated+0.024
lling+0.022
 supposed+0.021
igators+0.021
 attention+0.020
Neg logit contributions
ra-0.015
mm-0.014
aw-0.014
era-0.014
iolet-0.013
Token.
Token ID13
Feature activation+8.57e-01

Pos logit contributions
 supposed+0.022
igators+0.022
rated+0.021
 attention+0.021
lling+0.021
Neg logit contributions
ra-0.018
era-0.017
mm-0.017
iolet-0.017
aw-0.016
Token Would
Token ID10928
Feature activation-1.31e+00

Pos logit contributions
igators+0.020
 supposed+0.020
 attention+0.019
rated+0.019
 having+0.019
Neg logit contributions
ra-0.023
era-0.022
iolet-0.021
mm-0.021
aw-0.021
Token you
Token ID345
Feature activation+1.48e+00

Pos logit contributions
ra+0.027
era+0.025
mm+0.024
iolet+0.024
aw+0.023
Neg logit contributions
 supposed-0.040
igators-0.038
 attention-0.038
rated-0.037
 valuable-0.037
Token like
Token ID588
Feature activation+8.91e-02

Pos logit contributions
 supposed+0.027
 attention+0.026
rated+0.024
igators+0.024
 valuable+0.024
Neg logit contributions
ra-0.049
mm-0.047
era-0.047
iolet-0.045
aw-0.045
Token to
Token ID284
Feature activation-4.27e-01

Pos logit contributions
 supposed+0.001
igators+0.001
 attention+0.001
 way+0.001
 having+0.001
Neg logit contributions
ra-0.003
era-0.003
mm-0.003
iolet-0.003
ara-0.003
Token play
Token ID711
Feature activation-1.99e-01

Pos logit contributions
aw+0.015
ra+0.015
mm+0.015
era+0.015
iolet+0.014
Neg logit contributions
 attention-0.006
 way-0.006
 supposed-0.006
 having-0.006
igators-0.005
Token with
Token ID351
Feature activation-2.69e-01

Pos logit contributions
ra+0.006
era+0.006
mm+0.006
light+0.005
ara+0.005
Neg logit contributions
 around-0.004
 having-0.004
igators-0.004
 supposed-0.004
 old-0.004
TokenOn
Token ID2202
Feature activation+9.49e-01

Pos logit contributions
 supposed+0.007
 valuable+0.007
igators+0.006
lling+0.006
 important+0.006
Neg logit contributions
ra-0.007
mm-0.007
era-0.006
light-0.006
ink-0.006
Tokena
Token ID64
Feature activation-1.54e+00

Pos logit contributions
 supposed+0.017
igators+0.016
 way+0.016
 attention+0.016
 having+0.015
Neg logit contributions
ra-0.031
mm-0.029
era-0.029
iolet-0.029
aw-0.028
Token and
Token ID290
Feature activation+3.81e-01

Pos logit contributions
ra+0.041
era+0.040
mm+0.040
iolet+0.038
ara+0.037
Neg logit contributions
 supposed-0.036
igators-0.035
rated-0.034
 attention-0.034
 way-0.033
Token her
Token ID607
Feature activation+2.93e-01

Pos logit contributions
 supposed+0.009
igators+0.009
 attention+0.009
rated+0.008
 way+0.008
Neg logit contributions
ra-0.010
mm-0.010
era-0.010
iolet-0.009
aw-0.009
Token friends
Token ID2460
Feature activation-1.68e-01

Pos logit contributions
 supposed+0.008
igators+0.008
rated+0.008
lling+0.007
 attention+0.007
Neg logit contributions
ra-0.007
era-0.007
mm-0.006
iolet-0.006
aw-0.006
Token had
Token ID550
Feature activation-1.83e+00

Pos logit contributions
ra+0.005
mm+0.005
era+0.005
iolet+0.004
mmm+0.004
Neg logit contributions
 supposed-0.004
igators-0.004
 way-0.003
rated-0.003
 attention-0.003
Token so
Token ID523
Feature activation+2.94e-03

Pos logit contributions
ra+0.047
era+0.045
aw+0.044
iolet+0.044
mm+0.043
Neg logit contributions
rated-0.044
 supposed-0.043
igators-0.043
 attention-0.042
lling-0.041
Token much
Token ID881
Feature activation+6.81e-01

Pos logit contributions
rated+0.000
igators+0.000
 supposed+0.000
 attention+0.000
lling+0.000
Neg logit contributions
ra-0.000
mm-0.000
era-0.000
aw-0.000
iolet-0.000
Token fun
Token ID1257
Feature activation+8.45e-01

Pos logit contributions
rated+0.013
igators+0.012
 supposed+0.012
lling+0.011
 attention+0.011
Neg logit contributions
ra-0.021
mm-0.021
era-0.021
aw-0.020
iolet-0.020
Token at
Token ID379
Feature activation-5.19e-01

Pos logit contributions
 supposed+0.023
igators+0.022
rated+0.022
 attention+0.021
lling+0.021
Neg logit contributions
ra-0.020
era-0.019
mm-0.019
iolet-0.018
aw-0.018
Token the
Token ID262
Feature activation-3.11e-01

Pos logit contributions
ra+0.013
era+0.013
iolet+0.012
mm+0.012
room+0.011
Neg logit contributions
rated-0.013
igators-0.013
 supposed-0.013
 attention-0.012
lling-0.012
Token.
Token ID13
Feature activation+2.44e-01

Pos logit contributions
ra+0.012
era+0.011
mm+0.010
aw+0.010
light+0.010
Neg logit contributions
igators-0.014
 supposed-0.014
rated-0.014
lling-0.014
 attention-0.013
Token She
Token ID1375
Feature activation-1.87e+00

Pos logit contributions
 having+0.004
 supposed+0.004
 old+0.004
 valuable+0.004
 attention+0.004
Neg logit contributions
ra-0.008
ink-0.007
light-0.007
int-0.007
mm-0.007
Token felt
Token ID2936
Feature activation-1.16e-01

Pos logit contributions
mm+0.063
mmm+0.062
mmmm+0.059
iolet+0.059
ap+0.057
Neg logit contributions
 old-0.032
 way-0.031
 supposed-0.031
 around-0.029
 having-0.029
Token worried
Token ID7960
Feature activation-9.48e-01

Pos logit contributions
ra+0.003
era+0.003
aw+0.003
mm+0.003
iolet+0.003
Neg logit contributions
 supposed-0.002
rated-0.002
igators-0.002
lling-0.002
 attention-0.002
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
ra+0.023
era+0.022
mm+0.022
aw+0.021
iolet+0.021
Neg logit contributions
 supposed-0.025
igators-0.024
rated-0.024
 attention-0.023
lling-0.023
Token angry
Token ID7954
Feature activation-2.98e+00

Pos logit contributions
igators+0.028
 supposed+0.027
rated+0.027
 sizes+0.027
lling+0.026
Neg logit contributions
ra-0.037
aw-0.034
era-0.032
apy-0.032
 ha-0.031
Token.
Token ID13
Feature activation-1.26e-01

Pos logit contributions
ra+0.083
mm+0.079
era+0.078
iolet+0.076
mmm+0.074
Neg logit contributions
 supposed-0.068
 attention-0.067
igators-0.063
 having-0.063
rated-0.062
Token
Token ID198
Feature activation-6.59e-03

Pos logit contributions
ra+0.004
mm+0.003
light+0.003
ink+0.003
era+0.003
Neg logit contributions
 supposed-0.003
 having-0.003
 attention-0.003
 old-0.003
 way-0.003
Token
Token ID198
Feature activation-9.13e-01

Pos logit contributions
ra+0.000
era+0.000
mm+0.000
iolet+0.000
aw+0.000
Neg logit contributions
 supposed-0.000
igators-0.000
 attention-0.000
rated-0.000
 valuable-0.000
TokenShe
Token ID3347
Feature activation-2.27e+00

Pos logit contributions
ra+0.022
light+0.019
mm+0.019
era+0.019
iolet+0.019
Neg logit contributions
 supposed-0.023
 having-0.022
 old-0.022
 winding-0.022
 valuable-0.022
Token knocked
Token ID13642
Feature activation-1.79e+00

Pos logit contributions
mm+0.075
mmm+0.072
iolet+0.069
mmmm+0.068
ra+0.068
Neg logit contributions
 old-0.041
 way-0.040
 supposed-0.040
 causing-0.039
 around-0.038
Token "
Token ID366
Feature activation-9.33e-02

Pos logit contributions
igators+0.010
 supposed+0.010
rated+0.010
 attention+0.009
lling+0.009
Neg logit contributions
ra-0.013
era-0.012
mm-0.012
iolet-0.011
aw-0.011
TokenLet
Token ID5756
Feature activation-6.88e-01

Pos logit contributions
mm+0.002
ra+0.002
era+0.002
iolet+0.002
ink+0.002
Neg logit contributions
 supposed-0.002
lling-0.002
rated-0.002
igators-0.002
 causing-0.002
Token me
Token ID502
Feature activation-2.49e-01

Pos logit contributions
ra+0.017
era+0.016
mm+0.016
iolet+0.015
aw+0.015
Neg logit contributions
 supposed-0.018
igators-0.018
 attention-0.017
rated-0.017
lling-0.016
Token go
Token ID467
Feature activation+1.62e+00

Pos logit contributions
ra+0.007
mm+0.007
era+0.007
iolet+0.007
aw+0.006
Neg logit contributions
 supposed-0.006
 attention-0.005
igators-0.005
 way-0.005
rated-0.005
Token!
Token ID0
Feature activation-7.58e-01

Pos logit contributions
 supposed+0.038
igators+0.037
rated+0.036
lling+0.035
 attention+0.035
Neg logit contributions
ra-0.044
era-0.041
mm-0.040
iolet-0.040
ara-0.039
Token Please
Token ID4222
Feature activation-2.92e+00

Pos logit contributions
ra+0.018
era+0.017
mm+0.017
aw+0.017
iolet+0.017
Neg logit contributions
 supposed-0.019
igators-0.019
 attention-0.018
 having-0.018
rated-0.018
Token,
Token ID11
Feature activation-8.65e-01

Pos logit contributions
ra+0.077
era+0.074
mm+0.074
iolet+0.072
aw+0.071
Neg logit contributions
 supposed-0.069
 attention-0.066
igators-0.066
rated-0.065
 way-0.064
Token I
Token ID314
Feature activation-1.81e+00

Pos logit contributions
ra+0.020
era+0.019
mm+0.019
iolet+0.019
aw+0.018
Neg logit contributions
 supposed-0.023
igators-0.022
 attention-0.022
rated-0.022
 way-0.022
Token want
Token ID765
Feature activation-1.00e+00

Pos logit contributions
ra+0.050
mm+0.049
era+0.048
iolet+0.047
aw+0.046
Neg logit contributions
 supposed-0.041
igators-0.039
rated-0.039
 attention-0.038
 way-0.038
Token to
Token ID284
Feature activation-4.46e-01

Pos logit contributions
ra+0.034
era+0.032
mm+0.032
iolet+0.031
aw+0.031
Neg logit contributions
 supposed-0.017
igators-0.016
rated-0.016
 attention-0.016
lling-0.015
Token go
Token ID467
Feature activation+1.58e+00

Pos logit contributions
iolet+0.013
mm+0.013
era+0.013
aw+0.013
ra+0.013
Neg logit contributions
 supposed-0.009
 way-0.009
 attention-0.008
igators-0.008
 having-0.008
Token finished
Token ID5201
Feature activation-2.06e+00

Pos logit contributions
ra+0.014
mm+0.014
era+0.013
iolet+0.013
aw+0.013
Neg logit contributions
 supposed-0.012
igators-0.012
 way-0.011
 having-0.011
lling-0.011
Token,
Token ID11
Feature activation-1.38e+00

Pos logit contributions
ra+0.045
era+0.043
mm+0.042
iolet+0.041
aw+0.040
Neg logit contributions
 supposed-0.059
igators-0.057
 attention-0.056
rated-0.055
lling-0.055
Token they
Token ID484
Feature activation-1.91e-01

Pos logit contributions
ra+0.040
era+0.038
mm+0.037
iolet+0.037
aw+0.035
Neg logit contributions
 supposed-0.030
igators-0.029
 attention-0.029
rated-0.028
 way-0.028
Token were
Token ID547
Feature activation-1.56e+00

Pos logit contributions
mm+0.006
ra+0.006
iolet+0.006
era+0.006
mmm+0.005
Neg logit contributions
 supposed-0.004
igators-0.004
 around-0.004
lling-0.004
 having-0.004
Token very
Token ID845
Feature activation+1.41e+00

Pos logit contributions
ra+0.047
era+0.045
mm+0.045
iolet+0.044
aw+0.043
Neg logit contributions
 supposed-0.032
igators-0.031
 attention-0.030
rated-0.030
lling-0.029
Token proud
Token ID6613
Feature activation-2.86e+00

Pos logit contributions
 supposed+0.021
igators+0.019
 attention+0.018
rated+0.018
 valuable+0.017
Neg logit contributions
ra-0.052
era-0.050
mm-0.049
iolet-0.049
ara-0.048
Token of
Token ID286
Feature activation+9.44e-01

Pos logit contributions
ra+0.084
mm+0.078
era+0.078
iolet+0.076
ara+0.073
Neg logit contributions
 supposed-0.064
igators-0.060
 attention-0.060
 having-0.058
 valuable-0.057
Token their
Token ID511
Feature activation+1.62e+00

Pos logit contributions
 supposed+0.023
rated+0.022
igators+0.022
 attention+0.021
lling+0.021
Neg logit contributions
ra-0.026
era-0.024
mm-0.024
iolet-0.023
aw-0.023
Token work
Token ID670
Feature activation+2.74e+00

Pos logit contributions
 supposed+0.039
igators+0.039
rated+0.037
lling+0.037
 causing+0.035
Neg logit contributions
ra-0.044
era-0.040
mm-0.040
light-0.038
aw-0.038
Token.
Token ID13
Feature activation-1.87e-01

Pos logit contributions
 supposed+0.075
igators+0.074
rated+0.073
 attention+0.071
lling+0.069
Neg logit contributions
ra-0.062
era-0.059
mm-0.056
iolet-0.056
aw-0.055
Token
Token ID198
Feature activation+5.75e-02

Pos logit contributions
ra+0.005
mm+0.004
era+0.004
iolet+0.004
ara+0.004
Neg logit contributions
 supposed-0.005
igators-0.005
rated-0.005
 attention-0.005
lling-0.005
Token They
Token ID1119
Feature activation+7.17e-01

Pos logit contributions
 supposed+0.019
igators+0.018
rated+0.018
 attention+0.018
lling+0.018
Neg logit contributions
ra-0.016
era-0.015
mm-0.015
iolet-0.015
ara-0.014
Token put
Token ID1234
Feature activation+4.20e-01

Pos logit contributions
 supposed+0.013
 having+0.012
igators+0.012
 around+0.012
 sizes+0.012
Neg logit contributions
mm-0.024
mmm-0.023
iolet-0.023
aw-0.022
ra-0.022
Token each
Token ID1123
Feature activation+1.57e-01

Pos logit contributions
 supposed+0.010
igators+0.009
rated+0.009
 attention+0.009
lling+0.009
Neg logit contributions
ra-0.012
era-0.011
mm-0.011
iolet-0.011
aw-0.011
Token block
Token ID2512
Feature activation+4.02e-01

Pos logit contributions
 supposed+0.004
igators+0.004
 attention+0.004
rated+0.004
 way+0.004
Neg logit contributions
ra-0.004
era-0.004
mm-0.004
iolet-0.004
aw-0.003
Token in
Token ID287
Feature activation+5.34e-01

Pos logit contributions
igators+0.007
 attention+0.007
 supposed+0.007
 around+0.006
 way+0.006
Neg logit contributions
ra-0.013
mm-0.013
era-0.013
iolet-0.012
mmm-0.012
Token a
Token ID257
Feature activation-3.17e+00

Pos logit contributions
 way+0.010
 having+0.010
 old+0.010
 sizes+0.010
 attention+0.010
Neg logit contributions
mm-0.016
ra-0.016
int-0.016
aw-0.016
era-0.016
Token box
Token ID3091
Feature activation-1.53e+00

Pos logit contributions
mm+0.100
ra+0.100
era+0.098
iolet+0.095
mmm+0.094
Neg logit contributions
 way-0.060
 supposed-0.058
igators-0.057
 valuable-0.057
 old-0.056
Token that
Token ID326
Feature activation-1.78e+00

Pos logit contributions
ra+0.044
mm+0.043
era+0.042
iolet+0.042
ara+0.041
Neg logit contributions
 supposed-0.032
 attention-0.031
 way-0.030
igators-0.030
 around-0.030
Token matched
Token ID14451
Feature activation-1.10e+00

Pos logit contributions
ra+0.052
iolet+0.051
mmm+0.051
mm+0.050
era+0.050
Neg logit contributions
 way-0.038
 supposed-0.036
igators-0.036
 around-0.034
 having-0.034
Token its
Token ID663
Feature activation+1.81e-01

Pos logit contributions
ra+0.032
mm+0.030
era+0.030
iolet+0.030
ara+0.029
Neg logit contributions
 supposed-0.024
igators-0.023
 attention-0.023
 way-0.022
rated-0.022
Token color
Token ID3124
Feature activation-1.23e-01

Pos logit contributions
 supposed+0.004
 attention+0.004
 way+0.004
igators+0.004
rated+0.004
Neg logit contributions
ra-0.005
era-0.005
iolet-0.005
mm-0.005
ara-0.005
Token throw
Token ID3714
Feature activation-2.44e-01

Pos logit contributions
lling+0.019
 supposed+0.019
rated+0.018
igators+0.018
 attention+0.018
Neg logit contributions
ra-0.016
era-0.015
mm-0.015
aw-0.014
iolet-0.014
Token it
Token ID340
Feature activation+6.57e-01

Pos logit contributions
ra+0.006
aw+0.006
mm+0.006
ara+0.006
era+0.006
Neg logit contributions
 supposed-0.006
rated-0.006
lling-0.006
igators-0.006
 winding-0.005
Token.
Token ID13
Feature activation-1.50e-01

Pos logit contributions
 supposed+0.015
rated+0.015
igators+0.015
lling+0.014
 sizes+0.014
Neg logit contributions
ra-0.017
aw-0.016
era-0.016
mm-0.015
iolet-0.015
Token You
Token ID921
Feature activation+3.98e-01

Pos logit contributions
ra+0.004
era+0.004
iolet+0.004
mm+0.004
light+0.004
Neg logit contributions
igators-0.003
 supposed-0.003
 having-0.003
 valuable-0.003
 old-0.003
Token could
Token ID714
Feature activation-1.76e+00

Pos logit contributions
 supposed+0.008
igators+0.008
rated+0.008
 attention+0.008
 way+0.007
Neg logit contributions
ra-0.012
era-0.011
mm-0.011
iolet-0.011
ara-0.011
Token hurt
Token ID5938
Feature activation-2.84e+00

Pos logit contributions
ra+0.052
era+0.049
mm+0.048
iolet+0.047
aw+0.046
Neg logit contributions
 supposed-0.038
igators-0.036
rated-0.035
 attention-0.035
lling-0.034
Token someone
Token ID2130
Feature activation+2.98e-01

Pos logit contributions
ra+0.061
era+0.058
mm+0.057
iolet+0.055
light+0.053
Neg logit contributions
 supposed-0.082
igators-0.080
 attention-0.079
 way-0.076
lling-0.076
Token or
Token ID393
Feature activation-1.07e+00

Pos logit contributions
rated+0.009
 supposed+0.009
 winding+0.009
lling+0.009
 attention+0.009
Neg logit contributions
ra-0.005
aw-0.005
mm-0.004
ara-0.004
light-0.004
Token lose
Token ID4425
Feature activation-1.44e+00

Pos logit contributions
ra+0.030
era+0.029
mm+0.028
iolet+0.028
aw+0.027
Neg logit contributions
 supposed-0.024
igators-0.024
 attention-0.023
rated-0.023
 way-0.022
Token it
Token ID340
Feature activation-1.69e-01

Pos logit contributions
ra+0.039
era+0.038
mm+0.037
iolet+0.036
ara+0.035
Neg logit contributions
 supposed-0.034
 attention-0.033
igators-0.033
rated-0.031
 way-0.031
Token.
Token ID13
Feature activation-1.66e-01

Pos logit contributions
ra+0.004
mm+0.003
aw+0.003
era+0.003
iolet+0.003
Neg logit contributions
 supposed-0.005
rated-0.005
lling-0.005
igators-0.005
 attention-0.004