TOP ACTIVATIONS
MAX = 0.143

put it in our photo album ." L ily
Let 's un pack our bags ." Tim my helped his
Let 's un pack our bags ." So , Lily helped
We have to carry our bags ." L ily
them to come to her birthday party . Her mom understood
we need to pack our bags first ." L
family . They packed their bags and got into the car
We can go on a holiday !" Mill ie
grandparents to come to her birthday party .
I have to deliver some papers to our neighbor ."
we borrow it from the store ?" Jack was
need to go to the store and take some more ."
his family went to the shopping centre . Tim had the
them up and have a party . It will be very
We 'll finish all our shopping and get a huge pizza
you get me for my birthday ?" Her mom and dad
put them back on the book case when we 're done
Okay , let 's go shopping for some paint and markers
We can go on our trip now !" Tim my smiled
Now let 's finish our shopping and then we can go

MIN ACTIVATIONS
MIN = -0.145

that . It made a loud , angry noise and sprayed
But then she hears a loud buzz . She notices a
when suddenly he heard a loud roar . He was scared
eyes . It made a loud sound . It stood up
hurt . It makes a loud noise . The other pige
that . It made a loud noise and flew out of
the hive and made a loud noise . The bees came
. The bicycle makes a loud noise . Ben falls off
jumped away and made a loud cro ak .
eating , he heard a loud noise . It was a
skull . He heard a loud noise . The hole started
mouth . It made a loud noise that sounded like a
eyes . It made a loud sound . " C aw
playing , Benny heard a loud noise . It was a
. The blocks made a loud noise . They scattered all
Suddenly , he heard a loud noise . " What was
. The tube makes a loud his s and pops .
at them and made a loud noise . Lily and Tom
down . It makes a loud noise . It breaks into
Suddenly , they heard a loud noise . It was their

INTERVAL 0.071 - 0.143
CONTAINS 0.353%

the exam . Let 's go ." Lily says .
today . Let 's play tomorrow ." Tommy forg ave him
a board game or a puzzle ?" J ill
it ! Can we put it in my stamp collection ?"
, it 's time to go home now . I have

INTERVAL -0.001 - 0.071
CONTAINS 50.915%

" Why are there so many people here ?" Her mom
. She didn 't want him to have it , so
ill wondered what it would be like to fly like the
to make a bigger castle . So , he joined more
the best paper and even the prett iest stickers . They

INTERVAL -0.073 - -0.001
CONTAINS 48.515%

. Anna said , " I love you , Luna .
. <|endoftext|> One day , a boy named Tom went to
. <|endoftext|> Anna liked to read books . She had many
sorry . They realized that they had made a bad choice
, like houses , towers , and cars . They had

INTERVAL -0.145 - -0.073
CONTAINS 0.217%

. Butter flies have antennas and legs . Your bug has
eat , Lily noticed a squirrel watching them from a nearby
edge of the branch he stopped to take a look around
Suddenly he heard a sound . It was quick and
He liked bees . They made honey and whist led .
Token put
Token ID1234
Feature activation+6.64e-02

Pos logit contributions
 grandparents+0.075
ari+0.074
 destination+0.071
 online+0.067
 home+0.066
Neg logit contributions
bee-0.055
 sting-0.052
 bleed-0.049
 ow-0.048
 spit-0.047
Token it
Token ID340
Feature activation+8.39e-02

Pos logit contributions
ari+0.207
 grandparents+0.200
 destination+0.192
ait+0.187
 driveway+0.184
Neg logit contributions
 ow-0.020
bee-0.018
 flies-0.017
 bleed-0.017
gging-0.017
Token in
Token ID287
Feature activation+7.67e-02

Pos logit contributions
 grandparents+0.210
 destination+0.186
ari+0.186
 workshop+0.182
 grandson+0.179
Neg logit contributions
bee-0.100
 bleed-0.082
 sting-0.081
 rust-0.079
ling-0.077
Token our
Token ID674
Feature activation+7.78e-02

Pos logit contributions
ari+0.204
 grandparents+0.174
cases+0.172
ival+0.167
 boarded+0.165
Neg logit contributions
 ow-0.067
 itself-0.067
 sting-0.066
 slim-0.065
bee-0.064
Token photo
Token ID4590
Feature activation+1.17e-01

Pos logit contributions
ari+0.117
ait+0.108
ris+0.105
 grandparents+0.104
 online+0.099
Neg logit contributions
 ow-0.144
 slim-0.135
 sting-0.133
 spit-0.127
 sounds-0.124
Token album
Token ID5062
Feature activation+1.43e-01

Pos logit contributions
 grandparents+0.270
 destination+0.264
 boarded+0.245
 harbour+0.241
 workshop+0.240
Neg logit contributions
bee-0.126
der-0.098
bees-0.089
le-0.087
ickets-0.085
Token."
Token ID526
Feature activation+1.94e-02

Pos logit contributions
 grandparents+0.350
ilia+0.332
 workshop+0.320
 destination+0.320
 boarded+0.316
Neg logit contributions
bee-0.110
der-0.082
bees-0.075
.-0.074
ze-0.067
Token
Token ID198
Feature activation+1.64e-02

Pos logit contributions
 grandparents+0.042
 destination+0.042
ari+0.038
 home+0.038
 grandma+0.035
Neg logit contributions
bee-0.030
 ow-0.026
gging-0.026
 bleed-0.026
 sting-0.025
Token
Token ID198
Feature activation+8.22e-03

Pos logit contributions
 grandparents+0.035
 destination+0.035
 home+0.032
ari+0.032
 grandma+0.031
Neg logit contributions
bee-0.026
 ow-0.023
gging-0.022
 sting-0.022
 bleed-0.022
TokenL
Token ID43
Feature activation-1.42e-03

Pos logit contributions
 grandparents+0.018
 destination+0.018
 home+0.017
ari+0.017
 grandma+0.016
Neg logit contributions
bee-0.013
gging-0.011
 ow-0.011
 sting-0.011
 bleed-0.011
Tokenily
Token ID813
Feature activation+2.93e-02

Pos logit contributions
bee+0.002
 ow+0.002
gging+0.002
 bleed+0.002
 sting+0.002
Neg logit contributions
 grandparents-0.003
 destination-0.003
 home-0.003
ari-0.003
 grandma-0.003
TokenLet
Token ID5756
Feature activation+5.98e-03

Pos logit contributions
bee+0.034
 ow+0.030
gging+0.030
 sting+0.029
 bleed+0.029
Neg logit contributions
 grandparents-0.043
 home-0.041
 destination-0.040
 grandma-0.040
ari-0.039
Token's
Token ID338
Feature activation+5.37e-02

Pos logit contributions
 grandparents+0.013
 destination+0.012
 home+0.012
ari+0.012
 grandma+0.011
Neg logit contributions
bee-0.010
 ow-0.009
gging-0.008
 sting-0.008
 bleed-0.008
Token un
Token ID555
Feature activation+7.08e-02

Pos logit contributions
ari+0.132
 grandparents+0.127
 destination+0.117
 online+0.117
 home+0.113
Neg logit contributions
bee-0.054
 ow-0.053
 spit-0.052
 bleed-0.052
 sting-0.051
Tokenpack
Token ID8002
Feature activation+7.45e-02

Pos logit contributions
 grandparents+0.091
 destination+0.083
 grandma+0.078
 grandmother+0.074
 home+0.074
Neg logit contributions
hing-0.148
ze-0.145
its-0.141
 bleed-0.138
ez-0.138
Token our
Token ID674
Feature activation+6.14e-02

Pos logit contributions
ari+0.183
 grandparents+0.168
 destination+0.163
 driveway+0.154
 boarded+0.154
Neg logit contributions
 ow-0.079
 him-0.068
bee-0.068
 slim-0.065
 foam-0.063
Token bags
Token ID11668
Feature activation+1.34e-01

Pos logit contributions
ari+0.116
 grandparents+0.102
 destination+0.102
nes+0.099
ait+0.098
Neg logit contributions
 ow-0.105
bee-0.090
 spit-0.089
 slim-0.088
 hurts-0.086
Token."
Token ID526
Feature activation-1.11e-02

Pos logit contributions
 grandparents+0.346
 cousins+0.331
 granddaughter+0.325
 grandmother+0.323
 recess+0.314
Neg logit contributions
.-0.063
 and-0.050
 like-0.049
 him-0.045
 he-0.043
Token Tim
Token ID5045
Feature activation-5.48e-04

Pos logit contributions
bee+0.023
 ow+0.020
 sting+0.020
 bleed+0.020
gging+0.020
Neg logit contributions
 grandparents-0.020
 home-0.019
 destination-0.019
 grandma-0.018
ari-0.018
Tokenmy
Token ID1820
Feature activation+3.47e-02

Pos logit contributions
bee+0.001
 ow+0.001
gging+0.001
 bleed+0.001
 sting+0.001
Neg logit contributions
 grandparents-0.001
 destination-0.001
 home-0.001
ari-0.001
 grandma-0.001
Token helped
Token ID4193
Feature activation+9.23e-03

Pos logit contributions
 grandparents+0.089
 destination+0.088
 home+0.084
ari+0.082
 grandma+0.080
Neg logit contributions
bee-0.042
gging-0.032
 spit-0.031
 ow-0.029
 sting-0.029
Token his
Token ID465
Feature activation+6.41e-02

Pos logit contributions
 grandparents+0.022
 destination+0.021
ari+0.021
 home+0.020
 grandma+0.019
Neg logit contributions
bee-0.013
gging-0.011
 bleed-0.011
 ow-0.011
 sting-0.010
TokenLet
Token ID5756
Feature activation+1.17e-02

Pos logit contributions
 grandparents+0.005
 destination+0.005
 home+0.005
ari+0.005
 grandma+0.005
Neg logit contributions
bee-0.003
 ow-0.003
gging-0.003
 bleed-0.003
 sting-0.003
Token's
Token ID338
Feature activation+6.20e-02

Pos logit contributions
 grandparents+0.026
 destination+0.025
ari+0.024
 home+0.024
 grandma+0.023
Neg logit contributions
bee-0.018
 ow-0.016
 sting-0.015
gging-0.015
 bleed-0.015
Token un
Token ID555
Feature activation+7.54e-02

Pos logit contributions
ari+0.157
 grandparents+0.146
 online+0.141
aj+0.139
ouses+0.137
Neg logit contributions
 ow-0.052
 spit-0.052
bee-0.050
 sting-0.050
 seem-0.049
Tokenpack
Token ID8002
Feature activation+8.23e-02

Pos logit contributions
 grandparents+0.096
 destination+0.089
 grandma+0.088
 home+0.083
 cousin+0.082
Neg logit contributions
hing-0.154
ze-0.146
its-0.143
gging-0.140
ez-0.138
Token our
Token ID674
Feature activation+8.02e-02

Pos logit contributions
ari+0.200
 destination+0.183
 grandparents+0.180
 driveway+0.177
 harbour+0.177
Neg logit contributions
 ow-0.085
bee-0.077
 sounds-0.069
 him-0.067
 slim-0.065
Token bags
Token ID11668
Feature activation+1.32e-01

Pos logit contributions
ari+0.143
eline+0.127
ris+0.126
enna+0.120
nes+0.120
Neg logit contributions
 ow-0.121
 sounds-0.117
 noise-0.111
 slim-0.111
 hurts-0.107
Token."
Token ID526
Feature activation+3.13e-02

Pos logit contributions
 grandparents+0.336
 cousins+0.323
 granddaughter+0.315
 recess+0.314
 grandmother+0.310
Neg logit contributions
.-0.064
 and-0.045
 like-0.045
cr-0.041
 hurts-0.040
Token So
Token ID1406
Feature activation+3.33e-02

Pos logit contributions
 grandparents+0.073
 destination+0.071
ari+0.066
 home+0.064
 passport+0.061
Neg logit contributions
bee-0.043
gging-0.037
 ow-0.036
 sting-0.035
 spit-0.035
Token,
Token ID11
Feature activation+9.43e-03

Pos logit contributions
 grandparents+0.082
ari+0.080
 destination+0.079
 workshop+0.074
 harbour+0.074
Neg logit contributions
bee-0.041
 ow-0.036
 sting-0.034
gging-0.030
 slim-0.030
Token Lily
Token ID20037
Feature activation+7.05e-02

Pos logit contributions
 grandparents+0.018
 destination+0.018
ari+0.017
 home+0.017
 grandma+0.016
Neg logit contributions
bee-0.017
 ow-0.015
 sting-0.015
gging-0.015
 bleed-0.015
Token helped
Token ID4193
Feature activation+3.36e-02

Pos logit contributions
 destination+0.165
 grandparents+0.165
 recess+0.157
 home+0.152
 France+0.147
Neg logit contributions
bee-0.092
 spit-0.070
ze-0.064
 wr-0.061
gging-0.059
Token We
Token ID775
Feature activation+4.20e-02

Pos logit contributions
 destination+0.088
 grandparents+0.088
 home+0.085
ari+0.082
 online+0.078
Neg logit contributions
bee-0.041
 ow-0.036
 bleed-0.035
gging-0.034
 sting-0.032
Token have
Token ID423
Feature activation+7.57e-02

Pos logit contributions
 grandparents+0.102
ari+0.097
 destination+0.096
 grandma+0.091
 home+0.090
Neg logit contributions
bee-0.050
 bleed-0.048
 sting-0.048
 seem-0.042
 ow-0.040
Token to
Token ID284
Feature activation+4.88e-02

Pos logit contributions
ari+0.168
 grandparents+0.150
 destination+0.146
 gallery+0.138
eline+0.137
Neg logit contributions
 ow-0.093
 sting-0.088
 slim-0.087
 th-0.086
 bleed-0.082
Token carry
Token ID3283
Feature activation+6.15e-02

Pos logit contributions
ari+0.088
 grandparents+0.084
 destination+0.076
 online+0.074
 home+0.071
Neg logit contributions
bee-0.088
 sting-0.086
 bleed-0.085
 spit-0.083
gging-0.082
Token our
Token ID674
Feature activation+5.79e-02

Pos logit contributions
ari+0.157
 destination+0.137
 grandparents+0.136
 gallery+0.131
 driveway+0.130
Neg logit contributions
 ow-0.060
bee-0.057
 bleed-0.055
gging-0.055
 sting-0.055
Token bags
Token ID11668
Feature activation+1.31e-01

Pos logit contributions
ari+0.105
eline+0.098
 grandparents+0.094
 boarded+0.090
ris+0.090
Neg logit contributions
 ow-0.095
bee-0.090
 slim-0.086
 sting-0.084
 bleed-0.081
Token."
Token ID526
Feature activation+2.41e-02

Pos logit contributions
 grandparents+0.338
 cousins+0.315
 granddaughter+0.310
 recess+0.309
 grandmother+0.304
Neg logit contributions
.-0.065
 like-0.062
pe-0.061
 himself-0.055
 slim-0.054
Token
Token ID198
Feature activation+2.23e-02

Pos logit contributions
 destination+0.056
 grandparents+0.056
ari+0.053
 home+0.051
 passports+0.049
Neg logit contributions
bee-0.031
 ow-0.027
gging-0.026
 sting-0.026
 bleed-0.025
Token
Token ID198
Feature activation+2.35e-02

Pos logit contributions
 destination+0.050
 grandparents+0.050
ari+0.046
 home+0.046
 online+0.044
Neg logit contributions
bee-0.031
 ow-0.027
gging-0.027
 sting-0.026
 bleed-0.026
TokenL
Token ID43
Feature activation-4.88e-03

Pos logit contributions
 grandparents+0.058
 destination+0.058
 home+0.056
ari+0.055
 grandma+0.052
Neg logit contributions
bee-0.029
gging-0.024
 sting-0.022
ching-0.022
 bleed-0.021
Tokenily
Token ID813
Feature activation+1.63e-02

Pos logit contributions
bee+0.007
 ow+0.006
 bleed+0.006
gging+0.006
 sting+0.006
Neg logit contributions
 grandparents-0.011
 destination-0.010
 home-0.010
ari-0.010
 grandma-0.010
Token them
Token ID606
Feature activation+7.48e-02

Pos logit contributions
ari+0.073
 destination+0.070
 grandparents+0.067
 boarded+0.061
 backpack+0.060
Neg logit contributions
bee-0.063
 ow-0.062
 sting-0.060
 rust-0.056
 bleed-0.056
Token to
Token ID284
Feature activation+2.51e-03

Pos logit contributions
 grandparents+0.149
ari+0.149
 destination+0.142
 workshop+0.142
 harbour+0.139
Neg logit contributions
 bleed-0.094
 seem-0.092
bee-0.090
 rust-0.088
 sting-0.085
Token come
Token ID1282
Feature activation+7.10e-02

Pos logit contributions
 grandparents+0.005
 destination+0.005
 home+0.005
ari+0.005
 grandma+0.005
Neg logit contributions
bee-0.004
 ow-0.004
gging-0.004
 bleed-0.004
 sting-0.004
Token to
Token ID284
Feature activation+6.01e-02

Pos logit contributions
ari+0.136
 grandparents+0.134
ilia+0.127
 passports+0.127
 list+0.125
Neg logit contributions
 bleed-0.095
bee-0.093
 sting-0.089
fly-0.089
 tweet-0.087
Token her
Token ID607
Feature activation+8.95e-02

Pos logit contributions
ari+0.130
 destination+0.115
belt+0.115
 grandparents+0.115
suits+0.114
Neg logit contributions
 ow-0.086
 sting-0.080
bee-0.076
 conclusions-0.073
 spit-0.073
Token birthday
Token ID10955
Feature activation+1.30e-01

Pos logit contributions
ilia+0.161
 boarded+0.157
ari+0.155
ait+0.154
eline+0.154
Neg logit contributions
 ow-0.110
 sounds-0.108
ze-0.106
bee-0.104
 sting-0.100
Token party
Token ID2151
Feature activation+1.01e-01

Pos logit contributions
 boarded+0.211
belt+0.188
 grandparents+0.186
ounded+0.180
 harbour+0.180
Neg logit contributions
 ow-0.191
.-0.179
bee-0.173
le-0.172
 sounds-0.167
Token.
Token ID13
Feature activation+3.68e-02

Pos logit contributions
 grandparents+0.273
 boarded+0.264
 harbour+0.259
 destination+0.259
ari+0.256
Neg logit contributions
bee-0.053
 ow-0.034
ze-0.032
 rust-0.032
.-0.029
Token Her
Token ID2332
Feature activation+5.77e-02

Pos logit contributions
 destination+0.104
 grandparents+0.102
ari+0.099
 boarded+0.096
 home+0.096
Neg logit contributions
bee-0.021
 ow-0.017
gging-0.016
 spit-0.013
 sting-0.012
Token mom
Token ID1995
Feature activation+3.61e-02

Pos logit contributions
 destination+0.068
ari+0.068
pack+0.062
ait+0.061
ris+0.060
Neg logit contributions
 ow-0.139
bee-0.128
 sounds-0.116
 noise-0.115
 sting-0.115
Token understood
Token ID7247
Feature activation-4.27e-03

Pos logit contributions
 destination+0.076
 grandparents+0.072
 home+0.069
 recess+0.067
ari+0.066
Neg logit contributions
bee-0.060
 ow-0.049
 spit-0.048
 sting-0.048
 rust-0.047
Token we
Token ID356
Feature activation+4.35e-02

Pos logit contributions
 grandparents+0.010
 destination+0.010
 home+0.009
ari+0.009
 grandma+0.009
Neg logit contributions
bee-0.009
 ow-0.008
gging-0.008
 sting-0.008
 bleed-0.008
Token need
Token ID761
Feature activation+5.51e-02

Pos logit contributions
 grandparents+0.106
 destination+0.099
ari+0.098
 grandma+0.093
 home+0.091
Neg logit contributions
bee-0.053
 sting-0.049
 bleed-0.048
 seem-0.044
 ow-0.043
Token to
Token ID284
Feature activation+3.93e-02

Pos logit contributions
ari+0.106
 destination+0.105
 grandparents+0.100
eline+0.093
ris+0.091
Neg logit contributions
 ow-0.083
 sting-0.079
 rust-0.077
 mosquitoes-0.075
bee-0.075
Token pack
Token ID2353
Feature activation+5.87e-02

Pos logit contributions
 grandparents+0.088
ari+0.087
 online+0.080
 destination+0.079
 grandma+0.076
Neg logit contributions
bee-0.054
 sting-0.051
 bleed-0.050
 ow-0.047
gging-0.046
Token our
Token ID674
Feature activation+7.05e-02

Pos logit contributions
ari+0.129
 destination+0.124
 grandparents+0.120
 workshop+0.113
 harbour+0.113
Neg logit contributions
 ow-0.073
bee-0.072
 sting-0.068
 bleed-0.068
 slim-0.067
Token bags
Token ID11668
Feature activation+1.30e-01

Pos logit contributions
ari+0.111
ris+0.102
 gallery+0.093
 grandparents+0.093
eline+0.093
Neg logit contributions
 ow-0.131
bee-0.128
 sting-0.120
 sounds-0.120
 slim-0.118
Token first
Token ID717
Feature activation+8.99e-02

Pos logit contributions
 grandparents+0.320
 cousins+0.305
 destination+0.293
 granddaughter+0.291
 cousin+0.291
Neg logit contributions
.-0.083
 like-0.078
 himself-0.069
ling-0.063
 sensation-0.060
Token."
Token ID526
Feature activation+2.26e-02

Pos logit contributions
 grandparents+0.212
ari+0.208
 destination+0.197
 boarded+0.191
 harbour+0.189
Neg logit contributions
bee-0.092
 ow-0.069
 sting-0.068
 seem-0.064
 slim-0.061
Token
Token ID198
Feature activation+1.45e-02

Pos logit contributions
 grandparents+0.052
 destination+0.051
 home+0.046
ari+0.046
 grandma+0.044
Neg logit contributions
bee-0.032
 ow-0.028
gging-0.028
 bleed-0.027
 sting-0.027
Token
Token ID198
Feature activation+6.68e-04

Pos logit contributions
 grandparents+0.031
 destination+0.030
 home+0.028
ari+0.028
 grandma+0.027
Neg logit contributions
bee-0.023
 ow-0.020
gging-0.020
 sting-0.020
 bleed-0.020
TokenL
Token ID43
Feature activation-2.11e-03

Pos logit contributions
 grandparents+0.001
 destination+0.001
 home+0.001
ari+0.001
 grandma+0.001
Neg logit contributions
bee-0.001
 ow-0.001
gging-0.001
 bleed-0.001
 sting-0.001
Token family
Token ID1641
Feature activation+4.18e-02

Pos logit contributions
ari+0.108
eline+0.104
 destination+0.103
pack+0.102
 harbour+0.095
Neg logit contributions
 ow-0.118
bee-0.107
 sting-0.093
 rust-0.093
 mosquito-0.088
Token.
Token ID13
Feature activation+2.40e-02

Pos logit contributions
 destination+0.118
 grandparents+0.111
ari+0.111
 boarded+0.106
 grandma+0.102
Neg logit contributions
bee-0.036
 sensation-0.019
gging-0.018
 ow-0.018
 rust-0.017
Token They
Token ID1119
Feature activation+6.26e-02

Pos logit contributions
 grandparents+0.056
 destination+0.055
ari+0.052
 home+0.052
 passports+0.048
Neg logit contributions
bee-0.031
 ow-0.026
gging-0.025
 sensation-0.024
 sting-0.024
Token packed
Token ID11856
Feature activation+4.97e-02

Pos logit contributions
 destination+0.141
 grandparents+0.130
ari+0.125
aj+0.120
 grandmother+0.120
Neg logit contributions
bee-0.102
hing-0.074
 rum-0.073
 rust-0.073
 ow-0.070
Token their
Token ID511
Feature activation+8.07e-02

Pos logit contributions
ari+0.109
 destination+0.106
 grandparents+0.103
 workshop+0.091
 driveway+0.089
Neg logit contributions
bee-0.068
 ow-0.066
 rust-0.063
 slim-0.063
 sting-0.062
Token bags
Token ID11668
Feature activation+1.26e-01

Pos logit contributions
ari+0.128
ris+0.114
 destination+0.111
 gallery+0.110
nes+0.108
Neg logit contributions
 ow-0.148
 slim-0.135
bee-0.134
 sounds-0.129
 rust-0.123
Token and
Token ID290
Feature activation+6.96e-02

Pos logit contributions
 principal+0.346
 grandparents+0.345
 cousin+0.338
 granddaughter+0.336
 recess+0.332
Neg logit contributions
 and-0.023
 rust-0.017
 with-0.015
ling-0.012
 he-0.009
Token got
Token ID1392
Feature activation+8.04e-02

Pos logit contributions
 destination+0.154
ari+0.152
 grandparents+0.139
 grandmother+0.128
ris+0.127
Neg logit contributions
bee-0.102
 spit-0.085
 ow-0.082
 rust-0.080
 wr-0.077
Token into
Token ID656
Feature activation+8.26e-02

Pos logit contributions
ari+0.193
 destination+0.182
ris+0.164
 grandparents+0.160
 workshop+0.155
Neg logit contributions
 him-0.090
 spit-0.086
 gag-0.082
You-0.081
 tick-0.080
Token the
Token ID262
Feature activation+6.92e-02

Pos logit contributions
ari+0.243
 destination+0.206
belt+0.199
ival+0.190
ris+0.190
Neg logit contributions
 ow-0.065
 noise-0.054
 th-0.052
 sensation-0.052
 slim-0.052
Token car
Token ID1097
Feature activation+8.19e-02

Pos logit contributions
ari+0.124
eline+0.119
 grandparents+0.118
ait+0.115
nes+0.113
Neg logit contributions
 ow-0.102
 foam-0.090
 sounds-0.089
 slim-0.089
 noise-0.088
Token We
Token ID775
Feature activation+4.16e-02

Pos logit contributions
 destination+0.103
 grandparents+0.102
 home+0.100
ari+0.094
 backpack+0.092
Neg logit contributions
bee-0.054
 ow-0.046
 bleed-0.045
gging-0.044
 sensation-0.042
Token can
Token ID460
Feature activation+3.07e-02

Pos logit contributions
 grandparents+0.090
 destination+0.085
 grandma+0.082
ari+0.081
 home+0.078
Neg logit contributions
bee-0.067
 sting-0.057
 bleed-0.055
ching-0.052
 ow-0.052
Token go
Token ID467
Feature activation+8.12e-02

Pos logit contributions
 grandparents+0.059
ari+0.057
 destination+0.055
 home+0.054
 online+0.053
Neg logit contributions
bee-0.051
 ow-0.045
 bleed-0.045
 sting-0.045
gging-0.044
Token on
Token ID319
Feature activation+7.76e-02

Pos logit contributions
 grandparents+0.152
ari+0.141
 cousin+0.137
 grandson+0.135
 granddaughter+0.135
Neg logit contributions
bee-0.101
 seem-0.099
ching-0.098
 bleed-0.098
 rust-0.097
Token a
Token ID257
Feature activation+7.36e-02

Pos logit contributions
ari+0.180
 grandparents+0.158
eline+0.152
 helper+0.150
 buyer+0.147
Neg logit contributions
 ow-0.097
 mosquitoes-0.084
bee-0.078
 th-0.078
 sting-0.078
Token holiday
Token ID9912
Feature activation+1.25e-01

Pos logit contributions
 grandparents+0.100
ari+0.096
 granddaughter+0.090
 grandson+0.087
 destination+0.086
Neg logit contributions
bee-0.149
 ow-0.148
 rust-0.137
 slim-0.135
 sting-0.134
Token!"
Token ID2474
Feature activation-8.28e-03

Pos logit contributions
 grandparents+0.256
ilia+0.248
 granddaughter+0.244
 boarded+0.242
 grandmother+0.240
Neg logit contributions
 sounds-0.109
le-0.103
bee-0.100
h-0.097
.-0.095
Token
Token ID198
Feature activation+4.72e-03

Pos logit contributions
bee+0.017
gging+0.015
 ow+0.015
 bleed+0.014
 sting+0.014
Neg logit contributions
 grandparents-0.015
 home-0.014
 destination-0.014
 grandma-0.013
ari-0.013
Token
Token ID198
Feature activation+9.57e-03

Pos logit contributions
 grandparents+0.009
 destination+0.009
 home+0.009
ari+0.009
 grandma+0.008
Neg logit contributions
bee-0.008
 ow-0.007
gging-0.007
 bleed-0.007
 sting-0.007
TokenMill
Token ID22603
Feature activation-8.30e-03

Pos logit contributions
 grandparents+0.019
 destination+0.019
 home+0.018
ari+0.018
 grandma+0.017
Neg logit contributions
bee-0.017
 ow-0.015
gging-0.014
 sting-0.014
 bleed-0.014
Tokenie
Token ID494
Feature activation+2.68e-02

Pos logit contributions
bee+0.011
 bleed+0.009
 ow+0.009
gging+0.009
 sting+0.009
Neg logit contributions
 grandparents-0.020
 destination-0.019
 home-0.019
ari-0.019
 grandma-0.018
Token grandparents
Token ID28571
Feature activation+5.20e-02

Pos logit contributions
ari+0.125
ait+0.116
eline+0.109
ris+0.107
 boarded+0.104
Neg logit contributions
 ow-0.177
 sounds-0.154
 happen-0.149
 noise-0.149
 hurts-0.148
Token to
Token ID284
Feature activation+1.84e-02

Pos logit contributions
 grandparents+0.105
 destination+0.103
ari+0.100
 recess+0.090
 driveway+0.089
Neg logit contributions
bee-0.078
 seem-0.068
 sting-0.067
 rust-0.066
 bleed-0.065
Token come
Token ID1282
Feature activation+5.07e-02

Pos logit contributions
 grandparents+0.037
 destination+0.035
ari+0.035
 home+0.033
 online+0.032
Neg logit contributions
bee-0.031
 sting-0.028
 ow-0.028
 bleed-0.028
gging-0.027
Token to
Token ID284
Feature activation+4.78e-02

Pos logit contributions
ari+0.109
 grandparents+0.109
 destination+0.100
 passports+0.100
 list+0.098
Neg logit contributions
bee-0.062
 bleed-0.060
 sting-0.057
 rust-0.054
 twitch-0.053
Token her
Token ID607
Feature activation+8.65e-02

Pos logit contributions
ari+0.100
 destination+0.091
 grandparents+0.091
belt+0.086
suits+0.085
Neg logit contributions
 ow-0.072
bee-0.068
 sting-0.067
 bleed-0.062
 conclusions-0.062
Token birthday
Token ID10955
Feature activation+1.25e-01

Pos logit contributions
ilia+0.154
 boarded+0.151
ari+0.151
eline+0.150
ait+0.149
Neg logit contributions
 ow-0.108
 sounds-0.107
ze-0.103
 rust-0.102
bee-0.100
Token party
Token ID2151
Feature activation+9.64e-02

Pos logit contributions
 boarded+0.208
belt+0.190
ounded+0.186
 grandparents+0.182
packing+0.178
Neg logit contributions
 ow-0.178
.-0.170
bee-0.164
le-0.158
 sounds-0.152
Token.
Token ID13
Feature activation+2.43e-02

Pos logit contributions
 grandparents+0.277
 boarded+0.272
 destination+0.264
ari+0.262
 harbour+0.262
Neg logit contributions
bee-0.032
.-0.017
 rust-0.015
ze-0.014
 ow-0.012
Token
Token ID220
Feature activation+2.24e-02

Pos logit contributions
 destination+0.054
 grandparents+0.054
ari+0.051
 home+0.050
 online+0.048
Neg logit contributions
bee-0.032
 ow-0.028
gging-0.028
 sting-0.025
 bleed-0.025
Token
Token ID198
Feature activation+1.88e-02

Pos logit contributions
 grandparents+0.048
 destination+0.047
 home+0.044
ari+0.044
 grandma+0.042
Neg logit contributions
bee-0.035
gging-0.030
 ow-0.030
 bleed-0.030
 sting-0.029
Token
Token ID198
Feature activation-3.91e-03

Pos logit contributions
 grandparents+0.040
 destination+0.039
 home+0.036
ari+0.036
 grandma+0.035
Neg logit contributions
bee-0.030
gging-0.026
 ow-0.026
 sting-0.025
 bleed-0.025
Token I
Token ID314
Feature activation+2.80e-02

Pos logit contributions
 grandparents+0.043
 destination+0.042
 home+0.040
ari+0.039
 grandma+0.038
Neg logit contributions
bee-0.025
 ow-0.021
 bleed-0.021
gging-0.020
 sting-0.020
Token have
Token ID423
Feature activation+6.51e-02

Pos logit contributions
 grandparents+0.061
 destination+0.059
ari+0.057
 home+0.055
 grandma+0.054
Neg logit contributions
bee-0.041
 bleed-0.038
 sting-0.038
 ow-0.034
ching-0.034
Token to
Token ID284
Feature activation+3.00e-02

Pos logit contributions
ari+0.152
 destination+0.141
 grandparents+0.138
ris+0.132
 gallery+0.125
Neg logit contributions
 ow-0.083
 sting-0.076
 mosquitoes-0.074
 slim-0.070
 th-0.068
Token deliver
Token ID5203
Feature activation+4.86e-02

Pos logit contributions
 grandparents+0.064
ari+0.063
 destination+0.060
 home+0.057
 online+0.057
Neg logit contributions
bee-0.046
 bleed-0.042
 sting-0.041
 ow-0.040
gging-0.039
Token some
Token ID617
Feature activation+4.88e-02

Pos logit contributions
ari+0.111
 grandparents+0.109
 destination+0.109
eline+0.097
 driveway+0.097
Neg logit contributions
 ow-0.058
bee-0.056
 sting-0.052
 mosquitoes-0.051
 slim-0.051
Token papers
Token ID9473
Feature activation+1.24e-01

Pos logit contributions
 destination+0.095
 grandparents+0.093
ari+0.088
 home+0.082
eline+0.081
Neg logit contributions
bee-0.080
 ow-0.075
 rust-0.072
 sting-0.072
 mosquitoes-0.069
Token to
Token ID284
Feature activation+5.94e-02

Pos logit contributions
 grandparents+0.297
 destination+0.276
 workshop+0.269
ilia+0.260
 grandmother+0.258
Neg logit contributions
bee-0.111
der-0.096
 like-0.092
 seem-0.089
 sting-0.088
Token our
Token ID674
Feature activation+7.50e-02

Pos logit contributions
ari+0.120
 destination+0.115
 grandparents+0.114
suits+0.101
 backpack+0.100
Neg logit contributions
 ow-0.089
 sting-0.088
bee-0.084
 spit-0.083
 bleed-0.081
Token neighbor
Token ID4780
Feature activation+7.38e-02

Pos logit contributions
ari+0.098
 boarded+0.085
eline+0.085
 online+0.082
ris+0.081
Neg logit contributions
 ow-0.147
bee-0.145
 sting-0.143
 mosquitoes-0.139
 sounds-0.136
Token."
Token ID526
Feature activation+1.32e-02

Pos logit contributions
 grandparents+0.176
 destination+0.173
 recess+0.163
ari+0.161
 passports+0.157
Neg logit contributions
bee-0.080
 ow-0.065
 sting-0.059
 itch-0.056
 bleed-0.056
Token
Token ID198
Feature activation+1.93e-02

Pos logit contributions
 grandparents+0.028
 destination+0.027
ari+0.026
 home+0.025
 grandma+0.024
Neg logit contributions
bee-0.022
 ow-0.019
gging-0.019
 sting-0.018
 bleed-0.018
Token we
Token ID356
Feature activation+5.93e-02

Pos logit contributions
 destination+0.091
 grandparents+0.087
ari+0.087
 home+0.085
 driveway+0.078
Neg logit contributions
bee-0.058
 ow-0.055
gging-0.050
 rust-0.049
 slim-0.049
Token borrow
Token ID8804
Feature activation+4.88e-02

Pos logit contributions
 grandparents+0.104
ari+0.099
 destination+0.099
aj+0.092
 carriage+0.090
Neg logit contributions
 seem-0.098
 spit-0.097
 ow-0.094
 sting-0.093
bee-0.090
Token it
Token ID340
Feature activation+8.63e-02

Pos logit contributions
ari+0.125
 grandparents+0.123
 destination+0.120
 driveway+0.113
 boarded+0.113
Neg logit contributions
bee-0.044
 ow-0.043
gging-0.039
 sting-0.037
 slim-0.034
Token from
Token ID422
Feature activation+7.47e-02

Pos logit contributions
 grandparents+0.204
 destination+0.189
ari+0.181
 cousins+0.180
 boarded+0.180
Neg logit contributions
bee-0.096
 ow-0.085
 sting-0.077
 rust-0.076
 happen-0.073
Token the
Token ID262
Feature activation+3.87e-02

Pos logit contributions
ari+0.173
 grandparents+0.148
 destination+0.148
 boarded+0.146
 driveway+0.141
Neg logit contributions
bee-0.085
 ow-0.084
gging-0.082
 him-0.077
 sting-0.074
Token store
Token ID3650
Feature activation+1.24e-01

Pos logit contributions
ari+0.056
 grandparents+0.055
 destination+0.052
 boarded+0.052
eline+0.051
Neg logit contributions
 ow-0.078
bee-0.074
 slim-0.073
 sensation-0.071
 rust-0.070
Token?"
Token ID1701
Feature activation-2.68e-02

Pos logit contributions
 grandparents+0.278
 recess+0.271
 destination+0.267
 carriage+0.261
 driveway+0.261
Neg logit contributions
Why-0.083
bee-0.080
 rust-0.068
 noise-0.064
 sounded-0.063
Token
Token ID198
Feature activation+3.32e-03

Pos logit contributions
bee+0.069
 bleed+0.062
 ow+0.062
 sting+0.061
 rust+0.059
Neg logit contributions
 grandparents-0.034
 home-0.032
 grandma-0.031
 destination-0.030
 online-0.028
Token
Token ID198
Feature activation-2.45e-02

Pos logit contributions
 grandparents+0.007
 destination+0.006
 home+0.006
ari+0.006
 grandma+0.006
Neg logit contributions
bee-0.006
gging-0.005
 ow-0.005
 bleed-0.005
 sting-0.005
TokenJack
Token ID14295
Feature activation-2.30e-02

Pos logit contributions
bee+0.054
 bleed+0.050
 ow+0.049
 sting+0.048
 sensation+0.046
Neg logit contributions
 grandparents-0.039
 destination-0.035
 grandma-0.034
 home-0.034
ari-0.032
Token was
Token ID373
Feature activation-1.49e-03

Pos logit contributions
bee+0.038
 bleed+0.035
 ow+0.035
 sensation+0.033
 sting+0.033
Neg logit contributions
 grandparents-0.047
 destination-0.045
 home-0.044
 grandma-0.043
ari-0.042
Token need
Token ID761
Feature activation+3.85e-02

Pos logit contributions
 grandparents+0.028
 destination+0.027
ari+0.026
 home+0.026
 grandma+0.025
Neg logit contributions
bee-0.020
 bleed-0.017
 sting-0.017
 ow-0.017
gging-0.017
Token to
Token ID284
Feature activation+4.65e-03

Pos logit contributions
ari+0.072
 destination+0.071
 grandparents+0.070
 home+0.060
eline+0.060
Neg logit contributions
bee-0.065
 ow-0.063
 sting-0.062
 bleed-0.058
 mosquitoes-0.057
Token go
Token ID467
Feature activation+1.00e-01

Pos logit contributions
 grandparents+0.008
 destination+0.008
 home+0.008
ari+0.008
 grandma+0.007
Neg logit contributions
bee-0.009
 ow-0.008
gging-0.008
 bleed-0.008
 sting-0.008
Token to
Token ID284
Feature activation+6.76e-02

Pos logit contributions
 grandparents+0.218
ari+0.215
bek+0.211
eria+0.210
aughter+0.202
Neg logit contributions
 hurt-0.085
 him-0.082
 bleed-0.081
 whistle-0.074
 hurts-0.071
Token the
Token ID262
Feature activation+7.89e-02

Pos logit contributions
ari+0.180
ouses+0.156
 grandparents+0.154
suits+0.152
eline+0.150
Neg logit contributions
 ow-0.072
 sting-0.066
 bleed-0.064
 noise-0.060
 conclusions-0.059
Token store
Token ID3650
Feature activation+1.24e-01

Pos logit contributions
ouses+0.134
packs+0.132
ari+0.132
eline+0.131
 grandparents+0.126
Neg logit contributions
 ow-0.133
 mosquito-0.119
 mosquitoes-0.117
 rust-0.116
 sounds-0.115
Token and
Token ID290
Feature activation+6.44e-02

Pos logit contributions
 recess+0.320
 grandparents+0.318
 carriage+0.305
 destination+0.300
 suitcase+0.295
Neg logit contributions
.-0.066
 and-0.049
 like-0.047
 sometimes-0.039
 noise-0.037
Token take
Token ID1011
Feature activation+4.12e-02

Pos logit contributions
ari+0.123
 grandparents+0.114
 destination+0.111
ris+0.102
ouses+0.099
Neg logit contributions
 bleed-0.101
 spit-0.099
 sting-0.097
 seem-0.097
 ow-0.093
Token some
Token ID617
Feature activation+7.18e-02

Pos logit contributions
ari+0.097
 grandparents+0.095
 destination+0.093
 gallery+0.084
 driveway+0.082
Neg logit contributions
 ow-0.051
bee-0.050
 sting-0.047
ez-0.047
 bleed-0.047
Token more
Token ID517
Feature activation+4.10e-02

Pos logit contributions
ari+0.154
 grandparents+0.154
 destination+0.153
eline+0.145
 grandmother+0.138
Neg logit contributions
bee-0.084
 ow-0.082
 sting-0.077
 noise-0.074
 mosquitoes-0.072
Token."
Token ID526
Feature activation-1.72e-02

Pos logit contributions
 grandparents+0.092
 destination+0.091
ari+0.088
 grandmother+0.081
 home+0.080
Neg logit contributions
bee-0.063
 sting-0.050
 ow-0.049
gging-0.049
its-0.047
Token his
Token ID465
Feature activation+2.28e-02

Pos logit contributions
 destination+0.040
 grandparents+0.039
ari+0.038
 home+0.035
 online+0.034
Neg logit contributions
bee-0.029
 ow-0.025
 bleed-0.023
 sting-0.023
gging-0.023
Token family
Token ID1641
Feature activation+4.27e-02

Pos logit contributions
 grandparents+0.028
 destination+0.027
ari+0.025
 home+0.024
 online+0.022
Neg logit contributions
bee-0.057
 ow-0.052
 sting-0.050
 bleed-0.050
gging-0.049
Token went
Token ID1816
Feature activation+6.32e-02

Pos logit contributions
 grandparents+0.093
 destination+0.093
ari+0.087
 grandma+0.083
 home+0.082
Neg logit contributions
bee-0.063
 ow-0.052
 sting-0.049
 rust-0.049
ching-0.048
Token to
Token ID284
Feature activation+4.72e-02

Pos logit contributions
 grandparents+0.127
 destination+0.118
ari+0.112
ilia+0.106
ait+0.105
Neg logit contributions
bee-0.084
ching-0.083
 rust-0.080
 buzzing-0.079
 bleed-0.079
Token the
Token ID262
Feature activation+5.05e-02

Pos logit contributions
ari+0.118
 grandparents+0.106
 destination+0.104
belt+0.099
ouses+0.098
Neg logit contributions
 ow-0.050
bee-0.049
 bleed-0.049
 sting-0.048
 rust-0.044
Token shopping
Token ID9735
Feature activation+1.24e-01

Pos logit contributions
 grandparents+0.081
ari+0.080
 destination+0.073
ouses+0.072
eline+0.072
Neg logit contributions
 ow-0.089
 rust-0.084
bee-0.084
 sting-0.082
 slim-0.079
Token centre
Token ID7372
Feature activation+5.56e-02

Pos logit contributions
 grandparents+0.227
iper+0.227
aley+0.220
 destination+0.215
aries+0.214
Neg logit contributions
.-0.142
 he-0.122
 sounds-0.118
 sound-0.114
 noise-0.113
Token.
Token ID13
Feature activation+4.98e-02

Pos logit contributions
 grandparents+0.134
 destination+0.131
ari+0.122
 home+0.118
 grandma+0.118
Neg logit contributions
bee-0.063
 sensation-0.051
 ow-0.047
 rust-0.047
 sting-0.045
Token Tim
Token ID5045
Feature activation+6.30e-02

Pos logit contributions
 destination+0.120
 grandparents+0.119
 home+0.113
ari+0.109
 passports+0.108
Neg logit contributions
bee-0.051
 ow-0.043
gging-0.043
 sensation-0.040
 slim-0.040
Token had
Token ID550
Feature activation+5.11e-02

Pos logit contributions
 grandparents+0.157
 recess+0.155
 destination+0.155
 passports+0.151
 France+0.150
Neg logit contributions
bee-0.057
gging-0.040
ez-0.037
 ow-0.037
ling-0.036
Token the
Token ID262
Feature activation+3.67e-02

Pos logit contributions
ari+0.117
 grandparents+0.116
 destination+0.114
 grandmother+0.101
 online+0.100
Neg logit contributions
bee-0.058
 ow-0.057
 slim-0.056
 rust-0.053
 spit-0.052
Token them
Token ID606
Feature activation+5.63e-02

Pos logit contributions
 grandparents+0.070
 destination+0.066
ari+0.066
 home+0.063
 workshop+0.060
Neg logit contributions
bee-0.043
 ow-0.036
 sting-0.035
gging-0.034
 bleed-0.034
Token up
Token ID510
Feature activation+8.66e-02

Pos logit contributions
 grandparents+0.119
ari+0.112
 destination+0.106
 workshop+0.102
 grandson+0.102
Neg logit contributions
bee-0.094
 sting-0.079
 rust-0.076
 bleed-0.076
ze-0.074
Token and
Token ID290
Feature activation+3.82e-02

Pos logit contributions
 grandparents+0.202
ari+0.194
 destination+0.187
ilia+0.180
 workshop+0.179
Neg logit contributions
bee-0.105
 sting-0.088
 slim-0.087
 buzzing-0.085
p-0.079
Token have
Token ID423
Feature activation+4.92e-02

Pos logit contributions
 grandparents+0.083
ari+0.082
 destination+0.080
 home+0.075
 grandma+0.071
Neg logit contributions
bee-0.056
 sting-0.051
 bleed-0.050
 ow-0.048
 rust-0.047
Token a
Token ID257
Feature activation+5.96e-02

Pos logit contributions
ari+0.099
 grandparents+0.097
 destination+0.097
 home+0.086
pack+0.084
Neg logit contributions
bee-0.076
 ow-0.075
 sting-0.068
 rust-0.066
 slim-0.066
Token party
Token ID2151
Feature activation+1.23e-01

Pos logit contributions
ari+0.105
 grandparents+0.105
 destination+0.100
 boarded+0.089
 online+0.088
Neg logit contributions
bee-0.108
 sting-0.103
 ow-0.098
 slim-0.090
 spit-0.090
Token.
Token ID13
Feature activation+5.29e-02

Pos logit contributions
 grandparents+0.288
 destination+0.280
 workshop+0.280
 harbour+0.277
 boarded+0.265
Neg logit contributions
bee-0.133
ze-0.094
bees-0.094
p-0.090
ms-0.084
Token It
Token ID632
Feature activation+2.85e-02

Pos logit contributions
 destination+0.124
 grandparents+0.123
 home+0.121
ari+0.113
 shopping+0.108
Neg logit contributions
bee-0.064
 ow-0.054
 bleed-0.053
gging-0.052
 sting-0.051
Token will
Token ID481
Feature activation+3.08e-02

Pos logit contributions
 grandparents+0.064
 destination+0.060
 home+0.059
ari+0.057
 grandma+0.056
Neg logit contributions
bee-0.042
 sting-0.037
 ow-0.037
 bleed-0.036
 rust-0.034
Token be
Token ID307
Feature activation+7.51e-02

Pos logit contributions
 grandparents+0.074
ari+0.071
 destination+0.070
 home+0.067
 grandma+0.065
Neg logit contributions
bee-0.040
 sting-0.037
 ow-0.035
 bleed-0.035
 rust-0.033
Token very
Token ID845
Feature activation+6.65e-02

Pos logit contributions
ari+0.178
 grandparents+0.166
 destination+0.162
 doorstep+0.158
rolley+0.156
Neg logit contributions
 ow-0.095
bee-0.092
 slim-0.089
 sting-0.086
 buzzing-0.081
Token We
Token ID775
Feature activation+4.49e-02

Pos logit contributions
 grandparents+0.073
 destination+0.071
 home+0.069
ari+0.067
 grandma+0.064
Neg logit contributions
bee-0.042
 ow-0.037
 bleed-0.036
gging-0.035
 sting-0.034
Token'll
Token ID1183
Feature activation+2.88e-02

Pos logit contributions
 grandparents+0.102
 destination+0.094
ari+0.091
 grandma+0.090
 grandmother+0.086
Neg logit contributions
bee-0.064
 sting-0.058
 bleed-0.056
 seem-0.052
 ow-0.052
Token finish
Token ID5461
Feature activation+4.75e-02

Pos logit contributions
 grandparents+0.059
ari+0.057
 destination+0.055
 home+0.052
 online+0.052
Neg logit contributions
bee-0.045
 sting-0.043
 bleed-0.042
 ow-0.042
 spit-0.040
Token all
Token ID477
Feature activation+3.07e-02

Pos logit contributions
ari+0.103
 grandparents+0.098
 destination+0.094
 boarded+0.084
 cousin+0.084
Neg logit contributions
bee-0.072
 sting-0.065
 ow-0.065
gging-0.064
 bleed-0.062
Token our
Token ID674
Feature activation+1.15e-02

Pos logit contributions
 grandparents+0.071
 destination+0.068
ari+0.065
 grandmother+0.061
 backpack+0.061
Neg logit contributions
bee-0.042
 sensation-0.037
 sting-0.037
 ow-0.036
 mosquitoes-0.036
Token shopping
Token ID9735
Feature activation+1.23e-01

Pos logit contributions
 grandparents+0.022
 destination+0.021
ari+0.020
 home+0.020
 grandma+0.019
Neg logit contributions
bee-0.022
 ow-0.019
 sting-0.019
gging-0.019
 bleed-0.019
Token and
Token ID290
Feature activation+3.68e-02

Pos logit contributions
 grandparents+0.291
Kelly+0.275
Jessica+0.273
 destination+0.270
 grandmother+0.270
Neg logit contributions
.-0.087
 sounds-0.067
der-0.067
 himself-0.061
 is-0.060
Token get
Token ID651
Feature activation+4.62e-02

Pos logit contributions
ari+0.076
 grandparents+0.075
 destination+0.074
 home+0.067
 cousin+0.064
Neg logit contributions
bee-0.055
 bleed-0.053
 sting-0.052
 ow-0.052
 spit-0.051
Token a
Token ID257
Feature activation+5.86e-02

Pos logit contributions
ari+0.106
 grandparents+0.103
 destination+0.101
 driveway+0.087
ris+0.087
Neg logit contributions
 sting-0.058
 ow-0.058
 bleed-0.057
bee-0.055
 spit-0.055
Token huge
Token ID3236
Feature activation+2.14e-02

Pos logit contributions
 grandparents+0.125
ari+0.119
 destination+0.119
eline+0.111
packs+0.106
Neg logit contributions
 sting-0.083
bee-0.079
 ow-0.077
 spit-0.076
 slim-0.076
Token pizza
Token ID14256
Feature activation+9.00e-02

Pos logit contributions
 grandparents+0.039
 destination+0.036
ari+0.035
 home+0.033
 grandma+0.033
Neg logit contributions
bee-0.042
 ow-0.038
 sting-0.037
 sensation-0.037
 bleed-0.037
Token you
Token ID345
Feature activation+2.31e-02

Pos logit contributions
 grandparents+0.030
 destination+0.030
 home+0.029
ari+0.029
 grandma+0.027
Neg logit contributions
bee-0.022
 ow-0.019
 sting-0.018
gging-0.018
 bleed-0.018
Token get
Token ID651
Feature activation+5.72e-02

Pos logit contributions
 grandparents+0.061
 destination+0.058
ari+0.057
 home+0.055
 grandma+0.054
Neg logit contributions
bee-0.027
 ow-0.021
 sting-0.021
gging-0.021
 bleed-0.020
Token me
Token ID502
Feature activation+7.74e-02

Pos logit contributions
 grandparents+0.124
ari+0.122
 destination+0.118
packs+0.109
 driveway+0.109
Neg logit contributions
 ow-0.071
bee-0.069
 sting-0.068
 slim-0.068
 rust-0.067
Token for
Token ID329
Feature activation+7.15e-02

Pos logit contributions
 grandparents+0.164
 destination+0.151
ari+0.142
 grandmother+0.140
 driveway+0.135
Neg logit contributions
bee-0.105
 sting-0.099
 ow-0.098
 bleed-0.094
 pinch-0.094
Token my
Token ID616
Feature activation+5.03e-02

Pos logit contributions
ari+0.167
ris+0.154
suits+0.152
 destination+0.150
 grandparents+0.149
Neg logit contributions
 ow-0.094
bee-0.078
ze-0.070
gging-0.069
 sting-0.068
Token birthday
Token ID10955
Feature activation+1.23e-01

Pos logit contributions
ari+0.068
 destination+0.068
 grandparents+0.065
eline+0.059
 boarded+0.058
Neg logit contributions
bee-0.109
 ow-0.106
 sting-0.099
 spit-0.096
 sensation-0.095
Token?"
Token ID1701
Feature activation+2.71e-02

Pos logit contributions
 grandparents+0.207
 terminal+0.205
 boarded+0.202
 harbour+0.198
bara+0.195
Neg logit contributions
 ow-0.191
bee-0.158
 joke-0.154
 spit-0.153
 pinch-0.152
Token Her
Token ID2332
Feature activation+4.09e-02

Pos logit contributions
 destination+0.063
 grandparents+0.060
ari+0.059
 home+0.056
 online+0.054
Neg logit contributions
bee-0.036
 ow-0.031
gging-0.030
 sensation-0.029
 sting-0.028
Token mom
Token ID1995
Feature activation+1.42e-02

Pos logit contributions
ari+0.073
 destination+0.071
ris+0.063
 online+0.063
 harbour+0.062
Neg logit contributions
 ow-0.078
bee-0.074
 spit-0.070
 sting-0.067
 bleed-0.063
Token and
Token ID290
Feature activation+2.52e-02

Pos logit contributions
 grandparents+0.029
 destination+0.029
 home+0.027
ari+0.026
 grandma+0.025
Neg logit contributions
bee-0.025
 ow-0.021
gging-0.021
 sting-0.020
 bleed-0.020
Token dad
Token ID9955
Feature activation+4.01e-02

Pos logit contributions
 destination+0.029
ari+0.026
 grandparents+0.024
 home+0.024
 online+0.021
Neg logit contributions
bee-0.068
 ow-0.062
gging-0.061
 rust-0.058
 spit-0.058
Token put
Token ID1234
Feature activation+6.46e-02

Pos logit contributions
 grandparents+0.044
ari+0.043
 destination+0.041
 home+0.039
 online+0.038
Neg logit contributions
bee-0.038
 sting-0.035
 bleed-0.035
 ow-0.034
gging-0.033
Token them
Token ID606
Feature activation+8.73e-02

Pos logit contributions
ari+0.156
 grandparents+0.151
 destination+0.142
 doorstep+0.134
 boarded+0.133
Neg logit contributions
 ow-0.071
bee-0.067
 bleed-0.067
 sting-0.064
gging-0.063
Token back
Token ID736
Feature activation+8.86e-02

Pos logit contributions
 grandparents+0.204
ari+0.194
 destination+0.187
 harbour+0.175
 workshop+0.175
Neg logit contributions
bee-0.105
 bleed-0.104
 sting-0.091
ze-0.087
ling-0.084
Token on
Token ID319
Feature activation+8.28e-02

Pos logit contributions
 grandparents+0.189
 destination+0.180
ari+0.173
 workshop+0.159
 grandmother+0.159
Neg logit contributions
bee-0.125
 bleed-0.104
 ow-0.103
 sting-0.097
ling-0.097
Token the
Token ID262
Feature activation+4.72e-02

Pos logit contributions
ari+0.225
 grandparents+0.192
 boarded+0.189
 destination+0.183
ouses+0.177
Neg logit contributions
 ow-0.076
 mosquitoes-0.068
 sting-0.066
gging-0.066
bee-0.065
Token book
Token ID1492
Feature activation+1.23e-01

Pos logit contributions
 grandparents+0.084
ari+0.082
 destination+0.081
eline+0.077
 boarded+0.077
Neg logit contributions
 ow-0.086
bee-0.085
 sting-0.078
 sensation-0.078
 bleed-0.078
Tokencase
Token ID7442
Feature activation+1.17e-01

Pos logit contributions
 grandparents+0.277
 boarded+0.247
 cousins+0.242
ilia+0.235
 destination+0.234
Neg logit contributions
der-0.134
bee-0.122
 ow-0.118
hell-0.115
 sting-0.111
Token when
Token ID618
Feature activation+6.20e-02

Pos logit contributions
 grandparents+0.282
 boarded+0.249
ari+0.249
 destination+0.247
 harbour+0.243
Neg logit contributions
bee-0.124
der-0.098
 sting-0.097
 sensation-0.094
 ow-0.093
Token we
Token ID356
Feature activation+3.14e-02

Pos logit contributions
ari+0.144
 grandparents+0.131
 destination+0.128
ris+0.127
eline+0.124
Neg logit contributions
 ow-0.078
bee-0.072
 sting-0.070
 mosquitoes-0.069
 slim-0.067
Token're
Token ID821
Feature activation+7.15e-02

Pos logit contributions
 grandparents+0.071
 destination+0.068
ari+0.066
 grandma+0.063
 home+0.063
Neg logit contributions
bee-0.045
 bleed-0.043
 sting-0.042
 ow-0.039
 spit-0.039
Token done
Token ID1760
Feature activation+4.53e-02

Pos logit contributions
ari+0.134
 grandparents+0.132
 destination+0.119
 passports+0.107
ris+0.107
Neg logit contributions
 ow-0.122
 sting-0.121
 bleed-0.120
bee-0.117
 spit-0.117
TokenOkay
Token ID16454
Feature activation+1.62e-02

Pos logit contributions
bee+0.048
 ow+0.046
 bleed+0.045
 sting+0.044
gging+0.044
Neg logit contributions
 grandparents-0.048
ari-0.043
 destination-0.043
 home-0.042
 grandma-0.042
Token,
Token ID11
Feature activation+1.55e-02

Pos logit contributions
 grandparents+0.034
 destination+0.032
ari+0.031
 home+0.030
 grandma+0.029
Neg logit contributions
bee-0.027
 ow-0.024
gging-0.023
 sting-0.022
 bleed-0.022
Token let
Token ID1309
Feature activation+1.13e-02

Pos logit contributions
 grandparents+0.033
 destination+0.031
 home+0.030
ari+0.030
 grandma+0.029
Neg logit contributions
bee-0.026
 ow-0.022
 sting-0.021
gging-0.021
 bleed-0.021
Token's
Token ID338
Feature activation+2.26e-02

Pos logit contributions
 grandparents+0.024
 destination+0.024
ari+0.023
 home+0.023
 grandma+0.022
Neg logit contributions
bee-0.018
 ow-0.015
gging-0.015
 sting-0.015
 bleed-0.015
Token go
Token ID467
Feature activation+5.22e-02

Pos logit contributions
 grandparents+0.047
ari+0.046
 destination+0.045
 home+0.043
 grandma+0.042
Neg logit contributions
bee-0.035
 sting-0.031
 ow-0.031
 bleed-0.030
gging-0.030
Token shopping
Token ID9735
Feature activation+1.23e-01

Pos logit contributions
 grandparents+0.110
 destination+0.100
ari+0.095
 grandma+0.095
 cousin+0.093
Neg logit contributions
bee-0.079
 bleed-0.068
 sting-0.068
ching-0.068
ze-0.063
Token for
Token ID329
Feature activation+6.96e-02

Pos logit contributions
 grandparents+0.309
ari+0.301
 boarded+0.299
 grandmother+0.291
 cousin+0.290
Neg logit contributions
.-0.070
 sounds-0.051
!-0.043
 noise-0.041
 it-0.035
Token some
Token ID617
Feature activation+5.86e-02

Pos logit contributions
ari+0.185
 grandparents+0.164
 destination+0.164
 driveway+0.154
belt+0.153
Neg logit contributions
 ow-0.076
 th-0.056
gging-0.055
 slim-0.054
 mosquitoes-0.054
Token paint
Token ID7521
Feature activation+6.61e-02

Pos logit contributions
 destination+0.108
 grandparents+0.107
ari+0.102
 home+0.094
 harbour+0.093
Neg logit contributions
bee-0.097
 ow-0.088
 mosquitoes-0.087
 sting-0.082
 spit-0.081
Token and
Token ID290
Feature activation+4.88e-02

Pos logit contributions
 grandparents+0.148
 destination+0.137
 boarded+0.128
ari+0.127
 home+0.125
Neg logit contributions
bee-0.091
 ow-0.072
 rust-0.068
 sensation-0.068
ze-0.066
Token markers
Token ID19736
Feature activation+6.60e-02

Pos logit contributions
ari+0.096
 grandparents+0.096
 destination+0.093
 home+0.085
 cousin+0.081
Neg logit contributions
bee-0.073
 ow-0.066
 bleed-0.066
 sting-0.066
 rust-0.065
Token We
Token ID775
Feature activation+3.68e-02

Pos logit contributions
 destination+0.071
 grandparents+0.071
 home+0.069
ari+0.065
 grandma+0.063
Neg logit contributions
bee-0.033
 ow-0.028
 bleed-0.027
gging-0.026
 sensation-0.025
Token can
Token ID460
Feature activation+2.53e-02

Pos logit contributions
 grandparents+0.088
 destination+0.086
ari+0.081
 grandma+0.080
 home+0.077
Neg logit contributions
bee-0.049
 sting-0.041
 bleed-0.041
 ow-0.039
ching-0.038
Token go
Token ID467
Feature activation+7.97e-02

Pos logit contributions
 grandparents+0.049
ari+0.047
 destination+0.047
 home+0.046
 online+0.044
Neg logit contributions
bee-0.042
 bleed-0.038
 ow-0.038
 sting-0.037
gging-0.037
Token on
Token ID319
Feature activation+7.41e-02

Pos logit contributions
 grandparents+0.147
ari+0.131
 album+0.128
aughter+0.128
 grandson+0.127
Neg logit contributions
 bleed-0.114
 seem-0.104
 rust-0.102
 him-0.101
 sting-0.100
Token our
Token ID674
Feature activation+8.67e-02

Pos logit contributions
ari+0.186
eline+0.164
 grandparents+0.157
ouses+0.155
aughter+0.154
Neg logit contributions
 ow-0.083
 mosquitoes-0.070
 th-0.067
 bleed-0.066
ze-0.064
Token trip
Token ID5296
Feature activation+1.23e-01

Pos logit contributions
ari+0.138
eline+0.125
rolley+0.116
ris+0.115
 online+0.115
Neg logit contributions
 ow-0.173
 sting-0.154
bee-0.152
ze-0.150
 sounds-0.149
Token now
Token ID783
Feature activation+6.20e-02

Pos logit contributions
 grandparents+0.288
 granddaughter+0.264
 cousins+0.259
 boarded+0.258
ari+0.258
Neg logit contributions
pe-0.080
 him-0.077
le-0.077
.-0.076
ze-0.075
Token!"
Token ID2474
Feature activation-2.05e-02

Pos logit contributions
 grandparents+0.146
ari+0.137
 destination+0.131
 boarded+0.129
 workshop+0.126
Neg logit contributions
bee-0.079
 ow-0.073
gging-0.069
 sting-0.069
 bleed-0.066
Token Tim
Token ID5045
Feature activation+2.52e-05

Pos logit contributions
bee+0.045
 sting+0.039
 ow+0.038
gging+0.038
 bleed+0.038
Neg logit contributions
 grandparents-0.034
 home-0.033
 grandma-0.032
 destination-0.030
ari-0.030
Tokenmy
Token ID1820
Feature activation+1.54e-02

Pos logit contributions
 grandparents+0.000
 destination+0.000
 home+0.000
ari+0.000
 grandma+0.000
Neg logit contributions
bee-0.000
 ow-0.000
gging-0.000
 bleed-0.000
 sting-0.000
Token smiled
Token ID13541
Feature activation+2.22e-02

Pos logit contributions
 grandparents+0.038
 destination+0.037
 home+0.035
ari+0.035
 grandma+0.034
Neg logit contributions
bee-0.021
gging-0.017
 ow-0.017
 sting-0.016
 bleed-0.016
Token Now
Token ID2735
Feature activation+1.51e-02

Pos logit contributions
 grandparents+0.008
 destination+0.007
 home+0.007
ari+0.007
 grandma+0.007
Neg logit contributions
bee-0.006
 ow-0.005
gging-0.005
 bleed-0.005
 sting-0.005
Token let
Token ID1309
Feature activation+1.88e-02

Pos logit contributions
 grandparents+0.031
 destination+0.030
ari+0.029
 home+0.029
 online+0.027
Neg logit contributions
bee-0.025
 ow-0.022
 sting-0.022
gging-0.021
 bleed-0.021
Token's
Token ID338
Feature activation+1.14e-02

Pos logit contributions
 grandparents+0.042
 destination+0.041
ari+0.040
 home+0.038
 online+0.038
Neg logit contributions
bee-0.027
 ow-0.024
 sting-0.023
gging-0.022
 bleed-0.022
Token finish
Token ID5461
Feature activation+4.74e-02

Pos logit contributions
 grandparents+0.024
 destination+0.023
ari+0.022
 home+0.022
 grandma+0.021
Neg logit contributions
bee-0.019
 ow-0.016
 bleed-0.016
 sting-0.016
gging-0.016
Token our
Token ID674
Feature activation+4.72e-02

Pos logit contributions
ari+0.100
 grandparents+0.090
 destination+0.088
 boarded+0.081
 home+0.076
Neg logit contributions
bee-0.074
gging-0.069
 bleed-0.068
 sting-0.068
 ow-0.067
Token shopping
Token ID9735
Feature activation+1.22e-01

Pos logit contributions
ari+0.076
 grandparents+0.073
eline+0.071
 destination+0.070
 harbour+0.066
Neg logit contributions
 ow-0.090
 sting-0.085
bee-0.085
 bleed-0.085
 hurts-0.082
Token and
Token ID290
Feature activation+4.86e-02

Pos logit contributions
enna+0.302
Kelly+0.301
 grandparents+0.299
 boarded+0.292
 harbour+0.290
Neg logit contributions
.-0.071
 sounds-0.059
 noise-0.049
 is-0.041
 noises-0.039
Token then
Token ID788
Feature activation+3.27e-02

Pos logit contributions
ari+0.106
 grandparents+0.101
 destination+0.101
 home+0.088
ris+0.087
Neg logit contributions
 bleed-0.066
 sting-0.062
 spit-0.062
bee-0.062
 ow-0.058
Token we
Token ID356
Feature activation+4.52e-02

Pos logit contributions
 grandparents+0.070
ari+0.068
 destination+0.067
 home+0.062
 grandma+0.059
Neg logit contributions
bee-0.047
 sting-0.044
 bleed-0.043
 ow-0.041
gging-0.040
Token can
Token ID460
Feature activation+2.24e-02

Pos logit contributions
 grandparents+0.114
 destination+0.109
ari+0.105
 grandma+0.104
 home+0.100
Neg logit contributions
bee-0.048
 sting-0.043
 bleed-0.043
ching-0.038
 spit-0.038
Token go
Token ID467
Feature activation+9.12e-02

Pos logit contributions
 grandparents+0.045
ari+0.043
 destination+0.042
 home+0.041
 online+0.040
Neg logit contributions
bee-0.036
 bleed-0.032
 sting-0.032
 ow-0.032
gging-0.031
Token that
Token ID326
Feature activation-4.98e-02

Pos logit contributions
bee+0.090
 bleed+0.080
gging+0.080
 sensation+0.076
its+0.074
Neg logit contributions
 home-0.081
 grandma-0.077
 grandparents-0.076
 destination-0.072
 online-0.065
Token.
Token ID13
Feature activation-2.51e-02

Pos logit contributions
bee+0.100
gging+0.096
 bleed+0.092
 ow+0.086
its+0.086
Neg logit contributions
 home-0.085
 destination-0.072
 grandma-0.071
 grandparents-0.066
 online-0.064
Token It
Token ID632
Feature activation-7.42e-02

Pos logit contributions
bee+0.043
gging+0.033
 bleed+0.033
 sting+0.032
 ow+0.032
Neg logit contributions
 destination-0.053
 home-0.053
 grandparents-0.053
 grandma-0.051
ari-0.051
Token made
Token ID925
Feature activation-5.07e-02

Pos logit contributions
gging+0.149
 conclusions+0.145
ickets+0.141
bee+0.139
hing+0.138
Neg logit contributions
 home-0.098
 bought-0.093
 packed-0.092
 grandma-0.091
 destination-0.090
Token a
Token ID257
Feature activation-7.49e-02

Pos logit contributions
bee+0.127
gging+0.111
bees+0.105
pillar+0.100
 bleed+0.100
Neg logit contributions
 home-0.079
 grandma-0.070
 destination-0.065
 grandparents-0.062
 Grand-0.055
Token loud
Token ID7812
Feature activation-1.39e-01

Pos logit contributions
bee+0.149
gging+0.140
ez+0.133
hing+0.129
bees+0.123
Neg logit contributions
 home-0.136
 destination-0.130
 grandparents-0.117
 grandma-0.115
 list-0.110
Token,
Token ID11
Feature activation-5.65e-02

Pos logit contributions
pill+0.256
Nice+0.254
Fine+0.229
Buzz+0.226
izards+0.226
Neg logit contributions
 home-0.218
 door-0.181
 room-0.163
 for-0.159
 grandma-0.158
Token angry
Token ID7954
Feature activation-7.86e-02

Pos logit contributions
bee+0.113
gging+0.101
ickets+0.098
its+0.098
hing+0.094
Neg logit contributions
 home-0.093
 grandma-0.090
 grandparents-0.084
 destination-0.084
 online-0.082
Token noise
Token ID7838
Feature activation-5.77e-02

Pos logit contributions
pill+0.150
ching+0.140
hing+0.139
ched+0.136
ez+0.135
Neg logit contributions
 home-0.147
 grandma-0.121
 dad-0.113
 helper-0.107
 destination-0.103
Token and
Token ID290
Feature activation-5.23e-02

Pos logit contributions
bee+0.117
gging+0.112
 bleed+0.109
 gag+0.107
its+0.107
Neg logit contributions
 home-0.083
 grandma-0.076
 grandparents-0.075
 online-0.074
 destination-0.074
Token sprayed
Token ID35729
Feature activation-6.86e-02

Pos logit contributions
bee+0.098
gging+0.088
ity+0.080
its+0.079
ickets+0.079
Neg logit contributions
 home-0.093
 grandma-0.088
 destination-0.087
 grandparents-0.086
 online-0.081
Token But
Token ID887
Feature activation-9.75e-03

Pos logit contributions
bee+0.044
 bleed+0.035
 ow+0.035
gging+0.035
 sensation+0.035
Neg logit contributions
 grandparents-0.047
 home-0.047
 grandma-0.047
 destination-0.046
ari-0.042
Token then
Token ID788
Feature activation-2.20e-02

Pos logit contributions
bee+0.019
gging+0.017
 bleed+0.017
 ow+0.016
 sensation+0.016
Neg logit contributions
 grandparents-0.017
 home-0.017
 destination-0.017
 grandma-0.016
ari-0.015
Token she
Token ID673
Feature activation-3.68e-02

Pos logit contributions
bee+0.048
 sensation+0.043
gging+0.042
 bleed+0.042
its+0.040
Neg logit contributions
 grandparents-0.036
 home-0.035
 grandma-0.035
 destination-0.034
ari-0.030
Token hears
Token ID23180
Feature activation-4.51e-02

Pos logit contributions
bee+0.077
 sensation+0.073
gging+0.070
 bleed+0.067
hing+0.066
Neg logit contributions
 grandparents-0.060
 destination-0.059
 grandma-0.056
 home-0.055
 backpack-0.052
Token a
Token ID257
Feature activation-9.54e-02

Pos logit contributions
bee+0.111
 bleed+0.098
gging+0.097
 sensation+0.095
its+0.094
Neg logit contributions
 grandma-0.066
 grandparents-0.062
 home-0.061
 Grand-0.055
 grandmother-0.052
Token loud
Token ID7812
Feature activation-1.36e-01

Pos logit contributions
ez+0.201
bee+0.193
gging+0.185
ched+0.180
ickets+0.177
Neg logit contributions
 home-0.124
 door-0.121
 grand-0.115
 list-0.115
 grandma-0.114
Token buzz
Token ID14713
Feature activation-8.68e-02

Pos logit contributions
pill+0.215
 conclusions+0.211
bee+0.203
Fine+0.201
ickets+0.200
Neg logit contributions
 home-0.228
 grandma-0.191
 door-0.180
 Grand-0.173
 downstairs-0.171
Token.
Token ID13
Feature activation-3.87e-02

Pos logit contributions
gging+0.189
bee+0.186
 ow+0.181
 disl+0.179
 gag+0.179
Neg logit contributions
 home-0.101
 downstairs-0.083
ari-0.079
 online-0.078
 downtown-0.073
Token She
Token ID1375
Feature activation-6.81e-02

Pos logit contributions
bee+0.072
gging+0.057
 bleed+0.056
py+0.055
hing+0.054
Neg logit contributions
 grandma-0.075
 home-0.075
 grandparents-0.073
 Grand-0.070
 destination-0.069
Token notices
Token ID19748
Feature activation-5.54e-02

Pos logit contributions
 sensation+0.135
bee+0.134
gging+0.118
hing+0.115
pe+0.113
Neg logit contributions
 destination-0.111
 grandparents-0.106
 grandma-0.102
 home-0.097
 backpack-0.092
Token a
Token ID257
Feature activation-4.49e-02

Pos logit contributions
bee+0.136
 bleed+0.127
 sensation+0.123
gging+0.120
its+0.119
Neg logit contributions
 grandma-0.076
 grandparents-0.070
 home-0.069
 Grand-0.062
 destination-0.057
Token when
Token ID618
Feature activation-3.72e-02

Pos logit contributions
bee+0.050
 ow+0.045
 bleed+0.044
 sting+0.043
gging+0.042
Neg logit contributions
 home-0.039
 grandparents-0.036
 destination-0.036
ari-0.033
 grandma-0.033
Token suddenly
Token ID6451
Feature activation-4.86e-02

Pos logit contributions
bee+0.070
gging+0.059
its+0.059
 bleed+0.057
ez+0.055
Neg logit contributions
 home-0.076
 grandparents-0.073
 destination-0.072
 grandma-0.071
 online-0.062
Token he
Token ID339
Feature activation-1.94e-02

Pos logit contributions
bee+0.109
its+0.104
 bleed+0.101
gging+0.099
 sensation+0.097
Neg logit contributions
 home-0.073
 grandma-0.065
 destination-0.065
 grandparents-0.064
 backpack-0.055
Token heard
Token ID2982
Feature activation-5.04e-02

Pos logit contributions
bee+0.038
gging+0.034
 bleed+0.034
 sensation+0.034
 ow+0.032
Neg logit contributions
 grandparents-0.034
 destination-0.033
 home-0.033
ari-0.030
 grandma-0.030
Token a
Token ID257
Feature activation-8.08e-02

Pos logit contributions
bee+0.132
gging+0.115
 bleed+0.115
its+0.113
 sting+0.111
Neg logit contributions
 grandma-0.063
 home-0.062
 grandparents-0.060
 destination-0.051
 Grand-0.049
Token loud
Token ID7812
Feature activation-1.36e-01

Pos logit contributions
bee+0.161
ez+0.154
gging+0.148
ped+0.141
ched+0.135
Neg logit contributions
 grandparents-0.131
 grandma-0.125
 home-0.125
 destination-0.122
 grand-0.120
Token roar
Token ID34532
Feature activation-6.96e-02

Pos logit contributions
bee+0.211
Ble+0.205
 conclusions+0.202
pill+0.199
its+0.199
Neg logit contributions
 home-0.236
 grandma-0.201
 station-0.197
 door-0.194
 destination-0.185
Token.
Token ID13
Feature activation-2.84e-02

Pos logit contributions
bee+0.156
gging+0.149
 bleed+0.142
 ow+0.140
 gag+0.138
Neg logit contributions
 home-0.093
 destination-0.085
ari-0.081
 grandparents-0.080
 downstairs-0.077
Token He
Token ID679
Feature activation-3.53e-02

Pos logit contributions
bee+0.051
 bleed+0.041
 ow+0.040
 sting+0.040
gging+0.040
Neg logit contributions
 grandparents-0.058
 home-0.057
 grandma-0.056
 destination-0.055
ari-0.054
Token was
Token ID373
Feature activation-1.71e-02

Pos logit contributions
bee+0.068
 sensation+0.064
gging+0.063
 bleed+0.062
 mosquitoes+0.059
Neg logit contributions
 grandparents-0.061
 destination-0.058
 home-0.056
 grandma-0.054
ari-0.054
Token scared
Token ID12008
Feature activation-3.42e-02

Pos logit contributions
bee+0.030
gging+0.025
 sensation+0.024
 ow+0.024
 sting+0.023
Neg logit contributions
 grandparents-0.035
 home-0.035
 destination-0.034
 grandma-0.033
ari-0.031
Token eyes
Token ID2951
Feature activation-2.60e-02

Pos logit contributions
bee+0.022
gging+0.019
 ow+0.018
 sensation+0.018
 bleed+0.018
Neg logit contributions
 grandparents-0.023
 destination-0.022
 home-0.022
 grandma-0.020
ari-0.020
Token.
Token ID13
Feature activation-9.13e-03

Pos logit contributions
bee+0.050
gging+0.043
 ow+0.042
 bleed+0.042
 sensation+0.041
Neg logit contributions
 home-0.048
 destination-0.047
 grandparents-0.047
ari-0.043
 grandma-0.043
Token It
Token ID632
Feature activation-5.15e-02

Pos logit contributions
bee+0.014
gging+0.011
 bleed+0.011
 ow+0.011
 sensation+0.011
Neg logit contributions
 grandparents-0.021
 destination-0.020
 home-0.020
 grandma-0.020
ari-0.019
Token made
Token ID925
Feature activation-3.64e-02

Pos logit contributions
gging+0.100
bee+0.096
 conclusions+0.091
its+0.089
hing+0.087
Neg logit contributions
 grandparents-0.079
 grandma-0.078
 home-0.076
 destination-0.076
 boarded-0.074
Token a
Token ID257
Feature activation-5.89e-02

Pos logit contributions
bee+0.082
gging+0.073
 bleed+0.068
 sensation+0.068
bees+0.066
Neg logit contributions
 home-0.059
 grandma-0.055
 grandparents-0.055
 destination-0.052
 Grand-0.046
Token loud
Token ID7812
Feature activation-1.36e-01

Pos logit contributions
bee+0.114
gging+0.110
ez+0.101
hing+0.093
bees+0.092
Neg logit contributions
 home-0.107
 destination-0.105
 grandparents-0.103
 grandma-0.098
 list-0.090
Token sound
Token ID2128
Feature activation-7.21e-02

Pos logit contributions
pill+0.235
Nice+0.218
Fine+0.217
If+0.216
gging+0.211
Neg logit contributions
 home-0.217
 door-0.183
 grandma-0.176
 room-0.166
 window-0.161
Token.
Token ID13
Feature activation-9.71e-03

Pos logit contributions
bee+0.166
gging+0.162
pill+0.161
Buzz+0.157
its+0.152
Neg logit contributions
 home-0.087
 grandma-0.071
 downstairs-0.071
 online-0.068
 grandparents-0.062
Token It
Token ID632
Feature activation-6.56e-02

Pos logit contributions
bee+0.016
gging+0.013
 bleed+0.013
 ow+0.013
 sensation+0.013
Neg logit contributions
 grandparents-0.021
 destination-0.020
 home-0.020
 grandma-0.020
ari-0.019
Token stood
Token ID6204
Feature activation-1.41e-02

Pos logit contributions
gging+0.125
bee+0.120
 conclusions+0.112
its+0.107
oles+0.105
Neg logit contributions
 grandma-0.097
 grandparents-0.096
 boarded-0.095
 packed-0.094
 home-0.093
Token up
Token ID510
Feature activation-9.24e-03

Pos logit contributions
bee+0.032
gging+0.028
 ow+0.028
 bleed+0.027
 sting+0.027
Neg logit contributions
 home-0.022
 grandparents-0.021
 destination-0.021
 grandma-0.020
ari-0.018
Token hurt
Token ID5938
Feature activation-2.89e-02

Pos logit contributions
bee+0.037
gging+0.031
 sensation+0.030
bees+0.029
 ow+0.029
Neg logit contributions
 home-0.040
 grandparents-0.037
 destination-0.037
 grandma-0.036
 online-0.033
Token.
Token ID13
Feature activation+1.01e-03

Pos logit contributions
bee+0.061
gging+0.052
 sensation+0.052
 sting+0.050
 ow+0.050
Neg logit contributions
 home-0.052
 grandparents-0.045
 grandma-0.045
 destination-0.043
ari-0.041
Token It
Token ID632
Feature activation-8.20e-02

Pos logit contributions
 grandparents+0.002
 destination+0.002
 home+0.002
ari+0.002
 grandma+0.002
Neg logit contributions
bee-0.001
 ow-0.001
gging-0.001
 bleed-0.001
 sting-0.001
Token makes
Token ID1838
Feature activation-2.75e-02

Pos logit contributions
bee+0.156
gging+0.143
bees+0.132
 sensation+0.129
ity+0.127
Neg logit contributions
 home-0.144
 grandma-0.137
 destination-0.131
 grandparents-0.127
 backpack-0.127
Token a
Token ID257
Feature activation-6.21e-02

Pos logit contributions
bee+0.065
gging+0.055
 sensation+0.053
bees+0.052
 sting+0.050
Neg logit contributions
 home-0.045
 grandma-0.043
 grandparents-0.042
 destination-0.041
 cousin-0.035
Token loud
Token ID7812
Feature activation-1.35e-01

Pos logit contributions
bee+0.126
gging+0.113
bees+0.104
ez+0.103
hing+0.099
Neg logit contributions
 home-0.116
 destination-0.106
 grandma-0.105
 grandparents-0.103
 backpack-0.089
Token noise
Token ID7838
Feature activation-5.19e-02

Pos logit contributions
pill+0.249
bee+0.233
izards+0.222
bees+0.220
oles+0.215
Neg logit contributions
 home-0.210
 grandma-0.174
 door-0.165
 room-0.153
 back-0.149
Token.
Token ID13
Feature activation+6.74e-03

Pos logit contributions
bee+0.119
gging+0.108
 sensation+0.105
pill+0.102
 gag+0.101
Neg logit contributions
 home-0.073
 grandma-0.065
 downstairs-0.062
 grandparents-0.060
 online-0.059
Token The
Token ID383
Feature activation-2.90e-02

Pos logit contributions
 grandparents+0.016
 destination+0.015
 home+0.015
ari+0.015
 grandma+0.014
Neg logit contributions
bee-0.010
 ow-0.008
gging-0.008
 bleed-0.008
 sting-0.008
Token other
Token ID584
Feature activation-2.51e-02

Pos logit contributions
bee+0.054
ez+0.047
gging+0.047
 bleed+0.046
 sensation+0.045
Neg logit contributions
 home-0.054
 grandparents-0.053
 grandma-0.051
 destination-0.051
ari-0.047
Token pige
Token ID32372
Feature activation-3.62e-02

Pos logit contributions
bee+0.046
 bleed+0.042
 sensation+0.040
gging+0.040
ez+0.039
Neg logit contributions
 home-0.047
 grandparents-0.046
 destination-0.045
 grandma-0.045
ari-0.043
Token that
Token ID326
Feature activation-5.01e-02

Pos logit contributions
bee+0.093
gging+0.083
 bleed+0.082
 sensation+0.078
hing+0.077
Neg logit contributions
 home-0.076
 grandma-0.072
 grandparents-0.070
 destination-0.066
 online-0.060
Token.
Token ID13
Feature activation-2.04e-02

Pos logit contributions
bee+0.106
gging+0.099
 bleed+0.096
hing+0.090
ickets+0.090
Neg logit contributions
 home-0.081
 destination-0.070
 grandma-0.068
 online-0.062
 grandparents-0.061
Token It
Token ID632
Feature activation-7.63e-02

Pos logit contributions
bee+0.034
 ow+0.026
 bleed+0.026
gging+0.026
 rust+0.026
Neg logit contributions
 grandparents-0.044
 destination-0.043
 grandma-0.043
 home-0.043
ari-0.040
Token made
Token ID925
Feature activation-5.08e-02

Pos logit contributions
gging+0.157
hing+0.149
bee+0.148
 conclusions+0.146
ickets+0.146
Neg logit contributions
 home-0.096
 bought-0.095
 packed-0.093
 grandma-0.092
 boarded-0.089
Token a
Token ID257
Feature activation-7.16e-02

Pos logit contributions
bee+0.128
gging+0.112
bees+0.106
 sensation+0.104
 bleed+0.103
Neg logit contributions
 home-0.073
 grandma-0.071
 grandparents-0.064
 destination-0.064
 Grand-0.055
Token loud
Token ID7812
Feature activation-1.35e-01

Pos logit contributions
bee+0.151
gging+0.141
ez+0.134
hing+0.131
bees+0.125
Neg logit contributions
 home-0.125
 destination-0.119
 grandma-0.108
 grandparents-0.107
 list-0.101
Token noise
Token ID7838
Feature activation-5.61e-02

Pos logit contributions
pill+0.236
Nice+0.234
bee+0.217
Fine+0.214
If+0.208
Neg logit contributions
 home-0.201
 door-0.177
 room-0.167
 grandma-0.160
 for-0.155
Token and
Token ID290
Feature activation-6.21e-02

Pos logit contributions
bee+0.129
gging+0.119
 gag+0.117
 bleed+0.114
its+0.114
Neg logit contributions
 home-0.072
 online-0.064
 grandma-0.064
 downstairs-0.060
 grandparents-0.059
Token flew
Token ID13112
Feature activation-2.05e-02

Pos logit contributions
bee+0.123
gging+0.111
hing+0.101
ickets+0.100
its+0.098
Neg logit contributions
 grandma-0.102
 home-0.101
 grandparents-0.096
 destination-0.093
 Grand-0.091
Token out
Token ID503
Feature activation-3.24e-02

Pos logit contributions
bee+0.055
gging+0.050
 ow+0.050
 bleed+0.047
 sensation+0.047
Neg logit contributions
 home-0.025
 destination-0.022
 grandparents-0.020
 grandma-0.019
 online-0.019
Token of
Token ID286
Feature activation+2.71e-04

Pos logit contributions
bee+0.079
gging+0.074
 ow+0.068
 bleed+0.065
 sensation+0.065
Neg logit contributions
 home-0.045
 grandparents-0.040
 grandma-0.039
 destination-0.039
 online-0.037
Token the
Token ID262
Feature activation-1.51e-02

Pos logit contributions
bee+0.087
 bleed+0.077
gging+0.076
 sting+0.074
bees+0.074
Neg logit contributions
 home-0.051
 grandma-0.046
 destination-0.044
 grandparents-0.043
 Grand-0.038
Token hive
Token ID35881
Feature activation-3.26e-02

Pos logit contributions
bee+0.020
gging+0.017
 bleed+0.016
 sensation+0.016
ez+0.016
Neg logit contributions
 grandparents-0.036
 home-0.036
 destination-0.035
 grandma-0.034
ari-0.033
Token and
Token ID290
Feature activation-1.51e-02

Pos logit contributions
bee+0.071
gging+0.063
 ow+0.062
 bleed+0.057
ez+0.056
Neg logit contributions
 home-0.053
 destination-0.051
 grandparents-0.047
 online-0.045
 grandma-0.045
Token made
Token ID925
Feature activation-4.71e-02

Pos logit contributions
bee+0.029
gging+0.026
 ow+0.024
 bleed+0.024
 sensation+0.024
Neg logit contributions
 grandparents-0.027
 home-0.027
 destination-0.026
 grandma-0.026
 online-0.024
Token a
Token ID257
Feature activation-5.01e-02

Pos logit contributions
bee+0.115
gging+0.103
 bleed+0.095
 sensation+0.094
bees+0.092
Neg logit contributions
 home-0.069
 grandma-0.065
 grandparents-0.062
 destination-0.058
 presents-0.052
Token loud
Token ID7812
Feature activation-1.34e-01

Pos logit contributions
bee+0.099
gging+0.090
ez+0.082
hing+0.076
bees+0.076
Neg logit contributions
 home-0.094
 destination-0.090
 grandparents-0.089
 grandma-0.086
 list-0.077
Token noise
Token ID7838
Feature activation-5.94e-02

Pos logit contributions
pill+0.233
bee+0.231
Nice+0.227
gging+0.213
izards+0.213
Neg logit contributions
 home-0.209
 door-0.165
 grandma-0.156
 room-0.155
 for-0.148
Token.
Token ID13
Feature activation+8.39e-05

Pos logit contributions
bee+0.135
gging+0.125
pill+0.115
 thorn+0.114
Buzz+0.114
Neg logit contributions
 home-0.080
 grandma-0.071
 downstairs-0.069
 online-0.068
 grandparents-0.068
Token The
Token ID383
Feature activation-3.27e-02

Pos logit contributions
 grandparents+0.000
 destination+0.000
 home+0.000
ari+0.000
 grandma+0.000
Neg logit contributions
bee-0.000
 ow-0.000
gging-0.000
 bleed-0.000
 sting-0.000
Token bees
Token ID17002
Feature activation-6.57e-02

Pos logit contributions
bee+0.039
gging+0.036
ez+0.033
bery+0.030
hing+0.030
Neg logit contributions
 home-0.083
 grandparents-0.079
 destination-0.077
 grandma-0.076
 helper-0.072
Token came
Token ID1625
Feature activation-3.66e-02

Pos logit contributions
bee+0.119
gging+0.105
 sensation+0.103
 gag+0.101
ity+0.100
Neg logit contributions
 home-0.106
 destination-0.100
 boarded-0.097
 grandparents-0.095
 online-0.090
Token.
Token ID13
Feature activation-2.16e-02

Pos logit contributions
bee+0.120
gging+0.105
 ow+0.103
 sensation+0.100
bees+0.097
Neg logit contributions
 home-0.090
 destination-0.081
 online-0.076
 grandma-0.073
 grandparents-0.072
Token The
Token ID383
Feature activation-1.58e-02

Pos logit contributions
bee+0.044
 bleed+0.037
gging+0.036
 sensation+0.036
 ow+0.035
Neg logit contributions
 grandparents-0.038
 grandma-0.037
 home-0.037
 destination-0.036
 online-0.033
Token bicycle
Token ID17026
Feature activation-5.58e-02

Pos logit contributions
bee+0.035
gging+0.031
 ow+0.030
 sting+0.029
 sensation+0.029
Neg logit contributions
 grandparents-0.026
 home-0.025
 destination-0.024
 grandma-0.023
 backpack-0.021
Token makes
Token ID1838
Feature activation-5.71e-02

Pos logit contributions
bee+0.113
gging+0.097
 sensation+0.096
bees+0.091
 ow+0.086
Neg logit contributions
 destination-0.098
 grandma-0.091
 home-0.091
 grandparents-0.087
 online-0.085
Token a
Token ID257
Feature activation-6.78e-02

Pos logit contributions
bee+0.145
gging+0.129
bees+0.119
pe+0.119
oles+0.115
Neg logit contributions
 home-0.080
 grandma-0.077
 grandparents-0.067
 destination-0.065
 presents-0.059
Token loud
Token ID7812
Feature activation-1.34e-01

Pos logit contributions
bee+0.136
gging+0.123
bees+0.113
ez+0.112
py+0.108
Neg logit contributions
 home-0.119
 destination-0.116
 grandma-0.113
 grandparents-0.110
 list-0.100
Token noise
Token ID7838
Feature activation-6.17e-02

Pos logit contributions
pill+0.233
bee+0.228
oles+0.226
izards+0.220
bees+0.219
Neg logit contributions
 home-0.206
 grandma-0.175
 door-0.170
 room-0.163
 back-0.159
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
bee+0.141
gging+0.130
Buzz+0.127
 gag+0.124
pill+0.124
Neg logit contributions
 home-0.078
 grandma-0.072
 online-0.069
 downstairs-0.068
 grandparents-0.063
Token Ben
Token ID3932
Feature activation-6.13e-02

Pos logit contributions
bee+0.038
 bleed+0.031
gging+0.031
 sensation+0.030
 rust+0.030
Neg logit contributions
 grandparents-0.037
 grandma-0.036
 home-0.035
 destination-0.035
ari-0.033
Token falls
Token ID8953
Feature activation-9.11e-03

Pos logit contributions
bee+0.114
 sensation+0.106
bees+0.093
ity+0.092
Buzz+0.089
Neg logit contributions
 destination-0.104
 grandma-0.104
 grandparents-0.102
 home-0.101
 backpack-0.097
Token off
Token ID572
Feature activation-3.19e-03

Pos logit contributions
bee+0.021
gging+0.019
 ow+0.019
 sting+0.018
 sensation+0.018
Neg logit contributions
 grandparents-0.013
 home-0.013
 destination-0.013
 grandma-0.012
ari-0.011
Token jumped
Token ID11687
Feature activation-2.77e-02

Pos logit contributions
 conclusions+0.152
gging+0.150
bee+0.140
ickets+0.139
hing+0.137
Neg logit contributions
 home-0.098
 packed-0.095
 bought-0.093
 grandma-0.088
 boarded-0.087
Token away
Token ID1497
Feature activation-2.20e-02

Pos logit contributions
bee+0.069
gging+0.062
 ow+0.060
 sensation+0.060
 bleed+0.059
Neg logit contributions
 home-0.038
 destination-0.033
 online-0.029
 grandparents-0.029
 grandma-0.027
Token and
Token ID290
Feature activation-3.27e-02

Pos logit contributions
bee+0.047
gging+0.042
 ow+0.041
ez+0.040
 bleed+0.039
Neg logit contributions
 home-0.037
 destination-0.034
 grandparents-0.033
 grandma-0.031
 online-0.030
Token made
Token ID925
Feature activation-4.31e-02

Pos logit contributions
bee+0.062
gging+0.055
its+0.052
 sensation+0.050
ity+0.050
Neg logit contributions
 home-0.060
 grandparents-0.055
 destination-0.055
 grandma-0.055
 online-0.051
Token a
Token ID257
Feature activation-5.33e-02

Pos logit contributions
bee+0.110
gging+0.096
pe+0.091
bees+0.090
 sensation+0.089
Neg logit contributions
 home-0.063
 grandma-0.058
 grandparents-0.053
 destination-0.049
 cousin-0.044
Token loud
Token ID7812
Feature activation-1.34e-01

Pos logit contributions
bee+0.112
gging+0.102
ez+0.096
bees+0.091
hing+0.091
Neg logit contributions
 home-0.097
 destination-0.092
 grandparents-0.086
 grandma-0.083
 list-0.077
Token cro
Token ID6763
Feature activation-2.72e-02

Pos logit contributions
pill+0.224
Nice+0.223
bee+0.215
If+0.214
gging+0.212
Neg logit contributions
 home-0.221
 door-0.168
 party-0.162
 room-0.160
 bed-0.157
Tokenak
Token ID461
Feature activation-7.88e-02

Pos logit contributions
bee+0.061
 ow+0.058
gging+0.055
 bleed+0.054
 sensation+0.053
Neg logit contributions
ari-0.039
 home-0.038
 grandparents-0.036
 destination-0.035
 online-0.033
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
bee+0.182
gging+0.171
 ow+0.158
Buzz+0.151
 mosquitoes+0.151
Neg logit contributions
 home-0.121
 destination-0.100
 online-0.087
ari-0.084
 grandma-0.083
Token
Token ID198
Feature activation+7.58e-03

Pos logit contributions
 grandparents+0.027
 destination+0.027
 home+0.025
ari+0.025
 grandma+0.023
Neg logit contributions
bee-0.022
 ow-0.020
 sting-0.019
gging-0.019
 bleed-0.019
Token
Token ID198
Feature activation-7.02e-03

Pos logit contributions
 grandparents+0.015
 destination+0.015
 home+0.014
ari+0.014
 grandma+0.014
Neg logit contributions
bee-0.013
 ow-0.011
gging-0.011
 sting-0.011
 bleed-0.011
Token eating
Token ID6600
Feature activation-2.29e-02

Pos logit contributions
 grandparents+0.018
 destination+0.017
ari+0.016
 home+0.016
 grandma+0.016
Neg logit contributions
bee-0.017
gging-0.015
 ow-0.014
 sting-0.014
 bleed-0.014
Token,
Token ID11
Feature activation-7.99e-03

Pos logit contributions
bee+0.051
gging+0.045
 bleed+0.044
 ow+0.044
its+0.043
Neg logit contributions
 home-0.037
 destination-0.034
 grandparents-0.033
 grandma-0.032
 online-0.031
Token he
Token ID339
Feature activation-2.03e-03

Pos logit contributions
bee+0.014
gging+0.012
 bleed+0.012
 ow+0.012
 sting+0.011
Neg logit contributions
 grandparents-0.017
 destination-0.016
 home-0.016
 grandma-0.015
ari-0.015
Token heard
Token ID2982
Feature activation-2.70e-02

Pos logit contributions
bee+0.004
gging+0.003
 ow+0.003
 bleed+0.003
 sting+0.003
Neg logit contributions
 grandparents-0.004
 destination-0.004
 home-0.004
ari-0.003
 grandma-0.003
Token a
Token ID257
Feature activation-7.74e-02

Pos logit contributions
bee+0.061
 bleed+0.054
gging+0.053
 sting+0.052
 sensation+0.052
Neg logit contributions
 grandparents-0.043
 grandma-0.041
 home-0.041
 destination-0.039
ari-0.036
Token loud
Token ID7812
Feature activation-1.34e-01

Pos logit contributions
bee+0.154
ez+0.150
gging+0.144
ped+0.133
ched+0.133
Neg logit contributions
 grandparents-0.118
 home-0.117
 destination-0.114
 grandma-0.113
 grand-0.109
Token noise
Token ID7838
Feature activation-6.75e-02

Pos logit contributions
bee+0.200
pill+0.197
bery+0.196
 conclusions+0.194
its+0.194
Neg logit contributions
 home-0.229
 grandma-0.194
 station-0.187
 door-0.184
 party-0.177
Token.
Token ID13
Feature activation+7.37e-03

Pos logit contributions
bee+0.152
gging+0.141
 gag+0.138
its+0.137
 bleed+0.136
Neg logit contributions
 home-0.084
 downstairs-0.082
 online-0.081
 grandparents-0.077
 grandma-0.075
Token It
Token ID632
Feature activation-5.22e-02

Pos logit contributions
 grandparents+0.019
 destination+0.018
 home+0.018
ari+0.017
 grandma+0.017
Neg logit contributions
bee-0.009
gging-0.007
 ow-0.007
 bleed-0.007
 sting-0.007
Token was
Token ID373
Feature activation-2.35e-02

Pos logit contributions
bee+0.101
gging+0.095
 conclusions+0.092
 bleed+0.086
 sensation+0.085
Neg logit contributions
 destination-0.085
 grandparents-0.083
 home-0.082
 grandma-0.078
 mailed-0.073
Token a
Token ID257
Feature activation-1.50e-02

Pos logit contributions
bee+0.051
gging+0.044
its+0.043
 bleed+0.042
 sting+0.041
Neg logit contributions
 grandparents-0.039
 home-0.039
 grandma-0.038
 destination-0.036
 online-0.033
Token skull
Token ID14511
Feature activation-5.17e-02

Pos logit contributions
gging+0.070
bee+0.066
its+0.058
ez+0.058
 ow+0.056
Neg logit contributions
 home-0.087
 backpack-0.080
 grandparents-0.080
 grandma-0.078
 destination-0.077
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
bee+0.115
gging+0.107
 ow+0.102
 bleed+0.097
 sensation+0.097
Neg logit contributions
 home-0.078
 destination-0.069
 online-0.066
 grandparents-0.063
 backpack-0.063
Token He
Token ID679
Feature activation-3.72e-02

Pos logit contributions
bee+0.031
gging+0.024
 ow+0.023
 sting+0.023
 bleed+0.022
Neg logit contributions
 grandparents-0.044
 home-0.043
 grandma-0.043
 destination-0.043
ari-0.040
Token heard
Token ID2982
Feature activation-2.91e-02

Pos logit contributions
bee+0.070
 sensation+0.063
gging+0.062
hing+0.059
 bleed+0.059
Neg logit contributions
 grandparents-0.063
 grandma-0.060
 destination-0.057
 home-0.056
 boarded-0.056
Token a
Token ID257
Feature activation-8.94e-02

Pos logit contributions
bee+0.068
gging+0.059
 bleed+0.056
 sensation+0.055
bees+0.055
Neg logit contributions
 grandma-0.046
 grandparents-0.045
 home-0.045
 destination-0.040
 Grand-0.037
Token loud
Token ID7812
Feature activation-1.33e-01

Pos logit contributions
bee+0.172
ez+0.167
gging+0.164
ched+0.151
ped+0.149
Neg logit contributions
 grandma-0.128
 grandparents-0.126
 door-0.126
 home-0.125
 grand-0.122
Token noise
Token ID7838
Feature activation-6.76e-02

Pos logit contributions
pill+0.215
bee+0.203
oles+0.195
bery+0.192
bees+0.189
Neg logit contributions
 home-0.212
 grandma-0.194
 door-0.181
 downstairs-0.175
 station-0.172
Token.
Token ID13
Feature activation-8.37e-03

Pos logit contributions
bee+0.159
gging+0.145
pill+0.144
Buzz+0.143
 gag+0.142
Neg logit contributions
 downstairs-0.083
 home-0.083
 online-0.076
 grandma-0.074
 upstairs-0.072
Token The
Token ID383
Feature activation-5.63e-02

Pos logit contributions
bee+0.011
gging+0.009
 ow+0.009
 sting+0.009
 bleed+0.009
Neg logit contributions
 grandparents-0.021
 destination-0.020
 home-0.020
 grandma-0.020
ari-0.019
Token hole
Token ID7604
Feature activation-6.74e-02

Pos logit contributions
gging+0.097
bee+0.096
its+0.088
hing+0.087
ez+0.085
Neg logit contributions
 home-0.106
 grandma-0.102
 grandparents-0.102
 backpack-0.096
 room-0.093
Token started
Token ID2067
Feature activation-3.24e-02

Pos logit contributions
bee+0.132
gging+0.128
 conclusions+0.125
 bleed+0.119
 gag+0.118
Neg logit contributions
 destination-0.096
 home-0.093
ari-0.090
 online-0.089
 grandma-0.089
Token mouth
Token ID5422
Feature activation-4.52e-02

Pos logit contributions
 grandparents+0.006
 destination+0.006
 home+0.006
ari+0.006
 grandma+0.006
Neg logit contributions
bee-0.005
 ow-0.004
gging-0.004
 bleed-0.004
 sting-0.004
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
bee+0.098
 ow+0.086
gging+0.084
 mosquitoes+0.082
 bleed+0.080
Neg logit contributions
 home-0.075
 destination-0.068
 grandparents-0.065
 online-0.064
 grandma-0.064
Token It
Token ID632
Feature activation-5.27e-02

Pos logit contributions
bee+0.022
gging+0.018
 bleed+0.017
 ow+0.017
 sensation+0.017
Neg logit contributions
 grandparents-0.037
 destination-0.035
 grandma-0.035
 home-0.035
ari-0.034
Token made
Token ID925
Feature activation-3.70e-02

Pos logit contributions
bee+0.102
gging+0.102
 sensation+0.097
 conclusions+0.096
its+0.096
Neg logit contributions
 grandparents-0.077
 grandma-0.076
 home-0.073
 destination-0.072
 boarded-0.072
Token a
Token ID257
Feature activation-5.58e-02

Pos logit contributions
bee+0.087
gging+0.076
 sensation+0.073
 bleed+0.071
bees+0.070
Neg logit contributions
 home-0.057
 grandma-0.054
 grandparents-0.054
 destination-0.049
 presents-0.045
Token loud
Token ID7812
Feature activation-1.33e-01

Pos logit contributions
bee+0.102
gging+0.094
ez+0.088
 mosquitoes+0.083
bees+0.082
Neg logit contributions
 home-0.107
 destination-0.106
 grandparents-0.105
 grandma-0.100
 list-0.090
Token noise
Token ID7838
Feature activation-7.19e-02

Pos logit contributions
pill+0.220
Fine+0.217
Nice+0.211
bee+0.208
If+0.205
Neg logit contributions
 home-0.214
 door-0.186
 grandma-0.184
 room-0.173
 window-0.164
Token that
Token ID326
Feature activation-2.36e-02

Pos logit contributions
bee+0.171
gging+0.161
 gag+0.158
Buzz+0.158
its+0.158
Neg logit contributions
 home-0.082
 online-0.075
 downstairs-0.074
 grandma-0.073
 grandparents-0.067
Token sounded
Token ID14846
Feature activation-4.55e-02

Pos logit contributions
bee+0.043
gging+0.038
 bleed+0.036
 ow+0.035
 sensation+0.035
Neg logit contributions
 grandparents-0.046
 destination-0.044
 home-0.043
 grandma-0.043
ari-0.040
Token like
Token ID588
Feature activation-4.03e-02

Pos logit contributions
bee+0.087
gging+0.076
its+0.073
ez+0.073
 bleed+0.070
Neg logit contributions
 home-0.083
 grandparents-0.078
 destination-0.078
 grandma-0.077
 online-0.074
Token a
Token ID257
Feature activation-4.26e-02

Pos logit contributions
bee+0.084
gging+0.076
 bleed+0.073
its+0.071
 sensation+0.069
Neg logit contributions
 grandparents-0.068
 home-0.067
 grandma-0.065
 destination-0.063
 online-0.057
Token eyes
Token ID2951
Feature activation-4.71e-02

Pos logit contributions
 mosquitoes+0.135
bee+0.132
Buzz+0.131
 sensation+0.129
gging+0.129
Neg logit contributions
 home-0.135
 destination-0.116
 backpack-0.115
 grandparents-0.114
 grandma-0.111
Token.
Token ID13
Feature activation+5.38e-04

Pos logit contributions
bee+0.095
 bleed+0.083
gging+0.083
 sensation+0.082
 mosquitoes+0.082
Neg logit contributions
 home-0.077
 destination-0.076
 grandparents-0.073
 grandma-0.070
 online-0.068
Token It
Token ID632
Feature activation-6.11e-02

Pos logit contributions
 grandparents+0.001
 destination+0.001
 home+0.001
ari+0.001
 grandma+0.001
Neg logit contributions
bee-0.001
 ow-0.001
gging-0.001
 bleed-0.001
 sting-0.001
Token made
Token ID925
Feature activation-4.88e-02

Pos logit contributions
gging+0.122
 conclusions+0.121
bee+0.120
its+0.119
 bleed+0.116
Neg logit contributions
 grandma-0.083
 grandparents-0.082
 home-0.082
 boarded-0.078
 bought-0.078
Token a
Token ID257
Feature activation-5.50e-02

Pos logit contributions
bee+0.115
gging+0.102
 sensation+0.097
 bleed+0.097
pe+0.094
Neg logit contributions
 home-0.072
 grandma-0.069
 grandparents-0.066
 destination-0.062
 presents-0.056
Token loud
Token ID7812
Feature activation-1.33e-01

Pos logit contributions
bee+0.105
gging+0.096
ez+0.093
 mosquitoes+0.084
bees+0.084
Neg logit contributions
 destination-0.102
 home-0.102
 grandparents-0.099
 grandma-0.094
 list-0.087
Token sound
Token ID2128
Feature activation-7.68e-02

Pos logit contributions
pill+0.235
Nice+0.222
Fine+0.215
bee+0.215
izards+0.211
Neg logit contributions
 home-0.210
 door-0.176
 grandma-0.168
 room-0.160
 destination-0.156
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
bee+0.178
Buzz+0.176
pill+0.173
gging+0.171
Fine+0.169
Neg logit contributions
 home-0.090
 grandma-0.075
 online-0.072
 downstairs-0.069
 grandparents-0.065
Token "
Token ID366
Feature activation-5.94e-02

Pos logit contributions
 grandparents+0.025
 destination+0.024
 home+0.023
ari+0.023
 grandma+0.022
Neg logit contributions
bee-0.015
 ow-0.013
gging-0.013
 sting-0.013
 sensation-0.012
TokenC
Token ID34
Feature activation-2.97e-02

Pos logit contributions
bee+0.099
gging+0.089
 ow+0.088
 sensation+0.083
bees+0.081
Neg logit contributions
 grandparents-0.100
 boarded-0.097
 grandma-0.096
 downtown-0.092
ari-0.091
Tokenaw
Token ID707
Feature activation-4.08e-02

Pos logit contributions
bee+0.051
 sensation+0.044
 bleed+0.042
 ow+0.042
 slim+0.040
Neg logit contributions
 destination-0.062
ari-0.062
 home-0.061
 grandparents-0.060
 grandma-0.055
Token playing
Token ID2712
Feature activation-3.46e-02

Pos logit contributions
bee+0.003
 ow+0.003
gging+0.003
 bleed+0.003
 sting+0.003
Neg logit contributions
 grandparents-0.004
 destination-0.004
 home-0.004
ari-0.004
 grandma-0.004
Token,
Token ID11
Feature activation-3.99e-02

Pos logit contributions
bee+0.082
 bleed+0.074
gging+0.073
 ow+0.073
its+0.072
Neg logit contributions
 home-0.051
 destination-0.042
 online-0.041
 grandparents-0.040
 shopping-0.040
Token Benny
Token ID44275
Feature activation-2.58e-02

Pos logit contributions
bee+0.087
its+0.082
gging+0.079
 bleed+0.075
 sensation+0.073
Neg logit contributions
 home-0.063
 grandparents-0.061
 grandma-0.059
 destination-0.055
 online-0.053
Token heard
Token ID2982
Feature activation-4.27e-02

Pos logit contributions
bee+0.055
 sensation+0.051
 bleed+0.051
gging+0.051
 ow+0.050
Neg logit contributions
 grandparents-0.039
 home-0.038
 destination-0.038
 grandma-0.034
ari-0.034
Token a
Token ID257
Feature activation-7.67e-02

Pos logit contributions
bee+0.106
gging+0.094
 bleed+0.094
its+0.093
 sting+0.091
Neg logit contributions
 grandma-0.056
 grandparents-0.056
 home-0.054
 Grand-0.045
ari-0.045
Token loud
Token ID7812
Feature activation-1.33e-01

Pos logit contributions
bee+0.147
ez+0.140
gging+0.140
ped+0.125
its+0.125
Neg logit contributions
 grandparents-0.124
 home-0.122
 grandma-0.118
 destination-0.116
 grand-0.112
Token noise
Token ID7838
Feature activation-7.05e-02

Pos logit contributions
 conclusions+0.198
bee+0.198
bery+0.195
its+0.195
Ble+0.193
Neg logit contributions
 home-0.226
 grandma-0.191
 door-0.186
 station-0.183
 office-0.175
Token.
Token ID13
Feature activation-3.77e-02

Pos logit contributions
bee+0.162
gging+0.154
 gag+0.149
its+0.147
Buzz+0.146
Neg logit contributions
 home-0.086
 downstairs-0.082
 online-0.081
 upstairs-0.077
 grandparents-0.073
Token It
Token ID632
Feature activation-7.39e-02

Pos logit contributions
bee+0.066
its+0.055
 bleed+0.053
gging+0.053
 sting+0.051
Neg logit contributions
 home-0.077
 grandparents-0.075
 grandma-0.073
 destination-0.072
ari-0.068
Token was
Token ID373
Feature activation-3.43e-02

Pos logit contributions
bee+0.162
 conclusions+0.161
gging+0.157
bees+0.141
 gag+0.140
Neg logit contributions
 home-0.092
 destination-0.090
 grandparents-0.085
 grandma-0.083
 delivered-0.079
Token a
Token ID257
Feature activation-1.79e-02

Pos logit contributions
bee+0.078
gging+0.070
its+0.068
 bleed+0.064
bery+0.063
Neg logit contributions
 home-0.054
 grandparents-0.052
 grandma-0.052
 destination-0.047
 Grand-0.044
Token.
Token ID13
Feature activation-4.74e-03

Pos logit contributions
bee+0.051
gging+0.043
 ow+0.041
 bleed+0.038
 sting+0.038
Neg logit contributions
 home-0.047
 destination-0.046
 grandparents-0.044
 grandma-0.043
 online-0.041
Token The
Token ID383
Feature activation-5.61e-03

Pos logit contributions
bee+0.008
gging+0.007
 ow+0.007
 bleed+0.007
 sting+0.007
Neg logit contributions
 grandparents-0.010
 destination-0.009
 home-0.009
 grandma-0.009
ari-0.009
Token blocks
Token ID7021
Feature activation-3.70e-02

Pos logit contributions
bee+0.012
gging+0.011
 ow+0.010
 bleed+0.010
 sting+0.010
Neg logit contributions
 grandparents-0.009
 destination-0.009
 home-0.009
 grandma-0.008
ari-0.008
Token made
Token ID925
Feature activation-4.25e-02

Pos logit contributions
bee+0.070
gging+0.062
 ow+0.057
 sensation+0.057
 bleed+0.054
Neg logit contributions
 home-0.065
 destination-0.064
 grandparents-0.062
 grandma-0.059
 boarded-0.057
Token a
Token ID257
Feature activation-5.36e-02

Pos logit contributions
bee+0.105
gging+0.093
pe+0.086
bees+0.085
 sensation+0.084
Neg logit contributions
 grandma-0.061
 home-0.060
 grandparents-0.056
 destination-0.053
 Grand-0.047
Token loud
Token ID7812
Feature activation-1.33e-01

Pos logit contributions
bee+0.106
gging+0.097
ez+0.089
bees+0.086
 sensation+0.081
Neg logit contributions
 destination-0.096
 home-0.095
 grandparents-0.092
 grandma-0.090
 list-0.082
Token noise
Token ID7838
Feature activation-6.54e-02

Pos logit contributions
pill+0.228
bee+0.226
oles+0.216
Buzz+0.211
bees+0.210
Neg logit contributions
 home-0.203
 door-0.175
 grandma-0.172
 room-0.169
 downstairs-0.159
Token.
Token ID13
Feature activation+3.08e-04

Pos logit contributions
bee+0.148
gging+0.140
Buzz+0.133
pill+0.131
 gag+0.131
Neg logit contributions
 home-0.081
 downstairs-0.074
 grandma-0.074
 online-0.074
 grandparents-0.067
Token They
Token ID1119
Feature activation-2.19e-02

Pos logit contributions
 grandparents+0.001
 destination+0.001
 home+0.001
ari+0.001
 grandma+0.001
Neg logit contributions
bee-0.001
 ow-0.000
gging-0.000
 bleed-0.000
 sting-0.000
Token scattered
Token ID16830
Feature activation-1.56e-02

Pos logit contributions
bee+0.040
gging+0.036
 sensation+0.034
 ow+0.033
 bleed+0.033
Neg logit contributions
 grandparents-0.040
 home-0.039
 destination-0.038
 grandma-0.037
 boarded-0.035
Token all
Token ID477
Feature activation-1.77e-02

Pos logit contributions
bee+0.035
gging+0.030
 ow+0.029
 bleed+0.029
 sting+0.028
Neg logit contributions
 home-0.025
 grandparents-0.024
 destination-0.024
 grandma-0.023
 online-0.021
Token Suddenly
Token ID24975
Feature activation-2.67e-02

Pos logit contributions
bee+0.036
gging+0.030
 bleed+0.030
 ow+0.030
 sting+0.030
Neg logit contributions
 grandparents-0.044
 home-0.043
 destination-0.043
 grandma-0.041
ari-0.040
Token,
Token ID11
Feature activation-2.47e-02

Pos logit contributions
bee+0.062
gging+0.056
 sensation+0.056
its+0.056
 bleed+0.055
Neg logit contributions
 home-0.039
 grandparents-0.038
 destination-0.036
 grandma-0.035
ari-0.032
Token he
Token ID339
Feature activation-1.77e-02

Pos logit contributions
bee+0.044
its+0.039
gging+0.039
 bleed+0.036
 sting+0.035
Neg logit contributions
 home-0.050
 grandparents-0.049
 grandma-0.047
 destination-0.046
 online-0.044
Token heard
Token ID2982
Feature activation-4.06e-02

Pos logit contributions
bee+0.037
gging+0.033
 bleed+0.033
 sensation+0.033
its+0.031
Neg logit contributions
 grandparents-0.029
 destination-0.028
 home-0.028
ari-0.026
 grandma-0.026
Token a
Token ID257
Feature activation-7.91e-02

Pos logit contributions
bee+0.099
gging+0.088
 bleed+0.088
its+0.087
 sting+0.085
Neg logit contributions
 grandparents-0.056
 home-0.055
 grandma-0.055
 destination-0.048
ari-0.046
Token loud
Token ID7812
Feature activation-1.33e-01

Pos logit contributions
bee+0.158
ez+0.153
gging+0.152
ped+0.140
its+0.135
Neg logit contributions
 grandparents-0.123
 home-0.120
 destination-0.114
 grandma-0.114
 grand-0.114
Token noise
Token ID7838
Feature activation-7.33e-02

Pos logit contributions
Ble+0.208
its+0.207
 conclusions+0.203
bery+0.203
bee+0.201
Neg logit contributions
 home-0.225
 door-0.182
 grandma-0.181
 station-0.178
 room-0.171
Token.
Token ID13
Feature activation-3.28e-02

Pos logit contributions
bee+0.174
gging+0.166
 gag+0.165
its+0.161
 bleed+0.158
Neg logit contributions
 home-0.085
 downstairs-0.084
 online-0.079
 upstairs-0.075
 grandparents-0.072
Token "
Token ID366
Feature activation-5.41e-02

Pos logit contributions
bee+0.056
its+0.046
gging+0.045
 bleed+0.045
 sensation+0.043
Neg logit contributions
 home-0.068
 grandparents-0.068
 grandma-0.065
 destination-0.064
ari-0.061
TokenWhat
Token ID2061
Feature activation-3.14e-02

Pos logit contributions
bee+0.089
gging+0.080
der+0.075
its+0.074
 ow+0.074
Neg logit contributions
 grandparents-0.101
 home-0.092
 boarded-0.092
 grandma-0.091
 destination-0.088
Token was
Token ID373
Feature activation+4.95e-04

Pos logit contributions
bee+0.059
 sensation+0.053
gging+0.052
 bleed+0.049
 sting+0.048
Neg logit contributions
 grandparents-0.061
 grandma-0.057
 home-0.056
 destination-0.055
ari-0.055
Token.
Token ID13
Feature activation-2.40e-02

Pos logit contributions
bee+0.143
gging+0.123
 ow+0.116
 sensation+0.115
bees+0.114
Neg logit contributions
 home-0.094
 destination-0.078
 online-0.076
 grandma-0.069
 downstairs-0.067
Token The
Token ID383
Feature activation-1.68e-02

Pos logit contributions
bee+0.044
gging+0.035
 bleed+0.035
 sensation+0.034
 ow+0.033
Neg logit contributions
 grandparents-0.046
 grandma-0.046
 home-0.045
 destination-0.045
ari-0.042
Token tube
Token ID12403
Feature activation-5.69e-02

Pos logit contributions
bee+0.034
gging+0.031
 ow+0.028
ez+0.028
 sensation+0.028
Neg logit contributions
 grandparents-0.030
 home-0.029
 destination-0.028
 grandma-0.028
ari-0.025
Token makes
Token ID1838
Feature activation-4.84e-02

Pos logit contributions
bee+0.110
gging+0.093
 sensation+0.091
 ow+0.083
ity+0.083
Neg logit contributions
 destination-0.101
 home-0.100
 grandma-0.093
 grandparents-0.092
 online-0.089
Token a
Token ID257
Feature activation-6.86e-02

Pos logit contributions
bee+0.123
gging+0.109
 sensation+0.100
bees+0.100
pe+0.099
Neg logit contributions
 home-0.068
 grandma-0.062
 grandparents-0.056
 destination-0.056
 Grand-0.047
Token loud
Token ID7812
Feature activation-1.33e-01

Pos logit contributions
bee+0.138
gging+0.128
ez+0.119
bees+0.113
hing+0.111
Neg logit contributions
 home-0.119
 destination-0.111
 grandma-0.107
 grandparents-0.106
 list-0.102
Token his
Token ID465
Feature activation-3.35e-02

Pos logit contributions
pill+0.224
bee+0.211
Nice+0.207
izards+0.204
oles+0.200
Neg logit contributions
 home-0.208
 door-0.174
 grandma-0.166
 room-0.164
 for-0.164
Tokens
Token ID82
Feature activation-7.78e-02

Pos logit contributions
bee+0.063
gging+0.058
its+0.053
ching+0.053
 mosquitoes+0.053
Neg logit contributions
 home-0.063
 destination-0.059
 grandparents-0.059
 grandma-0.057
 backpack-0.052
Token and
Token ID290
Feature activation-4.17e-02

Pos logit contributions
pill+0.153
bee+0.142
Run+0.142
Buzz+0.141
Raw+0.140
Neg logit contributions
 home-0.136
 grandma-0.104
 destination-0.101
 backpack-0.096
 dad-0.092
Token pops
Token ID26384
Feature activation-5.18e-02

Pos logit contributions
bee+0.079
gging+0.069
hing+0.063
ickets+0.062
 sensation+0.061
Neg logit contributions
 grandma-0.078
 home-0.078
 grandparents-0.076
 destination-0.076
 backpack-0.071
Token.
Token ID13
Feature activation-2.28e-02

Pos logit contributions
bee+0.104
gging+0.101
pill+0.088
 slim+0.087
 sting+0.087
Neg logit contributions
 home-0.089
 grandma-0.078
 destination-0.076
 grandparents-0.071
 online-0.069
Token at
Token ID379
Feature activation+2.28e-02

Pos logit contributions
bee+0.100
 ow+0.089
 sensation+0.088
gging+0.088
 bleed+0.085
Neg logit contributions
 home-0.068
 destination-0.064
 grandparents-0.060
 online-0.056
ari-0.055
Token them
Token ID606
Feature activation-4.35e-02

Pos logit contributions
 grandparents+0.052
ari+0.049
 destination+0.049
 home+0.045
 online+0.044
Neg logit contributions
bee-0.034
 ow-0.030
gging-0.029
 sting-0.028
 sensation-0.027
Token and
Token ID290
Feature activation-4.91e-02

Pos logit contributions
bee+0.089
 ow+0.078
 sensation+0.078
gging+0.076
 mosquitoes+0.072
Neg logit contributions
 home-0.078
 destination-0.072
 grandparents-0.069
 grandma-0.065
 online-0.063
Token made
Token ID925
Feature activation-3.55e-02

Pos logit contributions
bee+0.092
ickets+0.081
gging+0.080
its+0.076
hing+0.075
Neg logit contributions
 grandma-0.083
 home-0.081
 grandparents-0.081
 destination-0.078
 Grand-0.076
Token a
Token ID257
Feature activation-6.80e-02

Pos logit contributions
bee+0.078
gging+0.067
 sensation+0.063
bees+0.062
 ow+0.061
Neg logit contributions
 home-0.062
 grandma-0.060
 grandparents-0.059
 destination-0.056
 online-0.049
Token loud
Token ID7812
Feature activation-1.33e-01

Pos logit contributions
bee+0.129
gging+0.120
ez+0.111
bees+0.110
hing+0.108
Neg logit contributions
 home-0.125
 destination-0.115
 grandparents-0.108
 grandma-0.107
 list-0.103
Token noise
Token ID7838
Feature activation-6.92e-02

Pos logit contributions
pill+0.233
Nice+0.215
bee+0.210
Fine+0.205
 mosquitoes+0.204
Neg logit contributions
 home-0.215
 door-0.180
 grandma-0.167
 list-0.161
 room-0.156
Token.
Token ID13
Feature activation-2.12e-03

Pos logit contributions
bee+0.158
gging+0.149
pill+0.142
 ow+0.141
Buzz+0.140
Neg logit contributions
 home-0.085
 downstairs-0.076
 online-0.075
 grandma-0.073
 grandparents-0.070
Token Lily
Token ID20037
Feature activation-1.89e-02

Pos logit contributions
bee+0.003
gging+0.003
 ow+0.003
 bleed+0.003
 sting+0.003
Neg logit contributions
 grandparents-0.005
 destination-0.005
 home-0.005
 grandma-0.004
ari-0.004
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
bee+0.038
 sensation+0.034
 ow+0.034
 bleed+0.033
gging+0.033
Neg logit contributions
 grandparents-0.032
 home-0.030
 destination-0.029
 grandma-0.029
ari-0.028
Token Tom
Token ID4186
Feature activation-6.33e-02

Pos logit contributions
bee+0.022
gging+0.019
 sensation+0.019
 bleed+0.019
 ow+0.019
Neg logit contributions
 grandparents-0.019
 home-0.018
 destination-0.018
 grandma-0.018
ari-0.016
Token down
Token ID866
Feature activation-6.58e-03

Pos logit contributions
bee+0.052
gging+0.047
 ow+0.045
 sensation+0.045
 sting+0.043
Neg logit contributions
 home-0.027
 destination-0.026
 grandparents-0.025
 grandma-0.023
 online-0.022
Token.
Token ID13
Feature activation-1.38e-03

Pos logit contributions
bee+0.011
gging+0.009
 ow+0.009
 bleed+0.009
 sting+0.009
Neg logit contributions
 grandparents-0.014
 destination-0.013
 home-0.013
 grandma-0.013
ari-0.013
Token It
Token ID632
Feature activation-5.64e-02

Pos logit contributions
bee+0.002
gging+0.001
 ow+0.001
 bleed+0.001
 sting+0.001
Neg logit contributions
 grandparents-0.003
 destination-0.003
 home-0.003
ari-0.003
 grandma-0.003
Token makes
Token ID1838
Feature activation-2.85e-02

Pos logit contributions
bee+0.108
gging+0.101
bees+0.088
 sensation+0.086
ity+0.084
Neg logit contributions
 destination-0.104
 home-0.104
 grandma-0.096
 grandparents-0.092
 backpack-0.090
Token a
Token ID257
Feature activation-6.76e-02

Pos logit contributions
bee+0.066
gging+0.058
 sensation+0.054
 bleed+0.053
bees+0.052
Neg logit contributions
 home-0.044
 grandparents-0.044
 grandma-0.043
 destination-0.043
 cousin-0.035
Token loud
Token ID7812
Feature activation-1.32e-01

Pos logit contributions
bee+0.132
gging+0.125
ez+0.115
bees+0.107
hing+0.105
Neg logit contributions
 home-0.119
 destination-0.118
 grandma-0.113
 grandparents-0.113
 list-0.101
Token noise
Token ID7838
Feature activation-5.73e-02

Pos logit contributions
pill+0.220
bee+0.210
oles+0.209
gging+0.205
Nice+0.199
Neg logit contributions
 home-0.208
 grandma-0.187
 door-0.184
 room-0.181
 bed-0.164
Token.
Token ID13
Feature activation-5.81e-03

Pos logit contributions
bee+0.129
gging+0.121
 sensation+0.113
 gag+0.113
 bleed+0.111
Neg logit contributions
 home-0.073
 grandma-0.069
 downstairs-0.068
 online-0.064
 grandparents-0.064
Token It
Token ID632
Feature activation-6.33e-02

Pos logit contributions
bee+0.009
gging+0.007
 ow+0.007
 bleed+0.007
 sting+0.007
Neg logit contributions
 grandparents-0.013
 destination-0.012
 home-0.012
 grandma-0.012
ari-0.012
Token breaks
Token ID9457
Feature activation-6.23e-03

Pos logit contributions
bee+0.124
gging+0.118
bees+0.103
 sensation+0.103
ity+0.101
Neg logit contributions
 destination-0.110
 home-0.104
 grandma-0.101
 grandparents-0.099
 backpack-0.093
Token into
Token ID656
Feature activation+9.29e-03

Pos logit contributions
bee+0.014
gging+0.012
 ow+0.012
 bleed+0.012
 sting+0.012
Neg logit contributions
 grandparents-0.010
 destination-0.010
 home-0.010
 grandma-0.009
ari-0.009
TokenSuddenly
Token ID38582
Feature activation-9.11e-03

Pos logit contributions
 grandparents+0.002
 destination+0.002
 home+0.002
ari+0.002
 grandma+0.002
Neg logit contributions
bee-0.002
 ow-0.001
gging-0.001
 bleed-0.001
 sting-0.001
Token,
Token ID11
Feature activation+3.82e-03

Pos logit contributions
bee+0.019
gging+0.017
 bleed+0.017
 ow+0.016
 sensation+0.016
Neg logit contributions
 grandparents-0.016
 destination-0.015
 home-0.015
 grandma-0.014
ari-0.014
Token they
Token ID484
Feature activation+1.79e-03

Pos logit contributions
 grandparents+0.008
 destination+0.007
 home+0.007
ari+0.007
 grandma+0.007
Neg logit contributions
bee-0.007
 ow-0.006
gging-0.006
 bleed-0.006
 sting-0.006
Token heard
Token ID2982
Feature activation-3.07e-02

Pos logit contributions
 grandparents+0.004
 destination+0.004
 home+0.003
ari+0.003
 grandma+0.003
Neg logit contributions
bee-0.003
 ow-0.003
gging-0.003
 bleed-0.003
 sting-0.003
Token a
Token ID257
Feature activation-8.05e-02

Pos logit contributions
bee+0.070
gging+0.062
 bleed+0.061
 sensation+0.059
 sting+0.059
Neg logit contributions
 grandparents-0.049
 grandma-0.048
 home-0.047
 destination-0.044
ari-0.040
Token loud
Token ID7812
Feature activation-1.32e-01

Pos logit contributions
bee+0.157
ez+0.154
gging+0.148
ched+0.139
ped+0.136
Neg logit contributions
 grandparents-0.132
 destination-0.125
 grandma-0.124
 home-0.122
 grand-0.118
Token noise
Token ID7838
Feature activation-5.53e-02

Pos logit contributions
bee+0.201
Ble+0.198
pill+0.197
its+0.196
bery+0.195
Neg logit contributions
 home-0.224
 grandma-0.197
 station-0.186
 door-0.182
 party-0.180
Token.
Token ID13
Feature activation+4.28e-04

Pos logit contributions
bee+0.123
gging+0.114
 bleed+0.109
 gag+0.107
its+0.107
Neg logit contributions
 home-0.079
 downstairs-0.074
 grandparents-0.074
 grandma-0.072
 online-0.072
Token It
Token ID632
Feature activation-4.56e-02

Pos logit contributions
 grandparents+0.001
 destination+0.001
 home+0.001
ari+0.001
 grandma+0.001
Neg logit contributions
bee-0.001
 ow-0.000
gging-0.000
 bleed-0.000
 sting-0.000
Token was
Token ID373
Feature activation-1.41e-02

Pos logit contributions
bee+0.087
gging+0.082
 bleed+0.075
 conclusions+0.074
 sensation+0.073
Neg logit contributions
 grandparents-0.078
 destination-0.078
 home-0.075
 grandma-0.074
 boarded-0.067
Token their
Token ID511
Feature activation+8.92e-03

Pos logit contributions
bee+0.030
gging+0.027
 bleed+0.025
 sting+0.025
 ow+0.025
Neg logit contributions
 grandparents-0.024
 home-0.023
 destination-0.023
 grandma-0.023
 online-0.019
Token the
Token ID262
Feature activation+2.13e-02

Pos logit contributions
ari+0.064
 destination+0.060
 grandparents+0.060
 online+0.054
 home+0.053
Neg logit contributions
 ow-0.050
bee-0.050
 sting-0.048
gging-0.046
 bleed-0.046
Token exam
Token ID2814
Feature activation+7.54e-02

Pos logit contributions
 grandparents+0.035
 destination+0.034
ari+0.033
 home+0.032
 online+0.030
Neg logit contributions
bee-0.044
 ow-0.039
 sting-0.039
 sensation-0.038
 bleed-0.038
Token.
Token ID13
Feature activation+2.39e-02

Pos logit contributions
 grandparents+0.190
 destination+0.188
 harbour+0.170
 home+0.170
 boarded+0.169
Neg logit contributions
bee-0.081
 sting-0.059
 bleed-0.059
 ow-0.058
gging-0.056
Token Let
Token ID3914
Feature activation+1.38e-02

Pos logit contributions
 destination+0.056
 grandparents+0.055
 home+0.054
ari+0.051
 grandma+0.049
Neg logit contributions
bee-0.032
 ow-0.028
 bleed-0.027
gging-0.026
 sting-0.026
Token's
Token ID338
Feature activation+1.15e-02

Pos logit contributions
 grandparents+0.031
 destination+0.030
ari+0.029
 home+0.028
 online+0.027
Neg logit contributions
bee-0.021
 ow-0.018
 sting-0.018
gging-0.017
 bleed-0.017
Token go
Token ID467
Feature activation+7.79e-02

Pos logit contributions
 grandparents+0.024
 destination+0.023
ari+0.023
 home+0.022
 grandma+0.021
Neg logit contributions
bee-0.019
 ow-0.016
 bleed-0.016
 sting-0.016
gging-0.016
Token."
Token ID526
Feature activation+3.45e-02

Pos logit contributions
 grandparents+0.159
 destination+0.144
 cousins+0.135
 grandmother+0.130
 harbour+0.130
Neg logit contributions
 bleed-0.097
bee-0.095
p-0.089
 sting-0.087
ching-0.087
Token Lily
Token ID20037
Feature activation+1.49e-02

Pos logit contributions
 destination+0.086
 grandparents+0.083
ari+0.077
 home+0.075
pack+0.073
Neg logit contributions
bee-0.037
 ow-0.036
 sting-0.034
 spit-0.034
 bleed-0.033
Token says
Token ID1139
Feature activation-9.17e-03

Pos logit contributions
 grandparents+0.033
 destination+0.032
 home+0.031
ari+0.030
 grandma+0.029
Neg logit contributions
bee-0.023
gging-0.019
 bleed-0.019
 ow-0.019
 sting-0.019
Token.
Token ID13
Feature activation+6.78e-03

Pos logit contributions
bee+0.015
 ow+0.013
gging+0.013
 sensation+0.013
 sting+0.013
Neg logit contributions
 grandparents-0.019
 home-0.019
 destination-0.018
 grandma-0.018
ari-0.018
Token
Token ID198
Feature activation-1.63e-02

Pos logit contributions
 grandparents+0.014
 destination+0.014
 home+0.013
ari+0.013
 grandma+0.013
Neg logit contributions
bee-0.011
 ow-0.010
gging-0.010
 bleed-0.010
 sting-0.010
Token today
Token ID1909
Feature activation+3.90e-02

Pos logit contributions
 grandparents+0.011
 destination+0.010
 home+0.010
ari+0.010
 grandma+0.009
Neg logit contributions
bee-0.009
 ow-0.008
gging-0.008
 bleed-0.008
 sting-0.008
Token.
Token ID13
Feature activation+1.39e-03

Pos logit contributions
 grandparents+0.091
 destination+0.086
ari+0.084
 home+0.082
 grandma+0.080
Neg logit contributions
bee-0.054
 ow-0.047
 sting-0.044
gging-0.043
 bleed-0.042
Token Let
Token ID3914
Feature activation+1.52e-02

Pos logit contributions
 grandparents+0.003
 destination+0.003
 home+0.003
ari+0.003
 grandma+0.003
Neg logit contributions
bee-0.002
 ow-0.002
gging-0.002
 bleed-0.002
 sting-0.002
Token's
Token ID338
Feature activation-4.60e-03

Pos logit contributions
 grandparents+0.034
 destination+0.033
ari+0.033
 home+0.031
 online+0.030
Neg logit contributions
bee-0.022
 ow-0.020
 sting-0.019
gging-0.019
 bleed-0.018
Token play
Token ID711
Feature activation+3.90e-02

Pos logit contributions
bee+0.009
gging+0.008
 ow+0.008
 bleed+0.007
 sting+0.007
Neg logit contributions
 grandparents-0.009
 destination-0.008
 home-0.008
 grandma-0.008
ari-0.008
Token tomorrow
Token ID9439
Feature activation+8.12e-02

Pos logit contributions
 grandparents+0.081
ari+0.078
 destination+0.076
 boarded+0.069
eline+0.069
Neg logit contributions
bee-0.062
ze-0.059
 ow-0.057
gging-0.056
 sting-0.055
Token."
Token ID526
Feature activation+1.43e-02

Pos logit contributions
 grandparents+0.188
ari+0.174
 destination+0.169
 boarded+0.166
 workshop+0.161
Neg logit contributions
bee-0.102
 ow-0.086
gging-0.081
 sting-0.079
ze-0.078
Token Tommy
Token ID19919
Feature activation-2.09e-04

Pos logit contributions
 grandparents+0.030
 destination+0.030
ari+0.029
 home+0.028
 grandma+0.026
Neg logit contributions
bee-0.023
gging-0.020
 ow-0.020
 bleed-0.019
 sensation-0.019
Token forg
Token ID5784
Feature activation-3.36e-03

Pos logit contributions
bee+0.000
 ow+0.000
gging+0.000
 bleed+0.000
 sting+0.000
Neg logit contributions
 grandparents-0.000
 destination-0.000
 home-0.000
ari-0.000
 grandma-0.000
Tokenave
Token ID1015
Feature activation-1.63e-02

Pos logit contributions
bee+0.007
 ow+0.007
gging+0.007
 bleed+0.006
 sting+0.006
Neg logit contributions
 grandparents-0.005
 destination-0.005
 home-0.005
ari-0.005
 grandma-0.005
Token him
Token ID683
Feature activation-1.51e-02

Pos logit contributions
bee+0.035
 sting+0.031
 bleed+0.031
 ow+0.031
gging+0.031
Neg logit contributions
 grandparents-0.028
 home-0.026
 grandma-0.026
 destination-0.026
ari-0.022
Token a
Token ID257
Feature activation+2.65e-02

Pos logit contributions
 grandparents+0.008
 destination+0.008
 home+0.007
ari+0.007
 grandma+0.007
Neg logit contributions
bee-0.008
 ow-0.007
gging-0.007
 bleed-0.007
 sting-0.007
Token board
Token ID3096
Feature activation+5.26e-02

Pos logit contributions
 grandparents+0.048
ari+0.047
 destination+0.045
 home+0.043
 grandma+0.042
Neg logit contributions
bee-0.050
 ow-0.044
 sting-0.044
 sensation-0.043
 bleed-0.042
Token game
Token ID983
Feature activation+5.18e-02

Pos logit contributions
 grandparents+0.112
 destination+0.100
ari+0.095
 grandma+0.094
 home+0.093
Neg logit contributions
bee-0.085
 sting-0.076
 ow-0.074
 sensation-0.072
ching-0.071
Token or
Token ID393
Feature activation+1.65e-02

Pos logit contributions
 grandparents+0.110
 destination+0.102
ari+0.096
 home+0.095
 grandma+0.093
Neg logit contributions
bee-0.080
 ow-0.070
 sting-0.068
 sensation-0.065
ching-0.065
Token a
Token ID257
Feature activation+1.83e-02

Pos logit contributions
 grandparents+0.032
 destination+0.031
ari+0.030
 home+0.030
 grandma+0.028
Neg logit contributions
bee-0.029
 ow-0.026
 sting-0.026
 bleed-0.025
gging-0.025
Token puzzle
Token ID15027
Feature activation+7.83e-02

Pos logit contributions
 grandparents+0.034
 destination+0.033
ari+0.032
 home+0.031
 grandma+0.030
Neg logit contributions
bee-0.034
 ow-0.030
 sting-0.030
 bleed-0.029
 sensation-0.029
Token?"
Token ID1701
Feature activation-1.50e-02

Pos logit contributions
 grandparents+0.175
ari+0.155
 destination+0.154
eline+0.149
 grandmother+0.148
Neg logit contributions
bee-0.107
 sting-0.102
 ow-0.100
 mosquitoes-0.092
 sensation-0.091
Token
Token ID198
Feature activation-7.09e-03

Pos logit contributions
bee+0.033
 ow+0.029
 bleed+0.029
 sting+0.028
gging+0.028
Neg logit contributions
 grandparents-0.025
 home-0.023
 grandma-0.023
 destination-0.023
ari-0.022
Token
Token ID198
Feature activation-1.18e-03

Pos logit contributions
bee+0.014
 ow+0.012
 bleed+0.012
gging+0.012
 sensation+0.012
Neg logit contributions
 grandparents-0.013
 destination-0.012
 home-0.012
 grandma-0.012
ari-0.012
TokenJ
Token ID41
Feature activation-1.13e-02

Pos logit contributions
bee+0.002
 ow+0.002
gging+0.002
 bleed+0.002
 sting+0.002
Neg logit contributions
 grandparents-0.002
 destination-0.002
 home-0.002
ari-0.002
 grandma-0.002
Tokenill
Token ID359
Feature activation-3.39e-02

Pos logit contributions
bee+0.021
 bleed+0.019
 ow+0.019
gging+0.018
 sting+0.018
Neg logit contributions
 grandparents-0.021
 destination-0.020
 home-0.020
ari-0.020
 grandma-0.019
Token it
Token ID340
Feature activation+4.56e-02

Pos logit contributions
 destination+0.051
 grandparents+0.050
ari+0.050
 home+0.046
 online+0.044
Neg logit contributions
bee-0.036
 ow-0.033
 sting-0.030
gging-0.030
 bleed-0.029
Token!
Token ID0
Feature activation+2.61e-02

Pos logit contributions
 grandparents+0.105
 destination+0.102
ari+0.099
 boarded+0.096
 passports+0.096
Neg logit contributions
bee-0.065
 ow-0.055
ze-0.047
 sting-0.047
 bleed-0.046
Token Can
Token ID1680
Feature activation+1.62e-02

Pos logit contributions
 destination+0.064
 grandparents+0.062
 home+0.061
ari+0.058
 grandma+0.055
Neg logit contributions
bee-0.033
 ow-0.028
 bleed-0.026
gging-0.025
 sting-0.025
Token we
Token ID356
Feature activation+4.15e-02

Pos logit contributions
 grandparents+0.037
 destination+0.037
 home+0.035
ari+0.035
 online+0.032
Neg logit contributions
bee-0.023
 ow-0.020
gging-0.019
 sting-0.019
 bleed-0.019
Token put
Token ID1234
Feature activation+5.22e-02

Pos logit contributions
 grandparents+0.082
ari+0.080
 destination+0.079
 home+0.075
 online+0.073
Neg logit contributions
bee-0.066
 sting-0.062
 ow-0.061
 bleed-0.057
 spit-0.056
Token it
Token ID340
Feature activation+8.11e-02

Pos logit contributions
ari+0.147
 grandparents+0.146
 destination+0.140
 driveway+0.132
 boarded+0.131
Neg logit contributions
bee-0.038
 ow-0.037
gging-0.033
 sting-0.032
 bleed-0.029
Token in
Token ID287
Feature activation+8.20e-02

Pos logit contributions
 grandparents+0.174
ari+0.154
 workshop+0.148
 destination+0.148
 passports+0.146
Neg logit contributions
bee-0.118
 ow-0.108
 sting-0.102
 bleed-0.100
ze-0.099
Token my
Token ID616
Feature activation+7.66e-02

Pos logit contributions
ari+0.224
ouses+0.193
cases+0.191
 grandparents+0.190
 boarded+0.185
Neg logit contributions
 ow-0.064
 slim-0.060
 itself-0.059
 sting-0.057
 it-0.057
Token stamp
Token ID17977
Feature activation+8.93e-02

Pos logit contributions
 grandparents+0.116
ari+0.114
 destination+0.113
ris+0.110
ait+0.110
Neg logit contributions
 ow-0.143
 sting-0.127
 thorn-0.123
 slim-0.119
 itch-0.116
Token collection
Token ID4947
Feature activation+1.04e-01

Pos logit contributions
 grandparents+0.188
ari+0.171
 boarded+0.170
 shopping+0.169
ait+0.167
Neg logit contributions
bee-0.120
 ow-0.097
 sting-0.096
ze-0.095
fly-0.091
Token?"
Token ID1701
Feature activation+2.10e-02

Pos logit contributions
 grandparents+0.233
 cousins+0.218
 boarded+0.208
 passports+0.205
 paperwork+0.202
Neg logit contributions
bee-0.114
 ow-0.099
ickets-0.094
 sensation-0.092
 rust-0.091
Token,
Token ID11
Feature activation-2.07e-02

Pos logit contributions
 grandparents+0.034
 destination+0.033
ari+0.031
 home+0.031
 online+0.030
Neg logit contributions
bee-0.027
 ow-0.024
 sting-0.023
gging-0.022
 bleed-0.022
Token it
Token ID340
Feature activation-3.02e-02

Pos logit contributions
bee+0.039
gging+0.035
 bleed+0.033
 sensation+0.033
 ow+0.032
Neg logit contributions
 grandparents-0.040
 home-0.038
 grandma-0.038
 destination-0.037
ari-0.034
Token's
Token ID338
Feature activation+3.42e-02

Pos logit contributions
bee+0.067
gging+0.061
 sensation+0.059
 ow+0.056
 bleed+0.056
Neg logit contributions
 grandparents-0.047
 destination-0.044
 home-0.043
 grandma-0.042
ari-0.041
Token time
Token ID640
Feature activation+6.39e-02

Pos logit contributions
ari+0.081
 grandparents+0.080
 destination+0.078
 carriage+0.069
 backpack+0.069
Neg logit contributions
bee-0.046
 ow-0.044
 sting-0.041
 itch-0.040
 sensation-0.040
Token to
Token ID284
Feature activation+8.76e-03

Pos logit contributions
 destination+0.127
 grandparents+0.125
ari+0.122
 workshop+0.109
 buyer+0.108
Neg logit contributions
bee-0.099
 sting-0.089
 bleed-0.088
 ow-0.084
 hurts-0.082
Token go
Token ID467
Feature activation+7.29e-02

Pos logit contributions
 grandparents+0.018
 destination+0.017
ari+0.016
 home+0.016
 grandma+0.016
Neg logit contributions
bee-0.015
 ow-0.013
 bleed-0.013
gging-0.013
 sting-0.013
Token home
Token ID1363
Feature activation+9.16e-02

Pos logit contributions
 grandparents+0.125
ari+0.109
eria+0.108
 grandmother+0.101
 grandson+0.101
Neg logit contributions
 bleed-0.129
 sting-0.118
 poke-0.117
 pinch-0.117
 hurts-0.116
Token now
Token ID783
Feature activation+5.07e-02

Pos logit contributions
ari+0.236
 grandparents+0.229
 boarded+0.220
 destination+0.219
ival+0.219
Neg logit contributions
 bleed-0.056
 sting-0.051
 hurts-0.049
 noise-0.049
 ow-0.048
Token.
Token ID13
Feature activation+1.82e-02

Pos logit contributions
 grandparents+0.123
ari+0.121
 destination+0.113
 boarded+0.109
 online+0.105
Neg logit contributions
bee-0.062
 ow-0.058
 bleed-0.057
 sting-0.056
gging-0.050
Token I
Token ID314
Feature activation+2.80e-02

Pos logit contributions
 grandparents+0.043
 destination+0.042
 home+0.040
ari+0.039
 grandma+0.038
Neg logit contributions
bee-0.025
 ow-0.021
 bleed-0.021
gging-0.020
 sting-0.020
Token have
Token ID423
Feature activation+6.51e-02

Pos logit contributions
 grandparents+0.061
 destination+0.059
ari+0.057
 home+0.055
 grandma+0.054
Neg logit contributions
bee-0.041
 bleed-0.038
 sting-0.038
 ow-0.034
ching-0.034
Token "
Token ID366
Feature activation+1.67e-02

Pos logit contributions
 destination+0.041
 grandparents+0.040
ari+0.038
 home+0.037
 grandma+0.035
Neg logit contributions
bee-0.026
 ow-0.022
gging-0.022
 sting-0.021
 rust-0.020
TokenWhy
Token ID5195
Feature activation+3.14e-03

Pos logit contributions
 grandparents+0.043
 destination+0.042
 home+0.041
ari+0.040
 grandma+0.039
Neg logit contributions
bee-0.020
 ow-0.015
gging-0.015
 sensation-0.014
 sting-0.014
Token are
Token ID389
Feature activation+5.07e-02

Pos logit contributions
 grandparents+0.008
 destination+0.008
 home+0.008
ari+0.007
 grandma+0.007
Neg logit contributions
bee-0.004
 ow-0.003
gging-0.003
 bleed-0.003
 sting-0.003
Token there
Token ID612
Feature activation+4.86e-02

Pos logit contributions
 destination+0.120
 grandparents+0.117
 home+0.110
ari+0.110
ris+0.107
Neg logit contributions
 ow-0.064
bee-0.064
 rust-0.056
 mosquitoes-0.054
 sting-0.053
Token so
Token ID523
Feature activation+3.44e-02

Pos logit contributions
 grandparents+0.099
 destination+0.092
ari+0.091
ris+0.086
 home+0.086
Neg logit contributions
 ow-0.076
bee-0.074
 rust-0.067
 sting-0.066
gging-0.061
Token many
Token ID867
Feature activation+5.01e-02

Pos logit contributions
 grandparents+0.059
 destination+0.058
ari+0.052
 home+0.049
 grandson+0.049
Neg logit contributions
bee-0.064
 ow-0.064
 sting-0.059
 rust-0.058
gging-0.058
Token people
Token ID661
Feature activation+4.83e-02

Pos logit contributions
 grandparents+0.107
 destination+0.104
ari+0.102
 home+0.099
 grandmother+0.096
Neg logit contributions
bee-0.076
 ow-0.070
 mosquitoes-0.068
gging-0.067
 sting-0.062
Token here
Token ID994
Feature activation+5.98e-02

Pos logit contributions
 grandparents+0.100
 destination+0.095
ari+0.088
 grandma+0.086
 home+0.085
Neg logit contributions
bee-0.080
 sting-0.070
gging-0.070
 ow-0.068
 bleed-0.066
Token?"
Token ID1701
Feature activation+5.57e-03

Pos logit contributions
 grandparents+0.122
 destination+0.113
ari+0.113
 grandma+0.104
 grandmother+0.103
Neg logit contributions
bee-0.092
 ow-0.081
 sting-0.080
gging-0.079
 rust-0.075
Token Her
Token ID2332
Feature activation+4.48e-02

Pos logit contributions
 grandparents+0.013
 destination+0.013
 home+0.012
ari+0.012
 grandma+0.012
Neg logit contributions
bee-0.008
 ow-0.007
gging-0.007
 sting-0.006
 bleed-0.006
Token mom
Token ID1995
Feature activation-2.13e-02

Pos logit contributions
ari+0.074
 destination+0.071
 boarded+0.067
ris+0.064
eline+0.064
Neg logit contributions
 ow-0.091
bee-0.089
 sting-0.078
gging-0.073
 rust-0.073
Token.
Token ID13
Feature activation-2.85e-04

Pos logit contributions
 grandparents+0.029
 destination+0.027
 home+0.026
ari+0.026
 grandma+0.026
Neg logit contributions
bee-0.018
 ow-0.015
 bleed-0.015
 sting-0.015
gging-0.015
Token She
Token ID1375
Feature activation-9.07e-03

Pos logit contributions
bee+0.000
 ow+0.000
gging+0.000
 bleed+0.000
 sting+0.000
Neg logit contributions
 grandparents-0.001
 destination-0.001
 home-0.001
ari-0.001
 grandma-0.001
Token didn
Token ID1422
Feature activation-1.67e-02

Pos logit contributions
bee+0.017
 ow+0.015
gging+0.014
 bleed+0.014
 sensation+0.014
Neg logit contributions
 grandparents-0.017
 destination-0.017
 home-0.016
ari-0.016
 grandma-0.016
Token't
Token ID470
Feature activation-1.61e-02

Pos logit contributions
bee+0.028
 ow+0.024
gging+0.024
 sensation+0.024
 bleed+0.022
Neg logit contributions
 grandparents-0.035
 destination-0.034
 home-0.033
 grandma-0.032
ari-0.031
Token want
Token ID765
Feature activation+7.99e-03

Pos logit contributions
bee+0.030
 ow+0.028
gging+0.026
 sensation+0.026
 sting+0.025
Neg logit contributions
 home-0.029
 grandparents-0.028
 destination-0.027
ari-0.027
 grandma-0.026
Token him
Token ID683
Feature activation+1.15e-02

Pos logit contributions
 grandparents+0.012
 destination+0.012
ari+0.011
 home+0.011
 grandma+0.010
Neg logit contributions
bee-0.018
 ow-0.016
 sting-0.016
gging-0.016
 bleed-0.016
Token to
Token ID284
Feature activation-1.92e-02

Pos logit contributions
 grandparents+0.018
 destination+0.016
ari+0.015
 grandma+0.015
 home+0.014
Neg logit contributions
bee-0.026
 ow-0.024
 bleed-0.024
gging-0.024
 sting-0.023
Token have
Token ID423
Feature activation+1.75e-02

Pos logit contributions
bee+0.039
 sensation+0.033
 ow+0.033
gging+0.032
ez+0.032
Neg logit contributions
 home-0.035
 destination-0.034
 grandparents-0.033
 grandma-0.031
ari-0.029
Token it
Token ID340
Feature activation+2.44e-02

Pos logit contributions
 grandparents+0.037
ari+0.035
 destination+0.035
 home+0.032
 grandma+0.032
Neg logit contributions
bee-0.028
 ow-0.027
 sting-0.026
 bleed-0.025
gging-0.025
Token,
Token ID11
Feature activation-3.06e-02

Pos logit contributions
 grandparents+0.056
 destination+0.051
ari+0.051
 grandma+0.049
 grandmother+0.047
Neg logit contributions
bee-0.036
 bleed-0.032
 ow-0.032
 sting-0.031
gging-0.030
Token so
Token ID523
Feature activation-1.99e-02

Pos logit contributions
bee+0.066
gging+0.058
 ow+0.057
 bleed+0.056
 sting+0.056
Neg logit contributions
 grandparents-0.050
 home-0.049
 destination-0.047
 grandma-0.047
 online-0.041
Tokenill
Token ID359
Feature activation-2.26e-02

Pos logit contributions
 grandparents+0.028
 destination+0.027
 home+0.026
 grandma+0.025
ari+0.024
Neg logit contributions
bee-0.022
gging-0.018
 ow-0.018
 sting-0.017
 bleed-0.017
Token wondered
Token ID14028
Feature activation-8.48e-03

Pos logit contributions
bee+0.040
 bleed+0.037
gging+0.036
 sensation+0.036
 ow+0.036
Neg logit contributions
 grandparents-0.043
 destination-0.041
 home-0.040
ari-0.039
 grandma-0.038
Token what
Token ID644
Feature activation-3.84e-04

Pos logit contributions
bee+0.016
 bleed+0.014
gging+0.014
 ow+0.014
 sting+0.014
Neg logit contributions
 grandparents-0.016
 destination-0.015
 home-0.015
 grandma-0.015
ari-0.014
Token it
Token ID340
Feature activation-1.45e-02

Pos logit contributions
bee+0.001
gging+0.001
 ow+0.001
 bleed+0.001
 sting+0.001
Neg logit contributions
 grandparents-0.001
 destination-0.001
 home-0.001
ari-0.001
 grandma-0.001
Token would
Token ID561
Feature activation-3.17e-02

Pos logit contributions
bee+0.026
gging+0.024
 bleed+0.023
 sensation+0.022
 ow+0.022
Neg logit contributions
 grandparents-0.029
 destination-0.027
 home-0.027
 grandma-0.026
ari-0.026
Token be
Token ID307
Feature activation+2.95e-02

Pos logit contributions
bee+0.056
gging+0.051
ez+0.049
 ow+0.045
 bleed+0.045
Neg logit contributions
 home-0.065
 destination-0.063
 grandparents-0.062
 grandma-0.058
ari-0.053
Token like
Token ID588
Feature activation+2.73e-02

Pos logit contributions
 grandparents+0.073
 destination+0.070
ari+0.069
 grandma+0.065
 home+0.062
Neg logit contributions
bee-0.039
 ow-0.031
 sting-0.031
 sensation-0.031
 bleed-0.030
Token to
Token ID284
Feature activation-4.83e-02

Pos logit contributions
 grandparents+0.053
 destination+0.052
ari+0.049
 home+0.047
 grandma+0.046
Neg logit contributions
bee-0.049
 ow-0.043
 sting-0.041
 sensation-0.040
gging-0.040
Token fly
Token ID6129
Feature activation-1.23e-02

Pos logit contributions
bee+0.094
gging+0.091
ching+0.083
 sensation+0.082
ez+0.082
Neg logit contributions
 home-0.083
 destination-0.075
 grandma-0.075
 grandparents-0.073
 party-0.064
Token like
Token ID588
Feature activation-3.78e-03

Pos logit contributions
bee+0.025
gging+0.022
 ow+0.021
 bleed+0.021
 sensation+0.021
Neg logit contributions
 home-0.022
 grandparents-0.022
 destination-0.022
 grandma-0.020
ari-0.019
Token the
Token ID262
Feature activation-6.33e-03

Pos logit contributions
bee+0.007
gging+0.006
 ow+0.006
 bleed+0.006
 sting+0.006
Neg logit contributions
 grandparents-0.007
 destination-0.007
 home-0.007
 grandma-0.006
ari-0.006
Token to
Token ID284
Feature activation-1.32e-02

Pos logit contributions
 grandparents+0.006
 destination+0.005
 home+0.005
ari+0.005
 grandma+0.005
Neg logit contributions
bee-0.010
 ow-0.009
gging-0.009
 bleed-0.009
 sting-0.009
Token make
Token ID787
Feature activation-2.97e-02

Pos logit contributions
bee+0.026
gging+0.023
 ow+0.023
 bleed+0.022
 sensation+0.022
Neg logit contributions
 grandparents-0.024
 destination-0.023
 home-0.023
 grandma-0.022
ari-0.021
Token a
Token ID257
Feature activation+2.17e-02

Pos logit contributions
bee+0.060
gging+0.051
 bleed+0.050
 sting+0.050
 ow+0.050
Neg logit contributions
 grandparents-0.055
 home-0.054
 destination-0.052
 grandma-0.052
 shopping-0.045
Token bigger
Token ID5749
Feature activation-7.06e-04

Pos logit contributions
 grandparents+0.040
ari+0.037
 destination+0.037
 home+0.035
 grandma+0.034
Neg logit contributions
bee-0.040
 ow-0.037
 sting-0.036
 bleed-0.036
 sensation-0.035
Token castle
Token ID16669
Feature activation+1.67e-02

Pos logit contributions
bee+0.001
gging+0.001
 ow+0.001
 bleed+0.001
 sting+0.001
Neg logit contributions
 grandparents-0.001
 destination-0.001
 home-0.001
ari-0.001
 grandma-0.001
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
 grandparents+0.037
 destination+0.035
ari+0.034
 home+0.033
 grandma+0.033
Neg logit contributions
bee-0.026
 ow-0.022
 bleed-0.022
 sting-0.022
 sensation-0.021
Token So
Token ID1406
Feature activation+2.08e-02

Pos logit contributions
 grandparents+0.027
 destination+0.026
 home+0.025
ari+0.025
 grandma+0.024
Neg logit contributions
bee-0.011
gging-0.009
 ow-0.009
 bleed-0.009
 sensation-0.009
Token,
Token ID11
Feature activation+1.24e-02

Pos logit contributions
 grandparents+0.051
ari+0.049
 destination+0.048
 home+0.046
 online+0.045
Neg logit contributions
bee-0.026
 ow-0.021
 sensation-0.021
gging-0.021
 sting-0.020
Token he
Token ID339
Feature activation+3.00e-02

Pos logit contributions
 grandparents+0.032
 destination+0.031
ari+0.030
 home+0.030
 grandma+0.028
Neg logit contributions
bee-0.015
 ow-0.012
gging-0.012
 bleed-0.011
 sensation-0.011
Token joined
Token ID5399
Feature activation+5.17e-03

Pos logit contributions
 grandparents+0.062
 destination+0.060
 home+0.057
ari+0.055
 grandma+0.055
Neg logit contributions
bee-0.055
 sting-0.043
 spit-0.043
gging-0.042
 ow-0.042
Token more
Token ID517
Feature activation+7.12e-03

Pos logit contributions
 grandparents+0.011
 destination+0.010
ari+0.010
 home+0.010
 grandma+0.009
Neg logit contributions
bee-0.009
gging-0.008
 ow-0.008
 bleed-0.008
 sting-0.007
Token the
Token ID262
Feature activation+3.18e-03

Pos logit contributions
 grandparents+0.017
 destination+0.016
 home+0.015
ari+0.015
 grandma+0.015
Neg logit contributions
bee-0.016
gging-0.014
 ow-0.014
 sting-0.014
 bleed-0.014
Token best
Token ID1266
Feature activation+1.89e-02

Pos logit contributions
 grandparents+0.006
 destination+0.005
 home+0.005
ari+0.005
 grandma+0.005
Neg logit contributions
bee-0.006
 ow-0.006
gging-0.006
 bleed-0.006
 sting-0.006
Token paper
Token ID3348
Feature activation+1.91e-02

Pos logit contributions
 grandparents+0.031
 destination+0.029
ari+0.029
 home+0.028
 grandma+0.027
Neg logit contributions
bee-0.039
 ow-0.034
 sensation-0.034
 sting-0.033
gging-0.033
Token and
Token ID290
Feature activation+2.93e-02

Pos logit contributions
 grandparents+0.037
 destination+0.035
 home+0.034
ari+0.033
 grandma+0.033
Neg logit contributions
bee-0.035
 ow-0.030
 sting-0.029
gging-0.029
 bleed-0.029
Token even
Token ID772
Feature activation+2.26e-02

Pos logit contributions
 grandparents+0.057
 destination+0.056
ari+0.055
 home+0.053
 grandma+0.050
Neg logit contributions
bee-0.051
 ow-0.044
 rust-0.043
 sensation-0.043
 sting-0.043
Token the
Token ID262
Feature activation+8.94e-03

Pos logit contributions
 grandparents+0.043
 destination+0.041
ari+0.039
 home+0.038
 grandma+0.037
Neg logit contributions
bee-0.042
 ow-0.037
 sting-0.036
 sensation-0.036
gging-0.036
Token prett
Token ID46442
Feature activation+8.47e-04

Pos logit contributions
 grandparents+0.015
 destination+0.015
 home+0.014
ari+0.014
 grandma+0.013
Neg logit contributions
bee-0.018
 ow-0.016
gging-0.016
 sting-0.016
 sensation-0.016
Tokeniest
Token ID6386
Feature activation+1.07e-02

Pos logit contributions
 grandparents+0.002
 destination+0.002
 home+0.002
ari+0.002
 grandma+0.002
Neg logit contributions
bee-0.001
 ow-0.001
gging-0.001
 bleed-0.001
 sting-0.001
Token stickers
Token ID28568
Feature activation+3.37e-02

Pos logit contributions
 grandparents+0.020
 destination+0.019
 home+0.018
ari+0.018
 grandma+0.017
Neg logit contributions
bee-0.021
 ow-0.018
gging-0.017
 sensation-0.017
 sting-0.017
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
 grandparents+0.072
 destination+0.068
 grandma+0.063
ari+0.063
 home+0.063
Neg logit contributions
bee-0.057
 ow-0.045
 sting-0.045
 sensation-0.045
 bleed-0.044
Token They
Token ID1119
Feature activation+2.86e-02

Pos logit contributions
 grandparents+0.048
 destination+0.046
 home+0.045
ari+0.044
 grandma+0.042
Neg logit contributions
bee-0.031
gging-0.027
 ow-0.026
 sensation-0.026
 sting-0.025
Token.
Token ID13
Feature activation-2.31e-02

Pos logit contributions
bee+0.070
gging+0.064
 ow+0.062
 bleed+0.061
 mosquitoes+0.059
Neg logit contributions
 home-0.066
 grandparents-0.065
 destination-0.063
 grandma-0.062
 online-0.059
Token Anna
Token ID11735
Feature activation-2.06e-02

Pos logit contributions
bee+0.043
 sensation+0.036
gging+0.036
 bleed+0.035
 ow+0.034
Neg logit contributions
 grandparents-0.046
 grandma-0.044
 home-0.042
 destination-0.042
ari-0.040
Token said
Token ID531
Feature activation-3.08e-02

Pos logit contributions
bee+0.036
gging+0.032
 sensation+0.032
 ow+0.032
 bleed+0.032
Neg logit contributions
 grandparents-0.040
 destination-0.038
 home-0.037
 grandma-0.037
ari-0.036
Token,
Token ID11
Feature activation-3.05e-02

Pos logit contributions
bee+0.073
 sensation+0.069
gging+0.067
 bleed+0.067
its+0.064
Neg logit contributions
 home-0.042
 grandma-0.040
 grandparents-0.040
 destination-0.034
ari-0.034
Token "
Token ID366
Feature activation-5.66e-02

Pos logit contributions
bee+0.064
 bleed+0.059
 sensation+0.058
gging+0.058
its+0.058
Neg logit contributions
 grandparents-0.048
 grandma-0.045
 home-0.044
ari-0.042
 destination-0.042
TokenI
Token ID40
Feature activation-4.36e-03

Pos logit contributions
bee+0.117
gging+0.114
its+0.113
oles+0.111
 ow+0.110
Neg logit contributions
 grandparents-0.070
 grandma-0.067
ari-0.061
 Eve-0.060
 boarded-0.059
Token love
Token ID1842
Feature activation-1.95e-02

Pos logit contributions
bee+0.008
gging+0.007
 ow+0.007
 bleed+0.006
 sensation+0.006
Neg logit contributions
 grandparents-0.009
 destination-0.009
 home-0.009
ari-0.008
 grandma-0.008
Token you
Token ID345
Feature activation+1.25e-02

Pos logit contributions
bee+0.038
gging+0.035
 sensation+0.035
 bleed+0.033
 ow+0.032
Neg logit contributions
 grandparents-0.037
 home-0.035
 grandma-0.034
 destination-0.032
ari-0.030
Token,
Token ID11
Feature activation-2.18e-02

Pos logit contributions
 grandparents+0.027
 destination+0.026
ari+0.025
 home+0.024
 grandma+0.023
Neg logit contributions
bee-0.021
 ow-0.018
gging-0.017
 sting-0.017
 bleed-0.017
Token Luna
Token ID23694
Feature activation-1.76e-02

Pos logit contributions
bee+0.038
gging+0.036
 sensation+0.035
 bleed+0.032
its+0.032
Neg logit contributions
 grandparents-0.044
 grandma-0.043
 home-0.041
 destination-0.041
ari-0.039
Token.
Token ID13
Feature activation-3.05e-02

Pos logit contributions
bee+0.030
gging+0.027
 bleed+0.026
 sensation+0.026
 ow+0.026
Neg logit contributions
 grandparents-0.036
 home-0.034
 destination-0.034
 grandma-0.033
ari-0.032
Token.
Token ID13
Feature activation+2.87e-02

Pos logit contributions
bee+0.003
 ow+0.002
gging+0.002
 bleed+0.002
 sting+0.002
Neg logit contributions
 grandparents-0.004
 destination-0.004
 home-0.004
ari-0.003
 grandma-0.003
Token<|endoftext|>
Token ID50256
Feature activation-2.26e-05

Pos logit contributions
 destination+0.063
 grandparents+0.062
 home+0.061
ari+0.059
 online+0.057
Neg logit contributions
bee-0.038
 ow-0.036
gging-0.035
 bleed-0.034
 sting-0.033
TokenOne
Token ID3198
Feature activation-1.10e-02

Pos logit contributions
bee+0.000
 ow+0.000
gging+0.000
 bleed+0.000
 sting+0.000
Neg logit contributions
 grandparents-0.000
 destination-0.000
 home-0.000
ari-0.000
 grandma-0.000
Token day
Token ID1110
Feature activation-6.90e-03

Pos logit contributions
bee+0.028
gging+0.026
 bleed+0.026
 ow+0.025
 sting+0.025
Neg logit contributions
 grandparents-0.014
 home-0.013
 destination-0.013
 grandma-0.012
ari-0.012
Token,
Token ID11
Feature activation-4.05e-02

Pos logit contributions
bee+0.014
 bleed+0.012
gging+0.012
 ow+0.012
 sensation+0.012
Neg logit contributions
 grandparents-0.012
 destination-0.012
 home-0.011
 grandma-0.011
ari-0.011
Token a
Token ID257
Feature activation-2.78e-03

Pos logit contributions
bee+0.105
 bleed+0.094
gging+0.092
bees+0.090
ickets+0.089
Neg logit contributions
 home-0.054
 grandma-0.049
 grandparents-0.044
 online-0.041
 destination-0.039
Token boy
Token ID2933
Feature activation-2.24e-02

Pos logit contributions
bee+0.006
gging+0.005
 ow+0.005
 bleed+0.005
 sting+0.005
Neg logit contributions
 grandparents-0.005
 destination-0.005
 home-0.005
ari-0.004
 grandma-0.004
Token named
Token ID3706
Feature activation+7.90e-03

Pos logit contributions
bee+0.056
gging+0.052
 bleed+0.051
 sensation+0.050
 ow+0.050
Neg logit contributions
 grandparents-0.028
 home-0.027
 grandma-0.025
ari-0.024
 destination-0.023
Token Tom
Token ID4186
Feature activation-1.40e-02

Pos logit contributions
 grandparents+0.018
 destination+0.017
 home+0.016
ari+0.016
 grandma+0.015
Neg logit contributions
bee-0.012
 ow-0.011
gging-0.010
 bleed-0.010
 sting-0.010
Token went
Token ID1816
Feature activation+3.89e-02

Pos logit contributions
bee+0.029
gging+0.026
 bleed+0.025
 ow+0.025
 sensation+0.025
Neg logit contributions
 grandparents-0.024
 home-0.023
 destination-0.022
 grandma-0.022
ari-0.021
Token to
Token ID284
Feature activation+2.75e-02

Pos logit contributions
 grandparents+0.076
 destination+0.075
ari+0.064
 grandma+0.062
 grandmother+0.060
Neg logit contributions
bee-0.068
 bleed-0.062
 sting-0.061
ching-0.060
 rust-0.060
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
bee+0.005
 ow+0.004
gging+0.004
 bleed+0.004
 sting+0.004
Neg logit contributions
 grandparents-0.006
 destination-0.006
 home-0.006
ari-0.006
 grandma-0.006
Token<|endoftext|>
Token ID50256
Feature activation-7.52e-03

Pos logit contributions
 grandparents+0.024
 destination+0.024
 home+0.023
ari+0.022
 grandma+0.021
Neg logit contributions
bee-0.015
 ow-0.013
gging-0.013
 sting-0.012
 bleed-0.012
TokenAnna
Token ID31160
Feature activation+3.06e-02

Pos logit contributions
bee+0.016
gging+0.014
 ow+0.014
 bleed+0.014
 sting+0.014
Neg logit contributions
 grandparents-0.013
 destination-0.012
 home-0.012
ari-0.012
 grandma-0.011
Token liked
Token ID8288
Feature activation+1.59e-02

Pos logit contributions
 grandparents+0.073
 destination+0.073
 home+0.067
ari+0.067
 grandma+0.064
Neg logit contributions
bee-0.042
 ow-0.034
 sting-0.034
gging-0.033
ez-0.032
Token to
Token ID284
Feature activation-3.12e-02

Pos logit contributions
 destination+0.024
 grandparents+0.023
ari+0.022
 home+0.020
 grandma+0.019
Neg logit contributions
bee-0.036
 ow-0.032
gging-0.031
 sting-0.031
 bleed-0.031
Token read
Token ID1100
Feature activation-1.03e-02

Pos logit contributions
bee+0.069
gging+0.064
 ow+0.062
 bleed+0.062
 sensation+0.062
Neg logit contributions
 grandparents-0.048
 grandma-0.045
 home-0.044
 destination-0.042
 online-0.039
Token books
Token ID3835
Feature activation+3.83e-02

Pos logit contributions
bee+0.019
 bleed+0.017
gging+0.016
 sting+0.016
 ow+0.016
Neg logit contributions
 grandparents-0.020
 destination-0.019
 home-0.019
 grandma-0.019
ari-0.018
Token.
Token ID13
Feature activation-3.18e-02

Pos logit contributions
 grandparents+0.100
 destination+0.098
ari+0.090
 home+0.090
 grandma+0.087
Neg logit contributions
bee-0.044
gging-0.033
 rust-0.032
 ow-0.032
 sensation-0.031
Token She
Token ID1375
Feature activation+2.30e-02

Pos logit contributions
bee+0.060
 bleed+0.054
 sting+0.052
gging+0.052
 ow+0.050
Neg logit contributions
 grandparents-0.061
 grandma-0.060
 home-0.057
 online-0.055
 destination-0.055
Token had
Token ID550
Feature activation+2.38e-02

Pos logit contributions
 destination+0.055
 grandparents+0.054
 home+0.052
ari+0.050
 grandma+0.047
Neg logit contributions
bee-0.034
 ow-0.028
 sting-0.026
gging-0.025
 bleed-0.025
Token many
Token ID867
Feature activation+1.34e-02

Pos logit contributions
 destination+0.051
 grandparents+0.049
ari+0.047
 home+0.045
 online+0.041
Neg logit contributions
bee-0.038
 ow-0.035
 sting-0.032
 rust-0.031
 bleed-0.031
Token sorry
Token ID7926
Feature activation-3.55e-02

Pos logit contributions
bee+0.056
gging+0.045
 sensation+0.043
bees+0.043
 ow+0.042
Neg logit contributions
 grandparents-0.049
 home-0.049
 grandma-0.047
 destination-0.045
 online-0.043
Token.
Token ID13
Feature activation+9.99e-03

Pos logit contributions
bee+0.082
gging+0.073
 ow+0.069
 sensation+0.068
 slim+0.066
Neg logit contributions
 home-0.059
 grandma-0.055
 grandparents-0.055
 destination-0.049
 online-0.049
Token They
Token ID1119
Feature activation-1.02e-02

Pos logit contributions
 grandparents+0.024
 destination+0.024
 home+0.023
ari+0.022
 grandma+0.021
Neg logit contributions
bee-0.013
gging-0.011
 ow-0.011
 sting-0.011
 bleed-0.011
Token realized
Token ID6939
Feature activation-2.27e-04

Pos logit contributions
bee+0.020
gging+0.017
 sensation+0.017
 ow+0.017
 bleed+0.017
Neg logit contributions
 grandparents-0.019
 destination-0.018
 home-0.018
 grandma-0.017
ari-0.017
Token that
Token ID326
Feature activation-2.64e-02

Pos logit contributions
bee+0.000
gging+0.000
 ow+0.000
 bleed+0.000
 sting+0.000
Neg logit contributions
 grandparents-0.000
 destination-0.000
 home-0.000
ari-0.000
 grandma-0.000
Token they
Token ID484
Feature activation-6.58e-03

Pos logit contributions
bee+0.061
gging+0.056
 ow+0.053
 bleed+0.053
 sting+0.052
Neg logit contributions
 grandparents-0.042
 grandma-0.040
 home-0.038
 destination-0.034
 online-0.032
Token had
Token ID550
Feature activation+2.23e-02

Pos logit contributions
bee+0.012
gging+0.010
 ow+0.010
 sensation+0.010
 bleed+0.010
Neg logit contributions
 grandparents-0.013
 destination-0.012
 home-0.012
ari-0.012
 grandma-0.012
Token made
Token ID925
Feature activation-3.58e-03

Pos logit contributions
 grandparents+0.044
 destination+0.044
ari+0.043
 home+0.039
 online+0.038
Neg logit contributions
bee-0.038
 ow-0.035
 sting-0.035
 bleed-0.034
gging-0.034
Token a
Token ID257
Feature activation+8.91e-03

Pos logit contributions
bee+0.007
gging+0.006
 ow+0.006
 bleed+0.006
 sting+0.006
Neg logit contributions
 grandparents-0.007
 destination-0.006
 home-0.006
 grandma-0.006
ari-0.006
Token bad
Token ID2089
Feature activation-2.10e-02

Pos logit contributions
 grandparents+0.017
 destination+0.016
ari+0.015
 home+0.015
 grandma+0.015
Neg logit contributions
bee-0.017
 ow-0.015
 sting-0.015
 bleed-0.015
gging-0.014
Token choice
Token ID3572
Feature activation-5.31e-03

Pos logit contributions
bee+0.044
gging+0.041
 ow+0.039
 bleed+0.039
 rust+0.037
Neg logit contributions
 grandparents-0.036
 home-0.035
 destination-0.034
 grandma-0.034
 cousin-0.030
Token,
Token ID11
Feature activation-3.66e-02

Pos logit contributions
 grandparents+0.042
 destination+0.040
ari+0.038
 home+0.038
 grandma+0.037
Neg logit contributions
bee-0.023
 ow-0.019
 sting-0.019
 bleed-0.019
gging-0.018
Token like
Token ID588
Feature activation-2.53e-02

Pos logit contributions
bee+0.073
gging+0.066
ez+0.066
ching+0.066
 ow+0.065
Neg logit contributions
 grandparents-0.061
 grandma-0.060
 home-0.057
 destination-0.056
 online-0.053
Token houses
Token ID7777
Feature activation+3.25e-03

Pos logit contributions
bee+0.057
gging+0.052
 bleed+0.050
 sensation+0.050
 sting+0.050
Neg logit contributions
 grandparents-0.038
 home-0.037
 destination-0.036
 grandma-0.036
 online-0.033
Token,
Token ID11
Feature activation-3.67e-02

Pos logit contributions
 grandparents+0.007
 destination+0.006
 home+0.006
ari+0.006
 grandma+0.006
Neg logit contributions
bee-0.006
 ow-0.005
gging-0.005
 bleed-0.005
 sting-0.005
Token towers
Token ID18028
Feature activation+8.49e-03

Pos logit contributions
bee+0.089
gging+0.084
 sensation+0.082
 sting+0.082
 bleed+0.082
Neg logit contributions
 grandparents-0.044
 grandma-0.043
 destination-0.040
 home-0.040
 online-0.036
Token,
Token ID11
Feature activation-2.36e-02

Pos logit contributions
 grandparents+0.017
 destination+0.016
 home+0.016
ari+0.016
 grandma+0.015
Neg logit contributions
bee-0.015
 ow-0.013
gging-0.013
 sting-0.012
 bleed-0.012
Token and
Token ID290
Feature activation-3.31e-02

Pos logit contributions
bee+0.052
gging+0.048
 bleed+0.047
 sting+0.047
 sensation+0.047
Neg logit contributions
 grandparents-0.035
 destination-0.033
 grandma-0.033
 home-0.033
ari-0.031
Token cars
Token ID5006
Feature activation+7.89e-03

Pos logit contributions
bee+0.076
gging+0.069
 sensation+0.067
 sting+0.067
 bleed+0.066
Neg logit contributions
 grandparents-0.050
 grandma-0.045
 home-0.045
 destination-0.045
 online-0.041
Token.
Token ID13
Feature activation-3.95e-02

Pos logit contributions
 grandparents+0.019
 destination+0.018
 home+0.017
ari+0.017
 grandma+0.017
Neg logit contributions
bee-0.011
 ow-0.009
gging-0.009
 bleed-0.009
 sting-0.009
Token They
Token ID1119
Feature activation+5.59e-04

Pos logit contributions
bee+0.085
 bleed+0.082
gging+0.078
 sensation+0.077
 ow+0.075
Neg logit contributions
 home-0.061
 grandparents-0.060
 destination-0.059
 grandma-0.058
ari-0.055
Token had
Token ID550
Feature activation+1.28e-02

Pos logit contributions
 grandparents+0.001
 destination+0.001
 home+0.001
ari+0.001
 grandma+0.001
Neg logit contributions
bee-0.001
 ow-0.001
gging-0.001
 bleed-0.001
 sting-0.001
Token.
Token ID13
Feature activation-5.73e-02

Pos logit contributions
bee+0.065
gging+0.062
 sensation+0.059
 bleed+0.058
 ow+0.057
Neg logit contributions
 home-0.064
 grandparents-0.063
 grandma-0.060
ari-0.059
 destination-0.059
Token Butter
Token ID18971
Feature activation-7.05e-02

Pos logit contributions
bee+0.109
 sensation+0.099
 rust+0.096
gging+0.095
ling+0.092
Neg logit contributions
 grandma-0.103
 grandparents-0.101
 Grand-0.098
 cousin-0.092
ari-0.091
Tokenflies
Token ID27959
Feature activation-1.01e-01

Pos logit contributions
 sensation+0.102
bee+0.095
 bleed+0.095
Leave+0.095
gging+0.093
Neg logit contributions
 home-0.139
 grandparents-0.135
ari-0.133
 destination-0.131
 grandma-0.131
Token have
Token ID423
Feature activation-4.61e-02

Pos logit contributions
bee+0.184
 sensation+0.167
gging+0.161
 gag+0.155
 ow+0.153
Neg logit contributions
 home-0.171
 destination-0.157
 grandparents-0.153
 online-0.144
 grandma-0.140
Token antennas
Token ID43813
Feature activation-2.51e-02

Pos logit contributions
bee+0.090
gging+0.080
 sensation+0.078
 bleed+0.075
its+0.072
Neg logit contributions
 home-0.084
 grandparents-0.081
 grandma-0.076
 destination-0.075
 presents-0.069
Token and
Token ID290
Feature activation-7.51e-02

Pos logit contributions
bee+0.050
gging+0.044
 ow+0.044
 bleed+0.044
 sensation+0.044
Neg logit contributions
 grandparents-0.043
 home-0.042
 destination-0.041
ari-0.039
 grandma-0.039
Token legs
Token ID7405
Feature activation-5.66e-02

Pos logit contributions
bee+0.128
gging+0.117
 sensation+0.117
its+0.108
hing+0.107
Neg logit contributions
 grandparents-0.141
 home-0.139
 grandma-0.137
 destination-0.125
 presents-0.123
Token.
Token ID13
Feature activation-4.25e-02

Pos logit contributions
bee+0.116
 sensation+0.107
 ow+0.106
gging+0.104
 bleed+0.100
Neg logit contributions
 home-0.095
 grandparents-0.085
 grandma-0.082
 destination-0.081
 online-0.080
Token Your
Token ID3406
Feature activation-2.73e-02

Pos logit contributions
bee+0.078
 sensation+0.071
gging+0.067
 bleed+0.066
 rust+0.064
Neg logit contributions
 grandparents-0.081
 grandma-0.077
ari-0.074
 home-0.072
 destination-0.072
Token bug
Token ID5434
Feature activation-5.76e-02

Pos logit contributions
bee+0.050
gging+0.046
 bleed+0.043
 sensation+0.043
 ow+0.043
Neg logit contributions
 grandparents-0.053
 home-0.052
 destination-0.050
 grandma-0.049
ari-0.044
Token has
Token ID468
Feature activation-4.32e-02

Pos logit contributions
bee+0.098
 sensation+0.096
gging+0.090
 bleed+0.083
Run+0.081
Neg logit contributions
 home-0.106
 grandparents-0.105
 destination-0.102
 grandma-0.100
ari-0.098
Token eat
Token ID4483
Feature activation-1.14e-02

Pos logit contributions
 grandparents+0.010
 destination+0.010
 home+0.009
ari+0.009
 grandma+0.009
Neg logit contributions
bee-0.008
 ow-0.007
gging-0.006
 bleed-0.006
 sting-0.006
Token,
Token ID11
Feature activation+1.39e-02

Pos logit contributions
bee+0.023
 bleed+0.021
gging+0.021
 ow+0.020
 sting+0.020
Neg logit contributions
 grandparents-0.019
 home-0.019
 destination-0.019
 grandma-0.018
ari-0.017
Token Lily
Token ID20037
Feature activation+1.34e-02

Pos logit contributions
 grandparents+0.027
 destination+0.027
ari+0.026
 home+0.025
 grandma+0.023
Neg logit contributions
bee-0.025
 ow-0.022
 sting-0.021
gging-0.021
 sensation-0.021
Token noticed
Token ID6810
Feature activation-2.64e-02

Pos logit contributions
 grandparents+0.026
 destination+0.025
 home+0.024
ari+0.024
 grandma+0.023
Neg logit contributions
bee-0.024
 ow-0.021
gging-0.021
 sting-0.021
 bleed-0.020
Token a
Token ID257
Feature activation-1.97e-02

Pos logit contributions
bee+0.055
 bleed+0.050
gging+0.048
 sting+0.047
 sensation+0.047
Neg logit contributions
 grandparents-0.046
 grandma-0.044
 home-0.044
 destination-0.042
 online-0.038
Token squirrel
Token ID33039
Feature activation-9.22e-02

Pos logit contributions
bee+0.040
gging+0.036
 bleed+0.035
 ow+0.034
 sensation+0.034
Neg logit contributions
 grandparents-0.035
 destination-0.033
 home-0.033
 grandma-0.031
ari-0.030
Token watching
Token ID4964
Feature activation-6.43e-02

Pos logit contributions
 sensation+0.180
gging+0.180
 ow+0.173
bee+0.173
Ble+0.172
Neg logit contributions
 home-0.140
 shopping-0.120
ari-0.115
 bought-0.105
 online-0.103
Token them
Token ID606
Feature activation-5.05e-02

Pos logit contributions
bee+0.137
 bleed+0.124
 sensation+0.122
gging+0.122
its+0.121
Neg logit contributions
 home-0.110
 grandma-0.098
 grandparents-0.090
 online-0.084
 destination-0.083
Token from
Token ID422
Feature activation-8.13e-03

Pos logit contributions
bee+0.112
 ow+0.102
gging+0.100
 sensation+0.097
ez+0.093
Neg logit contributions
 home-0.085
 destination-0.078
 grandparents-0.073
 online-0.068
 shopping-0.068
Token a
Token ID257
Feature activation+7.13e-04

Pos logit contributions
bee+0.016
gging+0.014
 ow+0.014
 bleed+0.013
 sting+0.013
Neg logit contributions
 grandparents-0.015
 home-0.015
 destination-0.014
 grandma-0.014
ari-0.013
Token nearby
Token ID6716
Feature activation+6.98e-03

Pos logit contributions
 grandparents+0.001
 destination+0.001
 home+0.001
ari+0.001
 grandma+0.001
Neg logit contributions
bee-0.001
 ow-0.001
gging-0.001
 bleed-0.001
 sting-0.001
Token edge
Token ID5743
Feature activation-2.68e-02

Pos logit contributions
bee+0.018
gging+0.016
 bleed+0.015
 ow+0.015
 sting+0.015
Neg logit contributions
 grandparents-0.018
 destination-0.018
 home-0.017
 grandma-0.016
ari-0.016
Token of
Token ID286
Feature activation-2.67e-03

Pos logit contributions
bee+0.059
 bleed+0.053
gging+0.053
 ow+0.051
 sting+0.051
Neg logit contributions
 destination-0.042
 home-0.042
 grandparents-0.041
ari-0.039
 grandma-0.038
Token the
Token ID262
Feature activation+7.82e-03

Pos logit contributions
bee+0.005
gging+0.005
 ow+0.005
 bleed+0.005
 sting+0.005
Neg logit contributions
 grandparents-0.005
 destination-0.005
 home-0.005
 grandma-0.004
ari-0.004
Token branch
Token ID8478
Feature activation-2.14e-02

Pos logit contributions
 grandparents+0.017
 destination+0.016
 home+0.015
ari+0.015
 grandma+0.015
Neg logit contributions
bee-0.013
 ow-0.011
 sting-0.011
gging-0.011
 bleed-0.011
Token he
Token ID339
Feature activation-7.75e-03

Pos logit contributions
bee+0.045
gging+0.040
 ow+0.040
 bleed+0.040
 sting+0.038
Neg logit contributions
 destination-0.036
 home-0.035
 grandparents-0.035
ari-0.033
 grandma-0.032
Token stopped
Token ID5025
Feature activation-8.45e-02

Pos logit contributions
bee+0.015
gging+0.013
 ow+0.013
 bleed+0.013
 sensation+0.013
Neg logit contributions
 grandparents-0.014
 destination-0.014
 home-0.013
ari-0.013
 grandma-0.013
Token to
Token ID284
Feature activation-4.70e-02

Pos logit contributions
bee+0.218
gging+0.198
its+0.197
 ow+0.194
 gag+0.191
Neg logit contributions
 home-0.121
 destination-0.091
 shopping-0.084
 grandparents-0.076
 grandma-0.076
Token take
Token ID1011
Feature activation-7.33e-03

Pos logit contributions
bee+0.103
gging+0.092
oles+0.089
ez+0.087
 ow+0.086
Neg logit contributions
 home-0.076
 destination-0.072
 grandparents-0.065
 grandma-0.065
 shopping-0.059
Token a
Token ID257
Feature activation-1.93e-02

Pos logit contributions
bee+0.014
gging+0.012
 ow+0.012
 bleed+0.011
 sting+0.011
Neg logit contributions
 grandparents-0.014
 destination-0.014
 home-0.014
 grandma-0.013
ari-0.013
Token look
Token ID804
Feature activation-2.80e-03

Pos logit contributions
bee+0.036
gging+0.032
 ow+0.030
 sensation+0.030
 bleed+0.029
Neg logit contributions
 home-0.038
 grandparents-0.037
 destination-0.037
 grandma-0.034
ari-0.033
Token around
Token ID1088
Feature activation-3.25e-02

Pos logit contributions
bee+0.005
 ow+0.005
gging+0.005
 bleed+0.004
 sting+0.004
Neg logit contributions
 grandparents-0.005
 destination-0.005
 home-0.005
ari-0.005
 grandma-0.005
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
 grandparents+0.017
 destination+0.016
 home+0.015
ari+0.015
 grandma+0.015
Neg logit contributions
bee-0.014
gging-0.012
 ow-0.012
 bleed-0.012
 sting-0.012
TokenSuddenly
Token ID38582
Feature activation-1.74e-02

Pos logit contributions
bee+0.023
 ow+0.021
 bleed+0.020
 sting+0.020
gging+0.019
Neg logit contributions
 grandparents-0.024
 destination-0.023
 home-0.023
ari-0.022
 grandma-0.022
Token he
Token ID339
Feature activation-1.83e-02

Pos logit contributions
bee+0.037
 bleed+0.033
gging+0.033
 sensation+0.032
its+0.032
Neg logit contributions
 grandparents-0.029
 home-0.028
 destination-0.028
 grandma-0.027
ari-0.025
Token heard
Token ID2982
Feature activation-3.48e-02

Pos logit contributions
bee+0.037
gging+0.034
 bleed+0.034
 sensation+0.034
 ow+0.032
Neg logit contributions
 grandparents-0.031
 destination-0.030
 home-0.029
 grandma-0.028
ari-0.027
Token a
Token ID257
Feature activation-7.45e-02

Pos logit contributions
bee+0.081
 bleed+0.072
gging+0.071
 sting+0.069
its+0.069
Neg logit contributions
 grandparents-0.053
 home-0.051
 grandma-0.051
 destination-0.047
ari-0.043
Token sound
Token ID2128
Feature activation-7.74e-02

Pos logit contributions
bee+0.143
ez+0.135
gging+0.132
ped+0.123
ched+0.122
Neg logit contributions
 grandparents-0.129
 home-0.121
 grandma-0.121
 destination-0.119
 grand-0.114
Token.
Token ID13
Feature activation-3.48e-02

Pos logit contributions
bee+0.174
 gag+0.172
 bleed+0.171
gging+0.169
pill+0.167
Neg logit contributions
 home-0.090
 downstairs-0.081
 online-0.081
 grandma-0.075
 upstairs-0.074
Token It
Token ID632
Feature activation-7.01e-02

Pos logit contributions
bee+0.057
 bleed+0.048
 sting+0.047
gging+0.047
its+0.046
Neg logit contributions
 grandparents-0.074
 grandma-0.072
 home-0.072
 destination-0.070
ari-0.069
Token was
Token ID373
Feature activation-4.03e-02

Pos logit contributions
bee+0.145
gging+0.140
 conclusions+0.137
 mosquitoes+0.130
 bleed+0.128
Neg logit contributions
 grandparents-0.098
 destination-0.098
 home-0.095
 grandma-0.094
 boarded-0.087
Token quick
Token ID2068
Feature activation-5.23e-02

Pos logit contributions
bee+0.094
gging+0.084
its+0.084
 bleed+0.079
ez+0.078
Neg logit contributions
 home-0.063
 grandma-0.061
 grandparents-0.061
 destination-0.054
 Grand-0.053
Token and
Token ID290
Feature activation-1.79e-02

Pos logit contributions
bee+0.117
gging+0.113
its+0.111
 bleed+0.109
Ble+0.107
Neg logit contributions
 home-0.080
 grandparents-0.073
 destination-0.068
 grandma-0.067
ari-0.062
Token He
Token ID679
Feature activation-3.55e-02

Pos logit contributions
bee+0.035
gging+0.029
 bleed+0.029
 ow+0.029
 sting+0.029
Neg logit contributions
 grandparents-0.032
 grandma-0.030
 home-0.030
 destination-0.029
ari-0.028
Token liked
Token ID8288
Feature activation-5.77e-02

Pos logit contributions
bee+0.064
 bleed+0.059
gging+0.058
 sensation+0.058
 ow+0.057
Neg logit contributions
 grandparents-0.065
 grandma-0.060
 destination-0.059
 home-0.056
ari-0.055
Token bees
Token ID17002
Feature activation-8.76e-02

Pos logit contributions
bee+0.136
gging+0.124
 bleed+0.123
its+0.117
ching+0.117
Neg logit contributions
 home-0.081
 grandma-0.080
 grandparents-0.077
 Grand-0.066
 destination-0.064
Token.
Token ID13
Feature activation-1.49e-02

Pos logit contributions
bee+0.177
Ble+0.169
 sensation+0.169
gging+0.167
Run+0.166
Neg logit contributions
 home-0.127
 grandparents-0.107
 destination-0.104
 online-0.103
 grandma-0.101
Token They
Token ID1119
Feature activation-3.81e-02

Pos logit contributions
bee+0.027
gging+0.022
 bleed+0.022
 sting+0.022
 ow+0.022
Neg logit contributions
 grandparents-0.031
 grandma-0.029
 home-0.029
 destination-0.029
ari-0.028
Token made
Token ID925
Feature activation-7.66e-02

Pos logit contributions
bee+0.069
gging+0.066
 sensation+0.065
 bleed+0.062
ching+0.061
Neg logit contributions
 grandparents-0.067
 grandma-0.061
 home-0.060
 destination-0.059
 boarded-0.057
Token honey
Token ID12498
Feature activation-4.69e-02

Pos logit contributions
bee+0.170
gging+0.153
Leave+0.144
 bleed+0.138
 sensation+0.138
Neg logit contributions
 home-0.127
 grandma-0.116
 grandparents-0.106
 online-0.094
 invitations-0.094
Token and
Token ID290
Feature activation-8.51e-02

Pos logit contributions
bee+0.094
gging+0.088
ez+0.082
 ow+0.081
Run+0.081
Neg logit contributions
 home-0.080
 grandparents-0.077
 destination-0.076
 grandma-0.073
ari-0.070
Token whist
Token ID32262
Feature activation-4.29e-02

Pos logit contributions
gging+0.139
bee+0.137
hing+0.126
its+0.125
ching+0.123
Neg logit contributions
 home-0.152
 grandma-0.148
 grandparents-0.145
 presents-0.135
 grand-0.132
Tokenled
Token ID992
Feature activation-5.12e-02

Pos logit contributions
 bleed+0.065
bee+0.065
 ow+0.061
 sensation+0.060
 sting+0.059
Neg logit contributions
 home-0.079
 grandparents-0.077
 grandma-0.072
ari-0.071
 destination-0.071
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
bee+0.116
gging+0.104
 ow+0.098
its+0.097
 bleed+0.096
Neg logit contributions
 home-0.085
 grandma-0.075
 grandparents-0.075
 destination-0.073
 online-0.068