TOP ACTIVATIONS
MAX = 0.165

elephants , lions and ac rob ats . They are amazed
, roses , and sun fl owers . One
going to fly in an aer opl ane . She jumped
clown s juggling and ac rob ats doing flips . But
room and spotted an ash tr ay . It had lots
, roses , and sun fl owers . They waved to
play with lamps and ash tr ays anymore .
take a ride in a aer opl ane . He dream
in and found an ash tr ay . Lucy thought it
dressed up in her fan ci est clothes and asked her
they also see an ash tr ay on the table .
room and saw an ash tr ay . She remembered her
could even turn flip - fl ops ! Everyone was very
saw clown s and ac rob ats and lions and elephants
chains , glitter y snow fl akes , and a picture
She saw lots of snow fl akes , glitter ing in
cigarette and using an ash tr ay . L
cave and saw an ash tr ay by the shore .
he sp ied an ash tr ay . He picked up
a garden full of wild fl owers . He decided to

MIN ACTIVATIONS
MIN = -0.157

! Alice was so thrilled as she took a big bite
fly again and all cheered as the butterfly flew away .
felt so happy and grateful as she skipped home . She
The other raven s cheered as the winner walked away proudly
Jim felt satisfied as he went back to his
The sock seemed so happy as she skipped around .
The farmer was very pleased as the elephants finally made it
<|endoftext|> The sun was shining as Lucy stepped out into the
said goodbye . Luke smiled as he watched the deaf man
oak tree felt so happy as it sailed across the lake
k ite looked so adorable as it shot up and down
. Ruby was so sad as she had lost her sweet
The rock felt so happy as it z ipped around the
. His eyes spark led as he watched the hero 's
, the box looked good as new . The
new ! <|endoftext|> Jimmy smiled as he raced down the hill
and the hero was cheered as a hero .
The fox was so surprised as he had never seen a
for it !" Jack smiled as he handed money to the
noise . It seemed happy as it flew around .

INTERVAL 0.084 - 0.165
CONTAINS 0.040%

smiled and said , " Well , next time , bring
O w , ow , ow ! It hurts !" Ben
!" shouted Sam , " we can buy something with this
he never squeezed an ash tr ay or anything else again
O w , ow , ow !" Ben cried . He

INTERVAL 0.004 - 0.084
CONTAINS 42.044%

down . They saw big trees , blue rivers , and
fell to the ground . She felt a sharp pain and
in circles . She was having so much fun ! Then
Her friend said yes and gave her the violin .
a girl named Lily . She loved taking pictures with her

INTERVAL -0.077 - 0.004
CONTAINS 57.753%

" Free " on it . She hoped that
L ily and Ben are happy . They bow
have help from his friend . Together , they went down
time . He had to delay some of his work until
he ran away . P ete quickly ran to

INTERVAL -0.157 - -0.077
CONTAINS 0.163%

while holding them as tight as he could , so they
. But as soon as he reached the entrance ,
She understands . She knows Ben is curious . He likes
so cold outside that my breath froze on my face !"
and played with the beetle as they looked for more da
Token elephants
Token ID25368
Feature activation+5.67e-02

Pos logit contributions
 Everything+0.035
 earthquake+0.033
 emergency+0.033
loe+0.032
py+0.032
Neg logit contributions
 than-0.021
ho-0.021
s-0.019
 far-0.018
 unt-0.018
Token,
Token ID11
Feature activation+2.07e-02

Pos logit contributions
 Everything+0.118
 earthquake+0.110
 emergency+0.110
py+0.108
 ambulance+0.107
Neg logit contributions
ho-0.079
 than-0.077
bel-0.070
s-0.069
outing-0.069
Token lions
Token ID31420
Feature activation+5.93e-02

Pos logit contributions
 Everything+0.051
 earthquake+0.048
 emergency+0.048
loe+0.047
py+0.046
Neg logit contributions
 than-0.024
ho-0.023
s-0.020
 far-0.019
 unt-0.019
Token and
Token ID290
Feature activation+2.23e-02

Pos logit contributions
 Everything+0.128
 earthquake+0.120
 emergency+0.119
py+0.117
 ambulance+0.116
Neg logit contributions
ho-0.080
 than-0.079
s-0.070
bel-0.067
 unt-0.067
Token ac
Token ID936
Feature activation+9.86e-02

Pos logit contributions
 Everything+0.042
loe+0.041
 earthquake+0.039
 emergency+0.039
 operation+0.037
Neg logit contributions
 than-0.037
ho-0.036
s-0.034
 far-0.034
bel-0.033
Tokenrob
Token ID22609
Feature activation+1.65e-01

Pos logit contributions
 Everything+0.080
 earthquake+0.066
 emergency+0.065
py+0.064
rell+0.062
Neg logit contributions
ho-0.264
 than-0.261
bel-0.245
outing-0.245
s-0.245
Tokenats
Token ID1381
Feature activation-1.47e-03

Pos logit contributions
 Everything+0.252
 emergency+0.235
py+0.234
 earthquake+0.231
rell+0.230
Neg logit contributions
ho-0.321
 than-0.305
bel-0.288
 unt-0.282
s-0.279
Token.
Token ID13
Feature activation+7.01e-03

Pos logit contributions
ho+0.001
 than+0.001
s+0.001
 unt+0.001
 far+0.001
Neg logit contributions
 Everything-0.004
 earthquake-0.004
 emergency-0.004
py-0.004
loe-0.004
Token They
Token ID1119
Feature activation+3.87e-03

Pos logit contributions
 earthquake+0.015
py+0.015
 Everything+0.015
 emergency+0.015
loe+0.014
Neg logit contributions
 than-0.009
ho-0.009
s-0.009
 far-0.008
bel-0.008
Token are
Token ID389
Feature activation+2.84e-02

Pos logit contributions
 Everything+0.008
 earthquake+0.008
 emergency+0.007
py+0.007
 ambulance+0.007
Neg logit contributions
ho-0.006
 than-0.006
s-0.005
 unt-0.005
 far-0.005
Token amazed
Token ID24562
Feature activation-8.71e-03

Pos logit contributions
 Everything+0.061
 earthquake+0.057
 emergency+0.056
loe+0.055
py+0.055
Neg logit contributions
 than-0.040
ho-0.039
s-0.034
 far-0.033
 unt-0.033
Token,
Token ID11
Feature activation+1.49e-02

Pos logit contributions
 Everything+0.011
 bears+0.010
rell+0.010
py+0.010
 ambulance+0.010
Neg logit contributions
ho-0.008
outing-0.008
 than-0.007
bel-0.007
ater-0.007
Token roses
Token ID42152
Feature activation+4.00e-02

Pos logit contributions
 Everything+0.031
 earthquake+0.029
py+0.028
 emergency+0.028
 thunder+0.028
Neg logit contributions
ho-0.021
 than-0.021
s-0.019
outing-0.019
ater-0.019
Token,
Token ID11
Feature activation+1.55e-02

Pos logit contributions
 Everything+0.065
rell+0.065
py+0.063
 bears+0.062
 Beth+0.059
Neg logit contributions
 than-0.063
bel-0.061
outing-0.061
ho-0.061
ater-0.061
Token and
Token ID290
Feature activation+1.37e-02

Pos logit contributions
 Everything+0.034
 bears+0.032
py+0.032
 thunder+0.031
rell+0.031
Neg logit contributions
ho-0.019
 than-0.019
outing-0.018
ater-0.017
bel-0.017
Token sun
Token ID4252
Feature activation+3.55e-02

Pos logit contributions
 Everything+0.029
 earthquake+0.027
 emergency+0.027
loe+0.026
py+0.026
Neg logit contributions
ho-0.020
 than-0.019
s-0.017
 unt-0.017
bel-0.017
Tokenfl
Token ID2704
Feature activation+1.57e-01

Pos logit contributions
 Everything+0.041
 earthquake+0.036
 emergency+0.036
py+0.034
 ambulance+0.033
Neg logit contributions
ho-0.085
 than-0.084
s-0.080
bel-0.077
 unt-0.077
Tokenowers
Token ID3618
Feature activation+1.42e-02

Pos logit contributions
 Everything+0.281
 earthquake+0.265
 emergency+0.265
 ambulance+0.249
 hurricane+0.246
Neg logit contributions
ho-0.285
 than-0.277
s-0.267
bel-0.245
ater-0.243
Token.
Token ID13
Feature activation+2.88e-03

Pos logit contributions
 Everything+0.025
rell+0.025
py+0.025
 bears+0.025
 lightning+0.024
Neg logit contributions
ho-0.019
outing-0.019
ater-0.019
bel-0.018
ike-0.018
Token
Token ID198
Feature activation+5.66e-03

Pos logit contributions
 earthquake+0.006
 emergency+0.006
py+0.006
 hurricane+0.006
 ambulance+0.006
Neg logit contributions
s-0.003
 than-0.003
ho-0.003
 unt-0.003
 far-0.003
Token
Token ID198
Feature activation+8.23e-03

Pos logit contributions
 Everything+0.011
loe+0.010
 emergency+0.009
py+0.009
 earthquake+0.009
Neg logit contributions
ho-0.009
 than-0.009
ater-0.008
outing-0.008
bel-0.008
TokenOne
Token ID3198
Feature activation-1.03e-02

Pos logit contributions
 Everything+0.013
 emergency+0.012
 operation+0.012
py+0.012
 earthquake+0.012
Neg logit contributions
ho-0.016
 than-0.015
bel-0.015
ater-0.014
outing-0.014
Token going
Token ID1016
Feature activation+2.60e-02

Pos logit contributions
 Everything+0.016
 earthquake+0.015
 emergency+0.015
py+0.014
 ambulance+0.014
Neg logit contributions
ho-0.014
 than-0.014
s-0.013
 unt-0.013
bel-0.012
Token to
Token ID284
Feature activation-1.63e-02

Pos logit contributions
 Everything+0.077
 earthquake+0.074
loe+0.073
 emergency+0.073
py+0.073
Neg logit contributions
 than-0.016
ho-0.014
s-0.012
 far-0.011
bel-0.011
Token fly
Token ID6129
Feature activation-2.77e-02

Pos logit contributions
ho+0.023
 than+0.023
 unt+0.021
s+0.020
 far+0.020
Neg logit contributions
 Everything-0.034
 earthquake-0.031
 emergency-0.031
py-0.031
loe-0.030
Token in
Token ID287
Feature activation-4.41e-04

Pos logit contributions
ho+0.039
outing+0.036
 than+0.034
ater+0.033
ucker+0.033
Neg logit contributions
 Everything-0.056
 thunder-0.053
py-0.053
 earthquake-0.052
 lightning-0.052
Token an
Token ID281
Feature activation+4.04e-02

Pos logit contributions
 than+0.001
ho+0.000
 far+0.000
 unt+0.000
s+0.000
Neg logit contributions
 Everything-0.001
py-0.001
 earthquake-0.001
 emergency-0.001
rell-0.001
Token aer
Token ID9551
Feature activation+1.55e-01

Pos logit contributions
 Everything+0.076
loe+0.069
 bears+0.067
py+0.067
 Wolf+0.067
Neg logit contributions
 than-0.066
 far-0.061
ho-0.060
 unt-0.058
ater-0.056
Tokenopl
Token ID20106
Feature activation+8.00e-02

Pos logit contributions
 Everything+0.083
 earthquake+0.058
 emergency+0.057
py+0.050
 ambulance+0.049
Neg logit contributions
ho-0.461
 than-0.460
s-0.437
ater-0.430
 unt-0.429
Tokenane
Token ID1531
Feature activation+4.73e-02

Pos logit contributions
 Everything+0.096
 earthquake+0.085
 emergency+0.082
py+0.080
loe+0.080
Neg logit contributions
 than-0.185
ho-0.185
s-0.171
 unt-0.170
bel-0.167
Token.
Token ID13
Feature activation+3.63e-03

Pos logit contributions
 Everything+0.102
 earthquake+0.100
 emergency+0.099
 operation+0.098
 thunder+0.097
Neg logit contributions
ho-0.060
outing-0.053
 than-0.052
ucker-0.050
ater-0.048
Token She
Token ID1375
Feature activation+2.28e-02

Pos logit contributions
 Everything+0.006
 earthquake+0.006
 emergency+0.006
py+0.006
 ambulance+0.006
Neg logit contributions
ho-0.006
 than-0.006
s-0.006
bel-0.006
 unt-0.006
Token jumped
Token ID11687
Feature activation-2.65e-02

Pos logit contributions
 Everything+0.047
py+0.043
 lightning+0.042
 bears+0.042
 thunder+0.042
Neg logit contributions
 than-0.032
ho-0.032
outing-0.029
ater-0.028
bel-0.027
Token clown
Token ID25573
Feature activation+1.16e-02

Pos logit contributions
 Everything+0.010
 earthquake+0.010
 emergency+0.010
rell+0.010
py+0.009
Neg logit contributions
ho-0.005
 than-0.005
 far-0.005
s-0.005
bel-0.005
Tokens
Token ID82
Feature activation-2.42e-03

Pos logit contributions
 Everything+0.036
 earthquake+0.034
 emergency+0.034
py+0.034
 ambulance+0.033
Neg logit contributions
 than-0.005
ho-0.005
ater-0.003
 unt-0.003
bel-0.003
Token juggling
Token ID48097
Feature activation-3.74e-02

Pos logit contributions
ho+0.003
 than+0.003
s+0.003
ater+0.003
 unt+0.003
Neg logit contributions
 Everything-0.005
 earthquake-0.005
 emergency-0.005
py-0.005
 ambulance-0.005
Token and
Token ID290
Feature activation-2.13e-03

Pos logit contributions
ho+0.056
 than+0.053
outing+0.049
s+0.048
 unt+0.048
Neg logit contributions
 Everything-0.075
 earthquake-0.069
 emergency-0.069
py-0.068
 bears-0.068
Token ac
Token ID936
Feature activation+7.97e-02

Pos logit contributions
ho+0.003
 than+0.003
s+0.003
 unt+0.003
bel+0.003
Neg logit contributions
 Everything-0.004
 emergency-0.004
 earthquake-0.004
loe-0.004
 ambulance-0.004
Tokenrob
Token ID22609
Feature activation+1.54e-01

Pos logit contributions
 Everything+0.047
rell+0.041
py+0.038
loe+0.034
 earthquake+0.034
Neg logit contributions
ho-0.226
 than-0.220
outing-0.214
bel-0.213
 unt-0.209
Tokenats
Token ID1381
Feature activation-1.12e-02

Pos logit contributions
 Everything+0.243
 emergency+0.224
 earthquake+0.222
py+0.222
rell+0.219
Neg logit contributions
ho-0.298
 than-0.285
bel-0.262
s-0.259
 unt-0.259
Token doing
Token ID1804
Feature activation+2.87e-02

Pos logit contributions
ho+0.013
 than+0.013
s+0.011
bel+0.011
 unt+0.011
Neg logit contributions
 Everything-0.026
 earthquake-0.024
 emergency-0.024
py-0.024
 ambulance-0.024
Token flips
Token ID45971
Feature activation-4.78e-02

Pos logit contributions
 Everything+0.058
 earthquake+0.054
 emergency+0.053
py+0.052
 ambulance+0.052
Neg logit contributions
 than-0.043
ho-0.043
s-0.039
 unt-0.038
bel-0.037
Token.
Token ID13
Feature activation-2.35e-03

Pos logit contributions
ho+0.059
 than+0.057
s+0.051
outing+0.049
ater+0.049
Neg logit contributions
 Everything-0.108
 earthquake-0.101
 emergency-0.100
py-0.099
 ambulance-0.098
Token But
Token ID887
Feature activation+2.35e-02

Pos logit contributions
ho+0.003
s+0.003
 than+0.003
 far+0.002
 unt+0.002
Neg logit contributions
 earthquake-0.005
 emergency-0.005
py-0.005
 ambulance-0.005
 nightmares-0.005
Token room
Token ID2119
Feature activation+2.53e-02

Pos logit contributions
 Everything+0.100
 emergency+0.089
 earthquake+0.085
 ambulance+0.085
 bears+0.082
Neg logit contributions
ho-0.158
 than-0.148
s-0.139
outing-0.136
 unt-0.135
Token and
Token ID290
Feature activation+9.02e-03

Pos logit contributions
 Everything+0.052
 emergency+0.051
 earthquake+0.050
 ambulance+0.049
 thunder+0.048
Neg logit contributions
ho-0.036
ucker-0.033
ighed-0.031
fish-0.030
onna-0.029
Token spotted
Token ID13489
Feature activation-6.35e-03

Pos logit contributions
 Everything+0.022
py+0.021
 earthquake+0.021
 emergency+0.021
loe+0.020
Neg logit contributions
ho-0.010
 than-0.010
s-0.009
 far-0.008
bel-0.008
Token an
Token ID281
Feature activation+2.44e-02

Pos logit contributions
 than+0.006
ho+0.006
 far+0.005
 unt+0.005
s+0.005
Neg logit contributions
 Everything-0.016
loe-0.015
 earthquake-0.015
py-0.015
rell-0.015
Token ash
Token ID12530
Feature activation+6.26e-02

Pos logit contributions
 Everything+0.053
loe+0.048
py+0.047
rell+0.047
 nightmares+0.046
Neg logit contributions
 than-0.034
ho-0.032
 unt-0.030
 far-0.029
ater-0.029
Tokentr
Token ID2213
Feature activation+1.53e-01

Pos logit contributions
 Everything+0.111
 earthquake+0.098
 emergency+0.098
py+0.097
loe+0.097
Neg logit contributions
ho-0.113
 than-0.111
s-0.101
bel-0.098
 unt-0.098
Tokenay
Token ID323
Feature activation+7.42e-02

Pos logit contributions
 Everything+0.202
 earthquake+0.179
py+0.177
 emergency+0.176
 bears+0.172
Neg logit contributions
ho-0.336
s-0.332
 than-0.330
bel-0.308
outing-0.308
Token.
Token ID13
Feature activation+4.00e-05

Pos logit contributions
 Everything+0.173
 earthquake+0.156
py+0.156
 emergency+0.154
loe+0.154
Neg logit contributions
ho-0.090
 than-0.089
s-0.081
 unt-0.077
 far-0.074
Token It
Token ID632
Feature activation+6.57e-03

Pos logit contributions
py+0.000
 earthquake+0.000
 emergency+0.000
 ambulance+0.000
 nightmares+0.000
Neg logit contributions
 than-0.000
ho-0.000
s-0.000
 unt-0.000
 far-0.000
Token had
Token ID550
Feature activation-7.60e-03

Pos logit contributions
 Everything+0.015
 earthquake+0.014
 emergency+0.014
 ambulance+0.014
py+0.014
Neg logit contributions
ho-0.008
 than-0.008
s-0.007
 unt-0.007
bel-0.007
Token lots
Token ID6041
Feature activation-4.67e-03

Pos logit contributions
 than+0.010
ho+0.009
 unt+0.009
 far+0.009
s+0.008
Neg logit contributions
 Everything-0.016
 earthquake-0.016
 emergency-0.016
loe-0.016
py-0.015
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
 Everything+0.033
 earthquake+0.031
 emergency+0.031
py+0.030
loe+0.030
Neg logit contributions
ho-0.015
 than-0.015
s-0.013
 unt-0.012
 far-0.012
Token roses
Token ID42152
Feature activation+2.81e-02

Pos logit contributions
 Everything+0.027
 earthquake+0.025
 emergency+0.025
loe+0.025
 operation+0.024
Neg logit contributions
 than-0.014
ho-0.014
s-0.012
 far-0.012
bel-0.012
Token,
Token ID11
Feature activation+7.04e-03

Pos logit contributions
 Everything+0.062
 earthquake+0.058
 emergency+0.058
py+0.057
 ambulance+0.057
Neg logit contributions
ho-0.035
 than-0.035
bel-0.030
s-0.030
 unt-0.030
Token and
Token ID290
Feature activation+8.21e-03

Pos logit contributions
 Everything+0.017
 earthquake+0.016
 emergency+0.016
loe+0.016
py+0.016
Neg logit contributions
ho-0.008
 than-0.008
s-0.007
bel-0.006
 unt-0.006
Token sun
Token ID4252
Feature activation+1.74e-02

Pos logit contributions
 Everything+0.017
loe+0.016
 earthquake+0.015
 emergency+0.015
rell+0.015
Neg logit contributions
 than-0.013
ho-0.012
 far-0.011
s-0.011
bel-0.011
Tokenfl
Token ID2704
Feature activation+1.52e-01

Pos logit contributions
 Everything+0.031
 earthquake+0.028
 emergency+0.028
py+0.027
 ambulance+0.027
Neg logit contributions
ho-0.031
 than-0.031
s-0.029
bel-0.027
outing-0.027
Tokenowers
Token ID3618
Feature activation+6.60e-04

Pos logit contributions
 Everything+0.283
 earthquake+0.266
 emergency+0.264
 nightmare+0.253
 thunder+0.250
Neg logit contributions
ho-0.266
s-0.260
 than-0.260
bel-0.236
outing-0.231
Token.
Token ID13
Feature activation-8.58e-03

Pos logit contributions
 Everything+0.001
 earthquake+0.001
py+0.001
 emergency+0.001
 bears+0.001
Neg logit contributions
ho-0.001
 than-0.001
bel-0.001
outing-0.001
ater-0.001
Token They
Token ID1119
Feature activation-3.61e-03

Pos logit contributions
ho+0.011
 than+0.011
s+0.010
 far+0.010
outing+0.009
Neg logit contributions
 earthquake-0.018
 emergency-0.018
 Everything-0.018
py-0.017
rell-0.017
Token waved
Token ID26834
Feature activation-7.40e-03

Pos logit contributions
 than+0.005
ho+0.005
s+0.004
outing+0.004
ater+0.004
Neg logit contributions
 Everything-0.007
py-0.007
 earthquake-0.007
 emergency-0.007
 bears-0.007
Token to
Token ID284
Feature activation-5.75e-03

Pos logit contributions
ho+0.009
 than+0.008
s+0.007
ater+0.007
outing+0.007
Neg logit contributions
 Everything-0.017
 emergency-0.016
 earthquake-0.016
py-0.016
 thunder-0.016
Token play
Token ID711
Feature activation+1.62e-02

Pos logit contributions
 than+0.002
ho+0.002
 unt+0.002
s+0.002
ater+0.002
Neg logit contributions
 Everything-0.005
loe-0.005
py-0.005
 emergency-0.005
rell-0.004
Token with
Token ID351
Feature activation+2.93e-04

Pos logit contributions
 Everything+0.049
 earthquake+0.046
rell+0.046
py+0.046
loe+0.046
Neg logit contributions
 than-0.009
ho-0.008
s-0.006
 unt-0.006
bel-0.006
Token lamps
Token ID32209
Feature activation+4.65e-02

Pos logit contributions
 Everything+0.001
 earthquake+0.001
 emergency+0.001
py+0.001
 ambulance+0.001
Neg logit contributions
ho-0.000
 than-0.000
s-0.000
 unt-0.000
bel-0.000
Token and
Token ID290
Feature activation+2.44e-02

Pos logit contributions
 Everything+0.078
 earthquake+0.073
 emergency+0.073
 ambulance+0.071
py+0.071
Neg logit contributions
ho-0.081
 than-0.078
bel-0.072
s-0.072
 unt-0.072
Token ash
Token ID12530
Feature activation+6.39e-02

Pos logit contributions
 Everything+0.047
loe+0.043
 earthquake+0.042
rell+0.042
py+0.042
Neg logit contributions
 than-0.040
ho-0.039
bel-0.036
s-0.035
 unt-0.035
Tokentr
Token ID2213
Feature activation+1.52e-01

Pos logit contributions
 Everything+0.121
 earthquake+0.109
 emergency+0.108
loe+0.106
py+0.105
Neg logit contributions
ho-0.109
 than-0.108
s-0.098
bel-0.095
 unt-0.093
Tokenays
Token ID592
Feature activation-1.13e-02

Pos logit contributions
 Everything+0.221
 earthquake+0.201
 emergency+0.193
py+0.192
 hurricane+0.185
Neg logit contributions
 than-0.319
s-0.319
ho-0.316
outing-0.296
bel-0.292
Token anymore
Token ID7471
Feature activation+1.22e-02

Pos logit contributions
 than+0.013
ho+0.012
s+0.011
 far+0.010
 unt+0.010
Neg logit contributions
 Everything-0.028
 earthquake-0.026
 emergency-0.026
py-0.025
loe-0.025
Token.
Token ID13
Feature activation-2.39e-03

Pos logit contributions
 Everything+0.029
 earthquake+0.028
 emergency+0.028
py+0.027
 thunder+0.027
Neg logit contributions
ho-0.013
 than-0.012
outing-0.011
bel-0.011
ater-0.011
Token
Token ID198
Feature activation-8.54e-03

Pos logit contributions
ho+0.004
 than+0.004
ater+0.004
bel+0.004
 unt+0.004
Neg logit contributions
 Everything-0.005
 earthquake-0.004
 thunder-0.004
 emergency-0.004
 nightmares-0.004
Token
Token ID198
Feature activation-6.10e-03

Pos logit contributions
 than+0.016
ho+0.016
ater+0.015
outing+0.015
bel+0.014
Neg logit contributions
 Everything-0.015
 ambulance-0.012
 emergency-0.012
 thunder-0.012
loe-0.012
Token take
Token ID1011
Feature activation+3.76e-03

Pos logit contributions
ho+0.029
 than+0.029
s+0.026
 unt+0.026
bel+0.025
Neg logit contributions
 Everything-0.042
 earthquake-0.039
py-0.038
 emergency-0.038
 ambulance-0.037
Token a
Token ID257
Feature activation-2.13e-02

Pos logit contributions
py+0.010
loe+0.010
mo+0.010
 hurricane+0.010
 Everything+0.010
Neg logit contributions
 than-0.002
 far-0.002
 unt-0.002
 with-0.002
s-0.002
Token ride
Token ID6594
Feature activation-1.48e-02

Pos logit contributions
ho+0.034
 than+0.033
s+0.031
 unt+0.031
 far+0.030
Neg logit contributions
 Everything-0.041
 earthquake-0.037
py-0.037
 emergency-0.037
loe-0.036
Token in
Token ID287
Feature activation-1.04e-02

Pos logit contributions
ho+0.021
 than+0.021
s+0.018
bel+0.018
 unt+0.018
Neg logit contributions
 Everything-0.032
 earthquake-0.029
 emergency-0.029
py-0.029
 ambulance-0.028
Token a
Token ID257
Feature activation+3.26e-04

Pos logit contributions
 than+0.013
ho+0.013
s+0.012
 unt+0.011
 far+0.011
Neg logit contributions
 Everything-0.023
 earthquake-0.022
py-0.022
 emergency-0.021
rell-0.021
Token aer
Token ID9551
Feature activation+1.51e-01

Pos logit contributions
 Everything+0.001
py+0.001
 earthquake+0.001
loe+0.001
rell+0.001
Neg logit contributions
ho-0.001
 than-0.000
s-0.000
 far-0.000
 unt-0.000
Tokenopl
Token ID20106
Feature activation+7.89e-02

Pos logit contributions
 Everything+0.083
 earthquake+0.058
 emergency+0.057
py+0.051
 ambulance+0.048
Neg logit contributions
ho-0.448
 than-0.447
s-0.425
ater-0.418
 unt-0.418
Tokenane
Token ID1531
Feature activation+4.05e-02

Pos logit contributions
 Everything+0.095
 earthquake+0.083
 emergency+0.082
loe+0.079
py+0.079
Neg logit contributions
 than-0.183
ho-0.182
s-0.168
 unt-0.168
bel-0.165
Token.
Token ID13
Feature activation+3.33e-03

Pos logit contributions
 Everything+0.086
 operation+0.085
 emergency+0.085
 ambulance+0.084
 earthquake+0.084
Neg logit contributions
ho-0.054
outing-0.047
 than-0.047
ucker-0.045
ater-0.042
Token He
Token ID679
Feature activation-2.12e-04

Pos logit contributions
 Everything+0.005
 earthquake+0.005
 emergency+0.005
 ambulance+0.004
rell+0.004
Neg logit contributions
ho-0.007
 than-0.007
s-0.006
bel-0.006
outing-0.006
Token dream
Token ID4320
Feature activation+2.44e-03

Pos logit contributions
ho+0.000
 than+0.000
s+0.000
outing+0.000
ater+0.000
Neg logit contributions
 Everything-0.000
 emergency-0.000
 earthquake-0.000
py-0.000
 ambulance-0.000
Token in
Token ID287
Feature activation-1.15e-02

Pos logit contributions
 Everything+0.020
 earthquake+0.018
 thunder+0.018
 emergency+0.018
 lightning+0.018
Neg logit contributions
ho-0.016
 than-0.015
ater-0.014
outing-0.014
bel-0.014
Token and
Token ID290
Feature activation+5.50e-03

Pos logit contributions
ho+0.018
 than+0.017
s+0.015
outing+0.015
bel+0.015
Neg logit contributions
 Everything-0.023
 earthquake-0.022
 emergency-0.021
 thunder-0.021
 ambulance-0.021
Token found
Token ID1043
Feature activation+6.11e-04

Pos logit contributions
 Everything+0.011
 thunder+0.010
 lightning+0.010
 Beth+0.009
 earthquake+0.009
Neg logit contributions
 than-0.008
ho-0.008
ater-0.008
ucker-0.008
eed-0.008
Token an
Token ID281
Feature activation+1.53e-02

Pos logit contributions
 Everything+0.001
 earthquake+0.001
 emergency+0.001
py+0.001
loe+0.001
Neg logit contributions
ho-0.001
 than-0.001
s-0.001
 unt-0.001
 far-0.001
Token ash
Token ID12530
Feature activation+6.01e-02

Pos logit contributions
 Everything+0.031
 earthquake+0.028
 emergency+0.028
py+0.028
loe+0.028
Neg logit contributions
 than-0.023
ho-0.022
s-0.020
 unt-0.020
ater-0.019
Tokentr
Token ID2213
Feature activation+1.51e-01

Pos logit contributions
 Everything+0.092
 earthquake+0.087
 emergency+0.086
 ambulance+0.083
py+0.082
Neg logit contributions
 than-0.115
ho-0.114
s-0.107
outing-0.106
bel-0.105
Tokenay
Token ID323
Feature activation+7.70e-02

Pos logit contributions
 Everything+0.184
 earthquake+0.161
 emergency+0.160
py+0.157
 bears+0.152
Neg logit contributions
ho-0.349
 than-0.344
s-0.337
outing-0.318
bel-0.315
Token.
Token ID13
Feature activation+2.11e-03

Pos logit contributions
 Everything+0.203
 emergency+0.187
 earthquake+0.187
py+0.185
 nightmares+0.183
Neg logit contributions
ho-0.071
 than-0.069
s-0.060
 unt-0.056
 far-0.053
Token Lucy
Token ID22162
Feature activation+7.92e-04

Pos logit contributions
py+0.004
 emergency+0.004
 earthquake+0.004
 Everything+0.004
 ambulance+0.004
Neg logit contributions
 than-0.003
ho-0.003
s-0.003
 far-0.003
 unt-0.003
Token thought
Token ID1807
Feature activation+9.03e-03

Pos logit contributions
 Everything+0.001
 bears+0.001
 lightning+0.001
 thunder+0.001
py+0.001
Neg logit contributions
 than-0.001
ho-0.001
bel-0.001
eed-0.001
ater-0.001
Token it
Token ID340
Feature activation-3.24e-02

Pos logit contributions
 Everything+0.020
 earthquake+0.019
 emergency+0.019
py+0.018
 ambulance+0.018
Neg logit contributions
ho-0.012
 than-0.012
s-0.010
 unt-0.010
bel-0.010
Token dressed
Token ID12049
Feature activation-1.58e-02

Pos logit contributions
ho+0.028
 far+0.028
 than+0.027
s+0.027
 unt+0.026
Neg logit contributions
 earthquake-0.066
 Everything-0.066
py-0.065
loe-0.064
 emergency-0.063
Token up
Token ID510
Feature activation-2.25e-02

Pos logit contributions
ho+0.023
 than+0.021
outing+0.019
ucker+0.019
onna+0.018
Neg logit contributions
 Everything-0.032
 ambulance-0.030
 emergency-0.029
 bears-0.029
 television-0.029
Token in
Token ID287
Feature activation-3.08e-02

Pos logit contributions
ho+0.031
outing+0.029
ucker+0.028
ater+0.028
atching+0.027
Neg logit contributions
 Everything-0.042
 thunder-0.041
 earthquake-0.040
 lightning-0.040
 bears-0.039
Token her
Token ID607
Feature activation+1.62e-03

Pos logit contributions
ho+0.043
 than+0.043
s+0.038
bel+0.037
 unt+0.037
Neg logit contributions
 Everything-0.065
 earthquake-0.061
 emergency-0.060
py-0.059
 ambulance-0.059
Token fan
Token ID4336
Feature activation+4.16e-02

Pos logit contributions
 Everything+0.004
 earthquake+0.003
py+0.003
 emergency+0.003
loe+0.003
Neg logit contributions
 than-0.002
ho-0.002
s-0.002
bel-0.002
 unt-0.002
Tokenci
Token ID979
Feature activation+1.51e-01

Pos logit contributions
 Everything+0.098
 earthquake+0.092
 emergency+0.092
py+0.090
 ambulance+0.090
Neg logit contributions
ho-0.047
 than-0.046
s-0.039
outing-0.039
 unt-0.038
Tokenest
Token ID395
Feature activation+4.75e-02

Pos logit contributions
 Everything+0.149
 earthquake+0.145
 hurricane+0.137
 lightning+0.134
 emergency+0.132
Neg logit contributions
 than-0.364
s-0.362
ho-0.355
ater-0.326
 carefully-0.319
Token clothes
Token ID8242
Feature activation-2.08e-03

Pos logit contributions
 Everything+0.102
 earthquake+0.091
loe+0.091
py+0.090
 emergency+0.086
Neg logit contributions
 than-0.075
ho-0.065
s-0.064
bel-0.062
ater-0.060
Token and
Token ID290
Feature activation-2.04e-03

Pos logit contributions
ho+0.003
 than+0.002
ater+0.002
outing+0.002
bel+0.002
Neg logit contributions
 Everything-0.004
 earthquake-0.004
rell-0.004
 ambulance-0.004
 emergency-0.004
Token asked
Token ID1965
Feature activation+1.44e-02

Pos logit contributions
 than+0.003
ho+0.003
s+0.003
bel+0.002
 unt+0.002
Neg logit contributions
 Everything-0.004
 earthquake-0.004
 emergency-0.004
py-0.004
loe-0.004
Token her
Token ID607
Feature activation-1.74e-02

Pos logit contributions
 Everything+0.034
 earthquake+0.032
rell+0.032
 emergency+0.032
py+0.032
Neg logit contributions
 than-0.016
ho-0.016
s-0.014
 far-0.014
 unt-0.013
Token they
Token ID484
Feature activation+1.21e-02

Pos logit contributions
 Everything+0.068
 earthquake+0.064
 emergency+0.064
py+0.062
 thunder+0.061
Neg logit contributions
ho-0.043
 than-0.042
s-0.038
ater-0.037
bel-0.037
Token also
Token ID635
Feature activation-9.53e-03

Pos logit contributions
 Everything+0.025
 earthquake+0.023
 emergency+0.023
 ambulance+0.022
py+0.022
Neg logit contributions
ho-0.017
 than-0.017
bel-0.016
s-0.015
outing-0.015
Token see
Token ID766
Feature activation-6.46e-03

Pos logit contributions
ho+0.014
s+0.012
 than+0.012
outing+0.012
bel+0.012
Neg logit contributions
 Everything-0.020
 earthquake-0.019
 emergency-0.019
 ambulance-0.018
 thunder-0.018
Token an
Token ID281
Feature activation+1.67e-02

Pos logit contributions
 than+0.007
 far+0.006
ho+0.006
s+0.006
 unt+0.005
Neg logit contributions
 Everything-0.016
loe-0.015
rell-0.015
py-0.015
 earthquake-0.015
Token ash
Token ID12530
Feature activation+6.21e-02

Pos logit contributions
 Everything+0.036
loe+0.033
rell+0.032
py+0.031
 Wolf+0.031
Neg logit contributions
 than-0.024
ho-0.021
 far-0.021
 unt-0.021
ater-0.020
Tokentr
Token ID2213
Feature activation+1.51e-01

Pos logit contributions
 Everything+0.110
 earthquake+0.097
 emergency+0.097
py+0.096
loe+0.095
Neg logit contributions
ho-0.113
 than-0.111
s-0.102
bel-0.098
 unt-0.097
Tokenay
Token ID323
Feature activation+8.27e-02

Pos logit contributions
 Everything+0.210
 earthquake+0.186
py+0.184
 emergency+0.182
 bears+0.178
Neg logit contributions
s-0.323
ho-0.318
 than-0.315
bel-0.297
outing-0.293
Token on
Token ID319
Feature activation+2.97e-02

Pos logit contributions
 Everything+0.191
 earthquake+0.174
 nightmares+0.170
py+0.170
 emergency+0.169
Neg logit contributions
 than-0.101
ho-0.098
s-0.095
 unt-0.088
 far-0.086
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
 Everything+0.064
 earthquake+0.060
 emergency+0.059
py+0.059
 ambulance+0.058
Neg logit contributions
ho-0.040
 than-0.040
s-0.035
 unt-0.034
bel-0.034
Token table
Token ID3084
Feature activation-2.39e-02

Pos logit contributions
 Everything+0.023
 earthquake+0.021
 emergency+0.021
py+0.021
 ambulance+0.020
Neg logit contributions
ho-0.019
 than-0.018
s-0.017
bel-0.016
outing-0.016
Token.
Token ID13
Feature activation+7.87e-03

Pos logit contributions
ho+0.034
bel+0.029
 unt+0.029
 than+0.028
ucker+0.028
Neg logit contributions
 Everything-0.048
 ambulance-0.046
 earthquake-0.046
 emergency-0.046
 thunder-0.043
Token room
Token ID2119
Feature activation+1.51e-02

Pos logit contributions
 Everything+0.030
py+0.030
rell+0.028
loe+0.028
 earthquake+0.028
Neg logit contributions
 than-0.030
ho-0.029
 far-0.029
s-0.028
 unt-0.026
Token and
Token ID290
Feature activation+3.96e-03

Pos logit contributions
 Everything+0.028
 earthquake+0.026
 emergency+0.026
 thunder+0.025
 lightning+0.025
Neg logit contributions
ho-0.024
ucker-0.022
ater-0.022
outing-0.021
bel-0.021
Token saw
Token ID2497
Feature activation+8.89e-03

Pos logit contributions
 Everything+0.010
 earthquake+0.009
 emergency+0.009
py+0.009
 ambulance+0.009
Neg logit contributions
ho-0.005
 than-0.004
s-0.004
ater-0.004
outing-0.004
Token an
Token ID281
Feature activation+1.92e-02

Pos logit contributions
 Everything+0.021
 earthquake+0.020
rell+0.020
py+0.020
 emergency+0.020
Neg logit contributions
ho-0.010
 than-0.009
 far-0.009
s-0.008
 unt-0.008
Token ash
Token ID12530
Feature activation+6.05e-02

Pos logit contributions
 Everything+0.042
loe+0.038
py+0.038
rell+0.038
 Everybody+0.036
Neg logit contributions
 than-0.027
ho-0.025
 unt-0.023
 far-0.023
ater-0.022
Tokentr
Token ID2213
Feature activation+1.50e-01

Pos logit contributions
 Everything+0.112
 emergency+0.098
 earthquake+0.098
loe+0.098
py+0.097
Neg logit contributions
ho-0.107
 than-0.105
s-0.095
bel-0.092
 unt-0.090
Tokenay
Token ID323
Feature activation+7.41e-02

Pos logit contributions
 Everything+0.206
 earthquake+0.185
py+0.183
 emergency+0.183
 bears+0.182
Neg logit contributions
s-0.320
ho-0.319
 than-0.314
outing-0.294
bel-0.294
Token.
Token ID13
Feature activation+2.61e-03

Pos logit contributions
 Everything+0.175
 earthquake+0.160
 emergency+0.158
py+0.158
rell+0.157
Neg logit contributions
ho-0.087
 than-0.084
s-0.078
 unt-0.074
 far-0.071
Token She
Token ID1375
Feature activation+2.29e-02

Pos logit contributions
py+0.005
 earthquake+0.005
 emergency+0.005
 bears+0.005
 hurricane+0.005
Neg logit contributions
s-0.004
 than-0.003
ho-0.003
 unt-0.003
 far-0.003
Token remembered
Token ID12086
Feature activation-1.37e-02

Pos logit contributions
 Everything+0.051
 emergency+0.046
 earthquake+0.046
py+0.046
 thunder+0.045
Neg logit contributions
 than-0.030
ho-0.030
ater-0.026
s-0.025
bel-0.025
Token her
Token ID607
Feature activation-8.46e-03

Pos logit contributions
ho+0.018
 than+0.017
s+0.015
 unt+0.015
bel+0.015
Neg logit contributions
 Everything-0.031
 earthquake-0.029
 emergency-0.028
py-0.028
 ambulance-0.028
Token could
Token ID714
Feature activation+1.17e-02

Pos logit contributions
 Everything+0.020
 earthquake+0.019
 emergency+0.018
py+0.018
 ambulance+0.018
Neg logit contributions
ho-0.012
 than-0.012
s-0.011
outing-0.011
bel-0.011
Token even
Token ID772
Feature activation+2.54e-02

Pos logit contributions
 Everything+0.023
 earthquake+0.021
py+0.021
 emergency+0.021
 ambulance+0.020
Neg logit contributions
ho-0.019
 than-0.018
s-0.017
outing-0.016
ater-0.016
Token turn
Token ID1210
Feature activation-3.00e-02

Pos logit contributions
 Everything+0.053
 earthquake+0.049
 emergency+0.049
py+0.048
 ambulance+0.048
Neg logit contributions
ho-0.036
 than-0.035
s-0.032
outing-0.031
ater-0.031
Token flip
Token ID14283
Feature activation-4.65e-02

Pos logit contributions
 than+0.045
ho+0.040
s+0.040
 far+0.038
 unt+0.036
Neg logit contributions
 Everything-0.064
 earthquake-0.061
loe-0.060
 emergency-0.059
py-0.058
Token-
Token ID12
Feature activation+5.20e-02

Pos logit contributions
 than+0.061
ho+0.059
s+0.055
bel+0.051
 far+0.051
Neg logit contributions
 Everything-0.104
 earthquake-0.098
 emergency-0.097
loe-0.095
py-0.095
Tokenfl
Token ID2704
Feature activation+1.50e-01

Pos logit contributions
 Everything+0.104
 earthquake+0.099
 emergency+0.098
py+0.095
 ambulance+0.094
Neg logit contributions
ho-0.079
 than-0.079
s-0.072
bel-0.069
ater-0.068
Tokenops
Token ID2840
Feature activation+1.83e-02

Pos logit contributions
 Everything+0.242
 emergency+0.223
 earthquake+0.221
 nightmare+0.209
 hurricane+0.206
Neg logit contributions
ho-0.293
 than-0.293
s-0.289
bel-0.266
ater-0.258
Token!
Token ID0
Feature activation-9.20e-03

Pos logit contributions
 Everything+0.037
 earthquake+0.036
rell+0.035
py+0.035
 emergency+0.035
Neg logit contributions
ho-0.026
 than-0.024
bel-0.022
outing-0.022
ater-0.021
Token Everyone
Token ID11075
Feature activation+2.72e-02

Pos logit contributions
 than+0.010
s+0.009
ho+0.009
 far+0.008
 unt+0.008
Neg logit contributions
py-0.022
 emergency-0.021
 earthquake-0.021
 ambulance-0.021
 nightmares-0.021
Token was
Token ID373
Feature activation+2.36e-02

Pos logit contributions
 Everything+0.058
 earthquake+0.054
 emergency+0.054
py+0.053
 ambulance+0.052
Neg logit contributions
ho-0.038
 than-0.037
s-0.033
bel-0.032
 unt-0.032
Token very
Token ID845
Feature activation+9.02e-03

Pos logit contributions
 Everything+0.049
 earthquake+0.045
 emergency+0.045
py+0.044
 ambulance+0.043
Neg logit contributions
 than-0.035
ho-0.034
s-0.031
 unt-0.030
bel-0.030
Token saw
Token ID2497
Feature activation+6.66e-03

Pos logit contributions
ho+0.004
 than+0.004
ater+0.003
outing+0.003
s+0.003
Neg logit contributions
 Everything-0.005
py-0.005
 bears-0.005
 emergency-0.005
 thunder-0.005
Token clown
Token ID25573
Feature activation+1.19e-02

Pos logit contributions
 Everything+0.013
 earthquake+0.013
 emergency+0.012
loe+0.012
py+0.012
Neg logit contributions
ho-0.010
 than-0.010
s-0.009
 far-0.009
bel-0.009
Tokens
Token ID82
Feature activation+8.73e-04

Pos logit contributions
 Everything+0.039
 earthquake+0.037
 emergency+0.037
py+0.036
 ambulance+0.036
Neg logit contributions
ho-0.004
 than-0.003
s-0.002
bel-0.001
 unt-0.001
Token and
Token ID290
Feature activation+5.53e-03

Pos logit contributions
 Everything+0.002
 earthquake+0.002
 emergency+0.002
py+0.002
 ambulance+0.002
Neg logit contributions
ho-0.001
 than-0.001
s-0.001
ater-0.001
bel-0.001
Token ac
Token ID936
Feature activation+8.33e-02

Pos logit contributions
 Everything+0.011
 earthquake+0.010
loe+0.010
 emergency+0.010
rell+0.010
Neg logit contributions
 than-0.009
ho-0.009
s-0.008
bel-0.008
 far-0.007
Tokenrob
Token ID22609
Feature activation+1.50e-01

Pos logit contributions
 Everything+0.075
 earthquake+0.063
 emergency+0.062
py+0.059
 ambulance+0.057
Neg logit contributions
ho-0.219
 than-0.218
s-0.206
bel-0.202
 unt-0.202
Tokenats
Token ID1381
Feature activation-1.00e-02

Pos logit contributions
 Everything+0.226
 emergency+0.212
py+0.211
rell+0.208
 earthquake+0.208
Neg logit contributions
ho-0.295
 than-0.279
bel-0.265
 unt-0.259
 far-0.257
Token and
Token ID290
Feature activation+1.86e-02

Pos logit contributions
ho+0.009
 than+0.009
s+0.008
 unt+0.007
 far+0.007
Neg logit contributions
 Everything-0.026
 earthquake-0.025
 emergency-0.024
py-0.024
loe-0.024
Token lions
Token ID31420
Feature activation+5.43e-02

Pos logit contributions
 Everything+0.037
loe+0.035
 earthquake+0.034
 emergency+0.034
py+0.033
Neg logit contributions
 than-0.029
ho-0.029
s-0.027
bel-0.026
 far-0.026
Token and
Token ID290
Feature activation+2.19e-02

Pos logit contributions
 Everything+0.138
 earthquake+0.130
 emergency+0.129
py+0.127
 ambulance+0.125
Neg logit contributions
ho-0.054
 than-0.054
s-0.046
 unt-0.042
bel-0.042
Token elephants
Token ID25368
Feature activation+6.20e-02

Pos logit contributions
 Everything+0.038
loe+0.037
 earthquake+0.034
 emergency+0.034
 operation+0.033
Neg logit contributions
 than-0.040
ho-0.040
s-0.037
 far-0.037
bel-0.036
Token chains
Token ID14659
Feature activation+3.68e-02

Pos logit contributions
 Everything+0.002
 earthquake+0.002
 emergency+0.002
py+0.002
 ambulance+0.002
Neg logit contributions
ho-0.001
 than-0.001
s-0.001
bel-0.001
 unt-0.001
Token,
Token ID11
Feature activation+2.03e-02

Pos logit contributions
 Everything+0.081
 earthquake+0.075
 emergency+0.075
py+0.074
 ambulance+0.073
Neg logit contributions
ho-0.049
 than-0.048
s-0.043
bel-0.041
 unt-0.041
Token glitter
Token ID31133
Feature activation+3.81e-02

Pos logit contributions
 Everything+0.044
 earthquake+0.041
 emergency+0.041
loe+0.040
 hurricane+0.039
Neg logit contributions
 than-0.030
ho-0.028
s-0.025
 unt-0.025
bel-0.024
Tokeny
Token ID88
Feature activation+6.89e-03

Pos logit contributions
 Everything+0.081
py+0.076
 bears+0.075
 emergency+0.075
rell+0.075
Neg logit contributions
ho-0.049
 than-0.047
 unt-0.043
bel-0.043
outing-0.042
Token snow
Token ID6729
Feature activation+5.67e-02

Pos logit contributions
 Everything+0.015
 emergency+0.014
 earthquake+0.014
loe+0.014
 ambulance+0.014
Neg logit contributions
 than-0.010
ho-0.010
s-0.009
ater-0.008
bel-0.008
Tokenfl
Token ID2704
Feature activation+1.49e-01

Pos logit contributions
 Everything+0.116
 earthquake+0.107
 emergency+0.107
 ambulance+0.102
py+0.102
Neg logit contributions
ho-0.088
 than-0.086
s-0.079
bel-0.074
ater-0.073
Tokenakes
Token ID1124
Feature activation+4.17e-02

Pos logit contributions
 Everything+0.230
 emergency+0.220
 earthquake+0.218
 nightmare+0.208
 thunder+0.204
Neg logit contributions
s-0.305
ho-0.302
 than-0.301
bel-0.274
ater-0.263
Token,
Token ID11
Feature activation+2.45e-02

Pos logit contributions
 Everything+0.083
 earthquake+0.080
rell+0.079
 bears+0.078
py+0.078
Neg logit contributions
ho-0.056
 than-0.053
bel-0.052
outing-0.051
ater-0.050
Token and
Token ID290
Feature activation+2.04e-02

Pos logit contributions
 Everything+0.063
 earthquake+0.060
 emergency+0.060
loe+0.058
py+0.057
Neg logit contributions
 than-0.025
ho-0.024
s-0.020
 unt-0.019
bel-0.019
Token a
Token ID257
Feature activation+8.54e-03

Pos logit contributions
 Everything+0.043
loe+0.041
 earthquake+0.040
 emergency+0.040
py+0.039
Neg logit contributions
 than-0.030
ho-0.028
s-0.026
 unt-0.025
 far-0.025
Token picture
Token ID4286
Feature activation+4.10e-02

Pos logit contributions
 Everything+0.019
loe+0.017
py+0.017
 earthquake+0.017
rell+0.017
Neg logit contributions
ho-0.012
 than-0.012
s-0.011
 far-0.010
 unt-0.010
Token She
Token ID1375
Feature activation+2.57e-02

Pos logit contributions
 Everything+0.018
 earthquake+0.017
 emergency+0.017
py+0.017
 ambulance+0.017
Neg logit contributions
ho-0.016
 than-0.016
s-0.015
 unt-0.014
bel-0.014
Token saw
Token ID2497
Feature activation+9.11e-03

Pos logit contributions
 Everything+0.056
 emergency+0.051
 earthquake+0.051
py+0.050
 ambulance+0.050
Neg logit contributions
 than-0.035
ho-0.035
s-0.030
outing-0.030
ater-0.030
Token lots
Token ID6041
Feature activation+9.31e-03

Pos logit contributions
 Everything+0.021
 earthquake+0.021
loe+0.020
 emergency+0.020
rell+0.020
Neg logit contributions
ho-0.010
 than-0.010
 far-0.009
s-0.009
 unt-0.009
Token of
Token ID286
Feature activation+6.95e-02

Pos logit contributions
 Everything+0.019
 earthquake+0.017
 emergency+0.017
loe+0.017
py+0.017
Neg logit contributions
 than-0.014
ho-0.013
s-0.013
 unt-0.012
 far-0.012
Token snow
Token ID6729
Feature activation+5.93e-02

Pos logit contributions
loe+0.155
aries+0.150
 Everything+0.143
py+0.137
rell+0.137
Neg logit contributions
 far-0.098
 than-0.096
s-0.085
 else-0.079
 carefully-0.079
Tokenfl
Token ID2704
Feature activation+1.48e-01

Pos logit contributions
 Everything+0.128
 earthquake+0.120
 emergency+0.119
py+0.117
 ambulance+0.115
Neg logit contributions
ho-0.080
 than-0.079
s-0.070
 unt-0.068
bel-0.068
Tokenakes
Token ID1124
Feature activation+2.99e-02

Pos logit contributions
 Everything+0.212
 emergency+0.196
 earthquake+0.194
 nightmare+0.185
 hurricane+0.182
Neg logit contributions
 than-0.314
s-0.313
ho-0.309
bel-0.287
outing-0.286
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
 Everything+0.063
 earthquake+0.060
 emergency+0.059
py+0.059
 thunder+0.058
Neg logit contributions
ho-0.040
 than-0.038
bel-0.035
ater-0.035
outing-0.034
Token glitter
Token ID31133
Feature activation+2.80e-02

Pos logit contributions
 Everything+0.032
 earthquake+0.030
 emergency+0.030
loe+0.029
py+0.029
Neg logit contributions
 than-0.015
ho-0.015
s-0.013
 unt-0.012
bel-0.012
Tokening
Token ID278
Feature activation+4.87e-02

Pos logit contributions
 Everything+0.060
py+0.056
rell+0.056
 bears+0.056
 earthquake+0.055
Neg logit contributions
ho-0.035
 than-0.034
bel-0.032
 unt-0.031
ater-0.031
Token in
Token ID287
Feature activation-5.03e-03

Pos logit contributions
 Everything+0.117
 earthquake+0.110
 emergency+0.109
loe+0.108
 ambulance+0.106
Neg logit contributions
 than-0.059
ho-0.058
s-0.052
bel-0.046
 far-0.046
Token cigarette
Token ID17779
Feature activation+6.87e-02

Pos logit contributions
ho+0.018
 than+0.017
s+0.016
bel+0.015
 unt+0.015
Neg logit contributions
 Everything-0.023
 earthquake-0.022
 emergency-0.021
py-0.021
 ambulance-0.021
Token and
Token ID290
Feature activation-3.32e-03

Pos logit contributions
 Everything+0.160
 earthquake+0.150
 emergency+0.149
py+0.147
 ambulance+0.145
Neg logit contributions
ho-0.082
 than-0.081
s-0.071
bel-0.067
 unt-0.067
Token using
Token ID1262
Feature activation-2.30e-02

Pos logit contributions
ho+0.005
 than+0.005
s+0.004
 unt+0.004
bel+0.004
Neg logit contributions
 Everything-0.007
 earthquake-0.007
 emergency-0.006
py-0.006
loe-0.006
Token an
Token ID281
Feature activation+9.05e-03

Pos logit contributions
 than+0.029
ho+0.028
s+0.026
 unt+0.026
 far+0.024
Neg logit contributions
 Everything-0.051
 earthquake-0.049
py-0.049
 emergency-0.048
loe-0.048
Token ash
Token ID12530
Feature activation+5.88e-02

Pos logit contributions
 Everything+0.017
py+0.015
 earthquake+0.015
loe+0.015
rell+0.014
Neg logit contributions
 than-0.015
ho-0.015
s-0.014
 unt-0.014
outing-0.013
Tokentr
Token ID2213
Feature activation+1.48e-01

Pos logit contributions
 Everything+0.098
 earthquake+0.089
 emergency+0.088
py+0.087
loe+0.085
Neg logit contributions
ho-0.109
 than-0.108
s-0.099
bel-0.096
 unt-0.096
Tokenay
Token ID323
Feature activation+7.73e-02

Pos logit contributions
 Everything+0.169
 earthquake+0.149
 emergency+0.146
py+0.142
 ambulance+0.137
Neg logit contributions
ho-0.353
 than-0.350
s-0.339
outing-0.323
bel-0.323
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
 Everything+0.174
 earthquake+0.164
 emergency+0.163
py+0.159
 ambulance+0.158
Neg logit contributions
ho-0.097
 than-0.097
s-0.087
 unt-0.082
ater-0.081
Token
Token ID198
Feature activation-6.16e-03

Pos logit contributions
ho+0.022
 than+0.022
ater+0.020
outing+0.020
bel+0.020
Neg logit contributions
 Everything-0.021
 earthquake-0.019
 emergency-0.018
 ambulance-0.018
rell-0.018
Token
Token ID198
Feature activation-1.82e-03

Pos logit contributions
ho+0.010
 than+0.010
ater+0.009
outing+0.009
bel+0.008
Neg logit contributions
 Everything-0.012
loe-0.011
 emergency-0.011
py-0.010
 ambulance-0.010
TokenL
Token ID43
Feature activation-1.81e-03

Pos logit contributions
ho+0.003
 than+0.002
bec+0.002
fish+0.002
outing+0.002
Neg logit contributions
 Everything-0.004
 ambulance-0.003
 Everybody-0.003
 emergency-0.003
loe-0.003
Token cave
Token ID11527
Feature activation+1.90e-02

Pos logit contributions
 Everything+0.056
loe+0.053
py+0.052
rell+0.051
 earthquake+0.048
Neg logit contributions
 far-0.051
 than-0.049
ho-0.048
s-0.047
fish-0.045
Token and
Token ID290
Feature activation+8.39e-03

Pos logit contributions
 Everything+0.035
 earthquake+0.035
 thunder+0.034
 ambulance+0.034
 emergency+0.034
Neg logit contributions
ho-0.027
ucker-0.026
ater-0.025
outing-0.024
ighed-0.023
Token saw
Token ID2497
Feature activation+9.70e-03

Pos logit contributions
 Everything+0.018
 earthquake+0.017
 emergency+0.017
 thunder+0.017
py+0.016
Neg logit contributions
ho-0.011
 than-0.011
ater-0.010
s-0.010
outing-0.010
Token an
Token ID281
Feature activation+1.88e-02

Pos logit contributions
 Everything+0.023
 earthquake+0.022
 emergency+0.021
rell+0.021
loe+0.021
Neg logit contributions
ho-0.011
 than-0.011
 far-0.010
s-0.010
 unt-0.009
Token ash
Token ID12530
Feature activation+5.75e-02

Pos logit contributions
 Everything+0.045
loe+0.042
rell+0.040
 nightmares+0.039
aries+0.039
Neg logit contributions
 than-0.023
ho-0.020
 far-0.019
 unt-0.019
ater-0.018
Tokentr
Token ID2213
Feature activation+1.48e-01

Pos logit contributions
 Everything+0.109
 earthquake+0.100
 emergency+0.099
py+0.097
 ambulance+0.096
Neg logit contributions
ho-0.094
 than-0.093
s-0.085
bel-0.082
 unt-0.082
Tokenay
Token ID323
Feature activation+7.02e-02

Pos logit contributions
 Everything+0.185
 earthquake+0.162
 emergency+0.161
py+0.155
 bears+0.152
Neg logit contributions
ho-0.340
 than-0.333
s-0.327
outing-0.309
bel-0.308
Token by
Token ID416
Feature activation+9.94e-03

Pos logit contributions
 Everything+0.169
 earthquake+0.154
 emergency+0.154
py+0.151
loe+0.151
Neg logit contributions
ho-0.081
 than-0.080
s-0.071
 unt-0.067
 far-0.065
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
 Everything+0.019
 earthquake+0.018
 emergency+0.018
 thunder+0.018
py+0.018
Neg logit contributions
ho-0.015
 than-0.014
s-0.014
outing-0.014
bel-0.014
Token shore
Token ID15191
Feature activation-2.18e-02

Pos logit contributions
 Everything+0.026
loe+0.024
rell+0.023
py+0.022
 emergency+0.021
Neg logit contributions
ho-0.023
 than-0.023
 far-0.021
ater-0.020
 unt-0.019
Token.
Token ID13
Feature activation-9.45e-03

Pos logit contributions
ho+0.023
 than+0.022
 unt+0.019
s+0.019
bel+0.019
Neg logit contributions
 Everything-0.053
 earthquake-0.050
 emergency-0.050
py-0.049
 ambulance-0.049
Token he
Token ID339
Feature activation+1.86e-02

Pos logit contributions
ho+0.027
 far+0.026
 than+0.025
s+0.022
bel+0.018
Neg logit contributions
rell-0.095
 Everything-0.094
py-0.094
loe-0.092
 ambulance-0.089
Token sp
Token ID599
Feature activation+2.85e-02

Pos logit contributions
 Everything+0.037
 earthquake+0.034
 emergency+0.034
 thunder+0.034
py+0.033
Neg logit contributions
 than-0.028
ho-0.027
outing-0.024
bel-0.024
ater-0.023
Tokenied
Token ID798
Feature activation+1.11e-02

Pos logit contributions
 Everything+0.051
 emergency+0.047
loe+0.047
py+0.047
 earthquake+0.047
Neg logit contributions
ho-0.051
 than-0.051
 unt-0.042
s-0.042
 appet-0.042
Token an
Token ID281
Feature activation+1.92e-02

Pos logit contributions
 Everything+0.026
 earthquake+0.024
 emergency+0.024
py+0.024
loe+0.024
Neg logit contributions
 than-0.014
ho-0.013
s-0.012
 unt-0.012
 far-0.011
Token ash
Token ID12530
Feature activation+5.84e-02

Pos logit contributions
 Everything+0.036
loe+0.032
rell+0.031
 Wolf+0.030
 nightmares+0.030
Neg logit contributions
 than-0.032
ho-0.030
 unt-0.029
ater-0.028
 far-0.028
Tokentr
Token ID2213
Feature activation+1.48e-01

Pos logit contributions
 Everything+0.094
 earthquake+0.086
 emergency+0.085
py+0.083
 ambulance+0.081
Neg logit contributions
ho-0.111
 than-0.110
s-0.102
 unt-0.099
bel-0.099
Tokenay
Token ID323
Feature activation+8.01e-02

Pos logit contributions
 Everything+0.175
 earthquake+0.154
 emergency+0.151
py+0.149
 bears+0.143
Neg logit contributions
ho-0.347
 than-0.342
s-0.334
bel-0.318
outing-0.316
Token.
Token ID13
Feature activation-2.76e-04

Pos logit contributions
 Everything+0.202
 earthquake+0.190
 emergency+0.189
py+0.186
 ambulance+0.183
Neg logit contributions
ho-0.081
 than-0.080
s-0.069
 unt-0.064
bel-0.063
Token He
Token ID679
Feature activation+2.17e-03

Pos logit contributions
ho+0.000
 than+0.000
s+0.000
 unt+0.000
bel+0.000
Neg logit contributions
 Everything-0.000
 earthquake-0.000
 emergency-0.000
py-0.000
 ambulance-0.000
Token picked
Token ID6497
Feature activation+1.35e-02

Pos logit contributions
 Everything+0.005
 earthquake+0.004
 emergency+0.004
py+0.004
 bears+0.004
Neg logit contributions
 than-0.003
ho-0.003
s-0.003
outing-0.003
ater-0.003
Token up
Token ID510
Feature activation-6.92e-03

Pos logit contributions
 Everything+0.030
 earthquake+0.029
 emergency+0.029
py+0.028
loe+0.028
Neg logit contributions
 than-0.017
ho-0.017
s-0.015
 unt-0.015
 far-0.014
Token a
Token ID257
Feature activation-1.97e-03

Pos logit contributions
 than+0.004
 far+0.003
ho+0.003
s+0.003
 unt+0.003
Neg logit contributions
 Everything-0.007
 earthquake-0.007
loe-0.007
py-0.007
 emergency-0.007
Token garden
Token ID11376
Feature activation+4.70e-03

Pos logit contributions
ho+0.002
 than+0.002
 far+0.002
s+0.002
 unt+0.002
Neg logit contributions
 Everything-0.005
py-0.004
loe-0.004
rell-0.004
 earthquake-0.004
Token full
Token ID1336
Feature activation-1.26e-02

Pos logit contributions
 Everything+0.010
 earthquake+0.010
 ambulance+0.010
 emergency+0.010
 bears+0.010
Neg logit contributions
ho-0.006
 than-0.005
outing-0.005
bel-0.005
s-0.005
Token of
Token ID286
Feature activation+3.81e-02

Pos logit contributions
ho+0.019
 than+0.019
s+0.017
 far+0.016
 unt+0.016
Neg logit contributions
 Everything-0.026
 earthquake-0.024
py-0.023
 emergency-0.023
loe-0.023
Token wild
Token ID4295
Feature activation+1.86e-02

Pos logit contributions
loe+0.084
 Everything+0.081
py+0.079
rell+0.078
 earthquake+0.076
Neg logit contributions
 than-0.048
 far-0.048
 carefully-0.041
ho-0.039
 unt-0.039
Tokenfl
Token ID2704
Feature activation+1.48e-01

Pos logit contributions
loe+0.039
 Everything+0.037
 meantime+0.033
 Story+0.033
aries+0.033
Neg logit contributions
 than-0.032
s-0.027
ho-0.025
 far-0.023
 with-0.022
Tokenowers
Token ID3618
Feature activation+4.04e-03

Pos logit contributions
 Everything+0.332
 emergency+0.305
 nightmare+0.303
 earthquake+0.297
 thunder+0.295
Neg logit contributions
s-0.206
ho-0.199
 than-0.193
bel-0.169
ater-0.164
Token.
Token ID13
Feature activation-2.89e-03

Pos logit contributions
 Everything+0.009
 earthquake+0.009
 emergency+0.009
py+0.009
 bears+0.009
Neg logit contributions
ho-0.005
 than-0.004
bel-0.004
outing-0.004
ater-0.004
Token He
Token ID679
Feature activation+4.69e-04

Pos logit contributions
 than+0.004
s+0.004
ho+0.004
 far+0.004
 unt+0.003
Neg logit contributions
 emergency-0.006
 earthquake-0.006
py-0.006
 ambulance-0.006
 nightmares-0.005
Token decided
Token ID3066
Feature activation+6.42e-04

Pos logit contributions
 Everything+0.001
 earthquake+0.001
 emergency+0.001
py+0.001
 ambulance+0.001
Neg logit contributions
 than-0.001
ho-0.001
s-0.001
outing-0.001
ater-0.001
Token to
Token ID284
Feature activation-1.31e-02

Pos logit contributions
 Everything+0.002
 emergency+0.001
 thunder+0.001
 earthquake+0.001
 ambulance+0.001
Neg logit contributions
ho-0.001
ater-0.001
outing-0.001
 than-0.001
ucker-0.001
Token!
Token ID0
Feature activation-8.79e-03

Pos logit contributions
ho+0.023
 than+0.021
outing+0.019
s+0.019
ater+0.018
Neg logit contributions
 Everything-0.049
 earthquake-0.046
 emergency-0.045
 thunder-0.045
 ambulance-0.044
Token Alice
Token ID14862
Feature activation-4.33e-02

Pos logit contributions
 than+0.017
ho+0.017
s+0.016
 unt+0.015
bel+0.015
Neg logit contributions
 Everything-0.013
py-0.013
 earthquake-0.013
 emergency-0.013
 ambulance-0.012
Token was
Token ID373
Feature activation-4.36e-03

Pos logit contributions
ho+0.057
 than+0.057
outing+0.050
bel+0.050
ater+0.048
Neg logit contributions
 Everything-0.094
py-0.085
 earthquake-0.085
 emergency-0.085
loe-0.084
Token so
Token ID523
Feature activation-1.51e-02

Pos logit contributions
 than+0.007
ho+0.007
 unt+0.006
s+0.006
bel+0.006
Neg logit contributions
 Everything-0.009
 earthquake-0.008
 emergency-0.008
py-0.008
 ambulance-0.008
Token thrilled
Token ID20536
Feature activation-3.57e-02

Pos logit contributions
ho+0.026
 than+0.025
ater+0.024
ike+0.024
ro+0.022
Neg logit contributions
 Everything-0.027
 emergency-0.024
 earthquake-0.023
 thunder-0.023
 lightning-0.023
Token as
Token ID355
Feature activation-1.36e-01

Pos logit contributions
ho+0.049
ater+0.044
outing+0.044
bel+0.043
ighed+0.041
Neg logit contributions
 Everything-0.076
 earthquake-0.070
 thunder-0.068
 emergency-0.068
 Beth-0.067
Token she
Token ID673
Feature activation+1.12e-03

Pos logit contributions
ho+0.203
outing+0.176
ater+0.175
 than+0.173
bel+0.165
Neg logit contributions
 Everything-0.280
 earthquake-0.261
 thunder-0.259
 nightmares-0.258
 emergency-0.257
Token took
Token ID1718
Feature activation+3.93e-03

Pos logit contributions
 Everything+0.002
 earthquake+0.002
 emergency+0.002
 bears+0.002
py+0.002
Neg logit contributions
 than-0.002
ho-0.002
outing-0.001
bel-0.001
s-0.001
Token a
Token ID257
Feature activation-2.50e-02

Pos logit contributions
py+0.009
 Everything+0.009
 earthquake+0.009
 emergency+0.009
 ambulance+0.009
Neg logit contributions
 than-0.004
 unt-0.004
 far-0.004
s-0.004
ho-0.004
Token big
Token ID1263
Feature activation+1.66e-02

Pos logit contributions
 than+0.036
ho+0.034
outing+0.031
ater+0.029
bel+0.029
Neg logit contributions
 Everything-0.052
 earthquake-0.049
 emergency-0.049
 hurricane-0.048
 thunder-0.047
Token bite
Token ID13197
Feature activation-4.59e-02

Pos logit contributions
 Everything+0.039
 earthquake+0.037
 emergency+0.036
py+0.036
 ambulance+0.035
Neg logit contributions
ho-0.019
 than-0.019
s-0.017
 unt-0.016
bel-0.016
Token fly
Token ID6129
Feature activation-4.04e-02

Pos logit contributions
 than+0.025
ho+0.025
 unt+0.022
s+0.022
bel+0.021
Neg logit contributions
 Everything-0.035
 earthquake-0.032
 emergency-0.032
py-0.031
loe-0.031
Token again
Token ID757
Feature activation+3.94e-02

Pos logit contributions
 than+0.050
ho+0.048
s+0.043
 far+0.041
 unt+0.040
Neg logit contributions
 Everything-0.096
 earthquake-0.090
 emergency-0.090
 ambulance-0.087
loe-0.087
Token and
Token ID290
Feature activation-9.96e-03

Pos logit contributions
 Everything+0.092
 earthquake+0.088
 emergency+0.087
py+0.085
 thunder+0.084
Neg logit contributions
ho-0.045
 than-0.043
s-0.038
outing-0.038
bel-0.037
Token all
Token ID477
Feature activation+1.81e-02

Pos logit contributions
ho+0.014
 than+0.014
s+0.012
 unt+0.012
bel+0.012
Neg logit contributions
 Everything-0.021
 earthquake-0.020
 emergency-0.020
py-0.020
 ambulance-0.019
Token cheered
Token ID34767
Feature activation-8.40e-03

Pos logit contributions
 Everything+0.040
 earthquake+0.038
 emergency+0.038
 ambulance+0.037
loe+0.036
Neg logit contributions
ho-0.024
 than-0.024
s-0.021
bel-0.020
 unt-0.020
Token as
Token ID355
Feature activation-1.33e-01

Pos logit contributions
ho+0.009
 than+0.009
s+0.008
outing+0.007
 unt+0.007
Neg logit contributions
 Everything-0.021
 earthquake-0.019
 emergency-0.019
py-0.019
 ambulance-0.019
Token the
Token ID262
Feature activation-1.35e-02

Pos logit contributions
ho+0.149
 than+0.148
s+0.127
 far+0.122
bel+0.121
Neg logit contributions
 Everything-0.319
 emergency-0.301
 earthquake-0.300
py-0.296
 ambulance-0.295
Token butterfly
Token ID35113
Feature activation-3.16e-02

Pos logit contributions
ho+0.022
 than+0.020
 far+0.020
 unt+0.019
ater+0.019
Neg logit contributions
 Everything-0.026
rell-0.025
loe-0.025
 Kenny-0.023
 Beth-0.023
Token flew
Token ID13112
Feature activation-3.08e-02

Pos logit contributions
ho+0.042
 than+0.042
s+0.037
 unt+0.036
bel+0.035
Neg logit contributions
 Everything-0.069
 earthquake-0.064
 emergency-0.064
py-0.062
 ambulance-0.062
Token away
Token ID1497
Feature activation-4.84e-03

Pos logit contributions
ho+0.047
 than+0.046
s+0.042
 unt+0.040
bel+0.040
Neg logit contributions
 Everything-0.062
 earthquake-0.058
 emergency-0.057
py-0.056
 ambulance-0.055
Token.
Token ID13
Feature activation-1.51e-03

Pos logit contributions
ho+0.006
outing+0.005
 than+0.005
ater+0.005
ucker+0.005
Neg logit contributions
 Everything-0.011
 earthquake-0.010
 thunder-0.010
 emergency-0.010
py-0.010
Token felt
Token ID2936
Feature activation-5.97e-03

Pos logit contributions
ho+0.070
 than+0.069
s+0.062
 unt+0.059
bel+0.058
Neg logit contributions
 Everything-0.122
 earthquake-0.115
 emergency-0.114
py-0.111
 ambulance-0.110
Token so
Token ID523
Feature activation-9.53e-03

Pos logit contributions
ho+0.010
 than+0.010
outing+0.009
ater+0.009
s+0.009
Neg logit contributions
 Everything-0.011
 emergency-0.010
 earthquake-0.010
 thunder-0.010
 ambulance-0.010
Token happy
Token ID3772
Feature activation-5.49e-03

Pos logit contributions
ho+0.016
 than+0.015
ater+0.014
outing+0.014
bel+0.014
Neg logit contributions
 Everything-0.019
 earthquake-0.016
 emergency-0.016
 thunder-0.016
 Everybody-0.016
Token and
Token ID290
Feature activation-4.01e-02

Pos logit contributions
ho+0.007
outing+0.006
ater+0.006
bel+0.006
 than+0.006
Neg logit contributions
 Everything-0.012
 earthquake-0.012
 thunder-0.011
 nightmares-0.011
 emergency-0.011
Token grateful
Token ID14066
Feature activation-2.03e-02

Pos logit contributions
ater+0.063
ho+0.062
outing+0.061
 than+0.058
ge+0.053
Neg logit contributions
 Everything-0.080
 earthquake-0.070
 thunder-0.070
 Beth-0.069
 Everybody-0.069
Token as
Token ID355
Feature activation-1.31e-01

Pos logit contributions
ater+0.027
ho+0.026
outing+0.026
 unt+0.025
bel+0.025
Neg logit contributions
 Everything-0.041
 earthquake-0.039
 thunder-0.038
 emergency-0.038
 nightmares-0.037
Token she
Token ID673
Feature activation+2.95e-03

Pos logit contributions
ho+0.170
 than+0.169
s+0.147
 unt+0.142
 far+0.141
Neg logit contributions
 Everything-0.294
 emergency-0.274
 earthquake-0.273
 ambulance-0.268
py-0.268
Token skipped
Token ID26684
Feature activation-3.29e-02

Pos logit contributions
 Everything+0.006
 earthquake+0.006
 emergency+0.006
py+0.006
 ambulance+0.006
Neg logit contributions
ho-0.004
 than-0.004
s-0.004
outing-0.004
bel-0.004
Token home
Token ID1363
Feature activation+1.43e-02

Pos logit contributions
 than+0.055
ho+0.054
s+0.050
 far+0.048
 unt+0.048
Neg logit contributions
 Everything-0.062
 earthquake-0.058
 emergency-0.058
py-0.056
 ambulance-0.054
Token.
Token ID13
Feature activation+3.24e-03

Pos logit contributions
 Everything+0.031
 earthquake+0.029
 emergency+0.029
 ambulance+0.029
 bears+0.029
Neg logit contributions
ho-0.018
outing-0.016
 than-0.016
ater-0.015
fish-0.014
Token She
Token ID1375
Feature activation+1.98e-02

Pos logit contributions
 emergency+0.006
 earthquake+0.006
 Everything+0.006
 ambulance+0.006
py+0.006
Neg logit contributions
 than-0.005
ho-0.005
s-0.004
 unt-0.004
 far-0.004
TokenThe
Token ID464
Feature activation-1.17e-02

Pos logit contributions
ho+0.001
 than+0.001
s+0.001
outing+0.001
ater+0.001
Neg logit contributions
 Everything-0.002
 emergency-0.002
 earthquake-0.002
 ambulance-0.002
py-0.002
Token other
Token ID584
Feature activation+2.13e-02

Pos logit contributions
ho+0.017
 than+0.016
 far+0.015
 unt+0.014
s+0.014
Neg logit contributions
 Everything-0.025
loe-0.023
 earthquake-0.023
py-0.023
rell-0.023
Token raven
Token ID37735
Feature activation-4.23e-02

Pos logit contributions
 Everything+0.038
 earthquake+0.035
 emergency+0.034
py+0.034
loe+0.033
Neg logit contributions
 than-0.038
ho-0.038
s-0.034
 far-0.033
 unt-0.033
Tokens
Token ID82
Feature activation-1.51e-02

Pos logit contributions
ho+0.054
 than+0.053
s+0.048
 unt+0.045
bel+0.044
Neg logit contributions
 Everything-0.095
 earthquake-0.090
 emergency-0.089
py-0.087
 ambulance-0.086
Token cheered
Token ID34767
Feature activation-1.62e-02

Pos logit contributions
ho+0.027
 than+0.026
s+0.023
 far+0.023
 unt+0.023
Neg logit contributions
 Everything-0.027
 earthquake-0.025
rell-0.025
py-0.024
 emergency-0.024
Token as
Token ID355
Feature activation-1.31e-01

Pos logit contributions
ho+0.018
 than+0.017
s+0.015
bel+0.015
outing+0.015
Neg logit contributions
 Everything-0.039
 earthquake-0.036
 emergency-0.036
py-0.036
 ambulance-0.036
Token the
Token ID262
Feature activation-7.80e-03

Pos logit contributions
ho+0.172
 than+0.169
s+0.151
bel+0.145
ater+0.145
Neg logit contributions
 Everything-0.287
 earthquake-0.268
 emergency-0.266
py-0.261
 ambulance-0.258
Token winner
Token ID8464
Feature activation-4.64e-02

Pos logit contributions
ho+0.011
 than+0.011
 far+0.010
 unt+0.010
s+0.010
Neg logit contributions
 Everything-0.016
loe-0.015
 earthquake-0.015
rell-0.015
py-0.015
Token walked
Token ID6807
Feature activation-7.46e-03

Pos logit contributions
ho+0.063
 than+0.061
outing+0.054
s+0.054
bel+0.054
Neg logit contributions
 Everything-0.099
 earthquake-0.092
 emergency-0.092
py-0.090
 ambulance-0.089
Token away
Token ID1497
Feature activation-1.07e-03

Pos logit contributions
ho+0.010
 than+0.010
s+0.009
bel+0.008
 unt+0.008
Neg logit contributions
 Everything-0.016
 earthquake-0.015
 emergency-0.015
py-0.015
 ambulance-0.015
Token proudly
Token ID23017
Feature activation-5.22e-02

Pos logit contributions
ho+0.001
outing+0.001
 than+0.001
ater+0.001
s+0.001
Neg logit contributions
 Everything-0.002
 earthquake-0.002
 emergency-0.002
 thunder-0.002
py-0.002
Token
Token ID198
Feature activation-9.48e-03

Pos logit contributions
ho+0.038
 than+0.037
bel+0.034
ater+0.034
s+0.034
Neg logit contributions
 Everything-0.034
 earthquake-0.030
 emergency-0.029
py-0.029
rell-0.029
Token
Token ID198
Feature activation-5.65e-03

Pos logit contributions
ho+0.015
 than+0.015
ater+0.013
bel+0.013
outing+0.013
Neg logit contributions
 Everything-0.019
loe-0.017
 emergency-0.017
py-0.017
 Everybody-0.016
TokenJim
Token ID18050
Feature activation-6.64e-02

Pos logit contributions
ho+0.010
 than+0.009
bel+0.009
ater+0.008
ucker+0.008
Neg logit contributions
 Everything-0.011
 Everybody-0.009
 emergency-0.009
loe-0.009
 earthquake-0.009
Token felt
Token ID2936
Feature activation-1.20e-02

Pos logit contributions
ho+0.087
 than+0.086
s+0.075
bel+0.073
outing+0.073
Neg logit contributions
 Everything-0.146
 earthquake-0.136
 emergency-0.135
py-0.133
 ambulance-0.132
Token satisfied
Token ID11378
Feature activation-3.37e-02

Pos logit contributions
ho+0.018
 than+0.017
s+0.016
ater+0.015
outing+0.015
Neg logit contributions
 Everything-0.025
 earthquake-0.023
 emergency-0.023
 ambulance-0.022
 thunder-0.022
Token as
Token ID355
Feature activation-1.30e-01

Pos logit contributions
ho+0.046
outing+0.039
s+0.038
ater+0.037
bel+0.036
Neg logit contributions
 Everything-0.074
 thunder-0.068
 earthquake-0.068
 emergency-0.068
 lightning-0.066
Token he
Token ID339
Feature activation-7.36e-03

Pos logit contributions
ho+0.173
 than+0.162
s+0.149
ater+0.148
outing+0.147
Neg logit contributions
 Everything-0.287
 earthquake-0.272
 emergency-0.266
 thunder-0.261
py-0.261
Token went
Token ID1816
Feature activation+6.85e-03

Pos logit contributions
ho+0.011
 than+0.010
outing+0.009
s+0.009
bel+0.009
Neg logit contributions
 Everything-0.015
 earthquake-0.014
 emergency-0.014
py-0.014
 thunder-0.014
Token back
Token ID736
Feature activation+1.51e-02

Pos logit contributions
 Everything+0.018
 earthquake+0.017
py+0.017
loe+0.017
rell+0.017
Neg logit contributions
 than-0.006
ho-0.005
s-0.005
 far-0.005
 unt-0.005
Token to
Token ID284
Feature activation-3.17e-02

Pos logit contributions
 Everything+0.042
 earthquake+0.040
 emergency+0.039
py+0.039
rell+0.038
Neg logit contributions
 than-0.012
ho-0.011
s-0.010
bel-0.009
 unt-0.009
Token his
Token ID465
Feature activation-1.51e-02

Pos logit contributions
 than+0.048
ho+0.047
 unt+0.043
 far+0.043
s+0.042
Neg logit contributions
 Everything-0.062
py-0.058
rell-0.058
 earthquake-0.057
 emergency-0.057
Token The
Token ID383
Feature activation+1.62e-02

Pos logit contributions
 Everything+0.014
 earthquake+0.012
 emergency+0.012
py+0.012
 ambulance+0.012
Neg logit contributions
ho-0.015
 than-0.014
s-0.013
bel-0.013
 unt-0.013
Token sock
Token ID32263
Feature activation-1.33e-02

Pos logit contributions
 Everything+0.031
 earthquake+0.028
 emergency+0.028
py+0.028
rell+0.028
Neg logit contributions
 than-0.026
ho-0.026
s-0.024
 unt-0.023
ater-0.023
Token seemed
Token ID3947
Feature activation-2.45e-02

Pos logit contributions
ho+0.018
 than+0.018
s+0.016
 unt+0.015
bel+0.015
Neg logit contributions
 Everything-0.029
 earthquake-0.027
 emergency-0.027
py-0.026
 ambulance-0.026
Token so
Token ID523
Feature activation-2.02e-02

Pos logit contributions
ho+0.038
ater+0.034
outing+0.034
atching+0.034
ucker+0.032
Neg logit contributions
 Everything-0.049
 thunder-0.046
 lightning-0.044
 nightmares-0.044
 nightmare-0.042
Token happy
Token ID3772
Feature activation-9.48e-03

Pos logit contributions
ho+0.036
ater+0.030
bel+0.030
 than+0.030
outing+0.029
Neg logit contributions
 Everything-0.039
 thunder-0.034
rell-0.034
 earthquake-0.034
 Everybody-0.034
Token as
Token ID355
Feature activation-1.30e-01

Pos logit contributions
ho+0.012
outing+0.011
ater+0.011
bel+0.011
 unt+0.010
Neg logit contributions
 Everything-0.021
 earthquake-0.020
 thunder-0.019
 emergency-0.019
 lightning-0.019
Token she
Token ID673
Feature activation+1.51e-02

Pos logit contributions
ho+0.183
outing+0.166
ater+0.164
 than+0.162
bel+0.154
Neg logit contributions
 Everything-0.268
 thunder-0.254
 lightning-0.251
py-0.251
 earthquake-0.249
Token skipped
Token ID26684
Feature activation-2.82e-02

Pos logit contributions
 Everything+0.030
py+0.027
 bears+0.027
 earthquake+0.027
 thunder+0.027
Neg logit contributions
 than-0.022
ho-0.022
outing-0.020
bel-0.019
s-0.019
Token around
Token ID1088
Feature activation+3.63e-03

Pos logit contributions
ho+0.043
 than+0.042
s+0.038
outing+0.037
bel+0.037
Neg logit contributions
 Everything-0.056
 earthquake-0.052
 emergency-0.051
py-0.051
 ambulance-0.050
Token.
Token ID13
Feature activation+5.31e-03

Pos logit contributions
 Everything+0.008
 earthquake+0.008
 emergency+0.008
py+0.008
rell+0.008
Neg logit contributions
 than-0.004
ho-0.004
s-0.004
 unt-0.004
 far-0.004
Token
Token ID198
Feature activation+1.13e-03

Pos logit contributions
 Everything+0.011
 earthquake+0.010
 emergency+0.010
py+0.010
 ambulance+0.010
Neg logit contributions
ho-0.008
 than-0.008
s-0.007
bel-0.007
 unt-0.007
TokenThe
Token ID464
Feature activation-1.41e-02

Pos logit contributions
ho+0.006
 than+0.005
s+0.005
outing+0.005
ater+0.004
Neg logit contributions
 Everything-0.010
 Everybody-0.009
 emergency-0.009
 earthquake-0.009
loe-0.009
Token farmer
Token ID18739
Feature activation-5.61e-02

Pos logit contributions
ho+0.021
 than+0.020
 far+0.018
s+0.017
 unt+0.017
Neg logit contributions
 Everything-0.030
loe-0.028
rell-0.027
 nightmares-0.026
 Beth-0.026
Token was
Token ID373
Feature activation+4.06e-04

Pos logit contributions
ho+0.068
 than+0.065
s+0.059
 unt+0.057
 far+0.054
Neg logit contributions
 Everything-0.132
 earthquake-0.126
 emergency-0.124
py-0.120
rell-0.120
Token very
Token ID845
Feature activation-1.72e-02

Pos logit contributions
 Everything+0.001
 earthquake+0.001
 emergency+0.001
py+0.001
rell+0.001
Neg logit contributions
 than-0.001
ho-0.001
 unt-0.001
 far-0.001
s-0.001
Token pleased
Token ID10607
Feature activation-2.91e-02

Pos logit contributions
ho+0.030
ater+0.027
 than+0.027
eed+0.026
s+0.026
Neg logit contributions
 Everything-0.028
 emergency-0.027
 earthquake-0.026
 hurricane-0.026
 thunder-0.026
Token as
Token ID355
Feature activation-1.29e-01

Pos logit contributions
ho+0.038
ater+0.035
outing+0.034
bel+0.033
ighed+0.032
Neg logit contributions
 Everything-0.061
 bears-0.056
 thunder-0.055
py-0.055
 Wolf-0.055
Token the
Token ID262
Feature activation-1.99e-02

Pos logit contributions
ho+0.169
 than+0.165
s+0.147
outing+0.142
ater+0.142
Neg logit contributions
 Everything-0.286
 earthquake-0.268
 emergency-0.266
py-0.262
 thunder-0.257
Token elephants
Token ID25368
Feature activation+9.26e-03

Pos logit contributions
ho+0.031
 than+0.030
 far+0.030
 unt+0.027
 respectful+0.027
Neg logit contributions
loe-0.038
 Everything-0.038
rell-0.035
 nightmares-0.033
py-0.033
Token finally
Token ID3443
Feature activation-2.32e-02

Pos logit contributions
 Everything+0.019
 earthquake+0.018
 emergency+0.018
py+0.018
 ambulance+0.017
Neg logit contributions
ho-0.013
 than-0.013
s-0.012
 unt-0.011
bel-0.011
Token made
Token ID925
Feature activation-2.52e-02

Pos logit contributions
ho+0.037
 than+0.036
outing+0.035
bel+0.033
ater+0.033
Neg logit contributions
 Everything-0.042
py-0.039
 earthquake-0.038
rell-0.038
 bears-0.038
Token it
Token ID340
Feature activation-6.90e-03

Pos logit contributions
 than+0.032
ho+0.030
s+0.029
 far+0.028
 unt+0.028
Neg logit contributions
 Everything-0.055
 earthquake-0.053
py-0.052
loe-0.051
rell-0.051
Token<|endoftext|>
Token ID50256
Feature activation-9.40e-03

Pos logit contributions
 emergency+0.020
 earthquake+0.020
 ambulance+0.019
py+0.019
 hurricane+0.019
Neg logit contributions
 than-0.008
s-0.007
ho-0.007
 far-0.006
 unt-0.006
TokenThe
Token ID464
Feature activation+1.36e-02

Pos logit contributions
s+0.011
ho+0.010
 than+0.009
 far+0.009
 with+0.008
Neg logit contributions
 earthquake-0.023
 emergency-0.023
 ambulance-0.022
 hurricane-0.022
py-0.021
Token sun
Token ID4252
Feature activation+4.82e-03

Pos logit contributions
 Everything+0.029
loe+0.029
rell+0.029
py+0.027
 emergency+0.027
Neg logit contributions
ho-0.017
 far-0.017
 than-0.017
s-0.015
fish-0.014
Token was
Token ID373
Feature activation+3.45e-03

Pos logit contributions
 Everything+0.011
 earthquake+0.011
 emergency+0.011
 ambulance+0.011
 nightmares+0.011
Neg logit contributions
ho-0.006
s-0.006
 than-0.005
fish-0.005
bel-0.004
Token shining
Token ID22751
Feature activation+1.89e-02

Pos logit contributions
 Everything+0.006
 earthquake+0.006
 emergency+0.006
py+0.006
loe+0.006
Neg logit contributions
 than-0.006
ho-0.006
 far-0.005
s-0.005
 unt-0.005
Token as
Token ID355
Feature activation-1.28e-01

Pos logit contributions
 Everything+0.044
 earthquake+0.041
 emergency+0.041
py+0.041
loe+0.040
Neg logit contributions
 than-0.023
ho-0.022
s-0.020
 far-0.019
 unt-0.018
Token Lucy
Token ID22162
Feature activation+1.33e-02

Pos logit contributions
 than+0.170
ho+0.168
s+0.149
 far+0.143
bel+0.142
Neg logit contributions
 Everything-0.282
 earthquake-0.262
 emergency-0.262
py-0.259
 ambulance-0.256
Token stepped
Token ID10764
Feature activation-3.12e-03

Pos logit contributions
 Everything+0.026
 earthquake+0.024
 emergency+0.024
 ambulance+0.023
py+0.023
Neg logit contributions
ho-0.020
 than-0.020
outing-0.018
bel-0.018
ater-0.017
Token out
Token ID503
Feature activation-3.46e-03

Pos logit contributions
ho+0.005
 than+0.005
s+0.005
 far+0.004
 unt+0.004
Neg logit contributions
 Everything-0.006
 earthquake-0.006
 emergency-0.006
py-0.005
loe-0.005
Token into
Token ID656
Feature activation-1.67e-02

Pos logit contributions
 than+0.004
ho+0.004
s+0.004
bel+0.003
 unt+0.003
Neg logit contributions
 Everything-0.008
 earthquake-0.008
 emergency-0.008
py-0.007
 ambulance-0.007
Token the
Token ID262
Feature activation+3.47e-02

Pos logit contributions
 than+0.018
ho+0.017
 far+0.016
 unt+0.016
s+0.015
Neg logit contributions
 Everything-0.039
py-0.038
 earthquake-0.037
rell-0.037
 emergency-0.037
Token said
Token ID531
Feature activation+3.93e-03

Pos logit contributions
ho+0.029
 than+0.029
ater+0.026
outing+0.026
bel+0.025
Neg logit contributions
 Everything-0.048
 earthquake-0.044
 emergency-0.044
py-0.044
 bears-0.043
Token goodbye
Token ID24829
Feature activation-3.28e-02

Pos logit contributions
 Everything+0.008
 earthquake+0.007
 emergency+0.007
py+0.007
 ambulance+0.007
Neg logit contributions
ho-0.006
 than-0.006
s-0.005
bel-0.005
 unt-0.005
Token.
Token ID13
Feature activation-2.25e-02

Pos logit contributions
ho+0.038
outing+0.037
ucker+0.036
ater+0.035
fish+0.033
Neg logit contributions
 Everything-0.071
 Wolf-0.065
 earthquake-0.065
 thunder-0.065
 emergency-0.064
Token Luke
Token ID11336
Feature activation-5.96e-02

Pos logit contributions
ho+0.045
 than+0.044
bel+0.041
ater+0.040
outing+0.040
Neg logit contributions
 Everything-0.037
 earthquake-0.031
 Everybody-0.030
 emergency-0.030
loe-0.030
Token smiled
Token ID13541
Feature activation-3.44e-02

Pos logit contributions
 than+0.084
ho+0.083
outing+0.073
s+0.073
bel+0.072
Neg logit contributions
 Everything-0.125
 earthquake-0.114
 emergency-0.113
py-0.113
 ambulance-0.111
Token as
Token ID355
Feature activation-1.28e-01

Pos logit contributions
ho+0.048
ater+0.043
 than+0.042
outing+0.042
bel+0.040
Neg logit contributions
 Everything-0.073
 thunder-0.066
 bears-0.065
py-0.064
 ambulance-0.064
Token he
Token ID339
Feature activation-5.48e-03

Pos logit contributions
ho+0.160
 than+0.156
s+0.138
ater+0.135
outing+0.135
Neg logit contributions
 Everything-0.293
 earthquake-0.274
 emergency-0.270
py-0.266
 thunder-0.264
Token watched
Token ID7342
Feature activation+2.19e-02

Pos logit contributions
 than+0.008
ho+0.008
outing+0.007
s+0.007
bel+0.007
Neg logit contributions
 Everything-0.011
 earthquake-0.010
 emergency-0.010
py-0.010
 bears-0.010
Token the
Token ID262
Feature activation-2.92e-03

Pos logit contributions
 Everything+0.049
 earthquake+0.046
 emergency+0.045
py+0.045
rell+0.045
Neg logit contributions
 than-0.028
ho-0.027
s-0.024
 unt-0.024
bel-0.023
Token deaf
Token ID21815
Feature activation+3.32e-04

Pos logit contributions
 than+0.005
ho+0.005
s+0.004
 unt+0.004
bel+0.004
Neg logit contributions
 Everything-0.006
 earthquake-0.005
py-0.005
 emergency-0.005
loe-0.005
Token man
Token ID582
Feature activation-2.19e-02

Pos logit contributions
 Everything+0.001
 earthquake+0.001
 emergency+0.001
py+0.001
 ambulance+0.001
Neg logit contributions
ho-0.000
 than-0.000
s-0.000
bel-0.000
 unt-0.000
Token oak
Token ID26210
Feature activation+1.92e-02

Pos logit contributions
 Everything+0.008
loe+0.008
rell+0.007
py+0.007
 earthquake+0.007
Neg logit contributions
ho-0.006
 far-0.005
 than-0.005
fish-0.005
s-0.005
Token tree
Token ID5509
Feature activation-1.72e-02

Pos logit contributions
 Everything+0.039
 earthquake+0.037
 emergency+0.037
loe+0.036
py+0.036
Neg logit contributions
ho-0.029
 than-0.028
s-0.027
 far-0.025
 unt-0.024
Token felt
Token ID2936
Feature activation-1.56e-02

Pos logit contributions
ho+0.024
 than+0.024
s+0.021
ater+0.021
bel+0.021
Neg logit contributions
 Everything-0.036
 earthquake-0.033
 emergency-0.033
py-0.033
 ambulance-0.032
Token so
Token ID523
Feature activation-1.40e-02

Pos logit contributions
ho+0.027
 than+0.025
s+0.024
outing+0.023
ater+0.023
Neg logit contributions
 Everything-0.030
 earthquake-0.027
 emergency-0.026
 thunder-0.026
 lightning-0.025
Token happy
Token ID3772
Feature activation-1.60e-02

Pos logit contributions
ho+0.023
ater+0.021
 than+0.021
ber+0.020
outing+0.020
Neg logit contributions
 Everything-0.027
 earthquake-0.025
 thunder-0.024
 hurricane-0.024
 lightning-0.024
Token as
Token ID355
Feature activation-1.28e-01

Pos logit contributions
ho+0.019
outing+0.017
ater+0.016
 than+0.016
bel+0.016
Neg logit contributions
 Everything-0.037
 earthquake-0.035
 thunder-0.035
 emergency-0.034
py-0.034
Token it
Token ID340
Feature activation+5.72e-03

Pos logit contributions
ho+0.185
 than+0.177
outing+0.166
ater+0.166
s+0.164
Neg logit contributions
 Everything-0.263
 earthquake-0.246
 thunder-0.241
 emergency-0.240
 lightning-0.239
Token sailed
Token ID33835
Feature activation-3.93e-02

Pos logit contributions
 Everything+0.011
py+0.010
 earthquake+0.010
 thunder+0.010
 lightning+0.010
Neg logit contributions
ho-0.009
 than-0.009
outing-0.008
ater-0.008
s-0.008
Token across
Token ID1973
Feature activation-2.42e-02

Pos logit contributions
ho+0.059
 than+0.057
s+0.053
outing+0.052
bel+0.052
Neg logit contributions
 Everything-0.078
 earthquake-0.072
 emergency-0.071
py-0.071
 ambulance-0.069
Token the
Token ID262
Feature activation+5.35e-03

Pos logit contributions
ho+0.031
 than+0.031
s+0.027
 far+0.026
 unt+0.026
Neg logit contributions
 Everything-0.054
 earthquake-0.051
 emergency-0.051
py-0.050
 ambulance-0.050
Token lake
Token ID13546
Feature activation-2.93e-02

Pos logit contributions
 Everything+0.010
loe+0.010
py+0.009
 nightmares+0.009
 emergency+0.008
Neg logit contributions
ho-0.009
 than-0.009
 far-0.009
s-0.008
 unt-0.008
Token k
Token ID479
Feature activation+6.41e-05

Pos logit contributions
 Everything+0.019
py+0.019
loe+0.018
 emergency+0.018
 earthquake+0.018
Neg logit contributions
 than-0.012
ho-0.011
 far-0.010
s-0.010
 unt-0.010
Tokenite
Token ID578
Feature activation-3.05e-02

Pos logit contributions
 Everything+0.000
 earthquake+0.000
 emergency+0.000
py+0.000
 ambulance+0.000
Neg logit contributions
ho-0.000
 than-0.000
s-0.000
 unt-0.000
bel-0.000
Token looked
Token ID3114
Feature activation-1.62e-02

Pos logit contributions
 than+0.042
ho+0.042
ater+0.037
outing+0.037
s+0.037
Neg logit contributions
 Everything-0.064
 earthquake-0.059
 emergency-0.059
py-0.058
 ambulance-0.057
Token so
Token ID523
Feature activation-2.44e-02

Pos logit contributions
ho+0.028
 than+0.026
ater+0.024
bel+0.024
outing+0.024
Neg logit contributions
 Everything-0.030
 emergency-0.027
 earthquake-0.027
 thunder-0.027
 lightning-0.026
Token adorable
Token ID29012
Feature activation-9.43e-03

Pos logit contributions
ho+0.037
 than+0.035
ater+0.034
bel+0.032
outing+0.032
Neg logit contributions
 Everything-0.050
 thunder-0.045
 earthquake-0.045
rell-0.044
 lightning-0.044
Token as
Token ID355
Feature activation-1.28e-01

Pos logit contributions
ho+0.013
bel+0.013
outing+0.012
ater+0.012
ucker+0.012
Neg logit contributions
 Everything-0.019
 thunder-0.017
 earthquake-0.017
 bears-0.017
 emergency-0.017
Token it
Token ID340
Feature activation+1.10e-03

Pos logit contributions
ho+0.174
 than+0.167
s+0.151
ater+0.149
outing+0.149
Neg logit contributions
 Everything-0.274
 earthquake-0.257
 emergency-0.253
py-0.251
 thunder-0.249
Token shot
Token ID2823
Feature activation-4.71e-02

Pos logit contributions
 Everything+0.002
py+0.002
 thunder+0.002
 lightning+0.002
rell+0.002
Neg logit contributions
 than-0.002
ho-0.002
outing-0.002
ater-0.002
 cheaply-0.001
Token up
Token ID510
Feature activation-1.60e-02

Pos logit contributions
 than+0.068
ho+0.067
s+0.061
 far+0.056
 unt+0.056
Neg logit contributions
 Everything-0.100
 earthquake-0.096
 emergency-0.095
loe-0.092
 ambulance-0.092
Token and
Token ID290
Feature activation-1.17e-02

Pos logit contributions
ho+0.018
 than+0.018
s+0.016
bel+0.015
outing+0.015
Neg logit contributions
 Everything-0.038
 earthquake-0.035
 emergency-0.035
py-0.035
 ambulance-0.034
Token down
Token ID866
Feature activation+9.05e-03

Pos logit contributions
ho+0.017
 than+0.016
s+0.015
 unt+0.015
 far+0.015
Neg logit contributions
 Everything-0.024
 emergency-0.023
loe-0.023
 earthquake-0.023
py-0.023
Token.
Token ID13
Feature activation+8.43e-03

Pos logit contributions
 thunder+0.012
 earthquake+0.012
 lightning+0.012
 Everything+0.012
 emergency+0.012
Neg logit contributions
ho-0.010
outing-0.009
 than-0.008
s-0.008
bel-0.008
Token Ruby
Token ID10888
Feature activation-1.47e-02

Pos logit contributions
 emergency+0.014
 earthquake+0.014
 ambulance+0.013
 hurricane+0.013
py+0.013
Neg logit contributions
 than-0.014
s-0.014
ho-0.014
 unt-0.013
 far-0.013
Token was
Token ID373
Feature activation+1.45e-03

Pos logit contributions
 than+0.021
ho+0.021
outing+0.018
bel+0.018
s+0.018
Neg logit contributions
 Everything-0.030
 earthquake-0.028
 emergency-0.028
py-0.027
 ambulance-0.027
Token so
Token ID523
Feature activation-1.04e-02

Pos logit contributions
 Everything+0.003
 earthquake+0.003
 emergency+0.003
py+0.003
loe+0.003
Neg logit contributions
 than-0.002
ho-0.002
s-0.002
 unt-0.002
 far-0.002
Token sad
Token ID6507
Feature activation+3.26e-03

Pos logit contributions
 than+0.019
ho+0.019
ater+0.019
eed+0.017
outing+0.017
Neg logit contributions
 Everything-0.017
 earthquake-0.015
 emergency-0.015
 thunder-0.015
 lightning-0.015
Token as
Token ID355
Feature activation-1.27e-01

Pos logit contributions
 Everything+0.007
 earthquake+0.007
 emergency+0.007
 thunder+0.007
 ambulance+0.006
Neg logit contributions
ho-0.004
bel-0.004
ater-0.004
outing-0.004
 than-0.004
Token she
Token ID673
Feature activation+1.07e-02

Pos logit contributions
ho+0.169
 than+0.158
outing+0.157
ater+0.156
s+0.148
Neg logit contributions
 Everything-0.269
 earthquake-0.261
py-0.259
 thunder-0.257
 lightning-0.257
Token had
Token ID550
Feature activation-1.22e-03

Pos logit contributions
 Everything+0.023
 earthquake+0.021
py+0.021
 emergency+0.021
 bears+0.021
Neg logit contributions
 than-0.015
ho-0.014
bel-0.013
s-0.012
outing-0.012
Token lost
Token ID2626
Feature activation-9.78e-03

Pos logit contributions
ho+0.002
 than+0.002
outing+0.002
bel+0.002
s+0.002
Neg logit contributions
 Everything-0.002
 earthquake-0.002
 emergency-0.002
 thunder-0.002
 nightmares-0.002
Token her
Token ID607
Feature activation-1.21e-02

Pos logit contributions
 than+0.013
ho+0.012
 unt+0.011
s+0.010
 far+0.010
Neg logit contributions
 Everything-0.022
 earthquake-0.021
 emergency-0.021
py-0.020
loe-0.020
Token sweet
Token ID6029
Feature activation-8.88e-03

Pos logit contributions
ho+0.018
outing+0.017
bel+0.017
 than+0.016
s+0.016
Neg logit contributions
 emergency-0.023
 earthquake-0.023
 operation-0.022
 Everything-0.022
 nightmare-0.021
TokenThe
Token ID464
Feature activation-2.25e-03

Pos logit contributions
 Everything+0.016
 earthquake+0.015
 emergency+0.015
py+0.015
 ambulance+0.014
Neg logit contributions
ho-0.007
 than-0.007
s-0.006
 unt-0.006
bel-0.006
Token rock
Token ID3881
Feature activation-2.60e-02

Pos logit contributions
ho+0.004
 than+0.003
 far+0.003
 unt+0.003
fish+0.003
Neg logit contributions
 Everything-0.005
loe-0.004
rell-0.004
py-0.004
aries-0.004
Token felt
Token ID2936
Feature activation-1.53e-02

Pos logit contributions
ho+0.036
 than+0.035
s+0.031
bel+0.030
 unt+0.030
Neg logit contributions
 Everything-0.056
 earthquake-0.052
 emergency-0.052
py-0.051
 ambulance-0.050
Token so
Token ID523
Feature activation-1.52e-02

Pos logit contributions
ho+0.025
 than+0.024
s+0.022
outing+0.022
ater+0.022
Neg logit contributions
 Everything-0.030
 earthquake-0.027
 emergency-0.027
 thunder-0.026
py-0.026
Token happy
Token ID3772
Feature activation-1.12e-02

Pos logit contributions
ho+0.026
 than+0.023
ater+0.022
bel+0.022
outing+0.022
Neg logit contributions
 Everything-0.030
 earthquake-0.027
 thunder-0.027
 emergency-0.026
 lightning-0.026
Token as
Token ID355
Feature activation-1.27e-01

Pos logit contributions
ho+0.014
outing+0.012
ater+0.012
bel+0.012
 than+0.011
Neg logit contributions
 Everything-0.025
 earthquake-0.024
 thunder-0.024
 emergency-0.023
 lightning-0.023
Token it
Token ID340
Feature activation+1.69e-03

Pos logit contributions
ho+0.191
 than+0.180
outing+0.172
ater+0.171
s+0.168
Neg logit contributions
 Everything-0.255
 earthquake-0.239
 thunder-0.235
 lightning-0.233
 emergency-0.233
Token z
Token ID1976
Feature activation+1.60e-02

Pos logit contributions
 Everything+0.003
py+0.003
 earthquake+0.003
 emergency+0.003
 thunder+0.003
Neg logit contributions
ho-0.003
 than-0.002
outing-0.002
s-0.002
ater-0.002
Tokenipped
Token ID3949
Feature activation-1.95e-02

Pos logit contributions
 Everything+0.022
py+0.020
 earthquake+0.020
 emergency+0.019
loe+0.019
Neg logit contributions
 than-0.034
ho-0.033
s-0.031
 unt-0.030
bel-0.030
Token around
Token ID1088
Feature activation-1.17e-02

Pos logit contributions
 than+0.028
ho+0.026
s+0.024
 far+0.024
 unt+0.023
Neg logit contributions
 Everything-0.043
 earthquake-0.041
 emergency-0.040
loe-0.040
py-0.039
Token the
Token ID262
Feature activation+8.85e-03

Pos logit contributions
 than+0.014
ho+0.013
 far+0.011
s+0.011
 unt+0.011
Neg logit contributions
 Everything-0.028
 emergency-0.027
rell-0.026
 earthquake-0.026
 ambulance-0.026
Token.
Token ID13
Feature activation-8.22e-04

Pos logit contributions
ho+0.002
outing+0.002
ater+0.002
 than+0.002
bel+0.002
Neg logit contributions
 Everything-0.004
 earthquake-0.004
 emergency-0.004
 thunder-0.004
 bears-0.003
Token His
Token ID2399
Feature activation-2.05e-02

Pos logit contributions
 than+0.001
ho+0.001
s+0.001
 unt+0.001
bel+0.001
Neg logit contributions
 Everything-0.001
 emergency-0.001
 earthquake-0.001
py-0.001
 ambulance-0.001
Token eyes
Token ID2951
Feature activation-4.33e-02

Pos logit contributions
 than+0.029
ho+0.028
s+0.027
 far+0.027
 unt+0.026
Neg logit contributions
 Everything-0.041
rell-0.040
py-0.039
bridge-0.036
loe-0.036
Token spark
Token ID9009
Feature activation+5.43e-03

Pos logit contributions
ho+0.073
 than+0.073
s+0.066
ater+0.065
outing+0.065
Neg logit contributions
 Everything-0.078
 earthquake-0.071
 emergency-0.071
py-0.070
rell-0.069
Tokenled
Token ID992
Feature activation-5.61e-02

Pos logit contributions
 Everything+0.012
 emergency+0.011
 earthquake+0.011
py+0.011
rell+0.011
Neg logit contributions
ho-0.007
 than-0.007
outing-0.006
ater-0.006
 unt-0.006
Token as
Token ID355
Feature activation-1.27e-01

Pos logit contributions
ho+0.074
outing+0.065
ater+0.065
 than+0.065
ucker+0.065
Neg logit contributions
 Everything-0.118
 thunder-0.111
 emergency-0.110
 earthquake-0.109
 lightning-0.109
Token he
Token ID339
Feature activation-5.48e-03

Pos logit contributions
ho+0.151
 than+0.150
s+0.131
bel+0.125
 unt+0.124
Neg logit contributions
 Everything-0.296
 earthquake-0.278
 emergency-0.276
py-0.271
 ambulance-0.269
Token watched
Token ID7342
Feature activation+1.91e-02

Pos logit contributions
ho+0.008
 than+0.008
outing+0.007
s+0.007
ater+0.007
Neg logit contributions
 Everything-0.011
 earthquake-0.010
 emergency-0.010
py-0.010
 bears-0.010
Token the
Token ID262
Feature activation-4.20e-03

Pos logit contributions
 Everything+0.045
 earthquake+0.042
 emergency+0.042
py+0.042
rell+0.042
Neg logit contributions
 than-0.023
ho-0.022
s-0.019
 far-0.018
 unt-0.018
Token hero
Token ID4293
Feature activation-1.19e-02

Pos logit contributions
s+0.009
 than+0.009
 far+0.008
ho+0.008
 unt+0.008
Neg logit contributions
 Everything-0.005
py-0.005
loe-0.005
mo-0.005
rell-0.004
Token's
Token ID338
Feature activation-1.09e-02

Pos logit contributions
 than+0.016
ho+0.016
s+0.015
 unt+0.014
bel+0.013
Neg logit contributions
 Everything-0.026
 earthquake-0.025
 emergency-0.024
py-0.024
loe-0.023
Token,
Token ID11
Feature activation+5.89e-03

Pos logit contributions
 Everything+0.026
py+0.026
 earthquake+0.025
 emergency+0.024
 nightmares+0.024
Neg logit contributions
ho-0.010
 than-0.010
s-0.009
 unt-0.009
 far-0.009
Token the
Token ID262
Feature activation-2.20e-02

Pos logit contributions
 Everything+0.014
 earthquake+0.013
 emergency+0.013
py+0.013
 ambulance+0.013
Neg logit contributions
ho-0.007
 than-0.007
s-0.006
bel-0.006
 unt-0.006
Token box
Token ID3091
Feature activation-1.30e-03

Pos logit contributions
ho+0.032
 than+0.031
s+0.029
 unt+0.028
bel+0.028
Neg logit contributions
 Everything-0.045
py-0.042
 earthquake-0.042
 emergency-0.042
rell-0.041
Token looked
Token ID3114
Feature activation-2.53e-02

Pos logit contributions
 than+0.002
ho+0.002
ater+0.002
outing+0.002
eed+0.002
Neg logit contributions
 Everything-0.002
 emergency-0.002
 earthquake-0.002
py-0.002
 ambulance-0.002
Token good
Token ID922
Feature activation-1.87e-02

Pos logit contributions
ho+0.048
ater+0.040
 than+0.040
outing+0.040
ber+0.039
Neg logit contributions
 Everything-0.044
 emergency-0.040
 earthquake-0.039
 thunder-0.039
 lightning-0.038
Token as
Token ID355
Feature activation-1.27e-01

Pos logit contributions
ho+0.029
bel+0.028
outing+0.026
chard+0.025
ighed+0.025
Neg logit contributions
 Everything-0.034
 earthquake-0.033
 bears-0.032
 emergency-0.032
 nightmares-0.031
Token new
Token ID649
Feature activation+1.70e-02

Pos logit contributions
ho+0.200
 than+0.182
ater+0.173
outing+0.172
s+0.168
Neg logit contributions
 Everything-0.250
 earthquake-0.234
 emergency-0.231
 thunder-0.229
 lightning-0.228
Token.
Token ID13
Feature activation-5.10e-03

Pos logit contributions
 Everything+0.040
 earthquake+0.039
 emergency+0.038
 ambulance+0.037
py+0.037
Neg logit contributions
ho-0.019
 than-0.017
outing-0.016
s-0.015
bel-0.015
Token
Token ID198
Feature activation-5.80e-03

Pos logit contributions
ho+0.009
 than+0.009
s+0.008
 unt+0.008
bel+0.008
Neg logit contributions
 Everything-0.009
 earthquake-0.009
 emergency-0.008
py-0.008
 ambulance-0.008
Token
Token ID198
Feature activation-2.85e-03

Pos logit contributions
ho+0.009
 than+0.009
outing+0.008
ater+0.008
bel+0.008
Neg logit contributions
 Everything-0.012
 emergency-0.011
 earthquake-0.011
py-0.010
loe-0.010
TokenThe
Token ID464
Feature activation-1.29e-02

Pos logit contributions
ho+0.004
 than+0.004
outing+0.004
bec+0.003
ater+0.003
Neg logit contributions
 Everything-0.006
 Everybody-0.005
loe-0.005
 emergency-0.005
 earthquake-0.005
Token new
Token ID649
Feature activation+2.19e-02

Pos logit contributions
ho+0.156
 than+0.148
s+0.135
outing+0.134
ater+0.133
Neg logit contributions
 Everything-0.218
 earthquake-0.206
 emergency-0.204
py-0.199
 thunder-0.198
Token!
Token ID0
Feature activation-1.77e-02

Pos logit contributions
 Everything+0.056
py+0.052
loe+0.052
 earthquake+0.052
 emergency+0.051
Neg logit contributions
 than-0.024
ho-0.021
s-0.019
 unt-0.018
bel-0.017
Token<|endoftext|>
Token ID50256
Feature activation-1.19e-02

Pos logit contributions
 than+0.021
ho+0.021
s+0.020
 unt+0.018
bel+0.017
Neg logit contributions
 Everything-0.040
 earthquake-0.039
 emergency-0.039
py-0.038
 ambulance-0.037
TokenJimmy
Token ID40335
Feature activation-4.53e-03

Pos logit contributions
s+0.011
 than+0.009
ho+0.009
 far+0.009
 with+0.008
Neg logit contributions
 earthquake-0.031
 emergency-0.031
 hurricane-0.030
py-0.029
 ambulance-0.029
Token smiled
Token ID13541
Feature activation-1.47e-02

Pos logit contributions
ho+0.005
 than+0.005
outing+0.005
bel+0.005
s+0.005
Neg logit contributions
 Everything-0.010
 earthquake-0.010
 emergency-0.010
py-0.010
 ambulance-0.009
Token as
Token ID355
Feature activation-1.27e-01

Pos logit contributions
ho+0.020
 than+0.017
outing+0.016
ater+0.016
 unt+0.016
Neg logit contributions
 Everything-0.032
 ambulance-0.030
 thunder-0.030
 cartoons-0.029
 earthquake-0.029
Token he
Token ID339
Feature activation-1.86e-03

Pos logit contributions
ho+0.189
 than+0.176
outing+0.163
s+0.162
ater+0.161
Neg logit contributions
 Everything-0.265
 earthquake-0.244
 emergency-0.239
 thunder-0.238
py-0.235
Token raced
Token ID32897
Feature activation-1.89e-02

Pos logit contributions
ho+0.003
 than+0.002
s+0.002
bel+0.002
 unt+0.002
Neg logit contributions
 Everything-0.004
 earthquake-0.004
 emergency-0.004
py-0.004
 ambulance-0.004
Token down
Token ID866
Feature activation+6.20e-03

Pos logit contributions
 than+0.020
ho+0.019
s+0.018
 unt+0.016
bel+0.016
Neg logit contributions
 Everything-0.047
 earthquake-0.044
 emergency-0.044
py-0.044
loe-0.043
Token the
Token ID262
Feature activation+4.17e-03

Pos logit contributions
 Everything+0.012
 earthquake+0.011
 thunder+0.011
 emergency+0.011
 lightning+0.011
Neg logit contributions
ho-0.010
 than-0.010
outing-0.009
ater-0.009
ucker-0.009
Token hill
Token ID12788
Feature activation-1.42e-02

Pos logit contributions
 Everything+0.007
 earthquake+0.007
 emergency+0.007
 ambulance+0.006
rell+0.006
Neg logit contributions
ho-0.007
 than-0.007
s-0.007
outing-0.007
bel-0.007
Token and
Token ID290
Feature activation-2.75e-02

Pos logit contributions
ho+0.012
 than+0.010
outing+0.010
ater+0.010
bel+0.010
Neg logit contributions
 Everything-0.024
 earthquake-0.023
 emergency-0.023
 thunder-0.023
 bears-0.022
Token the
Token ID262
Feature activation-1.50e-02

Pos logit contributions
ho+0.038
 than+0.038
ater+0.036
outing+0.035
s+0.032
Neg logit contributions
 Everything-0.059
 earthquake-0.053
 thunder-0.052
 emergency-0.052
py-0.052
Token hero
Token ID4293
Feature activation-3.47e-02

Pos logit contributions
 far+0.022
 unt+0.021
 than+0.020
 carefully+0.019
 with+0.019
Neg logit contributions
 Everything-0.029
rell-0.028
loe-0.028
py-0.027
mo-0.027
Token was
Token ID373
Feature activation-1.53e-03

Pos logit contributions
 than+0.043
ho+0.043
s+0.040
 unt+0.038
bel+0.035
Neg logit contributions
 Everything-0.080
 earthquake-0.077
 emergency-0.076
 hurricane-0.072
 ambulance-0.072
Token cheered
Token ID34767
Feature activation-1.25e-02

Pos logit contributions
 than+0.002
ho+0.002
 unt+0.002
s+0.002
 far+0.002
Neg logit contributions
 Everything-0.003
 earthquake-0.003
 emergency-0.003
py-0.003
loe-0.003
Token as
Token ID355
Feature activation-1.26e-01

Pos logit contributions
ho+0.015
 than+0.014
ater+0.013
s+0.013
outing+0.013
Neg logit contributions
 Everything-0.029
 emergency-0.027
 earthquake-0.027
py-0.027
 thunder-0.026
Token a
Token ID257
Feature activation-1.77e-02

Pos logit contributions
ho+0.156
 than+0.155
s+0.136
bel+0.130
 unt+0.129
Neg logit contributions
 Everything-0.289
 earthquake-0.271
 emergency-0.270
py-0.266
 ambulance-0.263
Token hero
Token ID4293
Feature activation-3.23e-03

Pos logit contributions
ho+0.026
 than+0.025
 unt+0.023
s+0.023
 far+0.023
Neg logit contributions
 Everything-0.037
py-0.034
rell-0.034
loe-0.034
 earthquake-0.033
Token.
Token ID13
Feature activation-3.69e-03

Pos logit contributions
ho+0.004
 than+0.003
s+0.003
 unt+0.003
bel+0.003
Neg logit contributions
 Everything-0.008
 earthquake-0.008
 emergency-0.007
py-0.007
loe-0.007
Token
Token ID220
Feature activation+4.66e-02

Pos logit contributions
ho+0.006
 than+0.006
s+0.006
 unt+0.005
bel+0.005
Neg logit contributions
 Everything-0.007
 earthquake-0.006
 emergency-0.006
py-0.006
 ambulance-0.006
Token
Token ID198
Feature activation-7.73e-03

Pos logit contributions
 Everything+0.106
 earthquake+0.102
 emergency+0.101
py+0.099
 ambulance+0.098
Neg logit contributions
 than-0.055
ho-0.055
s-0.050
 unt-0.047
bel-0.046
Token The
Token ID383
Feature activation+3.18e-04

Pos logit contributions
ho+0.020
 than+0.020
s+0.020
 unt+0.017
 far+0.017
Neg logit contributions
 earthquake-0.027
 emergency-0.027
 Everything-0.026
py-0.026
 ambulance-0.025
Token fox
Token ID21831
Feature activation-4.48e-02

Pos logit contributions
 Everything+0.000
 emergency+0.000
 earthquake+0.000
py+0.000
 ambulance+0.000
Neg logit contributions
 than-0.001
ho-0.001
s-0.001
bel-0.001
outing-0.001
Token was
Token ID373
Feature activation-1.04e-03

Pos logit contributions
ho+0.058
 than+0.058
s+0.051
bel+0.049
 unt+0.049
Neg logit contributions
 Everything-0.099
 earthquake-0.092
 emergency-0.092
py-0.090
 ambulance-0.089
Token so
Token ID523
Feature activation-1.15e-02

Pos logit contributions
ho+0.002
 than+0.001
s+0.001
bel+0.001
 unt+0.001
Neg logit contributions
 Everything-0.002
 earthquake-0.002
 emergency-0.002
py-0.002
 ambulance-0.002
Token surprised
Token ID6655
Feature activation-1.30e-02

Pos logit contributions
ho+0.017
 than+0.016
ater+0.015
outing+0.014
s+0.014
Neg logit contributions
 Everything-0.024
 emergency-0.021
 earthquake-0.021
py-0.021
 thunder-0.021
Token as
Token ID355
Feature activation-1.26e-01

Pos logit contributions
ho+0.018
bel+0.017
ater+0.017
chard+0.016
outing+0.016
Neg logit contributions
 Everything-0.026
 thunder-0.024
 emergency-0.024
 earthquake-0.024
 Beth-0.023
Token he
Token ID339
Feature activation-6.41e-03

Pos logit contributions
ho+0.174
 than+0.168
ater+0.160
outing+0.155
s+0.151
Neg logit contributions
 Everything-0.267
 earthquake-0.247
 thunder-0.244
 emergency-0.244
py-0.242
Token had
Token ID550
Feature activation-5.16e-03

Pos logit contributions
 than+0.009
ho+0.009
ater+0.008
bel+0.008
outing+0.008
Neg logit contributions
 Everything-0.013
py-0.012
 bears-0.012
 thunder-0.012
 emergency-0.012
Token never
Token ID1239
Feature activation+5.36e-03

Pos logit contributions
ho+0.008
 than+0.008
outing+0.007
s+0.007
bel+0.007
Neg logit contributions
 Everything-0.010
 emergency-0.009
py-0.009
 earthquake-0.009
 thunder-0.009
Token seen
Token ID1775
Feature activation-1.04e-02

Pos logit contributions
 Everything+0.011
 earthquake+0.010
 emergency+0.010
py+0.010
 thunder+0.010
Neg logit contributions
ho-0.008
 than-0.008
s-0.007
outing-0.007
ater-0.007
Token a
Token ID257
Feature activation-1.06e-02

Pos logit contributions
 than+0.015
ho+0.013
 far+0.013
 unt+0.013
s+0.012
Neg logit contributions
 Everything-0.021
loe-0.020
 earthquake-0.020
 hurricane-0.020
 emergency-0.020
Token for
Token ID329
Feature activation+1.31e-02

Pos logit contributions
ho+0.023
 than+0.022
s+0.020
bel+0.020
outing+0.019
Neg logit contributions
 Everything-0.034
 emergency-0.032
 earthquake-0.032
 ambulance-0.031
py-0.031
Token it
Token ID340
Feature activation-1.38e-04

Pos logit contributions
 Everything+0.029
 earthquake+0.026
py+0.026
 emergency+0.026
rell+0.026
Neg logit contributions
 than-0.018
ho-0.017
 unt-0.015
s-0.015
 far-0.015
Token!"
Token ID2474
Feature activation-1.16e-02

Pos logit contributions
ho+0.000
 than+0.000
s+0.000
bel+0.000
outing+0.000
Neg logit contributions
 Everything-0.000
 earthquake-0.000
 emergency-0.000
py-0.000
 ambulance-0.000
Token Jack
Token ID3619
Feature activation-3.94e-02

Pos logit contributions
ho+0.024
 than+0.024
bel+0.022
s+0.022
outing+0.022
Neg logit contributions
 Everything-0.017
 earthquake-0.015
 emergency-0.015
py-0.014
rell-0.014
Token smiled
Token ID13541
Feature activation-2.88e-02

Pos logit contributions
 than+0.060
ho+0.058
bel+0.052
outing+0.051
ater+0.051
Neg logit contributions
 Everything-0.078
 bears-0.069
loe-0.069
 ambulance-0.068
py-0.068
Token as
Token ID355
Feature activation-1.26e-01

Pos logit contributions
ho+0.044
ater+0.036
outing+0.036
bel+0.036
 than+0.036
Neg logit contributions
 Everything-0.059
 ambulance-0.054
 thunder-0.054
 emergency-0.054
 bears-0.053
Token he
Token ID339
Feature activation-4.26e-03

Pos logit contributions
ho+0.155
 than+0.151
s+0.134
ater+0.130
outing+0.129
Neg logit contributions
 Everything-0.289
 earthquake-0.274
 emergency-0.270
py-0.264
 thunder-0.262
Token handed
Token ID10158
Feature activation-1.75e-02

Pos logit contributions
 than+0.006
ho+0.006
outing+0.005
bel+0.005
ater+0.005
Neg logit contributions
 Everything-0.009
 bears-0.008
 emergency-0.008
 ambulance-0.008
 Wolf-0.008
Token money
Token ID1637
Feature activation-4.81e-02

Pos logit contributions
 than+0.020
ho+0.020
s+0.019
 unt+0.018
 far+0.018
Neg logit contributions
 Everything-0.040
 earthquake-0.039
 emergency-0.038
py-0.038
rell-0.038
Token to
Token ID284
Feature activation-2.14e-02

Pos logit contributions
ho+0.058
outing+0.052
bel+0.052
ater+0.049
 than+0.049
Neg logit contributions
 Everything-0.104
 bears-0.099
 ambulance-0.097
 emergency-0.096
rell-0.093
Token the
Token ID262
Feature activation-1.14e-03

Pos logit contributions
 than+0.027
ho+0.026
s+0.025
 unt+0.025
 far+0.024
Neg logit contributions
 Everything-0.046
 earthquake-0.044
 emergency-0.043
py-0.043
rell-0.043
Token noise
Token ID7838
Feature activation+1.36e-02

Pos logit contributions
 Everything+0.048
loe+0.045
py+0.042
 emergency+0.041
 nightmares+0.041
Neg logit contributions
 than-0.041
ho-0.040
s-0.037
 far-0.036
 unt-0.036
Token.
Token ID13
Feature activation+4.63e-03

Pos logit contributions
 Everything+0.033
 earthquake+0.032
 emergency+0.031
 thunder+0.031
py+0.031
Neg logit contributions
ho-0.014
 than-0.014
s-0.012
outing-0.012
bel-0.011
Token It
Token ID632
Feature activation+1.21e-02

Pos logit contributions
py+0.009
 emergency+0.009
 earthquake+0.009
mo+0.009
 ambulance+0.008
Neg logit contributions
 than-0.006
ho-0.006
s-0.006
 far-0.006
On-0.005
Token seemed
Token ID3947
Feature activation-2.91e-02

Pos logit contributions
 Everything+0.025
 earthquake+0.023
py+0.023
 emergency+0.023
 thunder+0.023
Neg logit contributions
ho-0.017
 than-0.017
ater-0.015
outing-0.015
s-0.015
Token happy
Token ID3772
Feature activation-1.40e-02

Pos logit contributions
ho+0.042
ater+0.038
outing+0.038
atching+0.037
ucker+0.036
Neg logit contributions
 thunder-0.060
 Everything-0.060
 lightning-0.057
 nightmares-0.056
 earthquake-0.055
Token as
Token ID355
Feature activation-1.25e-01

Pos logit contributions
ho+0.018
outing+0.016
bel+0.015
ater+0.015
 unt+0.014
Neg logit contributions
 Everything-0.031
 earthquake-0.030
 thunder-0.030
 bears-0.029
py-0.029
Token it
Token ID340
Feature activation+2.94e-03

Pos logit contributions
ho+0.190
 than+0.173
outing+0.167
ater+0.165
s+0.161
Neg logit contributions
 Everything-0.253
 earthquake-0.239
 thunder-0.237
 lightning-0.234
py-0.233
Token flew
Token ID13112
Feature activation-2.53e-02

Pos logit contributions
 Everything+0.006
py+0.005
rell+0.005
 bears+0.005
 thunder+0.005
Neg logit contributions
ho-0.004
 than-0.004
outing-0.004
ater-0.004
 cheaply-0.004
Token around
Token ID1088
Feature activation-7.33e-03

Pos logit contributions
ho+0.038
 than+0.036
outing+0.034
ater+0.034
bel+0.034
Neg logit contributions
 Everything-0.049
 earthquake-0.046
py-0.046
rell-0.045
 thunder-0.045
Token.
Token ID13
Feature activation+1.79e-03

Pos logit contributions
 than+0.008
ho+0.008
 far+0.007
s+0.007
 unt+0.007
Neg logit contributions
 Everything-0.018
 emergency-0.017
 earthquake-0.017
rell-0.016
loe-0.016
Token
Token ID198
Feature activation-3.00e-04

Pos logit contributions
 emergency+0.004
 earthquake+0.004
py+0.004
 ambulance+0.003
 Everything+0.003
Neg logit contributions
 than-0.002
ho-0.002
s-0.002
 far-0.002
 unt-0.002
Token smiled
Token ID13541
Feature activation-2.65e-02

Pos logit contributions
 Everything+0.026
 earthquake+0.025
 emergency+0.025
py+0.024
rell+0.024
Neg logit contributions
ho-0.013
 than-0.013
s-0.011
 unt-0.011
bel-0.011
Token and
Token ID290
Feature activation-2.28e-02

Pos logit contributions
ho+0.037
 than+0.033
ater+0.033
bel+0.032
outing+0.032
Neg logit contributions
 Everything-0.056
 thunder-0.051
 ambulance-0.050
 emergency-0.050
py-0.050
Token said
Token ID531
Feature activation+3.68e-03

Pos logit contributions
 than+0.027
ho+0.026
ater+0.025
outing+0.022
bel+0.022
Neg logit contributions
 Everything-0.055
 Beth-0.050
 thunder-0.049
 lightning-0.048
 ambulance-0.048
Token,
Token ID11
Feature activation-2.33e-02

Pos logit contributions
 Everything+0.008
 earthquake+0.007
 emergency+0.007
py+0.007
 ambulance+0.007
Neg logit contributions
 than-0.005
ho-0.005
s-0.005
ater-0.005
bel-0.005
Token "
Token ID366
Feature activation-5.88e-03

Pos logit contributions
 than+0.032
ho+0.032
s+0.028
ater+0.028
bel+0.028
Neg logit contributions
 Everything-0.050
 earthquake-0.046
 emergency-0.045
py-0.045
 ambulance-0.044
TokenWell
Token ID5779
Feature activation+8.55e-02

Pos logit contributions
ho+0.009
 than+0.009
s+0.008
ater+0.008
bel+0.008
Neg logit contributions
 Everything-0.012
py-0.010
 earthquake-0.010
loe-0.010
 thunder-0.010
Token,
Token ID11
Feature activation+8.54e-03

Pos logit contributions
 Everything+0.207
 helicopters+0.198
 cartoons+0.196
 emergency+0.195
 ambulance+0.193
Neg logit contributions
 than-0.081
ho-0.072
 far-0.070
 unt-0.068
s-0.065
Token next
Token ID1306
Feature activation+9.60e-03

Pos logit contributions
 Everything+0.019
 earthquake+0.018
 emergency+0.018
rell+0.018
py+0.018
Neg logit contributions
 than-0.010
ho-0.010
s-0.009
 unt-0.009
 far-0.009
Token time
Token ID640
Feature activation-6.58e-02

Pos logit contributions
 Everything+0.016
py+0.015
rell+0.014
 ambulance+0.014
 earthquake+0.014
Neg logit contributions
 than-0.018
ho-0.017
 far-0.016
 unt-0.016
s-0.016
Token,
Token ID11
Feature activation+1.46e-02

Pos logit contributions
 than+0.077
ho+0.075
s+0.070
 far+0.065
 unt+0.063
Neg logit contributions
 Everything-0.151
py-0.144
 earthquake-0.144
 emergency-0.144
rell-0.141
Token bring
Token ID2222
Feature activation-1.31e-02

Pos logit contributions
 Everything+0.034
 earthquake+0.032
 emergency+0.032
py+0.032
rell+0.032
Neg logit contributions
 than-0.017
ho-0.016
s-0.014
 unt-0.014
 far-0.014
TokenO
Token ID46
Feature activation+4.70e-02

Pos logit contributions
 than+0.041
ho+0.041
 unt+0.037
s+0.036
ater+0.036
Neg logit contributions
 Everything-0.053
 earthquake-0.050
 emergency-0.049
py-0.049
rell-0.048
Tokenw
Token ID86
Feature activation+1.17e-02

Pos logit contributions
 earthquake+0.077
 Everything+0.072
loe+0.070
 emergency+0.069
rell+0.068
Neg logit contributions
ho-0.081
chard-0.079
bel-0.078
 than-0.077
 unt-0.074
Token,
Token ID11
Feature activation+1.79e-02

Pos logit contributions
 Everything+0.030
 emergency+0.027
 hurricane+0.027
 earthquake+0.027
 nightmares+0.027
Neg logit contributions
 than-0.012
ho-0.012
s-0.012
ater-0.010
 unt-0.010
Token ow
Token ID12334
Feature activation+8.01e-02

Pos logit contributions
 Everything+0.021
rell+0.018
 earthquake+0.018
py+0.018
 emergency+0.018
Neg logit contributions
ho-0.042
 than-0.040
s-0.039
 unt-0.038
 far-0.037
Token,
Token ID11
Feature activation+2.26e-02

Pos logit contributions
 Everything+0.175
 earthquake+0.163
 emergency+0.162
py+0.160
 ambulance+0.158
Neg logit contributions
ho-0.107
 than-0.106
s-0.095
 unt-0.090
bel-0.090
Token ow
Token ID12334
Feature activation+8.47e-02

Pos logit contributions
 Everything+0.027
 earthquake+0.023
 emergency+0.023
py+0.023
rell+0.023
Neg logit contributions
ho-0.053
 than-0.052
s-0.049
 unt-0.048
 far-0.047
Token!
Token ID0
Feature activation+1.12e-02

Pos logit contributions
 Everything+0.190
 earthquake+0.176
 emergency+0.175
py+0.173
 ambulance+0.170
Neg logit contributions
ho-0.109
 than-0.109
s-0.097
 unt-0.092
bel-0.091
Token It
Token ID632
Feature activation+2.28e-02

Pos logit contributions
py+0.020
 emergency+0.019
 earthquake+0.019
rell+0.019
 Everything+0.019
Neg logit contributions
ho-0.018
s-0.017
 than-0.017
outing-0.016
ater-0.016
Token hurts
Token ID20406
Feature activation+1.34e-02

Pos logit contributions
 Everything+0.048
 earthquake+0.044
 emergency+0.044
loe+0.043
 ambulance+0.042
Neg logit contributions
ho-0.034
 than-0.032
s-0.032
 unt-0.030
 far-0.028
Token!"
Token ID2474
Feature activation-2.91e-03

Pos logit contributions
 Everything+0.032
 emergency+0.029
 earthquake+0.029
rell+0.029
py+0.029
Neg logit contributions
 than-0.017
ho-0.016
s-0.014
 far-0.014
 unt-0.013
Token Ben
Token ID3932
Feature activation-5.99e-02

Pos logit contributions
ho+0.004
 than+0.004
s+0.003
bel+0.003
ater+0.003
Neg logit contributions
 Everything-0.007
 earthquake-0.006
 emergency-0.006
py-0.006
 ambulance-0.006
Token!"
Token ID2474
Feature activation+8.31e-03

Pos logit contributions
 Everything+0.146
 emergency+0.136
 earthquake+0.134
 ambulance+0.133
py+0.133
Neg logit contributions
ho-0.070
 than-0.070
s-0.065
 unt-0.059
 far-0.057
Token shouted
Token ID17293
Feature activation-1.79e-03

Pos logit contributions
 Everything+0.017
 earthquake+0.015
 Beth+0.015
 thunder+0.015
 lightning+0.015
Neg logit contributions
ho-0.013
ater-0.012
 than-0.012
outing-0.011
bel-0.011
Token Sam
Token ID3409
Feature activation-4.28e-02

Pos logit contributions
ho+0.003
 than+0.003
s+0.003
 unt+0.003
ater+0.003
Neg logit contributions
 Everything-0.003
 emergency-0.003
 earthquake-0.003
py-0.003
rell-0.003
Token,
Token ID11
Feature activation-4.99e-03

Pos logit contributions
ho+0.061
 than+0.059
ater+0.057
bel+0.056
outing+0.056
Neg logit contributions
 Everything-0.082
py-0.077
 Beth-0.076
 ambulance-0.074
rell-0.074
Token "
Token ID366
Feature activation+8.98e-03

Pos logit contributions
 than+0.008
ho+0.008
ater+0.007
s+0.007
bel+0.007
Neg logit contributions
 Everything-0.010
 thunder-0.009
 earthquake-0.009
 emergency-0.009
 ambulance-0.009
Tokenwe
Token ID732
Feature activation+8.93e-02

Pos logit contributions
 Everything+0.015
 Boom+0.012
 lightning+0.012
 earthquake+0.012
py+0.012
Neg logit contributions
 than-0.016
ho-0.016
ucker-0.015
outing-0.015
s-0.015
Token can
Token ID460
Feature activation+1.90e-02

Pos logit contributions
 Everything+0.192
 earthquake+0.183
 emergency+0.181
py+0.178
 ambulance+0.176
Neg logit contributions
ho-0.120
 than-0.120
 unt-0.108
s-0.106
ater-0.105
Token buy
Token ID2822
Feature activation-1.32e-03

Pos logit contributions
 Everything+0.030
 earthquake+0.027
 emergency+0.027
 operation+0.026
py+0.026
Neg logit contributions
 than-0.036
ho-0.035
outing-0.034
bel-0.033
ber-0.032
Token something
Token ID1223
Feature activation-3.45e-02

Pos logit contributions
ho+0.002
 than+0.002
bel+0.002
outing+0.002
ater+0.002
Neg logit contributions
 Everything-0.003
 emergency-0.002
 earthquake-0.002
 bears-0.002
 ambulance-0.002
Token with
Token ID351
Feature activation-7.82e-03

Pos logit contributions
ho+0.052
outing+0.045
bel+0.045
ater+0.045
ighed+0.043
Neg logit contributions
 Everything-0.067
 emergency-0.065
 earthquake-0.062
 ambulance-0.062
 operation-0.061
Token this
Token ID428
Feature activation+9.87e-03

Pos logit contributions
ho+0.013
bel+0.012
 than+0.012
outing+0.011
s+0.011
Neg logit contributions
 Everything-0.015
 emergency-0.014
 earthquake-0.014
 bears-0.013
py-0.013
Token he
Token ID339
Feature activation-1.84e-02

Pos logit contributions
 than+0.011
ho+0.010
s+0.009
 far+0.009
 unt+0.009
Neg logit contributions
 Everything-0.023
py-0.022
 emergency-0.021
 ambulance-0.021
 earthquake-0.021
Token never
Token ID1239
Feature activation+6.17e-03

Pos logit contributions
ho+0.025
 than+0.025
s+0.022
 unt+0.021
bel+0.021
Neg logit contributions
 Everything-0.040
 earthquake-0.037
 emergency-0.037
py-0.036
 ambulance-0.036
Token squeezed
Token ID29650
Feature activation+2.29e-03

Pos logit contributions
 Everything+0.013
 earthquake+0.012
 emergency+0.012
 thunder+0.012
py+0.012
Neg logit contributions
 than-0.008
ho-0.008
ater-0.007
bel-0.007
s-0.007
Token an
Token ID281
Feature activation+4.48e-03

Pos logit contributions
 Everything+0.005
loe+0.005
 earthquake+0.005
rell+0.005
 hurricane+0.005
Neg logit contributions
 than-0.003
 far-0.003
ho-0.003
s-0.003
 unt-0.003
Token ash
Token ID12530
Feature activation+5.38e-02

Pos logit contributions
 Everything+0.008
loe+0.007
 earthquake+0.007
 emergency+0.007
rell+0.007
Neg logit contributions
 than-0.008
ho-0.007
ater-0.007
 unt-0.007
s-0.006
Tokentr
Token ID2213
Feature activation+1.39e-01

Pos logit contributions
 Everything+0.100
loe+0.089
 earthquake+0.088
 emergency+0.088
py+0.087
Neg logit contributions
ho-0.096
 than-0.093
s-0.083
bel-0.082
 unt-0.082
Tokenay
Token ID323
Feature activation+7.61e-02

Pos logit contributions
 Everything+0.182
 earthquake+0.163
 emergency+0.159
py+0.157
 bears+0.155
Neg logit contributions
ho-0.312
 than-0.307
s-0.303
bel-0.286
outing-0.284
Token or
Token ID393
Feature activation+7.50e-03

Pos logit contributions
 Everything+0.172
 earthquake+0.160
 emergency+0.158
rell+0.155
py+0.155
Neg logit contributions
ho-0.099
 than-0.098
s-0.090
 unt-0.086
 far-0.083
Token anything
Token ID1997
Feature activation+6.74e-03

Pos logit contributions
 Everything+0.016
 earthquake+0.015
 emergency+0.015
py+0.014
 ambulance+0.014
Neg logit contributions
ho-0.010
 than-0.010
s-0.009
bel-0.009
 unt-0.009
Token else
Token ID2073
Feature activation-7.24e-03

Pos logit contributions
 Everything+0.016
 earthquake+0.015
 emergency+0.015
 thunder+0.015
 nightmares+0.015
Neg logit contributions
ho-0.007
 than-0.006
outing-0.006
ater-0.006
bel-0.006
Token again
Token ID757
Feature activation+4.97e-02

Pos logit contributions
ho+0.008
 than+0.008
s+0.007
outing+0.007
ater+0.007
Neg logit contributions
 Everything-0.017
 earthquake-0.016
 emergency-0.016
py-0.016
 thunder-0.016
TokenO
Token ID46
Feature activation+5.41e-02

Pos logit contributions
 than+0.046
ho+0.046
s+0.042
outing+0.041
bel+0.041
Neg logit contributions
 Everything-0.038
 earthquake-0.034
 emergency-0.034
 thunder-0.034
 ambulance-0.033
Tokenw
Token ID86
Feature activation+1.86e-02

Pos logit contributions
 earthquake+0.085
opl+0.079
 emergency+0.077
rell+0.076
igh+0.076
Neg logit contributions
chard-0.092
bel-0.091
ho-0.083
 than-0.083
 unw-0.081
Token,
Token ID11
Feature activation+2.32e-02

Pos logit contributions
 Everything+0.045
 emergency+0.041
 earthquake+0.041
 hurricane+0.040
 nightmares+0.040
Neg logit contributions
 than-0.021
ho-0.021
s-0.020
ater-0.018
 unt-0.018
Token ow
Token ID12334
Feature activation+8.66e-02

Pos logit contributions
 Everything+0.025
 earthquake+0.022
 emergency+0.021
py+0.020
 ambulance+0.020
Neg logit contributions
 than-0.057
ho-0.057
s-0.053
bel-0.053
outing-0.052
Token,
Token ID11
Feature activation+2.95e-02

Pos logit contributions
 earthquake+0.164
 Everything+0.163
 emergency+0.159
py+0.155
 ambulance+0.154
Neg logit contributions
ho-0.132
 than-0.128
bel-0.123
 unt-0.115
ater-0.114
Token ow
Token ID12334
Feature activation+9.07e-02

Pos logit contributions
 Everything+0.031
 earthquake+0.027
 emergency+0.027
py+0.025
 ambulance+0.025
Neg logit contributions
 than-0.073
ho-0.072
s-0.068
bel-0.067
outing-0.067
Token!"
Token ID2474
Feature activation-2.67e-03

Pos logit contributions
 earthquake+0.181
 Everything+0.178
 emergency+0.175
 thunder+0.172
 lightning+0.171
Neg logit contributions
ho-0.129
 than-0.124
bel-0.121
outing-0.110
 unt-0.110
Token Ben
Token ID3932
Feature activation-5.62e-02

Pos logit contributions
ho+0.004
 than+0.004
ater+0.003
bel+0.003
outing+0.003
Neg logit contributions
 Everything-0.006
 earthquake-0.005
 thunder-0.005
 emergency-0.005
 ambulance-0.005
Token cried
Token ID16896
Feature activation+1.03e-02

Pos logit contributions
ho+0.091
 than+0.088
bel+0.081
ater+0.080
 cheaply+0.080
Neg logit contributions
 Everything-0.099
 bears-0.096
 ambulance-0.094
 thunder-0.092
py-0.092
Token.
Token ID13
Feature activation-5.38e-03

Pos logit contributions
 Everything+0.029
 earthquake+0.028
loe+0.028
 emergency+0.027
rell+0.027
Neg logit contributions
 than-0.007
ho-0.007
s-0.006
 unt-0.005
 far-0.005
Token He
Token ID679
Feature activation-1.14e-02

Pos logit contributions
ho+0.007
 than+0.007
s+0.006
bel+0.006
 unt+0.006
Neg logit contributions
 Everything-0.012
 earthquake-0.011
 emergency-0.011
py-0.011
 ambulance-0.011
Token down
Token ID866
Feature activation+1.77e-02

Pos logit contributions
ho+0.006
 than+0.006
s+0.006
bel+0.005
 unt+0.005
Neg logit contributions
 Everything-0.007
 earthquake-0.007
 emergency-0.007
py-0.007
loe-0.007
Token.
Token ID13
Feature activation+9.98e-03

Pos logit contributions
 Everything+0.038
 earthquake+0.035
 emergency+0.035
 thunder+0.034
py+0.034
Neg logit contributions
ho-0.025
 than-0.023
outing-0.022
ater-0.021
 unt-0.021
Token They
Token ID1119
Feature activation+8.96e-03

Pos logit contributions
py+0.020
 ambulance+0.019
 emergency+0.019
 nightmares+0.019
 operation+0.019
Neg logit contributions
ho-0.014
 than-0.014
s-0.012
 far-0.012
 carefully-0.011
Token saw
Token ID2497
Feature activation+7.41e-03

Pos logit contributions
 Everything+0.017
py+0.015
 earthquake+0.015
 emergency+0.015
 bears+0.015
Neg logit contributions
 than-0.015
ho-0.014
outing-0.014
ater-0.013
bel-0.013
Token big
Token ID1263
Feature activation+8.66e-03

Pos logit contributions
loe+0.019
 Everything+0.019
 earthquake+0.018
rell+0.018
py+0.018
Neg logit contributions
ho-0.007
 than-0.007
 far-0.007
s-0.006
bel-0.005
Token trees
Token ID7150
Feature activation+2.31e-02

Pos logit contributions
loe+0.015
 Everything+0.014
py+0.013
 operation+0.012
 Everybody+0.012
Neg logit contributions
 than-0.018
ho-0.016
 far-0.016
s-0.015
 high-0.014
Token,
Token ID11
Feature activation+8.26e-03

Pos logit contributions
 Everything+0.044
 earthquake+0.041
rell+0.041
 thunder+0.040
 lightning+0.040
Neg logit contributions
ho-0.034
ater-0.034
bel-0.033
 than-0.033
outing-0.032
Token blue
Token ID4171
Feature activation+2.47e-02

Pos logit contributions
 Everything+0.013
 emergency+0.011
 earthquake+0.011
loe+0.011
py+0.011
Neg logit contributions
 than-0.017
ho-0.017
s-0.015
 far-0.015
bel-0.015
Token rivers
Token ID18180
Feature activation-1.05e-02

Pos logit contributions
 Everything+0.039
 emergency+0.036
loe+0.035
 earthquake+0.035
 operation+0.034
Neg logit contributions
 than-0.049
ho-0.049
s-0.046
 far-0.043
ater-0.042
Token,
Token ID11
Feature activation+7.14e-03

Pos logit contributions
ho+0.016
bel+0.015
ater+0.015
outing+0.015
 than+0.015
Neg logit contributions
 Everything-0.020
rell-0.019
 earthquake-0.018
 thunder-0.018
py-0.018
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
 Everything+0.017
 earthquake+0.016
 emergency+0.015
py+0.015
 thunder+0.015
Neg logit contributions
ho-0.008
 than-0.008
s-0.007
outing-0.007
bel-0.007
Token fell
Token ID3214
Feature activation+5.69e-03

Pos logit contributions
 than+0.005
ho+0.005
ater+0.005
bel+0.004
eed+0.004
Neg logit contributions
 Everything-0.005
 thunder-0.005
 lightning-0.004
 bears-0.004
 ambulance-0.004
Token to
Token ID284
Feature activation-1.71e-03

Pos logit contributions
 Everything+0.011
 earthquake+0.011
py+0.011
 emergency+0.011
rell+0.011
Neg logit contributions
ho-0.008
 than-0.008
ater-0.007
bel-0.007
outing-0.007
Token the
Token ID262
Feature activation+2.11e-02

Pos logit contributions
 than+0.003
ho+0.003
s+0.002
bel+0.002
outing+0.002
Neg logit contributions
 Everything-0.003
 earthquake-0.003
 emergency-0.003
 thunder-0.003
py-0.003
Token ground
Token ID2323
Feature activation+4.78e-03

Pos logit contributions
 Everything+0.033
 earthquake+0.030
 emergency+0.030
py+0.029
 ambulance+0.029
Neg logit contributions
ho-0.042
 than-0.041
s-0.038
bel-0.037
outing-0.037
Token.
Token ID13
Feature activation-2.81e-03

Pos logit contributions
 thunder+0.008
 earthquake+0.008
 lightning+0.008
 ambulance+0.008
 emergency+0.008
Neg logit contributions
eed-0.007
ater-0.007
bel-0.007
ucker-0.007
ho-0.007
Token She
Token ID1375
Feature activation+2.41e-02

Pos logit contributions
 than+0.004
ho+0.004
ater+0.004
s+0.004
bel+0.004
Neg logit contributions
 Everything-0.006
 earthquake-0.005
 emergency-0.005
py-0.005
 thunder-0.005
Token felt
Token ID2936
Feature activation+6.96e-03

Pos logit contributions
 Everything+0.047
 earthquake+0.043
 emergency+0.043
py+0.043
 bears+0.043
Neg logit contributions
ho-0.036
 than-0.036
bel-0.032
outing-0.032
ater-0.032
Token a
Token ID257
Feature activation-1.49e-02

Pos logit contributions
 Everything+0.014
 earthquake+0.013
 emergency+0.013
py+0.012
 ambulance+0.012
Neg logit contributions
ho-0.011
 than-0.011
s-0.010
bel-0.009
 unt-0.009
Token sharp
Token ID7786
Feature activation+2.73e-02

Pos logit contributions
ho+0.029
 than+0.029
bel+0.027
bec+0.027
outing+0.027
Neg logit contributions
 Everything-0.022
 earthquake-0.021
 emergency-0.021
 thunder-0.021
 lightning-0.020
Token pain
Token ID2356
Feature activation+1.66e-02

Pos logit contributions
 Everything+0.024
 earthquake+0.023
 emergency+0.022
py+0.021
 ambulance+0.021
Neg logit contributions
ho-0.070
 than-0.067
s-0.066
bel-0.066
outing-0.065
Token and
Token ID290
Feature activation+5.13e-03

Pos logit contributions
 earthquake+0.030
 Everything+0.030
py+0.030
 emergency+0.030
 thunder+0.030
Neg logit contributions
ho-0.024
 than-0.023
bel-0.023
s-0.022
outing-0.022
Token in
Token ID287
Feature activation+2.24e-03

Pos logit contributions
 Everything+0.031
 earthquake+0.029
 emergency+0.029
py+0.029
 ambulance+0.028
Neg logit contributions
ho-0.019
 than-0.019
s-0.017
 unt-0.016
bel-0.016
Token circles
Token ID13332
Feature activation+1.53e-04

Pos logit contributions
 Everything+0.005
py+0.005
 earthquake+0.005
rell+0.005
 emergency+0.005
Neg logit contributions
 than-0.003
 far-0.002
ho-0.002
 unt-0.002
s-0.002
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
 Everything+0.000
 earthquake+0.000
 emergency+0.000
 ambulance+0.000
py+0.000
Neg logit contributions
ho-0.000
 than-0.000
outing-0.000
s-0.000
ater-0.000
Token She
Token ID1375
Feature activation+3.15e-02

Pos logit contributions
 Everything+0.019
 emergency+0.019
 earthquake+0.018
py+0.018
 ambulance+0.018
Neg logit contributions
ho-0.016
 than-0.016
s-0.015
bel-0.014
 far-0.014
Token was
Token ID373
Feature activation+2.52e-02

Pos logit contributions
 Everything+0.064
 emergency+0.058
 earthquake+0.058
 ambulance+0.057
py+0.057
Neg logit contributions
ho-0.047
 than-0.045
outing-0.041
ater-0.040
s-0.039
Token having
Token ID1719
Feature activation+3.62e-02

Pos logit contributions
 Everything+0.052
 earthquake+0.049
 emergency+0.049
py+0.048
loe+0.047
Neg logit contributions
 than-0.037
ho-0.035
 unt-0.032
 far-0.032
s-0.032
Token so
Token ID523
Feature activation+5.92e-03

Pos logit contributions
py+0.089
 earthquake+0.089
 emergency+0.087
mo+0.086
 Everything+0.085
Neg logit contributions
 than-0.039
 far-0.037
 unt-0.035
s-0.030
On-0.027
Token much
Token ID881
Feature activation+1.03e-02

Pos logit contributions
 Everything+0.013
py+0.012
 earthquake+0.012
 emergency+0.012
 ambulance+0.012
Neg logit contributions
 than-0.008
ho-0.008
s-0.008
 far-0.007
 unt-0.007
Token fun
Token ID1257
Feature activation+8.82e-03

Pos logit contributions
 Everything+0.025
py+0.024
loe+0.023
 emergency+0.023
 ambulance+0.023
Neg logit contributions
 than-0.013
s-0.010
 far-0.010
 unt-0.010
ho-0.009
Token!
Token ID0
Feature activation-3.84e-03

Pos logit contributions
 Everything+0.023
 earthquake+0.021
py+0.021
 emergency+0.021
loe+0.021
Neg logit contributions
 than-0.009
ho-0.008
s-0.007
 unt-0.007
bel-0.007
Token Then
Token ID3244
Feature activation+1.27e-02

Pos logit contributions
ho+0.007
 than+0.007
s+0.006
 unt+0.006
bel+0.006
Neg logit contributions
 Everything-0.007
 emergency-0.006
 earthquake-0.006
py-0.006
 ambulance-0.006
Token Her
Token ID2332
Feature activation+8.93e-03

Pos logit contributions
 than+0.009
ho+0.009
ater+0.008
outing+0.008
bel+0.008
Neg logit contributions
 Everything-0.010
 earthquake-0.009
 emergency-0.009
 ambulance-0.009
py-0.009
Token friend
Token ID1545
Feature activation-1.04e-03

Pos logit contributions
 Everything+0.020
 earthquake+0.018
 emergency+0.018
loe+0.018
py+0.018
Neg logit contributions
ho-0.012
 than-0.012
s-0.011
 unt-0.011
bel-0.010
Token said
Token ID531
Feature activation+7.08e-03

Pos logit contributions
ho+0.001
 than+0.001
s+0.001
 unt+0.001
bel+0.001
Neg logit contributions
 Everything-0.003
 earthquake-0.003
 emergency-0.002
py-0.002
 ambulance-0.002
Token yes
Token ID3763
Feature activation-1.69e-02

Pos logit contributions
 Everything+0.015
 earthquake+0.013
 emergency+0.013
 thunder+0.013
py+0.013
Neg logit contributions
 than-0.010
ho-0.010
ater-0.009
bel-0.009
s-0.009
Token and
Token ID290
Feature activation-1.62e-02

Pos logit contributions
bel+0.027
ho+0.026
chard+0.025
outing+0.024
 than+0.024
Neg logit contributions
 Everything-0.031
 ambulance-0.030
 emergency-0.030
 earthquake-0.029
 operation-0.029
Token gave
Token ID2921
Feature activation+8.83e-03

Pos logit contributions
ho+0.018
 than+0.018
s+0.016
ater+0.015
bel+0.015
Neg logit contributions
 Everything-0.038
 earthquake-0.036
 emergency-0.036
py-0.035
 ambulance-0.035
Token her
Token ID607
Feature activation-1.20e-02

Pos logit contributions
 Everything+0.020
 earthquake+0.020
 emergency+0.019
py+0.019
 hurricane+0.019
Neg logit contributions
ho-0.010
 than-0.010
s-0.010
 unt-0.009
 far-0.009
Token the
Token ID262
Feature activation-8.52e-03

Pos logit contributions
ho+0.017
 than+0.017
s+0.015
outing+0.014
ater+0.014
Neg logit contributions
 Everything-0.026
 earthquake-0.024
 emergency-0.024
py-0.023
 ambulance-0.023
Token violin
Token ID38283
Feature activation+4.10e-03

Pos logit contributions
ho+0.013
 than+0.013
s+0.012
 unt+0.012
bel+0.011
Neg logit contributions
 Everything-0.017
 earthquake-0.016
 emergency-0.015
py-0.015
 ambulance-0.015
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
 Everything+0.010
 earthquake+0.009
 ambulance+0.009
rell+0.009
 emergency+0.009
Neg logit contributions
ho-0.005
 than-0.004
outing-0.004
ater-0.004
bel-0.004
Token
Token ID198
Feature activation-8.90e-03

Pos logit contributions
ho+0.020
 than+0.020
ater+0.018
outing+0.018
bel+0.018
Neg logit contributions
 Everything-0.023
rell-0.019
 earthquake-0.019
 emergency-0.019
loe-0.019
Token a
Token ID257
Feature activation-2.31e-02

Pos logit contributions
 Everything+0.043
 earthquake+0.040
 emergency+0.040
py+0.040
 ambulance+0.039
Neg logit contributions
 than-0.020
ho-0.019
 far-0.017
s-0.017
 unt-0.017
Token girl
Token ID2576
Feature activation+3.62e-02

Pos logit contributions
ho+0.032
 than+0.032
s+0.028
bel+0.027
outing+0.027
Neg logit contributions
 Everything-0.049
 earthquake-0.046
 emergency-0.046
py-0.045
 ambulance-0.045
Token named
Token ID3706
Feature activation+1.32e-02

Pos logit contributions
 Everything+0.044
 emergency+0.042
 earthquake+0.042
py+0.041
 ambulance+0.040
Neg logit contributions
ho-0.078
 than-0.076
outing-0.074
bel-0.073
eed-0.073
Token Lily
Token ID20037
Feature activation+8.39e-04

Pos logit contributions
 Everything+0.024
 emergency+0.022
 earthquake+0.022
py+0.021
 ambulance+0.021
Neg logit contributions
ho-0.023
 than-0.022
s-0.021
 unt-0.020
bel-0.020
Token.
Token ID13
Feature activation+7.79e-03

Pos logit contributions
py+0.001
 Everything+0.001
 lightning+0.001
 earthquake+0.001
 thunder+0.001
Neg logit contributions
ho-0.001
outing-0.001
bel-0.001
eed-0.001
ucker-0.001
Token She
Token ID1375
Feature activation+1.84e-02

Pos logit contributions
 Everything+0.015
 emergency+0.015
 earthquake+0.014
 ambulance+0.014
py+0.014
Neg logit contributions
ho-0.012
 than-0.012
s-0.011
 far-0.010
 unt-0.010
Token loved
Token ID6151
Feature activation-1.79e-02

Pos logit contributions
 Everything+0.037
 earthquake+0.034
 emergency+0.034
py+0.034
 thunder+0.033
Neg logit contributions
ho-0.028
 than-0.027
outing-0.025
s-0.024
ater-0.024
Token taking
Token ID2263
Feature activation+2.82e-02

Pos logit contributions
ho+0.012
 than+0.011
s+0.009
 unt+0.008
bel+0.008
Neg logit contributions
 Everything-0.052
 earthquake-0.049
 emergency-0.048
loe-0.048
py-0.048
Token pictures
Token ID5986
Feature activation+1.76e-02

Pos logit contributions
 Everything+0.052
loe+0.051
py+0.051
 earthquake+0.051
 hurricane+0.051
Neg logit contributions
 than-0.045
s-0.042
 unt-0.041
 far-0.041
ater-0.038
Token with
Token ID351
Feature activation-6.11e-03

Pos logit contributions
 Everything+0.044
 earthquake+0.041
 emergency+0.041
py+0.040
 ambulance+0.040
Neg logit contributions
ho-0.018
 than-0.017
s-0.015
outing-0.014
ater-0.014
Token her
Token ID607
Feature activation+7.38e-03

Pos logit contributions
ho+0.008
 than+0.008
s+0.007
 unt+0.007
bel+0.007
Neg logit contributions
 Everything-0.014
 earthquake-0.013
 emergency-0.012
py-0.012
 ambulance-0.012
Token "
Token ID366
Feature activation-9.47e-03

Pos logit contributions
 Everything+0.010
 earthquake+0.010
 emergency+0.010
py+0.010
rell+0.010
Neg logit contributions
ho-0.005
 than-0.004
s-0.004
bel-0.004
 far-0.004
TokenFree
Token ID11146
Feature activation+3.69e-02

Pos logit contributions
ho+0.014
 than+0.014
s+0.012
 unt+0.012
outing+0.012
Neg logit contributions
 Everything-0.020
 earthquake-0.018
 emergency-0.018
py-0.018
 ambulance-0.017
Token"
Token ID1
Feature activation-3.93e-02

Pos logit contributions
 Everything+0.068
 earthquake+0.065
 hurricane+0.064
py+0.063
 emergency+0.063
Neg logit contributions
 than-0.059
ho-0.056
ater-0.055
 unt-0.053
eed-0.051
Token on
Token ID319
Feature activation+1.58e-02

Pos logit contributions
ho+0.047
 than+0.045
s+0.040
outing+0.039
ater+0.038
Neg logit contributions
 Everything-0.091
 earthquake-0.085
 emergency-0.084
py-0.083
 ambulance-0.083
Token it
Token ID340
Feature activation+2.23e-03

Pos logit contributions
 Everything+0.034
 earthquake+0.032
 emergency+0.032
py+0.031
 ambulance+0.031
Neg logit contributions
 than-0.021
ho-0.021
s-0.019
 unt-0.018
bel-0.018
Token.
Token ID13
Feature activation+2.78e-03

Pos logit contributions
 Everything+0.006
 earthquake+0.005
 emergency+0.005
py+0.005
 ambulance+0.005
Neg logit contributions
ho-0.002
 than-0.002
s-0.002
bel-0.002
outing-0.002
Token
Token ID198
Feature activation+9.23e-04

Pos logit contributions
 Everything+0.006
 earthquake+0.005
 emergency+0.005
py+0.005
 ambulance+0.005
Neg logit contributions
ho-0.004
 than-0.004
s-0.004
bel-0.004
 unt-0.004
Token
Token ID198
Feature activation+4.03e-03

Pos logit contributions
 Everything+0.002
loe+0.002
 emergency+0.002
py+0.002
 earthquake+0.002
Neg logit contributions
ho-0.001
 than-0.001
ater-0.001
bel-0.001
outing-0.001
TokenShe
Token ID3347
Feature activation+1.81e-02

Pos logit contributions
 Everything+0.008
loe+0.007
 Everybody+0.007
 earthquake+0.007
 thunder+0.007
Neg logit contributions
ho-0.007
 than-0.006
ucker-0.006
fish-0.006
outing-0.006
Token hoped
Token ID10719
Feature activation+2.22e-02

Pos logit contributions
 Everything+0.040
 emergency+0.037
 earthquake+0.036
py+0.036
 ambulance+0.036
Neg logit contributions
ho-0.023
 than-0.023
s-0.020
outing-0.020
ater-0.020
Token that
Token ID326
Feature activation+2.87e-03

Pos logit contributions
 Everything+0.056
 earthquake+0.053
rell+0.053
py+0.053
 emergency+0.053
Neg logit contributions
 than-0.021
ho-0.021
s-0.019
 far-0.018
 unt-0.017
Token
Token ID198
Feature activation-9.45e-03

Pos logit contributions
 than+0.010
ho+0.010
s+0.009
 unt+0.009
bel+0.008
Neg logit contributions
 Everything-0.017
 earthquake-0.017
 emergency-0.017
py-0.016
 ambulance-0.016
Token
Token ID198
Feature activation-6.51e-03

Pos logit contributions
ho+0.017
 than+0.017
ater+0.015
outing+0.015
bel+0.015
Neg logit contributions
 Everything-0.017
loe-0.015
 Everybody-0.015
py-0.014
 thunder-0.014
TokenL
Token ID43
Feature activation-9.80e-03

Pos logit contributions
ho+0.012
 than+0.010
bel+0.010
bec+0.010
outing+0.009
Neg logit contributions
 Everything-0.011
 Everybody-0.010
 ambulance-0.010
 meantime-0.009
loe-0.009
Tokenily
Token ID813
Feature activation-1.52e-02

Pos logit contributions
ho+0.020
 than+0.019
bel+0.018
s+0.018
outing+0.018
Neg logit contributions
 Everything-0.015
 earthquake-0.013
py-0.013
 emergency-0.013
 ambulance-0.013
Token and
Token ID290
Feature activation-4.76e-03

Pos logit contributions
ho+0.016
 than+0.016
bel+0.014
outing+0.013
s+0.013
Neg logit contributions
 Everything-0.037
 earthquake-0.034
py-0.034
 emergency-0.034
 ambulance-0.034
Token Ben
Token ID3932
Feature activation-6.30e-02

Pos logit contributions
 than+0.007
ho+0.007
bel+0.007
outing+0.006
ater+0.006
Neg logit contributions
 Everything-0.010
 earthquake-0.009
 thunder-0.009
 bears-0.009
 ambulance-0.009
Token are
Token ID389
Feature activation+7.97e-03

Pos logit contributions
ho+0.104
 than+0.104
s+0.094
outing+0.094
bel+0.094
Neg logit contributions
 Everything-0.116
 earthquake-0.106
 emergency-0.104
py-0.104
 ambulance-0.102
Token happy
Token ID3772
Feature activation-4.58e-03

Pos logit contributions
 Everything+0.019
 earthquake+0.018
 emergency+0.018
py+0.017
 ambulance+0.017
Neg logit contributions
 than-0.009
ho-0.009
s-0.008
 unt-0.008
 far-0.008
Token.
Token ID13
Feature activation-1.75e-03

Pos logit contributions
ho+0.006
 than+0.005
ater+0.005
bel+0.005
outing+0.005
Neg logit contributions
 Everything-0.010
 thunder-0.010
 bears-0.009
 earthquake-0.009
py-0.009
Token They
Token ID1119
Feature activation-1.08e-02

Pos logit contributions
ho+0.003
 than+0.003
bel+0.003
outing+0.003
ater+0.003
Neg logit contributions
 Everything-0.003
 earthquake-0.003
 emergency-0.003
py-0.003
 thunder-0.003
Token bow
Token ID9563
Feature activation-3.33e-02

Pos logit contributions
 than+0.017
ho+0.017
outing+0.016
bel+0.016
s+0.016
Neg logit contributions
 Everything-0.020
py-0.018
 earthquake-0.018
rell-0.018
 bears-0.018
Token have
Token ID423
Feature activation-4.21e-03

Pos logit contributions
ho+0.043
 than+0.043
 unt+0.040
s+0.038
 far+0.038
Neg logit contributions
 Everything-0.065
 nightmares-0.059
 emergency-0.059
 earthquake-0.059
rell-0.058
Token help
Token ID1037
Feature activation+1.27e-02

Pos logit contributions
 than+0.006
ho+0.005
s+0.005
 unt+0.005
 far+0.005
Neg logit contributions
 Everything-0.009
 earthquake-0.009
 emergency-0.009
loe-0.009
 hurricane-0.009
Token from
Token ID422
Feature activation+3.92e-02

Pos logit contributions
 Everything+0.026
 earthquake+0.024
 emergency+0.024
loe+0.024
py+0.024
Neg logit contributions
 than-0.019
ho-0.018
s-0.017
 unt-0.016
 far-0.016
Token his
Token ID465
Feature activation-7.29e-03

Pos logit contributions
 Everything+0.090
loe+0.084
rell+0.083
 Everybody+0.079
 ambulance+0.078
Neg logit contributions
ho-0.051
 than-0.048
 far-0.047
 unt-0.044
s-0.040
Token friend
Token ID1545
Feature activation+1.82e-03

Pos logit contributions
ho+0.011
 than+0.011
s+0.010
 unt+0.010
 far+0.010
Neg logit contributions
 Everything-0.015
 earthquake-0.013
loe-0.013
rell-0.013
py-0.013
Token.
Token ID13
Feature activation-9.64e-03

Pos logit contributions
 Everything+0.005
 earthquake+0.004
loe+0.004
 emergency+0.004
 hurricane+0.004
Neg logit contributions
 than-0.002
ho-0.002
s-0.002
 unt-0.002
 far-0.002
Token Together
Token ID17083
Feature activation-1.41e-02

Pos logit contributions
ho+0.016
 than+0.015
ater+0.014
bel+0.014
outing+0.014
Neg logit contributions
 Everything-0.019
 earthquake-0.017
rell-0.016
py-0.016
 emergency-0.016
Token,
Token ID11
Feature activation-8.81e-03

Pos logit contributions
ho+0.018
 than+0.018
s+0.016
 unt+0.015
bel+0.015
Neg logit contributions
 Everything-0.032
 earthquake-0.030
 emergency-0.030
py-0.029
 ambulance-0.029
Token they
Token ID484
Feature activation-8.83e-03

Pos logit contributions
ho+0.007
 than+0.007
s+0.006
ater+0.006
bel+0.006
Neg logit contributions
 Everything-0.024
 earthquake-0.022
 emergency-0.022
py-0.022
 ambulance-0.022
Token went
Token ID1816
Feature activation+1.87e-02

Pos logit contributions
 than+0.012
ho+0.012
s+0.010
outing+0.010
bel+0.010
Neg logit contributions
 Everything-0.019
 earthquake-0.018
 emergency-0.017
py-0.017
 ambulance-0.017
Token down
Token ID866
Feature activation+1.44e-02

Pos logit contributions
 Everything+0.043
 earthquake+0.041
 emergency+0.040
py+0.039
loe+0.039
Neg logit contributions
 than-0.023
ho-0.023
s-0.021
 far-0.020
 unt-0.020
Token time
Token ID640
Feature activation-6.66e-02

Pos logit contributions
ho+0.043
 than+0.043
s+0.039
bel+0.037
outing+0.037
Neg logit contributions
 Everything-0.054
 earthquake-0.050
 emergency-0.050
py-0.049
 ambulance-0.048
Token.
Token ID13
Feature activation+8.46e-04

Pos logit contributions
ho+0.077
 than+0.068
outing+0.062
bel+0.061
s+0.061
Neg logit contributions
 Everything-0.156
 earthquake-0.150
 emergency-0.148
 ambulance-0.144
 thunder-0.143
Token He
Token ID679
Feature activation+1.27e-03

Pos logit contributions
py+0.002
 earthquake+0.002
 emergency+0.002
 ambulance+0.002
rell+0.002
Neg logit contributions
s-0.001
 than-0.001
ho-0.001
 far-0.001
 unt-0.001
Token had
Token ID550
Feature activation+1.38e-02

Pos logit contributions
 Everything+0.003
 earthquake+0.003
 emergency+0.003
rell+0.003
py+0.003
Neg logit contributions
ho-0.001
 than-0.001
s-0.001
 unt-0.001
 far-0.001
Token to
Token ID284
Feature activation+2.51e-03

Pos logit contributions
 Everything+0.032
 earthquake+0.030
py+0.030
 emergency+0.030
loe+0.029
Neg logit contributions
 than-0.016
ho-0.016
 unt-0.014
s-0.014
 far-0.014
Token delay
Token ID5711
Feature activation-2.34e-02

Pos logit contributions
 Everything+0.006
py+0.005
rell+0.005
 earthquake+0.005
 emergency+0.005
Neg logit contributions
 than-0.003
ho-0.003
 unt-0.003
 far-0.003
ater-0.003
Token some
Token ID617
Feature activation-8.03e-03

Pos logit contributions
 than+0.034
ho+0.033
s+0.029
 unt+0.029
bel+0.028
Neg logit contributions
 Everything-0.049
 earthquake-0.045
py-0.045
 emergency-0.045
 ambulance-0.044
Token of
Token ID286
Feature activation+4.29e-02

Pos logit contributions
 than+0.013
ho+0.011
 unt+0.011
 far+0.011
s+0.011
Neg logit contributions
 Everything-0.016
py-0.015
rell-0.014
loe-0.014
 earthquake-0.014
Token his
Token ID465
Feature activation+5.08e-03

Pos logit contributions
 Everything+0.094
py+0.087
rell+0.086
 earthquake+0.086
 emergency+0.085
Neg logit contributions
 than-0.057
ho-0.053
 unt-0.050
 far-0.049
s-0.047
Token work
Token ID670
Feature activation-1.36e-02

Pos logit contributions
 Everything+0.010
py+0.010
rell+0.009
loe+0.009
 earthquake+0.009
Neg logit contributions
 than-0.008
ho-0.007
s-0.007
bel-0.007
 far-0.007
Token until
Token ID1566
Feature activation+5.96e-03

Pos logit contributions
ho+0.016
 than+0.015
s+0.013
outing+0.013
ater+0.013
Neg logit contributions
 Everything-0.031
 earthquake-0.030
 emergency-0.030
 ambulance-0.029
py-0.029
Token he
Token ID339
Feature activation-6.38e-03

Pos logit contributions
ho+0.005
 than+0.005
s+0.004
 unt+0.004
 far+0.004
Neg logit contributions
 Everything-0.011
 earthquake-0.010
 emergency-0.010
py-0.010
rell-0.010
Token ran
Token ID4966
Feature activation+1.33e-02

Pos logit contributions
ho+0.008
 than+0.008
s+0.007
 unt+0.007
bel+0.007
Neg logit contributions
 Everything-0.015
 earthquake-0.014
 emergency-0.014
py-0.013
rell-0.013
Token away
Token ID1497
Feature activation+9.75e-03

Pos logit contributions
 Everything+0.025
 earthquake+0.023
 emergency+0.023
py+0.023
 ambulance+0.023
Neg logit contributions
ho-0.021
 than-0.021
s-0.019
outing-0.019
bel-0.019
Token.
Token ID13
Feature activation-5.39e-03

Pos logit contributions
 earthquake+0.019
 Everything+0.019
 emergency+0.018
 thunder+0.018
 nightmares+0.018
Neg logit contributions
ho-0.013
outing-0.013
ater-0.012
ucker-0.011
s-0.011
Token
Token ID198
Feature activation-2.65e-03

Pos logit contributions
 than+0.007
ho+0.007
s+0.007
 unt+0.006
 far+0.006
Neg logit contributions
 earthquake-0.011
 emergency-0.011
py-0.011
 ambulance-0.011
 Everything-0.011
Token
Token ID198
Feature activation-1.29e-04

Pos logit contributions
ho+0.003
 than+0.003
s+0.003
 unt+0.002
 far+0.002
Neg logit contributions
 Everything-0.006
 earthquake-0.006
 emergency-0.006
py-0.006
rell-0.006
TokenP
Token ID47
Feature activation-1.31e-02

Pos logit contributions
ho+0.000
 than+0.000
s+0.000
 unt+0.000
bel+0.000
Neg logit contributions
 Everything-0.000
 earthquake-0.000
 emergency-0.000
py-0.000
 ambulance-0.000
Tokenete
Token ID14471
Feature activation-6.66e-02

Pos logit contributions
 than+0.021
ho+0.020
bel+0.017
ater+0.017
s+0.017
Neg logit contributions
 Everything-0.026
loe-0.024
 emergency-0.023
py-0.023
 earthquake-0.023
Token quickly
Token ID2952
Feature activation-2.13e-03

Pos logit contributions
ho+0.088
 than+0.088
s+0.077
bel+0.075
outing+0.075
Neg logit contributions
 Everything-0.145
 earthquake-0.135
 emergency-0.134
py-0.132
 ambulance-0.131
Token ran
Token ID4966
Feature activation+1.32e-02

Pos logit contributions
 than+0.003
ho+0.003
outing+0.003
bel+0.003
s+0.003
Neg logit contributions
 Everything-0.005
 emergency-0.004
 earthquake-0.004
 ambulance-0.004
py-0.004
Token to
Token ID284
Feature activation-2.14e-03

Pos logit contributions
 Everything+0.026
 earthquake+0.024
 emergency+0.024
py+0.024
loe+0.023
Neg logit contributions
 than-0.021
ho-0.020
s-0.019
 unt-0.018
 far-0.018
Token while
Token ID981
Feature activation-5.91e-02

Pos logit contributions
ho+0.006
 than+0.006
s+0.006
 unt+0.005
 far+0.005
Neg logit contributions
 Everything-0.009
 earthquake-0.009
 emergency-0.009
py-0.009
loe-0.009
Token holding
Token ID4769
Feature activation+2.27e-02

Pos logit contributions
ho+0.083
 than+0.080
s+0.071
 far+0.069
bel+0.068
Neg logit contributions
 Everything-0.128
 emergency-0.118
py-0.118
 earthquake-0.118
rell-0.118
Token them
Token ID606
Feature activation-9.40e-04

Pos logit contributions
 Everything+0.053
 earthquake+0.052
py+0.051
 emergency+0.051
rell+0.050
Neg logit contributions
 than-0.027
ho-0.025
s-0.023
 far-0.023
 unt-0.023
Token as
Token ID355
Feature activation-9.71e-02

Pos logit contributions
ho+0.001
 than+0.001
bel+0.001
ater+0.001
outing+0.001
Neg logit contributions
 Everything-0.002
 earthquake-0.002
 emergency-0.002
 ambulance-0.002
py-0.002
Token tight
Token ID5381
Feature activation-2.61e-03

Pos logit contributions
 than+0.098
ho+0.095
ater+0.085
s+0.085
bel+0.080
Neg logit contributions
 Everything-0.232
 earthquake-0.223
 emergency-0.219
 lightning-0.218
 thunder-0.218
Token as
Token ID355
Feature activation-9.12e-02

Pos logit contributions
ho+0.003
 than+0.003
outing+0.002
ater+0.002
bel+0.002
Neg logit contributions
 Everything-0.006
 earthquake-0.006
 emergency-0.006
 bears-0.006
 ambulance-0.006
Token he
Token ID339
Feature activation+1.47e-02

Pos logit contributions
ho+0.112
 than+0.110
s+0.098
ater+0.096
bel+0.095
Neg logit contributions
 Everything-0.205
 earthquake-0.194
 emergency-0.191
py-0.188
 thunder-0.187
Token could
Token ID714
Feature activation+1.48e-02

Pos logit contributions
 Everything+0.033
 earthquake+0.030
 emergency+0.030
py+0.030
 ambulance+0.029
Neg logit contributions
 than-0.019
ho-0.019
s-0.016
outing-0.016
bel-0.016
Token,
Token ID11
Feature activation+2.89e-02

Pos logit contributions
 Everything+0.034
 earthquake+0.031
 emergency+0.031
py+0.031
 ambulance+0.031
Neg logit contributions
ho-0.017
 than-0.017
outing-0.016
ater-0.015
bel-0.015
Token so
Token ID523
Feature activation+1.05e-02

Pos logit contributions
 Everything+0.067
 earthquake+0.063
 emergency+0.063
py+0.062
loe+0.061
Neg logit contributions
 than-0.035
ho-0.034
s-0.030
 unt-0.029
bel-0.028
Token they
Token ID484
Feature activation-6.58e-05

Pos logit contributions
 Everything+0.025
 earthquake+0.023
 emergency+0.023
py+0.022
 ambulance+0.022
Neg logit contributions
ho-0.013
 than-0.012
s-0.011
bel-0.010
 unt-0.010
Token.
Token ID13
Feature activation+6.23e-03

Pos logit contributions
 Everything+0.000
 earthquake+0.000
 emergency+0.000
py+0.000
 ambulance+0.000
Neg logit contributions
ho-0.000
 than-0.000
s-0.000
 unt-0.000
bel-0.000
Token
Token ID198
Feature activation+4.26e-03

Pos logit contributions
 Everything+0.010
 earthquake+0.010
 emergency+0.010
py+0.009
 ambulance+0.009
Neg logit contributions
ho-0.011
 than-0.011
s-0.011
 unt-0.010
bel-0.010
TokenBut
Token ID1537
Feature activation+1.42e-02

Pos logit contributions
 Everything+0.009
loe+0.008
py+0.007
 Everybody+0.007
 Wolf+0.007
Neg logit contributions
ho-0.007
 than-0.007
outing-0.006
ater-0.006
chard-0.006
Token as
Token ID355
Feature activation-9.34e-02

Pos logit contributions
 Everything+0.032
 earthquake+0.030
 emergency+0.030
py+0.029
 ambulance+0.029
Neg logit contributions
ho-0.018
 than-0.017
ater-0.015
s-0.015
outing-0.015
Token soon
Token ID2582
Feature activation+2.43e-02

Pos logit contributions
ho+0.142
ater+0.138
 than+0.134
outing+0.133
ucker+0.123
Neg logit contributions
 Everything-0.178
 lightning-0.169
 thunder-0.168
 earthquake-0.166
 nightmares-0.162
Token as
Token ID355
Feature activation-8.52e-02

Pos logit contributions
 Everything+0.062
 earthquake+0.058
 emergency+0.057
 thunder+0.056
rell+0.056
Neg logit contributions
ho-0.026
 than-0.022
outing-0.021
ater-0.020
bel-0.020
Token he
Token ID339
Feature activation+1.52e-02

Pos logit contributions
ho+0.122
 than+0.116
ater+0.112
outing+0.109
bel+0.103
Neg logit contributions
 Everything-0.176
 earthquake-0.163
 thunder-0.162
 lightning-0.162
 emergency-0.160
Token reached
Token ID4251
Feature activation-2.36e-02

Pos logit contributions
 Everything+0.031
 earthquake+0.028
 emergency+0.028
py+0.027
 thunder+0.027
Neg logit contributions
ho-0.023
 than-0.023
outing-0.020
s-0.020
bel-0.020
Token the
Token ID262
Feature activation+1.67e-02

Pos logit contributions
 than+0.029
ho+0.028
 far+0.026
s+0.026
 unt+0.025
Neg logit contributions
 Everything-0.053
 earthquake-0.051
 emergency-0.050
py-0.049
 ambulance-0.049
Token entrance
Token ID10384
Feature activation-3.24e-02

Pos logit contributions
 Everything+0.028
loe+0.026
py+0.026
rell+0.025
 earthquake+0.025
Neg logit contributions
ho-0.031
 than-0.030
s-0.028
 far-0.028
 unt-0.027
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
ho+0.045
 than+0.045
s+0.040
ater+0.039
 unt+0.039
Neg logit contributions
 Everything-0.068
 earthquake-0.064
 emergency-0.063
py-0.062
 ambulance-0.061
Token She
Token ID1375
Feature activation+1.28e-02

Pos logit contributions
ho+0.002
 than+0.002
s+0.002
 unt+0.002
bel+0.002
Neg logit contributions
 Everything-0.002
 earthquake-0.002
 emergency-0.002
py-0.002
 ambulance-0.002
Token understands
Token ID14759
Feature activation-3.74e-03

Pos logit contributions
 Everything+0.026
 ambulance+0.024
 emergency+0.023
 bears+0.023
 thunder+0.023
Neg logit contributions
ho-0.020
 than-0.019
bel-0.017
outing-0.017
ater-0.017
Token.
Token ID13
Feature activation-2.00e-03

Pos logit contributions
ho+0.004
 than+0.003
s+0.003
 unt+0.003
bel+0.003
Neg logit contributions
 Everything-0.010
 earthquake-0.009
 emergency-0.009
py-0.009
 ambulance-0.009
Token She
Token ID1375
Feature activation+1.02e-02

Pos logit contributions
 than+0.003
ho+0.003
s+0.003
 unt+0.003
 far+0.003
Neg logit contributions
 Everything-0.004
 earthquake-0.003
 emergency-0.003
py-0.003
rell-0.003
Token knows
Token ID4206
Feature activation+8.52e-03

Pos logit contributions
 Everything+0.021
 ambulance+0.019
 emergency+0.019
 bears+0.019
py+0.019
Neg logit contributions
ho-0.015
 than-0.015
bel-0.013
outing-0.013
ater-0.013
Token Ben
Token ID3932
Feature activation-8.17e-02

Pos logit contributions
 Everything+0.020
 earthquake+0.019
 emergency+0.018
rell+0.018
py+0.018
Neg logit contributions
 than-0.010
ho-0.010
s-0.009
 unt-0.008
 far-0.008
Token is
Token ID318
Feature activation-8.25e-03

Pos logit contributions
ho+0.121
 than+0.116
bel+0.108
outing+0.104
ater+0.103
Neg logit contributions
 Everything-0.166
 emergency-0.151
 earthquake-0.151
py-0.150
 ambulance-0.149
Token curious
Token ID11040
Feature activation-2.11e-02

Pos logit contributions
ho+0.013
 than+0.012
s+0.011
bel+0.011
outing+0.010
Neg logit contributions
 Everything-0.017
 earthquake-0.016
 emergency-0.016
py-0.015
 ambulance-0.015
Token.
Token ID13
Feature activation-5.16e-03

Pos logit contributions
 than+0.017
ho+0.014
s+0.013
 far+0.010
outing+0.010
Neg logit contributions
 Everything-0.058
 earthquake-0.056
loe-0.055
 emergency-0.055
py-0.055
Token He
Token ID679
Feature activation-1.40e-02

Pos logit contributions
 than+0.009
ho+0.008
s+0.008
 unt+0.007
outing+0.007
Neg logit contributions
 Everything-0.009
 earthquake-0.009
 emergency-0.009
py-0.009
rell-0.009
Token likes
Token ID7832
Feature activation-4.79e-02

Pos logit contributions
ho+0.021
 than+0.020
bel+0.018
outing+0.018
s+0.017
Neg logit contributions
 Everything-0.029
 emergency-0.026
 ambulance-0.026
 earthquake-0.026
py-0.026
Token so
Token ID523
Feature activation-1.68e-02

Pos logit contributions
 than+0.024
 unt+0.021
 far+0.020
ho+0.020
outing+0.019
Neg logit contributions
 Everything-0.040
loe-0.037
rell-0.036
py-0.036
 earthquake-0.036
Token cold
Token ID4692
Feature activation+3.02e-04

Pos logit contributions
ho+0.024
 than+0.023
bel+0.021
ater+0.020
s+0.020
Neg logit contributions
 Everything-0.035
 earthquake-0.033
 emergency-0.033
 thunder-0.032
py-0.032
Token outside
Token ID2354
Feature activation-2.17e-02

Pos logit contributions
 Everything+0.001
py+0.001
 emergency+0.001
loe+0.001
 earthquake+0.001
Neg logit contributions
 than-0.000
ho-0.000
s-0.000
 far-0.000
 unt-0.000
Token that
Token ID326
Feature activation+7.70e-03

Pos logit contributions
 than+0.024
ho+0.022
s+0.020
 far+0.018
bel+0.017
Neg logit contributions
 Everything-0.055
 earthquake-0.050
py-0.050
 emergency-0.050
rell-0.050
Token my
Token ID616
Feature activation-2.14e-02

Pos logit contributions
 Everything+0.018
loe+0.018
py+0.018
rell+0.017
 ambulance+0.016
Neg logit contributions
 than-0.009
s-0.008
ho-0.008
 far-0.008
 unt-0.007
Token breath
Token ID8033
Feature activation-9.12e-02

Pos logit contributions
 than+0.035
ho+0.034
s+0.031
ater+0.030
bel+0.030
Neg logit contributions
 Everything-0.042
py-0.038
loe-0.037
rell-0.037
 earthquake-0.037
Token froze
Token ID37785
Feature activation-8.02e-03

Pos logit contributions
ho+0.146
 than+0.144
s+0.130
bel+0.126
 unt+0.126
Neg logit contributions
 Everything-0.174
 earthquake-0.161
 emergency-0.160
py-0.157
 ambulance-0.155
Token on
Token ID319
Feature activation+7.34e-03

Pos logit contributions
 than+0.009
ho+0.008
s+0.007
 far+0.007
 unt+0.006
Neg logit contributions
 Everything-0.021
loe-0.019
 emergency-0.019
 earthquake-0.019
rell-0.019
Token my
Token ID616
Feature activation-1.80e-02

Pos logit contributions
 Everything+0.016
py+0.014
loe+0.014
rell+0.014
 earthquake+0.014
Neg logit contributions
 than-0.011
ho-0.010
 far-0.009
s-0.009
ater-0.009
Token face
Token ID1986
Feature activation-1.46e-02

Pos logit contributions
 than+0.026
ho+0.026
s+0.023
ater+0.022
bel+0.022
Neg logit contributions
 Everything-0.038
 earthquake-0.034
 emergency-0.034
py-0.034
loe-0.034
Token!"
Token ID2474
Feature activation-1.11e-02

Pos logit contributions
 than+0.015
ho+0.015
s+0.013
 unt+0.012
ater+0.012
Neg logit contributions
 Everything-0.036
 earthquake-0.034
 emergency-0.034
py-0.034
 ambulance-0.033
Token and
Token ID290
Feature activation-7.78e-04

Pos logit contributions
ho+0.027
 than+0.026
s+0.023
outing+0.022
bel+0.022
Neg logit contributions
 Everything-0.056
 earthquake-0.053
 emergency-0.053
py-0.052
 ambulance-0.052
Token played
Token ID2826
Feature activation+1.99e-02

Pos logit contributions
ho+0.001
 than+0.001
s+0.001
 far+0.001
 unt+0.001
Neg logit contributions
 Everything-0.002
 emergency-0.002
 earthquake-0.002
py-0.002
 ambulance-0.002
Token with
Token ID351
Feature activation-1.48e-02

Pos logit contributions
 Everything+0.049
 earthquake+0.046
 emergency+0.046
py+0.045
rell+0.045
Neg logit contributions
 than-0.021
ho-0.021
s-0.018
 unt-0.018
 far-0.017
Token the
Token ID262
Feature activation+8.89e-03

Pos logit contributions
ho+0.020
 than+0.020
s+0.018
 unt+0.017
ater+0.017
Neg logit contributions
 Everything-0.032
 earthquake-0.030
 emergency-0.030
py-0.029
 ambulance-0.029
Token beetle
Token ID45489
Feature activation-3.86e-02

Pos logit contributions
 Everything+0.019
 earthquake+0.018
py+0.018
 emergency+0.018
 thunder+0.017
Neg logit contributions
ho-0.012
 than-0.011
bel-0.011
outing-0.010
s-0.010
Token as
Token ID355
Feature activation-9.57e-02

Pos logit contributions
 than+0.043
ho+0.043
s+0.038
 unt+0.035
bel+0.034
Neg logit contributions
 Everything-0.094
 earthquake-0.088
 emergency-0.087
 ambulance-0.085
loe-0.085
Token they
Token ID484
Feature activation+2.22e-02

Pos logit contributions
ho+0.118
 than+0.116
s+0.102
bel+0.097
 unt+0.097
Neg logit contributions
 Everything-0.219
 earthquake-0.205
 emergency-0.204
py-0.201
 ambulance-0.198
Token looked
Token ID3114
Feature activation+6.56e-03

Pos logit contributions
 Everything+0.047
 earthquake+0.044
 emergency+0.044
py+0.043
 ambulance+0.043
Neg logit contributions
ho-0.032
 than-0.031
s-0.028
 unt-0.027
bel-0.027
Token for
Token ID329
Feature activation+7.63e-03

Pos logit contributions
 Everything+0.015
 earthquake+0.014
 emergency+0.014
py+0.014
 ambulance+0.014
Neg logit contributions
ho-0.008
 than-0.008
s-0.007
bel-0.007
outing-0.007
Token more
Token ID517
Feature activation-2.77e-03

Pos logit contributions
 Everything+0.015
 emergency+0.013
 earthquake+0.013
py+0.013
 bears+0.013
Neg logit contributions
ho-0.012
 than-0.012
bel-0.011
outing-0.011
s-0.011
Token da
Token ID12379
Feature activation+1.78e-02

Pos logit contributions
ho+0.005
outing+0.004
 than+0.004
bel+0.004
 unt+0.004
Neg logit contributions
 Everything-0.005
py-0.005
 earthquake-0.005
 bears-0.005
 nightmares-0.005