TOP ACTIVATIONS
MAX = 11.355

One day , something magical happened . A little girl has
joy . But then something happened . The son quickly forgot
one day , something special happened . The cat
One day , something magical happened . A little frog escaped
Then , something magical happened . As soon as Suzanne
Then , something magical happened . Amy felt a warm
park entrance , something unexpected happened . A big sign stopped
my crying and asked what happened . Tim my told him
day , something very different happened ! A wave ! It
But then , something bad happened . Lily accidentally knocked over
insect 's back . Nothing happened , so he touched it
Tim my told her what happened and she said , "
but then a magical thing happened . A big , pink
room and saw what had happened . She told Lily 's
Jack asked his mom what happened . She laughed and said
over and saw what had happened . She was sad because
But then something amazing happened . Sir Robert was able
her meal , something strange happened . Lily started to shrink
very sad about what had happened and never wrote about sailing
one day , something special happened . The old

MIN ACTIVATIONS
MIN = -6.789

and sharp teeth . It bar ks loud .
and sharp teeth . It bar ks loud .
not like water . He bar ks and runs away .
and sharp teeth . It bar ks loud . Lily and
said I 'm the fastest bar ber in town !" Everyone
sharp teeth . The dog bar ks loud . Anna is
. The dog bar ks and runs away .
?" The dog bar ks . He says ,
. The bar ber smiled and said ,
the ball . The dog bar ks . "
. He broke the chocolate bar in half and gave one
?" The dog bar ks . He does not
they all knew the great bar ber was the fastest in
The dog is angry and bar ks at them . It
dog is angry . It bar ks and runs to them
dog watches them . He bar ks and cheers . He
. The puppy bar ks and jumps out of
Spot is scared . He bar ks at the truck and
thought the man was the bar ber . She
. It runs fast and bar ks loud . Tom and

INTERVAL 6.819 - 11.355
CONTAINS 0.048%

knew that no matter what happened , they would always be
One day , something strange happened and the pilot had to
. It looked even y umm ier . Lily wanted to
One day , something special happened as he polished the toy
hands , good things would happen to it .

INTERVAL 2.283 - 6.819
CONTAINS 7.223%

so peaceful and content in Little Mark 's palm . He
. They peek ed inside the house . It
them together . The sound of the drums filled the air
and hung it up to dry . The next day ,
fly away , but the hawk was too fast . So

INTERVAL -2.253 - 2.283
CONTAINS 85.774%

his tail and bar ks . " Max ,
dead . Ben felt sad . He wanted to help the
it was time to go home , Bobby was so happy
One day , Lily saw a boy who was sad
are happy to be alive . <|endoftext|> L ily and Tom

INTERVAL -6.789 - -2.253
CONTAINS 6.955%

that his mum had replied with kindness instead of sc olding
boy . His name was Jake . Every morning he loved
soon . She wanted to see more books . She wanted
Everyone loved the film and hoped it would last forever .
nodded and they began to play mar bles . Sarah was
TokenOne
Token ID3198
Feature activation-5.02e-01

Pos logit contributions
urg+0.021
 accidents+0.020
ruck+0.020
icles+0.019
umm+0.019
Neg logit contributions
ars-0.018
 likes-0.017
 studies-0.017
 seven-0.017
 clipboard-0.017
Token day
Token ID1110
Feature activation+9.11e-01

Pos logit contributions
ars+0.031
 clipboard+0.031
 studies+0.030
 hips+0.029
 likes+0.028
Neg logit contributions
 accidents-0.025
urg-0.024
ruck-0.024
icles-0.023
ull-0.022
Token,
Token ID11
Feature activation+1.21e-01

Pos logit contributions
urg+0.068
 accidents+0.067
ruck+0.065
icles+0.063
umm+0.061
Neg logit contributions
ars-0.034
 studies-0.033
 likes-0.033
 clipboard-0.031
 seven-0.031
Token something
Token ID1223
Feature activation+3.47e+00

Pos logit contributions
urg+0.007
 accidents+0.007
ruck+0.006
icles+0.006
umm+0.006
Neg logit contributions
ars-0.007
 likes-0.007
 studies-0.007
 clipboard-0.006
 seven-0.006
Token magical
Token ID10883
Feature activation+2.23e+00

Pos logit contributions
 accidents+0.212
urg+0.203
ruck+0.192
icles+0.182
ull+0.179
Neg logit contributions
ars-0.181
 clipboard-0.174
 studies-0.173
 likes-0.166
 seven-0.163
Token happened
Token ID3022
Feature activation+9.74e+00

Pos logit contributions
urg+0.105
 accidents+0.102
ruck+0.097
icles+0.093
umm+0.090
Neg logit contributions
ars-0.146
 likes-0.142
 studies-0.142
 clipboard-0.138
 seven-0.138
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
urg+0.618
 accidents+0.609
ruck+0.587
icles+0.561
umm+0.537
Neg logit contributions
ars-0.478
 studies-0.458
 likes-0.450
 clipboard-0.446
 seven-0.435
Token A
Token ID317
Feature activation+1.61e+00

Pos logit contributions
urg+0.014
 accidents+0.014
ruck+0.013
icles+0.013
umm+0.012
Neg logit contributions
ars-0.011
 likes-0.011
 studies-0.011
 clipboard-0.010
 seven-0.010
Token little
Token ID1310
Feature activation+4.20e+00

Pos logit contributions
 accidents+0.095
urg+0.089
ruck+0.087
umm+0.082
ull+0.082
Neg logit contributions
ars-0.087
 studies-0.081
 likes-0.081
ighed-0.077
 clipboard-0.077
Token girl
Token ID2576
Feature activation+2.72e-01

Pos logit contributions
 accidents+0.227
ruck+0.200
urg+0.194
umm+0.191
ter+0.187
Neg logit contributions
ars-0.268
 studies-0.245
 clipboard-0.239
ossible-0.234
 likes-0.232
Token has
Token ID468
Feature activation-2.04e+00

Pos logit contributions
urg+0.017
 accidents+0.017
ruck+0.016
icles+0.015
umm+0.015
Neg logit contributions
ars-0.014
 studies-0.013
 clipboard-0.012
 likes-0.012
 seven-0.012
Token joy
Token ID8716
Feature activation+1.94e+00

Pos logit contributions
ars+0.136
 likes+0.130
 studies+0.129
 clipboard+0.126
 seven+0.126
Neg logit contributions
 accidents-0.139
urg-0.135
ruck-0.131
icles-0.123
umm-0.120
Token.
Token ID13
Feature activation+2.51e-01

Pos logit contributions
urg+0.151
 accidents+0.146
icles+0.141
ruck+0.141
umm+0.140
Neg logit contributions
ars-0.064
 likes-0.064
 studies-0.063
 clipboard-0.061
 seven-0.059
Token But
Token ID887
Feature activation-2.27e+00

Pos logit contributions
urg+0.017
 accidents+0.017
ruck+0.016
icles+0.016
umm+0.015
Neg logit contributions
 likes-0.011
ars-0.011
 studies-0.011
 seven-0.010
 clipboard-0.010
Token then
Token ID788
Feature activation-1.96e+00

Pos logit contributions
ars+0.118
 likes+0.112
 studies+0.112
 clipboard+0.110
 seven+0.108
Neg logit contributions
urg-0.140
 accidents-0.138
ruck-0.132
icles-0.128
umm-0.125
Token something
Token ID1223
Feature activation+3.29e+00

Pos logit contributions
ars+0.067
 likes+0.063
 studies+0.063
 clipboard+0.060
 seven+0.060
Neg logit contributions
urg-0.155
 accidents-0.151
ruck-0.148
icles-0.145
umm-0.143
Token happened
Token ID3022
Feature activation+9.73e+00

Pos logit contributions
urg+0.212
 accidents+0.201
ruck+0.198
icles+0.194
umm+0.193
Neg logit contributions
ars-0.163
 likes-0.156
 studies-0.154
 seven-0.150
 clipboard-0.148
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
 accidents+0.604
urg+0.586
ruck+0.568
icles+0.530
ull+0.506
Neg logit contributions
ars-0.501
 studies-0.471
 clipboard-0.462
 likes-0.461
 seven-0.454
Token The
Token ID383
Feature activation-1.48e-01

Pos logit contributions
urg+0.020
 accidents+0.020
ruck+0.020
rest+0.019
umm+0.018
Neg logit contributions
 likes-0.008
 seven-0.008
 clipboard-0.008
 studies-0.008
 bar-0.008
Token son
Token ID3367
Feature activation+2.71e+00

Pos logit contributions
ars+0.008
 likes+0.007
 studies+0.007
 clipboard+0.007
 seven+0.007
Neg logit contributions
urg-0.009
 accidents-0.009
ruck-0.008
icles-0.008
umm-0.008
Token quickly
Token ID2952
Feature activation-7.63e-01

Pos logit contributions
urg+0.181
 accidents+0.177
ruck+0.171
icles+0.165
umm+0.162
Neg logit contributions
ars-0.124
 likes-0.119
 studies-0.119
 clipboard-0.115
 seven-0.114
Token forgot
Token ID16453
Feature activation+3.37e+00

Pos logit contributions
ars+0.036
 likes+0.035
 studies+0.034
 clipboard+0.033
 seven+0.033
Neg logit contributions
urg-0.050
 accidents-0.049
ruck-0.047
icles-0.046
umm-0.045
Token one
Token ID530
Feature activation-1.39e+00

Pos logit contributions
ars+0.154
 clipboard+0.135
 studies+0.131
ighed+0.128
 likes+0.127
Neg logit contributions
 accidents-0.121
icles-0.101
urg-0.100
ruck-0.099
 Bobby-0.095
Token day
Token ID1110
Feature activation-4.02e-01

Pos logit contributions
ars+0.084
 clipboard+0.079
 studies+0.079
 hips+0.075
 seven+0.074
Neg logit contributions
 accidents-0.075
ruck-0.071
urg-0.069
ull-0.065
umm-0.064
Token,
Token ID11
Feature activation-3.60e-01

Pos logit contributions
ars+0.011
 likes+0.011
 studies+0.011
 seven+0.010
 clipboard+0.010
Neg logit contributions
urg-0.034
 accidents-0.033
ruck-0.033
umm-0.032
icles-0.032
Token something
Token ID1223
Feature activation+3.27e+00

Pos logit contributions
ars+0.017
 likes+0.016
 studies+0.016
 clipboard+0.016
 seven+0.016
Neg logit contributions
urg-0.024
 accidents-0.023
ruck-0.022
icles-0.022
umm-0.021
Token special
Token ID2041
Feature activation+6.74e-01

Pos logit contributions
urg+0.208
 accidents+0.200
ruck+0.195
icles+0.190
umm+0.188
Neg logit contributions
ars-0.162
 likes-0.157
 studies-0.156
 seven-0.150
 clipboard-0.150
Token happened
Token ID3022
Feature activation+9.73e+00

Pos logit contributions
urg+0.020
 accidents+0.017
umm+0.015
ruck+0.015
icles+0.014
Neg logit contributions
 likes-0.058
ars-0.057
 studies-0.056
 bar-0.054
 think-0.054
Token.
Token ID13
Feature activation+3.24e-01

Pos logit contributions
 accidents+0.597
urg+0.586
ruck+0.570
icles+0.533
ull+0.502
Neg logit contributions
ars-0.499
 studies-0.473
 clipboard-0.466
 likes-0.458
 seven-0.456
Token
Token ID198
Feature activation+3.02e-01

Pos logit contributions
urg+0.024
 accidents+0.023
ruck+0.023
umm+0.022
icles+0.021
Neg logit contributions
ars-0.013
 studies-0.013
 likes-0.013
 clipboard-0.012
 seven-0.012
Token
Token ID198
Feature activation+3.12e-01

Pos logit contributions
urg+0.020
 accidents+0.020
ruck+0.019
icles+0.018
umm+0.018
Neg logit contributions
ars-0.014
 studies-0.013
 likes-0.013
 clipboard-0.013
 seven-0.013
TokenThe
Token ID464
Feature activation-3.09e-01

Pos logit contributions
ruck+0.017
 accidents+0.017
urg+0.017
icles+0.016
ull+0.015
Neg logit contributions
 studies-0.017
 likes-0.016
 clipboard-0.016
ars-0.016
 pocket-0.016
Token cat
Token ID3797
Feature activation+5.16e-01

Pos logit contributions
ars+0.017
 studies+0.017
 clipboard+0.016
 seven+0.016
 likes+0.016
Neg logit contributions
 accidents-0.017
urg-0.016
ruck-0.016
umm-0.015
olk-0.015
TokenOne
Token ID3198
Feature activation-1.98e+00

Pos logit contributions
ruck+0.017
 accidents+0.016
urg+0.016
icles+0.016
ull+0.015
Neg logit contributions
 studies-0.012
 clipboard-0.012
ars-0.012
 pocket-0.012
 likes-0.011
Token day
Token ID1110
Feature activation-4.14e-01

Pos logit contributions
ars+0.103
 studies+0.100
 clipboard+0.099
 likes+0.097
 seven+0.095
Neg logit contributions
urg-0.119
 accidents-0.117
ruck-0.113
icles-0.108
umm-0.106
Token,
Token ID11
Feature activation-3.88e-01

Pos logit contributions
 likes+0.008
ars+0.007
 seven+0.006
 bar+0.006
 sees+0.006
Neg logit contributions
urg-0.041
umm-0.040
ruck-0.038
 accidents-0.038
icles-0.037
Token something
Token ID1223
Feature activation+3.28e+00

Pos logit contributions
 likes+0.016
ars+0.016
 seven+0.016
 bar+0.015
 clipboard+0.015
Neg logit contributions
urg-0.028
ruck-0.026
umm-0.026
 accidents-0.026
icles-0.025
Token magical
Token ID10883
Feature activation+1.94e+00

Pos logit contributions
urg+0.223
 accidents+0.212
ruck+0.211
icles+0.207
umm+0.205
Neg logit contributions
ars-0.150
 likes-0.145
 studies-0.141
 seven-0.138
 clipboard-0.134
Token happened
Token ID3022
Feature activation+9.71e+00

Pos logit contributions
urg+0.053
 accidents+0.045
ruck+0.043
umm+0.042
icles+0.041
Neg logit contributions
 likes-0.169
ars-0.169
 studies-0.163
 sees-0.161
 seven-0.160
Token.
Token ID13
Feature activation+2.76e-01

Pos logit contributions
urg+0.643
 accidents+0.637
ruck+0.612
icles+0.588
umm+0.565
Neg logit contributions
ars-0.447
 studies-0.428
 likes-0.421
 clipboard-0.417
 seven-0.408
Token A
Token ID317
Feature activation+1.50e+00

Pos logit contributions
urg+0.017
 accidents+0.017
ruck+0.016
icles+0.016
umm+0.016
Neg logit contributions
ars-0.014
 likes-0.013
 studies-0.013
 clipboard-0.013
 seven-0.013
Token little
Token ID1310
Feature activation+4.21e+00

Pos logit contributions
 accidents+0.086
urg+0.083
ruck+0.079
umm+0.077
olk+0.075
Neg logit contributions
ars-0.081
 studies-0.078
 likes-0.077
 clipboard-0.075
ossible-0.073
Token frog
Token ID21264
Feature activation-5.07e-01

Pos logit contributions
 accidents+0.223
ruck+0.194
urg+0.193
umm+0.190
olk+0.186
Neg logit contributions
ars-0.270
 studies-0.253
 clipboard-0.249
 hips-0.239
ossible-0.238
Token escaped
Token ID13537
Feature activation+2.14e+00

Pos logit contributions
ars+0.026
 studies+0.026
 clipboard+0.025
 seven+0.025
 likes+0.024
Neg logit contributions
urg-0.030
 accidents-0.030
ruck-0.028
icles-0.026
ull-0.026
Token
Token ID198
Feature activation+3.26e-01

Pos logit contributions
urg+0.022
 accidents+0.021
ruck+0.021
icles+0.020
umm+0.019
Neg logit contributions
ars-0.015
 likes-0.015
 studies-0.015
 clipboard-0.014
 seven-0.014
TokenThen
Token ID6423
Feature activation+1.01e+00

Pos logit contributions
urg+0.022
 accidents+0.021
ruck+0.020
icles+0.020
umm+0.020
Neg logit contributions
ars-0.015
 likes-0.015
 studies-0.015
 clipboard-0.014
 seven-0.014
Token,
Token ID11
Feature activation-3.43e-01

Pos logit contributions
urg+0.085
ruck+0.082
 accidents+0.081
umm+0.079
icles+0.078
Neg logit contributions
ars-0.029
 likes-0.028
 seven-0.027
 studies-0.027
 clipboard-0.026
Token something
Token ID1223
Feature activation+3.28e+00

Pos logit contributions
 seven+0.014
 think+0.014
 likes+0.013
 pocket+0.013
 clipboard+0.013
Neg logit contributions
urg-0.025
ruck-0.023
umm-0.023
 accidents-0.022
icles-0.022
Token magical
Token ID10883
Feature activation+2.01e+00

Pos logit contributions
urg+0.207
 accidents+0.195
ruck+0.193
icles+0.188
umm+0.186
Neg logit contributions
ars-0.166
 likes-0.160
 studies-0.158
 seven-0.154
 clipboard-0.152
Token happened
Token ID3022
Feature activation+9.71e+00

Pos logit contributions
urg+0.059
 accidents+0.049
ruck+0.046
umm+0.045
icles+0.045
Neg logit contributions
ars-0.169
 likes-0.168
 studies-0.164
 sees-0.160
 bar-0.160
Token.
Token ID13
Feature activation+3.08e-01

Pos logit contributions
 accidents+0.612
urg+0.604
ruck+0.580
icles+0.552
umm+0.524
Neg logit contributions
ars-0.483
 studies-0.459
 likes-0.450
 clipboard-0.447
 seven-0.433
Token As
Token ID1081
Feature activation+2.96e+00

Pos logit contributions
urg+0.021
 accidents+0.020
ruck+0.020
umm+0.018
icles+0.018
Neg logit contributions
 studies-0.014
 seven-0.013
 likes-0.013
 clipboard-0.013
ars-0.013
Token soon
Token ID2582
Feature activation-4.65e-01

Pos logit contributions
urg+0.229
umm+0.217
icles+0.209
ruck+0.207
 accidents+0.201
Neg logit contributions
 seven-0.109
 think-0.103
 high-0.103
 pocket-0.100
 swing-0.100
Token as
Token ID355
Feature activation-9.58e-01

Pos logit contributions
 seven+0.015
ars+0.015
 bar+0.014
 studies+0.014
 pocket+0.014
Neg logit contributions
urg-0.036
eddy-0.034
umm-0.034
icles-0.034
ruck-0.033
Token Suzanne
Token ID45587
Feature activation+4.83e-01

Pos logit contributions
 seven+0.030
 think+0.030
 studies+0.028
ars+0.028
 high+0.028
Neg logit contributions
urg-0.078
ruck-0.072
umm-0.072
icles-0.071
 accidents-0.070
Token
Token ID198
Feature activation+3.42e-01

Pos logit contributions
urg+0.021
 accidents+0.021
ruck+0.020
icles+0.019
umm+0.019
Neg logit contributions
ars-0.017
 studies-0.016
 likes-0.016
 clipboard-0.016
 seven-0.016
TokenThen
Token ID6423
Feature activation+9.93e-01

Pos logit contributions
 accidents+0.016
ruck+0.014
urg+0.014
Mill+0.013
olk+0.013
Neg logit contributions
 clipboard-0.021
 studies-0.021
 pocket-0.021
ars-0.020
ebra-0.020
Token,
Token ID11
Feature activation-3.41e-01

Pos logit contributions
 accidents+0.065
urg+0.061
ruck+0.058
icles+0.058
olk+0.056
Neg logit contributions
ars-0.048
 studies-0.046
 clipboard-0.045
 likes-0.043
 seven-0.042
Token something
Token ID1223
Feature activation+3.27e+00

Pos logit contributions
ars+0.020
 studies+0.019
 likes+0.019
 clipboard+0.019
 seven+0.018
Neg logit contributions
 accidents-0.019
urg-0.018
ruck-0.017
icles-0.017
umm-0.016
Token magical
Token ID10883
Feature activation+2.00e+00

Pos logit contributions
urg+0.175
 accidents+0.170
ruck+0.163
icles+0.157
umm+0.153
Neg logit contributions
ars-0.193
 likes-0.187
 studies-0.187
 clipboard-0.182
 seven-0.181
Token happened
Token ID3022
Feature activation+9.70e+00

Pos logit contributions
urg+0.053
 accidents+0.045
ruck+0.043
icles+0.040
umm+0.040
Neg logit contributions
ars-0.172
 likes-0.171
 studies-0.167
 seven-0.164
 bar-0.163
Token.
Token ID13
Feature activation+3.08e-01

Pos logit contributions
urg+0.624
 accidents+0.623
ruck+0.594
icles+0.571
umm+0.544
Neg logit contributions
ars-0.469
 studies-0.448
 likes-0.440
 clipboard-0.434
 seven-0.425
Token Amy
Token ID14235
Feature activation+2.26e-01

Pos logit contributions
 accidents+0.016
urg+0.015
ruck+0.014
icles+0.014
umm+0.013
Neg logit contributions
ars-0.019
 clipboard-0.018
 studies-0.018
 likes-0.018
 pocket-0.017
Token felt
Token ID2936
Feature activation+4.13e-01

Pos logit contributions
urg+0.012
 accidents+0.012
ruck+0.011
 happened+0.011
icles+0.010
Neg logit contributions
ars-0.013
 clipboard-0.012
 mat-0.012
 seven-0.011
 studies-0.011
Token a
Token ID257
Feature activation-4.80e-02

Pos logit contributions
 accidents+0.025
urg+0.024
ruck+0.023
icles+0.022
umm+0.021
Neg logit contributions
ars-0.022
 studies-0.021
 likes-0.021
 seven-0.021
 clipboard-0.021
Token warm
Token ID5814
Feature activation+3.63e+00

Pos logit contributions
ars+0.002
 likes+0.002
 studies+0.002
 clipboard+0.002
 seven+0.002
Neg logit contributions
urg-0.003
 accidents-0.003
ruck-0.003
icles-0.003
umm-0.003
Token park
Token ID3952
Feature activation-2.05e+00

Pos logit contributions
ars+0.027
 studies+0.026
 clipboard+0.025
 likes+0.025
ossible+0.024
Neg logit contributions
 accidents-0.033
urg-0.032
ruck-0.031
icles-0.031
umm-0.031
Token entrance
Token ID10384
Feature activation+1.25e+00

Pos logit contributions
ars+0.085
 studies+0.079
 likes+0.079
 clipboard+0.076
 seven+0.074
Neg logit contributions
 accidents-0.148
urg-0.146
ruck-0.139
icles-0.137
umm-0.133
Token,
Token ID11
Feature activation-3.29e-01

Pos logit contributions
 accidents+0.090
urg+0.089
ruck+0.086
icles+0.084
umm+0.081
Neg logit contributions
ars-0.051
 studies-0.049
 likes-0.048
 clipboard-0.047
 seven-0.046
Token something
Token ID1223
Feature activation+3.36e+00

Pos logit contributions
ars+0.014
 studies+0.013
 likes+0.013
 seven+0.013
 clipboard+0.013
Neg logit contributions
urg-0.023
 accidents-0.022
ruck-0.022
icles-0.021
umm-0.021
Token unexpected
Token ID10059
Feature activation+4.48e+00

Pos logit contributions
urg+0.209
 accidents+0.202
ruck+0.196
icles+0.190
umm+0.187
Neg logit contributions
ars-0.170
 likes-0.164
 studies-0.163
 clipboard-0.158
 seven-0.157
Token happened
Token ID3022
Feature activation+9.70e+00

Pos logit contributions
urg+0.105
 accidents+0.089
ruck+0.081
umm+0.078
icles+0.076
Neg logit contributions
ars-0.404
 likes-0.399
 studies-0.394
 seven-0.386
 sees-0.385
Token.
Token ID13
Feature activation+3.03e-01

Pos logit contributions
urg+0.645
 accidents+0.636
ruck+0.613
icles+0.592
umm+0.573
Neg logit contributions
ars-0.444
 studies-0.426
 likes-0.423
 clipboard-0.411
 seven-0.404
Token A
Token ID317
Feature activation+1.61e+00

Pos logit contributions
urg+0.021
 accidents+0.020
ruck+0.020
icles+0.019
umm+0.019
Neg logit contributions
ars-0.013
 studies-0.013
 likes-0.013
 clipboard-0.012
 seven-0.012
Token big
Token ID1263
Feature activation-1.21e-01

Pos logit contributions
 accidents+0.103
urg+0.096
ruck+0.096
umm+0.090
ull+0.087
Neg logit contributions
ars-0.077
 studies-0.075
 likes-0.074
ossible-0.070
 clipboard-0.069
Token sign
Token ID1051
Feature activation-2.59e+00

Pos logit contributions
ars+0.007
 studies+0.006
 clipboard+0.006
ossible+0.006
ighed+0.006
Neg logit contributions
 accidents-0.008
ruck-0.007
urg-0.007
olk-0.006
umm-0.006
Token stopped
Token ID5025
Feature activation+3.46e+00

Pos logit contributions
ars+0.145
 studies+0.142
 likes+0.138
 clipboard+0.137
 think+0.131
Neg logit contributions
 accidents-0.147
ruck-0.138
urg-0.135
icles-0.128
 happened-0.125
Tokenmy
Token ID1820
Feature activation+1.81e-01

Pos logit contributions
urg+0.117
 accidents+0.115
ruck+0.111
icles+0.106
umm+0.102
Neg logit contributions
ars-0.099
 studies-0.095
 likes-0.095
 clipboard-0.092
 seven-0.092
Token crying
Token ID13774
Feature activation+1.03e+00

Pos logit contributions
urg+0.013
 accidents+0.013
ruck+0.013
icles+0.012
umm+0.012
Neg logit contributions
ars-0.007
 likes-0.007
 studies-0.007
 clipboard-0.007
 seven-0.007
Token and
Token ID290
Feature activation+6.58e-01

Pos logit contributions
urg+0.076
 accidents+0.075
ruck+0.073
icles+0.070
umm+0.069
Neg logit contributions
ars-0.041
 studies-0.039
 likes-0.038
 clipboard-0.037
 seven-0.037
Token asked
Token ID1965
Feature activation+1.30e+00

Pos logit contributions
urg+0.052
umm+0.051
icles+0.049
 accidents+0.049
olk+0.048
Neg logit contributions
 likes-0.023
 sees-0.021
 bar-0.021
 clipboard-0.021
 think-0.021
Token what
Token ID644
Feature activation+8.09e-01

Pos logit contributions
urg+0.089
umm+0.086
olk+0.082
ruck+0.082
 accidents+0.081
Neg logit contributions
 likes-0.059
 think-0.056
 sees-0.055
ars-0.054
 bar-0.054
Token happened
Token ID3022
Feature activation+9.70e+00

Pos logit contributions
urg+0.056
umm+0.051
icles+0.050
olk+0.049
ruck+0.048
Neg logit contributions
 likes-0.045
 bar-0.041
 sees-0.041
 studies-0.041
ars-0.038
Token.
Token ID13
Feature activation+3.31e-01

Pos logit contributions
urg+0.674
 accidents+0.665
ruck+0.641
icles+0.620
umm+0.606
Neg logit contributions
ars-0.422
 studies-0.402
 likes-0.399
 clipboard-0.387
 seven-0.385
Token Tim
Token ID5045
Feature activation+1.92e+00

Pos logit contributions
 accidents+0.011
urg+0.011
ruck+0.010
icles+0.010
umm+0.009
Neg logit contributions
ars-0.027
 studies-0.025
 likes-0.025
 clipboard-0.025
 seven-0.024
Tokenmy
Token ID1820
Feature activation+1.90e-01

Pos logit contributions
urg+0.104
 accidents+0.102
ruck+0.099
icles+0.093
olk+0.086
Neg logit contributions
ars-0.114
 studies-0.109
 seven-0.105
 clipboard-0.105
 mat-0.105
Token told
Token ID1297
Feature activation+1.51e+00

Pos logit contributions
urg+0.013
 accidents+0.013
ruck+0.013
icles+0.012
umm+0.012
Neg logit contributions
ars-0.008
 studies-0.008
 likes-0.008
 clipboard-0.007
 seven-0.007
Token him
Token ID683
Feature activation+6.85e-01

Pos logit contributions
urg+0.097
umm+0.093
 accidents+0.091
ruck+0.091
olk+0.087
Neg logit contributions
ars-0.071
 studies-0.069
 bar-0.069
 clipboard-0.069
 likes-0.068
Token day
Token ID1110
Feature activation-4.11e-01

Pos logit contributions
ars+0.076
 studies+0.072
 clipboard+0.071
 likes+0.069
 seven+0.069
Neg logit contributions
 accidents-0.078
urg-0.075
ruck-0.074
icles-0.069
umm-0.069
Token,
Token ID11
Feature activation-3.48e-01

Pos logit contributions
 likes+0.010
ars+0.009
 studies+0.009
 seven+0.008
 bar+0.008
Neg logit contributions
urg-0.037
umm-0.036
ruck-0.036
 accidents-0.035
icles-0.035
Token something
Token ID1223
Feature activation+3.29e+00

Pos logit contributions
ars+0.018
 likes+0.018
 studies+0.018
 clipboard+0.017
 seven+0.017
Neg logit contributions
urg-0.021
ruck-0.020
 accidents-0.020
umm-0.019
icles-0.019
Token very
Token ID845
Feature activation-3.94e+00

Pos logit contributions
urg+0.209
 accidents+0.195
ruck+0.195
icles+0.191
umm+0.191
Neg logit contributions
ars-0.166
 likes-0.162
 studies-0.158
 seven-0.153
 clipboard-0.151
Token different
Token ID1180
Feature activation+1.81e-01

Pos logit contributions
ars+0.260
 clipboard+0.248
 studies+0.238
ossible+0.235
ebra+0.234
Neg logit contributions
 accidents-0.203
urg-0.172
ruck-0.166
asting-0.158
 happened-0.156
Token happened
Token ID3022
Feature activation+9.69e+00

Pos logit contributions
urg+0.009
umm+0.008
 accidents+0.007
ruck+0.007
olk+0.007
Neg logit contributions
 likes-0.012
 studies-0.011
ars-0.011
 think-0.011
 bar-0.011
Token!
Token ID0
Feature activation+9.38e-01

Pos logit contributions
urg+0.634
 accidents+0.629
ruck+0.606
icles+0.578
umm+0.554
Neg logit contributions
ars-0.455
 studies-0.431
 likes-0.427
 clipboard-0.421
 seven-0.415
Token A
Token ID317
Feature activation+1.56e+00

Pos logit contributions
urg+0.073
 accidents+0.070
ruck+0.068
umm+0.065
icles+0.063
Neg logit contributions
ars-0.034
 studies-0.033
 likes-0.033
 clipboard-0.032
 seven-0.032
Token wave
Token ID6769
Feature activation-1.98e+00

Pos logit contributions
 accidents+0.092
urg+0.089
ruck+0.086
umm+0.081
olk+0.080
Neg logit contributions
ars-0.084
 studies-0.080
 likes-0.079
 seven-0.077
 clipboard-0.075
Token!
Token ID0
Feature activation+9.31e-01

Pos logit contributions
ars+0.120
 likes+0.118
 studies+0.116
 clipboard+0.113
 seven+0.113
Neg logit contributions
urg-0.102
 accidents-0.099
ruck-0.094
icles-0.091
umm-0.090
Token It
Token ID632
Feature activation+1.13e+00

Pos logit contributions
urg+0.073
 accidents+0.071
ruck+0.069
umm+0.065
icles+0.064
Neg logit contributions
ars-0.033
 likes-0.032
 studies-0.031
 seven-0.031
 clipboard-0.031
TokenBut
Token ID1537
Feature activation-3.58e+00

Pos logit contributions
 accidents+0.019
urg+0.018
ruck+0.018
icles+0.016
asting+0.016
Neg logit contributions
 studies-0.025
ars-0.025
 clipboard-0.025
 pocket-0.024
 likes-0.024
Token then
Token ID788
Feature activation-1.92e+00

Pos logit contributions
ars+0.129
 likes+0.127
 studies+0.125
 seven+0.121
 clipboard+0.119
Neg logit contributions
urg-0.273
 accidents-0.263
ruck-0.259
icles-0.251
umm-0.250
Token,
Token ID11
Feature activation-3.10e-01

Pos logit contributions
 likes+0.052
 studies+0.048
ars+0.048
 seven+0.047
 sees+0.047
Neg logit contributions
urg-0.167
ruck-0.162
umm-0.160
icles-0.157
 accidents-0.153
Token something
Token ID1223
Feature activation+3.28e+00

Pos logit contributions
 likes+0.014
 seven+0.013
 bar+0.013
 pocket+0.013
 think+0.013
Neg logit contributions
urg-0.022
umm-0.021
ruck-0.021
icles-0.019
ull-0.019
Token bad
Token ID2089
Feature activation+3.14e+00

Pos logit contributions
urg+0.219
ruck+0.204
icles+0.203
umm+0.201
 accidents+0.198
Neg logit contributions
ars-0.157
 likes-0.153
 studies-0.146
 seven-0.143
 bar-0.142
Token happened
Token ID3022
Feature activation+9.69e+00

Pos logit contributions
urg+0.102
umm+0.088
icles+0.081
 accidents+0.080
ruck+0.080
Neg logit contributions
 likes-0.262
ars-0.254
 sees-0.248
 studies-0.248
 bar-0.245
Token.
Token ID13
Feature activation+3.72e-01

Pos logit contributions
 accidents+0.640
urg+0.639
ruck+0.609
icles+0.584
umm+0.563
Neg logit contributions
ars-0.453
 studies-0.430
 likes-0.424
 clipboard-0.420
 seven-0.414
Token Lily
Token ID20037
Feature activation+1.01e+00

Pos logit contributions
urg+0.024
 accidents+0.023
ruck+0.023
icles+0.022
umm+0.022
Neg logit contributions
ars-0.018
 likes-0.017
 studies-0.017
 seven-0.017
 clipboard-0.017
Token accidentally
Token ID14716
Feature activation+4.61e+00

Pos logit contributions
urg+0.062
 accidents+0.061
ruck+0.058
icles+0.054
olk+0.053
Neg logit contributions
ars-0.052
 studies-0.049
 clipboard-0.049
 seven-0.048
 likes-0.048
Token knocked
Token ID13642
Feature activation+2.87e+00

Pos logit contributions
umm+0.351
urg+0.339
 accidents+0.324
icles+0.316
ruck+0.313
Neg logit contributions
 likes-0.173
 bar-0.173
 sees-0.171
 studies-0.168
 clipboard-0.168
Token over
Token ID625
Feature activation-1.10e+00

Pos logit contributions
urg+0.180
 accidents+0.171
ruck+0.164
umm+0.161
icles+0.159
Neg logit contributions
ars-0.145
 likes-0.141
 studies-0.140
 seven-0.138
 clipboard-0.136
Token insect
Token ID12833
Feature activation+9.86e-02

Pos logit contributions
ars+0.034
 studies+0.034
 clipboard+0.033
 likes+0.031
 pedal+0.031
Neg logit contributions
 accidents-0.030
urg-0.029
ruck-0.027
olk-0.026
ull-0.025
Token's
Token ID338
Feature activation+2.52e+00

Pos logit contributions
 accidents+0.006
urg+0.006
icles+0.005
ruck+0.005
olk+0.005
Neg logit contributions
ars-0.005
 studies-0.005
 likes-0.005
 clipboard-0.005
 seven-0.005
Token back
Token ID736
Feature activation-9.65e-01

Pos logit contributions
urg+0.176
 accidents+0.171
ruck+0.163
olk+0.157
umm+0.157
Neg logit contributions
ars-0.109
 studies-0.104
 likes-0.099
 clipboard-0.095
ossible-0.091
Token.
Token ID13
Feature activation+2.38e-01

Pos logit contributions
ars+0.048
 studies+0.047
 likes+0.045
 clipboard+0.045
 seven+0.043
Neg logit contributions
urg-0.060
 accidents-0.060
ruck-0.057
icles-0.056
olk-0.053
Token Nothing
Token ID10528
Feature activation+1.71e+00

Pos logit contributions
 accidents+0.017
urg+0.017
ruck+0.017
icles+0.016
olk+0.015
Neg logit contributions
 likes-0.009
 studies-0.009
 bar-0.009
 seven-0.009
 clipboard-0.009
Token happened
Token ID3022
Feature activation+9.69e+00

Pos logit contributions
urg+0.103
 accidents+0.099
ruck+0.096
icles+0.094
umm+0.092
Neg logit contributions
ars-0.091
 likes-0.088
 studies-0.087
 seven-0.084
 clipboard-0.084
Token,
Token ID11
Feature activation-3.33e-01

Pos logit contributions
 accidents+0.600
urg+0.598
ruck+0.575
icles+0.546
umm+0.517
Neg logit contributions
ars-0.488
 studies-0.465
 likes-0.460
 clipboard-0.448
 seven-0.441
Token so
Token ID523
Feature activation+2.92e-01

Pos logit contributions
ars+0.015
 likes+0.015
 studies+0.015
 clipboard+0.014
 seven+0.014
Neg logit contributions
urg-0.022
 accidents-0.021
ruck-0.021
icles-0.020
umm-0.020
Token he
Token ID339
Feature activation-1.29e-01

Pos logit contributions
urg+0.020
 accidents+0.019
ruck+0.019
icles+0.018
umm+0.018
Neg logit contributions
ars-0.013
 likes-0.013
 studies-0.013
 seven-0.012
 clipboard-0.012
Token touched
Token ID12615
Feature activation+3.42e-01

Pos logit contributions
ars+0.007
 studies+0.007
 clipboard+0.006
 likes+0.006
 mat+0.006
Neg logit contributions
urg-0.008
 accidents-0.007
ruck-0.007
icles-0.007
ull-0.007
Token it
Token ID340
Feature activation+9.46e-01

Pos logit contributions
 accidents+0.016
urg+0.016
ruck+0.015
icles+0.014
umm+0.014
Neg logit contributions
ars-0.023
 studies-0.022
 likes-0.022
 clipboard-0.021
 seven-0.021
Token Tim
Token ID5045
Feature activation+1.84e+00

Pos logit contributions
urg+0.015
 accidents+0.014
ruck+0.014
umm+0.013
icles+0.013
Neg logit contributions
 studies-0.027
 likes-0.027
ars-0.027
 clipboard-0.026
 seven-0.026
Tokenmy
Token ID1820
Feature activation+1.52e-01

Pos logit contributions
 accidents+0.093
urg+0.093
ruck+0.090
icles+0.082
eddy+0.076
Neg logit contributions
ars-0.116
 studies-0.110
 seven-0.108
 mat-0.107
 clipboard-0.107
Token told
Token ID1297
Feature activation+1.53e+00

Pos logit contributions
 accidents+0.009
urg+0.009
ruck+0.009
ull+0.008
icles+0.008
Neg logit contributions
ars-0.008
 mat-0.007
 studies-0.007
 seven-0.007
 likes-0.007
Token her
Token ID607
Feature activation-7.29e-01

Pos logit contributions
umm+0.092
urg+0.091
 accidents+0.085
olk+0.084
ruck+0.084
Neg logit contributions
 studies-0.073
 think-0.073
 clipboard-0.072
ars-0.072
 pedal-0.070
Token what
Token ID644
Feature activation+8.45e-01

Pos logit contributions
 likes+0.040
 studies+0.040
ars+0.039
 clipboard+0.039
 think+0.039
Neg logit contributions
urg-0.041
umm-0.041
ruck-0.037
icles-0.037
 accidents-0.037
Token happened
Token ID3022
Feature activation+9.69e+00

Pos logit contributions
urg+0.044
umm+0.040
icles+0.039
olk+0.038
ruck+0.036
Neg logit contributions
 likes-0.054
 studies-0.053
ars-0.052
 think-0.051
 bar-0.051
Token and
Token ID290
Feature activation+6.52e-01

Pos logit contributions
 accidents+0.645
urg+0.601
ruck+0.600
icles+0.530
ull+0.526
Neg logit contributions
ars-0.484
 seven-0.446
 clipboard-0.441
 studies-0.439
 mat-0.431
Token she
Token ID673
Feature activation-1.40e-01

Pos logit contributions
urg+0.046
umm+0.045
ruck+0.044
 accidents+0.044
icles+0.044
Neg logit contributions
 studies-0.026
 likes-0.026
ars-0.025
 think-0.025
 clipboard-0.025
Token said
Token ID531
Feature activation+7.11e-03

Pos logit contributions
ars+0.007
 studies+0.007
 mat+0.007
 clipboard+0.006
 likes+0.006
Neg logit contributions
urg-0.009
 accidents-0.009
ruck-0.008
icles-0.008
ull-0.008
Token,
Token ID11
Feature activation-3.14e-01

Pos logit contributions
urg+0.001
umm+0.000
ruck+0.000
icles+0.000
 accidents+0.000
Neg logit contributions
 likes-0.000
ars-0.000
 studies-0.000
 clipboard-0.000
 think-0.000
Token "
Token ID366
Feature activation+1.15e+00

Pos logit contributions
ars+0.015
 likes+0.014
 studies+0.014
 clipboard+0.014
 seven+0.014
Neg logit contributions
urg-0.020
 accidents-0.020
ruck-0.019
umm-0.019
icles-0.019
Token but
Token ID475
Feature activation+1.67e-01

Pos logit contributions
 likes+0.028
ars+0.028
 studies+0.026
 clipboard+0.026
 bar+0.026
Neg logit contributions
urg-0.073
ruck-0.069
umm-0.069
 accidents-0.069
icles-0.068
Token then
Token ID788
Feature activation-1.93e+00

Pos logit contributions
 accidents+0.011
urg+0.010
ruck+0.010
icles+0.010
olk+0.009
Neg logit contributions
ars-0.008
 studies-0.008
 likes-0.008
 clipboard-0.008
 seven-0.007
Token a
Token ID257
Feature activation-3.91e-02

Pos logit contributions
ars+0.100
 likes+0.093
 studies+0.093
 clipboard+0.089
 seven+0.089
Neg logit contributions
 accidents-0.122
urg-0.119
ruck-0.112
icles-0.108
olk-0.104
Token magical
Token ID10883
Feature activation+2.03e+00

Pos logit contributions
ars+0.002
 studies+0.002
 likes+0.002
 clipboard+0.002
 seven+0.002
Neg logit contributions
 accidents-0.003
urg-0.003
ruck-0.003
umm-0.002
icles-0.002
Token thing
Token ID1517
Feature activation+1.52e+00

Pos logit contributions
 accidents+0.122
urg+0.119
ruck+0.117
umm+0.110
ull+0.107
Neg logit contributions
ars-0.109
 studies-0.101
 likes-0.100
 clipboard-0.097
 seven-0.096
Token happened
Token ID3022
Feature activation+9.69e+00

Pos logit contributions
urg+0.061
 accidents+0.056
umm+0.054
ruck+0.053
icles+0.052
Neg logit contributions
 likes-0.110
ars-0.109
 studies-0.107
 sees-0.106
 bar-0.105
Token.
Token ID13
Feature activation+3.94e-01

Pos logit contributions
 accidents+0.598
urg+0.592
ruck+0.567
icles+0.535
ull+0.509
Neg logit contributions
ars-0.492
 studies-0.469
 likes-0.462
 seven-0.453
 clipboard-0.451
Token A
Token ID317
Feature activation+1.55e+00

Pos logit contributions
urg+0.029
 accidents+0.028
ruck+0.027
icles+0.026
umm+0.026
Neg logit contributions
ars-0.016
 likes-0.016
 studies-0.016
 clipboard-0.015
 seven-0.015
Token big
Token ID1263
Feature activation-7.38e-02

Pos logit contributions
urg+0.103
 accidents+0.101
ruck+0.097
icles+0.094
umm+0.093
Neg logit contributions
ars-0.072
 likes-0.069
 studies-0.069
 clipboard-0.066
 seven-0.066
Token,
Token ID11
Feature activation-2.44e-01

Pos logit contributions
ars+0.004
 studies+0.004
ighed+0.004
 clipboard+0.004
 likes+0.004
Neg logit contributions
 accidents-0.005
urg-0.004
ruck-0.004
olk-0.004
umm-0.004
Token pink
Token ID11398
Feature activation-6.29e-01

Pos logit contributions
ars+0.011
 studies+0.010
 likes+0.010
 clipboard+0.010
 seven+0.010
Neg logit contributions
 accidents-0.017
urg-0.016
ruck-0.016
icles-0.015
umm-0.015
Token room
Token ID2119
Feature activation-9.67e-01

Pos logit contributions
 accidents+0.013
ruck+0.013
urg+0.012
olk+0.012
umm+0.012
Neg logit contributions
ars-0.012
 likes-0.011
 studies-0.011
ighed-0.011
 clipboard-0.011
Token and
Token ID290
Feature activation+5.86e-01

Pos logit contributions
ars+0.049
 likes+0.045
 studies+0.045
 clipboard+0.044
 seven+0.044
Neg logit contributions
urg-0.060
 accidents-0.059
ruck-0.058
icles-0.054
umm-0.052
Token saw
Token ID2497
Feature activation-2.93e-01

Pos logit contributions
urg+0.040
 accidents+0.040
ruck+0.038
icles+0.037
umm+0.035
Neg logit contributions
ars-0.026
 studies-0.024
 likes-0.024
 clipboard-0.024
 seven-0.023
Token what
Token ID644
Feature activation+7.70e-01

Pos logit contributions
ars+0.016
 likes+0.016
 studies+0.016
 clipboard+0.015
 seven+0.015
Neg logit contributions
urg-0.017
 accidents-0.016
ruck-0.015
umm-0.015
icles-0.015
Token had
Token ID550
Feature activation+4.53e-01

Pos logit contributions
urg+0.050
ruck+0.045
umm+0.045
icles+0.044
 accidents+0.044
Neg logit contributions
 likes-0.038
 studies-0.037
ars-0.037
 bar-0.036
 sees-0.036
Token happened
Token ID3022
Feature activation+9.69e+00

Pos logit contributions
umm+0.024
urg+0.024
olk+0.022
icles+0.022
eddy+0.021
Neg logit contributions
 sees-0.026
 likes-0.025
 cannot-0.024
 seven-0.024
 studies-0.024
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
 accidents+0.615
urg+0.613
ruck+0.590
icles+0.558
umm+0.531
Neg logit contributions
ars-0.475
 studies-0.448
 likes-0.442
 clipboard-0.440
 seven-0.435
Token She
Token ID1375
Feature activation-5.61e-01

Pos logit contributions
urg+0.018
 accidents+0.017
ruck+0.016
icles+0.016
umm+0.016
Neg logit contributions
ars-0.015
 likes-0.015
 studies-0.015
 clipboard-0.014
 seven-0.014
Token told
Token ID1297
Feature activation+1.27e+00

Pos logit contributions
ars+0.032
 mat+0.029
 studies+0.028
 clipboard+0.028
 likes+0.028
Neg logit contributions
urg-0.032
 accidents-0.032
ruck-0.029
ull-0.027
 Baby-0.027
Token Lily
Token ID20037
Feature activation+7.37e-01

Pos logit contributions
urg+0.080
umm+0.077
ruck+0.076
 accidents+0.075
icles+0.072
Neg logit contributions
 studies-0.058
 clipboard-0.057
 think-0.056
 bar-0.056
ars-0.055
Token's
Token ID338
Feature activation+2.25e+00

Pos logit contributions
urg+0.045
 accidents+0.043
ruck+0.042
umm+0.041
icles+0.041
Neg logit contributions
ars-0.038
 likes-0.037
 studies-0.037
 clipboard-0.036
 seven-0.035
Token Jack
Token ID3619
Feature activation+1.23e+00

Pos logit contributions
urg+0.042
 accidents+0.040
ruck+0.039
icles+0.037
umm+0.035
Neg logit contributions
ars-0.049
 studies-0.049
 likes-0.049
 seven-0.047
 clipboard-0.046
Token asked
Token ID1965
Feature activation+1.33e+00

Pos logit contributions
urg+0.064
 accidents+0.063
ruck+0.059
ull+0.054
icles+0.053
Neg logit contributions
ars-0.073
 clipboard-0.069
 studies-0.069
 mat-0.069
 seven-0.068
Token his
Token ID465
Feature activation-3.57e-01

Pos logit contributions
urg+0.077
 accidents+0.076
ruck+0.073
icles+0.070
umm+0.067
Neg logit contributions
ars-0.075
 likes-0.071
 studies-0.071
 clipboard-0.068
 seven-0.068
Token mom
Token ID1995
Feature activation-7.46e-01

Pos logit contributions
ars+0.026
tail+0.023
ighed+0.023
oir+0.022
 likes+0.022
Neg logit contributions
 accidents-0.016
urg-0.015
ruck-0.013
ull-0.012
eddy-0.012
Token what
Token ID644
Feature activation+8.38e-01

Pos logit contributions
ars+0.048
 studies+0.044
 likes+0.044
 clipboard+0.044
 seven+0.043
Neg logit contributions
 accidents-0.038
urg-0.037
ruck-0.035
icles-0.033
umm-0.031
Token happened
Token ID3022
Feature activation+9.69e+00

Pos logit contributions
urg+0.051
 accidents+0.047
ruck+0.046
icles+0.046
umm+0.045
Neg logit contributions
 likes-0.045
ars-0.045
 studies-0.044
 clipboard-0.042
 seven-0.042
Token.
Token ID13
Feature activation+2.68e-01

Pos logit contributions
 accidents+0.613
urg+0.592
ruck+0.575
icles+0.532
ull+0.513
Neg logit contributions
ars-0.495
 studies-0.457
 clipboard-0.451
 seven-0.446
 likes-0.445
Token She
Token ID1375
Feature activation-3.81e-01

Pos logit contributions
urg+0.018
 accidents+0.017
ruck+0.017
icles+0.016
umm+0.016
Neg logit contributions
ars-0.013
 likes-0.012
 studies-0.012
 clipboard-0.012
 seven-0.012
Token laughed
Token ID13818
Feature activation-6.62e-01

Pos logit contributions
ars+0.020
 studies+0.018
 mat+0.018
 clipboard+0.018
 likes+0.017
Neg logit contributions
urg-0.024
 accidents-0.023
ruck-0.022
icles-0.021
ull-0.021
Token and
Token ID290
Feature activation+5.97e-01

Pos logit contributions
ars+0.025
 likes+0.023
 studies+0.023
 clipboard+0.022
 seven+0.022
Neg logit contributions
urg-0.050
 accidents-0.049
ruck-0.048
icles-0.047
umm-0.046
Token said
Token ID531
Feature activation-5.79e-03

Pos logit contributions
urg+0.047
 accidents+0.046
ruck+0.045
icles+0.043
umm+0.043
Neg logit contributions
ars-0.021
 studies-0.019
 likes-0.019
 clipboard-0.019
 seven-0.019
Token over
Token ID625
Feature activation-1.15e+00

Pos logit contributions
urg+0.277
umm+0.275
 accidents+0.271
icles+0.263
olk+0.259
Neg logit contributions
ars-0.155
 studies-0.154
 seven-0.152
 think-0.152
 clipboard-0.151
Token and
Token ID290
Feature activation+6.66e-01

Pos logit contributions
 studies+0.032
 likes+0.032
ars+0.032
 think+0.030
 sees+0.030
Neg logit contributions
urg-0.096
icles-0.091
 accidents-0.090
umm-0.090
ruck-0.089
Token saw
Token ID2497
Feature activation-2.23e-01

Pos logit contributions
urg+0.058
 accidents+0.056
ruck+0.055
umm+0.055
icles+0.054
Neg logit contributions
ars-0.017
 likes-0.017
 studies-0.016
 clipboard-0.016
 seven-0.015
Token what
Token ID644
Feature activation+8.00e-01

Pos logit contributions
 sees+0.009
 think+0.009
 likes+0.008
 bar+0.008
 thinks+0.008
Neg logit contributions
umm-0.017
urg-0.017
burn-0.014
olk-0.014
keeping-0.014
Token had
Token ID550
Feature activation+3.26e-01

Pos logit contributions
urg+0.042
umm+0.037
icles+0.033
olk+0.031
rily+0.031
Neg logit contributions
 likes-0.057
 sees-0.056
 cannot-0.053
ars-0.052
 has-0.052
Token happened
Token ID3022
Feature activation+9.69e+00

Pos logit contributions
umm+0.018
urg+0.016
icles+0.016
olk+0.015
eddy+0.014
Neg logit contributions
 sees-0.020
 cannot-0.020
 likes-0.019
ars-0.019
 stick-0.018
Token.
Token ID13
Feature activation+3.71e-01

Pos logit contributions
urg+0.671
 accidents+0.664
ruck+0.640
icles+0.617
umm+0.602
Neg logit contributions
ars-0.420
 studies-0.398
 likes-0.397
 clipboard-0.386
 seven-0.385
Token She
Token ID1375
Feature activation-3.12e-01

Pos logit contributions
urg+0.028
 accidents+0.026
ruck+0.026
icles+0.025
umm+0.025
Neg logit contributions
 likes-0.014
ars-0.014
 studies-0.014
 clipboard-0.013
 seven-0.013
Token was
Token ID373
Feature activation+1.55e+00

Pos logit contributions
ars+0.010
 likes+0.010
 studies+0.010
 seven+0.009
 clipboard+0.009
Neg logit contributions
urg-0.025
 accidents-0.024
ruck-0.023
umm-0.023
icles-0.023
Token sad
Token ID6507
Feature activation+2.15e-01

Pos logit contributions
urg+0.125
 accidents+0.121
ruck+0.120
umm+0.119
icles+0.118
Neg logit contributions
ars-0.049
 likes-0.047
 studies-0.047
 clipboard-0.046
 seven-0.044
Token because
Token ID780
Feature activation+5.96e-01

Pos logit contributions
urg+0.016
icles+0.015
ruck+0.015
 accidents+0.015
umm+0.015
Neg logit contributions
 likes-0.008
 studies-0.008
ars-0.008
 think-0.008
 clipboard-0.008
Token
Token ID198
Feature activation+2.99e-01

Pos logit contributions
urg+0.016
ruck+0.016
 accidents+0.016
ull+0.015
icles+0.015
Neg logit contributions
 studies-0.016
 likes-0.016
ars-0.015
 clipboard-0.015
 seven-0.015
TokenBut
Token ID1537
Feature activation-3.56e+00

Pos logit contributions
ruck+0.016
 accidents+0.016
urg+0.015
icles+0.013
ull+0.013
Neg logit contributions
 pocket-0.016
 mat-0.016
 studies-0.016
ars-0.016
 likes-0.016
Token then
Token ID788
Feature activation-1.94e+00

Pos logit contributions
ars+0.149
 likes+0.146
 studies+0.145
 seven+0.142
 clipboard+0.140
Neg logit contributions
urg-0.254
 accidents-0.243
ruck-0.237
icles-0.233
umm-0.231
Token something
Token ID1223
Feature activation+3.25e+00

Pos logit contributions
ars+0.066
 likes+0.064
 studies+0.063
 seven+0.061
 clipboard+0.060
Neg logit contributions
urg-0.153
 accidents-0.148
ruck-0.145
icles-0.142
umm-0.141
Token amazing
Token ID4998
Feature activation+1.25e+00

Pos logit contributions
urg+0.198
 accidents+0.190
ruck+0.184
icles+0.180
umm+0.177
Neg logit contributions
ars-0.171
 likes-0.163
 studies-0.162
 seven-0.158
 clipboard-0.158
Token happened
Token ID3022
Feature activation+9.68e+00

Pos logit contributions
urg+0.031
 accidents+0.027
umm+0.025
ruck+0.025
icles+0.024
Neg logit contributions
ars-0.111
 likes-0.109
 studies-0.106
 seven-0.106
 sees-0.104
Token.
Token ID13
Feature activation+2.07e-01

Pos logit contributions
 accidents+0.580
urg+0.561
ruck+0.545
icles+0.512
ull+0.486
Neg logit contributions
ars-0.515
 studies-0.498
 likes-0.484
 clipboard-0.481
 seven-0.469
Token Sir
Token ID7361
Feature activation+1.23e+00

Pos logit contributions
urg+0.013
 accidents+0.013
ruck+0.012
icles+0.012
umm+0.012
Neg logit contributions
 studies-0.010
ars-0.010
 likes-0.010
 seven-0.010
 clipboard-0.010
Token Robert
Token ID5199
Feature activation+1.82e+00

Pos logit contributions
urg+0.070
 accidents+0.065
ruck+0.062
icles+0.062
olk+0.061
Neg logit contributions
 likes-0.071
ars-0.070
 studies-0.069
 seven-0.069
 sees-0.066
Token was
Token ID373
Feature activation+1.59e+00

Pos logit contributions
urg+0.123
 accidents+0.119
ruck+0.116
icles+0.111
umm+0.108
Neg logit contributions
ars-0.082
 studies-0.079
 likes-0.078
 clipboard-0.076
 seven-0.075
Token able
Token ID1498
Feature activation+4.13e+00

Pos logit contributions
urg+0.103
 accidents+0.103
ruck+0.099
icles+0.093
ull+0.089
Neg logit contributions
ars-0.076
 likes-0.072
 studies-0.072
 clipboard-0.068
 seven-0.067
Token her
Token ID607
Feature activation-7.03e-01

Pos logit contributions
urg+0.193
 accidents+0.187
ruck+0.185
icles+0.176
umm+0.175
Neg logit contributions
ars-0.168
 likes-0.167
 studies-0.166
 clipboard-0.161
 seven-0.160
Token meal
Token ID9799
Feature activation+6.92e+00

Pos logit contributions
ars+0.055
 studies+0.053
 likes+0.053
 clipboard+0.052
 seven+0.052
Neg logit contributions
urg-0.025
 accidents-0.024
ruck-0.022
icles-0.021
umm-0.020
Token,
Token ID11
Feature activation-3.84e-01

Pos logit contributions
urg+0.558
 accidents+0.543
ruck+0.533
icles+0.517
umm+0.507
Neg logit contributions
ars-0.224
 likes-0.211
 studies-0.210
 clipboard-0.199
 seven-0.197
Token something
Token ID1223
Feature activation+3.25e+00

Pos logit contributions
ars+0.017
 likes+0.017
 studies+0.017
 clipboard+0.016
 seven+0.016
Neg logit contributions
urg-0.026
 accidents-0.025
ruck-0.025
icles-0.024
umm-0.024
Token strange
Token ID6283
Feature activation+2.79e+00

Pos logit contributions
urg+0.215
ruck+0.200
icles+0.197
umm+0.195
 accidents+0.187
Neg logit contributions
 likes-0.160
ars-0.156
 bar-0.151
 studies-0.150
 sees-0.148
Token happened
Token ID3022
Feature activation+9.68e+00

Pos logit contributions
urg+0.091
umm+0.082
 accidents+0.077
ruck+0.077
olk+0.071
Neg logit contributions
 likes-0.225
ars-0.216
 sees-0.215
 studies-0.215
 bar-0.214
Token.
Token ID13
Feature activation+3.15e-01

Pos logit contributions
urg+0.664
 accidents+0.654
ruck+0.630
icles+0.610
umm+0.596
Neg logit contributions
ars-0.427
 studies-0.408
 likes-0.407
 clipboard-0.394
 seven-0.389
Token Lily
Token ID20037
Feature activation+1.01e+00

Pos logit contributions
urg+0.026
ruck+0.025
 accidents+0.024
icles+0.023
umm+0.023
Neg logit contributions
 likes-0.010
 studies-0.010
ars-0.010
 clipboard-0.010
 bar-0.009
Token started
Token ID2067
Feature activation+1.72e+00

Pos logit contributions
urg+0.060
 accidents+0.059
ruck+0.055
icles+0.052
olk+0.051
Neg logit contributions
ars-0.054
 studies-0.050
 clipboard-0.050
 seven-0.050
 likes-0.049
Token to
Token ID284
Feature activation+4.39e-01

Pos logit contributions
urg+0.104
ruck+0.100
 accidents+0.099
icles+0.098
umm+0.098
Neg logit contributions
 studies-0.088
 likes-0.086
 clipboard-0.085
 bar-0.085
ars-0.083
Token shrink
Token ID22085
Feature activation+2.29e+00

Pos logit contributions
urg+0.031
 accidents+0.031
ruck+0.029
icles+0.028
olk+0.028
Neg logit contributions
ars-0.019
 seven-0.017
 likes-0.017
 studies-0.017
 clipboard-0.016
Token very
Token ID845
Feature activation-3.94e+00

Pos logit contributions
urg+0.111
 accidents+0.110
ruck+0.105
icles+0.101
umm+0.099
Neg logit contributions
ars-0.070
 likes-0.066
 studies-0.066
 clipboard-0.064
 seven-0.063
Token sad
Token ID6507
Feature activation+1.94e-01

Pos logit contributions
 mat+0.185
ars+0.183
 studies+0.179
 seven+0.175
 clipboard+0.174
Neg logit contributions
 accidents-0.263
urg-0.244
ruck-0.228
ull-0.224
 returning-0.213
Token about
Token ID546
Feature activation+4.02e+00

Pos logit contributions
urg+0.013
 accidents+0.013
ruck+0.013
icles+0.012
umm+0.012
Neg logit contributions
ars-0.008
 studies-0.008
 likes-0.008
 clipboard-0.008
 seven-0.008
Token what
Token ID644
Feature activation+8.55e-01

Pos logit contributions
urg+0.231
 accidents+0.217
ruck+0.215
icles+0.212
umm+0.208
Neg logit contributions
 studies-0.219
 likes-0.216
ars-0.215
 clipboard-0.208
 think-0.206
Token had
Token ID550
Feature activation+3.43e-01

Pos logit contributions
urg+0.052
 accidents+0.051
ruck+0.049
icles+0.048
umm+0.047
Neg logit contributions
ars-0.044
 likes-0.043
 studies-0.043
 clipboard-0.041
 seven-0.041
Token happened
Token ID3022
Feature activation+9.68e+00

Pos logit contributions
urg+0.013
umm+0.012
icles+0.012
olk+0.011
ruck+0.010
Neg logit contributions
 likes-0.026
 think-0.026
 studies-0.026
 sees-0.026
ars-0.025
Token and
Token ID290
Feature activation+6.39e-01

Pos logit contributions
 accidents+0.590
urg+0.562
ruck+0.561
icles+0.503
 returning+0.481
Neg logit contributions
ars-0.506
 clipboard-0.474
 studies-0.463
 likes-0.463
 seven-0.462
Token never
Token ID1239
Feature activation-2.19e+00

Pos logit contributions
 accidents+0.039
urg+0.039
ruck+0.036
icles+0.035
umm+0.034
Neg logit contributions
ars-0.034
 likes-0.031
 studies-0.031
 clipboard-0.031
 seven-0.031
Token wrote
Token ID2630
Feature activation+1.41e+00

Pos logit contributions
 studies+0.083
 likes+0.083
ars+0.081
 sees+0.081
 think+0.081
Neg logit contributions
urg-0.163
umm-0.158
icles-0.158
olk-0.154
 accidents-0.154
Token about
Token ID546
Feature activation+4.05e+00

Pos logit contributions
urg+0.073
 accidents+0.070
ruck+0.067
icles+0.065
umm+0.064
Neg logit contributions
ars-0.086
 likes-0.083
 studies-0.083
 clipboard-0.081
 seven-0.081
Token sailing
Token ID28499
Feature activation+4.67e-01

Pos logit contributions
urg+0.253
ruck+0.234
 accidents+0.233
umm+0.232
icles+0.229
Neg logit contributions
 studies-0.201
 likes-0.197
ars-0.191
 clipboard-0.189
 think-0.189
Token one
Token ID530
Feature activation-1.35e+00

Pos logit contributions
ars+0.131
 clipboard+0.118
 studies+0.117
 likes+0.112
 mat+0.111
Neg logit contributions
 accidents-0.132
urg-0.121
ruck-0.120
icles-0.119
ter-0.107
Token day
Token ID1110
Feature activation-4.46e-01

Pos logit contributions
ars+0.075
 studies+0.071
 clipboard+0.071
 likes+0.068
 seven+0.067
Neg logit contributions
 accidents-0.078
urg-0.077
ruck-0.075
icles-0.070
umm-0.070
Token,
Token ID11
Feature activation-3.31e-01

Pos logit contributions
 likes+0.009
ars+0.008
 sees+0.007
 studies+0.007
 seven+0.007
Neg logit contributions
urg-0.043
umm-0.042
ruck-0.040
icles-0.040
 accidents-0.039
Token something
Token ID1223
Feature activation+3.32e+00

Pos logit contributions
 likes+0.014
ars+0.014
 studies+0.013
 seven+0.013
 clipboard+0.013
Neg logit contributions
urg-0.024
ruck-0.022
umm-0.022
 accidents-0.022
icles-0.021
Token special
Token ID2041
Feature activation+7.40e-01

Pos logit contributions
urg+0.221
ruck+0.206
umm+0.205
 accidents+0.205
icles+0.203
Neg logit contributions
ars-0.158
 likes-0.155
 studies-0.148
 seven-0.145
 sees-0.143
Token happened
Token ID3022
Feature activation+9.68e+00

Pos logit contributions
urg+0.025
 accidents+0.020
umm+0.020
ruck+0.019
olk+0.019
Neg logit contributions
 likes-0.061
ars-0.059
 studies-0.058
 sees-0.057
 bar-0.057
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
urg+0.640
 accidents+0.636
ruck+0.610
icles+0.587
umm+0.563
Neg logit contributions
ars-0.449
 studies-0.428
 likes-0.423
 clipboard-0.416
 seven-0.410
Token
Token ID198
Feature activation+2.55e-01

Pos logit contributions
urg+0.020
 accidents+0.019
ruck+0.019
umm+0.018
icles+0.017
Neg logit contributions
ars-0.009
 likes-0.009
 studies-0.009
 clipboard-0.008
 seven-0.008
Token
Token ID198
Feature activation+2.54e-01

Pos logit contributions
urg+0.017
 accidents+0.017
ruck+0.016
icles+0.016
umm+0.016
Neg logit contributions
ars-0.011
 likes-0.011
 studies-0.011
 clipboard-0.010
 seven-0.010
TokenThe
Token ID464
Feature activation-2.95e-01

Pos logit contributions
urg+0.017
 accidents+0.016
ruck+0.016
icles+0.015
umm+0.015
Neg logit contributions
ars-0.012
 studies-0.011
 likes-0.011
 clipboard-0.011
 seven-0.011
Token old
Token ID1468
Feature activation-1.20e+00

Pos logit contributions
ars+0.011
 studies+0.011
 likes+0.010
 clipboard+0.010
 seven+0.010
Neg logit contributions
urg-0.022
 accidents-0.022
ruck-0.021
icles-0.020
umm-0.020
Token and
Token ID290
Feature activation+2.47e-01

Pos logit contributions
ars+0.007
 studies+0.007
 likes+0.006
 clipboard+0.006
 seven+0.006
Neg logit contributions
urg-0.012
 accidents-0.012
ruck-0.011
icles-0.011
umm-0.011
Token sharp
Token ID7786
Feature activation-1.30e+00

Pos logit contributions
 accidents+0.016
urg+0.014
ruck+0.013
olk+0.013
icles+0.013
Neg logit contributions
ars-0.012
 studies-0.012
 clipboard-0.011
ossible-0.010
ighed-0.010
Token teeth
Token ID9941
Feature activation+2.75e-01

Pos logit contributions
ars+0.072
 studies+0.071
 likes+0.069
 clipboard+0.068
 seven+0.067
Neg logit contributions
urg-0.074
 accidents-0.073
ruck-0.070
icles-0.067
umm-0.064
Token.
Token ID13
Feature activation-3.76e-02

Pos logit contributions
urg+0.020
 accidents+0.019
ruck+0.019
icles+0.018
umm+0.018
Neg logit contributions
ars-0.011
 likes-0.011
 studies-0.011
 clipboard-0.010
 seven-0.010
Token It
Token ID632
Feature activation+6.77e-01

Pos logit contributions
ars+0.002
 likes+0.002
 studies+0.002
 seven+0.002
 clipboard+0.002
Neg logit contributions
urg-0.002
 accidents-0.002
ruck-0.002
umm-0.002
icles-0.002
Token bar
Token ID2318
Feature activation-6.79e+00

Pos logit contributions
 accidents+0.039
 happened+0.036
urg+0.035
 lava+0.033
 Twe+0.033
Neg logit contributions
ars-0.035
 clipboard-0.029
 locker-0.029
 wrist-0.029
robat-0.028
Tokenks
Token ID591
Feature activation-3.85e+00

Pos logit contributions
 clipboard+0.246
ars+0.244
 studies+0.244
 hips+0.243
 pocket+0.240
Neg logit contributions
urg-0.466
 accidents-0.465
ruck-0.448
ull-0.439
 happened-0.435
Token loud
Token ID7812
Feature activation-1.59e+00

Pos logit contributions
ars+0.158
 studies+0.153
 likes+0.149
 clipboard+0.146
 seven+0.142
Neg logit contributions
urg-0.275
 accidents-0.270
ruck-0.261
icles-0.252
umm-0.245
Token.
Token ID13
Feature activation-1.48e-01

Pos logit contributions
ars+0.064
 studies+0.061
 likes+0.061
 clipboard+0.059
 seven+0.058
Neg logit contributions
urg-0.115
 accidents-0.113
ruck-0.109
icles-0.106
umm-0.104
Token
Token ID198
Feature activation-1.18e-01

Pos logit contributions
ars+0.007
 likes+0.007
 studies+0.007
 seven+0.006
 clipboard+0.006
Neg logit contributions
urg-0.010
 accidents-0.010
ruck-0.009
icles-0.009
umm-0.009
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
ars+0.006
 studies+0.006
 likes+0.006
 clipboard+0.005
 seven+0.005
Neg logit contributions
urg-0.007
 accidents-0.007
ruck-0.007
icles-0.007
ull-0.006
Token and
Token ID290
Feature activation+3.49e-01

Pos logit contributions
ars+0.008
 studies+0.007
 likes+0.007
 clipboard+0.007
 seven+0.007
Neg logit contributions
urg-0.012
 accidents-0.012
ruck-0.011
icles-0.011
olk-0.011
Token sharp
Token ID7786
Feature activation-1.32e+00

Pos logit contributions
 accidents+0.021
urg+0.019
 happened+0.018
ruck+0.018
icles+0.017
Neg logit contributions
ars-0.019
 studies-0.018
 clipboard-0.016
ighed-0.016
 poster-0.016
Token teeth
Token ID9941
Feature activation+2.18e-01

Pos logit contributions
 studies+0.077
ars+0.077
 likes+0.073
 clipboard+0.073
 seven+0.072
Neg logit contributions
urg-0.070
 accidents-0.068
ruck-0.065
icles-0.063
olk-0.060
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
urg+0.015
 accidents+0.015
ruck+0.015
icles+0.014
umm+0.014
Neg logit contributions
ars-0.009
 studies-0.009
 likes-0.009
 clipboard-0.008
 seven-0.008
Token It
Token ID632
Feature activation+6.93e-01

Pos logit contributions
urg+0.007
 accidents+0.007
ruck+0.007
icles+0.007
umm+0.006
Neg logit contributions
ars-0.007
 studies-0.007
 likes-0.006
 clipboard-0.006
 seven-0.006
Token bar
Token ID2318
Feature activation-6.78e+00

Pos logit contributions
 accidents+0.039
 happened+0.036
urg+0.035
 Twe+0.033
 lava+0.033
Neg logit contributions
ars-0.037
 clipboard-0.032
 locker-0.031
 wrist-0.031
aye-0.030
Tokenks
Token ID591
Feature activation-3.85e+00

Pos logit contributions
 clipboard+0.248
ars+0.248
 studies+0.246
 pocket+0.246
 hips+0.246
Neg logit contributions
urg-0.466
 accidents-0.458
ruck-0.447
ull-0.436
 happened-0.429
Token loud
Token ID7812
Feature activation-1.58e+00

Pos logit contributions
ars+0.162
 studies+0.157
 likes+0.152
 clipboard+0.149
 seven+0.144
Neg logit contributions
urg-0.272
 accidents-0.268
ruck-0.258
icles-0.249
umm-0.241
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
ars+0.063
 studies+0.060
 likes+0.059
 clipboard+0.058
 seven+0.057
Neg logit contributions
urg-0.115
 accidents-0.113
ruck-0.109
icles-0.106
umm-0.103
Token
Token ID198
Feature activation-3.55e-02

Pos logit contributions
ars+0.001
 likes+0.001
 studies+0.001
 clipboard+0.001
 seven+0.001
Neg logit contributions
urg-0.001
 accidents-0.001
ruck-0.001
icles-0.001
umm-0.001
Token
Token ID198
Feature activation-4.31e-02

Pos logit contributions
ars+0.002
 studies+0.002
 likes+0.002
 clipboard+0.002
 seven+0.002
Neg logit contributions
urg-0.002
 accidents-0.002
ruck-0.002
icles-0.002
umm-0.002
Token not
Token ID407
Feature activation-2.19e+00

Pos logit contributions
ars+0.225
 studies+0.217
ebra+0.210
 pocket+0.208
 likes+0.206
Neg logit contributions
 accidents-0.162
urg-0.153
ruck-0.133
asting-0.131
icles-0.126
Token like
Token ID588
Feature activation-2.70e+00

Pos logit contributions
 studies+0.092
ars+0.089
 clipboard+0.084
 seven+0.084
 likes+0.083
Neg logit contributions
urg-0.152
ruck-0.152
 accidents-0.151
icles-0.139
ull-0.136
Token water
Token ID1660
Feature activation+1.31e+00

Pos logit contributions
 likes+0.127
 bar+0.123
 studies+0.123
ars+0.121
 mat+0.120
Neg logit contributions
urg-0.175
umm-0.166
ruck-0.162
icles-0.157
 accidents-0.156
Token.
Token ID13
Feature activation-9.81e-02

Pos logit contributions
urg+0.097
ruck+0.092
 accidents+0.092
umm+0.090
icles+0.088
Neg logit contributions
 likes-0.051
ars-0.051
 studies-0.047
 bar-0.047
 sees-0.046
Token He
Token ID679
Feature activation-1.40e+00

Pos logit contributions
ars+0.005
 studies+0.005
 likes+0.005
 clipboard+0.005
 seven+0.005
Neg logit contributions
urg-0.006
 accidents-0.006
icles-0.005
ruck-0.005
umm-0.005
Token bar
Token ID2318
Feature activation-6.74e+00

Pos logit contributions
ars+0.059
 clipboard+0.058
 wrist+0.057
 backpack+0.056
 locker+0.056
Neg logit contributions
 accidents-0.084
urg-0.084
 happened-0.081
 Baby-0.071
 spilling-0.070
Tokenks
Token ID591
Feature activation-3.76e+00

Pos logit contributions
tail+0.278
 hips+0.275
 pedal+0.270
 belt+0.268
 pocket+0.266
Neg logit contributions
 accidents-0.430
urg-0.421
ull-0.401
 happened-0.399
ruck-0.396
Token and
Token ID290
Feature activation+8.80e-02

Pos logit contributions
 studies+0.185
ars+0.180
 pocket+0.173
 clipboard+0.172
 mat+0.162
Neg logit contributions
 accidents-0.242
urg-0.234
ruck-0.223
icles-0.208
ull-0.205
Token runs
Token ID4539
Feature activation-2.21e+00

Pos logit contributions
 accidents+0.006
urg+0.005
 spilling+0.005
 Baby+0.005
 Twe+0.004
Neg logit contributions
ars-0.004
 studies-0.004
belt-0.004
ooter-0.004
ebra-0.004
Token away
Token ID1497
Feature activation-2.01e+00

Pos logit contributions
 studies+0.119
ars+0.118
 clipboard+0.113
 likes+0.112
 pocket+0.109
Neg logit contributions
 accidents-0.128
urg-0.120
ruck-0.118
icles-0.112
umm-0.104
Token.
Token ID13
Feature activation-9.99e-02

Pos logit contributions
ars+0.109
 studies+0.104
 likes+0.101
 clipboard+0.101
 seven+0.098
Neg logit contributions
 accidents-0.120
urg-0.118
ruck-0.113
icles-0.106
umm-0.101
Token and
Token ID290
Feature activation+6.67e-02

Pos logit contributions
ars+0.008
 studies+0.008
 likes+0.008
 clipboard+0.008
 seven+0.007
Neg logit contributions
urg-0.016
 accidents-0.016
ruck-0.016
icles-0.015
umm-0.015
Token sharp
Token ID7786
Feature activation-1.28e+00

Pos logit contributions
 accidents+0.004
urg+0.004
olk+0.004
ruck+0.004
icles+0.004
Neg logit contributions
ars-0.003
 studies-0.003
 clipboard-0.003
ighed-0.003
ossible-0.003
Token teeth
Token ID9941
Feature activation+3.00e-01

Pos logit contributions
ars+0.067
 studies+0.065
 likes+0.064
 clipboard+0.063
 seven+0.062
Neg logit contributions
urg-0.077
 accidents-0.075
ruck-0.072
icles-0.070
umm-0.068
Token.
Token ID13
Feature activation-2.14e-01

Pos logit contributions
urg+0.023
 accidents+0.022
ruck+0.021
umm+0.021
icles+0.021
Neg logit contributions
ars-0.011
 likes-0.011
 studies-0.010
 clipboard-0.010
 seven-0.010
Token It
Token ID632
Feature activation+6.51e-01

Pos logit contributions
ars+0.011
 likes+0.011
 studies+0.010
 seven+0.010
 clipboard+0.010
Neg logit contributions
urg-0.013
 accidents-0.013
ruck-0.013
icles-0.012
umm-0.012
Token bar
Token ID2318
Feature activation-6.73e+00

Pos logit contributions
 accidents+0.037
 happened+0.035
urg+0.033
 lava+0.033
 flames+0.031
Neg logit contributions
ars-0.033
 clipboard-0.029
robat-0.028
belt-0.028
 wrist-0.028
Tokenks
Token ID591
Feature activation-3.83e+00

Pos logit contributions
ars+0.237
 clipboard+0.236
 studies+0.235
 hips+0.229
 mat+0.228
Neg logit contributions
urg-0.470
 accidents-0.470
ruck-0.453
ull-0.443
 happened-0.428
Token loud
Token ID7812
Feature activation-1.56e+00

Pos logit contributions
ars+0.149
 likes+0.143
 studies+0.140
 seven+0.136
 clipboard+0.136
Neg logit contributions
urg-0.284
 accidents-0.278
ruck-0.269
icles-0.263
umm-0.260
Token.
Token ID13
Feature activation-2.82e-01

Pos logit contributions
ars+0.059
 likes+0.057
 studies+0.056
 seven+0.054
 clipboard+0.054
Neg logit contributions
urg-0.116
 accidents-0.113
ruck-0.110
icles-0.107
umm-0.106
Token Lily
Token ID20037
Feature activation+2.92e-01

Pos logit contributions
ars+0.014
 likes+0.014
 studies+0.014
 seven+0.013
 clipboard+0.013
Neg logit contributions
urg-0.018
 accidents-0.017
ruck-0.017
icles-0.016
umm-0.016
Token and
Token ID290
Feature activation+9.43e-03

Pos logit contributions
 accidents+0.014
urg+0.013
 happened+0.012
ruck+0.012
aste+0.011
Neg logit contributions
ars-0.017
 clipboard-0.016
ossible-0.015
 seven-0.015
 headset-0.015
Token said
Token ID531
Feature activation-3.05e-01

Pos logit contributions
ars+0.024
 clipboard+0.022
 mat+0.022
 studies+0.021
 seven+0.020
Neg logit contributions
 accidents-0.030
ruck-0.030
urg-0.029
 returning-0.026
 Mill-0.026
Token I
Token ID314
Feature activation-1.49e+00

Pos logit contributions
ars+0.016
 studies+0.015
 likes+0.015
 clipboard+0.015
 seven+0.014
Neg logit contributions
urg-0.018
 accidents-0.018
ruck-0.017
icles-0.017
umm-0.016
Token'm
Token ID1101
Feature activation+5.53e-01

Pos logit contributions
ars+0.078
 mat+0.074
 clipboard+0.074
 seven+0.073
 studies+0.071
Neg logit contributions
ruck-0.083
urg-0.080
 accidents-0.079
 happened-0.077
ull-0.073
Token the
Token ID262
Feature activation-6.31e-01

Pos logit contributions
urg+0.031
 accidents+0.031
ruck+0.029
icles+0.028
umm+0.027
Neg logit contributions
ars-0.032
 likes-0.030
 studies-0.030
 clipboard-0.029
 seven-0.028
Token fastest
Token ID14162
Feature activation-2.27e+00

Pos logit contributions
ars+0.036
ighed+0.031
ossible+0.031
 studies+0.031
 likes+0.030
Neg logit contributions
 accidents-0.036
umm-0.035
ruck-0.035
icles-0.032
urg-0.032
Token bar
Token ID2318
Feature activation-6.73e+00

Pos logit contributions
ars+0.118
 studies+0.111
 clipboard+0.107
 likes+0.106
ossible+0.103
Neg logit contributions
ruck-0.135
 accidents-0.134
urg-0.128
icles-0.123
umm-0.121
Tokenber
Token ID527
Feature activation-3.62e+00

Pos logit contributions
ars+0.289
 studies+0.272
ossible+0.267
 clipboard+0.266
 hips+0.262
Neg logit contributions
ruck-0.434
urg-0.413
 accidents-0.409
umm-0.405
icles-0.404
Token in
Token ID287
Feature activation-2.21e+00

Pos logit contributions
ars+0.212
 clipboard+0.202
 studies+0.201
 mat+0.193
 likes+0.192
Neg logit contributions
ruck-0.179
 accidents-0.178
urg-0.174
icles-0.165
umm-0.155
Token town
Token ID3240
Feature activation-1.94e+00

Pos logit contributions
ars+0.125
 likes+0.119
 studies+0.117
 clipboard+0.117
 seven+0.112
Neg logit contributions
 accidents-0.123
urg-0.123
ruck-0.118
icles-0.112
umm-0.109
Token!"
Token ID2474
Feature activation+4.34e-01

Pos logit contributions
ars+0.114
 clipboard+0.109
 studies+0.106
 mat+0.103
 likes+0.103
Neg logit contributions
ruck-0.099
urg-0.098
 accidents-0.098
icles-0.090
umm-0.085
Token Everyone
Token ID11075
Feature activation+7.68e-01

Pos logit contributions
 accidents+0.026
urg+0.026
icles+0.026
ruck+0.026
 Baby+0.024
Neg logit contributions
ars-0.021
 studies-0.020
 likes-0.020
 clipboard-0.019
 seven-0.019
Token sharp
Token ID7786
Feature activation-1.33e+00

Pos logit contributions
ars+0.091
 likes+0.089
 seven+0.087
 studies+0.085
 clipboard+0.084
Neg logit contributions
urg-0.154
 accidents-0.148
ruck-0.144
icles-0.141
umm-0.140
Token teeth
Token ID9941
Feature activation+3.11e-01

Pos logit contributions
ars+0.078
 studies+0.077
 clipboard+0.073
 likes+0.073
 seven+0.072
Neg logit contributions
urg-0.071
 accidents-0.070
ruck-0.067
icles-0.064
olk-0.061
Token.
Token ID13
Feature activation-1.07e-01

Pos logit contributions
urg+0.021
 accidents+0.020
ruck+0.020
icles+0.019
olk+0.018
Neg logit contributions
ars-0.014
 studies-0.014
 likes-0.013
 clipboard-0.013
 seven-0.013
Token The
Token ID383
Feature activation-5.88e-01

Pos logit contributions
ars+0.005
 studies+0.005
 likes+0.005
 clipboard+0.005
 seven+0.005
Neg logit contributions
 accidents-0.007
urg-0.007
ruck-0.006
icles-0.006
umm-0.006
Token dog
Token ID3290
Feature activation-1.64e+00

Pos logit contributions
 studies+0.035
ars+0.034
 seven+0.032
ighed+0.031
 hips+0.030
Neg logit contributions
 accidents-0.032
ruck-0.028
urg-0.025
olk-0.025
burn-0.025
Token bar
Token ID2318
Feature activation-6.73e+00

Pos logit contributions
ars+0.073
ossible+0.073
 clipboard+0.072
belt+0.071
 pocket+0.069
Neg logit contributions
 accidents-0.098
urg-0.088
 happened-0.084
 coming-0.080
 lava-0.076
Tokenks
Token ID591
Feature activation-3.84e+00

Pos logit contributions
 hips+0.309
 studies+0.301
tail+0.297
 pocket+0.294
 clipboard+0.291
Neg logit contributions
ull-0.400
urg-0.392
ruck-0.385
 accidents-0.383
 happened-0.368
Token loud
Token ID7812
Feature activation-1.57e+00

Pos logit contributions
ars+0.158
 studies+0.154
 likes+0.147
 clipboard+0.146
 seven+0.140
Neg logit contributions
urg-0.274
 accidents-0.271
ruck-0.260
icles-0.251
umm-0.242
Token.
Token ID13
Feature activation-1.22e-01

Pos logit contributions
ars+0.064
 likes+0.061
 studies+0.061
 clipboard+0.059
 seven+0.058
Neg logit contributions
urg-0.113
 accidents-0.111
ruck-0.107
icles-0.104
umm-0.103
Token Anna
Token ID11735
Feature activation-2.81e+00

Pos logit contributions
ars+0.006
 studies+0.006
 clipboard+0.005
 likes+0.005
 seven+0.005
Neg logit contributions
 accidents-0.008
urg-0.008
icles-0.007
ruck-0.007
umm-0.007
Token is
Token ID318
Feature activation-1.06e+00

Pos logit contributions
belt+0.135
 clipboard+0.135
ars+0.134
 pocket+0.130
ossible+0.129
Neg logit contributions
 accidents-0.155
urg-0.150
 happened-0.131
ruck-0.130
 Ow-0.127
Token.
Token ID13
Feature activation-4.15e-01

Pos logit contributions
ars+0.153
 studies+0.148
 likes+0.148
 clipboard+0.141
 seven+0.141
Neg logit contributions
urg-0.229
 accidents-0.225
ruck-0.215
icles-0.209
umm-0.200
Token
Token ID198
Feature activation-4.82e-01

Pos logit contributions
 studies+0.026
ars+0.024
 likes+0.023
 pocket+0.022
tail+0.022
Neg logit contributions
 accidents-0.020
urg-0.019
ruck-0.019
icles-0.019
ull-0.018
Token
Token ID198
Feature activation-4.61e-01

Pos logit contributions
 studies+0.025
ars+0.024
 clipboard+0.024
 likes+0.024
 seven+0.024
Neg logit contributions
urg-0.028
 accidents-0.028
ruck-0.027
icles-0.026
ull-0.025
TokenThe
Token ID464
Feature activation-8.93e-01

Pos logit contributions
 studies+0.034
 clipboard+0.031
 hips+0.030
 pocket+0.030
 likes+0.030
Neg logit contributions
 accidents-0.018
ruck-0.018
urg-0.016
ull-0.016
eddy-0.015
Token dog
Token ID3290
Feature activation-1.70e+00

Pos logit contributions
 studies+0.046
ars+0.046
 hips+0.045
ighed+0.044
tail+0.041
Neg logit contributions
 accidents-0.053
ruck-0.050
urg-0.046
ter-0.043
eddy-0.043
Token bar
Token ID2318
Feature activation-6.73e+00

Pos logit contributions
ars+0.073
ossible+0.071
belt+0.071
 clipboard+0.070
aye+0.068
Neg logit contributions
 accidents-0.107
urg-0.095
 happened-0.087
 coming-0.085
 lava-0.083
Tokenks
Token ID591
Feature activation-3.78e+00

Pos logit contributions
tail+0.337
 studies+0.320
 pedal+0.319
 hips+0.317
 pocket+0.310
Neg logit contributions
ull-0.375
urg-0.364
 accidents-0.351
ier-0.348
ruck-0.341
Token and
Token ID290
Feature activation+3.77e-03

Pos logit contributions
 studies+0.187
ars+0.182
 clipboard+0.176
 pocket+0.174
 likes+0.169
Neg logit contributions
 accidents-0.241
urg-0.237
ruck-0.224
icles-0.213
ull-0.207
Token runs
Token ID4539
Feature activation-2.21e+00

Pos logit contributions
 accidents+0.000
urg+0.000
 spilling+0.000
 lava+0.000
icles+0.000
Neg logit contributions
ars-0.000
ooter-0.000
ossible-0.000
 studies-0.000
belt-0.000
Token away
Token ID1497
Feature activation-2.04e+00

Pos logit contributions
 studies+0.123
ars+0.121
 clipboard+0.119
 likes+0.118
 pocket+0.114
Neg logit contributions
 accidents-0.120
urg-0.114
ruck-0.113
icles-0.107
umm-0.099
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
ars+0.112
 studies+0.109
 clipboard+0.105
 likes+0.104
 seven+0.101
Neg logit contributions
 accidents-0.119
urg-0.116
ruck-0.109
icles-0.103
ull-0.099
Token?"
Token ID1701
Feature activation-1.58e-01

Pos logit contributions
ars+0.139
 likes+0.133
 studies+0.132
 clipboard+0.127
 seven+0.126
Neg logit contributions
urg-0.243
 accidents-0.238
ruck-0.231
icles-0.224
umm-0.221
Token
Token ID198
Feature activation-4.10e-01

Pos logit contributions
ars+0.009
 studies+0.008
 likes+0.008
 clipboard+0.008
 seven+0.008
Neg logit contributions
 accidents-0.009
urg-0.009
icles-0.008
ruck-0.008
 Baby-0.008
Token
Token ID198
Feature activation-4.02e-01

Pos logit contributions
 studies+0.023
ars+0.022
 likes+0.022
 clipboard+0.022
 seven+0.021
Neg logit contributions
urg-0.022
 accidents-0.022
ruck-0.021
icles-0.021
ull-0.020
TokenThe
Token ID464
Feature activation-9.56e-01

Pos logit contributions
 studies+0.031
 hips+0.028
 likes+0.028
 clipboard+0.028
ars+0.027
Neg logit contributions
 accidents-0.014
ruck-0.014
urg-0.014
icles-0.013
Ab-0.013
Token dog
Token ID3290
Feature activation-1.70e+00

Pos logit contributions
 studies+0.051
ars+0.051
 hips+0.048
 clipboard+0.046
 seven+0.046
Neg logit contributions
 accidents-0.056
ruck-0.052
urg-0.048
eddy-0.047
olk-0.045
Token bar
Token ID2318
Feature activation-6.72e+00

Pos logit contributions
belt+0.078
ars+0.077
aye+0.070
ossible+0.069
ang+0.068
Neg logit contributions
 accidents-0.101
urg-0.092
 happened-0.089
 coming-0.082
 lava-0.080
Tokenks
Token ID591
Feature activation-3.92e+00

Pos logit contributions
 hips+0.329
tail+0.325
 studies+0.319
 pedal+0.314
 pocket+0.308
Neg logit contributions
ull-0.384
urg-0.358
 happened-0.342
ier-0.334
 accidents-0.327
Token.
Token ID13
Feature activation-3.23e-01

Pos logit contributions
 studies+0.206
ars+0.201
 clipboard+0.194
 pocket+0.193
 likes+0.182
Neg logit contributions
 accidents-0.238
urg-0.233
ruck-0.219
icles-0.207
ull-0.205
Token He
Token ID679
Feature activation-1.47e+00

Pos logit contributions
ars+0.018
 studies+0.018
 clipboard+0.017
 likes+0.016
 poster+0.016
Neg logit contributions
urg-0.018
 accidents-0.017
icles-0.017
umm-0.016
ruck-0.016
Token says
Token ID1139
Feature activation-3.36e+00

Pos logit contributions
ars+0.067
belt+0.064
aye+0.061
ooter+0.061
ossible+0.059
Neg logit contributions
 happened-0.084
 accidents-0.082
urg-0.080
 spending-0.069
 such-0.068
Token,
Token ID11
Feature activation-9.87e-01

Pos logit contributions
ars+0.152
 studies+0.149
 likes+0.138
 clipboard+0.137
 pocket+0.132
Neg logit contributions
 accidents-0.227
urg-0.225
icles-0.207
ruck-0.201
ull-0.199
Token.
Token ID13
Feature activation+7.16e-02

Pos logit contributions
urg+0.028
 accidents+0.026
ruck+0.026
icles+0.025
umm+0.025
Neg logit contributions
ars-0.017
 likes-0.017
 studies-0.016
 seven-0.016
 clipboard-0.015
Token
Token ID220
Feature activation-3.69e+00

Pos logit contributions
urg+0.004
 accidents+0.004
icles+0.004
ruck+0.004
umm+0.004
Neg logit contributions
ars-0.004
 studies-0.004
 likes-0.004
 clipboard-0.004
 mat-0.004
Token
Token ID198
Feature activation+1.81e-01

Pos logit contributions
ars+0.162
 studies+0.155
 likes+0.155
 clipboard+0.150
 seven+0.149
Neg logit contributions
urg-0.253
 accidents-0.247
ruck-0.239
icles-0.232
umm-0.227
Token
Token ID198
Feature activation+2.10e-01

Pos logit contributions
urg+0.009
ruck+0.009
 accidents+0.009
icles+0.009
ull+0.009
Neg logit contributions
 studies-0.010
 likes-0.010
ars-0.010
 clipboard-0.010
 pocket-0.010
TokenThe
Token ID464
Feature activation-5.50e-01

Pos logit contributions
ruck+0.010
 accidents+0.010
urg+0.010
ull+0.010
Mill+0.010
Neg logit contributions
 studies-0.012
 clipboard-0.012
 likes-0.012
 pocket-0.012
ars-0.012
Token bar
Token ID2318
Feature activation-6.72e+00

Pos logit contributions
ars+0.036
ighed+0.034
tail+0.034
 clipboard+0.032
 likes+0.032
Neg logit contributions
 accidents-0.026
ruck-0.025
urg-0.024
umm-0.023
icles-0.021
Tokenber
Token ID527
Feature activation-3.54e+00

Pos logit contributions
 hips+0.299
ars+0.297
 mat+0.295
ossible+0.292
 studies+0.289
Neg logit contributions
ruck-0.396
urg-0.395
ull-0.387
 accidents-0.374
umm-0.366
Token smiled
Token ID13541
Feature activation-9.61e-01

Pos logit contributions
ars+0.181
 mat+0.171
 clipboard+0.162
 studies+0.161
oir+0.159
Neg logit contributions
urg-0.199
 accidents-0.196
ruck-0.191
icles-0.181
 Baby-0.174
Token and
Token ID290
Feature activation+4.98e-01

Pos logit contributions
ars+0.044
 studies+0.041
 likes+0.040
 clipboard+0.039
 seven+0.038
Neg logit contributions
urg-0.066
 accidents-0.065
ruck-0.062
icles-0.060
umm-0.059
Token said
Token ID531
Feature activation-2.65e-01

Pos logit contributions
 accidents+0.033
urg+0.033
ruck+0.030
 Mill+0.030
icles+0.030
Neg logit contributions
ars-0.023
 studies-0.021
 mat-0.020
 clipboard-0.020
 likes-0.019
Token,
Token ID11
Feature activation-3.06e-01

Pos logit contributions
ars+0.012
 studies+0.011
 likes+0.011
 clipboard+0.011
 pocket+0.010
Neg logit contributions
urg-0.018
 accidents-0.018
ruck-0.017
icles-0.017
umm-0.016
Token the
Token ID262
Feature activation-1.14e+00

Pos logit contributions
 studies+0.150
ars+0.150
 likes+0.145
 clipboard+0.144
 seven+0.138
Neg logit contributions
urg-0.125
 accidents-0.123
ruck-0.117
icles-0.113
olk-0.105
Token ball
Token ID2613
Feature activation-1.17e+00

Pos logit contributions
 studies+0.052
ars+0.051
 likes+0.047
 clipboard+0.047
 seven+0.047
Neg logit contributions
 accidents-0.076
ruck-0.073
urg-0.070
icles-0.068
umm-0.066
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
 studies+0.069
 clipboard+0.067
ars+0.065
 pocket+0.060
 likes+0.060
Neg logit contributions
 accidents-0.060
urg-0.057
ruck-0.057
icles-0.055
ull-0.053
Token The
Token ID383
Feature activation-6.55e-01

Pos logit contributions
ars+0.005
 studies+0.005
 clipboard+0.005
 likes+0.005
 seven+0.005
Neg logit contributions
 accidents-0.006
urg-0.006
icles-0.005
ruck-0.005
umm-0.005
Token dog
Token ID3290
Feature activation-1.71e+00

Pos logit contributions
 studies+0.034
ars+0.033
 seven+0.031
 likes+0.030
 clipboard+0.030
Neg logit contributions
 accidents-0.041
ruck-0.037
urg-0.035
icles-0.034
olk-0.034
Token bar
Token ID2318
Feature activation-6.71e+00

Pos logit contributions
ars+0.075
 clipboard+0.074
 studies+0.072
ossible+0.071
 hips+0.069
Neg logit contributions
 accidents-0.106
urg-0.095
 happened-0.086
icles-0.084
 lava-0.083
Tokenks
Token ID591
Feature activation-3.77e+00

Pos logit contributions
 studies+0.307
tail+0.305
 hips+0.302
 clipboard+0.296
 pedal+0.295
Neg logit contributions
ull-0.390
urg-0.382
 accidents-0.375
ruck-0.372
 happened-0.345
Token.
Token ID13
Feature activation-8.93e-02

Pos logit contributions
ars+0.167
 studies+0.166
 clipboard+0.158
 likes+0.156
 pocket+0.151
Neg logit contributions
urg-0.255
 accidents-0.253
ruck-0.241
icles-0.232
umm-0.222
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
ars+0.004
 studies+0.004
 clipboard+0.004
 likes+0.004
 seven+0.004
Neg logit contributions
urg-0.005
 accidents-0.005
icles-0.005
ruck-0.005
umm-0.005
Token
Token ID198
Feature activation-1.74e-01

Pos logit contributions
ars+0.008
 studies+0.008
 likes+0.008
 clipboard+0.008
 seven+0.008
Neg logit contributions
urg-0.011
 accidents-0.011
ruck-0.010
icles-0.010
ull-0.009
Token"
Token ID1
Feature activation-1.89e+00

Pos logit contributions
 studies+0.009
 clipboard+0.009
 hips+0.008
 pocket+0.008
 pedal+0.008
Neg logit contributions
 accidents-0.010
urg-0.010
asting-0.009
ruck-0.009
icles-0.009
Token.
Token ID13
Feature activation-1.98e-01

Pos logit contributions
ars+0.022
 studies+0.021
 likes+0.021
 seven+0.020
 clipboard+0.020
Neg logit contributions
urg-0.028
 accidents-0.027
ruck-0.026
icles-0.025
ull-0.024
Token He
Token ID679
Feature activation-1.23e+00

Pos logit contributions
ars+0.011
 studies+0.011
 likes+0.010
 clipboard+0.010
 seven+0.010
Neg logit contributions
urg-0.011
 accidents-0.011
ruck-0.010
icles-0.010
umm-0.010
Token broke
Token ID6265
Feature activation+2.58e+00

Pos logit contributions
ars+0.070
 studies+0.063
 clipboard+0.063
tail+0.063
 mat+0.062
Neg logit contributions
urg-0.068
 accidents-0.063
ruck-0.059
ull-0.057
icles-0.057
Token the
Token ID262
Feature activation-9.08e-01

Pos logit contributions
urg+0.180
umm+0.174
ruck+0.171
 accidents+0.170
icles+0.155
Neg logit contributions
 seven-0.108
 bar-0.101
 likes-0.097
 clipboard-0.096
 pocket-0.096
Token chocolate
Token ID11311
Feature activation+1.51e+00

Pos logit contributions
ars+0.038
 studies+0.036
 likes+0.033
 clipboard+0.033
 think+0.032
Neg logit contributions
 accidents-0.064
ruck-0.062
urg-0.060
icles-0.059
olk-0.058
Token bar
Token ID2318
Feature activation-6.71e+00

Pos logit contributions
urg+0.107
 accidents+0.105
ruck+0.102
icles+0.099
umm+0.097
Neg logit contributions
ars-0.063
 studies-0.060
 likes-0.060
 clipboard-0.058
 seven-0.057
Token in
Token ID287
Feature activation-2.35e+00

Pos logit contributions
 studies+0.335
 likes+0.320
ars+0.313
 clipboard+0.311
 think+0.310
Neg logit contributions
ruck-0.398
urg-0.390
 accidents-0.379
icles-0.374
umm-0.363
Token half
Token ID2063
Feature activation+1.67e+00

Pos logit contributions
 seven+0.119
ars+0.116
 likes+0.116
 pocket+0.111
 clipboard+0.111
Neg logit contributions
urg-0.146
 accidents-0.142
umm-0.133
ruck-0.132
icles-0.127
Token and
Token ID290
Feature activation+1.54e-01

Pos logit contributions
urg+0.114
 accidents+0.112
ruck+0.108
icles+0.105
umm+0.103
Neg logit contributions
ars-0.073
 studies-0.071
 likes-0.070
 clipboard-0.068
 seven-0.067
Token gave
Token ID2921
Feature activation+1.13e+00

Pos logit contributions
urg+0.010
 accidents+0.010
ruck+0.010
icles+0.009
umm+0.009
Neg logit contributions
ars-0.007
 likes-0.007
 studies-0.007
 clipboard-0.007
 seven-0.007
Token one
Token ID530
Feature activation-1.63e+00

Pos logit contributions
urg+0.063
 accidents+0.059
ruck+0.058
umm+0.056
icles+0.053
Neg logit contributions
ars-0.064
 likes-0.064
 seven-0.064
 sees-0.060
 clipboard-0.060
Token?"
Token ID1701
Feature activation-1.19e-02

Pos logit contributions
ars+0.152
 studies+0.148
 clipboard+0.141
 likes+0.141
 seven+0.137
Neg logit contributions
urg-0.225
 accidents-0.223
ruck-0.213
icles-0.209
umm-0.200
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
ars+0.001
 studies+0.001
suit+0.001
 locker+0.001
ighed+0.001
Neg logit contributions
 accidents-0.000
 Joey-0.000
 Twe-0.000
 Bobby-0.000
urg-0.000
Token
Token ID198
Feature activation-1.53e-01

Pos logit contributions
 studies+0.009
ars+0.009
 likes+0.008
 seven+0.008
 clipboard+0.008
Neg logit contributions
urg-0.009
 accidents-0.009
ruck-0.008
ull-0.008
icles-0.008
TokenThe
Token ID464
Feature activation-8.04e-01

Pos logit contributions
 studies+0.011
 hips+0.010
 pocket+0.009
 likes+0.009
 backpack+0.009
Neg logit contributions
 accidents-0.006
ull-0.005
ruck-0.005
urg-0.005
Mill-0.005
Token dog
Token ID3290
Feature activation-1.67e+00

Pos logit contributions
 studies+0.048
ars+0.048
 hips+0.046
 seven+0.044
ighed+0.044
Neg logit contributions
 accidents-0.042
eddy-0.037
ruck-0.036
urg-0.033
burn-0.032
Token bar
Token ID2318
Feature activation-6.70e+00

Pos logit contributions
ars+0.084
belt+0.078
ossible+0.074
ang+0.073
aye+0.071
Neg logit contributions
 accidents-0.097
urg-0.089
 happened-0.087
 coming-0.081
 lava-0.079
Tokenks
Token ID591
Feature activation-3.85e+00

Pos logit contributions
tail+0.335
 hips+0.330
 studies+0.320
 pocket+0.315
 pedal+0.312
Neg logit contributions
ull-0.394
urg-0.357
 happened-0.346
 accidents-0.326
ier-0.320
Token.
Token ID13
Feature activation+2.01e-02

Pos logit contributions
 studies+0.198
ars+0.194
 pocket+0.187
 clipboard+0.186
 mat+0.175
Neg logit contributions
 accidents-0.240
urg-0.232
ruck-0.221
ull-0.209
icles-0.207
Token He
Token ID679
Feature activation-1.35e+00

Pos logit contributions
 accidents+0.001
urg+0.001
icles+0.001
umm+0.001
ull+0.001
Neg logit contributions
ars-0.001
 studies-0.001
 clipboard-0.001
 poster-0.001
 think-0.001
Token does
Token ID857
Feature activation-3.47e+00

Pos logit contributions
ars+0.064
belt+0.056
ossible+0.054
tail+0.054
ooter+0.053
Neg logit contributions
 accidents-0.079
urg-0.078
 happened-0.078
 spending-0.068
 coming-0.066
Token not
Token ID407
Feature activation-2.33e+00

Pos logit contributions
ars+0.241
 studies+0.227
ebra+0.227
 pocket+0.224
ang+0.222
Neg logit contributions
 accidents-0.148
urg-0.146
ull-0.121
ruck-0.120
 theirs-0.115
Token they
Token ID484
Feature activation+1.49e+00

Pos logit contributions
ars+0.077
 clipboard+0.072
 studies+0.072
 likes+0.069
ighed+0.066
Neg logit contributions
 accidents-0.092
urg-0.088
icles-0.083
 Bobby-0.080
ruck-0.080
Token all
Token ID477
Feature activation-1.02e+00

Pos logit contributions
urg+0.065
ruck+0.063
rest+0.057
asting+0.057
 accidents+0.057
Neg logit contributions
 clipboard-0.095
ars-0.092
 mat-0.092
 hips-0.089
 studies-0.089
Token knew
Token ID2993
Feature activation-3.49e-01

Pos logit contributions
 clipboard+0.060
ars+0.059
 hips+0.056
 mat+0.056
 studies+0.055
Neg logit contributions
ruck-0.054
urg-0.052
 accidents-0.049
ull-0.048
 Mill-0.046
Token the
Token ID262
Feature activation-5.78e-01

Pos logit contributions
ars+0.018
 studies+0.017
 likes+0.017
 clipboard+0.017
 seven+0.017
Neg logit contributions
urg-0.022
 accidents-0.020
ruck-0.020
umm-0.019
icles-0.019
Token great
Token ID1049
Feature activation+2.54e+00

Pos logit contributions
ars+0.031
ighed+0.029
ossible+0.028
tail+0.027
ang+0.026
Neg logit contributions
 accidents-0.033
ruck-0.033
umm-0.031
icles-0.029
 Baby-0.029
Token bar
Token ID2318
Feature activation-6.70e+00

Pos logit contributions
ruck+0.155
 accidents+0.154
urg+0.144
umm+0.141
icles+0.136
Neg logit contributions
ars-0.132
ighed-0.116
 seven-0.112
ossible-0.112
tail-0.110
Tokenber
Token ID527
Feature activation-3.63e+00

Pos logit contributions
ars+0.265
 hips+0.258
ossible+0.257
ighed+0.256
 mat+0.254
Neg logit contributions
ruck-0.438
 accidents-0.423
urg-0.419
umm-0.405
icles-0.396
Token was
Token ID373
Feature activation+1.37e+00

Pos logit contributions
ars+0.195
 clipboard+0.187
 studies+0.182
 mat+0.181
 seven+0.180
Neg logit contributions
 accidents-0.198
urg-0.191
ruck-0.190
icles-0.179
umm-0.171
Token the
Token ID262
Feature activation-5.43e-01

Pos logit contributions
urg+0.079
 accidents+0.078
ruck+0.075
icles+0.072
umm+0.070
Neg logit contributions
ars-0.075
 likes-0.072
 studies-0.072
 clipboard-0.070
 seven-0.069
Token fastest
Token ID14162
Feature activation-2.32e+00

Pos logit contributions
ars+0.034
ossible+0.031
ighed+0.031
tail+0.030
 likes+0.030
Neg logit contributions
umm-0.026
 accidents-0.026
ruck-0.025
ter-0.022
icles-0.022
Token in
Token ID287
Feature activation-2.18e+00

Pos logit contributions
ars+0.116
 studies+0.107
 clipboard+0.105
 likes+0.105
ossible+0.102
Neg logit contributions
ruck-0.141
 accidents-0.140
urg-0.135
umm-0.130
icles-0.128
Token The
Token ID383
Feature activation-5.49e-01

Pos logit contributions
 likes+0.010
ars+0.010
 clipboard+0.009
 studies+0.009
 seven+0.009
Neg logit contributions
urg-0.017
ruck-0.016
 accidents-0.016
umm-0.015
icles-0.015
Token dog
Token ID3290
Feature activation-1.73e+00

Pos logit contributions
ars+0.030
 studies+0.029
ighed+0.027
 seven+0.027
 hips+0.027
Neg logit contributions
 accidents-0.032
ruck-0.029
urg-0.027
olk-0.026
umm-0.026
Token is
Token ID318
Feature activation-1.04e+00

Pos logit contributions
ossible+0.086
ars+0.084
 clipboard+0.081
 pocket+0.079
aye+0.078
Neg logit contributions
 accidents-0.101
urg-0.093
 happened-0.087
 coming-0.083
 lava-0.083
Token angry
Token ID7954
Feature activation-2.36e+00

Pos logit contributions
ars+0.049
 likes+0.047
 studies+0.047
 clipboard+0.046
 seven+0.045
Neg logit contributions
urg-0.068
 accidents-0.067
ruck-0.065
icles-0.063
umm-0.062
Token and
Token ID290
Feature activation+5.91e-02

Pos logit contributions
 likes+0.079
ars+0.076
 sees+0.069
 studies+0.069
 seven+0.068
Neg logit contributions
urg-0.192
 accidents-0.180
umm-0.179
icles-0.177
ruck-0.176
Token bar
Token ID2318
Feature activation-6.70e+00

Pos logit contributions
 accidents+0.004
urg+0.003
 happened+0.003
 spilling+0.003
ruck+0.003
Neg logit contributions
ars-0.003
 studies-0.003
 clipboard-0.003
ossible-0.003
 pocket-0.003
Tokenks
Token ID591
Feature activation-3.80e+00

Pos logit contributions
 studies+0.231
ars+0.227
 clipboard+0.226
 pocket+0.224
 mat+0.224
Neg logit contributions
urg-0.482
 accidents-0.476
ruck-0.473
ull-0.450
icles-0.430
Token at
Token ID379
Feature activation-2.04e+00

Pos logit contributions
ars+0.170
 studies+0.167
 likes+0.160
 clipboard+0.159
 seven+0.153
Neg logit contributions
urg-0.257
 accidents-0.255
ruck-0.244
icles-0.234
umm-0.226
Token them
Token ID606
Feature activation+5.87e-01

Pos logit contributions
 seven+0.115
 likes+0.111
ars+0.109
 think+0.109
 sees+0.109
Neg logit contributions
umm-0.113
urg-0.110
 accidents-0.105
ruck-0.102
olk-0.096
Token.
Token ID13
Feature activation-2.33e-01

Pos logit contributions
urg+0.040
 accidents+0.040
ruck+0.038
icles+0.037
umm+0.036
Neg logit contributions
ars-0.026
 studies-0.025
 likes-0.024
 clipboard-0.024
 seven-0.023
Token It
Token ID632
Feature activation+6.73e-01

Pos logit contributions
ars+0.011
 likes+0.011
 studies+0.010
 clipboard+0.010
 seven+0.010
Neg logit contributions
urg-0.015
 accidents-0.015
ruck-0.015
icles-0.014
umm-0.014
Token dog
Token ID3290
Feature activation-1.64e+00

Pos logit contributions
ars+0.028
 studies+0.028
 seven+0.026
ighed+0.026
 hips+0.025
Neg logit contributions
 accidents-0.033
ruck-0.031
urg-0.029
olk-0.028
icles-0.027
Token is
Token ID318
Feature activation-9.79e-01

Pos logit contributions
ars+0.086
ossible+0.083
 studies+0.078
aye+0.078
 clipboard+0.078
Neg logit contributions
 accidents-0.090
urg-0.083
 happened-0.080
 coming-0.072
 Baby-0.070
Token angry
Token ID7954
Feature activation-2.33e+00

Pos logit contributions
ars+0.044
 studies+0.043
 likes+0.042
 clipboard+0.041
 seven+0.040
Neg logit contributions
urg-0.066
 accidents-0.065
ruck-0.062
icles-0.060
umm-0.058
Token.
Token ID13
Feature activation-2.96e-02

Pos logit contributions
ars+0.083
 likes+0.082
 studies+0.077
 seven+0.075
 clipboard+0.073
Neg logit contributions
urg-0.181
 accidents-0.173
ruck-0.169
icles-0.167
umm-0.167
Token It
Token ID632
Feature activation+6.58e-01

Pos logit contributions
ars+0.002
 studies+0.002
 likes+0.001
 clipboard+0.001
 seven+0.001
Neg logit contributions
urg-0.002
 accidents-0.002
ruck-0.002
icles-0.002
umm-0.002
Token bar
Token ID2318
Feature activation-6.69e+00

Pos logit contributions
 accidents+0.040
 happened+0.037
urg+0.037
 lava+0.033
 coming+0.033
Neg logit contributions
ars-0.032
 wrist-0.029
 studies-0.028
 clipboard-0.028
 pocket-0.028
Tokenks
Token ID591
Feature activation-3.82e+00

Pos logit contributions
ossible+0.272
 pocket+0.271
 studies+0.268
 hips+0.266
tail+0.266
Neg logit contributions
 accidents-0.435
urg-0.422
ruck-0.404
ull-0.399
 happened-0.392
Token and
Token ID290
Feature activation+1.38e-01

Pos logit contributions
 studies+0.173
ars+0.170
 clipboard+0.162
 pocket+0.158
 likes+0.158
Neg logit contributions
 accidents-0.256
urg-0.255
ruck-0.242
icles-0.229
ull-0.220
Token runs
Token ID4539
Feature activation-2.18e+00

Pos logit contributions
 accidents+0.009
urg+0.008
 spilling+0.008
 returning+0.007
 happened+0.007
Neg logit contributions
ars-0.007
 studies-0.006
ossible-0.006
ooter-0.006
ebra-0.006
Token to
Token ID284
Feature activation+1.41e-01

Pos logit contributions
ars+0.111
 studies+0.111
 likes+0.106
 clipboard+0.106
 pocket+0.102
Neg logit contributions
 accidents-0.131
urg-0.127
ruck-0.124
icles-0.117
umm-0.111
Token them
Token ID606
Feature activation+5.89e-01

Pos logit contributions
urg+0.009
 accidents+0.009
ruck+0.008
icles+0.008
umm+0.008
Neg logit contributions
ars-0.007
 likes-0.007
 studies-0.007
 clipboard-0.007
 seven-0.007
Token dog
Token ID3290
Feature activation-1.33e+00

Pos logit contributions
 accidents+0.013
ruck+0.012
urg+0.012
icles+0.011
umm+0.011
Neg logit contributions
ars-0.012
 studies-0.011
ighed-0.011
 clipboard-0.011
 likes-0.011
Token watches
Token ID16860
Feature activation-1.98e+00

Pos logit contributions
ars+0.064
 clipboard+0.059
ossible+0.056
aye+0.055
tail+0.055
Neg logit contributions
 accidents-0.080
urg-0.073
 happened-0.072
 Twe-0.065
 Baby-0.064
Token them
Token ID606
Feature activation+8.36e-01

Pos logit contributions
ars+0.105
 likes+0.102
 seven+0.102
 clipboard+0.099
 studies+0.099
Neg logit contributions
urg-0.115
 accidents-0.111
ruck-0.108
umm-0.107
icles-0.103
Token.
Token ID13
Feature activation+4.41e-01

Pos logit contributions
urg+0.053
 accidents+0.052
ruck+0.050
icles+0.048
umm+0.044
Neg logit contributions
ars-0.041
 studies-0.039
 seven-0.039
 clipboard-0.039
 likes-0.038
Token He
Token ID679
Feature activation-1.14e+00

Pos logit contributions
urg+0.024
 accidents+0.024
icles+0.023
ruck+0.023
umm+0.022
Neg logit contributions
ars-0.025
 studies-0.024
 clipboard-0.023
 likes-0.023
 seven-0.022
Token bar
Token ID2318
Feature activation-6.69e+00

Pos logit contributions
ars+0.051
aye+0.049
 clipboard+0.049
belt+0.048
 hips+0.047
Neg logit contributions
 accidents-0.066
 happened-0.063
urg-0.062
 such-0.055
ruck-0.054
Tokenks
Token ID591
Feature activation-3.83e+00

Pos logit contributions
 hips+0.314
 mat+0.310
tail+0.306
 belt+0.303
 pocket+0.299
Neg logit contributions
ull-0.375
urg-0.369
ruck-0.368
 accidents-0.365
 happened-0.350
Token and
Token ID290
Feature activation+4.08e-01

Pos logit contributions
 studies+0.174
ars+0.173
 clipboard+0.163
 pocket+0.162
 likes+0.159
Neg logit contributions
 accidents-0.256
urg-0.255
ruck-0.243
icles-0.229
ull-0.221
Token cheers
Token ID34550
Feature activation-2.10e+00

Pos logit contributions
 accidents+0.026
urg+0.023
 returning+0.021
 lava+0.021
 spilling+0.021
Neg logit contributions
ars-0.021
ooter-0.018
belt-0.017
 studies-0.017
ossible-0.017
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
ars+0.087
 likes+0.084
 studies+0.082
 clipboard+0.080
 seven+0.080
Neg logit contributions
urg-0.150
 accidents-0.147
ruck-0.142
icles-0.138
umm-0.137
Token He
Token ID679
Feature activation-1.27e+00

Pos logit contributions
urg+0.007
 accidents+0.007
icles+0.007
ruck+0.007
ull+0.007
Neg logit contributions
ars-0.007
 studies-0.007
 likes-0.007
 clipboard-0.007
 pocket-0.006
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
ars+0.129
 likes+0.121
 clipboard+0.120
 studies+0.120
 seven+0.116
Neg logit contributions
urg-0.250
 accidents-0.245
ruck-0.240
umm-0.235
icles-0.233
Token
Token ID198
Feature activation-3.67e-01

Pos logit contributions
ars+0.011
 studies+0.011
 likes+0.010
 clipboard+0.010
 seven+0.010
Neg logit contributions
 accidents-0.013
urg-0.013
ruck-0.012
icles-0.012
olk-0.011
Token
Token ID198
Feature activation-3.67e-01

Pos logit contributions
ars+0.017
 studies+0.017
 likes+0.017
 clipboard+0.016
 seven+0.016
Neg logit contributions
urg-0.023
 accidents-0.023
ruck-0.022
icles-0.021
olk-0.021
TokenThe
Token ID464
Feature activation-9.18e-01

Pos logit contributions
 studies+0.021
 likes+0.019
ars+0.018
 clipboard+0.018
 hips+0.018
Neg logit contributions
 accidents-0.020
ruck-0.019
urg-0.019
Mill-0.018
eddy-0.018
Token puppy
Token ID26188
Feature activation+8.83e-01

Pos logit contributions
ars+0.053
 studies+0.052
 clipboard+0.049
 hips+0.049
 likes+0.049
Neg logit contributions
 accidents-0.051
ruck-0.047
urg-0.046
eddy-0.043
umm-0.042
Token bar
Token ID2318
Feature activation-6.69e+00

Pos logit contributions
 accidents+0.055
urg+0.050
 happened+0.047
 coming+0.042
icles+0.042
Neg logit contributions
ars-0.043
ossible-0.037
belt-0.035
ang-0.035
 clipboard-0.035
Tokenks
Token ID591
Feature activation-3.90e+00

Pos logit contributions
 hips+0.321
tail+0.319
 own+0.311
 pedal+0.311
 mat+0.309
Neg logit contributions
ull-0.381
urg-0.358
 happened-0.339
 accidents-0.337
ier-0.329
Token and
Token ID290
Feature activation+7.03e-02

Pos logit contributions
 studies+0.187
ars+0.187
 clipboard+0.176
 pocket+0.174
 likes+0.169
Neg logit contributions
 accidents-0.251
urg-0.248
ruck-0.235
icles-0.223
ull-0.218
Token jumps
Token ID18045
Feature activation-2.02e+00

Pos logit contributions
 accidents+0.004
urg+0.004
 returning+0.004
 Baby+0.004
 spilling+0.004
Neg logit contributions
ars-0.004
ooter-0.003
belt-0.003
ossible-0.003
 studies-0.003
Token out
Token ID503
Feature activation+1.33e+00

Pos logit contributions
ars+0.105
 studies+0.102
 likes+0.101
 clipboard+0.099
 seven+0.097
Neg logit contributions
urg-0.123
 accidents-0.121
ruck-0.116
icles-0.112
umm-0.109
Token of
Token ID286
Feature activation+3.68e+00

Pos logit contributions
urg+0.063
 accidents+0.061
ruck+0.058
icles+0.056
umm+0.054
Neg logit contributions
ars-0.087
 studies-0.084
 likes-0.084
 clipboard-0.082
 seven-0.082
Token Spot
Token ID15899
Feature activation+7.17e-02

Pos logit contributions
ars+0.025
 studies+0.024
 clipboard+0.024
 likes+0.023
 seven+0.023
Neg logit contributions
 accidents-0.015
urg-0.013
icles-0.012
ruck-0.011
 Baby-0.010
Token is
Token ID318
Feature activation-1.09e+00

Pos logit contributions
urg+0.004
 accidents+0.004
 happened+0.004
icles+0.004
ruck+0.004
Neg logit contributions
 clipboard-0.003
 studies-0.003
 seven-0.003
ars-0.003
 headset-0.003
Token scared
Token ID12008
Feature activation-6.13e-01

Pos logit contributions
ars+0.052
 studies+0.050
 clipboard+0.046
 likes+0.046
ossible+0.046
Neg logit contributions
 accidents-0.072
urg-0.070
ruck-0.067
icles-0.063
ull-0.062
Token.
Token ID13
Feature activation-4.07e-01

Pos logit contributions
 likes+0.020
ars+0.020
 seven+0.018
 bar+0.017
 think+0.017
Neg logit contributions
urg-0.051
umm-0.047
 accidents-0.046
ruck-0.046
icles-0.046
Token He
Token ID679
Feature activation-1.38e+00

Pos logit contributions
ars+0.019
 likes+0.018
 studies+0.018
 clipboard+0.018
 seven+0.017
Neg logit contributions
urg-0.027
 accidents-0.026
ruck-0.026
icles-0.025
umm-0.024
Token bar
Token ID2318
Feature activation-6.69e+00

Pos logit contributions
 wrist+0.064
aye+0.059
ars+0.059
 locker+0.057
tail+0.057
Neg logit contributions
 happened-0.078
urg-0.077
 accidents-0.076
 Baby-0.066
 Ow-0.065
Tokenks
Token ID591
Feature activation-3.80e+00

Pos logit contributions
tail+0.294
 studies+0.280
 pocket+0.280
 mat+0.277
 hips+0.272
Neg logit contributions
urg-0.412
 accidents-0.405
ruck-0.397
ull-0.397
 happened-0.374
Token at
Token ID379
Feature activation-2.03e+00

Pos logit contributions
 studies+0.190
ars+0.184
 pocket+0.176
 clipboard+0.173
 likes+0.166
Neg logit contributions
 accidents-0.245
urg-0.238
ruck-0.225
icles-0.212
ull-0.206
Token the
Token ID262
Feature activation-1.18e+00

Pos logit contributions
ars+0.119
 studies+0.114
 likes+0.114
 clipboard+0.111
 seven+0.110
Neg logit contributions
urg-0.111
 accidents-0.109
ruck-0.104
icles-0.100
umm-0.097
Token truck
Token ID7779
Feature activation-3.57e+00

Pos logit contributions
 studies+0.047
ars+0.047
 likes+0.042
tail+0.042
 seven+0.041
Neg logit contributions
ruck-0.084
urg-0.082
 accidents-0.082
icles-0.080
umm-0.075
Token and
Token ID290
Feature activation-5.03e-02

Pos logit contributions
ars+0.174
 studies+0.171
 clipboard+0.161
ossible+0.161
 seven+0.159
Neg logit contributions
 accidents-0.215
urg-0.214
icles-0.208
ruck-0.207
umm-0.182
Token thought
Token ID1807
Feature activation-4.07e+00

Pos logit contributions
urg+0.017
ruck+0.017
 accidents+0.017
 happened+0.016
 Baby+0.015
Neg logit contributions
ars-0.020
 mat-0.018
 clipboard-0.018
 locker-0.018
oir-0.018
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
ars+0.217
 studies+0.206
 likes+0.204
 clipboard+0.200
 seven+0.198
Neg logit contributions
urg-0.245
 accidents-0.245
ruck-0.231
icles-0.225
umm-0.211
Token man
Token ID582
Feature activation-3.88e+00

Pos logit contributions
ars+0.060
 studies+0.055
 seven+0.054
 hips+0.053
ang+0.053
Neg logit contributions
 accidents-0.058
ruck-0.055
 happened-0.051
icles-0.050
burn-0.048
Token was
Token ID373
Feature activation+1.01e+00

Pos logit contributions
ars+0.224
 clipboard+0.215
 studies+0.214
 mat+0.211
 seven+0.210
Neg logit contributions
 accidents-0.201
urg-0.193
ruck-0.189
icles-0.182
ull-0.172
Token the
Token ID262
Feature activation-1.02e+00

Pos logit contributions
urg+0.071
 accidents+0.069
ruck+0.067
icles+0.065
umm+0.064
Neg logit contributions
ars-0.044
 likes-0.043
 studies-0.042
 clipboard-0.041
 seven-0.040
Token bar
Token ID2318
Feature activation-6.68e+00

Pos logit contributions
ars+0.061
 studies+0.056
ossible+0.052
 hips+0.052
tail+0.051
Neg logit contributions
ruck-0.054
 accidents-0.054
umm-0.050
ter-0.049
burn-0.049
Tokenber
Token ID527
Feature activation-3.48e+00

Pos logit contributions
 studies+0.272
ars+0.267
ossible+0.264
oir+0.254
 hips+0.252
Neg logit contributions
ruck-0.443
 accidents-0.435
urg-0.422
eddy-0.419
umm-0.419
Token.
Token ID13
Feature activation-9.23e-02

Pos logit contributions
ars+0.180
 studies+0.174
 clipboard+0.168
ossible+0.167
 seven+0.165
Neg logit contributions
 accidents-0.196
ruck-0.185
urg-0.185
icles-0.179
umm-0.172
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
ars+0.005
 studies+0.005
 clipboard+0.004
 likes+0.004
 seven+0.004
Neg logit contributions
 accidents-0.006
urg-0.005
ruck-0.005
icles-0.005
umm-0.005
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
 studies+0.006
ars+0.005
 clipboard+0.005
 seven+0.005
 likes+0.005
Neg logit contributions
ruck-0.006
urg-0.006
 accidents-0.006
icles-0.005
ull-0.005
TokenShe
Token ID3347
Feature activation-1.87e+00

Pos logit contributions
 studies+0.007
 hips+0.006
 clipboard+0.006
 pocket+0.006
 nods+0.006
Neg logit contributions
ruck-0.005
 accidents-0.005
asting-0.005
Mill-0.004
urg-0.004
Token.
Token ID13
Feature activation-2.94e-02

Pos logit contributions
ars+0.059
 studies+0.057
 clipboard+0.056
 likes+0.054
 seven+0.053
Neg logit contributions
 accidents-0.080
urg-0.079
ruck-0.076
icles-0.073
umm-0.070
Token It
Token ID632
Feature activation+7.49e-01

Pos logit contributions
ars+0.002
 likes+0.001
 studies+0.001
 clipboard+0.001
 seven+0.001
Neg logit contributions
urg-0.002
 accidents-0.002
ruck-0.002
icles-0.002
umm-0.002
Token runs
Token ID4539
Feature activation-2.19e+00

Pos logit contributions
 accidents+0.042
 happened+0.039
urg+0.037
 lava+0.036
 Baby+0.034
Neg logit contributions
ars-0.038
 clipboard-0.033
 wrist-0.032
aye-0.032
robat-0.032
Token fast
Token ID3049
Feature activation-4.44e+00

Pos logit contributions
 studies+0.139
ars+0.138
 clipboard+0.133
 likes+0.129
 pocket+0.128
Neg logit contributions
 accidents-0.103
ruck-0.097
urg-0.095
icles-0.089
ter-0.081
Token and
Token ID290
Feature activation+1.28e-01

Pos logit contributions
ars+0.220
 studies+0.204
 clipboard+0.202
 likes+0.196
 pocket+0.188
Neg logit contributions
 accidents-0.288
urg-0.278
ruck-0.277
icles-0.256
umm-0.241
Token bar
Token ID2318
Feature activation-6.68e+00

Pos logit contributions
 accidents+0.008
 spilling+0.007
 Twe+0.007
 pouring+0.006
 spending+0.006
Neg logit contributions
ars-0.007
ossible-0.006
ebra-0.006
robat-0.006
 clipboard-0.006
Tokenks
Token ID591
Feature activation-3.73e+00

Pos logit contributions
 studies+0.238
ars+0.235
 clipboard+0.225
 mat+0.218
 pocket+0.216
Neg logit contributions
urg-0.468
ruck-0.462
 accidents-0.461
ull-0.441
icles-0.432
Token loud
Token ID7812
Feature activation-1.46e+00

Pos logit contributions
ars+0.172
 studies+0.169
 likes+0.163
 clipboard+0.161
 pocket+0.154
Neg logit contributions
urg-0.247
 accidents-0.243
ruck-0.234
icles-0.225
umm-0.217
Token.
Token ID13
Feature activation-1.51e-01

Pos logit contributions
ars+0.067
 studies+0.064
 likes+0.063
 clipboard+0.062
 seven+0.060
Neg logit contributions
urg-0.097
 accidents-0.096
ruck-0.092
icles-0.089
umm-0.086
Token Tom
Token ID4186
Feature activation-1.84e+00

Pos logit contributions
ars+0.007
 likes+0.007
 studies+0.007
 clipboard+0.007
 seven+0.007
Neg logit contributions
urg-0.010
 accidents-0.009
ruck-0.009
icles-0.009
umm-0.009
Token and
Token ID290
Feature activation+4.55e-02

Pos logit contributions
ars+0.118
 clipboard+0.115
tail+0.112
ossible+0.111
 headset+0.111
Neg logit contributions
 accidents-0.067
ruck-0.065
urg-0.064
aste-0.057
 Baby-0.055
Token knew
Token ID2993
Feature activation-3.80e-01

Pos logit contributions
urg+0.032
 accidents+0.030
ruck+0.030
icles+0.028
ull+0.028
Neg logit contributions
ars-0.031
 studies-0.030
 clipboard-0.029
 mat-0.029
 likes-0.028
Token that
Token ID326
Feature activation-6.19e-02

Pos logit contributions
 studies+0.016
 likes+0.015
 think+0.015
ars+0.015
 pocket+0.015
Neg logit contributions
urg-0.027
umm-0.025
ruck-0.025
olk-0.025
icles-0.024
Token no
Token ID645
Feature activation+1.84e-01

Pos logit contributions
 likes+0.003
 studies+0.003
 think+0.003
ars+0.003
 seven+0.003
Neg logit contributions
urg-0.004
ruck-0.003
olk-0.003
umm-0.003
icles-0.003
Token matter
Token ID2300
Feature activation+4.63e+00

Pos logit contributions
urg+0.011
 accidents+0.011
ruck+0.010
icles+0.010
umm+0.010
Neg logit contributions
ars-0.010
 likes-0.009
 studies-0.009
 seven-0.009
 clipboard-0.009
Token what
Token ID644
Feature activation+4.78e-01

Pos logit contributions
urg+0.292
umm+0.285
olk+0.268
 accidents+0.249
ruck+0.248
Neg logit contributions
 likes-0.236
 studies-0.226
 think-0.223
ars-0.211
 seven-0.207
Token happened
Token ID3022
Feature activation+9.20e+00

Pos logit contributions
urg+0.029
 accidents+0.028
ruck+0.027
icles+0.026
umm+0.026
Neg logit contributions
ars-0.025
 likes-0.024
 studies-0.024
 clipboard-0.023
 seven-0.023
Token,
Token ID11
Feature activation-6.25e-01

Pos logit contributions
 accidents+0.629
urg+0.614
ruck+0.589
icles+0.561
umm+0.543
Neg logit contributions
ars-0.405
 studies-0.384
 clipboard-0.380
 likes-0.374
 seven-0.370
Token they
Token ID484
Feature activation+1.25e+00

Pos logit contributions
 likes+0.033
 studies+0.032
ars+0.032
 think+0.031
 seven+0.031
Neg logit contributions
urg-0.038
ruck-0.035
 accidents-0.034
icles-0.034
umm-0.034
Token would
Token ID561
Feature activation-2.67e-01

Pos logit contributions
 accidents+0.068
urg+0.067
ruck+0.066
icles+0.060
ull+0.059
Neg logit contributions
ars-0.068
 mat-0.067
 clipboard-0.067
 studies-0.066
 seven-0.064
Token always
Token ID1464
Feature activation-5.51e-01

Pos logit contributions
ars+0.015
 clipboard+0.014
 likes+0.014
 studies+0.014
 seven+0.014
Neg logit contributions
urg-0.015
 accidents-0.014
ruck-0.014
icles-0.013
umm-0.012
Token be
Token ID307
Feature activation-1.66e+00

Pos logit contributions
ars+0.028
 clipboard+0.026
 seven+0.026
 mat+0.026
 studies+0.025
Neg logit contributions
 accidents-0.033
urg-0.032
ruck-0.031
 returning-0.029
icles-0.028
TokenOne
Token ID3198
Feature activation-1.85e+00

Pos logit contributions
ruck+0.013
urg+0.012
 accidents+0.012
ull+0.012
icles+0.011
Neg logit contributions
 studies-0.015
 likes-0.015
 clipboard-0.014
ars-0.014
 sees-0.014
Token day
Token ID1110
Feature activation-1.91e-01

Pos logit contributions
 clipboard+0.116
ars+0.113
 studies+0.112
 hips+0.109
 likes+0.104
Neg logit contributions
 accidents-0.092
ruck-0.090
urg-0.089
ull-0.085
icles-0.081
Token,
Token ID11
Feature activation-4.06e-01

Pos logit contributions
ars+0.005
 seven+0.005
 likes+0.005
 studies+0.005
 clipboard+0.005
Neg logit contributions
urg-0.016
 accidents-0.016
ruck-0.015
umm-0.015
icles-0.015
Token something
Token ID1223
Feature activation+3.16e+00

Pos logit contributions
ars+0.023
 likes+0.022
 seven+0.022
 studies+0.022
 clipboard+0.022
Neg logit contributions
urg-0.023
 accidents-0.022
ruck-0.021
umm-0.021
icles-0.020
Token strange
Token ID6283
Feature activation+2.75e+00

Pos logit contributions
urg+0.194
 accidents+0.194
ruck+0.183
icles+0.175
umm+0.171
Neg logit contributions
ars-0.160
 studies-0.156
 likes-0.154
 clipboard-0.152
 seven-0.147
Token happened
Token ID3022
Feature activation+9.40e+00

Pos logit contributions
urg+0.070
 accidents+0.064
umm+0.059
ruck+0.058
olk+0.052
Neg logit contributions
 likes-0.237
ars-0.237
 studies-0.229
 seven-0.228
 sees-0.226
Token and
Token ID290
Feature activation+3.13e-01

Pos logit contributions
urg+0.619
 accidents+0.608
ruck+0.586
icles+0.567
umm+0.550
Neg logit contributions
ars-0.439
 studies-0.421
 likes-0.418
 clipboard-0.407
 seven-0.401
Token the
Token ID262
Feature activation-4.85e-01

Pos logit contributions
urg+0.020
 accidents+0.020
ruck+0.019
icles+0.018
umm+0.018
Neg logit contributions
ars-0.015
 studies-0.015
 likes-0.015
 clipboard-0.014
 seven-0.014
Token pilot
Token ID8022
Feature activation-7.79e-01

Pos logit contributions
ars+0.026
 studies+0.025
 likes+0.025
 clipboard+0.024
ossible+0.024
Neg logit contributions
 accidents-0.028
ruck-0.027
urg-0.027
umm-0.025
icles-0.025
Token had
Token ID550
Feature activation+1.86e-01

Pos logit contributions
ars+0.039
 likes+0.038
 studies+0.037
 clipboard+0.036
 seven+0.036
Neg logit contributions
urg-0.048
 accidents-0.047
ruck-0.046
icles-0.044
umm-0.043
Token to
Token ID284
Feature activation+4.11e-01

Pos logit contributions
urg+0.011
 accidents+0.011
ruck+0.010
umm+0.010
icles+0.010
Neg logit contributions
ars-0.010
 studies-0.010
 likes-0.010
 seven-0.010
 clipboard-0.009
Token.
Token ID13
Feature activation+3.99e-01

Pos logit contributions
urg+0.179
 accidents+0.176
ruck+0.167
umm+0.165
icles+0.164
Neg logit contributions
ars-0.092
 likes-0.090
 seven-0.087
 studies-0.083
 clipboard-0.082
Token It
Token ID632
Feature activation+9.41e-01

Pos logit contributions
urg+0.022
 accidents+0.022
ruck+0.020
icles+0.020
umm+0.020
Neg logit contributions
ars-0.023
 studies-0.022
 likes-0.022
 clipboard-0.021
 seven-0.021
Token looked
Token ID3114
Feature activation-5.54e-01

Pos logit contributions
 accidents+0.052
urg+0.052
ruck+0.047
icles+0.047
umm+0.045
Neg logit contributions
ars-0.055
 studies-0.052
 clipboard-0.051
 likes-0.049
 seven-0.049
Token even
Token ID772
Feature activation-1.43e-02

Pos logit contributions
ars+0.025
 studies+0.023
 likes+0.023
 clipboard+0.022
 seven+0.022
Neg logit contributions
urg-0.038
 accidents-0.038
ruck-0.036
icles-0.035
umm-0.034
Token y
Token ID331
Feature activation+2.45e+00

Pos logit contributions
ars+0.001
 likes+0.001
 studies+0.001
 clipboard+0.001
 seven+0.001
Neg logit contributions
urg-0.001
 accidents-0.001
ruck-0.001
icles-0.001
umm-0.001
Tokenumm
Token ID13929
Feature activation+9.13e+00

Pos logit contributions
ruck+0.088
 accidents+0.084
urg+0.084
 harming+0.083
ull+0.080
Neg logit contributions
ars-0.171
 likes-0.166
 seven-0.162
 bar-0.159
 pocket-0.155
Tokenier
Token ID959
Feature activation+6.83e+00

Pos logit contributions
urg+0.353
 accidents+0.348
 harming+0.301
ruck+0.289
 Frankie+0.285
Neg logit contributions
 likes-0.656
ars-0.652
 seven-0.618
 bar-0.597
 studies-0.596
Token.
Token ID13
Feature activation+4.04e-01

Pos logit contributions
urg+0.465
 accidents+0.456
ruck+0.440
icles+0.427
umm+0.419
Neg logit contributions
ars-0.304
 likes-0.291
 studies-0.291
 clipboard-0.281
 seven-0.279
Token Lily
Token ID20037
Feature activation+8.49e-01

Pos logit contributions
 accidents+0.021
urg+0.021
ruck+0.019
icles+0.019
umm+0.019
Neg logit contributions
ars-0.024
 studies-0.023
 likes-0.023
 clipboard-0.023
 seven-0.023
Token wanted
Token ID2227
Feature activation+4.29e-01

Pos logit contributions
 accidents+0.043
urg+0.043
ruck+0.038
ull+0.035
 happened+0.035
Neg logit contributions
ars-0.052
 clipboard-0.048
 seven-0.048
 studies-0.047
 mat-0.047
Token to
Token ID284
Feature activation+2.61e-01

Pos logit contributions
urg+0.032
umm+0.029
ruck+0.029
 accidents+0.028
icles+0.027
Neg logit contributions
 bar-0.019
 seven-0.018
 likes-0.018
 studies-0.018
 think-0.018
TokenOne
Token ID3198
Feature activation-2.05e+00

Pos logit contributions
ruck+0.017
urg+0.017
 accidents+0.017
icles+0.016
asting+0.016
Neg logit contributions
 studies-0.011
ars-0.011
 likes-0.011
 clipboard-0.011
 pocket-0.010
Token day
Token ID1110
Feature activation-4.58e-01

Pos logit contributions
ars+0.114
 studies+0.108
 clipboard+0.108
 likes+0.105
 seven+0.103
Neg logit contributions
 accidents-0.118
urg-0.118
ruck-0.115
icles-0.107
umm-0.105
Token,
Token ID11
Feature activation-3.90e-01

Pos logit contributions
 likes+0.008
ars+0.008
 seven+0.008
 bar+0.007
 sees+0.007
Neg logit contributions
urg-0.044
umm-0.042
 accidents-0.041
ruck-0.041
icles-0.041
Token something
Token ID1223
Feature activation+3.21e+00

Pos logit contributions
 likes+0.016
ars+0.016
 bar+0.016
 studies+0.016
 seven+0.016
Neg logit contributions
urg-0.028
ruck-0.025
umm-0.025
 accidents-0.025
icles-0.024
Token special
Token ID2041
Feature activation+6.45e-01

Pos logit contributions
urg+0.227
 accidents+0.212
ruck+0.211
icles+0.210
umm+0.209
Neg logit contributions
ars-0.138
 likes-0.134
 studies-0.130
 seven-0.127
 bar-0.125
Token happened
Token ID3022
Feature activation+9.55e+00

Pos logit contributions
urg+0.020
 accidents+0.016
umm+0.015
olk+0.015
ruck+0.015
Neg logit contributions
 likes-0.054
ars-0.053
 studies-0.052
 bar-0.051
 seven-0.051
Token as
Token ID355
Feature activation-9.78e-01

Pos logit contributions
 accidents+0.596
urg+0.594
ruck+0.571
icles+0.542
umm+0.511
Neg logit contributions
ars-0.478
 studies-0.452
 likes-0.445
 clipboard-0.444
 seven-0.433
Token he
Token ID339
Feature activation-2.98e-01

Pos logit contributions
 likes+0.042
ars+0.041
 studies+0.041
 seven+0.041
 think+0.039
Neg logit contributions
urg-0.068
umm-0.063
 accidents-0.063
olk-0.062
icles-0.061
Token polished
Token ID23895
Feature activation+3.98e-01

Pos logit contributions
ars+0.015
 clipboard+0.015
 mat+0.015
 seven+0.014
 studies+0.014
Neg logit contributions
urg-0.017
 accidents-0.016
ruck-0.016
 happened-0.015
ull-0.014
Token the
Token ID262
Feature activation-7.05e-01

Pos logit contributions
 accidents+0.020
urg+0.020
ruck+0.019
icles+0.019
umm+0.017
Neg logit contributions
ars-0.025
 studies-0.024
 likes-0.023
 clipboard-0.023
ossible-0.023
Token toy
Token ID13373
Feature activation-1.43e-01

Pos logit contributions
ars+0.034
 studies+0.032
 likes+0.032
 clipboard+0.031
 seven+0.030
Neg logit contributions
urg-0.046
 accidents-0.045
ruck-0.044
icles-0.042
umm-0.041
Token hands
Token ID2832
Feature activation-1.05e+00

Pos logit contributions
ars+0.029
 studies+0.025
 mat+0.025
 pose+0.025
 seven+0.024
Neg logit contributions
 accidents-0.040
umm-0.038
urg-0.037
olk-0.037
 Baby-0.036
Token,
Token ID11
Feature activation-3.52e-01

Pos logit contributions
ars+0.043
 studies+0.041
 likes+0.040
 clipboard+0.039
 seven+0.039
Neg logit contributions
urg-0.074
 accidents-0.073
ruck-0.071
icles-0.069
umm-0.067
Token good
Token ID922
Feature activation+2.64e+00

Pos logit contributions
ars+0.019
 studies+0.018
 likes+0.018
 clipboard+0.018
 seven+0.017
Neg logit contributions
 accidents-0.021
urg-0.020
ruck-0.019
icles-0.019
umm-0.018
Token things
Token ID1243
Feature activation+3.27e-01

Pos logit contributions
urg+0.159
 accidents+0.152
icles+0.146
ruck+0.144
umm+0.140
Neg logit contributions
 likes-0.137
ars-0.136
 studies-0.136
 clipboard-0.131
 seven-0.127
Token would
Token ID561
Feature activation+2.28e-02

Pos logit contributions
 accidents+0.020
urg+0.020
ruck+0.019
icles+0.018
umm+0.018
Neg logit contributions
ars-0.017
 studies-0.016
 likes-0.016
 clipboard-0.015
 seven-0.015
Token happen
Token ID1645
Feature activation+6.83e+00

Pos logit contributions
 accidents+0.001
urg+0.001
ruck+0.001
icles+0.001
umm+0.001
Neg logit contributions
ars-0.001
 studies-0.001
 likes-0.001
 clipboard-0.001
 seven-0.001
Token to
Token ID284
Feature activation+4.64e-01

Pos logit contributions
urg+0.435
 accidents+0.429
ruck+0.412
icles+0.398
umm+0.390
Neg logit contributions
ars-0.333
 studies-0.318
 likes-0.316
 clipboard-0.310
 seven-0.303
Token it
Token ID340
Feature activation+8.80e-01

Pos logit contributions
urg+0.025
 accidents+0.024
ruck+0.023
icles+0.022
umm+0.021
Neg logit contributions
ars-0.028
 likes-0.027
 studies-0.027
 clipboard-0.026
 seven-0.026
Token.
Token ID13
Feature activation+2.96e-01

Pos logit contributions
 accidents+0.056
urg+0.056
ruck+0.053
icles+0.051
umm+0.050
Neg logit contributions
ars-0.043
 studies-0.040
 clipboard-0.040
 likes-0.039
 seven-0.039
Token
Token ID198
Feature activation+3.55e-01

Pos logit contributions
urg+0.020
 accidents+0.020
ruck+0.019
icles+0.019
umm+0.018
Neg logit contributions
ars-0.013
 studies-0.012
 likes-0.012
 clipboard-0.012
 seven-0.012
Token
Token ID198
Feature activation+3.57e-01

Pos logit contributions
urg+0.022
 accidents+0.021
ruck+0.021
icles+0.020
ull+0.019
Neg logit contributions
ars-0.017
 studies-0.017
 likes-0.017
 clipboard-0.017
 seven-0.017
Token so
Token ID523
Feature activation+9.94e-02

Pos logit contributions
urg+0.079
 accidents+0.078
ruck+0.074
icles+0.071
umm+0.068
Neg logit contributions
ars-0.074
 studies-0.071
 likes-0.070
 clipboard-0.068
 seven-0.066
Token peaceful
Token ID12309
Feature activation+2.76e+00

Pos logit contributions
urg+0.007
 accidents+0.007
ruck+0.007
icles+0.006
ull+0.006
Neg logit contributions
ars-0.004
 studies-0.004
 clipboard-0.004
 mat-0.004
 seven-0.004
Token and
Token ID290
Feature activation+5.65e-01

Pos logit contributions
urg+0.192
 accidents+0.186
ruck+0.180
icles+0.176
umm+0.173
Neg logit contributions
ars-0.118
 likes-0.114
 studies-0.113
 seven-0.109
 clipboard-0.108
Token content
Token ID2695
Feature activation+2.53e+00

Pos logit contributions
urg+0.037
 accidents+0.036
ruck+0.035
icles+0.034
umm+0.033
Neg logit contributions
ars-0.027
 likes-0.026
 studies-0.026
 clipboard-0.025
 seven-0.025
Token in
Token ID287
Feature activation-2.17e+00

Pos logit contributions
urg+0.186
 accidents+0.179
ruck+0.175
icles+0.172
umm+0.169
Neg logit contributions
ars-0.097
 likes-0.094
 studies-0.092
 seven-0.089
 bar-0.087
Token Little
Token ID7703
Feature activation+2.51e+00

Pos logit contributions
ars+0.113
 studies+0.110
 likes+0.110
 seven+0.107
 clipboard+0.107
Neg logit contributions
urg-0.132
 accidents-0.129
ruck-0.123
icles-0.120
umm-0.118
Token Mark
Token ID2940
Feature activation-2.91e+00

Pos logit contributions
urg+0.129
 accidents+0.125
ruck+0.120
icles+0.115
umm+0.112
Neg logit contributions
ars-0.154
 likes-0.149
 studies-0.149
 clipboard-0.145
 seven-0.144
Token's
Token ID338
Feature activation+2.38e+00

Pos logit contributions
ars+0.142
 studies+0.138
 clipboard+0.136
 likes+0.134
 mat+0.131
Neg logit contributions
urg-0.178
 accidents-0.174
ruck-0.168
icles-0.157
umm-0.153
Token palm
Token ID18057
Feature activation-1.90e-01

Pos logit contributions
 accidents+0.149
urg+0.140
umm+0.135
ruck+0.132
olk+0.131
Neg logit contributions
ars-0.120
 likes-0.110
oir-0.108
 dice-0.108
 seven-0.107
Token.
Token ID13
Feature activation+3.06e-01

Pos logit contributions
ars+0.010
 clipboard+0.010
 studies+0.009
 think+0.009
oir+0.009
Neg logit contributions
 accidents-0.011
urg-0.011
ruck-0.011
umm-0.010
icles-0.010
Token He
Token ID679
Feature activation-1.12e+00

Pos logit contributions
urg+0.020
 accidents+0.020
ruck+0.019
icles+0.019
umm+0.018
Neg logit contributions
ars-0.014
 likes-0.014
 studies-0.014
 clipboard-0.013
 seven-0.013
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
ars+0.001
 studies+0.001
 likes+0.001
 clipboard+0.001
 seven+0.001
Neg logit contributions
urg-0.002
 accidents-0.002
ruck-0.002
icles-0.002
umm-0.002
Token
Token ID198
Feature activation+1.62e-01

Pos logit contributions
urg+0.007
 accidents+0.007
icles+0.006
ruck+0.006
ull+0.006
Neg logit contributions
ars-0.007
 studies-0.007
 likes-0.006
 clipboard-0.006
 pedal-0.006
Token
Token ID198
Feature activation+1.74e-01

Pos logit contributions
urg+0.009
 accidents+0.009
ruck+0.009
icles+0.008
ull+0.008
Neg logit contributions
 studies-0.009
 likes-0.008
ars-0.008
 clipboard-0.008
 seven-0.008
TokenThey
Token ID2990
Feature activation+3.41e-01

Pos logit contributions
urg+0.008
 accidents+0.008
ruck+0.008
ull+0.008
icles+0.008
Neg logit contributions
 studies-0.011
 clipboard-0.010
 likes-0.010
 pocket-0.010
 backpack-0.010
Token peek
Token ID27185
Feature activation-1.11e+00

Pos logit contributions
urg+0.017
ruck+0.015
 accidents+0.015
icles+0.015
 Baby+0.014
Neg logit contributions
ars-0.020
 mat-0.020
 studies-0.019
 clipboard-0.019
 likes-0.018
Tokened
Token ID276
Feature activation+4.95e+00

Pos logit contributions
ars+0.074
 studies+0.072
 likes+0.072
 clipboard+0.070
 seven+0.070
Neg logit contributions
urg-0.051
 accidents-0.049
ruck-0.047
icles-0.045
umm-0.043
Token inside
Token ID2641
Feature activation+5.66e-01

Pos logit contributions
 accidents+0.268
urg+0.265
ruck+0.248
icles+0.239
olk+0.228
Neg logit contributions
ars-0.297
 studies-0.277
 likes-0.271
 clipboard-0.269
 mat-0.257
Token the
Token ID262
Feature activation-8.58e-01

Pos logit contributions
urg+0.036
 accidents+0.035
ruck+0.034
icles+0.033
umm+0.033
Neg logit contributions
ars-0.027
 likes-0.026
 studies-0.026
 clipboard-0.026
 seven-0.025
Token house
Token ID2156
Feature activation-1.05e+00

Pos logit contributions
ars+0.047
 likes+0.042
 seven+0.042
 clipboard+0.041
 hips+0.041
Neg logit contributions
 accidents-0.053
urg-0.048
ruck-0.048
olk-0.046
umm-0.044
Token.
Token ID13
Feature activation+1.14e-01

Pos logit contributions
ars+0.052
 likes+0.048
 studies+0.047
 clipboard+0.047
 seven+0.046
Neg logit contributions
 accidents-0.065
urg-0.065
ruck-0.063
icles-0.061
umm-0.058
Token It
Token ID632
Feature activation+9.33e-01

Pos logit contributions
urg+0.007
 accidents+0.006
ruck+0.006
icles+0.006
umm+0.006
Neg logit contributions
ars-0.006
 studies-0.006
 likes-0.006
 clipboard-0.006
 seven-0.006
Token them
Token ID606
Feature activation+1.02e+00

Pos logit contributions
 accidents+0.177
urg+0.176
ruck+0.166
icles+0.159
umm+0.146
Neg logit contributions
ars-0.187
 studies-0.177
 likes-0.175
 clipboard-0.172
 seven-0.168
Token together
Token ID1978
Feature activation-9.02e-01

Pos logit contributions
urg+0.063
 accidents+0.062
ruck+0.059
icles+0.058
umm+0.055
Neg logit contributions
ars-0.051
 likes-0.048
 studies-0.048
 clipboard-0.048
 seven-0.047
Token.
Token ID13
Feature activation+2.31e-01

Pos logit contributions
ars+0.045
 studies+0.042
 clipboard+0.042
 likes+0.042
 seven+0.040
Neg logit contributions
urg-0.056
 accidents-0.055
ruck-0.054
icles-0.051
umm-0.048
Token The
Token ID383
Feature activation-2.06e-01

Pos logit contributions
urg+0.012
 accidents+0.012
ruck+0.012
icles+0.011
umm+0.011
Neg logit contributions
ars-0.014
 studies-0.013
 likes-0.013
 clipboard-0.013
 seven-0.013
Token sound
Token ID2128
Feature activation+8.77e-01

Pos logit contributions
ars+0.013
ighed+0.013
tail+0.012
 studies+0.012
 clipboard+0.012
Neg logit contributions
 accidents-0.010
ruck-0.010
urg-0.009
icles-0.008
asting-0.008
Token of
Token ID286
Feature activation+4.04e+00

Pos logit contributions
urg+0.053
 accidents+0.049
ruck+0.048
icles+0.048
umm+0.047
Neg logit contributions
ars-0.046
 likes-0.046
 studies-0.044
 seven-0.043
 clipboard-0.042
Token the
Token ID262
Feature activation-6.84e-01

Pos logit contributions
urg+0.248
 accidents+0.238
ruck+0.228
icles+0.227
umm+0.226
Neg logit contributions
ars-0.204
 likes-0.201
 studies-0.200
 seven-0.197
 clipboard-0.194
Token drums
Token ID24730
Feature activation-1.06e+00

Pos logit contributions
ars+0.041
 studies+0.041
ighed+0.040
 clipboard+0.039
 pocket+0.039
Neg logit contributions
 accidents-0.034
ruck-0.034
urg-0.032
icles-0.027
asting-0.026
Token filled
Token ID5901
Feature activation+1.94e+00

Pos logit contributions
ars+0.057
 studies+0.056
 clipboard+0.055
 likes+0.054
 seven+0.054
Neg logit contributions
urg-0.061
 accidents-0.060
ruck-0.059
icles-0.055
umm-0.052
Token the
Token ID262
Feature activation-6.87e-01

Pos logit contributions
urg+0.122
 accidents+0.120
ruck+0.116
icles+0.112
umm+0.111
Neg logit contributions
ars-0.096
 likes-0.094
 studies-0.091
 seven-0.091
 clipboard-0.088
Token air
Token ID1633
Feature activation+2.88e-01

Pos logit contributions
ars+0.034
 studies+0.033
 clipboard+0.033
tail+0.031
 bar+0.031
Neg logit contributions
ruck-0.040
 accidents-0.040
urg-0.039
olk-0.035
umm-0.035
Token and
Token ID290
Feature activation+6.84e-01

Pos logit contributions
ars+0.207
 studies+0.195
 likes+0.191
 seven+0.190
 clipboard+0.181
Neg logit contributions
ruck-0.247
urg-0.243
 accidents-0.241
icles-0.238
burn-0.213
Token hung
Token ID9174
Feature activation+1.10e+00

Pos logit contributions
 accidents+0.040
urg+0.038
ruck+0.038
icles+0.035
 happened+0.034
Neg logit contributions
ars-0.038
 studies-0.033
 seven-0.032
ebra-0.032
 clipboard-0.032
Token it
Token ID340
Feature activation+1.23e+00

Pos logit contributions
urg+0.059
 accidents+0.058
ruck+0.055
icles+0.054
umm+0.050
Neg logit contributions
ars-0.066
 studies-0.064
 likes-0.063
 clipboard-0.061
 seven-0.060
Token up
Token ID510
Feature activation+4.74e-01

Pos logit contributions
 accidents+0.061
urg+0.060
ruck+0.057
icles+0.056
asting+0.048
Neg logit contributions
ars-0.075
 studies-0.073
 likes-0.072
 clipboard-0.072
ossible-0.070
Token to
Token ID284
Feature activation+6.57e-01

Pos logit contributions
urg+0.028
 accidents+0.028
ruck+0.028
icles+0.027
umm+0.025
Neg logit contributions
ars-0.025
 studies-0.024
 likes-0.023
 clipboard-0.023
 think-0.022
Token dry
Token ID5894
Feature activation+2.50e+00

Pos logit contributions
urg+0.038
 accidents+0.037
ruck+0.035
icles+0.034
umm+0.033
Neg logit contributions
ars-0.037
 likes-0.035
 studies-0.035
 clipboard-0.034
 seven-0.034
Token.
Token ID13
Feature activation+5.11e-01

Pos logit contributions
urg+0.167
 accidents+0.160
ruck+0.155
umm+0.152
icles+0.150
Neg logit contributions
ars-0.110
 likes-0.108
 studies-0.105
 seven-0.105
 clipboard-0.103
Token The
Token ID383
Feature activation-5.18e-02

Pos logit contributions
urg+0.031
 accidents+0.031
ruck+0.030
icles+0.029
umm+0.028
Neg logit contributions
ars-0.026
 studies-0.025
 likes-0.025
 clipboard-0.024
 seven-0.024
Token next
Token ID1306
Feature activation-1.35e+00

Pos logit contributions
ars+0.003
 studies+0.003
 likes+0.003
ighed+0.003
 clipboard+0.002
Neg logit contributions
 accidents-0.003
ruck-0.003
urg-0.003
icles-0.003
umm-0.003
Token day
Token ID1110
Feature activation-2.47e-01

Pos logit contributions
ars+0.074
 studies+0.071
 clipboard+0.071
 likes+0.070
 seven+0.067
Neg logit contributions
 accidents-0.077
urg-0.076
ruck-0.075
icles-0.071
umm-0.070
Token,
Token ID11
Feature activation-2.55e-01

Pos logit contributions
ars+0.006
 likes+0.005
 studies+0.005
 seven+0.005
 sees+0.005
Neg logit contributions
urg-0.022
 accidents-0.021
umm-0.021
icles-0.021
ruck-0.021
Token fly
Token ID6129
Feature activation-9.33e-01

Pos logit contributions
urg+0.026
 accidents+0.025
ruck+0.024
icles+0.024
umm+0.023
Neg logit contributions
ars-0.016
 studies-0.015
 likes-0.015
 seven-0.014
 clipboard-0.014
Token away
Token ID1497
Feature activation-1.91e+00

Pos logit contributions
 studies+0.056
ars+0.055
 clipboard+0.053
 likes+0.051
ossible+0.048
Neg logit contributions
 accidents-0.052
icles-0.048
urg-0.046
ter-0.045
 Joey-0.045
Token,
Token ID11
Feature activation-5.49e-01

Pos logit contributions
ars+0.079
 likes+0.076
 studies+0.075
 clipboard+0.072
 seven+0.071
Neg logit contributions
urg-0.137
 accidents-0.136
ruck-0.130
icles-0.127
umm-0.124
Token but
Token ID475
Feature activation-5.94e-02

Pos logit contributions
ars+0.029
 studies+0.027
 likes+0.026
 clipboard+0.025
 seven+0.025
Neg logit contributions
 accidents-0.035
urg-0.033
ruck-0.032
icles-0.030
 Mill-0.029
Token the
Token ID262
Feature activation-7.59e-01

Pos logit contributions
ars+0.004
 studies+0.004
 likes+0.004
 clipboard+0.004
 seven+0.004
Neg logit contributions
 accidents-0.003
urg-0.003
icles-0.003
ruck-0.003
olk-0.002
Token hawk
Token ID48710
Feature activation+3.39e+00

Pos logit contributions
ars+0.051
 studies+0.051
 likes+0.050
 clipboard+0.049
 seven+0.048
Neg logit contributions
 accidents-0.034
urg-0.033
ruck-0.031
icles-0.029
umm-0.029
Token was
Token ID373
Feature activation+1.21e+00

Pos logit contributions
 accidents+0.177
urg+0.172
ruck+0.160
icles+0.157
ull+0.147
Neg logit contributions
 studies-0.199
ars-0.196
 clipboard-0.194
 seven-0.185
 mat-0.185
Token too
Token ID1165
Feature activation-3.48e+00

Pos logit contributions
 accidents+0.080
urg+0.077
ruck+0.074
icles+0.071
ull+0.067
Neg logit contributions
ars-0.059
 studies-0.057
 likes-0.055
 clipboard-0.055
 mat-0.051
Token fast
Token ID3049
Feature activation-4.31e+00

Pos logit contributions
ars+0.125
 studies+0.123
 likes+0.117
 clipboard+0.117
 seven+0.111
Neg logit contributions
urg-0.264
 accidents-0.263
ruck-0.253
icles-0.242
olk-0.237
Token.
Token ID13
Feature activation+7.99e-02

Pos logit contributions
ars+0.170
 likes+0.165
 studies+0.163
 seven+0.154
 clipboard+0.154
Neg logit contributions
urg-0.317
 accidents-0.305
ruck-0.295
icles-0.292
umm-0.289
Token So
Token ID1406
Feature activation-2.49e-01

Pos logit contributions
urg+0.006
 accidents+0.005
ruck+0.005
icles+0.005
umm+0.005
Neg logit contributions
ars-0.003
 likes-0.003
 studies-0.003
 clipboard-0.003
 seven-0.003
Token his
Token ID465
Feature activation+1.46e+00

Pos logit contributions
ars+0.111
 pocket+0.098
belt+0.095
ebra+0.092
ang+0.092
Neg logit contributions
 accidents-0.105
urg-0.101
 happened-0.091
ruck-0.089
icles-0.086
Token tail
Token ID7894
Feature activation-7.21e-01

Pos logit contributions
 accidents+0.083
urg+0.083
ruck+0.082
icles+0.078
umm+0.072
Neg logit contributions
ars-0.077
 studies-0.067
 likes-0.065
 seven-0.064
ang-0.064
Token and
Token ID290
Feature activation+5.15e-01

Pos logit contributions
ars+0.033
 studies+0.033
 clipboard+0.031
 likes+0.031
 seven+0.030
Neg logit contributions
urg-0.047
 accidents-0.046
ruck-0.045
icles-0.044
umm-0.041
Token bar
Token ID2318
Feature activation-6.47e+00

Pos logit contributions
 accidents+0.032
urg+0.031
ruck+0.029
icles+0.028
 happened+0.027
Neg logit contributions
ars-0.027
 studies-0.024
 clipboard-0.023
 seven-0.022
ossible-0.022
Tokenks
Token ID591
Feature activation-3.57e+00

Pos logit contributions
 studies+0.236
 mat+0.232
ars+0.224
 likes+0.215
 pedal+0.212
Neg logit contributions
urg-0.466
ruck-0.448
 accidents-0.444
ull-0.435
icles-0.424
Token.
Token ID13
Feature activation+4.92e-02

Pos logit contributions
ars+0.172
 studies+0.171
 likes+0.163
 clipboard+0.160
 pocket+0.159
Neg logit contributions
urg-0.227
 accidents-0.223
ruck-0.213
icles-0.205
ull-0.195
Token
Token ID198
Feature activation-3.46e-02

Pos logit contributions
urg+0.003
 accidents+0.003
icles+0.003
ruck+0.002
 Baby+0.002
Neg logit contributions
ars-0.003
 studies-0.003
 likes-0.003
 clipboard-0.002
 seven-0.002
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
 studies+0.002
 clipboard+0.002
 likes+0.002
ars+0.002
 pocket+0.002
Neg logit contributions
ruck-0.002
urg-0.002
ull-0.002
 accidents-0.002
icles-0.001
Token"
Token ID1
Feature activation-1.65e+00

Pos logit contributions
 studies+0.001
 clipboard+0.001
 likes+0.001
 hips+0.001
 pocket+0.001
Neg logit contributions
ruck-0.001
Mill-0.001
 accidents-0.001
ull-0.001
Ab-0.001
TokenMax
Token ID11518
Feature activation-1.47e+00

Pos logit contributions
 studies+0.106
ars+0.102
 likes+0.098
 clipboard+0.097
 pocket+0.096
Neg logit contributions
 accidents-0.078
ruck-0.075
urg-0.074
icles-0.073
umm-0.068
Token,
Token ID11
Feature activation-5.52e-01

Pos logit contributions
ars+0.064
 studies+0.063
 clipboard+0.060
 mat+0.057
 seven+0.056
Neg logit contributions
urg-0.099
 accidents-0.097
ruck-0.092
icles-0.090
umm-0.085
Token dead
Token ID2636
Feature activation+3.41e+00

Pos logit contributions
urg+0.073
 accidents+0.071
ruck+0.069
icles+0.067
umm+0.067
Neg logit contributions
ars-0.042
 likes-0.042
 studies-0.040
 seven-0.040
 clipboard-0.040
Token.
Token ID13
Feature activation-4.47e-01

Pos logit contributions
urg+0.268
 accidents+0.253
ruck+0.250
icles+0.246
umm+0.243
Neg logit contributions
 likes-0.120
ars-0.119
 studies-0.114
 seven-0.107
 bar-0.105
Token Ben
Token ID3932
Feature activation-2.34e+00

Pos logit contributions
ars+0.021
 studies+0.020
 likes+0.020
 clipboard+0.019
 seven+0.019
Neg logit contributions
urg-0.030
 accidents-0.029
ruck-0.028
icles-0.027
umm-0.027
Token felt
Token ID2936
Feature activation-1.71e-01

Pos logit contributions
ars+0.145
 mat+0.136
 clipboard+0.135
 seven+0.134
tail+0.134
Neg logit contributions
 accidents-0.110
urg-0.108
 happened-0.100
ull-0.094
ruck-0.093
Token sad
Token ID6507
Feature activation-6.70e-02

Pos logit contributions
ars+0.008
 studies+0.007
 seven+0.007
 clipboard+0.007
 likes+0.007
Neg logit contributions
 accidents-0.012
urg-0.011
ruck-0.011
icles-0.010
olk-0.010
Token.
Token ID13
Feature activation-4.98e-01

Pos logit contributions
ars+0.003
 studies+0.003
 clipboard+0.003
 likes+0.003
 seven+0.003
Neg logit contributions
 accidents-0.005
urg-0.005
ruck-0.005
icles-0.004
umm-0.004
Token He
Token ID679
Feature activation-1.33e+00

Pos logit contributions
ars+0.022
 studies+0.021
 likes+0.021
 clipboard+0.021
 seven+0.020
Neg logit contributions
urg-0.034
 accidents-0.033
ruck-0.032
icles-0.031
umm-0.031
Token wanted
Token ID2227
Feature activation+9.26e-02

Pos logit contributions
ars+0.073
 mat+0.072
 studies+0.070
 poster+0.069
 clipboard+0.068
Neg logit contributions
urg-0.070
 accidents-0.070
 happened-0.067
ruck-0.066
ull-0.060
Token to
Token ID284
Feature activation+1.47e-01

Pos logit contributions
urg+0.006
umm+0.006
 accidents+0.006
ruck+0.006
icles+0.005
Neg logit contributions
 likes-0.004
 studies-0.004
ars-0.004
 clipboard-0.004
 bar-0.004
Token help
Token ID1037
Feature activation-2.23e+00

Pos logit contributions
urg+0.010
 accidents+0.010
ruck+0.010
icles+0.010
olk+0.009
Neg logit contributions
ars-0.006
 studies-0.006
 likes-0.006
 seven-0.006
 clipboard-0.005
Token the
Token ID262
Feature activation-1.08e+00

Pos logit contributions
ars+0.101
 likes+0.098
 clipboard+0.096
 studies+0.095
 seven+0.094
Neg logit contributions
urg-0.150
 accidents-0.146
ruck-0.142
umm-0.139
icles-0.135
Token it
Token ID340
Feature activation+8.41e-01

Pos logit contributions
urg+0.101
ruck+0.091
 harming+0.089
umm+0.087
icles+0.086
Neg logit contributions
 pocket-0.060
 studies-0.058
 bar-0.057
 think-0.056
 high-0.056
Token was
Token ID373
Feature activation+1.41e+00

Pos logit contributions
urg+0.057
 accidents+0.056
ruck+0.055
olk+0.053
umm+0.053
Neg logit contributions
ars-0.037
 likes-0.036
 studies-0.035
 seven-0.035
 sees-0.035
Token time
Token ID640
Feature activation+3.44e+00

Pos logit contributions
 accidents+0.099
urg+0.099
umm+0.097
keeping+0.094
ruck+0.093
Neg logit contributions
 seven-0.060
 high-0.054
 low-0.053
 likes-0.050
ossible-0.049
Token to
Token ID284
Feature activation+3.60e-01

Pos logit contributions
urg+0.228
 accidents+0.224
umm+0.219
icles+0.210
ruck+0.210
Neg logit contributions
ars-0.154
 seven-0.151
 studies-0.151
 likes-0.151
 sees-0.148
Token go
Token ID467
Feature activation+6.86e-01

Pos logit contributions
 accidents+0.024
urg+0.024
ruck+0.023
umm+0.022
icles+0.022
Neg logit contributions
 think-0.016
 studies-0.016
ars-0.016
 likes-0.015
 seven-0.015
Token home
Token ID1363
Feature activation-1.66e+00

Pos logit contributions
urg+0.039
 accidents+0.038
ruck+0.036
icles+0.035
umm+0.034
Neg logit contributions
ars-0.039
 likes-0.038
 studies-0.037
 clipboard-0.036
 seven-0.036
Token,
Token ID11
Feature activation-4.37e-01

Pos logit contributions
ars+0.050
 studies+0.050
 likes+0.048
 clipboard+0.047
 seven+0.045
Neg logit contributions
urg-0.134
 accidents-0.132
ruck-0.129
icles-0.126
umm-0.125
Token Bobby
Token ID17155
Feature activation+3.49e+00

Pos logit contributions
 think+0.030
 pocket+0.030
 studies+0.030
 pedal+0.030
 likes+0.029
Neg logit contributions
urg-0.019
ruck-0.015
umm-0.015
olk-0.015
icles-0.014
Token was
Token ID373
Feature activation+1.42e+00

Pos logit contributions
urg+0.222
 accidents+0.214
ruck+0.210
icles+0.199
umm+0.189
Neg logit contributions
ars-0.172
 studies-0.161
 likes-0.160
 clipboard-0.156
 seven-0.156
Token so
Token ID523
Feature activation+1.41e-01

Pos logit contributions
urg+0.105
 accidents+0.103
ruck+0.100
icles+0.097
umm+0.096
Neg logit contributions
ars-0.056
 studies-0.053
 likes-0.053
 clipboard-0.051
 seven-0.051
Token happy
Token ID3772
Feature activation-1.89e-01

Pos logit contributions
urg+0.011
 accidents+0.011
ruck+0.011
icles+0.010
umm+0.010
Neg logit contributions
ars-0.005
 studies-0.004
 likes-0.004
 clipboard-0.004
 seven-0.004
Token
Token ID198
Feature activation+3.12e-01

Pos logit contributions
urg+0.019
 accidents+0.018
ruck+0.018
icles+0.018
ull+0.017
Neg logit contributions
ars-0.017
 studies-0.016
 likes-0.016
 clipboard-0.016
 seven-0.016
TokenOne
Token ID3198
Feature activation-2.09e+00

Pos logit contributions
 accidents+0.020
ruck+0.020
urg+0.019
icles+0.018
asting+0.018
Neg logit contributions
ars-0.014
 clipboard-0.014
 studies-0.014
 pocket-0.013
 hips-0.013
Token day
Token ID1110
Feature activation-4.82e-01

Pos logit contributions
ars+0.116
 studies+0.111
 clipboard+0.110
 likes+0.106
 seven+0.106
Neg logit contributions
urg-0.117
 accidents-0.116
ruck-0.115
icles-0.106
umm-0.103
Token,
Token ID11
Feature activation-4.06e-01

Pos logit contributions
 likes+0.007
ars+0.006
 sees+0.005
 seven+0.005
 bar+0.005
Neg logit contributions
urg-0.050
umm-0.048
ruck-0.047
icles-0.046
keeping-0.046
Token Lily
Token ID20037
Feature activation+8.80e-01

Pos logit contributions
 likes+0.021
 seven+0.020
ars+0.020
 bar+0.019
 clipboard+0.019
Neg logit contributions
urg-0.026
ruck-0.024
umm-0.024
 accidents-0.023
icles-0.023
Token saw
Token ID2497
Feature activation-3.33e-01

Pos logit contributions
urg+0.066
 accidents+0.064
icles+0.062
ruck+0.062
umm+0.061
Neg logit contributions
 likes-0.034
ars-0.033
 studies-0.033
 sees-0.031
 seven-0.030
Token a
Token ID257
Feature activation-2.17e-01

Pos logit contributions
 sees+0.011
 bar+0.011
 likes+0.011
 clipboard+0.010
 seven+0.010
Neg logit contributions
umm-0.027
urg-0.026
icles-0.024
ruck-0.024
 accidents-0.023
Token boy
Token ID2933
Feature activation-3.38e-01

Pos logit contributions
ars+0.011
 studies+0.010
 likes+0.010
 seven+0.010
 clipboard+0.009
Neg logit contributions
 accidents-0.014
urg-0.013
ruck-0.013
umm-0.012
icles-0.012
Token who
Token ID508
Feature activation-1.31e+00

Pos logit contributions
ars+0.015
 likes+0.015
 studies+0.015
 clipboard+0.014
 seven+0.014
Neg logit contributions
urg-0.023
 accidents-0.022
ruck-0.021
icles-0.021
umm-0.021
Token was
Token ID373
Feature activation+1.45e+00

Pos logit contributions
ars+0.072
 clipboard+0.067
 seven+0.067
 studies+0.066
 mat+0.066
Neg logit contributions
 accidents-0.075
urg-0.073
ruck-0.072
icles-0.067
 happened-0.065
Token sad
Token ID6507
Feature activation+8.48e-02

Pos logit contributions
urg+0.091
 accidents+0.090
ruck+0.086
icles+0.083
umm+0.080
Neg logit contributions
ars-0.074
 likes-0.069
 studies-0.069
 clipboard-0.067
 seven-0.066
Token are
Token ID389
Feature activation-6.23e-01

Pos logit contributions
asting+0.015
urg+0.015
ruck+0.015
 Baby+0.015
 Wil+0.014
Neg logit contributions
ang-0.017
 clipboard-0.017
ebra-0.017
 hips-0.017
ars-0.017
Token happy
Token ID3772
Feature activation-4.02e-01

Pos logit contributions
ars+0.028
 likes+0.027
 studies+0.026
 clipboard+0.026
 seven+0.025
Neg logit contributions
urg-0.043
 accidents-0.042
ruck-0.040
icles-0.039
umm-0.039
Token to
Token ID284
Feature activation+8.60e-02

Pos logit contributions
 likes+0.015
 studies+0.014
ars+0.014
 sees+0.013
 think+0.013
Neg logit contributions
urg-0.031
ruck-0.029
icles-0.029
umm-0.029
 accidents-0.029
Token be
Token ID307
Feature activation-1.80e+00

Pos logit contributions
urg+0.006
ruck+0.006
 accidents+0.006
icles+0.005
olk+0.005
Neg logit contributions
ars-0.004
 studies-0.004
 likes-0.004
 clipboard-0.003
 seven-0.003
Token alive
Token ID6776
Feature activation+1.95e+00

Pos logit contributions
ars+0.077
 seven+0.075
 likes+0.072
 clipboard+0.065
 sees+0.065
Neg logit contributions
umm-0.128
 accidents-0.127
urg-0.126
icles-0.122
olk-0.118
Token.
Token ID13
Feature activation-2.08e-01

Pos logit contributions
urg+0.134
 accidents+0.132
ruck+0.128
icles+0.124
umm+0.121
Neg logit contributions
ars-0.086
 studies-0.081
 likes-0.080
 clipboard-0.079
 seven-0.077
Token<|endoftext|>
Token ID50256
Feature activation+5.82e-01

Pos logit contributions
ars+0.010
 likes+0.009
 studies+0.009
 clipboard+0.009
 seven+0.009
Neg logit contributions
urg-0.014
 accidents-0.014
ruck-0.013
icles-0.013
umm-0.013
TokenL
Token ID43
Feature activation+6.43e-02

Pos logit contributions
urg+0.038
 accidents+0.037
ruck+0.035
icles+0.034
umm+0.034
Neg logit contributions
ars-0.028
 likes-0.027
 studies-0.027
 clipboard-0.026
 seven-0.026
Tokenily
Token ID813
Feature activation+7.76e-01

Pos logit contributions
 accidents+0.005
umm+0.005
urg+0.005
icles+0.004
ruck+0.004
Neg logit contributions
ars-0.002
 likes-0.002
 seven-0.002
ossible-0.002
 sees-0.002
Token and
Token ID290
Feature activation+4.39e-01

Pos logit contributions
urg+0.049
ruck+0.049
 accidents+0.047
 happened+0.044
olk+0.044
Neg logit contributions
ars-0.036
 clipboard-0.035
 hips-0.032
 studies-0.031
 seven-0.031
Token Tom
Token ID4186
Feature activation-1.46e+00

Pos logit contributions
urg+0.027
 accidents+0.027
ruck+0.026
icles+0.025
umm+0.025
Neg logit contributions
ars-0.022
 likes-0.022
 studies-0.022
 seven-0.021
 clipboard-0.021
Token that
Token ID326
Feature activation+9.29e-03

Pos logit contributions
ars+0.024
 likes+0.024
 studies+0.024
 seven+0.023
 clipboard+0.023
Neg logit contributions
urg-0.038
 accidents-0.036
ruck-0.036
icles-0.035
umm-0.035
Token his
Token ID465
Feature activation-6.29e-01

Pos logit contributions
 accidents+0.001
urg+0.001
ruck+0.001
icles+0.001
umm+0.001
Neg logit contributions
ars-0.000
 studies-0.000
 likes-0.000
 clipboard-0.000
 seven-0.000
Token mum
Token ID25682
Feature activation-1.69e-02

Pos logit contributions
ars+0.045
 clipboard+0.040
tail+0.039
 likes+0.039
 seven+0.039
Neg logit contributions
 accidents-0.030
urg-0.028
umm-0.026
 Joey-0.024
olk-0.024
Token had
Token ID550
Feature activation+1.32e-01

Pos logit contributions
ars+0.001
 seven+0.001
 clipboard+0.001
 mat+0.001
 studies+0.001
Neg logit contributions
 accidents-0.001
ruck-0.001
urg-0.001
icles-0.001
 happened-0.001
Token replied
Token ID8712
Feature activation+1.35e+00

Pos logit contributions
urg+0.009
umm+0.009
 accidents+0.009
icles+0.009
olk+0.009
Neg logit contributions
ars-0.006
 likes-0.006
 studies-0.006
 think-0.005
 seven-0.005
Token with
Token ID351
Feature activation-2.53e+00

Pos logit contributions
urg+0.095
 accidents+0.095
ruck+0.091
icles+0.088
umm+0.085
Neg logit contributions
ars-0.056
 studies-0.054
 likes-0.053
 clipboard-0.051
 seven-0.051
Token kindness
Token ID23887
Feature activation+3.30e+00

Pos logit contributions
ars+0.153
 studies+0.146
 likes+0.146
 clipboard+0.142
 seven+0.139
Neg logit contributions
 accidents-0.132
urg-0.129
ruck-0.123
icles-0.117
umm-0.112
Token instead
Token ID2427
Feature activation-3.10e+00

Pos logit contributions
urg+0.221
 accidents+0.220
ruck+0.213
icles+0.205
umm+0.199
Neg logit contributions
ars-0.147
 studies-0.141
 likes-0.139
 clipboard-0.136
 seven-0.136
Token of
Token ID286
Feature activation+3.93e+00

Pos logit contributions
ars+0.153
 likes+0.148
 studies+0.146
 seven+0.141
 clipboard+0.141
Neg logit contributions
urg-0.195
 accidents-0.188
ruck-0.182
umm-0.176
icles-0.176
Token sc
Token ID629
Feature activation-3.88e+00

Pos logit contributions
 accidents+0.200
urg+0.200
ruck+0.187
icles+0.179
umm+0.170
Neg logit contributions
ars-0.247
 likes-0.235
 studies-0.233
 seven-0.230
 clipboard-0.229
Tokenolding
Token ID33266
Feature activation-5.76e-01

Pos logit contributions
 likes+0.236
 clipboard+0.234
 seven+0.233
 poster+0.229
oir+0.226
Neg logit contributions
 accidents-0.176
urg-0.175
ruck-0.167
icking-0.153
umm-0.152
Token boy
Token ID2933
Feature activation+1.51e-01

Pos logit contributions
urg+0.037
 accidents+0.036
ruck+0.036
umm+0.034
icles+0.032
Neg logit contributions
ars-0.034
 studies-0.031
 likes-0.030
 clipboard-0.030
 seven-0.030
Token.
Token ID13
Feature activation+8.98e-02

Pos logit contributions
urg+0.010
 accidents+0.009
ruck+0.009
icles+0.009
ull+0.008
Neg logit contributions
ars-0.007
 studies-0.007
 likes-0.007
 clipboard-0.007
 seven-0.007
Token His
Token ID2399
Feature activation+1.63e+00

Pos logit contributions
 accidents+0.005
urg+0.005
icles+0.005
ruck+0.005
 Joey+0.005
Neg logit contributions
ars-0.005
 studies-0.005
 clipboard-0.005
 likes-0.004
 seven-0.004
Token name
Token ID1438
Feature activation-3.20e+00

Pos logit contributions
urg+0.099
 accidents+0.097
ruck+0.092
icles+0.088
umm+0.088
Neg logit contributions
ars-0.084
 seven-0.072
 studies-0.068
 likes-0.067
tail-0.066
Token was
Token ID373
Feature activation+1.14e+00

Pos logit contributions
ars+0.173
 likes+0.167
 studies+0.167
 clipboard+0.162
 seven+0.161
Neg logit contributions
urg-0.187
 accidents-0.182
ruck-0.175
icles-0.169
umm-0.166
Token Jake
Token ID14757
Feature activation-2.70e+00

Pos logit contributions
urg+0.060
 accidents+0.058
ruck+0.056
icles+0.054
umm+0.054
Neg logit contributions
 likes-0.066
ars-0.066
 studies-0.065
 seven-0.064
 clipboard-0.064
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
ars+0.147
 studies+0.136
 seven+0.129
 mat+0.129
 clipboard+0.129
Neg logit contributions
urg-0.153
 accidents-0.153
ruck-0.152
icles-0.140
eddy-0.134
Token Every
Token ID3887
Feature activation-1.17e+00

Pos logit contributions
ruck+0.007
 Bobby+0.006
 accidents+0.006
icles+0.006
urg+0.006
Neg logit contributions
ars-0.009
 clipboard-0.008
 studies-0.008
 mat-0.008
 seven-0.008
Token morning
Token ID3329
Feature activation+7.97e-01

Pos logit contributions
ars+0.076
 clipboard+0.074
 hips+0.070
ighed+0.070
 likes+0.069
Neg logit contributions
 accidents-0.054
urg-0.053
ruck-0.052
ull-0.049
 happened-0.047
Token he
Token ID339
Feature activation-4.74e-01

Pos logit contributions
urg+0.054
 accidents+0.053
ruck+0.051
icles+0.050
umm+0.048
Neg logit contributions
ars-0.036
 likes-0.034
 studies-0.034
 clipboard-0.033
 seven-0.033
Token loved
Token ID6151
Feature activation-1.54e+00

Pos logit contributions
ars+0.031
 clipboard+0.027
 poster+0.027
 locker+0.027
ighed+0.027
Neg logit contributions
urg-0.022
ruck-0.021
 accidents-0.020
 happened-0.020
 Baby-0.018
Token soon
Token ID2582
Feature activation-6.71e-01

Pos logit contributions
ars+0.123
 clipboard+0.117
 likes+0.115
 seven+0.113
ebra+0.112
Neg logit contributions
 accidents-0.098
ruck-0.092
urg-0.090
keeping-0.085
 returning-0.082
Token.
Token ID13
Feature activation-2.23e-01

Pos logit contributions
ars+0.031
 studies+0.030
 likes+0.030
 clipboard+0.030
 seven+0.029
Neg logit contributions
urg-0.043
 accidents-0.043
ruck-0.041
icles-0.039
umm-0.038
Token She
Token ID1375
Feature activation-8.86e-01

Pos logit contributions
ars+0.011
 seven+0.010
 clipboard+0.010
 studies+0.010
 likes+0.010
Neg logit contributions
 accidents-0.014
urg-0.013
ruck-0.013
icles-0.013
umm-0.013
Token wanted
Token ID2227
Feature activation+2.72e-01

Pos logit contributions
ars+0.050
 clipboard+0.046
 mat+0.045
 hips+0.044
 poster+0.043
Neg logit contributions
urg-0.045
 accidents-0.043
ruck-0.041
 happened-0.041
 Baby-0.038
Token to
Token ID284
Feature activation+2.91e-01

Pos logit contributions
urg+0.018
umm+0.017
 accidents+0.017
ruck+0.017
icles+0.016
Neg logit contributions
 studies-0.013
 seven-0.012
 bar-0.012
 likes-0.012
 clipboard-0.012
Token see
Token ID766
Feature activation-2.37e+00

Pos logit contributions
urg+0.018
ruck+0.017
 accidents+0.017
icles+0.016
 Mill+0.016
Neg logit contributions
ars-0.014
 likes-0.013
 seven-0.013
ang-0.013
 clipboard-0.012
Token more
Token ID517
Feature activation+7.65e-02

Pos logit contributions
ars+0.129
 studies+0.124
 likes+0.124
 clipboard+0.121
 seven+0.120
Neg logit contributions
urg-0.138
 accidents-0.135
ruck-0.129
icles-0.125
umm-0.121
Token books
Token ID3835
Feature activation-3.18e+00

Pos logit contributions
 accidents+0.004
ruck+0.004
urg+0.004
reating+0.004
icles+0.003
Neg logit contributions
ars-0.004
 seven-0.004
ighed-0.004
 hips-0.004
 clipboard-0.004
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
ars+0.174
 likes+0.170
 clipboard+0.169
 studies+0.168
 seven+0.166
Neg logit contributions
 accidents-0.175
urg-0.167
ruck-0.166
icles-0.156
umm-0.147
Token She
Token ID1375
Feature activation-8.70e-01

Pos logit contributions
ars+0.012
 seven+0.011
 clipboard+0.011
 studies+0.011
 likes+0.011
Neg logit contributions
 accidents-0.013
urg-0.012
umm-0.012
ruck-0.012
icles-0.012
Token wanted
Token ID2227
Feature activation+2.70e-01

Pos logit contributions
ars+0.050
 clipboard+0.045
 mat+0.045
 poster+0.043
tail+0.043
Neg logit contributions
urg-0.043
 accidents-0.041
 happened-0.040
ruck-0.039
 Baby-0.036
Token Everyone
Token ID11075
Feature activation+9.99e-01

Pos logit contributions
urg+0.010
 accidents+0.009
ruck+0.009
icles+0.009
umm+0.009
Neg logit contributions
ars-0.007
 studies-0.007
 likes-0.007
 clipboard-0.007
 seven-0.007
Token loved
Token ID6151
Feature activation-1.35e+00

Pos logit contributions
 accidents+0.061
urg+0.060
ruck+0.059
ull+0.056
icles+0.054
Neg logit contributions
ars-0.051
 clipboard-0.048
 studies-0.048
 seven-0.047
 pocket-0.046
Token the
Token ID262
Feature activation-6.02e-01

Pos logit contributions
 likes+0.071
 studies+0.070
ars+0.070
 seven+0.068
 clipboard+0.067
Neg logit contributions
urg-0.080
 accidents-0.075
umm-0.075
ruck-0.074
icles-0.073
Token film
Token ID2646
Feature activation+3.03e+00

Pos logit contributions
ars+0.037
 studies+0.037
 seven+0.036
 clipboard+0.035
 likes+0.035
Neg logit contributions
urg-0.029
 accidents-0.029
ruck-0.028
icles-0.025
umm-0.025
Token and
Token ID290
Feature activation+5.23e-01

Pos logit contributions
 accidents+0.195
urg+0.194
ruck+0.186
icles+0.178
umm+0.176
Neg logit contributions
ars-0.141
 studies-0.133
 clipboard-0.132
 likes-0.132
 seven-0.130
Token hoped
Token ID10719
Feature activation-2.72e+00

Pos logit contributions
urg+0.034
 accidents+0.033
ruck+0.032
icles+0.031
umm+0.031
Neg logit contributions
ars-0.025
 likes-0.024
 studies-0.024
 clipboard-0.023
 seven-0.023
Token it
Token ID340
Feature activation+8.37e-01

Pos logit contributions
ars+0.129
 likes+0.129
 studies+0.127
 seven+0.125
 think+0.121
Neg logit contributions
urg-0.174
umm-0.165
olk-0.162
 accidents-0.159
ruck-0.158
Token would
Token ID561
Feature activation-7.90e-02

Pos logit contributions
 accidents+0.040
urg+0.040
ruck+0.038
icles+0.036
rest+0.035
Neg logit contributions
ars-0.050
 clipboard-0.049
 studies-0.049
 mat-0.048
 pocket-0.047
Token last
Token ID938
Feature activation-9.64e-02

Pos logit contributions
ars+0.004
 studies+0.004
 likes+0.004
 clipboard+0.004
 mat+0.004
Neg logit contributions
urg-0.005
 accidents-0.005
ruck-0.005
icles-0.004
ull-0.004
Token forever
Token ID8097
Feature activation+1.29e+00

Pos logit contributions
ars+0.006
 likes+0.006
 studies+0.006
 clipboard+0.006
 seven+0.006
Neg logit contributions
 accidents-0.005
urg-0.005
ruck-0.004
icles-0.004
umm-0.004
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
urg+0.084
 accidents+0.083
ruck+0.081
icles+0.077
umm+0.076
Neg logit contributions
ars-0.060
 studies-0.058
 likes-0.057
 clipboard-0.056
 seven-0.054
Token nodded
Token ID14464
Feature activation-2.27e+00

Pos logit contributions
urg+0.070
 accidents+0.065
 Bobby+0.064
 Joey+0.063
 Mill+0.063
Neg logit contributions
ars-0.076
 mat-0.064
 clipboard-0.064
 locker-0.062
 studies-0.061
Token and
Token ID290
Feature activation+5.48e-01

Pos logit contributions
ars+0.112
 studies+0.106
 likes+0.103
 clipboard+0.100
 seven+0.098
Neg logit contributions
urg-0.149
 accidents-0.146
ruck-0.139
icles-0.136
umm-0.129
Token they
Token ID484
Feature activation+1.61e+00

Pos logit contributions
urg+0.034
 accidents+0.034
ruck+0.032
icles+0.031
 Mill+0.031
Neg logit contributions
ars-0.028
 studies-0.026
 clipboard-0.025
 likes-0.025
 mat-0.024
Token began
Token ID2540
Feature activation+1.73e+00

Pos logit contributions
urg+0.078
ruck+0.071
icles+0.071
 Bobby+0.068
 Joey+0.067
Neg logit contributions
 studies-0.099
 clipboard-0.098
ars-0.097
 poster-0.096
 mat-0.094
Token to
Token ID284
Feature activation+2.25e-01

Pos logit contributions
urg+0.097
 accidents+0.095
ruck+0.091
icles+0.088
umm+0.087
Neg logit contributions
ars-0.097
 studies-0.095
 likes-0.094
 clipboard-0.092
 seven-0.092
Token play
Token ID711
Feature activation-4.62e+00

Pos logit contributions
urg+0.015
ruck+0.014
icles+0.014
 accidents+0.014
 Joey+0.014
Neg logit contributions
ars-0.011
 clipboard-0.009
 likes-0.009
 studies-0.009
ebra-0.009
Token mar
Token ID1667
Feature activation-2.55e+00

Pos logit contributions
 clipboard+0.307
 studies+0.306
ars+0.297
 likes+0.283
 trusts+0.282
Neg logit contributions
 accidents-0.200
urg-0.175
 such-0.169
ull-0.161
ruck-0.157
Tokenbles
Token ID7689
Feature activation-3.88e+00

Pos logit contributions
 likes+0.140
 studies+0.139
 clipboard+0.138
 seven+0.133
 bar+0.130
Neg logit contributions
urg-0.149
 accidents-0.136
ruck-0.134
icles-0.129
eddy-0.123
Token.
Token ID13
Feature activation+2.40e-01

Pos logit contributions
 clipboard+0.245
 studies+0.241
 likes+0.237
ars+0.230
 seven+0.224
Neg logit contributions
 accidents-0.186
urg-0.185
ruck-0.179
icles-0.172
ull-0.160
Token Sarah
Token ID10490
Feature activation+5.74e-01

Pos logit contributions
urg+0.015
 accidents+0.014
ruck+0.014
icles+0.013
umm+0.013
Neg logit contributions
 likes-0.012
ars-0.012
 studies-0.012
 seven-0.012
 clipboard-0.012
Token was
Token ID373
Feature activation+1.17e+00

Pos logit contributions
urg+0.029
ruck+0.028
 accidents+0.027
 Bobby+0.026
 happened+0.026
Neg logit contributions
ars-0.033
 clipboard-0.031
ossible-0.030
 studies-0.030
oir-0.030