TOP ACTIVATIONS
MAX = 11.738

using his problem - s olving skills . From that day
The storm th und ered and roared above ,
Every time it th und ered afterwards , the famous
sc ooter . She tr und led along the sidewalk all
day . They worked delic ately and carefully and nothing broke
aded in the water delic ately with his fishing rod .
uff led his hair affection ately . The
to be restored to its former glory ."
the village back to its former glory !"
, leaving it to its former state .
through the water and delic ately scoop ed up some of
of sprink les and delic ately spread them over the ice
dad taught her about ig lo os . Now , she
he got it so the rightful owner could find it .
put them back in their rightful place . After
back the village to its former state . With the help
had just heard about ig lo os and she wanted to
. Sue learned that ig lo os are made of snow
little girl 's shoulders affection ately . Sarah smiled
the stolen squares to their rightful place . The

MIN ACTIVATIONS
MIN = -11.843

?" L ila and Ben look at the ice - cream
Tom and Lily look at the salad . It
were twins who liked to look at the stars at night
Tom and Mia look at Mom . They see
mom called him over to look at something . She pointed
higher and higher . They look at each other 's towers
Tim my thought it was a great idea and
Lily . She loved to look at herself in the mirror
drink some water . They look around . They see their
Lily and Ben like to look at the bugs and wave
playing . They look out the window . They
then Tom made his doll look crazy and I tried to
want to get closer and look inside . "
over to take a closer look and saw that it was
liked to dig holes and look for treasures . Sara found
Lily . She loved to look up at the sky and
lf . She likes to look at the pictures and the
, " S ara , look , a cloth !"
Ell ie 's friend thought it was crazy how much Ellie
reach the top . They look down and see a green

INTERVAL 5.843 - 11.738
CONTAINS 1.577%

fun . What can we do ?" They pick
was awesome ! Let 's do it again soon ?"
always thought of ways she could help her family and friends
were very popular and his siblings praised him . Max was
from the top of the chim ney ! <|endoftext|> Once upon

INTERVAL -0.052 - 5.843
CONTAINS 53.405%

felt very content , knowing her mum had stepped into her
they got angry and started to quarrel . They were both
famous mel on around . They pushed it along the ground
he looked there were tall trees with deep roots . As
my smiled and she said , " It doesn 't matter

INTERVAL -5.947 - -0.052
CONTAINS 43.026%

frightened by the wild girl . The wild girl
It turned out that the mighty lake was home
of himself and let out a big roar of happiness .
on the floor . Sarah was so busy playing
fountain with water . They ran to the fountain .

INTERVAL -11.843 - -5.947
CONTAINS 1.991%

ines he is one of them . He puts on a
a naughty dragon . You lied to us . You are
got his turn and he threw the light really high .
go outside and play but it was raining . She looked
Mom and Dad hugged them and gave them some cookies
Token using
Token ID1262
Feature activation+1.06e+00

Pos logit contributions
 Kang+0.028
fl+0.028
 Australian+0.027
stack+0.027
umm+0.027
Neg logit contributions
 closer-0.028
 sure-0.027
 sigh-0.026
 bite-0.026
 harder-0.026
Token his
Token ID465
Feature activation+1.53e+00

Pos logit contributions
stack+0.049
 Kang+0.047
 mastering+0.047
olving+0.047
 lived+0.046
Neg logit contributions
 closer-0.029
 bite-0.028
 sigh-0.027
ax-0.025
 harder-0.024
Token problem
Token ID1917
Feature activation+3.76e+00

Pos logit contributions
stack+0.060
 mastering+0.059
und+0.059
 lived+0.057
olving+0.057
Neg logit contributions
 closer-0.051
 bite-0.050
 sigh-0.049
 sandwich-0.045
 slice-0.044
Token-
Token ID12
Feature activation+1.49e+00

Pos logit contributions
stack+0.199
 Kang+0.195
 Australian+0.193
 mastering+0.190
ric+0.189
Neg logit contributions
 closer-0.079
 sigh-0.067
 bite-0.067
hell-0.064
 harder-0.062
Tokens
Token ID82
Feature activation+3.58e+00

Pos logit contributions
 mastering+0.063
stack+0.063
olving+0.061
 lived+0.060
 Kang+0.059
Neg logit contributions
 closer-0.045
 bite-0.040
ax-0.040
awk-0.040
 sigh-0.040
Tokenolving
Token ID10890
Feature activation+1.17e+01

Pos logit contributions
stack+0.153
 Kang+0.147
und+0.143
ric+0.143
 Australian+0.141
Neg logit contributions
 closer-0.113
 bite-0.113
 sigh-0.109
 harder-0.100
 slice-0.099
Token skills
Token ID4678
Feature activation+3.61e+00

Pos logit contributions
 mastering+0.430
 Kang+0.415
stack+0.415
 Australian+0.398
 lived+0.397
Neg logit contributions
 closer-0.411
 bite-0.406
 sigh-0.392
ls-0.391
hell-0.379
Token.
Token ID13
Feature activation+2.99e-01

Pos logit contributions
 mastering+0.182
stack+0.179
 Kang+0.178
olving+0.173
ric+0.172
Neg logit contributions
 closer-0.081
ax-0.069
 bite-0.069
 sigh-0.068
hell-0.066
Token From
Token ID3574
Feature activation+3.66e+00

Pos logit contributions
 mastering+0.012
olving+0.012
ating+0.012
stack+0.011
tails+0.011
Neg logit contributions
ax-0.009
 sigh-0.009
 closer-0.009
five-0.008
hell-0.008
Token that
Token ID326
Feature activation-2.99e+00

Pos logit contributions
 mastering+0.169
olving+0.167
stack+0.161
und+0.156
 lived+0.156
Neg logit contributions
 closer-0.100
 sigh-0.089
 bite-0.089
ax-0.088
ls-0.086
Token day
Token ID1110
Feature activation+2.89e+00

Pos logit contributions
 bite+0.107
 closer+0.107
 sigh+0.101
 slice+0.094
 sip+0.092
Neg logit contributions
 mastering-0.109
stack-0.106
olving-0.105
ric-0.105
arians-0.104
Token
Token ID198
Feature activation-8.12e-01

Pos logit contributions
 closer+0.047
 bite+0.045
 sigh+0.044
awk+0.043
ax+0.042
Neg logit contributions
 mastering-0.064
olving-0.064
stack-0.062
 lived-0.061
und-0.060
Token
Token ID198
Feature activation-8.08e-01

Pos logit contributions
 closer+0.029
awk+0.028
 sigh+0.028
 bite+0.027
ax+0.027
Neg logit contributions
 mastering-0.029
 lived-0.028
stack-0.028
und-0.028
olving-0.027
TokenThe
Token ID464
Feature activation+5.06e+00

Pos logit contributions
 closer+0.029
 bite+0.027
 sigh+0.026
ax+0.026
awk+0.025
Neg logit contributions
 mastering-0.030
olving-0.030
stack-0.029
 lived-0.027
ric-0.027
Token storm
Token ID6388
Feature activation-5.79e-01

Pos logit contributions
 mastering+0.215
und+0.212
stack+0.203
olving+0.203
ric+0.201
Neg logit contributions
 closer-0.169
 bite-0.159
 sigh-0.153
 slice-0.139
 harder-0.138
Token th
Token ID294
Feature activation+1.75e+00

Pos logit contributions
 closer+0.022
 sigh+0.020
 bite+0.020
ls+0.019
 sip+0.018
Neg logit contributions
 mastering-0.022
olving-0.021
ric-0.020
und-0.020
stack-0.020
Tokenund
Token ID917
Feature activation+1.15e+01

Pos logit contributions
 mastering+0.062
stack+0.058
olving+0.057
 Zo+0.056
 rarely+0.054
Neg logit contributions
awk-0.065
ls-0.061
les-0.060
umbs-0.058
 bite-0.058
Tokenered
Token ID1068
Feature activation-3.03e+00

Pos logit contributions
 mastering+0.622
olving+0.605
ric+0.598
stack+0.592
 Kang+0.577
Neg logit contributions
ls-0.258
 closer-0.211
 bite-0.202
les-0.194
awk-0.189
Token and
Token ID290
Feature activation-3.90e+00

Pos logit contributions
 closer+0.075
 sigh+0.066
 bite+0.066
awk+0.064
ls+0.063
Neg logit contributions
 mastering-0.147
olving-0.144
stack-0.141
 lived-0.137
und-0.135
Token roared
Token ID45615
Feature activation-8.58e-01

Pos logit contributions
 closer+0.122
 bite+0.111
 sigh+0.110
ls+0.103
awk+0.103
Neg logit contributions
 mastering-0.167
olving-0.164
stack-0.156
und-0.152
ric-0.152
Token above
Token ID2029
Feature activation-3.08e+00

Pos logit contributions
 closer+0.024
 sigh+0.021
 bite+0.021
awk+0.020
ax+0.020
Neg logit contributions
 mastering-0.039
olving-0.038
stack-0.038
ric-0.036
 lived-0.036
Token,
Token ID11
Feature activation+2.09e+00

Pos logit contributions
 closer+0.081
 bite+0.070
 sigh+0.070
 back+0.064
ls+0.061
Neg logit contributions
 mastering-0.156
olving-0.151
ric-0.147
stack-0.144
 Kang-0.141
Token
Token ID198
Feature activation-7.68e-01

Pos logit contributions
awk+0.031
 closer+0.031
 bite+0.029
 sigh+0.029
ls+0.029
Neg logit contributions
 lived-0.024
 mastering-0.023
und-0.023
stack-0.022
olving-0.022
TokenEvery
Token ID6109
Feature activation+2.71e+00

Pos logit contributions
 closer+0.025
 bite+0.023
 sigh+0.023
ax+0.023
hell+0.022
Neg logit contributions
 mastering-0.030
olving-0.030
stack-0.029
ric-0.029
 lived-0.028
Token time
Token ID640
Feature activation+9.27e-01

Pos logit contributions
 mastering+0.103
olving+0.101
arians+0.093
ric+0.093
und+0.093
Neg logit contributions
 closer-0.102
 bite-0.098
 sigh-0.095
 sip-0.088
 sandwich-0.086
Token it
Token ID340
Feature activation-9.19e+00

Pos logit contributions
ric+0.057
 former+0.055
olving+0.054
 mastering+0.054
stack+0.054
Neg logit contributions
 closer-0.018
 it-0.014
 out-0.012
 back-0.012
 harder-0.012
Token th
Token ID294
Feature activation+1.65e+00

Pos logit contributions
 closer+0.304
 bite+0.287
 sigh+0.280
ls+0.254
 falls+0.242
Neg logit contributions
 mastering-0.401
ric-0.393
olving-0.386
stack-0.384
 perspective-0.376
Tokenund
Token ID917
Feature activation+1.15e+01

Pos logit contributions
 mastering+0.048
stack+0.045
spe+0.044
 rarely+0.043
olving+0.043
Neg logit contributions
awk-0.072
ls-0.066
ax-0.065
iffs-0.064
irted-0.064
Tokenered
Token ID1068
Feature activation-3.07e+00

Pos logit contributions
ric+0.548
 mastering+0.545
olving+0.524
stack+0.510
 neighbourhood+0.508
Neg logit contributions
ls-0.338
 closer-0.290
 bite-0.279
 falls-0.263
les-0.260
Token afterwards
Token ID12979
Feature activation+3.07e+00

Pos logit contributions
 closer+0.087
 sigh+0.073
 bite+0.072
ls+0.072
awk+0.069
Neg logit contributions
 mastering-0.144
olving-0.141
stack-0.139
ric-0.134
 lived-0.132
Token,
Token ID11
Feature activation+2.03e+00

Pos logit contributions
ric+0.155
 mastering+0.148
olving+0.143
stack+0.140
 perspective+0.136
Neg logit contributions
 closer-0.085
 sigh-0.077
 bite-0.075
ls-0.072
 back-0.068
Token the
Token ID262
Feature activation+8.39e-01

Pos logit contributions
olving+0.104
 mastering+0.104
ric+0.101
und+0.100
stack+0.099
Neg logit contributions
 closer-0.049
 sigh-0.040
 bite-0.039
 harder-0.037
ls-0.036
Token famous
Token ID5863
Feature activation+3.08e+00

Pos logit contributions
 mastering+0.036
olving+0.034
und+0.034
stack+0.033
ric+0.033
Neg logit contributions
 closer-0.028
 bite-0.026
 sigh-0.026
 harder-0.023
awk-0.023
Token sc
Token ID629
Feature activation+1.21e+00

Pos logit contributions
 closer+0.001
 bite+0.001
 sigh+0.001
 sandwich+0.001
awk+0.001
Neg logit contributions
 mastering-0.002
und-0.002
olving-0.002
ating-0.002
stack-0.002
Tokenooter
Token ID25141
Feature activation-1.93e+00

Pos logit contributions
 mastering+0.046
olving+0.045
stack+0.044
 lived+0.043
und+0.042
Neg logit contributions
 closer-0.041
 bite-0.039
 sigh-0.038
awk-0.037
ax-0.037
Token.
Token ID13
Feature activation-4.51e-02

Pos logit contributions
 closer+0.052
 back+0.046
 bite+0.045
 sigh+0.045
 look+0.043
Neg logit contributions
stack-0.089
olving-0.088
 mastering-0.087
 importantly-0.083
 Kang-0.083
Token She
Token ID1375
Feature activation+3.70e+00

Pos logit contributions
 closer+0.002
 sigh+0.002
 bite+0.001
ax+0.001
awk+0.001
Neg logit contributions
 mastering-0.002
olving-0.002
stack-0.002
 lived-0.002
und-0.002
Token tr
Token ID491
Feature activation+8.16e+00

Pos logit contributions
olving+0.137
 mastering+0.135
stack+0.135
und+0.124
ric+0.122
Neg logit contributions
 closer-0.132
 sigh-0.126
 bite-0.124
ax-0.120
ls-0.117
Tokenund
Token ID917
Feature activation+1.14e+01

Pos logit contributions
 mastering+0.231
olving+0.230
stack+0.221
 lived+0.209
 Kang+0.199
Neg logit contributions
 closer-0.342
ax-0.339
awk-0.333
 sigh-0.332
 bite-0.330
Tokenled
Token ID992
Feature activation-1.53e+00

Pos logit contributions
 mastering+0.533
olving+0.523
stack+0.513
 lived+0.497
und+0.492
Neg logit contributions
 closer-0.290
 bite-0.267
 sigh-0.267
awk-0.262
ax-0.259
Token along
Token ID1863
Feature activation+2.81e+00

Pos logit contributions
 closer+0.050
 bite+0.048
awk+0.047
 sigh+0.047
ax+0.047
Neg logit contributions
 mastering-0.059
olving-0.057
stack-0.056
 lived-0.056
und-0.054
Token the
Token ID262
Feature activation+4.94e-01

Pos logit contributions
olving+0.146
stack+0.144
ric+0.142
 mastering+0.141
 Kang+0.140
Neg logit contributions
 closer-0.062
 bite-0.055
 sigh-0.054
 harder-0.046
five-0.046
Token sidewalk
Token ID24058
Feature activation+4.99e+00

Pos logit contributions
 mastering+0.019
olving+0.019
stack+0.018
und+0.018
 lived+0.018
Neg logit contributions
 closer-0.017
 sigh-0.016
 bite-0.016
ax-0.015
awk-0.015
Token all
Token ID477
Feature activation-9.22e-01

Pos logit contributions
 mastering+0.244
olving+0.243
stack+0.243
 Kang+0.240
ric+0.233
Neg logit contributions
 closer-0.111
 bite-0.101
 sigh-0.100
five-0.093
 harder-0.086
Token day
Token ID1110
Feature activation+2.89e+00

Pos logit contributions
 closer+0.009
 bite+0.009
 harder+0.008
 sigh+0.008
five+0.007
Neg logit contributions
arians-0.021
stack-0.021
 Kang-0.020
 Zo-0.019
 mastering-0.019
Token.
Token ID13
Feature activation+5.20e-01

Pos logit contributions
 mastering+0.129
olving+0.127
stack+0.124
 lived+0.122
und+0.119
Neg logit contributions
 closer-0.080
 bite-0.074
 sigh-0.073
ax-0.072
awk-0.070
Token They
Token ID1119
Feature activation+2.83e+00

Pos logit contributions
 mastering+0.020
olving+0.020
 lived+0.019
ating+0.019
stack+0.019
Neg logit contributions
ax-0.017
 closer-0.017
 sigh-0.016
five-0.015
awk-0.015
Token worked
Token ID3111
Feature activation-3.95e-01

Pos logit contributions
 mastering+0.123
olving+0.122
 situations+0.118
stack+0.116
 Kang+0.115
Neg logit contributions
 closer-0.084
 bite-0.080
 sigh-0.077
ax-0.075
 slice-0.073
Token delic
Token ID8675
Feature activation+2.66e+00

Pos logit contributions
 closer+0.009
 sigh+0.008
 bite+0.007
ls+0.007
 harder+0.007
Neg logit contributions
stack-0.021
 mastering-0.020
olving-0.020
 lived-0.019
 Kang-0.019
Tokenately
Token ID1286
Feature activation+1.14e+01

Pos logit contributions
 forest+0.091
stack+0.091
 lived+0.089
jack+0.086
 mastering+0.086
Neg logit contributions
ax-0.101
awk-0.095
 closer-0.094
ls-0.092
 sigh-0.092
Token and
Token ID290
Feature activation-3.75e+00

Pos logit contributions
stack+0.515
 mastering+0.502
 Kang+0.488
 Zo+0.482
jack+0.479
Neg logit contributions
 closer-0.308
 bite-0.303
 sigh-0.284
five-0.269
 harder-0.262
Token carefully
Token ID7773
Feature activation+1.75e+00

Pos logit contributions
 closer+0.109
 bite+0.102
 sigh+0.102
ax+0.097
 harder+0.092
Neg logit contributions
stack-0.161
olving-0.160
 Kang-0.160
 mastering-0.160
 Australian-0.152
Token and
Token ID290
Feature activation-3.78e+00

Pos logit contributions
stack+0.081
 Kang+0.081
 mastering+0.080
 Australian+0.079
ric+0.078
Neg logit contributions
 closer-0.047
 bite-0.043
 sigh-0.043
 harder-0.039
 sip-0.039
Token nothing
Token ID2147
Feature activation-5.01e+00

Pos logit contributions
 closer+0.108
 bite+0.098
 sigh+0.098
ax+0.097
bs+0.089
Neg logit contributions
 mastering-0.167
olving-0.164
stack-0.161
ric-0.158
 Kang-0.157
Token broke
Token ID6265
Feature activation+3.10e-01

Pos logit contributions
 closer+0.167
 sigh+0.155
 bite+0.154
ls+0.148
ax+0.145
Neg logit contributions
stack-0.203
 mastering-0.202
olving-0.196
ric-0.191
 Kang-0.188
Tokenaded
Token ID5286
Feature activation+1.37e+00

Pos logit contributions
 mastering+0.053
olving+0.052
 lived+0.050
und+0.050
stack+0.049
Neg logit contributions
 closer-0.042
 sigh-0.039
 bite-0.039
awk-0.037
ax-0.037
Token in
Token ID287
Feature activation+3.61e+00

Pos logit contributions
 mastering+0.051
olving+0.050
 lived+0.050
und+0.049
stack+0.049
Neg logit contributions
 sigh-0.045
 closer-0.044
ax-0.043
 bite-0.042
awk-0.042
Token the
Token ID262
Feature activation+1.08e+00

Pos logit contributions
 mastering+0.165
olving+0.162
stack+0.159
 lived+0.156
und+0.151
Neg logit contributions
 closer-0.103
 bite-0.096
 sigh-0.093
awk-0.085
ax-0.085
Token water
Token ID1660
Feature activation-2.37e-01

Pos logit contributions
 mastering+0.043
olving+0.043
stack+0.042
 lived+0.042
 forest+0.041
Neg logit contributions
 closer-0.032
 sigh-0.031
 bite-0.030
awk-0.030
ax-0.030
Token delic
Token ID8675
Feature activation+2.76e+00

Pos logit contributions
 closer+0.006
 bite+0.005
 sigh+0.005
awk+0.005
ax+0.005
Neg logit contributions
 mastering-0.011
olving-0.011
stack-0.011
 lived-0.011
und-0.010
Tokenately
Token ID1286
Feature activation+1.13e+01

Pos logit contributions
 mastering+0.090
olving+0.087
stack+0.086
 lived+0.083
und+0.081
Neg logit contributions
 closer-0.108
 bite-0.103
awk-0.103
ax-0.103
 sigh-0.103
Token with
Token ID351
Feature activation-7.38e-01

Pos logit contributions
 mastering+0.520
olving+0.508
stack+0.490
 lived+0.480
und+0.475
Neg logit contributions
 closer-0.306
 bite-0.289
awk-0.278
 sigh-0.276
ax-0.264
Token his
Token ID465
Feature activation+1.84e+00

Pos logit contributions
 closer+0.023
 sigh+0.022
awk+0.022
ax+0.022
ls+0.022
Neg logit contributions
 mastering-0.028
 lived-0.028
olving-0.028
und-0.026
stack-0.026
Token fishing
Token ID12478
Feature activation+6.41e-01

Pos logit contributions
 mastering+0.086
olving+0.084
stack+0.082
 lived+0.080
und+0.079
Neg logit contributions
 closer-0.048
 bite-0.044
 sigh-0.044
awk-0.044
ax-0.043
Token rod
Token ID15299
Feature activation+1.34e+00

Pos logit contributions
 mastering+0.032
olving+0.030
stack+0.029
und+0.028
 lived+0.028
Neg logit contributions
 closer-0.016
 bite-0.015
awk-0.014
 sigh-0.014
 sandwich-0.014
Token.
Token ID13
Feature activation+5.24e-01

Pos logit contributions
 mastering+0.063
olving+0.061
stack+0.059
und+0.058
 lived+0.057
Neg logit contributions
 closer-0.035
 bite-0.032
 sigh-0.031
ls-0.030
awk-0.030
Tokenuff
Token ID1648
Feature activation+4.99e+00

Pos logit contributions
 mastering+0.087
stack+0.086
 lived+0.084
olving+0.083
 tre+0.079
Neg logit contributions
ax-0.085
 closer-0.084
awk-0.082
 sigh-0.078
 bite-0.076
Tokenled
Token ID992
Feature activation-1.29e+00

Pos logit contributions
 mastering+0.233
olving+0.228
 lived+0.223
stack+0.222
und+0.219
Neg logit contributions
 closer-0.123
 bite-0.115
 sigh-0.114
awk-0.111
ax-0.107
Token his
Token ID465
Feature activation+1.42e+00

Pos logit contributions
 closer+0.035
 sigh+0.031
 bite+0.031
ax+0.029
awk+0.029
Neg logit contributions
 mastering-0.061
olving-0.059
stack-0.058
 lived-0.057
und-0.056
Token hair
Token ID4190
Feature activation+3.30e+00

Pos logit contributions
 mastering+0.073
und+0.071
olving+0.070
stack+0.070
 lived+0.067
Neg logit contributions
 closer-0.032
 sigh-0.031
 sandwich-0.031
 bite-0.030
 back-0.030
Token affection
Token ID17696
Feature activation+1.21e+00

Pos logit contributions
 mastering+0.179
olving+0.172
stack+0.171
 lived+0.168
 Kang+0.167
Neg logit contributions
 closer-0.062
 bite-0.056
 sigh-0.055
ax-0.051
hell-0.050
Tokenately
Token ID1286
Feature activation+1.13e+01

Pos logit contributions
 mastering+0.045
stack+0.043
olving+0.042
 lived+0.041
 Australian+0.040
Neg logit contributions
 closer-0.044
ax-0.041
ls-0.039
awk-0.039
 sigh-0.038
Token.
Token ID13
Feature activation+3.06e-01

Pos logit contributions
 mastering+0.517
olving+0.508
stack+0.496
 lived+0.486
und+0.479
Neg logit contributions
 closer-0.294
 sigh-0.270
 bite-0.269
awk-0.264
ax-0.262
Token
Token ID220
Feature activation-1.70e+00

Pos logit contributions
 mastering+0.015
olving+0.015
stack+0.014
 lived+0.014
ating+0.014
Neg logit contributions
 closer-0.007
 sigh-0.006
ax-0.006
 bite-0.006
awk-0.006
Token
Token ID198
Feature activation-9.51e-01

Pos logit contributions
 closer+0.052
 bite+0.050
 sigh+0.049
awk+0.049
ax+0.047
Neg logit contributions
 mastering-0.070
olving-0.069
stack-0.066
 lived-0.066
und-0.065
Token
Token ID198
Feature activation-9.39e-01

Pos logit contributions
 closer+0.026
 sigh+0.024
 bite+0.024
ax+0.023
awk+0.023
Neg logit contributions
 mastering-0.043
olving-0.042
stack-0.041
 lived-0.040
und-0.039
TokenThe
Token ID464
Feature activation+4.98e+00

Pos logit contributions
 closer+0.023
 slice+0.021
ax+0.020
 sigh+0.020
five+0.019
Neg logit contributions
ric-0.044
olving-0.044
 mastering-0.044
stack-0.044
 situations-0.042
Token to
Token ID284
Feature activation+1.32e+00

Pos logit contributions
 mastering+0.144
olving+0.138
stack+0.137
 lived+0.133
und+0.131
Neg logit contributions
 closer-0.067
 bite-0.059
 sigh-0.058
awk-0.058
ax-0.057
Token be
Token ID307
Feature activation+2.56e+00

Pos logit contributions
 mastering+0.061
olving+0.059
stack+0.058
und+0.057
 lived+0.056
Neg logit contributions
 closer-0.036
 bite-0.034
 sigh-0.033
 look-0.029
 sip-0.029
Token restored
Token ID15032
Feature activation-1.15e+00

Pos logit contributions
stack+0.121
 mastering+0.121
olving+0.117
und+0.113
 Kang+0.112
Neg logit contributions
 closer-0.067
 sigh-0.059
 bite-0.059
awk-0.055
ax-0.054
Token to
Token ID284
Feature activation+1.28e+00

Pos logit contributions
 closer+0.025
 bite+0.021
 sigh+0.021
awk+0.020
ax+0.020
Neg logit contributions
 mastering-0.060
olving-0.058
stack-0.058
 lived-0.056
und-0.055
Token its
Token ID663
Feature activation+7.90e-01

Pos logit contributions
stack+0.058
 mastering+0.058
jack+0.056
olving+0.056
und+0.055
Neg logit contributions
 closer-0.035
 bite-0.033
 sigh-0.031
 look-0.028
 sip-0.027
Token former
Token ID1966
Feature activation+1.13e+01

Pos logit contributions
und+0.030
 mastering+0.030
ric+0.028
stack+0.028
ating+0.028
Neg logit contributions
 closer-0.029
 bite-0.026
 sigh-0.025
 back-0.025
 harder-0.024
Token glory
Token ID13476
Feature activation+1.01e+01

Pos logit contributions
stack+0.264
 mastering+0.248
jack+0.235
 Zo+0.226
und+0.225
Neg logit contributions
 sigh-0.540
 closer-0.531
 bite-0.527
ls-0.509
awk-0.509
Token."
Token ID526
Feature activation-2.87e+00

Pos logit contributions
 mastering+0.519
olving+0.501
stack+0.501
und+0.485
 lived+0.483
Neg logit contributions
 closer-0.216
 bite-0.185
 sigh-0.185
ax-0.180
awk-0.177
Token
Token ID220
Feature activation-1.50e+00

Pos logit contributions
 closer+0.074
 sigh+0.068
ax+0.065
 bite+0.064
awk+0.063
Neg logit contributions
 mastering-0.138
olving-0.136
stack-0.130
 lived-0.124
jack-0.124
Token
Token ID198
Feature activation-8.25e-01

Pos logit contributions
 closer+0.049
 bite+0.047
 sigh+0.046
awk+0.045
ax+0.043
Neg logit contributions
 mastering-0.059
olving-0.058
stack-0.057
 lived-0.056
und-0.056
Token
Token ID198
Feature activation-8.44e-01

Pos logit contributions
 closer+0.024
 bite+0.022
 sigh+0.022
awk+0.022
ax+0.022
Neg logit contributions
 mastering-0.035
olving-0.035
stack-0.034
 lived-0.033
und-0.033
Token the
Token ID262
Feature activation+8.76e-01

Pos logit contributions
 closer+0.007
 sigh+0.007
 bite+0.007
awk+0.006
ax+0.006
Neg logit contributions
 mastering-0.016
olving-0.016
stack-0.016
 lived-0.016
und-0.015
Token village
Token ID7404
Feature activation+9.88e+00

Pos logit contributions
 mastering+0.029
olving+0.027
stack+0.027
und+0.026
 lived+0.026
Neg logit contributions
 closer-0.035
 bite-0.034
 sigh-0.034
awk-0.033
ax-0.032
Token back
Token ID736
Feature activation-2.99e+00

Pos logit contributions
 mastering+0.460
olving+0.446
stack+0.442
 lived+0.427
und+0.426
Neg logit contributions
 closer-0.262
 bite-0.236
 sigh-0.236
awk-0.224
ax-0.221
Token to
Token ID284
Feature activation+1.30e+00

Pos logit contributions
 closer+0.071
 bite+0.061
awk+0.059
 sigh+0.059
ls+0.056
Neg logit contributions
 mastering-0.151
olving-0.144
stack-0.144
 lived-0.139
jack-0.137
Token its
Token ID663
Feature activation+6.59e-01

Pos logit contributions
 mastering+0.057
stack+0.056
jack+0.056
olving+0.055
und+0.053
Neg logit contributions
 closer-0.036
 bite-0.035
 sigh-0.033
 slice-0.031
 sip-0.030
Token former
Token ID1966
Feature activation+1.12e+01

Pos logit contributions
 mastering+0.025
und+0.024
ric+0.024
stack+0.023
 lived+0.023
Neg logit contributions
 closer-0.024
 bite-0.023
 sigh-0.022
 back-0.021
 sandwich-0.021
Token glory
Token ID13476
Feature activation+1.01e+01

Pos logit contributions
stack+0.246
 mastering+0.238
jack+0.226
olving+0.210
 Zo+0.210
Neg logit contributions
 sigh-0.555
 bite-0.546
 closer-0.544
awk-0.523
hell-0.522
Token!"
Token ID2474
Feature activation-3.51e+00

Pos logit contributions
 mastering+0.506
olving+0.487
stack+0.486
und+0.471
jack+0.469
Neg logit contributions
 closer-0.225
 sigh-0.196
 bite-0.196
ax-0.187
ls-0.184
Token
Token ID220
Feature activation-1.57e+00

Pos logit contributions
 closer+0.090
 sigh+0.088
ax+0.086
five+0.083
 bite+0.081
Neg logit contributions
olving-0.166
 mastering-0.163
stack-0.151
 lived-0.149
ating-0.148
Token
Token ID198
Feature activation-9.36e-01

Pos logit contributions
 closer+0.046
 bite+0.043
 sigh+0.043
awk+0.042
ax+0.041
Neg logit contributions
 mastering-0.067
olving-0.066
stack-0.064
 lived-0.063
und-0.063
Token
Token ID198
Feature activation-9.26e-01

Pos logit contributions
 closer+0.025
 sigh+0.023
 bite+0.022
ax+0.022
awk+0.022
Neg logit contributions
 mastering-0.043
olving-0.043
stack-0.042
 lived-0.040
und-0.040
Token,
Token ID11
Feature activation+2.12e+00

Pos logit contributions
 mastering+0.273
olving+0.261
 Kang+0.248
stack+0.248
 lived+0.247
Neg logit contributions
 closer-0.135
 bite-0.119
 sigh-0.117
ls-0.106
ax-0.105
Token leaving
Token ID4305
Feature activation-2.10e+00

Pos logit contributions
 mastering+0.097
olving+0.095
stack+0.093
 lived+0.091
und+0.089
Neg logit contributions
 closer-0.055
awk-0.051
 bite-0.051
 sigh-0.051
ax-0.050
Token it
Token ID340
Feature activation-9.00e+00

Pos logit contributions
 closer+0.058
 bite+0.051
 sigh+0.051
ax+0.049
ls+0.048
Neg logit contributions
 mastering-0.097
olving-0.094
stack-0.092
und-0.090
 Kang-0.088
Token to
Token ID284
Feature activation+1.30e+00

Pos logit contributions
 closer+0.229
 bite+0.207
 sigh+0.204
awk+0.186
ls+0.185
Neg logit contributions
 mastering-0.431
olving-0.421
stack-0.414
 Kang-0.401
und-0.400
Token its
Token ID663
Feature activation+7.36e-01

Pos logit contributions
 mastering+0.056
olving+0.055
stack+0.053
und+0.052
 lived+0.052
Neg logit contributions
 closer-0.039
 bite-0.037
 sigh-0.036
awk-0.033
 sip-0.033
Token former
Token ID1966
Feature activation+1.12e+01

Pos logit contributions
 mastering+0.031
und+0.030
olving+0.029
ating+0.028
 lived+0.028
Neg logit contributions
 closer-0.026
 bite-0.023
 sigh-0.022
 sandwich-0.021
 back-0.021
Token state
Token ID1181
Feature activation+5.94e+00

Pos logit contributions
stack+0.273
 mastering+0.260
jack+0.252
olving+0.237
 Zo+0.234
Neg logit contributions
 sigh-0.528
 closer-0.522
 bite-0.517
awk-0.494
ls-0.493
Token.
Token ID13
Feature activation+3.34e-01

Pos logit contributions
 mastering+0.298
stack+0.288
 Kang+0.284
olving+0.283
 Australian+0.281
Neg logit contributions
 closer-0.140
 sigh-0.116
 bite-0.114
ax-0.112
hell-0.108
Token
Token ID220
Feature activation-1.66e+00

Pos logit contributions
 mastering+0.015
olving+0.015
stack+0.014
 lived+0.014
ating+0.014
Neg logit contributions
 closer-0.009
 sigh-0.008
ax-0.008
 bite-0.008
awk-0.007
Token
Token ID198
Feature activation-9.28e-01

Pos logit contributions
 closer+0.049
 bite+0.046
 sigh+0.045
awk+0.045
ax+0.044
Neg logit contributions
 mastering-0.070
olving-0.069
stack-0.067
 lived-0.066
und-0.065
Token
Token ID198
Feature activation-9.18e-01

Pos logit contributions
 closer+0.028
awk+0.026
 bite+0.026
 sigh+0.026
ax+0.025
Neg logit contributions
 mastering-0.039
olving-0.038
stack-0.037
 lived-0.037
und-0.036
Token through
Token ID832
Feature activation+4.08e+00

Pos logit contributions
 closer+0.014
 sigh+0.012
 bite+0.012
ax+0.010
awk+0.010
Neg logit contributions
 mastering-0.033
olving-0.033
stack-0.033
 lived-0.033
ric-0.032
Token the
Token ID262
Feature activation+1.17e+00

Pos logit contributions
stack+0.210
ric+0.209
 mastering+0.209
 Kang+0.207
olving+0.206
Neg logit contributions
 closer-0.103
 bite-0.088
 sigh-0.086
 sip-0.080
 harder-0.073
Token water
Token ID1660
Feature activation-2.96e-01

Pos logit contributions
 mastering+0.057
olving+0.055
stack+0.052
 rarely+0.050
 Kang+0.050
Neg logit contributions
 closer-0.034
 sigh-0.032
 bite-0.030
awk-0.028
 sip-0.028
Token and
Token ID290
Feature activation-3.76e+00

Pos logit contributions
 closer+0.008
 bite+0.007
 sigh+0.006
ax+0.006
 sip+0.006
Neg logit contributions
 mastering-0.015
olving-0.014
 lived-0.014
 Kang-0.013
ric-0.013
Token delic
Token ID8675
Feature activation+2.62e+00

Pos logit contributions
 closer+0.106
 sigh+0.094
 bite+0.094
ax+0.091
awk+0.089
Neg logit contributions
 mastering-0.170
olving-0.168
stack-0.162
 lived-0.159
und-0.158
Tokenately
Token ID1286
Feature activation+1.12e+01

Pos logit contributions
 mastering+0.078
stack+0.074
olving+0.074
 lived+0.072
 Kang+0.069
Neg logit contributions
 closer-0.109
ax-0.105
 sigh-0.104
awk-0.104
 bite-0.103
Token scoop
Token ID23529
Feature activation-3.56e+00

Pos logit contributions
 mastering+0.507
olving+0.501
 Zo+0.488
stack+0.485
 Kang+0.482
Neg logit contributions
 closer-0.319
 bite-0.296
 sigh-0.275
ls-0.259
 sip-0.256
Tokened
Token ID276
Feature activation-4.27e-01

Pos logit contributions
 sigh+0.096
 bite+0.093
 closer+0.092
awk+0.090
ax+0.088
Neg logit contributions
olving-0.150
 mastering-0.147
 lived-0.146
und-0.145
stack-0.144
Token up
Token ID510
Feature activation-5.49e+00

Pos logit contributions
 closer+0.008
 sigh+0.008
awk+0.008
ax+0.008
 bite+0.008
Neg logit contributions
 mastering-0.022
olving-0.021
stack-0.021
 lived-0.021
und-0.021
Token some
Token ID617
Feature activation-5.72e+00

Pos logit contributions
 closer+0.155
 sigh+0.140
 bite+0.140
ls+0.131
ax+0.130
Neg logit contributions
 mastering-0.250
olving-0.245
stack-0.239
 lived-0.234
und-0.230
Token of
Token ID286
Feature activation+3.64e+00

Pos logit contributions
 closer+0.159
 bite+0.141
 sigh+0.136
ls+0.133
 sip+0.133
Neg logit contributions
 mastering-0.273
stack-0.268
olving-0.264
ric-0.254
und-0.254
Token of
Token ID286
Feature activation+3.47e+00

Pos logit contributions
 closer+0.027
 sigh+0.027
awk+0.026
 bite+0.025
ax+0.025
Neg logit contributions
 mastering-0.028
olving-0.028
und-0.027
 tre-0.026
ating-0.026
Token sprink
Token ID27562
Feature activation-4.48e+00

Pos logit contributions
 mastering+0.145
 lived+0.140
olving+0.137
stack+0.135
und+0.134
Neg logit contributions
 closer-0.097
awk-0.094
 sigh-0.091
ax-0.087
ls-0.085
Tokenles
Token ID829
Feature activation-3.28e+00

Pos logit contributions
 closer+0.058
 sigh+0.048
 bite+0.045
awk+0.045
ax+0.042
Neg logit contributions
 mastering-0.263
olving-0.260
und-0.253
 lived-0.252
stack-0.251
Token and
Token ID290
Feature activation-3.86e+00

Pos logit contributions
 closer+0.080
 bite+0.073
 sigh+0.073
awk+0.071
ax+0.071
Neg logit contributions
 mastering-0.157
olving-0.154
stack-0.150
 lived-0.148
und-0.145
Token delic
Token ID8675
Feature activation+2.55e+00

Pos logit contributions
 closer+0.107
 sigh+0.099
 bite+0.099
awk+0.097
ax+0.096
Neg logit contributions
 mastering-0.172
olving-0.168
stack-0.164
 lived-0.161
und-0.158
Tokenately
Token ID1286
Feature activation+1.12e+01

Pos logit contributions
stack+0.087
 mastering+0.087
 forest+0.085
 lived+0.084
olving+0.084
Neg logit contributions
ax-0.092
 closer-0.090
awk-0.088
 sigh-0.088
 bite-0.087
Token spread
Token ID4104
Feature activation-4.79e+00

Pos logit contributions
 mastering+0.528
olving+0.525
stack+0.522
 Kang+0.497
jack+0.494
Neg logit contributions
 closer-0.273
 bite-0.272
 sigh-0.252
awk-0.241
 sip-0.241
Token them
Token ID606
Feature activation-6.34e+00

Pos logit contributions
awk+0.147
ax+0.144
 sigh+0.142
 burger+0.138
ls+0.136
Neg logit contributions
olving-0.174
 mastering-0.168
 lived-0.161
 tre-0.161
und-0.158
Token over
Token ID625
Feature activation-5.57e+00

Pos logit contributions
awk+0.194
ax+0.191
 sigh+0.184
 burger+0.178
five+0.177
Neg logit contributions
olving-0.244
 mastering-0.240
stack-0.228
und-0.226
 lived-0.223
Token the
Token ID262
Feature activation+8.35e-01

Pos logit contributions
 closer+0.153
 bite+0.135
 sigh+0.133
ax+0.125
 slice+0.123
Neg logit contributions
 mastering-0.260
stack-0.256
olving-0.254
 lived-0.252
ric-0.246
Token ice
Token ID4771
Feature activation+8.97e-01

Pos logit contributions
 mastering+0.039
olving+0.039
stack+0.038
 lived+0.037
und+0.036
Neg logit contributions
 closer-0.021
awk-0.019
 sigh-0.019
ax-0.019
 bite-0.019
Token dad
Token ID9955
Feature activation-4.29e+00

Pos logit contributions
 mastering+0.022
olving+0.022
stack+0.022
und+0.021
 lived+0.021
Neg logit contributions
 closer-0.010
 sigh-0.009
 bite-0.009
ax-0.008
awk-0.008
Token taught
Token ID7817
Feature activation-1.66e-01

Pos logit contributions
 closer+0.130
 bite+0.128
 sigh+0.127
ax+0.116
 yawn+0.113
Neg logit contributions
 mastering-0.181
olving-0.179
stack-0.176
und-0.170
ric-0.168
Token her
Token ID607
Feature activation+4.31e-01

Pos logit contributions
 closer+0.005
 bite+0.004
 sigh+0.004
ax+0.004
awk+0.004
Neg logit contributions
 mastering-0.007
olving-0.007
stack-0.007
 lived-0.007
und-0.007
Token about
Token ID546
Feature activation+1.59e-01

Pos logit contributions
 mastering+0.020
stack+0.019
olving+0.019
und+0.019
jack+0.018
Neg logit contributions
 closer-0.012
 sigh-0.011
 bite-0.011
ax-0.010
awk-0.010
Token ig
Token ID45329
Feature activation+1.00e+01

Pos logit contributions
olving+0.008
 mastering+0.008
jack+0.008
und+0.007
spe+0.007
Neg logit contributions
 closer-0.004
 bite-0.004
 sigh-0.004
 yawn-0.003
 sandwiches-0.003
Tokenlo
Token ID5439
Feature activation+1.12e+01

Pos logit contributions
 mastering+0.314
stack+0.297
olving+0.288
 lived+0.275
 Australian+0.275
Neg logit contributions
 closer-0.408
ls-0.398
ax-0.394
 sigh-0.394
awk-0.392
Tokenos
Token ID418
Feature activation+2.38e-01

Pos logit contributions
 mastering+0.444
olving+0.432
stack+0.417
 tre+0.391
und+0.391
Neg logit contributions
 closer-0.362
 bite-0.346
 sigh-0.346
ls-0.343
ax-0.342
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
 mastering+0.012
olving+0.012
stack+0.012
 perspective+0.011
und+0.011
Neg logit contributions
 closer-0.006
 sigh-0.005
 bite-0.005
ls-0.005
hell-0.005
Token Now
Token ID2735
Feature activation+3.00e+00

Pos logit contributions
 mastering+0.013
olving+0.013
stack+0.012
ating+0.012
und+0.012
Neg logit contributions
 closer-0.008
 sigh-0.008
ax-0.008
 bite-0.008
awk-0.008
Token,
Token ID11
Feature activation+2.28e+00

Pos logit contributions
 mastering+0.122
olving+0.119
stack+0.116
ric+0.112
und+0.111
Neg logit contributions
 closer-0.098
 bite-0.091
 sigh-0.091
ax-0.088
awk-0.087
Token she
Token ID673
Feature activation-9.26e-01

Pos logit contributions
 mastering+0.124
olving+0.120
ric+0.119
und+0.118
arians+0.116
Neg logit contributions
 closer-0.049
 bite-0.041
 sigh-0.036
 sandwich-0.033
ax-0.033
Token he
Token ID339
Feature activation-6.97e-01

Pos logit contributions
 closer+0.094
 sigh+0.086
 bite+0.086
ax+0.083
awk+0.081
Neg logit contributions
 mastering-0.155
olving-0.151
stack-0.148
und-0.144
 lived-0.142
Token got
Token ID1392
Feature activation-2.63e+00

Pos logit contributions
 closer+0.025
 bite+0.024
 sigh+0.023
ls+0.022
ax+0.022
Neg logit contributions
 mastering-0.026
olving-0.026
stack-0.025
und-0.024
 Kang-0.024
Token it
Token ID340
Feature activation-8.96e+00

Pos logit contributions
awk+0.067
ax+0.064
 sigh+0.063
 closer+0.063
five+0.061
Neg logit contributions
 mastering-0.115
olving-0.111
 lived-0.111
stack-0.108
 tre-0.107
Token so
Token ID523
Feature activation-9.65e-02

Pos logit contributions
 closer+0.214
 bite+0.192
 sigh+0.188
awk+0.181
ax+0.180
Neg logit contributions
 mastering-0.442
olving-0.428
stack-0.421
 lived-0.409
und-0.408
Token the
Token ID262
Feature activation+9.95e-01

Pos logit contributions
awk+0.002
ax+0.002
 closer+0.002
 sigh+0.002
 bite+0.002
Neg logit contributions
 lived-0.004
olving-0.004
 mastering-0.004
 tre-0.004
stack-0.004
Token rightful
Token ID43133
Feature activation+1.12e+01

Pos logit contributions
 mastering+0.041
olving+0.041
stack+0.040
 lived+0.039
jack+0.037
Neg logit contributions
 closer-0.030
awk-0.028
 sigh-0.028
 bite-0.028
ax-0.027
Token owner
Token ID4870
Feature activation+1.94e+00

Pos logit contributions
olving+0.393
 lived+0.389
 mastering+0.387
stack+0.383
 tre+0.364
Neg logit contributions
 closer-0.394
 sigh-0.383
ax-0.379
awk-0.376
 bite-0.371
Token could
Token ID714
Feature activation+7.25e+00

Pos logit contributions
 mastering+0.081
olving+0.078
stack+0.076
 Kang+0.075
und+0.074
Neg logit contributions
 closer-0.063
 bite-0.059
 sigh-0.058
ax-0.054
hell-0.053
Token find
Token ID1064
Feature activation-2.66e+00

Pos logit contributions
 mastering+0.330
olving+0.323
stack+0.312
 lived+0.306
und+0.305
Neg logit contributions
 closer-0.198
 bite-0.188
 sigh-0.184
ax-0.174
awk-0.173
Token it
Token ID340
Feature activation-8.98e+00

Pos logit contributions
 closer+0.065
 sigh+0.060
 bite+0.060
awk+0.059
ax+0.058
Neg logit contributions
 mastering-0.126
olving-0.124
stack-0.121
 lived-0.119
und-0.117
Token.
Token ID13
Feature activation+1.70e-01

Pos logit contributions
 closer+0.235
 bite+0.210
 sigh+0.204
ax+0.192
awk+0.189
Neg logit contributions
 mastering-0.429
olving-0.412
stack-0.408
 Kang-0.397
und-0.396
Token put
Token ID1234
Feature activation-3.71e+00

Pos logit contributions
 closer+0.094
 bite+0.085
 sigh+0.084
 sip+0.076
ax+0.076
Neg logit contributions
 mastering-0.175
olving-0.173
stack-0.169
und-0.166
 Kang-0.164
Token them
Token ID606
Feature activation-6.11e+00

Pos logit contributions
awk+0.091
 sigh+0.087
 closer+0.086
ax+0.085
 bite+0.084
Neg logit contributions
 mastering-0.166
olving-0.165
 lived-0.164
und-0.159
stack-0.159
Token back
Token ID736
Feature activation-2.68e+00

Pos logit contributions
 closer+0.183
awk+0.179
 sigh+0.177
ax+0.176
 bite+0.176
Neg logit contributions
 mastering-0.245
olving-0.244
 lived-0.233
stack-0.233
und-0.227
Token in
Token ID287
Feature activation+3.66e+00

Pos logit contributions
 closer+0.079
 bite+0.070
 sigh+0.069
ls+0.067
awk+0.066
Neg logit contributions
 mastering-0.119
olving-0.114
stack-0.114
 Kang-0.110
 lived-0.109
Token their
Token ID511
Feature activation+1.28e+00

Pos logit contributions
 mastering+0.178
stack+0.176
olving+0.172
 Kang+0.171
jack+0.171
Neg logit contributions
 closer-0.092
 bite-0.087
 sigh-0.083
 sip-0.076
 slice-0.075
Token rightful
Token ID43133
Feature activation+1.11e+01

Pos logit contributions
 mastering+0.058
stack+0.055
olving+0.055
 Kang+0.054
und+0.054
Neg logit contributions
 sigh-0.038
 closer-0.037
 bite-0.037
 sandwich-0.035
 sip-0.034
Token place
Token ID1295
Feature activation+5.74e+00

Pos logit contributions
 mastering+0.381
olving+0.370
stack+0.359
 lived+0.349
und+0.343
Neg logit contributions
 closer-0.423
 sigh-0.399
 bite-0.398
awk-0.393
ax-0.390
Token.
Token ID13
Feature activation+4.63e-01

Pos logit contributions
 mastering+0.299
 Kang+0.285
stack+0.283
olving+0.281
ric+0.279
Neg logit contributions
 closer-0.126
 bite-0.108
 sigh-0.106
hell-0.098
ls-0.095
Token
Token ID198
Feature activation-5.56e-01

Pos logit contributions
 mastering+0.021
olving+0.021
stack+0.020
 lived+0.020
und+0.019
Neg logit contributions
 closer-0.012
 sigh-0.011
ax-0.011
 bite-0.011
awk-0.011
Token
Token ID198
Feature activation-5.52e-01

Pos logit contributions
 closer+0.019
awk+0.019
 bite+0.018
 sigh+0.018
ls+0.018
Neg logit contributions
 lived-0.020
 mastering-0.020
olving-0.019
und-0.019
stack-0.019
TokenAfter
Token ID3260
Feature activation+5.46e+00

Pos logit contributions
 closer+0.019
 bite+0.017
 sigh+0.017
ax+0.017
awk+0.017
Neg logit contributions
 mastering-0.022
olving-0.021
stack-0.021
 lived-0.020
und-0.020
Token back
Token ID736
Feature activation-3.22e+00

Pos logit contributions
 closer+0.125
 sigh+0.115
 bite+0.115
awk+0.113
ax+0.112
Neg logit contributions
 mastering-0.196
olving-0.192
stack-0.188
 lived-0.183
und-0.181
Token the
Token ID262
Feature activation+7.54e-01

Pos logit contributions
 closer+0.075
 bite+0.063
 sigh+0.059
awk+0.058
ls+0.058
Neg logit contributions
 mastering-0.166
stack-0.163
olving-0.160
jack-0.153
 Kang-0.151
Token village
Token ID7404
Feature activation+9.70e+00

Pos logit contributions
 mastering+0.028
olving+0.028
stack+0.027
 lived+0.026
und+0.026
Neg logit contributions
 closer-0.027
 sigh-0.025
 bite-0.025
awk-0.024
ax-0.024
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
 mastering+0.456
olving+0.444
stack+0.437
und+0.425
 lived+0.424
Neg logit contributions
 closer-0.253
 sigh-0.228
 bite-0.227
ax-0.214
awk-0.214
Token its
Token ID663
Feature activation+4.63e-01

Pos logit contributions
jack+0.053
stack+0.052
 mastering+0.052
olving+0.051
und+0.051
Neg logit contributions
 closer-0.026
 bite-0.024
 sigh-0.024
 look-0.023
 sure-0.022
Token former
Token ID1966
Feature activation+1.11e+01

Pos logit contributions
und+0.020
ric+0.019
 mastering+0.019
 lived+0.018
olving+0.018
Neg logit contributions
 closer-0.016
 bite-0.014
 back-0.014
 sigh-0.013
 sandwich-0.013
Token state
Token ID1181
Feature activation+5.75e+00

Pos logit contributions
stack+0.241
 mastering+0.231
jack+0.220
olving+0.208
und+0.207
Neg logit contributions
 sigh-0.555
 closer-0.546
 bite-0.542
awk-0.518
hell-0.517
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
 mastering+0.297
stack+0.293
 Kang+0.290
 Australian+0.288
 Zo+0.285
Neg logit contributions
 closer-0.120
 sigh-0.099
 back-0.097
ax-0.095
hell-0.093
Token With
Token ID2080
Feature activation+3.14e+00

Pos logit contributions
 mastering+0.006
olving+0.006
ating+0.006
stack+0.006
jack+0.006
Neg logit contributions
 closer-0.002
 sigh-0.002
ax-0.002
five-0.002
 bite-0.002
Token the
Token ID262
Feature activation+7.52e-01

Pos logit contributions
 mastering+0.158
olving+0.155
stack+0.154
und+0.149
ric+0.147
Neg logit contributions
 closer-0.076
 bite-0.069
 sigh-0.065
 slice-0.062
 sip-0.059
Token help
Token ID1037
Feature activation+5.36e-01

Pos logit contributions
 mastering+0.032
olving+0.031
stack+0.030
und+0.030
 lived+0.030
Neg logit contributions
 closer-0.024
 sigh-0.022
 bite-0.022
awk-0.021
ax-0.021
Token had
Token ID550
Feature activation+2.23e+00

Pos logit contributions
 mastering+0.170
olving+0.168
stack+0.162
ric+0.153
und+0.152
Neg logit contributions
 closer-0.125
 sigh-0.117
 bite-0.116
ax-0.111
awk-0.108
Token just
Token ID655
Feature activation-6.63e-01

Pos logit contributions
 mastering+0.097
olving+0.095
stack+0.092
 lived+0.092
und+0.090
Neg logit contributions
 closer-0.063
 sigh-0.059
awk-0.059
 bite-0.058
ax-0.057
Token heard
Token ID2982
Feature activation-2.44e+00

Pos logit contributions
 closer+0.019
 bite+0.017
 sigh+0.017
ax+0.016
awk+0.016
Neg logit contributions
 mastering-0.029
stack-0.029
olving-0.029
 Kang-0.028
und-0.028
Token about
Token ID546
Feature activation+2.15e-01

Pos logit contributions
 closer+0.061
 sigh+0.056
 bite+0.055
ax+0.054
awk+0.053
Neg logit contributions
 mastering-0.116
olving-0.113
stack-0.112
 lived-0.108
und-0.107
Token ig
Token ID45329
Feature activation+9.99e+00

Pos logit contributions
stack+0.011
jack+0.011
umm+0.011
olving+0.011
 mastering+0.011
Neg logit contributions
 closer-0.005
 bite-0.004
 sigh-0.004
 look-0.004
 sure-0.003
Tokenlo
Token ID5439
Feature activation+1.11e+01

Pos logit contributions
 mastering+0.223
stack+0.213
olving+0.205
 lived+0.199
 Kang+0.198
Neg logit contributions
 closer-0.490
ax-0.488
ls-0.485
 sigh-0.479
awk-0.477
Tokenos
Token ID418
Feature activation+2.14e-01

Pos logit contributions
 mastering+0.419
olving+0.409
stack+0.395
 tre+0.392
 perspective+0.378
Neg logit contributions
ax-0.368
 closer-0.367
ls-0.356
 sigh-0.356
 bite-0.352
Token and
Token ID290
Feature activation-3.98e+00

Pos logit contributions
olving+0.010
 mastering+0.010
stack+0.010
 tre+0.010
 perspective+0.010
Neg logit contributions
 closer-0.006
 sigh-0.005
 bite-0.005
ls-0.005
hell-0.005
Token she
Token ID673
Feature activation-8.77e-01

Pos logit contributions
 closer+0.108
 sigh+0.104
 bite+0.098
 yawn+0.095
 look+0.092
Neg logit contributions
olving-0.178
stack-0.172
 mastering-0.171
 Kang-0.169
jack-0.168
Token wanted
Token ID2227
Feature activation+2.57e-01

Pos logit contributions
 closer+0.028
 sigh+0.027
 bite+0.025
 yawn+0.025
bs+0.024
Neg logit contributions
olving-0.038
 mastering-0.037
stack-0.036
umm-0.034
 situations-0.034
Token to
Token ID284
Feature activation+1.36e+00

Pos logit contributions
 mastering+0.014
olving+0.014
stack+0.014
und+0.014
 lived+0.014
Neg logit contributions
 closer-0.005
 bite-0.004
 sigh-0.004
ax-0.003
awk-0.003
Token.
Token ID13
Feature activation+4.70e-01

Pos logit contributions
 mastering+0.130
olving+0.127
stack+0.124
 lived+0.121
und+0.120
Neg logit contributions
 closer-0.085
 sigh-0.079
 bite-0.079
awk-0.077
ax-0.077
Token Sue
Token ID26089
Feature activation-2.92e+00

Pos logit contributions
 mastering+0.022
olving+0.022
stack+0.021
 lived+0.021
und+0.020
Neg logit contributions
 closer-0.012
 sigh-0.011
 bite-0.011
awk-0.011
ax-0.011
Token learned
Token ID4499
Feature activation+2.39e-01

Pos logit contributions
 closer+0.086
 sigh+0.081
 bite+0.080
ax+0.078
awk+0.077
Neg logit contributions
 mastering-0.126
olving-0.125
stack-0.120
und-0.116
ric-0.113
Token that
Token ID326
Feature activation-2.98e+00

Pos logit contributions
 mastering+0.011
olving+0.010
stack+0.010
 lived+0.010
und+0.010
Neg logit contributions
 closer-0.007
 sigh-0.006
 bite-0.006
awk-0.006
ax-0.006
Token ig
Token ID45329
Feature activation+9.94e+00

Pos logit contributions
 closer+0.086
 bite+0.079
 sigh+0.077
 sip+0.070
 sandwich+0.068
Neg logit contributions
und-0.133
 mastering-0.131
stack-0.130
olving-0.129
jack-0.126
Tokenlo
Token ID5439
Feature activation+1.11e+01

Pos logit contributions
 mastering+0.241
stack+0.219
olving+0.209
 Australian+0.199
 perspective+0.199
Neg logit contributions
 closer-0.480
ls-0.472
awk-0.464
 sigh-0.463
ax-0.459
Tokenos
Token ID418
Feature activation+1.77e-01

Pos logit contributions
 mastering+0.471
olving+0.448
stack+0.433
 perspective+0.417
 completing+0.408
Neg logit contributions
 closer-0.336
ls-0.327
 bite-0.324
 sigh-0.322
awk-0.317
Token are
Token ID389
Feature activation+4.53e+00

Pos logit contributions
 mastering+0.007
olving+0.007
stack+0.007
 perspective+0.006
 Kang+0.006
Neg logit contributions
 closer-0.006
 bite-0.006
 sigh-0.006
ls-0.006
hell-0.005
Token made
Token ID925
Feature activation-1.57e+00

Pos logit contributions
und+0.255
stack+0.245
 mastering+0.235
 Zo+0.233
 perspective+0.231
Neg logit contributions
 closer-0.108
 bite-0.082
 harder-0.081
 sigh-0.075
 back-0.074
Token of
Token ID286
Feature activation+3.70e+00

Pos logit contributions
 closer+0.043
 bite+0.038
 sigh+0.038
ls+0.036
awk+0.036
Neg logit contributions
 mastering-0.073
stack-0.072
olving-0.069
und-0.068
ric-0.067
Token snow
Token ID6729
Feature activation+3.16e+00

Pos logit contributions
�+0.172
 mastering+0.168
stack+0.168
olving+0.163
jack+0.163
Neg logit contributions
 closer-0.106
 bite-0.101
 sigh-0.099
 look-0.094
 sandwiches-0.090
Token little
Token ID1310
Feature activation+3.27e+00

Pos logit contributions
 mastering+0.030
stack+0.029
olving+0.029
und+0.029
 lived+0.029
Neg logit contributions
 closer-0.018
 sigh-0.017
 bite-0.017
ax-0.016
awk-0.016
Token girl
Token ID2576
Feature activation+3.39e+00

Pos logit contributions
 mastering+0.144
olving+0.141
stack+0.139
 lived+0.136
und+0.135
Neg logit contributions
 closer-0.096
 sigh-0.088
 bite-0.088
awk-0.085
ax-0.084
Token's
Token ID338
Feature activation+1.20e+00

Pos logit contributions
 mastering+0.154
olving+0.152
stack+0.148
 lived+0.146
und+0.145
Neg logit contributions
 closer-0.094
 sigh-0.085
 bite-0.084
awk-0.080
ax-0.079
Token shoulders
Token ID12450
Feature activation-7.69e-02

Pos logit contributions
 mastering+0.051
olving+0.050
stack+0.049
 lived+0.048
und+0.047
Neg logit contributions
 closer-0.036
 sigh-0.034
 bite-0.034
awk-0.033
ax-0.033
Token affection
Token ID17696
Feature activation+9.22e-01

Pos logit contributions
 closer+0.002
awk+0.002
 sigh+0.002
 bite+0.002
ax+0.002
Neg logit contributions
 mastering-0.003
olving-0.003
stack-0.003
 lived-0.003
und-0.003
Tokenately
Token ID1286
Feature activation+1.11e+01

Pos logit contributions
 mastering+0.037
stack+0.036
olving+0.035
ric+0.035
 lived+0.035
Neg logit contributions
 closer-0.032
ax-0.028
ls-0.028
awk-0.027
 sigh-0.027
Token.
Token ID13
Feature activation+1.31e-01

Pos logit contributions
 mastering+0.484
olving+0.474
stack+0.465
 lived+0.455
und+0.449
Neg logit contributions
 closer-0.315
 bite-0.293
 sigh-0.291
awk-0.283
ax-0.280
Token
Token ID198
Feature activation-1.15e+00

Pos logit contributions
 mastering+0.006
olving+0.006
stack+0.006
 lived+0.005
ating+0.005
Neg logit contributions
 closer-0.004
 sigh-0.003
ax-0.003
 bite-0.003
awk-0.003
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
 closer+0.037
awk+0.036
 bite+0.036
 sigh+0.035
ax+0.034
Neg logit contributions
 mastering-0.045
olving-0.044
 lived-0.043
stack-0.042
und-0.042
TokenSarah
Token ID29284
Feature activation-7.16e-01

Pos logit contributions
 closer+0.040
 sigh+0.037
 bite+0.036
ax+0.036
awk+0.035
Neg logit contributions
 mastering-0.044
olving-0.043
stack-0.043
 lived-0.041
und-0.040
Token smiled
Token ID13541
Feature activation-3.27e+00

Pos logit contributions
 sigh+0.019
 closer+0.019
 bite+0.018
ax+0.017
 slice+0.016
Neg logit contributions
olving-0.033
 mastering-0.032
stack-0.032
 situations-0.031
 Kang-0.030
Token the
Token ID262
Feature activation+9.18e-01

Pos logit contributions
 closer+0.037
 sigh+0.035
awk+0.035
 bite+0.035
ax+0.034
Neg logit contributions
 mastering-0.069
olving-0.068
stack-0.067
 lived-0.065
und-0.064
Token stolen
Token ID9909
Feature activation-1.99e-01

Pos logit contributions
 mastering+0.036
olving+0.036
stack+0.034
 tre+0.034
 forest+0.033
Neg logit contributions
 closer-0.028
awk-0.028
 sigh-0.027
ax-0.026
ls-0.026
Token squares
Token ID24438
Feature activation+1.42e+00

Pos logit contributions
 sigh+0.006
awk+0.006
 closer+0.006
 bite+0.005
bs+0.005
Neg logit contributions
olving-0.008
 mastering-0.008
stack-0.008
 lived-0.007
 forest-0.007
Token to
Token ID284
Feature activation+1.30e+00

Pos logit contributions
 mastering+0.058
olving+0.058
stack+0.057
 lived+0.055
und+0.054
Neg logit contributions
 closer-0.041
 sigh-0.041
awk-0.041
 bite-0.039
ax-0.038
Token their
Token ID511
Feature activation+1.07e+00

Pos logit contributions
 mastering+0.057
olving+0.057
stack+0.053
und+0.053
 lived+0.053
Neg logit contributions
 closer-0.038
 bite-0.036
 sigh-0.034
ax-0.033
 sip-0.032
Token rightful
Token ID43133
Feature activation+1.11e+01

Pos logit contributions
 mastering+0.048
und+0.047
olving+0.046
ating+0.045
ric+0.044
Neg logit contributions
 closer-0.031
 bite-0.031
 sandwich-0.030
 sigh-0.029
 sandwiches-0.028
Token place
Token ID1295
Feature activation+5.76e+00

Pos logit contributions
olving+0.388
 mastering+0.388
 lived+0.377
stack+0.375
 tre+0.367
Neg logit contributions
 closer-0.394
awk-0.380
 sigh-0.379
ax-0.372
 bite-0.370
Token.
Token ID13
Feature activation+3.50e-01

Pos logit contributions
 mastering+0.278
olving+0.270
 Kang+0.260
stack+0.259
 lived+0.256
Neg logit contributions
 closer-0.148
 bite-0.132
 sigh-0.130
ax-0.122
ls-0.118
Token
Token ID198
Feature activation-7.78e-01

Pos logit contributions
olving+0.017
 mastering+0.017
ating+0.016
stack+0.015
 tre+0.015
Neg logit contributions
 closer-0.009
 sigh-0.008
ax-0.008
 bite-0.008
five-0.007
Token
Token ID198
Feature activation-7.69e-01

Pos logit contributions
 closer+0.026
awk+0.026
 sigh+0.025
 bite+0.025
ax+0.024
Neg logit contributions
 lived-0.029
stack-0.029
 mastering-0.028
und-0.028
olving-0.027
TokenThe
Token ID464
Feature activation+5.14e+00

Pos logit contributions
 closer+0.028
 bite+0.026
 sigh+0.026
ax+0.026
awk+0.025
Neg logit contributions
 mastering-0.028
olving-0.028
stack-0.026
ric-0.025
 lived-0.025
Token?"
Token ID1701
Feature activation-9.17e+00

Pos logit contributions
 closer+0.026
 bite+0.023
ax+0.022
five+0.022
 sigh+0.022
Neg logit contributions
 mastering-0.074
olving-0.071
stack-0.071
und-0.069
 Australian-0.068
Token L
Token ID406
Feature activation-3.80e-01

Pos logit contributions
 closer+0.228
 sigh+0.211
 bite+0.208
ax+0.203
awk+0.202
Neg logit contributions
 mastering-0.440
olving-0.440
stack-0.420
ating-0.403
und-0.401
Tokenila
Token ID10102
Feature activation-2.13e+00

Pos logit contributions
 closer+0.011
 bite+0.010
 sigh+0.010
awk+0.009
 slice+0.009
Neg logit contributions
 mastering-0.017
olving-0.016
 lived-0.016
und-0.016
stack-0.016
Token and
Token ID290
Feature activation-5.84e+00

Pos logit contributions
 closer+0.048
 bite+0.044
awk+0.043
 sigh+0.043
ax+0.042
Neg logit contributions
 mastering-0.104
olving-0.102
 lived-0.100
stack-0.099
und-0.098
Token Ben
Token ID3932
Feature activation-4.80e+00

Pos logit contributions
 closer+0.104
 sigh+0.095
 bite+0.093
awk+0.088
ax+0.088
Neg logit contributions
 mastering-0.320
olving-0.316
stack-0.308
 lived-0.299
und-0.298
Token look
Token ID804
Feature activation-1.05e+01

Pos logit contributions
 closer+0.115
 bite+0.105
awk+0.104
ax+0.102
 sigh+0.101
Neg logit contributions
 mastering-0.224
 lived-0.218
olving-0.218
stack-0.216
und-0.211
Token at
Token ID379
Feature activation+9.27e-01

Pos logit contributions
five+0.363
 bite+0.356
iffs+0.354
umbs+0.348
 slice+0.345
Neg logit contributions
 lived-0.360
 mastering-0.336
und-0.321
 tre-0.313
olving-0.300
Token the
Token ID262
Feature activation-8.05e-02

Pos logit contributions
 mastering+0.040
olving+0.039
stack+0.038
 lived+0.037
und+0.037
Neg logit contributions
 closer-0.027
 sigh-0.025
 bite-0.025
awk-0.025
ax-0.025
Token ice
Token ID4771
Feature activation+3.46e-01

Pos logit contributions
 closer+0.002
 bite+0.002
 sigh+0.002
awk+0.002
ax+0.002
Neg logit contributions
 mastering-0.004
olving-0.003
stack-0.003
 lived-0.003
und-0.003
Token-
Token ID12
Feature activation+6.05e-01

Pos logit contributions
 mastering+0.016
olving+0.016
stack+0.016
 lived+0.015
und+0.015
Neg logit contributions
 closer-0.009
 sigh-0.008
awk-0.008
 bite-0.008
ax-0.008
Tokencream
Token ID36277
Feature activation+4.71e-01

Pos logit contributions
jack+0.019
und+0.018
ating+0.018
olving+0.018
ric+0.018
Neg logit contributions
 sigh-0.024
 closer-0.022
 bite-0.022
 sip-0.021
awk-0.021
Token
Token ID198
Feature activation-2.35e+00

Pos logit contributions
 closer+0.021
 sigh+0.019
 bite+0.019
awk+0.019
ax+0.019
Neg logit contributions
 mastering-0.032
olving-0.032
stack-0.031
 lived-0.031
und-0.030
Token
Token ID198
Feature activation-1.78e+00

Pos logit contributions
 closer+0.065
 sigh+0.059
awk+0.059
ax+0.059
 bite+0.059
Neg logit contributions
 mastering-0.105
olving-0.104
stack-0.100
 lived-0.095
ating-0.095
TokenTom
Token ID13787
Feature activation-2.52e+00

Pos logit contributions
 closer+0.053
ax+0.049
awk+0.047
 sigh+0.047
 bite+0.047
Neg logit contributions
 mastering-0.076
olving-0.076
stack-0.073
ric-0.069
 lived-0.069
Token and
Token ID290
Feature activation-5.60e+00

Pos logit contributions
 closer+0.064
 sigh+0.062
 bite+0.060
ax+0.059
awk+0.058
Neg logit contributions
 mastering-0.119
olving-0.118
stack-0.115
 Kang-0.109
 lived-0.108
Token Lily
Token ID20037
Feature activation-3.69e+00

Pos logit contributions
 closer+0.117
 bite+0.105
 sigh+0.105
awk+0.101
ax+0.100
Neg logit contributions
 mastering-0.289
olving-0.284
stack-0.278
 lived-0.272
und-0.270
Token look
Token ID804
Feature activation-1.05e+01

Pos logit contributions
 closer+0.122
 sigh+0.120
 bite+0.115
ax+0.114
awk+0.111
Neg logit contributions
 mastering-0.149
olving-0.149
stack-0.142
 Kang-0.135
ating-0.134
Token at
Token ID379
Feature activation+1.17e+00

Pos logit contributions
 bite+0.421
five+0.404
 slice+0.403
iffs+0.397
 sigh+0.391
Neg logit contributions
 lived-0.308
 mastering-0.304
und-0.282
olving-0.273
 tre-0.270
Token the
Token ID262
Feature activation-5.96e-02

Pos logit contributions
 mastering+0.050
olving+0.049
stack+0.048
 lived+0.047
und+0.046
Neg logit contributions
 closer-0.035
 sigh-0.033
 bite-0.033
awk-0.032
ax-0.032
Token salad
Token ID20698
Feature activation-3.86e+00

Pos logit contributions
 closer+0.002
 bite+0.002
 sigh+0.002
awk+0.002
ax+0.002
Neg logit contributions
 mastering-0.003
olving-0.003
stack-0.002
 lived-0.002
und-0.002
Token.
Token ID13
Feature activation-7.26e-01

Pos logit contributions
 closer+0.100
 bite+0.091
 sigh+0.086
 sandwich+0.085
ls+0.084
Neg logit contributions
 mastering-0.184
olving-0.177
und-0.170
stack-0.169
 lived-0.169
Token It
Token ID632
Feature activation-2.63e+00

Pos logit contributions
 closer+0.022
 sigh+0.020
 bite+0.020
ax+0.019
awk+0.019
Neg logit contributions
 mastering-0.031
olving-0.030
stack-0.029
 lived-0.029
und-0.028
Token were
Token ID547
Feature activation+4.68e+00

Pos logit contributions
ax+0.067
 sigh+0.064
 closer+0.063
awk+0.062
 bite+0.062
Neg logit contributions
 mastering-0.166
olving-0.161
 lived-0.161
stack-0.159
und-0.157
Token twins
Token ID20345
Feature activation+5.91e-01

Pos logit contributions
 mastering+0.196
olving+0.193
stack+0.188
 lived+0.182
und+0.182
Neg logit contributions
 closer-0.143
 bite-0.131
 sigh-0.131
ax-0.128
awk-0.127
Token who
Token ID508
Feature activation-2.52e-01

Pos logit contributions
olving+0.026
 mastering+0.025
stack+0.025
 Kang+0.024
und+0.024
Neg logit contributions
 closer-0.018
 sigh-0.017
 bite-0.016
ax-0.015
awk-0.015
Token liked
Token ID8288
Feature activation+2.15e+00

Pos logit contributions
 closer+0.008
 sigh+0.008
 bite+0.008
bs+0.008
ax+0.007
Neg logit contributions
 mastering-0.010
olving-0.010
stack-0.010
 Kang-0.009
 perspective-0.009
Token to
Token ID284
Feature activation+2.73e-01

Pos logit contributions
stack+0.134
 mastering+0.131
olving+0.129
und+0.128
 Kang+0.126
Neg logit contributions
 closer-0.028
 bite-0.024
 sigh-0.022
 burger-0.017
 sandwiches-0.017
Token look
Token ID804
Feature activation-1.04e+01

Pos logit contributions
 mastering+0.012
olving+0.011
stack+0.011
und+0.011
 Kang+0.011
Neg logit contributions
 closer-0.008
 bite-0.008
 sigh-0.008
 sip-0.007
awk-0.007
Token at
Token ID379
Feature activation+1.45e+00

Pos logit contributions
 bite+0.335
 sigh+0.329
awk+0.326
ax+0.321
hell+0.318
Neg logit contributions
 mastering-0.394
olving-0.380
 lived-0.365
 tre-0.350
 forest-0.342
Token the
Token ID262
Feature activation+1.10e-01

Pos logit contributions
stack+0.069
 mastering+0.069
olving+0.067
 Kang+0.065
jack+0.064
Neg logit contributions
 closer-0.039
 bite-0.034
 sigh-0.033
ls-0.030
 sip-0.030
Token stars
Token ID5788
Feature activation-9.16e-01

Pos logit contributions
 mastering+0.004
olving+0.004
stack+0.004
und+0.004
 lived+0.004
Neg logit contributions
 closer-0.004
 bite-0.003
 sigh-0.003
 sandwich-0.003
awk-0.003
Token at
Token ID379
Feature activation+1.63e+00

Pos logit contributions
 closer+0.024
 bite+0.021
 sigh+0.020
ls+0.020
awk+0.019
Neg logit contributions
 mastering-0.044
olving-0.043
stack-0.042
und-0.041
 lived-0.041
Token night
Token ID1755
Feature activation+1.02e+00

Pos logit contributions
 mastering+0.070
olving+0.068
stack+0.067
und+0.065
jack+0.064
Neg logit contributions
 closer-0.049
 bite-0.045
 sigh-0.044
awk-0.041
ax-0.040
Token
Token ID198
Feature activation-1.81e+00

Pos logit contributions
 closer+0.228
 sigh+0.211
 bite+0.208
ax+0.203
awk+0.202
Neg logit contributions
 mastering-0.440
olving-0.440
stack-0.420
ating-0.403
und-0.401
Token
Token ID198
Feature activation-1.65e+00

Pos logit contributions
 closer+0.051
 sigh+0.047
 bite+0.047
awk+0.047
ax+0.046
Neg logit contributions
 mastering-0.079
olving-0.078
stack-0.076
 lived-0.075
und-0.073
TokenTom
Token ID13787
Feature activation-2.32e+00

Pos logit contributions
 closer+0.046
ax+0.042
 sigh+0.042
 bite+0.042
awk+0.041
Neg logit contributions
olving-0.075
 mastering-0.075
stack-0.071
ric-0.068
ating-0.067
Token and
Token ID290
Feature activation-5.46e+00

Pos logit contributions
 closer+0.058
 sigh+0.054
 bite+0.053
awk+0.052
ax+0.052
Neg logit contributions
 mastering-0.110
olving-0.108
stack-0.106
 lived-0.103
und-0.102
Token Mia
Token ID32189
Feature activation-3.33e+00

Pos logit contributions
 closer+0.100
 sigh+0.089
 bite+0.089
awk+0.083
ax+0.082
Neg logit contributions
 mastering-0.298
olving-0.294
stack-0.287
und-0.279
 lived-0.279
Token look
Token ID804
Feature activation-1.04e+01

Pos logit contributions
 closer+0.101
 sigh+0.095
 bite+0.094
awk+0.092
ax+0.091
Neg logit contributions
 mastering-0.141
olving-0.138
stack-0.134
 lived-0.129
und-0.128
Token at
Token ID379
Feature activation+1.23e+00

Pos logit contributions
 bite+0.405
five+0.399
 slice+0.392
iffs+0.390
umbs+0.387
Neg logit contributions
 lived-0.311
 mastering-0.292
und-0.283
 tre-0.265
olving-0.259
Token Mom
Token ID11254
Feature activation-5.69e+00

Pos logit contributions
 mastering+0.051
olving+0.050
stack+0.049
 lived+0.048
und+0.047
Neg logit contributions
 closer-0.037
 sigh-0.035
 bite-0.035
awk-0.035
ax-0.034
Token.
Token ID13
Feature activation-5.15e-01

Pos logit contributions
 closer+0.149
 sigh+0.138
 bite+0.136
awk+0.132
ax+0.132
Neg logit contributions
 mastering-0.263
olving-0.259
stack-0.253
 lived-0.247
und-0.244
Token They
Token ID1119
Feature activation+2.19e+00

Pos logit contributions
 closer+0.016
 sigh+0.016
 bite+0.015
 sip+0.015
ax+0.014
Neg logit contributions
 mastering-0.022
olving-0.021
stack-0.020
ating-0.020
 lived-0.020
Token see
Token ID766
Feature activation-6.21e+00

Pos logit contributions
 mastering+0.104
olving+0.102
 lived+0.101
stack+0.101
und+0.098
Neg logit contributions
 closer-0.052
 bite-0.046
awk-0.046
ls-0.046
ax-0.046
Token mom
Token ID1995
Feature activation-6.48e+00

Pos logit contributions
 mastering+0.020
olving+0.019
stack+0.018
 lived+0.018
und+0.018
Neg logit contributions
 closer-0.025
 sigh-0.024
 bite-0.024
awk-0.024
ax-0.024
Token called
Token ID1444
Feature activation-1.73e+00

Pos logit contributions
 closer+0.182
 bite+0.168
 sigh+0.167
awk+0.167
ax+0.164
Neg logit contributions
 mastering-0.282
olving-0.276
stack-0.270
 lived-0.267
und-0.261
Token him
Token ID683
Feature activation-2.58e+00

Pos logit contributions
 closer+0.043
 bite+0.039
 sigh+0.039
ax+0.037
awk+0.037
Neg logit contributions
 mastering-0.083
olving-0.081
stack-0.078
 lived-0.077
und-0.077
Token over
Token ID625
Feature activation-6.18e+00

Pos logit contributions
 closer+0.066
 sigh+0.056
 bite+0.056
ax+0.053
awk+0.051
Neg logit contributions
 mastering-0.127
olving-0.123
stack-0.119
und-0.117
ating-0.116
Token to
Token ID284
Feature activation+3.91e-01

Pos logit contributions
 closer+0.121
 sigh+0.102
 bite+0.100
ax+0.087
awk+0.083
Neg logit contributions
 mastering-0.339
olving-0.335
stack-0.326
 lived-0.325
ric-0.320
Token look
Token ID804
Feature activation-1.04e+01

Pos logit contributions
olving+0.018
 mastering+0.018
stack+0.017
ating+0.016
und+0.016
Neg logit contributions
 closer-0.011
 bite-0.011
 sigh-0.011
 sip-0.010
awk-0.009
Token at
Token ID379
Feature activation+1.47e+00

Pos logit contributions
 bite+0.312
awk+0.312
 sigh+0.303
ax+0.300
 closer+0.300
Neg logit contributions
 mastering-0.397
 lived-0.385
olving-0.376
 tre-0.364
und-0.363
Token something
Token ID1223
Feature activation-5.04e+00

Pos logit contributions
 mastering+0.069
olving+0.067
stack+0.064
 Kang+0.063
 lived+0.062
Neg logit contributions
 closer-0.040
 bite-0.037
 sigh-0.036
ax-0.033
ls-0.032
Token.
Token ID13
Feature activation-3.21e-01

Pos logit contributions
 closer+0.136
 bite+0.120
 sigh+0.120
ls+0.116
ax+0.112
Neg logit contributions
 mastering-0.238
olving-0.235
stack-0.230
und-0.222
ric-0.218
Token She
Token ID1375
Feature activation+3.25e+00

Pos logit contributions
 closer+0.010
 sigh+0.009
ax+0.009
 bite+0.008
awk+0.008
Neg logit contributions
 mastering-0.014
olving-0.014
ating-0.013
stack-0.013
und-0.012
Token pointed
Token ID6235
Feature activation-5.82e+00

Pos logit contributions
 mastering+0.143
olving+0.143
stack+0.138
und+0.131
ric+0.129
Neg logit contributions
 closer-0.095
 sigh-0.088
 bite-0.085
ax-0.083
awk-0.081
Token higher
Token ID2440
Feature activation-4.64e+00

Pos logit contributions
ax+0.057
five+0.056
 bite+0.056
 sigh+0.056
 slice+0.054
Neg logit contributions
olving-0.099
 lived-0.097
 mastering-0.097
und-0.092
 tre-0.091
Token and
Token ID290
Feature activation-5.05e+00

Pos logit contributions
 closer+0.110
 bite+0.099
ls+0.092
 sigh+0.090
awk+0.090
Neg logit contributions
 mastering-0.239
olving-0.230
stack-0.222
 lived-0.222
arians-0.217
Token higher
Token ID2440
Feature activation-4.26e+00

Pos logit contributions
 closer+0.100
 sigh+0.087
 bite+0.086
awk+0.082
ax+0.080
Neg logit contributions
 mastering-0.268
olving-0.264
stack-0.258
 lived-0.252
und-0.252
Token.
Token ID13
Feature activation-3.37e-01

Pos logit contributions
 closer+0.112
 bite+0.096
ls+0.095
 sigh+0.093
 harder+0.090
Neg logit contributions
 mastering-0.206
olving-0.199
stack-0.195
 lived-0.194
 Kang-0.193
Token They
Token ID1119
Feature activation+1.78e+00

Pos logit contributions
 closer+0.010
 sigh+0.009
ax+0.009
 bite+0.009
awk+0.009
Neg logit contributions
 mastering-0.014
olving-0.014
ating-0.014
stack-0.013
 tre-0.013
Token look
Token ID804
Feature activation-1.03e+01

Pos logit contributions
 mastering+0.079
olving+0.078
stack+0.075
und+0.073
 Kang+0.072
Neg logit contributions
 closer-0.051
 bite-0.048
 sigh-0.047
awk-0.044
ax-0.044
Token at
Token ID379
Feature activation+1.69e+00

Pos logit contributions
umbs+0.318
iffs+0.314
 bite+0.310
awk+0.309
ax+0.307
Neg logit contributions
 mastering-0.384
 lived-0.379
 tre-0.360
olving-0.357
 forest-0.342
Token each
Token ID1123
Feature activation-1.34e+00

Pos logit contributions
 mastering+0.074
olving+0.072
stack+0.070
 lived+0.068
und+0.068
Neg logit contributions
 closer-0.050
 bite-0.046
 sigh-0.045
awk-0.043
ax-0.043
Token other
Token ID584
Feature activation+8.10e+00

Pos logit contributions
 closer+0.043
 bite+0.042
 sigh+0.038
awk+0.038
ax+0.037
Neg logit contributions
 mastering-0.056
stack-0.056
olving-0.056
 lived-0.053
und-0.053
Token's
Token ID338
Feature activation+9.95e-01

Pos logit contributions
 mastering+0.441
stack+0.430
olving+0.429
 Kang+0.421
und+0.419
Neg logit contributions
 closer-0.155
 bite-0.139
 sigh-0.139
 sandwich-0.118
 slice-0.118
Token towers
Token ID18028
Feature activation+7.68e-01

Pos logit contributions
 mastering+0.041
stack+0.039
und+0.039
 lived+0.039
olving+0.039
Neg logit contributions
 closer-0.032
 bite-0.031
 sigh-0.030
awk-0.028
 slice-0.027
Token
Token ID198
Feature activation-2.35e+00

Pos logit contributions
 mastering+0.191
olving+0.189
 lived+0.183
stack+0.180
 tre+0.175
Neg logit contributions
 closer-0.107
awk-0.107
ax-0.104
five-0.100
 bite-0.099
Token
Token ID198
Feature activation-1.78e+00

Pos logit contributions
 closer+0.065
 sigh+0.059
awk+0.059
ax+0.059
 bite+0.059
Neg logit contributions
 mastering-0.105
olving-0.104
stack-0.100
 lived-0.095
ating-0.095
TokenTim
Token ID14967
Feature activation+6.08e-01

Pos logit contributions
 closer+0.053
ax+0.049
awk+0.047
 sigh+0.047
 bite+0.047
Neg logit contributions
 mastering-0.076
olving-0.076
stack-0.073
ric-0.069
 lived-0.069
Tokenmy
Token ID1820
Feature activation-4.59e+00

Pos logit contributions
 mastering+0.028
olving+0.027
stack+0.027
 lived+0.025
und+0.025
Neg logit contributions
 closer-0.016
 sigh-0.016
 bite-0.015
awk-0.015
ax-0.015
Token thought
Token ID1807
Feature activation-6.01e+00

Pos logit contributions
 closer+0.153
 sigh+0.149
 bite+0.145
ax+0.141
awk+0.140
Neg logit contributions
 mastering-0.180
olving-0.179
stack-0.173
und-0.161
 Kang-0.161
Token it
Token ID340
Feature activation-1.03e+01

Pos logit contributions
 closer+0.166
awk+0.164
 bite+0.158
 sigh+0.154
ax+0.154
Neg logit contributions
 mastering-0.261
olving-0.260
 lived-0.247
stack-0.245
und-0.238
Token was
Token ID373
Feature activation+4.06e+00

Pos logit contributions
 closer+0.413
 bite+0.381
 sigh+0.378
hell+0.352
 harder+0.345
Neg logit contributions
 mastering-0.369
stack-0.356
olving-0.350
 Kang-0.334
und-0.327
Token a
Token ID257
Feature activation-4.75e+00

Pos logit contributions
 mastering+0.203
olving+0.199
stack+0.198
und+0.192
 lived+0.187
Neg logit contributions
 closer-0.096
 bite-0.086
 sigh-0.084
ax-0.081
awk-0.078
Token great
Token ID1049
Feature activation+3.03e+00

Pos logit contributions
 closer+0.135
 bite+0.125
 sigh+0.123
awk+0.114
ax+0.112
Neg logit contributions
 mastering-0.215
olving-0.209
stack-0.208
und-0.203
 lived-0.200
Token idea
Token ID2126
Feature activation+2.61e-01

Pos logit contributions
olving+0.098
 mastering+0.096
 lived+0.088
 forest+0.087
 tre+0.087
Neg logit contributions
ls-0.118
ax-0.116
 closer-0.113
awk-0.113
bs-0.110
Token and
Token ID290
Feature activation-4.55e+00

Pos logit contributions
 mastering+0.013
stack+0.013
olving+0.013
ric+0.012
 Kang+0.012
Neg logit contributions
 closer-0.006
 bite-0.006
 sigh-0.005
hell-0.005
ax-0.005
Token Lily
Token ID20037
Feature activation-4.49e+00

Pos logit contributions
 closer+0.050
 sigh+0.047
awk+0.047
 bite+0.047
ax+0.045
Neg logit contributions
 mastering-0.078
stack-0.076
olving-0.076
 lived-0.076
und-0.074
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
 closer+0.107
 sigh+0.094
 bite+0.089
ax+0.089
awk+0.087
Neg logit contributions
olving-0.223
 mastering-0.221
stack-0.217
umm-0.208
 Kang-0.206
Token She
Token ID1375
Feature activation+2.66e+00

Pos logit contributions
 closer+0.031
ax+0.029
 sigh+0.029
 sip+0.027
 bite+0.027
Neg logit contributions
 mastering-0.045
olving-0.044
ating-0.043
stack-0.041
 tre-0.041
Token loved
Token ID6151
Feature activation+1.76e+00

Pos logit contributions
olving+0.103
 mastering+0.100
stack+0.098
 situations+0.096
umm+0.094
Neg logit contributions
 closer-0.090
 sigh-0.089
 bite-0.087
 yawn-0.083
 dropped-0.081
Token to
Token ID284
Feature activation+1.43e-01

Pos logit contributions
stack+0.103
 mastering+0.100
olving+0.100
und+0.096
jack+0.096
Neg logit contributions
 closer-0.026
 bite-0.025
 sigh-0.022
 burger-0.020
 sandwiches-0.020
Token look
Token ID804
Feature activation-1.03e+01

Pos logit contributions
 mastering+0.006
olving+0.006
stack+0.006
und+0.005
 lived+0.005
Neg logit contributions
 closer-0.004
 bite-0.004
 sigh-0.004
awk-0.004
ax-0.004
Token at
Token ID379
Feature activation+1.49e+00

Pos logit contributions
 bite+0.373
 sigh+0.365
five+0.364
 slice+0.354
awk+0.353
Neg logit contributions
 mastering-0.346
olving-0.326
 lived-0.314
 tre-0.311
 forest-0.309
Token herself
Token ID5223
Feature activation+8.92e-01

Pos logit contributions
 mastering+0.067
stack+0.065
olving+0.065
und+0.062
 Kang+0.061
Neg logit contributions
 closer-0.042
 bite-0.038
 sigh-0.037
ls-0.036
ax-0.035
Token in
Token ID287
Feature activation+3.14e+00

Pos logit contributions
 mastering+0.042
stack+0.041
olving+0.040
 Zo+0.040
und+0.039
Neg logit contributions
 closer-0.023
ls-0.021
 bite-0.020
awk-0.019
 sigh-0.018
Token the
Token ID262
Feature activation+3.23e-01

Pos logit contributions
 mastering+0.135
olving+0.132
stack+0.132
und+0.125
 lived+0.125
Neg logit contributions
 closer-0.094
 bite-0.087
 sigh-0.087
ax-0.080
awk-0.080
Token mirror
Token ID10162
Feature activation-7.15e-01

Pos logit contributions
 mastering+0.010
olving+0.010
stack+0.010
 lived+0.009
und+0.009
Neg logit contributions
 closer-0.014
 sigh-0.013
 bite-0.013
awk-0.013
ax-0.012
Token drink
Token ID4144
Feature activation-3.96e+00

Pos logit contributions
 closer+0.165
 bite+0.153
 sigh+0.151
awk+0.145
ax+0.143
Neg logit contributions
 mastering-0.257
olving-0.252
stack-0.244
und-0.237
 lived-0.236
Token some
Token ID617
Feature activation-5.92e+00

Pos logit contributions
awk+0.101
 closer+0.101
ax+0.100
 sigh+0.099
 bite+0.092
Neg logit contributions
 lived-0.169
olving-0.166
 mastering-0.165
stack-0.165
und-0.162
Token water
Token ID1660
Feature activation-1.71e+00

Pos logit contributions
 closer+0.124
 bite+0.115
 sigh+0.108
ls+0.102
awk+0.102
Neg logit contributions
 mastering-0.316
olving-0.307
stack-0.302
und-0.292
ric-0.288
Token.
Token ID13
Feature activation-7.02e-01

Pos logit contributions
 closer+0.050
 sigh+0.047
awk+0.046
ax+0.046
 bite+0.046
Neg logit contributions
 mastering-0.071
olving-0.070
stack-0.070
 lived-0.068
und-0.067
Token They
Token ID1119
Feature activation+1.87e+00

Pos logit contributions
 closer+0.021
 sigh+0.019
ax+0.019
 bite+0.019
awk+0.019
Neg logit contributions
 mastering-0.030
olving-0.030
ating-0.028
stack-0.027
und-0.027
Token look
Token ID804
Feature activation-1.03e+01

Pos logit contributions
 mastering+0.081
olving+0.079
stack+0.077
 lived+0.076
und+0.075
Neg logit contributions
 closer-0.054
 sigh-0.050
 bite-0.050
awk-0.049
ax-0.049
Token around
Token ID1088
Feature activation-1.25e+00

Pos logit contributions
five+0.384
umbs+0.381
iffs+0.377
ax+0.367
awk+0.360
Neg logit contributions
 lived-0.321
 mastering-0.303
 tre-0.279
 completing-0.270
 forest-0.270
Token.
Token ID13
Feature activation+6.35e-02

Pos logit contributions
 closer+0.027
 bite+0.024
 sigh+0.024
awk+0.023
ax+0.023
Neg logit contributions
 mastering-0.064
olving-0.063
stack-0.061
 lived-0.060
und-0.059
Token They
Token ID1119
Feature activation+2.25e+00

Pos logit contributions
 mastering+0.003
olving+0.003
stack+0.003
 lived+0.002
ating+0.002
Neg logit contributions
 closer-0.002
 sigh-0.002
 bite-0.002
ax-0.002
awk-0.002
Token see
Token ID766
Feature activation-6.24e+00

Pos logit contributions
 mastering+0.107
 lived+0.107
stack+0.106
olving+0.105
und+0.100
Neg logit contributions
 closer-0.050
awk-0.046
ax-0.046
ls-0.045
hell-0.044
Token their
Token ID511
Feature activation+4.66e-01

Pos logit contributions
 closer+0.144
awk+0.138
ax+0.137
 sigh+0.135
 bite+0.133
Neg logit contributions
 mastering-0.295
olving-0.292
 lived-0.289
stack-0.288
und-0.277
Token Lily
Token ID20037
Feature activation-3.22e+00

Pos logit contributions
 closer+0.035
ax+0.032
 sigh+0.031
 bite+0.030
awk+0.030
Neg logit contributions
 mastering-0.056
olving-0.056
ating-0.053
 tre-0.049
stack-0.049
Token and
Token ID290
Feature activation-5.45e+00

Pos logit contributions
 closer+0.102
 sigh+0.099
 bite+0.097
ax+0.094
awk+0.094
Neg logit contributions
 mastering-0.130
olving-0.129
stack-0.126
 Kang-0.118
und-0.118
Token Ben
Token ID3932
Feature activation-4.51e+00

Pos logit contributions
 closer+0.101
 bite+0.093
 sigh+0.093
ax+0.088
awk+0.087
Neg logit contributions
 mastering-0.296
olving-0.291
stack-0.282
und-0.277
ating-0.273
Token like
Token ID588
Feature activation-8.54e-01

Pos logit contributions
 closer+0.142
 sigh+0.134
 bite+0.133
awk+0.129
ax+0.128
Neg logit contributions
 mastering-0.187
olving-0.185
stack-0.178
und-0.171
 lived-0.167
Token to
Token ID284
Feature activation+3.06e-01

Pos logit contributions
 closer+0.015
 bite+0.014
 sigh+0.013
awk+0.012
ls+0.011
Neg logit contributions
 mastering-0.048
olving-0.046
stack-0.046
und-0.045
 lived-0.045
Token look
Token ID804
Feature activation-1.03e+01

Pos logit contributions
 mastering+0.012
olving+0.012
stack+0.012
 lived+0.012
und+0.011
Neg logit contributions
 closer-0.009
 sigh-0.009
ax-0.009
awk-0.009
 bite-0.009
Token at
Token ID379
Feature activation+1.46e+00

Pos logit contributions
 bite+0.352
 sigh+0.349
hell+0.344
five+0.340
 slice+0.338
Neg logit contributions
 mastering-0.352
 lived-0.331
olving-0.326
 forest-0.309
 tre-0.308
Token the
Token ID262
Feature activation+4.38e-01

Pos logit contributions
 mastering+0.063
olving+0.062
stack+0.060
 lived+0.059
und+0.058
Neg logit contributions
 closer-0.043
 bite-0.040
 sigh-0.040
awk-0.039
ax-0.038
Token bugs
Token ID11316
Feature activation+9.07e-02

Pos logit contributions
 mastering+0.018
olving+0.018
stack+0.017
 lived+0.017
und+0.017
Neg logit contributions
 closer-0.014
 bite-0.013
 sigh-0.012
awk-0.012
ax-0.012
Token and
Token ID290
Feature activation-4.74e+00

Pos logit contributions
 mastering+0.004
olving+0.004
stack+0.004
 lived+0.004
und+0.004
Neg logit contributions
 closer-0.002
 bite-0.002
 sigh-0.002
awk-0.002
ax-0.002
Token wave
Token ID6769
Feature activation-6.27e+00

Pos logit contributions
 closer+0.128
 bite+0.121
 sigh+0.116
awk+0.109
 sip+0.107
Neg logit contributions
 mastering-0.223
olving-0.221
stack-0.212
und-0.211
ating-0.206
Token playing
Token ID2712
Feature activation-5.95e-01

Pos logit contributions
 closer+0.023
 bite+0.023
 sigh+0.022
awk+0.022
ax+0.021
Neg logit contributions
 mastering-0.018
stack-0.018
olving-0.017
und-0.016
jack-0.016
Token.
Token ID13
Feature activation-6.06e-01

Pos logit contributions
 closer+0.018
 bite+0.016
 sigh+0.016
awk+0.016
ax+0.015
Neg logit contributions
 mastering-0.026
olving-0.025
stack-0.024
 lived-0.023
und-0.023
Token
Token ID198
Feature activation-1.39e+00

Pos logit contributions
 closer+0.016
ax+0.015
 sigh+0.014
 bite+0.014
awk+0.013
Neg logit contributions
 mastering-0.028
olving-0.028
ating-0.027
stack-0.025
 tre-0.025
Token
Token ID198
Feature activation-1.40e+00

Pos logit contributions
 closer+0.038
 sigh+0.035
 bite+0.035
awk+0.034
ax+0.034
Neg logit contributions
 mastering-0.063
olving-0.061
stack-0.060
 lived-0.059
und-0.058
TokenThey
Token ID2990
Feature activation-2.90e-01

Pos logit contributions
 closer+0.042
ax+0.038
 bite+0.037
 sigh+0.037
awk+0.037
Neg logit contributions
olving-0.060
 mastering-0.060
stack-0.057
ric-0.055
ating-0.054
Token look
Token ID804
Feature activation-1.03e+01

Pos logit contributions
 closer+0.008
 bite+0.008
 sigh+0.008
ax+0.007
 sip+0.007
Neg logit contributions
olving-0.013
 mastering-0.013
stack-0.012
ating-0.012
 Kang-0.012
Token out
Token ID503
Feature activation-4.54e+00

Pos logit contributions
iffs+0.370
umbs+0.367
 bite+0.365
hell+0.361
awk+0.359
Neg logit contributions
 mastering-0.322
 lived-0.315
 tre-0.287
olving-0.280
 forest-0.278
Token the
Token ID262
Feature activation+4.01e-01

Pos logit contributions
 closer+0.149
awk+0.147
ax+0.145
 bite+0.144
 sigh+0.143
Neg logit contributions
 mastering-0.172
olving-0.165
stack-0.158
 lived-0.157
und-0.153
Token window
Token ID4324
Feature activation+2.92e-01

Pos logit contributions
 mastering+0.017
olving+0.016
stack+0.016
 lived+0.015
und+0.015
Neg logit contributions
 closer-0.013
 bite-0.012
 sigh-0.012
awk-0.012
ax-0.011
Token.
Token ID13
Feature activation-1.70e-01

Pos logit contributions
 mastering+0.015
olving+0.015
stack+0.015
 lived+0.015
und+0.014
Neg logit contributions
 closer-0.006
 sigh-0.005
 bite-0.005
ax-0.005
awk-0.005
Token They
Token ID1119
Feature activation+2.55e+00

Pos logit contributions
 closer+0.005
 sigh+0.005
ax+0.004
 bite+0.004
 sip+0.004
Neg logit contributions
 mastering-0.007
olving-0.007
ating-0.007
stack-0.007
jack-0.007
Token then
Token ID788
Feature activation-5.63e+00

Pos logit contributions
olving+0.004
 mastering+0.004
stack+0.003
ating+0.003
und+0.003
Neg logit contributions
 closer-0.004
 bite-0.003
 sigh-0.003
ls-0.003
 sandwiches-0.003
Token Tom
Token ID4186
Feature activation-5.25e+00

Pos logit contributions
 closer+0.161
 bite+0.150
 sigh+0.150
awk+0.149
ax+0.147
Neg logit contributions
 mastering-0.242
olving-0.236
stack-0.231
 lived-0.228
und-0.223
Token made
Token ID925
Feature activation-3.48e+00

Pos logit contributions
 closer+0.139
 sigh+0.129
 bite+0.128
awk+0.125
ax+0.124
Neg logit contributions
 mastering-0.240
olving-0.236
stack-0.230
 lived-0.223
und-0.221
Token his
Token ID465
Feature activation+3.87e-01

Pos logit contributions
 closer+0.088
awk+0.083
 sigh+0.082
 bite+0.082
ax+0.080
Neg logit contributions
 mastering-0.159
olving-0.157
stack-0.151
 lived-0.150
und-0.147
Token doll
Token ID3654
Feature activation-2.24e-01

Pos logit contributions
 mastering+0.017
olving+0.016
stack+0.016
und+0.016
 lived+0.016
Neg logit contributions
 closer-0.011
 bite-0.011
 sigh-0.011
awk-0.010
ax-0.010
Token look
Token ID804
Feature activation-1.03e+01

Pos logit contributions
 closer+0.006
 sigh+0.005
 bite+0.005
awk+0.005
ax+0.005
Neg logit contributions
 mastering-0.010
olving-0.010
stack-0.010
 lived-0.010
und-0.010
Token crazy
Token ID7165
Feature activation+1.24e+00

Pos logit contributions
awk+0.312
ax+0.295
five+0.294
iffs+0.294
 sigh+0.291
Neg logit contributions
 mastering-0.400
 lived-0.386
 tre-0.386
olving-0.382
und-0.357
Token and
Token ID290
Feature activation-4.85e+00

Pos logit contributions
 mastering+0.063
stack+0.062
olving+0.060
 Kang+0.060
 lived+0.059
Neg logit contributions
 closer-0.029
 bite-0.026
 sigh-0.025
ax-0.023
ls-0.023
Token I
Token ID314
Feature activation-3.13e+00

Pos logit contributions
 closer+0.119
 sigh+0.108
 bite+0.108
ls+0.096
ax+0.094
Neg logit contributions
 mastering-0.237
olving-0.235
stack-0.232
und-0.226
 Kang-0.221
Token tried
Token ID3088
Feature activation-2.49e+00

Pos logit contributions
 closer+0.104
 bite+0.098
 sigh+0.098
awk+0.095
ax+0.093
Neg logit contributions
 mastering-0.125
olving-0.121
stack-0.119
und-0.112
 lived-0.111
Token to
Token ID284
Feature activation+6.76e-01

Pos logit contributions
 closer+0.052
 sigh+0.046
 bite+0.046
awk+0.045
ax+0.045
Neg logit contributions
 mastering-0.128
olving-0.126
stack-0.123
 lived-0.121
und-0.119
Token want
Token ID765
Feature activation-6.92e+00

Pos logit contributions
 closer+0.084
 sigh+0.080
 bite+0.080
awk+0.079
ax+0.077
Neg logit contributions
 mastering-0.093
olving-0.091
stack-0.088
 lived-0.084
und-0.082
Token to
Token ID284
Feature activation+3.38e-02

Pos logit contributions
 closer+0.174
awk+0.161
 sigh+0.161
ax+0.158
 bite+0.156
Neg logit contributions
 mastering-0.316
olving-0.307
 lived-0.302
stack-0.301
und-0.291
Token get
Token ID651
Feature activation-4.62e+00

Pos logit contributions
 mastering+0.002
olving+0.001
stack+0.001
und+0.001
 lived+0.001
Neg logit contributions
 closer-0.001
 bite-0.001
 sigh-0.001
awk-0.001
ax-0.001
Token closer
Token ID5699
Feature activation-8.77e+00

Pos logit contributions
awk+0.116
hell+0.115
 sigh+0.114
ax+0.114
 closer+0.111
Neg logit contributions
 mastering-0.197
olving-0.189
 lived-0.189
stack-0.184
 tre-0.183
Token and
Token ID290
Feature activation-5.26e+00

Pos logit contributions
 closer+0.210
 bite+0.190
 sigh+0.189
awk+0.185
ax+0.182
Neg logit contributions
 mastering-0.426
olving-0.417
stack-0.407
 lived-0.398
und-0.393
Token look
Token ID804
Feature activation-1.03e+01

Pos logit contributions
 closer+0.104
ax+0.100
awk+0.099
 sigh+0.097
 bite+0.096
Neg logit contributions
 mastering-0.263
olving-0.258
stack-0.257
 lived-0.251
und-0.245
Token inside
Token ID2641
Feature activation-1.75e+00

Pos logit contributions
 bite+0.297
awk+0.294
 sigh+0.292
ax+0.289
 closer+0.286
Neg logit contributions
 mastering-0.408
 lived-0.393
olving-0.390
und-0.377
 tre-0.374
Token.
Token ID13
Feature activation-6.27e-01

Pos logit contributions
 closer+0.040
 bite+0.034
 sigh+0.034
ls+0.032
ax+0.031
Neg logit contributions
 mastering-0.090
olving-0.090
stack-0.087
 Kang-0.086
ric-0.084
Token
Token ID198
Feature activation-1.35e+00

Pos logit contributions
 closer+0.019
 sigh+0.018
ax+0.017
 bite+0.017
awk+0.017
Neg logit contributions
 mastering-0.027
olving-0.027
ating-0.025
stack-0.025
 lived-0.024
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
 closer+0.047
 bite+0.046
 sigh+0.045
awk+0.045
ls+0.043
Neg logit contributions
 lived-0.048
 mastering-0.046
und-0.046
olving-0.046
stack-0.045
Token"
Token ID1
Feature activation+7.97e-01

Pos logit contributions
 closer+0.037
 sigh+0.033
 bite+0.033
ax+0.033
awk+0.033
Neg logit contributions
 mastering-0.048
olving-0.047
stack-0.046
 lived-0.044
ric-0.043
Token over
Token ID625
Feature activation-7.17e+00

Pos logit contributions
 bite+0.098
 sigh+0.097
awk+0.094
 slice+0.094
ax+0.093
Neg logit contributions
 mastering-0.183
stack-0.182
olving-0.180
 lived-0.178
und-0.173
Token to
Token ID284
Feature activation+1.30e-01

Pos logit contributions
 closer+0.158
 sigh+0.138
 bite+0.137
ax+0.130
awk+0.130
Neg logit contributions
 mastering-0.367
olving-0.361
stack-0.353
 lived-0.345
und-0.342
Token take
Token ID1011
Feature activation-4.78e+00

Pos logit contributions
 mastering+0.006
olving+0.006
stack+0.005
 lived+0.005
und+0.005
Neg logit contributions
 closer-0.004
 bite-0.004
 sigh-0.004
awk-0.003
ax-0.003
Token a
Token ID257
Feature activation-4.83e+00

Pos logit contributions
 closer+0.108
 bite+0.098
 sigh+0.097
awk+0.094
ax+0.093
Neg logit contributions
 mastering-0.238
olving-0.235
stack-0.231
 lived-0.224
und-0.221
Token closer
Token ID5699
Feature activation-8.79e+00

Pos logit contributions
ax+0.123
irted+0.118
irts+0.114
ls+0.113
ws+0.112
Neg logit contributions
 je-0.193
 forest-0.184
olving-0.181
tails-0.181
 mastering-0.180
Token look
Token ID804
Feature activation-1.03e+01

Pos logit contributions
 closer+0.210
ax+0.199
 sigh+0.194
awk+0.192
 bite+0.190
Neg logit contributions
 mastering-0.403
olving-0.399
stack-0.396
 lived-0.387
und-0.383
Token and
Token ID290
Feature activation-4.83e+00

Pos logit contributions
 closer+0.301
 bite+0.289
 sigh+0.289
awk+0.284
ax+0.281
Neg logit contributions
 mastering-0.424
olving-0.415
 lived-0.402
stack-0.402
und-0.393
Token saw
Token ID2497
Feature activation-2.15e+00

Pos logit contributions
 closer+0.121
awk+0.115
 bite+0.114
 sigh+0.114
ax+0.113
Neg logit contributions
 mastering-0.222
olving-0.216
stack-0.213
 lived-0.211
und-0.205
Token that
Token ID326
Feature activation-4.00e+00

Pos logit contributions
 closer+0.045
 bite+0.038
 sigh+0.037
ax+0.037
ls+0.036
Neg logit contributions
 mastering-0.113
olving-0.111
stack-0.109
 lived-0.104
und-0.104
Token it
Token ID340
Feature activation-9.53e+00

Pos logit contributions
 closer+0.094
 sigh+0.079
 bite+0.079
ax+0.073
ls+0.071
Neg logit contributions
 mastering-0.203
olving-0.200
stack-0.194
und-0.191
ric-0.189
Token was
Token ID373
Feature activation+4.55e+00

Pos logit contributions
 closer+0.375
 bite+0.345
 sigh+0.344
ls+0.320
 yawn+0.310
Neg logit contributions
 mastering-0.346
olving-0.345
stack-0.332
und-0.316
 Kang-0.315
Token liked
Token ID8288
Feature activation+2.27e+00

Pos logit contributions
 mastering+0.047
olving+0.046
stack+0.044
und+0.042
 Kang+0.042
Neg logit contributions
 closer-0.032
 sigh-0.030
 bite-0.030
ax-0.028
awk-0.028
Token to
Token ID284
Feature activation+4.89e-01

Pos logit contributions
 mastering+0.129
stack+0.128
olving+0.128
und+0.125
 lived+0.122
Neg logit contributions
 closer-0.038
 bite-0.034
 sigh-0.032
 burger-0.028
 slice-0.027
Token dig
Token ID3100
Feature activation-2.15e+00

Pos logit contributions
 mastering+0.021
olving+0.020
stack+0.020
und+0.019
 lived+0.019
Neg logit contributions
 closer-0.015
 bite-0.014
 sigh-0.014
awk-0.013
ax-0.013
Token holes
Token ID10421
Feature activation+9.24e-01

Pos logit contributions
ax+0.069
 sigh+0.067
awk+0.066
 closer+0.066
 bite+0.063
Neg logit contributions
 lived-0.082
olving-0.082
 mastering-0.081
stack-0.078
und-0.078
Token and
Token ID290
Feature activation-4.73e+00

Pos logit contributions
 mastering+0.044
olving+0.043
stack+0.042
 lived+0.041
und+0.041
Neg logit contributions
 closer-0.023
 sigh-0.021
 bite-0.021
awk-0.021
ax-0.020
Token look
Token ID804
Feature activation-1.02e+01

Pos logit contributions
 closer+0.132
 bite+0.123
 sigh+0.120
awk+0.114
 sip+0.111
Neg logit contributions
 mastering-0.214
olving-0.209
stack-0.202
und-0.199
 lived-0.197
Token for
Token ID329
Feature activation+1.90e-01

Pos logit contributions
ax+0.334
awk+0.331
 bite+0.326
five+0.324
 sigh+0.324
Neg logit contributions
 mastering-0.374
 lived-0.360
olving-0.355
 tre-0.338
 completing-0.330
Token treasures
Token ID29561
Feature activation+4.52e+00

Pos logit contributions
 mastering+0.009
stack+0.009
olving+0.009
und+0.008
 lived+0.008
Neg logit contributions
 closer-0.005
 bite-0.005
 sigh-0.004
ls-0.004
awk-0.004
Token.
Token ID13
Feature activation-4.96e-01

Pos logit contributions
 mastering+0.189
olving+0.187
stack+0.182
 lived+0.178
und+0.175
Neg logit contributions
 closer-0.132
 sigh-0.126
awk-0.124
 bite-0.124
ax-0.122
Token Sara
Token ID24799
Feature activation-3.37e+00

Pos logit contributions
 closer+0.013
 sigh+0.011
ax+0.011
 bite+0.011
awk+0.011
Neg logit contributions
 mastering-0.024
olving-0.023
ating-0.022
stack-0.022
und-0.021
Token found
Token ID1043
Feature activation-3.78e+00

Pos logit contributions
 closer+0.114
 sigh+0.108
 bite+0.108
ax+0.105
awk+0.104
Neg logit contributions
 mastering-0.130
olving-0.128
stack-0.123
und-0.118
ric-0.116
Token Lily
Token ID20037
Feature activation-5.18e+00

Pos logit contributions
awk+0.003
 closer+0.003
ax+0.003
 bite+0.003
five+0.002
Neg logit contributions
 mastering-0.004
olving-0.004
 lived-0.004
stack-0.004
 rarely-0.003
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
 closer+0.152
 sigh+0.141
 bite+0.141
awk+0.139
ax+0.137
Neg logit contributions
 mastering-0.222
olving-0.217
stack-0.212
 lived-0.206
und-0.204
Token She
Token ID1375
Feature activation+2.34e+00

Pos logit contributions
 closer+0.027
 sigh+0.025
ax+0.024
 Anna+0.023
 sip+0.023
Neg logit contributions
 mastering-0.049
olving-0.048
ating-0.047
 tre-0.043
stack-0.043
Token loved
Token ID6151
Feature activation+1.75e+00

Pos logit contributions
 mastering+0.098
olving+0.097
stack+0.093
und+0.088
 Kang+0.088
Neg logit contributions
 closer-0.072
 sigh-0.069
 bite-0.068
ax-0.065
awk-0.064
Token to
Token ID284
Feature activation+4.56e-01

Pos logit contributions
stack+0.092
 mastering+0.091
olving+0.091
und+0.088
jack+0.088
Neg logit contributions
 bite-0.034
 closer-0.034
 sigh-0.032
 burger-0.028
 sandwiches-0.028
Token look
Token ID804
Feature activation-1.02e+01

Pos logit contributions
 mastering+0.019
olving+0.018
stack+0.018
 lived+0.017
und+0.017
Neg logit contributions
 closer-0.014
 bite-0.013
 sigh-0.013
awk-0.013
ax-0.013
Token up
Token ID510
Feature activation-6.20e+00

Pos logit contributions
 bite+0.386
 sigh+0.378
five+0.373
umbs+0.369
 slice+0.368
Neg logit contributions
 mastering-0.321
olving-0.300
 lived-0.289
 tre-0.287
 forest-0.280
Token at
Token ID379
Feature activation+1.73e+00

Pos logit contributions
 closer+0.184
 sigh+0.184
 bite+0.182
awk+0.180
ax+0.179
Neg logit contributions
 mastering-0.248
olving-0.243
 lived-0.228
und-0.228
stack-0.228
Token the
Token ID262
Feature activation+3.06e-01

Pos logit contributions
 mastering+0.077
olving+0.075
stack+0.074
und+0.071
 lived+0.071
Neg logit contributions
 closer-0.050
 bite-0.045
 sigh-0.045
ax-0.043
awk-0.043
Token sky
Token ID6766
Feature activation+6.63e-01

Pos logit contributions
 mastering+0.012
olving+0.012
stack+0.011
und+0.011
 lived+0.011
Neg logit contributions
 closer-0.010
 bite-0.010
 sigh-0.009
awk-0.009
ax-0.009
Token and
Token ID290
Feature activation-4.51e+00

Pos logit contributions
 mastering+0.034
olving+0.033
stack+0.032
 lived+0.032
und+0.032
Neg logit contributions
 closer-0.015
 bite-0.013
 sigh-0.013
awk-0.012
ax-0.012
Tokenlf
Token ID1652
Feature activation+5.18e-01

Pos logit contributions
 closer+0.024
 bite+0.024
 sigh+0.023
ax+0.022
ls+0.022
Neg logit contributions
olving-0.037
 mastering-0.037
 Kang-0.036
stack-0.035
ric-0.034
Token.
Token ID13
Feature activation-6.36e-01

Pos logit contributions
 mastering+0.022
 Kang+0.022
olving+0.022
stack+0.021
 Zo+0.021
Neg logit contributions
 closer-0.015
 bite-0.014
 sigh-0.013
ax-0.013
 slice-0.013
Token She
Token ID1375
Feature activation+2.80e+00

Pos logit contributions
 closer+0.018
ax+0.016
 sigh+0.016
 bite+0.015
five+0.015
Neg logit contributions
 mastering-0.028
olving-0.028
ating-0.027
stack-0.025
ric-0.025
Token likes
Token ID7832
Feature activation+1.71e+00

Pos logit contributions
olving+0.117
 mastering+0.116
stack+0.113
 Kang+0.110
ating+0.106
Neg logit contributions
 closer-0.086
 sigh-0.086
 bite-0.082
ax-0.075
 sip-0.074
Token to
Token ID284
Feature activation+6.16e-01

Pos logit contributions
stack+0.094
 mastering+0.093
olving+0.092
 Kang+0.090
und+0.089
Neg logit contributions
 closer-0.031
 bite-0.030
 sigh-0.028
 slice-0.024
 burger-0.023
Token look
Token ID804
Feature activation-1.02e+01

Pos logit contributions
 mastering+0.027
olving+0.027
stack+0.026
und+0.025
 Kang+0.025
Neg logit contributions
 bite-0.017
 closer-0.017
 sigh-0.017
 sip-0.015
awk-0.015
Token at
Token ID379
Feature activation+1.69e+00

Pos logit contributions
 bite+0.396
umbs+0.389
awk+0.384
 sigh+0.383
five+0.383
Neg logit contributions
 mastering-0.310
olving-0.282
 lived-0.275
 tre-0.270
 completing-0.267
Token the
Token ID262
Feature activation-1.37e-02

Pos logit contributions
 mastering+0.076
stack+0.075
olving+0.074
 Kang+0.072
 lived+0.071
Neg logit contributions
 closer-0.047
 bite-0.042
 sigh-0.042
ls-0.039
ax-0.039
Token pictures
Token ID5986
Feature activation-1.73e+00

Pos logit contributions
 closer+0.000
 bite+0.000
 sigh+0.000
hell+0.000
 sandwich+0.000
Neg logit contributions
 mastering-0.001
olving-0.001
stack-0.001
 lived-0.001
und-0.001
Token and
Token ID290
Feature activation-4.45e+00

Pos logit contributions
 closer+0.048
 sigh+0.044
 bite+0.044
awk+0.044
ax+0.043
Neg logit contributions
 mastering-0.077
olving-0.075
stack-0.073
 lived-0.072
und-0.071
Token the
Token ID262
Feature activation+2.22e-01

Pos logit contributions
 closer+0.111
 bite+0.103
 sigh+0.102
awk+0.090
 sip+0.089
Neg logit contributions
 mastering-0.212
olving-0.212
stack-0.210
und-0.205
 Kang-0.200
Token,
Token ID11
Feature activation-1.52e-01

Pos logit contributions
 closer+0.215
 sigh+0.199
 bite+0.197
awk+0.192
ax+0.191
Neg logit contributions
 mastering-0.320
olving-0.313
stack-0.301
und-0.291
 Kang-0.286
Token "
Token ID366
Feature activation+3.74e+00

Pos logit contributions
 closer+0.005
 bite+0.005
 sigh+0.005
awk+0.005
ax+0.005
Neg logit contributions
 mastering-0.005
olving-0.005
stack-0.005
 lived-0.005
und-0.005
TokenS
Token ID50
Feature activation-1.24e+00

Pos logit contributions
 mastering+0.147
olving+0.144
stack+0.134
 lived+0.130
 completing+0.129
Neg logit contributions
 closer-0.130
 bite-0.123
five-0.121
awk-0.120
 sigh-0.120
Tokenara
Token ID3301
Feature activation-3.33e+00

Pos logit contributions
 closer+0.024
 sigh+0.021
 bite+0.021
awk+0.021
ax+0.020
Neg logit contributions
 mastering-0.066
olving-0.065
stack-0.064
 lived-0.063
und-0.062
Token,
Token ID11
Feature activation+6.81e-01

Pos logit contributions
 bite+0.108
 closer+0.108
 sigh+0.107
awk+0.104
ax+0.098
Neg logit contributions
 mastering-0.136
olving-0.129
stack-0.126
 Kang-0.119
 perspective-0.118
Token look
Token ID804
Feature activation-1.02e+01

Pos logit contributions
 mastering+0.030
olving+0.030
stack+0.029
und+0.028
 lived+0.028
Neg logit contributions
 closer-0.019
 bite-0.018
 sigh-0.018
awk-0.017
ax-0.017
Token,
Token ID11
Feature activation+9.84e-01

Pos logit contributions
five+0.356
 bite+0.354
 sigh+0.348
awk+0.340
iffs+0.339
Neg logit contributions
 lived-0.320
 mastering-0.318
und-0.311
olving-0.290
 forest-0.287
Token a
Token ID257
Feature activation-4.44e+00

Pos logit contributions
 mastering+0.044
olving+0.043
stack+0.043
 lived+0.042
und+0.041
Neg logit contributions
 closer-0.026
 sigh-0.024
 bite-0.024
awk-0.024
ax-0.023
Token cloth
Token ID16270
Feature activation-3.93e-01

Pos logit contributions
 closer+0.111
 sigh+0.102
 bite+0.102
awk+0.100
ax+0.100
Neg logit contributions
 mastering-0.207
olving-0.203
stack-0.199
 lived-0.195
und-0.192
Token!"
Token ID2474
Feature activation-4.56e+00

Pos logit contributions
 closer+0.011
 bite+0.010
 sigh+0.010
awk+0.010
ax+0.009
Neg logit contributions
 mastering-0.018
olving-0.018
stack-0.017
 lived-0.017
und-0.016
Token
Token ID198
Feature activation-1.30e+00

Pos logit contributions
 closer+0.155
 bite+0.148
awk+0.145
 sigh+0.140
hell+0.138
Neg logit contributions
stack-0.166
 mastering-0.164
 lived-0.160
und-0.158
olving-0.154
TokenEll
Token ID30639
Feature activation-1.29e+00

Pos logit contributions
 mastering+0.063
olving+0.061
stack+0.059
und+0.058
 lived+0.058
Neg logit contributions
 closer-0.037
 bite-0.034
 sigh-0.033
ax-0.032
awk-0.031
Tokenie
Token ID494
Feature activation-1.99e+00

Pos logit contributions
 closer+0.041
 bite+0.039
awk+0.039
 sigh+0.038
ax+0.037
Neg logit contributions
stack-0.049
 mastering-0.049
olving-0.049
 Kang-0.047
 Zo-0.045
Token's
Token ID338
Feature activation-5.24e-01

Pos logit contributions
 sigh+0.057
 closer+0.056
 bite+0.053
 yawn+0.050
awk+0.050
Neg logit contributions
 mastering-0.087
olving-0.086
stack-0.079
 Kang-0.079
umm-0.079
Token friend
Token ID1545
Feature activation+8.38e-01

Pos logit contributions
 closer+0.014
 bite+0.014
 sigh+0.013
 sandwich+0.013
 slice+0.012
Neg logit contributions
 mastering-0.025
und-0.024
ating-0.023
olving-0.023
stack-0.023
Token thought
Token ID1807
Feature activation-5.66e+00

Pos logit contributions
 mastering+0.036
olving+0.036
stack+0.034
ric+0.034
 Kang+0.033
Neg logit contributions
 closer-0.027
 sigh-0.024
 bite-0.023
ax-0.023
awk-0.022
Token it
Token ID340
Feature activation-1.02e+01

Pos logit contributions
 closer+0.140
 sigh+0.126
 bite+0.124
ax+0.117
ls+0.117
Neg logit contributions
 mastering-0.273
olving-0.267
stack-0.265
 Kang-0.254
und-0.254
Token was
Token ID373
Feature activation+3.92e+00

Pos logit contributions
 closer+0.442
 bite+0.408
 sigh+0.407
awk+0.385
hell+0.385
Neg logit contributions
 mastering-0.324
olving-0.307
stack-0.304
und-0.284
 Kang-0.283
Token crazy
Token ID7165
Feature activation+9.55e-01

Pos logit contributions
 mastering+0.197
stack+0.194
olving+0.191
und+0.188
 Kang+0.184
Neg logit contributions
 closer-0.095
 bite-0.083
 sigh-0.080
ax-0.076
 sandwich-0.073
Token how
Token ID703
Feature activation-2.68e+00

Pos logit contributions
 mastering+0.044
olving+0.043
stack+0.043
 lived+0.041
und+0.041
Neg logit contributions
 closer-0.025
 sigh-0.023
 bite-0.023
awk-0.022
ax-0.022
Token much
Token ID881
Feature activation-3.32e+00

Pos logit contributions
 closer+0.060
 sigh+0.052
 bite+0.052
ls+0.049
awk+0.049
Neg logit contributions
 mastering-0.137
olving-0.134
stack-0.134
und-0.128
 lived-0.127
Token Ellie
Token ID44801
Feature activation-1.23e+00

Pos logit contributions
 closer+0.073
 bite+0.067
 sigh+0.063
ls+0.056
 sip+0.054
Neg logit contributions
 mastering-0.177
stack-0.175
ric-0.173
olving-0.172
und-0.168
Token reach
Token ID3151
Feature activation-4.26e+00

Pos logit contributions
ax+0.222
awk+0.214
 burger+0.205
hell+0.202
 bite+0.201
Neg logit contributions
 mastering-0.191
 lived-0.190
olving-0.183
 tre-0.180
 forest-0.172
Token the
Token ID262
Feature activation-8.46e-01

Pos logit contributions
 closer+0.116
 sigh+0.108
 bite+0.107
awk+0.107
ax+0.105
Neg logit contributions
 mastering-0.188
olving-0.185
stack-0.181
 lived-0.177
und-0.174
Token top
Token ID1353
Feature activation-1.31e+00

Pos logit contributions
 closer+0.026
 bite+0.025
 sigh+0.024
awk+0.023
ax+0.023
Neg logit contributions
 mastering-0.036
olving-0.035
 lived-0.033
stack-0.033
und-0.033
Token.
Token ID13
Feature activation-1.58e-01

Pos logit contributions
 closer+0.037
 bite+0.034
 sigh+0.034
ax+0.033
awk+0.032
Neg logit contributions
 mastering-0.058
olving-0.057
stack-0.055
 lived-0.054
und-0.053
Token They
Token ID1119
Feature activation+1.78e+00

Pos logit contributions
 closer+0.005
ax+0.004
 sigh+0.004
 bite+0.004
awk+0.004
Neg logit contributions
 mastering-0.007
olving-0.007
ating-0.007
stack-0.006
ric-0.006
Token look
Token ID804
Feature activation-1.02e+01

Pos logit contributions
 mastering+0.081
olving+0.080
stack+0.075
ating+0.074
 Kang+0.074
Neg logit contributions
 closer-0.050
 bite-0.048
 sigh-0.047
ax-0.044
awk-0.043
Token down
Token ID866
Feature activation-1.54e-01

Pos logit contributions
umbs+0.382
iffs+0.379
awk+0.372
five+0.367
 bite+0.360
Neg logit contributions
 mastering-0.318
 lived-0.307
 tre-0.284
olving-0.277
 completing-0.275
Token and
Token ID290
Feature activation-4.54e+00

Pos logit contributions
awk+0.004
 bite+0.004
hell+0.004
 sigh+0.004
five+0.004
Neg logit contributions
 mastering-0.006
olving-0.006
stack-0.006
und-0.006
 lived-0.006
Token see
Token ID766
Feature activation-6.23e+00

Pos logit contributions
awk+0.095
 closer+0.094
ax+0.092
hell+0.091
 bite+0.089
Neg logit contributions
 mastering-0.221
 lived-0.215
stack-0.215
olving-0.211
 tre-0.203
Token a
Token ID257
Feature activation-4.24e+00

Pos logit contributions
 closer+0.149
 sigh+0.130
 bite+0.130
ax+0.124
awk+0.123
Neg logit contributions
 mastering-0.309
olving-0.302
stack-0.295
 lived-0.287
und-0.285
Token green
Token ID4077
Feature activation+6.85e-03

Pos logit contributions
 closer+0.102
 bite+0.092
 sigh+0.091
awk+0.083
ax+0.082
Neg logit contributions
 mastering-0.211
olving-0.207
stack-0.201
 lived-0.199
und-0.198
Token fun
Token ID1257
Feature activation-7.34e-01

Pos logit contributions
awk+0.093
ax+0.092
 closer+0.089
five+0.084
 sigh+0.083
Neg logit contributions
 lived-0.163
 mastering-0.163
olving-0.161
stack-0.156
 tre-0.155
Token.
Token ID13
Feature activation+6.55e-01

Pos logit contributions
 closer+0.018
 bite+0.016
 sigh+0.016
awk+0.015
ax+0.015
Neg logit contributions
 mastering-0.035
olving-0.035
stack-0.034
 lived-0.033
und-0.033
Token What
Token ID1867
Feature activation+2.53e+00

Pos logit contributions
 mastering+0.026
olving+0.025
stack+0.024
und+0.024
 lived+0.023
Neg logit contributions
 closer-0.021
 sigh-0.020
 bite-0.020
awk-0.019
ax-0.019
Token can
Token ID460
Feature activation+4.72e+00

Pos logit contributions
 lived+0.087
olving+0.084
 mastering+0.083
 tre+0.073
und+0.072
Neg logit contributions
 closer-0.095
awk-0.095
ax-0.094
 sigh-0.091
ls-0.087
Token we
Token ID356
Feature activation-4.61e+00

Pos logit contributions
 mastering+0.183
 lived+0.182
olving+0.176
stack+0.174
ating+0.167
Neg logit contributions
awk-0.150
 closer-0.149
ax-0.144
hell-0.139
 sigh-0.133
Token do
Token ID466
Feature activation+6.61e+00

Pos logit contributions
 closer+0.158
 bite+0.156
 sigh+0.149
awk+0.143
 sip+0.142
Neg logit contributions
 mastering-0.178
stack-0.171
olving-0.168
und-0.164
 Kang-0.162
Token?"
Token ID1701
Feature activation-4.90e+00

Pos logit contributions
 mastering+0.279
olving+0.277
 lived+0.264
stack+0.257
ating+0.255
Neg logit contributions
 closer-0.195
awk-0.188
 sigh-0.187
ax-0.185
 bite-0.182
Token
Token ID198
Feature activation-7.57e-01

Pos logit contributions
 closer+0.145
 bite+0.134
awk+0.134
 sigh+0.133
ax+0.132
Neg logit contributions
 mastering-0.210
olving-0.204
stack-0.201
 lived-0.198
und-0.194
Token
Token ID198
Feature activation-7.62e-01

Pos logit contributions
awk+0.029
 closer+0.029
ax+0.029
 bite+0.028
hell+0.027
Neg logit contributions
 lived-0.024
 mastering-0.022
und-0.022
olving-0.021
 neighbourhood-0.021
TokenThey
Token ID2990
Feature activation+8.60e-01

Pos logit contributions
 closer+0.021
 sigh+0.019
 bite+0.019
awk+0.019
ax+0.019
Neg logit contributions
 mastering-0.035
olving-0.034
stack-0.034
 lived-0.032
und-0.032
Token pick
Token ID2298
Feature activation-1.55e+00

Pos logit contributions
 mastering+0.040
 lived+0.039
stack+0.039
olving+0.038
und+0.037
Neg logit contributions
ls-0.019
awk-0.019
 closer-0.018
hell-0.018
 burger-0.018
Token was
Token ID373
Feature activation+4.64e+00

Pos logit contributions
 mastering+0.103
olving+0.100
stack+0.098
und+0.093
ric+0.092
Neg logit contributions
 closer-0.071
 bite-0.067
 sigh-0.066
 sandwich-0.061
ls-0.059
Token awesome
Token ID7427
Feature activation-9.04e-01

Pos logit contributions
stack+0.245
 mastering+0.239
olving+0.237
und+0.236
jack+0.226
Neg logit contributions
 closer-0.108
 bite-0.089
 sigh-0.085
 harder-0.083
ax-0.079
Token!
Token ID0
Feature activation+1.83e-01

Pos logit contributions
 closer+0.024
 sigh+0.022
ls+0.021
 bite+0.021
 sandwich+0.020
Neg logit contributions
 mastering-0.042
stack-0.042
olving-0.042
ric-0.042
 Australian-0.039
Token Let
Token ID3914
Feature activation-3.41e-01

Pos logit contributions
 mastering+0.009
olving+0.008
ating+0.008
 forest+0.008
stack+0.008
Neg logit contributions
 closer-0.005
five-0.004
 sigh-0.004
ax-0.004
iffs-0.004
Token's
Token ID338
Feature activation+1.42e+00

Pos logit contributions
 closer+0.011
 bite+0.010
 sigh+0.010
awk+0.010
ax+0.010
Neg logit contributions
 lived-0.013
 mastering-0.013
und-0.012
olving-0.012
stack-0.012
Token do
Token ID466
Feature activation+6.29e+00

Pos logit contributions
 mastering+0.072
olving+0.071
 importantly+0.069
und+0.069
ating+0.068
Neg logit contributions
 bite-0.033
 closer-0.031
 look-0.030
 sigh-0.028
 sip-0.027
Token it
Token ID340
Feature activation-9.29e+00

Pos logit contributions
 mastering+0.335
olving+0.323
 lived+0.318
stack+0.311
und+0.310
Neg logit contributions
 closer-0.116
awk-0.111
ax-0.106
 bite-0.104
 sigh-0.102
Token again
Token ID757
Feature activation-3.77e+00

Pos logit contributions
 closer+0.216
 sigh+0.201
awk+0.198
 bite+0.198
ax+0.193
Neg logit contributions
 mastering-0.450
olving-0.441
stack-0.429
 lived-0.426
und-0.421
Token soon
Token ID2582
Feature activation-5.58e-01

Pos logit contributions
 closer+0.096
 bite+0.087
 sigh+0.087
awk+0.085
ax+0.084
Neg logit contributions
 mastering-0.177
olving-0.174
stack-0.171
 lived-0.167
und-0.164
Token?"
Token ID1701
Feature activation-5.14e+00

Pos logit contributions
 closer+0.013
 sigh+0.011
 bite+0.011
awk+0.011
ax+0.011
Neg logit contributions
 mastering-0.027
olving-0.027
stack-0.026
 lived-0.026
und-0.026
Token
Token ID220
Feature activation-1.89e+00

Pos logit contributions
 sigh+0.119
 closer+0.118
ax+0.103
five+0.102
awk+0.102
Neg logit contributions
olving-0.254
 mastering-0.246
ating-0.238
stack-0.236
jack-0.233
Token always
Token ID1464
Feature activation+4.74e+00

Pos logit contributions
 mastering+0.133
olving+0.131
stack+0.127
und+0.121
ric+0.117
Neg logit contributions
 closer-0.157
 bite-0.152
 sigh-0.151
ax-0.144
awk-0.143
Token thought
Token ID1807
Feature activation-4.59e+00

Pos logit contributions
stack+0.213
 mastering+0.211
olving+0.210
 Kang+0.198
und+0.197
Neg logit contributions
 closer-0.131
 sigh-0.130
 bite-0.127
 yawn-0.117
awk-0.112
Token of
Token ID286
Feature activation+3.23e+00

Pos logit contributions
 closer+0.123
 sigh+0.113
 bite+0.113
awk+0.110
ax+0.110
Neg logit contributions
 mastering-0.208
olving-0.204
stack-0.200
 lived-0.195
und-0.193
Token ways
Token ID2842
Feature activation+8.12e+00

Pos logit contributions
stack+0.170
jack+0.165
 mastering+0.161
 Kang+0.158
und+0.157
Neg logit contributions
 closer-0.067
 bite-0.061
 sigh-0.059
 look-0.056
 sure-0.055
Token she
Token ID673
Feature activation-1.08e+00

Pos logit contributions
 mastering+0.492
stack+0.489
 Kang+0.482
olving+0.479
und+0.477
Neg logit contributions
 closer-0.112
 bite-0.071
 sigh-0.066
 back-0.062
ax-0.057
Token could
Token ID714
Feature activation+6.98e+00

Pos logit contributions
 closer+0.040
 bite+0.037
 sigh+0.036
ls+0.035
bs+0.034
Neg logit contributions
stack-0.041
 mastering-0.039
olving-0.039
und-0.039
ric-0.038
Token help
Token ID1037
Feature activation+6.49e-01

Pos logit contributions
 mastering+0.319
olving+0.310
und+0.309
stack+0.307
 Kang+0.302
Neg logit contributions
 bite-0.192
 closer-0.189
 sigh-0.177
 sip-0.161
 look-0.157
Token her
Token ID607
Feature activation+1.53e-01

Pos logit contributions
stack+0.024
 mastering+0.024
olving+0.024
und+0.023
 lived+0.023
Neg logit contributions
 closer-0.023
 bite-0.022
 sigh-0.021
ax-0.021
awk-0.021
Token family
Token ID1641
Feature activation+3.06e+00

Pos logit contributions
stack+0.007
und+0.007
 mastering+0.006
olving+0.006
jack+0.006
Neg logit contributions
 closer-0.005
 bite-0.005
 sigh-0.004
awk-0.004
 sandwich-0.004
Token and
Token ID290
Feature activation-4.14e+00

Pos logit contributions
und+0.164
stack+0.164
 mastering+0.162
 Zo+0.161
 Kang+0.160
Neg logit contributions
 closer-0.064
 bite-0.059
 look-0.051
 sigh-0.049
 back-0.049
Token friends
Token ID2460
Feature activation+3.50e+00

Pos logit contributions
 closer+0.125
 look+0.121
 bite+0.119
 sigh+0.113
 sure+0.111
Neg logit contributions
und-0.181
 Kang-0.179
umm-0.172
stack-0.170
 mastering-0.169
Token were
Token ID547
Feature activation+6.33e+00

Pos logit contributions
 mastering+0.082
olving+0.079
ric+0.078
 Kang+0.074
stack+0.073
Neg logit contributions
 closer-0.077
 bite-0.072
 sigh-0.072
ls-0.068
 sip-0.065
Token very
Token ID845
Feature activation+2.06e+00

Pos logit contributions
 mastering+0.293
olving+0.289
stack+0.283
und+0.278
 lived+0.275
Neg logit contributions
 closer-0.173
 bite-0.154
 sigh-0.153
ax-0.143
 slice-0.136
Token popular
Token ID2968
Feature activation+4.00e+00

Pos logit contributions
 mastering+0.103
olving+0.101
stack+0.100
 lived+0.097
und+0.097
Neg logit contributions
 closer-0.047
 sigh-0.042
 bite-0.042
awk-0.040
ax-0.040
Token and
Token ID290
Feature activation-3.71e+00

Pos logit contributions
ric+0.230
stack+0.221
 Australian+0.221
 Kang+0.215
 mastering+0.213
Neg logit contributions
 closer-0.068
 sigh-0.056
 back-0.053
 bite-0.053
ls-0.053
Token his
Token ID465
Feature activation+1.80e+00

Pos logit contributions
 closer+0.094
 sigh+0.080
 bite+0.079
 sure+0.073
 burger+0.071
Neg logit contributions
 mastering-0.179
olving-0.176
ric-0.176
und-0.173
stack-0.173
Token siblings
Token ID20569
Feature activation+6.20e+00

Pos logit contributions
 mastering+0.072
und+0.069
stack+0.068
olving+0.066
ric+0.065
Neg logit contributions
 closer-0.060
 sigh-0.059
 bite-0.058
 sandwich-0.055
ls-0.055
Token praised
Token ID15342
Feature activation-1.20e+00

Pos logit contributions
olving+0.245
 mastering+0.238
 Kang+0.236
stack+0.225
 situations+0.224
Neg logit contributions
 sigh-0.204
 closer-0.195
 bite-0.193
ax-0.181
 yawn-0.179
Token him
Token ID683
Feature activation-1.20e+00

Pos logit contributions
 closer+0.032
 sigh+0.027
 bite+0.026
 back+0.024
 harder+0.024
Neg logit contributions
stack-0.059
 mastering-0.058
ric-0.057
olving-0.056
 Kang-0.056
Token.
Token ID13
Feature activation+4.78e-01

Pos logit contributions
 closer+0.027
 sigh+0.024
 bite+0.024
hell+0.022
ax+0.021
Neg logit contributions
 mastering-0.061
olving-0.059
stack-0.059
und-0.058
 lived-0.057
Token Max
Token ID5436
Feature activation-2.57e-01

Pos logit contributions
olving+0.022
 mastering+0.021
ating+0.021
 tre+0.019
stack+0.019
Neg logit contributions
ax-0.012
 closer-0.012
 sigh-0.012
 Anna-0.011
 Dad-0.011
Token was
Token ID373
Feature activation+5.22e+00

Pos logit contributions
 sigh+0.008
 closer+0.008
 bite+0.007
 yawn+0.007
ax+0.007
Neg logit contributions
olving-0.011
 mastering-0.011
stack-0.010
 situations-0.010
 perspective-0.010
Token from
Token ID422
Feature activation+4.49e+00

Pos logit contributions
 mastering+0.098
olving+0.096
stack+0.094
 lived+0.092
und+0.091
Neg logit contributions
 closer-0.055
 sigh-0.050
 bite-0.050
awk-0.048
ax-0.047
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
 mastering+0.196
olving+0.196
und+0.187
stack+0.185
 lived+0.183
Neg logit contributions
 closer-0.135
 bite-0.120
 sigh-0.115
awk-0.107
ls-0.106
Token top
Token ID1353
Feature activation+2.82e-01

Pos logit contributions
 mastering+0.045
und+0.043
olving+0.042
ating+0.042
 lived+0.039
Neg logit contributions
 closer-0.049
 bite-0.047
 sigh-0.047
 back-0.041
 slice-0.041
Token of
Token ID286
Feature activation+3.76e+00

Pos logit contributions
 mastering+0.015
olving+0.014
stack+0.014
 lived+0.014
ric+0.014
Neg logit contributions
 closer-0.007
 bite-0.006
 sigh-0.005
ls-0.005
 harder-0.005
Token the
Token ID262
Feature activation+1.25e+00

Pos logit contributions
 mastering+0.180
olving+0.179
und+0.170
arians+0.170
stack+0.170
Neg logit contributions
 closer-0.099
 bite-0.090
 sigh-0.085
 sip-0.074
ax-0.074
Token chim
Token ID18205
Feature activation+9.63e+00

Pos logit contributions
 mastering+0.038
olving+0.035
und+0.034
ating+0.033
 lived+0.032
Neg logit contributions
 closer-0.057
 sigh-0.055
 bite-0.055
 slice-0.049
ls-0.049
Tokenney
Token ID1681
Feature activation+6.47e+00

Pos logit contributions
 mastering+0.326
 lived+0.288
olving+0.286
 Australian+0.274
 completing+0.272
Neg logit contributions
 closer-0.357
 sigh-0.347
ax-0.335
ls-0.334
 slice-0.331
Token!
Token ID0
Feature activation+4.32e-01

Pos logit contributions
 mastering+0.339
olving+0.325
 Kang+0.318
 Australian+0.314
 Zo+0.314
Neg logit contributions
 closer-0.146
 bite-0.123
 sigh-0.122
 back-0.108
 harder-0.107
Token<|endoftext|>
Token ID50256
Feature activation-2.85e+00

Pos logit contributions
 mastering+0.020
olving+0.020
stack+0.019
 lived+0.019
ating+0.018
Neg logit contributions
 closer-0.011
 sigh-0.010
ax-0.010
awk-0.010
 bite-0.010
TokenOnce
Token ID7454
Feature activation+2.46e+00

Pos logit contributions
 closer+0.090
 sigh+0.083
 bite+0.082
ax+0.081
ls+0.080
Neg logit contributions
 mastering-0.119
olving-0.115
stack-0.109
 rarely-0.104
 tre-0.104
Token upon
Token ID2402
Feature activation+3.93e+00

Pos logit contributions
 lived+0.079
olving+0.074
 mastering+0.071
stack+0.068
und+0.066
Neg logit contributions
five-0.096
 bite-0.095
awk-0.095
 closer-0.094
 sigh-0.093
Token felt
Token ID2936
Feature activation+9.85e-01

Pos logit contributions
 mastering+0.186
olving+0.185
stack+0.178
und+0.172
ric+0.169
Neg logit contributions
 closer-0.115
 sigh-0.107
 bite-0.106
ax-0.100
awk-0.097
Token very
Token ID845
Feature activation+2.12e+00

Pos logit contributions
 mastering+0.044
olving+0.042
 lived+0.041
stack+0.041
und+0.040
Neg logit contributions
 closer-0.027
awk-0.025
 bite-0.025
 sigh-0.025
ax-0.025
Token content
Token ID2695
Feature activation+2.03e+00

Pos logit contributions
 mastering+0.098
olving+0.095
 lived+0.093
 tre+0.091
stack+0.089
Neg logit contributions
 closer-0.051
awk-0.050
 bite-0.050
ax-0.050
hell-0.048
Token,
Token ID11
Feature activation+2.39e+00

Pos logit contributions
 mastering+0.095
olving+0.093
stack+0.091
 lived+0.089
und+0.089
Neg logit contributions
 closer-0.052
 sigh-0.048
 bite-0.047
ax-0.045
awk-0.045
Token knowing
Token ID6970
Feature activation+3.76e-01

Pos logit contributions
 mastering+0.099
olving+0.095
stack+0.093
 lived+0.091
 tre+0.089
Neg logit contributions
 closer-0.073
awk-0.070
 bite-0.069
ls-0.069
ax-0.068
Token her
Token ID607
Feature activation+5.29e-01

Pos logit contributions
olving+0.018
 mastering+0.018
stack+0.018
ric+0.018
und+0.018
Neg logit contributions
 closer-0.009
 sigh-0.008
 bite-0.007
ax-0.007
ls-0.007
Token mum
Token ID25682
Feature activation-2.25e+00

Pos logit contributions
 mastering+0.024
olving+0.023
stack+0.023
und+0.022
 lived+0.022
Neg logit contributions
 closer-0.015
 sigh-0.014
 bite-0.014
awk-0.013
ax-0.013
Token had
Token ID550
Feature activation+2.48e+00

Pos logit contributions
 closer+0.070
 sigh+0.064
 bite+0.061
ax+0.058
 yawn+0.056
Neg logit contributions
olving-0.096
 mastering-0.095
 Kang-0.093
und-0.092
 situations-0.092
Token stepped
Token ID10764
Feature activation-1.64e+00

Pos logit contributions
 mastering+0.106
olving+0.104
stack+0.103
 lived+0.099
und+0.099
Neg logit contributions
 closer-0.074
 sigh-0.068
 bite-0.068
ax-0.066
awk-0.066
Token into
Token ID656
Feature activation+3.17e+00

Pos logit contributions
 bite+0.038
awk+0.038
 sigh+0.037
 closer+0.037
ax+0.037
Neg logit contributions
 mastering-0.076
olving-0.075
stack-0.073
und-0.071
 lived-0.070
Token her
Token ID607
Feature activation+5.45e-01

Pos logit contributions
 mastering+0.142
olving+0.140
stack+0.138
 lived+0.134
und+0.133
Neg logit contributions
 closer-0.090
 sigh-0.083
 bite-0.082
ax-0.076
 sip-0.075
Token they
Token ID484
Feature activation-2.98e+00

Pos logit contributions
 mastering+0.109
olving+0.107
stack+0.105
 lived+0.103
und+0.102
Neg logit contributions
 closer-0.048
 bite-0.043
 sigh-0.042
ax-0.041
awk-0.041
Token got
Token ID1392
Feature activation-2.77e+00

Pos logit contributions
five+0.115
 closer+0.113
awk+0.113
 sigh+0.111
iffs+0.110
Neg logit contributions
 lived-0.101
 mastering-0.090
olving-0.087
 tre-0.086
stack-0.085
Token angry
Token ID7954
Feature activation-2.20e+00

Pos logit contributions
hell+0.073
awk+0.073
five+0.072
ax+0.071
 sigh+0.070
Neg logit contributions
 mastering-0.120
olving-0.116
 lived-0.111
 tre-0.111
ric-0.108
Token and
Token ID290
Feature activation-3.88e+00

Pos logit contributions
 closer+0.043
 sigh+0.038
 bite+0.038
awk+0.036
ax+0.036
Neg logit contributions
 mastering-0.116
stack-0.115
olving-0.114
 lived-0.109
 Kang-0.109
Token started
Token ID2067
Feature activation+4.24e+00

Pos logit contributions
 closer+0.124
 sigh+0.116
 bite+0.116
awk+0.113
ax+0.112
Neg logit contributions
 mastering-0.157
olving-0.155
stack-0.150
 lived-0.144
und-0.144
Token to
Token ID284
Feature activation+1.19e+00

Pos logit contributions
 mastering+0.229
stack+0.228
olving+0.227
 lived+0.223
und+0.218
Neg logit contributions
 closer-0.081
 bite-0.074
 sigh-0.072
 sip-0.062
awk-0.061
Token quarrel
Token ID41998
Feature activation-1.03e+00

Pos logit contributions
 mastering+0.060
olving+0.059
stack+0.058
 Kang+0.056
ating+0.056
Neg logit contributions
 bite-0.027
 closer-0.027
 sigh-0.027
 sip-0.024
awk-0.023
Token.
Token ID13
Feature activation+2.66e-01

Pos logit contributions
 closer+0.029
 sigh+0.027
awk+0.027
 bite+0.027
ax+0.027
Neg logit contributions
 mastering-0.045
olving-0.044
stack-0.043
 lived-0.042
und-0.041
Token They
Token ID1119
Feature activation+2.97e+00

Pos logit contributions
 mastering+0.010
olving+0.010
stack+0.010
 lived+0.010
und+0.010
Neg logit contributions
 closer-0.009
 bite-0.008
awk-0.008
 sigh-0.008
hell-0.008
Token were
Token ID547
Feature activation+6.00e+00

Pos logit contributions
olving+0.133
 mastering+0.130
stack+0.126
 Kang+0.123
ating+0.123
Neg logit contributions
 closer-0.090
 bite-0.086
 sigh-0.084
ax-0.077
 sip-0.077
Token both
Token ID1111
Feature activation+9.89e-02

Pos logit contributions
 mastering+0.254
olving+0.249
stack+0.243
 lived+0.237
und+0.234
Neg logit contributions
 closer-0.180
 sigh-0.167
 bite-0.167
awk-0.164
ax-0.163
Token famous
Token ID5863
Feature activation+3.46e+00

Pos logit contributions
 mastering+0.035
olving+0.035
stack+0.033
 lived+0.033
 tre+0.032
Neg logit contributions
 closer-0.035
 sigh-0.033
awk-0.033
ls-0.032
 bite-0.032
Token mel
Token ID7758
Feature activation+3.09e+00

Pos logit contributions
 mastering+0.140
olving+0.137
stack+0.133
 lived+0.130
und+0.129
Neg logit contributions
 closer-0.110
 bite-0.104
 sigh-0.103
awk-0.101
ax-0.101
Tokenon
Token ID261
Feature activation-3.92e+00

Pos logit contributions
 mastering+0.142
olving+0.140
stack+0.137
 lived+0.134
und+0.132
Neg logit contributions
 closer-0.081
 bite-0.075
 sigh-0.074
ax-0.073
awk-0.073
Token around
Token ID1088
Feature activation-7.72e-01

Pos logit contributions
 closer+0.100
 bite+0.089
 sigh+0.087
awk+0.085
ax+0.083
Neg logit contributions
 mastering-0.188
olving-0.181
stack-0.179
 lived-0.175
und-0.173
Token.
Token ID13
Feature activation+3.89e-01

Pos logit contributions
 closer+0.018
 bite+0.017
 sigh+0.015
 sip+0.014
 slice+0.013
Neg logit contributions
olving-0.039
 mastering-0.039
 Kang-0.037
stack-0.037
ric-0.037
Token They
Token ID1119
Feature activation+3.16e+00

Pos logit contributions
 mastering+0.017
olving+0.017
stack+0.016
ating+0.015
 lived+0.015
Neg logit contributions
 closer-0.012
 sigh-0.011
 bite-0.011
ax-0.011
awk-0.010
Token pushed
Token ID7121
Feature activation-2.59e+00

Pos logit contributions
olving+0.146
 mastering+0.145
stack+0.138
umm+0.137
ric+0.136
Neg logit contributions
 closer-0.094
 bite-0.085
 sigh-0.081
bs-0.077
ax-0.076
Token it
Token ID340
Feature activation-8.87e+00

Pos logit contributions
ax+0.075
 sigh+0.074
awk+0.072
 burger+0.072
 closer+0.071
Neg logit contributions
 mastering-0.105
olving-0.101
und-0.099
 lived-0.097
stack-0.095
Token along
Token ID1863
Feature activation+3.68e+00

Pos logit contributions
 closer+0.196
 bite+0.171
 sigh+0.168
awk+0.156
ls+0.152
Neg logit contributions
 mastering-0.456
olving-0.443
stack-0.440
 lived-0.432
und-0.425
Token the
Token ID262
Feature activation+9.88e-01

Pos logit contributions
ric+0.188
stack+0.187
olving+0.186
 mastering+0.186
 Kang+0.183
Neg logit contributions
 closer-0.084
 bite-0.074
 harder-0.067
 sigh-0.067
 sip-0.059
Token ground
Token ID2323
Feature activation+4.67e+00

Pos logit contributions
 mastering+0.034
olving+0.033
stack+0.032
 lived+0.032
und+0.031
Neg logit contributions
 closer-0.037
 bite-0.035
 sigh-0.035
awk-0.035
ax-0.034
Token he
Token ID339
Feature activation-2.92e-01

Pos logit contributions
 mastering+0.119
olving+0.117
stack+0.113
ric+0.111
und+0.108
Neg logit contributions
 closer-0.051
ls-0.044
 bite-0.044
 sigh-0.043
ax-0.039
Token looked
Token ID3114
Feature activation-5.66e+00

Pos logit contributions
 closer+0.008
 sigh+0.008
 bite+0.008
awk+0.007
ax+0.007
Neg logit contributions
 mastering-0.013
olving-0.013
stack-0.012
und-0.012
 lived-0.012
Token there
Token ID612
Feature activation-2.76e+00

Pos logit contributions
 closer+0.181
 sigh+0.168
 bite+0.168
awk+0.165
ax+0.163
Neg logit contributions
 mastering-0.230
olving-0.225
stack-0.219
 lived-0.211
und-0.209
Token were
Token ID547
Feature activation+6.16e+00

Pos logit contributions
 closer+0.093
 bite+0.086
 look+0.083
 sigh+0.083
ls+0.082
Neg logit contributions
 mastering-0.116
olving-0.111
 former-0.111
arians-0.110
stack-0.107
Token tall
Token ID7331
Feature activation+1.69e+00

Pos logit contributions
 mastering+0.310
olving+0.304
stack+0.303
arians+0.283
�+0.282
Neg logit contributions
 closer-0.155
 sigh-0.133
ls-0.129
 bite-0.126
ax-0.117
Token trees
Token ID7150
Feature activation+5.17e+00

Pos logit contributions
stack+0.075
 mastering+0.075
olving+0.075
und+0.071
ric+0.071
Neg logit contributions
 sigh-0.048
 closer-0.048
ls-0.046
 bite-0.045
hell-0.042
Token with
Token ID351
Feature activation-8.27e-01

Pos logit contributions
 mastering+0.281
olving+0.275
stack+0.267
und+0.259
arians+0.258
Neg logit contributions
 closer-0.107
 bite-0.094
 sigh-0.092
ls-0.083
hell-0.077
Token deep
Token ID2769
Feature activation+1.14e+00

Pos logit contributions
 closer+0.019
 bite+0.016
 sigh+0.014
 sandwiches+0.013
 back+0.013
Neg logit contributions
 mastering-0.045
stack-0.044
arians-0.044
olving-0.043
�-0.043
Token roots
Token ID11135
Feature activation+3.46e+00

Pos logit contributions
 mastering+0.047
stack+0.046
olving+0.045
jack+0.041
ric+0.041
Neg logit contributions
 closer-0.038
ls-0.036
 sigh-0.035
 bite-0.035
awk-0.034
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
 mastering+0.171
olving+0.167
stack+0.162
 lived+0.158
und+0.156
Neg logit contributions
 closer-0.085
 bite-0.076
 sigh-0.075
ls-0.072
awk-0.069
Token As
Token ID1081
Feature activation+1.01e+00

Pos logit contributions
 mastering+0.012
olving+0.012
ating+0.012
stack+0.011
und+0.011
Neg logit contributions
 closer-0.008
 sigh-0.007
 bite-0.007
ax-0.007
 sip-0.006
Tokenmy
Token ID1820
Feature activation-2.97e+00

Pos logit contributions
 closer+0.124
 sigh+0.117
 bite+0.111
ax+0.111
awk+0.103
Neg logit contributions
 mastering-0.214
olving-0.210
stack-0.206
 lived-0.197
 Kang-0.197
Token smiled
Token ID13541
Feature activation-2.94e+00

Pos logit contributions
 closer+0.076
 sigh+0.072
 bite+0.069
ax+0.067
awk+0.062
Neg logit contributions
 mastering-0.141
olving-0.140
stack-0.135
und-0.131
 lived-0.130
Token and
Token ID290
Feature activation-3.96e+00

Pos logit contributions
 closer+0.068
 bite+0.068
awk+0.068
ax+0.066
 sigh+0.065
Neg logit contributions
 mastering-0.137
olving-0.134
 lived-0.129
stack-0.129
und-0.126
Token she
Token ID673
Feature activation-1.11e+00

Pos logit contributions
 closer+0.079
 bite+0.070
 sigh+0.070
awk+0.069
ax+0.068
Neg logit contributions
 mastering-0.207
olving-0.203
stack-0.199
 lived-0.196
und-0.193
Token said
Token ID531
Feature activation-3.15e+00

Pos logit contributions
 closer+0.025
 sigh+0.023
 bite+0.023
ax+0.022
awk+0.022
Neg logit contributions
 mastering-0.055
olving-0.054
stack-0.053
 lived-0.052
und-0.051
Token,
Token ID11
Feature activation+2.09e+00

Pos logit contributions
 closer+0.114
 sigh+0.106
 bite+0.103
ls+0.101
ax+0.101
Neg logit contributions
 mastering-0.117
olving-0.114
stack-0.112
ric-0.107
und-0.107
Token "
Token ID366
Feature activation+4.98e+00

Pos logit contributions
olving+0.083
 mastering+0.083
und+0.079
stack+0.078
 situations+0.077
Neg logit contributions
 closer-0.069
 sigh-0.065
 bite-0.061
ax-0.059
 slice-0.058
TokenIt
Token ID1026
Feature activation-3.25e+00

Pos logit contributions
 mastering+0.206
olving+0.201
stack+0.199
 lived+0.192
und+0.184
Neg logit contributions
 closer-0.159
ax-0.148
 sigh-0.141
awk-0.139
 bite-0.139
Token doesn
Token ID1595
Feature activation-2.70e+00

Pos logit contributions
 closer+0.080
 sigh+0.073
 bite+0.070
ls+0.066
awk+0.066
Neg logit contributions
 mastering-0.160
stack-0.156
olving-0.155
und-0.148
 Kang-0.146
Token't
Token ID470
Feature activation-4.12e+00

Pos logit contributions
awk+0.112
 closer+0.111
ax+0.105
 slice+0.105
five+0.103
Neg logit contributions
 mastering-0.081
olving-0.077
stack-0.073
 lived-0.073
und-0.070
Token matter
Token ID2300
Feature activation+3.26e+00

Pos logit contributions
 closer+0.151
awk+0.143
ax+0.142
 sigh+0.142
ls+0.141
Neg logit contributions
 mastering-0.143
olving-0.140
 lived-0.134
stack-0.133
und-0.128
Token frightened
Token ID24776
Feature activation+1.13e+00

Pos logit contributions
 mastering+0.069
olving+0.067
stack+0.066
 lived+0.065
und+0.064
Neg logit contributions
 closer-0.042
 sigh-0.039
 bite-0.038
awk-0.038
ax-0.037
Token by
Token ID416
Feature activation+8.41e+00

Pos logit contributions
 mastering+0.059
olving+0.058
stack+0.056
ric+0.054
ating+0.053
Neg logit contributions
 closer-0.027
 sigh-0.024
 bite-0.023
awk-0.022
ls-0.021
Token the
Token ID262
Feature activation+6.34e-01

Pos logit contributions
olving+0.357
 mastering+0.356
stack+0.349
jack+0.344
ating+0.338
Neg logit contributions
 closer-0.260
 bite-0.246
 sigh-0.236
 sip-0.211
awk-0.207
Token wild
Token ID4295
Feature activation+1.87e+00

Pos logit contributions
 mastering+0.026
olving+0.025
und+0.024
 lived+0.024
ating+0.023
Neg logit contributions
 closer-0.023
 bite-0.021
 sigh-0.020
awk-0.019
 sandwich-0.018
Token girl
Token ID2576
Feature activation+3.45e+00

Pos logit contributions
 mastering+0.081
olving+0.075
und+0.073
 completing+0.070
ating+0.069
Neg logit contributions
 closer-0.066
 bite-0.061
awk-0.057
 sigh-0.056
 yawn-0.055
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
 mastering+0.172
olving+0.171
stack+0.162
ating+0.161
jack+0.160
Neg logit contributions
 closer-0.085
 bite-0.077
 sigh-0.075
awk-0.067
ax-0.066
Token
Token ID198
Feature activation-7.93e-01

Pos logit contributions
 closer+0.003
 sigh+0.003
ax+0.003
 Anna+0.002
awk+0.002
Neg logit contributions
olving-0.005
 mastering-0.005
ating-0.005
 tre-0.004
stack-0.004
Token
Token ID198
Feature activation-7.52e-01

Pos logit contributions
 closer+0.024
 bite+0.023
awk+0.023
 sigh+0.023
ax+0.022
Neg logit contributions
 mastering-0.032
olving-0.031
stack-0.031
 lived-0.031
und-0.030
TokenThe
Token ID464
Feature activation+5.13e+00

Pos logit contributions
 closer+0.024
 sigh+0.022
 bite+0.022
ax+0.022
awk+0.021
Neg logit contributions
olving-0.031
 mastering-0.031
stack-0.029
ric-0.029
 lived-0.028
Token wild
Token ID4295
Feature activation+2.10e+00

Pos logit contributions
 mastering+0.214
und+0.207
olving+0.207
stack+0.201
 lived+0.196
Neg logit contributions
 closer-0.174
 sigh-0.155
 bite-0.155
awk-0.148
ax-0.141
Token girl
Token ID2576
Feature activation+3.63e+00

Pos logit contributions
 mastering+0.091
und+0.086
olving+0.085
ric+0.083
stack+0.081
Neg logit contributions
 closer-0.072
awk-0.064
 bite-0.064
 sigh-0.061
ls-0.060
Token
Token ID198
Feature activation-9.70e-01

Pos logit contributions
 mastering+0.007
olving+0.007
ating+0.006
stack+0.006
 lived+0.006
Neg logit contributions
 closer-0.004
ax-0.004
 sigh-0.004
five-0.004
 bite-0.004
Token
Token ID198
Feature activation-9.77e-01

Pos logit contributions
awk+0.035
 closer+0.035
 bite+0.033
 sigh+0.033
ax+0.032
Neg logit contributions
 mastering-0.034
 lived-0.034
stack-0.033
olving-0.032
und-0.032
TokenIt
Token ID1026
Feature activation-3.31e+00

Pos logit contributions
 closer+0.033
ax+0.030
 bite+0.030
 sigh+0.030
hell+0.029
Neg logit contributions
olving-0.038
 mastering-0.038
stack-0.037
ric-0.035
 lived-0.035
Token turned
Token ID2900
Feature activation-4.55e+00

Pos logit contributions
 closer+0.137
 sigh+0.127
 bite+0.125
ls+0.121
 sip+0.115
Neg logit contributions
 mastering-0.112
olving-0.108
stack-0.103
ric-0.099
 perspective-0.098
Token out
Token ID503
Feature activation-5.01e+00

Pos logit contributions
 closer+0.099
 bite+0.096
awk+0.096
 sigh+0.095
ax+0.094
Neg logit contributions
 mastering-0.220
olving-0.218
stack-0.211
 lived-0.210
und-0.207
Token that
Token ID326
Feature activation-3.30e+00

Pos logit contributions
 closer+0.131
 sigh+0.111
 bite+0.105
ls+0.102
ax+0.097
Neg logit contributions
 mastering-0.248
olving-0.242
stack-0.242
ric-0.237
 lived-0.232
Token the
Token ID262
Feature activation+7.12e-01

Pos logit contributions
 closer+0.088
 sigh+0.075
 bite+0.075
ls+0.071
ax+0.068
Neg logit contributions
 mastering-0.160
olving-0.155
ric-0.150
stack-0.149
und-0.146
Token mighty
Token ID18680
Feature activation+3.21e+00

Pos logit contributions
 mastering+0.026
olving+0.025
stack+0.025
 lived+0.024
und+0.023
Neg logit contributions
 closer-0.025
 bite-0.024
awk-0.024
 sigh-0.024
ax-0.024
Token lake
Token ID13546
Feature activation+5.17e+00

Pos logit contributions
 mastering+0.134
olving+0.132
stack+0.129
 lived+0.128
und+0.123
Neg logit contributions
 closer-0.096
 bite-0.090
 sigh-0.090
awk-0.089
ax-0.089
Token was
Token ID373
Feature activation+4.53e+00

Pos logit contributions
 mastering+0.192
olving+0.181
und+0.168
ric+0.168
 Kang+0.166
Neg logit contributions
 closer-0.198
 sigh-0.186
 bite-0.183
ls-0.174
 sandwich-0.171
Token home
Token ID1363
Feature activation+1.25e+00

Pos logit contributions
 mastering+0.206
olving+0.200
stack+0.199
und+0.196
 Kang+0.191
Neg logit contributions
 closer-0.133
 bite-0.115
 sigh-0.113
ax-0.105
 slice-0.102
Token of
Token ID286
Feature activation+3.70e+00

Pos logit contributions
 closer+0.025
ax+0.022
 bite+0.022
 sigh+0.022
ls+0.020
Neg logit contributions
stack-0.058
olving-0.057
 mastering-0.057
ric-0.056
 lived-0.055
Token himself
Token ID2241
Feature activation+3.04e+00

Pos logit contributions
olving+0.187
 Kang+0.186
arians+0.185
 Australian+0.183
�+0.182
Neg logit contributions
 closer-0.087
 sigh-0.076
 it-0.073
 bite-0.068
 harder-0.065
Token and
Token ID290
Feature activation-3.74e+00

Pos logit contributions
olving+0.141
 mastering+0.140
stack+0.139
 lived+0.135
und+0.135
Neg logit contributions
 closer-0.081
 sigh-0.072
 bite-0.071
ax-0.070
ls-0.067
Token let
Token ID1309
Feature activation-3.87e+00

Pos logit contributions
 closer+0.094
 sigh+0.082
 bite+0.081
ax+0.077
ls+0.072
Neg logit contributions
olving-0.186
 mastering-0.178
stack-0.177
ric-0.176
und-0.175
Token out
Token ID503
Feature activation-4.28e+00

Pos logit contributions
 closer+0.107
 sigh+0.098
 bite+0.098
awk+0.097
ax+0.096
Neg logit contributions
 mastering-0.174
olving-0.171
stack-0.167
 lived-0.163
und-0.161
Token a
Token ID257
Feature activation-3.62e+00

Pos logit contributions
 closer+0.097
 sigh+0.089
awk+0.089
ax+0.089
 bite+0.088
Neg logit contributions
 mastering-0.207
olving-0.203
 lived-0.198
stack-0.195
und-0.193
Token big
Token ID1263
Feature activation-8.06e-01

Pos logit contributions
 closer+0.090
 bite+0.082
 sigh+0.082
awk+0.077
ax+0.076
Neg logit contributions
 mastering-0.175
olving-0.173
stack-0.168
 lived-0.165
und-0.164
Token roar
Token ID34532
Feature activation-5.81e-01

Pos logit contributions
 closer+0.019
ax+0.017
awk+0.017
 sigh+0.016
 bite+0.016
Neg logit contributions
 mastering-0.039
olving-0.038
stack-0.038
 lived-0.037
und-0.036
Token of
Token ID286
Feature activation+3.71e+00

Pos logit contributions
 closer+0.017
 bite+0.015
 sigh+0.015
ax+0.014
awk+0.014
Neg logit contributions
 mastering-0.026
olving-0.026
stack-0.025
 lived-0.025
und-0.024
Token happiness
Token ID12157
Feature activation-8.36e-01

Pos logit contributions
olving+0.171
 mastering+0.170
stack+0.162
 lived+0.158
und+0.158
Neg logit contributions
 closer-0.103
 bite-0.097
 sigh-0.095
 sip-0.086
ax-0.085
Token.
Token ID13
Feature activation+5.68e-01

Pos logit contributions
 closer+0.021
 bite+0.019
 sigh+0.018
ax+0.017
 sandwich+0.016
Neg logit contributions
 mastering-0.042
olving-0.041
stack-0.040
 lived-0.038
 rarely-0.038
Token on
Token ID319
Feature activation+1.21e+00

Pos logit contributions
 mastering+0.079
olving+0.078
stack+0.077
ric+0.074
und+0.074
Neg logit contributions
 closer-0.054
 bite-0.049
 sigh-0.049
ls-0.045
hell-0.045
Token the
Token ID262
Feature activation+8.78e-01

Pos logit contributions
olving+0.060
stack+0.060
 Kang+0.059
 mastering+0.059
jack+0.058
Neg logit contributions
 closer-0.027
 bite-0.026
 sigh-0.024
 harder-0.022
 sip-0.021
Token floor
Token ID4314
Feature activation+3.17e+00

Pos logit contributions
 mastering+0.033
olving+0.032
stack+0.031
 lived+0.031
 tre+0.030
Neg logit contributions
 closer-0.031
awk-0.030
ax-0.029
 sigh-0.029
 bite-0.028
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
 Kang+0.159
stack+0.158
 mastering+0.157
 lived+0.155
 Australian+0.154
Neg logit contributions
 closer-0.071
 bite-0.064
 sigh-0.060
hell-0.055
 slice-0.055
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
 mastering+0.008
olving+0.008
stack+0.008
 lived+0.008
und+0.007
Neg logit contributions
 closer-0.007
 sigh-0.007
 bite-0.007
awk-0.007
ax-0.007
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
awk+0.039
 closer+0.039
 bite+0.038
 sigh+0.038
 burger+0.037
Neg logit contributions
 mastering-0.035
 lived-0.035
olving-0.034
und-0.033
stack-0.033
TokenSarah
Token ID29284
Feature activation-6.21e-01

Pos logit contributions
 closer+0.036
 sigh+0.033
 bite+0.033
ax+0.033
awk+0.033
Neg logit contributions
 mastering-0.040
olving-0.039
stack-0.038
 lived-0.037
und-0.036
Token was
Token ID373
Feature activation+4.56e+00

Pos logit contributions
 closer+0.018
 sigh+0.018
 bite+0.018
ax+0.017
 yawn+0.016
Neg logit contributions
olving-0.026
stack-0.026
 mastering-0.026
 Kang-0.024
ric-0.024
Token so
Token ID523
Feature activation-4.80e-01

Pos logit contributions
olving+0.210
stack+0.208
und+0.204
 mastering+0.202
 lived+0.198
Neg logit contributions
 closer-0.124
 sigh-0.111
 bite-0.108
ax-0.106
 slice-0.097
Token busy
Token ID8179
Feature activation+4.26e-01

Pos logit contributions
 closer+0.010
awk+0.010
 bite+0.009
 sigh+0.009
ls+0.009
Neg logit contributions
 mastering-0.025
olving-0.024
 lived-0.023
stack-0.023
und-0.022
Token playing
Token ID2712
Feature activation+1.74e+00

Pos logit contributions
stack+0.016
 Kang+0.015
 lived+0.015
jack+0.015
ric+0.015
Neg logit contributions
 closer-0.015
 sigh-0.015
 bite-0.015
 sip-0.013
 harder-0.013
Token fountain
Token ID29420
Feature activation+1.07e+00

Pos logit contributions
 closer+0.078
awk+0.071
 sigh+0.071
ax+0.070
 bite+0.070
Neg logit contributions
 mastering-0.192
olving-0.189
stack-0.186
 lived-0.183
und-0.180
Token with
Token ID351
Feature activation-1.08e+00

Pos logit contributions
 mastering+0.044
olving+0.043
stack+0.043
 lived+0.042
und+0.041
Neg logit contributions
 closer-0.032
 sigh-0.030
awk-0.030
 bite-0.030
ax-0.030
Token water
Token ID1660
Feature activation-3.16e-01

Pos logit contributions
awk+0.028
 closer+0.027
 sigh+0.027
ax+0.027
 bite+0.025
Neg logit contributions
 lived-0.046
 mastering-0.046
olving-0.045
stack-0.045
 tre-0.045
Token.
Token ID13
Feature activation+3.44e-01

Pos logit contributions
 closer+0.007
 bite+0.006
 sigh+0.006
ls+0.006
ax+0.006
Neg logit contributions
 mastering-0.017
olving-0.016
stack-0.015
 lived-0.015
 tre-0.015
Token They
Token ID1119
Feature activation+2.90e+00

Pos logit contributions
 lived+0.010
und+0.009
stack+0.009
 Kang+0.009
 mastering+0.009
Neg logit contributions
awk-0.015
 closer-0.014
 bite-0.014
hell-0.014
ax-0.013
Token ran
Token ID4966
Feature activation-1.99e+00

Pos logit contributions
 lived+0.096
 mastering+0.096
stack+0.091
olving+0.091
und+0.088
Neg logit contributions
 closer-0.107
awk-0.105
 sigh-0.102
 bite-0.102
ls-0.100
Token to
Token ID284
Feature activation+1.25e+00

Pos logit contributions
hell+0.070
ax+0.066
awk+0.066
 sandwich+0.065
 slice+0.065
Neg logit contributions
stack-0.068
 mastering-0.066
olving-0.063
 tre-0.062
jack-0.061
Token the
Token ID262
Feature activation+9.98e-01

Pos logit contributions
 lived+0.033
stack+0.031
 Sc+0.031
 mastering+0.029
ric+0.029
Neg logit contributions
hell-0.058
ls-0.056
awk-0.052
ives-0.051
ache-0.051
Token fountain
Token ID29420
Feature activation+1.03e+00

Pos logit contributions
stack+0.035
 lived+0.034
 mastering+0.033
olving+0.033
 forest+0.032
Neg logit contributions
awk-0.035
ax-0.035
 closer-0.034
 sigh-0.034
 bite-0.033
Token.
Token ID13
Feature activation+3.45e-01

Pos logit contributions
stack+0.048
 mastering+0.047
olving+0.047
 lived+0.046
und+0.045
Neg logit contributions
 closer-0.024
awk-0.024
ax-0.023
 sigh-0.023
 bite-0.022
Token
Token ID198
Feature activation-7.54e-01

Pos logit contributions
 lived+0.010
stack+0.009
und+0.009
 Kang+0.009
 mastering+0.009
Neg logit contributions
awk-0.015
hell-0.015
 closer-0.014
ax-0.014
 bite-0.014
Tokenines
Token ID1127
Feature activation+2.10e+00

Pos logit contributions
 mastering+0.010
olving+0.009
und+0.009
 lived+0.009
ric+0.009
Neg logit contributions
 closer-0.010
 bite-0.010
awk-0.010
ls-0.010
 sandwich-0.009
Token he
Token ID339
Feature activation-5.98e-01

Pos logit contributions
 mastering+0.106
 Kang+0.105
olving+0.104
stack+0.102
 perspective+0.102
Neg logit contributions
 closer-0.047
 sigh-0.044
 bite-0.042
ls-0.038
hell-0.037
Token is
Token ID318
Feature activation+2.59e+00

Pos logit contributions
 closer+0.018
 sigh+0.017
 bite+0.017
 falls+0.016
awk+0.015
Neg logit contributions
stack-0.026
 mastering-0.026
olving-0.026
 Kang-0.024
jack-0.024
Token one
Token ID530
Feature activation-7.17e+00

Pos logit contributions
stack+0.116
olving+0.115
und+0.115
 mastering+0.114
 perspective+0.111
Neg logit contributions
 closer-0.074
 sigh-0.062
 bite-0.061
ax-0.061
 back-0.058
Token of
Token ID286
Feature activation+3.77e+00

Pos logit contributions
 closer+0.209
 bite+0.178
 slice+0.167
 sandwich+0.164
 burger+0.158
Neg logit contributions
olving-0.336
und-0.334
 mastering-0.333
stack-0.326
ric-0.324
Token them
Token ID606
Feature activation-6.01e+00

Pos logit contributions
 mastering+0.179
olving+0.177
stack+0.174
und+0.172
 lived+0.170
Neg logit contributions
 closer-0.098
 bite-0.088
 sigh-0.085
ax-0.079
 sip-0.075
Token.
Token ID13
Feature activation+4.38e-01

Pos logit contributions
 closer+0.124
 bite+0.097
 sigh+0.093
 back+0.084
 yawn+0.081
Neg logit contributions
stack-0.324
 mastering-0.322
olving-0.320
und-0.318
ric-0.310
Token He
Token ID679
Feature activation+5.03e+00

Pos logit contributions
 mastering+0.018
olving+0.017
stack+0.017
ating+0.016
 lived+0.016
Neg logit contributions
 closer-0.014
 sigh-0.013
ax-0.013
 bite-0.012
awk-0.012
Token puts
Token ID7584
Feature activation-3.82e+00

Pos logit contributions
 mastering+0.230
olving+0.227
stack+0.223
und+0.213
 lived+0.213
Neg logit contributions
 closer-0.134
 sigh-0.126
 bite-0.124
awk-0.119
ax-0.118
Token on
Token ID319
Feature activation+1.71e+00

Pos logit contributions
 sigh+0.107
awk+0.107
 closer+0.104
ax+0.103
 bite+0.103
Neg logit contributions
 mastering-0.152
 lived-0.151
stack-0.150
olving-0.148
und-0.147
Token a
Token ID257
Feature activation-3.75e+00

Pos logit contributions
 mastering+0.063
olving+0.062
stack+0.060
und+0.059
 lived+0.059
Neg logit contributions
 closer-0.057
awk-0.056
ax-0.055
 sigh-0.054
 bite-0.053
Token a
Token ID257
Feature activation-4.12e+00

Pos logit contributions
 mastering+0.198
olving+0.197
stack+0.193
und+0.188
 lived+0.186
Neg logit contributions
 closer-0.099
 bite-0.090
 sigh-0.089
awk-0.084
ax-0.084
Token naughty
Token ID45128
Feature activation+2.08e+00

Pos logit contributions
 closer+0.091
 bite+0.083
 sigh+0.082
awk+0.076
ls+0.072
Neg logit contributions
 mastering-0.211
olving-0.208
stack-0.204
und-0.198
 lived-0.198
Token dragon
Token ID10441
Feature activation+1.60e+00

Pos logit contributions
 mastering+0.088
olving+0.086
stack+0.084
 lived+0.082
und+0.081
Neg logit contributions
 closer-0.063
 sigh-0.058
 bite-0.058
awk-0.058
ax-0.057
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
 mastering+0.080
olving+0.077
stack+0.074
ric+0.072
und+0.072
Neg logit contributions
 closer-0.039
 bite-0.037
 sigh-0.035
awk-0.034
ls-0.033
Token You
Token ID921
Feature activation+1.71e-01

Pos logit contributions
 mastering+0.013
olving+0.013
 perspective+0.012
ating+0.012
stack+0.012
Neg logit contributions
five-0.010
awk-0.009
 sigh-0.009
 closer-0.009
 bite-0.009
Token lied
Token ID19837
Feature activation-6.98e+00

Pos logit contributions
 mastering+0.007
olving+0.007
stack+0.007
und+0.006
 Kang+0.006
Neg logit contributions
 closer-0.006
 bite-0.005
 sigh-0.005
awk-0.005
ax-0.005
Token to
Token ID284
Feature activation+1.12e+00

Pos logit contributions
 closer+0.152
 sigh+0.139
 bite+0.138
awk+0.135
ax+0.135
Neg logit contributions
 mastering-0.346
olving-0.342
stack-0.334
 lived-0.330
und-0.326
Token us
Token ID514
Feature activation-1.25e+00

Pos logit contributions
 mastering+0.048
olving+0.048
stack+0.046
 lived+0.044
und+0.043
Neg logit contributions
 closer-0.033
 bite-0.033
 sigh-0.032
awk-0.031
ax-0.030
Token.
Token ID13
Feature activation+2.90e-01

Pos logit contributions
 closer+0.032
 sigh+0.029
 bite+0.029
awk+0.029
ax+0.028
Neg logit contributions
 mastering-0.058
olving-0.057
stack-0.056
 lived-0.055
und-0.054
Token You
Token ID921
Feature activation+1.13e-01

Pos logit contributions
 mastering+0.012
olving+0.011
 perspective+0.011
ating+0.011
stack+0.011
Neg logit contributions
awk-0.009
 closer-0.009
 sigh-0.009
five-0.009
 bite-0.009
Token are
Token ID389
Feature activation+3.99e+00

Pos logit contributions
 mastering+0.005
olving+0.004
stack+0.004
und+0.004
 lived+0.004
Neg logit contributions
 closer-0.004
 bite-0.004
 sigh-0.004
awk-0.003
ax-0.003
Token got
Token ID1392
Feature activation-2.92e+00

Pos logit contributions
 mastering+0.186
olving+0.185
stack+0.178
und+0.169
 lived+0.168
Neg logit contributions
 closer-0.159
 sigh-0.150
 bite-0.149
ax-0.145
awk-0.143
Token his
Token ID465
Feature activation+1.36e+00

Pos logit contributions
awk+0.071
 closer+0.069
 bite+0.069
 sigh+0.068
ax+0.067
Neg logit contributions
 mastering-0.134
olving-0.129
stack-0.124
 lived-0.123
und-0.122
Token turn
Token ID1210
Feature activation-3.51e+00

Pos logit contributions
 mastering+0.057
olving+0.055
stack+0.054
 lived+0.053
und+0.052
Neg logit contributions
 closer-0.042
 bite-0.039
 sigh-0.039
awk-0.038
ax-0.037
Token and
Token ID290
Feature activation-4.04e+00

Pos logit contributions
 closer+0.084
 sigh+0.077
 bite+0.077
awk+0.076
ax+0.075
Neg logit contributions
 mastering-0.167
olving-0.164
stack-0.160
 lived-0.157
und-0.155
Token he
Token ID339
Feature activation-8.83e-01

Pos logit contributions
 closer+0.108
awk+0.105
 sigh+0.102
ax+0.101
 bite+0.101
Neg logit contributions
 mastering-0.178
olving-0.172
stack-0.170
 lived-0.170
und-0.165
Token threw
Token ID9617
Feature activation-6.47e+00

Pos logit contributions
 closer+0.027
 sigh+0.026
 bite+0.026
awk+0.025
ax+0.025
Neg logit contributions
 mastering-0.036
olving-0.036
stack-0.035
 lived-0.034
und-0.033
Token the
Token ID262
Feature activation+8.03e-01

Pos logit contributions
awk+0.183
 closer+0.183
ax+0.181
 sigh+0.180
ls+0.176
Neg logit contributions
 mastering-0.266
olving-0.256
stack-0.250
 lived-0.248
und-0.246
Token light
Token ID1657
Feature activation-5.30e-01

Pos logit contributions
 mastering+0.033
olving+0.031
stack+0.031
und+0.030
 lived+0.030
Neg logit contributions
 closer-0.027
 bite-0.025
 sigh-0.024
awk-0.024
 sip-0.023
Token really
Token ID1107
Feature activation-7.77e-01

Pos logit contributions
 closer+0.013
 bite+0.012
 sigh+0.012
awk+0.011
ls+0.011
Neg logit contributions
 mastering-0.026
olving-0.025
stack-0.025
 lived-0.024
und-0.024
Token high
Token ID1029
Feature activation+3.83e-01

Pos logit contributions
 closer+0.017
ax+0.017
 sigh+0.016
awk+0.016
 bite+0.016
Neg logit contributions
 mastering-0.037
olving-0.036
 lived-0.035
und-0.035
stack-0.035
Token.
Token ID13
Feature activation+1.20e-01

Pos logit contributions
 mastering+0.019
olving+0.018
stack+0.018
 lived+0.018
ric+0.017
Neg logit contributions
 closer-0.010
 bite-0.009
ls-0.009
 sigh-0.008
awk-0.008
Token go
Token ID467
Feature activation-2.96e+00

Pos logit contributions
 mastering+0.053
olving+0.052
stack+0.050
 lived+0.050
und+0.048
Neg logit contributions
 closer-0.032
 sigh-0.030
awk-0.030
 bite-0.029
ax-0.029
Token outside
Token ID2354
Feature activation-1.48e+00

Pos logit contributions
hell+0.098
ax+0.095
awk+0.087
 slice+0.087
 sigh+0.086
Neg logit contributions
 mastering-0.111
 forest-0.110
 lived-0.105
olving-0.103
 solving-0.102
Token and
Token ID290
Feature activation-3.95e+00

Pos logit contributions
 closer+0.022
 bite+0.018
 sigh+0.017
 back+0.013
 sip+0.013
Neg logit contributions
olving-0.088
 mastering-0.087
stack-0.086
ric-0.085
 lived-0.084
Token play
Token ID711
Feature activation+1.31e+00

Pos logit contributions
 closer+0.121
 bite+0.112
 sigh+0.111
awk+0.105
ax+0.103
Neg logit contributions
 mastering-0.169
olving-0.167
stack-0.160
und-0.158
 lived-0.155
Token but
Token ID475
Feature activation+7.05e-01

Pos logit contributions
 mastering+0.063
olving+0.062
stack+0.061
ric+0.061
und+0.058
Neg logit contributions
 closer-0.034
 bite-0.031
 sigh-0.030
 sip-0.029
awk-0.028
Token it
Token ID340
Feature activation-9.15e+00

Pos logit contributions
und+0.037
 mastering+0.037
olving+0.037
ric+0.036
arians+0.036
Neg logit contributions
 closer-0.016
 sigh-0.014
 bite-0.013
 back-0.012
 yawn-0.012
Token was
Token ID373
Feature activation+4.67e+00

Pos logit contributions
 closer+0.352
 bite+0.338
 sigh+0.326
 sip+0.304
 harder+0.303
Neg logit contributions
 mastering-0.349
olving-0.334
stack-0.326
arians-0.319
ric-0.319
Token raining
Token ID43079
Feature activation+1.69e+00

Pos logit contributions
 mastering+0.230
olving+0.227
stack+0.225
und+0.220
 lived+0.215
Neg logit contributions
 closer-0.113
 bite-0.103
 sigh-0.101
awk-0.090
ax-0.090
Token.
Token ID13
Feature activation+3.05e-01

Pos logit contributions
 mastering+0.069
olving+0.068
stack+0.066
 lived+0.065
und+0.064
Neg logit contributions
 closer-0.052
awk-0.049
 sigh-0.048
ax-0.048
 bite-0.048
Token She
Token ID1375
Feature activation+4.08e+00

Pos logit contributions
 mastering+0.013
olving+0.013
ating+0.012
stack+0.012
ric+0.012
Neg logit contributions
 closer-0.009
 sigh-0.009
 bite-0.008
awk-0.008
ax-0.008
Token looked
Token ID3114
Feature activation-6.02e+00

Pos logit contributions
 mastering+0.161
olving+0.159
stack+0.154
und+0.146
 lived+0.144
Neg logit contributions
 closer-0.138
 sigh-0.129
 bite-0.128
awk-0.124
ax-0.124
Token
Token ID198
Feature activation-1.15e+00

Pos logit contributions
 closer+0.047
awk+0.045
 bite+0.045
 sandwich+0.044
ax+0.044
Neg logit contributions
 mastering-0.034
 lived-0.034
olving-0.033
und-0.033
stack-0.032
TokenMom
Token ID29252
Feature activation-4.79e+00

Pos logit contributions
 closer+0.038
 bite+0.035
 sigh+0.035
awk+0.035
ax+0.034
Neg logit contributions
 mastering-0.046
olving-0.045
stack-0.044
 lived-0.043
und-0.042
Token and
Token ID290
Feature activation-4.22e+00

Pos logit contributions
awk+0.157
 ours+0.152
 closer+0.142
irt+0.140
ls+0.139
Neg logit contributions
 mastering-0.180
 lived-0.178
olving-0.170
jack-0.170
stack-0.168
Token Dad
Token ID17415
Feature activation-5.80e+00

Pos logit contributions
hell+0.125
awk+0.112
ls+0.105
 closer+0.101
irt+0.094
Neg logit contributions
 lived-0.176
 mastering-0.175
 Zo-0.171
stack-0.170
jack-0.165
Token hugged
Token ID41130
Feature activation-5.12e+00

Pos logit contributions
awk+0.197
 closer+0.192
ls+0.185
 bite+0.182
ax+0.180
Neg logit contributions
 mastering-0.212
 lived-0.204
olving-0.198
stack-0.196
und-0.191
Token them
Token ID606
Feature activation-6.44e+00

Pos logit contributions
ax+0.149
awk+0.148
hell+0.147
 closer+0.146
ls+0.143
Neg logit contributions
 mastering-0.192
 lived-0.188
olving-0.186
 Zo-0.184
und-0.184
Token and
Token ID290
Feature activation-4.25e+00

Pos logit contributions
awk+0.209
ax+0.201
ls+0.189
 bite+0.186
hell+0.184
Neg logit contributions
 mastering-0.223
stack-0.210
 tre-0.209
 lived-0.208
jack-0.208
Token gave
Token ID2921
Feature activation-4.19e+00

Pos logit contributions
 closer+0.104
awk+0.101
ax+0.097
 bite+0.095
hell+0.094
Neg logit contributions
 mastering-0.199
olving-0.191
 lived-0.191
stack-0.190
und-0.182
Token them
Token ID606
Feature activation-6.45e+00

Pos logit contributions
hell+0.131
awk+0.129
ax+0.128
 sigh+0.128
ls+0.123
Neg logit contributions
 lived-0.146
und-0.137
 Zo-0.135
olving-0.135
stack-0.135
Token some
Token ID617
Feature activation-6.14e+00

Pos logit contributions
awk+0.157
ax+0.156
 sigh+0.156
 closer+0.156
ls+0.154
Neg logit contributions
olving-0.263
 lived-0.263
 mastering-0.260
 tre-0.258
 Zo-0.255
Token cookies
Token ID14746
Feature activation-7.43e-01

Pos logit contributions
 closer+0.151
 sigh+0.138
 bite+0.137
awk+0.135
ax+0.135
Neg logit contributions
 mastering-0.290
olving-0.285
stack-0.278
 lived-0.274
und-0.269