TOP ACTIVATIONS
MAX = 0.342

soon nothing but a small spe ck in the sky .
. They were blue and spe ck led . L ila
away , a bright blue spe ck in the sky .
The eggs were blue and spe ck led . L ila
The eggs were blue and spe ck led . They looked
of the frog . He spe ared the lazy frog and
John chose some carrots and cuc umbers to take home .
They were filled with fresh cuc umber and cheese . It
like lettuce , tomatoes and cuc umbers . He smiled as
garden . They cut the cuc umber and put it in
cut the lettuce and the cuc umber . She said he
. And this is a cuc umber . It is green
saw carrots , potatoes , cuc umbers and tomatoes . He
house . They wash the cuc umber and cut it into
. You found a nice cuc umber . It will be
thank you . We love cuc umbers . And we love
happy to see our big cuc umber ." Lily says .
the bowl . Lucy put cuc umbers , tomatoes and some
tomato , and then a cuc umber . Lily and Tom
has lettuce and tomatoes and cuc umbers . And it has

MIN ACTIVATIONS
MIN = -0.207

two squirrel s kept qu arre ling until finally Sammy said
friends didn 't stop qu arre ling and were getting louder
and saw two birds qu arre ling ! They were squ
walking by saw them qu arre ling . She was very
but today they were qu arre ling . Jenna wanted to
mum and understood that qu arre lling was not a good
on , they never qu arre led again over clothes .
together . They never qu arre led again . <|endoftext|> Once
and their children , qu arre ling .
he found a rabbit qu arre lling with a squirrel .
. The bears stopped qu arre ling and asked him for
Finally , after much qu arre ling , one of the
said , " Stop qu arre ling , boys . You
, why are you qu arre ling ?" But the birds
Lily and Jack never qu arre led again . They learned
told them to stop qu arre lling and to share the
she was sorry for qu arre ling . From that day
Tom and Mia stop qu arre ling . They see the
They learn that qu arre ling is bad and sharing
on , they never qu arre led again and played together

INTERVAL 0.205 - 0.342
CONTAINS 0.015%

replied . Fl umm ox ed , the little
four more times ," the gen ie said , " I
and grab the ju ic iest fruit . One day she
the biggest , fl uff iest do ggy he had ever
as he opened the ch ocol ates and started eating them

INTERVAL 0.067 - 0.205
CONTAINS 7.718%

to the shop and see if they have any diamonds available
like so much fun ! â € So she
. The witch gave Tom some candy , and
He was so disappointed and he sadly had to go home
go to the doctor now ," Mom said .

INTERVAL -0.070 - 0.067
CONTAINS 86.510%

Max . Max had a long tail and loved to w
He walked closer to find out and , suddenly
You might break it or hurt yourselves ," she says .
€ ™ t be fixed . The little girl
From then on , Tim my learned that sometimes the simple

INTERVAL -0.207 - -0.070
CONTAINS 5.757%

some boxes . She hopped up on top of them and
. The two birds flew together , and the
there was a monkey named Max . Max lived in a
to wash your face ." Spot did not like water ,
He walked up to the pond , getting closer and closer
Token soon
Token ID2582
Feature activation+6.63e-02

Pos logit contributions
 walks+0.031
 walk+0.031
 conversation+0.031
 jog+0.030
 skip+0.030
Neg logit contributions
rier-0.044
isher-0.037
bars-0.035
rily-0.035
iest-0.034
Token nothing
Token ID2147
Feature activation+6.18e-02

Pos logit contributions
rier+0.123
rily+0.096
iest+0.093
isher+0.093
 spe+0.092
Neg logit contributions
 walks-0.108
 walk-0.107
 conversation-0.106
 jog-0.105
 joke-0.101
Token but
Token ID475
Feature activation+3.89e-02

Pos logit contributions
rier+0.123
rily+0.098
isher+0.097
bars+0.095
bar+0.092
Neg logit contributions
 walk-0.097
 walks-0.096
 jog-0.092
 conversation-0.092
 skip-0.089
Token a
Token ID257
Feature activation-2.59e-03

Pos logit contributions
rier+0.062
rily+0.046
isher+0.045
iest+0.045
bars+0.044
Neg logit contributions
 walks-0.075
 walk-0.074
 conversation-0.073
 jog-0.073
 skip-0.070
Token small
Token ID1402
Feature activation-2.47e-02

Pos logit contributions
 walks+0.004
 walk+0.004
 conversation+0.004
 jog+0.004
 skip+0.004
Neg logit contributions
rier-0.005
rily-0.004
isher-0.004
iest-0.004
 spe-0.004
Token spe
Token ID693
Feature activation+3.42e-01

Pos logit contributions
 walk+0.043
 walks+0.043
 conversation+0.042
 jog+0.042
 skip+0.040
Neg logit contributions
rier-0.044
rily-0.034
isher-0.033
bars-0.033
iest-0.032
Tokenck
Token ID694
Feature activation+7.32e-03

Pos logit contributions
rier+0.763
rily+0.617
bars+0.606
 diamonds+0.603
isher+0.602
Neg logit contributions
 walk-0.439
 walks-0.434
 conversation-0.425
 jog-0.416
 quarrel-0.400
Token in
Token ID287
Feature activation-6.04e-02

Pos logit contributions
rier+0.012
bars+0.009
isher+0.009
rily+0.009
 spe+0.009
Neg logit contributions
 walk-0.014
 walks-0.013
 jog-0.013
 conversation-0.013
 skip-0.012
Token the
Token ID262
Feature activation+8.00e-03

Pos logit contributions
 walks+0.104
 walk+0.103
 conversation+0.100
 jog+0.099
 skip+0.097
Neg logit contributions
rier-0.113
isher-0.087
bars-0.087
rily-0.085
bar-0.084
Token sky
Token ID6766
Feature activation-2.30e-02

Pos logit contributions
rier+0.014
rily+0.010
 diamonds+0.010
 spe+0.010
iest+0.010
Neg logit contributions
 walks-0.015
 walk-0.014
 conversation-0.014
 jog-0.014
ggies-0.014
Token.
Token ID13
Feature activation-8.88e-03

Pos logit contributions
 walk+0.035
 walks+0.035
 conversation+0.034
 jog+0.034
 skip+0.032
Neg logit contributions
rier-0.046
rily-0.036
isher-0.036
bars-0.035
iest-0.035
Token.
Token ID13
Feature activation-6.52e-03

Pos logit contributions
rier+0.037
rily+0.029
iest+0.028
isher+0.028
bars+0.028
Neg logit contributions
 walks-0.032
 walk-0.032
 conversation-0.031
 jog-0.031
 skip-0.030
Token They
Token ID1119
Feature activation-1.06e-01

Pos logit contributions
 walks+0.010
 jog+0.010
 walk+0.010
 conversation+0.009
 skip+0.009
Neg logit contributions
rier-0.013
iest-0.011
isher-0.010
bul-0.010
 diamonds-0.010
Token were
Token ID547
Feature activation+1.01e-02

Pos logit contributions
 walk+0.151
 walks+0.146
 jog+0.140
 conversation+0.140
 quarrel+0.138
Neg logit contributions
rier-0.237
rily-0.186
isher-0.185
iest-0.184
eddy-0.176
Token blue
Token ID4171
Feature activation+3.27e-02

Pos logit contributions
rier+0.022
isher+0.018
iest+0.018
rily+0.017
bars+0.017
Neg logit contributions
 walks-0.014
 walk-0.014
 conversation-0.014
 jog-0.013
 quarrel-0.013
Token and
Token ID290
Feature activation+8.81e-03

Pos logit contributions
rier+0.064
isher+0.050
rily+0.050
iest+0.048
bars+0.047
Neg logit contributions
 walks-0.053
 walk-0.053
 conversation-0.050
 jog-0.049
 stroll-0.048
Token spe
Token ID693
Feature activation+3.37e-01

Pos logit contributions
rier+0.019
isher+0.015
rily+0.015
iest+0.014
bars+0.014
Neg logit contributions
 walk-0.013
 walks-0.013
 quarrel-0.013
 jog-0.012
 skip-0.012
Tokenck
Token ID694
Feature activation+4.84e-03

Pos logit contributions
rier+0.696
rily+0.566
isher+0.560
iest+0.544
 spe+0.542
Neg logit contributions
 walks-0.478
 walk-0.476
 jog-0.466
 conversation-0.464
 skip-0.446
Tokenled
Token ID992
Feature activation+1.02e-02

Pos logit contributions
rier+0.008
rily+0.006
isher+0.006
bars+0.006
bul+0.006
Neg logit contributions
 walks-0.009
 walk-0.009
 conversation-0.009
 jog-0.009
 stroll-0.008
Token.
Token ID13
Feature activation-5.33e-03

Pos logit contributions
rier+0.018
rily+0.014
isher+0.014
iest+0.013
bars+0.013
Neg logit contributions
 walks-0.018
 walk-0.018
 conversation-0.017
 jog-0.017
 skip-0.016
Token L
Token ID406
Feature activation+1.10e-01

Pos logit contributions
 walks+0.010
 walk+0.010
 jog+0.010
 conversation+0.010
 skip+0.009
Neg logit contributions
rier-0.009
rily-0.007
iest-0.006
isher-0.006
bars-0.006
Tokenila
Token ID10102
Feature activation+9.07e-02

Pos logit contributions
rier+0.173
rily+0.127
isher+0.123
 spe+0.120
iest+0.120
Neg logit contributions
 walks-0.217
 walk-0.215
 jog-0.211
 conversation-0.211
 skip-0.204
Token away
Token ID1497
Feature activation-5.64e-02

Pos logit contributions
rier+0.075
isher+0.064
bars+0.063
iest+0.062
rily+0.061
Neg logit contributions
 walk-0.040
 walks-0.039
 jog-0.038
 skip-0.036
 conversation-0.036
Token,
Token ID11
Feature activation-2.43e-02

Pos logit contributions
 walk+0.094
 walks+0.094
 jog+0.090
 conversation+0.090
 skip+0.087
Neg logit contributions
rier-0.103
isher-0.082
rily-0.080
bars-0.079
iest-0.079
Token a
Token ID257
Feature activation-4.12e-03

Pos logit contributions
 walk+0.037
 walks+0.037
 jog+0.036
 conversation+0.035
 skip+0.034
Neg logit contributions
rier-0.049
isher-0.040
iest-0.040
rily-0.039
bars-0.038
Token bright
Token ID6016
Feature activation-1.41e-02

Pos logit contributions
 walk+0.006
 walks+0.006
 conversation+0.006
 jog+0.006
 skip+0.006
Neg logit contributions
rier-0.008
rily-0.007
isher-0.007
iest-0.006
bars-0.006
Token blue
Token ID4171
Feature activation+2.33e-02

Pos logit contributions
 walk+0.023
 conversation+0.022
 jog+0.022
 walks+0.022
 skip+0.021
Neg logit contributions
rier-0.027
rily-0.021
isher-0.021
bars-0.021
iest-0.020
Token spe
Token ID693
Feature activation+3.36e-01

Pos logit contributions
rier+0.045
rily+0.035
isher+0.034
bars+0.034
iest+0.033
Neg logit contributions
 walk-0.038
 walks-0.037
 jog-0.037
 conversation-0.037
 skip-0.036
Tokenck
Token ID694
Feature activation+1.45e-03

Pos logit contributions
rier+0.719
rily+0.575
bars+0.565
ter+0.565
 diamonds+0.565
Neg logit contributions
 walk-0.459
 walks-0.453
 conversation-0.447
 jog-0.433
 quarrel-0.418
Token in
Token ID287
Feature activation-6.79e-02

Pos logit contributions
rier+0.002
isher+0.002
bars+0.002
rily+0.002
 spe+0.002
Neg logit contributions
 walk-0.003
 walks-0.003
 jog-0.003
 conversation-0.002
 stroll-0.002
Token the
Token ID262
Feature activation+7.75e-03

Pos logit contributions
 walks+0.111
 walk+0.108
 jog+0.102
 conversation+0.101
 skip+0.099
Neg logit contributions
rier-0.137
ancer-0.109
bars-0.108
isher-0.108
iest-0.106
Token sky
Token ID6766
Feature activation-2.95e-02

Pos logit contributions
rier+0.013
rily+0.010
isher+0.009
iest+0.009
 spe+0.009
Neg logit contributions
 walks-0.014
 walk-0.014
 conversation-0.014
 jog-0.014
 quarrel-0.013
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
 walk+0.045
 walks+0.045
 conversation+0.043
 jog+0.043
 skip+0.041
Neg logit contributions
rier-0.059
isher-0.046
rily-0.046
iest-0.045
bars-0.045
Token The
Token ID383
Feature activation-5.30e-03

Pos logit contributions
 walks+0.020
 jog+0.019
 walk+0.019
 learn+0.018
 skip+0.018
Neg logit contributions
rier-0.025
iest-0.022
bul-0.021
isher-0.020
 diamonds-0.020
Token eggs
Token ID9653
Feature activation+1.28e-01

Pos logit contributions
 walk+0.010
 conversation+0.009
 quarrel+0.009
 walks+0.009
 fight+0.009
Neg logit contributions
rier-0.009
rily-0.008
isher-0.008
bars-0.008
iest-0.007
Token were
Token ID547
Feature activation+5.83e-03

Pos logit contributions
rier+0.278
isher+0.219
rily+0.219
bars+0.211
iest+0.208
Neg logit contributions
 walk-0.180
 walks-0.173
 conversation-0.170
 jog-0.166
 quarrel-0.165
Token blue
Token ID4171
Feature activation+2.87e-02

Pos logit contributions
rier+0.012
isher+0.011
iest+0.010
bars+0.010
umm+0.010
Neg logit contributions
 walk-0.008
 walks-0.008
 quarrel-0.008
 conversation-0.008
 fight-0.007
Token and
Token ID290
Feature activation+8.64e-04

Pos logit contributions
rier+0.059
isher+0.047
rily+0.047
iest+0.045
bars+0.044
Neg logit contributions
 walks-0.043
 walk-0.042
 conversation-0.040
 jog-0.039
 skip-0.039
Token spe
Token ID693
Feature activation+3.35e-01

Pos logit contributions
rier+0.002
isher+0.002
rily+0.002
umm+0.001
eddy+0.001
Neg logit contributions
 walks-0.001
 walk-0.001
 quarrel-0.001
 run-0.001
 joke-0.001
Tokenck
Token ID694
Feature activation+8.93e-04

Pos logit contributions
rier+0.731
rily+0.592
isher+0.579
iest+0.577
bars+0.572
Neg logit contributions
 walks-0.446
 walk-0.441
 conversation-0.429
 jog-0.424
 quarrel-0.402
Tokenled
Token ID992
Feature activation+5.21e-03

Pos logit contributions
rier+0.002
isher+0.001
rily+0.001
bars+0.001
bul+0.001
Neg logit contributions
 walks-0.002
 walk-0.002
 conversation-0.002
 jog-0.002
 stroll-0.001
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
rier+0.009
isher+0.007
iest+0.007
bars+0.007
rily+0.007
Neg logit contributions
 walks-0.009
 walk-0.009
 jog-0.008
 conversation-0.008
 skip-0.008
Token L
Token ID406
Feature activation+1.09e-01

Pos logit contributions
 walks+0.023
 jog+0.022
 walk+0.022
 conversation+0.022
 skip+0.021
Neg logit contributions
rier-0.021
iest-0.017
isher-0.016
bul-0.016
 diamonds-0.016
Tokenila
Token ID10102
Feature activation+8.61e-02

Pos logit contributions
rier+0.176
rily+0.130
isher+0.127
 spe+0.124
bars+0.124
Neg logit contributions
 walks-0.208
 walk-0.206
 jog-0.203
 conversation-0.201
 skip-0.196
Token The
Token ID383
Feature activation-9.03e-06

Pos logit contributions
 walks+0.010
 jog+0.009
 conversation+0.009
 learn+0.009
 walk+0.009
Neg logit contributions
rier-0.018
iest-0.016
isher-0.016
ener-0.015
bars-0.015
Token eggs
Token ID9653
Feature activation+1.23e-01

Feature not active on prev token;
no predictions were affected.
Token were
Token ID547
Feature activation+4.75e-03

Pos logit contributions
rier+0.264
isher+0.210
rily+0.208
bars+0.204
iest+0.200
Neg logit contributions
 walk-0.175
 walks-0.167
 conversation-0.163
 jog-0.161
 quarrel-0.158
Token blue
Token ID4171
Feature activation+2.67e-02

Pos logit contributions
rier+0.010
isher+0.009
bars+0.009
iest+0.009
umm+0.009
Neg logit contributions
 walk-0.006
 walks-0.006
 conversation-0.006
 quarrel-0.006
 fight-0.005
Token and
Token ID290
Feature activation-2.33e-03

Pos logit contributions
rier+0.055
isher+0.045
rily+0.044
iest+0.043
bars+0.042
Neg logit contributions
 walks-0.040
 walk-0.039
 conversation-0.036
 jog-0.036
 skip-0.035
Token spe
Token ID693
Feature activation+3.34e-01

Pos logit contributions
 walk+0.003
 walks+0.003
 quarrel+0.003
 chase+0.003
 skip+0.003
Neg logit contributions
rier-0.005
isher-0.004
rily-0.004
bars-0.004
eddy-0.004
Tokenck
Token ID694
Feature activation+1.10e-03

Pos logit contributions
rier+0.770
rily+0.626
iest+0.617
bars+0.617
bul+0.612
Neg logit contributions
 walks-0.408
 walk-0.401
 conversation-0.386
 jog-0.372
 quarrel-0.367
Tokenled
Token ID992
Feature activation+2.40e-03

Pos logit contributions
rier+0.002
isher+0.001
bars+0.001
rily+0.001
bul+0.001
Neg logit contributions
 walks-0.002
 walk-0.002
 conversation-0.002
 jog-0.002
 stroll-0.002
Token.
Token ID13
Feature activation-7.40e-03

Pos logit contributions
rier+0.005
isher+0.004
bars+0.004
iest+0.004
rily+0.004
Neg logit contributions
 walks-0.004
 walk-0.004
 jog-0.004
 conversation-0.003
 skip-0.003
Token They
Token ID1119
Feature activation-1.13e-01

Pos logit contributions
 walks+0.009
 jog+0.008
 walk+0.008
 conversation+0.008
 learn+0.008
Neg logit contributions
rier-0.017
iest-0.015
isher-0.015
bars-0.014
ener-0.014
Token looked
Token ID3114
Feature activation-3.47e-02

Pos logit contributions
 walk+0.155
 walks+0.153
 jog+0.145
 conversation+0.143
 quarrel+0.142
Neg logit contributions
rier-0.254
isher-0.203
rily-0.203
iest-0.199
bars-0.196
Token of
Token ID286
Feature activation+1.33e-01

Pos logit contributions
 walk+0.101
 walks+0.100
 jog+0.096
 skip+0.094
 conversation+0.092
Neg logit contributions
rier-0.152
isher-0.127
rily-0.123
 spe-0.121
ter-0.118
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
rier+0.234
rily+0.178
isher+0.174
iest+0.174
bars+0.171
Neg logit contributions
 walks-0.242
 walk-0.238
 conversation-0.230
 jog-0.230
 quarrel-0.221
Token frog
Token ID21264
Feature activation-8.48e-02

Pos logit contributions
rier+0.026
rily+0.020
isher+0.020
iest+0.020
bars+0.020
Neg logit contributions
 walks-0.020
 walk-0.020
 conversation-0.020
 jog-0.020
 skip-0.019
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
 walks+0.130
 conversation+0.129
 walk+0.128
 jog+0.126
 quarrel+0.120
Neg logit contributions
rier-0.164
rily-0.131
isher-0.125
iest-0.125
bars-0.124
Token He
Token ID679
Feature activation+2.82e-02

Pos logit contributions
 walks+0.017
 walk+0.017
 jog+0.016
 conversation+0.016
 stroll+0.016
Neg logit contributions
rier-0.024
iest-0.019
isher-0.019
rily-0.019
 spe-0.018
Token spe
Token ID693
Feature activation+3.34e-01

Pos logit contributions
rier+0.063
rily+0.050
isher+0.049
iest+0.048
ter+0.047
Neg logit contributions
 walks-0.040
 walk-0.039
 jog-0.037
 conversation-0.037
 joke-0.036
Tokenared
Token ID1144
Feature activation-7.57e-02

Pos logit contributions
rier+0.782
rily+0.637
 diamonds+0.629
iest+0.626
ter+0.625
Neg logit contributions
 walks-0.394
 walk-0.389
 conversation-0.379
 quarrel-0.358
 jog-0.358
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
 walks+0.126
 walk+0.125
 conversation+0.120
 jog+0.120
 skip+0.115
Neg logit contributions
rier-0.139
isher-0.109
rily-0.107
bars-0.106
iest-0.106
Token lazy
Token ID16931
Feature activation-3.45e-02

Pos logit contributions
rier+0.022
rily+0.017
isher+0.017
ter+0.017
iest+0.017
Neg logit contributions
 walks-0.018
 walk-0.018
 conversation-0.017
 jog-0.017
 skip-0.017
Token frog
Token ID21264
Feature activation-8.29e-02

Pos logit contributions
 walks+0.057
 walk+0.056
 conversation+0.056
 jog+0.055
 learn+0.053
Neg logit contributions
rier-0.064
rily-0.048
ter-0.048
iest-0.048
isher-0.047
Token and
Token ID290
Feature activation-2.52e-03

Pos logit contributions
 walks+0.126
 conversation+0.126
 jog+0.124
 walk+0.124
 learn+0.117
Neg logit contributions
rier-0.160
rily-0.128
 spe-0.122
bar-0.122
ter-0.122
Token John
Token ID1757
Feature activation+2.76e-02

Pos logit contributions
ggies+0.052
 walks+0.052
astics+0.051
 conversation+0.050
ggy+0.050
Neg logit contributions
rier-0.041
 spe-0.036
bars-0.035
ter-0.035
bul-0.033
Token chose
Token ID7690
Feature activation+1.18e-01

Pos logit contributions
rier+0.056
rily+0.044
isher+0.043
iest+0.043
 spe+0.042
Neg logit contributions
 walks-0.042
 walk-0.042
 conversation-0.040
 jog-0.040
 stroll-0.038
Token some
Token ID617
Feature activation+2.36e-02

Pos logit contributions
rier+0.232
rily+0.181
isher+0.177
 spe+0.174
iest+0.170
Neg logit contributions
 walks-0.191
 walk-0.189
 jog-0.182
 conversation-0.180
 skip-0.173
Token carrots
Token ID34397
Feature activation+4.85e-02

Pos logit contributions
rier+0.045
rily+0.035
isher+0.034
iest+0.033
bars+0.033
Neg logit contributions
 walks-0.038
 walk-0.038
 jog-0.037
 conversation-0.037
 skip-0.036
Token and
Token ID290
Feature activation-2.33e-03

Pos logit contributions
rier+0.105
isher+0.087
umm+0.086
 spe+0.085
bars+0.084
Neg logit contributions
 walk-0.061
 jog-0.058
 walks-0.056
 skip-0.055
 learn-0.055
Token cuc
Token ID38421
Feature activation+3.09e-01

Pos logit contributions
 walks+0.004
 walk+0.004
 jog+0.004
 conversation+0.004
 skip+0.004
Neg logit contributions
rier-0.004
isher-0.003
iest-0.003
rily-0.003
bul-0.003
Tokenumbers
Token ID17024
Feature activation+1.45e-01

Pos logit contributions
 spe+0.508
rier+0.506
bul+0.484
 diamonds+0.448
mith+0.440
Neg logit contributions
arre-0.418
 walks-0.412
ither-0.412
astics-0.396
 walk-0.388
Token to
Token ID284
Feature activation-1.24e-02

Pos logit contributions
rier+0.302
umm+0.261
isher+0.255
bars+0.255
bul+0.254
Neg logit contributions
 walk-0.175
 walks-0.169
 learn-0.163
 jog-0.159
 joke-0.156
Token take
Token ID1011
Feature activation-1.45e-02

Pos logit contributions
 walk+0.015
 walks+0.015
 jog+0.015
 conversation+0.014
 joke+0.014
Neg logit contributions
rier-0.029
rily-0.023
isher-0.023
bars-0.023
iest-0.023
Token home
Token ID1363
Feature activation+1.76e-03

Pos logit contributions
 walks+0.025
 walk+0.024
 conversation+0.023
 jog+0.023
 stroll+0.022
Neg logit contributions
rier-0.027
bars-0.021
rily-0.020
isher-0.020
bul-0.020
Token.
Token ID13
Feature activation-1.45e-03

Pos logit contributions
rier+0.003
isher+0.003
bars+0.003
 spe+0.003
rily+0.003
Neg logit contributions
 walk-0.003
 walks-0.003
 jog-0.002
 learn-0.002
 conversation-0.002
Token They
Token ID1119
Feature activation-1.09e-01

Pos logit contributions
 walk+0.012
 jog+0.012
 walks+0.011
 skip+0.011
 quarrel+0.011
Neg logit contributions
rier-0.010
 spe-0.008
rily-0.008
eddy-0.008
ter-0.008
Token were
Token ID547
Feature activation+1.04e-02

Pos logit contributions
 walk+0.148
 walks+0.145
 jog+0.139
 conversation+0.139
 learn+0.134
Neg logit contributions
rier-0.247
rily-0.195
isher-0.193
bars-0.191
iest-0.190
Token filled
Token ID5901
Feature activation+1.69e-01

Pos logit contributions
rier+0.023
bars+0.020
isher+0.020
bul+0.019
ener+0.019
Neg logit contributions
 walks-0.014
 learn-0.013
 conversation-0.013
 joke-0.013
 walk-0.013
Token with
Token ID351
Feature activation-2.04e-02

Pos logit contributions
rier+0.278
rily+0.213
isher+0.202
iest+0.201
ter+0.198
Neg logit contributions
 walks-0.313
 walk-0.311
 jog-0.305
 conversation-0.304
 stroll-0.292
Token fresh
Token ID4713
Feature activation+4.54e-02

Pos logit contributions
 walks+0.029
 walk+0.028
 conversation+0.028
 jog+0.027
 learn+0.026
Neg logit contributions
rier-0.044
bars-0.036
bul-0.035
isher-0.035
bar-0.035
Token cuc
Token ID38421
Feature activation+3.08e-01

Pos logit contributions
rier+0.091
bars+0.073
rily+0.072
bul+0.071
bar+0.071
Neg logit contributions
 walks-0.071
 walk-0.068
 conversation-0.066
 jog-0.065
 stroll-0.062
Tokenumber
Token ID4494
Feature activation+1.96e-01

Pos logit contributions
rier+0.491
bul+0.459
 spe+0.445
bars+0.427
 diamonds+0.424
Neg logit contributions
ither-0.457
 walk-0.454
 walks-0.451
 jog-0.446
 learn-0.443
Token and
Token ID290
Feature activation+8.31e-03

Pos logit contributions
rier+0.363
rily+0.284
isher+0.278
iest+0.273
bars+0.273
Neg logit contributions
 walks-0.320
 walk-0.319
 conversation-0.312
 jog-0.309
 skip-0.296
Token cheese
Token ID9891
Feature activation+8.87e-02

Pos logit contributions
rier+0.020
bars+0.016
isher+0.016
rily+0.016
bul+0.016
Neg logit contributions
 walk-0.011
 joke-0.010
 walks-0.010
 learn-0.010
 laugh-0.010
Token.
Token ID13
Feature activation-3.35e-03

Pos logit contributions
rier+0.175
bars+0.140
isher+0.140
rily+0.137
bul+0.134
Neg logit contributions
 walks-0.135
 walk-0.134
 conversation-0.130
 jog-0.127
 joke-0.125
Token It
Token ID632
Feature activation+2.32e-02

Pos logit contributions
 walk+0.006
 jog+0.006
 skip+0.006
 walks+0.006
 fight+0.006
Neg logit contributions
rier-0.005
 spe-0.004
rily-0.004
ter-0.004
eddy-0.004
Token like
Token ID588
Feature activation+3.80e-02

Pos logit contributions
 walk+0.095
 walks+0.095
 jog+0.091
 conversation+0.089
 skip+0.088
Neg logit contributions
rier-0.111
rily-0.088
isher-0.087
 spe-0.081
bars-0.080
Token lettuce
Token ID39406
Feature activation+7.72e-02

Pos logit contributions
rier+0.074
rily+0.059
isher+0.058
iest+0.056
bars+0.055
Neg logit contributions
 walks-0.062
 walk-0.060
 conversation-0.058
 jog-0.058
 joke-0.057
Token,
Token ID11
Feature activation-1.54e-02

Pos logit contributions
rier+0.161
isher+0.136
bars+0.135
rily+0.131
bul+0.128
Neg logit contributions
 walk-0.106
 walks-0.105
 jog-0.100
 joke-0.100
 fight-0.099
Token tomatoes
Token ID23972
Feature activation+7.09e-02

Pos logit contributions
 walks+0.025
 walk+0.024
 joke+0.024
 chase+0.023
 jog+0.023
Neg logit contributions
rier-0.029
isher-0.025
bars-0.024
iest-0.023
can-0.022
Token and
Token ID290
Feature activation+7.71e-03

Pos logit contributions
rier+0.155
isher+0.130
bars+0.126
rily+0.126
bul+0.121
Neg logit contributions
 walk-0.095
 walks-0.094
 jog-0.089
 skip-0.088
 learn-0.088
Token cuc
Token ID38421
Feature activation+3.05e-01

Pos logit contributions
rier+0.015
isher+0.012
bars+0.012
eddy+0.012
iest+0.012
Neg logit contributions
 walks-0.013
 walk-0.013
 chase-0.012
 learn-0.012
 joke-0.012
Tokenumbers
Token ID17024
Feature activation+1.47e-01

Pos logit contributions
 spe+0.488
rier+0.483
bul+0.475
bars+0.445
mith+0.430
Neg logit contributions
arre-0.430
 walks-0.421
 learn-0.420
ither-0.418
astics-0.405
Token.
Token ID13
Feature activation-2.50e-03

Pos logit contributions
rier+0.324
bars+0.282
isher+0.276
umm+0.272
bul+0.272
Neg logit contributions
 walk-0.166
 walks-0.164
 learn-0.162
 fight-0.154
 jog-0.153
Token He
Token ID679
Feature activation+3.54e-02

Pos logit contributions
 walks+0.003
 jog+0.003
 walk+0.003
 conversation+0.003
 skip+0.003
Neg logit contributions
rier-0.006
iest-0.005
isher-0.005
ier-0.005
rily-0.005
Token smiled
Token ID13541
Feature activation+1.62e-02

Pos logit contributions
rier+0.076
rily+0.059
isher+0.059
iest+0.058
bars+0.058
Neg logit contributions
 walks-0.052
 walk-0.050
 jog-0.048
 conversation-0.048
 joke-0.047
Token as
Token ID355
Feature activation+4.52e-02

Pos logit contributions
rier+0.034
isher+0.034
bars+0.033
iest+0.032
 spe+0.032
Neg logit contributions
 jog-0.018
 walk-0.018
 walks-0.017
 conversation-0.017
 learn-0.017
Token garden
Token ID11376
Feature activation-7.52e-02

Pos logit contributions
rier+0.046
rily+0.037
isher+0.036
ter+0.036
 diamonds+0.036
Neg logit contributions
 walks-0.026
 walk-0.025
 conversation-0.025
 jog-0.025
 stroll-0.024
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
 walk+0.112
 walks+0.112
 conversation+0.106
 jog+0.105
 learn+0.103
Neg logit contributions
rier-0.154
isher-0.125
iest-0.125
rily-0.122
bars-0.121
Token They
Token ID1119
Feature activation-9.45e-02

Pos logit contributions
rier+0.039
rily+0.031
 spe+0.030
 diamonds+0.030
ter+0.030
Neg logit contributions
 walk-0.028
 walks-0.028
 conversation-0.027
 jog-0.026
 quarrel-0.026
Token cut
Token ID2005
Feature activation+1.08e-01

Pos logit contributions
 conversation+0.112
ggies+0.110
 walks+0.109
 stroll+0.102
 walk+0.102
Neg logit contributions
rier-0.207
 spe-0.174
rily-0.171
isher-0.169
bars-0.167
Token the
Token ID262
Feature activation+2.17e-02

Pos logit contributions
rier+0.206
bars+0.177
isher+0.167
bul+0.166
iest+0.165
Neg logit contributions
 walks-0.158
 walk-0.153
 learn-0.144
 chase-0.142
 conversation-0.142
Token cuc
Token ID38421
Feature activation+3.05e-01

Pos logit contributions
rier+0.036
bars+0.030
isher+0.029
bul+0.029
rily+0.029
Neg logit contributions
 walk-0.038
 walks-0.038
 conversation-0.036
 jog-0.036
 joke-0.036
Tokenumber
Token ID4494
Feature activation+1.86e-01

Pos logit contributions
 spe+0.504
rier+0.488
bul+0.476
 diamonds+0.470
bars+0.466
Neg logit contributions
ither-0.439
 learn-0.423
 achieve-0.395
 obey-0.392
 walk-0.388
Token and
Token ID290
Feature activation+2.23e-02

Pos logit contributions
rier+0.367
isher+0.296
iest+0.291
 diamonds+0.289
rily+0.288
Neg logit contributions
 walk-0.283
 walks-0.279
 learn-0.262
 jog-0.261
 conversation-0.256
Token put
Token ID1234
Feature activation+8.41e-02

Pos logit contributions
rier+0.050
iest+0.041
rily+0.041
isher+0.040
eddy+0.039
Neg logit contributions
 walk-0.031
 walks-0.031
 jog-0.029
 learn-0.029
 skip-0.028
Token it
Token ID340
Feature activation+2.16e-02

Pos logit contributions
rier+0.158
bars+0.138
iest+0.130
ancer+0.127
umm+0.126
Neg logit contributions
 walks-0.133
 walk-0.125
 fight-0.117
 chase-0.113
 play-0.112
Token in
Token ID287
Feature activation-5.43e-02

Pos logit contributions
rier+0.044
bars+0.035
isher+0.035
iest+0.035
rily+0.034
Neg logit contributions
 walk-0.032
 walks-0.032
 jog-0.030
 skip-0.029
 conversation-0.029
Token cut
Token ID2005
Feature activation+1.08e-01

Pos logit contributions
 walks+0.015
 walk+0.014
 conversation+0.014
 jog+0.014
 stroll+0.013
Neg logit contributions
rier-0.023
rily-0.019
isher-0.018
iest-0.018
 spe-0.018
Token the
Token ID262
Feature activation+1.41e-02

Pos logit contributions
rier+0.203
bars+0.175
 spe+0.172
isher+0.172
bul+0.168
Neg logit contributions
 walks-0.159
 walk-0.148
 learn-0.146
 fight-0.137
 skip-0.136
Token lettuce
Token ID39406
Feature activation+6.73e-02

Pos logit contributions
rier+0.026
rily+0.021
isher+0.021
bars+0.020
iest+0.020
Neg logit contributions
 walk-0.023
 walks-0.023
 conversation-0.022
 jog-0.022
 joke-0.021
Token and
Token ID290
Feature activation+3.53e-03

Pos logit contributions
rier+0.131
isher+0.113
bars+0.110
 spe+0.107
iest+0.107
Neg logit contributions
 walk-0.099
 walks-0.094
 jog-0.093
 learn-0.092
 skip-0.092
Token the
Token ID262
Feature activation+1.39e-02

Pos logit contributions
rier+0.007
iest+0.006
eddy+0.006
isher+0.006
rily+0.006
Neg logit contributions
 walks-0.006
 walk-0.006
 learn-0.005
 joke-0.005
 fight-0.005
Token cuc
Token ID38421
Feature activation+3.04e-01

Pos logit contributions
rier+0.025
rily+0.020
isher+0.020
bars+0.020
iest+0.020
Neg logit contributions
 walk-0.023
 walks-0.023
 conversation-0.022
 jog-0.022
 joke-0.022
Tokenumber
Token ID4494
Feature activation+1.81e-01

Pos logit contributions
rier+0.480
 spe+0.454
bul+0.449
 diamonds+0.432
bars+0.429
Neg logit contributions
ither-0.457
 learn-0.446
 walk-0.442
 walks-0.439
arre-0.429
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
rier+0.378
rily+0.300
isher+0.298
 spe+0.292
 diamonds+0.292
Neg logit contributions
 walk-0.261
 walks-0.260
 learn-0.242
 jog-0.242
 conversation-0.241
Token She
Token ID1375
Feature activation-2.63e-02

Pos logit contributions
 walk+0.018
 walks+0.018
 conversation+0.018
 jog+0.017
 stroll+0.017
Neg logit contributions
rier-0.020
rily-0.015
bars-0.015
isher-0.015
bar-0.015
Token said
Token ID531
Feature activation+2.39e-02

Pos logit contributions
 walks+0.037
 walk+0.036
 joke+0.035
 conversation+0.034
 learn+0.033
Neg logit contributions
rier-0.062
rily-0.047
iest-0.046
isher-0.045
umm-0.045
Token he
Token ID339
Feature activation+8.59e-02

Pos logit contributions
rier+0.043
isher+0.038
bars+0.036
rily+0.036
umm+0.035
Neg logit contributions
 walks-0.040
 joke-0.037
 walk-0.037
 jog-0.037
 conversation-0.037
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
rier+0.168
bars+0.157
isher+0.152
bar+0.151
bul+0.150
Neg logit contributions
 walks-0.094
 walk-0.085
 lawn-0.082
 joke-0.081
 jog-0.080
Token And
Token ID843
Feature activation+2.34e-02

Pos logit contributions
rier+0.045
isher+0.039
iest+0.037
rily+0.037
ier+0.035
Neg logit contributions
 learn-0.033
 walks-0.032
 jog-0.030
 skip-0.030
 joke-0.030
Token this
Token ID428
Feature activation+2.21e-02

Pos logit contributions
rier+0.042
isher+0.036
bars+0.035
eddy+0.035
bul+0.034
Neg logit contributions
 walks-0.040
 learn-0.040
 walk-0.039
 skip-0.039
 jog-0.038
Token is
Token ID318
Feature activation-3.29e-02

Pos logit contributions
rier+0.048
isher+0.039
bars+0.039
bul+0.039
rily+0.038
Neg logit contributions
 walks-0.030
 walk-0.030
 joke-0.030
 conversation-0.029
 learn-0.028
Token a
Token ID257
Feature activation+5.42e-03

Pos logit contributions
 walks+0.044
 joke+0.042
 learn+0.040
 walk+0.040
 fight+0.039
Neg logit contributions
rier-0.070
bars-0.063
isher-0.062
iest-0.059
eddy-0.057
Token cuc
Token ID38421
Feature activation+3.03e-01

Pos logit contributions
rier+0.011
bars+0.009
isher+0.009
rily+0.009
iest+0.009
Neg logit contributions
 walk-0.008
 conversation-0.008
 joke-0.008
 walks-0.008
 jog-0.008
Tokenumber
Token ID4494
Feature activation+1.90e-01

Pos logit contributions
rier+0.467
 diamonds+0.462
bul+0.462
 spe+0.455
bars+0.450
Neg logit contributions
ither-0.418
ggy-0.394
 jog-0.388
 walk-0.383
 learn-0.381
Token.
Token ID13
Feature activation+2.21e-02

Pos logit contributions
rier+0.391
rily+0.324
bars+0.322
umm+0.321
isher+0.319
Neg logit contributions
 walk-0.265
ggy-0.256
 walks-0.255
 learn-0.255
 joke-0.249
Token It
Token ID632
Feature activation+2.90e-02

Pos logit contributions
rier+0.047
isher+0.039
rily+0.038
iest+0.038
bars+0.036
Neg logit contributions
 learn-0.031
 walks-0.031
 jog-0.029
 skip-0.029
 joke-0.028
Token is
Token ID318
Feature activation-3.15e-02

Pos logit contributions
rier+0.064
isher+0.052
rily+0.052
bars+0.051
iest+0.050
Neg logit contributions
 walks-0.040
 walk-0.039
 joke-0.037
 jog-0.037
 learn-0.036
Token green
Token ID4077
Feature activation+9.02e-03

Pos logit contributions
 walks+0.042
 joke+0.040
 walk+0.039
 learn+0.038
 quarrel+0.038
Neg logit contributions
rier-0.067
bars-0.060
isher-0.059
iest-0.056
bul-0.055
Token saw
Token ID2497
Feature activation+3.84e-02

Pos logit contributions
rier+0.050
rily+0.040
isher+0.039
 spe+0.038
iest+0.038
Neg logit contributions
 walks-0.041
 walk-0.041
 conversation-0.040
 jog-0.040
 skip-0.039
Token carrots
Token ID34397
Feature activation+4.34e-02

Pos logit contributions
rier+0.087
isher+0.071
rily+0.070
bars+0.069
bul+0.068
Neg logit contributions
 walks-0.053
 conversation-0.050
 walk-0.049
 joke-0.047
 jog-0.047
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
rier+0.085
isher+0.068
rily+0.067
bars+0.066
bul+0.064
Neg logit contributions
 walk-0.068
 walks-0.067
 jog-0.066
 conversation-0.064
 skip-0.063
Token potatoes
Token ID18821
Feature activation+7.34e-02

Pos logit contributions
 walks+0.027
 walk+0.026
 jog+0.025
 conversation+0.025
 joke+0.025
Neg logit contributions
rier-0.033
isher-0.027
bars-0.026
rily-0.026
iest-0.025
Token,
Token ID11
Feature activation-1.60e-02

Pos logit contributions
rier+0.138
rily+0.107
isher+0.105
iest+0.104
bars+0.103
Neg logit contributions
 walks-0.120
 walk-0.119
 conversation-0.116
 jog-0.116
 skip-0.111
Token cuc
Token ID38421
Feature activation+3.03e-01

Pos logit contributions
 walks+0.026
 walk+0.025
 jog+0.024
 conversation+0.023
 chase+0.023
Neg logit contributions
rier-0.034
isher-0.027
bars-0.026
bar-0.026
rily-0.026
Tokenumbers
Token ID17024
Feature activation+1.45e-01

Pos logit contributions
rier+0.505
bul+0.419
 spe+0.413
bars+0.394
iest+0.385
Neg logit contributions
 walks-0.508
 walk-0.506
 jog-0.495
 conversation-0.487
 stroll-0.478
Token and
Token ID290
Feature activation+3.79e-03

Pos logit contributions
rier+0.301
isher+0.242
bars+0.239
rily+0.238
bul+0.237
Neg logit contributions
 walk-0.206
 walks-0.205
 jog-0.195
 conversation-0.194
 quarrel-0.187
Token tomatoes
Token ID23972
Feature activation+6.52e-02

Pos logit contributions
rier+0.007
isher+0.006
rily+0.006
iest+0.005
bars+0.005
Neg logit contributions
 walks-0.007
 walk-0.006
 jog-0.006
 conversation-0.006
 learn-0.006
Token.
Token ID13
Feature activation-1.48e-02

Pos logit contributions
rier+0.144
isher+0.120
bul+0.115
 spe+0.115
bars+0.115
Neg logit contributions
 walk-0.084
 walks-0.084
 jog-0.079
 conversation-0.076
 skip-0.075
Token He
Token ID679
Feature activation+3.03e-02

Pos logit contributions
 walks+0.026
 walk+0.026
 jog+0.025
 conversation+0.025
 skip+0.024
Neg logit contributions
rier-0.026
rily-0.020
isher-0.019
bars-0.019
iest-0.019
Token house
Token ID2156
Feature activation-2.52e-02

Pos logit contributions
rier+0.041
isher+0.031
rily+0.031
 cuc+0.031
 diamonds+0.031
Neg logit contributions
 walks-0.030
ggies-0.029
 walk-0.029
 conversation-0.028
 jog-0.028
Token.
Token ID13
Feature activation+1.62e-02

Pos logit contributions
 walk+0.037
 walks+0.036
 conversation+0.035
 jog+0.035
 learn+0.034
Neg logit contributions
rier-0.052
rily-0.042
isher-0.042
iest-0.041
 diamonds-0.041
Token They
Token ID1119
Feature activation-1.00e-01

Pos logit contributions
rier+0.034
rily+0.028
 spe+0.027
 diamonds+0.027
ter+0.026
Neg logit contributions
 walks-0.022
 walk-0.022
 conversation-0.022
 quarrel-0.021
 fight-0.021
Token wash
Token ID13502
Feature activation+1.24e-02

Pos logit contributions
 conversation+0.126
ggies+0.125
 walks+0.120
 stroll+0.114
ggy+0.114
Neg logit contributions
rier-0.210
 spe-0.180
 cuc-0.173
rily-0.172
isher-0.170
Token the
Token ID262
Feature activation+2.10e-02

Pos logit contributions
rier+0.026
rily+0.020
isher+0.020
iest+0.020
bars+0.019
Neg logit contributions
 walks-0.018
 walk-0.018
 conversation-0.018
 jog-0.017
 skip-0.017
Token cuc
Token ID38421
Feature activation+3.02e-01

Pos logit contributions
rier+0.038
rily+0.030
bars+0.030
iest+0.029
isher+0.029
Neg logit contributions
 walk-0.036
 walks-0.035
 conversation-0.035
 jog-0.034
 joke-0.033
Tokenumber
Token ID4494
Feature activation+1.80e-01

Pos logit contributions
 spe+0.505
rier+0.485
 diamonds+0.476
bul+0.468
bars+0.450
Neg logit contributions
ither-0.420
 learn-0.408
 jog-0.385
 walk-0.384
 obey-0.379
Token and
Token ID290
Feature activation+1.70e-02

Pos logit contributions
rier+0.343
rily+0.268
isher+0.264
iest+0.261
bars+0.257
Neg logit contributions
 walk-0.290
 walks-0.289
 jog-0.279
 conversation-0.278
 skip-0.267
Token cut
Token ID2005
Feature activation+1.07e-01

Pos logit contributions
rier+0.039
rily+0.031
isher+0.031
iest+0.031
bars+0.030
Neg logit contributions
 walk-0.022
 walks-0.022
 jog-0.021
 conversation-0.020
 skip-0.020
Token it
Token ID340
Feature activation+1.88e-02

Pos logit contributions
rier+0.199
bars+0.177
bul+0.167
isher+0.167
 spe+0.167
Neg logit contributions
 walks-0.149
 walk-0.141
 learn-0.135
 jog-0.129
 fight-0.128
Token into
Token ID656
Feature activation-4.22e-02

Pos logit contributions
rier+0.033
rily+0.026
isher+0.025
iest+0.025
bars+0.024
Neg logit contributions
 walks-0.033
 walk-0.032
 conversation-0.032
 jog-0.032
 skip-0.030
Token.
Token ID13
Feature activation+1.45e-02

Pos logit contributions
rier+0.015
ter+0.012
rily+0.012
iest+0.012
bars+0.012
Neg logit contributions
 conversation-0.012
 walks-0.012
 walk-0.012
 stroll-0.011
 jog-0.011
Token You
Token ID921
Feature activation-2.85e-02

Pos logit contributions
rier+0.033
rily+0.027
isher+0.027
iest+0.027
bars+0.026
Neg logit contributions
 walks-0.018
 walk-0.017
 jog-0.017
 conversation-0.017
 skip-0.016
Token found
Token ID1043
Feature activation+9.52e-02

Pos logit contributions
 walks+0.042
 conversation+0.041
 walk+0.040
 jog+0.040
 stroll+0.039
Neg logit contributions
rier-0.054
iest-0.045
 spe-0.044
ter-0.044
isher-0.044
Token a
Token ID257
Feature activation+1.08e-02

Pos logit contributions
rier+0.207
bars+0.178
rily+0.171
iest+0.168
bul+0.168
Neg logit contributions
 walks-0.143
 walk-0.118
 conversation-0.117
 fight-0.114
 chase-0.113
Token nice
Token ID3621
Feature activation-3.63e-02

Pos logit contributions
rier+0.021
rily+0.016
isher+0.016
iest+0.016
 spe+0.016
Neg logit contributions
 walks-0.017
 walk-0.017
 conversation-0.016
 jog-0.016
 stroll-0.015
Token cuc
Token ID38421
Feature activation+3.01e-01

Pos logit contributions
 walk+0.054
 walks+0.053
 conversation+0.052
 jog+0.052
 skip+0.049
Neg logit contributions
rier-0.075
rily-0.060
isher-0.058
bars-0.058
iest-0.058
Tokenumber
Token ID4494
Feature activation+1.83e-01

Pos logit contributions
 spe+0.501
bul+0.480
 diamonds+0.470
rier+0.468
bars+0.443
Neg logit contributions
 learn-0.393
ither-0.389
arre-0.388
 jog-0.383
 walk-0.372
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
rier+0.376
rily+0.311
 spe+0.307
 diamonds+0.304
bars+0.300
Neg logit contributions
 walk-0.259
 walks-0.249
 learn-0.248
 fight-0.248
 jog-0.239
Token It
Token ID632
Feature activation+2.02e-02

Pos logit contributions
rier+0.021
iest+0.017
rily+0.016
bars+0.016
isher+0.016
Neg logit contributions
 walks-0.017
 learn-0.016
 jog-0.016
 skip-0.016
 walk-0.016
Token will
Token ID481
Feature activation+3.17e-02

Pos logit contributions
rier+0.047
rily+0.038
isher+0.037
bars+0.036
iest+0.035
Neg logit contributions
 walks-0.026
 walk-0.026
 joke-0.025
 fight-0.024
 quarrel-0.024
Token be
Token ID307
Feature activation-5.86e-02

Pos logit contributions
rier+0.082
rily+0.072
isher+0.071
bars+0.071
eddy+0.069
Neg logit contributions
 walk-0.033
 learn-0.032
 fight-0.032
 joke-0.030
 jog-0.030
Token thank
Token ID5875
Feature activation+1.47e-01

Pos logit contributions
 walk+0.011
 jog+0.011
 joke+0.011
 skip+0.010
 run+0.010
Neg logit contributions
rier-0.012
rily-0.010
bars-0.009
isher-0.009
rol-0.009
Token you
Token ID345
Feature activation-9.01e-04

Pos logit contributions
rier+0.236
rily+0.178
bars+0.172
isher+0.171
iest+0.170
Neg logit contributions
 walk-0.287
 walks-0.286
 jog-0.279
 conversation-0.276
 skip-0.270
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
 learn+0.001
 walk+0.001
 conversation+0.001
 walks+0.001
 jog+0.001
Neg logit contributions
rier-0.002
isher-0.002
bars-0.002
bul-0.002
 diamonds-0.002
Token We
Token ID775
Feature activation-1.14e-02

Pos logit contributions
rier+0.032
rily+0.026
iest+0.026
ier+0.025
bars+0.025
Neg logit contributions
 walks-0.027
 jog-0.027
 skip-0.027
 walk-0.027
 learn-0.027
Token love
Token ID1842
Feature activation-8.65e-03

Pos logit contributions
 walks+0.018
ggies+0.017
 conversation+0.017
 walk+0.017
 stroll+0.017
Neg logit contributions
rier-0.020
isher-0.017
iest-0.017
ter-0.017
 spe-0.016
Token cuc
Token ID38421
Feature activation+3.01e-01

Pos logit contributions
 walks+0.016
 walk+0.015
 conversation+0.015
 joke+0.015
 jog+0.015
Neg logit contributions
rier-0.016
rily-0.012
bars-0.012
iest-0.012
isher-0.012
Tokenumbers
Token ID17024
Feature activation+1.43e-01

Pos logit contributions
rier+0.499
 spe+0.467
bul+0.444
 diamonds+0.429
bars+0.420
Neg logit contributions
 learn-0.445
 walk-0.444
 jog-0.442
ither-0.438
 walks-0.437
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
rier+0.309
bars+0.262
isher+0.259
rily+0.257
bul+0.255
Neg logit contributions
 walks-0.178
 walk-0.176
 learn-0.176
 jog-0.166
 conversation-0.166
Token And
Token ID843
Feature activation+1.93e-02

Pos logit contributions
rier+0.041
iest+0.035
rily+0.034
bars+0.034
ier+0.033
Neg logit contributions
 learn-0.026
 walks-0.025
 jog-0.025
 walk-0.024
 chase-0.024
Token we
Token ID356
Feature activation+2.71e-04

Pos logit contributions
rier+0.038
bars+0.033
bul+0.032
isher+0.031
iest+0.031
Neg logit contributions
 learn-0.033
 walk-0.031
 play-0.030
 jog-0.030
 skip-0.030
Token love
Token ID1842
Feature activation-8.02e-03

Pos logit contributions
rier+0.000
isher+0.000
ter+0.000
 spe+0.000
iest+0.000
Neg logit contributions
 walks-0.000
 conversation-0.000
 walk-0.000
 jog-0.000
ggies-0.000
Token happy
Token ID3772
Feature activation-1.92e-02

Pos logit contributions
 walks+0.085
 walk+0.082
 conversation+0.080
 jog+0.079
 quarrel+0.076
Neg logit contributions
rier-0.136
bars-0.112
iest-0.111
rily-0.111
isher-0.111
Token to
Token ID284
Feature activation-9.71e-03

Pos logit contributions
 walk+0.031
 walks+0.031
 jog+0.030
 learn+0.030
 skip+0.030
Neg logit contributions
rier-0.037
isher-0.031
rily-0.030
bars-0.029
bar-0.028
Token see
Token ID766
Feature activation+1.74e-02

Pos logit contributions
 walks+0.011
 conversation+0.011
 walk+0.011
ggies+0.011
 jog+0.011
Neg logit contributions
rier-0.022
 spe-0.018
rily-0.018
 diamonds-0.017
ter-0.017
Token our
Token ID674
Feature activation-2.97e-02

Pos logit contributions
rier+0.035
isher+0.030
bars+0.030
umm+0.029
bar+0.029
Neg logit contributions
 walks-0.026
 learn-0.024
 walk-0.024
 play-0.023
 skip-0.022
Token big
Token ID1263
Feature activation+2.16e-03

Pos logit contributions
 walks+0.050
 walk+0.050
 conversation+0.049
 jog+0.049
 quarrel+0.047
Neg logit contributions
rier-0.054
rily-0.042
isher-0.041
iest-0.040
bars-0.040
Token cuc
Token ID38421
Feature activation+3.00e-01

Pos logit contributions
rier+0.004
bars+0.003
rily+0.003
bul+0.003
bar+0.003
Neg logit contributions
 walks-0.004
 walk-0.004
 conversation-0.004
 quarrel-0.003
 fight-0.003
Tokenumber
Token ID4494
Feature activation+1.83e-01

Pos logit contributions
 spe+0.457
rier+0.453
bul+0.437
bars+0.428
 diamonds+0.427
Neg logit contributions
ither-0.439
 learn-0.434
 walk-0.422
 jog-0.421
 achieve-0.421
Token."
Token ID526
Feature activation+4.07e-02

Pos logit contributions
rier+0.348
isher+0.286
 spe+0.281
bars+0.277
rily+0.276
Neg logit contributions
 walk-0.286
 walks-0.281
 learn-0.273
 jog-0.267
 conversation-0.265
Token Lily
Token ID20037
Feature activation-4.18e-02

Pos logit contributions
rier+0.088
isher+0.072
rily+0.070
iest+0.068
bars+0.068
Neg logit contributions
 walk-0.064
 jog-0.064
 walks-0.064
 skip-0.061
 learn-0.061
Token says
Token ID1139
Feature activation+2.05e-02

Pos logit contributions
ggies+0.085
ggy+0.082
 stroll+0.081
 conversation+0.081
 walk+0.080
Neg logit contributions
rier-0.055
rily-0.041
bars-0.039
 cuc-0.039
ener-0.038
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
rier+0.038
rily+0.029
bars+0.027
iest+0.027
isher+0.027
Neg logit contributions
 walks-0.035
 walk-0.034
 conversation-0.034
 jog-0.034
 quarrel-0.033
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
 walks+0.102
 walk+0.100
 conversation+0.096
 jog+0.093
 quarrel+0.093
Neg logit contributions
rier-0.129
bars-0.105
isher-0.103
iest-0.102
bar-0.099
Token bowl
Token ID9396
Feature activation+1.14e-01

Pos logit contributions
rier+0.020
bars+0.017
bul+0.017
isher+0.017
rily+0.016
Neg logit contributions
 walk-0.017
 walks-0.017
 conversation-0.016
 jog-0.016
 quarrel-0.016
Token.
Token ID13
Feature activation-1.58e-02

Pos logit contributions
rier+0.255
 spe+0.221
 diamonds+0.213
isher+0.204
bul+0.200
Neg logit contributions
 walk-0.135
 learn-0.126
 jog-0.122
 skip-0.122
 play-0.120
Token Lucy
Token ID22162
Feature activation-4.33e-02

Pos logit contributions
 walk+0.025
 walks+0.025
 jog+0.024
 conversation+0.024
 joke+0.023
Neg logit contributions
rier-0.031
rily-0.024
 spe-0.023
bars-0.023
isher-0.023
Token put
Token ID1234
Feature activation+7.93e-02

Pos logit contributions
 walks+0.064
 walk+0.064
 conversation+0.062
 jog+0.061
 skip+0.059
Neg logit contributions
rier-0.088
rily-0.070
isher-0.069
iest-0.068
bars-0.067
Token cuc
Token ID38421
Feature activation+2.99e-01

Pos logit contributions
rier+0.168
bars+0.149
eddy+0.139
bul+0.138
umm+0.137
Neg logit contributions
 walks-0.109
 walk-0.098
 conversation-0.091
 fight-0.088
 chase-0.088
Tokenumbers
Token ID17024
Feature activation+1.39e-01

Pos logit contributions
rier+0.519
 spe+0.506
bul+0.464
bars+0.462
 diamonds+0.450
Neg logit contributions
ither-0.410
arre-0.385
 walk-0.382
 learn-0.382
 skip-0.373
Token,
Token ID11
Feature activation-1.54e-02

Pos logit contributions
rier+0.299
bars+0.256
umm+0.254
 spe+0.245
isher+0.241
Neg logit contributions
 walk-0.179
 walks-0.168
 skip-0.161
 jog-0.160
 learn-0.151
Token tomatoes
Token ID23972
Feature activation+6.06e-02

Pos logit contributions
 walks+0.026
 walk+0.026
 learn+0.023
 chase+0.023
 skip+0.023
Neg logit contributions
rier-0.031
eddy-0.025
iest-0.025
isher-0.024
bars-0.023
Token and
Token ID290
Feature activation+1.56e-03

Pos logit contributions
rier+0.133
isher+0.108
umm+0.104
bars+0.103
rily+0.103
Neg logit contributions
 walk-0.083
 walks-0.078
 jog-0.075
 skip-0.075
 stroll-0.071
Token some
Token ID617
Feature activation+5.39e-03

Pos logit contributions
rier+0.003
eddy+0.002
isher+0.002
iest+0.002
bars+0.002
Neg logit contributions
 walks-0.003
 walk-0.003
 learn-0.002
 chase-0.002
 run-0.002
Token tomato
Token ID24240
Feature activation+4.88e-02

Pos logit contributions
 walks+0.003
 walk+0.003
 conversation+0.003
 jog+0.003
 skip+0.003
Neg logit contributions
rier-0.004
rily-0.003
isher-0.003
iest-0.003
 spe-0.003
Token,
Token ID11
Feature activation-1.56e-02

Pos logit contributions
rier+0.100
rily+0.079
isher+0.079
bars+0.078
iest+0.076
Neg logit contributions
 walks-0.074
 walk-0.073
 jog-0.070
 skip-0.068
 joke-0.067
Token and
Token ID290
Feature activation+3.57e-03

Pos logit contributions
 walks+0.024
 walk+0.023
 chase+0.022
 jog+0.022
 skip+0.022
Neg logit contributions
rier-0.032
iest-0.026
isher-0.026
rily-0.025
bars-0.025
Token then
Token ID788
Feature activation+4.30e-03

Pos logit contributions
rier+0.008
iest+0.007
umm+0.006
isher+0.006
rily+0.006
Neg logit contributions
 walks-0.005
 walk-0.005
 chase-0.004
 fight-0.004
 run-0.004
Token a
Token ID257
Feature activation-2.05e-03

Pos logit contributions
rier+0.008
rily+0.006
isher+0.006
iest+0.006
bars+0.006
Neg logit contributions
 walks-0.007
 walk-0.007
 jog-0.007
 conversation-0.007
 skip-0.007
Token cuc
Token ID38421
Feature activation+2.97e-01

Pos logit contributions
 walk+0.003
 walks+0.003
 conversation+0.003
 jog+0.003
 joke+0.003
Neg logit contributions
rier-0.004
rily-0.003
isher-0.003
bars-0.003
iest-0.003
Tokenumber
Token ID4494
Feature activation+1.78e-01

Pos logit contributions
rier+0.478
bul+0.457
 diamonds+0.449
 spe+0.447
bars+0.431
Neg logit contributions
ither-0.444
 walk-0.407
 jog-0.405
ggy-0.402
 skip-0.396
Token.
Token ID13
Feature activation-4.26e-03

Pos logit contributions
rier+0.384
rily+0.307
umm+0.295
bars+0.293
isher+0.289
Neg logit contributions
 walks-0.270
 walk-0.269
 fight-0.247
 jog-0.246
 chase-0.242
Token Lily
Token ID20037
Feature activation-4.58e-02

Pos logit contributions
 walks+0.004
 jog+0.004
 walk+0.004
 conversation+0.004
 learn+0.004
Neg logit contributions
rier-0.011
iest-0.010
isher-0.009
bars-0.009
ener-0.009
Token and
Token ID290
Feature activation+4.63e-03

Pos logit contributions
 walks+0.079
 walk+0.079
 conversation+0.077
 jog+0.077
 stroll+0.074
Neg logit contributions
rier-0.079
rily-0.062
isher-0.059
 spe-0.059
iest-0.059
Token Tom
Token ID4186
Feature activation-1.45e-03

Pos logit contributions
rier+0.009
rily+0.007
isher+0.007
iest+0.007
bars+0.007
Neg logit contributions
 walks-0.008
 walk-0.007
 conversation-0.007
 jog-0.007
 skip-0.007
Token has
Token ID468
Feature activation-6.16e-02

Pos logit contributions
rier+0.050
isher+0.041
rily+0.041
iest+0.039
bars+0.039
Neg logit contributions
 walks-0.030
 walk-0.029
 joke-0.028
 quarrel-0.027
 conversation-0.027
Token lettuce
Token ID39406
Feature activation+6.25e-02

Pos logit contributions
 walks+0.082
 joke+0.068
 walk+0.068
 conversation+0.065
 learn+0.065
Neg logit contributions
rier-0.144
isher-0.125
ancer-0.122
bars-0.121
rily-0.119
Token and
Token ID290
Feature activation+1.73e-02

Pos logit contributions
rier+0.132
bars+0.116
isher+0.116
bul+0.112
iest+0.107
Neg logit contributions
 walks-0.079
 walk-0.079
 fight-0.075
 quarrel-0.073
 skip-0.073
Token tomatoes
Token ID23972
Feature activation+5.95e-02

Pos logit contributions
rier+0.035
isher+0.030
ancer+0.029
bars+0.028
iest+0.028
Neg logit contributions
 walks-0.028
 walk-0.026
 chase-0.025
 joke-0.024
 learn-0.024
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
rier+0.132
isher+0.115
bars+0.111
bul+0.108
 spe+0.107
Neg logit contributions
 walks-0.073
 walk-0.072
 fight-0.068
 learn-0.066
 jog-0.066
Token cuc
Token ID38421
Feature activation+2.97e-01

Pos logit contributions
rier+0.030
isher+0.025
iest+0.025
rily+0.024
bars+0.024
Neg logit contributions
 walks-0.026
 walk-0.024
 chase-0.024
 joke-0.023
 learn-0.023
Tokenumbers
Token ID17024
Feature activation+1.39e-01

Pos logit contributions
rier+0.484
bul+0.479
 spe+0.468
bars+0.436
 diamonds+0.414
Neg logit contributions
 walks-0.434
ither-0.431
arre-0.411
 learn-0.407
 walk-0.405
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
rier+0.320
bars+0.289
isher+0.279
bul+0.279
umm+0.274
Neg logit contributions
 walks-0.143
 walk-0.140
 fight-0.136
 learn-0.133
 quarrel-0.130
Token And
Token ID843
Feature activation+1.84e-02

Pos logit contributions
rier+0.030
iest+0.026
isher+0.025
rily+0.024
ier+0.024
Neg logit contributions
 walks-0.012
 jog-0.011
 walk-0.011
 conversation-0.011
 learn-0.011
Token it
Token ID340
Feature activation+2.11e-02

Pos logit contributions
rier+0.033
isher+0.026
iest+0.026
bul+0.026
bars+0.025
Neg logit contributions
 walks-0.032
 walk-0.032
 learn-0.031
 jog-0.031
 skip-0.030
Token has
Token ID468
Feature activation-6.29e-02

Pos logit contributions
rier+0.043
rily+0.034
isher+0.034
iest+0.033
bars+0.033
Neg logit contributions
 walks-0.032
 walk-0.031
 conversation-0.030
 jog-0.030
 joke-0.029
Token two
Token ID734
Feature activation-7.95e-02

Pos logit contributions
rier+0.021
isher+0.016
iest+0.016
rily+0.016
bars+0.016
Neg logit contributions
 walks-0.018
 walk-0.018
 conversation-0.017
 jog-0.017
 skip-0.017
Token squirrel
Token ID33039
Feature activation-1.04e-01

Pos logit contributions
 walks+0.126
 walk+0.125
 jog+0.122
 conversation+0.122
 skip+0.115
Neg logit contributions
rier-0.159
rily-0.129
isher-0.122
bul-0.120
 spe-0.119
Tokens
Token ID82
Feature activation-6.29e-02

Pos logit contributions
 walks+0.171
 walk+0.170
 conversation+0.165
 jog+0.165
 skip+0.157
Neg logit contributions
rier-0.198
rily-0.154
isher-0.148
iest-0.146
 diamonds-0.144
Token kept
Token ID4030
Feature activation+3.66e-02

Pos logit contributions
 walks+0.096
 walk+0.094
 conversation+0.093
 jog+0.092
 joke+0.090
Neg logit contributions
rier-0.118
isher-0.093
bars-0.093
rily-0.092
iest-0.092
Token qu
Token ID627
Feature activation-9.99e-02

Pos logit contributions
rier+0.077
rily+0.063
 spe+0.062
 diamonds+0.062
isher+0.062
Neg logit contributions
 conversation-0.046
 walk-0.046
 walks-0.045
 jog-0.045
 joke-0.044
Tokenarre
Token ID9624
Feature activation-2.07e-01

Pos logit contributions
 walk+0.126
 walks+0.125
 conversation+0.114
 jog+0.111
 chase+0.110
Neg logit contributions
rier-0.232
bar-0.194
rily-0.194
isher-0.193
eddy-0.188
Tokenling
Token ID1359
Feature activation-1.33e-01

Pos logit contributions
 walks+0.284
 walk+0.282
 conversation+0.274
 jog+0.272
 skip+0.259
Neg logit contributions
rier-0.443
rily-0.358
isher-0.351
iest-0.348
bars-0.344
Token until
Token ID1566
Feature activation+3.77e-02

Pos logit contributions
 walks+0.229
 walk+0.228
 conversation+0.223
 jog+0.222
 skip+0.213
Neg logit contributions
rier-0.237
rily-0.182
isher-0.178
iest-0.176
bars-0.173
Token finally
Token ID3443
Feature activation+9.19e-02

Pos logit contributions
rier+0.061
rily+0.045
isher+0.044
iest+0.043
bars+0.042
Neg logit contributions
 walks-0.072
 walk-0.072
 jog-0.070
 conversation-0.070
 skip-0.067
Token Sammy
Token ID44067
Feature activation-2.75e-02

Pos logit contributions
rier+0.155
rily+0.116
isher+0.115
iest+0.111
bars+0.111
Neg logit contributions
 walks-0.171
 walk-0.170
 jog-0.165
 conversation-0.165
 skip-0.159
Token said
Token ID531
Feature activation+9.24e-03

Pos logit contributions
 walks+0.042
 walk+0.041
 conversation+0.041
 jog+0.040
 skip+0.039
Neg logit contributions
rier-0.053
rily-0.042
iest-0.041
isher-0.041
bars-0.040
Token friends
Token ID2460
Feature activation-8.98e-02

Pos logit contributions
 walks+0.129
 walk+0.127
 jog+0.125
 conversation+0.125
 stroll+0.120
Neg logit contributions
rier-0.146
rily-0.112
iest-0.112
isher-0.109
 diamonds-0.109
Token didn
Token ID1422
Feature activation-3.18e-02

Pos logit contributions
 walks+0.153
astics+0.151
 jog+0.148
seek+0.146
 conversation+0.145
Neg logit contributions
rier-0.139
ter-0.117
bar-0.111
bars-0.111
 of-0.110
Token't
Token ID470
Feature activation+5.21e-03

Pos logit contributions
 conversation+0.065
 walk+0.063
 walks+0.063
 stroll+0.061
 jog+0.061
Neg logit contributions
rier-0.043
rily-0.030
bar-0.029
bars-0.029
can-0.029
Token stop
Token ID2245
Feature activation-7.83e-02

Pos logit contributions
rier+0.009
 if+0.008
 though+0.007
 of+0.007
ter+0.007
Neg logit contributions
 walks-0.009
ggies-0.008
astics-0.008
bes-0.008
 lawn-0.008
Token qu
Token ID627
Feature activation-9.37e-02

Pos logit contributions
ggies+0.118
seek+0.116
ggy+0.116
astics+0.115
live+0.110
Neg logit contributions
rier-0.146
ter-0.122
 if-0.113
rily-0.113
 diamonds-0.113
Tokenarre
Token ID9624
Feature activation-2.06e-01

Pos logit contributions
 walk+0.128
 walks+0.125
 jog+0.118
 lookout+0.116
 obey+0.115
Neg logit contributions
rier-0.194
ain-0.176
bar-0.172
ents-0.170
rily-0.164
Tokenling
Token ID1359
Feature activation-1.27e-01

Pos logit contributions
 walks+0.329
 walk+0.326
 jog+0.324
 skip+0.315
 conversation+0.312
Neg logit contributions
rier-0.408
rily-0.323
iest-0.318
isher-0.307
ter-0.305
Token and
Token ID290
Feature activation-8.40e-03

Pos logit contributions
 walks+0.236
 walk+0.236
 conversation+0.232
 jog+0.232
 skip+0.224
Neg logit contributions
rier-0.209
rily-0.154
ter-0.148
iest-0.144
 diamonds-0.139
Token were
Token ID547
Feature activation-2.75e-03

Pos logit contributions
 walks+0.013
 conversation+0.013
 jog+0.013
 walk+0.013
ggies+0.012
Neg logit contributions
rier-0.015
ter-0.012
 spe-0.012
rily-0.012
 diamonds-0.012
Token getting
Token ID1972
Feature activation-2.36e-02

Pos logit contributions
 walk+0.005
 walks+0.005
 jog+0.005
 stroll+0.005
 conversation+0.005
Neg logit contributions
rier-0.004
ter-0.003
 spe-0.003
rily-0.003
 diamonds-0.003
Token louder
Token ID27089
Feature activation-6.64e-02

Pos logit contributions
 walks+0.034
 walk+0.034
 conversation+0.033
 jog+0.033
 joke+0.032
Neg logit contributions
rier-0.048
rily-0.037
ter-0.037
isher-0.037
bars-0.037
Token and
Token ID290
Feature activation-9.45e-03

Pos logit contributions
 walk+0.240
 walks+0.238
 jog+0.227
 conversation+0.224
 stroll+0.217
Neg logit contributions
rier-0.302
isher-0.246
iest-0.239
bars-0.236
rily-0.235
Token saw
Token ID2497
Feature activation+3.37e-02

Pos logit contributions
 walk+0.013
 walks+0.013
 conversation+0.012
 jog+0.012
 stroll+0.012
Neg logit contributions
rier-0.021
iest-0.017
isher-0.017
rily-0.017
bars-0.016
Token two
Token ID734
Feature activation-8.04e-02

Pos logit contributions
rier+0.070
rily+0.057
isher+0.056
bars+0.055
can+0.053
Neg logit contributions
 walks-0.050
 walk-0.047
 conversation-0.047
 jog-0.045
 joke-0.045
Token birds
Token ID10087
Feature activation-8.46e-02

Pos logit contributions
 walk+0.128
 walks+0.128
 jog+0.126
 conversation+0.125
 skip+0.120
Neg logit contributions
rier-0.152
rily-0.117
iest-0.115
 spe-0.115
 diamonds-0.114
Token qu
Token ID627
Feature activation-9.05e-02

Pos logit contributions
 walks+0.126
 conversation+0.125
 jog+0.124
 learn+0.121
 walk+0.121
Neg logit contributions
rier-0.160
ter-0.133
rily-0.128
 diamonds-0.127
 spe-0.126
Tokenarre
Token ID9624
Feature activation-2.06e-01

Pos logit contributions
 walk+0.167
 jog+0.165
 walks+0.158
 learn+0.152
 obey+0.151
Neg logit contributions
rier-0.155
ain-0.134
bar-0.132
rily-0.131
rol-0.128
Tokenling
Token ID1359
Feature activation-1.29e-01

Pos logit contributions
 walks+0.322
 jog+0.320
 walk+0.320
 conversation+0.308
 skip+0.308
Neg logit contributions
rier-0.412
rily-0.331
iest-0.323
ter-0.310
isher-0.310
Token!
Token ID0
Feature activation+1.75e-03

Pos logit contributions
 jog+0.233
 walk+0.233
 conversation+0.230
 walks+0.229
 learn+0.227
Neg logit contributions
rier-0.216
rily-0.166
ter-0.158
 diamonds-0.152
 spe-0.151
Token They
Token ID1119
Feature activation-1.13e-01

Pos logit contributions
rier+0.003
rily+0.003
isher+0.003
bars+0.003
iest+0.003
Neg logit contributions
 walks-0.003
 walk-0.003
 conversation-0.003
 jog-0.003
 skip-0.003
Token were
Token ID547
Feature activation+2.62e-04

Pos logit contributions
 walks+0.190
 walk+0.182
 jog+0.182
 conversation+0.181
ggies+0.176
Neg logit contributions
rier-0.193
ter-0.154
rily-0.152
 diamonds-0.152
 spe-0.151
Token squ
Token ID2809
Feature activation+7.77e-04

Pos logit contributions
rier+0.000
rily+0.000
ter+0.000
 spe+0.000
 diamonds+0.000
Neg logit contributions
 walk-0.000
 walks-0.000
 jog-0.000
 conversation-0.000
 skip-0.000
Token walking
Token ID6155
Feature activation-5.88e-02

Pos logit contributions
 walk+0.030
 jog+0.029
 walks+0.029
 conversation+0.029
 stroll+0.028
Neg logit contributions
rier-0.042
ter-0.033
 spe-0.032
rily-0.032
iest-0.031
Token by
Token ID416
Feature activation-9.35e-02

Pos logit contributions
 walks+0.099
 walk+0.098
 conversation+0.096
 jog+0.095
 skip+0.092
Neg logit contributions
rier-0.108
rily-0.084
isher-0.082
iest-0.081
bars-0.080
Token saw
Token ID2497
Feature activation+3.26e-02

Pos logit contributions
 jog+0.187
 walk+0.184
 walks+0.181
 stroll+0.180
 conversation+0.179
Neg logit contributions
rier-0.118
ter-0.098
rily-0.089
 if-0.087
iest-0.084
Token them
Token ID606
Feature activation-1.36e-02

Pos logit contributions
rier+0.064
rily+0.050
iest+0.048
bars+0.048
isher+0.048
Neg logit contributions
 walks-0.055
 walk-0.052
 conversation-0.051
 jog-0.050
 quarrel-0.048
Token qu
Token ID627
Feature activation-9.54e-02

Pos logit contributions
astics+0.028
 walks+0.028
ggies+0.027
 stroll+0.027
 conversation+0.027
Neg logit contributions
rier-0.016
 of-0.013
ter-0.012
 if-0.012
bars-0.011
Tokenarre
Token ID9624
Feature activation-2.05e-01

Pos logit contributions
 walk+0.137
 jog+0.128
 walks+0.128
 skip+0.121
 chase+0.120
Neg logit contributions
rier-0.203
bar-0.175
ain-0.175
isher-0.173
eddy-0.172
Tokenling
Token ID1359
Feature activation-1.27e-01

Pos logit contributions
 walks+0.328
 walk+0.326
 jog+0.321
 conversation+0.315
 skip+0.313
Neg logit contributions
rier-0.406
rily-0.320
iest-0.317
isher-0.307
ter-0.304
Token.
Token ID13
Feature activation-3.31e-03

Pos logit contributions
 walk+0.238
 jog+0.235
 walks+0.234
 conversation+0.234
 stroll+0.229
Neg logit contributions
rier-0.200
rily-0.150
ter-0.139
iest-0.136
bul-0.133
Token She
Token ID1375
Feature activation-2.11e-02

Pos logit contributions
 walk+0.005
 walks+0.005
 jog+0.005
 conversation+0.005
 joke+0.005
Neg logit contributions
rier-0.006
rily-0.005
bars-0.005
 spe-0.004
bul-0.004
Token was
Token ID373
Feature activation-2.18e-02

Pos logit contributions
 walks+0.031
 walk+0.030
 conversation+0.030
 jog+0.029
 quarrel+0.028
Neg logit contributions
rier-0.044
rily-0.035
isher-0.034
iest-0.034
bars-0.034
Token very
Token ID845
Feature activation-1.76e-02

Pos logit contributions
 walk+0.034
 jog+0.033
 conversation+0.032
 walks+0.032
 stroll+0.032
Neg logit contributions
rier-0.042
ter-0.033
 spe-0.032
rily-0.032
bul-0.031
Token but
Token ID475
Feature activation+3.59e-02

Pos logit contributions
 walks+0.036
 walk+0.036
 conversation+0.035
 jog+0.035
 skip+0.034
Neg logit contributions
rier-0.035
isher-0.027
rily-0.026
iest-0.025
bars-0.025
Token today
Token ID1909
Feature activation-3.39e-02

Pos logit contributions
rier+0.063
rily+0.048
isher+0.047
iest+0.046
bars+0.045
Neg logit contributions
 walk-0.065
 walks-0.064
 conversation-0.063
 jog-0.063
 quarrel-0.061
Token they
Token ID484
Feature activation-1.79e-02

Pos logit contributions
 walk+0.061
 walks+0.061
 conversation+0.060
 jog+0.059
 quarrel+0.057
Neg logit contributions
rier-0.060
rily-0.046
isher-0.045
bars-0.044
 spe-0.043
Token were
Token ID547
Feature activation+9.05e-04

Pos logit contributions
 walks+0.028
 walk+0.028
 conversation+0.027
 jog+0.027
 skip+0.026
Neg logit contributions
rier-0.035
rily-0.027
isher-0.027
iest-0.027
bars-0.026
Token qu
Token ID627
Feature activation-8.96e-02

Pos logit contributions
rier+0.002
rily+0.001
isher+0.001
iest+0.001
bars+0.001
Neg logit contributions
 walks-0.001
 walk-0.001
 conversation-0.001
 jog-0.001
 skip-0.001
Tokenarre
Token ID9624
Feature activation-2.05e-01

Pos logit contributions
 walks+0.098
 walk+0.094
 jog+0.087
 chase+0.083
 conversation+0.082
Neg logit contributions
rier-0.226
rily-0.193
bar-0.190
isher-0.189
eddy-0.188
Tokenling
Token ID1359
Feature activation-1.30e-01

Pos logit contributions
 walks+0.288
 walk+0.286
 conversation+0.279
 jog+0.277
 skip+0.263
Neg logit contributions
rier-0.429
rily-0.346
isher-0.339
iest-0.335
bars-0.332
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
 walks+0.221
 walk+0.220
 conversation+0.215
 jog+0.215
 skip+0.206
Neg logit contributions
rier-0.232
rily-0.180
iest-0.174
isher-0.173
ter-0.170
Token Jenna
Token ID45377
Feature activation+8.90e-02

Pos logit contributions
 walks+0.017
 walk+0.017
 conversation+0.017
 jog+0.017
 learn+0.016
Neg logit contributions
rier-0.027
rily-0.022
isher-0.021
iest-0.021
 spe-0.021
Token wanted
Token ID2227
Feature activation-5.13e-03

Pos logit contributions
rier+0.162
rily+0.127
isher+0.123
iest+0.122
bars+0.121
Neg logit contributions
 walks-0.146
 walk-0.145
 conversation-0.142
 jog-0.142
 skip-0.136
Token to
Token ID284
Feature activation-1.82e-02

Pos logit contributions
 walks+0.010
 walk+0.010
 jog+0.010
 conversation+0.010
 skip+0.009
Neg logit contributions
rier-0.008
rily-0.006
iest-0.006
isher-0.006
 diamonds-0.006
Token mum
Token ID25682
Feature activation-1.43e-02

Pos logit contributions
|+0.102
 stroll+0.101
iver+0.100
acks+0.099
ither+0.098
Neg logit contributions
rier-0.039
eddy-0.019
keeper-0.018
 captain-0.017
 of-0.017
Token and
Token ID290
Feature activation-5.03e-03

Pos logit contributions
 jog+0.026
 walks+0.025
 conversation+0.025
ggies+0.025
 skip+0.025
Neg logit contributions
rier-0.022
rily-0.017
bars-0.016
ener-0.016
ter-0.016
Token understood
Token ID7247
Feature activation-2.74e-02

Pos logit contributions
 walks+0.008
 walk+0.008
 conversation+0.008
 jog+0.008
 stroll+0.008
Neg logit contributions
rier-0.009
rily-0.007
ter-0.007
isher-0.007
 diamonds-0.007
Token that
Token ID326
Feature activation-6.27e-03

Pos logit contributions
 walks+0.052
 walk+0.052
 conversation+0.051
 jog+0.051
 stroll+0.050
Neg logit contributions
rier-0.043
rily-0.032
ter-0.030
iest-0.030
bul-0.030
Token qu
Token ID627
Feature activation-9.08e-02

Pos logit contributions
 walks+0.012
 walk+0.012
 conversation+0.012
 jog+0.012
 stroll+0.012
Neg logit contributions
rier-0.009
rily-0.007
iest-0.007
isher-0.007
 diamonds-0.007
Tokenarre
Token ID9624
Feature activation-2.04e-01

Pos logit contributions
 walk+0.104
 walks+0.102
 jog+0.098
 skip+0.089
 conversation+0.087
Neg logit contributions
rier-0.221
isher-0.192
bar-0.184
rily-0.184
iest-0.184
Tokenlling
Token ID2680
Feature activation-9.90e-02

Pos logit contributions
 walks+0.316
 walk+0.313
 jog+0.305
 conversation+0.304
 skip+0.292
Neg logit contributions
rier-0.403
rily-0.319
isher-0.313
iest-0.313
bars-0.305
Token was
Token ID373
Feature activation-2.56e-02

Pos logit contributions
 walks+0.167
 walk+0.166
 conversation+0.164
 jog+0.162
 stroll+0.156
Neg logit contributions
rier-0.178
rily-0.137
iest-0.135
isher-0.133
bars-0.132
Token not
Token ID407
Feature activation-3.98e-02

Pos logit contributions
 walks+0.043
 walk+0.043
 conversation+0.042
 jog+0.042
 stroll+0.040
Neg logit contributions
rier-0.046
rily-0.035
isher-0.035
iest-0.034
ter-0.034
Token a
Token ID257
Feature activation-5.14e-03

Pos logit contributions
 walks+0.066
 walk+0.066
 conversation+0.064
 jog+0.064
 quarrel+0.062
Neg logit contributions
rier-0.073
rily-0.058
isher-0.057
bars-0.056
iest-0.055
Token good
Token ID922
Feature activation-4.13e-02

Pos logit contributions
 walks+0.008
 walk+0.007
 stroll+0.007
 jog+0.007
 conversation+0.007
Neg logit contributions
rier-0.011
 spe-0.008
bul-0.008
iest-0.008
 diamonds-0.008
Token on
Token ID319
Feature activation-7.21e-02

Pos logit contributions
 walks+0.041
 walk+0.041
 jog+0.040
 conversation+0.040
 skip+0.038
Neg logit contributions
rier-0.047
rily-0.037
isher-0.036
bars-0.035
 spe-0.035
Token,
Token ID11
Feature activation-1.64e-02

Pos logit contributions
 walk+0.128
 walks+0.128
 conversation+0.126
 jog+0.125
 joke+0.121
Neg logit contributions
rier-0.120
iest-0.092
rily-0.091
isher-0.088
 spe-0.088
Token they
Token ID484
Feature activation-1.94e-02

Pos logit contributions
 walks+0.029
 walk+0.029
 jog+0.028
 conversation+0.028
 skip+0.027
Neg logit contributions
rier-0.029
rily-0.022
isher-0.022
iest-0.021
bars-0.021
Token never
Token ID1239
Feature activation+5.71e-02

Pos logit contributions
 walks+0.028
 walk+0.028
 conversation+0.027
 jog+0.027
 skip+0.026
Neg logit contributions
rier-0.040
rily-0.032
isher-0.032
iest-0.031
bars-0.031
Token qu
Token ID627
Feature activation-8.95e-02

Pos logit contributions
rier+0.098
 spe+0.088
ter+0.086
bul+0.084
bars+0.083
Neg logit contributions
ggies-0.086
 walks-0.082
ggy-0.081
 lawn-0.080
 conversation-0.079
Tokenarre
Token ID9624
Feature activation-2.04e-01

Pos logit contributions
 walk+0.102
 walks+0.098
 jog+0.095
 lookout+0.089
 conversation+0.087
Neg logit contributions
rier-0.208
isher-0.183
ain-0.182
eddy-0.182
bar-0.180
Tokenled
Token ID992
Feature activation-4.53e-04

Pos logit contributions
 walks+0.350
 walk+0.348
 jog+0.339
 conversation+0.339
 skip+0.326
Neg logit contributions
rier-0.366
rily-0.282
isher-0.276
iest-0.274
bars-0.268
Token again
Token ID757
Feature activation-4.97e-02

Pos logit contributions
 walks+0.001
 walk+0.001
 jog+0.001
 conversation+0.001
 skip+0.001
Neg logit contributions
rier-0.001
bars-0.001
isher-0.001
rily-0.001
bar-0.001
Token over
Token ID625
Feature activation-1.53e-01

Pos logit contributions
 walks+0.089
 walk+0.088
 conversation+0.087
 jog+0.087
 stroll+0.085
Neg logit contributions
rier-0.080
rily-0.060
 spe-0.059
iest-0.059
ter-0.059
Token clothes
Token ID8242
Feature activation+3.31e-02

Pos logit contributions
 walks+0.270
 walk+0.269
 conversation+0.263
 jog+0.262
 skip+0.251
Neg logit contributions
rier-0.268
rily-0.206
isher-0.200
iest-0.198
bars-0.196
Token.
Token ID13
Feature activation-8.17e-03

Pos logit contributions
rier+0.061
isher+0.050
 spe+0.049
rily+0.049
iest+0.048
Neg logit contributions
 walk-0.054
 walks-0.052
 jog-0.052
 conversation-0.051
 quarrel-0.049
Token together
Token ID1978
Feature activation-6.04e-02

Pos logit contributions
 walk+0.277
 walks+0.273
 conversation+0.269
 jog+0.268
 stroll+0.267
Neg logit contributions
rier-0.231
iest-0.170
ter-0.169
rily-0.165
isher-0.164
Token.
Token ID13
Feature activation-9.26e-03

Pos logit contributions
 walks+0.105
 walk+0.104
 conversation+0.102
 jog+0.102
 skip+0.098
Neg logit contributions
rier-0.105
rily-0.080
iest-0.078
isher-0.077
ter-0.076
Token They
Token ID1119
Feature activation-1.12e-01

Pos logit contributions
 walk+0.015
 walks+0.015
 jog+0.015
 skip+0.014
 conversation+0.014
Neg logit contributions
rier-0.016
rily-0.013
bar-0.013
bars-0.013
isher-0.012
Token never
Token ID1239
Feature activation+5.41e-02

Pos logit contributions
 walks+0.163
 conversation+0.157
ggies+0.155
 jog+0.153
 walk+0.152
Neg logit contributions
rier-0.192
bar-0.170
 spe-0.169
ter-0.167
isher-0.165
Token qu
Token ID627
Feature activation-9.36e-02

Pos logit contributions
rier+0.078
 spe+0.072
bar+0.071
ter+0.071
ener+0.069
Neg logit contributions
ggies-0.088
bes-0.085
ggy-0.084
 lawn-0.084
 walks-0.083
Tokenarre
Token ID9624
Feature activation-2.03e-01

Pos logit contributions
 walk+0.125
 jog+0.114
 walks+0.113
 lookout+0.104
 conversation+0.103
Neg logit contributions
rier-0.199
bar-0.179
isher-0.178
ain-0.178
rol-0.175
Tokenled
Token ID992
Feature activation-6.14e-04

Pos logit contributions
 walk+0.350
 walks+0.350
 jog+0.345
 conversation+0.337
 skip+0.332
Neg logit contributions
rier-0.365
rily-0.283
iest-0.281
isher-0.278
ter-0.267
Token again
Token ID757
Feature activation-5.26e-02

Pos logit contributions
 walks+0.001
 walk+0.001
 jog+0.001
 conversation+0.001
 skip+0.001
Neg logit contributions
rier-0.001
rily-0.001
isher-0.001
bars-0.001
iest-0.001
Token.
Token ID13
Feature activation-5.08e-03

Pos logit contributions
 walks+0.097
 walk+0.095
 jog+0.094
 conversation+0.094
 stroll+0.092
Neg logit contributions
rier-0.080
iest-0.060
rily-0.059
 spe-0.059
ter-0.059
Token<|endoftext|>
Token ID50256
Feature activation-2.01e-02

Pos logit contributions
 walk+0.009
 walks+0.008
 jog+0.008
 skip+0.008
 conversation+0.008
Neg logit contributions
rier-0.008
rily-0.007
bar-0.007
bars-0.007
isher-0.007
TokenOnce
Token ID7454
Feature activation+5.34e-02

Pos logit contributions
 walk+0.033
 walks+0.033
 conversation+0.032
 jog+0.032
 quarrel+0.031
Neg logit contributions
rier-0.036
rily-0.028
iest-0.028
ter-0.027
isher-0.027
Token and
Token ID290
Feature activation-2.60e-03

Pos logit contributions
 walks+0.069
 conversation+0.067
 jog+0.066
 walk+0.066
 joke+0.064
Neg logit contributions
rier-0.087
ter-0.071
bars-0.070
rily-0.070
bar-0.068
Token their
Token ID511
Feature activation-4.61e-02

Pos logit contributions
 walks+0.004
 walk+0.004
 jog+0.004
 conversation+0.004
 skip+0.004
Neg logit contributions
rier-0.005
rily-0.004
 diamonds-0.004
iest-0.004
isher-0.004
Token children
Token ID1751
Feature activation-6.65e-02

Pos logit contributions
 jog+0.071
 walks+0.070
 skip+0.070
 walk+0.070
astics+0.068
Neg logit contributions
rier-0.084
isher-0.064
 spe-0.063
 diamonds-0.063
ener-0.062
Token,
Token ID11
Feature activation-2.16e-02

Pos logit contributions
 walks+0.115
 jog+0.113
 conversation+0.113
 joke+0.111
astics+0.111
Neg logit contributions
rier-0.096
 if-0.079
ter-0.077
bars-0.077
 of-0.076
Token qu
Token ID627
Feature activation-8.96e-02

Pos logit contributions
 walks+0.037
 walk+0.037
 jog+0.036
 conversation+0.036
 stroll+0.035
Neg logit contributions
rier-0.035
ter-0.027
 spe-0.027
rily-0.027
 diamonds-0.027
Tokenarre
Token ID9624
Feature activation-2.03e-01

Pos logit contributions
 walk+0.151
 walks+0.145
 jog+0.144
 skip+0.134
 chase+0.134
Neg logit contributions
rier-0.163
ain-0.147
rily-0.140
eddy-0.139
bar-0.138
Tokenling
Token ID1359
Feature activation-1.29e-01

Pos logit contributions
 walks+0.323
 walk+0.318
 jog+0.315
 conversation+0.305
 skip+0.304
Neg logit contributions
rier-0.405
rily-0.324
iest-0.320
isher-0.308
ter-0.304
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
 walk+0.234
 jog+0.232
 walks+0.232
 skip+0.225
 conversation+0.224
Neg logit contributions
rier-0.211
rily-0.162
ter-0.153
iest-0.144
isher-0.140
Token
Token ID220
Feature activation+2.18e-02

Pos logit contributions
 walks+0.016
 walk+0.016
 conversation+0.016
 jog+0.015
 stroll+0.015
Neg logit contributions
rier-0.031
iest-0.026
isher-0.026
rily-0.026
 spe-0.025
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
rier+0.048
isher+0.039
rily+0.039
iest+0.038
 spe+0.038
Neg logit contributions
 walks-0.028
 walk-0.028
 conversation-0.027
 jog-0.027
 learn-0.026
Token
Token ID198
Feature activation-1.31e-02

Pos logit contributions
 walks+0.018
 walk+0.018
 conversation+0.018
 jog+0.018
 skip+0.017
Neg logit contributions
rier-0.026
rily-0.021
isher-0.020
iest-0.020
bars-0.020
Token he
Token ID339
Feature activation+7.64e-02

Pos logit contributions
 walk+0.045
 jog+0.044
 walks+0.043
 stroll+0.042
 skip+0.041
Neg logit contributions
rier-0.038
isher-0.031
eddy-0.029
rily-0.028
bars-0.028
Token found
Token ID1043
Feature activation+8.40e-02

Pos logit contributions
rier+0.172
rily+0.136
isher+0.134
iest+0.133
bars+0.132
Neg logit contributions
 walk-0.108
 walks-0.107
 jog-0.103
 conversation-0.102
 skip-0.097
Token a
Token ID257
Feature activation-8.87e-03

Pos logit contributions
rier+0.219
bars+0.192
umm+0.189
rily+0.188
isher+0.184
Neg logit contributions
 walks-0.085
ggy-0.069
 chase-0.066
 walk-0.065
 learn-0.065
Token rabbit
Token ID22746
Feature activation-3.65e-02

Pos logit contributions
 walks+0.015
 walk+0.014
 jog+0.014
 conversation+0.014
 learn+0.013
Neg logit contributions
rier-0.016
 spe-0.013
rily-0.013
 diamonds-0.012
iest-0.012
Token qu
Token ID627
Feature activation-8.82e-02

Pos logit contributions
 conversation+0.057
 walks+0.057
 walk+0.056
 jog+0.055
 quarrel+0.054
Neg logit contributions
rier-0.066
ter-0.053
rily-0.053
bar-0.051
 spe-0.051
Tokenarre
Token ID9624
Feature activation-2.03e-01

Pos logit contributions
 walks+0.143
 walk+0.141
 jog+0.134
 fight+0.129
 chase+0.129
Neg logit contributions
rier-0.164
isher-0.141
rily-0.140
bar-0.139
eddy-0.139
Tokenlling
Token ID2680
Feature activation-9.94e-02

Pos logit contributions
 walk+0.258
 walks+0.258
 conversation+0.250
 jog+0.246
 joke+0.234
Neg logit contributions
rier-0.449
rily-0.364
isher-0.360
bars-0.357
 spe-0.356
Token with
Token ID351
Feature activation-3.02e-02

Pos logit contributions
 walks+0.176
 walk+0.175
 conversation+0.172
 jog+0.171
 skip+0.164
Neg logit contributions
rier-0.172
rily-0.132
isher-0.128
iest-0.126
bars-0.124
Token a
Token ID257
Feature activation-8.38e-03

Pos logit contributions
 walks+0.049
 walk+0.048
 conversation+0.047
 jog+0.047
 skip+0.045
Neg logit contributions
rier-0.058
rily-0.045
isher-0.044
iest-0.044
bars-0.043
Token squirrel
Token ID33039
Feature activation-9.48e-02

Pos logit contributions
 walks+0.014
 walk+0.013
 jog+0.013
 conversation+0.013
ggies+0.012
Neg logit contributions
rier-0.015
 spe-0.012
rily-0.012
bul-0.012
bar-0.012
Token.
Token ID13
Feature activation-1.70e-02

Pos logit contributions
 walks+0.171
 conversation+0.171
 jog+0.168
 walk+0.167
 joke+0.162
Neg logit contributions
rier-0.149
bar-0.119
rily-0.116
ter-0.116
bars-0.113
Token.
Token ID13
Feature activation-9.40e-03

Pos logit contributions
rier+0.007
rily+0.006
isher+0.006
 spe+0.006
bars+0.006
Neg logit contributions
 walk-0.005
 walks-0.005
 jog-0.005
 conversation-0.005
 skip-0.005
Token The
Token ID383
Feature activation-5.48e-03

Pos logit contributions
 walks+0.014
 conversation+0.013
 walk+0.013
 jog+0.013
 stroll+0.013
Neg logit contributions
rier-0.020
iest-0.016
isher-0.016
rily-0.015
 spe-0.015
Token bears
Token ID13062
Feature activation-3.94e-02

Pos logit contributions
 walks+0.011
 walk+0.011
 conversation+0.011
 quarrel+0.011
 fight+0.010
Neg logit contributions
rier-0.009
rily-0.007
isher-0.007
bars-0.007
bul-0.007
Token stopped
Token ID5025
Feature activation-4.70e-02

Pos logit contributions
 walks+0.068
 jog+0.065
 conversation+0.065
astics+0.064
ggies+0.064
Neg logit contributions
rier-0.064
rily-0.052
 spe-0.051
ter-0.051
 diamonds-0.049
Token qu
Token ID627
Feature activation-8.74e-02

Pos logit contributions
 walks+0.070
ggies+0.069
 conversation+0.069
 walk+0.068
 jog+0.068
Neg logit contributions
rier-0.089
ter-0.073
 spe-0.070
bul-0.070
rily-0.070
Tokenarre
Token ID9624
Feature activation-2.02e-01

Pos logit contributions
 walks+0.145
 walk+0.145
 jog+0.141
 lookout+0.132
 obey+0.132
Neg logit contributions
rier-0.165
rily-0.145
ain-0.143
bar-0.141
eddy-0.139
Tokenling
Token ID1359
Feature activation-1.27e-01

Pos logit contributions
 walks+0.300
 walk+0.296
 jog+0.292
 conversation+0.288
 skip+0.281
Neg logit contributions
rier-0.418
rily-0.337
iest-0.325
isher-0.321
ter-0.317
Token and
Token ID290
Feature activation-9.67e-03

Pos logit contributions
 walks+0.230
 walk+0.229
 jog+0.226
 conversation+0.226
 skip+0.219
Neg logit contributions
rier-0.213
rily-0.162
ter-0.151
iest-0.150
isher-0.150
Token asked
Token ID1965
Feature activation+5.24e-02

Pos logit contributions
 walks+0.014
 walk+0.014
 conversation+0.014
 jog+0.014
ggies+0.013
Neg logit contributions
rier-0.019
rily-0.015
 spe-0.015
ter-0.015
 diamonds-0.015
Token him
Token ID683
Feature activation+2.06e-02

Pos logit contributions
rier+0.091
isher+0.071
bars+0.070
rily+0.070
bul+0.070
Neg logit contributions
 walks-0.095
 walk-0.092
 conversation-0.090
 jog-0.089
 learn-0.087
Token for
Token ID329
Feature activation+8.96e-02

Pos logit contributions
rier+0.032
rily+0.023
isher+0.022
iest+0.022
bars+0.022
Neg logit contributions
 walks-0.041
 walk-0.040
 conversation-0.040
 jog-0.039
 stroll-0.038
Token Finally
Token ID9461
Feature activation+6.95e-02

Pos logit contributions
 walks+0.019
 walk+0.019
 conversation+0.018
 jog+0.018
 skip+0.017
Neg logit contributions
rier-0.027
rily-0.022
isher-0.021
iest-0.021
bars-0.021
Token,
Token ID11
Feature activation-2.40e-02

Pos logit contributions
rier+0.132
rily+0.103
isher+0.101
bars+0.100
iest+0.099
Neg logit contributions
 walks-0.114
 walk-0.113
 conversation-0.110
 jog-0.109
 skip-0.105
Token after
Token ID706
Feature activation-1.08e-02

Pos logit contributions
 walk+0.045
 walks+0.045
 conversation+0.044
 jog+0.044
 skip+0.042
Neg logit contributions
rier-0.039
rily-0.030
isher-0.029
iest-0.028
bars-0.028
Token much
Token ID881
Feature activation+5.76e-02

Pos logit contributions
 jog+0.021
 walk+0.020
 conversation+0.020
 skip+0.020
 walks+0.020
Neg logit contributions
rier-0.015
 diamonds-0.012
ter-0.012
iest-0.011
 spe-0.011
Token qu
Token ID627
Feature activation-9.39e-02

Pos logit contributions
rier+0.087
 of+0.075
 diamonds+0.072
 spe+0.070
iest+0.070
Neg logit contributions
 jog-0.095
|-0.092
astics-0.092
 lawn-0.091
 skip-0.091
Tokenarre
Token ID9624
Feature activation-2.01e-01

Pos logit contributions
 walk+0.155
 jog+0.148
 walks+0.147
 obey+0.137
 chase+0.134
Neg logit contributions
rier-0.172
ain-0.150
bar-0.144
rily-0.141
ic-0.141
Tokenling
Token ID1359
Feature activation-1.26e-01

Pos logit contributions
 walks+0.318
 walk+0.316
 jog+0.312
 conversation+0.302
 skip+0.302
Neg logit contributions
rier-0.397
iest-0.318
rily-0.314
isher-0.303
ter-0.299
Token,
Token ID11
Feature activation-2.27e-02

Pos logit contributions
 walks+0.222
 walk+0.222
 jog+0.218
 conversation+0.217
 skip+0.209
Neg logit contributions
rier-0.217
rily-0.165
iest-0.159
ter-0.158
isher-0.158
Token one
Token ID530
Feature activation+5.69e-02

Pos logit contributions
 walks+0.042
 walk+0.042
 jog+0.041
 conversation+0.041
 skip+0.040
Neg logit contributions
rier-0.037
rily-0.028
isher-0.028
iest-0.027
bars-0.027
Token of
Token ID286
Feature activation+1.30e-01

Pos logit contributions
rier+0.082
rily+0.058
isher+0.056
bars+0.054
iest+0.054
Neg logit contributions
 walk-0.120
 walks-0.120
 conversation-0.118
 jog-0.117
 joke-0.114
Token the
Token ID262
Feature activation+7.28e-03

Pos logit contributions
rier+0.235
rily+0.179
bars+0.173
iest+0.173
isher+0.169
Neg logit contributions
 walks-0.232
 walk-0.226
 conversation-0.221
 jog-0.220
 quarrel-0.214
Token said
Token ID531
Feature activation+2.24e-02

Pos logit contributions
 walks+0.038
 conversation+0.037
 walk+0.037
 jog+0.035
 quarrel+0.035
Neg logit contributions
rier-0.055
rily-0.043
iest-0.043
isher-0.042
bars-0.041
Token,
Token ID11
Feature activation-1.40e-02

Pos logit contributions
rier+0.043
rily+0.034
isher+0.034
iest+0.033
bars+0.033
Neg logit contributions
 walks-0.035
 walk-0.035
 conversation-0.034
 jog-0.034
 quarrel-0.033
Token "
Token ID366
Feature activation+5.40e-02

Pos logit contributions
 walks+0.028
 walk+0.028
 conversation+0.027
 jog+0.027
 quarrel+0.026
Neg logit contributions
rier-0.021
isher-0.016
rily-0.016
iest-0.015
bars-0.015
TokenStop
Token ID19485
Feature activation-1.30e-01

Pos logit contributions
rier+0.125
rily+0.104
 spe+0.100
iest+0.099
isher+0.098
Neg logit contributions
 walk-0.066
 walks-0.066
 conversation-0.065
 jog-0.064
 quarrel-0.061
Token qu
Token ID627
Feature activation-8.90e-02

Pos logit contributions
 walks+0.173
 walk+0.171
 conversation+0.167
 jog+0.166
 stroll+0.158
Neg logit contributions
rier-0.280
rily-0.224
ter-0.223
isher-0.222
iest-0.219
Tokenarre
Token ID9624
Feature activation-2.01e-01

Pos logit contributions
 walk+0.107
 walks+0.107
 jog+0.095
 stroll+0.093
 lookout+0.087
Neg logit contributions
rier-0.199
bar-0.180
isher-0.176
ain-0.175
ac-0.173
Tokenling
Token ID1359
Feature activation-1.23e-01

Pos logit contributions
 walks+0.309
 walk+0.306
 jog+0.298
 conversation+0.293
 stroll+0.288
Neg logit contributions
rier-0.406
iest-0.321
rily-0.321
isher-0.319
ter-0.311
Token,
Token ID11
Feature activation-1.27e-02

Pos logit contributions
 walks+0.213
 walk+0.212
 conversation+0.208
 jog+0.207
 skip+0.199
Neg logit contributions
rier-0.218
rily-0.168
isher-0.164
iest-0.161
bars-0.160
Token boys
Token ID6510
Feature activation-7.45e-02

Pos logit contributions
 walk+0.021
 walks+0.020
 jog+0.020
 quarrel+0.020
 fight+0.020
Neg logit contributions
rier-0.023
rily-0.019
iest-0.018
bul-0.018
 spe-0.018
Token.
Token ID13
Feature activation-2.12e-03

Pos logit contributions
 walks+0.125
 walk+0.125
 conversation+0.122
 jog+0.121
 skip+0.117
Neg logit contributions
rier-0.136
rily-0.105
isher-0.103
iest-0.101
bars-0.100
Token You
Token ID921
Feature activation-3.57e-02

Pos logit contributions
 walks+0.004
 walk+0.004
 conversation+0.004
 jog+0.004
 skip+0.004
Neg logit contributions
rier-0.003
rily-0.002
isher-0.002
iest-0.002
bars-0.002
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
 walk+0.101
 jog+0.100
 walks+0.100
 conversation+0.099
 joke+0.097
Neg logit contributions
rier-0.093
isher-0.074
bars-0.073
rily-0.073
bar-0.069
Token why
Token ID1521
Feature activation+1.09e-03

Pos logit contributions
 walk+0.026
 jog+0.026
 joke+0.025
 walks+0.025
 conversation+0.025
Neg logit contributions
rier-0.031
isher-0.026
rily-0.026
bars-0.025
iest-0.025
Token are
Token ID389
Feature activation-2.80e-02

Pos logit contributions
rier+0.002
rily+0.002
iest+0.002
isher+0.002
bars+0.002
Neg logit contributions
 walks-0.002
 walk-0.002
 conversation-0.002
 jog-0.002
 stroll-0.002
Token you
Token ID345
Feature activation-6.76e-03

Pos logit contributions
 walks+0.052
 walk+0.051
 jog+0.050
 conversation+0.048
 quarrel+0.048
Neg logit contributions
rier-0.048
rily-0.039
isher-0.037
iest-0.036
bul-0.035
Token qu
Token ID627
Feature activation-8.75e-02

Pos logit contributions
 walks+0.010
 walk+0.010
 conversation+0.009
 jog+0.009
 skip+0.009
Neg logit contributions
rier-0.014
rily-0.011
isher-0.011
ter-0.011
iest-0.011
Tokenarre
Token ID9624
Feature activation-2.01e-01

Pos logit contributions
 walks+0.080
 walk+0.078
 jog+0.071
 conversation+0.064
 chase+0.063
Neg logit contributions
rier-0.233
rily-0.198
bar-0.197
isher-0.195
ter-0.194
Tokenling
Token ID1359
Feature activation-1.25e-01

Pos logit contributions
 walks+0.262
 walk+0.261
 conversation+0.255
 jog+0.252
 quarrel+0.241
Neg logit contributions
rier-0.436
rily-0.354
isher-0.348
 spe-0.347
bars-0.344
Token?"
Token ID1701
Feature activation+6.59e-02

Pos logit contributions
 jog+0.231
 walk+0.229
 conversation+0.227
 learn+0.224
 joke+0.224
Neg logit contributions
rier-0.201
rily-0.155
isher-0.155
 spe-0.154
bars-0.152
Token But
Token ID887
Feature activation+3.29e-03

Pos logit contributions
rier+0.118
rily+0.093
ter+0.091
bar+0.091
bul+0.089
Neg logit contributions
 walk-0.105
 walks-0.104
 jog-0.102
 conversation-0.101
 skip-0.099
Token the
Token ID262
Feature activation+7.09e-03

Pos logit contributions
rier+0.005
isher+0.004
rily+0.004
iest+0.004
bars+0.004
Neg logit contributions
 walk-0.006
 walks-0.006
 jog-0.006
 conversation-0.006
 quarrel-0.006
Token birds
Token ID10087
Feature activation-7.85e-02

Pos logit contributions
rier+0.013
rily+0.009
ter+0.009
iest+0.009
 spe+0.009
Neg logit contributions
 walks-0.012
 walk-0.012
 jog-0.012
 conversation-0.012
 skip-0.012
Token Lily
Token ID20037
Feature activation-4.71e-02

Pos logit contributions
 walk+0.030
 walks+0.030
 conversation+0.030
 jog+0.029
 quarrel+0.028
Neg logit contributions
rier-0.031
rily-0.024
isher-0.024
bars-0.023
iest-0.023
Token and
Token ID290
Feature activation-1.52e-03

Pos logit contributions
 walks+0.080
 walk+0.080
 conversation+0.078
 jog+0.077
 stroll+0.075
Neg logit contributions
rier-0.083
rily-0.064
iest-0.063
isher-0.063
ter-0.062
Token Jack
Token ID3619
Feature activation-2.60e-02

Pos logit contributions
ggies+0.003
ggy+0.003
 lookout+0.003
astics+0.003
seek+0.003
Neg logit contributions
rier-0.001
ter-0.001
 diamonds-0.001
 if-0.001
 cuc-0.001
Token never
Token ID1239
Feature activation+5.32e-02

Pos logit contributions
 walks+0.043
 conversation+0.042
 stroll+0.042
 walk+0.041
ggies+0.041
Neg logit contributions
rier-0.040
 spe-0.035
ter-0.034
bul-0.034
iest-0.033
Token qu
Token ID627
Feature activation-9.22e-02

Pos logit contributions
rier+0.082
 spe+0.080
ter+0.078
bul+0.076
bar+0.074
Neg logit contributions
ggies-0.083
bes-0.079
 walks-0.079
ggy-0.079
 lawn-0.076
Tokenarre
Token ID9624
Feature activation-2.00e-01

Pos logit contributions
 walk+0.114
 walks+0.111
 jog+0.106
 lookout+0.103
 stroll+0.095
Neg logit contributions
rier-0.207
isher-0.184
eddy-0.181
ain-0.181
ac-0.180
Tokenled
Token ID992
Feature activation+1.12e-03

Pos logit contributions
 walks+0.364
 walk+0.362
 jog+0.354
 conversation+0.351
 skip+0.341
Neg logit contributions
rier-0.342
rily-0.261
isher-0.256
iest-0.256
ter-0.247
Token again
Token ID757
Feature activation-4.97e-02

Pos logit contributions
rier+0.002
rily+0.001
isher+0.001
bars+0.001
iest+0.001
Neg logit contributions
 walks-0.002
 walk-0.002
 jog-0.002
 conversation-0.002
 skip-0.002
Token.
Token ID13
Feature activation-7.97e-03

Pos logit contributions
 walks+0.098
 walk+0.097
 stroll+0.097
 jog+0.097
ggies+0.097
Neg logit contributions
rier-0.062
 if-0.049
 though-0.045
ter-0.045
 spe-0.045
Token They
Token ID1119
Feature activation-1.12e-01

Pos logit contributions
 walk+0.011
 walks+0.011
 jog+0.010
 conversation+0.010
 skip+0.010
Neg logit contributions
rier-0.017
rily-0.014
isher-0.014
bul-0.014
iest-0.014
Token learned
Token ID4499
Feature activation-2.19e-02

Pos logit contributions
 walks+0.190
 conversation+0.184
ggies+0.183
 stroll+0.181
 walk+0.178
Neg logit contributions
rier-0.168
 spe-0.148
ter-0.147
bul-0.141
bar-0.138
Token told
Token ID1297
Feature activation+6.53e-03

Pos logit contributions
 walk+0.002
 walks+0.002
 conversation+0.002
 jog+0.002
 quarrel+0.002
Neg logit contributions
rier-0.004
rily-0.003
umm-0.003
iest-0.003
isher-0.003
Token them
Token ID606
Feature activation-1.27e-02

Pos logit contributions
rier+0.009
ter+0.006
rily+0.006
 diamonds+0.006
isher+0.006
Neg logit contributions
 jog-0.013
 walk-0.013
 walks-0.013
 conversation-0.013
 skip-0.013
Token to
Token ID284
Feature activation-2.05e-02

Pos logit contributions
 walks+0.027
 walk+0.027
 jog+0.027
 stroll+0.027
 conversation+0.026
Neg logit contributions
rier-0.016
 if-0.011
 of-0.011
bars-0.010
 diamonds-0.010
Token stop
Token ID2245
Feature activation-7.40e-02

Pos logit contributions
 walks+0.032
 stroll+0.029
 hikes+0.029
 conversation+0.029
ggies+0.028
Neg logit contributions
rier-0.037
ter-0.032
bar-0.030
 if-0.029
isher-0.029
Token qu
Token ID627
Feature activation-8.99e-02

Pos logit contributions
 walks+0.116
 walk+0.115
 conversation+0.113
 jog+0.112
 stroll+0.110
Neg logit contributions
rier-0.143
ter-0.111
rily-0.111
isher-0.107
 diamonds-0.105
Tokenarre
Token ID9624
Feature activation-2.00e-01

Pos logit contributions
 walk+0.135
 walks+0.132
 jog+0.124
 stroll+0.119
 lookout+0.116
Neg logit contributions
rier-0.181
ain-0.162
eddy-0.157
bar-0.156
ic-0.155
Tokenlling
Token ID2680
Feature activation-9.74e-02

Pos logit contributions
 walks+0.348
 walk+0.344
 jog+0.342
 skip+0.336
 stroll+0.335
Neg logit contributions
rier-0.375
rily-0.291
iest-0.288
isher-0.277
ier-0.277
Token and
Token ID290
Feature activation-4.58e-04

Pos logit contributions
 walks+0.170
 walk+0.168
 conversation+0.166
 jog+0.165
 stroll+0.160
Neg logit contributions
rier-0.172
rily-0.132
iest-0.126
isher-0.124
bars-0.124
Token to
Token ID284
Feature activation-2.05e-02

Pos logit contributions
 walks+0.001
 walk+0.001
 conversation+0.001
 jog+0.001
 stroll+0.001
Neg logit contributions
rier-0.001
rily-0.001
isher-0.001
ter-0.001
bars-0.001
Token share
Token ID2648
Feature activation-9.42e-02

Pos logit contributions
 stroll+0.029
 walks+0.028
 conversation+0.028
ggies+0.027
 hikes+0.026
Neg logit contributions
rier-0.038
ter-0.032
 if-0.031
isher-0.030
bar-0.030
Token the
Token ID262
Feature activation+7.64e-03

Pos logit contributions
 walks+0.153
 walk+0.152
 conversation+0.148
 jog+0.147
 learn+0.142
Neg logit contributions
rier-0.178
rily-0.139
isher-0.137
iest-0.136
bars-0.135
Token she
Token ID673
Feature activation+4.03e-02

Pos logit contributions
rier+0.035
rily+0.023
iest+0.023
 diamonds+0.022
 if+0.022
Neg logit contributions
 walk-0.057
 walks-0.057
 conversation-0.057
 jog-0.056
 stroll-0.056
Token was
Token ID373
Feature activation-2.18e-02

Pos logit contributions
rier+0.069
ter+0.052
isher+0.052
iest+0.052
bars+0.052
Neg logit contributions
 walks-0.068
 walk-0.068
 jog-0.068
 stroll-0.066
 conversation-0.066
Token sorry
Token ID7926
Feature activation-4.41e-02

Pos logit contributions
 walk+0.034
 walks+0.033
 jog+0.033
 conversation+0.032
 stroll+0.032
Neg logit contributions
rier-0.044
ter-0.034
rily-0.033
iest-0.033
isher-0.032
Token for
Token ID329
Feature activation+9.48e-02

Pos logit contributions
 walks+0.074
 walk+0.074
 jog+0.072
 conversation+0.072
 learn+0.070
Neg logit contributions
rier-0.080
isher-0.064
rily-0.064
 spe-0.063
iest-0.060
Token qu
Token ID627
Feature activation-9.11e-02

Pos logit contributions
rier+0.172
ter+0.135
 diamonds+0.132
rily+0.129
 spe+0.122
Neg logit contributions
 jog-0.151
 walks-0.150
 walk-0.150
 conversation-0.148
 stroll-0.147
Tokenarre
Token ID9624
Feature activation-2.00e-01

Pos logit contributions
 walks+0.121
 walk+0.117
 jog+0.106
 lookout+0.103
 stroll+0.102
Neg logit contributions
rier-0.211
isher-0.180
ain-0.175
eddy-0.174
ac-0.173
Tokenling
Token ID1359
Feature activation-1.21e-01

Pos logit contributions
 walks+0.305
 walk+0.302
 jog+0.295
 conversation+0.289
 skip+0.283
Neg logit contributions
rier-0.408
rily-0.323
iest-0.320
isher-0.316
ter-0.307
Token.
Token ID13
Feature activation-8.46e-03

Pos logit contributions
 walks+0.198
 walk+0.197
 conversation+0.193
 jog+0.192
 skip+0.184
Neg logit contributions
rier-0.226
rily-0.176
isher-0.172
iest-0.170
bars-0.168
Token From
Token ID3574
Feature activation-1.84e-02

Pos logit contributions
 walk+0.015
 walks+0.014
 jog+0.014
 conversation+0.014
 fight+0.013
Neg logit contributions
rier-0.015
rily-0.012
bars-0.011
bar-0.011
ter-0.011
Token that
Token ID326
Feature activation+7.22e-04

Pos logit contributions
 walks+0.038
 walk+0.038
 conversation+0.037
 jog+0.037
 skip+0.036
Neg logit contributions
rier-0.027
rily-0.019
isher-0.019
iest-0.018
ter-0.018
Token day
Token ID1110
Feature activation-2.21e-02

Pos logit contributions
rier+0.001
iest+0.001
bul+0.001
ter+0.001
isher+0.001
Neg logit contributions
 walks-0.001
 walk-0.001
 jog-0.001
 stroll-0.001
ggies-0.001
Token Tom
Token ID4186
Feature activation-3.03e-03

Pos logit contributions
rier+0.015
iest+0.012
isher+0.012
rily+0.012
bars+0.012
Neg logit contributions
 walks-0.013
 walk-0.012
 jog-0.012
 conversation-0.012
 learn-0.012
Token and
Token ID290
Feature activation+1.57e-02

Pos logit contributions
 conversation+0.006
 walk+0.006
 stroll+0.006
ggies+0.006
 jog+0.006
Neg logit contributions
rier-0.004
rily-0.003
 spe-0.003
ter-0.003
bars-0.003
Token Mia
Token ID32189
Feature activation-1.89e-02

Pos logit contributions
rier+0.035
rily+0.028
isher+0.027
iest+0.027
bars+0.027
Neg logit contributions
 walks-0.022
 walk-0.021
 jog-0.021
 conversation-0.020
 skip-0.020
Token stop
Token ID2245
Feature activation-7.02e-02

Pos logit contributions
ggies+0.032
astics+0.030
seek+0.030
 topics+0.029
 conversation+0.029
Neg logit contributions
rier-0.031
 if-0.026
 of-0.026
 spe-0.026
ter-0.025
Token qu
Token ID627
Feature activation-9.43e-02

Pos logit contributions
ggies+0.125
seek+0.123
astics+0.121
ggy+0.119
 conversation+0.118
Neg logit contributions
rier-0.111
ter-0.090
rily-0.086
 if-0.086
 of-0.084
Tokenarre
Token ID9624
Feature activation-2.00e-01

Pos logit contributions
 walk+0.164
 stroll+0.155
 walks+0.154
 jog+0.147
 lookout+0.145
Neg logit contributions
rier-0.167
eddy-0.135
ain-0.132
ac-0.131
bar-0.131
Tokenling
Token ID1359
Feature activation-1.23e-01

Pos logit contributions
 walks+0.316
 walk+0.316
 jog+0.308
 conversation+0.306
 stroll+0.301
Neg logit contributions
rier-0.395
rily-0.311
iest-0.306
isher-0.300
ter-0.296
Token.
Token ID13
Feature activation+6.63e-03

Pos logit contributions
 conversation+0.218
 walk+0.217
 walks+0.216
 jog+0.214
 stroll+0.210
Neg logit contributions
rier-0.214
rily-0.163
ter-0.154
isher-0.147
iest-0.146
Token They
Token ID1119
Feature activation-1.05e-01

Pos logit contributions
rier+0.012
 spe+0.009
rily+0.009
bars+0.008
ter+0.008
Neg logit contributions
 walk-0.011
 jog-0.011
 conversation-0.011
 stroll-0.011
 skip-0.011
Token see
Token ID766
Feature activation+1.20e-02

Pos logit contributions
seek+0.165
ggies+0.165
 conversation+0.160
 conversations+0.158
astics+0.157
Neg logit contributions
rier-0.178
 spe-0.152
 if-0.143
ter-0.142
ener-0.137
Token the
Token ID262
Feature activation+9.37e-03

Pos logit contributions
rier+0.022
rily+0.017
isher+0.016
iest+0.016
bars+0.016
Neg logit contributions
 walks-0.021
 walk-0.021
 conversation-0.020
 jog-0.020
 skip-0.019
Token
Token ID198
Feature activation+7.26e-03

Pos logit contributions
rier+0.018
rily+0.016
eddy+0.015
iest+0.015
bars+0.015
Neg logit contributions
 walk-0.010
 walks-0.010
 jog-0.009
 conversation-0.009
 quarrel-0.009
TokenThey
Token ID2990
Feature activation-1.02e-01

Pos logit contributions
rier+0.012
can+0.010
bar+0.010
eddy+0.010
ter+0.010
Neg logit contributions
 walks-0.012
 walk-0.012
 conversation-0.012
 stroll-0.012
 quarrel-0.011
Token learn
Token ID2193
Feature activation-1.18e-01

Pos logit contributions
 conversation+0.127
 walks+0.125
ggies+0.124
 stroll+0.119
 walk+0.117
Neg logit contributions
rier-0.216
 spe-0.178
isher-0.176
rily-0.176
ter-0.175
Token that
Token ID326
Feature activation+2.92e-03

Pos logit contributions
 walk+0.246
 walks+0.242
 stroll+0.239
 jog+0.239
 conversation+0.236
Neg logit contributions
rier-0.160
bul-0.117
 diamonds-0.111
rily-0.110
bars-0.109
Token qu
Token ID627
Feature activation-8.45e-02

Pos logit contributions
rier+0.005
rily+0.004
isher+0.004
iest+0.004
bars+0.004
Neg logit contributions
 walks-0.005
 walk-0.005
 conversation-0.005
 jog-0.005
 skip-0.005
Tokenarre
Token ID9624
Feature activation-1.99e-01

Pos logit contributions
 walk+0.102
 walks+0.098
 jog+0.094
 conversation+0.089
 stroll+0.088
Neg logit contributions
rier-0.200
isher-0.171
rily-0.164
eddy-0.164
bul-0.164
Tokenling
Token ID1359
Feature activation-1.21e-01

Pos logit contributions
 walks+0.303
 walk+0.301
 conversation+0.295
 jog+0.292
 skip+0.278
Neg logit contributions
rier-0.391
rily-0.310
isher-0.302
 spe-0.298
bars-0.298
Token is
Token ID318
Feature activation-4.00e-02

Pos logit contributions
 walks+0.199
 walk+0.198
 conversation+0.194
 jog+0.193
 skip+0.185
Neg logit contributions
rier-0.224
rily-0.174
isher-0.171
iest-0.169
bars-0.167
Token bad
Token ID2089
Feature activation-4.45e-02

Pos logit contributions
 walk+0.061
 walks+0.061
 conversation+0.060
 jog+0.059
 stroll+0.058
Neg logit contributions
rier-0.079
ter-0.061
rily-0.061
 diamonds-0.060
isher-0.060
Token and
Token ID290
Feature activation+1.22e-02

Pos logit contributions
 walks+0.080
 walk+0.079
 conversation+0.078
 jog+0.077
 joke+0.074
Neg logit contributions
rier-0.077
rily-0.059
isher-0.057
iest-0.056
bars-0.056
Token sharing
Token ID7373
Feature activation-7.48e-02

Pos logit contributions
rier+0.022
rily+0.017
isher+0.016
 spe+0.016
ter+0.016
Neg logit contributions
 walks-0.021
 walk-0.021
 conversation-0.020
 jog-0.020
 stroll-0.020
Token on
Token ID319
Feature activation-7.31e-02

Pos logit contributions
 walks+0.043
 walk+0.043
 conversation+0.042
 jog+0.042
 skip+0.040
Neg logit contributions
rier-0.044
rily-0.034
isher-0.033
iest-0.033
bars-0.033
Token,
Token ID11
Feature activation-1.12e-02

Pos logit contributions
 walks+0.131
 walk+0.130
 conversation+0.128
 jog+0.127
 skip+0.123
Neg logit contributions
rier-0.122
rily-0.092
iest-0.092
isher-0.090
 spe-0.089
Token they
Token ID484
Feature activation-1.90e-02

Pos logit contributions
 walk+0.020
 walks+0.020
 jog+0.019
 conversation+0.019
 quarrel+0.019
Neg logit contributions
rier-0.021
rily-0.016
isher-0.015
iest-0.015
 spe-0.015
Token never
Token ID1239
Feature activation+5.65e-02

Pos logit contributions
 walks+0.028
 conversation+0.027
 walk+0.027
ggies+0.027
 stroll+0.026
Neg logit contributions
rier-0.035
ter-0.030
 spe-0.029
bul-0.029
isher-0.029
Token qu
Token ID627
Feature activation-8.88e-02

Pos logit contributions
ter+0.082
 spe+0.082
rier+0.081
 if+0.080
 though+0.079
Neg logit contributions
ggies-0.090
bes-0.088
ggy-0.085
 lawn-0.082
ither-0.082
Tokenarre
Token ID9624
Feature activation-1.98e-01

Pos logit contributions
 walk+0.114
 walks+0.108
 jog+0.106
 lookout+0.103
 chase+0.096
Neg logit contributions
rier-0.191
eddy-0.174
isher-0.171
ain-0.171
ac-0.168
Tokenled
Token ID992
Feature activation+5.03e-03

Pos logit contributions
 walks+0.370
 walk+0.369
 jog+0.362
 conversation+0.356
 skip+0.350
Neg logit contributions
rier-0.329
rily-0.248
isher-0.245
iest-0.245
ter-0.235
Token again
Token ID757
Feature activation-4.60e-02

Pos logit contributions
rier+0.009
rily+0.007
isher+0.007
bars+0.007
iest+0.006
Neg logit contributions
 walks-0.009
 walk-0.009
 jog-0.009
 conversation-0.009
 skip-0.008
Token and
Token ID290
Feature activation+3.98e-03

Pos logit contributions
 walks+0.088
 walk+0.087
 jog+0.086
 stroll+0.086
 conversation+0.086
Neg logit contributions
rier-0.064
ter-0.047
 spe-0.047
 if-0.046
iest-0.046
Token played
Token ID2826
Feature activation-4.55e-02

Pos logit contributions
rier+0.007
ter+0.005
 spe+0.005
 diamonds+0.005
bul+0.005
Neg logit contributions
 walks-0.007
 walk-0.007
 conversation-0.007
 jog-0.007
 stroll-0.006
Token together
Token ID1978
Feature activation-5.60e-02

Pos logit contributions
 stroll+0.089
 walk+0.088
 quarrel+0.086
 jog+0.085
 conversation+0.085
Neg logit contributions
rier-0.064
 if-0.044
rily-0.044
ter-0.043
 for-0.038
Token replied
Token ID8712
Feature activation+7.89e-03

Pos logit contributions
rier+0.140
rily+0.110
isher+0.107
iest+0.106
bars+0.105
Neg logit contributions
 walks-0.121
 walk-0.120
 conversation-0.117
 jog-0.117
 skip-0.112
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
rier+0.016
isher+0.014
rily+0.013
bars+0.013
 spe+0.013
Neg logit contributions
 walks-0.011
 jog-0.011
 walk-0.011
ggy-0.011
ggies-0.010
Token
Token ID198
Feature activation-1.57e-02

Pos logit contributions
 jog+0.018
 walks+0.017
 stroll+0.016
 walk+0.016
 skip+0.016
Neg logit contributions
rier-0.026
iest-0.023
isher-0.022
ier-0.021
 diamonds-0.021
Token
Token ID198
Feature activation-1.50e-02

Pos logit contributions
 walks+0.023
 walk+0.022
 conversation+0.022
 jog+0.022
 skip+0.021
Neg logit contributions
rier-0.032
rily-0.026
isher-0.025
iest-0.025
bars-0.025
TokenFl
Token ID7414
Feature activation+1.94e-02

Pos logit contributions
 jog+0.020
 walk+0.020
 walks+0.020
 conversation+0.019
 skip+0.019
Neg logit contributions
rier-0.034
rily-0.027
isher-0.027
 diamonds-0.026
bul-0.026
Tokenumm
Token ID13929
Feature activation+2.07e-01

Pos logit contributions
rier+0.030
isher+0.023
ter+0.023
eddy+0.023
ancer+0.022
Neg logit contributions
 conversation-0.037
 walks-0.036
 chase-0.035
 joke-0.034
 conversations-0.034
Tokenox
Token ID1140
Feature activation-3.24e-02

Pos logit contributions
rier+0.366
rily+0.281
 spe+0.270
 diamonds+0.259
bars+0.254
Neg logit contributions
 walk-0.366
 conversation-0.360
 walks-0.358
 jog-0.353
 joke-0.351
Tokened
Token ID276
Feature activation-3.55e-03

Pos logit contributions
 walks+0.053
 walk+0.053
 conversation+0.052
 jog+0.051
 skip+0.050
Neg logit contributions
rier-0.060
rily-0.047
isher-0.046
iest-0.046
ter-0.045
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
 walks+0.006
 walk+0.006
 jog+0.006
 conversation+0.005
 skip+0.005
Neg logit contributions
rier-0.007
isher-0.005
bars-0.005
bul-0.005
umm-0.005
Token the
Token ID262
Feature activation+6.75e-03

Pos logit contributions
 walk+0.030
 jog+0.030
 walks+0.030
 skip+0.028
 conversation+0.027
Neg logit contributions
rier-0.029
isher-0.024
rily-0.023
iest-0.023
umm-0.022
Token little
Token ID1310
Feature activation-1.19e-02

Pos logit contributions
rier+0.010
rily+0.007
isher+0.007
iest+0.007
bars+0.007
Neg logit contributions
 walks-0.014
 walk-0.014
 jog-0.014
 conversation-0.014
 joke-0.013
Token four
Token ID1440
Feature activation+3.40e-02

Pos logit contributions
 walks+0.058
 walk+0.058
 jog+0.056
 conversation+0.055
 skip+0.054
Neg logit contributions
rier-0.070
rily-0.056
isher-0.056
 spe-0.054
iest-0.053
Token more
Token ID517
Feature activation+2.41e-02

Pos logit contributions
rier+0.064
isher+0.053
rily+0.052
ter+0.052
eddy+0.051
Neg logit contributions
 walks-0.058
 walk-0.056
 conversation-0.053
 skip-0.052
 jog-0.052
Token times
Token ID1661
Feature activation+3.10e-02

Pos logit contributions
rier+0.042
isher+0.032
rily+0.032
ter+0.031
bars+0.031
Neg logit contributions
 walks-0.043
 walk-0.043
 conversation-0.042
 jog-0.041
 skip-0.040
Token,"
Token ID553
Feature activation+7.44e-02

Pos logit contributions
rier+0.063
isher+0.054
rily+0.053
 spe+0.052
eddy+0.050
Neg logit contributions
 walk-0.045
 jog-0.043
 walks-0.043
 skip-0.041
 learn-0.041
Token the
Token ID262
Feature activation+1.02e-02

Pos logit contributions
rier+0.147
isher+0.124
rily+0.120
iest+0.118
bars+0.118
Neg logit contributions
 walk-0.116
 walks-0.115
 learn-0.112
 conversation-0.110
 fight-0.107
Token gen
Token ID2429
Feature activation+2.29e-01

Pos logit contributions
rier+0.013
rily+0.011
bars+0.011
isher+0.010
bar+0.009
Neg logit contributions
 conversation-0.021
 walk-0.021
 joke-0.021
 walks-0.021
 quarrel-0.020
Tokenie
Token ID494
Feature activation+8.37e-02

Pos logit contributions
rier+0.314
 spe+0.223
 diamonds+0.222
rily+0.213
bars+0.211
Neg logit contributions
 conversation-0.479
 walk-0.472
 walks-0.468
 joke-0.460
 quarrel-0.456
Token said
Token ID531
Feature activation+2.72e-02

Pos logit contributions
rier+0.175
isher+0.144
bars+0.138
iest+0.137
 diamonds+0.137
Neg logit contributions
 walks-0.124
 walk-0.120
 conversation-0.116
 jog-0.111
 joke-0.111
Token,
Token ID11
Feature activation-2.16e-02

Pos logit contributions
rier+0.056
isher+0.052
bars+0.049
iest+0.049
 diamonds+0.049
Neg logit contributions
 walk-0.035
 walks-0.034
 conversation-0.033
 jog-0.032
 learn-0.031
Token "
Token ID366
Feature activation+4.77e-02

Pos logit contributions
 walks+0.040
 walk+0.039
 conversation+0.038
 jog+0.038
 quarrel+0.037
Neg logit contributions
rier-0.036
isher-0.030
iest-0.029
bars-0.027
eddy-0.027
TokenI
Token ID40
Feature activation+1.71e-02

Pos logit contributions
rier+0.100
rily+0.082
 spe+0.080
isher+0.080
 diamonds+0.080
Neg logit contributions
 walk-0.067
 jog-0.066
 learn-0.064
 skip-0.063
 walks-0.062
Token and
Token ID290
Feature activation-3.95e-03

Pos logit contributions
 walk+0.069
 skip+0.064
 jog+0.063
 walks+0.062
 learn+0.061
Neg logit contributions
rier-0.084
isher-0.070
rily-0.067
eddy-0.066
 diamonds-0.064
Token grab
Token ID5552
Feature activation-1.99e-02

Pos logit contributions
 conversation+0.004
 walks+0.004
ggies+0.004
 walk+0.004
 jog+0.004
Neg logit contributions
rier-0.009
 spe-0.008
bars-0.008
rily-0.008
 diamonds-0.008
Token the
Token ID262
Feature activation+4.46e-03

Pos logit contributions
 walks+0.031
 walk+0.030
 jog+0.029
 conversation+0.029
 skip+0.028
Neg logit contributions
rier-0.039
isher-0.031
rily-0.031
bars-0.031
iest-0.031
Token ju
Token ID7544
Feature activation+2.45e-01

Pos logit contributions
rier+0.010
isher+0.009
rily+0.009
eddy+0.009
bul+0.009
Neg logit contributions
 walk-0.006
 walks-0.005
 chase-0.005
 conversation-0.005
 skip-0.005
Tokenic
Token ID291
Feature activation+1.61e-01

Pos logit contributions
rier+0.279
rily+0.181
isher+0.174
iest+0.168
bars+0.161
Neg logit contributions
 walks-0.581
 walk-0.578
 jog-0.572
 conversation-0.568
 skip-0.556
Tokeniest
Token ID6386
Feature activation+2.43e-01

Pos logit contributions
rier+0.165
 spe+0.133
eddy+0.127
isher+0.119
rily+0.118
Neg logit contributions
 walk-0.373
ggy-0.368
 walks-0.362
 joke-0.357
 run-0.356
Token fruit
Token ID8234
Feature activation-3.06e-02

Pos logit contributions
rier+0.564
rily+0.478
isher+0.474
bars+0.452
eddy+0.444
Neg logit contributions
 walk-0.311
 walks-0.304
 conversation-0.282
 jog-0.281
 skip-0.279
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
 walk+0.039
 walks+0.036
 skip+0.035
 chase+0.033
 run+0.033
Neg logit contributions
rier-0.066
isher-0.059
ancer-0.055
bul-0.055
eddy-0.055
Token One
Token ID1881
Feature activation+2.71e-02

Pos logit contributions
 walks+0.033
 walk+0.032
 conversation+0.032
 jog+0.031
 quarrel+0.030
Neg logit contributions
rier-0.042
rily-0.034
bars-0.033
isher-0.033
iest-0.033
Token day
Token ID1110
Feature activation-2.48e-02

Pos logit contributions
rier+0.057
rily+0.046
isher+0.045
iest+0.044
bars+0.044
Neg logit contributions
 walks-0.039
 walk-0.039
 conversation-0.038
 jog-0.038
 skip-0.036
Token she
Token ID673
Feature activation+3.34e-02

Pos logit contributions
 conversation+0.047
 walks+0.046
 walk+0.046
 jog+0.045
 quarrel+0.045
Neg logit contributions
rier-0.036
iest-0.028
ter-0.028
 spe-0.028
 diamonds-0.028
Token the
Token ID262
Feature activation+6.87e-03

Pos logit contributions
rier+0.017
isher+0.014
rily+0.014
umm+0.014
bars+0.014
Neg logit contributions
 walks-0.012
 walk-0.011
 jog-0.011
 learn-0.011
 chase-0.011
Token biggest
Token ID4094
Feature activation+7.84e-02

Pos logit contributions
rier+0.014
ter+0.011
 spe+0.010
 diamonds+0.010
iest+0.010
Neg logit contributions
 walks-0.010
astics-0.010
 walk-0.010
 conversation-0.010
ggies-0.010
Token,
Token ID11
Feature activation-1.95e-02

Pos logit contributions
rier+0.152
rily+0.118
iest+0.117
 spe+0.116
isher+0.114
Neg logit contributions
 walks-0.123
 walk-0.121
 conversation-0.118
 jog-0.118
 stroll-0.113
Token fl
Token ID781
Feature activation+1.90e-02

Pos logit contributions
 walks+0.029
 walk+0.029
 jog+0.028
 conversation+0.028
 joke+0.027
Neg logit contributions
rier-0.039
isher-0.032
rily-0.032
bars-0.031
iest-0.030
Tokenuff
Token ID1648
Feature activation+1.21e-02

Pos logit contributions
rier+0.039
rily+0.033
 diamonds+0.032
 spe+0.032
isher+0.032
Neg logit contributions
 walk-0.025
 jog-0.025
 walks-0.024
 conversation-0.024
 skip-0.024
Tokeniest
Token ID6386
Feature activation+2.37e-01

Pos logit contributions
rier+0.017
 spe+0.013
umm+0.013
rily+0.013
bul+0.013
Neg logit contributions
 walk-0.024
 walks-0.023
 jog-0.023
ggy-0.022
ggies-0.022
Token do
Token ID466
Feature activation-9.71e-02

Pos logit contributions
rier+0.437
rily+0.345
isher+0.333
bars+0.326
bul+0.319
Neg logit contributions
 walk-0.398
 walks-0.397
 conversation-0.388
 jog-0.388
 joke-0.373
Tokenggy
Token ID19970
Feature activation-8.70e-02

Pos logit contributions
 walks+0.040
 walk+0.040
 conversation+0.036
 jog+0.036
 skip+0.030
Neg logit contributions
rier-0.293
iest-0.256
rily-0.255
 spe-0.248
ter-0.247
Token he
Token ID339
Feature activation+8.06e-02

Pos logit contributions
 walks+0.145
 walk+0.145
 conversation+0.145
 jog+0.143
 skip+0.137
Neg logit contributions
rier-0.153
rily-0.119
ter-0.116
iest-0.115
 diamonds-0.113
Token had
Token ID550
Feature activation-2.08e-02

Pos logit contributions
rier+0.179
rily+0.144
isher+0.142
bars+0.138
iest+0.137
Neg logit contributions
 walks-0.112
 walk-0.110
 jog-0.106
 conversation-0.106
 quarrel-0.100
Token ever
Token ID1683
Feature activation+8.79e-02

Pos logit contributions
 walks+0.038
 walk+0.038
 conversation+0.037
 jog+0.037
 skip+0.035
Neg logit contributions
rier-0.037
rily-0.028
isher-0.027
bars-0.026
bul-0.026
Token as
Token ID355
Feature activation+3.42e-02

Pos logit contributions
rier+0.028
rily+0.022
iest+0.021
isher+0.021
bars+0.021
Neg logit contributions
 walks-0.024
 walk-0.024
 conversation-0.024
 jog-0.024
 skip-0.023
Token he
Token ID339
Feature activation+8.67e-02

Pos logit contributions
rier+0.053
isher+0.040
rily+0.039
bars+0.037
iest+0.036
Neg logit contributions
 walks-0.068
 walk-0.067
 jog-0.065
 conversation-0.064
 skip-0.063
Token opened
Token ID4721
Feature activation+6.39e-02

Pos logit contributions
rier+0.167
rily+0.133
isher+0.130
iest+0.129
bars+0.128
Neg logit contributions
 walks-0.132
 walk-0.132
 conversation-0.130
 jog-0.128
 skip-0.123
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
rier+0.121
iest+0.096
isher+0.095
bars+0.095
ter+0.094
Neg logit contributions
 walks-0.106
 walk-0.101
 jog-0.096
 conversation-0.095
 stroll-0.092
Token ch
Token ID442
Feature activation-1.09e-02

Pos logit contributions
rier+0.018
rily+0.014
isher+0.013
iest+0.013
bars+0.013
Neg logit contributions
 walks-0.022
 walk-0.022
 conversation-0.021
 jog-0.021
 skip-0.020
Tokenocol
Token ID4668
Feature activation+2.64e-01

Pos logit contributions
 walks+0.024
 walk+0.024
 jog+0.024
 conversation+0.023
 stroll+0.023
Neg logit contributions
rier-0.013
rily-0.009
ter-0.009
 diamonds-0.009
bars-0.009
Tokenates
Token ID689
Feature activation+4.89e-02

Pos logit contributions
rier+0.538
rily+0.426
bars+0.420
 spe+0.416
 diamonds+0.407
Neg logit contributions
 walks-0.379
 walk-0.352
 conversation-0.345
 jog-0.334
ggies-0.333
Token and
Token ID290
Feature activation-8.13e-03

Pos logit contributions
rier+0.097
 spe+0.082
isher+0.079
bul+0.077
umm+0.076
Neg logit contributions
 walk-0.073
 walks-0.069
 jog-0.068
 learn-0.064
 skip-0.064
Token started
Token ID2067
Feature activation-2.71e-02

Pos logit contributions
 walks+0.012
 walk+0.012
 jog+0.012
 conversation+0.012
 skip+0.011
Neg logit contributions
rier-0.017
rily-0.013
isher-0.013
iest-0.013
bars-0.013
Token eating
Token ID6600
Feature activation-4.87e-02

Pos logit contributions
 walk+0.043
 walks+0.042
 conversation+0.042
 jog+0.042
 skip+0.040
Neg logit contributions
rier-0.049
rily-0.039
 spe-0.038
bars-0.037
bar-0.037
Token them
Token ID606
Feature activation-1.55e-02

Pos logit contributions
 walks+0.083
 walk+0.080
 jog+0.077
 conversation+0.076
 joke+0.074
Neg logit contributions
rier-0.088
isher-0.070
bars-0.070
iest-0.069
bul-0.068
Token to
Token ID284
Feature activation-1.11e-02

Pos logit contributions
 walk+0.014
 jog+0.014
 walks+0.013
 learn+0.013
 play+0.013
Neg logit contributions
rier-0.020
 diamonds-0.015
 spe-0.015
umm-0.015
rily-0.015
Token the
Token ID262
Feature activation+1.67e-02

Pos logit contributions
 walks+0.019
 walk+0.019
 jog+0.018
 learn+0.018
 fight+0.017
Neg logit contributions
rier-0.023
iest-0.018
isher-0.018
rily-0.017
bars-0.017
Token shop
Token ID6128
Feature activation+5.25e-02

Pos logit contributions
rier+0.035
rily+0.029
iest+0.029
isher+0.028
bars+0.028
Neg logit contributions
 walk-0.024
 walks-0.024
 jog-0.024
 conversation-0.023
 skip-0.022
Token and
Token ID290
Feature activation+1.79e-02

Pos logit contributions
ocol+0.111
rier+0.110
 spe+0.108
bul+0.108
 diamonds+0.107
Neg logit contributions
 walk-0.063
 learn-0.062
 play-0.058
 jog-0.056
 fight-0.053
Token see
Token ID766
Feature activation+1.65e-02

Pos logit contributions
rier+0.049
iest+0.043
rily+0.041
isher+0.041
ier+0.040
Neg logit contributions
 walk-0.019
 walks-0.017
 jog-0.017
 learn-0.017
 fight-0.016
Token if
Token ID611
Feature activation+1.45e-01

Pos logit contributions
rier+0.032
umm+0.027
 spe+0.026
isher+0.025
iest+0.025
Neg logit contributions
 walks-0.028
 walk-0.026
 learn-0.026
 jog-0.025
 play-0.025
Token they
Token ID484
Feature activation-1.08e-02

Pos logit contributions
rier+0.264
umm+0.224
bul+0.224
rily+0.223
isher+0.223
Neg logit contributions
 walks-0.260
 walk-0.260
 jog-0.252
 learn-0.236
 play-0.231
Token have
Token ID423
Feature activation-5.59e-02

Pos logit contributions
 walks+0.015
 walk+0.015
 jog+0.014
 conversation+0.014
 skip+0.014
Neg logit contributions
rier-0.023
rily-0.019
isher-0.018
iest-0.018
bars-0.018
Token any
Token ID597
Feature activation+5.80e-02

Pos logit contributions
 walks+0.104
 walk+0.096
 jog+0.090
 conversation+0.090
 learn+0.088
Neg logit contributions
rier-0.109
rily-0.085
umm-0.083
bars-0.081
eddy-0.081
Token diamonds
Token ID30984
Feature activation+1.68e-01

Pos logit contributions
rier+0.112
rily+0.095
can+0.091
ter+0.090
bars+0.089
Neg logit contributions
 walks-0.095
 walk-0.092
 conversation-0.086
 jog-0.085
 fight-0.084
Token available
Token ID1695
Feature activation+1.31e-01

Pos logit contributions
rier+0.349
umm+0.286
 spe+0.283
isher+0.282
bul+0.278
Neg logit contributions
 walk-0.235
 walks-0.226
 jog-0.218
 learn-0.213
 fight-0.211
Token like
Token ID588
Feature activation+1.41e-02

Pos logit contributions
 walks+0.108
 walk+0.106
 jog+0.103
 conversation+0.101
 joke+0.099
Neg logit contributions
rier-0.130
rily-0.103
bars-0.102
isher-0.102
bul-0.098
Token so
Token ID523
Feature activation+3.79e-02

Pos logit contributions
rier+0.033
rily+0.028
isher+0.026
ener+0.026
bars+0.025
Neg logit contributions
 walks-0.019
 walk-0.018
 jog-0.018
 conversation-0.017
 skip-0.017
Token much
Token ID881
Feature activation+5.93e-02

Pos logit contributions
rier+0.066
iest+0.050
rily+0.050
 diamonds+0.049
bars+0.048
Neg logit contributions
 walk-0.066
 walks-0.065
 conversation-0.064
 jog-0.064
 quarrel-0.061
Token fun
Token ID1257
Feature activation-7.47e-02

Pos logit contributions
rier+0.149
rily+0.124
iest+0.123
isher+0.123
 diamonds+0.121
Neg logit contributions
 walks-0.058
 walk-0.058
 conversation-0.056
 jog-0.055
 skip-0.051
Token!
Token ID0
Feature activation+5.44e-04

Pos logit contributions
 walks+0.125
 walk+0.122
 jog+0.119
 conversation+0.119
 skip+0.114
Neg logit contributions
rier-0.140
rily-0.109
isher-0.107
bars-0.106
iest-0.103
Tokenâ
Token ID22940
Feature activation+8.14e-02

Pos logit contributions
rier+0.001
iest+0.001
rily+0.001
isher+0.001
 spe+0.001
Neg logit contributions
 walks-0.001
 jog-0.001
 walk-0.001
 skip-0.001
 learn-0.001
Token€
Token ID26391
Feature activation-5.30e-02

Pos logit contributions
rier+0.154
ter+0.142
isher+0.141
iest+0.137
ener+0.130
Neg logit contributions
 conversation-0.112
 walks-0.111
 walk-0.106
 jog-0.105
 quarrel-0.104
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
 walk+0.085
 jog+0.084
 conversation+0.084
 walks+0.083
 joke+0.082
Neg logit contributions
rier-0.089
 spe-0.072
can-0.069
bars-0.069
 diamonds-0.069
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
 quarrel+0.027
 walk+0.027
 fight+0.027
 walks+0.027
 jog+0.026
Neg logit contributions
rier-0.020
rily-0.017
eddy-0.016
iest-0.016
 of-0.016
TokenSo
Token ID2396
Feature activation-6.25e-02

Pos logit contributions
 walks+0.024
 conversation+0.023
 walk+0.023
 fight+0.022
 quarrel+0.022
Neg logit contributions
rier-0.024
rily-0.019
can-0.019
ter-0.019
iest-0.019
Token she
Token ID673
Feature activation+3.61e-02

Pos logit contributions
 walk+0.099
 jog+0.096
 walks+0.096
 skip+0.094
 conversation+0.092
Neg logit contributions
rier-0.127
isher-0.102
rily-0.102
bul-0.098
bars-0.097
Token.
Token ID13
Feature activation-1.36e-02

Pos logit contributions
rier+0.092
rily+0.074
bars+0.068
bul+0.067
bar+0.067
Neg logit contributions
 conversation-0.107
 jog-0.106
ggies-0.105
 walks-0.105
 joke-0.103
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
 walks+0.021
 walk+0.021
 conversation+0.020
 jog+0.020
 skip+0.019
Neg logit contributions
rier-0.027
rily-0.021
isher-0.021
iest-0.021
ter-0.020
Token
Token ID198
Feature activation-1.37e-02

Pos logit contributions
 walks+0.020
 walk+0.020
 conversation+0.019
 jog+0.019
 skip+0.018
Neg logit contributions
rier-0.029
rily-0.023
isher-0.023
iest-0.023
bars-0.022
TokenThe
Token ID464
Feature activation+1.57e-02

Pos logit contributions
 walks+0.023
 walk+0.023
 conversation+0.022
 jog+0.022
 stroll+0.021
Neg logit contributions
rier-0.025
rily-0.019
bars-0.019
iest-0.018
isher-0.018
Token witch
Token ID16365
Feature activation+5.97e-02

Pos logit contributions
rier+0.025
iest+0.018
ter+0.018
 spe+0.017
ener+0.017
Neg logit contributions
 walks-0.029
 walk-0.029
 jog-0.028
 conversation-0.028
 stroll-0.028
Token gave
Token ID2921
Feature activation+1.00e-01

Pos logit contributions
rier+0.107
rily+0.083
 spe+0.081
isher+0.080
iest+0.080
Neg logit contributions
 walks-0.099
 walk-0.098
 jog-0.097
 conversation-0.096
 skip-0.093
Token Tom
Token ID4186
Feature activation-1.08e-02

Pos logit contributions
rier+0.229
bul+0.185
 spe+0.184
rily+0.181
iest+0.180
Neg logit contributions
 walks-0.148
 walk-0.136
 conversation-0.125
 learn-0.122
 jog-0.120
Token some
Token ID617
Feature activation+7.23e-03

Pos logit contributions
 conversation+0.019
 walks+0.019
 walk+0.019
 jog+0.018
 quarrel+0.018
Neg logit contributions
rier-0.018
rily-0.014
isher-0.014
iest-0.014
bars-0.014
Token candy
Token ID18550
Feature activation-1.93e-02

Pos logit contributions
rier+0.014
rily+0.011
isher+0.011
iest+0.010
bars+0.010
Neg logit contributions
 walks-0.012
 walk-0.011
 conversation-0.011
 jog-0.011
 skip-0.011
Token,
Token ID11
Feature activation-1.85e-02

Pos logit contributions
 walk+0.028
 walks+0.027
 jog+0.025
 conversation+0.024
 skip+0.024
Neg logit contributions
rier-0.040
isher-0.032
 spe-0.032
bars-0.032
rily-0.032
Token and
Token ID290
Feature activation-3.51e-03

Pos logit contributions
 walks+0.029
 walk+0.028
 conversation+0.027
 jog+0.026
 learn+0.026
Neg logit contributions
rier-0.040
isher-0.031
rily-0.031
iest-0.031
umm-0.030
Token He
Token ID679
Feature activation+1.78e-02

Pos logit contributions
 walks+0.039
 walk+0.038
 conversation+0.038
 jog+0.037
 stroll+0.036
Neg logit contributions
rier-0.051
rily-0.041
iest-0.040
isher-0.040
 spe-0.039
Token was
Token ID373
Feature activation-2.77e-02

Pos logit contributions
rier+0.033
rily+0.026
isher+0.026
 spe+0.026
ter+0.025
Neg logit contributions
 walks-0.028
 walk-0.028
 jog-0.028
 conversation-0.028
 skip-0.027
Token so
Token ID523
Feature activation+3.47e-02

Pos logit contributions
 walks+0.048
 walk+0.048
 conversation+0.047
 jog+0.046
 skip+0.045
Neg logit contributions
rier-0.049
rily-0.038
isher-0.037
iest-0.037
bars-0.036
Token disappointed
Token ID11679
Feature activation-2.83e-02

Pos logit contributions
rier+0.072
rily+0.056
 spe+0.055
iest+0.054
isher+0.054
Neg logit contributions
 walks-0.051
 walk-0.050
 conversation-0.049
 jog-0.049
 stroll-0.047
Token and
Token ID290
Feature activation-6.25e-03

Pos logit contributions
 walk+0.043
 walks+0.043
 jog+0.042
 conversation+0.041
 learn+0.039
Neg logit contributions
rier-0.056
rily-0.046
isher-0.046
 spe-0.044
bars-0.044
Token he
Token ID339
Feature activation+7.82e-02

Pos logit contributions
 walks+0.010
 walk+0.010
 conversation+0.010
 jog+0.010
 skip+0.010
Neg logit contributions
rier-0.012
rily-0.009
isher-0.009
ter-0.009
bars-0.009
Token sadly
Token ID21098
Feature activation+5.53e-02

Pos logit contributions
rier+0.144
rily+0.116
 spe+0.112
isher+0.112
ter+0.111
Neg logit contributions
 walks-0.121
 walk-0.121
 conversation-0.120
 jog-0.120
 skip-0.116
Token had
Token ID550
Feature activation-1.98e-02

Pos logit contributions
rier+0.096
 spe+0.078
rily+0.078
ter+0.078
 diamonds+0.077
Neg logit contributions
 conversation-0.086
 joke-0.085
 jog-0.084
ggies-0.083
 lawn-0.082
Token to
Token ID284
Feature activation-1.83e-02

Pos logit contributions
 walks+0.034
 walk+0.033
 conversation+0.032
 jog+0.032
 quarrel+0.031
Neg logit contributions
rier-0.039
rily-0.031
isher-0.030
iest-0.030
ter-0.029
Token go
Token ID467
Feature activation-1.54e-02

Pos logit contributions
 conversation+0.027
ggies+0.026
seek+0.025
 conversations+0.025
 walks+0.024
Neg logit contributions
rier-0.032
ter-0.027
 if-0.025
bar-0.025
rily-0.025
Token home
Token ID1363
Feature activation-5.44e-03

Pos logit contributions
 conversation+0.027
 walks+0.027
 joke+0.027
 walk+0.027
ggies+0.027
Neg logit contributions
rier-0.024
iest-0.018
rily-0.018
bar-0.018
ter-0.018
Token go
Token ID467
Feature activation-3.00e-03

Pos logit contributions
 conversation+0.009
 stroll+0.008
 walks+0.008
ggies+0.008
 conversations+0.008
Neg logit contributions
rier-0.010
ter-0.009
bar-0.009
 if-0.008
 spe-0.008
Token to
Token ID284
Feature activation-5.83e-03

Pos logit contributions
 walks+0.005
 walk+0.005
 conversation+0.005
 jog+0.005
 skip+0.005
Neg logit contributions
rier-0.005
rily-0.004
isher-0.004
iest-0.004
bars-0.004
Token the
Token ID262
Feature activation+1.89e-02

Pos logit contributions
 walks+0.011
 walk+0.010
 conversation+0.010
 jog+0.010
 stroll+0.010
Neg logit contributions
rier-0.009
rily-0.007
ter-0.007
isher-0.007
 spe-0.007
Token doctor
Token ID6253
Feature activation+9.01e-02

Pos logit contributions
rier+0.037
ter+0.028
 spe+0.028
rily+0.028
isher+0.028
Neg logit contributions
 walks-0.029
 walk-0.028
 conversation-0.028
 stroll-0.027
 jog-0.027
Token now
Token ID783
Feature activation+5.93e-02

Pos logit contributions
rier+0.181
iest+0.146
rily+0.145
 diamonds+0.144
isher+0.143
Neg logit contributions
 walk-0.142
 walks-0.140
 jog-0.134
 conversation-0.134
 learn-0.131
Token,"
Token ID553
Feature activation+9.73e-02

Pos logit contributions
rier+0.107
rily+0.084
isher+0.083
iest+0.081
 spe+0.080
Neg logit contributions
 walks-0.102
 walk-0.102
 jog-0.098
 conversation-0.098
 learn-0.095
Token Mom
Token ID11254
Feature activation-1.61e-02

Pos logit contributions
rier+0.206
isher+0.176
rily+0.173
iest+0.161
bar+0.159
Neg logit contributions
 walk-0.147
 conversation-0.143
 learn-0.143
 walks-0.142
 jog-0.141
Token said
Token ID531
Feature activation+3.94e-02

Pos logit contributions
 walks+0.025
 walk+0.025
 conversation+0.025
 jog+0.024
 quarrel+0.023
Neg logit contributions
rier-0.033
rily-0.026
 diamonds-0.025
isher-0.025
iest-0.025
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
rier+0.073
rily+0.056
bars+0.053
ter+0.053
isher+0.053
Neg logit contributions
 walks-0.067
 walk-0.066
 jog-0.065
 conversation-0.065
 joke-0.063
Token
Token ID198
Feature activation+6.08e-03

Pos logit contributions
rier+0.027
rily+0.021
isher+0.021
bars+0.021
 spe+0.021
Neg logit contributions
 walks-0.021
 walk-0.020
 conversation-0.020
 jog-0.020
 joke-0.019
Token
Token ID198
Feature activation+6.37e-03

Pos logit contributions
rier+0.010
rily+0.008
bars+0.008
eddy+0.008
 spe+0.008
Neg logit contributions
 walk-0.010
 conversation-0.010
 walks-0.010
 quarrel-0.010
 joke-0.010
Token Max
Token ID5436
Feature activation-8.19e-02

Pos logit contributions
rier+0.107
ter+0.089
rily+0.087
 spe+0.085
bar+0.084
Neg logit contributions
 conversation-0.097
 walks-0.094
 learn-0.092
 walk-0.092
 quarrel-0.091
Token.
Token ID13
Feature activation-1.97e-02

Pos logit contributions
 walks+0.134
 walk+0.133
 conversation+0.132
 jog+0.130
 skip+0.126
Neg logit contributions
rier-0.149
rily-0.118
isher-0.114
ter-0.113
iest-0.113
Token Max
Token ID5436
Feature activation-8.17e-02

Pos logit contributions
 conversation+0.030
 walk+0.029
 walks+0.029
 jog+0.028
ggy+0.028
Neg logit contributions
rier-0.034
bars-0.029
 spe-0.027
eddy-0.027
bar-0.027
Token had
Token ID550
Feature activation-2.07e-02

Pos logit contributions
 conversation+0.151
 blame+0.147
 skip+0.143
 stroll+0.143
 conversations+0.141
Neg logit contributions
 spe-0.088
eddy-0.081
 if-0.079
rier-0.078
ter-0.076
Token a
Token ID257
Feature activation-9.99e-03

Pos logit contributions
 walk+0.039
 conversation+0.039
 jog+0.038
 quarrel+0.037
 walks+0.037
Neg logit contributions
rier-0.028
 spe-0.023
 diamonds-0.022
bar-0.022
ter-0.021
Token long
Token ID890
Feature activation+1.54e-02

Pos logit contributions
seek+0.020
 walks+0.019
arming+0.019
 obey+0.018
|+0.018
Neg logit contributions
rier-0.012
 spe-0.011
 v-0.009
bar-0.009
 cuc-0.009
Token tail
Token ID7894
Feature activation-4.59e-02

Pos logit contributions
rier+0.021
bar+0.020
 spe+0.019
iest+0.017
 eighty+0.017
Neg logit contributions
 blame-0.024
 walks-0.024
 obey-0.024
 achieve-0.023
 apologise-0.023
Token and
Token ID290
Feature activation-2.02e-03

Pos logit contributions
 walks+0.088
 walk+0.086
 conversation+0.085
 jog+0.085
 quarrel+0.082
Neg logit contributions
rier-0.072
rily-0.053
iest-0.052
isher-0.052
ter-0.051
Token loved
Token ID6151
Feature activation-2.50e-02

Pos logit contributions
 walks+0.003
 walk+0.003
 jog+0.003
 conversation+0.003
 joke+0.003
Neg logit contributions
rier-0.004
rily-0.003
isher-0.003
iest-0.003
bars-0.003
Token to
Token ID284
Feature activation-1.51e-02

Pos logit contributions
 walk+0.051
 conversation+0.050
 walks+0.050
 jog+0.050
 skip+0.049
Neg logit contributions
rier-0.031
 spe-0.024
 diamonds-0.023
bul-0.022
ter-0.022
Token w
Token ID266
Feature activation+3.33e-03

Pos logit contributions
arre+0.019
Enough+0.018
 conversation+0.015
imming+0.015
 neighbourhood+0.015
Neg logit contributions
 spe-0.027
 v-0.025
bar-0.025
 cuc-0.024
 if-0.024
Token
Token ID198
Feature activation-1.15e-02

Pos logit contributions
 walk+0.021
 walks+0.021
 conversation+0.020
 jog+0.020
 skip+0.019
Neg logit contributions
rier-0.023
rily-0.018
bars-0.018
isher-0.017
bar-0.017
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
 walks+0.018
 walk+0.018
 conversation+0.018
 jog+0.017
 quarrel+0.017
Neg logit contributions
rier-0.021
rily-0.017
iest-0.017
bars-0.016
isher-0.016
TokenHe
Token ID1544
Feature activation+3.02e-02

Pos logit contributions
 conversation+0.018
 quarrel+0.017
 walks+0.017
 joke+0.017
 walk+0.017
Neg logit contributions
rier-0.020
can-0.018
bars-0.017
bar-0.017
iest-0.016
Token walked
Token ID6807
Feature activation+5.44e-03

Pos logit contributions
rier+0.059
rily+0.048
isher+0.047
 spe+0.047
bul+0.047
Neg logit contributions
 walks-0.042
 conversation-0.042
 walk-0.042
 jog-0.041
 skip-0.040
Token closer
Token ID5699
Feature activation-4.19e-02

Pos logit contributions
rier+0.009
rily+0.007
iest+0.007
ter+0.007
isher+0.007
Neg logit contributions
 walks-0.009
 conversation-0.009
 walk-0.009
 jog-0.009
 joke-0.009
Token to
Token ID284
Feature activation-1.19e-02

Pos logit contributions
 walks+0.076
 walk+0.075
 conversation+0.074
 jog+0.074
 skip+0.071
Neg logit contributions
rier-0.071
rily-0.054
isher-0.052
iest-0.052
bars-0.051
Token find
Token ID1064
Feature activation-9.23e-03

Pos logit contributions
 walk+0.022
 walks+0.022
 jog+0.021
 conversation+0.021
 learn+0.020
Neg logit contributions
rier-0.021
rily-0.016
isher-0.016
iest-0.015
bars-0.015
Token out
Token ID503
Feature activation-2.55e-02

Pos logit contributions
 walks+0.012
 walk+0.011
 learn+0.011
 chase+0.010
 jog+0.010
Neg logit contributions
rier-0.022
rily-0.019
umm-0.019
bars-0.019
isher-0.018
Token and
Token ID290
Feature activation+6.13e-05

Pos logit contributions
 walks+0.042
 walk+0.041
 conversation+0.040
 jog+0.040
 skip+0.038
Neg logit contributions
rier-0.049
rily-0.038
isher-0.038
iest-0.037
bars-0.037
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
rier+0.000
rily+0.000
ter+0.000
 diamonds+0.000
isher+0.000
Neg logit contributions
 walks-0.000
 conversation-0.000
 walk-0.000
 jog-0.000
ggies-0.000
Token suddenly
Token ID6451
Feature activation+4.24e-02

Pos logit contributions
 walks+0.031
 walk+0.031
 jog+0.030
 conversation+0.030
 skip+0.029
Neg logit contributions
rier-0.025
rily-0.019
isher-0.018
iest-0.018
bars-0.018
Token You
Token ID921
Feature activation-2.37e-02

Pos logit contributions
rier+0.056
rily+0.046
isher+0.045
iest+0.045
 diamonds+0.043
Neg logit contributions
 walks-0.041
 jog-0.039
 learn-0.039
 walk-0.039
 skip-0.038
Token might
Token ID1244
Feature activation+4.61e-02

Pos logit contributions
 walks+0.036
 walk+0.036
 jog+0.035
 conversation+0.035
 skip+0.034
Neg logit contributions
rier-0.047
rily-0.037
isher-0.037
iest-0.036
bars-0.036
Token break
Token ID2270
Feature activation-2.06e-02

Pos logit contributions
rier+0.092
 spe+0.077
ter+0.074
bar+0.073
iest+0.072
Neg logit contributions
 conversation-0.063
 walks-0.062
ggies-0.059
 stroll-0.058
 conversations-0.056
Token it
Token ID340
Feature activation+2.43e-02

Pos logit contributions
 walks+0.038
 walk+0.037
 jog+0.036
 learn+0.036
 conversation+0.035
Neg logit contributions
rier-0.034
rily-0.026
isher-0.026
 spe-0.026
umm-0.025
Token or
Token ID393
Feature activation+1.01e-01

Pos logit contributions
rier+0.045
isher+0.037
 spe+0.036
rily+0.035
 diamonds+0.035
Neg logit contributions
 walk-0.040
 walks-0.039
 jog-0.038
 learn-0.037
 fight-0.037
Token hurt
Token ID5938
Feature activation-5.70e-02

Pos logit contributions
rier+0.239
rily+0.197
isher+0.194
iest+0.192
bars+0.191
Neg logit contributions
 walks-0.114
 walk-0.113
 conversation-0.110
 jog-0.108
 skip-0.102
Token yourselves
Token ID27012
Feature activation-1.51e-03

Pos logit contributions
 walks+0.092
 walk+0.090
 learn+0.088
 jog+0.087
 quarrel+0.085
Neg logit contributions
rier-0.105
isher-0.086
rily-0.085
bul-0.084
bars-0.083
Token,"
Token ID553
Feature activation+9.93e-02

Pos logit contributions
 walk+0.002
 walks+0.002
 jog+0.002
 fight+0.002
 learn+0.002
Neg logit contributions
rier-0.003
isher-0.002
umm-0.002
iest-0.002
 spe-0.002
Token she
Token ID673
Feature activation+5.32e-02

Pos logit contributions
rier+0.207
rily+0.169
isher+0.166
iest+0.159
ter+0.153
Neg logit contributions
 walk-0.162
 walks-0.159
 jog-0.156
 conversation-0.154
 learn-0.153
Token says
Token ID1139
Feature activation+2.48e-02

Pos logit contributions
rier+0.097
rily+0.076
isher+0.074
bars+0.073
iest+0.073
Neg logit contributions
 walk-0.087
 walks-0.087
 conversation-0.085
 jog-0.085
 stroll-0.082
Token.
Token ID13
Feature activation+2.11e-02

Pos logit contributions
rier+0.049
rily+0.039
isher+0.038
iest+0.037
bars+0.037
Neg logit contributions
 walks-0.038
 walk-0.038
 conversation-0.037
 jog-0.037
 skip-0.035
Token€
Token ID26391
Feature activation-5.58e-02

Pos logit contributions
rier+0.136
iest+0.125
isher+0.122
bar+0.120
ter+0.119
Neg logit contributions
 conversation-0.105
 walks-0.095
 lawn-0.094
 conversations-0.093
 stroll-0.093
Tokenâ„¢
Token ID8151
Feature activation+1.48e-02

Pos logit contributions
 walk+0.079
 jog+0.078
 joke+0.078
 conversation+0.078
 walks+0.077
Neg logit contributions
rier-0.100
bars-0.082
 spe-0.079
rily-0.079
can-0.079
Tokent
Token ID83
Feature activation+7.31e-02

Pos logit contributions
rier+0.024
isher+0.019
 diamonds+0.018
 spe+0.018
rily+0.018
Neg logit contributions
 walk-0.028
 conversation-0.027
 jog-0.027
 walks-0.027
ggies-0.026
Token be
Token ID307
Feature activation-6.43e-02

Pos logit contributions
rier+0.156
ter+0.131
iest+0.128
bul+0.127
rily+0.126
Neg logit contributions
 walks-0.099
 conversation-0.095
ggies-0.089
astics-0.087
 conversations-0.086
Token fixed
Token ID5969
Feature activation+1.32e-01

Pos logit contributions
 walk+0.101
 walks+0.100
 conversation+0.098
 quarrel+0.097
 fight+0.097
Neg logit contributions
rier-0.128
rily-0.102
isher-0.101
bars-0.100
iest-0.099
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
rier+0.280
isher+0.240
 spe+0.236
 cuc+0.233
bars+0.231
Neg logit contributions
 walks-0.170
 walk-0.162
 jog-0.156
 conversation-0.149
 skip-0.148
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
 walks+0.019
 walk+0.019
 conversation+0.019
 jog+0.019
 skip+0.018
Neg logit contributions
rier-0.029
rily-0.023
isher-0.023
iest-0.023
bars-0.022
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
 walk+0.025
 quarrel+0.025
 walks+0.025
 fight+0.025
 joke+0.024
Neg logit contributions
rier-0.014
 of-0.012
eddy-0.012
can-0.012
rily-0.011
TokenThe
Token ID464
Feature activation+1.87e-02

Pos logit contributions
 walks+0.023
 walk+0.023
 fight+0.023
 joke+0.022
 conversation+0.022
Neg logit contributions
rier-0.017
can-0.014
ter-0.013
iest-0.013
eddy-0.013
Token little
Token ID1310
Feature activation-1.38e-02

Pos logit contributions
rier+0.031
rily+0.022
iest+0.022
isher+0.022
ter+0.022
Neg logit contributions
 walks-0.034
 conversation-0.033
 walk-0.033
 jog-0.032
 quarrel-0.032
Token girl
Token ID2576
Feature activation+6.91e-03

Pos logit contributions
 walk+0.027
 jog+0.027
 skip+0.026
 walks+0.026
 conversation+0.025
Neg logit contributions
rier-0.023
rily-0.018
bar-0.017
isher-0.016
 spe-0.016
Token From
Token ID3574
Feature activation-2.61e-02

Pos logit contributions
 walks+0.024
 walk+0.024
 conversation+0.023
 jog+0.023
 skip+0.022
Neg logit contributions
rier-0.035
rily-0.028
isher-0.028
iest-0.027
bars-0.027
Token then
Token ID788
Feature activation+3.30e-03

Pos logit contributions
 walks+0.049
 walk+0.049
 jog+0.048
 conversation+0.047
 skip+0.045
Neg logit contributions
rier-0.044
rily-0.033
iest-0.031
 spe-0.031
isher-0.031
Token on
Token ID319
Feature activation-7.65e-02

Pos logit contributions
rier+0.006
rily+0.005
isher+0.004
iest+0.004
bars+0.004
Neg logit contributions
 walks-0.006
 walk-0.006
 conversation-0.006
 jog-0.005
 skip-0.005
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
 walks+0.111
 walk+0.110
 jog+0.105
 conversation+0.105
ggies+0.099
Neg logit contributions
rier-0.165
rily-0.132
isher-0.131
bars-0.127
bul-0.125
Token Tim
Token ID5045
Feature activation-5.79e-03

Pos logit contributions
 walk+0.032
 walks+0.031
 jog+0.031
 conversation+0.030
 fight+0.030
Neg logit contributions
rier-0.035
rily-0.028
isher-0.028
bars-0.026
iest-0.025
Tokenmy
Token ID1820
Feature activation-2.17e-02

Pos logit contributions
 walks+0.010
 walk+0.010
 conversation+0.010
 jog+0.010
 learn+0.009
Neg logit contributions
rier-0.011
rily-0.008
 spe-0.008
isher-0.008
 diamonds-0.008
Token learned
Token ID4499
Feature activation-2.55e-02

Pos logit contributions
 walks+0.033
 walk+0.033
 conversation+0.032
 jog+0.032
 stroll+0.031
Neg logit contributions
rier-0.041
iest-0.032
rily-0.032
isher-0.032
 spe-0.032
Token that
Token ID326
Feature activation-5.49e-03

Pos logit contributions
 walks+0.050
 walk+0.050
 conversation+0.049
 jog+0.049
 skip+0.047
Neg logit contributions
rier-0.040
rily-0.029
isher-0.028
iest-0.028
bars-0.028
Token sometimes
Token ID3360
Feature activation+1.23e-03

Pos logit contributions
 walks+0.011
 walk+0.011
 jog+0.011
 conversation+0.010
 skip+0.010
Neg logit contributions
rier-0.009
rily-0.007
isher-0.006
iest-0.006
bars-0.006
Token the
Token ID262
Feature activation+6.34e-03

Pos logit contributions
rier+0.002
 diamonds+0.001
iest+0.001
ter+0.001
 spe+0.001
Neg logit contributions
 conversation-0.003
 walks-0.003
 walk-0.003
 jog-0.002
 stroll-0.002
Token simple
Token ID2829
Feature activation+2.26e-02

Pos logit contributions
rier+0.012
 diamonds+0.010
ter+0.010
 spe+0.010
iest+0.010
Neg logit contributions
seek-0.008
ggies-0.008
 walks-0.008
 conversation-0.008
astics-0.008
Token some
Token ID617
Feature activation-4.83e-05

Pos logit contributions
rier+0.201
rily+0.159
isher+0.158
bars+0.154
iest+0.153
Neg logit contributions
 walks-0.152
 walk-0.149
 conversation-0.144
 jog-0.142
 stroll-0.137
Token boxes
Token ID10559
Feature activation+4.30e-02

Pos logit contributions
 walks+0.000
 walk+0.000
 conversation+0.000
 jog+0.000
 stroll+0.000
Neg logit contributions
rier-0.000
rily-0.000
iest-0.000
isher-0.000
 diamonds-0.000
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
rier+0.086
isher+0.070
rily+0.066
 spe+0.066
umm+0.065
Neg logit contributions
 walk-0.068
 walks-0.063
 jog-0.061
 skip-0.061
 quarrel-0.060
Token She
Token ID1375
Feature activation-2.75e-02

Pos logit contributions
 walks+0.020
 walk+0.019
 conversation+0.019
 jog+0.019
 stroll+0.018
Neg logit contributions
rier-0.029
isher-0.023
iest-0.023
rily-0.023
ter-0.022
Token hopped
Token ID45230
Feature activation+4.07e-02

Pos logit contributions
 walks+0.037
 walk+0.036
 conversation+0.035
 jog+0.034
 skip+0.033
Neg logit contributions
rier-0.062
rily-0.049
iest-0.049
isher-0.049
bars-0.048
Token up
Token ID510
Feature activation-1.52e-01

Pos logit contributions
rier+0.081
isher+0.063
rily+0.063
iest+0.063
bars+0.063
Neg logit contributions
 walks-0.063
 walk-0.063
 jog-0.061
 conversation-0.060
 skip-0.059
Token on
Token ID319
Feature activation-7.67e-02

Pos logit contributions
 walks+0.270
 walk+0.269
 conversation+0.261
 jog+0.259
 skip+0.250
Neg logit contributions
rier-0.267
rily-0.203
isher-0.201
iest-0.198
bars-0.195
Token top
Token ID1353
Feature activation-1.98e-02

Pos logit contributions
 walks+0.144
 walk+0.143
 conversation+0.141
 jog+0.140
 skip+0.135
Neg logit contributions
rier-0.124
rily-0.093
isher-0.090
iest-0.089
bars-0.088
Token of
Token ID286
Feature activation+1.28e-01

Pos logit contributions
 walk+0.043
 walks+0.042
 conversation+0.041
 jog+0.040
 stroll+0.039
Neg logit contributions
rier-0.028
isher-0.020
ter-0.020
rily-0.019
 diamonds-0.018
Token them
Token ID606
Feature activation-1.64e-02

Pos logit contributions
rier+0.233
rily+0.178
isher+0.177
iest+0.175
bars+0.173
Neg logit contributions
 walks-0.223
 walk-0.221
 conversation-0.212
 jog-0.210
 skip-0.204
Token and
Token ID290
Feature activation-6.88e-03

Pos logit contributions
 walks+0.029
 conversation+0.029
 jog+0.029
 walk+0.028
 joke+0.028
Neg logit contributions
rier-0.027
rily-0.021
bars-0.020
bar-0.020
 diamonds-0.020
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
rier+0.037
rily+0.029
isher+0.029
iest+0.028
bars+0.028
Neg logit contributions
 walks-0.030
 walk-0.030
 jog-0.029
 conversation-0.029
 skip-0.028
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
 walks+0.016
 walk+0.015
 conversation+0.015
 jog+0.015
 learn+0.014
Neg logit contributions
rier-0.039
iest-0.033
isher-0.032
rily-0.032
 spe-0.031
Token
Token ID198
Feature activation-1.03e-02

Pos logit contributions
 walk+0.012
 walks+0.012
 jog+0.011
 conversation+0.011
 quarrel+0.011
Neg logit contributions
rier-0.026
rily-0.022
iest-0.021
isher-0.021
bars-0.021
TokenThe
Token ID464
Feature activation+1.92e-02

Pos logit contributions
 walks+0.014
 walk+0.014
 conversation+0.013
 jog+0.013
 skip+0.012
Neg logit contributions
rier-0.022
rily-0.018
isher-0.018
iest-0.018
bars-0.018
Token two
Token ID734
Feature activation-7.84e-02

Pos logit contributions
rier+0.037
rily+0.029
isher+0.028
iest+0.028
 spe+0.028
Neg logit contributions
 walks-0.030
 walk-0.030
 conversation-0.029
 jog-0.029
 skip-0.028
Token birds
Token ID10087
Feature activation-8.15e-02

Pos logit contributions
 walks+0.115
 walk+0.115
 conversation+0.111
 jog+0.108
 quarrel+0.104
Neg logit contributions
rier-0.165
rily-0.135
isher-0.131
bars-0.128
bul-0.126
Token flew
Token ID13112
Feature activation+3.63e-02

Pos logit contributions
 walks+0.125
 walk+0.121
 jog+0.120
 conversation+0.119
 joke+0.115
Neg logit contributions
rier-0.156
rily-0.126
ter-0.124
iest-0.123
isher-0.123
Token together
Token ID1978
Feature activation-5.76e-02

Pos logit contributions
rier+0.069
rily+0.054
isher+0.053
iest+0.052
bars+0.052
Neg logit contributions
 walks-0.058
 walk-0.058
 conversation-0.056
 jog-0.056
 skip-0.054
Token,
Token ID11
Feature activation-2.31e-02

Pos logit contributions
 walks+0.105
 conversation+0.104
 walk+0.103
 jog+0.103
ggies+0.102
Neg logit contributions
rier-0.091
rily-0.070
 spe-0.067
 diamonds-0.066
iest-0.064
Token and
Token ID290
Feature activation-6.13e-03

Pos logit contributions
 walks+0.034
 walk+0.034
 conversation+0.034
 jog+0.033
 joke+0.032
Neg logit contributions
rier-0.046
rily-0.037
 spe-0.035
isher-0.035
ter-0.035
Token the
Token ID262
Feature activation+7.80e-03

Pos logit contributions
 walks+0.011
 walk+0.011
 conversation+0.010
 jog+0.010
 quarrel+0.010
Neg logit contributions
rier-0.012
rily-0.009
isher-0.009
bars-0.009
iest-0.009
Token there
Token ID612
Feature activation-1.71e-02

Pos logit contributions
 walk+0.054
 walks+0.054
 jog+0.053
 conversation+0.052
 skip+0.051
Neg logit contributions
rier-0.043
isher-0.031
rily-0.031
bars-0.031
iest-0.030
Token was
Token ID373
Feature activation-3.31e-02

Pos logit contributions
 walk+0.024
 walks+0.023
 conversation+0.023
 jog+0.023
 joke+0.022
Neg logit contributions
rier-0.038
rily-0.030
isher-0.030
bars-0.029
bul-0.028
Token a
Token ID257
Feature activation-1.27e-02

Pos logit contributions
 walk+0.056
 walks+0.055
 conversation+0.055
 jog+0.054
 quarrel+0.052
Neg logit contributions
rier-0.058
rily-0.046
iest-0.044
 diamonds-0.044
 spe-0.044
Token monkey
Token ID21657
Feature activation-1.70e-02

Pos logit contributions
 walks+0.023
 walk+0.023
 conversation+0.023
 jog+0.022
 stroll+0.022
Neg logit contributions
rier-0.021
 spe-0.016
rily-0.016
ter-0.015
isher-0.015
Token named
Token ID3706
Feature activation+5.83e-02

Pos logit contributions
 walks+0.025
 walk+0.025
 conversation+0.024
 jog+0.024
 stroll+0.023
Neg logit contributions
rier-0.034
rily-0.027
iest-0.027
ter-0.026
 spe-0.026
Token Max
Token ID5436
Feature activation-9.06e-02

Pos logit contributions
rier+0.128
isher+0.103
iest+0.103
rily+0.101
bul+0.100
Neg logit contributions
 walks-0.085
 walk-0.083
 jog-0.078
 conversation-0.077
 chase-0.075
Token.
Token ID13
Feature activation-2.50e-02

Pos logit contributions
 walks+0.142
 walk+0.141
 conversation+0.138
 jog+0.137
 skip+0.131
Neg logit contributions
rier-0.176
rily-0.139
isher-0.136
iest-0.134
bars-0.133
Token Max
Token ID5436
Feature activation-9.04e-02

Pos logit contributions
 walks+0.033
 walk+0.032
 conversation+0.032
 jog+0.031
 skip+0.030
Neg logit contributions
rier-0.054
rily-0.044
isher-0.043
iest-0.043
bars-0.043
Token lived
Token ID5615
Feature activation+1.64e-02

Pos logit contributions
 conversation+0.158
 walk+0.151
 jog+0.150
 stroll+0.149
 walks+0.149
Neg logit contributions
rier-0.127
 spe-0.117
ter-0.111
eddy-0.108
bul-0.108
Token in
Token ID287
Feature activation-6.76e-02

Pos logit contributions
rier+0.030
rily+0.024
isher+0.023
bars+0.023
iest+0.023
Neg logit contributions
 walks-0.027
 walk-0.027
 conversation-0.027
 jog-0.027
 skip-0.026
Token a
Token ID257
Feature activation-1.31e-02

Pos logit contributions
 walks+0.102
 walk+0.099
 jog+0.095
 conversation+0.093
 learn+0.090
Neg logit contributions
rier-0.144
rily-0.115
isher-0.115
bars-0.112
iest-0.111
Token to
Token ID284
Feature activation-1.27e-02

Pos logit contributions
rier+0.003
rily+0.002
isher+0.002
bul+0.002
 spe+0.002
Neg logit contributions
 walk-0.003
 walks-0.003
 conversation-0.003
 jog-0.003
 learn-0.003
Token wash
Token ID13502
Feature activation+6.27e-03

Pos logit contributions
 walks+0.017
 conversation+0.017
ggies+0.017
seek+0.016
 conversations+0.016
Neg logit contributions
rier-0.025
ter-0.022
rily-0.021
iest-0.020
 cuc-0.020
Token your
Token ID534
Feature activation+2.33e-02

Pos logit contributions
rier+0.010
rily+0.007
isher+0.007
iest+0.007
bars+0.007
Neg logit contributions
 walks-0.012
 walk-0.012
 conversation-0.012
 jog-0.012
 skip-0.011
Token face
Token ID1986
Feature activation-6.88e-02

Pos logit contributions
rier+0.043
rily+0.034
isher+0.031
 diamonds+0.031
 spe+0.030
Neg logit contributions
 conversation-0.038
 walks-0.038
seek-0.037
 jog-0.037
 walk-0.037
Token."
Token ID526
Feature activation+3.06e-02

Pos logit contributions
 walk+0.114
 walks+0.111
 jog+0.110
 conversation+0.107
 skip+0.107
Neg logit contributions
rier-0.125
isher-0.099
 spe-0.097
rily-0.095
 diamonds-0.094
Token Spot
Token ID15899
Feature activation-8.80e-02

Pos logit contributions
rier+0.045
 Pe+0.035
 I+0.034
rily+0.034
 spe+0.034
Neg logit contributions
 joke-0.053
 conversation-0.052
 quarrel-0.051
 lookout-0.050
 walk-0.050
Token did
Token ID750
Feature activation-4.04e-02

Pos logit contributions
 walks+0.144
 conversation+0.142
 stroll+0.141
 walk+0.139
 jog+0.138
Neg logit contributions
rier-0.140
ter-0.117
rily-0.116
 spe-0.112
 cuc-0.110
Token not
Token ID407
Feature activation-3.17e-02

Pos logit contributions
 conversation+0.075
 stroll+0.071
 neighborhood+0.069
 conversations+0.069
 walk+0.069
Neg logit contributions
rier-0.065
rily-0.051
 if-0.050
 though-0.045
ter-0.045
Token like
Token ID588
Feature activation+1.76e-02

Pos logit contributions
 walks+0.056
ggies+0.055
 conversation+0.053
astics+0.052
 stroll+0.051
Neg logit contributions
rier-0.052
iest-0.042
ter-0.041
ier-0.040
 if-0.040
Token water
Token ID1660
Feature activation-8.64e-03

Pos logit contributions
rier+0.028
rily+0.021
 diamonds+0.020
 spe+0.020
ter+0.020
Neg logit contributions
 walk-0.033
 conversation-0.032
 walks-0.032
 jog-0.032
 skip-0.031
Token,
Token ID11
Feature activation-1.17e-02

Pos logit contributions
 walks+0.015
 walk+0.015
 conversation+0.015
 jog+0.014
 stroll+0.014
Neg logit contributions
rier-0.015
rily-0.012
iest-0.011
ter-0.011
isher-0.011
TokenHe
Token ID1544
Feature activation+2.53e-02

Pos logit contributions
 walks+0.021
 walk+0.021
 conversation+0.021
 quarrel+0.020
 fight+0.020
Neg logit contributions
rier-0.025
bar-0.020
rily-0.020
iest-0.020
can-0.019
Token walked
Token ID6807
Feature activation+8.69e-04

Pos logit contributions
rier+0.051
rily+0.041
isher+0.040
iest+0.039
bars+0.039
Neg logit contributions
 walks-0.038
 walk-0.037
 conversation-0.036
 jog-0.036
 skip-0.035
Token up
Token ID510
Feature activation-1.52e-01

Pos logit contributions
rier+0.002
rily+0.001
iest+0.001
isher+0.001
ter+0.001
Neg logit contributions
 walks-0.001
 walk-0.001
 conversation-0.001
 jog-0.001
 quarrel-0.001
Token to
Token ID284
Feature activation-2.20e-02

Pos logit contributions
 walks+0.279
 walk+0.278
 conversation+0.272
 jog+0.270
 skip+0.260
Neg logit contributions
rier-0.257
rily-0.194
isher-0.189
iest-0.187
bars-0.184
Token the
Token ID262
Feature activation+6.92e-03

Pos logit contributions
 walk+0.040
 walks+0.040
 jog+0.038
 conversation+0.037
 learn+0.037
Neg logit contributions
rier-0.041
rily-0.031
isher-0.031
bars-0.030
iest-0.030
Token pond
Token ID16723
Feature activation-8.19e-02

Pos logit contributions
rier+0.015
ter+0.012
rily+0.012
 diamonds+0.012
 spe+0.011
Neg logit contributions
 walks-0.010
 conversation-0.009
 walk-0.009
ggies-0.009
astics-0.009
Token,
Token ID11
Feature activation-2.64e-02

Pos logit contributions
 walk+0.132
 walks+0.130
 conversation+0.127
 jog+0.127
 skip+0.122
Neg logit contributions
rier-0.156
isher-0.122
rily-0.121
iest-0.120
 spe-0.118
Token getting
Token ID1972
Feature activation-2.61e-02

Pos logit contributions
 walks+0.043
 walk+0.043
 jog+0.042
 conversation+0.042
 skip+0.040
Neg logit contributions
rier-0.050
rily-0.039
isher-0.038
iest-0.038
bars-0.037
Token closer
Token ID5699
Feature activation-5.16e-02

Pos logit contributions
 walks+0.038
 walk+0.038
 jog+0.037
 conversation+0.036
 skip+0.035
Neg logit contributions
rier-0.054
rily-0.043
iest-0.043
isher-0.042
bars-0.041
Token and
Token ID290
Feature activation-1.17e-02

Pos logit contributions
 walks+0.077
 walk+0.077
 jog+0.072
 conversation+0.072
 stroll+0.069
Neg logit contributions
rier-0.105
isher-0.090
iest-0.083
rily-0.083
bul-0.082
Token closer
Token ID5699
Feature activation-5.20e-02

Pos logit contributions
 walks+0.021
 walk+0.020
 jog+0.020
 conversation+0.020
 skip+0.019
Neg logit contributions
rier-0.021
rily-0.016
isher-0.015
iest-0.015
bars-0.015