TOP ACTIVATIONS
MAX = 0.283

both skipped and jumped mer rily as they made their way
deed . She made a mental note to be more thoughtful
S ki pping mer rily along , Hawk came across
The girl eyed it hung rily . She thought it looked
. He pur red hung rily and decided it would be
happy and sailed away mer rily . <|endoftext|> Once there was
Polly looked at it hung rily
watched as she skipped mer rily away into the sunset .
It turns around and pe cks Tom on the nose .
the tree . It pe cks Anna on the head .
the box . It pe cks the bread .
the snow man and pe cks at it . It knocks
It flies down and pe cks at the coin . It
R ib bit , rib bit ! Go away , go
" R ib bit , rib bit ! Go
truck like Ben . Tru cks are fun . Doll s
It turns around and pe cks Tom on the nose .
other birds . It pe cks at the seeds and makes
R ib bit said , " Would you
to the ball and pe cks it . The ball makes

MIN ACTIVATIONS
MIN = -0.290

girl felt even p rou der . She was happy that
. Red felt even sad der . But then
This made her even glad der and she smiled with joy
look even smaller and sad der . <|endoftext|> Once upon a
even happier and p rou der . He displays his sail
. Tim felt even sad der now , and he cried
said no , the mad der Sara got . Then ,
. Lily felt even sad der than before . From that
Jack got even mad der and threw his toy on
it made her even sad der . Suddenly ,
and it was even red der . Suddenly , she had
little boy felt even sad der . He started to cry
and Lily felt even sad der . From that day on
nothing made it p rou der . The
, so Tim got mad der . He pushed Jack away
car made him even sad der . He asked everyone if
. He felt even sad der now . But then ,
. He felt even sad der . Max sat
found something even we ir der ! There were little plastic
." Lily felt even sad der than before .

INTERVAL 0.140 - 0.283
CONTAINS 0.270%

to swim . She sw am every chance she could -
She loved to eat jelly sandwiches . One day , her
mummy bear had one more suggestion . She asked , "
" We don 't have enough money to buy the toy
mom thought about it and finally agreed . Emma was relieved

INTERVAL -0.003 - 0.140
CONTAINS 57.611%

But then , something incredible happened . Lily 's mom
helmet and strong armour with a shield to protect him .
and had to take a nap . When he
Sorry , Lily . I did not mean to be mean
. She got excited and ran up to it . She

INTERVAL -0.147 - -0.003
CONTAINS 41.841%

Lily . Did you have fun at the park ?" Lily
too deep !" she said , pointing at the soil .
money to pay for it and the little girl was very
said , " But you know what we can do ?
like him . The old pigeon was so

INTERVAL -0.290 - -0.147
CONTAINS 0.277%

The audience cheers and cl aps . They are relieved
together !" But Mitt ens just wanted to sleep .
stir , stir !" Ben echoed . " We have to
calling for him ! The p anda zoom ed to his
fl owers . Eventually the gard ener came back with lots
Token both
Token ID1111
Feature activation-7.19e-02

Pos logit contributions
 mental+0.039
rily+0.037
bits+0.035
cks+0.035
bit+0.034
Neg logit contributions
der-0.035
Stud-0.031
 tr-0.031
 Lu-0.030
 pian-0.030
Token skipped
Token ID26684
Feature activation+1.92e-02

Pos logit contributions
der+0.073
Stud+0.067
 pian+0.064
 escal+0.063
 tr+0.063
Neg logit contributions
 mental-0.069
rily-0.065
bits-0.061
cks-0.060
bit-0.060
Token and
Token ID290
Feature activation-8.69e-03

Pos logit contributions
 mental+0.018
rily+0.017
bits+0.016
bit+0.015
cks+0.015
Neg logit contributions
der-0.020
Stud-0.018
Ju-0.017
 optical-0.017
 pian-0.017
Token jumped
Token ID11687
Feature activation+2.46e-02

Pos logit contributions
der+0.008
 tr+0.008
 Lu+0.008
Stud+0.008
Ste+0.008
Neg logit contributions
 mental-0.008
rily-0.008
cks-0.008
bits-0.008
bit-0.007
Token mer
Token ID4017
Feature activation-3.84e-02

Pos logit contributions
 mental+0.026
rily+0.025
cks+0.024
bits+0.023
bit+0.023
Neg logit contributions
der-0.022
 tr-0.020
Stud-0.020
 Lu-0.020
Ste-0.019
Tokenrily
Token ID28904
Feature activation+2.83e-01

Pos logit contributions
der+0.066
 tr+0.063
 Lu+0.062
Stud+0.061
 pian+0.061
Neg logit contributions
 mental-0.009
rily-0.006
bits-0.006
cks-0.005
bit-0.004
Token as
Token ID355
Feature activation+1.02e-02

Pos logit contributions
 mental+0.403
cks+0.395
ulously+0.376
bits+0.374
 acquaintance+0.367
Neg logit contributions
der-0.190
 tr-0.167
 Lu-0.134
 inside-0.116
 sub-0.114
Token they
Token ID484
Feature activation-2.44e-02

Pos logit contributions
 mental+0.010
cks+0.010
rily+0.010
bits+0.009
bit+0.009
Neg logit contributions
der-0.010
 Lu-0.009
 tr-0.009
Ste-0.009
Stud-0.009
Token made
Token ID925
Feature activation+5.78e-03

Pos logit contributions
der+0.020
 tr+0.019
 Lu+0.018
Ste+0.017
Ju+0.017
Neg logit contributions
 mental-0.028
rily-0.027
cks-0.026
bits-0.025
bit-0.025
Token their
Token ID511
Feature activation-4.53e-02

Pos logit contributions
 mental+0.008
cks+0.008
bel+0.008
bits+0.008
 achieve+0.008
Neg logit contributions
 Lu-0.002
 tr-0.002
der-0.002
 fog-0.002
 p-0.002
Token way
Token ID835
Feature activation+1.06e-01

Pos logit contributions
 tr+0.035
 p+0.030
 r+0.024
 t+0.023
 m+0.021
Neg logit contributions
cks-0.065
astered-0.064
iece-0.063
 granting-0.063
izer-0.063
Token deed
Token ID28637
Feature activation+5.57e-02

Pos logit contributions
der+0.011
 tr+0.011
 Lu+0.009
leaf+0.009
 archae+0.009
Neg logit contributions
cks-0.017
rily-0.016
 mental-0.015
bel-0.014
bits-0.014
Token.
Token ID13
Feature activation-4.07e-03

Pos logit contributions
cks+0.081
 mental+0.080
bit+0.076
bits+0.075
rily+0.075
Neg logit contributions
der-0.033
 tr-0.027
 Lu-0.024
 escal-0.019
Ste-0.019
Token She
Token ID1375
Feature activation+5.67e-03

Pos logit contributions
der+0.003
 Lu+0.003
Stud+0.003
 tr+0.003
Ste+0.003
Neg logit contributions
 mental-0.005
cks-0.004
rily-0.004
bits-0.004
bit-0.004
Token made
Token ID925
Feature activation-5.25e-03

Pos logit contributions
 mental+0.006
rily+0.006
cks+0.006
bits+0.005
bit+0.005
Neg logit contributions
der-0.005
 tr-0.005
 Lu-0.005
Stud-0.005
Ste-0.005
Token a
Token ID257
Feature activation+1.62e-03

Pos logit contributions
 Lu+0.003
 just+0.002
leaf+0.002
 all+0.002
 tr+0.002
Neg logit contributions
cks-0.007
ho-0.007
bit-0.007
 lair-0.007
bel-0.007
Token mental
Token ID5110
Feature activation+2.82e-01

Pos logit contributions
cks+0.002
rily+0.002
bits+0.002
ases+0.002
bit+0.002
Neg logit contributions
 tr-0.001
der-0.001
 pian-0.001
 Lu-0.001
Ste-0.001
Token note
Token ID3465
Feature activation+3.27e-02

Pos logit contributions
rily+0.295
cks+0.287
 mental+0.281
bits+0.273
bit+0.259
Neg logit contributions
der-0.246
 tr-0.238
Stud-0.222
Ste-0.222
leaf-0.220
Token to
Token ID284
Feature activation-3.43e-03

Pos logit contributions
cks+0.043
 mental+0.042
rily+0.041
bit+0.040
bits+0.040
Neg logit contributions
der-0.022
 tr-0.019
 Lu-0.017
Stud-0.016
leaf-0.016
Token be
Token ID307
Feature activation-4.78e-02

Pos logit contributions
der+0.003
 tr+0.003
 Lu+0.003
Stud+0.003
Ste+0.003
Neg logit contributions
 mental-0.004
rily-0.003
cks-0.003
bits-0.003
bit-0.003
Token more
Token ID517
Feature activation-3.96e-02

Pos logit contributions
der+0.039
 tr+0.036
Stud+0.035
 Lu+0.035
Ste+0.034
Neg logit contributions
 mental-0.053
rily-0.052
cks-0.050
bits-0.048
bit-0.047
Token thoughtful
Token ID22677
Feature activation+9.01e-04

Pos logit contributions
der+0.040
 tr+0.037
 Lu+0.036
Stud+0.036
Ste+0.035
Neg logit contributions
 mental-0.038
rily-0.035
cks-0.034
bits-0.034
bit-0.032
Token
Token ID198
Feature activation-6.96e-03

Pos logit contributions
 Lu+0.003
Ste+0.003
Stud+0.003
der+0.003
 tr+0.003
Neg logit contributions
rily-0.010
 mental-0.010
cks-0.010
bits-0.009
bit-0.009
TokenS
Token ID50
Feature activation-4.31e-02

Pos logit contributions
der+0.006
 tr+0.006
Stud+0.006
 Lu+0.006
Ste+0.005
Neg logit contributions
 mental-0.007
rily-0.007
cks-0.007
bits-0.007
bit-0.006
Tokenki
Token ID4106
Feature activation+9.08e-03

Pos logit contributions
der+0.037
 tr+0.033
Stud+0.033
Ste+0.033
 Lu+0.032
Neg logit contributions
 mental-0.046
rily-0.046
cks-0.044
bits-0.042
bit-0.041
Tokenpping
Token ID2105
Feature activation-1.91e-02

Pos logit contributions
 mental+0.008
rily+0.008
bits+0.007
cks+0.007
bit+0.007
Neg logit contributions
der-0.009
Stud-0.009
 pian-0.009
Ste-0.009
leaf-0.009
Token mer
Token ID4017
Feature activation-3.97e-02

Pos logit contributions
der+0.012
 tr+0.011
 Lu+0.011
Stud+0.010
 escal+0.010
Neg logit contributions
 mental-0.025
rily-0.024
cks-0.024
bit-0.022
 acquaintance-0.022
Tokenrily
Token ID28904
Feature activation+2.82e-01

Pos logit contributions
der+0.054
 tr+0.050
Stud+0.049
 Lu+0.049
 pian+0.048
Neg logit contributions
 mental-0.024
rily-0.021
cks-0.020
bits-0.020
bit-0.019
Token along
Token ID1863
Feature activation+4.01e-03

Pos logit contributions
 mental+0.419
cks+0.403
rily+0.391
ucker+0.390
bits+0.382
Neg logit contributions
 tr-0.176
der-0.166
 just-0.136
 r-0.129
 listened-0.116
Token,
Token ID11
Feature activation-6.36e-03

Pos logit contributions
rily+0.006
 mental+0.006
ucker+0.006
bit+0.006
cks+0.006
Neg logit contributions
 tr-0.002
der-0.002
 Lu-0.002
 r-0.002
 sub-0.002
Token Hawk
Token ID26698
Feature activation-3.07e-02

Pos logit contributions
der+0.005
 tr+0.005
 Lu+0.004
Stud+0.004
 pian+0.004
Neg logit contributions
 mental-0.007
rily-0.007
cks-0.007
bit-0.007
bits-0.007
Token came
Token ID1625
Feature activation+2.37e-03

Pos logit contributions
 tr+0.017
der+0.015
 kn+0.014
 r+0.014
 just+0.014
Neg logit contributions
rily-0.044
 mental-0.044
 acquaintance-0.041
cks-0.040
bit-0.040
Token across
Token ID1973
Feature activation-1.10e-02

Pos logit contributions
 mental+0.003
rily+0.003
cks+0.003
bit+0.003
bits+0.003
Neg logit contributions
der-0.002
 tr-0.002
 Lu-0.001
Stud-0.001
 sub-0.001
Token The
Token ID383
Feature activation+2.07e-03

Pos logit contributions
der+0.003
 Lu+0.003
 tr+0.003
Stud+0.003
Ste+0.003
Neg logit contributions
 mental-0.004
rily-0.004
cks-0.004
bits-0.004
bit-0.004
Token girl
Token ID2576
Feature activation+2.11e-02

Pos logit contributions
 mental+0.002
rily+0.002
cks+0.002
bits+0.002
bit+0.002
Neg logit contributions
der-0.002
 tr-0.001
Stud-0.001
 Lu-0.001
Ste-0.001
Token eyed
Token ID45320
Feature activation+9.03e-02

Pos logit contributions
 mental+0.022
rily+0.021
cks+0.021
bits+0.020
bit+0.020
Neg logit contributions
der-0.019
 tr-0.017
 Lu-0.017
Stud-0.017
Ste-0.016
Token it
Token ID340
Feature activation+6.03e-04

Pos logit contributions
 mental+0.117
rily+0.107
cks+0.106
bit+0.105
bits+0.104
Neg logit contributions
der-0.066
 Lu-0.064
 tr-0.062
 pian-0.056
Stud-0.053
Token hung
Token ID9174
Feature activation+8.37e-02

Pos logit contributions
 mental+0.001
cks+0.001
bit+0.001
ases+0.001
rily+0.001
Neg logit contributions
 tr-0.000
der-0.000
 Lu-0.000
 r-0.000
 just-0.000
Tokenrily
Token ID28904
Feature activation+2.77e-01

Pos logit contributions
 mental+0.087
rily+0.079
cks+0.079
bits+0.078
bit+0.076
Neg logit contributions
der-0.077
 tr-0.071
 Lu-0.070
Stud-0.068
 escal-0.068
Token.
Token ID13
Feature activation-3.88e-03

Pos logit contributions
 mental+0.416
cks+0.399
bits+0.397
 acquaintance+0.391
bit+0.387
Neg logit contributions
der-0.175
 tr-0.151
 Lu-0.105
 just-0.100
 inside-0.096
Token She
Token ID1375
Feature activation+1.26e-02

Pos logit contributions
der+0.003
 Lu+0.003
 tr+0.003
Stud+0.003
Ste+0.003
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.005
bits-0.004
bit-0.004
Token thought
Token ID1807
Feature activation-9.47e-03

Pos logit contributions
 mental+0.012
rily+0.011
cks+0.010
bits+0.010
bit+0.010
Neg logit contributions
der-0.013
Stud-0.012
 escal-0.011
Ste-0.011
 Lu-0.011
Token it
Token ID340
Feature activation+1.09e-03

Pos logit contributions
der+0.006
 Lu+0.005
 tr+0.005
 pian+0.005
 fog+0.004
Neg logit contributions
bit-0.013
 mental-0.013
cks-0.013
 acquaintance-0.013
bits-0.013
Token looked
Token ID3114
Feature activation+3.06e-03

Pos logit contributions
cks+0.001
rily+0.001
 mental+0.001
bit+0.001
bits+0.001
Neg logit contributions
 tr-0.001
 Lu-0.001
der-0.001
Stud-0.001
 pian-0.001
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
 mental+0.077
bits+0.068
bit+0.066
rily+0.066
ulously+0.066
Neg logit contributions
der-0.032
 tr-0.029
 inside-0.021
 r-0.021
 p-0.020
Token He
Token ID679
Feature activation-1.77e-03

Pos logit contributions
der+0.011
 pian+0.010
 tr+0.010
 Lu+0.010
leaf+0.010
Neg logit contributions
 mental-0.010
rily-0.009
bits-0.009
bit-0.009
cks-0.008
Token pur
Token ID1308
Feature activation-1.97e-02

Pos logit contributions
der+0.002
 tr+0.001
 Lu+0.001
Stud+0.001
Ste+0.001
Neg logit contributions
 mental-0.002
rily-0.002
cks-0.002
bits-0.002
bit-0.002
Tokenred
Token ID445
Feature activation+1.70e-02

Pos logit contributions
der+0.019
 tr+0.016
 Lu+0.016
Stud+0.016
Ste+0.016
Neg logit contributions
 mental-0.021
rily-0.019
cks-0.018
bel-0.018
 choice-0.017
Token hung
Token ID9174
Feature activation+7.41e-02

Pos logit contributions
 mental+0.019
rily+0.018
cks+0.018
bits+0.017
bit+0.017
Neg logit contributions
der-0.014
 tr-0.013
 Lu-0.013
Stud-0.012
Ste-0.012
Tokenrily
Token ID28904
Feature activation+2.74e-01

Pos logit contributions
 mental+0.097
 acquaintance+0.090
 breakthrough+0.085
cks+0.085
bits+0.084
Neg logit contributions
der-0.049
 tr-0.040
 Lu-0.040
 archae-0.038
 escal-0.036
Token and
Token ID290
Feature activation-1.84e-02

Pos logit contributions
 mental+0.430
cks+0.409
 acquaintance+0.398
bits+0.397
ulously+0.392
Neg logit contributions
der-0.166
 tr-0.125
 just-0.092
 inside-0.082
 Lu-0.075
Token decided
Token ID3066
Feature activation+4.36e-02

Pos logit contributions
der+0.019
Stud+0.018
 escal+0.018
Ste+0.018
 pian+0.017
Neg logit contributions
 mental-0.016
rily-0.015
cks-0.014
bits-0.014
 mun-0.014
Token it
Token ID340
Feature activation-8.69e-03

Pos logit contributions
 mental+0.054
rily+0.052
cks+0.051
bits+0.051
bit+0.050
Neg logit contributions
der-0.033
 tr-0.030
 Lu-0.028
Stud-0.028
Ste-0.027
Token would
Token ID561
Feature activation+6.77e-02

Pos logit contributions
der+0.007
 tr+0.007
 Lu+0.006
Stud+0.006
Ste+0.006
Neg logit contributions
 mental-0.011
rily-0.010
cks-0.010
bits-0.009
bit-0.009
Token be
Token ID307
Feature activation-5.49e-02

Pos logit contributions
rily+0.078
 mental+0.077
cks+0.076
bits+0.071
bel+0.067
Neg logit contributions
der-0.052
 tr-0.050
 Lu-0.045
Ste-0.045
Stud-0.045
Token happy
Token ID3772
Feature activation+4.70e-02

Pos logit contributions
 mental+0.011
rily+0.011
cks+0.010
bits+0.010
bit+0.010
Neg logit contributions
der-0.008
 Lu-0.008
Ste-0.007
 tr-0.007
Stud-0.007
Token and
Token ID290
Feature activation-1.44e-02

Pos logit contributions
 mental+0.052
rily+0.050
cks+0.049
bits+0.048
bit+0.047
Neg logit contributions
der-0.041
 tr-0.036
 Lu-0.035
Stud-0.035
Ste-0.034
Token sailed
Token ID33835
Feature activation+7.60e-02

Pos logit contributions
der+0.013
 tr+0.012
 Lu+0.012
Stud+0.012
 escal+0.011
Neg logit contributions
 mental-0.014
rily-0.014
cks-0.014
bits-0.013
bit-0.013
Token away
Token ID1497
Feature activation+1.91e-02

Pos logit contributions
 mental+0.094
cks+0.094
 acquaintance+0.087
rily+0.086
bits+0.084
Neg logit contributions
der-0.052
 tr-0.050
 Lu-0.046
lessly-0.043
 fog-0.042
Token mer
Token ID4017
Feature activation-4.55e-02

Pos logit contributions
 mental+0.020
rily+0.019
cks+0.019
bits+0.018
bit+0.018
Neg logit contributions
der-0.017
 tr-0.015
Stud-0.015
 Lu-0.015
 pian-0.015
Tokenrily
Token ID28904
Feature activation+2.72e-01

Pos logit contributions
der+0.066
 tr+0.060
 Lu+0.058
Stud+0.058
 pian+0.058
Neg logit contributions
 mental-0.025
bits-0.021
cks-0.020
rily-0.020
bit-0.019
Token.
Token ID13
Feature activation-4.83e-03

Pos logit contributions
 mental+0.435
cks+0.428
ucker+0.405
bits+0.404
 acquaintance+0.400
Neg logit contributions
der-0.157
 tr-0.114
 just-0.090
 part-0.079
 inside-0.076
Token<|endoftext|>
Token ID50256
Feature activation-1.96e-02

Pos logit contributions
der+0.005
 tr+0.004
 pian+0.004
Stud+0.004
 escal+0.004
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.004
bits-0.004
bit-0.004
TokenOnce
Token ID7454
Feature activation+3.43e-02

Pos logit contributions
 Lu+0.010
der+0.010
 tr+0.010
Ste+0.009
Stud+0.009
Neg logit contributions
 mental-0.027
rily-0.026
cks-0.025
bits-0.025
bit-0.025
Token there
Token ID612
Feature activation+3.85e-02

Pos logit contributions
 mental+0.043
rily+0.042
cks+0.041
bits+0.040
bit+0.040
Neg logit contributions
der-0.023
 tr-0.021
 Lu-0.021
Stud-0.020
Ste-0.019
Token was
Token ID373
Feature activation+7.09e-03

Pos logit contributions
 mental+0.050
rily+0.050
cks+0.049
bits+0.048
bit+0.047
Neg logit contributions
 Lu-0.024
 tr-0.024
der-0.022
Stud-0.021
Ste-0.020
Token Polly
Token ID36898
Feature activation+6.79e-02

Pos logit contributions
der+0.005
 tr+0.005
 Lu+0.004
Stud+0.004
Ste+0.004
Neg logit contributions
cks-0.010
 mental-0.009
rily-0.009
bits-0.009
bel-0.009
Token looked
Token ID3114
Feature activation-4.78e-03

Pos logit contributions
 mental+0.071
rily+0.068
cks+0.066
bits+0.065
bit+0.063
Neg logit contributions
der-0.061
 tr-0.056
 Lu-0.054
Stud-0.054
Ste-0.053
Token at
Token ID379
Feature activation-9.14e-02

Pos logit contributions
der+0.004
 tr+0.003
 Lu+0.003
Stud+0.003
Ste+0.003
Neg logit contributions
 mental-0.006
cks-0.005
bits-0.005
rily-0.005
bit-0.005
Token it
Token ID340
Feature activation-7.57e-03

Pos logit contributions
der+0.054
 tr+0.051
 Lu+0.050
 Mind+0.041
Stud+0.040
Neg logit contributions
bel-0.127
 mental-0.126
cks-0.121
astered-0.114
rily-0.114
Token hung
Token ID9174
Feature activation+7.51e-02

Pos logit contributions
der+0.004
 tr+0.004
 just+0.003
 Lu+0.003
 r+0.002
Neg logit contributions
cks-0.011
 mental-0.011
ases-0.011
bit-0.011
bits-0.011
Tokenrily
Token ID28904
Feature activation+2.68e-01

Pos logit contributions
 mental+0.077
cks+0.070
 acquaintance+0.068
bits+0.066
bit+0.064
Neg logit contributions
der-0.075
 tr-0.065
 Lu-0.064
 escal-0.060
Ste-0.059
Token
Token ID6184
Feature activation-4.79e-02

Pos logit contributions
 mental+0.415
cks+0.411
bits+0.408
 acquaintance+0.399
bit+0.389
Neg logit contributions
der-0.161
 tr-0.117
 just-0.081
 inside-0.069
 Lu-0.064
Token
Token ID95
Feature activation-7.12e-02

Pos logit contributions
der+0.061
 tr+0.058
Stud+0.057
 Lu+0.056
 pian+0.056
Neg logit contributions
 mental-0.032
rily-0.030
cks-0.027
bits-0.026
bit-0.026
Token
Token ID26391
Feature activation-8.40e-02

Pos logit contributions
der+0.081
 pian+0.073
 tr+0.073
 Lu+0.071
 ambul+0.070
Neg logit contributions
 mental-0.062
bit-0.059
bits-0.056
 Day-0.054
rily-0.054
Token
Token ID447
Feature activation-1.17e-01

Pos logit contributions
der+0.069
 tr+0.063
Stud+0.061
 Lu+0.061
Ste+0.060
Neg logit contributions
 mental-0.094
rily-0.091
cks-0.087
bits-0.086
bit-0.083
Token
Token ID250
Feature activation-7.63e-02

Pos logit contributions
Ste+0.091
der+0.090
 tr+0.089
Ju+0.088
Stud+0.084
Neg logit contributions
rily-0.133
 mental-0.128
bits-0.124
cks-0.123
bit-0.116
Token watched
Token ID7342
Feature activation-6.78e-02

Pos logit contributions
der+0.014
 tr+0.014
 Lu+0.012
Ste+0.011
 pian+0.011
Neg logit contributions
 mental-0.024
rily-0.024
bits-0.024
cks-0.023
bit-0.022
Token as
Token ID355
Feature activation-6.20e-03

Pos logit contributions
der+0.043
 Lu+0.033
 tr+0.027
 just+0.025
 all+0.025
Neg logit contributions
 mental-0.097
 acquaintance-0.096
cks-0.096
iate-0.092
bits-0.091
Token she
Token ID673
Feature activation-1.20e-02

Pos logit contributions
der+0.004
 Lu+0.004
 tr+0.003
 lightning+0.003
 fog+0.003
Neg logit contributions
cks-0.008
 mental-0.008
bits-0.008
bit-0.008
rily-0.008
Token skipped
Token ID26684
Feature activation+8.93e-04

Pos logit contributions
der+0.012
 tr+0.011
 Lu+0.010
Stud+0.010
Ste+0.010
Neg logit contributions
 mental-0.012
rily-0.011
cks-0.011
bits-0.011
bit-0.010
Token mer
Token ID4017
Feature activation-5.55e-02

Pos logit contributions
 mental+0.001
cks+0.001
rily+0.001
bits+0.001
bit+0.001
Neg logit contributions
der-0.001
 tr-0.001
 Lu-0.001
 escal-0.001
Ste-0.001
Tokenrily
Token ID28904
Feature activation+2.64e-01

Pos logit contributions
der+0.091
 tr+0.084
 Lu+0.082
 pian+0.081
Stud+0.080
Neg logit contributions
 mental-0.021
bits-0.014
cks-0.012
bit-0.012
 acquaintance-0.010
Token away
Token ID1497
Feature activation+1.54e-02

Pos logit contributions
 mental+0.416
cks+0.384
 acquaintance+0.379
bits+0.375
ulously+0.370
Neg logit contributions
der-0.168
 tr-0.132
 Lu-0.090
 inside-0.080
 just-0.080
Token into
Token ID656
Feature activation-2.76e-02

Pos logit contributions
 mental+0.016
rily+0.016
cks+0.015
bits+0.015
bit+0.014
Neg logit contributions
der-0.013
Stud-0.012
 tr-0.012
 Lu-0.012
Ste-0.012
Token the
Token ID262
Feature activation-5.02e-03

Pos logit contributions
 Lu+0.016
der+0.016
 tr+0.015
 fog+0.014
 pian+0.014
Neg logit contributions
 mental-0.037
rily-0.036
bit-0.035
cks-0.035
bits-0.034
Token sunset
Token ID26428
Feature activation-1.38e-01

Pos logit contributions
der+0.004
Stud+0.004
 tr+0.004
 Lu+0.004
Ste+0.004
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.005
bits-0.005
bit-0.005
Token.
Token ID13
Feature activation-9.67e-03

Pos logit contributions
der+0.101
 tr+0.091
 Lu+0.083
 pian+0.078
 escal+0.077
Neg logit contributions
 mental-0.178
cks-0.164
rily-0.164
bits-0.160
bit-0.156
Token It
Token ID632
Feature activation-1.49e-02

Pos logit contributions
 mental+0.010
rily+0.010
cks+0.009
bits+0.009
bit+0.009
Neg logit contributions
der-0.009
 tr-0.008
Stud-0.008
 Lu-0.008
Ste-0.008
Token turns
Token ID4962
Feature activation+6.18e-03

Pos logit contributions
der+0.017
Stud+0.016
 Lu+0.016
Mad+0.016
Ste+0.016
Neg logit contributions
 mental-0.011
cks-0.011
rily-0.011
bit-0.010
bits-0.010
Token around
Token ID1088
Feature activation-4.58e-02

Pos logit contributions
 mental+0.007
rily+0.006
cks+0.006
bits+0.006
bit+0.006
Neg logit contributions
der-0.006
 tr-0.005
 Lu-0.005
Stud-0.005
Ste-0.005
Token and
Token ID290
Feature activation-2.99e-03

Pos logit contributions
der+0.013
 tr+0.013
 just+0.010
 Lu+0.009
 fog+0.008
Neg logit contributions
 mental-0.083
 acquaintance-0.078
rily-0.076
bel-0.074
bits-0.074
Token pe
Token ID613
Feature activation-1.03e-01

Pos logit contributions
der+0.003
 tr+0.003
 Lu+0.003
Stud+0.003
 pian+0.003
Neg logit contributions
 mental-0.003
rily-0.003
bits-0.003
bit-0.002
cks-0.002
Tokencks
Token ID4657
Feature activation+2.53e-01

Pos logit contributions
der+0.138
 tr+0.131
 Lu+0.128
Stud+0.126
 pian+0.125
Neg logit contributions
 mental-0.064
rily-0.049
 Day-0.046
 acquaintance-0.044
bits-0.043
Token Tom
Token ID4186
Feature activation+4.32e-02

Pos logit contributions
 mental+0.377
rily+0.361
 acquaintance+0.360
bel+0.357
bits+0.352
Neg logit contributions
der-0.122
 tr-0.105
 Lu-0.082
 pian-0.079
 escal-0.074
Token on
Token ID319
Feature activation+3.75e-02

Pos logit contributions
 mental+0.079
bel+0.076
umm+0.076
 choice+0.075
rily+0.073
Neg logit contributions
 tr-0.014
 r-0.009
 cl-0.008
 just-0.007
 p-0.006
Token the
Token ID262
Feature activation+1.49e-02

Pos logit contributions
 mental+0.041
rily+0.039
cks+0.037
bits+0.037
bit+0.036
Neg logit contributions
der-0.033
 tr-0.030
 Lu-0.029
Stud-0.028
Ste-0.028
Token nose
Token ID9686
Feature activation+7.28e-02

Pos logit contributions
 mental+0.017
rily+0.016
bits+0.015
cks+0.015
bel+0.015
Neg logit contributions
der-0.013
 tr-0.013
 escal-0.011
 Lu-0.011
 pian-0.011
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
 mental+0.117
cks+0.108
 acquaintance+0.106
bel+0.105
rily+0.104
Neg logit contributions
der-0.042
 tr-0.030
 Lu-0.021
 bud-0.019
 pian-0.019
Token the
Token ID262
Feature activation+1.45e-02

Pos logit contributions
 mental+0.013
rily+0.012
bits+0.012
cks+0.011
bit+0.011
Neg logit contributions
der-0.009
 Lu-0.008
 tr-0.008
Stud-0.008
Ste-0.007
Token tree
Token ID5509
Feature activation-1.88e-02

Pos logit contributions
 mental+0.017
rily+0.017
bits+0.016
cks+0.016
bit+0.015
Neg logit contributions
der-0.011
 tr-0.010
 Lu-0.010
Stud-0.010
 escal-0.009
Token.
Token ID13
Feature activation+2.61e-02

Pos logit contributions
der+0.007
 tr+0.005
 r+0.004
 p+0.004
 just+0.003
Neg logit contributions
 mental-0.033
rily-0.030
 acquaintance-0.029
bits-0.029
bit-0.029
Token It
Token ID632
Feature activation-1.36e-02

Pos logit contributions
 mental+0.028
rily+0.027
cks+0.026
bits+0.026
bit+0.025
Neg logit contributions
der-0.023
 tr-0.020
 Lu-0.020
Stud-0.020
Ste-0.019
Token pe
Token ID613
Feature activation-1.04e-01

Pos logit contributions
der+0.016
Stud+0.015
 Lu+0.015
 tr+0.015
Ste+0.015
Neg logit contributions
 mental-0.010
rily-0.009
cks-0.009
bits-0.009
bit-0.008
Tokencks
Token ID4657
Feature activation+2.53e-01

Pos logit contributions
der+0.156
 tr+0.147
Stud+0.144
leaf+0.142
 Lu+0.141
Neg logit contributions
 mental-0.047
rily-0.038
 Day-0.030
bits-0.030
bit-0.028
Token Anna
Token ID11735
Feature activation+4.03e-02

Pos logit contributions
 mental+0.330
bel+0.310
 acquaintance+0.310
rily+0.303
bits+0.301
Neg logit contributions
der-0.181
 tr-0.153
 Lu-0.133
 pian-0.133
 r-0.127
Token on
Token ID319
Feature activation+3.63e-02

Pos logit contributions
 mental+0.066
bel+0.060
rily+0.060
 choice+0.058
 acquaintance+0.058
Neg logit contributions
 tr-0.017
der-0.016
 r-0.013
 pian-0.011
 cl-0.010
Token the
Token ID262
Feature activation+1.54e-02

Pos logit contributions
 mental+0.035
rily+0.033
cks+0.032
bits+0.031
bit+0.030
Neg logit contributions
der-0.036
Stud-0.033
 tr-0.033
 Lu-0.032
Ste-0.032
Token head
Token ID1182
Feature activation+5.36e-02

Pos logit contributions
 mental+0.016
rily+0.015
cks+0.014
bits+0.014
bit+0.014
Neg logit contributions
der-0.014
Stud-0.013
 Lu-0.013
Ste-0.013
 tr-0.013
Token.
Token ID13
Feature activation+2.67e-02

Pos logit contributions
 mental+0.086
rily+0.076
 acquaintance+0.076
bits+0.076
cks+0.074
Neg logit contributions
der-0.029
 tr-0.020
 Lu-0.017
Ju-0.015
 pian-0.015
Token the
Token ID262
Feature activation+1.41e-02

Pos logit contributions
 mental+0.008
rily+0.007
cks+0.007
bits+0.007
bit+0.007
Neg logit contributions
der-0.008
 tr-0.007
Stud-0.007
 pian-0.007
 Lu-0.007
Token box
Token ID3091
Feature activation+5.00e-02

Pos logit contributions
 mental+0.015
rily+0.014
cks+0.014
bits+0.014
bit+0.013
Neg logit contributions
der-0.013
 tr-0.012
 Lu-0.011
 escal-0.011
Stud-0.011
Token.
Token ID13
Feature activation+2.11e-02

Pos logit contributions
 mental+0.097
bit+0.088
 acquaintance+0.088
ulously+0.087
 breakthrough+0.087
Neg logit contributions
der-0.011
 tr-0.003
 bud-0.002
 inside-0.001
.-0.000
Token It
Token ID632
Feature activation-1.34e-02

Pos logit contributions
 mental+0.029
bits+0.028
rily+0.028
cks+0.028
bit+0.026
Neg logit contributions
der-0.013
 Lu-0.012
Ste-0.011
Stud-0.011
 tr-0.010
Token pe
Token ID613
Feature activation-1.05e-01

Pos logit contributions
der+0.014
 tr+0.013
Stud+0.013
 Lu+0.013
Ste+0.013
Neg logit contributions
 mental-0.012
rily-0.011
cks-0.011
bits-0.011
bit-0.010
Tokencks
Token ID4657
Feature activation+2.51e-01

Pos logit contributions
der+0.128
 tr+0.117
leaf+0.116
Ju+0.116
 archae+0.116
Neg logit contributions
 mental-0.071
 Day-0.054
rily-0.053
 acquaintance-0.053
bits-0.052
Token the
Token ID262
Feature activation+1.49e-02

Pos logit contributions
 mental+0.360
bel+0.347
 acquaintance+0.345
rily+0.336
bits+0.333
Neg logit contributions
der-0.141
 tr-0.119
 r-0.096
 Lu-0.092
 escal-0.090
Token bread
Token ID8509
Feature activation+5.18e-02

Pos logit contributions
 mental+0.015
bits+0.015
rily+0.014
cks+0.013
bel+0.013
Neg logit contributions
 tr-0.014
der-0.014
Ju-0.012
leaf-0.012
 escal-0.012
Token.
Token ID13
Feature activation+2.21e-02

Pos logit contributions
 mental+0.084
 acquaintance+0.078
ulously+0.076
bel+0.075
 breakthrough+0.075
Neg logit contributions
der-0.027
 tr-0.020
 bud-0.017
 r-0.015
 p-0.014
Token
Token ID198
Feature activation+1.07e-02

Pos logit contributions
 mental+0.030
bits+0.029
rily+0.029
cks+0.029
bit+0.028
Neg logit contributions
der-0.014
 Lu-0.012
Ste-0.011
Stud-0.011
 tr-0.011
Token
Token ID198
Feature activation+1.09e-02

Pos logit contributions
rily+0.013
 mental+0.013
bits+0.013
cks+0.013
bit+0.013
Neg logit contributions
der-0.007
 Lu-0.006
Stud-0.006
 tr-0.006
Ste-0.006
Token the
Token ID262
Feature activation+8.65e-03

Pos logit contributions
 mental+0.011
rily+0.010
bel+0.010
cks+0.010
bit+0.009
Neg logit contributions
 Lu-0.005
 tr-0.004
der-0.004
 escal-0.004
Ju-0.004
Token snow
Token ID6729
Feature activation-3.11e-02

Pos logit contributions
 mental+0.010
rily+0.010
cks+0.010
bits+0.010
bit+0.009
Neg logit contributions
der-0.007
 tr-0.006
 Lu-0.006
 escal-0.006
Stud-0.005
Tokenman
Token ID805
Feature activation+1.35e-01

Pos logit contributions
der+0.036
 tr+0.032
 Lu+0.028
Ju+0.028
 archae+0.027
Neg logit contributions
 mental-0.031
rily-0.025
cks-0.025
bits-0.024
ulously-0.024
Token and
Token ID290
Feature activation-1.99e-03

Pos logit contributions
 mental+0.239
bel+0.225
bits+0.224
ulously+0.221
bit+0.219
Neg logit contributions
 tr-0.035
der-0.027
 p-0.024
 just-0.023
 inside-0.020
Token pe
Token ID613
Feature activation-1.07e-01

Pos logit contributions
der+0.002
 tr+0.002
Stud+0.002
 Lu+0.002
Ste+0.002
Neg logit contributions
 mental-0.002
rily-0.002
cks-0.002
bits-0.001
bit-0.001
Tokencks
Token ID4657
Feature activation+2.50e-01

Pos logit contributions
der+0.123
 tr+0.119
 Lu+0.114
 archae+0.113
 pian+0.111
Neg logit contributions
 mental-0.082
 Day-0.065
rily-0.062
bits-0.061
 acquaintance-0.060
Token at
Token ID379
Feature activation-7.90e-02

Pos logit contributions
 mental+0.350
rily+0.334
bits+0.326
bel+0.320
 acquaintance+0.317
Neg logit contributions
der-0.142
 tr-0.123
 Lu-0.107
 pian-0.105
 escal-0.103
Token it
Token ID340
Feature activation+2.35e-03

Pos logit contributions
der+0.054
 Lu+0.051
 tr+0.049
 escal+0.045
Stud+0.043
Neg logit contributions
 mental-0.102
rily-0.095
bel-0.092
bits-0.091
bit-0.089
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
 mental+0.004
 acquaintance+0.003
bits+0.003
bel+0.003
bit+0.003
Neg logit contributions
der-0.001
 tr-0.001
 r-0.001
 p-0.001
 just-0.001
Token It
Token ID632
Feature activation-1.69e-02

Pos logit contributions
 mental+0.014
rily+0.013
bits+0.013
cks+0.013
bit+0.012
Neg logit contributions
der-0.009
 Lu-0.008
Stud-0.008
 tr-0.008
Ste-0.008
Token knocks
Token ID36539
Feature activation+8.60e-02

Pos logit contributions
der+0.020
Mad+0.019
Stud+0.019
 Lu+0.019
Ste+0.019
Neg logit contributions
cks-0.013
 mental-0.012
rily-0.012
bit-0.011
bits-0.010
Token It
Token ID632
Feature activation-1.58e-02

Pos logit contributions
 mental+0.032
bits+0.031
rily+0.031
cks+0.031
bel+0.030
Neg logit contributions
der-0.011
 Lu-0.008
 pian-0.006
 Mind-0.006
Stud-0.006
Token flies
Token ID17607
Feature activation+1.28e-01

Pos logit contributions
der+0.017
 tr+0.015
Stud+0.015
 Lu+0.015
Ste+0.015
Neg logit contributions
 mental-0.014
rily-0.014
cks-0.013
bits-0.013
bit-0.012
Token down
Token ID866
Feature activation+6.89e-03

Pos logit contributions
 mental+0.230
 acquaintance+0.212
uces+0.201
 supervision+0.195
astered+0.195
Neg logit contributions
 inside-0.051
der-0.046
 just-0.044
 tr-0.042
 over-0.041
Token and
Token ID290
Feature activation+4.20e-03

Pos logit contributions
 mental+0.011
 acquaintance+0.010
rily+0.010
bit+0.009
cks+0.009
Neg logit contributions
der-0.004
 tr-0.003
 fog-0.003
 Lu-0.003
 r-0.003
Token pe
Token ID613
Feature activation-1.05e-01

Pos logit contributions
 mental+0.004
rily+0.004
bits+0.003
bit+0.003
cks+0.003
Neg logit contributions
der-0.005
 tr-0.004
Stud-0.004
 pian-0.004
 Lu-0.004
Tokencks
Token ID4657
Feature activation+2.50e-01

Pos logit contributions
der+0.114
 archae+0.097
 kn+0.097
 tr+0.095
Stud+0.095
Neg logit contributions
 mental-0.093
 Day-0.080
 acquaintance-0.079
 eater-0.076
 Whenever-0.075
Token at
Token ID379
Feature activation-7.81e-02

Pos logit contributions
 mental+0.369
 acquaintance+0.354
bel+0.344
bits+0.341
rily+0.338
Neg logit contributions
der-0.139
 tr-0.109
 r-0.093
 pian-0.091
 fog-0.085
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
der+0.047
 Lu+0.038
 lightning+0.035
 tr+0.034
 fog+0.032
Neg logit contributions
 mental-0.119
bel-0.109
rily-0.104
astered-0.102
bits-0.102
Token coin
Token ID10752
Feature activation+4.75e-02

Pos logit contributions
 mental+0.017
bits+0.016
rily+0.015
cks+0.015
bel+0.015
Neg logit contributions
der-0.010
 tr-0.010
 p-0.009
 fog-0.009
 bud-0.008
Token.
Token ID13
Feature activation+2.20e-02

Pos logit contributions
 mental+0.089
 acquaintance+0.086
ulously+0.082
bits+0.081
 breakthrough+0.081
Neg logit contributions
der-0.018
 tr-0.008
 bud-0.008
 inside-0.006
 just-0.004
Token It
Token ID632
Feature activation-1.49e-02

Pos logit contributions
 mental+0.034
bits+0.032
rily+0.032
cks+0.032
bel+0.031
Neg logit contributions
der-0.013
 Lu-0.010
Ste-0.008
 BO-0.008
 pian-0.008
TokenR
Token ID49
Feature activation-9.53e-02

Pos logit contributions
der+0.007
 tr+0.006
 Lu+0.006
Stud+0.006
Ste+0.006
Neg logit contributions
 mental-0.012
rily-0.012
cks-0.011
bits-0.011
bit-0.011
Tokenib
Token ID571
Feature activation+8.22e-02

Pos logit contributions
der+0.108
 tr+0.098
 Lu+0.094
Stud+0.094
Ju+0.093
Neg logit contributions
 mental-0.085
rily-0.081
bits-0.071
cks-0.070
bel-0.069
Tokenbit
Token ID2545
Feature activation+2.49e-01

Pos logit contributions
 mental+0.048
bel+0.037
rily+0.037
 granting+0.032
bones+0.032
Neg logit contributions
der-0.123
 tr-0.113
 bud-0.104
arre-0.102
Ste-0.098
Token,
Token ID11
Feature activation+9.16e-03

Pos logit contributions
rily+0.381
 mental+0.379
bel+0.363
bits+0.362
 acquaintance+0.354
Neg logit contributions
der-0.108
 tr-0.104
 bud-0.082
ha-0.073
 Lu-0.073
Token rib
Token ID12183
Feature activation-8.12e-02

Pos logit contributions
 mental+0.010
rily+0.010
bits+0.009
cks+0.009
bit+0.009
Neg logit contributions
der-0.008
 tr-0.007
 Lu-0.007
 pian-0.007
Stud-0.007
Tokenbit
Token ID2545
Feature activation+2.49e-01

Pos logit contributions
der+0.130
 tr+0.125
Ste+0.122
Stud+0.120
 Lu+0.119
Neg logit contributions
rily-0.027
 mental-0.025
bel-0.020
 acquaintance-0.012
cks-0.011
Token!
Token ID0
Feature activation+1.80e-02

Pos logit contributions
 mental+0.367
rily+0.358
bits+0.345
bel+0.343
 acquaintance+0.336
Neg logit contributions
der-0.126
 tr-0.114
 Lu-0.093
Ste-0.088
 bud-0.087
Token Go
Token ID1514
Feature activation+7.65e-02

Pos logit contributions
 mental+0.024
cks+0.023
bits+0.023
bel+0.023
ulously+0.022
Neg logit contributions
der-0.015
 tr-0.011
 Lu-0.011
 Pupp-0.010
Ju-0.010
Token away
Token ID1497
Feature activation+3.24e-02

Pos logit contributions
rily+0.091
 mental+0.091
cks+0.084
bits+0.079
 acquaintance+0.076
Neg logit contributions
 tr-0.058
 Lu-0.054
Ste-0.053
Ju-0.053
der-0.052
Token,
Token ID11
Feature activation+9.32e-03

Pos logit contributions
 mental+0.037
rily+0.036
cks+0.035
bits+0.034
bit+0.033
Neg logit contributions
der-0.026
 tr-0.023
Stud-0.023
 Lu-0.023
Ste-0.022
Token go
Token ID467
Feature activation+9.30e-02

Pos logit contributions
rily+0.011
bits+0.011
 mental+0.011
cks+0.010
bel+0.010
Neg logit contributions
 tr-0.008
der-0.007
 Lu-0.006
Ste-0.006
 archae-0.006
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
 mental+0.027
rily+0.026
bits+0.025
cks+0.025
bit+0.024
Neg logit contributions
der-0.019
 Lu-0.016
Stud-0.016
 tr-0.016
Ste-0.015
Token
Token ID198
Feature activation+1.16e-02

Pos logit contributions
rily+0.017
 mental+0.016
bits+0.015
bit+0.015
cks+0.015
Neg logit contributions
der-0.006
 Lu-0.005
<-0.005
Stud-0.005
 tr-0.005
Token"
Token ID1
Feature activation-9.70e-03

Pos logit contributions
 mental+0.012
cks+0.012
rily+0.011
bits+0.011
bit+0.011
Neg logit contributions
der-0.011
 tr-0.009
 Lu-0.009
 pian-0.009
Stud-0.009
TokenR
Token ID49
Feature activation-9.53e-02

Pos logit contributions
der+0.007
 tr+0.006
 Lu+0.006
Stud+0.006
Ste+0.006
Neg logit contributions
 mental-0.012
rily-0.012
cks-0.011
bits-0.011
bit-0.011
Tokenib
Token ID571
Feature activation+8.22e-02

Pos logit contributions
der+0.108
 tr+0.098
 Lu+0.094
Stud+0.094
Ju+0.093
Neg logit contributions
 mental-0.085
rily-0.081
bits-0.071
cks-0.070
bel-0.069
Tokenbit
Token ID2545
Feature activation+2.49e-01

Pos logit contributions
 mental+0.048
bel+0.037
rily+0.037
 granting+0.032
bones+0.032
Neg logit contributions
der-0.123
 tr-0.113
 bud-0.104
arre-0.102
Ste-0.098
Token,
Token ID11
Feature activation+9.16e-03

Pos logit contributions
rily+0.381
 mental+0.379
bel+0.363
bits+0.362
 acquaintance+0.354
Neg logit contributions
der-0.108
 tr-0.104
 bud-0.082
ha-0.073
 Lu-0.073
Token rib
Token ID12183
Feature activation-8.12e-02

Pos logit contributions
 mental+0.010
rily+0.010
bits+0.009
cks+0.009
bit+0.009
Neg logit contributions
der-0.008
 tr-0.007
 Lu-0.007
 pian-0.007
Stud-0.007
Tokenbit
Token ID2545
Feature activation+2.49e-01

Pos logit contributions
der+0.130
 tr+0.125
Ste+0.122
Stud+0.120
 Lu+0.119
Neg logit contributions
rily-0.027
 mental-0.025
bel-0.020
 acquaintance-0.012
cks-0.011
Token!
Token ID0
Feature activation+1.80e-02

Pos logit contributions
 mental+0.367
rily+0.358
bits+0.345
bel+0.343
 acquaintance+0.336
Neg logit contributions
der-0.126
 tr-0.114
 Lu-0.093
Ste-0.088
 bud-0.087
Token Go
Token ID1514
Feature activation+7.65e-02

Pos logit contributions
 mental+0.024
cks+0.023
bits+0.023
bel+0.023
ulously+0.022
Neg logit contributions
der-0.015
 tr-0.011
 Lu-0.011
 Pupp-0.010
Ju-0.010
Token truck
Token ID7779
Feature activation+7.99e-03

Pos logit contributions
 mental+0.011
rily+0.010
cks+0.009
bits+0.009
bit+0.009
Neg logit contributions
der-0.011
Stud-0.010
 Lu-0.010
 tr-0.009
 escal-0.009
Token like
Token ID588
Feature activation-1.52e-02

Pos logit contributions
 mental+0.012
rily+0.012
cks+0.011
bits+0.011
ulously+0.011
Neg logit contributions
der-0.005
 tr-0.004
 pian-0.004
 Lu-0.003
Ju-0.003
Token Ben
Token ID3932
Feature activation+9.57e-02

Pos logit contributions
der+0.010
 tr+0.010
 pian+0.009
 Lu+0.009
 escal+0.009
Neg logit contributions
 mental-0.021
rily-0.020
bits-0.019
 acquaintance-0.019
cks-0.019
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
 mental+0.160
bel+0.154
bits+0.151
ulously+0.147
rily+0.146
Neg logit contributions
der-0.036
 tr-0.031
 pian-0.021
Ste-0.021
 just-0.021
Token Tru
Token ID13616
Feature activation-1.41e-01

Pos logit contributions
 mental+0.021
rily+0.020
cks+0.019
bits+0.019
bit+0.018
Neg logit contributions
der-0.015
 tr-0.013
 Lu-0.013
Stud-0.013
 pian-0.013
Tokencks
Token ID4657
Feature activation+2.49e-01

Pos logit contributions
der+0.250
 tr+0.239
 Lu+0.236
Stud+0.236
Ste+0.233
Neg logit contributions
 mental-0.024
rily-0.018
cks-0.013
bits-0.011
bit-0.007
Token are
Token ID389
Feature activation-4.15e-02

Pos logit contributions
 mental+0.345
rily+0.333
bel+0.323
 acquaintance+0.323
bits+0.315
Neg logit contributions
der-0.147
 tr-0.138
 pian-0.112
 escal-0.107
 Lu-0.105
Token fun
Token ID1257
Feature activation-2.27e-02

Pos logit contributions
der+0.030
 tr+0.029
 pian+0.027
 Lu+0.026
Stud+0.025
Neg logit contributions
 mental-0.051
rily-0.051
bits-0.049
 acquaintance-0.045
cks-0.045
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
der+0.014
 tr+0.013
 pian+0.012
 Lu+0.011
 escal+0.011
Neg logit contributions
 mental-0.033
rily-0.030
cks-0.029
bits-0.029
 acquaintance-0.028
Token Doll
Token ID22209
Feature activation-9.78e-03

Pos logit contributions
 mental+0.015
bit+0.014
rily+0.014
cks+0.014
bits+0.013
Neg logit contributions
der-0.020
Stud-0.018
 tr-0.018
arre-0.018
 escal-0.018
Tokens
Token ID82
Feature activation+1.15e-03

Pos logit contributions
der+0.007
 tr+0.006
 pian+0.006
 Lu+0.006
Ju+0.005
Neg logit contributions
rily-0.013
 mental-0.012
 acquaintance-0.011
bits-0.011
ulously-0.011
Token It
Token ID632
Feature activation-1.99e-02

Pos logit contributions
 mental+0.006
rily+0.006
cks+0.005
bits+0.005
bit+0.005
Neg logit contributions
der-0.005
 tr-0.005
Stud-0.005
 Lu-0.005
Ste-0.005
Token turns
Token ID4962
Feature activation+5.40e-04

Pos logit contributions
der+0.023
Stud+0.022
 Lu+0.022
Mad+0.022
Ste+0.022
Neg logit contributions
 mental-0.015
cks-0.014
rily-0.014
bit-0.013
bits-0.013
Token around
Token ID1088
Feature activation-5.00e-02

Pos logit contributions
 mental+0.001
rily+0.001
cks+0.001
bits+0.001
bit+0.001
Neg logit contributions
der-0.000
 tr-0.000
 Lu-0.000
Stud-0.000
Ste-0.000
Token and
Token ID290
Feature activation-6.75e-03

Pos logit contributions
der+0.012
 tr+0.012
 just+0.010
 Lu+0.009
 r+0.009
Neg logit contributions
 mental-0.093
 acquaintance-0.088
rily-0.085
bel-0.084
bits-0.082
Token pe
Token ID613
Feature activation-1.09e-01

Pos logit contributions
der+0.007
 tr+0.007
 Lu+0.006
Stud+0.006
 pian+0.006
Neg logit contributions
 mental-0.006
rily-0.006
bits-0.006
bit-0.006
cks-0.005
Tokencks
Token ID4657
Feature activation+2.49e-01

Pos logit contributions
der+0.147
 tr+0.140
 Lu+0.136
Stud+0.135
 pian+0.134
Neg logit contributions
 mental-0.065
rily-0.051
 Day-0.046
 acquaintance-0.044
bits-0.043
Token Tom
Token ID4186
Feature activation+3.91e-02

Pos logit contributions
 mental+0.370
rily+0.353
 acquaintance+0.350
bel+0.348
bits+0.344
Neg logit contributions
der-0.123
 tr-0.103
 Lu-0.082
 pian-0.080
 escal-0.073
Token on
Token ID319
Feature activation+3.21e-02

Pos logit contributions
 mental+0.072
bel+0.070
 choice+0.069
umm+0.068
 acquaintance+0.067
Neg logit contributions
 tr-0.012
 r-0.008
 cl-0.007
 bar-0.006
 just-0.005
Token the
Token ID262
Feature activation+1.01e-02

Pos logit contributions
 mental+0.033
rily+0.032
cks+0.031
bits+0.030
bit+0.029
Neg logit contributions
der-0.030
 tr-0.027
 Lu-0.026
Stud-0.026
Ste-0.026
Token nose
Token ID9686
Feature activation+6.88e-02

Pos logit contributions
 mental+0.011
rily+0.011
bits+0.010
cks+0.010
bel+0.010
Neg logit contributions
der-0.009
 tr-0.008
 Lu-0.007
 escal-0.007
 pian-0.007
Token.
Token ID13
Feature activation+7.47e-03

Pos logit contributions
 mental+0.108
cks+0.099
 acquaintance+0.097
bel+0.096
rily+0.095
Neg logit contributions
der-0.041
 tr-0.030
 Lu-0.022
 pian-0.021
 bud-0.020
Token other
Token ID584
Feature activation-6.13e-02

Pos logit contributions
rily+0.009
 mental+0.009
bits+0.009
cks+0.008
bit+0.008
Neg logit contributions
 church-0.003
der-0.003
 lightning-0.003
 fog-0.003
 archae-0.003
Token birds
Token ID10087
Feature activation-2.17e-03

Pos logit contributions
der+0.047
 tr+0.039
 archae+0.038
 pian+0.037
 optical+0.037
Neg logit contributions
 mental-0.079
rily-0.075
bits-0.070
cks-0.067
bel-0.066
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
der+0.001
 tr+0.001
 Lu+0.001
Ste+0.001
 pian+0.001
Neg logit contributions
 mental-0.003
rily-0.003
bits-0.003
bit-0.002
cks-0.002
Token It
Token ID632
Feature activation-2.01e-02

Pos logit contributions
 mental+0.013
bits+0.013
rily+0.013
cks+0.012
bit+0.012
Neg logit contributions
der-0.007
 Lu-0.007
Ste-0.006
Stud-0.006
 pian-0.006
Token pe
Token ID613
Feature activation-1.11e-01

Pos logit contributions
der+0.024
Stud+0.023
Mad+0.023
 Lu+0.023
Ste+0.023
Neg logit contributions
 mental-0.014
cks-0.014
rily-0.013
bit-0.012
bits-0.011
Tokencks
Token ID4657
Feature activation+2.47e-01

Pos logit contributions
der+0.147
 tr+0.134
leaf+0.133
Stud+0.132
 Lu+0.132
Neg logit contributions
 mental-0.072
rily-0.057
bits-0.052
 Day-0.051
bit-0.049
Token at
Token ID379
Feature activation-8.28e-02

Pos logit contributions
 mental+0.352
rily+0.330
bits+0.323
 acquaintance+0.322
bel+0.319
Neg logit contributions
der-0.145
 tr-0.120
 pian-0.105
 Lu-0.101
 escal-0.096
Token the
Token ID262
Feature activation+6.36e-03

Pos logit contributions
der+0.057
 Lu+0.051
 tr+0.047
Ste+0.045
 church+0.043
Neg logit contributions
 mental-0.111
rily-0.102
bel-0.102
bits-0.100
cks-0.095
Token seeds
Token ID11904
Feature activation+1.57e-02

Pos logit contributions
 mental+0.008
rily+0.008
bits+0.007
cks+0.007
 acquaintance+0.007
Neg logit contributions
der-0.005
 tr-0.004
 archae-0.004
 fog-0.004
 pian-0.004
Token and
Token ID290
Feature activation-2.23e-03

Pos logit contributions
 mental+0.025
 acquaintance+0.022
bits+0.022
bel+0.022
bit+0.022
Neg logit contributions
der-0.009
 tr-0.006
 r-0.005
 p-0.005
 bud-0.004
Token makes
Token ID1838
Feature activation+5.94e-02

Pos logit contributions
der+0.002
Stud+0.002
Ste+0.002
 Lu+0.002
Mad+0.002
Neg logit contributions
 mental-0.002
cks-0.002
rily-0.002
bit-0.002
bits-0.002
Token
Token ID220
Feature activation+1.52e-04

Pos logit contributions
der+0.001
 tr+0.001
 Lu+0.001
 Pupp+0.001
Ju+0.001
Neg logit contributions
 mental-0.002
cks-0.002
bel-0.002
rily-0.002
bits-0.002
Token
Token ID198
Feature activation-1.69e-03

Pos logit contributions
 mental+0.000
rily+0.000
bits+0.000
bel+0.000
cks+0.000
Neg logit contributions
der-0.000
Ste-0.000
 Lu-0.000
Stud-0.000
 tr-0.000
Token
Token ID198
Feature activation-1.43e-03

Pos logit contributions
der+0.001
 tr+0.001
Stud+0.001
Ste+0.001
Ju+0.001
Neg logit contributions
 mental-0.002
rily-0.002
cks-0.002
bits-0.002
 supervision-0.002
TokenR
Token ID49
Feature activation-9.67e-02

Pos logit contributions
der+0.001
Ste+0.001
 tr+0.001
Ju+0.001
Stud+0.001
Neg logit contributions
rily-0.002
 mental-0.002
cks-0.002
 acquaintance-0.002
 supervision-0.002
Tokenib
Token ID571
Feature activation+7.93e-02

Pos logit contributions
der+0.091
Ste+0.077
arre+0.072
 archae+0.071
Stud+0.071
Neg logit contributions
 mental-0.108
bel-0.102
 granting-0.097
bits-0.094
rily-0.094
Tokenbit
Token ID2545
Feature activation+2.46e-01

Pos logit contributions
 mental+0.052
rily+0.041
bones+0.038
cks+0.037
 granting+0.037
Neg logit contributions
der-0.115
 tr-0.102
Ste-0.092
 archae-0.091
arre-0.091
Token said
Token ID531
Feature activation-4.30e-03

Pos logit contributions
 mental+0.278
rily+0.267
cks+0.260
bits+0.255
bel+0.240
Neg logit contributions
der-0.212
 tr-0.189
Ste-0.176
 Lu-0.175
Stud-0.172
Token,
Token ID11
Feature activation-1.17e-03

Pos logit contributions
der+0.002
 tr+0.002
 Lu+0.002
Ste+0.001
 Pupp+0.001
Neg logit contributions
cks-0.006
 mental-0.006
bits-0.006
rily-0.006
 acquaintance-0.006
Token "
Token ID366
Feature activation-1.22e-02

Pos logit contributions
der+0.001
 tr+0.001
Stud+0.001
 Lu+0.001
Ste+0.001
Neg logit contributions
 mental-0.001
rily-0.001
cks-0.001
bits-0.001
bit-0.001
TokenWould
Token ID17353
Feature activation+1.70e-01

Pos logit contributions
Ste+0.006
Stud+0.005
Ju+0.005
Bon+0.005
Sh+0.005
Neg logit contributions
 mental-0.020
rily-0.019
 lair-0.019
cks-0.019
 supervision-0.019
Token you
Token ID345
Feature activation+1.55e-02

Pos logit contributions
rily+0.282
 lair+0.264
 mental+0.253
bits+0.250
 supervision+0.250
Neg logit contributions
Ste-0.084
Rob-0.068
Bon-0.067
 tr-0.062
O-0.061
Token to
Token ID284
Feature activation+3.49e-03

Pos logit contributions
 mental+0.002
rily+0.001
cks+0.001
bits+0.001
bit+0.001
Neg logit contributions
der-0.001
 tr-0.001
 Lu-0.001
Ste-0.001
Stud-0.001
Token the
Token ID262
Feature activation+5.43e-03

Pos logit contributions
 mental+0.004
rily+0.004
bits+0.003
cks+0.003
bit+0.003
Neg logit contributions
der-0.003
 tr-0.003
 Lu-0.003
Stud-0.003
Ste-0.003
Token ball
Token ID2613
Feature activation+4.05e-02

Pos logit contributions
 mental+0.006
rily+0.005
cks+0.005
bits+0.005
bit+0.005
Neg logit contributions
der-0.005
 tr-0.004
Stud-0.004
 Lu-0.004
Ste-0.004
Token and
Token ID290
Feature activation-1.15e-03

Pos logit contributions
 mental+0.071
 acquaintance+0.064
rily+0.062
bits+0.062
bel+0.061
Neg logit contributions
der-0.020
 tr-0.013
 r-0.009
 p-0.008
 bud-0.008
Token pe
Token ID613
Feature activation-1.12e-01

Pos logit contributions
der+0.001
Stud+0.001
 Lu+0.001
 tr+0.001
Ste+0.001
Neg logit contributions
 mental-0.001
rily-0.001
cks-0.001
bits-0.001
bit-0.001
Tokencks
Token ID4657
Feature activation+2.46e-01

Pos logit contributions
der+0.152
 tr+0.143
Stud+0.140
leaf+0.139
Ju+0.139
Neg logit contributions
 mental-0.064
 Day-0.047
rily-0.047
bits-0.043
 acquaintance-0.042
Token it
Token ID340
Feature activation+3.15e-04

Pos logit contributions
 mental+0.358
rily+0.338
bits+0.334
bel+0.331
 acquaintance+0.331
Neg logit contributions
der-0.132
 tr-0.108
 pian-0.090
 Lu-0.089
 escal-0.086
Token.
Token ID13
Feature activation+1.68e-02

Pos logit contributions
 mental+0.000
 acquaintance+0.000
bits+0.000
rily+0.000
bel+0.000
Neg logit contributions
der-0.000
 tr-0.000
 r-0.000
 pian-0.000
 escal-0.000
Token The
Token ID383
Feature activation+7.81e-03

Pos logit contributions
 mental+0.021
rily+0.021
bits+0.020
cks+0.020
bit+0.019
Neg logit contributions
der-0.012
 Lu-0.010
Stud-0.010
Ste-0.010
 tr-0.010
Token ball
Token ID2613
Feature activation+4.14e-02

Pos logit contributions
rily+0.010
 mental+0.010
cks+0.009
bits+0.009
bel+0.009
Neg logit contributions
der-0.006
 tr-0.006
 pian-0.005
 fog-0.005
 lightning-0.005
Token makes
Token ID1838
Feature activation+5.91e-02

Pos logit contributions
 mental+0.038
rily+0.035
bits+0.033
 acquaintance+0.031
bit+0.030
Neg logit contributions
der-0.048
 tr-0.044
 r-0.039
 pian-0.039
 fog-0.038
Token girl
Token ID2576
Feature activation+8.07e-03

Pos logit contributions
der+0.007
 tr+0.007
 Lu+0.006
 pian+0.006
Stud+0.006
Neg logit contributions
cks-0.010
 mental-0.010
rily-0.010
bits-0.010
bit-0.010
Token felt
Token ID2936
Feature activation-1.43e-02

Pos logit contributions
cks+0.011
rily+0.011
 mental+0.011
bits+0.011
bit+0.011
Neg logit contributions
 tr-0.005
der-0.004
 Lu-0.004
 r-0.004
 kn-0.004
Token even
Token ID772
Feature activation-1.07e-01

Pos logit contributions
der+0.013
 Lu+0.012
 tr+0.012
Stud+0.012
Ste+0.012
Neg logit contributions
 mental-0.014
rily-0.014
cks-0.014
bits-0.013
bit-0.013
Token p
Token ID279
Feature activation-1.78e-01

Pos logit contributions
der+0.068
 tr+0.063
 r+0.061
 pian+0.057
 Lu+0.057
Neg logit contributions
rily-0.138
bit-0.135
cks-0.135
bits-0.132
 mental-0.129
Tokenrou
Token ID472
Feature activation+5.93e-02

Pos logit contributions
der+0.231
 tr+0.218
Stud+0.216
 Lu+0.215
Ste+0.213
Neg logit contributions
 mental-0.117
rily-0.112
cks-0.106
bits-0.102
bel-0.094
Tokender
Token ID1082
Feature activation-2.90e-01

Pos logit contributions
rily+0.096
 mental+0.094
bits+0.091
cks+0.090
bit+0.089
Neg logit contributions
Stud-0.016
 tr-0.015
 Lu-0.014
der-0.014
 pian-0.013
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
der+0.255
 tr+0.235
Stud+0.230
 Lu+0.230
Ste+0.224
Neg logit contributions
 mental-0.306
rily-0.294
cks-0.283
bits-0.279
bit-0.269
Token She
Token ID1375
Feature activation+3.07e-03

Pos logit contributions
der+0.010
 Lu+0.009
Stud+0.009
 tr+0.009
Ste+0.009
Neg logit contributions
 mental-0.013
rily-0.013
cks-0.013
bits-0.012
bit-0.012
Token was
Token ID373
Feature activation+1.37e-03

Pos logit contributions
 mental+0.003
rily+0.003
cks+0.003
bits+0.003
bit+0.003
Neg logit contributions
der-0.003
 tr-0.002
 Lu-0.002
Stud-0.002
 pian-0.002
Token happy
Token ID3772
Feature activation+3.98e-02

Pos logit contributions
 mental+0.001
rily+0.001
cks+0.001
bit+0.001
bits+0.001
Neg logit contributions
der-0.001
 Lu-0.001
 tr-0.001
Ste-0.001
Stud-0.001
Token that
Token ID326
Feature activation+5.32e-02

Pos logit contributions
 mental+0.050
cks+0.049
rily+0.048
bits+0.047
bit+0.047
Neg logit contributions
der-0.033
 tr-0.027
 Lu-0.026
Stud-0.025
 pian-0.024
Token.
Token ID13
Feature activation-5.84e-03

Pos logit contributions
 mental+0.083
cks+0.081
bits+0.078
 acquaintance+0.077
bit+0.077
Neg logit contributions
der-0.028
 tr-0.024
 just-0.017
 fog-0.017
 escal-0.017
Token Red
Token ID2297
Feature activation-8.03e-02

Pos logit contributions
der+0.006
 pian+0.006
 tr+0.006
leaf+0.005
 sub+0.005
Neg logit contributions
 mental-0.005
rily-0.005
bits-0.004
 Day-0.004
cks-0.004
Token felt
Token ID2936
Feature activation-5.50e-03

Pos logit contributions
 tr+0.051
der+0.050
Ste+0.045
 Lu+0.044
arre+0.040
Neg logit contributions
cks-0.107
rily-0.105
 mental-0.102
bits-0.099
ases-0.098
Token even
Token ID772
Feature activation-1.00e-01

Pos logit contributions
der+0.006
 tr+0.005
Stud+0.005
 Lu+0.005
Ste+0.005
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.005
bits-0.005
bit-0.004
Token sad
Token ID6507
Feature activation+5.45e-02

Pos logit contributions
der+0.051
 tr+0.048
 r+0.046
 fog+0.046
 p+0.042
Neg logit contributions
rily-0.140
bit-0.137
cks-0.135
bits-0.134
ings-0.133
Tokender
Token ID1082
Feature activation-2.88e-01

Pos logit contributions
 mental+0.071
cks+0.068
rily+0.068
bits+0.066
bit+0.065
Neg logit contributions
der-0.043
 tr-0.034
 Lu-0.032
Stud-0.031
 pian-0.030
Token.
Token ID13
Feature activation-4.23e-03

Pos logit contributions
der+0.252
 tr+0.237
 Lu+0.233
Stud+0.232
 pian+0.228
Neg logit contributions
 mental-0.298
rily-0.287
cks-0.276
bits-0.271
bit-0.261
Token
Token ID198
Feature activation-1.05e-02

Pos logit contributions
der+0.004
 tr+0.004
Stud+0.003
 Lu+0.003
 pian+0.003
Neg logit contributions
 mental-0.004
rily-0.004
cks-0.004
bits-0.004
bit-0.004
Token
Token ID198
Feature activation-1.08e-02

Pos logit contributions
der+0.007
Stud+0.007
 Lu+0.007
Ste+0.007
 tr+0.007
Neg logit contributions
rily-0.013
 mental-0.013
cks-0.012
bits-0.012
bit-0.012
TokenBut
Token ID1537
Feature activation-6.34e-02

Pos logit contributions
der+0.012
 Lu+0.011
 tr+0.010
 pian+0.010
leaf+0.010
Neg logit contributions
 mental-0.010
rily-0.008
bits-0.008
cks-0.008
ulously-0.008
Token then
Token ID788
Feature activation+5.01e-02

Pos logit contributions
der+0.067
 tr+0.062
 Lu+0.061
Stud+0.060
Ste+0.060
Neg logit contributions
 mental-0.057
rily-0.054
cks-0.052
bits-0.051
bit-0.049
Token This
Token ID770
Feature activation+2.29e-02

Pos logit contributions
der+0.004
 Lu+0.004
 tr+0.004
Ste+0.004
Stud+0.003
Neg logit contributions
 mental-0.006
cks-0.005
rily-0.005
bits-0.005
bit-0.005
Token made
Token ID925
Feature activation-6.80e-03

Pos logit contributions
 mental+0.027
rily+0.027
cks+0.025
bits+0.025
bel+0.023
Neg logit contributions
der-0.018
 tr-0.018
 Lu-0.016
Stud-0.016
 pian-0.016
Token her
Token ID607
Feature activation+5.85e-03

Pos logit contributions
 Lu+0.004
 Mind+0.003
 Pupp+0.003
 fog+0.003
 lightning+0.003
Neg logit contributions
rily-0.010
 lair-0.010
 mental-0.010
cks-0.010
bel-0.009
Token even
Token ID772
Feature activation-1.02e-01

Pos logit contributions
rily+0.009
cks+0.009
bits+0.009
ic+0.009
bones+0.009
Neg logit contributions
 tr-0.003
 just-0.002
 Lu-0.002
 p-0.002
 m-0.002
Token glad
Token ID9675
Feature activation+2.52e-02

Pos logit contributions
 tr+0.056
der+0.054
 r+0.050
 p+0.046
 pian+0.043
Neg logit contributions
rily-0.142
cks-0.139
bits-0.138
 mental-0.135
itched-0.135
Tokender
Token ID1082
Feature activation-2.87e-01

Pos logit contributions
 mental+0.033
cks+0.031
bits+0.031
rily+0.030
bit+0.029
Neg logit contributions
der-0.020
 tr-0.017
 Lu-0.017
 pian-0.015
Stud-0.015
Token and
Token ID290
Feature activation-1.60e-02

Pos logit contributions
der+0.265
 tr+0.249
Stud+0.246
 Lu+0.245
Ste+0.241
Neg logit contributions
 mental-0.282
rily-0.273
cks-0.263
bits-0.259
bit-0.249
Token she
Token ID673
Feature activation-6.86e-03

Pos logit contributions
der+0.013
 tr+0.012
 Lu+0.011
 r+0.010
Stud+0.010
Neg logit contributions
rily-0.019
 mental-0.019
cks-0.018
bit-0.018
izer-0.017
Token smiled
Token ID13541
Feature activation-7.28e-02

Pos logit contributions
der+0.007
Stud+0.006
 escal+0.006
 Lu+0.006
 pian+0.006
Neg logit contributions
 mental-0.006
rily-0.006
bits-0.006
cks-0.006
bit-0.006
Token with
Token ID351
Feature activation+2.73e-02

Pos logit contributions
der+0.054
 tr+0.043
 Lu+0.039
Stud+0.035
 pian+0.035
Neg logit contributions
 mental-0.099
cks-0.097
bits-0.090
rily-0.087
bit-0.087
Token joy
Token ID8716
Feature activation-6.15e-02

Pos logit contributions
rily+0.033
cks+0.031
bel+0.031
 mental+0.030
z+0.030
Neg logit contributions
der-0.020
 tr-0.018
 Lu-0.018
leaf-0.018
Ste-0.017
Token look
Token ID804
Feature activation-8.99e-03

Pos logit contributions
der+0.095
 tr+0.088
 Lu+0.087
Stud+0.082
 pian+0.081
Neg logit contributions
 mental-0.115
rily-0.113
cks-0.112
bits-0.109
bit-0.108
Token even
Token ID772
Feature activation-9.98e-02

Pos logit contributions
der+0.008
 tr+0.007
 Lu+0.007
Stud+0.007
 pian+0.006
Neg logit contributions
 mental-0.010
rily-0.009
cks-0.009
bits-0.009
bit-0.009
Token smaller
Token ID4833
Feature activation+8.96e-02

Pos logit contributions
der+0.090
 tr+0.081
 Lu+0.078
Stud+0.078
Ste+0.076
Neg logit contributions
 mental-0.105
rily-0.101
cks-0.098
bits-0.097
bit-0.094
Token and
Token ID290
Feature activation-1.34e-02

Pos logit contributions
 mental+0.121
cks+0.112
rily+0.109
bits+0.108
 acquaintance+0.105
Neg logit contributions
der-0.067
 tr-0.055
 Lu-0.050
 escal-0.045
 pian-0.044
Token sad
Token ID6507
Feature activation+5.81e-02

Pos logit contributions
der+0.014
Stud+0.013
 tr+0.013
Ste+0.013
 Lu+0.013
Neg logit contributions
 mental-0.012
rily-0.011
cks-0.011
bits-0.011
bit-0.010
Tokender
Token ID1082
Feature activation-2.87e-01

Pos logit contributions
 mental+0.078
cks+0.074
bits+0.072
rily+0.072
bit+0.070
Neg logit contributions
der-0.045
 tr-0.035
 Lu-0.034
 pian-0.031
Stud-0.030
Token.
Token ID13
Feature activation-2.86e-03

Pos logit contributions
der+0.238
 tr+0.209
 Lu+0.205
Stud+0.205
Ste+0.199
Neg logit contributions
 mental-0.326
rily-0.312
cks-0.300
bits-0.297
bit-0.290
Token<|endoftext|>
Token ID50256
Feature activation-1.75e-02

Pos logit contributions
der+0.002
 Lu+0.002
Stud+0.002
 tr+0.002
Ste+0.002
Neg logit contributions
 mental-0.003
rily-0.003
cks-0.003
bits-0.003
bit-0.003
TokenOnce
Token ID7454
Feature activation+3.49e-02

Pos logit contributions
 Lu+0.010
der+0.009
 tr+0.009
Ste+0.009
Stud+0.009
Neg logit contributions
 mental-0.024
rily-0.024
cks-0.023
bits-0.022
bit-0.022
Token upon
Token ID2402
Feature activation-4.45e-02

Pos logit contributions
 mental+0.046
rily+0.044
cks+0.044
bit+0.043
bits+0.043
Neg logit contributions
der-0.022
 Lu-0.019
 tr-0.019
Stud-0.018
Ste-0.017
Token a
Token ID257
Feature activation-2.21e-03

Pos logit contributions
der+0.029
Ste+0.027
 tr+0.026
 Lu+0.025
Stud+0.025
Neg logit contributions
 mental-0.059
rily-0.057
cks-0.055
bits-0.054
bit-0.053
Token even
Token ID772
Feature activation-1.03e-01

Pos logit contributions
bits+0.025
rily+0.025
cks+0.024
bit+0.024
z+0.024
Neg logit contributions
der-0.014
 Lu-0.013
Ste-0.012
Stud-0.012
leaf-0.012
Token happier
Token ID23030
Feature activation+8.47e-02

Pos logit contributions
 tr+0.067
der+0.064
 r+0.063
 kn+0.060
leaf+0.060
Neg logit contributions
rily-0.125
bits-0.124
bit-0.118
ings-0.117
cks-0.117
Token and
Token ID290
Feature activation-9.44e-03

Pos logit contributions
 mental+0.108
rily+0.101
bits+0.100
cks+0.099
bit+0.097
Neg logit contributions
der-0.066
 tr-0.053
Stud-0.051
 Lu-0.050
Ste-0.048
Token p
Token ID279
Feature activation-1.67e-01

Pos logit contributions
der+0.008
 tr+0.007
leaf+0.006
 Lu+0.006
 r+0.006
Neg logit contributions
rily-0.011
bit-0.011
bits-0.011
cks-0.010
 mental-0.010
Tokenrou
Token ID472
Feature activation+6.73e-02

Pos logit contributions
der+0.211
 tr+0.197
 Lu+0.193
Stud+0.193
Ste+0.189
Neg logit contributions
 mental-0.116
rily-0.108
bits-0.099
cks-0.098
bit-0.096
Tokender
Token ID1082
Feature activation-2.86e-01

Pos logit contributions
 mental+0.112
rily+0.110
cks+0.107
bits+0.106
bit+0.104
Neg logit contributions
der-0.017
 tr-0.014
Stud-0.013
 Lu-0.013
 pian-0.011
Token.
Token ID13
Feature activation+1.02e-03

Pos logit contributions
der+0.236
 tr+0.205
Stud+0.202
 Lu+0.201
Ste+0.195
Neg logit contributions
 mental-0.330
rily-0.315
cks-0.302
bits-0.300
bit-0.293
Token He
Token ID679
Feature activation+4.94e-03

Pos logit contributions
 mental+0.001
rily+0.001
cks+0.001
bits+0.001
bit+0.001
Neg logit contributions
der-0.001
 tr-0.001
 Lu-0.001
Stud-0.001
Ste-0.001
Token displays
Token ID11298
Feature activation+6.41e-04

Pos logit contributions
 mental+0.005
rily+0.004
cks+0.004
bits+0.004
bit+0.004
Neg logit contributions
der-0.005
 tr-0.005
Stud-0.004
 Lu-0.004
Ste-0.004
Token his
Token ID465
Feature activation-2.70e-02

Pos logit contributions
 mental+0.001
uces+0.001
 acquaintance+0.001
astered+0.001
rily+0.001
Neg logit contributions
der-0.000
 tr-0.000
 all-0.000
 fog-0.000
 Lu-0.000
Token sail
Token ID14936
Feature activation+5.76e-02

Pos logit contributions
 tr+0.023
der+0.019
 fog+0.017
 pian+0.017
 archae+0.016
Neg logit contributions
rily-0.033
cks-0.031
bits-0.031
 mental-0.031
astered-0.028
Token.
Token ID13
Feature activation-3.72e-03

Pos logit contributions
 mental+0.021
rily+0.020
cks+0.019
bits+0.019
 acquaintance+0.018
Neg logit contributions
 tr-0.008
der-0.008
 just-0.005
 Lu-0.005
 r-0.005
Token Tim
Token ID5045
Feature activation-3.12e-02

Pos logit contributions
der+0.003
 tr+0.003
Stud+0.003
 Lu+0.003
Ste+0.003
Neg logit contributions
 mental-0.004
rily-0.004
cks-0.004
bits-0.004
bit-0.004
Token felt
Token ID2936
Feature activation-5.25e-03

Pos logit contributions
 tr+0.018
 Lu+0.014
 p+0.012
der+0.012
 r+0.012
Neg logit contributions
rily-0.048
cks-0.046
 mental-0.045
bel-0.045
uggets-0.043
Token even
Token ID772
Feature activation-1.02e-01

Pos logit contributions
der+0.004
Ste+0.004
 Lu+0.004
 tr+0.004
Stud+0.004
Neg logit contributions
rily-0.006
cks-0.006
 mental-0.005
bits-0.005
bit-0.005
Token sad
Token ID6507
Feature activation+5.57e-02

Pos logit contributions
der+0.056
 tr+0.054
 r+0.053
 fog+0.051
 kn+0.049
Neg logit contributions
rily-0.136
bits-0.132
cks-0.131
ings-0.131
bit-0.129
Tokender
Token ID1082
Feature activation-2.86e-01

Pos logit contributions
 mental+0.072
cks+0.069
rily+0.069
bits+0.067
bit+0.066
Neg logit contributions
der-0.042
 tr-0.034
 Lu-0.032
Stud-0.031
 pian-0.030
Token now
Token ID783
Feature activation+8.95e-02

Pos logit contributions
der+0.251
 tr+0.230
Stud+0.226
 Lu+0.226
Ste+0.220
Neg logit contributions
 mental-0.303
rily-0.291
cks-0.280
bits-0.276
bit-0.267
Token,
Token ID11
Feature activation-8.46e-03

Pos logit contributions
 mental+0.106
rily+0.103
cks+0.102
bits+0.101
bit+0.098
Neg logit contributions
der-0.075
 tr-0.064
 Lu-0.060
Stud-0.058
Ste-0.057
Token and
Token ID290
Feature activation-1.21e-02

Pos logit contributions
der+0.006
 tr+0.005
 Lu+0.004
Stud+0.004
Ste+0.004
Neg logit contributions
 mental-0.012
cks-0.011
bits-0.011
rily-0.011
bit-0.011
Token he
Token ID339
Feature activation-2.10e-02

Pos logit contributions
der+0.008
 tr+0.007
 Lu+0.007
 fog+0.006
 r+0.006
Neg logit contributions
rily-0.016
cks-0.016
 mental-0.016
bit-0.016
bits-0.016
Token cried
Token ID16896
Feature activation-8.33e-03

Pos logit contributions
der+0.017
 tr+0.015
 Lu+0.014
Ste+0.014
Stud+0.013
Neg logit contributions
rily-0.025
 mental-0.025
cks-0.024
bits-0.024
bit-0.022
Token said
Token ID531
Feature activation-1.11e-02

Pos logit contributions
 tr+0.013
der+0.012
Ste+0.011
 Lu+0.011
Ju+0.010
Neg logit contributions
rily-0.023
cks-0.022
 mental-0.022
bits-0.022
bit-0.021
Token no
Token ID645
Feature activation+8.92e-02

Pos logit contributions
der+0.010
 tr+0.009
Stud+0.009
 Lu+0.009
Ste+0.009
Neg logit contributions
 mental-0.012
rily-0.011
cks-0.011
bits-0.011
bit-0.010
Token,
Token ID11
Feature activation-9.38e-03

Pos logit contributions
 mental+0.093
rily+0.084
bits+0.081
bit+0.080
cks+0.079
Neg logit contributions
der-0.077
Stud-0.074
 escal-0.073
leaf-0.073
 Lu-0.072
Token the
Token ID262
Feature activation+2.05e-03

Pos logit contributions
der+0.008
 tr+0.006
 Lu+0.006
Stud+0.006
 pian+0.006
Neg logit contributions
 mental-0.011
rily-0.011
cks-0.011
bits-0.011
bit-0.011
Token mad
Token ID8805
Feature activation-6.04e-04

Pos logit contributions
cks+0.003
rily+0.002
 mental+0.002
bits+0.002
bit+0.002
Neg logit contributions
der-0.002
 tr-0.002
 fog-0.001
 Lu-0.001
 pian-0.001
Tokender
Token ID1082
Feature activation-2.86e-01

Pos logit contributions
der+0.001
 tr+0.000
 Lu+0.000
 pian+0.000
 archae+0.000
Neg logit contributions
 mental-0.001
cks-0.001
rily-0.001
bits-0.001
ulously-0.001
Token Sara
Token ID24799
Feature activation+2.68e-02

Pos logit contributions
der+0.254
 tr+0.228
Stud+0.224
 Lu+0.223
Ste+0.217
Neg logit contributions
 mental-0.306
rily-0.293
cks-0.281
bits-0.278
bit-0.270
Token got
Token ID1392
Feature activation-7.00e-02

Pos logit contributions
cks+0.041
 mental+0.040
rily+0.039
bits+0.039
 acquaintance+0.038
Neg logit contributions
der-0.016
 tr-0.013
 just-0.012
 Lu-0.010
 r-0.009
Token.
Token ID13
Feature activation+7.80e-03

Pos logit contributions
der+0.037
Ste+0.036
 Lu+0.036
 tr+0.035
 just+0.035
Neg logit contributions
rily-0.095
bit-0.094
 mental-0.093
bits-0.093
cks-0.093
Token Then
Token ID3244
Feature activation+8.98e-02

Pos logit contributions
 mental+0.009
cks+0.008
rily+0.008
bits+0.008
bit+0.008
Neg logit contributions
der-0.007
 Lu-0.006
 tr-0.006
Stud-0.006
Ste-0.006
Token,
Token ID11
Feature activation-8.03e-03

Pos logit contributions
 mental+0.110
cks+0.108
rily+0.108
bits+0.106
bit+0.100
Neg logit contributions
der-0.065
 tr-0.062
 Lu-0.062
Stud-0.056
Ste-0.055
Token.
Token ID13
Feature activation-7.46e-04

Pos logit contributions
der+0.004
 Lu+0.004
 tr+0.004
 fog+0.003
 Mind+0.003
Neg logit contributions
 mental-0.008
cks-0.008
bits-0.007
bit-0.007
 acquaintance-0.007
Token Lily
Token ID20037
Feature activation+7.44e-03

Pos logit contributions
der+0.001
 Lu+0.001
Stud+0.000
Ste+0.000
 tr+0.000
Neg logit contributions
 mental-0.001
cks-0.001
rily-0.001
bits-0.001
bit-0.001
Token felt
Token ID2936
Feature activation-1.20e-03

Pos logit contributions
 mental+0.008
rily+0.008
cks+0.008
bits+0.008
bit+0.007
Neg logit contributions
der-0.006
 tr-0.006
 Lu-0.005
Ste-0.005
Stud-0.005
Token even
Token ID772
Feature activation-9.78e-02

Pos logit contributions
der+0.001
 tr+0.001
Stud+0.001
 Lu+0.001
Ste+0.001
Neg logit contributions
 mental-0.001
rily-0.001
cks-0.001
bits-0.001
bit-0.001
Token sad
Token ID6507
Feature activation+5.74e-02

Pos logit contributions
 r+0.039
der+0.036
 fog+0.034
 kn+0.033
 p+0.032
Neg logit contributions
rily-0.153
bit-0.148
ings-0.148
cks-0.147
bits-0.146
Tokender
Token ID1082
Feature activation-2.86e-01

Pos logit contributions
 mental+0.073
rily+0.069
cks+0.069
bits+0.067
bit+0.066
Neg logit contributions
der-0.046
 tr-0.036
 Lu-0.035
Stud-0.034
 pian-0.033
Token than
Token ID621
Feature activation-5.12e-03

Pos logit contributions
der+0.247
 tr+0.235
Stud+0.229
 Lu+0.228
 pian+0.225
Neg logit contributions
 mental-0.298
rily-0.287
cks-0.276
bits-0.271
bit-0.260
Token before
Token ID878
Feature activation+5.79e-03

Pos logit contributions
der+0.005
 tr+0.005
Stud+0.005
 Lu+0.005
Ste+0.005
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.004
bits-0.004
bit-0.004
Token.
Token ID13
Feature activation+6.06e-04

Pos logit contributions
 mental+0.008
rily+0.008
bit+0.008
bits+0.008
cks+0.008
Neg logit contributions
der-0.004
 Lu-0.003
 tr-0.003
 sub-0.002
Ste-0.002
Token From
Token ID3574
Feature activation+1.30e-02

Pos logit contributions
rily+0.001
 mental+0.001
cks+0.001
bits+0.001
bit+0.001
Neg logit contributions
der-0.000
 Lu-0.000
Stud-0.000
Ste-0.000
 tr-0.000
Token that
Token ID326
Feature activation+5.83e-02

Pos logit contributions
 mental+0.012
rily+0.011
cks+0.011
bits+0.011
bit+0.010
Neg logit contributions
der-0.014
 tr-0.013
 Lu-0.012
Stud-0.012
Ste-0.012
Token
Token ID198
Feature activation-8.34e-03

Pos logit contributions
der+0.007
 Lu+0.006
Stud+0.006
 tr+0.006
Ste+0.006
Neg logit contributions
 mental-0.012
rily-0.012
cks-0.011
bits-0.011
bit-0.011
TokenJack
Token ID14295
Feature activation-5.73e-02

Pos logit contributions
der+0.008
 tr+0.007
 Lu+0.007
 pian+0.007
 escal+0.007
Neg logit contributions
 mental-0.008
rily-0.008
cks-0.007
bits-0.007
bit-0.007
Token got
Token ID1392
Feature activation-7.46e-02

Pos logit contributions
der+0.042
 tr+0.040
 Lu+0.035
arre+0.033
Ste+0.033
Neg logit contributions
 mental-0.072
cks-0.070
rily-0.069
bits-0.066
 acquaintance-0.064
Token even
Token ID772
Feature activation-1.02e-01

Pos logit contributions
 Lu+0.052
 tr+0.051
leaf+0.049
Ste+0.049
der+0.048
Neg logit contributions
cks-0.088
rily-0.088
bit-0.087
 mental-0.084
bits-0.082
Token mad
Token ID8805
Feature activation+1.41e-03

Pos logit contributions
 r+0.046
 tr+0.046
der+0.041
arre+0.040
 deter+0.040
Neg logit contributions
rily-0.151
cks-0.145
bits-0.145
bit-0.144
itched-0.139
Tokender
Token ID1082
Feature activation-2.85e-01

Pos logit contributions
 mental+0.002
cks+0.002
bits+0.002
rily+0.002
bit+0.002
Neg logit contributions
der-0.001
 tr-0.001
 Lu-0.001
Stud-0.001
arre-0.001
Token and
Token ID290
Feature activation-1.80e-02

Pos logit contributions
der+0.231
 tr+0.200
Stud+0.196
 Lu+0.195
Ste+0.190
Neg logit contributions
 mental-0.332
rily-0.319
cks-0.306
bits-0.304
bit-0.297
Token threw
Token ID9617
Feature activation+8.40e-03

Pos logit contributions
der+0.014
 tr+0.012
 Lu+0.012
Ste+0.011
Stud+0.010
Neg logit contributions
 mental-0.022
rily-0.022
bits-0.021
cks-0.021
bit-0.021
Token his
Token ID465
Feature activation-2.68e-02

Pos logit contributions
 mental+0.010
rily+0.010
cks+0.010
bits+0.009
bit+0.009
Neg logit contributions
der-0.006
 tr-0.006
 Lu-0.005
Stud-0.005
Ste-0.005
Token toy
Token ID13373
Feature activation-2.68e-02

Pos logit contributions
 tr+0.024
der+0.021
<+0.018
 fog+0.017
 archae+0.017
Neg logit contributions
 mental-0.031
bits-0.030
cks-0.030
rily-0.029
ulously-0.027
Token on
Token ID319
Feature activation+2.60e-02

Pos logit contributions
der+0.023
 tr+0.022
 pian+0.017
 Lu+0.017
 escal+0.016
Neg logit contributions
 mental-0.033
cks-0.031
rily-0.030
bits-0.029
 acquaintance-0.029
Token it
Token ID340
Feature activation-8.30e-03

Pos logit contributions
der+0.014
 tr+0.013
 Lu+0.013
Stud+0.012
 pian+0.012
Neg logit contributions
 mental-0.019
rily-0.018
cks-0.017
bits-0.017
bit-0.017
Token made
Token ID925
Feature activation-6.85e-03

Pos logit contributions
der+0.009
leaf+0.008
Ste+0.008
Stud+0.008
 archae+0.008
Neg logit contributions
 mental-0.007
rily-0.006
cks-0.006
bits-0.006
bel-0.006
Token her
Token ID607
Feature activation+2.67e-03

Pos logit contributions
 Lu+0.004
der+0.003
 thunder+0.003
 earthquakes+0.003
 lightning+0.003
Neg logit contributions
cks-0.010
 mental-0.009
rily-0.009
bit-0.009
 lair-0.009
Token even
Token ID772
Feature activation-1.00e-01

Pos logit contributions
cks+0.004
rily+0.004
bits+0.004
ases+0.003
izer+0.003
Neg logit contributions
 tr-0.002
 Lu-0.002
 church-0.001
der-0.001
 just-0.001
Token sad
Token ID6507
Feature activation+5.49e-02

Pos logit contributions
der+0.064
 tr+0.064
 pian+0.056
 r+0.054
 Lu+0.054
Neg logit contributions
cks-0.127
bits-0.127
rily-0.125
bit-0.122
 mental-0.121
Tokender
Token ID1082
Feature activation-2.85e-01

Pos logit contributions
 mental+0.069
cks+0.067
rily+0.066
bits+0.065
bit+0.063
Neg logit contributions
der-0.043
 tr-0.035
 Lu-0.034
Stud-0.032
 pian-0.032
Token.
Token ID13
Feature activation-4.01e-03

Pos logit contributions
der+0.242
 tr+0.228
Stud+0.224
 Lu+0.222
Ste+0.219
Neg logit contributions
 mental-0.302
rily-0.293
cks-0.280
bits-0.277
bit-0.265
Token
Token ID198
Feature activation-9.75e-03

Pos logit contributions
der+0.004
 tr+0.003
Stud+0.003
 pian+0.003
 Lu+0.003
Neg logit contributions
 mental-0.004
rily-0.004
cks-0.004
bits-0.004
bit-0.004
Token
Token ID198
Feature activation-9.65e-03

Pos logit contributions
der+0.007
 Lu+0.006
Stud+0.006
Ste+0.006
 tr+0.006
Neg logit contributions
 mental-0.012
rily-0.012
cks-0.011
bits-0.011
bit-0.011
TokenSuddenly
Token ID38582
Feature activation+4.77e-02

Pos logit contributions
der+0.011
 tr+0.010
 archae+0.010
 pian+0.010
 escal+0.010
Neg logit contributions
 mental-0.008
rily-0.007
bits-0.007
cks-0.007
ulously-0.007
Token,
Token ID11
Feature activation-1.31e-02

Pos logit contributions
 mental+0.035
rily+0.034
 Day+0.031
bits+0.031
cks+0.030
Neg logit contributions
der-0.055
Stud-0.053
leaf-0.052
Ju-0.051
 escal-0.050
Token and
Token ID290
Feature activation-1.78e-02

Pos logit contributions
der+0.004
 tr+0.003
 Lu+0.003
 r+0.002
 just+0.002
Neg logit contributions
 mental-0.009
cks-0.009
 acquaintance-0.009
bits-0.009
bit-0.009
Token it
Token ID340
Feature activation-6.14e-03

Pos logit contributions
der+0.017
 tr+0.015
Stud+0.015
 Lu+0.015
Ste+0.015
Neg logit contributions
 mental-0.018
rily-0.017
cks-0.016
bits-0.016
bit-0.015
Token was
Token ID373
Feature activation+7.80e-03

Pos logit contributions
der+0.006
Ste+0.006
Stud+0.006
leaf+0.006
 tr+0.006
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.005
bits-0.005
bit-0.005
Token even
Token ID772
Feature activation-1.01e-01

Pos logit contributions
 mental+0.009
rily+0.009
cks+0.009
bits+0.008
ases+0.008
Neg logit contributions
der-0.006
 Lu-0.006
 tr-0.006
 fog-0.005
 sub-0.005
Token red
Token ID2266
Feature activation+6.20e-02

Pos logit contributions
der+0.082
 tr+0.076
 Lu+0.072
Stud+0.070
 pian+0.070
Neg logit contributions
 mental-0.115
rily-0.113
cks-0.109
bits-0.108
bit-0.106
Tokender
Token ID1082
Feature activation-2.84e-01

Pos logit contributions
 mental+0.084
cks+0.078
rily+0.075
bits+0.074
bit+0.072
Neg logit contributions
der-0.048
 tr-0.041
 Lu-0.037
 pian-0.034
 escal-0.033
Token.
Token ID13
Feature activation-7.29e-03

Pos logit contributions
der+0.266
 tr+0.262
Stud+0.259
Ste+0.258
 Lu+0.256
Neg logit contributions
 mental-0.256
rily-0.255
cks-0.242
bits-0.238
bit-0.224
Token Suddenly
Token ID24975
Feature activation+8.08e-02

Pos logit contributions
der+0.007
 pian+0.007
 tr+0.007
 archae+0.006
Stud+0.006
Neg logit contributions
 mental-0.007
rily-0.006
bits-0.006
 Day-0.006
 acquaintance-0.006
Token,
Token ID11
Feature activation-1.41e-02

Pos logit contributions
 mental+0.062
rily+0.061
bits+0.054
cks+0.054
 Day+0.053
Neg logit contributions
der-0.089
Stud-0.086
leaf-0.084
Ste-0.083
Ju-0.081
Token she
Token ID673
Feature activation-6.67e-03

Pos logit contributions
der+0.017
Stud+0.016
arre+0.016
 pian+0.016
 tr+0.015
Neg logit contributions
 mental-0.010
rily-0.010
bel-0.009
ulously-0.008
bits-0.008
Token had
Token ID550
Feature activation-3.58e-02

Pos logit contributions
der+0.006
 tr+0.005
 Lu+0.005
Stud+0.005
 pian+0.005
Neg logit contributions
 mental-0.008
rily-0.007
bits-0.007
cks-0.007
bit-0.007
Token little
Token ID1310
Feature activation+3.40e-02

Pos logit contributions
rily+0.001
bits+0.001
cks+0.001
 mental+0.001
ings+0.001
Neg logit contributions
 tr-0.000
 fog-0.000
 lightning-0.000
der-0.000
 r-0.000
Token boy
Token ID2933
Feature activation+3.99e-02

Pos logit contributions
 mental+0.053
rily+0.051
cks+0.050
bits+0.048
anks+0.046
Neg logit contributions
 tr-0.018
 lightning-0.015
der-0.014
 fog-0.014
 p-0.014
Token felt
Token ID2936
Feature activation-3.97e-03

Pos logit contributions
 mental+0.051
rily+0.051
cks+0.050
bits+0.050
bit+0.047
Neg logit contributions
der-0.028
 tr-0.027
 Lu-0.024
Ste-0.023
Ju-0.022
Token even
Token ID772
Feature activation-1.00e-01

Pos logit contributions
der+0.004
 tr+0.003
 Lu+0.003
Stud+0.003
Ste+0.003
Neg logit contributions
 mental-0.004
rily-0.004
cks-0.004
bits-0.004
bit-0.003
Token sad
Token ID6507
Feature activation+5.93e-02

Pos logit contributions
 r+0.044
der+0.040
 kn+0.038
 fog+0.037
 p+0.036
Neg logit contributions
rily-0.150
bits-0.147
bit-0.147
ings-0.146
room-0.144
Tokender
Token ID1082
Feature activation-2.84e-01

Pos logit contributions
 mental+0.079
cks+0.076
rily+0.075
bits+0.075
bit+0.074
Neg logit contributions
der-0.047
 tr-0.035
 Lu-0.032
Stud-0.031
 pian-0.029
Token.
Token ID13
Feature activation-1.44e-03

Pos logit contributions
der+0.235
 tr+0.201
Stud+0.196
 Lu+0.196
Ste+0.189
Neg logit contributions
 mental-0.331
rily-0.316
cks-0.303
bits-0.302
bit-0.296
Token He
Token ID679
Feature activation+5.16e-03

Pos logit contributions
der+0.001
 Lu+0.001
 tr+0.001
Stud+0.001
Ste+0.001
Neg logit contributions
cks-0.002
 mental-0.002
rily-0.002
bits-0.002
bel-0.002
Token started
Token ID2067
Feature activation-7.92e-02

Pos logit contributions
 mental+0.006
rily+0.006
cks+0.006
bits+0.006
bit+0.005
Neg logit contributions
der-0.004
 tr-0.004
 Lu-0.003
Ste-0.003
Stud-0.003
Token to
Token ID284
Feature activation-1.94e-03

Pos logit contributions
der+0.058
 tr+0.053
Stud+0.050
 Lu+0.049
 pian+0.047
Neg logit contributions
 mental-0.098
rily-0.098
bit-0.093
cks-0.093
bits-0.092
Token cry
Token ID3960
Feature activation-4.88e-02

Pos logit contributions
der+0.002
 tr+0.001
 Lu+0.001
Stud+0.001
Ste+0.001
Neg logit contributions
 mental-0.002
rily-0.002
cks-0.002
bits-0.002
bit-0.002
Token and
Token ID290
Feature activation-1.73e-02

Pos logit contributions
 mental+0.104
rily+0.098
cks+0.095
bits+0.095
bit+0.093
Neg logit contributions
der-0.068
 tr-0.057
 Lu-0.055
Stud-0.054
Ste-0.052
Token Lily
Token ID20037
Feature activation+7.43e-03

Pos logit contributions
der+0.017
 tr+0.016
 Lu+0.015
Stud+0.015
Ste+0.015
Neg logit contributions
 mental-0.017
rily-0.016
cks-0.015
bits-0.015
bit-0.014
Token felt
Token ID2936
Feature activation-4.89e-03

Pos logit contributions
cks+0.010
rily+0.010
bits+0.010
 mental+0.010
bit+0.009
Neg logit contributions
der-0.005
 tr-0.004
 Lu-0.004
Ste-0.004
arre-0.004
Token even
Token ID772
Feature activation-1.01e-01

Pos logit contributions
der+0.005
 tr+0.004
Stud+0.004
 Lu+0.004
 pian+0.004
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.004
bits-0.004
bit-0.004
Token sad
Token ID6507
Feature activation+5.90e-02

Pos logit contributions
der+0.081
 tr+0.069
Ste+0.067
 r+0.067
leaf+0.067
Neg logit contributions
rily-0.114
bits-0.111
bit-0.111
 mental-0.110
cks-0.108
Tokender
Token ID1082
Feature activation-2.83e-01

Pos logit contributions
 mental+0.075
rily+0.072
cks+0.071
bits+0.071
bit+0.069
Neg logit contributions
der-0.046
 tr-0.036
 Lu-0.035
Stud-0.034
 pian-0.033
Token.
Token ID13
Feature activation-2.10e-03

Pos logit contributions
der+0.245
 tr+0.227
 Lu+0.222
Stud+0.221
 pian+0.216
Neg logit contributions
 mental-0.302
rily-0.290
cks-0.279
bits-0.275
bit-0.265
Token From
Token ID3574
Feature activation+1.04e-02

Pos logit contributions
der+0.002
Stud+0.002
 Lu+0.002
 tr+0.002
Ste+0.001
Neg logit contributions
 mental-0.002
rily-0.002
cks-0.002
bits-0.002
bit-0.002
Token that
Token ID326
Feature activation+5.24e-02

Pos logit contributions
 mental+0.010
rily+0.010
bits+0.009
cks+0.009
bit+0.009
Neg logit contributions
der-0.010
 tr-0.009
 Lu-0.009
Stud-0.009
Ste-0.009
Token day
Token ID1110
Feature activation+4.90e-02

Pos logit contributions
rily+0.057
cks+0.055
 mental+0.054
bit+0.050
bits+0.049
Neg logit contributions
der-0.045
 tr-0.044
Stud-0.044
 Lu-0.043
Ste-0.041
Token on
Token ID319
Feature activation+2.45e-02

Pos logit contributions
 mental+0.049
rily+0.048
cks+0.046
bits+0.045
bit+0.044
Neg logit contributions
der-0.046
 tr-0.042
 Lu-0.042
Stud-0.041
Ste-0.041
Token nothing
Token ID2147
Feature activation-1.01e-02

Pos logit contributions
der+0.012
 tr+0.011
 Lu+0.011
Stud+0.010
 escal+0.010
Neg logit contributions
 mental-0.017
rily-0.016
cks-0.016
bits-0.016
bit-0.015
Token made
Token ID925
Feature activation-2.39e-03

Pos logit contributions
der+0.009
Stud+0.008
 tr+0.008
 Lu+0.008
 pian+0.008
Neg logit contributions
 mental-0.010
rily-0.010
cks-0.009
bits-0.009
bit-0.009
Token it
Token ID340
Feature activation-5.20e-03

Pos logit contributions
 Lu+0.001
der+0.001
 lightning+0.001
 fog+0.001
 earthquakes+0.001
Neg logit contributions
 mental-0.003
bel-0.003
rily-0.003
cks-0.003
bits-0.003
Token p
Token ID279
Feature activation-1.69e-01

Pos logit contributions
der+0.004
 tr+0.004
 Lu+0.004
 sub+0.004
Stud+0.004
Neg logit contributions
 mental-0.006
rily-0.006
bits-0.005
cks-0.005
bit-0.005
Tokenrou
Token ID472
Feature activation+6.88e-02

Pos logit contributions
der+0.194
 tr+0.178
 Lu+0.175
Stud+0.175
Ste+0.172
Neg logit contributions
 mental-0.139
rily-0.131
cks-0.123
bits-0.122
bit-0.117
Tokender
Token ID1082
Feature activation-2.83e-01

Pos logit contributions
 mental+0.122
rily+0.121
cks+0.117
bits+0.117
bit+0.115
Neg logit contributions
der-0.009
Stud-0.005
 tr-0.005
 Lu-0.004
 pian-0.003
Token.
Token ID13
Feature activation-5.39e-03

Pos logit contributions
der+0.256
 tr+0.229
Stud+0.224
 Lu+0.224
Ste+0.218
Neg logit contributions
 mental-0.301
rily-0.288
cks-0.275
bits-0.273
bit-0.265
Token
Token ID220
Feature activation-6.39e-03

Pos logit contributions
der+0.005
 tr+0.005
Stud+0.005
 pian+0.005
 escal+0.005
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.005
bits-0.005
bit-0.005
Token
Token ID198
Feature activation-8.50e-03

Pos logit contributions
der+0.005
 Lu+0.004
Stud+0.004
 tr+0.004
Ste+0.004
Neg logit contributions
 mental-0.008
rily-0.008
bits-0.007
cks-0.007
bit-0.007
Token
Token ID198
Feature activation-8.94e-03

Pos logit contributions
Ste+0.005
 Lu+0.005
Stud+0.005
der+0.005
Mad+0.004
Neg logit contributions
rily-0.012
 mental-0.011
cks-0.011
bits-0.011
bit-0.011
TokenThe
Token ID464
Feature activation+2.79e-03

Pos logit contributions
der+0.008
 tr+0.008
 Lu+0.007
Stud+0.007
 pian+0.007
Neg logit contributions
 mental-0.009
rily-0.009
cks-0.008
bits-0.008
bit-0.008
Token,
Token ID11
Feature activation-8.47e-03

Pos logit contributions
der+0.023
 tr+0.019
 Lu+0.017
 earthquakes+0.012
 just+0.012
Neg logit contributions
 mental-0.077
bits-0.073
rily-0.071
 acquaintance-0.070
 mun-0.070
Token so
Token ID523
Feature activation+1.09e-01

Pos logit contributions
der+0.006
 tr+0.005
 Lu+0.005
Stud+0.005
Ste+0.005
Neg logit contributions
 mental-0.011
cks-0.011
bits-0.011
rily-0.011
bit-0.011
Token Tim
Token ID5045
Feature activation-3.20e-02

Pos logit contributions
 mental+0.121
rily+0.116
cks+0.112
bits+0.110
bit+0.108
Neg logit contributions
der-0.094
 tr-0.084
 Lu-0.083
Stud-0.082
Ste-0.080
Token got
Token ID1392
Feature activation-6.90e-02

Pos logit contributions
 tr+0.021
der+0.019
 just+0.017
 Lu+0.017
Ste+0.015
Neg logit contributions
cks-0.045
 mental-0.044
rily-0.044
bits-0.042
 acquaintance-0.041
Token mad
Token ID8805
Feature activation+7.44e-03

Pos logit contributions
 tr+0.046
 Lu+0.045
leaf+0.044
Ste+0.043
der+0.042
Neg logit contributions
rily-0.084
 mental-0.081
bit-0.080
bits-0.079
cks-0.078
Tokender
Token ID1082
Feature activation-2.83e-01

Pos logit contributions
 mental+0.010
cks+0.009
bits+0.009
rily+0.009
bit+0.009
Neg logit contributions
der-0.006
 tr-0.004
 Lu-0.004
Stud-0.004
Ste-0.004
Token.
Token ID13
Feature activation+2.26e-03

Pos logit contributions
der+0.222
 tr+0.184
Stud+0.180
 Lu+0.178
Ste+0.172
Neg logit contributions
 mental-0.348
rily-0.331
cks-0.319
bits-0.318
bit-0.313
Token He
Token ID679
Feature activation+7.44e-03

Pos logit contributions
 mental+0.002
rily+0.002
cks+0.002
bits+0.002
bit+0.002
Neg logit contributions
der-0.002
 tr-0.002
 Lu-0.002
Stud-0.002
Ste-0.002
Token pushed
Token ID7121
Feature activation-2.05e-02

Pos logit contributions
 mental+0.007
rily+0.007
cks+0.006
bit+0.006
bits+0.006
Neg logit contributions
der-0.007
Stud-0.007
 Lu-0.007
 escal-0.006
 pian-0.006
Token Jack
Token ID3619
Feature activation-3.84e-02

Pos logit contributions
 tr+0.013
der+0.013
 Lu+0.012
Ste+0.012
Stud+0.011
Neg logit contributions
 mental-0.026
rily-0.025
cks-0.025
bel-0.024
bits-0.024
Token away
Token ID1497
Feature activation+2.36e-02

Pos logit contributions
 tr+0.026
der+0.023
 r+0.019
 Lu+0.019
 escal+0.019
Neg logit contributions
 mental-0.050
rily-0.047
bits-0.047
bit-0.046
 acquaintance-0.045
Token car
Token ID1097
Feature activation+4.31e-02

Pos logit contributions
cks+0.005
 mental+0.005
rily+0.005
bits+0.005
bit+0.004
Neg logit contributions
 tr-0.003
der-0.003
 pian-0.003
 Lu-0.003
 fog-0.003
Token made
Token ID925
Feature activation-1.60e-03

Pos logit contributions
 mental+0.077
cks+0.074
bit+0.072
rily+0.072
 acquaintance+0.071
Neg logit contributions
der-0.017
 tr-0.016
 just-0.013
 all-0.012
 inside-0.010
Token him
Token ID683
Feature activation+1.09e-01

Pos logit contributions
der+0.001
 just+0.001
 tr+0.001
 Lu+0.001
 r+0.001
Neg logit contributions
cks-0.002
 mental-0.002
rily-0.002
bit-0.002
 acquaintance-0.002
Token even
Token ID772
Feature activation-9.82e-02

Pos logit contributions
bits+0.154
rily+0.151
cks+0.150
uces+0.147
izer+0.146
Neg logit contributions
 tr-0.053
der-0.050
 just-0.045
 all-0.042
 too-0.042
Token sad
Token ID6507
Feature activation+6.00e-02

Pos logit contributions
der+0.053
 tr+0.043
 r+0.042
 pian+0.036
 sub+0.036
Neg logit contributions
bits-0.140
rily-0.139
bit-0.137
 mental-0.135
cks-0.133
Tokender
Token ID1082
Feature activation-2.83e-01

Pos logit contributions
 mental+0.089
cks+0.084
bits+0.082
rily+0.080
bit+0.079
Neg logit contributions
der-0.044
 tr-0.032
 Lu-0.027
 pian-0.025
 sub-0.024
Token.
Token ID13
Feature activation-3.85e-03

Pos logit contributions
der+0.239
 tr+0.198
Stud+0.194
 Lu+0.192
Ste+0.187
Neg logit contributions
 mental-0.334
rily-0.315
cks-0.302
bits-0.300
bit-0.295
Token He
Token ID679
Feature activation+3.14e-03

Pos logit contributions
der+0.003
 Lu+0.003
 tr+0.003
Stud+0.003
Ste+0.003
Neg logit contributions
 mental-0.004
rily-0.004
cks-0.004
bits-0.004
bit-0.004
Token asked
Token ID1965
Feature activation-1.75e-02

Pos logit contributions
 mental+0.004
rily+0.003
cks+0.003
bits+0.003
bit+0.003
Neg logit contributions
der-0.003
 tr-0.002
 Lu-0.002
Stud-0.002
Ste-0.002
Token everyone
Token ID2506
Feature activation-9.12e-02

Pos logit contributions
der+0.012
 Lu+0.011
 tr+0.011
Stud+0.010
 pian+0.009
Neg logit contributions
 mental-0.022
cks-0.022
rily-0.022
bit-0.021
bits-0.020
Token if
Token ID611
Feature activation+9.02e-02

Pos logit contributions
der+0.074
 tr+0.067
 Lu+0.065
Stud+0.063
 pian+0.060
Neg logit contributions
 mental-0.105
rily-0.102
cks-0.100
bit-0.099
bits-0.099
Token.
Token ID13
Feature activation-2.95e-03

Pos logit contributions
 mental+0.091
cks+0.089
 acquaintance+0.086
bits+0.086
ases+0.085
Neg logit contributions
der-0.026
 tr-0.022
 just-0.016
 fog-0.016
 bud-0.015
Token He
Token ID679
Feature activation+4.38e-03

Pos logit contributions
der+0.003
Stud+0.003
 pian+0.003
 tr+0.003
 escal+0.003
Neg logit contributions
 mental-0.003
rily-0.003
bits-0.002
cks-0.002
bit-0.002
Token felt
Token ID2936
Feature activation-1.71e-03

Pos logit contributions
 mental+0.004
rily+0.004
cks+0.004
bits+0.004
bit+0.004
Neg logit contributions
der-0.004
Stud-0.004
 tr-0.004
 Lu-0.004
 pian-0.004
Token even
Token ID772
Feature activation-9.75e-02

Pos logit contributions
der+0.002
 tr+0.002
 Lu+0.002
Stud+0.002
Ste+0.002
Neg logit contributions
 mental-0.002
rily-0.002
cks-0.002
bits-0.001
bit-0.001
Token sad
Token ID6507
Feature activation+5.80e-02

Pos logit contributions
der+0.049
 tr+0.045
 r+0.045
 fog+0.044
 p+0.041
Neg logit contributions
rily-0.138
bit-0.134
cks-0.132
bits-0.132
ings-0.131
Tokender
Token ID1082
Feature activation-2.83e-01

Pos logit contributions
 mental+0.076
cks+0.073
rily+0.072
bits+0.071
bit+0.071
Neg logit contributions
der-0.046
 tr-0.035
 Lu-0.033
Stud-0.031
 pian-0.030
Token now
Token ID783
Feature activation+9.17e-02

Pos logit contributions
der+0.243
 tr+0.215
Stud+0.211
 Lu+0.211
Ste+0.205
Neg logit contributions
 mental-0.314
rily-0.301
cks-0.289
bits-0.286
bit-0.278
Token.
Token ID13
Feature activation-1.64e-03

Pos logit contributions
 mental+0.107
rily+0.104
cks+0.101
bits+0.100
bit+0.098
Neg logit contributions
der-0.075
 tr-0.066
 Lu-0.063
Stud-0.062
Ste-0.060
Token But
Token ID887
Feature activation-5.52e-02

Pos logit contributions
der+0.002
 tr+0.001
 pian+0.001
Stud+0.001
 Lu+0.001
Neg logit contributions
 mental-0.002
rily-0.002
bits-0.001
cks-0.001
bit-0.001
Token then
Token ID788
Feature activation+5.31e-02

Pos logit contributions
der+0.057
 tr+0.052
 Lu+0.051
Stud+0.051
Ste+0.050
Neg logit contributions
 mental-0.051
rily-0.049
cks-0.047
bits-0.046
bit-0.045
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
 mental+0.052
rily+0.048
cks+0.046
bits+0.045
bit+0.044
Neg logit contributions
der-0.052
Stud-0.047
Ste-0.047
leaf-0.046
 pian-0.046
Token.
Token ID13
Feature activation-1.34e-03

Pos logit contributions
der+0.080
 tr+0.072
 Lu+0.070
 escal+0.066
Stud+0.058
Neg logit contributions
 mental-0.180
rily-0.175
bel-0.173
bit-0.172
bits-0.170
Token He
Token ID679
Feature activation+6.62e-03

Pos logit contributions
der+0.001
 tr+0.001
 Lu+0.001
Stud+0.001
Ste+0.001
Neg logit contributions
 mental-0.001
rily-0.001
cks-0.001
bits-0.001
bit-0.001
Token felt
Token ID2936
Feature activation+5.91e-04

Pos logit contributions
 mental+0.005
 mun+0.004
rily+0.004
am+0.004
bit+0.004
Neg logit contributions
der-0.008
 pian-0.007
 escal-0.007
Stud-0.007
 outage-0.007
Token even
Token ID772
Feature activation-9.39e-02

Pos logit contributions
 mental+0.001
rily+0.001
cks+0.001
bits+0.001
bit+0.001
Neg logit contributions
der-0.001
 tr-0.001
 Lu-0.001
Stud-0.001
Ste-0.000
Token sad
Token ID6507
Feature activation+6.03e-02

Pos logit contributions
 tr+0.041
 r+0.040
der+0.040
 fog+0.038
 p+0.036
Neg logit contributions
rily-0.139
bit-0.136
ings-0.133
bits-0.132
cks-0.132
Tokender
Token ID1082
Feature activation-2.83e-01

Pos logit contributions
 mental+0.079
rily+0.075
cks+0.075
bits+0.073
bit+0.072
Neg logit contributions
der-0.047
 tr-0.038
 Lu-0.035
Stud-0.034
 pian-0.032
Token.
Token ID13
Feature activation-1.40e-04

Pos logit contributions
der+0.247
 tr+0.227
 Lu+0.223
Stud+0.223
 pian+0.217
Neg logit contributions
 mental-0.300
rily-0.288
cks-0.278
bits-0.273
bit-0.264
Token
Token ID198
Feature activation-6.65e-03

Pos logit contributions
der+0.000
 tr+0.000
Stud+0.000
 Lu+0.000
Ste+0.000
Neg logit contributions
 mental-0.000
rily-0.000
cks-0.000
bits-0.000
bit-0.000
Token
Token ID198
Feature activation-6.49e-03

Pos logit contributions
der+0.004
 Lu+0.004
Stud+0.003
 tr+0.003
Ste+0.003
Neg logit contributions
rily-0.009
 mental-0.009
bit-0.008
bits-0.008
cks-0.008
TokenMax
Token ID11518
Feature activation-6.52e-03

Pos logit contributions
der+0.007
 Lu+0.006
 tr+0.006
Stud+0.006
 pian+0.006
Neg logit contributions
 mental-0.006
rily-0.006
cks-0.006
bits-0.006
bit-0.005
Token sat
Token ID3332
Feature activation-6.88e-02

Pos logit contributions
der+0.005
 tr+0.005
 Lu+0.004
Ste+0.004
Stud+0.004
Neg logit contributions
rily-0.008
 mental-0.008
cks-0.008
bits-0.007
bel-0.007
Token found
Token ID1043
Feature activation+1.33e-02

Pos logit contributions
 tr+0.003
der+0.003
Ju+0.002
 Lu+0.002
 r+0.002
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.005
bit-0.005
bits-0.005
Token something
Token ID1223
Feature activation-7.61e-02

Pos logit contributions
 mental+0.020
cks+0.020
bit+0.019
 acquaintance+0.018
rily+0.018
Neg logit contributions
der-0.007
 Lu-0.006
 tr-0.005
 fog-0.005
 lightning-0.004
Token even
Token ID772
Feature activation-9.86e-02

Pos logit contributions
der+0.059
 tr+0.058
 Lu+0.052
Stud+0.051
 pian+0.050
Neg logit contributions
bit-0.090
bits-0.090
 mental-0.089
cks-0.087
rily-0.087
Token we
Token ID356
Feature activation-3.97e-02

Pos logit contributions
der+0.068
 tr+0.060
 r+0.053
 fog+0.051
Ju+0.051
Neg logit contributions
 mental-0.127
rily-0.122
cks-0.122
bits-0.121
bit-0.120
Tokenir
Token ID343
Feature activation+2.04e-02

Pos logit contributions
der+0.038
 tr+0.035
Stud+0.034
 Lu+0.034
Ste+0.033
Neg logit contributions
 mental-0.040
rily-0.038
cks-0.036
bits-0.036
bit-0.035
Tokender
Token ID1082
Feature activation-2.81e-01

Pos logit contributions
 mental+0.016
rily+0.014
cks+0.013
bits+0.013
bit+0.012
Neg logit contributions
der-0.025
 tr-0.023
 Lu-0.022
Stud-0.022
Ste-0.022
Token!
Token ID0
Feature activation-7.15e-03

Pos logit contributions
der+0.267
 tr+0.245
 Lu+0.241
Stud+0.241
Ste+0.235
Neg logit contributions
 mental-0.279
rily-0.268
cks-0.257
bits-0.253
bit-0.244
Token There
Token ID1318
Feature activation+2.68e-02

Pos logit contributions
der+0.007
 tr+0.006
Stud+0.006
 pian+0.006
Ste+0.006
Neg logit contributions
 mental-0.007
rily-0.007
cks-0.007
bits-0.007
bit-0.006
Token were
Token ID547
Feature activation-1.27e-03

Pos logit contributions
cks+0.037
 mental+0.036
bit+0.036
rily+0.036
bits+0.034
Neg logit contributions
 Lu-0.015
 tr-0.015
der-0.014
 escal-0.012
Stud-0.012
Token little
Token ID1310
Feature activation+3.20e-02

Pos logit contributions
der+0.001
 tr+0.001
 Lu+0.001
 fog+0.001
 escal+0.001
Neg logit contributions
 mental-0.002
cks-0.002
bit-0.001
bits-0.001
rily-0.001
Token plastic
Token ID7309
Feature activation+5.46e-03

Pos logit contributions
 mental+0.034
cks+0.032
rily+0.032
bits+0.031
bit+0.030
Neg logit contributions
der-0.030
 tr-0.028
 Lu-0.026
 pian-0.026
 escal-0.025
Token."
Token ID526
Feature activation-4.65e-03

Pos logit contributions
 mental+0.011
rily+0.010
bits+0.010
cks+0.010
bel+0.009
Neg logit contributions
der-0.005
 tr-0.004
 Lu-0.004
 pian-0.003
Ste-0.003
Token Lily
Token ID20037
Feature activation+1.25e-02

Pos logit contributions
der+0.004
 tr+0.003
 Lu+0.003
Stud+0.003
Ste+0.003
Neg logit contributions
cks-0.006
 mental-0.006
rily-0.005
bits-0.005
bit-0.005
Token felt
Token ID2936
Feature activation-6.44e-05

Pos logit contributions
 mental+0.013
rily+0.012
cks+0.011
bits+0.011
bit+0.011
Neg logit contributions
der-0.012
 tr-0.011
Stud-0.011
 Lu-0.011
 pian-0.010
Token even
Token ID772
Feature activation-9.70e-02

Pos logit contributions
der+0.000
Stud+0.000
Ste+0.000
 tr+0.000
 Lu+0.000
Neg logit contributions
rily-0.000
 mental-0.000
bits-0.000
cks-0.000
bit-0.000
Token sad
Token ID6507
Feature activation+5.79e-02

Pos logit contributions
 r+0.042
der+0.037
 kn+0.034
 p+0.033
 tr+0.032
Neg logit contributions
rily-0.152
bits-0.149
bit-0.148
ings-0.147
cks-0.142
Tokender
Token ID1082
Feature activation-2.81e-01

Pos logit contributions
 mental+0.074
rily+0.071
cks+0.071
bits+0.070
bit+0.069
Neg logit contributions
der-0.046
 tr-0.036
 Lu-0.034
Stud-0.034
 pian-0.032
Token than
Token ID621
Feature activation-8.01e-04

Pos logit contributions
der+0.233
 tr+0.213
 Lu+0.209
Stud+0.209
Ste+0.203
Neg logit contributions
 mental-0.312
rily-0.299
cks-0.288
bits-0.285
bit-0.275
Token before
Token ID878
Feature activation+1.11e-02

Pos logit contributions
der+0.001
 tr+0.001
Stud+0.001
 Lu+0.001
Ste+0.001
Neg logit contributions
 mental-0.001
rily-0.001
cks-0.001
bits-0.001
bit-0.001
Token.
Token ID13
Feature activation+5.64e-03

Pos logit contributions
 mental+0.016
bit+0.016
rily+0.016
bits+0.016
cks+0.016
Neg logit contributions
der-0.006
 tr-0.005
 Lu-0.005
Stud-0.004
 sub-0.004
Token
Token ID198
Feature activation-4.85e-03

Pos logit contributions
 mental+0.006
rily+0.006
cks+0.006
bits+0.006
bit+0.005
Neg logit contributions
der-0.005
 tr-0.004
Stud-0.004
 Lu-0.004
Ste-0.004
Token
Token ID198
Feature activation-4.57e-03

Pos logit contributions
der+0.003
 Lu+0.002
Stud+0.002
Ste+0.002
<+0.002
Neg logit contributions
rily-0.007
 mental-0.006
bit-0.006
cks-0.006
bits-0.006
Token to
Token ID284
Feature activation-5.45e-04

Pos logit contributions
der+0.045
 tr+0.040
Stud+0.039
 Lu+0.039
Ste+0.039
Neg logit contributions
 mental-0.049
rily-0.047
cks-0.045
bits-0.044
bit-0.043
Token swim
Token ID9422
Feature activation-1.00e-03

Pos logit contributions
der+0.001
 tr+0.001
 Lu+0.000
Stud+0.000
 pian+0.000
Neg logit contributions
 mental-0.001
rily-0.001
cks-0.000
bits-0.000
bit-0.000
Token.
Token ID13
Feature activation+9.62e-04

Pos logit contributions
der+0.001
Stud+0.001
 Lu+0.001
 pian+0.001
Ste+0.001
Neg logit contributions
rily-0.001
 mental-0.001
bel-0.001
bits-0.001
cks-0.001
Token She
Token ID1375
Feature activation+1.33e-02

Pos logit contributions
 mental+0.001
rily+0.001
bits+0.001
cks+0.001
bit+0.001
Neg logit contributions
der-0.001
 tr-0.001
Stud-0.001
 Lu-0.001
Ste-0.001
Token sw
Token ID1509
Feature activation-6.06e-02

Pos logit contributions
 mental+0.016
rily+0.016
cks+0.015
bits+0.015
bit+0.014
Neg logit contributions
 tr-0.010
der-0.010
 Lu-0.009
Stud-0.009
 pian-0.009
Tokenam
Token ID321
Feature activation+1.95e-01

Pos logit contributions
der+0.091
 tr+0.088
arre+0.084
 Lu+0.084
Stud+0.083
Neg logit contributions
 mental-0.027
rily-0.020
cks-0.017
 granting-0.016
bits-0.016
Token every
Token ID790
Feature activation+9.91e-02

Pos logit contributions
 mental+0.234
rily+0.221
cks+0.218
bits+0.207
bit+0.206
Neg logit contributions
der-0.142
 tr-0.142
Stud-0.133
 Lu-0.132
Ste-0.128
Token chance
Token ID2863
Feature activation+1.18e-01

Pos logit contributions
 mental+0.129
cks+0.128
rily+0.120
anks+0.116
bit+0.116
Neg logit contributions
 tr-0.073
 fog-0.068
 Lu-0.067
der-0.064
Ste-0.062
Token she
Token ID673
Feature activation+1.54e-03

Pos logit contributions
 mental+0.148
rily+0.146
cks+0.139
bits+0.134
bit+0.133
Neg logit contributions
der-0.088
 tr-0.082
 pian-0.078
 Lu-0.078
Stud-0.077
Token could
Token ID714
Feature activation+6.17e-02

Pos logit contributions
 mental+0.002
rily+0.002
cks+0.002
bits+0.002
ulously+0.002
Neg logit contributions
 tr-0.001
der-0.001
 Lu-0.001
 r-0.001
 pian-0.001
Token -
Token ID532
Feature activation-3.52e-02

Pos logit contributions
 mental+0.066
rily+0.063
cks+0.061
bits+0.060
bit+0.058
Neg logit contributions
der-0.054
 tr-0.051
 Lu-0.049
Ste-0.047
Stud-0.047
Token She
Token ID1375
Feature activation+1.85e-03

Pos logit contributions
der+0.008
 Lu+0.008
 tr+0.008
Stud+0.007
Ste+0.007
Neg logit contributions
 mental-0.016
rily-0.015
cks-0.015
bits-0.014
bel-0.014
Token loved
Token ID6151
Feature activation-6.14e-02

Pos logit contributions
 mental+0.002
rily+0.002
cks+0.002
bits+0.002
bit+0.002
Neg logit contributions
der-0.002
Stud-0.001
 tr-0.001
 Lu-0.001
 escal-0.001
Token to
Token ID284
Feature activation-1.17e-02

Pos logit contributions
der+0.059
 tr+0.052
Stud+0.052
 Lu+0.052
Ste+0.051
Neg logit contributions
 mental-0.061
rily-0.058
cks-0.056
bits-0.056
bit-0.053
Token eat
Token ID4483
Feature activation+1.69e-02

Pos logit contributions
 tr+0.011
der+0.011
 Lu+0.010
 pian+0.010
Stud+0.010
Neg logit contributions
rily-0.011
 mental-0.011
cks-0.011
bits-0.010
bit-0.010
Token jelly
Token ID27644
Feature activation-1.07e-02

Pos logit contributions
rily+0.011
 mental+0.011
cks+0.010
 sandwiches+0.010
 slices+0.009
Neg logit contributions
der-0.021
 outage-0.021
Mad-0.020
Stud-0.020
leaf-0.019
Token sandwiches
Token ID29777
Feature activation+1.42e-01

Pos logit contributions
der+0.009
 tr+0.008
Ju+0.007
Stud+0.007
 Lu+0.007
Neg logit contributions
 mental-0.013
cks-0.012
ulously-0.012
 acquaintance-0.011
rily-0.011
Token.
Token ID13
Feature activation-9.81e-03

Pos logit contributions
 mental+0.176
rily+0.163
cks+0.163
bits+0.157
bit+0.154
Neg logit contributions
der-0.109
 tr-0.096
 Lu-0.090
Stud-0.089
Ste-0.088
Token One
Token ID1881
Feature activation-4.72e-02

Pos logit contributions
der+0.009
leaf+0.008
 tr+0.008
Stud+0.008
 escal+0.008
Neg logit contributions
 mental-0.010
rily-0.009
bits-0.009
bit-0.009
cks-0.008
Token day
Token ID1110
Feature activation+5.02e-02

Pos logit contributions
der+0.059
 tr+0.054
Stud+0.053
 pian+0.053
 Lu+0.053
Neg logit contributions
 mental-0.034
rily-0.031
bits-0.030
cks-0.029
bit-0.028
Token,
Token ID11
Feature activation-1.33e-02

Pos logit contributions
ings+0.019
itched+0.018
 wherever+0.018
 whenever+0.016
 mental+0.015
Neg logit contributions
der-0.078
leaf-0.077
Stud-0.073
 escal-0.072
 archae-0.071
Token her
Token ID607
Feature activation-2.80e-03

Pos logit contributions
der+0.009
 tr+0.009
 Lu+0.009
Ste+0.008
Stud+0.008
Neg logit contributions
 mental-0.017
cks-0.016
rily-0.016
bits-0.016
bit-0.016
Token mummy
Token ID38567
Feature activation+2.29e-03

Pos logit contributions
 mental+0.013
rily+0.012
bits+0.011
cks+0.011
bit+0.011
Neg logit contributions
der-0.010
 tr-0.009
Stud-0.009
 Lu-0.009
Ste-0.009
Token bear
Token ID6842
Feature activation+1.79e-02

Pos logit contributions
 mental+0.003
rily+0.002
cks+0.002
bits+0.002
bit+0.002
Neg logit contributions
der-0.002
 tr-0.002
Stud-0.002
 Lu-0.002
Ste-0.002
Token had
Token ID550
Feature activation-2.72e-02

Pos logit contributions
 mental+0.019
rily+0.018
cks+0.017
bits+0.017
bit+0.016
Neg logit contributions
der-0.016
 tr-0.015
Stud-0.014
 Lu-0.014
Ste-0.014
Token one
Token ID530
Feature activation-1.05e-01

Pos logit contributions
der+0.025
 tr+0.023
Stud+0.022
 Lu+0.022
Ste+0.022
Neg logit contributions
 mental-0.028
rily-0.028
cks-0.027
bits-0.026
bit-0.025
Token more
Token ID517
Feature activation-2.91e-02

Pos logit contributions
 tr+0.076
der+0.062
 pian+0.059
 p+0.057
 part+0.056
Neg logit contributions
rily-0.154
cks-0.143
bits-0.139
 mental-0.135
uces-0.131
Token suggestion
Token ID13052
Feature activation+1.71e-01

Pos logit contributions
 tr+0.022
der+0.022
 pian+0.021
 Lu+0.020
Stud+0.020
Neg logit contributions
 mental-0.034
rily-0.034
cks-0.033
bits-0.032
bit-0.031
Token.
Token ID13
Feature activation+5.47e-03

Pos logit contributions
 mental+0.228
rily+0.227
cks+0.220
bits+0.214
bit+0.213
Neg logit contributions
der-0.115
 tr-0.102
 Lu-0.092
Stud-0.089
 pian-0.083
Token She
Token ID1375
Feature activation+1.72e-02

Pos logit contributions
 mental+0.006
rily+0.006
cks+0.005
bits+0.005
bit+0.005
Neg logit contributions
der-0.005
 tr-0.004
 Lu-0.004
Stud-0.004
Ste-0.004
Token asked
Token ID1965
Feature activation-5.39e-03

Pos logit contributions
 mental+0.016
rily+0.015
bits+0.014
cks+0.014
bit+0.014
Neg logit contributions
der-0.017
Stud-0.016
 escal-0.015
 Lu-0.015
 pian-0.015
Token,
Token ID11
Feature activation+3.30e-04

Pos logit contributions
der+0.005
 Lu+0.004
 tr+0.004
 pian+0.004
Stud+0.004
Neg logit contributions
cks-0.006
 mental-0.006
bit-0.006
rily-0.006
bits-0.006
Token "
Token ID366
Feature activation-9.59e-03

Pos logit contributions
 mental+0.000
rily+0.000
bits+0.000
 granting+0.000
cks+0.000
Neg logit contributions
der-0.000
 escal-0.000
 tr-0.000
Stud-0.000
 pian-0.000
Token "
Token ID366
Feature activation-1.20e-02

Pos logit contributions
 mental+0.001
rily+0.001
bits+0.001
cks+0.001
 granting+0.001
Neg logit contributions
der-0.001
Stud-0.001
 pian-0.001
 escal-0.001
 tr-0.001
TokenWe
Token ID1135
Feature activation+2.00e-02

Pos logit contributions
der+0.011
 tr+0.010
 Lu+0.010
 escal+0.010
Stud+0.009
Neg logit contributions
 mental-0.012
rily-0.012
bits-0.011
cks-0.011
bel-0.011
Token don
Token ID836
Feature activation-2.24e-02

Pos logit contributions
 mental+0.016
rily+0.013
bits+0.013
bit+0.013
 collection+0.012
Neg logit contributions
der-0.025
 Lu-0.022
 tr-0.022
Stud-0.022
 pian-0.021
Token't
Token ID470
Feature activation-3.28e-03

Pos logit contributions
der+0.028
 tr+0.027
 archae+0.027
Ju+0.026
leaf+0.026
Neg logit contributions
 mental-0.015
cks-0.014
bits-0.013
rily-0.013
itched-0.013
Token have
Token ID423
Feature activation-4.98e-02

Pos logit contributions
der+0.004
 tr+0.003
Stud+0.003
 Lu+0.003
 pian+0.003
Neg logit contributions
 mental-0.003
bit-0.003
rily-0.003
bits-0.003
cks-0.003
Token enough
Token ID1576
Feature activation+1.50e-01

Pos logit contributions
der+0.055
 tr+0.050
Stud+0.050
 Lu+0.050
Ste+0.049
Neg logit contributions
 mental-0.043
rily-0.041
cks-0.039
bits-0.038
bit-0.037
Token money
Token ID1637
Feature activation+5.46e-03

Pos logit contributions
 mental+0.087
rily+0.080
 choice+0.075
cks+0.067
bits+0.067
Neg logit contributions
der-0.203
Stud-0.192
 Lu-0.190
 tr-0.189
arre-0.188
Token to
Token ID284
Feature activation+6.35e-03

Pos logit contributions
 mental+0.006
rily+0.006
cks+0.006
bits+0.006
bit+0.005
Neg logit contributions
der-0.005
 tr-0.004
 Lu-0.004
Stud-0.004
Ste-0.004
Token buy
Token ID2822
Feature activation+7.95e-02

Pos logit contributions
rily+0.007
bits+0.006
 mental+0.006
cks+0.006
bit+0.006
Neg logit contributions
Ste-0.005
Mad-0.005
 church-0.005
 escal-0.005
 Lu-0.005
Token the
Token ID262
Feature activation+9.85e-03

Pos logit contributions
 mental+0.068
rily+0.065
cks+0.059
bits+0.058
bit+0.056
Neg logit contributions
der-0.087
 tr-0.081
Stud-0.081
 outage-0.079
 pian-0.078
Token toy
Token ID13373
Feature activation-2.06e-02

Pos logit contributions
 mental+0.009
rily+0.008
cks+0.007
bits+0.007
bit+0.007
Neg logit contributions
der-0.011
 Lu-0.010
 tr-0.010
Stud-0.010
Ste-0.009
Token mom
Token ID1995
Feature activation-2.63e-03

Pos logit contributions
 tr+0.013
der+0.012
 Lu+0.012
 pian+0.010
Ste+0.010
Neg logit contributions
cks-0.026
bits-0.025
rily-0.024
 mental-0.024
bit-0.024
Token thought
Token ID1807
Feature activation-1.48e-02

Pos logit contributions
der+0.003
 escal+0.003
Stud+0.003
 optical+0.003
 archae+0.003
Neg logit contributions
 mental-0.002
rily-0.002
bits-0.002
bit-0.002
cks-0.002
Token about
Token ID546
Feature activation-1.26e-01

Pos logit contributions
der+0.009
 Lu+0.008
 tr+0.008
 escal+0.007
 pian+0.007
Neg logit contributions
cks-0.020
bit-0.020
rily-0.020
 mental-0.020
 acquaintance-0.019
Token it
Token ID340
Feature activation-9.05e-03

Pos logit contributions
der+0.105
 tr+0.095
 Lu+0.093
Stud+0.092
 pian+0.087
Neg logit contributions
 mental-0.144
rily-0.142
cks-0.137
bits-0.136
bit-0.135
Token and
Token ID290
Feature activation-1.71e-02

Pos logit contributions
der+0.004
 tr+0.003
 Lu+0.003
 just+0.002
Stud+0.002
Neg logit contributions
cks-0.016
rily-0.015
bit-0.015
bel-0.014
bits-0.014
Token finally
Token ID3443
Feature activation+1.42e-01

Pos logit contributions
der+0.015
 tr+0.013
 Lu+0.013
Stud+0.013
Ste+0.013
Neg logit contributions
 mental-0.019
rily-0.018
cks-0.018
bits-0.018
bit-0.017
Token agreed
Token ID4987
Feature activation-1.87e-02

Pos logit contributions
 mental+0.100
rily+0.094
bits+0.082
 able+0.080
 mun+0.078
Neg logit contributions
Stud-0.167
der-0.166
 pian-0.162
 ambul-0.160
 optical-0.159
Token.
Token ID13
Feature activation-4.59e-03

Pos logit contributions
der+0.012
 tr+0.010
 Lu+0.009
Ste+0.008
Stud+0.008
Neg logit contributions
cks-0.027
 mental-0.026
bit-0.025
bits-0.025
rily-0.025
Token Emma
Token ID18966
Feature activation-6.55e-02

Pos logit contributions
der+0.004
 tr+0.004
 Lu+0.004
Stud+0.004
Ste+0.004
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.005
bits-0.004
bit-0.004
Token was
Token ID373
Feature activation+4.46e-03

Pos logit contributions
der+0.061
Stud+0.055
 tr+0.054
 Lu+0.053
Ste+0.053
Neg logit contributions
 mental-0.067
rily-0.063
bits-0.060
cks-0.060
bit-0.058
Token relieved
Token ID22598
Feature activation+3.06e-03

Pos logit contributions
rily+0.004
 mental+0.004
cks+0.004
bit+0.004
bits+0.004
Neg logit contributions
 Lu-0.004
der-0.004
Ste-0.004
 tr-0.004
Stud-0.004
Token
Token ID198
Feature activation-3.70e-03

Pos logit contributions
 Lu+0.002
Ste+0.002
Stud+0.002
der+0.002
<+0.002
Neg logit contributions
rily-0.006
 mental-0.005
cks-0.005
bit-0.005
bits-0.005
TokenBut
Token ID1537
Feature activation-5.95e-02

Pos logit contributions
der+0.003
 tr+0.003
 pian+0.003
 Lu+0.003
 escal+0.003
Neg logit contributions
 mental-0.004
rily-0.004
bits-0.003
cks-0.003
ulously-0.003
Token then
Token ID788
Feature activation+4.86e-02

Pos logit contributions
der+0.056
 tr+0.051
Stud+0.051
 Lu+0.050
 pian+0.049
Neg logit contributions
 mental-0.059
rily-0.057
cks-0.054
bits-0.054
bit-0.052
Token,
Token ID11
Feature activation-1.47e-02

Pos logit contributions
 mental+0.050
rily+0.048
cks+0.045
bits+0.045
bit+0.044
Neg logit contributions
der-0.044
 tr-0.039
Stud-0.039
 Lu-0.038
Ste-0.038
Token something
Token ID1223
Feature activation-7.64e-02

Pos logit contributions
der+0.013
Stud+0.011
 tr+0.011
 pian+0.011
 escal+0.011
Neg logit contributions
 mental-0.016
rily-0.015
bits-0.014
cks-0.014
bit-0.014
Token incredible
Token ID8082
Feature activation+1.47e-03

Pos logit contributions
der+0.072
Stud+0.064
 Lu+0.063
 tr+0.063
Ste+0.062
Neg logit contributions
 mental-0.079
rily-0.072
cks-0.068
bits-0.067
bel-0.065
Token happened
Token ID3022
Feature activation-2.80e-02

Pos logit contributions
 mental+0.002
rily+0.002
cks+0.002
bits+0.002
bit+0.002
Neg logit contributions
der-0.001
 tr-0.001
 Lu-0.001
Stud-0.001
 pian-0.001
Token.
Token ID13
Feature activation-1.84e-03

Pos logit contributions
der+0.015
 tr+0.013
Stud+0.011
 Lu+0.011
 sub+0.010
Neg logit contributions
 mental-0.042
rily-0.039
bit-0.039
bits-0.038
cks-0.038
Token Lily
Token ID20037
Feature activation+6.54e-03

Pos logit contributions
der+0.002
 tr+0.002
Stud+0.002
 archae+0.002
 pian+0.002
Neg logit contributions
 mental-0.002
rily-0.002
 Day-0.002
bits-0.002
 acquaintance-0.001
Token's
Token ID338
Feature activation-5.23e-02

Pos logit contributions
 mental+0.008
rily+0.008
cks+0.008
bits+0.008
bit+0.008
Neg logit contributions
 tr-0.005
der-0.005
 Lu-0.004
arre-0.004
 pian-0.004
Token mom
Token ID1995
Feature activation+7.46e-04

Pos logit contributions
 tr+0.039
der+0.031
 fog+0.028
 Lu+0.027
 pian+0.025
Neg logit contributions
cks-0.079
bits-0.072
rily-0.072
 mental-0.071
bit-0.069
Token helmet
Token ID14335
Feature activation-4.46e-02

Pos logit contributions
 mental+0.013
rily+0.013
bit+0.012
bits+0.012
cks+0.012
Neg logit contributions
der-0.023
 Lu-0.022
Stud-0.021
Ste-0.021
leaf-0.021
Token and
Token ID290
Feature activation-1.65e-02

Pos logit contributions
der+0.029
 tr+0.027
 just+0.021
 pian+0.019
Stud+0.018
Neg logit contributions
 mental-0.066
cks-0.062
rily-0.061
 acquaintance-0.058
bits-0.057
Token strong
Token ID1913
Feature activation-1.69e-02

Pos logit contributions
der+0.012
 tr+0.012
 Lu+0.011
 pian+0.011
Stud+0.010
Neg logit contributions
 mental-0.020
rily-0.020
cks-0.019
bits-0.018
bit-0.017
Token armour
Token ID18588
Feature activation+8.33e-02

Pos logit contributions
 tr+0.015
der+0.015
 pian+0.013
 fog+0.012
 sub+0.012
Neg logit contributions
rily-0.019
 mental-0.019
cks-0.018
glers-0.017
 acquaintance-0.017
Token with
Token ID351
Feature activation+2.73e-02

Pos logit contributions
 mental+0.134
cks+0.123
rily+0.122
 acquaintance+0.117
uces+0.116
Neg logit contributions
der-0.048
 tr-0.037
 just-0.033
 inside-0.025
 outage-0.023
Token a
Token ID257
Feature activation-1.20e-03

Pos logit contributions
 mental+0.042
rily+0.042
cks+0.038
ucker+0.038
uces+0.037
Neg logit contributions
 tr-0.013
der-0.013
 lightning-0.011
 all-0.010
 fog-0.010
Token shield
Token ID7614
Feature activation+5.95e-04

Pos logit contributions
der+0.001
 Lu+0.001
Stud+0.001
Ste+0.001
 escal+0.001
Neg logit contributions
 mental-0.001
rily-0.001
bit-0.001
cks-0.001
bel-0.001
Token to
Token ID284
Feature activation-6.07e-03

Pos logit contributions
 mental+0.001
rily+0.001
ulously+0.001
 acquaintance+0.001
bits+0.001
Neg logit contributions
der-0.000
 tr-0.000
 just-0.000
 inside-0.000
 pian-0.000
Token protect
Token ID1805
Feature activation-1.65e-02

Pos logit contributions
der+0.006
 tr+0.005
 Lu+0.005
Stud+0.005
 pian+0.005
Neg logit contributions
 mental-0.006
rily-0.006
cks-0.006
bits-0.006
bit-0.005
Token him
Token ID683
Feature activation+1.08e-01

Pos logit contributions
der+0.012
 tr+0.012
 Lu+0.011
 escal+0.010
Stud+0.010
Neg logit contributions
 mental-0.020
rily-0.018
 mun-0.018
cks-0.018
 acquaintance-0.018
Token.
Token ID13
Feature activation-5.47e-03

Pos logit contributions
 mental+0.164
bits+0.156
ulously+0.151
cks+0.150
 Deliver+0.149
Neg logit contributions
der-0.053
 tr-0.050
 escal-0.035
 sub-0.033
 Lu-0.030
Token and
Token ID290
Feature activation-5.96e-03

Pos logit contributions
 mental+0.008
rily+0.008
cks+0.008
bits+0.008
bit+0.007
Neg logit contributions
der-0.006
 tr-0.006
Stud-0.006
 Lu-0.006
Ste-0.005
Token had
Token ID550
Feature activation-2.72e-02

Pos logit contributions
der+0.005
 tr+0.004
 Lu+0.004
Ste+0.004
Stud+0.004
Neg logit contributions
 mental-0.007
rily-0.007
cks-0.007
bits-0.007
bit-0.007
Token to
Token ID284
Feature activation+4.58e-03

Pos logit contributions
 pian+0.030
der+0.030
leaf+0.030
 Lu+0.028
Stud+0.028
Neg logit contributions
 mental-0.020
bits-0.018
ulously-0.018
rily-0.017
 Day-0.017
Token take
Token ID1011
Feature activation+3.52e-02

Pos logit contributions
 mental+0.004
rily+0.004
cks+0.004
bits+0.004
bit+0.004
Neg logit contributions
der-0.005
 tr-0.004
 Lu-0.004
Stud-0.004
Ste-0.004
Token a
Token ID257
Feature activation+1.01e-02

Pos logit contributions
 mental+0.036
rily+0.034
cks+0.033
bits+0.033
bit+0.032
Neg logit contributions
der-0.033
Stud-0.029
 tr-0.029
Ste-0.029
 Lu-0.029
Token nap
Token ID25422
Feature activation-1.93e-03

Pos logit contributions
rily+0.011
cks+0.011
bits+0.011
 mental+0.011
bit+0.011
Neg logit contributions
 tr-0.008
der-0.008
 Lu-0.008
Ste-0.007
 pian-0.007
Token.
Token ID13
Feature activation+5.11e-03

Pos logit contributions
der+0.001
 tr+0.001
 Lu+0.001
Ju+0.001
Stud+0.001
Neg logit contributions
 mental-0.003
cks-0.003
bit-0.003
 acquaintance-0.002
bits-0.002
Token
Token ID220
Feature activation-1.63e-03

Pos logit contributions
 mental+0.004
rily+0.004
bits+0.004
 Day+0.004
bit+0.003
Neg logit contributions
der-0.006
 pian-0.005
leaf-0.005
Stud-0.005
 archae-0.005
Token
Token ID198
Feature activation-1.75e-03

Pos logit contributions
der+0.001
 Lu+0.001
Stud+0.001
 tr+0.001
Ste+0.001
Neg logit contributions
 mental-0.002
rily-0.002
cks-0.002
bits-0.002
bit-0.002
TokenWhen
Token ID2215
Feature activation+6.69e-02

Pos logit contributions
der+0.002
 tr+0.002
 pian+0.002
 Lu+0.002
 escal+0.002
Neg logit contributions
 mental-0.002
rily-0.002
cks-0.001
bits-0.001
bit-0.001
Token he
Token ID339
Feature activation-7.82e-03

Pos logit contributions
 mental+0.064
rily+0.058
bits+0.056
cks+0.056
bit+0.053
Neg logit contributions
der-0.069
Stud-0.061
 pian-0.060
 tr-0.060
leaf-0.060
TokenSorry
Token ID14385
Feature activation-8.37e-02

Pos logit contributions
der+0.025
 tr+0.023
 escal+0.022
 Lu+0.022
 optical+0.021
Neg logit contributions
 mental-0.020
rily-0.019
cks-0.018
bit-0.018
bel-0.017
Token,
Token ID11
Feature activation-3.40e-03

Pos logit contributions
der+0.076
Stud+0.069
 tr+0.068
Ste+0.068
 Lu+0.067
Neg logit contributions
 mental-0.085
rily-0.081
cks-0.077
bits-0.076
bit-0.075
Token Lily
Token ID20037
Feature activation+1.30e-02

Pos logit contributions
der+0.003
Stud+0.003
 tr+0.003
Ste+0.003
 Lu+0.003
Neg logit contributions
 mental-0.003
rily-0.003
cks-0.003
bits-0.003
bit-0.003
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
 mental+0.020
rily+0.020
cks+0.019
bits+0.019
bel+0.018
Neg logit contributions
der-0.006
 Lu-0.005
 tr-0.005
arre-0.004
Ste-0.004
Token I
Token ID314
Feature activation-5.53e-02

Pos logit contributions
 mental+0.010
rily+0.010
cks+0.009
bits+0.009
bit+0.009
Neg logit contributions
der-0.011
 tr-0.010
Stud-0.010
Ste-0.009
 Lu-0.009
Token did
Token ID750
Feature activation+1.81e-02

Pos logit contributions
der+0.066
 escal+0.062
Stud+0.059
 pian+0.058
 Lu+0.057
Neg logit contributions
 mental-0.046
 choice-0.038
 acquaintance-0.037
 mun-0.037
bit-0.036
Token not
Token ID407
Feature activation-1.25e-02

Pos logit contributions
 mental+0.014
cks+0.013
 choice+0.011
rily+0.011
bit+0.011
Neg logit contributions
der-0.022
 Lu-0.020
 escal-0.020
Stud-0.020
leaf-0.020
Token mean
Token ID1612
Feature activation-4.07e-02

Pos logit contributions
der+0.014
 Lu+0.013
 escal+0.012
Stud+0.012
 tr+0.012
Neg logit contributions
 mental-0.011
bit-0.010
cks-0.010
bits-0.009
 choice-0.009
Token to
Token ID284
Feature activation+2.39e-03

Pos logit contributions
der+0.035
 tr+0.031
 Lu+0.031
Stud+0.031
Ste+0.030
Neg logit contributions
 mental-0.045
rily-0.044
cks-0.042
bits-0.041
bit-0.040
Token be
Token ID307
Feature activation-3.98e-02

Pos logit contributions
rily+0.003
 mental+0.003
bits+0.003
cks+0.002
bit+0.002
Neg logit contributions
der-0.002
 tr-0.002
 Lu-0.002
Stud-0.002
 pian-0.002
Token mean
Token ID1612
Feature activation-3.87e-02

Pos logit contributions
der+0.035
 tr+0.031
Ste+0.030
 Lu+0.030
Stud+0.030
Neg logit contributions
 mental-0.043
rily-0.042
bits-0.040
cks-0.039
bit-0.038
Token.
Token ID13
Feature activation+5.03e-03

Pos logit contributions
der+0.026
 tr+0.022
 Lu+0.018
 escal+0.018
Stud+0.018
Neg logit contributions
 mental-0.066
bit-0.061
rily-0.061
 acquaintance-0.058
bits-0.058
Token She
Token ID1375
Feature activation+1.60e-02

Pos logit contributions
cks+0.007
rily+0.007
 mental+0.006
bit+0.006
bits+0.006
Neg logit contributions
der-0.003
Stud-0.003
 tr-0.003
 Lu-0.003
 escal-0.003
Token got
Token ID1392
Feature activation-6.57e-02

Pos logit contributions
 mental+0.009
am+0.008
 swoop+0.008
ulously+0.008
 sudden+0.008
Neg logit contributions
der-0.021
leaf-0.020
 archae-0.020
 outage-0.020
 escal-0.020
Token excited
Token ID6568
Feature activation+5.97e-02

Pos logit contributions
der+0.059
 tr+0.053
Stud+0.052
 Lu+0.052
Ste+0.051
Neg logit contributions
 mental-0.070
rily-0.067
cks-0.064
bits-0.063
bit-0.062
Token and
Token ID290
Feature activation-5.57e-03

Pos logit contributions
 mental+0.079
rily+0.075
cks+0.074
bit+0.072
bits+0.071
Neg logit contributions
der-0.042
 tr-0.037
 Lu-0.034
Stud-0.034
 escal-0.033
Token ran
Token ID4966
Feature activation+4.70e-02

Pos logit contributions
der+0.005
 tr+0.005
 Lu+0.004
Stud+0.004
Ste+0.004
Neg logit contributions
 mental-0.006
rily-0.006
cks-0.005
bits-0.005
bit-0.005
Token up
Token ID510
Feature activation+4.02e-02

Pos logit contributions
 mental+0.025
 wherever+0.024
bits+0.023
bel+0.022
 whenever+0.019
Neg logit contributions
der-0.072
arre-0.065
 pian-0.063
 ambul-0.059
 mosquit-0.058
Token to
Token ID284
Feature activation+3.55e-03

Pos logit contributions
 mental+0.041
rily+0.039
cks+0.038
bits+0.037
bit+0.036
Neg logit contributions
der-0.038
 tr-0.034
Stud-0.033
 Lu-0.033
Ste-0.032
Token it
Token ID340
Feature activation+4.45e-03

Pos logit contributions
 mental+0.003
rily+0.003
bits+0.003
cks+0.003
bit+0.003
Neg logit contributions
der-0.004
Stud-0.004
 pian-0.003
 tr-0.003
Ste-0.003
Token.
Token ID13
Feature activation+5.72e-03

Pos logit contributions
 mental+0.005
rily+0.005
cks+0.005
bits+0.005
bit+0.004
Neg logit contributions
der-0.004
 tr-0.003
Stud-0.003
 Lu-0.003
Ste-0.003
Token She
Token ID1375
Feature activation+1.68e-02

Pos logit contributions
 mental+0.007
rily+0.007
cks+0.007
bit+0.006
bits+0.006
Neg logit contributions
der-0.004
 tr-0.004
Stud-0.004
 Lu-0.004
 escal-0.004
Token Lily
Token ID20037
Feature activation+1.64e-02

Pos logit contributions
der+0.001
 tr+0.001
 pian+0.001
 Lu+0.001
Stud+0.001
Neg logit contributions
cks-0.002
 mental-0.002
bits-0.002
rily-0.002
bit-0.002
Token.
Token ID13
Feature activation+6.93e-03

Pos logit contributions
 mental+0.024
rily+0.024
cks+0.023
bits+0.023
bel+0.023
Neg logit contributions
der-0.008
 Lu-0.007
 tr-0.006
Ju-0.005
 pian-0.005
Token Did
Token ID7731
Feature activation+6.70e-02

Pos logit contributions
 mental+0.007
rily+0.007
cks+0.006
bits+0.006
bit+0.006
Neg logit contributions
der-0.007
 tr-0.006
Stud-0.006
 Lu-0.006
Ste-0.006
Token you
Token ID345
Feature activation+1.49e-02

Pos logit contributions
 mental+0.079
rily+0.077
bits+0.074
cks+0.072
bel+0.070
Neg logit contributions
der-0.055
 tr-0.050
 Lu-0.049
 pian-0.046
Stud-0.045
Token have
Token ID423
Feature activation-5.25e-02

Pos logit contributions
 mental+0.010
bit+0.009
 mun+0.009
 achieve+0.008
 wherever+0.008
Neg logit contributions
der-0.020
Stud-0.018
 escal-0.018
 Lu-0.018
 pian-0.018
Token fun
Token ID1257
Feature activation-1.97e-02

Pos logit contributions
der+0.038
 tr+0.037
Ste+0.036
 Lu+0.035
Stud+0.034
Neg logit contributions
rily-0.064
 mental-0.063
bits-0.058
cks-0.058
bel-0.056
Token at
Token ID379
Feature activation-7.86e-02

Pos logit contributions
der+0.019
Ste+0.018
 escal+0.017
Stud+0.017
 Lu+0.017
Neg logit contributions
rily-0.017
cks-0.017
 mental-0.017
bits-0.017
bel-0.016
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
der+0.070
 tr+0.064
 Lu+0.063
Stud+0.063
Ste+0.061
Neg logit contributions
 mental-0.083
rily-0.080
cks-0.077
bits-0.075
bit-0.074
Token park
Token ID3952
Feature activation-2.04e-02

Pos logit contributions
rily+0.016
cks+0.015
 mental+0.015
bits+0.015
bit+0.014
Neg logit contributions
 tr-0.006
der-0.005
 escal-0.005
 fog-0.005
 pian-0.005
Token?"
Token ID1701
Feature activation-8.06e-02

Pos logit contributions
der+0.008
 tr+0.008
Ju+0.007
 pian+0.006
 Lu+0.006
Neg logit contributions
 mental-0.033
cks-0.031
rily-0.031
bits-0.030
bit-0.030
Token Lily
Token ID20037
Feature activation+1.78e-02

Pos logit contributions
der+0.081
 tr+0.074
Stud+0.073
 Lu+0.072
 escal+0.071
Neg logit contributions
 mental-0.077
rily-0.072
bits-0.068
cks-0.068
bit-0.065
Token too
Token ID1165
Feature activation-1.09e-01

Pos logit contributions
 mental+0.067
rily+0.063
bits+0.061
cks+0.060
bit+0.059
Neg logit contributions
der-0.048
 tr-0.045
 Lu-0.042
 pian-0.041
Stud-0.041
Token deep
Token ID2769
Feature activation-2.91e-03

Pos logit contributions
der+0.120
Stud+0.109
 tr+0.108
 pian+0.106
 Lu+0.106
Neg logit contributions
 mental-0.093
rily-0.088
cks-0.086
bits-0.085
bit-0.080
Token!"
Token ID2474
Feature activation-6.15e-03

Pos logit contributions
der+0.002
 tr+0.002
 Lu+0.002
Stud+0.002
Ste+0.002
Neg logit contributions
 mental-0.003
rily-0.003
cks-0.003
bits-0.003
bit-0.003
Token she
Token ID673
Feature activation+5.80e-04

Pos logit contributions
der+0.006
 tr+0.005
Stud+0.005
 Lu+0.005
Ste+0.005
Neg logit contributions
 mental-0.006
rily-0.006
cks-0.006
bits-0.006
bit-0.006
Token said
Token ID531
Feature activation-8.34e-03

Pos logit contributions
 mental+0.000
rily+0.000
bits+0.000
bit+0.000
cks+0.000
Neg logit contributions
der-0.001
Stud-0.001
 escal-0.001
leaf-0.001
 pian-0.001
Token,
Token ID11
Feature activation-4.40e-03

Pos logit contributions
der+0.006
 tr+0.006
 Lu+0.005
Stud+0.005
Ste+0.005
Neg logit contributions
 mental-0.010
cks-0.010
rily-0.010
bits-0.009
bit-0.009
Token pointing
Token ID10609
Feature activation+3.76e-03

Pos logit contributions
der+0.005
 tr+0.004
Stud+0.004
 Lu+0.004
Ste+0.004
Neg logit contributions
 mental-0.004
rily-0.004
cks-0.004
bits-0.003
bit-0.003
Token at
Token ID379
Feature activation-7.95e-02

Pos logit contributions
rily+0.003
 mental+0.003
bits+0.003
cks+0.003
bel+0.003
Neg logit contributions
der-0.004
Stud-0.004
Ste-0.004
Mad-0.004
 pian-0.004
Token the
Token ID262
Feature activation+9.08e-03

Pos logit contributions
der+0.047
 tr+0.042
 Lu+0.042
 lightning+0.035
 earthquakes+0.033
Neg logit contributions
 mental-0.116
rily-0.111
bit-0.105
cks-0.105
astered-0.103
Token soil
Token ID9260
Feature activation-7.33e-02

Pos logit contributions
 mental+0.008
rily+0.007
bits+0.006
bit+0.006
 ultimate+0.006
Neg logit contributions
 pian-0.010
Mad-0.010
 Lu-0.010
der-0.010
Stud-0.009
Token.
Token ID13
Feature activation+7.67e-04

Pos logit contributions
der+0.044
 tr+0.030
 bud+0.019
 r+0.019
 p+0.018
Neg logit contributions
 mental-0.127
bit-0.113
 acquaintance-0.111
cks-0.110
ulously-0.108
Token money
Token ID1637
Feature activation-1.08e-03

Pos logit contributions
cks+0.000
rily+0.000
bits+0.000
ases+0.000
 mental+0.000
Neg logit contributions
 tr-0.000
der-0.000
Ste-0.000
 Lu-0.000
 escal-0.000
Token to
Token ID284
Feature activation-4.77e-03

Pos logit contributions
der+0.001
 tr+0.001
 Lu+0.001
Ste+0.001
 escal+0.001
Neg logit contributions
 mental-0.001
cks-0.001
rily-0.001
bit-0.001
bits-0.001
Token pay
Token ID1414
Feature activation+5.29e-02

Pos logit contributions
Ste+0.004
 Lu+0.004
Ju+0.004
 tr+0.004
der+0.003
Neg logit contributions
cks-0.005
rily-0.005
bits-0.005
 mental-0.005
bit-0.005
Token for
Token ID329
Feature activation-4.09e-02

Pos logit contributions
 mental+0.068
cks+0.067
rily+0.065
 acquaintance+0.064
bit+0.064
Neg logit contributions
der-0.034
 tr-0.029
 Lu-0.029
Stud-0.026
Ste-0.026
Token it
Token ID340
Feature activation-5.84e-03

Pos logit contributions
 Lu+0.028
der+0.028
Ste+0.026
 tr+0.026
 escal+0.025
Neg logit contributions
 mental-0.049
rily-0.048
bit-0.048
cks-0.047
bits-0.047
Token and
Token ID290
Feature activation-1.54e-02

Pos logit contributions
der+0.003
 tr+0.003
 Lu+0.003
Ste+0.002
 escal+0.002
Neg logit contributions
cks-0.008
 mental-0.008
rily-0.008
bit-0.008
bits-0.008
Token the
Token ID262
Feature activation+1.61e-03

Pos logit contributions
der+0.012
 Lu+0.012
 tr+0.011
Ste+0.010
 escal+0.010
Neg logit contributions
cks-0.018
rily-0.017
bit-0.017
 mental-0.017
bits-0.017
Token little
Token ID1310
Feature activation+3.11e-02

Pos logit contributions
cks+0.002
rily+0.002
bits+0.002
 mental+0.002
bit+0.002
Neg logit contributions
 tr-0.001
der-0.001
Ste-0.001
 p-0.001
 escal-0.001
Token girl
Token ID2576
Feature activation+1.71e-02

Pos logit contributions
cks+0.049
 mental+0.047
bits+0.044
rily+0.042
bit+0.042
Neg logit contributions
 tr-0.017
der-0.016
 Lu-0.015
 pian-0.015
 p-0.013
Token was
Token ID373
Feature activation+7.87e-03

Pos logit contributions
cks+0.022
 mental+0.020
rily+0.020
bits+0.020
bit+0.020
Neg logit contributions
der-0.012
 tr-0.011
 Lu-0.011
Ste-0.010
Ju-0.010
Token very
Token ID845
Feature activation+1.07e-02

Pos logit contributions
bit+0.009
cks+0.009
bits+0.009
 mental+0.009
rily+0.009
Neg logit contributions
Ste-0.006
 Lu-0.005
der-0.005
Mad-0.005
 just-0.005
Token said
Token ID531
Feature activation-2.93e-03

Pos logit contributions
cks+0.031
rily+0.029
bit+0.029
bits+0.029
 acquaintance+0.028
Neg logit contributions
 tr-0.012
der-0.011
 kn-0.011
 just-0.010
 Lu-0.010
Token,
Token ID11
Feature activation-4.56e-04

Pos logit contributions
der+0.002
 tr+0.002
 Lu+0.002
Stud+0.002
 pian+0.002
Neg logit contributions
 mental-0.004
cks-0.004
rily-0.004
bits-0.004
bit-0.004
Token "
Token ID366
Feature activation-1.42e-02

Pos logit contributions
der+0.000
 tr+0.000
Stud+0.000
 Lu+0.000
 pian+0.000
Neg logit contributions
 mental-0.001
rily-0.000
cks-0.000
bits-0.000
bit-0.000
TokenBut
Token ID1537
Feature activation-5.88e-02

Pos logit contributions
der+0.013
 tr+0.011
Stud+0.011
 Lu+0.011
Ste+0.011
Neg logit contributions
 mental-0.015
rily-0.015
cks-0.014
bits-0.014
bit-0.013
Token you
Token ID345
Feature activation+1.01e-02

Pos logit contributions
der+0.051
 tr+0.046
 Lu+0.045
Stud+0.044
 pian+0.043
Neg logit contributions
 mental-0.065
rily-0.062
cks-0.061
bits-0.060
bit-0.058
Token know
Token ID760
Feature activation-2.33e-02

Pos logit contributions
 mental+0.008
bit+0.007
bits+0.006
ulously+0.006
 breakthrough+0.006
Neg logit contributions
der-0.012
 Lu-0.011
leaf-0.011
 pian-0.011
 escal-0.011
Token what
Token ID644
Feature activation+2.31e-03

Pos logit contributions
der+0.022
Stud+0.021
Ste+0.021
 tr+0.020
leaf+0.020
Neg logit contributions
 mental-0.022
rily-0.021
cks-0.020
bits-0.020
bit-0.019
Token we
Token ID356
Feature activation-3.54e-02

Pos logit contributions
 mental+0.002
 Day+0.002
cks+0.002
 granting+0.002
bits+0.002
Neg logit contributions
der-0.002
 escal-0.002
 Lu-0.002
 tr-0.002
Ste-0.002
Token can
Token ID460
Feature activation+1.73e-02

Pos logit contributions
der+0.037
 Lu+0.033
 tr+0.032
 pian+0.032
Stud+0.032
Neg logit contributions
 mental-0.035
bits-0.030
cks-0.030
rily-0.030
bit-0.029
Token do
Token ID466
Feature activation+5.11e-02

Pos logit contributions
 mental+0.018
rily+0.018
cks+0.017
bits+0.016
bit+0.016
Neg logit contributions
der-0.016
 tr-0.014
Stud-0.014
 Lu-0.014
Ste-0.014
Token?
Token ID30
Feature activation-7.07e-02

Pos logit contributions
 mental+0.054
rily+0.052
cks+0.050
bits+0.049
bit+0.048
Neg logit contributions
der-0.046
 tr-0.041
Stud-0.041
 Lu-0.040
Ste-0.039
Token like
Token ID588
Feature activation-1.26e-02

Pos logit contributions
der+0.012
 tr+0.010
 Lu+0.008
 just+0.007
 pian+0.007
Neg logit contributions
 mental-0.037
cks-0.035
rily-0.034
bits-0.034
ulously-0.033
Token him
Token ID683
Feature activation+1.17e-01

Pos logit contributions
der+0.007
 tr+0.006
 Lu+0.006
 fog+0.006
 just+0.005
Neg logit contributions
 mental-0.020
cks-0.018
bel-0.018
rily-0.018
bits-0.018
Token.
Token ID13
Feature activation+2.82e-03

Pos logit contributions
 mental+0.189
rily+0.181
cks+0.180
bits+0.176
ulously+0.173
Neg logit contributions
der-0.050
 tr-0.047
 Lu-0.036
 just-0.032
Ste-0.028
Token
Token ID220
Feature activation-3.16e-03

Pos logit contributions
 mental+0.003
rily+0.003
cks+0.003
bits+0.003
bit+0.003
Neg logit contributions
der-0.003
 tr-0.002
 Lu-0.002
Stud-0.002
Ste-0.002
Token
Token ID198
Feature activation-3.91e-03

Pos logit contributions
der+0.002
 Lu+0.002
Stud+0.002
 tr+0.002
Ste+0.002
Neg logit contributions
 mental-0.004
rily-0.004
bits-0.004
cks-0.004
bit-0.004
Token
Token ID198
Feature activation-4.22e-03

Pos logit contributions
Ste+0.002
 Lu+0.002
Stud+0.002
<+0.001
 tr+0.001
Neg logit contributions
rily-0.006
 mental-0.006
elled-0.006
cks-0.006
 choice-0.006
TokenThe
Token ID464
Feature activation+8.21e-03

Pos logit contributions
der+0.003
Stud+0.003
Ste+0.003
 tr+0.003
 Lu+0.003
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.005
bits-0.004
bit-0.004
Token old
Token ID1468
Feature activation+2.18e-02

Pos logit contributions
rily+0.011
cks+0.010
 mental+0.010
bits+0.010
bel+0.010
Neg logit contributions
der-0.005
Ste-0.005
 tr-0.005
 church-0.005
 escal-0.005
Token pigeon
Token ID46387
Feature activation-2.20e-02

Pos logit contributions
 mental+0.024
rily+0.023
cks+0.022
bits+0.021
bit+0.021
Neg logit contributions
der-0.019
 tr-0.017
 Lu-0.017
Stud-0.017
 pian-0.017
Token was
Token ID373
Feature activation+1.69e-02

Pos logit contributions
der+0.019
 tr+0.018
 Lu+0.017
Stud+0.016
Ste+0.016
Neg logit contributions
 mental-0.024
rily-0.023
cks-0.023
bits-0.022
bit-0.021
Token so
Token ID523
Feature activation+1.15e-01

Pos logit contributions
 mental+0.020
rily+0.019
cks+0.019
bel+0.019
 trigger+0.018
Neg logit contributions
 Lu-0.012
Ste-0.012
 just-0.011
 part-0.011
der-0.011
Token
Token ID198
Feature activation+7.50e-05

Pos logit contributions
Stud+0.000
der+0.000
 Lu+0.000
<+0.000
Ste+0.000
Neg logit contributions
rily-0.000
 mental-0.000
bits-0.000
bit-0.000
elled-0.000
TokenThe
Token ID464
Feature activation+3.57e-03

Pos logit contributions
 mental+0.000
rily+0.000
cks+0.000
bits+0.000
bit+0.000
Neg logit contributions
der-0.000
 tr-0.000
 Lu-0.000
Stud-0.000
 pian-0.000
Token audience
Token ID5386
Feature activation-1.31e-01

Pos logit contributions
 mental+0.002
 choice+0.002
bel+0.002
rily+0.002
 acquaintance+0.002
Neg logit contributions
der-0.004
Ste-0.004
Stud-0.004
arre-0.004
Mad-0.004
Token cheers
Token ID34550
Feature activation+6.30e-02

Pos logit contributions
der+0.114
Ste+0.106
Stud+0.102
 Lu+0.101
Mad+0.100
Neg logit contributions
 mental-0.137
cks-0.132
rily-0.124
bits-0.121
bit-0.118
Token and
Token ID290
Feature activation-8.89e-03

Pos logit contributions
 mental+0.101
 acquaintance+0.098
cks+0.098
rily+0.094
ulously+0.093
Neg logit contributions
der-0.036
 tr-0.024
lessly-0.018
 just-0.017
 pian-0.017
Token cl
Token ID537
Feature activation-1.59e-01

Pos logit contributions
der+0.009
 tr+0.008
 Lu+0.008
Stud+0.007
 pian+0.007
Neg logit contributions
 mental-0.009
rily-0.009
bits-0.008
cks-0.008
bit-0.008
Tokenaps
Token ID1686
Feature activation+7.35e-02

Pos logit contributions
der+0.219
 tr+0.211
 escal+0.207
 Lu+0.207
Stud+0.205
Neg logit contributions
 mental-0.093
cks-0.089
rily-0.086
bit-0.081
bits-0.081
Token.
Token ID13
Feature activation+8.16e-03

Pos logit contributions
 mental+0.114
 acquaintance+0.111
cks+0.110
uces+0.109
 mun+0.107
Neg logit contributions
der-0.039
 tr-0.027
lessly-0.022
 Lu-0.021
 fog-0.021
Token They
Token ID1119
Feature activation+2.62e-03

Pos logit contributions
 mental+0.007
rily+0.007
cks+0.007
bits+0.007
bit+0.006
Neg logit contributions
der-0.009
 tr-0.008
 Lu-0.008
Stud-0.008
Ste-0.007
Token are
Token ID389
Feature activation-4.87e-02

Pos logit contributions
 mental+0.002
cks+0.002
bit+0.002
rily+0.002
bits+0.002
Neg logit contributions
der-0.003
Stud-0.003
Ste-0.002
 Lu-0.002
Mad-0.002
Token relieved
Token ID22598
Feature activation+5.85e-03

Pos logit contributions
der+0.035
 tr+0.034
 Lu+0.034
 pian+0.032
Stud+0.029
Neg logit contributions
rily-0.058
 mental-0.058
bits-0.054
ulously-0.053
bit-0.052
Token together
Token ID1978
Feature activation-1.54e-02

Pos logit contributions
 mental+0.049
rily+0.047
cks+0.045
bits+0.045
bit+0.043
Neg logit contributions
der-0.042
 tr-0.038
Stud-0.037
 Lu-0.037
Ste-0.036
Token!"
Token ID2474
Feature activation-2.29e-02

Pos logit contributions
der+0.013
 tr+0.011
 Lu+0.011
Stud+0.011
Ste+0.011
Neg logit contributions
 mental-0.018
rily-0.017
cks-0.016
bits-0.016
bit-0.016
Token But
Token ID887
Feature activation-6.19e-02

Pos logit contributions
 Lu+0.016
der+0.015
 tr+0.015
Ste+0.015
Stud+0.014
Neg logit contributions
 mental-0.029
cks-0.028
rily-0.027
bits-0.027
bit-0.026
Token Mitt
Token ID16627
Feature activation+5.18e-02

Pos logit contributions
der+0.048
 Lu+0.048
 tr+0.046
Ste+0.042
 pian+0.041
Neg logit contributions
 mental-0.073
cks-0.069
rily-0.068
bits-0.067
bit-0.063
Tokenens
Token ID641
Feature activation+1.26e-03

Pos logit contributions
rily+0.056
 mental+0.056
cks+0.054
bel+0.051
bits+0.050
Neg logit contributions
der-0.046
 tr-0.041
 Lu-0.041
Ste-0.039
Ju-0.039
Token just
Token ID655
Feature activation-1.60e-01

Pos logit contributions
 mental+0.001
bit+0.001
rily+0.001
cks+0.001
am+0.001
Neg logit contributions
leaf-0.001
Stud-0.001
Mad-0.001
 escal-0.001
 pian-0.001
Token wanted
Token ID2227
Feature activation-7.47e-03

Pos logit contributions
der+0.141
 tr+0.127
 Lu+0.125
Stud+0.124
Ste+0.121
Neg logit contributions
 mental-0.173
rily-0.166
cks-0.159
bits-0.157
bit-0.152
Token to
Token ID284
Feature activation-9.64e-03

Pos logit contributions
der+0.005
 tr+0.005
 Lu+0.005
Stud+0.004
Ste+0.004
Neg logit contributions
rily-0.010
 mental-0.010
cks-0.009
bits-0.009
bit-0.009
Token sleep
Token ID3993
Feature activation-1.92e-02

Pos logit contributions
der+0.010
 tr+0.009
 Lu+0.009
Stud+0.009
Ste+0.009
Neg logit contributions
 mental-0.009
rily-0.009
cks-0.008
bits-0.008
bit-0.008
Token.
Token ID13
Feature activation-7.10e-03

Pos logit contributions
der+0.015
 tr+0.013
 Lu+0.013
Ste+0.012
 pian+0.012
Neg logit contributions
 mental-0.023
cks-0.022
rily-0.022
bits-0.021
bit-0.021
Token
Token ID198
Feature activation-1.31e-02

Pos logit contributions
der+0.005
 Lu+0.005
 tr+0.005
Ste+0.005
Stud+0.004
Neg logit contributions
cks-0.009
rily-0.009
 mental-0.008
bits-0.008
bel-0.008
Token stir
Token ID11240
Feature activation-6.61e-02

Pos logit contributions
der+0.002
 tr+0.002
 Lu+0.002
Stud+0.002
 pian+0.002
Neg logit contributions
 mental-0.002
rily-0.002
cks-0.002
bits-0.002
bit-0.002
Token,
Token ID11
Feature activation-1.96e-03

Pos logit contributions
der+0.050
 tr+0.045
 Lu+0.044
Ju+0.040
Stud+0.040
Neg logit contributions
 mental-0.079
rily-0.073
bits-0.071
 acquaintance-0.071
cks-0.070
Token stir
Token ID11240
Feature activation-6.58e-02

Pos logit contributions
der+0.002
 tr+0.002
 Lu+0.001
Stud+0.001
 pian+0.001
Neg logit contributions
 mental-0.002
rily-0.002
cks-0.002
bits-0.002
bit-0.002
Token!"
Token ID2474
Feature activation-7.67e-05

Pos logit contributions
der+0.051
 tr+0.046
 Lu+0.044
Stud+0.042
Ju+0.041
Neg logit contributions
 mental-0.078
rily-0.073
bits-0.071
cks-0.070
 acquaintance-0.069
Token Ben
Token ID3932
Feature activation+9.26e-02

Pos logit contributions
der+0.000
Stud+0.000
 tr+0.000
 escal+0.000
 optical+0.000
Neg logit contributions
 mental-0.000
rily-0.000
cks-0.000
bits-0.000
bit-0.000
Token echoed
Token ID22211
Feature activation-1.54e-01

Pos logit contributions
 mental+0.071
rily+0.071
cks+0.067
bits+0.066
bit+0.066
Neg logit contributions
der-0.101
Mad-0.099
 escal-0.098
Stud-0.097
 Lu-0.095
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
der+0.109
 tr+0.097
 Lu+0.096
 pian+0.079
 escal+0.077
Neg logit contributions
 mental-0.203
cks-0.189
bits-0.177
 acquaintance-0.177
bit-0.174
Token "
Token ID366
Feature activation-1.66e-02

Pos logit contributions
 mental+0.013
rily+0.012
cks+0.012
bits+0.012
bit+0.012
Neg logit contributions
der-0.008
 tr-0.007
 Lu-0.007
Stud-0.007
Ste-0.007
TokenWe
Token ID1135
Feature activation+1.67e-02

Pos logit contributions
der+0.014
 tr+0.013
Stud+0.012
 Lu+0.012
Ste+0.012
Neg logit contributions
 mental-0.018
rily-0.018
cks-0.017
bits-0.017
bit-0.016
Token have
Token ID423
Feature activation-5.72e-02

Pos logit contributions
 mental+0.014
bit+0.013
cks+0.012
bits+0.012
rily+0.011
Neg logit contributions
der-0.020
Mad-0.018
Ste-0.018
 Lu-0.018
 escal-0.018
Token to
Token ID284
Feature activation+4.73e-04

Pos logit contributions
der+0.058
 Lu+0.054
Stud+0.053
 tr+0.052
 escal+0.052
Neg logit contributions
 mental-0.052
rily-0.048
cks-0.047
bits-0.046
bit-0.046
Token calling
Token ID4585
Feature activation+5.71e-02

Pos logit contributions
 mental+0.006
rily+0.006
cks+0.006
bits+0.005
bel+0.005
Neg logit contributions
der-0.009
 tr-0.008
Stud-0.008
Ju-0.008
leaf-0.007
Token for
Token ID329
Feature activation-4.26e-02

Pos logit contributions
 mental+0.057
cks+0.055
rily+0.055
bits+0.052
bit+0.051
Neg logit contributions
der-0.054
 tr-0.049
 Lu-0.049
Stud-0.047
 escal-0.046
Token him
Token ID683
Feature activation+1.10e-01

Pos logit contributions
 Lu+0.040
 tr+0.040
der+0.039
 escal+0.037
Stud+0.037
Neg logit contributions
rily-0.040
 mental-0.040
cks-0.038
bel-0.037
bits-0.037
Token!
Token ID0
Feature activation-7.01e-03

Pos logit contributions
cks+0.209
rily+0.201
ulously+0.200
bits+0.195
bit+0.194
Neg logit contributions
 inside-0.029
 about-0.025
 tr-0.025
 just-0.023
 Lu-0.019
Token The
Token ID383
Feature activation-2.99e-03

Pos logit contributions
der+0.008
 pian+0.007
leaf+0.007
Stud+0.007
 optical+0.007
Neg logit contributions
 mental-0.006
rily-0.006
 Day-0.005
bits-0.005
cks-0.005
Token p
Token ID279
Feature activation-1.70e-01

Pos logit contributions
der+0.001
 tr+0.001
 Lu+0.001
Stud+0.001
Ste+0.001
Neg logit contributions
 mental-0.005
rily-0.005
cks-0.004
bits-0.004
bit-0.004
Tokenanda
Token ID5282
Feature activation+5.07e-02

Pos logit contributions
der+0.250
Bon+0.235
Stud+0.234
Ste+0.232
leaf+0.232
Neg logit contributions
am-0.077
bel-0.075
 mental-0.071
cks-0.071
rily-0.066
Token zoom
Token ID19792
Feature activation+4.97e-02

Pos logit contributions
 mental+0.056
rily+0.054
cks+0.052
bits+0.051
bit+0.049
Neg logit contributions
der-0.044
 tr-0.040
 Lu-0.039
Stud-0.038
Ste-0.037
Tokened
Token ID276
Feature activation+3.65e-02

Pos logit contributions
 mental+0.061
cks+0.055
rily+0.054
 acquaintance+0.052
bits+0.050
Neg logit contributions
der-0.042
 tr-0.041
 Lu-0.037
 escal-0.034
 pian-0.034
Token to
Token ID284
Feature activation-3.89e-03

Pos logit contributions
 mental+0.046
cks+0.044
bit+0.042
rily+0.042
 acquaintance+0.042
Neg logit contributions
 tr-0.027
der-0.025
 Lu-0.025
 escal-0.022
 fog-0.021
Token his
Token ID465
Feature activation-2.74e-02

Pos logit contributions
der+0.004
 tr+0.003
 Lu+0.003
Stud+0.003
Ste+0.003
Neg logit contributions
 mental-0.004
rily-0.004
cks-0.003
bits-0.003
bit-0.003
Tokenfl
Token ID2704
Feature activation-6.11e-02

Pos logit contributions
der+0.049
 tr+0.041
 Lu+0.039
 pian+0.036
Stud+0.036
Neg logit contributions
 mental-0.080
rily-0.072
cks-0.071
bits-0.070
 acquaintance-0.067
Tokenowers
Token ID3618
Feature activation-2.10e-02

Pos logit contributions
der+0.055
 tr+0.049
 Lu+0.048
Stud+0.048
Ste+0.047
Neg logit contributions
 mental-0.065
rily-0.062
cks-0.059
bits-0.059
bit-0.057
Token.
Token ID13
Feature activation-3.83e-04

Pos logit contributions
der+0.017
 tr+0.015
 Lu+0.014
Stud+0.014
Ste+0.014
Neg logit contributions
 mental-0.025
rily-0.023
cks-0.022
bits-0.022
bit-0.021
Token Eventually
Token ID16178
Feature activation+1.38e-01

Pos logit contributions
der+0.000
 Lu+0.000
 tr+0.000
Ste+0.000
Stud+0.000
Neg logit contributions
 mental-0.000
cks-0.000
rily-0.000
bits-0.000
bel-0.000
Token the
Token ID262
Feature activation-1.73e-02

Pos logit contributions
 mental+0.170
rily+0.163
cks+0.157
bits+0.157
bit+0.150
Neg logit contributions
der-0.103
 tr-0.094
 Lu-0.093
Stud-0.085
Ste-0.085
Token gard
Token ID16985
Feature activation-1.53e-01

Pos logit contributions
 tr+0.012
der+0.012
 Lu+0.010
 fog+0.010
 archae+0.010
Neg logit contributions
 mental-0.022
cks-0.022
rily-0.022
bits-0.021
bit-0.019
Tokenener
Token ID877
Feature activation+4.65e-02

Pos logit contributions
der+0.142
 archae+0.101
arre+0.094
 Lu+0.091
leaf+0.089
Neg logit contributions
 mental-0.195
cks-0.178
bits-0.162
bit-0.158
 acquaintance-0.155
Token came
Token ID1625
Feature activation-1.94e-02

Pos logit contributions
 mental+0.060
rily+0.059
bits+0.057
cks+0.056
 acquaintance+0.052
Neg logit contributions
der-0.033
 tr-0.031
 r-0.027
 Lu-0.027
arre-0.026
Token back
Token ID736
Feature activation-3.91e-02

Pos logit contributions
der+0.012
 tr+0.011
 Lu+0.011
Ste+0.010
 sub+0.010
Neg logit contributions
 mental-0.025
cks-0.024
bits-0.023
rily-0.023
bit-0.023
Token with
Token ID351
Feature activation+2.12e-02

Pos logit contributions
der+0.030
 tr+0.027
 Lu+0.027
Ste+0.025
Stud+0.025
Neg logit contributions
 mental-0.046
rily-0.044
cks-0.043
bits-0.043
bit-0.042
Token lots
Token ID6041
Feature activation-1.45e-03

Pos logit contributions
 mental+0.031
rily+0.031
anks+0.029
cks+0.029
bel+0.029
Neg logit contributions
 fog-0.008
 lightning-0.008
 tr-0.008
der-0.007
 Lu-0.007