TOP ACTIVATIONS
MAX = 0.160

said Sarah â s mom .
ily said , " B run o , you are a
said Ali â s mom .
day , Sam â s Mom said to him
Yes , that â s it . You guessed
you ?" B run o said , " Of
day , Jane â s mom my gave her
Baby â s eyes lit up .
the little girl â s l imes were dry
Lu cy â s mother took her to
Yes , that â s rob in â
Yes , that â s exactly right . See
day , Lucy â s dad took her and
M olly â s mom filled up a
, the girl â s mom my gave her
don â t worry ! I will
Yes , that â s a great idea .
day , Rachel â s mom came to play
â Sam â s mum said .
day , Tim â s mom gave him a

MIN ACTIVATIONS
MIN = -0.141

was the brightest it could possibly be . Ab bi had
bigger until it filled the entire box . It was the
loud that it filled the entire house ! Everyone cl apped
big that it filled the entire garden . But something strange
Some people live on y ach ts and travel around the
shapes and colors filled the me adow , and it was
until he had filled the entire sky with a mystery .
big fuzzy blanket covering the entire park ! Ell a was
wide that it filled the entire room . Every day ,
big it almost covered her entire face . She even shouted
The giant had filled the entire castle with colorful toys and
's house was built long ago . L ily
it seemed to fill the entire room . They were having
, until it covered the entire sky . The
These toys will make the children happy !" The man was
where they will make many children happy . I hope you
something that would make M andy feel better .
bit of water to make o at soup . She wanted
, boy ! Leave those kids alone !" The
For breakfast the board made o atmeal with fresh strawberries .

INTERVAL 0.085 - 0.160
CONTAINS 0.451%

but they did not know where it was . They said
she had shown her mother how much she loved her ,
him inside and showed him how to be more careful when
told her that this was where she worked . Lily was
much it had grown and how rich it looked . She

INTERVAL 0.009 - 0.085
CONTAINS 31.072%

her . <|endoftext|> Once upon a time , there was a
went inside feeling scared and her mom gave her a big
help . " Please , help me remove the trap from
she did a great job . Lily was happy and relieved
the chain was swinging really high . Bob smiled and laughed

INTERVAL -0.066 - 0.009
CONTAINS 68.038%

case and parked the car . She climbed out and looked
help you find your toy ." Tim and the
head was spinning . John 's Mom saw him
what it was and where it was going . They decided
was too late . They didn 't make it out of

INTERVAL -0.141 - -0.066
CONTAINS 0.438%

friends . The sun was always shining , and the pony
of trees and Tommy could see a laser in the sky
noticed something rare in the ocean . It was a special
so lovely . I will be very happy if you give
dry . He had seen children try to climb it before
Token
Token ID26391
Feature activation-1.55e-02

Pos logit contributions
iece+0.044
 myself+0.043
aming+0.039
Ta+0.038
CR+0.036
Neg logit contributions
 tight-0.058
ucker-0.058
 understands-0.057
 fault-0.057
 stern-0.055
Token said
Token ID531
Feature activation-1.00e-02

Pos logit contributions
 myself+0.025
iece+0.025
CR+0.023
aming+0.023
oy+0.023
Neg logit contributions
 tight-0.018
 fault-0.018
 understands-0.017
ucker-0.017
 stern-0.017
Token Sarah
Token ID10490
Feature activation+5.37e-02

Pos logit contributions
 myself+0.017
iece+0.016
CR+0.015
aming+0.014
th+0.014
Neg logit contributions
ucker-0.012
 fault-0.012
 understands-0.012
 tight-0.011
run-0.010
Tokenâ
Token ID22940
Feature activation-1.11e-02

Pos logit contributions
arl+0.070
ucker+0.069
 fault+0.065
icks+0.060
 shrug+0.060
Neg logit contributions
 myself-0.079
iece-0.072
CR-0.069
th-0.068
aming-0.067
Token
Token ID26391
Feature activation+1.61e-02

Pos logit contributions
iece+0.013
 myself+0.013
aming+0.011
Ta+0.011
Now+0.010
Neg logit contributions
ucker-0.019
 fault-0.019
 tight-0.019
 understands-0.019
run-0.018
Token
Token ID8151
Feature activation+1.60e-01

Pos logit contributions
 tight+0.019
 fault+0.019
ucker+0.019
 understands+0.019
 vain+0.018
Neg logit contributions
 myself-0.025
iece-0.024
 seek-0.024
CR-0.023
oy-0.023
Tokens
Token ID82
Feature activation+3.10e-02

Pos logit contributions
 tight+0.227
 shrug+0.225
 understands+0.225
ucker+0.223
 stern+0.217
Neg logit contributions
 myself-0.207
iece-0.206
aming-0.185
CR-0.184
mmmm-0.184
Token mom
Token ID1995
Feature activation+3.98e-02

Pos logit contributions
ucker+0.031
 tight+0.029
 understands+0.029
run+0.028
 fault+0.027
Neg logit contributions
 myself-0.056
iece-0.054
aming-0.049
 endings-0.049
CR-0.048
Token.
Token ID13
Feature activation+2.21e-02

Pos logit contributions
ucker+0.048
arl+0.047
 fault+0.047
 tight+0.045
 lot+0.044
Neg logit contributions
 myself-0.061
iece-0.058
aming-0.054
oy-0.053
th-0.053
Token
Token ID6184
Feature activation-4.01e-03

Pos logit contributions
 fault+0.022
ucker+0.022
 understands+0.022
 tight+0.022
 stern+0.020
Neg logit contributions
 myself-0.040
iece-0.038
aming-0.036
CR-0.036
Ta-0.035
Token
Token ID95
Feature activation-4.19e-02

Pos logit contributions
 myself+0.006
iece+0.006
aming+0.006
CR+0.005
mmm+0.005
Neg logit contributions
 fault-0.006
 understands-0.005
ucker-0.005
 tight-0.005
 stern-0.005
Tokenily
Token ID813
Feature activation-3.09e-02

Pos logit contributions
iece+0.007
 nights+0.006
light+0.006
 beaches+0.006
 myself+0.006
Neg logit contributions
arl-0.010
run-0.009
 understands-0.009
 fault-0.009
ucker-0.009
Token said
Token ID531
Feature activation-1.67e-02

Pos logit contributions
iece+0.057
 myself+0.056
Ta+0.052
aming+0.052
CR+0.051
Neg logit contributions
 tight-0.032
 understands-0.031
ucker-0.030
 stern-0.028
 fault-0.028
Token,
Token ID11
Feature activation+2.70e-02

Pos logit contributions
 myself+0.028
iece+0.028
oy+0.025
CR+0.024
 seek+0.024
Neg logit contributions
ucker-0.020
 fault-0.018
 tight-0.018
run-0.018
 understands-0.018
Token "
Token ID366
Feature activation+2.35e-02

Pos logit contributions
 fault+0.025
ucker+0.023
 vain+0.020
 understands+0.019
run+0.017
Neg logit contributions
iece-0.055
 myself-0.055
Now-0.051
mmm-0.050
Next-0.050
TokenB
Token ID33
Feature activation-6.98e-03

Pos logit contributions
 tight+0.030
 understands+0.029
 "+0.028
ucker+0.028
 +0.028
Neg logit contributions
iece-0.036
 myself-0.034
aming-0.032
oy-0.031
CR-0.031
Tokenrun
Token ID5143
Feature activation+1.60e-01

Pos logit contributions
 myself+0.011
iece+0.011
aming+0.010
Ta+0.010
CR+0.010
Neg logit contributions
 tight-0.008
 understands-0.008
ucker-0.008
 stern-0.008
 fault-0.008
Tokeno
Token ID78
Feature activation+2.17e-02

Pos logit contributions
 tight+0.191
ucker+0.189
 fault+0.184
 stern+0.181
 understands+0.181
Neg logit contributions
 myself-0.255
iece-0.255
aming-0.235
Ta-0.234
Now-0.222
Token,
Token ID11
Feature activation-1.83e-03

Pos logit contributions
 fault+0.028
run+0.027
ucker+0.027
arl+0.026
 understands+0.026
Neg logit contributions
 myself-0.034
iece-0.030
oy-0.030
aming-0.029
 seek-0.028
Token you
Token ID345
Feature activation-1.66e-02

Pos logit contributions
iece+0.002
 myself+0.002
aming+0.002
Ta+0.002
CR+0.002
Neg logit contributions
 tight-0.003
 understands-0.003
ucker-0.003
 fault-0.003
 "-0.003
Token are
Token ID389
Feature activation+1.52e-02

Pos logit contributions
iece+0.031
 myself+0.031
aming+0.029
Ta+0.029
CR+0.027
Neg logit contributions
 tight-0.016
ucker-0.016
 understands-0.015
 fault-0.015
 stern-0.015
Token a
Token ID257
Feature activation+9.91e-03

Pos logit contributions
 tight+0.017
 understands+0.016
ucker+0.015
 fault+0.015
 stern+0.015
Neg logit contributions
iece-0.027
 myself-0.026
Ta-0.025
aming-0.024
CR-0.024
Token
Token ID26391
Feature activation-1.15e-02

Pos logit contributions
iece+0.043
 myself+0.041
aming+0.038
Ta+0.037
CR+0.035
Neg logit contributions
 tight-0.052
ucker-0.051
 understands-0.050
 fault-0.049
 stern-0.049
Token said
Token ID531
Feature activation-1.26e-02

Pos logit contributions
iece+0.019
 myself+0.019
aming+0.018
Ta+0.017
CR+0.017
Neg logit contributions
 tight-0.013
ucker-0.013
 fault-0.012
 understands-0.012
run-0.012
Token Ali
Token ID12104
Feature activation+3.92e-02

Pos logit contributions
 myself+0.023
iece+0.022
CR+0.020
oy+0.019
aming+0.019
Neg logit contributions
ucker-0.014
 understands-0.013
 fault-0.013
 tight-0.012
 vain-0.012
Tokenâ
Token ID22940
Feature activation-1.41e-02

Pos logit contributions
ucker+0.051
arl+0.045
 understands+0.044
 fault+0.043
 lot+0.041
Neg logit contributions
 myself-0.064
iece-0.062
 later-0.054
oy-0.054
th-0.053
Token
Token ID26391
Feature activation+1.59e-02

Pos logit contributions
iece+0.018
 myself+0.017
aming+0.016
Ta+0.015
CR+0.014
Neg logit contributions
ucker-0.022
 tight-0.022
 fault-0.022
 understands-0.022
 stern-0.021
Token
Token ID8151
Feature activation+1.59e-01

Pos logit contributions
ucker+0.020
 tight+0.020
 fault+0.019
 understands+0.019
arl+0.018
Neg logit contributions
iece-0.025
 myself-0.025
aming-0.023
 seek-0.023
CR-0.022
Tokens
Token ID82
Feature activation+1.64e-02

Pos logit contributions
ucker+0.239
 shrug+0.237
 understands+0.237
 tight+0.229
 stern+0.223
Neg logit contributions
 myself-0.198
iece-0.193
aming-0.174
CR-0.171
mmmm-0.168
Token mom
Token ID1995
Feature activation+3.75e-02

Pos logit contributions
ucker+0.015
 understands+0.012
run+0.012
ip+0.012
burn+0.011
Neg logit contributions
 myself-0.033
iece-0.031
 endings-0.029
aming-0.028
 seek-0.028
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
ucker+0.050
 understands+0.048
 fault+0.046
 lot+0.046
 stern+0.045
Neg logit contributions
 myself-0.057
iece-0.056
aming-0.049
 later-0.048
oy-0.046
Token
Token ID220
Feature activation+7.23e-02

Pos logit contributions
 fault+0.014
ucker+0.014
 tight+0.014
 understands+0.013
 stern+0.013
Neg logit contributions
 myself-0.027
iece-0.027
aming-0.024
Ta-0.024
CR-0.024
Token
Token ID198
Feature activation-8.24e-03

Pos logit contributions
 tight+0.076
ucker+0.074
 understands+0.073
 fault+0.072
 stern+0.072
Neg logit contributions
 myself-0.126
iece-0.126
aming-0.116
Ta-0.113
CR-0.112
Token day
Token ID1110
Feature activation-7.01e-03

Pos logit contributions
 tight+0.002
 understands+0.002
ucker+0.002
 "+0.002
 fault+0.002
Neg logit contributions
iece-0.003
 myself-0.003
Ta-0.003
aming-0.003
 endings-0.003
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
iece+0.017
 myself+0.016
Ta+0.016
aming+0.015
ands+0.015
Neg logit contributions
 tight-0.004
 "-0.004
 stern-0.004
 understands-0.003
 -0.003
Token Sam
Token ID3409
Feature activation+2.34e-02

Pos logit contributions
 tight+0.013
 "+0.013
 +0.011
 lot+0.011
 stern+0.010
Neg logit contributions
iece-0.023
ands-0.021
 myself-0.021
Ta-0.020
 nights-0.020
Tokenâ
Token ID22940
Feature activation-2.71e-02

Pos logit contributions
 tight+0.018
 would+0.017
 understands+0.016
 "+0.015
 will+0.015
Neg logit contributions
iece-0.052
 myself-0.051
mmmm-0.049
Ta-0.048
mmm-0.048
Token
Token ID26391
Feature activation-3.18e-04

Pos logit contributions
iece+0.036
 myself+0.035
 endings+0.034
 pirates+0.034
bees+0.033
Neg logit contributions
 tight-0.038
 -0.036
arl-0.035
 understands-0.034
ucker-0.034
Token
Token ID8151
Feature activation+1.57e-01

Pos logit contributions
iece+0.001
 myself+0.000
aming+0.000
oy+0.000
Ta+0.000
Neg logit contributions
 fault-0.000
 tight-0.000
ucker-0.000
 understands-0.000
 stern-0.000
Tokens
Token ID82
Feature activation+1.51e-02

Pos logit contributions
ucker+0.188
 understands+0.185
 tight+0.183
 shrug+0.180
 stern+0.178
Neg logit contributions
iece-0.252
 myself-0.238
aming-0.229
CR-0.218
Ta-0.210
Token Mom
Token ID11254
Feature activation+2.07e-02

Pos logit contributions
 tight+0.015
ucker+0.015
 understands+0.014
 fault+0.014
 stern+0.014
Neg logit contributions
iece-0.028
 myself-0.027
aming-0.026
Ta-0.026
CR-0.025
Token said
Token ID531
Feature activation-4.47e-03

Pos logit contributions
 fault+0.026
ucker+0.025
arl+0.024
 stern+0.024
 tight+0.024
Neg logit contributions
 myself-0.032
aming-0.032
iece-0.031
Now-0.029
Ta-0.029
Token to
Token ID284
Feature activation+3.75e-02

Pos logit contributions
iece+0.008
 myself+0.008
aming+0.008
Ta+0.008
CR+0.008
Neg logit contributions
 tight-0.004
ucker-0.004
 understands-0.004
 fault-0.004
run-0.004
Token him
Token ID683
Feature activation+1.52e-02

Pos logit contributions
 tight+0.039
ucker+0.038
 understands+0.037
 fault+0.037
 stern+0.036
Neg logit contributions
iece-0.068
 myself-0.066
aming-0.062
Ta-0.062
CR-0.059
TokenYes
Token ID5297
Feature activation+4.23e-02

Pos logit contributions
 tight+0.020
 understands+0.019
ucker+0.019
 fault+0.019
 "+0.019
Neg logit contributions
iece-0.020
 myself-0.019
aming-0.018
Ta-0.017
 endings-0.016
Token,
Token ID11
Feature activation+2.41e-02

Pos logit contributions
ucker+0.054
 fault+0.054
 understands+0.052
 tight+0.049
ometer+0.048
Neg logit contributions
 myself-0.067
iece-0.063
th-0.058
mmm-0.058
oy-0.058
Token that
Token ID326
Feature activation+5.90e-02

Pos logit contributions
 tight+0.031
ucker+0.029
 understands+0.029
 fault+0.028
run+0.028
Neg logit contributions
iece-0.038
 myself-0.037
aming-0.035
Ta-0.034
CR-0.033
Tokenâ
Token ID22940
Feature activation-2.20e-02

Pos logit contributions
ucker+0.060
 tight+0.059
 fault+0.057
arl+0.056
 understands+0.056
Neg logit contributions
 myself-0.106
iece-0.105
aming-0.096
Ta-0.094
 seek-0.093
Token
Token ID26391
Feature activation-1.18e-03

Pos logit contributions
iece+0.026
 myself+0.026
aming+0.023
Ta+0.023
CR+0.021
Neg logit contributions
 tight-0.036
ucker-0.036
 fault-0.036
 understands-0.036
run-0.035
Token
Token ID8151
Feature activation+1.56e-01

Pos logit contributions
 myself+0.002
iece+0.002
CR+0.002
aming+0.002
mmm+0.002
Neg logit contributions
 tight-0.001
 fault-0.001
ucker-0.001
 understands-0.001
 stern-0.001
Tokens
Token ID82
Feature activation+2.45e-02

Pos logit contributions
 tight+0.169
 shrug+0.167
 understands+0.165
 stern+0.165
 fault+0.163
Neg logit contributions
 myself-0.253
iece-0.244
CR-0.232
mmmm-0.231
mmm-0.229
Token it
Token ID340
Feature activation+3.19e-02

Pos logit contributions
ucker+0.031
 understands+0.028
 fault+0.028
 tight+0.028
arl+0.028
Neg logit contributions
 myself-0.041
iece-0.038
aming-0.036
oy-0.034
 seek-0.033
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
 tight+0.034
ucker+0.033
 understands+0.033
 fault+0.032
 stern+0.031
Neg logit contributions
iece-0.056
 myself-0.055
aming-0.052
Ta-0.051
CR-0.049
Token You
Token ID921
Feature activation-1.21e-02

Pos logit contributions
 tight+0.012
ucker+0.012
 understands+0.011
 fault+0.011
 stern+0.011
Neg logit contributions
iece-0.019
 myself-0.018
aming-0.017
Ta-0.017
CR-0.016
Token guessed
Token ID25183
Feature activation-4.09e-02

Pos logit contributions
iece+0.024
 myself+0.024
aming+0.022
Ta+0.022
CR+0.022
Neg logit contributions
 tight-0.010
ucker-0.010
 understands-0.010
 fault-0.010
 stern-0.009
Token you
Token ID345
Feature activation-5.28e-02

Pos logit contributions
 tight+0.029
ucker+0.028
 understands+0.028
 fault+0.028
 stern+0.027
Neg logit contributions
iece-0.017
 myself-0.016
aming-0.014
Ta-0.014
CR-0.013
Token?"
Token ID1701
Feature activation-2.49e-02

Pos logit contributions
iece+0.082
 myself+0.081
aming+0.075
Ta+0.074
CR+0.071
Neg logit contributions
 tight-0.067
ucker-0.066
 understands-0.065
 fault-0.065
 stern-0.063
Token
Token ID198
Feature activation-5.33e-03

Pos logit contributions
 myself+0.029
iece+0.029
Next+0.028
Now+0.028
Ta+0.027
Neg logit contributions
 fault-0.040
 vain-0.038
ucker-0.037
 understands-0.036
arl-0.035
Token
Token ID198
Feature activation-5.03e-03

Pos logit contributions
 myself+0.008
iece+0.008
aming+0.008
Ta+0.008
Now+0.008
Neg logit contributions
 fault-0.007
ucker-0.006
 understands-0.006
 vain-0.006
 tight-0.006
TokenB
Token ID33
Feature activation+9.14e-03

Pos logit contributions
iece+0.007
 myself+0.006
aming+0.005
 endings+0.005
Ta+0.005
Neg logit contributions
 tight-0.008
ucker-0.008
 stern-0.008
 understands-0.008
 "-0.008
Tokenrun
Token ID5143
Feature activation+1.56e-01

Pos logit contributions
 tight+0.012
ucker+0.012
 understands+0.012
 fault+0.012
 stern+0.011
Neg logit contributions
iece-0.014
 myself-0.013
aming-0.012
Ta-0.012
CR-0.012
Tokeno
Token ID78
Feature activation+3.10e-02

Pos logit contributions
 tight+0.191
ucker+0.188
 fault+0.185
 understands+0.184
 stern+0.180
Neg logit contributions
iece-0.247
 myself-0.244
aming-0.227
Ta-0.224
CR-0.215
Token said
Token ID531
Feature activation-1.31e-02

Pos logit contributions
 tight+0.031
 understands+0.030
ucker+0.030
 stern+0.028
 fault+0.028
Neg logit contributions
iece-0.058
 myself-0.057
aming-0.053
Ta-0.053
CR-0.051
Token,
Token ID11
Feature activation+2.47e-02

Pos logit contributions
iece+0.019
 myself+0.019
 seek+0.017
oy+0.017
 later+0.016
Neg logit contributions
ucker-0.019
 fault-0.018
 understands-0.017
arl-0.017
 vain-0.017
Token "
Token ID366
Feature activation+2.63e-02

Pos logit contributions
 fault+0.027
ucker+0.024
uffy+0.021
 vain+0.020
 understands+0.019
Neg logit contributions
iece-0.050
 myself-0.048
Now-0.044
Next-0.044
 seek-0.044
TokenOf
Token ID5189
Feature activation+5.98e-02

Pos logit contributions
 tight+0.025
 understands+0.022
 will+0.021
 "+0.021
ucker+0.021
Neg logit contributions
iece-0.053
aming-0.049
 myself-0.048
 nights-0.046
oy-0.045
Token day
Token ID1110
Feature activation-2.82e-02

Pos logit contributions
iece+0.037
 myself+0.035
Ta+0.033
aming+0.032
 endings+0.032
Neg logit contributions
 tight-0.023
 lot-0.020
 understands-0.019
 stern-0.019
 "-0.019
Token,
Token ID11
Feature activation+3.91e-03

Pos logit contributions
iece+0.068
 myself+0.066
Ta+0.064
aming+0.063
ands+0.061
Neg logit contributions
 "-0.016
 tight-0.016
 stern-0.014
 lot-0.014
 -0.013
Token Jane
Token ID12091
Feature activation+6.33e-03

Pos logit contributions
 tight+0.004
 "+0.003
 lot+0.003
 Spot+0.003
 +0.003
Neg logit contributions
iece-0.008
aming-0.008
Ta-0.008
ands-0.008
 myself-0.008
Tokenâ
Token ID22940
Feature activation-2.66e-02

Pos logit contributions
 understands+0.006
 tight+0.006
 would+0.006
 will+0.006
ucker+0.005
Neg logit contributions
 myself-0.012
iece-0.012
Ta-0.012
mmmm-0.011
ands-0.011
Token
Token ID26391
Feature activation-5.24e-03

Pos logit contributions
iece+0.037
 myself+0.037
 endings+0.035
 pirates+0.034
 nights+0.034
Neg logit contributions
 tight-0.034
 -0.033
ucker-0.033
 understands-0.032
 Together-0.032
Token
Token ID8151
Feature activation+1.55e-01

Pos logit contributions
iece+0.009
 myself+0.009
aming+0.009
CR+0.008
Ta+0.008
Neg logit contributions
 tight-0.006
 fault-0.005
ucker-0.005
 understands-0.005
 stern-0.005
Tokens
Token ID82
Feature activation+1.60e-02

Pos logit contributions
 tight+0.205
ucker+0.197
 stern+0.196
 shrug+0.196
 understands+0.194
Neg logit contributions
iece-0.231
 myself-0.212
aming-0.208
CR-0.196
Ta-0.192
Token mom
Token ID1995
Feature activation+2.43e-02

Pos logit contributions
 tight+0.015
ucker+0.015
 understands+0.014
 fault+0.014
run+0.014
Neg logit contributions
iece-0.030
 myself-0.030
aming-0.028
Ta-0.028
CR-0.027
Tokenmy
Token ID1820
Feature activation+1.30e-02

Pos logit contributions
 tight+0.028
 fault+0.027
ucker+0.027
 understands+0.026
run+0.026
Neg logit contributions
iece-0.039
 myself-0.039
aming-0.038
Ta-0.036
CR-0.035
Token gave
Token ID2921
Feature activation+3.58e-02

Pos logit contributions
 tight+0.015
 fault+0.015
ucker+0.015
run+0.014
 understands+0.014
Neg logit contributions
iece-0.021
 myself-0.021
aming-0.020
Ta-0.019
CR-0.019
Token her
Token ID607
Feature activation+1.58e-02

Pos logit contributions
 tight+0.045
ucker+0.044
 understands+0.044
 fault+0.043
 stern+0.042
Neg logit contributions
iece-0.057
 myself-0.055
aming-0.051
Ta-0.051
CR-0.049
Token
Token ID198
Feature activation-1.37e-02

Pos logit contributions
iece+0.005
 myself+0.005
aming+0.005
Ta+0.005
CR+0.005
Neg logit contributions
 tight-0.003
ucker-0.003
 understands-0.003
 fault-0.003
 stern-0.003
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
iece+0.032
 myself+0.029
Ta+0.029
aming+0.029
bees+0.029
Neg logit contributions
 -0.009
 tight-0.008
 "-0.008
run-0.007
 understands-0.007
TokenBaby
Token ID36534
Feature activation-1.48e-02

Pos logit contributions
iece+0.021
 seaw+0.018
 nights+0.018
 endings+0.017
bees+0.017
Neg logit contributions
 -0.020
 tight-0.019
 "-0.018
run-0.018
 stern-0.017
Tokenâ
Token ID22940
Feature activation-2.61e-02

Pos logit contributions
iece+0.025
 myself+0.024
aming+0.023
Ta+0.022
CR+0.022
Neg logit contributions
 tight-0.017
ucker-0.017
 fault-0.017
 understands-0.016
run-0.016
Token
Token ID26391
Feature activation-8.81e-04

Pos logit contributions
iece+0.035
 myself+0.034
aming+0.031
Ta+0.031
 endings+0.030
Neg logit contributions
 tight-0.038
ucker-0.037
 understands-0.037
 fault-0.036
 stern-0.035
Token
Token ID8151
Feature activation+1.55e-01

Pos logit contributions
iece+0.002
 myself+0.002
aming+0.001
oy+0.001
CR+0.001
Neg logit contributions
 tight-0.001
 fault-0.001
 understands-0.001
ucker-0.001
 stern-0.001
Tokens
Token ID82
Feature activation+9.55e-03

Pos logit contributions
 tight+0.187
 shrug+0.183
 stern+0.177
 understands+0.173
ucker+0.173
Neg logit contributions
iece-0.241
 myself-0.228
aming-0.219
Now-0.210
mmmm-0.208
Token eyes
Token ID2951
Feature activation+5.65e-02

Pos logit contributions
ucker+0.009
 tight+0.009
 understands+0.009
 fault+0.009
run+0.008
Neg logit contributions
iece-0.017
 myself-0.017
aming-0.016
Ta-0.016
 endings-0.015
Token lit
Token ID6578
Feature activation-2.48e-02

Pos logit contributions
 tight+0.072
ucker+0.072
 fault+0.071
 understands+0.071
run+0.068
Neg logit contributions
iece-0.086
 myself-0.084
aming-0.079
Ta-0.077
CR-0.073
Token up
Token ID510
Feature activation+3.16e-02

Pos logit contributions
iece+0.050
 myself+0.049
Ta+0.047
aming+0.046
CR+0.045
Neg logit contributions
 tight-0.021
ucker-0.019
 understands-0.019
 fault-0.019
 stern-0.018
Token.
Token ID13
Feature activation-1.51e-03

Pos logit contributions
 tight+0.033
 stern+0.029
ucker+0.029
 "+0.029
 understands+0.028
Neg logit contributions
iece-0.059
 myself-0.057
Ta-0.056
aming-0.054
CR-0.052
Token the
Token ID262
Feature activation+3.73e-02

Pos logit contributions
 tight+0.008
 "+0.008
 stern+0.007
 understands+0.007
 fault+0.007
Neg logit contributions
iece-0.015
 myself-0.014
Ta-0.014
aming-0.014
ands-0.013
Token little
Token ID1310
Feature activation-4.98e-03

Pos logit contributions
 tight+0.042
 fault+0.039
 understands+0.039
ucker+0.038
 stern+0.038
Neg logit contributions
iece-0.066
 myself-0.064
aming-0.060
Ta-0.059
CR-0.057
Token girl
Token ID2576
Feature activation-1.14e-02

Pos logit contributions
iece+0.009
 myself+0.008
aming+0.008
Ta+0.008
CR+0.008
Neg logit contributions
 tight-0.006
ucker-0.005
 understands-0.005
 fault-0.005
 stern-0.005
Tokenâ
Token ID22940
Feature activation-2.42e-02

Pos logit contributions
iece+0.019
 myself+0.019
aming+0.018
Ta+0.017
CR+0.017
Neg logit contributions
 tight-0.013
ucker-0.013
 fault-0.012
 understands-0.012
run-0.012
Token
Token ID26391
Feature activation+6.02e-03

Pos logit contributions
iece+0.032
 myself+0.031
aming+0.029
Ta+0.028
CR+0.027
Neg logit contributions
 tight-0.036
ucker-0.036
 understands-0.035
 fault-0.035
 stern-0.034
Token
Token ID8151
Feature activation+1.55e-01

Pos logit contributions
 fault+0.007
 tight+0.007
 understands+0.007
arl+0.007
ucker+0.006
Neg logit contributions
 myself-0.010
iece-0.010
aming-0.009
 seek-0.009
CR-0.009
Tokens
Token ID82
Feature activation+1.24e-02

Pos logit contributions
 tight+0.198
 shrug+0.193
 sho+0.186
 stern+0.186
 cane+0.185
Neg logit contributions
iece-0.224
 myself-0.212
aming-0.208
CR-0.197
mmmm-0.194
Token l
Token ID300
Feature activation+4.92e-02

Pos logit contributions
ucker+0.013
 tight+0.013
 fault+0.012
 understands+0.012
run+0.012
Neg logit contributions
 myself-0.022
iece-0.021
aming-0.020
Ta-0.019
 endings-0.019
Tokenimes
Token ID999
Feature activation+3.84e-02

Pos logit contributions
 tight+0.063
 fault+0.061
ucker+0.060
 understands+0.057
 "+0.057
Neg logit contributions
 myself-0.077
iece-0.077
aming-0.073
Ta-0.072
CR-0.067
Token were
Token ID547
Feature activation-2.08e-02

Pos logit contributions
ucker+0.044
 fault+0.043
 tight+0.042
 understands+0.041
 stern+0.039
Neg logit contributions
 myself-0.064
iece-0.064
aming-0.060
Ta-0.057
CR-0.056
Token dry
Token ID5894
Feature activation-1.91e-02

Pos logit contributions
 myself+0.034
iece+0.033
aming+0.032
 seek+0.029
Ta+0.029
Neg logit contributions
ucker-0.024
 understands-0.024
 fault-0.024
 tight-0.023
arl-0.023
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
 tight+0.081
ucker+0.078
 stern+0.076
 understands+0.076
 fault+0.076
Neg logit contributions
iece-0.129
 myself-0.127
aming-0.117
Ta-0.115
CR-0.115
TokenLu
Token ID25596
Feature activation+7.89e-02

Pos logit contributions
iece+0.020
 endings+0.018
 myself+0.017
 nights+0.017
bees+0.017
Neg logit contributions
 -0.016
 tight-0.015
 "-0.015
run-0.014
 Together-0.013
Tokency
Token ID948
Feature activation+4.24e-03

Pos logit contributions
 fault+0.091
ucker+0.089
 tight+0.087
 understands+0.084
 stern+0.084
Neg logit contributions
iece-0.132
 myself-0.128
aming-0.126
Ta-0.122
CR-0.116
Tokenâ
Token ID22940
Feature activation-2.75e-02

Pos logit contributions
 tight+0.004
 understands+0.004
 would+0.003
 stern+0.003
 "+0.003
Neg logit contributions
iece-0.008
 myself-0.008
Ta-0.008
aming-0.008
mmmm-0.008
Token
Token ID26391
Feature activation-4.14e-03

Pos logit contributions
iece+0.040
 pirates+0.039
 myself+0.039
 endings+0.038
 seaw+0.037
Neg logit contributions
 tight-0.035
 understands-0.033
ucker-0.033
ishly-0.032
 -0.031
Token
Token ID8151
Feature activation+1.54e-01

Pos logit contributions
iece+0.008
 myself+0.007
aming+0.007
CR+0.007
Ta+0.007
Neg logit contributions
 tight-0.004
 fault-0.004
 understands-0.004
ucker-0.004
 stern-0.004
Tokens
Token ID82
Feature activation+1.13e-02

Pos logit contributions
 tight+0.189
 shrug+0.184
ucker+0.179
 stern+0.178
 understands+0.178
Neg logit contributions
iece-0.237
 myself-0.225
aming-0.217
Now-0.205
mmmm-0.204
Token mother
Token ID2802
Feature activation-3.71e-04

Pos logit contributions
 tight+0.011
ucker+0.010
 understands+0.010
 fault+0.010
 stern+0.010
Neg logit contributions
iece-0.021
 myself-0.021
aming-0.020
Ta-0.020
CR-0.019
Token took
Token ID1718
Feature activation-5.46e-03

Pos logit contributions
iece+0.001
 myself+0.001
aming+0.001
Ta+0.001
 behold+0.001
Neg logit contributions
 fault-0.000
ucker-0.000
 tight-0.000
arl-0.000
ometer-0.000
Token her
Token ID607
Feature activation-1.17e-02

Pos logit contributions
iece+0.009
 myself+0.008
aming+0.008
Ta+0.008
CR+0.007
Neg logit contributions
 tight-0.007
ucker-0.007
 understands-0.007
 fault-0.007
 stern-0.006
Token to
Token ID284
Feature activation-2.97e-02

Pos logit contributions
iece+0.022
 myself+0.021
aming+0.020
Ta+0.020
CR+0.019
Neg logit contributions
ucker-0.012
 understands-0.011
 tight-0.011
 fault-0.010
arl-0.010
TokenYes
Token ID5297
Feature activation+4.43e-02

Pos logit contributions
iece+0.001
 myself+0.001
aming+0.001
Ta+0.001
CR+0.001
Neg logit contributions
 tight-0.001
 understands-0.001
ucker-0.001
 fault-0.001
run-0.001
Token,
Token ID11
Feature activation+2.42e-02

Pos logit contributions
ucker+0.054
 fault+0.054
 understands+0.052
 tight+0.050
ometer+0.049
Neg logit contributions
 myself-0.072
iece-0.068
aming-0.063
oy-0.063
mmm-0.063
Token that
Token ID326
Feature activation+5.71e-02

Pos logit contributions
 tight+0.030
 understands+0.028
 "+0.028
ucker+0.028
run+0.028
Neg logit contributions
iece-0.040
 myself-0.038
aming-0.037
Ta-0.036
CR-0.034
Tokenâ
Token ID22940
Feature activation-2.89e-02

Pos logit contributions
 tight+0.054
ucker+0.054
 fault+0.052
 understands+0.051
 stern+0.049
Neg logit contributions
iece-0.106
 myself-0.105
aming-0.098
Ta-0.097
CR-0.094
Token
Token ID26391
Feature activation-1.01e-02

Pos logit contributions
iece+0.034
 myself+0.034
aming+0.030
Ta+0.029
CR+0.028
Neg logit contributions
 fault-0.048
ucker-0.048
 tight-0.048
 understands-0.047
run-0.046
Token
Token ID8151
Feature activation+1.53e-01

Pos logit contributions
 myself+0.016
iece+0.016
CR+0.015
aming+0.015
Ta+0.015
Neg logit contributions
 tight-0.012
ucker-0.012
 fault-0.011
 understands-0.011
 stern-0.011
Tokens
Token ID82
Feature activation+1.61e-02

Pos logit contributions
 tight+0.172
burn+0.169
 fault+0.168
ucker+0.165
 shrug+0.164
Neg logit contributions
 myself-0.241
iece-0.228
CR-0.223
mmmm-0.222
mmm-0.221
Token rob
Token ID3857
Feature activation+1.53e-02

Pos logit contributions
ucker+0.020
 understands+0.019
 tight+0.019
 fault+0.019
 stern+0.018
Neg logit contributions
 myself-0.026
iece-0.025
aming-0.023
oy-0.022
 seek-0.022
Tokenin
Token ID259
Feature activation+2.14e-02

Pos logit contributions
 tight+0.021
ucker+0.021
 fault+0.020
 understands+0.020
 stern+0.020
Neg logit contributions
iece-0.022
 myself-0.022
aming-0.020
Ta-0.020
CR-0.019
Tokenâ
Token ID22940
Feature activation-4.72e-02

Pos logit contributions
 tight+0.022
ucker+0.021
 understands+0.021
 fault+0.021
 stern+0.020
Neg logit contributions
iece-0.039
 myself-0.037
Ta-0.036
aming-0.036
CR-0.034
Token
Token ID26391
Feature activation-1.93e-02

Pos logit contributions
iece+0.059
 myself+0.058
aming+0.053
Ta+0.052
CR+0.049
Neg logit contributions
 tight-0.074
ucker-0.073
 understands-0.072
 fault-0.072
 stern-0.070
TokenYes
Token ID5297
Feature activation+3.73e-02

Pos logit contributions
 tight+0.007
 understands+0.006
 fault+0.006
ucker+0.006
 "+0.006
Neg logit contributions
iece-0.007
 myself-0.007
aming-0.006
Ta-0.006
CR-0.006
Token,
Token ID11
Feature activation+2.32e-02

Pos logit contributions
ucker+0.048
 fault+0.048
 understands+0.047
 tight+0.044
run+0.043
Neg logit contributions
 myself-0.058
iece-0.054
oy-0.050
aming-0.050
mmm-0.050
Token that
Token ID326
Feature activation+5.67e-02

Pos logit contributions
 tight+0.028
ucker+0.027
 understands+0.027
 fault+0.026
 stern+0.026
Neg logit contributions
iece-0.038
 myself-0.037
aming-0.035
Ta-0.034
CR-0.033
Tokenâ
Token ID22940
Feature activation-2.82e-02

Pos logit contributions
ucker+0.058
 tight+0.057
 fault+0.054
arl+0.054
 understands+0.054
Neg logit contributions
iece-0.101
 myself-0.101
aming-0.093
Ta-0.090
CR-0.089
Token
Token ID26391
Feature activation-7.54e-03

Pos logit contributions
iece+0.033
 myself+0.033
aming+0.030
Ta+0.029
CR+0.028
Neg logit contributions
 fault-0.047
ucker-0.047
 tight-0.047
 understands-0.046
run-0.044
Token
Token ID8151
Feature activation+1.53e-01

Pos logit contributions
 myself+0.012
iece+0.012
CR+0.012
oy+0.011
aming+0.011
Neg logit contributions
 tight-0.009
ucker-0.008
 fault-0.008
 understands-0.008
 stern-0.008
Tokens
Token ID82
Feature activation+2.03e-02

Pos logit contributions
 tight+0.168
burn+0.167
 shrug+0.166
ucker+0.162
 sore+0.161
Neg logit contributions
 myself-0.240
iece-0.229
CR-0.220
mmm-0.220
mmmm-0.220
Token exactly
Token ID3446
Feature activation+5.99e-02

Pos logit contributions
ucker+0.026
 understands+0.025
 tight+0.024
arl+0.024
 fault+0.024
Neg logit contributions
 myself-0.032
iece-0.030
aming-0.028
oy-0.026
 seek-0.026
Token right
Token ID826
Feature activation+1.05e-02

Pos logit contributions
ucker+0.077
 tight+0.069
 fault+0.068
ometer+0.068
 understands+0.067
Neg logit contributions
 myself-0.101
iece-0.094
oy-0.085
 seek-0.085
th-0.085
Token.
Token ID13
Feature activation+5.72e-03

Pos logit contributions
ucker+0.014
 understands+0.013
 tight+0.013
 fault+0.013
run+0.012
Neg logit contributions
 myself-0.016
iece-0.016
aming-0.015
oy-0.014
ach-0.014
Token See
Token ID4091
Feature activation-3.34e-02

Pos logit contributions
 tight+0.006
ucker+0.006
 fault+0.006
 understands+0.006
 stern+0.006
Neg logit contributions
 myself-0.010
iece-0.010
aming-0.009
Ta-0.009
CR-0.009
Token day
Token ID1110
Feature activation-1.42e-02

Pos logit contributions
ucker+0.067
run+0.067
 fault+0.066
arl+0.066
 understands+0.063
Neg logit contributions
 myself-0.043
iece-0.042
aming-0.040
 nights-0.039
 seek-0.038
Token,
Token ID11
Feature activation+1.02e-02

Pos logit contributions
iece+0.028
 myself+0.028
aming+0.026
Ta+0.026
CR+0.025
Neg logit contributions
 tight-0.012
ucker-0.012
 understands-0.012
 fault-0.011
 stern-0.011
Token Lucy
Token ID22162
Feature activation+4.97e-02

Pos logit contributions
 tight+0.007
 "+0.006
 understands+0.006
 lot+0.006
 stern+0.005
Neg logit contributions
iece-0.023
 myself-0.022
Ta-0.022
aming-0.021
ands-0.021
Tokenâ
Token ID22940
Feature activation-2.49e-02

Pos logit contributions
 tight+0.052
 understands+0.049
ucker+0.048
 fault+0.045
 would+0.045
Neg logit contributions
 myself-0.092
iece-0.092
Ta-0.085
mmmm-0.083
aming-0.081
Token
Token ID26391
Feature activation-4.78e-03

Pos logit contributions
iece+0.035
 myself+0.034
 endings+0.031
 seaw+0.031
 pirates+0.030
Neg logit contributions
 tight-0.034
ucker-0.033
 understands-0.032
arl-0.032
 fault-0.031
Token
Token ID8151
Feature activation+1.52e-01

Pos logit contributions
iece+0.008
aming+0.008
 myself+0.008
CR+0.007
oy+0.007
Neg logit contributions
 tight-0.005
 fault-0.005
arl-0.005
ucker-0.005
 stern-0.005
Tokens
Token ID82
Feature activation+1.78e-02

Pos logit contributions
ucker+0.184
 understands+0.183
 shrug+0.181
 tight+0.178
 fault+0.176
Neg logit contributions
iece-0.244
 myself-0.228
aming-0.225
CR-0.207
Ta-0.206
Token dad
Token ID9955
Feature activation+1.44e-03

Pos logit contributions
ucker+0.019
 understands+0.017
 fault+0.017
run+0.017
 tight+0.016
Neg logit contributions
iece-0.031
 myself-0.030
aming-0.030
Ta-0.028
 nights-0.028
Token took
Token ID1718
Feature activation-2.02e-03

Pos logit contributions
 fault+0.002
ucker+0.002
 tight+0.002
arl+0.002
 understands+0.002
Neg logit contributions
iece-0.002
 myself-0.002
aming-0.002
Ta-0.002
CR-0.002
Token her
Token ID607
Feature activation-4.92e-03

Pos logit contributions
iece+0.004
 myself+0.003
aming+0.003
Ta+0.003
CR+0.003
Neg logit contributions
 tight-0.002
 understands-0.002
 stern-0.002
 fault-0.002
ucker-0.002
Token and
Token ID290
Feature activation-1.26e-02

Pos logit contributions
iece+0.010
 myself+0.009
aming+0.009
Ta+0.009
 nights+0.008
Neg logit contributions
ucker-0.004
 understands-0.004
 tight-0.004
 fault-0.004
arl-0.004
Token
Token ID198
Feature activation-6.39e-03

Pos logit contributions
iece+0.019
 myself+0.018
aming+0.017
Ta+0.017
 endings+0.016
Neg logit contributions
 tight-0.009
ucker-0.008
 understands-0.008
 fault-0.008
run-0.008
TokenM
Token ID44
Feature activation+3.03e-02

Pos logit contributions
iece+0.010
 endings+0.008
 myself+0.008
 nights+0.008
Ta+0.008
Neg logit contributions
 tight-0.010
run-0.009
 "-0.009
 stern-0.009
 lot-0.009
Tokenolly
Token ID5098
Feature activation-2.45e-02

Pos logit contributions
ucker+0.041
 tight+0.040
 understands+0.039
 fault+0.038
arl+0.037
Neg logit contributions
iece-0.048
 myself-0.045
Ta-0.042
aming-0.042
 seek-0.040
Tokenâ
Token ID22940
Feature activation-2.77e-02

Pos logit contributions
iece+0.046
 myself+0.045
Ta+0.042
aming+0.041
 endings+0.040
Neg logit contributions
 tight-0.026
 understands-0.024
 stern-0.023
ucker-0.023
 "-0.023
Token
Token ID26391
Feature activation-4.53e-03

Pos logit contributions
iece+0.036
 myself+0.035
aming+0.032
Ta+0.031
CR+0.030
Neg logit contributions
 tight-0.043
ucker-0.042
 understands-0.042
 fault-0.042
 stern-0.040
Token
Token ID8151
Feature activation+1.52e-01

Pos logit contributions
iece+0.008
 myself+0.008
aming+0.007
CR+0.007
oy+0.007
Neg logit contributions
 tight-0.005
 fault-0.005
ucker-0.005
 understands-0.005
 stern-0.005
Tokens
Token ID82
Feature activation+1.62e-02

Pos logit contributions
 shrug+0.174
ucker+0.173
 understands+0.172
 tight+0.169
 stern+0.163
Neg logit contributions
iece-0.245
 myself-0.236
aming-0.222
CR-0.212
mmmm-0.210
Token mom
Token ID1995
Feature activation+2.40e-02

Pos logit contributions
ucker+0.016
 fault+0.016
 understands+0.016
 tight+0.015
arl+0.015
Neg logit contributions
 myself-0.027
iece-0.027
aming-0.026
 nights-0.025
th-0.025
Token filled
Token ID5901
Feature activation-3.66e-02

Pos logit contributions
 fault+0.032
 tight+0.031
ucker+0.031
arl+0.030
 understands+0.030
Neg logit contributions
iece-0.035
 myself-0.034
aming-0.033
Ta-0.031
th-0.030
Token up
Token ID510
Feature activation+2.54e-03

Pos logit contributions
iece+0.071
 myself+0.066
Ta+0.065
aming+0.064
CR+0.064
Neg logit contributions
 tight-0.037
ucker-0.033
 stern-0.032
 understands-0.031
 "-0.031
Token a
Token ID257
Feature activation-3.45e-02

Pos logit contributions
 tight+0.003
 stern+0.002
ucker+0.002
 "+0.002
 understands+0.002
Neg logit contributions
iece-0.005
 myself-0.005
Ta-0.004
aming-0.004
CR-0.004
Token,
Token ID11
Feature activation+3.67e-03

Pos logit contributions
iece+0.038
Ta+0.037
 myself+0.037
aming+0.036
mmmm+0.035
Neg logit contributions
 tight-0.015
 "-0.013
 fault-0.013
 understands-0.013
 stern-0.012
Token the
Token ID262
Feature activation+4.40e-02

Pos logit contributions
 tight+0.004
 "+0.004
 +0.003
 stern+0.003
 lot+0.003
Neg logit contributions
iece-0.007
Ta-0.007
aming-0.007
 myself-0.007
ands-0.007
Token girl
Token ID2576
Feature activation-1.37e-02

Pos logit contributions
 tight+0.049
 stern+0.043
 fault+0.043
 lot+0.043
 understands+0.042
Neg logit contributions
iece-0.081
 myself-0.078
aming-0.073
Ta-0.072
CR-0.071
Tokenâ
Token ID22940
Feature activation-2.72e-02

Pos logit contributions
iece+0.025
 myself+0.024
aming+0.023
Ta+0.023
CR+0.022
Neg logit contributions
 tight-0.014
 understands-0.014
ucker-0.014
 fault-0.013
 stern-0.013
Token
Token ID26391
Feature activation-3.22e-04

Pos logit contributions
iece+0.037
 myself+0.037
 endings+0.035
 pirates+0.035
 seaw+0.034
Neg logit contributions
 tight-0.036
ucker-0.035
 understands-0.034
ishly-0.033
 -0.033
Token
Token ID8151
Feature activation+1.52e-01

Pos logit contributions
iece+0.001
 myself+0.001
aming+0.001
CR+0.000
Ta+0.000
Neg logit contributions
 tight-0.000
 fault-0.000
 understands-0.000
ucker-0.000
 stern-0.000
Tokens
Token ID82
Feature activation+1.37e-02

Pos logit contributions
 tight+0.200
 shrug+0.192
 stern+0.188
 understands+0.187
ucker+0.187
Neg logit contributions
iece-0.223
 myself-0.207
aming-0.203
CR-0.194
Now-0.189
Token mom
Token ID1995
Feature activation+2.75e-02

Pos logit contributions
 tight+0.013
ucker+0.013
 understands+0.012
 fault+0.012
 stern+0.012
Neg logit contributions
iece-0.026
 myself-0.026
aming-0.024
Ta-0.024
CR-0.023
Tokenmy
Token ID1820
Feature activation+1.62e-02

Pos logit contributions
 fault+0.032
 tight+0.032
ucker+0.031
 understands+0.030
run+0.030
Neg logit contributions
 myself-0.044
iece-0.043
aming-0.042
Ta-0.039
Now-0.039
Token gave
Token ID2921
Feature activation+4.08e-02

Pos logit contributions
 fault+0.019
 tight+0.018
ucker+0.018
run+0.018
 understands+0.017
Neg logit contributions
iece-0.026
 myself-0.026
aming-0.025
CR-0.023
Now-0.023
Token her
Token ID607
Feature activation+1.28e-02

Pos logit contributions
ucker+0.057
 fault+0.056
 understands+0.056
 tight+0.055
 stern+0.053
Neg logit contributions
 myself-0.058
iece-0.056
aming-0.052
Ta-0.051
 nights-0.048
Token
Token ID129
Feature activation+6.23e-02

Pos logit contributions
 fault+0.028
 tight+0.028
ucker+0.027
 understands+0.027
 stern+0.026
Neg logit contributions
 myself-0.041
iece-0.040
CR-0.037
aming-0.037
oy-0.036
Token
Token ID241
Feature activation+3.62e-02

Pos logit contributions
ucker+0.081
 tight+0.080
 understands+0.080
 fault+0.080
run+0.076
Neg logit contributions
 myself-0.096
iece-0.095
aming-0.087
Ta-0.084
CR-0.082
Tokendon
Token ID9099
Feature activation+1.02e-02

Pos logit contributions
 tight+0.036
ucker+0.034
 understands+0.034
 fault+0.033
run+0.033
Neg logit contributions
iece-0.067
 myself-0.065
aming-0.062
Ta-0.062
 nights-0.058
Tokenâ
Token ID22940
Feature activation-1.91e-02

Pos logit contributions
 tight+0.013
ucker+0.013
 understands+0.012
 fault+0.012
 stern+0.012
Neg logit contributions
 myself-0.016
iece-0.016
aming-0.015
Ta-0.015
CR-0.014
Token
Token ID26391
Feature activation+9.74e-03

Pos logit contributions
iece+0.024
 myself+0.024
aming+0.022
Ta+0.021
CR+0.020
Neg logit contributions
 tight-0.030
ucker-0.029
 understands-0.029
 fault-0.029
 stern-0.028
Token
Token ID8151
Feature activation+1.51e-01

Pos logit contributions
 tight+0.011
 fault+0.011
 understands+0.011
ucker+0.010
 stern+0.010
Neg logit contributions
 myself-0.016
iece-0.016
CR-0.015
aming-0.015
 seek-0.015
Tokent
Token ID83
Feature activation+1.67e-02

Pos logit contributions
 tight+0.199
 understands+0.193
 stern+0.190
ucker+0.189
 shrug+0.188
Neg logit contributions
iece-0.219
 myself-0.217
aming-0.194
mmmm-0.194
CR-0.194
Token worry
Token ID5490
Feature activation+2.39e-02

Pos logit contributions
 tight+0.020
run+0.019
 fault+0.018
ucker+0.018
 "+0.018
Neg logit contributions
iece-0.028
Ta-0.027
aming-0.026
 myself-0.026
CR-0.024
Token!
Token ID0
Feature activation+1.85e-02

Pos logit contributions
ucker+0.027
 understands+0.027
 fault+0.026
 tight+0.025
 stern+0.024
Neg logit contributions
 myself-0.040
iece-0.038
th-0.035
aming-0.035
ach-0.034
Token I
Token ID314
Feature activation-7.32e-03

Pos logit contributions
 tight+0.019
ucker+0.018
 understands+0.018
 fault+0.017
run+0.017
Neg logit contributions
iece-0.034
 myself-0.033
aming-0.031
Ta-0.031
CR-0.030
Token will
Token ID481
Feature activation+1.45e-02

Pos logit contributions
iece+0.012
 myself+0.012
aming+0.012
Ta+0.011
 seek+0.011
Neg logit contributions
ucker-0.008
 tight-0.008
 fault-0.007
 understands-0.007
 stern-0.007
TokenYes
Token ID5297
Feature activation+3.96e-02

Pos logit contributions
 tight+0.004
 understands+0.004
 fault+0.004
ucker+0.004
run+0.004
Neg logit contributions
iece-0.005
 myself-0.005
aming-0.004
Ta-0.004
oy-0.004
Token,
Token ID11
Feature activation+2.25e-02

Pos logit contributions
ucker+0.051
 fault+0.050
 understands+0.049
 tight+0.046
run+0.046
Neg logit contributions
 myself-0.063
iece-0.059
th-0.053
mmm-0.053
CR-0.053
Token that
Token ID326
Feature activation+5.79e-02

Pos logit contributions
 tight+0.028
ucker+0.027
 understands+0.026
 fault+0.026
 stern+0.026
Neg logit contributions
iece-0.036
 myself-0.035
aming-0.033
Ta-0.033
CR-0.031
Tokenâ
Token ID22940
Feature activation-2.56e-02

Pos logit contributions
ucker+0.055
 tight+0.055
 fault+0.052
 understands+0.052
 stern+0.050
Neg logit contributions
iece-0.108
 myself-0.107
aming-0.099
Ta-0.098
CR-0.096
Token
Token ID26391
Feature activation-5.21e-03

Pos logit contributions
iece+0.031
 myself+0.030
aming+0.027
Ta+0.027
CR+0.025
Neg logit contributions
 tight-0.042
ucker-0.042
 fault-0.041
 understands-0.041
run-0.040
Token
Token ID8151
Feature activation+1.51e-01

Pos logit contributions
 myself+0.008
iece+0.008
CR+0.008
aming+0.008
mmm+0.008
Neg logit contributions
 tight-0.006
 fault-0.006
ucker-0.006
 understands-0.006
 stern-0.006
Tokens
Token ID82
Feature activation+2.39e-02

Pos logit contributions
 shrug+0.173
 tight+0.173
ucker+0.171
 understands+0.170
 stern+0.169
Neg logit contributions
 myself-0.236
iece-0.229
CR-0.215
mmmm-0.212
aming-0.211
Token a
Token ID257
Feature activation+4.41e-02

Pos logit contributions
ucker+0.027
 tight+0.026
 understands+0.026
 fault+0.025
 stern+0.025
Neg logit contributions
 myself-0.041
iece-0.040
aming-0.037
Ta-0.035
 endings-0.035
Token great
Token ID1049
Feature activation+3.34e-02

Pos logit contributions
ucker+0.053
 fault+0.048
 understands+0.045
run+0.045
 tight+0.044
Neg logit contributions
 myself-0.073
iece-0.072
 seek-0.069
Ta-0.068
aming-0.068
Token idea
Token ID2126
Feature activation+3.60e-03

Pos logit contributions
ucker+0.041
 tight+0.038
 understands+0.038
 fault+0.037
run+0.037
Neg logit contributions
iece-0.054
 myself-0.054
aming-0.049
Ta-0.049
 seek-0.049
Token.
Token ID13
Feature activation+1.65e-02

Pos logit contributions
 tight+0.004
 understands+0.003
 fault+0.003
ucker+0.003
run+0.003
Neg logit contributions
iece-0.007
 myself-0.006
Ta-0.006
aming-0.006
CR-0.006
Token day
Token ID1110
Feature activation-1.60e-02

Pos logit contributions
iece+0.011
 myself+0.010
Ta+0.010
 endings+0.009
aming+0.009
Neg logit contributions
 tight-0.006
 lot-0.005
 "-0.005
ucker-0.005
 stern-0.005
Token,
Token ID11
Feature activation+7.29e-03

Pos logit contributions
 myself+0.038
Ta+0.036
ands+0.035
iece+0.035
aming+0.034
Neg logit contributions
 "-0.011
 tight-0.009
 understands-0.009
 stern-0.008
 -0.008
Token Rachel
Token ID15984
Feature activation-4.18e-03

Pos logit contributions
 "+0.011
 tight+0.011
 stern+0.010
 understands+0.010
 +0.010
Neg logit contributions
 myself-0.010
ands-0.009
Ta-0.009
iece-0.009
ach-0.009
Tokenâ
Token ID22940
Feature activation-3.49e-02

Pos logit contributions
 myself+0.010
iece+0.010
mmmm+0.009
mmm+0.009
Ta+0.009
Neg logit contributions
 would-0.003
 tight-0.003
 understands-0.003
 will-0.003
 won-0.003
Token
Token ID26391
Feature activation-1.13e-02

Pos logit contributions
iece+0.049
 myself+0.048
 pirates+0.046
 seaw+0.046
bridge+0.046
Neg logit contributions
 tight-0.044
ucker-0.041
 understands-0.041
 fault-0.040
 -0.039
Token
Token ID8151
Feature activation+1.51e-01

Pos logit contributions
iece+0.021
 myself+0.020
aming+0.019
Ta+0.019
CR+0.018
Neg logit contributions
 tight-0.011
ucker-0.011
 understands-0.011
 fault-0.011
 stern-0.010
Tokens
Token ID82
Feature activation+9.48e-03

Pos logit contributions
ucker+0.193
 understands+0.189
 tight+0.187
 fault+0.182
 shrug+0.182
Neg logit contributions
iece-0.239
 myself-0.220
aming-0.216
CR-0.203
Ta-0.199
Token mom
Token ID1995
Feature activation+1.76e-02

Pos logit contributions
ucker+0.012
 understands+0.011
run+0.010
arl+0.010
ip+0.010
Neg logit contributions
 pirates-0.014
iece-0.014
 myself-0.014
aming-0.014
 endings-0.013
Token came
Token ID1625
Feature activation+1.06e-03

Pos logit contributions
 tight+0.019
ucker+0.019
 understands+0.018
 fault+0.018
 stern+0.018
Neg logit contributions
iece-0.031
 myself-0.030
aming-0.028
Ta-0.028
CR-0.027
Token to
Token ID284
Feature activation-1.33e-02

Pos logit contributions
 tight+0.001
run+0.001
 "+0.001
 stern+0.001
 understands+0.001
Neg logit contributions
iece-0.002
 myself-0.002
Ta-0.002
mmmm-0.002
mmm-0.002
Token play
Token ID711
Feature activation+8.16e-03

Pos logit contributions
iece+0.027
aming+0.025
 myself+0.025
Ta+0.024
 nights+0.024
Neg logit contributions
 hug-0.012
 tight-0.012
 "-0.011
 smile-0.011
 stern-0.011
Tokenâ
Token ID22940
Feature activation-2.88e-02

Pos logit contributions
iece+0.029
 myself+0.027
Ta+0.026
aming+0.026
CR+0.025
Neg logit contributions
 tight-0.021
 "-0.019
run-0.018
 understands-0.018
 stern-0.018
Token
Token ID26391
Feature activation-6.29e-03

Pos logit contributions
iece+0.035
 myself+0.035
aming+0.032
Ta+0.031
CR+0.029
Neg logit contributions
 tight-0.046
ucker-0.046
 understands-0.045
 fault-0.045
 stern-0.044
Token Sam
Token ID3409
Feature activation+1.79e-02

Pos logit contributions
 myself+0.010
iece+0.010
CR+0.009
aming+0.009
oy+0.009
Neg logit contributions
 tight-0.007
 understands-0.007
 fault-0.007
ucker-0.007
 stern-0.007
Tokenâ
Token ID22940
Feature activation-1.73e-02

Pos logit contributions
ucker+0.021
 tight+0.020
 fault+0.020
arl+0.020
 understands+0.019
Neg logit contributions
 myself-0.029
iece-0.028
aming-0.026
CR-0.025
oy-0.025
Token
Token ID26391
Feature activation+8.20e-04

Pos logit contributions
iece+0.020
 myself+0.020
aming+0.018
Ta+0.017
Now+0.016
Neg logit contributions
ucker-0.029
 tight-0.029
 fault-0.029
 understands-0.029
 stern-0.028
Token
Token ID8151
Feature activation+1.51e-01

Pos logit contributions
 tight+0.001
 understands+0.001
 fault+0.001
ucker+0.001
 stern+0.001
Neg logit contributions
 myself-0.001
iece-0.001
oy-0.001
CR-0.001
 seek-0.001
Tokens
Token ID82
Feature activation+1.69e-02

Pos logit contributions
 tight+0.192
 shrug+0.192
 understands+0.190
 stern+0.184
ucker+0.184
Neg logit contributions
 myself-0.218
iece-0.213
aming-0.194
mmmm-0.193
CR-0.193
Token mum
Token ID25682
Feature activation+9.59e-03

Pos logit contributions
ucker+0.016
 understands+0.015
 tight+0.015
 fault+0.014
 stern+0.014
Neg logit contributions
 myself-0.032
iece-0.031
aming-0.029
Ta-0.028
 endings-0.028
Token said
Token ID531
Feature activation+6.92e-04

Pos logit contributions
 fault+0.012
ucker+0.012
 tight+0.011
arl+0.011
 understands+0.011
Neg logit contributions
 myself-0.015
iece-0.014
aming-0.013
oy-0.013
th-0.013
Token.
Token ID13
Feature activation+3.80e-02

Pos logit contributions
ucker+0.001
 fault+0.001
 understands+0.001
 tight+0.001
arl+0.001
Neg logit contributions
 myself-0.001
iece-0.001
th-0.001
CR-0.001
aming-0.001
Token
Token ID6184
Feature activation+4.62e-03

Pos logit contributions
 tight+0.034
ucker+0.034
 understands+0.034
 fault+0.034
 stern+0.031
Neg logit contributions
 myself-0.072
iece-0.072
aming-0.066
Ta-0.065
CR-0.065
Token day
Token ID1110
Feature activation-2.01e-02

Pos logit contributions
iece+0.005
 myself+0.005
Ta+0.004
aming+0.004
CR+0.004
Neg logit contributions
 tight-0.004
ucker-0.004
 understands-0.004
 fault-0.004
 stern-0.004
Token,
Token ID11
Feature activation+6.77e-03

Pos logit contributions
iece+0.044
 myself+0.044
Ta+0.042
aming+0.042
ands+0.040
Neg logit contributions
 "-0.015
 tight-0.015
 stern-0.013
 lot-0.013
 -0.012
Token Tim
Token ID5045
Feature activation+8.59e-03

Pos logit contributions
 tight+0.007
 "+0.007
 lot+0.006
 Spot+0.006
 stern+0.006
Neg logit contributions
iece-0.014
Ta-0.013
aming-0.013
 myself-0.012
ands-0.012
Tokenâ
Token ID22940
Feature activation-2.79e-02

Pos logit contributions
 tight+0.007
 understands+0.007
ucker+0.007
 fault+0.006
 would+0.006
Neg logit contributions
iece-0.018
 myself-0.017
Ta-0.017
mmmm-0.016
aming-0.016
Token
Token ID26391
Feature activation-4.87e-03

Pos logit contributions
iece+0.041
 myself+0.040
 seaw+0.038
 pirates+0.038
 endings+0.038
Neg logit contributions
 tight-0.035
ucker-0.032
arl-0.032
 -0.032
 Together-0.031
Token
Token ID8151
Feature activation+1.51e-01

Pos logit contributions
iece+0.008
 myself+0.008
aming+0.008
CR+0.008
oy+0.008
Neg logit contributions
 tight-0.005
 fault-0.005
 understands-0.005
ucker-0.005
 stern-0.005
Tokens
Token ID82
Feature activation+1.58e-02

Pos logit contributions
 understands+0.177
 tight+0.176
ucker+0.174
 shrug+0.170
 stern+0.169
Neg logit contributions
iece-0.245
 myself-0.231
aming-0.224
CR-0.212
Ta-0.210
Token mom
Token ID1995
Feature activation+2.26e-02

Pos logit contributions
ucker+0.017
 understands+0.016
run+0.016
 tight+0.015
 fault+0.015
Neg logit contributions
 myself-0.026
iece-0.026
aming-0.024
 endings-0.024
 nights-0.024
Token gave
Token ID2921
Feature activation+3.40e-02

Pos logit contributions
 tight+0.025
 fault+0.024
ucker+0.024
 understands+0.024
run+0.023
Neg logit contributions
iece-0.038
 myself-0.038
aming-0.036
Ta-0.035
CR-0.034
Token him
Token ID683
Feature activation-3.38e-03

Pos logit contributions
ucker+0.060
 tight+0.059
 understands+0.058
 fault+0.058
 stern+0.057
Neg logit contributions
 myself-0.036
iece-0.036
aming-0.032
Ta-0.031
CR-0.029
Token a
Token ID257
Feature activation+2.24e-02

Pos logit contributions
 myself+0.004
iece+0.004
aming+0.004
Ta+0.004
oy+0.004
Neg logit contributions
ucker-0.005
 understands-0.005
 tight-0.005
 fault-0.005
 stern-0.005
Token was
Token ID373
Feature activation-2.44e-02

Pos logit contributions
iece+0.046
 myself+0.045
aming+0.042
Ta+0.042
CR+0.040
Neg logit contributions
 tight-0.028
ucker-0.027
 understands-0.027
 fault-0.026
 stern-0.025
Token the
Token ID262
Feature activation-1.26e-02

Pos logit contributions
 myself+0.037
aming+0.036
iece+0.036
 seek+0.033
 nights+0.032
Neg logit contributions
arl-0.031
ucker-0.030
ometer-0.030
 understands-0.030
 fault-0.029
Token brightest
Token ID33871
Feature activation-1.02e-02

Pos logit contributions
iece+0.021
 myself+0.020
aming+0.019
Ta+0.019
CR+0.019
Neg logit contributions
 tight-0.016
 fault-0.015
ucker-0.014
 understands-0.014
 stern-0.014
Token it
Token ID340
Feature activation-1.77e-02

Pos logit contributions
 myself+0.016
iece+0.016
aming+0.015
 nights+0.014
Ta+0.014
Neg logit contributions
ucker-0.012
 tight-0.012
 understands-0.012
 fault-0.012
arl-0.012
Token could
Token ID714
Feature activation-3.65e-02

Pos logit contributions
iece+0.032
 myself+0.031
aming+0.029
Ta+0.029
CR+0.028
Neg logit contributions
ucker-0.018
 tight-0.018
 fault-0.017
 understands-0.017
 stern-0.016
Token possibly
Token ID5457
Feature activation-1.41e-01

Pos logit contributions
iece+0.064
Ta+0.060
 myself+0.058
aming+0.058
CR+0.057
Neg logit contributions
 tight-0.042
 hug-0.039
 understands-0.039
 fault-0.038
 "-0.038
Token be
Token ID307
Feature activation-8.57e-02

Pos logit contributions
iece+0.237
 myself+0.222
aming+0.221
Ta+0.220
CR+0.209
Neg logit contributions
 tight-0.168
 understands-0.155
 fault-0.155
ucker-0.153
run-0.152
Token.
Token ID13
Feature activation-6.64e-03

Pos logit contributions
iece+0.153
 myself+0.148
Ta+0.143
aming+0.138
CR+0.134
Neg logit contributions
 tight-0.092
ucker-0.087
 fault-0.086
 understands-0.085
 stern-0.083
Token Ab
Token ID2275
Feature activation+4.08e-03

Pos logit contributions
 myself+0.013
iece+0.013
aming+0.012
Ta+0.012
mmmm+0.012
Neg logit contributions
ucker-0.005
 tight-0.005
 fault-0.005
 understands-0.005
 stern-0.005
Tokenbi
Token ID8482
Feature activation+6.55e-02

Pos logit contributions
 tight+0.003
 stern+0.003
 fault+0.003
ometer+0.003
 "+0.003
Neg logit contributions
iece-0.008
 myself-0.008
aming-0.008
mmmm-0.008
 behold-0.008
Token had
Token ID550
Feature activation+1.02e-02

Pos logit contributions
ucker+0.071
 tight+0.071
 fault+0.069
arl+0.067
 understands+0.067
Neg logit contributions
iece-0.112
 myself-0.110
aming-0.103
Ta-0.101
CR-0.098
Token bigger
Token ID5749
Feature activation+5.95e-03

Pos logit contributions
 myself+0.049
iece+0.048
aming+0.044
 seek+0.043
Ta+0.043
Neg logit contributions
ucker-0.043
 fault-0.043
 understands-0.042
arl-0.041
 tight-0.040
Token until
Token ID1566
Feature activation-4.47e-02

Pos logit contributions
 tight+0.007
ucker+0.007
 understands+0.007
 fault+0.007
 stern+0.007
Neg logit contributions
iece-0.010
 myself-0.009
aming-0.009
Ta-0.009
CR-0.008
Token it
Token ID340
Feature activation+2.49e-02

Pos logit contributions
iece+0.076
 myself+0.074
aming+0.070
Ta+0.069
CR+0.066
Neg logit contributions
 tight-0.051
ucker-0.050
 understands-0.049
 fault-0.048
 stern-0.047
Token filled
Token ID5901
Feature activation-5.26e-02

Pos logit contributions
 tight+0.024
 would+0.022
 won+0.022
 understands+0.022
 will+0.021
Neg logit contributions
 myself-0.050
iece-0.047
Ta-0.045
mmmm-0.045
ands-0.044
Token the
Token ID262
Feature activation-6.94e-02

Pos logit contributions
iece+0.095
 myself+0.091
Ta+0.088
aming+0.087
CR+0.084
Neg logit contributions
 tight-0.057
ucker-0.052
 understands-0.051
 fault-0.050
 stern-0.050
Token entire
Token ID2104
Feature activation-1.41e-01

Pos logit contributions
 myself+0.129
iece+0.126
oy+0.124
bees+0.123
mmmm+0.121
Neg logit contributions
 tight-0.073
 understands-0.064
 basket-0.062
 lot-0.060
 will-0.059
Token box
Token ID3091
Feature activation+3.78e-02

Pos logit contributions
iece+0.250
 myself+0.243
Ta+0.232
aming+0.222
mmmm+0.220
Neg logit contributions
 tight-0.162
 understands-0.147
 fault-0.142
 basket-0.140
 lot-0.139
Token.
Token ID13
Feature activation-7.36e-03

Pos logit contributions
 tight+0.038
ucker+0.035
 understands+0.035
 fault+0.034
 stern+0.034
Neg logit contributions
iece-0.070
 myself-0.069
Ta-0.065
aming-0.065
CR-0.062
Token It
Token ID632
Feature activation-3.68e-03

Pos logit contributions
 myself+0.012
iece+0.012
Ta+0.011
aming+0.011
CR+0.010
Neg logit contributions
ucker-0.009
 fault-0.009
 tight-0.008
 understands-0.008
arl-0.008
Token was
Token ID373
Feature activation-3.13e-02

Pos logit contributions
iece+0.007
 myself+0.007
Ta+0.007
mmmm+0.007
aming+0.007
Neg logit contributions
 tight-0.003
 understands-0.003
 would-0.003
 will-0.003
 won-0.003
Token the
Token ID262
Feature activation-1.49e-02

Pos logit contributions
iece+0.046
 myself+0.046
aming+0.043
Ta+0.041
CR+0.039
Neg logit contributions
ucker-0.042
 understands-0.041
 tight-0.041
 fault-0.040
arl-0.039
Token loud
Token ID7812
Feature activation-2.31e-03

Pos logit contributions
iece+0.065
 myself+0.064
aming+0.059
Ta+0.057
 behold+0.057
Neg logit contributions
ucker-0.037
 fault-0.036
 understands-0.034
 tight-0.034
run-0.034
Token that
Token ID326
Feature activation-1.53e-02

Pos logit contributions
iece+0.004
 myself+0.004
aming+0.004
Ta+0.004
CR+0.004
Neg logit contributions
 tight-0.003
ucker-0.002
 understands-0.002
 fault-0.002
 stern-0.002
Token it
Token ID340
Feature activation-4.21e-03

Pos logit contributions
iece+0.025
 myself+0.024
aming+0.022
 nights+0.022
 behold+0.022
Neg logit contributions
ucker-0.019
arl-0.018
 fault-0.018
 tight-0.018
 understands-0.018
Token filled
Token ID5901
Feature activation-6.19e-02

Pos logit contributions
iece+0.007
 myself+0.007
aming+0.007
Ta+0.007
CR+0.006
Neg logit contributions
 tight-0.005
ucker-0.005
 understands-0.004
 fault-0.004
 stern-0.004
Token the
Token ID262
Feature activation-7.62e-02

Pos logit contributions
iece+0.122
Ta+0.118
 myself+0.115
aming+0.111
CR+0.110
Neg logit contributions
 tight-0.060
 "-0.048
 lot-0.048
 stern-0.048
 fault-0.048
Token entire
Token ID2104
Feature activation-1.40e-01

Pos logit contributions
iece+0.114
 myself+0.111
Ta+0.102
aming+0.102
mmmm+0.099
Neg logit contributions
 tight-0.104
 understands-0.098
 fault-0.098
ucker-0.097
 stern-0.094
Token house
Token ID2156
Feature activation-6.53e-02

Pos logit contributions
iece+0.213
 myself+0.209
aming+0.194
Ta+0.191
CR+0.183
Neg logit contributions
 tight-0.185
ucker-0.181
 understands-0.178
 fault-0.177
 stern-0.172
Token!
Token ID0
Feature activation-9.70e-03

Pos logit contributions
iece+0.121
 myself+0.118
aming+0.111
Ta+0.105
CR+0.104
Neg logit contributions
ucker-0.064
 understands-0.063
 tight-0.062
 fault-0.061
 stern-0.058
Token Everyone
Token ID11075
Feature activation-2.26e-02

Pos logit contributions
iece+0.016
 myself+0.016
Ta+0.015
Now+0.015
aming+0.015
Neg logit contributions
ucker-0.011
 fault-0.011
 tight-0.010
 stern-0.010
arl-0.010
Token cl
Token ID537
Feature activation-9.59e-03

Pos logit contributions
iece+0.039
 myself+0.039
aming+0.036
Ta+0.036
CR+0.035
Neg logit contributions
 tight-0.025
ucker-0.024
 understands-0.024
 fault-0.023
 stern-0.023
Tokenapped
Token ID6320
Feature activation+4.90e-03

Pos logit contributions
 myself+0.020
iece+0.020
Ta+0.019
 seek+0.019
aming+0.018
Neg logit contributions
ucker-0.007
 tight-0.007
 understands-0.007
run-0.006
 fault-0.006
Token big
Token ID1263
Feature activation+1.10e-02

Pos logit contributions
iece+0.019
 myself+0.019
Ta+0.018
aming+0.018
CR+0.017
Neg logit contributions
 tight-0.009
ucker-0.008
 understands-0.008
 fault-0.008
 stern-0.007
Token that
Token ID326
Feature activation+5.21e-03

Pos logit contributions
 tight+0.012
ucker+0.012
 understands+0.012
 fault+0.012
run+0.011
Neg logit contributions
iece-0.019
 myself-0.019
aming-0.017
Ta-0.017
CR-0.017
Token it
Token ID340
Feature activation+1.09e-02

Pos logit contributions
ucker+0.006
 tight+0.006
 fault+0.006
 understands+0.006
run+0.005
Neg logit contributions
iece-0.009
 myself-0.009
aming-0.008
Ta-0.008
CR-0.008
Token filled
Token ID5901
Feature activation-5.49e-02

Pos logit contributions
 tight+0.010
 understands+0.009
 would+0.009
ucker+0.009
 will+0.009
Neg logit contributions
iece-0.022
 myself-0.021
Ta-0.020
mmmm-0.020
aming-0.020
Token the
Token ID262
Feature activation-6.62e-02

Pos logit contributions
iece+0.115
Ta+0.111
oy+0.104
aming+0.104
 myself+0.103
Neg logit contributions
 tight-0.050
 "-0.040
 lot-0.038
ucker-0.038
 stern-0.036
Token entire
Token ID2104
Feature activation-1.39e-01

Pos logit contributions
iece+0.099
 myself+0.096
Ta+0.093
oy+0.091
 behold+0.087
Neg logit contributions
 tight-0.094
 understands-0.087
 fault-0.085
 basket-0.084
ucker-0.084
Token garden
Token ID11376
Feature activation-4.29e-02

Pos logit contributions
iece+0.228
 myself+0.218
Ta+0.214
aming+0.200
 behold+0.195
Neg logit contributions
 tight-0.174
 understands-0.164
 fault-0.160
ucker-0.160
 "-0.155
Token.
Token ID13
Feature activation-4.26e-03

Pos logit contributions
iece+0.077
 myself+0.076
aming+0.071
Ta+0.070
CR+0.068
Neg logit contributions
 tight-0.044
ucker-0.043
 understands-0.042
 fault-0.042
 stern-0.040
Token But
Token ID887
Feature activation-4.40e-02

Pos logit contributions
iece+0.009
 myself+0.009
aming+0.009
Ta+0.009
CR+0.008
Neg logit contributions
 tight-0.003
 understands-0.003
ucker-0.003
 "-0.003
 fault-0.003
Token something
Token ID1223
Feature activation+1.87e-02

Pos logit contributions
Ta+0.074
iece+0.074
 myself+0.072
aming+0.070
ands+0.069
Neg logit contributions
 tight-0.055
 "-0.051
 understands-0.048
 stern-0.047
 would-0.046
Token strange
Token ID6283
Feature activation-3.37e-02

Pos logit contributions
ucker+0.024
arl+0.023
 fault+0.023
 tight+0.022
 understands+0.022
Neg logit contributions
iece-0.028
 myself-0.027
aming-0.027
CR-0.026
Ta-0.025
Token Some
Token ID2773
Feature activation-1.53e-02

Pos logit contributions
iece+0.011
 myself+0.010
aming+0.010
Ta+0.010
 endings+0.010
Neg logit contributions
 tight-0.005
 "-0.004
run-0.004
 understands-0.004
ucker-0.004
Token people
Token ID661
Feature activation-3.39e-02

Pos logit contributions
iece+0.022
 myself+0.021
Ta+0.019
aming+0.019
CR+0.018
Neg logit contributions
 tight-0.022
 understands-0.022
 lot-0.021
ucker-0.021
 will-0.021
Token live
Token ID2107
Feature activation-5.33e-02

Pos logit contributions
 myself+0.055
iece+0.055
aming+0.051
Ta+0.050
oy+0.049
Neg logit contributions
ucker-0.038
 fault-0.038
 tight-0.038
 understands-0.037
 stern-0.036
Token on
Token ID319
Feature activation+5.88e-02

Pos logit contributions
iece+0.085
 myself+0.084
aming+0.078
Ta+0.077
CR+0.074
Neg logit contributions
 tight-0.065
ucker-0.064
 understands-0.063
 fault-0.062
 stern-0.060
Token y
Token ID331
Feature activation+6.99e-03

Pos logit contributions
 tight+0.060
ucker+0.056
 fault+0.055
 understands+0.055
 stern+0.053
Neg logit contributions
iece-0.109
 myself-0.105
Ta-0.100
aming-0.100
CR-0.095
Tokenach
Token ID620
Feature activation-1.33e-01

Pos logit contributions
ucker+0.010
 tight+0.010
 fault+0.009
arl+0.009
 understands+0.009
Neg logit contributions
iece-0.011
 myself-0.010
Ta-0.010
 seek-0.009
aming-0.009
Tokents
Token ID912
Feature activation+4.37e-02

Pos logit contributions
iece+0.164
Ta+0.159
 myself+0.155
aming+0.143
CR+0.138
Neg logit contributions
 tight-0.215
 understands-0.204
ucker-0.203
 fault-0.199
 stern-0.198
Token and
Token ID290
Feature activation-4.45e-02

Pos logit contributions
 tight+0.046
 fault+0.039
 stern+0.038
 understands+0.038
 will+0.038
Neg logit contributions
iece-0.084
Ta-0.083
aming-0.077
 myself-0.077
CR-0.074
Token travel
Token ID3067
Feature activation-2.08e-02

Pos logit contributions
iece+0.074
 myself+0.069
aming+0.068
Ta+0.068
CR+0.063
Neg logit contributions
 tight-0.055
 understands-0.051
 "-0.050
ucker-0.050
 will-0.050
Token around
Token ID1088
Feature activation-2.62e-02

Pos logit contributions
 myself+0.036
iece+0.036
aming+0.033
Ta+0.032
CR+0.032
Neg logit contributions
ucker-0.023
 fault-0.022
 understands-0.022
 tight-0.022
 stern-0.021
Token the
Token ID262
Feature activation-2.28e-02

Pos logit contributions
Ta+0.053
iece+0.053
aming+0.049
 myself+0.048
mmmm+0.047
Neg logit contributions
 tight-0.025
 "-0.022
 will-0.021
 lot-0.020
 fault-0.019
Token shapes
Token ID15268
Feature activation-3.40e-02

Pos logit contributions
 myself+0.015
iece+0.015
aming+0.014
 behold+0.014
 nights+0.014
Neg logit contributions
 fault-0.015
ometer-0.015
arl-0.015
ucker-0.015
 stern-0.014
Token and
Token ID290
Feature activation-4.63e-02

Pos logit contributions
iece+0.056
 myself+0.055
aming+0.052
Ta+0.051
CR+0.049
Neg logit contributions
 tight-0.040
ucker-0.039
 understands-0.038
 fault-0.038
 stern-0.037
Token colors
Token ID7577
Feature activation-3.22e-02

Pos logit contributions
iece+0.064
 myself+0.060
aming+0.056
Ta+0.056
CR+0.055
Neg logit contributions
 tight-0.071
 understands-0.068
 "-0.067
 lot-0.066
ucker-0.065
Token filled
Token ID5901
Feature activation-7.17e-02

Pos logit contributions
iece+0.052
 myself+0.050
aming+0.048
th+0.043
bees+0.043
Neg logit contributions
 understands-0.038
 fault-0.038
ucker-0.038
 tight-0.038
 stern-0.037
Token the
Token ID262
Feature activation-7.28e-02

Pos logit contributions
iece+0.135
Ta+0.127
 myself+0.125
CR+0.122
aming+0.120
Neg logit contributions
 tight-0.078
 "-0.065
 lot-0.065
 stern-0.064
ucker-0.063
Token me
Token ID502
Feature activation-1.31e-01

Pos logit contributions
iece+0.113
 myself+0.100
Ta+0.098
CR+0.098
mmmm+0.097
Neg logit contributions
 tight-0.107
 understands-0.097
 basket-0.096
 lot-0.095
 will-0.094
Tokenadow
Token ID4584
Feature activation-5.94e-02

Pos logit contributions
Ta+0.183
iece+0.180
 endings+0.170
 myself+0.168
CR+0.167
Neg logit contributions
 tight-0.193
ucker-0.175
run-0.174
 understands-0.170
 smile-0.167
Token,
Token ID11
Feature activation-3.97e-02

Pos logit contributions
iece+0.107
 myself+0.105
Ta+0.102
aming+0.099
CR+0.097
Neg logit contributions
 tight-0.063
ucker-0.057
 understands-0.056
 stern-0.056
 fault-0.055
Token and
Token ID290
Feature activation-2.56e-02

Pos logit contributions
iece+0.066
 myself+0.064
Ta+0.060
aming+0.060
CR+0.058
Neg logit contributions
 tight-0.049
ucker-0.045
 understands-0.045
 stern-0.044
 fault-0.044
Token it
Token ID340
Feature activation+4.41e-03

Pos logit contributions
iece+0.043
 myself+0.042
aming+0.040
Ta+0.039
CR+0.038
Neg logit contributions
 tight-0.029
ucker-0.028
 understands-0.028
 fault-0.028
 stern-0.027
Token was
Token ID373
Feature activation-3.64e-02

Pos logit contributions
ucker+0.005
 fault+0.005
 tight+0.005
arl+0.005
run+0.005
Neg logit contributions
iece-0.007
 myself-0.007
aming-0.007
Ta-0.006
CR-0.006
Token until
Token ID1566
Feature activation-3.93e-02

Pos logit contributions
 tight+0.010
 "+0.008
 lot+0.008
ucker+0.008
run+0.008
Neg logit contributions
iece-0.021
Ta-0.020
 myself-0.020
aming-0.019
CR-0.018
Token he
Token ID339
Feature activation-6.69e-03

Pos logit contributions
 myself+0.066
iece+0.066
Ta+0.063
aming+0.063
ands+0.058
Neg logit contributions
 tight-0.048
 "-0.043
 lot-0.042
 -0.041
 stern-0.041
Token had
Token ID550
Feature activation+2.04e-02

Pos logit contributions
 myself+0.015
iece+0.014
Ta+0.013
ands+0.013
ach+0.013
Neg logit contributions
 would-0.006
 won-0.005
 will-0.005
 understands-0.005
 tight-0.005
Token filled
Token ID5901
Feature activation-4.21e-02

Pos logit contributions
 tight+0.023
 fault+0.023
 understands+0.022
ucker+0.022
 stern+0.021
Neg logit contributions
iece-0.035
 myself-0.034
Ta-0.032
aming-0.032
oy-0.030
Token the
Token ID262
Feature activation-5.89e-02

Pos logit contributions
iece+0.086
Ta+0.084
CR+0.080
aming+0.078
mmmm+0.077
Neg logit contributions
 tight-0.041
 "-0.034
 -0.031
 lot-0.030
ucker-0.030
Token entire
Token ID2104
Feature activation-1.30e-01

Pos logit contributions
iece+0.107
 myself+0.095
CR+0.094
mmm+0.093
mmmm+0.093
Neg logit contributions
 tight-0.072
 lot-0.065
 basket-0.064
 understands-0.062
 will-0.060
Token sky
Token ID6766
Feature activation-6.35e-02

Pos logit contributions
iece+0.221
 myself+0.203
Ta+0.200
aming+0.188
CR+0.184
Neg logit contributions
 tight-0.166
 understands-0.153
 lot-0.150
 basket-0.148
 fault-0.145
Token with
Token ID351
Feature activation-2.77e-03

Pos logit contributions
iece+0.122
Ta+0.120
 myself+0.118
CR+0.112
aming+0.111
Neg logit contributions
 tight-0.062
ucker-0.054
 fault-0.052
 stern-0.051
 "-0.051
Token a
Token ID257
Feature activation-7.49e-03

Pos logit contributions
iece+0.005
 myself+0.005
Ta+0.004
aming+0.004
CR+0.004
Neg logit contributions
 tight-0.003
 understands-0.003
 fault-0.003
 lot-0.003
ucker-0.003
Token mystery
Token ID10715
Feature activation-6.43e-03

Pos logit contributions
 myself+0.012
iece+0.012
aming+0.012
Ta+0.011
 seek+0.011
Neg logit contributions
ucker-0.008
 fault-0.008
run-0.008
 understands-0.008
arl-0.008
Token.
Token ID13
Feature activation+1.62e-04

Pos logit contributions
iece+0.011
 myself+0.011
aming+0.011
Ta+0.010
CR+0.010
Neg logit contributions
ucker-0.007
 tight-0.007
 understands-0.007
 fault-0.006
 stern-0.006
Token big
Token ID1263
Feature activation-1.16e-02

Pos logit contributions
 seek+0.030
 myself+0.030
iece+0.029
aming+0.029
Ta+0.028
Neg logit contributions
 fault-0.021
ucker-0.021
run-0.020
 understands-0.020
icks-0.020
Token fuzzy
Token ID34669
Feature activation-1.10e-02

Pos logit contributions
 myself+0.018
iece+0.018
 seek+0.017
aming+0.017
oy+0.016
Neg logit contributions
ucker-0.014
 fault-0.013
run-0.013
 understands-0.013
arl-0.012
Token blanket
Token ID18447
Feature activation+4.07e-02

Pos logit contributions
iece+0.017
 myself+0.017
aming+0.016
Ta+0.016
 seek+0.016
Neg logit contributions
ucker-0.013
 understands-0.013
 fault-0.013
 tight-0.013
run-0.012
Token covering
Token ID9505
Feature activation-5.87e-02

Pos logit contributions
 fault+0.052
ucker+0.052
 understands+0.052
 tight+0.049
run+0.047
Neg logit contributions
iece-0.064
 myself-0.061
aming-0.059
CR-0.054
 seek-0.054
Token the
Token ID262
Feature activation-6.14e-02

Pos logit contributions
iece+0.110
Ta+0.105
 myself+0.104
aming+0.102
CR+0.100
Neg logit contributions
 tight-0.062
 "-0.053
 understands-0.052
 stern-0.052
ucker-0.052
Token entire
Token ID2104
Feature activation-1.28e-01

Pos logit contributions
iece+0.096
 myself+0.089
Ta+0.086
oy+0.084
aming+0.082
Neg logit contributions
 tight-0.088
 basket-0.081
 lot-0.078
 will-0.077
 understands-0.077
Token park
Token ID3952
Feature activation-3.18e-02

Pos logit contributions
iece+0.200
 myself+0.196
aming+0.182
Ta+0.180
CR+0.172
Neg logit contributions
 tight-0.163
ucker-0.160
 understands-0.158
 fault-0.156
 stern-0.151
Token!
Token ID0
Feature activation-1.78e-02

Pos logit contributions
iece+0.057
 myself+0.057
aming+0.053
 seek+0.051
Ta+0.050
Neg logit contributions
ucker-0.033
 understands-0.032
 fault-0.031
 tight-0.031
arl-0.030
Token Ell
Token ID7122
Feature activation-3.43e-03

Pos logit contributions
iece+0.034
 myself+0.034
aming+0.032
Ta+0.032
CR+0.031
Neg logit contributions
 tight-0.016
ucker-0.016
 understands-0.015
 fault-0.015
 stern-0.014
Tokena
Token ID64
Feature activation-1.73e-02

Pos logit contributions
iece+0.006
 myself+0.006
Ta+0.005
aming+0.005
 endings+0.005
Neg logit contributions
ucker-0.004
 tight-0.004
 understands-0.004
run-0.004
 stern-0.004
Token was
Token ID373
Feature activation-6.86e-03

Pos logit contributions
iece+0.036
 myself+0.035
Ta+0.034
mmmm+0.033
ands+0.033
Neg logit contributions
 tight-0.016
 understands-0.015
 "-0.014
 stern-0.014
 would-0.014
Token wide
Token ID3094
Feature activation-1.59e-02

Pos logit contributions
iece+0.050
 myself+0.050
aming+0.047
Ta+0.047
CR+0.045
Neg logit contributions
 tight-0.023
ucker-0.023
 fault-0.023
 understands-0.023
 stern-0.021
Token that
Token ID326
Feature activation-7.90e-03

Pos logit contributions
iece+0.027
 myself+0.027
aming+0.025
Ta+0.024
CR+0.024
Neg logit contributions
ucker-0.018
 tight-0.017
 understands-0.017
 fault-0.017
 stern-0.016
Token it
Token ID340
Feature activation+5.12e-03

Pos logit contributions
iece+0.014
 myself+0.014
aming+0.013
Ta+0.013
CR+0.013
Neg logit contributions
 tight-0.008
ucker-0.008
 understands-0.008
 fault-0.008
 stern-0.007
Token filled
Token ID5901
Feature activation-5.64e-02

Pos logit contributions
 tight+0.005
ucker+0.005
 fault+0.005
 understands+0.005
 stern+0.005
Neg logit contributions
iece-0.009
 myself-0.009
aming-0.008
Ta-0.008
CR-0.008
Token the
Token ID262
Feature activation-6.37e-02

Pos logit contributions
iece+0.103
Ta+0.097
aming+0.090
 myself+0.090
CR+0.090
Neg logit contributions
 tight-0.067
 "-0.057
ucker-0.055
 lot-0.055
 stern-0.053
Token entire
Token ID2104
Feature activation-1.28e-01

Pos logit contributions
iece+0.104
 myself+0.100
Ta+0.095
oy+0.094
mmmm+0.094
Neg logit contributions
 tight-0.085
 basket-0.077
 lot-0.074
 understands-0.072
ucker-0.071
Token room
Token ID2119
Feature activation-2.57e-02

Pos logit contributions
iece+0.215
 myself+0.209
Ta+0.198
aming+0.191
CR+0.185
Neg logit contributions
 tight-0.154
 understands-0.142
ucker-0.142
 fault-0.139
 basket-0.136
Token.
Token ID13
Feature activation+8.92e-03

Pos logit contributions
iece+0.049
 myself+0.048
aming+0.045
Ta+0.045
CR+0.043
Neg logit contributions
 tight-0.024
ucker-0.023
 understands-0.023
 fault-0.022
 stern-0.021
Token Every
Token ID3887
Feature activation+4.57e-02

Pos logit contributions
 tight+0.006
ucker+0.006
 understands+0.006
 fault+0.006
 stern+0.006
Neg logit contributions
iece-0.019
 myself-0.019
aming-0.018
Ta-0.017
CR-0.017
Token day
Token ID1110
Feature activation-1.66e-02

Pos logit contributions
ucker+0.066
 tight+0.064
 fault+0.064
 understands+0.064
 stern+0.062
Neg logit contributions
iece-0.063
 myself-0.061
aming-0.057
Ta-0.056
CR-0.054
Token,
Token ID11
Feature activation+4.36e-03

Pos logit contributions
iece+0.033
 myself+0.033
aming+0.031
Ta+0.031
CR+0.029
Neg logit contributions
 tight-0.015
ucker-0.013
 understands-0.013
 fault-0.013
 "-0.013
Token big
Token ID1263
Feature activation-2.56e-03

Pos logit contributions
 myself+0.051
iece+0.049
aming+0.048
Ta+0.046
 seek+0.045
Neg logit contributions
 fault-0.037
ucker-0.036
 understands-0.035
ometer-0.035
ishly-0.035
Token it
Token ID340
Feature activation-7.29e-03

Pos logit contributions
iece+0.004
 myself+0.004
aming+0.004
Ta+0.004
CR+0.004
Neg logit contributions
 tight-0.003
ucker-0.003
 understands-0.003
 fault-0.003
 stern-0.003
Token almost
Token ID2048
Feature activation-8.34e-02

Pos logit contributions
iece+0.013
 myself+0.013
aming+0.012
Ta+0.012
CR+0.011
Neg logit contributions
 tight-0.008
ucker-0.008
 understands-0.007
 fault-0.007
 stern-0.007
Token covered
Token ID5017
Feature activation-7.10e-02

Pos logit contributions
 myself+0.133
iece+0.131
aming+0.121
CR+0.117
Ta+0.115
Neg logit contributions
ucker-0.101
 understands-0.099
 fault-0.097
 tight-0.097
ometer-0.092
Token her
Token ID607
Feature activation-5.97e-02

Pos logit contributions
iece+0.123
Ta+0.118
 myself+0.118
aming+0.112
CR+0.110
Neg logit contributions
 tight-0.081
 fault-0.071
 stern-0.071
ucker-0.071
 "-0.070
Token entire
Token ID2104
Feature activation-1.26e-01

Pos logit contributions
Ta+0.125
CR+0.120
iece+0.119
Now+0.118
mmmm+0.116
Neg logit contributions
 tight-0.054
 smile-0.044
 fault-0.041
 hug-0.040
 stern-0.039
Token face
Token ID1986
Feature activation-3.53e-02

Pos logit contributions
iece+0.234
 myself+0.224
Ta+0.221
aming+0.214
mmmm+0.209
Neg logit contributions
 tight-0.130
 fault-0.120
 understands-0.118
ucker-0.112
 stern-0.112
Token.
Token ID13
Feature activation-7.47e-03

Pos logit contributions
iece+0.067
 myself+0.066
aming+0.062
Ta+0.062
CR+0.060
Neg logit contributions
 tight-0.033
ucker-0.032
 understands-0.031
 fault-0.031
 stern-0.029
Token She
Token ID1375
Feature activation-5.66e-03

Pos logit contributions
 myself+0.012
iece+0.012
Ta+0.011
Now+0.011
CR+0.011
Neg logit contributions
ucker-0.009
 fault-0.009
 tight-0.008
 basket-0.008
 understands-0.008
Token even
Token ID772
Feature activation-4.00e-02

Pos logit contributions
iece+0.010
 myself+0.010
aming+0.009
Ta+0.009
CR+0.009
Neg logit contributions
 tight-0.006
ucker-0.006
 understands-0.006
 fault-0.006
 stern-0.006
Token shouted
Token ID17293
Feature activation-3.55e-02

Pos logit contributions
 myself+0.074
iece+0.073
aming+0.067
Ta+0.066
mmmm+0.066
Neg logit contributions
 tight-0.041
 understands-0.040
ucker-0.039
 stern-0.038
run-0.037
Token The
Token ID383
Feature activation+9.93e-03

Pos logit contributions
 myself+0.026
iece+0.026
Ta+0.024
aming+0.023
CR+0.023
Neg logit contributions
ucker-0.016
 fault-0.015
 tight-0.015
 understands-0.015
 stern-0.014
Token giant
Token ID6175
Feature activation-1.87e-02

Pos logit contributions
 tight+0.014
 understands+0.013
 stern+0.012
 lot+0.012
 "+0.012
Neg logit contributions
iece-0.016
 myself-0.015
CR-0.014
aming-0.014
Ta-0.014
Token had
Token ID550
Feature activation-3.69e-03

Pos logit contributions
iece+0.039
 myself+0.039
Ta+0.036
ands+0.036
CR+0.035
Neg logit contributions
 tight-0.017
 understands-0.015
 would-0.015
 "-0.014
 stern-0.014
Token filled
Token ID5901
Feature activation-4.50e-02

Pos logit contributions
iece+0.006
 myself+0.006
aming+0.006
Ta+0.005
CR+0.005
Neg logit contributions
 tight-0.004
 understands-0.004
 fault-0.004
 "-0.004
 stern-0.004
Token the
Token ID262
Feature activation-5.10e-02

Pos logit contributions
iece+0.089
Ta+0.084
CR+0.083
 myself+0.079
ands+0.079
Neg logit contributions
 tight-0.047
 stern-0.039
 "-0.038
 basket-0.036
 -0.036
Token entire
Token ID2104
Feature activation-1.25e-01

Pos logit contributions
iece+0.091
 myself+0.088
ands+0.084
CR+0.084
bees+0.082
Neg logit contributions
 tight-0.061
 understands-0.055
 basket-0.054
 lot-0.052
 hug-0.052
Token castle
Token ID16669
Feature activation-5.81e-02

Pos logit contributions
iece+0.228
 myself+0.215
Ta+0.210
CR+0.197
aming+0.196
Neg logit contributions
 tight-0.145
 understands-0.132
 basket-0.127
 lot-0.126
 "-0.121
Token with
Token ID351
Feature activation-3.14e-03

Pos logit contributions
Ta+0.107
iece+0.104
 myself+0.102
CR+0.098
mmmm+0.097
Neg logit contributions
 tight-0.063
 "-0.053
 basket-0.052
 stern-0.052
 understands-0.052
Token colorful
Token ID20239
Feature activation-4.63e-02

Pos logit contributions
iece+0.005
 myself+0.005
aming+0.005
Ta+0.005
CR+0.005
Neg logit contributions
 tight-0.004
 understands-0.004
 fault-0.003
ucker-0.003
 stern-0.003
Token toys
Token ID14958
Feature activation-1.60e-02

Pos logit contributions
iece+0.077
 myself+0.076
Ta+0.073
aming+0.070
CR+0.069
Neg logit contributions
 tight-0.058
 lot-0.056
 basket-0.055
 hug-0.055
 smile-0.055
Token and
Token ID290
Feature activation-3.93e-02

Pos logit contributions
iece+0.028
 myself+0.027
aming+0.026
Ta+0.025
CR+0.024
Neg logit contributions
 tight-0.017
ucker-0.017
 understands-0.017
 fault-0.017
 stern-0.016
Token's
Token ID338
Feature activation-1.91e-02

Pos logit contributions
 tight+0.004
ucker+0.004
 fault+0.004
 understands+0.004
 stern+0.004
Neg logit contributions
iece-0.005
 myself-0.005
aming-0.005
Ta-0.005
CR-0.005
Token house
Token ID2156
Feature activation-1.39e-02

Pos logit contributions
iece+0.023
 myself+0.021
 nights+0.021
 endings+0.021
 walls+0.021
Neg logit contributions
ucker-0.031
run-0.028
 understands-0.028
ip-0.028
oun-0.027
Token was
Token ID373
Feature activation-1.83e-02

Pos logit contributions
iece+0.019
 myself+0.017
aming+0.017
CR+0.017
th+0.016
Neg logit contributions
ucker-0.021
arl-0.020
 fault-0.020
oun-0.019
run-0.019
Token built
Token ID3170
Feature activation-9.92e-02

Pos logit contributions
iece+0.028
aming+0.028
 myself+0.028
 seek+0.026
CR+0.025
Neg logit contributions
ucker-0.023
 understands-0.023
 fault-0.022
run-0.021
ip-0.021
Token long
Token ID890
Feature activation-7.08e-02

Pos logit contributions
Ta+0.206
iece+0.203
 myself+0.201
mmmm+0.191
aming+0.187
Neg logit contributions
 tight-0.083
 "-0.067
 fault-0.065
 stern-0.064
 basket-0.062
Token ago
Token ID2084
Feature activation-1.24e-01

Pos logit contributions
iece+0.114
 myself+0.112
Ta+0.110
aming+0.105
mmmm+0.101
Neg logit contributions
 tight-0.089
 fault-0.083
 understands-0.080
ucker-0.080
 stern-0.079
Token.
Token ID13
Feature activation-9.82e-04

Pos logit contributions
iece+0.252
 myself+0.248
Ta+0.240
aming+0.235
CR+0.226
Neg logit contributions
 tight-0.099
 fault-0.092
ucker-0.091
 understands-0.089
 stern-0.086
Token
Token ID198
Feature activation-1.66e-02

Pos logit contributions
 myself+0.001
iece+0.001
Ta+0.001
CR+0.001
Now+0.001
Neg logit contributions
 fault-0.001
 vain-0.001
ucker-0.001
 understands-0.001
run-0.001
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
 myself+0.026
iece+0.026
aming+0.025
Ta+0.025
Now+0.024
Neg logit contributions
 fault-0.019
ucker-0.019
 tight-0.019
 understands-0.018
ishly-0.018
TokenL
Token ID43
Feature activation-1.01e-02

Pos logit contributions
iece+0.015
 myself+0.014
 endings+0.012
Ta+0.012
aming+0.012
Neg logit contributions
 tight-0.023
 lot-0.021
 "-0.021
ucker-0.021
 stern-0.021
Tokenily
Token ID813
Feature activation-2.96e-02

Pos logit contributions
iece+0.011
oy+0.010
 myself+0.010
Ta+0.010
 seek+0.010
Neg logit contributions
 understands-0.017
 tight-0.017
ucker-0.017
 fault-0.017
run-0.016
Token it
Token ID340
Feature activation+1.11e-02

Pos logit contributions
ucker+0.004
 tight+0.004
 fault+0.004
 understands+0.004
arl+0.004
Neg logit contributions
iece-0.005
 myself-0.005
aming-0.005
Ta-0.004
 nights-0.004
Token seemed
Token ID3947
Feature activation-3.66e-04

Pos logit contributions
 tight+0.012
ucker+0.012
 fault+0.011
 understands+0.011
 stern+0.011
Neg logit contributions
iece-0.019
 myself-0.019
aming-0.018
Ta-0.018
CR-0.017
Token to
Token ID284
Feature activation-1.43e-02

Pos logit contributions
 myself+0.001
iece+0.001
aming+0.001
 nights+0.000
 seek+0.000
Neg logit contributions
ucker-0.000
 understands-0.000
arl-0.000
uffy-0.000
 fault-0.000
Token fill
Token ID6070
Feature activation-3.06e-02

Pos logit contributions
iece+0.023
 myself+0.022
Ta+0.021
aming+0.021
CR+0.020
Neg logit contributions
 tight-0.018
ucker-0.017
 fault-0.017
 understands-0.017
 stern-0.016
Token the
Token ID262
Feature activation-5.33e-02

Pos logit contributions
iece+0.056
 myself+0.055
Ta+0.053
aming+0.052
CR+0.050
Neg logit contributions
 tight-0.032
ucker-0.029
 fault-0.029
 understands-0.028
 stern-0.028
Token entire
Token ID2104
Feature activation-1.24e-01

Pos logit contributions
iece+0.079
 myself+0.077
Ta+0.072
aming+0.070
mmmm+0.070
Neg logit contributions
 tight-0.074
 fault-0.069
 understands-0.068
ucker-0.067
 stern-0.066
Token room
Token ID2119
Feature activation-2.57e-02

Pos logit contributions
iece+0.184
 myself+0.180
aming+0.167
Ta+0.164
CR+0.157
Neg logit contributions
 tight-0.166
ucker-0.164
 understands-0.161
 fault-0.159
 stern-0.155
Token.
Token ID13
Feature activation-3.39e-03

Pos logit contributions
iece+0.047
 myself+0.046
aming+0.043
Ta+0.043
CR+0.041
Neg logit contributions
 tight-0.025
ucker-0.025
 understands-0.024
 fault-0.024
 stern-0.023
Token They
Token ID1119
Feature activation-4.71e-03

Pos logit contributions
iece+0.006
 myself+0.006
aming+0.006
Ta+0.006
CR+0.006
Neg logit contributions
 tight-0.003
ucker-0.003
 understands-0.003
 fault-0.003
 stern-0.003
Token were
Token ID547
Feature activation-8.36e-03

Pos logit contributions
iece+0.008
 myself+0.007
aming+0.007
Ta+0.007
 behold+0.007
Neg logit contributions
ucker-0.006
 fault-0.006
arl-0.006
 understands-0.005
 tight-0.005
Token having
Token ID1719
Feature activation-4.94e-02

Pos logit contributions
iece+0.012
 myself+0.012
aming+0.012
Ta+0.010
 nights+0.010
Neg logit contributions
ucker-0.012
arl-0.011
 understands-0.011
 fault-0.011
 tight-0.011
Token,
Token ID11
Feature activation-2.05e-02

Pos logit contributions
 tight+0.023
ucker+0.023
 understands+0.023
 fault+0.022
 stern+0.021
Neg logit contributions
iece-0.042
 myself-0.041
aming-0.038
Ta-0.038
CR-0.037
Token until
Token ID1566
Feature activation-3.39e-02

Pos logit contributions
iece+0.033
 myself+0.032
Ta+0.031
aming+0.030
CR+0.029
Neg logit contributions
 tight-0.026
 understands-0.024
 "-0.024
 stern-0.024
 fault-0.024
Token it
Token ID340
Feature activation+3.36e-02

Pos logit contributions
 myself+0.060
iece+0.060
Ta+0.057
aming+0.056
ands+0.053
Neg logit contributions
 tight-0.038
 "-0.034
 stern-0.033
 understands-0.033
 lot-0.033
Token covered
Token ID5017
Feature activation-6.62e-02

Pos logit contributions
 tight+0.032
 understands+0.030
 would+0.030
 will+0.029
 won+0.029
Neg logit contributions
 myself-0.066
iece-0.064
Ta-0.060
mmmm-0.059
ands-0.058
Token the
Token ID262
Feature activation-4.90e-02

Pos logit contributions
Ta+0.114
iece+0.114
 myself+0.111
aming+0.104
CR+0.103
Neg logit contributions
 tight-0.077
 "-0.066
 fault-0.064
 stern-0.064
 understands-0.063
Token entire
Token ID2104
Feature activation-1.21e-01

Pos logit contributions
iece+0.074
 myself+0.074
Ta+0.069
oy+0.065
CR+0.065
Neg logit contributions
 tight-0.068
 understands-0.063
 fault-0.063
 lot-0.061
 basket-0.061
Token sky
Token ID6766
Feature activation-6.45e-02

Pos logit contributions
iece+0.190
 myself+0.185
Ta+0.174
aming+0.169
CR+0.163
Neg logit contributions
 tight-0.156
 understands-0.148
ucker-0.147
 fault-0.147
 stern-0.142
Token.
Token ID13
Feature activation-2.18e-03

Pos logit contributions
iece+0.124
 myself+0.122
Ta+0.122
aming+0.115
CR+0.112
Neg logit contributions
 tight-0.061
ucker-0.053
 fault-0.052
 understands-0.052
 stern-0.051
Token
Token ID198
Feature activation-1.52e-02

Pos logit contributions
iece+0.004
 myself+0.004
Ta+0.003
aming+0.003
CR+0.003
Neg logit contributions
ucker-0.002
 fault-0.002
 tight-0.002
 understands-0.002
 stern-0.002
Token
Token ID198
Feature activation-1.36e-02

Pos logit contributions
iece+0.028
 myself+0.027
aming+0.026
Ta+0.026
CR+0.025
Neg logit contributions
 tight-0.015
ucker-0.015
 fault-0.014
 understands-0.014
 stern-0.014
TokenThe
Token ID464
Feature activation+2.45e-02

Pos logit contributions
iece+0.018
 myself+0.017
 endings+0.015
aming+0.015
Ta+0.015
Neg logit contributions
 tight-0.022
 understands-0.021
 "-0.020
 stern-0.020
 lot-0.020
TokenThese
Token ID4711
Feature activation+2.06e-02

Pos logit contributions
 tight+0.031
 will+0.030
 would+0.029
 understands+0.029
 +0.029
Neg logit contributions
iece-0.043
 myself-0.039
uring-0.038
 endings-0.038
oy-0.037
Token toys
Token ID14958
Feature activation+1.25e-02

Pos logit contributions
 tight+0.021
 understands+0.020
 fault+0.019
ucker+0.019
 stern+0.018
Neg logit contributions
iece-0.038
 myself-0.037
aming-0.036
Ta-0.035
CR-0.034
Token will
Token ID481
Feature activation-8.90e-03

Pos logit contributions
ucker+0.012
 tight+0.012
 understands+0.012
 fault+0.012
 stern+0.011
Neg logit contributions
 myself-0.023
iece-0.022
aming-0.021
Ta-0.020
CR-0.020
Token make
Token ID787
Feature activation-3.52e-02

Pos logit contributions
iece+0.017
Ta+0.015
 myself+0.015
aming+0.015
CR+0.014
Neg logit contributions
 tight-0.010
 will-0.009
 hug-0.009
 lot-0.009
ucker-0.008
Token the
Token ID262
Feature activation-3.91e-02

Pos logit contributions
iece+0.053
Ta+0.049
aming+0.046
Now+0.044
CR+0.043
Neg logit contributions
 tight-0.055
 lot-0.049
 will-0.048
 "-0.047
 fault-0.046
Token children
Token ID1751
Feature activation-1.20e-01

Pos logit contributions
iece+0.069
 myself+0.067
Ta+0.064
aming+0.064
CR+0.061
Neg logit contributions
 tight-0.045
 understands-0.041
 fault-0.041
 basket-0.040
 lot-0.039
Token happy
Token ID3772
Feature activation-4.37e-02

Pos logit contributions
iece+0.232
 myself+0.225
Ta+0.216
aming+0.214
CR+0.209
Neg logit contributions
 tight-0.112
ucker-0.103
 understands-0.101
 fault-0.100
 stern-0.098
Token!"
Token ID2474
Feature activation-1.63e-02

Pos logit contributions
iece+0.084
Ta+0.083
aming+0.079
 myself+0.075
CR+0.074
Neg logit contributions
 tight-0.043
 fault-0.042
 will-0.040
 "-0.040
 understands-0.039
Token The
Token ID383
Feature activation+2.07e-02

Pos logit contributions
iece+0.032
 myself+0.032
aming+0.030
Ta+0.029
CR+0.029
Neg logit contributions
 tight-0.014
ucker-0.014
 fault-0.014
 understands-0.014
 stern-0.013
Token man
Token ID582
Feature activation+2.34e-03

Pos logit contributions
ucker+0.032
 fault+0.031
run+0.031
ometer+0.027
uffy+0.027
Neg logit contributions
iece-0.030
 myself-0.029
aming-0.028
 pirates-0.027
Ta-0.026
Token was
Token ID373
Feature activation+5.01e-03

Pos logit contributions
 tight+0.002
 understands+0.002
ucker+0.002
 stern+0.002
 fault+0.002
Neg logit contributions
iece-0.004
 myself-0.004
aming-0.004
Ta-0.004
CR-0.004
Token where
Token ID810
Feature activation+7.19e-02

Pos logit contributions
iece+0.032
 myself+0.032
aming+0.029
Ta+0.029
CR+0.028
Neg logit contributions
 tight-0.020
ucker-0.020
 understands-0.019
 fault-0.019
 stern-0.019
Token they
Token ID484
Feature activation-3.50e-02

Pos logit contributions
 tight+0.096
 understands+0.093
 fault+0.091
ucker+0.091
 "+0.089
Neg logit contributions
iece-0.109
 myself-0.104
aming-0.100
Ta-0.099
mmmm-0.092
Token will
Token ID481
Feature activation-4.17e-03

Pos logit contributions
iece+0.071
 myself+0.069
aming+0.066
Ta+0.066
CR+0.063
Neg logit contributions
 tight-0.029
 understands-0.027
ucker-0.027
 fault-0.026
 stern-0.025
Token make
Token ID787
Feature activation-2.65e-02

Pos logit contributions
iece+0.007
 myself+0.007
Ta+0.007
aming+0.007
CR+0.006
Neg logit contributions
 tight-0.005
 fault-0.004
 understands-0.004
ucker-0.004
 "-0.004
Token many
Token ID867
Feature activation-5.89e-02

Pos logit contributions
iece+0.046
Ta+0.044
aming+0.042
 myself+0.041
CR+0.040
Neg logit contributions
 tight-0.032
 fault-0.029
 understands-0.029
 "-0.029
 lot-0.029
Token children
Token ID1751
Feature activation-1.19e-01

Pos logit contributions
iece+0.095
 myself+0.089
Ta+0.087
aming+0.084
CR+0.081
Neg logit contributions
 tight-0.078
 understands-0.074
 lot-0.073
ucker-0.071
 fault-0.071
Token happy
Token ID3772
Feature activation-4.15e-02

Pos logit contributions
iece+0.216
 myself+0.209
Ta+0.203
aming+0.199
CR+0.192
Neg logit contributions
 tight-0.124
ucker-0.116
 understands-0.116
 fault-0.115
 stern-0.111
Token.
Token ID13
Feature activation+5.86e-03

Pos logit contributions
Ta+0.090
iece+0.088
aming+0.083
 myself+0.082
mmmm+0.080
Neg logit contributions
 tight-0.030
 fault-0.030
 "-0.030
 will-0.029
 understands-0.028
Token I
Token ID314
Feature activation-1.19e-03

Pos logit contributions
 tight+0.005
ucker+0.005
 understands+0.005
 fault+0.005
 stern+0.005
Neg logit contributions
iece-0.012
 myself-0.011
aming-0.011
Ta-0.011
CR-0.010
Token hope
Token ID2911
Feature activation+6.06e-02

Pos logit contributions
iece+0.002
 myself+0.002
aming+0.002
Ta+0.002
CR+0.002
Neg logit contributions
 tight-0.001
ucker-0.001
 understands-0.001
 fault-0.001
 stern-0.001
Token you
Token ID345
Feature activation-1.83e-02

Pos logit contributions
 tight+0.087
 understands+0.082
 fault+0.082
 "+0.081
 stern+0.080
Neg logit contributions
iece-0.090
 myself-0.082
aming-0.082
Ta-0.081
CR-0.075
Token something
Token ID1223
Feature activation-7.64e-03

Pos logit contributions
iece+0.055
 myself+0.054
aming+0.050
Ta+0.049
CR+0.048
Neg logit contributions
 tight-0.045
 understands-0.042
 fault-0.042
 stern-0.041
run-0.040
Token that
Token ID326
Feature activation-2.64e-02

Pos logit contributions
 myself+0.011
iece+0.011
aming+0.010
 seek+0.010
Ta+0.010
Neg logit contributions
ucker-0.010
arl-0.010
 tight-0.010
 understands-0.010
 fault-0.010
Token would
Token ID561
Feature activation-1.06e-02

Pos logit contributions
iece+0.048
 myself+0.048
aming+0.044
Ta+0.044
CR+0.042
Neg logit contributions
ucker-0.026
 tight-0.026
 fault-0.025
 understands-0.025
arl-0.024
Token make
Token ID787
Feature activation-3.48e-02

Pos logit contributions
iece+0.020
 myself+0.018
aming+0.018
Ta+0.018
CR+0.017
Neg logit contributions
 tight-0.012
 fault-0.011
 hug-0.011
 understands-0.011
 stern-0.011
Token M
Token ID337
Feature activation-4.03e-02

Pos logit contributions
iece+0.061
Ta+0.059
 myself+0.058
aming+0.058
CR+0.055
Neg logit contributions
 tight-0.039
 fault-0.036
ucker-0.035
 understands-0.035
 stern-0.034
Tokenandy
Token ID10757
Feature activation-1.19e-01

Pos logit contributions
iece+0.064
Ta+0.063
CR+0.059
glass+0.058
bridge+0.057
Neg logit contributions
ucker-0.056
 fault-0.052
uffy-0.050
run-0.049
arl-0.047
Token feel
Token ID1254
Feature activation-4.51e-02

Pos logit contributions
iece+0.232
 myself+0.228
Ta+0.218
aming+0.217
CR+0.211
Neg logit contributions
 tight-0.107
ucker-0.100
 understands-0.099
 fault-0.097
 stern-0.096
Token better
Token ID1365
Feature activation+5.92e-04

Pos logit contributions
iece+0.088
 myself+0.087
aming+0.082
Ta+0.079
CR+0.078
Neg logit contributions
ucker-0.040
 tight-0.038
 understands-0.037
 fault-0.037
run-0.036
Token.
Token ID13
Feature activation-1.72e-03

Pos logit contributions
ucker+0.001
 tight+0.001
 understands+0.001
 fault+0.001
run+0.000
Neg logit contributions
iece-0.001
 myself-0.001
aming-0.001
Ta-0.001
CR-0.001
Token
Token ID220
Feature activation+6.92e-02

Pos logit contributions
 myself+0.003
iece+0.003
aming+0.003
Ta+0.003
CR+0.002
Neg logit contributions
 tight-0.002
 fault-0.002
ucker-0.002
 understands-0.002
 stern-0.002
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
 tight+0.070
ucker+0.067
 understands+0.066
 fault+0.066
 stern+0.065
Neg logit contributions
iece-0.124
 myself-0.124
aming-0.115
Ta-0.113
CR-0.111
Token bit
Token ID1643
Feature activation+4.03e-02

Pos logit contributions
iece+0.059
 myself+0.058
aming+0.054
Ta+0.054
CR+0.052
Neg logit contributions
 tight-0.029
ucker-0.028
 understands-0.027
 fault-0.027
 stern-0.026
Token of
Token ID286
Feature activation+1.08e-02

Pos logit contributions
 tight+0.053
ucker+0.052
 understands+0.051
 fault+0.051
 stern+0.049
Neg logit contributions
iece-0.062
 myself-0.060
aming-0.056
Ta-0.055
CR-0.053
Token water
Token ID1660
Feature activation-1.42e-02

Pos logit contributions
ucker+0.015
 fault+0.015
 understands+0.015
ometer+0.015
 tight+0.014
Neg logit contributions
 myself-0.015
iece-0.014
aming-0.014
 nights-0.013
Ta-0.013
Token to
Token ID284
Feature activation-3.78e-02

Pos logit contributions
iece+0.023
 myself+0.023
aming+0.021
CR+0.020
 nights+0.019
Neg logit contributions
 understands-0.017
ucker-0.017
 fault-0.016
 tight-0.016
run-0.015
Token make
Token ID787
Feature activation-2.87e-02

Pos logit contributions
iece+0.068
 myself+0.064
Ta+0.062
aming+0.061
CR+0.060
Neg logit contributions
 tight-0.042
ucker-0.038
 stern-0.037
 lot-0.037
 understands-0.037
Token o
Token ID267
Feature activation-1.18e-01

Pos logit contributions
iece+0.053
Ta+0.051
 myself+0.049
aming+0.047
CR+0.047
Neg logit contributions
 tight-0.033
 lot-0.028
ucker-0.028
 "-0.028
 stern-0.028
Tokenat
Token ID265
Feature activation-5.87e-02

Pos logit contributions
iece+0.136
 myself+0.131
Ta+0.129
aming+0.116
 endings+0.115
Neg logit contributions
ucker-0.199
 tight-0.195
run-0.186
ip-0.186
 understands-0.183
Token soup
Token ID17141
Feature activation-6.35e-02

Pos logit contributions
iece+0.101
Ta+0.097
 myself+0.097
aming+0.092
CR+0.091
Neg logit contributions
 tight-0.068
ucker-0.064
 fault-0.061
 stern-0.060
 understands-0.060
Token.
Token ID13
Feature activation+1.50e-03

Pos logit contributions
iece+0.133
Ta+0.131
 myself+0.127
aming+0.123
CR+0.122
Neg logit contributions
 tight-0.052
ucker-0.046
 fault-0.045
 "-0.043
 will-0.043
Token She
Token ID1375
Feature activation-9.96e-03

Pos logit contributions
ucker+0.001
 tight+0.001
 fault+0.001
 understands+0.001
 stern+0.001
Neg logit contributions
iece-0.003
 myself-0.003
aming-0.003
Ta-0.003
CR-0.003
Token wanted
Token ID2227
Feature activation-3.53e-03

Pos logit contributions
iece+0.017
 myself+0.017
Ta+0.016
aming+0.015
 endings+0.015
Neg logit contributions
 tight-0.012
ucker-0.011
 understands-0.011
 fault-0.010
 stern-0.010
Token,
Token ID11
Feature activation-1.09e-02

Pos logit contributions
iece+0.005
Ta+0.005
 myself+0.005
aming+0.005
 endings+0.005
Neg logit contributions
 tight-0.003
 understands-0.003
 stern-0.003
 fault-0.003
 will-0.003
Token boy
Token ID2933
Feature activation-3.74e-02

Pos logit contributions
iece+0.021
 myself+0.020
aming+0.019
Ta+0.019
ands+0.018
Neg logit contributions
 tight-0.011
 understands-0.011
 "-0.010
 will-0.010
 stern-0.010
Token!
Token ID0
Feature activation+4.99e-03

Pos logit contributions
iece+0.068
 myself+0.065
Ta+0.064
aming+0.062
 endings+0.059
Neg logit contributions
 tight-0.041
 understands-0.037
ucker-0.037
 stern-0.036
 fault-0.036
Token Leave
Token ID17446
Feature activation+9.51e-03

Pos logit contributions
 tight+0.006
ucker+0.006
 understands+0.005
 fault+0.005
run+0.005
Neg logit contributions
iece-0.008
 myself-0.008
aming-0.008
Ta-0.008
CR-0.007
Token those
Token ID883
Feature activation-5.36e-02

Pos logit contributions
ucker+0.015
 fault+0.015
 tight+0.015
 understands+0.015
run+0.014
Neg logit contributions
 myself-0.012
iece-0.012
aming-0.010
Ta-0.010
CR-0.010
Token kids
Token ID3988
Feature activation-1.18e-01

Pos logit contributions
iece+0.078
 myself+0.070
Ta+0.069
mmmm+0.067
mmm+0.067
Neg logit contributions
 tight-0.078
 understands-0.075
run-0.074
 stern-0.071
 fault-0.071
Token alone
Token ID3436
Feature activation-4.70e-02

Pos logit contributions
iece+0.199
Ta+0.187
 myself+0.186
aming+0.178
 endings+0.176
Neg logit contributions
 tight-0.145
ucker-0.130
 fault-0.128
 understands-0.128
 stern-0.127
Token!"
Token ID2474
Feature activation-3.57e-02

Pos logit contributions
Ta+0.091
iece+0.090
mmmm+0.080
aming+0.078
 behold+0.077
Neg logit contributions
 tight-0.051
 will-0.044
 fault-0.042
 understands-0.041
 "-0.041
Token
Token ID198
Feature activation-7.87e-03

Pos logit contributions
iece+0.065
 myself+0.061
aming+0.060
Ta+0.057
 endings+0.054
Neg logit contributions
 tight-0.042
 stern-0.039
 understands-0.039
 "-0.039
ucker-0.038
Token
Token ID198
Feature activation-1.01e-02

Pos logit contributions
 myself+0.013
iece+0.013
aming+0.012
CR+0.012
Ta+0.012
Neg logit contributions
 fault-0.009
ucker-0.009
 tight-0.009
 understands-0.008
ishly-0.008
TokenThe
Token ID464
Feature activation+3.73e-02

Pos logit contributions
iece+0.010
 myself+0.010
aming+0.008
Ta+0.008
CR+0.008
Neg logit contributions
 fault-0.019
ucker-0.019
 tight-0.019
 understands-0.018
run-0.018
Token For
Token ID1114
Feature activation-8.98e-03

Pos logit contributions
 myself+0.000
iece+0.000
mmmm+0.000
Now+0.000
mmm+0.000
Neg logit contributions
ucker-0.000
 stern-0.000
 tight-0.000
 fault-0.000
arl-0.000
Token breakfast
Token ID12607
Feature activation-1.70e-02

Pos logit contributions
iece+0.015
Ta+0.013
 myself+0.013
CR+0.013
aming+0.013
Neg logit contributions
 tight-0.012
 lot-0.011
 "-0.011
 understands-0.010
 -0.010
Token the
Token ID262
Feature activation-1.54e-02

Pos logit contributions
iece+0.032
 myself+0.032
aming+0.030
Ta+0.029
CR+0.029
Neg logit contributions
 tight-0.016
ucker-0.016
 understands-0.015
 fault-0.015
 stern-0.014
Token board
Token ID3096
Feature activation-6.28e-03

Pos logit contributions
iece+0.028
 myself+0.026
Ta+0.025
aming+0.025
CR+0.025
Neg logit contributions
 tight-0.019
 lot-0.018
 basket-0.016
 cart-0.016
 sho-0.016
Token made
Token ID925
Feature activation-5.40e-02

Pos logit contributions
iece+0.012
 myself+0.012
aming+0.011
Ta+0.011
CR+0.011
Neg logit contributions
 tight-0.006
ucker-0.006
 understands-0.005
 fault-0.005
 stern-0.005
Token o
Token ID267
Feature activation-1.18e-01

Pos logit contributions
iece+0.112
Ta+0.108
 myself+0.104
aming+0.102
CR+0.101
Neg logit contributions
 tight-0.050
 "-0.044
 lot-0.044
 understands-0.041
ucker-0.041
Tokenatmeal
Token ID45280
Feature activation-3.24e-02

Pos logit contributions
iece+0.133
 myself+0.128
Ta+0.123
aming+0.113
 endings+0.112
Neg logit contributions
ucker-0.198
 tight-0.192
 understands-0.185
run-0.185
ip-0.183
Token with
Token ID351
Feature activation-1.03e-02

Pos logit contributions
iece+0.059
 myself+0.058
Ta+0.057
aming+0.055
CR+0.054
Neg logit contributions
 tight-0.034
ucker-0.031
 understands-0.030
 fault-0.030
 stern-0.029
Token fresh
Token ID4713
Feature activation-1.48e-02

Pos logit contributions
iece+0.017
 myself+0.017
aming+0.015
Ta+0.015
CR+0.015
Neg logit contributions
 tight-0.013
ucker-0.012
 understands-0.012
 fault-0.012
 stern-0.011
Token strawberries
Token ID36973
Feature activation-3.58e-02

Pos logit contributions
iece+0.022
 myself+0.021
Ta+0.020
aming+0.019
CR+0.019
Neg logit contributions
 tight-0.021
ucker-0.020
 understands-0.020
 fault-0.019
 "-0.019
Token.
Token ID13
Feature activation-6.71e-04

Pos logit contributions
iece+0.069
 myself+0.068
aming+0.064
Ta+0.063
CR+0.061
Neg logit contributions
 tight-0.032
ucker-0.032
 understands-0.031
 fault-0.031
 stern-0.029
Token but
Token ID475
Feature activation-4.31e-02

Pos logit contributions
iece+0.022
 myself+0.022
aming+0.020
Ta+0.019
CR+0.019
Neg logit contributions
 tight-0.021
ucker-0.021
 understands-0.020
 fault-0.020
 stern-0.019
Token they
Token ID484
Feature activation+1.29e-02

Pos logit contributions
 myself+0.069
iece+0.067
aming+0.061
 seek+0.061
 behold+0.059
Neg logit contributions
ucker-0.053
 fault-0.051
 understands-0.050
arl-0.049
 tight-0.048
Token did
Token ID750
Feature activation+9.83e-03

Pos logit contributions
ucker+0.017
arl+0.017
 fault+0.017
run+0.016
 tight+0.016
Neg logit contributions
 myself-0.018
iece-0.018
aming-0.017
 behold-0.017
 seek-0.017
Token not
Token ID407
Feature activation+4.28e-02

Pos logit contributions
ucker+0.014
arl+0.014
run+0.013
 understands+0.013
ometer+0.013
Neg logit contributions
 myself-0.015
iece-0.013
 later-0.013
CR-0.012
 behold-0.012
Token know
Token ID760
Feature activation-1.09e-02

Pos logit contributions
 tight+0.053
ucker+0.053
 understands+0.052
 fault+0.052
 stern+0.050
Neg logit contributions
iece-0.068
 myself-0.067
aming-0.062
Ta-0.061
CR-0.059
Token where
Token ID810
Feature activation+1.13e-01

Pos logit contributions
 myself+0.017
iece+0.016
 later+0.015
 nights+0.015
th+0.014
Neg logit contributions
ucker-0.015
arl-0.014
 understands-0.013
ometer-0.013
oun-0.013
Token it
Token ID340
Feature activation+9.48e-03

Pos logit contributions
ucker+0.148
arl+0.134
 understands+0.132
 tight+0.132
 fault+0.131
Neg logit contributions
iece-0.175
 myself-0.174
 endings-0.158
 nights-0.157
CR-0.156
Token was
Token ID373
Feature activation-2.09e-02

Pos logit contributions
ucker+0.011
 tight+0.011
 fault+0.011
 understands+0.010
 stern+0.010
Neg logit contributions
iece-0.016
 myself-0.015
aming-0.014
Ta-0.014
CR-0.014
Token.
Token ID13
Feature activation+5.32e-03

Pos logit contributions
iece+0.042
 myself+0.041
Ta+0.039
aming+0.039
CR+0.037
Neg logit contributions
 tight-0.018
 fault-0.017
ucker-0.017
 understands-0.016
 stern-0.016
Token They
Token ID1119
Feature activation+1.64e-03

Pos logit contributions
 tight+0.005
ucker+0.005
 understands+0.005
 fault+0.005
run+0.005
Neg logit contributions
iece-0.010
 myself-0.009
Ta-0.009
aming-0.009
CR-0.009
Token said
Token ID531
Feature activation-2.93e-02

Pos logit contributions
 fault+0.002
ucker+0.002
arl+0.002
 tight+0.002
 understands+0.002
Neg logit contributions
iece-0.002
 myself-0.002
aming-0.002
Ta-0.002
 behold-0.002
Token she
Token ID673
Feature activation-1.74e-02

Pos logit contributions
iece+0.039
 myself+0.039
aming+0.036
Ta+0.035
CR+0.033
Neg logit contributions
 tight-0.031
ucker-0.031
 understands-0.030
 fault-0.030
 stern-0.029
Token had
Token ID550
Feature activation-5.71e-03

Pos logit contributions
iece+0.031
 myself+0.031
aming+0.029
Ta+0.028
CR+0.027
Neg logit contributions
ucker-0.018
 tight-0.017
 fault-0.017
 understands-0.017
arl-0.016
Token shown
Token ID3402
Feature activation-5.19e-03

Pos logit contributions
iece+0.009
 myself+0.009
aming+0.008
Ta+0.008
CR+0.008
Neg logit contributions
 tight-0.007
ucker-0.007
 understands-0.007
 fault-0.007
 stern-0.006
Token her
Token ID607
Feature activation+2.19e-02

Pos logit contributions
 myself+0.008
iece+0.007
aming+0.007
 endings+0.006
th+0.006
Neg logit contributions
ucker-0.007
 understands-0.007
uffy-0.007
ometer-0.007
arl-0.007
Token mother
Token ID2802
Feature activation-1.41e-02

Pos logit contributions
arl+0.034
ucker+0.034
oun+0.030
ounces+0.030
run+0.030
Neg logit contributions
 myself-0.029
aming-0.026
 nights-0.026
iece-0.024
 later-0.024
Token how
Token ID703
Feature activation+1.06e-01

Pos logit contributions
 myself+0.019
iece+0.019
aming+0.018
 behold+0.017
 later+0.016
Neg logit contributions
arl-0.024
ucker-0.022
 fault-0.020
oun-0.020
ometer-0.020
Token much
Token ID881
Feature activation+5.06e-02

Pos logit contributions
 fault+0.112
ucker+0.111
burn+0.110
ishly+0.109
arl+0.109
Neg logit contributions
 myself-0.194
aming-0.171
iece-0.164
CR-0.162
th-0.156
Token she
Token ID673
Feature activation-4.64e-02

Pos logit contributions
 tight+0.069
ucker+0.069
 understands+0.067
 fault+0.067
run+0.065
Neg logit contributions
 myself-0.072
iece-0.071
aming-0.065
Ta-0.064
CR-0.062
Token loved
Token ID6151
Feature activation+2.32e-02

Pos logit contributions
iece+0.086
 myself+0.084
aming+0.079
Ta+0.079
CR+0.076
Neg logit contributions
 tight-0.046
ucker-0.044
 understands-0.044
 fault-0.043
 stern-0.041
Token her
Token ID607
Feature activation+1.28e-02

Pos logit contributions
ucker+0.030
 understands+0.029
arl+0.029
 fault+0.028
 tight+0.027
Neg logit contributions
 myself-0.036
iece-0.034
aming-0.032
 nights-0.031
 endings-0.031
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
ucker+0.014
 understands+0.014
arl+0.013
 tight+0.013
run+0.013
Neg logit contributions
 myself-0.022
iece-0.022
aming-0.020
 nights-0.019
 seek-0.019
Token him
Token ID683
Feature activation-1.70e-02

Pos logit contributions
 tight+0.001
 understands+0.001
ucker+0.001
 fault+0.001
 stern+0.001
Neg logit contributions
iece-0.002
 myself-0.002
aming-0.002
Ta-0.002
CR-0.002
Token inside
Token ID2641
Feature activation+4.45e-03

Pos logit contributions
iece+0.036
 myself+0.036
Ta+0.034
aming+0.034
mmmm+0.033
Neg logit contributions
 tight-0.013
 fault-0.011
 understands-0.011
ucker-0.011
 stern-0.011
Token and
Token ID290
Feature activation-5.62e-03

Pos logit contributions
ucker+0.005
 understands+0.005
 tight+0.005
 fault+0.005
run+0.005
Neg logit contributions
iece-0.007
 myself-0.007
aming-0.006
Ta-0.006
CR-0.006
Token showed
Token ID3751
Feature activation-3.79e-03

Pos logit contributions
iece+0.010
 myself+0.010
aming+0.009
Ta+0.009
CR+0.009
Neg logit contributions
 tight-0.006
 understands-0.006
ucker-0.006
 fault-0.006
 stern-0.006
Token him
Token ID683
Feature activation-1.52e-02

Pos logit contributions
iece+0.004
 myself+0.003
aming+0.003
Ta+0.003
th+0.003
Neg logit contributions
ucker-0.007
 tight-0.007
 fault-0.007
 understands-0.007
arl-0.007
Token how
Token ID703
Feature activation+1.02e-01

Pos logit contributions
 myself+0.025
iece+0.025
aming+0.022
Ta+0.022
CR+0.021
Neg logit contributions
ucker-0.019
 tight-0.018
 understands-0.017
 fault-0.017
arl-0.017
Token to
Token ID284
Feature activation-1.17e-02

Pos logit contributions
ucker+0.071
 tight+0.069
 understands+0.067
 fault+0.067
 stern+0.061
Neg logit contributions
 myself-0.216
iece-0.214
aming-0.202
Ta-0.199
CR-0.197
Token be
Token ID307
Feature activation-5.39e-02

Pos logit contributions
iece+0.021
 myself+0.020
aming+0.020
Ta+0.020
 endings+0.019
Neg logit contributions
 tight-0.013
ucker-0.012
 hug-0.011
 fault-0.011
 understands-0.011
Token more
Token ID517
Feature activation-2.72e-03

Pos logit contributions
iece+0.093
 myself+0.091
aming+0.085
Ta+0.085
CR+0.081
Neg logit contributions
 tight-0.061
ucker-0.059
 understands-0.058
 fault-0.057
 stern-0.056
Token careful
Token ID8161
Feature activation-2.82e-02

Pos logit contributions
iece+0.005
 myself+0.004
Ta+0.004
aming+0.004
mmmm+0.004
Neg logit contributions
 tight-0.004
ucker-0.003
 understands-0.003
 fault-0.003
 stern-0.003
Token when
Token ID618
Feature activation-1.21e-02

Pos logit contributions
Ta+0.050
iece+0.046
 myself+0.045
mmmm+0.044
aming+0.044
Neg logit contributions
 tight-0.033
 fault-0.030
 "-0.029
run-0.029
 stern-0.029
Token told
Token ID1297
Feature activation-2.01e-02

Pos logit contributions
iece+0.016
 myself+0.015
aming+0.015
Ta+0.014
oy+0.014
Neg logit contributions
 fault-0.012
 tight-0.012
ucker-0.012
arl-0.011
 understands-0.011
Token her
Token ID607
Feature activation+2.98e-02

Pos logit contributions
iece+0.039
 myself+0.037
Ta+0.037
aming+0.037
 seek+0.035
Neg logit contributions
 tight-0.020
 "-0.018
 stern-0.018
 lot-0.018
 hug-0.017
Token that
Token ID326
Feature activation+2.72e-02

Pos logit contributions
ucker+0.038
arl+0.037
run+0.034
 understands+0.034
 tight+0.033
Neg logit contributions
iece-0.045
 myself-0.045
 endings-0.042
 later-0.041
th-0.041
Token this
Token ID428
Feature activation-3.46e-03

Pos logit contributions
 tight+0.028
 "+0.028
 fault+0.027
 understands+0.026
ucker+0.026
Neg logit contributions
iece-0.050
 myself-0.048
aming-0.047
Ta-0.047
oy-0.044
Token was
Token ID373
Feature activation-5.20e-03

Pos logit contributions
iece+0.006
 myself+0.006
CR+0.005
aming+0.005
 nights+0.005
Neg logit contributions
ucker-0.004
 tight-0.004
 understands-0.004
run-0.004
arl-0.003
Token where
Token ID810
Feature activation+8.56e-02

Pos logit contributions
 myself+0.007
aming+0.007
CR+0.007
iece+0.007
 seek+0.007
Neg logit contributions
ucker-0.007
 understands-0.007
run-0.007
ishly-0.007
arl-0.007
Token she
Token ID673
Feature activation-4.11e-02

Pos logit contributions
 tight+0.101
ucker+0.099
 understands+0.097
 fault+0.096
 stern+0.093
Neg logit contributions
iece-0.141
 myself-0.139
aming-0.129
Ta-0.127
CR-0.123
Token worked
Token ID3111
Feature activation-5.08e-03

Pos logit contributions
iece+0.079
 myself+0.076
Ta+0.073
aming+0.073
mmmm+0.069
Neg logit contributions
 tight-0.039
 understands-0.037
 fault-0.036
ucker-0.036
 will-0.034
Token.
Token ID13
Feature activation+1.78e-03

Pos logit contributions
iece+0.009
 myself+0.009
aming+0.009
Ta+0.009
CR+0.008
Neg logit contributions
 tight-0.005
ucker-0.005
 understands-0.005
 fault-0.005
 stern-0.004
Token Lily
Token ID20037
Feature activation-2.02e-02

Pos logit contributions
 fault+0.002
ucker+0.002
 understands+0.002
 tight+0.002
run+0.002
Neg logit contributions
 myself-0.003
iece-0.003
CR-0.003
Ta-0.003
aming-0.003
Token was
Token ID373
Feature activation-8.62e-06

Pos logit contributions
iece+0.036
 myself+0.036
Ta+0.033
aming+0.033
CR+0.032
Neg logit contributions
 tight-0.022
 understands-0.021
ucker-0.021
 fault-0.020
 stern-0.020
Token much
Token ID881
Feature activation+5.10e-02

Pos logit contributions
 fault+0.127
ucker+0.123
arl+0.120
burn+0.117
run+0.116
Neg logit contributions
 myself-0.174
iece-0.163
aming-0.159
 behold-0.149
CR-0.149
Token it
Token ID340
Feature activation-3.56e-03

Pos logit contributions
 tight+0.062
ucker+0.059
 understands+0.059
 fault+0.058
 stern+0.056
Neg logit contributions
iece-0.085
 myself-0.082
aming-0.078
Ta-0.077
CR-0.073
Token had
Token ID550
Feature activation-1.09e-02

Pos logit contributions
iece+0.006
 myself+0.006
aming+0.006
Ta+0.006
CR+0.005
Neg logit contributions
 tight-0.004
ucker-0.004
 fault-0.004
 understands-0.004
run-0.004
Token grown
Token ID7334
Feature activation-9.48e-02

Pos logit contributions
iece+0.018
 myself+0.017
Ta+0.017
aming+0.016
mmmm+0.015
Neg logit contributions
 tight-0.013
 understands-0.013
 fault-0.013
 "-0.012
 will-0.012
Token and
Token ID290
Feature activation-3.32e-02

Pos logit contributions
iece+0.152
 myself+0.148
aming+0.138
Ta+0.137
CR+0.131
Neg logit contributions
 tight-0.118
ucker-0.115
 understands-0.113
 fault-0.112
 stern-0.109
Token how
Token ID703
Feature activation+9.69e-02

Pos logit contributions
iece+0.054
 myself+0.052
Ta+0.050
aming+0.049
CR+0.047
Neg logit contributions
 tight-0.041
 understands-0.039
ucker-0.039
 fault-0.038
 stern-0.037
Token rich
Token ID5527
Feature activation-3.92e-02

Pos logit contributions
 fault+0.110
ucker+0.107
arl+0.107
burn+0.103
ishly+0.103
Neg logit contributions
 myself-0.164
iece-0.157
aming-0.147
CR-0.143
 nights-0.140
Token it
Token ID340
Feature activation-2.12e-02

Pos logit contributions
iece+0.064
 myself+0.063
aming+0.058
Ta+0.057
CR+0.055
Neg logit contributions
 tight-0.047
ucker-0.046
 fault-0.046
 understands-0.045
run-0.044
Token looked
Token ID3114
Feature activation-7.28e-03

Pos logit contributions
iece+0.037
 myself+0.036
aming+0.034
Ta+0.033
CR+0.033
Neg logit contributions
 tight-0.023
ucker-0.023
 fault-0.022
 understands-0.022
 stern-0.021
Token.
Token ID13
Feature activation-7.78e-03

Pos logit contributions
iece+0.013
 myself+0.013
aming+0.012
Ta+0.012
CR+0.012
Neg logit contributions
ucker-0.007
 tight-0.007
 understands-0.007
 fault-0.007
 stern-0.006
Token She
Token ID1375
Feature activation-7.49e-03

Pos logit contributions
 myself+0.014
iece+0.014
Ta+0.013
CR+0.013
aming+0.013
Neg logit contributions
ucker-0.008
 fault-0.008
 tight-0.007
 understands-0.007
run-0.007
Token her
Token ID607
Feature activation+6.64e-03

Pos logit contributions
 tight+0.069
ucker+0.068
 understands+0.066
 fault+0.066
 stern+0.064
Neg logit contributions
iece-0.087
 myself-0.085
aming-0.079
Ta-0.078
CR-0.075
Token.
Token ID13
Feature activation+4.88e-03

Pos logit contributions
 understands+0.009
ucker+0.009
arl+0.009
 fault+0.009
run+0.008
Neg logit contributions
 myself-0.010
iece-0.010
aming-0.009
 seek-0.008
 later-0.008
Token<|endoftext|>
Token ID50256
Feature activation+5.81e-04

Pos logit contributions
 fault+0.006
run+0.006
ucker+0.006
 vain+0.006
 stern+0.006
Neg logit contributions
 myself-0.007
iece-0.007
Now-0.007
Ta-0.007
 nights-0.006
TokenOnce
Token ID7454
Feature activation+3.41e-02

Pos logit contributions
ucker+0.001
 fault+0.001
ishly+0.001
 stern+0.001
ometer+0.001
Neg logit contributions
iece-0.001
 myself-0.001
aming-0.001
Now-0.001
CR-0.001
Token upon
Token ID2402
Feature activation-1.28e-02

Pos logit contributions
ishly+0.067
burn+0.062
ucker+0.061
 stern+0.061
 shrug+0.061
Neg logit contributions
iece-0.027
 later-0.027
 myself-0.025
CR-0.023
 nights-0.022
Token a
Token ID257
Feature activation+8.16e-02

Pos logit contributions
iece+0.025
 myself+0.023
ach+0.023
oy+0.023
ands+0.022
Neg logit contributions
 tight-0.013
 "-0.013
 lot-0.012
 -0.012
 hug-0.011
Token time
Token ID640
Feature activation+3.01e-02

Pos logit contributions
ucker+0.105
 tight+0.104
 understands+0.104
 fault+0.102
run+0.100
Neg logit contributions
iece-0.118
 myself-0.117
aming-0.112
Ta-0.108
CR-0.105
Token,
Token ID11
Feature activation+3.10e-02

Pos logit contributions
 tight+0.027
ucker+0.027
 understands+0.026
 fault+0.026
 stern+0.025
Neg logit contributions
iece-0.058
 myself-0.057
aming-0.054
Ta-0.053
CR-0.051
Token there
Token ID612
Feature activation+3.78e-02

Pos logit contributions
 tight+0.037
 understands+0.034
ucker+0.034
 "+0.034
 stern+0.033
Neg logit contributions
iece-0.053
 myself-0.052
Ta-0.049
aming-0.048
 endings-0.046
Token was
Token ID373
Feature activation+1.73e-02

Pos logit contributions
ucker+0.050
arl+0.050
 tight+0.049
 fault+0.049
 stern+0.048
Neg logit contributions
iece-0.055
 myself-0.054
aming-0.051
Ta-0.049
CR-0.048
Token a
Token ID257
Feature activation+2.31e-02

Pos logit contributions
 tight+0.015
 lot+0.013
 "+0.013
 Spot+0.011
 +0.011
Neg logit contributions
iece-0.041
ands-0.036
 myself-0.036
fire-0.035
 endings-0.035
Token went
Token ID1816
Feature activation-1.70e-02

Pos logit contributions
iece+0.015
 myself+0.015
Ta+0.014
aming+0.014
CR+0.013
Neg logit contributions
 tight-0.010
 understands-0.009
ucker-0.009
 fault-0.009
 stern-0.009
Token inside
Token ID2641
Feature activation-7.11e-03

Pos logit contributions
iece+0.029
 myself+0.029
aming+0.027
 seek+0.027
 nights+0.026
Neg logit contributions
ucker-0.020
 understands-0.019
 fault-0.018
 vain-0.016
 tight-0.016
Token feeling
Token ID4203
Feature activation-3.98e-02

Pos logit contributions
iece+0.013
 myself+0.013
Ta+0.012
aming+0.012
CR+0.011
Neg logit contributions
 tight-0.007
ucker-0.007
 understands-0.007
 fault-0.007
 stern-0.007
Token scared
Token ID12008
Feature activation-7.86e-03

Pos logit contributions
iece+0.073
 myself+0.072
aming+0.071
Ta+0.066
 nights+0.065
Neg logit contributions
ucker-0.039
 fault-0.037
ometer-0.035
 understands-0.035
run-0.035
Token and
Token ID290
Feature activation+1.29e-02

Pos logit contributions
Ta+0.015
iece+0.015
 myself+0.014
aming+0.014
CR+0.013
Neg logit contributions
 tight-0.008
 "-0.007
 fault-0.007
 stern-0.007
run-0.007
Token her
Token ID607
Feature activation+4.33e-02

Pos logit contributions
ucker+0.014
 tight+0.014
 fault+0.014
 understands+0.014
run+0.013
Neg logit contributions
iece-0.022
 myself-0.022
aming-0.020
Ta-0.020
CR-0.019
Token mom
Token ID1995
Feature activation+2.57e-02

Pos logit contributions
ucker+0.048
arl+0.044
run+0.043
 understands+0.043
icks+0.041
Neg logit contributions
 myself-0.073
iece-0.071
 seek-0.069
 nights-0.067
aming-0.067
Token gave
Token ID2921
Feature activation+3.08e-02

Pos logit contributions
 fault+0.040
arl+0.039
ucker+0.038
 understands+0.035
icks+0.035
Neg logit contributions
iece-0.035
aming-0.033
 myself-0.031
th-0.031
 nights-0.030
Token her
Token ID607
Feature activation+1.54e-02

Pos logit contributions
 tight+0.042
ucker+0.041
 understands+0.041
 fault+0.041
 stern+0.039
Neg logit contributions
iece-0.045
 myself-0.044
aming-0.041
Ta-0.040
CR-0.038
Token a
Token ID257
Feature activation+1.71e-02

Pos logit contributions
ucker+0.029
arl+0.028
 understands+0.027
 fault+0.026
ometer+0.026
Neg logit contributions
 myself-0.015
iece-0.014
aming-0.013
Ta-0.012
th-0.012
Token big
Token ID1263
Feature activation+2.30e-02

Pos logit contributions
ucker+0.020
ishly+0.019
 fault+0.018
ours+0.018
 wrists+0.016
Neg logit contributions
Ta-0.028
 seek-0.025
 seaw-0.025
aming-0.024
th-0.023
Token help
Token ID1037
Feature activation+4.00e-02

Pos logit contributions
iece+0.022
 myself+0.021
aming+0.020
Ta+0.020
CR+0.019
Neg logit contributions
 tight-0.014
ucker-0.014
 understands-0.013
 fault-0.013
 stern-0.012
Token.
Token ID13
Feature activation-7.77e-03

Pos logit contributions
 tight+0.028
 "+0.024
run+0.023
 stern+0.022
ucker+0.022
Neg logit contributions
Ta-0.088
iece-0.088
 myself-0.085
aming-0.085
Now-0.081
Token "
Token ID366
Feature activation-1.13e-02

Pos logit contributions
 myself+0.013
iece+0.013
CR+0.012
mmmm+0.011
Ta+0.011
Neg logit contributions
 fault-0.009
ucker-0.009
 understands-0.008
burn-0.008
arl-0.008
TokenPlease
Token ID5492
Feature activation-1.75e-02

Pos logit contributions
iece+0.021
 myself+0.020
inations+0.019
uring+0.018
 endings+0.018
Neg logit contributions
 will-0.012
 would-0.011
 tight-0.010
 "-0.010
burn-0.010
Token,
Token ID11
Feature activation-1.20e-02

Pos logit contributions
 myself+0.029
iece+0.028
aming+0.025
 seek+0.025
CR+0.025
Neg logit contributions
ucker-0.021
 fault-0.021
 understands-0.020
 tight-0.020
run-0.019
Token help
Token ID1037
Feature activation+4.99e-02

Pos logit contributions
 myself+0.022
iece+0.021
Ta+0.020
 seek+0.020
aming+0.020
Neg logit contributions
ucker-0.012
 fault-0.012
 tight-0.011
 understands-0.011
run-0.011
Token me
Token ID502
Feature activation-5.17e-02

Pos logit contributions
 tight+0.076
ucker+0.072
 "+0.071
 understands+0.070
 will+0.069
Neg logit contributions
iece-0.070
Ta-0.065
aming-0.062
 myself-0.062
CR-0.057
Token remove
Token ID4781
Feature activation+1.24e-02

Pos logit contributions
iece+0.091
Ta+0.088
aming+0.082
 myself+0.082
 endings+0.081
Neg logit contributions
 tight-0.062
ucker-0.055
run-0.054
 stern-0.054
 hug-0.053
Token the
Token ID262
Feature activation-1.11e-02

Pos logit contributions
 tight+0.013
ucker+0.012
 understands+0.012
 fault+0.012
 stern+0.011
Neg logit contributions
iece-0.023
 myself-0.021
aming-0.021
Ta-0.021
CR-0.020
Token trap
Token ID12840
Feature activation-5.76e-03

Pos logit contributions
iece+0.019
 myself+0.018
aming+0.018
Ta+0.017
mmmm+0.016
Neg logit contributions
 tight-0.014
 basket-0.013
 lot-0.013
 understands-0.013
 will-0.013
Token from
Token ID422
Feature activation+1.25e-02

Pos logit contributions
iece+0.010
Ta+0.010
aming+0.010
 myself+0.010
Now+0.009
Neg logit contributions
 tight-0.007
 fault-0.006
 stern-0.006
 will-0.006
 "-0.006
Token she
Token ID673
Feature activation-5.32e-03

Pos logit contributions
 tight+0.038
 understands+0.037
ucker+0.036
 fault+0.036
 stern+0.035
Neg logit contributions
iece-0.048
 myself-0.046
aming-0.044
Ta-0.043
CR-0.041
Token did
Token ID750
Feature activation+8.65e-03

Pos logit contributions
 myself+0.008
iece+0.008
aming+0.008
th+0.007
Ta+0.007
Neg logit contributions
arl-0.007
ucker-0.006
 tight-0.006
 fault-0.006
 stern-0.006
Token a
Token ID257
Feature activation+9.95e-03

Pos logit contributions
ucker+0.014
arl+0.013
 understands+0.013
 fault+0.013
run+0.013
Neg logit contributions
 myself-0.011
iece-0.011
aming-0.010
CR-0.009
th-0.009
Token great
Token ID1049
Feature activation+7.57e-03

Pos logit contributions
ucker+0.013
run+0.011
ishly+0.011
arl+0.011
bi+0.011
Neg logit contributions
 myself-0.017
iece-0.016
 seek-0.015
th-0.015
 endings-0.015
Token job
Token ID1693
Feature activation+4.35e-03

Pos logit contributions
ucker+0.010
run+0.010
 understands+0.010
 fault+0.009
arl+0.009
Neg logit contributions
iece-0.011
 myself-0.011
th-0.010
 endings-0.010
 seek-0.010
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
 tight+0.005
ucker+0.005
 understands+0.004
 fault+0.004
 stern+0.004
Neg logit contributions
iece-0.008
 myself-0.008
aming-0.007
Ta-0.007
CR-0.007
Token Lily
Token ID20037
Feature activation-1.64e-02

Pos logit contributions
 fault+0.013
burn+0.013
ucker+0.012
 understands+0.012
 vain+0.012
Neg logit contributions
 myself-0.015
iece-0.015
CR-0.014
Now-0.014
Ta-0.013
Token was
Token ID373
Feature activation+4.26e-03

Pos logit contributions
iece+0.027
 myself+0.027
aming+0.025
Ta+0.025
CR+0.024
Neg logit contributions
ucker-0.019
 fault-0.018
 tight-0.018
run-0.018
 understands-0.018
Token happy
Token ID3772
Feature activation-5.68e-03

Pos logit contributions
arl+0.007
ucker+0.007
uffy+0.006
ometer+0.006
 fault+0.006
Neg logit contributions
 myself-0.006
aming-0.006
iece-0.006
 seek-0.005
Now-0.005
Token and
Token ID290
Feature activation+1.39e-02

Pos logit contributions
iece+0.010
 myself+0.010
aming+0.009
Ta+0.009
CR+0.009
Neg logit contributions
 tight-0.006
ucker-0.006
 understands-0.006
 fault-0.006
 stern-0.005
Token relieved
Token ID22598
Feature activation+5.37e-02

Pos logit contributions
 fault+0.020
ucker+0.018
arl+0.018
run+0.018
uffy+0.017
Neg logit contributions
 myself-0.021
iece-0.021
aming-0.020
Ta-0.019
 seek-0.019
Token the
Token ID262
Feature activation+3.72e-02

Pos logit contributions
iece+0.002
 myself+0.002
Ta+0.002
aming+0.002
ands+0.002
Neg logit contributions
 tight-0.001
 "-0.001
 lot-0.001
 understands-0.001
 stern-0.001
Token chain
Token ID6333
Feature activation+4.20e-02

Pos logit contributions
 tight+0.045
 understands+0.041
 fault+0.041
 "+0.040
 lot+0.039
Neg logit contributions
 myself-0.063
iece-0.061
aming-0.058
Ta-0.057
mmmm-0.056
Token was
Token ID373
Feature activation-1.67e-02

Pos logit contributions
 tight+0.044
 understands+0.042
ucker+0.040
 "+0.039
 fault+0.039
Neg logit contributions
 myself-0.075
iece-0.074
Ta-0.070
aming-0.069
mmmm-0.067
Token swinging
Token ID25635
Feature activation+3.10e-02

Pos logit contributions
 myself+0.025
iece+0.025
aming+0.025
 seek+0.022
CR+0.022
Neg logit contributions
 understands-0.022
ucker-0.022
ometer-0.021
arl-0.021
uffy-0.021
Token really
Token ID1107
Feature activation-4.64e-02

Pos logit contributions
 tight+0.032
ucker+0.031
 understands+0.030
 fault+0.030
 stern+0.029
Neg logit contributions
iece-0.056
 myself-0.055
aming-0.052
Ta-0.051
CR-0.049
Token high
Token ID1029
Feature activation+1.12e-02

Pos logit contributions
 myself+0.090
iece+0.082
aming+0.079
CR+0.078
 seek+0.076
Neg logit contributions
ucker-0.045
 understands-0.044
 fault-0.044
ometer-0.043
run-0.043
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
ucker+0.012
 understands+0.012
 tight+0.012
 fault+0.012
run+0.011
Neg logit contributions
iece-0.019
 myself-0.019
aming-0.018
Ta-0.017
CR-0.017
Token Bob
Token ID5811
Feature activation+1.15e-02

Pos logit contributions
iece+0.022
 myself+0.022
aming+0.020
Ta+0.020
CR+0.019
Neg logit contributions
 tight-0.012
ucker-0.012
 understands-0.011
 fault-0.011
 stern-0.011
Token smiled
Token ID13541
Feature activation+2.24e-02

Pos logit contributions
 tight+0.012
 understands+0.012
ucker+0.012
 fault+0.011
run+0.011
Neg logit contributions
iece-0.021
 myself-0.020
Ta-0.019
aming-0.019
 endings-0.018
Token and
Token ID290
Feature activation+1.29e-02

Pos logit contributions
 tight+0.028
ucker+0.027
 understands+0.027
 fault+0.027
 stern+0.026
Neg logit contributions
iece-0.036
 myself-0.035
aming-0.033
Ta-0.032
CR-0.031
Token laughed
Token ID13818
Feature activation+3.73e-02

Pos logit contributions
 fault+0.016
ucker+0.016
arl+0.015
 tight+0.015
 understands+0.015
Neg logit contributions
 myself-0.020
iece-0.020
aming-0.019
Ta-0.018
 seek-0.018
Tokencase
Token ID7442
Feature activation+3.42e-02

Pos logit contributions
ucker+0.058
 tight+0.057
 understands+0.057
 fault+0.056
run+0.054
Neg logit contributions
iece-0.082
 myself-0.080
aming-0.074
Ta-0.073
CR-0.070
Token and
Token ID290
Feature activation-1.74e-02

Pos logit contributions
 tight+0.040
ucker+0.040
 understands+0.039
 fault+0.039
 stern+0.037
Neg logit contributions
iece-0.056
 myself-0.055
aming-0.051
Ta-0.050
CR-0.049
Token parked
Token ID19584
Feature activation-1.13e-02

Pos logit contributions
iece+0.029
 myself+0.028
aming+0.027
Ta+0.026
CR+0.025
Neg logit contributions
 tight-0.021
ucker-0.020
 understands-0.020
 fault-0.019
 stern-0.019
Token the
Token ID262
Feature activation-1.57e-02

Pos logit contributions
iece+0.017
 myself+0.016
 later+0.014
aming+0.014
 behold+0.014
Neg logit contributions
ucker-0.018
 understands-0.016
 fault-0.016
run-0.015
uffy-0.015
Token car
Token ID1097
Feature activation+1.75e-02

Pos logit contributions
iece+0.031
 myself+0.031
Ta+0.029
aming+0.029
CR+0.028
Neg logit contributions
ucker-0.014
 understands-0.013
 fault-0.013
 tight-0.013
run-0.012
Token.
Token ID13
Feature activation-4.51e-03

Pos logit contributions
ucker+0.018
 understands+0.017
 fault+0.017
 tight+0.016
 stern+0.015
Neg logit contributions
iece-0.033
 myself-0.032
aming-0.030
Ta-0.029
CR-0.028
Token She
Token ID1375
Feature activation-1.05e-02

Pos logit contributions
iece+0.008
 myself+0.008
Ta+0.008
CR+0.007
aming+0.007
Neg logit contributions
ucker-0.005
 fault-0.004
 tight-0.004
 understands-0.004
 stern-0.004
Token climbed
Token ID19952
Feature activation-2.05e-02

Pos logit contributions
iece+0.018
 myself+0.018
aming+0.017
Ta+0.017
CR+0.016
Neg logit contributions
 tight-0.011
ucker-0.011
 understands-0.011
 fault-0.011
 stern-0.011
Token out
Token ID503
Feature activation+2.41e-02

Pos logit contributions
iece+0.037
 myself+0.036
CR+0.033
aming+0.032
Now+0.031
Neg logit contributions
ucker-0.024
 fault-0.022
 understands-0.022
run-0.021
 stern-0.020
Token and
Token ID290
Feature activation-2.93e-02

Pos logit contributions
 tight+0.030
 understands+0.027
ucker+0.027
 fault+0.027
run+0.027
Neg logit contributions
iece-0.040
 myself-0.039
Ta-0.037
aming-0.037
 endings-0.035
Token looked
Token ID3114
Feature activation+2.76e-02

Pos logit contributions
 myself+0.053
iece+0.053
aming+0.050
 endings+0.049
mmmm+0.049
Neg logit contributions
 tight-0.032
 understands-0.029
 stern-0.027
 "-0.027
 will-0.027
Token help
Token ID1037
Feature activation+1.39e-02

Pos logit contributions
iece+0.140
 myself+0.135
aming+0.128
Ta+0.127
CR+0.122
Neg logit contributions
 tight-0.085
ucker-0.082
 understands-0.081
 fault-0.080
run-0.078
Token you
Token ID345
Feature activation-6.33e-02

Pos logit contributions
 tight+0.027
 "+0.026
ucker+0.025
 hug+0.025
run+0.025
Neg logit contributions
iece-0.015
Ta-0.013
aming-0.013
CR-0.011
 nights-0.011
Token find
Token ID1064
Feature activation-4.36e-02

Pos logit contributions
iece+0.113
Ta+0.109
aming+0.104
 myself+0.104
CR+0.100
Neg logit contributions
 tight-0.071
 fault-0.063
ucker-0.063
run-0.061
 "-0.061
Token your
Token ID534
Feature activation-2.02e-02

Pos logit contributions
iece+0.078
Ta+0.070
aming+0.070
 myself+0.066
CR+0.065
Neg logit contributions
 tight-0.052
 understands-0.047
 "-0.046
 fault-0.046
run-0.045
Token toy
Token ID13373
Feature activation+8.74e-03

Pos logit contributions
iece+0.037
 myself+0.036
aming+0.034
Ta+0.034
CR+0.032
Neg logit contributions
 tight-0.020
ucker-0.020
 understands-0.019
 fault-0.019
 stern-0.018
Token."
Token ID526
Feature activation-1.45e-02

Pos logit contributions
 tight+0.007
 fault+0.007
 "+0.006
 will+0.006
ometer+0.006
Neg logit contributions
iece-0.018
Ta-0.018
aming-0.017
 myself-0.017
 behold-0.016
Token
Token ID198
Feature activation-1.54e-02

Pos logit contributions
iece+0.030
 myself+0.029
aming+0.028
Ta+0.027
CR+0.027
Neg logit contributions
 tight-0.012
ucker-0.011
 understands-0.011
 fault-0.011
 stern-0.010
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
iece+0.034
 myself+0.031
 nights+0.030
 endings+0.030
aming+0.030
Neg logit contributions
 tight-0.012
 -0.011
 "-0.011
run-0.011
ucker-0.011
TokenTim
Token ID14967
Feature activation-9.70e-03

Pos logit contributions
iece+0.022
 nights+0.019
 endings+0.018
ands+0.017
 myself+0.017
Neg logit contributions
 tight-0.019
run-0.018
 -0.018
 stern-0.018
 "-0.017
Token and
Token ID290
Feature activation+3.93e-02

Pos logit contributions
iece+0.016
 myself+0.016
aming+0.015
Ta+0.015
CR+0.014
Neg logit contributions
 tight-0.011
ucker-0.011
 fault-0.011
 understands-0.010
 stern-0.010
Token the
Token ID262
Feature activation+4.23e-02

Pos logit contributions
ucker+0.053
 fault+0.050
 tight+0.050
 understands+0.049
arl+0.048
Neg logit contributions
 myself-0.060
iece-0.060
 seek-0.053
Ta-0.053
 behold-0.052
Token head
Token ID1182
Feature activation+4.34e-02

Pos logit contributions
 tight+0.024
ucker+0.024
 understands+0.023
 fault+0.023
 stern+0.022
Neg logit contributions
iece-0.046
 myself-0.045
aming-0.043
Ta-0.042
CR-0.041
Token was
Token ID373
Feature activation-3.81e-03

Pos logit contributions
 tight+0.049
ucker+0.049
 understands+0.048
 fault+0.047
 stern+0.046
Neg logit contributions
iece-0.073
 myself-0.071
aming-0.067
Ta-0.066
CR-0.064
Token spinning
Token ID19493
Feature activation-1.85e-03

Pos logit contributions
 myself+0.006
iece+0.006
aming+0.006
CR+0.005
 seek+0.005
Neg logit contributions
 understands-0.005
ucker-0.005
 fault-0.005
 tight-0.004
ometer-0.004
Token.
Token ID13
Feature activation+2.69e-03

Pos logit contributions
iece+0.004
 myself+0.003
Ta+0.003
aming+0.003
CR+0.003
Neg logit contributions
 tight-0.002
ucker-0.002
 fault-0.002
 understands-0.002
 stern-0.002
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
ucker+0.002
 tight+0.002
 fault+0.002
 understands+0.002
 stern+0.002
Neg logit contributions
iece-0.005
 myself-0.005
aming-0.005
Ta-0.005
CR-0.005
Token
Token ID198
Feature activation-1.14e-02

Pos logit contributions
iece+0.027
 myself+0.027
Ta+0.025
aming+0.025
 endings+0.024
Neg logit contributions
 tight-0.012
 understands-0.011
ucker-0.011
run-0.011
 fault-0.010
TokenJohn
Token ID7554
Feature activation+9.14e-03

Pos logit contributions
iece+0.016
 myself+0.015
 endings+0.014
 seaw+0.014
ands+0.014
Neg logit contributions
 tight-0.018
 "-0.017
 -0.017
run-0.016
 stern-0.016
Token's
Token ID338
Feature activation-8.24e-03

Pos logit contributions
 tight+0.010
 understands+0.009
ucker+0.009
 stern+0.009
run+0.009
Neg logit contributions
 myself-0.016
iece-0.016
Ta-0.015
mmmm-0.015
aming-0.015
Token Mom
Token ID11254
Feature activation-4.52e-03

Pos logit contributions
iece+0.015
 myself+0.014
aming+0.013
Ta+0.013
 seek+0.013
Neg logit contributions
ucker-0.009
 understands-0.008
 tight-0.008
 fault-0.008
run-0.008
Token saw
Token ID2497
Feature activation-2.82e-02

Pos logit contributions
iece+0.007
 myself+0.007
aming+0.007
Ta+0.007
CR+0.006
Neg logit contributions
 fault-0.005
ucker-0.005
 tight-0.005
 understands-0.005
arl-0.005
Token him
Token ID683
Feature activation-2.17e-02

Pos logit contributions
 myself+0.040
iece+0.040
Ta+0.038
aming+0.037
 seek+0.036
Neg logit contributions
 tight-0.040
 understands-0.039
 "-0.038
 stern-0.038
 fault-0.037
Token what
Token ID644
Feature activation+7.48e-02

Pos logit contributions
 myself+0.067
iece+0.067
 later+0.061
 pirates+0.057
 himself+0.056
Neg logit contributions
ucker-0.074
arl-0.066
oun-0.062
 fault-0.062
 understands-0.061
Token it
Token ID340
Feature activation-1.62e-04

Pos logit contributions
arl+0.136
ucker+0.127
 fault+0.120
ishly+0.108
run+0.108
Neg logit contributions
 myself-0.089
 you-0.088
th-0.087
 nights-0.087
 endings-0.084
Token was
Token ID373
Feature activation-3.26e-02

Pos logit contributions
 myself+0.000
iece+0.000
aming+0.000
 seek+0.000
Ta+0.000
Neg logit contributions
ucker-0.000
 fault-0.000
arl-0.000
 tight-0.000
run-0.000
Token and
Token ID290
Feature activation-1.64e-02

Pos logit contributions
iece+0.062
 myself+0.061
Ta+0.059
aming+0.058
CR+0.055
Neg logit contributions
 tight-0.030
 understands-0.028
 fault-0.028
ucker-0.028
 "-0.027
Token where
Token ID810
Feature activation+9.24e-02

Pos logit contributions
 myself+0.027
iece+0.027
aming+0.024
 behold+0.024
 endings+0.024
Neg logit contributions
ucker-0.020
 fault-0.019
 tight-0.018
arl-0.018
 understands-0.018
Token it
Token ID340
Feature activation+4.09e-03

Pos logit contributions
ucker+0.141
arl+0.135
 fault+0.129
 understands+0.122
run+0.122
Neg logit contributions
 myself-0.126
iece-0.115
 endings-0.109
CR-0.109
 nights-0.109
Token was
Token ID373
Feature activation-3.19e-02

Pos logit contributions
ucker+0.005
 fault+0.005
 tight+0.004
arl+0.004
 understands+0.004
Neg logit contributions
iece-0.007
 myself-0.007
aming-0.006
CR-0.006
Ta-0.006
Token going
Token ID1016
Feature activation-3.00e-02

Pos logit contributions
iece+0.059
 myself+0.058
aming+0.054
Ta+0.054
CR+0.052
Neg logit contributions
 tight-0.032
ucker-0.030
 understands-0.030
 fault-0.030
 stern-0.029
Token.
Token ID13
Feature activation-3.99e-03

Pos logit contributions
 myself+0.056
iece+0.055
aming+0.051
CR+0.050
ach+0.049
Neg logit contributions
ucker-0.030
 understands-0.030
 fault-0.029
 tight-0.027
run-0.027
Token They
Token ID1119
Feature activation-6.83e-03

Pos logit contributions
iece+0.008
 myself+0.008
aming+0.008
Ta+0.008
ands+0.008
Neg logit contributions
 tight-0.003
 "-0.003
run-0.003
ucker-0.003
 -0.003
Token decided
Token ID3066
Feature activation-2.99e-02

Pos logit contributions
iece+0.011
 myself+0.010
aming+0.010
Ta+0.010
CR+0.009
Neg logit contributions
 tight-0.009
ucker-0.009
 understands-0.008
 fault-0.008
 stern-0.008
Token was
Token ID373
Feature activation-1.57e-02

Pos logit contributions
 tight+0.024
ucker+0.024
 understands+0.023
 fault+0.023
 stern+0.022
Neg logit contributions
iece-0.041
 myself-0.040
aming-0.038
Ta-0.038
CR-0.036
Token too
Token ID1165
Feature activation+1.49e-02

Pos logit contributions
 myself+0.022
aming+0.022
 later+0.020
 seek+0.020
 nights+0.019
Neg logit contributions
 understands-0.024
ucker-0.024
ometer-0.023
ip-0.022
arl-0.021
Token late
Token ID2739
Feature activation-4.49e-02

Pos logit contributions
ucker+0.027
 tight+0.027
 understands+0.027
 fault+0.027
run+0.026
Neg logit contributions
iece-0.015
 myself-0.014
aming-0.013
Ta-0.012
CR-0.012
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
iece+0.083
 myself+0.081
aming+0.074
CR+0.072
 endings+0.071
Neg logit contributions
ucker-0.046
 understands-0.045
 tight-0.043
 fault-0.042
run-0.041
Token They
Token ID1119
Feature activation+1.57e-02

Pos logit contributions
 tight+0.017
ucker+0.017
 understands+0.016
 fault+0.016
 stern+0.015
Neg logit contributions
iece-0.028
 myself-0.028
aming-0.026
Ta-0.026
CR-0.026
Token didn
Token ID1422
Feature activation+3.62e-03

Pos logit contributions
ucker+0.019
 fault+0.018
 understands+0.017
 tight+0.017
arl+0.017
Neg logit contributions
iece-0.026
 myself-0.024
aming-0.024
Ta-0.023
 nights-0.022
Token't
Token ID470
Feature activation+1.18e-02

Pos logit contributions
ucker+0.004
 tight+0.004
 understands+0.004
 fault+0.004
 stern+0.004
Neg logit contributions
 myself-0.006
iece-0.006
aming-0.005
Ta-0.005
CR-0.005
Token make
Token ID787
Feature activation+1.41e-02

Pos logit contributions
 tight+0.014
ucker+0.014
 understands+0.013
 fault+0.013
 stern+0.013
Neg logit contributions
iece-0.019
 myself-0.019
aming-0.018
Ta-0.018
CR-0.017
Token it
Token ID340
Feature activation-1.45e-02

Pos logit contributions
ucker+0.017
 understands+0.017
run+0.017
 tight+0.017
arl+0.016
Neg logit contributions
 myself-0.023
iece-0.022
aming-0.020
 endings-0.020
CR-0.019
Token out
Token ID503
Feature activation+2.13e-02

Pos logit contributions
iece+0.027
 myself+0.027
aming+0.025
Ta+0.025
CR+0.024
Neg logit contributions
 tight-0.014
ucker-0.014
 understands-0.013
 fault-0.013
 stern-0.013
Token of
Token ID286
Feature activation+3.42e-03

Pos logit contributions
ucker+0.024
 tight+0.024
 understands+0.024
 fault+0.023
 stern+0.022
Neg logit contributions
iece-0.036
 myself-0.036
aming-0.033
Ta-0.032
CR-0.032
Token friends
Token ID2460
Feature activation-3.79e-02

Pos logit contributions
ucker+0.018
 understands+0.017
 fault+0.017
 tight+0.017
 stern+0.017
Neg logit contributions
 myself-0.022
iece-0.021
aming-0.019
 seek-0.019
oy-0.019
Token.
Token ID13
Feature activation+7.41e-04

Pos logit contributions
 myself+0.049
iece+0.048
 later+0.045
 seek+0.042
 pirates+0.039
Neg logit contributions
ucker-0.067
 understands-0.061
arl-0.061
 fault-0.060
run-0.058
Token The
Token ID383
Feature activation+1.47e-02

Pos logit contributions
ucker+0.001
 fault+0.001
 tight+0.001
 stern+0.001
 understands+0.001
Neg logit contributions
iece-0.001
 myself-0.001
Ta-0.001
CR-0.001
Now-0.001
Token sun
Token ID4252
Feature activation-1.49e-02

Pos logit contributions
ucker+0.019
run+0.019
 fault+0.019
 stern+0.017
 understands+0.016
Neg logit contributions
 myself-0.024
iece-0.022
Now-0.022
aming-0.022
 nights-0.022
Token was
Token ID373
Feature activation-1.60e-02

Pos logit contributions
iece+0.024
 myself+0.023
aming+0.022
 nights+0.021
Ta+0.021
Neg logit contributions
 fault-0.018
ucker-0.018
 tight-0.017
arl-0.017
run-0.017
Token always
Token ID1464
Feature activation-7.34e-02

Pos logit contributions
 myself+0.029
iece+0.029
aming+0.028
Ta+0.026
 nights+0.026
Neg logit contributions
 understands-0.016
ucker-0.016
 fault-0.016
run-0.015
 tight-0.015
Token shining
Token ID22751
Feature activation+2.24e-02

Pos logit contributions
iece+0.126
 myself+0.125
aming+0.122
 seek+0.112
 nights+0.110
Neg logit contributions
 understands-0.083
 fault-0.082
ucker-0.081
run-0.078
 tight-0.075
Token,
Token ID11
Feature activation-3.12e-02

Pos logit contributions
ucker+0.027
 fault+0.026
 understands+0.026
run+0.025
 tight+0.025
Neg logit contributions
iece-0.037
 myself-0.036
aming-0.034
Ta-0.032
 nights-0.032
Token and
Token ID290
Feature activation-7.82e-03

Pos logit contributions
iece+0.049
 myself+0.048
aming+0.044
Ta+0.044
CR+0.042
Neg logit contributions
 tight-0.039
ucker-0.039
 understands-0.038
 fault-0.038
 stern-0.037
Token the
Token ID262
Feature activation+4.17e-02

Pos logit contributions
iece+0.012
 myself+0.012
 nights+0.011
aming+0.011
 seek+0.011
Neg logit contributions
ucker-0.010
 fault-0.010
run-0.009
arl-0.009
 tight-0.009
Token pony
Token ID26902
Feature activation+6.02e-02

Pos logit contributions
ucker+0.076
run+0.073
 fault+0.072
ometer+0.068
ours+0.067
Neg logit contributions
 myself-0.050
iece-0.045
 nights-0.044
aming-0.044
Now-0.043
Token of
Token ID286
Feature activation-3.22e-02

Pos logit contributions
iece+0.017
 myself+0.016
Ta+0.015
 endings+0.015
CR+0.014
Neg logit contributions
 tight-0.008
 lot-0.007
ucker-0.007
 "-0.007
 understands-0.007
Token trees
Token ID7150
Feature activation-3.55e-02

Pos logit contributions
iece+0.064
 myself+0.055
light+0.053
Ta+0.051
uring+0.051
Neg logit contributions
 lot-0.040
 tight-0.038
 basket-0.033
 understands-0.033
 "-0.033
Token and
Token ID290
Feature activation-5.74e-02

Pos logit contributions
 myself+0.049
iece+0.047
aming+0.045
 seek+0.041
Now+0.040
Neg logit contributions
 fault-0.052
 understands-0.050
ucker-0.050
run-0.049
 stern-0.049
Token Tommy
Token ID19919
Feature activation+6.93e-03

Pos logit contributions
iece+0.098
 myself+0.089
ands+0.085
aming+0.085
Ta+0.084
Neg logit contributions
 tight-0.071
 lot-0.068
 "-0.067
 understands-0.064
 -0.064
Token could
Token ID714
Feature activation-1.11e-02

Pos logit contributions
 fault+0.009
arl+0.009
ucker+0.008
 understands+0.008
run+0.008
Neg logit contributions
 myself-0.011
iece-0.011
aming-0.010
oy-0.010
CR-0.010
Token see
Token ID766
Feature activation-6.70e-02

Pos logit contributions
iece+0.016
 myself+0.016
aming+0.015
Ta+0.014
CR+0.014
Neg logit contributions
 tight-0.015
ucker-0.015
 understands-0.015
 fault-0.015
 stern-0.014
Token a
Token ID257
Feature activation-1.68e-02

Pos logit contributions
 myself+0.094
iece+0.086
 himself+0.079
aming+0.079
 seaw+0.079
Neg logit contributions
ucker-0.105
arl-0.102
ometer-0.098
 fault-0.098
 stern-0.096
Token laser
Token ID12855
Feature activation-3.70e-03

Pos logit contributions
 myself+0.026
iece+0.026
 seek+0.025
aming+0.025
Ta+0.025
Neg logit contributions
 fault-0.020
run-0.019
ucker-0.019
 understands-0.018
icks-0.018
Token in
Token ID287
Feature activation-1.27e-03

Pos logit contributions
iece+0.007
 myself+0.007
Ta+0.007
aming+0.007
CR+0.006
Neg logit contributions
 tight-0.003
ucker-0.003
 fault-0.003
 understands-0.003
 stern-0.003
Token the
Token ID262
Feature activation-4.58e-02

Pos logit contributions
 myself+0.002
iece+0.002
aming+0.002
 seek+0.002
Now+0.002
Neg logit contributions
ucker-0.002
 understands-0.002
 fault-0.002
ometer-0.001
 stern-0.001
Token sky
Token ID6766
Feature activation-5.67e-02

Pos logit contributions
iece+0.073
 myself+0.072
aming+0.065
mmmm+0.064
mmm+0.064
Neg logit contributions
 tight-0.060
 understands-0.055
 lot-0.054
 basket-0.054
 fault-0.052
Token noticed
Token ID6810
Feature activation-2.81e-02

Pos logit contributions
 would+0.008
 tight+0.007
 will+0.007
 understands+0.007
 won+0.007
Neg logit contributions
 myself-0.017
iece-0.016
Ta-0.016
mmmm-0.015
mmm-0.015
Token something
Token ID1223
Feature activation+1.85e-03

Pos logit contributions
iece+0.052
 myself+0.049
Ta+0.048
aming+0.046
ands+0.045
Neg logit contributions
 "-0.031
 tight-0.030
 stern-0.029
 understands-0.028
run-0.027
Token rare
Token ID4071
Feature activation-3.89e-02

Pos logit contributions
ucker+0.002
 tight+0.002
 fault+0.002
 understands+0.002
arl+0.002
Neg logit contributions
iece-0.003
 myself-0.003
aming-0.003
Ta-0.003
CR-0.003
Token in
Token ID287
Feature activation+2.34e-02

Pos logit contributions
iece+0.073
 myself+0.072
Ta+0.069
aming+0.067
CR+0.065
Neg logit contributions
 tight-0.038
 understands-0.036
ucker-0.035
 fault-0.035
 "-0.034
Token the
Token ID262
Feature activation-2.56e-02

Pos logit contributions
 tight+0.022
ucker+0.020
 understands+0.019
 fault+0.019
 stern+0.019
Neg logit contributions
iece-0.046
 myself-0.044
aming-0.042
Ta-0.042
CR-0.041
Token ocean
Token ID9151
Feature activation-7.85e-02

Pos logit contributions
iece+0.033
 myself+0.032
Ta+0.029
mmmm+0.029
mmm+0.029
Neg logit contributions
 tight-0.042
 basket-0.039
 lot-0.039
 will-0.038
 hug-0.038
Token.
Token ID13
Feature activation+1.64e-03

Pos logit contributions
Ta+0.164
iece+0.164
 myself+0.162
aming+0.154
CR+0.151
Neg logit contributions
 tight-0.058
 "-0.050
ucker-0.050
 understands-0.049
 stern-0.049
Token It
Token ID632
Feature activation-6.06e-03

Pos logit contributions
 tight+0.001
ucker+0.001
 understands+0.001
 "+0.001
 stern+0.001
Neg logit contributions
iece-0.004
 myself-0.004
aming-0.003
Ta-0.003
 nights-0.003
Token was
Token ID373
Feature activation-3.34e-02

Pos logit contributions
iece+0.012
 myself+0.012
Ta+0.011
aming+0.011
mmmm+0.011
Neg logit contributions
 tight-0.005
 understands-0.005
 "-0.005
ucker-0.005
 stern-0.005
Token a
Token ID257
Feature activation-1.11e-02

Pos logit contributions
iece+0.059
 myself+0.056
Ta+0.054
CR+0.052
mmmm+0.052
Neg logit contributions
 tight-0.041
 "-0.038
 fault-0.037
 stern-0.036
 understands-0.035
Token special
Token ID2041
Feature activation-3.66e-02

Pos logit contributions
iece+0.018
 myself+0.018
aming+0.017
Ta+0.017
 seek+0.017
Neg logit contributions
ucker-0.012
 understands-0.012
 fault-0.012
 tight-0.012
run-0.011
Token so
Token ID523
Feature activation-4.38e-02

Pos logit contributions
 myself+0.045
iece+0.040
aming+0.039
 seek+0.038
CR+0.037
Neg logit contributions
ucker-0.032
 understands-0.032
 fault-0.030
arl-0.029
ishly-0.029
Token lovely
Token ID14081
Feature activation-3.96e-02

Pos logit contributions
 myself+0.079
 seek+0.069
aming+0.068
Ta+0.068
iece+0.066
Neg logit contributions
 fault-0.049
ishly-0.046
burn-0.045
ucker-0.045
 understands-0.045
Token.
Token ID13
Feature activation-3.45e-03

Pos logit contributions
iece+0.072
Ta+0.068
 myself+0.067
aming+0.066
CR+0.062
Neg logit contributions
 tight-0.044
 fault-0.040
ucker-0.040
 understands-0.039
 "-0.039
Token I
Token ID314
Feature activation-7.89e-03

Pos logit contributions
iece+0.006
 myself+0.006
aming+0.005
Ta+0.005
CR+0.005
Neg logit contributions
 tight-0.004
ucker-0.004
 understands-0.004
 fault-0.004
 stern-0.004
Token will
Token ID481
Feature activation+4.23e-03

Pos logit contributions
 myself+0.013
iece+0.013
Ta+0.013
aming+0.013
 seek+0.012
Neg logit contributions
ucker-0.008
 tight-0.008
 fault-0.008
 understands-0.008
 stern-0.007
Token be
Token ID307
Feature activation-7.15e-02

Pos logit contributions
 tight+0.005
ucker+0.004
 understands+0.004
 fault+0.004
 stern+0.004
Neg logit contributions
iece-0.008
 myself-0.007
aming-0.007
Ta-0.007
CR-0.007
Token very
Token ID845
Feature activation-7.34e-04

Pos logit contributions
 myself+0.115
iece+0.111
aming+0.108
Ta+0.096
CR+0.096
Neg logit contributions
ucker-0.091
 fault-0.088
 understands-0.088
ometer-0.086
 tight-0.084
Token happy
Token ID3772
Feature activation-2.96e-02

Pos logit contributions
 myself+0.001
iece+0.001
aming+0.001
Ta+0.001
 seek+0.001
Neg logit contributions
 fault-0.001
ucker-0.001
 understands-0.001
 tight-0.001
run-0.001
Token if
Token ID611
Feature activation+1.58e-02

Pos logit contributions
Ta+0.061
iece+0.061
aming+0.056
CR+0.053
 behold+0.053
Neg logit contributions
 tight-0.027
 "-0.025
 fault-0.025
 will-0.024
run-0.023
Token you
Token ID345
Feature activation-7.76e-03

Pos logit contributions
 tight+0.026
 understands+0.024
 "+0.024
ucker+0.024
 fault+0.024
Neg logit contributions
iece-0.020
aming-0.018
Ta-0.018
 myself-0.018
Now-0.016
Token give
Token ID1577
Feature activation+2.78e-02

Pos logit contributions
iece+0.012
 myself+0.012
aming+0.011
Ta+0.011
CR+0.011
Neg logit contributions
 tight-0.010
ucker-0.009
 understands-0.009
 fault-0.009
 stern-0.009
Token dry
Token ID5894
Feature activation+6.84e-03

Pos logit contributions
burn+0.046
ucker+0.045
 fault+0.044
uffy+0.041
 understands+0.041
Neg logit contributions
 myself-0.074
iece-0.067
aming-0.065
 seek-0.064
CR-0.062
Token.
Token ID13
Feature activation+7.95e-03

Pos logit contributions
ucker+0.007
 tight+0.007
 understands+0.007
 fault+0.007
 stern+0.007
Neg logit contributions
iece-0.012
 myself-0.012
aming-0.011
 nights-0.010
Ta-0.010
Token He
Token ID679
Feature activation+3.16e-03

Pos logit contributions
ucker+0.008
 tight+0.008
 fault+0.008
 understands+0.008
arl+0.007
Neg logit contributions
 myself-0.014
iece-0.014
aming-0.013
Ta-0.013
CR-0.012
Token had
Token ID550
Feature activation+4.71e-03

Pos logit contributions
ucker+0.004
 fault+0.004
 tight+0.004
arl+0.004
 understands+0.003
Neg logit contributions
iece-0.005
 myself-0.005
aming-0.005
Ta-0.005
CR-0.004
Token seen
Token ID1775
Feature activation-3.78e-02

Pos logit contributions
 tight+0.006
ucker+0.006
 understands+0.005
 fault+0.005
 stern+0.005
Neg logit contributions
iece-0.008
 myself-0.008
aming-0.007
Ta-0.007
CR-0.007
Token children
Token ID1751
Feature activation-8.13e-02

Pos logit contributions
iece+0.058
 myself+0.058
aming+0.052
Ta+0.051
 nights+0.050
Neg logit contributions
ucker-0.050
 tight-0.048
 understands-0.046
arl-0.046
 fault-0.045
Token try
Token ID1949
Feature activation-4.93e-03

Pos logit contributions
iece+0.131
 myself+0.128
aming+0.120
Ta+0.115
CR+0.112
Neg logit contributions
 tight-0.098
ucker-0.098
 understands-0.096
 fault-0.095
 stern-0.090
Token to
Token ID284
Feature activation-2.29e-02

Pos logit contributions
 myself+0.009
iece+0.009
aming+0.008
 nights+0.008
CR+0.008
Neg logit contributions
ucker-0.005
arl-0.005
uffy-0.005
oun-0.004
 understands-0.004
Token climb
Token ID12080
Feature activation-1.59e-02

Pos logit contributions
iece+0.043
Ta+0.040
aming+0.039
CR+0.039
 myself+0.038
Neg logit contributions
 tight-0.025
 hug-0.023
 smile-0.022
 shrug-0.022
 will-0.021
Token it
Token ID340
Feature activation-1.55e-02

Pos logit contributions
 myself+0.027
iece+0.027
aming+0.024
CR+0.024
 nights+0.023
Neg logit contributions
ucker-0.019
 understands-0.018
 fault-0.017
arl-0.016
 tight-0.016
Token before
Token ID878
Feature activation-2.74e-02

Pos logit contributions
iece+0.027
 myself+0.027
aming+0.025
Ta+0.025
CR+0.024
Neg logit contributions
 tight-0.016
ucker-0.016
 understands-0.016
 fault-0.016
 stern-0.015