TOP ACTIVATIONS
MAX = 5.103

v ase on her bed side table . She remembered why
of laundry on the clothes line . " He wants to
lonely clock on his bed side table and he made sure
I fell off the clothes line and I can 't get
his hunger began to sub side . When the
down side - by - side . They were having so
perfume bottle on her bed side table and every night she
kept it on her bed side table . Every night before
a walk along the shore line . But something
together side - by - side . No
kept it by her bed side every night . One evening
Once they reached the other side , they stepped off the
back on the zip - line , z ipping all around
the heart on her bed side table . Each night ,
robe back onto the clothes line . He used all his
kept it on her bed side table and looked at it
as he explored the shore line . One day
it would hug the shore line and admire the wrecked boats
go back to the pool side , Mia . Maybe we
when it hugged the shore line again , it noticed a

MIN ACTIVATIONS
MIN = -3.099

!" " A ww !" Lily says . "
the ground . " E ww , what 's that ?"
A ww , okay . â
and said , " A ww www , do I have
be bitter , like e ww . Do you want to
. " E ww , this fish is sm
. " A ww , it is so cute
. " E ww !" she screamed .
at it said " ew ww !" One
friends said , " A ww , okay . But come
and said , " E ww , Bruno . You are
She said , " E ww , disgusting !" She threw
He says , " A ww , this is from Anna
He said , " E ww , this is the worst
Of course ! Now , let â
so proud of himself ! Now he could play with them
done , she smiled . Now she could wear her dress
's great , Molly ! Now , can you help me
're right , John ! Now , let 's see if
was over the moon ! Now she could easily find her

INTERVAL 3.052 - 5.103
CONTAINS 0.006%

She saw Lily and Wh isk ers looking sad . She
got back inside , Wh isk ers saw a teaspoon on
the things we eat can affect our dreams .
. After Wh isk ers got better , he
then accidentally pin ched Wh isk ers with the brush .

INTERVAL 1.002 - 3.052
CONTAINS 1.779%

went to the market to buy some vegetables to cook .
or hurt . L ily waved to the vet
waved back and the se ag ull flew away .
around you , then say ' I recognize the shield .'
many surprises ." L ily and Sam sniff ed

INTERVAL -1.049 - 1.002
CONTAINS 97.682%

She did not like clown s . She thought they were
was very interesting to see all the pretty colors in the
haircut . She asked her mom if she could get a
her heart sink . She wondered if Rose was in one
Elizabeth 's eyes lit up with excitement . She was ready

INTERVAL -3.099 - -1.049
CONTAINS 0.533%

Help ! Help !" Little Annie shouted from the top of
and hold each other 's hands . After the
, " Sam , you should not chew gum when you
a chair . He was smiling at her .
a three - year - old boy who loved exploring the
Token v
Token ID410
Feature activation+3.53e-01

Pos logit contributions
side+0.010
handed+0.010
ge+0.009
 unatt+0.009
thank+0.009
Neg logit contributions
 happens-0.009
 comes-0.008
apy-0.008
 Everything-0.008
 can-0.008
Tokenase
Token ID589
Feature activation+5.95e-01

Pos logit contributions
 distance+0.047
ge+0.047
side+0.044
 unatt+0.043
 lookout+0.042
Neg logit contributions
 Now-0.023
ay-0.021
 Everyone-0.020
 happens-0.020
apy-0.020
Token on
Token ID319
Feature activation+2.51e-01

Pos logit contributions
side+0.065
handed+0.062
 distance+0.062
ge+0.061
anut+0.061
Neg logit contributions
 happens-0.052
 comes-0.049
 Now-0.044
 can-0.043
apy-0.042
Token her
Token ID607
Feature activation+1.97e-01

Pos logit contributions
side+0.034
 unatt+0.034
anut+0.033
ge+0.033
handed+0.033
Neg logit contributions
 happens-0.017
 It-0.015
 comes-0.014
 Everyone-0.014
 Now-0.014
Token bed
Token ID3996
Feature activation+4.39e-01

Pos logit contributions
side+0.020
ge+0.019
handed+0.018
thank+0.017
 distance+0.017
Neg logit contributions
 happens-0.018
 can-0.017
 I-0.017
 comes-0.017
 We-0.016
Tokenside
Token ID1589
Feature activation+4.63e+00

Pos logit contributions
 distance+0.046
ge+0.046
anut+0.046
handed+0.046
 unatt+0.044
Neg logit contributions
 happens-0.038
 comes-0.034
 Now-0.033
 It-0.033
 Everyone-0.032
Token table
Token ID3084
Feature activation+6.17e-01

Pos logit contributions
 unatt+0.424
handed+0.398
anut+0.379
ge+0.373
side+0.371
Neg logit contributions
 happens-0.471
 comes-0.439
 Everyone-0.410
 People-0.402
apy-0.400
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
side+0.064
handed+0.062
 distance+0.061
ge+0.060
 unatt+0.060
Neg logit contributions
 happens-0.058
 comes-0.055
 Now-0.049
 can-0.048
 It-0.048
Token She
Token ID1375
Feature activation-5.73e-02

Pos logit contributions
side+0.016
handed+0.015
 distance+0.014
 unatt+0.014
thank+0.013
Neg logit contributions
 happens-0.014
 comes-0.012
apy-0.011
oke-0.010
oms-0.010
Token remembered
Token ID12086
Feature activation-3.93e-01

Pos logit contributions
 happens+0.006
 comes+0.005
 Everything+0.005
apy+0.005
oke+0.005
Neg logit contributions
side-0.006
handed-0.006
 distance-0.006
 unatt-0.006
thank-0.005
Token why
Token ID1521
Feature activation-3.09e-01

Pos logit contributions
 happens+0.034
 comes+0.029
 Everything+0.027
apy+0.026
 Now+0.026
Neg logit contributions
side-0.051
 distance-0.048
handed-0.048
 unatt-0.046
ge-0.045
Token of
Token ID286
Feature activation-3.30e-01

Pos logit contributions
side+0.082
handed+0.078
 unatt+0.076
 distance+0.076
thank+0.075
Neg logit contributions
 happens-0.012
 comes-0.009
apy-0.006
oke-0.005
 It-0.004
Token laundry
Token ID25724
Feature activation+6.82e-01

Pos logit contributions
 happens+0.035
 comes+0.031
 Everything+0.029
apy+0.029
 Now+0.029
Neg logit contributions
side-0.036
handed-0.033
 distance-0.033
 unatt-0.032
thank-0.030
Token on
Token ID319
Feature activation+2.28e-01

Pos logit contributions
side+0.069
handed+0.067
 distance+0.063
anut+0.062
 unatt+0.061
Neg logit contributions
 happens-0.069
 comes-0.065
 can-0.058
apy-0.056
oms-0.055
Token the
Token ID262
Feature activation-3.36e-02

Pos logit contributions
side+0.031
 unatt+0.030
handed+0.030
thank+0.029
anut+0.029
Neg logit contributions
 happens-0.016
 comes-0.014
 It-0.013
 turns-0.013
 Everything-0.013
Token clothes
Token ID8242
Feature activation+4.91e-01

Pos logit contributions
 happens+0.004
 comes+0.004
 Everything+0.003
 can+0.003
 Now+0.003
Neg logit contributions
side-0.003
ge-0.003
handed-0.003
 unatt-0.003
thank-0.003
Tokenline
Token ID1370
Feature activation+4.62e+00

Pos logit contributions
side+0.053
handed+0.052
 distance+0.051
 unatt+0.050
thank+0.050
Neg logit contributions
 happens-0.044
 comes-0.041
 can-0.038
 It-0.038
oke-0.037
Token.
Token ID13
Feature activation+1.58e-01

Pos logit contributions
side+0.492
handed+0.469
 unatt+0.466
 distance+0.456
 former+0.432
Neg logit contributions
 happens-0.447
 comes-0.423
 It-0.386
 Everything-0.378
oke-0.376
Token "
Token ID366
Feature activation+3.99e-01

Pos logit contributions
side+0.020
 distance+0.018
handed+0.018
 unatt+0.017
thank+0.016
Neg logit contributions
 happens-0.015
 comes-0.013
apy-0.011
 Everything-0.011
ours-0.011
TokenHe
Token ID1544
Feature activation+3.80e-01

Pos logit contributions
side+0.047
 unatt+0.044
 spine+0.043
handed+0.041
 distance+0.041
Neg logit contributions
 happens-0.036
 It-0.033
apy-0.032
 Now-0.032
 Everything-0.032
Token wants
Token ID3382
Feature activation-7.47e-01

Pos logit contributions
handed+0.045
 bank+0.043
side+0.042
anut+0.041
 unatt+0.040
Neg logit contributions
apy-0.027
 happens-0.027
 Everything-0.026
 importantly-0.026
 People-0.025
Token to
Token ID284
Feature activation+3.48e-01

Pos logit contributions
 happens+0.087
apy+0.078
 comes+0.078
 Everything+0.075
oke+0.075
Neg logit contributions
side-0.071
 distance-0.066
handed-0.065
 unatt-0.064
 bank-0.058
Token lonely
Token ID21757
Feature activation-5.16e-02

Pos logit contributions
 happens+0.022
 comes+0.019
 Everything+0.018
 People+0.017
apy+0.017
Neg logit contributions
side-0.033
handed-0.031
ge-0.031
anut-0.031
keeping-0.030
Token clock
Token ID8801
Feature activation-4.87e-01

Pos logit contributions
 happens+0.006
 comes+0.005
 Everything+0.005
apy+0.005
 Now+0.005
Neg logit contributions
side-0.006
 distance-0.005
handed-0.005
 unatt-0.005
thank-0.005
Token on
Token ID319
Feature activation+5.99e-02

Pos logit contributions
 happens+0.050
 comes+0.046
 Everything+0.042
apy+0.041
 can+0.041
Neg logit contributions
side-0.050
handed-0.048
 distance-0.046
 unatt-0.046
anut-0.045
Token his
Token ID465
Feature activation+5.68e-02

Pos logit contributions
side+0.007
handed+0.007
 unatt+0.007
anut+0.007
 distance+0.007
Neg logit contributions
 happens-0.005
 comes-0.004
 It-0.004
 Now-0.004
 Everything-0.004
Token bed
Token ID3996
Feature activation+2.59e-01

Pos logit contributions
side+0.006
handed+0.006
thank+0.006
ge+0.006
ety+0.005
Neg logit contributions
 happens-0.005
 comes-0.005
 turns-0.004
 Everything-0.004
 We-0.004
Tokenside
Token ID1589
Feature activation+4.51e+00

Pos logit contributions
anut+0.029
handed+0.028
ge+0.028
 distance+0.028
side+0.027
Neg logit contributions
 happens-0.021
 comes-0.019
 Now-0.018
 Everything-0.018
 People-0.018
Token table
Token ID3084
Feature activation+3.23e-01

Pos logit contributions
 unatt+0.431
anut+0.413
handed+0.409
ety+0.391
broken+0.386
Neg logit contributions
 happens-0.441
 comes-0.413
 Everyone-0.387
 People-0.386
 Show-0.369
Token and
Token ID290
Feature activation+1.27e-02

Pos logit contributions
side+0.035
handed+0.035
anut+0.034
 distance+0.033
 unatt+0.033
Neg logit contributions
 happens-0.027
 comes-0.026
 can-0.024
 Now-0.023
 It-0.022
Token he
Token ID339
Feature activation+2.24e-01

Pos logit contributions
side+0.001
 distance+0.001
handed+0.001
 unatt+0.001
thank+0.001
Neg logit contributions
 happens-0.001
 comes-0.001
 Everything-0.001
apy-0.001
oke-0.001
Token made
Token ID925
Feature activation+5.23e-01

Pos logit contributions
side+0.025
 distance+0.023
handed+0.023
 unatt+0.022
thank+0.021
Neg logit contributions
 happens-0.023
 comes-0.020
 Everything-0.019
apy-0.019
oke-0.019
Token sure
Token ID1654
Feature activation-3.17e-01

Pos logit contributions
side+0.061
handed+0.056
 distance+0.055
 unatt+0.054
thank+0.051
Neg logit contributions
 happens-0.051
 comes-0.045
oke-0.042
 Everything-0.042
apy-0.042
TokenI
Token ID40
Feature activation+3.68e-01

Pos logit contributions
side+0.033
 unatt+0.031
handed+0.030
ge+0.030
 spine+0.030
Neg logit contributions
 happens-0.023
 It-0.020
 Everything-0.020
 Now-0.020
apy-0.020
Token fell
Token ID3214
Feature activation-9.41e-02

Pos logit contributions
side+0.042
 distance+0.040
handed+0.040
 unatt+0.039
thank+0.038
Neg logit contributions
 happens-0.034
 Everything-0.030
apy-0.029
 comes-0.028
 Now-0.028
Token off
Token ID572
Feature activation-7.76e-01

Pos logit contributions
 happens+0.010
 comes+0.009
 Everything+0.008
 Now+0.008
 can+0.008
Neg logit contributions
side-0.010
handed-0.010
 distance-0.009
 unatt-0.009
anut-0.009
Token the
Token ID262
Feature activation-2.57e-01

Pos logit contributions
 happens+0.072
 comes+0.062
 Everything+0.059
apy+0.058
 Now+0.056
Neg logit contributions
side-0.096
handed-0.088
 distance-0.088
 unatt-0.087
thank-0.082
Token clothes
Token ID8242
Feature activation+1.80e-01

Pos logit contributions
 happens+0.024
 comes+0.021
 can+0.020
 Everything+0.019
 Now+0.019
Neg logit contributions
side-0.030
ge-0.028
handed-0.028
thank-0.026
 unatt-0.026
Tokenline
Token ID1370
Feature activation+4.47e+00

Pos logit contributions
side+0.021
handed+0.020
 distance+0.019
 unatt+0.018
anut+0.018
Neg logit contributions
 happens-0.016
 comes-0.015
 can-0.014
 I-0.014
 It-0.013
Token and
Token ID290
Feature activation-1.15e-01

Pos logit contributions
side+0.480
handed+0.440
 distance+0.431
 unatt+0.428
ge+0.404
Neg logit contributions
 happens-0.473
 comes-0.431
 Everything-0.401
apy-0.393
 Now-0.391
Token I
Token ID314
Feature activation-2.78e-01

Pos logit contributions
 happens+0.010
oke+0.008
 comes+0.008
 Everything+0.008
apy+0.008
Neg logit contributions
side-0.014
 unatt-0.014
 distance-0.014
handed-0.013
 bank-0.013
Token can
Token ID460
Feature activation-4.44e-01

Pos logit contributions
 happens+0.025
 comes+0.021
 Everything+0.020
apy+0.020
 Now+0.019
Neg logit contributions
side-0.035
handed-0.032
 distance-0.032
 unatt-0.032
thank-0.030
Token't
Token ID470
Feature activation-4.75e-01

Pos logit contributions
 happens+0.046
 comes+0.041
 Everything+0.039
apy+0.039
 Now+0.037
Neg logit contributions
side-0.051
handed-0.046
 distance-0.045
 unatt-0.045
ge-0.042
Token get
Token ID651
Feature activation+7.26e-03

Pos logit contributions
 happens+0.051
 comes+0.046
apy+0.043
 Everything+0.043
 It+0.043
Neg logit contributions
side-0.054
handed-0.045
 distance-0.044
 unatt-0.043
ge-0.041
Token his
Token ID465
Feature activation+8.05e-02

Pos logit contributions
side+0.005
handed+0.004
 distance+0.004
 unatt+0.004
ge+0.004
Neg logit contributions
 happens-0.005
 comes-0.005
apy-0.005
 Everything-0.005
 Now-0.005
Token hunger
Token ID16460
Feature activation-2.09e-02

Pos logit contributions
side+0.009
handed+0.008
 unatt+0.007
 distance+0.007
ge+0.007
Neg logit contributions
 happens-0.009
 comes-0.008
apy-0.008
 Everything-0.008
 Now-0.007
Token began
Token ID2540
Feature activation-4.99e-01

Pos logit contributions
 happens+0.002
 comes+0.002
apy+0.002
 Everything+0.002
 can+0.002
Neg logit contributions
side-0.002
handed-0.002
 distance-0.002
 unatt-0.002
anut-0.002
Token to
Token ID284
Feature activation+1.42e-01

Pos logit contributions
 happens+0.051
 comes+0.046
apy+0.043
 Everything+0.041
 It+0.040
Neg logit contributions
side-0.056
handed-0.054
 unatt-0.051
 distance-0.051
anut-0.049
Token sub
Token ID850
Feature activation-2.77e-01

Pos logit contributions
side+0.016
handed+0.015
anut+0.013
keeping+0.013
 unatt+0.013
Neg logit contributions
 happens-0.015
 comes-0.014
apy-0.013
 can-0.013
 Everything-0.012
Tokenside
Token ID1589
Feature activation+4.44e+00

Pos logit contributions
 Everyone+0.042
 Now+0.042
 It+0.040
 Everything+0.040
 happens+0.040
Neg logit contributions
anut-0.009
 unatt-0.009
 distance-0.008
 diet-0.008
 spine-0.008
Token.
Token ID13
Feature activation-4.79e-02

Pos logit contributions
 unatt+0.512
handed+0.501
side+0.494
anut+0.491
 distance+0.480
Neg logit contributions
 happens-0.363
 comes-0.320
 Everyone-0.294
apy-0.287
 We-0.286
Token
Token ID198
Feature activation-2.99e-02

Pos logit contributions
 happens+0.006
 comes+0.005
oms+0.005
ours+0.004
apy+0.004
Neg logit contributions
side-0.005
handed-0.004
 distance-0.004
thank-0.004
 bank-0.004
Token
Token ID198
Feature activation-4.96e-02

Pos logit contributions
 happens+0.003
 comes+0.003
 Everything+0.003
apy+0.003
ours+0.003
Neg logit contributions
side-0.003
handed-0.003
 distance-0.003
 unatt-0.003
thank-0.003
TokenWhen
Token ID2215
Feature activation+1.28e-01

Pos logit contributions
 happens+0.006
 comes+0.006
 Everything+0.005
apy+0.005
 Now+0.005
Neg logit contributions
side-0.004
handed-0.004
 distance-0.004
 unatt-0.004
thank-0.004
Token the
Token ID262
Feature activation-3.12e-01

Pos logit contributions
side+0.013
 distance+0.013
 unatt+0.011
handed+0.011
 bank+0.010
Neg logit contributions
 happens-0.014
oke-0.013
ours-0.013
icks-0.012
oms-0.012
Token down
Token ID866
Feature activation+5.64e-01

Pos logit contributions
 happens+0.068
 comes+0.061
 Everything+0.058
apy+0.058
 Now+0.057
Neg logit contributions
side-0.062
handed-0.056
 distance-0.055
 unatt-0.055
ge-0.051
Token side
Token ID1735
Feature activation+5.25e-01

Pos logit contributions
side+0.060
handed+0.057
 unatt+0.054
 distance+0.053
anut+0.051
Neg logit contributions
 happens-0.060
 comes-0.055
apy-0.051
 Everything-0.050
 Now-0.050
Token-
Token ID12
Feature activation+5.40e-01

Pos logit contributions
side+0.054
handed+0.050
 unatt+0.049
 distance+0.049
thank+0.046
Neg logit contributions
 happens-0.056
 comes-0.051
apy-0.048
 Everything-0.048
 Now-0.047
Tokenby
Token ID1525
Feature activation+1.68e-02

Pos logit contributions
 spine+0.049
 unatt+0.048
 former+0.047
side+0.046
 distance+0.046
Neg logit contributions
 happens-0.053
 comes-0.050
 Now-0.049
 Everything-0.048
 turns-0.048
Token-
Token ID12
Feature activation+5.37e-01

Pos logit contributions
side+0.002
handed+0.002
 distance+0.002
 unatt+0.002
thank+0.001
Neg logit contributions
 happens-0.002
 comes-0.002
 Everything-0.002
apy-0.002
 Now-0.002
Tokenside
Token ID1589
Feature activation+4.44e+00

Pos logit contributions
 spine+0.050
 unatt+0.048
 former+0.047
side+0.047
 distance+0.047
Neg logit contributions
 happens-0.050
 comes-0.048
 Now-0.047
 turns-0.046
 Everything-0.046
Token.
Token ID13
Feature activation-7.94e-02

Pos logit contributions
 unatt+0.512
handed+0.500
side+0.482
anut+0.475
thank+0.470
Neg logit contributions
 happens-0.352
 comes-0.313
apy-0.299
 Everyone-0.294
 We-0.288
Token They
Token ID1119
Feature activation-2.28e-01

Pos logit contributions
 happens+0.009
 comes+0.007
apy+0.007
oke+0.007
oms+0.007
Neg logit contributions
side-0.009
 distance-0.008
handed-0.008
 bank-0.007
 unatt-0.007
Token were
Token ID547
Feature activation+3.79e-01

Pos logit contributions
 happens+0.018
 comes+0.016
apy+0.015
 Everything+0.015
 turns+0.014
Neg logit contributions
side-0.030
handed-0.028
 unatt-0.027
 distance-0.027
ge-0.026
Token having
Token ID1719
Feature activation-1.46e-01

Pos logit contributions
side+0.034
 distance+0.030
handed+0.030
 unatt+0.030
thank+0.027
Neg logit contributions
 happens-0.048
 comes-0.043
 Everything-0.041
apy-0.041
 Now-0.040
Token so
Token ID523
Feature activation+1.05e-01

Pos logit contributions
 happens+0.019
 comes+0.017
apy+0.016
oke+0.016
ours+0.016
Neg logit contributions
side-0.013
 distance-0.011
handed-0.011
 unatt-0.011
thank-0.010
Token perfume
Token ID40094
Feature activation+1.55e-01

Pos logit contributions
 happens+0.030
 comes+0.027
 Now+0.026
 Everything+0.026
 turns+0.025
Neg logit contributions
side-0.037
ge-0.035
 unatt-0.035
handed-0.034
 distance-0.033
Token bottle
Token ID9294
Feature activation+4.99e-01

Pos logit contributions
side+0.016
handed+0.016
 distance+0.015
 unatt+0.015
ge+0.014
Neg logit contributions
 happens-0.016
 comes-0.014
apy-0.014
 Everything-0.013
 can-0.013
Token on
Token ID319
Feature activation+4.28e-02

Pos logit contributions
side+0.055
handed+0.053
 unatt+0.051
 distance+0.051
ge+0.050
Neg logit contributions
 happens-0.046
 comes-0.042
apy-0.040
 Everything-0.039
 It-0.038
Token her
Token ID607
Feature activation-2.88e-02

Pos logit contributions
handed+0.005
 unatt+0.005
side+0.005
 distance+0.005
ge+0.005
Neg logit contributions
 It-0.003
 happens-0.003
 comes-0.003
 Now-0.003
 Everyone-0.003
Token bed
Token ID3996
Feature activation+1.71e-01

Pos logit contributions
 happens+0.003
 It+0.003
 can+0.003
 comes+0.002
 Show+0.002
Neg logit contributions
side-0.003
handed-0.003
ge-0.003
thank-0.003
 distance-0.003
Tokenside
Token ID1589
Feature activation+4.42e+00

Pos logit contributions
anut+0.019
handed+0.019
ge+0.019
 distance+0.019
 unatt+0.018
Neg logit contributions
 happens-0.013
 Now-0.012
 comes-0.012
 It-0.012
 Everyone-0.011
Token table
Token ID3084
Feature activation+2.44e-01

Pos logit contributions
 unatt+0.413
handed+0.380
anut+0.363
ge+0.354
 distance+0.353
Neg logit contributions
 happens-0.435
 comes-0.418
 Everyone-0.400
 People-0.395
apy-0.383
Token and
Token ID290
Feature activation-1.60e-01

Pos logit contributions
handed+0.029
anut+0.028
 distance+0.028
side+0.028
 unatt+0.027
Neg logit contributions
 happens-0.017
 comes-0.016
 can-0.015
 of-0.015
 Now-0.015
Token every
Token ID790
Feature activation-5.74e-03

Pos logit contributions
 happens+0.018
 comes+0.015
 Everything+0.015
apy+0.015
oke+0.014
Neg logit contributions
side-0.017
handed-0.015
 distance-0.015
 unatt-0.015
thank-0.014
Token night
Token ID1755
Feature activation+1.43e-01

Pos logit contributions
 happens+0.001
 comes+0.001
 Everything+0.001
apy+0.001
 Now+0.001
Neg logit contributions
side-0.001
handed-0.001
 distance-0.001
 unatt-0.001
thank-0.000
Token she
Token ID673
Feature activation+4.66e-01

Pos logit contributions
side+0.017
handed+0.016
 distance+0.015
 unatt+0.015
ge+0.014
Neg logit contributions
 happens-0.014
 comes-0.012
 Everything-0.011
apy-0.011
 Now-0.011
Token kept
Token ID4030
Feature activation-5.70e-01

Pos logit contributions
side+0.004
 distance+0.003
handed+0.003
 unatt+0.003
thank+0.003
Neg logit contributions
 happens-0.003
 comes-0.003
 Everything-0.003
oke-0.003
apy-0.003
Token it
Token ID340
Feature activation-5.87e-01

Pos logit contributions
 happens+0.054
 comes+0.048
 Everything+0.044
apy+0.044
 Now+0.043
Neg logit contributions
side-0.070
handed-0.065
 distance-0.064
 unatt-0.063
ge-0.059
Token on
Token ID319
Feature activation+6.25e-02

Pos logit contributions
 happens+0.042
apy+0.042
 can+0.041
 comes+0.040
 of+0.039
Neg logit contributions
handed-0.073
side-0.067
 distance-0.063
thank-0.062
keeping-0.061
Token her
Token ID607
Feature activation+1.98e-02

Pos logit contributions
handed+0.009
anut+0.008
ppa+0.008
 unatt+0.008
broken+0.008
Neg logit contributions
 It-0.003
 happens-0.003
 Everyone-0.003
 comes-0.003
 Now-0.003
Token bed
Token ID3996
Feature activation+1.55e-01

Pos logit contributions
side+0.002
handed+0.002
ge+0.002
 distance+0.002
thank+0.002
Neg logit contributions
 happens-0.002
 can-0.002
 comes-0.002
 It-0.002
 turns-0.002
Tokenside
Token ID1589
Feature activation+4.41e+00

Pos logit contributions
 distance+0.019
anut+0.019
handed+0.019
ge+0.018
ety+0.018
Neg logit contributions
 of-0.010
 happens-0.009
ay-0.009
 Everyone-0.009
 can-0.008
Token table
Token ID3084
Feature activation+2.91e-01

Pos logit contributions
 unatt+0.444
handed+0.435
ge+0.424
anut+0.405
isk+0.402
Neg logit contributions
 happens-0.387
 comes-0.354
 Everyone-0.348
 People-0.341
 of-0.338
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
handed+0.036
 distance+0.035
anut+0.033
side+0.032
 unatt+0.032
Neg logit contributions
 happens-0.019
 comes-0.019
 can-0.018
 of-0.018
apy-0.016
Token Every
Token ID3887
Feature activation-1.01e-01

Pos logit contributions
 happens+0.022
 comes+0.019
apy+0.017
oke+0.017
 Everything+0.017
Neg logit contributions
side-0.029
handed-0.026
 distance-0.026
 unatt-0.025
thank-0.024
Token night
Token ID1755
Feature activation+5.31e-02

Pos logit contributions
 happens+0.011
 comes+0.010
 Everything+0.010
apy+0.010
 Now+0.009
Neg logit contributions
side-0.010
handed-0.010
 distance-0.009
 unatt-0.009
ge-0.009
Token before
Token ID878
Feature activation-3.53e-01

Pos logit contributions
side+0.007
 distance+0.006
handed+0.006
 unatt+0.006
thank+0.006
Neg logit contributions
 happens-0.005
 comes-0.004
 Everything-0.004
apy-0.004
ours-0.004
Token a
Token ID257
Feature activation+1.02e-01

Pos logit contributions
 happens+0.044
oke+0.041
 comes+0.039
apy+0.038
ours+0.038
Neg logit contributions
side-0.035
handed-0.032
 distance-0.031
 unatt-0.030
thank-0.029
Token walk
Token ID2513
Feature activation+3.78e-01

Pos logit contributions
side+0.010
handed+0.009
 distance+0.009
 unatt+0.009
thank+0.008
Neg logit contributions
 happens-0.012
 comes-0.011
 Everything-0.010
apy-0.010
 Now-0.010
Token along
Token ID1863
Feature activation-3.22e-01

Pos logit contributions
side+0.043
handed+0.042
 unatt+0.040
anut+0.040
 distance+0.038
Neg logit contributions
 happens-0.032
 comes-0.029
 It-0.028
 of-0.027
 We-0.027
Token the
Token ID262
Feature activation-1.20e-01

Pos logit contributions
 happens+0.025
 comes+0.023
 It+0.022
 Everyone+0.022
 Now+0.021
Neg logit contributions
side-0.039
anut-0.038
handed-0.038
 unatt-0.037
 spine-0.037
Token shore
Token ID15191
Feature activation+2.62e-01

Pos logit contributions
 happens+0.015
 comes+0.013
 Everything+0.013
 People+0.012
 Everyone+0.012
Neg logit contributions
side-0.011
 unatt-0.010
handed-0.009
thank-0.009
ge-0.009
Tokenline
Token ID1370
Feature activation+4.41e+00

Pos logit contributions
side+0.026
handed+0.024
 unatt+0.024
 distance+0.023
thank+0.022
Neg logit contributions
 happens-0.029
 comes-0.027
 Everything-0.024
 Now-0.024
 People-0.024
Token.
Token ID13
Feature activation+6.84e-02

Pos logit contributions
side+0.467
handed+0.443
 unatt+0.438
 distance+0.425
anut+0.417
Neg logit contributions
 happens-0.419
 comes-0.409
 Everything-0.371
 It-0.369
 Everyone-0.367
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
side+0.007
 distance+0.006
handed+0.006
thank+0.006
 unatt+0.005
Neg logit contributions
 happens-0.008
 comes-0.007
oke-0.007
apy-0.006
ours-0.006
Token
Token ID198
Feature activation+8.17e-02

Pos logit contributions
side+0.013
 unatt+0.012
handed+0.012
 distance+0.012
thank+0.012
Neg logit contributions
 happens-0.009
 comes-0.008
 Everything-0.008
apy-0.007
 Now-0.007
TokenBut
Token ID1537
Feature activation+4.61e-01

Pos logit contributions
side+0.008
 unatt+0.007
handed+0.007
 distance+0.007
ge+0.007
Neg logit contributions
 happens-0.010
 comes-0.009
 Everything-0.009
 Now-0.008
apy-0.008
Token something
Token ID1223
Feature activation-8.11e-02

Pos logit contributions
side+0.041
handed+0.038
 distance+0.037
 unatt+0.036
ge+0.033
Neg logit contributions
 happens-0.058
 comes-0.052
 Everything-0.050
apy-0.050
 Now-0.049
Token together
Token ID1978
Feature activation-6.38e-01

Pos logit contributions
 happens+0.060
 comes+0.055
 Everything+0.053
apy+0.053
 Now+0.052
Neg logit contributions
side-0.042
handed-0.040
 unatt-0.039
 distance-0.037
ge-0.035
Token side
Token ID1735
Feature activation+4.75e-01

Pos logit contributions
 happens+0.063
 comes+0.058
apy+0.055
 Everything+0.054
 It+0.052
Neg logit contributions
handed-0.069
side-0.067
 unatt-0.064
 distance-0.062
thank-0.061
Token-
Token ID12
Feature activation+3.95e-01

Pos logit contributions
side+0.050
handed+0.047
 unatt+0.046
 distance+0.045
thank+0.044
Neg logit contributions
 happens-0.050
 comes-0.045
apy-0.043
 Everything-0.042
 Now-0.041
Tokenby
Token ID1525
Feature activation-8.03e-02

Pos logit contributions
side+0.036
 unatt+0.036
 distance+0.036
handed+0.034
 spine+0.034
Neg logit contributions
 happens-0.043
 comes-0.038
apy-0.037
 Everything-0.036
 Now-0.036
Token-
Token ID12
Feature activation+4.05e-01

Pos logit contributions
 happens+0.009
 comes+0.008
 Everything+0.008
apy+0.008
 Now+0.008
Neg logit contributions
side-0.008
 distance-0.007
handed-0.007
 unatt-0.007
ge-0.007
Tokenside
Token ID1589
Feature activation+4.39e+00

Pos logit contributions
side+0.038
 distance+0.038
 unatt+0.037
handed+0.036
 spine+0.036
Neg logit contributions
 happens-0.042
 comes-0.038
apy-0.037
 Everything-0.036
 turns-0.036
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
 unatt+0.494
handed+0.491
side+0.479
anut+0.465
 distance+0.461
Neg logit contributions
 happens-0.374
 comes-0.338
apy-0.307
 Everyone-0.307
 People-0.299
Token
Token ID220
Feature activation+1.58e-01

Pos logit contributions
 happens+0.017
 comes+0.014
apy+0.013
ours+0.012
oke+0.012
Neg logit contributions
side-0.024
handed-0.022
 distance-0.021
 unatt-0.020
thank-0.020
Token
Token ID198
Feature activation-2.24e-01

Pos logit contributions
side+0.016
handed+0.015
 distance+0.013
ge+0.013
 unatt+0.013
Neg logit contributions
 happens-0.019
 comes-0.016
apy-0.015
 turns-0.014
ours-0.014
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
 happens+0.021
 comes+0.018
 Everything+0.017
apy+0.017
 Now+0.017
Neg logit contributions
side-0.027
handed-0.025
 unatt-0.025
 distance-0.025
ge-0.023
TokenNo
Token ID2949
Feature activation-6.84e-01

Pos logit contributions
 happens+0.027
 comes+0.025
 Everything+0.024
apy+0.024
 Now+0.023
Neg logit contributions
side-0.022
handed-0.020
 unatt-0.020
 distance-0.019
ge-0.018
Token kept
Token ID4030
Feature activation-4.83e-01

Pos logit contributions
side+0.021
handed+0.020
 distance+0.019
 unatt+0.019
ge+0.018
Neg logit contributions
 happens-0.018
 comes-0.016
 Everything-0.015
apy-0.015
 Now-0.014
Token it
Token ID340
Feature activation-5.30e-01

Pos logit contributions
 happens+0.041
 comes+0.035
 Everything+0.033
apy+0.032
 Now+0.031
Neg logit contributions
side-0.064
handed-0.059
 distance-0.059
 unatt-0.058
ge-0.055
Token by
Token ID416
Feature activation+8.01e-02

Pos logit contributions
 happens+0.038
 can+0.038
 comes+0.037
apy+0.035
 of+0.035
Neg logit contributions
handed-0.066
side-0.061
 distance-0.057
thank-0.056
keeping-0.055
Token her
Token ID607
Feature activation+7.15e-02

Pos logit contributions
side+0.009
handed+0.008
 unatt+0.008
 distance+0.008
 bank+0.007
Neg logit contributions
 It-0.008
 happens-0.008
 Now-0.007
 comes-0.007
 Everything-0.007
Token bed
Token ID3996
Feature activation+1.57e-01

Pos logit contributions
side+0.009
handed+0.008
ge+0.008
 distance+0.008
anut+0.007
Neg logit contributions
 happens-0.006
 can-0.005
 turns-0.005
 comes-0.005
 Show-0.005
Tokenside
Token ID1589
Feature activation+4.39e+00

Pos logit contributions
handed+0.018
 distance+0.017
anut+0.017
keeping+0.017
ge+0.017
Neg logit contributions
 happens-0.013
 can-0.011
 comes-0.011
 of-0.011
 People-0.011
Token every
Token ID790
Feature activation+2.40e-02

Pos logit contributions
handed+0.454
 unatt+0.441
ge+0.435
anut+0.426
 distance+0.409
Neg logit contributions
 happens-0.383
 comes-0.355
 Everyone-0.342
 Show-0.341
 People-0.337
Token night
Token ID1755
Feature activation+1.75e-01

Pos logit contributions
side+0.003
handed+0.003
anut+0.002
 unatt+0.002
ge+0.002
Neg logit contributions
 happens-0.002
 comes-0.002
 Everything-0.002
 Now-0.002
apy-0.002
Token.
Token ID13
Feature activation-1.74e-01

Pos logit contributions
handed+0.017
side+0.017
ge+0.016
anut+0.016
 unatt+0.015
Neg logit contributions
 happens-0.020
 comes-0.018
 Everything-0.017
 It-0.017
 There-0.016
Token One
Token ID1881
Feature activation-1.92e-01

Pos logit contributions
 happens+0.017
 comes+0.014
 Everything+0.014
apy+0.013
 Now+0.013
Neg logit contributions
side-0.021
handed-0.020
 distance-0.019
 unatt-0.019
thank-0.018
Token evening
Token ID6180
Feature activation+3.21e-03

Pos logit contributions
 happens+0.020
 comes+0.018
 Everything+0.017
apy+0.017
 Now+0.016
Neg logit contributions
side-0.022
handed-0.020
 unatt-0.019
 distance-0.019
ge-0.019
Token Once
Token ID4874
Feature activation-9.55e-01

Pos logit contributions
 happens+0.011
 comes+0.009
apy+0.009
oke+0.008
 importantly+0.008
Neg logit contributions
side-0.014
 distance-0.013
handed-0.012
 bank-0.012
ro-0.012
Token they
Token ID484
Feature activation+2.09e-01

Pos logit contributions
 happens+0.075
 Everything+0.062
 comes+0.061
apy+0.060
 Now+0.058
Neg logit contributions
side-0.132
 distance-0.118
handed-0.118
 unatt-0.115
ge-0.113
Token reached
Token ID4251
Feature activation-4.29e-01

Pos logit contributions
side+0.023
 distance+0.022
 unatt+0.021
handed+0.021
ge+0.020
Neg logit contributions
 happens-0.022
apy-0.018
 comes-0.018
 Everything-0.018
 Now-0.018
Token the
Token ID262
Feature activation-2.19e-01

Pos logit contributions
 happens+0.049
oke+0.042
 Now+0.041
apy+0.041
 comes+0.040
Neg logit contributions
side-0.043
 distance-0.042
ge-0.038
 unatt-0.037
handed-0.037
Token other
Token ID584
Feature activation+1.20e+00

Pos logit contributions
 happens+0.026
 comes+0.023
apy+0.022
 Everything+0.022
 Now+0.022
Neg logit contributions
side-0.022
 distance-0.020
handed-0.019
 unatt-0.019
thank-0.017
Tokenside
Token ID1589
Feature activation+4.38e+00

Pos logit contributions
side+0.062
 distance+0.060
handed+0.050
 unatt+0.049
 bank+0.043
Neg logit contributions
 happens-0.199
apy-0.181
 comes-0.178
oke-0.175
 Everything-0.174
Token,
Token ID11
Feature activation-3.55e-01

Pos logit contributions
side+0.587
handed+0.544
 distance+0.540
 unatt+0.537
thank+0.508
Neg logit contributions
 happens-0.358
 comes-0.304
 Everything-0.283
apy-0.280
 Now-0.271
Token they
Token ID484
Feature activation+2.03e-01

Pos logit contributions
 happens+0.032
 comes+0.026
 Now+0.025
 Everything+0.025
icks+0.025
Neg logit contributions
side-0.044
 distance-0.042
handed-0.039
 unatt-0.038
ge-0.038
Token stepped
Token ID10764
Feature activation-3.76e-01

Pos logit contributions
side+0.024
handed+0.022
 distance+0.021
 unatt+0.021
thank+0.020
Neg logit contributions
 happens-0.020
 comes-0.018
 Everything-0.017
apy-0.017
 Now-0.016
Token off
Token ID572
Feature activation-7.49e-01

Pos logit contributions
 happens+0.044
 comes+0.040
 Everything+0.038
apy+0.038
 Now+0.037
Neg logit contributions
side-0.037
handed-0.033
 distance-0.033
 unatt-0.033
thank-0.030
Token the
Token ID262
Feature activation-2.00e-01

Pos logit contributions
 happens+0.077
 comes+0.066
 Everything+0.064
apy+0.062
 Now+0.061
Neg logit contributions
side-0.087
 distance-0.079
handed-0.077
 unatt-0.076
ge-0.073
Token back
Token ID736
Feature activation+2.44e-01

Pos logit contributions
 happens+0.022
 comes+0.020
 Everything+0.018
apy+0.018
 Now+0.018
Neg logit contributions
side-0.019
handed-0.018
 distance-0.018
 unatt-0.018
ge-0.016
Token on
Token ID319
Feature activation+1.04e-01

Pos logit contributions
side+0.024
handed+0.021
 distance+0.021
 unatt+0.021
thank+0.019
Neg logit contributions
 happens-0.029
 comes-0.026
 Everything-0.025
apy-0.025
 Now-0.024
Token the
Token ID262
Feature activation-1.61e-01

Pos logit contributions
handed+0.013
 unatt+0.013
side+0.013
anut+0.013
ge+0.013
Neg logit contributions
 happens-0.008
 It-0.007
 comes-0.007
 Now-0.007
 turns-0.006
Token zip
Token ID19974
Feature activation+1.02e-01

Pos logit contributions
 happens+0.017
 comes+0.015
 can+0.015
 Everything+0.014
 Now+0.014
Neg logit contributions
side-0.016
handed-0.015
ge-0.015
thank-0.014
 unatt-0.014
Token-
Token ID12
Feature activation+5.74e-01

Pos logit contributions
side+0.011
handed+0.010
 unatt+0.010
anut+0.010
 distance+0.010
Neg logit contributions
 happens-0.010
 comes-0.009
 Everything-0.008
 It-0.008
apy-0.008
Tokenline
Token ID1370
Feature activation+4.37e+00

Pos logit contributions
 unatt+0.061
 distance+0.060
side+0.059
 spine+0.059
ge+0.055
Neg logit contributions
 happens-0.056
 Now-0.048
 comes-0.048
 Everything-0.047
 Everyone-0.046
Token,
Token ID11
Feature activation-1.94e-01

Pos logit contributions
side+0.489
handed+0.469
 unatt+0.460
 distance+0.450
anut+0.432
Neg logit contributions
 happens-0.406
 comes-0.385
 Everything-0.356
 It-0.349
apy-0.345
Token z
Token ID1976
Feature activation-7.10e-02

Pos logit contributions
 happens+0.021
 comes+0.020
apy+0.019
 Now+0.019
 Everything+0.019
Neg logit contributions
side-0.019
handed-0.018
 unatt-0.017
 distance-0.017
ge-0.017
Tokenipping
Token ID4501
Feature activation+8.25e-01

Pos logit contributions
 happens+0.007
 comes+0.006
 Everything+0.006
apy+0.006
 People+0.005
Neg logit contributions
side-0.009
handed-0.008
 unatt-0.008
 distance-0.007
thank-0.007
Token all
Token ID477
Feature activation-3.85e-01

Pos logit contributions
side+0.098
handed+0.090
 distance+0.089
 unatt+0.089
ge+0.083
Neg logit contributions
 happens-0.080
 comes-0.070
 Everything-0.066
apy-0.066
 Now-0.064
Token around
Token ID1088
Feature activation+3.57e-02

Pos logit contributions
 happens+0.035
 comes+0.031
 Everything+0.029
apy+0.028
 Now+0.028
Neg logit contributions
side-0.047
handed-0.044
 unatt-0.044
 distance-0.043
ge-0.041
Token the
Token ID262
Feature activation-2.44e-01

Pos logit contributions
 happens+0.011
 comes+0.010
 Everything+0.009
apy+0.009
 Now+0.009
Neg logit contributions
side-0.013
handed-0.012
 unatt-0.012
 distance-0.012
ge-0.012
Token heart
Token ID2612
Feature activation-6.35e-01

Pos logit contributions
 happens+0.022
 comes+0.019
 Everything+0.018
apy+0.017
 can+0.017
Neg logit contributions
side-0.031
handed-0.028
ge-0.027
 unatt-0.026
anut-0.026
Token on
Token ID319
Feature activation+5.51e-02

Pos logit contributions
 happens+0.056
 comes+0.052
apy+0.051
oms+0.045
 can+0.044
Neg logit contributions
side-0.072
handed-0.072
 distance-0.066
thank-0.065
 unatt-0.065
Token her
Token ID607
Feature activation+1.25e-02

Pos logit contributions
handed+0.007
 unatt+0.007
side+0.007
ge+0.007
 distance+0.007
Neg logit contributions
 happens-0.004
 It-0.004
 comes-0.004
 Now-0.004
 Everyone-0.004
Token bed
Token ID3996
Feature activation+1.21e-01

Pos logit contributions
side+0.001
handed+0.001
thank+0.001
ge+0.001
anut+0.001
Neg logit contributions
 happens-0.001
 comes-0.001
 can-0.001
 Everything-0.001
 It-0.001
Tokenside
Token ID1589
Feature activation+4.36e+00

Pos logit contributions
anut+0.013
handed+0.013
ge+0.013
 distance+0.013
 unatt+0.012
Neg logit contributions
 happens-0.010
 comes-0.009
 It-0.009
 Now-0.009
apy-0.009
Token table
Token ID3084
Feature activation+2.17e-01

Pos logit contributions
 unatt+0.430
handed+0.415
anut+0.404
thank+0.397
side+0.392
Neg logit contributions
 happens-0.408
 comes-0.385
 Everyone-0.357
 People-0.352
 Show-0.348
Token.
Token ID13
Feature activation-5.93e-02

Pos logit contributions
handed+0.024
side+0.023
anut+0.022
 distance+0.022
 unatt+0.022
Neg logit contributions
 happens-0.019
 comes-0.018
apy-0.016
 can-0.016
 It-0.015
Token Each
Token ID5501
Feature activation+8.42e-02

Pos logit contributions
 happens+0.006
 comes+0.005
oke+0.005
 floats+0.005
apy+0.005
Neg logit contributions
side-0.007
handed-0.006
 distance-0.006
 sill-0.005
ro-0.005
Token night
Token ID1755
Feature activation+1.06e-01

Pos logit contributions
side+0.008
 distance+0.008
handed+0.007
 unatt+0.007
thank+0.007
Neg logit contributions
 happens-0.010
apy-0.009
 comes-0.009
oke-0.008
 Everything-0.008
Token,
Token ID11
Feature activation-2.75e-01

Pos logit contributions
side+0.012
 distance+0.011
 unatt+0.010
handed+0.010
thank+0.010
Neg logit contributions
 happens-0.010
ours-0.009
icks-0.009
oke-0.009
apy-0.009
Tokenrobe
Token ID25481
Feature activation+2.24e+00

Pos logit contributions
side+0.055
handed+0.051
 distance+0.049
 unatt+0.048
ge+0.047
Neg logit contributions
 happens-0.081
 comes-0.075
 Everything-0.069
apy-0.069
 Now-0.069
Token back
Token ID736
Feature activation+4.01e-02

Pos logit contributions
side+0.266
handed+0.260
anut+0.247
 unatt+0.241
keeping+0.238
Neg logit contributions
 happens-0.211
 comes-0.191
 can-0.171
 It-0.171
 Everything-0.167
Token onto
Token ID4291
Feature activation-4.77e-01

Pos logit contributions
side+0.004
handed+0.004
 unatt+0.004
 distance+0.004
thank+0.004
Neg logit contributions
 happens-0.004
 comes-0.004
apy-0.004
 Everything-0.003
 Now-0.003
Token the
Token ID262
Feature activation-2.87e-01

Pos logit contributions
 happens+0.026
 It+0.024
 Everyone+0.022
 comes+0.022
 Now+0.022
Neg logit contributions
side-0.074
 unatt-0.071
handed-0.070
thank-0.067
broken-0.067
Token clothes
Token ID8242
Feature activation+1.49e-01

Pos logit contributions
 happens+0.027
 comes+0.024
 can+0.023
 Everything+0.022
 Now+0.022
Neg logit contributions
side-0.035
handed-0.031
ge-0.030
 unatt-0.030
thank-0.029
Tokenline
Token ID1370
Feature activation+4.36e+00

Pos logit contributions
handed+0.018
side+0.017
anut+0.017
thank+0.016
 unatt+0.016
Neg logit contributions
 happens-0.012
 comes-0.012
 It-0.011
 can-0.011
oms-0.010
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
side+0.494
handed+0.488
 unatt+0.461
 distance+0.449
anut+0.443
Neg logit contributions
 happens-0.383
 comes-0.372
 It-0.346
apy-0.331
 Everything-0.330
Token He
Token ID679
Feature activation-3.74e-01

Pos logit contributions
 happens+0.018
 comes+0.016
oms+0.015
oke+0.014
ours+0.014
Neg logit contributions
side-0.015
handed-0.013
 distance-0.013
ro-0.013
 bank-0.012
Token used
Token ID973
Feature activation-1.86e-01

Pos logit contributions
 happens+0.039
 comes+0.034
 Everything+0.034
apy+0.032
 Now+0.032
Neg logit contributions
side-0.040
 distance-0.037
 unatt-0.036
handed-0.036
 bank-0.034
Token all
Token ID477
Feature activation-4.99e-01

Pos logit contributions
 happens+0.019
 comes+0.017
 Everything+0.016
oke+0.016
apy+0.016
Neg logit contributions
side-0.021
 distance-0.019
handed-0.019
 unatt-0.018
ge-0.018
Token his
Token ID465
Feature activation-2.37e-02

Pos logit contributions
 happens+0.050
oke+0.043
 comes+0.042
 Someone+0.041
 We+0.041
Neg logit contributions
side-0.056
 distance-0.053
handed-0.048
ge-0.047
 unatt-0.045
Token kept
Token ID4030
Feature activation-6.20e-01

Pos logit contributions
 happens+0.017
 comes+0.015
 Everything+0.015
apy+0.014
 Now+0.014
Neg logit contributions
side-0.015
handed-0.014
 distance-0.013
 unatt-0.013
thank-0.013
Token it
Token ID340
Feature activation-6.63e-01

Pos logit contributions
 happens+0.055
 comes+0.048
 Everything+0.044
apy+0.044
 Now+0.043
Neg logit contributions
side-0.080
handed-0.074
 distance-0.073
 unatt-0.073
thank-0.068
Token on
Token ID319
Feature activation+8.77e-03

Pos logit contributions
 happens+0.053
 can+0.052
 comes+0.052
apy+0.051
 of+0.050
Neg logit contributions
handed-0.074
side-0.072
thank-0.067
keeping-0.064
 unatt-0.063
Token her
Token ID607
Feature activation+1.30e-02

Pos logit contributions
handed+0.001
 unatt+0.001
side+0.001
anut+0.001
ppa+0.001
Neg logit contributions
 It-0.000
 comes-0.000
 Now-0.000
 happens-0.000
 Sam-0.000
Token bed
Token ID3996
Feature activation+1.51e-01

Pos logit contributions
side+0.001
handed+0.001
ge+0.001
thank+0.001
isk+0.001
Neg logit contributions
 happens-0.001
 can-0.001
 may-0.001
 comes-0.001
 is-0.001
Tokenside
Token ID1589
Feature activation+4.35e+00

Pos logit contributions
handed+0.019
ge+0.019
anut+0.019
ety+0.018
 distance+0.018
Neg logit contributions
 of-0.009
 happens-0.008
 It-0.008
ay-0.007
 can-0.007
Token table
Token ID3084
Feature activation+2.31e-01

Pos logit contributions
 unatt+0.449
handed+0.437
isk+0.408
ge+0.407
anut+0.407
Neg logit contributions
 happens-0.354
 comes-0.343
 of-0.329
 Everyone-0.316
 People-0.311
Token and
Token ID290
Feature activation-8.54e-02

Pos logit contributions
handed+0.030
anut+0.028
 distance+0.028
 unatt+0.028
side+0.028
Neg logit contributions
 comes-0.013
 happens-0.013
 of-0.012
 can-0.012
 It-0.011
Token looked
Token ID3114
Feature activation+1.82e-01

Pos logit contributions
 happens+0.009
 comes+0.008
 Everything+0.008
apy+0.007
 Now+0.007
Neg logit contributions
side-0.009
handed-0.009
 distance-0.009
 unatt-0.008
thank-0.008
Token at
Token ID379
Feature activation-3.18e-01

Pos logit contributions
side+0.021
 distance+0.019
handed+0.019
 unatt+0.019
thank+0.018
Neg logit contributions
 happens-0.018
 comes-0.016
 Everything-0.015
apy-0.015
 Now-0.015
Token it
Token ID340
Feature activation-6.51e-01

Pos logit contributions
 happens+0.019
 comes+0.015
 Everything+0.014
 It+0.014
 turns+0.013
Neg logit contributions
side-0.051
handed-0.047
 unatt-0.047
 distance-0.047
thank-0.046
Token as
Token ID355
Feature activation-7.00e-01

Pos logit contributions
side+0.020
handed+0.018
 distance+0.018
 unatt+0.017
thank+0.016
Neg logit contributions
 happens-0.023
 comes-0.021
apy-0.020
 Everything-0.020
 Now-0.019
Token he
Token ID339
Feature activation+4.52e-01

Pos logit contributions
 happens+0.072
 comes+0.064
 Everything+0.061
oke+0.059
 Now+0.058
Neg logit contributions
side-0.080
 distance-0.071
handed-0.071
 unatt-0.070
ge-0.067
Token explored
Token ID18782
Feature activation-6.40e-01

Pos logit contributions
side+0.053
handed+0.049
 distance+0.048
 unatt+0.048
 bank+0.046
Neg logit contributions
 happens-0.043
 Everything-0.037
apy-0.036
 comes-0.036
 Now-0.035
Token the
Token ID262
Feature activation-1.12e-01

Pos logit contributions
 happens+0.059
 comes+0.053
 Everything+0.048
apy+0.047
 It+0.047
Neg logit contributions
side-0.078
handed-0.073
 unatt-0.070
 distance-0.069
thank-0.067
Token shore
Token ID15191
Feature activation+2.33e-01

Pos logit contributions
 happens+0.013
 comes+0.012
 Everything+0.011
 Now+0.011
 can+0.011
Neg logit contributions
side-0.011
handed-0.010
ge-0.009
 unatt-0.009
 distance-0.009
Tokenline
Token ID1370
Feature activation+4.33e+00

Pos logit contributions
side+0.022
handed+0.021
 unatt+0.020
 distance+0.020
thank+0.019
Neg logit contributions
 happens-0.026
 comes-0.025
 Everything-0.023
apy-0.022
 Now-0.022
Token.
Token ID13
Feature activation-5.59e-02

Pos logit contributions
side+0.455
handed+0.441
 unatt+0.425
 distance+0.414
anut+0.400
Neg logit contributions
 happens-0.411
 comes-0.401
 Everything-0.370
 It-0.364
 People-0.362
Token
Token ID198
Feature activation+1.26e-01

Pos logit contributions
 happens+0.006
 comes+0.006
apy+0.005
oke+0.005
ours+0.005
Neg logit contributions
side-0.006
 distance-0.005
handed-0.005
 unatt-0.005
thank-0.005
Token
Token ID198
Feature activation+1.01e-01

Pos logit contributions
side+0.017
 unatt+0.016
handed+0.016
 distance+0.015
thank+0.015
Neg logit contributions
 happens-0.010
 comes-0.008
 Everything-0.008
 Now-0.008
 It-0.008
TokenOne
Token ID3198
Feature activation+5.12e-01

Pos logit contributions
side+0.008
 unatt+0.008
handed+0.008
 distance+0.007
keeping+0.007
Neg logit contributions
 happens-0.013
 Everything-0.012
 comes-0.012
 Now-0.012
 It-0.011
Token day
Token ID1110
Feature activation+1.13e-01

Pos logit contributions
side+0.044
 distance+0.041
handed+0.038
 unatt+0.037
thank+0.035
Neg logit contributions
 happens-0.067
 comes-0.060
 Everything-0.058
apy-0.057
oke-0.056
Token it
Token ID340
Feature activation-5.98e-01

Pos logit contributions
 happens+0.019
 comes+0.016
oke+0.016
 Everything+0.015
apy+0.015
Neg logit contributions
side-0.032
 distance-0.030
 unatt-0.029
handed-0.029
thank-0.028
Token would
Token ID561
Feature activation+2.59e-01

Pos logit contributions
 happens+0.055
 comes+0.048
 Everything+0.047
apy+0.047
 Now+0.045
Neg logit contributions
side-0.073
 distance-0.067
 unatt-0.066
handed-0.065
thank-0.062
Token hug
Token ID16225
Feature activation+8.78e-02

Pos logit contributions
side+0.035
handed+0.034
 unatt+0.031
 distance+0.030
keeping+0.029
Neg logit contributions
 happens-0.021
 comes-0.019
 can-0.018
 Everything-0.017
 It-0.017
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
side+0.011
handed+0.010
 distance+0.010
 unatt+0.010
ge+0.010
Neg logit contributions
 happens-0.008
 comes-0.007
 Everything-0.007
 Now-0.006
 It-0.006
Token shore
Token ID15191
Feature activation+1.12e-01

Pos logit contributions
 happens+0.021
 comes+0.018
 Everything+0.017
apy+0.017
 Now+0.017
Neg logit contributions
side-0.027
handed-0.025
 distance-0.025
 unatt-0.025
thank-0.023
Tokenline
Token ID1370
Feature activation+4.32e+00

Pos logit contributions
side+0.011
handed+0.010
 distance+0.010
 unatt+0.010
ge+0.009
Neg logit contributions
 happens-0.013
 comes-0.012
 Everything-0.011
apy-0.011
 Now-0.010
Token and
Token ID290
Feature activation+5.16e-03

Pos logit contributions
side+0.447
handed+0.414
 unatt+0.401
 distance+0.400
ge+0.375
Neg logit contributions
 happens-0.470
 comes-0.431
 Everything-0.405
apy-0.401
 Now-0.389
Token admire
Token ID21099
Feature activation+1.05e-01

Pos logit contributions
side+0.001
 distance+0.001
 unatt+0.000
handed+0.000
thank+0.000
Neg logit contributions
 happens-0.001
 comes-0.000
 Everything-0.000
oke-0.000
apy-0.000
Token the
Token ID262
Feature activation-2.12e-01

Pos logit contributions
side+0.012
 distance+0.012
 unatt+0.012
handed+0.011
ge+0.011
Neg logit contributions
 happens-0.010
 comes-0.009
oke-0.008
 Everything-0.008
apy-0.008
Token wrecked
Token ID44770
Feature activation-3.09e-02

Pos logit contributions
 happens+0.022
 comes+0.019
 Everything+0.018
apy+0.018
 Now+0.018
Neg logit contributions
side-0.024
handed-0.022
 distance-0.022
 unatt-0.021
thank-0.020
Token boats
Token ID16659
Feature activation-5.95e-02

Pos logit contributions
 happens+0.003
 comes+0.002
 Everything+0.002
apy+0.002
 Now+0.002
Neg logit contributions
side-0.004
 distance-0.004
handed-0.004
 unatt-0.003
ge-0.003
Token go
Token ID467
Feature activation-5.95e-01

Pos logit contributions
 happens+0.007
 comes+0.007
 Everything+0.006
apy+0.006
 Now+0.006
Neg logit contributions
side-0.011
handed-0.010
 unatt-0.010
 distance-0.009
ge-0.009
Token back
Token ID736
Feature activation+1.16e-01

Pos logit contributions
 happens+0.064
 comes+0.061
 Now+0.055
 Everything+0.054
 can+0.053
Neg logit contributions
side-0.064
handed-0.057
 unatt-0.056
 distance-0.054
thank-0.053
Token to
Token ID284
Feature activation+1.94e-01

Pos logit contributions
side+0.013
handed+0.011
 distance+0.011
 unatt+0.011
thank+0.011
Neg logit contributions
 happens-0.012
 comes-0.011
 Everything-0.010
apy-0.010
 Now-0.010
Token the
Token ID262
Feature activation-2.77e-01

Pos logit contributions
side+0.023
handed+0.021
 distance+0.021
 unatt+0.021
ge+0.019
Neg logit contributions
 happens-0.019
 comes-0.016
apy-0.016
 Everything-0.015
oke-0.015
Token pool
Token ID5933
Feature activation-3.09e-02

Pos logit contributions
 happens+0.028
 comes+0.026
 Everything+0.023
apy+0.023
 Now+0.023
Neg logit contributions
side-0.031
handed-0.028
ge-0.027
 unatt-0.027
thank-0.026
Tokenside
Token ID1589
Feature activation+4.31e+00

Pos logit contributions
 happens+0.002
 comes+0.002
 can+0.002
 We+0.002
oke+0.002
Neg logit contributions
side-0.004
handed-0.004
 unatt-0.004
 distance-0.004
ge-0.004
Token,
Token ID11
Feature activation-9.87e-02

Pos logit contributions
side+0.554
 unatt+0.541
handed+0.527
 distance+0.500
thank+0.496
Neg logit contributions
 happens-0.350
 comes-0.297
 says-0.265
 turns-0.262
 We-0.259
Token Mia
Token ID32189
Feature activation-7.94e-01

Pos logit contributions
 happens+0.009
apy+0.008
 Everything+0.007
 comes+0.007
 Now+0.007
Neg logit contributions
side-0.011
 distance-0.011
 unatt-0.011
handed-0.010
anut-0.010
Token.
Token ID13
Feature activation+3.95e-01

Pos logit contributions
 happens+0.088
 Everything+0.078
 comes+0.078
apy+0.077
 Now+0.074
Neg logit contributions
side-0.081
handed-0.074
 distance-0.073
 unatt-0.072
ge-0.068
Token Maybe
Token ID6674
Feature activation+5.90e-01

Pos logit contributions
side+0.050
 sill+0.049
 unatt+0.048
 spine+0.048
 distance+0.048
Neg logit contributions
 happens-0.031
 comes-0.023
oke-0.023
apy-0.023
 importantly-0.022
Token we
Token ID356
Feature activation+2.44e-01

Pos logit contributions
side+0.072
 distance+0.071
 unatt+0.070
handed+0.065
ge+0.063
Neg logit contributions
 happens-0.045
 Everything-0.042
oke-0.042
 Now-0.040
apy-0.040
Token when
Token ID618
Feature activation-1.30e-01

Pos logit contributions
 happens+0.037
 comes+0.032
 Everything+0.031
apy+0.030
oke+0.030
Neg logit contributions
side-0.043
 distance-0.039
handed-0.039
 unatt-0.038
thank-0.036
Token it
Token ID340
Feature activation-6.34e-01

Pos logit contributions
oke+0.016
ours+0.014
 happens+0.014
icks+0.014
oms+0.014
Neg logit contributions
 distance-0.012
side-0.012
 unatt-0.010
thank-0.010
 bank-0.010
Token hugged
Token ID41130
Feature activation-2.82e-01

Pos logit contributions
 happens+0.061
 comes+0.053
 Everything+0.053
apy+0.052
 Now+0.052
Neg logit contributions
side-0.073
 distance-0.068
 unatt-0.066
handed-0.065
ge-0.061
Token the
Token ID262
Feature activation-2.19e-01

Pos logit contributions
 happens+0.024
 comes+0.021
 Everything+0.019
apy+0.019
 Now+0.019
Neg logit contributions
side-0.037
handed-0.034
 distance-0.034
 unatt-0.034
ge-0.032
Token shore
Token ID15191
Feature activation+1.43e-01

Pos logit contributions
 happens+0.020
 comes+0.017
 Everything+0.016
apy+0.016
 Now+0.015
Neg logit contributions
side-0.028
handed-0.026
 distance-0.025
 unatt-0.025
thank-0.024
Tokenline
Token ID1370
Feature activation+4.30e+00

Pos logit contributions
side+0.017
 distance+0.016
 unatt+0.015
handed+0.015
 bank+0.014
Neg logit contributions
 happens-0.014
 comes-0.012
 Everything-0.012
 Now-0.011
apy-0.011
Token again
Token ID757
Feature activation+6.13e-01

Pos logit contributions
side+0.530
 distance+0.490
 unatt+0.475
handed+0.474
thank+0.446
Neg logit contributions
 happens-0.408
 comes-0.339
 Everything-0.319
 Now-0.319
apy-0.319
Token,
Token ID11
Feature activation-2.43e-01

Pos logit contributions
side+0.081
 distance+0.075
handed+0.074
 unatt+0.074
thank+0.071
Neg logit contributions
 happens-0.051
 comes-0.043
 Everything-0.041
apy-0.040
oke-0.039
Token it
Token ID340
Feature activation-5.98e-01

Pos logit contributions
 happens+0.022
oke+0.021
 comes+0.019
ours+0.019
 Now+0.018
Neg logit contributions
side-0.028
 distance-0.027
 unatt-0.026
handed-0.025
thank-0.024
Token noticed
Token ID6810
Feature activation+1.72e-01

Pos logit contributions
 happens+0.058
 comes+0.050
 Everything+0.046
apy+0.046
 Now+0.043
Neg logit contributions
side-0.073
handed-0.068
 unatt-0.065
 distance-0.065
ge-0.063
Token a
Token ID257
Feature activation-8.39e-03

Pos logit contributions
side+0.020
handed+0.018
 distance+0.018
 unatt+0.018
ge+0.017
Neg logit contributions
 happens-0.017
 comes-0.015
 Everything-0.014
apy-0.014
 Now-0.014
Token!"
Token ID2474
Feature activation-4.27e-02

Pos logit contributions
 happens+0.059
 Everything+0.055
apy+0.053
 People+0.050
 Everyone+0.050
Neg logit contributions
 distance-0.069
side-0.069
 unatt-0.068
handed-0.067
thank-0.063
Token
Token ID198
Feature activation+5.41e-02

Pos logit contributions
 happens+0.005
 comes+0.004
apy+0.004
 Everything+0.004
ours+0.004
Neg logit contributions
side-0.004
 distance-0.004
handed-0.004
 unatt-0.004
thank-0.004
Token
Token ID198
Feature activation+3.56e-02

Pos logit contributions
side+0.005
handed+0.005
 distance+0.005
 unatt+0.004
ge+0.004
Neg logit contributions
 happens-0.006
 comes-0.006
 Everything-0.005
apy-0.005
ours-0.005
Token"
Token ID1
Feature activation+3.04e-02

Pos logit contributions
side+0.005
 distance+0.004
handed+0.004
 unatt+0.004
ge+0.004
Neg logit contributions
 happens-0.003
 comes-0.003
 Everything-0.003
apy-0.002
ours-0.002
TokenA
Token ID32
Feature activation+4.75e-01

Pos logit contributions
side+0.003
handed+0.003
 unatt+0.003
 distance+0.003
ge+0.003
Neg logit contributions
 happens-0.003
 comes-0.003
apy-0.003
 Everything-0.003
 Now-0.003
Tokenww
Token ID1383
Feature activation-2.95e+00

Pos logit contributions
 distance+0.038
side+0.035
handed+0.032
 source+0.030
 tune+0.030
Neg logit contributions
 happens-0.056
ours-0.054
apy-0.053
 Everything-0.053
 comes-0.051
Token!"
Token ID2474
Feature activation-1.65e-01

Pos logit contributions
 Everything+0.340
 People+0.318
apy+0.314
 importantly+0.310
 Working+0.306
Neg logit contributions
side-0.220
 distance-0.207
 unatt-0.199
 away-0.199
handed-0.190
Token Lily
Token ID20037
Feature activation+3.92e-02

Pos logit contributions
 happens+0.016
 comes+0.014
oke+0.013
 Everything+0.013
ours+0.013
Neg logit contributions
side-0.019
 distance-0.018
handed-0.017
 unatt-0.017
thank-0.016
Token says
Token ID1139
Feature activation-2.72e-01

Pos logit contributions
side+0.006
 distance+0.005
handed+0.005
 bank+0.005
 sill+0.005
Neg logit contributions
 happens-0.002
apy-0.002
 Everything-0.002
 importantly-0.002
 People-0.002
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
 happens+0.036
 comes+0.033
 Everything+0.032
apy+0.032
 Now+0.031
Neg logit contributions
side-0.022
 distance-0.019
 unatt-0.019
handed-0.019
ge-0.017
Token "
Token ID366
Feature activation+2.96e-01

Pos logit contributions
side+0.016
 distance+0.016
ro+0.014
 sill+0.014
thank+0.014
Neg logit contributions
 happens-0.013
apy-0.010
 comes-0.010
oke-0.010
ye-0.010
Token the
Token ID262
Feature activation-3.31e-01

Pos logit contributions
 happens+0.003
 It+0.002
 comes+0.002
 Everything+0.002
 turns+0.002
Neg logit contributions
side-0.008
 unatt-0.008
handed-0.007
 spine-0.007
broken-0.007
Token ground
Token ID2323
Feature activation+7.20e-01

Pos logit contributions
 happens+0.026
 comes+0.024
 Everything+0.023
 Everyone+0.022
 can+0.022
Neg logit contributions
ge-0.037
side-0.036
handed-0.033
 unatt-0.033
broken-0.032
Token.
Token ID13
Feature activation-6.76e-02

Pos logit contributions
side+0.077
handed+0.076
anut+0.073
 unatt+0.072
 distance+0.071
Neg logit contributions
 happens-0.065
 comes-0.061
 can-0.056
 It-0.055
 Everything-0.054
Token "
Token ID366
Feature activation+2.82e-01

Pos logit contributions
 happens+0.008
 comes+0.006
apy+0.006
oke+0.006
ye+0.006
Neg logit contributions
side-0.007
handed-0.006
 distance-0.006
 bank-0.006
ro-0.006
TokenE
Token ID36
Feature activation-8.97e-02

Pos logit contributions
side+0.031
 spine+0.031
 unatt+0.030
handed+0.029
 bank+0.029
Neg logit contributions
 happens-0.025
 Now-0.023
 Everything-0.023
 It-0.023
apy-0.022
Tokenww
Token ID1383
Feature activation-2.89e+00

Pos logit contributions
 happens+0.010
 comes+0.009
 Everything+0.008
apy+0.008
 Now+0.008
Neg logit contributions
side-0.010
handed-0.009
 unatt-0.009
 distance-0.009
thank-0.008
Token,
Token ID11
Feature activation-3.04e-01

Pos logit contributions
 happens+0.291
 Everything+0.290
 It+0.274
 Someone+0.273
apy+0.263
Neg logit contributions
side-0.273
handed-0.246
 unatt-0.237
 bank-0.235
ge-0.230
Token what
Token ID644
Feature activation-7.41e-01

Pos logit contributions
 happens+0.029
apy+0.024
 comes+0.024
 Now+0.023
 Everything+0.023
Neg logit contributions
side-0.033
 unatt-0.033
 distance-0.032
handed-0.032
anut-0.029
Token's
Token ID338
Feature activation-7.05e-02

Pos logit contributions
 Now+0.067
apy+0.067
ours+0.061
 Everything+0.060
oms+0.059
Neg logit contributions
 unatt-0.068
handed-0.066
 distance-0.065
side-0.063
 missing-0.061
Token that
Token ID326
Feature activation+6.18e-01

Pos logit contributions
 happens+0.008
 comes+0.007
 Everything+0.006
apy+0.006
 Now+0.006
Neg logit contributions
side-0.008
handed-0.007
 distance-0.007
 unatt-0.007
thank-0.006
Token?"
Token ID1701
Feature activation-3.37e-01

Pos logit contributions
side+0.073
 distance+0.071
handed+0.069
 unatt+0.068
thank+0.065
Neg logit contributions
 happens-0.054
apy-0.048
 Everything-0.046
 comes-0.046
 Now-0.045
Token
Token ID95
Feature activation+6.73e-02

Pos logit contributions
 happens+0.007
 comes+0.006
 Everything+0.006
apy+0.006
 turns+0.006
Neg logit contributions
side-0.008
handed-0.007
 unatt-0.007
 distance-0.007
thank-0.006
Token
Token ID26391
Feature activation+1.05e+00

Pos logit contributions
side+0.007
handed+0.007
 distance+0.007
 unatt+0.007
thank+0.006
Neg logit contributions
 happens-0.007
 comes-0.006
apy-0.006
 Everything-0.006
ours-0.006
Token
Token ID129
Feature activation+1.25e+00

Pos logit contributions
side+0.104
handed+0.094
 unatt+0.089
 distance+0.087
j+0.087
Neg logit contributions
 happens-0.117
 comes-0.097
 Everything-0.095
ours-0.094
apy-0.093
Token
Token ID241
Feature activation+5.40e-01

Pos logit contributions
side+0.155
handed+0.142
 distance+0.142
 unatt+0.141
ge+0.133
Neg logit contributions
 happens-0.117
 comes-0.101
 Everything-0.096
 Now-0.094
apy-0.092
TokenA
Token ID32
Feature activation+6.89e-01

Pos logit contributions
side+0.061
 unatt+0.058
handed+0.056
 distance+0.055
anut+0.052
Neg logit contributions
 happens-0.055
 comes-0.048
 Now-0.048
apy-0.047
 Everything-0.047
Tokenww
Token ID1383
Feature activation-2.86e+00

Pos logit contributions
side+0.076
 distance+0.072
handed+0.067
 unatt+0.067
 bank+0.063
Neg logit contributions
 happens-0.070
 comes-0.060
apy-0.060
 Everything-0.059
ours-0.058
Token,
Token ID11
Feature activation-2.16e-01

Pos logit contributions
 Everything+0.285
 happens+0.283
 People+0.262
 Someone+0.261
apy+0.259
Neg logit contributions
side-0.265
handed-0.230
 unatt-0.227
ge-0.223
j-0.218
Token okay
Token ID8788
Feature activation+4.35e-01

Pos logit contributions
 happens+0.019
 comes+0.015
apy+0.015
icks+0.014
 Everything+0.014
Neg logit contributions
side-0.026
 unatt-0.024
 distance-0.024
handed-0.024
thank-0.022
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
side+0.049
 distance+0.045
 unatt+0.045
handed+0.044
ge+0.044
Neg logit contributions
 happens-0.042
 comes-0.036
 Everything-0.036
apy-0.035
 People-0.034
Tokenâ
Token ID22940
Feature activation-4.19e-02

Pos logit contributions
side+0.033
handed+0.032
 distance+0.030
 sill+0.030
 unatt+0.030
Neg logit contributions
 happens-0.025
 comes-0.019
apy-0.019
 turns-0.016
oms-0.016
Token
Token ID26391
Feature activation+1.05e+00

Pos logit contributions
 happens+0.006
apy+0.005
 comes+0.005
 Everything+0.005
ours+0.005
Neg logit contributions
side-0.003
handed-0.003
 distance-0.003
 unatt-0.003
thank-0.003
Token and
Token ID290
Feature activation-1.37e-01

Pos logit contributions
 happens+0.004
 comes+0.003
 Everything+0.003
apy+0.003
 It+0.003
Neg logit contributions
side-0.006
handed-0.006
 unatt-0.005
anut-0.005
 distance-0.005
Token said
Token ID531
Feature activation-4.47e-03

Pos logit contributions
 happens+0.013
 comes+0.012
 Everything+0.011
apy+0.011
oke+0.011
Neg logit contributions
side-0.016
 distance-0.014
 unatt-0.014
handed-0.014
thank-0.013
Token,
Token ID11
Feature activation-3.57e-01

Pos logit contributions
 happens+0.000
 comes+0.000
oke+0.000
 Everything+0.000
ours+0.000
Neg logit contributions
side-0.000
 unatt-0.000
handed-0.000
 distance-0.000
thank-0.000
Token "
Token ID366
Feature activation+1.57e-01

Pos logit contributions
 happens+0.020
 comes+0.015
 Everything+0.013
apy+0.013
 Now+0.012
Neg logit contributions
side-0.058
handed-0.054
 distance-0.054
 unatt-0.053
thank-0.051
TokenA
Token ID32
Feature activation+5.69e-01

Pos logit contributions
side+0.018
ge+0.017
 unatt+0.017
 bank+0.017
handed+0.016
Neg logit contributions
 happens-0.013
 Now-0.013
 Everything-0.013
 It-0.013
 See-0.012
Tokenww
Token ID1383
Feature activation-2.83e+00

Pos logit contributions
side+0.056
 distance+0.053
 unatt+0.049
handed+0.048
 bank+0.045
Neg logit contributions
 happens-0.064
 comes-0.057
 Everything-0.056
apy-0.055
 Now-0.054
Tokenwww
Token ID2503
Feature activation-1.71e+00

Pos logit contributions
 happens+0.305
 Everything+0.302
 People+0.289
 Someone+0.289
 It+0.288
Neg logit contributions
side-0.244
 unatt-0.212
handed-0.209
thank-0.197
 distance-0.197
Token,
Token ID11
Feature activation-3.14e-01

Pos logit contributions
 happens+0.189
 Everything+0.180
apy+0.175
 importantly+0.172
 People+0.172
Neg logit contributions
side-0.144
 unatt-0.128
handed-0.127
j-0.120
ro-0.118
Token do
Token ID466
Feature activation-2.96e-01

Pos logit contributions
 happens+0.029
 comes+0.024
apy+0.024
oke+0.023
 Now+0.023
Neg logit contributions
 unatt-0.034
side-0.034
 distance-0.033
handed-0.032
 sill-0.031
Token I
Token ID314
Feature activation-3.48e-01

Pos logit contributions
 happens+0.031
oke+0.031
 comes+0.030
ours+0.029
oms+0.029
Neg logit contributions
side-0.026
handed-0.025
 unatt-0.024
 bank-0.022
 distance-0.021
Token have
Token ID423
Feature activation-4.03e-01

Pos logit contributions
 happens+0.032
 comes+0.027
 Everything+0.026
apy+0.026
 Now+0.025
Neg logit contributions
side-0.043
handed-0.039
 distance-0.039
 unatt-0.039
thank-0.037
Token be
Token ID307
Feature activation+3.08e-01

Pos logit contributions
 happens+0.048
 comes+0.042
 can+0.040
 I+0.038
 turns+0.037
Neg logit contributions
side-0.056
handed-0.051
 unatt-0.045
 distance-0.045
keeping-0.044
Token bitter
Token ID12922
Feature activation+3.71e-01

Pos logit contributions
side+0.035
handed+0.032
 distance+0.031
 unatt+0.029
thank+0.028
Neg logit contributions
 happens-0.034
 comes-0.030
apy-0.027
 turns-0.026
 Everything-0.026
Token,
Token ID11
Feature activation+6.38e-02

Pos logit contributions
side+0.041
 distance+0.039
handed+0.038
anut+0.035
 unatt+0.034
Neg logit contributions
 happens-0.042
 comes-0.037
 can-0.033
 Now-0.032
oms-0.031
Token like
Token ID588
Feature activation+1.40e+00

Pos logit contributions
side+0.007
handed+0.007
 distance+0.006
 unatt+0.006
thank+0.006
Neg logit contributions
 happens-0.007
 comes-0.006
 Everything-0.006
apy-0.006
 It-0.006
Token e
Token ID304
Feature activation-3.60e-02

Pos logit contributions
side+0.174
handed+0.160
 unatt+0.160
 distance+0.159
ge+0.149
Neg logit contributions
 happens-0.126
 comes-0.109
 Everything-0.103
apy-0.102
 Now-0.099
Tokenww
Token ID1383
Feature activation-2.81e+00

Pos logit contributions
 happens+0.004
 comes+0.004
 Everything+0.004
apy+0.003
 Now+0.003
Neg logit contributions
side-0.004
handed-0.003
 unatt-0.003
 distance-0.003
ge-0.003
Token.
Token ID13
Feature activation+2.25e-01

Pos logit contributions
 Everything+0.294
 happens+0.294
 It+0.275
 See+0.268
 Someone+0.267
Neg logit contributions
side-0.281
 unatt-0.252
handed-0.247
ge-0.232
 distance-0.226
Token Do
Token ID2141
Feature activation-4.46e-01

Pos logit contributions
side+0.028
handed+0.027
 sill+0.027
 unatt+0.026
 distance+0.026
Neg logit contributions
 happens-0.017
 comes-0.014
apy-0.013
oms-0.013
oke-0.013
Token you
Token ID345
Feature activation-5.84e-02

Pos logit contributions
 happens+0.037
 comes+0.034
apy+0.033
 Everything+0.032
ours+0.031
Neg logit contributions
side-0.053
 unatt-0.050
handed-0.048
anut-0.048
 distance-0.047
Token want
Token ID765
Feature activation+2.46e-01

Pos logit contributions
 happens+0.006
 comes+0.005
 Everything+0.005
apy+0.005
 It+0.004
Neg logit contributions
side-0.007
handed-0.007
 distance-0.006
 unatt-0.006
ge-0.006
Token to
Token ID284
Feature activation+2.20e-01

Pos logit contributions
side+0.023
 unatt+0.021
 distance+0.021
handed+0.021
thank+0.019
Neg logit contributions
 happens-0.029
 comes-0.026
apy-0.025
 Everything-0.025
oke-0.025
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
side+0.059
handed+0.056
 unatt+0.054
 distance+0.054
ge+0.053
Neg logit contributions
 happens-0.055
 comes-0.053
 can-0.045
 It-0.045
 Now-0.044
Token
Token ID198
Feature activation-1.99e-01

Pos logit contributions
 happens+0.011
 comes+0.009
apy+0.008
 Everything+0.008
oke+0.008
Neg logit contributions
side-0.016
handed-0.015
 distance-0.015
 unatt-0.015
thank-0.014
Token
Token ID198
Feature activation-2.15e-01

Pos logit contributions
 happens+0.018
 comes+0.015
 Everything+0.015
 Now+0.014
apy+0.014
Neg logit contributions
side-0.025
 unatt-0.023
handed-0.023
 distance-0.023
thank-0.022
Token"
Token ID1
Feature activation+9.85e-02

Pos logit contributions
 happens+0.025
 comes+0.023
 Everything+0.023
apy+0.022
 Now+0.022
Neg logit contributions
side-0.020
 unatt-0.018
 distance-0.018
handed-0.018
thank-0.017
TokenE
Token ID36
Feature activation-9.60e-02

Pos logit contributions
side+0.011
 unatt+0.011
 spine+0.011
handed+0.011
ge+0.011
Neg logit contributions
 happens-0.009
apy-0.008
 comes-0.008
 Everything-0.008
 It-0.008
Tokenww
Token ID1383
Feature activation-2.81e+00

Pos logit contributions
 happens+0.010
 comes+0.009
apy+0.008
 Now+0.008
 Everything+0.008
Neg logit contributions
side-0.011
 distance-0.010
handed-0.010
 unatt-0.010
ge-0.009
Token,
Token ID11
Feature activation-3.05e-01

Pos logit contributions
 happens+0.291
 Everything+0.286
 People+0.263
apy+0.261
 It+0.260
Neg logit contributions
side-0.267
handed-0.245
anut-0.237
 bank-0.236
 unatt-0.234
Token this
Token ID428
Feature activation+4.40e-01

Pos logit contributions
 happens+0.031
apy+0.026
 comes+0.026
 Now+0.024
oke+0.024
Neg logit contributions
side-0.032
 unatt-0.032
handed-0.031
 distance-0.030
anut-0.029
Token fish
Token ID5916
Feature activation+2.37e-02

Pos logit contributions
side+0.053
 distance+0.051
handed+0.050
 unatt+0.048
 bank+0.048
Neg logit contributions
 happens-0.038
apy-0.035
 Everything-0.033
 comes-0.033
 Now-0.032
Token is
Token ID318
Feature activation-6.31e-01

Pos logit contributions
handed+0.003
side+0.003
 distance+0.003
 unatt+0.003
 bank+0.003
Neg logit contributions
 happens-0.002
 Everything-0.001
apy-0.001
 People-0.001
 Everyone-0.001
Token sm
Token ID895
Feature activation+5.10e-01

Pos logit contributions
 happens+0.077
 comes+0.069
 Everything+0.066
apy+0.065
 Now+0.064
Neg logit contributions
side-0.060
handed-0.054
 distance-0.053
 unatt-0.053
thank-0.048
Token.
Token ID13
Feature activation+2.62e-01

Pos logit contributions
 happens+0.068
 comes+0.061
 Everything+0.060
apy+0.059
 Now+0.058
Neg logit contributions
side-0.045
 distance-0.041
handed-0.041
 unatt-0.040
ge-0.036
Token
Token ID198
Feature activation+2.15e-01

Pos logit contributions
side+0.025
 distance+0.023
 sill+0.022
handed+0.022
ro+0.021
Neg logit contributions
 happens-0.027
 comes-0.023
oke-0.022
ye-0.022
opter-0.022
Token
Token ID198
Feature activation+1.91e-01

Pos logit contributions
side+0.021
handed+0.019
 distance+0.019
 unatt+0.017
ge+0.017
Neg logit contributions
 happens-0.025
 comes-0.022
 Everything-0.021
apy-0.021
ours-0.020
Token"
Token ID1
Feature activation+1.69e-01

Pos logit contributions
side+0.024
 distance+0.021
handed+0.021
 unatt+0.019
ge+0.019
Neg logit contributions
 happens-0.018
 comes-0.016
 Everything-0.016
oke-0.015
ours-0.014
TokenA
Token ID32
Feature activation+6.04e-01

Pos logit contributions
side+0.018
handed+0.017
 unatt+0.017
 distance+0.016
ge+0.016
Neg logit contributions
 happens-0.018
 comes-0.016
apy-0.015
 Everything-0.015
 Now-0.015
Tokenww
Token ID1383
Feature activation-2.81e+00

Pos logit contributions
 distance+0.045
side+0.040
handed+0.036
 bank+0.035
 tune+0.035
Neg logit contributions
ours-0.073
 happens-0.071
 Everything-0.071
oke-0.071
apy-0.070
Token,
Token ID11
Feature activation-8.40e-02

Pos logit contributions
 Everything+0.341
 People+0.316
 Someone+0.311
apy+0.305
 importantly+0.304
Neg logit contributions
side-0.198
 distance-0.183
 unatt-0.180
ge-0.175
 bank-0.173
Token it
Token ID340
Feature activation-5.28e-01

Pos logit contributions
 happens+0.009
apy+0.008
 Everything+0.008
oke+0.008
 People+0.008
Neg logit contributions
 unatt-0.007
side-0.007
 distance-0.006
 sill-0.006
handed-0.006
Token is
Token ID318
Feature activation-5.76e-01

Pos logit contributions
apy+0.037
 Everything+0.035
 importantly+0.033
 Everyone+0.033
 People+0.032
Neg logit contributions
 bank-0.064
 distance-0.063
side-0.063
handed-0.061
 unatt-0.061
Token so
Token ID523
Feature activation+2.35e-01

Pos logit contributions
 happens+0.063
 comes+0.056
 Everything+0.055
apy+0.055
 Now+0.053
Neg logit contributions
side-0.059
 distance-0.054
 unatt-0.053
handed-0.053
thank-0.049
Token cute
Token ID13779
Feature activation+3.81e-02

Pos logit contributions
 distance+0.017
side+0.016
handed+0.015
 unatt+0.015
anut+0.014
Neg logit contributions
 happens-0.030
 Everything-0.028
oke-0.027
ours-0.027
 comes-0.027
Token.
Token ID13
Feature activation-1.35e-01

Pos logit contributions
handed+0.024
side+0.023
anut+0.023
 unatt+0.023
thank+0.022
Neg logit contributions
 happens-0.019
 comes-0.018
 can-0.017
 of-0.016
 It-0.016
Token
Token ID198
Feature activation-1.37e-01

Pos logit contributions
 happens+0.011
 comes+0.009
apy+0.009
 Everything+0.008
oke+0.008
Neg logit contributions
side-0.018
handed-0.017
 distance-0.016
 unatt-0.016
thank-0.016
Token
Token ID198
Feature activation-1.48e-01

Pos logit contributions
 happens+0.013
 comes+0.011
 Everything+0.011
apy+0.010
 Now+0.010
Neg logit contributions
side-0.017
handed-0.015
 distance-0.015
 unatt-0.015
thank-0.014
Token"
Token ID1
Feature activation+8.87e-02

Pos logit contributions
 happens+0.018
 comes+0.016
 Everything+0.016
 Now+0.016
apy+0.016
Neg logit contributions
side-0.013
 unatt-0.012
 distance-0.012
handed-0.012
thank-0.011
TokenE
Token ID36
Feature activation+2.98e-02

Pos logit contributions
side+0.010
 unatt+0.010
 spine+0.010
handed+0.010
ge+0.009
Neg logit contributions
 happens-0.008
apy-0.007
 Everything-0.007
 Now-0.007
 comes-0.007
Tokenww
Token ID1383
Feature activation-2.80e+00

Pos logit contributions
side+0.003
handed+0.003
 distance+0.003
 unatt+0.003
thank+0.003
Neg logit contributions
 happens-0.003
 comes-0.003
 Everything-0.003
apy-0.003
 Now-0.002
Token!"
Token ID2474
Feature activation-2.25e-01

Pos logit contributions
 happens+0.278
 Everything+0.278
 People+0.259
 It+0.257
 There+0.254
Neg logit contributions
side-0.266
handed-0.238
thank-0.231
 unatt-0.229
ge-0.229
Token she
Token ID673
Feature activation+5.84e-01

Pos logit contributions
 happens+0.020
 comes+0.018
 Everything+0.017
apy+0.017
 Now+0.016
Neg logit contributions
side-0.028
handed-0.026
 distance-0.026
 unatt-0.026
ge-0.024
Token screamed
Token ID25421
Feature activation-6.86e-01

Pos logit contributions
side+0.077
handed+0.072
 distance+0.071
 unatt+0.071
ge+0.067
Neg logit contributions
 happens-0.050
 comes-0.042
 Everything-0.039
apy-0.039
 Now-0.038
Token.
Token ID13
Feature activation-2.50e-02

Pos logit contributions
 happens+0.054
 It+0.053
apy+0.053
 comes+0.051
 We+0.048
Neg logit contributions
side-0.086
handed-0.083
 distance-0.081
 unatt-0.075
anut-0.074
Token
Token ID198
Feature activation-7.83e-02

Pos logit contributions
 happens+0.002
 comes+0.002
 Everything+0.001
apy+0.001
 Now+0.001
Neg logit contributions
side-0.004
handed-0.003
 distance-0.003
 unatt-0.003
ge-0.003
Token at
Token ID379
Feature activation-2.05e-01

Pos logit contributions
side+0.039
handed+0.035
 distance+0.034
 unatt+0.033
thank+0.031
Neg logit contributions
 happens-0.054
 comes-0.049
 Everything-0.047
apy-0.046
 Now-0.045
Token it
Token ID340
Feature activation-4.73e-01

Pos logit contributions
 happens+0.019
 comes+0.016
 Everything+0.015
apy+0.015
 Now+0.015
Neg logit contributions
side-0.026
handed-0.024
 distance-0.023
 unatt-0.023
thank-0.022
Token said
Token ID531
Feature activation+4.01e-01

Pos logit contributions
 happens+0.054
 comes+0.048
 can+0.044
 Everything+0.043
apy+0.042
Neg logit contributions
side-0.049
handed-0.046
 distance-0.045
 unatt-0.043
thank-0.042
Token "
Token ID366
Feature activation+4.32e-01

Pos logit contributions
side+0.043
 unatt+0.040
 distance+0.038
handed+0.036
 bank+0.035
Neg logit contributions
 happens-0.037
 Now-0.034
 comes-0.033
ours-0.032
oke-0.032
Tokenew
Token ID413
Feature activation-8.63e-03

Pos logit contributions
side+0.037
 spine+0.035
 unatt+0.035
handed+0.035
anut+0.033
Neg logit contributions
 happens-0.046
 comes-0.045
 Everything-0.042
 It-0.042
 turns-0.042
Tokenww
Token ID1383
Feature activation-2.78e+00

Pos logit contributions
 happens+0.001
 comes+0.001
apy+0.001
 Everything+0.001
 There+0.001
Neg logit contributions
side-0.001
handed-0.001
 unatt-0.001
 distance-0.001
 bank-0.001
Token!"
Token ID2474
Feature activation-1.33e-01

Pos logit contributions
 happens+0.278
 Everything+0.263
 importantly+0.245
apy+0.245
 It+0.242
Neg logit contributions
side-0.281
handed-0.240
 bank-0.233
 unatt-0.232
ater-0.232
Token
Token ID220
Feature activation+4.79e-01

Pos logit contributions
 happens+0.010
 comes+0.008
 Everything+0.008
apy+0.008
 Now+0.008
Neg logit contributions
side-0.019
handed-0.017
 distance-0.017
 unatt-0.017
ge-0.016
Token
Token ID198
Feature activation+4.31e-02

Pos logit contributions
side+0.052
handed+0.048
 distance+0.047
 unatt+0.045
ge+0.043
Neg logit contributions
 happens-0.051
 comes-0.045
apy-0.042
 Everything-0.041
 Now-0.040
Token
Token ID198
Feature activation+2.38e-02

Pos logit contributions
side+0.005
 unatt+0.005
handed+0.005
 distance+0.005
ge+0.005
Neg logit contributions
 happens-0.004
 comes-0.003
 Everything-0.003
 Now-0.003
apy-0.003
TokenOne
Token ID3198
Feature activation+5.55e-01

Pos logit contributions
side+0.002
handed+0.002
 unatt+0.002
 distance+0.002
ge+0.001
Neg logit contributions
 happens-0.003
 comes-0.003
 Everything-0.003
apy-0.003
 Now-0.003
Token friends
Token ID2460
Feature activation-1.10e+00

Pos logit contributions
 happens+0.005
 comes+0.004
 Everything+0.004
apy+0.004
 Now+0.004
Neg logit contributions
side-0.007
handed-0.007
 distance-0.007
 unatt-0.007
thank-0.006
Token said
Token ID531
Feature activation-1.05e-02

Pos logit contributions
 happens+0.099
 comes+0.086
 Everything+0.081
apy+0.079
 Now+0.077
Neg logit contributions
side-0.139
handed-0.128
 distance-0.127
 unatt-0.126
ge-0.119
Token,
Token ID11
Feature activation-2.96e-01

Pos logit contributions
 happens+0.001
 comes+0.001
 Everything+0.001
ours+0.001
 Now+0.001
Neg logit contributions
side-0.001
handed-0.001
 unatt-0.001
 distance-0.001
thank-0.001
Token "
Token ID366
Feature activation+9.32e-02

Pos logit contributions
 happens+0.017
 comes+0.014
 Everything+0.012
apy+0.012
 Now+0.011
Neg logit contributions
side-0.047
handed-0.043
 distance-0.043
 unatt-0.043
thank-0.041
TokenA
Token ID32
Feature activation+4.31e-01

Pos logit contributions
side+0.010
 unatt+0.010
handed+0.010
 distance+0.009
ge+0.009
Neg logit contributions
 happens-0.009
 comes-0.008
 Now-0.008
 Everything-0.008
apy-0.008
Tokenww
Token ID1383
Feature activation-2.78e+00

Pos logit contributions
side+0.042
 distance+0.042
handed+0.038
 unatt+0.036
 bank+0.035
Neg logit contributions
 happens-0.048
ours-0.043
 comes-0.042
 Everything-0.042
apy-0.041
Token,
Token ID11
Feature activation-1.93e-01

Pos logit contributions
 Everything+0.282
 happens+0.279
 People+0.258
 It+0.256
 Someone+0.255
Neg logit contributions
side-0.261
handed-0.229
 unatt-0.226
thank-0.219
ge-0.218
Token okay
Token ID8788
Feature activation+4.07e-01

Pos logit contributions
 happens+0.019
apy+0.016
ours+0.015
 Everything+0.015
 comes+0.015
Neg logit contributions
side-0.021
 unatt-0.020
 distance-0.020
handed-0.019
thank-0.018
Token.
Token ID13
Feature activation-1.49e-02

Pos logit contributions
side+0.038
 unatt+0.035
 distance+0.034
handed+0.034
ge+0.032
Neg logit contributions
 happens-0.047
apy-0.042
 Everything-0.042
oke-0.041
 comes-0.041
Token But
Token ID887
Feature activation+1.08e-01

Pos logit contributions
 happens+0.001
 comes+0.001
apy+0.001
oms+0.001
 turns+0.001
Neg logit contributions
side-0.002
ro-0.002
 sill-0.002
handed-0.002
 spine-0.002
Token come
Token ID1282
Feature activation+2.76e-01

Pos logit contributions
side+0.012
 unatt+0.011
 distance+0.011
handed+0.011
thank+0.010
Neg logit contributions
 happens-0.010
 comes-0.009
apy-0.009
oke-0.009
 Everything-0.009
Token and
Token ID290
Feature activation+1.75e-01

Pos logit contributions
 happens+0.002
 comes+0.002
 It+0.002
apy+0.001
 can+0.001
Neg logit contributions
handed-0.003
 unatt-0.003
side-0.003
 distance-0.003
 former-0.003
Token said
Token ID531
Feature activation+2.22e-01

Pos logit contributions
side+0.020
 distance+0.018
handed+0.018
 unatt+0.018
thank+0.017
Neg logit contributions
 happens-0.018
 comes-0.016
 Everything-0.015
apy-0.015
 Now-0.015
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
side+0.020
 unatt+0.017
 bank+0.016
handed+0.016
anut+0.016
Neg logit contributions
 happens-0.022
 Everything-0.021
 Now-0.021
ay-0.020
 comes-0.020
Token "
Token ID366
Feature activation+3.82e-01

Pos logit contributions
 happens+0.007
 comes+0.005
 Everything+0.005
apy+0.005
 Now+0.005
Neg logit contributions
side-0.017
 distance-0.015
 unatt-0.015
handed-0.015
thank-0.015
TokenE
Token ID36
Feature activation+9.35e-02

Pos logit contributions
side+0.044
 unatt+0.043
 spine+0.041
handed+0.041
 distance+0.040
Neg logit contributions
 happens-0.037
apy-0.033
 Now-0.032
 comes-0.031
 turns-0.030
Tokenww
Token ID1383
Feature activation-2.77e+00

Pos logit contributions
side+0.009
handed+0.009
 unatt+0.008
 distance+0.008
thank+0.008
Neg logit contributions
 happens-0.010
 Everything-0.010
 comes-0.009
apy-0.009
 There-0.009
Token,
Token ID11
Feature activation-9.40e-02

Pos logit contributions
 Everything+0.302
 happens+0.298
 Someone+0.278
 It+0.278
 People+0.278
Neg logit contributions
side-0.235
ge-0.212
handed-0.206
 bank-0.200
 unatt-0.200
Token Bruno
Token ID31045
Feature activation-3.66e-02

Pos logit contributions
 happens+0.009
 comes+0.008
apy+0.008
 Everything+0.008
 Now+0.008
Neg logit contributions
side-0.010
 unatt-0.010
 distance-0.009
handed-0.009
 bank-0.009
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
 happens+0.005
apy+0.005
 Everything+0.005
 People+0.004
 importantly+0.004
Neg logit contributions
side-0.003
handed-0.002
 distance-0.002
 bank-0.002
 unatt-0.002
Token You
Token ID921
Feature activation-5.27e-01

Pos logit contributions
side+0.022
handed+0.022
 sill+0.021
ro+0.021
 distance+0.020
Neg logit contributions
 happens-0.016
 comes-0.013
oms-0.012
oke-0.012
 turns-0.011
Token are
Token ID389
Feature activation+2.47e-01

Pos logit contributions
 happens+0.042
 Everything+0.035
 Now+0.035
 comes+0.034
apy+0.034
Neg logit contributions
side-0.067
 distance-0.064
handed-0.063
 unatt-0.063
 bank-0.061
Token She
Token ID1375
Feature activation-9.09e-02

Pos logit contributions
side+0.012
 distance+0.011
handed+0.011
 sill+0.010
ro+0.010
Neg logit contributions
 happens-0.012
 comes-0.010
ye-0.010
oke-0.010
oms-0.010
Token said
Token ID531
Feature activation+2.42e-01

Pos logit contributions
 happens+0.009
 comes+0.008
 Everything+0.008
apy+0.008
 Now+0.008
Neg logit contributions
side-0.010
 distance-0.009
handed-0.009
 unatt-0.009
thank-0.009
Token,
Token ID11
Feature activation-1.22e-01

Pos logit contributions
side+0.018
 unatt+0.017
 away+0.015
 distance+0.015
 herself+0.015
Neg logit contributions
 happens-0.026
 Everything-0.025
 importantly-0.025
 Now-0.025
 People-0.024
Token "
Token ID366
Feature activation+4.02e-01

Pos logit contributions
 happens+0.010
 comes+0.007
apy+0.007
 Everything+0.007
 turns+0.007
Neg logit contributions
side-0.017
 distance-0.015
 unatt-0.015
thank-0.014
handed-0.014
TokenE
Token ID36
Feature activation+8.62e-02

Pos logit contributions
side+0.049
 unatt+0.045
handed+0.044
ge+0.043
 distance+0.041
Neg logit contributions
 happens-0.039
 comes-0.034
 It-0.034
apy-0.034
 Now-0.033
Tokenww
Token ID1383
Feature activation-2.74e+00

Pos logit contributions
side+0.009
handed+0.008
 unatt+0.008
 distance+0.008
thank+0.008
Neg logit contributions
 happens-0.010
 Everything-0.009
 comes-0.009
 There-0.008
apy-0.008
Token,
Token ID11
Feature activation-1.23e-01

Pos logit contributions
 Everything+0.314
 happens+0.305
 People+0.294
 Someone+0.283
 It+0.281
Neg logit contributions
side-0.222
ge-0.201
anut-0.191
 unatt-0.189
handed-0.188
Token disgusting
Token ID23374
Feature activation+2.87e-01

Pos logit contributions
 happens+0.013
apy+0.011
 comes+0.010
 Everything+0.010
 Now+0.010
Neg logit contributions
 unatt-0.012
side-0.012
 distance-0.012
anut-0.011
handed-0.011
Token!"
Token ID2474
Feature activation+1.83e-02

Pos logit contributions
side+0.027
handed+0.026
 unatt+0.025
 distance+0.025
thank+0.024
Neg logit contributions
 happens-0.030
 Everything-0.028
apy-0.026
 There-0.026
 People-0.026
Token She
Token ID1375
Feature activation-1.43e-01

Pos logit contributions
side+0.002
handed+0.002
 distance+0.002
 unatt+0.002
thank+0.002
Neg logit contributions
 happens-0.002
 comes-0.001
 Everything-0.001
apy-0.001
 Now-0.001
Token threw
Token ID9617
Feature activation+3.30e-01

Pos logit contributions
 happens+0.015
 Everything+0.013
 comes+0.013
apy+0.012
 People+0.012
Neg logit contributions
side-0.015
 distance-0.013
handed-0.013
 unatt-0.013
thank-0.013
Token He
Token ID679
Feature activation-2.01e-01

Pos logit contributions
side+0.014
handed+0.012
 distance+0.012
ro+0.012
 sill+0.012
Neg logit contributions
 happens-0.013
 comes-0.012
apy-0.011
oms-0.011
oke-0.011
Token says
Token ID1139
Feature activation-3.49e-01

Pos logit contributions
 happens+0.010
apy+0.010
 Everything+0.010
 Now+0.009
 People+0.009
Neg logit contributions
side-0.027
 bank-0.027
handed-0.026
 distance-0.026
thank-0.026
Token,
Token ID11
Feature activation-1.43e-01

Pos logit contributions
 Everything+0.037
 happens+0.035
 Now+0.035
ours+0.035
 importantly+0.034
Neg logit contributions
side-0.026
 unatt-0.025
handed-0.021
ro-0.021
 bank-0.020
Token "
Token ID366
Feature activation+3.21e-01

Pos logit contributions
 happens+0.011
apy+0.008
 comes+0.008
 Everything+0.008
 turns+0.008
Neg logit contributions
side-0.020
 unatt-0.017
 distance-0.017
handed-0.016
thank-0.016
TokenA
Token ID32
Feature activation+6.02e-01

Pos logit contributions
side+0.037
 unatt+0.036
 distance+0.035
 spine+0.034
handed+0.033
Neg logit contributions
 happens-0.031
 Now-0.027
 It-0.027
 comes-0.026
apy-0.026
Tokenww
Token ID1383
Feature activation-2.69e+00

Pos logit contributions
side+0.051
 distance+0.047
handed+0.043
 unatt+0.042
thank+0.042
Neg logit contributions
 Everything-0.069
 happens-0.066
apy-0.066
ours-0.063
 Now-0.061
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
 Everything+0.310
 People+0.285
 happens+0.282
 Someone+0.279
 It+0.278
Neg logit contributions
side-0.216
 unatt-0.190
handed-0.187
thank-0.174
ge-0.173
Token this
Token ID428
Feature activation+5.05e-01

Pos logit contributions
 happens+0.010
apy+0.009
 Everything+0.009
 comes+0.009
 Now+0.009
Neg logit contributions
side-0.013
 unatt-0.012
handed-0.012
 distance-0.012
anut-0.011
Token is
Token ID318
Feature activation-5.95e-01

Pos logit contributions
side+0.072
 distance+0.067
handed+0.067
 unatt+0.065
 bank+0.065
Neg logit contributions
 happens-0.030
apy-0.029
 Everything-0.028
 comes-0.024
 Now-0.024
Token from
Token ID422
Feature activation+5.65e-01

Pos logit contributions
 happens+0.058
apy+0.052
 Everything+0.052
 comes+0.051
 Now+0.049
Neg logit contributions
side-0.065
 unatt-0.060
 distance-0.060
anut-0.057
handed-0.057
Token Anna
Token ID11735
Feature activation-1.74e-01

Pos logit contributions
side+0.081
handed+0.075
 distance+0.075
 unatt+0.073
ge+0.071
Neg logit contributions
 happens-0.041
 comes-0.035
 Everything-0.030
 Now-0.030
apy-0.030
Token He
Token ID679
Feature activation-1.58e-01

Pos logit contributions
side+0.012
 distance+0.011
handed+0.010
 sill+0.010
thank+0.010
Neg logit contributions
 happens-0.009
 comes-0.008
oms-0.007
oke-0.007
osh-0.007
Token said
Token ID531
Feature activation+3.20e-01

Pos logit contributions
 happens+0.015
 Everything+0.013
 comes+0.013
apy+0.013
 Now+0.012
Neg logit contributions
side-0.017
 distance-0.016
handed-0.016
 unatt-0.016
thank-0.016
Token,
Token ID11
Feature activation+5.87e-02

Pos logit contributions
side+0.027
 unatt+0.024
handed+0.023
anut+0.021
 bank+0.021
Neg logit contributions
 happens-0.033
 Everything-0.032
 comes-0.030
 People-0.030
 importantly-0.030
Token "
Token ID366
Feature activation+4.52e-01

Pos logit contributions
side+0.008
 unatt+0.006
 distance+0.006
thank+0.006
handed+0.006
Neg logit contributions
 happens-0.005
 comes-0.004
 turns-0.004
ours-0.004
oms-0.004
TokenE
Token ID36
Feature activation+1.84e-01

Pos logit contributions
side+0.050
 unatt+0.049
handed+0.046
ge+0.046
 spine+0.046
Neg logit contributions
 happens-0.047
 Now-0.041
 comes-0.041
 It-0.039
apy-0.039
Tokenww
Token ID1383
Feature activation-2.67e+00

Pos logit contributions
side+0.020
handed+0.018
thank+0.017
 unatt+0.017
 distance+0.017
Neg logit contributions
 happens-0.019
 Everything-0.018
 comes-0.017
 There-0.017
apy-0.016
Token,
Token ID11
Feature activation+4.54e-02

Pos logit contributions
 Everything+0.301
 happens+0.298
 Someone+0.279
 It+0.276
 People+0.275
Neg logit contributions
side-0.229
handed-0.189
ge-0.187
thank-0.183
anut-0.183
Token this
Token ID428
Feature activation+5.32e-01

Pos logit contributions
side+0.004
 unatt+0.004
handed+0.004
 distance+0.004
anut+0.004
Neg logit contributions
 happens-0.005
apy-0.004
oms-0.004
 comes-0.004
 Everything-0.004
Token is
Token ID318
Feature activation-5.01e-01

Pos logit contributions
side+0.078
handed+0.073
 distance+0.072
thank+0.070
 unatt+0.070
Neg logit contributions
 happens-0.031
 Everything-0.027
apy-0.026
 comes-0.025
 Now-0.024
Token the
Token ID262
Feature activation-8.40e-02

Pos logit contributions
 happens+0.054
 comes+0.048
 Everything+0.047
apy+0.046
 Now+0.045
Neg logit contributions
side-0.052
 unatt-0.047
 distance-0.047
handed-0.046
anut-0.044
Token worst
Token ID5290
Feature activation+1.81e-01

Pos logit contributions
 happens+0.007
 comes+0.006
 can+0.005
 turns+0.005
 I+0.005
Neg logit contributions
side-0.011
ge-0.011
handed-0.011
broken-0.010
 unatt-0.010
Token
Token ID129
Feature activation+1.22e+00

Pos logit contributions
side+0.111
handed+0.100
 distance+0.100
 unatt+0.096
thank+0.093
Neg logit contributions
 happens-0.104
 comes-0.089
 Everything-0.083
apy-0.083
ours-0.081
Token
Token ID241
Feature activation+3.13e-01

Pos logit contributions
side+0.126
 distance+0.119
 unatt+0.118
handed+0.114
ge+0.111
Neg logit contributions
 happens-0.131
 Now-0.121
 comes-0.116
 Everything-0.116
 Everyone-0.113
TokenOf
Token ID5189
Feature activation+7.15e-02

Pos logit contributions
side+0.038
 unatt+0.036
handed+0.035
 distance+0.034
anut+0.032
Neg logit contributions
 happens-0.029
 Now-0.025
 comes-0.025
 Everything-0.025
 It-0.025
Token course
Token ID1781
Feature activation+5.67e-02

Pos logit contributions
side+0.009
 distance+0.008
handed+0.008
 unatt+0.008
thank+0.007
Neg logit contributions
 happens-0.007
 comes-0.006
apy-0.006
ours-0.005
 Everything-0.005
Token!
Token ID0
Feature activation-3.80e-01

Pos logit contributions
side+0.006
 distance+0.005
 unatt+0.005
handed+0.005
ge+0.005
Neg logit contributions
 happens-0.006
ours-0.005
 comes-0.005
apy-0.005
 Everything-0.005
Token Now
Token ID2735
Feature activation-2.63e+00

Pos logit contributions
 happens+0.033
 comes+0.026
apy+0.025
ours+0.022
icks+0.021
Neg logit contributions
side-0.049
handed-0.045
 distance-0.044
ro-0.043
 sill-0.043
Token,
Token ID11
Feature activation-4.42e-01

Pos logit contributions
 happens+0.221
apy+0.190
ours+0.180
 comes+0.178
oke+0.177
Neg logit contributions
side-0.331
handed-0.297
 distance-0.296
 unatt-0.292
ge-0.290
Token let
Token ID1309
Feature activation-2.42e-01

Pos logit contributions
 happens+0.043
apy+0.037
 comes+0.036
ours+0.035
oke+0.034
Neg logit contributions
side-0.051
 distance-0.046
 unatt-0.046
handed-0.045
thank-0.043
Tokenâ
Token ID22940
Feature activation+5.77e-03

Pos logit contributions
 happens+0.027
apy+0.024
 comes+0.023
ours+0.022
 importantly+0.022
Neg logit contributions
side-0.024
handed-0.023
 unatt-0.021
 distance-0.021
 bank-0.020
Token
Token ID26391
Feature activation+1.04e+00

Pos logit contributions
side+0.001
handed+0.000
 distance+0.000
 unatt+0.000
ge+0.000
Neg logit contributions
 happens-0.001
 comes-0.001
 Everything-0.001
 Now-0.001
apy-0.001
Token
Token ID8151
Feature activation+1.69e+00

Pos logit contributions
side+0.112
handed+0.098
 distance+0.094
 unatt+0.091
j+0.090
Neg logit contributions
 happens-0.115
 comes-0.093
apy-0.092
ours-0.091
 importantly-0.090
Token so
Token ID523
Feature activation+1.23e-01

Pos logit contributions
 happens+0.106
 comes+0.095
 Everything+0.091
apy+0.090
 Now+0.089
Neg logit contributions
side-0.079
handed-0.071
 distance-0.070
 unatt-0.069
thank-0.064
Token proud
Token ID6613
Feature activation-8.09e-01

Pos logit contributions
side+0.012
handed+0.011
 distance+0.011
 unatt+0.011
thank+0.010
Neg logit contributions
 happens-0.014
 comes-0.013
 Everything-0.012
apy-0.012
 Now-0.012
Token of
Token ID286
Feature activation-6.28e-01

Pos logit contributions
 happens+0.063
 comes+0.053
 Everything+0.051
apy+0.051
ours+0.049
Neg logit contributions
side-0.110
 distance-0.102
handed-0.101
 unatt-0.100
thank-0.096
Token himself
Token ID2241
Feature activation+1.17e+00

Pos logit contributions
 happens+0.090
 comes+0.081
apy+0.078
 Everything+0.078
ours+0.077
Neg logit contributions
side-0.046
handed-0.041
 distance-0.039
 unatt-0.039
ge-0.035
Token!
Token ID0
Feature activation-4.99e-01

Pos logit contributions
side+0.118
handed+0.107
 distance+0.104
 unatt+0.104
ge+0.098
Neg logit contributions
 happens-0.135
 comes-0.119
 Everything-0.115
apy-0.114
 Now-0.110
Token Now
Token ID2735
Feature activation-2.56e+00

Pos logit contributions
 happens+0.056
 comes+0.049
apy+0.044
ours+0.044
oms+0.043
Neg logit contributions
side-0.053
handed-0.050
 distance-0.047
thank-0.043
keeping-0.042
Token he
Token ID339
Feature activation-3.44e-03

Pos logit contributions
 happens+0.257
ours+0.233
apy+0.224
oke+0.216
 comes+0.215
Neg logit contributions
side-0.285
handed-0.253
 distance-0.252
 unatt-0.240
ge-0.237
Token could
Token ID714
Feature activation-2.40e-01

Pos logit contributions
 happens+0.000
apy+0.000
 Everything+0.000
ours+0.000
 There+0.000
Neg logit contributions
side-0.000
thank-0.000
handed-0.000
 distance-0.000
 unatt-0.000
Token play
Token ID711
Feature activation-1.67e-02

Pos logit contributions
 happens+0.026
 comes+0.022
 Everything+0.021
oke+0.021
apy+0.021
Neg logit contributions
side-0.026
handed-0.024
 distance-0.024
 unatt-0.023
thank-0.022
Token with
Token ID351
Feature activation-1.55e-01

Pos logit contributions
 happens+0.002
 comes+0.002
 Everything+0.002
apy+0.002
 Now+0.002
Neg logit contributions
side-0.002
handed-0.001
 distance-0.001
 unatt-0.001
ge-0.001
Token them
Token ID606
Feature activation+1.80e-02

Pos logit contributions
 happens+0.015
 comes+0.013
apy+0.013
 Everything+0.013
 Now+0.012
Neg logit contributions
side-0.018
 distance-0.017
handed-0.017
 unatt-0.016
thank-0.016
Token done
Token ID1760
Feature activation+7.84e-02

Pos logit contributions
 happens+0.080
 comes+0.072
 Everything+0.069
apy+0.066
 Now+0.064
Neg logit contributions
side-0.094
handed-0.087
 distance-0.085
 unatt-0.084
ge-0.081
Token,
Token ID11
Feature activation-5.73e-01

Pos logit contributions
side+0.010
handed+0.009
 distance+0.009
 unatt+0.009
ge+0.009
Neg logit contributions
 happens-0.007
 comes-0.006
 Everything-0.005
apy-0.005
 Now-0.005
Token she
Token ID673
Feature activation+2.63e-01

Pos logit contributions
 happens+0.061
 comes+0.053
oke+0.052
ours+0.051
 Everything+0.050
Neg logit contributions
side-0.063
 distance-0.059
handed-0.057
 unatt-0.056
thank-0.052
Token smiled
Token ID13541
Feature activation-2.56e-01

Pos logit contributions
side+0.028
handed+0.026
 distance+0.026
 unatt+0.025
thank+0.024
Neg logit contributions
 happens-0.028
 comes-0.024
 Everything-0.024
apy-0.024
 Now-0.023
Token.
Token ID13
Feature activation-3.24e-01

Pos logit contributions
 happens+0.025
 comes+0.023
 It+0.022
 Everything+0.022
apy+0.021
Neg logit contributions
side-0.029
handed-0.028
 distance-0.026
 unatt-0.026
anut-0.025
Token Now
Token ID2735
Feature activation-2.55e+00

Pos logit contributions
 happens+0.038
 comes+0.032
oke+0.031
apy+0.031
 floats+0.030
Neg logit contributions
side-0.032
handed-0.028
 distance-0.027
ro-0.026
thank-0.025
Token she
Token ID673
Feature activation+2.19e-01

Pos logit contributions
 happens+0.275
oke+0.262
ours+0.258
apy+0.252
 Now+0.238
Neg logit contributions
side-0.253
 distance-0.224
handed-0.217
 unatt-0.211
thank-0.199
Token could
Token ID714
Feature activation-3.94e-01

Pos logit contributions
side+0.020
thank+0.019
handed+0.019
 distance+0.019
 unatt+0.018
Neg logit contributions
 happens-0.023
apy-0.021
 Everything-0.021
 People-0.020
ours-0.020
Token wear
Token ID5806
Feature activation+1.43e-01

Pos logit contributions
 happens+0.043
 comes+0.038
 Everything+0.036
apy+0.035
oke+0.035
Neg logit contributions
side-0.042
handed-0.039
 distance-0.038
 unatt-0.038
thank-0.036
Token her
Token ID607
Feature activation-3.05e-01

Pos logit contributions
side+0.015
handed+0.013
 distance+0.013
 unatt+0.013
thank+0.012
Neg logit contributions
 happens-0.016
 comes-0.014
 Everything-0.013
apy-0.013
oke-0.013
Token dress
Token ID6576
Feature activation+2.88e-01

Pos logit contributions
 happens+0.032
 comes+0.028
 Everything+0.026
apy+0.026
 Now+0.026
Neg logit contributions
side-0.034
handed-0.031
 distance-0.030
 unatt-0.029
ge-0.029
Token's
Token ID338
Feature activation+3.15e-01

Pos logit contributions
side+0.074
handed+0.069
 distance+0.068
 unatt+0.067
 bank+0.064
Neg logit contributions
 happens-0.052
apy-0.048
 comes-0.048
 Everything-0.046
 Now-0.045
Token great
Token ID1049
Feature activation-4.55e-01

Pos logit contributions
side+0.033
handed+0.030
 unatt+0.030
 distance+0.030
thank+0.027
Neg logit contributions
 happens-0.035
 comes-0.032
 Everything-0.030
apy-0.030
 Now-0.029
Token,
Token ID11
Feature activation+1.90e-03

Pos logit contributions
 happens+0.051
apy+0.047
 Everything+0.047
 comes+0.046
oke+0.043
Neg logit contributions
side-0.043
handed-0.039
 distance-0.039
 unatt-0.038
thank-0.038
Token Molly
Token ID30236
Feature activation-2.17e-01

Pos logit contributions
side+0.000
handed+0.000
 distance+0.000
 unatt+0.000
thank+0.000
Neg logit contributions
 happens-0.000
 comes-0.000
apy-0.000
 Everything-0.000
ours-0.000
Token!
Token ID0
Feature activation-3.99e-01

Pos logit contributions
 happens+0.024
apy+0.022
 Everything+0.022
 comes+0.021
 People+0.021
Neg logit contributions
side-0.020
handed-0.018
 distance-0.018
thank-0.018
 unatt-0.018
Token Now
Token ID2735
Feature activation-2.54e+00

Pos logit contributions
 happens+0.041
 comes+0.034
apy+0.032
oms+0.031
 floats+0.030
Neg logit contributions
side-0.043
handed-0.042
 distance-0.037
thank-0.037
ro-0.036
Token,
Token ID11
Feature activation-4.42e-01

Pos logit contributions
 happens+0.220
apy+0.192
 Everything+0.185
 comes+0.183
oke+0.182
Neg logit contributions
side-0.310
handed-0.279
 unatt-0.277
 distance-0.277
ge-0.267
Token can
Token ID460
Feature activation-6.33e-01

Pos logit contributions
 happens+0.040
 comes+0.034
apy+0.033
 Everything+0.033
oke+0.032
Neg logit contributions
side-0.054
 distance-0.049
handed-0.049
 unatt-0.049
thank-0.046
Token you
Token ID345
Feature activation-1.31e-01

Pos logit contributions
 happens+0.052
 comes+0.043
oke+0.043
ours+0.042
apy+0.041
Neg logit contributions
side-0.077
handed-0.075
 unatt-0.073
 distance-0.070
ge-0.068
Token help
Token ID1037
Feature activation-1.40e-01

Pos logit contributions
 happens+0.012
 comes+0.011
 Everything+0.010
apy+0.010
 Now+0.010
Neg logit contributions
side-0.016
handed-0.015
 distance-0.015
 unatt-0.015
ge-0.014
Token me
Token ID502
Feature activation+1.22e-01

Pos logit contributions
 happens+0.012
 comes+0.010
 Everything+0.009
 Now+0.009
 Everyone+0.009
Neg logit contributions
side-0.019
handed-0.017
 distance-0.017
 unatt-0.017
ge-0.016
Token're
Token ID821
Feature activation+3.47e-01

Pos logit contributions
 happens+0.036
apy+0.034
ours+0.032
oke+0.032
 Now+0.031
Neg logit contributions
side-0.049
handed-0.046
 distance-0.045
thank-0.045
 unatt-0.044
Token right
Token ID826
Feature activation+7.78e-01

Pos logit contributions
side+0.034
 distance+0.032
handed+0.031
 unatt+0.031
thank+0.030
Neg logit contributions
 happens-0.040
apy-0.035
 comes-0.035
oke-0.033
ours-0.033
Token,
Token ID11
Feature activation-3.14e-01

Pos logit contributions
side+0.095
handed+0.090
 distance+0.088
 unatt+0.087
ge+0.083
Neg logit contributions
 happens-0.071
 comes-0.063
 Now-0.058
 Everything-0.057
 It-0.055
Token John
Token ID1757
Feature activation-7.21e-01

Pos logit contributions
 happens+0.029
apy+0.024
 comes+0.024
ours+0.022
 Everything+0.022
Neg logit contributions
side-0.037
 distance-0.035
 unatt-0.034
handed-0.034
thank-0.032
Token!
Token ID0
Feature activation-5.20e-01

Pos logit contributions
 happens+0.085
 comes+0.075
apy+0.074
 Everything+0.074
 Now+0.070
Neg logit contributions
side-0.070
handed-0.063
 distance-0.062
 unatt-0.061
thank-0.059
Token Now
Token ID2735
Feature activation-2.53e+00

Pos logit contributions
 happens+0.047
apy+0.036
 comes+0.036
oms+0.034
ls+0.034
Neg logit contributions
side-0.058
 sill-0.056
thank-0.054
j-0.053
ro-0.053
Token,
Token ID11
Feature activation-4.88e-01

Pos logit contributions
 happens+0.231
apy+0.210
oke+0.205
ours+0.198
 Everything+0.192
Neg logit contributions
side-0.287
 distance-0.264
 unatt-0.262
handed-0.259
thank-0.254
Token let
Token ID1309
Feature activation-2.78e-01

Pos logit contributions
 happens+0.049
apy+0.042
oke+0.042
ours+0.041
 comes+0.040
Neg logit contributions
side-0.051
 distance-0.048
 unatt-0.048
handed-0.046
thank-0.044
Token's
Token ID338
Feature activation-1.82e-01

Pos logit contributions
 happens+0.033
apy+0.032
 importantly+0.030
 comes+0.029
ours+0.029
Neg logit contributions
handed-0.024
side-0.023
 distance-0.021
 unatt-0.020
 bank-0.019
Token see
Token ID766
Feature activation-2.40e-01

Pos logit contributions
 happens+0.018
 comes+0.016
 Everything+0.015
apy+0.015
 Now+0.015
Neg logit contributions
side-0.021
handed-0.019
 distance-0.019
 unatt-0.019
thank-0.018
Token if
Token ID611
Feature activation+3.74e-01

Pos logit contributions
 happens+0.023
 comes+0.020
apy+0.019
 Everything+0.019
 Now+0.019
Neg logit contributions
side-0.028
handed-0.026
 unatt-0.026
 distance-0.025
ge-0.024
Token was
Token ID373
Feature activation-8.50e-01

Pos logit contributions
 happens+0.032
 comes+0.028
 Everything+0.027
apy+0.027
 Now+0.026
Neg logit contributions
side-0.038
handed-0.035
 distance-0.034
 unatt-0.034
thank-0.032
Token over
Token ID625
Feature activation+2.32e-01

Pos logit contributions
 happens+0.095
 comes+0.087
 Everything+0.084
apy+0.083
 Now+0.081
Neg logit contributions
side-0.086
handed-0.078
 distance-0.075
 unatt-0.074
ge-0.072
Token the
Token ID262
Feature activation-2.10e-01

Pos logit contributions
side+0.027
handed+0.025
 unatt+0.025
 distance+0.024
anut+0.023
Neg logit contributions
 happens-0.022
 comes-0.021
 Everything-0.019
 Now-0.019
apy-0.018
Token moon
Token ID8824
Feature activation-1.23e+00

Pos logit contributions
 happens+0.022
 comes+0.020
 Everything+0.019
 Now+0.018
apy+0.018
Neg logit contributions
side-0.022
handed-0.020
 unatt-0.020
ge-0.020
 distance-0.019
Token!
Token ID0
Feature activation-6.82e-01

Pos logit contributions
 happens+0.118
 comes+0.113
apy+0.102
 Everything+0.101
 can+0.100
Neg logit contributions
side-0.128
handed-0.125
 unatt-0.119
 distance-0.117
anut-0.114
Token Now
Token ID2735
Feature activation-2.53e+00

Pos logit contributions
 happens+0.076
 comes+0.064
ours+0.059
oke+0.058
oms+0.057
Neg logit contributions
side-0.074
 distance-0.067
handed-0.066
thank-0.060
ro-0.059
Token she
Token ID673
Feature activation+4.23e-01

Pos logit contributions
 happens+0.257
ours+0.231
oke+0.229
apy+0.226
icks+0.215
Neg logit contributions
side-0.271
 distance-0.240
handed-0.234
 unatt-0.228
ge-0.224
Token could
Token ID714
Feature activation-1.92e-01

Pos logit contributions
side+0.040
 distance+0.038
thank+0.038
handed+0.037
 unatt+0.037
Neg logit contributions
 happens-0.041
 Everything-0.039
apy-0.038
ours-0.037
 People-0.036
Token easily
Token ID3538
Feature activation-2.07e-01

Pos logit contributions
 happens+0.022
 comes+0.019
 Everything+0.018
oke+0.018
apy+0.018
Neg logit contributions
side-0.020
 distance-0.018
handed-0.018
 unatt-0.017
thank-0.016
Token find
Token ID1064
Feature activation+5.90e-01

Pos logit contributions
 happens+0.021
 comes+0.019
apy+0.017
 Everything+0.017
 It+0.017
Neg logit contributions
side-0.024
handed-0.022
 unatt-0.021
 distance-0.021
thank-0.019
Token her
Token ID607
Feature activation-1.19e-01

Pos logit contributions
side+0.056
 distance+0.054
handed+0.052
 unatt+0.050
 sill+0.045
Neg logit contributions
 happens-0.066
oke-0.062
apy-0.058
ours-0.058
 comes-0.058
Token She
Token ID1375
Feature activation-2.03e-01

Pos logit contributions
 happens+0.008
osh+0.007
apy+0.007
 comes+0.006
ye+0.006
Neg logit contributions
side-0.008
 distance-0.007
handed-0.007
 unatt-0.007
ro-0.006
Token saw
Token ID2497
Feature activation+1.70e-02

Pos logit contributions
 happens+0.025
apy+0.022
 Everything+0.022
 People+0.021
 comes+0.021
Neg logit contributions
side-0.018
 distance-0.016
 unatt-0.016
handed-0.016
broken-0.015
Token Lily
Token ID20037
Feature activation+9.72e-02

Pos logit contributions
side+0.002
 distance+0.002
handed+0.002
 unatt+0.002
thank+0.002
Neg logit contributions
 happens-0.001
 comes-0.001
 Everything-0.001
apy-0.001
 Now-0.001
Token and
Token ID290
Feature activation+1.23e-01

Pos logit contributions
side+0.011
 distance+0.010
handed+0.010
 unatt+0.010
thank+0.010
Neg logit contributions
 happens-0.009
 comes-0.009
 Everything-0.008
apy-0.008
 It-0.008
Token Wh
Token ID854
Feature activation+7.08e-01

Pos logit contributions
side+0.014
handed+0.013
 distance+0.013
 unatt+0.013
thank+0.012
Neg logit contributions
 happens-0.012
 comes-0.011
 Everything-0.010
apy-0.010
 Now-0.010
Tokenisk
Token ID1984
Feature activation+3.50e+00

Pos logit contributions
side+0.032
 distance+0.028
 unatt+0.024
handed+0.024
thank+0.019
Neg logit contributions
 happens-0.120
 comes-0.115
 Everything-0.110
oke-0.108
 Now-0.108
Tokeners
Token ID364
Feature activation+4.46e-01

Pos logit contributions
side+0.350
 distance+0.321
 unatt+0.315
handed+0.313
ge+0.288
Neg logit contributions
 happens-0.402
 comes-0.363
 Everything-0.341
apy-0.334
 Now-0.333
Token looking
Token ID2045
Feature activation-9.83e-01

Pos logit contributions
side+0.051
handed+0.047
 distance+0.046
 unatt+0.046
thank+0.044
Neg logit contributions
 happens-0.044
 comes-0.040
 Everything-0.037
apy-0.036
oke-0.035
Token sad
Token ID6507
Feature activation+4.00e-01

Pos logit contributions
 happens+0.124
 comes+0.111
 Everything+0.107
apy+0.106
 Now+0.105
Neg logit contributions
side-0.088
handed-0.079
 distance-0.079
 unatt-0.078
thank-0.071
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
side+0.046
handed+0.042
 distance+0.041
 unatt+0.039
anut+0.039
Neg logit contributions
 happens-0.043
 comes-0.039
 Everything-0.035
 can-0.034
 It-0.034
Token She
Token ID1375
Feature activation-1.04e-01

Pos logit contributions
side+0.002
handed+0.002
 distance+0.002
 unatt+0.002
 bank+0.002
Neg logit contributions
 happens-0.002
 comes-0.002
apy-0.001
osh-0.001
ours-0.001
Token got
Token ID1392
Feature activation+2.05e-01

Pos logit contributions
side+0.046
 distance+0.042
handed+0.042
 unatt+0.041
thank+0.039
Neg logit contributions
 happens-0.037
 comes-0.032
 Everything-0.031
apy-0.030
 Now-0.030
Token back
Token ID736
Feature activation+3.03e-01

Pos logit contributions
side+0.017
 distance+0.016
 unatt+0.015
handed+0.015
thank+0.013
Neg logit contributions
 happens-0.027
 comes-0.024
oke-0.024
 Everything-0.024
 Now-0.023
Token inside
Token ID2641
Feature activation+1.69e-01

Pos logit contributions
side+0.026
 distance+0.024
handed+0.024
 unatt+0.023
ge+0.022
Neg logit contributions
 happens-0.040
 Everything-0.034
 comes-0.033
oke-0.033
 People-0.033
Token,
Token ID11
Feature activation-1.02e-04

Pos logit contributions
side+0.022
 distance+0.020
handed+0.020
 unatt+0.019
ge+0.018
Neg logit contributions
 happens-0.015
 comes-0.012
 Everything-0.012
apy-0.012
ours-0.012
Token Wh
Token ID854
Feature activation+9.05e-01

Pos logit contributions
 happens+0.000
 comes+0.000
apy+0.000
 Everything+0.000
 It+0.000
Neg logit contributions
side-0.000
handed-0.000
 unatt-0.000
 distance-0.000
ge-0.000
Tokenisk
Token ID1984
Feature activation+3.76e+00

Pos logit contributions
side+0.059
 distance+0.056
 unatt+0.046
handed+0.045
thank+0.042
Neg logit contributions
 comes-0.130
 happens-0.129
oke-0.127
 Now-0.120
ours-0.117
Tokeners
Token ID364
Feature activation+6.44e-01

Pos logit contributions
side+0.372
 distance+0.338
 unatt+0.334
handed+0.333
ge+0.307
Neg logit contributions
 happens-0.440
 comes-0.395
 Everything-0.373
apy-0.366
 Now-0.365
Token saw
Token ID2497
Feature activation+1.86e-01

Pos logit contributions
side+0.078
handed+0.072
 distance+0.071
 unatt+0.071
ge+0.066
Neg logit contributions
 happens-0.061
 comes-0.054
 Everything-0.050
apy-0.049
 Now-0.048
Token a
Token ID257
Feature activation+2.19e-01

Pos logit contributions
side+0.024
 distance+0.022
 unatt+0.022
handed+0.022
ge+0.021
Neg logit contributions
 happens-0.017
 comes-0.015
 Everything-0.014
apy-0.013
 Now-0.013
Token teaspoon
Token ID22326
Feature activation-1.29e-01

Pos logit contributions
side+0.025
handed+0.023
 unatt+0.022
 distance+0.022
ge+0.021
Neg logit contributions
 happens-0.023
 comes-0.020
apy-0.019
 Everything-0.019
oke-0.018
Token on
Token ID319
Feature activation+2.50e-01

Pos logit contributions
 happens+0.013
 comes+0.012
apy+0.011
oke+0.010
 can+0.010
Neg logit contributions
side-0.015
handed-0.014
 distance-0.013
 unatt-0.013
thank-0.013
Token the
Token ID262
Feature activation+8.16e-02

Pos logit contributions
 unatt+0.037
side+0.035
ge+0.035
 distance+0.034
 bank+0.032
Neg logit contributions
oke-0.037
ours-0.036
 happens-0.036
icks-0.035
icked-0.035
Token things
Token ID1243
Feature activation-2.99e-01

Pos logit contributions
side+0.009
handed+0.009
 distance+0.008
ge+0.008
 unatt+0.008
Neg logit contributions
 happens-0.008
 comes-0.007
 Everything-0.007
 can-0.007
apy-0.007
Token we
Token ID356
Feature activation+7.63e-01

Pos logit contributions
 Now+0.020
 happens+0.019
 Everything+0.019
apy+0.018
oke+0.018
Neg logit contributions
 unatt-0.037
side-0.036
 distance-0.035
 bank-0.034
handed-0.034
Token eat
Token ID4483
Feature activation+3.88e-01

Pos logit contributions
side+0.089
 unatt+0.086
 distance+0.085
 bank+0.084
handed+0.083
Neg logit contributions
 happens-0.066
apy-0.059
 Now-0.056
ours-0.055
 comes-0.055
Token can
Token ID460
Feature activation+2.29e-03

Pos logit contributions
side+0.041
handed+0.037
 distance+0.037
 unatt+0.036
thank+0.034
Neg logit contributions
 happens-0.043
 comes-0.038
 Everything-0.036
apy-0.036
 Now-0.035
Token affect
Token ID2689
Feature activation+3.24e+00

Pos logit contributions
side+0.000
 distance+0.000
 unatt+0.000
handed+0.000
 spine+0.000
Neg logit contributions
 happens-0.000
 comes-0.000
 can-0.000
 It-0.000
 Everything-0.000
Token our
Token ID674
Feature activation+1.69e-03

Pos logit contributions
side+0.318
handed+0.291
 distance+0.285
 unatt+0.282
thank+0.264
Neg logit contributions
 happens-0.374
 comes-0.333
apy-0.322
 Everything-0.321
 Now-0.311
Token dreams
Token ID10625
Feature activation+3.11e-01

Pos logit contributions
side+0.000
handed+0.000
 distance+0.000
 unatt+0.000
ge+0.000
Neg logit contributions
 happens-0.000
 comes-0.000
 Everything-0.000
apy-0.000
 Now-0.000
Token.
Token ID13
Feature activation+4.43e-01

Pos logit contributions
side+0.028
 unatt+0.025
 distance+0.025
handed+0.025
ge+0.022
Neg logit contributions
 happens-0.038
 comes-0.034
 Everything-0.034
apy-0.034
 Now-0.033
Token
Token ID220
Feature activation+5.63e-01

Pos logit contributions
side+0.048
handed+0.044
 distance+0.044
 unatt+0.041
thank+0.040
Neg logit contributions
 happens-0.048
 comes-0.041
apy-0.039
oms-0.037
ours-0.036
Token
Token ID198
Feature activation+8.14e-02

Pos logit contributions
side+0.054
handed+0.051
 unatt+0.046
 distance+0.045
anut+0.043
Neg logit contributions
 happens-0.068
 comes-0.059
apy-0.055
 turns-0.053
 Everything-0.052
Token.
Token ID13
Feature activation+3.22e-01

Pos logit contributions
 happens+0.009
 comes+0.009
 Everything+0.008
apy+0.008
 Now+0.008
Neg logit contributions
side-0.008
handed-0.008
 distance-0.008
 unatt-0.007
ge-0.007
Token
Token ID198
Feature activation+8.94e-02

Pos logit contributions
side+0.038
handed+0.034
 distance+0.034
 unatt+0.032
 bank+0.031
Neg logit contributions
 happens-0.033
 comes-0.028
apy-0.026
ours-0.025
oms-0.024
Token
Token ID198
Feature activation+5.49e-02

Pos logit contributions
side+0.012
 unatt+0.012
handed+0.011
 distance+0.011
thank+0.011
Neg logit contributions
 happens-0.007
 comes-0.006
 Now-0.006
 Everything-0.006
 It-0.006
TokenAfter
Token ID3260
Feature activation+8.97e-02

Pos logit contributions
side+0.005
handed+0.005
 unatt+0.005
 distance+0.005
 spine+0.005
Neg logit contributions
 happens-0.006
 comes-0.006
 Now-0.006
 Everything-0.006
apy-0.006
Token Wh
Token ID854
Feature activation+1.01e+00

Pos logit contributions
side+0.010
handed+0.009
 unatt+0.008
 distance+0.008
thank+0.008
Neg logit contributions
 happens-0.010
 comes-0.009
oke-0.008
ours-0.008
 Everything-0.008
Tokenisk
Token ID1984
Feature activation+3.75e+00

Pos logit contributions
side+0.060
 distance+0.059
 unatt+0.050
chard+0.046
handed+0.044
Neg logit contributions
 happens-0.147
 comes-0.146
oke-0.145
 Now-0.141
 Everything-0.136
Tokeners
Token ID364
Feature activation+6.75e-01

Pos logit contributions
side+0.386
 distance+0.367
 unatt+0.363
handed+0.348
 spine+0.338
Neg logit contributions
 happens-0.414
 comes-0.367
 Everything-0.344
 Now-0.344
 People-0.330
Token got
Token ID1392
Feature activation+1.32e-01

Pos logit contributions
side+0.082
handed+0.075
 distance+0.074
 unatt+0.074
thank+0.069
Neg logit contributions
 happens-0.064
 comes-0.056
 Everything-0.053
apy-0.052
 Now-0.051
Token better
Token ID1365
Feature activation-3.10e-01

Pos logit contributions
side+0.012
handed+0.011
 distance+0.011
 unatt+0.011
thank+0.010
Neg logit contributions
 happens-0.016
 comes-0.015
 Everything-0.014
apy-0.014
 Now-0.014
Token,
Token ID11
Feature activation+9.38e-03

Pos logit contributions
 happens+0.026
 comes+0.023
apy+0.021
 Everything+0.020
 Now+0.020
Neg logit contributions
side-0.041
handed-0.038
 unatt-0.038
 distance-0.038
anut-0.036
Token he
Token ID339
Feature activation+3.94e-01

Pos logit contributions
side+0.001
handed+0.001
 unatt+0.001
 distance+0.001
thank+0.001
Neg logit contributions
 happens-0.001
 comes-0.001
 Everything-0.001
apy-0.001
 Now-0.001
Token then
Token ID788
Feature activation+9.47e-01

Pos logit contributions
side+0.052
handed+0.049
 distance+0.048
 unatt+0.048
thank+0.045
Neg logit contributions
 happens-0.033
 comes-0.028
apy-0.026
 Everything-0.026
 Now-0.025
Token accidentally
Token ID14716
Feature activation+3.04e-01

Pos logit contributions
side+0.103
 distance+0.091
 unatt+0.091
handed+0.090
 bank+0.089
Neg logit contributions
 happens-0.095
 comes-0.083
oke-0.083
ours-0.081
 Now-0.081
Token pin
Token ID6757
Feature activation+8.43e-01

Pos logit contributions
side+0.030
handed+0.028
 unatt+0.027
 distance+0.027
ge+0.025
Neg logit contributions
 happens-0.034
 comes-0.031
 Everything-0.030
 Now-0.029
apy-0.029
Tokenched
Token ID1740
Feature activation+2.16e-01

Pos logit contributions
side+0.102
handed+0.094
 distance+0.094
 unatt+0.092
thank+0.086
Neg logit contributions
 happens-0.082
 comes-0.071
apy-0.067
 Everything-0.067
 Now-0.064
Token Wh
Token ID854
Feature activation+7.29e-01

Pos logit contributions
side+0.029
handed+0.026
 distance+0.026
 unatt+0.026
thank+0.024
Neg logit contributions
 happens-0.019
 comes-0.017
 It-0.016
 Everything-0.016
apy-0.016
Tokenisk
Token ID1984
Feature activation+3.55e+00

Pos logit contributions
side+0.041
 distance+0.040
 unatt+0.033
handed+0.032
thank+0.028
Neg logit contributions
 happens-0.113
 comes-0.110
oke-0.104
 Everything-0.103
 Now-0.103
Tokeners
Token ID364
Feature activation+4.60e-01

Pos logit contributions
side+0.351
 distance+0.321
 unatt+0.316
handed+0.314
thank+0.289
Neg logit contributions
 happens-0.415
 comes-0.371
 Everything-0.352
apy-0.347
 Now-0.344
Token with
Token ID351
Feature activation-1.54e-01

Pos logit contributions
side+0.051
 distance+0.048
thank+0.048
handed+0.046
 former+0.045
Neg logit contributions
 happens-0.043
 comes-0.040
 can-0.037
apy-0.037
 It-0.035
Token the
Token ID262
Feature activation-3.38e-01

Pos logit contributions
 happens+0.015
 comes+0.013
apy+0.013
 Everything+0.013
 Now+0.012
Neg logit contributions
side-0.018
handed-0.017
 distance-0.016
 unatt-0.016
thank-0.015
Token brush
Token ID14093
Feature activation+5.54e-01

Pos logit contributions
 happens+0.035
 comes+0.032
apy+0.030
 Everything+0.030
 can+0.029
Neg logit contributions
side-0.038
handed-0.035
 unatt-0.034
 distance-0.033
ge-0.032
Token.
Token ID13
Feature activation-3.93e-02

Pos logit contributions
handed+0.060
side+0.059
 distance+0.057
anut+0.056
 unatt+0.056
Neg logit contributions
 happens-0.054
apy-0.049
 comes-0.048
 It-0.047
 can-0.045
Token went
Token ID1816
Feature activation-4.16e-01

Pos logit contributions
side+0.083
handed+0.076
 distance+0.075
 unatt+0.074
ge+0.070
Neg logit contributions
 happens-0.073
 comes-0.065
 Everything-0.061
apy-0.060
 Now-0.059
Token to
Token ID284
Feature activation+2.07e-01

Pos logit contributions
 happens+0.046
 comes+0.042
 Everything+0.039
apy+0.038
 Now+0.038
Neg logit contributions
side-0.044
handed-0.041
 unatt-0.039
 distance-0.039
ge-0.037
Token the
Token ID262
Feature activation-1.22e-01

Pos logit contributions
side+0.028
 unatt+0.025
handed+0.025
 distance+0.025
ge+0.024
Neg logit contributions
 happens-0.017
 comes-0.016
 Everything-0.014
 It-0.014
 turns-0.013
Token market
Token ID1910
Feature activation+9.03e-01

Pos logit contributions
 happens+0.011
 comes+0.010
 Everything+0.010
 Now+0.009
 can+0.009
Neg logit contributions
ge-0.015
side-0.014
 unatt-0.013
handed-0.013
broken-0.012
Token to
Token ID284
Feature activation+2.32e-01

Pos logit contributions
side+0.112
 unatt+0.111
 former+0.109
handed+0.106
anut+0.104
Neg logit contributions
 of-0.057
 comes-0.055
 Now-0.053
 It-0.053
 We-0.051
Token buy
Token ID2822
Feature activation+1.40e+00

Pos logit contributions
side+0.022
 unatt+0.019
ge+0.018
 distance+0.018
handed+0.018
Neg logit contributions
 happens-0.028
 comes-0.027
 Everything-0.025
 It-0.025
 Now-0.024
Token some
Token ID617
Feature activation+1.75e-01

Pos logit contributions
side+0.153
handed+0.138
 distance+0.138
 unatt+0.137
ge+0.128
Neg logit contributions
 happens-0.150
 comes-0.133
 Everything-0.126
apy-0.125
 Now-0.122
Token vegetables
Token ID13701
Feature activation+1.36e-01

Pos logit contributions
side+0.016
 distance+0.015
handed+0.015
thank+0.014
 bank+0.014
Neg logit contributions
 happens-0.021
 comes-0.018
 Everything-0.018
oke-0.018
apy-0.017
Token to
Token ID284
Feature activation+2.59e-01

Pos logit contributions
side+0.014
handed+0.013
 distance+0.013
 unatt+0.013
ge+0.012
Neg logit contributions
 happens-0.014
 comes-0.013
 can-0.012
 It-0.012
 Everything-0.012
Token cook
Token ID4255
Feature activation+4.08e-01

Pos logit contributions
side+0.026
handed+0.023
 unatt+0.023
 distance+0.023
ge+0.021
Neg logit contributions
 happens-0.030
 comes-0.027
 Everything-0.026
apy-0.026
 Now-0.025
Token.
Token ID13
Feature activation-9.22e-02

Pos logit contributions
side+0.039
handed+0.035
 distance+0.035
 unatt+0.035
ge+0.032
Neg logit contributions
 happens-0.049
 comes-0.044
apy-0.042
 Everything-0.042
 Now-0.041
Token or
Token ID393
Feature activation-6.08e-03

Pos logit contributions
side+0.006
handed+0.006
 distance+0.006
 unatt+0.005
ge+0.005
Neg logit contributions
 happens-0.008
 comes-0.007
 Everything-0.007
 Now-0.007
apy-0.007
Token hurt
Token ID5938
Feature activation+7.46e-01

Pos logit contributions
 happens+0.001
 comes+0.001
apy+0.001
 Everything+0.001
 can+0.001
Neg logit contributions
side-0.001
handed-0.001
 distance-0.001
 unatt-0.001
thank-0.000
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
side+0.080
handed+0.071
 distance+0.070
 unatt+0.069
thank+0.068
Neg logit contributions
 happens-0.082
 comes-0.076
 Everything-0.069
 It-0.068
apy-0.067
Token
Token ID198
Feature activation+1.47e-01

Pos logit contributions
side+0.020
handed+0.018
 distance+0.018
 unatt+0.017
thank+0.016
Neg logit contributions
 happens-0.016
 comes-0.013
apy-0.012
ours-0.012
 Everything-0.012
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
side+0.019
 unatt+0.018
handed+0.018
 distance+0.018
thank+0.017
Neg logit contributions
 happens-0.012
 comes-0.010
 Everything-0.010
 Now-0.010
apy-0.010
TokenL
Token ID43
Feature activation+1.48e+00

Pos logit contributions
side+0.012
 unatt+0.011
handed+0.011
 distance+0.011
 spine+0.011
Neg logit contributions
 happens-0.014
 Now-0.013
 comes-0.013
apy-0.013
 Everything-0.012
Tokenily
Token ID813
Feature activation+3.58e-01

Pos logit contributions
side+0.190
 distance+0.175
handed+0.173
 unatt+0.172
thank+0.163
Neg logit contributions
 happens-0.133
 comes-0.115
 Everything-0.107
apy-0.104
 Now-0.104
Token waved
Token ID26834
Feature activation-5.12e-01

Pos logit contributions
side+0.043
handed+0.039
 unatt+0.039
 distance+0.039
ge+0.037
Neg logit contributions
 happens-0.035
 comes-0.032
 Everything-0.029
 Now-0.028
apy-0.028
Token to
Token ID284
Feature activation+1.91e-01

Pos logit contributions
 happens+0.059
 comes+0.052
 Everything+0.050
apy+0.050
 Now+0.049
Neg logit contributions
side-0.051
 distance-0.046
handed-0.046
 unatt-0.046
thank-0.042
Token the
Token ID262
Feature activation-2.11e-01

Pos logit contributions
side+0.029
 unatt+0.026
handed+0.026
 distance+0.025
broken+0.024
Neg logit contributions
 happens-0.013
 comes-0.012
 Everything-0.011
 It-0.010
 Everyone-0.010
Token vet
Token ID20202
Feature activation+1.36e-01

Pos logit contributions
 happens+0.022
 comes+0.020
 Everything+0.018
apy+0.017
oke+0.017
Neg logit contributions
side-0.025
handed-0.022
 unatt-0.022
 distance-0.021
ge-0.021
Token waved
Token ID26834
Feature activation-5.93e-01

Pos logit contributions
side+0.031
 distance+0.028
handed+0.028
 unatt+0.028
thank+0.026
Neg logit contributions
 happens-0.025
 comes-0.022
 Everything-0.021
apy-0.020
 Now-0.020
Token back
Token ID736
Feature activation+2.35e-01

Pos logit contributions
 happens+0.063
 comes+0.056
 Everything+0.053
apy+0.053
 Now+0.052
Neg logit contributions
side-0.065
handed-0.059
 distance-0.059
 unatt-0.058
thank-0.054
Token and
Token ID290
Feature activation-1.73e-01

Pos logit contributions
side+0.024
handed+0.022
 distance+0.022
 unatt+0.021
ge+0.020
Neg logit contributions
 happens-0.027
 comes-0.024
 Everything-0.023
apy-0.023
 Now-0.022
Token the
Token ID262
Feature activation-2.65e-01

Pos logit contributions
 happens+0.017
 comes+0.015
 Everything+0.014
apy+0.014
ours+0.014
Neg logit contributions
side-0.020
 distance-0.019
 unatt-0.018
handed-0.018
thank-0.017
Token se
Token ID384
Feature activation+1.19e+00

Pos logit contributions
 happens+0.024
 comes+0.021
 Everything+0.019
 turns+0.019
apy+0.018
Neg logit contributions
side-0.034
handed-0.030
 unatt-0.030
ge-0.029
 distance-0.029
Tokenag
Token ID363
Feature activation+1.56e+00

Pos logit contributions
side+0.126
 distance+0.116
 unatt+0.114
handed+0.110
ge+0.102
Neg logit contributions
 happens-0.132
 comes-0.119
oke-0.112
 Now-0.109
 Everything-0.107
Tokenull
Token ID724
Feature activation-1.46e-01

Pos logit contributions
side+0.230
 unatt+0.217
handed+0.215
 distance+0.213
thank+0.202
Neg logit contributions
 happens-0.107
 comes-0.090
apy-0.085
 Everything-0.081
 Now-0.077
Token flew
Token ID13112
Feature activation-1.57e-02

Pos logit contributions
 happens+0.013
 comes+0.011
apy+0.011
 Everything+0.010
oke+0.010
Neg logit contributions
side-0.019
handed-0.018
 unatt-0.017
 distance-0.017
thank-0.016
Token away
Token ID1497
Feature activation+5.42e-01

Pos logit contributions
 happens+0.002
 comes+0.002
 It+0.002
 turns+0.002
 Everything+0.002
Neg logit contributions
handed-0.001
side-0.001
 unatt-0.001
anut-0.001
ge-0.001
Token.
Token ID13
Feature activation-8.34e-02

Pos logit contributions
side+0.066
handed+0.066
anut+0.061
 unatt+0.060
ppa+0.060
Neg logit contributions
 happens-0.045
 comes-0.042
apy-0.038
 can-0.035
 It-0.035
Token
Token ID220
Feature activation+2.69e-01

Pos logit contributions
 happens+0.009
 comes+0.008
apy+0.007
ours+0.007
oke+0.007
Neg logit contributions
side-0.009
handed-0.008
 distance-0.008
thank-0.008
 unatt-0.007
Token around
Token ID1088
Feature activation-1.49e-03

Pos logit contributions
side+0.128
 distance+0.115
 unatt+0.114
handed+0.112
thank+0.105
Neg logit contributions
 happens-0.137
apy-0.124
 Everything-0.120
 comes-0.120
ours-0.116
Token you
Token ID345
Feature activation+3.87e-02

Pos logit contributions
 happens+0.000
 comes+0.000
apy+0.000
 Everything+0.000
oke+0.000
Neg logit contributions
side-0.000
 distance-0.000
 unatt-0.000
handed-0.000
ge-0.000
Token,
Token ID11
Feature activation-1.89e-01

Pos logit contributions
side+0.004
 distance+0.004
 unatt+0.004
handed+0.004
ge+0.004
Neg logit contributions
 happens-0.004
apy-0.003
 comes-0.003
 Everything-0.003
 Now-0.003
Token then
Token ID788
Feature activation+1.05e+00

Pos logit contributions
 happens+0.023
 comes+0.021
 Everything+0.020
apy+0.020
 Now+0.019
Neg logit contributions
side-0.018
 distance-0.016
handed-0.016
 unatt-0.016
thank-0.014
Token say
Token ID910
Feature activation-3.14e-02

Pos logit contributions
side+0.127
 distance+0.120
 unatt+0.120
handed+0.118
thank+0.115
Neg logit contributions
 happens-0.084
apy-0.075
oke-0.073
 Everything-0.072
ours-0.071
Token '
Token ID705
Feature activation+1.11e+00

Pos logit contributions
 happens+0.003
ours+0.002
oke+0.002
 comes+0.002
apy+0.002
Neg logit contributions
side-0.004
 unatt-0.003
thank-0.003
 distance-0.003
handed-0.003
TokenI
Token ID40
Feature activation+2.80e-01

Pos logit contributions
side+0.111
handed+0.100
 distance+0.100
 unatt+0.099
 spine+0.090
Neg logit contributions
 happens-0.125
 comes-0.112
 Everything-0.107
apy-0.106
 Now-0.105
Token recognize
Token ID7564
Feature activation+5.98e-01

Pos logit contributions
side+0.032
 distance+0.030
handed+0.030
thank+0.029
 unatt+0.028
Neg logit contributions
 happens-0.026
apy-0.022
 Everything-0.022
 comes-0.021
ours-0.021
Token the
Token ID262
Feature activation-2.53e-01

Pos logit contributions
side+0.068
handed+0.062
 unatt+0.061
 distance+0.060
thank+0.056
Neg logit contributions
 happens-0.058
 comes-0.052
apy-0.051
oke-0.049
 Everything-0.049
Token shield
Token ID7614
Feature activation-6.41e-02

Pos logit contributions
 happens+0.026
 comes+0.023
 Everything+0.021
apy+0.020
 Now+0.020
Neg logit contributions
side-0.029
handed-0.027
 unatt-0.026
 distance-0.026
ge-0.025
Token.'
Token ID2637
Feature activation-3.78e-01

Pos logit contributions
 happens+0.007
 comes+0.006
 Everything+0.006
apy+0.006
 Now+0.005
Neg logit contributions
side-0.007
handed-0.006
 distance-0.006
 unatt-0.006
thank-0.006
Token many
Token ID867
Feature activation+1.45e-01

Pos logit contributions
 happens+0.075
 comes+0.067
 Everything+0.064
apy+0.064
 Now+0.062
Neg logit contributions
side-0.077
 distance-0.070
handed-0.070
 unatt-0.069
thank-0.064
Token surprises
Token ID24072
Feature activation-3.20e-01

Pos logit contributions
side+0.014
 distance+0.014
 unatt+0.013
handed+0.013
 bank+0.012
Neg logit contributions
 happens-0.016
 Everything-0.015
 Now-0.014
 comes-0.014
oke-0.014
Token."
Token ID526
Feature activation-1.28e-01

Pos logit contributions
 happens+0.040
 Everything+0.037
apy+0.036
 comes+0.035
 Now+0.035
Neg logit contributions
side-0.028
 unatt-0.025
 distance-0.024
handed-0.024
thank-0.022
Token
Token ID198
Feature activation+2.75e-01

Pos logit contributions
 happens+0.014
 comes+0.013
apy+0.012
 Everything+0.012
oke+0.012
Neg logit contributions
side-0.013
 distance-0.012
handed-0.012
 unatt-0.011
thank-0.011
Token
Token ID198
Feature activation+2.49e-01

Pos logit contributions
side+0.032
handed+0.030
 distance+0.029
 unatt+0.029
thank+0.027
Neg logit contributions
 happens-0.027
 comes-0.024
 Everything-0.022
apy-0.022
 Now-0.022
TokenL
Token ID43
Feature activation+1.42e+00

Pos logit contributions
side+0.023
handed+0.021
 unatt+0.021
 distance+0.020
thank+0.019
Neg logit contributions
 happens-0.030
 comes-0.027
 Now-0.026
apy-0.026
 Everything-0.026
Tokenily
Token ID813
Feature activation+2.79e-01

Pos logit contributions
side+0.159
handed+0.149
 unatt+0.145
 distance+0.141
ge+0.135
Neg logit contributions
 happens-0.143
 comes-0.125
apy-0.124
 Everything-0.120
 Now-0.114
Token and
Token ID290
Feature activation+2.18e-01

Pos logit contributions
side+0.028
handed+0.025
 distance+0.025
 unatt+0.025
ge+0.023
Neg logit contributions
 happens-0.032
 comes-0.029
 Everything-0.027
 Now-0.027
apy-0.027
Token Sam
Token ID3409
Feature activation-6.75e-01

Pos logit contributions
side+0.032
handed+0.029
 distance+0.029
 unatt+0.029
ge+0.028
Neg logit contributions
 happens-0.016
 comes-0.013
 Everything-0.012
apy-0.012
 Now-0.011
Token sniff
Token ID26300
Feature activation-1.15e-01

Pos logit contributions
 happens+0.050
 comes+0.043
 Everything+0.039
apy+0.038
 Now+0.036
Neg logit contributions
side-0.096
handed-0.089
 unatt-0.088
 distance-0.087
ge-0.084
Tokened
Token ID276
Feature activation+1.19e+00

Pos logit contributions
 happens+0.011
 comes+0.010
 Everything+0.010
apy+0.009
 Now+0.009
Neg logit contributions
side-0.013
handed-0.012
 distance-0.012
 unatt-0.012
thank-0.011
Token She
Token ID1375
Feature activation-5.12e-02

Pos logit contributions
side+0.011
handed+0.010
 distance+0.010
 unatt+0.010
 sill+0.009
Neg logit contributions
 happens-0.011
 comes-0.009
apy-0.008
ye-0.008
oke-0.008
Token did
Token ID750
Feature activation-4.85e-02

Pos logit contributions
 happens+0.005
 comes+0.004
apy+0.004
 Everything+0.004
 Now+0.004
Neg logit contributions
side-0.006
handed-0.006
 unatt-0.006
 distance-0.006
ge-0.005
Token not
Token ID407
Feature activation+4.10e-01

Pos logit contributions
 happens+0.006
 comes+0.005
apy+0.005
 turns+0.005
 Everything+0.005
Neg logit contributions
side-0.005
handed-0.004
 distance-0.004
 unatt-0.004
thank-0.004
Token like
Token ID588
Feature activation+1.36e+00

Pos logit contributions
side+0.051
handed+0.045
 unatt+0.044
 distance+0.043
ge+0.040
Neg logit contributions
 happens-0.041
 comes-0.037
 Everything-0.032
 turns-0.032
apy-0.031
Token clown
Token ID25573
Feature activation-3.93e-01

Pos logit contributions
side+0.161
handed+0.148
 distance+0.148
 unatt+0.147
ge+0.137
Neg logit contributions
 happens-0.130
 comes-0.113
 Everything-0.108
apy-0.108
 Now-0.105
Tokens
Token ID82
Feature activation+9.62e-01

Pos logit contributions
 happens+0.034
 comes+0.029
 Everything+0.028
apy+0.027
 Now+0.026
Neg logit contributions
side-0.051
handed-0.047
 distance-0.046
 unatt-0.046
thank-0.044
Token.
Token ID13
Feature activation+1.58e-01

Pos logit contributions
side+0.100
handed+0.092
 distance+0.090
 unatt+0.085
anut+0.083
Neg logit contributions
 happens-0.110
 comes-0.100
 Everything-0.088
 can-0.088
apy-0.088
Token She
Token ID1375
Feature activation+1.11e-03

Pos logit contributions
side+0.018
 distance+0.016
handed+0.016
 unatt+0.016
thank+0.015
Neg logit contributions
 happens-0.016
 comes-0.013
apy-0.013
oke-0.012
 Everything-0.012
Token thought
Token ID1807
Feature activation-1.39e-02

Pos logit contributions
side+0.000
handed+0.000
 unatt+0.000
 distance+0.000
ge+0.000
Neg logit contributions
 happens-0.000
 comes-0.000
apy-0.000
 Everything-0.000
 Now-0.000
Token they
Token ID484
Feature activation+1.74e-01

Pos logit contributions
 happens+0.001
 Now+0.001
 Everything+0.001
 comes+0.001
ours+0.001
Neg logit contributions
side-0.002
 distance-0.001
 unatt-0.001
handed-0.001
ge-0.001
Token were
Token ID547
Feature activation+3.85e-01

Pos logit contributions
side+0.025
handed+0.023
 distance+0.023
 unatt+0.023
ge+0.021
Neg logit contributions
 happens-0.013
 comes-0.011
 Everything-0.010
apy-0.010
 turns-0.010
Token was
Token ID373
Feature activation-6.08e-02

Pos logit contributions
 happens+0.033
 comes+0.028
 Everything+0.026
apy+0.025
 Now+0.024
Neg logit contributions
side-0.053
handed-0.051
 unatt-0.049
 distance-0.049
ge-0.047
Token very
Token ID845
Feature activation+2.78e-01

Pos logit contributions
 happens+0.007
 comes+0.006
apy+0.006
 Everything+0.006
oke+0.006
Neg logit contributions
side-0.006
 distance-0.005
 unatt-0.005
handed-0.005
thank-0.005
Token interesting
Token ID3499
Feature activation+3.69e-01

Pos logit contributions
side+0.021
 distance+0.021
handed+0.019
 unatt+0.019
anut+0.018
Neg logit contributions
 happens-0.037
 comes-0.033
oke-0.032
ours-0.032
 Everything-0.032
Token to
Token ID284
Feature activation+2.81e-01

Pos logit contributions
side+0.028
 unatt+0.024
 distance+0.024
handed+0.023
ge+0.021
Neg logit contributions
 happens-0.049
apy-0.046
 Everything-0.045
 comes-0.045
oke-0.044
Token see
Token ID766
Feature activation+3.23e-02

Pos logit contributions
side+0.033
handed+0.029
 unatt+0.029
 distance+0.029
ge+0.027
Neg logit contributions
 happens-0.029
 comes-0.026
 Everything-0.025
apy-0.024
 Now-0.024
Token all
Token ID477
Feature activation-2.72e-01

Pos logit contributions
side+0.004
 unatt+0.003
handed+0.003
 distance+0.003
ge+0.003
Neg logit contributions
 happens-0.003
 comes-0.003
apy-0.003
 Everything-0.003
 Now-0.003
Token the
Token ID262
Feature activation-8.53e-02

Pos logit contributions
 happens+0.026
 comes+0.022
apy+0.022
 Everything+0.021
 Now+0.021
Neg logit contributions
side-0.032
 distance-0.031
handed-0.030
 unatt-0.030
thank-0.028
Token pretty
Token ID2495
Feature activation+3.40e-01

Pos logit contributions
 happens+0.008
 comes+0.008
 Everything+0.007
 turns+0.007
 People+0.007
Neg logit contributions
side-0.010
handed-0.009
 unatt-0.009
 distance-0.009
ge-0.009
Token colors
Token ID7577
Feature activation+1.52e-01

Pos logit contributions
side+0.032
 distance+0.031
handed+0.029
 unatt+0.028
thank+0.027
Neg logit contributions
 happens-0.037
oke-0.035
 Everything-0.034
apy-0.033
 Now-0.033
Token in
Token ID287
Feature activation+3.29e-01

Pos logit contributions
side+0.014
 distance+0.013
 unatt+0.013
handed+0.012
ge+0.012
Neg logit contributions
 happens-0.018
 comes-0.016
 Everything-0.016
 Now-0.016
apy-0.016
Token the
Token ID262
Feature activation-7.89e-02

Pos logit contributions
side+0.049
handed+0.045
 unatt+0.044
thank+0.044
 spine+0.044
Neg logit contributions
 happens-0.022
 comes-0.019
 It-0.017
 Everything-0.017
apy-0.017
Token haircut
Token ID45548
Feature activation+3.43e-01

Pos logit contributions
side+0.007
 distance+0.007
handed+0.007
 unatt+0.007
thank+0.006
Neg logit contributions
 happens-0.008
 comes-0.008
 Everything-0.007
apy-0.007
 Now-0.007
Token.
Token ID13
Feature activation-8.55e-02

Pos logit contributions
side+0.039
handed+0.037
 distance+0.036
 unatt+0.035
anut+0.034
Neg logit contributions
 happens-0.034
 comes-0.030
apy-0.027
 can-0.027
 Everything-0.027
Token She
Token ID1375
Feature activation+1.38e-01

Pos logit contributions
 happens+0.008
 comes+0.006
apy+0.006
 Everything+0.006
 Now+0.006
Neg logit contributions
side-0.011
handed-0.010
 distance-0.010
 unatt-0.010
thank-0.009
Token asked
Token ID1965
Feature activation-8.22e-01

Pos logit contributions
side+0.017
handed+0.016
 distance+0.016
 unatt+0.016
ge+0.015
Neg logit contributions
 happens-0.013
 comes-0.011
 Everything-0.010
apy-0.010
 Now-0.010
Token her
Token ID607
Feature activation+9.97e-02

Pos logit contributions
 happens+0.116
 comes+0.104
apy+0.104
oke+0.102
 Everything+0.100
Neg logit contributions
side-0.059
 unatt-0.050
handed-0.048
 distance-0.048
thank-0.047
Token mom
Token ID1995
Feature activation+4.40e-01

Pos logit contributions
side+0.009
handed+0.008
 distance+0.008
 unatt+0.008
anut+0.007
Neg logit contributions
 happens-0.012
 comes-0.010
apy-0.010
 Everything-0.010
 Now-0.010
Token if
Token ID611
Feature activation+5.51e-01

Pos logit contributions
side+0.047
handed+0.043
 distance+0.042
 unatt+0.042
thank+0.039
Neg logit contributions
 happens-0.048
 comes-0.043
 Everything-0.041
apy-0.040
 Now-0.039
Token she
Token ID673
Feature activation+7.29e-01

Pos logit contributions
side+0.046
 unatt+0.043
 distance+0.041
handed+0.039
thank+0.039
Neg logit contributions
 happens-0.062
oke-0.061
 Now-0.060
icks-0.060
ay-0.059
Token could
Token ID714
Feature activation+3.09e-02

Pos logit contributions
side+0.071
thank+0.070
handed+0.070
 unatt+0.069
ety+0.069
Neg logit contributions
 happens-0.064
apy-0.062
 Everything-0.060
 comes-0.059
oms-0.059
Token get
Token ID651
Feature activation+7.51e-02

Pos logit contributions
side+0.003
handed+0.003
 distance+0.003
 unatt+0.003
thank+0.003
Neg logit contributions
 happens-0.003
 comes-0.003
 Everything-0.003
apy-0.003
 Now-0.003
Token a
Token ID257
Feature activation+8.89e-02

Pos logit contributions
side+0.008
handed+0.007
 distance+0.007
 unatt+0.007
thank+0.006
Neg logit contributions
 happens-0.009
 comes-0.008
 Everything-0.007
apy-0.007
 Now-0.007
Token her
Token ID607
Feature activation+8.01e-02

Pos logit contributions
side+0.062
handed+0.055
 distance+0.054
 unatt+0.054
ge+0.053
Neg logit contributions
 happens-0.060
 comes-0.053
 Everything-0.051
apy-0.050
 Now-0.048
Token heart
Token ID2612
Feature activation-4.93e-01

Pos logit contributions
side+0.010
 distance+0.009
handed+0.009
 unatt+0.009
 bank+0.008
Neg logit contributions
 happens-0.007
 comes-0.006
apy-0.006
 Everything-0.006
oke-0.006
Token sink
Token ID14595
Feature activation+3.96e-01

Pos logit contributions
 happens+0.056
 comes+0.052
apy+0.050
 can+0.047
 Everything+0.047
Neg logit contributions
side-0.045
handed-0.045
 unatt-0.042
 distance-0.042
anut-0.040
Token.
Token ID13
Feature activation+1.18e-01

Pos logit contributions
side+0.042
handed+0.041
 distance+0.039
 unatt+0.039
anut+0.037
Neg logit contributions
 happens-0.040
 comes-0.038
apy-0.033
 can-0.033
 It-0.032
Token She
Token ID1375
Feature activation-1.23e-01

Pos logit contributions
side+0.014
handed+0.012
 distance+0.012
 unatt+0.012
thank+0.011
Neg logit contributions
 happens-0.012
 comes-0.010
apy-0.010
oke-0.009
ours-0.009
Token wondered
Token ID14028
Feature activation-1.33e-01

Pos logit contributions
 happens+0.013
 comes+0.011
 Everything+0.011
apy+0.011
 Now+0.010
Neg logit contributions
side-0.013
handed-0.012
 distance-0.012
 unatt-0.012
thank-0.011
Token if
Token ID611
Feature activation+5.45e-01

Pos logit contributions
 happens+0.012
 comes+0.010
 Everything+0.009
apy+0.009
 Now+0.009
Neg logit contributions
side-0.017
handed-0.016
 unatt-0.016
 distance-0.016
ge-0.015
Token Rose
Token ID8049
Feature activation-2.65e-01

Pos logit contributions
side+0.056
 unatt+0.050
 distance+0.049
handed+0.049
 missing+0.045
Neg logit contributions
 happens-0.053
oke-0.053
apy-0.050
 Now-0.049
 comes-0.049
Token was
Token ID373
Feature activation-2.42e-01

Pos logit contributions
 happens+0.025
 comes+0.021
apy+0.021
 Everything+0.021
 Now+0.020
Neg logit contributions
side-0.032
handed-0.029
 distance-0.029
 unatt-0.028
ge-0.027
Token in
Token ID287
Feature activation+2.71e-01

Pos logit contributions
 happens+0.026
 comes+0.023
 Everything+0.021
apy+0.021
 Now+0.020
Neg logit contributions
side-0.027
handed-0.024
 distance-0.024
 unatt-0.023
thank-0.022
Token one
Token ID530
Feature activation+6.95e-01

Pos logit contributions
side+0.038
handed+0.036
anut+0.035
 unatt+0.035
thank+0.035
Neg logit contributions
 happens-0.019
 comes-0.017
 It-0.015
 turns-0.015
 can-0.015
TokenElizabeth
Token ID43568
Feature activation-6.23e-01

Pos logit contributions
 happens+0.002
 comes+0.002
 Everything+0.002
apy+0.002
 Now+0.002
Neg logit contributions
side-0.002
handed-0.002
 distance-0.002
 unatt-0.002
thank-0.002
Token's
Token ID338
Feature activation-1.61e-01

Pos logit contributions
 happens+0.063
 comes+0.055
 Everything+0.053
apy+0.052
ours+0.051
Neg logit contributions
side-0.071
handed-0.064
 distance-0.063
 unatt-0.061
thank-0.061
Token eyes
Token ID2951
Feature activation-1.14e+00

Pos logit contributions
 happens+0.014
 comes+0.012
 Everything+0.011
apy+0.011
 Now+0.010
Neg logit contributions
side-0.022
handed-0.020
 distance-0.020
 unatt-0.019
ge-0.019
Token lit
Token ID6578
Feature activation+7.19e-01

Pos logit contributions
 happens+0.112
 comes+0.100
apy+0.098
 Everything+0.092
 It+0.090
Neg logit contributions
handed-0.127
side-0.126
 unatt-0.122
anut-0.120
 distance-0.118
Token up
Token ID510
Feature activation+8.55e-01

Pos logit contributions
 unatt+0.081
handed+0.078
side+0.078
 spine+0.077
 distance+0.077
Neg logit contributions
 happens-0.067
 turns-0.063
 It-0.060
 comes-0.056
 Everyone-0.055
Token with
Token ID351
Feature activation-1.27e-01

Pos logit contributions
handed+0.114
side+0.107
keeping+0.105
 unatt+0.105
anut+0.104
Neg logit contributions
 of-0.054
 happens-0.050
apy-0.050
 comes-0.049
 It-0.048
Token excitement
Token ID14067
Feature activation-5.37e-01

Pos logit contributions
 happens+0.011
 comes+0.010
 Everything+0.009
apy+0.009
 Now+0.009
Neg logit contributions
side-0.016
handed-0.015
 unatt-0.015
 distance-0.014
ge-0.014
Token.
Token ID13
Feature activation-2.19e-02

Pos logit contributions
 happens+0.057
 comes+0.052
 Everything+0.048
apy+0.048
 It+0.047
Neg logit contributions
side-0.055
handed-0.053
 distance-0.052
 unatt-0.051
anut-0.048
Token She
Token ID1375
Feature activation-2.14e-01

Pos logit contributions
 happens+0.002
 comes+0.002
apy+0.002
ye+0.002
oke+0.002
Neg logit contributions
side-0.002
 distance-0.002
handed-0.002
thank-0.002
 sill-0.002
Token was
Token ID373
Feature activation-4.36e-01

Pos logit contributions
 happens+0.024
 comes+0.022
 Everything+0.021
ours+0.021
oke+0.021
Neg logit contributions
side-0.021
 distance-0.018
thank-0.018
broken-0.018
handed-0.018
Token ready
Token ID3492
Feature activation-1.06e+00

Pos logit contributions
 happens+0.053
 comes+0.047
 Everything+0.045
apy+0.045
 Now+0.044
Neg logit contributions
side-0.042
handed-0.037
 distance-0.037
 unatt-0.037
thank-0.034
TokenHelp
Token ID22087
Feature activation-3.84e-01

Pos logit contributions
 happens+0.019
 comes+0.016
apy+0.016
 Everything+0.016
 Now+0.015
Neg logit contributions
side-0.020
 unatt-0.018
handed-0.018
 distance-0.017
ge-0.017
Token!
Token ID0
Feature activation-4.36e-01

Pos logit contributions
 happens+0.037
 comes+0.032
apy+0.031
 Everything+0.031
 Now+0.029
Neg logit contributions
side-0.046
handed-0.042
 distance-0.042
 unatt-0.042
ge-0.039
Token Help
Token ID10478
Feature activation-1.02e+00

Pos logit contributions
 happens+0.042
 comes+0.034
apy+0.032
oke+0.031
ours+0.031
Neg logit contributions
side-0.050
handed-0.049
 unatt-0.047
 distance-0.046
ro-0.042
Token!"
Token ID2474
Feature activation-2.93e-01

Pos logit contributions
 happens+0.102
apy+0.089
 comes+0.089
 Everything+0.086
 Now+0.083
Neg logit contributions
side-0.116
 distance-0.107
handed-0.106
 unatt-0.105
ge-0.098
Token Little
Token ID7703
Feature activation-3.17e-01

Pos logit contributions
 happens+0.026
 comes+0.023
 Everything+0.021
apy+0.021
 Now+0.021
Neg logit contributions
side-0.037
handed-0.034
 distance-0.034
 unatt-0.034
ge-0.032
Token Annie
Token ID25860
Feature activation-1.46e+00

Pos logit contributions
 happens+0.026
apy+0.022
 comes+0.021
ours+0.021
icks+0.020
Neg logit contributions
side-0.041
 distance-0.038
handed-0.038
 unatt-0.037
thank-0.036
Token shouted
Token ID17293
Feature activation-3.58e-01

Pos logit contributions
 happens+0.142
 comes+0.126
apy+0.117
 Everything+0.116
 Now+0.112
Neg logit contributions
side-0.175
handed-0.160
 unatt-0.158
 distance-0.156
thank-0.149
Token from
Token ID422
Feature activation+4.49e-01

Pos logit contributions
 happens+0.039
 comes+0.035
apy+0.034
 Everything+0.033
 It+0.032
Neg logit contributions
side-0.039
handed-0.035
 distance-0.035
 unatt-0.034
thank-0.032
Token the
Token ID262
Feature activation-3.81e-01

Pos logit contributions
side+0.049
handed+0.044
 distance+0.043
 unatt+0.043
thank+0.040
Neg logit contributions
 happens-0.049
 comes-0.044
 Everything-0.041
apy-0.041
 Now-0.040
Token top
Token ID1353
Feature activation+2.36e-01

Pos logit contributions
 happens+0.039
 comes+0.035
 Everything+0.033
apy+0.032
 Now+0.031
Neg logit contributions
side-0.043
handed-0.039
 unatt-0.038
ge-0.038
thank-0.037
Token of
Token ID286
Feature activation-6.79e-01

Pos logit contributions
side+0.040
handed+0.040
 unatt+0.039
thank+0.038
anut+0.038
Neg logit contributions
 happens-0.009
 comes-0.007
apy-0.005
 Everything-0.005
oke-0.005
Token and
Token ID290
Feature activation+1.23e-01

Pos logit contributions
side+0.046
handed+0.045
anut+0.044
 distance+0.043
ge+0.043
Neg logit contributions
 happens-0.034
 comes-0.031
 Now-0.029
 It-0.029
 can-0.027
Token hold
Token ID1745
Feature activation-3.25e-01

Pos logit contributions
side+0.011
 distance+0.010
 unatt+0.010
handed+0.010
thank+0.009
Neg logit contributions
 happens-0.015
 comes-0.013
oke-0.013
 Everything-0.013
apy-0.013
Token each
Token ID1123
Feature activation-1.35e-01

Pos logit contributions
 happens+0.038
oke+0.033
 comes+0.033
 Everything+0.032
apy+0.031
Neg logit contributions
side-0.032
 distance-0.029
handed-0.028
 unatt-0.028
ge-0.025
Token other
Token ID584
Feature activation+1.21e+00

Pos logit contributions
 happens+0.014
 comes+0.013
 Everything+0.012
 Now+0.012
 turns+0.011
Neg logit contributions
side-0.016
handed-0.014
 unatt-0.014
ge-0.013
thank-0.013
Token's
Token ID338
Feature activation-7.86e-02

Pos logit contributions
thank+0.137
handed+0.133
side+0.128
 unatt+0.123
 distance+0.121
Neg logit contributions
 happens-0.112
 can-0.096
 comes-0.095
 turns-0.090
 It-0.089
Token hands
Token ID2832
Feature activation-1.18e+00

Pos logit contributions
 happens+0.007
 comes+0.007
 Everything+0.006
apy+0.006
 Now+0.006
Neg logit contributions
side-0.010
handed-0.009
 unatt-0.009
 distance-0.009
thank-0.008
Token.
Token ID13
Feature activation-1.09e-01

Pos logit contributions
 happens+0.114
 can+0.100
 comes+0.099
apy+0.097
 It+0.094
Neg logit contributions
handed-0.125
side-0.123
 distance-0.120
thank-0.116
 unatt-0.115
Token
Token ID198
Feature activation-6.60e-02

Pos logit contributions
 happens+0.011
 comes+0.010
apy+0.009
 Everything+0.009
oke+0.009
Neg logit contributions
side-0.013
handed-0.011
 distance-0.011
 unatt-0.011
thank-0.011
Token
Token ID198
Feature activation-7.48e-02

Pos logit contributions
 happens+0.005
 comes+0.005
 Everything+0.004
 Now+0.004
 It+0.004
Neg logit contributions
side-0.009
 unatt-0.008
handed-0.008
 distance-0.008
thank-0.008
TokenAfter
Token ID3260
Feature activation+4.31e-02

Pos logit contributions
 happens+0.008
 Now+0.008
 Everything+0.007
apy+0.007
 Everyone+0.007
Neg logit contributions
 unatt-0.007
handed-0.007
side-0.007
 distance-0.007
keeping-0.007
Token the
Token ID262
Feature activation-3.19e-01

Pos logit contributions
side+0.005
 distance+0.004
 unatt+0.004
handed+0.004
thank+0.004
Neg logit contributions
 happens-0.005
 comes-0.004
oke-0.004
ours-0.004
 Everything-0.004
Token,
Token ID11
Feature activation-1.17e-01

Pos logit contributions
side+0.100
handed+0.091
 unatt+0.090
 distance+0.090
thank+0.085
Neg logit contributions
 happens-0.080
 comes-0.068
 Everything-0.065
apy-0.065
 Now-0.063
Token "
Token ID366
Feature activation+2.71e-01

Pos logit contributions
 happens+0.009
 comes+0.007
apy+0.006
 Everything+0.006
ours+0.006
Neg logit contributions
side-0.017
 distance-0.015
 unatt-0.015
handed-0.015
thank-0.015
TokenSam
Token ID16305
Feature activation+1.36e-01

Pos logit contributions
side+0.029
 unatt+0.026
handed+0.026
 distance+0.026
ge+0.024
Neg logit contributions
 happens-0.029
 comes-0.026
 Everything-0.025
 Now-0.025
apy-0.025
Token,
Token ID11
Feature activation-1.35e-01

Pos logit contributions
side+0.015
handed+0.014
 distance+0.014
 unatt+0.013
thank+0.013
Neg logit contributions
 happens-0.013
apy-0.012
 Everything-0.011
 importantly-0.011
 People-0.011
Token you
Token ID345
Feature activation-4.64e-02

Pos logit contributions
 happens+0.012
apy+0.010
 Now+0.010
ours+0.010
 Everything+0.010
Neg logit contributions
 unatt-0.015
side-0.014
 distance-0.014
handed-0.014
 bank-0.013
Token should
Token ID815
Feature activation-1.06e+00

Pos logit contributions
apy+0.004
ours+0.004
 happens+0.004
 Now+0.004
 importantly+0.004
Neg logit contributions
 distance-0.004
 bank-0.004
handed-0.004
 unatt-0.004
side-0.004
Token not
Token ID407
Feature activation+5.69e-01

Pos logit contributions
 happens+0.125
 comes+0.110
apy+0.107
oke+0.107
 Everything+0.106
Neg logit contributions
side-0.100
handed-0.092
 distance-0.090
 unatt-0.088
 bank-0.082
Token chew
Token ID34722
Feature activation+5.21e-01

Pos logit contributions
side+0.060
handed+0.055
 distance+0.055
 unatt+0.054
thank+0.050
Neg logit contributions
 happens-0.063
 comes-0.055
 Everything-0.053
apy-0.053
 Now-0.052
Token gum
Token ID27142
Feature activation+1.15e+00

Pos logit contributions
side+0.065
 distance+0.060
handed+0.060
 unatt+0.058
thank+0.055
Neg logit contributions
 happens-0.049
 comes-0.042
apy-0.039
 Everything-0.039
 Now-0.038
Token when
Token ID618
Feature activation+5.27e-02

Pos logit contributions
 distance+0.139
side+0.133
handed+0.128
 spine+0.122
 former+0.119
Neg logit contributions
 happens-0.117
 comes-0.094
 can-0.091
oke-0.083
 Go-0.083
Token you
Token ID345
Feature activation+2.15e-01

Pos logit contributions
side+0.006
 unatt+0.006
handed+0.006
 distance+0.006
 bank+0.006
Neg logit contributions
 happens-0.004
oke-0.004
 Everything-0.004
oms-0.004
 comes-0.003
Token a
Token ID257
Feature activation+6.69e-02

Pos logit contributions
side+0.034
 unatt+0.031
anut+0.031
handed+0.030
ge+0.030
Neg logit contributions
 happens-0.018
 comes-0.017
 It-0.016
 Everything-0.016
 turns-0.015
Token chair
Token ID5118
Feature activation-8.81e-02

Pos logit contributions
side+0.007
handed+0.006
 distance+0.006
 unatt+0.006
thank+0.005
Neg logit contributions
 happens-0.008
 comes-0.007
 Everything-0.007
apy-0.007
 Now-0.006
Token.
Token ID13
Feature activation+3.63e-02

Pos logit contributions
 comes+0.008
 happens+0.008
 There+0.007
 It+0.007
 can+0.006
Neg logit contributions
side-0.009
handed-0.009
anut-0.009
 unatt-0.009
 distance-0.009
Token He
Token ID679
Feature activation-1.59e-01

Pos logit contributions
side+0.004
 distance+0.003
handed+0.003
 bank+0.003
 sill+0.003
Neg logit contributions
 happens-0.004
apy-0.004
 comes-0.004
ours-0.003
icks-0.003
Token was
Token ID373
Feature activation-2.06e-01

Pos logit contributions
 happens+0.018
 comes+0.015
 Everything+0.015
apy+0.015
ours+0.015
Neg logit contributions
side-0.016
 distance-0.015
handed-0.014
 unatt-0.014
thank-0.014
Token smiling
Token ID16755
Feature activation-1.19e+00

Pos logit contributions
 happens+0.021
 comes+0.020
 Everything+0.019
apy+0.019
 It+0.018
Neg logit contributions
side-0.022
handed-0.020
 unatt-0.020
 distance-0.019
ge-0.019
Token at
Token ID379
Feature activation-2.38e-01

Pos logit contributions
 happens+0.123
 comes+0.110
 Everything+0.104
apy+0.102
 Now+0.101
Neg logit contributions
side-0.133
handed-0.122
 distance-0.121
 unatt-0.119
ge-0.112
Token her
Token ID607
Feature activation+7.26e-02

Pos logit contributions
 happens+0.025
 comes+0.023
 Everything+0.022
 It+0.021
 Now+0.021
Neg logit contributions
side-0.027
handed-0.024
 unatt-0.023
 distance-0.023
thank-0.022
Token.
Token ID13
Feature activation-1.77e-02

Pos logit contributions
side+0.008
handed+0.007
 unatt+0.007
 distance+0.007
thank+0.007
Neg logit contributions
 happens-0.007
 comes-0.007
 Everything-0.006
apy-0.006
oke-0.006
Token
Token ID220
Feature activation+3.89e-01

Pos logit contributions
 happens+0.002
 comes+0.002
apy+0.002
ours+0.002
oms+0.002
Neg logit contributions
side-0.002
 distance-0.002
handed-0.001
 sill-0.001
 bank-0.001
Token
Token ID198
Feature activation+4.27e-02

Pos logit contributions
side+0.044
handed+0.040
 distance+0.039
 unatt+0.039
ge+0.037
Neg logit contributions
 happens-0.040
 comes-0.035
apy-0.033
 Everything-0.032
 Now-0.031
Token a
Token ID257
Feature activation-3.30e-02

Pos logit contributions
 happens+0.014
 comes+0.013
 Everything+0.012
apy+0.012
 Now+0.012
Neg logit contributions
side-0.017
handed-0.016
 distance-0.015
 unatt-0.015
ge-0.015
Token three
Token ID1115
Feature activation+2.44e-01

Pos logit contributions
 happens+0.003
 comes+0.003
 Everything+0.003
apy+0.003
 Now+0.003
Neg logit contributions
side-0.004
handed-0.003
 unatt-0.003
 distance-0.003
ge-0.003
Token-
Token ID12
Feature activation+4.09e-01

Pos logit contributions
side+0.020
 distance+0.019
handed+0.018
 unatt+0.018
ge+0.016
Neg logit contributions
 happens-0.032
 comes-0.028
 Everything-0.028
apy-0.027
 Now-0.027
Tokenyear
Token ID1941
Feature activation-1.45e-01

Pos logit contributions
side+0.062
 distance+0.059
 unatt+0.058
 spine+0.056
handed+0.056
Neg logit contributions
 happens-0.023
 comes-0.018
apy-0.018
 Everything-0.018
oke-0.018
Token-
Token ID12
Feature activation+4.01e-01

Pos logit contributions
 happens+0.014
 comes+0.012
 Everything+0.012
apy+0.012
 Now+0.011
Neg logit contributions
side-0.017
handed-0.016
 distance-0.016
 unatt-0.015
thank-0.015
Tokenold
Token ID727
Feature activation-1.28e+00

Pos logit contributions
side+0.070
 distance+0.067
 unatt+0.067
handed+0.064
 spine+0.064
Neg logit contributions
 happens-0.013
 comes-0.009
 Everything-0.008
apy-0.008
 Now-0.007
Token boy
Token ID2933
Feature activation-2.50e-03

Pos logit contributions
 happens+0.142
 comes+0.127
 Everything+0.121
apy+0.119
 Now+0.117
Neg logit contributions
side-0.135
handed-0.121
 distance-0.120
 unatt-0.119
thank-0.111
Token who
Token ID508
Feature activation-3.26e-01

Pos logit contributions
 happens+0.000
 comes+0.000
 Everything+0.000
apy+0.000
 Now+0.000
Neg logit contributions
side-0.000
handed-0.000
 distance-0.000
 unatt-0.000
thank-0.000
Token loved
Token ID6151
Feature activation+1.08e-01

Pos logit contributions
 happens+0.037
ours+0.032
apy+0.032
oke+0.031
oms+0.031
Neg logit contributions
side-0.032
handed-0.030
 distance-0.029
thank-0.028
 unatt-0.027
Token exploring
Token ID13504
Feature activation-1.39e+00

Pos logit contributions
side+0.011
handed+0.010
 distance+0.010
 unatt+0.010
thank+0.009
Neg logit contributions
 happens-0.012
 comes-0.011
 Everything-0.010
apy-0.010
 Now-0.010
Token the
Token ID262
Feature activation-2.39e-01

Pos logit contributions
 happens+0.152
 comes+0.135
 Everything+0.128
apy+0.127
 Now+0.124
Neg logit contributions
side-0.150
handed-0.136
 distance-0.135
 unatt-0.134
thank-0.125