TOP ACTIVATIONS
MAX = 0.081

small animal she had never seen before . It was white
ostr ich is flexible too ," she said .
it 's a special rock ," Martha said .
Mia . I have not seen him ." They both decided
This line is too slow ," Tom said .
. " I have never seen a dead foot before ."
the fish are not hungry ," Sara says .
I don 't like that ," Sam says . He pushes
you are not my friend ," Sam said . " You
saw things he had never seen before . He was amazed
car that Lily had never seen before . Lily loved the
knowing what day it is ," Lily explained . " See
Hi , what is your name ?" Tom asked the new
" I have a plan ," Sam says . " You
, the gum is gone ," Tim says .
Hello , what is your name ?" Lily asks the new
flower isn 't growing anymore ," Tim my said .
are looking for penn ies ," Tom said . " We
But I am nos y ," says Lily . " I
" I want the jam ," Tom said .

MIN ACTIVATIONS
MIN = -0.112

Everyday he would take the av oc ados and scatter them
That was the setup that fore shadow ed the bad ending
room while she ran an err and . The girl was
as he opened the ch ocol ates and started eating them
the tree to its once former beauty . He was happy
Grand ma 's words fore shadow ed a happy future
It turned out that the av oc ados were bad and
. He gave the friendly nurses a wave goodbye . They
As the farmer scattered the av oc ados , he noticed
was a box of ch ocol ates . Lily loved ch
took Sara the 3 - year - old to the park
She told her three - year - old daughter to attach
took her little three - year - old daughter to a
dad took their 3 - year - old Jimmy to the
twenty - eight , twenty - nine , thirty , thirty
the sky before they walked fore shadow ed the down pour
, but it was all fore shadow ed by the kind
back the village to its former state . With the help
and the frog suggested catching tad p oles in a nearby
something new - there were tad p oles swimming around in

INTERVAL 0.033 - 0.081
CONTAINS 2.706%

and broke . This made the little girl very sad because
Can you please forgive us ?" they say together .
bird flying high in the sky . She wanted to fly
a little bird in the water . It was a sour
some food to feed my tum my ."

INTERVAL -0.015 - 0.033
CONTAINS 79.918%

They have an idea . " Okay , we
girl so happy ! She was so excited
," Mia says . " I like the rain . It
fun . <|endoftext|> S ara and Tom were playing at the
hears a strange noise outside . He is scared . He

INTERVAL -0.063 - -0.015
CONTAINS 17.343%

bird . The bird did not like the cat 's wink
coming . So the brave boy and his friends
. Tom hugged Lily and said , " I
excited ly . The friends hopped on board the
, " S ue , we can use this salt to

INTERVAL -0.112 - -0.063
CONTAINS 0.034%

her best friend . The moral of the story is that
return . The next morning , the pink elephant
They gave her a band - aid and helped her stand
course !" Together , they worked on the
gift . The next day , the children of
Token small
Token ID1402
Feature activation+8.93e-04

Pos logit contributions
opus+0.039
ement+0.034
uffy+0.033
imi+0.033
 replied+0.032
Neg logit contributions
 err-0.038
 across-0.037
 even-0.036
wood-0.035
 quite-0.033
Token animal
Token ID5044
Feature activation+2.55e-02

Pos logit contributions
opus+0.001
uffy+0.001
uddle+0.001
 replied+0.001
imi+0.001
Neg logit contributions
 err-0.002
wood-0.002
keeping-0.002
 across-0.002
beh-0.002
Token she
Token ID673
Feature activation+2.22e-02

Pos logit contributions
opus+0.051
uffy+0.045
imi+0.044
uddle+0.042
 replied+0.042
Neg logit contributions
 err-0.035
 across-0.035
keeping-0.032
 even-0.032
wood-0.032
Token had
Token ID550
Feature activation-1.85e-03

Pos logit contributions
opus+0.040
uffy+0.033
uddle+0.033
imi+0.032
ama+0.031
Neg logit contributions
 err-0.036
 across-0.035
wood-0.035
beh-0.035
keeping-0.035
Token never
Token ID1239
Feature activation-2.47e-02

Pos logit contributions
 err+0.002
wood+0.002
keeping+0.002
beh+0.002
 across+0.002
Neg logit contributions
opus-0.004
uffy-0.003
 replied-0.003
uddle-0.003
imi-0.003
Token seen
Token ID1775
Feature activation+8.09e-02

Pos logit contributions
wood+0.033
 err+0.032
beh+0.030
keeping+0.029
gent+0.028
Neg logit contributions
opus-0.049
 replied-0.044
uffy-0.040
uddle-0.040
 ow-0.040
Token before
Token ID878
Feature activation+3.44e-02

Pos logit contributions
opus+0.181
 replied+0.153
uffy+0.152
imi+0.150
uddle+0.150
Neg logit contributions
wood-0.097
 err-0.095
beh-0.093
keeping-0.090
 distances-0.081
Token.
Token ID13
Feature activation+7.65e-03

Pos logit contributions
opus+0.051
uffy+0.041
 replied+0.040
uddle+0.039
imi+0.039
Neg logit contributions
 err-0.067
wood-0.066
keeping-0.065
beh-0.065
 across-0.062
Token It
Token ID632
Feature activation+2.85e-02

Pos logit contributions
opus+0.013
uffy+0.010
 replied+0.010
uddle+0.010
imi+0.010
Neg logit contributions
 err-0.014
wood-0.014
keeping-0.013
beh-0.013
 across-0.013
Token was
Token ID373
Feature activation-2.30e-03

Pos logit contributions
opus+0.053
uffy+0.044
uddle+0.044
imi+0.042
ama+0.041
Neg logit contributions
 err-0.045
 across-0.045
wood-0.044
 even-0.043
beh-0.043
Token white
Token ID2330
Feature activation+8.30e-03

Pos logit contributions
keeping+0.004
beh+0.004
 err+0.004
wood+0.003
 distances+0.003
Neg logit contributions
opus-0.004
 replied-0.004
imi-0.004
uffy-0.004
ilo-0.003
Token ostr
Token ID42387
Feature activation+1.53e-02

Pos logit contributions
 err+0.014
beh+0.014
keeping+0.013
wood+0.013
 distances+0.013
Neg logit contributions
opus-0.014
 replied-0.012
uffy-0.012
imi-0.011
uddle-0.011
Tokenich
Token ID488
Feature activation+1.80e-02

Pos logit contributions
opus+0.020
 replied+0.017
uddle+0.016
pron+0.016
imi+0.015
Neg logit contributions
wood-0.031
beh-0.030
keeping-0.030
 err-0.029
 across-0.029
Token is
Token ID318
Feature activation+6.21e-03

Pos logit contributions
opus+0.027
uffy+0.021
imi+0.021
uddle+0.021
 replied+0.020
Neg logit contributions
 err-0.035
wood-0.034
beh-0.034
keeping-0.033
 across-0.033
Token flexible
Token ID12846
Feature activation+1.26e-03

Pos logit contributions
opus+0.010
 replied+0.009
uffy+0.009
imi+0.008
uddle+0.008
Neg logit contributions
 err-0.011
keeping-0.011
wood-0.010
beh-0.010
 distances-0.010
Token too
Token ID1165
Feature activation+9.58e-03

Pos logit contributions
opus+0.002
 replied+0.001
uffy+0.001
uddle+0.001
imi+0.001
Neg logit contributions
beh-0.003
 err-0.003
keeping-0.003
wood-0.002
gent-0.002
Token,"
Token ID553
Feature activation+8.02e-02

Pos logit contributions
opus+0.017
uddle+0.014
uffy+0.014
imi+0.014
 replied+0.013
Neg logit contributions
 err-0.015
 across-0.015
wood-0.015
 even-0.015
 distances-0.014
Token she
Token ID673
Feature activation+1.71e-02

Pos logit contributions
opus+0.175
uddle+0.145
uffy+0.134
ement+0.134
actus+0.126
Neg logit contributions
 even-0.120
 err-0.111
 themselves-0.108
 across-0.108
 some-0.099
Token said
Token ID531
Feature activation+2.52e-02

Pos logit contributions
opus+0.024
uffy+0.018
uddle+0.018
ama+0.017
ement+0.017
Neg logit contributions
 err-0.034
wood-0.034
 even-0.034
beh-0.034
 across-0.033
Token.
Token ID13
Feature activation+1.58e-02

Pos logit contributions
opus+0.047
uffy+0.038
uddle+0.038
imi+0.037
 replied+0.035
Neg logit contributions
 err-0.041
wood-0.041
 across-0.040
keeping-0.040
beh-0.039
Token
Token ID198
Feature activation+2.00e-02

Pos logit contributions
opus+0.031
uddle+0.026
uffy+0.025
imi+0.025
inkle+0.024
Neg logit contributions
 across-0.024
wood-0.023
 err-0.023
keeping-0.022
 distances-0.022
Token
Token ID198
Feature activation+2.20e-02

Pos logit contributions
opus+0.045
ement+0.036
uddle+0.036
uffy+0.036
pron+0.035
Neg logit contributions
 err-0.024
 across-0.024
keeping-0.024
 themselves-0.023
and-0.023
Token it
Token ID340
Feature activation+3.81e-03

Pos logit contributions
opus+0.034
uffy+0.027
uddle+0.026
 replied+0.026
imi+0.026
Neg logit contributions
 err-0.041
wood-0.040
keeping-0.039
beh-0.039
 across-0.038
Token's
Token ID338
Feature activation-6.45e-04

Pos logit contributions
opus+0.006
uffy+0.005
uddle+0.005
imi+0.005
 replied+0.005
Neg logit contributions
 err-0.007
wood-0.007
beh-0.007
keeping-0.007
 across-0.007
Token a
Token ID257
Feature activation+1.53e-02

Pos logit contributions
keeping+0.001
beh+0.001
 err+0.001
and+0.001
gent+0.001
Neg logit contributions
opus-0.001
 replied-0.001
imi-0.001
uffy-0.001
 ow-0.001
Token special
Token ID2041
Feature activation+3.07e-03

Pos logit contributions
opus+0.026
uffy+0.021
uddle+0.020
 replied+0.020
imi+0.020
Neg logit contributions
 err-0.027
keeping-0.027
beh-0.026
wood-0.026
 distances-0.025
Token rock
Token ID3881
Feature activation+1.96e-02

Pos logit contributions
opus+0.005
uffy+0.004
 replied+0.004
uddle+0.004
imi+0.004
Neg logit contributions
 err-0.006
beh-0.006
keeping-0.006
wood-0.005
 distances-0.005
Token,"
Token ID553
Feature activation+7.92e-02

Pos logit contributions
opus+0.026
 replied+0.020
uffy+0.020
uddle+0.019
imi+0.018
Neg logit contributions
 err-0.042
keeping-0.041
beh-0.041
wood-0.040
 distances-0.039
Token Martha
Token ID27243
Feature activation+2.98e-02

Pos logit contributions
opus+0.105
uddle+0.085
uffy+0.080
imi+0.070
pron+0.067
Neg logit contributions
wood-0.167
 err-0.166
keeping-0.161
beh-0.160
 even-0.159
Token said
Token ID531
Feature activation+7.87e-03

Pos logit contributions
opus+0.032
uffy+0.023
uddle+0.022
imi+0.021
 replied+0.019
Neg logit contributions
wood-0.070
 err-0.070
keeping-0.069
beh-0.069
 across-0.067
Token.
Token ID13
Feature activation+8.36e-03

Pos logit contributions
opus+0.012
 replied+0.010
uffy+0.010
uddle+0.009
imi+0.009
Neg logit contributions
 err-0.015
wood-0.014
keeping-0.014
beh-0.014
 distances-0.014
Token
Token ID198
Feature activation+8.80e-03

Pos logit contributions
opus+0.013
uffy+0.011
 replied+0.011
uddle+0.010
imi+0.010
Neg logit contributions
 err-0.016
wood-0.015
keeping-0.015
beh-0.015
 across-0.014
Token
Token ID198
Feature activation+9.78e-03

Pos logit contributions
opus+0.017
uddle+0.014
uffy+0.014
imi+0.013
 replied+0.013
Neg logit contributions
 err-0.013
wood-0.013
keeping-0.013
 across-0.012
beh-0.012
Token Mia
Token ID32189
Feature activation+1.60e-02

Pos logit contributions
opus+0.041
uffy+0.033
uddle+0.033
ement+0.031
inkle+0.031
Neg logit contributions
 err-0.053
 across-0.049
 even-0.049
wood-0.049
keeping-0.049
Token.
Token ID13
Feature activation-5.28e-03

Pos logit contributions
opus+0.036
pron+0.031
rescent+0.031
ement+0.030
uddle+0.029
Neg logit contributions
wood-0.018
beh-0.018
 across-0.018
 even-0.017
ried-0.017
Token I
Token ID314
Feature activation+1.83e-03

Pos logit contributions
wood+0.009
keeping+0.008
 err+0.008
beh+0.008
 across+0.008
Neg logit contributions
opus-0.009
inkle-0.008
pron-0.008
ement-0.008
uddle-0.008
Token have
Token ID423
Feature activation-1.25e-02

Pos logit contributions
opus+0.004
uffy+0.003
 replied+0.003
uddle+0.003
imi+0.003
Neg logit contributions
 err-0.003
wood-0.002
keeping-0.002
beh-0.002
 across-0.002
Token not
Token ID407
Feature activation-2.66e-04

Pos logit contributions
 err+0.017
wood+0.017
beh+0.016
keeping+0.016
 distances+0.015
Neg logit contributions
opus-0.025
uffy-0.022
 replied-0.021
uddle-0.021
imi-0.021
Token seen
Token ID1775
Feature activation+7.91e-02

Pos logit contributions
 err+0.000
wood+0.000
 distances+0.000
keeping+0.000
beh+0.000
Neg logit contributions
opus-0.000
 replied-0.000
uffy-0.000
imi-0.000
 dear-0.000
Token him
Token ID683
Feature activation+3.25e-02

Pos logit contributions
opus+0.135
uffy+0.113
uddle+0.113
 replied+0.108
imi+0.104
Neg logit contributions
 err-0.137
wood-0.133
keeping-0.132
 across-0.130
beh-0.128
Token."
Token ID526
Feature activation+2.93e-02

Pos logit contributions
opus+0.061
uffy+0.050
 replied+0.050
uddle+0.049
imi+0.048
Neg logit contributions
 err-0.052
 across-0.050
wood-0.050
keeping-0.049
beh-0.049
Token They
Token ID1119
Feature activation-1.05e-02

Pos logit contributions
opus+0.058
inkle+0.043
pron+0.043
uffy+0.042
ement+0.042
Neg logit contributions
wood-0.047
 err-0.046
 across-0.046
keeping-0.045
and-0.044
Token both
Token ID1111
Feature activation-6.46e-03

Pos logit contributions
 err+0.016
wood+0.015
 across+0.015
beh+0.014
keeping+0.014
Neg logit contributions
opus-0.022
uffy-0.018
uddle-0.017
imi-0.017
ilo-0.017
Token decided
Token ID3066
Feature activation-2.63e-02

Pos logit contributions
 err+0.011
wood+0.010
keeping+0.010
beh+0.010
 across+0.010
Neg logit contributions
opus-0.012
uffy-0.010
uddle-0.009
imi-0.009
 replied-0.009
Token This
Token ID770
Feature activation+2.26e-02

Pos logit contributions
opus+0.018
uffy+0.014
uddle+0.014
 replied+0.014
imi+0.013
Neg logit contributions
 err-0.022
wood-0.021
keeping-0.021
beh-0.021
 across-0.020
Token line
Token ID1627
Feature activation-5.85e-03

Pos logit contributions
opus+0.031
uffy+0.024
imi+0.024
 replied+0.023
uddle+0.023
Neg logit contributions
 err-0.046
wood-0.045
 across-0.044
keeping-0.043
 even-0.042
Token is
Token ID318
Feature activation+1.99e-02

Pos logit contributions
 err+0.014
wood+0.013
keeping+0.013
beh+0.013
 across+0.013
Neg logit contributions
opus-0.006
uffy-0.005
 replied-0.005
uddle-0.005
imi-0.004
Token too
Token ID1165
Feature activation+1.65e-02

Pos logit contributions
opus+0.028
 replied+0.025
uffy+0.024
imi+0.024
pron+0.023
Neg logit contributions
keeping-0.039
wood-0.037
 err-0.037
beh-0.036
and-0.036
Token slow
Token ID3105
Feature activation-1.03e-02

Pos logit contributions
opus+0.034
 replied+0.028
ement+0.027
uffy+0.027
uddle+0.027
Neg logit contributions
 err-0.025
 across-0.023
wood-0.022
beh-0.021
 even-0.021
Token,"
Token ID553
Feature activation+7.91e-02

Pos logit contributions
beh+0.022
keeping+0.022
 err+0.022
wood+0.021
uries+0.020
Neg logit contributions
opus-0.012
 replied-0.010
uffy-0.010
uddle-0.009
imi-0.009
Token Tom
Token ID4186
Feature activation+2.20e-02

Pos logit contributions
opus+0.155
uffy+0.131
uddle+0.127
inkle+0.117
imi+0.117
Neg logit contributions
 err-0.126
 even-0.124
 across-0.115
 themselves-0.115
wood-0.109
Token said
Token ID531
Feature activation+1.41e-02

Pos logit contributions
opus+0.031
uffy+0.024
uddle+0.024
imi+0.022
ama+0.021
Neg logit contributions
 even-0.045
wood-0.044
 err-0.044
beh-0.044
 across-0.043
Token.
Token ID13
Feature activation+4.61e-03

Pos logit contributions
opus+0.024
uffy+0.019
imi+0.019
uddle+0.019
 replied+0.018
Neg logit contributions
 err-0.025
wood-0.024
keeping-0.024
beh-0.024
 across-0.024
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
opus+0.009
uffy+0.008
uddle+0.008
imi+0.008
inkle+0.008
Neg logit contributions
 across-0.006
 err-0.006
wood-0.006
keeping-0.006
 even-0.006
Token
Token ID198
Feature activation+1.68e-02

Pos logit contributions
opus+0.027
uffy+0.022
ement+0.022
uddle+0.022
pron+0.021
Neg logit contributions
 across-0.014
 err-0.014
and-0.013
wood-0.013
keeping-0.013
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
opus+0.038
uddle+0.031
uffy+0.031
imi+0.029
ement+0.029
Neg logit contributions
wood-0.031
beh-0.031
 across-0.030
 err-0.030
 even-0.029
Token "
Token ID366
Feature activation-1.55e-02

Pos logit contributions
opus+0.027
uffy+0.022
uddle+0.022
imi+0.022
inkle+0.021
Neg logit contributions
wood-0.022
 across-0.022
 err-0.021
keeping-0.021
 distances-0.020
TokenI
Token ID40
Feature activation-1.13e-02

Pos logit contributions
 err+0.034
wood+0.032
keeping+0.029
beh+0.029
 throughout+0.028
Neg logit contributions
opus-0.023
ara-0.016
on-0.015
pron-0.015
 replied-0.015
Token have
Token ID423
Feature activation-2.34e-02

Pos logit contributions
 err+0.018
wood+0.017
keeping+0.017
beh+0.017
 across+0.016
Neg logit contributions
opus-0.021
uffy-0.018
 replied-0.017
uddle-0.017
imi-0.017
Token never
Token ID1239
Feature activation-2.64e-02

Pos logit contributions
wood+0.036
 err+0.035
keeping+0.034
beh+0.034
ried+0.031
Neg logit contributions
opus-0.044
 replied-0.037
uffy-0.037
 ow-0.036
imi-0.035
Token seen
Token ID1775
Feature activation+7.88e-02

Pos logit contributions
wood+0.041
 err+0.037
 themselves+0.036
keeping+0.036
gent+0.034
Neg logit contributions
opus-0.046
 replied-0.044
 ow-0.040
uffy-0.037
?-0.037
Token a
Token ID257
Feature activation+3.58e-02

Pos logit contributions
opus+0.165
uffy+0.143
uddle+0.139
 replied+0.138
imi+0.136
Neg logit contributions
 err-0.106
wood-0.102
keeping-0.101
beh-0.099
 across-0.098
Token dead
Token ID2636
Feature activation+3.34e-02

Pos logit contributions
opus+0.060
ement+0.053
 replied+0.053
uffy+0.051
imi+0.050
Neg logit contributions
 across-0.062
 err-0.059
 even-0.057
wood-0.055
 quite-0.050
Token foot
Token ID2366
Feature activation+4.02e-02

Pos logit contributions
opus+0.051
uffy+0.042
 replied+0.040
uddle+0.040
imi+0.040
Neg logit contributions
 err-0.063
wood-0.061
keeping-0.061
 across-0.061
beh-0.059
Token before
Token ID878
Feature activation+3.15e-02

Pos logit contributions
opus+0.070
uffy+0.058
 replied+0.056
uddle+0.056
imi+0.055
Neg logit contributions
 err-0.068
wood-0.067
keeping-0.066
beh-0.065
 across-0.063
Token."
Token ID526
Feature activation+4.26e-02

Pos logit contributions
opus+0.049
uffy+0.040
uddle+0.038
 replied+0.038
imi+0.038
Neg logit contributions
 err-0.060
wood-0.058
keeping-0.058
beh-0.057
 across-0.056
Token the
Token ID262
Feature activation-1.99e-03

Pos logit contributions
 err+0.001
keeping+0.001
wood+0.001
beh+0.001
 across+0.001
Neg logit contributions
opus-0.001
 replied-0.000
uffy-0.000
uddle-0.000
imi-0.000
Token fish
Token ID5916
Feature activation+2.33e-02

Pos logit contributions
 err+0.003
wood+0.003
keeping+0.003
 across+0.003
beh+0.003
Neg logit contributions
opus-0.004
uffy-0.003
 replied-0.003
imi-0.003
uddle-0.003
Token are
Token ID389
Feature activation+1.87e-02

Pos logit contributions
opus+0.031
uffy+0.024
 replied+0.023
uddle+0.023
imi+0.022
Neg logit contributions
 err-0.050
keeping-0.049
wood-0.049
beh-0.048
 across-0.047
Token not
Token ID407
Feature activation+1.06e-04

Pos logit contributions
opus+0.030
uffy+0.024
 replied+0.024
uddle+0.023
imi+0.023
Neg logit contributions
 err-0.035
keeping-0.034
wood-0.034
beh-0.033
 distances-0.032
Token hungry
Token ID14720
Feature activation+5.19e-02

Pos logit contributions
opus+0.000
 replied+0.000
uffy+0.000
uddle+0.000
imi+0.000
Neg logit contributions
 err-0.000
keeping-0.000
wood-0.000
beh-0.000
 distances-0.000
Token,"
Token ID553
Feature activation+7.87e-02

Pos logit contributions
opus+0.056
uddle+0.041
 replied+0.040
uffy+0.040
imi+0.035
Neg logit contributions
beh-0.122
 err-0.119
wood-0.118
keeping-0.117
gent-0.112
Token Sara
Token ID24799
Feature activation+2.53e-02

Pos logit contributions
opus+0.132
uffy+0.105
uddle+0.103
imi+0.101
 replied+0.096
Neg logit contributions
 err-0.144
wood-0.138
keeping-0.137
beh-0.136
 across-0.133
Token says
Token ID1139
Feature activation+2.75e-02

Pos logit contributions
opus+0.022
uffy+0.014
uddle+0.013
imi+0.012
pron+0.012
Neg logit contributions
 err-0.065
wood-0.065
beh-0.063
 across-0.063
keeping-0.063
Token.
Token ID13
Feature activation+8.11e-03

Pos logit contributions
opus+0.046
uffy+0.037
uddle+0.036
imi+0.036
 replied+0.035
Neg logit contributions
 err-0.050
wood-0.049
keeping-0.048
beh-0.047
 across-0.047
Token
Token ID198
Feature activation+1.34e-02

Pos logit contributions
opus+0.013
uffy+0.011
 replied+0.011
uddle+0.011
imi+0.011
Neg logit contributions
 err-0.015
wood-0.014
keeping-0.014
beh-0.014
 across-0.013
Token
Token ID198
Feature activation+1.46e-02

Pos logit contributions
opus+0.028
uffy+0.023
uddle+0.023
pron+0.023
ement+0.023
Neg logit contributions
 err-0.018
 across-0.018
wood-0.017
keeping-0.017
 themselves-0.016
Token I
Token ID314
Feature activation-4.08e-03

Pos logit contributions
opus+0.050
uffy+0.043
uddle+0.041
ement+0.039
imi+0.039
Neg logit contributions
 err-0.054
wood-0.051
 even-0.051
beh-0.051
 across-0.051
Token don
Token ID836
Feature activation+1.45e-02

Pos logit contributions
 err+0.005
wood+0.005
beh+0.005
keeping+0.005
 across+0.005
Neg logit contributions
opus-0.009
uffy-0.007
imi-0.007
 replied-0.007
uddle-0.007
Token't
Token ID470
Feature activation-1.72e-03

Pos logit contributions
opus+0.022
uffy+0.018
uddle+0.017
 replied+0.017
imi+0.017
Neg logit contributions
 err-0.028
beh-0.026
wood-0.026
 across-0.026
keeping-0.026
Token like
Token ID588
Feature activation+4.00e-02

Pos logit contributions
 err+0.002
wood+0.002
keeping+0.002
beh+0.002
 across+0.002
Neg logit contributions
opus-0.004
uffy-0.003
 replied-0.003
uddle-0.003
imi-0.003
Token that
Token ID326
Feature activation+3.26e-02

Pos logit contributions
opus+0.078
uffy+0.070
uddle+0.065
 replied+0.064
ama+0.063
Neg logit contributions
 err-0.060
 across-0.059
 distances-0.058
wood-0.056
 even-0.056
Token,"
Token ID553
Feature activation+7.87e-02

Pos logit contributions
opus+0.058
uffy+0.048
imi+0.046
uddle+0.045
ama+0.044
Neg logit contributions
 across-0.054
wood-0.054
 err-0.054
 distances-0.051
 even-0.051
Token Sam
Token ID3409
Feature activation+2.84e-02

Pos logit contributions
opus+0.100
uffy+0.075
uddle+0.072
 replied+0.070
imi+0.069
Neg logit contributions
 err-0.174
wood-0.171
keeping-0.169
beh-0.168
 across-0.164
Token says
Token ID1139
Feature activation+1.70e-02

Pos logit contributions
opus+0.028
uffy+0.018
uddle+0.017
imi+0.016
ama+0.016
Neg logit contributions
 err-0.072
wood-0.070
beh-0.069
 across-0.068
 distances-0.067
Token.
Token ID13
Feature activation+7.97e-03

Pos logit contributions
opus+0.027
uffy+0.022
uddle+0.021
 replied+0.021
imi+0.021
Neg logit contributions
 err-0.032
wood-0.031
keeping-0.031
beh-0.031
 across-0.030
Token He
Token ID679
Feature activation+1.69e-02

Pos logit contributions
opus+0.016
uffy+0.013
uddle+0.013
imi+0.012
pron+0.012
Neg logit contributions
wood-0.012
 err-0.012
 across-0.012
keeping-0.012
beh-0.011
Token pushes
Token ID20070
Feature activation+1.30e-02

Pos logit contributions
opus+0.028
uffy+0.022
imi+0.022
ama+0.021
uddle+0.021
Neg logit contributions
 err-0.030
wood-0.028
beh-0.028
 across-0.028
 distances-0.027
Token you
Token ID345
Feature activation-1.48e-02

Pos logit contributions
opus+0.015
uddle+0.012
uffy+0.012
ement+0.012
 replied+0.012
Neg logit contributions
 err-0.012
 across-0.012
keeping-0.012
wood-0.012
 even-0.012
Token are
Token ID389
Feature activation+2.53e-02

Pos logit contributions
beh+0.019
 even+0.019
 err+0.019
 across+0.018
wood+0.018
Neg logit contributions
opus-0.032
ement-0.027
pron-0.027
imi-0.027
uffy-0.027
Token not
Token ID407
Feature activation+4.55e-04

Pos logit contributions
opus+0.046
uffy+0.039
uddle+0.038
 replied+0.037
imi+0.037
Neg logit contributions
 err-0.042
beh-0.040
wood-0.040
keeping-0.040
 across-0.039
Token my
Token ID616
Feature activation+2.16e-02

Pos logit contributions
opus+0.001
 replied+0.001
uffy+0.001
imi+0.001
 dear+0.001
Neg logit contributions
wood-0.001
 err-0.001
keeping-0.001
 distances-0.001
beh-0.001
Token friend
Token ID1545
Feature activation+5.44e-02

Pos logit contributions
opus+0.032
 replied+0.025
uffy+0.024
uddle+0.024
pron+0.023
Neg logit contributions
 err-0.042
wood-0.041
keeping-0.039
 even-0.039
beh-0.039
Token,"
Token ID553
Feature activation+7.84e-02

Pos logit contributions
opus+0.091
uffy+0.073
uddle+0.072
imi+0.071
ama+0.069
Neg logit contributions
 err-0.095
keeping-0.093
beh-0.092
wood-0.092
 across-0.091
Token Sam
Token ID3409
Feature activation+2.25e-02

Pos logit contributions
opus+0.125
uddle+0.102
uffy+0.100
pron+0.095
ement+0.092
Neg logit contributions
 err-0.145
wood-0.144
 even-0.140
 across-0.138
 themselves-0.136
Token said
Token ID531
Feature activation+2.26e-02

Pos logit contributions
opus+0.034
uffy+0.026
uddle+0.025
ama+0.024
imi+0.024
Neg logit contributions
 even-0.045
 err-0.044
 across-0.043
beh-0.043
wood-0.043
Token.
Token ID13
Feature activation+9.27e-03

Pos logit contributions
opus+0.039
uffy+0.031
imi+0.030
uddle+0.030
 replied+0.030
Neg logit contributions
 err-0.040
wood-0.039
keeping-0.039
beh-0.038
 across-0.038
Token "
Token ID366
Feature activation-1.22e-02

Pos logit contributions
opus+0.017
uffy+0.014
uddle+0.014
imi+0.014
 replied+0.013
Neg logit contributions
 across-0.015
wood-0.015
 err-0.015
keeping-0.014
 distances-0.014
TokenYou
Token ID1639
Feature activation-1.33e-02

Pos logit contributions
wood+0.023
 err+0.022
keeping+0.020
beh+0.020
 across+0.019
Neg logit contributions
opus-0.021
uffy-0.015
ara-0.015
 replied-0.015
imi-0.015
Token saw
Token ID2497
Feature activation+1.89e-02

Pos logit contributions
opus+0.026
 replied+0.021
uffy+0.020
uddle+0.019
imi+0.019
Neg logit contributions
 err-0.038
keeping-0.038
wood-0.037
beh-0.037
 distances-0.035
Token things
Token ID1243
Feature activation+2.35e-03

Pos logit contributions
opus+0.036
 replied+0.030
uffy+0.030
uddle+0.029
imi+0.029
Neg logit contributions
 err-0.029
beh-0.029
keeping-0.028
wood-0.028
 distances-0.026
Token he
Token ID339
Feature activation+1.65e-02

Pos logit contributions
opus+0.003
 replied+0.003
uffy+0.003
uddle+0.003
ement+0.003
Neg logit contributions
 err-0.005
keeping-0.005
beh-0.004
wood-0.004
gent-0.004
Token had
Token ID550
Feature activation+3.02e-03

Pos logit contributions
opus+0.024
 replied+0.019
uffy+0.019
uddle+0.018
imi+0.017
Neg logit contributions
 err-0.034
keeping-0.033
wood-0.032
beh-0.032
 distances-0.030
Token never
Token ID1239
Feature activation-2.09e-02

Pos logit contributions
opus+0.006
 replied+0.005
uffy+0.005
imi+0.005
uddle+0.005
Neg logit contributions
 err-0.004
wood-0.004
beh-0.004
keeping-0.004
 distances-0.004
Token seen
Token ID1775
Feature activation+7.84e-02

Pos logit contributions
 err+0.027
wood+0.025
beh+0.024
keeping+0.023
 distances+0.022
Neg logit contributions
opus-0.044
 replied-0.039
uffy-0.037
uddle-0.036
imi-0.036
Token before
Token ID878
Feature activation+3.06e-02

Pos logit contributions
opus+0.163
 replied+0.137
ement+0.137
uffy+0.131
imi+0.129
Neg logit contributions
wood-0.106
beh-0.105
 err-0.104
keeping-0.098
ried-0.091
Token.
Token ID13
Feature activation+3.83e-03

Pos logit contributions
opus+0.043
 replied+0.034
uffy+0.034
uddle+0.033
imi+0.033
Neg logit contributions
 err-0.061
beh-0.061
wood-0.060
keeping-0.059
 across-0.056
Token He
Token ID679
Feature activation+1.10e-02

Pos logit contributions
opus+0.006
uffy+0.005
 replied+0.005
uddle+0.005
imi+0.005
Neg logit contributions
 err-0.007
wood-0.007
keeping-0.007
beh-0.007
 across-0.006
Token was
Token ID373
Feature activation+8.29e-03

Pos logit contributions
opus+0.014
 replied+0.012
uddle+0.010
uffy+0.010
imi+0.010
Neg logit contributions
keeping-0.024
 err-0.024
beh-0.023
wood-0.023
 distances-0.022
Token amazed
Token ID24562
Feature activation+2.75e-02

Pos logit contributions
opus+0.013
 replied+0.011
uffy+0.010
 sincere+0.010
imi+0.010
Neg logit contributions
 err-0.015
keeping-0.015
wood-0.015
beh-0.015
 distances-0.014
Token car
Token ID1097
Feature activation+7.80e-03

Pos logit contributions
opus+0.035
uffy+0.030
uddle+0.029
imi+0.029
ement+0.028
Neg logit contributions
 err-0.033
 across-0.033
 even-0.032
wood-0.032
keeping-0.029
Token that
Token ID326
Feature activation+5.81e-02

Pos logit contributions
opus+0.014
uffy+0.011
uddle+0.011
imi+0.011
 replied+0.011
Neg logit contributions
 err-0.013
wood-0.013
keeping-0.013
 across-0.013
beh-0.012
Token Lily
Token ID20037
Feature activation+2.04e-02

Pos logit contributions
opus+0.098
uffy+0.077
uddle+0.076
imi+0.074
 replied+0.073
Neg logit contributions
 err-0.104
wood-0.101
keeping-0.100
beh-0.100
 across-0.099
Token had
Token ID550
Feature activation-6.13e-03

Pos logit contributions
opus+0.043
ement+0.036
imi+0.035
ilo+0.035
uddle+0.035
Neg logit contributions
 even-0.029
 across-0.027
wood-0.026
beh-0.026
keeping-0.026
Token never
Token ID1239
Feature activation-2.35e-02

Pos logit contributions
 err+0.008
wood+0.008
keeping+0.008
beh+0.008
 across+0.008
Neg logit contributions
opus-0.013
uffy-0.011
 replied-0.011
uddle-0.010
imi-0.010
Token seen
Token ID1775
Feature activation+7.83e-02

Pos logit contributions
wood+0.031
 err+0.030
keeping+0.028
beh+0.027
 distances+0.027
Neg logit contributions
opus-0.047
 replied-0.043
uffy-0.040
pron-0.040
uddle-0.039
Token before
Token ID878
Feature activation+2.94e-02

Pos logit contributions
opus+0.184
uffy+0.158
 replied+0.157
uddle+0.156
imi+0.155
Neg logit contributions
 err-0.085
wood-0.083
keeping-0.081
beh-0.080
 distances-0.071
Token.
Token ID13
Feature activation+2.87e-03

Pos logit contributions
opus+0.039
uffy+0.031
 replied+0.030
uddle+0.029
imi+0.029
Neg logit contributions
wood-0.060
 err-0.060
beh-0.060
keeping-0.059
ried-0.056
Token Lily
Token ID20037
Feature activation+5.10e-03

Pos logit contributions
opus+0.007
inkle+0.006
ister+0.006
uffy+0.005
ement+0.005
Neg logit contributions
 across-0.003
 err-0.003
and-0.003
keeping-0.003
 even-0.003
Token loved
Token ID6151
Feature activation-1.22e-03

Pos logit contributions
opus+0.010
uffy+0.008
imi+0.008
ilo+0.008
uddle+0.008
Neg logit contributions
 even-0.007
 err-0.007
wood-0.007
beh-0.007
 across-0.007
Token the
Token ID262
Feature activation+1.18e-02

Pos logit contributions
wood+0.002
 err+0.002
keeping+0.002
beh+0.002
 across+0.002
Neg logit contributions
opus-0.002
 replied-0.002
uffy-0.002
imi-0.002
uddle-0.002
Token knowing
Token ID6970
Feature activation+2.42e-02

Pos logit contributions
wood+0.010
beh+0.009
keeping+0.009
 err+0.009
 across+0.009
Neg logit contributions
opus-0.010
 replied-0.008
imi-0.008
uffy-0.008
uddle-0.008
Token what
Token ID644
Feature activation+3.01e-02

Pos logit contributions
opus+0.038
uffy+0.030
 replied+0.029
uddle+0.029
imi+0.029
Neg logit contributions
 err-0.046
wood-0.045
keeping-0.045
beh-0.044
 across-0.043
Token day
Token ID1110
Feature activation+3.50e-02

Pos logit contributions
opus+0.046
uffy+0.037
 replied+0.036
uddle+0.035
imi+0.035
Neg logit contributions
 err-0.058
wood-0.057
keeping-0.056
beh-0.056
 across-0.054
Token it
Token ID340
Feature activation+3.37e-02

Pos logit contributions
opus+0.051
uffy+0.040
uddle+0.039
 replied+0.038
imi+0.038
Neg logit contributions
 err-0.070
wood-0.069
keeping-0.069
beh-0.067
 across-0.066
Token is
Token ID318
Feature activation-5.98e-03

Pos logit contributions
opus+0.047
uffy+0.036
uddle+0.034
imi+0.034
ama+0.033
Neg logit contributions
wood-0.069
 err-0.068
beh-0.067
 across-0.066
keeping-0.066
Token,"
Token ID553
Feature activation+7.82e-02

Pos logit contributions
 err+0.012
beh+0.012
keeping+0.012
wood+0.012
 distances+0.011
Neg logit contributions
opus-0.008
 replied-0.007
uffy-0.006
uddle-0.006
imi-0.006
Token Lily
Token ID20037
Feature activation+1.95e-02

Pos logit contributions
opus+0.140
uddle+0.114
inkle+0.107
ement+0.106
uffy+0.105
Neg logit contributions
 err-0.137
 even-0.132
wood-0.127
 across-0.126
 themselves-0.123
Token explained
Token ID4893
Feature activation+3.81e-03

Pos logit contributions
opus+0.027
uffy+0.021
uddle+0.020
imi+0.019
inkle+0.018
Neg logit contributions
wood-0.040
 even-0.040
 err-0.039
 across-0.039
beh-0.039
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
opus+0.007
uffy+0.006
uddle+0.006
imi+0.005
pron+0.005
Neg logit contributions
 err-0.006
wood-0.006
keeping-0.006
 across-0.006
beh-0.006
Token "
Token ID366
Feature activation-1.94e-02

Pos logit contributions
opus+0.038
uddle+0.032
inkle+0.032
ement+0.031
uffy+0.030
Neg logit contributions
 across-0.022
wood-0.021
 err-0.020
keeping-0.019
 along-0.019
TokenSee
Token ID6214
Feature activation+1.26e-02

Pos logit contributions
 err+0.044
wood+0.041
beh+0.040
keeping+0.039
ried+0.038
Neg logit contributions
opus-0.026
on-0.017
ara-0.017
pron-0.017
rescent-0.016
TokenHi
Token ID17250
Feature activation+3.51e-02

Pos logit contributions
 err+0.076
keeping+0.072
beh+0.071
wood+0.070
 distances+0.067
Neg logit contributions
opus-0.063
 replied-0.052
ara-0.050
uffy-0.045
pron-0.045
Token,
Token ID11
Feature activation+2.66e-02

Pos logit contributions
opus+0.070
pron+0.063
imi+0.059
ama+0.058
ilo+0.058
Neg logit contributions
 across-0.047
 even-0.046
wood-0.043
 along-0.041
 over-0.038
Token what
Token ID644
Feature activation+2.91e-02

Pos logit contributions
opus+0.043
uffy+0.039
inkle+0.037
uddle+0.036
ama+0.036
Neg logit contributions
 even-0.044
 across-0.042
 err-0.042
wood-0.042
 along-0.040
Token is
Token ID318
Feature activation+4.30e-02

Pos logit contributions
opus+0.047
uffy+0.038
uddle+0.037
 replied+0.037
imi+0.036
Neg logit contributions
 err-0.053
wood-0.052
beh-0.051
keeping-0.051
 across-0.050
Token your
Token ID534
Feature activation+2.76e-02

Pos logit contributions
opus+0.078
uffy+0.067
uddle+0.066
imi+0.063
ama+0.062
Neg logit contributions
 err-0.071
wood-0.068
 across-0.068
beh-0.067
keeping-0.066
Token name
Token ID1438
Feature activation+7.81e-02

Pos logit contributions
opus+0.048
uffy+0.038
ement+0.037
uddle+0.037
imi+0.035
Neg logit contributions
wood-0.046
 across-0.046
 err-0.045
 even-0.042
keeping-0.042
Token?"
Token ID1701
Feature activation+6.21e-02

Pos logit contributions
opus+0.090
uffy+0.071
uddle+0.071
imi+0.065
ilo+0.061
Neg logit contributions
 err-0.172
 across-0.169
wood-0.168
beh-0.160
 even-0.159
Token Tom
Token ID4186
Feature activation+2.92e-02

Pos logit contributions
opus+0.099
uffy+0.082
uddle+0.081
imi+0.077
pron+0.072
Neg logit contributions
 err-0.118
 across-0.115
wood-0.113
beh-0.110
 even-0.110
Token asked
Token ID1965
Feature activation+2.34e-02

Pos logit contributions
opus+0.056
uffy+0.047
uddle+0.047
imi+0.044
ama+0.043
Neg logit contributions
wood-0.045
 err-0.045
beh-0.044
keeping-0.043
 across-0.043
Token the
Token ID262
Feature activation+9.51e-03

Pos logit contributions
opus+0.037
uffy+0.030
uddle+0.029
 replied+0.028
imi+0.028
Neg logit contributions
 err-0.044
wood-0.043
keeping-0.043
beh-0.042
 across-0.041
Token new
Token ID649
Feature activation-1.81e-02

Pos logit contributions
opus+0.019
uffy+0.017
ama+0.017
imi+0.017
inkle+0.016
Neg logit contributions
 across-0.014
 err-0.014
 even-0.013
wood-0.012
 along-0.012
Token"
Token ID1
Feature activation-3.03e-02

Pos logit contributions
opus+0.018
uffy+0.015
 replied+0.014
uddle+0.014
imi+0.014
Neg logit contributions
 err-0.022
wood-0.022
keeping-0.022
beh-0.022
 across-0.021
TokenI
Token ID40
Feature activation-1.39e-02

Pos logit contributions
keeping+0.058
 err+0.057
wood+0.057
beh+0.054
 distances+0.053
Neg logit contributions
opus-0.049
 replied-0.037
ara-0.036
uffy-0.036
pron-0.036
Token have
Token ID423
Feature activation-2.17e-02

Pos logit contributions
keeping+0.021
 err+0.020
wood+0.019
beh+0.019
 distances+0.019
Neg logit contributions
opus-0.027
uffy-0.024
 replied-0.024
uddle-0.023
imi-0.022
Token a
Token ID257
Feature activation+1.57e-02

Pos logit contributions
beh+0.036
keeping+0.034
wood+0.032
 err+0.031
ried+0.030
Neg logit contributions
opus-0.038
 replied-0.033
uddle-0.033
 spoon-0.032
 dear-0.032
Token plan
Token ID1410
Feature activation+3.26e-03

Pos logit contributions
opus+0.025
uffy+0.020
imi+0.019
 replied+0.019
uddle+0.018
Neg logit contributions
 err-0.030
wood-0.029
 across-0.028
keeping-0.028
beh-0.027
Token,"
Token ID553
Feature activation+7.81e-02

Pos logit contributions
opus+0.003
 replied+0.002
uffy+0.002
imi+0.002
 says+0.002
Neg logit contributions
beh-0.008
 err-0.008
wood-0.007
 increasingly-0.007
keeping-0.007
Token Sam
Token ID3409
Feature activation+2.10e-02

Pos logit contributions
opus+0.094
uffy+0.071
 replied+0.068
uddle+0.067
imi+0.066
Neg logit contributions
 err-0.176
wood-0.173
keeping-0.172
beh-0.170
 across-0.166
Token says
Token ID1139
Feature activation+1.43e-02

Pos logit contributions
opus+0.017
uffy+0.011
uddle+0.010
imi+0.009
 replied+0.009
Neg logit contributions
 err-0.055
wood-0.055
beh-0.054
keeping-0.054
 across-0.053
Token.
Token ID13
Feature activation-3.33e-03

Pos logit contributions
opus+0.023
uffy+0.019
uddle+0.018
imi+0.018
 replied+0.018
Neg logit contributions
 err-0.026
wood-0.026
keeping-0.025
beh-0.025
 across-0.025
Token "
Token ID366
Feature activation-2.85e-02

Pos logit contributions
 err+0.006
wood+0.006
keeping+0.006
beh+0.006
 across+0.006
Neg logit contributions
opus-0.006
uffy-0.005
uddle-0.004
imi-0.004
 replied-0.004
TokenYou
Token ID1639
Feature activation-2.74e-02

Pos logit contributions
 err+0.063
beh+0.057
wood+0.056
 throughout+0.053
ur+0.052
Neg logit contributions
opus-0.037
ara-0.030
pron-0.027
on-0.026
Anna-0.026
Token,
Token ID11
Feature activation+3.23e-02

Pos logit contributions
 err+0.013
wood+0.012
beh+0.012
keeping+0.012
 across+0.012
Neg logit contributions
opus-0.010
uffy-0.008
uddle-0.008
 replied-0.007
imi-0.007
Token the
Token ID262
Feature activation+1.62e-02

Pos logit contributions
opus+0.061
uffy+0.051
uddle+0.049
 replied+0.048
imi+0.048
Neg logit contributions
 err-0.051
wood-0.048
beh-0.048
 across-0.048
keeping-0.047
Token gum
Token ID27142
Feature activation+2.28e-02

Pos logit contributions
opus+0.025
uffy+0.021
imi+0.020
uddle+0.019
 replied+0.019
Neg logit contributions
 err-0.030
wood-0.029
 across-0.029
keeping-0.028
beh-0.028
Token is
Token ID318
Feature activation+1.94e-02

Pos logit contributions
opus+0.030
uffy+0.023
imi+0.022
 replied+0.022
ama+0.021
Neg logit contributions
 err-0.047
wood-0.046
 across-0.046
keeping-0.045
beh-0.045
Token gone
Token ID3750
Feature activation+2.53e-02

Pos logit contributions
opus+0.029
 replied+0.024
uffy+0.024
imi+0.023
uddle+0.023
Neg logit contributions
keeping-0.037
 err-0.037
wood-0.036
beh-0.036
 distances-0.034
Token,"
Token ID553
Feature activation+7.80e-02

Pos logit contributions
opus+0.034
uffy+0.027
 replied+0.026
uddle+0.026
imi+0.025
Neg logit contributions
 err-0.052
wood-0.051
beh-0.051
keeping-0.051
 distances-0.049
Token Tim
Token ID5045
Feature activation+3.35e-02

Pos logit contributions
opus+0.141
uffy+0.115
uddle+0.113
imi+0.109
pron+0.107
Neg logit contributions
 err-0.131
wood-0.128
keeping-0.124
beh-0.124
 across-0.122
Token says
Token ID1139
Feature activation+2.74e-02

Pos logit contributions
opus+0.036
uffy+0.024
pron+0.024
uddle+0.022
ama+0.020
Neg logit contributions
wood-0.081
 err-0.081
beh-0.079
 across-0.079
keeping-0.078
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
opus+0.042
uffy+0.034
 replied+0.033
uddle+0.032
imi+0.032
Neg logit contributions
 err-0.053
wood-0.051
keeping-0.051
beh-0.050
 across-0.049
Token
Token ID198
Feature activation+1.65e-02

Pos logit contributions
opus+0.019
uffy+0.016
 replied+0.016
uddle+0.015
imi+0.015
Neg logit contributions
 err-0.021
wood-0.020
keeping-0.020
beh-0.020
 distances-0.019
Token
Token ID198
Feature activation+1.81e-02

Pos logit contributions
opus+0.033
uffy+0.027
uddle+0.026
 replied+0.026
imi+0.025
Neg logit contributions
 err-0.024
wood-0.024
keeping-0.024
 across-0.023
beh-0.023
TokenHello
Token ID15496
Feature activation+3.06e-02

Pos logit contributions
 err+0.057
keeping+0.054
wood+0.053
beh+0.052
 distances+0.048
Neg logit contributions
opus-0.062
 replied-0.051
ara-0.048
uffy-0.048
imi-0.048
Token,
Token ID11
Feature activation+2.92e-02

Pos logit contributions
opus+0.055
pron+0.049
imi+0.047
uddle+0.046
inkle+0.046
Neg logit contributions
 across-0.044
wood-0.042
 even-0.042
 err-0.039
 along-0.039
Token what
Token ID644
Feature activation+3.20e-02

Pos logit contributions
opus+0.045
uffy+0.044
inkle+0.042
ama+0.041
uddle+0.039
Neg logit contributions
 even-0.048
 err-0.046
 across-0.046
 along-0.044
wood-0.043
Token is
Token ID318
Feature activation+4.43e-02

Pos logit contributions
opus+0.053
uffy+0.044
uddle+0.043
 replied+0.042
imi+0.042
Neg logit contributions
 err-0.056
beh-0.055
wood-0.055
keeping-0.054
 across-0.054
Token your
Token ID534
Feature activation+3.23e-02

Pos logit contributions
opus+0.078
uddle+0.067
uffy+0.067
imi+0.063
ama+0.062
Neg logit contributions
 err-0.075
wood-0.071
 across-0.071
beh-0.070
keeping-0.069
Token name
Token ID1438
Feature activation+7.79e-02

Pos logit contributions
opus+0.051
uffy+0.041
imi+0.040
 replied+0.040
uddle+0.039
Neg logit contributions
 err-0.061
wood-0.059
keeping-0.058
 across-0.057
beh-0.057
Token?"
Token ID1701
Feature activation+5.78e-02

Pos logit contributions
opus+0.086
uffy+0.067
uddle+0.066
imi+0.061
ilo+0.055
Neg logit contributions
 err-0.176
 across-0.172
wood-0.172
beh-0.165
keeping-0.165
Token Lily
Token ID20037
Feature activation+2.33e-02

Pos logit contributions
opus+0.106
uffy+0.089
uddle+0.088
imi+0.085
ama+0.081
Neg logit contributions
 err-0.096
 across-0.091
wood-0.091
keeping-0.089
 even-0.088
Token asks
Token ID7893
Feature activation+2.10e-02

Pos logit contributions
opus+0.030
uffy+0.023
uddle+0.022
imi+0.021
 replied+0.021
Neg logit contributions
 err-0.050
wood-0.050
keeping-0.049
beh-0.049
 across-0.048
Token the
Token ID262
Feature activation+1.44e-02

Pos logit contributions
opus+0.032
uffy+0.026
 replied+0.025
uddle+0.025
imi+0.025
Neg logit contributions
 err-0.040
wood-0.039
keeping-0.039
beh-0.038
 across-0.037
Token new
Token ID649
Feature activation-1.56e-02

Pos logit contributions
opus+0.026
uffy+0.024
imi+0.024
ama+0.023
ilo+0.022
Neg logit contributions
 across-0.024
 err-0.024
 even-0.021
wood-0.021
 along-0.021
Token flower
Token ID15061
Feature activation+3.11e-02

Pos logit contributions
 err+0.013
wood+0.013
keeping+0.013
 across+0.012
beh+0.012
Neg logit contributions
opus-0.010
 replied-0.008
uffy-0.008
imi-0.008
uddle-0.008
Token isn
Token ID2125
Feature activation+3.30e-02

Pos logit contributions
opus+0.045
ilo+0.033
uffy+0.033
imi+0.032
ama+0.032
Neg logit contributions
 err-0.058
wood-0.057
 across-0.057
keeping-0.056
 even-0.055
Token't
Token ID470
Feature activation+1.51e-02

Pos logit contributions
opus+0.046
uffy+0.037
uddle+0.036
 replied+0.035
imi+0.034
Neg logit contributions
 err-0.067
wood-0.065
beh-0.065
keeping-0.064
 across-0.064
Token growing
Token ID3957
Feature activation+3.15e-02

Pos logit contributions
opus+0.026
 replied+0.022
imi+0.021
uffy+0.021
pron+0.020
Neg logit contributions
 err-0.026
wood-0.025
keeping-0.025
beh-0.025
 distances-0.024
Token anymore
Token ID7471
Feature activation+2.92e-02

Pos logit contributions
opus+0.041
uffy+0.032
 replied+0.032
uddle+0.032
imi+0.031
Neg logit contributions
 err-0.066
keeping-0.066
beh-0.065
wood-0.064
 distances-0.062
Token,"
Token ID553
Feature activation+7.78e-02

Pos logit contributions
opus+0.031
 replied+0.024
uffy+0.024
uddle+0.022
imi+0.022
Neg logit contributions
 err-0.068
wood-0.067
beh-0.067
keeping-0.065
ried-0.064
Token Tim
Token ID5045
Feature activation+2.89e-02

Pos logit contributions
opus+0.159
uffy+0.130
uddle+0.128
rescent+0.124
pron+0.122
Neg logit contributions
 err-0.115
 even-0.111
beh-0.109
 themselves-0.108
wood-0.104
Tokenmy
Token ID1820
Feature activation+2.23e-02

Pos logit contributions
opus+0.066
uddle+0.053
uffy+0.051
ement+0.051
inkle+0.050
Neg logit contributions
wood-0.037
beh-0.036
keeping-0.035
 err-0.034
 across-0.034
Token said
Token ID531
Feature activation+1.19e-02

Pos logit contributions
opus+0.033
uddle+0.025
uffy+0.024
ement+0.021
imi+0.021
Neg logit contributions
 even-0.047
beh-0.044
 err-0.042
wood-0.042
 across-0.042
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
opus+0.021
uffy+0.017
uddle+0.016
imi+0.016
 replied+0.016
Neg logit contributions
 err-0.021
wood-0.020
keeping-0.020
beh-0.020
 across-0.020
Token
Token ID198
Feature activation+8.83e-03

Pos logit contributions
opus+0.048
uffy+0.039
uddle+0.038
imi+0.038
inkle+0.037
Neg logit contributions
 across-0.030
wood-0.029
 err-0.029
keeping-0.028
 even-0.027
Token are
Token ID389
Feature activation+1.39e-02

Pos logit contributions
 err+0.029
wood+0.028
keeping+0.028
beh+0.027
 across+0.026
Neg logit contributions
opus-0.036
uffy-0.030
uddle-0.030
 replied-0.030
imi-0.030
Token looking
Token ID2045
Feature activation+1.82e-02

Pos logit contributions
opus+0.029
uffy+0.025
 replied+0.024
uddle+0.024
imi+0.024
Neg logit contributions
 err-0.019
wood-0.019
keeping-0.018
beh-0.018
 across-0.017
Token for
Token ID329
Feature activation-5.06e-04

Pos logit contributions
opus+0.041
uffy+0.035
 replied+0.034
uddle+0.034
imi+0.034
Neg logit contributions
 err-0.022
wood-0.021
keeping-0.021
beh-0.021
 across-0.020
Token penn
Token ID22429
Feature activation-2.10e-02

Pos logit contributions
wood+0.001
beh+0.001
keeping+0.001
 err+0.001
ried+0.001
Neg logit contributions
opus-0.001
 dear-0.001
 replied-0.001
 ow-0.001
 mom-0.001
Tokenies
Token ID444
Feature activation+1.75e-02

Pos logit contributions
 err+0.049
 across+0.047
beh+0.046
wood+0.046
keeping+0.046
Neg logit contributions
opus-0.024
uffy-0.018
uddle-0.017
imi-0.016
 replied-0.016
Token,"
Token ID553
Feature activation+7.78e-02

Pos logit contributions
opus+0.020
uffy+0.015
 replied+0.014
uddle+0.013
imi+0.012
Neg logit contributions
 err-0.040
beh-0.040
wood-0.039
keeping-0.038
 distances-0.037
Token Tom
Token ID4186
Feature activation+2.63e-02

Pos logit contributions
opus+0.121
uffy+0.098
uddle+0.094
imi+0.092
 replied+0.088
Neg logit contributions
 err-0.150
wood-0.145
keeping-0.145
beh-0.142
 across-0.140
Token said
Token ID531
Feature activation+2.60e-02

Pos logit contributions
opus+0.031
uffy+0.022
uddle+0.022
imi+0.021
 replied+0.019
Neg logit contributions
 err-0.060
wood-0.060
keeping-0.059
beh-0.058
 across-0.058
Token.
Token ID13
Feature activation+3.49e-03

Pos logit contributions
opus+0.040
uffy+0.032
 replied+0.032
uddle+0.031
imi+0.031
Neg logit contributions
 err-0.050
wood-0.049
keeping-0.048
beh-0.048
 across-0.046
Token "
Token ID366
Feature activation-2.02e-02

Pos logit contributions
opus+0.006
uffy+0.004
 replied+0.004
uddle+0.004
imi+0.004
Neg logit contributions
 err-0.007
wood-0.006
keeping-0.006
beh-0.006
 across-0.006
TokenWe
Token ID1135
Feature activation-1.95e-02

Pos logit contributions
 err+0.042
wood+0.039
beh+0.035
 throughout+0.034
keeping+0.033
Neg logit contributions
opus-0.031
ara-0.023
on-0.021
uffy-0.020
pron-0.020
TokenBut
Token ID1537
Feature activation+1.56e-02

Pos logit contributions
wood+0.041
 err+0.040
keeping+0.040
beh+0.038
 across+0.036
Neg logit contributions
opus-0.043
uffy-0.034
ara-0.034
 replied-0.033
uddle-0.033
Token I
Token ID314
Feature activation+9.94e-03

Pos logit contributions
opus+0.027
uddle+0.022
uffy+0.022
imi+0.021
 replied+0.021
Neg logit contributions
wood-0.026
 err-0.026
 across-0.025
keeping-0.025
beh-0.024
Token am
Token ID716
Feature activation+1.78e-02

Pos logit contributions
opus+0.020
uffy+0.018
 replied+0.017
uddle+0.017
imi+0.017
Neg logit contributions
 err-0.014
keeping-0.014
wood-0.013
beh-0.013
 distances-0.012
Token nos
Token ID43630
Feature activation+2.37e-02

Pos logit contributions
opus+0.026
uffy+0.022
 replied+0.021
imi+0.020
ama+0.020
Neg logit contributions
wood-0.034
 err-0.034
keeping-0.034
beh-0.032
 distances-0.032
Tokeny
Token ID88
Feature activation+2.17e-02

Pos logit contributions
opus+0.048
uffy+0.041
 replied+0.040
uddle+0.040
imi+0.039
Neg logit contributions
 err-0.033
wood-0.033
keeping-0.033
beh-0.032
 across-0.031
Token,"
Token ID553
Feature activation+7.77e-02

Pos logit contributions
opus+0.029
uffy+0.023
 replied+0.022
uddle+0.022
imi+0.022
Neg logit contributions
 err-0.045
wood-0.044
beh-0.044
keeping-0.043
 across-0.042
Token says
Token ID1139
Feature activation+4.27e-02

Pos logit contributions
opus+0.115
uffy+0.089
uddle+0.086
imi+0.080
ama+0.078
Neg logit contributions
 err-0.161
 even-0.150
beh-0.148
 themselves-0.148
 across-0.146
Token Lily
Token ID20037
Feature activation+6.92e-04

Pos logit contributions
opus+0.082
uddle+0.067
uffy+0.063
imi+0.063
pron+0.060
Neg logit contributions
 err-0.073
 across-0.069
 themselves-0.067
wood-0.067
beh-0.066
Token.
Token ID13
Feature activation+5.06e-03

Pos logit contributions
opus+0.001
uddle+0.001
uffy+0.001
imi+0.001
pron+0.001
Neg logit contributions
wood-0.001
beh-0.001
 across-0.001
keeping-0.001
 err-0.001
Token "
Token ID366
Feature activation-1.99e-02

Pos logit contributions
opus+0.010
uddle+0.008
uffy+0.008
imi+0.008
inkle+0.008
Neg logit contributions
 across-0.008
 err-0.007
keeping-0.007
wood-0.007
 distances-0.007
TokenI
Token ID40
Feature activation-2.60e-03

Pos logit contributions
 err+0.043
wood+0.042
beh+0.039
keeping+0.037
 throughout+0.037
Neg logit contributions
opus-0.029
ara-0.020
on-0.019
ister-0.018
�-0.017
Token"
Token ID1
Feature activation-1.96e-02

Pos logit contributions
opus+0.022
uffy+0.018
 replied+0.017
uddle+0.017
imi+0.017
Neg logit contributions
 err-0.025
wood-0.025
keeping-0.024
beh-0.024
 across-0.023
TokenI
Token ID40
Feature activation-7.74e-03

Pos logit contributions
 err+0.036
wood+0.035
beh+0.034
keeping+0.034
 across+0.032
Neg logit contributions
opus-0.033
uffy-0.026
 replied-0.026
imi-0.026
uddle-0.026
Token want
Token ID765
Feature activation+1.91e-02

Pos logit contributions
 err+0.012
keeping+0.011
wood+0.011
beh+0.011
 across+0.010
Neg logit contributions
opus-0.015
uffy-0.013
 replied-0.013
uddle-0.013
imi-0.012
Token the
Token ID262
Feature activation+8.26e-03

Pos logit contributions
opus+0.043
uffy+0.037
 replied+0.036
uddle+0.036
imi+0.036
Neg logit contributions
 err-0.023
wood-0.022
keeping-0.022
beh-0.022
 across-0.021
Token jam
Token ID16853
Feature activation+4.57e-03

Pos logit contributions
opus+0.014
uffy+0.012
 replied+0.011
imi+0.011
ement+0.011
Neg logit contributions
 err-0.014
 across-0.014
wood-0.014
keeping-0.013
 even-0.013
Token,"
Token ID553
Feature activation+7.76e-02

Pos logit contributions
opus+0.006
uffy+0.005
uddle+0.005
 replied+0.005
imi+0.004
Neg logit contributions
 err-0.010
beh-0.010
wood-0.010
keeping-0.010
 distances-0.009
Token Tom
Token ID4186
Feature activation+2.59e-02

Pos logit contributions
opus+0.135
uffy+0.107
uddle+0.104
pron+0.102
imi+0.098
Neg logit contributions
 err-0.144
 themselves-0.137
 even-0.134
 across-0.132
wood-0.129
Token said
Token ID531
Feature activation+2.48e-02

Pos logit contributions
opus+0.033
uffy+0.025
uddle+0.024
imi+0.023
ement+0.022
Neg logit contributions
wood-0.056
 err-0.056
beh-0.055
 across-0.054
 even-0.054
Token.
Token ID13
Feature activation+3.42e-03

Pos logit contributions
opus+0.042
uffy+0.034
uddle+0.033
imi+0.033
 replied+0.032
Neg logit contributions
 err-0.044
wood-0.043
keeping-0.043
beh-0.042
 across-0.042
Token
Token ID198
Feature activation+1.65e-02

Pos logit contributions
opus+0.006
uffy+0.005
 replied+0.005
uddle+0.004
imi+0.004
Neg logit contributions
 err-0.006
wood-0.006
keeping-0.006
beh-0.006
 distances-0.006
Token
Token ID198
Feature activation+1.91e-02

Pos logit contributions
opus+0.035
uffy+0.029
imi+0.028
uddle+0.028
 replied+0.028
Neg logit contributions
 err-0.022
 across-0.022
wood-0.021
keeping-0.020
and-0.020
Token Everyday
Token ID48154
Feature activation-2.28e-02

Pos logit contributions
 err+0.007
wood+0.006
keeping+0.006
beh+0.006
 across+0.006
Neg logit contributions
opus-0.008
uffy-0.007
uddle-0.007
imi-0.007
 replied-0.007
Token he
Token ID339
Feature activation-1.02e-02

Pos logit contributions
 err+0.047
beh+0.046
keeping+0.046
wood+0.046
 distances+0.044
Neg logit contributions
opus-0.031
 replied-0.025
uffy-0.025
imi-0.023
uddle-0.022
Token would
Token ID561
Feature activation-3.39e-02

Pos logit contributions
 err+0.018
keeping+0.018
wood+0.018
beh+0.018
 distances+0.016
Neg logit contributions
opus-0.016
 replied-0.014
uffy-0.014
uddle-0.013
imi-0.013
Token take
Token ID1011
Feature activation-2.33e-02

Pos logit contributions
 err+0.053
keeping+0.052
beh+0.052
wood+0.051
and+0.047
Neg logit contributions
opus-0.061
 replied-0.056
uffy-0.050
imi-0.050
inkle-0.048
Token the
Token ID262
Feature activation-1.42e-02

Pos logit contributions
beh+0.038
wood+0.038
 err+0.037
keeping+0.037
ried+0.037
Neg logit contributions
opus-0.037
 replied-0.033
ilo-0.031
uffy-0.031
imi-0.030
Token av
Token ID1196
Feature activation-1.02e-01

Pos logit contributions
beh+0.027
keeping+0.027
 err+0.027
wood+0.026
 distances+0.025
Neg logit contributions
opus-0.022
uffy-0.017
 replied-0.017
uddle-0.017
imi-0.016
Tokenoc
Token ID420
Feature activation-3.10e-02

Pos logit contributions
 err+0.215
 themselves+0.209
keeping+0.208
beh+0.195
ried+0.194
Neg logit contributions
opus-0.142
on-0.107
 replied-0.096
�-0.096
ama-0.095
Tokenados
Token ID22484
Feature activation-4.68e-02

Pos logit contributions
 err+0.057
 across+0.054
beh+0.053
keeping+0.053
wood+0.052
Neg logit contributions
opus-0.052
ama-0.040
ement-0.039
uffy-0.039
 replied-0.039
Token and
Token ID290
Feature activation-1.39e-02

Pos logit contributions
beh+0.075
 err+0.074
gent+0.069
keeping+0.069
 distances+0.063
Neg logit contributions
opus-0.082
 replied-0.071
 box-0.067
uffy-0.066
on-0.064
Token scatter
Token ID41058
Feature activation-1.15e-02

Pos logit contributions
beh+0.025
and+0.024
 err+0.023
keeping+0.023
|+0.023
Neg logit contributions
 replied-0.021
opus-0.020
 spoon-0.017
 fork-0.017
 mom-0.017
Token them
Token ID606
Feature activation-2.37e-02

Pos logit contributions
beh+0.019
 err+0.019
keeping+0.018
wood+0.017
uries+0.017
Neg logit contributions
opus-0.018
 replied-0.017
ement-0.016
 box-0.016
uddle-0.016
Token That
Token ID1320
Feature activation-6.86e-03

Pos logit contributions
opus+0.003
uffy+0.002
uddle+0.002
imi+0.002
 replied+0.002
Neg logit contributions
 err-0.003
wood-0.003
keeping-0.003
beh-0.003
 across-0.003
Token was
Token ID373
Feature activation+1.68e-03

Pos logit contributions
 err+0.012
wood+0.012
keeping+0.012
beh+0.011
 across+0.011
Neg logit contributions
opus-0.012
uffy-0.010
 replied-0.009
uddle-0.009
imi-0.009
Token the
Token ID262
Feature activation+1.08e-03

Pos logit contributions
opus+0.002
 replied+0.002
 sincere+0.002
 ow+0.002
 sorry+0.002
Neg logit contributions
keeping-0.003
beh-0.003
and-0.003
 err-0.003
wood-0.003
Token setup
Token ID9058
Feature activation-2.45e-02

Pos logit contributions
opus+0.002
uffy+0.002
 replied+0.001
uddle+0.001
imi+0.001
Neg logit contributions
 err-0.002
wood-0.002
keeping-0.002
beh-0.002
 across-0.002
Token that
Token ID326
Feature activation+1.53e-02

Pos logit contributions
 err+0.041
beh+0.041
wood+0.040
keeping+0.040
 distances+0.037
Neg logit contributions
opus-0.042
 replied-0.036
uffy-0.035
uddle-0.033
imi-0.033
Token fore
Token ID1674
Feature activation-9.99e-02

Pos logit contributions
opus+0.023
 replied+0.019
uffy+0.018
uddle+0.018
imi+0.018
Neg logit contributions
 err-0.029
keeping-0.029
wood-0.028
beh-0.028
 distances-0.027
Tokenshadow
Token ID19106
Feature activation-2.59e-02

Pos logit contributions
keeping+0.139
wood+0.128
 err+0.126
 across+0.122
 streams+0.120
Neg logit contributions
opus-0.194
 replied-0.170
uffy-0.159
ama-0.159
uddle-0.156
Tokened
Token ID276
Feature activation-1.40e-02

Pos logit contributions
 err+0.043
keeping+0.041
wood+0.040
beh+0.039
 distances+0.039
Neg logit contributions
opus-0.047
uffy-0.040
 replied-0.039
uddle-0.038
imi-0.038
Token the
Token ID262
Feature activation-1.50e-02

Pos logit contributions
 err+0.022
beh+0.021
keeping+0.021
wood+0.020
 distances+0.020
Neg logit contributions
opus-0.025
 replied-0.022
uffy-0.021
imi-0.020
ama-0.020
Token bad
Token ID2089
Feature activation-4.17e-02

Pos logit contributions
 err+0.027
beh+0.027
keeping+0.027
wood+0.026
 distances+0.026
Neg logit contributions
opus-0.024
uffy-0.019
 replied-0.019
uddle-0.019
imi-0.019
Token ending
Token ID7464
Feature activation-1.14e-02

Pos logit contributions
beh+0.078
 err+0.074
and+0.073
keeping+0.073
 distances+0.069
Neg logit contributions
opus-0.067
 replied-0.056
uffy-0.054
ama-0.052
imi-0.051
Token room
Token ID2119
Feature activation-2.57e-04

Pos logit contributions
opus+0.016
uffy+0.012
 replied+0.012
imi+0.012
uddle+0.012
Neg logit contributions
 err-0.025
wood-0.024
keeping-0.024
 across-0.023
beh-0.023
Token while
Token ID981
Feature activation-6.44e-03

Pos logit contributions
beh+0.000
wood+0.000
keeping+0.000
 err+0.000
 distances+0.000
Neg logit contributions
opus-0.000
uffy-0.000
 replied-0.000
uddle-0.000
imi-0.000
Token she
Token ID673
Feature activation-4.15e-03

Pos logit contributions
 err+0.011
keeping+0.010
wood+0.010
 across+0.010
beh+0.010
Neg logit contributions
opus-0.012
uffy-0.010
uddle-0.010
 replied-0.010
imi-0.009
Token ran
Token ID4966
Feature activation-4.33e-02

Pos logit contributions
 err+0.007
wood+0.007
keeping+0.007
beh+0.007
 across+0.007
Neg logit contributions
opus-0.007
uffy-0.006
 replied-0.006
uddle-0.006
imi-0.006
Token an
Token ID281
Feature activation-1.28e-02

Pos logit contributions
beh+0.085
wood+0.070
keeping+0.070
cr+0.069
 distances+0.068
Neg logit contributions
 window-0.056
 box-0.055
 replied-0.053
 mom-0.052
uddle-0.050
Token err
Token ID11454
Feature activation-9.28e-02

Pos logit contributions
wood+0.023
keeping+0.022
beh+0.022
 distances+0.021
 across+0.021
Neg logit contributions
opus-0.020
imi-0.017
 replied-0.017
uffy-0.016
ama-0.016
Tokenand
Token ID392
Feature activation-2.81e-02

Pos logit contributions
wood+0.106
fl+0.098
 across+0.095
ried+0.086
cr+0.084
Neg logit contributions
opus-0.194
ister-0.176
olly-0.172
imi-0.171
uddle-0.169
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
 distances+0.049
wood+0.048
keeping+0.048
beh+0.047
Circ+0.047
Neg logit contributions
opus-0.040
uffy-0.037
ama-0.034
ister-0.034
uddle-0.033
Token The
Token ID383
Feature activation-1.73e-02

Pos logit contributions
 err+0.029
wood+0.028
keeping+0.028
beh+0.028
 across+0.027
Neg logit contributions
opus-0.029
uffy-0.024
 replied-0.024
uddle-0.023
imi-0.023
Token girl
Token ID2576
Feature activation+7.11e-04

Pos logit contributions
 err+0.039
wood+0.037
keeping+0.037
 across+0.036
beh+0.036
Neg logit contributions
opus-0.021
uffy-0.016
 replied-0.016
uddle-0.015
imi-0.015
Token was
Token ID373
Feature activation-1.21e-02

Pos logit contributions
opus+0.001
uffy+0.001
 replied+0.001
uddle+0.001
imi+0.001
Neg logit contributions
 err-0.001
wood-0.001
keeping-0.001
beh-0.001
 across-0.001
Token as
Token ID355
Feature activation-5.05e-04

Pos logit contributions
beh+0.113
 err+0.108
wood+0.108
keeping+0.106
ried+0.103
Neg logit contributions
opus-0.064
 replied-0.055
uddle-0.048
uffy-0.048
ement-0.045
Token he
Token ID339
Feature activation-1.66e-03

Pos logit contributions
 err+0.001
wood+0.001
keeping+0.001
beh+0.001
 distances+0.001
Neg logit contributions
opus-0.001
uffy-0.001
 replied-0.001
imi-0.001
uddle-0.001
Token opened
Token ID4721
Feature activation-5.55e-03

Pos logit contributions
 err+0.003
wood+0.003
keeping+0.003
beh+0.003
 across+0.003
Neg logit contributions
opus-0.003
uffy-0.002
 replied-0.002
uddle-0.002
imi-0.002
Token the
Token ID262
Feature activation+1.35e-02

Pos logit contributions
beh+0.009
 err+0.009
keeping+0.008
wood+0.008
and+0.008
Neg logit contributions
opus-0.010
 replied-0.009
uffy-0.009
uddle-0.008
imi-0.008
Token ch
Token ID442
Feature activation-1.62e-02

Pos logit contributions
opus+0.014
uffy+0.009
uddle+0.009
 replied+0.009
imi+0.009
Neg logit contributions
 err-0.032
beh-0.032
keeping-0.032
wood-0.032
 distances-0.031
Tokenocol
Token ID4668
Feature activation-9.03e-02

Pos logit contributions
keeping+0.030
 err+0.030
wood+0.029
beh+0.029
 across+0.028
Neg logit contributions
opus-0.026
uddle-0.021
uffy-0.021
 replied-0.020
imi-0.020
Tokenates
Token ID689
Feature activation-8.82e-03

Pos logit contributions
 err+0.200
 even+0.182
 across+0.171
beh+0.169
 explore+0.163
Neg logit contributions
opus-0.123
ama-0.103
uffy-0.098
inkle-0.097
on-0.096
Token and
Token ID290
Feature activation+1.26e-02

Pos logit contributions
 err+0.015
beh+0.015
wood+0.014
keeping+0.014
 distances+0.014
Neg logit contributions
opus-0.015
uffy-0.012
 replied-0.012
uddle-0.012
imi-0.012
Token started
Token ID2067
Feature activation-1.31e-02

Pos logit contributions
opus+0.020
 replied+0.018
uffy+0.016
uddle+0.015
imi+0.015
Neg logit contributions
 err-0.022
beh-0.022
wood-0.021
keeping-0.021
and-0.021
Token eating
Token ID6600
Feature activation+3.21e-03

Pos logit contributions
 err+0.018
beh+0.018
keeping+0.018
wood+0.018
uries+0.017
Neg logit contributions
opus-0.024
 replied-0.022
uffy-0.020
pron-0.019
imi-0.019
Token them
Token ID606
Feature activation-1.39e-02

Pos logit contributions
opus+0.005
 replied+0.004
ement+0.004
uddle+0.004
 fork+0.004
Neg logit contributions
beh-0.006
keeping-0.005
 err-0.005
wood-0.005
ried-0.005
Token the
Token ID262
Feature activation+5.26e-03

Pos logit contributions
opus+0.005
 replied+0.004
ement+0.004
uffy+0.004
uddle+0.004
Neg logit contributions
beh-0.006
 err-0.006
keeping-0.006
 distances-0.006
uries-0.006
Token tree
Token ID5509
Feature activation-7.27e-03

Pos logit contributions
opus+0.007
uddle+0.006
uffy+0.006
 replied+0.005
imi+0.005
Neg logit contributions
wood-0.011
beh-0.011
keeping-0.011
 err-0.011
ried-0.010
Token to
Token ID284
Feature activation+2.04e-02

Pos logit contributions
beh+0.014
 err+0.013
keeping+0.013
wood+0.013
 distances+0.012
Neg logit contributions
opus-0.010
 replied-0.009
uffy-0.009
uddle-0.009
 box-0.008
Token its
Token ID663
Feature activation+5.97e-03

Pos logit contributions
opus+0.032
 replied+0.030
imi+0.027
uffy+0.027
inkle+0.027
Neg logit contributions
beh-0.035
 err-0.033
wood-0.033
keeping-0.032
 distances-0.032
Token once
Token ID1752
Feature activation-3.92e-03

Pos logit contributions
opus+0.010
uddle+0.009
 replied+0.009
uffy+0.009
ama+0.008
Neg logit contributions
beh-0.010
keeping-0.010
wood-0.010
gent-0.009
 err-0.009
Token former
Token ID1966
Feature activation-8.99e-02

Pos logit contributions
beh+0.007
keeping+0.007
 err+0.007
wood+0.007
uries+0.007
Neg logit contributions
opus-0.006
 replied-0.006
inkle-0.005
ement-0.005
uddle-0.005
Token beauty
Token ID8737
Feature activation-6.40e-03

Pos logit contributions
beh+0.166
Circ+0.159
cr+0.158
keeping+0.155
 across+0.153
Neg logit contributions
inkle-0.123
ement-0.119
uddle-0.110
 dear-0.110
opus-0.109
Token.
Token ID13
Feature activation-7.99e-03

Pos logit contributions
 err+0.014
beh+0.014
keeping+0.013
wood+0.013
 distances+0.013
Neg logit contributions
opus-0.008
 replied-0.006
uffy-0.006
uddle-0.006
ement-0.006
Token He
Token ID679
Feature activation+7.66e-03

Pos logit contributions
 err+0.016
beh+0.015
keeping+0.015
wood+0.014
 distances+0.014
Neg logit contributions
opus-0.012
 replied-0.010
uffy-0.010
ama-0.009
imi-0.009
Token was
Token ID373
Feature activation-6.34e-03

Pos logit contributions
opus+0.011
 replied+0.010
uffy+0.008
uddle+0.008
imi+0.008
Neg logit contributions
 err-0.016
keeping-0.015
wood-0.015
beh-0.015
 distances-0.014
Token happy
Token ID3772
Feature activation+1.03e-02

Pos logit contributions
 err+0.010
keeping+0.010
wood+0.010
beh+0.010
 distances+0.009
Neg logit contributions
 replied-0.010
opus-0.010
 sincere-0.009
ama-0.009
 shocked-0.009
Token
Token ID198
Feature activation-7.97e-03

Pos logit contributions
 err+0.012
 across+0.011
keeping+0.011
wood+0.010
and+0.010
Neg logit contributions
opus-0.018
uddle-0.015
uffy-0.015
imi-0.014
 replied-0.014
TokenGrand
Token ID23581
Feature activation-1.37e-02

Pos logit contributions
 err+0.012
wood+0.012
keeping+0.012
beh+0.012
 across+0.011
Neg logit contributions
opus-0.015
 replied-0.013
uffy-0.013
imi-0.012
uddle-0.012
Tokenma
Token ID2611
Feature activation-1.35e-03

Pos logit contributions
 err+0.034
wood+0.033
keeping+0.033
beh+0.033
 distances+0.031
Neg logit contributions
opus-0.014
uffy-0.009
uddle-0.009
 replied-0.009
pron-0.008
Token's
Token ID338
Feature activation-1.09e-02

Pos logit contributions
 err+0.002
wood+0.002
keeping+0.002
beh+0.002
 across+0.002
Neg logit contributions
opus-0.002
uffy-0.002
 replied-0.002
uddle-0.002
imi-0.002
Token words
Token ID2456
Feature activation-6.94e-03

Pos logit contributions
 err+0.020
wood+0.020
keeping+0.019
 across+0.019
beh+0.019
Neg logit contributions
opus-0.018
uffy-0.014
uddle-0.014
imi-0.014
 replied-0.013
Token fore
Token ID1674
Feature activation-8.81e-02

Pos logit contributions
 err+0.012
keeping+0.012
wood+0.012
beh+0.012
 across+0.011
Neg logit contributions
opus-0.012
uffy-0.010
 replied-0.010
uddle-0.009
imi-0.009
Tokenshadow
Token ID19106
Feature activation-2.42e-02

Pos logit contributions
keeping+0.124
wood+0.116
 err+0.113
 streams+0.108
 across+0.108
Neg logit contributions
opus-0.169
 replied-0.153
uffy-0.143
pron-0.140
ama-0.139
Tokened
Token ID276
Feature activation-1.63e-02

Pos logit contributions
 err+0.044
keeping+0.042
wood+0.042
beh+0.042
 across+0.040
Neg logit contributions
opus-0.040
uffy-0.033
 replied-0.032
uddle-0.032
imi-0.031
Token a
Token ID257
Feature activation-1.20e-03

Pos logit contributions
 err+0.023
keeping+0.023
beh+0.022
wood+0.021
 distances+0.021
Neg logit contributions
opus-0.031
 replied-0.028
uffy-0.027
imi-0.026
ama-0.026
Token happy
Token ID3772
Feature activation-1.29e-02

Pos logit contributions
 err+0.002
keeping+0.002
wood+0.002
beh+0.002
 distances+0.002
Neg logit contributions
opus-0.002
 replied-0.002
uffy-0.002
uddle-0.002
imi-0.002
Token future
Token ID2003
Feature activation-3.73e-02

Pos logit contributions
beh+0.025
 err+0.024
wood+0.024
keeping+0.024
 distances+0.024
Neg logit contributions
opus-0.020
 replied-0.017
uffy-0.015
uddle-0.015
imi-0.015
Token It
Token ID632
Feature activation+8.95e-03

Pos logit contributions
 err+0.012
keeping+0.012
wood+0.012
beh+0.011
 across+0.011
Neg logit contributions
opus-0.014
uddle-0.012
uffy-0.012
imi-0.011
 replied-0.011
Token turned
Token ID2900
Feature activation-1.37e-02

Pos logit contributions
opus+0.014
uffy+0.012
 replied+0.012
uddle+0.011
imi+0.011
Neg logit contributions
 err-0.016
wood-0.016
keeping-0.016
beh-0.016
 distances-0.015
Token out
Token ID503
Feature activation-1.21e-02

Pos logit contributions
keeping+0.017
 err+0.016
beh+0.016
wood+0.016
 distances+0.015
Neg logit contributions
opus-0.030
 replied-0.026
uffy-0.026
uddle-0.026
imi-0.025
Token that
Token ID326
Feature activation+2.69e-02

Pos logit contributions
 err+0.017
keeping+0.016
wood+0.016
beh+0.016
 distances+0.015
Neg logit contributions
opus-0.025
uffy-0.021
 replied-0.021
uddle-0.021
imi-0.021
Token the
Token ID262
Feature activation-7.69e-03

Pos logit contributions
opus+0.045
 replied+0.038
uffy+0.037
imi+0.036
uddle+0.035
Neg logit contributions
 err-0.047
beh-0.046
keeping-0.046
wood-0.046
 distances-0.044
Token av
Token ID1196
Feature activation-8.63e-02

Pos logit contributions
 err+0.016
wood+0.016
beh+0.016
keeping+0.016
 distances+0.015
Neg logit contributions
opus-0.010
uffy-0.008
uddle-0.007
 replied-0.007
imi-0.007
Tokenoc
Token ID420
Feature activation-2.40e-02

Pos logit contributions
 err+0.177
keeping+0.168
beh+0.164
wood+0.164
 themselves+0.164
Neg logit contributions
opus-0.126
 replied-0.098
uddle-0.094
uffy-0.091
on-0.090
Tokenados
Token ID22484
Feature activation-2.01e-02

Pos logit contributions
 err+0.044
 across+0.041
wood+0.041
beh+0.041
keeping+0.040
Neg logit contributions
opus-0.040
uffy-0.032
uddle-0.031
 replied-0.031
ama-0.031
Token were
Token ID547
Feature activation-2.40e-02

Pos logit contributions
 err+0.033
wood+0.032
keeping+0.032
beh+0.031
 across+0.031
Neg logit contributions
opus-0.036
uffy-0.030
imi-0.029
uddle-0.029
 replied-0.028
Token bad
Token ID2089
Feature activation-3.19e-02

Pos logit contributions
 err+0.045
keeping+0.044
wood+0.044
beh+0.043
and+0.041
Neg logit contributions
opus-0.033
 replied-0.030
 sincere-0.029
uffy-0.029
 ow-0.028
Token and
Token ID290
Feature activation+7.72e-03

Pos logit contributions
 err+0.051
beh+0.050
wood+0.049
keeping+0.049
 distances+0.047
Neg logit contributions
opus-0.057
uffy-0.048
 replied-0.048
uddle-0.047
imi-0.047
Token.
Token ID13
Feature activation-8.94e-03

Pos logit contributions
 err+0.042
beh+0.041
wood+0.041
keeping+0.039
 distances+0.038
Neg logit contributions
opus-0.027
 replied-0.021
uffy-0.021
uddle-0.020
ement-0.020
Token He
Token ID679
Feature activation+1.01e-02

Pos logit contributions
 err+0.016
wood+0.015
keeping+0.015
beh+0.015
 across+0.015
Neg logit contributions
opus-0.015
uffy-0.012
 replied-0.012
uddle-0.012
imi-0.012
Token gave
Token ID2921
Feature activation+7.64e-03

Pos logit contributions
opus+0.016
uffy+0.013
 replied+0.013
uddle+0.013
imi+0.013
Neg logit contributions
 err-0.019
wood-0.018
keeping-0.018
beh-0.018
 across-0.017
Token the
Token ID262
Feature activation-1.10e-02

Pos logit contributions
opus+0.012
 replied+0.011
uffy+0.010
ama+0.010
 mom+0.010
Neg logit contributions
beh-0.013
 err-0.013
keeping-0.013
 distances-0.013
wood-0.012
Token friendly
Token ID8030
Feature activation-3.00e-02

Pos logit contributions
 err+0.023
wood+0.022
keeping+0.022
beh+0.022
 across+0.022
Neg logit contributions
opus-0.015
uffy-0.012
 replied-0.011
uddle-0.011
imi-0.011
Token nurses
Token ID20669
Feature activation-8.50e-02

Pos logit contributions
keeping+0.061
 err+0.061
beh+0.061
wood+0.060
 distances+0.059
Neg logit contributions
opus-0.041
uffy-0.032
 replied-0.031
uddle-0.031
imi-0.030
Token a
Token ID257
Feature activation-2.33e-02

Pos logit contributions
beh+0.144
keeping+0.141
 distances+0.138
 err+0.136
wood+0.136
Neg logit contributions
opus-0.122
 replied-0.113
on-0.112
 box-0.111
 journal-0.107
Token wave
Token ID6769
Feature activation-2.10e-02

Pos logit contributions
 err+0.045
wood+0.044
keeping+0.043
 across+0.043
beh+0.042
Neg logit contributions
opus-0.035
uffy-0.028
imi-0.027
uddle-0.027
 replied-0.027
Token goodbye
Token ID24829
Feature activation-1.37e-02

Pos logit contributions
 err+0.033
wood+0.032
beh+0.032
keeping+0.032
 distances+0.030
Neg logit contributions
opus-0.039
 replied-0.033
uffy-0.033
uddle-0.031
imi-0.031
Token.
Token ID13
Feature activation-8.46e-03

Pos logit contributions
beh+0.026
 err+0.025
wood+0.023
 distances+0.023
keeping+0.023
Neg logit contributions
opus-0.020
 replied-0.019
uffy-0.017
uddle-0.016
ama-0.016
Token They
Token ID1119
Feature activation-1.57e-02

Pos logit contributions
 err+0.014
wood+0.014
keeping+0.014
beh+0.013
 across+0.013
Neg logit contributions
opus-0.015
uffy-0.012
uddle-0.012
 replied-0.012
imi-0.012
Token As
Token ID1081
Feature activation-2.71e-02

Pos logit contributions
 err+0.034
beh+0.033
wood+0.033
keeping+0.033
 distances+0.031
Neg logit contributions
opus-0.028
 replied-0.022
uffy-0.022
uddle-0.021
imi-0.021
Token the
Token ID262
Feature activation-3.48e-02

Pos logit contributions
 err+0.055
beh+0.054
keeping+0.053
wood+0.053
 distances+0.051
Neg logit contributions
opus-0.038
 replied-0.030
uffy-0.029
imi-0.028
uddle-0.027
Token farmer
Token ID18739
Feature activation-1.43e-02

Pos logit contributions
 err+0.064
keeping+0.062
beh+0.062
wood+0.062
 across+0.059
Neg logit contributions
opus-0.057
 replied-0.045
uffy-0.045
uddle-0.044
imi-0.043
Token scattered
Token ID16830
Feature activation-2.05e-02

Pos logit contributions
 err+0.026
keeping+0.025
wood+0.025
beh+0.025
 distances+0.023
Neg logit contributions
opus-0.024
 replied-0.020
uffy-0.019
uddle-0.019
imi-0.018
Token the
Token ID262
Feature activation-2.18e-02

Pos logit contributions
beh+0.038
keeping+0.035
 err+0.034
uries+0.034
wood+0.033
Neg logit contributions
opus-0.031
 replied-0.029
ement-0.026
 box-0.026
on-0.026
Token av
Token ID1196
Feature activation-8.47e-02

Pos logit contributions
beh+0.043
keeping+0.040
 err+0.040
 themselves+0.039
wood+0.039
Neg logit contributions
opus-0.034
 replied-0.026
pron-0.026
 box-0.026
uddle-0.026
Tokenoc
Token ID420
Feature activation-1.94e-02

Pos logit contributions
 err+0.178
keeping+0.169
beh+0.166
 themselves+0.165
ried+0.162
Neg logit contributions
opus-0.121
on-0.090
 replied-0.089
ama-0.085
uddle-0.083
Tokenados
Token ID22484
Feature activation-2.95e-02

Pos logit contributions
 err+0.036
 across+0.034
beh+0.034
keeping+0.033
wood+0.033
Neg logit contributions
opus-0.032
ama-0.025
uffy-0.024
ement-0.024
uddle-0.024
Token,
Token ID11
Feature activation-2.84e-03

Pos logit contributions
 err+0.055
beh+0.055
keeping+0.053
wood+0.051
uries+0.050
Neg logit contributions
opus-0.047
 replied-0.040
uddle-0.037
uffy-0.036
ement-0.034
Token he
Token ID339
Feature activation+1.66e-03

Pos logit contributions
 err+0.006
beh+0.006
keeping+0.006
wood+0.006
 distances+0.005
Neg logit contributions
opus-0.004
 replied-0.003
uffy-0.003
imi-0.003
uddle-0.003
Token noticed
Token ID6810
Feature activation+8.35e-03

Pos logit contributions
opus+0.003
uffy+0.002
uddle+0.002
imi+0.002
 replied+0.002
Neg logit contributions
 err-0.003
wood-0.003
keeping-0.003
beh-0.003
 across-0.003
Token was
Token ID373
Feature activation-4.30e-03

Pos logit contributions
opus+0.022
uffy+0.018
uddle+0.017
 replied+0.017
imi+0.017
Neg logit contributions
 err-0.026
wood-0.025
keeping-0.025
beh-0.025
 across-0.024
Token a
Token ID257
Feature activation+2.36e-02

Pos logit contributions
beh+0.007
 err+0.007
keeping+0.007
wood+0.006
 distances+0.006
Neg logit contributions
opus-0.008
 replied-0.007
uffy-0.007
imi-0.007
ilo-0.006
Token box
Token ID3091
Feature activation+1.95e-02

Pos logit contributions
opus+0.036
uffy+0.029
imi+0.029
 replied+0.028
ement+0.028
Neg logit contributions
 err-0.044
 across-0.043
wood-0.042
keeping-0.041
 even-0.041
Token of
Token ID286
Feature activation-1.80e-02

Pos logit contributions
opus+0.039
uffy+0.034
uddle+0.033
imi+0.033
 replied+0.032
Neg logit contributions
 err-0.028
wood-0.027
keeping-0.027
 across-0.027
beh-0.026
Token ch
Token ID442
Feature activation-1.29e-02

Pos logit contributions
beh+0.036
wood+0.035
keeping+0.035
 err+0.035
ried+0.034
Neg logit contributions
opus-0.024
 replied-0.020
uddle-0.019
imi-0.019
uffy-0.019
Tokenocol
Token ID4668
Feature activation-8.45e-02

Pos logit contributions
 err+0.023
keeping+0.023
wood+0.022
beh+0.022
 across+0.021
Neg logit contributions
opus-0.021
uffy-0.018
uddle-0.018
imi-0.017
ilo-0.017
Tokenates
Token ID689
Feature activation+1.14e-03

Pos logit contributions
 err+0.180
 even+0.162
 across+0.158
beh+0.153
 increasingly+0.149
Neg logit contributions
opus-0.120
uffy-0.101
ama-0.101
on-0.099
imi-0.096
Token.
Token ID13
Feature activation-4.37e-03

Pos logit contributions
opus+0.002
uffy+0.001
 replied+0.001
uddle+0.001
imi+0.001
Neg logit contributions
 err-0.002
beh-0.002
wood-0.002
keeping-0.002
 distances-0.002
Token Lily
Token ID20037
Feature activation+6.00e-03

Pos logit contributions
wood+0.007
 err+0.007
 across+0.007
keeping+0.007
 distances+0.007
Neg logit contributions
opus-0.008
uffy-0.007
uddle-0.006
imi-0.006
 replied-0.006
Token loved
Token ID6151
Feature activation-3.07e-03

Pos logit contributions
opus+0.011
uffy+0.009
uddle+0.008
imi+0.008
 replied+0.008
Neg logit contributions
 err-0.010
wood-0.010
keeping-0.010
beh-0.009
 across-0.009
Token ch
Token ID442
Feature activation+7.88e-03

Pos logit contributions
 err+0.005
beh+0.005
wood+0.005
keeping+0.005
 across+0.005
Neg logit contributions
opus-0.005
imi-0.004
 replied-0.004
uffy-0.004
ilo-0.004
Token took
Token ID1718
Feature activation-3.97e-02

Pos logit contributions
 err+0.019
wood+0.019
keeping+0.019
beh+0.018
 across+0.018
Neg logit contributions
opus-0.025
uffy-0.021
 replied-0.021
uddle-0.021
imi-0.021
Token Sara
Token ID24799
Feature activation-3.96e-02

Pos logit contributions
keeping+0.062
beh+0.062
wood+0.061
 distances+0.061
 err+0.061
Neg logit contributions
opus-0.066
uffy-0.060
 replied-0.058
imi-0.056
uddle-0.055
Token the
Token ID262
Feature activation-1.43e-02

Pos logit contributions
beh+0.056
keeping+0.055
 distances+0.055
wood+0.054
 err+0.054
Neg logit contributions
opus-0.074
uffy-0.069
 replied-0.065
ama-0.064
imi-0.062
Token 3
Token ID513
Feature activation-5.69e-02

Pos logit contributions
 err+0.029
wood+0.028
keeping+0.027
 across+0.027
beh+0.027
Neg logit contributions
opus-0.020
uffy-0.016
 replied-0.016
uddle-0.015
imi-0.015
Token-
Token ID12
Feature activation-6.02e-02

Pos logit contributions
wood+0.114
keeping+0.109
beh+0.108
 distances+0.106
 err+0.106
Neg logit contributions
opus-0.084
 replied-0.065
uffy-0.065
uddle-0.063
imi-0.062
Tokenyear
Token ID1941
Feature activation-8.41e-02

Pos logit contributions
wood+0.076
 distances+0.076
 err+0.071
 across+0.071
keeping+0.070
Neg logit contributions
opus-0.118
on-0.110
 replied-0.105
ama-0.102
pron-0.100
Token-
Token ID12
Feature activation-6.04e-02

Pos logit contributions
wood+0.160
keeping+0.159
 distances+0.157
 across+0.156
beh+0.154
Neg logit contributions
opus-0.123
ama-0.096
uddle-0.095
 replied-0.094
uffy-0.094
Tokenold
Token ID727
Feature activation-3.10e-02

Pos logit contributions
wood+0.120
 distances+0.117
 across+0.117
 err+0.115
keeping+0.112
Neg logit contributions
opus-0.079
on-0.066
 replied-0.062
ama-0.062
uffy-0.060
Token to
Token ID284
Feature activation-1.02e-02

Pos logit contributions
 err+0.063
wood+0.063
beh+0.062
 distances+0.062
keeping+0.062
Neg logit contributions
opus-0.039
 replied-0.032
uffy-0.031
uddle-0.029
imi-0.028
Token the
Token ID262
Feature activation+1.41e-02

Pos logit contributions
beh+0.016
wood+0.016
keeping+0.016
and+0.016
 distances+0.015
Neg logit contributions
 replied-0.015
opus-0.015
 mom-0.014
 Jake-0.014
 daddy-0.013
Token park
Token ID3952
Feature activation+4.61e-03

Pos logit contributions
opus+0.020
uffy+0.016
uddle+0.016
 replied+0.015
imi+0.015
Neg logit contributions
beh-0.028
keeping-0.028
 err-0.028
wood-0.027
 distances-0.027
Token She
Token ID1375
Feature activation-1.22e-03

Pos logit contributions
 err+0.038
wood+0.038
keeping+0.037
beh+0.037
 across+0.036
Neg logit contributions
opus-0.034
uffy-0.028
uddle-0.027
 replied-0.027
imi-0.026
Token told
Token ID1297
Feature activation-8.59e-03

Pos logit contributions
 err+0.002
wood+0.002
keeping+0.002
beh+0.002
 across+0.002
Neg logit contributions
opus-0.002
uffy-0.002
uddle-0.002
 replied-0.002
imi-0.002
Token her
Token ID607
Feature activation-1.45e-02

Pos logit contributions
 err+0.016
wood+0.015
keeping+0.015
beh+0.015
 across+0.015
Neg logit contributions
opus-0.014
uffy-0.012
 replied-0.011
imi-0.011
uddle-0.011
Token three
Token ID1115
Feature activation-4.86e-02

Pos logit contributions
 err+0.033
wood+0.032
keeping+0.032
 across+0.031
 even+0.031
Neg logit contributions
opus-0.018
uffy-0.012
uddle-0.012
ement-0.012
inkle-0.012
Token-
Token ID12
Feature activation-6.26e-02

Pos logit contributions
 err+0.087
wood+0.086
keeping+0.085
beh+0.084
 across+0.082
Neg logit contributions
opus-0.081
uffy-0.066
 replied-0.065
uddle-0.064
imi-0.063
Tokenyear
Token ID1941
Feature activation-8.37e-02

Pos logit contributions
wood+0.074
 err+0.073
keeping+0.069
 distances+0.069
 across+0.067
Neg logit contributions
opus-0.138
 replied-0.122
imi-0.117
uffy-0.117
ama-0.114
Token-
Token ID12
Feature activation-5.96e-02

Pos logit contributions
wood+0.157
keeping+0.154
 across+0.153
beh+0.151
 distances+0.151
Neg logit contributions
opus-0.129
 replied-0.101
ama-0.100
uffy-0.099
uddle-0.099
Tokenold
Token ID727
Feature activation-2.86e-02

Pos logit contributions
wood+0.115
 err+0.112
 across+0.110
 distances+0.109
keeping+0.109
Neg logit contributions
opus-0.089
 replied-0.071
uffy-0.068
imi-0.068
ama-0.067
Token daughter
Token ID4957
Feature activation-4.06e-02

Pos logit contributions
 err+0.055
wood+0.054
keeping+0.053
beh+0.053
 across+0.051
Neg logit contributions
opus-0.044
uffy-0.035
uddle-0.034
 replied-0.034
imi-0.033
Token to
Token ID284
Feature activation-1.09e-02

Pos logit contributions
 err+0.082
keeping+0.080
beh+0.080
wood+0.080
 distances+0.078
Neg logit contributions
opus-0.055
 replied-0.047
uffy-0.046
imi-0.043
ama-0.042
Token attach
Token ID10199
Feature activation-1.88e-02

Pos logit contributions
wood+0.016
beh+0.016
keeping+0.016
 distances+0.016
 err+0.016
Neg logit contributions
opus-0.019
 replied-0.018
uffy-0.016
ama-0.016
imi-0.016
Token took
Token ID1718
Feature activation-3.68e-02

Pos logit contributions
 err+0.011
keeping+0.011
wood+0.011
beh+0.011
 across+0.010
Neg logit contributions
opus-0.011
uffy-0.009
 replied-0.009
uddle-0.009
imi-0.009
Token her
Token ID607
Feature activation-2.73e-02

Pos logit contributions
keeping+0.053
wood+0.053
beh+0.053
 err+0.051
 distances+0.049
Neg logit contributions
opus-0.068
 replied-0.060
uffy-0.059
imi-0.058
ama-0.057
Token little
Token ID1310
Feature activation-2.58e-02

Pos logit contributions
beh+0.055
 err+0.055
keeping+0.055
wood+0.054
 distances+0.052
Neg logit contributions
opus-0.038
uffy-0.031
 replied-0.030
uddle-0.029
imi-0.029
Token three
Token ID1115
Feature activation-5.43e-02

Pos logit contributions
beh+0.057
 err+0.057
keeping+0.056
wood+0.055
 distances+0.055
Neg logit contributions
opus-0.031
uffy-0.024
 replied-0.023
uddle-0.022
imi-0.022
Token-
Token ID12
Feature activation-5.82e-02

Pos logit contributions
wood+0.102
keeping+0.101
beh+0.099
 err+0.095
 across+0.094
Neg logit contributions
opus-0.085
uffy-0.069
 replied-0.069
imi-0.068
uddle-0.067
Tokenyear
Token ID1941
Feature activation-8.36e-02

Pos logit contributions
 err+0.073
 distances+0.073
wood+0.072
keeping+0.067
 across+0.066
Neg logit contributions
opus-0.116
on-0.106
 replied-0.102
ama-0.098
pron-0.097
Token-
Token ID12
Feature activation-5.97e-02

Pos logit contributions
keeping+0.154
wood+0.153
beh+0.151
 err+0.149
 across+0.149
Neg logit contributions
opus-0.129
ama-0.101
 replied-0.100
uffy-0.099
uddle-0.098
Tokenold
Token ID727
Feature activation-2.99e-02

Pos logit contributions
wood+0.114
 err+0.113
 distances+0.113
 across+0.111
keeping+0.109
Neg logit contributions
opus-0.084
on-0.070
ama-0.066
 replied-0.065
uffy-0.062
Token daughter
Token ID4957
Feature activation-4.78e-02

Pos logit contributions
 err+0.063
beh+0.062
wood+0.062
keeping+0.061
 distances+0.060
Neg logit contributions
opus-0.037
 replied-0.029
uffy-0.028
imi-0.027
uddle-0.026
Token to
Token ID284
Feature activation-6.39e-03

Pos logit contributions
beh+0.078
keeping+0.073
 distances+0.073
 err+0.072
wood+0.067
Neg logit contributions
opus-0.075
 brother-0.070
 replied-0.070
 box-0.068
ama-0.068
Token a
Token ID257
Feature activation+1.63e-02

Pos logit contributions
beh+0.011
keeping+0.011
wood+0.011
 err+0.010
 distances+0.010
Neg logit contributions
opus-0.010
 replied-0.010
pron-0.009
 mom-0.008
imi-0.008
Token dad
Token ID9955
Feature activation-1.55e-02

Pos logit contributions
 distances+0.046
 err+0.045
beh+0.044
uries+0.044
keeping+0.044
Neg logit contributions
opus-0.025
 replied-0.024
 daddy-0.019
 mom-0.018
uffy-0.018
Token took
Token ID1718
Feature activation-3.06e-02

Pos logit contributions
 err+0.024
wood+0.024
keeping+0.023
 across+0.023
beh+0.022
Neg logit contributions
opus-0.029
uddle-0.024
uffy-0.024
imi-0.024
ement-0.023
Token their
Token ID511
Feature activation-1.86e-02

Pos logit contributions
beh+0.056
 err+0.056
wood+0.055
keeping+0.055
 distances+0.054
Neg logit contributions
opus-0.043
 replied-0.038
uffy-0.037
imi-0.035
ama-0.034
Token 3
Token ID513
Feature activation-5.18e-02

Pos logit contributions
 err+0.036
wood+0.034
keeping+0.034
 across+0.033
beh+0.033
Neg logit contributions
opus-0.030
uffy-0.024
uddle-0.023
 replied-0.023
imi-0.023
Token-
Token ID12
Feature activation-5.69e-02

Pos logit contributions
wood+0.105
keeping+0.099
beh+0.098
 across+0.096
 distances+0.095
Neg logit contributions
opus-0.075
 replied-0.060
uffy-0.057
uddle-0.057
imi-0.057
Tokenyear
Token ID1941
Feature activation-8.34e-02

Pos logit contributions
 distances+0.100
wood+0.099
 err+0.094
 across+0.094
keeping+0.092
Neg logit contributions
opus-0.083
on-0.076
 replied-0.072
ons-0.068
ma-0.067
Token-
Token ID12
Feature activation-5.83e-02

Pos logit contributions
keeping+0.165
wood+0.164
 across+0.160
beh+0.159
 distances+0.159
Neg logit contributions
opus-0.112
ama-0.086
uddle-0.086
 replied-0.085
ement-0.081
Tokenold
Token ID727
Feature activation-2.47e-02

Pos logit contributions
wood+0.124
 distances+0.122
 across+0.121
 err+0.119
keeping+0.117
Neg logit contributions
opus-0.067
on-0.055
 replied-0.052
ons-0.050
ama-0.048
Token Jimmy
Token ID12963
Feature activation-3.03e-02

Pos logit contributions
 err+0.051
beh+0.050
wood+0.050
keeping+0.049
 distances+0.049
Neg logit contributions
opus-0.032
 replied-0.026
uffy-0.026
uddle-0.024
imi-0.023
Token to
Token ID284
Feature activation-1.89e-03

Pos logit contributions
 err+0.046
beh+0.046
keeping+0.045
 distances+0.045
wood+0.043
Neg logit contributions
opus-0.051
 replied-0.048
uffy-0.046
ama-0.043
 brother-0.041
Token the
Token ID262
Feature activation+1.60e-02

Pos logit contributions
beh+0.004
wood+0.003
keeping+0.003
ried+0.003
and+0.003
Neg logit contributions
 replied-0.003
opus-0.002
 mom-0.002
 daddy-0.002
 Jake-0.002
Token twenty
Token ID8208
Feature activation-5.38e-02

Pos logit contributions
 err+0.028
wood+0.027
keeping+0.027
beh+0.027
 across+0.026
Neg logit contributions
opus-0.032
uffy-0.027
 replied-0.026
uddle-0.026
imi-0.026
Token-
Token ID12
Feature activation-8.02e-02

Pos logit contributions
 err+0.103
wood+0.102
keeping+0.102
beh+0.099
 across+0.096
Neg logit contributions
opus-0.083
uffy-0.066
 replied-0.065
uddle-0.064
imi-0.064
Tokeneight
Token ID26022
Feature activation-4.19e-02

Pos logit contributions
wood+0.128
 err+0.124
keeping+0.121
 across+0.108
 distances+0.106
Neg logit contributions
opus-0.149
 replied-0.131
ement-0.124
imi-0.122
on-0.121
Token,
Token ID11
Feature activation-2.76e-02

Pos logit contributions
 err+0.087
wood+0.085
keeping+0.084
beh+0.083
 across+0.081
Neg logit contributions
opus-0.058
uffy-0.045
uddle-0.043
 replied-0.043
imi-0.042
Token twenty
Token ID8208
Feature activation-6.08e-02

Pos logit contributions
 err+0.037
keeping+0.036
wood+0.036
beh+0.035
 across+0.033
Neg logit contributions
opus-0.059
 replied-0.051
uffy-0.050
imi-0.049
uddle-0.048
Token-
Token ID12
Feature activation-8.32e-02

Pos logit contributions
keeping+0.117
wood+0.116
 err+0.113
beh+0.109
 across+0.105
Neg logit contributions
opus-0.096
 replied-0.080
uffy-0.078
ement-0.077
imi-0.077
Tokennine
Token ID30888
Feature activation-4.25e-02

Pos logit contributions
wood+0.143
 err+0.140
keeping+0.137
beh+0.123
 across+0.121
Neg logit contributions
opus-0.140
 replied-0.123
on-0.119
ement-0.116
imi-0.116
Token,
Token ID11
Feature activation-2.81e-02

Pos logit contributions
 err+0.091
wood+0.089
keeping+0.089
beh+0.088
 across+0.085
Neg logit contributions
opus-0.055
uffy-0.043
 replied-0.041
uddle-0.040
imi-0.040
Token thirty
Token ID12277
Feature activation-6.15e-02

Pos logit contributions
keeping+0.039
 err+0.039
beh+0.038
wood+0.038
 across+0.034
Neg logit contributions
opus-0.059
 replied-0.051
uffy-0.049
imi-0.049
ama-0.047
Token,
Token ID11
Feature activation-2.71e-02

Pos logit contributions
keeping+0.116
 err+0.114
wood+0.114
beh+0.109
 across+0.106
Neg logit contributions
opus-0.098
 replied-0.082
uffy-0.080
ement-0.079
imi-0.078
Token thirty
Token ID12277
Feature activation-6.16e-02

Pos logit contributions
 err+0.037
keeping+0.036
beh+0.035
wood+0.035
 across+0.032
Neg logit contributions
opus-0.058
 replied-0.050
uffy-0.049
imi-0.048
ama-0.047
Token the
Token ID262
Feature activation+2.08e-02

Pos logit contributions
beh+0.038
wood+0.035
 err+0.034
keeping+0.034
uries+0.033
Neg logit contributions
opus-0.030
 replied-0.029
imi-0.027
uffy-0.027
 box-0.026
Token sky
Token ID6766
Feature activation+2.79e-02

Pos logit contributions
opus+0.032
uffy+0.025
uddle+0.025
 replied+0.025
imi+0.024
Neg logit contributions
beh-0.040
 err-0.039
keeping-0.039
wood-0.039
 distances-0.037
Token before
Token ID878
Feature activation-7.04e-03

Pos logit contributions
opus+0.041
 replied+0.035
uffy+0.033
uddle+0.032
imi+0.031
Neg logit contributions
 err-0.055
beh-0.054
keeping-0.053
wood-0.052
 distances-0.049
Token they
Token ID484
Feature activation-1.35e-02

Pos logit contributions
 err+0.012
beh+0.012
wood+0.012
keeping+0.012
 across+0.011
Neg logit contributions
opus-0.012
 replied-0.010
uffy-0.010
imi-0.010
uddle-0.010
Token walked
Token ID6807
Feature activation-3.38e-02

Pos logit contributions
 err+0.021
wood+0.021
keeping+0.020
beh+0.020
 across+0.020
Neg logit contributions
opus-0.026
uffy-0.021
uddle-0.021
imi-0.021
 replied-0.021
Token fore
Token ID1674
Feature activation-8.31e-02

Pos logit contributions
beh+0.050
 err+0.047
keeping+0.045
wood+0.044
 distances+0.043
Neg logit contributions
opus-0.063
 replied-0.061
uddle-0.056
uffy-0.055
ement-0.053
Tokenshadow
Token ID19106
Feature activation-2.31e-02

Pos logit contributions
keeping+0.113
wood+0.106
gent+0.105
 err+0.103
 distances+0.099
Neg logit contributions
opus-0.154
 replied-0.143
uddle-0.135
uffy-0.130
ama-0.127
Tokened
Token ID276
Feature activation-1.44e-02

Pos logit contributions
 err+0.033
wood+0.032
keeping+0.031
beh+0.031
 distances+0.030
Neg logit contributions
opus-0.047
uffy-0.040
 replied-0.039
uddle-0.039
imi-0.038
Token the
Token ID262
Feature activation-1.33e-02

Pos logit contributions
beh+0.023
 err+0.023
wood+0.022
keeping+0.022
 distances+0.021
Neg logit contributions
opus-0.025
 replied-0.022
uffy-0.021
uddle-0.021
ement-0.020
Token down
Token ID866
Feature activation-1.17e-02

Pos logit contributions
 err+0.024
beh+0.024
keeping+0.024
wood+0.024
 distances+0.023
Neg logit contributions
opus-0.021
 replied-0.017
uffy-0.017
uddle-0.017
imi-0.016
Tokenpour
Token ID48681
Feature activation-3.34e-02

Pos logit contributions
 err+0.020
beh+0.019
keeping+0.019
wood+0.019
 distances+0.018
Neg logit contributions
opus-0.020
 replied-0.017
uffy-0.017
uddle-0.016
ama-0.016
Token,
Token ID11
Feature activation+3.69e-03

Pos logit contributions
 err+0.012
beh+0.012
wood+0.012
keeping+0.012
 distances+0.012
Neg logit contributions
opus-0.011
uffy-0.009
 replied-0.009
uddle-0.008
imi-0.008
Token but
Token ID475
Feature activation-7.98e-03

Pos logit contributions
opus+0.006
 replied+0.006
uffy+0.005
imi+0.005
uddle+0.005
Neg logit contributions
 err-0.006
beh-0.006
wood-0.006
keeping-0.006
 distances-0.005
Token it
Token ID340
Feature activation+2.38e-02

Pos logit contributions
 err+0.018
wood+0.018
beh+0.018
keeping+0.017
 distances+0.017
Neg logit contributions
opus-0.009
 replied-0.008
uffy-0.007
ama-0.007
imi-0.007
Token was
Token ID373
Feature activation-4.52e-03

Pos logit contributions
opus+0.035
 replied+0.028
uffy+0.028
imi+0.026
uddle+0.026
Neg logit contributions
 err-0.048
keeping-0.047
wood-0.046
beh-0.045
 distances-0.044
Token all
Token ID477
Feature activation-6.25e-03

Pos logit contributions
keeping+0.009
wood+0.008
 err+0.008
beh+0.008
and+0.008
Neg logit contributions
 replied-0.006
opus-0.006
 sincere-0.006
pron-0.006
imi-0.005
Token fore
Token ID1674
Feature activation-8.31e-02

Pos logit contributions
 err+0.010
keeping+0.010
wood+0.010
 distances+0.010
beh+0.009
Neg logit contributions
opus-0.010
 replied-0.009
uffy-0.009
uddle-0.008
imi-0.008
Tokenshadow
Token ID19106
Feature activation-2.66e-02

Pos logit contributions
keeping+0.109
wood+0.104
 err+0.097
 across+0.096
 distances+0.092
Neg logit contributions
opus-0.171
 replied-0.147
uffy-0.145
uddle-0.143
ama-0.141
Tokened
Token ID276
Feature activation-1.45e-02

Pos logit contributions
 err+0.051
keeping+0.050
wood+0.050
beh+0.049
 distances+0.047
Neg logit contributions
opus-0.041
uffy-0.033
 replied-0.032
uddle-0.031
imi-0.031
Token by
Token ID416
Feature activation+4.02e-04

Pos logit contributions
 err+0.022
beh+0.021
keeping+0.021
wood+0.021
 distances+0.020
Neg logit contributions
opus-0.026
 replied-0.023
uffy-0.023
uddle-0.022
imi-0.022
Token the
Token ID262
Feature activation+7.97e-03

Pos logit contributions
opus+0.001
 replied+0.001
uffy+0.001
imi+0.001
ama+0.001
Neg logit contributions
 err-0.001
beh-0.001
wood-0.001
keeping-0.001
 distances-0.001
Token kind
Token ID1611
Feature activation-3.54e-02

Pos logit contributions
opus+0.013
uffy+0.011
uddle+0.010
 replied+0.010
imi+0.010
Neg logit contributions
 err-0.014
wood-0.014
keeping-0.014
beh-0.014
 distances-0.013
Token back
Token ID736
Feature activation-4.27e-02

Pos logit contributions
beh+0.027
 err+0.026
ried+0.026
keeping+0.026
 distances+0.025
Neg logit contributions
opus-0.022
 replied-0.021
ement-0.020
uddle-0.020
 mom-0.019
Token the
Token ID262
Feature activation-5.99e-03

Pos logit contributions
beh+0.078
uries+0.077
 distances+0.077
ern+0.071
 err+0.071
Neg logit contributions
 mom-0.051
 box-0.048
 grandma-0.044
 daddy-0.043
uffy-0.043
Token village
Token ID7404
Feature activation-4.09e-02

Pos logit contributions
wood+0.011
beh+0.011
 err+0.011
keeping+0.011
 distances+0.011
Neg logit contributions
opus-0.009
uddle-0.008
uffy-0.007
 replied-0.007
 box-0.007
Token to
Token ID284
Feature activation+1.46e-02

Pos logit contributions
beh+0.070
 err+0.069
 distances+0.065
wood+0.065
gent+0.065
Neg logit contributions
opus-0.056
 box-0.053
 replied-0.051
uddle-0.050
ement-0.048
Token its
Token ID663
Feature activation+2.45e-03

Pos logit contributions
opus+0.021
 replied+0.020
inkle+0.018
 sincere+0.018
 mom+0.018
Neg logit contributions
beh-0.027
wood-0.025
 err-0.025
ried-0.025
keeping-0.025
Token former
Token ID1966
Feature activation-8.29e-02

Pos logit contributions
opus+0.004
uddle+0.004
uffy+0.004
 replied+0.003
ama+0.003
Neg logit contributions
beh-0.004
keeping-0.004
wood-0.004
gent-0.004
 err-0.004
Token state
Token ID1181
Feature activation-6.46e-03

Pos logit contributions
beh+0.162
Circ+0.145
cr+0.145
keeping+0.144
wood+0.142
Neg logit contributions
inkle-0.114
ement-0.106
uddle-0.105
 replied-0.100
opus-0.098
Token.
Token ID13
Feature activation-8.10e-03

Pos logit contributions
beh+0.014
 err+0.014
wood+0.013
keeping+0.013
 distances+0.012
Neg logit contributions
opus-0.008
ement-0.006
 replied-0.006
uddle-0.006
uffy-0.006
Token With
Token ID2080
Feature activation-3.92e-02

Pos logit contributions
 err+0.017
beh+0.016
wood+0.015
keeping+0.015
 distances+0.015
Neg logit contributions
opus-0.012
uffy-0.010
 replied-0.010
ama-0.009
imi-0.009
Token the
Token ID262
Feature activation-1.21e-02

Pos logit contributions
keeping+0.074
beh+0.073
wood+0.068
ried+0.068
and+0.067
Neg logit contributions
 replied-0.059
opus-0.056
 mom-0.050
ilo-0.050
ement-0.049
Token help
Token ID1037
Feature activation-2.63e-02

Pos logit contributions
 err+0.021
keeping+0.020
wood+0.020
beh+0.020
 distances+0.019
Neg logit contributions
opus-0.022
 replied-0.018
uffy-0.017
uddle-0.017
ama-0.017
Token and
Token ID290
Feature activation-1.32e-02

Pos logit contributions
beh+0.023
keeping+0.022
 err+0.022
wood+0.022
gent+0.020
Neg logit contributions
 replied-0.017
opus-0.017
 dear-0.014
 says-0.014
 fork-0.014
Token the
Token ID262
Feature activation-1.00e-02

Pos logit contributions
 err+0.024
beh+0.024
keeping+0.023
wood+0.022
and+0.022
Neg logit contributions
 replied-0.019
opus-0.018
 mom-0.015
 says-0.015
 Jake-0.014
Token frog
Token ID21264
Feature activation+2.52e-03

Pos logit contributions
 err+0.020
beh+0.019
keeping+0.019
wood+0.019
 distances+0.018
Neg logit contributions
opus-0.015
 replied-0.012
uddle-0.012
uffy-0.012
imi-0.011
Token suggested
Token ID5220
Feature activation-1.26e-02

Pos logit contributions
opus+0.004
 replied+0.003
uffy+0.003
uddle+0.003
imi+0.003
Neg logit contributions
 err-0.005
keeping-0.005
wood-0.005
beh-0.005
 distances-0.004
Token catching
Token ID16508
Feature activation-5.16e-03

Pos logit contributions
 err+0.023
wood+0.023
keeping+0.023
beh+0.023
 distances+0.022
Neg logit contributions
opus-0.020
 replied-0.017
uffy-0.016
uddle-0.016
imi-0.015
Token tad
Token ID36264
Feature activation-8.26e-02

Pos logit contributions
keeping+0.009
uries+0.009
 err+0.009
beh+0.009
gent+0.009
Neg logit contributions
opus-0.007
 replied-0.007
ls-0.007
uddle-0.006
 says-0.006
Tokenp
Token ID79
Feature activation-2.45e-02

Pos logit contributions
keeping+0.136
 err+0.136
wood+0.123
Ice+0.114
Circ+0.109
Neg logit contributions
opus-0.118
 replied-0.106
ia-0.105
inkle-0.100
isy-0.096
Tokenoles
Token ID4316
Feature activation-1.98e-02

Pos logit contributions
 err+0.042
keeping+0.041
beh+0.039
wood+0.039
gent+0.037
Neg logit contributions
opus-0.042
uffy-0.036
uddle-0.036
 replied-0.035
imi-0.034
Token in
Token ID287
Feature activation-3.74e-03

Pos logit contributions
 err+0.038
keeping+0.037
beh+0.036
gent+0.035
wood+0.035
Neg logit contributions
opus-0.028
 replied-0.024
uddle-0.024
uffy-0.023
imi-0.022
Token a
Token ID257
Feature activation+2.55e-02

Pos logit contributions
beh+0.008
 err+0.007
wood+0.007
keeping+0.007
seek+0.007
Neg logit contributions
 replied-0.004
opus-0.004
 box-0.004
 mom-0.004
 sincere-0.004
Token nearby
Token ID6716
Feature activation+1.60e-02

Pos logit contributions
opus+0.042
uddle+0.034
uffy+0.034
 replied+0.034
imi+0.032
Neg logit contributions
keeping-0.045
 err-0.045
wood-0.045
beh-0.045
 distances-0.042
Token something
Token ID1223
Feature activation+5.42e-05

Pos logit contributions
opus+0.026
uffy+0.022
 replied+0.022
uddle+0.021
imi+0.021
Neg logit contributions
 err-0.024
wood-0.023
keeping-0.023
beh-0.023
 across-0.022
Token new
Token ID649
Feature activation-2.58e-02

Pos logit contributions
opus+0.000
 replied+0.000
uddle+0.000
uffy+0.000
imi+0.000
Neg logit contributions
keeping-0.000
wood-0.000
 err-0.000
beh-0.000
 distances-0.000
Token-
Token ID12
Feature activation-5.14e-02

Pos logit contributions
 err+0.050
beh+0.049
wood+0.047
keeping+0.046
uries+0.045
Neg logit contributions
opus-0.038
 replied-0.032
uddle-0.031
uffy-0.030
ement-0.030
Token there
Token ID612
Feature activation+2.53e-02

Pos logit contributions
 err+0.088
wood+0.084
keeping+0.082
 distances+0.079
beh+0.077
Neg logit contributions
opus-0.092
 replied-0.075
uffy-0.071
imi-0.068
uddle-0.068
Token were
Token ID547
Feature activation-1.94e-02

Pos logit contributions
opus+0.034
 replied+0.030
uffy+0.027
imi+0.026
ement+0.026
Neg logit contributions
 err-0.053
beh-0.052
wood-0.050
keeping-0.050
 distances-0.047
Token tad
Token ID36264
Feature activation-8.26e-02

Pos logit contributions
 err+0.030
beh+0.029
keeping+0.029
wood+0.029
gent+0.026
Neg logit contributions
opus-0.036
 replied-0.031
uddle-0.030
uffy-0.030
imi-0.029
Tokenp
Token ID79
Feature activation-1.34e-02

Pos logit contributions
keeping+0.135
 err+0.128
wood+0.123
Circ+0.112
 hikes+0.106
Neg logit contributions
opus-0.132
ement-0.110
 replied-0.108
uddle-0.107
inkle-0.104
Tokenoles
Token ID4316
Feature activation+6.81e-03

Pos logit contributions
 err+0.025
wood+0.024
keeping+0.024
beh+0.024
 across+0.023
Neg logit contributions
opus-0.022
uffy-0.017
uddle-0.017
 replied-0.017
imi-0.017
Token swimming
Token ID14899
Feature activation-1.30e-02

Pos logit contributions
opus+0.012
uffy+0.009
 replied+0.009
uddle+0.009
imi+0.009
Neg logit contributions
 err-0.012
wood-0.012
keeping-0.011
beh-0.011
 distances-0.011
Token around
Token ID1088
Feature activation+1.21e-02

Pos logit contributions
beh+0.019
 err+0.019
keeping+0.019
 distances+0.018
uries+0.017
Neg logit contributions
opus-0.023
uddle-0.021
 replied-0.020
uffy-0.019
 box-0.018
Token in
Token ID287
Feature activation-1.31e-03

Pos logit contributions
opus+0.016
 replied+0.014
imi+0.012
uffy+0.012
ement+0.012
Neg logit contributions
beh-0.025
 err-0.024
wood-0.024
keeping-0.023
uries-0.022
Token and
Token ID290
Feature activation+2.21e-02

Pos logit contributions
opus+0.019
uffy+0.016
uddle+0.015
 replied+0.015
imi+0.015
Neg logit contributions
 err-0.027
wood-0.026
beh-0.026
keeping-0.026
 distances-0.025
Token broke
Token ID6265
Feature activation+1.81e-02

Pos logit contributions
opus+0.038
uffy+0.030
uddle+0.030
imi+0.029
 replied+0.029
Neg logit contributions
 err-0.040
keeping-0.038
wood-0.038
 across-0.038
beh-0.037
Token.
Token ID13
Feature activation+4.83e-04

Pos logit contributions
opus+0.033
uffy+0.029
uddle+0.028
imi+0.028
 replied+0.027
Neg logit contributions
 err-0.029
beh-0.027
wood-0.027
keeping-0.026
 distances-0.025
Token This
Token ID770
Feature activation+1.17e-02

Pos logit contributions
opus+0.001
uffy+0.001
uddle+0.001
ement+0.001
imi+0.001
Neg logit contributions
 across-0.001
 err-0.001
keeping-0.001
wood-0.001
beh-0.001
Token made
Token ID925
Feature activation+1.39e-02

Pos logit contributions
opus+0.020
uffy+0.016
uddle+0.016
 replied+0.016
imi+0.016
Neg logit contributions
 err-0.021
wood-0.020
keeping-0.020
beh-0.019
 across-0.019
Token the
Token ID262
Feature activation+3.45e-02

Pos logit contributions
opus+0.029
uffy+0.024
 replied+0.024
uddle+0.024
imi+0.024
Neg logit contributions
 err-0.019
wood-0.019
keeping-0.019
beh-0.018
 across-0.018
Token little
Token ID1310
Feature activation+3.56e-03

Pos logit contributions
opus+0.045
uffy+0.035
 replied+0.033
imi+0.033
uddle+0.032
Neg logit contributions
 err-0.075
keeping-0.072
wood-0.072
 across-0.070
beh-0.069
Token girl
Token ID2576
Feature activation+1.14e-02

Pos logit contributions
opus+0.004
uffy+0.003
 replied+0.003
uddle+0.003
imi+0.003
Neg logit contributions
 err-0.008
wood-0.008
beh-0.008
keeping-0.008
 distances-0.007
Token very
Token ID845
Feature activation+1.35e-03

Pos logit contributions
opus+0.022
uffy+0.018
 replied+0.018
uddle+0.018
imi+0.018
Neg logit contributions
 err-0.017
wood-0.017
keeping-0.017
beh-0.016
 across-0.016
Token sad
Token ID6507
Feature activation+1.29e-02

Pos logit contributions
opus+0.002
uffy+0.002
imi+0.002
 replied+0.002
uddle+0.002
Neg logit contributions
 err-0.003
beh-0.003
keeping-0.003
wood-0.003
 across-0.003
Token because
Token ID780
Feature activation+8.73e-04

Pos logit contributions
opus+0.021
uffy+0.017
 replied+0.017
uddle+0.017
imi+0.017
Neg logit contributions
 err-0.023
wood-0.023
beh-0.023
keeping-0.023
 distances-0.022
Token Can
Token ID1680
Feature activation+1.74e-02

Pos logit contributions
opus+0.028
uffy+0.023
uddle+0.022
imi+0.022
 replied+0.022
Neg logit contributions
 err-0.029
wood-0.029
keeping-0.028
beh-0.028
 across-0.027
Token you
Token ID345
Feature activation-4.39e-03

Pos logit contributions
opus+0.029
uddle+0.025
uffy+0.024
imi+0.023
ement+0.022
Neg logit contributions
 err-0.029
beh-0.028
wood-0.028
keeping-0.028
 even-0.028
Token please
Token ID3387
Feature activation+1.58e-02

Pos logit contributions
 err+0.007
wood+0.006
beh+0.006
 across+0.006
keeping+0.006
Neg logit contributions
opus-0.009
uffy-0.007
uddle-0.007
imi-0.007
 replied-0.007
Token forgive
Token ID20927
Feature activation+4.87e-02

Pos logit contributions
opus+0.034
uffy+0.028
uddle+0.028
imi+0.027
 replied+0.027
Neg logit contributions
 err-0.021
wood-0.021
keeping-0.020
beh-0.020
 across-0.019
Token us
Token ID514
Feature activation+1.13e-02

Pos logit contributions
opus+0.084
uffy+0.070
uddle+0.068
 replied+0.064
imi+0.064
Neg logit contributions
 err-0.083
wood-0.082
keeping-0.081
 across-0.079
beh-0.078
Token?"
Token ID1701
Feature activation+5.33e-02

Pos logit contributions
opus+0.010
uffy+0.007
 replied+0.006
uddle+0.006
imi+0.006
Neg logit contributions
 err-0.029
keeping-0.029
wood-0.029
beh-0.028
 across-0.027
Token they
Token ID484
Feature activation+1.99e-02

Pos logit contributions
opus+0.114
uffy+0.103
uddle+0.099
imi+0.094
pron+0.093
Neg logit contributions
 even-0.074
 across-0.074
 err-0.071
 themselves-0.069
wood-0.068
Token say
Token ID910
Feature activation+1.73e-02

Pos logit contributions
opus+0.038
uffy+0.033
uddle+0.032
pron+0.030
ama+0.030
Neg logit contributions
 err-0.032
 even-0.031
wood-0.031
 across-0.030
 distances-0.030
Token together
Token ID1978
Feature activation-1.97e-02

Pos logit contributions
opus+0.032
uffy+0.027
uddle+0.026
imi+0.026
 replied+0.024
Neg logit contributions
 err-0.028
keeping-0.028
wood-0.027
 across-0.027
beh-0.027
Token.
Token ID13
Feature activation-2.26e-04

Pos logit contributions
 err+0.029
 across+0.028
 even+0.026
 along+0.026
wood+0.026
Neg logit contributions
opus-0.042
uffy-0.035
imi-0.033
uddle-0.033
ama-0.033
Token
Token ID198
Feature activation+3.07e-03

Pos logit contributions
 across+0.000
wood+0.000
keeping+0.000
 err+0.000
 themselves+0.000
Neg logit contributions
opus-0.000
uffy-0.000
uddle-0.000
imi-0.000
pron-0.000
Token bird
Token ID6512
Feature activation+1.51e-02

Pos logit contributions
opus+0.033
ement+0.030
uffy+0.029
 replied+0.029
imi+0.029
Neg logit contributions
 err-0.035
 across-0.034
 even-0.033
wood-0.031
 quite-0.029
Token flying
Token ID7348
Feature activation-3.91e-03

Pos logit contributions
opus+0.032
uffy+0.027
 replied+0.026
uddle+0.026
imi+0.026
Neg logit contributions
 err-0.021
wood-0.020
keeping-0.020
beh-0.019
 across-0.019
Token high
Token ID1029
Feature activation-2.66e-02

Pos logit contributions
keeping+0.005
 err+0.005
wood+0.005
beh+0.005
 distances+0.005
Neg logit contributions
opus-0.008
 replied-0.007
uffy-0.007
uddle-0.007
imi-0.006
Token in
Token ID287
Feature activation+4.61e-03

Pos logit contributions
beh+0.040
keeping+0.039
 err+0.039
uries+0.038
wood+0.037
Neg logit contributions
opus-0.050
uddle-0.042
 replied-0.042
uffy-0.041
imi-0.040
Token the
Token ID262
Feature activation+4.98e-02

Pos logit contributions
opus+0.006
imi+0.006
 replied+0.006
 ow+0.006
ara+0.005
Neg logit contributions
beh-0.009
 err-0.009
wood-0.008
uries-0.008
ried-0.008
Token sky
Token ID6766
Feature activation+4.33e-02

Pos logit contributions
opus+0.078
uffy+0.063
 replied+0.060
uddle+0.059
imi+0.059
Neg logit contributions
 err-0.094
wood-0.092
keeping-0.091
 across-0.091
beh-0.089
Token.
Token ID13
Feature activation+2.66e-03

Pos logit contributions
opus+0.058
 replied+0.046
uffy+0.045
uddle+0.044
imi+0.043
Neg logit contributions
 err-0.091
beh-0.088
wood-0.088
keeping-0.087
 distances-0.083
Token She
Token ID1375
Feature activation+1.49e-02

Pos logit contributions
opus+0.006
uffy+0.005
ement+0.005
imi+0.005
inkle+0.005
Neg logit contributions
 across-0.004
 err-0.003
keeping-0.003
wood-0.003
 even-0.003
Token wanted
Token ID2227
Feature activation+3.23e-03

Pos logit contributions
opus+0.028
uffy+0.024
uddle+0.024
ement+0.023
imi+0.023
Neg logit contributions
 err-0.022
 across-0.022
 even-0.021
beh-0.021
wood-0.021
Token to
Token ID284
Feature activation+8.37e-03

Pos logit contributions
opus+0.005
 replied+0.005
 mom+0.005
imi+0.005
on+0.004
Neg logit contributions
beh-0.005
 err-0.005
keeping-0.005
wood-0.005
ried-0.005
Token fly
Token ID6129
Feature activation+1.79e-03

Pos logit contributions
opus+0.016
 replied+0.014
imi+0.013
uddle+0.012
uffy+0.012
Neg logit contributions
 err-0.013
wood-0.013
keeping-0.013
beh-0.012
 distances-0.012
Token a
Token ID257
Feature activation+1.42e-02

Pos logit contributions
opus+0.022
 replied+0.019
uffy+0.018
uddle+0.018
imi+0.018
Neg logit contributions
beh-0.017
 err-0.017
keeping-0.017
wood-0.017
 distances-0.016
Token little
Token ID1310
Feature activation-5.58e-03

Pos logit contributions
opus+0.022
uffy+0.019
imi+0.018
 replied+0.018
ement+0.017
Neg logit contributions
 err-0.026
wood-0.025
 across-0.025
keeping-0.024
 even-0.023
Token bird
Token ID6512
Feature activation+1.50e-02

Pos logit contributions
beh+0.012
 err+0.012
keeping+0.012
wood+0.011
 distances+0.011
Neg logit contributions
opus-0.007
uddle-0.006
 replied-0.006
uffy-0.006
imi-0.005
Token in
Token ID287
Feature activation-4.28e-04

Pos logit contributions
opus+0.029
uffy+0.024
 replied+0.024
uddle+0.024
imi+0.024
Neg logit contributions
 err-0.023
wood-0.022
keeping-0.022
beh-0.022
 across-0.021
Token the
Token ID262
Feature activation+3.00e-02

Pos logit contributions
beh+0.001
 err+0.001
wood+0.001
keeping+0.001
ried+0.001
Neg logit contributions
opus-0.001
 replied-0.001
imi-0.001
uffy-0.001
ilo-0.001
Token water
Token ID1660
Feature activation+3.69e-02

Pos logit contributions
opus+0.058
uffy+0.049
 replied+0.047
uddle+0.047
imi+0.047
Neg logit contributions
 err-0.046
wood-0.045
keeping-0.044
beh-0.043
 across-0.043
Token.
Token ID13
Feature activation-3.14e-03

Pos logit contributions
opus+0.062
uffy+0.051
uddle+0.049
imi+0.049
 replied+0.049
Neg logit contributions
 err-0.065
wood-0.064
keeping-0.063
beh-0.061
 across-0.061
Token It
Token ID632
Feature activation+2.25e-02

Pos logit contributions
wood+0.005
 err+0.005
keeping+0.005
 across+0.005
beh+0.004
Neg logit contributions
opus-0.006
uffy-0.005
uddle-0.005
imi-0.005
 replied-0.005
Token was
Token ID373
Feature activation-9.70e-03

Pos logit contributions
opus+0.034
 replied+0.028
uffy+0.027
uddle+0.026
imi+0.026
Neg logit contributions
 err-0.044
keeping-0.043
wood-0.042
beh-0.042
 distances-0.040
Token a
Token ID257
Feature activation+1.10e-02

Pos logit contributions
keeping+0.016
beh+0.015
 err+0.015
wood+0.014
and+0.014
Neg logit contributions
opus-0.017
 replied-0.016
imi-0.015
 ow-0.014
uffy-0.014
Token sour
Token ID11348
Feature activation+8.09e-03

Pos logit contributions
opus+0.018
uffy+0.015
imi+0.014
 replied+0.014
uddle+0.014
Neg logit contributions
 err-0.020
wood-0.019
 across-0.018
keeping-0.018
beh-0.018
Token some
Token ID617
Feature activation-1.96e-02

Pos logit contributions
opus+0.008
 replied+0.006
uffy+0.006
imi+0.006
uddle+0.006
Neg logit contributions
 err-0.005
keeping-0.005
beh-0.005
wood-0.005
 distances-0.004
Token food
Token ID2057
Feature activation-3.46e-03

Pos logit contributions
keeping+0.034
beh+0.033
wood+0.032
 err+0.032
ried+0.031
Neg logit contributions
opus-0.033
 replied-0.028
uddle-0.028
imi-0.028
uffy-0.027
Token to
Token ID284
Feature activation+8.30e-03

Pos logit contributions
beh+0.006
 err+0.005
keeping+0.005
wood+0.005
 distances+0.005
Neg logit contributions
opus-0.006
uffy-0.005
 replied-0.005
uddle-0.005
 box-0.005
Token feed
Token ID3745
Feature activation+2.78e-02

Pos logit contributions
opus+0.014
 replied+0.012
imi+0.012
uffy+0.012
uddle+0.011
Neg logit contributions
beh-0.014
wood-0.014
 err-0.014
keeping-0.014
 across-0.013
Token my
Token ID616
Feature activation-2.27e-03

Pos logit contributions
opus+0.036
 replied+0.030
uffy+0.029
imi+0.028
uddle+0.027
Neg logit contributions
beh-0.057
 err-0.056
keeping-0.054
 distances-0.053
uries-0.052
Token tum
Token ID11814
Feature activation+3.88e-02

Pos logit contributions
 err+0.004
beh+0.004
wood+0.004
keeping+0.004
 across+0.004
Neg logit contributions
opus-0.004
uffy-0.003
uddle-0.003
 replied-0.003
imi-0.003
Tokenmy
Token ID1820
Feature activation+1.38e-02

Pos logit contributions
opus+0.062
uffy+0.051
imi+0.049
 replied+0.049
uddle+0.049
Neg logit contributions
 err-0.073
wood-0.070
keeping-0.070
beh-0.069
 across-0.067
Token."
Token ID526
Feature activation+2.48e-02

Pos logit contributions
opus+0.017
uffy+0.014
 replied+0.014
uddle+0.013
 ow+0.013
Neg logit contributions
 err-0.030
beh-0.028
keeping-0.028
wood-0.027
 distances-0.026
Token
Token ID220
Feature activation+1.87e-02

Pos logit contributions
opus+0.047
uffy+0.038
uddle+0.037
imi+0.037
ement+0.035
Neg logit contributions
 err-0.039
wood-0.038
keeping-0.038
beh-0.037
 across-0.037
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
opus+0.038
uddle+0.031
uffy+0.030
pron+0.030
 replied+0.030
Neg logit contributions
 err-0.026
wood-0.026
 across-0.025
keeping-0.025
beh-0.025
Token
Token ID198
Feature activation-8.65e-03

Pos logit contributions
 err+0.017
wood+0.016
keeping+0.016
 across+0.016
beh+0.016
Neg logit contributions
opus-0.021
uffy-0.017
uddle-0.017
imi-0.017
 replied-0.017
Token They
Token ID1119
Feature activation+1.17e-02

Pos logit contributions
 err+0.016
beh+0.015
keeping+0.015
wood+0.015
 distances+0.014
Neg logit contributions
opus-0.022
uffy-0.019
 replied-0.019
imi-0.019
uddle-0.019
Token have
Token ID423
Feature activation-3.05e-02

Pos logit contributions
opus+0.022
uffy+0.019
 replied+0.018
uddle+0.018
imi+0.018
Neg logit contributions
 err-0.018
wood-0.017
keeping-0.017
beh-0.017
 across-0.016
Token an
Token ID281
Feature activation-1.09e-02

Pos logit contributions
 err+0.043
beh+0.042
wood+0.042
keeping+0.042
ried+0.038
Neg logit contributions
opus-0.060
 replied-0.053
uffy-0.051
imi-0.050
 sincere-0.050
Token idea
Token ID2126
Feature activation+2.41e-03

Pos logit contributions
keeping+0.022
wood+0.022
beh+0.022
 err+0.021
 across+0.021
Neg logit contributions
opus-0.015
 replied-0.013
imi-0.012
ement-0.012
uffy-0.011
Token.
Token ID13
Feature activation-1.76e-02

Pos logit contributions
opus+0.004
uffy+0.004
uddle+0.003
imi+0.003
 replied+0.003
Neg logit contributions
 err-0.004
keeping-0.004
wood-0.004
 across-0.004
beh-0.004
Token
Token ID198
Feature activation-2.31e-03

Pos logit contributions
 err+0.030
beh+0.030
keeping+0.028
wood+0.028
 distances+0.026
Neg logit contributions
opus-0.030
 replied-0.027
uffy-0.026
imi-0.025
uddle-0.025
Token
Token ID198
Feature activation-2.81e-03

Pos logit contributions
 err+0.004
keeping+0.004
wood+0.004
 across+0.003
beh+0.003
Neg logit contributions
opus-0.004
uffy-0.004
uddle-0.003
 replied-0.003
imi-0.003
Token"
Token ID1
Feature activation-3.70e-02

Pos logit contributions
 err+0.006
wood+0.006
keeping+0.006
beh+0.006
 distances+0.005
Neg logit contributions
opus-0.004
uffy-0.003
 replied-0.003
uddle-0.003
imi-0.003
TokenOkay
Token ID16454
Feature activation+3.34e-03

Pos logit contributions
 err+0.065
wood+0.061
keeping+0.061
beh+0.061
 distances+0.057
Neg logit contributions
opus-0.065
 replied-0.053
ara-0.052
pron-0.051
uffy-0.050
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
opus+0.006
uddle+0.005
uffy+0.005
imi+0.005
ement+0.004
Neg logit contributions
 err-0.006
wood-0.006
 across-0.005
 even-0.005
keeping-0.005
Token we
Token ID356
Feature activation-2.73e-02

Pos logit contributions
opus+0.016
uffy+0.013
uddle+0.013
imi+0.012
 replied+0.012
Neg logit contributions
 even-0.019
 err-0.019
wood-0.019
 across-0.018
 distances-0.018
Token girl
Token ID2576
Feature activation+2.07e-03

Pos logit contributions
 err+0.000
wood+0.000
keeping+0.000
beh+0.000
 across+0.000
Neg logit contributions
opus-0.000
uffy-0.000
 replied-0.000
uddle-0.000
imi-0.000
Token so
Token ID523
Feature activation-7.77e-03

Pos logit contributions
opus+0.004
 replied+0.003
uffy+0.003
uddle+0.003
imi+0.003
Neg logit contributions
 err-0.003
wood-0.003
keeping-0.003
beh-0.003
 distances-0.003
Token happy
Token ID3772
Feature activation+1.27e-02

Pos logit contributions
 err+0.012
beh+0.012
keeping+0.012
wood+0.012
 across+0.011
Neg logit contributions
opus-0.015
uffy-0.012
imi-0.012
uddle-0.012
 replied-0.012
Token!
Token ID0
Feature activation-9.13e-03

Pos logit contributions
opus+0.017
 replied+0.014
uffy+0.014
uddle+0.013
imi+0.013
Neg logit contributions
 err-0.027
wood-0.026
beh-0.025
keeping-0.025
 distances-0.024
Token
Token ID220
Feature activation+1.86e-02

Pos logit contributions
 err+0.016
wood+0.016
keeping+0.016
beh+0.015
 across+0.015
Neg logit contributions
opus-0.016
uffy-0.013
uddle-0.012
 replied-0.012
imi-0.012
Token
Token ID198
Feature activation-7.75e-03

Pos logit contributions
opus+0.033
uffy+0.027
uddle+0.026
 replied+0.026
imi+0.026
Neg logit contributions
 err-0.031
wood-0.030
keeping-0.030
beh-0.030
 across-0.029
Token
Token ID198
Feature activation-6.69e-03

Pos logit contributions
 err+0.015
wood+0.014
beh+0.014
keeping+0.014
 distances+0.013
Neg logit contributions
opus-0.012
uffy-0.010
 replied-0.010
imi-0.010
uddle-0.010
TokenShe
Token ID3347
Feature activation-8.40e-04

Pos logit contributions
wood+0.014
 err+0.013
keeping+0.013
beh+0.012
 distances+0.012
Neg logit contributions
opus-0.010
 replied-0.008
imi-0.008
uffy-0.008
ara-0.008
Token was
Token ID373
Feature activation-8.22e-03

Pos logit contributions
 err+0.002
wood+0.002
keeping+0.002
beh+0.001
 across+0.001
Neg logit contributions
opus-0.001
 replied-0.001
uffy-0.001
uddle-0.001
imi-0.001
Token so
Token ID523
Feature activation+3.07e-03

Pos logit contributions
wood+0.013
 err+0.013
keeping+0.013
beh+0.012
 distances+0.012
Neg logit contributions
opus-0.014
 replied-0.013
 sincere-0.013
uffy-0.012
 shocked-0.012
Token excited
Token ID6568
Feature activation+5.69e-03

Pos logit contributions
opus+0.005
uffy+0.004
 replied+0.004
uddle+0.004
imi+0.004
Neg logit contributions
 err-0.005
wood-0.005
keeping-0.005
beh-0.005
 across-0.005
Token,"
Token ID553
Feature activation+6.31e-02

Pos logit contributions
opus+0.023
uffy+0.019
 replied+0.019
imi+0.018
uddle+0.017
Neg logit contributions
beh-0.043
 err-0.042
wood-0.041
keeping-0.040
ried-0.038
Token Mia
Token ID32189
Feature activation+9.53e-03

Pos logit contributions
opus+0.098
uddle+0.079
uffy+0.076
imi+0.070
ement+0.069
Neg logit contributions
 err-0.121
wood-0.119
keeping-0.117
beh-0.114
 across-0.114
Token says
Token ID1139
Feature activation+1.05e-02

Pos logit contributions
opus+0.011
uffy+0.007
uddle+0.007
imi+0.006
pron+0.006
Neg logit contributions
 err-0.023
wood-0.022
beh-0.022
 across-0.022
keeping-0.022
Token.
Token ID13
Feature activation-6.70e-03

Pos logit contributions
opus+0.017
uffy+0.013
 replied+0.013
uddle+0.013
imi+0.013
Neg logit contributions
 err-0.020
wood-0.019
keeping-0.019
beh-0.019
 across-0.018
Token "
Token ID366
Feature activation-2.49e-02

Pos logit contributions
 err+0.013
wood+0.012
keeping+0.012
beh+0.012
 across+0.012
Neg logit contributions
opus-0.011
uffy-0.009
uddle-0.008
 replied-0.008
imi-0.008
TokenI
Token ID40
Feature activation-1.40e-02

Pos logit contributions
wood+0.052
 err+0.051
keeping+0.048
beh+0.047
 throughout+0.045
Neg logit contributions
opus-0.035
ara-0.028
pron-0.025
uffy-0.025
on-0.023
Token like
Token ID588
Feature activation+1.85e-02

Pos logit contributions
keeping+0.022
 err+0.021
wood+0.021
beh+0.020
 distances+0.020
Neg logit contributions
opus-0.026
uffy-0.023
 replied-0.023
uddle-0.022
imi-0.021
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
opus+0.027
imi+0.022
uffy+0.021
 replied+0.021
uddle+0.020
Neg logit contributions
wood-0.035
beh-0.035
keeping-0.035
 err-0.034
ried-0.033
Token rain
Token ID6290
Feature activation-1.04e-02

Pos logit contributions
opus+0.018
uffy+0.014
 replied+0.014
uddle+0.014
imi+0.014
Neg logit contributions
 err-0.022
wood-0.022
keeping-0.022
beh-0.021
 across-0.021
Token.
Token ID13
Feature activation+3.25e-03

Pos logit contributions
 err+0.021
beh+0.021
wood+0.020
keeping+0.020
 distances+0.019
Neg logit contributions
opus-0.014
uffy-0.012
uddle-0.012
imi-0.011
 replied-0.011
Token It
Token ID632
Feature activation+2.51e-02

Pos logit contributions
opus+0.005
uffy+0.004
 replied+0.004
imi+0.004
uddle+0.004
Neg logit contributions
 err-0.006
keeping-0.006
wood-0.006
beh-0.006
 across-0.006
Token fun
Token ID1257
Feature activation+7.72e-03

Pos logit contributions
beh+0.049
 err+0.049
wood+0.048
keeping+0.047
ried+0.044
Neg logit contributions
opus-0.044
 replied-0.041
uddle-0.038
imi-0.037
uffy-0.037
Token.
Token ID13
Feature activation-1.97e-03

Pos logit contributions
opus+0.012
uffy+0.010
 replied+0.009
uddle+0.009
imi+0.009
Neg logit contributions
 err-0.015
wood-0.015
keeping-0.014
beh-0.014
 across-0.014
Token<|endoftext|>
Token ID50256
Feature activation+7.07e-03

Pos logit contributions
 err+0.003
wood+0.003
keeping+0.003
beh+0.003
 across+0.003
Neg logit contributions
opus-0.004
uffy-0.003
 replied-0.003
uddle-0.003
imi-0.003
TokenS
Token ID50
Feature activation+2.57e-05

Pos logit contributions
opus+0.012
uffy+0.010
 replied+0.009
uddle+0.009
imi+0.009
Neg logit contributions
 err-0.013
wood-0.012
keeping-0.012
beh-0.012
 across-0.012
Tokenara
Token ID3301
Feature activation+1.72e-02

Pos logit contributions
opus+0.000
 sorry+0.000
 sincere+0.000
 dear+0.000
 ow+0.000
Neg logit contributions
 across-0.000
wood-0.000
 err-0.000
Circ-0.000
 farms-0.000
Token and
Token ID290
Feature activation-4.95e-03

Pos logit contributions
opus+0.028
uffy+0.023
 replied+0.023
uddle+0.022
imi+0.022
Neg logit contributions
 err-0.032
wood-0.030
keeping-0.030
beh-0.030
 across-0.028
Token Tom
Token ID4186
Feature activation+1.15e-02

Pos logit contributions
 err+0.009
wood+0.009
keeping+0.009
beh+0.009
 distances+0.009
Neg logit contributions
opus-0.008
 replied-0.007
uffy-0.006
imi-0.006
ama-0.006
Token were
Token ID547
Feature activation-1.18e-02

Pos logit contributions
opus+0.019
uffy+0.016
 replied+0.015
uddle+0.015
imi+0.015
Neg logit contributions
 err-0.021
wood-0.020
keeping-0.020
beh-0.020
 across-0.019
Token playing
Token ID2712
Feature activation+2.47e-02

Pos logit contributions
 err+0.018
 across+0.017
beh+0.016
wood+0.016
keeping+0.016
Neg logit contributions
opus-0.024
uddle-0.019
uffy-0.019
 replied-0.019
imi-0.018
Token at
Token ID379
Feature activation+6.76e-03

Pos logit contributions
opus+0.056
uffy+0.048
 replied+0.047
uddle+0.047
imi+0.047
Neg logit contributions
 err-0.030
wood-0.029
keeping-0.028
beh-0.028
 across-0.026
Token the
Token ID262
Feature activation+3.38e-02

Pos logit contributions
opus+0.010
imi+0.009
uffy+0.009
 replied+0.009
uddle+0.008
Neg logit contributions
beh-0.012
 err-0.012
wood-0.012
keeping-0.011
 distances-0.011
Token hears
Token ID23180
Feature activation-2.25e-03

Pos logit contributions
opus+0.038
uddle+0.031
uffy+0.030
ement+0.030
 replied+0.029
Neg logit contributions
wood-0.031
keeping-0.031
 err-0.030
 across-0.030
beh-0.029
Token a
Token ID257
Feature activation+1.69e-02

Pos logit contributions
 err+0.003
wood+0.003
keeping+0.003
beh+0.003
 across+0.002
Neg logit contributions
opus-0.005
uffy-0.004
 replied-0.004
uddle-0.004
imi-0.004
Token strange
Token ID6283
Feature activation+1.12e-03

Pos logit contributions
opus+0.029
uffy+0.025
ama+0.023
uddle+0.023
ement+0.023
Neg logit contributions
 err-0.030
 across-0.027
 even-0.026
wood-0.026
 distances-0.024
Token noise
Token ID7838
Feature activation+2.47e-02

Pos logit contributions
opus+0.002
uffy+0.001
uddle+0.001
 replied+0.001
imi+0.001
Neg logit contributions
 err-0.002
wood-0.002
keeping-0.002
beh-0.002
 across-0.002
Token outside
Token ID2354
Feature activation+2.96e-02

Pos logit contributions
opus+0.041
uffy+0.033
 replied+0.033
uddle+0.032
imi+0.032
Neg logit contributions
 err-0.044
wood-0.043
keeping-0.043
beh-0.042
 across-0.041
Token.
Token ID13
Feature activation-1.94e-03

Pos logit contributions
opus+0.055
uffy+0.046
uddle+0.045
 replied+0.043
imi+0.043
Neg logit contributions
 err-0.049
keeping-0.047
wood-0.047
 across-0.046
beh-0.045
Token He
Token ID679
Feature activation+1.71e-02

Pos logit contributions
 err+0.003
wood+0.003
keeping+0.003
beh+0.003
 across+0.003
Neg logit contributions
opus-0.003
uffy-0.003
uddle-0.003
imi-0.003
 replied-0.003
Token is
Token ID318
Feature activation-1.53e-03

Pos logit contributions
opus+0.026
uffy+0.021
uddle+0.020
imi+0.020
 replied+0.020
Neg logit contributions
 err-0.033
wood-0.032
 across-0.031
keeping-0.031
beh-0.031
Token scared
Token ID12008
Feature activation+1.78e-02

Pos logit contributions
 err+0.003
wood+0.003
keeping+0.003
beh+0.003
 distances+0.003
Neg logit contributions
opus-0.002
uffy-0.002
 replied-0.002
imi-0.002
uddle-0.002
Token.
Token ID13
Feature activation+5.84e-03

Pos logit contributions
opus+0.033
uffy+0.027
uddle+0.027
 replied+0.027
imi+0.026
Neg logit contributions
 err-0.029
wood-0.028
keeping-0.028
 across-0.027
beh-0.027
Token He
Token ID679
Feature activation+1.81e-02

Pos logit contributions
opus+0.011
uffy+0.009
uddle+0.009
imi+0.009
 replied+0.008
Neg logit contributions
wood-0.009
 err-0.009
keeping-0.009
 across-0.009
beh-0.009
Token bird
Token ID6512
Feature activation+1.15e-03

Pos logit contributions
opus+0.001
ement+0.000
uffy+0.000
imi+0.000
uddle+0.000
Neg logit contributions
 err-0.001
 across-0.000
wood-0.000
 even-0.000
keeping-0.000
Token.
Token ID13
Feature activation-1.81e-02

Pos logit contributions
opus+0.002
uffy+0.002
 replied+0.002
uddle+0.002
imi+0.002
Neg logit contributions
 err-0.002
wood-0.002
keeping-0.002
beh-0.002
 across-0.002
Token The
Token ID383
Feature activation-1.35e-02

Pos logit contributions
wood+0.026
 err+0.026
 across+0.025
keeping+0.024
beh+0.024
Neg logit contributions
opus-0.037
uffy-0.031
uddle-0.031
imi-0.030
ama-0.029
Token bird
Token ID6512
Feature activation+1.10e-02

Pos logit contributions
 err+0.027
 across+0.025
 even+0.024
wood+0.024
beh+0.024
Neg logit contributions
opus-0.020
uffy-0.015
ement-0.015
uddle-0.015
 replied-0.014
Token did
Token ID750
Feature activation-6.37e-03

Pos logit contributions
opus+0.018
uddle+0.014
uffy+0.014
ama+0.013
imi+0.013
Neg logit contributions
 even-0.020
 across-0.019
wood-0.019
beh-0.019
 err-0.019
Token not
Token ID407
Feature activation-1.70e-02

Pos logit contributions
wood+0.013
keeping+0.013
 err+0.012
beh+0.012
gent+0.012
Neg logit contributions
opus-0.009
 replied-0.007
uffy-0.007
uddle-0.007
imi-0.007
Token like
Token ID588
Feature activation+2.66e-02

Pos logit contributions
 err+0.026
keeping+0.025
wood+0.025
beh+0.023
 distances+0.022
Neg logit contributions
opus-0.031
 replied-0.029
imi-0.026
 sincere-0.025
uffy-0.025
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
opus+0.050
uffy+0.042
 replied+0.041
imi+0.041
uddle+0.041
Neg logit contributions
 err-0.041
wood-0.040
keeping-0.040
beh-0.039
 across-0.037
Token cat
Token ID3797
Feature activation+1.18e-02

Pos logit contributions
opus+0.018
uffy+0.014
uddle+0.013
imi+0.013
ama+0.013
Neg logit contributions
 err-0.020
 across-0.019
keeping-0.019
wood-0.019
beh-0.018
Token's
Token ID338
Feature activation-1.21e-03

Pos logit contributions
opus+0.022
uffy+0.017
uddle+0.017
imi+0.017
ama+0.017
Neg logit contributions
 err-0.019
 across-0.019
keeping-0.019
beh-0.019
wood-0.019
Token wink
Token ID41955
Feature activation+1.52e-02

Pos logit contributions
 err+0.002
wood+0.002
keeping+0.002
beh+0.002
 across+0.002
Neg logit contributions
opus-0.002
uffy-0.002
 replied-0.002
uddle-0.002
imi-0.002
Token coming
Token ID2406
Feature activation-7.52e-03

Pos logit contributions
keeping+0.010
beh+0.009
and+0.009
 err+0.009
uries+0.009
Neg logit contributions
opus-0.008
 replied-0.008
 sincere-0.007
ement-0.007
 sorry-0.007
Token.
Token ID13
Feature activation-2.67e-03

Pos logit contributions
beh+0.011
keeping+0.011
 err+0.011
 distances+0.010
wood+0.010
Neg logit contributions
opus-0.014
 replied-0.012
uddle-0.012
uffy-0.012
ama-0.012
Token
Token ID198
Feature activation-9.57e-03

Pos logit contributions
 err+0.005
wood+0.004
keeping+0.004
beh+0.004
 across+0.004
Neg logit contributions
opus-0.005
uffy-0.004
 replied-0.004
uddle-0.004
imi-0.004
Token
Token ID198
Feature activation-7.68e-03

Pos logit contributions
 err+0.016
wood+0.015
keeping+0.015
beh+0.015
 across+0.015
Neg logit contributions
opus-0.017
uffy-0.014
uddle-0.014
 replied-0.014
imi-0.014
TokenSo
Token ID2396
Feature activation-2.76e-02

Pos logit contributions
keeping+0.014
wood+0.014
 err+0.013
beh+0.013
ried+0.012
Neg logit contributions
opus-0.014
 replied-0.011
uffy-0.010
uddle-0.010
ama-0.010
Token the
Token ID262
Feature activation-2.64e-02

Pos logit contributions
 err+0.045
wood+0.044
keeping+0.043
beh+0.043
 across+0.041
Neg logit contributions
opus-0.051
uffy-0.042
 replied-0.041
uddle-0.041
imi-0.040
Token brave
Token ID14802
Feature activation-3.95e-02

Pos logit contributions
 err+0.051
wood+0.050
keeping+0.050
beh+0.049
 across+0.048
Neg logit contributions
opus-0.040
uffy-0.032
uddle-0.031
 replied-0.031
imi-0.030
Token boy
Token ID2933
Feature activation-1.40e-02

Pos logit contributions
 err+0.089
wood+0.087
keeping+0.087
beh+0.086
 across+0.084
Neg logit contributions
opus-0.048
uffy-0.036
 replied-0.034
uddle-0.034
imi-0.033
Token and
Token ID290
Feature activation-8.69e-03

Pos logit contributions
 err+0.024
keeping+0.023
wood+0.022
beh+0.022
 distances+0.021
Neg logit contributions
opus-0.025
 replied-0.021
uffy-0.020
uddle-0.020
imi-0.019
Token his
Token ID465
Feature activation-1.73e-02

Pos logit contributions
 err+0.017
uries+0.016
keeping+0.016
 distances+0.016
beh+0.016
Neg logit contributions
 replied-0.011
opus-0.011
 brother-0.010
 mom-0.009
 teacher-0.009
Token friends
Token ID2460
Feature activation-2.65e-02

Pos logit contributions
 err+0.034
wood+0.033
keeping+0.033
beh+0.033
 across+0.032
Neg logit contributions
opus-0.026
uffy-0.020
uddle-0.020
 replied-0.020
imi-0.019
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
 err+0.009
beh+0.009
wood+0.008
keeping+0.008
 distances+0.008
Neg logit contributions
opus-0.006
uffy-0.005
 replied-0.004
uddle-0.004
imi-0.004
Token
Token ID198
Feature activation+3.68e-03

Pos logit contributions
opus+0.030
uddle+0.025
uffy+0.024
imi+0.024
maid+0.023
Neg logit contributions
 across-0.014
 themselves-0.013
keeping-0.013
and-0.012
 distances-0.012
Token
Token ID198
Feature activation+6.57e-03

Pos logit contributions
opus+0.008
uddle+0.007
uffy+0.007
pron+0.007
ement+0.007
Neg logit contributions
 across-0.005
 err-0.005
keeping-0.004
 themselves-0.004
wood-0.004
TokenTom
Token ID13787
Feature activation+1.89e-02

Pos logit contributions
opus+0.011
uffy+0.009
uddle+0.009
 replied+0.009
imi+0.009
Neg logit contributions
 err-0.012
keeping-0.011
beh-0.011
wood-0.011
 across-0.011
Token hugged
Token ID41130
Feature activation+2.51e-02

Pos logit contributions
opus+0.042
ement+0.035
imi+0.035
ama+0.034
uffy+0.034
Neg logit contributions
 even-0.025
 across-0.023
ried-0.022
 over-0.020
wood-0.018
Token Lily
Token ID20037
Feature activation-2.70e-02

Pos logit contributions
opus+0.044
uffy+0.036
 replied+0.035
uddle+0.035
imi+0.035
Neg logit contributions
 err-0.043
wood-0.042
keeping-0.042
beh-0.042
 across-0.040
Token and
Token ID290
Feature activation+2.37e-02

Pos logit contributions
 err+0.046
wood+0.044
keeping+0.044
beh+0.043
 across+0.042
Neg logit contributions
opus-0.048
uffy-0.040
 replied-0.039
uddle-0.038
imi-0.038
Token said
Token ID531
Feature activation+3.47e-03

Pos logit contributions
opus+0.036
uffy+0.028
uddle+0.027
ama+0.026
imi+0.025
Neg logit contributions
 err-0.048
wood-0.046
beh-0.046
keeping-0.046
 even-0.045
Token,
Token ID11
Feature activation-2.06e-02

Pos logit contributions
opus+0.006
uddle+0.005
uffy+0.005
imi+0.005
pron+0.005
Neg logit contributions
 across-0.006
 even-0.005
 along-0.005
 err-0.005
 themselves-0.005
Token "
Token ID366
Feature activation-3.00e-02

Pos logit contributions
 err+0.034
 across+0.034
wood+0.034
keeping+0.033
beh+0.032
Neg logit contributions
opus-0.036
uffy-0.030
uddle-0.029
imi-0.029
 replied-0.028
TokenI
Token ID40
Feature activation-1.68e-02

Pos logit contributions
 err+0.040
keeping+0.039
beh+0.039
wood+0.039
 across+0.038
Neg logit contributions
opus-0.063
uffy-0.055
uddle-0.053
imi-0.052
 replied-0.052
Token excited
Token ID6568
Feature activation-5.17e-03

Pos logit contributions
 err+0.032
keeping+0.032
wood+0.031
beh+0.031
 distances+0.029
Neg logit contributions
opus-0.031
 replied-0.028
uffy-0.026
uddle-0.025
imi-0.025
Tokenly
Token ID306
Feature activation-1.83e-02

Pos logit contributions
 err+0.009
wood+0.009
keeping+0.009
beh+0.009
 across+0.009
Neg logit contributions
opus-0.009
uffy-0.007
uddle-0.007
imi-0.007
 replied-0.007
Token.
Token ID13
Feature activation-4.16e-03

Pos logit contributions
 err+0.036
wood+0.035
beh+0.034
keeping+0.034
 distances+0.033
Neg logit contributions
opus-0.026
 replied-0.022
uffy-0.021
uddle-0.020
imi-0.019
Token
Token ID198
Feature activation-1.17e-02

Pos logit contributions
wood+0.007
 err+0.006
keeping+0.006
 across+0.006
beh+0.006
Neg logit contributions
opus-0.008
uffy-0.006
imi-0.006
uddle-0.006
 replied-0.006
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
 err+0.017
 across+0.016
wood+0.016
keeping+0.016
and+0.016
Neg logit contributions
opus-0.023
uffy-0.020
uddle-0.019
imi-0.019
ement-0.019
TokenThe
Token ID464
Feature activation-2.88e-02

Pos logit contributions
 err+0.021
wood+0.021
keeping+0.021
beh+0.021
 distances+0.020
Neg logit contributions
opus-0.018
 replied-0.013
uddle-0.013
uffy-0.013
ara-0.012
Token friends
Token ID2460
Feature activation-9.70e-03

Pos logit contributions
 err+0.053
wood+0.052
keeping+0.050
 across+0.049
 even+0.048
Neg logit contributions
opus-0.046
uffy-0.039
uddle-0.037
imi-0.036
 replied-0.036
Token hopped
Token ID45230
Feature activation-4.72e-02

Pos logit contributions
 err+0.016
wood+0.015
keeping+0.015
beh+0.015
 across+0.014
Neg logit contributions
opus-0.018
uffy-0.015
 replied-0.015
uddle-0.014
imi-0.014
Token on
Token ID319
Feature activation-2.84e-02

Pos logit contributions
beh+0.081
keeping+0.079
uries+0.078
 former+0.073
 distances+0.073
Neg logit contributions
 replied-0.075
opus-0.071
uddle-0.063
 box-0.062
 says-0.060
Token board
Token ID3096
Feature activation-2.76e-02

Pos logit contributions
beh+0.056
wood+0.052
keeping+0.051
uries+0.049
ried+0.049
Neg logit contributions
opus-0.038
 replied-0.037
uddle-0.034
 mom-0.034
 box-0.033
Token the
Token ID262
Feature activation-1.06e-02

Pos logit contributions
beh+0.058
keeping+0.053
wood+0.052
 err+0.050
uries+0.049
Neg logit contributions
opus-0.037
 replied-0.034
uddle-0.031
ement-0.029
 box-0.029
Token,
Token ID11
Feature activation-2.44e-02

Pos logit contributions
 err+0.006
wood+0.005
keeping+0.005
beh+0.005
 across+0.005
Neg logit contributions
opus-0.004
uffy-0.003
uddle-0.003
 replied-0.003
imi-0.003
Token "
Token ID366
Feature activation-3.56e-02

Pos logit contributions
 err+0.043
wood+0.043
keeping+0.042
 across+0.041
beh+0.041
Neg logit contributions
opus-0.040
uffy-0.033
uddle-0.032
imi-0.031
 replied-0.031
TokenS
Token ID50
Feature activation-8.48e-03

Pos logit contributions
 err+0.070
wood+0.066
beh+0.064
keeping+0.063
 across+0.062
Neg logit contributions
opus-0.053
ara-0.047
 replied-0.044
ama-0.040
uffy-0.039
Tokenue
Token ID518
Feature activation-4.80e-04

Pos logit contributions
 err+0.020
beh+0.020
wood+0.020
keeping+0.019
 across+0.019
Neg logit contributions
opus-0.009
 replied-0.006
pron-0.006
imi-0.005
uddle-0.005
Token,
Token ID11
Feature activation-8.91e-03

Pos logit contributions
wood+0.001
 across+0.001
 even+0.001
keeping+0.001
beh+0.001
Neg logit contributions
opus-0.001
uddle-0.001
ement-0.001
pron-0.001
rescent-0.001
Token we
Token ID356
Feature activation-2.70e-02

Pos logit contributions
 err+0.017
wood+0.017
keeping+0.017
beh+0.016
 across+0.016
Neg logit contributions
opus-0.014
uffy-0.011
 replied-0.011
uddle-0.011
imi-0.011
Token can
Token ID460
Feature activation+4.66e-03

Pos logit contributions
 err+0.042
wood+0.042
beh+0.041
keeping+0.041
 across+0.040
Neg logit contributions
opus-0.051
uffy-0.042
imi-0.041
uddle-0.041
 replied-0.040
Token use
Token ID779
Feature activation-1.36e-02

Pos logit contributions
opus+0.010
uffy+0.009
uddle+0.008
imi+0.008
ement+0.008
Neg logit contributions
 err-0.006
wood-0.006
 even-0.006
beh-0.006
keeping-0.005
Token this
Token ID428
Feature activation-8.54e-03

Pos logit contributions
beh+0.026
keeping+0.025
 err+0.024
wood+0.023
ried+0.023
Neg logit contributions
opus-0.020
uffy-0.017
 replied-0.017
imi-0.017
uddle-0.016
Token salt
Token ID8268
Feature activation-1.72e-02

Pos logit contributions
beh+0.017
keeping+0.016
 err+0.016
wood+0.016
 distances+0.015
Neg logit contributions
opus-0.013
uffy-0.011
uddle-0.011
 replied-0.010
imi-0.010
Token to
Token ID284
Feature activation+1.13e-02

Pos logit contributions
 err+0.028
beh+0.028
keeping+0.026
 distances+0.025
wood+0.025
Neg logit contributions
opus-0.031
uffy-0.027
uddle-0.027
 replied-0.026
imi-0.025
Token her
Token ID607
Feature activation-5.49e-03

Pos logit contributions
opus+0.033
 replied+0.028
uffy+0.028
imi+0.027
ement+0.026
Neg logit contributions
 err-0.030
wood-0.030
beh-0.030
keeping-0.029
 distances-0.027
Token best
Token ID1266
Feature activation-2.02e-02

Pos logit contributions
 err+0.011
wood+0.010
 across+0.010
keeping+0.010
 even+0.010
Neg logit contributions
opus-0.009
imi-0.006
uffy-0.006
 replied-0.006
uddle-0.006
Token friend
Token ID1545
Feature activation-9.51e-03

Pos logit contributions
 err+0.042
wood+0.041
keeping+0.041
beh+0.040
 across+0.040
Neg logit contributions
opus-0.028
uffy-0.021
uddle-0.020
imi-0.020
 replied-0.020
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
 err+0.016
wood+0.015
beh+0.015
keeping+0.015
 distances+0.014
Neg logit contributions
opus-0.017
uffy-0.014
 replied-0.014
imi-0.014
uddle-0.014
Token The
Token ID383
Feature activation-1.12e-02

Pos logit contributions
 err+0.019
keeping+0.019
wood+0.019
 across+0.018
beh+0.018
Neg logit contributions
opus-0.021
uffy-0.017
uddle-0.017
imi-0.016
 replied-0.016
Token moral
Token ID6573
Feature activation-7.00e-02

Pos logit contributions
 err+0.020
keeping+0.019
wood+0.019
beh+0.019
 across+0.018
Neg logit contributions
opus-0.019
uffy-0.016
uddle-0.015
imi-0.015
 replied-0.015
Token of
Token ID286
Feature activation-7.46e-03

Pos logit contributions
wood+0.151
ried+0.146
 distances+0.145
keeping+0.145
beh+0.144
Neg logit contributions
opus-0.082
 replied-0.073
ara-0.068
ama-0.063
imi-0.063
Token the
Token ID262
Feature activation-1.65e-02

Pos logit contributions
wood+0.013
beh+0.013
 err+0.013
keeping+0.012
 distances+0.012
Neg logit contributions
opus-0.012
 replied-0.011
imi-0.011
uffy-0.010
ama-0.010
Token story
Token ID1621
Feature activation-2.53e-02

Pos logit contributions
 err+0.040
wood+0.039
keeping+0.039
beh+0.039
 across+0.038
Neg logit contributions
opus-0.017
uffy-0.012
uddle-0.011
 replied-0.011
imi-0.011
Token is
Token ID318
Feature activation-7.74e-03

Pos logit contributions
 err+0.062
wood+0.061
keeping+0.061
beh+0.060
 distances+0.058
Neg logit contributions
opus-0.025
uffy-0.018
 replied-0.017
uddle-0.017
imi-0.016
Token that
Token ID326
Feature activation+1.37e-02

Pos logit contributions
 err+0.015
keeping+0.015
wood+0.015
 distances+0.014
beh+0.014
Neg logit contributions
opus-0.012
uffy-0.010
 replied-0.009
imi-0.009
ama-0.009
Token return
Token ID1441
Feature activation-1.61e-02

Pos logit contributions
beh+0.038
 err+0.037
wood+0.037
keeping+0.036
gent+0.033
Neg logit contributions
opus-0.037
 replied-0.034
uffy-0.031
inkle-0.030
ister-0.030
Token.
Token ID13
Feature activation-4.78e-03

Pos logit contributions
 err+0.031
beh+0.031
wood+0.029
 distances+0.028
keeping+0.027
Neg logit contributions
opus-0.022
 replied-0.020
uffy-0.019
uddle-0.018
ement-0.018
Token
Token ID198
Feature activation-6.90e-03

Pos logit contributions
 err+0.008
wood+0.008
keeping+0.008
beh+0.008
 across+0.007
Neg logit contributions
opus-0.008
uffy-0.007
 replied-0.007
uddle-0.007
imi-0.007
Token
Token ID198
Feature activation-7.73e-03

Pos logit contributions
 err+0.012
wood+0.012
keeping+0.012
beh+0.012
 across+0.011
Neg logit contributions
opus-0.012
uffy-0.010
 replied-0.009
uddle-0.009
imi-0.009
TokenThe
Token ID464
Feature activation-1.84e-02

Pos logit contributions
wood+0.014
keeping+0.013
 err+0.013
beh+0.013
ried+0.012
Neg logit contributions
opus-0.014
 replied-0.011
uffy-0.011
imi-0.011
ama-0.010
Token next
Token ID1306
Feature activation-6.58e-02

Pos logit contributions
 err+0.034
wood+0.033
keeping+0.033
beh+0.033
 across+0.032
Neg logit contributions
opus-0.029
uffy-0.024
 replied-0.023
uddle-0.023
imi-0.023
Token morning
Token ID3329
Feature activation+9.74e-03

Pos logit contributions
 err+0.084
keeping+0.084
beh+0.082
wood+0.081
 distances+0.077
Neg logit contributions
opus-0.145
 replied-0.124
uffy-0.123
uddle-0.120
imi-0.119
Token,
Token ID11
Feature activation-5.16e-03

Pos logit contributions
opus+0.015
uffy+0.013
imi+0.012
uddle+0.012
 replied+0.011
Neg logit contributions
 err-0.018
wood-0.018
 across-0.018
keeping-0.017
 distances-0.016
Token the
Token ID262
Feature activation-1.66e-02

Pos logit contributions
 err+0.008
wood+0.008
keeping+0.008
beh+0.007
 distances+0.007
Neg logit contributions
opus-0.010
uffy-0.008
 replied-0.008
uddle-0.008
imi-0.008
Token pink
Token ID11398
Feature activation-9.70e-03

Pos logit contributions
 err+0.031
wood+0.030
keeping+0.030
beh+0.029
 across+0.029
Neg logit contributions
opus-0.027
uffy-0.022
uddle-0.021
imi-0.021
 replied-0.021
Token elephant
Token ID20950
Feature activation+3.28e-03

Pos logit contributions
 err+0.018
wood+0.018
keeping+0.018
 across+0.017
 distances+0.017
Neg logit contributions
opus-0.015
uffy-0.012
uddle-0.012
imi-0.011
 replied-0.011
Token They
Token ID1119
Feature activation-1.11e-02

Pos logit contributions
 err+0.022
wood+0.022
keeping+0.021
beh+0.021
 across+0.020
Neg logit contributions
opus-0.027
uffy-0.022
 replied-0.022
uddle-0.022
imi-0.021
Token gave
Token ID2921
Feature activation-7.64e-03

Pos logit contributions
 err+0.020
wood+0.019
keeping+0.019
beh+0.019
 across+0.018
Neg logit contributions
opus-0.019
uffy-0.015
 replied-0.015
uddle-0.015
imi-0.015
Token her
Token ID607
Feature activation-3.63e-02

Pos logit contributions
 err+0.011
beh+0.011
wood+0.011
keeping+0.011
 distances+0.011
Neg logit contributions
opus-0.014
 replied-0.013
uffy-0.013
imi-0.012
uddle-0.012
Token a
Token ID257
Feature activation-1.46e-02

Pos logit contributions
 err+0.050
keeping+0.050
beh+0.050
wood+0.050
 distances+0.048
Neg logit contributions
opus-0.072
 replied-0.063
uffy-0.063
uddle-0.061
imi-0.061
Token band
Token ID4097
Feature activation-5.05e-02

Pos logit contributions
 err+0.029
wood+0.027
 across+0.027
keeping+0.026
beh+0.026
Neg logit contributions
opus-0.022
uffy-0.017
imi-0.017
uddle-0.017
 replied-0.017
Token-
Token ID12
Feature activation-6.59e-02

Pos logit contributions
 distances+0.081
 err+0.080
wood+0.079
beh+0.078
keeping+0.077
Neg logit contributions
opus-0.086
 replied-0.079
uffy-0.074
uddle-0.071
ama-0.068
Tokenaid
Token ID1698
Feature activation-2.85e-02

Pos logit contributions
keeping+0.128
 distances+0.127
wood+0.126
 err+0.125
uries+0.117
Neg logit contributions
opus-0.096
 replied-0.078
uffy-0.070
on-0.070
ara-0.068
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
 err+0.048
beh+0.046
 distances+0.045
keeping+0.045
wood+0.044
Neg logit contributions
opus-0.050
 replied-0.044
uffy-0.043
uddle-0.041
imi-0.040
Token helped
Token ID4193
Feature activation+1.85e-03

Pos logit contributions
 err+0.023
keeping+0.023
wood+0.023
beh+0.023
 distances+0.022
Neg logit contributions
opus-0.020
 replied-0.018
uffy-0.017
uddle-0.016
imi-0.016
Token her
Token ID607
Feature activation-1.82e-02

Pos logit contributions
opus+0.003
uffy+0.003
 replied+0.003
imi+0.003
ama+0.003
Neg logit contributions
wood-0.003
 err-0.003
beh-0.003
 distances-0.003
keeping-0.003
Token stand
Token ID1302
Feature activation-1.56e-02

Pos logit contributions
 err+0.032
keeping+0.032
beh+0.031
wood+0.031
 distances+0.030
Neg logit contributions
opus-0.029
uffy-0.026
 replied-0.026
uddle-0.024
imi-0.024
Token course
Token ID1781
Feature activation+1.94e-03

Pos logit contributions
 err+0.023
beh+0.022
wood+0.022
 across+0.021
keeping+0.021
Neg logit contributions
opus-0.019
uddle-0.015
uffy-0.015
imi-0.014
 replied-0.014
Token!"
Token ID2474
Feature activation+4.45e-03

Pos logit contributions
opus+0.003
uffy+0.002
uddle+0.002
imi+0.002
 replied+0.002
Neg logit contributions
 err-0.004
wood-0.004
keeping-0.004
 across-0.004
beh-0.004
Token
Token ID220
Feature activation-2.42e-03

Pos logit contributions
opus+0.007
 replied+0.006
uffy+0.006
uddle+0.005
imi+0.005
Neg logit contributions
 err-0.008
wood-0.008
beh-0.008
keeping-0.008
 distances-0.008
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
 err+0.004
wood+0.004
keeping+0.004
beh+0.004
 across+0.004
Neg logit contributions
opus-0.005
uffy-0.004
uddle-0.004
 replied-0.004
imi-0.004
Token
Token ID198
Feature activation-1.34e-02

Pos logit contributions
 err+0.024
keeping+0.023
wood+0.023
beh+0.023
 across+0.022
Neg logit contributions
opus-0.026
uffy-0.021
uddle-0.021
 replied-0.020
imi-0.020
TokenTogether
Token ID41631
Feature activation-6.85e-02

Pos logit contributions
wood+0.026
keeping+0.024
 err+0.024
beh+0.023
 distances+0.023
Neg logit contributions
opus-0.021
 replied-0.018
uffy-0.017
imi-0.017
ama-0.017
Token,
Token ID11
Feature activation-2.40e-02

Pos logit contributions
 err+0.141
wood+0.139
keeping+0.138
beh+0.137
 across+0.132
Neg logit contributions
opus-0.095
uffy-0.075
 replied-0.073
uddle-0.072
imi-0.071
Token they
Token ID484
Feature activation-3.45e-02

Pos logit contributions
wood+0.045
beh+0.044
 err+0.044
keeping+0.044
 distances+0.042
Neg logit contributions
opus-0.036
 replied-0.032
uffy-0.030
uddle-0.029
ama-0.029
Token worked
Token ID3111
Feature activation-4.21e-02

Pos logit contributions
 err+0.058
keeping+0.056
beh+0.055
wood+0.053
 distances+0.052
Neg logit contributions
opus-0.060
 replied-0.053
uffy-0.050
uddle-0.049
ama-0.047
Token on
Token ID319
Feature activation-2.73e-02

Pos logit contributions
keeping+0.081
beh+0.076
uries+0.074
wood+0.072
 distances+0.071
Neg logit contributions
 replied-0.056
ement-0.053
opus-0.052
uddle-0.051
uffy-0.050
Token the
Token ID262
Feature activation+4.16e-03

Pos logit contributions
beh+0.056
wood+0.051
uries+0.048
ried+0.048
keeping+0.047
Neg logit contributions
 box-0.033
uffy-0.032
 mom-0.032
 replied-0.030
 grandma-0.029
Token gift
Token ID6979
Feature activation-6.46e-03

Pos logit contributions
 err+0.057
beh+0.056
wood+0.056
keeping+0.055
 distances+0.054
Neg logit contributions
opus-0.045
 replied-0.036
uffy-0.036
uddle-0.035
imi-0.034
Token.
Token ID13
Feature activation-7.72e-03

Pos logit contributions
 err+0.011
wood+0.011
keeping+0.011
beh+0.010
 across+0.010
Neg logit contributions
opus-0.011
uffy-0.009
uddle-0.009
imi-0.009
 replied-0.009
Token
Token ID198
Feature activation-1.34e-02

Pos logit contributions
wood+0.012
 err+0.012
keeping+0.012
 across+0.011
beh+0.011
Neg logit contributions
opus-0.015
uffy-0.012
uddle-0.012
imi-0.012
 replied-0.012
Token
Token ID198
Feature activation-1.37e-02

Pos logit contributions
 err+0.022
wood+0.021
keeping+0.021
beh+0.020
 across+0.020
Neg logit contributions
opus-0.025
uffy-0.021
uddle-0.020
 replied-0.020
imi-0.020
TokenThe
Token ID464
Feature activation-1.80e-02

Pos logit contributions
wood+0.024
keeping+0.024
 err+0.023
beh+0.022
 across+0.021
Neg logit contributions
opus-0.025
 replied-0.019
imi-0.019
uffy-0.019
uddle-0.018
Token next
Token ID1306
Feature activation-6.86e-02

Pos logit contributions
 err+0.035
wood+0.035
keeping+0.034
beh+0.034
 across+0.033
Neg logit contributions
opus-0.028
uffy-0.022
 replied-0.021
uddle-0.021
imi-0.021
Token day
Token ID1110
Feature activation-1.25e-03

Pos logit contributions
keeping+0.105
beh+0.100
 err+0.098
wood+0.097
 distances+0.096
Neg logit contributions
opus-0.142
 replied-0.119
uffy-0.115
uddle-0.114
imi-0.113
Token,
Token ID11
Feature activation-8.09e-04

Pos logit contributions
wood+0.002
 err+0.002
 across+0.002
keeping+0.002
beh+0.002
Neg logit contributions
opus-0.002
uffy-0.002
imi-0.002
uddle-0.002
pron-0.001
Token the
Token ID262
Feature activation-1.06e-02

Pos logit contributions
 err+0.001
wood+0.001
keeping+0.001
beh+0.001
 across+0.001
Neg logit contributions
opus-0.002
uffy-0.001
 replied-0.001
uddle-0.001
imi-0.001
Token children
Token ID1751
Feature activation-3.08e-03

Pos logit contributions
 err+0.021
wood+0.021
keeping+0.021
beh+0.020
 across+0.020
Neg logit contributions
opus-0.015
uffy-0.012
 replied-0.012
uddle-0.012
imi-0.012
Token of
Token ID286
Feature activation-3.89e-02

Pos logit contributions
 err+0.005
wood+0.005
keeping+0.005
beh+0.005
 across+0.004
Neg logit contributions
opus-0.006
uffy-0.005
uddle-0.005
 replied-0.005
imi-0.005