TOP ACTIVATIONS
MAX = 3.671

pretty kitten . From that day on , Lily made sure
hard work . From that day on , Jerry became known
helped him . From that day on , Billy and Harry
. From that day on , Nut ty made
motorcycles . He had a toy motorcycle that he would ride
helicopters . He had a toy helicopter that he played with
. From that day on , Jolly and his
nut together . From that day on , Nut ty knew
helicopters . He had a toy helicopter that he played with
good friends . From that day on , Max always replied
very happy . From that day on , Sue knew she
gray cloth . From that day onwards , Sandra always enjoyed
river bank . From that day on , Gig i knew
. From that day on , Mitt ens and
right thing . From that day on , Nut ty always
of fun . From that day on , Max and Luna
fun day . From that day on , they became good
each other . From that day on , Lily and Rose
with Fred . From that day on , Tim felt strong
together happily . From that day on , Lily always remembered

MIN ACTIVATIONS
MIN = -3.093

little boy carrying a box filled with empty boxes , wood
she found a huge box filled with costumes . She tried
I found a shell !" she said . Her mom smiled
my ! A flower !" she said .
this is so pretty !" she said . But then she
found a bath robe !" she said happily .
it too ?" Tim shared the marble with Sue . They
and content . B obby and the star were
Yes , you can go , but you need to be
shoe is so pretty !" she said to herself .
and gave it a good tidy . The nap
! A fire work !" she said . Tim looked up
their piles together and share their finds . They had more
Kate and the other was named Ben . They were playing
said it was a very special shell . The little girl
dress , but it didn 't fit her anymore . She
Joy and the other was named Jan . Jan was very
get the big pastry ?" she asked . Her mom said
are special ." B enny promised to show Ell
From that day forward the little girl was never without her

INTERVAL 1.980 - 3.671
CONTAINS 0.226%

other again . From that day on , Tim and Mo
. <|endoftext|> Sam and Alice were at the stadium . It
as can be . From then on , when Andy and
in the rain . From then on , she always looked
. <|endoftext|> Tom and Lily were twins who liked to play

INTERVAL 0.289 - 1.980
CONTAINS 30.615%

friend the fox . They would go to the station and
girl playing with a pink ball too . Lily wanted to
Mia lost her best friend because of a bad promise and
things in nature . Whenever he saw something he knew ,
. The friend said he wasn 't feeling

INTERVAL -1.402 - 0.289
CONTAINS 68.192%

," Sara said . They went to the kitchen
; maybe he could help clean up the waste .
thanked them . They started to build towers and houses and
Today was different , as today James came with his mum
loved the crystal and they all played together in the wide

INTERVAL -3.093 - -1.402
CONTAINS 0.967%

! She went to get it , and when she came
, her mom told her that if she didn 't clean
's what makes you happy and makes others happy too ."
water . The net was so big that it could catch
was so delicious ! The little girl was so happy that
Token pretty
Token ID2495
Feature activation-7.62e-01

Pos logit contributions
ro+0.032
uzz+0.028
emo+0.027
keeping+0.027
onna+0.025
Neg logit contributions
aw-0.023
 Other-0.023
 particularly-0.023
 middle-0.022
-0.021
Token kitten
Token ID42143
Feature activation+7.42e-01

Pos logit contributions
ro+0.031
uzz+0.028
emo+0.028
keeping+0.027
onna+0.026
Neg logit contributions
aw-0.032
 Other-0.030
 particularly-0.030
 middle-0.029
-0.028
Token.
Token ID13
Feature activation+9.13e-01

Pos logit contributions
aw+0.031
 Other+0.029
 middle+0.028
 particularly+0.027
+0.027
Neg logit contributions
ro-0.032
uzz-0.028
keeping-0.027
emo-0.026
 cheek-0.026
Token From
Token ID3574
Feature activation+5.88e-01

Pos logit contributions
aw+0.038
 middle+0.034
 particularly+0.034
 Other+0.033
ifying+0.032
Neg logit contributions
ro-0.040
uzz-0.035
emo-0.034
onna-0.032
keeping-0.032
Token that
Token ID326
Feature activation+2.78e-01

Pos logit contributions
aw+0.018
 Other+0.018
 middle+0.016
 particularly+0.016
+0.016
Neg logit contributions
ro-0.031
uzz-0.028
emo-0.027
keeping-0.027
 cheek-0.026
Token day
Token ID1110
Feature activation+3.67e+00

Pos logit contributions
aw+0.014
 Other+0.013
 middle+0.013
 particularly+0.013
+0.012
Neg logit contributions
ro-0.010
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.007
Token on
Token ID319
Feature activation+1.51e+00

Pos logit contributions
aw+0.180
ement+0.180
pieces+0.170
 Other+0.170
 pasture+0.168
Neg logit contributions
ro-0.086
 cheek-0.078
 moment-0.077
uzz-0.070
 seek-0.070
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
aw+0.081
ifying+0.070
 Other+0.069
bridge+0.067
 middle+0.067
Neg logit contributions
ro-0.054
emo-0.040
onna-0.039
uzz-0.039
 moment-0.038
Token Lily
Token ID20037
Feature activation+4.98e-01

Pos logit contributions
aw+0.062
ement+0.055
 middle+0.054
ifying+0.054
 Other+0.052
Neg logit contributions
ro-0.053
emo-0.046
uzz-0.041
onna-0.041
 L-0.040
Token made
Token ID925
Feature activation+1.70e-01

Pos logit contributions
aw+0.019
 Other+0.017
 middle+0.017
 particularly+0.016
+0.016
Neg logit contributions
ro-0.024
uzz-0.021
emo-0.021
keeping-0.020
onna-0.019
Token sure
Token ID1654
Feature activation-3.50e-01

Pos logit contributions
aw+0.008
 Other+0.007
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.007
uzz-0.006
emo-0.005
keeping-0.005
onna-0.005
Token hard
Token ID1327
Feature activation-4.52e-01

Pos logit contributions
ro+0.021
uzz+0.019
emo+0.018
keeping+0.018
onna+0.017
Neg logit contributions
aw-0.020
 Other-0.019
 middle-0.019
 particularly-0.019
-0.018
Token work
Token ID670
Feature activation-1.23e+00

Pos logit contributions
ro+0.012
uzz+0.011
emo+0.010
keeping+0.009
 cheek+0.009
Neg logit contributions
aw-0.025
 Other-0.025
 middle-0.024
 particularly-0.024
-0.023
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
ro+0.061
uzz+0.057
emo+0.056
 cheek+0.052
onna+0.051
Neg logit contributions
aw-0.043
 particularly-0.036
 Other-0.035
 middle-0.035
 course-0.033
Token From
Token ID3574
Feature activation+1.07e-01

Pos logit contributions
aw+0.065
 particularly+0.064
ifying+0.063
 middle+0.058
 Other+0.057
Neg logit contributions
ro-0.062
uzz-0.056
emo-0.054
keeping-0.053
 cheek-0.047
Token that
Token ID326
Feature activation-2.84e-01

Pos logit contributions
aw+0.003
 Other+0.003
 middle+0.003
 particularly+0.002
+0.002
Neg logit contributions
ro-0.006
uzz-0.006
keeping-0.006
emo-0.006
onna-0.005
Token day
Token ID1110
Feature activation+3.66e+00

Pos logit contributions
ro+0.012
uzz+0.010
keeping+0.010
emo+0.010
onna+0.009
Neg logit contributions
aw-0.012
 Other-0.012
 middle-0.012
 particularly-0.012
opl-0.011
Token on
Token ID319
Feature activation+1.54e+00

Pos logit contributions
aw+0.161
 particularly+0.158
 pasture+0.157
pieces+0.156
 Other+0.155
Neg logit contributions
ro-0.105
uzz-0.102
 seek-0.093
 moment-0.091
oys-0.090
Token,
Token ID11
Feature activation+2.18e+00

Pos logit contributions
aw+0.077
 Other+0.072
 particularly+0.072
ifying+0.070
pieces+0.069
Neg logit contributions
ro-0.053
emo-0.044
uzz-0.041
 common-0.041
 seek-0.041
Token Jerry
Token ID13075
Feature activation+5.61e-01

Pos logit contributions
ifying+0.113
aw+0.111
ridge+0.106
 particularly+0.105
 pasture+0.104
Neg logit contributions
ro-0.058
emo-0.055
 seek-0.050
 moment-0.048
 common-0.045
Token became
Token ID2627
Feature activation+1.34e+00

Pos logit contributions
aw+0.023
 Other+0.021
 middle+0.021
 particularly+0.020
+0.019
Neg logit contributions
ro-0.026
uzz-0.022
emo-0.021
keeping-0.021
onna-0.020
Token known
Token ID1900
Feature activation-2.06e-01

Pos logit contributions
aw+0.060
+0.056
 Other+0.056
ifying+0.055
opl+0.054
Neg logit contributions
ro-0.051
emo-0.047
uzz-0.046
 seek-0.042
keeping-0.040
Token helped
Token ID4193
Feature activation+7.85e-01

Pos logit contributions
aw+0.010
 Other+0.009
 middle+0.009
 particularly+0.009
+0.009
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.010
keeping-0.010
onna-0.009
Token him
Token ID683
Feature activation-3.28e-02

Pos logit contributions
aw+0.035
 Other+0.032
 middle+0.031
 particularly+0.030
+0.030
Neg logit contributions
ro-0.033
keeping-0.028
uzz-0.028
emo-0.027
 cheek-0.025
Token.
Token ID13
Feature activation+4.52e-01

Pos logit contributions
ro+0.001
uzz+0.001
keeping+0.001
emo+0.001
 cheek+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.001
 particularly-0.001
-0.001
Token From
Token ID3574
Feature activation+4.89e-01

Pos logit contributions
aw+0.017
 Other+0.015
 middle+0.015
 particularly+0.015
+0.014
Neg logit contributions
ro-0.022
uzz-0.019
emo-0.019
keeping-0.018
onna-0.017
Token that
Token ID326
Feature activation+6.14e-03

Pos logit contributions
aw+0.015
 Other+0.014
 middle+0.013
 particularly+0.013
+0.012
Neg logit contributions
ro-0.027
uzz-0.024
emo-0.023
keeping-0.023
 cheek-0.022
Token day
Token ID1110
Feature activation+3.66e+00

Pos logit contributions
aw+0.000
 Other+0.000
 middle+0.000
 particularly+0.000
+0.000
Neg logit contributions
ro-0.000
uzz-0.000
keeping-0.000
emo-0.000
onna-0.000
Token on
Token ID319
Feature activation+2.05e+00

Pos logit contributions
aw+0.157
 pasture+0.157
 Other+0.147
pieces+0.139
 particularly+0.138
Neg logit contributions
ro-0.109
 moment-0.102
emo-0.101
 itself-0.096
 seek-0.096
Token,
Token ID11
Feature activation+1.49e+00

Pos logit contributions
aw+0.111
 Other+0.098
ifying+0.093
pieces+0.092
 particularly+0.091
Neg logit contributions
ro-0.065
emo-0.055
 common-0.054
 seek-0.052
 moment-0.051
Token Billy
Token ID15890
Feature activation+8.31e-02

Pos logit contributions
aw+0.081
 Other+0.073
 middle+0.073
ifying+0.070
 particularly+0.070
Neg logit contributions
ro-0.044
emo-0.041
keeping-0.035
uzz-0.034
 seek-0.033
Token and
Token ID290
Feature activation-8.65e-02

Pos logit contributions
aw+0.003
 Other+0.003
 middle+0.003
 particularly+0.003
+0.002
Neg logit contributions
ro-0.004
uzz-0.004
keeping-0.004
emo-0.004
 cheek-0.003
Token Harry
Token ID5850
Feature activation+6.87e-01

Pos logit contributions
ro+0.004
uzz+0.003
emo+0.003
keeping+0.003
 cheek+0.003
Neg logit contributions
aw-0.004
 Other-0.003
 middle-0.003
 particularly-0.003
-0.003
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
ro+0.000
uzz+0.000
emo+0.000
keeping+0.000
onna+0.000
Neg logit contributions
aw-0.000
 Other-0.000
 middle-0.000
 particularly-0.000
-0.000
Token
Token ID198
Feature activation+1.17e+00

Pos logit contributions
aw+0.070
 particularly+0.065
ifying+0.063
 middle+0.062
 Other+0.061
Neg logit contributions
ro-0.056
emo-0.052
uzz-0.050
keeping-0.048
 itself-0.044
Token
Token ID198
Feature activation+2.22e+00

Pos logit contributions
aw+0.053
 Other+0.048
ifying+0.047
 particularly+0.047
 middle+0.046
Neg logit contributions
ro-0.046
emo-0.042
uzz-0.041
keeping-0.038
onna-0.037
TokenFrom
Token ID4863
Feature activation+3.06e-01

Pos logit contributions
ement+0.094
aw+0.094
 Other+0.090
ifying+0.086
 middle+0.082
Neg logit contributions
ro-0.080
emo-0.079
 itself-0.067
uzz-0.065
keeping-0.063
Token that
Token ID326
Feature activation+5.83e-01

Pos logit contributions
aw+0.009
 Other+0.008
 middle+0.008
 particularly+0.007
+0.007
Neg logit contributions
ro-0.017
uzz-0.015
emo-0.015
keeping-0.015
 cheek-0.014
Token day
Token ID1110
Feature activation+3.57e+00

Pos logit contributions
aw+0.033
 Other+0.029
 middle+0.028
ifying+0.028
 particularly+0.028
Neg logit contributions
ro-0.018
uzz-0.015
emo-0.014
keeping-0.013
 cheek-0.013
Token on
Token ID319
Feature activation+2.28e+00

Pos logit contributions
aw+0.151
pieces+0.144
ement+0.139
 Other+0.138
 particularly+0.138
Neg logit contributions
ro-0.114
uzz-0.104
emo-0.099
 moment-0.098
 itself-0.098
Token,
Token ID11
Feature activation+1.64e+00

Pos logit contributions
aw+0.135
pieces+0.123
ifying+0.123
bridge+0.119
 Other+0.118
Neg logit contributions
ro-0.054
emo-0.046
 moment-0.042
 itself-0.041
 common-0.039
Token Nut
Token ID11959
Feature activation+6.54e-02

Pos logit contributions
aw+0.081
ifying+0.073
 middle+0.072
ement+0.072
 Other+0.071
Neg logit contributions
ro-0.054
emo-0.052
 itself-0.043
uzz-0.042
keeping-0.041
Tokenty
Token ID774
Feature activation+1.15e+00

Pos logit contributions
aw+0.003
 Other+0.003
 middle+0.003
 particularly+0.003
+0.002
Neg logit contributions
ro-0.003
uzz-0.002
keeping-0.002
emo-0.002
onna-0.002
Token made
Token ID925
Feature activation+6.25e-01

Pos logit contributions
aw+0.045
 Other+0.043
 middle+0.042
ifying+0.041
 particularly+0.040
Neg logit contributions
ro-0.052
emo-0.043
keeping-0.043
uzz-0.043
onna-0.040
Token motorcycles
Token ID39404
Feature activation+1.20e+00

Pos logit contributions
ro+0.020
uzz+0.017
emo+0.016
keeping+0.015
 cheek+0.015
Neg logit contributions
aw-0.019
 Other-0.018
 particularly-0.017
 middle-0.017
-0.016
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
aw+0.055
 Other+0.054
 middle+0.052
Eventually+0.050
+0.050
Neg logit contributions
ro-0.045
uzz-0.038
keeping-0.036
emo-0.034
 cheek-0.033
Token He
Token ID679
Feature activation+1.44e+00

Pos logit contributions
aw+0.007
 Other+0.006
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.010
keeping-0.010
onna-0.010
Token had
Token ID550
Feature activation+2.20e+00

Pos logit contributions
aw+0.067
 Other+0.063
+0.062
ifying+0.061
 middle+0.061
Neg logit contributions
ro-0.057
emo-0.046
uzz-0.045
 seek-0.041
onna-0.040
Token a
Token ID257
Feature activation+2.57e+00

Pos logit contributions
+0.111
ement+0.103
aw+0.102
ishly+0.101
 Other+0.097
Neg logit contributions
ro-0.069
uzz-0.058
emo-0.058
keeping-0.056
icated-0.053
Token toy
Token ID13373
Feature activation+3.56e+00

Pos logit contributions
aw+0.145
 Other+0.124
+0.122
agged+0.122
pped+0.121
Neg logit contributions
ro-0.072
 common-0.065
 moment-0.062
 cheek-0.060
emo-0.060
Token motorcycle
Token ID18757
Feature activation+1.87e+00

Pos logit contributions
aw+0.171
Eventually+0.171
Hours+0.163
 Other+0.162
 Eventually+0.160
Neg logit contributions
ro-0.119
 cheek-0.090
 ante-0.084
guard-0.083
uzz-0.081
Token that
Token ID326
Feature activation+1.16e-01

Pos logit contributions
 Other+0.076
aw+0.074
Eventually+0.070
 middle+0.069
+0.069
Neg logit contributions
ro-0.084
uzz-0.066
keeping-0.061
 cheek-0.061
onna-0.061
Token he
Token ID339
Feature activation+8.88e-01

Pos logit contributions
aw+0.004
 Other+0.004
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.006
uzz-0.005
keeping-0.005
emo-0.005
 cheek-0.004
Token would
Token ID561
Feature activation+1.16e+00

Pos logit contributions
aw+0.037
 Other+0.035
 middle+0.033
+0.032
opl+0.031
Neg logit contributions
ro-0.040
emo-0.034
uzz-0.034
keeping-0.032
onna-0.030
Token ride
Token ID6594
Feature activation+2.43e-01

Pos logit contributions
aw+0.059
 Other+0.054
 middle+0.051
+0.050
ement+0.049
Neg logit contributions
ro-0.044
uzz-0.035
emo-0.033
onna-0.033
 cheek-0.032
Token helicopters
Token ID23890
Feature activation+1.19e+00

Pos logit contributions
ro+0.010
uzz+0.009
emo+0.008
keeping+0.008
 cheek+0.008
Neg logit contributions
aw-0.011
 Other-0.010
 particularly-0.010
 middle-0.010
-0.009
Token.
Token ID13
Feature activation-2.85e-01

Pos logit contributions
aw+0.056
 Other+0.055
+0.052
ifying+0.052
 middle+0.051
Neg logit contributions
ro-0.045
uzz-0.037
emo-0.034
keeping-0.033
 cheek-0.032
Token He
Token ID679
Feature activation+9.64e-01

Pos logit contributions
ro+0.016
keeping+0.015
uzz+0.015
emo+0.014
 cheek+0.014
Neg logit contributions
 Other-0.008
aw-0.008
 middle-0.007
 particularly-0.007
-0.007
Token had
Token ID550
Feature activation+1.63e+00

Pos logit contributions
aw+0.040
 Other+0.037
+0.036
ifying+0.035
 middle+0.035
Neg logit contributions
ro-0.043
emo-0.037
uzz-0.036
keeping-0.033
onna-0.033
Token a
Token ID257
Feature activation+2.31e+00

Pos logit contributions
+0.079
ement+0.077
aw+0.076
 Other+0.072
ishly+0.071
Neg logit contributions
ro-0.054
emo-0.046
uzz-0.046
keeping-0.045
 seek-0.041
Token toy
Token ID13373
Feature activation+3.55e+00

Pos logit contributions
aw+0.128
 Other+0.110
ement+0.109
ifying+0.106
opl+0.106
Neg logit contributions
ro-0.068
 common-0.061
emo-0.060
 cheek-0.060
 moment-0.056
Token helicopter
Token ID15392
Feature activation+1.50e+00

Pos logit contributions
aw+0.135
Eventually+0.130
 Eventually+0.121
oken+0.121
 Other+0.121
Neg logit contributions
ro-0.156
 cheek-0.135
guard-0.124
uzz-0.123
emo-0.120
Token that
Token ID326
Feature activation-2.43e-01

Pos logit contributions
 Other+0.066
aw+0.063
Eventually+0.059
+0.058
 middle+0.058
Neg logit contributions
ro-0.067
uzz-0.053
keeping-0.050
emo-0.049
 cheek-0.049
Token he
Token ID339
Feature activation+3.69e-01

Pos logit contributions
ro+0.012
uzz+0.011
keeping+0.011
emo+0.010
 cheek+0.010
Neg logit contributions
aw-0.008
 Other-0.008
 middle-0.008
 particularly-0.008
-0.007
Token played
Token ID2826
Feature activation+3.91e-02

Pos logit contributions
aw+0.014
 Other+0.013
 middle+0.012
 particularly+0.012
+0.012
Neg logit contributions
ro-0.018
uzz-0.016
emo-0.015
keeping-0.015
onna-0.014
Token with
Token ID351
Feature activation+9.32e-01

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.001
uzz-0.001
keeping-0.001
emo-0.001
onna-0.001
Token.
Token ID13
Feature activation-3.00e-01

Pos logit contributions
ro+0.022
uzz+0.019
emo+0.019
keeping+0.018
onna+0.018
Neg logit contributions
aw-0.017
 Other-0.016
 middle-0.015
 particularly-0.015
-0.014
Token
Token ID198
Feature activation-9.09e-01

Pos logit contributions
ro+0.016
uzz+0.014
keeping+0.014
emo+0.014
 cheek+0.014
Neg logit contributions
aw-0.010
 Other-0.009
 middle-0.008
 particularly-0.008
-0.008
Token
Token ID198
Feature activation+1.26e-01

Pos logit contributions
ro+0.050
keeping+0.043
uzz+0.043
 cheek+0.042
 textures+0.041
Neg logit contributions
aw-0.028
 Other-0.027
 middle-0.026
 particularly-0.024
Eventually-0.023
TokenFrom
Token ID4863
Feature activation+5.62e-02

Pos logit contributions
aw+0.004
 Other+0.004
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.006
uzz-0.006
emo-0.006
keeping-0.005
onna-0.005
Token that
Token ID326
Feature activation+3.33e-01

Pos logit contributions
aw+0.002
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.003
uzz-0.003
keeping-0.003
emo-0.003
onna-0.003
Token day
Token ID1110
Feature activation+3.53e+00

Pos logit contributions
aw+0.017
 Other+0.016
 middle+0.016
 particularly+0.015
+0.015
Neg logit contributions
ro-0.012
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token on
Token ID319
Feature activation+9.36e-01

Pos logit contributions
aw+0.160
pieces+0.148
 pasture+0.145
 Other+0.144
ement+0.140
Neg logit contributions
ro-0.114
uzz-0.099
emo-0.096
 moment-0.093
 cheek-0.092
Token,
Token ID11
Feature activation+9.15e-01

Pos logit contributions
aw+0.043
 Other+0.038
 particularly+0.037
ifying+0.036
 middle+0.036
Neg logit contributions
ro-0.040
emo-0.034
uzz-0.033
onna-0.030
keeping-0.030
Token Jolly
Token ID40276
Feature activation+4.99e-01

Pos logit contributions
aw+0.045
 Other+0.042
 middle+0.041
 particularly+0.041
+0.040
Neg logit contributions
ro-0.033
emo-0.028
uzz-0.026
keeping-0.024
onna-0.024
Token and
Token ID290
Feature activation-2.68e-01

Pos logit contributions
aw+0.022
 Other+0.021
 middle+0.020
 particularly+0.019
+0.019
Neg logit contributions
ro-0.021
uzz-0.018
emo-0.018
keeping-0.017
onna-0.016
Token his
Token ID465
Feature activation+1.33e+00

Pos logit contributions
ro+0.012
uzz+0.011
keeping+0.010
 cheek+0.010
emo+0.010
Neg logit contributions
aw-0.010
 Other-0.010
 middle-0.010
 particularly-0.009
 course-0.009
Token nut
Token ID6701
Feature activation-1.03e+00

Pos logit contributions
ro+0.039
uzz+0.035
keeping+0.033
emo+0.031
onna+0.031
Neg logit contributions
aw-0.029
 Other-0.028
 middle-0.027
 course-0.025
 particularly-0.025
Token together
Token ID1978
Feature activation+5.81e-02

Pos logit contributions
ro+0.050
uzz+0.042
emo+0.041
keeping+0.039
onna+0.039
Neg logit contributions
aw-0.039
 Other-0.036
 middle-0.035
 particularly-0.033
-0.032
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.003
uzz-0.002
emo-0.002
keeping-0.002
onna-0.002
Token From
Token ID3574
Feature activation+5.63e-01

Pos logit contributions
aw+0.078
 particularly+0.071
ifying+0.070
 middle+0.068
ement+0.067
Neg logit contributions
ro-0.068
emo-0.063
uzz-0.061
keeping-0.057
 itself-0.056
Token that
Token ID326
Feature activation-2.51e-02

Pos logit contributions
aw+0.017
 Other+0.017
 middle+0.016
 particularly+0.015
+0.015
Neg logit contributions
ro-0.030
uzz-0.027
emo-0.026
keeping-0.026
 cheek-0.025
Token day
Token ID1110
Feature activation+3.51e+00

Pos logit contributions
ro+0.001
uzz+0.001
keeping+0.001
emo+0.001
onna+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.001
 particularly-0.001
-0.001
Token on
Token ID319
Feature activation+1.69e+00

Pos logit contributions
aw+0.151
pieces+0.140
 pasture+0.137
ement+0.135
 particularly+0.134
Neg logit contributions
ro-0.114
uzz-0.107
 seek-0.101
emo-0.101
 moment-0.099
Token,
Token ID11
Feature activation+1.60e+00

Pos logit contributions
aw+0.092
ifying+0.081
pieces+0.081
 Other+0.080
 middle+0.077
Neg logit contributions
ro-0.053
emo-0.046
uzz-0.041
 itself-0.039
 moment-0.039
Token Nut
Token ID11959
Feature activation-1.11e-01

Pos logit contributions
aw+0.080
ifying+0.070
 middle+0.069
ement+0.069
 Other+0.069
Neg logit contributions
ro-0.054
emo-0.053
uzz-0.045
 itself-0.043
keeping-0.041
Tokenty
Token ID774
Feature activation+8.56e-01

Pos logit contributions
ro+0.005
uzz+0.004
keeping+0.004
emo+0.004
onna+0.004
Neg logit contributions
aw-0.005
 Other-0.004
 particularly-0.004
 middle-0.004
opl-0.004
Token knew
Token ID2993
Feature activation+4.53e-01

Pos logit contributions
aw+0.035
 Other+0.033
 middle+0.032
ifying+0.031
 particularly+0.031
Neg logit contributions
ro-0.038
emo-0.032
uzz-0.032
keeping-0.031
onna-0.030
Token helicopters
Token ID23890
Feature activation+1.32e+00

Pos logit contributions
ro+0.011
uzz+0.010
emo+0.009
keeping+0.009
 cheek+0.009
Neg logit contributions
aw-0.012
 Other-0.011
 particularly-0.011
 middle-0.011
-0.010
Token.
Token ID13
Feature activation-2.02e-01

Pos logit contributions
aw+0.065
 Other+0.063
Eventually+0.061
+0.060
 middle+0.060
Neg logit contributions
ro-0.046
uzz-0.038
 cheek-0.034
emo-0.034
keeping-0.034
Token He
Token ID679
Feature activation+8.55e-01

Pos logit contributions
ro+0.011
uzz+0.010
keeping+0.010
emo+0.010
 cheek+0.010
Neg logit contributions
aw-0.006
 Other-0.006
 middle-0.005
 particularly-0.005
-0.005
Token had
Token ID550
Feature activation+1.42e+00

Pos logit contributions
aw+0.036
 Other+0.034
+0.032
 middle+0.032
ifying+0.031
Neg logit contributions
ro-0.038
uzz-0.032
emo-0.032
keeping-0.029
onna-0.029
Token a
Token ID257
Feature activation+1.97e+00

Pos logit contributions
+0.068
aw+0.066
ement+0.065
 Other+0.064
opl+0.062
Neg logit contributions
ro-0.048
uzz-0.041
emo-0.040
keeping-0.039
 itself-0.036
Token toy
Token ID13373
Feature activation+3.51e+00

Pos logit contributions
aw+0.105
 Other+0.093
opl+0.089
ement+0.087
 Eventually+0.087
Neg logit contributions
ro-0.064
 cheek-0.055
emo-0.053
 common-0.053
 seek-0.049
Token helicopter
Token ID15392
Feature activation+1.25e+00

Pos logit contributions
aw+0.126
Eventually+0.120
oken+0.115
 Other+0.113
 Eventually+0.112
Neg logit contributions
ro-0.159
 cheek-0.142
uzz-0.129
guard-0.128
emo-0.122
Token that
Token ID326
Feature activation-1.62e-01

Pos logit contributions
 Other+0.052
aw+0.051
Eventually+0.046
 middle+0.046
+0.046
Neg logit contributions
ro-0.058
uzz-0.048
keeping-0.046
emo-0.044
 cheek-0.043
Token he
Token ID339
Feature activation+4.08e-01

Pos logit contributions
ro+0.008
uzz+0.007
keeping+0.007
emo+0.007
 cheek+0.007
Neg logit contributions
aw-0.006
 Other-0.005
 middle-0.005
 particularly-0.005
-0.005
Token played
Token ID2826
Feature activation+1.52e-01

Pos logit contributions
aw+0.015
 Other+0.014
 middle+0.014
 particularly+0.013
+0.013
Neg logit contributions
ro-0.020
uzz-0.017
emo-0.017
keeping-0.016
onna-0.016
Token with
Token ID351
Feature activation+7.51e-01

Pos logit contributions
aw+0.008
 Other+0.007
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.005
uzz-0.005
emo-0.004
keeping-0.004
onna-0.004
Token good
Token ID922
Feature activation+6.31e-01

Pos logit contributions
aw+0.032
 Other+0.030
+0.029
 middle+0.028
 particularly+0.028
Neg logit contributions
ro-0.037
uzz-0.035
emo-0.035
keeping-0.033
 seek-0.031
Token friends
Token ID2460
Feature activation+9.98e-01

Pos logit contributions
aw+0.022
 Other+0.021
 particularly+0.020
 middle+0.020
+0.019
Neg logit contributions
ro-0.032
uzz-0.028
keeping-0.028
emo-0.027
 cheek-0.025
Token.
Token ID13
Feature activation+8.20e-01

Pos logit contributions
aw+0.046
 Other+0.046
 middle+0.044
 particularly+0.043
+0.042
Neg logit contributions
ro-0.037
uzz-0.032
keeping-0.031
 cheek-0.029
emo-0.029
Token From
Token ID3574
Feature activation+4.33e-01

Pos logit contributions
aw+0.033
 particularly+0.029
 middle+0.029
 Other+0.029
ifying+0.028
Neg logit contributions
ro-0.037
uzz-0.033
emo-0.032
keeping-0.031
onna-0.030
Token that
Token ID326
Feature activation+2.91e-01

Pos logit contributions
aw+0.013
 Other+0.013
 middle+0.011
 particularly+0.011
+0.011
Neg logit contributions
ro-0.024
uzz-0.021
emo-0.021
keeping-0.021
 cheek-0.020
Token day
Token ID1110
Feature activation+3.51e+00

Pos logit contributions
aw+0.015
 Other+0.014
 middle+0.013
 particularly+0.013
+0.013
Neg logit contributions
ro-0.010
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.007
Token on
Token ID319
Feature activation+1.71e+00

Pos logit contributions
aw+0.170
 pasture+0.169
ement+0.168
pieces+0.160
ifying+0.159
Neg logit contributions
ro-0.078
 seek-0.077
 itself-0.073
uzz-0.073
emo-0.071
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
aw+0.092
ifying+0.082
 Other+0.078
 particularly+0.076
 middle+0.076
Neg logit contributions
ro-0.056
emo-0.045
 seek-0.045
uzz-0.044
 moment-0.044
Token Max
Token ID5436
Feature activation+3.31e-01

Pos logit contributions
aw+0.058
 middle+0.052
ifying+0.051
 Other+0.051
 particularly+0.050
Neg logit contributions
ro-0.033
uzz-0.027
emo-0.027
 seek-0.023
keeping-0.023
Token always
Token ID1464
Feature activation-9.97e-02

Pos logit contributions
aw+0.012
 Other+0.012
 middle+0.011
 particularly+0.011
+0.010
Neg logit contributions
ro-0.016
uzz-0.014
emo-0.014
keeping-0.013
onna-0.013
Token replied
Token ID8712
Feature activation+9.15e-01

Pos logit contributions
ro+0.005
uzz+0.005
emo+0.004
keeping+0.004
onna+0.004
Neg logit contributions
aw-0.003
 Other-0.003
 middle-0.003
 particularly-0.003
-0.003
Token very
Token ID845
Feature activation-3.43e-01

Pos logit contributions
aw+0.007
 Other+0.007
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
onna-0.007
Token happy
Token ID3772
Feature activation-1.76e-01

Pos logit contributions
ro+0.016
keeping+0.013
uzz+0.013
emo+0.013
onna+0.012
Neg logit contributions
aw-0.014
 Other-0.013
 particularly-0.013
 middle-0.013
-0.012
Token.
Token ID13
Feature activation+1.57e+00

Pos logit contributions
ro+0.008
uzz+0.007
emo+0.007
keeping+0.007
onna+0.006
Neg logit contributions
aw-0.007
 Other-0.007
 particularly-0.006
 middle-0.006
-0.006
Token From
Token ID3574
Feature activation+3.36e-01

Pos logit contributions
 particularly+0.074
aw+0.073
ifying+0.069
 middle+0.067
+0.065
Neg logit contributions
ro-0.056
uzz-0.052
emo-0.045
keeping-0.044
 itself-0.040
Token that
Token ID326
Feature activation+2.71e-01

Pos logit contributions
aw+0.010
 Other+0.009
 middle+0.008
 particularly+0.008
+0.008
Neg logit contributions
ro-0.019
uzz-0.017
emo-0.017
keeping-0.016
 cheek-0.016
Token day
Token ID1110
Feature activation+3.49e+00

Pos logit contributions
aw+0.014
 Other+0.013
 middle+0.013
 particularly+0.012
+0.012
Neg logit contributions
ro-0.010
uzz-0.008
emo-0.008
keeping-0.007
 cheek-0.007
Token on
Token ID319
Feature activation+1.85e+00

Pos logit contributions
 pasture+0.169
aw+0.168
opl+0.166
 particularly+0.166
ns+0.164
Neg logit contributions
uzz-0.072
 moment-0.071
oys-0.068
ro-0.067
 from-0.065
Token,
Token ID11
Feature activation+1.78e+00

Pos logit contributions
aw+0.099
 Other+0.090
ifying+0.090
 particularly+0.088
opl+0.086
Neg logit contributions
ro-0.057
 moment-0.045
 seek-0.044
uzz-0.043
 common-0.043
Token Sue
Token ID26089
Feature activation+6.58e-01

Pos logit contributions
aw+0.112
ifying+0.110
opl+0.106
 particularly+0.105
+0.105
Neg logit contributions
ro-0.034
uzz-0.028
emo-0.023
 seek-0.021
 moment-0.020
Token knew
Token ID2993
Feature activation+4.41e-01

Pos logit contributions
aw+0.026
 Other+0.025
 middle+0.024
 particularly+0.023
+0.022
Neg logit contributions
ro-0.030
uzz-0.026
emo-0.025
keeping-0.025
onna-0.024
Token she
Token ID673
Feature activation+5.23e-01

Pos logit contributions
aw+0.014
 Other+0.013
 middle+0.012
 particularly+0.012
+0.012
Neg logit contributions
ro-0.024
uzz-0.021
emo-0.021
keeping-0.021
onna-0.020
Token gray
Token ID12768
Feature activation-6.70e-01

Pos logit contributions
ro+0.043
emo+0.040
uzz+0.039
keeping+0.038
onna+0.036
Neg logit contributions
 middle-0.034
 Other-0.033
 particularly-0.033
aw-0.031
 course-0.030
Token cloth
Token ID16270
Feature activation-7.31e-01

Pos logit contributions
ro+0.027
emo+0.025
uzz+0.025
keeping+0.024
onna+0.023
Neg logit contributions
aw-0.028
 Other-0.027
 middle-0.027
 particularly-0.026
-0.025
Token.
Token ID13
Feature activation+7.80e-01

Pos logit contributions
ro+0.034
emo+0.030
uzz+0.030
keeping+0.029
onna+0.027
Neg logit contributions
aw-0.028
 Other-0.026
 particularly-0.025
 middle-0.025
-0.024
Token From
Token ID3574
Feature activation+5.79e-02

Pos logit contributions
aw+0.027
 particularly+0.024
 Other+0.024
 middle+0.024
ifying+0.022
Neg logit contributions
ro-0.039
uzz-0.035
emo-0.034
keeping-0.034
onna-0.032
Token that
Token ID326
Feature activation-6.03e-01

Pos logit contributions
aw+0.002
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.003
uzz-0.003
emo-0.003
keeping-0.003
onna-0.003
Token day
Token ID1110
Feature activation+3.46e+00

Pos logit contributions
ro+0.028
keeping+0.024
uzz+0.024
emo+0.024
onna+0.021
Neg logit contributions
 Other-0.024
 middle-0.023
 particularly-0.023
aw-0.023
 course-0.022
Token onwards
Token ID31719
Feature activation+1.22e+00

Pos logit contributions
 Other+0.157
aw+0.151
 pasture+0.146
opl+0.141
ns+0.140
Neg logit contributions
ro-0.101
uzz-0.091
 moment-0.089
 cheek-0.086
 from-0.083
Token,
Token ID11
Feature activation+2.47e+00

Pos logit contributions
aw+0.051
 Other+0.051
 particularly+0.048
opl+0.047
ifying+0.046
Neg logit contributions
ro-0.051
keeping-0.043
uzz-0.042
emo-0.042
 cheek-0.039
Token Sandra
Token ID32464
Feature activation+3.96e-01

Pos logit contributions
 Other+0.129
ifying+0.127
aw+0.126
lers+0.125
+0.122
Neg logit contributions
ro-0.062
 moment-0.054
emo-0.053
 common-0.051
uzz-0.051
Token always
Token ID1464
Feature activation-2.17e-01

Pos logit contributions
aw+0.015
 Other+0.014
 middle+0.014
 particularly+0.013
+0.013
Neg logit contributions
ro-0.019
uzz-0.016
emo-0.016
keeping-0.016
onna-0.015
Token enjoyed
Token ID8359
Feature activation+9.58e-02

Pos logit contributions
ro+0.011
uzz+0.009
emo+0.009
keeping+0.009
onna+0.009
Neg logit contributions
aw-0.008
 Other-0.007
 particularly-0.007
 middle-0.007
-0.007
Token river
Token ID7850
Feature activation+7.54e-02

Pos logit contributions
aw+0.037
 Other+0.035
+0.033
 particularly+0.032
ifying+0.032
Neg logit contributions
ro-0.034
 cheek-0.028
keeping-0.027
emo-0.027
uzz-0.026
Tokenbank
Token ID17796
Feature activation+2.79e-01

Pos logit contributions
aw+0.003
 Other+0.003
 middle+0.003
 particularly+0.003
+0.003
Neg logit contributions
ro-0.003
uzz-0.003
emo-0.003
keeping-0.003
onna-0.003
Token.
Token ID13
Feature activation+8.54e-01

Pos logit contributions
aw+0.012
 Other+0.011
 middle+0.011
 particularly+0.010
+0.010
Neg logit contributions
ro-0.012
uzz-0.010
keeping-0.010
emo-0.010
onna-0.009
Token From
Token ID3574
Feature activation+3.72e-01

Pos logit contributions
aw+0.036
 particularly+0.033
 Other+0.033
 middle+0.032
ifying+0.031
Neg logit contributions
ro-0.037
uzz-0.032
keeping-0.032
emo-0.031
onna-0.028
Token that
Token ID326
Feature activation+5.45e-02

Pos logit contributions
aw+0.011
 Other+0.010
 middle+0.009
 particularly+0.009
+0.009
Neg logit contributions
ro-0.021
uzz-0.019
emo-0.018
keeping-0.018
 cheek-0.017
Token day
Token ID1110
Feature activation+3.41e+00

Pos logit contributions
aw+0.003
 Other+0.003
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.002
uzz-0.002
keeping-0.002
emo-0.002
onna-0.002
Token on
Token ID319
Feature activation+1.72e+00

Pos logit contributions
aw+0.145
 Other+0.137
pieces+0.135
opl+0.129
ement+0.127
Neg logit contributions
ro-0.113
uzz-0.108
 moment-0.107
 cheek-0.099
 seek-0.097
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
aw+0.085
 Other+0.078
pieces+0.074
ifying+0.073
opl+0.073
Neg logit contributions
ro-0.061
uzz-0.051
keeping-0.049
 moment-0.048
emo-0.047
Token Gig
Token ID19525
Feature activation-1.25e-02

Pos logit contributions
aw+0.051
 Other+0.047
 middle+0.045
 particularly+0.045
opl+0.044
Neg logit contributions
ro-0.036
uzz-0.030
emo-0.030
keeping-0.029
onna-0.026
Tokeni
Token ID72
Feature activation+9.96e-01

Pos logit contributions
ro+0.001
uzz+0.000
keeping+0.000
emo+0.000
onna+0.000
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.000
 particularly-0.000
-0.000
Token knew
Token ID2993
Feature activation+6.73e-01

Pos logit contributions
aw+0.044
 Other+0.042
 middle+0.041
 particularly+0.039
ifying+0.039
Neg logit contributions
ro-0.041
uzz-0.034
emo-0.034
keeping-0.033
onna-0.031
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
aw+0.009
 Other+0.008
 middle+0.008
 particularly+0.007
+0.007
Neg logit contributions
ro-0.010
uzz-0.008
emo-0.008
keeping-0.008
onna-0.008
Token
Token ID198
Feature activation+6.42e-01

Pos logit contributions
aw+0.056
 particularly+0.052
ifying+0.050
ement+0.048
 middle+0.048
Neg logit contributions
ro-0.058
uzz-0.051
emo-0.050
onna-0.045
keeping-0.045
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
aw+0.027
 Other+0.024
 middle+0.023
 particularly+0.023
+0.023
Neg logit contributions
ro-0.028
uzz-0.025
emo-0.024
keeping-0.023
onna-0.023
TokenFrom
Token ID4863
Feature activation+3.50e-01

Pos logit contributions
aw+0.046
 Other+0.044
ement+0.042
 middle+0.041
opl+0.040
Neg logit contributions
ro-0.047
emo-0.039
uzz-0.038
 seek-0.036
onna-0.035
Token that
Token ID326
Feature activation+3.30e-01

Pos logit contributions
aw+0.010
 Other+0.010
 middle+0.009
 particularly+0.009
+0.008
Neg logit contributions
ro-0.020
uzz-0.018
emo-0.017
keeping-0.017
 cheek-0.016
Token day
Token ID1110
Feature activation+3.41e+00

Pos logit contributions
aw+0.017
 Other+0.016
 middle+0.015
 particularly+0.015
+0.015
Neg logit contributions
ro-0.012
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token on
Token ID319
Feature activation+1.28e+00

Pos logit contributions
aw+0.153
pieces+0.140
ement+0.139
 Other+0.134
 particularly+0.132
Neg logit contributions
ro-0.115
uzz-0.100
 seek-0.095
 cheek-0.094
 moment-0.091
Token,
Token ID11
Feature activation+1.34e+00

Pos logit contributions
aw+0.066
ifying+0.057
 Other+0.056
opl+0.055
 particularly+0.055
Neg logit contributions
ro-0.048
uzz-0.038
emo-0.038
onna-0.036
 seek-0.035
Token Mitt
Token ID16627
Feature activation+2.02e-01

Pos logit contributions
aw+0.061
ement+0.055
 middle+0.053
opl+0.052
 Other+0.052
Neg logit contributions
ro-0.054
emo-0.046
uzz-0.042
 L-0.041
 seek-0.040
Tokenens
Token ID641
Feature activation+6.31e-01

Pos logit contributions
aw+0.009
 Other+0.009
 middle+0.009
 particularly+0.009
+0.008
Neg logit contributions
ro-0.008
uzz-0.007
emo-0.006
keeping-0.006
onna-0.006
Token and
Token ID290
Feature activation+2.60e-02

Pos logit contributions
aw+0.027
 Other+0.025
 middle+0.024
 particularly+0.024
ifying+0.023
Neg logit contributions
ro-0.028
uzz-0.023
emo-0.023
keeping-0.022
onna-0.021
Token right
Token ID826
Feature activation+5.27e-01

Pos logit contributions
ro+0.006
uzz+0.005
keeping+0.005
emo+0.005
onna+0.005
Neg logit contributions
aw-0.005
 Other-0.005
 middle-0.005
 particularly-0.004
-0.004
Token thing
Token ID1517
Feature activation+2.00e-01

Pos logit contributions
aw+0.029
 Other+0.027
 particularly+0.026
 middle+0.026
+0.026
Neg logit contributions
ro-0.017
uzz-0.014
keeping-0.014
emo-0.013
 cheek-0.012
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
aw+0.008
 Other+0.008
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.007
keeping-0.007
onna-0.007
Token From
Token ID3574
Feature activation+5.73e-01

Pos logit contributions
aw+0.054
 particularly+0.049
 middle+0.049
ifying+0.048
 Other+0.048
Neg logit contributions
ro-0.052
uzz-0.046
emo-0.046
keeping-0.044
onna-0.041
Token that
Token ID326
Feature activation-4.80e-02

Pos logit contributions
 Other+0.018
aw+0.018
 middle+0.016
 particularly+0.016
+0.015
Neg logit contributions
ro-0.030
uzz-0.027
emo-0.027
keeping-0.026
 cheek-0.025
Token day
Token ID1110
Feature activation+3.40e+00

Pos logit contributions
ro+0.002
uzz+0.002
keeping+0.002
emo+0.002
onna+0.001
Neg logit contributions
aw-0.002
 Other-0.002
 middle-0.002
 particularly-0.002
-0.002
Token on
Token ID319
Feature activation+1.58e+00

Pos logit contributions
aw+0.149
 Other+0.136
 pasture+0.135
 particularly+0.127
ement+0.127
Neg logit contributions
ro-0.109
 moment-0.102
 cheek-0.099
uzz-0.099
 seek-0.098
Token,
Token ID11
Feature activation+1.35e+00

Pos logit contributions
aw+0.082
 Other+0.073
ifying+0.072
 middle+0.069
 particularly+0.068
Neg logit contributions
ro-0.055
emo-0.044
uzz-0.042
onna-0.041
keeping-0.041
Token Nut
Token ID11959
Feature activation-2.13e-01

Pos logit contributions
aw+0.063
ifying+0.057
 Other+0.057
 middle+0.057
ement+0.055
Neg logit contributions
ro-0.050
emo-0.045
uzz-0.041
keeping-0.040
 seek-0.038
Tokenty
Token ID774
Feature activation+5.61e-01

Pos logit contributions
ro+0.009
uzz+0.008
keeping+0.008
emo+0.008
onna+0.007
Neg logit contributions
aw-0.009
 Other-0.008
 particularly-0.008
 middle-0.008
opl-0.007
Token always
Token ID1464
Feature activation-3.69e-01

Pos logit contributions
aw+0.022
 Other+0.021
 middle+0.020
 particularly+0.019
+0.019
Neg logit contributions
ro-0.026
uzz-0.022
emo-0.022
keeping-0.022
onna-0.021
Token of
Token ID286
Feature activation-1.91e-02

Pos logit contributions
aw+0.031
 Other+0.031
 middle+0.030
 particularly+0.029
ifying+0.029
Neg logit contributions
ro-0.048
emo-0.040
uzz-0.040
keeping-0.038
 cheek-0.038
Token fun
Token ID1257
Feature activation+8.91e-01

Pos logit contributions
ro+0.001
uzz+0.001
keeping+0.001
emo+0.001
onna+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.001
 particularly-0.001
-0.001
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
aw+0.042
 Other+0.040
 middle+0.039
 particularly+0.039
ifying+0.037
Neg logit contributions
ro-0.034
uzz-0.028
keeping-0.027
emo-0.027
 cheek-0.024
Token From
Token ID3574
Feature activation+5.18e-01

Pos logit contributions
aw+0.078
 particularly+0.073
ement+0.070
 middle+0.070
ifying+0.070
Neg logit contributions
ro-0.050
emo-0.039
uzz-0.039
icated-0.036
onna-0.033
Token that
Token ID326
Feature activation+1.24e-01

Pos logit contributions
aw+0.016
 Other+0.016
 particularly+0.014
 middle+0.014
+0.014
Neg logit contributions
ro-0.028
uzz-0.025
emo-0.024
keeping-0.023
 cheek-0.023
Token day
Token ID1110
Feature activation+3.40e+00

Pos logit contributions
aw+0.006
 Other+0.006
 middle+0.006
 particularly+0.005
+0.005
Neg logit contributions
ro-0.005
uzz-0.004
emo-0.004
keeping-0.004
onna-0.003
Token on
Token ID319
Feature activation+1.73e+00

Pos logit contributions
aw+0.146
 particularly+0.135
pieces+0.135
 pasture+0.133
ement+0.129
Neg logit contributions
ro-0.113
uzz-0.102
 seek-0.098
 cheek-0.097
 moment-0.096
Token,
Token ID11
Feature activation+1.38e+00

Pos logit contributions
aw+0.094
ifying+0.086
 particularly+0.082
pieces+0.082
 Other+0.080
Neg logit contributions
ro-0.054
emo-0.040
 common-0.039
 L-0.039
uzz-0.038
Token Max
Token ID5436
Feature activation+2.06e-01

Pos logit contributions
aw+0.077
ement+0.068
ifying+0.068
 middle+0.068
 particularly+0.067
Neg logit contributions
ro-0.041
emo-0.032
uzz-0.030
icated-0.029
 seek-0.028
Token and
Token ID290
Feature activation+2.69e-01

Pos logit contributions
aw+0.008
 Other+0.008
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
onna-0.007
Token Luna
Token ID23694
Feature activation+4.06e-01

Pos logit contributions
aw+0.012
 Other+0.011
 middle+0.011
 particularly+0.011
+0.010
Neg logit contributions
ro-0.011
uzz-0.010
emo-0.009
keeping-0.009
onna-0.009
Token fun
Token ID1257
Feature activation+1.07e+00

Pos logit contributions
aw+0.036
 Other+0.033
 middle+0.031
opl+0.031
ifying+0.029
Neg logit contributions
ro-0.050
emo-0.044
uzz-0.043
 cheek-0.042
 seek-0.041
Token day
Token ID1110
Feature activation+1.56e+00

Pos logit contributions
aw+0.054
 Other+0.052
ifying+0.050
 middle+0.049
 particularly+0.049
Neg logit contributions
ro-0.037
uzz-0.030
emo-0.029
keeping-0.027
 seek-0.027
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
 Other+0.060
aw+0.058
 middle+0.055
 particularly+0.055
Eventually+0.054
Neg logit contributions
ro-0.066
uzz-0.059
keeping-0.058
emo-0.053
 seek-0.053
Token From
Token ID3574
Feature activation+4.23e-01

Pos logit contributions
aw+0.049
 particularly+0.047
 Other+0.044
 middle+0.044
ifying+0.043
Neg logit contributions
ro-0.040
uzz-0.035
emo-0.035
keeping-0.031
onna-0.030
Token that
Token ID326
Feature activation+3.86e-01

Pos logit contributions
aw+0.013
 Other+0.012
 particularly+0.011
 middle+0.011
+0.011
Neg logit contributions
ro-0.023
uzz-0.021
emo-0.020
keeping-0.020
 cheek-0.019
Token day
Token ID1110
Feature activation+3.39e+00

Pos logit contributions
aw+0.020
 Other+0.019
 middle+0.018
 particularly+0.018
+0.018
Neg logit contributions
ro-0.013
uzz-0.011
emo-0.010
keeping-0.010
 cheek-0.009
Token on
Token ID319
Feature activation+1.68e+00

Pos logit contributions
 pasture+0.154
 particularly+0.152
aw+0.148
pieces+0.146
 Other+0.146
Neg logit contributions
ro-0.091
uzz-0.088
oys-0.084
 L-0.077
 seek-0.076
Token,
Token ID11
Feature activation+8.52e-01

Pos logit contributions
aw+0.089
 particularly+0.086
ifying+0.085
 Other+0.085
pieces+0.083
Neg logit contributions
ro-0.050
emo-0.039
 common-0.037
uzz-0.035
 L-0.034
Token they
Token ID484
Feature activation+1.30e+00

Pos logit contributions
aw+0.047
 Other+0.044
 particularly+0.044
 middle+0.044
ifying+0.043
Neg logit contributions
ro-0.025
emo-0.020
uzz-0.019
keeping-0.016
onna-0.016
Token became
Token ID2627
Feature activation+1.63e+00

Pos logit contributions
 Other+0.050
aw+0.050
 middle+0.046
ement+0.045
ifying+0.044
Neg logit contributions
ro-0.059
emo-0.051
uzz-0.051
keeping-0.048
onna-0.046
Token good
Token ID922
Feature activation+1.45e+00

Pos logit contributions
aw+0.070
ifying+0.068
ement+0.067
 Other+0.067
+0.066
Neg logit contributions
ro-0.061
emo-0.059
uzz-0.054
 seek-0.051
 common-0.049
Token each
Token ID1123
Feature activation+1.39e+00

Pos logit contributions
aw+0.054
 middle+0.048
ement+0.047
+0.047
 particularly+0.047
Neg logit contributions
ro-0.036
uzz-0.032
emo-0.030
keeping-0.030
onna-0.028
Token other
Token ID584
Feature activation-4.03e-01

Pos logit contributions
aw+0.061
+0.058
ifying+0.054
ement+0.052
 Other+0.051
Neg logit contributions
ro-0.054
uzz-0.046
emo-0.045
keeping-0.045
 cheek-0.044
Token.
Token ID13
Feature activation+5.05e-02

Pos logit contributions
ro+0.019
uzz+0.017
emo+0.017
keeping+0.016
onna+0.016
Neg logit contributions
aw-0.015
 Other-0.014
 middle-0.014
 particularly-0.013
-0.013
Token From
Token ID3574
Feature activation+4.72e-01

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.001
Neg logit contributions
ro-0.003
uzz-0.002
keeping-0.002
emo-0.002
 cheek-0.002
Token that
Token ID326
Feature activation+2.49e-01

Pos logit contributions
aw+0.014
 Other+0.014
 particularly+0.013
 middle+0.013
+0.012
Neg logit contributions
ro-0.026
uzz-0.023
emo-0.023
keeping-0.022
 cheek-0.021
Token day
Token ID1110
Feature activation+3.39e+00

Pos logit contributions
aw+0.013
 Other+0.012
 middle+0.011
 particularly+0.011
+0.011
Neg logit contributions
ro-0.009
uzz-0.007
emo-0.007
keeping-0.007
 cheek-0.006
Token on
Token ID319
Feature activation+1.35e+00

Pos logit contributions
aw+0.157
 pasture+0.151
ement+0.145
opl+0.145
 particularly+0.144
Neg logit contributions
ro-0.092
uzz-0.082
 moment-0.080
 cheek-0.077
emo-0.075
Token,
Token ID11
Feature activation+1.35e-01

Pos logit contributions
aw+0.070
ifying+0.061
 Other+0.061
opl+0.060
 particularly+0.060
Neg logit contributions
ro-0.048
emo-0.037
uzz-0.037
onna-0.036
 moment-0.035
Token Lily
Token ID20037
Feature activation+5.46e-01

Pos logit contributions
aw+0.008
 Other+0.008
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.004
uzz-0.003
keeping-0.003
emo-0.003
onna-0.002
Token and
Token ID290
Feature activation-9.90e-01

Pos logit contributions
aw+0.020
 Other+0.019
 middle+0.018
 particularly+0.017
+0.017
Neg logit contributions
ro-0.027
uzz-0.023
emo-0.023
keeping-0.022
onna-0.021
Token Rose
Token ID8049
Feature activation+5.99e-01

Pos logit contributions
ro+0.052
uzz+0.046
emo+0.045
keeping+0.044
onna+0.044
Neg logit contributions
 Other-0.032
aw-0.030
 course-0.029
 particularly-0.028
 middle-0.027
Token with
Token ID351
Feature activation+8.10e-01

Pos logit contributions
ro+0.020
uzz+0.018
emo+0.018
keeping+0.017
onna+0.016
Neg logit contributions
aw-0.017
 Other-0.015
 middle-0.015
 particularly-0.015
-0.014
Token Fred
Token ID8559
Feature activation-1.04e-01

Pos logit contributions
aw+0.034
 Other+0.032
 middle+0.030
 particularly+0.030
+0.030
Neg logit contributions
ro-0.035
uzz-0.031
keeping-0.029
emo-0.029
onna-0.027
Token.
Token ID13
Feature activation+8.61e-01

Pos logit contributions
ro+0.005
uzz+0.004
emo+0.004
keeping+0.004
onna+0.004
Neg logit contributions
aw-0.004
 Other-0.004
 middle-0.004
 particularly-0.004
-0.003
Token From
Token ID3574
Feature activation+4.76e-01

Pos logit contributions
aw+0.035
 Other+0.031
 middle+0.031
 particularly+0.031
ifying+0.029
Neg logit contributions
ro-0.039
uzz-0.034
emo-0.033
keeping-0.033
onna-0.031
Token that
Token ID326
Feature activation-2.30e-01

Pos logit contributions
aw+0.014
 Other+0.014
 middle+0.013
 particularly+0.012
+0.012
Neg logit contributions
ro-0.026
uzz-0.023
emo-0.023
keeping-0.023
 cheek-0.022
Token day
Token ID1110
Feature activation+3.39e+00

Pos logit contributions
ro+0.010
uzz+0.008
keeping+0.008
emo+0.008
onna+0.007
Neg logit contributions
aw-0.010
 Other-0.010
 middle-0.010
 particularly-0.010
opl-0.009
Token on
Token ID319
Feature activation+2.01e+00

Pos logit contributions
 pasture+0.152
aw+0.146
chairs+0.140
ifying+0.138
 Other+0.138
Neg logit contributions
ro-0.100
 seek-0.095
 moment-0.093
 itself-0.090
emo-0.088
Token,
Token ID11
Feature activation+1.38e+00

Pos logit contributions
aw+0.104
ifying+0.092
 Other+0.092
 particularly+0.089
pieces+0.087
Neg logit contributions
ro-0.068
emo-0.057
 seek-0.055
 moment-0.054
 common-0.053
Token Tim
Token ID5045
Feature activation+3.57e-01

Pos logit contributions
aw+0.071
 Other+0.064
ifying+0.064
 middle+0.064
opl+0.063
Neg logit contributions
ro-0.047
emo-0.041
uzz-0.039
 seek-0.035
keeping-0.035
Token felt
Token ID2936
Feature activation+1.53e+00

Pos logit contributions
aw+0.015
 Other+0.014
 middle+0.013
 particularly+0.013
+0.013
Neg logit contributions
ro-0.016
uzz-0.013
emo-0.013
keeping-0.013
onna-0.012
Token strong
Token ID1913
Feature activation+1.03e+00

Pos logit contributions
aw+0.069
+0.066
 Other+0.065
ifying+0.063
opl+0.062
Neg logit contributions
ro-0.057
emo-0.051
uzz-0.047
onna-0.043
 itself-0.043
Token together
Token ID1978
Feature activation+3.75e-01

Pos logit contributions
aw+0.064
 particularly+0.063
ifying+0.063
opl+0.062
 Other+0.062
Neg logit contributions
ro-0.042
keeping-0.035
uzz-0.034
emo-0.033
 itself-0.032
Token happily
Token ID18177
Feature activation+6.66e-01

Pos logit contributions
aw+0.015
 Other+0.014
 middle+0.013
 particularly+0.013
+0.013
Neg logit contributions
ro-0.017
uzz-0.015
keeping-0.014
emo-0.014
onna-0.014
Token.
Token ID13
Feature activation-7.11e-03

Pos logit contributions
aw+0.033
 Other+0.031
 particularly+0.030
 middle+0.030
opl+0.028
Neg logit contributions
ro-0.025
uzz-0.021
keeping-0.020
emo-0.019
 cheek-0.018
Token From
Token ID3574
Feature activation+5.93e-01

Pos logit contributions
ro+0.000
uzz+0.000
keeping+0.000
emo+0.000
 cheek+0.000
Neg logit contributions
aw-0.000
 Other-0.000
 middle-0.000
 particularly-0.000
-0.000
Token that
Token ID326
Feature activation+5.67e-01

Pos logit contributions
 Other+0.019
aw+0.019
 middle+0.017
 particularly+0.017
+0.017
Neg logit contributions
ro-0.031
uzz-0.027
emo-0.027
keeping-0.026
 cheek-0.026
Token day
Token ID1110
Feature activation+3.37e+00

Pos logit contributions
aw+0.032
 Other+0.029
 middle+0.028
ifying+0.027
 particularly+0.027
Neg logit contributions
ro-0.017
uzz-0.014
emo-0.014
keeping-0.013
 cheek-0.013
Token on
Token ID319
Feature activation+1.52e+00

Pos logit contributions
aw+0.150
 pasture+0.142
opl+0.138
pieces+0.136
ement+0.135
Neg logit contributions
ro-0.095
 moment-0.086
uzz-0.085
 cheek-0.083
oys-0.078
Token,
Token ID11
Feature activation+9.86e-02

Pos logit contributions
aw+0.084
ifying+0.074
opl+0.071
 Other+0.071
 middle+0.069
Neg logit contributions
ro-0.049
emo-0.037
 moment-0.037
onna-0.037
 seek-0.035
Token Lily
Token ID20037
Feature activation+1.07e+00

Pos logit contributions
aw+0.006
 Other+0.006
 middle+0.006
 particularly+0.005
+0.005
Neg logit contributions
ro-0.003
uzz-0.002
keeping-0.002
emo-0.002
 cheek-0.002
Token always
Token ID1464
Feature activation-3.36e-01

Pos logit contributions
aw+0.040
 Other+0.038
 middle+0.037
ifying+0.037
+0.035
Neg logit contributions
ro-0.051
emo-0.044
uzz-0.042
keeping-0.042
onna-0.040
Token remembered
Token ID12086
Feature activation+4.65e-01

Pos logit contributions
ro+0.018
uzz+0.015
keeping+0.015
emo+0.015
onna+0.014
Neg logit contributions
aw-0.011
 Other-0.010
 particularly-0.010
 middle-0.010
-0.009
Token little
Token ID1310
Feature activation-4.31e-01

Pos logit contributions
ro+0.009
uzz+0.008
keeping+0.008
emo+0.008
onna+0.007
Neg logit contributions
aw-0.008
 Other-0.007
 middle-0.007
 particularly-0.007
-0.007
Token boy
Token ID2933
Feature activation+7.21e-01

Pos logit contributions
ro+0.018
uzz+0.017
emo+0.016
keeping+0.015
onna+0.015
Neg logit contributions
aw-0.019
 Other-0.018
 middle-0.018
 particularly-0.017
-0.016
Token carrying
Token ID6872
Feature activation-6.12e-02

Pos logit contributions
aw+0.025
 Other+0.024
 middle+0.022
 particularly+0.022
+0.022
Neg logit contributions
ro-0.036
uzz-0.031
keeping-0.031
emo-0.030
 cheek-0.028
Token a
Token ID257
Feature activation-1.31e-01

Pos logit contributions
ro+0.003
uzz+0.003
emo+0.003
keeping+0.003
onna+0.002
Neg logit contributions
aw-0.002
 Other-0.002
 middle-0.002
 particularly-0.002
-0.002
Token box
Token ID3091
Feature activation-2.80e-01

Pos logit contributions
ro+0.006
uzz+0.005
keeping+0.005
emo+0.005
onna+0.004
Neg logit contributions
aw-0.006
 Other-0.005
 middle-0.005
 particularly-0.005
-0.005
Token filled
Token ID5901
Feature activation-3.09e+00

Pos logit contributions
ro+0.013
emo+0.012
uzz+0.012
keeping+0.012
onna+0.011
Neg logit contributions
aw-0.010
 Other-0.009
 middle-0.009
 particularly-0.009
-0.009
Token with
Token ID351
Feature activation-9.10e-01

Pos logit contributions
ro+0.178
onna+0.167
 outage+0.167
 cheek+0.159
icated+0.156
Neg logit contributions
 middle-0.077
aw-0.072
 course-0.054
 Other-0.050
 all-0.049
Token empty
Token ID6565
Feature activation-7.08e-01

Pos logit contributions
ro+0.041
uzz+0.035
emo+0.035
onna+0.034
keeping+0.033
Neg logit contributions
aw-0.038
 middle-0.036
 Other-0.035
 particularly-0.034
ifying-0.031
Token boxes
Token ID10559
Feature activation-6.53e-01

Pos logit contributions
ro+0.033
uzz+0.029
emo+0.029
onna+0.026
keeping+0.026
Neg logit contributions
aw-0.029
 middle-0.027
 Other-0.027
 particularly-0.024
ifying-0.023
Token,
Token ID11
Feature activation-6.75e-01

Pos logit contributions
ro+0.032
emo+0.029
uzz+0.028
keeping+0.027
onna+0.026
Neg logit contributions
aw-0.024
 middle-0.021
 Other-0.021
 particularly-0.021
-0.020
Token wood
Token ID4898
Feature activation-1.00e+00

Pos logit contributions
ro+0.029
uzz+0.027
emo+0.026
keeping+0.024
onna+0.024
Neg logit contributions
aw-0.028
 middle-0.026
 Other-0.026
 particularly-0.025
ifying-0.023
Token she
Token ID673
Feature activation-7.49e-01

Pos logit contributions
aw+0.008
 Other+0.008
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.007
uzz-0.006
emo-0.006
keeping-0.006
onna-0.006
Token found
Token ID1043
Feature activation+1.03e+00

Pos logit contributions
ro+0.039
uzz+0.035
keeping+0.034
 cheek+0.034
onna+0.033
Neg logit contributions
aw-0.026
 particularly-0.023
 middle-0.022
 Other-0.022
 Leave-0.018
Token a
Token ID257
Feature activation-6.57e-01

Pos logit contributions
aw+0.047
 Other+0.046
+0.045
opl+0.045
 particularly+0.042
Neg logit contributions
ro-0.037
uzz-0.033
emo-0.031
keeping-0.030
 itself-0.028
Token huge
Token ID3236
Feature activation-6.79e-01

Pos logit contributions
ro+0.030
uzz+0.027
keeping+0.027
emo+0.025
onna+0.023
Neg logit contributions
aw-0.026
 particularly-0.025
 middle-0.024
 Other-0.023
-0.022
Token box
Token ID3091
Feature activation+4.17e-01

Pos logit contributions
ro+0.029
uzz+0.026
keeping+0.025
emo+0.025
onna+0.022
Neg logit contributions
aw-0.029
 middle-0.027
 particularly-0.026
 Other-0.026
-0.025
Token filled
Token ID5901
Feature activation-2.93e+00

Pos logit contributions
aw+0.017
 Other+0.016
 middle+0.015
 particularly+0.014
+0.014
Neg logit contributions
ro-0.019
uzz-0.017
keeping-0.016
emo-0.016
 cheek-0.015
Token with
Token ID351
Feature activation-1.35e+00

Pos logit contributions
ro+0.160
emo+0.148
onna+0.148
 cheek+0.144
uzz+0.143
Neg logit contributions
aw-0.081
 middle-0.080
 course-0.065
date-0.061
 particularly-0.061
Token costumes
Token ID24138
Feature activation-1.43e-01

Pos logit contributions
ro+0.061
emo+0.054
uzz+0.053
onna+0.051
 cheek+0.050
Neg logit contributions
aw-0.057
 middle-0.053
 particularly-0.051
 Other-0.050
ifying-0.045
Token.
Token ID13
Feature activation-5.02e-01

Pos logit contributions
ro+0.007
uzz+0.006
emo+0.006
keeping+0.006
onna+0.005
Neg logit contributions
aw-0.006
 Other-0.005
 middle-0.005
 particularly-0.005
-0.005
Token She
Token ID1375
Feature activation-1.44e+00

Pos logit contributions
ro+0.030
uzz+0.028
keeping+0.027
emo+0.027
ge+0.026
Neg logit contributions
 Other-0.013
aw-0.013
 middle-0.012
-0.011
 particularly-0.010
Token tried
Token ID3088
Feature activation-4.23e-01

Pos logit contributions
ro+0.071
keeping+0.068
 cheek+0.062
uzz+0.060
onna+0.059
Neg logit contributions
aw-0.050
 particularly-0.049
 middle-0.044
 Other-0.044
Eventually-0.039
Token I
Token ID314
Feature activation-3.07e-01

Pos logit contributions
ro+0.040
keeping+0.035
uzz+0.035
emo+0.033
ge+0.032
Neg logit contributions
aw-0.034
 Other-0.030
 middle-0.030
 course-0.028
 particularly-0.028
Token found
Token ID1043
Feature activation-3.32e-01

Pos logit contributions
ro+0.015
keeping+0.013
uzz+0.013
emo+0.013
 cheek+0.012
Neg logit contributions
aw-0.012
 Other-0.010
 particularly-0.010
 middle-0.010
-0.010
Token a
Token ID257
Feature activation-1.73e+00

Pos logit contributions
ro+0.017
uzz+0.014
emo+0.014
keeping+0.014
onna+0.013
Neg logit contributions
aw-0.013
 Other-0.012
 middle-0.011
 particularly-0.011
 course-0.010
Token shell
Token ID7582
Feature activation-1.87e-03

Pos logit contributions
ro+0.075
keeping+0.074
uzz+0.071
ge+0.063
and+0.062
Neg logit contributions
aw-0.068
 middle-0.068
 particularly-0.067
 best-0.059
 Other-0.057
Token!"
Token ID2474
Feature activation-9.44e-01

Pos logit contributions
ro+0.000
uzz+0.000
emo+0.000
keeping+0.000
onna+0.000
Neg logit contributions
aw-0.000
 Other-0.000
 middle-0.000
 particularly-0.000
-0.000
Token she
Token ID673
Feature activation-2.91e+00

Pos logit contributions
ro+0.049
uzz+0.046
keeping+0.043
emo+0.043
ge+0.042
Neg logit contributions
 middle-0.031
aw-0.030
 Other-0.030
 particularly-0.026
-0.025
Token said
Token ID531
Feature activation-1.05e+00

Pos logit contributions
ro+0.170
keeping+0.165
uzz+0.162
ge+0.152
onna+0.146
Neg logit contributions
aw-0.077
 middle-0.063
 particularly-0.054
 named-0.054
 sudden-0.053
Token.
Token ID13
Feature activation-9.44e-01

Pos logit contributions
ro+0.060
emo+0.054
keeping+0.054
uzz+0.052
icated+0.052
Neg logit contributions
aw-0.028
 Other-0.026
 middle-0.025
-0.024
 forward-0.023
Token Her
Token ID2332
Feature activation-9.58e-01

Pos logit contributions
ro+0.055
uzz+0.052
keeping+0.052
ge+0.051
emo+0.050
Neg logit contributions
 Other-0.024
-0.021
 middle-0.021
aw-0.021
 particularly-0.018
Token mom
Token ID1995
Feature activation-5.21e-01

Pos logit contributions
ro+0.052
keeping+0.046
uzz+0.045
ge+0.043
onna+0.043
Neg logit contributions
 middle-0.029
aw-0.028
 Other-0.028
-0.026
 best-0.025
Token smiled
Token ID13541
Feature activation-5.86e-02

Pos logit contributions
ro+0.023
keeping+0.020
uzz+0.020
emo+0.019
onna+0.018
Neg logit contributions
aw-0.023
 Other-0.020
 middle-0.019
 particularly-0.019
 course-0.018
Tokenmy
Token ID1820
Feature activation-6.58e-01

Pos logit contributions
ro+0.022
uzz+0.020
emo+0.019
keeping+0.019
onna+0.018
Neg logit contributions
aw-0.017
 Other-0.015
 middle-0.014
 particularly-0.013
opl-0.013
Token!
Token ID0
Feature activation-4.39e-01

Pos logit contributions
ro+0.037
uzz+0.034
emo+0.032
keeping+0.032
onna+0.030
Neg logit contributions
aw-0.021
 Other-0.018
 middle-0.017
-0.016
 particularly-0.016
Token A
Token ID317
Feature activation-3.35e-01

Pos logit contributions
ro+0.022
uzz+0.020
keeping+0.020
emo+0.019
ge+0.018
Neg logit contributions
aw-0.015
 Other-0.015
 middle-0.014
-0.013
 particularly-0.013
Token flower
Token ID15061
Feature activation+4.36e-01

Pos logit contributions
ro+0.014
uzz+0.012
keeping+0.012
emo+0.011
onna+0.010
Neg logit contributions
aw-0.015
 Other-0.014
 middle-0.014
 particularly-0.014
-0.013
Token!"
Token ID2474
Feature activation-1.05e+00

Pos logit contributions
aw+0.020
 Other+0.019
 middle+0.018
 particularly+0.018
+0.017
Neg logit contributions
ro-0.018
uzz-0.015
keeping-0.014
emo-0.014
onna-0.014
Token she
Token ID673
Feature activation-2.87e+00

Pos logit contributions
ro+0.054
uzz+0.050
keeping+0.048
emo+0.046
ge+0.046
Neg logit contributions
 middle-0.035
aw-0.035
 Other-0.033
 particularly-0.030
-0.029
Token said
Token ID531
Feature activation-7.97e-01

Pos logit contributions
ro+0.167
keeping+0.160
ge+0.150
uzz+0.149
onna+0.141
Neg logit contributions
aw-0.079
 middle-0.063
 particularly-0.062
 named-0.061
 sudden-0.061
Token.
Token ID13
Feature activation-5.61e-01

Pos logit contributions
ro+0.043
emo+0.038
keeping+0.038
uzz+0.038
icated+0.037
Neg logit contributions
aw-0.024
 Other-0.022
 middle-0.022
-0.021
 particularly-0.021
Token
Token ID220
Feature activation-3.09e-01

Pos logit contributions
ro+0.030
uzz+0.027
keeping+0.027
emo+0.026
ge+0.025
Neg logit contributions
 Other-0.017
aw-0.017
 middle-0.016
-0.016
 particularly-0.015
Token
Token ID198
Feature activation-5.67e-01

Pos logit contributions
ro+0.016
uzz+0.013
keeping+0.013
emo+0.013
 cheek+0.013
Neg logit contributions
aw-0.011
 Other-0.010
 middle-0.010
 particularly-0.009
-0.009
Token
Token ID198
Feature activation-7.06e-01

Pos logit contributions
ro+0.030
keeping+0.026
uzz+0.026
emo+0.025
 cheek+0.024
Neg logit contributions
aw-0.018
 Other-0.018
 middle-0.017
 particularly-0.016
-0.015
Token this
Token ID428
Feature activation-3.81e-01

Pos logit contributions
ro+0.028
keeping+0.024
uzz+0.023
emo+0.023
onna+0.022
Neg logit contributions
aw-0.024
 Other-0.022
 middle-0.022
 particularly-0.021
 course-0.021
Token is
Token ID318
Feature activation-1.12e+00

Pos logit contributions
ro+0.014
uzz+0.012
keeping+0.012
emo+0.012
onna+0.011
Neg logit contributions
aw-0.019
 middle-0.017
 Other-0.017
 particularly-0.016
 course-0.016
Token so
Token ID523
Feature activation-1.42e+00

Pos logit contributions
ro+0.065
keeping+0.056
uzz+0.056
emo+0.054
ge+0.052
Neg logit contributions
aw-0.034
 middle-0.032
 particularly-0.031
 Other-0.028
 course-0.027
Token pretty
Token ID2495
Feature activation-8.70e-01

Pos logit contributions
ro+0.066
keeping+0.060
uzz+0.056
ge+0.056
emo+0.051
Neg logit contributions
aw-0.057
 middle-0.057
 particularly-0.051
 Other-0.050
Eventually-0.045
Token!"
Token ID2474
Feature activation-1.33e+00

Pos logit contributions
ro+0.038
uzz+0.035
emo+0.033
keeping+0.032
 cheek+0.029
Neg logit contributions
aw-0.038
 Other-0.033
 middle-0.032
 particularly-0.032
 course-0.030
Token she
Token ID673
Feature activation-2.86e+00

Pos logit contributions
ro+0.073
uzz+0.071
emo+0.068
ge+0.065
keeping+0.064
Neg logit contributions
aw-0.039
 middle-0.038
 Other-0.035
 particularly-0.031
 Everyone-0.030
Token said
Token ID531
Feature activation-1.14e+00

Pos logit contributions
ro+0.164
keeping+0.153
uzz+0.146
onna+0.141
emo+0.135
Neg logit contributions
aw-0.080
 middle-0.069
 particularly-0.063
 sudden-0.057
 named-0.057
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
ro+0.064
emo+0.057
keeping+0.056
icated+0.055
uzz+0.054
Neg logit contributions
aw-0.033
 middle-0.030
 Other-0.030
 forward-0.028
 particularly-0.028
Token But
Token ID887
Feature activation-5.01e-01

Pos logit contributions
ro+0.065
uzz+0.062
keeping+0.061
ge+0.061
emo+0.060
Neg logit contributions
 Other-0.028
-0.026
aw-0.025
 middle-0.023
 Eventually-0.021
Token then
Token ID788
Feature activation+5.83e-01

Pos logit contributions
ro+0.025
keeping+0.022
emo+0.022
uzz+0.021
onna+0.021
Neg logit contributions
aw-0.018
 middle-0.017
 Other-0.017
 particularly-0.016
-0.015
Token she
Token ID673
Feature activation+7.14e-01

Pos logit contributions
aw+0.023
 Other+0.022
 middle+0.020
 particularly+0.020
opl+0.020
Neg logit contributions
ro-0.027
uzz-0.024
emo-0.022
keeping-0.022
 cheek-0.021
Token found
Token ID1043
Feature activation-3.60e-01

Pos logit contributions
ro+0.038
keeping+0.035
uzz+0.033
 cheek+0.032
emo+0.032
Neg logit contributions
aw-0.028
 particularly-0.027
 Other-0.026
 middle-0.026
-0.026
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
ro+0.016
uzz+0.014
emo+0.013
keeping+0.013
onna+0.013
Neg logit contributions
aw-0.016
 Other-0.014
 middle-0.014
 particularly-0.013
 course-0.013
Token bath
Token ID7837
Feature activation-1.33e+00

Pos logit contributions
ro+0.048
keeping+0.044
uzz+0.043
emo+0.040
and+0.038
Neg logit contributions
 particularly-0.042
 middle-0.041
-0.039
 Other-0.039
aw-0.039
Tokenrobe
Token ID25481
Feature activation-2.24e-01

Pos logit contributions
ro+0.053
uzz+0.049
emo+0.048
keeping+0.044
onna+0.044
Neg logit contributions
aw-0.054
 middle-0.051
 Other-0.049
-0.048
 particularly-0.048
Token!"
Token ID2474
Feature activation-1.42e+00

Pos logit contributions
ro+0.009
uzz+0.008
emo+0.008
keeping+0.008
onna+0.007
Neg logit contributions
aw-0.010
 Other-0.009
 middle-0.009
 particularly-0.009
-0.008
Token she
Token ID673
Feature activation-2.85e+00

Pos logit contributions
ro+0.071
uzz+0.064
keeping+0.064
emo+0.063
ge+0.062
Neg logit contributions
 middle-0.050
aw-0.050
 Other-0.045
 particularly-0.043
 Everybody-0.038
Token said
Token ID531
Feature activation-1.05e+00

Pos logit contributions
ro+0.160
keeping+0.154
uzz+0.147
emo+0.143
ge+0.137
Neg logit contributions
aw-0.066
 particularly-0.065
 middle-0.063
 named-0.057
 sudden-0.057
Token happily
Token ID18177
Feature activation-1.56e+00

Pos logit contributions
ro+0.057
emo+0.052
keeping+0.050
icated+0.050
uzz+0.049
Neg logit contributions
aw-0.031
 middle-0.028
 Other-0.028
 forward-0.027
 particularly-0.027
Token.
Token ID13
Feature activation-8.35e-01

Pos logit contributions
ro+0.092
emo+0.088
onna+0.079
icated+0.079
uzz+0.079
Neg logit contributions
aw-0.039
 middle-0.033
 forward-0.033
 particularly-0.030
 Other-0.028
Token
Token ID220
Feature activation-7.74e-01

Pos logit contributions
ro+0.049
keeping+0.045
emo+0.044
uzz+0.044
ge+0.043
Neg logit contributions
aw-0.021
 Other-0.021
 middle-0.021
 particularly-0.018
-0.017
Token
Token ID198
Feature activation-1.26e+00

Pos logit contributions
ro+0.042
keeping+0.037
uzz+0.036
emo+0.036
 cheek+0.035
Neg logit contributions
aw-0.024
 Other-0.024
 middle-0.022
 particularly-0.020
-0.020
Token it
Token ID340
Feature activation+4.36e-01

Pos logit contributions
ro+0.027
uzz+0.022
keeping+0.021
emo+0.020
onna+0.020
Neg logit contributions
aw-0.029
 Other-0.027
 middle-0.027
 course-0.026
 particularly-0.026
Token too
Token ID1165
Feature activation-6.59e-02

Pos logit contributions
aw+0.018
 Other+0.017
 middle+0.016
 particularly+0.016
+0.016
Neg logit contributions
ro-0.019
uzz-0.016
emo-0.016
keeping-0.015
onna-0.015
Token?"
Token ID1701
Feature activation+3.81e-01

Pos logit contributions
ro+0.003
uzz+0.003
keeping+0.003
emo+0.003
 cheek+0.003
Neg logit contributions
aw-0.002
 Other-0.002
 middle-0.002
 particularly-0.002
-0.002
Token Tim
Token ID5045
Feature activation-1.69e+00

Pos logit contributions
aw+0.021
 Other+0.020
 middle+0.019
 particularly+0.019
ifying+0.019
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.009
keeping-0.009
onna-0.008
Token shared
Token ID4888
Feature activation-7.75e-01

Pos logit contributions
keeping+0.082
ro+0.078
uzz+0.071
emo+0.068
onna+0.068
Neg logit contributions
aw-0.065
 particularly-0.056
 Other-0.054
 middle-0.050
 course-0.050
Token the
Token ID262
Feature activation-2.84e+00

Pos logit contributions
ro+0.040
keeping+0.035
uzz+0.034
emo+0.034
onna+0.033
Neg logit contributions
aw-0.028
 middle-0.025
 Other-0.024
 course-0.023
 particularly-0.022
Token marble
Token ID30623
Feature activation-1.66e+00

Pos logit contributions
keeping+0.119
ro+0.116
uzz+0.115
nes+0.108
onna+0.104
Neg logit contributions
 middle-0.126
 course-0.110
 best-0.107
 Other-0.105
 particularly-0.103
Token with
Token ID351
Feature activation+1.33e+00

Pos logit contributions
ro+0.080
keeping+0.071
uzz+0.071
emo+0.069
onna+0.065
Neg logit contributions
aw-0.061
 middle-0.058
 Other-0.052
 particularly-0.052
 forward-0.050
Token Sue
Token ID26089
Feature activation-5.27e-01

Pos logit contributions
ifying+0.080
 Other+0.080
aw+0.079
 particularly+0.079
opl+0.077
Neg logit contributions
ro-0.028
uzz-0.025
emo-0.023
keeping-0.020
 itself-0.019
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
ro+0.027
keeping+0.024
uzz+0.023
emo+0.023
onna+0.022
Neg logit contributions
aw-0.019
 Other-0.017
 middle-0.017
 particularly-0.017
-0.015
Token They
Token ID1119
Feature activation-8.85e-02

Pos logit contributions
ro+0.001
uzz+0.001
emo+0.001
keeping+0.001
 cheek+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.001
 particularly-0.001
-0.001
Token and
Token ID290
Feature activation-3.61e-01

Pos logit contributions
ro+0.010
uzz+0.009
emo+0.009
keeping+0.008
onna+0.008
Neg logit contributions
aw-0.009
 Other-0.008
 middle-0.008
 particularly-0.008
-0.007
Token content
Token ID2695
Feature activation-1.88e-01

Pos logit contributions
ro+0.017
uzz+0.015
emo+0.014
keeping+0.014
onna+0.013
Neg logit contributions
aw-0.014
 middle-0.013
 Other-0.013
 particularly-0.013
opl-0.012
Token.
Token ID13
Feature activation+4.25e-01

Pos logit contributions
ro+0.009
uzz+0.008
emo+0.007
keeping+0.007
onna+0.007
Neg logit contributions
aw-0.008
 Other-0.007
 middle-0.007
 particularly-0.007
-0.006
Token
Token ID198
Feature activation+2.95e-01

Pos logit contributions
aw+0.017
 Other+0.016
 middle+0.015
 particularly+0.015
+0.014
Neg logit contributions
ro-0.019
uzz-0.017
emo-0.016
keeping-0.016
onna-0.015
Token
Token ID198
Feature activation+5.62e-01

Pos logit contributions
aw+0.011
 Other+0.010
 middle+0.010
 particularly+0.010
+0.009
Neg logit contributions
ro-0.014
uzz-0.012
emo-0.012
keeping-0.012
onna-0.011
TokenB
Token ID33
Feature activation-2.84e+00

Pos logit contributions
aw+0.020
 Other+0.018
 middle+0.017
 particularly+0.017
+0.016
Neg logit contributions
ro-0.028
uzz-0.025
emo-0.025
keeping-0.024
onna-0.023
Tokenobby
Token ID11369
Feature activation-3.70e-01

Pos logit contributions
ro+0.095
 seek+0.095
keeping+0.094
 cheek+0.091
 common+0.088
Neg logit contributions
 Other-0.117
-0.113
aw-0.112
 Eventually-0.105
ement-0.101
Token and
Token ID290
Feature activation-1.29e+00

Pos logit contributions
ro+0.018
uzz+0.015
keeping+0.015
emo+0.015
onna+0.014
Neg logit contributions
aw-0.014
 Other-0.013
 particularly-0.013
 middle-0.012
-0.011
Token the
Token ID262
Feature activation-1.53e+00

Pos logit contributions
ro+0.071
onna+0.062
emo+0.060
uzz+0.059
 cheek+0.059
Neg logit contributions
 Other-0.036
aw-0.036
 middle-0.035
 particularly-0.033
 course-0.033
Token star
Token ID3491
Feature activation+2.14e-01

Pos logit contributions
ro+0.073
keeping+0.068
uzz+0.066
onna+0.062
emo+0.061
Neg logit contributions
 middle-0.066
aw-0.057
 particularly-0.057
 course-0.053
 Other-0.052
Token were
Token ID547
Feature activation+1.18e+00

Pos logit contributions
aw+0.008
 Other+0.008
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.010
uzz-0.009
emo-0.009
keeping-0.008
onna-0.008
TokenYes
Token ID5297
Feature activation-4.83e-01

Pos logit contributions
ro+0.011
uzz+0.009
keeping+0.009
emo+0.009
onna+0.008
Neg logit contributions
aw-0.013
 Other-0.012
 particularly-0.012
 middle-0.012
-0.011
Token,
Token ID11
Feature activation-2.24e+00

Pos logit contributions
ro+0.028
keeping+0.026
emo+0.025
uzz+0.025
onna+0.023
Neg logit contributions
aw-0.014
 Other-0.013
 middle-0.012
 particularly-0.011
-0.011
Token you
Token ID345
Feature activation+6.02e-01

Pos logit contributions
keeping+0.112
ro+0.103
emo+0.096
uzz+0.094
ge+0.084
Neg logit contributions
aw-0.088
 course-0.073
 Other-0.073
 particularly-0.069
 middle-0.068
Token can
Token ID460
Feature activation+6.72e-01

Pos logit contributions
aw+0.028
 Other+0.026
 middle+0.024
 particularly+0.024
opl+0.024
Neg logit contributions
ro-0.024
uzz-0.020
emo-0.020
keeping-0.019
 cheek-0.019
Token go
Token ID467
Feature activation+4.91e-01

Pos logit contributions
aw+0.031
 Other+0.028
 particularly+0.027
 middle+0.027
opl+0.026
Neg logit contributions
ro-0.027
uzz-0.023
emo-0.022
keeping-0.022
 cheek-0.021
Token,
Token ID11
Feature activation-2.82e+00

Pos logit contributions
aw+0.022
 Other+0.021
 middle+0.020
 particularly+0.019
+0.019
Neg logit contributions
ro-0.020
uzz-0.017
emo-0.017
keeping-0.017
onna-0.016
Token but
Token ID475
Feature activation-1.27e+00

Pos logit contributions
keeping+0.130
emo+0.114
ro+0.114
uzz+0.111
onna+0.098
Neg logit contributions
aw-0.111
 Other-0.102
 middle-0.098
 best-0.097
 particularly-0.096
Token you
Token ID345
Feature activation+2.57e-01

Pos logit contributions
ro+0.056
keeping+0.052
uzz+0.051
emo+0.048
onna+0.044
Neg logit contributions
 middle-0.048
aw-0.048
 course-0.047
 particularly-0.046
 Other-0.045
Token need
Token ID761
Feature activation-5.10e-01

Pos logit contributions
aw+0.010
 Other+0.009
 middle+0.009
 particularly+0.009
+0.008
Neg logit contributions
ro-0.012
uzz-0.011
emo-0.010
keeping-0.010
onna-0.010
Token to
Token ID284
Feature activation-9.22e-01

Pos logit contributions
ro+0.022
uzz+0.019
emo+0.019
keeping+0.018
 cheek+0.017
Neg logit contributions
aw-0.022
 Other-0.020
 middle-0.020
 particularly-0.020
 course-0.018
Token be
Token ID307
Feature activation+1.62e-01

Pos logit contributions
ro+0.040
uzz+0.036
emo+0.035
keeping+0.034
and+0.032
Neg logit contributions
 middle-0.036
aw-0.036
 Other-0.034
 particularly-0.033
-0.032
Token shoe
Token ID17292
Feature activation+2.87e-01

Pos logit contributions
aw+0.005
 Other+0.005
 middle+0.005
 particularly+0.004
+0.004
Neg logit contributions
ro-0.005
uzz-0.005
emo-0.004
keeping-0.004
onna-0.004
Token is
Token ID318
Feature activation-1.00e+00

Pos logit contributions
aw+0.012
 Other+0.011
 middle+0.010
 particularly+0.010
+0.010
Neg logit contributions
ro-0.013
uzz-0.011
keeping-0.011
emo-0.011
 cheek-0.010
Token so
Token ID523
Feature activation-1.31e+00

Pos logit contributions
ro+0.064
uzz+0.059
emo+0.056
keeping+0.056
ge+0.053
Neg logit contributions
aw-0.023
 particularly-0.022
 middle-0.022
 Other-0.018
 best-0.017
Token pretty
Token ID2495
Feature activation-6.36e-01

Pos logit contributions
ro+0.060
keeping+0.054
uzz+0.052
ge+0.049
emo+0.048
Neg logit contributions
aw-0.052
 middle-0.051
 particularly-0.049
 Other-0.047
opl-0.042
Token!"
Token ID2474
Feature activation-1.06e+00

Pos logit contributions
ro+0.028
uzz+0.024
emo+0.023
keeping+0.022
 cheek+0.021
Neg logit contributions
aw-0.029
 Other-0.025
 middle-0.024
 particularly-0.024
-0.023
Token she
Token ID673
Feature activation-2.81e+00

Pos logit contributions
ro+0.056
uzz+0.053
keeping+0.050
emo+0.049
ge+0.047
Neg logit contributions
aw-0.034
 middle-0.033
 Other-0.032
 particularly-0.028
-0.027
Token said
Token ID531
Feature activation-6.53e-01

Pos logit contributions
ro+0.162
keeping+0.157
ge+0.141
uzz+0.141
onna+0.137
Neg logit contributions
aw-0.079
 particularly-0.066
 middle-0.063
 named-0.058
 Other-0.057
Token to
Token ID284
Feature activation-6.06e-01

Pos logit contributions
ro+0.034
keeping+0.030
emo+0.030
uzz+0.029
icated+0.028
Neg logit contributions
aw-0.022
 Other-0.020
 middle-0.020
 particularly-0.019
-0.019
Token herself
Token ID5223
Feature activation-9.40e-01

Pos logit contributions
ro+0.029
uzz+0.025
keeping+0.025
emo+0.024
onna+0.024
Neg logit contributions
aw-0.022
 Other-0.022
 middle-0.022
 particularly-0.020
 course-0.020
Token.
Token ID13
Feature activation-7.10e-01

Pos logit contributions
ro+0.053
emo+0.046
keeping+0.046
uzz+0.045
onna+0.043
Neg logit contributions
aw-0.028
 Other-0.025
 middle-0.024
 particularly-0.023
 forward-0.022
Token
Token ID198
Feature activation-3.92e-01

Pos logit contributions
ro+0.039
uzz+0.036
keeping+0.036
emo+0.034
ge+0.034
Neg logit contributions
 Other-0.021
aw-0.020
-0.019
 middle-0.019
 particularly-0.017
Token and
Token ID290
Feature activation-6.32e-01

Pos logit contributions
ro+0.028
uzz+0.025
keeping+0.025
emo+0.024
onna+0.022
Neg logit contributions
aw-0.027
 particularly-0.026
 middle-0.025
 Other-0.025
-0.023
Token gave
Token ID2921
Feature activation-4.55e-01

Pos logit contributions
ro+0.029
uzz+0.026
keeping+0.024
emo+0.024
 cheek+0.023
Neg logit contributions
aw-0.026
 middle-0.023
 particularly-0.023
 Other-0.023
-0.020
Token it
Token ID340
Feature activation-1.21e+00

Pos logit contributions
ro+0.019
uzz+0.015
keeping+0.015
onna+0.015
emo+0.015
Neg logit contributions
aw-0.022
 middle-0.020
 Other-0.020
 particularly-0.019
 course-0.018
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
ro+0.065
keeping+0.057
uzz+0.056
emo+0.056
onna+0.054
Neg logit contributions
aw-0.038
 middle-0.036
 Other-0.035
 particularly-0.034
 forward-0.030
Token good
Token ID922
Feature activation-2.33e+00

Pos logit contributions
keeping+0.052
ro+0.052
uzz+0.049
emo+0.046
ge+0.043
Neg logit contributions
 particularly-0.042
 middle-0.042
 Other-0.041
-0.039
aw-0.039
Token tidy
Token ID43044
Feature activation-2.80e+00

Pos logit contributions
keeping+0.101
ro+0.096
uzz+0.095
ge+0.090
nes+0.090
Neg logit contributions
 middle-0.083
 Other-0.080
 particularly-0.078
aw-0.074
 course-0.071
Token.
Token ID13
Feature activation-4.50e-01

Pos logit contributions
uzz+0.120
keeping+0.115
emo+0.115
ge+0.111
icated+0.110
Neg logit contributions
 middle-0.105
 course-0.090
 Other-0.089
 meantime-0.088
 particularly-0.086
Token
Token ID198
Feature activation-6.80e-01

Pos logit contributions
ro+0.026
uzz+0.023
keeping+0.023
emo+0.022
 cheek+0.022
Neg logit contributions
aw-0.013
 Other-0.012
 middle-0.011
-0.011
 particularly-0.010
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
ro+0.037
keeping+0.032
uzz+0.032
 cheek+0.031
emo+0.030
Neg logit contributions
aw-0.022
 Other-0.021
 middle-0.019
 particularly-0.019
Eventually-0.018
TokenThe
Token ID464
Feature activation-9.14e-01

Pos logit contributions
ro+0.019
uzz+0.017
keeping+0.017
emo+0.016
 cheek+0.016
Neg logit contributions
aw-0.010
 Other-0.009
 middle-0.008
 particularly-0.008
Eventually-0.008
Token nap
Token ID25422
Feature activation-7.36e-01

Pos logit contributions
ro+0.040
uzz+0.040
keeping+0.038
emo+0.036
onna+0.034
Neg logit contributions
 middle-0.037
aw-0.034
 particularly-0.033
 Other-0.033
-0.031
Token!
Token ID0
Feature activation-3.22e-01

Pos logit contributions
ro+0.015
uzz+0.014
keeping+0.013
emo+0.013
onna+0.012
Neg logit contributions
aw-0.009
 Other-0.008
 middle-0.008
 particularly-0.007
opl-0.007
Token A
Token ID317
Feature activation+8.13e-01

Pos logit contributions
ro+0.016
uzz+0.014
keeping+0.014
emo+0.014
 cheek+0.013
Neg logit contributions
aw-0.012
 Other-0.011
 middle-0.010
-0.010
 particularly-0.010
Token fire
Token ID2046
Feature activation+6.18e-01

Pos logit contributions
aw+0.034
 Other+0.031
 particularly+0.029
 middle+0.029
+0.029
Neg logit contributions
ro-0.035
emo-0.030
uzz-0.029
 cheek-0.029
keeping-0.028
Tokenwork
Token ID1818
Feature activation+4.27e-01

Pos logit contributions
aw+0.034
 Other+0.032
 particularly+0.031
 middle+0.031
+0.030
Neg logit contributions
ro-0.019
uzz-0.016
keeping-0.015
emo-0.015
 cheek-0.014
Token!"
Token ID2474
Feature activation-1.28e+00

Pos logit contributions
aw+0.019
 Other+0.018
 middle+0.017
 particularly+0.017
+0.017
Neg logit contributions
ro-0.017
uzz-0.014
emo-0.014
keeping-0.014
 cheek-0.013
Token she
Token ID673
Feature activation-2.78e+00

Pos logit contributions
ro+0.066
uzz+0.061
keeping+0.059
emo+0.058
ge+0.058
Neg logit contributions
aw-0.043
 Other-0.041
 middle-0.041
 Everyone-0.036
 particularly-0.036
Token said
Token ID531
Feature activation-7.49e-01

Pos logit contributions
ro+0.165
keeping+0.155
uzz+0.151
ge+0.147
onna+0.139
Neg logit contributions
aw-0.077
 Other-0.057
 middle-0.057
 particularly-0.054
 brothers-0.053
Token.
Token ID13
Feature activation-7.00e-01

Pos logit contributions
ro+0.041
keeping+0.036
emo+0.036
uzz+0.035
icated+0.034
Neg logit contributions
aw-0.024
 Other-0.022
 middle-0.020
-0.019
 particularly-0.019
Token Tim
Token ID5045
Feature activation-1.64e+00

Pos logit contributions
ro+0.036
uzz+0.033
keeping+0.033
emo+0.032
ge+0.032
Neg logit contributions
 Other-0.023
aw-0.022
-0.022
 middle-0.020
 particularly-0.019
Token looked
Token ID3114
Feature activation+1.46e-01

Pos logit contributions
ro+0.082
keeping+0.082
uzz+0.076
onna+0.070
ge+0.069
Neg logit contributions
aw-0.060
 Other-0.048
 particularly-0.047
 middle-0.044
 continued-0.043
Token up
Token ID510
Feature activation+1.88e-01

Pos logit contributions
aw+0.005
 Other+0.005
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.008
uzz-0.007
emo-0.006
keeping-0.006
onna-0.006
Token their
Token ID511
Feature activation-2.17e+00

Pos logit contributions
ro+0.045
uzz+0.039
onna+0.038
emo+0.038
keeping+0.038
Neg logit contributions
aw-0.032
 middle-0.029
 Other-0.029
 particularly-0.026
 course-0.026
Token piles
Token ID30935
Feature activation-1.25e+00

Pos logit contributions
ro+0.106
uzz+0.100
onna+0.092
nes+0.090
keeping+0.090
Neg logit contributions
 middle-0.080
 best-0.071
aw-0.068
 course-0.067
 particularly-0.063
Token together
Token ID1978
Feature activation-8.69e-01

Pos logit contributions
ro+0.062
emo+0.056
uzz+0.055
keeping+0.055
 cheek+0.052
Neg logit contributions
aw-0.044
 middle-0.038
 Other-0.038
 particularly-0.036
 forward-0.036
Token and
Token ID290
Feature activation-6.97e-01

Pos logit contributions
ro+0.044
emo+0.040
uzz+0.040
keeping+0.038
 cheek+0.037
Neg logit contributions
aw-0.029
 middle-0.027
 Other-0.026
 particularly-0.026
-0.024
Token share
Token ID2648
Feature activation-1.33e+00

Pos logit contributions
ro+0.028
uzz+0.026
emo+0.025
keeping+0.024
onna+0.023
Neg logit contributions
aw-0.029
 middle-0.028
 Other-0.027
 particularly-0.027
-0.025
Token their
Token ID511
Feature activation-2.78e+00

Pos logit contributions
ro+0.071
keeping+0.062
uzz+0.061
onna+0.060
emo+0.059
Neg logit contributions
aw-0.047
 middle-0.038
 Other-0.036
 particularly-0.033
 course-0.032
Token finds
Token ID7228
Feature activation-6.33e-01

Pos logit contributions
ro+0.135
uzz+0.128
nes+0.125
ge+0.125
 outage+0.123
Neg logit contributions
 middle-0.098
 course-0.092
 best-0.085
aw-0.079
 Other-0.073
Token.
Token ID13
Feature activation-6.00e-01

Pos logit contributions
ro+0.033
uzz+0.029
keeping+0.028
emo+0.028
onna+0.027
Neg logit contributions
aw-0.023
 Other-0.020
 middle-0.020
 particularly-0.018
 course-0.017
Token They
Token ID1119
Feature activation-4.58e-01

Pos logit contributions
ro+0.031
keeping+0.029
uzz+0.028
 cheek+0.028
ge+0.027
Neg logit contributions
 Other-0.019
aw-0.019
 middle-0.017
-0.016
 Eventually-0.016
Token had
Token ID550
Feature activation-3.32e-01

Pos logit contributions
ro+0.023
uzz+0.020
keeping+0.020
emo+0.020
 cheek+0.019
Neg logit contributions
aw-0.017
 middle-0.015
 Other-0.015
 particularly-0.015
-0.014
Token more
Token ID517
Feature activation-1.04e+00

Pos logit contributions
ro+0.016
uzz+0.014
emo+0.014
keeping+0.014
 cheek+0.013
Neg logit contributions
aw-0.013
 Other-0.012
 middle-0.011
 particularly-0.011
-0.010
Token Kate
Token ID16693
Feature activation+3.63e-01

Pos logit contributions
ro+0.087
 cheek+0.076
keeping+0.075
ge+0.072
uzz+0.072
Neg logit contributions
 Other-0.070
-0.060
aw-0.057
 middle-0.056
 Everyone-0.056
Token and
Token ID290
Feature activation-1.68e-01

Pos logit contributions
aw+0.016
 Other+0.015
 middle+0.014
 particularly+0.014
+0.014
Neg logit contributions
ro-0.015
uzz-0.013
emo-0.013
keeping-0.012
 cheek-0.012
Token the
Token ID262
Feature activation-2.88e-01

Pos logit contributions
ro+0.007
uzz+0.005
keeping+0.005
emo+0.005
onna+0.005
Neg logit contributions
aw-0.008
 Other-0.007
 middle-0.007
 particularly-0.007
-0.007
Token other
Token ID584
Feature activation+8.86e-02

Pos logit contributions
ro+0.016
uzz+0.014
keeping+0.014
emo+0.013
onna+0.013
Neg logit contributions
aw-0.009
 Other-0.009
 middle-0.008
 particularly-0.008
-0.007
Token was
Token ID373
Feature activation+1.92e-01

Pos logit contributions
aw+0.004
 Other+0.003
 middle+0.003
 particularly+0.003
+0.003
Neg logit contributions
ro-0.004
uzz-0.003
keeping-0.003
emo-0.003
onna-0.003
Token named
Token ID3706
Feature activation-2.77e+00

Pos logit contributions
aw+0.006
 Other+0.006
 middle+0.006
 particularly+0.005
+0.005
Neg logit contributions
ro-0.010
uzz-0.009
emo-0.009
keeping-0.009
onna-0.008
Token Ben
Token ID3932
Feature activation-4.38e-01

Pos logit contributions
ro+0.116
keeping+0.107
 cheek+0.107
ge+0.100
uzz+0.099
Neg logit contributions
 Other-0.104
-0.090
 Everyone-0.087
 Everybody-0.085
 All-0.083
Token.
Token ID13
Feature activation-4.82e-01

Pos logit contributions
ro+0.022
keeping+0.019
emo+0.019
uzz+0.019
onna+0.018
Neg logit contributions
aw-0.016
 Other-0.014
 particularly-0.014
 middle-0.014
-0.012
Token They
Token ID1119
Feature activation+3.07e-01

Pos logit contributions
ro+0.023
keeping+0.020
uzz+0.020
 cheek+0.020
emo+0.020
Neg logit contributions
aw-0.018
 Other-0.018
 middle-0.017
 particularly-0.016
-0.015
Token were
Token ID547
Feature activation+8.00e-01

Pos logit contributions
aw+0.012
 Other+0.011
 middle+0.010
 particularly+0.010
+0.010
Neg logit contributions
ro-0.015
uzz-0.013
emo-0.013
keeping-0.012
 cheek-0.012
Token playing
Token ID2712
Feature activation+7.79e-01

Pos logit contributions
aw+0.031
 Other+0.028
+0.027
 middle+0.027
ifying+0.026
Neg logit contributions
ro-0.036
uzz-0.034
emo-0.032
keeping-0.031
 cheek-0.029
Token said
Token ID531
Feature activation+8.31e-02

Pos logit contributions
ro+0.003
uzz+0.003
keeping+0.003
emo+0.002
onna+0.002
Neg logit contributions
aw-0.004
 Other-0.004
 middle-0.003
 particularly-0.003
-0.003
Token it
Token ID340
Feature activation-9.39e-01

Pos logit contributions
aw+0.003
 Other+0.003
 middle+0.003
 particularly+0.003
+0.002
Neg logit contributions
ro-0.004
uzz-0.004
keeping-0.004
emo-0.004
onna-0.003
Token was
Token ID373
Feature activation-6.43e-01

Pos logit contributions
ro+0.050
keeping+0.046
emo+0.045
uzz+0.044
onna+0.043
Neg logit contributions
aw-0.030
 particularly-0.027
 Other-0.027
 middle-0.026
-0.024
Token a
Token ID257
Feature activation-1.20e+00

Pos logit contributions
ro+0.036
keeping+0.031
uzz+0.030
emo+0.029
 cheek+0.029
Neg logit contributions
aw-0.020
 particularly-0.020
 middle-0.019
 Other-0.018
 forward-0.016
Token very
Token ID845
Feature activation-1.48e+00

Pos logit contributions
ro+0.060
keeping+0.056
uzz+0.055
emo+0.050
ge+0.049
Neg logit contributions
 particularly-0.042
aw-0.041
 middle-0.039
 Other-0.036
-0.036
Token special
Token ID2041
Feature activation-2.77e+00

Pos logit contributions
ro+0.076
keeping+0.075
uzz+0.067
icated+0.064
ge+0.063
Neg logit contributions
aw-0.049
 particularly-0.049
 middle-0.047
 Other-0.042
 best-0.040
Token shell
Token ID7582
Feature activation-4.78e-01

Pos logit contributions
keeping+0.118
ro+0.115
uzz+0.112
ge+0.109
nes+0.105
Neg logit contributions
 particularly-0.105
 middle-0.105
aw-0.098
 best-0.096
 course-0.091
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
ro+0.024
uzz+0.021
emo+0.021
keeping+0.020
onna+0.019
Neg logit contributions
aw-0.017
 Other-0.015
 middle-0.015
 particularly-0.015
-0.014
Token The
Token ID383
Feature activation-2.06e+00

Pos logit contributions
ro+0.006
uzz+0.005
emo+0.005
keeping+0.005
onna+0.005
Neg logit contributions
aw-0.004
 Other-0.003
 middle-0.003
 particularly-0.003
-0.003
Token little
Token ID1310
Feature activation-2.20e+00

Pos logit contributions
keeping+0.098
ro+0.093
uzz+0.092
onna+0.091
ge+0.088
Neg logit contributions
 middle-0.088
aw-0.080
 best-0.077
 particularly-0.074
 happiest-0.065
Token girl
Token ID2576
Feature activation-6.68e-01

Pos logit contributions
keeping+0.109
ro+0.109
uzz+0.108
onna+0.106
ge+0.102
Neg logit contributions
 middle-0.073
aw-0.073
 best-0.064
 particularly-0.057
 course-0.056
Token dress
Token ID6576
Feature activation+5.85e-01

Pos logit contributions
aw+0.024
 Other+0.023
 middle+0.022
 particularly+0.021
+0.021
Neg logit contributions
ro-0.018
uzz-0.015
emo-0.014
keeping-0.014
 cheek-0.013
Token,
Token ID11
Feature activation+1.06e-02

Pos logit contributions
aw+0.025
 Other+0.024
 middle+0.022
 particularly+0.022
opl+0.021
Neg logit contributions
ro-0.026
uzz-0.021
keeping-0.021
emo-0.020
 cheek-0.020
Token but
Token ID475
Feature activation-8.40e-02

Pos logit contributions
aw+0.000
 Other+0.000
 middle+0.000
 particularly+0.000
+0.000
Neg logit contributions
ro-0.000
uzz-0.000
keeping-0.000
emo-0.000
 cheek-0.000
Token it
Token ID340
Feature activation-1.43e-01

Pos logit contributions
ro+0.004
uzz+0.003
keeping+0.003
emo+0.003
onna+0.003
Neg logit contributions
aw-0.003
 Other-0.003
 middle-0.003
 particularly-0.003
-0.003
Token didn
Token ID1422
Feature activation+6.62e-01

Pos logit contributions
ro+0.007
uzz+0.006
keeping+0.006
emo+0.006
onna+0.005
Neg logit contributions
aw-0.006
 Other-0.005
 middle-0.005
 particularly-0.005
-0.005
Token't
Token ID470
Feature activation-2.77e+00

Pos logit contributions
aw+0.025
 Other+0.022
 middle+0.021
 particularly+0.021
opl+0.020
Neg logit contributions
ro-0.032
uzz-0.028
emo-0.027
keeping-0.026
onna-0.026
Token fit
Token ID4197
Feature activation-1.07e+00

Pos logit contributions
keeping+0.149
ro+0.143
uzz+0.140
emo+0.133
ert+0.122
Neg logit contributions
 middle-0.078
 particularly-0.077
 forward-0.073
 meantime-0.068
 end-0.067
Token her
Token ID607
Feature activation-6.91e-01

Pos logit contributions
ro+0.060
emo+0.053
keeping+0.052
uzz+0.052
onna+0.051
Neg logit contributions
aw-0.031
 forward-0.029
 particularly-0.028
 Other-0.027
 middle-0.026
Token anymore
Token ID7471
Feature activation-6.01e-01

Pos logit contributions
ro+0.034
emo+0.031
keeping+0.030
uzz+0.030
onna+0.029
Neg logit contributions
aw-0.023
 middle-0.022
 particularly-0.021
 Other-0.021
ifying-0.020
Token.
Token ID13
Feature activation+8.56e-01

Pos logit contributions
ro+0.031
emo+0.027
uzz+0.027
keeping+0.026
onna+0.025
Neg logit contributions
aw-0.021
 Other-0.019
 particularly-0.018
 middle-0.018
-0.017
Token She
Token ID1375
Feature activation-2.44e-01

Pos logit contributions
aw+0.035
 particularly+0.031
 Other+0.031
 middle+0.030
opl+0.030
Neg logit contributions
ro-0.039
emo-0.032
uzz-0.032
keeping-0.031
 cheek-0.031
Token Joy
Token ID14087
Feature activation+1.73e-01

Pos logit contributions
ro+0.065
 cheek+0.059
keeping+0.057
ge+0.054
emo+0.052
Neg logit contributions
 Other-0.061
-0.054
aw-0.052
 middle-0.052
 particularly-0.049
Token and
Token ID290
Feature activation+3.54e-01

Pos logit contributions
aw+0.008
 Other+0.007
 middle+0.007
 particularly+0.007
+0.006
Neg logit contributions
ro-0.007
uzz-0.006
emo-0.006
keeping-0.006
onna-0.006
Token the
Token ID262
Feature activation-3.61e-01

Pos logit contributions
aw+0.017
 Other+0.016
 middle+0.016
 particularly+0.016
+0.015
Neg logit contributions
ro-0.013
uzz-0.011
emo-0.011
keeping-0.010
onna-0.010
Token other
Token ID584
Feature activation+2.79e-01

Pos logit contributions
ro+0.020
uzz+0.018
keeping+0.018
emo+0.017
onna+0.017
Neg logit contributions
aw-0.011
 Other-0.010
 middle-0.010
 particularly-0.010
-0.009
Token was
Token ID373
Feature activation+1.67e-01

Pos logit contributions
aw+0.012
 Other+0.011
 middle+0.011
 particularly+0.011
+0.010
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.010
keeping-0.010
onna-0.009
Token named
Token ID3706
Feature activation-2.77e+00

Pos logit contributions
aw+0.005
 Other+0.005
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
onna-0.008
Token Jan
Token ID2365
Feature activation+4.90e-01

Pos logit contributions
ro+0.117
 cheek+0.106
keeping+0.105
ge+0.103
uzz+0.099
Neg logit contributions
 Other-0.102
-0.098
 Everyone-0.086
 Everybody-0.084
 All-0.083
Token.
Token ID13
Feature activation+8.17e-02

Pos logit contributions
aw+0.022
 Other+0.021
 middle+0.020
 particularly+0.019
+0.019
Neg logit contributions
ro-0.020
uzz-0.017
emo-0.016
keeping-0.016
onna-0.015
Token Jan
Token ID2365
Feature activation-2.33e-02

Pos logit contributions
aw+0.003
 Other+0.003
 middle+0.003
 particularly+0.003
+0.003
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token was
Token ID373
Feature activation+4.14e-01

Pos logit contributions
ro+0.001
uzz+0.001
keeping+0.001
emo+0.001
 cheek+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.001
 particularly-0.001
-0.001
Token very
Token ID845
Feature activation-1.02e+00

Pos logit contributions
aw+0.016
 Other+0.015
 middle+0.014
 particularly+0.014
+0.014
Neg logit contributions
ro-0.019
uzz-0.017
emo-0.017
keeping-0.016
onna-0.015
Token get
Token ID651
Feature activation+3.00e-01

Pos logit contributions
aw+0.102
 Other+0.092
ement+0.090
opl+0.089
 middle+0.086
Neg logit contributions
ro-0.052
onna-0.037
uzz-0.037
 seek-0.034
 cheek-0.034
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
aw+0.013
 Other+0.013
 middle+0.012
 particularly+0.012
+0.012
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.010
keeping-0.010
onna-0.009
Token big
Token ID1263
Feature activation+5.20e-01

Pos logit contributions
aw+0.050
 Other+0.046
 particularly+0.043
opl+0.042
ifying+0.042
Neg logit contributions
ro-0.045
 cheek-0.038
uzz-0.037
keeping-0.036
emo-0.036
Token pastry
Token ID41738
Feature activation+6.01e-01

Pos logit contributions
aw+0.021
 Other+0.020
 particularly+0.019
 middle+0.019
+0.018
Neg logit contributions
ro-0.024
uzz-0.020
keeping-0.019
emo-0.019
 cheek-0.019
Token?"
Token ID1701
Feature activation-1.26e-01

Pos logit contributions
aw+0.025
 Other+0.024
 particularly+0.023
 middle+0.022
+0.021
Neg logit contributions
ro-0.026
uzz-0.022
keeping-0.022
emo-0.021
 cheek-0.021
Token she
Token ID673
Feature activation-2.77e+00

Pos logit contributions
ro+0.005
uzz+0.004
keeping+0.004
emo+0.004
onna+0.004
Neg logit contributions
aw-0.006
 Other-0.006
 middle-0.006
 particularly-0.005
-0.005
Token asked
Token ID1965
Feature activation-4.86e-01

Pos logit contributions
ro+0.175
keeping+0.166
uzz+0.165
onna+0.154
and+0.149
Neg logit contributions
 middle-0.057
aw-0.053
 particularly-0.051
 Other-0.046
 sudden-0.044
Token.
Token ID13
Feature activation-4.88e-01

Pos logit contributions
ro+0.025
emo+0.021
keeping+0.021
uzz+0.021
onna+0.020
Neg logit contributions
aw-0.017
 Other-0.017
 middle-0.016
 particularly-0.015
-0.014
Token Her
Token ID2332
Feature activation+2.49e-01

Pos logit contributions
ro+0.027
keeping+0.026
uzz+0.025
emo+0.025
ge+0.024
Neg logit contributions
 Other-0.013
aw-0.013
 middle-0.012
-0.012
 particularly-0.011
Token mom
Token ID1995
Feature activation-5.06e-01

Pos logit contributions
aw+0.012
 Other+0.011
 middle+0.010
 particularly+0.010
+0.010
Neg logit contributions
ro-0.010
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.008
Token said
Token ID531
Feature activation+4.26e-01

Pos logit contributions
ro+0.021
keeping+0.019
uzz+0.017
emo+0.017
onna+0.016
Neg logit contributions
aw-0.024
 Other-0.021
 particularly-0.021
 middle-0.020
-0.020
Token are
Token ID389
Feature activation-2.72e-01

Pos logit contributions
aw+0.002
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.002
uzz-0.002
keeping-0.002
emo-0.002
onna-0.002
Token special
Token ID2041
Feature activation-4.71e-03

Pos logit contributions
ro+0.013
uzz+0.012
keeping+0.011
emo+0.011
onna+0.010
Neg logit contributions
aw-0.010
 Other-0.009
 particularly-0.009
 middle-0.009
ifying-0.008
Token."
Token ID526
Feature activation-1.38e+00

Pos logit contributions
ro+0.000
uzz+0.000
emo+0.000
keeping+0.000
onna+0.000
Neg logit contributions
aw-0.000
 Other-0.000
 middle-0.000
 particularly-0.000
-0.000
Token
Token ID198
Feature activation-7.00e-01

Pos logit contributions
ro+0.081
ge+0.073
keeping+0.073
uzz+0.071
 common+0.071
Neg logit contributions
 Other-0.038
aw-0.034
 Eventually-0.031
 middle-0.030
-0.030
Token
Token ID198
Feature activation-9.54e-01

Pos logit contributions
ro+0.038
keeping+0.034
uzz+0.033
 cheek+0.032
emo+0.032
Neg logit contributions
aw-0.022
 Other-0.021
 middle-0.020
 particularly-0.019
-0.018
TokenB
Token ID33
Feature activation-2.76e+00

Pos logit contributions
ro+0.043
keeping+0.041
uzz+0.039
emo+0.036
onna+0.036
Neg logit contributions
aw-0.038
 particularly-0.036
 Other-0.036
 middle-0.036
Eventually-0.034
Tokenenny
Token ID11870
Feature activation-8.81e-01

Pos logit contributions
 cheek+0.063
keeping+0.059
ro+0.058
 seek+0.058
 common+0.055
Neg logit contributions
aw-0.150
 Other-0.146
-0.136
 Eventually-0.132
ifying-0.132
Token promised
Token ID8072
Feature activation-4.99e-01

Pos logit contributions
ro+0.041
keeping+0.037
uzz+0.037
emo+0.035
onna+0.034
Neg logit contributions
aw-0.036
 Other-0.031
 particularly-0.030
 middle-0.029
-0.027
Token to
Token ID284
Feature activation-6.31e-01

Pos logit contributions
ro+0.022
uzz+0.019
emo+0.018
keeping+0.018
icated+0.017
Neg logit contributions
aw-0.020
 Other-0.019
 middle-0.019
 particularly-0.018
-0.017
Token show
Token ID905
Feature activation-8.19e-01

Pos logit contributions
ro+0.028
uzz+0.024
keeping+0.024
emo+0.023
ge+0.022
Neg logit contributions
aw-0.024
 Other-0.024
 middle-0.024
 particularly-0.023
 course-0.023
Token Ell
Token ID7122
Feature activation-1.09e+00

Pos logit contributions
ro+0.039
uzz+0.032
emo+0.031
keeping+0.031
onna+0.031
Neg logit contributions
 Other-0.031
aw-0.031
 middle-0.030
 particularly-0.028
 course-0.027
TokenFrom
Token ID4863
Feature activation-4.10e-01

Pos logit contributions
aw+0.005
 Other+0.005
 middle+0.005
 particularly+0.005
+0.004
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
onna-0.008
Token that
Token ID326
Feature activation-7.45e-01

Pos logit contributions
ro+0.026
uzz+0.024
keeping+0.023
emo+0.023
onna+0.023
Neg logit contributions
aw-0.010
 middle-0.008
 Other-0.008
 particularly-0.007
 course-0.007
Token day
Token ID1110
Feature activation+2.04e+00

Pos logit contributions
ro+0.035
uzz+0.030
keeping+0.030
emo+0.030
onna+0.027
Neg logit contributions
 middle-0.028
 Other-0.028
 particularly-0.028
aw-0.027
 course-0.026
Token forward
Token ID2651
Feature activation-8.82e-01

Pos logit contributions
aw+0.067
 Other+0.058
opl+0.054
 particularly+0.053
 pasture+0.053
Neg logit contributions
ro-0.100
uzz-0.094
 cheek-0.088
keeping-0.086
 moment-0.085
Token the
Token ID262
Feature activation-1.53e+00

Pos logit contributions
ro+0.052
emo+0.047
uzz+0.046
keeping+0.046
 cheek+0.045
Neg logit contributions
aw-0.024
 Other-0.021
 middle-0.019
 particularly-0.017
 course-0.017
Token little
Token ID1310
Feature activation-2.76e+00

Pos logit contributions
ro+0.082
uzz+0.077
keeping+0.075
onna+0.072
emo+0.071
Neg logit contributions
 middle-0.051
 particularly-0.048
aw-0.046
 course-0.046
 best-0.044
Token girl
Token ID2576
Feature activation+1.52e+00

Pos logit contributions
onna+0.151
ro+0.144
ge+0.142
emo+0.137
and+0.135
Neg logit contributions
 middle-0.077
 best-0.072
 forward-0.063
 course-0.063
where-0.062
Token was
Token ID373
Feature activation+1.36e+00

Pos logit contributions
aw+0.065
 Other+0.064
 middle+0.060
+0.059
ifying+0.057
Neg logit contributions
ro-0.063
uzz-0.053
keeping-0.052
emo-0.050
onna-0.049
Token never
Token ID1239
Feature activation-3.71e-01

Pos logit contributions
aw+0.054
 Other+0.050
+0.049
ifying+0.046
 middle+0.046
Neg logit contributions
ro-0.059
uzz-0.056
emo-0.053
keeping-0.052
onna-0.050
Token without
Token ID1231
Feature activation-8.90e-01

Pos logit contributions
ro+0.019
uzz+0.017
emo+0.016
keeping+0.016
 cheek+0.015
Neg logit contributions
aw-0.013
 Other-0.012
 particularly-0.012
 middle-0.011
ifying-0.010
Token her
Token ID607
Feature activation-2.53e-01

Pos logit contributions
ro+0.048
emo+0.041
uzz+0.041
onna+0.039
keeping+0.038
Neg logit contributions
aw-0.028
 particularly-0.027
 Other-0.027
 middle-0.025
 course-0.023
Token other
Token ID584
Feature activation-7.90e-01

Pos logit contributions
aw+0.052
 Other+0.047
+0.044
opl+0.044
ifying+0.044
Neg logit contributions
ro-0.051
emo-0.043
uzz-0.043
 cheek-0.043
keeping-0.042
Token again
Token ID757
Feature activation+1.18e-01

Pos logit contributions
ro+0.040
emo+0.035
uzz+0.034
keeping+0.033
onna+0.033
Neg logit contributions
aw-0.029
 Other-0.026
 middle-0.026
 particularly-0.025
 forward-0.023
Token.
Token ID13
Feature activation+5.99e-01

Pos logit contributions
aw+0.005
 Other+0.005
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.005
uzz-0.005
emo-0.004
keeping-0.004
onna-0.004
Token From
Token ID3574
Feature activation+5.06e-01

Pos logit contributions
aw+0.025
 Other+0.023
 particularly+0.022
 middle+0.022
opl+0.021
Neg logit contributions
ro-0.026
uzz-0.023
emo-0.022
keeping-0.022
onna-0.021
Token that
Token ID326
Feature activation-4.26e-01

Pos logit contributions
aw+0.016
 Other+0.015
 particularly+0.014
 middle+0.014
+0.013
Neg logit contributions
ro-0.027
uzz-0.025
emo-0.024
keeping-0.024
 cheek-0.023
Token day
Token ID1110
Feature activation+2.68e+00

Pos logit contributions
ro+0.019
keeping+0.017
uzz+0.017
emo+0.016
onna+0.015
Neg logit contributions
 Other-0.017
aw-0.017
 middle-0.017
 particularly-0.017
 course-0.016
Token on
Token ID319
Feature activation+1.43e+00

Pos logit contributions
aw+0.109
 particularly+0.098
 Other+0.096
pieces+0.096
 pasture+0.095
Neg logit contributions
uzz-0.102
ro-0.102
emo-0.090
 seek-0.087
 cheek-0.087
Token,
Token ID11
Feature activation+5.73e-01

Pos logit contributions
aw+0.075
 Other+0.065
pieces+0.064
ifying+0.063
 particularly+0.063
Neg logit contributions
ro-0.049
emo-0.042
uzz-0.041
onna-0.039
 moment-0.037
Token Tim
Token ID5045
Feature activation-1.71e-01

Pos logit contributions
aw+0.029
 Other+0.026
 middle+0.026
 particularly+0.025
ifying+0.024
Neg logit contributions
ro-0.021
uzz-0.018
emo-0.017
keeping-0.016
onna-0.015
Token and
Token ID290
Feature activation+3.37e-02

Pos logit contributions
ro+0.008
uzz+0.007
keeping+0.007
emo+0.007
 cheek+0.006
Neg logit contributions
aw-0.007
 Other-0.006
 particularly-0.006
 middle-0.006
opl-0.006
Token Mo
Token ID4270
Feature activation+1.90e-01

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.002
uzz-0.001
emo-0.001
keeping-0.001
onna-0.001
Token.
Token ID13
Feature activation+3.68e-01

Pos logit contributions
aw+0.039
 Other+0.038
 middle+0.035
 particularly+0.035
+0.034
Neg logit contributions
ro-0.043
uzz-0.038
keeping-0.037
emo-0.035
 cheek-0.034
Token<|endoftext|>
Token ID50256
Feature activation-2.47e-01

Pos logit contributions
aw+0.014
 Other+0.013
 middle+0.013
 particularly+0.013
+0.012
Neg logit contributions
ro-0.017
uzz-0.015
emo-0.015
keeping-0.015
onna-0.014
TokenSam
Token ID16305
Feature activation-3.76e-01

Pos logit contributions
ro+0.013
uzz+0.012
emo+0.011
keeping+0.011
 cheek+0.011
Neg logit contributions
aw-0.008
 Other-0.007
 middle-0.007
 particularly-0.007
-0.007
Token and
Token ID290
Feature activation+3.61e-01

Pos logit contributions
ro+0.018
uzz+0.016
keeping+0.015
emo+0.015
 cheek+0.014
Neg logit contributions
aw-0.015
 Other-0.013
 middle-0.013
 particularly-0.013
opl-0.012
Token Alice
Token ID14862
Feature activation+4.64e-01

Pos logit contributions
aw+0.016
 Other+0.016
 middle+0.015
 particularly+0.015
+0.014
Neg logit contributions
ro-0.015
uzz-0.013
emo-0.012
keeping-0.012
 cheek-0.011
Token were
Token ID547
Feature activation+2.01e+00

Pos logit contributions
aw+0.018
 Other+0.017
 middle+0.016
 particularly+0.015
+0.015
Neg logit contributions
ro-0.022
uzz-0.019
emo-0.019
keeping-0.019
onna-0.017
Token at
Token ID379
Feature activation-1.32e+00

Pos logit contributions
aw+0.070
+0.065
opl+0.065
 Other+0.064
ifying+0.062
Neg logit contributions
uzz-0.083
ro-0.078
emo-0.072
 cheek-0.071
 L-0.068
Token the
Token ID262
Feature activation-3.13e-03

Pos logit contributions
ro+0.072
emo+0.064
uzz+0.063
 cheek+0.062
onna+0.059
Neg logit contributions
 Other-0.039
 middle-0.037
aw-0.036
 particularly-0.035
-0.034
Token stadium
Token ID10308
Feature activation+3.09e-02

Pos logit contributions
ro+0.000
uzz+0.000
emo+0.000
keeping+0.000
onna+0.000
Neg logit contributions
aw-0.000
 Other-0.000
 middle-0.000
 particularly-0.000
-0.000
Token.
Token ID13
Feature activation+5.14e-01

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.001
uzz-0.001
emo-0.001
keeping-0.001
onna-0.001
Token It
Token ID632
Feature activation-2.83e-01

Pos logit contributions
aw+0.020
 Other+0.018
 particularly+0.018
 middle+0.018
opl+0.017
Neg logit contributions
ro-0.024
uzz-0.021
keeping-0.020
emo-0.019
onna-0.019
Token as
Token ID355
Feature activation+1.06e-01

Pos logit contributions
aw+0.011
 Other+0.010
 middle+0.010
 particularly+0.010
+0.009
Neg logit contributions
ro-0.011
uzz-0.010
emo-0.010
keeping-0.010
onna-0.009
Token can
Token ID460
Feature activation+3.22e-01

Pos logit contributions
aw+0.005
 Other+0.004
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.004
uzz-0.004
emo-0.004
keeping-0.004
 cheek-0.003
Token be
Token ID307
Feature activation-1.36e-01

Pos logit contributions
aw+0.014
 Other+0.013
 middle+0.012
 particularly+0.012
+0.012
Neg logit contributions
ro-0.014
uzz-0.012
emo-0.011
keeping-0.011
onna-0.011
Token.
Token ID13
Feature activation+6.89e-01

Pos logit contributions
ro+0.006
uzz+0.005
emo+0.005
keeping+0.005
 cheek+0.005
Neg logit contributions
aw-0.006
 Other-0.005
 middle-0.005
 particularly-0.005
-0.005
Token From
Token ID3574
Feature activation-1.34e-01

Pos logit contributions
aw+0.025
 Other+0.022
 middle+0.022
 particularly+0.021
opl+0.020
Neg logit contributions
ro-0.034
uzz-0.031
emo-0.029
keeping-0.029
onna-0.028
Token then
Token ID788
Feature activation+2.13e+00

Pos logit contributions
ro+0.008
uzz+0.007
keeping+0.007
emo+0.007
onna+0.007
Neg logit contributions
aw-0.004
 Other-0.003
 middle-0.003
 particularly-0.003
-0.003
Token on
Token ID319
Feature activation+9.63e-01

Pos logit contributions
aw+0.083
 Other+0.075
 pasture+0.071
ifying+0.067
ns+0.066
Neg logit contributions
ro-0.091
uzz-0.076
 moment-0.076
 seek-0.072
emo-0.071
Token,
Token ID11
Feature activation+8.39e-01

Pos logit contributions
aw+0.040
 Other+0.037
 middle+0.034
opl+0.034
 particularly+0.034
Neg logit contributions
ro-0.043
uzz-0.038
emo-0.035
keeping-0.035
onna-0.034
Token when
Token ID618
Feature activation+8.86e-02

Pos logit contributions
aw+0.040
 Other+0.037
 middle+0.035
 particularly+0.035
+0.034
Neg logit contributions
ro-0.032
uzz-0.028
emo-0.026
keeping-0.026
onna-0.024
Token Andy
Token ID12382
Feature activation+5.01e-02

Pos logit contributions
aw+0.004
 Other+0.004
 middle+0.004
 particularly+0.004
+0.003
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.002
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.002
Token in
Token ID287
Feature activation-5.73e-01

Pos logit contributions
ro+0.012
uzz+0.011
emo+0.010
keeping+0.010
onna+0.010
Neg logit contributions
aw-0.010
 Other-0.009
 middle-0.009
 particularly-0.009
-0.008
Token the
Token ID262
Feature activation+5.91e-02

Pos logit contributions
ro+0.027
uzz+0.023
emo+0.023
 cheek+0.022
keeping+0.022
Neg logit contributions
aw-0.022
 middle-0.021
 Other-0.021
 particularly-0.020
 course-0.020
Token rain
Token ID6290
Feature activation-8.71e-01

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.003
uzz-0.002
emo-0.002
keeping-0.002
onna-0.002
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
ro+0.040
emo+0.037
uzz+0.037
keeping+0.033
onna+0.033
Neg logit contributions
aw-0.034
 Other-0.029
 middle-0.028
 particularly-0.028
-0.028
Token From
Token ID3574
Feature activation+9.43e-02

Pos logit contributions
ro+0.008
uzz+0.007
keeping+0.007
emo+0.007
 cheek+0.006
Neg logit contributions
aw-0.005
 Other-0.004
 middle-0.004
 particularly-0.004
-0.004
Token then
Token ID788
Feature activation+2.25e+00

Pos logit contributions
aw+0.003
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.006
uzz-0.005
keeping-0.005
emo-0.005
onna-0.005
Token on
Token ID319
Feature activation+1.02e+00

Pos logit contributions
aw+0.095
 Other+0.084
 pasture+0.080
ement+0.079
ns+0.078
Neg logit contributions
ro-0.093
 moment-0.076
uzz-0.074
emo-0.074
 cheek-0.072
Token,
Token ID11
Feature activation+2.40e-01

Pos logit contributions
aw+0.048
 Other+0.044
opl+0.042
ifying+0.041
 particularly+0.041
Neg logit contributions
ro-0.042
uzz-0.033
emo-0.032
keeping-0.032
 cheek-0.031
Token she
Token ID673
Feature activation+2.60e-01

Pos logit contributions
aw+0.013
 Other+0.012
 middle+0.012
 particularly+0.012
+0.011
Neg logit contributions
ro-0.008
uzz-0.006
emo-0.006
keeping-0.006
onna-0.006
Token always
Token ID1464
Feature activation-6.98e-01

Pos logit contributions
aw+0.009
 Other+0.009
 middle+0.008
 particularly+0.008
+0.008
Neg logit contributions
ro-0.013
uzz-0.012
emo-0.011
keeping-0.011
onna-0.011
Token looked
Token ID3114
Feature activation+1.25e+00

Pos logit contributions
ro+0.036
uzz+0.031
keeping+0.031
emo+0.030
onna+0.029
Neg logit contributions
aw-0.024
 particularly-0.022
 Other-0.022
 middle-0.021
-0.019
Token.
Token ID13
Feature activation-6.36e-01

Pos logit contributions
aw+0.028
 Other+0.026
 middle+0.025
 particularly+0.024
+0.024
Neg logit contributions
ro-0.030
uzz-0.025
keeping-0.024
emo-0.024
onna-0.023
Token<|endoftext|>
Token ID50256
Feature activation+3.08e-01

Pos logit contributions
ro+0.034
keeping+0.029
uzz+0.029
 cheek+0.029
emo+0.028
Neg logit contributions
 Other-0.020
aw-0.020
 middle-0.019
 particularly-0.017
-0.017
TokenTom
Token ID13787
Feature activation-6.75e-01

Pos logit contributions
aw+0.008
 Other+0.008
 middle+0.007
 particularly+0.007
+0.006
Neg logit contributions
ro-0.018
uzz-0.016
emo-0.016
keeping-0.016
onna-0.015
Token and
Token ID290
Feature activation+3.08e-01

Pos logit contributions
ro+0.033
uzz+0.029
keeping+0.029
 cheek+0.028
emo+0.028
Neg logit contributions
aw-0.025
 Other-0.022
 middle-0.021
 particularly-0.021
opl-0.020
Token Lily
Token ID20037
Feature activation-5.36e-01

Pos logit contributions
aw+0.014
 Other+0.013
 middle+0.012
 particularly+0.012
+0.012
Neg logit contributions
ro-0.013
uzz-0.011
emo-0.011
keeping-0.011
 cheek-0.010
Token were
Token ID547
Feature activation+2.05e+00

Pos logit contributions
ro+0.028
uzz+0.025
emo+0.024
keeping+0.024
 cheek+0.023
Neg logit contributions
aw-0.019
 Other-0.016
 middle-0.016
 particularly-0.015
-0.015
Token twins
Token ID20345
Feature activation+1.43e+00

Pos logit contributions
aw+0.085
opl+0.081
+0.079
ifying+0.079
 Other+0.078
Neg logit contributions
uzz-0.074
emo-0.063
 cheek-0.062
ro-0.060
 hiding-0.058
Token who
Token ID508
Feature activation+1.23e-01

Pos logit contributions
aw+0.065
 Other+0.062
opl+0.059
 middle+0.058
 particularly+0.056
Neg logit contributions
ro-0.056
uzz-0.049
 cheek-0.047
onna-0.044
keeping-0.044
Token liked
Token ID8288
Feature activation-6.20e-01

Pos logit contributions
aw+0.005
 Other+0.004
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.006
uzz-0.005
emo-0.005
keeping-0.005
 cheek-0.005
Token to
Token ID284
Feature activation+7.81e-01

Pos logit contributions
ro+0.028
uzz+0.024
emo+0.023
keeping+0.023
 cheek+0.022
Neg logit contributions
aw-0.026
 Other-0.024
 particularly-0.024
 middle-0.023
-0.021
Token play
Token ID711
Feature activation+1.07e+00

Pos logit contributions
aw+0.041
 Other+0.037
 middle+0.036
 particularly+0.035
opl+0.035
Neg logit contributions
ro-0.028
uzz-0.023
emo-0.022
keeping-0.021
onna-0.020
Token friend
Token ID1545
Feature activation+9.21e-01

Pos logit contributions
ro+0.023
uzz+0.020
keeping+0.020
emo+0.020
onna+0.019
Neg logit contributions
aw-0.018
 Other-0.017
 middle-0.017
 particularly-0.016
-0.015
Token the
Token ID262
Feature activation-6.41e-01

Pos logit contributions
aw+0.040
 Other+0.038
 middle+0.037
+0.036
 particularly+0.036
Neg logit contributions
ro-0.038
uzz-0.032
keeping-0.031
emo-0.030
 cheek-0.029
Token fox
Token ID21831
Feature activation+8.54e-01

Pos logit contributions
ro+0.029
uzz+0.026
keeping+0.024
emo+0.024
onna+0.023
Neg logit contributions
aw-0.026
 middle-0.025
 Other-0.024
 particularly-0.024
-0.022
Token.
Token ID13
Feature activation+7.76e-01

Pos logit contributions
aw+0.036
 Other+0.034
 middle+0.033
 particularly+0.032
+0.031
Neg logit contributions
ro-0.037
uzz-0.031
keeping-0.030
emo-0.030
 cheek-0.029
Token They
Token ID1119
Feature activation+1.21e+00

Pos logit contributions
aw+0.032
 middle+0.029
 Other+0.028
 particularly+0.028
ifying+0.026
Neg logit contributions
ro-0.035
uzz-0.031
emo-0.029
keeping-0.029
onna-0.028
Token would
Token ID561
Feature activation+1.02e+00

Pos logit contributions
aw+0.043
 Other+0.041
 middle+0.038
+0.037
ement+0.036
Neg logit contributions
ro-0.058
uzz-0.053
emo-0.049
keeping-0.048
onna-0.046
Token go
Token ID467
Feature activation+1.76e+00

Pos logit contributions
aw+0.042
 Other+0.039
 middle+0.037
 particularly+0.036
+0.035
Neg logit contributions
ro-0.047
uzz-0.041
keeping-0.038
emo-0.038
onna-0.037
Token to
Token ID284
Feature activation+1.93e-01

Pos logit contributions
aw+0.087
 Other+0.080
ement+0.079
opl+0.076
ifying+0.076
Neg logit contributions
ro-0.063
uzz-0.055
keeping-0.051
emo-0.050
 seek-0.049
Token the
Token ID262
Feature activation+3.60e-01

Pos logit contributions
aw+0.009
 Other+0.008
 middle+0.008
 particularly+0.008
+0.008
Neg logit contributions
ro-0.008
uzz-0.007
emo-0.007
keeping-0.006
onna-0.006
Token station
Token ID4429
Feature activation+4.98e-01

Pos logit contributions
aw+0.014
 Other+0.013
 middle+0.012
 particularly+0.012
+0.012
Neg logit contributions
ro-0.017
uzz-0.015
emo-0.014
keeping-0.014
onna-0.013
Token and
Token ID290
Feature activation+4.79e-01

Pos logit contributions
aw+0.022
 Other+0.021
 middle+0.020
 particularly+0.020
+0.019
Neg logit contributions
ro-0.021
uzz-0.017
keeping-0.017
emo-0.017
 cheek-0.016
Token girl
Token ID2576
Feature activation+5.94e-01

Pos logit contributions
ro+0.015
uzz+0.013
emo+0.012
keeping+0.012
onna+0.011
Neg logit contributions
aw-0.024
 Other-0.022
 middle-0.022
 particularly-0.021
-0.020
Token playing
Token ID2712
Feature activation-2.53e-02

Pos logit contributions
aw+0.021
 Other+0.020
 middle+0.019
 particularly+0.019
+0.018
Neg logit contributions
ro-0.029
uzz-0.026
keeping-0.025
emo-0.024
 cheek-0.024
Token with
Token ID351
Feature activation+1.74e-01

Pos logit contributions
ro+0.001
uzz+0.001
emo+0.001
keeping+0.001
onna+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.001
 particularly-0.001
-0.001
Token a
Token ID257
Feature activation+2.00e-01

Pos logit contributions
aw+0.007
 Other+0.007
 middle+0.007
 particularly+0.007
+0.006
Neg logit contributions
ro-0.008
uzz-0.007
emo-0.006
keeping-0.006
onna-0.006
Token pink
Token ID11398
Feature activation+8.11e-01

Pos logit contributions
aw+0.010
 Other+0.009
 middle+0.009
 particularly+0.009
+0.009
Neg logit contributions
ro-0.007
uzz-0.006
emo-0.006
keeping-0.006
 cheek-0.006
Token ball
Token ID2613
Feature activation+1.01e+00

Pos logit contributions
aw+0.045
 Other+0.043
 particularly+0.040
 middle+0.040
opl+0.040
Neg logit contributions
ro-0.026
uzz-0.019
emo-0.019
 cheek-0.018
keeping-0.017
Token too
Token ID1165
Feature activation-5.85e-01

Pos logit contributions
aw+0.046
 Other+0.045
 middle+0.042
 particularly+0.042
Eventually+0.042
Neg logit contributions
ro-0.040
uzz-0.032
 cheek-0.029
keeping-0.029
emo-0.029
Token.
Token ID13
Feature activation+1.82e-01

Pos logit contributions
ro+0.031
keeping+0.027
emo+0.027
uzz+0.027
 cheek+0.025
Neg logit contributions
aw-0.021
 Other-0.018
 particularly-0.017
 middle-0.017
-0.017
Token Lily
Token ID20037
Feature activation-6.81e-01

Pos logit contributions
aw+0.007
 Other+0.006
 middle+0.006
 particularly+0.006
+0.005
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
onna-0.007
Token wanted
Token ID2227
Feature activation-3.91e-01

Pos logit contributions
ro+0.031
keeping+0.028
uzz+0.028
emo+0.026
onna+0.025
Neg logit contributions
aw-0.029
 particularly-0.025
 Other-0.025
 middle-0.024
-0.022
Token to
Token ID284
Feature activation-2.59e-01

Pos logit contributions
ro+0.018
uzz+0.015
keeping+0.015
emo+0.015
 cheek+0.014
Neg logit contributions
aw-0.016
 Other-0.015
 particularly-0.015
 middle-0.015
 course-0.014
Token Mia
Token ID32189
Feature activation+5.73e-01

Pos logit contributions
ro+0.012
uzz+0.010
emo+0.010
keeping+0.010
 cheek+0.010
Neg logit contributions
aw-0.008
 Other-0.008
 middle-0.007
-0.007
 particularly-0.007
Token lost
Token ID2626
Feature activation-4.32e-01

Pos logit contributions
aw+0.025
 Other+0.024
 middle+0.023
 particularly+0.023
opl+0.022
Neg logit contributions
ro-0.024
uzz-0.020
keeping-0.019
emo-0.019
 cheek-0.018
Token her
Token ID607
Feature activation+1.05e-01

Pos logit contributions
ro+0.020
emo+0.018
uzz+0.017
keeping+0.016
onna+0.016
Neg logit contributions
aw-0.017
 Other-0.015
 middle-0.015
 particularly-0.014
-0.014
Token best
Token ID1266
Feature activation+6.18e-01

Pos logit contributions
aw+0.005
 Other+0.005
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.004
uzz-0.004
emo-0.003
keeping-0.003
onna-0.003
Token friend
Token ID1545
Feature activation+6.87e-01

Pos logit contributions
aw+0.029
 Other+0.027
 middle+0.025
 particularly+0.025
+0.024
Neg logit contributions
ro-0.026
uzz-0.021
keeping-0.021
emo-0.021
 cheek-0.019
Token because
Token ID780
Feature activation+4.00e-01

Pos logit contributions
aw+0.030
 Other+0.029
 middle+0.027
 particularly+0.027
+0.027
Neg logit contributions
ro-0.029
uzz-0.024
keeping-0.024
emo-0.023
 cheek-0.022
Token of
Token ID286
Feature activation+3.63e-01

Pos logit contributions
aw+0.017
 Other+0.016
 middle+0.015
 particularly+0.015
+0.015
Neg logit contributions
ro-0.017
uzz-0.015
emo-0.014
keeping-0.014
onna-0.013
Token a
Token ID257
Feature activation+5.64e-01

Pos logit contributions
aw+0.017
 Other+0.016
 middle+0.015
 particularly+0.015
+0.014
Neg logit contributions
ro-0.015
uzz-0.012
keeping-0.012
emo-0.012
onna-0.011
Token bad
Token ID2089
Feature activation+2.22e-01

Pos logit contributions
aw+0.025
 Other+0.023
 middle+0.022
 particularly+0.022
+0.021
Neg logit contributions
ro-0.024
uzz-0.020
emo-0.020
keeping-0.019
 cheek-0.019
Token promise
Token ID6991
Feature activation-3.78e-01

Pos logit contributions
aw+0.009
 Other+0.008
 middle+0.008
 particularly+0.008
+0.008
Neg logit contributions
ro-0.010
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token and
Token ID290
Feature activation+2.06e-01

Pos logit contributions
ro+0.017
uzz+0.015
emo+0.015
keeping+0.015
 cheek+0.014
Neg logit contributions
aw-0.015
 Other-0.014
 middle-0.013
 particularly-0.013
-0.012
Token things
Token ID1243
Feature activation+1.20e-02

Pos logit contributions
aw+0.040
+0.040
 Other+0.039
 middle+0.038
 particularly+0.037
Neg logit contributions
ro-0.039
uzz-0.033
keeping-0.032
emo-0.031
 textures-0.030
Token in
Token ID287
Feature activation+9.38e-02

Pos logit contributions
aw+0.000
 Other+0.000
 middle+0.000
 particularly+0.000
+0.000
Neg logit contributions
ro-0.001
uzz-0.000
emo-0.000
keeping-0.000
 cheek-0.000
Token nature
Token ID3450
Feature activation+1.57e-01

Pos logit contributions
aw+0.004
 Other+0.004
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
aw+0.007
 Other+0.006
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.007
uzz-0.006
emo-0.006
keeping-0.006
onna-0.005
Token Whenever
Token ID21326
Feature activation+9.42e-01

Pos logit contributions
aw+0.039
 particularly+0.036
 middle+0.035
ifying+0.035
+0.034
Neg logit contributions
ro-0.050
uzz-0.043
emo-0.041
keeping-0.039
onna-0.039
Token he
Token ID339
Feature activation+7.24e-01

Pos logit contributions
aw+0.045
 Other+0.044
 particularly+0.043
+0.043
 middle+0.042
Neg logit contributions
ro-0.033
emo-0.027
uzz-0.027
keeping-0.026
 seek-0.024
Token saw
Token ID2497
Feature activation+1.30e+00

Pos logit contributions
aw+0.030
 Other+0.029
 middle+0.028
 particularly+0.026
+0.026
Neg logit contributions
ro-0.032
uzz-0.026
emo-0.026
keeping-0.025
onna-0.024
Token something
Token ID1223
Feature activation+2.14e-01

Pos logit contributions
+0.053
aw+0.053
 Other+0.051
 middle+0.050
 particularly+0.049
Neg logit contributions
ro-0.054
uzz-0.048
emo-0.046
keeping-0.045
icated-0.041
Token he
Token ID339
Feature activation+1.23e+00

Pos logit contributions
aw+0.009
 Other+0.008
 middle+0.008
 particularly+0.008
+0.007
Neg logit contributions
ro-0.010
uzz-0.008
emo-0.008
keeping-0.008
onna-0.008
Token knew
Token ID2993
Feature activation+4.21e-01

Pos logit contributions
aw+0.054
 Other+0.054
 middle+0.051
+0.050
opl+0.049
Neg logit contributions
ro-0.049
uzz-0.040
emo-0.039
keeping-0.036
onna-0.036
Token,
Token ID11
Feature activation+7.40e-01

Pos logit contributions
aw+0.017
 Other+0.016
 middle+0.015
 particularly+0.015
+0.015
Neg logit contributions
ro-0.019
uzz-0.016
emo-0.016
keeping-0.016
onna-0.015
Token.
Token ID13
Feature activation+2.77e-01

Pos logit contributions
aw+0.000
 Other+0.000
 middle+0.000
 particularly+0.000
+0.000
Neg logit contributions
ro-0.000
uzz-0.000
keeping-0.000
emo-0.000
 cheek-0.000
Token
Token ID220
Feature activation+2.62e-01

Pos logit contributions
aw+0.009
 Other+0.009
 middle+0.008
 particularly+0.008
+0.008
Neg logit contributions
ro-0.015
uzz-0.013
emo-0.012
keeping-0.012
onna-0.012
Token
Token ID198
Feature activation-5.54e-02

Pos logit contributions
aw+0.010
 Other+0.010
 middle+0.009
 particularly+0.009
+0.009
Neg logit contributions
ro-0.012
uzz-0.011
emo-0.010
keeping-0.010
onna-0.010
Token
Token ID198
Feature activation+6.98e-01

Pos logit contributions
ro+0.003
uzz+0.002
keeping+0.002
emo+0.002
 cheek+0.002
Neg logit contributions
aw-0.002
 Other-0.002
 middle-0.002
 particularly-0.002
-0.002
TokenThe
Token ID464
Feature activation+1.24e+00

Pos logit contributions
aw+0.025
 Other+0.023
 middle+0.021
+0.021
 particularly+0.021
Neg logit contributions
ro-0.035
uzz-0.030
emo-0.030
keeping-0.029
 seek-0.028
Token friend
Token ID1545
Feature activation+4.68e-01

Pos logit contributions
aw+0.051
 Other+0.046
+0.045
ifying+0.045
opl+0.043
Neg logit contributions
ro-0.055
emo-0.048
 cheek-0.047
 moment-0.045
 seek-0.044
Token said
Token ID531
Feature activation+1.11e+00

Pos logit contributions
aw+0.020
 Other+0.019
 middle+0.018
 particularly+0.018
+0.018
Neg logit contributions
ro-0.020
uzz-0.017
emo-0.016
keeping-0.016
 cheek-0.015
Token he
Token ID339
Feature activation+1.19e+00

Pos logit contributions
aw+0.047
opl+0.045
 Other+0.045
 middle+0.044
+0.043
Neg logit contributions
ro-0.046
uzz-0.040
emo-0.038
 cheek-0.037
onna-0.037
Token wasn
Token ID2492
Feature activation+1.85e+00

Pos logit contributions
aw+0.058
 Other+0.057
 middle+0.056
opl+0.055
+0.053
Neg logit contributions
ro-0.041
uzz-0.036
emo-0.034
keeping-0.031
onna-0.030
Token't
Token ID470
Feature activation-5.22e-01

Pos logit contributions
aw+0.095
 Other+0.084
 middle+0.077
opl+0.077
 pasture+0.076
Neg logit contributions
ro-0.062
uzz-0.056
onna-0.053
nes-0.052
emo-0.050
Token feeling
Token ID4203
Feature activation-3.32e-01

Pos logit contributions
ro+0.026
uzz+0.022
keeping+0.021
emo+0.021
onna+0.020
Neg logit contributions
aw-0.019
 particularly-0.018
 Other-0.018
 middle-0.017
 forward-0.016
Token,"
Token ID553
Feature activation-4.73e-01

Pos logit contributions
ro+0.045
uzz+0.042
emo+0.042
keeping+0.039
 cheek+0.036
Neg logit contributions
aw-0.036
 Other-0.032
 middle-0.032
 particularly-0.030
opl-0.028
Token Sara
Token ID24799
Feature activation-1.26e+00

Pos logit contributions
ro+0.015
keeping+0.012
uzz+0.012
emo+0.011
 cheek+0.010
Neg logit contributions
aw-0.026
 Other-0.025
 middle-0.025
 particularly-0.023
 course-0.023
Token said
Token ID531
Feature activation-7.42e-01

Pos logit contributions
keeping+0.055
ro+0.054
uzz+0.050
emo+0.048
ge+0.045
Neg logit contributions
aw-0.055
 Other-0.047
 particularly-0.044
 middle-0.043
-0.041
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
ro+0.039
keeping+0.034
emo+0.034
uzz+0.034
 seek+0.032
Neg logit contributions
aw-0.024
 Other-0.023
 middle-0.021
 forward-0.020
 particularly-0.020
Token
Token ID198
Feature activation-4.77e-01

Pos logit contributions
aw+0.021
 Other+0.019
 middle+0.019
 particularly+0.019
opl+0.018
Neg logit contributions
ro-0.022
uzz-0.019
emo-0.018
keeping-0.018
onna-0.017
Token
Token ID198
Feature activation-3.32e-01

Pos logit contributions
ro+0.025
keeping+0.022
uzz+0.022
emo+0.021
 cheek+0.021
Neg logit contributions
aw-0.016
 Other-0.015
 middle-0.014
 particularly-0.013
-0.013
TokenThey
Token ID2990
Feature activation+7.37e-01

Pos logit contributions
ro+0.018
uzz+0.017
keeping+0.016
emo+0.016
 cheek+0.016
Neg logit contributions
aw-0.011
 Other-0.009
 middle-0.009
 particularly-0.008
Eventually-0.008
Token went
Token ID1816
Feature activation+6.73e-02

Pos logit contributions
aw+0.034
 Other+0.033
opl+0.032
 middle+0.030
 particularly+0.030
Neg logit contributions
ro-0.029
uzz-0.025
emo-0.023
keeping-0.023
onna-0.022
Token to
Token ID284
Feature activation-9.04e-01

Pos logit contributions
aw+0.003
 Other+0.003
 middle+0.003
 particularly+0.003
+0.003
Neg logit contributions
ro-0.003
uzz-0.002
emo-0.002
keeping-0.002
onna-0.002
Token the
Token ID262
Feature activation+2.48e-01

Pos logit contributions
ro+0.047
emo+0.041
keeping+0.041
uzz+0.041
 cheek+0.038
Neg logit contributions
 Other-0.029
 middle-0.029
aw-0.026
 particularly-0.026
-0.025
Token kitchen
Token ID9592
Feature activation-8.81e-01

Pos logit contributions
aw+0.010
 Other+0.009
 middle+0.009
 particularly+0.009
+0.008
Neg logit contributions
ro-0.012
uzz-0.010
keeping-0.010
emo-0.010
 cheek-0.009
Token;
Token ID26
Feature activation+3.38e-02

Pos logit contributions
ro+0.004
uzz+0.003
emo+0.003
keeping+0.003
onna+0.003
Neg logit contributions
aw-0.003
 Other-0.003
 middle-0.003
 particularly-0.003
-0.002
Token maybe
Token ID3863
Feature activation-1.15e-01

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.002
uzz-0.001
keeping-0.001
emo-0.001
 cheek-0.001
Token he
Token ID339
Feature activation+1.96e-01

Pos logit contributions
ro+0.005
uzz+0.005
keeping+0.004
emo+0.004
 cheek+0.004
Neg logit contributions
aw-0.005
 Other-0.004
 middle-0.004
 particularly-0.004
-0.004
Token could
Token ID714
Feature activation+4.69e-01

Pos logit contributions
aw+0.008
 Other+0.008
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.007
keeping-0.007
onna-0.007
Token help
Token ID1037
Feature activation+9.17e-03

Pos logit contributions
aw+0.023
 Other+0.021
 middle+0.020
 particularly+0.020
opl+0.020
Neg logit contributions
ro-0.018
uzz-0.015
emo-0.014
keeping-0.014
onna-0.013
Token clean
Token ID3424
Feature activation-2.13e-01

Pos logit contributions
aw+0.000
 Other+0.000
 middle+0.000
 particularly+0.000
+0.000
Neg logit contributions
ro-0.000
uzz-0.000
emo-0.000
keeping-0.000
onna-0.000
Token up
Token ID510
Feature activation-8.81e-02

Pos logit contributions
ro+0.009
uzz+0.008
emo+0.007
keeping+0.007
onna+0.007
Neg logit contributions
aw-0.009
 Other-0.009
 middle-0.009
 particularly-0.008
-0.008
Token the
Token ID262
Feature activation-9.35e-01

Pos logit contributions
ro+0.004
uzz+0.003
emo+0.003
keeping+0.003
 cheek+0.003
Neg logit contributions
aw-0.004
 Other-0.003
 middle-0.003
 particularly-0.003
-0.003
Token waste
Token ID7030
Feature activation-1.68e-01

Pos logit contributions
ro+0.038
uzz+0.036
emo+0.033
keeping+0.032
onna+0.031
Neg logit contributions
 middle-0.040
aw-0.039
 Other-0.038
 particularly-0.036
-0.036
Token.
Token ID13
Feature activation+1.95e-01

Pos logit contributions
ro+0.008
uzz+0.007
emo+0.006
keeping+0.006
 cheek+0.006
Neg logit contributions
aw-0.007
 Other-0.006
 middle-0.006
 particularly-0.006
-0.006
Token
Token ID198
Feature activation+2.30e-01

Pos logit contributions
aw+0.006
 Other+0.006
 middle+0.006
 particularly+0.005
+0.005
Neg logit contributions
ro-0.010
uzz-0.009
emo-0.009
keeping-0.009
onna-0.009
Token thanked
Token ID26280
Feature activation+1.36e-03

Pos logit contributions
ro+0.005
uzz+0.004
keeping+0.004
emo+0.004
onna+0.004
Neg logit contributions
aw-0.005
 Other-0.005
 middle-0.005
 particularly-0.005
-0.005
Token them
Token ID606
Feature activation-1.13e-01

Pos logit contributions
aw+0.000
 Other+0.000
 middle+0.000
 particularly+0.000
+0.000
Neg logit contributions
ro-0.000
uzz-0.000
keeping-0.000
emo-0.000
onna-0.000
Token.
Token ID13
Feature activation+2.56e-01

Pos logit contributions
ro+0.005
uzz+0.005
emo+0.004
keeping+0.004
onna+0.004
Neg logit contributions
aw-0.005
 Other-0.004
 middle-0.004
 particularly-0.004
-0.004
Token They
Token ID1119
Feature activation+4.33e-01

Pos logit contributions
aw+0.010
 Other+0.010
 middle+0.009
 particularly+0.009
+0.009
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.010
keeping-0.010
onna-0.009
Token started
Token ID2067
Feature activation-8.07e-01

Pos logit contributions
aw+0.020
 Other+0.019
 middle+0.019
 particularly+0.018
+0.018
Neg logit contributions
ro-0.017
uzz-0.014
emo-0.014
keeping-0.013
onna-0.013
Token to
Token ID284
Feature activation-6.94e-01

Pos logit contributions
ro+0.036
emo+0.032
uzz+0.031
keeping+0.031
 cheek+0.029
Neg logit contributions
aw-0.031
 Other-0.031
 particularly-0.028
 middle-0.028
-0.027
Token build
Token ID1382
Feature activation+4.48e-01

Pos logit contributions
ro+0.030
uzz+0.027
emo+0.026
keeping+0.026
and+0.024
Neg logit contributions
 Other-0.027
aw-0.027
 middle-0.026
 particularly-0.025
 course-0.025
Token towers
Token ID18028
Feature activation+5.76e-01

Pos logit contributions
aw+0.017
 Other+0.016
 particularly+0.015
 middle+0.015
+0.015
Neg logit contributions
ro-0.021
uzz-0.018
emo-0.018
keeping-0.018
onna-0.017
Token and
Token ID290
Feature activation+1.82e-01

Pos logit contributions
aw+0.026
 Other+0.025
 particularly+0.024
 middle+0.024
+0.023
Neg logit contributions
ro-0.023
uzz-0.020
emo-0.019
keeping-0.018
onna-0.018
Token houses
Token ID7777
Feature activation+1.82e-02

Pos logit contributions
aw+0.007
 Other+0.006
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.007
keeping-0.007
onna-0.007
Token and
Token ID290
Feature activation+3.74e-01

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.001
uzz-0.001
emo-0.001
keeping-0.001
 cheek-0.001
Token Today
Token ID6288
Feature activation-5.45e-01

Pos logit contributions
ro+0.032
uzz+0.028
emo+0.028
keeping+0.027
 cheek+0.027
Neg logit contributions
aw-0.019
 Other-0.019
 middle-0.017
 particularly-0.017
-0.016
Token was
Token ID373
Feature activation+2.61e-02

Pos logit contributions
ro+0.028
uzz+0.025
emo+0.024
keeping+0.024
 cheek+0.023
Neg logit contributions
aw-0.019
 Other-0.018
 middle-0.017
 particularly-0.017
-0.016
Token different
Token ID1180
Feature activation-6.44e-01

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.001
uzz-0.001
emo-0.001
keeping-0.001
 cheek-0.001
Token,
Token ID11
Feature activation-7.44e-01

Pos logit contributions
ro+0.027
emo+0.024
uzz+0.024
keeping+0.022
 cheek+0.022
Neg logit contributions
aw-0.028
 Other-0.026
 middle-0.025
 particularly-0.025
opl-0.023
Token as
Token ID355
Feature activation-4.83e-01

Pos logit contributions
ro+0.033
uzz+0.029
emo+0.029
keeping+0.028
onna+0.026
Neg logit contributions
aw-0.031
 Other-0.028
 particularly-0.027
 middle-0.027
-0.025
Token today
Token ID1909
Feature activation-3.88e-01

Pos logit contributions
ro+0.022
uzz+0.019
keeping+0.018
emo+0.018
 cheek+0.017
Neg logit contributions
aw-0.020
 Other-0.018
 middle-0.018
 particularly-0.018
-0.016
Token James
Token ID3700
Feature activation-5.13e-01

Pos logit contributions
ro+0.020
uzz+0.018
emo+0.018
keeping+0.018
 cheek+0.017
Neg logit contributions
aw-0.013
 Other-0.012
 middle-0.011
 particularly-0.011
-0.011
Token came
Token ID1625
Feature activation+2.93e-02

Pos logit contributions
ro+0.024
uzz+0.021
emo+0.021
keeping+0.020
 cheek+0.020
Neg logit contributions
aw-0.020
 Other-0.018
 particularly-0.018
 middle-0.018
-0.016
Token with
Token ID351
Feature activation+1.47e-04

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.001
uzz-0.001
emo-0.001
keeping-0.001
onna-0.001
Token his
Token ID465
Feature activation+2.63e-01

Pos logit contributions
aw+0.000
 Other+0.000
 middle+0.000
 particularly+0.000
+0.000
Neg logit contributions
ro-0.000
uzz-0.000
emo-0.000
keeping-0.000
onna-0.000
Token mum
Token ID25682
Feature activation+3.53e-01

Pos logit contributions
aw+0.012
 Other+0.011
 middle+0.011
 particularly+0.011
+0.010
Neg logit contributions
ro-0.011
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token loved
Token ID6151
Feature activation-9.02e-01

Pos logit contributions
ro+0.029
keeping+0.024
uzz+0.024
emo+0.023
onna+0.023
Neg logit contributions
aw-0.023
 particularly-0.020
 Other-0.020
 middle-0.019
 course-0.018
Token the
Token ID262
Feature activation-1.49e+00

Pos logit contributions
ro+0.045
uzz+0.037
 cheek+0.037
keeping+0.036
emo+0.035
Neg logit contributions
aw-0.035
 Other-0.032
 particularly-0.031
 course-0.030
-0.027
Token crystal
Token ID15121
Feature activation-1.58e-01

Pos logit contributions
ro+0.078
uzz+0.068
keeping+0.064
onna+0.060
nes+0.060
Neg logit contributions
 course-0.053
 particularly-0.052
ement-0.051
 middle-0.051
aw-0.051
Token and
Token ID290
Feature activation-2.67e-01

Pos logit contributions
ro+0.007
uzz+0.006
emo+0.006
keeping+0.006
onna+0.006
Neg logit contributions
aw-0.007
 Other-0.006
 middle-0.006
 particularly-0.006
-0.006
Token they
Token ID484
Feature activation+3.96e-01

Pos logit contributions
ro+0.012
uzz+0.010
keeping+0.010
emo+0.010
 cheek+0.010
Neg logit contributions
aw-0.011
 Other-0.010
 particularly-0.010
 middle-0.010
-0.009
Token all
Token ID477
Feature activation-9.52e-01

Pos logit contributions
aw+0.014
 Other+0.013
 middle+0.012
 particularly+0.012
+0.011
Neg logit contributions
ro-0.020
uzz-0.018
emo-0.017
keeping-0.017
 cheek-0.016
Token played
Token ID2826
Feature activation-2.08e-01

Pos logit contributions
ro+0.051
keeping+0.044
uzz+0.043
emo+0.042
ge+0.040
Neg logit contributions
aw-0.034
 particularly-0.029
 Other-0.029
 middle-0.028
 course-0.025
Token together
Token ID1978
Feature activation-1.75e-01

Pos logit contributions
ro+0.009
uzz+0.008
emo+0.008
keeping+0.008
onna+0.007
Neg logit contributions
aw-0.009
 Other-0.008
 middle-0.008
 particularly-0.008
-0.007
Token in
Token ID287
Feature activation-1.32e-01

Pos logit contributions
ro+0.008
uzz+0.007
emo+0.007
keeping+0.007
 cheek+0.007
Neg logit contributions
aw-0.007
 Other-0.006
 middle-0.006
 particularly-0.006
-0.005
Token the
Token ID262
Feature activation+2.46e-01

Pos logit contributions
ro+0.006
uzz+0.005
emo+0.005
keeping+0.005
 cheek+0.005
Neg logit contributions
aw-0.005
 Other-0.005
 middle-0.005
 particularly-0.005
-0.005
Token wide
Token ID3094
Feature activation-7.10e-01

Pos logit contributions
aw+0.010
 Other+0.009
 middle+0.009
 particularly+0.009
+0.009
Neg logit contributions
ro-0.011
uzz-0.010
emo-0.009
keeping-0.009
 cheek-0.009
Token!
Token ID0
Feature activation-5.37e-01

Pos logit contributions
ro+0.021
uzz+0.019
emo+0.019
keeping+0.018
onna+0.017
Neg logit contributions
aw-0.018
 Other-0.016
 middle-0.016
 particularly-0.016
-0.015
Token She
Token ID1375
Feature activation-8.07e-02

Pos logit contributions
ro+0.027
uzz+0.024
emo+0.024
keeping+0.024
 cheek+0.023
Neg logit contributions
aw-0.018
 Other-0.018
 middle-0.016
 particularly-0.016
-0.015
Token went
Token ID1816
Feature activation-2.32e-01

Pos logit contributions
ro+0.003
uzz+0.003
keeping+0.003
emo+0.003
 cheek+0.003
Neg logit contributions
aw-0.003
 Other-0.003
 middle-0.003
 particularly-0.003
-0.003
Token to
Token ID284
Feature activation-1.47e+00

Pos logit contributions
ro+0.010
emo+0.009
uzz+0.009
keeping+0.009
onna+0.008
Neg logit contributions
aw-0.009
 Other-0.009
 middle-0.009
 particularly-0.009
-0.008
Token get
Token ID651
Feature activation-7.13e-01

Pos logit contributions
ro+0.082
emo+0.078
uzz+0.073
keeping+0.072
 cheek+0.070
Neg logit contributions
 middle-0.044
 Other-0.042
 particularly-0.039
 course-0.034
aw-0.033
Token it
Token ID340
Feature activation-1.47e+00

Pos logit contributions
ro+0.038
emo+0.032
uzz+0.031
keeping+0.031
onna+0.031
Neg logit contributions
 middle-0.024
aw-0.024
 particularly-0.023
 Other-0.023
 forward-0.021
Token,
Token ID11
Feature activation-7.41e-01

Pos logit contributions
ro+0.078
emo+0.074
uzz+0.071
keeping+0.068
icated+0.065
Neg logit contributions
 middle-0.042
aw-0.041
 particularly-0.040
 Other-0.037
 forward-0.037
Token and
Token ID290
Feature activation-4.38e-01

Pos logit contributions
ro+0.032
emo+0.029
uzz+0.028
keeping+0.027
onna+0.025
Neg logit contributions
aw-0.031
 particularly-0.029
 middle-0.029
 Other-0.028
 forward-0.026
Token when
Token ID618
Feature activation+5.24e-01

Pos logit contributions
ro+0.020
uzz+0.017
emo+0.017
keeping+0.017
 cheek+0.016
Neg logit contributions
aw-0.018
 middle-0.016
 Other-0.016
 particularly-0.016
-0.015
Token she
Token ID673
Feature activation-6.98e-02

Pos logit contributions
aw+0.023
 Other+0.021
 particularly+0.020
 middle+0.020
opl+0.019
Neg logit contributions
ro-0.023
uzz-0.019
keeping-0.019
emo-0.018
onna-0.018
Token came
Token ID1625
Feature activation-2.18e-01

Pos logit contributions
ro+0.003
uzz+0.003
keeping+0.003
emo+0.003
 cheek+0.003
Neg logit contributions
aw-0.003
 Other-0.003
 middle-0.002
 particularly-0.002
-0.002
Token,
Token ID11
Feature activation+5.09e-01

Pos logit contributions
aw+0.034
 Other+0.031
opl+0.030
 particularly+0.029
ifying+0.028
Neg logit contributions
ro-0.034
uzz-0.029
keeping-0.027
emo-0.027
 seek-0.026
Token her
Token ID607
Feature activation-3.82e-02

Pos logit contributions
aw+0.023
 Other+0.021
opl+0.020
 particularly+0.020
 middle+0.020
Neg logit contributions
ro-0.021
uzz-0.018
keeping-0.017
emo-0.017
 cheek-0.016
Token mom
Token ID1995
Feature activation-2.40e-02

Pos logit contributions
ro+0.002
uzz+0.001
keeping+0.001
emo+0.001
onna+0.001
Neg logit contributions
aw-0.002
 Other-0.002
 middle-0.001
 particularly-0.001
-0.001
Token told
Token ID1297
Feature activation-4.63e-01

Pos logit contributions
ro+0.001
uzz+0.001
keeping+0.001
emo+0.001
onna+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 particularly-0.001
 middle-0.001
-0.001
Token her
Token ID607
Feature activation-5.03e-01

Pos logit contributions
ro+0.023
emo+0.021
uzz+0.021
keeping+0.020
onna+0.020
Neg logit contributions
aw-0.017
 Other-0.016
 middle-0.014
 particularly-0.014
 course-0.014
Token that
Token ID326
Feature activation-1.41e+00

Pos logit contributions
ro+0.025
uzz+0.021
emo+0.021
keeping+0.021
 cheek+0.020
Neg logit contributions
aw-0.018
 Other-0.017
 middle-0.016
 particularly-0.016
-0.015
Token if
Token ID611
Feature activation-1.24e+00

Pos logit contributions
ro+0.075
uzz+0.066
emo+0.065
 cheek+0.064
keeping+0.064
Neg logit contributions
aw-0.045
 particularly-0.043
 Other-0.041
 middle-0.039
 course-0.038
Token she
Token ID673
Feature activation-1.10e+00

Pos logit contributions
ro+0.068
uzz+0.063
keeping+0.060
emo+0.059
 cheek+0.059
Neg logit contributions
aw-0.037
 particularly-0.033
 Other-0.033
 middle-0.032
-0.030
Token didn
Token ID1422
Feature activation+8.61e-01

Pos logit contributions
ro+0.053
keeping+0.051
uzz+0.049
emo+0.046
 cheek+0.046
Neg logit contributions
aw-0.037
 particularly-0.036
-0.034
 Other-0.034
 middle-0.032
Token't
Token ID470
Feature activation-1.24e+00

Pos logit contributions
aw+0.035
 Other+0.030
 middle+0.029
 particularly+0.028
opl+0.027
Neg logit contributions
ro-0.041
uzz-0.035
emo-0.034
onna-0.033
keeping-0.033
Token clean
Token ID3424
Feature activation-2.74e-01

Pos logit contributions
ro+0.058
keeping+0.053
uzz+0.052
emo+0.051
onna+0.047
Neg logit contributions
 Other-0.043
 particularly-0.042
 middle-0.042
aw-0.040
 forward-0.039
Token's
Token ID338
Feature activation-1.94e-02

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.003
uzz-0.003
emo-0.003
keeping-0.003
onna-0.003
Token what
Token ID644
Feature activation-7.67e-01

Pos logit contributions
ro+0.001
uzz+0.001
keeping+0.001
emo+0.001
onna+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.001
 particularly-0.001
-0.001
Token makes
Token ID1838
Feature activation-6.02e-01

Pos logit contributions
ro+0.036
uzz+0.031
keeping+0.030
emo+0.028
 cheek+0.027
Neg logit contributions
aw-0.031
 Other-0.029
 particularly-0.028
 middle-0.027
 course-0.025
Token you
Token ID345
Feature activation-1.04e+00

Pos logit contributions
ro+0.027
uzz+0.023
keeping+0.022
emo+0.022
onna+0.021
Neg logit contributions
aw-0.026
 Other-0.025
 middle-0.023
 particularly-0.023
 course-0.023
Token happy
Token ID3772
Feature activation-3.09e-01

Pos logit contributions
ro+0.049
uzz+0.043
keeping+0.040
emo+0.038
onna+0.038
Neg logit contributions
aw-0.041
 middle-0.039
 Other-0.039
 particularly-0.039
opl-0.034
Token and
Token ID290
Feature activation-1.59e+00

Pos logit contributions
ro+0.013
uzz+0.012
keeping+0.011
emo+0.011
onna+0.011
Neg logit contributions
aw-0.013
 Other-0.013
 middle-0.012
 particularly-0.012
opl-0.011
Token makes
Token ID1838
Feature activation-1.03e+00

Pos logit contributions
ro+0.080
uzz+0.071
emo+0.066
keeping+0.066
ge+0.065
Neg logit contributions
aw-0.051
 middle-0.047
 Other-0.047
 particularly-0.046
 best-0.046
Token others
Token ID1854
Feature activation-1.04e+00

Pos logit contributions
ro+0.060
uzz+0.051
keeping+0.049
onna+0.049
emo+0.047
Neg logit contributions
aw-0.035
 Other-0.032
 middle-0.028
 particularly-0.027
 course-0.026
Token happy
Token ID3772
Feature activation-9.39e-01

Pos logit contributions
ro+0.045
uzz+0.039
keeping+0.037
emo+0.036
onna+0.035
Neg logit contributions
aw-0.046
 middle-0.042
 Other-0.042
 particularly-0.040
 forward-0.038
Token too
Token ID1165
Feature activation-8.21e-01

Pos logit contributions
ro+0.045
uzz+0.041
keeping+0.038
emo+0.038
onna+0.037
Neg logit contributions
aw-0.036
 Other-0.034
 middle-0.032
 particularly-0.029
opl-0.028
Token."
Token ID526
Feature activation-1.57e-01

Pos logit contributions
ro+0.042
uzz+0.037
keeping+0.036
emo+0.035
onna+0.033
Neg logit contributions
aw-0.030
 Other-0.029
 middle-0.026
 particularly-0.025
opl-0.023
Token water
Token ID1660
Feature activation-1.66e-01

Pos logit contributions
 Other+0.062
aw+0.061
+0.060
ement+0.060
opl+0.058
Neg logit contributions
ro-0.039
emo-0.033
keeping-0.030
 cheek-0.028
 seek-0.027
Token.
Token ID13
Feature activation-4.63e-01

Pos logit contributions
ro+0.008
uzz+0.007
emo+0.007
keeping+0.007
onna+0.006
Neg logit contributions
aw-0.007
 Other-0.006
 middle-0.006
 particularly-0.006
-0.005
Token The
Token ID383
Feature activation-1.13e-01

Pos logit contributions
ro+0.026
uzz+0.024
keeping+0.023
emo+0.023
 cheek+0.022
Neg logit contributions
aw-0.014
 Other-0.013
 middle-0.012
-0.011
 particularly-0.011
Token net
Token ID2010
Feature activation-3.97e-01

Pos logit contributions
ro+0.006
uzz+0.005
keeping+0.005
emo+0.005
onna+0.005
Neg logit contributions
aw-0.004
 Other-0.004
 middle-0.004
 particularly-0.004
-0.003
Token was
Token ID373
Feature activation-9.04e-01

Pos logit contributions
ro+0.019
uzz+0.017
keeping+0.016
emo+0.016
onna+0.015
Neg logit contributions
aw-0.015
 middle-0.014
 Other-0.014
 particularly-0.014
-0.013
Token so
Token ID523
Feature activation-2.01e+00

Pos logit contributions
ro+0.050
keeping+0.044
uzz+0.043
emo+0.042
and+0.040
Neg logit contributions
aw-0.027
 particularly-0.026
 middle-0.026
 Other-0.024
ifying-0.022
Token big
Token ID1263
Feature activation+3.91e-02

Pos logit contributions
ro+0.132
keeping+0.130
uzz+0.119
ge+0.118
nes+0.111
Neg logit contributions
aw-0.043
 middle-0.037
 Other-0.032
 particularly-0.030
 best-0.027
Token that
Token ID326
Feature activation-4.73e-02

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.002
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.002
Token it
Token ID340
Feature activation-1.39e+00

Pos logit contributions
ro+0.002
uzz+0.002
keeping+0.002
emo+0.002
 cheek+0.001
Neg logit contributions
aw-0.002
 Other-0.002
 middle-0.002
 particularly-0.002
-0.002
Token could
Token ID714
Feature activation-7.74e-01

Pos logit contributions
ro+0.074
keeping+0.070
uzz+0.069
and+0.062
 cheek+0.062
Neg logit contributions
aw-0.044
 particularly-0.041
 middle-0.036
 Other-0.035
-0.033
Token catch
Token ID4929
Feature activation-6.00e-01

Pos logit contributions
ro+0.032
keeping+0.029
uzz+0.028
emo+0.026
and+0.025
Neg logit contributions
 middle-0.032
aw-0.032
 particularly-0.031
 Other-0.030
-0.029
Token was
Token ID373
Feature activation+4.72e-01

Pos logit contributions
aw+0.032
 Other+0.031
 middle+0.029
opl+0.028
+0.028
Neg logit contributions
ro-0.028
uzz-0.023
emo-0.022
onna-0.021
keeping-0.021
Token so
Token ID523
Feature activation-3.37e-01

Pos logit contributions
aw+0.018
 Other+0.017
 middle+0.016
 particularly+0.016
+0.015
Neg logit contributions
ro-0.022
uzz-0.019
emo-0.019
keeping-0.019
onna-0.018
Token delicious
Token ID12625
Feature activation-7.48e-02

Pos logit contributions
ro+0.018
keeping+0.016
uzz+0.016
emo+0.015
 cheek+0.014
Neg logit contributions
aw-0.012
 Other-0.011
 middle-0.010
 particularly-0.010
-0.010
Token!
Token ID0
Feature activation-1.36e-01

Pos logit contributions
ro+0.004
uzz+0.003
emo+0.003
keeping+0.003
 cheek+0.003
Neg logit contributions
aw-0.003
 Other-0.002
 middle-0.002
 particularly-0.002
-0.002
Token The
Token ID383
Feature activation-6.44e-01

Pos logit contributions
ro+0.007
uzz+0.007
emo+0.006
keeping+0.006
 cheek+0.006
Neg logit contributions
aw-0.004
 Other-0.004
 middle-0.004
 particularly-0.004
-0.004
Token little
Token ID1310
Feature activation-1.51e+00

Pos logit contributions
ro+0.028
uzz+0.025
keeping+0.025
onna+0.024
emo+0.024
Neg logit contributions
aw-0.027
 middle-0.027
 particularly-0.025
 Other-0.025
-0.024
Token girl
Token ID2576
Feature activation+1.42e-01

Pos logit contributions
ro+0.069
uzz+0.067
onna+0.066
emo+0.065
keeping+0.064
Neg logit contributions
aw-0.056
 middle-0.055
 Other-0.050
 particularly-0.049
 course-0.049
Token was
Token ID373
Feature activation+6.38e-01

Pos logit contributions
aw+0.006
 Other+0.006
 middle+0.006
 particularly+0.005
+0.005
Neg logit contributions
ro-0.006
uzz-0.005
emo-0.005
keeping-0.005
onna-0.005
Token so
Token ID523
Feature activation-6.06e-01

Pos logit contributions
aw+0.024
 Other+0.023
 middle+0.021
+0.021
 particularly+0.020
Neg logit contributions
ro-0.030
uzz-0.027
emo-0.026
keeping-0.026
onna-0.025
Token happy
Token ID3772
Feature activation-3.10e-01

Pos logit contributions
ro+0.026
keeping+0.022
uzz+0.021
emo+0.021
ge+0.019
Neg logit contributions
aw-0.028
 middle-0.025
 Other-0.025
 particularly-0.025
-0.023
Token that
Token ID326
Feature activation-9.64e-02

Pos logit contributions
ro+0.016
uzz+0.015
emo+0.014
keeping+0.014
 cheek+0.013
Neg logit contributions
aw-0.011
 Other-0.009
 particularly-0.009
 middle-0.009
-0.008