TOP ACTIVATIONS
MAX = 0.076

little boy carrying a box filled with empty boxes , wood
she found a huge box filled with costumes . She tried
I found a shell !" she said . Her mom smiled
my ! A flower !" she said .
this is so pretty !" she said . But then she
found a bath robe !" she said happily .
it too ?" Tim shared the marble with Sue . They
and content . B obby and the star were
Yes , you can go , but you need to be
shoe is so pretty !" she said to herself .
and gave it a good tidy . The nap
From that day forward the little girl was never without her
get the big pastry ?" she asked . Her mom said
said it was a very special shell . The little girl
! A fire work !" she said . Tim looked up
Kate and the other was named Ben . They were playing
are special ." B enny promised to show Ell
Joy and the other was named Jan . Jan was very
" What 's this ?" she said . The girl picked
their piles together and share their finds . They had more

MIN ACTIVATIONS
MIN = -0.091

pretty kitten . From that day on , Lily made sure
hard work . From that day on , Jerry became known
helped him . From that day on , Billy and Harry
. From that day on , Nut ty made
motorcycles . He had a toy motorcycle that he would ride
. From that day on , Jolly and his
helicopters . He had a toy helicopter that he played with
nut together . From that day on , Nut ty knew
good friends . From that day on , Max always replied
very happy . From that day on , Sue knew she
helicopters . He had a toy helicopter that he played with
gray cloth . From that day onwards , Sandra always enjoyed
of fun . From that day on , Max and Luna
right thing . From that day on , Nut ty always
. From that day on , Mitt ens and
river bank . From that day on , Gig i knew
fun day . From that day on , they became good
each other . From that day on , Lily and Rose
with Fred . From that day on , Tim felt strong
of fun . From that day on , Lily always drew

INTERVAL 0.035 - 0.076
CONTAINS 0.964%

He wants to make it go away . He looks for
happy and whist ling a fun song . Tom wanted to
. They pretended they were on Mars , and they wore
excited that he ran to it and kicked it with all
The ticket was for a fun ride at the carn ival

INTERVAL -0.007 - 0.035
CONTAINS 68.202%

in the to aster and pushed the lever down . She
" Yes , you can take some water ."
cried . " This cake is bad ! It hurts !"
" Stop it , kids !" the man said
B un followed the squirrel to a big tree . The

INTERVAL -0.049 - -0.007
CONTAINS 30.612%

and they walked back inside . <|endoftext|> Once upon a time
had ruined his chance to see the surprise . He felt
Sara and Ben became friends . She was happy that they
tasted y ummy . Finally , Lucy added some crunch y
L ily loved her hairy kitten , Spot . She liked

INTERVAL -0.091 - -0.049
CONTAINS 0.223%

big hug . From that day on , Sarah hugged her
machine work . From that day on , Tim my loved
her hive blew away in the wind . B
, it caused the ash tr ay to scatter all over
ber . He worked in a small shop with a big
Token little
Token ID1310
Feature activation+1.04e-02

Pos logit contributions
ro+0.009
uzz+0.008
emo+0.008
keeping+0.008
onna+0.007
Neg logit contributions
aw-0.008
 Other-0.007
 middle-0.007
 particularly-0.007
-0.007
Token boy
Token ID2933
Feature activation-1.77e-02

Pos logit contributions
ro+0.017
uzz+0.016
emo+0.016
keeping+0.015
onna+0.014
Neg logit contributions
aw-0.019
 Other-0.017
 middle-0.017
 particularly-0.017
-0.016
Token carrying
Token ID6872
Feature activation+1.28e-03

Pos logit contributions
aw+0.025
 Other+0.024
+0.023
 particularly+0.022
opl+0.022
Neg logit contributions
ro-0.036
uzz-0.031
keeping-0.030
emo-0.030
 cheek-0.028
Token a
Token ID257
Feature activation+2.78e-03

Pos logit contributions
ro+0.003
uzz+0.002
emo+0.002
keeping+0.002
onna+0.002
Neg logit contributions
aw-0.002
 Other-0.002
 middle-0.002
 particularly-0.002
-0.002
Token box
Token ID3091
Feature activation+6.78e-03

Pos logit contributions
ro+0.005
uzz+0.004
emo+0.004
keeping+0.004
onna+0.004
Neg logit contributions
aw-0.005
 Other-0.004
 particularly-0.004
 middle-0.004
-0.004
Token filled
Token ID5901
Feature activation+7.63e-02

Pos logit contributions
ro+0.013
emo+0.012
uzz+0.012
keeping+0.011
 cheek+0.011
Neg logit contributions
aw-0.010
 Other-0.009
 middle-0.009
 particularly-0.009
-0.009
Token with
Token ID351
Feature activation+2.25e-02

Pos logit contributions
ro+0.179
 outage+0.167
onna+0.166
 cheek+0.159
ge+0.156
Neg logit contributions
 middle-0.076
aw-0.074
 course-0.055
 Other-0.050
 all-0.049
Token empty
Token ID6565
Feature activation+1.74e-02

Pos logit contributions
ro+0.041
emo+0.035
uzz+0.035
onna+0.033
keeping+0.032
Neg logit contributions
aw-0.039
 middle-0.037
 Other-0.036
 particularly-0.034
ifying-0.032
Token boxes
Token ID10559
Feature activation+1.60e-02

Pos logit contributions
ro+0.032
emo+0.029
uzz+0.029
keeping+0.026
onna+0.026
Neg logit contributions
aw-0.029
 Other-0.026
 middle-0.026
 particularly-0.024
ifying-0.023
Token,
Token ID11
Feature activation+1.66e-02

Pos logit contributions
ro+0.032
emo+0.029
uzz+0.028
keeping+0.026
onna+0.025
Neg logit contributions
aw-0.024
 middle-0.021
 Other-0.021
 particularly-0.021
-0.020
Token wood
Token ID4898
Feature activation+2.50e-02

Pos logit contributions
ro+0.029
uzz+0.026
emo+0.026
keeping+0.024
onna+0.023
Neg logit contributions
aw-0.028
 middle-0.026
 particularly-0.025
 Other-0.025
ifying-0.023
Token she
Token ID673
Feature activation+1.89e-02

Pos logit contributions
aw+0.007
 Other+0.006
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.006
uzz-0.005
emo-0.005
keeping-0.005
 cheek-0.005
Token found
Token ID1043
Feature activation-2.50e-02

Pos logit contributions
ro+0.040
uzz+0.036
 cheek+0.035
keeping+0.035
emo+0.034
Neg logit contributions
aw-0.027
 particularly-0.023
 middle-0.023
 Other-0.023
-0.019
Token a
Token ID257
Feature activation+1.61e-02

Pos logit contributions
aw+0.046
+0.045
 Other+0.045
opl+0.044
 particularly+0.042
Neg logit contributions
ro-0.037
uzz-0.033
emo-0.031
keeping-0.030
 itself-0.028
Token huge
Token ID3236
Feature activation+1.62e-02

Pos logit contributions
ro+0.030
uzz+0.026
keeping+0.026
emo+0.025
onna+0.022
Neg logit contributions
aw-0.026
 particularly-0.025
 middle-0.024
 Other-0.023
-0.023
Token box
Token ID3091
Feature activation-1.01e-02

Pos logit contributions
ro+0.028
uzz+0.025
emo+0.024
keeping+0.024
and+0.021
Neg logit contributions
aw-0.028
 middle-0.026
 particularly-0.026
 Other-0.025
-0.025
Token filled
Token ID5901
Feature activation+7.27e-02

Pos logit contributions
aw+0.016
 Other+0.015
 particularly+0.014
 middle+0.014
+0.014
Neg logit contributions
ro-0.019
uzz-0.016
emo-0.016
keeping-0.016
 cheek-0.015
Token with
Token ID351
Feature activation+3.34e-02

Pos logit contributions
ro+0.162
emo+0.150
onna+0.147
 cheek+0.145
uzz+0.143
Neg logit contributions
aw-0.082
 middle-0.080
 course-0.066
date-0.062
 particularly-0.062
Token costumes
Token ID24138
Feature activation+3.37e-03

Pos logit contributions
ro+0.061
emo+0.054
uzz+0.053
onna+0.051
 cheek+0.050
Neg logit contributions
aw-0.058
 middle-0.053
 particularly-0.052
 Other-0.050
ifying-0.046
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
ro+0.006
uzz+0.006
emo+0.006
keeping+0.005
 cheek+0.005
Neg logit contributions
aw-0.005
 Other-0.005
 middle-0.005
 particularly-0.005
-0.005
Token She
Token ID1375
Feature activation+3.59e-02

Pos logit contributions
ro+0.030
uzz+0.028
emo+0.027
keeping+0.027
ge+0.026
Neg logit contributions
aw-0.013
 Other-0.013
 middle-0.012
-0.011
 particularly-0.011
Token tried
Token ID3088
Feature activation+1.00e-02

Pos logit contributions
ro+0.072
keeping+0.069
 cheek+0.063
uzz+0.061
emo+0.059
Neg logit contributions
aw-0.051
 particularly-0.050
 middle-0.045
 Other-0.044
Eventually-0.039
Token I
Token ID314
Feature activation+7.25e-03

Pos logit contributions
ro+0.040
keeping+0.035
uzz+0.034
emo+0.033
ge+0.031
Neg logit contributions
aw-0.034
 Other-0.030
 middle-0.029
 course-0.028
 particularly-0.028
Token found
Token ID1043
Feature activation+8.26e-03

Pos logit contributions
ro+0.014
uzz+0.013
keeping+0.013
emo+0.012
 cheek+0.012
Neg logit contributions
aw-0.011
 Other-0.010
 particularly-0.010
 middle-0.010
-0.010
Token a
Token ID257
Feature activation+4.27e-02

Pos logit contributions
ro+0.017
uzz+0.014
emo+0.014
keeping+0.014
onna+0.013
Neg logit contributions
aw-0.014
 Other-0.012
 middle-0.011
 particularly-0.011
 course-0.011
Token shell
Token ID7582
Feature activation-3.38e-04

Pos logit contributions
ro+0.075
keeping+0.074
uzz+0.070
ge+0.063
and+0.061
Neg logit contributions
aw-0.069
 particularly-0.068
 middle-0.068
 best-0.059
-0.058
Token!"
Token ID2474
Feature activation+2.35e-02

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.001
uzz-0.000
emo-0.000
keeping-0.000
 cheek-0.000
Token she
Token ID673
Feature activation+7.16e-02

Pos logit contributions
ro+0.049
uzz+0.046
emo+0.044
keeping+0.044
ge+0.043
Neg logit contributions
aw-0.031
 middle-0.030
 Other-0.030
 particularly-0.027
-0.026
Token said
Token ID531
Feature activation+2.56e-02

Pos logit contributions
ro+0.170
keeping+0.163
uzz+0.161
ge+0.152
onna+0.144
Neg logit contributions
aw-0.078
 middle-0.062
 particularly-0.055
 named-0.054
 sudden-0.052
Token.
Token ID13
Feature activation+2.38e-02

Pos logit contributions
ro+0.059
emo+0.053
keeping+0.053
uzz+0.052
icated+0.051
Neg logit contributions
aw-0.028
 Other-0.025
 middle-0.025
-0.024
 forward-0.023
Token Her
Token ID2332
Feature activation+2.35e-02

Pos logit contributions
ro+0.056
uzz+0.053
keeping+0.053
ge+0.053
emo+0.051
Neg logit contributions
 Other-0.024
-0.022
aw-0.021
 middle-0.021
 particularly-0.018
Token mom
Token ID1995
Feature activation+1.26e-02

Pos logit contributions
ro+0.051
keeping+0.046
uzz+0.045
ge+0.043
icated+0.043
Neg logit contributions
 middle-0.029
aw-0.029
 Other-0.027
-0.026
 best-0.025
Token smiled
Token ID13541
Feature activation+1.34e-03

Pos logit contributions
ro+0.023
keeping+0.020
uzz+0.019
emo+0.019
onna+0.018
Neg logit contributions
aw-0.022
 Other-0.020
 particularly-0.019
 middle-0.019
 course-0.018
Tokenmy
Token ID1820
Feature activation+1.59e-02

Pos logit contributions
ro+0.021
uzz+0.019
emo+0.018
keeping+0.018
onna+0.017
Neg logit contributions
aw-0.016
 Other-0.014
 middle-0.013
 particularly-0.013
opl-0.013
Token!
Token ID0
Feature activation+1.05e-02

Pos logit contributions
ro+0.036
uzz+0.033
emo+0.032
keeping+0.031
onna+0.029
Neg logit contributions
aw-0.021
 Other-0.017
 middle-0.016
-0.016
 particularly-0.015
Token A
Token ID317
Feature activation+8.20e-03

Pos logit contributions
ro+0.022
uzz+0.019
keeping+0.019
emo+0.019
ge+0.018
Neg logit contributions
aw-0.015
 Other-0.014
-0.013
 middle-0.013
 particularly-0.013
Token flower
Token ID15061
Feature activation-1.08e-02

Pos logit contributions
ro+0.014
uzz+0.012
keeping+0.012
emo+0.011
 cheek+0.010
Neg logit contributions
aw-0.015
 Other-0.014
 particularly-0.014
 middle-0.014
-0.013
Token!"
Token ID2474
Feature activation+2.63e-02

Pos logit contributions
aw+0.020
 Other+0.019
 particularly+0.018
 middle+0.018
+0.018
Neg logit contributions
ro-0.018
uzz-0.015
emo-0.014
keeping-0.014
 cheek-0.014
Token she
Token ID673
Feature activation+7.06e-02

Pos logit contributions
ro+0.055
uzz+0.051
keeping+0.048
emo+0.047
ge+0.047
Neg logit contributions
aw-0.035
 middle-0.035
 Other-0.033
 particularly-0.031
-0.029
Token said
Token ID531
Feature activation+1.93e-02

Pos logit contributions
ro+0.167
keeping+0.159
ge+0.150
uzz+0.148
onna+0.139
Neg logit contributions
aw-0.080
 particularly-0.063
 middle-0.062
 named-0.062
 sudden-0.061
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
ro+0.043
emo+0.038
keeping+0.037
uzz+0.037
icated+0.036
Neg logit contributions
aw-0.024
 Other-0.022
-0.021
 middle-0.021
 particularly-0.020
Token
Token ID220
Feature activation+7.81e-03

Pos logit contributions
ro+0.031
uzz+0.028
keeping+0.028
emo+0.027
ge+0.027
Neg logit contributions
aw-0.017
 Other-0.017
-0.017
 middle-0.016
 particularly-0.015
Token
Token ID198
Feature activation+1.43e-02

Pos logit contributions
ro+0.016
uzz+0.014
keeping+0.014
emo+0.013
 cheek+0.013
Neg logit contributions
aw-0.011
 Other-0.011
 middle-0.010
 particularly-0.010
-0.010
Token
Token ID198
Feature activation+1.81e-02

Pos logit contributions
ro+0.031
uzz+0.027
keeping+0.027
emo+0.026
 cheek+0.025
Neg logit contributions
aw-0.019
 Other-0.018
 middle-0.017
 particularly-0.017
-0.016
Token this
Token ID428
Feature activation+1.02e-02

Pos logit contributions
ro+0.028
emo+0.023
keeping+0.023
uzz+0.023
ge+0.021
Neg logit contributions
aw-0.024
 Other-0.022
 middle-0.022
 particularly-0.021
 course-0.021
Token is
Token ID318
Feature activation+2.77e-02

Pos logit contributions
ro+0.016
uzz+0.014
keeping+0.013
emo+0.013
 cheek+0.011
Neg logit contributions
aw-0.021
 middle-0.019
 Other-0.018
 particularly-0.018
 course-0.017
Token so
Token ID523
Feature activation+3.50e-02

Pos logit contributions
ro+0.065
uzz+0.056
keeping+0.056
emo+0.055
ge+0.052
Neg logit contributions
aw-0.035
 middle-0.032
 particularly-0.031
 course-0.028
 Other-0.027
Token pretty
Token ID2495
Feature activation+2.12e-02

Pos logit contributions
ro+0.066
keeping+0.060
uzz+0.056
ge+0.056
emo+0.052
Neg logit contributions
aw-0.057
 middle-0.057
 particularly-0.052
 Other-0.050
Eventually-0.045
Token!"
Token ID2474
Feature activation+3.29e-02

Pos logit contributions
ro+0.038
uzz+0.034
emo+0.033
keeping+0.031
 cheek+0.028
Neg logit contributions
aw-0.038
 Other-0.032
 middle-0.032
 particularly-0.032
 course-0.030
Token she
Token ID673
Feature activation+7.05e-02

Pos logit contributions
ro+0.074
uzz+0.071
emo+0.068
ge+0.065
keeping+0.064
Neg logit contributions
aw-0.040
 middle-0.038
 Other-0.035
 particularly-0.031
 Everyone-0.029
Token said
Token ID531
Feature activation+2.79e-02

Pos logit contributions
ro+0.165
keeping+0.153
uzz+0.146
onna+0.140
emo+0.136
Neg logit contributions
aw-0.081
 middle-0.067
 particularly-0.064
 named-0.057
 sudden-0.056
Token.
Token ID13
Feature activation+2.81e-02

Pos logit contributions
ro+0.063
emo+0.057
keeping+0.055
icated+0.054
uzz+0.053
Neg logit contributions
aw-0.033
 middle-0.030
 Other-0.030
-0.028
 particularly-0.028
Token But
Token ID887
Feature activation+1.25e-02

Pos logit contributions
ro+0.066
uzz+0.063
ge+0.062
keeping+0.062
emo+0.061
Neg logit contributions
 Other-0.029
-0.027
aw-0.026
 middle-0.024
 Eventually-0.022
Token then
Token ID788
Feature activation-1.48e-02

Pos logit contributions
ro+0.026
keeping+0.022
emo+0.022
uzz+0.022
onna+0.021
Neg logit contributions
aw-0.018
 middle-0.017
 Other-0.017
 particularly-0.017
-0.016
Token she
Token ID673
Feature activation-1.83e-02

Pos logit contributions
aw+0.024
 Other+0.022
opl+0.021
 particularly+0.021
 middle+0.021
Neg logit contributions
ro-0.027
uzz-0.024
emo-0.023
keeping-0.022
 cheek-0.022
Token found
Token ID1043
Feature activation+8.95e-03

Pos logit contributions
ro+0.038
keeping+0.035
uzz+0.033
emo+0.032
 cheek+0.032
Neg logit contributions
aw-0.029
 particularly-0.027
 Other-0.026
-0.026
 middle-0.025
Token a
Token ID257
Feature activation+2.63e-02

Pos logit contributions
ro+0.017
uzz+0.014
emo+0.014
keeping+0.013
onna+0.013
Neg logit contributions
aw-0.016
 Other-0.014
 middle-0.014
 particularly-0.013
 course-0.013
Token bath
Token ID7837
Feature activation+3.25e-02

Pos logit contributions
ro+0.048
keeping+0.044
uzz+0.043
emo+0.040
and+0.037
Neg logit contributions
 particularly-0.043
 middle-0.041
-0.040
aw-0.039
 Other-0.039
Tokenrobe
Token ID25481
Feature activation+5.12e-03

Pos logit contributions
ro+0.052
uzz+0.049
emo+0.048
keeping+0.044
ge+0.043
Neg logit contributions
aw-0.054
 middle-0.050
-0.049
 Other-0.048
 particularly-0.048
Token!"
Token ID2474
Feature activation+3.57e-02

Pos logit contributions
ro+0.009
uzz+0.008
emo+0.007
keeping+0.007
 cheek+0.007
Neg logit contributions
aw-0.009
 Other-0.008
 middle-0.008
 particularly-0.008
-0.008
Token she
Token ID673
Feature activation+7.00e-02

Pos logit contributions
ro+0.073
uzz+0.066
keeping+0.065
emo+0.065
ge+0.063
Neg logit contributions
aw-0.051
 middle-0.051
 Other-0.045
 particularly-0.044
-0.039
Token said
Token ID531
Feature activation+2.57e-02

Pos logit contributions
ro+0.160
keeping+0.153
uzz+0.146
emo+0.143
ge+0.137
Neg logit contributions
aw-0.067
 particularly-0.066
 middle-0.063
 named-0.057
 sudden-0.056
Token happily
Token ID18177
Feature activation+3.81e-02

Pos logit contributions
ro+0.057
emo+0.051
keeping+0.049
icated+0.049
uzz+0.048
Neg logit contributions
aw-0.031
 middle-0.028
 Other-0.028
 forward-0.027
-0.027
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
ro+0.091
emo+0.088
uzz+0.078
icated+0.078
onna+0.078
Neg logit contributions
aw-0.039
 forward-0.033
 middle-0.032
 particularly-0.030
-0.028
Token
Token ID220
Feature activation+1.96e-02

Pos logit contributions
ro+0.052
keeping+0.047
emo+0.046
uzz+0.045
ge+0.045
Neg logit contributions
aw-0.022
 Other-0.021
 middle-0.021
 particularly-0.019
-0.018
Token
Token ID198
Feature activation+3.15e-02

Pos logit contributions
ro+0.043
keeping+0.038
uzz+0.037
emo+0.037
 cheek+0.036
Neg logit contributions
aw-0.025
 Other-0.024
 middle-0.022
-0.021
 particularly-0.021
Token it
Token ID340
Feature activation-1.09e-02

Pos logit contributions
ro+0.027
uzz+0.022
keeping+0.021
emo+0.021
 cheek+0.019
Neg logit contributions
aw-0.030
 Other-0.027
 middle-0.027
 course-0.026
 particularly-0.026
Token too
Token ID1165
Feature activation+1.38e-03

Pos logit contributions
aw+0.019
 Other+0.017
 particularly+0.016
 middle+0.016
+0.016
Neg logit contributions
ro-0.019
emo-0.016
uzz-0.016
keeping-0.016
 cheek-0.015
Token?"
Token ID1701
Feature activation-9.49e-03

Pos logit contributions
ro+0.003
uzz+0.002
keeping+0.002
emo+0.002
 cheek+0.002
Neg logit contributions
aw-0.002
 Other-0.002
 middle-0.002
 particularly-0.002
-0.002
Token Tim
Token ID5045
Feature activation+4.15e-02

Pos logit contributions
aw+0.021
 Other+0.020
 particularly+0.019
 middle+0.019
ifying+0.019
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.009
keeping-0.009
 cheek-0.008
Token shared
Token ID4888
Feature activation+1.90e-02

Pos logit contributions
keeping+0.081
ro+0.078
uzz+0.071
emo+0.069
onna+0.067
Neg logit contributions
aw-0.066
 particularly-0.056
 Other-0.054
 course-0.050
 middle-0.049
Token the
Token ID262
Feature activation+6.99e-02

Pos logit contributions
ro+0.040
keeping+0.035
uzz+0.033
emo+0.033
onna+0.032
Neg logit contributions
aw-0.028
 middle-0.025
 Other-0.024
 course-0.023
 particularly-0.023
Token marble
Token ID30623
Feature activation+4.06e-02

Pos logit contributions
keeping+0.119
ro+0.117
uzz+0.116
nes+0.108
onna+0.104
Neg logit contributions
 middle-0.124
 course-0.110
 best-0.105
 Other-0.104
 particularly-0.103
Token with
Token ID351
Feature activation-3.26e-02

Pos logit contributions
ro+0.079
keeping+0.070
uzz+0.070
emo+0.069
onna+0.064
Neg logit contributions
aw-0.061
 middle-0.057
 particularly-0.052
 Other-0.051
 forward-0.050
Token Sue
Token ID26089
Feature activation+1.27e-02

Pos logit contributions
ifying+0.080
aw+0.079
 particularly+0.078
 Other+0.078
opl+0.076
Neg logit contributions
ro-0.029
uzz-0.025
emo-0.024
keeping-0.020
 itself-0.019
Token.
Token ID13
Feature activation+9.28e-04

Pos logit contributions
ro+0.026
keeping+0.023
emo+0.023
uzz+0.022
onna+0.021
Neg logit contributions
aw-0.019
 Other-0.017
 particularly-0.016
 middle-0.016
-0.015
Token They
Token ID1119
Feature activation+1.82e-03

Pos logit contributions
ro+0.002
uzz+0.002
emo+0.002
keeping+0.001
 cheek+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.001
 particularly-0.001
-0.001
Token and
Token ID290
Feature activation+9.01e-03

Pos logit contributions
ro+0.010
uzz+0.009
emo+0.009
keeping+0.008
onna+0.008
Neg logit contributions
aw-0.009
 Other-0.008
 particularly-0.008
 middle-0.008
-0.008
Token content
Token ID2695
Feature activation+4.73e-03

Pos logit contributions
ro+0.017
uzz+0.015
emo+0.014
keeping+0.014
onna+0.013
Neg logit contributions
aw-0.014
 middle-0.013
 Other-0.013
 particularly-0.013
opl-0.012
Token.
Token ID13
Feature activation-1.07e-02

Pos logit contributions
ro+0.009
uzz+0.008
emo+0.008
keeping+0.007
onna+0.007
Neg logit contributions
aw-0.008
 Other-0.007
 middle-0.007
 particularly-0.007
-0.006
Token
Token ID198
Feature activation-7.80e-03

Pos logit contributions
aw+0.018
 Other+0.016
 middle+0.016
 particularly+0.016
+0.015
Neg logit contributions
ro-0.020
uzz-0.017
emo-0.017
keeping-0.016
onna-0.015
Token
Token ID198
Feature activation-1.40e-02

Pos logit contributions
aw+0.012
 Other+0.011
 middle+0.010
 particularly+0.010
+0.010
Neg logit contributions
ro-0.015
uzz-0.013
emo-0.013
keeping-0.013
 cheek-0.012
TokenB
Token ID33
Feature activation+6.97e-02

Pos logit contributions
aw+0.021
 Other+0.019
 middle+0.017
+0.017
 particularly+0.017
Neg logit contributions
ro-0.028
uzz-0.025
emo-0.025
keeping-0.024
onna-0.023
Tokenobby
Token ID11369
Feature activation+9.30e-03

Pos logit contributions
ro+0.096
 seek+0.094
keeping+0.093
 cheek+0.091
 common+0.089
Neg logit contributions
 Other-0.115
-0.114
aw-0.112
 Eventually-0.104
ement-0.101
Token and
Token ID290
Feature activation+3.19e-02

Pos logit contributions
ro+0.018
uzz+0.016
emo+0.015
keeping+0.015
 cheek+0.014
Neg logit contributions
aw-0.015
 Other-0.013
 particularly-0.013
 middle-0.013
-0.012
Token the
Token ID262
Feature activation+3.79e-02

Pos logit contributions
ro+0.071
onna+0.061
emo+0.060
uzz+0.059
 cheek+0.058
Neg logit contributions
aw-0.037
 Other-0.036
 middle-0.035
 particularly-0.034
 course-0.033
Token star
Token ID3491
Feature activation-5.16e-03

Pos logit contributions
ro+0.074
keeping+0.067
uzz+0.066
emo+0.062
onna+0.061
Neg logit contributions
 middle-0.066
aw-0.058
 particularly-0.058
 course-0.054
 Other-0.052
Token were
Token ID547
Feature activation-2.88e-02

Pos logit contributions
aw+0.008
 Other+0.008
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.010
uzz-0.009
emo-0.008
keeping-0.008
 cheek-0.008
TokenYes
Token ID5297
Feature activation+1.20e-02

Pos logit contributions
ro+0.011
uzz+0.009
emo+0.009
keeping+0.009
 cheek+0.008
Neg logit contributions
aw-0.013
 Other-0.012
 particularly-0.012
 middle-0.012
-0.011
Token,
Token ID11
Feature activation+5.52e-02

Pos logit contributions
ro+0.028
keeping+0.026
emo+0.025
uzz+0.025
onna+0.023
Neg logit contributions
aw-0.014
 Other-0.013
 middle-0.012
 particularly-0.011
-0.011
Token you
Token ID345
Feature activation-1.49e-02

Pos logit contributions
keeping+0.111
ro+0.104
emo+0.097
uzz+0.094
ge+0.084
Neg logit contributions
aw-0.088
 course-0.073
 Other-0.072
 particularly-0.070
 middle-0.067
Token can
Token ID460
Feature activation-1.61e-02

Pos logit contributions
aw+0.028
 Other+0.026
 particularly+0.024
 middle+0.024
+0.024
Neg logit contributions
ro-0.025
uzz-0.020
emo-0.020
keeping-0.019
 cheek-0.019
Token go
Token ID467
Feature activation-1.21e-02

Pos logit contributions
aw+0.031
 Other+0.028
 particularly+0.027
 middle+0.026
ifying+0.026
Neg logit contributions
ro-0.027
uzz-0.022
emo-0.022
keeping-0.021
 cheek-0.020
Token,
Token ID11
Feature activation+6.97e-02

Pos logit contributions
aw+0.022
 Other+0.021
 middle+0.019
 particularly+0.019
+0.019
Neg logit contributions
ro-0.020
uzz-0.017
emo-0.017
keeping-0.017
 cheek-0.016
Token but
Token ID475
Feature activation+3.15e-02

Pos logit contributions
keeping+0.129
emo+0.115
ro+0.115
uzz+0.111
 common+0.098
Neg logit contributions
aw-0.112
 Other-0.101
 middle-0.098
 particularly-0.096
 best-0.096
Token you
Token ID345
Feature activation-6.94e-03

Pos logit contributions
ro+0.056
keeping+0.052
uzz+0.051
emo+0.049
onna+0.044
Neg logit contributions
aw-0.049
 middle-0.049
 course-0.047
 particularly-0.047
 Other-0.045
Token need
Token ID761
Feature activation+1.26e-02

Pos logit contributions
aw+0.011
 Other+0.010
 middle+0.010
 particularly+0.010
+0.009
Neg logit contributions
ro-0.013
uzz-0.012
emo-0.011
keeping-0.011
 cheek-0.011
Token to
Token ID284
Feature activation+2.30e-02

Pos logit contributions
ro+0.022
uzz+0.019
emo+0.019
keeping+0.018
 cheek+0.017
Neg logit contributions
aw-0.022
 Other-0.020
 middle-0.020
 particularly-0.020
-0.019
Token be
Token ID307
Feature activation-4.21e-03

Pos logit contributions
ro+0.040
uzz+0.036
emo+0.035
keeping+0.034
and+0.031
Neg logit contributions
aw-0.037
 middle-0.037
 Other-0.035
 particularly-0.034
-0.033
Token shoe
Token ID17292
Feature activation-7.16e-03

Pos logit contributions
aw+0.004
 Other+0.004
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.004
uzz-0.004
emo-0.004
keeping-0.004
 cheek-0.003
Token is
Token ID318
Feature activation+2.47e-02

Pos logit contributions
aw+0.012
 Other+0.011
 particularly+0.011
 middle+0.010
+0.010
Neg logit contributions
ro-0.013
uzz-0.011
emo-0.011
keeping-0.011
 cheek-0.010
Token so
Token ID523
Feature activation+3.23e-02

Pos logit contributions
ro+0.065
uzz+0.059
emo+0.056
keeping+0.056
ge+0.053
Neg logit contributions
aw-0.023
 particularly-0.022
 middle-0.021
 Other-0.018
 best-0.016
Token pretty
Token ID2495
Feature activation+1.52e-02

Pos logit contributions
ro+0.060
keeping+0.054
uzz+0.052
ge+0.049
emo+0.048
Neg logit contributions
aw-0.053
 middle-0.050
 particularly-0.049
 Other-0.047
opl-0.042
Token!"
Token ID2474
Feature activation+2.63e-02

Pos logit contributions
ro+0.027
uzz+0.023
emo+0.022
keeping+0.022
 cheek+0.020
Neg logit contributions
aw-0.028
 Other-0.024
 particularly-0.024
 middle-0.024
-0.022
Token she
Token ID673
Feature activation+6.94e-02

Pos logit contributions
ro+0.057
uzz+0.054
emo+0.050
keeping+0.050
ge+0.048
Neg logit contributions
aw-0.035
 middle-0.033
 Other-0.032
 particularly-0.029
-0.027
Token said
Token ID531
Feature activation+1.58e-02

Pos logit contributions
ro+0.162
keeping+0.156
ge+0.141
uzz+0.141
onna+0.136
Neg logit contributions
aw-0.080
 particularly-0.067
 middle-0.062
 named-0.058
 Other-0.056
Token to
Token ID284
Feature activation+1.54e-02

Pos logit contributions
ro+0.033
emo+0.029
keeping+0.029
uzz+0.029
icated+0.027
Neg logit contributions
aw-0.022
 Other-0.020
 middle-0.019
 particularly-0.019
-0.019
Token herself
Token ID5223
Feature activation+2.32e-02

Pos logit contributions
ro+0.030
uzz+0.026
keeping+0.026
emo+0.025
onna+0.024
Neg logit contributions
aw-0.023
 Other-0.022
 middle-0.022
 particularly-0.021
 course-0.021
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
ro+0.054
emo+0.047
keeping+0.046
uzz+0.045
 common+0.043
Neg logit contributions
aw-0.028
 Other-0.025
 middle-0.024
 particularly-0.023
 forward-0.022
Token
Token ID198
Feature activation+9.94e-03

Pos logit contributions
ro+0.040
uzz+0.037
keeping+0.037
emo+0.036
ge+0.035
Neg logit contributions
 Other-0.021
aw-0.021
-0.020
 middle-0.019
 particularly-0.018
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
ro+0.028
uzz+0.025
keeping+0.024
emo+0.024
icated+0.021
Neg logit contributions
aw-0.027
 particularly-0.026
 middle-0.025
 Other-0.024
-0.023
Token gave
Token ID2921
Feature activation+1.15e-02

Pos logit contributions
ro+0.028
uzz+0.025
emo+0.024
keeping+0.024
 cheek+0.022
Neg logit contributions
aw-0.026
 particularly-0.023
 middle-0.023
 Other-0.023
-0.021
Token it
Token ID340
Feature activation+2.94e-02

Pos logit contributions
ro+0.019
uzz+0.016
keeping+0.015
emo+0.015
onna+0.015
Neg logit contributions
aw-0.022
 middle-0.020
 Other-0.020
 particularly-0.019
 course-0.018
Token a
Token ID257
Feature activation+2.82e-02

Pos logit contributions
ro+0.064
keeping+0.056
emo+0.055
uzz+0.055
ge+0.053
Neg logit contributions
aw-0.038
 middle-0.035
 Other-0.035
 particularly-0.034
 forward-0.030
Token good
Token ID922
Feature activation+5.76e-02

Pos logit contributions
ro+0.052
keeping+0.052
uzz+0.049
emo+0.046
ge+0.042
Neg logit contributions
 particularly-0.043
 middle-0.041
 Other-0.040
-0.040
aw-0.040
Token tidy
Token ID43044
Feature activation+6.91e-02

Pos logit contributions
keeping+0.101
ro+0.097
uzz+0.095
ge+0.091
nes+0.090
Neg logit contributions
 middle-0.082
 Other-0.080
 particularly-0.079
aw-0.075
 course-0.072
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
uzz+0.119
emo+0.115
keeping+0.114
ge+0.111
icated+0.109
Neg logit contributions
 middle-0.104
 course-0.091
 Other-0.088
 meantime-0.088
 particularly-0.087
Token
Token ID198
Feature activation+1.66e-02

Pos logit contributions
ro+0.025
uzz+0.023
keeping+0.022
emo+0.022
 cheek+0.022
Neg logit contributions
aw-0.013
 Other-0.012
-0.011
 middle-0.011
 particularly-0.010
Token
Token ID198
Feature activation+8.13e-03

Pos logit contributions
ro+0.036
keeping+0.031
uzz+0.031
 cheek+0.031
emo+0.030
Neg logit contributions
aw-0.022
 Other-0.021
 middle-0.019
 particularly-0.019
-0.018
TokenThe
Token ID464
Feature activation+2.27e-02

Pos logit contributions
ro+0.019
uzz+0.017
keeping+0.017
emo+0.016
 cheek+0.016
Neg logit contributions
aw-0.010
 Other-0.009
 middle-0.008
 particularly-0.008
Eventually-0.008
Token nap
Token ID25422
Feature activation+1.81e-02

Pos logit contributions
ro+0.041
uzz+0.040
keeping+0.038
emo+0.036
onna+0.034
Neg logit contributions
 middle-0.037
aw-0.035
 particularly-0.034
 Other-0.033
-0.032
TokenFrom
Token ID4863
Feature activation+1.03e-02

Pos logit contributions
aw+0.006
 Other+0.005
 middle+0.005
 particularly+0.005
+0.005
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.008
Token that
Token ID326
Feature activation+1.85e-02

Pos logit contributions
ro+0.027
uzz+0.024
emo+0.024
keeping+0.023
onna+0.023
Neg logit contributions
aw-0.010
 middle-0.008
 particularly-0.008
 Other-0.008
 course-0.007
Token day
Token ID1110
Feature activation-5.03e-02

Pos logit contributions
ro+0.035
uzz+0.030
keeping+0.030
emo+0.030
onna+0.027
Neg logit contributions
 Other-0.029
 middle-0.029
 particularly-0.029
aw-0.028
 course-0.027
Token forward
Token ID2651
Feature activation+2.23e-02

Pos logit contributions
aw+0.067
 Other+0.058
opl+0.054
 particularly+0.053
 pasture+0.053
Neg logit contributions
ro-0.101
uzz-0.095
 cheek-0.089
 moment-0.086
keeping-0.086
Token the
Token ID262
Feature activation+3.80e-02

Pos logit contributions
ro+0.054
emo+0.049
uzz+0.047
keeping+0.047
 cheek+0.046
Neg logit contributions
aw-0.025
 Other-0.021
 middle-0.019
 particularly-0.018
 course-0.018
Token little
Token ID1310
Feature activation+6.86e-02

Pos logit contributions
ro+0.082
uzz+0.076
keeping+0.075
onna+0.070
emo+0.070
Neg logit contributions
 middle-0.052
 particularly-0.050
aw-0.048
 course-0.047
 best-0.045
Token girl
Token ID2576
Feature activation-3.70e-02

Pos logit contributions
onna+0.150
ro+0.144
ge+0.142
emo+0.137
icated+0.134
Neg logit contributions
 middle-0.078
 best-0.073
 course-0.065
 forward-0.065
aw-0.064
Token was
Token ID373
Feature activation-3.36e-02

Pos logit contributions
aw+0.065
 Other+0.062
 middle+0.059
+0.059
ifying+0.057
Neg logit contributions
ro-0.063
uzz-0.052
keeping-0.051
emo-0.050
onna-0.047
Token never
Token ID1239
Feature activation+8.70e-03

Pos logit contributions
aw+0.054
 Other+0.050
+0.050
ifying+0.047
opl+0.046
Neg logit contributions
ro-0.059
uzz-0.056
emo-0.053
keeping-0.051
 seek-0.049
Token without
Token ID1231
Feature activation+2.21e-02

Pos logit contributions
ro+0.018
uzz+0.016
emo+0.015
keeping+0.015
 cheek+0.014
Neg logit contributions
aw-0.012
 particularly-0.011
 Other-0.011
 middle-0.011
ifying-0.010
Token her
Token ID607
Feature activation+6.26e-03

Pos logit contributions
ro+0.049
emo+0.042
uzz+0.041
onna+0.039
keeping+0.039
Neg logit contributions
aw-0.029
 particularly-0.027
 Other-0.027
 middle-0.025
 course-0.024
Token get
Token ID651
Feature activation-7.42e-03

Pos logit contributions
aw+0.102
 Other+0.092
ement+0.091
opl+0.090
 particularly+0.086
Neg logit contributions
ro-0.051
uzz-0.036
onna-0.036
emo-0.034
 cheek-0.033
Token the
Token ID262
Feature activation-2.65e-02

Pos logit contributions
aw+0.014
 Other+0.013
 particularly+0.012
 middle+0.012
+0.012
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.010
keeping-0.010
 cheek-0.009
Token big
Token ID1263
Feature activation-1.26e-02

Pos logit contributions
aw+0.049
 Other+0.043
 particularly+0.041
opl+0.041
ifying+0.041
Neg logit contributions
ro-0.044
 cheek-0.037
uzz-0.036
emo-0.035
keeping-0.035
Token pastry
Token ID41738
Feature activation-1.55e-02

Pos logit contributions
aw+0.021
 Other+0.019
 particularly+0.018
 middle+0.018
+0.018
Neg logit contributions
ro-0.023
uzz-0.019
emo-0.019
keeping-0.019
 cheek-0.018
Token?"
Token ID1701
Feature activation+3.41e-03

Pos logit contributions
aw+0.027
 Other+0.025
 particularly+0.024
 middle+0.023
+0.023
Neg logit contributions
ro-0.027
uzz-0.023
emo-0.022
keeping-0.022
 cheek-0.022
Token she
Token ID673
Feature activation+6.85e-02

Pos logit contributions
ro+0.005
uzz+0.005
emo+0.004
keeping+0.004
onna+0.004
Neg logit contributions
aw-0.007
 Other-0.006
 middle-0.006
 particularly-0.006
-0.006
Token asked
Token ID1965
Feature activation+1.19e-02

Pos logit contributions
ro+0.176
keeping+0.166
uzz+0.166
onna+0.154
emo+0.149
Neg logit contributions
 middle-0.057
aw-0.055
 particularly-0.052
 Other-0.046
 sudden-0.044
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
ro+0.024
emo+0.021
keeping+0.021
uzz+0.020
onna+0.020
Neg logit contributions
aw-0.017
 Other-0.016
 middle-0.016
 particularly-0.015
-0.015
Token Her
Token ID2332
Feature activation-6.04e-03

Pos logit contributions
ro+0.029
keeping+0.027
uzz+0.026
emo+0.026
ge+0.025
Neg logit contributions
 Other-0.014
aw-0.014
-0.013
 middle-0.013
 particularly-0.012
Token mom
Token ID1995
Feature activation+1.22e-02

Pos logit contributions
aw+0.012
 Other+0.011
 particularly+0.010
 middle+0.010
+0.010
Neg logit contributions
ro-0.010
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.007
Token said
Token ID531
Feature activation-1.05e-02

Pos logit contributions
ro+0.021
keeping+0.018
emo+0.017
uzz+0.017
onna+0.015
Neg logit contributions
aw-0.023
 Other-0.020
 particularly-0.020
 middle-0.020
-0.019
Token said
Token ID531
Feature activation-2.06e-03

Pos logit contributions
ro+0.003
uzz+0.002
keeping+0.002
emo+0.002
 cheek+0.002
Neg logit contributions
aw-0.004
 Other-0.003
 particularly-0.003
 middle-0.003
-0.003
Token it
Token ID340
Feature activation+2.35e-02

Pos logit contributions
aw+0.003
 Other+0.003
 particularly+0.003
 middle+0.003
+0.002
Neg logit contributions
ro-0.004
uzz-0.004
emo-0.004
keeping-0.004
 cheek-0.003
Token was
Token ID373
Feature activation+1.58e-02

Pos logit contributions
ro+0.051
emo+0.046
keeping+0.046
uzz+0.045
onna+0.042
Neg logit contributions
aw-0.031
 particularly-0.028
 Other-0.028
 middle-0.027
-0.025
Token a
Token ID257
Feature activation+2.96e-02

Pos logit contributions
ro+0.036
keeping+0.030
uzz+0.030
emo+0.029
 cheek+0.028
Neg logit contributions
aw-0.021
 particularly-0.020
 middle-0.019
 Other-0.018
ifying-0.017
Token very
Token ID845
Feature activation+3.67e-02

Pos logit contributions
ro+0.060
keeping+0.055
uzz+0.054
emo+0.050
ge+0.049
Neg logit contributions
 particularly-0.043
aw-0.042
 middle-0.039
-0.037
 Other-0.036
Token special
Token ID2041
Feature activation+6.85e-02

Pos logit contributions
ro+0.076
keeping+0.075
uzz+0.067
icated+0.064
ge+0.064
Neg logit contributions
aw-0.050
 particularly-0.049
 middle-0.047
 Other-0.042
 best-0.040
Token shell
Token ID7582
Feature activation+1.15e-02

Pos logit contributions
keeping+0.118
ro+0.116
uzz+0.112
ge+0.109
ried+0.105
Neg logit contributions
 particularly-0.106
 middle-0.105
aw-0.100
 best-0.095
 course-0.092
Token.
Token ID13
Feature activation+3.07e-03

Pos logit contributions
ro+0.023
uzz+0.020
emo+0.020
keeping+0.019
 cheek+0.018
Neg logit contributions
aw-0.017
 particularly-0.015
 Other-0.015
 middle-0.015
-0.014
Token The
Token ID383
Feature activation+5.14e-02

Pos logit contributions
ro+0.007
uzz+0.006
emo+0.006
keeping+0.006
 cheek+0.006
Neg logit contributions
aw-0.004
 Other-0.004
 middle-0.004
 particularly-0.003
-0.003
Token little
Token ID1310
Feature activation+5.48e-02

Pos logit contributions
keeping+0.099
ro+0.095
uzz+0.093
onna+0.091
ge+0.089
Neg logit contributions
 middle-0.089
aw-0.081
 best-0.078
 particularly-0.075
 course-0.066
Token girl
Token ID2576
Feature activation+1.65e-02

Pos logit contributions
keeping+0.109
ro+0.109
uzz+0.109
onna+0.105
emo+0.103
Neg logit contributions
aw-0.075
 middle-0.074
 best-0.065
 particularly-0.059
 course-0.058
Token!
Token ID0
Feature activation+7.60e-03

Pos logit contributions
ro+0.015
uzz+0.013
keeping+0.013
emo+0.013
onna+0.012
Neg logit contributions
aw-0.009
 Other-0.008
 middle-0.007
 particularly-0.007
-0.007
Token A
Token ID317
Feature activation-1.98e-02

Pos logit contributions
ro+0.015
uzz+0.013
keeping+0.013
emo+0.013
 cheek+0.012
Neg logit contributions
aw-0.011
 Other-0.011
 middle-0.010
-0.010
ifying-0.009
Token fire
Token ID2046
Feature activation-1.50e-02

Pos logit contributions
aw+0.034
 Other+0.031
 particularly+0.029
+0.029
 middle+0.029
Neg logit contributions
ro-0.034
emo-0.030
uzz-0.029
 cheek-0.029
keeping-0.028
Tokenwork
Token ID1818
Feature activation-1.03e-02

Pos logit contributions
aw+0.033
 Other+0.032
 particularly+0.031
 middle+0.031
+0.030
Neg logit contributions
ro-0.019
uzz-0.015
emo-0.015
keeping-0.014
 cheek-0.013
Token!"
Token ID2474
Feature activation+3.15e-02

Pos logit contributions
aw+0.019
 Other+0.018
 particularly+0.017
 middle+0.017
+0.017
Neg logit contributions
ro-0.017
uzz-0.014
emo-0.014
keeping-0.013
 cheek-0.013
Token she
Token ID673
Feature activation+6.83e-02

Pos logit contributions
ro+0.066
uzz+0.061
emo+0.059
keeping+0.058
ge+0.058
Neg logit contributions
aw-0.043
 Other-0.041
 middle-0.040
 particularly-0.036
-0.036
Token said
Token ID531
Feature activation+1.81e-02

Pos logit contributions
ro+0.165
keeping+0.153
uzz+0.150
ge+0.147
onna+0.137
Neg logit contributions
aw-0.077
 Other-0.056
 middle-0.056
 particularly-0.055
 brothers-0.053
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
ro+0.040
keeping+0.035
emo+0.035
uzz+0.034
icated+0.033
Neg logit contributions
aw-0.024
 Other-0.021
 middle-0.020
-0.020
 particularly-0.019
Token Tim
Token ID5045
Feature activation+4.04e-02

Pos logit contributions
ro+0.036
uzz+0.033
keeping+0.033
emo+0.032
ge+0.032
Neg logit contributions
 Other-0.023
aw-0.023
-0.023
 middle-0.020
 particularly-0.020
Token looked
Token ID3114
Feature activation-3.76e-03

Pos logit contributions
ro+0.082
keeping+0.081
uzz+0.075
emo+0.069
onna+0.069
Neg logit contributions
aw-0.060
 particularly-0.048
 Other-0.048
 middle-0.044
 continued-0.043
Token up
Token ID510
Feature activation-4.59e-03

Pos logit contributions
aw+0.005
 Other+0.005
 middle+0.005
 particularly+0.005
+0.004
Neg logit contributions
ro-0.008
uzz-0.007
emo-0.007
keeping-0.007
 cheek-0.006
Token Kate
Token ID16693
Feature activation-8.59e-03

Pos logit contributions
ro+0.087
 cheek+0.076
keeping+0.075
ge+0.073
uzz+0.072
Neg logit contributions
 Other-0.069
-0.060
aw-0.057
 middle-0.055
 particularly-0.055
Token and
Token ID290
Feature activation+4.30e-03

Pos logit contributions
aw+0.015
 Other+0.014
 middle+0.014
 particularly+0.014
+0.013
Neg logit contributions
ro-0.015
uzz-0.013
emo-0.012
keeping-0.012
 cheek-0.011
Token the
Token ID262
Feature activation+7.46e-03

Pos logit contributions
ro+0.007
uzz+0.006
keeping+0.006
emo+0.006
 cheek+0.005
Neg logit contributions
aw-0.008
 Other-0.008
 particularly-0.007
 middle-0.007
-0.007
Token other
Token ID584
Feature activation-1.65e-03

Pos logit contributions
ro+0.016
uzz+0.015
keeping+0.014
emo+0.014
onna+0.013
Neg logit contributions
aw-0.010
 Other-0.009
 middle-0.009
 particularly-0.008
-0.008
Token was
Token ID373
Feature activation-4.31e-03

Pos logit contributions
aw+0.003
 Other+0.003
 middle+0.003
 particularly+0.003
+0.002
Neg logit contributions
ro-0.003
uzz-0.003
emo-0.002
keeping-0.002
 cheek-0.002
Token named
Token ID3706
Feature activation+6.81e-02

Pos logit contributions
aw+0.006
 Other+0.005
 middle+0.005
 particularly+0.005
+0.005
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.008
Token Ben
Token ID3932
Feature activation+1.09e-02

Pos logit contributions
ro+0.117
 cheek+0.107
keeping+0.106
ge+0.101
emo+0.099
Neg logit contributions
 Other-0.102
-0.090
 Everyone-0.085
 Everybody-0.084
 All-0.081
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
ro+0.023
emo+0.019
keeping+0.019
uzz+0.019
onna+0.018
Neg logit contributions
aw-0.016
 Other-0.014
 particularly-0.014
 middle-0.014
-0.013
Token They
Token ID1119
Feature activation-7.34e-03

Pos logit contributions
ro+0.023
keeping+0.020
uzz+0.020
emo+0.019
 cheek+0.019
Neg logit contributions
aw-0.018
 Other-0.018
 middle-0.016
 particularly-0.016
-0.016
Token were
Token ID547
Feature activation-1.91e-02

Pos logit contributions
aw+0.011
 Other+0.010
 middle+0.010
 particularly+0.010
+0.010
Neg logit contributions
ro-0.014
uzz-0.013
emo-0.012
keeping-0.012
 cheek-0.011
Token playing
Token ID2712
Feature activation-1.93e-02

Pos logit contributions
aw+0.030
 Other+0.027
+0.026
 middle+0.026
ifying+0.025
Neg logit contributions
ro-0.035
uzz-0.033
emo-0.032
keeping-0.030
 cheek-0.028
Token are
Token ID389
Feature activation+6.69e-03

Pos logit contributions
aw+0.002
 Other+0.001
 particularly+0.001
 middle+0.001
+0.001
Neg logit contributions
ro-0.002
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.002
Token special
Token ID2041
Feature activation+4.25e-04

Pos logit contributions
ro+0.013
uzz+0.011
keeping+0.011
emo+0.011
 cheek+0.010
Neg logit contributions
aw-0.010
 Other-0.009
 particularly-0.009
 middle-0.009
-0.008
Token."
Token ID526
Feature activation+3.41e-02

Pos logit contributions
ro+0.001
uzz+0.001
emo+0.001
keeping+0.001
 cheek+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 particularly-0.001
 middle-0.001
-0.001
Token
Token ID198
Feature activation+1.75e-02

Pos logit contributions
ro+0.081
ge+0.073
keeping+0.072
uzz+0.071
 common+0.071
Neg logit contributions
 Other-0.037
aw-0.034
 Eventually-0.031
-0.031
 middle-0.030
Token
Token ID198
Feature activation+2.41e-02

Pos logit contributions
ro+0.039
keeping+0.034
uzz+0.034
emo+0.032
 cheek+0.032
Neg logit contributions
aw-0.022
 Other-0.021
 middle-0.020
 particularly-0.020
-0.018
TokenB
Token ID33
Feature activation+6.81e-02

Pos logit contributions
ro+0.044
keeping+0.042
uzz+0.039
emo+0.038
 cheek+0.037
Neg logit contributions
aw-0.039
 particularly-0.037
 Other-0.037
 middle-0.036
Eventually-0.035
Tokenenny
Token ID11870
Feature activation+2.16e-02

Pos logit contributions
 cheek+0.063
ro+0.059
keeping+0.059
 seek+0.057
 common+0.056
Neg logit contributions
aw-0.150
 Other-0.145
-0.136
ifying-0.132
 Eventually-0.132
Token promised
Token ID8072
Feature activation+1.25e-02

Pos logit contributions
ro+0.041
uzz+0.037
keeping+0.037
emo+0.035
onna+0.033
Neg logit contributions
aw-0.036
 Other-0.031
 particularly-0.030
 middle-0.029
-0.028
Token to
Token ID284
Feature activation+1.58e-02

Pos logit contributions
ro+0.022
uzz+0.019
emo+0.019
keeping+0.018
icated+0.017
Neg logit contributions
aw-0.021
 Other-0.020
 particularly-0.019
 middle-0.019
-0.018
Token show
Token ID905
Feature activation+2.07e-02

Pos logit contributions
ro+0.028
uzz+0.024
keeping+0.024
emo+0.023
ge+0.022
Neg logit contributions
aw-0.025
 Other-0.024
 middle-0.024
 particularly-0.024
 course-0.023
Token Ell
Token ID7122
Feature activation+2.70e-02

Pos logit contributions
ro+0.040
emo+0.032
uzz+0.032
keeping+0.032
onna+0.031
Neg logit contributions
 Other-0.032
aw-0.032
 middle-0.030
 particularly-0.029
 course-0.028
Token Joy
Token ID14087
Feature activation-3.63e-03

Pos logit contributions
ro+0.066
 cheek+0.059
keeping+0.057
ge+0.055
emo+0.053
Neg logit contributions
 Other-0.060
-0.054
aw-0.052
 middle-0.051
 particularly-0.049
Token and
Token ID290
Feature activation-8.33e-03

Pos logit contributions
aw+0.006
 Other+0.006
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.006
uzz-0.005
emo-0.005
keeping-0.005
 cheek-0.005
Token the
Token ID262
Feature activation+9.30e-03

Pos logit contributions
aw+0.017
 Other+0.015
 middle+0.015
 particularly+0.015
+0.015
Neg logit contributions
ro-0.013
uzz-0.011
emo-0.010
keeping-0.010
 cheek-0.009
Token other
Token ID584
Feature activation-6.30e-03

Pos logit contributions
ro+0.021
uzz+0.018
keeping+0.018
emo+0.018
ge+0.017
Neg logit contributions
aw-0.012
 Other-0.011
 middle-0.011
 particularly-0.010
-0.009
Token was
Token ID373
Feature activation-3.66e-03

Pos logit contributions
aw+0.011
 Other+0.010
 particularly+0.010
 middle+0.010
+0.010
Neg logit contributions
ro-0.011
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token named
Token ID3706
Feature activation+6.80e-02

Pos logit contributions
aw+0.005
 Other+0.004
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.008
uzz-0.007
emo-0.007
keeping-0.007
 cheek-0.007
Token Jan
Token ID2365
Feature activation-1.19e-02

Pos logit contributions
ro+0.117
 cheek+0.107
keeping+0.104
ge+0.103
uzz+0.099
Neg logit contributions
 Other-0.101
-0.098
 Everyone-0.084
 Everybody-0.083
 particularly-0.082
Token.
Token ID13
Feature activation-1.40e-03

Pos logit contributions
aw+0.022
 Other+0.020
 middle+0.020
 particularly+0.019
+0.019
Neg logit contributions
ro-0.020
uzz-0.017
emo-0.016
keeping-0.016
 cheek-0.015
Token Jan
Token ID2365
Feature activation+3.78e-04

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.003
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.002
Token was
Token ID373
Feature activation-9.54e-03

Pos logit contributions
ro+0.001
uzz+0.001
emo+0.001
keeping+0.001
 cheek+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 particularly-0.001
 middle-0.001
-0.001
Token very
Token ID845
Feature activation+2.57e-02

Pos logit contributions
aw+0.015
 Other+0.014
 middle+0.013
+0.013
 particularly+0.013
Neg logit contributions
ro-0.018
uzz-0.016
emo-0.016
keeping-0.015
 cheek-0.014
Token "
Token ID366
Feature activation+1.13e-02

Pos logit contributions
ro+0.039
emo+0.037
uzz+0.037
keeping+0.035
ge+0.035
Neg logit contributions
 Other-0.016
aw-0.016
 middle-0.015
-0.013
 particularly-0.012
TokenWhat
Token ID2061
Feature activation+1.14e-02

Pos logit contributions
ro+0.024
uzz+0.021
emo+0.021
onna+0.020
keeping+0.020
Neg logit contributions
aw-0.017
 middle-0.015
 Other-0.015
 particularly-0.014
-0.013
Token's
Token ID338
Feature activation+2.28e-02

Pos logit contributions
ro+0.025
uzz+0.022
keeping+0.022
emo+0.021
 cheek+0.020
Neg logit contributions
aw-0.016
 Other-0.014
 middle-0.013
 particularly-0.013
-0.012
Token this
Token ID428
Feature activation-1.11e-03

Pos logit contributions
ro+0.057
uzz+0.048
emo+0.048
keeping+0.047
onna+0.046
Neg logit contributions
aw-0.028
 middle-0.024
 Other-0.021
 particularly-0.021
 course-0.020
Token?"
Token ID1701
Feature activation+1.45e-02

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.002
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.001
Token she
Token ID673
Feature activation+6.80e-02

Pos logit contributions
ro+0.030
uzz+0.027
emo+0.027
keeping+0.026
onna+0.025
Neg logit contributions
aw-0.021
 middle-0.020
 Other-0.019
 particularly-0.018
-0.017
Token said
Token ID531
Feature activation+2.53e-02

Pos logit contributions
ro+0.170
keeping+0.163
uzz+0.151
onna+0.150
and+0.143
Neg logit contributions
aw-0.067
 particularly-0.060
 middle-0.060
 named-0.049
 Other-0.047
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
ro+0.058
keeping+0.052
emo+0.052
uzz+0.050
icated+0.049
Neg logit contributions
aw-0.030
 Other-0.027
 middle-0.027
 particularly-0.025
-0.025
Token The
Token ID383
Feature activation+1.17e-02

Pos logit contributions
ro+0.035
uzz+0.032
emo+0.032
keeping+0.032
ge+0.031
Neg logit contributions
aw-0.019
 Other-0.019
-0.017
 middle-0.017
 particularly-0.016
Token girl
Token ID2576
Feature activation+2.75e-02

Pos logit contributions
ro+0.023
uzz+0.020
emo+0.019
onna+0.019
keeping+0.019
Neg logit contributions
aw-0.019
 middle-0.018
 Other-0.017
 particularly-0.016
-0.016
Token picked
Token ID6497
Feature activation+1.48e-02

Pos logit contributions
ro+0.054
keeping+0.051
emo+0.046
uzz+0.044
icated+0.044
Neg logit contributions
aw-0.044
 particularly-0.039
 Other-0.037
 middle-0.037
ement-0.033
Token their
Token ID511
Feature activation+5.30e-02

Pos logit contributions
ro+0.045
uzz+0.038
emo+0.038
onna+0.038
keeping+0.037
Neg logit contributions
aw-0.032
 middle-0.029
 Other-0.029
 particularly-0.026
 course-0.026
Token piles
Token ID30935
Feature activation+3.05e-02

Pos logit contributions
ro+0.105
uzz+0.099
onna+0.090
nes+0.089
keeping+0.088
Neg logit contributions
 middle-0.079
 best-0.070
aw-0.069
 course-0.067
 particularly-0.064
Token together
Token ID1978
Feature activation+2.16e-02

Pos logit contributions
ro+0.062
emo+0.056
uzz+0.054
keeping+0.054
 cheek+0.052
Neg logit contributions
aw-0.044
 middle-0.038
 Other-0.038
 particularly-0.036
 forward-0.036
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
ro+0.044
emo+0.041
uzz+0.040
keeping+0.038
 cheek+0.037
Neg logit contributions
aw-0.030
 middle-0.027
 Other-0.026
 particularly-0.026
-0.025
Token share
Token ID2648
Feature activation+3.27e-02

Pos logit contributions
ro+0.028
uzz+0.025
emo+0.025
keeping+0.023
 cheek+0.022
Neg logit contributions
aw-0.029
 middle-0.028
 Other-0.027
 particularly-0.027
-0.025
Token their
Token ID511
Feature activation+6.80e-02

Pos logit contributions
ro+0.071
keeping+0.061
uzz+0.061
emo+0.059
onna+0.059
Neg logit contributions
aw-0.047
 middle-0.038
 Other-0.036
 particularly-0.033
 course-0.033
Token finds
Token ID7228
Feature activation+1.57e-02

Pos logit contributions
ro+0.134
uzz+0.126
ge+0.123
nes+0.123
 outage+0.121
Neg logit contributions
 middle-0.097
 course-0.092
 best-0.084
aw-0.080
 Other-0.072
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
ro+0.033
uzz+0.029
emo+0.028
keeping+0.028
onna+0.027
Neg logit contributions
aw-0.024
 Other-0.020
 middle-0.020
 particularly-0.018
 course-0.017
Token They
Token ID1119
Feature activation+1.10e-02

Pos logit contributions
ro+0.032
keeping+0.029
uzz+0.028
 cheek+0.028
ge+0.028
Neg logit contributions
aw-0.020
 Other-0.019
 middle-0.017
-0.017
 Eventually-0.016
Token had
Token ID550
Feature activation+7.89e-03

Pos logit contributions
ro+0.022
emo+0.019
uzz+0.019
keeping+0.019
 cheek+0.018
Neg logit contributions
aw-0.016
 particularly-0.015
 middle-0.015
 Other-0.014
-0.014
Token more
Token ID517
Feature activation+2.53e-02

Pos logit contributions
ro+0.016
uzz+0.014
emo+0.013
keeping+0.013
 cheek+0.013
Neg logit contributions
aw-0.013
 Other-0.011
 particularly-0.011
 middle-0.011
-0.010
Token pretty
Token ID2495
Feature activation+1.86e-02

Pos logit contributions
ro+0.031
uzz+0.027
emo+0.027
keeping+0.026
onna+0.024
Neg logit contributions
aw-0.023
 Other-0.023
 particularly-0.022
 middle-0.022
-0.021
Token kitten
Token ID42143
Feature activation-1.90e-02

Pos logit contributions
ro+0.031
uzz+0.028
emo+0.027
keeping+0.027
onna+0.025
Neg logit contributions
aw-0.032
 Other-0.030
 particularly-0.030
 middle-0.029
-0.028
Token.
Token ID13
Feature activation-2.26e-02

Pos logit contributions
aw+0.033
 Other+0.030
 middle+0.029
 particularly+0.028
+0.028
Neg logit contributions
ro-0.033
uzz-0.028
emo-0.027
keeping-0.027
 cheek-0.026
Token From
Token ID3574
Feature activation-1.44e-02

Pos logit contributions
aw+0.039
 particularly+0.035
 middle+0.034
 Other+0.034
ifying+0.033
Neg logit contributions
ro-0.040
uzz-0.035
emo-0.034
onna-0.032
keeping-0.032
Token that
Token ID326
Feature activation-6.39e-03

Pos logit contributions
aw+0.018
 Other+0.018
 particularly+0.016
+0.016
 middle+0.016
Neg logit contributions
ro-0.031
uzz-0.028
emo-0.027
keeping-0.026
 cheek-0.026
Token day
Token ID1110
Feature activation-9.05e-02

Pos logit contributions
aw+0.013
 Other+0.012
 particularly+0.012
 middle+0.012
+0.012
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.007
keeping-0.007
 cheek-0.007
Token on
Token ID319
Feature activation-3.69e-02

Pos logit contributions
aw+0.181
ement+0.180
pieces+0.171
 Other+0.168
 pasture+0.167
Neg logit contributions
ro-0.088
 cheek-0.078
 moment-0.078
 from-0.071
emo-0.071
Token,
Token ID11
Feature activation-3.25e-02

Pos logit contributions
aw+0.081
ifying+0.071
 Other+0.069
bridge+0.067
pieces+0.067
Neg logit contributions
ro-0.053
emo-0.040
uzz-0.038
 moment-0.038
onna-0.038
Token Lily
Token ID20037
Feature activation-1.20e-02

Pos logit contributions
aw+0.062
ement+0.055
ifying+0.054
 middle+0.053
 Other+0.052
Neg logit contributions
ro-0.054
emo-0.046
uzz-0.041
onna-0.041
 L-0.040
Token made
Token ID925
Feature activation-3.45e-03

Pos logit contributions
aw+0.018
 Other+0.017
 middle+0.016
+0.016
 particularly+0.016
Neg logit contributions
ro-0.024
emo-0.020
uzz-0.020
keeping-0.019
 cheek-0.019
Token sure
Token ID1654
Feature activation+8.86e-03

Pos logit contributions
aw+0.007
 Other+0.006
 particularly+0.006
 middle+0.006
+0.006
Neg logit contributions
ro-0.006
uzz-0.005
emo-0.005
keeping-0.004
 cheek-0.004
Token hard
Token ID1327
Feature activation+1.16e-02

Pos logit contributions
ro+0.021
uzz+0.018
emo+0.018
keeping+0.017
onna+0.016
Neg logit contributions
aw-0.020
 Other-0.019
 middle-0.019
 particularly-0.019
-0.018
Token work
Token ID670
Feature activation+3.02e-02

Pos logit contributions
ro+0.013
uzz+0.011
emo+0.010
keeping+0.009
 cheek+0.009
Neg logit contributions
aw-0.027
 Other-0.026
 particularly-0.025
 middle-0.025
-0.024
Token.
Token ID13
Feature activation-3.80e-02

Pos logit contributions
ro+0.061
emo+0.056
uzz+0.056
 cheek+0.051
keeping+0.050
Neg logit contributions
aw-0.043
 particularly-0.036
 Other-0.035
 middle-0.035
opl-0.034
Token From
Token ID3574
Feature activation-2.57e-03

Pos logit contributions
aw+0.065
 particularly+0.064
ifying+0.064
 middle+0.057
 Other+0.056
Neg logit contributions
ro-0.062
uzz-0.055
emo-0.054
keeping-0.052
 cheek-0.047
Token that
Token ID326
Feature activation+7.18e-03

Pos logit contributions
aw+0.003
 Other+0.003
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.006
uzz-0.005
emo-0.005
keeping-0.005
 cheek-0.005
Token day
Token ID1110
Feature activation-9.02e-02

Pos logit contributions
ro+0.012
uzz+0.011
keeping+0.010
emo+0.010
onna+0.009
Neg logit contributions
aw-0.013
 Other-0.012
 middle-0.012
 particularly-0.012
-0.011
Token on
Token ID319
Feature activation-3.77e-02

Pos logit contributions
aw+0.161
 particularly+0.158
pieces+0.156
 pasture+0.156
 Other+0.153
Neg logit contributions
ro-0.106
uzz-0.102
 moment-0.093
 seek-0.092
oys-0.092
Token,
Token ID11
Feature activation-5.32e-02

Pos logit contributions
aw+0.078
 particularly+0.072
 Other+0.071
ifying+0.071
pieces+0.069
Neg logit contributions
ro-0.053
emo-0.044
uzz-0.041
 common-0.041
 seek-0.040
Token Jerry
Token ID13075
Feature activation-1.31e-02

Pos logit contributions
ifying+0.112
aw+0.109
ridge+0.105
 particularly+0.104
 pasture+0.102
Neg logit contributions
ro-0.059
emo-0.055
 seek-0.049
 moment-0.048
 common-0.045
Token became
Token ID2627
Feature activation-3.26e-02

Pos logit contributions
aw+0.022
 Other+0.020
 middle+0.019
 particularly+0.019
+0.019
Neg logit contributions
ro-0.024
uzz-0.021
emo-0.020
keeping-0.020
onna-0.019
Token known
Token ID1900
Feature activation+5.48e-03

Pos logit contributions
aw+0.060
+0.056
ifying+0.055
 Other+0.055
opl+0.054
Neg logit contributions
ro-0.051
emo-0.047
uzz-0.045
 seek-0.041
keeping-0.039
Token helped
Token ID4193
Feature activation-1.94e-02

Pos logit contributions
aw+0.010
 Other+0.010
 middle+0.009
 particularly+0.009
+0.009
Neg logit contributions
ro-0.012
uzz-0.011
emo-0.010
keeping-0.010
 cheek-0.010
Token him
Token ID683
Feature activation+8.69e-04

Pos logit contributions
aw+0.036
 Other+0.032
 middle+0.031
 particularly+0.030
+0.030
Neg logit contributions
ro-0.033
uzz-0.028
keeping-0.028
emo-0.028
 cheek-0.025
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
ro+0.001
uzz+0.001
emo+0.001
keeping+0.001
 cheek+0.001
Neg logit contributions
aw-0.002
 Other-0.002
 particularly-0.001
 middle-0.001
-0.001
Token From
Token ID3574
Feature activation-1.21e-02

Pos logit contributions
aw+0.017
 Other+0.016
 particularly+0.015
 middle+0.015
+0.014
Neg logit contributions
ro-0.022
uzz-0.019
emo-0.019
keeping-0.018
onna-0.017
Token that
Token ID326
Feature activation-1.35e-04

Pos logit contributions
aw+0.015
 Other+0.014
 particularly+0.013
 middle+0.013
+0.013
Neg logit contributions
ro-0.027
uzz-0.024
emo-0.023
keeping-0.023
 cheek-0.022
Token day
Token ID1110
Feature activation-9.00e-02

Pos logit contributions
aw+0.000
 Other+0.000
 particularly+0.000
 middle+0.000
+0.000
Neg logit contributions
ro-0.000
uzz-0.000
emo-0.000
keeping-0.000
 cheek-0.000
Token on
Token ID319
Feature activation-5.07e-02

Pos logit contributions
aw+0.158
 pasture+0.156
 Other+0.146
pieces+0.140
chairs+0.139
Neg logit contributions
ro-0.110
 moment-0.103
emo-0.102
 itself-0.096
 don-0.095
Token,
Token ID11
Feature activation-3.68e-02

Pos logit contributions
aw+0.112
 Other+0.098
ifying+0.094
pieces+0.094
 particularly+0.092
Neg logit contributions
ro-0.065
emo-0.055
 common-0.054
 moment-0.052
 seek-0.051
Token Billy
Token ID15890
Feature activation-1.74e-03

Pos logit contributions
aw+0.081
 Other+0.072
 middle+0.072
ifying+0.071
 particularly+0.070
Neg logit contributions
ro-0.045
emo-0.042
keeping-0.035
uzz-0.034
 seek-0.033
Token and
Token ID290
Feature activation+2.00e-03

Pos logit contributions
aw+0.003
 Other+0.002
 particularly+0.002
 middle+0.002
+0.002
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token Harry
Token ID5850
Feature activation-1.67e-02

Pos logit contributions
ro+0.004
uzz+0.003
emo+0.003
keeping+0.003
 cheek+0.003
Neg logit contributions
aw-0.003
 Other-0.003
 particularly-0.003
 middle-0.003
-0.003
Token.
Token ID13
Feature activation-3.72e-02

Feature not active on prev token;
no predictions were affected.
Token
Token ID198
Feature activation-2.89e-02

Pos logit contributions
aw+0.070
 particularly+0.065
ifying+0.063
 middle+0.061
 Other+0.060
Neg logit contributions
ro-0.056
emo-0.052
uzz-0.050
keeping-0.047
 itself-0.044
Token
Token ID198
Feature activation-5.43e-02

Pos logit contributions
aw+0.053
 Other+0.048
ifying+0.047
 particularly+0.047
 middle+0.046
Neg logit contributions
ro-0.046
emo-0.042
uzz-0.041
keeping-0.038
onna-0.036
TokenFrom
Token ID4863
Feature activation-7.51e-03

Pos logit contributions
ement+0.094
aw+0.094
 Other+0.089
ifying+0.087
+0.081
Neg logit contributions
ro-0.080
emo-0.079
 itself-0.066
uzz-0.065
ge-0.063
Token that
Token ID326
Feature activation-1.41e-02

Pos logit contributions
aw+0.009
 Other+0.008
 particularly+0.008
 middle+0.008
+0.007
Neg logit contributions
ro-0.017
uzz-0.015
emo-0.015
keeping-0.015
 cheek-0.014
Token day
Token ID1110
Feature activation-8.78e-02

Pos logit contributions
aw+0.032
 Other+0.029
ifying+0.028
 middle+0.028
 particularly+0.028
Neg logit contributions
ro-0.018
emo-0.014
uzz-0.014
keeping-0.013
 cheek-0.013
Token on
Token ID319
Feature activation-5.59e-02

Pos logit contributions
aw+0.151
pieces+0.143
ement+0.139
 particularly+0.137
 Other+0.136
Neg logit contributions
ro-0.116
uzz-0.104
emo-0.101
 moment-0.100
 itself-0.098
Token,
Token ID11
Feature activation-4.00e-02

Pos logit contributions
aw+0.135
ifying+0.123
pieces+0.123
bridge+0.119
 Other+0.116
Neg logit contributions
ro-0.054
emo-0.047
 moment-0.042
 itself-0.041
 common-0.039
Token Nut
Token ID11959
Feature activation-1.34e-03

Pos logit contributions
aw+0.080
ifying+0.072
ement+0.071
 middle+0.070
 particularly+0.069
Neg logit contributions
ro-0.055
emo-0.052
 itself-0.043
uzz-0.042
 moment-0.041
Tokenty
Token ID774
Feature activation-2.78e-02

Pos logit contributions
aw+0.002
 Other+0.002
 particularly+0.002
 middle+0.002
+0.002
Neg logit contributions
ro-0.002
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.002
Token made
Token ID925
Feature activation-1.45e-02

Pos logit contributions
aw+0.045
 Other+0.042
ifying+0.041
 middle+0.041
+0.040
Neg logit contributions
ro-0.052
emo-0.043
keeping-0.042
uzz-0.042
 cheek-0.039
Token motorcycles
Token ID39404
Feature activation-2.96e-02

Pos logit contributions
ro+0.019
uzz+0.016
emo+0.015
keeping+0.015
 cheek+0.014
Neg logit contributions
aw-0.018
 Other-0.017
 particularly-0.017
 middle-0.016
-0.015
Token.
Token ID13
Feature activation-5.40e-03

Pos logit contributions
aw+0.056
 Other+0.054
 middle+0.052
+0.051
ifying+0.051
Neg logit contributions
ro-0.045
uzz-0.038
keeping-0.035
emo-0.034
 cheek-0.032
Token He
Token ID679
Feature activation-3.50e-02

Pos logit contributions
aw+0.007
 Other+0.006
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.010
keeping-0.010
 cheek-0.010
Token had
Token ID550
Feature activation-5.36e-02

Pos logit contributions
aw+0.066
+0.062
 Other+0.061
ifying+0.060
 middle+0.059
Neg logit contributions
ro-0.056
emo-0.046
uzz-0.044
ge-0.039
 seek-0.039
Token a
Token ID257
Feature activation-6.28e-02

Pos logit contributions
+0.111
ement+0.103
aw+0.101
ishly+0.101
 Other+0.096
Neg logit contributions
ro-0.069
emo-0.057
uzz-0.057
keeping-0.054
icated-0.052
Token toy
Token ID13373
Feature activation-8.70e-02

Pos logit contributions
aw+0.145
+0.123
 Other+0.123
agged+0.122
pped+0.121
Neg logit contributions
ro-0.071
 common-0.065
 moment-0.062
emo-0.059
 cheek-0.059
Token motorcycle
Token ID18757
Feature activation-4.56e-02

Pos logit contributions
aw+0.171
Eventually+0.169
Hours+0.161
 Other+0.160
 Eventually+0.158
Neg logit contributions
ro-0.118
 cheek-0.089
 ante-0.084
guard-0.082
emo-0.080
Token that
Token ID326
Feature activation-3.09e-03

Pos logit contributions
 Other+0.075
aw+0.074
+0.069
Eventually+0.069
 middle+0.068
Neg logit contributions
ro-0.083
uzz-0.065
 cheek-0.060
keeping-0.060
emo-0.060
Token he
Token ID339
Feature activation-2.15e-02

Pos logit contributions
aw+0.005
 Other+0.004
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.006
uzz-0.005
emo-0.005
keeping-0.005
 cheek-0.005
Token would
Token ID561
Feature activation-2.83e-02

Pos logit contributions
aw+0.037
 Other+0.034
 middle+0.033
+0.032
opl+0.031
Neg logit contributions
ro-0.039
emo-0.033
uzz-0.033
keeping-0.031
 cheek-0.029
Token ride
Token ID6594
Feature activation-6.24e-03

Pos logit contributions
aw+0.059
 Other+0.053
 middle+0.050
+0.050
ement+0.049
Neg logit contributions
ro-0.043
uzz-0.034
emo-0.033
 cheek-0.032
onna-0.032
Token.
Token ID13
Feature activation+7.92e-03

Pos logit contributions
ro+0.022
uzz+0.019
emo+0.019
keeping+0.018
onna+0.017
Neg logit contributions
aw-0.017
 Other-0.016
 middle-0.015
 particularly-0.015
-0.014
Token
Token ID198
Feature activation+2.28e-02

Pos logit contributions
ro+0.017
uzz+0.015
keeping+0.015
emo+0.015
 cheek+0.015
Neg logit contributions
aw-0.010
 Other-0.010
 middle-0.009
 particularly-0.008
-0.008
Token
Token ID198
Feature activation-2.50e-03

Pos logit contributions
ro+0.051
keeping+0.044
uzz+0.044
 cheek+0.043
 textures+0.042
Neg logit contributions
aw-0.029
 Other-0.027
 middle-0.026
 particularly-0.024
Eventually-0.023
TokenFrom
Token ID4863
Feature activation-1.23e-03

Pos logit contributions
aw+0.004
 Other+0.003
 middle+0.003
 particularly+0.003
+0.003
Neg logit contributions
ro-0.005
uzz-0.005
emo-0.004
keeping-0.004
 cheek-0.004
Token that
Token ID326
Feature activation-7.77e-03

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.003
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.002
Token day
Token ID1110
Feature activation-8.67e-02

Pos logit contributions
aw+0.017
 Other+0.015
 particularly+0.015
 middle+0.015
+0.014
Neg logit contributions
ro-0.011
uzz-0.009
emo-0.009
keeping-0.008
 cheek-0.008
Token on
Token ID319
Feature activation-2.31e-02

Pos logit contributions
aw+0.160
pieces+0.148
 pasture+0.143
 Other+0.142
ement+0.141
Neg logit contributions
ro-0.115
uzz-0.099
emo-0.097
 moment-0.095
 cheek-0.093
Token,
Token ID11
Feature activation-2.20e-02

Pos logit contributions
aw+0.043
 Other+0.038
 particularly+0.037
ifying+0.037
 middle+0.036
Neg logit contributions
ro-0.040
emo-0.034
uzz-0.033
onna-0.030
keeping-0.030
Token Jolly
Token ID40276
Feature activation-1.17e-02

Pos logit contributions
aw+0.044
 Other+0.041
 particularly+0.040
 middle+0.039
+0.039
Neg logit contributions
ro-0.032
emo-0.027
uzz-0.026
keeping-0.023
onna-0.023
Token and
Token ID290
Feature activation+6.88e-03

Pos logit contributions
aw+0.021
 Other+0.019
 middle+0.019
 particularly+0.019
+0.018
Neg logit contributions
ro-0.020
emo-0.017
uzz-0.017
keeping-0.016
 cheek-0.015
Token his
Token ID465
Feature activation-3.22e-02

Pos logit contributions
ro+0.013
uzz+0.011
keeping+0.011
emo+0.011
 cheek+0.011
Neg logit contributions
aw-0.011
 Other-0.010
 middle-0.010
 particularly-0.010
-0.009
Token helicopters
Token ID23890
Feature activation-2.91e-02

Pos logit contributions
ro+0.010
uzz+0.008
emo+0.008
keeping+0.008
 cheek+0.007
Neg logit contributions
aw-0.010
 Other-0.010
 particularly-0.009
 middle-0.009
-0.009
Token.
Token ID13
Feature activation+7.15e-03

Pos logit contributions
aw+0.056
 Other+0.054
+0.052
ifying+0.052
Eventually+0.050
Neg logit contributions
ro-0.045
uzz-0.036
emo-0.034
keeping-0.033
 cheek-0.032
Token He
Token ID679
Feature activation-2.30e-02

Pos logit contributions
ro+0.016
keeping+0.015
uzz+0.015
emo+0.014
 cheek+0.014
Neg logit contributions
aw-0.008
 Other-0.008
 middle-0.007
 particularly-0.007
-0.007
Token had
Token ID550
Feature activation-3.97e-02

Pos logit contributions
aw+0.039
 Other+0.036
+0.035
ifying+0.035
 middle+0.034
Neg logit contributions
ro-0.042
emo-0.036
uzz-0.035
keeping-0.032
onna-0.032
Token a
Token ID257
Feature activation-5.65e-02

Pos logit contributions
+0.079
ement+0.077
aw+0.076
 Other+0.071
ishly+0.071
Neg logit contributions
ro-0.054
emo-0.046
uzz-0.045
keeping-0.044
 textures-0.041
Token toy
Token ID13373
Feature activation-8.67e-02

Pos logit contributions
aw+0.128
ement+0.109
 Other+0.109
ifying+0.106
opl+0.106
Neg logit contributions
ro-0.068
 common-0.060
emo-0.060
 cheek-0.059
 moment-0.056
Token helicopter
Token ID15392
Feature activation-3.67e-02

Pos logit contributions
aw+0.135
Eventually+0.128
oken+0.121
 Eventually+0.120
 Other+0.119
Neg logit contributions
ro-0.155
 cheek-0.134
guard-0.123
uzz-0.122
emo-0.119
Token that
Token ID326
Feature activation+5.88e-03

Pos logit contributions
 Other+0.065
aw+0.063
+0.059
Eventually+0.059
 middle+0.057
Neg logit contributions
ro-0.066
uzz-0.052
keeping-0.049
emo-0.049
 cheek-0.048
Token he
Token ID339
Feature activation-8.61e-03

Pos logit contributions
ro+0.012
uzz+0.011
keeping+0.011
emo+0.010
 cheek+0.010
Neg logit contributions
aw-0.008
 Other-0.008
 particularly-0.007
 middle-0.007
-0.007
Token played
Token ID2826
Feature activation-4.48e-04

Pos logit contributions
aw+0.013
 Other+0.012
 middle+0.012
 particularly+0.011
+0.011
Neg logit contributions
ro-0.017
uzz-0.015
emo-0.015
keeping-0.014
 cheek-0.013
Token with
Token ID351
Feature activation-2.26e-02

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.001
uzz-0.001
emo-0.001
keeping-0.001
 cheek-0.000
Token nut
Token ID6701
Feature activation+2.50e-02

Pos logit contributions
ro+0.040
uzz+0.037
keeping+0.035
emo+0.033
nes+0.032
Neg logit contributions
aw-0.030
 Other-0.028
 middle-0.028
 course-0.026
 particularly-0.026
Token together
Token ID1978
Feature activation-2.07e-03

Pos logit contributions
ro+0.049
uzz+0.041
emo+0.040
keeping+0.039
onna+0.037
Neg logit contributions
aw-0.038
 Other-0.035
 middle-0.035
 particularly-0.033
-0.032
Token.
Token ID13
Feature activation-4.32e-02

Pos logit contributions
aw+0.004
 Other+0.003
 particularly+0.003
 middle+0.003
+0.003
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token From
Token ID3574
Feature activation-1.40e-02

Pos logit contributions
aw+0.077
 particularly+0.070
ifying+0.069
 middle+0.066
ement+0.066
Neg logit contributions
ro-0.068
emo-0.063
uzz-0.060
keeping-0.056
 itself-0.055
Token that
Token ID326
Feature activation+8.93e-04

Pos logit contributions
aw+0.018
 Other+0.017
 particularly+0.016
+0.016
 middle+0.016
Neg logit contributions
ro-0.030
uzz-0.027
emo-0.027
keeping-0.026
 cheek-0.025
Token day
Token ID1110
Feature activation-8.66e-02

Pos logit contributions
ro+0.001
uzz+0.001
keeping+0.001
emo+0.001
 cheek+0.001
Neg logit contributions
aw-0.002
 Other-0.002
 particularly-0.002
 middle-0.002
-0.002
Token on
Token ID319
Feature activation-4.14e-02

Pos logit contributions
aw+0.151
pieces+0.140
 pasture+0.136
ement+0.135
 particularly+0.134
Neg logit contributions
ro-0.115
uzz-0.107
emo-0.102
 moment-0.101
 seek-0.100
Token,
Token ID11
Feature activation-3.89e-02

Pos logit contributions
aw+0.092
ifying+0.081
pieces+0.081
 Other+0.079
 particularly+0.077
Neg logit contributions
ro-0.053
emo-0.046
uzz-0.040
 moment-0.039
 itself-0.039
Token Nut
Token ID11959
Feature activation+2.95e-03

Pos logit contributions
aw+0.078
ifying+0.069
ement+0.068
 middle+0.067
 Other+0.067
Neg logit contributions
ro-0.054
emo-0.053
uzz-0.045
 itself-0.043
 moment-0.041
Tokenty
Token ID774
Feature activation-2.08e-02

Pos logit contributions
ro+0.005
uzz+0.005
keeping+0.004
emo+0.004
 cheek+0.004
Neg logit contributions
aw-0.005
 Other-0.005
 particularly-0.005
 middle-0.004
-0.004
Token knew
Token ID2993
Feature activation-1.12e-02

Pos logit contributions
aw+0.035
 Other+0.032
 middle+0.031
ifying+0.031
 particularly+0.030
Neg logit contributions
ro-0.038
emo-0.032
uzz-0.031
keeping-0.031
onna-0.029
Token good
Token ID922
Feature activation-1.56e-02

Pos logit contributions
aw+0.032
 Other+0.030
+0.029
ifying+0.028
 particularly+0.028
Neg logit contributions
ro-0.037
emo-0.034
uzz-0.034
keeping-0.032
 seek-0.030
Token friends
Token ID2460
Feature activation-2.55e-02

Pos logit contributions
aw+0.022
 Other+0.021
 particularly+0.021
 middle+0.020
+0.020
Neg logit contributions
ro-0.032
uzz-0.028
keeping-0.027
emo-0.027
 cheek-0.025
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
aw+0.049
 Other+0.048
 middle+0.046
 particularly+0.045
+0.044
Neg logit contributions
ro-0.038
uzz-0.033
keeping-0.032
 cheek-0.030
emo-0.029
Token From
Token ID3574
Feature activation-1.07e-02

Pos logit contributions
aw+0.033
 particularly+0.030
 middle+0.029
 Other+0.029
ifying+0.028
Neg logit contributions
ro-0.037
uzz-0.033
emo-0.032
keeping-0.031
onna-0.029
Token that
Token ID326
Feature activation-6.80e-03

Pos logit contributions
aw+0.013
 Other+0.013
 particularly+0.011
+0.011
 middle+0.011
Neg logit contributions
ro-0.024
uzz-0.021
emo-0.021
keeping-0.020
 cheek-0.020
Token day
Token ID1110
Feature activation-8.62e-02

Pos logit contributions
aw+0.014
 Other+0.013
 particularly+0.013
 middle+0.013
+0.013
Neg logit contributions
ro-0.010
uzz-0.008
emo-0.008
keeping-0.007
 cheek-0.007
Token on
Token ID319
Feature activation-4.20e-02

Pos logit contributions
aw+0.170
ement+0.168
 pasture+0.167
pieces+0.160
ifying+0.159
Neg logit contributions
ro-0.080
 seek-0.077
 itself-0.074
uzz-0.073
emo-0.073
Token,
Token ID11
Feature activation-2.55e-02

Pos logit contributions
aw+0.092
ifying+0.082
 Other+0.077
 particularly+0.077
 middle+0.075
Neg logit contributions
ro-0.056
emo-0.045
 moment-0.044
 seek-0.044
uzz-0.043
Token Max
Token ID5436
Feature activation-7.82e-03

Pos logit contributions
aw+0.057
ifying+0.051
 middle+0.051
 Other+0.050
 particularly+0.050
Neg logit contributions
ro-0.033
emo-0.027
uzz-0.027
 seek-0.023
keeping-0.023
Token always
Token ID1464
Feature activation+2.52e-03

Pos logit contributions
aw+0.012
 Other+0.011
 middle+0.011
 particularly+0.010
+0.010
Neg logit contributions
ro-0.015
uzz-0.013
emo-0.013
keeping-0.013
 cheek-0.012
Token replied
Token ID8712
Feature activation-2.24e-02

Pos logit contributions
ro+0.005
uzz+0.005
emo+0.005
keeping+0.005
 cheek+0.004
Neg logit contributions
aw-0.004
 Other-0.003
 particularly-0.003
 middle-0.003
-0.003
Token very
Token ID845
Feature activation+8.49e-03

Pos logit contributions
aw+0.007
 Other+0.007
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.007
Token happy
Token ID3772
Feature activation+4.21e-03

Pos logit contributions
ro+0.016
keeping+0.013
uzz+0.013
emo+0.013
onna+0.012
Neg logit contributions
aw-0.015
 Other-0.013
 particularly-0.013
 middle-0.013
-0.012
Token.
Token ID13
Feature activation-3.85e-02

Pos logit contributions
ro+0.008
uzz+0.007
emo+0.007
keeping+0.006
 cheek+0.006
Neg logit contributions
aw-0.007
 Other-0.006
 particularly-0.006
 middle-0.006
-0.006
Token From
Token ID3574
Feature activation-8.37e-03

Pos logit contributions
 particularly+0.074
aw+0.073
ifying+0.070
 middle+0.066
+0.065
Neg logit contributions
ro-0.056
uzz-0.051
emo-0.044
keeping-0.044
 itself-0.040
Token that
Token ID326
Feature activation-6.53e-03

Pos logit contributions
aw+0.010
 Other+0.009
 particularly+0.009
 middle+0.009
+0.008
Neg logit contributions
ro-0.019
uzz-0.017
emo-0.017
keeping-0.016
 cheek-0.016
Token day
Token ID1110
Feature activation-8.61e-02

Pos logit contributions
aw+0.014
 Other+0.013
 middle+0.012
 particularly+0.012
+0.012
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.007
keeping-0.007
 cheek-0.007
Token on
Token ID319
Feature activation-4.53e-02

Pos logit contributions
aw+0.169
 pasture+0.169
opl+0.166
 particularly+0.166
ns+0.165
Neg logit contributions
 moment-0.073
uzz-0.072
oys-0.069
ro-0.068
 from-0.067
Token,
Token ID11
Feature activation-4.30e-02

Pos logit contributions
aw+0.100
ifying+0.090
 Other+0.090
 particularly+0.089
opl+0.087
Neg logit contributions
ro-0.057
 moment-0.045
 common-0.043
 seek-0.043
uzz-0.042
Token Sue
Token ID26089
Feature activation-1.60e-02

Pos logit contributions
aw+0.110
ifying+0.109
opl+0.104
+0.104
 particularly+0.103
Neg logit contributions
ro-0.034
uzz-0.028
emo-0.023
 moment-0.021
 seek-0.020
Token knew
Token ID2993
Feature activation-1.10e-02

Pos logit contributions
aw+0.026
 Other+0.024
 middle+0.024
+0.023
 particularly+0.023
Neg logit contributions
ro-0.030
uzz-0.025
emo-0.025
keeping-0.024
onna-0.023
Token she
Token ID673
Feature activation-1.29e-02

Pos logit contributions
aw+0.015
 Other+0.013
 middle+0.013
 particularly+0.012
+0.012
Neg logit contributions
ro-0.024
uzz-0.021
emo-0.021
keeping-0.020
onna-0.019
Token helicopters
Token ID23890
Feature activation-3.26e-02

Pos logit contributions
ro+0.011
uzz+0.009
emo+0.009
keeping+0.009
 cheek+0.008
Neg logit contributions
aw-0.012
 Other-0.011
 particularly-0.010
 middle-0.010
-0.010
Token.
Token ID13
Feature activation+4.95e-03

Pos logit contributions
aw+0.066
 Other+0.063
+0.061
Eventually+0.061
ifying+0.060
Neg logit contributions
ro-0.046
uzz-0.037
emo-0.035
 cheek-0.034
keeping-0.034
Token He
Token ID679
Feature activation-2.04e-02

Pos logit contributions
ro+0.011
uzz+0.010
keeping+0.010
emo+0.010
 cheek+0.010
Neg logit contributions
aw-0.006
 Other-0.006
 middle-0.005
 particularly-0.005
-0.005
Token had
Token ID550
Feature activation-3.44e-02

Pos logit contributions
aw+0.035
 Other+0.032
+0.031
ifying+0.031
 middle+0.030
Neg logit contributions
ro-0.037
emo-0.031
uzz-0.031
keeping-0.028
 cheek-0.028
Token a
Token ID257
Feature activation-4.80e-02

Pos logit contributions
+0.068
aw+0.066
ement+0.065
 Other+0.063
opl+0.061
Neg logit contributions
ro-0.047
uzz-0.040
emo-0.040
keeping-0.038
 textures-0.036
Token toy
Token ID13373
Feature activation-8.57e-02

Pos logit contributions
aw+0.105
 Other+0.092
opl+0.089
ement+0.088
 Eventually+0.086
Neg logit contributions
ro-0.063
 cheek-0.054
emo-0.052
 common-0.052
 moment-0.049
Token helicopter
Token ID15392
Feature activation-3.07e-02

Pos logit contributions
aw+0.126
Eventually+0.118
oken+0.114
 Other+0.111
 Eventually+0.111
Neg logit contributions
ro-0.158
 cheek-0.141
uzz-0.127
guard-0.127
emo-0.121
Token that
Token ID326
Feature activation+3.74e-03

Pos logit contributions
 Other+0.051
aw+0.051
+0.046
Eventually+0.046
 middle+0.046
Neg logit contributions
ro-0.058
uzz-0.047
keeping-0.045
emo-0.044
 cheek-0.043
Token he
Token ID339
Feature activation-9.58e-03

Pos logit contributions
ro+0.008
uzz+0.007
keeping+0.007
emo+0.006
 cheek+0.006
Neg logit contributions
aw-0.005
 Other-0.005
 particularly-0.005
 middle-0.005
-0.005
Token played
Token ID2826
Feature activation-3.25e-03

Pos logit contributions
aw+0.015
 Other+0.014
 middle+0.013
 particularly+0.013
+0.013
Neg logit contributions
ro-0.019
uzz-0.016
emo-0.016
keeping-0.015
 cheek-0.015
Token with
Token ID351
Feature activation-1.83e-02

Pos logit contributions
aw+0.007
 Other+0.006
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.005
uzz-0.004
emo-0.004
keeping-0.004
 cheek-0.003
Token gray
Token ID12768
Feature activation+1.65e-02

Pos logit contributions
ro+0.044
emo+0.041
uzz+0.040
keeping+0.038
onna+0.036
Neg logit contributions
 middle-0.035
 Other-0.034
 particularly-0.034
aw-0.032
 course-0.031
Token cloth
Token ID16270
Feature activation+1.78e-02

Pos logit contributions
ro+0.027
emo+0.025
uzz+0.025
keeping+0.024
onna+0.022
Neg logit contributions
aw-0.028
 Other-0.027
 middle-0.027
 particularly-0.027
-0.026
Token.
Token ID13
Feature activation-1.87e-02

Pos logit contributions
ro+0.033
emo+0.029
uzz+0.029
keeping+0.028
 cheek+0.026
Neg logit contributions
aw-0.028
 particularly-0.025
 Other-0.025
 middle-0.025
-0.024
Token From
Token ID3574
Feature activation-1.25e-03

Pos logit contributions
aw+0.027
 particularly+0.024
 Other+0.024
 middle+0.023
ifying+0.022
Neg logit contributions
ro-0.038
uzz-0.034
emo-0.033
keeping-0.032
onna-0.030
Token that
Token ID326
Feature activation+1.50e-02

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.003
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token day
Token ID1110
Feature activation-8.52e-02

Pos logit contributions
ro+0.028
keeping+0.024
uzz+0.024
emo+0.024
onna+0.021
Neg logit contributions
 Other-0.024
 particularly-0.024
 middle-0.024
aw-0.023
 course-0.022
Token onwards
Token ID31719
Feature activation-2.95e-02

Pos logit contributions
 Other+0.155
aw+0.152
 pasture+0.145
opl+0.141
ns+0.141
Neg logit contributions
ro-0.102
uzz-0.091
 moment-0.091
 cheek-0.086
 from-0.084
Token,
Token ID11
Feature activation-6.02e-02

Pos logit contributions
aw+0.050
 Other+0.050
 particularly+0.047
ifying+0.046
opl+0.046
Neg logit contributions
ro-0.050
keeping-0.042
uzz-0.041
emo-0.041
 cheek-0.038
Token Sandra
Token ID32464
Feature activation-9.28e-03

Pos logit contributions
 Other+0.127
ifying+0.126
aw+0.125
lers+0.124
+0.121
Neg logit contributions
ro-0.063
 moment-0.054
emo-0.054
 common-0.051
uzz-0.050
Token always
Token ID1464
Feature activation+5.38e-03

Pos logit contributions
aw+0.015
 Other+0.013
 middle+0.013
 particularly+0.013
+0.012
Neg logit contributions
ro-0.018
uzz-0.015
emo-0.015
keeping-0.015
 cheek-0.014
Token enjoyed
Token ID8359
Feature activation-2.18e-03

Pos logit contributions
ro+0.011
uzz+0.009
emo+0.009
keeping+0.009
 cheek+0.009
Neg logit contributions
aw-0.008
 Other-0.007
 particularly-0.007
 middle-0.007
-0.007
Token of
Token ID286
Feature activation+2.67e-04

Pos logit contributions
aw+0.032
 Other+0.031
 middle+0.031
ifying+0.030
 particularly+0.029
Neg logit contributions
ro-0.048
emo-0.040
uzz-0.039
 cheek-0.038
keeping-0.038
Token fun
Token ID1257
Feature activation-2.22e-02

Pos logit contributions
ro+0.000
uzz+0.000
emo+0.000
keeping+0.000
onna+0.000
Neg logit contributions
aw-0.001
 Other-0.000
 middle-0.000
 particularly-0.000
-0.000
Token.
Token ID13
Feature activation-3.72e-02

Pos logit contributions
aw+0.042
 Other+0.041
 middle+0.040
 particularly+0.039
ifying+0.038
Neg logit contributions
ro-0.034
uzz-0.028
keeping-0.027
emo-0.027
 cheek-0.025
Token From
Token ID3574
Feature activation-1.28e-02

Pos logit contributions
aw+0.077
 particularly+0.073
ement+0.070
ifying+0.070
 middle+0.069
Neg logit contributions
ro-0.050
emo-0.039
uzz-0.039
icated-0.036
 cheek-0.033
Token that
Token ID326
Feature activation-2.67e-03

Pos logit contributions
aw+0.016
 Other+0.016
 particularly+0.014
+0.014
 middle+0.014
Neg logit contributions
ro-0.028
uzz-0.025
emo-0.024
keeping-0.023
 cheek-0.023
Token day
Token ID1110
Feature activation-8.39e-02

Pos logit contributions
aw+0.005
 Other+0.005
 middle+0.005
 particularly+0.005
+0.005
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token on
Token ID319
Feature activation-4.22e-02

Pos logit contributions
aw+0.146
pieces+0.135
 particularly+0.135
 pasture+0.132
ement+0.130
Neg logit contributions
ro-0.114
uzz-0.103
 cheek-0.098
 moment-0.097
 seek-0.097
Token,
Token ID11
Feature activation-3.36e-02

Pos logit contributions
aw+0.094
ifying+0.086
 particularly+0.081
pieces+0.081
 Other+0.079
Neg logit contributions
ro-0.054
emo-0.040
 common-0.039
 L-0.039
uzz-0.038
Token Max
Token ID5436
Feature activation-4.73e-03

Pos logit contributions
aw+0.077
ement+0.068
ifying+0.068
 middle+0.067
+0.066
Neg logit contributions
ro-0.042
emo-0.032
uzz-0.030
icated-0.028
 seek-0.028
Token and
Token ID290
Feature activation-6.42e-03

Pos logit contributions
aw+0.008
 Other+0.007
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.007
keeping-0.007
 cheek-0.007
Token Luna
Token ID23694
Feature activation-9.91e-03

Pos logit contributions
aw+0.012
 Other+0.011
 particularly+0.010
 middle+0.010
+0.010
Neg logit contributions
ro-0.011
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token right
Token ID826
Feature activation-1.27e-02

Pos logit contributions
ro+0.007
uzz+0.006
emo+0.006
keeping+0.006
onna+0.005
Neg logit contributions
aw-0.006
 Other-0.005
 middle-0.005
 particularly-0.005
-0.005
Token thing
Token ID1517
Feature activation-5.41e-03

Pos logit contributions
aw+0.029
 Other+0.027
 particularly+0.026
+0.026
 middle+0.026
Neg logit contributions
ro-0.017
uzz-0.013
emo-0.013
keeping-0.013
 cheek-0.011
Token.
Token ID13
Feature activation-3.08e-02

Pos logit contributions
aw+0.009
 Other+0.008
 middle+0.008
 particularly+0.008
+0.008
Neg logit contributions
ro-0.010
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.008
Token From
Token ID3574
Feature activation-1.42e-02

Pos logit contributions
aw+0.054
 particularly+0.050
 middle+0.048
ifying+0.048
 Other+0.047
Neg logit contributions
ro-0.052
emo-0.046
uzz-0.046
keeping-0.044
 itself-0.040
Token that
Token ID326
Feature activation+1.22e-03

Pos logit contributions
aw+0.018
 Other+0.018
 particularly+0.016
+0.016
 middle+0.016
Neg logit contributions
ro-0.030
uzz-0.027
emo-0.027
keeping-0.026
 cheek-0.025
Token day
Token ID1110
Feature activation-8.39e-02

Pos logit contributions
ro+0.002
uzz+0.002
keeping+0.002
emo+0.002
onna+0.001
Neg logit contributions
aw-0.002
 Other-0.002
 particularly-0.002
 middle-0.002
-0.002
Token on
Token ID319
Feature activation-3.91e-02

Pos logit contributions
aw+0.150
 Other+0.135
 pasture+0.134
ement+0.128
 particularly+0.128
Neg logit contributions
ro-0.110
 moment-0.103
 cheek-0.100
uzz-0.099
 seek-0.097
Token,
Token ID11
Feature activation-3.29e-02

Pos logit contributions
aw+0.084
 Other+0.073
ifying+0.073
 middle+0.070
 particularly+0.069
Neg logit contributions
ro-0.055
emo-0.044
uzz-0.041
 moment-0.041
 itself-0.040
Token Nut
Token ID11959
Feature activation+5.36e-03

Pos logit contributions
aw+0.063
ifying+0.057
 Other+0.056
 middle+0.055
ement+0.054
Neg logit contributions
ro-0.050
emo-0.045
uzz-0.041
keeping-0.039
 itself-0.038
Tokenty
Token ID774
Feature activation-1.37e-02

Pos logit contributions
ro+0.010
uzz+0.008
keeping+0.008
emo+0.008
 cheek+0.008
Neg logit contributions
aw-0.009
 Other-0.008
 particularly-0.008
 middle-0.008
opl-0.008
Token always
Token ID1464
Feature activation+9.07e-03

Pos logit contributions
aw+0.022
 Other+0.020
 middle+0.020
 particularly+0.019
+0.019
Neg logit contributions
ro-0.026
emo-0.022
uzz-0.022
keeping-0.021
 cheek-0.020
Token.
Token ID13
Feature activation-3.36e-02

Pos logit contributions
aw+0.009
 Other+0.008
 middle+0.008
 particularly+0.008
+0.008
Neg logit contributions
ro-0.010
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
aw+0.057
 particularly+0.052
ifying+0.051
ement+0.049
 middle+0.048
Neg logit contributions
ro-0.058
emo-0.051
uzz-0.051
keeping-0.045
 cheek-0.044
Token
Token ID198
Feature activation-2.67e-02

Pos logit contributions
aw+0.027
 Other+0.024
 particularly+0.023
 middle+0.023
+0.023
Neg logit contributions
ro-0.029
uzz-0.025
emo-0.025
keeping-0.023
onna-0.023
TokenFrom
Token ID4863
Feature activation-8.49e-03

Pos logit contributions
aw+0.046
 Other+0.043
ement+0.042
opl+0.040
 middle+0.040
Neg logit contributions
ro-0.047
emo-0.039
uzz-0.037
 seek-0.035
onna-0.034
Token that
Token ID326
Feature activation-7.76e-03

Pos logit contributions
aw+0.010
 Other+0.009
 particularly+0.009
 middle+0.009
+0.008
Neg logit contributions
ro-0.019
uzz-0.017
emo-0.017
keeping-0.017
 cheek-0.016
Token day
Token ID1110
Feature activation-8.39e-02

Pos logit contributions
aw+0.017
 Other+0.015
 middle+0.015
 particularly+0.015
+0.014
Neg logit contributions
ro-0.011
uzz-0.009
emo-0.009
keeping-0.008
 cheek-0.008
Token on
Token ID319
Feature activation-3.12e-02

Pos logit contributions
aw+0.154
pieces+0.140
ement+0.139
 particularly+0.132
 Other+0.132
Neg logit contributions
ro-0.116
uzz-0.101
 cheek-0.094
 seek-0.094
 moment-0.092
Token,
Token ID11
Feature activation-3.27e-02

Pos logit contributions
aw+0.066
ifying+0.057
 Other+0.055
opl+0.055
 particularly+0.054
Neg logit contributions
ro-0.048
emo-0.038
uzz-0.038
onna-0.035
 seek-0.034
Token Mitt
Token ID16627
Feature activation-5.16e-03

Pos logit contributions
aw+0.060
ement+0.054
ifying+0.052
opl+0.052
 middle+0.051
Neg logit contributions
ro-0.055
emo-0.046
uzz-0.041
 L-0.041
 seek-0.040
Tokenens
Token ID641
Feature activation-1.54e-02

Pos logit contributions
aw+0.010
 Other+0.009
 middle+0.009
 particularly+0.009
+0.009
Neg logit contributions
ro-0.008
uzz-0.007
emo-0.007
keeping-0.007
 cheek-0.006
Token and
Token ID290
Feature activation-6.78e-04

Pos logit contributions
aw+0.026
 Other+0.025
 middle+0.024
 particularly+0.023
ifying+0.023
Neg logit contributions
ro-0.027
uzz-0.023
emo-0.022
keeping-0.022
onna-0.021
Token river
Token ID7850
Feature activation-2.07e-03

Pos logit contributions
aw+0.037
 Other+0.034
+0.033
 particularly+0.032
ifying+0.031
Neg logit contributions
ro-0.033
 cheek-0.027
emo-0.027
keeping-0.026
uzz-0.025
Tokenbank
Token ID17796
Feature activation-6.65e-03

Pos logit contributions
aw+0.004
 Other+0.003
 middle+0.003
 particularly+0.003
+0.003
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
aw+0.012
 Other+0.011
 particularly+0.010
 middle+0.010
+0.010
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.010
keeping-0.010
 cheek-0.009
Token From
Token ID3574
Feature activation-8.90e-03

Pos logit contributions
aw+0.035
 particularly+0.032
 Other+0.031
 middle+0.031
ifying+0.030
Neg logit contributions
ro-0.036
uzz-0.031
keeping-0.030
emo-0.030
 cheek-0.027
Token that
Token ID326
Feature activation-1.03e-03

Pos logit contributions
aw+0.011
 Other+0.010
 particularly+0.009
 middle+0.009
+0.009
Neg logit contributions
ro-0.020
uzz-0.018
emo-0.018
keeping-0.017
 cheek-0.017
Token day
Token ID1110
Feature activation-8.37e-02

Pos logit contributions
aw+0.002
 Other+0.002
 particularly+0.002
 middle+0.002
+0.002
Neg logit contributions
ro-0.002
uzz-0.001
emo-0.001
keeping-0.001
 cheek-0.001
Token on
Token ID319
Feature activation-4.21e-02

Pos logit contributions
aw+0.145
pieces+0.135
 Other+0.135
opl+0.128
ement+0.127
Neg logit contributions
ro-0.114
 moment-0.109
uzz-0.109
 cheek-0.100
 seek-0.097
Token,
Token ID11
Feature activation-2.45e-02

Pos logit contributions
aw+0.085
 Other+0.077
pieces+0.074
ifying+0.073
opl+0.073
Neg logit contributions
ro-0.061
uzz-0.051
keeping-0.048
 moment-0.048
emo-0.047
Token Gig
Token ID19525
Feature activation+7.43e-04

Pos logit contributions
aw+0.049
 Other+0.045
 middle+0.044
 particularly+0.044
opl+0.043
Neg logit contributions
ro-0.035
uzz-0.030
emo-0.029
keeping-0.028
 cheek-0.025
Tokeni
Token ID72
Feature activation-2.43e-02

Pos logit contributions
ro+0.001
uzz+0.001
emo+0.001
keeping+0.001
 cheek+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 particularly-0.001
 middle-0.001
-0.001
Token knew
Token ID2993
Feature activation-1.64e-02

Pos logit contributions
aw+0.044
 Other+0.042
 middle+0.041
ifying+0.040
 particularly+0.039
Neg logit contributions
ro-0.041
uzz-0.034
emo-0.034
keeping-0.032
onna-0.030
Token fun
Token ID1257
Feature activation-2.62e-02

Pos logit contributions
aw+0.036
 Other+0.033
opl+0.031
 middle+0.031
ifying+0.030
Neg logit contributions
ro-0.050
emo-0.044
uzz-0.043
 cheek-0.041
 moment-0.041
Token day
Token ID1110
Feature activation-3.92e-02

Pos logit contributions
aw+0.055
 Other+0.052
ifying+0.051
 middle+0.049
 particularly+0.049
Neg logit contributions
ro-0.037
uzz-0.029
emo-0.029
keeping-0.027
 moment-0.026
Token.
Token ID13
Feature activation-2.64e-02

Pos logit contributions
 Other+0.062
aw+0.060
 particularly+0.057
 middle+0.056
opl+0.055
Neg logit contributions
ro-0.067
uzz-0.059
keeping-0.058
emo-0.054
and-0.053
Token From
Token ID3574
Feature activation-1.04e-02

Pos logit contributions
aw+0.049
 particularly+0.047
 Other+0.045
ifying+0.044
 middle+0.044
Neg logit contributions
ro-0.041
uzz-0.035
emo-0.035
keeping-0.031
onna-0.030
Token that
Token ID326
Feature activation-9.11e-03

Pos logit contributions
aw+0.013
 Other+0.012
 particularly+0.011
+0.011
 middle+0.011
Neg logit contributions
ro-0.023
uzz-0.021
emo-0.020
keeping-0.020
 cheek-0.019
Token day
Token ID1110
Feature activation-8.37e-02

Pos logit contributions
aw+0.020
 Other+0.018
 particularly+0.017
 middle+0.017
+0.017
Neg logit contributions
ro-0.013
uzz-0.010
emo-0.010
keeping-0.009
 cheek-0.009
Token on
Token ID319
Feature activation-4.11e-02

Pos logit contributions
 pasture+0.153
 particularly+0.153
aw+0.149
pieces+0.147
 Other+0.145
Neg logit contributions
ro-0.092
uzz-0.088
oys-0.086
 moment-0.077
 L-0.077
Token,
Token ID11
Feature activation-2.08e-02

Pos logit contributions
aw+0.089
 particularly+0.086
ifying+0.086
 Other+0.084
pieces+0.083
Neg logit contributions
ro-0.050
emo-0.039
 common-0.037
uzz-0.034
 L-0.034
Token they
Token ID484
Feature activation-3.21e-02

Pos logit contributions
aw+0.046
 particularly+0.043
 Other+0.043
 middle+0.043
ifying+0.043
Neg logit contributions
ro-0.025
emo-0.020
uzz-0.019
keeping-0.016
icated-0.016
Token became
Token ID2627
Feature activation-3.96e-02

Pos logit contributions
aw+0.051
 Other+0.050
ement+0.046
 middle+0.046
ifying+0.045
Neg logit contributions
ro-0.059
emo-0.051
uzz-0.051
keeping-0.047
onna-0.046
Token good
Token ID922
Feature activation-3.58e-02

Pos logit contributions
aw+0.070
ifying+0.068
ement+0.067
+0.067
 Other+0.066
Neg logit contributions
ro-0.060
emo-0.058
uzz-0.053
 seek-0.049
 common-0.048
Token each
Token ID1123
Feature activation-3.28e-02

Pos logit contributions
aw+0.054
ement+0.048
 middle+0.048
+0.048
 particularly+0.048
Neg logit contributions
ro-0.036
uzz-0.032
emo-0.030
keeping-0.029
onna-0.027
Token other
Token ID584
Feature activation+9.66e-03

Pos logit contributions
aw+0.059
+0.056
ifying+0.053
ement+0.050
 Other+0.049
Neg logit contributions
ro-0.052
uzz-0.045
emo-0.044
keeping-0.043
 cheek-0.043
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
ro+0.019
uzz+0.017
emo+0.017
keeping+0.016
onna+0.015
Neg logit contributions
aw-0.015
 Other-0.014
 middle-0.013
 particularly-0.013
-0.012
Token From
Token ID3574
Feature activation-1.17e-02

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.003
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.002
Token that
Token ID326
Feature activation-5.70e-03

Pos logit contributions
aw+0.015
 Other+0.014
 particularly+0.013
 middle+0.012
+0.012
Neg logit contributions
ro-0.026
uzz-0.023
emo-0.023
keeping-0.022
 cheek-0.021
Token day
Token ID1110
Feature activation-8.34e-02

Pos logit contributions
aw+0.012
 Other+0.011
 particularly+0.011
 middle+0.011
+0.010
Neg logit contributions
ro-0.008
uzz-0.007
emo-0.007
keeping-0.006
 cheek-0.006
Token on
Token ID319
Feature activation-3.31e-02

Pos logit contributions
aw+0.157
 pasture+0.150
ement+0.145
opl+0.145
 particularly+0.144
Neg logit contributions
ro-0.094
uzz-0.082
 moment-0.082
 cheek-0.077
emo-0.077
Token,
Token ID11
Feature activation-2.96e-03

Pos logit contributions
aw+0.070
ifying+0.061
 Other+0.060
opl+0.060
 particularly+0.060
Neg logit contributions
ro-0.048
emo-0.037
uzz-0.037
 moment-0.036
onna-0.035
Token Lily
Token ID20037
Feature activation-1.32e-02

Pos logit contributions
aw+0.007
 Other+0.007
 particularly+0.006
 middle+0.006
+0.006
Neg logit contributions
ro-0.003
uzz-0.003
emo-0.003
keeping-0.002
 cheek-0.002
Token and
Token ID290
Feature activation+2.40e-02

Pos logit contributions
aw+0.020
 Other+0.018
 middle+0.018
 particularly+0.017
+0.017
Neg logit contributions
ro-0.026
emo-0.022
uzz-0.022
keeping-0.022
onna-0.020
Token Rose
Token ID8049
Feature activation-1.49e-02

Pos logit contributions
ro+0.052
uzz+0.045
emo+0.044
keeping+0.044
 cheek+0.043
Neg logit contributions
 Other-0.032
aw-0.030
 course-0.029
 particularly-0.028
-0.027
Token with
Token ID351
Feature activation-2.01e-02

Pos logit contributions
ro+0.019
uzz+0.017
emo+0.017
keeping+0.017
 cheek+0.016
Neg logit contributions
aw-0.017
 Other-0.015
 particularly-0.015
 middle-0.015
-0.014
Token Fred
Token ID8559
Feature activation+2.35e-03

Pos logit contributions
aw+0.035
 Other+0.032
+0.031
 particularly+0.030
 middle+0.030
Neg logit contributions
ro-0.035
uzz-0.030
keeping-0.029
emo-0.029
 cheek-0.027
Token.
Token ID13
Feature activation-2.13e-02

Pos logit contributions
ro+0.004
uzz+0.004
emo+0.004
keeping+0.004
 cheek+0.004
Neg logit contributions
aw-0.004
 Other-0.003
 particularly-0.003
 middle-0.003
-0.003
Token From
Token ID3574
Feature activation-1.18e-02

Pos logit contributions
aw+0.035
 particularly+0.031
 Other+0.031
 middle+0.031
ifying+0.030
Neg logit contributions
ro-0.039
uzz-0.034
emo-0.034
keeping-0.032
onna-0.030
Token that
Token ID326
Feature activation+5.59e-03

Pos logit contributions
aw+0.015
 Other+0.014
 particularly+0.013
 middle+0.013
+0.013
Neg logit contributions
ro-0.026
uzz-0.023
emo-0.023
keeping-0.022
 cheek-0.022
Token day
Token ID1110
Feature activation-8.34e-02

Pos logit contributions
ro+0.009
uzz+0.008
keeping+0.008
emo+0.008
 cheek+0.007
Neg logit contributions
aw-0.010
 Other-0.010
 particularly-0.010
 middle-0.010
-0.009
Token on
Token ID319
Feature activation-4.94e-02

Pos logit contributions
 pasture+0.152
aw+0.146
chairs+0.141
ifying+0.139
 particularly+0.138
Neg logit contributions
ro-0.100
 seek-0.094
 moment-0.094
 itself-0.090
emo-0.088
Token,
Token ID11
Feature activation-3.38e-02

Pos logit contributions
aw+0.105
ifying+0.093
 Other+0.091
 particularly+0.090
pieces+0.088
Neg logit contributions
ro-0.068
emo-0.057
 moment-0.054
 seek-0.054
 common-0.053
Token Tim
Token ID5045
Feature activation-8.74e-03

Pos logit contributions
aw+0.070
ifying+0.064
 Other+0.063
 middle+0.063
opl+0.062
Neg logit contributions
ro-0.047
emo-0.041
uzz-0.038
 seek-0.035
keeping-0.035
Token felt
Token ID2936
Feature activation-3.76e-02

Pos logit contributions
aw+0.015
 Other+0.014
 middle+0.013
 particularly+0.013
+0.013
Neg logit contributions
ro-0.016
uzz-0.013
emo-0.013
keeping-0.013
 cheek-0.012
Token strong
Token ID1913
Feature activation-2.53e-02

Pos logit contributions
aw+0.069
+0.067
 Other+0.064
ifying+0.063
ement+0.063
Neg logit contributions
ro-0.057
emo-0.051
uzz-0.047
 itself-0.043
 cheek-0.043
Token of
Token ID286
Feature activation-4.41e-03

Pos logit contributions
aw+0.043
ifying+0.043
 middle+0.042
 Other+0.042
+0.042
Neg logit contributions
ro-0.057
and-0.047
emo-0.047
uzz-0.046
icated-0.044
Token fun
Token ID1257
Feature activation-3.19e-02

Pos logit contributions
aw+0.009
 Other+0.008
 middle+0.008
 particularly+0.008
+0.008
Neg logit contributions
ro-0.007
uzz-0.006
emo-0.006
keeping-0.005
 cheek-0.005
Token.
Token ID13
Feature activation-4.30e-03

Pos logit contributions
aw+0.063
 Other+0.060
ifying+0.060
 particularly+0.059
 middle+0.058
Neg logit contributions
ro-0.045
keeping-0.036
uzz-0.035
emo-0.035
 textures-0.032
Token From
Token ID3574
Feature activation-5.70e-04

Pos logit contributions
aw+0.006
 Other+0.005
 particularly+0.005
 middle+0.005
+0.005
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.008
Token that
Token ID326
Feature activation+3.63e-03

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.000
Neg logit contributions
ro-0.001
uzz-0.001
emo-0.001
keeping-0.001
 cheek-0.001
Token day
Token ID1110
Feature activation-8.31e-02

Pos logit contributions
ro+0.006
uzz+0.005
keeping+0.005
emo+0.005
onna+0.004
Neg logit contributions
aw-0.007
 Other-0.006
 middle-0.006
 particularly-0.006
-0.006
Token on
Token ID319
Feature activation-2.60e-02

Pos logit contributions
aw+0.150
pieces+0.142
ement+0.142
 pasture+0.139
 particularly+0.139
Neg logit contributions
ro-0.102
uzz-0.092
 moment-0.091
 cheek-0.086
oys-0.086
Token,
Token ID11
Feature activation-1.26e-02

Pos logit contributions
aw+0.052
 Other+0.046
ifying+0.045
 particularly+0.045
opl+0.044
Neg logit contributions
ro-0.041
emo-0.033
uzz-0.032
onna-0.030
keeping-0.030
Token Lily
Token ID20037
Feature activation-1.63e-03

Pos logit contributions
aw+0.034
 Other+0.031
 middle+0.030
 particularly+0.030
opl+0.030
Neg logit contributions
ro-0.011
uzz-0.008
emo-0.008
keeping-0.006
onna-0.006
Token always
Token ID1464
Feature activation+2.39e-02

Pos logit contributions
aw+0.003
 Other+0.002
 particularly+0.002
 middle+0.002
+0.002
Neg logit contributions
ro-0.003
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token drew
Token ID9859
Feature activation-4.56e-02

Pos logit contributions
ro+0.052
uzz+0.046
keeping+0.044
emo+0.044
onna+0.043
Neg logit contributions
aw-0.031
 particularly-0.028
 Other-0.028
 middle-0.026
 course-0.024
Token He
Token ID679
Feature activation-2.73e-02

Pos logit contributions
ro+0.021
keeping+0.019
uzz+0.019
emo+0.019
 cheek+0.018
Neg logit contributions
aw-0.012
 Other-0.011
 middle-0.010
 particularly-0.009
-0.009
Token wants
Token ID3382
Feature activation+1.03e-02

Pos logit contributions
aw+0.052
 Other+0.051
+0.050
 middle+0.049
 particularly+0.048
Neg logit contributions
ro-0.042
uzz-0.032
emo-0.031
onna-0.030
 cheek-0.029
Token to
Token ID284
Feature activation+1.23e-02

Pos logit contributions
ro+0.018
uzz+0.015
emo+0.015
keeping+0.015
 cheek+0.014
Neg logit contributions
aw-0.019
 Other-0.017
 middle-0.016
 particularly-0.016
 course-0.015
Token make
Token ID787
Feature activation+1.83e-02

Pos logit contributions
ro+0.022
uzz+0.019
emo+0.019
keeping+0.018
 cheek+0.017
Neg logit contributions
aw-0.021
 middle-0.019
 Other-0.019
 particularly-0.018
opl-0.018
Token it
Token ID340
Feature activation+3.09e-02

Pos logit contributions
ro+0.037
uzz+0.032
keeping+0.031
emo+0.030
onna+0.029
Neg logit contributions
aw-0.029
 Other-0.026
 middle-0.025
 particularly-0.023
opl-0.023
Token go
Token ID467
Feature activation+3.69e-02

Pos logit contributions
ro+0.065
keeping+0.059
uzz+0.056
emo+0.055
 cheek+0.049
Neg logit contributions
aw-0.042
 middle-0.040
 Other-0.039
 particularly-0.037
opl-0.034
Token away
Token ID1497
Feature activation+2.72e-02

Pos logit contributions
ro+0.074
emo+0.071
uzz+0.070
keeping+0.068
 textures+0.065
Neg logit contributions
aw-0.049
 forward-0.045
 particularly-0.044
 middle-0.043
 Other-0.041
Token.
Token ID13
Feature activation+6.41e-03

Pos logit contributions
ro+0.056
emo+0.050
uzz+0.049
keeping+0.047
 cheek+0.046
Neg logit contributions
aw-0.036
 Other-0.034
 middle-0.033
 particularly-0.031
 forward-0.030
Token He
Token ID679
Feature activation-1.67e-02

Pos logit contributions
ro+0.014
uzz+0.012
keeping+0.012
emo+0.012
 cheek+0.011
Neg logit contributions
aw-0.009
 Other-0.008
 middle-0.008
 particularly-0.007
-0.007
Token looks
Token ID3073
Feature activation-1.98e-02

Pos logit contributions
aw+0.033
 Other+0.032
 middle+0.031
+0.031
 particularly+0.031
Neg logit contributions
ro-0.025
uzz-0.020
emo-0.019
onna-0.018
 cheek-0.017
Token for
Token ID329
Feature activation+1.32e-02

Pos logit contributions
aw+0.028
+0.027
 Other+0.026
opl+0.025
 middle+0.024
Neg logit contributions
ro-0.040
uzz-0.035
emo-0.034
 cheek-0.033
onna-0.032
Token happy
Token ID3772
Feature activation+1.78e-02

Pos logit contributions
ro+0.024
keeping+0.020
uzz+0.020
emo+0.020
onna+0.018
Neg logit contributions
aw-0.017
 particularly-0.015
 Other-0.015
 middle-0.015
opl-0.014
Token and
Token ID290
Feature activation+2.27e-02

Pos logit contributions
ro+0.037
uzz+0.032
emo+0.031
keeping+0.030
 cheek+0.029
Neg logit contributions
aw-0.027
 Other-0.022
 particularly-0.022
 course-0.021
 middle-0.021
Token whist
Token ID32262
Feature activation-4.88e-03

Pos logit contributions
ro+0.044
uzz+0.038
keeping+0.036
emo+0.035
onna+0.034
Neg logit contributions
aw-0.038
 particularly-0.033
 middle-0.032
 Other-0.032
 course-0.028
Tokenling
Token ID1359
Feature activation-1.57e-02

Pos logit contributions
aw+0.007
 Other+0.006
 particularly+0.006
 middle+0.006
+0.006
Neg logit contributions
ro-0.010
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token a
Token ID257
Feature activation+5.38e-04

Pos logit contributions
aw+0.028
 Other+0.026
+0.025
 particularly+0.025
 middle+0.025
Neg logit contributions
ro-0.027
uzz-0.023
emo-0.022
keeping-0.021
 cheek-0.020
Token fun
Token ID1257
Feature activation+3.71e-02

Pos logit contributions
ro+0.001
uzz+0.001
keeping+0.001
emo+0.001
onna+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.001
 particularly-0.001
-0.001
Token song
Token ID3496
Feature activation-4.87e-03

Pos logit contributions
ro+0.057
uzz+0.054
keeping+0.050
emo+0.050
ge+0.048
Neg logit contributions
aw-0.063
 middle-0.061
 course-0.059
 particularly-0.059
 Other-0.058
Token.
Token ID13
Feature activation-1.36e-02

Pos logit contributions
aw+0.008
 Other+0.008
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.009
uzz-0.007
emo-0.007
keeping-0.007
 cheek-0.007
Token Tom
Token ID4186
Feature activation+3.88e-02

Pos logit contributions
aw+0.028
 particularly+0.027
 Other+0.026
 middle+0.026
ifying+0.025
Neg logit contributions
ro-0.019
uzz-0.015
emo-0.015
keeping-0.014
onna-0.013
Token wanted
Token ID2227
Feature activation+1.63e-02

Pos logit contributions
ro+0.080
keeping+0.076
uzz+0.074
emo+0.067
onna+0.067
Neg logit contributions
aw-0.060
 particularly-0.048
 Other-0.045
 middle-0.044
 course-0.042
Token to
Token ID284
Feature activation+5.73e-03

Pos logit contributions
ro+0.032
uzz+0.028
keeping+0.027
emo+0.026
ge+0.026
Neg logit contributions
aw-0.026
 Other-0.024
 particularly-0.023
 middle-0.022
 course-0.021
Token.
Token ID13
Feature activation-2.15e-03

Pos logit contributions
aw+0.000
 Other+0.000
 middle+0.000
 particularly+0.000
+0.000
Neg logit contributions
ro-0.000
uzz-0.000
emo-0.000
keeping-0.000
 cheek-0.000
Token They
Token ID1119
Feature activation+2.01e-02

Pos logit contributions
aw+0.004
 Other+0.004
 middle+0.003
 particularly+0.003
+0.003
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token pretended
Token ID36766
Feature activation+1.25e-02

Pos logit contributions
ro+0.038
keeping+0.034
emo+0.034
uzz+0.033
 cheek+0.032
Neg logit contributions
aw-0.032
 middle-0.029
 particularly-0.029
 Other-0.028
ifying-0.025
Token they
Token ID484
Feature activation+1.06e-02

Pos logit contributions
ro+0.022
uzz+0.019
emo+0.018
keeping+0.018
 cheek+0.017
Neg logit contributions
aw-0.023
 Other-0.020
 particularly-0.020
 middle-0.020
-0.018
Token were
Token ID547
Feature activation+4.21e-03

Pos logit contributions
ro+0.025
uzz+0.022
emo+0.022
keeping+0.022
 cheek+0.021
Neg logit contributions
aw-0.012
 particularly-0.011
 middle-0.011
 Other-0.011
-0.010
Token on
Token ID319
Feature activation+3.53e-02

Pos logit contributions
ro+0.008
uzz+0.007
emo+0.007
keeping+0.007
 cheek+0.007
Neg logit contributions
aw-0.007
 Other-0.006
 middle-0.006
 particularly-0.006
-0.005
Token Mars
Token ID8706
Feature activation-2.38e-03

Pos logit contributions
ro+0.075
emo+0.072
uzz+0.070
 cheek+0.067
keeping+0.066
Neg logit contributions
aw-0.047
 Other-0.042
 middle-0.041
 particularly-0.040
 course-0.038
Token,
Token ID11
Feature activation+2.84e-02

Pos logit contributions
aw+0.004
 Other+0.004
 particularly+0.004
 middle+0.004
+0.003
Neg logit contributions
ro-0.004
uzz-0.004
emo-0.004
keeping-0.004
 cheek-0.003
Token and
Token ID290
Feature activation+1.13e-02

Pos logit contributions
ro+0.053
emo+0.049
uzz+0.047
keeping+0.047
 cheek+0.043
Neg logit contributions
aw-0.045
 Other-0.040
 particularly-0.040
 middle-0.039
ifying-0.036
Token they
Token ID484
Feature activation+8.68e-03

Pos logit contributions
ro+0.020
uzz+0.017
emo+0.017
keeping+0.017
 cheek+0.016
Neg logit contributions
aw-0.020
 Other-0.018
 middle-0.017
 particularly-0.017
ifying-0.016
Token wore
Token ID12408
Feature activation+2.28e-02

Pos logit contributions
ro+0.016
uzz+0.014
emo+0.014
keeping+0.014
 cheek+0.013
Neg logit contributions
aw-0.014
 middle-0.013
 Other-0.013
 particularly-0.013
ifying-0.011
Token excited
Token ID6568
Feature activation+1.07e-02

Pos logit contributions
keeping+0.080
ro+0.077
ge+0.062
uzz+0.062
icated+0.061
Neg logit contributions
aw-0.061
 middle-0.053
 Other-0.052
 particularly-0.050
Eventually-0.046
Token that
Token ID326
Feature activation-4.80e-04

Pos logit contributions
ro+0.022
uzz+0.020
emo+0.019
keeping+0.019
 cheek+0.018
Neg logit contributions
aw-0.017
 Other-0.014
 particularly-0.013
 middle-0.013
-0.012
Token he
Token ID339
Feature activation+2.74e-02

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.001
uzz-0.001
keeping-0.001
emo-0.001
 cheek-0.001
Token ran
Token ID4966
Feature activation+3.87e-02

Pos logit contributions
ro+0.055
keeping+0.053
uzz+0.051
ge+0.046
emo+0.046
Neg logit contributions
aw-0.041
 particularly-0.037
 middle-0.032
 Other-0.032
-0.029
Token to
Token ID284
Feature activation+1.83e-02

Pos logit contributions
ro+0.078
emo+0.074
uzz+0.071
keeping+0.069
icated+0.069
Neg logit contributions
aw-0.048
 forward-0.047
 particularly-0.047
 middle-0.046
 Other-0.044
Token it
Token ID340
Feature activation+3.66e-02

Pos logit contributions
ro+0.039
uzz+0.034
keeping+0.033
emo+0.033
onna+0.031
Neg logit contributions
 middle-0.025
 Other-0.025
aw-0.024
 course-0.022
 particularly-0.021
Token and
Token ID290
Feature activation+1.53e-02

Pos logit contributions
ro+0.085
emo+0.085
keeping+0.083
uzz+0.082
icated+0.077
Neg logit contributions
 middle-0.034
 Other-0.033
aw-0.032
 particularly-0.032
-0.030
Token kicked
Token ID12165
Feature activation+2.08e-02

Pos logit contributions
ro+0.031
uzz+0.028
keeping+0.028
emo+0.027
 cheek+0.025
Neg logit contributions
aw-0.022
 particularly-0.020
 middle-0.020
 Other-0.019
-0.018
Token it
Token ID340
Feature activation+2.50e-02

Pos logit contributions
ro+0.038
uzz+0.034
keeping+0.033
emo+0.033
onna+0.031
Neg logit contributions
aw-0.037
 middle-0.034
 Other-0.033
 forward-0.031
 particularly-0.030
Token with
Token ID351
Feature activation+2.36e-03

Pos logit contributions
ro+0.050
keeping+0.049
emo+0.047
uzz+0.046
icated+0.042
Neg logit contributions
aw-0.035
 Other-0.033
 middle-0.032
 particularly-0.031
 forward-0.030
Token all
Token ID477
Feature activation+3.10e-02

Pos logit contributions
ro+0.005
uzz+0.004
emo+0.004
keeping+0.004
 cheek+0.004
Neg logit contributions
aw-0.004
 Other-0.004
 middle-0.003
 particularly-0.003
-0.003
Token The
Token ID383
Feature activation+2.77e-02

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token ticket
Token ID7846
Feature activation+1.18e-02

Pos logit contributions
ro+0.047
keeping+0.047
uzz+0.045
onna+0.042
ge+0.040
Neg logit contributions
aw-0.052
 middle-0.048
 Other-0.043
 course-0.041
 particularly-0.038
Token was
Token ID373
Feature activation+1.50e-02

Pos logit contributions
ro+0.022
uzz+0.020
emo+0.019
keeping+0.019
onna+0.018
Neg logit contributions
aw-0.019
 middle-0.017
 Other-0.016
 particularly-0.016
-0.016
Token for
Token ID329
Feature activation+4.11e-03

Pos logit contributions
ro+0.031
keeping+0.027
emo+0.026
uzz+0.026
 cheek+0.024
Neg logit contributions
aw-0.022
 particularly-0.021
 middle-0.020
 Other-0.019
ifying-0.018
Token a
Token ID257
Feature activation+4.41e-02

Pos logit contributions
ro+0.006
uzz+0.005
emo+0.005
keeping+0.005
onna+0.005
Neg logit contributions
aw-0.008
 Other-0.008
 particularly-0.007
 middle-0.007
-0.007
Token fun
Token ID1257
Feature activation+4.26e-02

Pos logit contributions
keeping+0.083
ro+0.083
uzz+0.080
emo+0.075
 itself+0.070
Neg logit contributions
 particularly-0.063
 middle-0.057
 Other-0.057
aw-0.056
-0.056
Token ride
Token ID6594
Feature activation+2.37e-03

Pos logit contributions
ro+0.075
emo+0.073
uzz+0.073
keeping+0.071
icated+0.069
Neg logit contributions
aw-0.062
 course-0.056
 middle-0.055
date-0.054
 Other-0.052
Token at
Token ID379
Feature activation+2.07e-02

Pos logit contributions
ro+0.005
uzz+0.004
emo+0.004
keeping+0.004
 cheek+0.004
Neg logit contributions
aw-0.003
 Other-0.003
 particularly-0.003
 middle-0.003
-0.003
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
ro+0.040
uzz+0.035
 cheek+0.034
emo+0.034
keeping+0.032
Neg logit contributions
aw-0.031
 Other-0.031
 middle-0.030
opl-0.029
 particularly-0.028
Token carn
Token ID18466
Feature activation-7.83e-03

Pos logit contributions
ro+0.030
uzz+0.029
keeping+0.028
emo+0.027
onna+0.026
Neg logit contributions
aw-0.018
 middle-0.015
 Other-0.015
 particularly-0.014
 course-0.013
Tokenival
Token ID2473
Feature activation+2.10e-02

Pos logit contributions
aw+0.008
 Other+0.007
 particularly+0.007
 middle+0.007
+0.007
Neg logit contributions
ro-0.019
uzz-0.017
emo-0.017
keeping-0.016
 cheek-0.016
Token in
Token ID287
Feature activation+1.51e-03

Pos logit contributions
ro+0.042
emo+0.038
uzz+0.037
keeping+0.037
icated+0.035
Neg logit contributions
aw-0.033
 middle-0.028
 particularly-0.027
 Other-0.027
-0.026
Token the
Token ID262
Feature activation-4.72e-02

Pos logit contributions
ro+0.003
uzz+0.002
emo+0.002
keeping+0.002
 cheek+0.002
Neg logit contributions
aw-0.003
 Other-0.002
 middle-0.002
 particularly-0.002
-0.002
Token to
Token ID284
Feature activation-3.28e-02

Pos logit contributions
 Other+0.086
aw+0.085
+0.084
opl+0.082
ement+0.082
Neg logit contributions
ro-0.072
 cheek-0.067
 moment-0.052
keeping-0.051
onna-0.051
Tokenaster
Token ID1603
Feature activation-1.04e-03

Pos logit contributions
aw+0.046
 Other+0.042
 particularly+0.038
ement+0.038
opl+0.038
Neg logit contributions
ro-0.071
onna-0.059
 cheek-0.058
emo-0.057
uzz-0.056
Token and
Token ID290
Feature activation-8.72e-03

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.002
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.001
Token pushed
Token ID7121
Feature activation-1.39e-03

Pos logit contributions
aw+0.016
 Other+0.015
 particularly+0.015
 middle+0.014
+0.014
Neg logit contributions
ro-0.014
uzz-0.012
emo-0.012
keeping-0.012
 cheek-0.011
Token the
Token ID262
Feature activation-1.07e-02

Pos logit contributions
aw+0.003
 Other+0.002
 middle+0.002
 particularly+0.002
+0.002
Neg logit contributions
ro-0.002
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.002
Token lever
Token ID17124
Feature activation+9.25e-03

Pos logit contributions
aw+0.018
 Other+0.017
 particularly+0.016
+0.016
 middle+0.016
Neg logit contributions
ro-0.019
uzz-0.016
emo-0.016
keeping-0.016
 cheek-0.016
Token down
Token ID866
Feature activation-2.81e-04

Pos logit contributions
ro+0.017
emo+0.015
uzz+0.015
keeping+0.015
 cheek+0.014
Neg logit contributions
aw-0.015
 Other-0.014
 middle-0.013
 particularly-0.013
-0.012
Token.
Token ID13
Feature activation-4.14e-03

Pos logit contributions
aw+0.000
 Other+0.000
 middle+0.000
 particularly+0.000
+0.000
Neg logit contributions
ro-0.000
uzz-0.000
emo-0.000
keeping-0.000
 cheek-0.000
Token She
Token ID1375
Feature activation-5.33e-03

Pos logit contributions
aw+0.005
 Other+0.005
 middle+0.005
 particularly+0.005
+0.004
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.008
Token "
Token ID366
Feature activation+1.36e-02

Pos logit contributions
ro+0.011
uzz+0.010
keeping+0.010
emo+0.009
onna+0.009
Neg logit contributions
aw-0.008
 Other-0.007
 particularly-0.007
 middle-0.007
-0.007
TokenYes
Token ID5297
Feature activation-8.23e-03

Pos logit contributions
ro+0.024
keeping+0.021
uzz+0.021
emo+0.021
onna+0.019
Neg logit contributions
aw-0.023
 particularly-0.021
 Other-0.021
 middle-0.021
-0.020
Token,
Token ID11
Feature activation+2.00e-02

Pos logit contributions
aw+0.012
 Other+0.011
 particularly+0.011
 middle+0.011
+0.010
Neg logit contributions
ro-0.016
uzz-0.014
emo-0.014
keeping-0.014
 cheek-0.013
Token you
Token ID345
Feature activation-1.46e-02

Pos logit contributions
ro+0.037
keeping+0.033
uzz+0.033
emo+0.032
ge+0.029
Neg logit contributions
aw-0.034
 Other-0.029
 particularly-0.029
 middle-0.029
 course-0.028
Token can
Token ID460
Feature activation-1.54e-02

Pos logit contributions
aw+0.027
 Other+0.025
 particularly+0.024
 middle+0.024
opl+0.024
Neg logit contributions
ro-0.024
uzz-0.020
emo-0.020
 cheek-0.018
onna-0.018
Token take
Token ID1011
Feature activation-6.42e-03

Pos logit contributions
aw+0.031
 Other+0.028
 particularly+0.026
 middle+0.026
opl+0.026
Neg logit contributions
ro-0.024
uzz-0.020
emo-0.020
onna-0.018
keeping-0.018
Token some
Token ID617
Feature activation+7.08e-04

Pos logit contributions
aw+0.011
 Other+0.010
 particularly+0.010
 middle+0.010
+0.010
Neg logit contributions
ro-0.011
uzz-0.010
emo-0.009
keeping-0.009
 cheek-0.009
Token water
Token ID1660
Feature activation+1.84e-02

Pos logit contributions
ro+0.001
uzz+0.001
emo+0.001
keeping+0.001
 cheek+0.001
Neg logit contributions
aw-0.001
 Other-0.001
 particularly-0.001
 middle-0.001
-0.001
Token."
Token ID526
Feature activation+1.89e-02

Pos logit contributions
ro+0.031
uzz+0.028
emo+0.027
keeping+0.026
 cheek+0.025
Neg logit contributions
aw-0.032
 Other-0.029
 middle-0.028
 particularly-0.028
-0.027
Token
Token ID198
Feature activation+9.38e-03

Pos logit contributions
ro+0.044
uzz+0.041
emo+0.040
keeping+0.039
ge+0.039
Neg logit contributions
aw-0.023
 Other-0.021
-0.018
 middle-0.018
 Eventually-0.016
Token
Token ID198
Feature activation+5.76e-03

Pos logit contributions
ro+0.020
uzz+0.017
keeping+0.017
emo+0.017
 cheek+0.016
Neg logit contributions
aw-0.014
 Other-0.012
 middle-0.012
 particularly-0.011
-0.011
Token cried
Token ID16896
Feature activation-4.67e-03

Pos logit contributions
ro+0.041
uzz+0.038
keeping+0.038
emo+0.036
onna+0.033
Neg logit contributions
aw-0.042
 middle-0.036
 particularly-0.035
 Other-0.035
lers-0.031
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
aw+0.008
 Other+0.007
 particularly+0.007
 middle+0.007
+0.007
Neg logit contributions
ro-0.008
uzz-0.007
emo-0.007
keeping-0.007
 cheek-0.006
Token "
Token ID366
Feature activation+1.99e-02

Pos logit contributions
aw+0.021
 Other+0.019
 particularly+0.018
 middle+0.018
opl+0.018
Neg logit contributions
ro-0.023
uzz-0.019
emo-0.019
keeping-0.018
 cheek-0.018
TokenThis
Token ID1212
Feature activation-1.62e-02

Pos logit contributions
ro+0.043
uzz+0.039
keeping+0.038
emo+0.037
onna+0.036
Neg logit contributions
aw-0.027
 middle-0.024
 Other-0.023
 particularly-0.022
-0.022
Token cake
Token ID12187
Feature activation-7.02e-03

Pos logit contributions
aw+0.029
 Other+0.027
 particularly+0.026
 middle+0.025
+0.025
Neg logit contributions
ro-0.028
uzz-0.023
emo-0.023
keeping-0.022
onna-0.022
Token is
Token ID318
Feature activation+5.91e-03

Pos logit contributions
aw+0.012
 Other+0.011
 particularly+0.010
 middle+0.010
+0.010
Neg logit contributions
ro-0.013
uzz-0.011
emo-0.011
keeping-0.011
 cheek-0.010
Token bad
Token ID2089
Feature activation+3.01e-04

Pos logit contributions
ro+0.011
uzz+0.010
keeping+0.010
emo+0.010
 cheek+0.009
Neg logit contributions
aw-0.009
 particularly-0.008
 Other-0.008
 middle-0.008
-0.008
Token!
Token ID0
Feature activation+6.11e-03

Pos logit contributions
ro+0.001
uzz+0.000
emo+0.000
keeping+0.000
 cheek+0.000
Neg logit contributions
aw-0.001
 Other-0.000
 middle-0.000
 particularly-0.000
-0.000
Token It
Token ID632
Feature activation+1.37e-04

Pos logit contributions
ro+0.012
uzz+0.011
emo+0.010
keeping+0.010
ge+0.010
Neg logit contributions
aw-0.009
 Other-0.009
 middle-0.008
-0.008
 particularly-0.008
Token hurts
Token ID20406
Feature activation+5.28e-03

Pos logit contributions
ro+0.000
uzz+0.000
emo+0.000
keeping+0.000
 cheek+0.000
Neg logit contributions
aw-0.000
 Other-0.000
 particularly-0.000
 middle-0.000
-0.000
Token!"
Token ID2474
Feature activation+1.50e-03

Pos logit contributions
ro+0.009
uzz+0.007
emo+0.007
keeping+0.007
onna+0.007
Neg logit contributions
aw-0.010
 Other-0.009
 middle-0.009
 particularly-0.009
ifying-0.008
Token
Token ID198
Feature activation+1.67e-02

Pos logit contributions
ro+0.022
keeping+0.019
uzz+0.019
emo+0.019
 cheek+0.019
Neg logit contributions
aw-0.015
 Other-0.014
 middle-0.013
-0.013
 particularly-0.012
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
ro+0.037
keeping+0.033
 cheek+0.032
uzz+0.032
emo+0.031
Neg logit contributions
aw-0.022
 Other-0.020
 middle-0.019
 particularly-0.018
Eventually-0.017
Token"
Token ID1
Feature activation+1.01e-02

Pos logit contributions
ro+0.024
keeping+0.022
uzz+0.021
emo+0.021
 cheek+0.020
Neg logit contributions
aw-0.017
 middle-0.015
 Other-0.015
 particularly-0.014
Eventually-0.014
TokenStop
Token ID19485
Feature activation+1.32e-02

Pos logit contributions
ro+0.021
uzz+0.019
keeping+0.019
emo+0.018
onna+0.017
Neg logit contributions
aw-0.015
 middle-0.013
 Other-0.013
-0.012
 particularly-0.012
Token it
Token ID340
Feature activation-2.81e-02

Pos logit contributions
ro+0.027
uzz+0.023
keeping+0.023
emo+0.023
onna+0.021
Neg logit contributions
aw-0.021
 Other-0.018
 middle-0.017
-0.017
 particularly-0.017
Token,
Token ID11
Feature activation+2.92e-03

Pos logit contributions
aw+0.050
 Other+0.047
 particularly+0.047
ifying+0.046
 middle+0.045
Neg logit contributions
ro-0.046
 cheek-0.038
uzz-0.037
emo-0.036
 seek-0.036
Token kids
Token ID3988
Feature activation-6.43e-03

Pos logit contributions
ro+0.005
uzz+0.004
keeping+0.004
emo+0.004
onna+0.004
Neg logit contributions
aw-0.005
 Other-0.005
 middle-0.005
 particularly-0.004
-0.004
Token!"
Token ID2474
Feature activation+8.07e-03

Pos logit contributions
aw+0.011
 Other+0.010
 particularly+0.010
 middle+0.010
+0.010
Neg logit contributions
ro-0.012
uzz-0.010
emo-0.010
keeping-0.009
 cheek-0.009
Token the
Token ID262
Feature activation-6.46e-03

Pos logit contributions
ro+0.012
uzz+0.010
keeping+0.010
emo+0.010
 cheek+0.009
Neg logit contributions
aw-0.016
 Other-0.015
 middle-0.015
 particularly-0.014
-0.014
Token man
Token ID582
Feature activation-1.41e-02

Pos logit contributions
aw+0.012
 Other+0.011
 particularly+0.011
 middle+0.011
+0.011
Neg logit contributions
ro-0.011
uzz-0.009
emo-0.009
keeping-0.008
 cheek-0.008
Token said
Token ID531
Feature activation+9.18e-04

Pos logit contributions
aw+0.031
 Other+0.030
 particularly+0.029
 middle+0.029
+0.029
Neg logit contributions
ro-0.018
uzz-0.014
emo-0.013
keeping-0.013
 cheek-0.012
TokenB
Token ID33
Feature activation+2.70e-02

Pos logit contributions
ro+0.025
emo+0.021
uzz+0.020
keeping+0.020
onna+0.020
Neg logit contributions
aw-0.029
 Other-0.026
 particularly-0.026
 middle-0.025
ifying-0.024
Tokenun
Token ID403
Feature activation+3.02e-02

Pos logit contributions
ro+0.055
keeping+0.050
emo+0.049
onna+0.048
 cheek+0.048
Neg logit contributions
aw-0.035
 Other-0.034
-0.031
 particularly-0.030
 Leave-0.029
Token followed
Token ID3940
Feature activation+2.69e-03

Pos logit contributions
ro+0.055
uzz+0.051
keeping+0.049
emo+0.049
onna+0.048
Neg logit contributions
aw-0.049
 Other-0.042
 particularly-0.041
 course-0.039
 middle-0.039
Token the
Token ID262
Feature activation-3.70e-03

Pos logit contributions
ro+0.005
uzz+0.004
emo+0.004
keeping+0.004
 cheek+0.004
Neg logit contributions
aw-0.005
 Other-0.005
 particularly-0.004
 middle-0.004
-0.004
Token squirrel
Token ID33039
Feature activation+2.91e-03

Pos logit contributions
aw+0.007
 Other+0.006
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.007
uzz-0.006
emo-0.005
keeping-0.005
 cheek-0.005
Token to
Token ID284
Feature activation+1.40e-02

Pos logit contributions
ro+0.005
uzz+0.005
emo+0.005
keeping+0.004
 cheek+0.004
Neg logit contributions
aw-0.005
 Other-0.004
 middle-0.004
 particularly-0.004
-0.004
Token a
Token ID257
Feature activation+1.54e-03

Pos logit contributions
ro+0.028
uzz+0.025
emo+0.024
keeping+0.023
onna+0.022
Neg logit contributions
 Other-0.020
aw-0.019
 middle-0.019
 particularly-0.018
-0.018
Token big
Token ID1263
Feature activation-1.00e-02

Pos logit contributions
ro+0.003
uzz+0.003
emo+0.003
keeping+0.002
 cheek+0.002
Neg logit contributions
aw-0.002
 Other-0.002
 particularly-0.002
 middle-0.002
-0.002
Token tree
Token ID5509
Feature activation-5.46e-03

Pos logit contributions
aw+0.017
 Other+0.015
 particularly+0.015
 middle+0.015
+0.015
Neg logit contributions
ro-0.018
uzz-0.016
emo-0.015
keeping-0.015
 cheek-0.014
Token.
Token ID13
Feature activation-1.02e-03

Pos logit contributions
aw+0.010
 Other+0.009
 particularly+0.009
 middle+0.009
+0.008
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.007
Token The
Token ID383
Feature activation-1.48e-02

Pos logit contributions
aw+0.001
 Other+0.001
 middle+0.001
 particularly+0.001
+0.001
Neg logit contributions
ro-0.002
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.002
Token and
Token ID290
Feature activation-5.86e-03

Pos logit contributions
aw+0.016
 Other+0.014
 particularly+0.014
 middle+0.014
+0.013
Neg logit contributions
ro-0.015
uzz-0.012
emo-0.012
keeping-0.012
 cheek-0.011
Token they
Token ID484
Feature activation-2.23e-02

Pos logit contributions
aw+0.011
 Other+0.010
 middle+0.010
 particularly+0.010
+0.010
Neg logit contributions
ro-0.010
uzz-0.008
emo-0.008
keeping-0.008
 cheek-0.007
Token walked
Token ID6807
Feature activation-1.03e-02

Pos logit contributions
aw+0.031
 Other+0.030
opl+0.027
 particularly+0.027
 middle+0.026
Neg logit contributions
ro-0.046
uzz-0.041
keeping-0.038
emo-0.038
 cheek-0.037
Token back
Token ID736
Feature activation+1.49e-03

Pos logit contributions
aw+0.020
 Other+0.018
 particularly+0.018
+0.018
opl+0.017
Neg logit contributions
ro-0.017
uzz-0.014
keeping-0.013
emo-0.013
 cheek-0.012
Token inside
Token ID2641
Feature activation+6.32e-03

Pos logit contributions
ro+0.002
uzz+0.002
emo+0.002
keeping+0.002
 cheek+0.002
Neg logit contributions
aw-0.003
 Other-0.003
 middle-0.003
 particularly-0.003
-0.003
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
ro+0.012
uzz+0.010
emo+0.010
keeping+0.010
 cheek+0.009
Neg logit contributions
aw-0.010
 Other-0.010
 particularly-0.009
 middle-0.009
-0.009
Token<|endoftext|>
Token ID50256
Feature activation+2.51e-03

Pos logit contributions
aw+0.022
 Other+0.020
 particularly+0.020
 middle+0.019
opl+0.018
Neg logit contributions
ro-0.024
uzz-0.021
emo-0.021
keeping-0.020
onna-0.019
TokenOnce
Token ID7454
Feature activation+3.38e-03

Pos logit contributions
ro+0.006
uzz+0.005
emo+0.005
keeping+0.005
 cheek+0.005
Neg logit contributions
aw-0.003
 Other-0.003
 middle-0.003
 particularly-0.002
-0.002
Token upon
Token ID2402
Feature activation+9.55e-03

Pos logit contributions
ro+0.004
uzz+0.003
keeping+0.003
emo+0.003
 cheek+0.003
Neg logit contributions
aw-0.008
 Other-0.007
 middle-0.007
 particularly-0.007
-0.007
Token a
Token ID257
Feature activation-2.66e-02

Pos logit contributions
ro+0.020
uzz+0.017
keeping+0.017
emo+0.017
onna+0.016
Neg logit contributions
aw-0.014
 Other-0.012
 middle-0.012
 particularly-0.012
-0.011
Token time
Token ID640
Feature activation+1.77e-02

Pos logit contributions
aw+0.045
 Other+0.041
 middle+0.040
ifying+0.038
 particularly+0.038
Neg logit contributions
ro-0.048
uzz-0.041
emo-0.040
onna-0.037
 cheek-0.037
Token had
Token ID550
Feature activation-1.97e-03

Pos logit contributions
 Other+0.053
aw+0.052
+0.051
opl+0.051
 middle+0.049
Neg logit contributions
ro-0.040
emo-0.032
uzz-0.031
keeping-0.028
 cheek-0.028
Token ruined
Token ID20484
Feature activation+1.63e-02

Pos logit contributions
aw+0.003
 Other+0.003
 particularly+0.003
 middle+0.003
+0.003
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token his
Token ID465
Feature activation+1.55e-02

Pos logit contributions
ro+0.031
emo+0.027
uzz+0.026
keeping+0.025
 cheek+0.025
Neg logit contributions
aw-0.026
 Other-0.024
 middle-0.023
 particularly-0.022
opl-0.022
Token chance
Token ID2863
Feature activation+2.50e-03

Pos logit contributions
ro+0.028
uzz+0.025
emo+0.024
keeping+0.024
onna+0.022
Neg logit contributions
aw-0.025
 middle-0.024
 Other-0.024
-0.023
 particularly-0.023
Token to
Token ID284
Feature activation+1.16e-02

Pos logit contributions
ro+0.005
uzz+0.004
emo+0.004
keeping+0.004
 cheek+0.004
Neg logit contributions
aw-0.004
 Other-0.004
 particularly-0.004
 middle-0.004
-0.004
Token see
Token ID766
Feature activation-2.27e-02

Pos logit contributions
ro+0.020
uzz+0.018
emo+0.017
keeping+0.017
 cheek+0.016
Neg logit contributions
aw-0.020
 Other-0.018
 middle-0.018
-0.018
 particularly-0.017
Token the
Token ID262
Feature activation+2.72e-03

Pos logit contributions
aw+0.042
 Other+0.040
+0.040
 particularly+0.039
 middle+0.038
Neg logit contributions
ro-0.036
uzz-0.030
emo-0.029
keeping-0.029
 cheek-0.027
Token surprise
Token ID5975
Feature activation-2.83e-03

Pos logit contributions
ro+0.005
uzz+0.004
emo+0.004
keeping+0.004
onna+0.004
Neg logit contributions
aw-0.005
 Other-0.004
 middle-0.004
 particularly-0.004
-0.004
Token.
Token ID13
Feature activation+9.83e-04

Pos logit contributions
aw+0.005
 Other+0.004
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.005
uzz-0.004
emo-0.004
keeping-0.004
 cheek-0.004
Token He
Token ID679
Feature activation-1.34e-02

Pos logit contributions
ro+0.002
uzz+0.002
emo+0.002
keeping+0.002
 cheek+0.002
Neg logit contributions
aw-0.001
 Other-0.001
 middle-0.001
 particularly-0.001
-0.001
Token felt
Token ID2936
Feature activation-4.12e-02

Pos logit contributions
aw+0.025
 Other+0.024
+0.023
 middle+0.023
 particularly+0.023
Neg logit contributions
ro-0.022
uzz-0.018
emo-0.018
keeping-0.017
 cheek-0.016
Token Sara
Token ID24799
Feature activation+2.48e-02

Pos logit contributions
ro+0.021
keeping+0.018
emo+0.017
uzz+0.017
 cheek+0.016
Neg logit contributions
aw-0.027
 particularly-0.025
 Other-0.025
 middle-0.025
-0.023
Token and
Token ID290
Feature activation+2.68e-03

Pos logit contributions
ro+0.049
uzz+0.044
keeping+0.044
emo+0.043
onna+0.041
Neg logit contributions
aw-0.038
 particularly-0.032
 Other-0.031
 middle-0.030
-0.030
Token Ben
Token ID3932
Feature activation+8.74e-03

Pos logit contributions
ro+0.004
uzz+0.003
emo+0.003
keeping+0.003
onna+0.003
Neg logit contributions
aw-0.005
 Other-0.005
 middle-0.005
 particularly-0.005
-0.005
Token became
Token ID2627
Feature activation-4.12e-03

Pos logit contributions
ro+0.018
uzz+0.016
emo+0.015
keeping+0.015
onna+0.015
Neg logit contributions
aw-0.013
 Other-0.011
 particularly-0.011
 middle-0.011
-0.011
Token friends
Token ID2460
Feature activation-1.88e-02

Pos logit contributions
aw+0.006
 Other+0.005
 middle+0.005
 particularly+0.005
+0.005
Neg logit contributions
ro-0.009
uzz-0.008
emo-0.007
keeping-0.007
 cheek-0.007
Token.
Token ID13
Feature activation-1.32e-02

Pos logit contributions
aw+0.035
 Other+0.034
 particularly+0.033
 middle+0.033
+0.032
Neg logit contributions
ro-0.029
uzz-0.025
keeping-0.024
emo-0.023
 cheek-0.022
Token She
Token ID1375
Feature activation-1.06e-03

Pos logit contributions
aw+0.020
 Other+0.018
 particularly+0.018
 middle+0.017
+0.017
Neg logit contributions
ro-0.026
uzz-0.023
emo-0.022
keeping-0.021
onna-0.020
Token was
Token ID373
Feature activation-1.61e-02

Pos logit contributions
aw+0.002
 Other+0.002
 particularly+0.002
 middle+0.002
+0.002
Neg logit contributions
ro-0.002
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.001
Token happy
Token ID3772
Feature activation+1.42e-02

Pos logit contributions
aw+0.030
 Other+0.029
+0.028
 middle+0.027
 particularly+0.027
Neg logit contributions
ro-0.024
uzz-0.022
emo-0.021
keeping-0.020
 cheek-0.020
Token that
Token ID326
Feature activation+1.44e-02

Pos logit contributions
ro+0.029
emo+0.025
keeping+0.025
uzz+0.024
ge+0.023
Neg logit contributions
aw-0.021
 Other-0.019
 particularly-0.018
 middle-0.018
-0.017
Token they
Token ID484
Feature activation+5.99e-04

Pos logit contributions
ro+0.023
keeping+0.020
emo+0.019
uzz+0.018
 cheek+0.018
Neg logit contributions
aw-0.027
 Other-0.025
 particularly-0.025
 middle-0.025
-0.024
Token tasted
Token ID29187
Feature activation-1.86e-02

Pos logit contributions
aw+0.003
 Other+0.003
 particularly+0.003
 middle+0.003
+0.003
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token y
Token ID331
Feature activation-5.14e-03

Pos logit contributions
aw+0.030
 Other+0.028
opl+0.027
 middle+0.027
+0.026
Neg logit contributions
ro-0.034
emo-0.029
uzz-0.028
keeping-0.027
onna-0.027
Tokenummy
Token ID13513
Feature activation-4.71e-04

Pos logit contributions
aw+0.008
 Other+0.007
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.010
uzz-0.009
emo-0.008
keeping-0.008
 cheek-0.008
Token.
Token ID13
Feature activation-2.30e-03

Pos logit contributions
aw+0.001
 Other+0.001
 particularly+0.001
 middle+0.001
+0.001
Neg logit contributions
ro-0.001
uzz-0.001
emo-0.001
keeping-0.001
 cheek-0.001
Token Finally
Token ID9461
Feature activation-2.32e-02

Pos logit contributions
aw+0.004
 Other+0.004
 middle+0.004
 particularly+0.004
+0.004
Neg logit contributions
ro-0.004
uzz-0.003
emo-0.003
keeping-0.003
 cheek-0.003
Token,
Token ID11
Feature activation-9.51e-03

Pos logit contributions
aw+0.041
 Other+0.037
opl+0.037
 particularly+0.036
 middle+0.035
Neg logit contributions
ro-0.040
uzz-0.034
emo-0.032
keeping-0.032
 cheek-0.031
Token Lucy
Token ID22162
Feature activation+2.49e-04

Pos logit contributions
aw+0.020
 Other+0.018
 particularly+0.017
 middle+0.017
opl+0.017
Neg logit contributions
ro-0.014
uzz-0.012
emo-0.011
keeping-0.011
 cheek-0.010
Token added
Token ID2087
Feature activation-2.00e-02

Pos logit contributions
ro+0.000
uzz+0.000
emo+0.000
keeping+0.000
 cheek+0.000
Neg logit contributions
aw-0.000
 Other-0.000
 particularly-0.000
 middle-0.000
-0.000
Token some
Token ID617
Feature activation-2.39e-02

Pos logit contributions
aw+0.033
 Other+0.031
opl+0.029
+0.029
 particularly+0.029
Neg logit contributions
ro-0.036
emo-0.031
uzz-0.030
keeping-0.029
onna-0.028
Token crunch
Token ID23527
Feature activation-4.31e-04

Pos logit contributions
aw+0.042
 Other+0.039
opl+0.038
+0.037
 middle+0.036
Neg logit contributions
ro-0.041
emo-0.034
uzz-0.033
onna-0.032
 cheek-0.031
Tokeny
Token ID88
Feature activation+4.33e-03

Pos logit contributions
aw+0.001
 Other+0.001
 particularly+0.001
 middle+0.001
+0.001
Neg logit contributions
ro-0.001
uzz-0.000
emo-0.000
keeping-0.000
 cheek-0.000
TokenL
Token ID43
Feature activation+1.03e-02

Pos logit contributions
aw+0.006
 Other+0.006
 particularly+0.005
 middle+0.005
+0.005
Neg logit contributions
ro-0.013
uzz-0.011
emo-0.011
keeping-0.011
 cheek-0.010
Tokenily
Token ID813
Feature activation+2.76e-02

Pos logit contributions
ro+0.020
uzz+0.017
keeping+0.017
emo+0.017
 cheek+0.016
Neg logit contributions
aw-0.017
 Other-0.015
 middle-0.015
 particularly-0.014
 course-0.014
Token loved
Token ID6151
Feature activation+1.64e-02

Pos logit contributions
ro+0.053
uzz+0.048
emo+0.046
keeping+0.046
 cheek+0.045
Neg logit contributions
aw-0.044
 Other-0.037
 particularly-0.036
 middle-0.036
opl-0.034
Token her
Token ID607
Feature activation+1.04e-03

Pos logit contributions
ro+0.031
uzz+0.026
emo+0.025
keeping+0.024
 cheek+0.023
Neg logit contributions
aw-0.027
 Other-0.025
 particularly-0.025
 middle-0.024
-0.023
Token hairy
Token ID43873
Feature activation+1.06e-02

Pos logit contributions
ro+0.002
uzz+0.001
emo+0.001
keeping+0.001
onna+0.001
Neg logit contributions
aw-0.002
 Other-0.002
 middle-0.002
 particularly-0.002
-0.002
Token kitten
Token ID42143
Feature activation-3.97e-02

Pos logit contributions
ro+0.020
uzz+0.018
emo+0.017
keeping+0.017
onna+0.016
Neg logit contributions
aw-0.017
 Other-0.015
 middle-0.015
 particularly-0.015
-0.014
Token,
Token ID11
Feature activation-1.10e-03

Pos logit contributions
aw+0.078
 Other+0.072
opl+0.071
 middle+0.070
Eventually+0.070
Neg logit contributions
ro-0.059
uzz-0.049
 cheek-0.045
emo-0.043
keeping-0.042
Token Spot
Token ID15899
Feature activation-9.09e-03

Pos logit contributions
aw+0.002
 Other+0.002
 particularly+0.002
 middle+0.002
+0.002
Neg logit contributions
ro-0.002
uzz-0.001
emo-0.001
keeping-0.001
 cheek-0.001
Token.
Token ID13
Feature activation-2.48e-02

Pos logit contributions
aw+0.016
 Other+0.015
 middle+0.014
 particularly+0.014
+0.014
Neg logit contributions
ro-0.016
uzz-0.014
emo-0.013
keeping-0.013
 cheek-0.012
Token She
Token ID1375
Feature activation+8.62e-03

Pos logit contributions
aw+0.043
 particularly+0.038
 middle+0.037
 Other+0.036
+0.036
Neg logit contributions
ro-0.046
uzz-0.038
emo-0.037
onna-0.035
keeping-0.034
Token liked
Token ID8288
Feature activation+2.64e-03

Pos logit contributions
ro+0.017
keeping+0.015
uzz+0.015
emo+0.015
 cheek+0.014
Neg logit contributions
aw-0.013
 particularly-0.012
 Other-0.011
 middle-0.011
-0.011
Token big
Token ID1263
Feature activation+3.27e-02

Pos logit contributions
ro+0.023
keeping+0.020
uzz+0.020
emo+0.020
ge+0.018
Neg logit contributions
aw-0.018
 particularly-0.017
 Other-0.017
 middle-0.017
-0.016
Token hug
Token ID16225
Feature activation-5.96e-03

Pos logit contributions
ro+0.064
keeping+0.062
emo+0.059
uzz+0.056
ge+0.053
Neg logit contributions
aw-0.046
 Other-0.046
 particularly-0.043
 course-0.043
 middle-0.043
Token.
Token ID13
Feature activation-4.91e-03

Pos logit contributions
aw+0.010
 Other+0.010
 middle+0.009
 particularly+0.009
+0.009
Neg logit contributions
ro-0.011
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token From
Token ID3574
Feature activation-5.46e-03

Pos logit contributions
aw+0.007
 Other+0.007
 particularly+0.006
 middle+0.006
+0.006
Neg logit contributions
ro-0.010
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token that
Token ID326
Feature activation+1.41e-02

Pos logit contributions
aw+0.006
 Other+0.006
 particularly+0.005
 middle+0.005
+0.005
Neg logit contributions
ro-0.013
uzz-0.011
emo-0.011
keeping-0.011
 cheek-0.011
Token day
Token ID1110
Feature activation-6.30e-02

Pos logit contributions
ro+0.026
keeping+0.022
uzz+0.022
emo+0.022
onna+0.020
Neg logit contributions
 particularly-0.023
aw-0.023
 Other-0.023
 middle-0.022
 course-0.021
Token on
Token ID319
Feature activation-2.67e-02

Pos logit contributions
aw+0.102
 pasture+0.091
 Other+0.089
opl+0.088
ns+0.087
Neg logit contributions
ro-0.103
uzz-0.095
 moment-0.089
 seek-0.085
 cheek-0.085
Token,
Token ID11
Feature activation-1.65e-02

Pos logit contributions
aw+0.048
 Other+0.043
 particularly+0.041
 middle+0.041
opl+0.040
Neg logit contributions
ro-0.047
uzz-0.039
keeping-0.038
emo-0.037
 seek-0.035
Token Sarah
Token ID10490
Feature activation-1.31e-02

Pos logit contributions
aw+0.034
 Other+0.030
 middle+0.030
 particularly+0.029
+0.029
Neg logit contributions
ro-0.025
uzz-0.020
keeping-0.019
emo-0.019
 cheek-0.018
Token hugged
Token ID41130
Feature activation-6.55e-03

Pos logit contributions
aw+0.020
 Other+0.018
 middle+0.017
 particularly+0.017
+0.017
Neg logit contributions
ro-0.026
uzz-0.023
emo-0.022
keeping-0.022
onna-0.021
Token her
Token ID607
Feature activation+6.85e-03

Pos logit contributions
aw+0.012
 Other+0.011
 middle+0.011
 particularly+0.011
+0.010
Neg logit contributions
ro-0.011
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token machine
Token ID4572
Feature activation-4.86e-03

Pos logit contributions
aw+0.018
 Other+0.017
 particularly+0.016
 middle+0.016
+0.015
Neg logit contributions
ro-0.018
emo-0.015
uzz-0.015
keeping-0.015
 cheek-0.014
Token work
Token ID670
Feature activation+1.57e-02

Pos logit contributions
aw+0.008
 Other+0.008
 middle+0.007
 particularly+0.007
+0.007
Neg logit contributions
ro-0.009
uzz-0.007
emo-0.007
keeping-0.007
 cheek-0.007
Token.
Token ID13
Feature activation-2.45e-03

Pos logit contributions
ro+0.030
uzz+0.027
emo+0.027
keeping+0.026
 cheek+0.024
Neg logit contributions
aw-0.024
 Other-0.021
 particularly-0.021
 middle-0.021
-0.020
Token From
Token ID3574
Feature activation-6.48e-03

Pos logit contributions
aw+0.004
 Other+0.003
 middle+0.003
 particularly+0.003
+0.003
Neg logit contributions
ro-0.005
uzz-0.004
emo-0.004
keeping-0.004
 cheek-0.004
Token that
Token ID326
Feature activation+2.34e-03

Pos logit contributions
aw+0.008
 Other+0.007
 particularly+0.006
 middle+0.006
+0.006
Neg logit contributions
ro-0.015
uzz-0.013
emo-0.013
keeping-0.013
 cheek-0.012
Token day
Token ID1110
Feature activation-6.80e-02

Pos logit contributions
ro+0.004
uzz+0.003
keeping+0.003
emo+0.003
onna+0.003
Neg logit contributions
aw-0.004
 Other-0.004
 particularly-0.004
 middle-0.004
-0.004
Token on
Token ID319
Feature activation-2.35e-02

Pos logit contributions
aw+0.127
 pasture+0.122
 Other+0.116
ns+0.114
 particularly+0.112
Neg logit contributions
ro-0.082
uzz-0.076
 moment-0.070
 cheek-0.069
emo-0.069
Token,
Token ID11
Feature activation-2.27e-02

Pos logit contributions
aw+0.044
 Other+0.039
 particularly+0.038
ifying+0.037
 middle+0.037
Neg logit contributions
ro-0.040
emo-0.033
uzz-0.032
keeping-0.031
onna-0.030
Token Tim
Token ID5045
Feature activation-4.29e-03

Pos logit contributions
aw+0.058
ifying+0.052
 Other+0.051
 middle+0.051
 particularly+0.050
Neg logit contributions
ro-0.022
emo-0.018
uzz-0.016
keeping-0.013
 textures-0.013
Tokenmy
Token ID1820
Feature activation-1.56e-02

Pos logit contributions
aw+0.007
 Other+0.007
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.008
uzz-0.007
emo-0.006
keeping-0.006
 cheek-0.006
Token loved
Token ID6151
Feature activation+3.71e-03

Pos logit contributions
aw+0.025
 Other+0.023
 middle+0.022
+0.022
 particularly+0.021
Neg logit contributions
ro-0.030
emo-0.025
uzz-0.025
keeping-0.024
 cheek-0.023
Token her
Token ID607
Feature activation-1.27e-02

Pos logit contributions
aw+0.035
 Other+0.032
opl+0.030
 middle+0.029
ifying+0.029
Neg logit contributions
ro-0.030
uzz-0.026
emo-0.025
keeping-0.024
 cheek-0.023
Token hive
Token ID35881
Feature activation-4.02e-02

Pos logit contributions
aw+0.023
 Other+0.021
 particularly+0.020
 middle+0.020
+0.019
Neg logit contributions
ro-0.022
uzz-0.019
emo-0.018
keeping-0.018
 cheek-0.017
Token blew
Token ID17948
Feature activation+9.45e-03

Pos logit contributions
 Other+0.073
aw+0.072
opl+0.069
+0.066
pieces+0.066
Neg logit contributions
ro-0.068
 cheek-0.050
keeping-0.049
emo-0.048
 seek-0.048
Token away
Token ID1497
Feature activation-5.82e-03

Pos logit contributions
ro+0.019
uzz+0.017
emo+0.017
keeping+0.017
onna+0.016
Neg logit contributions
aw-0.014
 Other-0.013
 middle-0.013
 particularly-0.013
 forward-0.011
Token in
Token ID287
Feature activation+2.29e-02

Pos logit contributions
aw+0.010
 Other+0.009
 particularly+0.009
 middle+0.009
+0.009
Neg logit contributions
ro-0.011
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token the
Token ID262
Feature activation-5.30e-02

Pos logit contributions
ro+0.048
emo+0.043
uzz+0.041
 cheek+0.039
keeping+0.039
Neg logit contributions
 Other-0.030
 middle-0.030
aw-0.030
 particularly-0.029
-0.028
Token wind
Token ID2344
Feature activation+1.23e-02

Pos logit contributions
+0.103
aw+0.100
opl+0.099
 Other+0.097
pieces+0.095
Neg logit contributions
ro-0.070
 cheek-0.064
 moment-0.056
 textures-0.054
keeping-0.052
Token.
Token ID13
Feature activation+2.51e-03

Pos logit contributions
ro+0.023
uzz+0.021
emo+0.021
keeping+0.020
 cheek+0.018
Neg logit contributions
aw-0.019
 Other-0.017
 particularly-0.017
 middle-0.017
-0.016
Token
Token ID198
Feature activation-3.29e-03

Pos logit contributions
ro+0.005
uzz+0.004
emo+0.004
keeping+0.004
 cheek+0.004
Neg logit contributions
aw-0.004
 Other-0.004
 particularly-0.003
 middle-0.003
-0.003
Token
Token ID198
Feature activation+2.21e-03

Pos logit contributions
aw+0.005
 Other+0.005
 middle+0.005
 particularly+0.005
+0.004
Neg logit contributions
ro-0.006
uzz-0.005
emo-0.005
keeping-0.005
 cheek-0.005
TokenB
Token ID33
Feature activation+5.16e-02

Pos logit contributions
ro+0.004
uzz+0.003
keeping+0.003
emo+0.003
 cheek+0.003
Neg logit contributions
aw-0.004
 Other-0.004
 particularly-0.004
 middle-0.004
-0.004
Token,
Token ID11
Feature activation-3.50e-03

Pos logit contributions
aw+0.010
 Other+0.009
 particularly+0.009
 middle+0.009
+0.008
Neg logit contributions
ro-0.011
uzz-0.010
emo-0.009
keeping-0.009
 cheek-0.009
Token it
Token ID340
Feature activation+1.46e-03

Pos logit contributions
aw+0.006
 Other+0.006
 middle+0.006
 particularly+0.006
+0.006
Neg logit contributions
ro-0.006
uzz-0.005
emo-0.005
keeping-0.005
 cheek-0.004
Token caused
Token ID4073
Feature activation+1.76e-02

Pos logit contributions
ro+0.003
uzz+0.002
emo+0.002
keeping+0.002
 cheek+0.002
Neg logit contributions
aw-0.002
 Other-0.002
 particularly-0.002
 middle-0.002
-0.002
Token the
Token ID262
Feature activation+2.02e-03

Pos logit contributions
ro+0.035
emo+0.031
uzz+0.031
 cheek+0.029
keeping+0.029
Neg logit contributions
aw-0.027
 Other-0.025
 middle-0.023
 particularly-0.023
-0.022
Token ash
Token ID12530
Feature activation-2.67e-02

Pos logit contributions
ro+0.004
uzz+0.003
emo+0.003
keeping+0.003
onna+0.003
Neg logit contributions
aw-0.004
 Other-0.003
 middle-0.003
 particularly-0.003
-0.003
Tokentr
Token ID2213
Feature activation-6.05e-02

Pos logit contributions
aw+0.047
 Other+0.045
 particularly+0.042
+0.041
 middle+0.041
Neg logit contributions
ro-0.048
uzz-0.038
keeping-0.038
 cheek-0.037
emo-0.036
Tokenay
Token ID323
Feature activation+2.45e-03

Pos logit contributions
aw+0.079
 pasture+0.077
 particularly+0.077
 Other+0.070
 middle+0.067
Neg logit contributions
ro-0.124
uzz-0.111
onna-0.107
 cheek-0.103
keeping-0.098
Token to
Token ID284
Feature activation+1.68e-02

Pos logit contributions
ro+0.005
uzz+0.004
emo+0.004
keeping+0.004
 cheek+0.004
Neg logit contributions
aw-0.004
 Other-0.003
 particularly-0.003
 middle-0.003
-0.003
Token scatter
Token ID41058
Feature activation+5.53e-03

Pos logit contributions
ro+0.031
uzz+0.028
emo+0.027
keeping+0.026
 cheek+0.025
Neg logit contributions
aw-0.027
 middle-0.025
 Other-0.025
-0.024
 particularly-0.023
Token all
Token ID477
Feature activation+1.76e-02

Pos logit contributions
ro+0.011
uzz+0.009
emo+0.009
keeping+0.009
 cheek+0.009
Neg logit contributions
aw-0.009
 Other-0.008
 middle-0.008
 particularly-0.008
-0.007
Token over
Token ID625
Feature activation+6.86e-04

Pos logit contributions
ro+0.036
uzz+0.032
emo+0.032
keeping+0.030
 cheek+0.029
Neg logit contributions
aw-0.026
 Other-0.023
 middle-0.023
 particularly-0.022
 course-0.022
Tokenber
Token ID527
Feature activation-2.32e-02

Pos logit contributions
 Other+0.083
aw+0.082
+0.080
opl+0.079
 middle+0.079
Neg logit contributions
ro-0.044
uzz-0.035
emo-0.029
 cheek-0.027
onna-0.027
Token.
Token ID13
Feature activation+3.50e-03

Pos logit contributions
aw+0.041
 Other+0.039
 middle+0.037
+0.037
 particularly+0.036
Neg logit contributions
ro-0.041
uzz-0.034
keeping-0.031
emo-0.031
 cheek-0.030
Token He
Token ID679
Feature activation-5.97e-03

Pos logit contributions
ro+0.008
uzz+0.007
keeping+0.007
emo+0.007
 cheek+0.007
Neg logit contributions
aw-0.004
 Other-0.004
 middle-0.003
 particularly-0.003
-0.003
Token worked
Token ID3111
Feature activation-1.99e-02

Pos logit contributions
aw+0.010
 Other+0.010
 middle+0.009
 particularly+0.009
+0.009
Neg logit contributions
ro-0.011
uzz-0.009
emo-0.009
keeping-0.009
 cheek-0.008
Token in
Token ID287
Feature activation+6.03e-03

Pos logit contributions
aw+0.036
 Other+0.034
+0.032
 middle+0.032
opl+0.031
Neg logit contributions
ro-0.034
uzz-0.028
emo-0.028
keeping-0.026
onna-0.026
Token a
Token ID257
Feature activation-5.31e-02

Pos logit contributions
ro+0.012
uzz+0.010
emo+0.010
keeping+0.010
 cheek+0.009
Neg logit contributions
aw-0.009
 Other-0.009
 middle-0.008
 particularly-0.008
-0.008
Token small
Token ID1402
Feature activation-2.97e-02

Pos logit contributions
aw+0.105
ement+0.099
opl+0.097
 Other+0.097
+0.096
Neg logit contributions
ro-0.077
 cheek-0.060
 moment-0.056
 seek-0.056
onna-0.055
Token shop
Token ID6128
Feature activation-1.02e-03

Pos logit contributions
aw+0.055
 Other+0.051
+0.051
ifying+0.049
 particularly+0.049
Neg logit contributions
ro-0.049
keeping-0.037
uzz-0.036
 cheek-0.035
emo-0.034
Token with
Token ID351
Feature activation-2.77e-02

Pos logit contributions
aw+0.002
 Other+0.002
 middle+0.002
 particularly+0.002
+0.001
Neg logit contributions
ro-0.002
uzz-0.002
emo-0.002
keeping-0.002
 cheek-0.001
Token a
Token ID257
Feature activation-3.71e-02

Pos logit contributions
aw+0.049
+0.046
 Other+0.045
 particularly+0.044
opl+0.042
Neg logit contributions
ro-0.046
uzz-0.039
emo-0.038
keeping-0.037
 textures-0.035
Token big
Token ID1263
Feature activation-2.82e-02

Pos logit contributions
aw+0.070
 Other+0.066
opl+0.061
+0.059
 particularly+0.058
Neg logit contributions
ro-0.061
 cheek-0.050
emo-0.046
uzz-0.046
onna-0.046