TOP ACTIVATIONS
MAX = 5.827

the garden , having lots of fun . <|endoftext|> Once upon
saw dark clouds and flashes of lightning . They were scared
different roles and having lots of fun . Eventually
. They were having lots of fun together .
. They were having lots of fun ! <|endoftext|> Once upon
the sky , having lots of fun . One day ,
her brother were having lots of fun with the animals .
the park , having lots of fun . Suddenly , they
park and was having lots of fun . Suddenly
park together , having lots of fun . One day ,
loud noises and saw flashes of light . They were glad
the forest , having lots of fun . As
soon they were having lots of fun . After a while
. He was having lots of fun , and was almost
park and had a lot of fun . They swung and
. Tom was having lots of fun , playing with the
. He felt a lot of pain and blood . He
. They were having lots of fun . Then
. They were having lots of fun until Lily 's friend
dark clouds and the flashes of light . He saw the

MIN ACTIVATIONS
MIN = -5.449

playing with his magnet in Cole 's room when he accidentally
. You can call me Crystal . â
long time . They put True in a big box with
the way she solved the math equations .
was able to solve the math problems and even started to
was a silly cat called Edward . Edward loved to sleep
and Vince helped to wrap Luke up in a blanket .
would fill her notebook with maths problems and try to find
Mystery of the Bur ied Bone ". She showed it to
should remember to stay close by ," she said .
but they had to sell True . She said she would
would fill her room with maths books . She would add
a plan . She took Justin to the store and bought
the festival . He played tug - of - war and
but you have to give Coco a chance . He is
there was an elephant named Edward . Edward was very troubled
elt down and solved a math problem in her head .
the cheese and add the topp ings . Lily and Ben
garden trying to solve the maths problems . In no time
, Sara 's mom took True to a yard sale .

INTERVAL 3.008 - 5.827
CONTAINS 0.219%

cl apping , silly ," Mom said . She made John
. They had a lot of fun . One
glad that she had spent the day exploring the mud ,
was foolish ." Mom smiled and kissed his cheek
park . It was a sunny day and it seemed like

INTERVAL 0.189 - 3.008
CONTAINS 40.456%

They go to play again . They are happy .
, little worm . I 'm happy to help ." <|endoftext|>
" OK , but be careful ," Lily said .
the ball and Rex catches it . They have fun .
She told her that it 's okay to feel sad ,

INTERVAL -2.630 - 0.189
CONTAINS 59.143%

bumped into the table , where the dessert was . The
. " Why are you fighting ?" " L
back to her room to finish cleaning . Even though she
They drew many pictures together with the ink . And they
inside . " Let 's get the box !"

INTERVAL -5.449 - -2.630
CONTAINS 0.181%

said . " We can look for cheese in the kitchen
pepper and said , " Hello , Pepper oni . You
. They swung back and forth . They laughed and talked
. He swung the stick again and again . He made
that he used to clean himself . One day while he
Token the
Token ID262
Feature activation+2.28e+00

Pos logit contributions
 Kim+0.005
 Su+0.004
 Johnny+0.004
 cho+0.004
 Liz+0.004
Neg logit contributions
 conclusions-0.008
-0.008
ede-0.008
',"-0.008
growth-0.008
Token garden
Token ID11376
Feature activation+5.06e-01

Pos logit contributions
 Kim+0.051
 Su+0.046
 cho+0.045
 Johnny+0.044
Kim+0.043
Neg logit contributions
 conclusions-0.057
growth-0.052
-0.051
ede-0.051
',"-0.051
Token,
Token ID11
Feature activation-7.08e-01

Pos logit contributions
 Kim+0.015
 Su+0.014
 Johnny+0.014
Kim+0.014
 cho+0.014
Neg logit contributions
 conclusions-0.008
ede-0.008
-0.007
growth-0.007
',"-0.007
Token having
Token ID1719
Feature activation+5.05e-01

Pos logit contributions
ede+0.013
 vegetable+0.013
 conclusions+0.013
 mushrooms+0.012
+0.012
Neg logit contributions
 Kim-0.018
ug-0.017
Kim-0.017
 Su-0.016
rol-0.016
Token lots
Token ID6041
Feature activation+9.39e-01

Pos logit contributions
 Kim+0.013
 Su+0.011
ug+0.011
 cho+0.011
Kim+0.011
Neg logit contributions
 conclusions-0.011
growth-0.010
ede-0.010
 mushrooms-0.010
-0.009
Token of
Token ID286
Feature activation+5.83e+00

Pos logit contributions
 Kim+0.018
 Johnny+0.014
 Su+0.014
Kim+0.014
 Liz+0.013
Neg logit contributions
 conclusions-0.028
 mushrooms-0.026
ede-0.025
growth-0.025
 moral-0.025
Token fun
Token ID1257
Feature activation+1.13e+00

Pos logit contributions
 Kim+0.103
 Su+0.089
 cho+0.087
 Johnny+0.086
ug+0.081
Neg logit contributions
 conclusions-0.175
ede-0.162
growth-0.160
-0.157
',"-0.155
Token.
Token ID13
Feature activation+8.33e-01

Pos logit contributions
 Kim+0.018
 driving+0.018
 cho+0.017
ug+0.017
 Su+0.017
Neg logit contributions
 conclusions-0.032
',"-0.031
growth-0.031
-0.030
 tripod-0.030
Token<|endoftext|>
Token ID50256
Feature activation+9.49e-01

Pos logit contributions
 Kim+0.020
 Su+0.017
 Johnny+0.017
 cho+0.017
 Tim+0.016
Neg logit contributions
 conclusions-0.022
ede-0.020
growth-0.019
',"-0.019
-0.019
TokenOnce
Token ID7454
Feature activation+5.25e-02

Pos logit contributions
 Kim+0.023
 Su+0.020
 Johnny+0.020
 cho+0.020
Kim+0.019
Neg logit contributions
 conclusions-0.023
ede-0.021
-0.021
growth-0.020
',"-0.020
Token upon
Token ID2402
Feature activation+1.19e+00

Pos logit contributions
 Kim+0.001
 Su+0.001
 cho+0.001
 Johnny+0.001
Kim+0.001
Neg logit contributions
 conclusions-0.001
ede-0.001
growth-0.001
-0.001
',"-0.001
Token saw
Token ID2497
Feature activation+1.12e+00

Pos logit contributions
 Kim+0.015
 Su+0.014
 cho+0.014
 Johnny+0.013
ug+0.013
Neg logit contributions
 conclusions-0.015
growth-0.014
ede-0.014
-0.013
',"-0.013
Token dark
Token ID3223
Feature activation+2.96e+00

Pos logit contributions
 Kim+0.026
coat+0.025
 Su+0.024
Kim+0.024
ug+0.024
Neg logit contributions
 conclusions-0.025
growth-0.023
 mushrooms-0.021
',"-0.020
 vegetable-0.020
Token clouds
Token ID15114
Feature activation+2.90e+00

Pos logit contributions
 Kim+0.066
 Su+0.060
 Johnny+0.057
coat+0.056
Kim+0.056
Neg logit contributions
 conclusions-0.072
growth-0.068
-0.063
',"-0.061
ede-0.060
Token and
Token ID290
Feature activation+1.56e+00

Pos logit contributions
 Kim+0.085
 Su+0.078
Kim+0.077
 Johnny+0.076
 Liz+0.073
Neg logit contributions
 conclusions-0.050
ede-0.045
growth-0.043
-0.043
 closely-0.042
Token flashes
Token ID29227
Feature activation+2.54e+00

Pos logit contributions
 Kim+0.030
 Su+0.027
ug+0.027
 Johnny+0.026
 cho+0.025
Neg logit contributions
 conclusions-0.044
growth-0.040
ede-0.038
-0.037
 mushrooms-0.037
Token of
Token ID286
Feature activation+5.73e+00

Pos logit contributions
 Kim+0.076
 Su+0.071
coat+0.070
Kim+0.070
 Johnny+0.069
Neg logit contributions
 conclusions-0.041
growth-0.036
 closely-0.032
 mushrooms-0.032
ils-0.031
Token lightning
Token ID14357
Feature activation+2.47e+00

Pos logit contributions
 Kim+0.121
coat+0.114
ug+0.112
 Su+0.111
icking+0.110
Neg logit contributions
 conclusions-0.150
growth-0.134
 mushrooms-0.125
 moral-0.117
 vegetable-0.117
Token.
Token ID13
Feature activation-2.53e-01

Pos logit contributions
 Kim+0.076
 Su+0.071
 Johnny+0.068
Kim+0.066
 Liz+0.065
Neg logit contributions
 conclusions-0.041
growth-0.037
ede-0.037
-0.035
',"-0.034
Token They
Token ID1119
Feature activation+2.47e-01

Pos logit contributions
 conclusions+0.006
ede+0.006
growth+0.006
+0.005
',"+0.005
Neg logit contributions
 Kim-0.006
 Su-0.005
 cho-0.005
 Johnny-0.005
Kim-0.005
Token were
Token ID547
Feature activation+7.44e-02

Pos logit contributions
 Kim+0.006
 cho+0.006
 Su+0.006
ug+0.006
coat+0.006
Neg logit contributions
 conclusions-0.005
growth-0.005
ede-0.004
-0.004
',"-0.004
Token scared
Token ID12008
Feature activation+9.25e-01

Pos logit contributions
 Kim+0.002
Kim+0.002
 Su+0.002
coat+0.002
Johnny+0.002
Neg logit contributions
 conclusions-0.001
 closely-0.001
 moral-0.001
 mushrooms-0.001
 wrists-0.001
Token different
Token ID1180
Feature activation+9.95e-01

Pos logit contributions
 Kim+0.019
 Su+0.017
 cho+0.016
 Johnny+0.016
Kim+0.015
Neg logit contributions
 conclusions-0.019
growth-0.017
ede-0.017
-0.017
',"-0.017
Token roles
Token ID9176
Feature activation-2.38e+00

Pos logit contributions
 Kim+0.026
 Su+0.024
 cho+0.023
 Johnny+0.023
Kim+0.022
Neg logit contributions
 conclusions-0.022
growth-0.019
ede-0.019
-0.019
',"-0.019
Token and
Token ID290
Feature activation+9.66e-01

Pos logit contributions
 conclusions+0.046
growth+0.042
ede+0.042
+0.040
',"+0.039
Neg logit contributions
 Kim-0.067
 Su-0.062
Kim-0.059
 Johnny-0.058
 cho-0.058
Token having
Token ID1719
Feature activation+9.11e-01

Pos logit contributions
 Kim+0.021
 Su+0.020
ug+0.020
coat+0.020
 Johnny+0.019
Neg logit contributions
 conclusions-0.022
growth-0.020
 mushrooms-0.020
ede-0.019
ils-0.019
Token lots
Token ID6041
Feature activation+1.52e+00

Pos logit contributions
 Kim+0.023
ug+0.021
icking+0.020
 Su+0.020
Kim+0.020
Neg logit contributions
 conclusions-0.020
growth-0.019
 mushrooms-0.017
ede-0.017
cious-0.016
Token of
Token ID286
Feature activation+5.68e+00

Pos logit contributions
 Kim+0.029
 Su+0.025
Kim+0.024
 Johnny+0.024
icking+0.023
Neg logit contributions
 conclusions-0.043
 mushrooms-0.040
ils-0.038
growth-0.038
ede-0.037
Token fun
Token ID1257
Feature activation+2.04e+00

Pos logit contributions
 Kim+0.115
 Su+0.105
 Johnny+0.101
 cho+0.099
ug+0.097
Neg logit contributions
 conclusions-0.154
ede-0.138
growth-0.137
 mushrooms-0.136
',"-0.133
Token.
Token ID13
Feature activation+2.74e-01

Pos logit contributions
 Kim+0.045
 driving+0.041
 cho+0.040
 Su+0.040
 speeding+0.039
Neg logit contributions
 conclusions-0.051
-0.047
growth-0.047
ede-0.046
',"-0.045
Token
Token ID198
Feature activation+8.99e-01

Pos logit contributions
 Kim+0.007
 Su+0.006
 Johnny+0.006
 cho+0.006
Kim+0.006
Neg logit contributions
 conclusions-0.006
ede-0.006
growth-0.006
-0.006
',"-0.006
Token
Token ID198
Feature activation-1.45e-01

Pos logit contributions
 Kim+0.023
 Su+0.021
 cho+0.020
 Johnny+0.020
Kim+0.020
Neg logit contributions
 conclusions-0.020
ede-0.018
growth-0.018
-0.018
',"-0.017
TokenEventually
Token ID28724
Feature activation+4.95e-01

Pos logit contributions
 conclusions+0.004
ede+0.004
growth+0.004
+0.004
',"+0.004
Neg logit contributions
 Kim-0.003
 Su-0.003
 cho-0.002
 Johnny-0.002
Kim-0.002
Token.
Token ID13
Feature activation-3.76e-01

Pos logit contributions
 Kim+0.016
 Su+0.014
Kim+0.014
 Johnny+0.014
ug+0.014
Neg logit contributions
 conclusions-0.012
ede-0.011
-0.010
',"-0.010
'."-0.010
Token They
Token ID1119
Feature activation-9.73e-04

Pos logit contributions
 conclusions+0.007
growth+0.007
+0.007
',"+0.006
'."+0.006
Neg logit contributions
 Kim-0.010
 cho-0.010
ug-0.009
 Su-0.009
coat-0.009
Token were
Token ID547
Feature activation-3.74e-01

Pos logit contributions
 conclusions+0.000
ede+0.000
growth+0.000
ils+0.000
+0.000
Neg logit contributions
 Kim-0.000
 Su-0.000
Kim-0.000
 cho-0.000
reat-0.000
Token having
Token ID1719
Feature activation+3.90e-01

Pos logit contributions
 wrists+0.005
cious+0.005
ils+0.005
 closely+0.004
 Fran+0.004
Neg logit contributions
loo-0.011
OM-0.011
Kim-0.010
EEP-0.010
elt-0.010
Token lots
Token ID6041
Feature activation+1.17e+00

Pos logit contributions
 Kim+0.010
Kim+0.009
ug+0.009
reat+0.009
coat+0.008
Neg logit contributions
 conclusions-0.008
growth-0.007
 mushrooms-0.007
ils-0.007
',"-0.007
Token of
Token ID286
Feature activation+5.59e+00

Pos logit contributions
 Kim+0.022
Kim+0.018
 Johnny+0.018
 Su+0.018
 Liz+0.017
Neg logit contributions
 conclusions-0.033
leness-0.030
ede-0.030
 mushrooms-0.030
ils-0.030
Token fun
Token ID1257
Feature activation+1.59e+00

Pos logit contributions
 Kim+0.093
 Su+0.080
 cho+0.077
 Johnny+0.076
Kim+0.072
Neg logit contributions
 conclusions-0.173
ede-0.160
growth-0.160
-0.158
',"-0.156
Token together
Token ID1978
Feature activation-6.42e-01

Pos logit contributions
 Kim+0.031
 Su+0.028
 driving+0.028
 cho+0.028
 speeding+0.027
Neg logit contributions
 conclusions-0.044
growth-0.041
-0.040
ede-0.039
',"-0.039
Token.
Token ID13
Feature activation-3.68e-01

Pos logit contributions
 conclusions+0.015
growth+0.013
ede+0.013
+0.013
',"+0.013
Neg logit contributions
 Kim-0.016
 Su-0.014
 cho-0.014
 Johnny-0.014
 driving-0.013
Token
Token ID220
Feature activation+1.27e+00

Pos logit contributions
 conclusions+0.008
growth+0.007
ede+0.007
+0.007
',"+0.007
Neg logit contributions
 Kim-0.010
 Su-0.009
 cho-0.009
 Johnny-0.009
Kim-0.008
Token
Token ID198
Feature activation+1.33e-01

Pos logit contributions
 cho+0.041
 Kim+0.040
coat+0.040
 Su+0.038
ug+0.038
Neg logit contributions
',"-0.014
 conclusions-0.014
'."-0.013
-0.013
growth-0.013
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
 conclusions+0.029
ede+0.028
',"+0.027
+0.027
growth+0.027
Neg logit contributions
 Kim-0.046
 cho-0.044
Kim-0.043
ug-0.043
 Su-0.042
Token They
Token ID1119
Feature activation+7.38e-01

Pos logit contributions
 conclusions+0.000
growth+0.000
+0.000
ede+0.000
',"+0.000
Neg logit contributions
 Kim-0.000
 cho-0.000
 Su-0.000
ug-0.000
 driving-0.000
Token were
Token ID547
Feature activation-8.26e-02

Pos logit contributions
 Kim+0.020
reat+0.019
ug+0.019
Kim+0.019
 cho+0.018
Neg logit contributions
ede-0.012
 conclusions-0.011
 wrists-0.011
ils-0.010
growth-0.010
Token having
Token ID1719
Feature activation+7.38e-01

Pos logit contributions
 wrists+0.001
 closely+0.001
cious+0.001
pieces+0.001
 Fran+0.001
Neg logit contributions
ancer-0.003
ug-0.003
loo-0.003
EEP-0.002
OM-0.002
Token lots
Token ID6041
Feature activation+1.42e+00

Pos logit contributions
 Kim+0.018
ug+0.017
Kim+0.017
reat+0.016
icking+0.016
Neg logit contributions
 conclusions-0.015
growth-0.013
 mushrooms-0.013
ede-0.013
',"-0.013
Token of
Token ID286
Feature activation+5.54e+00

Pos logit contributions
 Kim+0.028
Kim+0.023
 Johnny+0.023
 Liz+0.022
 Su+0.022
Neg logit contributions
 conclusions-0.039
ede-0.034
leness-0.034
 mushrooms-0.034
growth-0.033
Token fun
Token ID1257
Feature activation+1.81e+00

Pos logit contributions
 Kim+0.103
 cho+0.090
 Su+0.089
 Johnny+0.088
ug+0.086
Neg logit contributions
 conclusions-0.159
ede-0.146
growth-0.141
',"-0.141
-0.140
Token!
Token ID0
Feature activation-5.37e-01

Pos logit contributions
 Kim+0.040
 Su+0.035
 cho+0.034
 Johnny+0.034
 driving+0.032
Neg logit contributions
 conclusions-0.047
growth-0.042
ede-0.042
-0.042
',"-0.041
Token<|endoftext|>
Token ID50256
Feature activation+1.11e+00

Pos logit contributions
 conclusions+0.013
ede+0.012
+0.011
growth+0.011
',"+0.011
Neg logit contributions
 Kim-0.014
 Su-0.012
 Johnny-0.012
 Liz-0.011
 cho-0.011
TokenOnce
Token ID7454
Feature activation+7.68e-01

Pos logit contributions
 Kim+0.032
 Su+0.029
 cho+0.029
 Johnny+0.028
ug+0.027
Neg logit contributions
 conclusions-0.021
growth-0.019
-0.018
ede-0.018
',"-0.018
Token upon
Token ID2402
Feature activation+1.01e+00

Pos logit contributions
 Kim+0.015
 cho+0.013
 Su+0.013
Kim+0.012
 speeding+0.012
Neg logit contributions
 conclusions-0.021
ede-0.019
-0.019
growth-0.019
',"-0.019
Token the
Token ID262
Feature activation+2.58e+00

Pos logit contributions
 Kim+0.007
 Su+0.006
 cho+0.006
 Johnny+0.006
ug+0.006
Neg logit contributions
 conclusions-0.009
growth-0.008
ede-0.008
-0.008
',"-0.008
Token sky
Token ID6766
Feature activation+7.08e-01

Pos logit contributions
 Kim+0.065
 Su+0.058
 Johnny+0.058
 Liz+0.056
ug+0.055
Neg logit contributions
 conclusions-0.059
growth-0.053
-0.053
ede-0.052
',"-0.051
Token,
Token ID11
Feature activation-1.48e-01

Pos logit contributions
 Kim+0.021
Kim+0.020
 Su+0.020
 Johnny+0.019
Johnny+0.018
Neg logit contributions
 conclusions-0.012
ede-0.011
growth-0.011
-0.010
',"-0.010
Token having
Token ID1719
Feature activation+8.42e-01

Pos logit contributions
 conclusions+0.003
 closely+0.003
ede+0.003
 mushrooms+0.003
 vegetable+0.003
Neg logit contributions
 Kim-0.004
Kim-0.004
 Johnny-0.004
ug-0.004
 Su-0.004
Token lots
Token ID6041
Feature activation+1.40e+00

Pos logit contributions
 Kim+0.021
ug+0.019
Kim+0.019
icking+0.019
 Johnny+0.019
Neg logit contributions
 conclusions-0.018
growth-0.017
 mushrooms-0.017
ede-0.015
cious-0.015
Token of
Token ID286
Feature activation+5.52e+00

Pos logit contributions
 Kim+0.031
 Johnny+0.028
Kim+0.027
 Matt+0.026
 Su+0.026
Neg logit contributions
 conclusions-0.037
 mushrooms-0.034
growth-0.032
 moral-0.031
leness-0.031
Token fun
Token ID1257
Feature activation+1.85e+00

Pos logit contributions
 Kim+0.117
 Johnny+0.108
 Su+0.105
Kim+0.099
 cho+0.098
Neg logit contributions
 conclusions-0.144
 mushrooms-0.129
ede-0.128
growth-0.127
',"-0.126
Token.
Token ID13
Feature activation-9.17e-01

Pos logit contributions
 Kim+0.042
 Su+0.038
 cho+0.038
 driving+0.037
ug+0.036
Neg logit contributions
 conclusions-0.045
growth-0.042
-0.042
ede-0.041
',"-0.041
Token One
Token ID1881
Feature activation+6.91e-01

Pos logit contributions
 conclusions+0.019
ede+0.018
ils+0.017
growth+0.016
+0.016
Neg logit contributions
 Kim-0.025
 Su-0.023
 Johnny-0.022
 Tim-0.022
 cho-0.021
Token day
Token ID1110
Feature activation-1.73e+00

Pos logit contributions
 Kim+0.017
rol+0.016
 Johnny+0.016
 cho+0.015
 Su+0.015
Neg logit contributions
 conclusions-0.014
 moral-0.014
 vegetable-0.014
ede-0.013
growth-0.013
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
 conclusions+0.039
+0.036
growth+0.035
ede+0.035
',"+0.035
Neg logit contributions
 Kim-0.043
 Su-0.038
Kim-0.037
 cho-0.037
 Johnny-0.037
Token her
Token ID607
Feature activation-2.96e-01

Pos logit contributions
 conclusions+0.033
ede+0.030
growth+0.030
+0.030
',"+0.029
Neg logit contributions
 Kim-0.032
 Su-0.029
 Johnny-0.028
 cho-0.027
 driving-0.027
Token brother
Token ID3956
Feature activation-5.97e-02

Pos logit contributions
 conclusions+0.008
ede+0.007
growth+0.007
+0.007
',"+0.007
Neg logit contributions
 Kim-0.006
 Su-0.005
 cho-0.005
 Johnny-0.005
 driving-0.005
Token were
Token ID547
Feature activation-2.64e-01

Pos logit contributions
growth+0.001
'+0.001
ils+0.001
cious+0.001
 conclusions+0.001
Neg logit contributions
 Kim-0.002
ug-0.002
reat-0.002
 cho-0.001
iring-0.001
Token having
Token ID1719
Feature activation+1.03e+00

Pos logit contributions
cious+0.005
'+0.005
 Fran+0.005
 closely+0.004
ils+0.004
Neg logit contributions
 Kim-0.006
Kim-0.006
EEP-0.006
 Liz-0.006
reat-0.006
Token lots
Token ID6041
Feature activation+1.61e+00

Pos logit contributions
 Kim+0.025
ug+0.022
Kim+0.022
 Su+0.022
coat+0.021
Neg logit contributions
 conclusions-0.023
growth-0.021
 mushrooms-0.020
ede-0.020
ils-0.019
Token of
Token ID286
Feature activation+5.49e+00

Pos logit contributions
 Kim+0.026
 Su+0.021
 Johnny+0.021
Kim+0.021
 cho+0.020
Neg logit contributions
 conclusions-0.050
 mushrooms-0.046
ede-0.046
growth-0.046
ils-0.044
Token fun
Token ID1257
Feature activation+2.20e+00

Pos logit contributions
 Kim+0.086
 Su+0.072
 cho+0.069
 Johnny+0.067
 driving+0.064
Neg logit contributions
 conclusions-0.178
growth-0.166
-0.165
ede-0.165
',"-0.161
Token with
Token ID351
Feature activation+5.92e-01

Pos logit contributions
 Kim+0.044
 Su+0.040
 driving+0.039
 cho+0.038
 speeding+0.038
Neg logit contributions
 conclusions-0.061
growth-0.056
-0.055
',"-0.055
ede-0.055
Token the
Token ID262
Feature activation+1.13e+00

Pos logit contributions
 Kim+0.014
 Su+0.013
 cho+0.012
Kim+0.012
 Johnny+0.012
Neg logit contributions
 conclusions-0.014
growth-0.013
ede-0.012
-0.012
 mushrooms-0.012
Token animals
Token ID4695
Feature activation+1.05e+00

Pos logit contributions
 Kim+0.028
 Su+0.025
 cho+0.024
 Johnny+0.024
ug+0.023
Neg logit contributions
 conclusions-0.026
ede-0.024
growth-0.023
-0.023
',"-0.023
Token.
Token ID13
Feature activation-8.52e-01

Pos logit contributions
 Kim+0.030
Kim+0.028
 Su+0.027
 cho+0.026
ug+0.026
Neg logit contributions
 conclusions-0.019
ede-0.017
 mushrooms-0.017
ils-0.016
 vegetable-0.016
Token the
Token ID262
Feature activation+2.84e-01

Pos logit contributions
 conclusions+0.000
growth+0.000
ede+0.000
+0.000
',"+0.000
Neg logit contributions
 Kim-0.000
 Su-0.000
 cho-0.000
 Johnny-0.000
 driving-0.000
Token park
Token ID3952
Feature activation+4.14e-01

Pos logit contributions
 Kim+0.007
 Su+0.007
ug+0.006
 cho+0.006
rol+0.006
Neg logit contributions
 conclusions-0.007
ede-0.006
growth-0.006
-0.006
 moral-0.005
Token,
Token ID11
Feature activation-1.50e-01

Pos logit contributions
 Kim+0.009
 Su+0.008
 cho+0.008
 Johnny+0.008
Kim+0.008
Neg logit contributions
 conclusions-0.010
ede-0.010
growth-0.010
-0.009
',"-0.009
Token having
Token ID1719
Feature activation+9.35e-01

Pos logit contributions
ede+0.003
 closely+0.003
 Fran+0.003
+0.003
cious+0.003
Neg logit contributions
 Kim-0.003
ug-0.003
 Su-0.003
Kim-0.003
reat-0.003
Token lots
Token ID6041
Feature activation+1.59e+00

Pos logit contributions
 Kim+0.022
ug+0.020
icking+0.019
reat+0.019
 Su+0.019
Neg logit contributions
 conclusions-0.021
growth-0.019
 mushrooms-0.018
ede-0.018
ils-0.017
Token of
Token ID286
Feature activation+5.48e+00

Pos logit contributions
 Kim+0.026
 Su+0.022
 Johnny+0.022
Kim+0.020
 cho+0.020
Neg logit contributions
 conclusions-0.048
ede-0.044
 mushrooms-0.043
growth-0.043
leness-0.042
Token fun
Token ID1257
Feature activation+1.92e+00

Pos logit contributions
 Kim+0.106
 Su+0.096
 cho+0.095
 Johnny+0.093
ug+0.091
Neg logit contributions
 conclusions-0.149
ede-0.138
growth-0.131
 mushrooms-0.129
 moral-0.129
Token.
Token ID13
Feature activation-3.20e-01

Pos logit contributions
 Kim+0.039
 driving+0.035
 Su+0.034
 speeding+0.034
 cho+0.034
Neg logit contributions
 conclusions-0.053
growth-0.050
-0.049
',"-0.048
 vegetable-0.046
Token Suddenly
Token ID24975
Feature activation+1.98e-01

Pos logit contributions
 conclusions+0.008
ede+0.007
growth+0.007
+0.007
',"+0.007
Neg logit contributions
 Kim-0.008
 Su-0.007
 cho-0.007
 Johnny-0.007
Kim-0.006
Token,
Token ID11
Feature activation+4.01e-02

Pos logit contributions
rol+0.005
reat+0.005
 Kim+0.005
umbling+0.005
coat+0.005
Neg logit contributions
 conclusions-0.003
-0.003
 mushrooms-0.003
growth-0.003
ede-0.003
Token they
Token ID484
Feature activation+2.46e-02

Pos logit contributions
reat+0.001
 Kim+0.001
ug+0.001
rol+0.001
coat+0.001
Neg logit contributions
 conclusions-0.001
growth-0.001
 mushrooms-0.001
-0.001
ede-0.001
Token park
Token ID3952
Feature activation+3.83e-01

Pos logit contributions
 Kim+0.016
ug+0.015
 Su+0.014
 Johnny+0.014
 cho+0.014
Neg logit contributions
 conclusions-0.013
growth-0.012
 vegetable-0.012
ede-0.011
-0.011
Token and
Token ID290
Feature activation+2.86e-01

Pos logit contributions
 Kim+0.008
 cho+0.007
 Su+0.007
 Johnny+0.007
ug+0.007
Neg logit contributions
 conclusions-0.010
ede-0.009
growth-0.009
-0.009
',"-0.009
Token was
Token ID373
Feature activation-6.96e-01

Pos logit contributions
ug+0.008
 Kim+0.008
 cho+0.008
reat+0.007
Kim+0.007
Neg logit contributions
 conclusions-0.005
ede-0.004
growth-0.004
-0.004
 Fran-0.004
Token having
Token ID1719
Feature activation+9.63e-01

Pos logit contributions
cious+0.008
 closely+0.008
 ear+0.007
 Fran+0.007
 moral+0.007
Neg logit contributions
ug-0.020
eddy-0.019
ister-0.018
Johnny-0.018
ay-0.018
Token lots
Token ID6041
Feature activation+1.69e+00

Pos logit contributions
 Kim+0.022
ug+0.021
 cho+0.020
Kim+0.019
 Su+0.019
Neg logit contributions
 conclusions-0.022
growth-0.020
 mushrooms-0.020
ede-0.019
',"-0.019
Token of
Token ID286
Feature activation+5.48e+00

Pos logit contributions
 Kim+0.027
 cho+0.023
 Johnny+0.023
Kim+0.022
 Su+0.022
Neg logit contributions
 conclusions-0.051
 mushrooms-0.047
ede-0.046
 moral-0.045
growth-0.044
Token fun
Token ID1257
Feature activation+1.91e+00

Pos logit contributions
 Kim+0.112
 cho+0.105
ug+0.102
 Johnny+0.102
 Su+0.099
Neg logit contributions
 conclusions-0.135
ede-0.125
 mushrooms-0.121
 moral-0.120
 vegetable-0.116
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
 Kim+0.047
 Su+0.043
 cho+0.041
 driving+0.040
 Johnny+0.040
Neg logit contributions
 conclusions-0.044
growth-0.041
-0.040
ede-0.040
',"-0.039
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
 conclusions+0.004
+0.004
growth+0.004
ede+0.004
',"+0.004
Neg logit contributions
 Kim-0.006
 cho-0.005
 Su-0.005
ug-0.005
 Johnny-0.005
Token
Token ID198
Feature activation-2.02e+00

Pos logit contributions
 conclusions+0.004
ede+0.003
+0.003
growth+0.003
 mushrooms+0.003
Neg logit contributions
 Kim-0.004
 Su-0.003
 Johnny-0.003
Kim-0.003
 cho-0.003
TokenSuddenly
Token ID38582
Feature activation-1.25e-01

Pos logit contributions
 conclusions+0.043
ede+0.038
growth+0.038
+0.038
',"+0.037
Neg logit contributions
 Kim-0.053
 Su-0.048
 cho-0.048
 Johnny-0.047
ug-0.046
Token park
Token ID3952
Feature activation-2.78e-01

Pos logit contributions
 Kim+0.048
 Su+0.043
ug+0.043
 cho+0.041
 Johnny+0.041
Neg logit contributions
 conclusions-0.043
growth-0.039
ede-0.038
-0.037
 vegetable-0.036
Token together
Token ID1978
Feature activation-2.17e+00

Pos logit contributions
 conclusions+0.007
growth+0.006
ede+0.006
+0.006
',"+0.006
Neg logit contributions
 Kim-0.006
 Su-0.006
 cho-0.006
 Johnny-0.006
 driving-0.005
Token,
Token ID11
Feature activation-2.33e-01

Pos logit contributions
 conclusions+0.057
growth+0.052
ede+0.051
+0.050
',"+0.049
Neg logit contributions
 Kim-0.045
 Su-0.041
 cho-0.041
 driving-0.039
 Johnny-0.039
Token having
Token ID1719
Feature activation+9.17e-01

Pos logit contributions
 conclusions+0.005
ede+0.005
 Fran+0.005
+0.005
 vegetable+0.005
Neg logit contributions
 Kim-0.005
ug-0.005
Kim-0.005
 cho-0.005
 Su-0.004
Token lots
Token ID6041
Feature activation+1.30e+00

Pos logit contributions
 Kim+0.021
ug+0.019
 Su+0.018
 cho+0.018
 Johnny+0.018
Neg logit contributions
 conclusions-0.021
growth-0.020
ede-0.019
 mushrooms-0.019
cious-0.018
Token of
Token ID286
Feature activation+5.46e+00

Pos logit contributions
 Kim+0.022
 Johnny+0.019
 Su+0.018
Kim+0.018
 cho+0.017
Neg logit contributions
 conclusions-0.039
 mushrooms-0.035
ede-0.035
growth-0.034
 moral-0.034
Token fun
Token ID1257
Feature activation+1.68e+00

Pos logit contributions
 Kim+0.107
 cho+0.096
 Johnny+0.095
 Su+0.094
ug+0.092
Neg logit contributions
 conclusions-0.146
ede-0.135
growth-0.128
 moral-0.127
 mushrooms-0.126
Token.
Token ID13
Feature activation-8.27e-03

Pos logit contributions
 Kim+0.033
 driving+0.030
 Su+0.030
 speeding+0.029
ug+0.029
Neg logit contributions
 conclusions-0.047
growth-0.045
-0.043
',"-0.042
 tripod-0.041
Token One
Token ID1881
Feature activation+8.84e-01

Pos logit contributions
 conclusions+0.000
ede+0.000
ils+0.000
+0.000
growth+0.000
Neg logit contributions
 Kim-0.000
 Su-0.000
 Tim-0.000
 Johnny-0.000
 Liz-0.000
Token day
Token ID1110
Feature activation-1.09e+00

Pos logit contributions
 Kim+0.022
rol+0.021
 cho+0.020
 driving+0.020
ug+0.020
Neg logit contributions
 moral-0.017
 conclusions-0.017
 end-0.016
 vegetable-0.016
ede-0.016
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
 conclusions+0.032
ede+0.029
growth+0.029
+0.028
',"+0.028
Neg logit contributions
 Kim-0.021
 Su-0.019
 cho-0.018
 Johnny-0.017
 driving-0.017
Token loud
Token ID7812
Feature activation+6.08e-01

Pos logit contributions
 Kim+0.029
rol+0.028
 Su+0.028
Kim+0.027
 Johnny+0.027
Neg logit contributions
 conclusions-0.026
ede-0.022
 moral-0.020
growth-0.020
 closely-0.020
Token noises
Token ID26782
Feature activation+5.54e-01

Pos logit contributions
 Kim+0.016
 Su+0.014
 Johnny+0.014
Kim+0.014
coat+0.013
Neg logit contributions
 conclusions-0.013
growth-0.012
ede-0.012
-0.011
',"-0.011
Token and
Token ID290
Feature activation+2.02e+00

Pos logit contributions
 Kim+0.016
 Su+0.015
Kim+0.015
 Johnny+0.015
 Liz+0.014
Neg logit contributions
 conclusions-0.010
growth-0.009
ede-0.009
'-0.008
-0.008
Token saw
Token ID2497
Feature activation+1.16e+00

Pos logit contributions
 Kim+0.050
 Su+0.047
coat+0.046
Kim+0.046
 Johnny+0.046
Neg logit contributions
 conclusions-0.046
growth-0.039
ede-0.036
 moral-0.036
 mushrooms-0.036
Token flashes
Token ID29227
Feature activation+1.92e+00

Pos logit contributions
coat+0.032
 Kim+0.029
Kim+0.029
reat+0.029
rol+0.028
Neg logit contributions
 conclusions-0.024
growth-0.021
 mushrooms-0.019
 moral-0.017
der-0.016
Token of
Token ID286
Feature activation+5.44e+00

Pos logit contributions
 Kim+0.048
 Su+0.045
 Johnny+0.042
Kim+0.042
coat+0.042
Neg logit contributions
 conclusions-0.043
growth-0.039
 closely-0.036
 moral-0.035
 mushrooms-0.035
Token light
Token ID1657
Feature activation+8.06e-01

Pos logit contributions
coat+0.114
 Kim+0.112
icking+0.110
rol+0.109
ug+0.108
Neg logit contributions
 conclusions-0.145
growth-0.126
 mushrooms-0.119
 moral-0.109
 vegetable-0.109
Token.
Token ID13
Feature activation-2.87e-02

Pos logit contributions
 Kim+0.025
 Su+0.023
Kim+0.023
 Johnny+0.022
Su+0.022
Neg logit contributions
 conclusions-0.013
growth-0.012
ede-0.010
-0.010
 mushrooms-0.010
Token They
Token ID1119
Feature activation-4.81e-01

Pos logit contributions
 conclusions+0.001
growth+0.000
+0.000
',"+0.000
ede+0.000
Neg logit contributions
 Kim-0.001
 cho-0.001
 Su-0.001
 driving-0.001
 speeding-0.001
Token were
Token ID547
Feature activation-4.21e-01

Pos logit contributions
 conclusions+0.010
growth+0.009
ede+0.009
+0.008
',"+0.008
Neg logit contributions
 Kim-0.012
 Su-0.011
 cho-0.011
 Johnny-0.011
ug-0.011
Token glad
Token ID9675
Feature activation+5.23e-01

Pos logit contributions
 conclusions+0.008
 closely+0.008
 wrists+0.007
 moral+0.007
 stems+0.007
Neg logit contributions
Kim-0.010
 Su-0.010
 Kim-0.010
coat-0.009
Johnny-0.009
Token the
Token ID262
Feature activation+1.96e+00

Pos logit contributions
 conclusions+0.003
growth+0.003
ede+0.003
+0.003
',"+0.003
Neg logit contributions
 Kim-0.002
 Su-0.002
 cho-0.002
 Johnny-0.002
Kim-0.002
Token forest
Token ID8222
Feature activation-5.24e-01

Pos logit contributions
 Kim+0.044
 Su+0.039
 cho+0.039
 Johnny+0.038
 speeding+0.036
Neg logit contributions
 conclusions-0.049
-0.044
growth-0.044
ede-0.044
',"-0.044
Token,
Token ID11
Feature activation-1.14e-01

Pos logit contributions
 conclusions+0.013
ede+0.011
growth+0.011
+0.011
',"+0.011
Neg logit contributions
 Kim-0.012
 Su-0.011
 cho-0.011
 Johnny-0.011
Kim-0.010
Token having
Token ID1719
Feature activation+5.80e-01

Pos logit contributions
ede+0.002
 closely+0.002
growth+0.002
 conclusions+0.002
cious+0.002
Neg logit contributions
 Kim-0.003
Kim-0.003
ug-0.003
 Su-0.003
 Johnny-0.003
Token lots
Token ID6041
Feature activation+1.21e+00

Pos logit contributions
 Kim+0.015
ug+0.013
Kim+0.013
 Su+0.013
icking+0.013
Neg logit contributions
 conclusions-0.012
growth-0.012
 mushrooms-0.011
ede-0.011
cious-0.010
Token of
Token ID286
Feature activation+5.39e+00

Pos logit contributions
 Kim+0.025
Kim+0.021
 Johnny+0.021
 Su+0.021
 Matt+0.020
Neg logit contributions
 conclusions-0.034
 mushrooms-0.032
growth-0.030
ede-0.029
ils-0.029
Token fun
Token ID1257
Feature activation+1.69e+00

Pos logit contributions
 Kim+0.105
 Su+0.092
 cho+0.090
 Johnny+0.089
Kim+0.086
Neg logit contributions
 conclusions-0.151
ede-0.140
growth-0.137
 mushrooms-0.134
 moral-0.132
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
 Kim+0.036
 Su+0.032
 driving+0.032
 cho+0.032
ug+0.031
Neg logit contributions
 conclusions-0.044
-0.041
growth-0.040
',"-0.040
ede-0.039
Token
Token ID198
Feature activation-5.55e-01

Pos logit contributions
 conclusions+0.030
ede+0.027
growth+0.026
+0.026
',"+0.025
Neg logit contributions
 Kim-0.025
 Johnny-0.022
 Su-0.022
 cho-0.021
 Tim-0.021
Token
Token ID198
Feature activation-2.20e+00

Pos logit contributions
 conclusions+0.015
+0.014
ede+0.014
growth+0.013
ils+0.013
Neg logit contributions
 Kim-0.012
 Su-0.010
 Johnny-0.010
 cho-0.010
Kim-0.009
TokenAs
Token ID1722
Feature activation-1.24e+00

Pos logit contributions
 conclusions+0.060
growth+0.054
',"+0.051
 wrists+0.050
 mushrooms+0.050
Neg logit contributions
 Kim-0.038
 cho-0.038
Kim-0.036
 speeding-0.035
 driving-0.034
Token soon
Token ID2582
Feature activation+4.73e-03

Pos logit contributions
ug+0.001
ick+0.001
reat+0.001
 cho+0.001
umbling+0.001
Neg logit contributions
 conclusions-0.001
 wrists-0.001
cious-0.001
ils-0.001
-0.001
Token they
Token ID484
Feature activation+5.90e-01

Pos logit contributions
ug+0.000
Kim+0.000
 Kim+0.000
reat+0.000
umbling+0.000
Neg logit contributions
 conclusions-0.000
ede-0.000
growth-0.000
 Fran-0.000
ils-0.000
Token were
Token ID547
Feature activation-1.25e-01

Pos logit contributions
 Kim+0.016
Kim+0.015
ug+0.015
reat+0.014
 cho+0.014
Neg logit contributions
ede-0.009
'-0.009
ils-0.008
 conclusions-0.008
growth-0.008
Token having
Token ID1719
Feature activation+6.13e-01

Pos logit contributions
 closely+0.002
cious+0.002
ils+0.002
pieces+0.002
 wrists+0.002
Neg logit contributions
Kim-0.003
ug-0.003
elt-0.003
eddy-0.003
 Kim-0.003
Token lots
Token ID6041
Feature activation+1.49e+00

Pos logit contributions
 Kim+0.015
ug+0.013
Kim+0.013
reat+0.012
 cho+0.012
Neg logit contributions
 conclusions-0.014
growth-0.012
',"-0.012
 mushrooms-0.012
ede-0.012
Token of
Token ID286
Feature activation+5.38e+00

Pos logit contributions
 Kim+0.029
Kim+0.025
 Johnny+0.023
 cho+0.022
 Liz+0.022
Neg logit contributions
 conclusions-0.042
 mushrooms-0.036
ede-0.036
leness-0.036
ils-0.035
Token fun
Token ID1257
Feature activation+1.44e+00

Pos logit contributions
 Kim+0.102
 cho+0.088
 Su+0.087
 Johnny+0.086
ug+0.086
Neg logit contributions
 conclusions-0.152
ede-0.137
growth-0.135
',"-0.134
 moral-0.132
Token.
Token ID13
Feature activation-6.06e-01

Pos logit contributions
 Kim+0.037
 Su+0.033
 cho+0.033
 Johnny+0.032
Kim+0.031
Neg logit contributions
 conclusions-0.032
growth-0.028
ede-0.028
-0.028
',"-0.027
Token After
Token ID2293
Feature activation+1.77e+00

Pos logit contributions
+0.008
 conclusions+0.008
growth+0.007
',"+0.007
'."+0.007
Neg logit contributions
 cho-0.019
ug-0.018
 Kim-0.017
coat-0.017
reat-0.017
Token a
Token ID257
Feature activation+3.63e-01

Pos logit contributions
 Kim+0.046
coat+0.045
ug+0.044
reat+0.044
umbling+0.043
Neg logit contributions
 conclusions-0.032
growth-0.029
-0.028
ede-0.027
 moral-0.027
Token while
Token ID981
Feature activation-3.19e-01

Pos logit contributions
 Kim+0.008
 Su+0.007
 cho+0.007
 Johnny+0.007
Kim+0.007
Neg logit contributions
 conclusions-0.009
growth-0.008
ede-0.008
-0.008
',"-0.008
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 Kim+0.045
 Su+0.041
 Johnny+0.040
Kim+0.039
 cho+0.039
Neg logit contributions
 conclusions-0.035
ede-0.033
growth-0.031
-0.031
',"-0.030
Token He
Token ID679
Feature activation-9.99e-01

Pos logit contributions
growth+0.018
 conclusions+0.018
+0.016
 One+0.016
 vegetable+0.016
Neg logit contributions
ug-0.027
 cho-0.026
 Kim-0.025
rol-0.024
icking-0.024
Token was
Token ID373
Feature activation-1.01e+00

Pos logit contributions
ede+0.019
 conclusions+0.019
ils+0.017
growth+0.016
 wrists+0.016
Neg logit contributions
 Kim-0.027
 Su-0.024
Kim-0.024
reat-0.023
ug-0.023
Token having
Token ID1719
Feature activation+4.60e-01

Pos logit contributions
 closely+0.012
 conclusions+0.011
cious+0.010
 wrists+0.010
 moral+0.010
Neg logit contributions
loo-0.029
ister-0.028
ait-0.028
inner-0.028
ay-0.028
Token lots
Token ID6041
Feature activation+1.28e+00

Pos logit contributions
 Kim+0.012
ug+0.011
icking+0.011
Kim+0.011
ait+0.010
Neg logit contributions
 conclusions-0.009
 mushrooms-0.008
growth-0.008
ede-0.008
 moral-0.008
Token of
Token ID286
Feature activation+5.38e+00

Pos logit contributions
 Kim+0.023
 Johnny+0.019
Kim+0.019
 Su+0.018
 Matt+0.018
Neg logit contributions
 conclusions-0.037
ede-0.033
growth-0.032
 mushrooms-0.032
leness-0.031
Token fun
Token ID1257
Feature activation+2.04e+00

Pos logit contributions
 Kim+0.114
 cho+0.101
 Su+0.098
ug+0.098
 Johnny+0.098
Neg logit contributions
 conclusions-0.137
ede-0.124
 moral-0.118
 mushrooms-0.115
growth-0.112
Token,
Token ID11
Feature activation+3.32e-01

Pos logit contributions
 Kim+0.046
 Su+0.041
 cho+0.040
 Johnny+0.040
 driving+0.039
Neg logit contributions
 conclusions-0.051
growth-0.047
-0.046
ede-0.046
',"-0.046
Token and
Token ID290
Feature activation-2.68e-02

Pos logit contributions
 Kim+0.010
ug+0.009
ick+0.009
Kim+0.009
eddy+0.009
Neg logit contributions
 conclusions-0.006
ede-0.005
 moral-0.005
 closely-0.004
 mushrooms-0.004
Token was
Token ID373
Feature activation-6.94e-01

Pos logit contributions
 conclusions+0.001
ede+0.000
 closely+0.000
 vegetable+0.000
 foods+0.000
Neg logit contributions
ug-0.001
ick-0.001
un-0.001
ait-0.001
 Kim-0.001
Token almost
Token ID2048
Feature activation+3.86e-01

Pos logit contributions
 closely+0.010
 conclusions+0.009
 moral+0.009
 vegetable+0.008
 one+0.008
Neg logit contributions
ug-0.018
ister-0.018
ait-0.017
ay-0.017
loo-0.017
Token park
Token ID3952
Feature activation+1.14e-01

Pos logit contributions
 Kim+0.023
ug+0.022
 Su+0.021
 Johnny+0.021
 Liz+0.020
Neg logit contributions
 conclusions-0.020
ede-0.017
growth-0.017
 moral-0.017
-0.017
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
 Kim+0.003
 Su+0.002
 Johnny+0.002
 cho+0.002
ug+0.002
Neg logit contributions
 conclusions-0.003
ede-0.002
growth-0.002
-0.002
',"-0.002
Token had
Token ID550
Feature activation-3.07e-01

Pos logit contributions
 Kim+0.022
 Su+0.019
 cho+0.019
 Johnny+0.018
ug+0.018
Neg logit contributions
 conclusions-0.030
ede-0.027
growth-0.027
-0.027
',"-0.026
Token a
Token ID257
Feature activation+2.53e+00

Pos logit contributions
 conclusions+0.006
growth+0.006
 mushrooms+0.006
',"+0.006
ede+0.006
Neg logit contributions
 Kim-0.008
ug-0.007
Kim-0.007
 Su-0.007
 Johnny-0.007
Token lot
Token ID1256
Feature activation+2.78e+00

Pos logit contributions
 Kim+0.056
 Su+0.050
 cho+0.048
 Johnny+0.048
ug+0.047
Neg logit contributions
 conclusions-0.064
ede-0.058
growth-0.058
-0.058
',"-0.056
Token of
Token ID286
Feature activation+5.37e+00

Pos logit contributions
 Kim+0.037
 Johnny+0.030
 Su+0.030
 cho+0.027
ug+0.026
Neg logit contributions
 conclusions-0.096
growth-0.088
',"-0.087
ede-0.085
 moral-0.085
Token fun
Token ID1257
Feature activation+1.61e+00

Pos logit contributions
 Kim+0.100
 Su+0.088
 cho+0.085
 Johnny+0.084
ug+0.083
Neg logit contributions
 conclusions-0.153
ede-0.138
growth-0.138
',"-0.136
 moral-0.135
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
 Kim+0.042
 Su+0.038
 cho+0.038
 Johnny+0.037
 driving+0.036
Neg logit contributions
 conclusions-0.035
growth-0.031
ede-0.031
-0.031
',"-0.030
Token They
Token ID1119
Feature activation-1.35e-01

Pos logit contributions
 Kim+0.004
 Su+0.003
 cho+0.003
 Johnny+0.003
Kim+0.003
Neg logit contributions
 conclusions-0.003
ede-0.003
growth-0.003
-0.003
',"-0.003
Token swung
Token ID28507
Feature activation+3.94e-01

Pos logit contributions
 conclusions+0.003
ede+0.003
growth+0.003
+0.002
 moral+0.002
Neg logit contributions
 Kim-0.003
 Su-0.003
 cho-0.003
ug-0.003
 Johnny-0.003
Token and
Token ID290
Feature activation+1.49e+00

Pos logit contributions
 Kim+0.010
 Su+0.009
 Johnny+0.009
Kim+0.009
ug+0.009
Neg logit contributions
 conclusions-0.008
growth-0.007
ede-0.007
-0.007
',"-0.007
Token.
Token ID13
Feature activation-9.90e-01

Pos logit contributions
 Kim+0.014
Kim+0.013
 Su+0.013
 Johnny+0.013
Johnny+0.013
Neg logit contributions
 conclusions-0.008
ede-0.008
 mushrooms-0.007
growth-0.007
-0.007
Token Tom
Token ID4186
Feature activation-3.59e-01

Pos logit contributions
 conclusions+0.021
growth+0.020
+0.019
',"+0.019
'."+0.018
Neg logit contributions
 Kim-0.022
ug-0.022
 cho-0.021
reat-0.021
 Su-0.021
Token was
Token ID373
Feature activation-8.06e-01

Pos logit contributions
ede+0.006
'+0.005
ils+0.005
ience+0.005
+0.005
Neg logit contributions
 Kim-0.011
 Su-0.010
Kim-0.010
reat-0.010
Su-0.010
Token having
Token ID1719
Feature activation+4.54e-01

Pos logit contributions
 moral+0.011
 conclusions+0.011
cious+0.011
 Fran+0.011
 closely+0.010
Neg logit contributions
loo-0.021
ug-0.021
eddy-0.020
 Kim-0.020
reat-0.020
Token lots
Token ID6041
Feature activation+1.25e+00

Pos logit contributions
 Kim+0.011
ug+0.011
Kim+0.011
icking+0.011
reat+0.011
Neg logit contributions
 conclusions-0.009
 mushrooms-0.008
growth-0.008
 moral-0.008
ede-0.007
Token of
Token ID286
Feature activation+5.36e+00

Pos logit contributions
 Kim+0.023
 Su+0.020
 Johnny+0.019
Kim+0.019
 Matt+0.019
Neg logit contributions
 conclusions-0.036
ede-0.033
 mushrooms-0.032
leness-0.032
growth-0.032
Token fun
Token ID1257
Feature activation+1.82e+00

Pos logit contributions
 Kim+0.099
 Su+0.088
 cho+0.086
 Johnny+0.083
Kim+0.079
Neg logit contributions
 conclusions-0.155
ede-0.143
growth-0.140
-0.138
',"-0.137
Token,
Token ID11
Feature activation+2.75e-01

Pos logit contributions
 Kim+0.032
 driving+0.029
 cho+0.028
 speeding+0.028
 Su+0.028
Neg logit contributions
 conclusions-0.053
growth-0.050
',"-0.049
-0.049
ede-0.048
Token playing
Token ID2712
Feature activation+4.02e-01

Pos logit contributions
 Kim+0.008
ug+0.007
ick+0.007
 Su+0.007
Kim+0.007
Neg logit contributions
 conclusions-0.004
ede-0.004
 moral-0.004
 mushrooms-0.004
 closely-0.004
Token with
Token ID351
Feature activation+6.96e-02

Pos logit contributions
 Kim+0.008
 Su+0.007
 cho+0.007
 Johnny+0.007
 driving+0.007
Neg logit contributions
 conclusions-0.011
growth-0.010
-0.010
',"-0.009
ede-0.009
Token the
Token ID262
Feature activation+2.06e-01

Pos logit contributions
 Kim+0.002
 Su+0.002
coat+0.002
ug+0.002
Kim+0.002
Neg logit contributions
 conclusions-0.002
growth-0.001
 mushrooms-0.001
ede-0.001
 vegetable-0.001
Token.
Token ID13
Feature activation+2.52e-01

Pos logit contributions
'+0.009
 closely+0.008
ede+0.008
 conclusions+0.008
der+0.008
Neg logit contributions
 Kim-0.031
 Su-0.030
Kim-0.029
 Johnny-0.029
Su-0.029
Token He
Token ID679
Feature activation-8.35e-01

Pos logit contributions
icking+0.006
 cho+0.006
ug+0.006
rol+0.006
coat+0.006
Neg logit contributions
growth-0.005
 conclusions-0.005
 mushrooms-0.004
 Fran-0.004
ede-0.004
Token felt
Token ID2936
Feature activation-3.05e-01

Pos logit contributions
 conclusions+0.018
growth+0.017
ede+0.017
ils+0.015
 wrists+0.015
Neg logit contributions
 Kim-0.020
 Su-0.019
 Johnny-0.018
ug-0.018
reat-0.018
Token a
Token ID257
Feature activation+1.63e+00

Pos logit contributions
 conclusions+0.007
growth+0.006
 mushrooms+0.006
 vegetable+0.006
',"+0.005
Neg logit contributions
 Kim-0.007
coat-0.007
Kim-0.007
 Su-0.007
ug-0.007
Token lot
Token ID1256
Feature activation+2.74e+00

Pos logit contributions
 Kim+0.037
 Su+0.033
 cho+0.032
 Johnny+0.032
Kim+0.031
Neg logit contributions
 conclusions-0.041
growth-0.037
ede-0.037
-0.036
',"-0.035
Token of
Token ID286
Feature activation+5.28e+00

Pos logit contributions
 Kim+0.049
 Johnny+0.047
 Su+0.044
Johnny+0.043
 Matt+0.042
Neg logit contributions
 conclusions-0.083
growth-0.073
ede-0.072
 moral-0.069
 sip-0.069
Token pain
Token ID2356
Feature activation+1.39e-01

Pos logit contributions
 Kim+0.140
 Johnny+0.135
 cho+0.132
 Su+0.132
icking+0.131
Neg logit contributions
 conclusions-0.111
 moral-0.090
growth-0.090
 mushrooms-0.088
ede-0.086
Token and
Token ID290
Feature activation+1.54e+00

Pos logit contributions
 Kim+0.005
 Su+0.005
Kim+0.005
Su+0.005
 Liz+0.004
Neg logit contributions
der-0.001
 conclusions-0.001
'-0.001
ede-0.001
 closely-0.001
Token blood
Token ID2910
Feature activation+1.83e-01

Pos logit contributions
coat+0.040
rol+0.040
Su+0.039
Kim+0.039
 Su+0.039
Neg logit contributions
growth-0.030
 conclusions-0.029
 mushrooms-0.025
 vegetable-0.022
 stems-0.022
Token.
Token ID13
Feature activation+3.62e-02

Pos logit contributions
 Kim+0.006
 Su+0.005
Kim+0.005
Su+0.005
omo+0.005
Neg logit contributions
 conclusions-0.002
growth-0.002
ede-0.002
ils-0.002
le-0.002
Token He
Token ID679
Feature activation-1.10e+00

Pos logit contributions
 cho+0.001
 Kim+0.001
ug+0.001
coat+0.001
rol+0.001
Neg logit contributions
 conclusions-0.001
growth-0.001
-0.001
 mushrooms-0.001
',"-0.001
Token.
Token ID13
Feature activation+5.93e-01

Pos logit contributions
 Kim+0.040
 Su+0.039
Kim+0.037
 Johnny+0.036
Su+0.036
Neg logit contributions
 conclusions-0.016
ede-0.016
',"-0.014
growth-0.014
'."-0.014
Token They
Token ID1119
Feature activation+4.57e-02

Pos logit contributions
 Kim+0.015
 Su+0.014
 cho+0.014
 Johnny+0.014
ug+0.013
Neg logit contributions
 conclusions-0.012
growth-0.011
ede-0.011
-0.011
',"-0.011
Token were
Token ID547
Feature activation-3.86e-01

Pos logit contributions
 Kim+0.001
 Su+0.001
 cho+0.001
 Johnny+0.001
Kim+0.001
Neg logit contributions
 conclusions-0.001
ede-0.001
growth-0.001
-0.001
',"-0.001
Token having
Token ID1719
Feature activation+3.88e-01

Pos logit contributions
 mushrooms+0.006
 moral+0.006
 wrists+0.006
 Fran+0.006
cious+0.005
Neg logit contributions
ug-0.010
Kim-0.009
 Su-0.009
ister-0.009
Johnny-0.009
Token lots
Token ID6041
Feature activation+8.73e-01

Pos logit contributions
 Kim+0.008
 Su+0.007
ug+0.007
Kim+0.007
 cho+0.007
Neg logit contributions
 conclusions-0.009
growth-0.009
 mushrooms-0.009
ede-0.008
',"-0.008
Token of
Token ID286
Feature activation+5.26e+00

Pos logit contributions
 Kim+0.014
 Su+0.012
 Johnny+0.011
Kim+0.011
 cho+0.011
Neg logit contributions
 conclusions-0.027
ede-0.024
 mushrooms-0.024
growth-0.024
ils-0.023
Token fun
Token ID1257
Feature activation+1.49e+00

Pos logit contributions
 Kim+0.075
 Su+0.059
 Johnny+0.056
 cho+0.056
 driving+0.054
Neg logit contributions
 conclusions-0.181
growth-0.171
-0.170
ede-0.168
',"-0.165
Token.
Token ID13
Feature activation+2.82e-01

Pos logit contributions
 Kim+0.027
 driving+0.024
 speeding+0.022
 cho+0.022
 Tim+0.022
Neg logit contributions
 conclusions-0.046
growth-0.044
',"-0.042
-0.042
sie-0.041
Token
Token ID198
Feature activation+8.32e-03

Pos logit contributions
 Kim+0.006
 Su+0.006
 Tim+0.006
 Johnny+0.005
 cho+0.005
Neg logit contributions
 conclusions-0.008
ede-0.007
growth-0.007
ils-0.007
-0.007
Token
Token ID198
Feature activation-1.36e+00

Pos logit contributions
 Kim+0.000
 Su+0.000
 Tim+0.000
Kim+0.000
 Johnny+0.000
Neg logit contributions
 conclusions-0.000
ede-0.000
-0.000
growth-0.000
 wrists-0.000
TokenThen
Token ID6423
Feature activation+4.18e-01

Pos logit contributions
 conclusions+0.031
growth+0.027
ede+0.027
',"+0.026
+0.026
Neg logit contributions
 Kim-0.036
 Su-0.032
 cho-0.032
Kim-0.032
 driving-0.030
Token.
Token ID13
Feature activation-4.38e-01

Pos logit contributions
 conclusions+0.019
growth+0.017
ede+0.017
+0.017
',"+0.016
Neg logit contributions
 Kim-0.024
 Su-0.022
 cho-0.021
 Johnny-0.021
 driving-0.020
Token They
Token ID1119
Feature activation+1.38e-01

Pos logit contributions
 conclusions+0.010
ede+0.009
growth+0.009
+0.009
',"+0.009
Neg logit contributions
 Kim-0.011
 Su-0.010
 Johnny-0.009
 cho-0.009
 Liz-0.009
Token were
Token ID547
Feature activation-2.81e-01

Pos logit contributions
 Kim+0.003
ug+0.003
 Su+0.003
 cho+0.003
Kim+0.003
Neg logit contributions
ede-0.003
 conclusions-0.003
-0.003
 tripod-0.003
ils-0.003
Token having
Token ID1719
Feature activation+2.50e-01

Pos logit contributions
cious+0.003
 Fran+0.003
 wrists+0.003
 closely+0.003
 moral+0.003
Neg logit contributions
loo-0.007
ister-0.007
less-0.007
ug-0.007
inner-0.007
Token lots
Token ID6041
Feature activation+8.95e-01

Pos logit contributions
 Kim+0.006
ug+0.006
Kim+0.005
icking+0.005
reat+0.005
Neg logit contributions
 conclusions-0.005
 mushrooms-0.005
growth-0.005
ede-0.005
cious-0.004
Token of
Token ID286
Feature activation+5.25e+00

Pos logit contributions
 Kim+0.015
 Johnny+0.012
 Su+0.012
 cho+0.012
Kim+0.012
Neg logit contributions
 conclusions-0.026
ede-0.024
 mushrooms-0.024
growth-0.023
ils-0.023
Token fun
Token ID1257
Feature activation+1.43e+00

Pos logit contributions
 Kim+0.095
 Su+0.084
 cho+0.084
 Johnny+0.081
ug+0.079
Neg logit contributions
 conclusions-0.149
ede-0.138
growth-0.134
',"-0.132
-0.131
Token until
Token ID1566
Feature activation+5.03e-01

Pos logit contributions
 Kim+0.028
 Su+0.024
 driving+0.024
 cho+0.023
 speeding+0.023
Neg logit contributions
 conclusions-0.042
growth-0.040
',"-0.038
-0.038
ede-0.037
Token Lily
Token ID20037
Feature activation-4.57e-02

Pos logit contributions
 Kim+0.010
 Su+0.009
 Johnny+0.009
 cho+0.009
 Tim+0.008
Neg logit contributions
 conclusions-0.015
',"-0.013
ede-0.013
growth-0.013
-0.013
Token's
Token ID338
Feature activation+1.33e+00

Pos logit contributions
ede+0.001
'+0.001
cious+0.000
+0.000
vine+0.000
Neg logit contributions
 Kim-0.002
ug-0.001
rol-0.001
reat-0.001
 cho-0.001
Token friend
Token ID1545
Feature activation+2.10e-01

Pos logit contributions
 Kim+0.039
 Su+0.039
rol+0.038
Kim+0.038
ug+0.037
Neg logit contributions
 moral-0.021
 conclusions-0.020
 vegetable-0.020
 goal-0.019
 end-0.019
Token dark
Token ID3223
Feature activation+1.80e+00

Pos logit contributions
 Kim+0.011
ug+0.010
 Su+0.010
rol+0.010
 Liz+0.009
Neg logit contributions
 conclusions-0.010
 moral-0.009
 end-0.008
 mushrooms-0.008
 vegetable-0.008
Token clouds
Token ID15114
Feature activation+2.62e+00

Pos logit contributions
 Kim+0.040
 Su+0.036
rol+0.036
ug+0.035
 cho+0.035
Neg logit contributions
 conclusions-0.044
growth-0.039
-0.037
',"-0.036
ede-0.035
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
 Kim+0.085
Kim+0.077
 Su+0.077
 Johnny+0.074
ug+0.073
Neg logit contributions
 conclusions-0.039
ede-0.034
'-0.032
growth-0.032
-0.031
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
 Kim+0.027
ug+0.026
coat+0.025
Kim+0.025
 Su+0.025
Neg logit contributions
 conclusions-0.027
growth-0.024
 mushrooms-0.023
 vegetable-0.021
 moral-0.021
Token flashes
Token ID29227
Feature activation+2.27e+00

Pos logit contributions
 Kim+0.031
ug+0.030
 Su+0.029
rol+0.028
oo+0.027
Neg logit contributions
 conclusions-0.028
 moral-0.023
growth-0.022
 mushrooms-0.022
 stems-0.022
Token of
Token ID286
Feature activation+5.24e+00

Pos logit contributions
 Kim+0.056
Kim+0.050
 Su+0.050
coat+0.048
 Johnny+0.047
Neg logit contributions
 conclusions-0.053
growth-0.046
 closely-0.045
 sip-0.043
 moral-0.043
Token light
Token ID1657
Feature activation+1.43e+00

Pos logit contributions
 Kim+0.108
coat+0.105
rol+0.102
ug+0.101
icking+0.100
Neg logit contributions
 conclusions-0.140
growth-0.124
 mushrooms-0.117
 moral-0.112
 vegetable-0.109
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
 Kim+0.049
Kim+0.046
 Su+0.045
 Johnny+0.044
Su+0.043
Neg logit contributions
 conclusions-0.018
growth-0.015
ede-0.014
',"-0.013
-0.013
Token He
Token ID679
Feature activation-4.77e-01

Pos logit contributions
 conclusions+0.005
growth+0.005
+0.005
 moral+0.005
',"+0.005
Neg logit contributions
ug-0.006
 Kim-0.006
 cho-0.006
coat-0.006
rol-0.006
Token saw
Token ID2497
Feature activation+6.58e-01

Pos logit contributions
 conclusions+0.009
ede+0.009
growth+0.008
+0.008
 moral+0.008
Neg logit contributions
 Kim-0.013
 Su-0.011
 cho-0.011
ug-0.011
Kim-0.011
Token the
Token ID262
Feature activation+7.49e-01

Pos logit contributions
coat+0.019
 Kim+0.017
Kim+0.017
ug+0.017
akin+0.017
Neg logit contributions
 conclusions-0.012
growth-0.011
 mushrooms-0.010
 moral-0.009
 coral-0.009
Token playing
Token ID2712
Feature activation+7.47e-01

Pos logit contributions
 conclusions+0.019
ede+0.019
 closely+0.018
 moral+0.018
 Fran+0.018
Neg logit contributions
 Kim-0.019
Kim-0.017
 Su-0.016
 cho-0.016
 Liz-0.015
Token with
Token ID351
Feature activation-2.07e-01

Pos logit contributions
 Kim+0.017
 Su+0.015
 cho+0.014
 Johnny+0.014
Kim+0.014
Neg logit contributions
 conclusions-0.019
ede-0.018
-0.017
growth-0.017
',"-0.016
Token his
Token ID465
Feature activation+2.91e-01

Pos logit contributions
 conclusions+0.005
growth+0.004
ede+0.004
 mushrooms+0.004
 vegetable+0.004
Neg logit contributions
 Kim-0.005
 Su-0.004
 cho-0.004
coat-0.004
ug-0.004
Token magnet
Token ID19972
Feature activation-2.10e+00

Pos logit contributions
 Kim+0.006
 Johnny+0.005
 driving+0.005
 Su+0.005
 cho+0.005
Neg logit contributions
 conclusions-0.008
growth-0.007
ede-0.007
',"-0.007
-0.007
Token in
Token ID287
Feature activation-7.07e-01

Pos logit contributions
 conclusions+0.052
ede+0.050
+0.048
growth+0.048
',"+0.047
Neg logit contributions
 Kim-0.047
 Su-0.043
 cho-0.041
Kim-0.039
 Johnny-0.039
Token Cole
Token ID11768
Feature activation-5.45e+00

Pos logit contributions
 conclusions+0.019
growth+0.018
ede+0.017
+0.017
',"+0.017
Neg logit contributions
 Kim-0.015
 Su-0.013
 cho-0.013
 Johnny-0.012
ug-0.012
Token's
Token ID338
Feature activation+6.95e-01

Pos logit contributions
 conclusions+0.107
growth+0.094
ede+0.092
+0.091
',"+0.088
Neg logit contributions
 Kim-0.150
 Su-0.137
 Johnny-0.136
 cho-0.135
 driving-0.132
Token room
Token ID2119
Feature activation-8.46e-01

Pos logit contributions
 Kim+0.014
 Johnny+0.012
 Su+0.012
 cho+0.012
 driving+0.012
Neg logit contributions
 conclusions-0.019
growth-0.017
ede-0.017
-0.017
',"-0.017
Token when
Token ID618
Feature activation+4.72e-01

Pos logit contributions
 conclusions+0.013
ede+0.013
+0.012
growth+0.012
 vegetable+0.012
Neg logit contributions
 Kim-0.026
 Su-0.024
Kim-0.024
 cho-0.024
 Liz-0.023
Token he
Token ID339
Feature activation-2.73e-01

Pos logit contributions
 Kim+0.010
 Su+0.009
 cho+0.009
ug+0.009
coat+0.009
Neg logit contributions
 conclusions-0.012
-0.011
growth-0.011
ede-0.011
 moral-0.011
Token accidentally
Token ID14716
Feature activation+1.04e+00

Pos logit contributions
ede+0.005
 conclusions+0.004
growth+0.004
+0.004
ils+0.004
Neg logit contributions
 Kim-0.008
 Su-0.008
ug-0.007
Kim-0.007
 cho-0.007
Token.
Token ID13
Feature activation+2.51e-01

Pos logit contributions
 conclusions+0.033
growth+0.030
ede+0.029
+0.029
',"+0.028
Neg logit contributions
 Kim-0.029
 Su-0.026
 cho-0.025
 Johnny-0.025
 driving-0.023
Token You
Token ID921
Feature activation-1.61e+00

Pos logit contributions
 Kim+0.006
 cho+0.006
ug+0.006
 Su+0.006
coat+0.006
Neg logit contributions
 conclusions-0.005
-0.005
',"-0.005
growth-0.005
'."-0.005
Token can
Token ID460
Feature activation-1.30e-01

Pos logit contributions
 conclusions+0.029
ede+0.027
growth+0.027
',"+0.027
+0.027
Neg logit contributions
 Kim-0.046
 Su-0.043
 cho-0.042
Kim-0.041
 Johnny-0.040
Token call
Token ID869
Feature activation-1.03e+00

Pos logit contributions
 conclusions+0.003
ede+0.003
growth+0.003
+0.003
',"+0.003
Neg logit contributions
 Kim-0.003
 Su-0.003
 cho-0.003
 Johnny-0.003
ug-0.003
Token me
Token ID502
Feature activation-2.12e+00

Pos logit contributions
 conclusions+0.024
',"+0.021
growth+0.021
+0.021
ede+0.020
Neg logit contributions
 Kim-0.025
 Su-0.022
coat-0.022
Kim-0.022
 cho-0.022
Token Crystal
Token ID12969
Feature activation-5.38e+00

Pos logit contributions
 conclusions+0.048
',"+0.045
+0.044
ede+0.044
growth+0.043
Neg logit contributions
 Kim-0.051
 Su-0.046
 cho-0.045
Kim-0.045
 Johnny-0.043
Token.
Token ID13
Feature activation+3.41e-01

Pos logit contributions
 conclusions+0.128
growth+0.116
ede+0.115
+0.113
',"+0.111
Neg logit contributions
 Kim-0.127
 Su-0.115
 cho-0.112
 Johnny-0.110
 driving-0.106
Tokenâ
Token ID22940
Feature activation+5.99e-01

Pos logit contributions
 Kim+0.009
 Su+0.008
 cho+0.008
 Johnny+0.007
ug+0.007
Neg logit contributions
 conclusions-0.008
growth-0.007
-0.007
ede-0.007
',"-0.007
Token
Token ID26391
Feature activation-1.42e+00

Pos logit contributions
 Kim+0.016
 Su+0.014
 cho+0.014
 Johnny+0.014
Kim+0.013
Neg logit contributions
 conclusions-0.013
growth-0.012
ede-0.011
-0.011
',"-0.011
Token
Token ID198
Feature activation+2.66e-01

Pos logit contributions
 conclusions+0.028
+0.026
ede+0.025
growth+0.025
',"+0.025
Neg logit contributions
 Kim-0.038
 Su-0.035
 cho-0.035
 Johnny-0.034
ug-0.033
Token
Token ID198
Feature activation-7.94e-01

Pos logit contributions
 Kim+0.007
 Su+0.006
 cho+0.006
 Johnny+0.006
Kim+0.006
Neg logit contributions
 conclusions-0.006
ede-0.005
growth-0.005
-0.005
',"-0.005
Token long
Token ID890
Feature activation+5.56e-01

Pos logit contributions
 Kim+0.017
 Su+0.015
 cho+0.015
ug+0.015
 Johnny+0.015
Neg logit contributions
 conclusions-0.014
ede-0.013
growth-0.012
-0.012
',"-0.012
Token time
Token ID640
Feature activation-5.06e-02

Pos logit contributions
 Kim+0.013
 Su+0.012
 cho+0.011
 Johnny+0.011
Kim+0.011
Neg logit contributions
 conclusions-0.013
ede-0.012
growth-0.012
-0.012
',"-0.012
Token.
Token ID13
Feature activation-9.96e-01

Pos logit contributions
 conclusions+0.001
',"+0.001
ede+0.001
+0.001
'."+0.001
Neg logit contributions
 Kim-0.002
Kim-0.002
 cho-0.002
 Su-0.002
 Johnny-0.001
Token They
Token ID1119
Feature activation-7.19e-02

Pos logit contributions
 conclusions+0.029
ede+0.027
growth+0.025
ils+0.025
+0.025
Neg logit contributions
 Kim-0.019
 Tim-0.017
 Tommy-0.017
 Tom-0.016
 Johnny-0.016
Token put
Token ID1234
Feature activation+1.05e-01

Pos logit contributions
 conclusions+0.001
ils+0.001
ede+0.001
+0.001
growth+0.001
Neg logit contributions
 Kim-0.002
 cho-0.002
ug-0.002
Kim-0.002
 speeding-0.002
Token True
Token ID6407
Feature activation-5.21e+00

Pos logit contributions
 Kim+0.002
 Su+0.002
 cho+0.002
Kim+0.002
ug+0.002
Neg logit contributions
 conclusions-0.002
growth-0.002
-0.002
ede-0.002
 mushrooms-0.002
Token in
Token ID287
Feature activation-1.09e+00

Pos logit contributions
 conclusions+0.138
growth+0.126
ede+0.126
+0.125
',"+0.122
Neg logit contributions
 Kim-0.109
 Su-0.097
 cho-0.094
 Johnny-0.093
Kim-0.089
Token a
Token ID257
Feature activation-3.93e-01

Pos logit contributions
 conclusions+0.026
growth+0.024
ede+0.023
+0.023
',"+0.023
Neg logit contributions
 Kim-0.024
 cho-0.022
 Su-0.022
ug-0.022
 Johnny-0.021
Token big
Token ID1263
Feature activation-9.91e-02

Pos logit contributions
 conclusions+0.010
growth+0.009
ede+0.009
+0.009
',"+0.008
Neg logit contributions
 Kim-0.009
 Su-0.008
 cho-0.008
 Johnny-0.008
Kim-0.008
Token box
Token ID3091
Feature activation-1.73e-01

Pos logit contributions
 conclusions+0.002
ede+0.002
growth+0.002
+0.002
',"+0.002
Neg logit contributions
 Kim-0.003
 Su-0.002
 cho-0.002
 Johnny-0.002
Kim-0.002
Token with
Token ID351
Feature activation-5.82e-01

Pos logit contributions
 closely+0.002
'+0.002
',"+0.002
growth+0.002
ede+0.002
Neg logit contributions
 Kim-0.005
 Su-0.005
Kim-0.005
 cho-0.005
 Johnny-0.005
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
 conclusions+0.011
growth+0.010
ede+0.010
+0.010
',"+0.009
Neg logit contributions
 Kim-0.013
 Su-0.012
 cho-0.012
ug-0.011
 Johnny-0.011
Token way
Token ID835
Feature activation-4.27e-02

Pos logit contributions
 conclusions+0.021
ede+0.019
+0.019
growth+0.019
',"+0.018
Neg logit contributions
 Kim-0.028
 Su-0.026
 cho-0.025
ug-0.025
 Johnny-0.025
Token she
Token ID673
Feature activation-1.07e+00

Pos logit contributions
 conclusions+0.001
+0.001
',"+0.001
ede+0.001
'."+0.001
Neg logit contributions
 Kim-0.001
 cho-0.001
 Su-0.001
ug-0.001
coat-0.001
Token solved
Token ID16019
Feature activation-1.80e+00

Pos logit contributions
 conclusions+0.021
ede+0.021
+0.021
ils+0.020
growth+0.020
Neg logit contributions
 Kim-0.028
 Su-0.026
 cho-0.025
ug-0.024
Kim-0.024
Token the
Token ID262
Feature activation-9.30e-02

Pos logit contributions
 conclusions+0.043
ede+0.039
growth+0.039
+0.038
',"+0.037
Neg logit contributions
 Kim-0.043
 Su-0.038
 cho-0.037
 Johnny-0.037
Kim-0.035
Token math
Token ID10688
Feature activation-5.10e+00

Pos logit contributions
 conclusions+0.002
growth+0.002
ede+0.002
+0.002
',"+0.002
Neg logit contributions
 Kim-0.002
 Su-0.002
 Johnny-0.002
 cho-0.002
 driving-0.002
Token equations
Token ID27490
Feature activation-2.86e+00

Pos logit contributions
 conclusions+0.121
growth+0.111
+0.103
cious+0.101
',"+0.100
Neg logit contributions
 Kim-0.113
 driving-0.102
 cho-0.100
 speeding-0.096
 Johnny-0.096
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
 conclusions+0.048
ede+0.046
+0.045
',"+0.045
growth+0.043
Neg logit contributions
 Kim-0.083
 Su-0.077
 cho-0.076
 Johnny-0.075
Kim-0.074
Token
Token ID220
Feature activation+9.91e-01

Pos logit contributions
 conclusions+0.027
ede+0.024
growth+0.024
+0.023
',"+0.023
Neg logit contributions
 Kim-0.031
 Su-0.028
 Johnny-0.027
 cho-0.026
 Liz-0.026
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
 Kim+0.027
 cho+0.025
 Su+0.025
 Johnny+0.024
ug+0.024
Neg logit contributions
 conclusions-0.019
-0.017
growth-0.017
',"-0.017
ede-0.016
Token
Token ID198
Feature activation-2.20e+00

Pos logit contributions
 conclusions+0.029
ede+0.026
growth+0.026
+0.025
',"+0.024
Neg logit contributions
 Kim-0.030
 Su-0.027
 Johnny-0.027
 cho-0.026
Kim-0.026
Token was
Token ID373
Feature activation-1.29e+00

Pos logit contributions
 conclusions+0.016
ede+0.015
growth+0.015
+0.014
',"+0.014
Neg logit contributions
 Kim-0.018
 Su-0.017
 cho-0.016
ug-0.016
Kim-0.016
Token able
Token ID1498
Feature activation+6.83e-01

Pos logit contributions
 moral+0.021
 closely+0.020
'+0.019
ils+0.019
pieces+0.018
Neg logit contributions
ug-0.033
 Kim-0.032
Kim-0.031
 cho-0.031
 Su-0.030
Token to
Token ID284
Feature activation+6.60e-02

Pos logit contributions
ug+0.019
 Kim+0.018
Kim+0.018
 cho+0.018
rol+0.018
Neg logit contributions
 conclusions-0.011
 mushrooms-0.011
 vegetable-0.010
 Fran-0.010
 moral-0.010
Token solve
Token ID8494
Feature activation-1.98e+00

Pos logit contributions
 Kim+0.002
 Su+0.002
 cho+0.002
 Johnny+0.002
Kim+0.001
Neg logit contributions
 conclusions-0.001
growth-0.001
ede-0.001
-0.001
',"-0.001
Token the
Token ID262
Feature activation+5.46e-01

Pos logit contributions
 conclusions+0.052
growth+0.047
ede+0.047
+0.047
',"+0.046
Neg logit contributions
 Kim-0.043
 Su-0.037
 Johnny-0.037
 Liz-0.036
 cho-0.036
Token math
Token ID10688
Feature activation-4.90e+00

Pos logit contributions
 Kim+0.013
 Su+0.012
 Johnny+0.012
 cho+0.011
 driving+0.011
Neg logit contributions
 conclusions-0.013
growth-0.012
-0.011
ede-0.011
',"-0.011
Token problems
Token ID2761
Feature activation-1.54e+00

Pos logit contributions
 conclusions+0.112
growth+0.101
+0.096
',"+0.094
ede+0.094
Neg logit contributions
 Kim-0.119
 cho-0.104
 driving-0.104
 Su-0.103
 Johnny-0.102
Token and
Token ID290
Feature activation+5.04e-01

Pos logit contributions
 conclusions+0.027
ede+0.027
+0.027
',"+0.026
'."+0.025
Neg logit contributions
 Kim-0.041
 Su-0.039
 cho-0.039
Kim-0.037
ug-0.037
Token even
Token ID772
Feature activation-6.29e-02

Pos logit contributions
 Kim+0.012
 Su+0.011
 Johnny+0.011
 cho+0.011
 driving+0.010
Neg logit contributions
 conclusions-0.012
growth-0.010
ede-0.010
-0.010
',"-0.010
Token started
Token ID2067
Feature activation+1.63e+00

Pos logit contributions
 conclusions+0.002
ede+0.001
growth+0.001
+0.001
',"+0.001
Neg logit contributions
 Kim-0.001
 Su-0.001
 cho-0.001
 Johnny-0.001
Kim-0.001
Token to
Token ID284
Feature activation+8.37e-01

Pos logit contributions
 Kim+0.037
ug+0.034
 Su+0.034
 cho+0.034
Kim+0.033
Neg logit contributions
 conclusions-0.038
ede-0.035
-0.034
growth-0.034
',"-0.033
Token was
Token ID373
Feature activation+2.57e-01

Pos logit contributions
 Kim+0.030
 cho+0.028
ug+0.028
 speeding+0.027
coat+0.027
Neg logit contributions
 conclusions-0.020
growth-0.018
-0.018
ede-0.017
 moral-0.016
Token a
Token ID257
Feature activation-1.14e+00

Pos logit contributions
 Kim+0.006
 cho+0.006
 Su+0.006
ug+0.005
Kim+0.005
Neg logit contributions
 conclusions-0.006
growth-0.005
ede-0.005
-0.005
',"-0.005
Token silly
Token ID14397
Feature activation-8.81e-01

Pos logit contributions
 conclusions+0.030
+0.028
ede+0.028
 vegetable+0.027
growth+0.027
Neg logit contributions
 Kim-0.024
 Su-0.021
 Johnny-0.019
ug-0.019
 cho-0.019
Token cat
Token ID3797
Feature activation+4.85e-01

Pos logit contributions
 conclusions+0.020
ede+0.018
 vegetable+0.018
+0.018
 mushrooms+0.018
Neg logit contributions
 Kim-0.021
Kim-0.019
 Su-0.019
rol-0.019
ug-0.018
Token called
Token ID1444
Feature activation+1.04e+00

Pos logit contributions
 Kim+0.010
 Su+0.009
 cho+0.009
 Johnny+0.009
 driving+0.008
Neg logit contributions
 conclusions-0.013
growth-0.012
ede-0.012
-0.012
',"-0.011
Token Edward
Token ID10443
Feature activation-4.87e+00

Pos logit contributions
rol+0.018
 cho+0.017
ug+0.017
 driving+0.017
 Kim+0.017
Neg logit contributions
 conclusions-0.027
growth-0.026
 mushrooms-0.026
 Fran-0.025
',"-0.025
Token.
Token ID13
Feature activation-2.17e+00

Pos logit contributions
 conclusions+0.160
ede+0.145
growth+0.143
+0.140
 media+0.130
Neg logit contributions
 Kim-0.075
 cho-0.071
o-0.070
 driving-0.064
ug-0.062
Token Edward
Token ID10443
Feature activation-3.26e+00

Pos logit contributions
 conclusions+0.066
ede+0.058
pieces+0.054
growth+0.054
ils+0.054
Neg logit contributions
 Tom-0.037
 Tommy-0.036
 Kim-0.036
 Tim-0.035
 driving-0.031
Token loved
Token ID6151
Feature activation+3.19e-01

Pos logit contributions
 conclusions+0.096
growth+0.087
ede+0.084
',"+0.082
+0.081
Neg logit contributions
 Kim-0.061
 cho-0.055
 Su-0.053
 Johnny-0.053
 driving-0.050
Token to
Token ID284
Feature activation+2.03e-01

Pos logit contributions
 Kim+0.008
ug+0.008
rol+0.007
icking+0.007
Kim+0.007
Neg logit contributions
 mushrooms-0.006
 vegetable-0.006
 conclusions-0.006
 herbs-0.006
ede-0.006
Token sleep
Token ID3993
Feature activation-9.62e-01

Pos logit contributions
 Kim+0.005
 cho+0.005
 Su+0.005
 Johnny+0.005
Kim+0.005
Neg logit contributions
 conclusions-0.004
ede-0.004
growth-0.004
',"-0.004
-0.004
Token and
Token ID290
Feature activation-1.34e-01

Pos logit contributions
 conclusions+0.007
growth+0.007
',"+0.007
'+0.007
 closely+0.007
Neg logit contributions
 Kim-0.011
 cho-0.010
 Su-0.010
ug-0.010
coat-0.009
Token Vince
Token ID22376
Feature activation-1.75e+00

Pos logit contributions
+0.003
ede+0.003
growth+0.003
 conclusions+0.003
ils+0.003
Neg logit contributions
 Kim-0.003
ug-0.003
 Su-0.003
 cho-0.003
coat-0.003
Token helped
Token ID4193
Feature activation-5.43e-01

Pos logit contributions
ede+0.032
+0.032
ils+0.030
'+0.030
 conclusions+0.029
Neg logit contributions
 Kim-0.051
 Su-0.046
Kim-0.044
 cho-0.044
 Liz-0.043
Token to
Token ID284
Feature activation+3.31e-01

Pos logit contributions
 conclusions+0.012
ede+0.011
growth+0.010
',"+0.010
ils+0.010
Neg logit contributions
 Kim-0.013
 cho-0.013
 Su-0.012
rol-0.012
coat-0.012
Token wrap
Token ID14441
Feature activation-4.80e-01

Pos logit contributions
 Kim+0.008
 cho+0.008
rol+0.008
 Su+0.008
ug+0.008
Neg logit contributions
 conclusions-0.007
growth-0.006
ede-0.006
',"-0.006
-0.006
Token Luke
Token ID11336
Feature activation-4.86e+00

Pos logit contributions
 conclusions+0.011
growth+0.010
ede+0.010
',"+0.010
+0.010
Neg logit contributions
 Kim-0.011
 Su-0.010
 cho-0.010
Kim-0.010
 speeding-0.010
Token up
Token ID510
Feature activation-1.62e+00

Pos logit contributions
 conclusions+0.114
growth+0.102
ede+0.100
+0.099
',"+0.095
Neg logit contributions
 Kim-0.114
 Su-0.102
 Johnny-0.101
 driving-0.099
 cho-0.097
Token in
Token ID287
Feature activation-5.00e-01

Pos logit contributions
 conclusions+0.034
growth+0.031
ede+0.031
',"+0.030
 mushrooms+0.030
Neg logit contributions
 Kim-0.040
 cho-0.037
 Su-0.037
Kim-0.037
ug-0.036
Token a
Token ID257
Feature activation-5.50e-01

Pos logit contributions
 conclusions+0.012
growth+0.011
ede+0.011
+0.011
',"+0.011
Neg logit contributions
 Kim-0.011
 Su-0.010
 cho-0.010
ug-0.010
Kim-0.010
Token blanket
Token ID18447
Feature activation+1.09e+00

Pos logit contributions
 conclusions+0.013
ede+0.012
+0.012
growth+0.012
',"+0.012
Neg logit contributions
 Kim-0.013
 Su-0.012
 cho-0.011
 Johnny-0.011
ug-0.011
Token.
Token ID13
Feature activation-7.42e-01

Pos logit contributions
 Kim+0.036
 Su+0.034
Kim+0.034
 cho+0.033
ug+0.032
Neg logit contributions
 conclusions-0.012
',"-0.012
ede-0.012
'."-0.011
.'"-0.011
Token would
Token ID561
Feature activation-2.00e+00

Pos logit contributions
 conclusions+0.035
growth+0.031
ede+0.031
+0.030
',"+0.030
Neg logit contributions
 Kim-0.028
 Su-0.024
 cho-0.024
 Johnny-0.024
 driving-0.023
Token fill
Token ID6070
Feature activation+4.60e-02

Pos logit contributions
 conclusions+0.046
growth+0.041
ede+0.041
+0.041
',"+0.040
Neg logit contributions
 Kim-0.049
 Su-0.044
 cho-0.043
 Johnny-0.043
 driving-0.041
Token her
Token ID607
Feature activation-1.43e+00

Pos logit contributions
 Kim+0.001
 Su+0.001
 Johnny+0.001
 Liz+0.001
 cho+0.001
Neg logit contributions
 conclusions-0.001
-0.001
ede-0.001
growth-0.001
',"-0.001
Token notebook
Token ID20922
Feature activation-2.37e+00

Pos logit contributions
 conclusions+0.037
growth+0.033
ede+0.033
+0.033
',"+0.032
Neg logit contributions
 Kim-0.032
 Su-0.027
 Johnny-0.027
 cho-0.026
 driving-0.026
Token with
Token ID351
Feature activation-1.13e+00

Pos logit contributions
 conclusions+0.064
growth+0.059
ede+0.059
+0.058
',"+0.057
Neg logit contributions
 Kim-0.048
 Su-0.042
 cho-0.041
 Johnny-0.040
Kim-0.039
Token maths
Token ID47761
Feature activation-4.82e+00

Pos logit contributions
 conclusions+0.026
growth+0.023
ede+0.023
+0.022
',"+0.022
Neg logit contributions
 Kim-0.028
 Su-0.026
 cho-0.025
 Johnny-0.025
ug-0.024
Token problems
Token ID2761
Feature activation-1.44e+00

Pos logit contributions
 conclusions+0.125
growth+0.113
ede+0.107
+0.106
',"+0.104
Neg logit contributions
 Kim-0.104
 cho-0.090
 driving-0.090
 Su-0.090
 Johnny-0.089
Token and
Token ID290
Feature activation-4.19e-01

Pos logit contributions
 conclusions+0.026
+0.025
',"+0.025
ede+0.025
'."+0.023
Neg logit contributions
 Kim-0.039
 Su-0.037
 cho-0.036
 Johnny-0.035
Kim-0.034
Token try
Token ID1949
Feature activation-1.66e+00

Pos logit contributions
 conclusions+0.009
growth+0.008
ede+0.008
+0.008
',"+0.008
Neg logit contributions
 Kim-0.011
 Su-0.010
 Johnny-0.009
 cho-0.009
 driving-0.009
Token to
Token ID284
Feature activation-9.34e-01

Pos logit contributions
 conclusions+0.049
growth+0.048
+0.046
ede+0.046
',"+0.044
Neg logit contributions
 Kim-0.030
 Johnny-0.025
 Su-0.025
 driving-0.025
 cho-0.025
Token find
Token ID1064
Feature activation-2.38e-01

Pos logit contributions
 conclusions+0.022
growth+0.021
+0.021
ede+0.020
',"+0.020
Neg logit contributions
 Kim-0.022
 Johnny-0.019
 Su-0.019
 driving-0.019
Kim-0.018
Token Mystery
Token ID18208
Feature activation-1.94e+00

Pos logit contributions
 conclusions+0.011
ede+0.009
+0.009
growth+0.009
',"+0.009
Neg logit contributions
 Kim-0.014
 Su-0.013
 Johnny-0.012
ug-0.012
 cho-0.012
Token of
Token ID286
Feature activation+1.71e-01

Pos logit contributions
 conclusions+0.038
+0.035
growth+0.034
ede+0.034
',"+0.034
Neg logit contributions
 Kim-0.054
 Su-0.050
 Johnny-0.048
 cho-0.047
Kim-0.047
Token the
Token ID262
Feature activation-5.17e-01

Pos logit contributions
 Kim+0.004
 Su+0.004
 cho+0.004
 Johnny+0.003
Kim+0.003
Neg logit contributions
 conclusions-0.004
growth-0.004
ede-0.004
-0.004
',"-0.004
Token Bur
Token ID5481
Feature activation-2.28e+00

Pos logit contributions
 conclusions+0.011
ede+0.010
growth+0.009
+0.009
',"+0.009
Neg logit contributions
 Kim-0.014
 Su-0.013
 cho-0.012
 Johnny-0.012
ug-0.012
Tokenied
Token ID798
Feature activation-1.27e+00

Pos logit contributions
 conclusions+0.047
+0.043
',"+0.043
ede+0.043
growth+0.042
Neg logit contributions
 Kim-0.061
 Su-0.055
Kim-0.053
 Johnny-0.053
 cho-0.052
Token Bone
Token ID21717
Feature activation-4.78e+00

Pos logit contributions
 conclusions+0.027
',"+0.024
+0.024
growth+0.024
ede+0.024
Neg logit contributions
 Kim-0.034
 Su-0.031
 Johnny-0.030
Kim-0.030
 cho-0.029
Token".
Token ID1911
Feature activation-6.80e-01

Pos logit contributions
 conclusions+0.089
growth+0.077
ede+0.076
+0.073
',"+0.071
Neg logit contributions
 Kim-0.136
 cho-0.124
 Su-0.124
 Johnny-0.121
 driving-0.119
Token She
Token ID1375
Feature activation-4.18e-01

Pos logit contributions
 conclusions+0.011
',"+0.010
growth+0.010
+0.010
'."+0.010
Neg logit contributions
 Kim-0.019
 cho-0.019
ug-0.018
 speeding-0.018
reat-0.018
Token showed
Token ID3751
Feature activation-9.23e-01

Pos logit contributions
 conclusions+0.009
growth+0.009
ede+0.009
+0.009
',"+0.008
Neg logit contributions
 Kim-0.010
 Su-0.009
 cho-0.009
 Johnny-0.009
ug-0.009
Token it
Token ID340
Feature activation-4.79e-01

Pos logit contributions
 conclusions+0.024
growth+0.023
',"+0.022
ede+0.021
 Fran+0.021
Neg logit contributions
 Kim-0.018
 Su-0.016
 cho-0.016
ug-0.016
coat-0.016
Token to
Token ID284
Feature activation-1.24e+00

Pos logit contributions
 conclusions+0.015
ede+0.013
+0.013
growth+0.013
',"+0.013
Neg logit contributions
 Kim-0.008
 Su-0.007
 cho-0.007
 Johnny-0.007
 Liz-0.006
Token should
Token ID815
Feature activation-6.00e-01

Pos logit contributions
 conclusions+0.024
growth+0.022
+0.022
ede+0.022
',"+0.022
Neg logit contributions
 Kim-0.036
 Su-0.033
 cho-0.032
Kim-0.032
 Johnny-0.031
Token remember
Token ID3505
Feature activation-2.30e-01

Pos logit contributions
 conclusions+0.015
ede+0.014
+0.013
growth+0.013
',"+0.013
Neg logit contributions
 Kim-0.013
 Su-0.012
 cho-0.012
 Johnny-0.011
ug-0.011
Token to
Token ID284
Feature activation+4.69e-01

Pos logit contributions
 conclusions+0.006
+0.005
',"+0.005
growth+0.005
ede+0.005
Neg logit contributions
 Kim-0.005
 cho-0.005
 Su-0.005
Kim-0.005
ug-0.004
Token stay
Token ID2652
Feature activation-2.00e-01

Pos logit contributions
 Kim+0.011
 Su+0.010
 cho+0.010
 Johnny+0.010
ug+0.010
Neg logit contributions
 conclusions-0.011
ede-0.010
growth-0.010
-0.010
',"-0.009
Token close
Token ID1969
Feature activation-6.40e-01

Pos logit contributions
 conclusions+0.005
+0.005
',"+0.005
ede+0.005
growth+0.005
Neg logit contributions
 Kim-0.004
 Su-0.004
 cho-0.004
Kim-0.003
 Johnny-0.003
Tokenby
Token ID1525
Feature activation-4.76e+00

Pos logit contributions
',"+0.013
 conclusions+0.013
'."+0.012
.'"+0.012
+0.011
Neg logit contributions
 Kim-0.017
Kim-0.016
 Su-0.015
 Liz-0.015
 cho-0.015
Token,"
Token ID553
Feature activation+4.25e-01

Pos logit contributions
 conclusions+0.108
ede+0.097
growth+0.097
+0.096
',"+0.095
Neg logit contributions
 Kim-0.118
 Su-0.107
 cho-0.104
 Johnny-0.103
Kim-0.100
Token she
Token ID673
Feature activation+1.59e+00

Pos logit contributions
 Kim+0.009
 Su+0.008
 cho+0.008
coat+0.007
 Johnny+0.007
Neg logit contributions
 conclusions-0.011
-0.010
ede-0.010
growth-0.010
',"-0.010
Token said
Token ID531
Feature activation+9.96e-01

Pos logit contributions
 Kim+0.031
 Su+0.027
 cho+0.026
 Johnny+0.026
Kim+0.025
Neg logit contributions
 conclusions-0.045
ede-0.041
growth-0.041
-0.041
',"-0.040
Token.
Token ID13
Feature activation+5.61e-01

Pos logit contributions
 Kim+0.029
 Su+0.026
 cho+0.026
Kim+0.025
 Johnny+0.025
Neg logit contributions
 conclusions-0.017
',"-0.016
-0.016
ede-0.016
growth-0.015
Token
Token ID198
Feature activation+8.86e-01

Pos logit contributions
 Kim+0.016
 cho+0.015
 Su+0.015
coat+0.015
 driving+0.015
Neg logit contributions
 conclusions-0.009
-0.009
',"-0.009
growth-0.009
'."-0.008
Token but
Token ID475
Feature activation+1.05e+00

Pos logit contributions
ug+0.025
rol+0.023
ick+0.023
 Kim+0.023
loo+0.022
Neg logit contributions
 conclusions-0.012
 moral-0.011
 mushrooms-0.010
der-0.010
 wrists-0.009
Token they
Token ID484
Feature activation+1.03e+00

Pos logit contributions
 Kim+0.022
 Su+0.020
 cho+0.019
 Johnny+0.019
ug+0.019
Neg logit contributions
 conclusions-0.027
ede-0.025
-0.025
growth-0.025
',"-0.024
Token had
Token ID550
Feature activation-5.73e-01

Pos logit contributions
 Kim+0.024
 Su+0.022
 cho+0.021
 Johnny+0.021
Kim+0.020
Neg logit contributions
 conclusions-0.025
growth-0.023
ede-0.023
-0.023
',"-0.022
Token to
Token ID284
Feature activation-3.05e-01

Pos logit contributions
 conclusions+0.013
growth+0.012
',"+0.012
 moral+0.012
ede+0.012
Neg logit contributions
 Kim-0.013
ug-0.012
Kim-0.012
 Su-0.012
 cho-0.011
Token sell
Token ID3677
Feature activation-5.93e-01

Pos logit contributions
 conclusions+0.007
ede+0.006
growth+0.006
',"+0.006
+0.006
Neg logit contributions
 Kim-0.007
 Su-0.007
 cho-0.007
 Johnny-0.007
ug-0.006
Token True
Token ID6407
Feature activation-4.72e+00

Pos logit contributions
 conclusions+0.016
ede+0.016
+0.016
growth+0.016
cious+0.014
Neg logit contributions
 Kim-0.012
 Tim-0.010
 Johnny-0.010
 Su-0.010
 cho-0.010
Token.
Token ID13
Feature activation-8.28e-01

Pos logit contributions
 conclusions+0.129
growth+0.120
ede+0.115
+0.115
 tripod+0.111
Neg logit contributions
 Kim-0.088
 driving-0.075
 Tim-0.074
 cho-0.073
 Johnny-0.072
Token She
Token ID1375
Feature activation+1.07e-01

Pos logit contributions
 conclusions+0.025
ede+0.024
growth+0.023
+0.023
',"+0.023
Neg logit contributions
 Kim-0.015
 Su-0.013
 Johnny-0.012
 Liz-0.011
 Tim-0.011
Token said
Token ID531
Feature activation+1.19e+00

Pos logit contributions
 Kim+0.002
 Su+0.002
ug+0.002
 cho+0.002
reat+0.002
Neg logit contributions
 conclusions-0.002
ede-0.002
 moral-0.002
growth-0.002
-0.002
Token she
Token ID673
Feature activation-5.60e-02

Pos logit contributions
 Kim+0.025
 Su+0.022
 cho+0.022
 Johnny+0.021
 driving+0.020
Neg logit contributions
 conclusions-0.032
growth-0.030
ede-0.030
-0.029
',"-0.028
Token would
Token ID561
Feature activation-9.93e-01

Pos logit contributions
 conclusions+0.001
ede+0.001
+0.001
growth+0.001
',"+0.001
Neg logit contributions
 Kim-0.001
 Su-0.001
 cho-0.001
 Johnny-0.001
ug-0.001
Token would
Token ID561
Feature activation-1.50e+00

Pos logit contributions
 conclusions+0.016
ede+0.015
growth+0.014
+0.014
',"+0.014
Neg logit contributions
 Kim-0.016
 Su-0.015
 cho-0.014
 Johnny-0.014
Kim-0.014
Token fill
Token ID6070
Feature activation+3.42e-01

Pos logit contributions
 conclusions+0.033
growth+0.030
ede+0.029
+0.029
',"+0.028
Neg logit contributions
 Kim-0.038
 Su-0.034
 Johnny-0.034
 cho-0.033
 driving-0.033
Token her
Token ID607
Feature activation-1.36e+00

Pos logit contributions
 Kim+0.007
 Johnny+0.006
 Su+0.006
 Liz+0.006
 Tim+0.005
Neg logit contributions
 conclusions-0.010
-0.009
ede-0.009
growth-0.009
',"-0.009
Token room
Token ID2119
Feature activation-1.24e+00

Pos logit contributions
 conclusions+0.038
growth+0.035
ede+0.034
',"+0.034
+0.033
Neg logit contributions
 Kim-0.027
 Johnny-0.023
 Su-0.023
 driving-0.022
 Liz-0.022
Token with
Token ID351
Feature activation-9.31e-01

Pos logit contributions
 conclusions+0.030
ede+0.028
+0.027
growth+0.027
',"+0.027
Neg logit contributions
 Kim-0.028
 Su-0.026
 cho-0.025
 Johnny-0.024
Kim-0.024
Token maths
Token ID47761
Feature activation-4.72e+00

Pos logit contributions
 conclusions+0.022
growth+0.020
ede+0.020
+0.019
',"+0.019
Neg logit contributions
 Kim-0.023
 Su-0.020
 cho-0.020
 Johnny-0.020
Kim-0.019
Token books
Token ID3835
Feature activation-1.09e+00

Pos logit contributions
 conclusions+0.121
growth+0.110
ede+0.101
+0.101
',"+0.099
Neg logit contributions
 Kim-0.102
 driving-0.091
 cho-0.089
 Johnny-0.088
 Su-0.087
Token.
Token ID13
Feature activation+3.61e-02

Pos logit contributions
 conclusions+0.021
',"+0.021
ede+0.021
+0.020
'."+0.020
Neg logit contributions
 Kim-0.029
 Su-0.026
 cho-0.026
 Johnny-0.025
Kim-0.025
Token She
Token ID1375
Feature activation-9.37e-01

Pos logit contributions
 Kim+0.001
 Liz+0.001
 Johnny+0.001
 Tim+0.001
Kim+0.001
Neg logit contributions
 conclusions-0.001
sie-0.001
beans-0.001
ils-0.001
ede-0.001
Token would
Token ID561
Feature activation-1.77e+00

Pos logit contributions
 conclusions+0.022
ede+0.020
growth+0.020
+0.020
',"+0.020
Neg logit contributions
 Kim-0.022
 Su-0.020
 cho-0.019
 Johnny-0.019
Kim-0.018
Token add
Token ID751
Feature activation-1.68e+00

Pos logit contributions
 conclusions+0.039
growth+0.035
ede+0.034
+0.034
',"+0.034
Neg logit contributions
 Kim-0.045
 Su-0.041
 cho-0.040
 Johnny-0.040
 driving-0.039
Token a
Token ID257
Feature activation+5.52e-01

Pos logit contributions
 Kim+0.025
ug+0.022
 Liz+0.022
Kim+0.022
 cho+0.021
Neg logit contributions
'-0.015
',"-0.014
'."-0.014
growth-0.014
ede-0.014
Token plan
Token ID1410
Feature activation+8.14e-01

Pos logit contributions
 Kim+0.013
 Su+0.012
ug+0.012
Kim+0.011
 cho+0.011
Neg logit contributions
 conclusions-0.013
ede-0.012
 moral-0.011
-0.011
 vegetable-0.011
Token.
Token ID13
Feature activation-5.71e-01

Pos logit contributions
 Kim+0.027
 cho+0.025
Kim+0.025
 Su+0.025
ug+0.024
Neg logit contributions
',"-0.009
'."-0.009
'-0.009
ede-0.008
 conclusions-0.008
Token She
Token ID1375
Feature activation-3.76e-01

Pos logit contributions
 conclusions+0.013
growth+0.013
+0.012
',"+0.011
 vegetable+0.011
Neg logit contributions
 cho-0.013
 Kim-0.013
ug-0.012
 speeding-0.012
 driving-0.012
Token took
Token ID1718
Feature activation-1.47e+00

Pos logit contributions
ede+0.006
+0.006
',"+0.006
ils+0.006
 moral+0.006
Neg logit contributions
 Kim-0.011
Kim-0.010
 Su-0.010
 cho-0.010
rol-0.010
Token Justin
Token ID10799
Feature activation-4.72e+00

Pos logit contributions
growth+0.027
 conclusions+0.026
 mushrooms+0.025
ils+0.024
 Fran+0.023
Neg logit contributions
 Kim-0.038
Kim-0.037
 cho-0.037
ug-0.036
 speeding-0.036
Token to
Token ID284
Feature activation-9.25e-01

Pos logit contributions
 conclusions+0.100
growth+0.091
ede+0.091
+0.090
',"+0.088
Neg logit contributions
 Kim-0.123
 Su-0.112
 cho-0.109
 Johnny-0.107
Kim-0.105
Token the
Token ID262
Feature activation-9.96e-02

Pos logit contributions
 conclusions+0.021
growth+0.019
ede+0.017
 vegetable+0.017
',"+0.017
Neg logit contributions
 Kim-0.022
 cho-0.022
ug-0.022
rol-0.021
Kim-0.021
Token store
Token ID3650
Feature activation+3.31e-01

Pos logit contributions
 conclusions+0.002
growth+0.002
+0.002
 vegetable+0.002
ede+0.002
Neg logit contributions
 Kim-0.003
 Su-0.002
ug-0.002
Kim-0.002
 Johnny-0.002
Token and
Token ID290
Feature activation+2.13e-01

Pos logit contributions
 Kim+0.009
 cho+0.008
 Su+0.008
ug+0.008
Kim+0.008
Neg logit contributions
 conclusions-0.005
',"-0.005
-0.005
'."-0.005
'-0.005
Token bought
Token ID5839
Feature activation-3.53e-01

Pos logit contributions
ug+0.005
 cho+0.005
 Kim+0.005
 speeding+0.005
umbling+0.005
Neg logit contributions
-0.004
 vegetable-0.004
ede-0.004
 conclusions-0.004
growth-0.004
Token the
Token ID262
Feature activation+2.17e-01

Pos logit contributions
 Kim+0.027
 Su+0.023
 Johnny+0.022
 cho+0.022
 driving+0.021
Neg logit contributions
 conclusions-0.044
-0.042
ede-0.042
growth-0.041
',"-0.040
Token festival
Token ID10876
Feature activation-1.84e+00

Pos logit contributions
 Kim+0.005
 Su+0.005
 cho+0.005
 Johnny+0.005
ug+0.004
Neg logit contributions
 conclusions-0.005
ede-0.005
growth-0.005
-0.004
',"-0.004
Token.
Token ID13
Feature activation-6.45e-01

Pos logit contributions
 conclusions+0.031
ede+0.028
',"+0.027
+0.027
growth+0.027
Neg logit contributions
 Kim-0.055
 Su-0.051
Kim-0.050
 cho-0.049
 Johnny-0.049
Token He
Token ID679
Feature activation-1.00e+00

Pos logit contributions
growth+0.013
 conclusions+0.013
+0.012
 mushrooms+0.012
',"+0.012
Neg logit contributions
 cho-0.016
 Kim-0.016
 Su-0.015
coat-0.015
ug-0.015
Token played
Token ID2826
Feature activation-1.02e-01

Pos logit contributions
ede+0.017
ils+0.016
+0.016
 moral+0.016
 wrists+0.015
Neg logit contributions
 Kim-0.029
Kim-0.026
 Su-0.025
reat-0.025
ug-0.025
Token tug
Token ID27762
Feature activation-4.65e+00

Pos logit contributions
 conclusions+0.002
ede+0.002
growth+0.002
+0.002
',"+0.002
Neg logit contributions
 Kim-0.003
 Su-0.002
 Johnny-0.002
 cho-0.002
Kim-0.002
Token-
Token ID12
Feature activation+1.58e-04

Pos logit contributions
 conclusions+0.102
ede+0.096
ils+0.093
+0.092
',"+0.090
Neg logit contributions
 Kim-0.117
 Su-0.105
Kim-0.102
 Johnny-0.099
 Liz-0.098
Tokenof
Token ID1659
Feature activation-3.98e+00

Pos logit contributions
 Kim+0.000
 Su+0.000
 Johnny+0.000
 Liz+0.000
 driving+0.000
Neg logit contributions
 conclusions-0.000
growth-0.000
ede-0.000
-0.000
',"-0.000
Token-
Token ID12
Feature activation-5.13e-05

Pos logit contributions
 conclusions+0.075
ils+0.073
ede+0.071
+0.070
'+0.069
Neg logit contributions
 Kim-0.115
 Su-0.104
Kim-0.103
 Johnny-0.099
 Liz-0.098
Tokenwar
Token ID5767
Feature activation-1.87e+00

Pos logit contributions
 conclusions+0.000
ede+0.000
growth+0.000
+0.000
',"+0.000
Neg logit contributions
 Kim-0.000
 Su-0.000
 Johnny-0.000
 Liz-0.000
 driving-0.000
Token and
Token ID290
Feature activation+1.78e-01

Pos logit contributions
ede+0.032
 closely+0.030
 conclusions+0.029
growth+0.028
'+0.028
Neg logit contributions
 Kim-0.056
 Su-0.054
 Johnny-0.048
Kim-0.048
Su-0.047
Token but
Token ID475
Feature activation+1.08e+00

Pos logit contributions
 Kim+0.004
ug+0.004
 Su+0.004
 cho+0.004
rol+0.004
Neg logit contributions
 conclusions-0.004
 moral-0.003
ede-0.003
 mushrooms-0.003
 vegetable-0.003
Token you
Token ID345
Feature activation-7.72e-01

Pos logit contributions
 Kim+0.022
coat+0.022
 Su+0.022
ug+0.021
rol+0.021
Neg logit contributions
 conclusions-0.026
 mushrooms-0.024
 moral-0.024
 vegetable-0.024
ede-0.023
Token have
Token ID423
Feature activation-1.79e+00

Pos logit contributions
 conclusions+0.018
growth+0.016
+0.016
ede+0.015
',"+0.015
Neg logit contributions
 Kim-0.019
 Su-0.017
 Johnny-0.017
 cho-0.017
 driving-0.016
Token to
Token ID284
Feature activation-8.64e-01

Pos logit contributions
 conclusions+0.041
 mushrooms+0.037
',"+0.037
growth+0.037
+0.036
Neg logit contributions
 Kim-0.041
 Su-0.038
Kim-0.038
ug-0.038
rol-0.036
Token give
Token ID1577
Feature activation-7.64e-01

Pos logit contributions
 conclusions+0.018
ede+0.016
growth+0.016
+0.016
',"+0.016
Neg logit contributions
 Kim-0.023
 Su-0.021
 cho-0.020
 Johnny-0.020
ug-0.020
Token Coco
Token ID48222
Feature activation-4.64e+00

Pos logit contributions
 conclusions+0.021
growth+0.019
ede+0.019
+0.019
',"+0.019
Neg logit contributions
 Kim-0.015
 Su-0.013
 cho-0.013
 Johnny-0.013
Kim-0.012
Token a
Token ID257
Feature activation+2.14e-01

Pos logit contributions
 conclusions+0.126
growth+0.118
+0.115
ede+0.115
',"+0.107
Neg logit contributions
 Kim-0.090
 cho-0.079
 Johnny-0.078
 Su-0.076
reat-0.074
Token chance
Token ID2863
Feature activation-2.99e-01

Pos logit contributions
 Kim+0.006
 Su+0.005
 cho+0.005
ug+0.005
 Johnny+0.005
Neg logit contributions
 conclusions-0.005
ede-0.004
growth-0.004
',"-0.004
-0.004
Token.
Token ID13
Feature activation+5.13e-01

Pos logit contributions
 conclusions+0.006
',"+0.005
ede+0.005
'."+0.005
+0.005
Neg logit contributions
 Kim-0.008
 Su-0.008
 cho-0.008
Kim-0.007
 Johnny-0.007
Token He
Token ID679
Feature activation-1.45e+00

Pos logit contributions
 Kim+0.011
 Su+0.009
 Johnny+0.009
 cho+0.009
Kim+0.008
Neg logit contributions
 conclusions-0.014
ede-0.013
growth-0.013
-0.013
',"-0.012
Token is
Token ID318
Feature activation-1.07e+00

Pos logit contributions
ede+0.022
 conclusions+0.021
+0.020
 moral+0.020
',"+0.020
Neg logit contributions
 Kim-0.046
 Su-0.045
Kim-0.042
Su-0.042
ug-0.041
Token there
Token ID612
Feature activation+1.26e+00

Pos logit contributions
 Kim+0.035
 Su+0.032
 cho+0.032
Kim+0.031
 Johnny+0.031
Neg logit contributions
 conclusions-0.030
-0.029
ede-0.028
',"-0.028
growth-0.027
Token was
Token ID373
Feature activation+9.33e-01

Pos logit contributions
 Kim+0.037
 cho+0.036
ug+0.036
 speeding+0.036
coat+0.035
Neg logit contributions
 conclusions-0.017
-0.016
growth-0.016
ede-0.015
'-0.014
Token an
Token ID281
Feature activation+5.82e-01

Pos logit contributions
 Kim+0.023
 cho+0.021
 Su+0.021
Kim+0.020
 Johnny+0.020
Neg logit contributions
 conclusions-0.019
growth-0.019
ede-0.017
 mushrooms-0.017
',"-0.017
Token elephant
Token ID20950
Feature activation+1.16e-01

Pos logit contributions
 Kim+0.013
 cho+0.011
Kim+0.011
 Su+0.011
rol+0.011
Neg logit contributions
 conclusions-0.014
 vegetable-0.013
 moral-0.013
growth-0.013
ede-0.013
Token named
Token ID3706
Feature activation+1.61e+00

Pos logit contributions
 Kim+0.003
 Su+0.003
 cho+0.003
 Johnny+0.003
Kim+0.003
Neg logit contributions
 conclusions-0.002
ede-0.002
',"-0.002
growth-0.002
-0.002
Token Edward
Token ID10443
Feature activation-4.62e+00

Pos logit contributions
 Kim+0.020
 Su+0.016
 cho+0.016
 Johnny+0.015
ug+0.015
Neg logit contributions
 conclusions-0.055
growth-0.052
ede-0.052
-0.051
',"-0.051
Token.
Token ID13
Feature activation-2.01e+00

Pos logit contributions
 conclusions+0.143
ede+0.125
growth+0.124
+0.122
 wrists+0.118
Neg logit contributions
 Kim-0.080
 cho-0.072
o-0.070
 driving-0.068
 Su-0.067
Token Edward
Token ID10443
Feature activation-3.09e+00

Pos logit contributions
 conclusions+0.061
ede+0.050
pieces+0.049
sie+0.047
growth+0.047
Neg logit contributions
 Tom-0.037
 Kim-0.035
 Tommy-0.034
 Tim-0.033
 driving-0.030
Token was
Token ID373
Feature activation-4.32e-01

Pos logit contributions
 conclusions+0.089
growth+0.080
ede+0.078
+0.075
',"+0.075
Neg logit contributions
 Kim-0.061
 cho-0.054
 Su-0.053
 Johnny-0.053
 driving-0.051
Token very
Token ID845
Feature activation-7.85e-01

Pos logit contributions
 Fran+0.005
 vegetable+0.005
 mushrooms+0.005
cious+0.005
 Blossom+0.005
Neg logit contributions
coat-0.012
ug-0.012
EEP-0.012
icking-0.012
less-0.012
Token troubled
Token ID17840
Feature activation+6.70e-01

Pos logit contributions
 vegetable+0.014
cious+0.014
 Fran+0.014
 mushrooms+0.013
 conclusions+0.013
Neg logit contributions
 Kim-0.019
Kim-0.018
 Su-0.018
rol-0.018
coat-0.018
Tokenelt
Token ID2120
Feature activation+3.12e-01

Pos logit contributions
 cho+0.020
 Kim+0.020
 Su+0.020
 speeding+0.019
 driving+0.018
Neg logit contributions
-0.036
',"-0.034
growth-0.033
 wrists-0.033
 conclusions-0.032
Token down
Token ID866
Feature activation-9.75e-01

Pos logit contributions
 Kim+0.007
 Su+0.006
 cho+0.006
 Johnny+0.006
Kim+0.005
Neg logit contributions
 conclusions-0.008
ede-0.008
-0.007
growth-0.007
',"-0.007
Token and
Token ID290
Feature activation-2.02e-02

Pos logit contributions
+0.018
 moral+0.018
',"+0.018
 conclusions+0.018
ede+0.017
Neg logit contributions
 Kim-0.027
 Su-0.024
Kim-0.023
 Liz-0.023
 cho-0.023
Token solved
Token ID16019
Feature activation-1.19e+00

Pos logit contributions
 conclusions+0.000
ede+0.000
+0.000
growth+0.000
',"+0.000
Neg logit contributions
 Kim-0.000
 Su-0.000
 cho-0.000
ug-0.000
Kim-0.000
Token a
Token ID257
Feature activation-3.90e-01

Pos logit contributions
 conclusions+0.033
ede+0.032
+0.031
growth+0.030
',"+0.029
Neg logit contributions
 Kim-0.024
 Su-0.021
 Johnny-0.020
 cho-0.020
 Liz-0.019
Token math
Token ID10688
Feature activation-4.61e+00

Pos logit contributions
 conclusions+0.009
growth+0.008
ede+0.008
+0.008
',"+0.008
Neg logit contributions
 Kim-0.009
 cho-0.009
 Su-0.008
 Johnny-0.008
 driving-0.008
Token problem
Token ID1917
Feature activation-1.13e-01

Pos logit contributions
 conclusions+0.110
growth+0.104
+0.095
cious+0.094
',"+0.091
Neg logit contributions
 Kim-0.099
 driving-0.094
 cho-0.093
 speeding-0.088
coat-0.084
Token in
Token ID287
Feature activation-2.44e-01

Pos logit contributions
 conclusions+0.002
ede+0.002
+0.002
',"+0.002
growth+0.002
Neg logit contributions
 Kim-0.003
 Su-0.003
 cho-0.003
 Johnny-0.003
Kim-0.003
Token her
Token ID607
Feature activation+1.77e-01

Pos logit contributions
 conclusions+0.006
growth+0.005
+0.005
ede+0.005
',"+0.005
Neg logit contributions
 Kim-0.006
 Su-0.005
 cho-0.005
ug-0.005
Kim-0.005
Token head
Token ID1182
Feature activation-1.10e+00

Pos logit contributions
 Kim+0.004
 Su+0.003
 Johnny+0.003
 cho+0.003
 driving+0.003
Neg logit contributions
 conclusions-0.005
growth-0.004
ede-0.004
-0.004
',"-0.004
Token.
Token ID13
Feature activation-5.14e-01

Pos logit contributions
 conclusions+0.015
+0.015
',"+0.014
ede+0.014
'."+0.014
Neg logit contributions
 Kim-0.034
 Su-0.032
ug-0.032
 cho-0.032
Kim-0.031
Token the
Token ID262
Feature activation-5.33e-01

Pos logit contributions
 conclusions+0.023
ede+0.021
growth+0.021
+0.021
',"+0.020
Neg logit contributions
 Kim-0.027
 Su-0.024
 Johnny-0.023
Kim-0.023
 cho-0.023
Token cheese
Token ID9891
Feature activation-1.74e+00

Pos logit contributions
 conclusions+0.011
ede+0.010
growth+0.009
',"+0.009
+0.009
Neg logit contributions
 Kim-0.015
 Su-0.014
 Johnny-0.013
Kim-0.013
 cho-0.013
Token and
Token ID290
Feature activation+3.42e-01

Pos logit contributions
 conclusions+0.042
ede+0.039
growth+0.038
+0.038
',"+0.037
Neg logit contributions
 Kim-0.040
 Su-0.036
 Johnny-0.035
 cho-0.035
Kim-0.034
Token add
Token ID751
Feature activation-1.72e+00

Pos logit contributions
 Kim+0.009
 Su+0.008
 cho+0.008
 Johnny+0.008
 driving+0.007
Neg logit contributions
 conclusions-0.008
ede-0.007
growth-0.007
-0.007
',"-0.007
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
 conclusions+0.036
growth+0.033
ede+0.032
+0.032
',"+0.032
Neg logit contributions
 Kim-0.045
 Su-0.041
 cho-0.040
 Johnny-0.039
Kim-0.039
Token topp
Token ID23126
Feature activation-4.60e+00

Pos logit contributions
 conclusions+0.025
ede+0.023
growth+0.022
+0.022
',"+0.022
Neg logit contributions
 Kim-0.032
 Su-0.029
 cho-0.028
 Johnny-0.028
Kim-0.027
Tokenings
Token ID654
Feature activation-7.38e-01

Pos logit contributions
 conclusions+0.128
growth+0.117
+0.113
ede+0.112
',"+0.107
Neg logit contributions
 Kim-0.089
 cho-0.083
 Su-0.077
ug-0.075
reat-0.075
Token.
Token ID13
Feature activation-7.35e-01

Pos logit contributions
 conclusions+0.014
ede+0.013
+0.013
',"+0.013
growth+0.013
Neg logit contributions
 Kim-0.020
 Su-0.018
 cho-0.018
Kim-0.017
 Johnny-0.017
Token Lily
Token ID20037
Feature activation-3.79e-01

Pos logit contributions
 conclusions+0.017
+0.016
growth+0.015
',"+0.015
ede+0.015
Neg logit contributions
 Kim-0.017
 cho-0.015
ug-0.015
 speeding-0.015
 driving-0.015
Token and
Token ID290
Feature activation-1.86e+00

Pos logit contributions
ede+0.008
 conclusions+0.007
+0.007
growth+0.007
',"+0.007
Neg logit contributions
 Kim-0.010
 Su-0.009
 cho-0.009
Kim-0.009
ug-0.009
Token Ben
Token ID3932
Feature activation-4.64e-01

Pos logit contributions
 conclusions+0.047
ede+0.044
+0.044
growth+0.043
',"+0.042
Neg logit contributions
 Kim-0.038
 cho-0.036
 Su-0.035
ug-0.034
 speeding-0.034
Token garden
Token ID11376
Feature activation+2.47e-01

Pos logit contributions
 conclusions+0.023
growth+0.021
ede+0.021
+0.021
',"+0.021
Neg logit contributions
 Kim-0.017
 Su-0.015
 cho-0.015
 Johnny-0.015
 driving-0.014
Token trying
Token ID2111
Feature activation-1.79e+00

Pos logit contributions
 Kim+0.007
 Su+0.006
 cho+0.006
ug+0.006
 Johnny+0.006
Neg logit contributions
 conclusions-0.005
ede-0.005
-0.004
growth-0.004
 mushrooms-0.004
Token to
Token ID284
Feature activation-7.77e-01

Pos logit contributions
 conclusions+0.056
growth+0.055
+0.055
ede+0.052
',"+0.052
Neg logit contributions
 Kim-0.028
 Su-0.024
 driving-0.023
 Johnny-0.023
 cho-0.023
Token solve
Token ID8494
Feature activation-1.95e+00

Pos logit contributions
 conclusions+0.017
+0.016
growth+0.016
',"+0.016
ede+0.016
Neg logit contributions
 Kim-0.019
 Su-0.017
 Johnny-0.016
 driving-0.016
 speeding-0.016
Token the
Token ID262
Feature activation+6.68e-01

Pos logit contributions
 conclusions+0.050
ede+0.046
growth+0.046
+0.045
',"+0.044
Neg logit contributions
 Kim-0.043
 Su-0.038
 Johnny-0.037
 cho-0.036
 Liz-0.036
Token maths
Token ID47761
Feature activation-4.56e+00

Pos logit contributions
 Kim+0.015
 Johnny+0.013
 Su+0.013
 cho+0.013
 speeding+0.013
Neg logit contributions
 conclusions-0.016
growth-0.016
-0.014
',"-0.014
ede-0.014
Token problems
Token ID2761
Feature activation-1.28e+00

Pos logit contributions
 conclusions+0.102
growth+0.092
+0.087
',"+0.086
ede+0.085
Neg logit contributions
 Kim-0.110
 cho-0.098
 driving-0.098
 Su-0.097
 Johnny-0.096
Token.
Token ID13
Feature activation+5.01e-01

Pos logit contributions
 conclusions+0.021
ede+0.019
+0.019
',"+0.019
growth+0.018
Neg logit contributions
 Kim-0.038
 Su-0.035
 cho-0.035
 Johnny-0.034
Kim-0.034
Token In
Token ID554
Feature activation+1.96e-01

Pos logit contributions
 Kim+0.011
 Liz+0.009
 Johnny+0.009
Kim+0.009
 Su+0.009
Neg logit contributions
 conclusions-0.014
ede-0.013
growth-0.012
ils-0.012
sie-0.012
Token no
Token ID645
Feature activation-1.14e+00

Pos logit contributions
 Kim+0.004
 Su+0.004
 cho+0.004
 Johnny+0.004
 Liz+0.003
Neg logit contributions
 conclusions-0.005
ede-0.005
growth-0.005
-0.005
',"-0.005
Token time
Token ID640
Feature activation-6.00e-01

Pos logit contributions
 conclusions+0.034
+0.031
ede+0.031
growth+0.031
',"+0.031
Neg logit contributions
 Kim-0.020
 Su-0.017
 Johnny-0.017
 cho-0.016
 driving-0.016
Token,
Token ID11
Feature activation-6.99e-01

Pos logit contributions
+0.027
 conclusions+0.026
',"+0.025
growth+0.024
ede+0.024
Neg logit contributions
 Kim-0.036
 Su-0.032
 Johnny-0.032
ug-0.032
 speeding-0.032
Token Sara
Token ID24799
Feature activation+2.05e-01

Pos logit contributions
 conclusions+0.018
+0.017
growth+0.016
ede+0.016
',"+0.016
Neg logit contributions
 Kim-0.014
ug-0.013
 cho-0.013
 Johnny-0.013
 driving-0.012
Token's
Token ID338
Feature activation+5.01e-01

Pos logit contributions
 Kim+0.006
 Su+0.006
rol+0.006
Kim+0.006
ug+0.006
Neg logit contributions
ede-0.003
'-0.003
ils-0.003
-0.003
 moral-0.003
Token mom
Token ID1995
Feature activation+1.42e+00

Pos logit contributions
 Kim+0.014
ug+0.014
rol+0.013
 Su+0.013
Kim+0.013
Neg logit contributions
 moral-0.009
 conclusions-0.009
 vegetable-0.009
 mushrooms-0.009
 mushroom-0.008
Token took
Token ID1718
Feature activation-1.38e+00

Pos logit contributions
ug+0.045
rol+0.043
 Kim+0.042
 Su+0.042
reat+0.041
Neg logit contributions
'-0.018
ils-0.018
ede-0.016
 moral-0.016
 decide-0.015
Token True
Token ID6407
Feature activation-4.54e+00

Pos logit contributions
 conclusions+0.032
growth+0.031
',"+0.029
ede+0.029
+0.029
Neg logit contributions
 Kim-0.032
 Su-0.029
 cho-0.028
ug-0.028
 Johnny-0.028
Token to
Token ID284
Feature activation-2.62e-01

Pos logit contributions
 conclusions+0.132
growth+0.121
ede+0.121
+0.121
',"+0.116
Neg logit contributions
 Kim-0.082
 Su-0.070
 Johnny-0.068
 cho-0.068
 driving-0.068
Token a
Token ID257
Feature activation-1.32e+00

Pos logit contributions
 conclusions+0.008
+0.008
ede+0.008
cious+0.008
ils+0.008
Neg logit contributions
 Kim-0.004
 Tim-0.003
 Su-0.003
 Johnny-0.003
 Tommy-0.003
Token yard
Token ID12699
Feature activation+1.19e+00

Pos logit contributions
 conclusions+0.034
growth+0.032
ede+0.031
+0.031
',"+0.030
Neg logit contributions
 Kim-0.028
 Su-0.025
 cho-0.025
 Johnny-0.024
 driving-0.023
Token sale
Token ID5466
Feature activation-1.67e+00

Pos logit contributions
 Kim+0.032
 Su+0.029
 Johnny+0.029
 cho+0.028
ug+0.028
Neg logit contributions
 conclusions-0.024
ede-0.022
growth-0.021
',"-0.021
-0.021
Token.
Token ID13
Feature activation-2.77e-01

Pos logit contributions
 conclusions+0.036
ede+0.033
+0.032
growth+0.032
',"+0.032
Neg logit contributions
 Kim-0.043
 Su-0.039
 cho-0.038
 Johnny-0.037
ug-0.036
Token cl
Token ID537
Feature activation+6.06e-01

Pos logit contributions
Kim+0.021
eddy+0.020
coat+0.019
rol+0.019
ug+0.019
Neg logit contributions
 conclusions-0.017
 closely-0.017
 Fran-0.016
 vegetable-0.016
 mushrooms-0.016
Tokenapping
Token ID5912
Feature activation-9.40e-01

Pos logit contributions
 Kim+0.011
 cho+0.009
 Su+0.009
Kim+0.009
 driving+0.008
Neg logit contributions
 conclusions-0.018
-0.017
',"-0.016
'."-0.016
ede-0.016
Token,
Token ID11
Feature activation+9.36e-01

Pos logit contributions
 conclusions+0.019
',"+0.019
'."+0.018
.'"+0.018
+0.017
Neg logit contributions
 Kim-0.023
Kim-0.021
 Su-0.021
 cho-0.020
 driving-0.020
Token silly
Token ID14397
Feature activation-5.13e-01

Pos logit contributions
ug+0.025
rol+0.022
 cho+0.022
ick+0.022
un+0.021
Neg logit contributions
 moral-0.017
 conclusions-0.016
 vegetable-0.015
-0.015
 closely-0.014
Token,"
Token ID553
Feature activation+2.19e-01

Pos logit contributions
 conclusions+0.011
',"+0.010
+0.010
'."+0.009
 moral+0.009
Neg logit contributions
 Kim-0.013
rol-0.011
 Liz-0.011
Kim-0.011
coat-0.011
Token Mom
Token ID11254
Feature activation+3.34e+00

Pos logit contributions
ug+0.005
 cho+0.005
coat+0.005
rol+0.005
rou+0.005
Neg logit contributions
',"-0.004
 moral-0.004
 vegetable-0.004
 conclusions-0.004
-0.004
Token said
Token ID531
Feature activation+1.89e+00

Pos logit contributions
ug+0.085
 cho+0.085
 Kim+0.083
ained+0.081
 speeding+0.080
Neg logit contributions
',"-0.049
'-0.049
:-0.048
 closely-0.047
ede-0.046
Token.
Token ID13
Feature activation+9.18e-01

Pos logit contributions
Kim+0.062
ained+0.062
rol+0.062
 cho+0.060
 Kim+0.059
Neg logit contributions
'-0.017
',"-0.016
'."-0.014
cious-0.014
'.-0.013
Token She
Token ID1375
Feature activation+9.23e-01

Pos logit contributions
ug+0.028
 cho+0.027
rol+0.027
 Kim+0.027
reat+0.026
Neg logit contributions
-0.013
',"-0.011
'."-0.011
growth-0.010
.'"-0.010
Token made
Token ID925
Feature activation+5.60e-01

Pos logit contributions
 Kim+0.022
 Su+0.020
ug+0.019
Kim+0.019
reat+0.019
Neg logit contributions
ede-0.019
-0.018
 conclusions-0.018
 moral-0.018
growth-0.018
Token John
Token ID1757
Feature activation-7.42e-01

Pos logit contributions
coat+0.016
ug+0.016
Kim+0.015
 cho+0.015
Su+0.014
Neg logit contributions
 conclusions-0.008
 closely-0.008
growth-0.008
',"-0.008
 mushrooms-0.007
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
 Kim+0.018
 Su+0.016
 cho+0.016
Kim+0.016
 Johnny+0.016
Neg logit contributions
 conclusions-0.012
',"-0.010
ede-0.010
growth-0.010
-0.010
Token They
Token ID1119
Feature activation-8.13e-01

Pos logit contributions
 conclusions+0.009
+0.009
growth+0.009
',"+0.008
'."+0.008
Neg logit contributions
 Kim-0.013
 cho-0.013
ug-0.012
 Su-0.012
coat-0.012
Token had
Token ID550
Feature activation-2.40e-01

Pos logit contributions
 conclusions+0.019
growth+0.017
ede+0.017
+0.017
',"+0.016
Neg logit contributions
 Kim-0.020
 Su-0.018
 cho-0.018
 Johnny-0.017
Kim-0.017
Token a
Token ID257
Feature activation+1.25e+00

Pos logit contributions
growth+0.004
 conclusions+0.004
 mushrooms+0.004
ils+0.004
cious+0.004
Neg logit contributions
 Kim-0.007
ug-0.006
Kim-0.006
 Johnny-0.006
 Su-0.006
Token lot
Token ID1256
Feature activation+1.92e+00

Pos logit contributions
 Kim+0.031
 Su+0.028
 cho+0.027
 Johnny+0.027
Kim+0.026
Neg logit contributions
 conclusions-0.028
ede-0.025
growth-0.025
-0.025
',"-0.024
Token of
Token ID286
Feature activation+3.84e+00

Pos logit contributions
 Kim+0.028
 Su+0.024
 Johnny+0.023
Kim+0.022
 cho+0.022
Neg logit contributions
 conclusions-0.063
growth-0.060
 mushrooms-0.059
',"-0.056
ede-0.056
Token fun
Token ID1257
Feature activation+1.15e+00

Pos logit contributions
 Kim+0.077
 Su+0.069
 cho+0.066
Kim+0.066
 Johnny+0.065
Neg logit contributions
 conclusions-0.105
 mushrooms-0.094
growth-0.094
ede-0.093
 moral-0.091
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
 Kim+0.025
 Su+0.022
 Johnny+0.021
 cho+0.021
Kim+0.021
Neg logit contributions
 conclusions-0.030
ede-0.027
growth-0.027
-0.027
',"-0.026
Token
Token ID198
Feature activation-2.52e-01

Pos logit contributions
 Kim+0.005
 Su+0.004
 Johnny+0.004
 Liz+0.004
 cho+0.004
Neg logit contributions
 conclusions-0.004
ede-0.004
growth-0.004
-0.004
',"-0.003
Token
Token ID198
Feature activation-9.43e-01

Pos logit contributions
 conclusions+0.006
ede+0.005
+0.005
growth+0.005
',"+0.005
Neg logit contributions
 Kim-0.006
 Su-0.006
 cho-0.005
Kim-0.005
 Johnny-0.005
TokenOne
Token ID3198
Feature activation+1.06e+00

Pos logit contributions
 conclusions+0.018
growth+0.017
ede+0.016
',"+0.015
+0.015
Neg logit contributions
 Kim-0.027
 Su-0.024
Kim-0.024
 cho-0.024
 Johnny-0.023
Token glad
Token ID9675
Feature activation+1.64e-01

Pos logit contributions
 conclusions+0.010
 closely+0.010
 moral+0.010
cious+0.010
+0.010
Neg logit contributions
 Kim-0.016
 cho-0.015
Kim-0.015
ug-0.015
coat-0.015
Token that
Token ID326
Feature activation+1.88e-01

Pos logit contributions
 Kim+0.004
ug+0.004
Kim+0.004
coat+0.004
 cho+0.004
Neg logit contributions
 conclusions-0.003
 mushrooms-0.003
growth-0.003
',"-0.003
-0.003
Token she
Token ID673
Feature activation-2.65e-01

Pos logit contributions
 Kim+0.004
ug+0.004
 cho+0.004
coat+0.004
 Su+0.004
Neg logit contributions
 conclusions-0.004
-0.004
ede-0.004
growth-0.004
 moral-0.004
Token had
Token ID550
Feature activation-4.03e-01

Pos logit contributions
ede+0.004
+0.004
 conclusions+0.004
ils+0.004
growth+0.004
Neg logit contributions
 Kim-0.008
 Su-0.007
ug-0.007
 cho-0.007
Kim-0.007
Token spent
Token ID3377
Feature activation-1.85e-01

Pos logit contributions
 conclusions+0.008
growth+0.008
ede+0.008
ils+0.008
+0.007
Neg logit contributions
 Kim-0.010
ug-0.009
 cho-0.009
 Su-0.009
Kim-0.009
Token the
Token ID262
Feature activation+3.51e+00

Pos logit contributions
 conclusions+0.006
growth+0.005
+0.005
ede+0.005
',"+0.005
Neg logit contributions
 Kim-0.003
 Su-0.003
 Johnny-0.003
 driving-0.003
 cho-0.003
Token day
Token ID1110
Feature activation-4.98e-01

Pos logit contributions
 Kim+0.104
ug+0.096
 cho+0.094
oses+0.092
 Liz+0.090
Neg logit contributions
 conclusions-0.065
 moral-0.065
ede-0.064
growth-0.054
 vegetable-0.054
Token exploring
Token ID13504
Feature activation-3.76e-01

Pos logit contributions
 conclusions+0.011
'+0.010
cious+0.010
ede+0.010
ils+0.010
Neg logit contributions
 Kim-0.013
Kim-0.011
 Su-0.011
 cho-0.010
 Liz-0.010
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
 conclusions+0.010
growth+0.009
ede+0.009
+0.009
',"+0.008
Neg logit contributions
 Kim-0.008
 Su-0.007
 cho-0.007
 Johnny-0.007
Kim-0.007
Token mud
Token ID17492
Feature activation+1.64e+00

Pos logit contributions
 Kim+0.028
ug+0.026
 cho+0.025
 Su+0.025
rol+0.024
Neg logit contributions
 conclusions-0.023
ede-0.020
 moral-0.020
 vegetable-0.019
-0.019
Token,
Token ID11
Feature activation-1.38e-01

Pos logit contributions
 Kim+0.048
Kim+0.043
 Johnny+0.043
 Su+0.043
 cho+0.042
Neg logit contributions
ede-0.029
 conclusions-0.028
 mushrooms-0.024
ils-0.024
growth-0.024
Token was
Token ID373
Feature activation-8.55e-01

Pos logit contributions
 conclusions+0.047
growth+0.042
ede+0.042
+0.042
',"+0.041
Neg logit contributions
 Kim-0.042
 Su-0.038
 Johnny-0.037
 cho-0.037
 Liz-0.035
Token foolish
Token ID19538
Feature activation-5.21e-01

Pos logit contributions
 conclusions+0.020
 vegetable+0.018
 mushrooms+0.018
ede+0.018
growth+0.018
Neg logit contributions
 Kim-0.019
Kim-0.018
 Su-0.018
ug-0.017
 Johnny-0.017
Token."
Token ID526
Feature activation+3.96e-01

Pos logit contributions
 conclusions+0.009
',"+0.008
 mushrooms+0.008
.'"+0.008
'."+0.008
Neg logit contributions
 Kim-0.015
Kim-0.015
 Su-0.015
Su-0.014
 cho-0.014
Token
Token ID198
Feature activation+5.47e-01

Pos logit contributions
 Kim+0.010
 Su+0.009
 Johnny+0.008
 cho+0.008
 Liz+0.008
Neg logit contributions
 conclusions-0.010
ede-0.009
growth-0.009
-0.009
ils-0.008
Token
Token ID198
Feature activation-5.84e-01

Pos logit contributions
 Kim+0.013
 Su+0.012
 cho+0.011
 Johnny+0.011
Kim+0.011
Neg logit contributions
 conclusions-0.013
ede-0.012
-0.012
growth-0.012
ils-0.011
TokenMom
Token ID29252
Feature activation+3.10e+00

Pos logit contributions
 conclusions+0.013
growth+0.012
ede+0.012
ils+0.011
+0.011
Neg logit contributions
 Kim-0.015
Kim-0.013
 cho-0.013
 Su-0.013
 Johnny-0.013
Token smiled
Token ID13541
Feature activation+5.68e-01

Pos logit contributions
 Kim+0.083
 Su+0.078
ug+0.076
 cho+0.075
 speeding+0.073
Neg logit contributions
ede-0.055
 moral-0.052
 conclusions-0.052
growth-0.051
-0.050
Token and
Token ID290
Feature activation+6.46e-01

Pos logit contributions
 Kim+0.009
 Johnny+0.007
 cho+0.007
 driving+0.007
 speeding+0.007
Neg logit contributions
 conclusions-0.019
ede-0.018
growth-0.018
-0.017
ils-0.017
Token kissed
Token ID27946
Feature activation-6.59e-02

Pos logit contributions
 Kim+0.015
 Su+0.013
ug+0.013
 cho+0.013
Kim+0.013
Neg logit contributions
 conclusions-0.015
ede-0.014
growth-0.014
-0.014
',"-0.013
Token his
Token ID465
Feature activation-4.98e-01

Pos logit contributions
 conclusions+0.002
growth+0.001
ede+0.001
+0.001
',"+0.001
Neg logit contributions
 Kim-0.001
 Su-0.001
 speeding-0.001
 driving-0.001
 cho-0.001
Token cheek
Token ID19353
Feature activation-2.91e-01

Pos logit contributions
 conclusions+0.012
growth+0.011
ede+0.011
+0.011
',"+0.011
Neg logit contributions
 Kim-0.012
 Su-0.011
 cho-0.010
 Johnny-0.010
 driving-0.010
Token park
Token ID3952
Feature activation+8.24e-02

Pos logit contributions
 conclusions+0.012
growth+0.011
ede+0.011
+0.011
',"+0.010
Neg logit contributions
 Kim-0.011
 Su-0.010
 cho-0.010
 Johnny-0.010
ug-0.010
Token.
Token ID13
Feature activation-4.17e-01

Pos logit contributions
 Kim+0.002
 Su+0.002
 cho+0.002
 Johnny+0.002
Kim+0.002
Neg logit contributions
 conclusions-0.002
ede-0.002
growth-0.002
-0.001
',"-0.001
Token It
Token ID632
Feature activation+1.49e+00

Pos logit contributions
 conclusions+0.007
+0.007
growth+0.006
',"+0.006
ede+0.006
Neg logit contributions
 Kim-0.012
 cho-0.011
ug-0.011
reat-0.011
 Su-0.011
Token was
Token ID373
Feature activation+3.53e-01

Pos logit contributions
 Kim+0.040
ug+0.037
 Su+0.036
reat+0.036
 Johnny+0.035
Neg logit contributions
ede-0.026
-0.025
 conclusions-0.025
growth-0.024
cious-0.023
Token a
Token ID257
Feature activation+6.67e-01

Pos logit contributions
ug+0.012
ancer+0.011
icking+0.011
ended+0.011
ister+0.011
Neg logit contributions
One-0.004
 moral-0.003
 vegetable-0.003
 mushrooms-0.003
 herbs-0.003
Token sunny
Token ID27737
Feature activation+3.15e+00

Pos logit contributions
 Kim+0.017
ug+0.015
 Su+0.015
rol+0.014
reat+0.014
Neg logit contributions
 conclusions-0.014
 vegetable-0.014
-0.014
ede-0.013
 tripod-0.013
Token day
Token ID1110
Feature activation-2.76e-01

Pos logit contributions
 Kim+0.076
ug+0.066
 Su+0.066
 cho+0.065
 Johnny+0.064
Neg logit contributions
 conclusions-0.067
-0.067
ede-0.064
 vegetable-0.064
growth-0.061
Token and
Token ID290
Feature activation+6.70e-01

Pos logit contributions
'+0.004
cious+0.004
',"+0.004
'."+0.004
+0.004
Neg logit contributions
 Kim-0.008
Kim-0.008
 cho-0.007
rol-0.007
ug-0.007
Token it
Token ID340
Feature activation+1.52e+00

Pos logit contributions
ug+0.015
 Kim+0.015
reat+0.014
 cho+0.013
Kim+0.013
Neg logit contributions
-0.014
 conclusions-0.014
 vegetable-0.013
 mushrooms-0.013
growth-0.013
Token seemed
Token ID3947
Feature activation+2.80e-01

Pos logit contributions
 Kim+0.048
ug+0.044
 Su+0.042
reat+0.042
 Johnny+0.041
Neg logit contributions
ede-0.023
-0.020
ils-0.020
 moral-0.019
 conclusions-0.019
Token like
Token ID588
Feature activation+7.74e-01

Pos logit contributions
 Kim+0.008
Kim+0.008
ug+0.008
Johnny+0.007
oo+0.007
Neg logit contributions
 vegetable-0.004
 mushrooms-0.004
 herbs-0.004
 closely-0.004
 conclusions-0.004
Token They
Token ID1119
Feature activation-6.10e-01

Pos logit contributions
 Kim+0.015
 Su+0.013
 cho+0.013
 Johnny+0.013
 Liz+0.013
Neg logit contributions
 conclusions-0.013
ede-0.012
growth-0.012
-0.012
',"-0.011
Token go
Token ID467
Feature activation+7.09e-01

Pos logit contributions
 conclusions+0.014
growth+0.013
+0.013
ede+0.013
',"+0.013
Neg logit contributions
 Kim-0.015
 Su-0.013
 cho-0.013
 Johnny-0.013
 driving-0.012
Token to
Token ID284
Feature activation+3.31e-02

Pos logit contributions
 Kim+0.020
 Su+0.019
 Johnny+0.019
ug+0.018
Kim+0.018
Neg logit contributions
 conclusions-0.011
ede-0.010
-0.010
',"-0.010
'-0.010
Token play
Token ID711
Feature activation+1.75e-01

Pos logit contributions
 Kim+0.001
 Su+0.001
 Johnny+0.001
 Tim+0.001
 driving+0.001
Neg logit contributions
-0.001
 conclusions-0.001
ede-0.001
growth-0.001
',"-0.001
Token again
Token ID757
Feature activation-2.22e+00

Pos logit contributions
 Kim+0.003
 cho+0.003
 Su+0.003
 driving+0.003
 Tim+0.003
Neg logit contributions
 conclusions-0.005
growth-0.005
-0.005
',"-0.004
ede-0.004
Token.
Token ID13
Feature activation+4.42e-01

Pos logit contributions
growth+0.064
 conclusions+0.062
ede+0.061
 tripod+0.060
ils+0.059
Neg logit contributions
 Kim-0.040
 Su-0.033
 Tim-0.032
 driving-0.032
 cho-0.031
Token They
Token ID1119
Feature activation-7.62e-01

Pos logit contributions
 Kim+0.011
 Su+0.010
 Johnny+0.010
 cho+0.009
 Liz+0.009
Neg logit contributions
 conclusions-0.010
ede-0.010
-0.009
growth-0.009
',"-0.009
Token are
Token ID389
Feature activation-5.45e-01

Pos logit contributions
 conclusions+0.018
growth+0.016
+0.016
',"+0.016
ede+0.016
Neg logit contributions
 Kim-0.018
 Su-0.017
 cho-0.016
 Johnny-0.016
 driving-0.015
Token happy
Token ID3772
Feature activation+7.42e-01

Pos logit contributions
 moral+0.009
 wrists+0.007
 vegetable+0.007
ils+0.007
 end+0.007
Neg logit contributions
 Su-0.015
 Kim-0.015
ug-0.014
rol-0.014
 Johnny-0.014
Token.
Token ID13
Feature activation+4.02e-01

Pos logit contributions
 Kim+0.023
 Su+0.021
Kim+0.021
coat+0.020
 cho+0.020
Neg logit contributions
 conclusions-0.013
ede-0.010
growth-0.010
-0.010
 moral-0.010
Token
Token ID198
Feature activation+9.96e-01

Pos logit contributions
 Kim+0.008
 Johnny+0.007
 Tim+0.007
 Liz+0.007
 Su+0.007
Neg logit contributions
ede-0.012
 conclusions-0.011
ils-0.010
-0.010
growth-0.010
Token,
Token ID11
Feature activation+8.79e-01

Pos logit contributions
 Kim+0.015
 Su+0.014
ug+0.013
Kim+0.013
 cho+0.013
Neg logit contributions
 conclusions-0.012
',"-0.012
-0.011
'."-0.011
ede-0.011
Token little
Token ID1310
Feature activation+9.05e-02

Pos logit contributions
 Kim+0.023
rol+0.022
 driving+0.022
ug+0.022
Kim+0.021
Neg logit contributions
 conclusions-0.017
ede-0.015
 mushrooms-0.015
 moral-0.015
 vegetable-0.015
Token worm
Token ID23849
Feature activation+1.93e+00

Pos logit contributions
 Kim+0.002
 Su+0.002
 cho+0.002
 Johnny+0.002
 Liz+0.002
Neg logit contributions
 conclusions-0.002
growth-0.002
ede-0.002
-0.002
',"-0.002
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
 Kim+0.054
 Su+0.050
Kim+0.049
 Johnny+0.048
 driving+0.048
Neg logit contributions
 conclusions-0.036
ede-0.035
',"-0.033
'."-0.032
 vegetable-0.031
Token I
Token ID314
Feature activation-1.16e+00

Pos logit contributions
 Kim+0.044
 Su+0.040
 cho+0.040
 driving+0.040
coat+0.040
Neg logit contributions
-0.041
 conclusions-0.038
growth-0.037
'."-0.035
',"-0.035
Token'm
Token ID1101
Feature activation+7.54e-01

Pos logit contributions
 conclusions+0.025
ede+0.023
growth+0.023
+0.023
',"+0.022
Neg logit contributions
 Kim-0.030
 Su-0.027
 cho-0.026
 Johnny-0.026
Kim-0.025
Token happy
Token ID3772
Feature activation+2.10e-01

Pos logit contributions
coat+0.018
Kim+0.018
 Kim+0.017
bars+0.017
ug+0.017
Neg logit contributions
 Fran-0.013
 vegetable-0.012
 Blossom-0.012
 moral-0.012
 closely-0.011
Token to
Token ID284
Feature activation+1.56e+00

Pos logit contributions
 Kim+0.005
Kim+0.005
 Su+0.005
coat+0.005
ug+0.005
Neg logit contributions
 conclusions-0.004
 mushrooms-0.004
-0.004
 vegetable-0.004
',"-0.004
Token help
Token ID1037
Feature activation-5.19e-01

Pos logit contributions
 Kim+0.032
 Su+0.028
 cho+0.027
 Johnny+0.026
Kim+0.026
Neg logit contributions
 conclusions-0.042
growth-0.039
 mushrooms-0.039
ils-0.039
-0.039
Token."
Token ID526
Feature activation+4.40e-01

Pos logit contributions
 conclusions+0.014
growth+0.013
ede+0.013
+0.013
',"+0.013
Neg logit contributions
 Kim-0.011
 Su-0.009
 cho-0.009
 Johnny-0.009
ug-0.009
Token<|endoftext|>
Token ID50256
Feature activation+9.97e-01

Pos logit contributions
 Kim+0.012
 cho+0.011
 Su+0.011
ug+0.011
 driving+0.011
Neg logit contributions
 conclusions-0.008
growth-0.007
-0.007
ede-0.007
',"-0.007
Token
Token ID198
Feature activation-6.94e-01

Pos logit contributions
 Kim+0.003
 cho+0.003
 Su+0.003
 driving+0.003
 speeding+0.003
Neg logit contributions
 conclusions-0.002
',"-0.002
growth-0.002
-0.002
'."-0.002
Token"
Token ID1
Feature activation+4.79e-01

Pos logit contributions
 conclusions+0.010
+0.009
',"+0.008
ede+0.008
growth+0.008
Neg logit contributions
 Kim-0.023
 Su-0.021
 cho-0.021
 Johnny-0.021
 driving-0.021
TokenOK
Token ID11380
Feature activation-1.19e+00

Pos logit contributions
 Kim+0.011
 Su+0.009
 cho+0.009
Kim+0.009
 Johnny+0.009
Neg logit contributions
 conclusions-0.012
growth-0.011
ede-0.011
',"-0.011
-0.011
Token,
Token ID11
Feature activation+9.07e-01

Pos logit contributions
',"+0.022
 conclusions+0.021
+0.021
'."+0.020
ede+0.019
Neg logit contributions
 Kim-0.032
ug-0.030
 cho-0.030
 Su-0.029
rol-0.029
Token but
Token ID475
Feature activation+1.37e+00

Pos logit contributions
ained+0.023
umbled+0.022
rol+0.022
ug+0.022
coat+0.021
Neg logit contributions
 conclusions-0.018
 moral-0.015
 mushrooms-0.015
',"-0.015
ede-0.015
Token be
Token ID307
Feature activation+6.32e-01

Pos logit contributions
 Kim+0.032
 Su+0.030
rol+0.030
 cho+0.029
ug+0.029
Neg logit contributions
 conclusions-0.031
-0.030
ede-0.028
 moral-0.027
',"-0.027
Token careful
Token ID8161
Feature activation+8.81e-01

Pos logit contributions
 Kim+0.010
 Su+0.010
 Johnny+0.010
ug+0.009
rol+0.009
Neg logit contributions
 conclusions-0.017
',"-0.016
 moral-0.016
'."-0.016
-0.016
Token,"
Token ID553
Feature activation+5.25e-01

Pos logit contributions
 Kim+0.026
 Su+0.025
 Johnny+0.024
 cho+0.024
rol+0.023
Neg logit contributions
',"-0.014
 conclusions-0.013
'."-0.013
-0.012
.'"-0.012
Token Lily
Token ID20037
Feature activation+1.64e+00

Pos logit contributions
 speeding+0.010
ug+0.010
reat+0.009
rol+0.009
 cho+0.009
Neg logit contributions
 conclusions-0.013
',"-0.012
growth-0.012
 moral-0.012
 Fran-0.012
Token said
Token ID531
Feature activation+1.64e+00

Pos logit contributions
 Kim+0.035
 cho+0.031
 Su+0.031
ug+0.029
Kim+0.029
Neg logit contributions
 conclusions-0.040
ede-0.039
-0.038
',"-0.038
growth-0.038
Token.
Token ID13
Feature activation+6.95e-01

Pos logit contributions
 Kim+0.052
 Su+0.050
Kim+0.049
 cho+0.048
rol+0.048
Neg logit contributions
',"-0.020
 conclusions-0.019
'."-0.019
-0.018
ede-0.018
Token the
Token ID262
Feature activation-1.90e-02

Pos logit contributions
 conclusions+0.021
ede+0.019
growth+0.019
+0.019
',"+0.019
Neg logit contributions
 Kim-0.020
 Su-0.018
 cho-0.017
 Johnny-0.017
ug-0.016
Token ball
Token ID2613
Feature activation+7.78e-01

Pos logit contributions
 conclusions+0.000
ede+0.000
growth+0.000
+0.000
',"+0.000
Neg logit contributions
 Kim-0.000
 Su-0.000
 Johnny-0.000
 cho-0.000
ug-0.000
Token and
Token ID290
Feature activation+5.06e-01

Pos logit contributions
 Kim+0.020
 Su+0.018
 Johnny+0.018
Kim+0.017
 cho+0.017
Neg logit contributions
 conclusions-0.017
ede-0.016
growth-0.015
-0.015
',"-0.015
Token Rex
Token ID17853
Feature activation-1.11e+00

Pos logit contributions
 Kim+0.012
 Su+0.011
 cho+0.010
 Johnny+0.010
ug+0.010
Neg logit contributions
 conclusions-0.012
ede-0.011
growth-0.011
-0.011
',"-0.010
Token catches
Token ID17591
Feature activation+2.19e-01

Pos logit contributions
 conclusions+0.022
ede+0.021
+0.020
',"+0.019
growth+0.019
Neg logit contributions
 Kim-0.030
 Su-0.029
Kim-0.028
 Johnny-0.028
Su-0.027
Token it
Token ID340
Feature activation+6.80e-01

Pos logit contributions
 Kim+0.006
 Su+0.006
coat+0.006
Kim+0.006
Su+0.005
Neg logit contributions
 conclusions-0.004
growth-0.004
ede-0.003
',"-0.003
 mushrooms-0.003
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 Kim+0.021
 Su+0.021
 Johnny+0.020
Kim+0.020
Su+0.019
Neg logit contributions
 conclusions-0.011
ede-0.009
growth-0.009
',"-0.009
 mushrooms-0.008
Token They
Token ID1119
Feature activation-7.77e-01

Pos logit contributions
icking+0.037
coat+0.036
reat+0.035
rol+0.035
rou+0.035
Neg logit contributions
 conclusions-0.013
growth-0.013
 One-0.011
',"-0.010
-0.010
Token have
Token ID423
Feature activation-9.08e-01

Pos logit contributions
 conclusions+0.017
ede+0.016
growth+0.016
+0.015
',"+0.015
Neg logit contributions
 Kim-0.020
 Su-0.018
 cho-0.017
 Johnny-0.017
Kim-0.017
Token fun
Token ID1257
Feature activation+1.14e+00

Pos logit contributions
 conclusions+0.021
growth+0.020
 mushrooms+0.019
 wrists+0.019
ede+0.019
Neg logit contributions
 Kim-0.020
ug-0.019
 Su-0.019
Kim-0.018
 Johnny-0.018
Token.
Token ID13
Feature activation+9.62e-01

Pos logit contributions
 Kim+0.027
 Su+0.025
 cho+0.024
 Johnny+0.024
 driving+0.023
Neg logit contributions
 conclusions-0.027
growth-0.025
ede-0.024
-0.024
',"-0.024
Token She
Token ID1375
Feature activation-4.21e-01

Pos logit contributions
 conclusions+0.018
+0.017
growth+0.017
ede+0.016
',"+0.016
Neg logit contributions
 Kim-0.019
 cho-0.017
 Su-0.017
ug-0.017
 driving-0.017
Token told
Token ID1297
Feature activation+2.99e-02

Pos logit contributions
 conclusions+0.010
ede+0.009
growth+0.009
+0.009
',"+0.009
Neg logit contributions
 Kim-0.010
 Su-0.009
 cho-0.008
 Johnny-0.008
ug-0.008
Token her
Token ID607
Feature activation-1.79e+00

Pos logit contributions
 Kim+0.001
 cho+0.001
 Su+0.001
Kim+0.001
coat+0.001
Neg logit contributions
 conclusions-0.001
growth-0.001
ede-0.001
 mushrooms-0.001
-0.001
Token that
Token ID326
Feature activation-3.70e-01

Pos logit contributions
 conclusions+0.042
 moral+0.040
growth+0.039
+0.039
',"+0.038
Neg logit contributions
 Kim-0.038
 Su-0.035
rol-0.035
 cho-0.034
Kim-0.034
Token it
Token ID340
Feature activation+2.91e-01

Pos logit contributions
 conclusions+0.008
 mushrooms+0.007
 moral+0.007
ede+0.006
+0.006
Neg logit contributions
 Kim-0.009
coat-0.009
ug-0.009
rol-0.009
Kim-0.009
Token's
Token ID338
Feature activation+9.49e-01

Pos logit contributions
 Kim+0.008
 Su+0.008
 cho+0.008
Kim+0.007
ug+0.007
Neg logit contributions
 conclusions-0.005
ede-0.005
-0.005
 moral-0.004
growth-0.004
Token okay
Token ID8788
Feature activation+1.21e-01

Pos logit contributions
 Kim+0.020
 Su+0.018
 cho+0.017
 Johnny+0.017
Kim+0.016
Neg logit contributions
 conclusions-0.025
ede-0.023
growth-0.023
-0.023
',"-0.023
Token to
Token ID284
Feature activation+9.42e-01

Pos logit contributions
rol+0.003
 Kim+0.003
 Su+0.003
ug+0.003
Kim+0.003
Neg logit contributions
 conclusions-0.002
-0.002
 mushrooms-0.002
 moral-0.002
ede-0.002
Token feel
Token ID1254
Feature activation+3.47e-02

Pos logit contributions
 Kim+0.023
 Su+0.021
 cho+0.021
 Johnny+0.020
ug+0.020
Neg logit contributions
 conclusions-0.022
ede-0.019
growth-0.019
-0.019
',"-0.019
Token sad
Token ID6507
Feature activation-7.36e-01

Pos logit contributions
 Kim+0.001
Kim+0.001
 Su+0.001
coat+0.001
rol+0.001
Neg logit contributions
 conclusions-0.001
 moral-0.001
 mushrooms-0.001
',"-0.001
-0.001
Token,
Token ID11
Feature activation-2.50e-01

Pos logit contributions
 conclusions+0.015
+0.012
 mushrooms+0.012
',"+0.012
ede+0.012
Neg logit contributions
 Kim-0.020
 Su-0.019
Kim-0.019
 cho-0.018
Su-0.018
Token bumped
Token ID38034
Feature activation+2.48e-01

Pos logit contributions
 conclusions+0.023
ede+0.022
growth+0.022
+0.021
',"+0.020
Neg logit contributions
 Kim-0.023
 Su-0.021
 cho-0.020
 Johnny-0.020
ug-0.020
Token into
Token ID656
Feature activation+3.12e-01

Pos logit contributions
 Kim+0.007
Kim+0.006
 cho+0.006
coat+0.006
 Su+0.006
Neg logit contributions
 conclusions-0.005
growth-0.004
',"-0.004
 mushrooms-0.004
 wrists-0.004
Token the
Token ID262
Feature activation+1.29e-01

Pos logit contributions
 Kim+0.007
 Su+0.006
 cho+0.006
 Johnny+0.006
ug+0.006
Neg logit contributions
 conclusions-0.008
growth-0.007
ede-0.007
-0.007
',"-0.007
Token table
Token ID3084
Feature activation-8.42e-01

Pos logit contributions
 Kim+0.003
 Su+0.003
 Johnny+0.003
ug+0.003
Kim+0.003
Neg logit contributions
 conclusions-0.003
growth-0.003
ede-0.003
 vegetable-0.002
',"-0.002
Token,
Token ID11
Feature activation+1.88e-02

Pos logit contributions
 conclusions+0.015
ede+0.014
growth+0.013
+0.013
',"+0.012
Neg logit contributions
 Kim-0.024
 Su-0.023
 Johnny-0.022
Kim-0.022
ug-0.021
Token where
Token ID810
Feature activation-2.74e-01

Pos logit contributions
ug+0.000
 Kim+0.000
 Su+0.000
 cho+0.000
Kim+0.000
Neg logit contributions
 conclusions-0.000
ede-0.000
growth-0.000
 wrists-0.000
-0.000
Token the
Token ID262
Feature activation-9.38e-01

Pos logit contributions
 conclusions+0.006
 mushrooms+0.005
growth+0.005
 moral+0.005
ils+0.005
Neg logit contributions
 Kim-0.006
coat-0.006
ug-0.006
 speeding-0.006
Kim-0.006
Token dessert
Token ID23084
Feature activation-1.76e+00

Pos logit contributions
 conclusions+0.019
 mushrooms+0.017
 vegetable+0.017
 moral+0.017
ede+0.017
Neg logit contributions
 Kim-0.026
 Su-0.024
ug-0.023
Kim-0.023
 Johnny-0.022
Token was
Token ID373
Feature activation+4.27e-01

Pos logit contributions
ede+0.030
ils+0.029
 conclusions+0.028
 vegetable+0.028
 mushrooms+0.027
Neg logit contributions
 Kim-0.049
ug-0.047
 Su-0.047
Kim-0.045
 speeding-0.045
Token.
Token ID13
Feature activation-5.28e-01

Pos logit contributions
 Kim+0.010
ug+0.009
 Su+0.009
Kim+0.009
 Liz+0.008
Neg logit contributions
 moral-0.009
ede-0.009
 conclusions-0.009
 vegetable-0.009
',"-0.009
Token The
Token ID383
Feature activation-1.05e+00

Pos logit contributions
 conclusions+0.011
growth+0.010
+0.009
',"+0.009
'."+0.009
Neg logit contributions
 Kim-0.013
 cho-0.013
 speeding-0.013
 driving-0.013
coat-0.012
Token.
Token ID13
Feature activation+3.72e-01

Pos logit contributions
 Kim+0.023
 Su+0.022
 cho+0.022
Kim+0.021
ug+0.021
Neg logit contributions
 conclusions-0.013
ede-0.012
',"-0.012
-0.012
growth-0.012
Token "
Token ID366
Feature activation-7.10e-01

Pos logit contributions
 Kim+0.008
 Su+0.007
 Johnny+0.007
Kim+0.007
 cho+0.007
Neg logit contributions
 conclusions-0.010
ede-0.009
growth-0.009
ils-0.009
',"-0.009
TokenWhy
Token ID5195
Feature activation-3.11e+00

Pos logit contributions
 conclusions+0.016
growth+0.014
ede+0.014
+0.014
',"+0.014
Neg logit contributions
 Kim-0.018
 Su-0.016
 cho-0.016
 Johnny-0.016
Kim-0.015
Token are
Token ID389
Feature activation+6.33e-01

Pos logit contributions
 conclusions+0.061
ede+0.055
+0.055
',"+0.054
 moral+0.053
Neg logit contributions
 Kim-0.079
 Su-0.075
 cho-0.075
ug-0.074
rol-0.072
Token you
Token ID345
Feature activation-7.31e-01

Pos logit contributions
 Kim+0.012
coat+0.011
 Su+0.011
ug+0.011
rol+0.011
Neg logit contributions
 conclusions-0.016
-0.015
ede-0.015
 mushrooms-0.015
',"-0.015
Token fighting
Token ID4330
Feature activation-8.49e-01

Pos logit contributions
 conclusions+0.012
 moral+0.012
ede+0.012
'+0.011
+0.011
Neg logit contributions
 Kim-0.019
Kim-0.017
 Su-0.017
rol-0.017
 cho-0.016
Token?"
Token ID1701
Feature activation-3.11e-02

Pos logit contributions
 conclusions+0.017
',"+0.015
ede+0.015
+0.015
'."+0.014
Neg logit contributions
 Kim-0.022
 Su-0.021
 cho-0.020
ug-0.020
Kim-0.020
Token
Token ID198
Feature activation+4.99e-01

Pos logit contributions
 conclusions+0.001
ede+0.001
growth+0.001
ils+0.001
+0.001
Neg logit contributions
 Kim-0.001
 Su-0.001
 Johnny-0.001
 Liz-0.001
Kim-0.001
Token
Token ID198
Feature activation-6.36e-01

Pos logit contributions
 Kim+0.012
 Su+0.010
 Johnny+0.010
Kim+0.010
 cho+0.010
Neg logit contributions
 conclusions-0.012
ede-0.012
growth-0.011
-0.011
ils-0.011
Token"
Token ID1
Feature activation-5.25e-01

Pos logit contributions
 conclusions+0.017
growth+0.016
ede+0.015
+0.015
',"+0.015
Neg logit contributions
 Kim-0.014
 Su-0.012
Kim-0.012
 Johnny-0.012
 cho-0.012
TokenL
Token ID43
Feature activation-2.78e-01

Pos logit contributions
 conclusions+0.012
+0.011
ede+0.011
growth+0.011
',"+0.011
Neg logit contributions
 Kim-0.012
 Su-0.011
 cho-0.011
 Johnny-0.011
ug-0.010
Token back
Token ID736
Feature activation-1.09e+00

Pos logit contributions
 Kim+0.001
 Su+0.001
 cho+0.001
 Johnny+0.001
ug+0.001
Neg logit contributions
 conclusions-0.002
ede-0.002
growth-0.002
-0.002
',"-0.002
Token to
Token ID284
Feature activation+4.42e-01

Pos logit contributions
 conclusions+0.027
growth+0.026
ede+0.025
',"+0.025
+0.025
Neg logit contributions
 Kim-0.023
 cho-0.021
 Su-0.020
ug-0.020
Kim-0.020
Token her
Token ID607
Feature activation-8.32e-01

Pos logit contributions
 Kim+0.007
 Su+0.006
 cho+0.006
ug+0.006
Kim+0.006
Neg logit contributions
 conclusions-0.013
growth-0.012
ede-0.012
-0.012
',"-0.012
Token room
Token ID2119
Feature activation+7.95e-01

Pos logit contributions
 conclusions+0.020
ede+0.019
growth+0.018
 moral+0.018
 vegetable+0.018
Neg logit contributions
 Kim-0.018
 Su-0.017
 cho-0.017
ug-0.017
rol-0.016
Token to
Token ID284
Feature activation+4.09e-01

Pos logit contributions
 Kim+0.026
Kim+0.025
 cho+0.024
 Su+0.024
ug+0.024
Neg logit contributions
 conclusions-0.009
ede-0.009
',"-0.008
growth-0.008
 wrists-0.008
Token finish
Token ID5461
Feature activation-2.52e-01

Pos logit contributions
 Kim+0.010
 Su+0.009
 cho+0.009
 Johnny+0.009
ug+0.008
Neg logit contributions
 conclusions-0.010
growth-0.009
ede-0.009
-0.008
',"-0.008
Token cleaning
Token ID12724
Feature activation-7.83e-02

Pos logit contributions
 conclusions+0.006
growth+0.006
ede+0.006
+0.006
',"+0.005
Neg logit contributions
 Kim-0.006
 Su-0.005
 cho-0.005
 Johnny-0.005
Kim-0.005
Token.
Token ID13
Feature activation+7.10e-02

Pos logit contributions
 conclusions+0.002
ede+0.002
growth+0.002
',"+0.001
+0.001
Neg logit contributions
 Kim-0.002
 cho-0.002
 Su-0.002
ug-0.002
Kim-0.002
Token Even
Token ID3412
Feature activation-3.74e-01

Pos logit contributions
 Kim+0.002
 cho+0.002
ug+0.002
 Su+0.002
 driving+0.002
Neg logit contributions
 conclusions-0.001
-0.001
growth-0.001
',"-0.001
ede-0.001
Token though
Token ID996
Feature activation-1.16e-01

Pos logit contributions
 conclusions+0.007
 mushrooms+0.007
 closely+0.007
 vegetable+0.007
 moral+0.006
Neg logit contributions
 Kim-0.009
 cho-0.009
 Su-0.009
Kim-0.009
ug-0.009
Token she
Token ID673
Feature activation-4.71e-01

Pos logit contributions
 conclusions+0.003
ede+0.002
+0.002
growth+0.002
 mushrooms+0.002
Neg logit contributions
 Kim-0.002
rol-0.002
 cho-0.002
ug-0.002
 Su-0.002
Token They
Token ID1119
Feature activation-1.79e-01

Pos logit contributions
 Kim+0.011
 cho+0.010
 Su+0.010
ug+0.010
coat+0.010
Neg logit contributions
 conclusions-0.008
growth-0.007
-0.007
ede-0.007
',"-0.007
Token drew
Token ID9859
Feature activation-1.03e+00

Pos logit contributions
 conclusions+0.004
ede+0.004
growth+0.004
+0.004
',"+0.004
Neg logit contributions
 Kim-0.004
 Su-0.004
 cho-0.004
 Johnny-0.004
Kim-0.004
Token many
Token ID867
Feature activation-1.22e+00

Pos logit contributions
 conclusions+0.019
 mushrooms+0.018
growth+0.017
ils+0.016
ede+0.016
Neg logit contributions
 Kim-0.028
 Su-0.027
coat-0.027
Kim-0.026
 speeding-0.026
Token pictures
Token ID5986
Feature activation+4.28e-01

Pos logit contributions
 conclusions+0.032
growth+0.030
+0.030
ede+0.029
',"+0.029
Neg logit contributions
 Kim-0.026
 Su-0.023
 Johnny-0.023
 cho-0.022
 Liz-0.021
Token together
Token ID1978
Feature activation-2.08e+00

Pos logit contributions
 Kim+0.011
 Su+0.011
Kim+0.010
Su+0.010
ug+0.009
Neg logit contributions
 mushrooms-0.008
 conclusions-0.008
ede-0.007
ils-0.007
'-0.007
Token with
Token ID351
Feature activation-5.91e-01

Pos logit contributions
 conclusions+0.048
ede+0.043
growth+0.043
+0.043
',"+0.042
Neg logit contributions
 Kim-0.051
 Su-0.046
 cho-0.044
 Johnny-0.044
Kim-0.043
Token the
Token ID262
Feature activation-4.05e-01

Pos logit contributions
 conclusions+0.013
growth+0.012
 mushrooms+0.011
ede+0.010
 moral+0.010
Neg logit contributions
 Kim-0.015
 Su-0.014
coat-0.014
Kim-0.013
 cho-0.013
Token ink
Token ID16882
Feature activation+6.30e-02

Pos logit contributions
 conclusions+0.008
 moral+0.007
ede+0.007
+0.007
 mushrooms+0.007
Neg logit contributions
 Kim-0.011
 Su-0.010
rol-0.010
ug-0.010
 cho-0.010
Token.
Token ID13
Feature activation+2.48e-01

Pos logit contributions
 Kim+0.002
 Su+0.002
Kim+0.002
ug+0.002
Su+0.002
Neg logit contributions
ede-0.001
 conclusions-0.001
 mushrooms-0.001
-0.001
ils-0.001
Token And
Token ID843
Feature activation+2.48e-01

Pos logit contributions
 Kim+0.006
 Su+0.006
 cho+0.006
 Johnny+0.006
ug+0.006
Neg logit contributions
 conclusions-0.005
growth-0.005
-0.005
ede-0.004
',"-0.004
Token they
Token ID484
Feature activation-5.33e-01

Pos logit contributions
coat+0.007
 Kim+0.006
Kim+0.006
rol+0.006
reat+0.006
Neg logit contributions
 conclusions-0.004
 mushrooms-0.004
 Fran-0.004
 moral-0.004
 vegetable-0.004
Token inside
Token ID2641
Feature activation-1.31e-01

Pos logit contributions
 Kim+0.019
 Su+0.018
Kim+0.017
 Johnny+0.017
 cho+0.017
Neg logit contributions
 conclusions-0.013
',"-0.012
ils-0.012
ede-0.012
-0.012
Token.
Token ID13
Feature activation-7.76e-02

Pos logit contributions
 conclusions+0.003
ede+0.003
growth+0.003
+0.003
',"+0.002
Neg logit contributions
 Kim-0.003
 Su-0.003
 cho-0.003
 Johnny-0.003
 driving-0.003
Token
Token ID198
Feature activation-7.58e-01

Pos logit contributions
 conclusions+0.002
ede+0.002
growth+0.002
+0.002
',"+0.002
Neg logit contributions
 Kim-0.002
 Su-0.002
 Johnny-0.002
 cho-0.002
Kim-0.002
Token
Token ID198
Feature activation-1.67e+00

Pos logit contributions
 conclusions+0.018
ede+0.016
growth+0.016
+0.016
',"+0.015
Neg logit contributions
 Kim-0.019
 Su-0.017
 cho-0.016
 Johnny-0.016
Kim-0.016
Token"
Token ID1
Feature activation-2.18e-01

Pos logit contributions
 conclusions+0.035
growth+0.033
ede+0.031
 mushrooms+0.029
+0.029
Neg logit contributions
 Kim-0.046
 Su-0.042
Kim-0.041
 cho-0.041
 Johnny-0.040
TokenLet
Token ID5756
Feature activation-1.20e+00

Pos logit contributions
 conclusions+0.005
ede+0.005
growth+0.005
',"+0.005
 mushrooms+0.005
Neg logit contributions
 Kim-0.005
Kim-0.005
 Su-0.005
 cho-0.004
 Johnny-0.004
Token's
Token ID338
Feature activation+1.75e+00

Pos logit contributions
 conclusions+0.024
growth+0.021
ede+0.020
+0.020
',"+0.019
Neg logit contributions
 Kim-0.034
 Su-0.032
 cho-0.031
 Johnny-0.031
Kim-0.029
Token get
Token ID651
Feature activation+5.04e-01

Pos logit contributions
 Kim+0.043
 cho+0.040
 Su+0.040
ug+0.038
 Johnny+0.037
Neg logit contributions
 conclusions-0.039
-0.035
growth-0.035
ede-0.035
',"-0.034
Token the
Token ID262
Feature activation+9.96e-01

Pos logit contributions
 Kim+0.013
 Su+0.012
 cho+0.012
Kim+0.012
ug+0.012
Neg logit contributions
 conclusions-0.010
growth-0.009
-0.009
ede-0.009
',"-0.009
Token box
Token ID3091
Feature activation+2.76e-01

Pos logit contributions
 Kim+0.026
 Su+0.026
ug+0.025
Su+0.024
Kim+0.024
Neg logit contributions
 conclusions-0.019
 mushrooms-0.018
ede-0.018
 moral-0.017
 vegetable-0.017
Token!"
Token ID2474
Feature activation-3.57e-01

Pos logit contributions
 Kim+0.008
 Su+0.008
Kim+0.008
 cho+0.008
ug+0.007
Neg logit contributions
',"-0.004
'."-0.004
.'"-0.004
 conclusions-0.004
!'"-0.003
Token said
Token ID531
Feature activation+1.19e+00

Pos logit contributions
 Kim+0.023
 Su+0.020
 Johnny+0.019
 cho+0.019
 Liz+0.018
Neg logit contributions
 conclusions-0.038
growth-0.035
-0.035
ede-0.035
',"-0.033
Token.
Token ID13
Feature activation+8.86e-02

Pos logit contributions
 Kim+0.033
 Su+0.031
 cho+0.030
Kim+0.030
 Johnny+0.030
Neg logit contributions
 conclusions-0.022
ede-0.020
',"-0.020
-0.020
growth-0.020
Token "
Token ID366
Feature activation+2.66e-01

Pos logit contributions
 Kim+0.002
 Su+0.002
 cho+0.002
 Johnny+0.002
Kim+0.002
Neg logit contributions
 conclusions-0.002
ede-0.002
growth-0.002
-0.002
',"-0.002
TokenWe
Token ID1135
Feature activation-1.29e+00

Pos logit contributions
 Kim+0.005
Kim+0.005
 cho+0.005
 Su+0.005
 driving+0.004
Neg logit contributions
 conclusions-0.007
ede-0.006
growth-0.006
 mushrooms-0.006
ils-0.006
Token can
Token ID460
Feature activation+3.83e-01

Pos logit contributions
 conclusions+0.028
ede+0.025
growth+0.025
+0.025
',"+0.024
Neg logit contributions
 Kim-0.033
 Su-0.030
 cho-0.029
 Johnny-0.029
Kim-0.028
Token look
Token ID804
Feature activation-2.72e+00

Pos logit contributions
 Kim+0.010
 Su+0.009
 cho+0.009
 Johnny+0.009
ug+0.008
Neg logit contributions
 conclusions-0.008
ede-0.008
growth-0.007
-0.007
',"-0.007
Token for
Token ID329
Feature activation+9.48e-01

Pos logit contributions
 conclusions+0.074
ede+0.068
growth+0.068
+0.067
',"+0.065
Neg logit contributions
 Kim-0.056
 Su-0.049
 cho-0.048
 Johnny-0.047
 driving-0.045
Token cheese
Token ID9891
Feature activation-8.88e-01

Pos logit contributions
 Kim+0.020
 Tim+0.019
 cho+0.018
 driving+0.018
 Johnny+0.017
Neg logit contributions
ede-0.024
-0.024
 conclusions-0.023
 narrative-0.022
 tripod-0.022
Token in
Token ID287
Feature activation+4.70e-01

Pos logit contributions
 conclusions+0.022
growth+0.021
+0.020
ede+0.020
',"+0.019
Neg logit contributions
 Kim-0.020
 cho-0.018
 Su-0.017
 Johnny-0.017
 driving-0.017
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
 Kim+0.007
 cho+0.006
 Tim+0.006
 Johnny+0.006
 driving+0.006
Neg logit contributions
ede-0.015
-0.014
 conclusions-0.014
cious-0.014
growth-0.014
Token kitchen
Token ID9592
Feature activation-1.12e+00

Pos logit contributions
 Kim+0.028
 Su+0.026
 cho+0.025
 Johnny+0.025
ug+0.024
Neg logit contributions
 conclusions-0.027
growth-0.024
ede-0.024
-0.024
',"-0.024
Token pepper
Token ID13385
Feature activation-2.71e-01

Pos logit contributions
 conclusions+0.011
growth+0.010
+0.009
ede+0.009
',"+0.009
Neg logit contributions
 Kim-0.011
 cho-0.010
 Su-0.010
 Johnny-0.010
 driving-0.010
Token and
Token ID290
Feature activation+5.36e-01

Pos logit contributions
 conclusions+0.007
growth+0.007
+0.007
ede+0.007
',"+0.006
Neg logit contributions
 Kim-0.006
 Su-0.005
 cho-0.005
 Johnny-0.005
 driving-0.005
Token said
Token ID531
Feature activation+8.33e-02

Pos logit contributions
 Kim+0.012
 Su+0.011
 cho+0.010
ug+0.010
 Johnny+0.010
Neg logit contributions
 conclusions-0.013
ede-0.012
growth-0.012
-0.012
',"-0.012
Token,
Token ID11
Feature activation+8.57e-02

Pos logit contributions
 Kim+0.002
 Su+0.002
 cho+0.002
 Johnny+0.002
Kim+0.002
Neg logit contributions
 conclusions-0.002
-0.002
ede-0.002
growth-0.002
',"-0.002
Token "
Token ID366
Feature activation-7.46e-01

Pos logit contributions
 Kim+0.002
 Su+0.002
 cho+0.002
 Johnny+0.002
 driving+0.002
Neg logit contributions
 conclusions-0.002
growth-0.002
ede-0.002
-0.002
',"-0.002
TokenHello
Token ID15496
Feature activation-2.73e+00

Pos logit contributions
 conclusions+0.022
growth+0.020
ede+0.020
 mushrooms+0.020
ils+0.020
Neg logit contributions
 Kim-0.013
Kim-0.012
 Su-0.011
 cho-0.011
 Johnny-0.010
Token,
Token ID11
Feature activation+7.80e-01

Pos logit contributions
+0.053
 conclusions+0.052
 mushrooms+0.049
',"+0.049
ede+0.048
Neg logit contributions
 Kim-0.069
 Su-0.065
rol-0.064
ug-0.063
Kim-0.063
Token Pepper
Token ID24346
Feature activation+5.03e-01

Pos logit contributions
ug+0.015
 Kim+0.015
rol+0.015
coat+0.015
Kim+0.014
Neg logit contributions
 conclusions-0.019
 vegetable-0.018
 mushrooms-0.018
ede-0.017
-0.017
Tokenoni
Token ID14651
Feature activation-7.11e-01

Pos logit contributions
 Kim+0.014
 Su+0.013
 cho+0.012
 Johnny+0.012
ug+0.012
Neg logit contributions
 conclusions-0.010
ede-0.009
-0.009
growth-0.009
',"-0.009
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
 conclusions+0.015
growth+0.014
ede+0.014
+0.014
',"+0.013
Neg logit contributions
 Kim-0.018
 Su-0.017
 cho-0.016
 Johnny-0.016
 driving-0.015
Token You
Token ID921
Feature activation-1.09e+00

Pos logit contributions
 Kim+0.031
 Su+0.028
 cho+0.027
 Johnny+0.027
Kim+0.026
Neg logit contributions
 conclusions-0.034
growth-0.031
ede-0.031
-0.031
',"-0.030
Token.
Token ID13
Feature activation+5.91e-02

Pos logit contributions
 Kim+0.014
 Su+0.013
 Johnny+0.013
Kim+0.013
 cho+0.012
Neg logit contributions
 conclusions-0.008
ede-0.007
',"-0.007
'."-0.006
-0.006
Token They
Token ID1119
Feature activation-4.50e-02

Pos logit contributions
 Kim+0.001
 Su+0.001
 cho+0.001
 Johnny+0.001
ug+0.001
Neg logit contributions
 conclusions-0.001
growth-0.001
ede-0.001
-0.001
',"-0.001
Token swung
Token ID28507
Feature activation+4.89e-01

Pos logit contributions
 conclusions+0.001
ede+0.001
growth+0.001
+0.001
',"+0.001
Neg logit contributions
 Kim-0.001
 Su-0.001
 cho-0.001
 Johnny-0.001
ug-0.001
Token back
Token ID736
Feature activation-1.78e+00

Pos logit contributions
 Kim+0.012
 Su+0.011
 Johnny+0.011
Kim+0.010
ug+0.010
Neg logit contributions
 conclusions-0.011
growth-0.009
',"-0.009
ede-0.009
-0.009
Token and
Token ID290
Feature activation+7.28e-01

Pos logit contributions
 conclusions+0.043
ede+0.039
growth+0.039
+0.039
',"+0.038
Neg logit contributions
 Kim-0.041
 Su-0.037
 Johnny-0.036
 cho-0.036
Kim-0.035
Token forth
Token ID6071
Feature activation-2.72e+00

Pos logit contributions
 Kim+0.015
 Su+0.014
 Johnny+0.013
ug+0.013
 cho+0.013
Neg logit contributions
 conclusions-0.018
ede-0.017
-0.017
growth-0.016
',"-0.016
Token.
Token ID13
Feature activation-1.67e-03

Pos logit contributions
 conclusions+0.051
ede+0.048
+0.047
',"+0.046
growth+0.045
Neg logit contributions
 Kim-0.075
 Su-0.069
 Johnny-0.067
Kim-0.066
 cho-0.065
Token They
Token ID1119
Feature activation-2.43e-01

Pos logit contributions
 conclusions+0.000
growth+0.000
+0.000
ede+0.000
',"+0.000
Neg logit contributions
 Kim-0.000
 Su-0.000
 cho-0.000
ug-0.000
 Johnny-0.000
Token laughed
Token ID13818
Feature activation-7.79e-01

Pos logit contributions
 conclusions+0.005
ede+0.005
growth+0.005
+0.005
',"+0.004
Neg logit contributions
 Kim-0.006
 Su-0.005
 Johnny-0.005
ug-0.005
 cho-0.005
Token and
Token ID290
Feature activation+7.42e-01

Pos logit contributions
 conclusions+0.013
 moral+0.012
',"+0.012
+0.012
'."+0.012
Neg logit contributions
 Kim-0.022
 Su-0.021
Kim-0.020
Su-0.020
 Liz-0.019
Token talked
Token ID6619
Feature activation-8.52e-01

Pos logit contributions
 Kim+0.016
 Su+0.015
reat+0.015
ug+0.015
 Johnny+0.014
Neg logit contributions
 conclusions-0.017
ede-0.016
-0.015
growth-0.015
 wrists-0.015
Token.
Token ID13
Feature activation+8.58e-01

Pos logit contributions
 Kim+0.001
 Su+0.000
 cho+0.000
 Johnny+0.000
Kim+0.000
Neg logit contributions
 conclusions-0.000
ede-0.000
growth-0.000
-0.000
',"-0.000
Token He
Token ID679
Feature activation-4.66e-01

Pos logit contributions
 Kim+0.018
 cho+0.017
 Su+0.017
ug+0.016
 speeding+0.016
Neg logit contributions
 conclusions-0.022
growth-0.020
-0.019
ede-0.019
',"-0.019
Token swung
Token ID28507
Feature activation+6.74e-01

Pos logit contributions
ede+0.009
 conclusions+0.009
growth+0.009
ils+0.009
 moral+0.008
Neg logit contributions
 Kim-0.012
 Su-0.011
 Johnny-0.010
reat-0.010
Kim-0.010
Token the
Token ID262
Feature activation-1.40e-01

Pos logit contributions
 Kim+0.017
 Su+0.015
 Johnny+0.015
 driving+0.015
ug+0.015
Neg logit contributions
 conclusions-0.014
ede-0.013
growth-0.012
 moral-0.012
-0.012
Token stick
Token ID4859
Feature activation-8.91e-01

Pos logit contributions
 conclusions+0.003
ede+0.003
growth+0.002
+0.002
',"+0.002
Neg logit contributions
 Kim-0.004
 Su-0.004
 Johnny-0.004
 Liz-0.004
Kim-0.003
Token again
Token ID757
Feature activation-2.64e+00

Pos logit contributions
 conclusions+0.018
ede+0.017
 closely+0.016
growth+0.016
 moral+0.016
Neg logit contributions
 Kim-0.024
 Su-0.023
 Johnny-0.022
Kim-0.021
Su-0.020
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
 conclusions+0.069
growth+0.063
ede+0.063
+0.063
',"+0.061
Neg logit contributions
 Kim-0.057
 Su-0.050
 cho-0.049
 Johnny-0.048
ug-0.046
Token again
Token ID757
Feature activation-1.72e+00

Pos logit contributions
 Kim+0.038
reat+0.036
 Su+0.036
ug+0.035
Kim+0.035
Neg logit contributions
 conclusions-0.024
ede-0.023
growth-0.019
 moral-0.018
 mushrooms-0.018
Token.
Token ID13
Feature activation+5.57e-01

Pos logit contributions
 conclusions+0.031
ede+0.027
growth+0.026
+0.026
',"+0.025
Neg logit contributions
 Kim-0.050
 Su-0.047
 cho-0.046
 Johnny-0.045
Kim-0.045
Token He
Token ID679
Feature activation-8.34e-01

Pos logit contributions
 Kim+0.013
 cho+0.013
ug+0.013
 Su+0.013
 speeding+0.012
Neg logit contributions
 conclusions-0.012
growth-0.011
ede-0.010
 mushrooms-0.010
-0.010
Token made
Token ID925
Feature activation+7.97e-02

Pos logit contributions
ede+0.017
 conclusions+0.016
growth+0.015
 wrists+0.015
ils+0.015
Neg logit contributions
 Kim-0.021
 Su-0.020
reat-0.019
 Johnny-0.019
Kim-0.019
Token that
Token ID326
Feature activation+6.35e-01

Pos logit contributions
 Kim+0.006
Kim+0.005
 Su+0.005
 Johnny+0.005
Su+0.005
Neg logit contributions
',"-0.002
 conclusions-0.002
ede-0.002
'-0.002
'."-0.002
Token he
Token ID339
Feature activation-1.25e+00

Pos logit contributions
ug+0.016
 speeding+0.015
 Kim+0.015
coat+0.015
 driving+0.014
Neg logit contributions
 vegetable-0.012
 conclusions-0.012
 mushrooms-0.011
ede-0.011
-0.011
Token used
Token ID973
Feature activation-1.37e-01

Pos logit contributions
 conclusions+0.023
ede+0.022
+0.022
growth+0.022
',"+0.021
Neg logit contributions
 Kim-0.034
 Su-0.031
Kim-0.031
 cho-0.030
ug-0.030
Token to
Token ID284
Feature activation-1.29e-01

Pos logit contributions
 conclusions+0.002
 mushrooms+0.002
growth+0.002
 vegetable+0.002
ils+0.002
Neg logit contributions
 Kim-0.003
Kim-0.003
ug-0.003
 Su-0.003
Su-0.003
Token clean
Token ID3424
Feature activation-4.72e-01

Pos logit contributions
 conclusions+0.003
ede+0.002
growth+0.002
',"+0.002
+0.002
Neg logit contributions
 Kim-0.003
 cho-0.003
 Su-0.003
rol-0.003
ug-0.003
Token himself
Token ID2241
Feature activation-3.19e+00

Pos logit contributions
 conclusions+0.011
ede+0.010
growth+0.010
',"+0.010
+0.010
Neg logit contributions
 Kim-0.011
 Su-0.010
 cho-0.010
Kim-0.009
ug-0.009
Token.
Token ID13
Feature activation-2.47e-01

Pos logit contributions
 conclusions+0.069
ede+0.063
+0.063
',"+0.062
growth+0.060
Neg logit contributions
 Kim-0.083
 Su-0.075
 cho-0.073
Kim-0.072
 Johnny-0.071
Token One
Token ID1881
Feature activation+3.97e-01

Pos logit contributions
 conclusions+0.005
growth+0.004
+0.004
',"+0.004
'."+0.004
Neg logit contributions
 cho-0.006
 Kim-0.006
ug-0.006
coat-0.006
 speeding-0.006
Token day
Token ID1110
Feature activation-1.24e+00

Pos logit contributions
 Kim+0.011
 cho+0.011
rol+0.010
 speeding+0.010
 driving+0.010
Neg logit contributions
 vegetable-0.007
 moral-0.007
 conclusions-0.006
ede-0.006
 end-0.006
Token while
Token ID981
Feature activation-3.29e-01

Pos logit contributions
+0.018
growth+0.018
 mushrooms+0.018
 vegetable+0.017
 conclusions+0.017
Neg logit contributions
rou-0.035
umbling-0.035
 Kim-0.034
coat-0.033
ug-0.033
Token he
Token ID339
Feature activation-2.77e-02

Pos logit contributions
 conclusions+0.007
growth+0.007
+0.007
 vegetable+0.007
 mushrooms+0.007
Neg logit contributions
coat-0.007
reat-0.007
rol-0.007
umbling-0.007
 Kim-0.007