TOP ACTIVATIONS
MAX = 0.264

still both go in different directions , but we must take
happily scattered off in different directions . As he headed home
, following the squirrel 's directions . But when he got
flexible and moved in many directions . He tried to reach
and they all followed her directions . Everyone said they had
two brothers scattered in different directions and ran after the butterfly
. He hopped in different directions , using his strong legs
boat moved fast and changed directions . The sail was red
to them and asked for directions . The stranger had a
me and yield to my directions ." The mouse
was just lost and needed directions . L
up and ran in different directions . After
that stuck out in all directions and an old straw hat
lady â s directions and soon he found his
shone the flashlight in all directions and saw something shiny .
the cricket ball in all directions . His friends would cheer
brush his hair in different directions with both com bs .
heard lots of rules and directions . This made him more
pulled the sack in different directions . The children changed the
and went in the different directions . The little

MIN ACTIVATIONS
MIN = -0.296

big salad with carrots , cuc umbers , and tomatoes .
house . They wash the cuc umber and cut it into
thank you . We love cuc umbers . And we love
the bowl . Lucy put cuc umbers , tomatoes and some
tomato , and then a cuc umber . Lily and Tom
. And this is a cuc umber . It is green
and the peppers from the cuc umbers . It took a
some y ummy carrots and cuc umbers from the farmer 's
They were filled with fresh cuc umber and cheese . It
chopped up some tomatoes , cuc umbers , and lettuce .
cuc umber . Mama puts cuc umbers in our salad .
I think it is a cuc umber . Mama puts cuc
happy to see our big cuc umber ." Lily says .
cut the lettuce and the cuc umber . She said he
saw carrots , potatoes , cuc umbers and tomatoes . He
like lettuce , tomatoes and cuc umbers . He smiled as
garden . They cut the cuc umber and put it in
. You found a nice cuc umber . It will be
John chose some carrots and cuc umbers to take home .
has lettuce and tomatoes and cuc umbers . And it has

INTERVAL 0.124 - 0.264
CONTAINS 0.524%

and play music on the street . They wear colorful clothes
that they had made it themselves and they felt safe knowing
and play with it that way !" Bob o
, " Max led the way and we found a switch
have an idea ," said Billy . " Let 's save

INTERVAL -0.016 - 0.124
CONTAINS 69.105%

that he took with him everywhere . One day , his
It just moved slowly and gently . " Maybe
. Mary quickly ran to the window to see what had
The dream maker smiled , and began telling Amy all the
for an adventure . One day , Sam went for a

INTERVAL -0.156 - -0.016
CONTAINS 30.085%

but you must do something for me first ". The k
to help . He said , " Come ,
went over to him and asked what was wrong . The
there was a fish named Nem o . Nem o loved
boy . They had an idea . They said , "

INTERVAL -0.296 - -0.156
CONTAINS 0.287%

says . He knows the v ase is not a toy
dream about luxury . She dreamed of having a huge house
button . The tooth brush starts to buzz and
She wanted to see the soup . " NO
deaf man selling l oll ip ops . Jane thought it
Token still
Token ID991
Feature activation-3.12e-02

Pos logit contributions
urance+0.073
inger+0.069
ady+0.069
 property+0.068
keeping+0.066
Neg logit contributions
 Sp-0.051
 cuc-0.046
 Chick-0.044
ie-0.040
 o-0.040
Token both
Token ID1111
Feature activation+8.53e-02

Pos logit contributions
 Sp+0.043
 cuc+0.041
 o+0.038
 Chick+0.038
ie+0.037
Neg logit contributions
urance-0.058
Jennifer-0.055
Greg-0.054
inger-0.054
ady-0.053
Token go
Token ID467
Feature activation-2.46e-02

Pos logit contributions
urance+0.155
Jennifer+0.145
Greg+0.145
ady+0.142
inger+0.142
Neg logit contributions
 Sp-0.121
 cuc-0.115
 Chick-0.108
 o-0.107
ie-0.104
Token in
Token ID287
Feature activation+4.35e-02

Pos logit contributions
 Sp+0.030
 cuc+0.028
 o+0.027
ie+0.026
 Chick+0.026
Neg logit contributions
urance-0.050
Greg-0.047
Jennifer-0.047
inger-0.046
ady-0.046
Token different
Token ID1180
Feature activation+6.60e-02

Pos logit contributions
urance+0.093
Greg+0.089
Jennifer+0.089
inger+0.087
ady+0.086
Neg logit contributions
 Sp-0.047
 cuc-0.045
 o-0.042
ie-0.041
 Chick-0.040
Token directions
Token ID11678
Feature activation+2.62e-01

Pos logit contributions
urance+0.091
Greg+0.085
Jennifer+0.085
inger+0.083
Jerry+0.081
Neg logit contributions
 Sp-0.123
 cuc-0.119
 o-0.117
ie-0.115
 if-0.112
Token,
Token ID11
Feature activation+1.83e-02

Pos logit contributions
urance+0.555
Greg+0.529
Jennifer+0.527
inger+0.520
ady+0.515
Neg logit contributions
 Sp-0.290
 cuc-0.274
 o-0.258
ie-0.248
 Chick-0.248
Token but
Token ID475
Feature activation+1.32e-02

Pos logit contributions
urance+0.036
Jennifer+0.035
Greg+0.035
ady+0.033
inger+0.033
Neg logit contributions
 Sp-0.022
 cuc-0.022
ie-0.020
 o-0.020
 Chick-0.020
Token we
Token ID356
Feature activation+5.81e-03

Pos logit contributions
urance+0.026
Greg+0.026
Jennifer+0.026
Jerry+0.025
inger+0.025
Neg logit contributions
 cuc-0.015
 Sp-0.015
 Chick-0.013
 o-0.013
ie-0.013
Token must
Token ID1276
Feature activation-1.15e-02

Pos logit contributions
urance+0.010
Greg+0.009
Jennifer+0.009
inger+0.009
ady+0.009
Neg logit contributions
 Sp-0.009
 cuc-0.009
 o-0.008
 Chick-0.008
ie-0.008
Token take
Token ID1011
Feature activation+3.89e-02

Pos logit contributions
 o+0.015
 please+0.015
 if+0.015
 Sp+0.015
?"+0.014
Neg logit contributions
Greg-0.021
Jennifer-0.021
urance-0.021
Jerry-0.020
ady-0.020
Token happily
Token ID18177
Feature activation-2.95e-02

Pos logit contributions
urance+0.024
Greg+0.023
Jennifer+0.023
inger+0.022
Jerry+0.022
Neg logit contributions
 Sp-0.020
 cuc-0.020
 Chick-0.018
 o-0.018
ie-0.018
Token scattered
Token ID16830
Feature activation+4.46e-02

Pos logit contributions
 Sp+0.044
 cuc+0.042
 o+0.041
ie+0.040
 Chick+0.039
Neg logit contributions
urance-0.052
Jennifer-0.049
Greg-0.049
inger-0.047
ady-0.047
Token off
Token ID572
Feature activation+1.35e-01

Pos logit contributions
urance+0.093
Greg+0.088
Jennifer+0.088
inger+0.087
ady+0.086
Neg logit contributions
 Sp-0.052
 cuc-0.049
 o-0.047
ie-0.045
 Chick-0.044
Token in
Token ID287
Feature activation+4.10e-02

Pos logit contributions
urance+0.296
Greg+0.284
Jennifer+0.282
inger+0.278
ady+0.275
Neg logit contributions
 Sp-0.139
 cuc-0.130
 o-0.122
ie-0.118
 Chick-0.116
Token different
Token ID1180
Feature activation+5.93e-02

Pos logit contributions
urance+0.097
Greg+0.093
inger+0.092
Jennifer+0.092
ady+0.090
Neg logit contributions
 Sp-0.037
 o-0.033
 cuc-0.033
ie-0.030
 if-0.030
Token directions
Token ID11678
Feature activation+2.57e-01

Pos logit contributions
urance+0.080
Greg+0.076
Jennifer+0.075
inger+0.074
Jerry+0.072
Neg logit contributions
 Sp-0.112
 cuc-0.109
 o-0.109
ie-0.107
 if-0.103
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
urance+0.573
Greg+0.553
Jennifer+0.552
inger+0.542
ady+0.534
Neg logit contributions
 Sp-0.255
 cuc-0.234
 o-0.232
ie-0.227
 if-0.219
Token As
Token ID1081
Feature activation+7.39e-02

Pos logit contributions
urance+0.020
Greg+0.019
Jennifer+0.019
inger+0.018
ady+0.018
Neg logit contributions
 Sp-0.014
 cuc-0.013
 o-0.013
ie-0.012
 Chick-0.012
Token he
Token ID339
Feature activation-9.60e-02

Pos logit contributions
urance+0.170
Greg+0.169
Jennifer+0.167
Jerry+0.162
inger+0.160
Neg logit contributions
 cuc-0.063
 Sp-0.061
 o-0.052
ie-0.050
ware-0.050
Token headed
Token ID9153
Feature activation-1.76e-02

Pos logit contributions
 Sp+0.140
 cuc+0.139
 Chick+0.126
 if+0.122
 o+0.122
Neg logit contributions
urance-0.163
Greg-0.160
Jennifer-0.156
ady-0.152
inger-0.152
Token home
Token ID1363
Feature activation+2.71e-02

Pos logit contributions
 Sp+0.024
 cuc+0.023
 o+0.022
ie+0.022
 Chick+0.021
Neg logit contributions
urance-0.033
Greg-0.031
Jennifer-0.031
inger-0.031
ady-0.031
Token,
Token ID11
Feature activation+1.31e-02

Pos logit contributions
urance+0.082
Jennifer+0.077
inger+0.077
ady+0.076
Greg+0.076
Neg logit contributions
 Sp-0.053
 cuc-0.050
 Chick-0.048
 o-0.046
ie-0.044
Token following
Token ID1708
Feature activation+7.41e-02

Pos logit contributions
urance+0.026
Jennifer+0.025
Greg+0.025
Jerry+0.024
ady+0.024
Neg logit contributions
 Sp-0.015
 cuc-0.015
ie-0.013
 Chick-0.013
ware-0.013
Token the
Token ID262
Feature activation+1.63e-03

Pos logit contributions
urance+0.166
Greg+0.159
Jennifer+0.158
inger+0.156
ady+0.154
Neg logit contributions
 Sp-0.075
 cuc-0.069
 o-0.066
ie-0.064
 Chick-0.062
Token squirrel
Token ID33039
Feature activation+4.15e-02

Pos logit contributions
urance+0.003
Greg+0.002
Jennifer+0.002
inger+0.002
ady+0.002
Neg logit contributions
 Sp-0.003
 cuc-0.002
 Chick-0.002
ie-0.002
 o-0.002
Token's
Token ID338
Feature activation+2.77e-02

Pos logit contributions
urance+0.087
Greg+0.083
Jennifer+0.083
inger+0.083
ady+0.082
Neg logit contributions
 Sp-0.046
 cuc-0.044
 o-0.041
 Chick-0.039
ie-0.039
Token directions
Token ID11678
Feature activation+2.57e-01

Pos logit contributions
urance+0.037
Greg+0.035
Jennifer+0.034
inger+0.033
Jerry+0.033
Neg logit contributions
 Sp-0.053
 cuc-0.051
 o-0.051
ie-0.049
 Chick-0.048
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
urance+0.559
Greg+0.535
Jennifer+0.533
inger+0.525
ady+0.519
Neg logit contributions
 Sp-0.271
 cuc-0.254
 o-0.241
ie-0.233
 Chick-0.228
Token But
Token ID887
Feature activation+1.37e-02

Pos logit contributions
urance+0.027
Greg+0.026
Jennifer+0.026
Jerry+0.025
ady+0.025
Neg logit contributions
 Sp-0.026
 cuc-0.026
 o-0.026
ie-0.026
 Chick-0.024
Token when
Token ID618
Feature activation-2.07e-02

Pos logit contributions
urance+0.027
Greg+0.026
Jennifer+0.026
Jerry+0.025
inger+0.025
Neg logit contributions
 cuc-0.015
 Sp-0.014
ware-0.013
 o-0.013
ie-0.013
Token he
Token ID339
Feature activation-9.89e-02

Pos logit contributions
 cuc+0.021
ware+0.019
 Sp+0.019
ie+0.019
 o+0.018
Neg logit contributions
urance-0.042
Greg-0.042
Jennifer-0.041
Jerry-0.040
 way-0.039
Token got
Token ID1392
Feature activation-1.33e-02

Pos logit contributions
 Sp+0.158
 cuc+0.157
 Chick+0.153
ectar+0.147
ware+0.141
Neg logit contributions
Greg-0.145
urance-0.145
Jennifer-0.141
Jerry-0.130
ady-0.129
Token flexible
Token ID12846
Feature activation+2.03e-02

Pos logit contributions
 Sp+0.043
 cuc+0.040
ie+0.039
ware+0.038
 Chick+0.038
Neg logit contributions
urance-0.039
ady-0.039
inger-0.037
Greg-0.037
Jennifer-0.036
Token and
Token ID290
Feature activation+2.11e-02

Pos logit contributions
urance+0.039
Greg+0.037
Jennifer+0.036
inger+0.036
ady+0.036
Neg logit contributions
 Sp-0.025
 cuc-0.024
 Chick-0.022
 o-0.021
ie-0.021
Token moved
Token ID3888
Feature activation-1.36e-02

Pos logit contributions
urance+0.034
Greg+0.032
Jennifer+0.032
inger+0.032
Jerry+0.032
Neg logit contributions
 Sp-0.030
 cuc-0.030
ie-0.029
 Chick-0.027
 o-0.027
Token in
Token ID287
Feature activation+4.29e-02

Pos logit contributions
 Sp+0.017
 cuc+0.017
 Chick+0.015
ie+0.015
 o+0.015
Neg logit contributions
urance-0.025
Greg-0.024
inger-0.024
ady-0.024
Jennifer-0.024
Token many
Token ID867
Feature activation+3.53e-02

Pos logit contributions
urance+0.090
Greg+0.086
Jennifer+0.086
inger+0.085
ady+0.084
Neg logit contributions
 Sp-0.049
 cuc-0.046
 o-0.044
ie-0.042
 Chick-0.042
Token directions
Token ID11678
Feature activation+2.57e-01

Pos logit contributions
urance+0.057
Greg+0.054
Jennifer+0.053
inger+0.053
Jerry+0.051
Neg logit contributions
 Sp-0.058
 cuc-0.056
 o-0.055
ie-0.054
 if-0.052
Token.
Token ID13
Feature activation+2.19e-02

Pos logit contributions
urance+0.543
Greg+0.517
Jennifer+0.515
inger+0.508
ady+0.503
Neg logit contributions
 Sp-0.287
 cuc-0.271
 o-0.256
ie-0.247
 Chick-0.245
Token He
Token ID679
Feature activation-8.01e-03

Pos logit contributions
urance+0.037
Greg+0.035
Jennifer+0.035
ady+0.034
inger+0.034
Neg logit contributions
 Sp-0.033
 cuc-0.033
 o-0.031
ie-0.031
 Chick-0.030
Token tried
Token ID3088
Feature activation+5.74e-02

Pos logit contributions
 Sp+0.012
 cuc+0.012
 Chick+0.011
ie+0.011
 o+0.011
Neg logit contributions
urance-0.012
Greg-0.012
ady-0.012
Jennifer-0.012
inger-0.012
Token to
Token ID284
Feature activation-1.16e-02

Pos logit contributions
urance+0.129
Greg+0.124
Jennifer+0.123
inger+0.121
ady+0.120
Neg logit contributions
 Sp-0.056
 cuc-0.052
 o-0.049
ie-0.048
 Chick-0.047
Token reach
Token ID3151
Feature activation-3.71e-02

Pos logit contributions
 Sp+0.017
 cuc+0.017
 o+0.016
ie+0.016
 Chick+0.015
Neg logit contributions
urance-0.020
Greg-0.019
Jennifer-0.019
inger-0.019
ady-0.018
Token and
Token ID290
Feature activation+1.06e-02

Pos logit contributions
 cuc+0.001
 Chick+0.001
 Sp+0.001
 ju+0.001
 Jos+0.001
Neg logit contributions
agging-0.002
ags-0.002
Jennifer-0.001
urance-0.001
Jerry-0.001
Token they
Token ID484
Feature activation-6.97e-03

Pos logit contributions
urance+0.015
Jennifer+0.014
Jerry+0.014
Greg+0.014
inger+0.013
Neg logit contributions
 Sp-0.018
 cuc-0.018
 o-0.017
ware-0.017
 Chick-0.017
Token all
Token ID477
Feature activation+1.20e-02

Pos logit contributions
 Sp+0.011
 cuc+0.011
ware+0.010
 Chick+0.010
 o+0.010
Neg logit contributions
urance-0.010
Greg-0.010
Jennifer-0.010
Jerry-0.009
inger-0.009
Token followed
Token ID3940
Feature activation+4.44e-02

Pos logit contributions
urance+0.021
Greg+0.020
Jennifer+0.020
inger+0.020
ady+0.019
Neg logit contributions
 Sp-0.018
 cuc-0.017
 o-0.017
ie-0.017
 Chick-0.016
Token her
Token ID607
Feature activation+4.47e-03

Pos logit contributions
urance+0.096
inger+0.091
Greg+0.091
Jennifer+0.091
ady+0.090
Neg logit contributions
 Sp-0.050
 cuc-0.044
 o-0.044
ie-0.043
 Chick-0.042
Token directions
Token ID11678
Feature activation+2.57e-01

Pos logit contributions
urance+0.008
Jennifer+0.008
Greg+0.008
inger+0.008
ady+0.008
Neg logit contributions
 Sp-0.006
 cuc-0.006
 o-0.006
ie-0.005
 Chick-0.005
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
urance+0.560
Greg+0.537
Jennifer+0.535
inger+0.527
ady+0.521
Neg logit contributions
 Sp-0.268
 cuc-0.250
 o-0.239
ie-0.232
 Chick-0.225
Token Everyone
Token ID11075
Feature activation+7.45e-02

Pos logit contributions
urance+0.032
Greg+0.030
Jennifer+0.030
inger+0.029
ady+0.029
Neg logit contributions
 Sp-0.026
 cuc-0.025
 o-0.024
ie-0.023
 Chick-0.023
Token said
Token ID531
Feature activation-2.86e-02

Pos logit contributions
urance+0.154
Jennifer+0.146
Greg+0.146
inger+0.144
ady+0.141
Neg logit contributions
 Sp-0.089
 cuc-0.083
 o-0.081
ie-0.079
 Chick-0.075
Token they
Token ID484
Feature activation-2.41e-03

Pos logit contributions
 Sp+0.029
 cuc+0.028
 o+0.025
ie+0.025
 Chick+0.025
Neg logit contributions
urance-0.063
Greg-0.060
Jennifer-0.060
inger-0.057
Jerry-0.057
Token had
Token ID550
Feature activation+1.03e-01

Pos logit contributions
 Sp+0.003
 cuc+0.003
 o+0.003
ie+0.003
 Chick+0.003
Neg logit contributions
urance-0.005
Greg-0.005
Jennifer-0.005
inger-0.004
ady-0.004
Token two
Token ID734
Feature activation+1.43e-01

Pos logit contributions
urance+0.027
Greg+0.026
Jennifer+0.026
inger+0.025
ady+0.025
Neg logit contributions
 Sp-0.022
 cuc-0.021
 o-0.020
ie-0.020
 Chick-0.019
Token brothers
Token ID9397
Feature activation+7.30e-03

Pos logit contributions
urance+0.273
Greg+0.260
Jennifer+0.259
inger+0.254
ady+0.252
Neg logit contributions
 Sp-0.188
 cuc-0.178
 o-0.170
 Chick-0.165
ie-0.165
Token scattered
Token ID16830
Feature activation+4.58e-02

Pos logit contributions
urance+0.014
Jennifer+0.014
inger+0.014
Greg+0.014
Jerry+0.013
Neg logit contributions
 Sp-0.009
 cuc-0.009
 o-0.009
ie-0.009
 if-0.008
Token in
Token ID287
Feature activation+4.39e-02

Pos logit contributions
urance+0.100
Jennifer+0.095
Greg+0.094
inger+0.094
ady+0.092
Neg logit contributions
 Sp-0.051
 cuc-0.048
 o-0.046
ie-0.044
 if-0.042
Token different
Token ID1180
Feature activation+5.51e-02

Pos logit contributions
urance+0.111
inger+0.107
Jennifer+0.106
Greg+0.103
ady+0.103
Neg logit contributions
 Sp-0.037
 cuc-0.031
 o-0.031
 if-0.029
 Chick-0.027
Token directions
Token ID11678
Feature activation+2.54e-01

Pos logit contributions
urance+0.094
Greg+0.091
Jennifer+0.090
inger+0.089
Jerry+0.088
Neg logit contributions
 Sp-0.087
 cuc-0.086
 o-0.084
ie-0.083
 if-0.079
Token and
Token ID290
Feature activation+2.11e-02

Pos logit contributions
urance+0.583
Jennifer+0.573
Greg+0.570
inger+0.561
Jerry+0.554
Neg logit contributions
 Sp-0.235
 cuc-0.212
 o-0.211
ie-0.208
 if-0.207
Token ran
Token ID4966
Feature activation+6.50e-02

Pos logit contributions
urance+0.037
Greg+0.035
Jennifer+0.034
inger+0.034
ady+0.033
Neg logit contributions
 Sp-0.032
 cuc-0.030
 o-0.029
ie-0.029
 Chick-0.028
Token after
Token ID706
Feature activation-3.52e-02

Pos logit contributions
urance+0.159
Jennifer+0.156
inger+0.152
Jerry+0.150
ady+0.148
Neg logit contributions
 Sp-0.052
 o-0.049
 if-0.047
 even-0.046
 he-0.045
Token the
Token ID262
Feature activation+5.35e-03

Pos logit contributions
 Sp+0.037
 cuc+0.035
 o+0.033
ie+0.032
 Chick+0.031
Neg logit contributions
urance-0.077
Greg-0.074
Jennifer-0.073
inger-0.072
ady-0.071
Token butterfly
Token ID35113
Feature activation+3.73e-03

Pos logit contributions
urance+0.009
Jennifer+0.009
Greg+0.009
inger+0.009
ady+0.008
Neg logit contributions
 Sp-0.008
 cuc-0.008
 o-0.008
ie-0.007
 Chick-0.007
Token.
Token ID13
Feature activation+4.50e-03

Pos logit contributions
 Sp+0.023
 cuc+0.022
 o+0.021
ie+0.021
 Chick+0.020
Neg logit contributions
urance-0.033
Greg-0.031
Jennifer-0.031
inger-0.031
ady-0.030
Token He
Token ID679
Feature activation-1.49e-02

Pos logit contributions
urance+0.008
Greg+0.008
Jennifer+0.008
inger+0.008
ady+0.008
Neg logit contributions
 Sp-0.006
 cuc-0.006
 o-0.006
ie-0.005
 Chick-0.005
Token hopped
Token ID45230
Feature activation-5.45e-02

Pos logit contributions
 Sp+0.021
 cuc+0.021
 Chick+0.019
ie+0.019
 o+0.019
Neg logit contributions
urance-0.025
Greg-0.025
Jennifer-0.025
Jerry-0.024
ady-0.024
Token in
Token ID287
Feature activation+3.59e-02

Pos logit contributions
 Sp+0.072
 cuc+0.069
 o+0.065
ie+0.064
 Chick+0.064
Neg logit contributions
urance-0.103
Greg-0.100
Jennifer-0.099
inger-0.097
ady-0.096
Token different
Token ID1180
Feature activation+6.06e-02

Pos logit contributions
urance+0.082
Greg+0.078
Jennifer+0.078
inger+0.077
ady+0.076
Neg logit contributions
 Sp-0.035
 cuc-0.032
 o-0.031
ie-0.029
 Chick-0.028
Token directions
Token ID11678
Feature activation+2.53e-01

Pos logit contributions
urance+0.074
Greg+0.068
Jennifer+0.067
inger+0.065
Jerry+0.063
Neg logit contributions
 Sp-0.124
 cuc-0.121
 o-0.119
ie-0.117
 if-0.113
Token,
Token ID11
Feature activation+1.61e-02

Pos logit contributions
urance+0.566
Greg+0.547
Jennifer+0.547
inger+0.537
Jerry+0.528
Neg logit contributions
 Sp-0.252
 cuc-0.232
 o-0.225
ie-0.219
 if-0.215
Token using
Token ID1262
Feature activation+2.92e-02

Pos logit contributions
urance+0.030
Greg+0.028
Jennifer+0.028
inger+0.028
ady+0.027
Neg logit contributions
 Sp-0.022
 cuc-0.021
 o-0.020
ie-0.020
 Chick-0.020
Token his
Token ID465
Feature activation+7.24e-03

Pos logit contributions
urance+0.067
Jennifer+0.064
Greg+0.064
inger+0.063
ady+0.062
Neg logit contributions
 Sp-0.028
 cuc-0.026
 o-0.025
ie-0.023
 Chick-0.023
Token strong
Token ID1913
Feature activation-1.61e-02

Pos logit contributions
urance+0.011
Greg+0.011
Jennifer+0.011
inger+0.010
Jerry+0.010
Neg logit contributions
 Sp-0.012
 cuc-0.012
 o-0.012
ie-0.011
 if-0.011
Token legs
Token ID7405
Feature activation+8.08e-02

Pos logit contributions
 cuc+0.027
 Sp+0.027
 o+0.027
ie+0.025
 cel+0.025
Neg logit contributions
Greg-0.024
urance-0.024
Jennifer-0.023
Jerry-0.022
inger-0.022
Token boat
Token ID8848
Feature activation-5.41e-02

Pos logit contributions
urance+0.035
Greg+0.033
Jennifer+0.033
inger+0.032
ady+0.032
Neg logit contributions
 Sp-0.031
 cuc-0.029
 Chick-0.028
ie-0.028
 o-0.028
Token moved
Token ID3888
Feature activation-2.01e-02

Pos logit contributions
 Sp+0.078
 cuc+0.074
 Chick+0.070
 o+0.067
ie+0.067
Neg logit contributions
urance-0.095
Greg-0.090
inger-0.089
Jennifer-0.088
ady-0.087
Token fast
Token ID3049
Feature activation+9.23e-02

Pos logit contributions
 Sp+0.025
 cuc+0.024
 o+0.023
ie+0.022
 Chick+0.022
Neg logit contributions
urance-0.039
Greg-0.038
Jennifer-0.037
inger-0.037
ady-0.037
Token and
Token ID290
Feature activation+1.37e-02

Pos logit contributions
urance+0.196
Greg+0.187
Jennifer+0.187
inger+0.184
ady+0.182
Neg logit contributions
 Sp-0.103
 cuc-0.097
 o-0.092
ie-0.089
 Chick-0.087
Token changed
Token ID3421
Feature activation+5.23e-02

Pos logit contributions
Greg+0.021
urance+0.021
Jennifer+0.020
Jerry+0.020
inger+0.020
Neg logit contributions
 cuc-0.021
 Sp-0.020
ie-0.020
 o-0.019
ware-0.019
Token directions
Token ID11678
Feature activation+2.50e-01

Pos logit contributions
urance+0.103
Greg+0.098
Jennifer+0.098
inger+0.097
ady+0.096
Neg logit contributions
 Sp-0.065
 cuc-0.062
 o-0.058
ie-0.057
 Chick-0.056
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
urance+0.524
Greg+0.496
Jennifer+0.494
inger+0.489
ady+0.485
Neg logit contributions
 Sp-0.283
 cuc-0.271
 o-0.251
 Chick-0.243
ie-0.241
Token The
Token ID383
Feature activation+1.46e-02

Pos logit contributions
urance+0.023
Greg+0.022
Jennifer+0.022
Jerry+0.021
ady+0.021
Neg logit contributions
 cuc-0.021
 Sp-0.021
ie-0.020
 o-0.020
 Chick-0.019
Token sail
Token ID14936
Feature activation-3.30e-02

Pos logit contributions
urance+0.029
Greg+0.028
Jennifer+0.028
inger+0.027
ady+0.027
Neg logit contributions
 Sp-0.018
 cuc-0.017
 o-0.017
ie-0.016
 Chick-0.016
Token was
Token ID373
Feature activation+2.42e-02

Pos logit contributions
 Sp+0.046
 cuc+0.043
 Chick+0.041
 o+0.040
ie+0.038
Neg logit contributions
urance-0.059
Greg-0.057
inger-0.057
Jennifer-0.056
ady-0.055
Token red
Token ID2266
Feature activation+3.70e-02

Pos logit contributions
urance+0.047
Jennifer+0.045
Greg+0.045
inger+0.044
Jerry+0.043
Neg logit contributions
 Sp-0.031
 o-0.029
 cuc-0.029
 if-0.028
ie-0.027
Token to
Token ID284
Feature activation-1.84e-02

Pos logit contributions
urance+0.196
Greg+0.192
Jennifer+0.191
inger+0.188
ady+0.186
Neg logit contributions
 Sp-0.098
 cuc-0.094
 o-0.086
 Chick-0.084
ie-0.084
Token them
Token ID606
Feature activation+8.10e-02

Pos logit contributions
 Sp+0.022
 cuc+0.021
 o+0.019
ie+0.019
 Chick+0.019
Neg logit contributions
urance-0.037
Jennifer-0.036
Greg-0.036
Jerry-0.034
inger-0.034
Token and
Token ID290
Feature activation+1.17e-02

Pos logit contributions
urance+0.175
Greg+0.168
Jennifer+0.168
inger+0.165
ady+0.162
Neg logit contributions
 Sp-0.087
 cuc-0.081
 o-0.079
ie-0.076
 Chick-0.074
Token asked
Token ID1965
Feature activation-2.26e-02

Pos logit contributions
Greg+0.020
Jennifer+0.020
urance+0.020
Jerry+0.020
aren+0.019
Neg logit contributions
 Sp-0.016
 cuc-0.016
ectar-0.014
ie-0.014
 o-0.014
Token for
Token ID329
Feature activation-1.01e-01

Pos logit contributions
 cuc+0.024
 Sp+0.024
 o+0.022
 Chick+0.021
ware+0.020
Neg logit contributions
urance-0.046
Greg-0.045
Jennifer-0.044
Jerry-0.043
ady-0.042
Token directions
Token ID11678
Feature activation+2.48e-01

Pos logit contributions
 Sp+0.130
 cuc+0.124
 o+0.118
ie+0.115
 Chick+0.113
Neg logit contributions
urance-0.198
Greg-0.189
Jennifer-0.188
inger-0.184
ady-0.182
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
urance+0.471
Greg+0.446
Jennifer+0.443
ady+0.436
inger+0.433
Neg logit contributions
 cuc-0.321
 Sp-0.319
 o-0.281
 Chick-0.281
ie-0.260
Token The
Token ID383
Feature activation+1.57e-02

Pos logit contributions
urance+0.026
Jennifer+0.025
Greg+0.025
Jerry+0.025
inger+0.023
Neg logit contributions
 cuc-0.020
ie-0.020
 o-0.019
 Sp-0.019
ware-0.017
Token stranger
Token ID16195
Feature activation+6.33e-02

Pos logit contributions
urance+0.030
inger+0.028
ady+0.027
Jennifer+0.026
Greg+0.026
Neg logit contributions
 o-0.023
 he-0.022
 cuc-0.021
 if-0.021
ie-0.021
Token had
Token ID550
Feature activation+1.01e-01

Pos logit contributions
urance+0.118
Greg+0.115
Jennifer+0.113
Jerry+0.110
ady+0.109
Neg logit contributions
 Sp-0.083
 cuc-0.081
 o-0.075
 Chick-0.073
ie-0.073
Token a
Token ID257
Feature activation-1.91e-02

Pos logit contributions
urance+0.221
inger+0.208
Greg+0.208
Jennifer+0.207
ady+0.202
Neg logit contributions
 Sp-0.108
 o-0.103
 cuc-0.100
 Chick-0.092
 if-0.090
Token me
Token ID502
Feature activation-1.16e-03

Pos logit contributions
urance+0.169
Jennifer+0.164
Greg+0.163
inger+0.160
ady+0.156
Neg logit contributions
 cuc-0.118
 Sp-0.117
 o-0.106
ie-0.102
 Chick-0.102
Token and
Token ID290
Feature activation+1.55e-02

Pos logit contributions
 Sp+0.001
 cuc+0.001
 o+0.001
ie+0.001
 Chick+0.001
Neg logit contributions
urance-0.002
Greg-0.002
inger-0.002
Jennifer-0.002
ady-0.002
Token yield
Token ID7800
Feature activation+5.49e-02

Pos logit contributions
urance+0.025
Jennifer+0.024
Greg+0.024
Jerry+0.022
inger+0.022
Neg logit contributions
 Sp-0.024
 cuc-0.024
ie-0.023
 o-0.022
ware-0.022
Token to
Token ID284
Feature activation-1.36e-02

Pos logit contributions
urance+0.112
Greg+0.108
Jennifer+0.107
inger+0.105
ady+0.105
Neg logit contributions
 Sp-0.062
 cuc-0.060
 o-0.055
 Chick-0.055
ie-0.054
Token my
Token ID616
Feature activation+5.07e-02

Pos logit contributions
 Sp+0.017
 cuc+0.017
ie+0.015
 o+0.015
 Chick+0.015
Neg logit contributions
urance-0.026
Jennifer-0.025
Greg-0.025
inger-0.024
ady-0.024
Token directions
Token ID11678
Feature activation+2.46e-01

Pos logit contributions
urance+0.075
Jennifer+0.068
Greg+0.068
inger+0.067
ady+0.066
Neg logit contributions
 Sp-0.089
 cuc-0.085
ie-0.082
 Chick-0.082
 o-0.080
Token."
Token ID526
Feature activation-2.23e-02

Pos logit contributions
urance+0.489
inger+0.455
ady+0.454
Jennifer+0.453
Greg+0.452
Neg logit contributions
 Sp-0.309
 cuc-0.297
 Chick-0.272
 o-0.270
ie-0.258
Token
Token ID198
Feature activation+1.57e-03

Pos logit contributions
 Sp+0.033
 cuc+0.033
 o+0.032
ie+0.031
 Chick+0.029
Neg logit contributions
urance-0.038
Jennifer-0.036
Greg-0.036
Jerry-0.035
inger-0.035
Token
Token ID198
Feature activation+1.65e-03

Pos logit contributions
urance+0.003
Jennifer+0.003
Greg+0.003
Jerry+0.003
inger+0.003
Neg logit contributions
 cuc-0.002
 Sp-0.002
 o-0.002
ie-0.002
 Chick-0.002
TokenThe
Token ID464
Feature activation+3.16e-02

Pos logit contributions
urance+0.002
Greg+0.002
Jennifer+0.002
inger+0.002
Jerry+0.002
Neg logit contributions
 Sp-0.003
 cuc-0.003
 o-0.003
ie-0.003
 Chick-0.003
Token mouse
Token ID10211
Feature activation-7.91e-02

Pos logit contributions
urance+0.057
Greg+0.054
Jennifer+0.053
inger+0.053
ady+0.052
Neg logit contributions
 Sp-0.046
 cuc-0.044
 o-0.043
ie-0.041
 Chick-0.040
Token was
Token ID373
Feature activation+2.04e-02

Pos logit contributions
 Sp+0.115
 cuc+0.112
 o+0.104
ie+0.100
 Chick+0.099
Neg logit contributions
urance-0.204
Greg-0.196
Jennifer-0.194
ady-0.190
inger-0.190
Token just
Token ID655
Feature activation-6.76e-02

Pos logit contributions
urance+0.037
Jennifer+0.035
Greg+0.035
inger+0.034
ady+0.033
Neg logit contributions
 Sp-0.030
 cuc-0.028
 o-0.028
ie-0.027
 Chick-0.026
Token lost
Token ID2626
Feature activation+4.30e-02

Pos logit contributions
 Sp+0.099
 cuc+0.094
 o+0.091
ie+0.089
 Chick+0.087
Neg logit contributions
urance-0.121
Greg-0.114
Jennifer-0.114
inger-0.112
ady-0.110
Token and
Token ID290
Feature activation+7.84e-03

Pos logit contributions
urance+0.093
Greg+0.089
Jennifer+0.089
inger+0.088
ady+0.087
Neg logit contributions
 Sp-0.046
 cuc-0.043
 o-0.042
ie-0.040
 Chick-0.039
Token needed
Token ID2622
Feature activation-2.27e-02

Pos logit contributions
urance+0.015
Jerry+0.015
Greg+0.015
Jennifer+0.015
inger+0.014
Neg logit contributions
 cuc-0.009
 Sp-0.009
ware-0.008
 o-0.008
berry-0.007
Token directions
Token ID11678
Feature activation+2.46e-01

Pos logit contributions
 Sp+0.027
 cuc+0.026
 o+0.025
ie+0.024
 Chick+0.024
Neg logit contributions
urance-0.047
Greg-0.044
Jennifer-0.044
inger-0.044
ady-0.043
Token.
Token ID13
Feature activation+6.00e-03

Pos logit contributions
urance+0.533
Greg+0.509
Jennifer+0.507
inger+0.499
ady+0.495
Neg logit contributions
 Sp-0.261
 cuc-0.248
 o-0.231
ie-0.222
 Chick-0.221
Token
Token ID220
Feature activation+2.25e-02

Pos logit contributions
urance+0.010
Jennifer+0.009
Greg+0.009
Jerry+0.009
aren+0.009
Neg logit contributions
 cuc-0.009
ie-0.009
 Sp-0.009
 o-0.009
 Chick-0.008
Token
Token ID198
Feature activation-1.03e-03

Pos logit contributions
urance+0.047
Greg+0.044
Jennifer+0.044
inger+0.043
ady+0.043
Neg logit contributions
 Sp-0.027
 cuc-0.024
 o-0.023
ie-0.023
 Chick-0.023
Token
Token ID198
Feature activation-8.40e-04

Pos logit contributions
 Sp+0.001
 cuc+0.001
 o+0.001
ie+0.001
 Chick+0.001
Neg logit contributions
urance-0.002
Greg-0.002
Jennifer-0.002
Jerry-0.002
inger-0.002
TokenL
Token ID43
Feature activation-8.02e-02

Pos logit contributions
 Sp+0.001
 cuc+0.001
 o+0.001
ie+0.001
 Chick+0.001
Neg logit contributions
urance-0.002
Greg-0.002
Jennifer-0.002
inger-0.002
ady-0.002
Token up
Token ID510
Feature activation+9.48e-02

Pos logit contributions
urance+0.131
Greg+0.126
Jennifer+0.125
inger+0.123
ady+0.122
Neg logit contributions
 Sp-0.063
 cuc-0.060
 o-0.056
ie-0.055
 Chick-0.054
Token and
Token ID290
Feature activation+1.53e-02

Pos logit contributions
urance+0.200
Greg+0.191
Jennifer+0.190
inger+0.187
ady+0.185
Neg logit contributions
 Sp-0.107
 cuc-0.101
 o-0.095
ie-0.093
 Chick-0.091
Token ran
Token ID4966
Feature activation+6.03e-02

Pos logit contributions
urance+0.026
Greg+0.025
Jennifer+0.025
Jerry+0.024
inger+0.024
Neg logit contributions
 Sp-0.022
 cuc-0.021
ie-0.020
 o-0.020
 Chick-0.020
Token in
Token ID287
Feature activation+3.78e-02

Pos logit contributions
urance+0.148
Jennifer+0.147
ady+0.141
inger+0.140
Greg+0.138
Neg logit contributions
 Sp-0.050
 if-0.047
 o-0.045
 he-0.042
 even-0.042
Token different
Token ID1180
Feature activation+5.52e-02

Pos logit contributions
urance+0.094
inger+0.092
Jennifer+0.091
Greg+0.089
agging+0.088
Neg logit contributions
 Sp-0.032
 if-0.029
 o-0.028
 cuc-0.027
 later-0.023
Token directions
Token ID11678
Feature activation+2.46e-01

Pos logit contributions
urance+0.070
Greg+0.066
Jennifer+0.065
inger+0.063
Jerry+0.062
Neg logit contributions
 Sp-0.110
 cuc-0.108
 o-0.107
ie-0.106
 if-0.103
Token.
Token ID13
Feature activation+1.73e-02

Pos logit contributions
urance+0.593
Jennifer+0.592
Greg+0.586
inger+0.577
Jerry+0.572
Neg logit contributions
 Sp-0.199
 if-0.183
 o-0.172
ie-0.171
 cuc-0.163
Token
Token ID220
Feature activation+2.48e-02

Pos logit contributions
urance+0.032
inger+0.031
Jennifer+0.030
agging+0.030
Greg+0.030
Neg logit contributions
 Sp-0.026
 cuc-0.022
 Chick-0.022
 o-0.021
 if-0.021
Token
Token ID198
Feature activation+2.09e-03

Pos logit contributions
urance+0.053
Jennifer+0.050
Greg+0.050
inger+0.049
ady+0.048
Neg logit contributions
 Sp-0.028
 cuc-0.025
 o-0.024
 if-0.024
 Chick-0.023
Token
Token ID198
Feature activation+2.59e-03

Pos logit contributions
urance+0.004
Jennifer+0.004
Greg+0.004
inger+0.004
ady+0.004
Neg logit contributions
 Sp-0.003
 cuc-0.002
 o-0.002
 Chick-0.002
ie-0.002
TokenAfter
Token ID3260
Feature activation+4.27e-02

Pos logit contributions
urance+0.004
inger+0.004
Jennifer+0.004
Greg+0.004
ady+0.004
Neg logit contributions
 Sp-0.004
 cuc-0.004
 o-0.004
 Chick-0.004
ie-0.004
Token that
Token ID326
Feature activation+1.25e-02

Pos logit contributions
urance+0.065
Greg+0.062
ady+0.061
inger+0.061
agging+0.060
Neg logit contributions
 Sp-0.048
 cuc-0.047
 Chick-0.042
 o-0.042
ie-0.041
Token stuck
Token ID7819
Feature activation-7.72e-02

Pos logit contributions
Greg+0.025
urance+0.025
Jennifer+0.024
Jerry+0.024
ady+0.023
Neg logit contributions
 cuc-0.013
 Sp-0.012
ie-0.012
 Chick-0.011
 o-0.011
Token out
Token ID503
Feature activation+3.06e-02

Pos logit contributions
 Sp+0.078
 cuc+0.073
 o+0.069
ie+0.066
 Chick+0.065
Neg logit contributions
urance-0.172
Greg-0.164
Jennifer-0.164
inger-0.161
ady-0.160
Token in
Token ID287
Feature activation+3.80e-02

Pos logit contributions
urance+0.069
Greg+0.066
ady+0.065
Jennifer+0.064
inger+0.064
Neg logit contributions
 cuc-0.028
 Sp-0.028
 o-0.024
 Chick-0.024
ie-0.023
Token all
Token ID477
Feature activation+3.74e-03

Pos logit contributions
urance+0.085
Greg+0.081
Jennifer+0.081
inger+0.080
ady+0.079
Neg logit contributions
 Sp-0.039
 cuc-0.036
 o-0.034
ie-0.033
 Chick-0.032
Token directions
Token ID11678
Feature activation+2.43e-01

Pos logit contributions
urance+0.008
Greg+0.008
Jennifer+0.008
inger+0.008
ady+0.008
Neg logit contributions
 Sp-0.004
 cuc-0.004
 o-0.004
ie-0.003
 Chick-0.003
Token and
Token ID290
Feature activation+7.32e-03

Pos logit contributions
urance+0.515
Greg+0.490
Jennifer+0.487
inger+0.481
ady+0.478
Neg logit contributions
 Sp-0.272
 cuc-0.257
 o-0.242
ie-0.234
 Chick-0.232
Token an
Token ID281
Feature activation-2.53e-02

Pos logit contributions
urance+0.014
Greg+0.013
Jennifer+0.013
ady+0.013
Jerry+0.013
Neg logit contributions
 cuc-0.009
 Sp-0.008
 Chick-0.008
 o-0.008
ie-0.007
Token old
Token ID1468
Feature activation+9.11e-02

Pos logit contributions
 Sp+0.044
 cuc+0.043
 o+0.040
 Chick+0.040
ie+0.040
Neg logit contributions
urance-0.038
Greg-0.035
Jennifer-0.035
ady-0.034
inger-0.034
Token straw
Token ID14787
Feature activation-1.63e-02

Pos logit contributions
urance+0.145
Jennifer+0.137
Greg+0.137
inger+0.134
ady+0.133
Neg logit contributions
 Sp-0.148
 cuc-0.138
 Chick-0.134
ie-0.132
 o-0.129
Token hat
Token ID6877
Feature activation-5.28e-02

Pos logit contributions
 Sp+0.020
 cuc+0.019
 o+0.017
 Chick+0.017
ie+0.017
Neg logit contributions
urance-0.033
Greg-0.031
inger-0.031
ady-0.030
Jennifer-0.030
Token lady
Token ID10846
Feature activation+3.70e-02

Pos logit contributions
urance+0.156
Greg+0.147
Jennifer+0.147
inger+0.144
ady+0.142
Neg logit contributions
 Sp-0.144
 cuc-0.138
 o-0.133
ie-0.130
 Chick-0.129
Tokenâ
Token ID22940
Feature activation-8.26e-02

Pos logit contributions
urance+0.078
Greg+0.074
Jennifer+0.074
inger+0.073
Jerry+0.072
Neg logit contributions
 Sp-0.042
 cuc-0.040
 o-0.038
ie-0.037
 Chick-0.036
Token
Token ID26391
Feature activation+1.31e-02

Pos logit contributions
 Sp+0.130
ie+0.127
 o+0.125
 please+0.118
 later+0.116
Neg logit contributions
 fists-0.118
 appearances-0.112
Greg-0.110
Jennifer-0.110
 supposed-0.108
Token
Token ID8151
Feature activation-6.22e-02

Pos logit contributions
urance+0.022
ags+0.021
agging+0.021
ady+0.021
Greg+0.021
Neg logit contributions
 Sp-0.020
 Chick-0.017
 if-0.017
 o-0.016
 he-0.016
Tokens
Token ID82
Feature activation+1.49e-02

Pos logit contributions
 Sp+0.067
ie+0.061
 if+0.060
 o+0.057
 cuc+0.056
Neg logit contributions
urance-0.136
Jennifer-0.128
Greg-0.126
inger-0.124
ady-0.123
Token directions
Token ID11678
Feature activation+2.43e-01

Pos logit contributions
urance+0.032
Jennifer+0.032
Greg+0.031
Jerry+0.031
ady+0.030
Neg logit contributions
 Sp-0.016
 o-0.015
 if-0.015
 cuc-0.014
ie-0.014
Token and
Token ID290
Feature activation+1.44e-02

Pos logit contributions
urance+0.568
Jennifer+0.562
Greg+0.561
inger+0.549
Jerry+0.548
Neg logit contributions
 Sp-0.208
 if-0.190
 o-0.188
ie-0.184
 cuc-0.184
Token soon
Token ID2582
Feature activation-7.64e-02

Pos logit contributions
Greg+0.026
Jennifer+0.025
urance+0.025
inger+0.024
Jerry+0.024
Neg logit contributions
 Sp-0.019
 cuc-0.018
ie-0.017
 o-0.017
ware-0.016
Token he
Token ID339
Feature activation-9.69e-02

Pos logit contributions
 Sp+0.074
 cuc+0.070
 o+0.065
ie+0.063
 Chick+0.062
Neg logit contributions
urance-0.172
Greg-0.166
Jennifer-0.165
inger-0.162
Jerry-0.160
Token found
Token ID1043
Feature activation-6.57e-02

Pos logit contributions
 Sp+0.130
 cuc+0.125
 o+0.117
 Chick+0.115
ie+0.114
Neg logit contributions
urance-0.182
Greg-0.176
Jennifer-0.174
inger-0.171
ady-0.169
Token his
Token ID465
Feature activation+1.54e-03

Pos logit contributions
 Sp+0.067
 cuc+0.063
 o+0.059
ie+0.058
 Chick+0.056
Neg logit contributions
urance-0.144
Greg-0.139
Jennifer-0.138
inger-0.136
ady-0.135
Token shone
Token ID44193
Feature activation-1.27e-01

Pos logit contributions
ady+0.014
Jennifer+0.013
Greg+0.013
urance+0.013
inger+0.013
Neg logit contributions
 cuc-0.015
 Sp-0.015
ectar-0.014
 Chick-0.013
 please-0.013
Token the
Token ID262
Feature activation+4.09e-04

Pos logit contributions
 cuc+0.211
ware+0.197
 Sp+0.192
 chef+0.183
atmeal+0.178
Neg logit contributions
Jennifer-0.168
ags-0.167
 directions-0.167
agging-0.163
Greg-0.160
Token flashlight
Token ID38371
Feature activation+4.09e-02

Pos logit contributions
urance+0.001
Greg+0.001
Jennifer+0.001
inger+0.001
ady+0.001
Neg logit contributions
 Sp-0.001
 cuc-0.001
 o-0.001
ie-0.001
 Chick-0.001
Token in
Token ID287
Feature activation+3.97e-02

Pos logit contributions
urance+0.077
Jennifer+0.072
ady+0.071
Greg+0.071
inger+0.071
Neg logit contributions
 Sp-0.055
 cuc-0.052
 Chick-0.047
 o-0.047
ie-0.045
Token all
Token ID477
Feature activation+7.21e-03

Pos logit contributions
urance+0.087
Jennifer+0.083
Greg+0.083
inger+0.082
ady+0.081
Neg logit contributions
 Sp-0.041
 cuc-0.039
 o-0.036
ie-0.036
 Chick-0.035
Token directions
Token ID11678
Feature activation+2.43e-01

Pos logit contributions
urance+0.014
Jennifer+0.014
Greg+0.014
inger+0.014
ady+0.014
Neg logit contributions
 Sp-0.009
 cuc-0.008
 o-0.008
 Chick-0.008
ie-0.007
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
urance+0.538
Greg+0.521
Jennifer+0.518
inger+0.508
Jerry+0.503
Neg logit contributions
 Sp-0.248
 cuc-0.228
 o-0.222
ie-0.216
 if-0.208
Token saw
Token ID2497
Feature activation-2.87e-02

Pos logit contributions
Jennifer+0.019
urance+0.019
 way+0.018
Greg+0.018
 directions+0.018
Neg logit contributions
 cuc-0.016
 Sp-0.015
 please-0.014
 Chick-0.014
?"-0.014
Token something
Token ID1223
Feature activation-4.01e-02

Pos logit contributions
 cuc+0.041
 Sp+0.038
 please+0.037
ie+0.036
ware+0.036
Neg logit contributions
Greg-0.044
urance-0.044
Jennifer-0.044
ady-0.043
 courage-0.042
Token shiny
Token ID22441
Feature activation+1.86e-02

Pos logit contributions
 Sp+0.059
 cuc+0.056
 Chick+0.053
ie+0.052
 o+0.051
Neg logit contributions
urance-0.067
Greg-0.066
Jennifer-0.064
ady-0.063
Jerry-0.062
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
urance+0.035
Greg+0.034
Jennifer+0.033
inger+0.033
ady+0.033
Neg logit contributions
 Sp-0.024
 cuc-0.022
 Chick-0.021
 o-0.020
ie-0.020
Token the
Token ID262
Feature activation+2.20e-03

Pos logit contributions
urance+0.035
Greg+0.034
Jennifer+0.034
Jerry+0.033
ady+0.032
Neg logit contributions
 cuc-0.019
 Sp-0.019
 Chick-0.017
 o-0.017
ie-0.017
Token cricket
Token ID18836
Feature activation-1.06e-01

Pos logit contributions
urance+0.004
Greg+0.003
Jennifer+0.003
inger+0.003
ady+0.003
Neg logit contributions
 Sp-0.004
 cuc-0.003
 o-0.003
ie-0.003
 Chick-0.003
Token ball
Token ID2613
Feature activation+2.04e-02

Pos logit contributions
 Sp+0.139
 Chick+0.134
 cuc+0.133
ie+0.121
 o+0.118
Neg logit contributions
urance-0.188
ady-0.173
 pace-0.173
agging-0.171
inger-0.170
Token in
Token ID287
Feature activation+3.66e-02

Pos logit contributions
urance+0.035
agging+0.033
inger+0.033
 pace+0.033
Greg+0.033
Neg logit contributions
 Sp-0.027
 cuc-0.027
 Chick-0.026
 o-0.024
ie-0.023
Token all
Token ID477
Feature activation+5.06e-03

Pos logit contributions
urance+0.088
Jennifer+0.084
inger+0.084
Greg+0.084
ady+0.081
Neg logit contributions
 Sp-0.033
 o-0.029
 cuc-0.029
 if-0.027
ie-0.026
Token directions
Token ID11678
Feature activation+2.43e-01

Pos logit contributions
urance+0.011
Jennifer+0.010
Greg+0.010
inger+0.010
Jerry+0.010
Neg logit contributions
 Sp-0.006
ie-0.005
 o-0.005
 cuc-0.005
 if-0.005
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
urance+0.527
Jennifer+0.506
Greg+0.506
inger+0.497
ady+0.491
Neg logit contributions
 Sp-0.257
 cuc-0.239
 o-0.229
ie-0.223
 Chick-0.215
Token His
Token ID2399
Feature activation+6.24e-02

Pos logit contributions
urance+0.022
Greg+0.021
Jennifer+0.021
Jerry+0.020
ady+0.020
Neg logit contributions
 Sp-0.019
 cuc-0.018
 o-0.018
ie-0.018
 Chick-0.017
Token friends
Token ID2460
Feature activation+3.70e-02

Pos logit contributions
urance+0.129
Greg+0.124
Jennifer+0.124
inger+0.121
ady+0.121
Neg logit contributions
 Sp-0.074
 o-0.069
 cuc-0.064
ie-0.063
 fro-0.063
Token would
Token ID561
Feature activation-5.15e-02

Pos logit contributions
urance+0.073
inger+0.070
Jennifer+0.070
Greg+0.069
ady+0.068
Neg logit contributions
 Sp-0.046
 cuc-0.043
 o-0.043
ie-0.043
 if-0.042
Token cheer
Token ID14042
Feature activation+1.94e-02

Pos logit contributions
 Sp+0.072
 o+0.071
 cuc+0.069
 if+0.066
ie+0.065
Neg logit contributions
urance-0.092
Jennifer-0.089
Greg-0.089
inger-0.088
ady-0.087
Token brush
Token ID14093
Feature activation+2.71e-02

Pos logit contributions
 Sp+0.025
 cuc+0.024
 o+0.023
 Chick+0.022
ie+0.022
Neg logit contributions
urance-0.028
Greg-0.026
Jennifer-0.026
ady-0.025
inger-0.025
Token his
Token ID465
Feature activation+1.92e-03

Pos logit contributions
Greg+0.053
urance+0.052
Jennifer+0.050
ady+0.050
inger+0.048
Neg logit contributions
 cuc-0.030
 Sp-0.030
ie-0.028
 o-0.027
 Chick-0.027
Token hair
Token ID4190
Feature activation+3.48e-02

Pos logit contributions
urance+0.002
ady+0.002
Jennifer+0.002
inger+0.002
Greg+0.002
Neg logit contributions
 Sp-0.004
 cuc-0.004
ie-0.004
 Chick-0.004
 if-0.004
Token in
Token ID287
Feature activation+3.97e-02

Pos logit contributions
urance+0.060
inger+0.056
ady+0.056
Greg+0.056
agging+0.055
Neg logit contributions
 Sp-0.050
 cuc-0.047
 Chick-0.046
ie-0.043
 o-0.041
Token different
Token ID1180
Feature activation+5.05e-02

Pos logit contributions
urance+0.090
inger+0.088
Jennifer+0.087
Greg+0.085
Jerry+0.085
Neg logit contributions
 Sp-0.039
 cuc-0.037
 o-0.036
 if-0.035
ie-0.032
Token directions
Token ID11678
Feature activation+2.41e-01

Pos logit contributions
urance+0.069
Greg+0.064
Jennifer+0.063
inger+0.062
ady+0.061
Neg logit contributions
 Sp-0.095
 cuc-0.092
 o-0.089
ie-0.087
 Chick-0.087
Token with
Token ID351
Feature activation+2.98e-02

Pos logit contributions
urance+0.546
Jennifer+0.539
Greg+0.535
inger+0.525
Jerry+0.521
Neg logit contributions
 Sp-0.226
 cuc-0.209
 o-0.205
ie-0.201
 if-0.197
Token both
Token ID1111
Feature activation+7.93e-02

Pos logit contributions
urance+0.066
Jennifer+0.063
Greg+0.063
inger+0.063
ady+0.062
Neg logit contributions
 Sp-0.030
 cuc-0.028
 o-0.027
ie-0.026
 Chick-0.025
Token com
Token ID401
Feature activation-1.16e-01

Pos logit contributions
urance+0.149
Greg+0.141
Jennifer+0.141
inger+0.138
ady+0.137
Neg logit contributions
 Sp-0.108
 cuc-0.103
 o-0.099
ie-0.096
 Chick-0.095
Tokenbs
Token ID1443
Feature activation-5.64e-02

Pos logit contributions
 Sp+0.153
 cuc+0.151
ie+0.147
 o+0.145
 Chick+0.136
Neg logit contributions
urance-0.219
Greg-0.212
Jennifer-0.210
Jerry-0.203
inger-0.198
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
 Sp+0.063
 cuc+0.060
 o+0.057
ie+0.055
 Chick+0.054
Neg logit contributions
urance-0.119
Greg-0.113
Jennifer-0.112
inger-0.111
ady-0.110
Token heard
Token ID2982
Feature activation+9.63e-03

Pos logit contributions
 Sp+0.021
 cuc+0.020
 Chick+0.018
ie+0.018
 o+0.018
Neg logit contributions
urance-0.025
Greg-0.025
Jennifer-0.025
ady-0.024
inger-0.024
Token lots
Token ID6041
Feature activation-1.54e-02

Pos logit contributions
urance+0.019
Greg+0.019
Jennifer+0.018
Jerry+0.018
ady+0.018
Neg logit contributions
 Sp-0.011
 cuc-0.011
ie-0.010
 Chick-0.010
 o-0.009
Token of
Token ID286
Feature activation-5.78e-02

Pos logit contributions
 Sp+0.011
 cuc+0.010
 o+0.009
ie+0.008
 Chick+0.008
Neg logit contributions
urance-0.039
Greg-0.038
Jennifer-0.038
inger-0.037
ady-0.037
Token rules
Token ID3173
Feature activation+1.57e-01

Pos logit contributions
 Sp+0.086
 cuc+0.081
 o+0.081
ie+0.077
 if+0.075
Neg logit contributions
urance-0.102
inger-0.097
Jennifer-0.096
Greg-0.096
Jerry-0.093
Token and
Token ID290
Feature activation+1.42e-02

Pos logit contributions
urance+0.362
Jennifer+0.353
inger+0.351
Greg+0.348
Jerry+0.345
Neg logit contributions
 Sp-0.146
 o-0.132
 if-0.131
 cuc-0.125
ie-0.125
Token directions
Token ID11678
Feature activation+2.40e-01

Pos logit contributions
urance+0.026
Greg+0.025
Jennifer+0.025
Jerry+0.024
ady+0.024
Neg logit contributions
 Sp-0.020
 cuc-0.019
 o-0.018
 Chick-0.017
ie-0.017
Token.
Token ID13
Feature activation+8.22e-03

Pos logit contributions
urance+0.511
Greg+0.487
Jennifer+0.484
inger+0.477
ady+0.473
Neg logit contributions
 Sp-0.265
 cuc-0.252
 o-0.236
ie-0.228
 Chick-0.226
Token This
Token ID770
Feature activation+1.74e-02

Pos logit contributions
urance+0.016
Jennifer+0.016
inger+0.016
Greg+0.015
agging+0.015
Neg logit contributions
 Sp-0.011
 cuc-0.010
 o-0.009
 if-0.009
 Chick-0.009
Token made
Token ID925
Feature activation-7.47e-02

Pos logit contributions
urance+0.030
inger+0.029
Greg+0.028
Jennifer+0.028
Jerry+0.028
Neg logit contributions
 Sp-0.026
 o-0.025
 cuc-0.025
ie-0.024
 if-0.024
Token him
Token ID683
Feature activation-2.00e-02

Pos logit contributions
 Sp+0.102
 cuc+0.098
 o+0.093
ie+0.091
 Chick+0.090
Neg logit contributions
urance-0.140
Greg-0.133
Jennifer-0.132
inger-0.130
ady-0.128
Token more
Token ID517
Feature activation-3.90e-02

Pos logit contributions
 Sp+0.028
 cuc+0.028
 Chick+0.025
ie+0.025
 o+0.025
Neg logit contributions
urance-0.037
Greg-0.034
Jennifer-0.034
ady-0.033
inger-0.033
Token pulled
Token ID5954
Feature activation+3.30e-02

Pos logit contributions
 Sp+0.010
 cuc+0.010
 Chick+0.009
 o+0.009
ie+0.009
Neg logit contributions
Greg-0.009
urance-0.009
Jennifer-0.009
Jerry-0.009
ady-0.009
Token the
Token ID262
Feature activation-2.63e-03

Pos logit contributions
urance+0.077
Jennifer+0.074
inger+0.073
Greg+0.073
Jerry+0.072
Neg logit contributions
 Sp-0.032
 cuc-0.028
 o-0.028
 if-0.027
ie-0.027
Token sack
Token ID23704
Feature activation+3.28e-02

Pos logit contributions
 cuc+0.003
 cel+0.003
 he+0.003
 o+0.003
 if+0.003
Neg logit contributions
urance-0.006
Alice-0.005
Elizabeth-0.005
aren-0.005
Greg-0.005
Token in
Token ID287
Feature activation+3.68e-02

Pos logit contributions
urance+0.073
Greg+0.070
Jennifer+0.070
inger+0.068
ady+0.068
Neg logit contributions
 Sp-0.033
 cuc-0.031
 o-0.030
ie-0.030
 if-0.029
Token different
Token ID1180
Feature activation+4.84e-02

Pos logit contributions
urance+0.092
inger+0.091
agging+0.091
Jennifer+0.088
Jerry+0.087
Neg logit contributions
 Sp-0.030
 if-0.030
 later-0.025
 o-0.024
 M-0.024
Token directions
Token ID11678
Feature activation+2.40e-01

Pos logit contributions
urance+0.090
Greg+0.089
inger+0.086
Elizabeth+0.085
Jerry+0.084
Neg logit contributions
ie-0.070
 cuc-0.067
 o-0.067
 Sp-0.065
 if-0.065
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
urance+0.557
Jennifer+0.554
Greg+0.551
inger+0.541
Jerry+0.536
Neg logit contributions
 Sp-0.215
 if-0.189
 o-0.189
ie-0.188
 cuc-0.187
Token The
Token ID383
Feature activation+1.07e-02

Pos logit contributions
urance+0.025
inger+0.024
Jennifer+0.024
ady+0.024
Greg+0.024
Neg logit contributions
 Sp-0.015
 cuc-0.013
 Chick-0.012
 o-0.012
 if-0.012
Token children
Token ID1751
Feature activation+8.85e-02

Pos logit contributions
urance+0.023
inger+0.023
ady+0.022
Elizabeth+0.022
anne+0.022
Neg logit contributions
 cuc-0.012
 cel-0.012
 o-0.011
 se-0.011
 he-0.011
Token changed
Token ID3421
Feature activation+4.86e-02

Pos logit contributions
urance+0.174
Jennifer+0.165
Greg+0.165
inger+0.164
ady+0.160
Neg logit contributions
 Sp-0.113
 cuc-0.107
 o-0.104
ie-0.102
 Chick-0.098
Token the
Token ID262
Feature activation-2.09e-03

Pos logit contributions
urance+0.110
Greg+0.106
Jennifer+0.106
inger+0.105
Jerry+0.104
Neg logit contributions
 Sp-0.050
 cuc-0.045
 o-0.044
ie-0.042
 if-0.041
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
 Sp+0.048
 if+0.044
ie+0.042
 o+0.042
 cuc+0.040
Neg logit contributions
urance-0.125
Greg-0.121
Jennifer-0.121
inger-0.121
ady-0.119
Token went
Token ID1816
Feature activation-4.79e-02

Pos logit contributions
urance+0.021
Greg+0.020
Jennifer+0.020
inger+0.020
ady+0.020
Neg logit contributions
 Sp-0.013
 cuc-0.012
 o-0.011
ie-0.011
 if-0.011
Token in
Token ID287
Feature activation+3.96e-02

Pos logit contributions
 o+0.037
 Sp+0.034
 if+0.033
 he+0.033
 for+0.032
Neg logit contributions
urance-0.121
Jennifer-0.115
Jerry-0.112
ady-0.110
inger-0.109
Token the
Token ID262
Feature activation+1.19e-03

Pos logit contributions
urance+0.089
inger+0.088
Jennifer+0.086
Greg+0.085
Jerry+0.085
Neg logit contributions
 Sp-0.041
 if-0.036
 o-0.036
 cuc-0.034
 Chick-0.031
Token different
Token ID1180
Feature activation+4.97e-02

Pos logit contributions
urance+0.002
inger+0.002
Greg+0.002
oys+0.002
lynn+0.002
Neg logit contributions
 o-0.002
 longer-0.001
 even-0.001
 he-0.001
ie-0.001
Token directions
Token ID11678
Feature activation+2.40e-01

Pos logit contributions
urance+0.069
Greg+0.068
Jennifer+0.066
Jerry+0.065
inger+0.063
Neg logit contributions
ie-0.095
 o-0.094
 if-0.092
 even-0.087
 later-0.085
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
urance+0.563
Jennifer+0.553
Greg+0.548
inger+0.539
Jerry+0.535
Neg logit contributions
 Sp-0.209
ie-0.188
 if-0.187
 o-0.187
 cuc-0.180
Token
Token ID198
Feature activation+6.85e-04

Pos logit contributions
urance+0.027
inger+0.026
agging+0.026
Jennifer+0.025
ady+0.025
Neg logit contributions
 Sp-0.017
 Chick-0.014
 cuc-0.014
 if-0.013
 Will-0.013
Token
Token ID198
Feature activation+9.00e-04

Pos logit contributions
urance+0.001
Jennifer+0.001
Greg+0.001
inger+0.001
ady+0.001
Neg logit contributions
 Sp-0.001
 cuc-0.001
 o-0.001
ie-0.001
 Chick-0.001
TokenThe
Token ID464
Feature activation+2.52e-02

Pos logit contributions
urance+0.001
inger+0.001
Jennifer+0.001
Greg+0.001
ady+0.001
Neg logit contributions
 Sp-0.001
 cuc-0.001
 o-0.001
 Chick-0.001
ie-0.001
Token little
Token ID1310
Feature activation+6.23e-02

Pos logit contributions
urance+0.053
inger+0.052
Jennifer+0.050
ady+0.050
Greg+0.050
Neg logit contributions
 Sp-0.030
 o-0.029
 cuc-0.028
ie-0.026
 if-0.026
Token big
Token ID1263
Feature activation+1.67e-02

Pos logit contributions
 Sp+0.037
 Chick+0.031
 Jos+0.031
 Isabel+0.030
ie+0.030
Neg logit contributions
 way-0.029
 lookout-0.028
 leash-0.026
Jennifer-0.025
urance-0.024
Token salad
Token ID20698
Feature activation-1.33e-01

Pos logit contributions
urance+0.027
 way+0.027
 lookout+0.025
 pace+0.025
 leash+0.025
Neg logit contributions
 Sp-0.024
bel-0.023
 Chick-0.021
ba-0.020
 Isabel-0.020
Token with
Token ID351
Feature activation+2.29e-02

Pos logit contributions
 Sp+0.211
 Does+0.187
 Chick+0.187
 Will+0.185
 fro+0.177
Neg logit contributions
 way-0.190
inger-0.184
urance-0.177
 leash-0.174
oys-0.174
Token carrots
Token ID34397
Feature activation-7.48e-02

Pos logit contributions
urance+0.038
ady+0.036
Jennifer+0.035
Jerry+0.035
 supposed+0.034
Neg logit contributions
 Sp-0.031
ie-0.027
 fro-0.027
bel-0.026
 Chick-0.026
Token,
Token ID11
Feature activation+2.95e-04

Pos logit contributions
 Sp+0.112
 cuc+0.096
 Chick+0.095
 fro+0.092
 o+0.090
Neg logit contributions
urance-0.125
inger-0.122
ady-0.120
Jennifer-0.115
Greg-0.114
Token cuc
Token ID38421
Feature activation-2.96e-01

Pos logit contributions
urance+0.000
Jennifer+0.000
Jerry+0.000
ady+0.000
 way+0.000
Neg logit contributions
 Sp-0.000
ie-0.000
bel-0.000
ectar-0.000
 Isabel-0.000
Tokenumbers
Token ID17024
Feature activation-1.73e-01

Pos logit contributions
 Sp+0.467
 o+0.391
 Chick+0.385
 M+0.380
ie+0.368
Neg logit contributions
ady-0.541
inger-0.526
agging-0.516
Greg-0.503
ags-0.490
Token,
Token ID11
Feature activation-3.34e-04

Pos logit contributions
 Sp+0.225
 cuc+0.202
 Chick+0.190
 o+0.188
ie+0.183
Neg logit contributions
urance-0.326
inger-0.314
ady-0.308
Greg-0.299
Jennifer-0.298
Token and
Token ID290
Feature activation+1.80e-03

Pos logit contributions
 Sp+0.001
ie+0.000
 Chick+0.000
 Isabel+0.000
bel+0.000
Neg logit contributions
urance-0.000
ady-0.000
Jennifer-0.000
 fore-0.000
 way-0.000
Token tomatoes
Token ID23972
Feature activation-8.53e-02

Pos logit contributions
 way+0.003
 respect+0.002
 courage+0.002
urance+0.002
Jennifer+0.002
Neg logit contributions
 Sp-0.002
ie-0.002
ectar-0.002
ry-0.002
 Isabel-0.002
Token.
Token ID13
Feature activation+3.84e-03

Pos logit contributions
 Sp+0.127
 Chick+0.108
 fro+0.105
 cuc+0.104
ie+0.102
Neg logit contributions
urance-0.142
inger-0.137
ady-0.133
oys-0.127
agging-0.127
Token house
Token ID2156
Feature activation-3.44e-02

Pos logit contributions
 Sp+0.011
 Chick+0.011
ie+0.011
 if+0.010
 cuc+0.010
Neg logit contributions
urance-0.008
Jennifer-0.007
Greg-0.007
 courtroom-0.007
 street-0.007
Token.
Token ID13
Feature activation+3.99e-04

Pos logit contributions
 Sp+0.046
 cuc+0.046
 Chick+0.043
ie+0.041
 o+0.041
Neg logit contributions
urance-0.063
ady-0.056
inger-0.056
Jennifer-0.056
Greg-0.055
Token They
Token ID1119
Feature activation+6.20e-02

Pos logit contributions
urance+0.001
Greg+0.001
Jennifer+0.001
inger+0.001
ady+0.001
Neg logit contributions
 Sp-0.001
 cuc-0.001
 o-0.001
 Chick-0.001
 if-0.001
Token wash
Token ID13502
Feature activation-2.23e-02

Pos logit contributions
urance+0.093
Jennifer+0.086
Greg+0.085
ady+0.081
inger+0.081
Neg logit contributions
 Sp-0.103
ie-0.098
 cuc-0.097
 Chick-0.095
ware-0.091
Token the
Token ID262
Feature activation-5.61e-03

Pos logit contributions
 Sp+0.031
 cuc+0.030
 o+0.028
ie+0.027
 Chick+0.026
Neg logit contributions
urance-0.041
Greg-0.039
Jennifer-0.038
ady-0.038
inger-0.038
Token cuc
Token ID38421
Feature activation-2.95e-01

Pos logit contributions
 Sp+0.007
 Chick+0.006
ie+0.006
 if+0.006
 Will+0.006
Neg logit contributions
urance-0.010
ady-0.010
inger-0.010
Jennifer-0.010
Greg-0.010
Tokenumber
Token ID4494
Feature activation-1.65e-01

Pos logit contributions
 Sp+0.552
 Chick+0.483
 M+0.481
 Will+0.479
ie+0.463
Neg logit contributions
inger-0.436
ady-0.433
tted-0.408
ags-0.406
agging-0.400
Token and
Token ID290
Feature activation+1.86e-03

Pos logit contributions
 Sp+0.207
 cuc+0.196
 o+0.176
 Chick+0.174
ie+0.174
Neg logit contributions
urance-0.321
inger-0.305
ady-0.303
Greg-0.297
Jennifer-0.296
Token cut
Token ID2005
Feature activation-1.02e-01

Pos logit contributions
urance+0.003
Jennifer+0.002
 street+0.002
 respect+0.002
Greg+0.002
Neg logit contributions
ie-0.003
 Sp-0.003
ectar-0.003
 cuc-0.003
ware-0.003
Token it
Token ID340
Feature activation-1.56e-02

Pos logit contributions
 Sp+0.120
 cuc+0.118
ie+0.108
 please+0.107
 o+0.105
Neg logit contributions
inger-0.184
urance-0.184
ady-0.183
Greg-0.183
oys-0.173
Token into
Token ID656
Feature activation-9.89e-03

Pos logit contributions
 Sp+0.022
 cuc+0.022
 fro+0.020
ie+0.020
 Chick+0.020
Neg logit contributions
urance-0.024
tted-0.022
inger-0.022
 way-0.022
ady-0.021
Token thank
Token ID5875
Feature activation-3.11e-02

Pos logit contributions
urance+0.004
Jennifer+0.004
oys+0.004
 respect+0.003
 fore+0.003
Neg logit contributions
ie-0.003
bel-0.003
 Sp-0.003
ware-0.002
 cuc-0.002
Token you
Token ID345
Feature activation+6.66e-03

Pos logit contributions
 cuc+0.041
ware+0.041
 Sp+0.038
 o+0.035
bel+0.035
Neg logit contributions
Jennifer-0.056
urance-0.055
Greg-0.053
Jerry-0.053
inger-0.051
Token.
Token ID13
Feature activation+3.53e-03

Pos logit contributions
urance+0.011
oys+0.010
ady+0.009
inger+0.009
Jennifer+0.009
Neg logit contributions
ware-0.011
 cuc-0.010
 Sp-0.010
 Chick-0.009
bel-0.009
Token We
Token ID775
Feature activation+1.83e-03

Pos logit contributions
urance+0.006
Greg+0.006
Jennifer+0.006
inger+0.005
ady+0.005
Neg logit contributions
 Sp-0.005
 cuc-0.005
 o-0.005
ie-0.005
 Chick-0.005
Token love
Token ID1842
Feature activation+3.50e-02

Pos logit contributions
urance+0.003
Greg+0.003
Jennifer+0.003
inger+0.003
ady+0.003
Neg logit contributions
 Sp-0.003
 cuc-0.003
 o-0.003
ie-0.003
 Chick-0.003
Token cuc
Token ID38421
Feature activation-2.95e-01

Pos logit contributions
 supposed+0.047
 respect+0.046
urance+0.046
oys+0.046
Jerry+0.045
Neg logit contributions
bel-0.052
ware-0.049
 Isabel-0.046
 Sp-0.045
ie-0.045
Tokenumbers
Token ID17024
Feature activation-1.73e-01

Pos logit contributions
 Sp+0.502
 Chick+0.447
 M+0.437
 o+0.427
 Will+0.412
Neg logit contributions
ady-0.481
inger-0.475
Greg-0.468
agging-0.461
ags-0.449
Token.
Token ID13
Feature activation+1.83e-03

Pos logit contributions
 Sp+0.257
 cuc+0.249
 Chick+0.235
 fro+0.220
ware+0.217
Neg logit contributions
urance-0.264
ady-0.262
oys-0.254
inger-0.253
agging-0.249
Token And
Token ID843
Feature activation-4.74e-02

Pos logit contributions
urance+0.003
Greg+0.003
Jennifer+0.003
ady+0.002
Jerry+0.002
Neg logit contributions
ie-0.003
 cuc-0.003
 o-0.003
 Sp-0.003
ware-0.003
Token we
Token ID356
Feature activation+6.37e-04

Pos logit contributions
ie+0.070
ware+0.062
berry+0.056
 Sp+0.054
 cuc+0.054
Neg logit contributions
urance-0.074
oys-0.074
 respect-0.072
 courage-0.069
 way-0.068
Token love
Token ID1842
Feature activation+3.47e-02

Pos logit contributions
urance+0.001
Greg+0.001
Jennifer+0.001
ady+0.001
inger+0.001
Neg logit contributions
 Sp-0.001
 cuc-0.001
ie-0.001
 Chick-0.001
 o-0.001
Token the
Token ID262
Feature activation-6.55e-03

Pos logit contributions
urance+0.073
Jennifer+0.070
Greg+0.069
Jerry+0.068
ady+0.067
Neg logit contributions
 Sp-0.043
 cuc-0.039
ie-0.039
 Chick-0.037
 o-0.037
Token bowl
Token ID9396
Feature activation-1.53e-01

Pos logit contributions
 Sp+0.009
 Chick+0.008
ie+0.008
 if+0.007
 Will+0.007
Neg logit contributions
urance-0.012
Jennifer-0.011
ady-0.011
inger-0.011
 lookout-0.011
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
 Sp+0.238
 cuc+0.216
 Chick+0.214
 Will+0.203
 fro+0.192
Neg logit contributions
ady-0.249
inger-0.238
urance-0.228
oys-0.222
keeping-0.219
Token Lucy
Token ID22162
Feature activation+7.44e-02

Pos logit contributions
urance+0.019
Greg+0.018
Jennifer+0.017
inger+0.017
ady+0.017
Neg logit contributions
 Sp-0.016
 cuc-0.016
 o-0.015
 Chick-0.015
ie-0.015
Token put
Token ID1234
Feature activation-5.06e-02

Pos logit contributions
urance+0.117
ady+0.116
Jennifer+0.113
Greg+0.112
inger+0.109
Neg logit contributions
 Sp-0.107
 cuc-0.101
 Chick-0.096
 please-0.096
 o-0.095
Token cuc
Token ID38421
Feature activation-2.95e-01

Pos logit contributions
 Sp+0.064
 cuc+0.054
 Chick+0.054
 o+0.053
bel+0.053
Neg logit contributions
Jennifer-0.090
Greg-0.089
urance-0.088
ags-0.088
Jerry-0.087
Tokenumbers
Token ID17024
Feature activation-1.73e-01

Pos logit contributions
 Sp+0.492
 o+0.430
 Chick+0.411
 M+0.403
 L+0.398
Neg logit contributions
ady-0.522
inger-0.494
tted-0.490
ags-0.471
agging-0.459
Token,
Token ID11
Feature activation+2.98e-03

Pos logit contributions
 Sp+0.247
 Chick+0.218
 cuc+0.217
 Will+0.198
 o+0.194
Neg logit contributions
inger-0.284
ady-0.276
Jennifer-0.262
urance-0.260
ags-0.260
Token tomatoes
Token ID23972
Feature activation-8.38e-02

Pos logit contributions
urance+0.005
Jennifer+0.005
Jerry+0.005
ady+0.005
Greg+0.005
Neg logit contributions
 Sp-0.004
ie-0.003
 Chick-0.003
ectar-0.003
 please-0.003
Token and
Token ID290
Feature activation+4.10e-03

Pos logit contributions
 Sp+0.116
 cuc+0.100
 Chick+0.100
ie+0.096
 o+0.096
Neg logit contributions
urance-0.146
inger-0.143
ady-0.141
Greg-0.134
Jennifer-0.133
Token some
Token ID617
Feature activation-4.98e-02

Pos logit contributions
Jennifer+0.007
urance+0.007
Jerry+0.007
ady+0.007
Greg+0.007
Neg logit contributions
 Sp-0.005
ectar-0.004
ie-0.004
bel-0.004
 Chick-0.004
Token tomato
Token ID24240
Feature activation-9.59e-02

Pos logit contributions
 Sp+0.034
 cuc+0.032
 o+0.031
ie+0.031
 Chick+0.030
Neg logit contributions
urance-0.042
Greg-0.040
Jennifer-0.040
inger-0.039
ady-0.038
Token,
Token ID11
Feature activation+2.69e-03

Pos logit contributions
 Sp+0.137
 Chick+0.124
ie+0.118
 cuc+0.116
 Jos+0.113
Neg logit contributions
urance-0.162
inger-0.152
agging-0.148
ady-0.147
tted-0.146
Token and
Token ID290
Feature activation+1.76e-03

Pos logit contributions
urance+0.006
Greg+0.005
Jennifer+0.005
inger+0.005
ady+0.005
Neg logit contributions
 Sp-0.003
 cuc-0.003
 o-0.003
ie-0.003
 Chick-0.003
Token then
Token ID788
Feature activation-3.56e-02

Pos logit contributions
Jennifer+0.003
Greg+0.003
Jerry+0.003
urance+0.003
 fore+0.003
Neg logit contributions
ie-0.002
 Sp-0.002
ectar-0.002
 cuc-0.002
 o-0.002
Token a
Token ID257
Feature activation-2.34e-02

Pos logit contributions
ie+0.047
 Sp+0.045
 cuc+0.044
 Chick+0.041
ectar+0.040
Neg logit contributions
Greg-0.061
Jennifer-0.060
Jerry-0.058
urance-0.055
Kyle-0.055
Token cuc
Token ID38421
Feature activation-2.94e-01

Pos logit contributions
 Sp+0.034
ie+0.031
 cuc+0.030
 Chick+0.030
 if+0.030
Neg logit contributions
urance-0.041
Greg-0.039
Jennifer-0.039
inger-0.038
Jerry-0.038
Tokenumber
Token ID4494
Feature activation-1.63e-01

Pos logit contributions
 Sp+0.506
 Chick+0.449
 o+0.419
ie+0.418
 M+0.417
Neg logit contributions
ady-0.484
inger-0.473
agging-0.461
tted-0.444
ags-0.442
Token.
Token ID13
Feature activation+1.21e-03

Pos logit contributions
 Sp+0.200
 cuc+0.185
 Chick+0.174
ie+0.168
 o+0.167
Neg logit contributions
urance-0.320
Greg-0.306
inger-0.306
ady-0.301
Jennifer-0.299
Token Lily
Token ID20037
Feature activation-6.84e-03

Pos logit contributions
urance+0.002
Jennifer+0.002
Greg+0.002
Jerry+0.001
ady+0.001
Neg logit contributions
ie-0.002
 cuc-0.002
 o-0.002
 Sp-0.002
ware-0.002
Token and
Token ID290
Feature activation+1.52e-03

Pos logit contributions
 Sp+0.008
 cuc+0.008
 o+0.007
ie+0.007
 Chick+0.007
Neg logit contributions
urance-0.014
Greg-0.013
Jennifer-0.013
inger-0.013
ady-0.013
Token Tom
Token ID4186
Feature activation+6.67e-02

Pos logit contributions
urance+0.002
Greg+0.002
inger+0.002
Jennifer+0.002
ady+0.002
Neg logit contributions
 Sp-0.003
 cuc-0.002
 o-0.002
 Chick-0.002
ie-0.002
Token.
Token ID13
Feature activation+1.37e-03

Pos logit contributions
 Sp+0.069
bel+0.062
 Chick+0.062
 Jos+0.060
 Manny+0.056
Neg logit contributions
ady-0.052
agging-0.049
keeping-0.049
inger-0.049
 property-0.048
Token And
Token ID843
Feature activation-4.75e-02

Pos logit contributions
urance+0.002
Jennifer+0.002
ady+0.002
Greg+0.002
Jerry+0.002
Neg logit contributions
ie-0.002
 cuc-0.002
 o-0.002
ware-0.002
bel-0.002
Token this
Token ID428
Feature activation+1.23e-02

Pos logit contributions
ie+0.060
 Sp+0.057
ware+0.053
 cuc+0.052
bel+0.051
Neg logit contributions
urance-0.081
Jennifer-0.077
 respect-0.077
ady-0.077
 strengths-0.076
Token is
Token ID318
Feature activation+3.87e-02

Pos logit contributions
urance+0.018
ady+0.018
inger+0.017
Jennifer+0.017
 way+0.017
Neg logit contributions
 Sp-0.021
 Chick-0.019
bel-0.018
 cuc-0.017
 Jos-0.017
Token a
Token ID257
Feature activation-2.25e-02

Pos logit contributions
 supposed+0.058
ady+0.057
urance+0.055
inger+0.055
 way+0.053
Neg logit contributions
bel-0.056
 Sp-0.056
ie-0.054
ware-0.050
berry-0.050
Token cuc
Token ID38421
Feature activation-2.94e-01

Pos logit contributions
 Sp+0.036
ie+0.034
 Chick+0.032
 if+0.030
 cuc+0.030
Neg logit contributions
urance-0.034
Jennifer-0.033
Jerry-0.032
Greg-0.032
ady-0.032
Tokenumber
Token ID4494
Feature activation-1.63e-01

Pos logit contributions
 Sp+0.558
 Chick+0.483
 M+0.482
ie+0.461
 o+0.459
Neg logit contributions
ady-0.451
inger-0.447
agging-0.422
tted-0.404
Greg-0.391
Token.
Token ID13
Feature activation+1.25e-03

Pos logit contributions
 Sp+0.229
 cuc+0.213
 Chick+0.190
 fro+0.189
ware+0.177
Neg logit contributions
ady-0.285
inger-0.280
urance-0.274
 supposed-0.268
oys-0.262
Token It
Token ID632
Feature activation-2.13e-02

Pos logit contributions
urance+0.002
ady+0.002
Jennifer+0.002
Greg+0.002
Jerry+0.002
Neg logit contributions
ie-0.002
 cuc-0.002
 o-0.002
 Sp-0.002
ware-0.002
Token is
Token ID318
Feature activation+3.90e-02

Pos logit contributions
bel+0.041
 Sp+0.040
ie+0.039
 cuc+0.038
 Jos+0.037
Neg logit contributions
ady-0.020
inger-0.019
urance-0.018
 fore-0.018
 way-0.017
Token green
Token ID4077
Feature activation+8.29e-03

Pos logit contributions
ady+0.063
urance+0.061
 supposed+0.061
inger+0.060
Jennifer+0.058
Neg logit contributions
 Sp-0.056
ie-0.054
bel-0.051
ware-0.049
 cuc-0.048
Token and
Token ID290
Feature activation+4.83e-03

Pos logit contributions
urance+0.005
Jennifer+0.005
Greg+0.005
ady+0.004
inger+0.004
Neg logit contributions
 Sp-0.003
ie-0.003
 cuc-0.003
 Chick-0.003
 o-0.003
Token the
Token ID262
Feature activation-5.21e-03

Pos logit contributions
urance+0.009
Jennifer+0.008
Jerry+0.008
Greg+0.008
 way+0.008
Neg logit contributions
 Sp-0.006
ie-0.005
 Chick-0.005
 o-0.005
 cuc-0.005
Token peppers
Token ID31804
Feature activation-7.92e-02

Pos logit contributions
 Sp+0.007
ie+0.006
 cuc+0.006
 Chick+0.006
 o+0.006
Neg logit contributions
urance-0.010
Jennifer-0.009
Greg-0.009
inger-0.009
ady-0.009
Token from
Token ID422
Feature activation+1.98e-02

Pos logit contributions
 Sp+0.095
 cuc+0.088
 o+0.083
 Chick+0.082
ie+0.081
Neg logit contributions
urance-0.160
Greg-0.152
Jennifer-0.150
inger-0.150
ady-0.149
Token the
Token ID262
Feature activation-5.34e-03

Pos logit contributions
urance+0.044
Greg+0.042
Jennifer+0.042
inger+0.041
ady+0.040
Neg logit contributions
 Sp-0.020
 cuc-0.019
 o-0.018
ie-0.017
 Chick-0.017
Token cuc
Token ID38421
Feature activation-2.94e-01

Pos logit contributions
 Sp+0.009
ie+0.008
 cuc+0.008
 Chick+0.008
 o+0.008
Neg logit contributions
urance-0.008
Jennifer-0.008
Greg-0.008
inger-0.008
ady-0.008
Tokenumbers
Token ID17024
Feature activation-1.71e-01

Pos logit contributions
 Sp+0.492
 Chick+0.421
 o+0.413
ie+0.407
 M+0.407
Neg logit contributions
inger-0.505
ady-0.502
agging-0.468
Greg-0.468
tted-0.461
Token.
Token ID13
Feature activation+5.14e-03

Pos logit contributions
 Sp+0.201
 cuc+0.189
 o+0.177
 Chick+0.174
ie+0.170
Neg logit contributions
urance-0.352
Greg-0.333
Jennifer-0.332
inger-0.331
ady-0.328
Token It
Token ID632
Feature activation-1.97e-02

Pos logit contributions
urance+0.009
Greg+0.008
Jennifer+0.008
inger+0.008
ady+0.008
Neg logit contributions
 Sp-0.008
 cuc-0.008
 o-0.007
ie-0.007
 Chick-0.007
Token took
Token ID1718
Feature activation-9.94e-03

Pos logit contributions
ectar+0.030
 Sp+0.029
 please+0.029
 cuc+0.029
 Chick+0.027
Neg logit contributions
Greg-0.025
tted-0.025
 fore-0.025
 way-0.025
urance-0.024
Token a
Token ID257
Feature activation-2.18e-02

Pos logit contributions
 Sp+0.012
 cuc+0.011
 o+0.011
ie+0.011
ware+0.010
Neg logit contributions
urance-0.019
Greg-0.018
Jennifer-0.018
inger-0.018
ady-0.018
Token some
Token ID617
Feature activation-4.69e-02

Pos logit contributions
Greg+0.067
ady+0.065
urance+0.064
Jennifer+0.064
inger+0.064
Neg logit contributions
 Sp-0.052
 cuc-0.049
 please-0.047
 Chick-0.047
bel-0.047
Token y
Token ID331
Feature activation-1.24e-01

Pos logit contributions
 Sp+0.073
 Chick+0.065
 cuc+0.065
ie+0.061
 Will+0.059
Neg logit contributions
ady-0.072
urance-0.071
inger-0.068
Greg-0.067
Jennifer-0.066
Tokenummy
Token ID13513
Feature activation+1.25e-02

Pos logit contributions
 Sp+0.157
 cuc+0.148
 o+0.141
ie+0.136
 Chick+0.136
Neg logit contributions
urance-0.244
Greg-0.231
Jennifer-0.230
inger-0.228
ady-0.225
Token carrots
Token ID34397
Feature activation-7.23e-02

Pos logit contributions
ady+0.017
urance+0.017
Greg+0.016
inger+0.016
Jerry+0.016
Neg logit contributions
 Sp-0.020
 please-0.018
 Chick-0.018
bel-0.018
 cuc-0.017
Token and
Token ID290
Feature activation+8.29e-03

Pos logit contributions
 Sp+0.107
 Chick+0.095
 cuc+0.089
 fro+0.085
 Jos+0.085
Neg logit contributions
inger-0.114
urance-0.113
ady-0.112
Greg-0.108
Jerry-0.108
Token cuc
Token ID38421
Feature activation-2.94e-01

Pos logit contributions
urance+0.013
Jerry+0.013
Jennifer+0.013
 way+0.012
Greg+0.012
Neg logit contributions
 Sp-0.011
ie-0.010
ectar-0.010
ware-0.010
 Isabel-0.009
Tokenumbers
Token ID17024
Feature activation-1.71e-01

Pos logit contributions
 Sp+0.450
 o+0.387
 Chick+0.366
 M+0.358
?"+0.346
Neg logit contributions
ady-0.553
inger-0.537
agging-0.518
Greg-0.515
tted-0.505
Token from
Token ID422
Feature activation+2.15e-02

Pos logit contributions
 Sp+0.257
 Chick+0.228
 cuc+0.225
 fro+0.207
 o+0.204
Neg logit contributions
urance-0.273
inger-0.272
ady-0.267
Greg-0.252
oys-0.252
Token the
Token ID262
Feature activation-2.65e-03

Pos logit contributions
urance+0.041
Greg+0.041
Jennifer+0.040
inger+0.039
Jerry+0.039
Neg logit contributions
 Sp-0.025
 cuc-0.023
 Chick-0.022
 o-0.021
 please-0.021
Token farmer
Token ID18739
Feature activation-3.93e-02

Pos logit contributions
 Sp+0.004
 cuc+0.004
 o+0.004
ie+0.004
 Chick+0.004
Neg logit contributions
urance-0.004
Greg-0.004
Jennifer-0.004
inger-0.004
ady-0.004
Token's
Token ID338
Feature activation+2.18e-02

Pos logit contributions
 Sp+0.061
 cuc+0.056
 o+0.055
 Chick+0.054
ie+0.052
Neg logit contributions
urance-0.062
Jennifer-0.059
ady-0.059
Greg-0.058
inger-0.058
Token They
Token ID1119
Feature activation+6.99e-02

Pos logit contributions
urance+0.011
Jennifer+0.011
inger+0.011
Greg+0.011
ady+0.010
Neg logit contributions
 Sp-0.008
 cuc-0.007
 o-0.007
 Chick-0.007
 if-0.007
Token were
Token ID547
Feature activation+9.34e-03

Pos logit contributions
urance+0.083
Jerry+0.080
ady+0.076
Jennifer+0.076
atha+0.075
Neg logit contributions
 Sp-0.116
bel-0.108
ware-0.106
 please-0.105
 Does-0.104
Token filled
Token ID5901
Feature activation-1.43e-01

Pos logit contributions
 supposed+0.015
urance+0.014
ady+0.014
Jerry+0.013
Jennifer+0.013
Neg logit contributions
ware-0.015
 Sp-0.015
 cuc-0.013
ie-0.013
 Chick-0.013
Token with
Token ID351
Feature activation+2.55e-02

Pos logit contributions
 Sp+0.163
 cuc+0.152
 Chick+0.146
 o+0.142
ware+0.136
Neg logit contributions
urance-0.279
Jennifer-0.278
Greg-0.273
Jerry-0.271
inger-0.270
Token fresh
Token ID4713
Feature activation-1.75e-04

Pos logit contributions
urance+0.040
ady+0.038
Jennifer+0.038
Greg+0.037
Jerry+0.037
Neg logit contributions
 Sp-0.039
 Chick-0.035
ie-0.035
ware-0.034
 cuc-0.032
Token cuc
Token ID38421
Feature activation-2.94e-01

Pos logit contributions
 Sp+0.000
 Chick+0.000
 cuc+0.000
ie+0.000
 if+0.000
Neg logit contributions
urance-0.000
ady-0.000
Greg-0.000
Jennifer-0.000
inger-0.000
Tokenumber
Token ID4494
Feature activation-1.62e-01

Pos logit contributions
 Sp+0.479
 Chick+0.398
 o+0.394
 if+0.389
?"+0.388
Neg logit contributions
ady-0.535
inger-0.522
agging-0.493
tted-0.482
Greg-0.465
Token and
Token ID290
Feature activation+4.91e-03

Pos logit contributions
 Sp+0.193
 cuc+0.183
 o+0.176
ie+0.171
 Chick+0.167
Neg logit contributions
urance-0.331
Greg-0.317
Jennifer-0.315
inger-0.309
ady-0.305
Token cheese
Token ID9891
Feature activation-1.25e-01

Pos logit contributions
 way+0.006
Jerry+0.006
 fore+0.006
 courage+0.006
 directions+0.006
Neg logit contributions
 Sp-0.007
ware-0.007
ie-0.007
 Chick-0.007
 Isabel-0.007
Token.
Token ID13
Feature activation+5.93e-03

Pos logit contributions
 Sp+0.179
 Chick+0.156
 cuc+0.149
ie+0.146
 please+0.144
Neg logit contributions
urance-0.212
inger-0.206
ady-0.202
Jennifer-0.194
Jerry-0.193
Token It
Token ID632
Feature activation-1.86e-02

Pos logit contributions
urance+0.011
Jennifer+0.011
inger+0.011
Greg+0.011
ady+0.011
Neg logit contributions
 Sp-0.009
 cuc-0.007
 o-0.007
 Chick-0.007
 if-0.007
Token chopped
Token ID20720
Feature activation-1.38e-01

Pos logit contributions
 Sp+0.003
 cuc+0.003
 o+0.003
 Chick+0.003
ware+0.003
Neg logit contributions
urance-0.004
ady-0.004
Greg-0.004
Jennifer-0.004
Jerry-0.004
Token up
Token ID510
Feature activation+9.19e-02

Pos logit contributions
 Sp+0.192
 please+0.182
 fro+0.173
 Chick+0.168
 cuc+0.167
Neg logit contributions
ags-0.210
Greg-0.206
inger-0.203
Jennifer-0.202
ady-0.202
Token some
Token ID617
Feature activation-4.78e-02

Pos logit contributions
urance+0.150
Jennifer+0.150
ady+0.143
Jerry+0.141
Greg+0.141
Neg logit contributions
 Sp-0.120
 please-0.112
 Chick-0.108
ie-0.107
 fro-0.103
Token tomatoes
Token ID23972
Feature activation-8.30e-02

Pos logit contributions
 Sp+0.075
 Chick+0.068
 Isabel+0.066
 fro+0.063
 Stri+0.062
Neg logit contributions
ady-0.072
 way-0.067
 directions-0.067
ags-0.067
urance-0.065
Token,
Token ID11
Feature activation+2.88e-03

Pos logit contributions
 Sp+0.120
 Chick+0.106
 fro+0.104
 o+0.100
ie+0.100
Neg logit contributions
urance-0.134
ady-0.134
inger-0.133
oys-0.125
tted-0.122
Token cuc
Token ID38421
Feature activation-2.94e-01

Pos logit contributions
urance+0.005
Jennifer+0.005
Jerry+0.005
ady+0.005
Greg+0.005
Neg logit contributions
 Sp-0.004
ie-0.003
 Chick-0.003
 o-0.003
 fro-0.003
Tokenumbers
Token ID17024
Feature activation-1.72e-01

Pos logit contributions
 Sp+0.430
 o+0.381
 Chick+0.368
ie+0.344
 cuc+0.342
Neg logit contributions
ady-0.554
inger-0.537
Greg-0.524
Jennifer-0.515
agging-0.511
Token,
Token ID11
Feature activation+1.89e-03

Pos logit contributions
 Sp+0.235
 cuc+0.209
 Chick+0.206
 o+0.203
 fro+0.197
Neg logit contributions
urance-0.302
inger-0.291
ady-0.289
oys-0.277
Greg-0.275
Token and
Token ID290
Feature activation+6.21e-03

Pos logit contributions
urance+0.003
ady+0.003
Jerry+0.003
Jennifer+0.003
Greg+0.003
Neg logit contributions
 Sp-0.002
ie-0.002
 Chick-0.002
 fro-0.002
ectar-0.002
Token lettuce
Token ID39406
Feature activation-6.09e-02

Pos logit contributions
Jerry+0.010
Jennifer+0.010
urance+0.010
 way+0.009
Greg+0.009
Neg logit contributions
 Sp-0.007
ectar-0.007
ie-0.007
bel-0.006
 fro-0.006
Token.
Token ID13
Feature activation+4.05e-03

Pos logit contributions
 Sp+0.089
 Chick+0.077
 fro+0.074
 cuc+0.073
 Will+0.072
Neg logit contributions
urance-0.100
inger-0.098
ady-0.095
oys-0.091
agging-0.088
Token cuc
Token ID38421
Feature activation-2.93e-01

Pos logit contributions
 Sp+0.036
ie+0.033
 Chick+0.032
 cuc+0.031
 o+0.030
Neg logit contributions
urance-0.040
Greg-0.038
Jennifer-0.037
Jerry-0.037
inger-0.036
Tokenumber
Token ID4494
Feature activation-1.63e-01

Pos logit contributions
 Sp+0.523
 Chick+0.465
 M+0.443
 o+0.433
ie+0.426
Neg logit contributions
ady-0.455
inger-0.455
agging-0.433
Greg-0.417
tted-0.407
Token.
Token ID13
Feature activation-4.70e-04

Pos logit contributions
 Sp+0.225
 cuc+0.210
 Chick+0.194
 o+0.185
 fro+0.183
Neg logit contributions
urance-0.284
ady-0.280
inger-0.277
oys-0.267
Jennifer-0.264
Token Mama
Token ID40084
Feature activation-4.69e-02

Pos logit contributions
ie+0.001
 o+0.001
 cuc+0.001
bel+0.001
ware+0.001
Neg logit contributions
urance-0.001
oys-0.001
ady-0.001
 Fighting-0.001
Greg-0.001
Token puts
Token ID7584
Feature activation-2.90e-02

Pos logit contributions
 Sp+0.071
 cuc+0.068
 Chick+0.063
 o+0.062
ie+0.062
Neg logit contributions
urance-0.075
Greg-0.072
Jennifer-0.072
inger-0.071
ady-0.070
Token cuc
Token ID38421
Feature activation-2.93e-01

Pos logit contributions
 Sp+0.033
 cuc+0.029
ie+0.028
 Chick+0.027
bel+0.026
Neg logit contributions
urance-0.053
ady-0.053
Jennifer-0.053
inger-0.053
Greg-0.053
Tokenumbers
Token ID17024
Feature activation-1.73e-01

Pos logit contributions
 Sp+0.528
 Chick+0.462
 M+0.452
 o+0.438
 L+0.429
Neg logit contributions
ady-0.472
inger-0.456
ags-0.426
agging-0.423
Greg-0.416
Token in
Token ID287
Feature activation+3.63e-02

Pos logit contributions
 Sp+0.238
 cuc+0.222
 Chick+0.213
 fro+0.194
ie+0.190
Neg logit contributions
ady-0.294
inger-0.292
ags-0.278
urance-0.274
oys-0.273
Token our
Token ID674
Feature activation+9.37e-02

Pos logit contributions
ady+0.066
urance+0.066
Jennifer+0.065
inger+0.064
Greg+0.063
Neg logit contributions
 Sp-0.043
 cuc-0.038
ie-0.038
bel-0.037
 Chick-0.036
Token salad
Token ID20698
Feature activation-1.33e-01

Pos logit contributions
urance+0.157
ags+0.145
inger+0.144
ady+0.144
Jennifer+0.142
Neg logit contributions
 Sp-0.134
 Chick-0.122
 if-0.122
ie-0.121
 Will-0.111
Token.
Token ID13
Feature activation-8.25e-05

Pos logit contributions
 Sp+0.207
 Chick+0.184
 cuc+0.171
 Will+0.168
 Jos+0.167
Neg logit contributions
ady-0.214
inger-0.210
urance-0.196
ags-0.190
 property-0.189
TokenI
Token ID40
Feature activation-2.72e-02

Pos logit contributions
urance+0.078
Greg+0.077
Jennifer+0.076
inger+0.074
Jerry+0.074
Neg logit contributions
 Sp-0.086
ie-0.082
 cuc-0.082
 o-0.081
 Chick-0.078
Token think
Token ID892
Feature activation+3.25e-03

Pos logit contributions
 cuc+0.040
 Sp+0.040
ie+0.037
 o+0.036
 Chick+0.036
Neg logit contributions
urance-0.046
Greg-0.045
Jennifer-0.045
ady-0.043
inger-0.043
Token it
Token ID340
Feature activation-1.54e-02

Pos logit contributions
urance+0.005
 courage+0.005
 appearances+0.005
 Jerry+0.005
 way+0.005
Neg logit contributions
bel-0.005
ie-0.004
 cuc-0.004
ba-0.003
 fro-0.003
Token is
Token ID318
Feature activation+3.81e-02

Pos logit contributions
 cuc+0.027
 Sp+0.027
bel+0.026
 Chick+0.025
ie+0.025
Neg logit contributions
urance-0.019
ady-0.017
Greg-0.017
Jennifer-0.017
inger-0.017
Token a
Token ID257
Feature activation-2.42e-02

Pos logit contributions
urance+0.062
ady+0.058
 supposed+0.057
inger+0.057
Jennifer+0.056
Neg logit contributions
 Sp-0.054
ie-0.051
bel-0.050
 cuc-0.048
 Chick-0.048
Token cuc
Token ID38421
Feature activation-2.93e-01

Pos logit contributions
 Sp+0.036
ie+0.033
 Chick+0.032
 cuc+0.031
 o+0.030
Neg logit contributions
urance-0.040
Greg-0.038
Jennifer-0.037
Jerry-0.037
inger-0.036
Tokenumber
Token ID4494
Feature activation-1.63e-01

Pos logit contributions
 Sp+0.523
 Chick+0.465
 M+0.443
 o+0.433
ie+0.426
Neg logit contributions
ady-0.455
inger-0.455
agging-0.433
Greg-0.417
tted-0.407
Token.
Token ID13
Feature activation-4.70e-04

Pos logit contributions
 Sp+0.225
 cuc+0.210
 Chick+0.194
 o+0.185
 fro+0.183
Neg logit contributions
urance-0.284
ady-0.280
inger-0.277
oys-0.267
Jennifer-0.264
Token Mama
Token ID40084
Feature activation-4.69e-02

Pos logit contributions
ie+0.001
 o+0.001
 cuc+0.001
bel+0.001
ware+0.001
Neg logit contributions
urance-0.001
oys-0.001
ady-0.001
 Fighting-0.001
Greg-0.001
Token puts
Token ID7584
Feature activation-2.90e-02

Pos logit contributions
 Sp+0.071
 cuc+0.068
 Chick+0.063
 o+0.062
ie+0.062
Neg logit contributions
urance-0.075
Greg-0.072
Jennifer-0.072
inger-0.071
ady-0.070
Token cuc
Token ID38421
Feature activation-2.93e-01

Pos logit contributions
 Sp+0.033
 cuc+0.029
ie+0.028
 Chick+0.027
bel+0.026
Neg logit contributions
urance-0.053
ady-0.053
Jennifer-0.053
inger-0.053
Greg-0.053
Token happy
Token ID3772
Feature activation-3.04e-02

Pos logit contributions
urance+0.045
ady+0.044
Jennifer+0.044
Greg+0.043
inger+0.043
Neg logit contributions
 Sp-0.054
 cuc-0.051
ie-0.049
 Chick-0.048
ware-0.047
Token to
Token ID284
Feature activation-2.44e-02

Pos logit contributions
 cuc+0.046
 Sp+0.045
 Chick+0.042
 Jos+0.041
ware+0.041
Neg logit contributions
 property-0.042
 respect-0.041
 strengths-0.040
 way-0.040
 supposed-0.040
Token see
Token ID766
Feature activation-4.43e-02

Pos logit contributions
 Sp+0.037
 cuc+0.034
ie+0.033
 Chick+0.032
 o+0.032
Neg logit contributions
urance-0.042
Jennifer-0.040
Greg-0.039
inger-0.039
Jerry-0.038
Token our
Token ID674
Feature activation+9.49e-02

Pos logit contributions
bel+0.066
 cuc+0.061
 Sp+0.060
ware+0.058
 fro+0.054
Neg logit contributions
 respect-0.064
Jennifer-0.062
 authority-0.060
Susan-0.059
Greg-0.058
Token big
Token ID1263
Feature activation+1.64e-02

Pos logit contributions
urance+0.172
inger+0.160
 property+0.159
ady+0.158
Jennifer+0.157
Neg logit contributions
 Sp-0.127
 Chick-0.116
 cuc-0.105
 if-0.105
ie-0.104
Token cuc
Token ID38421
Feature activation-2.93e-01

Pos logit contributions
urance+0.031
inger+0.029
ady+0.028
Jennifer+0.028
Greg+0.028
Neg logit contributions
 Sp-0.021
 Chick-0.019
bel-0.018
ie-0.017
 cuc-0.017
Tokenumber
Token ID4494
Feature activation-1.62e-01

Pos logit contributions
 Sp+0.529
 Chick+0.470
 M+0.455
 L+0.446
 Will+0.439
Neg logit contributions
inger-0.466
ady-0.451
agging-0.428
Greg-0.399
ags-0.396
Token."
Token ID526
Feature activation-3.14e-02

Pos logit contributions
 Sp+0.232
 cuc+0.210
 Chick+0.196
 o+0.179
 fro+0.178
Neg logit contributions
urance-0.290
inger-0.285
ady-0.281
Jennifer-0.265
 property-0.263
Token Lily
Token ID20037
Feature activation-4.81e-03

Pos logit contributions
ware+0.055
ie+0.052
bel+0.052
 cuc+0.048
fish+0.048
Neg logit contributions
Jennifer-0.045
Jerry-0.040
 Jennifer-0.039
Greg-0.038
 Tammy-0.037
Token says
Token ID1139
Feature activation-3.53e-02

Pos logit contributions
 Sp+0.007
 cuc+0.006
ie+0.006
 o+0.006
 Chick+0.006
Neg logit contributions
urance-0.009
Jennifer-0.008
Greg-0.008
ady-0.008
inger-0.008
Token.
Token ID13
Feature activation-2.46e-04

Pos logit contributions
 Sp+0.034
 cuc+0.032
 o+0.030
ie+0.030
 if+0.030
Neg logit contributions
urance-0.080
Greg-0.079
Jennifer-0.078
inger-0.076
Jerry-0.076
Token cut
Token ID2005
Feature activation-1.01e-01

Pos logit contributions
 Sp+0.031
 cuc+0.029
ie+0.028
 Chick+0.027
 if+0.026
Neg logit contributions
urance-0.044
ady-0.040
inger-0.039
Greg-0.039
Jennifer-0.039
Token the
Token ID262
Feature activation-5.04e-03

Pos logit contributions
 Sp+0.120
 cuc+0.117
bel+0.113
 o+0.111
ie+0.108
Neg logit contributions
urance-0.171
ady-0.169
Greg-0.168
inger-0.166
ags-0.165
Token lettuce
Token ID39406
Feature activation-6.02e-02

Pos logit contributions
 Sp+0.007
 Chick+0.006
ie+0.006
 if+0.005
 Will+0.005
Neg logit contributions
urance-0.010
ady-0.009
inger-0.009
Jennifer-0.009
agging-0.009
Token and
Token ID290
Feature activation+1.38e-03

Pos logit contributions
 Sp+0.093
 Chick+0.085
ie+0.079
bel+0.077
 Will+0.077
Neg logit contributions
urance-0.086
inger-0.085
 way-0.083
ady-0.082
 pace-0.077
Token the
Token ID262
Feature activation-5.24e-03

Pos logit contributions
urance+0.002
 way+0.002
 fists+0.002
Jennifer+0.002
Jerry+0.002
Neg logit contributions
 Sp-0.002
ie-0.002
ectar-0.002
 Chick-0.002
ba-0.002
Token cuc
Token ID38421
Feature activation-2.93e-01

Pos logit contributions
 Sp+0.007
 Chick+0.006
ie+0.006
 o+0.006
 if+0.006
Neg logit contributions
urance-0.010
Jennifer-0.009
ady-0.009
Jerry-0.009
inger-0.009
Tokenumber
Token ID4494
Feature activation-1.62e-01

Pos logit contributions
 Sp+0.513
 Chick+0.443
 M+0.434
 o+0.434
ie+0.427
Neg logit contributions
inger-0.488
ady-0.483
tted-0.457
ags-0.450
agging-0.444
Token.
Token ID13
Feature activation+3.44e-03

Pos logit contributions
 Sp+0.241
 cuc+0.220
 Chick+0.215
 fro+0.208
ectar+0.197
Neg logit contributions
urance-0.261
ady-0.246
inger-0.241
tted-0.229
 supposed-0.227
Token She
Token ID1375
Feature activation-5.68e-04

Pos logit contributions
urance+0.006
inger+0.006
Jennifer+0.006
ady+0.006
Greg+0.006
Neg logit contributions
 Sp-0.005
 cuc-0.005
 o-0.005
 if-0.004
 Chick-0.004
Token said
Token ID531
Feature activation-3.16e-02

Pos logit contributions
ectar+0.001
 Sp+0.001
 please+0.001
 fro+0.001
ie+0.001
Neg logit contributions
ady-0.001
Greg-0.001
Jennifer-0.001
Jerry-0.001
 fore-0.001
Token he
Token ID339
Feature activation-1.02e-01

Pos logit contributions
ie+0.042
light+0.038
ectar+0.037
bel+0.036
ry+0.035
Neg logit contributions
 Jerry-0.049
urance-0.048
Greg-0.046
Jerry-0.046
 Heidi-0.045
Token saw
Token ID2497
Feature activation-2.43e-02

Pos logit contributions
 Sp+0.028
 cuc+0.026
 Chick+0.026
ie+0.026
 if+0.025
Neg logit contributions
Jennifer-0.023
urance-0.023
Greg-0.023
Jerry-0.023
ady-0.022
Token carrots
Token ID34397
Feature activation-7.17e-02

Pos logit contributions
 Sp+0.035
ie+0.033
 please+0.032
 Chick+0.032
ry+0.032
Neg logit contributions
Jerry-0.036
Greg-0.035
Jennifer-0.034
urance-0.034
Susan-0.033
Token,
Token ID11
Feature activation+5.79e-03

Pos logit contributions
 Sp+0.097
 cuc+0.086
 Chick+0.086
 o+0.079
ie+0.078
Neg logit contributions
urance-0.129
inger-0.123
Greg-0.123
ady-0.122
Jerry-0.120
Token potatoes
Token ID18821
Feature activation-1.25e-01

Pos logit contributions
Jerry+0.010
urance+0.010
Jennifer+0.010
ady+0.009
Greg+0.009
Neg logit contributions
 Sp-0.007
ie-0.006
 Chick-0.006
 please-0.006
 Isabel-0.006
Token,
Token ID11
Feature activation+5.65e-03

Pos logit contributions
 Sp+0.146
 cuc+0.133
 o+0.125
 Chick+0.124
ie+0.124
Neg logit contributions
urance-0.257
Greg-0.244
Jennifer-0.241
inger-0.240
ady-0.239
Token cuc
Token ID38421
Feature activation-2.92e-01

Pos logit contributions
urance+0.010
Jerry+0.010
Jennifer+0.010
ady+0.010
Greg+0.009
Neg logit contributions
 Sp-0.007
 Chick-0.006
ie-0.006
 please-0.006
 Isabel-0.006
Tokenumbers
Token ID17024
Feature activation-1.70e-01

Pos logit contributions
 Sp+0.379
 o+0.327
 Chick+0.312
ie+0.312
 if+0.299
Neg logit contributions
ady-0.581
Greg-0.571
inger-0.566
urance-0.565
Jennifer-0.553
Token and
Token ID290
Feature activation+5.91e-03

Pos logit contributions
 Sp+0.181
 cuc+0.169
 o+0.160
ie+0.155
 Chick+0.153
Neg logit contributions
urance-0.368
Greg-0.352
Jennifer-0.350
inger-0.346
ady-0.343
Token tomatoes
Token ID23972
Feature activation-8.18e-02

Pos logit contributions
urance+0.010
ady+0.010
Jerry+0.010
Jennifer+0.010
Greg+0.010
Neg logit contributions
 Sp-0.007
ie-0.007
ectar-0.006
 Chick-0.006
bel-0.006
Token.
Token ID13
Feature activation+9.85e-03

Pos logit contributions
 Sp+0.117
 Chick+0.104
 cuc+0.101
ie+0.098
 o+0.095
Neg logit contributions
urance-0.141
ady-0.137
inger-0.134
Greg-0.130
agging-0.130
Token He
Token ID679
Feature activation-1.70e-02

Pos logit contributions
urance+0.018
Jennifer+0.017
Greg+0.017
Jerry+0.016
ady+0.016
Neg logit contributions
 Sp-0.014
 cuc-0.013
ie-0.013
 o-0.013
 Chick-0.013
Token like
Token ID588
Feature activation-5.93e-02

Pos logit contributions
urance+0.140
Greg+0.134
Jennifer+0.134
inger+0.131
ady+0.131
Neg logit contributions
 Sp-0.074
 cuc-0.069
 Chick-0.064
 o-0.063
ie-0.061
Token lettuce
Token ID39406
Feature activation-5.82e-02

Pos logit contributions
 Sp+0.078
 cuc+0.074
ie+0.070
 Chick+0.069
 o+0.069
Neg logit contributions
urance-0.109
Greg-0.104
Jennifer-0.103
ady-0.101
inger-0.100
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 Sp+0.091
 Chick+0.080
 Jos+0.076
 Isabel+0.073
 cuc+0.071
Neg logit contributions
urance-0.089
inger-0.086
ady-0.084
Jerry-0.083
 way-0.082
Token tomatoes
Token ID23972
Feature activation-8.11e-02

Pos logit contributions
Jerry+0.018
urance+0.018
Jennifer+0.017
ady+0.017
 strengths+0.017
Neg logit contributions
 Sp-0.015
ie-0.014
 Chick-0.013
 Isabel-0.013
bel-0.013
Token and
Token ID290
Feature activation+8.06e-03

Pos logit contributions
 Sp+0.123
 Chick+0.107
 cuc+0.100
 Jos+0.098
ie+0.098
Neg logit contributions
urance-0.129
inger-0.125
ady-0.124
Jerry-0.120
Jennifer-0.119
Token cuc
Token ID38421
Feature activation-2.91e-01

Pos logit contributions
Jerry+0.013
urance+0.012
Jennifer+0.012
 strengths+0.012
ady+0.012
Neg logit contributions
 Sp-0.010
ie-0.010
 Isabel-0.009
ry-0.009
bel-0.009
Tokenumbers
Token ID17024
Feature activation-1.67e-01

Pos logit contributions
 Sp+0.531
 Chick+0.465
 M+0.448
 ig+0.416
 o+0.407
Neg logit contributions
ady-0.482
inger-0.469
agging-0.438
ags-0.433
issors-0.426
Token.
Token ID13
Feature activation+4.82e-03

Pos logit contributions
 Sp+0.247
 Chick+0.218
 cuc+0.217
 fro+0.200
 o+0.197
Neg logit contributions
urance-0.273
inger-0.273
ady-0.269
oys-0.253
Greg-0.251
Token He
Token ID679
Feature activation-1.76e-02

Pos logit contributions
urance+0.007
Jennifer+0.006
Greg+0.006
Jerry+0.006
ady+0.006
Neg logit contributions
ie-0.008
 o-0.008
 cuc-0.008
 Sp-0.008
ware-0.008
Token smiled
Token ID13541
Feature activation+4.65e-02

Pos logit contributions
 Sp+0.030
 Chick+0.028
ectar+0.026
ie+0.025
bel+0.025
Neg logit contributions
urance-0.023
ady-0.023
Jennifer-0.022
Greg-0.022
Jerry-0.021
Token as
Token ID355
Feature activation-7.25e-03

Pos logit contributions
ady+0.062
urance+0.057
Jennifer+0.055
inger+0.053
agging+0.053
Neg logit contributions
 Sp-0.081
 Chick-0.077
ware-0.073
ie-0.070
atmeal-0.068
Token garden
Token ID11376
Feature activation+3.96e-02

Pos logit contributions
 Sp+0.003
 cuc+0.002
ie+0.002
 o+0.002
 Chick+0.002
Neg logit contributions
urance-0.002
Jennifer-0.002
Greg-0.002
inger-0.002
ady-0.002
Token.
Token ID13
Feature activation+1.68e-03

Pos logit contributions
urance+0.069
inger+0.065
Jennifer+0.063
ady+0.063
Greg+0.060
Neg logit contributions
 Sp-0.055
 Chick-0.051
 cuc-0.051
ie-0.050
 o-0.048
Token They
Token ID1119
Feature activation+6.09e-02

Pos logit contributions
urance+0.003
Jennifer+0.002
Greg+0.002
ady+0.002
Jerry+0.002
Neg logit contributions
 cuc-0.003
 Sp-0.003
ie-0.003
 o-0.003
ware-0.002
Token cut
Token ID2005
Feature activation-9.72e-02

Pos logit contributions
urance+0.092
Jennifer+0.088
Greg+0.087
inger+0.084
ady+0.082
Neg logit contributions
 Sp-0.101
ie-0.094
 cuc-0.093
 Chick-0.092
 o-0.088
Token the
Token ID262
Feature activation-2.18e-04

Pos logit contributions
 cuc+0.106
 Sp+0.105
 o+0.097
ie+0.095
 please+0.090
Neg logit contributions
urance-0.193
inger-0.188
Greg-0.187
ady-0.186
Jennifer-0.183
Token cuc
Token ID38421
Feature activation-2.91e-01

Pos logit contributions
 Sp+0.000
 Chick+0.000
ie+0.000
 if+0.000
 cuc+0.000
Neg logit contributions
urance-0.000
inger-0.000
ady-0.000
Jennifer-0.000
Greg-0.000
Tokenumber
Token ID4494
Feature activation-1.61e-01

Pos logit contributions
 Sp+0.523
 Chick+0.454
ie+0.445
 o+0.444
 M+0.443
Neg logit contributions
inger-0.455
ady-0.446
tted-0.414
ags-0.409
agging-0.402
Token and
Token ID290
Feature activation+1.35e-03

Pos logit contributions
 Sp+0.215
 cuc+0.202
 Chick+0.184
 o+0.179
ie+0.178
Neg logit contributions
urance-0.294
inger-0.287
ady-0.280
Greg-0.271
Jennifer-0.267
Token put
Token ID1234
Feature activation-4.29e-02

Pos logit contributions
Jennifer+0.002
urance+0.002
Greg+0.002
Jerry+0.002
 fists+0.002
Neg logit contributions
ie-0.002
 Sp-0.002
ectar-0.002
 Chick-0.002
 cuc-0.002
Token it
Token ID340
Feature activation-1.23e-02

Pos logit contributions
 Sp+0.043
 cuc+0.040
ie+0.039
 o+0.036
 Chick+0.036
Neg logit contributions
Greg-0.087
urance-0.086
inger-0.084
Jennifer-0.084
ady-0.083
Token in
Token ID287
Feature activation+3.70e-02

Pos logit contributions
 Sp+0.017
 cuc+0.015
 Chick+0.015
ie+0.014
 Will+0.014
Neg logit contributions
ags-0.021
inger-0.021
urance-0.021
agging-0.020
ady-0.020
Token.
Token ID13
Feature activation-5.52e-03

Pos logit contributions
 Sp+0.040
 cuc+0.038
 Chick+0.036
 o+0.035
ie+0.035
Neg logit contributions
urance-0.047
Greg-0.045
Jennifer-0.044
ady-0.042
inger-0.042
Token You
Token ID921
Feature activation+2.25e-02

Pos logit contributions
 Sp+0.009
 cuc+0.009
 Chick+0.008
 o+0.008
 if+0.008
Neg logit contributions
urance-0.009
inger-0.008
Jennifer-0.008
Greg-0.008
ady-0.008
Token found
Token ID1043
Feature activation-6.62e-02

Pos logit contributions
urance+0.036
Greg+0.034
Jennifer+0.034
inger+0.033
ady+0.033
Neg logit contributions
 Sp-0.036
 cuc-0.035
 o-0.034
ie-0.033
 Chick-0.033
Token a
Token ID257
Feature activation-1.90e-02

Pos logit contributions
 Sp+0.081
 cuc+0.080
ie+0.075
 o+0.072
 Chick+0.072
Neg logit contributions
urance-0.121
Greg-0.120
Jennifer-0.120
Jerry-0.115
ady-0.115
Token nice
Token ID3621
Feature activation+7.74e-02

Pos logit contributions
 Sp+0.028
 cuc+0.028
 o+0.027
ie+0.025
 Chick+0.025
Neg logit contributions
urance-0.034
Greg-0.031
Jennifer-0.031
inger-0.031
ady-0.031
Token cuc
Token ID38421
Feature activation-2.91e-01

Pos logit contributions
urance+0.139
Greg+0.133
Jennifer+0.130
inger+0.128
ady+0.128
Neg logit contributions
 cuc-0.111
 Sp-0.111
 o-0.108
ie-0.103
 if-0.100
Tokenumber
Token ID4494
Feature activation-1.58e-01

Pos logit contributions
 Sp+0.521
 Chick+0.490
 M+0.452
 o+0.429
 Will+0.419
Neg logit contributions
inger-0.445
ady-0.445
agging-0.438
tted-0.426
Greg-0.423
Token.
Token ID13
Feature activation-5.90e-03

Pos logit contributions
 Sp+0.178
 cuc+0.164
 Chick+0.152
 o+0.152
ie+0.148
Neg logit contributions
urance-0.329
Greg-0.314
Jennifer-0.314
inger-0.314
ady-0.311
Token It
Token ID632
Feature activation-1.55e-02

Pos logit contributions
 Sp+0.009
 cuc+0.009
 o+0.009
ie+0.008
 Chick+0.008
Neg logit contributions
urance-0.010
Greg-0.009
Jennifer-0.009
inger-0.009
ady-0.009
Token will
Token ID481
Feature activation-2.48e-02

Pos logit contributions
 Sp+0.025
 cuc+0.024
ie+0.023
 Chick+0.023
ectar+0.023
Neg logit contributions
urance-0.021
Greg-0.020
Jerry-0.020
Jennifer-0.020
ady-0.020
Token be
Token ID307
Feature activation+3.12e-02

Pos logit contributions
 Sp+0.042
ie+0.039
 cuc+0.039
 Chick+0.038
 o+0.036
Neg logit contributions
urance-0.036
inger-0.033
Jerry-0.033
Greg-0.033
Jennifer-0.033
Token John
Token ID1757
Feature activation+1.62e-01

Pos logit contributions
urance+0.030
Jennifer+0.029
Greg+0.029
inger+0.029
ady+0.028
Neg logit contributions
 Sp-0.019
 cuc-0.018
 o-0.018
ie-0.017
 if-0.016
Token chose
Token ID7690
Feature activation-5.17e-03

Pos logit contributions
urance+0.309
Greg+0.293
Jennifer+0.292
inger+0.286
ady+0.284
Neg logit contributions
 Sp-0.216
 cuc-0.205
 o-0.196
ie-0.190
 Chick-0.189
Token some
Token ID617
Feature activation-2.94e-02

Pos logit contributions
 Sp+0.006
 cuc+0.005
 Chick+0.005
ie+0.005
 o+0.005
Neg logit contributions
urance-0.010
Greg-0.010
Jennifer-0.010
ady-0.010
Jerry-0.010
Token carrots
Token ID34397
Feature activation-6.96e-02

Pos logit contributions
 Sp+0.042
 cuc+0.039
 Chick+0.038
ie+0.037
 o+0.036
Neg logit contributions
urance-0.050
ady-0.048
Greg-0.048
Jennifer-0.047
inger-0.046
Token and
Token ID290
Feature activation+1.47e-02

Pos logit contributions
 Sp+0.103
 Chick+0.092
 cuc+0.086
 Jos+0.082
ie+0.081
Neg logit contributions
ady-0.111
inger-0.107
urance-0.105
oys-0.102
Greg-0.101
Token cuc
Token ID38421
Feature activation-2.91e-01

Pos logit contributions
urance+0.026
Jennifer+0.025
Jerry+0.025
ady+0.025
Greg+0.025
Neg logit contributions
 Sp-0.018
ectar-0.015
 Chick-0.015
ie-0.015
ware-0.014
Tokenumbers
Token ID17024
Feature activation-1.64e-01

Pos logit contributions
 Sp+0.505
 Chick+0.444
 M+0.416
 o+0.410
 ig+0.405
Neg logit contributions
ady-0.500
inger-0.466
agging-0.438
Greg-0.433
ags-0.433
Token to
Token ID284
Feature activation-1.38e-02

Pos logit contributions
 Sp+0.230
 Chick+0.205
 cuc+0.199
 fro+0.185
 o+0.179
Neg logit contributions
inger-0.265
urance-0.264
ady-0.261
oys-0.246
tted-0.245
Token take
Token ID1011
Feature activation+3.39e-02

Pos logit contributions
 Sp+0.020
 Chick+0.018
ectar+0.018
ie+0.018
 if+0.017
Neg logit contributions
urance-0.022
Jerry-0.020
Greg-0.020
Jennifer-0.020
ady-0.019
Token home
Token ID1363
Feature activation+2.41e-02

Pos logit contributions
urance+0.070
Greg+0.067
Jennifer+0.067
ady+0.066
inger+0.065
Neg logit contributions
 Sp-0.038
 cuc-0.035
 o-0.033
ie-0.033
 Chick-0.032
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
urance+0.044
ady+0.043
inger+0.042
Greg+0.042
Jennifer+0.041
Neg logit contributions
 Sp-0.031
 cuc-0.027
 Chick-0.027
 o-0.026
ie-0.026
Token has
Token ID468
Feature activation+9.77e-02

Pos logit contributions
bel+0.028
 Chick+0.027
 Sp+0.026
ectar+0.026
ie+0.025
Neg logit contributions
 way-0.008
 hates-0.007
urance-0.007
Jerry-0.007
 fore-0.007
Token lettuce
Token ID39406
Feature activation-5.70e-02

Pos logit contributions
urance+0.153
ady+0.146
Jennifer+0.143
Jerry+0.141
inger+0.140
Neg logit contributions
 Sp-0.145
ie-0.145
 Chick-0.134
 if-0.128
 please-0.127
Token and
Token ID290
Feature activation+7.24e-03

Pos logit contributions
 Sp+0.088
 Chick+0.076
 cuc+0.072
ie+0.071
 o+0.069
Neg logit contributions
urance-0.095
inger-0.093
ady-0.092
agging-0.087
Jennifer-0.087
Token tomatoes
Token ID23972
Feature activation-7.86e-02

Pos logit contributions
 tails+0.010
urance+0.010
 fists+0.010
Jerry+0.010
Jennifer+0.010
Neg logit contributions
ie-0.010
 Sp-0.010
bel-0.010
ectar-0.010
ry-0.009
Token and
Token ID290
Feature activation+6.34e-03

Pos logit contributions
 Sp+0.120
 Chick+0.105
ie+0.100
 cuc+0.098
 fro+0.095
Neg logit contributions
urance-0.132
inger-0.128
ady-0.127
agging-0.122
Jennifer-0.121
Token cuc
Token ID38421
Feature activation-2.90e-01

Pos logit contributions
urance+0.009
 tails+0.009
Jerry+0.009
 fists+0.009
Jennifer+0.009
Neg logit contributions
ie-0.009
 Sp-0.009
ectar-0.008
 Chick-0.008
ry-0.008
Tokenumbers
Token ID17024
Feature activation-1.70e-01

Pos logit contributions
 Sp+0.501
 Chick+0.433
 M+0.417
 o+0.394
 Will+0.376
Neg logit contributions
ady-0.509
inger-0.493
ags-0.482
agging-0.475
Greg-0.448
Token.
Token ID13
Feature activation+6.37e-03

Pos logit contributions
 Sp+0.247
 Chick+0.217
 cuc+0.217
ie+0.203
 o+0.200
Neg logit contributions
urance-0.297
inger-0.293
ady-0.287
Jennifer-0.273
Greg-0.273
Token And
Token ID843
Feature activation-4.44e-02

Pos logit contributions
urance+0.009
Greg+0.008
Jennifer+0.008
Jerry+0.008
ady+0.008
Neg logit contributions
ie-0.012
 Sp-0.011
 cuc-0.011
 o-0.011
 Chick-0.011
Token it
Token ID340
Feature activation-1.23e-02

Pos logit contributions
ie+0.050
 Sp+0.049
 Chick+0.047
 cuc+0.046
 o+0.043
Neg logit contributions
urance-0.083
Jennifer-0.080
Greg-0.078
Jerry-0.077
ady-0.076
Token has
Token ID468
Feature activation+9.42e-02

Pos logit contributions
 Sp+0.024
bel+0.024
 Chick+0.024
ectar+0.023
 please+0.023
Neg logit contributions
urance-0.011
ady-0.010
Greg-0.010
Jennifer-0.010
 way-0.010
Token and
Token ID290
Feature activation+1.76e-03

Pos logit contributions
 cuc+0.054
 Sp+0.053
ie+0.049
 Chick+0.048
 o+0.046
Neg logit contributions
urance-0.073
Greg-0.070
ady-0.069
Jennifer-0.069
inger-0.068
Token play
Token ID711
Feature activation-9.53e-03

Pos logit contributions
urance+0.003
Jennifer+0.003
Greg+0.003
Jerry+0.002
ady+0.002
Neg logit contributions
 Sp-0.003
 cuc-0.003
ie-0.003
 Chick-0.003
 o-0.003
Token music
Token ID2647
Feature activation+2.39e-02

Pos logit contributions
 cuc+0.012
 Sp+0.012
ie+0.011
 Chick+0.011
 o+0.011
Neg logit contributions
urance-0.018
Greg-0.018
Jennifer-0.017
agging-0.017
ady-0.017
Token on
Token ID319
Feature activation+3.91e-02

Pos logit contributions
urance+0.041
Greg+0.039
Jennifer+0.039
ady+0.039
agging+0.037
Neg logit contributions
 cuc-0.034
 Sp-0.033
 Chick-0.031
ie-0.030
 o-0.030
Token the
Token ID262
Feature activation-3.89e-03

Pos logit contributions
urance+0.082
Greg+0.079
Jennifer+0.079
inger+0.077
ady+0.076
Neg logit contributions
 Sp-0.043
 cuc-0.042
 o-0.038
ie-0.038
 Chick-0.037
Token street
Token ID4675
Feature activation+2.00e-01

Pos logit contributions
 o+0.006
?"+0.005
 cuc+0.005
 v+0.005
 Sp+0.005
Neg logit contributions
urance-0.007
Greg-0.007
inger-0.007
ady-0.007
Jennifer-0.007
Token.
Token ID13
Feature activation-1.98e-03

Pos logit contributions
urance+0.383
Greg+0.361
Jennifer+0.359
ady+0.355
inger+0.354
Neg logit contributions
 cuc-0.252
 Sp-0.250
 Chick-0.227
 o-0.221
ie-0.217
Token They
Token ID1119
Feature activation+5.97e-02

Pos logit contributions
 Sp+0.003
 cuc+0.003
 o+0.003
 Chick+0.003
 if+0.003
Neg logit contributions
urance-0.003
Jennifer-0.003
Greg-0.003
inger-0.003
ady-0.003
Token wear
Token ID5806
Feature activation+2.31e-02

Pos logit contributions
urance+0.104
Greg+0.098
Jennifer+0.098
inger+0.096
ady+0.095
Neg logit contributions
 Sp-0.089
 cuc-0.086
 o-0.082
ie-0.080
 Chick-0.079
Token colorful
Token ID20239
Feature activation-1.19e-03

Pos logit contributions
urance+0.037
Jennifer+0.036
Greg+0.035
ady+0.034
inger+0.034
Neg logit contributions
 Sp-0.036
 cuc-0.035
ie-0.034
 o-0.033
 Chick-0.033
Token clothes
Token ID8242
Feature activation+3.15e-02

Pos logit contributions
 Sp+0.002
 cuc+0.002
 Chick+0.002
ie+0.002
 o+0.002
Neg logit contributions
urance-0.002
ady-0.002
Jennifer-0.002
Greg-0.002
inger-0.001
Token that
Token ID326
Feature activation+1.07e-02

Pos logit contributions
 Sp+0.041
 cuc+0.040
 Chick+0.040
 o+0.035
ware+0.035
Neg logit contributions
urance-0.061
ady-0.061
Jennifer-0.059
agging-0.059
Greg-0.059
Token they
Token ID484
Feature activation-5.98e-03

Pos logit contributions
urance+0.024
Greg+0.023
Jennifer+0.023
Jerry+0.022
ady+0.022
Neg logit contributions
 Sp-0.010
 cuc-0.010
 o-0.009
ie-0.008
 Chick-0.008
Token had
Token ID550
Feature activation+1.01e-01

Pos logit contributions
 Sp+0.008
 cuc+0.008
 o+0.008
 Chick+0.007
ie+0.007
Neg logit contributions
urance-0.011
Greg-0.011
Jennifer-0.010
inger-0.010
ady-0.010
Token made
Token ID925
Feature activation-7.63e-02

Pos logit contributions
urance+0.205
Greg+0.193
inger+0.191
Jennifer+0.190
Jerry+0.185
Neg logit contributions
 Sp-0.127
 cuc-0.119
 o-0.118
 if-0.111
ie-0.107
Token it
Token ID340
Feature activation-1.43e-02

Pos logit contributions
 cuc+0.103
 Sp+0.101
 Chick+0.092
ie+0.090
 o+0.087
Neg logit contributions
urance-0.136
Jennifer-0.131
Greg-0.129
ady-0.128
inger-0.125
Token themselves
Token ID2405
Feature activation+1.52e-01

Pos logit contributions
 Sp+0.020
 cuc+0.019
 Chick+0.018
 o+0.016
ware+0.016
Neg logit contributions
urance-0.026
inger-0.024
Jennifer-0.024
ady-0.024
 supposed-0.023
Token and
Token ID290
Feature activation+6.93e-03

Pos logit contributions
urance+0.286
ady+0.267
Jennifer+0.257
 pace+0.256
agging+0.255
Neg logit contributions
 Sp-0.203
 Chick-0.184
 cuc-0.183
 o-0.160
ie-0.157
Token they
Token ID484
Feature activation-6.26e-03

Pos logit contributions
urance+0.013
Jennifer+0.013
Greg+0.013
Jerry+0.012
inger+0.012
Neg logit contributions
 Sp-0.008
 cuc-0.008
 o-0.007
ie-0.007
 Chick-0.007
Token felt
Token ID2936
Feature activation-5.31e-02

Pos logit contributions
 Sp+0.010
 cuc+0.009
 o+0.008
 Chick+0.008
ware+0.008
Neg logit contributions
urance-0.010
Greg-0.010
Jennifer-0.010
ady-0.009
inger-0.009
Token safe
Token ID3338
Feature activation+5.63e-02

Pos logit contributions
 Sp+0.074
 cuc+0.069
ie+0.069
 Chick+0.067
 o+0.066
Neg logit contributions
urance-0.093
Greg-0.089
Jennifer-0.089
ady-0.088
inger-0.085
Token knowing
Token ID6970
Feature activation-2.87e-02

Pos logit contributions
urance+0.118
Greg+0.111
Jennifer+0.110
inger+0.109
ady+0.108
Neg logit contributions
 Sp-0.064
 cuc-0.061
 Chick-0.056
 o-0.055
ie-0.054
Token and
Token ID290
Feature activation+5.11e-03

Pos logit contributions
 cuc+0.018
 Sp+0.017
 Chick+0.016
 o+0.015
ware+0.015
Neg logit contributions
urance-0.020
inger-0.020
agging-0.019
 pace-0.019
 way-0.019
Token play
Token ID711
Feature activation-8.34e-03

Pos logit contributions
urance+0.007
Greg+0.007
Jerry+0.007
Jennifer+0.007
inger+0.006
Neg logit contributions
 cuc-0.008
 Sp-0.008
ie-0.008
ware-0.008
 o-0.008
Token with
Token ID351
Feature activation+2.45e-02

Pos logit contributions
 cuc+0.011
ie+0.011
 Chick+0.010
 Sp+0.010
 o+0.010
Neg logit contributions
ags-0.014
agging-0.014
urance-0.014
Greg-0.014
Jerry-0.014
Token it
Token ID340
Feature activation-1.33e-02

Pos logit contributions
urance+0.051
Greg+0.050
Jennifer+0.050
inger+0.049
Jerry+0.049
Neg logit contributions
 cuc-0.024
 Sp-0.021
 o-0.021
bel-0.020
ware-0.019
Token that
Token ID326
Feature activation+1.29e-02

Pos logit contributions
 cuc+0.019
 Sp+0.018
 Chick+0.017
 Jos+0.017
 o+0.017
Neg logit contributions
urance-0.021
agging-0.021
inger-0.020
 way-0.020
 pace-0.020
Token way
Token ID835
Feature activation+2.21e-01

Pos logit contributions
urance+0.023
Greg+0.022
Jennifer+0.022
Jerry+0.021
inger+0.021
Neg logit contributions
 Sp-0.018
 cuc-0.017
 Chick-0.016
 o-0.016
ie-0.016
Token!"
Token ID2474
Feature activation-1.28e-02

Pos logit contributions
urance+0.458
Jennifer+0.438
Greg+0.437
inger+0.435
ady+0.429
Neg logit contributions
 Sp-0.247
 cuc-0.242
 o-0.219
 Chick-0.213
ie-0.201
Token
Token ID198
Feature activation-8.41e-03

Pos logit contributions
 cuc+0.020
 o+0.018
ware+0.018
 Sp+0.018
ie+0.018
Neg logit contributions
urance-0.021
Greg-0.020
Jerry-0.020
Jennifer-0.020
inger-0.019
Token
Token ID198
Feature activation-7.94e-03

Pos logit contributions
 cuc+0.013
 Sp+0.012
 o+0.012
ware+0.012
ie+0.012
Neg logit contributions
Greg-0.014
urance-0.013
Jennifer-0.013
Jerry-0.013
inger-0.013
TokenBob
Token ID18861
Feature activation+5.42e-02

Pos logit contributions
 Sp+0.015
 cuc+0.015
 o+0.014
ie+0.014
 Chick+0.014
Neg logit contributions
urance-0.010
Greg-0.010
Jennifer-0.010
Jerry-0.010
inger-0.010
Tokeno
Token ID78
Feature activation-1.08e-01

Pos logit contributions
urance+0.121
Greg+0.116
Jennifer+0.116
inger+0.114
ady+0.112
Neg logit contributions
 Sp-0.055
 cuc-0.050
 o-0.048
ie-0.047
 Chick-0.045
Token,
Token ID11
Feature activation+3.31e-03

Pos logit contributions
 Sp+0.039
 cuc+0.038
 o+0.036
ware+0.035
ie+0.034
Neg logit contributions
urance-0.054
Greg-0.052
Jennifer-0.052
Jerry-0.051
agging-0.048
Token "
Token ID366
Feature activation+6.35e-03

Pos logit contributions
urance+0.006
Greg+0.005
Jennifer+0.005
Jerry+0.005
inger+0.005
Neg logit contributions
 Sp-0.005
ie-0.005
 cuc-0.005
 o-0.004
 Chick-0.004
TokenMax
Token ID11518
Feature activation+8.24e-02

Pos logit contributions
urance+0.008
Greg+0.008
Jennifer+0.007
inger+0.007
ady+0.007
Neg logit contributions
 Sp-0.012
 cuc-0.012
 o-0.012
ie-0.011
 Chick-0.011
Token led
Token ID2957
Feature activation+4.90e-02

Pos logit contributions
urance+0.189
Greg+0.181
Jennifer+0.180
inger+0.178
ady+0.177
Neg logit contributions
 Sp-0.080
 cuc-0.071
 o-0.070
ie-0.069
 Chick-0.066
Token the
Token ID262
Feature activation-1.01e-03

Pos logit contributions
urance+0.104
Greg+0.099
Jennifer+0.098
inger+0.097
ady+0.096
Neg logit contributions
 Sp-0.056
 cuc-0.052
 o-0.050
ie-0.049
 Chick-0.048
Token way
Token ID835
Feature activation+2.23e-01

Pos logit contributions
 o+0.002
 Sp+0.001
 please+0.001
 if+0.001
 v+0.001
Neg logit contributions
ady-0.002
Jennifer-0.002
urance-0.002
inger-0.002
Greg-0.002
Token and
Token ID290
Feature activation+5.39e-03

Pos logit contributions
urance+0.452
Greg+0.432
Jennifer+0.430
inger+0.426
ady+0.419
Neg logit contributions
 Sp-0.268
 cuc-0.260
 o-0.239
 Chick-0.231
ie-0.229
Token we
Token ID356
Feature activation+4.63e-03

Pos logit contributions
urance+0.010
Jennifer+0.010
Greg+0.010
Jerry+0.010
 property+0.009
Neg logit contributions
 cuc-0.007
 Sp-0.006
 o-0.006
ware-0.006
ie-0.006
Token found
Token ID1043
Feature activation-6.69e-02

Pos logit contributions
urance+0.008
Greg+0.008
inger+0.008
Jennifer+0.008
ady+0.007
Neg logit contributions
 Sp-0.007
 cuc-0.006
ware-0.006
 Chick-0.006
 o-0.006
Token a
Token ID257
Feature activation-2.14e-02

Pos logit contributions
 cuc+0.081
 Sp+0.080
ie+0.076
 o+0.074
ware+0.070
Neg logit contributions
urance-0.129
Greg-0.126
Jennifer-0.125
Jerry-0.121
ady-0.119
Token switch
Token ID5078
Feature activation+8.63e-02

Pos logit contributions
 Sp+0.033
 o+0.033
 cuc+0.032
 if+0.031
 Chick+0.031
Neg logit contributions
urance-0.037
ady-0.033
Jennifer-0.033
inger-0.033
Greg-0.032
Token have
Token ID423
Feature activation+3.67e-02

Pos logit contributions
 Sp+0.037
 cuc+0.037
 o+0.034
ie+0.033
 Chick+0.033
Neg logit contributions
urance-0.046
Greg-0.044
Jennifer-0.043
ady-0.042
inger-0.042
Token an
Token ID281
Feature activation-2.55e-02

Pos logit contributions
urance+0.073
Jennifer+0.071
Greg+0.070
ady+0.069
inger+0.068
Neg logit contributions
 Sp-0.044
 cuc-0.042
ie-0.040
 o-0.038
 Chick-0.037
Token idea
Token ID2126
Feature activation-5.43e-02

Pos logit contributions
 Sp+0.037
 cuc+0.036
 o+0.035
ie+0.034
 Chick+0.033
Neg logit contributions
urance-0.045
Greg-0.043
Jennifer-0.043
inger-0.042
Jerry-0.041
Token,"
Token ID553
Feature activation-6.40e-02

Pos logit contributions
 cuc+0.061
 Sp+0.061
 Chick+0.053
 o+0.052
ie+0.050
Neg logit contributions
urance-0.112
Greg-0.105
Jennifer-0.105
ady-0.104
Jerry-0.103
Token said
Token ID531
Feature activation-3.02e-02

Pos logit contributions
 Sp+0.101
 cuc+0.098
 o+0.095
ie+0.092
 Chick+0.091
Neg logit contributions
urance-0.103
Greg-0.100
Jennifer-0.099
Jerry-0.097
inger-0.093
Token Billy
Token ID15890
Feature activation+1.38e-01

Pos logit contributions
 Sp+0.058
 cuc+0.056
 o+0.054
ie+0.054
 Chick+0.053
Neg logit contributions
urance-0.040
inger-0.037
Jennifer-0.037
Greg-0.037
ady-0.036
Token.
Token ID13
Feature activation+3.25e-03

Pos logit contributions
urance+0.289
Greg+0.275
Jennifer+0.274
inger+0.270
ady+0.268
Neg logit contributions
 Sp-0.156
 cuc-0.148
 o-0.141
ie-0.135
 Chick-0.134
Token "
Token ID366
Feature activation+6.04e-03

Pos logit contributions
urance+0.007
Jennifer+0.006
Greg+0.006
inger+0.006
ady+0.006
Neg logit contributions
 Sp-0.004
 cuc-0.004
 o-0.003
 Chick-0.003
ie-0.003
TokenLet
Token ID5756
Feature activation+2.53e-02

Pos logit contributions
urance+0.009
Greg+0.009
Jennifer+0.009
inger+0.008
ady+0.008
Neg logit contributions
 Sp-0.010
 cuc-0.010
 o-0.010
ie-0.010
 Chick-0.009
Token's
Token ID338
Feature activation+2.44e-02

Pos logit contributions
urance+0.048
Greg+0.045
Jennifer+0.045
inger+0.045
ady+0.044
Neg logit contributions
 Sp-0.034
 cuc-0.032
 o-0.032
ie-0.032
 Chick-0.030
Token save
Token ID3613
Feature activation+2.69e-03

Pos logit contributions
urance+0.042
Greg+0.040
Jennifer+0.040
inger+0.039
ady+0.039
Neg logit contributions
 Sp-0.037
 cuc-0.035
 o-0.033
ie-0.032
 Chick-0.032
Token that
Token ID326
Feature activation+1.20e-02

Pos logit contributions
 Sp+0.044
 cuc+0.041
 o+0.037
 Chick+0.036
ie+0.035
Neg logit contributions
urance-0.111
Greg-0.106
Jennifer-0.106
inger-0.105
ady-0.105
Token he
Token ID339
Feature activation-9.72e-02

Pos logit contributions
Jerry+0.026
Greg+0.026
urance+0.026
Jennifer+0.025
 way+0.025
Neg logit contributions
 cuc-0.009
ie-0.008
 Sp-0.008
berry-0.008
 o-0.007
Token took
Token ID1718
Feature activation-5.28e-03

Pos logit contributions
 Sp+0.170
 cuc+0.168
 o+0.160
ie+0.157
 Chick+0.156
Neg logit contributions
urance-0.135
Greg-0.130
Jennifer-0.127
Jerry-0.125
ady-0.125
Token with
Token ID351
Feature activation+2.64e-02

Pos logit contributions
 Sp+0.006
 cuc+0.005
 o+0.005
ie+0.005
 Chick+0.005
Neg logit contributions
urance-0.011
Jennifer-0.011
Greg-0.011
inger-0.011
ady-0.011
Token him
Token ID683
Feature activation-2.27e-02

Pos logit contributions
urance+0.058
Greg+0.056
Jennifer+0.055
Jerry+0.054
ady+0.053
Neg logit contributions
 cuc-0.026
 Sp-0.025
ie-0.022
 o-0.022
 Chick-0.022
Token everywhere
Token ID8347
Feature activation+5.44e-02

Pos logit contributions
 Sp+0.022
 cuc+0.021
 o+0.020
ie+0.019
 Chick+0.019
Neg logit contributions
urance-0.050
Greg-0.048
Jennifer-0.047
inger-0.047
ady-0.047
Token.
Token ID13
Feature activation+6.60e-03

Pos logit contributions
urance+0.101
Greg+0.096
Jennifer+0.094
agging+0.093
Jerry+0.093
Neg logit contributions
 cuc-0.067
 Sp-0.066
 o-0.061
 Chick-0.059
ie-0.059
Token One
Token ID1881
Feature activation-1.69e-02

Pos logit contributions
urance+0.012
Jennifer+0.012
Greg+0.012
inger+0.012
ady+0.011
Neg logit contributions
 Sp-0.009
 cuc-0.009
 o-0.008
ie-0.008
 if-0.008
Token day
Token ID1110
Feature activation-1.43e-02

Pos logit contributions
 Sp+0.026
 cuc+0.025
 o+0.024
 Chick+0.023
ie+0.023
Neg logit contributions
urance-0.028
Greg-0.027
Jennifer-0.027
inger-0.026
ady-0.026
Token,
Token ID11
Feature activation+5.08e-03

Pos logit contributions
 Sp+0.012
 cuc+0.011
 if+0.010
 o+0.010
ie+0.010
Neg logit contributions
urance-0.035
Jennifer-0.034
inger-0.033
Greg-0.033
ady-0.033
Token his
Token ID465
Feature activation+4.14e-03

Pos logit contributions
Greg+0.010
urance+0.009
Jerry+0.009
Jennifer+0.009
Kyle+0.009
Neg logit contributions
 Sp-0.006
 cuc-0.006
 Chick-0.006
ie-0.006
 o-0.006
Token It
Token ID632
Feature activation-1.91e-02

Pos logit contributions
urance+0.005
Greg+0.005
Jennifer+0.005
ags+0.004
inger+0.004
Neg logit contributions
ie-0.007
 cuc-0.006
 o-0.006
 Sp-0.006
 Chick-0.006
Token just
Token ID655
Feature activation-6.60e-02

Pos logit contributions
 cuc+0.030
 Sp+0.030
 Chick+0.027
ie+0.027
 cel+0.026
Neg logit contributions
urance-0.027
Greg-0.026
inger-0.026
Jennifer-0.026
 fore-0.026
Token moved
Token ID3888
Feature activation-1.14e-02

Pos logit contributions
 Sp+0.092
 cuc+0.088
 o+0.084
ie+0.082
 Chick+0.081
Neg logit contributions
urance-0.122
Greg-0.116
Jennifer-0.115
inger-0.113
ady-0.112
Token slowly
Token ID6364
Feature activation+1.41e-02

Pos logit contributions
 Sp+0.014
 cuc+0.014
 o+0.013
ie+0.012
 Chick+0.012
Neg logit contributions
urance-0.022
Greg-0.021
Jennifer-0.021
inger-0.021
ady-0.021
Token and
Token ID290
Feature activation+2.84e-03

Pos logit contributions
urance+0.029
Greg+0.027
Jennifer+0.027
inger+0.027
ady+0.026
Neg logit contributions
 Sp-0.017
 cuc-0.016
 o-0.015
ie-0.015
 Chick-0.015
Token gently
Token ID15165
Feature activation+1.25e-02

Pos logit contributions
Greg+0.004
Jennifer+0.004
urance+0.004
Jerry+0.004
 fore+0.004
Neg logit contributions
ie-0.005
 cuc-0.004
 Sp-0.004
berry-0.004
ectar-0.004
Token.
Token ID13
Feature activation+3.73e-03

Pos logit contributions
urance+0.026
Greg+0.025
Jennifer+0.025
inger+0.024
ady+0.024
Neg logit contributions
 Sp-0.014
 cuc-0.013
 o-0.013
ie-0.013
 Chick-0.012
Token
Token ID198
Feature activation-6.54e-03

Pos logit contributions
urance+0.006
Jennifer+0.005
Greg+0.005
inger+0.005
Jerry+0.005
Neg logit contributions
ie-0.007
 cuc-0.006
 o-0.006
 Sp-0.006
 Chick-0.005
Token
Token ID198
Feature activation-6.53e-03

Pos logit contributions
 cuc+0.009
 Sp+0.009
 o+0.008
ie+0.008
 Chick+0.008
Neg logit contributions
urance-0.012
Greg-0.012
Jennifer-0.012
inger-0.011
Jerry-0.011
Token"
Token ID1
Feature activation+5.26e-02

Pos logit contributions
 Sp+0.009
 cuc+0.009
 o+0.008
 Chick+0.008
 if+0.008
Neg logit contributions
urance-0.013
Greg-0.011
Jennifer-0.011
inger-0.011
ady-0.011
TokenMaybe
Token ID13300
Feature activation-6.17e-02

Pos logit contributions
urance+0.083
Greg+0.077
Jennifer+0.077
inger+0.076
ady+0.074
Neg logit contributions
 Sp-0.088
 cuc-0.085
 o-0.082
ie-0.080
 Chick-0.079
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
urance+0.322
Greg+0.309
Jennifer+0.308
inger+0.302
ady+0.298
Neg logit contributions
 Sp-0.157
 cuc-0.147
 o-0.142
ie-0.137
 Chick-0.132
Token Mary
Token ID5335
Feature activation+7.57e-02

Pos logit contributions
urance+0.023
Greg+0.022
inger+0.022
Jennifer+0.022
agging+0.022
Neg logit contributions
 Sp-0.013
 o-0.011
 cuc-0.011
 if-0.011
 Chick-0.011
Token quickly
Token ID2952
Feature activation+5.73e-02

Pos logit contributions
urance+0.128
Jennifer+0.123
Greg+0.122
ady+0.121
inger+0.118
Neg logit contributions
 cuc-0.112
 Sp-0.111
 o-0.099
 Chick-0.097
ware-0.096
Token ran
Token ID4966
Feature activation+6.02e-02

Pos logit contributions
urance+0.102
Greg+0.097
Jennifer+0.096
inger+0.095
Jerry+0.092
Neg logit contributions
 Sp-0.085
 o-0.083
 cuc-0.082
ie-0.079
 Chick-0.077
Token to
Token ID284
Feature activation-2.20e-02

Pos logit contributions
urance+0.145
ady+0.135
Jennifer+0.134
inger+0.133
Greg+0.132
Neg logit contributions
 Sp-0.058
 if-0.052
 o-0.050
ie-0.047
 he-0.045
Token the
Token ID262
Feature activation-3.86e-03

Pos logit contributions
 Sp+0.020
 o+0.018
 if+0.018
 please+0.016
 M+0.015
Neg logit contributions
urance-0.053
ady-0.052
inger-0.052
Greg-0.051
agging-0.051
Token window
Token ID4324
Feature activation+6.37e-02

Pos logit contributions
 Sp+0.006
 cuc+0.006
 o+0.006
ie+0.005
 Chick+0.005
Neg logit contributions
urance-0.006
Greg-0.006
inger-0.006
ady-0.006
Jennifer-0.006
Token to
Token ID284
Feature activation-2.22e-02

Pos logit contributions
urance+0.120
Jennifer+0.112
ady+0.111
inger+0.110
Greg+0.108
Neg logit contributions
 cuc-0.087
 Sp-0.086
 Chick-0.077
 o-0.075
ie-0.072
Token see
Token ID766
Feature activation-4.52e-02

Pos logit contributions
 Sp+0.030
 o+0.028
 cuc+0.027
ie+0.026
 if+0.026
Neg logit contributions
urance-0.043
Greg-0.041
inger-0.041
ady-0.040
Jennifer-0.040
Token what
Token ID644
Feature activation+7.96e-02

Pos logit contributions
 cuc+0.062
 Sp+0.059
ie+0.054
 o+0.053
ware+0.053
Neg logit contributions
Greg-0.080
urance-0.080
Jennifer-0.079
ady-0.076
Jerry-0.074
Token had
Token ID550
Feature activation+1.02e-01

Pos logit contributions
urance+0.153
Jennifer+0.153
Greg+0.151
Jerry+0.147
inger+0.144
Neg logit contributions
 cuc-0.099
 Sp-0.095
 o-0.088
ie-0.085
 Chick-0.084
TokenThe
Token ID464
Feature activation+2.28e-02

Pos logit contributions
 Sp+0.009
 cuc+0.009
 o+0.009
ie+0.009
 if+0.009
Neg logit contributions
urance-0.008
inger-0.007
Greg-0.007
ady-0.007
agging-0.007
Token dream
Token ID4320
Feature activation-1.81e-01

Pos logit contributions
urance+0.036
Jennifer+0.034
Greg+0.034
ady+0.033
inger+0.033
Neg logit contributions
 Sp-0.037
 cuc-0.036
 Chick-0.034
ie-0.034
 o-0.033
Tokenmaker
Token ID10297
Feature activation-7.96e-02

Pos logit contributions
 Sp+0.226
 cuc+0.220
 o+0.202
 Chick+0.200
ie+0.191
Neg logit contributions
urance-0.362
Jennifer-0.344
Greg-0.341
inger-0.339
ady-0.329
Token smiled
Token ID13541
Feature activation+4.77e-02

Pos logit contributions
 Sp+0.110
 cuc+0.106
 Chick+0.098
 o+0.098
ie+0.094
Neg logit contributions
urance-0.146
Jennifer-0.139
Greg-0.138
inger-0.134
ady-0.134
Token,
Token ID11
Feature activation+3.40e-03

Pos logit contributions
urance+0.080
ady+0.076
Jennifer+0.075
agging+0.074
Greg+0.074
Neg logit contributions
 Sp-0.073
 cuc-0.069
 Chick-0.065
 o-0.063
ware-0.062
Token and
Token ID290
Feature activation+5.20e-03

Pos logit contributions
urance+0.007
Jennifer+0.006
Greg+0.006
Jerry+0.006
inger+0.006
Neg logit contributions
 Sp-0.004
 cuc-0.004
ie-0.004
 o-0.004
 Chick-0.004
Token began
Token ID2540
Feature activation-6.95e-04

Pos logit contributions
urance+0.009
Jennifer+0.009
Greg+0.008
Jerry+0.008
inger+0.008
Neg logit contributions
 Sp-0.008
 cuc-0.008
 o-0.007
ie-0.007
 Chick-0.007
Token telling
Token ID5149
Feature activation+7.05e-02

Pos logit contributions
 Sp+0.001
 cuc+0.001
ie+0.001
 o+0.001
 Chick+0.001
Neg logit contributions
urance-0.001
Greg-0.001
Jennifer-0.001
ady-0.001
inger-0.001
Token Amy
Token ID14235
Feature activation+7.17e-02

Pos logit contributions
urance+0.132
Greg+0.125
Jennifer+0.125
inger+0.122
ady+0.120
Neg logit contributions
 Sp-0.097
 cuc-0.092
 o-0.088
ie-0.086
 Chick-0.085
Token all
Token ID477
Feature activation+6.08e-03

Pos logit contributions
urance+0.162
Greg+0.155
Jennifer+0.154
inger+0.152
ady+0.151
Neg logit contributions
 Sp-0.070
 cuc-0.065
 o-0.062
ie-0.059
 Chick-0.058
Token the
Token ID262
Feature activation-2.86e-03

Pos logit contributions
urance+0.014
Greg+0.013
inger+0.013
Jennifer+0.013
Jerry+0.013
Neg logit contributions
 Sp-0.006
 cuc-0.006
 o-0.006
ie-0.005
 if-0.005
Token for
Token ID329
Feature activation-1.05e-01

Pos logit contributions
agging+0.123
ags+0.117
ady+0.115
urance+0.115
 supposed+0.109
Neg logit contributions
 Sp-0.067
 cuc-0.063
ware-0.062
 Chick-0.062
light-0.057
Token an
Token ID281
Feature activation-2.70e-02

Pos logit contributions
 Sp+0.127
 cuc+0.122
 o+0.114
ie+0.114
 Chick+0.111
Neg logit contributions
urance-0.209
Greg-0.199
Jennifer-0.198
inger-0.194
ady-0.193
Token adventure
Token ID8855
Feature activation+9.30e-03

Pos logit contributions
 Sp+0.046
 cuc+0.043
 Chick+0.041
ie+0.039
 o+0.039
Neg logit contributions
urance-0.042
ady-0.038
Greg-0.038
Jennifer-0.038
inger-0.037
Token.
Token ID13
Feature activation+8.80e-03

Pos logit contributions
urance+0.021
Greg+0.020
Jennifer+0.020
inger+0.020
ady+0.019
Neg logit contributions
 Sp-0.009
 cuc-0.009
 o-0.008
ie-0.008
 Chick-0.008
Token One
Token ID1881
Feature activation-1.65e-02

Pos logit contributions
urance+0.018
Greg+0.017
Jennifer+0.017
inger+0.017
ady+0.016
Neg logit contributions
 Sp-0.011
 cuc-0.010
 o-0.010
ie-0.009
 if-0.009
Token day
Token ID1110
Feature activation-1.51e-02

Pos logit contributions
 Sp+0.025
 cuc+0.025
 o+0.023
 Chick+0.023
ie+0.023
Neg logit contributions
urance-0.028
Greg-0.026
Jennifer-0.026
inger-0.026
ady-0.026
Token,
Token ID11
Feature activation+4.60e-03

Pos logit contributions
 Sp+0.014
 cuc+0.012
 o+0.012
ie+0.012
 if+0.012
Neg logit contributions
urance-0.036
Jennifer-0.034
inger-0.034
Greg-0.034
ady-0.033
Token Sam
Token ID3409
Feature activation+3.16e-02

Pos logit contributions
Greg+0.008
Jerry+0.007
urance+0.007
Jennifer+0.007
Kyle+0.007
Neg logit contributions
 cuc-0.007
 o-0.007
ie-0.006
 Chick-0.006
 Sp-0.006
Token went
Token ID1816
Feature activation-4.61e-02

Pos logit contributions
urance+0.058
Jennifer+0.056
Greg+0.056
inger+0.055
ady+0.055
Neg logit contributions
 Sp-0.043
 cuc-0.041
 o-0.038
 Chick-0.037
ie-0.037
Token for
Token ID329
Feature activation-1.05e-01

Pos logit contributions
 Sp+0.044
 cuc+0.041
 o+0.037
ie+0.037
 Chick+0.037
Neg logit contributions
urance-0.105
Greg-0.101
Jennifer-0.100
inger-0.099
ady-0.098
Token a
Token ID257
Feature activation-1.98e-02

Pos logit contributions
 Sp+0.107
 cuc+0.095
 o+0.095
 Chick+0.089
ie+0.087
Neg logit contributions
urance-0.236
Greg-0.225
Jennifer-0.225
inger-0.223
ady-0.220
Token but
Token ID475
Feature activation+7.57e-03

Pos logit contributions
urance+0.011
Jennifer+0.011
Jerry+0.011
Greg+0.010
lynn+0.010
Neg logit contributions
 Sp-0.008
 cuc-0.008
ie-0.008
bel-0.008
ware-0.008
Token you
Token ID345
Feature activation+1.08e-02

Pos logit contributions
urance+0.012
Jerry+0.012
Greg+0.012
 property+0.012
 way+0.012
Neg logit contributions
 Sp-0.010
 cuc-0.010
 Chick-0.009
bel-0.009
ie-0.009
Token must
Token ID1276
Feature activation-1.73e-02

Pos logit contributions
urance+0.016
inger+0.015
Greg+0.015
Jennifer+0.015
Jerry+0.014
Neg logit contributions
 Sp-0.018
 cuc-0.018
 o-0.016
ie-0.016
 Chick-0.016
Token do
Token ID466
Feature activation-4.38e-02

Pos logit contributions
 o+0.024
 please+0.022
 if+0.022
 cuc+0.022
 Sp+0.022
Neg logit contributions
Greg-0.032
urance-0.031
Jennifer-0.031
ady-0.030
Jerry-0.029
Token something
Token ID1223
Feature activation-4.05e-02

Pos logit contributions
 Sp+0.049
 cuc+0.044
 Chick+0.044
ie+0.044
bel+0.040
Neg logit contributions
urance-0.086
Greg-0.083
Jennifer-0.083
inger-0.082
Jerry-0.081
Token for
Token ID329
Feature activation-1.03e-01

Pos logit contributions
 Sp+0.062
 Chick+0.058
 cuc+0.056
ie+0.054
 fro+0.053
Neg logit contributions
urance-0.062
 supposed-0.061
Greg-0.059
 property-0.059
inger-0.058
Token me
Token ID502
Feature activation-5.15e-03

Pos logit contributions
 Sp+0.142
 cuc+0.135
 o+0.129
ie+0.126
 Chick+0.125
Neg logit contributions
urance-0.192
Greg-0.181
Jennifer-0.180
inger-0.178
ady-0.175
Token first
Token ID717
Feature activation+6.64e-02

Pos logit contributions
 Sp+0.006
 cuc+0.006
 Chick+0.005
 o+0.005
ie+0.005
Neg logit contributions
urance-0.011
inger-0.010
Jennifer-0.010
ady-0.010
Greg-0.010
Token".
Token ID1911
Feature activation-4.63e-03

Pos logit contributions
urance+0.127
inger+0.118
Jennifer+0.117
ady+0.116
Greg+0.116
Neg logit contributions
 Sp-0.091
 cuc-0.085
 Chick-0.081
 o-0.079
ie-0.075
Token The
Token ID383
Feature activation+1.14e-02

Pos logit contributions
 Sp+0.006
 cuc+0.005
 Chick+0.005
 o+0.005
ie+0.005
Neg logit contributions
urance-0.009
inger-0.009
Jennifer-0.009
ady-0.009
Greg-0.009
Token k
Token ID479
Feature activation+4.39e-02

Pos logit contributions
urance+0.020
Jennifer+0.018
Greg+0.018
inger+0.018
ady+0.018
Neg logit contributions
 Sp-0.018
 cuc-0.017
 o-0.017
ie-0.016
 Chick-0.016
Token to
Token ID284
Feature activation-2.13e-02

Pos logit contributions
 Sp+0.085
 cuc+0.083
 o+0.075
ie+0.074
 Chick+0.073
Neg logit contributions
urance-0.178
Greg-0.173
Jennifer-0.172
inger-0.169
Jerry-0.168
Token help
Token ID1037
Feature activation-2.69e-03

Pos logit contributions
 Sp+0.032
 cuc+0.031
 o+0.029
ie+0.029
 Chick+0.028
Neg logit contributions
urance-0.037
Greg-0.035
Jennifer-0.035
inger-0.034
Jerry-0.034
Token.
Token ID13
Feature activation+6.49e-03

Pos logit contributions
 cuc+0.003
 Sp+0.003
 o+0.003
ware+0.003
 Chick+0.003
Neg logit contributions
urance-0.005
Greg-0.005
inger-0.005
Jennifer-0.005
Jerry-0.005
Token
Token ID198
Feature activation-1.19e-04

Pos logit contributions
urance+0.011
Greg+0.010
Jennifer+0.010
inger+0.010
Jerry+0.010
Neg logit contributions
 Sp-0.010
 cuc-0.010
ie-0.010
 o-0.010
 Chick-0.009
Token
Token ID198
Feature activation-3.76e-05

Pos logit contributions
 Sp+0.000
 cuc+0.000
 o+0.000
ie+0.000
 Chick+0.000
Neg logit contributions
urance-0.000
Greg-0.000
Jennifer-0.000
inger-0.000
ady-0.000
TokenHe
Token ID1544
Feature activation-1.69e-02

Pos logit contributions
 Sp+0.000
 cuc+0.000
 o+0.000
 Chick+0.000
 if+0.000
Neg logit contributions
urance-0.000
Jennifer-0.000
inger-0.000
Greg-0.000
ady-0.000
Token said
Token ID531
Feature activation-2.92e-02

Pos logit contributions
 Sp+0.026
 cuc+0.025
ie+0.024
 o+0.024
 Chick+0.024
Neg logit contributions
urance-0.025
Greg-0.025
inger-0.024
Jennifer-0.024
Jerry-0.024
Token,
Token ID11
Feature activation+5.28e-03

Pos logit contributions
 Sp+0.041
ie+0.040
 Chick+0.038
 cuc+0.038
 o+0.038
Neg logit contributions
urance-0.049
Greg-0.045
agging-0.043
Jerry-0.043
Jennifer-0.042
Token "
Token ID366
Feature activation+6.94e-03

Pos logit contributions
urance+0.009
Greg+0.009
Jennifer+0.009
inger+0.008
Jerry+0.008
Neg logit contributions
 Sp-0.007
ie-0.007
 cuc-0.007
 o-0.007
 Chick-0.007
TokenCome
Token ID16773
Feature activation-3.91e-02

Pos logit contributions
urance+0.012
Greg+0.011
Jennifer+0.011
inger+0.010
ady+0.010
Neg logit contributions
 Sp-0.011
 cuc-0.011
 o-0.010
ie-0.010
 Chick-0.010
Token,
Token ID11
Feature activation+5.42e-03

Pos logit contributions
 cuc+0.050
 Sp+0.049
 o+0.045
ie+0.045
ware+0.044
Neg logit contributions
urance-0.074
Greg-0.073
Jennifer-0.071
inger-0.069
ady-0.069
Token went
Token ID1816
Feature activation-4.68e-02

Pos logit contributions
urance+0.013
Greg+0.012
Jennifer+0.012
inger+0.012
ady+0.012
Neg logit contributions
 Sp-0.009
 cuc-0.009
 o-0.008
ie-0.008
 Chick-0.008
Token over
Token ID625
Feature activation+1.10e-01

Pos logit contributions
 Sp+0.037
 o+0.034
 if+0.033
ie+0.030
 cuc+0.029
Neg logit contributions
urance-0.121
ady-0.110
Jennifer-0.110
Greg-0.109
inger-0.109
Token to
Token ID284
Feature activation-2.17e-02

Pos logit contributions
urance+0.232
Greg+0.229
Jennifer+0.229
Jerry+0.221
inger+0.219
Neg logit contributions
 cuc-0.115
 Sp-0.112
 o-0.103
ie-0.101
 Chick-0.100
Token him
Token ID683
Feature activation-2.22e-02

Pos logit contributions
 cuc+0.022
 Sp+0.021
 o+0.019
ie+0.019
ware+0.018
Neg logit contributions
urance-0.047
Greg-0.046
Jennifer-0.045
Jerry-0.043
inger-0.042
Token and
Token ID290
Feature activation+5.34e-03

Pos logit contributions
 Sp+0.023
 cuc+0.021
 o+0.020
ie+0.020
 if+0.020
Neg logit contributions
urance-0.049
Greg-0.048
Jennifer-0.048
inger-0.047
ady-0.046
Token asked
Token ID1965
Feature activation-1.93e-02

Pos logit contributions
Jennifer+0.009
Greg+0.009
Jerry+0.009
urance+0.008
Susan+0.008
Neg logit contributions
 Sp-0.007
 cuc-0.007
ware-0.007
ectar-0.007
 o-0.007
Token what
Token ID644
Feature activation+7.94e-02

Pos logit contributions
 cuc+0.025
bel+0.022
ware+0.021
 flakes+0.021
 o+0.021
Neg logit contributions
Greg-0.035
urance-0.034
Jennifer-0.033
Jerry-0.032
lynn-0.031
Token was
Token ID373
Feature activation+1.96e-02

Pos logit contributions
urance+0.154
Greg+0.153
Jennifer+0.152
Jerry+0.148
inger+0.140
Neg logit contributions
 Sp-0.094
 cuc-0.093
 o-0.087
ie-0.085
 Chick-0.084
Token wrong
Token ID2642
Feature activation+1.03e-01

Pos logit contributions
urance+0.024
Jennifer+0.022
Greg+0.022
inger+0.022
ady+0.022
Neg logit contributions
 Sp-0.040
 cuc-0.038
 o-0.038
ie-0.038
 Chick-0.037
Token.
Token ID13
Feature activation+8.87e-03

Pos logit contributions
urance+0.194
Greg+0.182
Jennifer+0.178
inger+0.178
ady+0.177
Neg logit contributions
 cuc-0.136
 Sp-0.135
 o-0.123
ie-0.119
 Chick-0.119
Token The
Token ID383
Feature activation+1.18e-02

Pos logit contributions
urance+0.015
Greg+0.014
Jennifer+0.014
Jerry+0.013
 courtroom+0.012
Neg logit contributions
 cuc-0.014
ie-0.013
 o-0.013
 Sp-0.012
ectar-0.012
Token there
Token ID612
Feature activation+4.37e-02

Pos logit contributions
Greg+0.005
urance+0.005
Jennifer+0.005
ady+0.005
Jerry+0.005
Neg logit contributions
 cuc-0.003
 Sp-0.003
 o-0.003
ware-0.002
 Chick-0.002
Token was
Token ID373
Feature activation+1.83e-02

Pos logit contributions
urance+0.079
Greg+0.077
Jennifer+0.075
Jerry+0.074
inger+0.072
Neg logit contributions
 cuc-0.060
 Sp-0.056
 o-0.053
 Chick-0.052
ie-0.050
Token a
Token ID257
Feature activation-2.17e-02

Pos logit contributions
urance+0.036
Jennifer+0.035
Greg+0.034
ady+0.034
Jerry+0.034
Neg logit contributions
 Sp-0.022
 cuc-0.022
ie-0.021
 o-0.020
 Chick-0.019
Token fish
Token ID5916
Feature activation-7.87e-02

Pos logit contributions
 Sp+0.036
 cuc+0.035
ie+0.034
ware+0.033
 Chick+0.033
Neg logit contributions
Greg-0.030
Jennifer-0.030
urance-0.030
Jerry-0.029
inger-0.027
Token named
Token ID3706
Feature activation-1.76e-02

Pos logit contributions
 Sp+0.142
 cuc+0.141
ware+0.133
 Chick+0.129
 o+0.127
Neg logit contributions
ady-0.098
agging-0.091
inger-0.090
urance-0.090
Jerry-0.089
Token Nem
Token ID22547
Feature activation-4.64e-02

Pos logit contributions
ie+0.022
 Sp+0.022
 cuc+0.022
 o+0.021
ware+0.020
Neg logit contributions
urance-0.032
Greg-0.030
Jerry-0.030
Jennifer-0.030
inger-0.029
Tokeno
Token ID78
Feature activation-1.07e-01

Pos logit contributions
 Sp+0.052
 cuc+0.050
 o+0.047
 Chick+0.045
ie+0.044
Neg logit contributions
urance-0.096
Greg-0.093
inger-0.092
Jennifer-0.092
ady-0.092
Token.
Token ID13
Feature activation+6.00e-03

Pos logit contributions
 Sp+0.145
 cuc+0.142
 o+0.129
 Chick+0.127
ware+0.124
Neg logit contributions
urance-0.191
ady-0.182
Greg-0.181
inger-0.180
agging-0.176
Token Nem
Token ID22547
Feature activation-4.61e-02

Pos logit contributions
urance+0.013
Greg+0.012
Jerry+0.012
Jennifer+0.012
inger+0.011
Neg logit contributions
 Sp-0.006
 cuc-0.006
ie-0.006
 o-0.006
 Chick-0.006
Tokeno
Token ID78
Feature activation-1.08e-01

Pos logit contributions
 Sp+0.049
 cuc+0.046
 o+0.044
ie+0.042
 Chick+0.041
Neg logit contributions
urance-0.099
Greg-0.096
Jennifer-0.095
inger-0.094
ady-0.094
Token loved
Token ID6151
Feature activation+4.85e-03

Pos logit contributions
 Sp+0.169
 cuc+0.167
 o+0.157
 Chick+0.153
ie+0.152
Neg logit contributions
urance-0.170
Greg-0.163
Jennifer-0.159
ady-0.159
inger-0.157
Token boy
Token ID2933
Feature activation-5.34e-02

Pos logit contributions
 Sp+0.034
 cuc+0.033
 o+0.032
ie+0.031
 Chick+0.030
Neg logit contributions
urance-0.044
Greg-0.042
Jennifer-0.041
inger-0.041
ady-0.040
Token.
Token ID13
Feature activation-3.02e-03

Pos logit contributions
 Sp+0.058
 cuc+0.055
 o+0.052
ie+0.051
 Chick+0.049
Neg logit contributions
urance-0.115
Greg-0.110
Jennifer-0.110
inger-0.108
ady-0.107
Token They
Token ID1119
Feature activation+6.20e-02

Pos logit contributions
 Sp+0.005
 cuc+0.004
 Chick+0.004
 o+0.004
 if+0.004
Neg logit contributions
urance-0.005
inger-0.005
Jennifer-0.005
Greg-0.005
ady-0.005
Token had
Token ID550
Feature activation+9.68e-02

Pos logit contributions
urance+0.130
inger+0.125
Jennifer+0.124
Greg+0.122
ady+0.121
Neg logit contributions
 Sp-0.073
 o-0.071
 cuc-0.067
ie-0.065
 if-0.064
Token an
Token ID281
Feature activation-2.33e-02

Pos logit contributions
urance+0.226
inger+0.220
Jennifer+0.217
Greg+0.216
ady+0.214
Neg logit contributions
 Sp-0.092
 o-0.087
 if-0.081
 cuc-0.079
 please-0.073
Token idea
Token ID2126
Feature activation-5.35e-02

Pos logit contributions
 o+0.027
 cuc+0.024
ie+0.023
 Sp+0.023
 if+0.022
Neg logit contributions
Greg-0.049
urance-0.048
Jennifer-0.048
Jerry-0.046
inger-0.046
Token.
Token ID13
Feature activation-9.93e-04

Pos logit contributions
 if+0.040
 Sp+0.039
 o+0.035
ie+0.032
 later+0.032
Neg logit contributions
urance-0.136
inger-0.133
Jennifer-0.133
Greg-0.131
ady-0.129
Token They
Token ID1119
Feature activation+6.26e-02

Pos logit contributions
 Sp+0.001
 cuc+0.001
 Chick+0.001
 if+0.001
 o+0.001
Neg logit contributions
urance-0.002
inger-0.002
Jennifer-0.002
agging-0.002
Greg-0.002
Token said
Token ID531
Feature activation-3.51e-02

Pos logit contributions
urance+0.121
Greg+0.113
inger+0.113
Jennifer+0.113
ady+0.111
Neg logit contributions
 Sp-0.084
 cuc-0.080
 o-0.079
ie-0.075
 if-0.073
Token,
Token ID11
Feature activation+3.75e-03

Pos logit contributions
 Sp+0.044
 cuc+0.044
ie+0.043
 o+0.040
 Chick+0.040
Neg logit contributions
urance-0.067
Greg-0.063
Jennifer-0.062
Jerry-0.060
inger-0.059
Token "
Token ID366
Feature activation+2.79e-03

Pos logit contributions
urance+0.007
Greg+0.007
Jennifer+0.006
Jerry+0.006
inger+0.006
Neg logit contributions
 Sp-0.005
ie-0.005
 cuc-0.005
 o-0.005
 Chick-0.005
Token says
Token ID1139
Feature activation-3.68e-02

Pos logit contributions
 Sp+0.038
 cuc+0.036
 o+0.035
ie+0.033
 Chick+0.033
Neg logit contributions
urance-0.054
Greg-0.051
Jennifer-0.051
inger-0.050
ady-0.049
Token.
Token ID13
Feature activation-3.19e-03

Pos logit contributions
 Sp+0.043
 cuc+0.042
 o+0.039
ie+0.038
 Chick+0.038
Neg logit contributions
urance-0.076
Greg-0.072
Jennifer-0.071
inger-0.070
ady-0.070
Token He
Token ID679
Feature activation-2.63e-02

Pos logit contributions
ie+0.005
 cuc+0.005
 Sp+0.005
 o+0.005
 Chick+0.005
Neg logit contributions
urance-0.005
Greg-0.005
Jennifer-0.004
Jerry-0.004
ady-0.004
Token knows
Token ID4206
Feature activation+1.89e-02

Pos logit contributions
 Sp+0.047
 cuc+0.045
 o+0.044
ie+0.042
 Chick+0.042
Neg logit contributions
urance-0.039
Jennifer-0.036
Greg-0.036
inger-0.036
ady-0.035
Token the
Token ID262
Feature activation-7.53e-03

Pos logit contributions
Greg+0.033
urance+0.032
Jennifer+0.032
 appearances+0.032
Jerry+0.031
Neg logit contributions
 cuc-0.023
 Sp-0.021
ie-0.021
bel-0.021
 Chick-0.020
Token v
Token ID410
Feature activation-1.63e-01

Pos logit contributions
 Sp+0.006
 cuc+0.006
ie+0.005
 Chick+0.005
 o+0.005
Neg logit contributions
urance-0.018
inger-0.017
Greg-0.017
Jennifer-0.017
ady-0.016
Tokenase
Token ID589
Feature activation-1.60e-01

Pos logit contributions
 cuc+0.283
 Sp+0.271
 o+0.266
 Chick+0.259
 if+0.256
Neg logit contributions
inger-0.246
urance-0.237
Greg-0.226
oys-0.226
ags-0.222
Token is
Token ID318
Feature activation+3.85e-02

Pos logit contributions
 Sp+0.240
 cuc+0.240
 Chick+0.221
 o+0.212
ie+0.206
Neg logit contributions
urance-0.269
inger-0.253
Jennifer-0.251
Greg-0.250
Jerry-0.243
Token not
Token ID407
Feature activation+7.97e-02

Pos logit contributions
urance+0.068
Greg+0.064
Jennifer+0.064
inger+0.062
ady+0.061
Neg logit contributions
 Sp-0.058
 o-0.055
 cuc-0.055
 if-0.052
ie-0.052
Token a
Token ID257
Feature activation-2.53e-02

Pos logit contributions
urance+0.147
Greg+0.139
Jennifer+0.139
inger+0.139
ady+0.137
Neg logit contributions
 Sp-0.111
 cuc-0.104
 o-0.097
ie-0.097
 Chick-0.096
Token toy
Token ID13373
Feature activation+3.55e-02

Pos logit contributions
 Sp+0.041
 cuc+0.038
ie+0.037
 Chick+0.036
 o+0.036
Neg logit contributions
urance-0.040
Greg-0.038
Jennifer-0.037
inger-0.037
Jerry-0.036
Token dream
Token ID4320
Feature activation-1.80e-01

Pos logit contributions
 Sp+0.079
 cuc+0.077
 Chick+0.070
ware+0.068
ie+0.068
Neg logit contributions
urance-0.074
Jennifer-0.063
 lookout-0.062
agging-0.061
 pace-0.061
Token about
Token ID546
Feature activation+7.81e-03

Pos logit contributions
 cuc+0.284
 Sp+0.267
 Chick+0.251
 o+0.236
 fro+0.223
Neg logit contributions
urance-0.315
agging-0.296
Jennifer-0.291
inger-0.287
Greg-0.277
Token luxury
Token ID13064
Feature activation-5.27e-03

Pos logit contributions
urance+0.011
 way+0.011
 supposed+0.011
Jennifer+0.011
 appearances+0.010
Neg logit contributions
 cuc-0.012
ie-0.012
 Sp-0.011
bel-0.011
 please-0.011
Token.
Token ID13
Feature activation+6.41e-03

Pos logit contributions
 cuc+0.009
 Chick+0.009
 Sp+0.009
 please+0.008
 fro+0.007
Neg logit contributions
 street-0.007
agging-0.007
 property-0.006
 pace-0.006
 way-0.006
Token She
Token ID1375
Feature activation+7.32e-03

Pos logit contributions
urance+0.011
Jennifer+0.010
Greg+0.010
inger+0.010
Jerry+0.010
Neg logit contributions
 Sp-0.010
 cuc-0.009
 o-0.009
ie-0.009
 Chick-0.009
Token dreamed
Token ID27947
Feature activation-1.59e-01

Pos logit contributions
urance+0.010
ady+0.009
 hates+0.009
 way+0.009
 fore+0.009
Neg logit contributions
 cuc-0.012
 Sp-0.011
ectar-0.011
 please-0.011
berry-0.010
Token of
Token ID286
Feature activation-6.04e-02

Pos logit contributions
 cuc+0.234
 Sp+0.222
 Chick+0.210
 o+0.191
 fro+0.189
Neg logit contributions
agging-0.278
urance-0.274
Jennifer-0.270
Greg-0.253
 fore-0.248
Token having
Token ID1719
Feature activation+6.48e-02

Pos logit contributions
 cuc+0.092
ie+0.091
 Sp+0.090
 Chick+0.084
ware+0.081
Neg logit contributions
urance-0.092
Greg-0.086
 supposed-0.085
Jennifer-0.085
agging-0.082
Token a
Token ID257
Feature activation-1.73e-02

Pos logit contributions
urance+0.112
Jennifer+0.110
Greg+0.104
Jerry+0.102
ady+0.101
Neg logit contributions
 cuc-0.093
ie-0.085
 Sp-0.085
ware-0.082
ry-0.078
Token huge
Token ID3236
Feature activation-1.90e-02

Pos logit contributions
 Sp+0.028
 cuc+0.028
ie+0.027
 if+0.025
 Chick+0.025
Neg logit contributions
Greg-0.024
urance-0.024
Jennifer-0.024
 way-0.022
Jerry-0.022
Token house
Token ID2156
Feature activation-2.89e-02

Pos logit contributions
 Sp+0.027
 cuc+0.026
 Chick+0.024
 if+0.023
ie+0.023
Neg logit contributions
urance-0.034
Jennifer-0.031
Greg-0.031
ady-0.030
agging-0.030
Token button
Token ID4936
Feature activation+2.62e-02

Pos logit contributions
 Sp+0.004
ie+0.004
 cuc+0.004
 Chick+0.004
 if+0.004
Neg logit contributions
urance-0.004
inger-0.004
 fists-0.004
Jennifer-0.004
 way-0.004
Token.
Token ID13
Feature activation+4.09e-03

Pos logit contributions
urance+0.045
Greg+0.043
Jennifer+0.042
inger+0.041
ady+0.041
Neg logit contributions
 cuc-0.038
 Sp-0.036
 Chick-0.034
 o-0.032
ie-0.032
Token
Token ID198
Feature activation-2.41e-03

Pos logit contributions
urance+0.007
Greg+0.007
Jennifer+0.007
inger+0.007
ady+0.006
Neg logit contributions
 Sp-0.006
 cuc-0.006
ie-0.006
 o-0.006
 Chick-0.005
Token
Token ID198
Feature activation-2.07e-03

Pos logit contributions
 Sp+0.003
 cuc+0.003
 o+0.003
ie+0.003
 Chick+0.003
Neg logit contributions
urance-0.005
Greg-0.004
Jennifer-0.004
inger-0.004
ady-0.004
TokenThe
Token ID464
Feature activation+2.50e-02

Pos logit contributions
 Sp+0.003
 cuc+0.003
 o+0.003
 if+0.002
 Chick+0.002
Neg logit contributions
urance-0.004
Jennifer-0.004
Greg-0.004
inger-0.004
ady-0.004
Token tooth
Token ID16162
Feature activation-1.78e-01

Pos logit contributions
urance+0.040
Jennifer+0.038
 way+0.037
Greg+0.037
inger+0.036
Neg logit contributions
 Sp-0.036
ie-0.034
 cuc-0.034
 Chick-0.033
 if-0.032
Tokenbrush
Token ID32680
Feature activation-1.60e-01

Pos logit contributions
 Sp+0.299
 cuc+0.296
 Chick+0.280
ie+0.276
ectar+0.271
Neg logit contributions
inger-0.238
Jennifer-0.237
 fore-0.225
urance-0.224
ady-0.224
Token starts
Token ID4940
Feature activation+2.44e-02

Pos logit contributions
 Sp+0.272
 cuc+0.269
 Chick+0.248
 o+0.230
 cel+0.225
Neg logit contributions
urance-0.251
inger-0.227
ady-0.225
Jennifer-0.224
agging-0.222
Token to
Token ID284
Feature activation-2.27e-02

Pos logit contributions
urance+0.048
ady+0.045
inger+0.044
Jennifer+0.044
Greg+0.042
Neg logit contributions
 cuc-0.028
 Sp-0.027
ie-0.027
 o-0.025
 Chick-0.025
Token buzz
Token ID14713
Feature activation-9.80e-02

Pos logit contributions
 Sp+0.036
 cuc+0.035
ie+0.034
ectar+0.033
 Chick+0.032
Neg logit contributions
urance-0.034
Jennifer-0.031
Greg-0.030
inger-0.030
 lookout-0.029
Token and
Token ID290
Feature activation+3.04e-03

Pos logit contributions
 cuc+0.134
 Sp+0.127
ie+0.120
 Chick+0.119
 o+0.119
Neg logit contributions
urance-0.175
ady-0.167
inger-0.166
Greg-0.165
Jennifer-0.164
Token She
Token ID1375
Feature activation-9.69e-04

Pos logit contributions
urance+0.003
Jennifer+0.003
Greg+0.003
inger+0.002
Jerry+0.002
Neg logit contributions
ie-0.003
 cuc-0.003
 o-0.003
 Sp-0.003
 Chick-0.003
Token wanted
Token ID2227
Feature activation+2.66e-02

Pos logit contributions
 Sp+0.002
 cuc+0.002
ie+0.002
ectar+0.001
 Chick+0.001
Neg logit contributions
Jennifer-0.001
ady-0.001
Greg-0.001
urance-0.001
Jerry-0.001
Token to
Token ID284
Feature activation-2.36e-02

Pos logit contributions
urance+0.046
Jennifer+0.044
Greg+0.043
Jerry+0.042
 directions+0.042
Neg logit contributions
 cuc-0.034
ie-0.031
 Sp-0.031
bel-0.030
 Chick-0.028
Token see
Token ID766
Feature activation-4.19e-02

Pos logit contributions
 Sp+0.037
 cuc+0.036
ie+0.034
ware+0.033
 Chick+0.033
Neg logit contributions
urance-0.039
Jennifer-0.035
Greg-0.034
Jerry-0.034
inger-0.034
Token the
Token ID262
Feature activation-3.11e-03

Pos logit contributions
 cuc+0.066
bel+0.063
 Sp+0.059
ware+0.056
ie+0.055
Neg logit contributions
Greg-0.058
Jennifer-0.057
urance-0.057
 authority-0.056
ags-0.054
Token soup
Token ID17141
Feature activation-2.06e-01

Pos logit contributions
 Sp+0.004
ie+0.003
 Chick+0.003
 cuc+0.003
 if+0.003
Neg logit contributions
urance-0.006
ady-0.005
 way-0.005
Jennifer-0.005
 lookout-0.005
Token.
Token ID13
Feature activation+2.16e-03

Pos logit contributions
 Sp+0.318
 cuc+0.307
 Chick+0.296
ie+0.271
 Will+0.262
Neg logit contributions
urance-0.317
 way-0.290
inger-0.284
agging-0.278
Greg-0.273
Token
Token ID198
Feature activation-9.04e-03

Pos logit contributions
urance+0.003
Jennifer+0.003
Greg+0.003
Jerry+0.003
inger+0.003
Neg logit contributions
 cuc-0.004
ie-0.004
 o-0.004
 Sp-0.004
 Chick-0.003
Token
Token ID198
Feature activation-9.03e-03

Pos logit contributions
 Sp+0.012
 cuc+0.012
 o+0.011
ie+0.011
 Chick+0.010
Neg logit contributions
urance-0.017
Greg-0.017
Jennifer-0.017
inger-0.016
ady-0.016
Token"
Token ID1
Feature activation+5.05e-02

Pos logit contributions
 Sp+0.013
 cuc+0.012
 o+0.012
ie+0.011
 Chick+0.011
Neg logit contributions
urance-0.017
Greg-0.016
Jennifer-0.015
inger-0.015
ady-0.015
TokenNO
Token ID15285
Feature activation-5.89e-02

Pos logit contributions
urance+0.080
Greg+0.076
Jennifer+0.076
inger+0.074
ady+0.073
Neg logit contributions
 Sp-0.084
 cuc-0.081
 o-0.078
ie-0.077
 Chick-0.076
Token deaf
Token ID21815
Feature activation+5.91e-03

Pos logit contributions
 Sp+0.029
 cuc+0.028
 o+0.027
ie+0.025
 Chick+0.025
Neg logit contributions
urance-0.036
Jennifer-0.033
Greg-0.033
inger-0.033
ady-0.033
Token man
Token ID582
Feature activation+8.65e-03

Pos logit contributions
urance+0.011
Greg+0.010
Jennifer+0.010
inger+0.010
ady+0.010
Neg logit contributions
 Sp-0.008
 cuc-0.008
 o-0.008
ie-0.007
 if-0.007
Token selling
Token ID6301
Feature activation+5.41e-02

Pos logit contributions
urance+0.018
Jennifer+0.017
Greg+0.017
inger+0.017
ady+0.017
Neg logit contributions
 Sp-0.010
 cuc-0.009
 o-0.009
ie-0.009
 if-0.008
Token l
Token ID300
Feature activation-7.98e-02

Pos logit contributions
urance+0.103
Greg+0.098
Jennifer+0.098
inger+0.096
ady+0.095
Neg logit contributions
 Sp-0.072
 cuc-0.068
 o-0.065
ie-0.064
 Chick-0.063
Tokenoll
Token ID692
Feature activation-2.11e-01

Pos logit contributions
 Sp+0.082
 cuc+0.077
 o+0.074
 if+0.068
ie+0.068
Neg logit contributions
urance-0.181
Greg-0.169
Jennifer-0.168
Jerry-0.165
inger-0.163
Tokenip
Token ID541
Feature activation-1.61e-01

Pos logit contributions
 Sp+0.248
 cuc+0.230
 Chick+0.212
 o+0.210
ie+0.188
Neg logit contributions
urance-0.417
inger-0.412
Greg-0.409
ady-0.406
Jennifer-0.400
Tokenops
Token ID2840
Feature activation-4.44e-02

Pos logit contributions
 Sp+0.224
 cuc+0.213
 o+0.206
ie+0.201
 Chick+0.196
Neg logit contributions
urance-0.297
Greg-0.282
Jennifer-0.281
inger-0.275
ady-0.272
Token.
Token ID13
Feature activation+8.17e-03

Pos logit contributions
 Sp+0.048
 cuc+0.045
 o+0.043
ie+0.041
 Chick+0.040
Neg logit contributions
urance-0.096
Greg-0.092
Jennifer-0.092
inger-0.090
ady-0.089
Token Jane
Token ID12091
Feature activation+2.99e-02

Pos logit contributions
urance+0.015
Greg+0.014
Jennifer+0.014
inger+0.014
ady+0.013
Neg logit contributions
 Sp-0.012
 cuc-0.011
 o-0.011
ie-0.011
 Chick-0.010
Token thought
Token ID1807
Feature activation+1.33e-02

Pos logit contributions
urance+0.063
Greg+0.059
Jennifer+0.058
inger+0.058
ags+0.056
Neg logit contributions
 Sp-0.035
 cuc-0.033
 o-0.033
ie-0.032
 Chick-0.030
Token it
Token ID340
Feature activation-1.08e-02

Pos logit contributions
urance+0.032
Jennifer+0.031
Greg+0.031
inger+0.030
Jerry+0.030
Neg logit contributions
 Sp-0.010
 cuc-0.010
 Chick-0.008
ie-0.008
 o-0.008