TOP ACTIVATIONS
MAX = 10.142

would also ask Him to bless the people who were sick
. Mom helped John re hang the shelf and it was
Cat ie feel much bra ver . She was no longer
he is a good sol ver . He ro ars again
, feeling a little bra ver , wandered happily off into
the world outside . Where ver he went , Jack always
words slowly , " God bless Mom my , Daddy ,
. Where ver the old man went ,
Night night everyone , God bless you all ," he said
pin is magical ! Where ver you go , it will
round tum my . Where ver he went , his tail
to be heard . Where ver the wish stood , everyone
she would be even bra ver . The end . <|endoftext|>
higher , he felt bra ver and bra ver . Soon
felt a little bit bra ver . She slowly stepped closer
felt bra ver and bra ver . Soon enough , he
many new friends . Where ver he went , people admired
fears and become a bra ver boy . Tim was happy
have sweet dreams . Please bless our family and our friends
He was much bra ver than before and eventually he

MIN ACTIVATIONS
MIN = -12.502

. They had stopped qu arre ling . To celebrate ,
on , they never qu arre led again over clothes .
but today they were qu arre ling . Jenna wanted to
together . They never qu arre led again . <|endoftext|> Once
lively sound . The qu arre ling stopped and they heard
and saw two birds qu arre ling ! They were squ
the parents were also qu arre ling . The first family
friends didn 't stop qu arre ling and were getting louder
and their children , qu arre ling .
told them to stop qu arre lling and to share the
on , they never qu arre led again and played together
Lily and Jack never qu arre led again . They learned
. The bears stopped qu arre ling and asked him for
mum and understood that qu arre lling was not a good
Tom and Mia stop qu arre ling . They see the
walking by saw them qu arre ling . She was very
he found a rabbit qu arre lling with a squirrel .
she was sorry for qu arre ling . From that day
Finally , after much qu arre ling , one of the
sharing is important and qu arre lling is not good .

INTERVAL 4.481 - 10.142
CONTAINS 2.751%

, the queen decided to bake a y ummy cake for
intimidated and asked Why not ?
watching TV today , why don 't you send your friend
, " It 's Match Day ! Every Sunday , we
He asked one more time if he could keep the pants

INTERVAL -1.180 - 4.481
CONTAINS 73.280%

us . We are brave and strong . We can manage
that the man was a hunter and he was hunting for
to join them and show them his copper coins .
worry so she kept walking . She kept looking back at
Tom and Lily smiled and said , " We love

INTERVAL -6.841 - -1.180
CONTAINS 23.538%

ask the birds who flew overhead what hay was , but
When Lily got to the building , she
3 year old held on tight . He managed to support
was so fine . The end . <|endoftext|> Mary was visiting
At the park , Tim saw another little boy named Sam

INTERVAL -12.502 - -6.841
CONTAINS 0.430%

the jewel and the ball all day long . They learned
having lots of fun when all of sudden , she heard
Amy was right , running all the time wasn 't nice
a few jokes together . â Then
children in town . They all liked him very much and
Token would
Token ID561
Feature activation+2.97e+00

Pos logit contributions
aid+0.033
 nod+0.028
 yes+0.027
Of+0.027
 forgive+0.026
Neg logit contributions
 anth-0.033
amp-0.032
ines-0.030
attered-0.030
ained-0.029
Token also
Token ID635
Feature activation-2.06e+00

Pos logit contributions
aid+0.130
Of+0.104
 yes+0.101
 nod+0.100
 forgiveness+0.096
Neg logit contributions
 anth-0.154
amp-0.149
ines-0.146
attered-0.141
ained-0.135
Token ask
Token ID1265
Feature activation+5.33e+00

Pos logit contributions
 anth+0.108
amp+0.104
ines+0.103
attered+0.099
 problem+0.096
Neg logit contributions
aid-0.088
Of-0.070
 yes-0.067
 nod-0.066
 forgiveness-0.064
Token Him
Token ID10978
Feature activation+7.22e+00

Pos logit contributions
aid+0.268
Of+0.214
 nod+0.202
 yes+0.197
 forgiveness+0.190
Neg logit contributions
 anth-0.246
ines-0.231
amp-0.231
 problem-0.215
 argument-0.210
Token to
Token ID284
Feature activation+3.80e-01

Pos logit contributions
aid+0.429
Of+0.359
 nod+0.334
 forgiveness+0.332
 yes+0.332
Neg logit contributions
ines-0.253
 anth-0.252
amp-0.234
 problem-0.229
 ago-0.225
Token bless
Token ID12012
Feature activation+1.01e+01

Pos logit contributions
aid+0.017
Of+0.014
 yes+0.012
 stro+0.012
 nod+0.012
Neg logit contributions
 anth-0.018
ines-0.017
amp-0.017
 argument-0.017
 problem-0.016
Token the
Token ID262
Feature activation+5.72e-01

Pos logit contributions
aid+0.544
Of+0.451
 nod+0.402
 stro+0.373
 yes+0.367
Neg logit contributions
ines-0.392
 problem-0.377
 anth-0.375
 everywhere-0.370
 ago-0.367
Token people
Token ID661
Feature activation+2.12e+00

Pos logit contributions
aid+0.029
Of+0.023
 yes+0.023
 nod+0.022
 forgiveness+0.022
Neg logit contributions
 anth-0.026
amp-0.025
ines-0.025
attered-0.024
 problem-0.023
Token who
Token ID508
Feature activation+3.15e+00

Pos logit contributions
aid+0.117
Of+0.098
 yes+0.091
 forgiveness+0.091
 nod+0.091
Neg logit contributions
 anth-0.084
ines-0.082
amp-0.081
attered-0.075
 problem-0.074
Token were
Token ID547
Feature activation-3.40e-01

Pos logit contributions
aid+0.173
Of+0.146
 yes+0.140
 nod+0.138
 forgiveness+0.136
Neg logit contributions
 anth-0.124
ines-0.119
amp-0.119
attered-0.111
 problem-0.108
Token sick
Token ID6639
Feature activation-2.47e+00

Pos logit contributions
 anth+0.016
amp+0.015
ines+0.014
attered+0.014
ained+0.013
Neg logit contributions
aid-0.017
 yes-0.014
 nod-0.014
Of-0.013
 forgiveness-0.013
Token.
Token ID13
Feature activation+3.54e-01

Pos logit contributions
aid+0.054
 yes+0.045
Of+0.043
 nod+0.043
 forgiveness+0.041
Neg logit contributions
 anth-0.052
amp-0.048
ines-0.047
attered-0.045
ained-0.044
Token Mom
Token ID11254
Feature activation-1.86e+00

Pos logit contributions
aid+0.017
Of+0.013
 yes+0.013
 nod+0.013
 forgiveness+0.012
Neg logit contributions
 anth-0.017
amp-0.017
ines-0.016
attered-0.016
ained-0.015
Token helped
Token ID4193
Feature activation+4.06e+00

Pos logit contributions
 anth+0.071
amp+0.067
ines+0.065
attered+0.062
ained+0.060
Neg logit contributions
aid-0.107
Of-0.090
 yes-0.089
 nod-0.088
 forgiveness-0.085
Token John
Token ID1757
Feature activation-1.07e+00

Pos logit contributions
aid+0.201
Of+0.164
 yes+0.156
 nod+0.155
 forgiveness+0.150
Neg logit contributions
 anth-0.187
amp-0.182
ines-0.178
attered-0.170
 problem-0.163
Token re
Token ID302
Feature activation-1.68e+00

Pos logit contributions
ines+0.046
 anth+0.046
amp+0.045
attered+0.044
 problem+0.040
Neg logit contributions
aid-0.053
Of-0.046
 yes-0.043
 forgiveness-0.043
 nod-0.042
Tokenhang
Token ID33255
Feature activation+1.01e+01

Pos logit contributions
 anth+0.052
amp+0.047
ines+0.044
ained+0.043
 earthquake+0.043
Neg logit contributions
aid-0.112
Of-0.092
 forgiveness-0.092
 yes-0.090
 nod-0.089
Token the
Token ID262
Feature activation+6.78e-01

Pos logit contributions
aid+0.578
Of+0.491
 yes+0.470
 nod+0.470
 forgiveness+0.460
Neg logit contributions
 anth-0.383
amp-0.371
ines-0.363
attered-0.342
 problem-0.326
Token shelf
Token ID18316
Feature activation-7.64e-01

Pos logit contributions
aid+0.035
Of+0.029
 yes+0.029
 nod+0.028
 forgiveness+0.027
Neg logit contributions
 anth-0.030
amp-0.029
ines-0.028
attered-0.027
ained-0.026
Token and
Token ID290
Feature activation+5.66e-01

Pos logit contributions
 anth+0.039
amp+0.037
ines+0.036
attered+0.035
ained+0.034
Neg logit contributions
aid-0.033
 yes-0.029
 nod-0.027
Of-0.026
 forgive-0.025
Token it
Token ID340
Feature activation+2.16e-01

Pos logit contributions
aid+0.030
Of+0.025
 nod+0.024
 yes+0.023
 forgiveness+0.023
Neg logit contributions
 anth-0.023
amp-0.022
ines-0.021
attered-0.021
 problem-0.020
Token was
Token ID373
Feature activation-2.07e+00

Pos logit contributions
aid+0.012
Of+0.010
 yes+0.010
 nod+0.009
 forgiveness+0.009
Neg logit contributions
 anth-0.009
amp-0.009
ines-0.009
attered-0.008
ained-0.008
Token Cat
Token ID5181
Feature activation-3.77e-01

Pos logit contributions
 anth+0.127
ines+0.126
amp+0.125
 problem+0.117
ained+0.116
Neg logit contributions
aid-0.133
Of-0.112
 yes-0.108
 nod-0.102
 forgiveness-0.101
Tokenie
Token ID494
Feature activation+7.04e-01

Pos logit contributions
ines+0.007
amp+0.006
 anth+0.006
attered+0.005
 problem+0.005
Neg logit contributions
aid-0.028
Of-0.026
 yes-0.025
 nod-0.024
 forgiveness-0.024
Token feel
Token ID1254
Feature activation+2.36e+00

Pos logit contributions
aid+0.032
Of+0.027
 yes+0.025
 forgiveness+0.024
 sew+0.022
Neg logit contributions
ines-0.031
 anth-0.031
 problem-0.029
 argument-0.029
amp-0.028
Token much
Token ID881
Feature activation-4.63e+00

Pos logit contributions
aid+0.121
Of+0.103
 yes+0.095
 sew+0.094
 forgiveness+0.092
Neg logit contributions
 anth-0.085
 problem-0.082
 argument-0.082
 cloudy-0.081
ines-0.080
Token bra
Token ID8290
Feature activation+3.59e+00

Pos logit contributions
 problem+0.217
 argument+0.198
 mess+0.192
 flood+0.184
 cloudy+0.182
Neg logit contributions
aid-0.203
Of-0.168
Elsa-0.162
cher-0.161
 yes-0.158
Tokenver
Token ID332
Feature activation+1.00e+01

Pos logit contributions
aid+0.163
Of+0.140
 forgiveness+0.125
Yes+0.123
 nod+0.122
Neg logit contributions
amp-0.171
ines-0.166
 ago-0.162
 anth-0.162
attered-0.159
Token.
Token ID13
Feature activation-9.65e-02

Pos logit contributions
aid+0.470
Of+0.382
 yes+0.378
 nod+0.374
 forgiveness+0.360
Neg logit contributions
 anth-0.487
amp-0.470
ines-0.449
attered-0.443
ained-0.423
Token She
Token ID1375
Feature activation+5.56e-01

Pos logit contributions
 anth+0.004
amp+0.004
ines+0.004
attered+0.004
ained+0.004
Neg logit contributions
aid-0.005
 yes-0.004
 forgiveness-0.004
Of-0.004
 nod-0.004
Token was
Token ID373
Feature activation-1.89e+00

Pos logit contributions
aid+0.028
Of+0.023
 yes+0.023
 nod+0.022
 forgiveness+0.022
Neg logit contributions
 anth-0.025
amp-0.024
ines-0.024
attered-0.023
 problem-0.022
Token no
Token ID645
Feature activation-3.03e+00

Pos logit contributions
 anth+0.081
ines+0.077
amp+0.076
ained+0.072
attered+0.070
Neg logit contributions
aid-0.099
Of-0.082
 forgiveness-0.076
 nod-0.076
 yes-0.076
Token longer
Token ID2392
Feature activation-1.49e+00

Pos logit contributions
 anth+0.096
ines+0.092
 problem+0.089
amp+0.088
 argument+0.083
Neg logit contributions
aid-0.192
Of-0.166
 yes-0.160
 nod-0.154
 forgiveness-0.152
Token he
Token ID339
Feature activation-1.39e+00

Pos logit contributions
 anth+0.035
ines+0.035
amp+0.034
 problem+0.033
attered+0.031
Neg logit contributions
aid-0.041
Of-0.034
 nod-0.032
 yes-0.031
 forgiveness-0.030
Token is
Token ID318
Feature activation-1.77e+00

Pos logit contributions
 anth+0.051
ines+0.051
amp+0.051
attered+0.046
 problem+0.046
Neg logit contributions
aid-0.080
Of-0.069
 yes-0.067
 nod-0.065
 forgiveness-0.065
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
 anth+0.060
 all+0.055
 everywhere+0.055
 problem+0.054
 supposed+0.053
Neg logit contributions
aid-0.106
Of-0.089
 nod-0.086
 yes-0.079
 stro-0.078
Token good
Token ID922
Feature activation+2.80e-01

Pos logit contributions
aid+0.057
Of+0.045
 nod+0.045
 stro+0.044
 yes+0.043
Neg logit contributions
 anth-0.045
 problem-0.042
amp-0.039
ines-0.039
 argument-0.038
Token sol
Token ID1540
Feature activation-4.50e+00

Pos logit contributions
aid+0.014
Of+0.011
 yes+0.011
 nod+0.011
 forgiveness+0.010
Neg logit contributions
 anth-0.013
amp-0.012
ines-0.012
 problem-0.012
attered-0.012
Tokenver
Token ID332
Feature activation+9.93e+00

Pos logit contributions
amp+0.245
 anth+0.243
ines+0.241
attered+0.227
 ago+0.225
Neg logit contributions
aid-0.181
Of-0.146
 nod-0.135
 yes-0.130
 forgiveness-0.127
Token.
Token ID13
Feature activation-1.59e-01

Pos logit contributions
aid+0.505
Of+0.416
 yes+0.412
 nod+0.402
 forgiveness+0.394
Neg logit contributions
 anth-0.448
amp-0.431
ines-0.414
attered-0.403
ained-0.387
Token He
Token ID679
Feature activation+7.54e-01

Pos logit contributions
 anth+0.006
amp+0.006
ines+0.006
 ago+0.006
ained+0.006
Neg logit contributions
aid-0.009
 yes-0.007
 forgiveness-0.007
 nod-0.007
 heal-0.007
Token ro
Token ID686
Feature activation-1.99e+00

Pos logit contributions
aid+0.040
Of+0.033
 yes+0.032
 nod+0.031
 forgiveness+0.031
Neg logit contributions
 anth-0.032
ines-0.031
amp-0.031
attered-0.028
 problem-0.028
Tokenars
Token ID945
Feature activation+3.70e+00

Pos logit contributions
ines+0.089
amp+0.089
 anth+0.085
attered+0.080
ained+0.079
Neg logit contributions
aid-0.093
Of-0.080
 yes-0.080
 forgiveness-0.077
 nod-0.076
Token again
Token ID757
Feature activation+2.79e+00

Pos logit contributions
aid+0.191
Of+0.158
 yes+0.151
 nod+0.150
 forgiveness+0.147
Neg logit contributions
 anth-0.160
amp-0.155
ines-0.154
attered-0.144
 problem-0.140
Token,
Token ID11
Feature activation-2.08e-01

Pos logit contributions
aid+0.014
 forgiveness+0.011
Of+0.011
 yes+0.011
 forgive+0.010
Neg logit contributions
 anth-0.011
attered-0.010
ines-0.010
 cloudy-0.010
 all-0.010
Token feeling
Token ID4203
Feature activation-2.09e+00

Pos logit contributions
 anth+0.008
ines+0.008
amp+0.007
 everywhere+0.007
 all+0.007
Neg logit contributions
aid-0.011
Of-0.009
 forgiveness-0.008
 nod-0.008
 yes-0.008
Token a
Token ID257
Feature activation+9.81e-01

Pos logit contributions
 anth+0.073
 cloudy+0.072
ines+0.072
 everywhere+0.070
attered+0.070
Neg logit contributions
aid-0.111
Of-0.094
 forgiveness-0.086
 yes-0.085
 nod-0.085
Token little
Token ID1310
Feature activation+8.35e-01

Pos logit contributions
aid+0.052
Of+0.041
 forgiveness+0.040
 yes+0.039
 nod+0.039
Neg logit contributions
 anth-0.040
amp-0.037
 problem-0.037
 cloudy-0.036
ines-0.036
Token bra
Token ID8290
Feature activation+3.38e+00

Pos logit contributions
aid+0.042
Of+0.031
 forgiveness+0.031
 yes+0.029
 sew+0.029
Neg logit contributions
 problem-0.035
 anth-0.035
 cloudy-0.034
attered-0.033
 argument-0.033
Tokenver
Token ID332
Feature activation+9.88e+00

Pos logit contributions
aid+0.152
Of+0.126
 forgiveness+0.118
 nod+0.114
Yes+0.112
Neg logit contributions
ines-0.161
amp-0.159
 ago-0.157
 anth-0.148
attered-0.145
Token,
Token ID11
Feature activation-2.01e-01

Pos logit contributions
aid+0.490
 yes+0.413
Of+0.408
 nod+0.401
 forgiveness+0.385
Neg logit contributions
 anth-0.454
amp-0.436
ines-0.413
attered-0.406
ained-0.390
Token wandered
Token ID36036
Feature activation-2.06e+00

Pos logit contributions
 anth+0.008
ines+0.007
amp+0.007
 ago+0.007
attered+0.007
Neg logit contributions
aid-0.011
Of-0.009
 forgiveness-0.008
 nod-0.008
 yes-0.008
Token happily
Token ID18177
Feature activation+2.70e-01

Pos logit contributions
 anth+0.078
amp+0.076
ines+0.075
attered+0.070
ained+0.067
Neg logit contributions
aid-0.118
Of-0.099
 yes-0.097
 nod-0.096
 forgiveness-0.095
Token off
Token ID572
Feature activation+1.24e+00

Pos logit contributions
aid+0.015
Of+0.012
 yes+0.012
 nod+0.012
 forgiveness+0.011
Neg logit contributions
 anth-0.011
amp-0.011
ines-0.011
attered-0.010
 problem-0.010
Token into
Token ID656
Feature activation-2.62e+00

Pos logit contributions
aid+0.057
 yes+0.047
Of+0.046
 nod+0.046
 forgiveness+0.044
Neg logit contributions
 anth-0.060
amp-0.058
attered-0.056
ines-0.056
ained-0.052
Token the
Token ID262
Feature activation+5.82e-01

Pos logit contributions
aid+0.015
Of+0.012
 nod+0.011
 yes+0.011
 forgiveness+0.010
Neg logit contributions
 anth-0.011
ines-0.010
amp-0.010
 problem-0.009
 cloudy-0.009
Token world
Token ID995
Feature activation-1.30e+00

Pos logit contributions
aid+0.040
 yes+0.034
Of+0.034
 nod+0.033
 forgiveness+0.032
Neg logit contributions
 anth-0.017
amp-0.017
ines-0.015
 problem-0.015
 argument-0.013
Token outside
Token ID2354
Feature activation+2.34e+00

Pos logit contributions
 anth+0.052
amp+0.050
ines+0.049
attered+0.046
 problem+0.044
Neg logit contributions
aid-0.072
Of-0.061
 yes-0.059
 nod-0.058
 forgiveness-0.057
Token.
Token ID13
Feature activation-1.70e-01

Pos logit contributions
aid+0.109
 yes+0.091
Of+0.089
 nod+0.088
 forgiveness+0.087
Neg logit contributions
 anth-0.111
amp-0.105
ines-0.102
attered-0.099
 problem-0.095
Token Where
Token ID6350
Feature activation+3.14e+00

Pos logit contributions
 anth+0.010
amp+0.009
ines+0.009
attered+0.009
 problem+0.009
Neg logit contributions
aid-0.006
Of-0.005
 yes-0.005
 nod-0.005
 forgiveness-0.005
Tokenver
Token ID332
Feature activation+9.88e+00

Pos logit contributions
aid+0.146
Of+0.117
 yes+0.117
 nod+0.114
 forgiveness+0.110
Neg logit contributions
 anth-0.157
amp-0.150
ines-0.146
attered-0.142
ained-0.135
Token he
Token ID339
Feature activation-1.22e+00

Pos logit contributions
aid+0.388
 yes+0.312
Of+0.309
 nod+0.300
 forgiveness+0.296
Neg logit contributions
 anth-0.542
amp-0.537
ines-0.528
attered-0.504
ained-0.487
Token went
Token ID1816
Feature activation-1.44e+00

Pos logit contributions
 anth+0.049
amp+0.047
ines+0.045
attered+0.044
ained+0.041
Neg logit contributions
aid-0.068
Of-0.057
 yes-0.056
 nod-0.056
 forgiveness-0.054
Token,
Token ID11
Feature activation-1.55e-01

Pos logit contributions
 anth+0.057
amp+0.055
ines+0.054
attered+0.050
 problem+0.049
Neg logit contributions
aid-0.082
Of-0.069
 yes-0.066
 nod-0.065
 forgiveness-0.064
Token Jack
Token ID3619
Feature activation-2.02e+00

Pos logit contributions
 anth+0.008
amp+0.007
ines+0.007
attered+0.007
 problem+0.007
Neg logit contributions
aid-0.007
Of-0.005
 yes-0.005
 nod-0.005
 forgiveness-0.005
Token always
Token ID1464
Feature activation-9.17e-01

Pos logit contributions
 anth+0.086
amp+0.082
ines+0.079
attered+0.077
ained+0.073
Neg logit contributions
aid-0.106
Of-0.089
 yes-0.088
 nod-0.087
 forgive-0.082
Token words
Token ID2456
Feature activation+2.60e+00

Pos logit contributions
aid+0.034
Of+0.027
 yes+0.026
 forgiveness+0.025
 nod+0.025
Neg logit contributions
 anth-0.031
amp-0.029
ines-0.029
 problem-0.029
attered-0.028
Token slowly
Token ID6364
Feature activation+3.84e-01

Pos logit contributions
aid+0.141
Of+0.118
 yes+0.115
 nod+0.114
 forgiveness+0.111
Neg logit contributions
 anth-0.108
amp-0.103
ines-0.100
attered-0.096
 problem-0.091
Token,
Token ID11
Feature activation-2.16e-01

Pos logit contributions
aid+0.021
Of+0.018
 yes+0.017
 forgiveness+0.016
 nod+0.016
Neg logit contributions
 anth-0.015
ines-0.015
amp-0.015
attered-0.014
 problem-0.014
Token "
Token ID366
Feature activation-1.35e+00

Pos logit contributions
 cloudy+0.007
 anth+0.007
 ago+0.007
 raining+0.007
ines+0.006
Neg logit contributions
aid-0.013
 forgiveness-0.010
Of-0.010
 forgive-0.010
 nod-0.009
TokenGod
Token ID13482
Feature activation+7.74e-01

Pos logit contributions
amp+0.065
attered+0.061
ained+0.060
ines+0.060
 anth+0.059
Neg logit contributions
aid-0.056
 forgiveness-0.051
 nod-0.049
 yes-0.049
 forgive-0.046
Token bless
Token ID12012
Feature activation+9.86e+00

Pos logit contributions
aid+0.040
Of+0.033
 yes+0.031
 forgiveness+0.030
 nod+0.030
Neg logit contributions
 anth-0.033
ines-0.033
amp-0.032
 problem-0.031
 ago-0.030
Token Mom
Token ID11254
Feature activation-1.77e+00

Pos logit contributions
aid+0.512
Of+0.437
 nod+0.397
 yes+0.386
 stro+0.374
Neg logit contributions
 anth-0.405
ines-0.391
amp-0.388
 problem-0.379
 ago-0.363
Tokenmy
Token ID1820
Feature activation+4.19e-01

Pos logit contributions
 anth+0.057
amp+0.055
ines+0.054
attered+0.049
 problem+0.048
Neg logit contributions
aid-0.111
Of-0.096
 yes-0.092
 nod-0.091
 forgiveness-0.090
Token,
Token ID11
Feature activation-1.93e-01

Pos logit contributions
aid+0.022
Of+0.019
 yes+0.018
 nod+0.018
 forgiveness+0.018
Neg logit contributions
 anth-0.017
amp-0.017
ines-0.016
attered-0.016
 problem-0.015
Token Daddy
Token ID26926
Feature activation-5.10e-01

Pos logit contributions
 anth+0.009
amp+0.009
ines+0.009
 problem+0.008
 cloudy+0.008
Neg logit contributions
aid-0.009
Of-0.007
 nod-0.007
 forgiveness-0.006
 yes-0.006
Token,
Token ID11
Feature activation-1.89e-01

Pos logit contributions
 anth+0.019
amp+0.018
ines+0.018
attered+0.016
 problem+0.016
Neg logit contributions
aid-0.030
Of-0.025
 yes-0.024
 nod-0.024
 forgiveness-0.024
Token.
Token ID13
Feature activation+6.40e-02

Pos logit contributions
 anth+0.070
amp+0.067
ines+0.067
attered+0.062
 problem+0.060
Neg logit contributions
aid-0.092
Of-0.076
 yes-0.074
 nod-0.073
 forgiveness-0.071
Token
Token ID220
Feature activation-2.44e-01

Pos logit contributions
aid+0.003
 yes+0.002
Of+0.002
 forgiveness+0.002
 nod+0.002
Neg logit contributions
 anth-0.003
amp-0.003
ines-0.003
attered-0.003
ained-0.003
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
amp+0.012
 anth+0.012
ines+0.012
ained+0.011
attered+0.011
Neg logit contributions
aid-0.011
Of-0.009
 yes-0.009
 forgiveness-0.008
 nod-0.008
Token
Token ID198
Feature activation+1.19e+00

Pos logit contributions
aid+0.055
 forgiveness+0.044
 yes+0.042
 nod+0.042
Of+0.042
Neg logit contributions
amp-0.057
 anth-0.055
ines-0.055
>-0.050
ained-0.049
TokenWhere
Token ID8496
Feature activation+4.02e+00

Pos logit contributions
aid+0.042
Of+0.030
 yes+0.030
 nod+0.029
 forgiveness+0.028
Neg logit contributions
 anth-0.072
amp-0.070
ines-0.069
attered-0.066
ained-0.065
Tokenver
Token ID332
Feature activation+9.84e+00

Pos logit contributions
aid+0.185
 yes+0.160
 nod+0.150
Of+0.148
 forgive+0.144
Neg logit contributions
 anth-0.205
amp-0.191
attered-0.188
ines-0.185
ained-0.171
Token the
Token ID262
Feature activation+7.58e-01

Pos logit contributions
aid+0.454
Of+0.364
 yes+0.349
 nod+0.347
 forgiveness+0.335
Neg logit contributions
 anth-0.492
amp-0.473
ines-0.465
attered-0.447
 problem-0.434
Token old
Token ID1468
Feature activation-8.43e-01

Pos logit contributions
aid+0.041
Of+0.033
 yes+0.031
 nod+0.030
 forgiveness+0.030
Neg logit contributions
 anth-0.033
amp-0.031
 problem-0.030
ines-0.030
 argument-0.029
Token man
Token ID582
Feature activation-4.83e-01

Pos logit contributions
 anth+0.045
amp+0.043
 problem+0.042
ines+0.042
 argument+0.041
Neg logit contributions
aid-0.036
Of-0.027
 yes-0.025
 forgiveness-0.024
 nod-0.024
Token went
Token ID1816
Feature activation-1.43e+00

Pos logit contributions
 anth+0.022
amp+0.021
ines+0.020
attered+0.020
ained+0.019
Neg logit contributions
aid-0.024
 yes-0.020
 nod-0.020
Of-0.020
 forgive-0.019
Token,
Token ID11
Feature activation-5.12e-03

Pos logit contributions
 anth+0.065
amp+0.062
ines+0.060
attered+0.059
ained+0.055
Neg logit contributions
aid-0.070
 yes-0.059
Of-0.058
 nod-0.058
 forgiveness-0.056
TokenNight
Token ID24732
Feature activation-7.42e-01

Pos logit contributions
amp+0.025
 anth+0.025
 month+0.025
ained+0.024
 cloudy+0.024
Neg logit contributions
aid-0.026
 heal-0.022
 nod-0.021
 forgiveness-0.021
 sew-0.020
Token night
Token ID1755
Feature activation+3.58e-02

Pos logit contributions
 anth+0.031
amp+0.029
ines+0.028
attered+0.027
ained+0.026
Neg logit contributions
aid-0.040
Of-0.034
 yes-0.033
 nod-0.033
 forgiveness-0.032
Token everyone
Token ID2506
Feature activation-1.98e+00

Pos logit contributions
aid+0.002
Of+0.002
 yes+0.002
 nod+0.001
 forgiveness+0.001
Neg logit contributions
 anth-0.002
amp-0.002
ines-0.001
attered-0.001
ained-0.001
Token,
Token ID11
Feature activation-1.23e-01

Pos logit contributions
 anth+0.084
amp+0.080
ines+0.078
attered+0.075
ained+0.071
Neg logit contributions
aid-0.105
Of-0.087
 yes-0.086
 nod-0.085
 forgiveness-0.082
Token God
Token ID1793
Feature activation+6.40e+00

Pos logit contributions
 anth+0.006
amp+0.005
ines+0.005
 problem+0.005
attered+0.005
Neg logit contributions
aid-0.006
Of-0.005
 nod-0.005
 yes-0.005
 forgiveness-0.005
Token bless
Token ID12012
Feature activation+9.84e+00

Pos logit contributions
aid+0.374
Of+0.318
 yes+0.307
 nod+0.304
 forgiveness+0.298
Neg logit contributions
 anth-0.234
amp-0.224
ines-0.223
attered-0.207
 problem-0.202
Token you
Token ID345
Feature activation+1.49e+00

Pos logit contributions
aid+0.535
Of+0.457
 nod+0.419
 yes+0.418
 forgiveness+0.397
Neg logit contributions
 anth-0.400
ines-0.382
amp-0.379
 problem-0.363
 ago-0.346
Token all
Token ID477
Feature activation-9.54e+00

Pos logit contributions
aid+0.070
 yes+0.058
Of+0.057
 nod+0.057
 forgiveness+0.054
Neg logit contributions
 anth-0.072
amp-0.070
ines-0.067
attered-0.066
ained-0.062
Token,"
Token ID553
Feature activation+1.35e+00

Pos logit contributions
 anth+0.354
amp+0.336
ines+0.315
attered+0.311
ained+0.283
Neg logit contributions
aid-0.566
Of-0.477
 nod-0.474
 yes-0.473
 forgiveness-0.459
Token he
Token ID339
Feature activation-1.29e+00

Pos logit contributions
aid+0.071
Of+0.058
 yes+0.055
 forgive+0.054
 forgiveness+0.053
Neg logit contributions
 anth-0.055
 problem-0.048
ines-0.047
 cloudy-0.046
 argument-0.045
Token said
Token ID531
Feature activation-7.32e-01

Pos logit contributions
 anth+0.054
ines+0.051
amp+0.050
 problem+0.048
 argument+0.047
Neg logit contributions
aid-0.067
Of-0.055
 forgiveness-0.053
 nod-0.052
 yes-0.052
Token pin
Token ID6757
Feature activation-7.37e-02

Pos logit contributions
 anth+0.006
amp+0.006
ines+0.006
attered+0.005
 problem+0.005
Neg logit contributions
aid-0.008
Of-0.007
 yes-0.007
 nod-0.007
 forgiveness-0.007
Token is
Token ID318
Feature activation-1.77e+00

Pos logit contributions
 anth+0.003
amp+0.003
ines+0.003
attered+0.003
 problem+0.003
Neg logit contributions
aid-0.004
Of-0.003
 yes-0.003
 nod-0.003
 forgiveness-0.003
Token magical
Token ID10883
Feature activation-1.39e+00

Pos logit contributions
 anth+0.086
amp+0.084
ines+0.083
attered+0.080
 ago+0.073
Neg logit contributions
aid-0.082
 yes-0.070
Of-0.070
 forgiveness-0.066
 forgive-0.065
Token!
Token ID0
Feature activation-1.45e+00

Pos logit contributions
 anth+0.049
amp+0.046
ines+0.046
 problem+0.042
attered+0.042
Neg logit contributions
aid-0.085
Of-0.072
 nod-0.069
 yes-0.068
 forgiveness-0.066
Token Where
Token ID6350
Feature activation+3.18e+00

Pos logit contributions
 anth+0.059
amp+0.059
 ago+0.056
attered+0.053
ained+0.052
Neg logit contributions
aid-0.080
 yes-0.059
 forgiveness-0.059
 nod-0.058
 band-0.057
Tokenver
Token ID332
Feature activation+9.83e+00

Pos logit contributions
aid+0.156
Of+0.127
 yes+0.123
 nod+0.121
 forgiveness+0.118
Neg logit contributions
 anth-0.148
amp-0.143
ines-0.141
attered-0.134
 problem-0.130
Token you
Token ID345
Feature activation+1.58e+00

Pos logit contributions
aid+0.340
 yes+0.262
Of+0.261
 nod+0.250
 forgiveness+0.249
Neg logit contributions
 anth-0.585
amp-0.580
ines-0.573
attered-0.552
ained-0.534
Token go
Token ID467
Feature activation+2.21e+00

Pos logit contributions
aid+0.071
 nod+0.059
 yes+0.059
Of+0.057
 forgive+0.057
Neg logit contributions
 anth-0.079
amp-0.076
ines-0.074
attered-0.073
ained-0.069
Token,
Token ID11
Feature activation-1.04e-01

Pos logit contributions
aid+0.121
Of+0.101
 yes+0.099
 nod+0.098
 forgiveness+0.095
Neg logit contributions
 anth-0.091
amp-0.087
ines-0.085
attered-0.081
 problem-0.077
Token it
Token ID340
Feature activation+2.09e-01

Pos logit contributions
 anth+0.005
amp+0.005
ines+0.005
 problem+0.005
attered+0.004
Neg logit contributions
aid-0.005
Of-0.004
 yes-0.004
 nod-0.003
 forgiveness-0.003
Token will
Token ID481
Feature activation+4.97e+00

Pos logit contributions
aid+0.010
 nod+0.008
Of+0.008
 yes+0.008
 forgive+0.008
Neg logit contributions
 anth-0.010
amp-0.010
ines-0.009
attered-0.009
ained-0.008
Token round
Token ID2835
Feature activation-2.60e+00

Pos logit contributions
aid+0.059
Of+0.049
 yes+0.048
 nod+0.047
 forgiveness+0.046
Neg logit contributions
 anth-0.052
amp-0.050
ines-0.049
attered-0.047
 problem-0.045
Token tum
Token ID11814
Feature activation-5.03e+00

Pos logit contributions
 anth+0.136
amp+0.134
ines+0.131
attered+0.124
ained+0.123
Neg logit contributions
aid-0.110
 nod-0.089
 yes-0.087
Of-0.086
 forgiveness-0.081
Tokenmy
Token ID1820
Feature activation+8.35e-02

Pos logit contributions
amp+0.181
 anth+0.181
ines+0.180
attered+0.165
ained+0.164
Neg logit contributions
aid-0.292
Of-0.254
 yes-0.246
 nod-0.243
 forgiveness-0.236
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
aid+0.004
Of+0.004
 yes+0.004
 forgiveness+0.003
 nod+0.003
Neg logit contributions
 anth-0.003
ines-0.003
amp-0.003
attered-0.003
 problem-0.003
Token Where
Token ID6350
Feature activation+3.17e+00

Pos logit contributions
aid+0.012
Of+0.009
 forgiveness+0.009
 yes+0.009
 nod+0.009
Neg logit contributions
 anth-0.010
amp-0.010
attered-0.009
ines-0.009
ained-0.009
Tokenver
Token ID332
Feature activation+9.82e+00

Pos logit contributions
aid+0.165
Of+0.137
 yes+0.133
 nod+0.132
 forgiveness+0.128
Neg logit contributions
 anth-0.138
amp-0.133
ines-0.130
attered-0.124
 problem-0.119
Token he
Token ID339
Feature activation-1.02e+00

Pos logit contributions
aid+0.471
Of+0.371
 nod+0.357
 yes+0.346
 forgiveness+0.345
Neg logit contributions
 anth-0.474
amp-0.453
ines-0.443
attered-0.430
 problem-0.429
Token went
Token ID1816
Feature activation-1.32e+00

Pos logit contributions
 anth+0.040
amp+0.039
ines+0.037
attered+0.036
ained+0.034
Neg logit contributions
aid-0.057
Of-0.048
 yes-0.047
 nod-0.047
 forgiveness-0.045
Token,
Token ID11
Feature activation+6.00e-02

Pos logit contributions
 anth+0.056
amp+0.053
ines+0.052
attered+0.050
 problem+0.047
Neg logit contributions
aid-0.070
Of-0.058
 yes-0.057
 nod-0.056
 forgiveness-0.055
Token his
Token ID465
Feature activation+1.14e+00

Pos logit contributions
aid+0.003
Of+0.002
 nod+0.002
 yes+0.002
 forgiveness+0.002
Neg logit contributions
 anth-0.003
amp-0.003
ines-0.003
 problem-0.003
 cloudy-0.003
Token tail
Token ID7894
Feature activation+5.86e-01

Pos logit contributions
aid+0.054
Of+0.043
 yes+0.042
 nod+0.041
 forgiveness+0.040
Neg logit contributions
 anth-0.055
amp-0.053
ines-0.052
attered-0.050
 problem-0.048
Token to
Token ID284
Feature activation-2.93e-01

Pos logit contributions
 anth+0.118
amp+0.113
ines+0.112
 problem+0.107
attered+0.105
Neg logit contributions
aid-0.151
Of-0.125
 nod-0.119
 yes-0.117
 forgiveness-0.111
Token be
Token ID307
Feature activation+1.33e+00

Pos logit contributions
 anth+0.013
ines+0.013
amp+0.012
 problem+0.012
 argument+0.011
Neg logit contributions
aid-0.015
Of-0.012
 yes-0.011
 nod-0.011
 stro-0.011
Token heard
Token ID2982
Feature activation-3.66e-01

Pos logit contributions
aid+0.065
Of+0.055
 nod+0.051
 yes+0.050
 forgiveness+0.047
Neg logit contributions
 anth-0.061
ines-0.059
amp-0.057
 problem-0.056
 ago-0.054
Token.
Token ID13
Feature activation-2.04e-01

Pos logit contributions
 anth+0.013
amp+0.012
ines+0.012
attered+0.011
 problem+0.011
Neg logit contributions
aid-0.022
Of-0.019
 yes-0.018
 nod-0.018
 forgiveness-0.018
Token Where
Token ID6350
Feature activation+3.16e+00

Pos logit contributions
 anth+0.007
amp+0.007
 ago+0.006
ained+0.006
>+0.006
Neg logit contributions
aid-0.013
 forgiveness-0.010
Of-0.010
 yes-0.010
cher-0.010
Tokenver
Token ID332
Feature activation+9.82e+00

Pos logit contributions
aid+0.167
Of+0.140
 nod+0.133
 yes+0.132
 forgiveness+0.130
Neg logit contributions
 anth-0.132
amp-0.129
ines-0.128
attered-0.120
 problem-0.117
Token the
Token ID262
Feature activation+7.27e-01

Pos logit contributions
aid+0.452
Of+0.364
 yes+0.354
 nod+0.349
 forgiveness+0.338
Neg logit contributions
 anth-0.490
amp-0.473
ines-0.464
attered-0.446
ained-0.428
Token wish
Token ID4601
Feature activation+2.04e+00

Pos logit contributions
aid+0.044
Of+0.037
 nod+0.035
 yes+0.035
 stro+0.034
Neg logit contributions
 anth-0.025
 problem-0.024
ines-0.023
amp-0.022
 earthquake-0.022
Token stood
Token ID6204
Feature activation+1.44e+00

Pos logit contributions
aid+0.116
Of+0.098
 yes+0.095
 nod+0.094
 forgiveness+0.092
Neg logit contributions
 anth-0.078
amp-0.074
ines-0.073
attered-0.069
 problem-0.067
Token,
Token ID11
Feature activation-1.69e-01

Pos logit contributions
aid+0.079
Of+0.067
 nod+0.063
 yes+0.062
 forgiveness+0.060
Neg logit contributions
 anth-0.060
amp-0.058
ines-0.057
 problem-0.053
attered-0.053
Token everyone
Token ID2506
Feature activation-1.83e+00

Pos logit contributions
 anth+0.007
 cloudy+0.006
 problem+0.006
amp+0.006
 earthquake+0.006
Neg logit contributions
aid-0.009
Of-0.007
 nod-0.007
 forgiveness-0.007
 yes-0.007
Token she
Token ID673
Feature activation-9.31e-01

Pos logit contributions
 anth+0.009
 problem+0.009
 mess+0.008
 all+0.008
 earthquake+0.008
Neg logit contributions
aid-0.013
Of-0.010
 nod-0.010
cher-0.009
 yes-0.009
Token would
Token ID561
Feature activation+3.45e+00

Pos logit contributions
 anth+0.037
amp+0.035
ines+0.034
attered+0.032
 problem+0.031
Neg logit contributions
aid-0.052
Of-0.044
 yes-0.043
 nod-0.043
 forgiveness-0.042
Token be
Token ID307
Feature activation+1.42e+00

Pos logit contributions
aid+0.182
Of+0.151
 yes+0.139
 forgiveness+0.135
 nod+0.131
Neg logit contributions
 anth-0.148
ines-0.140
amp-0.138
 problem-0.129
 argument-0.126
Token even
Token ID772
Feature activation-8.20e+00

Pos logit contributions
aid+0.067
Of+0.060
 yes+0.057
 nod+0.057
 Something+0.054
Neg logit contributions
 anth-0.058
ained-0.054
 problem-0.053
 all-0.051
 argument-0.051
Token bra
Token ID8290
Feature activation+3.34e+00

Pos logit contributions
 anth+0.351
 problem+0.340
 cloudy+0.318
 argument+0.317
ines+0.313
Neg logit contributions
aid-0.392
Of-0.324
 yes-0.319
 nod-0.311
 forgiveness-0.294
Tokenver
Token ID332
Feature activation+9.82e+00

Pos logit contributions
aid+0.155
Of+0.124
 forgiveness+0.120
 nod+0.117
Yes+0.113
Neg logit contributions
amp-0.158
ines-0.150
 ago-0.147
 anth-0.147
attered-0.145
Token.
Token ID13
Feature activation-1.73e-01

Pos logit contributions
aid+0.465
Of+0.388
 yes+0.379
 forgiveness+0.370
 nod+0.366
Neg logit contributions
 anth-0.457
amp-0.451
ines-0.427
attered-0.421
ained-0.400
Token The
Token ID383
Feature activation+1.46e+00

Pos logit contributions
amp+0.008
 anth+0.008
ines+0.008
attered+0.008
ained+0.007
Neg logit contributions
aid-0.008
 yes-0.007
 nod-0.006
 forgiveness-0.006
Of-0.006
Token end
Token ID886
Feature activation-1.12e+00

Pos logit contributions
aid+0.091
Of+0.078
 yes+0.078
 nod+0.077
 forgiveness+0.075
Neg logit contributions
amp-0.052
 anth-0.050
ines-0.049
attered-0.047
ained-0.042
Token.
Token ID13
Feature activation-2.02e-01

Pos logit contributions
 anth+0.046
amp+0.044
ines+0.043
attered+0.041
 problem+0.039
Neg logit contributions
aid-0.062
Of-0.052
 yes-0.051
 nod-0.050
 forgiveness-0.048
Token<|endoftext|>
Token ID50256
Feature activation-6.40e+00

Pos logit contributions
amp+0.009
 anth+0.009
ines+0.009
ained+0.008
attered+0.008
Neg logit contributions
aid-0.009
 yes-0.008
 nod-0.008
 forgiveness-0.007
Of-0.007
Token higher
Token ID2440
Feature activation+4.60e+00

Pos logit contributions
 anth+0.044
amp+0.042
ines+0.042
attered+0.039
ained+0.037
Neg logit contributions
aid-0.068
Of-0.057
 yes-0.055
 nod-0.055
 forgiveness-0.054
Token,
Token ID11
Feature activation-8.57e-02

Pos logit contributions
aid+0.225
Of+0.185
 yes+0.182
 nod+0.180
 forgiveness+0.172
Neg logit contributions
 anth-0.213
amp-0.205
ines-0.200
attered-0.194
ained-0.185
Token he
Token ID339
Feature activation-1.29e+00

Pos logit contributions
 anth+0.004
amp+0.004
 problem+0.003
ines+0.003
 cloudy+0.003
Neg logit contributions
aid-0.004
Of-0.004
 nod-0.003
 forgiveness-0.003
 forgive-0.003
Token felt
Token ID2936
Feature activation-3.25e-01

Pos logit contributions
 anth+0.053
amp+0.051
ines+0.050
attered+0.048
ained+0.045
Neg logit contributions
aid-0.070
Of-0.059
 yes-0.058
 nod-0.057
 forgiveness-0.055
Token bra
Token ID8290
Feature activation+3.58e+00

Pos logit contributions
 anth+0.014
ines+0.013
amp+0.012
ained+0.012
 cloudy+0.012
Neg logit contributions
aid-0.017
Of-0.014
 yes-0.013
 forgiveness-0.013
 nod-0.013
Tokenver
Token ID332
Feature activation+9.81e+00

Pos logit contributions
aid+0.150
Of+0.119
 forgiveness+0.116
 nod+0.111
Yes+0.102
Neg logit contributions
amp-0.186
ines-0.183
 anth-0.179
attered-0.174
 ago-0.170
Token and
Token ID290
Feature activation+5.20e-01

Pos logit contributions
aid+0.442
 yes+0.360
Of+0.355
 nod+0.352
 forgiveness+0.332
Neg logit contributions
 anth-0.490
amp-0.472
ines-0.453
attered-0.449
ained-0.431
Token bra
Token ID8290
Feature activation+3.57e+00

Pos logit contributions
aid+0.027
Of+0.022
 forgiveness+0.021
 nod+0.021
 yes+0.021
Neg logit contributions
 anth-0.021
ines-0.020
 problem-0.020
amp-0.020
attered-0.019
Tokenver
Token ID332
Feature activation+9.81e+00

Pos logit contributions
aid+0.157
Of+0.128
 nod+0.121
 forgiveness+0.120
 yes+0.116
Neg logit contributions
 anth-0.177
amp-0.176
ines-0.175
attered-0.168
ained-0.160
Token.
Token ID13
Feature activation-2.11e-02

Pos logit contributions
aid+0.504
Of+0.417
 yes+0.410
 nod+0.405
 forgiveness+0.391
Neg logit contributions
 anth-0.434
amp-0.417
ines-0.406
attered-0.390
ained-0.373
Token Soon
Token ID15894
Feature activation+4.32e-01

Pos logit contributions
 anth+0.001
amp+0.001
ines+0.001
ained+0.001
attered+0.001
Neg logit contributions
aid-0.001
 yes-0.001
 forgiveness-0.001
Of-0.001
 nod-0.001
Token felt
Token ID2936
Feature activation-3.77e-01

Pos logit contributions
 anth+0.041
amp+0.041
ines+0.039
attered+0.038
ained+0.037
Neg logit contributions
aid-0.045
 yes-0.038
Of-0.038
 nod-0.037
 stro-0.036
Token a
Token ID257
Feature activation+9.47e-01

Pos logit contributions
 anth+0.014
ines+0.013
ained+0.013
 cloudy+0.013
attered+0.013
Neg logit contributions
aid-0.020
Of-0.017
 yes-0.016
 forgiveness-0.016
 nod-0.016
Token little
Token ID1310
Feature activation+8.36e-01

Pos logit contributions
aid+0.051
Of+0.042
 yes+0.041
 forgiveness+0.040
 nod+0.040
Neg logit contributions
 anth-0.039
amp-0.037
ines-0.036
attered-0.035
 problem-0.034
Token bit
Token ID1643
Feature activation-3.16e+00

Pos logit contributions
aid+0.048
Of+0.040
 yes+0.038
 forgiveness+0.038
 nod+0.036
Neg logit contributions
 anth-0.032
 problem-0.029
attered-0.028
amp-0.028
ines-0.028
Token bra
Token ID8290
Feature activation+3.48e+00

Pos logit contributions
 anth+0.123
ines+0.118
amp+0.113
 problem+0.111
attered+0.111
Neg logit contributions
aid-0.173
Of-0.148
 yes-0.145
 forgiveness-0.141
 nod-0.137
Tokenver
Token ID332
Feature activation+9.81e+00

Pos logit contributions
aid+0.150
Of+0.122
 forgiveness+0.117
 nod+0.109
Yes+0.106
Neg logit contributions
amp-0.176
ines-0.173
 anth-0.168
 ago-0.162
attered-0.160
Token.
Token ID13
Feature activation-3.04e-01

Pos logit contributions
aid+0.440
 yes+0.368
Of+0.360
 nod+0.354
 forgiveness+0.338
Neg logit contributions
 anth-0.485
amp-0.461
attered-0.440
ines-0.438
ained-0.417
Token She
Token ID1375
Feature activation+5.23e-01

Pos logit contributions
 anth+0.015
amp+0.014
attered+0.013
ines+0.013
ained+0.013
Neg logit contributions
aid-0.015
 yes-0.012
Of-0.011
 forgiveness-0.011
 nod-0.011
Token slowly
Token ID6364
Feature activation+4.05e-01

Pos logit contributions
aid+0.029
Of+0.024
 yes+0.023
 nod+0.023
 forgiveness+0.023
Neg logit contributions
 anth-0.021
amp-0.020
ines-0.020
attered-0.019
 problem-0.018
Token stepped
Token ID10764
Feature activation-1.52e+00

Pos logit contributions
aid+0.024
Of+0.021
 forgiveness+0.020
 yes+0.019
 forgive+0.019
Neg logit contributions
 anth-0.014
ines-0.012
amp-0.012
 problem-0.012
 argument-0.012
Token closer
Token ID5699
Feature activation+3.08e+00

Pos logit contributions
 anth+0.060
amp+0.057
ines+0.056
attered+0.052
 everywhere+0.051
Neg logit contributions
aid-0.083
 forgiveness-0.071
Of-0.068
 nod-0.066
 yes-0.065
Token felt
Token ID2936
Feature activation-3.25e-01

Pos logit contributions
 anth+0.053
amp+0.051
ines+0.050
attered+0.048
ained+0.045
Neg logit contributions
aid-0.070
Of-0.059
 yes-0.058
 nod-0.057
 forgiveness-0.055
Token bra
Token ID8290
Feature activation+3.58e+00

Pos logit contributions
 anth+0.014
ines+0.013
amp+0.012
ained+0.012
 cloudy+0.012
Neg logit contributions
aid-0.017
Of-0.014
 yes-0.013
 forgiveness-0.013
 nod-0.013
Tokenver
Token ID332
Feature activation+9.81e+00

Pos logit contributions
aid+0.150
Of+0.119
 forgiveness+0.116
 nod+0.111
Yes+0.102
Neg logit contributions
amp-0.186
ines-0.183
 anth-0.179
attered-0.174
 ago-0.170
Token and
Token ID290
Feature activation+5.20e-01

Pos logit contributions
aid+0.442
 yes+0.360
Of+0.355
 nod+0.352
 forgiveness+0.332
Neg logit contributions
 anth-0.490
amp-0.472
ines-0.453
attered-0.449
ained-0.431
Token bra
Token ID8290
Feature activation+3.57e+00

Pos logit contributions
aid+0.027
Of+0.022
 forgiveness+0.021
 nod+0.021
 yes+0.021
Neg logit contributions
 anth-0.021
ines-0.020
 problem-0.020
amp-0.020
attered-0.019
Tokenver
Token ID332
Feature activation+9.81e+00

Pos logit contributions
aid+0.157
Of+0.128
 nod+0.121
 forgiveness+0.120
 yes+0.116
Neg logit contributions
 anth-0.177
amp-0.176
ines-0.175
attered-0.168
ained-0.160
Token.
Token ID13
Feature activation-2.11e-02

Pos logit contributions
aid+0.504
Of+0.417
 yes+0.410
 nod+0.405
 forgiveness+0.391
Neg logit contributions
 anth-0.434
amp-0.417
ines-0.406
attered-0.390
ained-0.373
Token Soon
Token ID15894
Feature activation+4.32e-01

Pos logit contributions
 anth+0.001
amp+0.001
ines+0.001
ained+0.001
attered+0.001
Neg logit contributions
aid-0.001
 yes-0.001
 forgiveness-0.001
Of-0.001
 nod-0.001
Token enough
Token ID1576
Feature activation-2.03e+00

Pos logit contributions
aid+0.023
Of+0.019
 yes+0.019
 nod+0.019
 forgiveness+0.018
Neg logit contributions
 anth-0.018
amp-0.017
ines-0.017
attered-0.016
 problem-0.016
Token,
Token ID11
Feature activation-7.53e-02

Pos logit contributions
 anth+0.079
amp+0.073
ines+0.073
 problem+0.069
attered+0.066
Neg logit contributions
aid-0.118
Of-0.100
 yes-0.094
 nod-0.094
 forgiveness-0.092
Token he
Token ID339
Feature activation-1.24e+00

Pos logit contributions
 anth+0.003
amp+0.003
 problem+0.003
ines+0.003
 cloudy+0.003
Neg logit contributions
aid-0.004
Of-0.003
 nod-0.003
 yes-0.003
 forgiveness-0.003
Token many
Token ID867
Feature activation-3.90e+00

Pos logit contributions
 anth+0.131
amp+0.127
ines+0.125
attered+0.118
 problem+0.116
Neg logit contributions
aid-0.142
Of-0.116
 nod-0.111
 yes-0.110
 forgiveness-0.107
Token new
Token ID649
Feature activation+4.38e+00

Pos logit contributions
 anth+0.175
ines+0.170
amp+0.163
 problem+0.160
 argument+0.150
Neg logit contributions
aid-0.199
Of-0.165
 yes-0.158
 nod-0.152
 forgiveness-0.148
Token friends
Token ID2460
Feature activation+2.47e+00

Pos logit contributions
aid+0.229
Of+0.190
 yes+0.183
 nod+0.175
 forgiveness+0.174
Neg logit contributions
 anth-0.194
ines-0.185
amp-0.181
 problem-0.172
attered-0.170
Token.
Token ID13
Feature activation-1.38e-01

Pos logit contributions
aid+0.150
Of+0.127
 nod+0.119
 forgiveness+0.118
 yes+0.117
Neg logit contributions
ines-0.084
 anth-0.084
amp-0.081
attered-0.074
 problem-0.073
Token Where
Token ID6350
Feature activation+3.11e+00

Pos logit contributions
 anth+0.007
amp+0.007
ines+0.007
attered+0.007
 problem+0.006
Neg logit contributions
aid-0.006
Of-0.005
 yes-0.005
 nod-0.005
 forgiveness-0.004
Tokenver
Token ID332
Feature activation+9.81e+00

Pos logit contributions
aid+0.141
 yes+0.115
Of+0.111
 nod+0.110
 forgive+0.105
Neg logit contributions
 anth-0.161
amp-0.154
ines-0.149
attered-0.147
ained-0.138
Token he
Token ID339
Feature activation-1.23e+00

Pos logit contributions
aid+0.469
Of+0.377
 yes+0.362
 nod+0.361
 forgiveness+0.347
Neg logit contributions
 anth-0.476
amp-0.456
ines-0.449
attered-0.430
 problem-0.418
Token went
Token ID1816
Feature activation-1.48e+00

Pos logit contributions
 anth+0.047
amp+0.045
ines+0.044
attered+0.042
ained+0.040
Neg logit contributions
aid-0.070
Of-0.060
 yes-0.059
 nod-0.058
 forgiveness-0.056
Token,
Token ID11
Feature activation-1.96e-01

Pos logit contributions
 anth+0.063
amp+0.061
ines+0.059
attered+0.057
ained+0.054
Neg logit contributions
aid-0.078
Of-0.065
 yes-0.064
 nod-0.063
 forgiveness-0.061
Token people
Token ID661
Feature activation+1.88e+00

Pos logit contributions
 anth+0.009
amp+0.009
ines+0.008
 problem+0.008
attered+0.008
Neg logit contributions
aid-0.010
Of-0.008
 nod-0.007
 yes-0.007
 forgiveness-0.007
Token admired
Token ID29382
Feature activation+2.84e+00

Pos logit contributions
aid+0.102
Of+0.085
 yes+0.085
 nod+0.084
 forgive+0.081
Neg logit contributions
 anth-0.078
amp-0.074
ines-0.071
attered-0.069
ained-0.065
Token fears
Token ID10251
Feature activation+1.13e+00

Pos logit contributions
aid+0.055
Of+0.048
 yes+0.044
 forgiveness+0.043
 forgive+0.042
Neg logit contributions
 problem-0.033
 anth-0.032
 argument-0.031
ines-0.031
attered-0.029
Token and
Token ID290
Feature activation+3.32e-01

Pos logit contributions
aid+0.057
Of+0.047
 yes+0.046
 nod+0.045
 forgiveness+0.044
Neg logit contributions
 anth-0.050
amp-0.049
ines-0.048
attered-0.045
 problem-0.044
Token become
Token ID1716
Feature activation+2.42e+00

Pos logit contributions
aid+0.018
Of+0.015
 yes+0.014
 nod+0.013
 forgiveness+0.013
Neg logit contributions
 anth-0.013
ines-0.012
 problem-0.012
 argument-0.011
 cloudy-0.011
Token a
Token ID257
Feature activation+9.11e-01

Pos logit contributions
aid+0.125
Of+0.111
 yes+0.099
 forgiveness+0.096
 nod+0.095
Neg logit contributions
 anth-0.106
 cloudy-0.094
 problem-0.092
ained-0.091
 raining-0.088
Token bra
Token ID8290
Feature activation+3.27e+00

Pos logit contributions
aid+0.047
Of+0.038
 nod+0.036
 yes+0.036
 forgiveness+0.036
Neg logit contributions
 anth-0.040
 problem-0.038
amp-0.035
 cloudy-0.034
ines-0.034
Tokenver
Token ID332
Feature activation+9.81e+00

Pos logit contributions
aid+0.148
Of+0.125
 forgiveness+0.119
 mass+0.114
 stro+0.112
Neg logit contributions
ines-0.154
amp-0.153
 problem-0.143
 ago-0.139
 anth-0.138
Token boy
Token ID2933
Feature activation+5.92e-01

Pos logit contributions
aid+0.418
 yes+0.347
 nod+0.330
Of+0.329
 forgiveness+0.324
Neg logit contributions
 anth-0.500
amp-0.486
attered-0.453
ines-0.452
ained-0.436
Token.
Token ID13
Feature activation-3.43e-01

Pos logit contributions
aid+0.035
Of+0.031
 nod+0.028
 forgiveness+0.028
 yes+0.028
Neg logit contributions
ines-0.020
 anth-0.020
amp-0.019
 problem-0.018
 ago-0.018
Token Tim
Token ID5045
Feature activation-2.79e+00

Pos logit contributions
 anth+0.017
amp+0.016
ines+0.016
attered+0.015
 problem+0.015
Neg logit contributions
aid-0.016
Of-0.013
 nod-0.012
 yes-0.012
 forgive-0.012
Token was
Token ID373
Feature activation-2.02e+00

Pos logit contributions
 anth+0.096
ines+0.095
amp+0.091
 problem+0.084
 argument+0.081
Neg logit contributions
aid-0.164
Of-0.142
 forgiveness-0.136
 yes-0.135
 nod-0.131
Token happy
Token ID3772
Feature activation+2.70e+00

Pos logit contributions
 anth+0.079
ained+0.075
amp+0.073
ines+0.072
 cloudy+0.070
Neg logit contributions
aid-0.108
Of-0.094
 forgiveness-0.087
 nod-0.086
 yes-0.086
Token have
Token ID423
Feature activation-8.38e-01

Pos logit contributions
aid+0.017
Of+0.014
 yes+0.013
 nod+0.012
 forgiveness+0.012
Neg logit contributions
 anth-0.017
ines-0.016
 problem-0.016
 argument-0.016
amp-0.016
Token sweet
Token ID6029
Feature activation+1.24e+00

Pos logit contributions
 anth+0.036
amp+0.034
ines+0.034
 problem+0.034
 argument+0.033
Neg logit contributions
aid-0.041
Of-0.034
 nod-0.033
 yes-0.032
 forgive-0.030
Token dreams
Token ID10625
Feature activation+3.49e+00

Pos logit contributions
aid+0.066
Of+0.058
 yes+0.053
 nod+0.050
 forgive+0.050
Neg logit contributions
 anth-0.049
ines-0.049
 problem-0.048
 argument-0.045
amp-0.045
Token.
Token ID13
Feature activation-3.23e-01

Pos logit contributions
aid+0.199
Of+0.174
 nod+0.162
 forgiveness+0.162
 yes+0.158
Neg logit contributions
ines-0.124
amp-0.121
 anth-0.117
 problem-0.110
ained-0.110
Token Please
Token ID4222
Feature activation-1.28e+00

Pos logit contributions
 ago+0.011
 anth+0.011
amp+0.010
>+0.010
ained+0.010
Neg logit contributions
aid-0.018
 forgiveness-0.013
 band-0.013
 yes-0.013
cher-0.013
Token bless
Token ID12012
Feature activation+9.80e+00

Pos logit contributions
 anth+0.043
 problem+0.042
ines+0.042
amp+0.041
>+0.041
Neg logit contributions
aid-0.072
Of-0.062
 forgiveness-0.054
 stro-0.054
My-0.054
Token our
Token ID674
Feature activation+6.33e-01

Pos logit contributions
aid+0.532
Of+0.453
 nod+0.429
 yes+0.404
 stro+0.399
Neg logit contributions
ines-0.345
 problem-0.339
 anth-0.330
amp-0.330
>-0.324
Token family
Token ID1641
Feature activation-1.23e+00

Pos logit contributions
aid+0.031
Of+0.026
 yes+0.025
 nod+0.025
 forgiveness+0.025
Neg logit contributions
 anth-0.029
amp-0.028
ines-0.027
attered-0.026
ained-0.025
Token and
Token ID290
Feature activation+3.75e-01

Pos logit contributions
ines+0.044
 anth+0.043
amp+0.043
 problem+0.039
 ago+0.038
Neg logit contributions
aid-0.071
Of-0.062
 nod-0.057
 forgiveness-0.056
 yes-0.056
Token our
Token ID674
Feature activation+6.44e-01

Pos logit contributions
aid+0.019
Of+0.015
 yes+0.014
 nod+0.014
 forgiveness+0.013
Neg logit contributions
 anth-0.017
ines-0.015
 problem-0.015
 argument-0.015
amp-0.015
Token friends
Token ID2460
Feature activation+2.48e+00

Pos logit contributions
aid+0.032
Of+0.027
 yes+0.026
 nod+0.025
 forgiveness+0.024
Neg logit contributions
 anth-0.029
amp-0.028
ines-0.028
attered-0.026
 problem-0.026
Token
Token ID198
Feature activation+1.37e+00

Pos logit contributions
aid+0.061
Of+0.049
 yes+0.048
 nod+0.047
 forgiveness+0.046
Neg logit contributions
 anth-0.068
amp-0.067
ines-0.067
attered-0.061
ained-0.060
TokenHe
Token ID1544
Feature activation+1.15e+00

Pos logit contributions
aid+0.059
 nod+0.046
 yes+0.046
Of+0.046
 forgiveness+0.045
Neg logit contributions
 anth-0.071
amp-0.070
ines-0.067
attered-0.063
ained-0.062
Token was
Token ID373
Feature activation-1.92e+00

Pos logit contributions
aid+0.059
Of+0.050
 yes+0.049
 nod+0.048
 forgive+0.046
Neg logit contributions
 anth-0.050
amp-0.049
ines-0.047
attered-0.046
ained-0.044
Token much
Token ID881
Feature activation-4.53e+00

Pos logit contributions
 anth+0.087
amp+0.084
ines+0.082
attered+0.079
 problem+0.075
Neg logit contributions
aid-0.096
Of-0.079
 yes-0.078
 nod-0.077
 forgiveness-0.074
Token bra
Token ID8290
Feature activation+3.84e+00

Pos logit contributions
 anth+0.226
 problem+0.216
 argument+0.204
ines+0.202
 cloudy+0.200
Neg logit contributions
aid-0.195
Of-0.157
 yes-0.154
 nod-0.141
 hello-0.140
Tokenver
Token ID332
Feature activation+9.80e+00

Pos logit contributions
aid+0.163
Of+0.129
 forgiveness+0.123
 nod+0.119
 mass+0.111
Neg logit contributions
amp-0.200
ines-0.194
 anth-0.192
attered-0.186
 ago-0.181
Token than
Token ID621
Feature activation-4.28e+00

Pos logit contributions
aid+0.447
Of+0.371
 yes+0.370
 nod+0.356
 forgiveness+0.345
Neg logit contributions
 anth-0.473
amp-0.461
ines-0.438
attered-0.433
ained-0.415
Token before
Token ID878
Feature activation+3.35e+00

Pos logit contributions
 anth+0.166
ines+0.158
amp+0.158
 ago+0.148
 problem+0.146
Neg logit contributions
aid-0.238
Of-0.200
 nod-0.192
 yes-0.181
 forgiveness-0.179
Token and
Token ID290
Feature activation+6.29e-01

Pos logit contributions
aid+0.148
 yes+0.130
 nod+0.123
 forgiveness+0.122
Of+0.119
Neg logit contributions
 anth-0.159
amp-0.156
attered-0.154
ines-0.148
ained-0.139
Token eventually
Token ID4191
Feature activation-1.36e-01

Pos logit contributions
aid+0.029
Of+0.024
 yes+0.023
 nod+0.023
 forgiveness+0.022
Neg logit contributions
 anth-0.031
amp-0.030
ines-0.029
attered-0.028
ained-0.027
Token he
Token ID339
Feature activation-1.29e+00

Pos logit contributions
 anth+0.007
amp+0.007
ines+0.007
attered+0.007
ained+0.006
Neg logit contributions
aid-0.006
 yes-0.005
Of-0.005
 nod-0.005
 forgiveness-0.004
Token.
Token ID13
Feature activation+3.56e-01

Pos logit contributions
aid+0.127
Of+0.105
 yes+0.103
 nod+0.102
 forgiveness+0.100
Neg logit contributions
 anth-0.117
amp-0.114
ines-0.109
attered-0.106
ained-0.102
Token They
Token ID1119
Feature activation-4.25e-01

Pos logit contributions
aid+0.019
 yes+0.015
Of+0.015
 forgiveness+0.015
 nod+0.014
Neg logit contributions
 anth-0.016
amp-0.015
ines-0.014
attered-0.014
 ago-0.014
Token had
Token ID550
Feature activation-4.92e+00

Pos logit contributions
 anth+0.019
amp+0.019
ines+0.018
attered+0.018
ained+0.017
Neg logit contributions
aid-0.021
Of-0.018
 yes-0.018
 nod-0.017
 forgive-0.017
Token stopped
Token ID5025
Feature activation+1.91e+00

Pos logit contributions
 anth+0.225
amp+0.222
ines+0.217
attered+0.204
 problem+0.194
Neg logit contributions
aid-0.248
Of-0.202
 yes-0.200
 nod-0.196
 forgiveness-0.191
Token qu
Token ID627
Feature activation-1.20e+00

Pos logit contributions
aid+0.110
Of+0.093
 yes+0.091
 nod+0.090
 forgiveness+0.089
Neg logit contributions
 anth-0.073
amp-0.070
ines-0.068
attered-0.065
ained-0.061
Tokenarre
Token ID9624
Feature activation-1.18e+01

Pos logit contributions
 anth+0.033
 ago+0.028
ines+0.028
 problem+0.027
amp+0.025
Neg logit contributions
aid-0.082
 nod-0.068
Of-0.066
 forgive-0.066
 forgiveness-0.064
Tokenling
Token ID1359
Feature activation-1.24e+00

Pos logit contributions
 anth+0.474
amp+0.418
ines+0.382
attered+0.373
 ago+0.368
Neg logit contributions
aid-0.685
 nod-0.560
 yes-0.551
Of-0.547
 forgiveness-0.543
Token.
Token ID13
Feature activation+3.63e-01

Pos logit contributions
 anth+0.057
amp+0.055
ines+0.054
attered+0.051
 problem+0.049
Neg logit contributions
aid-0.062
Of-0.051
 yes-0.049
 nod-0.049
 forgiveness-0.047
Token To
Token ID1675
Feature activation+1.67e+00

Pos logit contributions
aid+0.019
 yes+0.014
Of+0.014
 nod+0.014
 forgiveness+0.014
Neg logit contributions
 anth-0.017
amp-0.016
ines-0.015
attered-0.015
 ago-0.015
Token celebrate
Token ID10648
Feature activation+4.58e+00

Pos logit contributions
aid+0.079
Of+0.064
 yes+0.062
 nod+0.062
 forgiveness+0.060
Neg logit contributions
 anth-0.082
amp-0.079
ines-0.078
attered-0.076
ained-0.072
Token,
Token ID11
Feature activation-3.71e-03

Pos logit contributions
aid+0.269
Of+0.223
 yes+0.210
 nod+0.210
 forgiveness+0.204
Neg logit contributions
 anth-0.176
amp-0.169
ines-0.165
 problem-0.152
attered-0.151
Token on
Token ID319
Feature activation-1.52e+00

Pos logit contributions
 anth+0.114
amp+0.110
ines+0.108
attered+0.102
 problem+0.098
Neg logit contributions
aid-0.145
Of-0.121
 yes-0.117
 nod-0.116
 forgiveness-0.113
Token,
Token ID11
Feature activation-3.43e-01

Pos logit contributions
 anth+0.071
amp+0.068
ines+0.066
attered+0.066
ained+0.062
Neg logit contributions
aid-0.072
 yes-0.061
 nod-0.059
Of-0.059
 forgiveness-0.059
Token they
Token ID484
Feature activation-2.68e+00

Pos logit contributions
 anth+0.015
amp+0.014
ines+0.014
attered+0.013
 problem+0.013
Neg logit contributions
aid-0.018
Of-0.014
 nod-0.014
 yes-0.014
 forgiveness-0.013
Token never
Token ID1239
Feature activation-2.79e+00

Pos logit contributions
 anth+0.098
amp+0.094
ines+0.091
attered+0.087
 problem+0.082
Neg logit contributions
aid-0.159
Of-0.135
 yes-0.132
 nod-0.131
 forgiveness-0.127
Token qu
Token ID627
Feature activation-1.24e+00

Pos logit contributions
 anth+0.102
amp+0.094
ines+0.093
attered+0.090
 problem+0.087
Neg logit contributions
aid-0.165
Of-0.140
 yes-0.137
 nod-0.133
 forgiveness-0.131
Tokenarre
Token ID9624
Feature activation-1.16e+01

Pos logit contributions
 anth+0.035
 ago+0.029
 problem+0.029
ines+0.029
 earthquake+0.027
Neg logit contributions
aid-0.084
Of-0.068
 nod-0.067
 forgive-0.067
 forgiveness-0.064
Tokenled
Token ID992
Feature activation+1.89e+00

Pos logit contributions
 anth+0.455
amp+0.416
ines+0.395
attered+0.368
 ago+0.354
Neg logit contributions
aid-0.686
Of-0.565
 yes-0.554
 nod-0.549
 forgiveness-0.544
Token again
Token ID757
Feature activation+2.51e+00

Pos logit contributions
aid+0.095
Of+0.078
 yes+0.076
 nod+0.075
 forgiveness+0.073
Neg logit contributions
 anth-0.086
amp-0.083
ines-0.081
attered-0.078
 problem-0.074
Token over
Token ID625
Feature activation-1.23e+00

Pos logit contributions
aid+0.134
Of+0.111
 yes+0.108
 nod+0.107
 forgiveness+0.104
Neg logit contributions
 anth-0.107
amp-0.102
ines-0.100
attered-0.095
 problem-0.091
Token clothes
Token ID8242
Feature activation+2.99e+00

Pos logit contributions
 anth+0.060
amp+0.058
ines+0.057
attered+0.055
ained+0.052
Neg logit contributions
aid-0.057
Of-0.046
 yes-0.046
 nod-0.045
 forgiveness-0.045
Token.
Token ID13
Feature activation-5.27e-02

Pos logit contributions
aid+0.155
Of+0.128
 nod+0.122
 yes+0.121
 forgiveness+0.118
Neg logit contributions
 anth-0.130
amp-0.126
ines-0.124
attered-0.118
 problem-0.113
Token but
Token ID475
Feature activation-4.04e-01

Pos logit contributions
 anth+0.004
amp+0.004
ines+0.004
attered+0.004
 problem+0.004
Neg logit contributions
aid-0.004
Of-0.003
 yes-0.003
 nod-0.003
 forgiveness-0.003
Token today
Token ID1909
Feature activation-6.54e-01

Pos logit contributions
amp+0.020
ines+0.019
 anth+0.019
attered+0.018
ooky+0.018
Neg logit contributions
aid-0.017
 yes-0.016
 nod-0.016
Of-0.016
 forgive-0.015
Token they
Token ID484
Feature activation-2.78e+00

Pos logit contributions
 anth+0.037
amp+0.036
ines+0.035
attered+0.034
icer+0.032
Neg logit contributions
aid-0.023
 yes-0.021
 forgiveness-0.020
 nod-0.019
Of-0.019
Token were
Token ID547
Feature activation-8.56e-01

Pos logit contributions
 anth+0.136
amp+0.135
ines+0.128
attered+0.121
icer+0.118
Neg logit contributions
aid-0.121
 yes-0.115
 nod-0.111
 forgive-0.110
 nodded-0.108
Token qu
Token ID627
Feature activation-1.36e+00

Pos logit contributions
amp+0.037
 anth+0.037
ines+0.035
attered+0.034
icer+0.031
Neg logit contributions
aid-0.040
 yes-0.037
 nod-0.035
 forgive-0.034
 hello-0.034
Tokenarre
Token ID9624
Feature activation-1.16e+01

Pos logit contributions
 ago+0.044
 problem+0.042
 anth+0.042
ines+0.037
 earthquake+0.036
Neg logit contributions
aid-0.080
 nod-0.064
Of-0.063
 forgive-0.062
And-0.061
Tokenling
Token ID1359
Feature activation-9.94e-01

Pos logit contributions
 anth+0.532
amp+0.468
 ago+0.430
ines+0.423
 colony+0.409
Neg logit contributions
aid-0.590
 yes-0.472
 nod-0.459
 forgiveness-0.455
 mass-0.452
Token.
Token ID13
Feature activation+3.24e-01

Pos logit contributions
 anth+0.048
amp+0.045
ines+0.044
attered+0.041
 problem+0.040
Neg logit contributions
aid-0.046
 yes-0.040
Of-0.038
 nod-0.038
 forgiveness-0.037
Token Jenna
Token ID45377
Feature activation+1.96e-02

Pos logit contributions
aid+0.011
Of+0.010
 nod+0.010
 yes+0.009
 forgive+0.009
Neg logit contributions
ines-0.018
 anth-0.018
amp-0.018
attered-0.017
 problem-0.017
Token wanted
Token ID2227
Feature activation-2.66e+00

Pos logit contributions
aid+0.001
 yes+0.001
 nod+0.001
Of+0.001
 nodded+0.001
Neg logit contributions
 anth-0.001
amp-0.001
ines-0.001
ained-0.001
attered-0.001
Token to
Token ID284
Feature activation-9.55e-02

Pos logit contributions
 anth+0.119
amp+0.114
ines+0.112
attered+0.107
 problem+0.102
Neg logit contributions
aid-0.136
Of-0.112
 yes-0.109
 nod-0.108
 forgiveness-0.105
Token together
Token ID1978
Feature activation+5.19e-01

Pos logit contributions
aid+0.255
Of+0.212
 nod+0.202
 yes+0.202
 forgiveness+0.195
Neg logit contributions
 anth-0.196
amp-0.190
ines-0.185
attered-0.175
 problem-0.170
Token.
Token ID13
Feature activation+3.00e-01

Pos logit contributions
aid+0.024
 yes+0.021
 nod+0.020
Of+0.020
 forgive+0.020
Neg logit contributions
 anth-0.025
amp-0.024
attered-0.023
ines-0.023
ained-0.021
Token They
Token ID1119
Feature activation-3.31e-01

Pos logit contributions
aid+0.012
Of+0.011
 nod+0.011
 Something+0.010
 forgive+0.010
Neg logit contributions
ines-0.015
amp-0.014
 anth-0.014
 problem-0.014
ooky-0.014
Token never
Token ID1239
Feature activation-2.59e+00

Pos logit contributions
amp+0.015
 anth+0.015
ines+0.014
attered+0.014
ays+0.014
Neg logit contributions
aid-0.017
 yes-0.014
Of-0.014
 nod-0.014
 forgive-0.014
Token qu
Token ID627
Feature activation-1.13e+00

Pos logit contributions
amp+0.107
 anth+0.106
ines+0.103
attered+0.096
ained+0.093
Neg logit contributions
aid-0.140
 nod-0.118
Of-0.118
 yes-0.115
 forgive-0.115
Tokenarre
Token ID9624
Feature activation-1.15e+01

Pos logit contributions
 anth+0.036
 ago+0.032
 problem+0.031
 cloudy+0.029
 earthquake+0.029
Neg logit contributions
aid-0.071
 nod-0.056
 forgive-0.055
Of-0.055
And-0.053
Tokenled
Token ID992
Feature activation+2.09e+00

Pos logit contributions
 anth+0.506
amp+0.450
ines+0.413
 ago+0.395
attered+0.391
Neg logit contributions
aid-0.642
 yes-0.506
Of-0.504
 nod-0.500
 forgiveness-0.497
Token again
Token ID757
Feature activation+2.69e+00

Pos logit contributions
aid+0.099
 yes+0.082
Of+0.081
 nod+0.080
 forgiveness+0.077
Neg logit contributions
 anth-0.100
amp-0.096
ines-0.094
attered-0.090
ained-0.087
Token.
Token ID13
Feature activation+2.58e-01

Pos logit contributions
aid+0.136
Of+0.112
 yes+0.111
 nod+0.109
 forgiveness+0.106
Neg logit contributions
 anth-0.121
amp-0.117
ines-0.113
attered-0.110
ained-0.104
Token<|endoftext|>
Token ID50256
Feature activation-6.29e+00

Pos logit contributions
aid+0.009
Of+0.008
 nod+0.008
 Something+0.007
 forgive+0.007
Neg logit contributions
ines-0.014
 anth-0.014
amp-0.014
attered-0.014
 problem-0.014
TokenOnce
Token ID7454
Feature activation-2.87e+00

Pos logit contributions
amp+0.285
 anth+0.272
ines+0.268
ained+0.251
attered+0.242
Neg logit contributions
aid-0.333
Of-0.273
 yes-0.262
 forgiveness-0.257
 nod-0.255
Token lively
Token ID29696
Feature activation-8.46e-01

Pos logit contributions
 anth+0.000
amp+0.000
ines+0.000
attered+0.000
 problem+0.000
Neg logit contributions
aid-0.000
Of-0.000
 yes-0.000
 nod-0.000
 forgiveness-0.000
Token sound
Token ID2128
Feature activation+2.57e+00

Pos logit contributions
 anth+0.043
amp+0.042
ines+0.042
attered+0.041
ained+0.039
Neg logit contributions
aid-0.036
Of-0.029
 nod-0.029
 yes-0.029
 forgiveness-0.028
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
aid+0.132
 yes+0.110
Of+0.108
 nod+0.107
 forgiveness+0.103
Neg logit contributions
 anth-0.116
amp-0.108
ines-0.105
attered-0.101
ained-0.097
Token The
Token ID383
Feature activation+1.38e+00

Pos logit contributions
aid+0.010
Of+0.008
 yes+0.008
 nod+0.008
 forgiveness+0.008
Neg logit contributions
 anth-0.009
amp-0.009
ines-0.009
attered-0.008
ained-0.008
Token qu
Token ID627
Feature activation-1.19e+00

Pos logit contributions
aid+0.066
Of+0.054
 yes+0.053
 nod+0.053
 forgiveness+0.051
Neg logit contributions
 anth-0.066
amp-0.064
ines-0.063
attered-0.060
ained-0.057
Tokenarre
Token ID9624
Feature activation-1.15e+01

Pos logit contributions
 anth+0.042
 ago+0.036
 camps+0.035
 problem+0.033
 earthquake+0.031
Neg logit contributions
aid-0.072
 nod-0.057
 band-0.056
Of-0.055
 forgive-0.055
Tokenling
Token ID1359
Feature activation-7.85e-01

Pos logit contributions
 anth+0.450
amp+0.403
ines+0.364
attered+0.355
 ago+0.347
Neg logit contributions
aid-0.683
 yes-0.562
 nod-0.560
Of-0.557
 forgiveness-0.549
Token stopped
Token ID5025
Feature activation+1.67e+00

Pos logit contributions
 anth+0.035
amp+0.033
ines+0.032
attered+0.031
ained+0.029
Neg logit contributions
aid-0.040
Of-0.033
 yes-0.033
 nod-0.033
 forgiveness-0.032
Token and
Token ID290
Feature activation+8.68e-01

Pos logit contributions
aid+0.096
Of+0.081
 yes+0.078
 nod+0.077
 forgiveness+0.075
Neg logit contributions
 anth-0.065
amp-0.062
ines-0.061
attered-0.058
 problem-0.056
Token they
Token ID484
Feature activation-2.58e+00

Pos logit contributions
aid+0.045
Of+0.037
 yes+0.036
 nod+0.035
 forgiveness+0.034
Neg logit contributions
 anth-0.038
amp-0.037
ines-0.036
attered-0.034
 problem-0.033
Token heard
Token ID2982
Feature activation-2.94e-01

Pos logit contributions
 anth+0.114
amp+0.111
ines+0.106
attered+0.102
ained+0.100
Neg logit contributions
aid-0.137
 yes-0.116
 nod-0.114
Of-0.113
 forgive-0.110
Token and
Token ID290
Feature activation+8.51e-01

Pos logit contributions
aid+0.087
 yes+0.073
Of+0.073
 nod+0.072
 forgive+0.067
Neg logit contributions
 anth-0.091
amp-0.088
ines-0.085
attered-0.082
ained-0.079
Token saw
Token ID2497
Feature activation-2.98e+00

Pos logit contributions
aid+0.058
Of+0.049
 yes+0.047
 forgiveness+0.047
 nod+0.047
Neg logit contributions
 anth-0.024
amp-0.022
ines-0.022
attered-0.020
 problem-0.019
Token two
Token ID734
Feature activation-1.90e+00

Pos logit contributions
 anth+0.149
amp+0.141
ines+0.140
attered+0.134
 problem+0.132
Neg logit contributions
aid-0.134
Of-0.109
 nod-0.104
 yes-0.103
 forgiveness-0.097
Token birds
Token ID10087
Feature activation+2.77e+00

Pos logit contributions
amp+0.092
 anth+0.090
ained+0.086
attered+0.086
ines+0.085
Neg logit contributions
aid-0.085
 yes-0.073
 nod-0.073
Of-0.068
 hello-0.068
Token qu
Token ID627
Feature activation-1.28e+00

Pos logit contributions
aid+0.141
 yes+0.120
 nod+0.119
Of+0.114
 forgive+0.112
Neg logit contributions
 anth-0.121
amp-0.116
attered-0.109
ines-0.107
ained-0.101
Tokenarre
Token ID9624
Feature activation-1.15e+01

Pos logit contributions
 problem+0.039
 anth+0.038
 ago+0.038
 earthquake+0.035
 camps+0.035
Neg logit contributions
aid-0.074
 nod-0.059
 stro-0.055
And-0.054
 forgive-0.054
Tokenling
Token ID1359
Feature activation-7.64e-01

Pos logit contributions
 anth+0.524
amp+0.468
attered+0.416
 colony+0.411
 ago+0.403
Neg logit contributions
aid-0.593
 nod-0.476
 yes-0.463
 heal-0.451
 mass-0.446
Token!
Token ID0
Feature activation-1.01e+00

Pos logit contributions
 anth+0.035
amp+0.033
ines+0.032
attered+0.031
ained+0.030
Neg logit contributions
aid-0.038
 yes-0.031
 nod-0.031
Of-0.031
 forgiveness-0.030
Token They
Token ID1119
Feature activation-7.22e-02

Pos logit contributions
 anth+0.050
amp+0.049
ines+0.048
attered+0.046
 problem+0.044
Neg logit contributions
aid-0.047
Of-0.038
 yes-0.037
 nod-0.036
 forgiveness-0.035
Token were
Token ID547
Feature activation-8.43e-01

Pos logit contributions
amp+0.003
 anth+0.003
ines+0.003
attered+0.003
ained+0.003
Neg logit contributions
aid-0.004
 yes-0.003
 nod-0.003
Of-0.003
 forgive-0.003
Token squ
Token ID2809
Feature activation-2.02e+00

Pos logit contributions
amp+0.039
 anth+0.038
attered+0.037
ines+0.036
icer+0.034
Neg logit contributions
aid-0.038
 yes-0.033
 nod-0.033
 hello-0.031
Of-0.031
Token the
Token ID262
Feature activation+5.56e-01

Pos logit contributions
aid+0.134
Of+0.108
 nod+0.105
 yes+0.103
 forgiveness+0.101
Neg logit contributions
 anth-0.095
ines-0.088
amp-0.088
attered-0.085
 problem-0.082
Token parents
Token ID3397
Feature activation+1.71e+00

Pos logit contributions
aid+0.025
 yes+0.023
Of+0.021
 nod+0.020
 forgiveness+0.020
Neg logit contributions
amp-0.028
 anth-0.027
ines-0.027
attered-0.026
ained-0.025
Token were
Token ID547
Feature activation-9.85e-01

Pos logit contributions
aid+0.090
 yes+0.074
Of+0.074
 nod+0.073
 forgive+0.070
Neg logit contributions
 anth-0.075
amp-0.071
ines-0.068
attered-0.066
ained-0.064
Token also
Token ID635
Feature activation-1.95e+00

Pos logit contributions
 anth+0.048
amp+0.047
ines+0.045
attered+0.044
ained+0.042
Neg logit contributions
aid-0.046
 yes-0.037
Of-0.037
 nod-0.036
 forgiveness-0.035
Token qu
Token ID627
Feature activation-1.21e+00

Pos logit contributions
 anth+0.095
amp+0.092
ines+0.090
attered+0.086
ained+0.083
Neg logit contributions
aid-0.092
Of-0.074
 yes-0.073
 nod-0.071
 forgiveness-0.069
Tokenarre
Token ID9624
Feature activation-1.14e+01

Pos logit contributions
 anth+0.040
 ago+0.038
 problem+0.036
 earthquake+0.034
 camps+0.033
Neg logit contributions
aid-0.073
 nod-0.056
 forgive-0.056
Of-0.055
And-0.053
Tokenling
Token ID1359
Feature activation-8.06e-01

Pos logit contributions
 anth+0.444
amp+0.395
ines+0.358
attered+0.343
ained+0.337
Neg logit contributions
aid-0.678
 yes-0.556
 nod-0.552
Of-0.545
 forgiveness-0.544
Token.
Token ID13
Feature activation+1.09e-01

Pos logit contributions
 anth+0.036
amp+0.034
ines+0.034
attered+0.032
ained+0.031
Neg logit contributions
aid-0.041
Of-0.034
 yes-0.033
 nod-0.033
 forgiveness-0.032
Token The
Token ID383
Feature activation+1.36e+00

Pos logit contributions
aid+0.005
Of+0.004
 yes+0.004
 nod+0.004
 forgiveness+0.004
Neg logit contributions
 anth-0.005
amp-0.005
ines-0.005
attered-0.005
ained-0.005
Token first
Token ID717
Feature activation+1.91e+00

Pos logit contributions
aid+0.056
 yes+0.049
Of+0.047
 nod+0.046
 forgiveness+0.044
Neg logit contributions
amp-0.073
ines-0.070
 anth-0.070
attered-0.068
ained-0.067
Token family
Token ID1641
Feature activation-1.34e+00

Pos logit contributions
aid+0.079
 yes+0.076
 nod+0.072
Of+0.069
 hello+0.066
Neg logit contributions
amp-0.096
 anth-0.096
ines-0.090
attered-0.090
ained-0.090
Token friends
Token ID2460
Feature activation+2.19e+00

Pos logit contributions
amp+0.101
 anth+0.099
ines+0.093
ained+0.092
attered+0.091
Neg logit contributions
aid-0.077
 yes-0.063
 nod-0.062
Of-0.060
 forgiveness-0.057
Token didn
Token ID1422
Feature activation+1.50e+00

Pos logit contributions
aid+0.107
 yes+0.093
 nod+0.093
 forgive+0.090
Of+0.088
Neg logit contributions
 anth-0.102
amp-0.101
ained-0.093
ines-0.092
attered-0.090
Token't
Token ID470
Feature activation-2.18e+00

Pos logit contributions
aid+0.071
 yes+0.059
Of+0.058
 nod+0.057
 forgive+0.055
Neg logit contributions
 anth-0.072
ines-0.067
attered-0.066
amp-0.066
 problem-0.061
Token stop
Token ID2245
Feature activation+4.34e+00

Pos logit contributions
 anth+0.092
ines+0.089
amp+0.085
 argument+0.082
 problem+0.082
Neg logit contributions
aid-0.109
Of-0.092
 yes-0.089
 forgiveness-0.087
 nod-0.084
Token qu
Token ID627
Feature activation-1.15e+00

Pos logit contributions
aid+0.272
Of+0.231
 yes+0.223
 nod+0.219
 forgiveness+0.216
Neg logit contributions
 anth-0.148
amp-0.138
ines-0.135
attered-0.126
 problem-0.122
Tokenarre
Token ID9624
Feature activation-1.13e+01

Pos logit contributions
 ago+0.038
 anth+0.035
 camps+0.032
 earthquake+0.031
 problem+0.031
Neg logit contributions
aid-0.066
 nod-0.056
 forgive-0.053
 band-0.051
Of-0.050
Tokenling
Token ID1359
Feature activation-6.38e-01

Pos logit contributions
 anth+0.488
amp+0.424
 ago+0.387
ines+0.383
 colony+0.382
Neg logit contributions
aid-0.619
 nod-0.504
 yes-0.501
Of-0.483
 forgiveness-0.482
Token and
Token ID290
Feature activation+6.64e-01

Pos logit contributions
 anth+0.028
amp+0.027
ines+0.027
attered+0.025
ained+0.024
Neg logit contributions
aid-0.032
Of-0.027
 yes-0.027
 nod-0.026
 forgiveness-0.025
Token were
Token ID547
Feature activation-9.71e-01

Pos logit contributions
aid+0.034
Of+0.029
 nod+0.029
 yes+0.029
 forgive+0.027
Neg logit contributions
 anth-0.029
amp-0.029
ines-0.027
attered-0.026
ained-0.025
Token getting
Token ID1972
Feature activation-1.31e+00

Pos logit contributions
amp+0.049
 anth+0.047
icer+0.045
ines+0.044
attered+0.043
Neg logit contributions
aid-0.045
 yes-0.037
 nod-0.037
 forgive-0.037
Of-0.036
Token louder
Token ID27089
Feature activation+5.12e+00

Pos logit contributions
 anth+0.054
amp+0.053
ines+0.051
attered+0.047
 problem+0.045
Neg logit contributions
aid-0.072
Of-0.060
 yes-0.059
 nod-0.058
 forgiveness-0.057
Token and
Token ID290
Feature activation+7.56e-01

Pos logit contributions
aid+0.139
 yes+0.121
 nod+0.120
Of+0.115
 forgive+0.115
Neg logit contributions
 anth-0.116
amp-0.112
ines-0.104
attered-0.103
ained-0.100
Token their
Token ID511
Feature activation+9.65e-01

Pos logit contributions
aid+0.036
Of+0.029
 yes+0.029
 nod+0.028
 forgiveness+0.028
Neg logit contributions
amp-0.036
 anth-0.036
ines-0.035
attered-0.033
ained-0.032
Token children
Token ID1751
Feature activation+3.07e+00

Pos logit contributions
aid+0.042
 nod+0.037
 yes+0.036
Of+0.036
 forgiveness+0.036
Neg logit contributions
amp-0.046
ained-0.044
ines-0.044
 anth-0.043
attered-0.041
Token,
Token ID11
Feature activation-8.54e-02

Pos logit contributions
aid+0.136
 nod+0.122
 yes+0.121
 forgive+0.116
 forgiveness+0.111
Neg logit contributions
 anth-0.144
amp-0.141
ines-0.130
attered-0.130
ained-0.125
Token qu
Token ID627
Feature activation-1.11e+00

Pos logit contributions
 anth+0.004
amp+0.004
ines+0.004
attered+0.004
 problem+0.003
Neg logit contributions
aid-0.004
Of-0.003
 yes-0.003
 nod-0.003
 forgiveness-0.003
Tokenarre
Token ID9624
Feature activation-1.13e+01

Pos logit contributions
 ago+0.041
 problem+0.037
 camps+0.035
 steep+0.034
 earthquake+0.034
Neg logit contributions
aid-0.057
 nod-0.047
 band-0.045
And-0.044
 forgive-0.043
Tokenling
Token ID1359
Feature activation-6.87e-01

Pos logit contributions
 anth+0.520
amp+0.465
 ago+0.429
 colony+0.415
 camps+0.414
Neg logit contributions
aid-0.583
 nod-0.461
 yes-0.446
 band-0.442
 heal-0.441
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
 anth+0.032
amp+0.031
ines+0.030
attered+0.028
ained+0.028
Neg logit contributions
aid-0.033
 nod-0.027
 yes-0.027
Of-0.026
 forgiveness-0.026
Token
Token ID220
Feature activation-3.64e-01

Pos logit contributions
aid+0.011
Of+0.009
 yes+0.009
 nod+0.009
 forgive+0.008
Neg logit contributions
 anth-0.016
ines-0.016
amp-0.016
attered-0.015
 problem-0.015
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
 anth+0.023
amp+0.022
ines+0.022
attered+0.021
 problem+0.021
Neg logit contributions
aid-0.012
Of-0.009
 yes-0.008
 nod-0.008
 forgive-0.008
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
aid+0.056
 forgiveness+0.042
Of+0.042
 yes+0.042
 nod+0.042
Neg logit contributions
 anth-0.064
amp-0.064
ines-0.062
attered-0.057
ained-0.056
Token told
Token ID1297
Feature activation-2.12e+00

Pos logit contributions
aid+0.050
Of+0.041
 nod+0.041
 yes+0.041
 forgiveness+0.040
Neg logit contributions
 anth-0.032
amp-0.030
ines-0.030
attered-0.028
 problem-0.028
Token them
Token ID606
Feature activation+2.00e+00

Pos logit contributions
 anth+0.096
amp+0.092
ines+0.090
attered+0.086
 problem+0.082
Neg logit contributions
aid-0.108
Of-0.089
 yes-0.086
 nod-0.085
 forgiveness-0.083
Token to
Token ID284
Feature activation-1.46e-01

Pos logit contributions
aid+0.096
 yes+0.082
 nod+0.079
Of+0.079
 forgiveness+0.078
Neg logit contributions
 anth-0.095
amp-0.093
ines-0.088
attered-0.085
ained-0.083
Token stop
Token ID2245
Feature activation+4.44e+00

Pos logit contributions
 anth+0.007
amp+0.007
ines+0.007
attered+0.007
 argument+0.007
Neg logit contributions
aid-0.006
Of-0.005
 yes-0.005
 nod-0.005
 forgiveness-0.005
Token qu
Token ID627
Feature activation-1.09e+00

Pos logit contributions
aid+0.249
Of+0.209
 yes+0.204
 nod+0.202
 forgiveness+0.197
Neg logit contributions
 anth-0.177
amp-0.169
ines-0.165
attered-0.157
 problem-0.149
Tokenarre
Token ID9624
Feature activation-1.13e+01

Pos logit contributions
 ago+0.031
 anth+0.029
 camps+0.027
 problem+0.026
ines+0.026
Neg logit contributions
aid-0.071
 nod-0.058
 forgive-0.057
Of-0.055
 forgiveness-0.054
Tokenlling
Token ID2680
Feature activation-1.30e+00

Pos logit contributions
 anth+0.439
amp+0.392
ines+0.357
 ago+0.356
attered+0.354
Neg logit contributions
aid-0.668
 nod-0.540
Of-0.536
 yes-0.532
 forgiveness-0.527
Token and
Token ID290
Feature activation+8.87e-01

Pos logit contributions
 anth+0.068
amp+0.064
ines+0.061
attered+0.059
ained+0.057
Neg logit contributions
aid-0.056
 yes-0.048
 nod-0.047
 forgive-0.045
 forgiveness-0.045
Token to
Token ID284
Feature activation-1.04e-01

Pos logit contributions
aid+0.040
Of+0.032
 yes+0.031
 nod+0.031
 forgiveness+0.030
Neg logit contributions
 anth-0.044
amp-0.043
ines-0.042
attered-0.040
 problem-0.039
Token share
Token ID2648
Feature activation+6.67e+00

Pos logit contributions
 anth+0.005
amp+0.005
ines+0.005
attered+0.005
 argument+0.005
Neg logit contributions
aid-0.005
Of-0.004
 yes-0.004
 nod-0.003
 forgiveness-0.003
Token the
Token ID262
Feature activation+6.91e-01

Pos logit contributions
aid+0.369
Of+0.311
 yes+0.297
 nod+0.294
 forgiveness+0.279
Neg logit contributions
 anth-0.261
ines-0.250
amp-0.250
 problem-0.224
attered-0.222
Token on
Token ID319
Feature activation-1.44e+00

Pos logit contributions
 anth+0.121
amp+0.115
ines+0.113
attered+0.109
 argument+0.101
Neg logit contributions
aid-0.123
 yes-0.100
Of-0.100
 nod-0.096
 forgiveness-0.092
Token,
Token ID11
Feature activation-3.97e-01

Pos logit contributions
 anth+0.066
amp+0.064
ines+0.062
attered+0.060
ained+0.057
Neg logit contributions
aid-0.071
 yes-0.060
Of-0.059
 nod-0.058
 forgiveness-0.057
Token they
Token ID484
Feature activation-2.54e+00

Pos logit contributions
 anth+0.017
amp+0.016
ines+0.016
attered+0.015
ained+0.015
Neg logit contributions
aid-0.021
Of-0.017
 nod-0.017
 yes-0.017
 forgiveness-0.016
Token never
Token ID1239
Feature activation-2.69e+00

Pos logit contributions
 anth+0.093
amp+0.089
ines+0.087
attered+0.082
 problem+0.078
Neg logit contributions
aid-0.150
Of-0.128
 yes-0.125
 nod-0.124
 forgive-0.120
Token qu
Token ID627
Feature activation-1.28e+00

Pos logit contributions
 anth+0.108
amp+0.104
ines+0.102
attered+0.096
ained+0.091
Neg logit contributions
aid-0.150
Of-0.126
 nod-0.123
 yes-0.123
 forgiveness-0.119
Tokenarre
Token ID9624
Feature activation-1.13e+01

Pos logit contributions
 anth+0.038
 ago+0.035
 problem+0.033
 earthquake+0.031
ines+0.031
Neg logit contributions
aid-0.082
 nod-0.066
Of-0.065
 forgive-0.065
 forgiveness-0.062
Tokenled
Token ID992
Feature activation+2.22e+00

Pos logit contributions
 anth+0.460
amp+0.419
ines+0.395
attered+0.368
 ago+0.364
Neg logit contributions
aid-0.655
Of-0.535
 yes-0.522
 nod-0.518
 forgiveness-0.516
Token again
Token ID757
Feature activation+2.52e+00

Pos logit contributions
aid+0.113
Of+0.093
 yes+0.091
 nod+0.090
 forgiveness+0.087
Neg logit contributions
 anth-0.099
amp-0.095
ines-0.093
attered-0.089
 problem-0.086
Token and
Token ID290
Feature activation+3.58e-01

Pos logit contributions
aid+0.130
Of+0.107
 yes+0.105
 nod+0.104
 forgiveness+0.101
Neg logit contributions
 anth-0.111
amp-0.107
ines-0.105
attered-0.100
ained-0.096
Token played
Token ID2826
Feature activation+2.82e+00

Pos logit contributions
aid+0.020
Of+0.017
 yes+0.016
 nod+0.016
 forgiveness+0.015
Neg logit contributions
 anth-0.014
amp-0.014
ines-0.013
attered-0.013
 problem-0.012
Token together
Token ID1978
Feature activation+3.51e-01

Pos logit contributions
aid+0.170
Of+0.145
 yes+0.132
 nod+0.131
 stro+0.130
Neg logit contributions
 anth-0.096
amp-0.095
ines-0.090
 argument-0.087
 problem-0.083
Token Lily
Token ID20037
Feature activation-3.16e-01

Pos logit contributions
 anth+0.018
ines+0.017
amp+0.017
attered+0.017
ained+0.016
Neg logit contributions
aid-0.013
Of-0.011
 yes-0.011
 nod-0.010
 forgiveness-0.010
Token and
Token ID290
Feature activation+3.43e-01

Pos logit contributions
 anth+0.014
amp+0.014
ines+0.014
attered+0.013
 problem+0.012
Neg logit contributions
aid-0.016
Of-0.013
 yes-0.013
 nod-0.013
 forgiveness-0.012
Token Jack
Token ID3619
Feature activation-2.31e+00

Pos logit contributions
aid+0.021
Of+0.018
 yes+0.017
 nod+0.017
 forgiveness+0.016
Neg logit contributions
 anth-0.013
amp-0.012
ines-0.011
attered-0.011
 problem-0.010
Token never
Token ID1239
Feature activation-2.68e+00

Pos logit contributions
 anth+0.092
amp+0.088
ines+0.084
attered+0.081
 problem+0.078
Neg logit contributions
aid-0.129
Of-0.108
 nod-0.108
 yes-0.107
 forgive-0.103
Token qu
Token ID627
Feature activation-1.19e+00

Pos logit contributions
 anth+0.109
amp+0.107
ines+0.104
attered+0.097
ained+0.094
Neg logit contributions
aid-0.148
Of-0.124
 nod-0.122
 yes-0.120
 forgive-0.117
Tokenarre
Token ID9624
Feature activation-1.13e+01

Pos logit contributions
 anth+0.037
 ago+0.033
 problem+0.030
ines+0.030
 camps+0.030
Neg logit contributions
aid-0.077
Of-0.061
 nod-0.061
 forgive-0.060
 forgiveness-0.058
Tokenled
Token ID992
Feature activation+2.18e+00

Pos logit contributions
 anth+0.476
amp+0.431
ines+0.406
 ago+0.378
attered+0.373
Neg logit contributions
aid-0.645
Of-0.520
 yes-0.508
 forgiveness-0.504
 nod-0.502
Token again
Token ID757
Feature activation+2.60e+00

Pos logit contributions
aid+0.116
Of+0.096
 yes+0.093
 nod+0.092
 forgiveness+0.089
Neg logit contributions
 anth-0.093
amp-0.089
ines-0.088
attered-0.084
 problem-0.081
Token.
Token ID13
Feature activation-1.80e-01

Pos logit contributions
aid+0.127
Of+0.104
 yes+0.103
 nod+0.103
 forgiveness+0.100
Neg logit contributions
 anth-0.119
amp-0.116
ines-0.112
attered-0.109
ained-0.103
Token They
Token ID1119
Feature activation-5.14e-01

Pos logit contributions
 anth+0.008
amp+0.008
ines+0.007
attered+0.007
ained+0.007
Neg logit contributions
aid-0.009
Of-0.008
 yes-0.008
 nod-0.007
 forgiveness-0.007
Token learned
Token ID4499
Feature activation+2.69e+00

Pos logit contributions
 anth+0.022
amp+0.021
ines+0.020
attered+0.020
 problem+0.018
Neg logit contributions
aid-0.028
Of-0.023
 nod-0.023
 yes-0.023
 forgive-0.023
Token.
Token ID13
Feature activation+2.14e-01

Pos logit contributions
aid+0.086
 yes+0.071
Of+0.066
 nod+0.066
 forgiveness+0.063
Neg logit contributions
 anth-0.088
amp-0.082
ines-0.079
 argument-0.075
 problem-0.074
Token The
Token ID383
Feature activation+1.40e+00

Pos logit contributions
aid+0.010
Of+0.008
 yes+0.007
 nod+0.007
 forgiveness+0.007
Neg logit contributions
 anth-0.011
amp-0.011
ines-0.010
attered-0.010
ained-0.009
Token bears
Token ID13062
Feature activation+1.88e+00

Pos logit contributions
aid+0.073
Of+0.061
 yes+0.061
 nod+0.059
 forgiveness+0.057
Neg logit contributions
amp-0.061
 anth-0.060
ines-0.058
attered-0.057
ained-0.055
Token stopped
Token ID5025
Feature activation+1.71e+00

Pos logit contributions
aid+0.100
 yes+0.084
 nod+0.084
Of+0.082
 forgive+0.079
Neg logit contributions
 anth-0.082
amp-0.080
ines-0.074
attered-0.072
ained-0.071
Token qu
Token ID627
Feature activation-1.12e+00

Pos logit contributions
aid+0.101
Of+0.084
 yes+0.082
 nod+0.080
 forgiveness+0.079
Neg logit contributions
 anth-0.065
amp-0.061
ines-0.060
attered-0.056
 problem-0.054
Tokenarre
Token ID9624
Feature activation-1.13e+01

Pos logit contributions
 anth+0.032
 ago+0.030
 problem+0.029
ines+0.028
 earthquake+0.027
Neg logit contributions
aid-0.073
 nod-0.060
Of-0.057
 forgive-0.056
 band-0.056
Tokenling
Token ID1359
Feature activation-7.21e-01

Pos logit contributions
 anth+0.464
amp+0.409
ines+0.376
 ago+0.368
attered+0.368
Neg logit contributions
aid-0.631
 nod-0.522
 yes-0.515
Of-0.500
 heal-0.494
Token and
Token ID290
Feature activation+7.95e-01

Pos logit contributions
 anth+0.031
amp+0.030
ines+0.030
attered+0.028
 problem+0.027
Neg logit contributions
aid-0.038
Of-0.031
 yes-0.029
 nod-0.029
 forgiveness-0.029
Token asked
Token ID1965
Feature activation-1.23e+00

Pos logit contributions
aid+0.040
Of+0.033
 yes+0.032
 nod+0.031
 forgiveness+0.031
Neg logit contributions
 anth-0.036
amp-0.034
ines-0.033
attered-0.032
 problem-0.031
Token him
Token ID683
Feature activation+3.19e+00

Pos logit contributions
 anth+0.054
ines+0.051
amp+0.051
 problem+0.047
attered+0.047
Neg logit contributions
aid-0.064
Of-0.052
 nod-0.049
 yes-0.048
 forgiveness-0.047
Token for
Token ID329
Feature activation-4.33e+00

Pos logit contributions
aid+0.182
Of+0.145
 forgiveness+0.134
 nod+0.132
Sleep+0.129
Neg logit contributions
 anth-0.123
ines-0.120
 problem-0.111
amp-0.111
 ago-0.104
Token mum
Token ID25682
Feature activation-7.74e-01

Pos logit contributions
aid+0.046
Of+0.038
 yes+0.037
 nod+0.037
 forgiveness+0.037
Neg logit contributions
amp-0.060
 anth-0.060
ines-0.059
attered-0.058
ained-0.056
Token and
Token ID290
Feature activation+6.52e-01

Pos logit contributions
 anth+0.029
ines+0.028
amp+0.027
 problem+0.026
 argument+0.025
Neg logit contributions
aid-0.045
Of-0.038
 yes-0.035
 forgiveness-0.034
 nod-0.034
Token understood
Token ID7247
Feature activation+4.02e+00

Pos logit contributions
aid+0.036
Of+0.030
 yes+0.028
 nod+0.027
 forgiveness+0.027
Neg logit contributions
 anth-0.027
ines-0.023
attered-0.023
amp-0.023
 problem-0.022
Token that
Token ID326
Feature activation-2.53e+00

Pos logit contributions
aid+0.220
Of+0.183
 yes+0.173
 nod+0.172
 forgiveness+0.162
Neg logit contributions
 anth-0.164
ines-0.156
amp-0.154
 problem-0.144
attered-0.142
Token qu
Token ID627
Feature activation-1.05e+00

Pos logit contributions
 anth+0.115
amp+0.109
ines+0.108
attered+0.103
 problem+0.101
Neg logit contributions
aid-0.127
Of-0.104
 yes-0.100
 nod-0.099
 forgiveness-0.095
Tokenarre
Token ID9624
Feature activation-1.13e+01

Pos logit contributions
 anth+0.033
 camps+0.027
 ago+0.026
 problem+0.024
ines+0.024
Neg logit contributions
aid-0.069
 forgive-0.056
 nod-0.055
 forgiveness-0.054
Of-0.053
Tokenlling
Token ID2680
Feature activation-1.26e+00

Pos logit contributions
 anth+0.401
amp+0.368
ines+0.341
attered+0.325
ained+0.305
Neg logit contributions
aid-0.700
Of-0.585
 nod-0.576
 yes-0.576
 forgiveness-0.567
Token was
Token ID373
Feature activation-1.83e+00

Pos logit contributions
 anth+0.058
amp+0.055
ines+0.052
attered+0.050
ained+0.048
Neg logit contributions
aid-0.064
 yes-0.053
 nod-0.053
Of-0.052
 forgiveness-0.051
Token not
Token ID407
Feature activation-1.58e+00

Pos logit contributions
 anth+0.087
amp+0.082
ines+0.082
attered+0.078
 problem+0.077
Neg logit contributions
aid-0.088
Of-0.072
 yes-0.069
 nod-0.069
 forgiveness-0.065
Token a
Token ID257
Feature activation+1.19e+00

Pos logit contributions
 anth+0.068
amp+0.065
ines+0.064
attered+0.061
 problem+0.059
Neg logit contributions
aid-0.082
Of-0.068
 yes-0.066
 nod-0.066
 forgiveness-0.064
Token good
Token ID922
Feature activation+4.34e-01

Pos logit contributions
aid+0.061
Of+0.051
 yes+0.050
 nod+0.050
 forgiveness+0.049
Neg logit contributions
 anth-0.052
amp-0.051
ines-0.050
attered-0.048
ained-0.046
Token Tom
Token ID4186
Feature activation-1.23e+00

Pos logit contributions
 anth+0.002
amp+0.002
ines+0.002
 ago+0.002
attered+0.002
Neg logit contributions
aid-0.002
Of-0.002
 forgiveness-0.002
 yes-0.002
 nod-0.002
Token and
Token ID290
Feature activation+7.28e-01

Pos logit contributions
 anth+0.057
amp+0.055
ines+0.054
attered+0.051
 problem+0.049
Neg logit contributions
aid-0.061
Of-0.050
 yes-0.049
 nod-0.048
 forgiveness-0.047
Token Mia
Token ID32189
Feature activation-2.17e+00

Pos logit contributions
aid+0.038
Of+0.031
 yes+0.031
 nod+0.030
 forgiveness+0.029
Neg logit contributions
 anth-0.032
amp-0.031
ines-0.030
attered-0.029
 problem-0.027
Token stop
Token ID2245
Feature activation+4.18e+00

Pos logit contributions
 anth+0.098
ines+0.096
amp+0.095
attered+0.087
 problem+0.087
Neg logit contributions
aid-0.108
Of-0.090
 forgiveness-0.084
 yes-0.083
 nod-0.077
Token qu
Token ID627
Feature activation-1.05e+00

Pos logit contributions
aid+0.233
Of+0.196
 yes+0.192
 nod+0.191
 forgiveness+0.185
Neg logit contributions
 anth-0.167
amp-0.161
ines-0.156
attered-0.149
ained-0.142
Tokenarre
Token ID9624
Feature activation-1.13e+01

Pos logit contributions
 ago+0.036
 anth+0.034
storm+0.032
 camps+0.031
 cloudy+0.031
Neg logit contributions
aid-0.061
 nod-0.051
 forgive-0.049
 band-0.047
And-0.045
Tokenling
Token ID1359
Feature activation-6.16e-01

Pos logit contributions
 anth+0.453
amp+0.402
attered+0.368
 ago+0.363
ines+0.360
Neg logit contributions
aid-0.660
 nod-0.528
Of-0.512
 yes-0.511
 forgiveness-0.509
Token.
Token ID13
Feature activation+1.57e-03

Pos logit contributions
 anth+0.024
amp+0.023
ines+0.022
attered+0.021
 problem+0.020
Neg logit contributions
aid-0.035
Of-0.030
 yes-0.028
 nod-0.028
 forgiveness-0.027
Token They
Token ID1119
Feature activation-2.27e-01

Pos logit contributions
aid+0.000
Of+0.000
 yes+0.000
 forgiveness+0.000
 nod+0.000
Neg logit contributions
 anth-0.000
amp-0.000
ines-0.000
attered-0.000
ained-0.000
Token see
Token ID766
Feature activation+1.21e+00

Pos logit contributions
 anth+0.012
amp+0.011
ines+0.011
attered+0.011
 problem+0.010
Neg logit contributions
aid-0.010
Of-0.008
 yes-0.008
 nod-0.008
 forgiveness-0.007
Token the
Token ID262
Feature activation+6.71e-01

Pos logit contributions
aid+0.054
Of+0.042
 yes+0.041
 nod+0.041
 forgiveness+0.039
Neg logit contributions
 anth-0.063
amp-0.061
ines-0.060
attered-0.058
ained-0.056
Token walking
Token ID6155
Feature activation-8.95e-01

Pos logit contributions
 anth+0.079
amp+0.076
ines+0.073
attered+0.072
ained+0.067
Neg logit contributions
aid-0.105
Of-0.088
 yes-0.087
 nod-0.085
 forgiveness-0.083
Token by
Token ID416
Feature activation+4.63e+00

Pos logit contributions
 anth+0.042
amp+0.039
ines+0.038
attered+0.037
ained+0.035
Neg logit contributions
aid-0.044
Of-0.037
 yes-0.036
 nod-0.036
 forgiveness-0.034
Token saw
Token ID2497
Feature activation-2.58e+00

Pos logit contributions
aid+0.228
Of+0.186
 yes+0.184
 nod+0.181
 forgiveness+0.177
Neg logit contributions
 anth-0.216
amp-0.206
ines-0.201
attered-0.196
ained-0.186
Token them
Token ID606
Feature activation+1.98e+00

Pos logit contributions
 anth+0.116
amp+0.111
ines+0.108
attered+0.105
ained+0.100
Neg logit contributions
aid-0.133
Of-0.109
 yes-0.108
 nod-0.105
 forgiveness-0.105
Token qu
Token ID627
Feature activation-1.21e+00

Pos logit contributions
aid+0.090
 yes+0.075
 nod+0.075
Of+0.072
 forgive+0.072
Neg logit contributions
 anth-0.098
amp-0.092
attered-0.088
ines-0.088
ained-0.083
Tokenarre
Token ID9624
Feature activation-1.12e+01

Pos logit contributions
 ago+0.038
 anth+0.037
 problem+0.033
 cloudy+0.032
storm+0.032
Neg logit contributions
aid-0.073
 nod-0.056
 band-0.056
 forgive-0.055
 mass-0.053
Tokenling
Token ID1359
Feature activation-7.00e-01

Pos logit contributions
 anth+0.468
amp+0.410
attered+0.378
 ago+0.375
 colony+0.368
Neg logit contributions
aid-0.631
 nod-0.511
 forgiveness-0.492
 yes-0.491
Of-0.486
Token.
Token ID13
Feature activation-5.98e-02

Pos logit contributions
 anth+0.033
amp+0.031
ines+0.031
attered+0.029
ained+0.028
Neg logit contributions
aid-0.034
Of-0.028
 yes-0.028
 nod-0.027
 forgiveness-0.026
Token She
Token ID1375
Feature activation+6.94e-01

Pos logit contributions
 anth+0.003
amp+0.003
ines+0.003
attered+0.003
 problem+0.003
Neg logit contributions
aid-0.003
Of-0.002
 yes-0.002
 nod-0.002
 forgiveness-0.002
Token was
Token ID373
Feature activation-1.88e+00

Pos logit contributions
aid+0.038
Of+0.032
 yes+0.031
 nod+0.031
 forgiveness+0.030
Neg logit contributions
 anth-0.028
amp-0.027
ines-0.026
attered-0.025
ained-0.024
Token very
Token ID845
Feature activation-1.99e+00

Pos logit contributions
 anth+0.082
amp+0.079
ines+0.076
attered+0.075
ained+0.069
Neg logit contributions
aid-0.098
Of-0.081
 yes-0.080
 nod-0.079
 forgiveness-0.077
Token he
Token ID339
Feature activation-1.28e+00

Pos logit contributions
 anth+0.012
ines+0.012
amp+0.011
attered+0.011
 problem+0.011
Neg logit contributions
aid-0.010
Of-0.008
 nod-0.007
 forgiveness-0.007
 yes-0.007
Token found
Token ID1043
Feature activation-2.06e+00

Pos logit contributions
amp+0.050
 anth+0.049
ines+0.047
attered+0.046
ained+0.044
Neg logit contributions
aid-0.070
 yes-0.062
Of-0.061
 nod-0.060
 forgive-0.058
Token a
Token ID257
Feature activation+1.11e+00

Pos logit contributions
 anth+0.101
ines+0.095
amp+0.094
attered+0.091
 problem+0.089
Neg logit contributions
aid-0.093
Of-0.077
 nod-0.075
 yes-0.072
 forgiveness-0.069
Token rabbit
Token ID22746
Feature activation+3.61e-01

Pos logit contributions
aid+0.060
Of+0.049
 yes+0.048
 nod+0.048
 forgiveness+0.046
Neg logit contributions
 anth-0.047
amp-0.044
ines-0.043
attered-0.041
 problem-0.040
Token qu
Token ID627
Feature activation-1.25e+00

Pos logit contributions
aid+0.018
Of+0.015
 yes+0.015
 nod+0.015
 forgiveness+0.014
Neg logit contributions
 anth-0.016
amp-0.015
ines-0.015
attered-0.014
ained-0.014
Tokenarre
Token ID9624
Feature activation-1.12e+01

Pos logit contributions
 ago+0.041
 anth+0.041
 problem+0.038
 earthquake+0.036
ines+0.035
Neg logit contributions
aid-0.076
 nod-0.058
Of-0.058
 band-0.057
 forgive-0.057
Tokenlling
Token ID2680
Feature activation-1.15e+00

Pos logit contributions
 anth+0.467
amp+0.421
attered+0.373
ines+0.366
 ago+0.364
Neg logit contributions
aid-0.647
 yes-0.509
 nod-0.507
 heal-0.494
Of-0.492
Token with
Token ID351
Feature activation-7.28e-01

Pos logit contributions
 anth+0.058
amp+0.056
ines+0.052
icer+0.050
attered+0.050
Neg logit contributions
aid-0.047
 yes-0.046
 nod-0.042
 hello-0.041
 forgive-0.040
Token a
Token ID257
Feature activation+1.13e+00

Pos logit contributions
 anth+0.036
amp+0.034
ines+0.033
attered+0.032
 problem+0.031
Neg logit contributions
aid-0.034
Of-0.028
 yes-0.026
 nod-0.026
 forgiveness-0.025
Token squirrel
Token ID33039
Feature activation+3.55e-01

Pos logit contributions
aid+0.059
Of+0.049
 yes+0.048
 nod+0.047
 forgiveness+0.046
Neg logit contributions
 anth-0.049
amp-0.047
ines-0.046
attered-0.044
ained-0.042
Token.
Token ID13
Feature activation+9.92e-02

Pos logit contributions
aid+0.017
 yes+0.014
 stro+0.013
 nod+0.013
Of+0.013
Neg logit contributions
 anth-0.017
amp-0.016
ines-0.015
attered-0.015
ained-0.015
Token she
Token ID673
Feature activation-1.11e+00

Pos logit contributions
aid+0.173
Of+0.144
 nod+0.136
 yes+0.136
 forgiveness+0.132
Neg logit contributions
 anth-0.126
ines-0.121
amp-0.120
attered-0.113
 problem-0.112
Token was
Token ID373
Feature activation-1.84e+00

Pos logit contributions
 anth+0.045
amp+0.043
ines+0.042
attered+0.040
ained+0.038
Neg logit contributions
aid-0.062
Of-0.052
 yes-0.050
 nod-0.050
 forgiveness-0.049
Token sorry
Token ID7926
Feature activation+2.83e+00

Pos logit contributions
 anth+0.091
amp+0.088
ines+0.086
attered+0.083
 problem+0.080
Neg logit contributions
aid-0.086
Of-0.069
 yes-0.067
 nod-0.066
 forgiveness-0.064
Token for
Token ID329
Feature activation-4.47e+00

Pos logit contributions
aid+0.154
Of+0.129
 yes+0.125
 nod+0.124
 forgiveness+0.121
Neg logit contributions
 anth-0.117
amp-0.112
ines-0.110
attered-0.104
 problem-0.100
Token qu
Token ID627
Feature activation-1.14e+00

Pos logit contributions
 anth+0.159
ines+0.146
amp+0.143
 argument+0.140
 problem+0.140
Neg logit contributions
aid-0.268
Of-0.226
 nod-0.213
 yes-0.204
 stro-0.201
Tokenarre
Token ID9624
Feature activation-1.12e+01

Pos logit contributions
 anth+0.034
 ago+0.030
 camps+0.029
ines+0.028
 problem+0.027
Neg logit contributions
aid-0.074
 forgive-0.061
 nod-0.060
Of-0.059
 forgiveness-0.058
Tokenling
Token ID1359
Feature activation-5.95e-01

Pos logit contributions
 anth+0.422
amp+0.379
ines+0.348
attered+0.329
 camps+0.322
Neg logit contributions
aid-0.685
Of-0.561
 nod-0.556
 yes-0.553
 forgiveness-0.552
Token.
Token ID13
Feature activation+3.31e-02

Pos logit contributions
 anth+0.026
amp+0.025
ines+0.024
attered+0.023
 problem+0.022
Neg logit contributions
aid-0.031
Of-0.026
 yes-0.025
 nod-0.025
 forgiveness-0.024
Token From
Token ID3574
Feature activation+1.68e+00

Pos logit contributions
aid+0.002
Of+0.001
 yes+0.001
 nod+0.001
 forgiveness+0.001
Neg logit contributions
 anth-0.002
amp-0.002
ines-0.002
attered-0.001
ained-0.001
Token that
Token ID326
Feature activation-2.67e+00

Pos logit contributions
aid+0.093
Of+0.078
 yes+0.076
 nod+0.076
 forgiveness+0.073
Neg logit contributions
 anth-0.067
amp-0.064
ines-0.063
attered-0.059
 problem-0.058
Token day
Token ID1110
Feature activation-2.49e+00

Pos logit contributions
 anth+0.093
attered+0.092
ines+0.087
amp+0.087
ays+0.078
Neg logit contributions
aid-0.154
 nod-0.145
Of-0.144
 yes-0.142
 forgive-0.142
Token Finally
Token ID9461
Feature activation+7.27e-01

Pos logit contributions
aid+0.007
Of+0.005
 yes+0.005
 nod+0.005
 forgiveness+0.005
Neg logit contributions
 anth-0.006
amp-0.006
ines-0.006
attered-0.006
ained-0.006
Token,
Token ID11
Feature activation-6.10e-02

Pos logit contributions
aid+0.034
 yes+0.030
 nod+0.028
Of+0.028
 forgiveness+0.027
Neg logit contributions
 anth-0.035
amp-0.035
ines-0.034
attered-0.033
ained-0.031
Token after
Token ID706
Feature activation-1.27e+00

Pos logit contributions
 anth+0.003
amp+0.003
ines+0.003
attered+0.003
ained+0.003
Neg logit contributions
aid-0.003
Of-0.002
 yes-0.002
 nod-0.002
 forgiveness-0.002
Token much
Token ID881
Feature activation-4.62e+00

Pos logit contributions
amp+0.075
 anth+0.075
attered+0.073
ines+0.071
ained+0.067
Neg logit contributions
aid-0.046
 yes-0.037
 forgiveness-0.036
 forgive-0.032
Of-0.032
Token qu
Token ID627
Feature activation-1.07e+00

Pos logit contributions
amp+0.213
 anth+0.207
attered+0.204
ines+0.191
ained+0.191
Neg logit contributions
aid-0.229
 nod-0.192
 yes-0.189
 forgiveness-0.189
 forgive-0.179
Tokenarre
Token ID9624
Feature activation-1.12e+01

Pos logit contributions
 ago+0.032
 anth+0.031
 camps+0.029
 earthquake+0.027
ines+0.027
Neg logit contributions
aid-0.064
 nod-0.055
 forgive-0.051
Of-0.049
 band-0.049
Tokenling
Token ID1359
Feature activation-5.93e-01

Pos logit contributions
 anth+0.463
amp+0.409
attered+0.370
ines+0.365
 camps+0.356
Neg logit contributions
aid-0.632
 nod-0.526
 yes-0.515
Of-0.505
 forgiveness-0.504
Token,
Token ID11
Feature activation-2.19e-02

Pos logit contributions
 anth+0.027
amp+0.026
ines+0.026
attered+0.024
ained+0.024
Neg logit contributions
aid-0.029
 yes-0.024
Of-0.024
 nod-0.024
 forgiveness-0.023
Token one
Token ID530
Feature activation-4.65e+00

Pos logit contributions
 anth+0.001
amp+0.001
ines+0.001
attered+0.001
 problem+0.001
Neg logit contributions
aid-0.001
Of-0.001
 nod-0.001
 yes-0.001
 forgiveness-0.001
Token of
Token ID286
Feature activation+1.12e+00

Pos logit contributions
 anth+0.185
amp+0.178
ines+0.172
attered+0.164
ained+0.157
Neg logit contributions
aid-0.263
Of-0.222
 yes-0.219
 nod-0.217
 forgiveness-0.210
Token the
Token ID262
Feature activation+8.71e-01

Pos logit contributions
aid+0.045
Of+0.035
 yes+0.034
 forgiveness+0.033
 nod+0.033
Neg logit contributions
 anth-0.063
amp-0.062
ines-0.060
attered-0.059
ooky-0.056
Token sharing
Token ID7373
Feature activation+3.03e+00

Pos logit contributions
 anth+0.124
amp+0.121
ines+0.120
attered+0.114
ained+0.107
Neg logit contributions
aid-0.137
Of-0.114
 yes-0.112
 nod-0.110
 forgiveness-0.109
Token is
Token ID318
Feature activation-1.62e+00

Pos logit contributions
aid+0.173
Of+0.146
 yes+0.142
 nod+0.140
 forgiveness+0.134
Neg logit contributions
 anth-0.115
amp-0.110
ines-0.109
attered-0.102
 problem-0.100
Token important
Token ID1593
Feature activation-4.27e+00

Pos logit contributions
 anth+0.063
 problem+0.060
ines+0.059
ained+0.058
amp+0.058
Neg logit contributions
aid-0.087
Of-0.071
 nod-0.071
 yes-0.069
 stro-0.066
Token and
Token ID290
Feature activation+3.76e-01

Pos logit contributions
 anth+0.185
amp+0.177
ines+0.174
attered+0.166
 problem+0.159
Neg logit contributions
aid-0.225
Of-0.187
 yes-0.182
 nod-0.180
 forgiveness-0.175
Token qu
Token ID627
Feature activation-1.21e+00

Pos logit contributions
aid+0.020
Of+0.016
 yes+0.016
 nod+0.016
 stro+0.015
Neg logit contributions
 anth-0.015
ines-0.014
 problem-0.014
attered-0.014
amp-0.014
Tokenarre
Token ID9624
Feature activation-1.12e+01

Pos logit contributions
 anth+0.036
 ago+0.032
 camps+0.030
 problem+0.030
ines+0.028
Neg logit contributions
aid-0.079
 forgive-0.065
 nod-0.063
Of-0.062
 forgiveness-0.061
Tokenlling
Token ID2680
Feature activation-1.31e+00

Pos logit contributions
 anth+0.421
amp+0.381
ines+0.352
attered+0.332
 ago+0.322
Neg logit contributions
aid-0.680
Of-0.561
 nod-0.552
 yes-0.548
 forgiveness-0.547
Token is
Token ID318
Feature activation-1.64e+00

Pos logit contributions
 anth+0.062
amp+0.058
ines+0.055
attered+0.053
ained+0.051
Neg logit contributions
aid-0.064
 yes-0.053
 nod-0.053
Of-0.052
 forgiveness-0.051
Token not
Token ID407
Feature activation-1.77e+00

Pos logit contributions
 anth+0.068
ines+0.065
 problem+0.064
ained+0.062
amp+0.062
Neg logit contributions
aid-0.083
Of-0.069
 nod-0.067
 yes-0.066
 stro-0.061
Token good
Token ID922
Feature activation+1.44e-01

Pos logit contributions
 anth+0.080
amp+0.079
ines+0.076
attered+0.072
ained+0.068
Neg logit contributions
aid-0.090
Of-0.074
 yes-0.073
 nod-0.072
 forgiveness-0.071
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
aid+0.007
 forgiveness+0.006
 yes+0.006
 nod+0.006
 forgive+0.006
Neg logit contributions
 anth-0.007
amp-0.007
ines-0.006
attered-0.006
icer-0.006
Token,
Token ID11
Feature activation-2.84e-01

Pos logit contributions
 anth+0.095
amp+0.091
ines+0.089
attered+0.085
 problem+0.081
Neg logit contributions
aid-0.129
Of-0.108
 yes-0.106
 nod-0.105
 forgiveness-0.102
Token the
Token ID262
Feature activation+5.72e-01

Pos logit contributions
 cloudy+0.008
 anth+0.008
 flood+0.007
 everywhere+0.007
 all+0.007
Neg logit contributions
aid-0.019
Of-0.015
 forgiveness-0.014
cher-0.014
 nod-0.014
Token queen
Token ID16599
Feature activation-1.09e+00

Pos logit contributions
aid+0.024
 yes+0.020
Of+0.020
 nod+0.019
 forgiveness+0.018
Neg logit contributions
amp-0.030
 anth-0.029
ines-0.029
attered-0.028
ained-0.027
Token decided
Token ID3066
Feature activation+4.28e-01

Pos logit contributions
amp+0.055
 anth+0.054
attered+0.049
ained+0.048
ines+0.048
Neg logit contributions
aid-0.048
 nodded-0.039
 yes-0.038
 forgive-0.038
 nod-0.038
Token to
Token ID284
Feature activation-4.47e-01

Pos logit contributions
aid+0.025
Of+0.020
 nod+0.019
 yes+0.019
 forgiveness+0.019
Neg logit contributions
 anth-0.017
amp-0.015
ines-0.015
 problem-0.015
attered-0.015
Token bake
Token ID28450
Feature activation+7.48e+00

Pos logit contributions
 anth+0.026
amp+0.026
ines+0.025
attered+0.025
ained+0.024
Neg logit contributions
aid-0.016
Of-0.012
 yes-0.012
 nod-0.012
 forgiveness-0.011
Token a
Token ID257
Feature activation+9.11e-01

Pos logit contributions
aid+0.398
Of+0.350
 nod+0.318
 forgiveness+0.307
 stro+0.300
Neg logit contributions
 anth-0.293
amp-0.281
ines-0.281
ained-0.264
 problem-0.260
Token y
Token ID331
Feature activation-1.18e+00

Pos logit contributions
aid+0.042
Of+0.036
 yes+0.035
 nod+0.034
 forgiveness+0.033
Neg logit contributions
amp-0.043
 anth-0.042
ines-0.042
attered-0.042
ained-0.041
Tokenummy
Token ID13513
Feature activation-2.39e+00

Pos logit contributions
 anth+0.033
 ago+0.032
amp+0.032
ines+0.031
ained+0.031
Neg logit contributions
aid-0.074
 nod-0.062
 forgiveness-0.061
Of-0.060
 yes-0.059
Token cake
Token ID12187
Feature activation+8.18e-01

Pos logit contributions
 anth+0.109
amp+0.105
ines+0.103
attered+0.098
 problem+0.095
Neg logit contributions
aid-0.119
Of-0.098
 yes-0.095
 nod-0.094
 forgiveness-0.091
Token for
Token ID329
Feature activation-4.92e+00

Pos logit contributions
aid+0.045
Of+0.037
 yes+0.037
 nod+0.036
 forgiveness+0.035
Neg logit contributions
 anth-0.034
amp-0.032
ines-0.031
attered-0.030
ained-0.028
Token intimidated
Token ID36601
Feature activation-3.15e+00

Pos logit contributions
 anth+0.037
amp+0.036
ines+0.035
attered+0.033
ained+0.032
Neg logit contributions
aid-0.038
Of-0.032
 yes-0.031
 nod-0.031
 forgive-0.030
Token and
Token ID290
Feature activation+5.65e-01

Pos logit contributions
 anth+0.131
amp+0.127
ines+0.125
attered+0.117
 problem+0.115
Neg logit contributions
aid-0.171
Of-0.141
 yes-0.138
 nod-0.136
 forgiveness-0.133
Token asked
Token ID1965
Feature activation-1.17e+00

Pos logit contributions
aid+0.029
Of+0.024
 yes+0.024
 nod+0.023
 forgiveness+0.022
Neg logit contributions
 anth-0.025
amp-0.025
ines-0.024
attered-0.023
ained-0.022
Token
Token ID6184
Feature activation-8.09e-01

Pos logit contributions
 anth+0.047
ines+0.044
amp+0.043
 problem+0.043
 ago+0.040
Neg logit contributions
aid-0.064
Of-0.050
 nod-0.048
 forgiveness-0.047
 yes-0.046
Token
Token ID95
Feature activation-1.77e+00

Pos logit contributions
amp+0.037
 anth+0.037
ines+0.031
 earthquake+0.029
 problem+0.029
Neg logit contributions
aid-0.043
Of-0.034
 forgiveness-0.034
 yes-0.033
 nod-0.033
Token
Token ID26391
Feature activation+4.51e+00

Pos logit contributions
amp+0.080
 anth+0.076
ines+0.069
attered+0.065
 ago+0.065
Neg logit contributions
aid-0.096
 yes-0.079
 forgiveness-0.075
Of-0.075
 nod-0.073
Token
Token ID129
Feature activation+2.55e+00

Pos logit contributions
aid+0.223
Of+0.177
 forgiveness+0.171
 yes+0.170
 nod+0.168
Neg logit contributions
 anth-0.216
amp-0.211
ines-0.201
attered-0.190
 problem-0.186
Token
Token ID241
Feature activation-3.18e+00

Pos logit contributions
aid+0.131
 nod+0.111
Of+0.111
 yes+0.107
 stro+0.104
Neg logit contributions
 anth-0.100
ines-0.097
attered-0.095
amp-0.095
 problem-0.091
TokenWhy
Token ID5195
Feature activation+3.05e+00

Pos logit contributions
amp+0.167
 anth+0.163
ines+0.157
 ago+0.151
attered+0.151
Neg logit contributions
aid-0.137
 forgiveness-0.106
 nod-0.098
 heal-0.097
 yes-0.095
Token not
Token ID407
Feature activation-1.63e+00

Pos logit contributions
aid+0.144
Of+0.116
 forgiveness+0.105
 yes+0.103
 nod+0.102
Neg logit contributions
 anth-0.143
amp-0.142
ines-0.140
 problem-0.135
ained-0.131
Token?
Token ID30
Feature activation-4.39e-01

Pos logit contributions
 anth+0.066
amp+0.064
ines+0.062
attered+0.059
 problem+0.056
Neg logit contributions
aid-0.089
Of-0.075
 yes-0.073
 nod-0.072
 forgiveness-0.070
Token watching
Token ID4964
Feature activation+3.13e+00

Pos logit contributions
aid+0.043
Of+0.035
 yes+0.035
 forgiveness+0.034
 nod+0.034
Neg logit contributions
 anth-0.039
amp-0.038
ines-0.037
attered-0.036
ained-0.034
Token TV
Token ID3195
Feature activation+3.46e-01

Pos logit contributions
aid+0.163
Of+0.134
 yes+0.129
 nod+0.128
 forgiveness+0.124
Neg logit contributions
 anth-0.137
amp-0.132
ines-0.128
attered-0.121
 problem-0.119
Token today
Token ID1909
Feature activation-6.56e-01

Pos logit contributions
aid+0.018
Of+0.015
 yes+0.015
 nod+0.014
 forgiveness+0.014
Neg logit contributions
 anth-0.015
amp-0.014
ines-0.014
attered-0.013
 problem-0.013
Token,
Token ID11
Feature activation-2.67e-01

Pos logit contributions
 anth+0.026
amp+0.025
ines+0.025
attered+0.023
 problem+0.023
Neg logit contributions
aid-0.037
Of-0.031
 yes-0.029
 nod-0.029
 forgiveness-0.029
Token why
Token ID1521
Feature activation+3.03e+00

Pos logit contributions
 anth+0.012
amp+0.011
ines+0.011
 problem+0.011
 ago+0.011
Neg logit contributions
aid-0.014
Of-0.011
 nod-0.010
 yes-0.010
 forgiveness-0.010
Token don
Token ID836
Feature activation+5.38e+00

Pos logit contributions
aid+0.136
Of+0.108
 yes+0.099
 nod+0.099
 forgiveness+0.096
Neg logit contributions
 anth-0.152
amp-0.146
ines-0.146
 problem-0.141
attered-0.135
Token't
Token ID470
Feature activation-2.15e+00

Pos logit contributions
aid+0.268
Of+0.220
 yes+0.220
 nod+0.215
 forgiveness+0.209
Neg logit contributions
 anth-0.246
amp-0.230
ines-0.230
attered-0.226
ained-0.206
Token you
Token ID345
Feature activation+1.70e+00

Pos logit contributions
 anth+0.125
amp+0.123
ines+0.121
attered+0.118
ained+0.114
Neg logit contributions
aid-0.079
Of-0.061
 yes-0.061
 nod-0.060
 forgiveness-0.058
Token send
Token ID3758
Feature activation+4.86e+00

Pos logit contributions
aid+0.074
Of+0.059
 yes+0.058
 nod+0.058
 forgive+0.055
Neg logit contributions
 anth-0.088
amp-0.085
ines-0.083
attered-0.081
ained-0.077
Token your
Token ID534
Feature activation+1.31e+00

Pos logit contributions
aid+0.251
Of+0.208
 yes+0.201
 nod+0.200
 forgiveness+0.192
Neg logit contributions
 anth-0.210
amp-0.202
ines-0.199
attered-0.188
 problem-0.185
Token friend
Token ID1545
Feature activation+1.84e+00

Pos logit contributions
aid+0.066
Of+0.054
 yes+0.052
 nod+0.052
 forgiveness+0.050
Neg logit contributions
 anth-0.059
amp-0.057
ines-0.056
attered-0.053
 problem-0.052
Token,
Token ID11
Feature activation-1.70e-01

Pos logit contributions
 anth+0.025
 problem+0.024
ines+0.023
 ago+0.022
amp+0.021
Neg logit contributions
aid-0.049
Of-0.041
 nod-0.038
 sew-0.038
Sleep-0.037
Token "
Token ID366
Feature activation-1.20e+00

Pos logit contributions
 anth+0.007
amp+0.006
 ago+0.006
ines+0.006
 problem+0.006
Neg logit contributions
aid-0.010
Of-0.007
 forgiveness-0.007
 nod-0.007
 sew-0.007
TokenIt
Token ID1026
Feature activation+1.28e+00

Pos logit contributions
amp+0.063
 anth+0.060
attered+0.060
ained+0.059
ines+0.058
Neg logit contributions
aid-0.049
 nod-0.039
 forgiveness-0.038
 yes-0.038
 forgive-0.035
Token's
Token ID338
Feature activation-5.65e-02

Pos logit contributions
aid+0.070
Of+0.059
 yes+0.057
 nod+0.057
 forgiveness+0.055
Neg logit contributions
 anth-0.053
amp-0.050
ines-0.049
attered-0.047
 problem-0.045
Token Match
Token ID13225
Feature activation-6.24e-01

Pos logit contributions
 anth+0.003
amp+0.003
ines+0.003
attered+0.003
 argument+0.002
Neg logit contributions
aid-0.002
 yes-0.002
 forgiveness-0.002
 nod-0.002
Of-0.002
Token Day
Token ID3596
Feature activation+5.03e+00

Pos logit contributions
 anth+0.024
amp+0.023
ines+0.022
attered+0.021
 problem+0.020
Neg logit contributions
aid-0.036
Of-0.030
 yes-0.030
 nod-0.029
 forgiveness-0.029
Token!
Token ID0
Feature activation-1.42e+00

Pos logit contributions
aid+0.271
 yes+0.231
Of+0.231
 nod+0.224
 forgiveness+0.217
Neg logit contributions
 anth-0.205
amp-0.193
ines-0.188
attered-0.185
ained-0.174
Token Every
Token ID3887
Feature activation-8.24e-01

Pos logit contributions
 anth+0.058
amp+0.058
 ago+0.057
ained+0.053
ines+0.050
Neg logit contributions
aid-0.075
 yes-0.055
 forgiveness-0.055
 nod-0.054
Of-0.053
Token Sunday
Token ID3502
Feature activation+1.53e+00

Pos logit contributions
 anth+0.029
amp+0.028
ines+0.026
attered+0.026
ained+0.024
Neg logit contributions
aid-0.050
 yes-0.043
Of-0.043
 nod-0.042
 forgiveness-0.042
Token,
Token ID11
Feature activation-2.10e-01

Pos logit contributions
aid+0.074
Of+0.060
 yes+0.058
 nod+0.057
 forgiveness+0.056
Neg logit contributions
 anth-0.072
amp-0.070
ines-0.068
attered-0.065
 problem-0.064
Token we
Token ID356
Feature activation-5.77e-01

Pos logit contributions
 anth+0.011
amp+0.010
ines+0.010
attered+0.009
 problem+0.009
Neg logit contributions
aid-0.010
Of-0.008
 yes-0.007
 nod-0.007
 forgiveness-0.007
Token He
Token ID679
Feature activation+8.43e-01

Pos logit contributions
 anth+0.003
ines+0.003
amp+0.003
attered+0.003
 problem+0.003
Neg logit contributions
aid-0.002
Of-0.001
 nod-0.001
 yes-0.001
 forgiveness-0.001
Token asked
Token ID1965
Feature activation-1.13e+00

Pos logit contributions
aid+0.043
Of+0.036
 yes+0.035
 nod+0.034
 forgive+0.032
Neg logit contributions
 anth-0.038
amp-0.037
ines-0.036
attered-0.035
ained-0.034
Token one
Token ID530
Feature activation-4.70e+00

Pos logit contributions
 anth+0.054
ines+0.051
amp+0.050
 problem+0.048
attered+0.047
Neg logit contributions
aid-0.054
Of-0.044
 nod-0.041
 yes-0.041
 forgiveness-0.039
Token more
Token ID517
Feature activation+3.58e+00

Pos logit contributions
 anth+0.167
ines+0.156
amp+0.154
 problem+0.145
attered+0.140
Neg logit contributions
aid-0.282
Of-0.235
 yes-0.229
 nod-0.227
 forgiveness-0.224
Token time
Token ID640
Feature activation-1.75e+00

Pos logit contributions
aid+0.178
Of+0.146
 yes+0.142
 nod+0.140
 forgiveness+0.136
Neg logit contributions
 anth-0.166
amp-0.159
ines-0.156
attered-0.149
 problem-0.144
Token if
Token ID611
Feature activation+5.09e+00

Pos logit contributions
 anth+0.058
ines+0.057
 ago+0.055
amp+0.054
 problem+0.052
Neg logit contributions
aid-0.106
Of-0.089
 nod-0.083
 forgiveness-0.081
 stro-0.078
Token he
Token ID339
Feature activation-1.30e+00

Pos logit contributions
aid+0.244
Of+0.191
 nod+0.187
 stro+0.168
 yes+0.168
Neg logit contributions
 anth-0.240
 problem-0.224
amp-0.217
ines-0.216
 cloudy-0.212
Token could
Token ID714
Feature activation+4.78e+00

Pos logit contributions
 anth+0.049
amp+0.047
ines+0.047
attered+0.044
 problem+0.042
Neg logit contributions
aid-0.073
Of-0.062
 nod-0.060
 forgiveness-0.060
 yes-0.059
Token keep
Token ID1394
Feature activation+3.84e+00

Pos logit contributions
aid+0.235
Of+0.195
 forgiveness+0.180
 yes+0.175
 nod+0.171
Neg logit contributions
 anth-0.219
amp-0.215
ines-0.214
attered-0.193
 problem-0.192
Token the
Token ID262
Feature activation+5.76e-01

Pos logit contributions
aid+0.214
Of+0.179
 nod+0.172
 yes+0.166
 forgive+0.163
Neg logit contributions
 anth-0.158
amp-0.150
ines-0.148
 problem-0.139
attered-0.137
Token pants
Token ID12581
Feature activation+5.60e-01

Pos logit contributions
aid+0.028
Of+0.023
 yes+0.022
 nod+0.022
 forgiveness+0.021
Neg logit contributions
 anth-0.027
amp-0.026
ines-0.026
attered-0.025
 problem-0.024
Token us
Token ID514
Feature activation+9.54e-01

Pos logit contributions
 anth+0.092
amp+0.089
ines+0.088
attered+0.086
ained+0.082
Neg logit contributions
aid-0.066
Of-0.052
 yes-0.050
 nod-0.049
 forgiveness-0.048
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
aid+0.047
 yes+0.038
Of+0.038
 nod+0.038
 forgive+0.036
Neg logit contributions
 anth-0.044
amp-0.042
ines-0.041
attered-0.040
 problem-0.038
Token We
Token ID775
Feature activation+5.68e-01

Pos logit contributions
 ago+0.001
amp+0.001
attered+0.001
 anth+0.000
ines+0.000
Neg logit contributions
aid-0.001
Of-0.001
 forgiveness-0.001
 yes-0.001
 band-0.001
Token are
Token ID389
Feature activation+1.01e+00

Pos logit contributions
aid+0.026
 forgive+0.020
 nod+0.020
 yes+0.020
Of+0.020
Neg logit contributions
 anth-0.030
amp-0.029
attered-0.027
ines-0.027
ained-0.026
Token brave
Token ID14802
Feature activation+1.46e+00

Pos logit contributions
aid+0.049
 yes+0.040
Of+0.040
 nod+0.039
 forgiveness+0.039
Neg logit contributions
 anth-0.047
amp-0.047
ines-0.044
attered-0.043
ained-0.041
Token and
Token ID290
Feature activation+7.26e-01

Pos logit contributions
aid+0.066
 yes+0.053
 nod+0.053
Of+0.053
 forgiveness+0.051
Neg logit contributions
 anth-0.072
amp-0.071
ines-0.067
attered-0.066
ained-0.064
Token strong
Token ID1913
Feature activation-6.33e-01

Pos logit contributions
aid+0.034
Of+0.027
 forgiveness+0.026
 nod+0.025
 yes+0.024
Neg logit contributions
 anth-0.031
 problem-0.030
 ago-0.030
 cloudy-0.029
attered-0.029
Token.
Token ID13
Feature activation-7.73e-02

Pos logit contributions
 anth+0.028
amp+0.027
ines+0.026
attered+0.025
ained+0.024
Neg logit contributions
aid-0.032
Of-0.026
 yes-0.026
 nod-0.026
 forgiveness-0.025
Token We
Token ID775
Feature activation+5.29e-01

Pos logit contributions
 ago+0.002
 anth+0.002
attered+0.002
amp+0.002
>+0.002
Neg logit contributions
aid-0.005
 forgiveness-0.004
Of-0.004
 yes-0.004
 band-0.004
Token can
Token ID460
Feature activation+4.36e+00

Pos logit contributions
aid+0.024
Of+0.019
 nod+0.019
 yes+0.019
 forgive+0.018
Neg logit contributions
 anth-0.027
amp-0.026
ines-0.025
attered-0.024
ained-0.023
Token manage
Token ID6687
Feature activation+1.73e+00

Pos logit contributions
aid+0.194
Of+0.156
 yes+0.145
 forgiveness+0.141
 nod+0.138
Neg logit contributions
 anth-0.222
ines-0.214
amp-0.214
attered-0.197
 problem-0.197
Token that
Token ID326
Feature activation-2.68e+00

Pos logit contributions
 anth+0.038
amp+0.036
ines+0.035
attered+0.034
ained+0.033
Neg logit contributions
aid-0.042
Of-0.035
 yes-0.034
 nod-0.034
 forgiveness-0.033
Token the
Token ID262
Feature activation+6.62e-01

Pos logit contributions
 anth+0.131
amp+0.128
ines+0.126
attered+0.120
ained+0.113
Neg logit contributions
aid-0.126
Of-0.102
 yes-0.101
 nod-0.098
 forgiveness-0.097
Token man
Token ID582
Feature activation-7.32e-01

Pos logit contributions
aid+0.033
Of+0.027
 yes+0.026
 nod+0.025
 forgiveness+0.025
Neg logit contributions
 anth-0.030
 problem-0.028
amp-0.028
ines-0.027
 argument-0.027
Token was
Token ID373
Feature activation-2.00e+00

Pos logit contributions
 anth+0.027
ines+0.026
amp+0.026
attered+0.024
 problem+0.024
Neg logit contributions
aid-0.042
Of-0.035
 nod-0.034
 yes-0.034
 forgiveness-0.033
Token a
Token ID257
Feature activation+1.04e+00

Pos logit contributions
 anth+0.089
amp+0.087
ines+0.083
attered+0.081
 problem+0.075
Neg logit contributions
aid-0.101
 yes-0.084
Of-0.082
 nod-0.082
 forgiveness-0.080
Token hunter
Token ID19177
Feature activation+3.70e-01

Pos logit contributions
aid+0.051
Of+0.041
 yes+0.040
 nod+0.039
 forgiveness+0.038
Neg logit contributions
 anth-0.048
amp-0.046
ines-0.045
 problem-0.043
attered-0.043
Token and
Token ID290
Feature activation+4.87e-01

Pos logit contributions
aid+0.020
Of+0.016
 nod+0.016
 yes+0.016
 forgiveness+0.015
Neg logit contributions
 anth-0.015
amp-0.015
ines-0.015
attered-0.014
 problem-0.013
Token he
Token ID339
Feature activation-1.57e+00

Pos logit contributions
aid+0.023
Of+0.019
 yes+0.018
 nod+0.018
 forgiveness+0.018
Neg logit contributions
 anth-0.023
amp-0.022
ines-0.022
attered-0.021
 problem-0.020
Token was
Token ID373
Feature activation-2.00e+00

Pos logit contributions
 anth+0.070
amp+0.067
ines+0.065
attered+0.063
ained+0.060
Neg logit contributions
aid-0.081
Of-0.067
 yes-0.067
 nod-0.065
 forgive-0.063
Token hunting
Token ID10988
Feature activation-9.91e-01

Pos logit contributions
 anth+0.089
amp+0.086
ines+0.084
attered+0.080
 problem+0.076
Neg logit contributions
aid-0.103
Of-0.085
 yes-0.083
 nod-0.082
 forgiveness-0.080
Token for
Token ID329
Feature activation-4.62e+00

Pos logit contributions
 anth+0.039
amp+0.038
ines+0.036
attered+0.035
ained+0.033
Neg logit contributions
aid-0.055
 yes-0.047
Of-0.046
 nod-0.046
 forgiveness-0.045
Token to
Token ID284
Feature activation-1.60e-01

Pos logit contributions
aid+0.004
Of+0.004
 nod+0.003
 yes+0.003
 forgiveness+0.003
Neg logit contributions
 anth-0.003
amp-0.003
ines-0.003
attered-0.003
 problem-0.003
Token join
Token ID4654
Feature activation+4.80e+00

Pos logit contributions
 anth+0.009
amp+0.009
ines+0.009
attered+0.008
 problem+0.008
Neg logit contributions
aid-0.006
Of-0.005
 yes-0.005
 nod-0.005
 forgiveness-0.004
Token them
Token ID606
Feature activation+2.04e+00

Pos logit contributions
aid+0.279
Of+0.232
 nod+0.221
 yes+0.218
 forgiveness+0.217
Neg logit contributions
 anth-0.182
amp-0.177
ines-0.174
attered-0.158
 argument-0.154
Token and
Token ID290
Feature activation+9.03e-01

Pos logit contributions
aid+0.100
 yes+0.084
Of+0.083
 nod+0.081
 forgiveness+0.078
Neg logit contributions
 anth-0.093
amp-0.089
attered-0.086
ines-0.086
ained-0.080
Token show
Token ID905
Feature activation+8.63e-01

Pos logit contributions
aid+0.040
Of+0.032
 yes+0.031
 nod+0.030
 forgiveness+0.029
Neg logit contributions
 anth-0.046
amp-0.044
ines-0.043
attered-0.041
 problem-0.040
Token them
Token ID606
Feature activation+2.01e+00

Pos logit contributions
aid+0.049
Of+0.041
 yes+0.040
 nod+0.040
 forgiveness+0.039
Neg logit contributions
 anth-0.033
amp-0.031
ines-0.031
attered-0.029
 problem-0.028
Token his
Token ID465
Feature activation+6.18e-01

Pos logit contributions
aid+0.101
Of+0.083
 yes+0.082
 nod+0.081
 forgiveness+0.079
Neg logit contributions
 anth-0.091
amp-0.088
ines-0.084
attered-0.083
ained-0.078
Token copper
Token ID15317
Feature activation-7.47e-01

Pos logit contributions
aid+0.029
 yes+0.024
 nod+0.024
Of+0.024
 forgiveness+0.023
Neg logit contributions
 anth-0.029
amp-0.028
ines-0.027
attered-0.027
ained-0.026
Token coins
Token ID10796
Feature activation+3.83e-01

Pos logit contributions
 anth+0.034
amp+0.033
ines+0.032
attered+0.031
 problem+0.030
Neg logit contributions
aid-0.037
Of-0.031
 yes-0.030
 nod-0.029
 forgiveness-0.028
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
aid+0.018
 yes+0.015
 nod+0.014
Of+0.014
 forgive+0.014
Neg logit contributions
 anth-0.019
amp-0.018
ines-0.016
attered-0.016
ained-0.016
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
aid+0.009
Of+0.007
 yes+0.007
 nod+0.007
 forgiveness+0.007
Neg logit contributions
 anth-0.010
amp-0.010
ines-0.010
attered-0.009
ained-0.009
Token worry
Token ID5490
Feature activation+4.17e+00

Pos logit contributions
 anth+0.014
ines+0.013
 argument+0.013
 problem+0.013
amp+0.013
Neg logit contributions
aid-0.019
Of-0.015
 forgiveness-0.013
 yes-0.013
 stro-0.013
Token so
Token ID523
Feature activation-3.78e-01

Pos logit contributions
aid+0.228
Of+0.190
 yes+0.184
 nod+0.183
 forgiveness+0.178
Neg logit contributions
 anth-0.170
amp-0.163
ines-0.160
attered-0.152
 problem-0.146
Token she
Token ID673
Feature activation-9.85e-01

Pos logit contributions
 anth+0.020
amp+0.019
ines+0.019
attered+0.018
 problem+0.018
Neg logit contributions
aid-0.017
Of-0.013
 nod-0.012
 yes-0.012
 forgiveness-0.012
Token kept
Token ID4030
Feature activation+8.99e-01

Pos logit contributions
 anth+0.038
amp+0.036
ines+0.035
attered+0.034
 problem+0.032
Neg logit contributions
aid-0.057
Of-0.047
 yes-0.046
 nod-0.046
 forgiveness-0.045
Token walking
Token ID6155
Feature activation-8.18e-01

Pos logit contributions
aid+0.057
Of+0.048
 yes+0.045
 nod+0.044
 forgiveness+0.043
Neg logit contributions
 anth-0.031
amp-0.028
ines-0.028
 problem-0.026
 argument-0.026
Token.
Token ID13
Feature activation-2.64e-01

Pos logit contributions
 anth+0.034
amp+0.033
ines+0.032
attered+0.031
 problem+0.029
Neg logit contributions
aid-0.044
Of-0.037
 yes-0.036
 nod-0.035
 forgiveness-0.035
Token She
Token ID1375
Feature activation+4.42e-01

Pos logit contributions
 anth+0.013
amp+0.012
ines+0.012
attered+0.012
ained+0.011
Neg logit contributions
aid-0.013
Of-0.010
 yes-0.010
 nod-0.010
 forgiveness-0.009
Token kept
Token ID4030
Feature activation+8.89e-01

Pos logit contributions
aid+0.023
Of+0.019
 yes+0.019
 nod+0.019
 forgive+0.018
Neg logit contributions
 anth-0.019
amp-0.019
ines-0.018
attered-0.017
ained-0.017
Token looking
Token ID2045
Feature activation-4.02e-01

Pos logit contributions
aid+0.061
Of+0.051
 yes+0.049
 forgiveness+0.048
 nod+0.048
Neg logit contributions
 anth-0.026
amp-0.024
ines-0.023
 problem-0.021
 argument-0.021
Token back
Token ID736
Feature activation+4.65e-01

Pos logit contributions
 anth+0.015
amp+0.014
ines+0.013
attered+0.012
ained+0.012
Neg logit contributions
aid-0.024
 yes-0.021
 nod-0.021
Of-0.021
 forgive-0.020
Token at
Token ID379
Feature activation-2.58e+00

Pos logit contributions
aid+0.023
Of+0.019
 yes+0.019
 nod+0.019
 forgiveness+0.018
Neg logit contributions
 anth-0.021
amp-0.020
ines-0.019
attered-0.019
 argument-0.018
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
aid+0.047
 yes+0.038
Of+0.038
 nod+0.036
 forgiveness+0.035
Neg logit contributions
 anth-0.052
amp-0.052
ines-0.051
>-0.047
ained-0.046
TokenTom
Token ID13787
Feature activation-8.62e-01

Pos logit contributions
aid+0.051
 yes+0.044
 nod+0.041
Of+0.041
 forgiveness+0.040
Neg logit contributions
amp-0.054
 anth-0.054
ines-0.052
>-0.048
ained-0.046
Token and
Token ID290
Feature activation+6.34e-01

Pos logit contributions
 anth+0.031
ines+0.030
amp+0.030
 problem+0.026
attered+0.026
Neg logit contributions
aid-0.050
Of-0.043
 yes-0.043
 forgiveness-0.042
 heal-0.039
Token Lily
Token ID20037
Feature activation-1.79e-01

Pos logit contributions
aid+0.032
Of+0.025
 yes+0.025
 nod+0.024
 forgiveness+0.024
Neg logit contributions
 anth-0.029
amp-0.027
ines-0.026
 problem-0.025
attered-0.025
Token smiled
Token ID13541
Feature activation+3.60e+00

Pos logit contributions
 anth+0.008
 all+0.007
 argument+0.007
amp+0.007
 scattered+0.007
Neg logit contributions
aid-0.008
Of-0.007
 forgiveness-0.007
Sweet-0.006
Sleep-0.006
Token and
Token ID290
Feature activation+5.86e-01

Pos logit contributions
aid+0.197
Of+0.163
 yes+0.157
 forgiveness+0.152
 nod+0.150
Neg logit contributions
 anth-0.148
amp-0.140
ines-0.139
attered-0.134
 problem-0.130
Token said
Token ID531
Feature activation-7.69e-01

Pos logit contributions
aid+0.030
 forgiveness+0.027
 yes+0.026
Will+0.026
Of+0.025
Neg logit contributions
 scattered-0.019
 anth-0.018
 all-0.018
ashed-0.017
 collapsed-0.017
Token,
Token ID11
Feature activation-4.10e-01

Pos logit contributions
 problem+0.020
ines+0.019
 ago+0.016
 everywhere+0.016
 all+0.016
Neg logit contributions
aid-0.048
Of-0.044
 sew-0.041
My-0.039
Elsa-0.039
Token "
Token ID366
Feature activation-1.55e+00

Pos logit contributions
storm+0.009
 ago+0.009
 anth+0.008
 everywhere+0.008
>+0.008
Neg logit contributions
aid-0.028
 forgiveness-0.023
 stitches-0.021
Of-0.021
ieve-0.020
TokenWe
Token ID1135
Feature activation-1.31e-01

Pos logit contributions
attered+0.054
amp+0.047
y+0.045
 anth+0.044
ained+0.042
Neg logit contributions
aid-0.074
 yes-0.073
 forgiveness-0.071
 hello-0.069
 hi-0.066
Token love
Token ID1842
Feature activation+2.63e+00

Pos logit contributions
 anth+0.006
ines+0.005
amp+0.005
 problem+0.005
attered+0.005
Neg logit contributions
aid-0.007
Of-0.006
 yes-0.005
 forgiveness-0.005
 nod-0.005
Token ask
Token ID1265
Feature activation+5.17e+00

Pos logit contributions
aid+0.144
Of+0.115
 yes+0.108
 forgiveness+0.106
 nod+0.105
Neg logit contributions
 anth-0.166
amp-0.159
ines-0.157
attered-0.149
 problem-0.145
Token the
Token ID262
Feature activation+7.15e-01

Pos logit contributions
aid+0.264
Of+0.216
 nod+0.203
 yes+0.203
 forgiveness+0.193
Neg logit contributions
 anth-0.235
ines-0.220
amp-0.219
 problem-0.204
attered-0.202
Token birds
Token ID10087
Feature activation+2.65e+00

Pos logit contributions
aid+0.038
Of+0.030
 yes+0.030
 nod+0.029
 forgiveness+0.029
Neg logit contributions
 anth-0.032
amp-0.029
ines-0.029
 problem-0.028
attered-0.027
Token who
Token ID508
Feature activation+3.10e+00

Pos logit contributions
aid+0.152
Of+0.128
 yes+0.119
 forgiveness+0.119
 nod+0.118
Neg logit contributions
 anth-0.101
ines-0.098
amp-0.095
attered-0.087
 problem-0.086
Token flew
Token ID13112
Feature activation-1.76e-01

Pos logit contributions
aid+0.182
Of+0.155
 yes+0.151
 nod+0.149
 forgiveness+0.146
Neg logit contributions
 anth-0.114
amp-0.109
ines-0.106
attered-0.100
 problem-0.095
Token overhead
Token ID16965
Feature activation-1.83e+00

Pos logit contributions
 anth+0.007
amp+0.007
ines+0.007
attered+0.006
 problem+0.006
Neg logit contributions
aid-0.010
Of-0.008
 yes-0.008
 nod-0.008
 forgive-0.008
Token what
Token ID644
Feature activation+4.99e+00

Pos logit contributions
 anth+0.078
amp+0.075
ines+0.073
attered+0.070
 problem+0.067
Neg logit contributions
aid-0.097
Of-0.080
 yes-0.079
 nod-0.078
 forgiveness-0.075
Token hay
Token ID27678
Feature activation+3.12e-02

Pos logit contributions
aid+0.263
Of+0.217
 yes+0.210
 nod+0.207
 forgiveness+0.201
Neg logit contributions
 anth-0.216
amp-0.203
ines-0.201
attered-0.191
 problem-0.187
Token was
Token ID373
Feature activation-2.02e+00

Pos logit contributions
aid+0.002
 yes+0.001
 nod+0.001
Of+0.001
 forgiveness+0.001
Neg logit contributions
 anth-0.001
amp-0.001
ines-0.001
attered-0.001
ained-0.001
Token,
Token ID11
Feature activation-1.74e-01

Pos logit contributions
 anth+0.092
amp+0.090
attered+0.085
ines+0.081
ained+0.076
Neg logit contributions
aid-0.100
 yes-0.083
 nod-0.081
 forgive-0.079
Of-0.078
Token but
Token ID475
Feature activation-5.34e-01

Pos logit contributions
 anth+0.007
ines+0.007
amp+0.007
attered+0.006
ained+0.006
Neg logit contributions
aid-0.009
Of-0.008
 nod-0.007
 yes-0.007
 forgiveness-0.007
Token
Token ID220
Feature activation-4.47e-01

Pos logit contributions
 anth+0.006
amp+0.006
>+0.006
 Later+0.006
 everywhere+0.006
Neg logit contributions
aid-0.013
 forgiveness-0.011
 heal-0.010
 bless-0.009
 forgiving-0.009
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
 anth+0.026
amp+0.026
ines+0.025
attered+0.024
 problem+0.024
Neg logit contributions
aid-0.016
Of-0.012
 yes-0.012
 nod-0.012
 forgiveness-0.011
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
aid+0.042
Of+0.032
 yes+0.031
 nod+0.031
 forgiveness+0.031
Neg logit contributions
 anth-0.055
amp-0.055
ines-0.054
attered-0.050
ained-0.049
TokenWhen
Token ID2215
Feature activation+4.22e+00

Pos logit contributions
aid+0.048
 forgiveness+0.038
 nod+0.037
 yes+0.037
Of+0.037
Neg logit contributions
 anth-0.051
amp-0.049
ines-0.047
attered-0.046
ained-0.044
Token Lily
Token ID20037
Feature activation-3.38e-01

Pos logit contributions
aid+0.212
Of+0.170
 nod+0.164
 yes+0.163
 forgiveness+0.159
Neg logit contributions
 anth-0.196
amp-0.185
ines-0.182
attered-0.174
 problem-0.173
Token got
Token ID1392
Feature activation-2.47e+00

Pos logit contributions
 anth+0.012
amp+0.011
ines+0.011
attered+0.010
 problem+0.010
Neg logit contributions
aid-0.021
Of-0.018
 yes-0.017
 nod-0.017
 forgiveness-0.017
Token to
Token ID284
Feature activation-4.01e-01

Pos logit contributions
 anth+0.105
amp+0.101
ines+0.099
attered+0.094
 problem+0.090
Neg logit contributions
aid-0.133
Of-0.110
 yes-0.107
 nod-0.106
 forgiveness-0.104
Token the
Token ID262
Feature activation+6.53e-01

Pos logit contributions
 anth+0.017
amp+0.015
 flood+0.014
 argument+0.014
 earthquake+0.014
Neg logit contributions
aid-0.023
Of-0.019
 yes-0.018
 forgiveness-0.017
 nod-0.017
Token building
Token ID2615
Feature activation+1.58e-01

Pos logit contributions
aid+0.038
 nod+0.033
Of+0.033
 yes+0.032
 forgive+0.032
Neg logit contributions
 anth-0.022
amp-0.022
ines-0.021
attered-0.021
ained-0.019
Token,
Token ID11
Feature activation-3.04e-01

Pos logit contributions
aid+0.006
 yes+0.006
 nod+0.005
 forgiveness+0.005
Of+0.005
Neg logit contributions
 anth-0.008
amp-0.008
attered-0.008
ines-0.007
ained-0.007
Token she
Token ID673
Feature activation-9.08e-01

Pos logit contributions
 anth+0.015
amp+0.014
 problem+0.014
 cloudy+0.013
ines+0.013
Neg logit contributions
aid-0.014
Of-0.011
 nod-0.011
 forgiveness-0.010
 forgive-0.010
Token 3
Token ID513
Feature activation-3.49e-02

Pos logit contributions
aid+0.033
Of+0.027
 yes+0.026
 nod+0.026
 forgiveness+0.025
Neg logit contributions
 anth-0.027
amp-0.025
ines-0.024
 problem-0.024
attered-0.024
Token year
Token ID614
Feature activation-3.79e+00

Pos logit contributions
 anth+0.001
amp+0.001
attered+0.001
ines+0.001
ained+0.001
Neg logit contributions
aid-0.002
 yes-0.002
Of-0.002
 nod-0.002
 forgiveness-0.002
Token old
Token ID1468
Feature activation-9.99e-01

Pos logit contributions
 anth+0.155
amp+0.149
ines+0.145
attered+0.138
ained+0.131
Neg logit contributions
aid-0.208
Of-0.174
 yes-0.169
 nod-0.168
 forgiveness-0.164
Token held
Token ID2714
Feature activation+1.43e+00

Pos logit contributions
 anth+0.036
 problem+0.033
attered+0.032
amp+0.031
 argument+0.031
Neg logit contributions
aid-0.058
Of-0.049
 yes-0.046
 forgiveness-0.046
 nod-0.045
Token on
Token ID319
Feature activation-1.41e+00

Pos logit contributions
aid+0.085
Of+0.073
 nod+0.067
 forgiveness+0.067
 forgive+0.066
Neg logit contributions
 anth-0.048
amp-0.047
attered-0.044
ines-0.044
 everywhere-0.044
Token tight
Token ID5381
Feature activation-1.75e+00

Pos logit contributions
 anth+0.058
amp+0.056
ines+0.055
attered+0.052
 problem+0.050
Neg logit contributions
aid-0.077
Of-0.064
 yes-0.062
 nod-0.061
 forgiveness-0.060
Token.
Token ID13
Feature activation+1.53e-02

Pos logit contributions
 anth+0.071
amp+0.067
ines+0.066
attered+0.063
 problem+0.060
Neg logit contributions
aid-0.097
Of-0.082
 yes-0.078
 nod-0.078
 forgiveness-0.076
Token He
Token ID679
Feature activation+7.98e-01

Pos logit contributions
aid+0.001
 yes+0.001
 forgiveness+0.001
Of+0.001
 nod+0.001
Neg logit contributions
 anth-0.001
amp-0.001
attered-0.001
 ago-0.001
ained-0.001
Token managed
Token ID5257
Feature activation+1.53e+00

Pos logit contributions
aid+0.043
Of+0.036
 yes+0.035
 nod+0.035
 forgiveness+0.034
Neg logit contributions
 anth-0.033
amp-0.031
ines-0.031
attered-0.029
 problem-0.028
Token to
Token ID284
Feature activation-3.44e-01

Pos logit contributions
aid+0.086
Of+0.072
 yes+0.068
 nod+0.068
 forgiveness+0.067
Neg logit contributions
 anth-0.061
amp-0.058
ines-0.057
 problem-0.053
attered-0.053
Token support
Token ID1104
Feature activation+5.23e+00

Pos logit contributions
 anth+0.014
 flood+0.014
amp+0.013
 all+0.013
 argument+0.013
Neg logit contributions
aid-0.017
Of-0.015
 yes-0.013
 forgiveness-0.013
Sweet-0.013
Token was
Token ID373
Feature activation-2.02e+00

Pos logit contributions
aid+0.014
Of+0.012
 yes+0.012
 nod+0.012
 forgiveness+0.011
Neg logit contributions
 anth-0.009
amp-0.009
ines-0.009
attered-0.008
 problem-0.008
Token so
Token ID523
Feature activation-3.29e-01

Pos logit contributions
 anth+0.076
ines+0.073
amp+0.072
 problem+0.069
ained+0.068
Neg logit contributions
aid-0.115
Of-0.100
 nod-0.093
 yes-0.091
 forgiveness-0.089
Token fine
Token ID3734
Feature activation-1.12e+00

Pos logit contributions
 anth+0.015
ines+0.014
attered+0.014
amp+0.014
 problem+0.013
Neg logit contributions
aid-0.016
Of-0.013
 yes-0.013
 forgiveness-0.012
 nod-0.012
Token.
Token ID13
Feature activation-1.51e-01

Pos logit contributions
 anth+0.048
amp+0.046
ines+0.044
attered+0.043
ained+0.041
Neg logit contributions
aid-0.060
Of-0.050
 yes-0.049
 nod-0.048
 forgiveness-0.047
Token The
Token ID383
Feature activation+1.39e+00

Pos logit contributions
 anth+0.007
amp+0.007
attered+0.007
ines+0.007
ained+0.006
Neg logit contributions
aid-0.007
 yes-0.006
 forgiveness-0.006
Of-0.006
 nod-0.006
Token end
Token ID886
Feature activation-1.25e+00

Pos logit contributions
aid+0.079
Of+0.066
 yes+0.064
 nod+0.063
 forgiveness+0.062
Neg logit contributions
 anth-0.056
amp-0.052
ines-0.051
attered-0.049
 problem-0.047
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
 anth+0.056
amp+0.053
ines+0.052
attered+0.051
ained+0.049
Neg logit contributions
aid-0.064
Of-0.053
 nod-0.051
 yes-0.051
 forgiveness-0.050
Token<|endoftext|>
Token ID50256
Feature activation-6.44e+00

Pos logit contributions
 anth+0.008
amp+0.008
ines+0.008
attered+0.008
ained+0.008
Neg logit contributions
aid-0.008
 yes-0.007
 nod-0.006
 forgiveness-0.006
Of-0.006
TokenMary
Token ID24119
Feature activation+1.21e-01

Pos logit contributions
amp+0.324
 anth+0.312
ines+0.309
ained+0.291
>+0.281
Neg logit contributions
aid-0.301
Of-0.242
 yes-0.232
 nod-0.223
 forgiveness-0.223
Token was
Token ID373
Feature activation-2.06e+00

Pos logit contributions
aid+0.007
Of+0.006
 yes+0.005
 nod+0.005
 forgiveness+0.005
Neg logit contributions
 anth-0.005
amp-0.005
ines-0.005
attered-0.004
ained-0.004
Token visiting
Token ID10013
Feature activation+1.58e+00

Pos logit contributions
 anth+0.100
amp+0.098
ines+0.097
attered+0.091
 problem+0.085
Neg logit contributions
aid-0.091
 yes-0.080
 nod-0.075
Of-0.074
 forgive-0.070
TokenAt
Token ID2953
Feature activation-1.29e+00

Pos logit contributions
aid+0.037
Of+0.030
 yes+0.027
 nod+0.026
 forgive+0.026
Neg logit contributions
 anth-0.054
ines-0.051
 problem-0.051
amp-0.051
 argument-0.049
Token the
Token ID262
Feature activation+6.86e-01

Pos logit contributions
attered+0.078
 anth+0.076
amp+0.075
ines+0.075
ooky+0.070
Neg logit contributions
aid-0.043
Of-0.033
 yes-0.033
 nod-0.032
 forgiveness-0.031
Token park
Token ID3952
Feature activation+1.02e+00

Pos logit contributions
aid+0.031
Of+0.027
 nod+0.026
 yes+0.026
 forgive+0.025
Neg logit contributions
 anth-0.032
ines-0.031
attered-0.031
amp-0.030
 ago-0.029
Token,
Token ID11
Feature activation-3.45e-01

Pos logit contributions
aid+0.039
 nod+0.034
 yes+0.034
Of+0.033
 forgive+0.032
Neg logit contributions
 anth-0.055
amp-0.054
attered-0.051
ines-0.051
ained-0.048
Token Tim
Token ID5045
Feature activation-2.94e+00

Pos logit contributions
 anth+0.014
amp+0.013
ines+0.013
attered+0.012
 problem+0.012
Neg logit contributions
aid-0.019
Of-0.016
 yes-0.016
 nod-0.016
 forgiveness-0.015
Token saw
Token ID2497
Feature activation-2.86e+00

Pos logit contributions
 anth+0.126
amp+0.125
ines+0.122
attered+0.121
 earthquake+0.117
Neg logit contributions
aid-0.134
 nod-0.117
 yes-0.115
Of-0.113
 nodded-0.113
Token another
Token ID1194
Feature activation+2.45e-01

Pos logit contributions
 anth+0.155
amp+0.148
ines+0.146
attered+0.142
ained+0.138
Neg logit contributions
aid-0.119
Of-0.094
 yes-0.090
 nod-0.090
 forgiveness-0.085
Token little
Token ID1310
Feature activation+7.89e-01

Pos logit contributions
aid+0.007
 nod+0.006
Of+0.006
 yes+0.006
 forgive+0.005
Neg logit contributions
amp-0.015
attered-0.015
ained-0.014
ines-0.014
 anth-0.014
Token boy
Token ID2933
Feature activation+5.14e-01

Pos logit contributions
aid+0.032
Of+0.025
 nod+0.024
 yes+0.024
 forgiveness+0.023
Neg logit contributions
 anth-0.043
amp-0.043
ines-0.042
attered-0.040
ained-0.039
Token named
Token ID3706
Feature activation-1.14e+00

Pos logit contributions
aid+0.029
Of+0.025
 yes+0.024
 nod+0.024
 forgiveness+0.023
Neg logit contributions
 anth-0.020
amp-0.019
ines-0.019
attered-0.018
 problem-0.017
Token Sam
Token ID3409
Feature activation-2.10e+00

Pos logit contributions
 anth+0.041
amp+0.037
ines+0.033
 problem+0.031
 earthquake+0.031
Neg logit contributions
aid-0.073
Of-0.059
 yes-0.058
 nod-0.057
 forgiveness-0.055
Token the
Token ID262
Feature activation+6.73e-01

Pos logit contributions
 anth+0.033
amp+0.032
ines+0.031
attered+0.030
 problem+0.029
Neg logit contributions
aid-0.034
Of-0.028
 yes-0.026
 nod-0.026
 forgiveness-0.025
Token jewel
Token ID16927
Feature activation+1.65e+00

Pos logit contributions
aid+0.035
Of+0.029
 yes+0.029
 nod+0.028
 forgiveness+0.027
Neg logit contributions
 anth-0.030
amp-0.029
ines-0.028
attered-0.027
ained-0.026
Token and
Token ID290
Feature activation+6.86e-01

Pos logit contributions
aid+0.084
 yes+0.074
 nod+0.069
Of+0.069
 forgive+0.066
Neg logit contributions
 anth-0.073
amp-0.070
ines-0.065
attered-0.065
ained-0.062
Token the
Token ID262
Feature activation+6.56e-01

Pos logit contributions
aid+0.039
Of+0.033
 yes+0.033
 nod+0.032
 forgiveness+0.031
Neg logit contributions
 anth-0.027
amp-0.027
ines-0.026
attered-0.024
 problem-0.023
Token ball
Token ID2613
Feature activation-7.70e-01

Pos logit contributions
aid+0.033
Of+0.026
 nod+0.025
 yes+0.025
 forgiveness+0.024
Neg logit contributions
 anth-0.030
amp-0.027
 problem-0.027
ines-0.027
 argument-0.026
Token all
Token ID477
Feature activation-9.54e+00

Pos logit contributions
 anth+0.030
amp+0.028
ines+0.026
attered+0.025
 problem+0.024
Neg logit contributions
aid-0.044
 yes-0.039
 nod-0.037
Of-0.037
 forgive-0.036
Token day
Token ID1110
Feature activation-2.41e+00

Pos logit contributions
 anth+0.366
amp+0.316
ines+0.297
attered+0.284
 problem+0.280
Neg logit contributions
aid-0.580
Of-0.490
 nod-0.484
 yes-0.480
 forgive-0.473
Token long
Token ID890
Feature activation-4.90e+00

Pos logit contributions
 anth+0.107
amp+0.103
ines+0.098
attered+0.095
ained+0.093
Neg logit contributions
aid-0.116
 yes-0.109
 nod-0.099
Of-0.097
 forgive-0.093
Token.
Token ID13
Feature activation-3.40e-02

Pos logit contributions
 anth+0.216
amp+0.207
ines+0.200
attered+0.194
 problem+0.184
Neg logit contributions
aid-0.252
 yes-0.208
Of-0.207
 nod-0.201
 forgiveness-0.197
Token They
Token ID1119
Feature activation-2.03e-01

Pos logit contributions
 anth+0.002
amp+0.002
ines+0.002
attered+0.002
 problem+0.002
Neg logit contributions
aid-0.001
Of-0.001
 yes-0.001
 nod-0.001
 forgiveness-0.001
Token learned
Token ID4499
Feature activation+2.86e+00

Pos logit contributions
 anth+0.008
amp+0.008
attered+0.008
ines+0.008
 problem+0.007
Neg logit contributions
aid-0.011
 yes-0.009
Of-0.009
 forgive-0.009
 nod-0.009
Token having
Token ID1719
Feature activation-7.21e-01

Pos logit contributions
 anth+0.092
amp+0.090
ines+0.088
attered+0.082
 problem+0.078
Neg logit contributions
aid-0.122
Of-0.103
 yes-0.101
 nod-0.099
 forgiveness-0.096
Token lots
Token ID6041
Feature activation-6.28e+00

Pos logit contributions
 anth+0.035
amp+0.034
ines+0.033
attered+0.031
ained+0.031
Neg logit contributions
aid-0.034
Of-0.028
 nod-0.027
 yes-0.026
 forgiveness-0.025
Token of
Token ID286
Feature activation+8.69e-01

Pos logit contributions
 anth+0.272
amp+0.256
attered+0.245
ines+0.240
 problem+0.230
Neg logit contributions
aid-0.343
Of-0.274
 yes-0.260
 nod-0.260
 forgiveness-0.254
Token fun
Token ID1257
Feature activation-6.48e-01

Pos logit contributions
aid+0.049
Of+0.041
 yes+0.040
 nod+0.039
 forgiveness+0.038
Neg logit contributions
 anth-0.035
amp-0.033
ines-0.033
attered-0.031
 problem-0.030
Token when
Token ID618
Feature activation+4.48e+00

Pos logit contributions
 anth+0.033
amp+0.032
attered+0.032
 flood+0.030
icer+0.030
Neg logit contributions
aid-0.025
 yes-0.019
 band-0.018
 nod-0.018
Of-0.018
Token all
Token ID477
Feature activation-9.48e+00

Pos logit contributions
aid+0.120
 yes+0.095
Of+0.092
 forgiveness+0.086
 nod+0.086
Neg logit contributions
amp-0.280
 anth-0.280
ines-0.277
attered-0.275
 argument-0.261
Token of
Token ID286
Feature activation+9.09e-01

Pos logit contributions
 anth+0.614
OO+0.604
amp+0.603
attered+0.598
ines+0.580
Neg logit contributions
aid-0.252
 nod-0.181
 forgiveness-0.168
Of-0.167
 comfort-0.160
Token sudden
Token ID4802
Feature activation-7.29e-01

Pos logit contributions
aid+0.045
Of+0.038
 nod+0.036
 yes+0.035
 forgiveness+0.032
Neg logit contributions
 anth-0.043
amp-0.041
ines-0.040
 ago-0.037
 problem-0.037
Token,
Token ID11
Feature activation-3.16e-01

Pos logit contributions
 anth+0.035
amp+0.035
ines+0.034
attered+0.033
ained+0.031
Neg logit contributions
aid-0.034
 yes-0.027
Of-0.027
 nod-0.027
 forgiveness-0.026
Token she
Token ID673
Feature activation-8.93e-01

Pos logit contributions
 anth+0.017
amp+0.017
ines+0.016
attered+0.016
 problem+0.015
Neg logit contributions
aid-0.013
Of-0.010
 nod-0.009
 yes-0.009
 forgiveness-0.009
Token heard
Token ID2982
Feature activation-4.35e-01

Pos logit contributions
amp+0.040
ines+0.038
 anth+0.037
ained+0.035
ash+0.035
Neg logit contributions
aid-0.040
 band-0.035
My-0.033
 yes-0.033
 nodded-0.033
Token Amy
Token ID14235
Feature activation-1.57e+00

Pos logit contributions
 anth+0.118
amp+0.114
ines+0.112
attered+0.107
 problem+0.103
Neg logit contributions
aid-0.118
Of-0.096
 yes-0.094
 nod-0.092
 forgiveness-0.090
Token was
Token ID373
Feature activation-2.02e+00

Pos logit contributions
 anth+0.062
amp+0.059
ines+0.057
attered+0.054
ained+0.052
Neg logit contributions
aid-0.088
Of-0.073
 yes-0.073
 nod-0.073
 forgiveness-0.070
Token right
Token ID826
Feature activation-2.43e+00

Pos logit contributions
 anth+0.087
amp+0.083
ines+0.081
attered+0.077
 problem+0.075
Neg logit contributions
aid-0.107
Of-0.089
 yes-0.085
 nod-0.085
 forgiveness-0.082
Token,
Token ID11
Feature activation-2.27e-01

Pos logit contributions
 anth+0.105
amp+0.100
ines+0.098
attered+0.093
ained+0.089
Neg logit contributions
aid-0.128
Of-0.107
 yes-0.105
 nod-0.103
 forgiveness-0.101
Token running
Token ID2491
Feature activation-1.54e+00

Pos logit contributions
 anth+0.010
amp+0.010
ines+0.009
attered+0.009
 problem+0.009
Neg logit contributions
aid-0.012
Of-0.009
 yes-0.009
 nod-0.009
 forgiveness-0.009
Token all
Token ID477
Feature activation-9.38e+00

Pos logit contributions
 anth+0.071
amp+0.067
ines+0.066
attered+0.061
icer+0.060
Neg logit contributions
aid-0.074
Of-0.063
 yes-0.063
 nod-0.061
 forgive-0.059
Token the
Token ID262
Feature activation+6.80e-01

Pos logit contributions
 anth+0.528
attered+0.494
amp+0.491
OO+0.469
ines+0.466
Neg logit contributions
aid-0.353
Of-0.278
 nod-0.277
 forgiveness-0.264
 forgive-0.261
Token time
Token ID640
Feature activation-2.28e+00

Pos logit contributions
aid+0.040
Of+0.035
 nod+0.034
 yes+0.033
 forgiveness+0.033
Neg logit contributions
attered-0.023
 anth-0.023
amp-0.023
ines-0.021
ays-0.020
Token wasn
Token ID2492
Feature activation-1.37e+00

Pos logit contributions
 anth+0.108
amp+0.101
ines+0.097
attered+0.096
ained+0.092
Neg logit contributions
aid-0.107
 yes-0.092
Of-0.088
 nod-0.088
 forgive-0.085
Token't
Token ID470
Feature activation-2.01e+00

Pos logit contributions
 anth+0.070
ines+0.065
attered+0.064
amp+0.063
 problem+0.059
Neg logit contributions
aid-0.063
 yes-0.053
Of-0.052
 nod-0.049
 forgive-0.048
Token nice
Token ID3621
Feature activation+2.06e+00

Pos logit contributions
 anth+0.082
amp+0.077
ines+0.076
attered+0.072
 problem+0.071
Neg logit contributions
aid-0.111
Of-0.093
 yes-0.090
 nod-0.089
 forgiveness-0.086
Token a
Token ID257
Feature activation+1.15e+00

Pos logit contributions
aid+0.332
Of+0.273
 yes+0.260
 nod+0.260
 forgiveness+0.248
Neg logit contributions
 anth-0.286
amp-0.277
ines-0.271
 problem-0.247
attered-0.247
Token few
Token ID1178
Feature activation-5.93e+00

Pos logit contributions
aid+0.057
Of+0.047
 yes+0.046
 nod+0.045
 forgiveness+0.044
Neg logit contributions
 anth-0.053
amp-0.051
ines-0.051
attered-0.049
ained-0.047
Token jokes
Token ID14532
Feature activation+4.28e+00

Pos logit contributions
 anth+0.248
amp+0.243
attered+0.234
ines+0.229
ained+0.226
Neg logit contributions
aid-0.319
Of-0.262
 yes-0.261
 nod-0.256
 forgiveness-0.254
Token together
Token ID1978
Feature activation+5.56e-01

Pos logit contributions
aid+0.225
Of+0.186
 yes+0.181
 nod+0.180
 forgiveness+0.175
Neg logit contributions
 anth-0.183
amp-0.176
ines-0.172
attered-0.164
 problem-0.157
Token.
Token ID13
Feature activation+5.28e-03

Pos logit contributions
aid+0.028
 yes+0.024
Of+0.024
 nod+0.023
 forgiveness+0.022
Neg logit contributions
 anth-0.024
amp-0.023
ines-0.023
attered-0.022
ained-0.021
Tokenâ
Token ID22940
Feature activation-8.55e+00

Pos logit contributions
aid+0.000
 forgiveness+0.000
 yes+0.000
 heal+0.000
 band+0.000
Neg logit contributions
 anth-0.000
amp-0.000
 ago-0.000
ines-0.000
>-0.000
Token
Token ID26391
Feature activation+4.61e+00

Pos logit contributions
amp+0.365
ines+0.323
 anth+0.319
ained+0.315
 argument+0.312
Neg logit contributions
aid-0.447
 yes-0.370
 forgiveness-0.365
Of-0.358
 stro-0.352
Token
Token ID220
Feature activation-3.15e-01

Pos logit contributions
aid+0.197
Of+0.155
 yes+0.150
 nod+0.147
 forgiveness+0.143
Neg logit contributions
 anth-0.246
amp-0.238
ines-0.233
attered-0.224
 problem-0.216
Token
Token ID198
Feature activation+1.14e+00

Pos logit contributions
amp+0.016
 anth+0.015
ines+0.015
ained+0.014
attered+0.014
Neg logit contributions
aid-0.015
 yes-0.011
Of-0.011
 forgiveness-0.011
 nod-0.011
Token
Token ID198
Feature activation+1.14e+00

Pos logit contributions
aid+0.051
 yes+0.040
 forgiveness+0.039
 nod+0.039
Of+0.039
Neg logit contributions
amp-0.057
 anth-0.056
ines-0.055
>-0.051
ained-0.049
TokenThen
Token ID6423
Feature activation+2.91e+00

Pos logit contributions
aid+0.041
Of+0.030
 yes+0.029
 nod+0.029
 forgiveness+0.028
Neg logit contributions
 anth-0.068
amp-0.066
ines-0.065
attered-0.062
ained-0.061
Token children
Token ID1751
Feature activation+3.06e+00

Pos logit contributions
aid+0.036
Of+0.029
 nod+0.028
 yes+0.028
 forgiveness+0.027
Neg logit contributions
 anth-0.031
amp-0.029
ines-0.029
 problem-0.028
attered-0.027
Token in
Token ID287
Feature activation+3.20e-01

Pos logit contributions
aid+0.155
Of+0.127
 yes+0.126
 nod+0.124
 forgiveness+0.120
Neg logit contributions
 anth-0.138
amp-0.132
ines-0.128
attered-0.123
ained-0.118
Token town
Token ID3240
Feature activation+1.10e+00

Pos logit contributions
aid+0.013
 yes+0.010
 forgiveness+0.010
 nod+0.010
Of+0.010
Neg logit contributions
 anth-0.017
amp-0.017
attered-0.017
ines-0.016
ained-0.015
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
aid+0.056
Of+0.046
 yes+0.046
 nod+0.045
 forgiveness+0.044
Neg logit contributions
 anth-0.049
amp-0.047
ines-0.045
attered-0.044
ained-0.042
Token They
Token ID1119
Feature activation-4.81e-01

Pos logit contributions
aid+0.009
 yes+0.007
Of+0.007
 nod+0.007
 forgiveness+0.007
Neg logit contributions
 anth-0.008
amp-0.008
ines-0.008
attered-0.007
ained-0.007
Token all
Token ID477
Feature activation-9.52e+00

Pos logit contributions
 anth+0.018
amp+0.018
ines+0.017
attered+0.016
ained+0.015
Neg logit contributions
aid-0.028
 yes-0.024
Of-0.024
 nod-0.024
 forgive-0.023
Token liked
Token ID8288
Feature activation-4.13e-01

Pos logit contributions
 anth+0.411
amp+0.394
ines+0.378
attered+0.363
 problem+0.348
Neg logit contributions
aid-0.510
Of-0.421
 nod-0.413
 yes-0.413
 forgiveness-0.394
Token him
Token ID683
Feature activation+3.18e+00

Pos logit contributions
 anth+0.017
ines+0.016
amp+0.016
 problem+0.015
 argument+0.015
Neg logit contributions
aid-0.023
Of-0.018
 nod-0.018
 yes-0.017
 forgive-0.017
Token very
Token ID845
Feature activation-2.00e+00

Pos logit contributions
aid+0.176
Of+0.147
 yes+0.143
 nod+0.142
 forgiveness+0.138
Neg logit contributions
 anth-0.129
amp-0.123
ines-0.120
attered-0.114
 problem-0.109
Token much
Token ID881
Feature activation-4.72e+00

Pos logit contributions
 anth+0.078
amp+0.073
ines+0.067
attered+0.066
ained+0.064
Neg logit contributions
aid-0.115
 yes-0.098
 nod-0.097
Of-0.096
 forgiveness-0.093
Token and
Token ID290
Feature activation+6.01e-01

Pos logit contributions
ines+0.168
 problem+0.165
 anth+0.159
 argument+0.149
amp+0.147
Neg logit contributions
aid-0.274
Of-0.232
 nod-0.212
 yes-0.211
My-0.210