TOP ACTIVATIONS
MAX = 0.282

day , the little girl conf ided in the man that
full of colorful balloons , conf etti and stream ers .
air , releasing brightly colored conf etti . Lu
and throwing around mint green conf etti . He
and Zoe always kept their passports decorated and they enjoyed showing
had never seen such beautiful passports ! From that
looked at their tickets and passports and smiled . " You
to show their tickets and passports . " Hello
their bags and bring their passports . Pass ports are small
and Ben do not have passports . They are too little
they wanted to show their passports to their M ums .
understand . He takes their passports and rub s them with
it . They lost their passports and couldn 't go on
. He looks at their passports . He looks at Sara
going to teach you about passports today ."
We have to show our passports to the lady there .
Dad have to show their passports to a man . A
were successful in getting their passports . Lily was so happy
. After the passports were decorated , the two
to many places with their passports , but this was their

MIN ACTIVATIONS
MIN = -0.287

her toys , but she accidentally knocked over a v ase
, but one day he accidentally broke it . He was
while , but then she accidentally threw it too hard and
her friend said that she accidentally spilled juice on it .
while , but then he accidentally kicked it too hard and
he was playing , he accidentally lost a special button from
with her ball when it accidentally hit her mom 's ash
an apple , but he accidentally knocked the bowl over .
hair . Suddenly , he accidentally cut his finger and started
were playing , Tim my accidentally broke his new toy soldier
. He had accidentally fallen into the creek !
pepper in it when she accidentally spilled some on her hand
were too sharp and she accidentally cut off too much hair
with his toys when he accidentally knocked over a cup of
One day , Tim my accidentally spilled some water on his
. One day , he accidentally broke his motorcycle into two
his toy car when he accidentally knocked it into the sink
Suddenly , Lily accidentally knocked over the te ap
toys , but sometimes she accidentally broke them because they were
on the wall . Tim accidentally hit the ornament , and

INTERVAL 0.140 - 0.282
CONTAINS 2.024%

smiled and said , " Yes , Ben . We can
was time to relax . So he sat down on the
. They say , " Yes , Mom , let 's
to come over and play games with her . When they
can take you to any story you want . But you

INTERVAL -0.002 - 0.140
CONTAINS 47.549%

a small village near the forest . They liked to play
she couldn 't find him . She felt foolish for not
not for your mouth . OK ?" Mom says .
until her grandma came back . Days went
, Bob arrived at the beach . His heart was pounding

INTERVAL -0.144 - -0.002
CONTAINS 48.985%

" Can I try it ?" Ben asked .
it ." Anna and Ben felt sorry . They
They were happy again . <|endoftext|> Anna and Ben are twins
and took her home . Lily 's knee hurt so much
They did not know what to do . They heard Mom

INTERVAL -0.287 - -0.144
CONTAINS 1.441%

now ?" Mom my smiled and replied , "
prevented Laura and Alice from having an argument , and they
flew away . Lily was sad to see it go ,
smiled and wasn 't scared anymore . She was so happy
nuts ." Sam my used his strong teeth to
Token day
Token ID1110
Feature activation+3.07e-02

Pos logit contributions
years+0.146
Acc+0.143
eds+0.137
pped+0.132
umper+0.132
Neg logit contributions
 So-0.141
 tickets-0.125
 message-0.125
 says-0.124
 sounds-0.121
Token,
Token ID11
Feature activation-3.97e-02

Pos logit contributions
 So+0.044
 message+0.043
 idea+0.041
 funn+0.041
 tickets+0.040
Neg logit contributions
years-0.047
pped-0.046
Acc-0.045
 accidentally-0.044
pping-0.043
Token the
Token ID262
Feature activation+1.68e-02

Pos logit contributions
years+0.056
Acc+0.055
pped+0.051
eds+0.049
pping+0.048
Neg logit contributions
 So-0.066
 message-0.062
 says-0.060
 tickets-0.060
 idea-0.059
Token little
Token ID1310
Feature activation-4.47e-02

Pos logit contributions
 So+0.019
 message+0.015
 tickets+0.015
 says+0.015
 treats+0.014
Neg logit contributions
years-0.033
Acc-0.033
pped-0.031
eds-0.031
pping-0.031
Token girl
Token ID2576
Feature activation+1.26e-01

Pos logit contributions
years+0.071
umper+0.071
Acc+0.070
grown+0.069
mo+0.066
Neg logit contributions
 So-0.060
 tickets-0.052
 message-0.052
 shouts-0.051
 tale-0.049
Token conf
Token ID1013
Feature activation+2.82e-01

Pos logit contributions
 So+0.168
 says+0.150
 message+0.141
 sounds+0.139
 y+0.138
Neg logit contributions
years-0.233
Acc-0.230
eds-0.220
arter-0.214
pped-0.211
Tokenided
Token ID1384
Feature activation-5.02e-02

Pos logit contributions
 So+0.477
 message+0.427
 tickets+0.425
 says+0.416
 sounds+0.413
Neg logit contributions
Acc-0.401
years-0.392
eds-0.374
umper-0.354
pping-0.351
Token in
Token ID287
Feature activation+2.03e-02

Pos logit contributions
years+0.082
Acc+0.080
eds+0.078
pped+0.076
umper+0.075
Neg logit contributions
 So-0.073
 message-0.066
 says-0.065
 tickets-0.064
 idea-0.064
Token the
Token ID262
Feature activation+1.71e-02

Pos logit contributions
 So+0.031
 message+0.028
 says+0.028
 tickets+0.027
 sounds+0.027
Neg logit contributions
years-0.031
Acc-0.030
eds-0.029
pped-0.029
pping-0.028
Token man
Token ID582
Feature activation+6.53e-02

Pos logit contributions
 So+0.030
 message+0.028
 prayer+0.028
 idea+0.028
 says+0.027
Neg logit contributions
years-0.020
eds-0.019
Acc-0.019
pping-0.018
pped-0.018
Token that
Token ID326
Feature activation-5.97e-02

Pos logit contributions
 So+0.046
 message+0.041
 idea+0.040
 prayer+0.039
 tickets+0.039
Neg logit contributions
years-0.150
eds-0.146
Acc-0.145
pping-0.144
pped-0.142
Token full
Token ID1336
Feature activation-5.32e-03

Pos logit contributions
Acc+0.094
years+0.093
eds+0.091
umper+0.089
pping+0.087
Neg logit contributions
 So-0.068
 message-0.062
 idea-0.060
 y-0.060
 says-0.060
Token of
Token ID286
Feature activation-4.16e-02

Pos logit contributions
years+0.006
pped+0.006
pping+0.006
Acc+0.006
rou+0.006
Neg logit contributions
 So-0.009
 tickets-0.009
 Sorry-0.009
 says-0.009
 Fer-0.009
Token colorful
Token ID20239
Feature activation-3.05e-02

Pos logit contributions
years+0.080
Acc+0.078
eds+0.076
pped+0.075
pping+0.073
Neg logit contributions
 So-0.049
 message-0.041
 says-0.041
 tickets-0.040
 idea-0.040
Token balloons
Token ID35485
Feature activation+1.29e-01

Pos logit contributions
years+0.058
Acc+0.055
eds+0.055
pped+0.054
pping+0.053
Neg logit contributions
 So-0.036
 message-0.031
 says-0.030
 idea-0.030
 tickets-0.029
Token,
Token ID11
Feature activation-3.63e-02

Pos logit contributions
 So+0.189
 says+0.162
 message+0.161
 tickets+0.161
 prayer+0.161
Neg logit contributions
years-0.201
pped-0.197
pping-0.196
eds-0.195
Acc-0.192
Token conf
Token ID1013
Feature activation+2.82e-01

Pos logit contributions
years+0.056
Acc+0.055
pped+0.053
eds+0.052
rou+0.052
Neg logit contributions
 So-0.055
 message-0.047
 idea-0.047
 says-0.047
 prayer-0.046
Tokenetti
Token ID24851
Feature activation-8.01e-02

Pos logit contributions
 So+0.509
 message+0.463
 tickets+0.460
 sounds+0.454
 says+0.451
Neg logit contributions
Acc-0.370
years-0.361
eds-0.336
umper-0.329
pping-0.318
Token and
Token ID290
Feature activation-7.70e-03

Pos logit contributions
eds+0.123
pping+0.121
pped+0.120
years+0.118
mo+0.115
Neg logit contributions
 So-0.119
 prayer-0.102
 message-0.102
 says-0.100
 tickets-0.100
Token stream
Token ID4269
Feature activation+1.05e-02

Pos logit contributions
years+0.013
Acc+0.012
pped+0.012
eds+0.012
rou+0.012
Neg logit contributions
 So-0.011
 idea-0.009
 message-0.009
 prayer-0.009
 list-0.008
Tokeners
Token ID364
Feature activation+7.33e-02

Pos logit contributions
 So+0.013
 prayer+0.011
 message+0.011
 says+0.011
 idea+0.011
Neg logit contributions
years-0.019
eds-0.018
pped-0.018
pping-0.018
Acc-0.018
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
 So+0.101
 message+0.087
 prayer+0.086
 idea+0.086
 says+0.085
Neg logit contributions
years-0.124
eds-0.120
pped-0.120
pping-0.119
Acc-0.117
Token air
Token ID1633
Feature activation+1.38e-01

Pos logit contributions
 So+0.020
 idea+0.017
 message+0.017
 tickets+0.017
 says+0.016
Neg logit contributions
years-0.032
eds-0.032
Acc-0.032
pped-0.031
mo-0.031
Token,
Token ID11
Feature activation-3.79e-02

Pos logit contributions
 So+0.210
 message+0.186
 tickets+0.185
 says+0.184
 prayer+0.182
Neg logit contributions
years-0.208
Acc-0.204
eds-0.203
pping-0.202
pped-0.199
Token releasing
Token ID13011
Feature activation+6.43e-03

Pos logit contributions
years+0.062
Acc+0.060
eds+0.059
umper+0.058
pped+0.057
Neg logit contributions
 So-0.055
 y-0.050
 says-0.050
 tickets-0.050
 message-0.049
Token brightly
Token ID35254
Feature activation+5.86e-02

Pos logit contributions
 So+0.010
 message+0.009
 prayer+0.009
 says+0.009
 tale+0.009
Neg logit contributions
years-0.009
Acc-0.009
pped-0.008
eds-0.008
mo-0.008
Token colored
Token ID16396
Feature activation-3.34e-02

Pos logit contributions
 So+0.090
 message+0.080
 says+0.078
 prayer+0.077
 idea+0.077
Neg logit contributions
years-0.090
eds-0.089
Acc-0.087
pped-0.087
pping-0.085
Token conf
Token ID1013
Feature activation+2.82e-01

Pos logit contributions
years+0.055
pped+0.053
eds+0.053
Acc+0.052
pping+0.051
Neg logit contributions
 So-0.046
 message-0.039
 prayer-0.039
 says-0.038
 idea-0.038
Tokenetti
Token ID24851
Feature activation-8.28e-02

Pos logit contributions
 So+0.533
 message+0.487
 tickets+0.483
 says+0.481
 sounds+0.479
Neg logit contributions
years-0.341
Acc-0.340
eds-0.316
pped-0.305
umper-0.305
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
eds+0.114
pping+0.112
pped+0.109
mo+0.105
years+0.104
Neg logit contributions
 So-0.136
 message-0.118
 prayer-0.117
 Sorry-0.117
 says-0.117
Token
Token ID198
Feature activation+1.90e-02

Pos logit contributions
 So+0.020
 says+0.016
 y+0.015
 prayer+0.015
 tickets+0.015
Neg logit contributions
years-0.033
Acc-0.033
eds-0.032
pping-0.032
pped-0.032
Token
Token ID198
Feature activation+1.85e-02

Pos logit contributions
 So+0.016
So+0.010
 says+0.010
 y+0.008
 exclaimed+0.007
Neg logit contributions
years-0.049
arter-0.048
pping-0.048
mo-0.047
umper-0.047
TokenLu
Token ID25596
Feature activation+1.13e-02

Pos logit contributions
 So+0.025
So+0.022
 y+0.020
 treats+0.019
 message+0.019
Neg logit contributions
arter-0.034
eds-0.033
umper-0.033
years-0.033
mo-0.032
Token and
Token ID290
Feature activation-7.54e-03

Pos logit contributions
years+0.079
Acc+0.078
eds+0.076
pped+0.075
pping+0.074
Neg logit contributions
 So-0.068
 message-0.061
 idea-0.060
 tickets-0.059
 says-0.059
Token throwing
Token ID9644
Feature activation-1.32e-01

Pos logit contributions
years+0.012
Acc+0.012
eds+0.011
rou+0.011
pped+0.010
Neg logit contributions
 So-0.011
 tickets-0.010
 message-0.010
 idea-0.010
 prayer-0.010
Token around
Token ID1088
Feature activation-1.08e-01

Pos logit contributions
years+0.190
eds+0.188
pped+0.182
Acc+0.178
mo+0.172
Neg logit contributions
 So-0.197
 prayer-0.189
 message-0.180
 tale-0.178
 list-0.177
Token mint
Token ID21043
Feature activation-3.88e-02

Pos logit contributions
years+0.207
Acc+0.206
eds+0.199
umper+0.197
pped+0.194
Neg logit contributions
 So-0.128
 y-0.110
 says-0.108
 sounds-0.108
 treats-0.107
Token green
Token ID4077
Feature activation-1.15e-01

Pos logit contributions
eds+0.070
years+0.068
Acc+0.067
pping+0.066
pped+0.066
Neg logit contributions
 So-0.048
 prayer-0.044
 message-0.043
 tickets-0.042
 hello-0.042
Token conf
Token ID1013
Feature activation+2.82e-01

Pos logit contributions
years+0.211
Acc+0.208
eds+0.202
pped+0.199
pping+0.196
Neg logit contributions
 So-0.145
 message-0.128
 says-0.125
 tickets-0.125
 idea-0.124
Tokenetti
Token ID24851
Feature activation-8.03e-02

Pos logit contributions
 So+0.499
 message+0.444
 sounds+0.444
 says+0.433
 tickets+0.430
Neg logit contributions
Acc-0.382
years-0.375
umper-0.343
eds-0.340
mo-0.332
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
pping+0.130
eds+0.130
pped+0.129
years+0.127
mo+0.122
Neg logit contributions
 So-0.112
 prayer-0.101
 message-0.100
 tickets-0.097
 says-0.097
Token
Token ID198
Feature activation+2.33e-02

Pos logit contributions
 So+0.027
 message+0.025
 tickets+0.024
 idea+0.024
 says+0.024
Neg logit contributions
years-0.041
Acc-0.040
eds-0.039
pped-0.038
umper-0.038
Token
Token ID198
Feature activation+2.35e-02

Pos logit contributions
 So+0.025
 says+0.020
 message+0.018
 idea+0.018
 tickets+0.018
Neg logit contributions
years-0.052
Acc-0.050
pping-0.048
mo-0.048
eds-0.048
TokenHe
Token ID1544
Feature activation-4.62e-03

Pos logit contributions
 So+0.033
 sounds+0.028
 message+0.028
So+0.028
 y+0.027
Neg logit contributions
years-0.041
Acc-0.041
eds-0.041
arter-0.040
mo-0.039
Token and
Token ID290
Feature activation-5.96e-03

Pos logit contributions
years+0.070
Acc+0.069
eds+0.066
arter+0.066
pped+0.064
Neg logit contributions
 So-0.049
 says-0.043
 sounds-0.042
 message-0.042
 idea-0.041
Token Zoe
Token ID32280
Feature activation+5.72e-02

Pos logit contributions
years+0.008
Acc+0.008
eds+0.007
umper+0.007
mo+0.007
Neg logit contributions
 So-0.010
 message-0.010
 tickets-0.009
 says-0.009
 sounds-0.009
Token always
Token ID1464
Feature activation-6.86e-02

Pos logit contributions
 So+0.075
 says+0.065
 message+0.063
 sounds+0.063
 tickets+0.063
Neg logit contributions
Acc-0.104
years-0.100
eds-0.099
arter-0.098
pped-0.097
Token kept
Token ID4030
Feature activation-2.96e-02

Pos logit contributions
years+0.104
Acc+0.097
eds+0.094
pping+0.092
pped+0.092
Neg logit contributions
 So-0.111
 tickets-0.102
 message-0.101
 idea-0.097
 says-0.097
Token their
Token ID511
Feature activation+1.31e-01

Pos logit contributions
years+0.042
eds+0.039
Acc+0.038
pped+0.037
pping+0.037
Neg logit contributions
 So-0.052
 prayer-0.045
 says-0.045
 idea-0.045
 message-0.044
Token passports
Token ID33052
Feature activation+2.78e-01

Pos logit contributions
 So+0.181
 um+0.150
 click+0.145
br+0.144
So+0.143
Neg logit contributions
years-0.227
eds-0.218
ind-0.215
 flooding-0.199
grown-0.199
Token decorated
Token ID24789
Feature activation-4.92e-02

Pos logit contributions
 So+0.419
 says+0.380
 replies+0.364
 shouts+0.360
 sounds+0.354
Neg logit contributions
years-0.426
eds-0.420
pping-0.418
Acc-0.415
pped-0.405
Token and
Token ID290
Feature activation-7.15e-03

Pos logit contributions
ind+0.071
eds+0.071
years+0.065
ini+0.061
pping+0.061
Neg logit contributions
 So-0.082
 Sorry-0.073
 prayer-0.071
 hello-0.071
 says-0.070
Token they
Token ID484
Feature activation+8.52e-02

Pos logit contributions
years+0.012
Acc+0.011
eds+0.011
pped+0.011
pping+0.010
Neg logit contributions
 So-0.011
 message-0.009
 says-0.009
 idea-0.009
 replies-0.009
Token enjoyed
Token ID8359
Feature activation-4.81e-02

Pos logit contributions
 So+0.124
 message+0.109
 says+0.108
 tickets+0.108
 idea+0.106
Neg logit contributions
years-0.141
Acc-0.138
eds-0.133
pped-0.130
pping-0.129
Token showing
Token ID4478
Feature activation+8.18e-02

Pos logit contributions
years+0.072
Acc+0.069
eds+0.066
pped+0.064
pping+0.063
Neg logit contributions
 So-0.080
 says-0.070
 message-0.070
 replies-0.068
 prayer-0.068
Token had
Token ID550
Feature activation-1.01e-01

Pos logit contributions
 So+0.107
 says+0.095
 message+0.095
 idea+0.093
 y+0.093
Neg logit contributions
Acc-0.143
years-0.143
eds-0.139
pped-0.134
umper-0.133
Token never
Token ID1239
Feature activation-8.72e-02

Pos logit contributions
years+0.156
Acc+0.153
eds+0.148
pped+0.145
pping+0.143
Neg logit contributions
 So-0.156
 message-0.138
 says-0.137
 idea-0.136
 tickets-0.135
Token seen
Token ID1775
Feature activation-9.32e-02

Pos logit contributions
years+0.121
Acc+0.119
eds+0.113
pped+0.111
pping+0.109
Neg logit contributions
 So-0.149
 message-0.135
 tickets-0.134
 says-0.133
 idea-0.132
Token such
Token ID884
Feature activation+1.10e-01

Pos logit contributions
Acc+0.185
years+0.183
eds+0.178
pped+0.172
pping+0.172
Neg logit contributions
 So-0.102
 y-0.092
 tickets-0.092
 treats-0.091
 message-0.091
Token beautiful
Token ID4950
Feature activation-3.17e-02

Pos logit contributions
 So+0.123
 message+0.107
 tickets+0.104
 says+0.103
 idea+0.103
Neg logit contributions
years-0.217
Acc-0.215
eds-0.210
pped-0.205
pping-0.204
Token passports
Token ID33052
Feature activation+2.78e-01

Pos logit contributions
years+0.058
Acc+0.058
eds+0.054
pped+0.054
umper+0.054
Neg logit contributions
 So-0.040
 message-0.037
 sounds-0.036
 treats-0.036
 y-0.036
Token!
Token ID0
Feature activation-5.71e-02

Pos logit contributions
 So+0.389
 message+0.343
 says+0.340
 tickets+0.338
 idea+0.335
Neg logit contributions
years-0.469
Acc-0.462
eds-0.446
pped-0.440
pping-0.437
Token
Token ID198
Feature activation+1.91e-02

Pos logit contributions
years+0.109
Acc+0.108
eds+0.105
pped+0.104
pping+0.103
Neg logit contributions
 So-0.070
 says-0.059
 message-0.058
 tickets-0.057
 idea-0.056
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
 So+0.024
 says+0.020
So+0.020
 message+0.018
 y+0.018
Neg logit contributions
pping-0.038
arter-0.038
years-0.037
Acc-0.037
pped-0.037
TokenFrom
Token ID4863
Feature activation+5.23e-02

Pos logit contributions
 So+0.027
So+0.023
 message+0.021
 says+0.020
 sounds+0.020
Neg logit contributions
arter-0.035
eds-0.034
years-0.033
pping-0.033
Acc-0.033
Token that
Token ID326
Feature activation-5.48e-02

Pos logit contributions
 So+0.084
 says+0.078
 message+0.076
 y+0.074
 idea+0.074
Neg logit contributions
years-0.080
Acc-0.080
eds-0.077
umper-0.073
pping-0.073
Token looked
Token ID3114
Feature activation+3.47e-02

Pos logit contributions
years+0.050
Acc+0.049
eds+0.048
pped+0.047
pping+0.047
Neg logit contributions
 So-0.024
 says-0.021
 message-0.021
 tickets-0.020
 sounds-0.020
Token at
Token ID379
Feature activation+7.35e-02

Pos logit contributions
 So+0.048
 message+0.044
 prayer+0.042
 tickets+0.042
 sounds+0.042
Neg logit contributions
years-0.059
Acc-0.057
eds-0.054
pped-0.053
mo-0.053
Token their
Token ID511
Feature activation+1.25e-01

Pos logit contributions
 So+0.118
 tickets+0.113
 says+0.111
 sounds+0.108
 treats+0.105
Neg logit contributions
Acc-0.118
years-0.114
eds-0.110
pping-0.106
umper-0.104
Token tickets
Token ID8587
Feature activation+2.52e-01

Pos logit contributions
 So+0.114
ww+0.090
br+0.089
sc+0.081
 um+0.080
Neg logit contributions
ind-0.259
eds-0.258
years-0.253
rou-0.247
 flooding-0.241
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
 So+0.425
ww+0.385
 Sorry+0.370
 message+0.368
 y+0.367
Neg logit contributions
years-0.326
pped-0.321
pping-0.320
eds-0.306
Acc-0.303
Token passports
Token ID33052
Feature activation+2.77e-01

Pos logit contributions
years+0.027
Acc+0.025
pped+0.025
rou+0.024
eds+0.024
Neg logit contributions
 So-0.016
 idea-0.013
 message-0.013
 tale-0.012
 list-0.012
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
 So+0.402
ww+0.335
 message+0.331
 fer+0.330
 prayer+0.327
Neg logit contributions
pped-0.430
years-0.425
pping-0.415
Acc-0.405
eds-0.400
Token smiled
Token ID13541
Feature activation-9.93e-04

Pos logit contributions
years+0.025
Acc+0.023
pped+0.023
mo+0.022
eds+0.022
Neg logit contributions
 So-0.018
 message-0.016
 idea-0.015
 tale-0.015
 prayer-0.015
Token.
Token ID13
Feature activation+6.64e-03

Pos logit contributions
years+0.002
Acc+0.002
rou+0.002
pped+0.002
eds+0.002
Neg logit contributions
 So-0.001
 message-0.001
 tickets-0.001
 says-0.001
 fer-0.001
Token "
Token ID366
Feature activation-9.47e-02

Pos logit contributions
 So+0.009
 says+0.008
 tickets+0.008
 y+0.008
 message+0.008
Neg logit contributions
years-0.012
Acc-0.012
eds-0.012
pped-0.011
pping-0.011
TokenYou
Token ID1639
Feature activation+1.25e-01

Pos logit contributions
years+0.146
Acc+0.142
umper+0.129
pped+0.129
eds+0.125
Neg logit contributions
 So-0.149
 idea-0.141
 message-0.139
 says-0.136
 prayer-0.134
Token to
Token ID284
Feature activation-4.23e-03

Pos logit contributions
eds+0.136
pped+0.135
years+0.134
 flooding+0.129
grown+0.129
Neg logit contributions
 So-0.158
 idea-0.141
 prayer-0.137
 replies-0.133
 funn-0.130
Token show
Token ID905
Feature activation+1.31e-01

Pos logit contributions
pped+0.006
years+0.006
eds+0.006
pping+0.006
Acc+0.006
Neg logit contributions
 So-0.006
 says-0.005
 replies-0.005
 prayer-0.005
 funn-0.005
Token their
Token ID511
Feature activation+1.26e-01

Pos logit contributions
 So+0.232
 idea+0.212
 fer+0.209
 bisc+0.208
 Fer+0.207
Neg logit contributions
years-0.151
eds-0.146
ind-0.136
pped-0.134
pping-0.131
Token tickets
Token ID8587
Feature activation+2.51e-01

Pos logit contributions
 So+0.152
 fer+0.121
 funn+0.121
sc+0.120
 Fer+0.119
Neg logit contributions
eds-0.221
years-0.212
ind-0.210
rou-0.207
 flooding-0.207
Token and
Token ID290
Feature activation-1.50e-02

Pos logit contributions
 So+0.358
 funn+0.301
 replies+0.299
 idea+0.299
 prayer+0.297
Neg logit contributions
eds-0.387
pping-0.383
years-0.377
pped-0.376
 flooding-0.363
Token passports
Token ID33052
Feature activation+2.76e-01

Pos logit contributions
years+0.025
eds+0.023
 flooding+0.023
Acc+0.023
pped+0.022
Neg logit contributions
 So-0.019
 idea-0.015
 tale-0.015
 replies-0.015
ww-0.015
Token.
Token ID13
Feature activation+4.76e-03

Pos logit contributions
 So+0.409
 fer+0.342
ww+0.334
 message+0.331
 prayer+0.331
Neg logit contributions
eds-0.410
pped-0.407
pping-0.407
years-0.399
rou-0.390
Token
Token ID198
Feature activation+1.66e-02

Pos logit contributions
 So+0.006
 message+0.006
 idea+0.006
 prayer+0.006
 sounds+0.006
Neg logit contributions
years-0.008
Acc-0.008
eds-0.008
pped-0.007
umper-0.007
Token
Token ID198
Feature activation+1.67e-02

Pos logit contributions
 So+0.018
 says+0.015
 tickets+0.014
 y+0.013
 sounds+0.013
Neg logit contributions
years-0.037
Acc-0.036
umper-0.034
pping-0.034
eds-0.034
Token"
Token ID1
Feature activation-1.06e-01

Pos logit contributions
 So+0.024
 tickets+0.021
 message+0.021
 says+0.021
 sounds+0.021
Neg logit contributions
years-0.028
Acc-0.028
eds-0.027
umper-0.026
pping-0.026
TokenHello
Token ID15496
Feature activation+4.78e-02

Pos logit contributions
years+0.126
Acc+0.121
pped+0.117
umper+0.106
Sometimes+0.106
Neg logit contributions
 So-0.203
 idea-0.195
 message-0.189
 says-0.188
 prayer-0.185
Token their
Token ID511
Feature activation+1.23e-01

Pos logit contributions
 So+0.161
 tale+0.144
 message+0.143
 prayer+0.142
 fer+0.141
Neg logit contributions
years-0.140
eds-0.131
Acc-0.130
pped-0.127
grown-0.122
Token bags
Token ID11668
Feature activation+2.33e-01

Pos logit contributions
 So+0.150
 click+0.121
 Sorry+0.120
 replies+0.118
So+0.117
Neg logit contributions
eds-0.214
ind-0.210
 nowhere-0.205
 flooding-0.204
years-0.199
Token and
Token ID290
Feature activation-1.59e-02

Pos logit contributions
 So+0.294
 message+0.258
 says+0.254
 tickets+0.252
 idea+0.250
Neg logit contributions
years-0.426
Acc-0.420
eds-0.407
pped-0.400
pping-0.396
Token bring
Token ID2222
Feature activation+4.37e-02

Pos logit contributions
years+0.022
Acc+0.021
 flooding+0.021
eds+0.020
 raining+0.020
Neg logit contributions
 So-0.022
 tale-0.020
 fer-0.020
 MacDonald-0.020
 replies-0.020
Token their
Token ID511
Feature activation+1.23e-01

Pos logit contributions
 So+0.094
 prayer+0.086
 idea+0.085
 tale+0.085
 message+0.084
Neg logit contributions
years-0.039
eds-0.034
pped-0.032
mo-0.031
ind-0.030
Token passports
Token ID33052
Feature activation+2.76e-01

Pos logit contributions
 So+0.130
 click+0.108
 replies+0.107
ww+0.106
 Sorry+0.101
Neg logit contributions
eds-0.224
years-0.222
ind-0.216
 flooding-0.216
mo-0.215
Token.
Token ID13
Feature activation+2.18e-03

Pos logit contributions
 So+0.402
 says+0.349
 fer+0.349
 sounds+0.348
 replies+0.346
Neg logit contributions
eds-0.419
years-0.417
Acc-0.409
pped-0.401
pping-0.400
Token Pass
Token ID6251
Feature activation+9.16e-02

Pos logit contributions
 message+0.003
 So+0.003
 idea+0.003
 fer+0.003
 tale+0.003
Neg logit contributions
Acc-0.003
years-0.003
eds-0.003
pped-0.003
umper-0.003
Tokenports
Token ID3742
Feature activation+1.94e-01

Pos logit contributions
 So+0.129
 message+0.118
 idea+0.117
 prayer+0.114
 tale+0.111
Neg logit contributions
years-0.151
eds-0.148
pped-0.142
pping-0.140
mo-0.140
Token are
Token ID389
Feature activation+9.35e-02

Pos logit contributions
 So+0.212
 message+0.203
 idea+0.192
 funn+0.191
 tale+0.182
Neg logit contributions
pping-0.339
years-0.335
pped-0.328
eds-0.308
grown-0.297
Token small
Token ID1402
Feature activation-7.86e-02

Pos logit contributions
 So+0.122
 message+0.106
 says+0.103
 idea+0.102
 tickets+0.101
Neg logit contributions
years-0.167
Acc-0.163
eds-0.159
pped-0.155
pping-0.154
Token and
Token ID290
Feature activation-1.50e-02

Pos logit contributions
 So+0.048
 message+0.044
 idea+0.043
 tickets+0.043
 y+0.043
Neg logit contributions
years-0.052
Acc-0.051
eds-0.050
pped-0.049
pping-0.048
Token Ben
Token ID3932
Feature activation+1.92e-03

Pos logit contributions
years+0.018
Acc+0.017
umper+0.017
pped+0.017
pping+0.017
Neg logit contributions
 So-0.028
 message-0.026
 tale-0.026
 sounds-0.025
 idea-0.025
Token do
Token ID466
Feature activation+4.80e-02

Pos logit contributions
 So+0.002
 message+0.002
 treats+0.002
ww+0.002
 fer+0.002
Neg logit contributions
pping-0.003
Acc-0.003
pped-0.003
years-0.003
mo-0.003
Token not
Token ID407
Feature activation-7.64e-02

Pos logit contributions
 So+0.077
 message+0.068
 says+0.067
 tickets+0.067
 prayer+0.066
Neg logit contributions
years-0.072
Acc-0.071
eds-0.068
pped-0.067
pping-0.067
Token have
Token ID423
Feature activation-9.74e-02

Pos logit contributions
pping+0.100
years+0.098
umper+0.097
Acc+0.095
pped+0.094
Neg logit contributions
 So-0.131
 tickets-0.113
 fer-0.110
 Fer-0.110
 message-0.110
Token passports
Token ID33052
Feature activation+2.76e-01

Pos logit contributions
years+0.148
eds+0.141
pped+0.134
grown+0.132
pping+0.131
Neg logit contributions
 So-0.157
 idea-0.136
 prayer-0.133
 message-0.133
 click-0.132
Token.
Token ID13
Feature activation+4.17e-03

Pos logit contributions
 So+0.430
 message+0.364
 prayer+0.364
 says+0.360
 idea+0.357
Neg logit contributions
years-0.412
pped-0.407
pping-0.405
eds-0.397
Acc-0.387
Token They
Token ID1119
Feature activation+7.49e-02

Pos logit contributions
 message+0.007
 tale+0.006
 fer+0.006
 idea+0.006
 treats+0.006
Neg logit contributions
Acc-0.006
years-0.006
eds-0.005
grown-0.005
idget-0.005
Token are
Token ID389
Feature activation+9.49e-02

Pos logit contributions
 So+0.095
 message+0.085
 treats+0.083
 fer+0.081
 tickets+0.081
Neg logit contributions
years-0.124
pping-0.123
Acc-0.121
pped-0.120
mo-0.119
Token too
Token ID1165
Feature activation-7.32e-03

Pos logit contributions
 So+0.146
 message+0.126
 sounds+0.123
 says+0.123
 tickets+0.122
Neg logit contributions
years-0.151
Acc-0.142
eds-0.139
pping-0.136
pped-0.136
Token little
Token ID1310
Feature activation-4.28e-02

Pos logit contributions
years+0.013
Acc+0.013
eds+0.012
pped+0.012
umper+0.012
Neg logit contributions
 So-0.010
 message-0.009
 says-0.009
 tickets-0.009
 idea-0.009
Token they
Token ID484
Feature activation+8.62e-02

Pos logit contributions
years+0.104
Acc+0.099
eds+0.094
pped+0.093
pping+0.093
Neg logit contributions
 So-0.074
 message-0.065
 says-0.062
 idea-0.061
 tickets-0.061
Token wanted
Token ID2227
Feature activation-4.67e-02

Pos logit contributions
 So+0.128
 message+0.114
 tickets+0.113
 says+0.112
 idea+0.111
Neg logit contributions
years-0.141
Acc-0.137
eds-0.131
pped-0.131
pping-0.129
Token to
Token ID284
Feature activation-2.21e-03

Pos logit contributions
years+0.075
Acc+0.073
eds+0.070
pped+0.068
pping+0.067
Neg logit contributions
 So-0.070
 says-0.061
 message-0.061
 idea-0.061
 prayer-0.060
Token show
Token ID905
Feature activation+1.31e-01

Pos logit contributions
years+0.004
Acc+0.004
eds+0.004
pped+0.004
pping+0.004
Neg logit contributions
 So-0.003
 message-0.002
 says-0.002
 tickets-0.002
 idea-0.002
Token their
Token ID511
Feature activation+1.33e-01

Pos logit contributions
 So+0.246
 bisc+0.233
ww+0.225
 Fer+0.223
 chest+0.223
Neg logit contributions
years-0.130
idget-0.117
eds-0.116
mo-0.113
pping-0.108
Token passports
Token ID33052
Feature activation+2.75e-01

Pos logit contributions
 So+0.205
sc+0.167
 hello+0.163
 click+0.162
 Sorry+0.162
Neg logit contributions
years-0.202
eds-0.193
Acc-0.187
ind-0.181
rou-0.180
Token to
Token ID284
Feature activation-2.16e-03

Pos logit contributions
 So+0.422
ww+0.365
 fer+0.360
 Fer+0.357
 replies+0.356
Neg logit contributions
eds-0.384
years-0.382
pping-0.378
rou-0.377
Acc-0.374
Token their
Token ID511
Feature activation+1.33e-01

Pos logit contributions
years+0.003
Acc+0.003
eds+0.003
pping+0.003
pped+0.003
Neg logit contributions
 So-0.004
 message-0.003
 says-0.003
 sounds-0.003
 tickets-0.003
Token M
Token ID337
Feature activation+2.57e-02

Pos logit contributions
 So+0.217
 says+0.187
 idea+0.186
 message+0.184
 hello+0.182
Neg logit contributions
years-0.198
Acc-0.184
eds-0.182
ind-0.174
rou-0.173
Tokenums
Token ID5700
Feature activation-6.61e-02

Pos logit contributions
 So+0.052
 idea+0.048
 says+0.047
 tickets+0.047
 sounds+0.047
Neg logit contributions
years-0.029
Acc-0.028
pped-0.028
eds-0.025
umper-0.024
Token.
Token ID13
Feature activation+2.12e-02

Pos logit contributions
years+0.113
Acc+0.110
eds+0.108
umper+0.108
pping+0.107
Neg logit contributions
 So-0.088
 message-0.077
 tickets-0.075
 sounds-0.074
 prayer-0.074
Token understand
Token ID1833
Feature activation+9.21e-03

Pos logit contributions
years+0.110
umper+0.110
pping+0.109
eds+0.109
pped+0.108
Neg logit contributions
 So-0.128
 prayer-0.108
 tickets-0.105
 Fer-0.104
 fer-0.104
Token.
Token ID13
Feature activation-3.46e-05

Pos logit contributions
 So+0.011
 message+0.010
 idea+0.009
 says+0.009
 tickets+0.009
Neg logit contributions
years-0.017
Acc-0.017
eds-0.017
pped-0.016
pping-0.016
Token He
Token ID679
Feature activation-1.78e-02

Pos logit contributions
years+0.000
Acc+0.000
grown+0.000
pped+0.000
mo+0.000
Neg logit contributions
 message-0.000
 tale-0.000
 idea-0.000
 prayer-0.000
 sounds-0.000
Token takes
Token ID2753
Feature activation+4.05e-02

Pos logit contributions
mo+0.039
years+0.039
rou+0.038
pped+0.038
Acc+0.038
Neg logit contributions
 So-0.012
 message-0.010
 click-0.010
 treats-0.009
 prayer-0.009
Token their
Token ID511
Feature activation+1.21e-01

Pos logit contributions
 So+0.077
 tale+0.071
 prayer+0.070
 message+0.070
 list+0.068
Neg logit contributions
years-0.048
mo-0.044
pped-0.043
eds-0.041
idget-0.040
Token passports
Token ID33052
Feature activation+2.75e-01

Pos logit contributions
 So+0.149
So+0.113
 Sorry+0.109
 echoed+0.108
 click+0.108
Neg logit contributions
eds-0.219
years-0.214
ind-0.208
 nowhere-0.200
grown-0.194
Token and
Token ID290
Feature activation-1.70e-02

Pos logit contributions
 So+0.423
 prayer+0.369
 sounds+0.368
 tale+0.364
 message+0.363
Neg logit contributions
years-0.402
pped-0.398
eds-0.397
pping-0.393
Acc-0.391
Token rub
Token ID6437
Feature activation+1.18e-02

Pos logit contributions
pped+0.027
years+0.026
eds+0.026
mo+0.025
 accidentally+0.025
Neg logit contributions
 So-0.021
 tale-0.020
 message-0.019
 idea-0.018
So-0.018
Tokens
Token ID82
Feature activation-2.35e-02

Pos logit contributions
 So+0.016
 message+0.015
 idea+0.014
 prayer+0.014
 says+0.014
Neg logit contributions
years-0.020
Acc-0.019
eds-0.019
pped-0.019
umper-0.018
Token them
Token ID606
Feature activation-1.70e-02

Pos logit contributions
Acc+0.038
pped+0.038
eds+0.038
years+0.037
pping+0.037
Neg logit contributions
 So-0.033
 tale-0.030
 message-0.029
 prayer-0.029
 idea-0.029
Token with
Token ID351
Feature activation+5.14e-02

Pos logit contributions
pping+0.021
pped+0.021
eds+0.020
 flooding+0.020
idget+0.020
Neg logit contributions
 MacDonald-0.029
 tale-0.028
ww-0.026
 So-0.025
 prayer-0.025
Token it
Token ID340
Feature activation-2.06e-02

Pos logit contributions
 So+0.042
 fer+0.038
 prayer+0.038
 message+0.037
 hello+0.037
Neg logit contributions
pped-0.028
pping-0.027
Acc-0.027
eds-0.027
years-0.026
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
eds+0.032
Acc+0.032
pping+0.031
years+0.030
pped+0.030
Neg logit contributions
 So-0.031
 message-0.027
 tickets-0.027
 prayer-0.027
 says-0.027
Token They
Token ID1119
Feature activation+8.22e-02

Pos logit contributions
 So+0.029
 says+0.025
 message+0.024
 sounds+0.024
 y+0.024
Neg logit contributions
years-0.040
Acc-0.039
eds-0.038
pped-0.038
umper-0.038
Token lost
Token ID2626
Feature activation-1.72e-01

Pos logit contributions
 So+0.114
 message+0.101
 says+0.100
 tickets+0.099
 idea+0.098
Neg logit contributions
years-0.141
Acc-0.140
eds-0.135
pped-0.131
umper-0.130
Token their
Token ID511
Feature activation+1.35e-01

Pos logit contributions
years+0.283
Acc+0.279
eds+0.271
pped+0.264
pping+0.261
Neg logit contributions
 So-0.250
 message-0.221
 says-0.220
 tickets-0.217
 idea-0.216
Token passports
Token ID33052
Feature activation+2.75e-01

Pos logit contributions
 So+0.169
 says+0.141
 message+0.140
 hello+0.136
 idea+0.135
Neg logit contributions
years-0.246
Acc-0.242
eds-0.242
pping-0.230
pped-0.229
Token and
Token ID290
Feature activation-7.79e-03

Pos logit contributions
 So+0.367
 message+0.330
 tickets+0.322
 says+0.322
 idea+0.320
Neg logit contributions
years-0.484
Acc-0.477
eds-0.456
umper-0.447
pped-0.445
Token couldn
Token ID3521
Feature activation-1.47e-01

Pos logit contributions
years+0.012
Acc+0.012
eds+0.012
pped+0.011
pping+0.011
Neg logit contributions
 So-0.012
 message-0.011
 says-0.010
 tickets-0.010
 idea-0.010
Token't
Token ID470
Feature activation-1.57e-01

Pos logit contributions
years+0.231
Acc+0.212
eds+0.202
umper+0.201
mo+0.197
Neg logit contributions
 So-0.251
 says-0.225
 sounds-0.222
 message-0.215
 y-0.215
Token go
Token ID467
Feature activation+4.38e-02

Pos logit contributions
years+0.299
Acc+0.291
rou+0.275
eds+0.272
umper+0.266
Neg logit contributions
 So-0.199
 message-0.182
 idea-0.178
 sounds-0.175
 says-0.175
Token on
Token ID319
Feature activation+8.91e-02

Pos logit contributions
 So+0.056
 tickets+0.053
 goodbye+0.050
 message+0.050
 treats+0.049
Neg logit contributions
Acc-0.081
years-0.080
umper-0.073
pped-0.072
rou-0.072
Token.
Token ID13
Feature activation+4.15e-03

Pos logit contributions
 So+0.132
ww+0.114
 treats+0.111
 y+0.110
 prayer+0.110
Neg logit contributions
pped-0.135
pping-0.127
years-0.123
eds-0.121
rou-0.119
Token He
Token ID679
Feature activation-1.78e-02

Pos logit contributions
 message+0.008
 tale+0.008
 idea+0.007
ww+0.007
 route+0.007
Neg logit contributions
years-0.004
grown-0.004
Acc-0.004
mo-0.004
ores-0.004
Token looks
Token ID3073
Feature activation+6.74e-02

Pos logit contributions
years+0.035
Acc+0.034
eds+0.034
pped+0.033
mo+0.033
Neg logit contributions
 So-0.019
 message-0.016
 idea-0.016
 treats-0.015
 y-0.015
Token at
Token ID379
Feature activation+7.11e-02

Pos logit contributions
 So+0.105
 message+0.094
 prayer+0.091
 sounds+0.090
 tickets+0.089
Neg logit contributions
years-0.106
Acc-0.100
mo-0.094
rou-0.094
pped-0.093
Token their
Token ID511
Feature activation+1.21e-01

Pos logit contributions
 So+0.114
 tickets+0.110
 says+0.107
 y+0.103
 sounds+0.103
Neg logit contributions
Acc-0.110
years-0.108
eds-0.103
pping-0.101
rou-0.099
Token passports
Token ID33052
Feature activation+2.74e-01

Pos logit contributions
 So+0.156
ww+0.129
So+0.122
br+0.115
 tale+0.114
Neg logit contributions
years-0.212
eds-0.208
ind-0.201
rou-0.200
 flooding-0.192
Token.
Token ID13
Feature activation+3.37e-03

Pos logit contributions
 So+0.449
ww+0.393
 message+0.387
 prayer+0.382
 Sorry+0.379
Neg logit contributions
pping-0.370
pped-0.362
eds-0.362
years-0.358
Acc-0.347
Token He
Token ID679
Feature activation-1.78e-02

Pos logit contributions
 message+0.005
 tale+0.005
 idea+0.005
 sounds+0.005
 So+0.005
Neg logit contributions
years-0.005
Acc-0.004
grown-0.004
mo-0.004
umper-0.004
Token looks
Token ID3073
Feature activation+6.72e-02

Pos logit contributions
years+0.037
Acc+0.036
eds+0.035
pped+0.035
pping+0.034
Neg logit contributions
 So-0.018
 message-0.015
 idea-0.015
 tickets-0.014
 treats-0.014
Token at
Token ID379
Feature activation+7.15e-02

Pos logit contributions
 So+0.105
 message+0.095
 prayer+0.092
 sounds+0.090
 tickets+0.090
Neg logit contributions
years-0.104
Acc-0.097
rou-0.092
mo-0.092
pped-0.091
Token Sara
Token ID24799
Feature activation+6.22e-03

Pos logit contributions
 So+0.109
 tickets+0.107
 says+0.103
 hello+0.099
 sounds+0.099
Neg logit contributions
Acc-0.119
years-0.116
eds-0.111
pping-0.110
rou-0.108
Token going
Token ID1016
Feature activation+2.29e-02

Pos logit contributions
 So+0.086
 y+0.079
 says+0.079
 idea+0.078
 message+0.078
Neg logit contributions
years-0.089
Acc-0.088
eds-0.086
pped-0.083
umper-0.083
Token to
Token ID284
Feature activation-5.57e-03

Pos logit contributions
 So+0.029
 tickets+0.026
 says+0.025
 message+0.025
 treats+0.024
Neg logit contributions
Acc-0.043
years-0.043
eds-0.040
umper-0.040
pped-0.040
Token teach
Token ID4545
Feature activation+7.73e-02

Pos logit contributions
years+0.010
Acc+0.010
eds+0.010
pped+0.009
umper+0.009
Neg logit contributions
 So-0.007
 message-0.007
 tickets-0.006
 says-0.006
 idea-0.006
Token you
Token ID345
Feature activation+9.79e-02

Pos logit contributions
 So+0.091
 message+0.079
 idea+0.078
 says+0.077
 tickets+0.077
Neg logit contributions
years-0.147
Acc-0.145
eds-0.141
pped-0.138
pping-0.137
Token about
Token ID546
Feature activation-3.49e-02

Pos logit contributions
 So+0.131
 message+0.116
 says+0.115
 tickets+0.113
 idea+0.113
Neg logit contributions
years-0.170
Acc-0.168
eds-0.163
pped-0.159
pping-0.158
Token passports
Token ID33052
Feature activation+2.74e-01

Pos logit contributions
years+0.066
Acc+0.064
pped+0.063
umper+0.062
idget+0.061
Neg logit contributions
 So-0.043
 sounds-0.043
 y-0.042
 tickets-0.041
 treats-0.041
Token today
Token ID1909
Feature activation+7.47e-02

Pos logit contributions
 So+0.363
 message+0.319
 prayer+0.307
 idea+0.306
 says+0.306
Neg logit contributions
years-0.464
eds-0.456
Acc-0.451
pping-0.445
pped-0.437
Token."
Token ID526
Feature activation-7.23e-03

Pos logit contributions
 So+0.093
 message+0.085
 says+0.082
 fer+0.082
 idea+0.081
Neg logit contributions
years-0.127
eds-0.126
Acc-0.125
pping-0.123
pped-0.121
Token
Token ID220
Feature activation+5.85e-02

Pos logit contributions
Acc+0.012
years+0.012
eds+0.012
umper+0.011
mo+0.011
Neg logit contributions
 message-0.009
 So-0.009
 tickets-0.009
 idea-0.009
 fer-0.009
Token
Token ID198
Feature activation+2.25e-02

Pos logit contributions
 So+0.067
 says+0.055
 tickets+0.048
 idea+0.047
 sounds+0.047
Neg logit contributions
years-0.128
Acc-0.122
pped-0.119
eds-0.118
pping-0.117
Token
Token ID198
Feature activation+2.16e-02

Pos logit contributions
 So+0.026
 says+0.021
 y+0.019
 message+0.018
 sounds+0.018
Neg logit contributions
years-0.047
Acc-0.045
pping-0.045
pped-0.045
eds-0.045
Token We
Token ID775
Feature activation+5.32e-02

Pos logit contributions
Acc+0.013
years+0.013
pped+0.012
mo+0.011
pping+0.011
Neg logit contributions
 message-0.011
 idea-0.011
 fer-0.011
 y-0.011
 treats-0.011
Token have
Token ID423
Feature activation-1.02e-01

Pos logit contributions
 So+0.063
 message+0.055
 idea+0.055
 says+0.055
 tickets+0.054
Neg logit contributions
years-0.103
Acc-0.103
eds-0.099
umper-0.095
pped-0.095
Token to
Token ID284
Feature activation-6.80e-03

Pos logit contributions
years+0.168
Acc+0.164
eds+0.160
pped+0.157
pping+0.152
Neg logit contributions
 So-0.146
 idea-0.131
 message-0.125
 prayer-0.125
 replies-0.124
Token show
Token ID905
Feature activation+1.28e-01

Pos logit contributions
pped+0.011
pping+0.011
years+0.011
Acc+0.011
eds+0.011
Neg logit contributions
 So-0.009
 says-0.008
 sounds-0.008
 replies-0.008
 prayer-0.008
Token our
Token ID674
Feature activation+8.64e-02

Pos logit contributions
 So+0.212
 idea+0.193
 message+0.187
 fer+0.185
 tale+0.183
Neg logit contributions
years-0.180
pped-0.169
eds-0.169
Acc-0.166
pping-0.161
Token passports
Token ID33052
Feature activation+2.73e-01

Pos logit contributions
 So+0.130
 replies+0.099
 idea+0.099
 says+0.098
 hello+0.098
Neg logit contributions
eds-0.137
years-0.135
Acc-0.130
ind-0.128
rou-0.125
Token to
Token ID284
Feature activation-5.63e-03

Pos logit contributions
 So+0.388
 fer+0.333
 message+0.329
 prayer+0.326
 idea+0.324
Neg logit contributions
eds-0.431
years-0.427
Acc-0.421
pping-0.414
pped-0.412
Token the
Token ID262
Feature activation+1.25e-02

Pos logit contributions
years+0.010
Acc+0.010
eds+0.009
pped+0.009
pping+0.009
Neg logit contributions
 So-0.008
 message-0.007
 says-0.007
 tickets-0.007
 y-0.007
Token lady
Token ID10846
Feature activation+5.43e-02

Pos logit contributions
 So+0.018
 says+0.015
 idea+0.015
 prayer+0.014
 message+0.014
Neg logit contributions
years-0.021
Acc-0.021
eds-0.020
pped-0.020
pping-0.019
Token there
Token ID612
Feature activation+6.57e-02

Pos logit contributions
 So+0.068
 message+0.061
 idea+0.059
 tickets+0.059
 prayer+0.058
Neg logit contributions
years-0.097
Acc-0.095
eds-0.094
pping-0.091
pped-0.091
Token.
Token ID13
Feature activation-6.34e-03

Pos logit contributions
 So+0.079
 message+0.070
 fer+0.070
 prayer+0.068
 tickets+0.067
Neg logit contributions
years-0.115
eds-0.115
Acc-0.113
pping-0.110
pped-0.109
Token Dad
Token ID17415
Feature activation-1.42e-02

Pos logit contributions
years+0.014
Acc+0.013
pped+0.012
umper+0.012
pping+0.011
Neg logit contributions
 So-0.030
 tale-0.030
 message-0.030
 idea-0.029
 fer-0.028
Token have
Token ID423
Feature activation-9.81e-02

Pos logit contributions
pping+0.024
pped+0.023
years+0.023
Acc+0.023
mo+0.023
Neg logit contributions
 So-0.019
 message-0.016
 sounds-0.016
 treats-0.016
 fer-0.015
Token to
Token ID284
Feature activation-4.22e-03

Pos logit contributions
years+0.155
Acc+0.148
eds+0.144
pped+0.142
grown+0.139
Neg logit contributions
 So-0.148
 idea-0.129
 message-0.127
 says-0.126
 replies-0.125
Token show
Token ID905
Feature activation+1.29e-01

Pos logit contributions
pping+0.006
pped+0.006
 flooding+0.006
years+0.006
Acc+0.006
Neg logit contributions
 So-0.006
 says-0.005
 replies-0.005
 sounds-0.005
 prayer-0.005
Token their
Token ID511
Feature activation+1.25e-01

Pos logit contributions
 So+0.236
 message+0.220
 idea+0.218
 fer+0.218
 tale+0.216
Neg logit contributions
years-0.150
eds-0.137
idget-0.134
pped-0.133
mo-0.133
Token passports
Token ID33052
Feature activation+2.73e-01

Pos logit contributions
 So+0.123
sc+0.095
br+0.094
ww+0.091
 replies+0.091
Neg logit contributions
eds-0.233
years-0.232
rou-0.231
ind-0.231
 flooding-0.231
Token to
Token ID284
Feature activation-3.97e-03

Pos logit contributions
 So+0.416
 fer+0.361
 Fer+0.351
 replies+0.350
 message+0.349
Neg logit contributions
eds-0.393
pping-0.382
years-0.382
pped-0.375
Acc-0.374
Token a
Token ID257
Feature activation+7.88e-02

Pos logit contributions
years+0.006
Acc+0.006
eds+0.006
pped+0.006
pping+0.005
Neg logit contributions
 So-0.006
 message-0.006
 says-0.005
 idea-0.005
 sounds-0.005
Token man
Token ID582
Feature activation+6.18e-02

Pos logit contributions
 So+0.113
 says+0.095
 replies+0.092
 shouts+0.090
 idea+0.089
Neg logit contributions
years-0.123
Acc-0.122
eds-0.119
rou-0.117
ind-0.116
Token.
Token ID13
Feature activation+7.89e-03

Pos logit contributions
 So+0.087
 message+0.079
 idea+0.078
 sounds+0.076
 prayer+0.076
Neg logit contributions
years-0.101
Acc-0.097
eds-0.097
pping-0.096
pped-0.095
Token A
Token ID317
Feature activation+6.12e-02

Pos logit contributions
 message+0.014
 tale+0.013
 idea+0.013
 fer+0.013
 treats+0.013
Neg logit contributions
years-0.010
Acc-0.009
mo-0.008
arter-0.008
eds-0.008
Token were
Token ID547
Feature activation+3.89e-02

Pos logit contributions
 So+0.096
 says+0.082
 y+0.081
 idea+0.080
 sounds+0.080
Neg logit contributions
eds-0.167
Acc-0.166
years-0.163
umper-0.160
arter-0.157
Token successful
Token ID4388
Feature activation-9.82e-03

Pos logit contributions
 So+0.052
 message+0.046
 says+0.045
 tickets+0.045
 idea+0.045
Neg logit contributions
years-0.069
Acc-0.068
eds-0.066
pped-0.065
umper-0.064
Token in
Token ID287
Feature activation+2.11e-02

Pos logit contributions
years+0.016
pping+0.015
grown+0.014
rou+0.014
Acc+0.014
Neg logit contributions
 So-0.014
 message-0.013
 prayer-0.012
 hello-0.012
 replies-0.012
Token getting
Token ID1972
Feature activation-6.44e-02

Pos logit contributions
 So+0.029
 tickets+0.027
 message+0.027
 idea+0.026
 says+0.026
Neg logit contributions
Acc-0.037
years-0.036
eds-0.034
umper-0.034
pped-0.033
Token their
Token ID511
Feature activation+1.30e-01

Pos logit contributions
years+0.112
Acc+0.112
eds+0.108
pped+0.106
pping+0.105
Neg logit contributions
 So-0.086
 message-0.076
 tickets-0.076
 says-0.076
 y-0.074
Token passports
Token ID33052
Feature activation+2.73e-01

Pos logit contributions
 So+0.175
 hello+0.145
 says+0.143
 message+0.142
 replies+0.140
Neg logit contributions
years-0.223
eds-0.211
Acc-0.211
pping-0.202
rou-0.200
Token.
Token ID13
Feature activation+1.45e-02

Pos logit contributions
 So+0.352
 message+0.300
 says+0.297
 prayer+0.296
 sounds+0.295
Neg logit contributions
years-0.475
Acc-0.468
eds-0.467
pping-0.462
pped-0.460
Token Lily
Token ID20037
Feature activation-7.94e-02

Pos logit contributions
 message+0.019
 So+0.018
 idea+0.018
 sounds+0.018
 y+0.018
Neg logit contributions
years-0.024
Acc-0.023
umper-0.022
mo-0.022
eds-0.022
Token was
Token ID373
Feature activation-5.50e-02

Pos logit contributions
years+0.139
Acc+0.136
eds+0.131
pped+0.130
pping+0.129
Neg logit contributions
 So-0.103
 message-0.094
 tickets-0.093
 idea-0.091
 prayer-0.091
Token so
Token ID523
Feature activation+1.15e-01

Pos logit contributions
years+0.105
Acc+0.103
eds+0.100
pped+0.098
pping+0.097
Neg logit contributions
 So-0.066
 message-0.057
 says-0.057
 tickets-0.056
 idea-0.056
Token happy
Token ID3772
Feature activation-2.63e-02

Pos logit contributions
 So+0.171
 message+0.155
 tickets+0.153
 sounds+0.151
 treats+0.150
Neg logit contributions
years-0.189
Acc-0.180
pped-0.167
rou-0.166
pping-0.164
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
 So+0.189
 tickets+0.171
 hello+0.171
ww+0.166
 prayer+0.166
Neg logit contributions
eds-0.198
pping-0.195
pped-0.190
years-0.189
rou-0.183
Token
Token ID198
Feature activation+2.08e-02

Pos logit contributions
 So+0.026
 message+0.024
 tickets+0.023
 idea+0.023
 prayer+0.023
Neg logit contributions
years-0.036
Acc-0.036
eds-0.034
pped-0.034
pping-0.033
Token
Token ID198
Feature activation+2.03e-02

Pos logit contributions
 So+0.026
 says+0.022
 message+0.022
 tickets+0.022
 sounds+0.022
Neg logit contributions
years-0.039
Acc-0.039
eds-0.037
pped-0.037
pping-0.037
TokenAfter
Token ID3260
Feature activation+2.60e-02

Pos logit contributions
 So+0.030
 sounds+0.026
 says+0.026
 tickets+0.026
 message+0.026
Neg logit contributions
Acc-0.033
years-0.033
eds-0.032
umper-0.032
arter-0.031
Token the
Token ID262
Feature activation+2.10e-02

Pos logit contributions
 So+0.040
 message+0.036
 says+0.035
 idea+0.035
 sounds+0.035
Neg logit contributions
years-0.041
Acc-0.039
pped-0.038
eds-0.037
pping-0.036
Token passports
Token ID33052
Feature activation+2.73e-01

Pos logit contributions
 So+0.037
ows+0.031
sc+0.030
 Sorry+0.029
 hi+0.028
Neg logit contributions
years-0.026
 End-0.025
Acc-0.024
pped-0.024
ini-0.024
Token were
Token ID547
Feature activation+4.26e-02

Pos logit contributions
 So+0.379
 message+0.318
 sounds+0.317
 tickets+0.311
 fer+0.310
Neg logit contributions
years-0.447
pped-0.440
Acc-0.432
pping-0.430
eds-0.422
Token decorated
Token ID24789
Feature activation-5.62e-02

Pos logit contributions
 So+0.063
 message+0.056
 says+0.055
 sounds+0.054
 tickets+0.054
Neg logit contributions
years-0.070
Acc-0.068
pped-0.065
eds-0.065
pping-0.064
Token,
Token ID11
Feature activation-3.60e-02

Pos logit contributions
years+0.077
eds+0.076
pping+0.075
pped+0.074
Acc+0.073
Neg logit contributions
 So-0.086
 shouts-0.077
 says-0.075
 replies-0.074
 prayer-0.074
Token the
Token ID262
Feature activation+2.10e-02

Pos logit contributions
years+0.050
Acc+0.044
pped+0.043
 flooding+0.042
grown+0.042
Neg logit contributions
 So-0.059
 tale-0.056
 message-0.056
 prayer-0.054
 list-0.053
Token two
Token ID734
Feature activation+7.26e-02

Pos logit contributions
 So+0.033
 Sorry+0.026
 hello+0.025
 says+0.025
 hi+0.024
Neg logit contributions
years-0.033
ind-0.031
pped-0.030
eds-0.030
 nowhere-0.030
Token to
Token ID284
Feature activation-9.21e-03

Pos logit contributions
years+0.042
Acc+0.041
eds+0.040
pped+0.039
pping+0.039
Neg logit contributions
 So-0.035
 message-0.030
 says-0.030
 idea-0.029
 prayer-0.029
Token many
Token ID867
Feature activation+3.66e-02

Pos logit contributions
years+0.017
Acc+0.016
eds+0.016
pped+0.016
pping+0.016
Neg logit contributions
 So-0.012
 says-0.010
 message-0.010
 tickets-0.010
 sounds-0.010
Token places
Token ID4113
Feature activation+1.40e-01

Pos logit contributions
 So+0.040
 message+0.034
 says+0.034
 tickets+0.033
 idea+0.033
Neg logit contributions
years-0.074
Acc-0.073
eds-0.071
pped-0.070
pping-0.069
Token with
Token ID351
Feature activation+4.67e-02

Pos logit contributions
 So+0.161
 message+0.140
 tickets+0.138
 says+0.138
 y+0.135
Neg logit contributions
years-0.274
Acc-0.272
eds-0.263
umper-0.257
pped-0.257
Token their
Token ID511
Feature activation+1.16e-01

Pos logit contributions
 So+0.058
 tickets+0.056
 y+0.054
 says+0.053
 message+0.053
Neg logit contributions
Acc-0.088
years-0.085
umper-0.083
pping-0.082
pped-0.082
Token passports
Token ID33052
Feature activation+2.73e-01

Pos logit contributions
 So+0.139
 tickets+0.127
 message+0.126
 says+0.124
 y+0.122
Neg logit contributions
years-0.224
Acc-0.222
pped-0.214
pping-0.213
umper-0.212
Token,
Token ID11
Feature activation-5.24e-02

Pos logit contributions
 So+0.331
 message+0.285
 says+0.280
 sounds+0.276
 prayer+0.276
Neg logit contributions
years-0.502
Acc-0.494
eds-0.487
pping-0.478
pped-0.477
Token but
Token ID475
Feature activation+4.24e-02

Pos logit contributions
years+0.088
Acc+0.087
eds+0.084
pped+0.082
pping+0.082
Neg logit contributions
 So-0.072
 message-0.063
 says-0.062
 idea-0.061
 sounds-0.060
Token this
Token ID428
Feature activation+1.54e-02

Pos logit contributions
 So+0.053
 message+0.046
 says+0.044
 idea+0.044
 tickets+0.044
Neg logit contributions
years-0.079
Acc-0.077
eds-0.074
pped-0.073
pping-0.072
Token was
Token ID373
Feature activation-6.02e-02

Pos logit contributions
 So+0.023
 click+0.019
 Sorry+0.019
 fer+0.018
 hello+0.018
Neg logit contributions
 nowhere-0.022
eds-0.022
pped-0.021
Acc-0.021
grown-0.021
Token their
Token ID511
Feature activation+1.18e-01

Pos logit contributions
years+0.111
Acc+0.111
eds+0.108
umper+0.107
pped+0.106
Neg logit contributions
 So-0.074
 y-0.071
 tickets-0.069
 message-0.069
 says-0.069
Token her
Token ID607
Feature activation+6.67e-03

Pos logit contributions
 So+0.122
 says+0.113
 tickets+0.112
 y+0.111
 sounds+0.108
Neg logit contributions
Acc-0.135
eds-0.126
years-0.126
arter-0.125
rou-0.121
Token toys
Token ID14958
Feature activation+4.90e-02

Pos logit contributions
 So+0.009
 message+0.008
 says+0.008
 tickets+0.008
 sounds+0.008
Neg logit contributions
years-0.011
Acc-0.011
eds-0.011
pped-0.011
pping-0.010
Token,
Token ID11
Feature activation-6.76e-02

Pos logit contributions
 So+0.058
 message+0.051
 says+0.050
 tickets+0.050
 idea+0.049
Neg logit contributions
years-0.093
Acc-0.092
eds-0.089
pped-0.087
pping-0.086
Token but
Token ID475
Feature activation+3.04e-02

Pos logit contributions
years+0.114
Acc+0.113
eds+0.108
pped+0.106
pping+0.105
Neg logit contributions
 So-0.093
 message-0.083
 says-0.082
 tickets-0.082
 idea-0.081
Token she
Token ID673
Feature activation-2.35e-02

Pos logit contributions
 So+0.041
 message+0.037
 says+0.036
 tickets+0.036
 idea+0.036
Neg logit contributions
years-0.052
Acc-0.052
eds-0.050
pped-0.049
pping-0.048
Token accidentally
Token ID14716
Feature activation-2.84e-01

Pos logit contributions
eds+0.043
years+0.043
Acc+0.043
umper+0.041
pped+0.041
Neg logit contributions
 So-0.032
 says-0.029
 y-0.028
 idea-0.027
 sounds-0.027
Token knocked
Token ID13642
Feature activation-1.21e-01

Pos logit contributions
years+0.486
Acc+0.470
eds+0.433
umper+0.432
arter+0.427
Neg logit contributions
 So-0.399
 says-0.383
 y-0.379
 message-0.377
 tickets-0.374
Token over
Token ID625
Feature activation-1.11e-01

Pos logit contributions
Acc+0.229
years+0.228
arter+0.222
ini+0.217
rou+0.213
Neg logit contributions
 So-0.147
 says-0.136
 y-0.131
 treats-0.130
 tickets-0.128
Token a
Token ID257
Feature activation+7.18e-02

Pos logit contributions
Acc+0.199
years+0.199
eds+0.193
pping+0.188
pped+0.187
Neg logit contributions
 So-0.148
 tickets-0.131
 message-0.129
 y-0.128
 says-0.128
Token v
Token ID410
Feature activation-7.54e-02

Pos logit contributions
 So+0.088
 message+0.082
 y+0.080
 says+0.078
 idea+0.078
Neg logit contributions
years-0.136
Acc-0.134
pped-0.127
eds-0.127
pping-0.127
Tokenase
Token ID589
Feature activation-1.51e-01

Pos logit contributions
years+0.082
Acc+0.081
eds+0.071
mo+0.071
pping+0.070
Neg logit contributions
 So-0.155
 message-0.141
 tickets-0.138
 says-0.137
 idea-0.137
Token,
Token ID11
Feature activation-7.30e-02

Pos logit contributions
 So+0.062
 message+0.054
 says+0.053
 tickets+0.053
 idea+0.053
Neg logit contributions
years-0.095
Acc-0.094
eds-0.091
pped-0.089
umper-0.088
Token but
Token ID475
Feature activation+2.92e-02

Pos logit contributions
years+0.126
Acc+0.124
eds+0.120
pped+0.117
pping+0.116
Neg logit contributions
 So-0.099
 message-0.088
 says-0.086
 tickets-0.086
 idea-0.085
Token one
Token ID530
Feature activation-4.43e-02

Pos logit contributions
 So+0.039
 says+0.035
 message+0.035
 tickets+0.034
 idea+0.034
Neg logit contributions
years-0.051
Acc-0.050
eds-0.049
pped-0.048
pping-0.047
Token day
Token ID1110
Feature activation+6.29e-03

Pos logit contributions
years+0.072
Acc+0.071
mo+0.067
eds+0.067
rou+0.065
Neg logit contributions
 So-0.063
 tickets-0.056
 message-0.056
 fer-0.054
 list-0.054
Token he
Token ID339
Feature activation-3.69e-02

Pos logit contributions
 So+0.008
 message+0.007
 idea+0.007
 tickets+0.007
 prayer+0.007
Neg logit contributions
years-0.011
Acc-0.011
pped-0.010
pping-0.010
eds-0.010
Token accidentally
Token ID14716
Feature activation-2.80e-01

Pos logit contributions
years+0.063
Acc+0.062
eds+0.060
pped+0.058
pping+0.058
Neg logit contributions
 So-0.052
 says-0.047
 message-0.046
 y-0.046
 idea-0.045
Token broke
Token ID6265
Feature activation-1.01e-01

Pos logit contributions
years+0.476
Acc+0.462
eds+0.430
umper+0.420
mo+0.413
Neg logit contributions
 So-0.405
 says-0.375
 message-0.372
 y-0.367
 tickets-0.366
Token it
Token ID340
Feature activation-3.27e-02

Pos logit contributions
years+0.203
Acc+0.203
umper+0.186
eds+0.185
arter+0.182
Neg logit contributions
 So-0.118
 y-0.106
 treats-0.103
 tickets-0.102
 sounds-0.101
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
years+0.060
Acc+0.060
eds+0.059
pped+0.058
pping+0.058
Neg logit contributions
 So-0.039
 message-0.034
 tickets-0.034
 says-0.034
 prayer-0.033
Token He
Token ID679
Feature activation-1.58e-02

Pos logit contributions
 So+0.019
 message+0.017
 tickets+0.017
 says+0.017
 idea+0.017
Neg logit contributions
years-0.025
Acc-0.025
eds-0.024
pped-0.023
umper-0.023
Token was
Token ID373
Feature activation-5.80e-02

Pos logit contributions
Acc+0.030
years+0.030
eds+0.029
arter+0.028
umper+0.028
Neg logit contributions
 So-0.020
 says-0.018
 y-0.017
 message-0.017
 sounds-0.017
Token while
Token ID981
Feature activation+2.98e-02

Pos logit contributions
years+0.103
Acc+0.102
eds+0.099
pped+0.098
pping+0.097
Neg logit contributions
 So-0.057
 message-0.049
 says-0.049
 idea-0.048
 tickets-0.048
Token,
Token ID11
Feature activation-5.78e-02

Pos logit contributions
 So+0.038
 says+0.033
 message+0.033
 tickets+0.033
 idea+0.032
Neg logit contributions
years-0.054
Acc-0.054
eds-0.052
pped-0.051
pping-0.051
Token but
Token ID475
Feature activation+3.47e-02

Pos logit contributions
years+0.099
Acc+0.098
eds+0.095
pped+0.093
pping+0.092
Neg logit contributions
 So-0.080
 message-0.070
 says-0.070
 tickets-0.069
 idea-0.069
Token then
Token ID788
Feature activation-9.17e-02

Pos logit contributions
 So+0.049
 message+0.043
 says+0.043
 idea+0.043
 tickets+0.043
Neg logit contributions
years-0.059
Acc-0.058
eds-0.057
pped-0.055
pping-0.054
Token she
Token ID673
Feature activation-1.66e-02

Pos logit contributions
years+0.152
Acc+0.147
pped+0.144
mo+0.141
umper+0.140
Neg logit contributions
 So-0.124
 message-0.113
 tickets-0.110
 idea-0.109
 says-0.108
Token accidentally
Token ID14716
Feature activation-2.79e-01

Pos logit contributions
Acc+0.027
years+0.027
arter+0.026
eds+0.026
umper+0.025
Neg logit contributions
 So-0.025
 says-0.023
 sounds-0.022
 y-0.022
 message-0.021
Token threw
Token ID9617
Feature activation-6.45e-02

Pos logit contributions
years+0.490
Acc+0.475
eds+0.446
umper+0.435
arter+0.435
Neg logit contributions
 So-0.381
 says-0.358
 message-0.356
 y-0.352
 sounds-0.347
Token it
Token ID340
Feature activation-3.19e-02

Pos logit contributions
years+0.111
Acc+0.110
eds+0.106
pping+0.103
pped+0.102
Neg logit contributions
 So-0.089
 says-0.079
 message-0.079
 tickets-0.078
 y-0.078
Token too
Token ID1165
Feature activation-1.37e-02

Pos logit contributions
years+0.058
Acc+0.058
eds+0.055
umper+0.054
pped+0.054
Neg logit contributions
 So-0.041
 message-0.036
 says-0.035
 idea-0.035
 y-0.035
Token hard
Token ID1327
Feature activation-6.01e-02

Pos logit contributions
years+0.026
Acc+0.025
eds+0.025
umper+0.024
pped+0.024
Neg logit contributions
 So-0.017
 says-0.015
 y-0.015
 message-0.015
 idea-0.015
Token and
Token ID290
Feature activation-1.24e-02

Pos logit contributions
years+0.112
Acc+0.110
eds+0.107
pped+0.105
pping+0.104
Neg logit contributions
 So-0.074
 message-0.065
 says-0.064
 tickets-0.064
 idea-0.063
Token her
Token ID607
Feature activation+2.66e-02

Pos logit contributions
years+0.046
Acc+0.045
eds+0.043
pped+0.042
pping+0.042
Neg logit contributions
 So-0.036
 message-0.032
 tickets-0.031
 says-0.031
 idea-0.031
Token friend
Token ID1545
Feature activation-7.89e-02

Pos logit contributions
 So+0.040
 message+0.036
 says+0.036
 tickets+0.035
 idea+0.035
Neg logit contributions
years-0.043
Acc-0.042
eds-0.041
pped-0.040
pping-0.039
Token said
Token ID531
Feature activation+1.58e-01

Pos logit contributions
years+0.142
Acc+0.139
eds+0.135
pped+0.134
umper+0.132
Neg logit contributions
 So-0.099
 message-0.088
 tickets-0.087
 idea-0.085
 prayer-0.085
Token that
Token ID326
Feature activation-6.60e-02

Pos logit contributions
 So+0.202
 message+0.179
 tickets+0.178
 says+0.173
 idea+0.172
Neg logit contributions
years-0.280
Acc-0.277
pped-0.264
pping-0.259
eds-0.253
Token she
Token ID673
Feature activation-1.79e-02

Pos logit contributions
Acc+0.107
years+0.107
eds+0.101
pped+0.100
pping+0.099
Neg logit contributions
 So-0.095
 message-0.082
 tickets-0.082
 idea-0.081
 says-0.081
Token accidentally
Token ID14716
Feature activation-2.76e-01

Pos logit contributions
Acc+0.032
years+0.032
eds+0.031
pping+0.030
pped+0.030
Neg logit contributions
 So-0.024
 says-0.022
 sounds-0.021
 message-0.021
 y-0.021
Token spilled
Token ID31896
Feature activation-1.38e-01

Pos logit contributions
years+0.475
Acc+0.463
eds+0.432
umper+0.424
mo+0.416
Neg logit contributions
 So-0.388
 says-0.361
 message-0.360
 tickets-0.351
 y-0.351
Token juice
Token ID13135
Feature activation+4.95e-02

Pos logit contributions
years+0.262
Acc+0.259
eds+0.247
umper+0.242
pping+0.241
Neg logit contributions
 So-0.168
 y-0.155
 treats-0.152
 says-0.151
 sounds-0.149
Token on
Token ID319
Feature activation+8.18e-02

Pos logit contributions
 So+0.054
 message+0.046
 says+0.046
 tickets+0.045
 idea+0.045
Neg logit contributions
years-0.100
Acc-0.098
eds-0.095
umper-0.092
pped-0.092
Token it
Token ID340
Feature activation-3.40e-02

Pos logit contributions
 So+0.099
 y+0.091
 treats+0.088
 sounds+0.086
 says+0.084
Neg logit contributions
years-0.164
Acc-0.161
rou-0.154
eds-0.150
umper-0.150
Token.
Token ID13
Feature activation+7.66e-03

Pos logit contributions
years+0.064
Acc+0.063
eds+0.062
pped+0.061
pping+0.061
Neg logit contributions
 So-0.040
 message-0.035
 tickets-0.035
 says-0.034
 prayer-0.034
Token while
Token ID981
Feature activation+2.72e-02

Pos logit contributions
 So+0.076
 says+0.066
 tickets+0.063
 sounds+0.062
 message+0.061
Neg logit contributions
Acc-0.132
years-0.129
eds-0.125
mo-0.121
pping-0.121
Token,
Token ID11
Feature activation-5.47e-02

Pos logit contributions
 So+0.031
 message+0.026
 says+0.026
 tickets+0.026
 idea+0.026
Neg logit contributions
years-0.053
Acc-0.053
eds-0.051
pped-0.050
umper-0.050
Token but
Token ID475
Feature activation+3.70e-02

Pos logit contributions
years+0.097
Acc+0.096
eds+0.092
pped+0.091
pping+0.090
Neg logit contributions
 So-0.072
 message-0.064
 says-0.063
 tickets-0.062
 idea-0.062
Token then
Token ID788
Feature activation-9.22e-02

Pos logit contributions
 So+0.058
 message+0.052
 tickets+0.051
 says+0.051
 idea+0.051
Neg logit contributions
years-0.056
Acc-0.055
eds-0.053
pped-0.052
pping-0.051
Token he
Token ID339
Feature activation-2.60e-02

Pos logit contributions
years+0.144
pped+0.139
Acc+0.137
mo+0.134
pping+0.133
Neg logit contributions
 So-0.129
 message-0.118
 tickets-0.115
 idea-0.115
 prayer-0.111
Token accidentally
Token ID14716
Feature activation-2.76e-01

Pos logit contributions
Acc+0.043
years+0.042
eds+0.041
pped+0.039
pping+0.039
Neg logit contributions
 So-0.039
 says-0.035
 message-0.034
 sounds-0.034
 tickets-0.034
Token kicked
Token ID12165
Feature activation-1.16e-01

Pos logit contributions
years+0.461
Acc+0.453
eds+0.423
umper+0.413
mo+0.405
Neg logit contributions
 So-0.398
 message-0.369
 says-0.369
 tickets-0.364
 y-0.360
Token it
Token ID340
Feature activation-3.00e-02

Pos logit contributions
Acc+0.221
years+0.219
eds+0.206
umper+0.202
ini+0.200
Neg logit contributions
 So-0.147
 y-0.130
 says-0.130
 sounds-0.128
 message-0.127
Token too
Token ID1165
Feature activation-1.37e-02

Pos logit contributions
years+0.053
Acc+0.052
eds+0.051
pped+0.050
pping+0.050
Neg logit contributions
 So-0.039
 message-0.034
 tickets-0.034
 says-0.034
 prayer-0.033
Token hard
Token ID1327
Feature activation-5.90e-02

Pos logit contributions
years+0.026
Acc+0.025
eds+0.024
umper+0.024
pped+0.024
Neg logit contributions
 So-0.018
 says-0.015
 message-0.015
 y-0.015
 idea-0.015
Token and
Token ID290
Feature activation-9.85e-03

Pos logit contributions
years+0.104
Acc+0.102
eds+0.101
pping+0.100
pped+0.100
Neg logit contributions
 So-0.075
 message-0.068
 tickets-0.066
 prayer-0.065
 says-0.065
Token he
Token ID339
Feature activation-3.59e-02

Pos logit contributions
 So+0.030
 says+0.026
 tickets+0.026
 y+0.026
 message+0.026
Neg logit contributions
Acc-0.052
years-0.052
eds-0.050
umper-0.049
pping-0.049
Token was
Token ID373
Feature activation-5.95e-02

Pos logit contributions
years+0.064
Acc+0.064
eds+0.061
pped+0.060
pping+0.060
Neg logit contributions
 So-0.048
 says-0.042
 message-0.041
 tickets-0.041
 sounds-0.040
Token playing
Token ID2712
Feature activation-1.29e-01

Pos logit contributions
years+0.091
Acc+0.090
ini+0.085
eds+0.085
umper+0.085
Neg logit contributions
 So-0.091
 y-0.091
 idea-0.086
 goodbye-0.085
 treats-0.084
Token,
Token ID11
Feature activation-5.18e-02

Pos logit contributions
years+0.265
Acc+0.262
eds+0.255
umper+0.249
rou+0.245
Neg logit contributions
 So-0.141
 idea-0.122
 y-0.120
 treats-0.118
 message-0.117
Token he
Token ID339
Feature activation-2.54e-02

Pos logit contributions
years+0.103
Acc+0.102
eds+0.099
pped+0.097
pping+0.097
Neg logit contributions
 So-0.057
 message-0.049
 says-0.048
 tickets-0.048
 sounds-0.048
Token accidentally
Token ID14716
Feature activation-2.75e-01

Pos logit contributions
Acc+0.043
arter+0.042
years+0.042
eds+0.041
umper+0.040
Neg logit contributions
 So-0.037
 says-0.033
 sounds-0.033
 y-0.033
 message-0.031
Token lost
Token ID2626
Feature activation-1.84e-01

Pos logit contributions
years+0.495
Acc+0.481
eds+0.439
ini+0.436
umper+0.435
Neg logit contributions
 So-0.367
 says-0.348
 y-0.343
 message-0.341
 sounds-0.341
Token a
Token ID257
Feature activation+7.22e-02

Pos logit contributions
years+0.385
Acc+0.385
umper+0.354
eds+0.353
pping+0.351
Neg logit contributions
 So-0.189
 treats-0.176
 y-0.176
 message-0.172
 sounds-0.170
Token special
Token ID2041
Feature activation-1.87e-02

Pos logit contributions
 message+0.078
 y+0.075
 So+0.074
 chest+0.070
 idea+0.070
Neg logit contributions
years-0.156
Acc-0.145
pped-0.141
pping-0.141
umper-0.141
Token button
Token ID4936
Feature activation+4.77e-02

Pos logit contributions
years+0.035
Acc+0.035
eds+0.034
pped+0.033
pping+0.033
Neg logit contributions
 So-0.023
 message-0.020
 says-0.019
 tickets-0.019
 idea-0.019
Token from
Token ID422
Feature activation+1.29e-01

Pos logit contributions
 So+0.057
 message+0.049
 says+0.049
 tickets+0.048
 idea+0.048
Neg logit contributions
years-0.090
Acc-0.089
eds-0.087
pped-0.085
pping-0.085
Token with
Token ID351
Feature activation+4.65e-02

Pos logit contributions
years+0.257
Acc+0.252
eds+0.247
umper+0.243
pped+0.237
Neg logit contributions
 So-0.148
 idea-0.129
 y-0.129
 sounds-0.128
 message-0.127
Token her
Token ID607
Feature activation+3.37e-02

Pos logit contributions
 y+0.048
 treats+0.046
 snacks+0.044
 sounds+0.040
 tickets+0.039
Neg logit contributions
Acc-0.101
pping-0.098
umper-0.098
arter-0.097
ini-0.096
Token ball
Token ID2613
Feature activation-3.37e-02

Pos logit contributions
 So+0.040
 y+0.038
 treats+0.037
 idea+0.036
 message+0.036
Neg logit contributions
years-0.066
Acc-0.066
pped-0.064
pping-0.063
umper-0.062
Token when
Token ID618
Feature activation+2.40e-02

Pos logit contributions
years+0.072
Acc+0.071
eds+0.067
umper+0.065
arter+0.065
Neg logit contributions
 So-0.037
 y-0.030
 idea-0.030
 sounds-0.029
 treats-0.029
Token it
Token ID340
Feature activation-3.35e-02

Pos logit contributions
 So+0.023
 y+0.021
 goodbye+0.020
 treats+0.020
 hello+0.020
Neg logit contributions
eds-0.050
years-0.050
Acc-0.050
umper-0.049
pping-0.048
Token accidentally
Token ID14716
Feature activation-2.75e-01

Pos logit contributions
years+0.059
Acc+0.059
eds+0.057
pped+0.055
umper+0.055
Neg logit contributions
 So-0.045
 message-0.039
 says-0.039
 tickets-0.038
 sounds-0.038
Token hit
Token ID2277
Feature activation-1.61e-01

Pos logit contributions
years+0.485
Acc+0.468
umper+0.432
eds+0.427
ini+0.423
Neg logit contributions
 So-0.376
 says-0.355
 message-0.355
 y-0.354
 sounds-0.348
Token her
Token ID607
Feature activation+3.85e-02

Pos logit contributions
Acc+0.300
years+0.297
eds+0.280
umper+0.277
pping+0.276
Neg logit contributions
 So-0.203
 sounds-0.185
 says-0.184
 y-0.183
 treats-0.179
Token mom
Token ID1995
Feature activation-9.08e-02

Pos logit contributions
 So+0.057
 message+0.051
 says+0.050
 idea+0.050
 sounds+0.049
Neg logit contributions
years-0.065
Acc-0.063
eds-0.060
pped-0.059
umper-0.059
Token's
Token ID338
Feature activation+2.86e-02

Pos logit contributions
years+0.150
Acc+0.148
eds+0.138
umper+0.134
pped+0.133
Neg logit contributions
 So-0.135
 says-0.119
 message-0.119
 y-0.116
 sounds-0.116
Token ash
Token ID12530
Feature activation-7.56e-02

Pos logit contributions
 So+0.040
 message+0.037
 idea+0.036
 sounds+0.035
 says+0.035
Neg logit contributions
years-0.050
Acc-0.049
umper-0.046
eds-0.046
pped-0.046
Token an
Token ID281
Feature activation+6.13e-02

Pos logit contributions
years+0.032
Acc+0.031
eds+0.030
pped+0.030
pping+0.029
Neg logit contributions
 So-0.023
 message-0.020
 says-0.020
 tickets-0.020
 idea-0.020
Token apple
Token ID17180
Feature activation-6.91e-02

Pos logit contributions
 So+0.085
 message+0.078
 idea+0.077
 y+0.076
 tickets+0.076
Neg logit contributions
Acc-0.106
years-0.105
eds-0.102
pped-0.099
pping-0.099
Token,
Token ID11
Feature activation-5.01e-02

Pos logit contributions
years+0.123
Acc+0.122
eds+0.120
pped+0.117
pping+0.117
Neg logit contributions
 So-0.086
 message-0.076
 tickets-0.075
 says-0.074
 prayer-0.074
Token but
Token ID475
Feature activation+3.95e-02

Pos logit contributions
years+0.101
Acc+0.101
pping+0.096
umper+0.096
eds+0.096
Neg logit contributions
 So-0.048
 message-0.045
 prayer-0.043
 tickets-0.043
 idea-0.042
Token he
Token ID339
Feature activation-2.50e-02

Pos logit contributions
 So+0.043
 says+0.036
 y+0.035
 idea+0.034
 message+0.034
Neg logit contributions
years-0.082
Acc-0.082
eds-0.081
pping-0.078
pped-0.077
Token accidentally
Token ID14716
Feature activation-2.75e-01

Pos logit contributions
Acc+0.047
years+0.047
eds+0.046
arter+0.044
pping+0.044
Neg logit contributions
 So-0.033
 says-0.029
 y-0.029
 idea-0.027
 message-0.026
Token knocked
Token ID13642
Feature activation-1.19e-01

Pos logit contributions
years+0.487
Acc+0.467
eds+0.434
umper+0.430
ini+0.426
Neg logit contributions
 So-0.380
 says-0.360
 y-0.358
 message-0.348
 sounds-0.340
Token the
Token ID262
Feature activation+1.54e-02

Pos logit contributions
years+0.241
Acc+0.236
arter+0.235
rou+0.223
ini+0.223
Neg logit contributions
 So-0.140
 says-0.129
 y-0.123
 treats-0.113
 said-0.112
Token bowl
Token ID9396
Feature activation+3.26e-02

Pos logit contributions
 So+0.019
 y+0.017
 message+0.017
 idea+0.017
 treats+0.017
Neg logit contributions
years-0.030
Acc-0.029
eds-0.028
umper-0.028
pping-0.028
Token over
Token ID625
Feature activation-1.11e-01

Pos logit contributions
 So+0.043
 message+0.037
 says+0.037
 idea+0.037
 y+0.037
Neg logit contributions
years-0.060
Acc-0.059
eds-0.056
umper-0.054
pped-0.054
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
years+0.248
Acc+0.244
eds+0.234
mo+0.233
arter+0.233
Neg logit contributions
 So-0.110
 y-0.095
 treats-0.089
 says-0.087
 tickets-0.083
Token hair
Token ID4190
Feature activation+4.29e-02

Pos logit contributions
years+0.160
Acc+0.158
rou+0.148
eds+0.147
pping+0.146
Neg logit contributions
 prayer-0.119
 message-0.112
 So-0.111
 Sorry-0.109
 replies-0.105
Token.
Token ID13
Feature activation+4.06e-03

Pos logit contributions
 So+0.065
 message+0.059
 prayer+0.057
 tickets+0.056
 fer+0.056
Neg logit contributions
pped-0.063
Acc-0.062
eds-0.061
pping-0.061
years-0.059
Token Suddenly
Token ID24975
Feature activation-6.14e-02

Pos logit contributions
 So+0.005
 says+0.004
 hello+0.004
 tickets+0.004
 prayer+0.004
Neg logit contributions
years-0.008
Acc-0.008
eds-0.008
pped-0.007
pping-0.007
Token,
Token ID11
Feature activation-5.24e-02

Pos logit contributions
years+0.108
Acc+0.104
pped+0.104
eds+0.103
mo+0.102
Neg logit contributions
 So-0.076
 message-0.067
 idea-0.064
 tickets-0.064
 prayer-0.064
Token he
Token ID339
Feature activation-2.15e-02

Pos logit contributions
years+0.087
Acc+0.086
eds+0.082
pped+0.082
umper+0.080
Neg logit contributions
 So-0.074
 message-0.066
 says-0.064
 tickets-0.064
 idea-0.064
Token accidentally
Token ID14716
Feature activation-2.74e-01

Pos logit contributions
Acc+0.038
years+0.036
umper+0.034
arter+0.034
eds+0.034
Neg logit contributions
 So-0.030
 says-0.029
 sounds-0.028
 message-0.027
 shouts-0.027
Token cut
Token ID2005
Feature activation-7.34e-02

Pos logit contributions
years+0.464
Acc+0.454
eds+0.422
umper+0.416
arter+0.406
Neg logit contributions
 So-0.390
 says-0.362
 message-0.360
 y-0.356
 tickets-0.353
Token his
Token ID465
Feature activation+3.21e-02

Pos logit contributions
Acc+0.135
years+0.135
eds+0.127
umper+0.124
pping+0.124
Neg logit contributions
 So-0.094
 says-0.082
 sounds-0.081
 message-0.081
 tickets-0.081
Token finger
Token ID7660
Feature activation-4.04e-02

Pos logit contributions
 So+0.043
 says+0.036
 tickets+0.036
 message+0.036
 prayer+0.036
Neg logit contributions
years-0.055
eds-0.054
Acc-0.054
pped-0.052
mo-0.051
Token and
Token ID290
Feature activation-7.86e-03

Pos logit contributions
years+0.077
Acc+0.076
eds+0.074
pped+0.073
pping+0.072
Neg logit contributions
 So-0.047
 message-0.041
 tickets-0.041
 says-0.041
 prayer-0.040
Token started
Token ID2067
Feature activation-6.07e-02

Pos logit contributions
years+0.014
Acc+0.014
eds+0.013
umper+0.013
pping+0.013
Neg logit contributions
 So-0.011
 says-0.010
 message-0.010
 y-0.010
 sounds-0.009
Token were
Token ID547
Feature activation+3.77e-02

Pos logit contributions
 So+0.117
 message+0.107
 says+0.104
 idea+0.104
 tickets+0.104
Neg logit contributions
years-0.128
Acc-0.125
eds-0.120
pped-0.120
pping-0.118
Token playing
Token ID2712
Feature activation-1.32e-01

Pos logit contributions
 So+0.069
 tickets+0.063
 message+0.063
 says+0.063
 y+0.062
Neg logit contributions
years-0.048
Acc-0.048
eds-0.045
umper-0.043
pped-0.043
Token,
Token ID11
Feature activation-4.27e-02

Pos logit contributions
years+0.252
Acc+0.250
eds+0.242
pped+0.238
pping+0.236
Neg logit contributions
 So-0.157
 message-0.136
 says-0.134
 tickets-0.134
 idea-0.131
Token Tim
Token ID5045
Feature activation-1.19e-01

Pos logit contributions
years+0.060
eds+0.059
Acc+0.058
pping+0.055
umper+0.055
Neg logit contributions
 So-0.075
 y-0.067
 treats-0.067
 idea-0.066
 sounds-0.066
Tokenmy
Token ID1820
Feature activation-2.10e-01

Pos logit contributions
years+0.123
arter+0.114
Acc+0.112
eds+0.103
 flooding+0.098
Neg logit contributions
 So-0.264
 says-0.241
 sounds-0.234
 idea-0.232
 said-0.229
Token accidentally
Token ID14716
Feature activation-2.73e-01

Pos logit contributions
years+0.389
Acc+0.382
eds+0.369
arter+0.360
pping+0.356
Neg logit contributions
 So-0.275
 says-0.240
 message-0.235
 sounds-0.233
 y-0.230
Token broke
Token ID6265
Feature activation-1.00e-01

Pos logit contributions
years+0.470
Acc+0.451
eds+0.420
umper+0.412
arter+0.410
Neg logit contributions
 So-0.388
 says-0.365
 y-0.361
 message-0.361
 sounds-0.358
Token his
Token ID465
Feature activation+3.10e-02

Pos logit contributions
years+0.219
Acc+0.218
arter+0.200
eds+0.199
pping+0.198
Neg logit contributions
 So-0.099
 y-0.089
 treats-0.086
 sounds-0.083
 fer-0.082
Token new
Token ID649
Feature activation-3.81e-02

Pos logit contributions
 So+0.040
 message+0.036
 says+0.035
 idea+0.035
 tickets+0.035
Neg logit contributions
years-0.056
Acc-0.056
eds-0.053
pped-0.053
pping-0.052
Token toy
Token ID13373
Feature activation-4.49e-02

Pos logit contributions
years+0.077
Acc+0.075
pped+0.072
umper+0.072
pping+0.071
Neg logit contributions
 So-0.044
 message-0.041
 idea-0.040
 tickets-0.039
 treats-0.039
Token soldier
Token ID10686
Feature activation+2.35e-02

Pos logit contributions
years+0.089
Acc+0.088
eds+0.085
pped+0.083
umper+0.083
Neg logit contributions
 So-0.051
 message-0.044
 says-0.043
 tickets-0.043
 idea-0.042
Token.
Token ID13
Feature activation+3.81e-04

Pos logit contributions
 So+0.068
 message+0.060
 says+0.059
 tickets+0.058
 idea+0.058
Neg logit contributions
years-0.106
Acc-0.104
eds-0.101
pped-0.099
pping-0.098
Token
Token ID198
Feature activation+3.61e-03

Pos logit contributions
 So+0.001
 says+0.000
 hello+0.000
 tickets+0.000
 prayer+0.000
Neg logit contributions
eds-0.001
years-0.001
Acc-0.001
pping-0.001
pped-0.001
Token
Token ID198
Feature activation+3.92e-03

Pos logit contributions
 So+0.004
So+0.003
 says+0.003
 shouts+0.002
 tickets+0.002
Neg logit contributions
pping-0.008
years-0.008
arter-0.008
Acc-0.008
pped-0.008
TokenHe
Token ID1544
Feature activation-2.40e-02

Pos logit contributions
 So+0.006
So+0.005
 message+0.005
 says+0.005
 sounds+0.005
Neg logit contributions
years-0.006
eds-0.006
Acc-0.006
arter-0.006
pping-0.006
Token had
Token ID550
Feature activation-1.03e-01

Pos logit contributions
years+0.043
Acc+0.043
eds+0.041
umper+0.040
pped+0.040
Neg logit contributions
 So-0.032
 says-0.028
 message-0.027
 tickets-0.027
 sounds-0.027
Token accidentally
Token ID14716
Feature activation-2.73e-01

Pos logit contributions
years+0.187
Acc+0.185
eds+0.180
umper+0.175
pped+0.174
Neg logit contributions
 So-0.132
 message-0.117
 tickets-0.116
 y-0.115
 says-0.115
Token fallen
Token ID9292
Feature activation-1.54e-01

Pos logit contributions
years+0.459
Acc+0.443
umper+0.406
eds+0.406
mo+0.398
Neg logit contributions
 So-0.392
 says-0.371
 message-0.371
 sounds-0.366
 y-0.365
Token into
Token ID656
Feature activation-6.04e-03

Pos logit contributions
years+0.345
Acc+0.333
umper+0.308
eds+0.307
rou+0.301
Neg logit contributions
 So-0.148
 y-0.126
 sounds-0.125
 idea-0.124
 message-0.123
Token the
Token ID262
Feature activation+1.33e-02

Pos logit contributions
Acc+0.014
years+0.013
pping+0.012
umper+0.012
mo+0.012
Neg logit contributions
 y-0.006
 So-0.006
 tickets-0.005
 treats-0.005
 sounds-0.005
Token creek
Token ID39459
Feature activation+1.51e-02

Pos logit contributions
 So+0.014
 y+0.014
 message+0.014
 sounds+0.014
 treats+0.013
Neg logit contributions
years-0.028
Acc-0.027
pping-0.026
pped-0.025
umper-0.025
Token!
Token ID0
Feature activation-5.28e-02

Pos logit contributions
 So+0.018
 message+0.015
 says+0.015
 tickets+0.015
 idea+0.015
Neg logit contributions
years-0.029
Acc-0.029
eds-0.028
pped-0.028
umper-0.027
Token pepper
Token ID13385
Feature activation-4.68e-02

Pos logit contributions
 y+0.052
 So+0.049
 treats+0.048
 sounds+0.046
 says+0.044
Neg logit contributions
Acc-0.085
eds-0.083
years-0.083
pping-0.082
mo-0.081
Token in
Token ID287
Feature activation+1.40e-02

Pos logit contributions
years+0.081
Acc+0.080
eds+0.079
pped+0.077
pping+0.076
Neg logit contributions
 So-0.063
 message-0.055
 tickets-0.054
 says-0.054
 prayer-0.054
Token it
Token ID340
Feature activation-3.32e-02

Pos logit contributions
 So+0.018
 y+0.016
 message+0.016
 sounds+0.015
 idea+0.015
Neg logit contributions
years-0.027
Acc-0.027
eds-0.025
mo-0.025
umper-0.025
Token when
Token ID618
Feature activation+2.60e-02

Pos logit contributions
years+0.063
Acc+0.062
eds+0.061
pped+0.060
pping+0.059
Neg logit contributions
 So-0.039
 tickets-0.034
 message-0.034
 says-0.033
 idea-0.033
Token she
Token ID673
Feature activation-1.64e-02

Pos logit contributions
 So+0.026
 y+0.024
 goodbye+0.024
 hello+0.023
 treats+0.023
Neg logit contributions
eds-0.054
Acc-0.052
years-0.052
umper-0.052
ini-0.052
Token accidentally
Token ID14716
Feature activation-2.73e-01

Pos logit contributions
years+0.030
arter+0.029
Acc+0.029
eds+0.029
idget+0.029
Neg logit contributions
 So-0.021
 y-0.020
 says-0.019
 said-0.018
 sounds-0.018
Token spilled
Token ID31896
Feature activation-1.40e-01

Pos logit contributions
years+0.493
Acc+0.469
ini+0.433
umper+0.433
eds+0.426
Neg logit contributions
 So-0.360
 y-0.358
 says-0.353
 message-0.344
 sounds-0.338
Token some
Token ID617
Feature activation+3.00e-02

Pos logit contributions
years+0.265
Acc+0.260
eds+0.249
umper+0.243
pping+0.242
Neg logit contributions
 So-0.173
 y-0.158
 says-0.153
 treats-0.153
 message-0.153
Token on
Token ID319
Feature activation+8.14e-02

Pos logit contributions
 So+0.037
 y+0.034
 treats+0.033
 message+0.033
 says+0.033
Neg logit contributions
years-0.057
Acc-0.056
eds-0.053
umper-0.052
pping-0.052
Token her
Token ID607
Feature activation+3.65e-02

Pos logit contributions
 So+0.085
 y+0.082
 treats+0.074
 sounds+0.072
 says+0.069
Neg logit contributions
years-0.179
Acc-0.170
rou-0.167
umper-0.166
eds-0.165
Token hand
Token ID1021
Feature activation+3.78e-02

Pos logit contributions
 So+0.045
 message+0.040
 says+0.039
 tickets+0.039
 idea+0.038
Neg logit contributions
years-0.068
Acc-0.067
eds-0.065
pped-0.064
pping-0.063
Token were
Token ID547
Feature activation+2.92e-02

Pos logit contributions
 So+0.019
 message+0.016
 tickets+0.015
 says+0.015
 idea+0.015
Neg logit contributions
years-0.039
Acc-0.038
eds-0.037
pped-0.037
pping-0.036
Token too
Token ID1165
Feature activation-1.52e-02

Pos logit contributions
 y+0.033
 So+0.030
 says+0.029
 message+0.029
 tickets+0.029
Neg logit contributions
Acc-0.059
years-0.056
eds-0.056
umper-0.055
ini-0.054
Token sharp
Token ID7786
Feature activation-8.99e-02

Pos logit contributions
years+0.033
Acc+0.033
umper+0.032
eds+0.031
mo+0.031
Neg logit contributions
 y-0.015
 So-0.014
 idea-0.013
 says-0.013
 message-0.013
Token and
Token ID290
Feature activation-1.98e-02

Pos logit contributions
years+0.158
Acc+0.156
eds+0.152
pped+0.151
pping+0.150
Neg logit contributions
 So-0.117
 message-0.105
 tickets-0.102
 prayer-0.101
 idea-0.100
Token she
Token ID673
Feature activation-1.49e-02

Pos logit contributions
Acc+0.036
years+0.035
eds+0.034
umper+0.034
pping+0.034
Neg logit contributions
 So-0.026
 y-0.024
 says-0.024
 sounds-0.023
 message-0.023
Token accidentally
Token ID14716
Feature activation-2.73e-01

Pos logit contributions
Acc+0.029
years+0.028
umper+0.026
arter+0.026
eds+0.026
Neg logit contributions
 So-0.018
 says-0.017
 y-0.017
 message-0.016
 sounds-0.016
Token cut
Token ID2005
Feature activation-7.46e-02

Pos logit contributions
years+0.456
Acc+0.442
eds+0.413
umper+0.409
mo+0.397
Neg logit contributions
 So-0.399
 says-0.372
 message-0.366
 y-0.363
 tickets-0.362
Token off
Token ID572
Feature activation+1.14e-01

Pos logit contributions
Acc+0.139
years+0.136
eds+0.127
umper+0.125
arter+0.124
Neg logit contributions
 So-0.098
 says-0.086
 sounds-0.086
 y-0.085
 tickets-0.084
Token too
Token ID1165
Feature activation-1.07e-02

Pos logit contributions
 So+0.160
 message+0.140
 tickets+0.139
 sounds+0.139
 y+0.139
Neg logit contributions
years-0.195
Acc-0.195
eds-0.184
umper-0.181
pped-0.181
Token much
Token ID881
Feature activation-4.61e-02

Pos logit contributions
years+0.022
Acc+0.022
eds+0.021
umper+0.020
mo+0.020
Neg logit contributions
 So-0.012
 y-0.010
 message-0.010
 says-0.010
 idea-0.010
Token hair
Token ID4190
Feature activation+4.20e-02

Pos logit contributions
years+0.096
Acc+0.094
eds+0.091
umper+0.089
pped+0.089
Neg logit contributions
 So-0.049
 says-0.042
 message-0.041
 sounds-0.041
 tickets-0.040
Token with
Token ID351
Feature activation+4.80e-02

Pos logit contributions
years+0.235
Acc+0.230
eds+0.224
umper+0.218
mo+0.215
Neg logit contributions
 So-0.171
 idea-0.150
 y-0.150
 message-0.149
 sounds-0.148
Token his
Token ID465
Feature activation+3.00e-02

Pos logit contributions
 y+0.032
 snacks+0.030
 treats+0.028
 sounds+0.022
 tickets+0.022
Neg logit contributions
Acc-0.127
pping-0.124
arter-0.121
years-0.120
rou-0.119
Token toys
Token ID14958
Feature activation+5.87e-02

Pos logit contributions
 y+0.030
 treats+0.028
 snacks+0.027
 idea+0.026
 So+0.026
Neg logit contributions
years-0.068
Acc-0.067
pped-0.067
pping-0.067
umper-0.064
Token when
Token ID618
Feature activation+2.62e-02

Pos logit contributions
 So+0.071
 y+0.060
 message+0.060
 idea+0.060
 says+0.059
Neg logit contributions
years-0.113
Acc-0.112
eds-0.108
umper-0.104
pped-0.104
Token he
Token ID339
Feature activation-2.54e-02

Pos logit contributions
 y+0.017
 So+0.016
 treats+0.016
 snacks+0.016
 goodbye+0.016
Neg logit contributions
eds-0.063
years-0.062
umper-0.061
ini-0.061
rou-0.061
Token accidentally
Token ID14716
Feature activation-2.72e-01

Pos logit contributions
arter+0.041
Acc+0.041
years+0.040
eds+0.039
ini+0.038
Neg logit contributions
 So-0.037
 y-0.035
 says-0.035
 sounds-0.034
 said-0.034
Token knocked
Token ID13642
Feature activation-1.17e-01

Pos logit contributions
years+0.461
Acc+0.450
eds+0.409
umper+0.407
ini+0.402
Neg logit contributions
 So-0.387
 says-0.368
 message-0.365
 y-0.364
 sounds-0.363
Token over
Token ID625
Feature activation-1.09e-01

Pos logit contributions
Acc+0.213
years+0.209
arter+0.198
ini+0.194
rou+0.193
Neg logit contributions
 So-0.155
 says-0.144
 y-0.141
 treats-0.141
 tickets-0.141
Token a
Token ID257
Feature activation+7.53e-02

Pos logit contributions
Acc+0.226
years+0.224
eds+0.219
pping+0.215
umper+0.211
Neg logit contributions
 So-0.118
 y-0.104
 tickets-0.104
 treats-0.104
 message-0.099
Token cup
Token ID6508
Feature activation+4.76e-02

Pos logit contributions
 So+0.090
 message+0.086
 y+0.084
 treats+0.080
 sounds+0.080
Neg logit contributions
years-0.147
Acc-0.144
pped-0.137
pping-0.136
eds-0.135
Token of
Token ID286
Feature activation-4.56e-02

Pos logit contributions
 So+0.057
 message+0.050
 tickets+0.050
 says+0.049
 idea+0.049
Neg logit contributions
years-0.088
Acc-0.087
eds-0.085
pped-0.085
pping-0.084
TokenOne
Token ID3198
Feature activation-9.89e-02

Pos logit contributions
 So+0.024
 message+0.020
So+0.020
 says+0.020
 tickets+0.020
Neg logit contributions
years-0.025
Acc-0.024
eds-0.024
arter-0.024
pping-0.024
Token day
Token ID1110
Feature activation+9.85e-03

Pos logit contributions
years+0.169
Acc+0.166
eds+0.160
pped+0.156
pping+0.155
Neg logit contributions
 So-0.138
 message-0.122
 says-0.120
 tickets-0.120
 idea-0.118
Token,
Token ID11
Feature activation-4.98e-02

Pos logit contributions
 So+0.014
 message+0.013
 sounds+0.013
 funn+0.013
 idea+0.013
Neg logit contributions
years-0.015
pped-0.015
 accidentally-0.014
mo-0.014
grown-0.013
Token Tim
Token ID5045
Feature activation-1.19e-01

Pos logit contributions
years+0.065
Acc+0.061
pped+0.059
umper+0.058
mo+0.057
Neg logit contributions
 So-0.085
 message-0.079
 says-0.078
 idea-0.078
 sounds-0.078
Tokenmy
Token ID1820
Feature activation-2.16e-01

Pos logit contributions
years+0.111
Acc+0.105
eds+0.090
pped+0.089
arter+0.089
Neg logit contributions
 So-0.275
 says-0.259
 sounds-0.249
 idea-0.246
 message-0.242
Token accidentally
Token ID14716
Feature activation-2.72e-01

Pos logit contributions
years+0.450
Acc+0.443
eds+0.427
pped+0.420
pping+0.418
Neg logit contributions
 So-0.230
 says-0.195
 message-0.189
 sounds-0.187
 idea-0.184
Token spilled
Token ID31896
Feature activation-1.40e-01

Pos logit contributions
years+0.464
Acc+0.448
eds+0.420
umper+0.408
arter+0.405
Neg logit contributions
 So-0.388
 says-0.363
 message-0.360
 y-0.357
 sounds-0.351
Token some
Token ID617
Feature activation+2.87e-02

Pos logit contributions
years+0.275
Acc+0.272
eds+0.261
umper+0.253
pping+0.253
Neg logit contributions
 So-0.165
 y-0.152
 treats-0.147
 says-0.145
 sounds-0.143
Token water
Token ID1660
Feature activation+5.81e-02

Pos logit contributions
 So+0.034
 y+0.033
 treats+0.032
 idea+0.030
 says+0.030
Neg logit contributions
years-0.057
Acc-0.056
eds-0.053
umper-0.053
pping-0.053
Token on
Token ID319
Feature activation+8.03e-02

Pos logit contributions
 So+0.069
 says+0.060
 message+0.059
 y+0.059
 sounds+0.058
Neg logit contributions
years-0.113
Acc-0.111
eds-0.105
umper-0.103
pped-0.102
Token his
Token ID465
Feature activation+2.99e-02

Pos logit contributions
 So+0.080
 y+0.064
 sounds+0.060
 idea+0.056
 treats+0.056
Neg logit contributions
years-0.198
Acc-0.188
rou-0.186
umper-0.174
eds-0.173
Token.
Token ID13
Feature activation+8.33e-03

Pos logit contributions
years+0.026
Acc+0.025
eds+0.025
pping+0.024
pped+0.024
Neg logit contributions
 So-0.018
 message-0.016
 says-0.016
 tickets-0.016
 prayer-0.015
Token One
Token ID1881
Feature activation-7.56e-02

Pos logit contributions
 So+0.011
 says+0.007
 idea+0.007
 y+0.007
 sounds+0.006
Neg logit contributions
eds-0.019
years-0.019
pped-0.019
pping-0.018
Acc-0.018
Token day
Token ID1110
Feature activation+2.33e-02

Pos logit contributions
years+0.126
Acc+0.124
eds+0.120
pped+0.117
pping+0.116
Neg logit contributions
 So-0.108
 message-0.096
 says-0.095
 tickets-0.095
 sounds-0.093
Token,
Token ID11
Feature activation-4.43e-02

Pos logit contributions
 So+0.027
 message+0.024
 tickets+0.023
 says+0.023
 prayer+0.023
Neg logit contributions
years-0.045
Acc-0.044
pped-0.043
eds-0.042
pping-0.042
Token he
Token ID339
Feature activation-1.97e-02

Pos logit contributions
years+0.066
eds+0.065
Acc+0.065
pping+0.061
umper+0.061
Neg logit contributions
 So-0.075
 y-0.068
 treats-0.066
 idea-0.066
 says-0.065
Token accidentally
Token ID14716
Feature activation-2.72e-01

Pos logit contributions
Acc+0.036
years+0.036
eds+0.034
arter+0.033
pping+0.033
Neg logit contributions
 So-0.027
 says-0.024
 sounds-0.023
 message-0.023
 y-0.023
Token broke
Token ID6265
Feature activation-9.70e-02

Pos logit contributions
years+0.442
Acc+0.433
eds+0.408
umper+0.393
mo+0.391
Neg logit contributions
 So-0.407
 says-0.376
 message-0.376
 tickets-0.371
 y-0.366
Token his
Token ID465
Feature activation+3.19e-02

Pos logit contributions
Acc+0.206
years+0.199
eds+0.186
arter+0.185
umper+0.183
Neg logit contributions
 So-0.105
 tickets-0.093
 y-0.093
 fer-0.092
 sounds-0.090
Token motorcycle
Token ID18757
Feature activation-3.33e-02

Pos logit contributions
 So+0.047
 message+0.041
 says+0.041
 prayer+0.040
 tickets+0.040
Neg logit contributions
years-0.050
Acc-0.049
eds-0.049
pped-0.047
pping-0.047
Token into
Token ID656
Feature activation-1.36e-02

Pos logit contributions
pping+0.051
pped+0.051
eds+0.050
years+0.049
Acc+0.048
Neg logit contributions
 So-0.046
 tickets-0.045
 hello-0.044
 prayer-0.043
 says-0.043
Token two
Token ID734
Feature activation+6.59e-02

Pos logit contributions
Acc+0.032
mo+0.030
pping+0.030
years+0.030
arter+0.029
Neg logit contributions
 So-0.012
 y-0.011
 sounds-0.010
 words-0.010
 message-0.010
Token his
Token ID465
Feature activation+3.00e-02

Pos logit contributions
 y+0.032
 snacks+0.030
 treats+0.028
 sounds+0.022
 tickets+0.022
Neg logit contributions
Acc-0.127
pping-0.124
arter-0.121
years-0.120
rou-0.119
Token toy
Token ID13373
Feature activation-4.30e-02

Pos logit contributions
 y+0.030
 treats+0.028
 snacks+0.027
 idea+0.026
 So+0.026
Neg logit contributions
years-0.068
Acc-0.067
pped-0.067
pping-0.067
umper-0.064
Token car
Token ID1097
Feature activation-2.86e-02

Pos logit contributions
Acc+0.093
years+0.092
pped+0.085
pping+0.083
izz+0.083
Neg logit contributions
 So-0.047
 sounds-0.039
 chest-0.039
 message-0.039
 y-0.038
Token when
Token ID618
Feature activation+2.64e-02

Pos logit contributions
years+0.052
Acc+0.051
eds+0.049
pped+0.049
pping+0.048
Neg logit contributions
 So-0.037
 message-0.032
 says-0.032
 tickets-0.032
 idea-0.031
Token he
Token ID339
Feature activation-2.46e-02

Pos logit contributions
 So+0.019
 y+0.018
 treats+0.017
 goodbye+0.017
 sounds+0.017
Neg logit contributions
years-0.061
eds-0.061
ini-0.060
Acc-0.060
umper-0.060
Token accidentally
Token ID14716
Feature activation-2.72e-01

Pos logit contributions
Acc+0.038
arter+0.038
years+0.037
eds+0.036
ini+0.035
Neg logit contributions
 So-0.039
 says-0.035
 y-0.035
 sounds-0.034
 said-0.033
Token knocked
Token ID13642
Feature activation-1.18e-01

Pos logit contributions
years+0.451
Acc+0.438
eds+0.402
umper+0.398
mo+0.391
Neg logit contributions
 So-0.401
 says-0.379
 message-0.374
 sounds-0.370
 tickets-0.370
Token it
Token ID340
Feature activation-2.74e-02

Pos logit contributions
Acc+0.238
years+0.234
arter+0.224
ini+0.219
rou+0.215
Neg logit contributions
 So-0.136
 says-0.129
 tickets-0.121
 y-0.120
 treats-0.119
Token into
Token ID656
Feature activation-7.50e-03

Pos logit contributions
years+0.048
Acc+0.047
eds+0.046
pped+0.045
umper+0.044
Neg logit contributions
 So-0.037
 message-0.033
 says-0.032
 idea-0.032
 tickets-0.032
Token the
Token ID262
Feature activation+1.55e-02

Pos logit contributions
Acc+0.016
years+0.016
mo+0.016
pping+0.016
eds+0.016
Neg logit contributions
 So-0.007
 y-0.006
 message-0.006
 tickets-0.006
 treats-0.006
Token sink
Token ID14595
Feature activation-1.17e-02

Pos logit contributions
 So+0.020
 message+0.019
 sounds+0.018
 y+0.018
 says+0.018
Neg logit contributions
years-0.029
Acc-0.028
pped-0.027
pping-0.027
eds-0.026
Token
Token ID198
Feature activation+1.23e-02

Pos logit contributions
 So+0.010
 says+0.008
 sounds+0.007
 hello+0.007
 tickets+0.007
Neg logit contributions
years-0.019
eds-0.019
pped-0.019
pping-0.019
Acc-0.019
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
 So+0.011
 says+0.009
So+0.008
 sounds+0.006
 tickets+0.006
Neg logit contributions
years-0.031
arter-0.031
pping-0.030
Acc-0.030
umper-0.029
TokenSuddenly
Token ID38582
Feature activation-5.38e-02

Pos logit contributions
 So+0.014
So+0.014
ww+0.011
 sounds+0.011
 says+0.011
Neg logit contributions
arter-0.024
years-0.023
umper-0.023
mo-0.022
eds-0.022
Token,
Token ID11
Feature activation-4.21e-02

Pos logit contributions
years+0.088
pped+0.086
Acc+0.084
mo+0.084
eds+0.083
Neg logit contributions
 So-0.071
 message-0.063
 idea-0.061
 funn-0.061
 list-0.060
Token Lily
Token ID20037
Feature activation-8.11e-02

Pos logit contributions
years+0.069
Acc+0.067
pped+0.065
eds+0.063
mo+0.063
Neg logit contributions
 So-0.060
 message-0.052
 says-0.051
 idea-0.051
 sounds-0.050
Token accidentally
Token ID14716
Feature activation-2.72e-01

Pos logit contributions
Acc+0.164
years+0.162
arter+0.152
eds+0.151
pping+0.148
Neg logit contributions
 So-0.095
 says-0.085
 sounds-0.081
 message-0.079
 y-0.076
Token knocked
Token ID13642
Feature activation-1.19e-01

Pos logit contributions
years+0.415
Acc+0.406
eds+0.380
umper+0.373
pping+0.364
Neg logit contributions
 So-0.427
 message-0.399
 says-0.398
 tickets-0.395
 y-0.394
Token over
Token ID625
Feature activation-1.10e-01

Pos logit contributions
Acc+0.215
years+0.211
arter+0.204
ini+0.201
rou+0.196
Neg logit contributions
 So-0.160
 says-0.150
 tickets-0.145
 y-0.140
 treats-0.138
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
Acc+0.216
years+0.212
eds+0.209
pping+0.206
pped+0.202
Neg logit contributions
 So-0.131
 tickets-0.116
 y-0.114
 treats-0.113
 message-0.113
Token te
Token ID573
Feature activation+1.05e-01

Pos logit contributions
 So+0.015
 message+0.014
 tickets+0.013
 y+0.013
 says+0.013
Neg logit contributions
years-0.022
Acc-0.021
eds-0.020
pped-0.020
pping-0.020
Tokenap
Token ID499
Feature activation+7.00e-02

Pos logit contributions
 So+0.156
 message+0.139
 says+0.138
 tickets+0.137
 y+0.136
Neg logit contributions
years-0.170
Acc-0.165
eds-0.163
pped-0.156
pping-0.156
Token toys
Token ID14958
Feature activation+5.92e-02

Pos logit contributions
 So+0.041
 y+0.039
 treats+0.037
 idea+0.037
 message+0.036
Neg logit contributions
years-0.081
Acc-0.080
pped-0.079
pping-0.078
umper-0.077
Token,
Token ID11
Feature activation-4.65e-02

Pos logit contributions
 So+0.079
 message+0.070
 says+0.069
 tickets+0.069
 prayer+0.068
Neg logit contributions
years-0.104
Acc-0.103
eds-0.099
pped-0.098
pping-0.097
Token but
Token ID475
Feature activation+4.10e-02

Pos logit contributions
years+0.077
eds+0.074
Acc+0.073
rou+0.070
umper+0.070
Neg logit contributions
 So-0.069
 y-0.066
 treats-0.063
 sounds-0.063
 hello-0.061
Token sometimes
Token ID3360
Feature activation-6.35e-02

Pos logit contributions
 So+0.049
 says+0.045
 y+0.045
 sounds+0.043
 idea+0.043
Neg logit contributions
eds-0.079
years-0.077
Acc-0.076
umper-0.075
pping-0.074
Token she
Token ID673
Feature activation-1.45e-02

Pos logit contributions
years+0.100
Acc+0.099
eds+0.095
pping+0.094
pped+0.093
Neg logit contributions
 So-0.090
 tickets-0.082
 message-0.082
 replies-0.078
 idea-0.077
Token accidentally
Token ID14716
Feature activation-2.72e-01

Pos logit contributions
years+0.029
Acc+0.029
eds+0.028
arter+0.027
arted+0.027
Neg logit contributions
 So-0.016
 says-0.015
 said-0.014
's-0.014
 idea-0.014
Token broke
Token ID6265
Feature activation-9.66e-02

Pos logit contributions
years+0.478
Acc+0.465
eds+0.428
umper+0.426
ini+0.423
Neg logit contributions
 So-0.368
 sounds-0.352
 says-0.352
 message-0.349
 tickets-0.343
Token them
Token ID606
Feature activation-1.41e-02

Pos logit contributions
Acc+0.196
years+0.187
umper+0.178
pping+0.178
arter+0.178
Neg logit contributions
 So-0.106
 y-0.103
 treats-0.101
 tickets-0.099
 sounds-0.098
Token because
Token ID780
Feature activation-6.94e-02

Pos logit contributions
years+0.025
Acc+0.025
eds+0.025
pping+0.025
pped+0.024
Neg logit contributions
 So-0.017
 message-0.015
 tickets-0.015
 prayer-0.015
 says-0.015
Token they
Token ID484
Feature activation+7.58e-02

Pos logit contributions
Acc+0.164
pping+0.159
eds+0.159
umper+0.158
years+0.154
Neg logit contributions
 tickets-0.052
 y-0.052
 treats-0.048
 So-0.048
 sounds-0.046
Token were
Token ID547
Feature activation+3.16e-02

Pos logit contributions
 So+0.077
 y+0.070
 sounds+0.068
 says+0.068
 idea+0.067
Neg logit contributions
Acc-0.159
years-0.156
eds-0.150
umper-0.147
rou-0.147
Token on
Token ID319
Feature activation+8.41e-02

Pos logit contributions
years+0.142
Acc+0.141
eds+0.138
pped+0.135
pping+0.135
Neg logit contributions
 So-0.115
 message-0.100
 tickets-0.099
 says-0.099
 prayer-0.097
Token the
Token ID262
Feature activation+1.54e-02

Pos logit contributions
 So+0.116
 message+0.103
 tickets+0.103
 sounds+0.102
 idea+0.101
Neg logit contributions
years-0.152
Acc-0.148
eds-0.141
umper-0.138
rou-0.136
Token wall
Token ID3355
Feature activation-1.52e-02

Pos logit contributions
 So+0.021
 message+0.019
 tickets+0.018
 says+0.018
 sounds+0.018
Neg logit contributions
years-0.028
Acc-0.027
eds-0.026
pped-0.026
umper-0.025
Token.
Token ID13
Feature activation+7.28e-03

Pos logit contributions
years+0.030
Acc+0.030
eds+0.028
umper+0.028
pped+0.028
Neg logit contributions
 So-0.018
 says-0.015
 message-0.015
 idea-0.015
 sounds-0.015
Token Tim
Token ID5045
Feature activation-1.19e-01

Pos logit contributions
 So+0.009
 says+0.007
 tickets+0.007
 message+0.007
 sounds+0.007
Neg logit contributions
years-0.015
Acc-0.014
eds-0.014
pped-0.014
pping-0.014
Token accidentally
Token ID14716
Feature activation-2.71e-01

Pos logit contributions
years+0.185
Acc+0.180
arter+0.174
eds+0.164
pped+0.160
Neg logit contributions
 So-0.200
 says-0.187
 sounds-0.173
 replies-0.169
 shouts-0.168
Token hit
Token ID2277
Feature activation-1.59e-01

Pos logit contributions
years+0.435
Acc+0.419
umper+0.390
eds+0.389
mo+0.384
Neg logit contributions
 So-0.412
 says-0.387
 message-0.385
 y-0.380
 tickets-0.378
Token the
Token ID262
Feature activation+1.39e-02

Pos logit contributions
Acc+0.349
years+0.346
rou+0.318
pping+0.317
ini+0.316
Neg logit contributions
 So-0.162
 sounds-0.154
 says-0.153
 y-0.148
 words-0.142
Token ornament
Token ID45618
Feature activation-8.38e-02

Pos logit contributions
 So+0.019
 message+0.018
 y+0.018
 idea+0.017
 sounds+0.017
Neg logit contributions
years-0.025
Acc-0.024
eds-0.023
umper-0.023
pping-0.022
Token,
Token ID11
Feature activation-4.33e-02

Pos logit contributions
Acc+0.134
eds+0.132
years+0.132
pping+0.131
pped+0.129
Neg logit contributions
 So-0.117
 tickets-0.107
 message-0.104
 says-0.104
 prayer-0.102
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
years+0.088
eds+0.084
umper+0.082
Acc+0.082
arter+0.081
Neg logit contributions
 So-0.048
 y-0.044
 says-0.043
 sounds-0.042
 hello-0.040
Token smiled
Token ID13541
Feature activation-3.54e-03

Pos logit contributions
Acc+0.129
years+0.123
pped+0.121
umper+0.117
pping+0.117
Neg logit contributions
 tickets-0.079
 message-0.075
 idea-0.074
 passports-0.074
 So-0.074
Token and
Token ID290
Feature activation-1.98e-02

Pos logit contributions
Acc+0.006
years+0.006
eds+0.005
pped+0.005
umper+0.005
Neg logit contributions
 So-0.005
 message-0.005
 tickets-0.005
 says-0.005
 idea-0.005
Token said
Token ID531
Feature activation+1.61e-01

Pos logit contributions
Acc+0.036
pped+0.033
 accidentally+0.032
umper+0.032
years+0.031
Neg logit contributions
 preparations-0.023
 route-0.023
 tickets-0.023
 idea-0.023
 list-0.022
Token,
Token ID11
Feature activation-4.33e-02

Pos logit contributions
 So+0.199
 message+0.189
 tickets+0.184
 sounds+0.182
 fer+0.181
Neg logit contributions
Acc-0.283
years-0.278
pped-0.269
pping-0.263
rou-0.256
Token "
Token ID366
Feature activation-9.87e-02

Pos logit contributions
years+0.086
eds+0.086
umper+0.081
Acc+0.080
pped+0.080
Neg logit contributions
 So-0.051
 hello-0.044
 says-0.043
 prayer-0.042
 goodbye-0.042
TokenYes
Token ID5297
Feature activation+2.18e-01

Pos logit contributions
Acc+0.165
years+0.148
Sometimes+0.142
umper+0.133
Playing+0.129
Neg logit contributions
 tickets-0.144
 idea-0.142
 passports-0.137
 list-0.137
 message-0.134
Token,
Token ID11
Feature activation-4.35e-02

Pos logit contributions
 So+0.218
 says+0.180
 message+0.178
 tickets+0.175
 idea+0.172
Neg logit contributions
years-0.468
Acc-0.461
eds-0.448
pped-0.438
umper-0.437
Token Ben
Token ID3932
Feature activation-3.48e-03

Pos logit contributions
Acc+0.069
years+0.067
umper+0.065
pping+0.064
pped+0.064
Neg logit contributions
 So-0.061
 idea-0.054
 sounds-0.053
 list-0.052
 replies-0.052
Token.
Token ID13
Feature activation+1.72e-03

Pos logit contributions
Acc+0.005
eds+0.005
umper+0.005
pping+0.005
pped+0.005
Neg logit contributions
 tickets-0.005
 So-0.005
 Fer-0.005
 fer-0.005
 passports-0.005
Token We
Token ID775
Feature activation+5.60e-02

Pos logit contributions
 So+0.002
 message+0.002
 tickets+0.002
 idea+0.002
 y+0.002
Neg logit contributions
Acc-0.003
years-0.003
eds-0.003
pped-0.003
umper-0.003
Token can
Token ID460
Feature activation-2.59e-02

Pos logit contributions
 So+0.068
 says+0.059
 message+0.058
 idea+0.058
 tickets+0.056
Neg logit contributions
years-0.109
Acc-0.107
eds-0.105
pped-0.101
umper-0.100
Token was
Token ID373
Feature activation-5.02e-02

Pos logit contributions
years+0.034
Acc+0.033
eds+0.032
pped+0.032
pping+0.031
Neg logit contributions
 So-0.029
 tickets-0.026
 message-0.026
 says-0.026
 idea-0.025
Token time
Token ID640
Feature activation+8.66e-02

Pos logit contributions
years+0.096
Acc+0.095
eds+0.092
pped+0.090
umper+0.089
Neg logit contributions
 So-0.060
 message-0.053
 idea-0.052
 says-0.051
 y-0.051
Token to
Token ID284
Feature activation-1.89e-03

Pos logit contributions
 So+0.124
 message+0.110
 says+0.108
 idea+0.108
 sounds+0.107
Neg logit contributions
years-0.142
Acc-0.140
eds-0.136
pping-0.132
pped-0.132
Token relax
Token ID8960
Feature activation-3.68e-03

Pos logit contributions
pping+0.004
pped+0.004
Acc+0.003
years+0.003
eds+0.003
Neg logit contributions
 So-0.002
 says-0.002
 tickets-0.002
 message-0.002
 sounds-0.002
Token.
Token ID13
Feature activation+2.16e-02

Pos logit contributions
years+0.007
Acc+0.007
eds+0.007
pped+0.006
pping+0.006
Neg logit contributions
 So-0.005
 message-0.004
 says-0.004
 tickets-0.004
 prayer-0.004
Token So
Token ID1406
Feature activation+2.27e-01

Pos logit contributions
 message+0.037
 tickets+0.037
 fer+0.036
 y+0.036
 So+0.036
Neg logit contributions
years-0.027
Acc-0.026
eds-0.023
umper-0.023
pping-0.023
Token he
Token ID339
Feature activation-9.47e-03

Pos logit contributions
 tickets+0.431
 bisc+0.426
 Fer+0.407
 fer+0.400
 cand+0.400
Neg logit contributions
 accidentally-0.217
years-0.203
pped-0.190
pping-0.189
 flooding-0.189
Token sat
Token ID3332
Feature activation-5.61e-02

Pos logit contributions
years+0.015
Acc+0.014
pped+0.014
mo+0.014
pping+0.014
Neg logit contributions
 So-0.014
 message-0.012
 tickets-0.012
 idea-0.012
 prayer-0.012
Token down
Token ID866
Feature activation+2.22e-02

Pos logit contributions
years+0.112
Acc+0.109
arter+0.107
eds+0.107
umper+0.106
Neg logit contributions
 So-0.062
 says-0.053
 goodbye-0.051
 y-0.051
 hello-0.051
Token on
Token ID319
Feature activation+8.93e-02

Pos logit contributions
 So+0.028
 says+0.024
 tickets+0.024
 message+0.024
 idea+0.024
Neg logit contributions
years-0.041
Acc-0.040
eds-0.040
umper-0.039
pped-0.038
Token the
Token ID262
Feature activation+1.88e-02

Pos logit contributions
 So+0.113
 tickets+0.099
 message+0.098
 y+0.097
 sounds+0.096
Neg logit contributions
years-0.166
Acc-0.164
eds-0.160
umper-0.155
pped-0.154
Token.
Token ID13
Feature activation-5.01e-03

Pos logit contributions
 Fer+0.085
 fer+0.084
 AW+0.081
ww+0.080
br+0.079
Neg logit contributions
eds-0.053
Acc-0.053
 raining-0.052
 weeping-0.050
my-0.049
Token They
Token ID1119
Feature activation+6.97e-02

Pos logit contributions
Acc+0.005
 raining+0.005
ores+0.005
 flooding+0.005
years+0.005
Neg logit contributions
 fer-0.009
 message-0.009
 medal-0.009
 tale-0.009
 Fer-0.008
Token say
Token ID910
Feature activation+1.84e-01

Pos logit contributions
 So+0.087
 message+0.079
 tickets+0.076
 treats+0.075
 sounds+0.075
Neg logit contributions
Acc-0.118
years-0.117
pping-0.115
pped-0.114
umper-0.113
Token,
Token ID11
Feature activation-4.86e-02

Pos logit contributions
 So+0.293
 funn+0.285
sc+0.283
 bisc+0.281
 Fer+0.275
Neg logit contributions
years-0.205
Acc-0.202
 weeping-0.186
ful-0.183
lee-0.176
Token "
Token ID366
Feature activation-1.01e-01

Pos logit contributions
years+0.079
Acc+0.079
eds+0.075
pped+0.073
pping+0.073
Neg logit contributions
 So-0.071
 message-0.064
 tickets-0.063
 says-0.062
 idea-0.062
TokenYes
Token ID5297
Feature activation+2.17e-01

Pos logit contributions
Acc+0.139
years+0.127
Sometimes+0.121
Playing+0.110
pped+0.107
Neg logit contributions
 idea-0.177
 list-0.170
 chest-0.168
 message-0.166
 tickets-0.162
Token,
Token ID11
Feature activation-4.86e-02

Pos logit contributions
 So+0.254
 message+0.229
 idea+0.219
 tickets+0.218
 says+0.218
Neg logit contributions
years-0.399
Acc-0.394
eds-0.388
pped-0.386
pping-0.384
Token Mom
Token ID11254
Feature activation-6.09e-02

Pos logit contributions
Acc+0.066
years+0.065
pped+0.061
pping+0.060
rou+0.059
Neg logit contributions
 So-0.076
 idea-0.066
 tale-0.066
 replies-0.065
 sounds-0.065
Token,
Token ID11
Feature activation-4.89e-02

Pos logit contributions
eds+0.093
rou+0.090
years+0.089
pping+0.089
Acc+0.088
Neg logit contributions
 So-0.083
 message-0.080
 fer-0.076
 list-0.076
 tickets-0.075
Token let
Token ID1309
Feature activation+5.78e-02

Pos logit contributions
Acc+0.072
years+0.072
pping+0.069
pped+0.068
rou+0.065
Neg logit contributions
 So-0.072
 idea-0.065
 replies-0.063
 message-0.062
 tale-0.060
Token's
Token ID338
Feature activation+2.76e-02

Pos logit contributions
 So+0.050
 prayer+0.047
 message+0.044
 fer+0.043
 tale+0.043
Neg logit contributions
pped-0.119
eds-0.119
pping-0.117
Acc-0.115
umper-0.114
Token to
Token ID284
Feature activation-3.88e-03

Pos logit contributions
 So+0.040
 message+0.036
 idea+0.036
 sounds+0.036
 says+0.035
Neg logit contributions
years-0.044
Acc-0.043
eds-0.042
pping-0.041
pped-0.041
Token come
Token ID1282
Feature activation+7.36e-02

Pos logit contributions
years+0.007
Acc+0.007
eds+0.007
pped+0.007
pping+0.007
Neg logit contributions
 So-0.005
 message-0.004
 says-0.004
 tickets-0.004
 idea-0.004
Token over
Token ID625
Feature activation-1.07e-01

Pos logit contributions
 So+0.070
 look+0.068
 hello+0.064
 message+0.063
 goodbye+0.061
Neg logit contributions
years-0.164
Acc-0.161
pped-0.150
umper-0.149
pping-0.149
Token and
Token ID290
Feature activation-8.87e-03

Pos logit contributions
Acc+0.218
years+0.216
eds+0.213
umper+0.211
mo+0.210
Neg logit contributions
 So-0.117
 tickets-0.101
 y-0.100
 says-0.099
 message-0.096
Token play
Token ID711
Feature activation-5.40e-02

Pos logit contributions
years+0.016
Acc+0.015
eds+0.015
pped+0.015
umper+0.015
Neg logit contributions
 So-0.012
 says-0.011
 message-0.011
 y-0.011
 treats-0.011
Token games
Token ID1830
Feature activation+2.06e-01

Pos logit contributions
years+0.088
Acc+0.088
pped+0.085
pping+0.084
eds+0.084
Neg logit contributions
 So-0.076
 prayer-0.070
 says-0.069
 tickets-0.067
 message-0.067
Token with
Token ID351
Feature activation+5.12e-02

Pos logit contributions
 So+0.276
 message+0.241
 says+0.239
 prayer+0.238
 sounds+0.233
Neg logit contributions
years-0.359
Acc-0.351
eds-0.340
pped-0.339
pping-0.339
Token her
Token ID607
Feature activation+4.17e-02

Pos logit contributions
 So+0.060
 y+0.058
 treats+0.057
 tickets+0.055
 snacks+0.054
Neg logit contributions
Acc-0.101
years-0.098
pping-0.097
umper-0.095
eds-0.095
Token.
Token ID13
Feature activation+1.67e-02

Pos logit contributions
 So+0.053
 message+0.047
 says+0.046
 y+0.046
 idea+0.046
Neg logit contributions
years-0.078
Acc-0.077
eds-0.074
pped-0.074
umper-0.073
Token When
Token ID1649
Feature activation+5.93e-02

Pos logit contributions
 So+0.022
 says+0.018
 sounds+0.017
 idea+0.016
 y+0.016
Neg logit contributions
years-0.032
eds-0.032
Acc-0.031
pped-0.031
pping-0.031
Token they
Token ID484
Feature activation+8.50e-02

Pos logit contributions
 So+0.090
 message+0.080
 idea+0.077
 says+0.077
 replies+0.076
Neg logit contributions
years-0.092
Acc-0.091
pped-0.088
eds-0.086
pping-0.084
Token can
Token ID460
Feature activation-2.28e-02

Pos logit contributions
eds+0.114
mo+0.113
Acc+0.111
years+0.111
pping+0.110
Neg logit contributions
 So-0.081
 list-0.072
 tickets-0.071
 idea-0.070
 goodbye-0.069
Token take
Token ID1011
Feature activation+3.30e-02

Pos logit contributions
years+0.046
Acc+0.046
eds+0.044
pped+0.042
umper+0.042
Neg logit contributions
 So-0.025
 message-0.023
 says-0.022
 sounds-0.022
 tickets-0.022
Token you
Token ID345
Feature activation+9.50e-02

Pos logit contributions
 So+0.048
 prayer+0.042
 idea+0.042
 list+0.042
 Fer+0.042
Neg logit contributions
years-0.051
eds-0.050
pped-0.048
umper-0.046
Acc-0.045
Token to
Token ID284
Feature activation-7.12e-03

Pos logit contributions
 So+0.135
 idea+0.117
 says+0.116
 prayer+0.115
 message+0.115
Neg logit contributions
years-0.154
eds-0.154
Acc-0.150
pped-0.146
pping-0.146
Token any
Token ID597
Feature activation-4.99e-02

Pos logit contributions
years+0.012
Acc+0.012
eds+0.012
pped+0.011
pping+0.011
Neg logit contributions
 So-0.010
 message-0.009
 says-0.009
 tickets-0.009
 idea-0.009
Token story
Token ID1621
Feature activation+1.59e-01

Pos logit contributions
years+0.083
eds+0.081
Acc+0.078
pped+0.077
umper+0.076
Neg logit contributions
 So-0.070
 says-0.060
 tickets-0.060
 message-0.058
 hello-0.058
Token you
Token ID345
Feature activation+9.51e-02

Pos logit contributions
 So+0.211
 fer+0.197
 funn+0.192
 prayer+0.188
 Fer+0.186
Neg logit contributions
eds-0.241
years-0.239
pped-0.234
pping-0.232
ind-0.226
Token want
Token ID765
Feature activation-1.09e-02

Pos logit contributions
 So+0.128
 message+0.113
 tickets+0.111
 says+0.111
 idea+0.110
Neg logit contributions
years-0.166
Acc-0.163
eds-0.158
pped-0.155
pping-0.153
Token.
Token ID13
Feature activation+5.44e-03

Pos logit contributions
years+0.021
Acc+0.021
eds+0.021
pped+0.020
pping+0.020
Neg logit contributions
 So-0.012
 message-0.010
 idea-0.010
 says-0.010
 tickets-0.010
Token But
Token ID887
Feature activation+5.42e-02

Pos logit contributions
 So+0.006
 message+0.006
 tickets+0.006
 idea+0.006
 says+0.006
Neg logit contributions
years-0.010
Acc-0.010
eds-0.010
pped-0.010
umper-0.010
Token you
Token ID345
Feature activation+9.54e-02

Pos logit contributions
 So+0.069
 message+0.060
 tickets+0.059
 says+0.059
 idea+0.059
Neg logit contributions
years-0.098
Acc-0.096
eds-0.093
pped-0.092
pping-0.092
Token a
Token ID257
Feature activation+8.00e-02

Pos logit contributions
 So+0.026
 message+0.022
 tickets+0.022
 says+0.022
 sounds+0.022
Neg logit contributions
years-0.046
Acc-0.046
eds-0.045
pped-0.044
pping-0.044
Token small
Token ID1402
Feature activation-8.23e-02

Pos logit contributions
 So+0.080
 y+0.076
 sounds+0.071
 message+0.069
 says+0.068
Neg logit contributions
pped-0.177
pping-0.174
years-0.174
umper-0.170
Acc-0.168
Token village
Token ID7404
Feature activation+9.20e-02

Pos logit contributions
years+0.133
Acc+0.133
pped+0.130
pping+0.127
umper+0.122
Neg logit contributions
 So-0.130
 message-0.115
 sounds-0.115
 says-0.113
 idea-0.113
Token near
Token ID1474
Feature activation-2.60e-02

Pos logit contributions
 So+0.092
 message+0.077
 says+0.076
 tickets+0.075
 sounds+0.075
Neg logit contributions
years-0.194
Acc-0.194
umper-0.187
pped-0.187
eds-0.186
Token the
Token ID262
Feature activation+1.75e-02

Pos logit contributions
years+0.043
Acc+0.042
eds+0.041
pped+0.040
pping+0.040
Neg logit contributions
 So-0.037
 message-0.033
 says-0.033
 tickets-0.032
 idea-0.032
Token forest
Token ID8222
Feature activation+5.31e-03

Pos logit contributions
 So+0.023
 sounds+0.022
 message+0.022
 y+0.022
 tickets+0.022
Neg logit contributions
years-0.031
Acc-0.031
pped-0.031
pping-0.030
eds-0.029
Token.
Token ID13
Feature activation+1.16e-03

Pos logit contributions
 So+0.006
 message+0.005
 says+0.005
 tickets+0.005
 idea+0.005
Neg logit contributions
years-0.011
Acc-0.011
eds-0.010
pped-0.010
umper-0.010
Token They
Token ID1119
Feature activation+7.54e-02

Pos logit contributions
 So+0.002
 message+0.002
 tickets+0.001
 says+0.001
 idea+0.001
Neg logit contributions
years-0.002
Acc-0.002
eds-0.002
pped-0.002
umper-0.002
Token liked
Token ID8288
Feature activation-8.50e-02

Pos logit contributions
 So+0.095
 says+0.088
 y+0.084
 sounds+0.082
 idea+0.082
Neg logit contributions
eds-0.147
Acc-0.143
years-0.141
pped-0.140
pping-0.138
Token to
Token ID284
Feature activation-5.39e-03

Pos logit contributions
years+0.158
Acc+0.156
pped+0.152
eds+0.151
pping+0.150
Neg logit contributions
 So-0.105
 tickets-0.095
 message-0.094
 y-0.094
 sounds-0.093
Token play
Token ID711
Feature activation-5.23e-02

Pos logit contributions
years+0.010
Acc+0.009
eds+0.009
pped+0.009
umper+0.009
Neg logit contributions
 So-0.007
 message-0.006
 says-0.006
 tickets-0.006
 idea-0.006
Token she
Token ID673
Feature activation-1.19e-02

Pos logit contributions
 So+0.046
 y+0.044
 says+0.042
 goodbye+0.040
 hello+0.039
Neg logit contributions
eds-0.104
ini-0.102
Acc-0.101
umper-0.099
mo-0.099
Token couldn
Token ID3521
Feature activation-1.52e-01

Pos logit contributions
Acc+0.023
eds+0.023
arter+0.023
 flooding+0.022
years+0.022
Neg logit contributions
 So-0.014
 said-0.014
 says-0.013
 y-0.013
 replied-0.012
Token't
Token ID470
Feature activation-1.62e-01

Pos logit contributions
years+0.264
Acc+0.243
eds+0.239
mo+0.234
rou+0.232
Neg logit contributions
 So-0.240
 says-0.209
 sounds-0.198
 y-0.194
 shouts-0.192
Token find
Token ID1064
Feature activation-1.03e-01

Pos logit contributions
years+0.261
Acc+0.257
rou+0.253
eds+0.252
mo+0.231
Neg logit contributions
 So-0.253
 message-0.227
 says-0.221
 idea-0.220
 look-0.217
Token him
Token ID683
Feature activation-1.41e-02

Pos logit contributions
Acc+0.182
years+0.176
rou+0.170
eds+0.168
pping+0.165
Neg logit contributions
 So-0.145
 says-0.130
 y-0.128
 message-0.127
 treats-0.127
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
years+0.025
Acc+0.025
eds+0.025
pping+0.024
pped+0.024
Neg logit contributions
 So-0.018
 prayer-0.016
 message-0.016
 says-0.015
 tickets-0.015
Token She
Token ID1375
Feature activation-2.02e-02

Pos logit contributions
 So+0.018
 message+0.017
 tickets+0.017
 idea+0.016
 says+0.016
Neg logit contributions
years-0.018
Acc-0.018
eds-0.017
pped-0.017
umper-0.016
Token felt
Token ID2936
Feature activation-7.94e-02

Pos logit contributions
Acc+0.039
years+0.038
eds+0.038
arter+0.037
pping+0.037
Neg logit contributions
 So-0.025
 says-0.022
 y-0.021
 sounds-0.021
 message-0.020
Token foolish
Token ID19538
Feature activation-4.89e-02

Pos logit contributions
eds+0.153
years+0.146
Acc+0.146
ini+0.143
pping+0.141
Neg logit contributions
 So-0.099
 y-0.093
 hello-0.087
 idea-0.087
 message-0.086
Token for
Token ID329
Feature activation-5.63e-03

Pos logit contributions
years+0.074
Acc+0.071
eds+0.070
pping+0.070
pped+0.070
Neg logit contributions
 So-0.075
 message-0.068
 tickets-0.067
 prayer-0.067
 list-0.066
Token not
Token ID407
Feature activation-7.86e-02

Pos logit contributions
ini+0.011
eds+0.010
Acc+0.010
rou+0.010
years+0.010
Neg logit contributions
 So-0.007
 y-0.006
 hello-0.006
 sounds-0.006
 words-0.006
Token not
Token ID407
Feature activation-8.04e-02

Pos logit contributions
Acc+0.079
years+0.078
eds+0.076
pped+0.074
pping+0.074
Neg logit contributions
 So-0.067
 message-0.058
 tickets-0.058
 says-0.057
 idea-0.056
Token for
Token ID329
Feature activation-4.70e-03

Pos logit contributions
years+0.138
Acc+0.136
eds+0.132
pped+0.129
pping+0.128
Neg logit contributions
 So-0.111
 message-0.097
 says-0.096
 tickets-0.096
 idea-0.095
Token your
Token ID534
Feature activation+2.66e-02

Pos logit contributions
Acc+0.009
years+0.008
umper+0.008
ini+0.008
idget+0.008
Neg logit contributions
 So-0.006
 tickets-0.006
 y-0.006
 sounds-0.006
 words-0.006
Token mouth
Token ID5422
Feature activation+1.72e-02

Pos logit contributions
 So+0.036
 message+0.032
 says+0.032
 tickets+0.031
 idea+0.031
Neg logit contributions
years-0.046
Acc-0.045
eds-0.044
pped-0.043
pping-0.042
Token.
Token ID13
Feature activation-7.51e-03

Pos logit contributions
 So+0.017
 says+0.014
 sounds+0.014
 y+0.014
 message+0.014
Neg logit contributions
years-0.039
Acc-0.038
umper-0.035
pped-0.035
eds-0.035
Token OK
Token ID7477
Feature activation+4.00e-02

Pos logit contributions
years+0.013
Acc+0.013
pped+0.012
eds+0.012
umper+0.012
Neg logit contributions
 So-0.010
 message-0.009
 idea-0.009
 tickets-0.009
 treats-0.009
Token?"
Token ID1701
Feature activation-5.66e-02

Pos logit contributions
 So+0.064
 says+0.058
 message+0.056
 sounds+0.056
 prayer+0.056
Neg logit contributions
years-0.066
Acc-0.064
pped-0.060
pping-0.059
umper-0.059
Token Mom
Token ID11254
Feature activation-5.97e-02

Pos logit contributions
years+0.101
Acc+0.099
eds+0.096
pped+0.094
pping+0.094
Neg logit contributions
 So-0.075
 message-0.066
 says-0.066
 tickets-0.065
 idea-0.064
Token says
Token ID1139
Feature activation+1.93e-01

Pos logit contributions
years+0.147
Acc+0.147
eds+0.142
pped+0.141
pping+0.140
Neg logit contributions
 So-0.034
 message-0.029
 tickets-0.027
 idea-0.025
 prayer-0.024
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
 So+0.233
 message+0.213
 tickets+0.210
 idea+0.207
 sounds+0.202
Neg logit contributions
Acc-0.359
years-0.356
pped-0.338
pping-0.336
eds-0.333
Token
Token ID198
Feature activation+9.54e-03

Pos logit contributions
years+0.023
Acc+0.023
eds+0.022
pped+0.022
pping+0.022
Neg logit contributions
 So-0.017
 says-0.015
 message-0.015
 tickets-0.014
 idea-0.014
Token until
Token ID1566
Feature activation-3.71e-02

Pos logit contributions
 So+0.089
 message+0.079
 says+0.078
 tickets+0.077
 sounds+0.076
Neg logit contributions
years-0.121
Acc-0.119
eds-0.115
pped-0.113
pping-0.112
Token her
Token ID607
Feature activation+3.86e-02

Pos logit contributions
years+0.064
Acc+0.063
eds+0.062
umper+0.060
pping+0.059
Neg logit contributions
 So-0.051
 message-0.046
 tickets-0.045
 says-0.045
 y-0.045
Token grandma
Token ID49890
Feature activation-6.82e-03

Pos logit contributions
 So+0.070
 says+0.063
 tickets+0.063
 fer+0.063
 replies+0.063
Neg logit contributions
years-0.045
Acc-0.044
rou-0.040
eds-0.039
pped-0.039
Token came
Token ID1625
Feature activation-3.80e-02

Pos logit contributions
years+0.011
Acc+0.011
pped+0.011
eds+0.011
pping+0.011
Neg logit contributions
 So-0.009
 message-0.008
 tickets-0.008
 prayer-0.008
 idea-0.008
Token back
Token ID736
Feature activation-1.11e-01

Pos logit contributions
arter+0.084
Acc+0.084
years+0.082
mo+0.081
umper+0.078
Neg logit contributions
 So-0.034
 message-0.031
 look-0.028
 chest-0.026
 treats-0.026
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
Acc+0.243
years+0.242
umper+0.235
pping+0.231
eds+0.231
Neg logit contributions
 So-0.112
 message-0.091
 says-0.090
 idea-0.088
 y-0.087
Token
Token ID220
Feature activation+5.80e-02

Pos logit contributions
 So+0.020
 message+0.018
 tickets+0.018
 idea+0.018
 says+0.018
Neg logit contributions
years-0.027
Acc-0.027
eds-0.026
pped-0.025
umper-0.025
Token
Token ID198
Feature activation+2.16e-02

Pos logit contributions
 So+0.063
 says+0.047
 idea+0.037
 tickets+0.037
 sounds+0.037
Neg logit contributions
years-0.134
Acc-0.126
eds-0.126
mo-0.124
pping-0.124
Token
Token ID198
Feature activation+2.15e-02

Pos logit contributions
 So+0.024
 says+0.018
 y+0.016
So+0.016
 sounds+0.015
Neg logit contributions
years-0.048
mo-0.046
arter-0.046
pping-0.046
Acc-0.046
TokenDays
Token ID38770
Feature activation-1.05e-01

Pos logit contributions
 So+0.031
So+0.026
 y+0.025
 message+0.025
 sounds+0.025
Neg logit contributions
years-0.038
arter-0.038
eds-0.038
mo-0.038
umper-0.036
Token went
Token ID1816
Feature activation+2.80e-02

Pos logit contributions
years+0.163
pped+0.162
Acc+0.159
eds+0.157
pping+0.152
Neg logit contributions
 So-0.156
 message-0.137
 hello-0.137
 prayer-0.136
 idea-0.134
Token,
Token ID11
Feature activation-3.56e-02

Pos logit contributions
 funn+0.031
 So+0.030
 message+0.030
 replies+0.029
 sounds+0.029
Neg logit contributions
pped-0.025
 accidentally-0.022
pping-0.022
years-0.022
grown-0.021
Token Bob
Token ID5811
Feature activation-4.30e-03

Pos logit contributions
years+0.047
pped+0.046
 flooding+0.044
pping+0.044
 accidentally+0.043
Neg logit contributions
 So-0.058
 says-0.054
 idea-0.054
 replies-0.054
 sounds-0.054
Token arrived
Token ID5284
Feature activation+3.37e-02

Pos logit contributions
years+0.007
Acc+0.007
eds+0.007
pped+0.007
pping+0.007
Neg logit contributions
 So-0.006
 message-0.005
 idea-0.005
 tickets-0.005
 y-0.005
Token at
Token ID379
Feature activation+7.40e-02

Pos logit contributions
 So+0.048
 message+0.044
 says+0.043
 prayer+0.043
 sounds+0.043
Neg logit contributions
years-0.054
Acc-0.052
eds-0.052
pped-0.051
pping-0.051
Token the
Token ID262
Feature activation+2.04e-02

Pos logit contributions
 So+0.086
 tickets+0.083
 message+0.081
 says+0.079
 idea+0.079
Neg logit contributions
Acc-0.144
years-0.142
pping-0.137
pped-0.137
eds-0.135
Token beach
Token ID10481
Feature activation+3.52e-02

Pos logit contributions
 So+0.029
 message+0.027
 tickets+0.026
 sounds+0.026
 says+0.026
Neg logit contributions
years-0.034
Acc-0.034
eds-0.033
pped-0.032
umper-0.032
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
 So+0.048
 message+0.042
 says+0.042
 idea+0.041
 prayer+0.041
Neg logit contributions
years-0.058
eds-0.057
Acc-0.057
pping-0.056
pped-0.055
Token His
Token ID2399
Feature activation-1.31e-02

Pos logit contributions
 So+0.022
 says+0.019
 message+0.018
 tickets+0.018
 prayer+0.018
Neg logit contributions
years-0.036
Acc-0.036
eds-0.035
pped-0.035
pping-0.034
Token heart
Token ID2612
Feature activation+1.75e-02

Pos logit contributions
years+0.020
Acc+0.019
eds+0.019
umper+0.018
rou+0.018
Neg logit contributions
 So-0.020
 says-0.018
 tickets-0.018
 message-0.018
 y-0.017
Token was
Token ID373
Feature activation-4.97e-02

Pos logit contributions
 So+0.024
 message+0.021
 tickets+0.021
 says+0.021
 idea+0.021
Neg logit contributions
years-0.029
Acc-0.029
eds-0.028
pped-0.028
pping-0.028
Token pounding
Token ID33919
Feature activation-1.30e-01

Pos logit contributions
Acc+0.098
years+0.097
eds+0.094
umper+0.093
mo+0.090
Neg logit contributions
 So-0.056
 y-0.051
 message-0.050
 idea-0.049
 sounds-0.048
Token"
Token ID1
Feature activation-1.14e-01

Pos logit contributions
 So+0.006
So+0.005
 says+0.005
 sounds+0.004
 tickets+0.004
Neg logit contributions
mo-0.011
years-0.011
arter-0.010
Acc-0.010
idget-0.010
TokenCan
Token ID6090
Feature activation-3.17e-02

Pos logit contributions
Acc+0.190
years+0.188
pped+0.181
umper+0.174
pping+0.171
Neg logit contributions
 So-0.166
 message-0.151
 idea-0.150
 tickets-0.148
 prayer-0.146
Token I
Token ID314
Feature activation-4.18e-02

Pos logit contributions
years+0.068
Acc+0.067
eds+0.066
pped+0.064
umper+0.064
Neg logit contributions
 So-0.032
 says-0.027
 message-0.026
 tickets-0.026
 sounds-0.025
Token try
Token ID1949
Feature activation-1.90e-02

Pos logit contributions
Acc+0.085
years+0.085
eds+0.079
pping+0.076
rou+0.076
Neg logit contributions
 So-0.049
 says-0.043
 message-0.043
 tickets-0.042
 y-0.041
Token it
Token ID340
Feature activation-3.09e-02

Pos logit contributions
years+0.044
Acc+0.044
arted+0.040
ini+0.040
umper+0.038
Neg logit contributions
 says-0.018
 tickets-0.018
 look-0.017
 hello-0.017
 So-0.017
Token?"
Token ID1701
Feature activation-5.88e-02

Pos logit contributions
years+0.044
Acc+0.043
eds+0.042
pped+0.042
pping+0.041
Neg logit contributions
 So-0.051
 message-0.046
 tickets-0.045
 prayer-0.045
 says-0.045
Token Ben
Token ID3932
Feature activation-6.88e-03

Pos logit contributions
years+0.080
Acc+0.077
pped+0.072
umper+0.072
eds+0.070
Neg logit contributions
 So-0.098
 message-0.094
 tickets-0.091
 prayer-0.091
 sounds-0.090
Token asked
Token ID1965
Feature activation+5.75e-03

Pos logit contributions
years+0.008
umper+0.008
 accidentally+0.007
pping+0.007
Acc+0.007
Neg logit contributions
 tickets-0.012
 goodbye-0.012
 treats-0.012
 message-0.012
 prayer-0.012
Token.
Token ID13
Feature activation-7.84e-03

Pos logit contributions
 So+0.007
 message+0.006
 says+0.006
 tickets+0.006
 idea+0.005
Neg logit contributions
years-0.011
Acc-0.011
eds-0.011
pped-0.011
pping-0.011
Token
Token ID198
Feature activation+3.52e-03

Pos logit contributions
years+0.015
Acc+0.015
eds+0.015
pping+0.014
pped+0.014
Neg logit contributions
 So-0.010
 says-0.008
 message-0.008
 idea-0.008
 prayer-0.008
Token
Token ID198
Feature activation+3.68e-03

Pos logit contributions
 So+0.003
 says+0.002
So+0.002
 sounds+0.002
 replies+0.001
Neg logit contributions
years-0.009
pping-0.009
Acc-0.009
arter-0.009
mo-0.008
Token it
Token ID340
Feature activation-3.06e-02

Pos logit contributions
years+0.071
Acc+0.070
eds+0.067
pped+0.066
pping+0.066
Neg logit contributions
 So-0.051
 message-0.045
 says-0.045
 tickets-0.044
 sounds-0.044
Token."
Token ID526
Feature activation-1.37e-02

Pos logit contributions
years+0.053
eds+0.053
pping+0.053
Acc+0.053
pped+0.052
Neg logit contributions
 So-0.038
 message-0.034
 prayer-0.034
 tickets-0.033
 says-0.033
Token
Token ID198
Feature activation+6.39e-03

Pos logit contributions
years+0.025
Acc+0.025
eds+0.024
pped+0.023
pping+0.023
Neg logit contributions
 So-0.018
 message-0.015
 says-0.015
 tickets-0.015
 idea-0.015
Token
Token ID198
Feature activation+6.27e-03

Pos logit contributions
 So+0.006
 says+0.005
 sounds+0.004
 shouts+0.004
 hello+0.004
Neg logit contributions
years-0.016
pping-0.015
mo-0.015
Acc-0.015
pped-0.015
TokenAnna
Token ID31160
Feature activation+3.66e-03

Pos logit contributions
 So+0.009
 says+0.008
 sounds+0.008
 message+0.008
 replies+0.008
Neg logit contributions
years-0.011
Acc-0.010
eds-0.010
pping-0.010
mo-0.010
Token and
Token ID290
Feature activation-2.44e-02

Pos logit contributions
 So+0.005
 tickets+0.004
 message+0.004
 idea+0.004
 prayer+0.004
Neg logit contributions
Acc-0.006
years-0.006
pped-0.006
eds-0.006
pping-0.006
Token Ben
Token ID3932
Feature activation-8.68e-03

Pos logit contributions
years+0.035
Acc+0.034
eds+0.032
pped+0.032
umper+0.032
Neg logit contributions
 So-0.041
 message-0.037
 tickets-0.037
 idea-0.036
 says-0.036
Token felt
Token ID2936
Feature activation-8.33e-02

Pos logit contributions
pped+0.014
years+0.014
Acc+0.014
pping+0.013
umper+0.012
Neg logit contributions
 So-0.011
 tickets-0.011
 prayer-0.011
 message-0.011
ww-0.011
Token sorry
Token ID7926
Feature activation+1.37e-02

Pos logit contributions
years+0.123
Acc+0.123
pped+0.115
eds+0.111
pping+0.111
Neg logit contributions
 So-0.130
 tickets-0.119
 message-0.118
 prayer-0.115
 fer-0.115
Token.
Token ID13
Feature activation-1.48e-03

Pos logit contributions
 So+0.020
 fer+0.019
 Fer+0.019
 tickets+0.019
 message+0.019
Neg logit contributions
Acc-0.021
years-0.020
eds-0.020
rou-0.019
pping-0.019
Token They
Token ID1119
Feature activation+6.20e-02

Pos logit contributions
Acc+0.002
years+0.002
ges+0.001
 flooding+0.001
Sometimes+0.001
Neg logit contributions
 fer-0.003
 message-0.003
 medal-0.003
 Fer-0.003
 treats-0.003
Token They
Token ID1119
Feature activation+6.96e-02

Pos logit contributions
 So+0.008
 says+0.006
 tickets+0.006
 prayer+0.006
 sounds+0.006
Neg logit contributions
years-0.015
eds-0.015
Acc-0.015
pping-0.015
pped-0.015
Token were
Token ID547
Feature activation+3.34e-02

Pos logit contributions
 So+0.085
 y+0.078
 says+0.077
 tickets+0.075
 idea+0.073
Neg logit contributions
Acc-0.130
years-0.129
eds-0.128
arter-0.124
umper-0.122
Token happy
Token ID3772
Feature activation-2.50e-02

Pos logit contributions
 So+0.042
 y+0.037
 says+0.037
 idea+0.037
 message+0.037
Neg logit contributions
years-0.061
Acc-0.061
eds-0.059
umper-0.058
pped-0.058
Token again
Token ID757
Feature activation-1.28e-01

Pos logit contributions
years+0.044
Acc+0.044
eds+0.042
pped+0.041
pping+0.041
Neg logit contributions
 So-0.034
 message-0.030
 says-0.030
 tickets-0.030
 idea-0.029
Token.
Token ID13
Feature activation+6.16e-03

Pos logit contributions
years+0.237
Acc+0.235
eds+0.231
pping+0.224
pped+0.224
Neg logit contributions
 So-0.154
 message-0.135
 tickets-0.134
 says-0.133
 idea-0.131
Token<|endoftext|>
Token ID50256
Feature activation-4.59e-02

Pos logit contributions
 So+0.008
 says+0.006
 message+0.006
 tickets+0.006
 sounds+0.006
Neg logit contributions
years-0.012
Acc-0.012
eds-0.011
pped-0.011
pping-0.011
TokenAnna
Token ID31160
Feature activation+5.73e-03

Pos logit contributions
years+0.074
Acc+0.073
eds+0.071
pped+0.070
umper+0.069
Neg logit contributions
 So-0.066
 message-0.060
 tickets-0.059
 says-0.058
 idea-0.058
Token and
Token ID290
Feature activation-1.36e-02

Pos logit contributions
 So+0.007
 says+0.006
 sounds+0.006
 y+0.006
 message+0.006
Neg logit contributions
years-0.011
Acc-0.011
eds-0.011
pping-0.010
umper-0.010
Token Ben
Token ID3932
Feature activation-1.43e-04

Pos logit contributions
years+0.020
Acc+0.019
pped+0.018
eds+0.018
umper+0.018
Neg logit contributions
 So-0.022
 message-0.020
 tickets-0.020
 sounds-0.020
 idea-0.020
Token are
Token ID389
Feature activation+9.31e-02

Pos logit contributions
years+0.000
Acc+0.000
eds+0.000
pping+0.000
rou+0.000
Neg logit contributions
 So-0.000
 says-0.000
 sounds-0.000
 y-0.000
 tickets-0.000
Token twins
Token ID20345
Feature activation+7.49e-02

Pos logit contributions
 So+0.139
 y+0.131
 says+0.127
 tickets+0.125
 idea+0.124
Neg logit contributions
eds-0.152
Acc-0.148
pped-0.148
pping-0.145
umper-0.144
Token and
Token ID290
Feature activation-1.02e-02

Pos logit contributions
 So+0.068
 message+0.059
 says+0.058
 tickets+0.057
 sounds+0.057
Neg logit contributions
years-0.091
Acc-0.089
eds-0.087
umper-0.087
pped-0.086
Token took
Token ID1718
Feature activation+1.81e-02

Pos logit contributions
years+0.017
eds+0.017
arter+0.017
pping+0.017
pped+0.016
Neg logit contributions
 So-0.015
 says-0.014
 y-0.013
 tickets-0.013
 message-0.013
Token her
Token ID607
Feature activation+3.97e-02

Pos logit contributions
 So+0.031
 prayer+0.029
 message+0.029
 tale+0.028
 idea+0.028
Neg logit contributions
years-0.024
pped-0.023
Acc-0.022
umper-0.022
eds-0.021
Token home
Token ID1363
Feature activation-5.54e-02

Pos logit contributions
 So+0.056
 message+0.050
 says+0.049
 tickets+0.049
 idea+0.048
Neg logit contributions
years-0.068
Acc-0.067
eds-0.064
pped-0.063
pping-0.063
Token.
Token ID13
Feature activation+1.62e-02

Pos logit contributions
years+0.104
Acc+0.103
eds+0.099
pped+0.097
pping+0.096
Neg logit contributions
 So-0.066
 says-0.058
 message-0.058
 tickets-0.057
 sounds-0.057
Token Lily
Token ID20037
Feature activation-7.57e-02

Pos logit contributions
 So+0.022
 says+0.018
 sounds+0.016
 prayer+0.016
 y+0.016
Neg logit contributions
eds-0.032
pped-0.032
pping-0.032
years-0.032
umper-0.032
Token's
Token ID338
Feature activation+3.28e-02

Pos logit contributions
arter+0.152
years+0.152
Acc+0.146
eds+0.146
umper+0.143
Neg logit contributions
 So-0.091
 says-0.080
 y-0.075
 sounds-0.073
 message-0.070
Token knee
Token ID10329
Feature activation-6.84e-02

Pos logit contributions
 So+0.047
 message+0.045
 idea+0.043
 prayer+0.043
 y+0.043
Neg logit contributions
years-0.055
Acc-0.055
eds-0.053
umper-0.052
pped-0.052
Token hurt
Token ID5938
Feature activation-1.17e-01

Pos logit contributions
Acc+0.111
years+0.110
eds+0.106
pped+0.106
pping+0.104
Neg logit contributions
 So-0.098
 tickets-0.088
 message-0.088
 says-0.086
 prayer-0.086
Token so
Token ID523
Feature activation+1.15e-01

Pos logit contributions
years+0.223
Acc+0.217
eds+0.213
umper+0.209
mo+0.207
Neg logit contributions
 So-0.146
 sounds-0.122
 message-0.121
 y-0.120
 idea-0.120
Token much
Token ID881
Feature activation-4.37e-02

Pos logit contributions
 So+0.111
 y+0.103
 sounds+0.096
 goodbye+0.092
 words+0.088
Neg logit contributions
eds-0.247
pping-0.244
umper-0.244
years-0.242
mo-0.240
Token They
Token ID1119
Feature activation+6.75e-02

Pos logit contributions
Acc+0.000
years+0.000
eds+0.000
umper+0.000
arter+0.000
Neg logit contributions
 So-0.000
 message-0.000
 tickets-0.000
 fer-0.000
 y-0.000
Token did
Token ID750
Feature activation+1.05e-01

Pos logit contributions
 So+0.085
 message+0.076
 tickets+0.076
 prayer+0.074
 idea+0.073
Neg logit contributions
years-0.123
Acc-0.122
pped-0.116
eds-0.115
pping-0.114
Token not
Token ID407
Feature activation-7.85e-02

Pos logit contributions
 So+0.138
 idea+0.126
 y+0.126
 sounds+0.124
 says+0.124
Neg logit contributions
years-0.200
umper-0.187
eds-0.185
mo-0.182
Acc-0.180
Token know
Token ID760
Feature activation+1.48e-02

Pos logit contributions
years+0.136
Acc+0.134
eds+0.130
pped+0.127
pping+0.126
Neg logit contributions
 So-0.107
 message-0.095
 says-0.094
 tickets-0.093
 idea-0.092
Token what
Token ID644
Feature activation-1.71e-02

Pos logit contributions
 So+0.018
 y+0.018
 says+0.018
 tickets+0.017
 sounds+0.017
Neg logit contributions
years-0.029
umper-0.028
eds-0.027
Acc-0.027
ini-0.026
Token to
Token ID284
Feature activation-5.69e-03

Pos logit contributions
years+0.032
Acc+0.031
eds+0.031
pped+0.031
umper+0.030
Neg logit contributions
 So-0.022
 says-0.020
 y-0.019
 message-0.019
 sounds-0.019
Token do
Token ID466
Feature activation+4.91e-02

Pos logit contributions
eds+0.010
pped+0.010
arter+0.010
years+0.010
pping+0.010
Neg logit contributions
 So-0.006
 goodbye-0.006
 fer-0.006
 prayer-0.005
 tickets-0.005
Token.
Token ID13
Feature activation+4.71e-04

Pos logit contributions
 fer+0.075
 So+0.074
 Fer+0.073
 tickets+0.072
 prayer+0.072
Neg logit contributions
Acc-0.067
pping-0.063
years-0.062
pped-0.062
eds-0.060
Token They
Token ID1119
Feature activation+6.87e-02

Pos logit contributions
 So+0.001
 message+0.000
 tickets+0.000
 idea+0.000
 fer+0.000
Neg logit contributions
Acc-0.001
years-0.001
eds-0.001
umper-0.001
pped-0.001
Token heard
Token ID2982
Feature activation-4.22e-02

Pos logit contributions
 So+0.088
 message+0.079
 tickets+0.078
 prayer+0.076
 idea+0.076
Neg logit contributions
years-0.124
Acc-0.122
pped-0.116
eds-0.115
pping-0.114
Token Mom
Token ID11254
Feature activation-5.94e-02

Pos logit contributions
years+0.077
Acc+0.076
eds+0.074
pping+0.073
umper+0.072
Neg logit contributions
 So-0.056
 says-0.051
 message-0.051
 tickets-0.051
 sounds-0.050
Token now
Token ID783
Feature activation-1.22e-01

Pos logit contributions
 So+0.081
 message+0.073
 idea+0.072
 says+0.071
 tickets+0.071
Neg logit contributions
years-0.080
Acc-0.079
eds-0.075
pped-0.073
umper-0.073
Token?"
Token ID1701
Feature activation-4.34e-02

Pos logit contributions
years+0.145
Acc+0.143
eds+0.140
pped+0.140
pping+0.140
Neg logit contributions
 So-0.226
 message-0.207
 tickets-0.206
 says-0.203
 sounds-0.202
Token
Token ID198
Feature activation+2.28e-02

Pos logit contributions
Acc+0.074
years+0.072
eds+0.067
umper+0.067
pped+0.066
Neg logit contributions
 So-0.057
 message-0.057
 tickets-0.056
 idea-0.054
 fer-0.053
Token
Token ID198
Feature activation+2.25e-02

Pos logit contributions
 So+0.031
So+0.024
 says+0.024
 message+0.023
 y+0.022
Neg logit contributions
years-0.045
arter-0.043
mo-0.043
pping-0.042
eds-0.042
TokenMom
Token ID29252
Feature activation-5.62e-02

Pos logit contributions
 So+0.045
 message+0.040
 sounds+0.039
 prayer+0.039
 says+0.039
Neg logit contributions
years-0.026
eds-0.025
Acc-0.024
mo-0.024
arter-0.024
Tokenmy
Token ID1820
Feature activation-2.05e-01

Pos logit contributions
years+0.045
Acc+0.044
eds+0.041
arter+0.039
pped+0.038
Neg logit contributions
 So-0.134
 says-0.125
 prayer-0.121
 sounds-0.120
 message-0.120
Token smiled
Token ID13541
Feature activation-3.74e-03

Pos logit contributions
years+0.399
Acc+0.394
pped+0.380
eds+0.376
pping+0.375
Neg logit contributions
 So-0.222
 message-0.195
 tickets-0.194
 idea-0.191
 sounds-0.188
Token and
Token ID290
Feature activation-7.66e-03

Pos logit contributions
Acc+0.005
years+0.005
pped+0.005
rou+0.005
umper+0.004
Neg logit contributions
 tickets-0.006
 So-0.006
 message-0.006
 Fer-0.006
 passports-0.006
Token replied
Token ID8712
Feature activation+1.13e-01

Pos logit contributions
Acc+0.015
years+0.015
pped+0.014
rou+0.014
 accidentally+0.014
Neg logit contributions
 idea-0.007
 tale-0.007
 message-0.007
 price-0.007
 So-0.007
Token,
Token ID11
Feature activation-3.45e-02

Pos logit contributions
 So+0.138
 message+0.132
 tickets+0.130
 says+0.126
 sounds+0.124
Neg logit contributions
years-0.202
Acc-0.200
pped-0.191
pping-0.187
eds-0.183
Token "
Token ID366
Feature activation-9.06e-02

Pos logit contributions
Acc+0.050
years+0.049
pping+0.045
pped+0.045
eds+0.043
Neg logit contributions
 So-0.054
 message-0.053
 tickets-0.053
 idea-0.051
 says-0.051
Token prevented
Token ID13351
Feature activation+5.12e-02

Pos logit contributions
 So+0.181
 tickets+0.155
 hello+0.155
 message+0.153
 tale+0.152
Neg logit contributions
years-0.175
pped-0.169
Acc-0.164
pping-0.161
eds-0.159
Token Laura
Token ID16753
Feature activation+7.04e-02

Pos logit contributions
 So+0.074
 fer+0.066
 Fer+0.066
 bisc+0.065
 funn+0.065
Neg logit contributions
pping-0.078
pped-0.076
years-0.076
eds-0.075
Acc-0.074
Token and
Token ID290
Feature activation-9.95e-03

Pos logit contributions
 So+0.102
 Fer+0.097
 hello+0.092
 MacDonald+0.091
 fer+0.091
Neg logit contributions
pping-0.105
eds-0.104
years-0.103
pped-0.097
 flooding-0.097
Token Alice
Token ID14862
Feature activation+7.79e-02

Pos logit contributions
years+0.015
Acc+0.014
rou+0.013
 flooding+0.013
eds+0.013
Neg logit contributions
 So-0.015
 message-0.013
 idea-0.013
 tale-0.013
 fer-0.013
Token from
Token ID422
Feature activation+1.31e-01

Pos logit contributions
 So+0.108
 message+0.099
 idea+0.097
 tickets+0.097
 hello+0.094
Neg logit contributions
years-0.128
Acc-0.124
eds-0.122
pping-0.120
pped-0.120
Token having
Token ID1719
Feature activation-1.46e-01

Pos logit contributions
 tickets+0.180
 So+0.179
 message+0.177
 y+0.176
 idea+0.175
Neg logit contributions
Acc-0.211
umper-0.206
eds-0.206
years-0.203
ini-0.201
Token an
Token ID281
Feature activation+6.86e-02

Pos logit contributions
years+0.250
Acc+0.231
pped+0.223
eds+0.222
grown+0.221
Neg logit contributions
 So-0.198
 message-0.173
 says-0.172
 prayer-0.172
 replies-0.171
Token argument
Token ID4578
Feature activation+4.49e-03

Pos logit contributions
 So+0.086
 sounds+0.074
 says+0.074
 message+0.074
 prayer+0.072
Neg logit contributions
years-0.127
Acc-0.121
pping-0.116
pped-0.115
eds-0.115
Token,
Token ID11
Feature activation-3.51e-02

Pos logit contributions
 So+0.006
 message+0.005
 says+0.005
 tickets+0.005
 prayer+0.005
Neg logit contributions
years-0.008
Acc-0.008
pped-0.008
eds-0.008
pping-0.008
Token and
Token ID290
Feature activation-1.00e-02

Pos logit contributions
years+0.056
Acc+0.056
eds+0.053
pped+0.052
pping+0.052
Neg logit contributions
 So-0.052
 message-0.047
 tickets-0.046
 says-0.046
 idea-0.045
Token they
Token ID484
Feature activation+8.84e-02

Pos logit contributions
years+0.017
Acc+0.016
pped+0.015
eds+0.015
pping+0.015
Neg logit contributions
 So-0.015
 message-0.013
 idea-0.013
 tickets-0.012
 fer-0.012
Token flew
Token ID13112
Feature activation+2.46e-02

Pos logit contributions
years+0.018
Acc+0.018
eds+0.018
pping+0.017
umper+0.017
Neg logit contributions
 So-0.012
 message-0.011
 says-0.011
 y-0.011
 sounds-0.011
Token away
Token ID1497
Feature activation+1.03e-01

Pos logit contributions
 So+0.028
 message+0.024
 says+0.023
 tickets+0.023
 idea+0.023
Neg logit contributions
years-0.049
Acc-0.048
eds-0.047
pped-0.046
pping-0.046
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
 So+0.129
 tickets+0.112
 says+0.111
 message+0.111
 idea+0.107
Neg logit contributions
years-0.188
Acc-0.186
eds-0.181
pping-0.177
pped-0.176
Token Lily
Token ID20037
Feature activation-7.69e-02

Pos logit contributions
 So+0.028
 message+0.027
 tickets+0.027
 prayer+0.026
 fer+0.026
Neg logit contributions
Acc-0.021
years-0.021
eds-0.018
pped-0.018
umper-0.018
Token was
Token ID373
Feature activation-5.32e-02

Pos logit contributions
years+0.148
Acc+0.145
arter+0.141
eds+0.140
umper+0.137
Neg logit contributions
 So-0.095
 says-0.083
 sounds-0.081
 message-0.080
 y-0.079
Token sad
Token ID6507
Feature activation-1.45e-01

Pos logit contributions
years+0.117
Acc+0.116
eds+0.116
umper+0.113
pped+0.112
Neg logit contributions
 So-0.046
 y-0.039
 message-0.039
 idea-0.038
 says-0.037
Token to
Token ID284
Feature activation-4.49e-03

Pos logit contributions
years+0.264
Acc+0.256
eds+0.251
pped+0.246
umper+0.243
Neg logit contributions
 So-0.197
 says-0.170
 message-0.169
 idea-0.168
 sounds-0.166
Token see
Token ID766
Feature activation-7.78e-02

Pos logit contributions
years+0.009
Acc+0.009
rou+0.008
eds+0.008
mo+0.008
Neg logit contributions
 So-0.006
 message-0.005
 says-0.005
 thank-0.005
 idea-0.005
Token it
Token ID340
Feature activation-2.45e-02

Pos logit contributions
eds+0.159
Acc+0.156
years+0.154
pping+0.153
pped+0.152
Neg logit contributions
 says-0.082
 y-0.081
 hello-0.079
 message-0.078
 So-0.077
Token go
Token ID467
Feature activation+4.03e-02

Pos logit contributions
years+0.041
Acc+0.041
eds+0.040
pped+0.039
pping+0.039
Neg logit contributions
 So-0.034
 message-0.030
 tickets-0.030
 says-0.030
 idea-0.029
Token,
Token ID11
Feature activation-4.01e-02

Pos logit contributions
 So+0.043
 message+0.039
 idea+0.036
 says+0.036
 sounds+0.036
Neg logit contributions
years-0.083
Acc-0.082
eds-0.078
pped-0.077
umper-0.076
Token smiled
Token ID13541
Feature activation-1.53e-03

Pos logit contributions
 So+0.159
 says+0.138
 message+0.136
 sounds+0.134
 tickets+0.133
Neg logit contributions
years-0.235
Acc-0.232
eds-0.225
pped-0.220
umper-0.219
Token and
Token ID290
Feature activation-9.17e-03

Pos logit contributions
years+0.002
Acc+0.002
eds+0.002
pped+0.002
pping+0.002
Neg logit contributions
 So-0.002
 tickets-0.002
 message-0.002
 says-0.002
 treats-0.002
Token wasn
Token ID2492
Feature activation-8.96e-02

Pos logit contributions
Acc+0.016
years+0.015
pped+0.015
 accidentally+0.014
rou+0.014
Neg logit contributions
 So-0.011
 tickets-0.011
 message-0.011
 list-0.011
 idea-0.010
Token't
Token ID470
Feature activation-1.59e-01

Pos logit contributions
Acc+0.130
eds+0.128
years+0.127
ini+0.117
umper+0.117
Neg logit contributions
 So-0.153
 y-0.140
 message-0.136
 tickets-0.135
 says-0.134
Token scared
Token ID12008
Feature activation-1.88e-02

Pos logit contributions
years+0.233
Acc+0.228
eds+0.216
pping+0.214
umper+0.213
Neg logit contributions
 So-0.259
 tickets-0.236
 message-0.234
 prayer-0.229
 says-0.228
Token anymore
Token ID7471
Feature activation-1.81e-01

Pos logit contributions
years+0.027
Acc+0.027
eds+0.026
pped+0.025
umper+0.025
Neg logit contributions
 So-0.032
 idea-0.028
 says-0.028
 message-0.028
 y-0.028
Token.
Token ID13
Feature activation+1.88e-02

Pos logit contributions
years+0.334
Acc+0.327
eds+0.316
pping+0.313
pped+0.310
Neg logit contributions
 So-0.217
 tickets-0.197
 message-0.195
 says-0.192
 fer-0.188
Token She
Token ID1375
Feature activation-1.81e-02

Pos logit contributions
 So+0.027
 says+0.023
 prayer+0.022
 sounds+0.022
 message+0.022
Neg logit contributions
years-0.033
eds-0.032
Acc-0.032
pped-0.031
pping-0.031
Token was
Token ID373
Feature activation-5.05e-02

Pos logit contributions
years+0.034
Acc+0.033
eds+0.032
umper+0.031
pped+0.031
Neg logit contributions
 So-0.024
 says-0.021
 y-0.020
 message-0.020
 sounds-0.020
Token so
Token ID523
Feature activation+1.14e-01

Pos logit contributions
eds+0.099
years+0.098
Acc+0.097
umper+0.096
pped+0.094
Neg logit contributions
 So-0.055
 y-0.051
 idea-0.049
 message-0.048
 goodbye-0.047
Token happy
Token ID3772
Feature activation-2.20e-02

Pos logit contributions
 So+0.135
 y+0.123
 says+0.118
 message+0.117
 idea+0.116
Neg logit contributions
eds-0.219
years-0.216
Acc-0.214
pping-0.211
pped-0.211
Token nuts
Token ID14380
Feature activation+2.34e-02

Pos logit contributions
years+0.219
Acc+0.218
eds+0.202
pped+0.202
umper+0.199
Neg logit contributions
 So-0.158
 says-0.138
 y-0.138
 sounds-0.136
 message-0.136
Token."
Token ID526
Feature activation+1.63e-03

Pos logit contributions
 So+0.026
 message+0.022
 says+0.022
 tickets+0.022
 idea+0.022
Neg logit contributions
years-0.047
Acc-0.046
eds-0.044
pped-0.044
umper-0.043
Token
Token ID198
Feature activation+2.11e-02

Pos logit contributions
 So+0.002
 tickets+0.002
 message+0.002
 prayer+0.002
 fer+0.002
Neg logit contributions
Acc-0.003
years-0.003
umper-0.003
eds-0.003
pped-0.002
Token
Token ID198
Feature activation+2.08e-02

Pos logit contributions
 So+0.022
 says+0.016
So+0.014
 message+0.013
 replied+0.013
Neg logit contributions
years-0.050
pping-0.049
mo-0.048
arter-0.048
pped-0.048
TokenSam
Token ID16305
Feature activation-8.06e-02

Pos logit contributions
 So+0.036
So+0.031
 message+0.030
 sounds+0.029
 says+0.028
Neg logit contributions
arter-0.033
mo-0.032
years-0.031
pping-0.030
eds-0.030
Tokenmy
Token ID1820
Feature activation-2.07e-01

Pos logit contributions
arter+0.086
years+0.086
Acc+0.082
 flooding+0.079
pped+0.076
Neg logit contributions
 So-0.175
 says-0.156
 replied-0.154
 sounds-0.154
 idea-0.149
Token used
Token ID973
Feature activation-8.47e-02

Pos logit contributions
years+0.397
Acc+0.392
eds+0.388
umper+0.375
pped+0.372
Neg logit contributions
 So-0.253
 says-0.215
 message-0.212
 sounds-0.208
 y-0.208
Token his
Token ID465
Feature activation+3.46e-02

Pos logit contributions
eds+0.170
Acc+0.170
years+0.166
pping+0.161
ind+0.158
Neg logit contributions
 So-0.100
 y-0.090
 idea-0.088
 treats-0.085
 tickets-0.084
Token strong
Token ID1913
Feature activation-4.70e-02

Pos logit contributions
 So+0.043
 message+0.038
 idea+0.038
 y+0.037
 says+0.037
Neg logit contributions
years-0.066
Acc-0.065
eds-0.063
pped-0.062
umper-0.061
Token teeth
Token ID9941
Feature activation-1.82e-02

Pos logit contributions
years+0.088
Acc+0.086
eds+0.084
pped+0.082
pping+0.082
Neg logit contributions
 So-0.056
 message-0.049
 tickets-0.049
 says-0.049
 sounds-0.047
Token to
Token ID284
Feature activation-7.22e-03

Pos logit contributions
years+0.034
Acc+0.033
eds+0.033
umper+0.032
pped+0.031
Neg logit contributions
 So-0.023
 message-0.020
 idea-0.019
 tickets-0.019
 y-0.019