TOP ACTIVATIONS
MAX = 0.290

he went higher and higher up the mountain .
knew he could come back up again .
to go higher and higher up in the sky , looking
knew she wouldn 't stay up past her bed time again
it made her smile come back . <|endoftext|> Once there was
sun had started to come up . She suddenly could feel
the sun started to come up . Then , it flew
animals and they would come back soon . Sam
on tight as they soared up into the sky . Ho
as the yacht bob bed up and down .
it still wouldn 't stay up . One
next time he could come back to play . <|endoftext|> Once
and watch it soar high up in the sky .
and promised they would come back to the dentist soon .
the storm wouldn 't come back again . <|endoftext|> Once upon
good boy they would come back another day and see what
up in the branches farther up you will find a banana
to see the sun come up each morning . Every day
could feel the joy rising up in her heart .
and promised he would come back soon . <|endoftext|> One day

MIN ACTIVATIONS
MIN = -0.277

" L em ons ! Lem ons ! Beautiful juicy le
and y ucky . " Sal ad ? That 's all
Larry . " Will you play with me ?"
You learned something new . Lem ons are good for some
dad . " Will the storm go away ?"
him . " Will you help me , Wh
said . " Will everyone be okay ?" Ben
people . " Will the earthquake stop , mom
said . " Will she be happy to see
says . " Will they be loud ?" Ben
He asked , " Will you play with me ?"
screen . " Was n 't that fun ?"
honey . " Will it be great ?" Sally
see . " Will I see cows , mum
water . " Will the egg be warm now
excited . She asked " Will I get to help you
network . " Will you be my friend ?"
, " Will ie ! Did you steal
John : " Will you help me build a
scared . " Will the dentist hurt me ?"

INTERVAL 0.148 - 0.290
CONTAINS 1.035%

grab one of the sticks lying nearby . He slowly reached
heard them . They ran back to the je ep .
and play in the snow again ?" " Of
more . As she was cleaning up , Lily
L ily and Max continued to be best friends and

INTERVAL 0.007 - 0.148
CONTAINS 43.113%

it . She wanted to go home . But
someone say , " Close the hole !" He jumped
Tom felt happy knowing that he could share his love for
put the rock back on the ground . She gave the
Lu cy 's dad nodded , " Yes , but they

INTERVAL -0.135 - 0.007
CONTAINS 54.847%

playing with his toy airplane in the backyard . Suddenly ,
I 'm a brown spider who lives in this shed and
tree , I 'm helping you !" The little
sang loud and happy . They sang , " Pot ato
RE VERS E , huh ? That 's what you get

INTERVAL -0.277 - -0.135
CONTAINS 1.005%

! She looked around , but she couldn 't see anyone
you can ride Star , but be gentle ."
kind and generous . You should apologize to each other and
I was wrong . I should have listened . The grass
put them on and gave Grand ad a playful punch .
Token he
Token ID339
Feature activation+1.68e-01

Pos logit contributions
uddle+0.101
s+0.083
 big+0.072
udd+0.069
 sad+0.067
Neg logit contributions
 matter-0.227
 occasions-0.214
icians-0.208
bet-0.208
illa-0.208
Token went
Token ID1816
Feature activation+1.28e-01

Pos logit contributions
uddle+0.262
s+0.218
udd+0.199
+0.178
 She+0.169
Neg logit contributions
 matter-0.357
 yours-0.308
 remind-0.304
 avail-0.300
icians-0.299
Token higher
Token ID2440
Feature activation+2.16e-01

Pos logit contributions
uddle+0.203
udd+0.159
s+0.155
+0.125
icker+0.122
Neg logit contributions
 matter-0.263
 remind-0.227
 letting-0.226
 yours-0.222
 protection-0.217
Token and
Token ID290
Feature activation+3.58e-02

Pos logit contributions
uddle+0.294
s+0.230
udd+0.211
+0.179
 She+0.175
Neg logit contributions
 matter-0.502
 remind-0.432
icians-0.425
 yours-0.425
 letting-0.422
Token higher
Token ID2440
Feature activation+1.87e-01

Pos logit contributions
uddle+0.061
s+0.045
udd+0.044
eth+0.039
uffy+0.038
Neg logit contributions
 matter-0.075
 remind-0.061
 letting-0.061
 sizes-0.058
 avail-0.057
Token up
Token ID510
Feature activation+2.52e-01

Pos logit contributions
uddle+0.250
s+0.194
udd+0.178
+0.150
 She+0.146
Neg logit contributions
 matter-0.437
 remind-0.376
 yours-0.370
icians-0.370
 letting-0.369
Token the
Token ID262
Feature activation+1.44e-01

Pos logit contributions
uddle+0.429
udd+0.338
s+0.314
 gym+0.255
+0.245
Neg logit contributions
 matter-0.516
 letting-0.441
 remind-0.428
 yours-0.421
 avail-0.421
Token mountain
Token ID8598
Feature activation+7.04e-02

Pos logit contributions
uddle+0.277
s+0.233
udd+0.228
 She+0.198
+0.192
Neg logit contributions
 matter-0.254
 remind-0.198
pa-0.191
icated-0.190
 letting-0.190
Token.
Token ID13
Feature activation+3.82e-02

Pos logit contributions
uddle+0.068
s+0.056
udd+0.040
+0.034
 She+0.033
Neg logit contributions
 matter-0.190
 remind-0.167
 yours-0.167
bet-0.166
illa-0.166
Token
Token ID220
Feature activation+3.67e-02

Pos logit contributions
uddle+0.051
udd+0.038
s+0.035
 sad+0.026
obb+0.025
Neg logit contributions
 matter-0.084
 letting-0.076
bons-0.074
 remind-0.074
 lending-0.073
Token
Token ID198
Feature activation-3.89e-03

Pos logit contributions
uddle+0.044
udd+0.028
s+0.026
 gym+0.023
 messy+0.022
Neg logit contributions
 matter-0.075
 yours-0.074
illa-0.073
 letting-0.073
fulness-0.072
Token knew
Token ID2993
Feature activation+4.87e-02

Pos logit contributions
uddle+0.120
s+0.090
udd+0.083
+0.069
 She+0.066
Neg logit contributions
 matter-0.227
 remind-0.198
 letting-0.196
icians-0.194
 yours-0.194
Token he
Token ID339
Feature activation+1.25e-01

Pos logit contributions
uddle+0.057
s+0.049
 She+0.043
+0.043
 sad+0.041
Neg logit contributions
 matter-0.115
bet-0.103
icians-0.101
igh-0.100
bons-0.098
Token could
Token ID714
Feature activation-2.69e-02

Pos logit contributions
uddle+0.177
s+0.160
 sad+0.131
 He+0.130
udd+0.129
Neg logit contributions
 matter-0.265
 avail-0.237
bet-0.237
icians-0.235
icated-0.232
Token come
Token ID1282
Feature activation+1.04e-01

Pos logit contributions
 matter+0.055
icated+0.050
icians+0.050
bet+0.048
ocket+0.048
Neg logit contributions
uddle-0.040
s-0.034
-0.033
 She-0.032
 nicely-0.031
Token back
Token ID736
Feature activation+2.54e-01

Pos logit contributions
uddle+0.117
s+0.096
udd+0.077
+0.071
+0.069
Neg logit contributions
 matter-0.261
icians-0.226
bet-0.224
 occasions-0.223
icated-0.222
Token up
Token ID510
Feature activation+2.50e-01

Pos logit contributions
uddle+0.290
s+0.237
udd+0.194
+0.181
+0.179
Neg logit contributions
 matter-0.635
icians-0.552
bet-0.550
 remind-0.540
icated-0.540
Token again
Token ID757
Feature activation+2.30e-01

Pos logit contributions
uddle+0.273
s+0.224
udd+0.175
+0.171
+0.169
Neg logit contributions
 matter-0.626
bet-0.557
icians-0.553
illa-0.546
ocket-0.544
Token.
Token ID13
Feature activation+4.02e-02

Pos logit contributions
uddle+0.211
s+0.176
+0.133
+0.129
udd+0.128
Neg logit contributions
 matter-0.608
bet-0.545
icians-0.538
ocket-0.535
icated-0.532
Token
Token ID220
Feature activation+4.92e-02

Pos logit contributions
uddle+0.033
s+0.021
udd+0.017
+0.011
 She+0.010
Neg logit contributions
 matter-0.114
 remind-0.102
 yours-0.100
icians-0.100
illa-0.100
Token
Token ID198
Feature activation-3.34e-03

Pos logit contributions
uddle+0.050
udd+0.031
s+0.029
 sad+0.021
 gym+0.021
Neg logit contributions
 matter-0.115
 yours-0.113
illa-0.110
 letting-0.109
pa-0.109
Token
Token ID198
Feature activation-1.32e-02

Pos logit contributions
 matter+0.010
 remind+0.009
icians+0.009
illa+0.009
 letting+0.009
Neg logit contributions
uddle-0.002
s-0.001
udd-0.001
 sad-0.000
-0.000
Token to
Token ID284
Feature activation-3.31e-02

Pos logit contributions
uddle+0.163
s+0.136
udd+0.123
+0.108
 She+0.108
Neg logit contributions
 matter-0.236
icians-0.202
 remind-0.202
pa-0.201
illa-0.201
Token go
Token ID467
Feature activation+1.23e-01

Pos logit contributions
ocket+0.066
icians+0.065
illa+0.064
 matter+0.063
keeping+0.063
Neg logit contributions
uddle-0.041
s-0.038
 sad-0.036
-0.036
 big-0.035
Token higher
Token ID2440
Feature activation+2.12e-01

Pos logit contributions
uddle+0.173
s+0.135
udd+0.125
+0.107
 She+0.104
Neg logit contributions
 matter-0.281
 remind-0.242
 yours-0.238
icians-0.237
 letting-0.236
Token and
Token ID290
Feature activation+2.78e-02

Pos logit contributions
uddle+0.253
s+0.207
udd+0.176
+0.149
 She+0.144
Neg logit contributions
 matter-0.519
 remind-0.455
bet-0.450
ocket-0.449
icians-0.449
Token higher
Token ID2440
Feature activation+1.86e-01

Pos logit contributions
uddle+0.038
s+0.028
udd+0.026
+0.022
 She+0.021
Neg logit contributions
 matter-0.066
 remind-0.057
 yours-0.055
 letting-0.055
icians-0.055
Token up
Token ID510
Feature activation+2.50e-01

Pos logit contributions
uddle+0.239
s+0.198
udd+0.171
+0.148
 She+0.143
Neg logit contributions
 matter-0.440
 remind-0.383
illa-0.378
icians-0.378
bet-0.378
Token in
Token ID287
Feature activation+3.39e-02

Pos logit contributions
uddle+0.391
s+0.299
udd+0.297
+0.238
 gym+0.233
Neg logit contributions
 matter-0.536
 remind-0.452
 letting-0.450
 yours-0.446
 avail-0.438
Token the
Token ID262
Feature activation+1.02e-01

Pos logit contributions
+0.025
 big+0.023
+0.022
uddle+0.022
 The+0.021
Neg logit contributions
ocket-0.087
pper-0.086
 matter-0.085
illa-0.085
icians-0.083
Token sky
Token ID6766
Feature activation+4.67e-02

Pos logit contributions
uddle+0.180
s+0.150
udd+0.140
uddled+0.122
 nicely+0.120
Neg logit contributions
 matter-0.196
bons-0.153
 remind-0.150
pa-0.149
 sizes-0.146
Token,
Token ID11
Feature activation+8.73e-02

Pos logit contributions
uddle+0.044
s+0.036
udd+0.028
+0.023
+0.023
Neg logit contributions
 matter-0.125
 remind-0.110
icians-0.110
ocket-0.109
 yours-0.109
Token looking
Token ID2045
Feature activation+1.09e-01

Pos logit contributions
uddle+0.107
s+0.098
 sad+0.091
+0.087
 back+0.086
Neg logit contributions
 matter-0.198
 occasions-0.185
ocket-0.179
illa-0.179
pa-0.177
Token knew
Token ID2993
Feature activation+5.07e-03

Pos logit contributions
 matter+0.101
pper+0.091
bet+0.091
icians+0.090
ocket+0.086
Neg logit contributions
s-0.076
uddle-0.074
 She-0.068
 well-0.063
-0.063
Token she
Token ID673
Feature activation+1.17e-01

Pos logit contributions
uddle+0.006
s+0.005
 She+0.004
+0.004
udd+0.004
Neg logit contributions
 matter-0.012
bet-0.011
pper-0.011
icians-0.011
igh-0.011
Token wouldn
Token ID3636
Feature activation+5.93e-03

Pos logit contributions
s+0.138
uddle+0.132
+0.097
udd+0.097
 She+0.095
Neg logit contributions
 matter-0.274
 avail-0.265
bet-0.258
weet-0.256
icated-0.253
Token't
Token ID470
Feature activation-1.46e-02

Pos logit contributions
uddle+0.011
udd+0.009
s+0.009
+0.008
 nicely+0.008
Neg logit contributions
 matter-0.010
 remind-0.009
igh-0.008
 letting-0.008
icians-0.008
Token stay
Token ID2652
Feature activation+9.86e-02

Pos logit contributions
 matter+0.027
icians+0.026
ocket+0.025
bet+0.025
illa+0.025
Neg logit contributions
uddle-0.023
s-0.021
 sad-0.018
 She-0.018
udd-0.018
Token up
Token ID510
Feature activation+2.48e-01

Pos logit contributions
uddle+0.133
s+0.106
udd+0.096
 sad+0.084
+0.082
Neg logit contributions
 matter-0.228
icians-0.194
 remind-0.193
bet-0.193
icated-0.193
Token past
Token ID1613
Feature activation-3.35e-02

Pos logit contributions
uddle+0.290
s+0.245
udd+0.209
 sad+0.195
+0.190
Neg logit contributions
 matter-0.591
bet-0.528
illa-0.524
icians-0.522
pper-0.515
Token her
Token ID607
Feature activation+1.03e-01

Pos logit contributions
 matter+0.071
 remind+0.062
bet+0.061
 bid+0.060
epad+0.060
Neg logit contributions
uddle-0.050
s-0.045
udd-0.037
-0.037
 She-0.035
Token bed
Token ID3996
Feature activation+2.58e-02

Pos logit contributions
uddle+0.120
s+0.091
udd+0.078
+0.064
 sad+0.063
Neg logit contributions
 matter-0.257
icians-0.229
 remind-0.225
bet-0.224
 yours-0.224
Tokentime
Token ID2435
Feature activation+1.22e-01

Pos logit contributions
uddle+0.032
s+0.026
udd+0.023
 She+0.019
+0.018
Neg logit contributions
 matter-0.062
 remind-0.053
icians-0.053
bet-0.053
 yours-0.053
Token again
Token ID757
Feature activation+2.21e-01

Pos logit contributions
uddle+0.096
s+0.081
+0.060
 nicely+0.059
,+0.057
Neg logit contributions
 matter-0.329
icians-0.307
bet-0.305
igh-0.302
illa-0.302
Token it
Token ID340
Feature activation+1.53e-01

Pos logit contributions
uddle+0.106
s+0.080
udd+0.076
+0.068
 gym+0.065
Neg logit contributions
 matter-0.158
 yours-0.136
 remind-0.135
 letting-0.133
 protection-0.131
Token made
Token ID925
Feature activation+1.36e-02

Pos logit contributions
uddle+0.204
s+0.190
udd+0.154
 sad+0.141
 She+0.137
Neg logit contributions
 matter-0.330
bet-0.317
 avail-0.308
 yours-0.306
illa-0.304
Token her
Token ID607
Feature activation+7.81e-02

Pos logit contributions
uddle+0.013
s+0.012
 She+0.011
+0.011
 big+0.011
Neg logit contributions
 matter-0.031
pper-0.031
icians-0.031
igh-0.030
ocket-0.030
Token smile
Token ID8212
Feature activation+6.79e-02

Pos logit contributions
uddle+0.081
 sad+0.072
s+0.067
 big+0.062
 back+0.058
Neg logit contributions
icians-0.194
 matter-0.193
bet-0.188
pper-0.184
icated-0.181
Token come
Token ID1282
Feature activation+1.14e-01

Pos logit contributions
uddle+0.066
s+0.058
udd+0.043
 She+0.039
+0.036
Neg logit contributions
 matter-0.177
icians-0.158
bet-0.157
 occasions-0.155
illa-0.155
Token back
Token ID736
Feature activation+2.46e-01

Pos logit contributions
uddle+0.177
s+0.164
udd+0.138
 She+0.136
 sad+0.133
Neg logit contributions
 matter-0.228
icians-0.200
 occasions-0.194
bet-0.193
illa-0.192
Token.
Token ID13
Feature activation+5.06e-02

Pos logit contributions
uddle+0.239
s+0.202
udd+0.146
 She+0.135
+0.130
Neg logit contributions
 matter-0.662
icians-0.588
bet-0.581
icated-0.570
 remind-0.570
Token<|endoftext|>
Token ID50256
Feature activation-2.27e-02

Pos logit contributions
uddle+0.036
s+0.020
udd+0.016
+0.007
 sad+0.006
Neg logit contributions
 matter-0.150
 remind-0.134
 yours-0.133
illa-0.132
icians-0.132
TokenOnce
Token ID7454
Feature activation-6.09e-02

Pos logit contributions
 matter+0.060
 letting+0.054
igh+0.054
 remind+0.054
illa+0.054
Neg logit contributions
uddle-0.021
udd-0.013
s-0.012
 sad-0.008
eeper-0.008
Token there
Token ID612
Feature activation-6.09e-03

Pos logit contributions
 matter+0.112
pa+0.099
 yours+0.098
fulness+0.092
enna+0.091
Neg logit contributions
uddle-0.102
udd-0.080
eeper-0.074
ashes-0.072
 gym-0.067
Token was
Token ID373
Feature activation+2.51e-03

Pos logit contributions
 matter+0.013
 remind+0.011
pa+0.011
illa+0.011
 yours+0.011
Neg logit contributions
uddle-0.009
udd-0.007
s-0.007
eeper-0.005
ashes-0.005
Token sun
Token ID4252
Feature activation+3.37e-02

Pos logit contributions
uddle+0.081
 big+0.064
 sad+0.062
s+0.057
 gym+0.056
Neg logit contributions
 matter-0.137
icians-0.134
pper-0.130
 yours-0.127
illa-0.127
Token had
Token ID550
Feature activation-1.48e-02

Pos logit contributions
uddle+0.035
s+0.035
 She+0.024
+0.022
+0.022
Neg logit contributions
 matter-0.085
icated-0.082
 yours-0.081
icians-0.079
isdom-0.078
Token started
Token ID2067
Feature activation+1.58e-01

Pos logit contributions
 matter+0.028
illa+0.026
 avail+0.026
icians+0.026
pper+0.026
Neg logit contributions
uddle-0.020
s-0.020
 big-0.019
 She-0.017
udd-0.017
Token to
Token ID284
Feature activation-3.37e-02

Pos logit contributions
s+0.113
uddle+0.095
 She+0.079
 big+0.075
 sad+0.070
Neg logit contributions
 matter-0.425
icians-0.420
ocket-0.415
pper-0.415
bet-0.412
Token come
Token ID1282
Feature activation+9.06e-02

Pos logit contributions
icians+0.059
 matter+0.057
ocket+0.057
keeping+0.056
illa+0.055
Neg logit contributions
uddle-0.048
s-0.045
 big-0.045
-0.044
 She-0.043
Token up
Token ID510
Feature activation+2.44e-01

Pos logit contributions
uddle+0.127
s+0.104
udd+0.092
+0.081
 She+0.081
Neg logit contributions
 matter-0.203
 remind-0.175
icians-0.174
bet-0.172
illa-0.171
Token.
Token ID13
Feature activation+4.50e-02

Pos logit contributions
uddle+0.271
s+0.223
udd+0.181
+0.165
 She+0.163
Neg logit contributions
 matter-0.611
icians-0.542
bet-0.540
illa-0.535
ocket-0.533
Token She
Token ID1375
Feature activation-1.90e-03

Pos logit contributions
uddle+0.058
udd+0.042
s+0.040
 sad+0.027
eeper+0.026
Neg logit contributions
 matter-0.104
 letting-0.093
 remind-0.092
 occasions-0.092
 protection-0.092
Token suddenly
Token ID6451
Feature activation+8.01e-02

Pos logit contributions
 matter+0.004
 avail+0.004
 yours+0.004
epad+0.004
bet+0.004
Neg logit contributions
uddle-0.002
s-0.002
-0.002
udd-0.002
 big-0.002
Token could
Token ID714
Feature activation-4.37e-02

Pos logit contributions
uddle+0.114
s+0.101
udd+0.088
 She+0.078
 sad+0.077
Neg logit contributions
 matter-0.177
 yours-0.153
bet-0.152
 remind-0.151
 occasions-0.150
Token feel
Token ID1254
Feature activation+8.83e-02

Pos logit contributions
 matter+0.094
 yours+0.079
 remind+0.079
icians+0.079
bet+0.079
Neg logit contributions
uddle-0.065
s-0.054
udd-0.049
 She-0.044
-0.043
Token the
Token ID262
Feature activation+5.39e-02

Pos logit contributions
uddle+0.003
s+0.002
+0.002
+0.002
 She+0.002
Neg logit contributions
 matter-0.003
icians-0.002
illa-0.002
bet-0.002
pper-0.002
Token sun
Token ID4252
Feature activation+3.62e-02

Pos logit contributions
uddle+0.060
 big+0.045
s+0.043
 sad+0.041
 small+0.039
Neg logit contributions
 matter-0.130
illa-0.124
icians-0.122
pper-0.121
 yours-0.121
Token started
Token ID2067
Feature activation+1.63e-01

Pos logit contributions
uddle+0.050
s+0.047
udd+0.036
+0.035
+0.034
Neg logit contributions
 matter-0.082
 yours-0.075
icated-0.073
 remind-0.072
 occasions-0.072
Token to
Token ID284
Feature activation-2.45e-02

Pos logit contributions
uddle+0.152
s+0.129
udd+0.098
 She+0.085
+0.081
Neg logit contributions
 matter-0.433
icians-0.396
ocket-0.391
illa-0.391
 occasions-0.389
Token come
Token ID1282
Feature activation+9.13e-02

Pos logit contributions
 matter+0.040
ocket+0.036
icians+0.036
illa+0.035
 occasions+0.034
Neg logit contributions
uddle-0.045
s-0.040
udd-0.037
-0.036
-0.035
Token up
Token ID510
Feature activation+2.44e-01

Pos logit contributions
uddle+0.135
udd+0.098
s+0.096
 gym+0.083
+0.078
Neg logit contributions
 matter-0.207
 remind-0.176
 yours-0.174
 letting-0.173
 protection-0.169
Token.
Token ID13
Feature activation+3.76e-02

Pos logit contributions
uddle+0.346
s+0.254
udd+0.247
 gym+0.200
+0.196
Neg logit contributions
 matter-0.560
 remind-0.480
 letting-0.471
 yours-0.469
 avail-0.464
Token Then
Token ID3244
Feature activation-6.10e-03

Pos logit contributions
uddle+0.040
s+0.030
udd+0.026
+0.022
 She+0.021
Neg logit contributions
 matter-0.098
 remind-0.086
icians-0.085
 yours-0.085
illa-0.084
Token,
Token ID11
Feature activation+7.21e-02

Pos logit contributions
 matter+0.015
bet+0.013
illa+0.013
icians+0.013
ocket+0.013
Neg logit contributions
uddle-0.007
s-0.006
-0.005
-0.005
 She-0.005
Token it
Token ID340
Feature activation+9.62e-02

Pos logit contributions
uddle+0.101
s+0.092
+0.082
+0.078
 She+0.076
Neg logit contributions
 matter-0.155
 occasions-0.140
bet-0.140
icians-0.137
ocket-0.137
Token flew
Token ID13112
Feature activation+7.76e-02

Pos logit contributions
uddle+0.168
s+0.149
udd+0.132
+0.121
+0.119
Neg logit contributions
 matter-0.184
 yours-0.158
 avail-0.154
 occasions-0.154
bet-0.153
Token animals
Token ID4695
Feature activation+2.87e-02

Pos logit contributions
uddle+0.087
s+0.075
udd+0.061
+0.057
+0.056
Neg logit contributions
 matter-0.153
ocket-0.136
 avail-0.136
icians-0.135
bet-0.134
Token and
Token ID290
Feature activation-6.09e-02

Pos logit contributions
uddle+0.034
s+0.030
udd+0.023
+0.022
+0.021
Neg logit contributions
 matter-0.071
 avail-0.064
bet-0.063
illa-0.061
ocket-0.061
Token they
Token ID484
Feature activation+5.75e-02

Pos logit contributions
 matter+0.122
illa+0.117
ocket+0.113
pper+0.112
bet+0.112
Neg logit contributions
uddle-0.081
s-0.079
 sad-0.071
 big-0.069
 She-0.068
Token would
Token ID561
Feature activation+9.33e-03

Pos logit contributions
uddle+0.087
s+0.081
udd+0.068
 sad+0.066
 She+0.065
Neg logit contributions
 matter-0.117
 avail-0.104
bet-0.102
 occasions-0.100
illa-0.099
Token come
Token ID1282
Feature activation+9.47e-02

Pos logit contributions
uddle+0.014
s+0.012
+0.012
 She+0.012
 big+0.011
Neg logit contributions
 matter-0.018
ocket-0.016
icians-0.016
 avail-0.016
illa-0.016
Token back
Token ID736
Feature activation+2.44e-01

Pos logit contributions
uddle+0.105
s+0.080
udd+0.069
+0.057
 She+0.055
Neg logit contributions
 matter-0.242
 remind-0.211
icians-0.209
 yours-0.208
illa-0.207
Token soon
Token ID2582
Feature activation+1.47e-01

Pos logit contributions
uddle+0.313
s+0.262
udd+0.225
+0.202
+0.199
Neg logit contributions
 matter-0.575
icians-0.491
bet-0.490
 remind-0.488
ocket-0.486
Token.
Token ID13
Feature activation+3.03e-02

Pos logit contributions
uddle+0.107
s+0.092
udd+0.053
+0.051
 sad+0.047
Neg logit contributions
 matter-0.417
bet-0.385
illa-0.383
icians-0.381
icated-0.378
Token
Token ID198
Feature activation-2.01e-02

Pos logit contributions
uddle+0.022
s+0.015
udd+0.010
 She+0.009
+0.009
Neg logit contributions
 matter-0.089
 remind-0.079
icians-0.078
ocket-0.078
 avail-0.078
Token
Token ID198
Feature activation-2.85e-02

Pos logit contributions
 matter+0.053
 remind+0.047
 yours+0.046
icians+0.046
illa+0.046
Neg logit contributions
uddle-0.021
s-0.015
udd-0.013
-0.009
 She-0.009
TokenSam
Token ID16305
Feature activation-1.03e-01

Pos logit contributions
 matter+0.069
 avail+0.059
 remind+0.059
ocket+0.059
 yours+0.058
Neg logit contributions
uddle-0.035
s-0.028
udd-0.024
-0.023
-0.022
Token on
Token ID319
Feature activation-6.75e-02

Pos logit contributions
 matter+0.028
 occasions+0.024
epad+0.024
icated+0.024
weet+0.024
Neg logit contributions
uddle-0.013
s-0.013
-0.011
 back-0.010
 He-0.010
Token tight
Token ID5381
Feature activation-1.91e-02

Pos logit contributions
 matter+0.148
 remind+0.127
 yours+0.125
 letting+0.124
icians+0.123
Neg logit contributions
uddle-0.101
s-0.077
udd-0.075
-0.064
 She-0.060
Token as
Token ID355
Feature activation+7.12e-02

Pos logit contributions
 matter+0.053
 remind+0.047
illa+0.047
icians+0.046
icated+0.046
Neg logit contributions
uddle-0.017
s-0.014
udd-0.010
 She-0.008
-0.008
Token they
Token ID484
Feature activation+1.21e-01

Pos logit contributions
uddle+0.085
s+0.073
udd+0.058
 She+0.056
+0.056
Neg logit contributions
 matter-0.171
 occasions-0.149
 remind-0.148
bet-0.148
illa-0.147
Token soared
Token ID35292
Feature activation+1.04e-01

Pos logit contributions
uddle+0.183
s+0.176
udd+0.141
+0.132
+0.129
Neg logit contributions
 matter-0.257
 yours-0.227
 avail-0.224
 occasions-0.223
illa-0.218
Token up
Token ID510
Feature activation+2.43e-01

Pos logit contributions
uddle+0.151
s+0.113
udd+0.110
+0.094
 gym+0.091
Neg logit contributions
 matter-0.232
 remind-0.197
 yours-0.197
icians-0.194
 letting-0.194
Token into
Token ID656
Feature activation+7.46e-02

Pos logit contributions
uddle+0.341
s+0.254
udd+0.251
+0.199
 She+0.193
Neg logit contributions
 matter-0.554
 remind-0.473
 yours-0.470
 letting-0.470
icians-0.463
Token the
Token ID262
Feature activation+1.10e-01

Pos logit contributions
uddle+0.075
s+0.064
+0.059
 big+0.058
+0.055
Neg logit contributions
 matter-0.182
pper-0.169
illa-0.167
ocket-0.165
 remind-0.162
Token sky
Token ID6766
Feature activation+7.49e-02

Pos logit contributions
uddle+0.174
s+0.144
udd+0.136
+0.115
 She+0.113
Neg logit contributions
 matter-0.227
 remind-0.183
bons-0.183
 yours-0.181
 protection-0.178
Token.
Token ID13
Feature activation+3.45e-02

Pos logit contributions
uddle+0.064
s+0.055
udd+0.038
+0.031
 She+0.030
Neg logit contributions
 matter-0.210
 remind-0.185
ocket-0.183
icians-0.183
 yours-0.183
Token Ho
Token ID9544
Feature activation-8.81e-02

Pos logit contributions
uddle+0.046
udd+0.033
s+0.032
 sad+0.025
 gym+0.024
Neg logit contributions
 matter-0.076
 remind-0.069
 letting-0.068
bet-0.068
icians-0.068
Token as
Token ID355
Feature activation+7.69e-02

Pos logit contributions
uddle+0.019
s+0.016
udd+0.012
+0.009
 She+0.009
Neg logit contributions
 matter-0.065
icians-0.058
 yours-0.057
 remind-0.057
 avail-0.057
Token the
Token ID262
Feature activation+9.84e-02

Pos logit contributions
uddle+0.078
s+0.078
 big+0.070
 well+0.069
+0.064
Neg logit contributions
 matter-0.177
bet-0.171
 occasions-0.168
pper-0.166
ocket-0.164
Token yacht
Token ID43207
Feature activation+4.70e-02

Pos logit contributions
uddle+0.132
s+0.105
 big+0.103
 sad+0.097
 back+0.094
Neg logit contributions
 matter-0.213
illa-0.203
icians-0.197
pper-0.196
ocket-0.192
Token bob
Token ID29202
Feature activation+5.28e-02

Pos logit contributions
uddle+0.068
s+0.063
udd+0.050
+0.046
+0.045
Neg logit contributions
 matter-0.104
 yours-0.092
 avail-0.091
icated-0.090
bet-0.089
Tokenbed
Token ID3077
Feature activation+6.56e-02

Pos logit contributions
uddle+0.109
udd+0.090
s+0.088
 gym+0.080
 She+0.079
Neg logit contributions
 matter-0.083
pper-0.070
bet-0.069
illa-0.069
 remind-0.068
Token up
Token ID510
Feature activation+2.43e-01

Pos logit contributions
uddle+0.076
s+0.066
udd+0.051
 She+0.045
+0.044
Neg logit contributions
 matter-0.163
icians-0.142
 remind-0.141
illa-0.141
 avail-0.140
Token and
Token ID290
Feature activation-6.56e-03

Pos logit contributions
uddle+0.280
s+0.232
udd+0.184
 She+0.166
+0.164
Neg logit contributions
 matter-0.605
icians-0.530
illa-0.528
 remind-0.522
bet-0.522
Token down
Token ID866
Feature activation+1.59e-01

Pos logit contributions
 matter+0.019
 occasions+0.017
ocket+0.017
illa+0.017
icians+0.017
Neg logit contributions
uddle-0.005
s-0.004
udd-0.003
-0.002
 She-0.002
Token.
Token ID13
Feature activation+3.23e-02

Pos logit contributions
uddle+0.194
s+0.154
udd+0.133
+0.115
 She+0.112
Neg logit contributions
 matter-0.389
 remind-0.337
icians-0.334
illa-0.333
 yours-0.333
Token
Token ID198
Feature activation-9.57e-03

Pos logit contributions
uddle+0.030
s+0.021
udd+0.018
+0.012
 sad+0.011
Neg logit contributions
 matter-0.088
 remind-0.078
 yours-0.077
icians-0.077
 letting-0.077
Token
Token ID198
Feature activation-1.63e-02

Pos logit contributions
 matter+0.028
 remind+0.025
icians+0.025
illa+0.025
 letting+0.024
Neg logit contributions
uddle-0.007
s-0.004
udd-0.003
 sad-0.002
 She-0.001
Token it
Token ID340
Feature activation+8.85e-02

Pos logit contributions
 matter+0.319
bet+0.285
 occasions+0.282
icians+0.281
illa+0.280
Neg logit contributions
uddle-0.199
s-0.184
 sad-0.143
udd-0.139
-0.138
Token still
Token ID991
Feature activation-1.53e-02

Pos logit contributions
uddle+0.125
s+0.107
udd+0.089
 sad+0.081
 She+0.079
Neg logit contributions
 matter-0.194
icians-0.177
 yours-0.174
ocket-0.174
illa-0.173
Token wouldn
Token ID3636
Feature activation-1.13e-03

Pos logit contributions
 matter+0.027
icians+0.023
bet+0.023
illa+0.023
 occasions+0.022
Neg logit contributions
uddle-0.029
s-0.026
udd-0.023
 sad-0.021
 She-0.021
Token't
Token ID470
Feature activation-1.91e-02

Pos logit contributions
 matter+0.002
 remind+0.002
 letting+0.001
pper+0.001
 yours+0.001
Neg logit contributions
uddle-0.002
udd-0.002
s-0.002
 gym-0.002
-0.002
Token stay
Token ID2652
Feature activation+9.18e-02

Pos logit contributions
 matter+0.035
icians+0.029
illa+0.029
bet+0.029
 remind+0.029
Neg logit contributions
uddle-0.035
s-0.030
udd-0.027
 She-0.025
-0.025
Token up
Token ID510
Feature activation+2.43e-01

Pos logit contributions
uddle+0.119
s+0.095
udd+0.084
+0.071
 sad+0.070
Neg logit contributions
 matter-0.216
 remind-0.187
icians-0.187
illa-0.185
bet-0.185
Token.
Token ID13
Feature activation+4.30e-02

Pos logit contributions
uddle+0.218
s+0.172
udd+0.130
 sad+0.116
 She+0.112
Neg logit contributions
 matter-0.649
illa-0.588
bet-0.588
icians-0.584
ocket-0.578
Token
Token ID220
Feature activation+3.89e-02

Pos logit contributions
uddle+0.054
udd+0.040
s+0.031
 sad+0.028
obb+0.028
Neg logit contributions
 matter-0.098
 sizes-0.085
bons-0.085
 letting-0.085
 occasions-0.084
Token
Token ID198
Feature activation-4.74e-04

Pos logit contributions
uddle+0.041
udd+0.025
s+0.022
 sad+0.018
 gym+0.018
Neg logit contributions
 matter-0.091
 yours-0.085
illa-0.085
 letting-0.084
fulness-0.084
Token
Token ID198
Feature activation-8.63e-03

Pos logit contributions
 matter+0.001
 remind+0.001
illa+0.001
pa+0.001
icians+0.001
Neg logit contributions
uddle-0.001
udd-0.000
s-0.000
 gym-0.000
 sad-0.000
TokenOne
Token ID3198
Feature activation-4.61e-02

Pos logit contributions
 matter+0.017
 remind+0.014
icians+0.013
 letting+0.013
 occasions+0.013
Neg logit contributions
uddle-0.016
udd-0.012
s-0.012
-0.010
 sad-0.010
Token next
Token ID1306
Feature activation+5.24e-02

Pos logit contributions
uddle+0.127
 big+0.098
s+0.093
 gym+0.092
+0.087
Neg logit contributions
 matter-0.208
isdom-0.197
igh-0.197
icians-0.196
ocket-0.196
Token time
Token ID640
Feature activation+1.91e-02

Pos logit contributions
uddle+0.096
s+0.079
 big+0.075
+0.075
+0.074
Neg logit contributions
 matter-0.090
ocket-0.082
icians-0.082
pper-0.080
 yours-0.080
Token he
Token ID339
Feature activation+1.40e-01

Pos logit contributions
uddle+0.022
s+0.018
udd+0.015
+0.013
 She+0.013
Neg logit contributions
 matter-0.048
 remind-0.042
bet-0.041
 yours-0.041
illa-0.041
Token could
Token ID714
Feature activation-1.72e-02

Pos logit contributions
uddle+0.202
s+0.179
udd+0.153
+0.150
+0.144
Neg logit contributions
 matter-0.313
icated-0.270
 avail-0.270
 yours-0.269
bet-0.264
Token come
Token ID1282
Feature activation+9.66e-02

Pos logit contributions
 matter+0.037
icians+0.031
icated+0.031
 yours+0.031
bet+0.031
Neg logit contributions
uddle-0.026
s-0.022
udd-0.019
 She-0.018
-0.018
Token back
Token ID736
Feature activation+2.43e-01

Pos logit contributions
uddle+0.106
s+0.070
udd+0.068
+0.047
 gym+0.047
Neg logit contributions
 matter-0.247
 remind-0.224
 yours-0.222
 letting-0.220
illa-0.216
Token to
Token ID284
Feature activation-4.18e-03

Pos logit contributions
uddle+0.299
s+0.233
udd+0.206
+0.175
 She+0.170
Neg logit contributions
 matter-0.594
 remind-0.514
icians-0.508
 yours-0.506
bet-0.504
Token play
Token ID711
Feature activation+8.63e-02

Pos logit contributions
 matter+0.010
pper+0.009
icians+0.009
illa+0.008
ocket+0.008
Neg logit contributions
uddle-0.005
s-0.004
-0.004
-0.004
 She-0.004
Token.
Token ID13
Feature activation+4.41e-02

Pos logit contributions
uddle+0.101
s+0.077
udd+0.067
+0.058
 She+0.056
Neg logit contributions
 matter-0.216
 remind-0.186
icians-0.185
illa-0.185
 yours-0.184
Token<|endoftext|>
Token ID50256
Feature activation-2.27e-02

Pos logit contributions
uddle+0.036
s+0.022
udd+0.019
 sad+0.010
+0.010
Neg logit contributions
 matter-0.125
 remind-0.111
 yours-0.110
 letting-0.110
illa-0.110
TokenOnce
Token ID7454
Feature activation-6.29e-02

Pos logit contributions
 matter+0.060
 letting+0.055
igh+0.055
illa+0.054
icians+0.054
Neg logit contributions
uddle-0.020
udd-0.012
s-0.011
 sad-0.008
eeper-0.007
Token and
Token ID290
Feature activation-4.70e-02

Pos logit contributions
 matter+0.054
 remind+0.046
 yours+0.045
 letting+0.044
 lending+0.043
Neg logit contributions
uddle-0.039
udd-0.030
s-0.028
-0.026
 She-0.023
Token watch
Token ID2342
Feature activation+1.16e-02

Pos logit contributions
 matter+0.088
illa+0.087
ocket+0.084
igh+0.081
pper+0.080
Neg logit contributions
uddle-0.076
s-0.069
 sad-0.060
 big-0.057
 He-0.057
Token it
Token ID340
Feature activation+1.51e-01

Pos logit contributions
uddle+0.022
s+0.019
udd+0.018
 sad+0.016
+0.016
Neg logit contributions
 matter-0.020
illa-0.017
 remind-0.017
ocket-0.017
icians-0.017
Token soar
Token ID48701
Feature activation+9.04e-02

Pos logit contributions
uddle+0.281
udd+0.228
s+0.224
+0.203
 She+0.192
Neg logit contributions
 matter-0.281
 remind-0.233
 letting-0.225
 yours-0.223
icated-0.213
Token high
Token ID1029
Feature activation+1.03e-01

Pos logit contributions
uddle+0.134
udd+0.098
s+0.095
+0.082
 She+0.080
Neg logit contributions
 matter-0.204
 yours-0.173
 remind-0.168
 letting-0.166
icated-0.163
Token up
Token ID510
Feature activation+2.43e-01

Pos logit contributions
uddle+0.175
udd+0.133
s+0.125
+0.109
 She+0.109
Neg logit contributions
 matter-0.202
 yours-0.172
 letting-0.170
 remind-0.165
bons-0.161
Token in
Token ID287
Feature activation+2.44e-02

Pos logit contributions
uddle+0.410
udd+0.324
s+0.296
 gym+0.238
uddled+0.237
Neg logit contributions
 matter-0.506
 letting-0.423
 yours-0.414
 avail-0.407
 remind-0.404
Token the
Token ID262
Feature activation+1.05e-01

Pos logit contributions
uddle+0.030
s+0.025
+0.022
+0.022
 big+0.021
Neg logit contributions
 matter-0.056
ocket-0.051
pper-0.050
icians-0.050
illa-0.050
Token sky
Token ID6766
Feature activation+5.10e-02

Pos logit contributions
uddle+0.205
uddled+0.185
s+0.180
 nicely+0.175
eeper+0.171
Neg logit contributions
 matter-0.147
bons-0.111
weet-0.107
icated-0.101
 colours-0.101
Token.
Token ID13
Feature activation+3.23e-02

Pos logit contributions
uddle+0.053
s+0.040
udd+0.033
+0.027
 She+0.026
Neg logit contributions
 matter-0.134
 remind-0.118
icians-0.117
 yours-0.116
bet-0.115
Token
Token ID220
Feature activation+3.91e-02

Pos logit contributions
uddle+0.042
udd+0.030
s+0.028
 sad+0.021
obb+0.020
Neg logit contributions
 matter-0.076
 yours-0.066
 remind-0.066
icated-0.065
 letting-0.065
Token and
Token ID290
Feature activation-4.50e-02

Pos logit contributions
uddle+0.159
s+0.132
udd+0.113
 She+0.100
+0.098
Neg logit contributions
 matter-0.346
bet-0.303
 yours-0.302
icated-0.301
icians-0.300
Token promised
Token ID8072
Feature activation-9.73e-03

Pos logit contributions
 matter+0.088
icians+0.086
pper+0.086
igh+0.085
bet+0.084
Neg logit contributions
uddle-0.061
s-0.059
 She-0.053
 sad-0.052
-0.051
Token they
Token ID484
Feature activation+9.67e-02

Pos logit contributions
 matter+0.023
icians+0.020
 remind+0.020
igh+0.020
ocket+0.020
Neg logit contributions
uddle-0.012
s-0.010
udd-0.009
 She-0.008
-0.007
Token would
Token ID561
Feature activation+2.26e-02

Pos logit contributions
uddle+0.151
s+0.147
udd+0.120
 She+0.119
+0.115
Neg logit contributions
 matter-0.191
bet-0.175
 avail-0.173
icians-0.172
pper-0.165
Token come
Token ID1282
Feature activation+9.95e-02

Pos logit contributions
uddle+0.035
+0.031
s+0.030
 She+0.030
udd+0.029
Neg logit contributions
 matter-0.043
icians-0.039
icated-0.038
 avail-0.037
bet-0.037
Token back
Token ID736
Feature activation+2.43e-01

Pos logit contributions
uddle+0.089
s+0.059
udd+0.051
+0.036
 She+0.032
Neg logit contributions
 matter-0.276
 remind-0.248
 yours-0.243
icians-0.242
 letting-0.242
Token to
Token ID284
Feature activation-4.88e-03

Pos logit contributions
uddle+0.368
s+0.309
udd+0.279
+0.250
 She+0.247
Neg logit contributions
 matter-0.518
icians-0.436
 remind-0.434
bet-0.431
 yours-0.430
Token the
Token ID262
Feature activation+1.11e-01

Pos logit contributions
 matter+0.011
pper+0.010
icians+0.010
 sizes+0.010
bet+0.010
Neg logit contributions
uddle-0.005
-0.005
s-0.005
 She-0.004
 big-0.004
Token dentist
Token ID38408
Feature activation+5.88e-02

Pos logit contributions
uddle+0.186
s+0.156
udd+0.143
+0.125
 She+0.125
Neg logit contributions
 matter-0.226
 remind-0.189
pa-0.184
 yours-0.183
 letting-0.183
Token soon
Token ID2582
Feature activation+1.26e-01

Pos logit contributions
uddle+0.054
s+0.045
udd+0.034
+0.030
 She+0.029
Neg logit contributions
 matter-0.162
bet-0.141
 avail-0.140
icated-0.139
 yours-0.139
Token.
Token ID13
Feature activation+4.32e-02

Pos logit contributions
uddle+0.092
s+0.082
udd+0.054
+0.048
 She+0.046
Neg logit contributions
 matter-0.359
icated-0.329
bet-0.328
icians-0.328
illa-0.324
Token the
Token ID262
Feature activation+9.33e-02

Pos logit contributions
s+0.022
uddle+0.021
+0.021
 big+0.021
 back+0.021
Neg logit contributions
 matter-0.076
bet-0.075
icians-0.075
pper-0.074
igh-0.074
Token storm
Token ID6388
Feature activation+7.92e-02

Pos logit contributions
uddle+0.129
s+0.095
 sad+0.094
 big+0.093
udd+0.088
Neg logit contributions
 matter-0.201
icians-0.193
illa-0.191
pper-0.189
ocket-0.186
Token wouldn
Token ID3636
Feature activation+3.31e-02

Pos logit contributions
s+0.117
uddle+0.109
 back+0.088
+0.085
 sad+0.083
Neg logit contributions
 matter-0.158
 avail-0.154
icated-0.153
ocket-0.150
isdom-0.149
Token't
Token ID470
Feature activation+3.74e-03

Pos logit contributions
uddle+0.061
udd+0.048
s+0.047
 gym+0.041
+0.040
Neg logit contributions
 matter-0.062
 remind-0.053
 letting-0.051
igh-0.050
 yours-0.050
Token come
Token ID1282
Feature activation+1.07e-01

Pos logit contributions
uddle+0.007
s+0.006
udd+0.005
 sad+0.005
+0.005
Neg logit contributions
 matter-0.006
icians-0.006
ocket-0.006
bet-0.006
illa-0.006
Token back
Token ID736
Feature activation+2.42e-01

Pos logit contributions
uddle+0.119
s+0.088
udd+0.078
+0.063
 She+0.061
Neg logit contributions
 matter-0.273
 remind-0.239
icians-0.235
 yours-0.235
illa-0.233
Token again
Token ID757
Feature activation+2.25e-01

Pos logit contributions
uddle+0.236
s+0.188
udd+0.148
+0.126
 She+0.125
Neg logit contributions
 matter-0.647
icians-0.567
bet-0.564
 remind-0.562
ocket-0.557
Token.
Token ID13
Feature activation+5.23e-02

Pos logit contributions
uddle+0.132
s+0.121
+0.080
udd+0.062
 She+0.062
Neg logit contributions
 matter-0.647
bet-0.598
icians-0.596
icated-0.591
illa-0.591
Token<|endoftext|>
Token ID50256
Feature activation-2.73e-02

Pos logit contributions
uddle+0.030
s+0.016
udd+0.010
+0.005
 She+0.005
Neg logit contributions
 matter-0.163
 remind-0.146
icians-0.145
 yours-0.144
bet-0.143
TokenOnce
Token ID7454
Feature activation-6.26e-02

Pos logit contributions
 matter+0.072
 letting+0.065
igh+0.065
illa+0.064
icians+0.064
Neg logit contributions
uddle-0.025
udd-0.016
s-0.014
 sad-0.010
eeper-0.009
Token upon
Token ID2402
Feature activation-4.73e-02

Pos logit contributions
 matter+0.121
pa+0.107
 yours+0.104
fulness+0.099
igh+0.097
Neg logit contributions
uddle-0.100
udd-0.078
ashes-0.071
eeper-0.070
 gym-0.066
Token good
Token ID922
Feature activation+3.23e-02

Pos logit contributions
uddle+0.012
s+0.010
udd+0.009
+0.008
 She+0.008
Neg logit contributions
 matter-0.017
icians-0.015
 remind-0.015
 yours-0.015
pa-0.015
Token boy
Token ID2933
Feature activation+3.90e-02

Pos logit contributions
uddle+0.053
s+0.042
udd+0.040
 sad+0.039
 big+0.037
Neg logit contributions
 matter-0.062
icians-0.060
icated-0.056
illa-0.055
pper-0.055
Token they
Token ID484
Feature activation+9.34e-02

Pos logit contributions
s+0.028
uddle+0.025
+0.018
,+0.017
 She+0.017
Neg logit contributions
 matter-0.112
 avail-0.102
icians-0.101
bet-0.100
icated-0.100
Token would
Token ID561
Feature activation+2.61e-02

Pos logit contributions
s+0.122
uddle+0.121
 sad+0.098
+0.096
 back+0.094
Neg logit contributions
 matter-0.203
 avail-0.197
bet-0.192
icians-0.188
 occasions-0.186
Token come
Token ID1282
Feature activation+9.66e-02

Pos logit contributions
uddle+0.039
+0.034
s+0.034
 She+0.032
,+0.032
Neg logit contributions
 matter-0.050
icians-0.047
icated-0.046
 avail-0.044
ocket-0.044
Token back
Token ID736
Feature activation+2.42e-01

Pos logit contributions
uddle+0.100
s+0.078
udd+0.063
+0.054
+0.053
Neg logit contributions
 matter-0.254
icians-0.219
icated-0.217
 avail-0.217
 occasions-0.217
Token another
Token ID1194
Feature activation+8.40e-02

Pos logit contributions
uddle+0.330
s+0.273
udd+0.240
+0.215
+0.210
Neg logit contributions
 matter-0.554
icians-0.472
 remind-0.468
bet-0.467
 yours-0.462
Token day
Token ID1110
Feature activation+1.20e-01

Pos logit contributions
uddle+0.135
s+0.113
udd+0.101
+0.099
 sad+0.098
Neg logit contributions
 matter-0.156
icians-0.152
ocket-0.146
bet-0.143
igh-0.143
Token and
Token ID290
Feature activation-3.13e-02

Pos logit contributions
uddle+0.110
s+0.094
udd+0.068
+0.064
+0.062
Neg logit contributions
 matter-0.327
bet-0.290
illa-0.283
 avail-0.282
icians-0.282
Token see
Token ID766
Feature activation+1.42e-02

Pos logit contributions
 matter+0.069
icians+0.063
bet+0.061
igh+0.060
illa+0.060
Neg logit contributions
uddle-0.043
s-0.038
-0.032
 She-0.031
 He-0.031
Token what
Token ID644
Feature activation+2.92e-02

Pos logit contributions
uddle+0.019
s+0.016
 nicely+0.014
+0.014
udd+0.014
Neg logit contributions
 matter-0.032
pper-0.028
icians-0.028
bet-0.028
epad-0.028
Token up
Token ID510
Feature activation+1.89e-01

Pos logit contributions
uddle+0.074
s+0.060
udd+0.055
+0.048
 She+0.047
Neg logit contributions
 matter-0.111
 remind-0.095
 yours-0.093
icians-0.093
 letting-0.092
Token in
Token ID287
Feature activation-2.13e-02

Pos logit contributions
uddle+0.201
s+0.154
udd+0.128
+0.109
 She+0.105
Neg logit contributions
 matter-0.490
bet-0.429
icians-0.428
 remind-0.426
illa-0.425
Token the
Token ID262
Feature activation+6.61e-02

Pos logit contributions
 matter+0.053
icated+0.050
epad+0.049
bet+0.049
pper+0.049
Neg logit contributions
uddle-0.019
 big-0.018
,-0.018
s-0.017
-0.016
Token branches
Token ID13737
Feature activation-2.94e-02

Pos logit contributions
uddle+0.112
s+0.093
udd+0.087
+0.076
 She+0.075
Neg logit contributions
 matter-0.132
 remind-0.110
 yours-0.108
icians-0.107
pa-0.107
Token farther
Token ID18485
Feature activation+1.02e-01

Pos logit contributions
 matter+0.078
bet+0.069
icians+0.068
 remind+0.068
 avail+0.067
Neg logit contributions
uddle-0.031
s-0.024
udd-0.019
-0.016
 She-0.016
Token up
Token ID510
Feature activation+2.42e-01

Pos logit contributions
uddle+0.103
s+0.094
udd+0.069
+0.058
 She+0.057
Neg logit contributions
 matter-0.262
bet-0.240
icians-0.238
 lending-0.236
illa-0.232
Token you
Token ID345
Feature activation+5.05e-02

Pos logit contributions
uddle+0.303
s+0.247
udd+0.209
+0.194
 She+0.188
Neg logit contributions
 matter-0.576
bet-0.504
icians-0.501
illa-0.500
 remind-0.499
Token will
Token ID481
Feature activation+3.04e-04

Pos logit contributions
uddle+0.075
s+0.062
udd+0.056
+0.050
 She+0.048
Neg logit contributions
 matter-0.111
 remind-0.094
icians-0.092
bet-0.092
 avail-0.092
Token find
Token ID1064
Feature activation+2.06e-02

Pos logit contributions
uddle+0.000
s+0.000
+0.000
 big+0.000
 She+0.000
Neg logit contributions
 matter-0.001
bet-0.001
icians-0.001
pa-0.001
 avail-0.001
Token a
Token ID257
Feature activation-1.80e-02

Pos logit contributions
uddle+0.026
s+0.023
+0.020
+0.019
 back+0.019
Neg logit contributions
 matter-0.047
epad-0.041
icians-0.040
bet-0.040
icated-0.040
Token banana
Token ID25996
Feature activation-2.08e-02

Pos logit contributions
 matter+0.035
icians+0.031
pa+0.030
bet+0.030
igh+0.030
Neg logit contributions
uddle-0.030
s-0.025
udd-0.022
 big-0.021
-0.021
Token to
Token ID284
Feature activation-5.84e-02

Pos logit contributions
uddle+0.030
s+0.025
+0.018
 She+0.018
 He+0.017
Neg logit contributions
 matter-0.094
icians-0.084
bet-0.084
pper-0.084
igh-0.083
Token see
Token ID766
Feature activation+2.12e-02

Pos logit contributions
 matter+0.110
 remind+0.091
 yours+0.089
icians+0.088
 letting+0.088
Neg logit contributions
uddle-0.107
s-0.089
udd-0.084
-0.075
 She-0.073
Token the
Token ID262
Feature activation+8.06e-02

Pos logit contributions
uddle+0.020
s+0.019
+0.018
 big+0.018
 new+0.017
Neg logit contributions
 matter-0.054
bek-0.050
pper-0.049
epad-0.048
 occasions-0.048
Token sun
Token ID4252
Feature activation+4.88e-02

Pos logit contributions
uddle+0.101
s+0.078
udd+0.068
+0.059
 She+0.056
Neg logit contributions
 matter-0.195
icians-0.172
 remind-0.172
illa-0.169
pa-0.169
Token come
Token ID1282
Feature activation+9.30e-02

Pos logit contributions
uddle+0.060
s+0.059
 She+0.042
 He+0.041
+0.040
Neg logit contributions
 matter-0.116
icians-0.105
ocket-0.103
 occasions-0.102
bet-0.101
Token up
Token ID510
Feature activation+2.42e-01

Pos logit contributions
uddle+0.127
s+0.091
udd+0.091
 gym+0.072
+0.072
Neg logit contributions
 matter-0.217
 remind-0.188
 yours-0.186
 letting-0.184
igh-0.182
Token each
Token ID1123
Feature activation+7.47e-02

Pos logit contributions
uddle+0.357
s+0.266
udd+0.261
 gym+0.211
+0.209
Neg logit contributions
 matter-0.539
 remind-0.460
 yours-0.455
 letting-0.454
 avail-0.442
Token morning
Token ID3329
Feature activation+1.65e-01

Pos logit contributions
uddle+0.101
s+0.098
+0.081
+0.081
 big+0.079
Neg logit contributions
 matter-0.157
icians-0.152
isdom-0.141
epad-0.140
 avail-0.140
Token.
Token ID13
Feature activation+3.55e-02

Pos logit contributions
uddle+0.142
s+0.117
udd+0.077
 She+0.073
+0.072
Neg logit contributions
 matter-0.455
icians-0.412
illa-0.405
bet-0.402
 remind-0.402
Token Every
Token ID3887
Feature activation-6.44e-02

Pos logit contributions
uddle+0.048
udd+0.035
s+0.032
uddled+0.025
obb+0.023
Neg logit contributions
 matter-0.082
 yours-0.072
 remind-0.072
 letting-0.072
icated-0.071
Token day
Token ID1110
Feature activation+5.01e-02

Pos logit contributions
 matter+0.125
icians+0.115
 remind+0.108
epad+0.108
ocket+0.107
Neg logit contributions
uddle-0.109
s-0.093
udd-0.081
-0.080
-0.079
Token could
Token ID714
Feature activation-3.46e-02

Pos logit contributions
uddle+0.163
s+0.145
udd+0.123
 sad+0.111
+0.107
Neg logit contributions
 matter-0.234
 avail-0.208
 yours-0.206
icated-0.201
 lending-0.200
Token feel
Token ID1254
Feature activation+1.03e-01

Pos logit contributions
 matter+0.070
icians+0.059
bet+0.058
icated+0.058
 yours+0.058
Neg logit contributions
uddle-0.057
s-0.049
udd-0.043
 She-0.041
-0.040
Token the
Token ID262
Feature activation+1.25e-01

Pos logit contributions
uddle+0.150
s+0.123
udd+0.111
+0.099
 She+0.097
Neg logit contributions
 matter-0.225
icians-0.191
illa-0.191
 remind-0.190
bet-0.189
Token joy
Token ID8716
Feature activation+6.57e-03

Pos logit contributions
uddle+0.189
s+0.144
udd+0.139
 big+0.137
 sad+0.135
Neg logit contributions
 matter-0.254
icians-0.238
illa-0.237
isdom-0.233
igh-0.230
Token rising
Token ID7396
Feature activation+1.23e-01

Pos logit contributions
uddle+0.007
s+0.007
 She+0.005
udd+0.005
 sad+0.005
Neg logit contributions
 matter-0.016
icians-0.015
bet-0.014
illa-0.014
isdom-0.014
Token up
Token ID510
Feature activation+2.41e-01

Pos logit contributions
uddle+0.139
s+0.118
udd+0.092
 big+0.082
 She+0.082
Neg logit contributions
 matter-0.303
icians-0.277
illa-0.275
ocket-0.272
igh-0.270
Token in
Token ID287
Feature activation+2.70e-02

Pos logit contributions
uddle+0.256
s+0.217
udd+0.165
 She+0.159
+0.154
Neg logit contributions
 matter-0.607
icians-0.547
bet-0.544
illa-0.541
ocket-0.537
Token her
Token ID607
Feature activation+8.85e-02

Pos logit contributions
+0.019
 big+0.019
 She+0.017
,+0.016
 small+0.015
Neg logit contributions
pper-0.072
icians-0.070
ocket-0.069
illa-0.068
ems-0.068
Token heart
Token ID2612
Feature activation+1.57e-02

Pos logit contributions
uddle+0.152
s+0.125
udd+0.114
 sad+0.101
+0.101
Neg logit contributions
 matter-0.180
icians-0.153
 remind-0.150
illa-0.150
 letting-0.149
Token.
Token ID13
Feature activation+3.68e-02

Pos logit contributions
uddle+0.015
s+0.012
udd+0.009
 She+0.007
+0.007
Neg logit contributions
 matter-0.043
icians-0.038
bet-0.038
 remind-0.038
 yours-0.038
Token
Token ID198
Feature activation-9.19e-03

Pos logit contributions
uddle+0.032
s+0.021
udd+0.018
+0.012
 She+0.011
Neg logit contributions
 matter-0.103
 remind-0.091
 yours-0.090
icians-0.090
illa-0.090
Token and
Token ID290
Feature activation-5.53e-02

Pos logit contributions
uddle+0.049
s+0.039
udd+0.032
 gym+0.026
+0.026
Neg logit contributions
 matter-0.141
icians-0.124
bet-0.123
 avail-0.123
icated-0.122
Token promised
Token ID8072
Feature activation-1.95e-02

Pos logit contributions
 matter+0.125
icians+0.108
ities+0.107
ocket+0.107
 avail+0.106
Neg logit contributions
uddle-0.075
s-0.068
-0.058
udd-0.057
 back-0.055
Token he
Token ID339
Feature activation+1.32e-01

Pos logit contributions
 matter+0.048
 remind+0.042
icians+0.042
 yours+0.041
igh+0.041
Neg logit contributions
uddle-0.023
s-0.018
udd-0.016
 She-0.013
-0.013
Token would
Token ID561
Feature activation+1.78e-02

Pos logit contributions
uddle+0.216
s+0.208
udd+0.172
 She+0.166
+0.164
Neg logit contributions
 matter-0.257
 avail-0.225
bet-0.224
icians-0.224
 occasions-0.217
Token come
Token ID1282
Feature activation+9.74e-02

Pos logit contributions
uddle+0.028
+0.024
s+0.024
 She+0.023
udd+0.023
Neg logit contributions
 matter-0.035
icians-0.031
ocket-0.030
 avail-0.030
icated-0.030
Token back
Token ID736
Feature activation+2.41e-01

Pos logit contributions
uddle+0.089
s+0.057
udd+0.052
+0.036
 gym+0.032
Neg logit contributions
 matter-0.266
 remind-0.242
 yours-0.238
 letting-0.237
icians-0.234
Token soon
Token ID2582
Feature activation+1.50e-01

Pos logit contributions
uddle+0.334
s+0.270
udd+0.243
+0.212
 She+0.206
Neg logit contributions
 matter-0.552
 remind-0.470
icians-0.467
 yours-0.463
bet-0.462
Token.
Token ID13
Feature activation+5.58e-02

Pos logit contributions
uddle+0.125
s+0.102
udd+0.074
+0.063
 She+0.058
Neg logit contributions
 matter-0.417
bet-0.374
icians-0.373
illa-0.372
icated-0.368
Token<|endoftext|>
Token ID50256
Feature activation-2.36e-02

Pos logit contributions
uddle+0.066
udd+0.044
s+0.043
 sad+0.032
obb+0.031
Neg logit contributions
 matter-0.131
 letting-0.120
 yours-0.119
 lending-0.119
 remind-0.118
TokenOne
Token ID3198
Feature activation-3.95e-02

Pos logit contributions
 matter+0.061
 letting+0.055
igh+0.055
illa+0.055
pa+0.055
Neg logit contributions
uddle-0.022
udd-0.014
s-0.012
 sad-0.010
eeper-0.009
Token day
Token ID1110
Feature activation+3.23e-02

Pos logit contributions
 matter+0.071
icians+0.062
 remind+0.060
bet+0.059
illa+0.059
Neg logit contributions
uddle-0.073
s-0.063
udd-0.057
-0.053
-0.052
Token"
Token ID1
Feature activation-5.52e-02

Pos logit contributions
 matter+0.020
 remind+0.018
 letting+0.017
icians+0.017
 lending+0.017
Neg logit contributions
uddle-0.011
udd-0.008
s-0.008
 sad-0.006
 gym-0.006
TokenL
Token ID43
Feature activation-1.33e-01

Pos logit contributions
 matter+0.120
 yours+0.105
ocket+0.104
icians+0.103
 remind+0.103
Neg logit contributions
uddle-0.077
s-0.067
-0.057
udd-0.056
-0.055
Tokenem
Token ID368
Feature activation-1.82e-01

Pos logit contributions
 matter+0.272
 remind+0.229
icians+0.225
 yours+0.224
 letting+0.222
Neg logit contributions
uddle-0.217
s-0.179
udd-0.166
-0.148
 She-0.145
Tokenons
Token ID684
Feature activation-1.33e-01

Pos logit contributions
 matter+0.478
 occasions+0.411
 protection+0.411
 remind+0.410
bet+0.410
Neg logit contributions
uddle-0.199
s-0.166
udd-0.127
-0.120
-0.105
Token!
Token ID0
Feature activation-1.40e-02

Pos logit contributions
 matter+0.283
bet+0.250
 avail+0.247
 occasions+0.246
icians+0.245
Neg logit contributions
uddle-0.206
s-0.179
udd-0.154
-0.142
 sad-0.139
Token Lem
Token ID20607
Feature activation-2.77e-01

Pos logit contributions
 matter+0.029
 remind+0.025
 yours+0.024
illa+0.024
 letting+0.024
Neg logit contributions
uddle-0.022
s-0.018
udd-0.017
-0.014
 sad-0.014
Tokenons
Token ID684
Feature activation-1.41e-01

Pos logit contributions
 matter+0.350
 avail+0.258
bet+0.252
epad+0.252
 protection+0.246
Neg logit contributions
uddle-0.687
s-0.638
udd-0.598
-0.566
-0.536
Token!
Token ID0
Feature activation-2.12e-02

Pos logit contributions
 matter+0.316
 avail+0.294
bet+0.288
icians+0.281
 occasions+0.279
Neg logit contributions
uddle-0.196
s-0.184
udd-0.146
 big-0.131
 She-0.131
Token Beautiful
Token ID23762
Feature activation-1.21e-01

Pos logit contributions
 matter+0.047
 remind+0.040
icians+0.039
 avail+0.039
pa+0.039
Neg logit contributions
uddle-0.031
s-0.025
udd-0.023
 She-0.021
-0.021
Token juicy
Token ID35075
Feature activation-1.09e-01

Pos logit contributions
 matter+0.247
epad+0.222
ocket+0.221
 avail+0.219
pper+0.213
Neg logit contributions
uddle-0.197
s-0.181
udd-0.151
 big-0.142
-0.139
Token le
Token ID443
Feature activation-6.70e-02

Pos logit contributions
 matter+0.205
icians+0.177
 remind+0.176
epad+0.176
ocket+0.173
Neg logit contributions
uddle-0.199
s-0.177
udd-0.150
-0.143
 big-0.140
Token and
Token ID290
Feature activation-3.44e-02

Pos logit contributions
uddle+0.043
s+0.037
udd+0.028
 She+0.023
+0.021
Neg logit contributions
 matter-0.138
bet-0.124
icians-0.124
 avail-0.124
illa-0.123
Token y
Token ID331
Feature activation-9.05e-02

Pos logit contributions
 matter+0.082
icians+0.079
illa+0.079
igh+0.078
bet+0.077
Neg logit contributions
uddle-0.038
s-0.034
 sad-0.029
 big-0.027
udd-0.026
Tokenucky
Token ID5309
Feature activation+4.12e-02

Pos logit contributions
 matter+0.161
 yours+0.141
 avail+0.135
ities+0.134
pa+0.134
Neg logit contributions
uddle-0.147
s-0.120
 sad-0.119
-0.117
udd-0.116
Token.
Token ID13
Feature activation+2.60e-02

Pos logit contributions
uddle+0.037
s+0.030
udd+0.021
 sad+0.015
 She+0.015
Neg logit contributions
 matter-0.116
 avail-0.104
icians-0.103
 remind-0.102
illa-0.101
Token "
Token ID366
Feature activation-5.55e-02

Pos logit contributions
uddle+0.032
s+0.019
udd+0.018
uddled+0.012
iced+0.012
Neg logit contributions
 matter-0.063
 letting-0.060
 yours-0.057
 remind-0.056
weet-0.056
TokenSal
Token ID19221
Feature activation-2.55e-01

Pos logit contributions
 matter+0.121
 remind+0.104
 yours+0.103
icians+0.103
pa+0.103
Neg logit contributions
uddle-0.081
s-0.068
udd-0.061
-0.055
 She-0.054
Tokenad
Token ID324
Feature activation-7.15e-02

Pos logit contributions
 matter+0.349
 avail+0.308
epad+0.306
icians+0.291
 letting+0.290
Neg logit contributions
uddle-0.533
s-0.496
udd-0.475
-0.456
-0.447
Token?
Token ID30
Feature activation-4.78e-02

Pos logit contributions
 matter+0.143
 avail+0.134
epad+0.117
bet+0.117
 occasions+0.117
Neg logit contributions
uddle-0.124
s-0.116
udd-0.103
-0.090
 sad-0.087
Token That
Token ID1320
Feature activation-7.69e-02

Pos logit contributions
 matter+0.097
 remind+0.082
icians+0.080
 yours+0.080
illa+0.080
Neg logit contributions
uddle-0.078
s-0.064
udd-0.060
-0.053
 She-0.052
Token's
Token ID338
Feature activation-7.75e-02

Pos logit contributions
 matter+0.178
icians+0.160
ocket+0.157
bet+0.157
illa+0.156
Neg logit contributions
uddle-0.102
s-0.086
udd-0.069
-0.063
-0.061
Token all
Token ID477
Feature activation-2.77e-02

Pos logit contributions
keeping+0.170
 occasions+0.168
icians+0.166
epad+0.165
enna+0.163
Neg logit contributions
uddle-0.092
 sad-0.079
 big-0.079
 nicely-0.077
s-0.074
Token Larry
Token ID13633
Feature activation-3.65e-02

Pos logit contributions
uddle+0.046
s+0.038
+0.035
 She+0.033
+0.032
Neg logit contributions
 matter-0.132
 occasions-0.116
bons-0.116
 avail-0.115
bet-0.114
Token.
Token ID13
Feature activation+2.14e-02

Pos logit contributions
 matter+0.108
bons+0.097
 avail+0.097
bet+0.096
icians+0.096
Neg logit contributions
uddle-0.023
s-0.019
udd-0.010
-0.009
-0.009
Token
Token ID198
Feature activation-1.15e-02

Pos logit contributions
uddle+0.030
udd+0.022
s+0.021
 sad+0.016
 gym+0.015
Neg logit contributions
 matter-0.049
 remind-0.043
 letting-0.043
illa-0.043
icians-0.043
Token
Token ID198
Feature activation-1.63e-02

Pos logit contributions
 matter+0.032
icians+0.029
 remind+0.029
illa+0.028
 letting+0.028
Neg logit contributions
uddle-0.010
udd-0.006
s-0.006
 sad-0.003
 gym-0.003
Token"
Token ID1
Feature activation-6.05e-02

Pos logit contributions
 matter+0.036
 remind+0.030
 yours+0.030
icians+0.030
bet+0.030
Neg logit contributions
uddle-0.024
s-0.020
udd-0.018
-0.016
 She-0.016
TokenWill
Token ID8743
Feature activation-2.53e-01

Pos logit contributions
 matter+0.139
igh+0.134
ocket+0.132
 yours+0.131
pa+0.126
Neg logit contributions
uddle-0.063
s-0.062
-0.054
-0.053
 She-0.053
Token you
Token ID345
Feature activation-5.27e-02

Pos logit contributions
 matter+0.519
epad+0.459
bet+0.451
icians+0.445
 occasions+0.443
Neg logit contributions
uddle-0.397
s-0.365
 sad-0.291
 She-0.283
udd-0.283
Token play
Token ID711
Feature activation+1.89e-02

Pos logit contributions
 matter+0.101
keeping+0.095
 occasions+0.093
icians+0.093
 facts+0.092
Neg logit contributions
uddle-0.080
s-0.066
 big-0.062
 sad-0.061
udd-0.060
Token with
Token ID351
Feature activation-9.21e-02

Pos logit contributions
uddle+0.031
s+0.026
udd+0.024
+0.021
 She+0.021
Neg logit contributions
 matter-0.038
 remind-0.032
 yours-0.032
icians-0.032
 letting-0.031
Token me
Token ID502
Feature activation-2.76e-02

Pos logit contributions
 matter+0.209
epad+0.184
 bid+0.181
pper+0.179
ocket+0.178
Neg logit contributions
uddle-0.124
s-0.101
 big-0.091
 nicely-0.084
udd-0.084
Token?"
Token ID1701
Feature activation+4.70e-02

Pos logit contributions
 matter+0.062
bet+0.052
 remind+0.052
 avail+0.052
icians+0.052
Neg logit contributions
uddle-0.040
s-0.032
udd-0.029
-0.026
-0.025
Token You
Token ID921
Feature activation-1.23e-01

Pos logit contributions
uddle+0.030
s+0.025
udd+0.023
 She+0.021
+0.020
Neg logit contributions
 matter-0.042
 remind-0.035
icians-0.035
illa-0.034
 avail-0.034
Token learned
Token ID4499
Feature activation-2.31e-02

Pos logit contributions
 matter+0.241
 avail+0.206
bet+0.204
 occasions+0.203
icians+0.202
Neg logit contributions
uddle-0.211
s-0.181
udd-0.169
-0.151
-0.149
Token something
Token ID1223
Feature activation+4.79e-02

Pos logit contributions
 matter+0.040
 yours+0.036
 remind+0.034
 protection+0.033
 letting+0.032
Neg logit contributions
uddle-0.045
s-0.036
udd-0.035
 gym-0.032
-0.032
Token new
Token ID649
Feature activation+4.78e-02

Pos logit contributions
uddle+0.065
udd+0.046
 Suddenly+0.044
 gym+0.043
 While+0.042
Neg logit contributions
 matter-0.104
 yours-0.095
 remind-0.091
icated-0.090
 generosity-0.090
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
uddle+0.046
s+0.041
+0.029
udd+0.029
+0.027
Neg logit contributions
 matter-0.131
 avail-0.117
bet-0.115
icians-0.115
igh-0.113
Token Lem
Token ID20607
Feature activation-2.51e-01

Pos logit contributions
uddle+0.017
s+0.014
 She+0.014
+0.013
 He+0.013
Neg logit contributions
 matter-0.029
icians-0.024
 remind-0.024
 avail-0.024
pa-0.024
Tokenons
Token ID684
Feature activation-1.14e-01

Pos logit contributions
 matter+0.332
 avail+0.256
epad+0.252
bet+0.244
 letting+0.238
Neg logit contributions
uddle-0.612
s-0.569
udd-0.535
-0.495
-0.469
Token are
Token ID389
Feature activation-2.49e-02

Pos logit contributions
 matter+0.256
 avail+0.233
bet+0.223
icians+0.221
illa+0.219
Neg logit contributions
uddle-0.165
s-0.156
udd-0.129
-0.113
 She-0.112
Token good
Token ID922
Feature activation+1.18e-02

Pos logit contributions
keeping+0.065
igh+0.065
eman+0.065
 matter+0.065
 avail+0.064
Neg logit contributions
 big-0.019
uddle-0.018
 sad-0.016
 small-0.016
 new-0.016
Token for
Token ID329
Feature activation-7.27e-02

Pos logit contributions
uddle+0.011
s+0.011
udd+0.008
,+0.008
+0.007
Neg logit contributions
 matter-0.031
 avail-0.029
illa-0.028
icians-0.028
bet-0.028
Token some
Token ID617
Feature activation-6.02e-02

Pos logit contributions
 matter+0.146
 bid+0.136
epad+0.134
enna+0.134
igh+0.133
Neg logit contributions
uddle-0.089
 big-0.088
 nicely-0.084
-0.083
 small-0.083
Token dad
Token ID9955
Feature activation-5.27e-02

Pos logit contributions
 matter+0.199
icians+0.175
bet+0.168
 occasions+0.167
icated+0.166
Neg logit contributions
uddle-0.137
s-0.119
udd-0.103
 She-0.099
 sad-0.098
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
 matter+0.154
 avail+0.139
icians+0.137
bet+0.136
 yours+0.134
Neg logit contributions
uddle-0.036
s-0.033
-0.019
udd-0.018
 sad-0.016
Token
Token ID198
Feature activation-1.40e-02

Pos logit contributions
uddle+0.017
udd+0.014
s+0.010
obb+0.009
ooming+0.009
Neg logit contributions
 matter-0.023
 letting-0.022
 meaning-0.020
keeping-0.020
 bid-0.020
Token
Token ID198
Feature activation-1.75e-02

Pos logit contributions
 matter+0.031
 letting+0.029
 remind+0.028
illa+0.028
 yours+0.028
Neg logit contributions
uddle-0.019
udd-0.013
s-0.010
eeper-0.009
obb-0.009
Token"
Token ID1
Feature activation-5.89e-02

Pos logit contributions
 matter+0.035
 letting+0.030
 remind+0.029
illa+0.029
pper+0.029
Neg logit contributions
uddle-0.029
udd-0.023
s-0.020
 sad-0.018
 gym-0.017
TokenWill
Token ID8743
Feature activation-2.51e-01

Pos logit contributions
 matter+0.120
 remind+0.102
 yours+0.101
icians+0.100
ocket+0.099
Neg logit contributions
uddle-0.096
s-0.080
udd-0.073
-0.067
 She-0.065
Token the
Token ID262
Feature activation+2.48e-02

Pos logit contributions
 matter+0.464
 remind+0.384
 yours+0.381
pa+0.379
 letting+0.373
Neg logit contributions
uddle-0.461
udd-0.370
s-0.366
-0.321
 gym-0.318
Token storm
Token ID6388
Feature activation+2.46e-02

Pos logit contributions
uddle+0.047
s+0.043
udd+0.039
 She+0.035
+0.035
Neg logit contributions
 matter-0.042
 remind-0.033
pa-0.033
 protection-0.033
 letting-0.032
Token go
Token ID467
Feature activation+9.99e-02

Pos logit contributions
uddle+0.043
s+0.036
udd+0.034
+0.030
 She+0.030
Neg logit contributions
 matter-0.048
 remind-0.040
 yours-0.039
 letting-0.038
icians-0.038
Token away
Token ID1497
Feature activation+8.18e-02

Pos logit contributions
uddle+0.183
udd+0.147
s+0.142
eeper+0.122
 gym+0.122
Neg logit contributions
 matter-0.190
 remind-0.162
 yours-0.154
pa-0.152
 letting-0.147
Token?"
Token ID1701
Feature activation+5.12e-02

Pos logit contributions
uddle+0.157
s+0.129
udd+0.127
 She+0.110
 gym+0.108
Neg logit contributions
 matter-0.143
 remind-0.118
 yours-0.118
illa-0.116
pa-0.115
Token him
Token ID683
Feature activation+9.82e-02

Pos logit contributions
 matter+0.083
ocket+0.074
icians+0.073
icated+0.073
epad+0.072
Neg logit contributions
uddle-0.052
s-0.049
-0.044
 She-0.043
 He-0.042
Token.
Token ID13
Feature activation+3.15e-02

Pos logit contributions
uddle+0.141
s+0.113
udd+0.104
+0.090
 She+0.087
Neg logit contributions
 matter-0.219
 remind-0.188
 yours-0.185
icians-0.184
 letting-0.183
Token
Token ID198
Feature activation-1.02e-02

Pos logit contributions
uddle+0.039
udd+0.029
s+0.025
 sad+0.019
 gym+0.019
Neg logit contributions
 matter-0.073
 letting-0.065
 remind-0.065
 lending-0.064
bet-0.064
Token
Token ID198
Feature activation-1.74e-02

Pos logit contributions
 matter+0.027
 remind+0.025
 letting+0.025
illa+0.025
icians+0.025
Neg logit contributions
uddle-0.009
udd-0.005
s-0.005
 sad-0.003
 gym-0.003
Token"
Token ID1
Feature activation-5.83e-02

Pos logit contributions
 matter+0.033
 remind+0.028
icians+0.027
 letting+0.027
bet+0.027
Neg logit contributions
uddle-0.031
s-0.025
udd-0.024
-0.022
 She-0.021
TokenWill
Token ID8743
Feature activation-2.50e-01

Pos logit contributions
 matter+0.137
ocket+0.135
igh+0.130
epad+0.129
 avail+0.129
Neg logit contributions
uddle-0.056
s-0.054
 She-0.049
-0.049
 He-0.047
Token you
Token ID345
Feature activation-4.87e-02

Pos logit contributions
 matter+0.464
epad+0.418
 occasions+0.397
ocket+0.397
 avail+0.395
Neg logit contributions
uddle-0.436
s-0.418
 sad-0.335
-0.333
 She-0.328
Token help
Token ID1037
Feature activation-6.70e-02

Pos logit contributions
 matter+0.090
 avail+0.079
icians+0.079
keeping+0.079
 occasions+0.077
Neg logit contributions
uddle-0.082
s-0.070
udd-0.064
 sad-0.063
 big-0.062
Token me
Token ID502
Feature activation+2.12e-03

Pos logit contributions
 matter+0.151
epad+0.140
ocket+0.139
 generosity+0.133
icians+0.132
Neg logit contributions
uddle-0.086
s-0.074
-0.060
udd-0.060
 She-0.058
Token,
Token ID11
Feature activation+4.39e-02

Pos logit contributions
uddle+0.004
s+0.003
udd+0.003
+0.003
 She+0.002
Neg logit contributions
 matter-0.004
icians-0.003
 remind-0.003
bet-0.003
 yours-0.003
Token Wh
Token ID854
Feature activation-1.34e-01

Pos logit contributions
uddle+0.054
s+0.045
+0.044
 He+0.042
 big+0.042
Neg logit contributions
epad-0.101
keeping-0.097
 matter-0.096
icians-0.094
 occasions-0.094
Token said
Token ID531
Feature activation-1.93e-02

Pos logit contributions
uddle+0.027
s+0.023
udd+0.022
+0.020
 She+0.019
Neg logit contributions
 matter-0.026
 remind-0.021
icians-0.021
 yours-0.021
illa-0.021
Token.
Token ID13
Feature activation+3.24e-02

Pos logit contributions
 matter+0.043
 remind+0.039
 letting+0.039
pa+0.038
epad+0.038
Neg logit contributions
uddle-0.027
udd-0.019
s-0.018
-0.016
 gym-0.016
Token
Token ID198
Feature activation+5.24e-04

Pos logit contributions
uddle+0.042
udd+0.029
s+0.023
ashes+0.018
 gym+0.017
Neg logit contributions
 matter-0.073
 letting-0.072
 lending-0.070
 Steven-0.067
 written-0.067
Token
Token ID198
Feature activation-7.84e-03

Pos logit contributions
uddle+0.001
udd+0.001
obb+0.001
 gym+0.001
ashes+0.001
Neg logit contributions
pa-0.001
weet-0.001
 letting-0.001
 Steven-0.001
 yours-0.001
Token"
Token ID1
Feature activation-5.33e-02

Pos logit contributions
 letting+0.012
 matter+0.012
cle+0.011
pper+0.011
pa+0.011
Neg logit contributions
uddle-0.015
udd-0.012
s-0.010
 sad-0.010
 gym-0.010
TokenWill
Token ID8743
Feature activation-2.50e-01

Pos logit contributions
 matter+0.114
 remind+0.097
pa+0.096
igh+0.096
 yours+0.096
Neg logit contributions
uddle-0.081
s-0.068
udd-0.061
 She-0.056
-0.056
Token everyone
Token ID2506
Feature activation+5.11e-02

Pos logit contributions
 matter+0.457
epad+0.388
icians+0.382
bet+0.381
 occasions+0.378
Neg logit contributions
uddle-0.450
s-0.420
udd-0.343
 sad-0.335
-0.334
Token be
Token ID307
Feature activation-7.70e-03

Pos logit contributions
uddle+0.075
s+0.066
udd+0.056
 sad+0.053
+0.052
Neg logit contributions
 matter-0.110
 avail-0.095
bet-0.095
icians-0.093
 occasions-0.092
Token okay
Token ID8788
Feature activation-6.11e-03

Pos logit contributions
 matter+0.018
 occasions+0.016
 avail+0.015
icated+0.015
illa+0.015
Neg logit contributions
uddle-0.010
s-0.008
udd-0.008
 sad-0.008
 big-0.007
Token?"
Token ID1701
Feature activation+5.43e-02

Pos logit contributions
 matter+0.013
icians+0.011
 remind+0.011
bet+0.011
ocket+0.011
Neg logit contributions
uddle-0.010
s-0.008
udd-0.007
-0.007
 She-0.007
Token Ben
Token ID3932
Feature activation+3.89e-02

Pos logit contributions
uddle+0.112
udd+0.091
s+0.085
eeper+0.073
+0.071
Neg logit contributions
 matter-0.095
pa-0.081
 remind-0.079
 avail-0.077
 letting-0.075
Token people
Token ID661
Feature activation+6.63e-04

Pos logit contributions
 matter+0.166
icians+0.150
ocket+0.147
igh+0.147
icated+0.146
Neg logit contributions
uddle-0.096
s-0.091
 nicely-0.076
-0.074
 back-0.073
Token.
Token ID13
Feature activation+2.28e-02

Pos logit contributions
uddle+0.001
s+0.001
udd+0.000
+0.000
 She+0.000
Neg logit contributions
 matter-0.002
 remind-0.001
icians-0.001
 yours-0.001
bet-0.001
Token
Token ID198
Feature activation-1.51e-02

Pos logit contributions
uddle+0.029
udd+0.019
s+0.018
 sad+0.011
eeper+0.011
Neg logit contributions
 matter-0.055
 letting-0.050
 yours-0.049
 lending-0.049
 remind-0.048
Token
Token ID198
Feature activation-2.07e-02

Pos logit contributions
 matter+0.031
 letting+0.030
pa+0.030
 remind+0.029
 lending+0.029
Neg logit contributions
uddle-0.020
udd-0.014
s-0.011
obb-0.011
eeper-0.010
Token"
Token ID1
Feature activation-5.36e-02

Pos logit contributions
 matter+0.035
 letting+0.034
 lending+0.032
 yours+0.032
icians+0.032
Neg logit contributions
uddle-0.039
udd-0.028
s-0.026
tub-0.024
 sad-0.024
TokenWill
Token ID8743
Feature activation-2.49e-01

Pos logit contributions
 matter+0.113
ocket+0.102
 yours+0.102
igh+0.101
 remind+0.100
Neg logit contributions
uddle-0.073
s-0.063
-0.057
udd-0.055
-0.054
Token the
Token ID262
Feature activation+2.66e-02

Pos logit contributions
 matter+0.460
 remind+0.379
icians+0.373
bet+0.369
 yours+0.369
Neg logit contributions
uddle-0.456
s-0.394
udd-0.358
-0.328
 She-0.323
Token earthquake
Token ID16295
Feature activation+2.41e-02

Pos logit contributions
uddle+0.045
s+0.038
udd+0.035
+0.030
 She+0.030
Neg logit contributions
 matter-0.054
 remind-0.045
pa-0.043
 yours-0.043
 letting-0.043
Token stop
Token ID2245
Feature activation+1.01e-01

Pos logit contributions
uddle+0.037
s+0.029
udd+0.028
+0.024
 She+0.024
Neg logit contributions
 matter-0.052
 remind-0.044
 yours-0.043
 letting-0.043
pa-0.042
Token,
Token ID11
Feature activation+7.73e-02

Pos logit contributions
uddle+0.145
s+0.122
udd+0.108
+0.097
 sad+0.095
Neg logit contributions
 matter-0.224
ocket-0.191
bet-0.191
icians-0.190
 occasions-0.189
Token mom
Token ID1995
Feature activation-1.38e-02

Pos logit contributions
uddle+0.104
s+0.080
 sad+0.076
 big+0.073
udd+0.072
Neg logit contributions
 matter-0.169
 occasions-0.152
ocket-0.152
icians-0.151
bet-0.150
Token said
Token ID531
Feature activation-2.44e-02

Pos logit contributions
 matter+0.022
 remind+0.018
pa+0.018
illa+0.017
 letting+0.017
Neg logit contributions
uddle-0.026
s-0.022
udd-0.021
-0.019
 gym-0.019
Token.
Token ID13
Feature activation+2.81e-02

Pos logit contributions
 matter+0.060
 remind+0.053
 letting+0.053
 yours+0.052
pa+0.052
Neg logit contributions
uddle-0.029
s-0.021
udd-0.020
-0.016
 gym-0.015
Token
Token ID198
Feature activation-1.86e-02

Pos logit contributions
uddle+0.030
udd+0.019
s+0.018
eeper+0.011
 sad+0.011
Neg logit contributions
 matter-0.070
 letting-0.066
 lending-0.065
 yours-0.063
 remind-0.063
Token
Token ID198
Feature activation-2.57e-02

Pos logit contributions
 matter+0.044
 letting+0.041
pa+0.041
weet+0.040
illa+0.040
Neg logit contributions
uddle-0.021
udd-0.013
s-0.013
eeper-0.010
 sad-0.010
Token"
Token ID1
Feature activation-6.95e-02

Pos logit contributions
 matter+0.056
 remind+0.049
 letting+0.049
icians+0.049
 occasions+0.048
Neg logit contributions
uddle-0.038
s-0.028
udd-0.027
 sad-0.022
eeper-0.021
TokenWill
Token ID8743
Feature activation-2.49e-01

Pos logit contributions
 matter+0.146
 remind+0.123
igh+0.123
pa+0.123
 yours+0.122
Neg logit contributions
uddle-0.107
s-0.091
udd-0.082
 She-0.077
-0.076
Token she
Token ID673
Feature activation+8.58e-02

Pos logit contributions
 matter+0.426
 remind+0.347
icians+0.343
bet+0.341
epad+0.338
Neg logit contributions
uddle-0.486
s-0.434
udd-0.386
-0.363
 She-0.358
Token be
Token ID307
Feature activation-1.98e-02

Pos logit contributions
uddle+0.142
s+0.125
udd+0.110
 sad+0.108
+0.108
Neg logit contributions
 matter-0.159
icians-0.138
keeping-0.137
bet-0.136
 avail-0.136
Token happy
Token ID3772
Feature activation+7.15e-02

Pos logit contributions
 matter+0.046
 occasions+0.041
icated+0.039
fulness+0.039
 facts+0.039
Neg logit contributions
uddle-0.023
s-0.019
 sad-0.019
 big-0.019
 back-0.018
Token to
Token ID284
Feature activation-4.99e-02

Pos logit contributions
uddle+0.091
s+0.074
udd+0.064
 She+0.058
+0.057
Neg logit contributions
 matter-0.168
bet-0.147
icians-0.147
ocket-0.145
illa-0.144
Token see
Token ID766
Feature activation+2.08e-02

Pos logit contributions
 matter+0.103
icians+0.089
bet+0.088
 sizes+0.088
illa+0.088
Neg logit contributions
uddle-0.072
s-0.063
-0.055
 She-0.054
udd-0.053
Token says
Token ID1139
Feature activation-1.96e-02

Pos logit contributions
uddle+0.011
s+0.009
udd+0.008
+0.008
 She+0.008
Neg logit contributions
 matter-0.012
 remind-0.010
 yours-0.010
icians-0.010
bet-0.010
Token.
Token ID13
Feature activation+3.36e-02

Pos logit contributions
 matter+0.048
 remind+0.042
 letting+0.042
 yours+0.041
illa+0.041
Neg logit contributions
uddle-0.024
s-0.017
udd-0.016
-0.013
 gym-0.013
Token
Token ID198
Feature activation-3.49e-03

Pos logit contributions
uddle+0.049
udd+0.034
s+0.026
eeper+0.026
obb+0.025
Neg logit contributions
 letting-0.070
 matter-0.068
 lending-0.065
 remind-0.063
 saying-0.063
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
 matter+0.007
 letting+0.007
pa+0.007
 remind+0.007
weet+0.007
Neg logit contributions
uddle-0.005
udd-0.003
obb-0.003
eeper-0.003
s-0.003
Token"
Token ID1
Feature activation-5.86e-02

Pos logit contributions
 matter+0.025
 letting+0.022
 remind+0.022
 lending+0.022
 occasions+0.021
Neg logit contributions
uddle-0.019
udd-0.014
s-0.014
 sad-0.012
eeper-0.012
TokenWill
Token ID8743
Feature activation-2.48e-01

Pos logit contributions
 matter+0.128
igh+0.116
pa+0.113
ocket+0.113
 yours+0.112
Neg logit contributions
uddle-0.081
s-0.069
udd-0.061
-0.060
 She-0.059
Token they
Token ID484
Feature activation+4.65e-02

Pos logit contributions
 matter+0.452
 remind+0.371
icians+0.369
bet+0.365
epad+0.365
Neg logit contributions
uddle-0.458
s-0.405
udd-0.357
-0.336
 She-0.330
Token be
Token ID307
Feature activation-2.39e-02

Pos logit contributions
uddle+0.072
s+0.062
udd+0.055
+0.055
 sad+0.055
Neg logit contributions
 matter-0.091
 avail-0.080
 occasions-0.079
bet-0.079
icians-0.079
Token loud
Token ID7812
Feature activation-9.60e-03

Pos logit contributions
 matter+0.055
 occasions+0.047
bet+0.046
ocket+0.046
icated+0.046
Neg logit contributions
uddle-0.032
s-0.026
udd-0.024
 sad-0.022
-0.021
Token?"
Token ID1701
Feature activation+4.57e-02

Pos logit contributions
 matter+0.023
 remind+0.019
icians+0.019
 yours+0.019
bet+0.019
Neg logit contributions
uddle-0.013
s-0.010
udd-0.009
-0.008
 She-0.007
Token Ben
Token ID3932
Feature activation+3.50e-02

Pos logit contributions
uddle+0.080
udd+0.063
s+0.060
eeper+0.050
+0.047
Neg logit contributions
 matter-0.093
 remind-0.079
pa-0.077
 avail-0.076
 letting-0.076
Token
Token ID198
Feature activation-6.54e-03

Pos logit contributions
 matter+0.002
 remind+0.002
 letting+0.002
icians+0.002
illa+0.002
Neg logit contributions
uddle-0.001
udd-0.001
s-0.001
 sad-0.000
 gym-0.000
TokenHe
Token ID1544
Feature activation-1.25e-02

Pos logit contributions
 matter+0.015
 remind+0.013
icians+0.013
 letting+0.013
bet+0.013
Neg logit contributions
uddle-0.009
s-0.006
udd-0.006
-0.005
 sad-0.005
Token asked
Token ID1965
Feature activation-8.45e-02

Pos logit contributions
 matter+0.027
 yours+0.024
 avail+0.023
 occasions+0.023
 remind+0.023
Neg logit contributions
uddle-0.019
s-0.017
udd-0.014
-0.013
-0.013
Token,
Token ID11
Feature activation+5.21e-02

Pos logit contributions
 matter+0.204
icians+0.177
bet+0.174
pper+0.174
ocket+0.173
Neg logit contributions
uddle-0.096
s-0.086
-0.075
-0.072
 He-0.070
Token "
Token ID366
Feature activation-5.64e-02

Pos logit contributions
uddle+0.046
s+0.035
udd+0.026
+0.024
+0.024
Neg logit contributions
 matter-0.143
icians-0.125
 occasions-0.125
 remind-0.124
 yours-0.124
TokenWill
Token ID8743
Feature activation-2.48e-01

Pos logit contributions
 matter+0.130
 sizes+0.122
 facts+0.120
 yours+0.119
 generosity+0.118
Neg logit contributions
-0.055
uddle-0.055
-0.050
s-0.048
�-0.043
Token you
Token ID345
Feature activation-5.14e-02

Pos logit contributions
 matter+0.489
epad+0.439
 sizes+0.430
 occasions+0.426
icians+0.423
Neg logit contributions
uddle-0.399
s-0.379
 sad-0.307
-0.306
-0.297
Token play
Token ID711
Feature activation+1.41e-02

Pos logit contributions
 matter+0.097
keeping+0.091
 occasions+0.089
 sizes+0.089
icians+0.087
Neg logit contributions
uddle-0.079
s-0.070
 big-0.068
-0.065
 sad-0.065
Token with
Token ID351
Feature activation-9.42e-02

Pos logit contributions
uddle+0.020
s+0.016
udd+0.014
+0.013
+0.012
Neg logit contributions
 matter-0.032
icians-0.027
 remind-0.027
illa-0.027
bet-0.027
Token me
Token ID502
Feature activation-3.15e-02

Pos logit contributions
 matter+0.231
epad+0.205
 sizes+0.202
 cherish+0.199
 avail+0.197
Neg logit contributions
uddle-0.108
s-0.089
 big-0.082
-0.074
-0.073
Token?"
Token ID1701
Feature activation+4.90e-02

Pos logit contributions
 matter+0.071
bet+0.060
 avail+0.060
icians+0.060
 occasions+0.059
Neg logit contributions
uddle-0.045
s-0.037
udd-0.033
-0.030
-0.029
Token screen
Token ID3159
Feature activation-4.96e-02

Pos logit contributions
uddle+0.125
s+0.105
udd+0.099
+0.089
 She+0.087
Neg logit contributions
 matter-0.128
 remind-0.106
 yours-0.103
icians-0.103
 letting-0.102
Token.
Token ID13
Feature activation+2.88e-02

Pos logit contributions
 matter+0.138
bet+0.124
icians+0.123
 avail+0.122
 remind+0.121
Neg logit contributions
uddle-0.042
s-0.036
 She-0.023
udd-0.022
-0.022
Token
Token ID198
Feature activation-6.67e-03

Pos logit contributions
uddle+0.032
udd+0.021
s+0.019
 sad+0.013
 gym+0.012
Neg logit contributions
 matter-0.069
 letting-0.065
 remind-0.064
 lending-0.063
icated-0.063
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
 matter+0.014
 letting+0.014
 remind+0.013
 lending+0.013
icians+0.013
Neg logit contributions
uddle-0.009
udd-0.006
s-0.005
 sad-0.004
eeper-0.004
Token"
Token ID1
Feature activation-5.23e-02

Pos logit contributions
 matter+0.025
 letting+0.022
icians+0.022
 remind+0.022
 lending+0.021
Neg logit contributions
uddle-0.020
udd-0.015
s-0.015
 sad-0.012
 gym-0.012
TokenWas
Token ID16973
Feature activation-2.48e-01

Pos logit contributions
 yours+0.122
 supervision+0.122
pa+0.122
ocket+0.119
 matter+0.119
Neg logit contributions
-0.045
-0.044
She-0.041
L-0.040
 She-0.038
Tokenn
Token ID77
Feature activation-7.00e-02

Pos logit contributions
 matter+0.396
bet+0.328
 occasions+0.327
epad+0.322
 remind+0.318
Neg logit contributions
uddle-0.474
s-0.438
udd-0.393
-0.389
 She-0.381
Token't
Token ID470
Feature activation-8.19e-02

Pos logit contributions
 matter+0.148
bet+0.123
 remind+0.123
 occasions+0.123
 avail+0.122
Neg logit contributions
uddle-0.109
s-0.093
udd-0.082
-0.074
 She-0.074
Token that
Token ID326
Feature activation+1.28e-02

Pos logit contributions
bet+0.177
 matter+0.174
 occasions+0.174
icians+0.166
bons+0.165
Neg logit contributions
uddle-0.091
s-0.085
 sad-0.085
 big-0.085
-0.077
Token fun
Token ID1257
Feature activation+2.86e-02

Pos logit contributions
uddle+0.016
s+0.013
 big+0.012
 sad+0.012
+0.011
Neg logit contributions
bet-0.028
 matter-0.027
pper-0.027
icians-0.027
illa-0.027
Token?"
Token ID1701
Feature activation+5.11e-02

Pos logit contributions
uddle+0.033
s+0.028
 big+0.022
,+0.022
+0.021
Neg logit contributions
 matter-0.068
bet-0.065
pper-0.064
illa-0.062
ocket-0.061
Token honey
Token ID12498
Feature activation-6.49e-02

Pos logit contributions
 matter+0.123
igh+0.111
illa+0.109
pper+0.108
ocket+0.108
Neg logit contributions
uddle-0.091
s-0.088
 big-0.076
-0.071
 She-0.071
Token.
Token ID13
Feature activation+3.18e-02

Pos logit contributions
 matter+0.165
 remind+0.142
icians+0.141
 avail+0.140
pa+0.140
Neg logit contributions
uddle-0.076
s-0.060
udd-0.051
 She-0.042
-0.041
Token
Token ID198
Feature activation-5.41e-03

Pos logit contributions
uddle+0.034
s+0.023
udd+0.022
 sad+0.015
 gym+0.014
Neg logit contributions
 matter-0.076
 letting-0.071
 lending-0.069
 yours-0.069
 remind-0.069
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
 matter+0.011
 letting+0.011
 lending+0.011
 Judy+0.011
illa+0.011
Neg logit contributions
uddle-0.007
udd-0.005
s-0.004
 sad-0.004
 gym-0.004
Token"
Token ID1
Feature activation-5.19e-02

Pos logit contributions
 matter+0.022
 letting+0.020
 lending+0.019
 remind+0.019
 yours+0.019
Neg logit contributions
uddle-0.018
udd-0.013
s-0.013
 sad-0.011
 gym-0.011
TokenWill
Token ID8743
Feature activation-2.47e-01

Pos logit contributions
 matter+0.122
pa+0.119
igh+0.115
 protection+0.114
 avail+0.113
Neg logit contributions
uddle-0.046
-0.044
-0.043
 She-0.041
She-0.041
Token it
Token ID340
Feature activation+8.86e-02

Pos logit contributions
 matter+0.484
epad+0.456
icians+0.433
 avail+0.432
 occasions+0.432
Neg logit contributions
uddle-0.386
s-0.385
 sad-0.307
,-0.302
-0.301
Token be
Token ID307
Feature activation-1.01e-02

Pos logit contributions
uddle+0.159
s+0.134
udd+0.125
+0.113
 She+0.111
Neg logit contributions
 matter-0.166
 remind-0.138
icians-0.135
 yours-0.135
illa-0.133
Token great
Token ID1049
Feature activation+2.38e-02

Pos logit contributions
 matter+0.023
keeping+0.022
 occasions+0.021
illa+0.021
ocket+0.021
Neg logit contributions
uddle-0.011
 big-0.009
s-0.008
-0.008
udd-0.008
Token?"
Token ID1701
Feature activation+5.47e-02

Pos logit contributions
uddle+0.028
s+0.023
udd+0.019
+0.018
+0.017
Neg logit contributions
 matter-0.057
icians-0.052
bet-0.052
pper-0.051
ocket-0.051
Token Sally
Token ID25737
Feature activation+3.61e-02

Pos logit contributions
uddle+0.099
s+0.082
udd+0.078
+0.069
 She+0.067
Neg logit contributions
 matter-0.103
 remind-0.085
 yours-0.083
icians-0.082
 letting-0.082
Token see
Token ID766
Feature activation+1.27e-02

Pos logit contributions
 matter+0.158
icians+0.139
epad+0.137
illa+0.136
ocket+0.135
Neg logit contributions
uddle-0.124
s-0.104
 gym-0.099
-0.097
 big-0.097
Token.
Token ID13
Feature activation+3.35e-02

Pos logit contributions
s+0.013
uddle+0.012
+0.010
+0.009
 She+0.009
Neg logit contributions
 matter-0.034
 avail-0.030
bet-0.030
epad-0.029
pper-0.029
Token
Token ID198
Feature activation-4.72e-03

Pos logit contributions
uddle+0.043
udd+0.032
s+0.030
 sad+0.024
 big+0.021
Neg logit contributions
 matter-0.076
illa-0.068
 remind-0.067
 letting-0.067
pa-0.067
Token
Token ID198
Feature activation-1.19e-02

Pos logit contributions
 matter+0.012
illa+0.011
icians+0.011
 remind+0.011
 letting+0.011
Neg logit contributions
uddle-0.004
udd-0.003
s-0.002
 sad-0.002
 gym-0.001
Token"
Token ID1
Feature activation-5.64e-02

Pos logit contributions
 matter+0.025
 remind+0.022
icians+0.022
illa+0.021
 letting+0.021
Neg logit contributions
uddle-0.018
s-0.014
udd-0.014
 sad-0.012
-0.011
TokenWill
Token ID8743
Feature activation-2.47e-01

Pos logit contributions
 matter+0.128
ocket+0.125
 yours+0.122
igh+0.121
isdom+0.120
Neg logit contributions
s-0.054
-0.053
-0.053
uddle-0.051
 She-0.048
Token I
Token ID314
Feature activation-6.13e-02

Pos logit contributions
 matter+0.524
epad+0.455
bet+0.452
 avail+0.448
ocket+0.446
Neg logit contributions
uddle-0.373
s-0.349
udd-0.270
 sad-0.268
-0.265
Token see
Token ID766
Feature activation-1.70e-02

Pos logit contributions
 matter+0.127
 avail+0.108
icians+0.107
icated+0.107
 occasions+0.106
Neg logit contributions
uddle-0.098
s-0.085
udd-0.073
-0.071
-0.070
Token cows
Token ID22575
Feature activation-6.76e-02

Pos logit contributions
 matter+0.039
epad+0.035
keeping+0.035
bet+0.035
bek+0.034
Neg logit contributions
s-0.020
uddle-0.019
-0.017
-0.017
 big-0.017
Token,
Token ID11
Feature activation+4.71e-02

Pos logit contributions
 matter+0.178
 occasions+0.158
bet+0.157
ocket+0.155
 avail+0.154
Neg logit contributions
uddle-0.068
s-0.056
udd-0.042
 sad-0.039
-0.038
Token mum
Token ID25682
Feature activation-2.94e-02

Pos logit contributions
uddle+0.066
s+0.059
+0.053
 big+0.053
 sad+0.051
Neg logit contributions
 matter-0.100
keeping-0.092
 avail-0.091
epad-0.091
 occasions-0.090
Token water
Token ID1660
Feature activation+1.13e-02

Pos logit contributions
 matter+0.061
icians+0.053
 remind+0.052
illa+0.051
 yours+0.051
Neg logit contributions
uddle-0.039
s-0.033
udd-0.029
-0.026
 She-0.026
Token.
Token ID13
Feature activation+3.29e-02

Pos logit contributions
uddle+0.009
s+0.008
udd+0.005
 She+0.004
+0.004
Neg logit contributions
 matter-0.034
icians-0.030
 avail-0.030
illa-0.029
bet-0.029
Token
Token ID198
Feature activation-5.09e-03

Pos logit contributions
uddle+0.038
s+0.025
udd+0.024
 sad+0.017
 gym+0.017
Neg logit contributions
 matter-0.080
 letting-0.072
 remind-0.072
 yours-0.071
 lending-0.071
Token
Token ID198
Feature activation-9.90e-03

Pos logit contributions
 matter+0.011
 letting+0.011
 remind+0.011
 lending+0.011
illa+0.011
Neg logit contributions
uddle-0.006
udd-0.004
s-0.003
 gym-0.003
ashes-0.003
Token"
Token ID1
Feature activation-5.05e-02

Pos logit contributions
 matter+0.021
 remind+0.018
 letting+0.018
icians+0.018
 lending+0.018
Neg logit contributions
uddle-0.015
udd-0.011
s-0.011
 gym-0.009
-0.009
TokenWill
Token ID8743
Feature activation-2.47e-01

Pos logit contributions
 matter+0.125
pa+0.122
ocket+0.121
 yours+0.121
igh+0.121
Neg logit contributions
-0.037
-0.036
uddle-0.036
L-0.033
She-0.033
Token the
Token ID262
Feature activation+2.47e-02

Pos logit contributions
 matter+0.479
epad+0.442
icians+0.422
 occasions+0.421
 sizes+0.417
Neg logit contributions
uddle-0.409
s-0.401
 sad-0.320
-0.315
-0.306
Token egg
Token ID5935
Feature activation-8.56e-02

Pos logit contributions
uddle+0.052
s+0.045
 big+0.044
 small+0.041
 sad+0.041
Neg logit contributions
icians-0.037
 matter-0.037
enna-0.035
epad-0.034
icated-0.034
Token be
Token ID307
Feature activation-1.54e-02

Pos logit contributions
 matter+0.178
icians+0.151
bet+0.149
 avail+0.148
 letting+0.147
Neg logit contributions
uddle-0.137
s-0.114
udd-0.102
-0.092
 sad-0.090
Token warm
Token ID5814
Feature activation+8.45e-03

Pos logit contributions
 matter+0.038
 avail+0.033
 facts+0.033
 occasions+0.033
icians+0.033
Neg logit contributions
uddle-0.017
s-0.014
 big-0.013
 sad-0.013
udd-0.012
Token now
Token ID783
Feature activation+8.74e-02

Pos logit contributions
uddle+0.013
s+0.011
udd+0.010
+0.009
 She+0.009
Neg logit contributions
 matter-0.018
icians-0.015
 remind-0.015
bet-0.015
illa-0.015
Token excited
Token ID6568
Feature activation+4.19e-02

Pos logit contributions
uddle+0.096
s+0.076
udd+0.075
+0.070
 gym+0.062
Neg logit contributions
 matter-0.099
 remind-0.087
 expected-0.081
 generosity-0.080
 yours-0.079
Token.
Token ID13
Feature activation+3.12e-02

Pos logit contributions
uddle+0.034
s+0.031
udd+0.022
 She+0.022
+0.020
Neg logit contributions
 matter-0.113
icians-0.109
bet-0.107
igh-0.107
pper-0.106
Token She
Token ID1375
Feature activation-1.51e-02

Pos logit contributions
uddle+0.027
s+0.015
udd+0.014
+0.008
 sad+0.006
Neg logit contributions
 matter-0.088
 remind-0.078
 letting-0.078
epad-0.077
 occasions-0.077
Token asked
Token ID1965
Feature activation-8.38e-02

Pos logit contributions
 matter+0.030
 yours+0.025
 avail+0.025
icians+0.024
 remind+0.024
Neg logit contributions
uddle-0.026
s-0.022
udd-0.020
 She-0.018
-0.018
Token "
Token ID366
Feature activation-5.30e-02

Pos logit contributions
 matter+0.186
pper+0.162
icians+0.162
bet+0.161
 avail+0.158
Neg logit contributions
uddle-0.111
s-0.099
 She-0.086
 nicely-0.084
-0.084
TokenWill
Token ID8743
Feature activation-2.47e-01

Pos logit contributions
 matter+0.116
 yours+0.105
 generosity+0.104
 protection+0.104
ocket+0.103
Neg logit contributions
uddle-0.064
s-0.055
-0.055
-0.051
 She-0.047
Token I
Token ID314
Feature activation-6.31e-02

Pos logit contributions
 matter+0.491
epad+0.410
bet+0.407
 remind+0.404
icians+0.404
Neg logit contributions
uddle-0.415
s-0.378
udd-0.309
 She-0.301
-0.298
Token get
Token ID651
Feature activation+7.55e-02

Pos logit contributions
 matter+0.126
icians+0.105
 avail+0.105
bet+0.105
icated+0.104
Neg logit contributions
uddle-0.106
s-0.092
udd-0.081
-0.076
-0.076
Token to
Token ID284
Feature activation-5.98e-02

Pos logit contributions
uddle+0.092
s+0.080
udd+0.063
 big+0.063
+0.062
Neg logit contributions
 matter-0.181
pper-0.155
 sizes-0.155
bet-0.154
illa-0.154
Token help
Token ID1037
Feature activation-3.30e-02

Pos logit contributions
 matter+0.123
ocket+0.108
 sizes+0.107
igh+0.107
illa+0.107
Neg logit contributions
uddle-0.087
s-0.072
-0.064
 big-0.064
 gym-0.063
Token you
Token ID345
Feature activation+1.64e-03

Pos logit contributions
 matter+0.070
bet+0.062
ocket+0.061
 sizes+0.061
keeping+0.061
Neg logit contributions
uddle-0.047
s-0.042
-0.034
udd-0.033
 big-0.033
Token network
Token ID3127
Feature activation-2.56e-02

Pos logit contributions
uddle+0.085
 big+0.064
s+0.064
 gym+0.062
 sad+0.059
Neg logit contributions
 matter-0.113
pper-0.101
icians-0.101
 avail-0.100
igh-0.100
Token.
Token ID13
Feature activation+2.46e-02

Pos logit contributions
 matter+0.067
 avail+0.061
bet+0.060
illa+0.059
icians+0.059
Neg logit contributions
uddle-0.026
s-0.022
udd-0.017
 She-0.016
-0.016
Token
Token ID198
Feature activation-2.38e-02

Pos logit contributions
uddle+0.022
s+0.015
udd+0.012
+0.009
 She+0.008
Neg logit contributions
 matter-0.068
 remind-0.061
icians-0.060
 yours-0.060
 letting-0.059
Token
Token ID198
Feature activation-3.10e-02

Pos logit contributions
 matter+0.067
 remind+0.060
icians+0.060
 letting+0.059
illa+0.059
Neg logit contributions
uddle-0.020
s-0.012
udd-0.010
 sad-0.005
-0.005
Token"
Token ID1
Feature activation-6.95e-02

Pos logit contributions
 matter+0.075
 remind+0.065
 yours+0.064
icians+0.064
pa+0.063
Neg logit contributions
uddle-0.039
s-0.030
udd-0.027
-0.023
 She-0.022
TokenWill
Token ID8743
Feature activation-2.47e-01

Pos logit contributions
 matter+0.161
ocket+0.160
igh+0.156
pa+0.154
 yours+0.154
Neg logit contributions
-0.060
-0.059
L-0.059
 She-0.058
She-0.056
Token you
Token ID345
Feature activation-4.69e-02

Pos logit contributions
 matter+0.443
epad+0.377
bet+0.371
ocket+0.367
icians+0.365
Neg logit contributions
uddle-0.448
s-0.420
-0.342
udd-0.342
 She-0.342
Token be
Token ID307
Feature activation-2.01e-02

Pos logit contributions
 matter+0.090
keeping+0.078
ocket+0.077
 occasions+0.077
 sizes+0.076
Neg logit contributions
uddle-0.074
s-0.067
-0.060
 big-0.059
udd-0.057
Token my
Token ID616
Feature activation-3.87e-02

Pos logit contributions
 matter+0.046
 occasions+0.041
fulness+0.039
icated+0.039
ocket+0.039
Neg logit contributions
uddle-0.025
s-0.020
 sad-0.019
 big-0.019
udd-0.019
Token friend
Token ID1545
Feature activation-4.04e-02

Pos logit contributions
 matter+0.090
 occasions+0.080
 avail+0.080
bet+0.079
keeping+0.079
Neg logit contributions
uddle-0.051
s-0.039
udd-0.035
 sad-0.032
 big-0.032
Token?"
Token ID1701
Feature activation+3.76e-02

Pos logit contributions
 matter+0.093
bet+0.087
keeping+0.085
 occasions+0.085
epad+0.084
Neg logit contributions
uddle-0.048
s-0.042
-0.034
udd-0.034
-0.033
Token,
Token ID11
Feature activation+5.77e-02

Pos logit contributions
uddle+0.034
s+0.024
udd+0.018
 She+0.016
+0.015
Neg logit contributions
 matter-0.129
 remind-0.114
icians-0.113
 yours-0.112
bet-0.112
Token
Token ID220
Feature activation+6.23e-02

Pos logit contributions
uddle+0.071
s+0.060
 She+0.054
+0.054
 sad+0.052
Neg logit contributions
 matter-0.134
ocket-0.119
bet-0.119
illa-0.118
 avail-0.118
Token
Token ID198
Feature activation-7.26e-03

Pos logit contributions
uddle+0.057
s+0.034
udd+0.033
 sad+0.021
 gym+0.018
Neg logit contributions
 matter-0.156
 yours-0.149
 letting-0.145
 remind-0.144
illa-0.144
Token
Token ID198
Feature activation-1.89e-02

Pos logit contributions
 matter+0.021
 remind+0.018
icians+0.018
 yours+0.018
 letting+0.018
Neg logit contributions
uddle-0.006
s-0.004
udd-0.003
-0.002
 She-0.002
Token"
Token ID1
Feature activation-6.34e-02

Pos logit contributions
 matter+0.046
 remind+0.040
 yours+0.039
icians+0.039
pa+0.039
Neg logit contributions
uddle-0.024
s-0.018
udd-0.016
-0.014
 She-0.014
TokenWill
Token ID8743
Feature activation-2.46e-01

Pos logit contributions
 matter+0.154
pa+0.151
 avail+0.150
igh+0.149
ocket+0.148
Neg logit contributions
-0.053
uddle-0.050
L-0.047
-0.047
s-0.046
Tokenie
Token ID494
Feature activation-1.82e-03

Pos logit contributions
 matter+0.459
epad+0.403
bet+0.396
ocket+0.392
 occasions+0.388
Neg logit contributions
uddle-0.424
s-0.403
-0.321
 sad-0.321
udd-0.319
Token!
Token ID0
Feature activation-6.16e-03

Pos logit contributions
 matter+0.003
 avail+0.003
icians+0.003
 sizes+0.003
bet+0.003
Neg logit contributions
uddle-0.003
s-0.003
 sad-0.002
udd-0.002
-0.002
Token Did
Token ID7731
Feature activation-1.56e-01

Pos logit contributions
 matter+0.013
 avail+0.011
icians+0.011
pa+0.011
epad+0.011
Neg logit contributions
uddle-0.009
s-0.008
 She-0.008
 He-0.008
-0.008
Token you
Token ID345
Feature activation-3.17e-02

Pos logit contributions
 matter+0.300
keeping+0.288
epad+0.287
bet+0.287
ocket+0.273
Neg logit contributions
uddle-0.230
s-0.224
 She-0.191
-0.189
 big-0.187
Token steal
Token ID8711
Feature activation-4.55e-03

Pos logit contributions
 matter+0.060
bet+0.056
keeping+0.055
 occasions+0.054
icians+0.054
Neg logit contributions
uddle-0.047
s-0.044
 sad-0.039
 big-0.039
-0.038
Token
Token ID198
Feature activation-1.17e-02

Pos logit contributions
uddle+0.044
udd+0.033
s+0.033
 sad+0.026
+0.026
Neg logit contributions
 matter-0.063
 remind-0.053
 letting-0.053
pa-0.053
icians-0.052
Token
Token ID198
Feature activation-1.77e-02

Pos logit contributions
 matter+0.032
pa+0.029
 remind+0.029
illa+0.029
icians+0.028
Neg logit contributions
uddle-0.011
udd-0.006
s-0.005
 sad-0.004
 gym-0.003
TokenJohn
Token ID7554
Feature activation-4.94e-02

Pos logit contributions
 matter+0.036
 remind+0.030
icians+0.030
pa+0.029
illa+0.029
Neg logit contributions
uddle-0.030
s-0.024
udd-0.023
-0.020
 sad-0.020
Token:
Token ID25
Feature activation+2.94e-02

Pos logit contributions
 matter+0.102
 yours+0.090
 avail+0.089
 occasions+0.087
epad+0.086
Neg logit contributions
uddle-0.078
s-0.072
udd-0.060
-0.056
-0.055
Token "
Token ID366
Feature activation-7.23e-02

Pos logit contributions
uddle+0.031
s+0.024
udd+0.020
+0.018
 She+0.017
Neg logit contributions
 matter-0.075
 remind-0.066
icians-0.065
 yours-0.065
bet-0.065
TokenWill
Token ID8743
Feature activation-2.45e-01

Pos logit contributions
 matter+0.167
ocket+0.167
 yours+0.163
 generosity+0.161
 avail+0.160
Neg logit contributions
-0.060
-0.059
s-0.058
uddle-0.057
 She-0.054
Token you
Token ID345
Feature activation-5.20e-02

Pos logit contributions
 matter+0.426
epad+0.351
 remind+0.351
 occasions+0.346
ocket+0.345
Neg logit contributions
uddle-0.474
s-0.428
udd-0.371
-0.353
-0.350
Token help
Token ID1037
Feature activation-6.56e-02

Pos logit contributions
 matter+0.097
keeping+0.090
 avail+0.090
ocket+0.089
 occasions+0.088
Neg logit contributions
uddle-0.084
s-0.071
 big-0.066
udd-0.064
-0.064
Token me
Token ID502
Feature activation-4.80e-04

Pos logit contributions
 matter+0.152
ocket+0.145
epad+0.141
 generosity+0.136
bet+0.136
Neg logit contributions
uddle-0.077
s-0.066
-0.054
 big-0.053
udd-0.051
Token build
Token ID1382
Feature activation-1.84e-02

Pos logit contributions
 matter+0.001
 remind+0.001
icians+0.001
 yours+0.001
bet+0.001
Neg logit contributions
uddle-0.001
s-0.001
udd-0.001
-0.001
 She-0.001
Token a
Token ID257
Feature activation+7.62e-03

Pos logit contributions
 matter+0.041
 remind+0.035
illa+0.035
ocket+0.034
icians+0.034
Neg logit contributions
uddle-0.026
s-0.021
udd-0.019
-0.017
-0.017
Token scared
Token ID12008
Feature activation+8.06e-03

Pos logit contributions
 matter+0.062
bet+0.055
 occasions+0.054
icians+0.054
 avail+0.054
Neg logit contributions
uddle-0.032
s-0.028
udd-0.024
 sad-0.023
 She-0.022
Token.
Token ID13
Feature activation+2.79e-02

Pos logit contributions
uddle+0.009
s+0.007
udd+0.006
 She+0.005
+0.005
Neg logit contributions
 matter-0.021
icians-0.018
 remind-0.018
bet-0.018
illa-0.018
Token
Token ID198
Feature activation-8.75e-03

Pos logit contributions
uddle+0.032
s+0.021
udd+0.018
 sad+0.012
uddled+0.012
Neg logit contributions
 matter-0.067
 letting-0.062
enna-0.061
epad-0.060
 yours-0.060
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
pa+0.017
illa+0.017
 matter+0.017
 letting+0.016
weet+0.016
Neg logit contributions
uddle-0.012
udd-0.008
ashes-0.007
s-0.007
eeper-0.007
Token"
Token ID1
Feature activation-5.21e-02

Pos logit contributions
 matter+0.028
illa+0.026
 remind+0.025
 letting+0.025
pper+0.025
Neg logit contributions
uddle-0.024
udd-0.018
s-0.017
 sad-0.016
ashes-0.015
TokenWill
Token ID8743
Feature activation-2.45e-01

Pos logit contributions
 matter+0.117
 yours+0.107
igh+0.106
ocket+0.105
 remind+0.104
Neg logit contributions
uddle-0.066
s-0.056
udd-0.051
-0.049
 She-0.049
Token the
Token ID262
Feature activation+3.31e-02

Pos logit contributions
 matter+0.437
 remind+0.358
icians+0.356
bet+0.352
 yours+0.348
Neg logit contributions
uddle-0.462
s-0.406
udd-0.365
-0.339
 She-0.335
Token dentist
Token ID38408
Feature activation-7.36e-03

Pos logit contributions
uddle+0.050
s+0.040
udd+0.038
 sad+0.035
+0.035
Neg logit contributions
 matter-0.068
icians-0.061
 yours-0.060
illa-0.059
 avail-0.059
Token hurt
Token ID5938
Feature activation+1.41e-02

Pos logit contributions
 matter+0.014
icians+0.012
bet+0.011
 avail+0.011
 remind+0.011
Neg logit contributions
uddle-0.013
s-0.011
udd-0.010
-0.009
 She-0.009
Token me
Token ID502
Feature activation+1.63e-02

Pos logit contributions
uddle+0.018
s+0.016
 She+0.014
+0.014
 back+0.013
Neg logit contributions
 matter-0.031
icians-0.028
epad-0.027
 sizes-0.027
 generosity-0.027
Token?"
Token ID1701
Feature activation+5.04e-02

Pos logit contributions
uddle+0.023
s+0.019
udd+0.018
 sad+0.016
+0.015
Neg logit contributions
 matter-0.037
 avail-0.031
 occasions-0.031
bet-0.031
icians-0.031
Token grab
Token ID5552
Feature activation-2.47e-02

Pos logit contributions
icians+0.173
 matter+0.173
ocket+0.165
icated+0.164
 sizes+0.164
Neg logit contributions
uddle-0.102
 big-0.082
 She-0.080
-0.079
 gym-0.078
Token one
Token ID530
Feature activation+6.84e-02

Pos logit contributions
 matter+0.054
epad+0.048
icians+0.048
 occasions+0.047
illa+0.047
Neg logit contributions
uddle-0.035
s-0.031
-0.026
-0.026
udd-0.025
Token of
Token ID286
Feature activation-6.61e-02

Pos logit contributions
uddle+0.095
s+0.079
udd+0.061
+0.059
 big+0.058
Neg logit contributions
 matter-0.155
icians-0.148
pa-0.138
 lending-0.137
 occasions-0.137
Token the
Token ID262
Feature activation+6.47e-02

Pos logit contributions
epad+0.201
ities+0.200
icians+0.199
 matter+0.199
 meanings+0.198
Neg logit contributions
-0.026
 The-0.024
 big-0.018
 He-0.018
,-0.015
Token sticks
Token ID16461
Feature activation-3.06e-02

Pos logit contributions
uddle+0.092
udd+0.079
s+0.077
 She+0.061
eth+0.057
Neg logit contributions
 matter-0.145
 remind-0.118
 yours-0.115
 protection-0.114
 letting-0.112
Token lying
Token ID9105
Feature activation+1.52e-01

Pos logit contributions
 matter+0.085
icians+0.076
 occasions+0.076
bet+0.075
 avail+0.074
Neg logit contributions
uddle-0.027
s-0.025
udd-0.016
 She-0.015
-0.015
Token nearby
Token ID6716
Feature activation+6.14e-02

Pos logit contributions
uddle+0.150
s+0.130
 back+0.101
,+0.093
+0.087
Neg logit contributions
 matter-0.411
isdom-0.383
icians-0.380
 avail-0.376
 generosity-0.371
Token.
Token ID13
Feature activation+3.36e-02

Pos logit contributions
uddle+0.043
s+0.034
 She+0.020
+0.019
 He+0.019
Neg logit contributions
 matter-0.183
icians-0.164
 avail-0.164
 remind-0.160
illa-0.160
Token He
Token ID679
Feature activation-6.14e-03

Pos logit contributions
uddle+0.040
udd+0.029
s+0.025
 sad+0.021
obb+0.019
Neg logit contributions
 matter-0.077
 letting-0.071
keeping-0.069
bons-0.069
bet-0.069
Token slowly
Token ID6364
Feature activation+2.28e-02

Pos logit contributions
 matter+0.012
 yours+0.011
 avail+0.011
icated+0.011
epad+0.011
Neg logit contributions
uddle-0.010
s-0.009
udd-0.008
-0.007
-0.007
Token reached
Token ID4251
Feature activation-5.85e-03

Pos logit contributions
uddle+0.042
s+0.036
udd+0.033
+0.030
 She+0.030
Neg logit contributions
 matter-0.042
 yours-0.035
 remind-0.035
icians-0.034
illa-0.034
Token heard
Token ID2982
Feature activation+3.43e-02

Pos logit contributions
 matter+0.099
 yours+0.084
 avail+0.082
 remind+0.082
icians+0.081
Neg logit contributions
uddle-0.080
s-0.068
udd-0.062
-0.055
 She-0.055
Token them
Token ID606
Feature activation+1.07e-01

Pos logit contributions
uddle+0.061
s+0.049
udd+0.048
+0.040
eeper+0.040
Neg logit contributions
 matter-0.066
 remind-0.055
 letting-0.055
 yours-0.054
pa-0.054
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
uddle+0.155
udd+0.113
s+0.111
+0.095
eeper+0.089
Neg logit contributions
 matter-0.235
 remind-0.208
 letting-0.205
pa-0.200
 protection-0.199
Token They
Token ID1119
Feature activation-3.27e-02

Pos logit contributions
uddle+0.040
udd+0.033
ooming+0.028
eeper+0.025
uddled+0.024
Neg logit contributions
 matter-0.036
 sizes-0.034
 facts-0.034
 letting-0.033
 messages-0.033
Token ran
Token ID4966
Feature activation+5.26e-02

Pos logit contributions
 matter+0.065
 remind+0.055
icians+0.054
 yours+0.054
illa+0.053
Neg logit contributions
uddle-0.055
s-0.046
udd-0.042
-0.038
 She-0.037
Token back
Token ID736
Feature activation+1.80e-01

Pos logit contributions
uddle+0.109
udd+0.086
s+0.075
eeper+0.072
icker+0.070
Neg logit contributions
 matter-0.084
 letting-0.079
keeping-0.077
 remind-0.073
 protection-0.073
Token to
Token ID284
Feature activation-5.33e-02

Pos logit contributions
uddle+0.287
udd+0.216
s+0.198
eeper+0.175
 sad+0.169
Neg logit contributions
 matter-0.366
 letting-0.325
 remind-0.321
keeping-0.320
 yours-0.319
Token the
Token ID262
Feature activation+8.36e-02

Pos logit contributions
 matter+0.110
 remind+0.095
 letting+0.092
 yours+0.092
icians+0.091
Neg logit contributions
uddle-0.087
s-0.071
udd-0.067
-0.056
 She-0.055
Token je
Token ID11223
Feature activation-5.47e-02

Pos logit contributions
uddle+0.127
s+0.122
udd+0.109
 She+0.096
eeper+0.093
Neg logit contributions
 matter-0.152
 letting-0.140
 remind-0.137
 lending-0.137
keeping-0.135
Tokenep
Token ID538
Feature activation-4.38e-02

Pos logit contributions
 matter+0.044
icians+0.041
pa+0.035
illa+0.034
 letting+0.033
Neg logit contributions
uddle-0.140
s-0.128
 She-0.126
 He-0.120
 Suddenly-0.120
Token.
Token ID13
Feature activation+2.32e-02

Pos logit contributions
 matter+0.111
 remind+0.097
icians+0.095
 yours+0.095
 letting+0.095
Neg logit contributions
uddle-0.050
s-0.037
udd-0.033
-0.027
 She-0.026
Token and
Token ID290
Feature activation-3.50e-02

Pos logit contributions
uddle+0.197
s+0.163
udd+0.152
+0.134
 She+0.134
Neg logit contributions
 matter-0.222
 remind-0.185
 yours-0.182
 letting-0.180
pa-0.179
Token play
Token ID711
Feature activation+4.13e-02

Pos logit contributions
 matter+0.072
icians+0.062
illa+0.061
bet+0.061
ocket+0.061
Neg logit contributions
uddle-0.054
s-0.045
udd-0.040
 sad-0.038
-0.038
Token in
Token ID287
Feature activation-1.87e-02

Pos logit contributions
uddle+0.065
s+0.055
udd+0.051
 She+0.044
eeper+0.043
Neg logit contributions
 matter-0.084
 remind-0.074
 yours-0.072
icians-0.070
 letting-0.070
Token the
Token ID262
Feature activation+5.73e-02

Pos logit contributions
 matter+0.037
 remind+0.031
 yours+0.031
 letting+0.030
pa+0.030
Neg logit contributions
uddle-0.032
s-0.027
udd-0.025
-0.021
 She-0.021
Token snow
Token ID6729
Feature activation+1.60e-03

Pos logit contributions
uddle+0.112
s+0.109
udd+0.091
appy+0.086
eeper+0.085
Neg logit contributions
 matter-0.088
 yours-0.071
 remind-0.071
pa-0.070
 protection-0.068
Token again
Token ID757
Feature activation+1.86e-01

Pos logit contributions
uddle+0.003
s+0.002
udd+0.002
 She+0.002
+0.002
Neg logit contributions
 matter-0.003
 remind-0.003
 yours-0.003
pa-0.003
icians-0.003
Token?"
Token ID1701
Feature activation+6.26e-02

Pos logit contributions
uddle+0.315
s+0.264
udd+0.245
+0.221
 She+0.214
Neg logit contributions
 matter-0.365
 remind-0.306
icians-0.303
bet-0.300
 yours-0.299
Token
Token ID198
Feature activation+2.79e-02

Pos logit contributions
uddle+0.087
udd+0.061
s+0.053
eeper+0.052
umpy+0.050
Neg logit contributions
 matter-0.136
 remind-0.129
pa-0.126
 letting-0.126
 lending-0.123
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
uddle+0.050
udd+0.038
eeper+0.038
ashes+0.037
obb+0.036
Neg logit contributions
 remind-0.044
pa-0.043
 cherish-0.042
 matter-0.042
 bid-0.042
Token"
Token ID1
Feature activation-2.95e-02

Pos logit contributions
uddle+0.020
udd+0.015
s+0.015
eeper+0.013
+0.013
Neg logit contributions
 matter-0.023
 remind-0.021
 cherish-0.021
 letting-0.020
 occasions-0.020
TokenOf
Token ID5189
Feature activation-2.10e-01

Pos logit contributions
 matter+0.065
igh+0.065
pa+0.064
ocket+0.061
 avail+0.061
Neg logit contributions
uddle-0.038
s-0.033
-0.028
udd-0.028
 She-0.027
Token more
Token ID517
Feature activation+1.39e-01

Pos logit contributions
uddle+0.088
s+0.074
udd+0.070
eeper+0.062
+0.059
Neg logit contributions
 matter-0.096
 remind-0.087
 occasions-0.079
 letting-0.079
 yours-0.078
Token.
Token ID13
Feature activation+4.46e-02

Pos logit contributions
uddle+0.178
s+0.136
udd+0.124
+0.108
 She+0.102
Neg logit contributions
 matter-0.336
 remind-0.290
 yours-0.285
 occasions-0.284
 letting-0.281
Token
Token ID198
Feature activation+1.01e-03

Pos logit contributions
uddle+0.068
obb+0.052
udd+0.050
eeper+0.045
umpy+0.045
Neg logit contributions
 matter-0.085
 Janet-0.075
pa-0.074
 letting-0.073
enna-0.072
Token
Token ID198
Feature activation-6.78e-03

Pos logit contributions
uddle+0.001
udd+0.001
eeper+0.001
obb+0.001
ashes+0.001
Neg logit contributions
 matter-0.002
pa-0.002
 remind-0.002
illa-0.002
 letting-0.002
TokenAs
Token ID1722
Feature activation+2.14e-02

Pos logit contributions
 matter+0.014
 remind+0.011
 letting+0.011
 occasions+0.011
 lending+0.011
Neg logit contributions
uddle-0.011
udd-0.008
eeper-0.008
ashes-0.007
obb-0.007
Token she
Token ID673
Feature activation+1.51e-01

Pos logit contributions
uddle+0.035
udd+0.026
s+0.026
+0.022
eeper+0.021
Neg logit contributions
 matter-0.046
 letting-0.039
 remind-0.038
 lending-0.038
 yours-0.037
Token was
Token ID373
Feature activation+5.48e-02

Pos logit contributions
uddle+0.237
udd+0.186
s+0.185
eeper+0.166
 She+0.165
Neg logit contributions
 matter-0.313
 remind-0.258
 occasions-0.255
 letting-0.251
bet-0.249
Token cleaning
Token ID12724
Feature activation-1.02e-02

Pos logit contributions
uddle+0.125
s+0.113
udd+0.101
+0.097
 He+0.095
Neg logit contributions
 matter-0.075
 letting-0.066
 remind-0.065
 lending-0.065
 bringing-0.060
Token up
Token ID510
Feature activation+1.85e-01

Pos logit contributions
 matter+0.023
pa+0.021
 avail+0.020
 remind+0.020
 yours+0.019
Neg logit contributions
uddle-0.013
udd-0.010
s-0.010
-0.008
ashes-0.008
Token,
Token ID11
Feature activation+1.01e-01

Pos logit contributions
uddle+0.265
s+0.199
udd+0.193
+0.159
eeper+0.149
Neg logit contributions
 matter-0.430
 remind-0.365
pa-0.355
 letting-0.350
 protection-0.348
Token Lily
Token ID20037
Feature activation+8.46e-03

Pos logit contributions
uddle+0.190
udd+0.159
s+0.149
umpy+0.131
eeper+0.126
Neg logit contributions
 matter-0.197
 remind-0.159
 avail-0.157
 letting-0.155
 lending-0.149
Token
Token ID198
Feature activation-1.10e-03

Pos logit contributions
uddle+0.016
udd+0.010
s+0.009
 sad+0.007
eeper+0.007
Neg logit contributions
 matter-0.037
 letting-0.035
icians-0.035
 lending-0.035
 remind-0.034
TokenL
Token ID43
Feature activation-1.06e-01

Pos logit contributions
 matter+0.002
 remind+0.002
 yours+0.002
icians+0.002
illa+0.002
Neg logit contributions
uddle-0.002
s-0.001
udd-0.001
-0.001
 She-0.001
Tokenily
Token ID813
Feature activation-1.51e-02

Pos logit contributions
 matter+0.219
 yours+0.186
 remind+0.185
illa+0.185
icians+0.184
Neg logit contributions
uddle-0.175
s-0.146
udd-0.129
-0.115
 sad-0.114
Token and
Token ID290
Feature activation-9.68e-02

Pos logit contributions
 matter+0.030
 remind+0.026
icians+0.025
 yours+0.025
illa+0.025
Neg logit contributions
uddle-0.025
s-0.021
udd-0.019
-0.017
 She-0.017
Token Max
Token ID5436
Feature activation-5.52e-02

Pos logit contributions
 matter+0.228
ocket+0.205
 occasions+0.203
icians+0.203
 sizes+0.200
Neg logit contributions
uddle-0.100
s-0.096
 She-0.086
-0.081
 The-0.078
Token continued
Token ID3767
Feature activation+1.53e-01

Pos logit contributions
 matter+0.107
 yours+0.089
 remind+0.088
icians+0.088
 avail+0.087
Neg logit contributions
uddle-0.095
s-0.081
udd-0.075
 She-0.067
-0.067
Token to
Token ID284
Feature activation-4.13e-02

Pos logit contributions
uddle+0.190
s+0.149
udd+0.132
+0.111
 She+0.109
Neg logit contributions
 matter-0.374
 remind-0.323
icians-0.319
 yours-0.319
illa-0.317
Token be
Token ID307
Feature activation+3.13e-02

Pos logit contributions
 matter+0.094
ocket+0.084
 yours+0.084
icians+0.083
fulness+0.082
Neg logit contributions
uddle-0.051
s-0.043
udd-0.036
 She-0.036
-0.035
Token best
Token ID1266
Feature activation+5.41e-02

Pos logit contributions
uddle+0.047
s+0.038
udd+0.035
+0.030
 She+0.030
Neg logit contributions
 matter-0.069
 remind-0.058
 yours-0.057
icians-0.057
illa-0.057
Token friends
Token ID2460
Feature activation+1.88e-02

Pos logit contributions
uddle+0.087
s+0.072
udd+0.066
+0.058
 She+0.057
Neg logit contributions
 matter-0.114
 remind-0.096
 yours-0.094
icians-0.094
bet-0.094
Token and
Token ID290
Feature activation-4.09e-02

Pos logit contributions
uddle+0.017
s+0.013
udd+0.010
+0.009
 She+0.008
Neg logit contributions
 matter-0.053
bet-0.046
 avail-0.046
icians-0.046
 yours-0.046
Token it
Token ID340
Feature activation+1.14e-01

Pos logit contributions
uddle+0.000
s+0.000
udd+0.000
+0.000
 She+0.000
Neg logit contributions
 matter-0.000
 remind-0.000
icians-0.000
icated-0.000
bet-0.000
Token.
Token ID13
Feature activation+2.19e-02

Pos logit contributions
uddle+0.140
s+0.101
udd+0.099
+0.078
 gym+0.074
Neg logit contributions
 matter-0.275
 remind-0.241
 yours-0.236
 letting-0.234
illa-0.232
Token She
Token ID1375
Feature activation-2.27e-02

Pos logit contributions
uddle+0.022
s+0.015
udd+0.014
 sad+0.009
+0.008
Neg logit contributions
 matter-0.057
 remind-0.051
 letting-0.051
 yours-0.051
 lending-0.050
Token wanted
Token ID2227
Feature activation-4.25e-02

Pos logit contributions
 matter+0.054
 yours+0.047
 avail+0.047
bet+0.046
 remind+0.046
Neg logit contributions
uddle-0.030
s-0.025
udd-0.022
-0.019
-0.019
Token to
Token ID284
Feature activation-8.42e-02

Pos logit contributions
 matter+0.126
pper+0.123
igh+0.118
bet+0.117
icians+0.116
Neg logit contributions
uddle-0.020
s-0.018
 big-0.015
-0.013
 back-0.012
Token go
Token ID467
Feature activation+9.15e-02

Pos logit contributions
 matter+0.176
icians+0.168
 sizes+0.161
igh+0.160
bet+0.160
Neg logit contributions
uddle-0.110
s-0.088
 big-0.087
-0.086
 She-0.086
Token home
Token ID1363
Feature activation+1.28e-01

Pos logit contributions
uddle+0.128
s+0.097
udd+0.094
+0.073
eeper+0.072
Neg logit contributions
 matter-0.207
 remind-0.187
 yours-0.181
 letting-0.179
pa-0.178
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
uddle+0.132
s+0.103
udd+0.086
 She+0.071
+0.069
Neg logit contributions
 matter-0.338
bet-0.293
icians-0.291
 remind-0.291
 yours-0.291
Token
Token ID198
Feature activation-2.24e-02

Pos logit contributions
uddle+0.020
udd+0.011
s+0.011
 sad+0.006
eeper+0.005
Neg logit contributions
 matter-0.062
 remind-0.056
 letting-0.055
 yours-0.055
illa-0.055
Token
Token ID198
Feature activation-2.91e-02

Pos logit contributions
 matter+0.059
 remind+0.054
pa+0.054
illa+0.053
 letting+0.053
Neg logit contributions
uddle-0.021
udd-0.012
s-0.012
 sad-0.007
eeper-0.007
TokenBut
Token ID1537
Feature activation-1.30e-01

Pos logit contributions
 matter+0.069
 remind+0.061
icians+0.060
 letting+0.060
illa+0.060
Neg logit contributions
uddle-0.038
s-0.028
udd-0.026
-0.020
 sad-0.020
Token someone
Token ID2130
Feature activation+2.38e-02

Pos logit contributions
uddle+0.050
s+0.041
udd+0.036
+0.034
 She+0.033
Neg logit contributions
 matter-0.090
 remind-0.078
icians-0.078
 yours-0.077
 occasions-0.076
Token say
Token ID910
Feature activation-6.16e-02

Pos logit contributions
uddle+0.051
s+0.046
udd+0.041
 She+0.040
 sad+0.039
Neg logit contributions
 matter-0.036
 yours-0.029
icians-0.029
bet-0.029
 avail-0.028
Token,
Token ID11
Feature activation+7.12e-02

Pos logit contributions
 matter+0.164
 remind+0.145
icians+0.142
 yours+0.142
 letting+0.141
Neg logit contributions
uddle-0.062
s-0.045
udd-0.039
-0.030
 She-0.028
Token "
Token ID366
Feature activation-4.60e-02

Pos logit contributions
uddle+0.060
s+0.047
 She+0.034
udd+0.034
+0.032
Neg logit contributions
 matter-0.198
 occasions-0.174
icians-0.173
 avail-0.173
 remind-0.173
TokenClose
Token ID26125
Feature activation-7.80e-02

Pos logit contributions
 matter+0.104
ocket+0.102
 yours+0.100
 facts+0.099
epad+0.099
Neg logit contributions
s-0.045
uddle-0.045
-0.043
-0.039
 She-0.037
Token the
Token ID262
Feature activation+4.85e-02

Pos logit contributions
 matter+0.178
icians+0.154
epad+0.154
bet+0.152
ocket+0.151
Neg logit contributions
uddle-0.102
s-0.092
,-0.077
-0.077
 big-0.075
Token hole
Token ID7604
Feature activation-3.15e-03

Pos logit contributions
uddle+0.066
s+0.050
udd+0.044
+0.040
 She+0.039
Neg logit contributions
 matter-0.113
icians-0.100
illa-0.098
 remind-0.098
pa-0.097
Token!"
Token ID2474
Feature activation+5.22e-02

Pos logit contributions
 matter+0.008
 avail+0.007
bet+0.007
icians+0.007
 remind+0.007
Neg logit contributions
uddle-0.004
s-0.003
udd-0.003
 She-0.002
-0.002
Token
Token ID220
Feature activation+6.43e-02

Pos logit contributions
uddle+0.040
s+0.030
udd+0.020
 She+0.019
+0.018
Neg logit contributions
 matter-0.152
 yours-0.134
 remind-0.133
icians-0.133
bet-0.132
Token He
Token ID679
Feature activation+1.60e-02

Pos logit contributions
uddle+0.069
udd+0.043
s+0.039
 gym+0.029
 sad+0.027
Neg logit contributions
 matter-0.149
 letting-0.141
 yours-0.140
illa-0.140
igh-0.139
Token jumped
Token ID11687
Feature activation+5.89e-02

Pos logit contributions
uddle+0.024
s+0.022
+0.018
udd+0.017
+0.017
Neg logit contributions
 matter-0.034
 yours-0.032
icated-0.030
 avail-0.030
 occasions-0.030
Token Tom
Token ID4186
Feature activation-2.55e-02

Pos logit contributions
uddle+0.036
s+0.024
udd+0.022
+0.015
 sad+0.014
Neg logit contributions
 matter-0.100
 remind-0.089
 yours-0.088
 letting-0.088
illa-0.088
Token felt
Token ID2936
Feature activation+8.26e-02

Pos logit contributions
 matter+0.059
 avail+0.051
 yours+0.051
 remind+0.050
 occasions+0.050
Neg logit contributions
uddle-0.036
s-0.030
udd-0.027
-0.023
-0.023
Token happy
Token ID3772
Feature activation+9.82e-02

Pos logit contributions
uddle+0.137
s+0.102
udd+0.096
 gym+0.084
+0.082
Neg logit contributions
 matter-0.175
 remind-0.160
 protection-0.151
 yours-0.150
 letting-0.150
Token knowing
Token ID6970
Feature activation+1.91e-02

Pos logit contributions
uddle+0.102
s+0.085
udd+0.069
+0.065
 He+0.061
Neg logit contributions
 matter-0.260
icians-0.230
bet-0.228
igh-0.227
illa-0.227
Token that
Token ID326
Feature activation+2.53e-02

Pos logit contributions
+0.012
uddle+0.012
s+0.011
 He+0.010
 big+0.010
Neg logit contributions
 matter-0.054
bet-0.050
igh-0.049
icians-0.049
bons-0.048
Token he
Token ID339
Feature activation+1.15e-01

Pos logit contributions
+0.012
 He+0.012
 new+0.011
,+0.011
 sad+0.011
Neg logit contributions
pper-0.075
bet-0.071
illa-0.071
icians-0.071
ocket-0.069
Token could
Token ID714
Feature activation-3.81e-02

Pos logit contributions
s+0.138
uddle+0.134
udd+0.109
+0.103
+0.102
Neg logit contributions
 matter-0.260
 avail-0.245
 occasions-0.241
bet-0.236
 yours-0.233
Token share
Token ID2648
Feature activation+4.24e-02

Pos logit contributions
 matter+0.079
ocket+0.067
icated+0.066
icians+0.066
illa+0.065
Neg logit contributions
uddle-0.060
s-0.051
udd-0.046
-0.043
 She-0.042
Token his
Token ID465
Feature activation+6.11e-02

Pos logit contributions
uddle+0.042
s+0.038
+0.033
 big+0.032
 nicely+0.031
Neg logit contributions
 matter-0.108
pper-0.098
illa-0.097
igh-0.095
bet-0.095
Token love
Token ID1842
Feature activation-3.10e-02

Pos logit contributions
uddle+0.098
s+0.074
udd+0.069
 big+0.068
 sad+0.067
Neg logit contributions
 matter-0.126
illa-0.119
pper-0.119
ocket-0.114
icians-0.114
Token for
Token ID329
Feature activation-4.57e-02

Pos logit contributions
 matter+0.089
 avail+0.085
pper+0.084
illa+0.083
bet+0.082
Neg logit contributions
s-0.019
uddle-0.017
,-0.013
 nicely-0.011
-0.011
Token put
Token ID1234
Feature activation-1.25e-02

Pos logit contributions
 matter+0.091
 occasions+0.079
epad+0.079
icians+0.078
igh+0.078
Neg logit contributions
uddle-0.065
s-0.060
 She-0.051
-0.050
udd-0.049
Token the
Token ID262
Feature activation+7.42e-02

Pos logit contributions
 matter+0.034
icated+0.032
icians+0.031
epad+0.031
 occasions+0.030
Neg logit contributions
uddle-0.010
 back-0.010
s-0.009
-0.009
 She-0.009
Token rock
Token ID3881
Feature activation+1.91e-02

Pos logit contributions
uddle+0.130
s+0.104
udd+0.094
 sad+0.091
+0.090
Neg logit contributions
 matter-0.144
illa-0.125
icians-0.124
pa-0.123
 remind-0.122
Token back
Token ID736
Feature activation+1.81e-01

Pos logit contributions
uddle+0.023
s+0.018
udd+0.016
+0.013
 She+0.013
Neg logit contributions
 matter-0.047
 remind-0.040
icians-0.040
 yours-0.040
illa-0.040
Token on
Token ID319
Feature activation-6.87e-02

Pos logit contributions
uddle+0.205
s+0.176
udd+0.130
 sad+0.120
 She+0.119
Neg logit contributions
 matter-0.467
icians-0.406
 avail-0.405
bet-0.403
illa-0.400
Token the
Token ID262
Feature activation+7.22e-02

Pos logit contributions
 matter+0.168
icians+0.155
icated+0.153
igh+0.151
ocket+0.149
Neg logit contributions
uddle-0.073
s-0.062
 She-0.056
-0.055
-0.049
Token ground
Token ID2323
Feature activation+1.82e-02

Pos logit contributions
uddle+0.128
s+0.105
udd+0.098
+0.089
 sad+0.087
Neg logit contributions
 matter-0.138
icians-0.116
 remind-0.115
illa-0.114
 yours-0.113
Token.
Token ID13
Feature activation+3.84e-02

Pos logit contributions
uddle+0.015
s+0.012
udd+0.008
 She+0.007
+0.007
Neg logit contributions
 matter-0.052
icians-0.047
icated-0.046
illa-0.046
bet-0.046
Token She
Token ID1375
Feature activation-9.96e-03

Pos logit contributions
uddle+0.046
s+0.032
udd+0.032
 sad+0.021
 gym+0.020
Neg logit contributions
 matter-0.091
 letting-0.084
 yours-0.082
 remind-0.081
 lending-0.081
Token gave
Token ID2921
Feature activation-8.43e-02

Pos logit contributions
 matter+0.024
 avail+0.023
 yours+0.022
epad+0.022
icated+0.021
Neg logit contributions
uddle-0.014
s-0.012
udd-0.010
-0.010
 big-0.009
Token the
Token ID262
Feature activation+3.83e-02

Pos logit contributions
 matter+0.186
icians+0.173
epad+0.165
 sizes+0.165
pper+0.164
Neg logit contributions
uddle-0.103
s-0.098
-0.088
 back-0.087
 She-0.085
TokenLu
Token ID25596
Feature activation-9.97e-02

Pos logit contributions
 matter+0.005
 remind+0.005
 occasions+0.005
 letting+0.004
 granted+0.004
Neg logit contributions
uddle-0.007
udd-0.006
s-0.005
tub-0.005
ashes-0.005
Tokency
Token ID948
Feature activation-2.60e-02

Pos logit contributions
 matter+0.229
 remind+0.199
 yours+0.196
icians+0.196
 letting+0.193
Neg logit contributions
uddle-0.138
s-0.113
udd-0.100
-0.087
 She-0.085
Token's
Token ID338
Feature activation-5.53e-02

Pos logit contributions
 matter+0.047
 remind+0.039
 yours+0.038
icians+0.038
illa+0.037
Neg logit contributions
uddle-0.048
s-0.041
udd-0.038
-0.035
 She-0.034
Token dad
Token ID9955
Feature activation-4.56e-02

Pos logit contributions
 matter+0.109
 remind+0.091
icians+0.090
 yours+0.090
 letting+0.088
Neg logit contributions
uddle-0.097
s-0.080
udd-0.074
-0.066
 sad-0.065
Token nodded
Token ID14464
Feature activation+5.35e-02

Pos logit contributions
 matter+0.093
 yours+0.085
keeping+0.082
 occasions+0.082
 avail+0.081
Neg logit contributions
uddle-0.073
s-0.068
udd-0.057
-0.054
 sad-0.054
Token,
Token ID11
Feature activation+8.74e-02

Pos logit contributions
uddle+0.069
s+0.053
udd+0.049
+0.041
 She+0.039
Neg logit contributions
 matter-0.128
 remind-0.111
 yours-0.108
icians-0.108
 letting-0.108
Token "
Token ID366
Feature activation-3.24e-02

Pos logit contributions
uddle+0.092
s+0.080
+0.077
 She+0.076
 sad+0.075
Neg logit contributions
 matter-0.217
 avail-0.198
 occasions-0.195
igh-0.195
icians-0.195
TokenYes
Token ID5297
Feature activation-1.86e-01

Pos logit contributions
 matter+0.067
igh+0.066
ocket+0.066
 avail+0.065
pa+0.064
Neg logit contributions
s-0.040
uddle-0.039
 She-0.038
-0.036
udd-0.034
Token,
Token ID11
Feature activation+3.52e-02

Pos logit contributions
 matter+0.476
 yours+0.420
 remind+0.414
 letting+0.406
fulness+0.405
Neg logit contributions
uddle-0.211
udd-0.141
s-0.140
-0.117
 gym-0.109
Token but
Token ID475
Feature activation-1.80e-01

Pos logit contributions
uddle+0.037
s+0.037
 little+0.036
,+0.036
 She+0.036
Neg logit contributions
bet-0.076
igh-0.076
icians-0.076
 avail-0.075
ocket-0.074
Token they
Token ID484
Feature activation+2.33e-02

Pos logit contributions
keeping+0.420
icians+0.409
ocket+0.405
bons+0.397
igh+0.386
Neg logit contributions
,-0.191
uddle-0.175
 nicely-0.174
 sad-0.172
s-0.169
Token playing
Token ID2712
Feature activation+8.91e-02

Pos logit contributions
uddle+0.030
s+0.025
udd+0.023
+0.021
 She+0.020
Neg logit contributions
 matter-0.033
 remind-0.028
icians-0.027
 yours-0.027
 letting-0.027
Token with
Token ID351
Feature activation-6.60e-02

Pos logit contributions
uddle+0.134
s+0.110
udd+0.102
+0.085
 She+0.085
Neg logit contributions
 matter-0.193
 remind-0.167
 letting-0.162
icians-0.162
 yours-0.160
Token his
Token ID465
Feature activation+6.20e-03

Pos logit contributions
 matter+0.161
pper+0.147
illa+0.146
ocket+0.144
 remind+0.143
Neg logit contributions
uddle-0.072
s-0.057
 big-0.053
 gym-0.048
-0.048
Token toy
Token ID13373
Feature activation-7.67e-02

Pos logit contributions
uddle+0.009
s+0.007
udd+0.006
+0.005
 gym+0.005
Neg logit contributions
 matter-0.014
 remind-0.012
icians-0.012
illa-0.012
 letting-0.012
Token airplane
Token ID19401
Feature activation-2.79e-02

Pos logit contributions
 matter+0.182
 remind+0.157
icians+0.155
 yours+0.154
illa+0.153
Neg logit contributions
uddle-0.101
s-0.079
udd-0.070
-0.060
 She-0.058
Token in
Token ID287
Feature activation-5.55e-02

Pos logit contributions
 matter+0.075
illa+0.068
 avail+0.068
bet+0.067
ocket+0.067
Neg logit contributions
uddle-0.024
s-0.020
,-0.015
 back-0.014
 gym-0.014
Token the
Token ID262
Feature activation+3.53e-02

Pos logit contributions
 matter+0.142
pper+0.139
ocket+0.134
isdom+0.132
illa+0.131
Neg logit contributions
-0.045
 big-0.042
uddle-0.039
-0.039
 He-0.038
Token backyard
Token ID24296
Feature activation+4.54e-02

Pos logit contributions
uddle+0.053
s+0.044
udd+0.039
 She+0.032
+0.032
Neg logit contributions
 matter-0.078
 remind-0.066
pa-0.065
 yours-0.065
bet-0.065
Token.
Token ID13
Feature activation+1.91e-02

Pos logit contributions
uddle+0.048
s+0.037
udd+0.031
+0.026
 She+0.025
Neg logit contributions
 matter-0.118
 remind-0.103
icians-0.102
 yours-0.102
ocket-0.102
Token Suddenly
Token ID24975
Feature activation-4.91e-03

Pos logit contributions
uddle+0.023
udd+0.017
s+0.017
+0.010
uffy+0.010
Neg logit contributions
 matter-0.047
 lending-0.041
 letting-0.041
 yours-0.041
 remind-0.041
Token,
Token ID11
Feature activation+6.70e-02

Pos logit contributions
 matter+0.013
 remind+0.011
 avail+0.011
 yours+0.011
pa+0.011
Neg logit contributions
uddle-0.005
s-0.003
udd-0.003
-0.002
 She-0.002
TokenI
Token ID40
Feature activation-9.15e-02

Pos logit contributions
 matter+0.127
 avail+0.119
pa+0.117
ocket+0.115
igh+0.113
Neg logit contributions
s-0.042
-0.039
 She-0.037
uddle-0.035
-0.035
Token'm
Token ID1101
Feature activation-5.23e-02

Pos logit contributions
 matter+0.226
 avail+0.206
bet+0.201
icians+0.201
bons+0.200
Neg logit contributions
uddle-0.100
s-0.091
-0.069
udd-0.068
-0.068
Token a
Token ID257
Feature activation-2.53e-02

Pos logit contributions
 matter+0.111
 occasions+0.100
bet+0.094
bons+0.093
 hospitality+0.090
Neg logit contributions
uddle-0.070
 sad-0.068
 big-0.064
 She-0.063
-0.062
Token brown
Token ID7586
Feature activation-4.28e-02

Pos logit contributions
 matter+0.054
icians+0.048
pa+0.047
 avail+0.047
 occasions+0.047
Neg logit contributions
uddle-0.037
s-0.029
udd-0.026
 big-0.026
 sad-0.025
Token spider
Token ID19230
Feature activation-1.67e-02

Pos logit contributions
 matter+0.085
icians+0.076
 remind+0.073
 yours+0.072
 avail+0.071
Neg logit contributions
uddle-0.071
s-0.057
udd-0.053
 She-0.049
-0.048
Token who
Token ID508
Feature activation-1.05e-01

Pos logit contributions
 matter+0.043
 avail+0.038
bet+0.037
icated+0.036
epad+0.035
Neg logit contributions
uddle-0.018
s-0.016
,-0.013
udd-0.012
 She-0.012
Token lives
Token ID3160
Feature activation-1.34e-02

Pos logit contributions
 matter+0.235
 avail+0.230
bet+0.218
keeping+0.210
 occasions+0.208
Neg logit contributions
s-0.138
uddle-0.125
 She-0.107
-0.106
 big-0.103
Token in
Token ID287
Feature activation-5.50e-02

Pos logit contributions
 matter+0.034
 avail+0.029
bet+0.029
 generosity+0.028
icated+0.028
Neg logit contributions
uddle-0.015
s-0.013
udd-0.011
-0.011
 nicely-0.010
Token this
Token ID428
Feature activation+1.82e-03

Pos logit contributions
 matter+0.144
pper+0.134
bet+0.132
epad+0.130
icians+0.129
Neg logit contributions
 big-0.046
-0.043
-0.040
s-0.039
 She-0.038
Token shed
Token ID14999
Feature activation-2.89e-02

Pos logit contributions
uddle+0.003
s+0.003
udd+0.002
ashes+0.002
uddled+0.002
Neg logit contributions
 matter-0.004
 remind-0.003
 yours-0.003
pa-0.003
 disguise-0.003
Token and
Token ID290
Feature activation-7.78e-02

Pos logit contributions
 matter+0.073
 avail+0.064
bet+0.063
 occasions+0.062
icians+0.061
Neg logit contributions
uddle-0.034
s-0.028
udd-0.024
 She-0.022
-0.021
Token tree
Token ID5509
Feature activation+3.16e-02

Pos logit contributions
uddle+0.102
s+0.080
udd+0.075
+0.063
 She+0.062
Neg logit contributions
 matter-0.157
 remind-0.136
 yours-0.134
icians-0.132
 letting-0.132
Token,
Token ID11
Feature activation+4.93e-02

Pos logit contributions
uddle+0.045
s+0.037
udd+0.033
+0.029
 She+0.028
Neg logit contributions
 matter-0.071
 remind-0.060
icians-0.060
bet-0.059
 avail-0.059
Token I
Token ID314
Feature activation-5.37e-02

Pos logit contributions
uddle+0.065
 big+0.053
+0.051
 sad+0.050
 She+0.049
Neg logit contributions
 matter-0.105
 occasions-0.102
 avail-0.099
icians-0.099
ocket-0.097
Token'm
Token ID1101
Feature activation-2.87e-02

Pos logit contributions
 matter+0.116
 avail+0.104
icians+0.100
icated+0.100
pa+0.099
Neg logit contributions
uddle-0.081
s-0.068
udd-0.062
-0.056
-0.054
Token helping
Token ID5742
Feature activation-6.94e-02

Pos logit contributions
 matter+0.065
 occasions+0.057
bet+0.055
icians+0.055
 avail+0.055
Neg logit contributions
uddle-0.038
s-0.031
 sad-0.030
udd-0.028
 big-0.028
Token you
Token ID345
Feature activation-1.26e-02

Pos logit contributions
 matter+0.141
ocket+0.120
epad+0.118
 avail+0.117
bet+0.116
Neg logit contributions
uddle-0.109
s-0.097
-0.085
 nicely-0.084
udd-0.081
Token!"
Token ID2474
Feature activation+6.51e-02

Pos logit contributions
 matter+0.025
 remind+0.021
icians+0.021
pa+0.020
 yours+0.020
Neg logit contributions
uddle-0.021
s-0.018
udd-0.016
-0.015
 She-0.015
Token
Token ID198
Feature activation+2.12e-02

Pos logit contributions
uddle+0.048
s+0.029
udd+0.023
+0.014
 She+0.012
Neg logit contributions
 matter-0.192
 remind-0.171
icians-0.169
 yours-0.169
 letting-0.168
Token
Token ID198
Feature activation+6.22e-03

Pos logit contributions
uddle+0.025
udd+0.017
s+0.015
 sad+0.013
eeper+0.013
Neg logit contributions
 matter-0.047
 letting-0.044
 remind-0.044
icated-0.044
illa-0.044
TokenThe
Token ID464
Feature activation-3.71e-02

Pos logit contributions
uddle+0.009
s+0.007
udd+0.007
+0.006
 sad+0.006
Neg logit contributions
 matter-0.014
 remind-0.012
 letting-0.012
icians-0.012
icated-0.012
Token little
Token ID1310
Feature activation-3.56e-02

Pos logit contributions
 matter+0.076
 remind+0.061
 letting+0.061
 protection+0.060
 generosity+0.058
Neg logit contributions
uddle-0.059
s-0.053
udd-0.050
ashes-0.039
eeper-0.039
Token sang
Token ID25889
Feature activation-2.66e-02

Pos logit contributions
 matter+0.056
 yours+0.051
 occasions+0.050
bet+0.050
 avail+0.049
Neg logit contributions
uddle-0.053
s-0.051
udd-0.044
-0.042
 sad-0.042
Token loud
Token ID7812
Feature activation-4.60e-03

Pos logit contributions
 matter+0.060
 remind+0.051
icians+0.050
 yours+0.050
 letting+0.050
Neg logit contributions
uddle-0.038
s-0.031
udd-0.028
-0.024
 She-0.024
Token and
Token ID290
Feature activation-2.44e-02

Pos logit contributions
 matter+0.011
 remind+0.010
icians+0.009
illa+0.009
 yours+0.009
Neg logit contributions
uddle-0.006
s-0.005
udd-0.004
-0.004
 She-0.003
Token happy
Token ID3772
Feature activation+7.15e-02

Pos logit contributions
 matter+0.055
illa+0.052
igh+0.051
pa+0.050
bet+0.049
Neg logit contributions
uddle-0.031
s-0.028
 sad-0.024
udd-0.023
 big-0.022
Token.
Token ID13
Feature activation+3.13e-02

Pos logit contributions
uddle+0.097
s+0.077
udd+0.069
+0.059
 She+0.058
Neg logit contributions
 matter-0.166
 remind-0.143
icians-0.141
 yours-0.141
illa-0.140
Token They
Token ID1119
Feature activation-2.81e-02

Pos logit contributions
uddle+0.034
s+0.023
udd+0.021
 sad+0.014
+0.014
Neg logit contributions
 matter-0.082
 letting-0.072
 yours-0.071
 remind-0.071
icated-0.071
Token sang
Token ID25889
Feature activation-2.71e-02

Pos logit contributions
 matter+0.053
 yours+0.048
 occasions+0.048
 avail+0.048
bet+0.048
Neg logit contributions
uddle-0.046
s-0.043
udd-0.039
-0.035
 sad-0.035
Token,
Token ID11
Feature activation+7.69e-02

Pos logit contributions
 matter+0.057
 remind+0.049
 yours+0.048
 letting+0.048
icians+0.048
Neg logit contributions
uddle-0.043
s-0.034
udd-0.032
-0.028
 She-0.027
Token "
Token ID366
Feature activation-3.22e-02

Pos logit contributions
s+0.110
+0.110
 sad+0.104
 She+0.104
uddle+0.103
Neg logit contributions
 matter-0.160
illa-0.150
 avail-0.148
pa-0.141
 occasions-0.138
TokenPot
Token ID25396
Feature activation-1.55e-01

Pos logit contributions
 matter+0.070
pa+0.063
 avail+0.062
 yours+0.061
ocket+0.061
Neg logit contributions
uddle-0.043
s-0.040
udd-0.033
-0.033
-0.032
Tokenato
Token ID5549
Feature activation-6.04e-02

Pos logit contributions
 matter+0.265
ocket+0.227
keeping+0.222
 remind+0.219
epad+0.214
Neg logit contributions
uddle-0.298
s-0.281
udd-0.259
-0.238
-0.231
TokenRE
Token ID2200
Feature activation-5.40e-02

Pos logit contributions
 matter+0.107
 remind+0.089
icians+0.088
 letting+0.087
bet+0.087
Neg logit contributions
uddle-0.090
s-0.074
udd-0.070
-0.060
 She-0.060
TokenVERS
Token ID28884
Feature activation-1.86e-03

Pos logit contributions
 matter+0.084
bet+0.074
pper+0.073
icians+0.072
ocket+0.071
Neg logit contributions
uddle-0.101
s-0.094
udd-0.080
-0.079
 She-0.078
TokenE
Token ID36
Feature activation-8.21e-02

Pos logit contributions
 matter+0.004
illa+0.003
icians+0.003
pper+0.003
 yours+0.003
Neg logit contributions
uddle-0.003
s-0.002
udd-0.002
-0.002
-0.002
Token,
Token ID11
Feature activation+6.35e-02

Pos logit contributions
 matter+0.169
pper+0.148
illa+0.146
pa+0.145
 yours+0.145
Neg logit contributions
uddle-0.127
s-0.099
udd-0.095
 She-0.085
 gym-0.085
Token huh
Token ID24926
Feature activation-4.63e-02

Pos logit contributions
uddle+0.078
 sad+0.066
s+0.062
 big+0.061
 He+0.061
Neg logit contributions
 matter-0.145
 occasions-0.141
 avail-0.138
epad-0.136
bons-0.135
Token?
Token ID30
Feature activation-3.57e-02

Pos logit contributions
 matter+0.090
 yours+0.076
illa+0.075
pa+0.075
 remind+0.074
Neg logit contributions
uddle-0.080
s-0.066
udd-0.063
 gym-0.055
-0.054
Token That
Token ID1320
Feature activation-7.24e-02

Pos logit contributions
 matter+0.066
 remind+0.057
 letting+0.056
illa+0.056
igh+0.056
Neg logit contributions
uddle-0.063
s-0.050
udd-0.050
 gym-0.040
 sad-0.040
Token's
Token ID338
Feature activation-7.80e-02

Pos logit contributions
 matter+0.177
icians+0.160
 avail+0.158
 occasions+0.158
bet+0.158
Neg logit contributions
uddle-0.086
s-0.072
udd-0.057
-0.051
 sad-0.049
Token what
Token ID644
Feature activation-5.93e-03

Pos logit contributions
 occasions+0.177
keeping+0.169
epad+0.168
icians+0.167
 avail+0.167
Neg logit contributions
uddle-0.085
 sad-0.083
 big-0.076
 nicely-0.075
 well-0.070
Token you
Token ID345
Feature activation-3.20e-02

Pos logit contributions
 matter+0.012
bet+0.012
pper+0.011
 avail+0.011
illa+0.011
Neg logit contributions
uddle-0.009
s-0.008
-0.007
-0.007
 big-0.007
Token get
Token ID651
Feature activation+7.53e-02

Pos logit contributions
 matter+0.064
 avail+0.061
 occasions+0.060
bet+0.058
icians+0.057
Neg logit contributions
uddle-0.049
s-0.043
udd-0.039
 sad-0.035
-0.035
Token!
Token ID0
Feature activation+2.53e-02

Pos logit contributions
uddle+0.013
s+0.012
,+0.009
udd+0.009
 She+0.009
Neg logit contributions
 matter-0.025
icians-0.023
bet-0.023
ocket-0.023
 avail-0.023
Token She
Token ID1375
Feature activation-1.90e-02

Pos logit contributions
uddle+0.032
s+0.023
udd+0.023
 sad+0.016
 gym+0.015
Neg logit contributions
 matter-0.060
 remind-0.052
 letting-0.052
illa-0.052
 yours-0.052
Token looked
Token ID3114
Feature activation+7.22e-02

Pos logit contributions
 matter+0.044
 yours+0.040
 avail+0.039
 remind+0.038
epad+0.038
Neg logit contributions
uddle-0.026
s-0.023
udd-0.018
-0.017
 She-0.017
Token around
Token ID1088
Feature activation+6.76e-02

Pos logit contributions
uddle+0.107
s+0.081
udd+0.078
+0.068
 gym+0.065
Neg logit contributions
 matter-0.159
 remind-0.138
 yours-0.131
 letting-0.131
pa-0.129
Token,
Token ID11
Feature activation+7.78e-02

Pos logit contributions
uddle+0.065
s+0.054
 She+0.041
udd+0.040
+0.040
Neg logit contributions
 matter-0.178
icians-0.159
bet-0.158
pper-0.157
 yours-0.157
Token but
Token ID475
Feature activation-1.66e-01

Pos logit contributions
uddle+0.104
s+0.096
 sad+0.086
 She+0.085
+0.084
Neg logit contributions
 matter-0.174
 occasions-0.155
 avail-0.152
igh-0.151
 yours-0.149
Token she
Token ID673
Feature activation+6.82e-02

Pos logit contributions
 matter+0.358
 occasions+0.312
bet+0.312
icians+0.307
bons+0.305
Neg logit contributions
uddle-0.245
s-0.217
 She-0.185
 sad-0.181
-0.178
Token couldn
Token ID3521
Feature activation-2.38e-02

Pos logit contributions
uddle+0.116
s+0.098
udd+0.089
+0.080
 She+0.080
Neg logit contributions
 matter-0.137
 yours-0.118
 occasions-0.116
 remind-0.115
icated-0.114
Token't
Token ID470
Feature activation-5.32e-02

Pos logit contributions
 matter+0.030
 remind+0.026
 yours+0.025
e+0.024
 letting+0.024
Neg logit contributions
uddle-0.055
udd-0.048
s-0.040
ashing-0.040
ashes-0.039
Token see
Token ID766
Feature activation+2.11e-02

Pos logit contributions
 matter+0.100
 remind+0.085
illa+0.080
ocket+0.080
 letting+0.079
Neg logit contributions
uddle-0.099
s-0.081
udd-0.079
-0.068
 gym-0.065
Token anyone
Token ID2687
Feature activation+4.75e-02

Pos logit contributions
uddle+0.027
s+0.027
+0.025
 big+0.025
 back+0.025
Neg logit contributions
 matter-0.042
 occasions-0.039
 generosity-0.039
 hospitality-0.039
bek-0.038
Token you
Token ID345
Feature activation-5.10e-02

Pos logit contributions
s+0.043
uddle+0.040
+0.039
 big+0.037
 She+0.036
Neg logit contributions
bet-0.066
keeping-0.065
igh-0.063
epad-0.062
bons-0.062
Token can
Token ID460
Feature activation-8.29e-02

Pos logit contributions
 matter+0.119
 occasions+0.108
keeping+0.107
 avail+0.107
icians+0.106
Neg logit contributions
uddle-0.067
s-0.055
udd-0.047
 big-0.042
-0.042
Token ride
Token ID6594
Feature activation-3.91e-02

Pos logit contributions
 matter+0.151
 remind+0.123
icians+0.122
 yours+0.121
ocket+0.120
Neg logit contributions
uddle-0.154
s-0.133
udd-0.122
-0.112
 She-0.110
Token Star
Token ID2907
Feature activation-2.67e-02

Pos logit contributions
 matter+0.084
epad+0.071
bet+0.070
 remind+0.070
ocket+0.069
Neg logit contributions
uddle-0.057
s-0.056
-0.045
-0.044
 She-0.043
Token,
Token ID11
Feature activation+4.79e-02

Pos logit contributions
 matter+0.064
 remind+0.055
icians+0.054
bet+0.054
 avail+0.054
Neg logit contributions
uddle-0.034
s-0.028
udd-0.024
-0.021
 She-0.020
Token but
Token ID475
Feature activation-1.75e-01

Pos logit contributions
s+0.055
uddle+0.053
+0.050
 She+0.048
 big+0.046
Neg logit contributions
bet-0.109
keeping-0.106
 matter-0.106
bons-0.103
 avail-0.102
Token be
Token ID307
Feature activation-3.83e-02

Pos logit contributions
icians+0.368
bet+0.367
keeping+0.366
igh+0.362
ples+0.362
Neg logit contributions
s-0.215
uddle-0.205
-0.205
 She-0.187
 sad-0.182
Token gentle
Token ID10296
Feature activation-3.98e-02

Pos logit contributions
 matter+0.089
 occasions+0.082
fulness+0.079
icated+0.079
igh+0.079
Neg logit contributions
uddle-0.046
 sad-0.038
-0.038
s-0.037
 big-0.037
Token."
Token ID526
Feature activation+4.58e-02

Pos logit contributions
 matter+0.109
 avail+0.094
 remind+0.092
bet+0.092
icians+0.092
Neg logit contributions
uddle-0.039
s-0.033
udd-0.027
 She-0.026
-0.026
Token
Token ID198
Feature activation+1.08e-02

Pos logit contributions
uddle+0.058
udd+0.041
obb+0.036
eeper+0.036
icker+0.035
Neg logit contributions
 matter-0.099
 occasions-0.092
bons-0.092
 letting-0.091
 bid-0.090
Token
Token ID198
Feature activation-1.64e-03

Pos logit contributions
uddle+0.015
udd+0.010
eeper+0.009
obb+0.009
 gym+0.008
Neg logit contributions
 matter-0.023
 remind-0.020
 occasions-0.020
 letting-0.020
 granted-0.020
Token kind
Token ID1611
Feature activation-1.23e-02

Pos logit contributions
 matter+0.042
 letting+0.036
 remind+0.036
 yours+0.035
 lending+0.034
Neg logit contributions
uddle-0.043
s-0.035
udd-0.032
 gym-0.030
-0.029
Token and
Token ID290
Feature activation-4.93e-02

Pos logit contributions
 matter+0.034
 avail+0.031
icians+0.030
 remind+0.030
bet+0.029
Neg logit contributions
uddle-0.011
s-0.010
udd-0.007
-0.006
 She-0.006
Token generous
Token ID14431
Feature activation-2.66e-02

Pos logit contributions
 matter+0.096
igh+0.091
illa+0.089
icians+0.089
 avail+0.086
Neg logit contributions
uddle-0.073
s-0.065
 nicely-0.063
 sad-0.059
 big-0.058
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
 matter+0.073
 avail+0.066
icians+0.065
bet+0.064
 remind+0.064
Neg logit contributions
uddle-0.024
s-0.020
udd-0.014
-0.012
 nicely-0.012
Token You
Token ID921
Feature activation-1.17e-01

Pos logit contributions
uddle+0.024
s+0.021
 She+0.020
+0.019
 He+0.019
Neg logit contributions
 matter-0.033
 remind-0.028
icians-0.028
 avail-0.027
epad-0.027
Token should
Token ID815
Feature activation-1.63e-01

Pos logit contributions
 matter+0.199
 avail+0.169
bet+0.164
 occasions+0.161
 yours+0.160
Neg logit contributions
uddle-0.224
s-0.204
udd-0.182
-0.173
-0.172
Token apologize
Token ID16521
Feature activation+1.51e-02

Pos logit contributions
 matter+0.296
icians+0.267
ocket+0.263
igh+0.256
 avail+0.250
Neg logit contributions
uddle-0.276
s-0.237
udd-0.223
 She-0.220
-0.219
Token to
Token ID284
Feature activation-6.38e-02

Pos logit contributions
uddle+0.017
s+0.014
udd+0.012
+0.009
 She+0.009
Neg logit contributions
 matter-0.038
icians-0.033
 remind-0.033
bet-0.033
 avail-0.033
Token each
Token ID1123
Feature activation+9.26e-03

Pos logit contributions
 matter+0.141
ocket+0.129
epad+0.128
igh+0.126
bet+0.126
Neg logit contributions
uddle-0.075
s-0.065
 nicely-0.064
-0.063
udd-0.058
Token other
Token ID584
Feature activation+3.25e-02

Pos logit contributions
uddle+0.010
s+0.008
udd+0.006
+0.006
+0.006
Neg logit contributions
 matter-0.023
icians-0.021
 avail-0.021
ocket-0.021
bet-0.021
Token and
Token ID290
Feature activation-5.94e-02

Pos logit contributions
uddle+0.038
s+0.030
udd+0.026
+0.021
 She+0.021
Neg logit contributions
 matter-0.081
 remind-0.070
icians-0.070
 yours-0.069
bet-0.069
Token I
Token ID314
Feature activation-6.39e-02

Pos logit contributions
uddle+0.021
s+0.017
 She+0.016
+0.016
+0.015
Neg logit contributions
 matter-0.038
icians-0.031
 avail-0.031
 remind-0.031
pa-0.031
Token was
Token ID373
Feature activation-1.58e-03

Pos logit contributions
 matter+0.129
 avail+0.113
icians+0.112
 yours+0.111
bet+0.109
Neg logit contributions
uddle-0.104
s-0.091
udd-0.082
-0.072
-0.072
Token wrong
Token ID2642
Feature activation+1.30e-01

Pos logit contributions
 matter+0.004
 occasions+0.003
 avail+0.003
bet+0.003
bons+0.003
Neg logit contributions
uddle-0.002
 sad-0.002
s-0.002
udd-0.001
 big-0.001
Token.
Token ID13
Feature activation+1.90e-02

Pos logit contributions
uddle+0.123
s+0.101
udd+0.076
+0.068
+0.066
Neg logit contributions
 matter-0.355
 avail-0.313
bet-0.310
 remind-0.309
icians-0.308
Token I
Token ID314
Feature activation-6.02e-02

Pos logit contributions
uddle+0.026
 She+0.023
 The+0.023
+0.022
s+0.022
Neg logit contributions
 matter-0.042
 avail-0.035
icians-0.035
 remind-0.034
pa-0.034
Token should
Token ID815
Feature activation-1.60e-01

Pos logit contributions
 matter+0.120
icians+0.103
 avail+0.103
 yours+0.103
bet+0.101
Neg logit contributions
uddle-0.100
s-0.086
udd-0.078
-0.069
-0.068
Token have
Token ID423
Feature activation-5.76e-02

Pos logit contributions
 matter+0.330
icians+0.301
ocket+0.295
bet+0.294
 sizes+0.292
Neg logit contributions
uddle-0.220
s-0.182
 She-0.180
 sad-0.177
-0.174
Token listened
Token ID16399
Feature activation+1.02e-01

Pos logit contributions
 matter+0.103
ocket+0.099
igh+0.099
 avail+0.097
fulness+0.095
Neg logit contributions
uddle-0.086
s-0.078
-0.074
 nicely-0.072
udd-0.071
Token.
Token ID13
Feature activation+2.30e-02

Pos logit contributions
uddle+0.106
s+0.086
udd+0.071
+0.057
 She+0.057
Neg logit contributions
 matter-0.265
icians-0.232
 remind-0.231
 avail-0.231
bet-0.229
Token The
Token ID383
Feature activation-5.53e-02

Pos logit contributions
uddle+0.034
 She+0.029
s+0.029
+0.028
 He+0.028
Neg logit contributions
 matter-0.049
icians-0.041
 avail-0.040
 remind-0.040
bet-0.040
Token grass
Token ID8701
Feature activation-2.46e-02

Pos logit contributions
 matter+0.116
 avail+0.102
icians+0.102
pper+0.100
illa+0.100
Neg logit contributions
uddle-0.089
s-0.068
udd-0.063
 gym-0.062
 big-0.062
Token put
Token ID1234
Feature activation-2.26e-02

Pos logit contributions
 matter+0.035
 yours+0.032
 avail+0.031
keeping+0.031
icated+0.030
Neg logit contributions
uddle-0.024
s-0.022
udd-0.018
-0.017
 big-0.016
Token them
Token ID606
Feature activation+8.02e-02

Pos logit contributions
 matter+0.058
epad+0.052
isdom+0.052
icated+0.051
 occasions+0.051
Neg logit contributions
s-0.023
uddle-0.023
-0.019
 back-0.019
 big-0.019
Token on
Token ID319
Feature activation-8.75e-02

Pos logit contributions
uddle+0.097
s+0.082
udd+0.064
 back+0.060
 sad+0.056
Neg logit contributions
 matter-0.202
 occasions-0.173
 avail-0.173
icians-0.173
icated-0.171
Token and
Token ID290
Feature activation-6.65e-02

Pos logit contributions
 matter+0.218
icians+0.198
ocket+0.195
pper+0.193
igh+0.192
Neg logit contributions
uddle-0.092
s-0.079
 She-0.064
-0.064
udd-0.061
Token gave
Token ID2921
Feature activation-7.79e-02

Pos logit contributions
 matter+0.143
 yours+0.127
icians+0.124
 occasions+0.124
icated+0.124
Neg logit contributions
uddle-0.094
s-0.091
udd-0.072
 She-0.072
 big-0.070
Token Grand
Token ID5675
Feature activation-1.44e-01

Pos logit contributions
 matter+0.175
icians+0.157
 cherish+0.156
ocket+0.155
epad+0.154
Neg logit contributions
uddle-0.095
s-0.093
-0.075
udd-0.073
 She-0.072
Tokenad
Token ID324
Feature activation-2.48e-02

Pos logit contributions
 matter+0.207
epad+0.195
ocket+0.178
 yours+0.174
 letting+0.172
Neg logit contributions
uddle-0.317
s-0.286
udd-0.284
-0.235
-0.224
Token a
Token ID257
Feature activation-3.28e-02

Pos logit contributions
 matter+0.055
 avail+0.047
 remind+0.046
icians+0.046
 lending+0.045
Neg logit contributions
uddle-0.036
s-0.031
udd-0.027
-0.024
 She-0.024
Token playful
Token ID34264
Feature activation-5.62e-02

Pos logit contributions
 matter+0.078
 remind+0.068
icians+0.067
 yours+0.067
 letting+0.066
Neg logit contributions
uddle-0.043
s-0.033
udd-0.029
-0.025
 She-0.024
Token punch
Token ID10862
Feature activation-6.17e-02

Pos logit contributions
 matter+0.109
icians+0.106
illa+0.097
pper+0.096
pa+0.095
Neg logit contributions
uddle-0.095
s-0.074
 big-0.069
 sad-0.067
udd-0.065
Token.
Token ID13
Feature activation+3.22e-02

Pos logit contributions
 matter+0.179
icians+0.162
 avail+0.161
 remind+0.155
illa+0.155
Neg logit contributions
uddle-0.051
s-0.043
udd-0.025
 She-0.024
 He-0.021