TOP ACTIVATIONS
MAX = 21.931

smiled . " Looks like the frog likes you
table . " Looks like mom has some mail
laughed and said , " Looks like that apple made you
's mom my . " Looks like the wave destroyed all
smiled and said , " Looks like you like the wardrobe
the owner smiled . " Looks like you two are friends
should be compassionate . Comp assion ate means you care about
- three , forty - four , forty - five ,
garden . He was immediately aw est ruck .
the sky . Lily felt aw ed , with her eyes
truck . He was so eng ross ed that he didn
up at the stars , aw est ruck that the night
- three , twenty - four , twenty - five ,
- three , thirty - four , thirty - five ,
mom saw Jane looking en viously at the big kids and
- three , thirty - four , thirty - five ,
made her spark le and gle am . She felt so
gross that it made Jack grim ace . He wanted to
Go on , John . Beh ave like a brave boy
for the toast and smiled gle efully . His siblings were

MIN ACTIVATIONS
MIN = -19.171

You have to keep trying and learning and having fun .
to build towers and houses and cars with the blocks .
, she searched the maze and finally found the way out
't wait to tell mummy and daddy what aunt had whispered
liked to play with cars and robots . He saw Angel
walk home , holding hands and signing . Lily is happy
They pretend to be astronauts and explore the universe . They
I want to just sit and rest ." Charlie
They pretended to be knights and shot the arrow at imaginary
jet tried to turn around and go back home , but
to keep trying and learning and having fun . That 's
have to choose to respect and learn . You have to
my saw what had happened and told him that he should
just need to be careful and not scratch it . It
important to ask for help and that peaceful acts of kindness
mum saw what had happened and she rushed to Jem ma
matter . You should share and be kind to each other
You have to be careful and ask me first . Do
wanted to keep it clean and warm always . The end
She lly thought and thought and then she had an idea

INTERVAL 11.655 - 21.931
CONTAINS 0.312%

rocked him , Tim my felt better and fell asleep holding
first , but then he replied " I was just looking
o . L ily felt better and they
than yours !" L ily felt hurt by Ben
washing it ." L ily smiled and put the

INTERVAL 1.380 - 11.655
CONTAINS 48.889%

tri pped over her mom 's foot .
return . They are a happy family . <|endoftext|> It was
chew on the fence to be strong , Max . Just
the different things he could see . John
. He was going to take a trip with his mom

INTERVAL -8.895 - 1.380
CONTAINS 38.982%

up and down hills , over bumps and around corners .
But I want to know what you write ," Lily said
to start to eat the snow and begin licking the lake
Suddenly , the man with the dog walked over
give him a treat and he would be her friend .

INTERVAL -19.171 - -8.895
CONTAINS 11.817%

checked all the usual places , but she could not find
she arrived at a clearing , and there she saw a
. They stick the stamps and stickers on the envelope carefully
Lisa continued on , determined to find the source of the
, they cut it open and enjoyed a delicious snack together
Token smiled
Token ID13541
Feature activation+1.07e+01

Pos logit contributions
 spine+0.076
 to+0.070
cone+0.070
round+0.070
driver+0.068
Neg logit contributions
Looks-0.066
have-0.057
V-0.056
 shin-0.056
 funn-0.055
Token.
Token ID13
Feature activation+2.61e+00

Pos logit contributions
Looks+0.486
 shin+0.452
have+0.450
 funn+0.445
 prett+0.428
Neg logit contributions
round-0.061
cone-0.055
ipop-0.042
driver-0.042
 to-0.042
Token
Token ID198
Feature activation+5.72e+00

Pos logit contributions
Looks+0.076
have+0.066
 shin+0.063
V+0.062
 funn+0.062
Neg logit contributions
 and-0.062
 to-0.061
round-0.057
cone-0.056
,-0.055
Token
Token ID198
Feature activation+5.74e+00

Pos logit contributions
Looks+0.221
 funn+0.204
have+0.199
 shin+0.196
icious+0.195
Neg logit contributions
 and-0.117
 to-0.109
,-0.099
 Bob-0.086
 but-0.085
Token"
Token ID1
Feature activation+1.74e+00

Pos logit contributions
 funn+0.208
Looks+0.205
 fullest+0.192
 shin+0.191
 prett+0.191
Neg logit contributions
 and-0.113
 to-0.094
,-0.094
 but-0.080
 after-0.075
TokenLooks
Token ID41102
Feature activation+2.19e+01

Pos logit contributions
have+0.042
its+0.042
 rightful+0.041
 mistakes+0.041
Looks+0.041
Neg logit contributions
 and-0.045
 to-0.044
round-0.040
 but-0.040
cone-0.038
Token like
Token ID588
Feature activation+1.16e+00

Pos logit contributions
Looks+0.894
icious+0.790
have+0.783
V+0.773
Gr+0.753
Neg logit contributions
 and-0.376
 to-0.337
,-0.308
 but-0.294
 because-0.261
Token the
Token ID262
Feature activation+6.48e+00

Pos logit contributions
Looks+0.030
have+0.027
 shin+0.025
 funn+0.024
icious+0.024
Neg logit contributions
 and-0.032
 to-0.032
,-0.029
round-0.029
cone-0.028
Token frog
Token ID21264
Feature activation-2.36e+00

Pos logit contributions
Looks+0.265
have+0.242
icious+0.235
V+0.226
gr+0.225
Neg logit contributions
 and-0.101
 to-0.094
,-0.086
 but-0.082
 Bob-0.079
Token likes
Token ID7832
Feature activation+8.12e+00

Pos logit contributions
 and+0.038
 to+0.037
round+0.033
cone+0.031
 Bob+0.031
Neg logit contributions
Looks-0.086
have-0.079
 shin-0.075
 funn-0.075
icious-0.074
Token you
Token ID345
Feature activation+1.55e+00

Pos logit contributions
Looks+0.271
have+0.247
 funn+0.235
 shin+0.233
icious+0.227
Neg logit contributions
 and-0.150
 to-0.145
round-0.139
cone-0.134
driver-0.131
Token table
Token ID3084
Feature activation-2.73e+00

Pos logit contributions
Looks+0.187
have+0.166
 funn+0.163
 shin+0.161
icious+0.156
Neg logit contributions
 to-0.125
 and-0.123
round-0.115
 Bob-0.115
ipop-0.111
Token.
Token ID13
Feature activation+2.16e+00

Pos logit contributions
cone+0.066
round+0.065
 to+0.062
 Bob+0.061
ipop+0.061
Neg logit contributions
Looks-0.071
have-0.065
 shin-0.062
 funn-0.059
its-0.057
Token
Token ID198
Feature activation+5.34e+00

Pos logit contributions
Looks+0.069
 shin+0.060
have+0.060
V+0.060
 funn+0.057
Neg logit contributions
round-0.042
 to-0.040
cone-0.040
 and-0.039
ipop-0.039
Token
Token ID198
Feature activation+5.37e+00

Pos logit contributions
Looks+0.194
 funn+0.176
have+0.176
icious+0.172
 shin+0.168
Neg logit contributions
 and-0.113
 to-0.109
,-0.096
 Bob-0.089
 but-0.087
Token"
Token ID1
Feature activation+1.56e+00

Pos logit contributions
Looks+0.166
 funn+0.156
have+0.150
 fullest+0.148
icious+0.148
Neg logit contributions
 and-0.129
 to-0.122
,-0.114
 but-0.104
driver-0.100
TokenLooks
Token ID41102
Feature activation+2.18e+01

Pos logit contributions
Looks+0.031
have+0.028
icious+0.027
 fullest+0.026
 shin+0.025
Neg logit contributions
 and-0.050
 to-0.048
 but-0.045
round-0.044
 Bob-0.044
Token like
Token ID588
Feature activation+8.08e-01

Pos logit contributions
Looks+0.832
have+0.731
icious+0.721
V+0.714
 shin+0.699
Neg logit contributions
 and-0.397
 to-0.373
,-0.330
 but-0.319
 because-0.292
Token mom
Token ID1995
Feature activation-8.04e+00

Pos logit contributions
Looks+0.020
have+0.019
 shin+0.017
 funn+0.017
icious+0.017
Neg logit contributions
round-0.021
cone-0.020
 and-0.020
 spine-0.020
ipop-0.019
Token has
Token ID468
Feature activation+6.16e+00

Pos logit contributions
cone+0.120
 spine+0.115
 to+0.114
round+0.112
ipop+0.108
Neg logit contributions
Looks-0.282
have-0.278
 funn-0.252
 shin-0.251
icious-0.248
Token some
Token ID617
Feature activation+6.13e+00

Pos logit contributions
Looks+0.182
have+0.158
icious+0.150
 shin+0.149
 funn+0.148
Neg logit contributions
 and-0.156
 to-0.156
,-0.139
 but-0.133
 Bob-0.132
Token mail
Token ID6920
Feature activation-1.29e+00

Pos logit contributions
Looks+0.181
have+0.163
 shin+0.152
 funn+0.152
icious+0.150
Neg logit contributions
 and-0.143
 to-0.142
round-0.130
 Bob-0.128
cone-0.127
Token laughed
Token ID13818
Feature activation+9.63e+00

Pos logit contributions
Looks+0.012
have+0.011
 funn+0.010
 shin+0.010
V+0.010
Neg logit contributions
 to-0.010
round-0.010
 spine-0.010
cone-0.010
 and-0.010
Token and
Token ID290
Feature activation-1.44e+01

Pos logit contributions
Looks+0.437
have+0.409
 funn+0.405
 shin+0.403
icious+0.390
Neg logit contributions
round-0.055
cone-0.053
ipop-0.049
 spine-0.049
driver-0.040
Token said
Token ID531
Feature activation+9.76e+00

Pos logit contributions
 and+0.342
 to+0.331
,+0.304
 Bob+0.294
 but+0.291
Neg logit contributions
Looks-0.450
have-0.414
icious-0.395
 shin-0.374
 fullest-0.374
Token,
Token ID11
Feature activation-1.04e+01

Pos logit contributions
Looks+0.407
have+0.384
 shin+0.369
 funn+0.365
V+0.362
Neg logit contributions
cone-0.084
round-0.082
 spine-0.082
 and-0.070
stack-0.065
Token "
Token ID366
Feature activation+3.62e+00

Pos logit contributions
round+0.214
 to+0.209
cone+0.206
 and+0.205
ipop+0.203
Neg logit contributions
Looks-0.315
have-0.278
 shin-0.269
V-0.264
icious-0.262
TokenLooks
Token ID41102
Feature activation+2.16e+01

Pos logit contributions
 rightful+0.078
its+0.072
 expiration+0.072
inkle+0.071
 assumptions+0.071
Neg logit contributions
 and-0.101
round-0.095
 but-0.094
 to-0.093
 while-0.087
Token like
Token ID588
Feature activation+6.78e-01

Pos logit contributions
Looks+0.868
have+0.774
icious+0.752
V+0.748
 shin+0.729
Neg logit contributions
 and-0.357
 to-0.327
,-0.299
 but-0.288
 because-0.263
Token that
Token ID326
Feature activation-3.60e+00

Pos logit contributions
Looks+0.015
have+0.014
 funn+0.013
 shin+0.013
icious+0.012
Neg logit contributions
 and-0.019
round-0.019
cone-0.019
 to-0.019
ipop-0.018
Token apple
Token ID17180
Feature activation-7.46e+00

Pos logit contributions
 and+0.063
 to+0.063
round+0.057
cone+0.055
 Bob+0.054
Neg logit contributions
Looks-0.125
have-0.114
 funn-0.109
 shin-0.108
icious-0.108
Token made
Token ID925
Feature activation+7.69e+00

Pos logit contributions
 to+0.150
round+0.150
 spine+0.149
 and+0.148
cone+0.142
Neg logit contributions
Looks-0.216
have-0.201
 funn-0.195
 shin-0.190
icious-0.190
Token you
Token ID345
Feature activation+1.31e+00

Pos logit contributions
Looks+0.242
 shin+0.199
have+0.198
V+0.188
 rightful+0.188
Neg logit contributions
 and-0.206
 to-0.201
,-0.191
 Bob-0.173
 because-0.173
Token's
Token ID338
Feature activation+5.42e+00

Pos logit contributions
Looks+0.085
 shin+0.077
have+0.077
 funn+0.075
gr+0.071
Neg logit contributions
round-0.052
cone-0.050
 to-0.048
 and-0.048
 barrel-0.047
Token mom
Token ID1995
Feature activation-8.07e+00

Pos logit contributions
Looks+0.194
have+0.175
 shin+0.166
 funn+0.165
icious+0.165
Neg logit contributions
 and-0.100
 to-0.099
,-0.086
round-0.083
 Bob-0.082
Tokenmy
Token ID1820
Feature activation+6.10e-02

Pos logit contributions
 and+0.214
 to+0.213
round+0.203
cone+0.199
 Bob+0.196
Neg logit contributions
Looks-0.205
have-0.180
 shin-0.170
 funn-0.167
icious-0.164
Token.
Token ID13
Feature activation+2.06e+00

Pos logit contributions
Looks+0.002
have+0.002
 shin+0.002
 funn+0.002
icious+0.002
Neg logit contributions
round-0.001
 to-0.001
 and-0.001
cone-0.001
ipop-0.001
Token "
Token ID366
Feature activation+3.37e+00

Pos logit contributions
Looks+0.063
have+0.053
V+0.052
 shin+0.052
icious+0.050
Neg logit contributions
 and-0.043
round-0.042
 to-0.042
cone-0.042
ipop-0.040
TokenLooks
Token ID41102
Feature activation+2.13e+01

Pos logit contributions
Looks+0.065
have+0.056
 shin+0.055
 funn+0.054
icious+0.054
Neg logit contributions
 and-0.112
 to-0.107
 but-0.101
round-0.099
 because-0.096
Token like
Token ID588
Feature activation+4.68e-01

Pos logit contributions
Looks+0.774
have+0.679
icious+0.672
V+0.667
 shin+0.643
Neg logit contributions
 and-0.425
 to-0.402
,-0.359
 but-0.357
 because-0.332
Token the
Token ID262
Feature activation+6.13e+00

Pos logit contributions
Looks+0.012
have+0.011
 funn+0.011
 shin+0.010
 prett+0.010
Neg logit contributions
round-0.012
cone-0.012
ipop-0.011
 and-0.011
 spine-0.011
Token wave
Token ID6769
Feature activation+4.25e+00

Pos logit contributions
Looks+0.215
have+0.192
icious+0.186
V+0.186
gr+0.179
Neg logit contributions
 and-0.126
 to-0.122
,-0.108
 but-0.105
 Bob-0.101
Token destroyed
Token ID6572
Feature activation+5.62e+00

Pos logit contributions
Looks+0.136
have+0.127
 funn+0.120
 shin+0.118
icious+0.116
Neg logit contributions
 to-0.078
 and-0.077
cone-0.073
round-0.071
 Bob-0.071
Token all
Token ID477
Feature activation-1.62e+00

Pos logit contributions
Looks+0.156
have+0.141
 shin+0.133
 funn+0.133
icious+0.129
Neg logit contributions
 to-0.131
 and-0.130
cone-0.125
round-0.124
ipop-0.122
Token smiled
Token ID13541
Feature activation+9.51e+00

Pos logit contributions
 to+0.242
 spine+0.240
round+0.236
cone+0.233
 and+0.228
Neg logit contributions
Looks-0.224
have-0.188
 shin-0.185
 funn-0.183
icious-0.176
Token and
Token ID290
Feature activation-1.50e+01

Pos logit contributions
Looks+0.429
 shin+0.396
have+0.396
 funn+0.390
icious+0.377
Neg logit contributions
round-0.058
cone-0.055
 to-0.050
driver-0.043
 and-0.042
Token said
Token ID531
Feature activation+8.92e+00

Pos logit contributions
 and+0.383
 to+0.382
round+0.353
cone+0.345
,+0.344
Neg logit contributions
Looks-0.414
have-0.355
 shin-0.342
 funn-0.339
icious-0.327
Token,
Token ID11
Feature activation-1.08e+01

Pos logit contributions
Looks+0.406
have+0.380
 shin+0.368
 funn+0.363
icious+0.358
Neg logit contributions
cone-0.044
round-0.043
 and-0.041
 spine-0.036
 to-0.033
Token "
Token ID366
Feature activation+2.92e+00

Pos logit contributions
 and+0.211
 to+0.211
round+0.205
cone+0.203
ipop+0.191
Neg logit contributions
Looks-0.341
have-0.302
 shin-0.291
V-0.285
icious-0.284
TokenLooks
Token ID41102
Feature activation+2.09e+01

Pos logit contributions
have+0.059
Looks+0.058
its+0.052
 assumptions+0.051
 shin+0.051
Neg logit contributions
 and-0.088
 to-0.086
round-0.081
 but-0.080
 Bob-0.078
Token like
Token ID588
Feature activation-6.56e-02

Pos logit contributions
Looks+0.819
have+0.732
icious+0.708
V+0.707
 shin+0.690
Neg logit contributions
 and-0.352
 to-0.330
,-0.294
 but-0.281
 because-0.258
Token you
Token ID345
Feature activation+6.71e-01

Pos logit contributions
 and+0.002
round+0.002
 to+0.002
cone+0.002
 spine+0.002
Neg logit contributions
Looks-0.002
have-0.001
 funn-0.001
 shin-0.001
icious-0.001
Token like
Token ID588
Feature activation-1.15e-01

Pos logit contributions
Looks+0.024
have+0.021
 shin+0.020
V+0.020
icious+0.020
Neg logit contributions
 and-0.013
 to-0.013
,-0.012
 but-0.011
 Bob-0.011
Token the
Token ID262
Feature activation+5.51e+00

Pos logit contributions
round+0.003
 spine+0.003
cone+0.003
driver+0.003
ipop+0.003
Neg logit contributions
Looks-0.003
have-0.003
 funn-0.003
 shin-0.002
 fullest-0.002
Token wardrobe
Token ID35593
Feature activation-7.64e+00

Pos logit contributions
Looks+0.142
have+0.122
icious+0.113
 shin+0.112
V+0.111
Neg logit contributions
 and-0.158
 to-0.156
,-0.143
 but-0.141
 Bob-0.138
Token the
Token ID262
Feature activation+5.54e+00

Pos logit contributions
 and+0.340
 to+0.334
,+0.299
 Bob+0.292
 but+0.289
Neg logit contributions
Looks-0.482
have-0.433
icious-0.407
 shin-0.404
 funn-0.402
Token owner
Token ID4870
Feature activation-5.67e+00

Pos logit contributions
Looks+0.271
have+0.250
icious+0.249
gr+0.244
 shin+0.242
Neg logit contributions
 and-0.046
 to-0.036
,-0.021
 but-0.019
 Bob-0.018
Token smiled
Token ID13541
Feature activation+9.37e+00

Pos logit contributions
 to+0.136
 and+0.131
round+0.125
cone+0.125
 spine+0.125
Neg logit contributions
Looks-0.161
have-0.141
 shin-0.136
 funn-0.135
V-0.130
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
Looks+0.340
have+0.308
icious+0.291
 funn+0.287
 shin+0.287
Neg logit contributions
 and-0.178
 to-0.173
,-0.152
 but-0.144
 Bob-0.140
Token "
Token ID366
Feature activation+2.73e+00

Pos logit contributions
Looks+0.045
have+0.040
 shin+0.038
 funn+0.038
V+0.037
Neg logit contributions
 and-0.029
 to-0.029
round-0.026
cone-0.025
,-0.025
TokenLooks
Token ID41102
Feature activation+2.06e+01

Pos logit contributions
its+0.053
have+0.052
 mistakes+0.052
 assumptions+0.052
Looks+0.051
Neg logit contributions
 and-0.080
round-0.074
 to-0.074
 but-0.073
cone-0.070
Token like
Token ID588
Feature activation-4.09e-01

Pos logit contributions
Looks+0.843
have+0.748
icious+0.742
V+0.731
Gr+0.712
Neg logit contributions
 and-0.335
 to-0.293
,-0.272
 but-0.265
 because-0.238
Token you
Token ID345
Feature activation+3.09e-01

Pos logit contributions
 and+0.011
round+0.011
 to+0.010
ipop+0.010
cone+0.010
Neg logit contributions
Looks-0.010
have-0.009
 funn-0.009
 shin-0.008
icious-0.008
Token two
Token ID734
Feature activation+5.01e+00

Pos logit contributions
Looks+0.011
have+0.009
 shin+0.009
Und+0.009
icious+0.009
Neg logit contributions
 and-0.007
 to-0.007
,-0.006
 but-0.006
 because-0.006
Token are
Token ID389
Feature activation+6.98e+00

Pos logit contributions
Looks+0.159
have+0.138
 shin+0.133
icious+0.132
V+0.131
Neg logit contributions
 and-0.117
 to-0.112
,-0.104
 but-0.100
 Bob-0.098
Token friends
Token ID2460
Feature activation-5.07e+00

Pos logit contributions
Looks+0.274
have+0.246
 shin+0.237
icious+0.235
V+0.233
Neg logit contributions
 and-0.114
 to-0.110
,-0.099
 Bob-0.091
 but-0.089
Token should
Token ID815
Feature activation-3.34e+00

Pos logit contributions
Looks+0.169
have+0.147
 funn+0.133
 shin+0.131
icious+0.127
Neg logit contributions
 and-0.234
 to-0.233
round-0.221
cone-0.217
,-0.216
Token be
Token ID307
Feature activation+6.53e+00

Pos logit contributions
 and+0.083
 to+0.082
,+0.075
 Bob+0.072
 but+0.071
Neg logit contributions
Looks-0.099
have-0.086
 shin-0.082
 funn-0.081
icious-0.081
Token compassionate
Token ID32533
Feature activation+4.29e+00

Pos logit contributions
Looks+0.188
icious+0.173
 fullest+0.167
V+0.148
 shin+0.147
Neg logit contributions
 to-0.206
 and-0.197
,-0.184
 because-0.163
 but-0.158
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
Looks+0.107
have+0.105
gr+0.097
 funn+0.096
V+0.091
Neg logit contributions
cone-0.106
round-0.100
ipop-0.100
 spine-0.096
stack-0.096
Token Comp
Token ID3082
Feature activation+4.19e+00

Pos logit contributions
Looks+0.035
have+0.030
 shin+0.028
icious+0.028
 funn+0.028
Neg logit contributions
 and-0.047
 to-0.046
,-0.043
 Bob-0.042
round-0.042
Tokenassion
Token ID11857
Feature activation+1.91e+01

Pos logit contributions
Looks+0.108
 faire+0.086
 funn+0.085
have+0.084
icious+0.079
Neg logit contributions
 to-0.125
 and-0.125
round-0.118
driver-0.115
,-0.109
Tokenate
Token ID378
Feature activation+4.12e+00

Pos logit contributions
Looks+0.539
icious+0.420
have+0.413
 shin+0.398
 funn+0.395
Neg logit contributions
 and-0.652
,-0.565
 to-0.564
 or-0.485
 but-0.485
Token means
Token ID1724
Feature activation+7.96e+00

Pos logit contributions
Looks+0.093
have+0.079
 shin+0.072
 funn+0.072
icious+0.071
Neg logit contributions
 and-0.126
 to-0.125
round-0.116
,-0.115
 Bob-0.114
Token you
Token ID345
Feature activation+9.86e-01

Pos logit contributions
Looks+0.231
have+0.202
icious+0.192
 shin+0.191
V+0.191
Neg logit contributions
 to-0.209
 and-0.207
,-0.188
 but-0.180
 Bob-0.180
Token care
Token ID1337
Feature activation-2.57e+00

Pos logit contributions
Looks+0.029
have+0.024
 shin+0.024
icious+0.024
V+0.023
Neg logit contributions
 and-0.027
 to-0.027
,-0.025
 but-0.023
 because-0.023
Token about
Token ID546
Feature activation-3.37e+00

Pos logit contributions
 and+0.044
 to+0.042
,+0.037
 but+0.032
 because+0.032
Neg logit contributions
Looks-0.111
have-0.095
icious-0.094
 shin-0.093
 fullest-0.092
Token-
Token ID12
Feature activation-4.23e+00

Pos logit contributions
Looks+0.401
have+0.375
 shin+0.357
icious+0.353
 funn+0.351
Neg logit contributions
 to-0.181
 and-0.176
 Bob-0.157
round-0.156
cone-0.153
Tokenthree
Token ID15542
Feature activation+1.44e+01

Pos logit contributions
 to+0.122
 and+0.120
round+0.113
,+0.107
cone+0.107
Neg logit contributions
Looks-0.110
have-0.093
icious-0.088
 funn-0.087
 shin-0.085
Token,
Token ID11
Feature activation-9.55e+00

Pos logit contributions
Looks+0.664
have+0.619
 shin+0.596
icious+0.593
 funn+0.591
Neg logit contributions
 to-0.096
 and-0.095
round-0.062
 Bob-0.058
cone-0.056
Token forty
Token ID16571
Feature activation+1.14e+01

Pos logit contributions
 and+0.270
 to+0.270
round+0.249
 Bob+0.246
,+0.246
Neg logit contributions
Looks-0.235
have-0.203
 shin-0.192
 funn-0.188
icious-0.185
Token-
Token ID12
Feature activation-4.11e+00

Pos logit contributions
Looks+0.405
have+0.379
 shin+0.361
icious+0.357
 funn+0.355
Neg logit contributions
 to-0.183
 and-0.178
 Bob-0.159
round-0.158
cone-0.155
Tokenfour
Token ID14337
Feature activation+1.87e+01

Pos logit contributions
 to+0.122
 and+0.119
round+0.112
cone+0.107
,+0.107
Neg logit contributions
Looks-0.104
have-0.088
icious-0.082
 funn-0.081
 shin-0.080
Token,
Token ID11
Feature activation-9.41e+00

Pos logit contributions
Looks+0.982
have+0.895
 shin+0.883
icious+0.874
V+0.874
Neg logit contributions
 and-0.082
,-0.048
 to-0.046
 but+0.010
 because+0.023
Token forty
Token ID16571
Feature activation+1.14e+01

Pos logit contributions
 and+0.258
 to+0.252
,+0.232
 but+0.223
round+0.219
Neg logit contributions
Looks-0.257
have-0.227
icious-0.209
 funn-0.204
 shin-0.204
Token-
Token ID12
Feature activation-4.11e+00

Pos logit contributions
Looks+0.407
have+0.379
 shin+0.358
icious+0.356
 funn+0.354
Neg logit contributions
 to-0.183
 and-0.178
 Bob-0.158
round-0.155
cone-0.155
Tokenfive
Token ID13261
Feature activation+5.63e+00

Pos logit contributions
 to+0.097
 and+0.094
round+0.089
,+0.083
 Bob+0.081
Neg logit contributions
Looks-0.129
have-0.113
icious-0.109
 funn-0.107
 fullest-0.107
Token,
Token ID11
Feature activation-9.36e+00

Pos logit contributions
Looks+0.246
have+0.232
 funn+0.220
 shin+0.220
icious+0.217
Neg logit contributions
 to-0.039
cone-0.038
 and-0.036
round-0.035
 spine-0.031
Token garden
Token ID11376
Feature activation-1.01e+01

Pos logit contributions
 shin+0.232
Looks+0.229
 funn+0.222
 fullest+0.219
 prett+0.216
Neg logit contributions
round-0.172
driver-0.169
ipop-0.161
 to-0.157
 Bob-0.155
Token.
Token ID13
Feature activation+1.91e+00

Pos logit contributions
round+0.205
ipop+0.203
 inspection+0.192
 Bob+0.191
 Lee+0.187
Neg logit contributions
Looks-0.289
 shin-0.280
 funn-0.255
 prett-0.248
 exact-0.247
Token He
Token ID679
Feature activation+6.82e-01

Pos logit contributions
Looks+0.059
V+0.051
 shin+0.050
have+0.050
 funn+0.047
Neg logit contributions
 and-0.039
 to-0.037
round-0.037
ipop-0.035
cone-0.035
Token was
Token ID373
Feature activation+7.26e+00

Pos logit contributions
Looks+0.022
have+0.020
 shin+0.019
 funn+0.019
icious+0.019
Neg logit contributions
 and-0.015
 to-0.014
,-0.013
round-0.013
 Bob-0.012
Token immediately
Token ID3393
Feature activation+2.33e+00

Pos logit contributions
Looks+0.198
have+0.175
 funn+0.169
 shin+0.168
 prett+0.160
Neg logit contributions
round-0.180
 and-0.173
cone-0.171
 to-0.169
ipop-0.167
Token aw
Token ID3253
Feature activation+1.82e+01

Pos logit contributions
Looks+0.062
have+0.055
 shin+0.053
 funn+0.052
 fullest+0.050
Neg logit contributions
round-0.058
 and-0.056
 to-0.055
cone-0.055
driver-0.054
Tokenest
Token ID395
Feature activation+6.71e+00

Pos logit contributions
Looks+0.608
have+0.547
 shin+0.521
 funn+0.517
icious+0.513
Neg logit contributions
 and-0.359
 to-0.357
round-0.316
,-0.311
cone-0.310
Tokenruck
Token ID30915
Feature activation-4.25e+00

Pos logit contributions
Looks+0.156
have+0.139
 sounds+0.132
 funn+0.129
 shin+0.128
Neg logit contributions
 to-0.176
round-0.175
ipop-0.173
driver-0.172
 Bob-0.171
Token.
Token ID13
Feature activation+1.94e+00

Pos logit contributions
ipop+0.068
 Lee+0.066
round+0.065
driver+0.063
uddy+0.059
Neg logit contributions
Looks-0.133
 shin-0.132
 sounds-0.130
 funn-0.128
 prett-0.126
Token
Token ID198
Feature activation+4.67e+00

Pos logit contributions
Looks+0.058
have+0.050
V+0.049
 shin+0.048
 funn+0.047
Neg logit contributions
 and-0.044
 to-0.043
round-0.040
cone-0.039
ipop-0.039
Token
Token ID198
Feature activation+4.70e+00

Pos logit contributions
Looks+0.155
have+0.137
 funn+0.135
 shin+0.134
icious+0.134
Neg logit contributions
 and-0.107
 to-0.104
,-0.091
 Bob-0.089
 but-0.088
Token the
Token ID262
Feature activation+5.78e+00

Pos logit contributions
 and+0.115
 to+0.111
,+0.099
 Bob+0.094
 but+0.094
Neg logit contributions
Looks-0.190
have-0.173
icious-0.164
 funn-0.160
 shin-0.159
Token sky
Token ID6766
Feature activation-6.14e+00

Pos logit contributions
Looks+0.217
have+0.193
icious+0.189
its+0.177
 shin+0.175
Neg logit contributions
 and-0.116
 to-0.109
,-0.095
round-0.089
 but-0.085
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
 and+0.127
 to+0.123
,+0.110
 but+0.104
 Bob+0.103
Neg logit contributions
Looks-0.213
have-0.191
icious-0.179
 shin-0.179
 funn-0.177
Token Lily
Token ID20037
Feature activation+5.95e+00

Pos logit contributions
Looks+0.058
have+0.052
 shin+0.049
 funn+0.049
icious+0.048
Neg logit contributions
 and-0.038
 to-0.037
,-0.033
round-0.033
 Bob-0.032
Token felt
Token ID2936
Feature activation+1.14e+01

Pos logit contributions
Looks+0.171
have+0.154
 shin+0.144
 funn+0.142
V+0.140
Neg logit contributions
 to-0.139
 and-0.136
round-0.128
cone-0.127
 Bob-0.125
Token aw
Token ID3253
Feature activation+1.81e+01

Pos logit contributions
Looks+0.428
icious+0.391
have+0.367
V+0.357
its+0.348
Neg logit contributions
 and-0.250
 to-0.233
,-0.225
 but-0.204
 because-0.200
Tokened
Token ID276
Feature activation+3.43e+00

Pos logit contributions
Looks+0.624
have+0.556
icious+0.521
 shin+0.521
V+0.519
Neg logit contributions
 and-0.388
 to-0.372
,-0.336
 but-0.308
 Bob-0.308
Token,
Token ID11
Feature activation-1.07e+01

Pos logit contributions
Looks+0.117
have+0.109
 shin+0.108
 funn+0.105
 prett+0.101
Neg logit contributions
cone-0.056
ipop-0.056
round-0.053
driver-0.050
 to-0.047
Token with
Token ID351
Feature activation+1.57e+00

Pos logit contributions
 to+0.211
 and+0.207
round+0.200
ipop+0.198
cone+0.195
Neg logit contributions
Looks-0.348
have-0.306
 shin-0.300
 funn-0.298
 prett-0.281
Token her
Token ID607
Feature activation+4.49e+00

Pos logit contributions
Looks+0.045
have+0.039
 funn+0.038
 shin+0.038
V+0.036
Neg logit contributions
 and-0.038
 to-0.038
round-0.036
cone-0.035
,-0.034
Token eyes
Token ID2951
Feature activation-1.53e+00

Pos logit contributions
Looks+0.136
have+0.122
 shin+0.116
 funn+0.115
icious+0.112
Neg logit contributions
 and-0.101
 to-0.100
round-0.091
,-0.089
 Bob-0.089
Token truck
Token ID7779
Feature activation-7.83e+00

Pos logit contributions
round+0.069
 inspection+0.068
 Jim+0.061
 Bob+0.061
ipop+0.061
Neg logit contributions
Looks-0.074
 shin-0.073
have-0.072
 funn-0.071
 prett-0.070
Token.
Token ID13
Feature activation+1.79e+00

Pos logit contributions
 inspection+0.207
round+0.191
ipop+0.188
 spine+0.185
cone+0.181
Neg logit contributions
Looks-0.164
 shin-0.163
have-0.162
 funn-0.157
 prett-0.147
Token He
Token ID679
Feature activation+9.22e-01

Pos logit contributions
Looks+0.053
have+0.046
V+0.044
 shin+0.043
 fullest+0.042
Neg logit contributions
ipop-0.039
round-0.038
 and-0.037
 to-0.037
cone-0.036
Token was
Token ID373
Feature activation+7.48e+00

Pos logit contributions
Looks+0.026
have+0.024
 shin+0.022
V+0.021
 funn+0.021
Neg logit contributions
 to-0.021
 and-0.021
cone-0.021
round-0.021
 spine-0.020
Token so
Token ID523
Feature activation-6.17e+00

Pos logit contributions
Looks+0.219
have+0.200
 funn+0.188
 shin+0.186
V+0.180
Neg logit contributions
 and-0.169
 to-0.165
cone-0.164
round-0.163
ipop-0.159
Token eng
Token ID1786
Feature activation+1.81e+01

Pos logit contributions
 and+0.144
 to+0.138
,+0.129
 Bob+0.116
 but+0.115
Neg logit contributions
Looks-0.203
have-0.174
icious-0.173
 funn-0.169
V-0.167
Tokenross
Token ID1214
Feature activation+5.55e+00

Pos logit contributions
Looks+0.348
have+0.301
 shin+0.266
icious+0.265
 funn+0.256
Neg logit contributions
 to-0.588
 and-0.585
cone-0.567
 Bob-0.553
round-0.552
Tokened
Token ID276
Feature activation+3.82e+00

Pos logit contributions
Looks+0.132
have+0.119
gr+0.111
 shin+0.104
V+0.099
Neg logit contributions
round-0.138
cone-0.135
 grill-0.133
 but-0.130
 and-0.129
Token that
Token ID326
Feature activation-3.40e+00

Pos logit contributions
Looks+0.124
have+0.117
 shin+0.115
 funn+0.112
 prett+0.108
Neg logit contributions
cone-0.063
round-0.063
ipop-0.061
 Lee-0.056
driver-0.054
Token he
Token ID339
Feature activation-2.15e+00

Pos logit contributions
round+0.059
cone+0.057
 and+0.050
driver+0.047
 spine+0.047
Neg logit contributions
Looks-0.117
have-0.108
 shin-0.108
 funn-0.106
 faire-0.101
Token didn
Token ID1422
Feature activation+8.02e+00

Pos logit contributions
 to+0.049
 and+0.049
round+0.045
cone+0.045
 Bob+0.044
Neg logit contributions
Looks-0.064
have-0.057
 shin-0.053
 funn-0.053
V-0.051
Token up
Token ID510
Feature activation+6.25e-01

Pos logit contributions
Looks+0.139
have+0.127
 shin+0.125
 funn+0.124
 prett+0.119
Neg logit contributions
 to-0.057
 and-0.056
round-0.055
cone-0.054
driver-0.051
Token at
Token ID379
Feature activation-4.70e+00

Pos logit contributions
Looks+0.023
have+0.020
 shin+0.020
 funn+0.020
 fullest+0.019
Neg logit contributions
round-0.009
cone-0.009
 and-0.009
driver-0.009
 to-0.009
Token the
Token ID262
Feature activation+6.04e+00

Pos logit contributions
round+0.109
 to+0.108
 and+0.107
cone+0.102
ipop+0.101
Neg logit contributions
Looks-0.135
have-0.117
 shin-0.114
 funn-0.112
V-0.110
Token stars
Token ID5788
Feature activation-2.78e+00

Pos logit contributions
Looks+0.208
have+0.186
icious+0.171
gr+0.168
V+0.168
Neg logit contributions
 and-0.136
 to-0.129
,-0.114
 but-0.110
 Bob-0.107
Token,
Token ID11
Feature activation-1.07e+01

Pos logit contributions
round+0.058
cone+0.055
ipop+0.055
 Bob+0.054
 Lee+0.053
Neg logit contributions
Looks-0.080
 shin-0.074
have-0.071
 funn-0.069
 prett-0.068
Token aw
Token ID3253
Feature activation+1.80e+01

Pos logit contributions
round+0.187
ipop+0.179
cone+0.179
 to+0.175
driver+0.173
Neg logit contributions
Looks-0.373
 shin-0.329
have-0.328
 funn-0.327
 prett-0.314
Tokenest
Token ID395
Feature activation+6.47e+00

Pos logit contributions
Looks+0.552
have+0.489
V+0.451
 shin+0.451
icious+0.449
Neg logit contributions
 and-0.449
 to-0.431
,-0.396
 but-0.371
 Bob-0.369
Tokenruck
Token ID30915
Feature activation-4.52e+00

Pos logit contributions
Looks+0.179
have+0.158
 shin+0.149
 funn+0.147
icious+0.146
Neg logit contributions
 to-0.163
 and-0.163
round-0.149
 Bob-0.147
,-0.146
Token that
Token ID326
Feature activation-3.92e+00

Pos logit contributions
 Lee+0.075
round+0.074
 Bob+0.073
ipop+0.073
driver+0.071
Neg logit contributions
Looks-0.146
 shin-0.138
 funn-0.134
 sounds-0.131
 prett-0.130
Token the
Token ID262
Feature activation+5.90e+00

Pos logit contributions
 and+0.101
 to+0.101
round+0.096
cone+0.093
 Bob+0.091
Neg logit contributions
Looks-0.105
have-0.091
 shin-0.088
 funn-0.088
icious-0.083
Token night
Token ID1755
Feature activation-5.61e+00

Pos logit contributions
Looks+0.183
have+0.162
icious+0.151
 shin+0.150
 funn+0.149
Neg logit contributions
 and-0.140
 to-0.136
,-0.122
 Bob-0.119
 but-0.119
Token-
Token ID12
Feature activation-4.92e+00

Pos logit contributions
Looks+0.270
have+0.242
 shin+0.229
 funn+0.227
icious+0.226
Neg logit contributions
 and-0.176
 to-0.174
,-0.154
round-0.152
 Bob-0.150
Tokenthree
Token ID15542
Feature activation+1.35e+01

Pos logit contributions
 to+0.145
 and+0.139
round+0.133
,+0.125
cone+0.124
Neg logit contributions
Looks-0.127
have-0.112
icious-0.105
 fullest-0.100
 funn-0.100
Token,
Token ID11
Feature activation-1.03e+01

Pos logit contributions
Looks+0.622
have+0.574
 shin+0.552
 funn+0.551
icious+0.549
Neg logit contributions
 and-0.108
 to-0.103
,-0.073
round-0.066
 Bob-0.063
Token twenty
Token ID8208
Feature activation+8.37e+00

Pos logit contributions
 to+0.284
 and+0.282
round+0.267
cone+0.262
 Bob+0.260
Neg logit contributions
Looks-0.255
have-0.220
 shin-0.212
 funn-0.208
icious-0.202
Token-
Token ID12
Feature activation-4.87e+00

Pos logit contributions
Looks+0.275
have+0.248
 shin+0.234
 funn+0.232
icious+0.232
Neg logit contributions
 and-0.172
 to-0.170
,-0.149
round-0.149
 Bob-0.147
Tokenfour
Token ID14337
Feature activation+1.78e+01

Pos logit contributions
 to+0.140
 and+0.134
round+0.127
,+0.120
cone+0.119
Neg logit contributions
Looks-0.129
have-0.114
icious-0.106
 fullest-0.102
 shin-0.101
Token,
Token ID11
Feature activation-1.03e+01

Pos logit contributions
Looks+0.972
have+0.892
V+0.874
 shin+0.873
icious+0.872
Neg logit contributions
 and-0.043
,-0.015
 to-0.010
 but+0.040
 Bob+0.055
Token twenty
Token ID8208
Feature activation+8.34e+00

Pos logit contributions
 and+0.281
 to+0.275
,+0.253
 but+0.246
 Bob+0.243
Neg logit contributions
Looks-0.277
have-0.244
icious-0.224
 shin-0.223
 funn-0.221
Token-
Token ID12
Feature activation-4.92e+00

Pos logit contributions
Looks+0.304
have+0.280
 shin+0.265
 funn+0.263
icious+0.262
Neg logit contributions
 to-0.133
 and-0.132
round-0.114
 Bob-0.113
cone-0.112
Tokenfive
Token ID13261
Feature activation+4.61e+00

Pos logit contributions
 to+0.094
 and+0.088
round+0.084
,+0.076
ipop+0.074
Neg logit contributions
Looks-0.176
have-0.161
icious-0.156
 fullest-0.154
 shin-0.152
Token,
Token ID11
Feature activation-1.03e+01

Pos logit contributions
Looks+0.210
have+0.197
 funn+0.189
 shin+0.188
icious+0.186
Neg logit contributions
 to-0.026
 and-0.024
cone-0.023
round-0.022
 spine-0.016
Token-
Token ID12
Feature activation-5.35e+00

Pos logit contributions
Looks+0.307
have+0.279
 shin+0.266
 funn+0.265
icious+0.264
Neg logit contributions
 and-0.122
 to-0.120
,-0.101
round-0.099
 Bob-0.097
Tokenthree
Token ID15542
Feature activation+1.34e+01

Pos logit contributions
 to+0.137
 and+0.135
round+0.124
 Bob+0.119
,+0.118
Neg logit contributions
Looks-0.154
have-0.134
icious-0.129
 funn-0.128
 shin-0.127
Token,
Token ID11
Feature activation-1.07e+01

Pos logit contributions
Looks+0.623
have+0.577
 shin+0.556
 funn+0.553
icious+0.552
Neg logit contributions
 and-0.097
 to-0.093
,-0.062
round-0.057
 Bob-0.054
Token thirty
Token ID12277
Feature activation+8.40e+00

Pos logit contributions
 and+0.245
 to+0.236
,+0.221
 Bob+0.209
 but+0.207
Neg logit contributions
Looks-0.340
have-0.308
icious-0.290
 shin-0.283
 funn-0.281
Token-
Token ID12
Feature activation-4.96e+00

Pos logit contributions
Looks+0.320
have+0.292
 shin+0.279
 funn+0.277
icious+0.277
Neg logit contributions
 and-0.129
 to-0.127
,-0.106
round-0.105
 Bob-0.103
Tokenfour
Token ID14337
Feature activation+1.77e+01

Pos logit contributions
 to+0.132
 and+0.129
round+0.120
,+0.115
 Bob+0.115
Neg logit contributions
Looks-0.139
have-0.120
icious-0.114
 funn-0.113
 shin-0.112
Token,
Token ID11
Feature activation-1.04e+01

Pos logit contributions
Looks+0.806
have+0.731
 shin+0.717
icious+0.708
 funn+0.706
Neg logit contributions
 and-0.188
 to-0.164
,-0.152
 but-0.111
 Bob-0.104
Token thirty
Token ID12277
Feature activation+8.56e+00

Pos logit contributions
 and+0.277
 to+0.266
,+0.247
 but+0.239
 Bob+0.234
Neg logit contributions
Looks-0.295
have-0.266
icious-0.247
 shin-0.239
 funn-0.238
Token-
Token ID12
Feature activation-4.84e+00

Pos logit contributions
Looks+0.308
have+0.286
 shin+0.269
icious+0.267
 funn+0.267
Neg logit contributions
 to-0.136
 and-0.134
round-0.117
cone-0.117
 Bob-0.116
Tokenfive
Token ID13261
Feature activation+5.03e+00

Pos logit contributions
 to+0.127
 and+0.124
round+0.116
,+0.110
 Bob+0.110
Neg logit contributions
Looks-0.138
have-0.119
icious-0.115
 funn-0.113
 shin-0.113
Token,
Token ID11
Feature activation-1.02e+01

Pos logit contributions
Looks+0.216
have+0.202
 funn+0.192
 shin+0.191
icious+0.190
Neg logit contributions
 to-0.044
 and-0.042
cone-0.039
round-0.038
ipop-0.032
Token mom
Token ID1995
Feature activation-7.83e+00

Pos logit contributions
Looks+0.268
have+0.245
 funn+0.236
 shin+0.235
V+0.229
Neg logit contributions
 and-0.170
 to-0.165
round-0.164
cone-0.161
 Bob-0.148
Token saw
Token ID2497
Feature activation+8.41e+00

Pos logit contributions
 spine+0.218
round+0.214
 to+0.210
cone+0.208
ipop+0.196
Neg logit contributions
Looks-0.188
have-0.168
 shin-0.154
 funn-0.151
V-0.141
Token Jane
Token ID12091
Feature activation+1.94e+00

Pos logit contributions
Looks+0.220
have+0.196
 funn+0.181
 shin+0.176
End+0.169
Neg logit contributions
round-0.223
 spine-0.222
cone-0.217
driver-0.199
ipop-0.197
Token looking
Token ID2045
Feature activation+2.19e+00

Pos logit contributions
Looks+0.063
have+0.061
 shin+0.058
 funn+0.057
 prett+0.054
Neg logit contributions
round-0.034
cone-0.032
 spine-0.031
driver-0.030
 to-0.029
Token en
Token ID551
Feature activation+1.08e+01

Pos logit contributions
Looks+0.079
have+0.074
 funn+0.071
 shin+0.071
 prett+0.068
Neg logit contributions
round-0.033
cone-0.032
 to-0.031
 and-0.030
driver-0.030
Tokenviously
Token ID8647
Feature activation+1.76e+01

Pos logit contributions
Looks+0.290
V+0.240
have+0.240
icious+0.240
 fullest+0.228
Neg logit contributions
 and-0.319
 to-0.310
,-0.282
 but-0.271
 because-0.265
Token at
Token ID379
Feature activation-4.64e+00

Pos logit contributions
Looks+0.582
 shin+0.543
have+0.542
 funn+0.528
 prett+0.521
Neg logit contributions
cone-0.313
round-0.309
 spine-0.266
stack-0.266
driver-0.263
Token the
Token ID262
Feature activation+5.95e+00

Pos logit contributions
round+0.128
cone+0.127
driver+0.120
 spine+0.120
ipop+0.114
Neg logit contributions
Looks-0.116
have-0.098
 funn-0.094
 shin-0.089
 prett-0.088
Token big
Token ID1263
Feature activation+4.16e+00

Pos logit contributions
Looks+0.188
icious+0.167
have+0.166
gr+0.156
ceed+0.154
Neg logit contributions
 and-0.155
 to-0.146
,-0.135
 but-0.133
 Bob-0.126
Token kids
Token ID3988
Feature activation-2.73e+00

Pos logit contributions
Looks+0.119
have+0.106
 funn+0.105
 shin+0.104
V+0.099
Neg logit contributions
 to-0.092
cone-0.089
 and-0.087
driver-0.086
round-0.086
Token and
Token ID290
Feature activation-1.47e+01

Pos logit contributions
cone+0.054
round+0.052
driver+0.049
ipop+0.047
 spine+0.046
Neg logit contributions
Looks-0.077
have-0.077
 funn-0.071
 shin-0.069
 prett-0.068
Token-
Token ID12
Feature activation-5.16e+00

Pos logit contributions
Looks+0.294
have+0.274
 shin+0.261
icious+0.258
 funn+0.258
Neg logit contributions
 to-0.131
 and-0.128
round-0.115
 Bob-0.114
cone-0.111
Tokenthree
Token ID15542
Feature activation+1.33e+01

Pos logit contributions
 to+0.156
 and+0.150
round+0.144
,+0.136
cone+0.134
Neg logit contributions
Looks-0.126
have-0.110
icious-0.104
 funn-0.100
 fullest-0.100
Token,
Token ID11
Feature activation-1.05e+01

Pos logit contributions
Looks+0.655
have+0.613
 shin+0.592
icious+0.589
 funn+0.588
Neg logit contributions
 to-0.049
 and-0.049
round-0.018
 Bob-0.013
cone-0.011
Token thirty
Token ID12277
Feature activation+8.26e+00

Pos logit contributions
 to+0.305
 and+0.304
round+0.292
cone+0.286
 spine+0.285
Neg logit contributions
Looks-0.241
have-0.205
 shin-0.198
 funn-0.192
icious-0.186
Token-
Token ID12
Feature activation-5.14e+00

Pos logit contributions
Looks+0.300
have+0.280
 shin+0.268
 funn+0.264
icious+0.264
Neg logit contributions
 to-0.125
 and-0.122
round-0.109
 Bob-0.108
cone-0.106
Tokenfour
Token ID14337
Feature activation+1.75e+01

Pos logit contributions
 to+0.156
 and+0.150
round+0.143
,+0.136
cone+0.134
Neg logit contributions
Looks-0.127
have-0.110
icious-0.103
 fullest-0.100
 funn-0.099
Token,
Token ID11
Feature activation-1.05e+01

Pos logit contributions
Looks+0.953
have+0.875
 shin+0.856
icious+0.855
V+0.853
Neg logit contributions
 and-0.040
 to-0.010
,-0.010
 but+0.039
 Bob+0.055
Token thirty
Token ID12277
Feature activation+8.17e+00

Pos logit contributions
 and+0.295
 to+0.287
,+0.266
 but+0.257
 Bob+0.253
Neg logit contributions
Looks-0.277
have-0.246
icious-0.224
 shin-0.219
 funn-0.219
Token-
Token ID12
Feature activation-5.24e+00

Pos logit contributions
Looks+0.290
have+0.270
 shin+0.256
icious+0.253
 funn+0.252
Neg logit contributions
 to-0.131
 and-0.125
 Bob-0.114
round-0.112
cone-0.112
Tokenfive
Token ID13261
Feature activation+4.26e+00

Pos logit contributions
 to+0.097
 and+0.092
round+0.088
,+0.080
ipop+0.076
Neg logit contributions
Looks-0.189
have-0.172
icious-0.167
 fullest-0.165
 funn-0.163
Token,
Token ID11
Feature activation-1.05e+01

Pos logit contributions
Looks+0.193
have+0.183
 funn+0.175
 shin+0.174
icious+0.172
Neg logit contributions
cone-0.022
 to-0.021
round-0.020
 and-0.018
 spine-0.015
Token made
Token ID925
Feature activation+7.78e+00

Pos logit contributions
 and+0.039
 to+0.038
,+0.034
 but+0.032
 Bob+0.031
Neg logit contributions
Looks-0.052
have-0.047
icious-0.045
V-0.043
 funn-0.042
Token her
Token ID607
Feature activation+5.06e+00

Pos logit contributions
Looks+0.279
have+0.255
icious+0.241
V+0.237
gr+0.230
Neg logit contributions
 and-0.162
 to-0.158
,-0.138
 Bob-0.130
 but-0.125
Token spark
Token ID9009
Feature activation+7.03e+00

Pos logit contributions
Looks+0.146
have+0.129
 shin+0.121
 funn+0.120
icious+0.119
Neg logit contributions
 and-0.126
 to-0.125
,-0.113
round-0.111
 Bob-0.110
Tokenle
Token ID293
Feature activation-7.94e-01

Pos logit contributions
Looks+0.227
V+0.196
have+0.193
icious+0.184
gr+0.173
Neg logit contributions
 and-0.197
 to-0.182
,-0.174
 but-0.156
 because-0.150
Token and
Token ID290
Feature activation-1.42e+01

Pos logit contributions
 and+0.018
 to+0.017
,+0.015
 but+0.014
 because+0.013
Neg logit contributions
Looks-0.027
have-0.024
V-0.024
icious-0.023
 shin-0.023
Token gle
Token ID26852
Feature activation+1.74e+01

Pos logit contributions
 and+0.357
 to+0.352
,+0.318
 Bob+0.304
 but+0.303
Neg logit contributions
Looks-0.424
have-0.384
icious-0.365
V-0.353
gr-0.340
Tokenam
Token ID321
Feature activation+4.94e+00

Pos logit contributions
Looks+0.412
have+0.359
V+0.341
 shin+0.305
 funn+0.304
Neg logit contributions
 and-0.554
,-0.499
 to-0.483
 because-0.447
 Bob-0.435
Token.
Token ID13
Feature activation+2.53e+00

Pos logit contributions
Looks+0.198
have+0.180
icious+0.177
V+0.169
 funn+0.168
Neg logit contributions
 to-0.086
 and-0.085
,-0.070
 but-0.062
 Bob-0.057
Token She
Token ID1375
Feature activation+1.53e+00

Pos logit contributions
Looks+0.081
have+0.069
V+0.067
 shin+0.066
 funn+0.066
Neg logit contributions
 and-0.053
 to-0.053
round-0.048
cone-0.047
,-0.047
Token felt
Token ID2936
Feature activation+1.19e+01

Pos logit contributions
Looks+0.043
have+0.037
 shin+0.036
 funn+0.035
V+0.034
Neg logit contributions
 to-0.038
 and-0.037
round-0.035
cone-0.035
 spine-0.034
Token so
Token ID523
Feature activation-5.95e+00

Pos logit contributions
Looks+0.468
have+0.423
icious+0.417
V+0.411
 shin+0.392
Neg logit contributions
 and-0.215
 to-0.204
,-0.183
 but-0.164
 because-0.161
Token gross
Token ID10319
Feature activation+9.06e+00

Pos logit contributions
 and+0.195
 to+0.193
,+0.177
round+0.176
 Bob+0.174
Neg logit contributions
Looks-0.150
have-0.127
 shin-0.118
 funn-0.116
icious-0.115
Token that
Token ID326
Feature activation-3.65e+00

Pos logit contributions
Looks+0.227
 sounds+0.206
 shin+0.195
 yourself+0.193
have+0.185
Neg logit contributions
 spine-0.218
round-0.209
ipop-0.198
 lawyer-0.187
cone-0.186
Token it
Token ID340
Feature activation-1.83e+00

Pos logit contributions
 and+0.043
 to+0.042
round+0.039
cone+0.036
ipop+0.032
Neg logit contributions
Looks-0.151
have-0.137
 shin-0.133
 funn-0.133
icious-0.130
Token made
Token ID925
Feature activation+7.59e+00

Pos logit contributions
round+0.052
 to+0.047
 Lee+0.047
 and+0.047
 spine+0.047
Neg logit contributions
Looks-0.046
 shin-0.037
 funn-0.037
have-0.037
 prett-0.033
Token Jack
Token ID3619
Feature activation+2.03e-03

Pos logit contributions
Looks+0.275
have+0.249
 shin+0.237
 funn+0.236
icious+0.235
Neg logit contributions
 and-0.131
 to-0.129
round-0.113
,-0.111
cone-0.109
Token grim
Token ID18288
Feature activation+1.73e+01

Pos logit contributions
Looks+0.000
have+0.000
 shin+0.000
 funn+0.000
icious+0.000
Neg logit contributions
 and-0.000
 to-0.000
round-0.000
cone-0.000
 Bob-0.000
Tokenace
Token ID558
Feature activation-2.53e+00

Pos logit contributions
Looks+0.506
have+0.446
 shin+0.419
 funn+0.416
icious+0.415
Neg logit contributions
 and-0.425
 to-0.419
,-0.378
round-0.372
 Bob-0.369
Token.
Token ID13
Feature activation+1.91e+00

Pos logit contributions
round+0.049
cone+0.049
 Lee+0.046
 Bob+0.045
 to+0.045
Neg logit contributions
Looks-0.075
have-0.070
 shin-0.067
 funn-0.066
 fullest-0.062
Token He
Token ID679
Feature activation+8.06e-01

Pos logit contributions
Looks+0.061
have+0.053
 shin+0.051
 funn+0.050
V+0.050
Neg logit contributions
 and-0.039
round-0.038
 to-0.038
cone-0.038
ipop-0.035
Token wanted
Token ID2227
Feature activation+9.98e+00

Pos logit contributions
Looks+0.022
have+0.018
 shin+0.017
 funn+0.017
V+0.016
Neg logit contributions
round-0.020
 to-0.020
 spine-0.019
 and-0.019
 Lee-0.019
Token to
Token ID284
Feature activation-1.41e+01

Pos logit contributions
 better+0.280
 other+0.279
 wonderful+0.278
 new+0.277
 nice+0.270
Neg logit contributions
round-0.226
amb-0.204
 spine-0.200
ipop-0.182
storm-0.174
TokenGo
Token ID5247
Feature activation+3.53e+00

Pos logit contributions
Looks+0.179
icious+0.160
 funn+0.158
 shin+0.158
have+0.157
Neg logit contributions
 and-0.226
 to-0.221
 but-0.202
round-0.200
 Bob-0.198
Token on
Token ID319
Feature activation-6.23e+00

Pos logit contributions
Looks+0.110
have+0.099
V+0.091
 shin+0.091
icious+0.091
Neg logit contributions
 and-0.077
 to-0.076
 Bob-0.068
cone-0.067
,-0.067
Token,
Token ID11
Feature activation-9.99e+00

Pos logit contributions
round+0.103
cone+0.099
ipop+0.095
 spine+0.095
 and+0.094
Neg logit contributions
Looks-0.196
have-0.182
V-0.174
 shin-0.172
 funn-0.171
Token John
Token ID1757
Feature activation-1.71e-01

Pos logit contributions
 to+0.216
 and+0.216
cone+0.195
round+0.194
,+0.190
Neg logit contributions
Looks-0.312
have-0.281
 shin-0.263
 funn-0.261
icious-0.255
Token.
Token ID13
Feature activation+2.48e+00

Pos logit contributions
 and+0.003
 to+0.003
round+0.003
cone+0.003
 Bob+0.003
Neg logit contributions
Looks-0.006
have-0.005
 shin-0.005
 funn-0.005
icious-0.005
Token Beh
Token ID10407
Feature activation+1.73e+01

Pos logit contributions
Looks+0.068
have+0.060
 shin+0.057
 funn+0.056
icious+0.056
Neg logit contributions
 and-0.065
 to-0.064
,-0.058
 Bob-0.058
round-0.057
Tokenave
Token ID1015
Feature activation+5.85e+00

Pos logit contributions
Looks+0.528
 shin+0.449
 fullest+0.444
icious+0.444
 funn+0.433
Neg logit contributions
 to-0.431
 and-0.430
round-0.371
,-0.365
driver-0.351
Token like
Token ID588
Feature activation+1.02e+00

Pos logit contributions
Looks+0.176
have+0.159
 shin+0.147
 funn+0.147
icious+0.146
Neg logit contributions
 and-0.133
 to-0.132
round-0.119
cone-0.118
,-0.117
Token a
Token ID257
Feature activation+7.91e+00

Pos logit contributions
Looks+0.028
have+0.025
 shin+0.023
 funn+0.023
icious+0.023
Neg logit contributions
 and-0.026
 to-0.026
round-0.024
,-0.023
cone-0.023
Token brave
Token ID14802
Feature activation+3.23e+00

Pos logit contributions
Looks+0.204
have+0.176
 shin+0.162
icious+0.162
 funn+0.160
Neg logit contributions
 and-0.225
 to-0.223
,-0.204
round-0.199
 but-0.198
Token boy
Token ID2933
Feature activation-1.32e+00

Pos logit contributions
Looks+0.088
have+0.082
 shin+0.079
 funn+0.079
V+0.074
Neg logit contributions
 to-0.074
cone-0.071
stack-0.068
 and-0.067
 Bob-0.066
Token for
Token ID329
Feature activation+2.22e+00

Pos logit contributions
Looks+0.017
have+0.016
 shin+0.015
 funn+0.015
 moral+0.015
Neg logit contributions
cone-0.005
round-0.005
 spine-0.005
 inspection-0.004
driver-0.004
Token the
Token ID262
Feature activation+6.54e+00

Pos logit contributions
Looks+0.070
have+0.063
 shin+0.059
 funn+0.059
icious+0.058
Neg logit contributions
 and-0.049
 to-0.049
,-0.043
round-0.043
 Bob-0.042
Token toast
Token ID27805
Feature activation-5.82e+00

Pos logit contributions
Looks+0.185
have+0.164
 funn+0.154
 shin+0.154
icious+0.151
Neg logit contributions
 and-0.164
 to-0.162
round-0.148
,-0.148
 Bob-0.145
Token and
Token ID290
Feature activation-1.42e+01

Pos logit contributions
round+0.089
driver+0.088
cone+0.086
 inspection+0.079
 spine+0.079
Neg logit contributions
Looks-0.182
 funn-0.176
have-0.172
 shin-0.172
icious-0.171
Token smiled
Token ID13541
Feature activation+1.04e+01

Pos logit contributions
 spine+0.329
round+0.319
 to+0.312
cone+0.311
 and+0.307
Neg logit contributions
Looks-0.411
have-0.354
 funn-0.345
 shin-0.343
 prett-0.309
Token gle
Token ID26852
Feature activation+1.72e+01

Pos logit contributions
Looks+0.382
have+0.346
 shin+0.328
icious+0.327
 funn+0.327
Neg logit contributions
 and-0.184
 to-0.180
,-0.155
round-0.149
 but-0.148
Tokenefully
Token ID7549
Feature activation+2.36e+00

Pos logit contributions
Looks+0.387
V+0.313
have+0.309
 shin+0.285
icious+0.284
Neg logit contributions
 and-0.586
 to-0.536
,-0.531
 because-0.485
round-0.484
Token.
Token ID13
Feature activation+2.65e+00

Pos logit contributions
Looks+0.072
have+0.066
 funn+0.062
 shin+0.061
icious+0.060
Neg logit contributions
 and-0.046
 to-0.046
cone-0.046
 Bob-0.044
round-0.044
Token His
Token ID2399
Feature activation+9.39e+00

Pos logit contributions
Looks+0.099
have+0.089
icious+0.083
V+0.083
 funn+0.083
Neg logit contributions
 to-0.041
 and-0.041
cone-0.037
ipop-0.036
round-0.036
Token siblings
Token ID20569
Feature activation-1.62e+00

Pos logit contributions
Looks+0.284
have+0.263
 funn+0.251
 shin+0.246
icious+0.244
Neg logit contributions
round-0.196
 to-0.196
 and-0.195
driver-0.180
cone-0.179
Token were
Token ID547
Feature activation+8.11e+00

Pos logit contributions
 to+0.037
 and+0.037
cone+0.034
round+0.034
 Bob+0.033
Neg logit contributions
Looks-0.048
have-0.043
 shin-0.040
 funn-0.040
icious-0.040
Token You
Token ID921
Feature activation+5.05e+00

Pos logit contributions
Looks+0.171
have+0.158
icious+0.149
 shin+0.144
V+0.143
Neg logit contributions
 to-0.069
 and-0.069
,-0.058
 but-0.053
cone-0.049
Token have
Token ID423
Feature activation+1.19e+00

Pos logit contributions
Looks+0.154
have+0.131
icious+0.130
 shin+0.127
V+0.126
Neg logit contributions
 and-0.131
 to-0.130
,-0.119
 but-0.111
 Bob-0.108
Token to
Token ID284
Feature activation-1.73e+01

Pos logit contributions
Looks+0.058
have+0.053
icious+0.052
 shin+0.051
V+0.051
Neg logit contributions
 to-0.008
 and-0.008
,-0.004
 Bob-0.003
 but-0.003
Token keep
Token ID1394
Feature activation+2.34e+00

Pos logit contributions
 and+0.419
 to+0.411
,+0.378
 Bob+0.361
 but+0.358
Neg logit contributions
Looks-0.528
have-0.454
 shin-0.433
 funn-0.430
icious-0.426
Token trying
Token ID2111
Feature activation-2.69e+00

Pos logit contributions
Looks+0.073
have+0.066
 shin+0.062
 funn+0.061
icious+0.061
Neg logit contributions
 and-0.052
 to-0.051
round-0.046
,-0.045
cone-0.045
Token and
Token ID290
Feature activation-1.81e+01

Pos logit contributions
 and+0.041
 to+0.040
round+0.036
cone+0.036
 Bob+0.035
Neg logit contributions
Looks-0.099
have-0.092
 shin-0.087
 funn-0.086
icious-0.086
Token learning
Token ID4673
Feature activation-1.80e+00

Pos logit contributions
 to+0.422
 and+0.413
,+0.371
 but+0.356
 Bob+0.346
Neg logit contributions
Looks-0.584
have-0.501
icious-0.490
V-0.484
 funn-0.478
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
 and+0.041
 to+0.041
,+0.036
 but+0.034
 Bob+0.032
Neg logit contributions
Looks-0.062
have-0.054
icious-0.052
V-0.051
 shin-0.050
Token having
Token ID1719
Feature activation+4.46e+00

Pos logit contributions
 to+0.457
 and+0.447
,+0.403
 but+0.387
 Bob+0.380
Neg logit contributions
Looks-0.529
have-0.455
icious-0.440
V-0.435
 funn-0.429
Token fun
Token ID1257
Feature activation-2.81e-01

Pos logit contributions
Looks+0.117
have+0.100
icious+0.097
V+0.092
gr+0.090
Neg logit contributions
 to-0.138
 and-0.136
,-0.125
 Bob-0.119
 but-0.118
Token.
Token ID13
Feature activation-6.75e-01

Pos logit contributions
 and+0.006
 to+0.006
round+0.005
,+0.005
 Bob+0.005
Neg logit contributions
Looks-0.009
have-0.008
 shin-0.008
 funn-0.008
icious-0.008
Token to
Token ID284
Feature activation-1.74e+01

Pos logit contributions
Looks+0.172
have+0.168
 funn+0.164
 shin+0.156
 sounds+0.156
Neg logit contributions
 spine-0.049
driver-0.047
cone-0.046
round-0.045
ipop-0.039
Token build
Token ID1382
Feature activation-5.42e-01

Pos logit contributions
 and+0.399
 to+0.394
round+0.355
,+0.351
cone+0.350
Neg logit contributions
Looks-0.522
have-0.469
 shin-0.439
 funn-0.437
icious-0.432
Token towers
Token ID18028
Feature activation-8.76e+00

Pos logit contributions
 and+0.013
 to+0.013
,+0.012
 but+0.011
 Bob+0.011
Neg logit contributions
Looks-0.017
have-0.014
icious-0.014
V-0.014
 funn-0.013
Token and
Token ID290
Feature activation-1.84e+01

Pos logit contributions
ipop+0.110
cone+0.107
round+0.100
 spine+0.099
 Bob+0.094
Neg logit contributions
Looks-0.313
 shin-0.309
have-0.307
 funn-0.297
 prett-0.296
Token houses
Token ID7777
Feature activation-1.16e+01

Pos logit contributions
 and+0.443
 to+0.439
,+0.389
 but+0.380
round+0.372
Neg logit contributions
Looks-0.557
have-0.492
icious-0.475
 shin-0.458
 funn-0.457
Token and
Token ID290
Feature activation-1.80e+01

Pos logit contributions
ipop+0.293
round+0.283
cone+0.282
 spine+0.273
 Bob+0.266
Neg logit contributions
Looks-0.282
 shin-0.263
have-0.260
 funn-0.242
 prett-0.232
Token cars
Token ID5006
Feature activation-6.29e+00

Pos logit contributions
 and+0.455
 to+0.448
round+0.426
ipop+0.420
cone+0.418
Neg logit contributions
Looks-0.495
have-0.439
 shin-0.414
 funn-0.410
V-0.390
Token with
Token ID351
Feature activation-1.31e+00

Pos logit contributions
ipop+0.137
cone+0.134
 Bob+0.133
 spine+0.132
 to+0.131
Neg logit contributions
Looks-0.174
have-0.165
 shin-0.156
 funn-0.149
V-0.147
Token the
Token ID262
Feature activation+2.97e+00

Pos logit contributions
round+0.029
 and+0.028
cone+0.028
 to+0.028
 spine+0.027
Neg logit contributions
Looks-0.038
have-0.034
 shin-0.034
 funn-0.033
V-0.032
Token blocks
Token ID7021
Feature activation-9.29e+00

Pos logit contributions
Looks+0.098
have+0.088
 shin+0.083
icious+0.082
 funn+0.082
Neg logit contributions
 and-0.062
 to-0.061
,-0.054
round-0.053
 but-0.052
Token.
Token ID13
Feature activation-6.09e-01

Pos logit contributions
round+0.241
cone+0.240
 Bob+0.234
 spine+0.232
ipop+0.230
Neg logit contributions
Looks-0.208
 shin-0.199
have-0.189
 prett-0.184
 funn-0.184
Token,
Token ID11
Feature activation-1.40e+01

Pos logit contributions
Looks+0.187
have+0.167
 shin+0.158
 funn+0.157
icious+0.156
Neg logit contributions
 and-0.130
 to-0.128
,-0.114
round-0.113
 Bob-0.111
Token she
Token ID673
Feature activation-6.09e+00

Pos logit contributions
round+0.221
cone+0.219
 to+0.218
 and+0.216
ipop+0.205
Neg logit contributions
Looks-0.482
have-0.448
 shin-0.423
 funn-0.422
icious-0.401
Token searched
Token ID16499
Feature activation-1.62e+00

Pos logit contributions
 and+0.135
 to+0.131
,+0.119
 Bob+0.109
 but+0.108
Neg logit contributions
Looks-0.203
have-0.178
icious-0.174
 shin-0.171
 funn-0.170
Token the
Token ID262
Feature activation+2.88e+00

Pos logit contributions
round+0.036
cone+0.033
driver+0.032
ipop+0.032
 Lee+0.032
Neg logit contributions
Looks-0.049
 prett-0.044
 shin-0.044
 wonderful-0.042
 funn-0.042
Token maze
Token ID31237
Feature activation-6.37e+00

Pos logit contributions
Looks+0.086
have+0.077
 shin+0.076
 funn+0.076
 prett+0.073
Neg logit contributions
 to-0.062
 and-0.062
round-0.058
 Bob-0.057
ipop-0.056
Token and
Token ID290
Feature activation-1.79e+01

Pos logit contributions
ipop+0.083
round+0.082
cone+0.079
ule+0.078
olph+0.077
Neg logit contributions
Looks-0.228
 shin-0.215
 prett-0.210
 funn-0.209
 faire-0.203
Token finally
Token ID3443
Feature activation-6.38e+00

Pos logit contributions
 and+0.351
 to+0.347
,+0.303
 but+0.293
 Bob+0.293
Neg logit contributions
Looks-0.617
have-0.557
icious-0.527
 shin-0.526
 funn-0.524
Token found
Token ID1043
Feature activation+4.91e+00

Pos logit contributions
 and+0.100
 to+0.099
,+0.086
 but+0.079
 Bob+0.078
Neg logit contributions
Looks-0.249
have-0.227
icious-0.217
 funn-0.216
 shin-0.216
Token the
Token ID262
Feature activation+4.04e+00

Pos logit contributions
Looks+0.163
have+0.148
icious+0.140
 funn+0.137
 shin+0.136
Neg logit contributions
 and-0.107
 to-0.107
,-0.096
 Bob-0.093
 but-0.091
Token way
Token ID835
Feature activation-7.11e+00

Pos logit contributions
Looks+0.117
have+0.103
icious+0.096
 shin+0.094
V+0.094
Neg logit contributions
 and-0.104
 to-0.102
,-0.092
 but-0.091
 Bob-0.089
Token out
Token ID503
Feature activation-4.86e+00

Pos logit contributions
 and+0.101
round+0.099
cone+0.096
 to+0.093
ipop+0.091
Neg logit contributions
Looks-0.261
have-0.235
 shin-0.230
 funn-0.224
 prett-0.218
Token't
Token ID470
Feature activation-4.47e+00

Pos logit contributions
round+0.071
cone+0.068
ipop+0.068
 spine+0.062
 Frank+0.059
Neg logit contributions
 shin-0.105
Looks-0.105
 sounds-0.100
have-0.099
 funn-0.096
Token wait
Token ID4043
Feature activation-3.97e+00

Pos logit contributions
 and+0.110
 to+0.109
,+0.098
round+0.097
 Bob+0.096
Neg logit contributions
Looks-0.130
have-0.115
 shin-0.108
 funn-0.107
icious-0.107
Token to
Token ID284
Feature activation-1.72e+01

Pos logit contributions
cone+0.052
 and+0.052
round+0.050
 to+0.048
driver+0.046
Neg logit contributions
Looks-0.148
have-0.140
 shin-0.131
 funn-0.131
icious-0.127
Token tell
Token ID1560
Feature activation+3.08e+00

Pos logit contributions
 and+0.418
 to+0.407
round+0.404
cone+0.402
ipop+0.391
Neg logit contributions
Looks-0.461
have-0.424
 funn-0.396
 shin-0.389
icious-0.378
Token mummy
Token ID38567
Feature activation-1.02e+01

Pos logit contributions
Looks+0.097
have+0.088
 funn+0.083
 shin+0.082
icious+0.081
Neg logit contributions
 and-0.064
 to-0.062
round-0.058
cone-0.058
,-0.055
Token and
Token ID290
Feature activation-1.79e+01

Pos logit contributions
 to+0.151
 and+0.151
cone+0.144
round+0.142
 spine+0.135
Neg logit contributions
Looks-0.371
have-0.348
 funn-0.323
 shin-0.322
icious-0.319
Token daddy
Token ID35822
Feature activation-8.30e+00

Pos logit contributions
round+0.361
cone+0.354
 and+0.351
 to+0.336
 spine+0.316
Neg logit contributions
Looks-0.567
have-0.523
 funn-0.490
 shin-0.487
icious-0.456
Token what
Token ID644
Feature activation-1.73e+00

Pos logit contributions
 and+0.168
 to+0.165
cone+0.153
round+0.153
 Bob+0.146
Neg logit contributions
Looks-0.266
have-0.243
 funn-0.226
 shin-0.225
icious-0.222
Token aunt
Token ID25949
Feature activation-9.31e+00

Pos logit contributions
 and+0.028
 to+0.027
round+0.025
cone+0.024
,+0.023
Neg logit contributions
Looks-0.064
have-0.059
 funn-0.056
 shin-0.056
icious-0.055
Token had
Token ID550
Feature activation+3.28e+00

Pos logit contributions
 to+0.202
 and+0.200
round+0.198
cone+0.197
 spine+0.190
Neg logit contributions
Looks-0.277
have-0.254
 shin-0.230
 funn-0.229
icious-0.226
Token whispered
Token ID25029
Feature activation+5.67e+00

Pos logit contributions
Looks+0.094
have+0.082
 shin+0.078
 funn+0.077
icious+0.077
Neg logit contributions
 and-0.084
 to-0.083
,-0.075
 but-0.073
 Bob-0.072
Token liked
Token ID8288
Feature activation+4.49e+00

Pos logit contributions
 and+0.030
 to+0.028
,+0.025
 but+0.024
 Bob+0.023
Neg logit contributions
Looks-0.038
have-0.035
icious-0.034
 fullest-0.033
 funn-0.033
Token to
Token ID284
Feature activation-1.74e+01

Pos logit contributions
Looks+0.172
have+0.168
 funn+0.164
 shin+0.156
 sounds+0.156
Neg logit contributions
 spine-0.049
driver-0.047
cone-0.046
round-0.045
ipop-0.039
Token play
Token ID711
Feature activation+9.16e-01

Pos logit contributions
 and+0.399
 to+0.394
round+0.355
,+0.351
cone+0.350
Neg logit contributions
Looks-0.522
have-0.469
 shin-0.439
 funn-0.437
icious-0.432
Token with
Token ID351
Feature activation-2.11e+00

Pos logit contributions
Looks+0.033
have+0.031
V+0.029
 funn+0.029
 shin+0.029
Neg logit contributions
 to-0.013
 and-0.013
round-0.013
ipop-0.012
 spine-0.012
Token cars
Token ID5006
Feature activation-6.70e+00

Pos logit contributions
 spine+0.046
 to+0.046
 and+0.046
round+0.045
cone+0.044
Neg logit contributions
Looks-0.061
have-0.055
 shin-0.052
 funn-0.051
V-0.050
Token and
Token ID290
Feature activation-1.79e+01

Pos logit contributions
 spine+0.054
 inspection+0.051
 Bob+0.050
ipop+0.049
cone+0.048
Neg logit contributions
Looks-0.270
have-0.268
 shin-0.253
 funn-0.252
V-0.250
Token robots
Token ID14193
Feature activation-7.79e+00

Pos logit contributions
ipop+0.372
round+0.364
 spine+0.345
 and+0.344
cone+0.340
Neg logit contributions
Looks-0.540
have-0.503
 funn-0.480
 shin-0.474
V-0.473
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
 to+0.169
 Bob+0.167
 and+0.167
cone+0.165
ipop+0.164
Neg logit contributions
Looks-0.215
have-0.201
 funn-0.188
 shin-0.187
V-0.181
Token He
Token ID679
Feature activation-1.60e+00

Pos logit contributions
 and+0.024
 to+0.023
,+0.021
round+0.020
 Bob+0.020
Neg logit contributions
Looks-0.034
have-0.031
 shin-0.029
 funn-0.029
icious-0.029
Token saw
Token ID2497
Feature activation+6.04e+00

Pos logit contributions
 and+0.037
 to+0.036
,+0.033
round+0.032
 but+0.031
Neg logit contributions
Looks-0.050
have-0.044
icious-0.042
 shin-0.041
 funn-0.041
Token Angel
Token ID3905
Feature activation-2.78e+00

Pos logit contributions
Looks+0.184
have+0.161
icious+0.159
 fullest+0.148
 shin+0.148
Neg logit contributions
 and-0.151
 to-0.150
,-0.132
 but-0.129
 Bob-0.128
Token walk
Token ID2513
Feature activation-3.97e+00

Pos logit contributions
Looks+0.162
have+0.146
 shin+0.140
 funn+0.139
icious+0.137
Neg logit contributions
 and-0.086
 to-0.084
round-0.073
,-0.073
 Bob-0.071
Token home
Token ID1363
Feature activation-1.12e+01

Pos logit contributions
 and+0.047
 to+0.047
cone+0.043
round+0.042
driver+0.041
Neg logit contributions
Looks-0.155
have-0.143
 shin-0.136
 funn-0.135
V-0.133
Token,
Token ID11
Feature activation-1.45e+01

Pos logit contributions
cone+0.142
round+0.141
 and+0.136
 to+0.136
driver+0.130
Neg logit contributions
Looks-0.412
have-0.388
 shin-0.373
 funn-0.361
 fullest-0.357
Token holding
Token ID4769
Feature activation+2.84e+00

Pos logit contributions
round+0.215
driver+0.215
cone+0.214
ipop+0.213
 to+0.201
Neg logit contributions
Looks-0.526
have-0.482
 shin-0.464
 funn-0.453
 fullest-0.446
Token hands
Token ID2832
Feature activation-6.32e+00

Pos logit contributions
Looks+0.089
have+0.081
 funn+0.076
 shin+0.076
V+0.073
Neg logit contributions
 and-0.057
 to-0.057
round-0.054
cone-0.053
driver-0.050
Token and
Token ID290
Feature activation-1.79e+01

Pos logit contributions
cone+0.058
round+0.048
ipop+0.042
 Lee+0.041
 barrel+0.039
Neg logit contributions
Looks-0.247
have-0.235
 shin-0.234
 prett-0.224
 funn-0.221
Token signing
Token ID8415
Feature activation-7.38e-01

Pos logit contributions
 and+0.370
 to+0.367
round+0.332
,+0.322
cone+0.320
Neg logit contributions
Looks-0.582
have-0.522
 shin-0.494
 funn-0.489
icious-0.480
Token.
Token ID13
Feature activation-7.94e-01

Pos logit contributions
 and+0.015
 to+0.015
,+0.013
 Bob+0.013
 but+0.013
Neg logit contributions
Looks-0.025
have-0.023
icious-0.021
 shin-0.021
 funn-0.021
Token Lily
Token ID20037
Feature activation+3.89e+00

Pos logit contributions
 and+0.017
 to+0.017
round+0.015
,+0.015
cone+0.015
Neg logit contributions
Looks-0.026
have-0.023
 shin-0.021
V-0.021
 funn-0.021
Token is
Token ID318
Feature activation+3.71e+00

Pos logit contributions
Looks+0.111
have+0.101
 shin+0.093
 funn+0.092
V+0.091
Neg logit contributions
 to-0.092
 and-0.089
cone-0.086
round-0.085
ipop-0.083
Token happy
Token ID3772
Feature activation+3.64e+00

Pos logit contributions
Looks+0.112
have+0.097
icious+0.093
 shin+0.091
V+0.089
Neg logit contributions
 and-0.094
 to-0.093
,-0.083
 but-0.080
 Bob-0.080
Token They
Token ID1119
Feature activation-1.43e+00

Pos logit contributions
Looks+0.056
have+0.052
icious+0.044
V+0.044
 funn+0.043
Neg logit contributions
cone-0.086
round-0.085
 to-0.082
 and-0.082
 spine-0.082
Token pretend
Token ID16614
Feature activation-4.59e+00

Pos logit contributions
 and+0.033
 to+0.033
,+0.029
round+0.029
 Bob+0.029
Neg logit contributions
Looks-0.043
have-0.039
 shin-0.036
 funn-0.036
icious-0.036
Token to
Token ID284
Feature activation-1.71e+01

Pos logit contributions
cone+0.063
round+0.060
 spine+0.058
driver+0.053
ipop+0.050
Neg logit contributions
Looks-0.157
have-0.153
 sounds-0.152
 funn-0.150
 shin-0.144
Token be
Token ID307
Feature activation+3.41e+00

Pos logit contributions
 and+0.397
 to+0.393
,+0.354
 Bob+0.339
 but+0.334
Neg logit contributions
Looks-0.556
have-0.483
icious-0.465
 shin-0.451
 funn-0.450
Token astronauts
Token ID26835
Feature activation-1.19e+01

Pos logit contributions
Looks+0.108
have+0.094
icious+0.090
 shin+0.089
 funn+0.087
Neg logit contributions
 and-0.081
 to-0.080
,-0.072
 Bob-0.069
 but-0.068
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
cone+0.163
ipop+0.156
round+0.146
 spine+0.135
 Bob+0.133
Neg logit contributions
Looks-0.404
have-0.400
 funn-0.377
 shin-0.375
 sounds-0.367
Token explore
Token ID7301
Feature activation-3.44e+00

Pos logit contributions
 to+0.419
 and+0.418
,+0.371
 but+0.357
 Bob+0.352
Neg logit contributions
Looks-0.554
have-0.489
icious-0.473
 shin-0.460
 funn-0.455
Token the
Token ID262
Feature activation+3.01e+00

Pos logit contributions
 and+0.080
 to+0.077
,+0.071
 but+0.064
 Bob+0.061
Neg logit contributions
Looks-0.121
icious-0.106
have-0.105
 funn-0.100
 shin-0.099
Token universe
Token ID6881
Feature activation-9.19e+00

Pos logit contributions
Looks+0.087
have+0.077
 shin+0.072
 funn+0.072
icious+0.071
Neg logit contributions
 and-0.075
 to-0.074
,-0.067
round-0.065
 Bob-0.065
Token.
Token ID13
Feature activation-6.69e-01

Pos logit contributions
 and+0.186
 to+0.180
,+0.162
 but+0.150
round+0.143
Neg logit contributions
Looks-0.325
have-0.290
icious-0.279
 funn-0.274
 shin-0.273
Token They
Token ID1119
Feature activation-1.63e+00

Pos logit contributions
 and+0.014
 to+0.013
 Bob+0.012
,+0.012
 but+0.012
Neg logit contributions
Looks-0.023
have-0.021
icious-0.020
 shin-0.020
 funn-0.020
Token I
Token ID314
Feature activation-3.04e-01

Pos logit contributions
 and+0.036
 to+0.035
round+0.034
cone+0.033
 spine+0.033
Neg logit contributions
Looks-0.048
have-0.044
 shin-0.041
 funn-0.041
icious-0.039
Token want
Token ID765
Feature activation+2.76e+00

Pos logit contributions
 to+0.008
 and+0.008
,+0.007
 but+0.006
 Bob+0.006
Neg logit contributions
Looks-0.010
icious-0.008
 shin-0.008
have-0.008
V-0.008
Token to
Token ID284
Feature activation-1.67e+01

Pos logit contributions
Looks+0.120
have+0.112
 funn+0.108
 shin+0.107
icious+0.105
Neg logit contributions
 and-0.024
 to-0.021
round-0.019
cone-0.018
,-0.016
Token just
Token ID655
Feature activation-5.25e+00

Pos logit contributions
 and+0.404
 to+0.400
,+0.361
 Bob+0.351
round+0.348
Neg logit contributions
Looks-0.508
have-0.443
 shin-0.417
 funn-0.414
icious-0.414
Token sit
Token ID1650
Feature activation+1.47e+00

Pos logit contributions
 and+0.127
 to+0.125
,+0.113
round+0.112
 Bob+0.110
Neg logit contributions
Looks-0.155
have-0.137
 shin-0.129
 funn-0.128
icious-0.127
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
Looks+0.055
have+0.050
 shin+0.047
 funn+0.047
icious+0.047
Neg logit contributions
 and-0.024
 to-0.024
,-0.020
round-0.020
 Bob-0.020
Token rest
Token ID1334
Feature activation+1.98e+00

Pos logit contributions
 and+0.402
 to+0.401
,+0.355
 Bob+0.344
 but+0.342
Neg logit contributions
Looks-0.572
have-0.504
icious-0.478
 shin-0.473
 funn-0.470
Token."
Token ID526
Feature activation-1.19e+00

Pos logit contributions
Looks+0.063
have+0.056
 shin+0.053
icious+0.053
 funn+0.053
Neg logit contributions
 and-0.045
 to-0.045
,-0.040
 but-0.038
 Bob-0.038
Token
Token ID198
Feature activation+2.94e+00

Pos logit contributions
 to+0.030
 and+0.029
cone+0.028
driver+0.028
round+0.028
Neg logit contributions
Looks-0.032
have-0.028
V-0.025
 shin-0.025
 funn-0.025
Token
Token ID198
Feature activation+3.26e+00

Pos logit contributions
Looks+0.104
 funn+0.094
 shin+0.092
have+0.092
 prett+0.091
Neg logit contributions
 and-0.062
 to-0.060
 Bob-0.054
,-0.052
 but-0.048
TokenCharlie
Token ID37136
Feature activation-8.56e+00

Pos logit contributions
Looks+0.078
 funn+0.069
 shin+0.068
have+0.066
 prett+0.066
Neg logit contributions
 and-0.095
 to-0.092
round-0.087
 Bob-0.085
,-0.084
Token They
Token ID1119
Feature activation-1.87e+00

Pos logit contributions
 and+0.021
 to+0.021
cone+0.019
round+0.019
,+0.018
Neg logit contributions
Looks-0.032
have-0.028
V-0.026
 shin-0.026
 funn-0.026
Token pretended
Token ID36766
Feature activation-1.36e+00

Pos logit contributions
 and+0.044
 to+0.043
,+0.039
 Bob+0.038
round+0.038
Neg logit contributions
Looks-0.057
have-0.051
 shin-0.048
icious-0.048
 funn-0.047
Token to
Token ID284
Feature activation-1.70e+01

Pos logit contributions
 spine+0.020
cone+0.019
round+0.018
ipop+0.016
iker+0.016
Neg logit contributions
Looks-0.046
 sounds-0.045
have-0.044
 funn-0.043
 wonderful-0.042
Token be
Token ID307
Feature activation+3.83e+00

Pos logit contributions
 and+0.408
 to+0.404
,+0.364
 Bob+0.351
 but+0.349
Neg logit contributions
Looks-0.525
have-0.457
icious-0.434
 shin-0.430
 funn-0.426
Token knights
Token ID27536
Feature activation-9.51e+00

Pos logit contributions
Looks+0.118
have+0.105
 shin+0.099
icious+0.098
 funn+0.098
Neg logit contributions
 and-0.089
 to-0.088
,-0.079
 Bob-0.077
round-0.077
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
cone+0.078
round+0.077
ipop+0.075
 spine+0.064
 inspection+0.061
Neg logit contributions
have-0.360
Looks-0.345
 sounds-0.341
 shin-0.331
 funn-0.328
Token shot
Token ID2823
Feature activation+4.03e+00

Pos logit contributions
 and+0.391
 to+0.386
,+0.343
round+0.339
 Bob+0.335
Neg logit contributions
Looks-0.565
have-0.504
 shin-0.476
 funn-0.473
icious-0.471
Token the
Token ID262
Feature activation+3.27e+00

Pos logit contributions
Looks+0.105
have+0.095
 funn+0.092
 shin+0.090
 sounds+0.086
Neg logit contributions
round-0.098
ipop-0.095
cone-0.095
 and-0.093
 to-0.092
Token arrow
Token ID15452
Feature activation-7.05e+00

Pos logit contributions
Looks+0.105
have+0.095
 funn+0.093
 shin+0.092
icious+0.088
Neg logit contributions
 and-0.064
 to-0.064
round-0.060
 Bob-0.058
,-0.056
Token at
Token ID379
Feature activation-6.93e+00

Pos logit contributions
 Bob+0.132
 Lee+0.132
ipop+0.131
round+0.130
cone+0.124
Neg logit contributions
Looks-0.198
 shin-0.194
have-0.192
 funn-0.188
 prett-0.185
Token imaginary
Token ID26726
Feature activation+1.52e+00

Pos logit contributions
 and+0.154
round+0.153
 to+0.152
cone+0.150
ipop+0.149
Neg logit contributions
Looks-0.194
have-0.173
 shin-0.165
 funn-0.165
V-0.163
Token jet
Token ID12644
Feature activation-8.16e+00

Pos logit contributions
round+0.150
driver+0.148
 spine+0.148
ipop+0.147
cone+0.143
Neg logit contributions
Looks-0.205
have-0.189
 shin-0.178
 funn-0.170
V-0.168
Token tried
Token ID3088
Feature activation-8.57e-02

Pos logit contributions
 and+0.152
 to+0.149
,+0.130
round+0.124
 but+0.123
Neg logit contributions
Looks-0.292
have-0.262
icious-0.250
 shin-0.250
 funn-0.248
Token to
Token ID284
Feature activation-1.76e+01

Pos logit contributions
round+0.000
cone+0.000
ipop+0.000
 and+0.000
 Lee+0.000
Neg logit contributions
Looks-0.004
have-0.004
 shin-0.004
 funn-0.004
 fullest-0.003
Token turn
Token ID1210
Feature activation-1.18e+00

Pos logit contributions
 and+0.408
 to+0.402
,+0.361
 Bob+0.353
 but+0.348
Neg logit contributions
Looks-0.558
have-0.489
icious-0.465
 shin-0.459
 funn-0.457
Token around
Token ID1088
Feature activation-8.90e+00

Pos logit contributions
 and+0.019
 to+0.019
round+0.018
cone+0.018
 Bob+0.017
Neg logit contributions
Looks-0.041
have-0.038
 shin-0.037
 funn-0.036
V-0.035
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
ipop+0.136
round+0.135
cone+0.128
ule+0.109
 spine+0.106
Neg logit contributions
Looks-0.300
 shin-0.280
have-0.279
 funn-0.266
 faire-0.266
Token go
Token ID467
Feature activation-3.10e+00

Pos logit contributions
 and+0.367
 to+0.359
round+0.322
,+0.319
cone+0.313
Neg logit contributions
Looks-0.586
have-0.525
 shin-0.499
 funn-0.497
icious-0.486
Token back
Token ID736
Feature activation-8.23e+00

Pos logit contributions
 and+0.052
 to+0.050
cone+0.046
round+0.046
ipop+0.046
Neg logit contributions
Looks-0.108
have-0.099
 shin-0.095
 funn-0.093
V-0.091
Token home
Token ID1363
Feature activation-9.70e+00

Pos logit contributions
cone+0.087
ipop+0.086
round+0.084
 and+0.077
driver+0.071
Neg logit contributions
Looks-0.317
have-0.301
 shin-0.290
 funn-0.283
 prett-0.278
Token,
Token ID11
Feature activation-1.31e+01

Pos logit contributions
 and+0.195
 to+0.192
round+0.174
cone+0.172
 Bob+0.169
Neg logit contributions
Looks-0.316
have-0.285
 shin-0.271
 funn-0.267
icious-0.264
Token but
Token ID475
Feature activation-9.75e+00

Pos logit contributions
ipop+0.210
round+0.209
cone+0.206
driver+0.199
 to+0.182
Neg logit contributions
Looks-0.469
have-0.421
 shin-0.413
 funn-0.411
 prett-0.400
Token to
Token ID284
Feature activation-1.73e+01

Pos logit contributions
Looks+0.058
have+0.053
icious+0.052
 shin+0.051
V+0.051
Neg logit contributions
 to-0.008
 and-0.008
,-0.004
 Bob-0.003
 but-0.003
Token keep
Token ID1394
Feature activation+2.34e+00

Pos logit contributions
 and+0.419
 to+0.411
,+0.378
 Bob+0.361
 but+0.358
Neg logit contributions
Looks-0.528
have-0.454
 shin-0.433
 funn-0.430
icious-0.426
Token trying
Token ID2111
Feature activation-2.69e+00

Pos logit contributions
Looks+0.073
have+0.066
 shin+0.062
 funn+0.061
icious+0.061
Neg logit contributions
 and-0.052
 to-0.051
round-0.046
,-0.045
cone-0.045
Token and
Token ID290
Feature activation-1.81e+01

Pos logit contributions
 and+0.041
 to+0.040
round+0.036
cone+0.036
 Bob+0.035
Neg logit contributions
Looks-0.099
have-0.092
 shin-0.087
 funn-0.086
icious-0.086
Token learning
Token ID4673
Feature activation-1.80e+00

Pos logit contributions
 to+0.422
 and+0.413
,+0.371
 but+0.356
 Bob+0.346
Neg logit contributions
Looks-0.584
have-0.501
icious-0.490
V-0.484
 funn-0.478
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
 and+0.041
 to+0.041
,+0.036
 but+0.034
 Bob+0.032
Neg logit contributions
Looks-0.062
have-0.054
icious-0.052
V-0.051
 shin-0.050
Token having
Token ID1719
Feature activation+4.46e+00

Pos logit contributions
 to+0.457
 and+0.447
,+0.403
 but+0.387
 Bob+0.380
Neg logit contributions
Looks-0.529
have-0.455
icious-0.440
V-0.435
 funn-0.429
Token fun
Token ID1257
Feature activation-2.81e-01

Pos logit contributions
Looks+0.117
have+0.100
icious+0.097
V+0.092
gr+0.090
Neg logit contributions
 to-0.138
 and-0.136
,-0.125
 Bob-0.119
 but-0.118
Token.
Token ID13
Feature activation-6.75e-01

Pos logit contributions
 and+0.006
 to+0.006
round+0.005
,+0.005
 Bob+0.005
Neg logit contributions
Looks-0.009
have-0.008
 shin-0.008
 funn-0.008
icious-0.008
Token That
Token ID1320
Feature activation+8.23e+00

Pos logit contributions
 and+0.017
 to+0.016
,+0.015
 Bob+0.015
 but+0.015
Neg logit contributions
Looks-0.020
have-0.018
 shin-0.017
 funn-0.017
icious-0.016
Token's
Token ID338
Feature activation+3.22e+00

Pos logit contributions
Looks+0.229
have+0.197
icious+0.184
 shin+0.183
V+0.180
Neg logit contributions
 and-0.224
 to-0.219
,-0.202
 but-0.194
 Bob-0.193
Token have
Token ID423
Feature activation+6.44e-01

Pos logit contributions
Looks+0.091
have+0.087
 funn+0.081
 shin+0.081
V+0.077
Neg logit contributions
cone-0.054
round-0.052
ipop-0.049
 to-0.049
 spine-0.047
Token to
Token ID284
Feature activation-1.75e+01

Pos logit contributions
Looks+0.022
have+0.019
icious+0.018
 shin+0.018
 funn+0.018
Neg logit contributions
 to-0.013
 and-0.013
,-0.012
 Bob-0.011
 but-0.011
Token choose
Token ID3853
Feature activation-3.72e+00

Pos logit contributions
 and+0.420
 to+0.414
,+0.377
 Bob+0.363
 but+0.359
Neg logit contributions
Looks-0.536
have-0.465
 shin-0.441
 funn-0.438
icious-0.437
Token to
Token ID284
Feature activation-1.74e+01

Pos logit contributions
 and+0.080
 to+0.078
,+0.070
round+0.069
 Bob+0.068
Neg logit contributions
Looks-0.119
have-0.107
 shin-0.101
 funn-0.100
icious-0.100
Token respect
Token ID2461
Feature activation+2.17e+00

Pos logit contributions
 and+0.414
 to+0.407
,+0.368
 Bob+0.356
 but+0.354
Neg logit contributions
Looks-0.539
have-0.470
 shin-0.445
 funn-0.441
icious-0.439
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
Looks+0.063
have+0.060
 funn+0.055
 shin+0.055
icious+0.053
Neg logit contributions
 spine-0.044
 to-0.043
cone-0.043
round-0.043
 and-0.043
Token learn
Token ID2193
Feature activation+7.68e-01

Pos logit contributions
 and+0.444
 to+0.427
round+0.417
cone+0.404
ipop+0.397
Neg logit contributions
Looks-0.492
have-0.442
 shin-0.412
 funn-0.410
icious-0.395
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
Looks+0.028
have+0.025
icious+0.024
V+0.024
 shin+0.024
Neg logit contributions
 and-0.015
 to-0.015
,-0.013
 but-0.012
 Bob-0.011
Token You
Token ID921
Feature activation+5.41e+00

Pos logit contributions
 and+0.025
 to+0.024
,+0.022
 Bob+0.021
round+0.021
Neg logit contributions
Looks-0.034
have-0.031
 shin-0.029
 funn-0.029
icious-0.029
Token have
Token ID423
Feature activation+2.81e+00

Pos logit contributions
Looks+0.133
have+0.116
 funn+0.107
 shin+0.107
icious+0.105
Neg logit contributions
 and-0.153
 to-0.153
round-0.143
cone-0.140
,-0.140
Token to
Token ID284
Feature activation-1.65e+01

Pos logit contributions
Looks+0.137
have+0.127
 shin+0.123
icious+0.123
 funn+0.122
Neg logit contributions
 and-0.016
 to-0.016
,-0.008
 Bob-0.006
round-0.006
Tokenmy
Token ID1820
Feature activation-8.60e-01

Pos logit contributions
Looks+0.152
have+0.134
 shin+0.127
 funn+0.125
V+0.124
Neg logit contributions
 to-0.143
 and-0.142
round-0.129
,-0.127
 Bob-0.127
Token saw
Token ID2497
Feature activation+5.17e+00

Pos logit contributions
 and+0.018
 to+0.017
,+0.015
round+0.015
 but+0.015
Neg logit contributions
Looks-0.029
have-0.026
 shin-0.024
icious-0.024
 funn-0.024
Token what
Token ID644
Feature activation-2.53e+00

Pos logit contributions
Looks+0.154
have+0.136
 shin+0.128
icious+0.128
 funn+0.127
Neg logit contributions
 and-0.125
 to-0.124
,-0.111
 Bob-0.108
round-0.108
Token had
Token ID550
Feature activation+1.93e+00

Pos logit contributions
 and+0.058
 to+0.057
round+0.055
 spine+0.053
cone+0.053
Neg logit contributions
Looks-0.075
have-0.067
 funn-0.063
 shin-0.063
V-0.060
Token happened
Token ID3022
Feature activation+3.16e+00

Pos logit contributions
Looks+0.058
have+0.050
icious+0.049
V+0.049
 shin+0.047
Neg logit contributions
 to-0.051
 and-0.050
,-0.045
 but-0.042
 Bob-0.041
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
Looks+0.115
have+0.107
 shin+0.100
 funn+0.100
icious+0.098
Neg logit contributions
round-0.046
 and-0.046
 to-0.046
cone-0.045
ipop-0.043
Token told
Token ID1297
Feature activation+5.35e+00

Pos logit contributions
 and+0.380
 to+0.376
,+0.334
 Bob+0.321
 but+0.319
Neg logit contributions
Looks-0.585
have-0.525
icious-0.499
 shin-0.494
 funn-0.492
Token him
Token ID683
Feature activation-2.28e+00

Pos logit contributions
Looks+0.180
have+0.162
 funn+0.155
 shin+0.155
icious+0.150
Neg logit contributions
 and-0.100
 to-0.096
round-0.093
cone-0.090
ipop-0.086
Token that
Token ID326
Feature activation-6.41e+00

Pos logit contributions
round+0.031
cone+0.030
 and+0.030
 spine+0.028
 to+0.026
Neg logit contributions
Looks-0.085
have-0.080
 shin-0.075
 funn-0.075
gr-0.070
Token he
Token ID339
Feature activation-4.69e+00

Pos logit contributions
round+0.103
 spine+0.094
cone+0.091
 and+0.089
ipop+0.088
Neg logit contributions
Looks-0.232
have-0.216
 shin-0.210
 funn-0.206
 faire-0.194
Token should
Token ID815
Feature activation-5.07e+00

Pos logit contributions
 and+0.109
 to+0.106
,+0.094
 Bob+0.089
 but+0.088
Neg logit contributions
Looks-0.151
have-0.133
icious-0.129
 shin-0.126
V-0.125
Token just
Token ID655
Feature activation-5.75e+00

Pos logit contributions
Looks+0.284
have+0.255
icious+0.243
V+0.237
 funn+0.237
Neg logit contributions
 and-0.145
 to-0.138
,-0.122
 but-0.113
 because-0.107
Token need
Token ID761
Feature activation+3.18e+00

Pos logit contributions
 and+0.119
 to+0.118
,+0.104
 Bob+0.100
 but+0.099
Neg logit contributions
Looks-0.194
have-0.173
icious-0.165
 shin-0.165
V-0.164
Token to
Token ID284
Feature activation-1.71e+01

Pos logit contributions
Looks+0.109
have+0.100
 shin+0.095
 funn+0.094
icious+0.092
Neg logit contributions
 and-0.057
 to-0.055
round-0.053
cone-0.050
,-0.048
Token be
Token ID307
Feature activation+3.67e+00

Pos logit contributions
 and+0.413
 to+0.407
,+0.370
 Bob+0.356
 but+0.354
Neg logit contributions
Looks-0.521
have-0.452
 shin-0.429
 funn-0.427
icious-0.424
Token careful
Token ID8161
Feature activation-3.07e+00

Pos logit contributions
Looks+0.102
V+0.086
icious+0.084
have+0.080
 funn+0.078
Neg logit contributions
 and-0.107
 to-0.107
,-0.098
 but-0.091
 because-0.089
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
 to+0.049
 and+0.049
cone+0.049
round+0.048
ipop+0.046
Neg logit contributions
Looks-0.105
have-0.100
 shin-0.093
 funn-0.091
icious-0.091
Token not
Token ID407
Feature activation-1.31e+00

Pos logit contributions
 and+0.456
 to+0.452
,+0.408
round+0.399
 Bob+0.398
Neg logit contributions
Looks-0.504
have-0.441
 shin-0.413
 funn-0.411
icious-0.411
Token scratch
Token ID12692
Feature activation+1.40e+00

Pos logit contributions
 and+0.028
round+0.027
cone+0.027
 to+0.027
ipop+0.026
Neg logit contributions
Looks-0.038
have-0.036
 shin-0.034
 funn-0.033
icious-0.032
Token it
Token ID340
Feature activation-4.37e+00

Pos logit contributions
Looks+0.047
have+0.043
 shin+0.040
 funn+0.040
icious+0.039
Neg logit contributions
 and-0.027
 to-0.027
round-0.024
cone-0.024
,-0.023
Token.
Token ID13
Feature activation-4.51e-01

Pos logit contributions
 and+0.086
 to+0.084
,+0.074
 but+0.071
 Bob+0.069
Neg logit contributions
Looks-0.154
have-0.137
icious-0.130
 shin-0.130
 funn-0.129
Token It
Token ID632
Feature activation-8.60e-01

Pos logit contributions
 and+0.010
 to+0.010
,+0.009
round+0.009
cone+0.008
Neg logit contributions
Looks-0.014
have-0.013
 shin-0.012
 funn-0.012
icious-0.012
Token important
Token ID1593
Feature activation+3.44e+00

Pos logit contributions
Looks+0.355
have+0.325
icious+0.309
 shin+0.306
 funn+0.306
Neg logit contributions
 and-0.118
 to-0.107
,-0.094
 but-0.087
cone-0.083
Token to
Token ID284
Feature activation-1.61e+01

Pos logit contributions
Looks+0.102
have+0.098
 shin+0.090
 funn+0.088
 fullest+0.086
Neg logit contributions
round-0.066
cone-0.065
 and-0.065
 to-0.062
 Bob-0.062
Token ask
Token ID1265
Feature activation+8.26e-01

Pos logit contributions
 and+0.403
 to+0.399
,+0.361
 Bob+0.352
 but+0.348
Neg logit contributions
Looks-0.474
have-0.410
 shin-0.389
icious-0.385
 funn-0.385
Token for
Token ID329
Feature activation-1.65e+00

Pos logit contributions
Looks+0.024
have+0.023
 shin+0.020
 funn+0.020
icious+0.020
Neg logit contributions
round-0.018
cone-0.017
 and-0.017
 to-0.017
ipop-0.016
Token help
Token ID1037
Feature activation-4.21e+00

Pos logit contributions
 to+0.038
 and+0.038
,+0.034
 Bob+0.032
 but+0.032
Neg logit contributions
Looks-0.053
have-0.046
icious-0.045
V-0.045
 funn-0.044
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
 and+0.085
round+0.085
 to+0.084
cone+0.083
ipop+0.083
Neg logit contributions
Looks-0.127
have-0.121
 shin-0.110
icious-0.109
 funn-0.107
Token that
Token ID326
Feature activation-6.61e+00

Pos logit contributions
 and+0.426
 to+0.411
round+0.407
cone+0.387
ipop+0.381
Neg logit contributions
Looks-0.510
have-0.460
 shin-0.428
 funn-0.422
icious-0.410
Token peaceful
Token ID12309
Feature activation+3.96e+00

Pos logit contributions
 and+0.124
 to+0.121
round+0.110
,+0.105
cone+0.105
Neg logit contributions
Looks-0.229
have-0.208
 shin-0.197
 funn-0.196
icious-0.193
Token acts
Token ID6529
Feature activation+9.87e+00

Pos logit contributions
Looks+0.107
have+0.103
 funn+0.095
 sounds+0.093
 shin+0.093
Neg logit contributions
round-0.090
cone-0.087
ipop-0.087
driver-0.083
 to-0.081
Token of
Token ID286
Feature activation-7.51e+00

Pos logit contributions
Looks+0.377
have+0.338
icious+0.325
 shin+0.324
 funn+0.323
Neg logit contributions
 and-0.167
 to-0.164
,-0.139
 but-0.130
 Bob-0.125
Token kindness
Token ID23887
Feature activation+1.85e+00

Pos logit contributions
 and+0.223
 to+0.220
,+0.203
 but+0.197
 Bob+0.196
Neg logit contributions
Looks-0.191
have-0.161
icious-0.152
V-0.151
 shin-0.150
Token mum
Token ID25682
Feature activation-1.03e+01

Pos logit contributions
 and+0.221
 to+0.219
round+0.196
,+0.189
cone+0.188
Neg logit contributions
Looks-0.423
have-0.380
 shin-0.362
 funn-0.360
icious-0.355
Token saw
Token ID2497
Feature activation+4.63e+00

Pos logit contributions
 to+0.179
 and+0.178
round+0.162
cone+0.159
 Bob+0.152
Neg logit contributions
Looks-0.361
have-0.331
 shin-0.310
 funn-0.309
icious-0.306
Token what
Token ID644
Feature activation-2.70e+00

Pos logit contributions
Looks+0.129
have+0.114
 funn+0.107
 shin+0.106
icious+0.102
Neg logit contributions
 and-0.111
 to-0.110
round-0.108
cone-0.105
 spine-0.101
Token had
Token ID550
Feature activation+1.88e+00

Pos logit contributions
 and+0.055
round+0.054
 to+0.053
 spine+0.051
cone+0.051
Neg logit contributions
Looks-0.086
have-0.076
 funn-0.074
 shin-0.072
icious-0.069
Token happened
Token ID3022
Feature activation+3.34e+00

Pos logit contributions
Looks+0.054
have+0.047
V+0.045
icious+0.045
 shin+0.044
Neg logit contributions
 to-0.049
 and-0.049
,-0.044
 but-0.041
 Bob-0.041
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
Looks+0.122
have+0.115
 funn+0.106
 shin+0.106
icious+0.104
Neg logit contributions
round-0.049
cone-0.047
 and-0.045
 to-0.044
ipop-0.044
Token she
Token ID673
Feature activation-4.75e+00

Pos logit contributions
 and+0.385
 to+0.379
round+0.341
,+0.336
cone+0.333
Neg logit contributions
Looks-0.567
have-0.505
 shin-0.478
 funn-0.475
icious-0.467
Token rushed
Token ID13999
Feature activation+2.94e+00

Pos logit contributions
 and+0.108
 to+0.107
,+0.096
round+0.095
 Bob+0.093
Neg logit contributions
Looks-0.147
have-0.131
 shin-0.123
 funn-0.122
icious-0.122
Token to
Token ID284
Feature activation-1.63e+01

Pos logit contributions
Looks+0.080
have+0.065
gr+0.063
V+0.063
 funn+0.061
Neg logit contributions
cone-0.068
ipop-0.065
round-0.065
 spine-0.064
ule-0.064
Token Jem
Token ID48199
Feature activation+2.92e+00

Pos logit contributions
round+0.447
cone+0.433
ipop+0.421
driver+0.395
 spine+0.393
Neg logit contributions
Looks-0.418
have-0.366
 funn-0.331
 shin-0.326
V-0.325
Tokenma
Token ID2611
Feature activation-7.03e+00

Pos logit contributions
Looks+0.083
have+0.075
 shin+0.069
icious+0.068
 funn+0.068
Neg logit contributions
 and-0.070
 to-0.070
round-0.066
 Bob-0.065
cone-0.065
Token matter
Token ID2300
Feature activation+1.79e+00

Pos logit contributions
 to+0.027
 and+0.026
round+0.026
 spine+0.025
cone+0.025
Neg logit contributions
Looks-0.027
have-0.024
 shin-0.023
V-0.023
 funn-0.022
Token.
Token ID13
Feature activation-1.88e+00

Pos logit contributions
Looks+0.075
have+0.068
 shin+0.066
icious+0.065
 funn+0.065
Neg logit contributions
 and-0.022
 to-0.022
,-0.018
 but-0.016
 Bob-0.016
Token You
Token ID921
Feature activation+4.97e+00

Pos logit contributions
 and+0.045
 to+0.044
round+0.040
,+0.040
cone+0.039
Neg logit contributions
Looks-0.055
have-0.049
 shin-0.046
 funn-0.046
icious-0.045
Token should
Token ID815
Feature activation-6.59e+00

Pos logit contributions
Looks+0.145
have+0.126
 shin+0.119
icious+0.119
 funn+0.117
Neg logit contributions
 and-0.127
 to-0.125
,-0.114
 but-0.110
 Bob-0.109
Token share
Token ID2648
Feature activation+6.82e-01

Pos logit contributions
 and+0.167
 to+0.165
,+0.150
round+0.148
 Bob+0.146
Neg logit contributions
Looks-0.187
have-0.164
 shin-0.154
 funn-0.153
icious-0.152
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
Looks+0.021
have+0.019
 shin+0.018
 funn+0.018
icious+0.018
Neg logit contributions
 to-0.014
 and-0.014
round-0.013
cone-0.013
 Bob-0.013
Token be
Token ID307
Feature activation+3.95e+00

Pos logit contributions
 to+0.473
 and+0.471
,+0.425
 but+0.408
 Bob+0.402
Neg logit contributions
Looks-0.502
have-0.432
icious-0.416
 shin-0.412
V-0.410
Token kind
Token ID1611
Feature activation+5.40e+00

Pos logit contributions
Looks+0.107
icious+0.089
V+0.085
its+0.081
 fullest+0.080
Neg logit contributions
 and-0.127
 to-0.126
,-0.114
 but-0.103
 because-0.101
Token to
Token ID284
Feature activation-1.62e+01

Pos logit contributions
have+0.187
Looks+0.187
 funn+0.174
gr+0.173
 shin+0.172
Neg logit contributions
cone-0.075
round-0.067
ipop-0.067
 spine-0.064
stack-0.063
Token each
Token ID1123
Feature activation-5.19e-01

Pos logit contributions
 and+0.535
 to+0.531
,+0.495
 Bob+0.474
 but+0.466
Neg logit contributions
Looks-0.392
 shin-0.305
have-0.301
icious-0.291
V-0.287
Token other
Token ID584
Feature activation+7.59e-01

Pos logit contributions
 and+0.019
 to+0.019
,+0.017
 but+0.017
round+0.017
Neg logit contributions
Looks-0.010
have-0.008
 shin-0.007
icious-0.007
 funn-0.007
Token You
Token ID921
Feature activation+5.05e+00

Pos logit contributions
cone+0.316
round+0.314
 spine+0.307
 and+0.300
driver+0.291
Neg logit contributions
Looks-0.307
have-0.277
 funn-0.267
 shin-0.257
V-0.253
Token have
Token ID423
Feature activation+1.19e+00

Pos logit contributions
Looks+0.154
have+0.131
icious+0.130
 shin+0.127
V+0.126
Neg logit contributions
 and-0.131
 to-0.130
,-0.119
 but-0.111
 Bob-0.108
Token to
Token ID284
Feature activation-1.73e+01

Pos logit contributions
Looks+0.058
have+0.053
icious+0.052
 shin+0.051
V+0.051
Neg logit contributions
 to-0.008
 and-0.008
,-0.004
 Bob-0.003
 but-0.003
Token be
Token ID307
Feature activation+3.41e+00

Pos logit contributions
 and+0.419
 to+0.411
,+0.378
 Bob+0.361
 but+0.358
Neg logit contributions
Looks-0.528
have-0.454
 shin-0.433
 funn-0.430
icious-0.426
Token careful
Token ID8161
Feature activation-3.24e+00

Pos logit contributions
Looks+0.094
icious+0.076
have+0.074
V+0.073
its+0.072
Neg logit contributions
 to-0.101
 and-0.101
,-0.092
 but-0.086
 because-0.084
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
 to+0.050
 and+0.050
cone+0.047
round+0.046
ipop+0.045
Neg logit contributions
Looks-0.116
have-0.109
 shin-0.101
icious-0.101
 funn-0.100
Token ask
Token ID1265
Feature activation+1.20e+00

Pos logit contributions
 to+0.447
 and+0.446
,+0.399
 Bob+0.386
 but+0.386
Neg logit contributions
Looks-0.521
have-0.453
icious-0.426
 shin-0.425
 funn-0.423
Token me
Token ID502
Feature activation-9.61e-01

Pos logit contributions
Looks+0.033
have+0.032
 shin+0.029
icious+0.029
 funn+0.029
Neg logit contributions
round-0.026
 and-0.026
cone-0.026
 to-0.026
ipop-0.025
Token first
Token ID717
Feature activation-1.97e+00

Pos logit contributions
 and+0.017
 to+0.017
round+0.016
cone+0.016
 Bob+0.015
Neg logit contributions
Looks-0.032
have-0.030
 shin-0.028
 funn-0.028
icious-0.028
Token.
Token ID13
Feature activation-4.21e-01

Pos logit contributions
 and+0.043
 to+0.042
,+0.038
 but+0.036
 Bob+0.036
Neg logit contributions
Looks-0.065
have-0.057
 shin-0.054
 funn-0.054
icious-0.054
Token Do
Token ID2141
Feature activation+4.68e+00

Pos logit contributions
 to+0.010
 and+0.010
,+0.009
round+0.009
cone+0.009
Neg logit contributions
Looks-0.012
have-0.011
V-0.010
 shin-0.010
icious-0.010
Token wanted
Token ID2227
Feature activation+4.95e+00

Pos logit contributions
Looks+0.114
have+0.108
 shin+0.106
 funn+0.106
 fullest+0.104
Neg logit contributions
round-0.087
cone-0.082
ipop-0.082
driver-0.079
 and-0.078
Token to
Token ID284
Feature activation-1.79e+01

Pos logit contributions
Looks+0.203
have+0.198
 funn+0.194
 shin+0.185
 sounds+0.183
Neg logit contributions
round-0.035
cone-0.034
 spine-0.032
driver-0.030
 and-0.026
Token keep
Token ID1394
Feature activation+2.12e+00

Pos logit contributions
 and+0.420
 to+0.413
round+0.372
,+0.370
cone+0.366
Neg logit contributions
Looks-0.534
have-0.477
 shin-0.447
 funn-0.445
icious-0.441
Token it
Token ID340
Feature activation-5.49e+00

Pos logit contributions
Looks+0.070
have+0.064
 shin+0.060
 funn+0.059
icious+0.057
Neg logit contributions
 and-0.041
 to-0.040
round-0.039
cone-0.037
ipop-0.036
Token clean
Token ID3424
Feature activation-4.19e+00

Pos logit contributions
 and+0.116
 to+0.116
round+0.113
cone+0.110
 Bob+0.108
Neg logit contributions
Looks-0.162
have-0.150
 shin-0.143
 funn-0.139
icious-0.136
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
 to+0.067
round+0.066
cone+0.066
ipop+0.065
 and+0.063
Neg logit contributions
Looks-0.143
have-0.134
 shin-0.129
 funn-0.126
icious-0.122
Token warm
Token ID5814
Feature activation+1.68e+00

Pos logit contributions
 to+0.458
 and+0.448
,+0.403
 but+0.400
 because+0.377
Neg logit contributions
Looks-0.542
V-0.473
icious-0.467
have-0.454
Tr-0.439
Token always
Token ID1464
Feature activation-4.46e+00

Pos logit contributions
Looks+0.046
have+0.043
 shin+0.040
 funn+0.039
 prett+0.038
Neg logit contributions
cone-0.037
round-0.036
ipop-0.036
driver-0.034
 and-0.034
Token.
Token ID13
Feature activation-6.01e-01

Pos logit contributions
 and+0.105
 to+0.102
,+0.096
 but+0.088
 because+0.087
Neg logit contributions
Looks-0.152
V-0.135
icious-0.134
 fullest-0.124
have-0.123
Token The
Token ID383
Feature activation+8.72e+00

Pos logit contributions
 and+0.014
 to+0.014
round+0.013
,+0.012
cone+0.012
Neg logit contributions
Looks-0.018
have-0.016
 shin-0.015
 funn-0.015
V-0.015
Token end
Token ID886
Feature activation+3.90e-01

Pos logit contributions
Looks+0.272
have+0.239
icious+0.226
 shin+0.224
 funn+0.223
Neg logit contributions
 and-0.204
 to-0.198
,-0.179
 but-0.173
 Bob-0.172
TokenShe
Token ID3347
Feature activation-1.51e+00

Pos logit contributions
 to+0.087
 and+0.086
round+0.084
cone+0.084
 spine+0.081
Neg logit contributions
Looks-0.111
have-0.101
 shin-0.095
 funn-0.094
V-0.090
Tokenlly
Token ID12810
Feature activation+3.90e+00

Pos logit contributions
 and+0.036
 to+0.035
,+0.032
 Bob+0.031
 but+0.030
Neg logit contributions
Looks-0.046
have-0.041
icious-0.039
 shin-0.039
 funn-0.038
Token thought
Token ID1807
Feature activation+4.18e+00

Pos logit contributions
Looks+0.117
have+0.103
 shin+0.097
 funn+0.096
icious+0.096
Neg logit contributions
 and-0.092
 to-0.091
,-0.082
round-0.081
 Bob-0.080
Token and
Token ID290
Feature activation-1.79e+01

Pos logit contributions
Looks+0.129
have+0.121
 shin+0.119
 funn+0.114
 wonderful+0.112
Neg logit contributions
round-0.081
cone-0.073
 spine-0.068
 Lee-0.062
stack-0.062
Token thought
Token ID1807
Feature activation+4.76e+00

Pos logit contributions
 and+0.354
 to+0.352
round+0.318
cone+0.310
,+0.305
Neg logit contributions
Looks-0.599
have-0.538
 shin-0.512
 funn-0.505
icious-0.495
Token and
Token ID290
Feature activation-1.78e+01

Pos logit contributions
Looks+0.156
 shin+0.147
have+0.144
 wonderful+0.138
 mistakes+0.137
Neg logit contributions
round-0.083
cone-0.068
 vendor-0.065
 spine-0.063
 barrel-0.062
Token then
Token ID788
Feature activation-8.48e+00

Pos logit contributions
 and+0.336
 to+0.332
round+0.295
,+0.288
cone+0.284
Neg logit contributions
Looks-0.617
have-0.556
 shin-0.528
 funn-0.523
icious-0.517
Token she
Token ID673
Feature activation-4.61e+00

Pos logit contributions
round+0.137
 and+0.127
 to+0.121
cone+0.120
 spine+0.116
Neg logit contributions
Looks-0.307
have-0.278
 shin-0.275
 funn-0.265
V-0.254
Token had
Token ID550
Feature activation+3.18e+00

Pos logit contributions
 and+0.098
 to+0.096
,+0.085
 but+0.079
 Bob+0.078
Neg logit contributions
Looks-0.156
have-0.140
icious-0.135
 funn-0.131
 shin-0.130
Token an
Token ID281
Feature activation+7.44e+00

Pos logit contributions
Looks+0.100
have+0.087
icious+0.086
V+0.082
 shin+0.082
Neg logit contributions
 to-0.079
 and-0.076
,-0.067
 Bob-0.063
 but-0.063
Token idea
Token ID2126
Feature activation-1.04e+00

Pos logit contributions
Looks+0.149
have+0.131
 funn+0.118
 shin+0.118
icious+0.113
Neg logit contributions
 to-0.243
 and-0.242
round-0.228
 Bob-0.224
cone-0.223
Token rocked
Token ID36872
Feature activation+5.90e+00

Pos logit contributions
 and+0.051
 to+0.051
,+0.045
round+0.045
 Bob+0.044
Neg logit contributions
Looks-0.067
have-0.060
 shin-0.056
 funn-0.056
icious-0.055
Token him
Token ID683
Feature activation+7.09e-02

Pos logit contributions
Looks+0.185
have+0.176
 shin+0.173
V+0.171
 funn+0.167
Neg logit contributions
round-0.098
 spine-0.097
cone-0.093
 grill-0.091
 to-0.090
Token,
Token ID11
Feature activation-1.04e+01

Pos logit contributions
have+0.002
 shin+0.002
 prett+0.002
Looks+0.002
 funn+0.002
Neg logit contributions
round-0.001
 Jim-0.001
 vendor-0.001
 bull-0.001
 Lee-0.001
Token Tim
Token ID5045
Feature activation+3.13e+00

Pos logit contributions
round+0.211
 spine+0.192
 and+0.189
 to+0.179
cone+0.178
Neg logit contributions
Looks-0.352
 shin-0.303
have-0.300
 funn-0.290
V-0.280
Tokenmy
Token ID1820
Feature activation+2.12e-01

Pos logit contributions
Looks+0.088
have+0.077
 shin+0.072
icious+0.072
 funn+0.072
Neg logit contributions
 and-0.081
 to-0.080
,-0.073
round-0.071
 but-0.071
Token felt
Token ID2936
Feature activation+1.18e+01

Pos logit contributions
Looks+0.006
have+0.006
 shin+0.005
 funn+0.005
V+0.005
Neg logit contributions
 to-0.005
 and-0.005
round-0.005
 Bob-0.005
,-0.004
Token better
Token ID1365
Feature activation+2.10e+00

Pos logit contributions
Looks+0.406
icious+0.355
have+0.340
V+0.329
its+0.321
Neg logit contributions
 and-0.295
 to-0.284
,-0.264
 because-0.249
 but-0.238
Token and
Token ID290
Feature activation-1.47e+01

Pos logit contributions
Looks+0.083
have+0.081
 shin+0.078
 funn+0.074
 fullest+0.074
Neg logit contributions
 Bob-0.021
round-0.020
cone-0.020
 barrel-0.018
ipop-0.018
Token fell
Token ID3214
Feature activation+2.46e+00

Pos logit contributions
 and+0.300
 to+0.294
,+0.259
 but+0.240
 Bob+0.230
Neg logit contributions
Looks-0.515
icious-0.474
have-0.459
V-0.451
gr-0.447
Token asleep
Token ID16039
Feature activation-1.73e+00

Pos logit contributions
Looks+0.083
have+0.075
 shin+0.074
 funn+0.072
V+0.071
Neg logit contributions
 and-0.044
 to-0.043
round-0.041
 Bob-0.040
cone-0.040
Token holding
Token ID4769
Feature activation+6.63e+00

Pos logit contributions
 and+0.033
 to+0.033
round+0.030
 Bob+0.029
cone+0.029
Neg logit contributions
Looks-0.058
have-0.052
 shin-0.050
 funn-0.049
icious-0.049
Token first
Token ID717
Feature activation+6.57e-01

Pos logit contributions
round+0.146
cone+0.145
ipop+0.139
chair+0.133
driver+0.132
Neg logit contributions
Looks-0.082
have-0.072
 funn-0.066
 shin-0.062
 yourself-0.061
Token,
Token ID11
Feature activation-1.03e+01

Pos logit contributions
Looks+0.022
have+0.022
 shin+0.020
 funn+0.020
 sounds+0.019
Neg logit contributions
cone-0.010
round-0.009
ipop-0.008
driver-0.008
 bull-0.007
Token but
Token ID475
Feature activation-7.54e+00

Pos logit contributions
ipop+0.143
round+0.128
driver+0.122
cone+0.121
op+0.105
Neg logit contributions
Looks-0.400
 shin-0.359
 funn-0.355
have-0.355
 prett-0.333
Token then
Token ID788
Feature activation-5.95e+00

Pos logit contributions
round+0.094
 to+0.091
 and+0.090
driver+0.086
cone+0.086
Neg logit contributions
Looks-0.300
have-0.269
 shin-0.267
 funn-0.261
 faire-0.248
Token he
Token ID339
Feature activation-2.45e+00

Pos logit contributions
 and+0.063
 to+0.061
round+0.050
cone+0.046
,+0.045
Neg logit contributions
Looks-0.255
have-0.234
 shin-0.226
 funn-0.224
icious-0.220
Token replied
Token ID8712
Feature activation+1.21e+01

Pos logit contributions
 and+0.056
 to+0.053
,+0.049
 but+0.045
 Bob+0.044
Neg logit contributions
Looks-0.081
have-0.073
icious-0.071
V-0.068
 fullest-0.068
Token "
Token ID366
Feature activation+3.47e+00

Pos logit contributions
Looks+0.543
have+0.513
 shin+0.498
 funn+0.490
V+0.478
Neg logit contributions
round-0.069
 and-0.063
cone-0.059
 to-0.053
 spine-0.047
TokenI
Token ID40
Feature activation+4.63e+00

Pos logit contributions
Looks+0.079
icious+0.074
 expiration+0.072
 shin+0.070
 funn+0.070
Neg logit contributions
 to-0.094
 and-0.093
round-0.088
 but-0.087
 Bob-0.084
Token was
Token ID373
Feature activation+7.43e+00

Pos logit contributions
Looks+0.118
have+0.103
 shin+0.095
 funn+0.094
icious+0.093
Neg logit contributions
 and-0.129
 to-0.128
round-0.118
,-0.117
cone-0.116
Token just
Token ID655
Feature activation-1.83e+00

Pos logit contributions
Looks+0.227
have+0.205
 funn+0.192
 shin+0.192
icious+0.187
Neg logit contributions
 and-0.164
 to-0.161
round-0.151
cone-0.148
ipop-0.143
Token looking
Token ID2045
Feature activation+2.19e+00

Pos logit contributions
 and+0.037
 to+0.037
round+0.033
cone+0.033
,+0.032
Neg logit contributions
Looks-0.060
have-0.054
 shin-0.051
 funn-0.051
icious-0.050
Tokeno
Token ID78
Feature activation-2.27e+00

Pos logit contributions
Looks+0.031
icious+0.026
have+0.026
 shin+0.025
 funn+0.025
Neg logit contributions
 and-0.025
 to-0.024
,-0.022
round-0.021
ipop-0.021
Token.
Token ID13
Feature activation+8.18e-01

Pos logit contributions
 and+0.049
 to+0.046
,+0.042
 but+0.038
round+0.038
Neg logit contributions
Looks-0.082
have-0.070
icious-0.070
 funn-0.068
 shin-0.067
Token
Token ID220
Feature activation+9.83e+00

Pos logit contributions
Looks+0.027
have+0.024
 shin+0.023
 funn+0.022
V+0.022
Neg logit contributions
 and-0.017
 to-0.017
round-0.015
,-0.015
cone-0.015
Token
Token ID198
Feature activation+4.53e+00

Pos logit contributions
Looks+0.389
 funn+0.344
have+0.339
icious+0.336
 shin+0.335
Neg logit contributions
 and-0.189
 to-0.188
,-0.155
 Bob-0.149
 but-0.137
Token
Token ID198
Feature activation+4.51e+00

Pos logit contributions
Looks+0.157
 funn+0.148
 shin+0.145
have+0.144
 prett+0.143
Neg logit contributions
 and-0.089
 to-0.083
 Bob-0.078
,-0.072
 but-0.071
TokenL
Token ID43
Feature activation+1.23e+01

Pos logit contributions
Looks+0.102
 funn+0.093
 shin+0.091
 prett+0.087
have+0.087
Neg logit contributions
 and-0.135
 to-0.130
round-0.123
 Bob-0.120
 but-0.120
Tokenily
Token ID813
Feature activation+3.07e+00

Pos logit contributions
Looks+0.357
 funn+0.318
 shin+0.304
 prett+0.296
 fullest+0.294
Neg logit contributions
 and-0.330
 to-0.285
,-0.271
 but-0.262
round-0.260
Token felt
Token ID2936
Feature activation+1.11e+01

Pos logit contributions
Looks+0.084
have+0.076
 shin+0.070
 funn+0.070
V+0.068
Neg logit contributions
 to-0.076
round-0.072
 and-0.072
cone-0.070
 Bob-0.068
Token better
Token ID1365
Feature activation+1.40e+00

Pos logit contributions
Looks+0.339
icious+0.295
V+0.286
have+0.281
ymes+0.268
Neg logit contributions
 and-0.322
 to-0.299
,-0.295
 because-0.272
 but-0.261
Token and
Token ID290
Feature activation-1.54e+01

Pos logit contributions
Looks+0.051
have+0.050
gr+0.046
 shin+0.046
 fullest+0.045
Neg logit contributions
cone-0.019
round-0.018
 barrel-0.017
ipop-0.016
driver-0.016
Token they
Token ID484
Feature activation-3.66e+00

Pos logit contributions
 and+0.351
 to+0.346
,+0.308
 Bob+0.295
 but+0.294
Neg logit contributions
Looks-0.485
have-0.431
icious-0.413
 funn-0.403
 shin-0.403
Token than
Token ID621
Feature activation-5.85e+00

Pos logit contributions
have+0.050
Looks+0.048
 shin+0.046
 yourself+0.046
 sounds+0.044
Neg logit contributions
ipop-0.042
round-0.041
stack-0.041
cone-0.041
 Bob-0.038
Token yours
Token ID12431
Feature activation+6.92e+00

Pos logit contributions
 and+0.172
 to+0.166
round+0.166
cone+0.165
ipop+0.159
Neg logit contributions
Looks-0.129
have-0.112
 shin-0.105
 funn-0.100
icious-0.097
Token!"
Token ID2474
Feature activation+1.17e+00

Pos logit contributions
Looks+0.165
have+0.159
 shin+0.145
 funn+0.144
icious+0.139
Neg logit contributions
round-0.166
 to-0.165
cone-0.164
 and-0.163
ipop-0.158
Token
Token ID198
Feature activation+5.29e+00

Pos logit contributions
Looks+0.033
have+0.029
 shin+0.027
 funn+0.027
V+0.027
Neg logit contributions
 and-0.029
 to-0.029
round-0.027
,-0.026
cone-0.026
Token
Token ID198
Feature activation+5.32e+00

Pos logit contributions
Looks+0.244
 funn+0.227
icious+0.227
 prett+0.219
 fullest+0.217
Neg logit contributions
 and-0.093
 to-0.083
,-0.072
 but-0.051
 Bob-0.045
TokenL
Token ID43
Feature activation+1.29e+01

Pos logit contributions
Looks+0.132
 funn+0.127
 fullest+0.117
 prett+0.117
 faire+0.117
Neg logit contributions
 and-0.167
 to-0.148
,-0.143
 but-0.136
 while-0.130
Tokenily
Token ID813
Feature activation+3.58e+00

Pos logit contributions
Looks+0.353
 funn+0.293
have+0.289
 shin+0.286
icious+0.285
Neg logit contributions
 and-0.378
 to-0.352
,-0.332
 but-0.316
ipop-0.306
Token felt
Token ID2936
Feature activation+1.18e+01

Pos logit contributions
Looks+0.088
have+0.079
 funn+0.075
 shin+0.075
V+0.073
Neg logit contributions
cone-0.094
round-0.094
 to-0.093
 spine-0.092
 Bob-0.090
Token hurt
Token ID5938
Feature activation+6.35e+00

Pos logit contributions
Looks+0.375
icious+0.342
have+0.305
 fullest+0.302
V+0.291
Neg logit contributions
 and-0.329
 to-0.308
,-0.306
 but-0.269
 because-0.268
Token by
Token ID416
Feature activation-5.26e+00

Pos logit contributions
Looks+0.280
have+0.265
 shin+0.254
V+0.250
 funn+0.250
Neg logit contributions
cone-0.043
ipop-0.043
stack-0.041
round-0.040
 Bob-0.034
Token Ben
Token ID3932
Feature activation+2.42e+00

Pos logit contributions
 to+0.124
 and+0.121
round+0.116
cone+0.111
ipop+0.110
Neg logit contributions
Looks-0.156
have-0.136
 shin-0.131
 funn-0.128
V-0.126
Token washing
Token ID20518
Feature activation-1.36e+00

Pos logit contributions
cone+0.155
 and+0.152
 to+0.152
round+0.147
ipop+0.143
Neg logit contributions
Looks-0.215
have-0.204
 funn-0.185
 shin-0.184
icious-0.184
Token it
Token ID340
Feature activation-2.41e+00

Pos logit contributions
 and+0.032
 to+0.032
round+0.029
,+0.029
 Bob+0.028
Neg logit contributions
Looks-0.041
have-0.036
 shin-0.034
 funn-0.034
icious-0.033
Token."
Token ID526
Feature activation+1.87e+00

Pos logit contributions
 and+0.054
 to+0.053
,+0.048
round+0.047
 Bob+0.047
Neg logit contributions
Looks-0.075
have-0.067
 shin-0.063
 funn-0.063
icious-0.062
Token
Token ID198
Feature activation+4.94e+00

Pos logit contributions
Looks+0.051
have+0.044
 shin+0.041
V+0.041
 funn+0.040
Neg logit contributions
 to-0.046
 and-0.045
cone-0.044
round-0.042
driver-0.042
Token
Token ID198
Feature activation+5.02e+00

Pos logit contributions
Looks+0.188
 funn+0.171
 prett+0.166
 shin+0.163
have+0.163
Neg logit contributions
 and-0.101
 to-0.093
,-0.080
 Bob-0.080
 but-0.073
TokenL
Token ID43
Feature activation+1.26e+01

Pos logit contributions
Looks+0.150
 funn+0.135
 prett+0.131
 shin+0.130
have+0.127
Neg logit contributions
 and-0.122
 to-0.112
round-0.109
,-0.101
 but-0.100
Tokenily
Token ID813
Feature activation+3.59e+00

Pos logit contributions
Looks+0.300
have+0.245
 funn+0.239
 shin+0.238
icious+0.227
Neg logit contributions
 and-0.394
 to-0.381
,-0.354
round-0.352
 Bob-0.344
Token smiled
Token ID13541
Feature activation+1.02e+01

Pos logit contributions
Looks+0.097
have+0.087
 shin+0.081
 funn+0.080
V+0.079
Neg logit contributions
 to-0.090
 and-0.085
round-0.085
cone-0.085
 spine-0.083
Token and
Token ID290
Feature activation-1.45e+01

Pos logit contributions
Looks+0.447
 shin+0.420
have+0.417
 funn+0.413
 prett+0.399
Neg logit contributions
round-0.067
cone-0.065
 to-0.052
ipop-0.052
driver-0.050
Token put
Token ID1234
Feature activation+6.29e+00

Pos logit contributions
 and+0.350
 to+0.343
,+0.311
 Bob+0.302
round+0.298
Neg logit contributions
Looks-0.437
have-0.389
icious-0.366
 shin-0.362
 funn-0.360
Token the
Token ID262
Feature activation+5.96e+00

Pos logit contributions
Looks+0.215
have+0.194
 shin+0.187
 funn+0.184
V+0.181
Neg logit contributions
 and-0.115
 to-0.115
round-0.106
cone-0.100
 spine-0.099
Token tri
Token ID1333
Feature activation+7.83e+00

Pos logit contributions
round+0.348
cone+0.333
 and+0.331
ipop+0.326
 to+0.325
Neg logit contributions
Looks-0.470
have-0.408
 shin-0.397
 funn-0.386
 prett-0.363
Tokenpped
Token ID1496
Feature activation-2.10e+00

Pos logit contributions
Looks+0.218
have+0.190
 shin+0.179
 funn+0.177
icious+0.176
Neg logit contributions
 and-0.206
 to-0.203
,-0.185
round-0.182
 Bob-0.180
Token over
Token ID625
Feature activation-3.80e+00

Pos logit contributions
ipop+0.043
round+0.042
 spine+0.038
cone+0.037
 vendor+0.037
Neg logit contributions
Looks-0.060
V-0.055
 shin-0.053
have-0.052
End-0.050
Token her
Token ID607
Feature activation+3.47e+00

Pos logit contributions
round+0.107
cone+0.099
ipop+0.098
 spine+0.094
driver+0.093
Neg logit contributions
Looks-0.098
have-0.078
 shin-0.077
 funn-0.074
V-0.073
Token mom
Token ID1995
Feature activation-9.18e+00

Pos logit contributions
Looks+0.101
 shin+0.098
have+0.095
 funn+0.093
 prett+0.092
Neg logit contributions
round-0.064
 to-0.063
cone-0.062
 and-0.061
 Bob-0.061
Token's
Token ID338
Feature activation+4.26e+00

Pos logit contributions
 inspection+0.292
ipop+0.292
 vendor+0.287
round+0.285
stack+0.279
Neg logit contributions
Looks-0.125
have-0.123
 shin-0.106
 funn-0.099
 lo-0.092
Token foot
Token ID2366
Feature activation-3.25e+00

Pos logit contributions
Looks+0.118
have+0.115
 shin+0.113
 funn+0.107
 prett+0.103
Neg logit contributions
round-0.086
ipop-0.083
 Bob-0.083
cone-0.082
 to-0.081
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
round+0.074
cone+0.073
 Bob+0.073
ipop+0.072
 Lee+0.072
Neg logit contributions
Looks-0.081
have-0.075
 shin-0.072
 funn-0.066
gr-0.065
Token
Token ID220
Feature activation+9.15e+00

Pos logit contributions
Looks+0.035
have+0.031
 shin+0.029
V+0.029
 funn+0.029
Neg logit contributions
 and-0.023
 to-0.022
round-0.020
cone-0.020
,-0.020
Token
Token ID198
Feature activation+3.74e+00

Pos logit contributions
Looks+0.333
 funn+0.297
have+0.295
 shin+0.291
icious+0.286
Neg logit contributions
 and-0.192
 to-0.188
,-0.164
 Bob-0.160
 but-0.153
Token
Token ID198
Feature activation+3.72e+00

Pos logit contributions
Looks+0.127
 funn+0.113
have+0.112
 shin+0.111
icious+0.109
Neg logit contributions
 and-0.084
 to-0.082
,-0.073
 Bob-0.069
 but-0.068
Token return
Token ID1441
Feature activation+1.93e+00

Pos logit contributions
 and+0.087
 to+0.086
round+0.077
,+0.077
 Bob+0.076
Neg logit contributions
Looks-0.104
have-0.092
 shin-0.086
 funn-0.086
icious-0.085
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
Looks+0.054
have+0.050
 shin+0.048
 yourself+0.047
 funn+0.047
Neg logit contributions
cone-0.040
round-0.039
ipop-0.038
driver-0.037
 spine-0.036
Token They
Token ID1119
Feature activation+1.26e-01

Pos logit contributions
Looks+0.042
have+0.036
V+0.034
 shin+0.033
 funn+0.033
Neg logit contributions
 and-0.036
 to-0.035
round-0.032
,-0.032
cone-0.032
Token are
Token ID389
Feature activation+7.19e+00

Pos logit contributions
Looks+0.003
have+0.003
 funn+0.003
 shin+0.003
V+0.003
Neg logit contributions
 to-0.003
 and-0.003
round-0.003
cone-0.003
ipop-0.003
Token a
Token ID257
Feature activation+6.93e+00

Pos logit contributions
Looks+0.187
have+0.165
 shin+0.154
 funn+0.154
icious+0.148
Neg logit contributions
 and-0.193
 to-0.191
round-0.181
cone-0.179
,-0.174
Token happy
Token ID3772
Feature activation+5.18e+00

Pos logit contributions
Looks+0.151
have+0.128
 shin+0.122
 funn+0.122
V+0.117
Neg logit contributions
 to-0.212
 and-0.211
round-0.199
cone-0.196
driver-0.195
Token family
Token ID1641
Feature activation-6.85e+00

Pos logit contributions
Looks+0.126
 shin+0.120
have+0.119
 wonderful+0.118
 sounds+0.117
Neg logit contributions
cone-0.118
driver-0.117
ipop-0.109
round-0.108
stack-0.102
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
cone+0.125
driver+0.116
 inspection+0.113
ipop+0.111
round+0.111
Neg logit contributions
Looks-0.194
have-0.189
 funn-0.180
 shin-0.177
 wonderful-0.175
Token<|endoftext|>
Token ID50256
Feature activation+5.35e+00

Pos logit contributions
Looks+0.056
have+0.049
 shin+0.046
 funn+0.046
icious+0.046
Neg logit contributions
 and-0.044
 to-0.044
,-0.039
round-0.039
cone-0.038
TokenIt
Token ID1026
Feature activation+3.33e+00

Pos logit contributions
Looks+0.149
have+0.130
 shin+0.124
icious+0.124
 funn+0.123
Neg logit contributions
 and-0.150
 to-0.146
,-0.135
 Bob-0.130
 but-0.126
Token was
Token ID373
Feature activation+7.21e+00

Pos logit contributions
Looks+0.093
icious+0.082
have+0.077
 shin+0.077
 funn+0.077
Neg logit contributions
 and-0.101
,-0.093
 to-0.093
 Bob-0.081
 but-0.079
Token chew
Token ID34722
Feature activation+5.00e+00

Pos logit contributions
 and+0.332
 to+0.330
round+0.296
,+0.290
cone+0.290
Neg logit contributions
Looks-0.422
have-0.380
 shin-0.356
icious-0.353
 funn-0.352
Token on
Token ID319
Feature activation-6.84e+00

Pos logit contributions
Looks+0.157
have+0.147
 shin+0.140
 funn+0.136
V+0.134
Neg logit contributions
 to-0.093
round-0.090
 and-0.090
 spine-0.089
ipop-0.085
Token the
Token ID262
Feature activation+6.04e+00

Pos logit contributions
 and+0.156
 to+0.155
round+0.142
,+0.138
cone+0.136
Neg logit contributions
Looks-0.205
have-0.183
 shin-0.174
 funn-0.171
icious-0.170
Token fence
Token ID13990
Feature activation-6.25e+00

Pos logit contributions
Looks+0.229
have+0.209
 shin+0.200
 funn+0.200
icious+0.199
Neg logit contributions
 and-0.091
 to-0.090
round-0.076
,-0.076
 Bob-0.075
Token to
Token ID284
Feature activation-1.41e+01

Pos logit contributions
cone+0.127
ipop+0.124
round+0.120
 inspection+0.120
 spine+0.119
Neg logit contributions
Looks-0.170
have-0.158
 funn-0.157
 shin-0.157
icious-0.150
Token be
Token ID307
Feature activation+6.96e+00

Pos logit contributions
 and+0.316
 to+0.308
,+0.280
 Bob+0.273
 but+0.269
Neg logit contributions
Looks-0.463
have-0.408
 shin-0.382
 funn-0.381
icious-0.378
Token strong
Token ID1913
Feature activation+4.37e+00

Pos logit contributions
Looks+0.188
have+0.155
icious+0.149
V+0.148
 shin+0.145
Neg logit contributions
 and-0.202
 to-0.199
,-0.184
 but-0.175
 Bob-0.173
Token,
Token ID11
Feature activation-1.07e+01

Pos logit contributions
Looks+0.124
have+0.122
 shin+0.116
 funn+0.114
gr+0.106
Neg logit contributions
ipop-0.091
cone-0.087
round-0.085
 inspection-0.082
 spine-0.080
Token Max
Token ID5436
Feature activation-2.10e-01

Pos logit contributions
round+0.155
ipop+0.148
cone+0.140
 to+0.139
 inspection+0.134
Neg logit contributions
Looks-0.386
 shin-0.361
 funn-0.358
have-0.357
 faire-0.341
Token.
Token ID13
Feature activation+1.85e+00

Pos logit contributions
 and+0.005
 to+0.005
,+0.004
round+0.004
 Bob+0.004
Neg logit contributions
Looks-0.007
have-0.006
 shin-0.005
icious-0.005
 funn-0.005
Token Just
Token ID2329
Feature activation+1.03e+00

Pos logit contributions
Looks+0.051
have+0.044
V+0.042
icious+0.042
 shin+0.042
Neg logit contributions
 to-0.048
 and-0.047
,-0.043
round-0.042
ipop-0.042
Token the
Token ID262
Feature activation+5.33e+00

Pos logit contributions
 and+0.072
 to+0.071
round+0.068
cone+0.066
ipop+0.064
Neg logit contributions
Looks-0.120
have-0.107
 shin-0.103
 funn-0.102
icious-0.098
Token different
Token ID1180
Feature activation+6.91e+00

Pos logit contributions
Looks+0.149
have+0.131
icious+0.121
V+0.121
 shin+0.117
Neg logit contributions
 and-0.145
 to-0.142
,-0.129
 but-0.126
 Bob-0.123
Token things
Token ID1243
Feature activation-6.27e+00

Pos logit contributions
Looks+0.185
have+0.161
 shin+0.149
icious+0.149
 funn+0.148
Neg logit contributions
 and-0.188
 to-0.186
,-0.170
round-0.166
 Bob-0.165
Token he
Token ID339
Feature activation-2.84e+00

Pos logit contributions
round+0.114
cone+0.107
ipop+0.104
 spine+0.104
 Lee+0.102
Neg logit contributions
Looks-0.199
 shin-0.183
have-0.175
 funn-0.174
 prett-0.173
Token could
Token ID714
Feature activation-1.48e+00

Pos logit contributions
 and+0.059
 to+0.057
,+0.050
 but+0.048
 Bob+0.047
Neg logit contributions
Looks-0.098
have-0.088
icious-0.086
V-0.084
 funn-0.083
Token see
Token ID766
Feature activation+2.23e+00

Pos logit contributions
 and+0.033
 to+0.033
round+0.031
cone+0.031
ipop+0.030
Neg logit contributions
Looks-0.044
have-0.039
 shin-0.036
 funn-0.036
icious-0.036
Token.
Token ID13
Feature activation+1.61e+00

Pos logit contributions
Looks+0.076
have+0.068
 shin+0.064
icious+0.064
 funn+0.064
Neg logit contributions
 and-0.045
 to-0.044
,-0.039
round-0.037
 Bob-0.037
Token
Token ID220
Feature activation+1.02e+01

Pos logit contributions
Looks+0.046
have+0.038
V+0.037
Everyone+0.035
icious+0.035
Neg logit contributions
 and-0.038
 to-0.037
cone-0.036
round-0.036
ipop-0.036
Token
Token ID198
Feature activation+4.64e+00

Pos logit contributions
Looks+0.328
have+0.291
 funn+0.283
 shin+0.279
icious+0.276
Neg logit contributions
 and-0.235
 to-0.231
,-0.206
 Bob-0.205
 but-0.195
Token
Token ID198
Feature activation+4.70e+00

Pos logit contributions
Looks+0.148
have+0.132
 funn+0.130
 shin+0.130
icious+0.126
Neg logit contributions
 and-0.107
 to-0.105
,-0.092
 Bob-0.092
 but-0.089
TokenJohn
Token ID7554
Feature activation-1.09e+00

Pos logit contributions
Looks+0.123
have+0.107
 shin+0.103
 funn+0.103
icious+0.100
Neg logit contributions
 and-0.130
 to-0.128
,-0.115
round-0.115
 Bob-0.114
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
cone+0.090
round+0.083
 Bob+0.082
 bull+0.081
 inspection+0.080
Neg logit contributions
Looks-0.142
 shin-0.138
have-0.136
 prett-0.133
 funn-0.131
Token He
Token ID679
Feature activation+9.16e-01

Pos logit contributions
Looks+0.069
have+0.063
 shin+0.061
V+0.060
 funn+0.060
Neg logit contributions
 and-0.032
 to-0.031
cone-0.030
round-0.028
,-0.028
Token was
Token ID373
Feature activation+7.47e+00

Pos logit contributions
Looks+0.027
have+0.025
 shin+0.023
 funn+0.022
V+0.022
Neg logit contributions
 to-0.020
 and-0.020
round-0.019
cone-0.019
 Bob-0.019
Token going
Token ID1016
Feature activation+9.49e-01

Pos logit contributions
Looks+0.237
have+0.216
 shin+0.204
 funn+0.201
icious+0.193
Neg logit contributions
 and-0.155
 to-0.150
cone-0.147
round-0.146
ipop-0.137
Token to
Token ID284
Feature activation-1.40e+01

Pos logit contributions
Looks+0.036
have+0.036
 shin+0.034
 prett+0.034
 funn+0.032
Neg logit contributions
 Bob-0.009
comb-0.009
cone-0.008
round-0.008
stack-0.008
Token take
Token ID1011
Feature activation+1.01e+01

Pos logit contributions
 and+0.294
 to+0.292
,+0.255
 Bob+0.244
 but+0.243
Neg logit contributions
Looks-0.485
have-0.427
icious-0.414
 funn-0.407
 shin-0.403
Token a
Token ID257
Feature activation+7.26e+00

Pos logit contributions
Looks+0.279
have+0.251
 shin+0.233
 funn+0.229
V+0.225
Neg logit contributions
 and-0.252
 to-0.248
round-0.234
cone-0.232
 but-0.226
Token trip
Token ID5296
Feature activation-2.84e+00

Pos logit contributions
Looks+0.208
have+0.188
 shin+0.181
 funn+0.179
V+0.173
Neg logit contributions
 and-0.166
 to-0.156
 Bob-0.154
round-0.149
 Lee-0.147
Token with
Token ID351
Feature activation+1.80e+00

Pos logit contributions
ipop+0.035
cone+0.035
 bull+0.031
stack+0.030
 Bob+0.029
Neg logit contributions
Looks-0.101
have-0.100
 shin-0.095
 funn-0.092
gr-0.091
Token his
Token ID465
Feature activation+5.04e+00

Pos logit contributions
Looks+0.067
have+0.060
icious+0.058
 shin+0.057
 funn+0.057
Neg logit contributions
 and-0.032
 to-0.031
,-0.027
 Bob-0.026
 but-0.025
Token mom
Token ID1995
Feature activation-7.86e+00

Pos logit contributions
Looks+0.143
have+0.129
 shin+0.123
 funn+0.122
icious+0.117
Neg logit contributions
 and-0.117
 to-0.116
round-0.113
cone-0.107
driver-0.105
Token up
Token ID510
Feature activation+4.66e-01

Pos logit contributions
Looks+0.019
have+0.018
 shin+0.018
 prett+0.017
V+0.017
Neg logit contributions
comb-0.010
cone-0.010
ipop-0.010
 knife-0.010
storm-0.009
Token and
Token ID290
Feature activation-1.49e+01

Pos logit contributions
Looks+0.014
 shin+0.013
V+0.013
 craz+0.012
 lo+0.012
Neg logit contributions
cone-0.008
ipop-0.008
round-0.008
driver-0.008
affe-0.008
Token down
Token ID866
Feature activation-9.28e-01

Pos logit contributions
 and+0.236
 to+0.225
round+0.206
cone+0.205
driver+0.200
Neg logit contributions
Looks-0.540
have-0.491
 shin-0.478
 funn-0.472
V-0.463
Token hills
Token ID18639
Feature activation-5.96e+00

Pos logit contributions
 and+0.013
 to+0.013
cone+0.013
round+0.013
ipop+0.012
Neg logit contributions
Looks-0.034
have-0.031
 shin-0.030
 funn-0.029
V-0.029
Token,
Token ID11
Feature activation-1.09e+01

Pos logit contributions
ipop+0.100
cone+0.092
 barrel+0.083
round+0.082
 grill+0.082
Neg logit contributions
Looks-0.191
 shin-0.186
have-0.172
 funn-0.171
 prett-0.168
Token over
Token ID625
Feature activation-2.78e+00

Pos logit contributions
ipop+0.214
round+0.198
cone+0.191
driver+0.191
 to+0.189
Neg logit contributions
Looks-0.356
 shin-0.320
have-0.317
 funn-0.311
 prett-0.311
Token bumps
Token ID34029
Feature activation+2.93e+00

Pos logit contributions
ipop+0.062
round+0.061
cone+0.060
 spine+0.055
driver+0.055
Neg logit contributions
Looks-0.082
 shin-0.071
have-0.071
 funn-0.068
V-0.068
Token and
Token ID290
Feature activation-1.47e+01

Pos logit contributions
Looks+0.109
 shin+0.107
have+0.103
 sounds+0.099
 funn+0.099
Neg logit contributions
cone-0.032
ipop-0.031
round-0.030
 barrel-0.026
 grill-0.025
Token around
Token ID1088
Feature activation-5.72e+00

Pos logit contributions
 and+0.352
 to+0.345
round+0.320
,+0.313
cone+0.310
Neg logit contributions
Looks-0.432
have-0.377
 shin-0.364
 funn-0.361
V-0.346
Token corners
Token ID14371
Feature activation+8.05e-01

Pos logit contributions
 and+0.117
 to+0.117
round+0.115
cone+0.111
ipop+0.110
Neg logit contributions
Looks-0.178
have-0.158
 shin-0.152
 funn-0.150
V-0.146
Token.
Token ID13
Feature activation+1.92e+00

Pos logit contributions
Looks+0.023
have+0.020
 shin+0.020
 funn+0.019
icious+0.018
Neg logit contributions
cone-0.018
round-0.018
 to-0.017
ipop-0.017
 and-0.017
TokenBut
Token ID1537
Feature activation-3.04e+00

Pos logit contributions
Looks+0.022
have+0.020
 shin+0.020
icious+0.020
 funn+0.019
Neg logit contributions
 and-0.022
 to-0.022
 but-0.021
round-0.020
 Bob-0.020
Token I
Token ID314
Feature activation+1.70e+00

Pos logit contributions
round+0.069
 to+0.068
 and+0.068
cone+0.065
 spine+0.062
Neg logit contributions
Looks-0.087
have-0.078
 shin-0.075
 funn-0.074
icious-0.072
Token want
Token ID765
Feature activation+6.31e+00

Pos logit contributions
Looks+0.047
have+0.042
 shin+0.038
 funn+0.038
icious+0.038
Neg logit contributions
 and-0.043
 to-0.043
round-0.040
cone-0.039
 Bob-0.039
Token to
Token ID284
Feature activation-1.47e+01

Pos logit contributions
Looks+0.252
have+0.237
 funn+0.235
 shin+0.233
V+0.227
Neg logit contributions
round-0.065
 spine-0.058
 and-0.057
cone-0.056
ipop-0.055
Token know
Token ID760
Feature activation+5.44e+00

Pos logit contributions
 and+0.305
 to+0.296
round+0.280
cone+0.273
ipop+0.269
Neg logit contributions
Looks-0.451
have-0.414
 shin-0.392
 funn-0.391
icious-0.388
Token what
Token ID644
Feature activation+1.61e-01

Pos logit contributions
Looks+0.168
have+0.150
 shin+0.141
 funn+0.140
icious+0.139
Neg logit contributions
 and-0.124
 to-0.122
,-0.109
round-0.108
 Bob-0.107
Token you
Token ID345
Feature activation+4.60e-01

Pos logit contributions
Looks+0.005
have+0.004
icious+0.004
 shin+0.004
 funn+0.004
Neg logit contributions
 and-0.004
 to-0.004
,-0.004
 Bob-0.004
 but-0.004
Token write
Token ID3551
Feature activation+2.52e+00

Pos logit contributions
Looks+0.016
have+0.014
 shin+0.014
icious+0.013
 funn+0.013
Neg logit contributions
 and-0.010
 to-0.009
,-0.009
 but-0.008
 because-0.008
Token,"
Token ID553
Feature activation-3.31e-01

Pos logit contributions
Looks+0.074
have+0.069
 funn+0.064
 shin+0.064
icious+0.063
Neg logit contributions
 and-0.053
 to-0.053
round-0.051
cone-0.050
 spine-0.049
Token Lily
Token ID20037
Feature activation+5.14e+00

Pos logit contributions
 and+0.006
 to+0.006
,+0.005
 but+0.005
 Bob+0.005
Neg logit contributions
Looks-0.013
icious-0.012
have-0.011
 fullest-0.011
 shin-0.011
Token said
Token ID531
Feature activation+8.61e+00

Pos logit contributions
Looks+0.132
have+0.118
 funn+0.110
 shin+0.110
V+0.105
Neg logit contributions
 to-0.134
 and-0.132
round-0.130
cone-0.128
 Bob-0.123
Token to
Token ID284
Feature activation-1.38e+01

Pos logit contributions
 and+0.026
 to+0.025
round+0.024
cone+0.024
ipop+0.023
Neg logit contributions
Looks-0.042
have-0.038
 shin-0.035
 funn-0.035
icious-0.034
Token start
Token ID923
Feature activation+5.23e+00

Pos logit contributions
 and+0.336
 to+0.332
round+0.303
,+0.298
cone+0.297
Neg logit contributions
Looks-0.399
have-0.353
 shin-0.331
 funn-0.328
icious-0.327
Token to
Token ID284
Feature activation-1.38e+01

Pos logit contributions
Looks+0.216
have+0.200
 shin+0.193
 funn+0.190
icious+0.186
Neg logit contributions
 and-0.058
 to-0.055
round-0.052
cone-0.049
ipop-0.046
Token eat
Token ID4483
Feature activation+4.53e+00

Pos logit contributions
 and+0.305
 to+0.298
,+0.270
 but+0.256
 Bob+0.255
Neg logit contributions
Looks-0.465
have-0.406
icious-0.393
V-0.392
 funn-0.385
Token the
Token ID262
Feature activation+6.33e+00

Pos logit contributions
Looks+0.140
have+0.126
 shin+0.121
 funn+0.119
icious+0.116
Neg logit contributions
 to-0.094
 and-0.094
round-0.090
cone-0.087
ipop-0.085
Token snow
Token ID6729
Feature activation-6.03e+00

Pos logit contributions
Looks+0.209
have+0.181
V+0.180
icious+0.175
gr+0.172
Neg logit contributions
 and-0.148
 to-0.143
,-0.130
 but-0.126
 Bob-0.121
Token and
Token ID290
Feature activation-1.43e+01

Pos logit contributions
round+0.141
ipop+0.138
 to+0.135
cone+0.135
 inspection+0.134
Neg logit contributions
Looks-0.141
 shin-0.131
 funn-0.128
have-0.126
 sounds-0.125
Token begin
Token ID2221
Feature activation+6.15e+00

Pos logit contributions
round+0.319
 and+0.313
driver+0.307
cone+0.306
ipop+0.305
Neg logit contributions
Looks-0.415
have-0.371
 shin-0.360
 funn-0.355
 fullest-0.343
Token licking
Token ID43390
Feature activation+2.01e+00

Pos logit contributions
Looks+0.271
have+0.251
 shin+0.243
 funn+0.240
icious+0.237
Neg logit contributions
 and-0.053
 to-0.048
round-0.044
cone-0.039
,-0.036
Token the
Token ID262
Feature activation+6.36e+00

Pos logit contributions
Looks+0.058
have+0.051
 shin+0.051
 funn+0.050
icious+0.047
Neg logit contributions
round-0.044
 to-0.043
 and-0.043
cone-0.043
ipop-0.041
Token lake
Token ID13546
Feature activation-5.49e+00

Pos logit contributions
Looks+0.210
have+0.183
V+0.178
icious+0.175
gr+0.170
Neg logit contributions
 and-0.149
 to-0.138
,-0.125
 but-0.121
 Bob-0.119
Token
Token ID198
Feature activation+5.21e+00

Pos logit contributions
Looks+0.088
have+0.078
icious+0.074
 funn+0.073
 shin+0.073
Neg logit contributions
 to-0.041
 and-0.040
round-0.039
ipop-0.038
cone-0.037
Token
Token ID198
Feature activation+5.21e+00

Pos logit contributions
Looks+0.186
 funn+0.167
 shin+0.167
have+0.163
 prett+0.162
Neg logit contributions
 and-0.116
 to-0.106
,-0.096
 but-0.091
 Bob-0.089
TokenSuddenly
Token ID38582
Feature activation+2.10e+00

Pos logit contributions
Looks+0.156
 funn+0.139
 shin+0.136
have+0.135
 faire+0.132
Neg logit contributions
 and-0.128
 to-0.117
 but-0.109
,-0.109
round-0.106
Token,
Token ID11
Feature activation-1.06e+01

Pos logit contributions
Looks+0.078
 shin+0.071
have+0.069
Everyone+0.068
 wonderful+0.067
Neg logit contributions
round-0.032
 spine-0.027
cone-0.027
keeping-0.024
ipop-0.023
Token the
Token ID262
Feature activation+6.02e+00

Pos logit contributions
round+0.222
 and+0.219
 to+0.215
cone+0.211
 spine+0.207
Neg logit contributions
Looks-0.333
have-0.284
 shin-0.279
 funn-0.276
V-0.261
Token man
Token ID582
Feature activation-6.59e+00

Pos logit contributions
Looks+0.259
have+0.236
icious+0.228
 shin+0.226
 funn+0.224
Neg logit contributions
 and-0.074
 to-0.070
,-0.056
 but-0.052
round-0.052
Token with
Token ID351
Feature activation+1.70e+00

Pos logit contributions
round+0.140
 to+0.139
 and+0.136
cone+0.134
ipop+0.131
Neg logit contributions
Looks-0.195
have-0.173
 shin-0.166
 funn-0.163
V-0.159
Token the
Token ID262
Feature activation+6.01e+00

Pos logit contributions
Looks+0.048
have+0.042
 shin+0.039
 funn+0.039
icious+0.038
Neg logit contributions
 and-0.043
 to-0.043
round-0.039
,-0.039
 but-0.038
Token dog
Token ID3290
Feature activation-7.11e+00

Pos logit contributions
Looks+0.242
have+0.215
icious+0.213
 shin+0.205
V+0.204
Neg logit contributions
 and-0.094
 to-0.091
,-0.075
 but-0.074
 Bob-0.068
Token walked
Token ID6807
Feature activation+5.38e+00

Pos logit contributions
round+0.157
ipop+0.143
cone+0.143
 to+0.141
 spine+0.141
Neg logit contributions
Looks-0.205
have-0.186
 shin-0.175
 funn-0.173
V-0.168
Token over
Token ID625
Feature activation-2.80e+00

Pos logit contributions
Looks+0.163
have+0.140
 funn+0.137
 shin+0.137
V+0.136
Neg logit contributions
ipop-0.117
cone-0.111
 spine-0.107
round-0.105
driver-0.105
Token give
Token ID1577
Feature activation+6.26e+00

Pos logit contributions
 and+0.021
 to+0.020
round+0.019
cone+0.018
,+0.018
Neg logit contributions
Looks-0.023
have-0.021
 shin-0.019
 funn-0.019
icious-0.019
Token him
Token ID683
Feature activation+2.44e-01

Pos logit contributions
Looks+0.223
have+0.202
 shin+0.192
 funn+0.191
icious+0.189
Neg logit contributions
 and-0.113
 to-0.110
,-0.096
round-0.095
 Bob-0.092
Token a
Token ID257
Feature activation+7.59e+00

Pos logit contributions
Looks+0.007
have+0.006
icious+0.006
 shin+0.006
V+0.006
Neg logit contributions
 and-0.006
 to-0.006
,-0.006
 Bob-0.005
 but-0.005
Token treat
Token ID2190
Feature activation-2.44e+00

Pos logit contributions
Looks+0.219
have+0.193
 shin+0.182
 funn+0.180
icious+0.179
Neg logit contributions
 and-0.188
 to-0.185
,-0.167
round-0.166
 Bob-0.164
Token and
Token ID290
Feature activation-1.44e+01

Pos logit contributions
 inspection+0.049
 spine+0.047
cone+0.044
round+0.044
driver+0.040
Neg logit contributions
Looks-0.061
 funn-0.061
 shin-0.060
have-0.059
gr-0.056
Token he
Token ID339
Feature activation-2.18e+00

Pos logit contributions
 and+0.332
 to+0.323
round+0.319
cone+0.303
,+0.295
Neg logit contributions
Looks-0.419
have-0.373
 shin-0.354
 funn-0.353
icious-0.329
Token would
Token ID561
Feature activation-2.72e+00

Pos logit contributions
 to+0.053
 and+0.052
round+0.051
 spine+0.050
cone+0.049
Neg logit contributions
Looks-0.059
have-0.053
 shin-0.050
 funn-0.050
V-0.047
Token be
Token ID307
Feature activation+7.25e+00

Pos logit contributions
 to+0.067
 and+0.066
round+0.063
cone+0.062
ipop+0.061
Neg logit contributions
Looks-0.074
have-0.065
 shin-0.062
 funn-0.061
icious-0.061
Token her
Token ID607
Feature activation+4.98e+00

Pos logit contributions
Looks+0.205
icious+0.174
V+0.169
its+0.162
have+0.161
Neg logit contributions
 and-0.211
 to-0.208
,-0.189
 but-0.177
 Bob-0.175
Token friend
Token ID1545
Feature activation-4.39e+00

Pos logit contributions
Looks+0.104
 funn+0.096
have+0.096
 shin+0.093
 prett+0.091
Neg logit contributions
round-0.143
cone-0.137
driver-0.136
 and-0.135
 to-0.133
Token.
Token ID13
Feature activation+2.15e+00

Pos logit contributions
round+0.088
cone+0.086
 spine+0.086
 to+0.085
driver+0.085
Neg logit contributions
Looks-0.128
have-0.119
 funn-0.114
 shin-0.112
icious-0.107
Token checked
Token ID10667
Feature activation+3.37e+00

Pos logit contributions
Looks+0.025
have+0.022
 shin+0.022
 funn+0.021
V+0.021
Neg logit contributions
 to-0.017
 and-0.017
round-0.017
cone-0.016
 Bob-0.015
Token all
Token ID477
Feature activation-1.47e+00

Pos logit contributions
Looks+0.087
 yourself+0.072
Tr+0.070
 funn+0.070
have+0.070
Neg logit contributions
round-0.087
storm-0.083
cone-0.077
 spine-0.070
 cone-0.068
Token the
Token ID262
Feature activation+6.17e+00

Pos logit contributions
cone+0.036
round+0.036
ipop+0.034
 spine+0.034
driver+0.033
Neg logit contributions
Looks-0.038
 shin-0.032
 funn-0.032
have-0.032
V-0.031
Token usual
Token ID6678
Feature activation+6.45e+00

Pos logit contributions
Looks+0.183
have+0.163
 shin+0.153
 funn+0.153
icious+0.148
Neg logit contributions
 and-0.146
 to-0.145
round-0.135
 Bob-0.131
,-0.130
Token places
Token ID4113
Feature activation-7.22e+00

Pos logit contributions
Looks+0.162
have+0.156
 shin+0.147
 sounds+0.144
V+0.141
Neg logit contributions
round-0.152
 Bob-0.144
 Frank-0.143
cone-0.143
ipop-0.141
Token,
Token ID11
Feature activation-1.07e+01

Pos logit contributions
round+0.139
cone+0.119
ule+0.118
 bull+0.116
ipop+0.114
Neg logit contributions
Looks-0.203
 shin-0.201
 better-0.190
 lo-0.189
 prett-0.188
Token but
Token ID475
Feature activation-7.47e+00

Pos logit contributions
round+0.198
ule+0.184
ipop+0.173
cone+0.165
uddy+0.163
Neg logit contributions
Looks-0.381
 shin-0.353
 prett-0.352
 funn-0.349
 craz-0.329
Token she
Token ID673
Feature activation-2.24e+00

Pos logit contributions
round+0.172
cone+0.152
 and+0.142
 spine+0.138
 to+0.134
Neg logit contributions
Looks-0.236
 shin-0.197
have-0.196
 funn-0.196
V-0.187
Token could
Token ID714
Feature activation-9.48e-01

Pos logit contributions
round+0.060
 to+0.056
cone+0.054
 and+0.054
 spine+0.053
Neg logit contributions
Looks-0.057
have-0.049
 shin-0.048
 funn-0.046
V-0.045
Token not
Token ID407
Feature activation+1.76e+00

Pos logit contributions
round+0.028
cone+0.027
ipop+0.026
 and+0.026
 barrel+0.025
Neg logit contributions
Looks-0.020
have-0.018
 shin-0.017
 funn-0.017
V-0.016
Token find
Token ID1064
Feature activation+5.25e+00

Pos logit contributions
Looks+0.043
have+0.039
 shin+0.037
 funn+0.036
icious+0.035
Neg logit contributions
round-0.046
cone-0.044
ipop-0.044
 to-0.043
 and-0.043
Token she
Token ID673
Feature activation-3.03e+00

Pos logit contributions
round+0.210
 and+0.196
cone+0.196
 to+0.193
ipop+0.186
Neg logit contributions
Looks-0.378
have-0.332
 funn-0.320
 shin-0.315
V-0.314
Token arrived
Token ID5284
Feature activation+2.50e+00

Pos logit contributions
round+0.079
cone+0.074
ipop+0.073
driver+0.072
 to+0.072
Neg logit contributions
Looks-0.079
have-0.071
 shin-0.069
 funn-0.065
V-0.062
Token at
Token ID379
Feature activation-5.40e+00

Pos logit contributions
Looks+0.085
have+0.077
 shin+0.075
 prett+0.074
 fullest+0.073
Neg logit contributions
round-0.045
cone-0.044
driver-0.040
ipop-0.038
storm-0.037
Token a
Token ID257
Feature activation+6.41e+00

Pos logit contributions
 and+0.119
 to+0.118
,+0.104
 Bob+0.102
 but+0.099
Neg logit contributions
Looks-0.178
have-0.162
icious-0.156
 shin-0.149
 funn-0.149
Token clearing
Token ID17304
Feature activation-1.85e-01

Pos logit contributions
Looks+0.176
have+0.154
 shin+0.142
 funn+0.142
icious+0.142
Neg logit contributions
 and-0.170
 to-0.168
,-0.153
round-0.150
 but-0.149
Token,
Token ID11
Feature activation-1.13e+01

Pos logit contributions
round+0.004
ipop+0.003
cone+0.003
driver+0.003
 Lee+0.003
Neg logit contributions
Looks-0.006
have-0.005
 shin-0.005
End-0.005
 funn-0.005
Token and
Token ID290
Feature activation-1.56e+01

Pos logit contributions
round+0.221
ipop+0.213
cone+0.208
 to+0.205
driver+0.204
Neg logit contributions
Looks-0.382
have-0.333
 shin-0.324
 funn-0.318
 prett-0.307
Token there
Token ID612
Feature activation-2.33e+00

Pos logit contributions
 and+0.313
 to+0.310
round+0.305
cone+0.296
ipop+0.283
Neg logit contributions
Looks-0.508
have-0.450
 funn-0.427
 shin-0.426
V-0.412
Token she
Token ID673
Feature activation-3.25e+00

Pos logit contributions
 and+0.058
 to+0.057
,+0.052
 Bob+0.050
 but+0.049
Neg logit contributions
Looks-0.070
have-0.062
icious-0.058
 shin-0.058
 funn-0.057
Token saw
Token ID2497
Feature activation+7.22e+00

Pos logit contributions
 and+0.068
 to+0.067
,+0.059
round+0.058
 Bob+0.058
Neg logit contributions
Looks-0.107
have-0.096
 shin-0.091
 funn-0.090
icious-0.090
Token a
Token ID257
Feature activation+6.21e+00

Pos logit contributions
Looks+0.224
have+0.203
icious+0.200
 shin+0.183
V+0.183
Neg logit contributions
 and-0.181
 to-0.178
,-0.162
 Bob-0.157
 but-0.149
Token.
Token ID13
Feature activation+1.79e+00

Pos logit contributions
 sounds+0.143
Looks+0.136
have+0.133
 funn+0.130
gr+0.130
Neg logit contributions
cone-0.122
ipop-0.120
round-0.112
stack-0.108
 spine-0.104
Token They
Token ID1119
Feature activation+4.67e-02

Pos logit contributions
Looks+0.044
V+0.037
have+0.036
 funn+0.036
 shin+0.035
Neg logit contributions
cone-0.049
ipop-0.047
driver-0.047
round-0.046
 to-0.045
Token stick
Token ID4859
Feature activation-5.13e+00

Pos logit contributions
Looks+0.001
have+0.001
 shin+0.001
 funn+0.001
icious+0.001
Neg logit contributions
 to-0.001
 and-0.001
round-0.001
,-0.001
 Bob-0.001
Token the
Token ID262
Feature activation+5.42e+00

Pos logit contributions
round+0.124
cone+0.121
 spine+0.119
 and+0.114
ipop+0.114
Neg logit contributions
Looks-0.134
 shin-0.122
have-0.119
 funn-0.115
V-0.114
Token stamps
Token ID25560
Feature activation-7.02e+00

Pos logit contributions
Looks+0.200
have+0.179
icious+0.170
 shin+0.169
 funn+0.169
Neg logit contributions
 and-0.097
 to-0.095
,-0.081
 but-0.077
round-0.077
Token and
Token ID290
Feature activation-1.50e+01

Pos logit contributions
round+0.116
cone+0.112
 Bob+0.104
ipop+0.100
 Lee+0.099
Neg logit contributions
Looks-0.226
 shin-0.221
 prett-0.214
have-0.214
 funn-0.212
Token stickers
Token ID28568
Feature activation-2.31e+00

Pos logit contributions
round+0.382
 to+0.362
 and+0.357
cone+0.354
ipop+0.354
Neg logit contributions
Looks-0.384
 shin-0.348
have-0.337
 funn-0.332
 prett-0.324
Token on
Token ID319
Feature activation-7.06e+00

Pos logit contributions
cone+0.033
round+0.032
 spine+0.031
 Bob+0.030
 barrel+0.029
Neg logit contributions
Looks-0.077
 shin-0.076
have-0.073
 prett-0.070
 funn-0.070
Token the
Token ID262
Feature activation+5.37e+00

Pos logit contributions
round+0.170
 to+0.168
cone+0.164
 and+0.164
ipop+0.163
Neg logit contributions
Looks-0.190
 shin-0.164
have-0.163
V-0.158
 funn-0.155
Token envelope
Token ID22878
Feature activation-6.72e+00

Pos logit contributions
Looks+0.171
have+0.155
 shin+0.149
 funn+0.148
icious+0.144
Neg logit contributions
 and-0.109
 to-0.109
 Bob-0.100
round-0.099
,-0.096
Token carefully
Token ID7773
Feature activation-2.01e+00

Pos logit contributions
cone+0.162
round+0.160
 Frank+0.158
 Bob+0.157
 bull+0.152
Neg logit contributions
 shin-0.152
Looks-0.151
 funn-0.142
 prett-0.141
have-0.140
Token Lisa
Token ID15378
Feature activation+1.13e+00

Pos logit contributions
 and+0.142
 to+0.140
round+0.128
cone+0.122
,+0.119
Neg logit contributions
Looks-0.291
have-0.261
 shin-0.250
 funn-0.249
icious-0.245
Token continued
Token ID3767
Feature activation-1.60e+00

Pos logit contributions
Looks+0.026
have+0.025
 shin+0.023
 funn+0.021
Und+0.021
Neg logit contributions
round-0.031
 to-0.030
cone-0.030
driver-0.029
 spine-0.029
Token on
Token ID319
Feature activation-7.10e+00

Pos logit contributions
round+0.021
cone+0.019
ipop+0.019
driver+0.016
 Bob+0.016
Neg logit contributions
Looks-0.062
have-0.058
 shin-0.055
 funn-0.054
 fullest-0.053
Token,
Token ID11
Feature activation-1.10e+01

Pos logit contributions
round+0.185
ipop+0.172
cone+0.165
 and+0.165
driver+0.162
Neg logit contributions
Looks-0.196
have-0.163
 shin-0.163
 funn-0.157
 fullest-0.157
Token determined
Token ID5295
Feature activation-7.49e-01

Pos logit contributions
round+0.207
ipop+0.205
cone+0.189
driver+0.189
 cone+0.163
Neg logit contributions
Looks-0.377
have-0.333
 shin-0.323
 funn-0.320
 prett-0.316
Token to
Token ID284
Feature activation-1.46e+01

Pos logit contributions
 and+0.001
round+0.001
 to+0.001
cone+0.000
ipop-0.000
Neg logit contributions
Looks-0.038
have-0.035
 shin-0.034
 funn-0.034
icious-0.033
Token find
Token ID1064
Feature activation+4.63e+00

Pos logit contributions
 and+0.332
 to+0.327
,+0.292
round+0.290
 Bob+0.286
Neg logit contributions
Looks-0.456
have-0.406
 shin-0.383
 funn-0.380
icious-0.378
Token the
Token ID262
Feature activation+5.39e+00

Pos logit contributions
Looks+0.150
have+0.133
icious+0.128
 shin+0.123
V+0.123
Neg logit contributions
 and-0.107
 to-0.106
,-0.094
 but-0.091
 Bob-0.090
Token source
Token ID2723
Feature activation+5.15e+00

Pos logit contributions
Looks+0.158
have+0.129
icious+0.126
V+0.120
gr+0.118
Neg logit contributions
 and-0.155
 to-0.151
 but-0.138
,-0.136
 Bob-0.124
Token of
Token ID286
Feature activation-5.80e+00

Pos logit contributions
Looks+0.202
have+0.183
icious+0.175
 funn+0.174
 shin+0.174
Neg logit contributions
 and-0.082
 to-0.079
,-0.068
 but-0.063
 Bob-0.061
Token the
Token ID262
Feature activation+5.37e+00

Pos logit contributions
 and+0.124
 to+0.120
,+0.108
 Bob+0.104
 but+0.104
Neg logit contributions
Looks-0.196
have-0.175
icious-0.168
V-0.164
 funn-0.163
Token,
Token ID11
Feature activation-1.02e+01

Pos logit contributions
Looks+0.066
have+0.057
 shin+0.057
 wonderful+0.056
Everyone+0.055
Neg logit contributions
round-0.028
cone-0.024
 spine-0.024
ule-0.020
driver-0.019
Token they
Token ID484
Feature activation-2.65e+00

Pos logit contributions
 spine+0.259
round+0.249
driver+0.248
cone+0.234
ule+0.234
Neg logit contributions
Looks-0.281
 funn-0.220
 shin-0.219
Everyone-0.218
have-0.217
Token cut
Token ID2005
Feature activation+3.76e+00

Pos logit contributions
 to+0.059
 and+0.058
cone+0.058
round+0.058
 spine+0.057
Neg logit contributions
Looks-0.078
have-0.068
 funn-0.065
 shin-0.065
V-0.063
Token it
Token ID340
Feature activation-1.88e+00

Pos logit contributions
Looks+0.129
have+0.116
 shin+0.110
 funn+0.109
icious+0.109
Neg logit contributions
 and-0.074
 to-0.072
,-0.063
round-0.062
 Bob-0.062
Token open
Token ID1280
Feature activation-7.16e-01

Pos logit contributions
round+0.043
ipop+0.040
 Lee+0.040
cone+0.040
ule+0.039
Neg logit contributions
Looks-0.051
 shin-0.050
 funn-0.048
 prett-0.048
have-0.046
Token and
Token ID290
Feature activation-1.45e+01

Pos logit contributions
round+0.010
ipop+0.010
cone+0.009
ule+0.009
 spine+0.008
Neg logit contributions
 shin-0.026
Looks-0.025
 prett-0.024
 funn-0.024
 fullest-0.023
Token enjoyed
Token ID8359
Feature activation+6.80e+00

Pos logit contributions
 and+0.338
 to+0.334
,+0.299
round+0.297
 Bob+0.292
Neg logit contributions
Looks-0.442
have-0.392
 shin-0.370
 funn-0.367
icious-0.365
Token a
Token ID257
Feature activation+7.54e+00

Pos logit contributions
Looks+0.232
have+0.209
icious+0.196
 funn+0.195
 shin+0.195
Neg logit contributions
 and-0.139
 to-0.137
,-0.121
 Bob-0.117
 but-0.115
Token delicious
Token ID12625
Feature activation+3.79e+00

Pos logit contributions
Looks+0.172
have+0.145
 shin+0.139
 funn+0.137
icious+0.136
Neg logit contributions
 and-0.225
 to-0.225
 Bob-0.209
round-0.208
 but-0.203
Token snack
Token ID26906
Feature activation-3.80e+00

Pos logit contributions
 lovely+0.092
 wonderful+0.092
 shin+0.091
 beautiful+0.089
 sounds+0.089
Neg logit contributions
driver-0.094
 Lee-0.087
stack-0.080
round-0.077
uddy-0.076
Token together
Token ID1978
Feature activation-1.47e-01

Pos logit contributions
 inspection+0.100
 spine+0.096
cone+0.094
round+0.093
ipop+0.093
Neg logit contributions
Looks-0.075
 funn-0.073
 shin-0.072
icious-0.070
 prett-0.069