TOP ACTIVATIONS
MAX = 4.389

. The dog was acting very funny , running around in
<|endoftext|> Once there was a j olly bull walking in the
pillar loved to move around very quickly . One day ,
, there was an old j agu ar that lived in
texture - just perfect for sn ugg ling . The baby
forest , there lived a j olly little squirrel named Bob
His little heart was beating very fast and he felt very
time , there was a j olly boy named Tim my
. He had a pet ant named Andy . Andy lived
tried so hard and worked very hard . But no matter
time , there was a j olly little boy named Tim
time , there was a j olly little girl who lived
very small but could crawl very fast . She loved to
time , there was a j olly bear named Benny .
time , there was a j olly turkey . The turkey
a time there was a j olly little girl called Lucy
time , there was a j olly old man named Jack
time , there was a j olly boy named Jack .
upon a time , a j olly angel flew down from
a time there was a j olly monster . He liked

MIN ACTIVATIONS
MIN = -4.160

hungry and wanted to eat the cake .
and Ben want to try the cream . They
him and called out . " What do you have in
some popcorn . Lily saw the same clown eating a hot
you want to have a closer look ?" Tim
joke . " Why did the chicken cross the road ?"
The friend without the mustache was so excited to
" Please let me go " said Jim .
. Sam wanted to eat the ice cream cone . He
The man took the boy to a special pizza
sandwich . Tim my loved the sandwich so much that he
Mom and Dad say , " Tom , slow down !
bird flew by and thought the mustache was a nest .
Jack and Jill , wanted the flag too . So they
" Yes , we liked the clown . He was very
" What is that ?" Sam asks .
very angry and he wanted the steak for himself . He
. Lily laughed and thought the clown was so funny .
and went to talk to the boy . The boy said
nice hat . He wanted the hat for himself and picked

INTERVAL 2.252 - 4.389
CONTAINS 0.515%

her doll . She was very worried and sad .
, there was a little boy named Tim my . Tim
, there was a little boy named Tim my . Tim
her best and feel proud of herself . Chloe thought about
The moral of the story is that being kind and sharing

INTERVAL 0.115 - 2.252
CONTAINS 45.026%

was so proud of himself for being successful with his plan
rabbit ran away . The wolf had improved ! It was
none was more special than her last request .
that even when we stumble , we have friends to help
always safe . From that day on , she made sure

INTERVAL -2.022 - 0.115
CONTAINS 53.319%

cross the road because there were too many big cars driving
Sally both laughed and laughed . From that day
of you ." Ben shook his head . He
. Later that day , Tim my 's mom
and said , " That 's a nice car . Can

INTERVAL -4.160 - -2.022
CONTAINS 1.140%

Look , a dog !" Tim said . " Can we
he squ eal ed , " I did it ! I
thinking independently !" <|endoftext|> Once upon a time , there was
rolling maze . <|endoftext|> Once upon a time , there was
Jimmy and said , " Let your shadow go on a
Token.
Token ID13
Feature activation+4.65e-01

Pos logit contributions
igs+3.060
oppy+3.055
 hedge+2.689
ind+2.671
 gut+2.650
Neg logit contributions
 written-2.304
ax-2.130
 funn-2.103
 Adam-2.088
 vanilla-1.989
Token The
Token ID383
Feature activation-6.30e-01

Pos logit contributions
oppy+0.789
igs+0.781
ind+0.728
 web+0.727
 hedge+0.726
Neg logit contributions
 Adam-0.641
ax-0.633
 funn-0.626
 written-0.618
andra-0.616
Token dog
Token ID3290
Feature activation+1.24e+00

Pos logit contributions
ax+0.872
 Adam+0.862
 funn+0.862
 written+0.852
andra+0.846
Neg logit contributions
oppy-1.067
 hedge-1.027
igs-1.022
 web-0.995
ind-0.946
Token was
Token ID373
Feature activation+3.97e-01

Pos logit contributions
igs+2.657
oppy+2.636
 hedge+2.426
ind+2.414
 web+2.406
Neg logit contributions
 funn-1.197
 written-1.195
 Adam-1.124
ax-1.101
andra-1.076
Token acting
Token ID7205
Feature activation+6.69e-01

Pos logit contributions
oppy+0.652
igs+0.635
 hedge+0.598
 web+0.581
ind+0.562
Neg logit contributions
 funn-0.575
 written-0.565
ax-0.559
 Adam-0.556
 vanilla-0.532
Token very
Token ID845
Feature activation+4.39e+00

Pos logit contributions
oppy+1.278
igs+1.239
 hedge+1.189
ind+1.160
 web+1.147
Neg logit contributions
 funn-0.812
 written-0.801
ax-0.774
 Adam-0.757
 flavor-0.710
Token funny
Token ID8258
Feature activation+5.52e-01

Pos logit contributions
 web+10.410
 cellar+10.276
 hedge+10.213
igs+9.989
oppy+9.858
Neg logit contributions
 funn-4.545
ax-3.873
 flavor-3.670
 Tom-3.603
 vanilla-3.393
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
oppy+0.966
igs+0.925
 hedge+0.887
ind+0.857
 web+0.856
Neg logit contributions
 funn-0.734
 written-0.733
ax-0.723
 Adam-0.722
 flavor-0.678
Token running
Token ID2491
Feature activation+1.28e+00

Pos logit contributions
andra+1.340
ax+1.337
 Adam+1.319
 funn+1.294
 written+1.251
Neg logit contributions
oppy-2.297
 hedge-2.278
igs-2.271
 web-2.252
 webs-2.141
Token around
Token ID1088
Feature activation+1.70e+00

Pos logit contributions
oppy+2.367
igs+2.222
ind+2.022
 hedge+2.011
 web+1.988
Neg logit contributions
 funn-1.716
 written-1.675
ax-1.630
 Adam-1.610
 nineteen-1.497
Token in
Token ID287
Feature activation+8.57e-01

Pos logit contributions
oppy+3.744
igs+3.609
ind+3.355
 hedge+3.198
edient+3.088
Neg logit contributions
 Adam-1.686
 written-1.654
 funn-1.620
ax-1.532
 nineteen-1.447
Token<|endoftext|>
Token ID50256
Feature activation-6.05e-02

Pos logit contributions
oppy+1.363
igs+1.349
 web+1.274
 hedge+1.256
ind+1.234
Neg logit contributions
 Adam-0.993
ax-0.969
 funn-0.957
 written-0.941
andra-0.933
TokenOnce
Token ID7454
Feature activation+5.05e-01

Pos logit contributions
ax+0.072
 Adam+0.071
 funn+0.071
 written+0.070
andra+0.069
Neg logit contributions
oppy-0.112
igs-0.109
 hedge-0.106
 web-0.104
ind-0.101
Token there
Token ID612
Feature activation-5.06e-01

Pos logit contributions
igs+1.028
oppy+1.027
 hedge+1.010
 web+1.000
ind+0.970
Neg logit contributions
ax-0.484
 Adam-0.482
 written-0.458
 funn-0.454
andra-0.451
Token was
Token ID373
Feature activation-5.78e-01

Pos logit contributions
 funn+0.686
ax+0.680
 written+0.670
 Adam+0.669
andra+0.660
Neg logit contributions
oppy-0.866
igs-0.826
 hedge-0.811
 web-0.789
ind-0.756
Token a
Token ID257
Feature activation+8.46e-01

Pos logit contributions
 funn+0.560
ax+0.557
 Adam+0.548
 written+0.543
andra+0.538
Neg logit contributions
oppy-1.211
igs-1.175
 hedge-1.148
 web-1.131
ind-1.094
Token j
Token ID474
Feature activation+4.11e+00

Pos logit contributions
igs+1.664
oppy+1.648
 hedge+1.567
 web+1.555
 webs+1.538
Neg logit contributions
 funn-0.886
ax-0.878
 Adam-0.861
andra-0.837
 vanilla-0.825
Tokenolly
Token ID5098
Feature activation+8.36e-01

Pos logit contributions
 webs+7.174
 web+6.994
edient+6.556
igs+6.531
 gut+6.371
Neg logit contributions
ax-6.992
oph-5.887
 flavor-5.630
cake-5.346
 Adam-5.329
Token bull
Token ID6473
Feature activation+1.49e+00

Pos logit contributions
igs+1.575
oppy+1.571
 hedge+1.466
 web+1.448
 webs+1.427
Neg logit contributions
ax-0.953
 Adam-0.943
 funn-0.923
 written-0.914
andra-0.910
Token walking
Token ID6155
Feature activation+1.02e+00

Pos logit contributions
igs+2.394
oppy+2.357
 hedge+2.113
 web+2.057
ind+2.014
Neg logit contributions
 written-2.280
 Adam-2.207
 funn-2.179
ax-2.157
 flavor-2.124
Token in
Token ID287
Feature activation+6.67e-01

Pos logit contributions
oppy+1.807
igs+1.739
 web+1.622
 hedge+1.597
ind+1.546
Neg logit contributions
 funn-1.357
 Adam-1.341
ax-1.326
 written-1.323
 flavor-1.270
Token the
Token ID262
Feature activation+6.06e-01

Pos logit contributions
oppy+1.249
igs+1.192
 hedge+1.106
 web+1.091
ind+1.088
Neg logit contributions
 Adam-0.839
 funn-0.823
 written-0.807
ax-0.796
andra-0.764
Tokenpillar
Token ID41643
Feature activation+2.59e+00

Pos logit contributions
oppy+3.394
igs+3.353
 gut+3.098
ind+3.079
 hedge+3.064
Neg logit contributions
ax-3.276
 written-3.076
 funn-3.020
oph-2.845
andra-2.790
Token loved
Token ID6151
Feature activation+1.22e+00

Pos logit contributions
igs+5.852
oppy+5.729
ind+5.515
 hedge+5.406
ands+5.380
Neg logit contributions
 written-2.378
 funn-2.236
 flavor-2.187
 Adam-2.127
ax-2.018
Token to
Token ID284
Feature activation+1.58e+00

Pos logit contributions
oppy+2.745
igs+2.596
ind+2.485
 hedge+2.462
edient+2.453
Neg logit contributions
 written-1.069
 funn-1.062
ax-0.992
 flavor-0.979
 Adam-0.979
Token move
Token ID1445
Feature activation+2.43e+00

Pos logit contributions
oppy+3.061
ind+2.961
igs+2.961
edient+2.755
 hedge+2.723
Neg logit contributions
 funn-1.797
 flavor-1.793
ax-1.692
 written-1.676
 taste-1.642
Token around
Token ID1088
Feature activation+2.39e+00

Pos logit contributions
oppy+5.221
igs+4.752
ind+4.500
 hedge+4.355
edient+4.341
Neg logit contributions
 funn-2.924
 written-2.819
 Adam-2.581
ax-2.504
 flavor-2.495
Token very
Token ID845
Feature activation+3.98e+00

Pos logit contributions
oppy+5.911
igs+5.430
ind+5.189
edient+5.115
 hedge+4.924
Neg logit contributions
 written-1.876
 Adam-1.840
 funn-1.735
ax-1.600
 flavor-1.544
Token quickly
Token ID2952
Feature activation+1.68e+00

Pos logit contributions
oppy+7.784
igs+7.047
 hedge+6.973
ind+6.944
edient+6.874
Neg logit contributions
 Adam-4.913
 ding-4.699
ax-4.564
 funn-4.446
 flavor-4.369
Token.
Token ID13
Feature activation-2.11e-02

Pos logit contributions
oppy+3.218
igs+2.951
ind+2.813
 hedge+2.801
 web+2.687
Neg logit contributions
 written-2.131
 Adam-1.999
 funn-1.993
ax-1.931
 nineteen-1.907
Token One
Token ID1881
Feature activation+5.94e-01

Pos logit contributions
 funn+0.032
ax+0.032
 Adam+0.032
 written+0.031
andra+0.031
Neg logit contributions
oppy-0.033
igs-0.032
 hedge-0.030
 web-0.030
ind-0.029
Token day
Token ID1110
Feature activation+5.28e-01

Pos logit contributions
igs+0.963
oppy+0.959
 hedge+0.887
 web+0.872
ind+0.864
Neg logit contributions
 funn-0.854
ax-0.840
 written-0.824
 Adam-0.823
 flavor-0.812
Token,
Token ID11
Feature activation-2.15e-01

Pos logit contributions
oppy+0.826
igs+0.799
 web+0.768
ind+0.762
 hedge+0.761
Neg logit contributions
 Adam-0.786
 written-0.779
ax-0.771
 funn-0.765
 flavor-0.747
Token,
Token ID11
Feature activation+1.70e-01

Pos logit contributions
oppy+0.064
igs+0.062
 hedge+0.060
 web+0.059
ind+0.057
Neg logit contributions
 Adam-0.071
ax-0.071
 funn-0.070
 written-0.070
andra-0.068
Token there
Token ID612
Feature activation-1.25e-01

Pos logit contributions
oppy+0.341
igs+0.335
 hedge+0.323
 web+0.319
ind+0.312
Neg logit contributions
 Adam-0.174
 funn-0.174
ax-0.172
 written-0.170
andra-0.163
Token was
Token ID373
Feature activation-1.11e+00

Pos logit contributions
 funn+0.156
ax+0.156
 Adam+0.155
 written+0.154
andra+0.151
Neg logit contributions
oppy-0.225
igs-0.219
 hedge-0.212
 web-0.208
ind-0.202
Token an
Token ID281
Feature activation+1.18e+00

Pos logit contributions
ax+1.162
 funn+1.144
andra+1.122
 Adam+1.105
 written+1.100
Neg logit contributions
oppy-2.229
 hedge-2.145
igs-2.141
 web-2.094
ind-2.008
Token old
Token ID1468
Feature activation+1.10e+00

Pos logit contributions
oppy+2.111
igs+2.002
 web+1.874
 hedge+1.832
 webs+1.796
Neg logit contributions
ax-1.640
 funn-1.618
 Adam-1.539
 written-1.528
andra-1.495
Token j
Token ID474
Feature activation+3.97e+00

Pos logit contributions
oppy+2.438
igs+2.399
 web+2.245
 webs+2.245
ind+2.207
Neg logit contributions
 Adam-0.899
ax-0.894
 written-0.885
 funn-0.867
andra-0.864
Tokenagu
Token ID11433
Feature activation+1.86e+00

Pos logit contributions
 web+6.650
 webs+6.536
igs+6.341
edient+6.292
 gut+6.016
Neg logit contributions
ax-6.993
oph-5.965
 Adam-5.425
andra-5.381
cake-5.298
Tokenar
Token ID283
Feature activation+5.48e-01

Pos logit contributions
igs+2.760
oppy+2.659
 hedge+2.590
 web+2.558
 webs+2.549
Neg logit contributions
ax-2.995
 Adam-2.944
 written-2.938
andra-2.854
 Sandra-2.757
Token that
Token ID326
Feature activation+1.96e+00

Pos logit contributions
oppy+0.830
igs+0.801
 hedge+0.776
 web+0.758
 webs+0.710
Neg logit contributions
ax-0.849
 written-0.842
 Adam-0.832
 funn-0.821
andra-0.817
Token lived
Token ID5615
Feature activation+1.64e+00

Pos logit contributions
oppy+3.842
igs+3.837
 web+3.527
ind+3.420
 webs+3.382
Neg logit contributions
 funn-2.506
 written-2.465
 Adam-2.424
ax-2.279
 flavor-2.227
Token in
Token ID287
Feature activation+5.37e-01

Pos logit contributions
oppy+2.991
igs+2.939
 web+2.747
 hedge+2.736
 webs+2.674
Neg logit contributions
 written-2.135
 funn-2.112
ax-2.085
 Adam-2.015
 flavor-1.873
Token texture
Token ID11743
Feature activation+7.91e-01

Pos logit contributions
oppy+1.804
igs+1.757
 hedge+1.645
 web+1.597
ind+1.589
Neg logit contributions
 funn-1.145
 written-1.135
ax-1.132
 flavor-1.122
 Adam-1.117
Token -
Token ID532
Feature activation+7.82e-01

Pos logit contributions
oppy+1.307
igs+1.267
 hedge+1.179
ind+1.157
 web+1.129
Neg logit contributions
 written-1.137
 funn-1.122
ax-1.102
 flavor-1.084
 Adam-1.072
Token just
Token ID655
Feature activation+1.66e-03

Pos logit contributions
oppy+1.588
igs+1.506
 hedge+1.427
ind+1.425
 web+1.417
Neg logit contributions
 funn-0.851
 written-0.839
 vanilla-0.819
 Adam-0.818
ax-0.810
Token perfect
Token ID2818
Feature activation+1.53e+00

Pos logit contributions
oppy+0.003
igs+0.003
 hedge+0.003
 web+0.003
ind+0.003
Neg logit contributions
 funn-0.002
 written-0.002
ax-0.002
 Adam-0.002
andra-0.002
Token for
Token ID329
Feature activation+2.21e+00

Pos logit contributions
oppy+2.801
igs+2.624
ind+2.496
 hedge+2.490
edient+2.375
Neg logit contributions
 written-1.983
 funn-1.959
 flavor-1.919
 vanilla-1.908
 Birthday-1.840
Token sn
Token ID3013
Feature activation+3.94e+00

Pos logit contributions
oppy+4.360
igs+4.042
 hedge+3.992
 web+3.952
ind+3.890
Neg logit contributions
 vanilla-2.547
 flavor-2.524
 Adam-2.519
 written-2.493
 funn-2.434
Tokenugg
Token ID6837
Feature activation+1.01e+00

Pos logit contributions
 hedge+6.771
 web+6.744
oppy+6.519
 swamp+6.402
 webs+6.256
Neg logit contributions
ax-6.313
acket-5.526
 vanilla-5.262
 topping-5.032
 written-5.008
Tokenling
Token ID1359
Feature activation+1.65e+00

Pos logit contributions
oppy+1.742
igs+1.649
 hedge+1.636
 web+1.602
 swamp+1.492
Neg logit contributions
ax-1.365
 funn-1.349
 Adam-1.337
 written-1.336
andra-1.317
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
oppy+3.166
igs+2.939
 hedge+2.769
ind+2.678
 web+2.664
Neg logit contributions
 funn-2.082
 written-2.040
 Adam-1.948
ax-1.934
 vanilla-1.898
Token The
Token ID383
Feature activation+4.44e-01

Pos logit contributions
oppy+0.309
igs+0.298
 hedge+0.284
 web+0.282
ind+0.274
Neg logit contributions
 Adam-0.239
ax-0.237
 funn-0.235
 written-0.231
andra-0.229
Token baby
Token ID5156
Feature activation+1.07e+00

Pos logit contributions
oppy+0.770
igs+0.745
 hedge+0.695
 web+0.692
 webs+0.660
Neg logit contributions
 funn-0.608
ax-0.589
 written-0.582
 Adam-0.579
andra-0.576
Token forest
Token ID8222
Feature activation+1.95e+00

Pos logit contributions
oppy+2.597
igs+2.567
edient+2.302
 webs+2.278
 web+2.273
Neg logit contributions
 flavor-2.133
 Adam-2.131
 funn-2.106
andra-2.101
ax-2.084
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
oppy+3.554
igs+3.428
ind+3.220
 web+3.210
 hedge+3.172
Neg logit contributions
 written-2.433
 Adam-2.410
 flavor-2.403
 Sandra-2.315
andra-2.258
Token there
Token ID612
Feature activation+5.97e-02

Pos logit contributions
oppy+3.090
igs+3.040
 web+2.903
ind+2.888
 hedge+2.794
Neg logit contributions
 Adam-1.104
 funn-1.081
 written-1.054
ax-0.987
 Sandra-0.984
Token lived
Token ID5615
Feature activation+9.38e-01

Pos logit contributions
oppy+0.081
igs+0.078
 hedge+0.074
 web+0.072
ind+0.070
Neg logit contributions
 funn-0.101
 Adam-0.101
ax-0.101
 written-0.100
andra-0.098
Token a
Token ID257
Feature activation+1.80e+00

Pos logit contributions
oppy+2.279
igs+2.258
 hedge+2.149
 web+2.126
ind+2.123
Neg logit contributions
 funn-0.604
 written-0.600
 Adam-0.582
ax-0.567
andra-0.486
Token j
Token ID474
Feature activation+3.88e+00

Pos logit contributions
igs+3.733
oppy+3.583
 webs+3.341
 web+3.339
ands+3.295
Neg logit contributions
 funn-1.875
 Adam-1.740
ax-1.682
 flavor-1.678
andra-1.658
Tokenolly
Token ID5098
Feature activation+9.85e-01

Pos logit contributions
 webs+5.989
 web+5.945
igs+5.943
 gut+5.790
edient+5.718
Neg logit contributions
ax-6.644
oph-5.693
cake-5.388
 flavor-5.245
andra-5.245
Token little
Token ID1310
Feature activation+2.00e+00

Pos logit contributions
igs+1.764
oppy+1.762
 hedge+1.607
 web+1.593
ind+1.584
Neg logit contributions
 funn-1.187
ax-1.187
 Adam-1.187
 written-1.164
andra-1.148
Token squirrel
Token ID33039
Feature activation+2.60e+00

Pos logit contributions
igs+4.482
oppy+4.363
edient+4.093
 web+4.087
 poorly+4.075
Neg logit contributions
 funn-1.761
ax-1.562
 flavor-1.548
 son-1.536
 vanilla-1.491
Token named
Token ID3706
Feature activation-9.37e-02

Pos logit contributions
igs+4.550
oppy+4.478
 gut+4.238
 hedge+4.150
 web+4.051
Neg logit contributions
 written-3.662
ax-3.389
 flavor-3.305
 funn-3.182
 Adam-3.159
Token Bob
Token ID5811
Feature activation+7.30e-01

Pos logit contributions
ax+0.132
 funn+0.132
 Adam+0.131
 written+0.129
andra+0.127
Neg logit contributions
oppy-0.154
igs-0.149
 hedge-0.144
 web-0.141
ind-0.136
Token His
Token ID2399
Feature activation+7.63e-01

Pos logit contributions
oppy+1.606
igs+1.574
 hedge+1.464
ind+1.463
 web+1.448
Neg logit contributions
 Adam-1.196
 funn-1.189
 written-1.150
ax-1.144
andra-1.107
Token little
Token ID1310
Feature activation+8.67e-01

Pos logit contributions
oppy+1.268
igs+1.255
 hedge+1.200
ind+1.159
 web+1.153
Neg logit contributions
 funn-1.043
 written-0.991
 flavor-0.979
ax-0.978
 Adam-0.976
Token heart
Token ID2612
Feature activation+8.51e-01

Pos logit contributions
oppy+1.494
igs+1.488
ind+1.398
 hedge+1.395
 web+1.370
Neg logit contributions
 funn-1.138
ax-1.057
 written-1.039
 flavor-1.038
 Adam-1.024
Token was
Token ID373
Feature activation+5.36e-01

Pos logit contributions
oppy+1.663
igs+1.618
 hedge+1.577
ind+1.532
 web+1.513
Neg logit contributions
 written-0.933
 funn-0.912
andra-0.889
ax-0.883
 Adam-0.861
Token beating
Token ID11226
Feature activation+2.04e+00

Pos logit contributions
oppy+0.953
igs+0.909
 hedge+0.886
 web+0.845
ind+0.834
Neg logit contributions
 funn-0.712
 written-0.710
ax-0.684
 Adam-0.675
andra-0.657
Token very
Token ID845
Feature activation+3.84e+00

Pos logit contributions
oppy+3.758
igs+3.428
 hedge+3.262
 web+3.213
 swamp+3.108
Neg logit contributions
 written-2.856
 funn-2.840
ax-2.673
 Adam-2.570
 vanilla-2.519
Token fast
Token ID3049
Feature activation+1.14e+00

Pos logit contributions
 hedge+6.818
oppy+6.371
 web+6.145
 swamp+6.136
 cellar+6.063
Neg logit contributions
 funn-5.342
ax-5.230
 ding-5.215
 flavor-5.042
 vanilla-4.990
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
oppy+2.339
igs+2.248
 hedge+2.139
ind+2.088
 web+2.056
Neg logit contributions
 written-1.220
 funn-1.178
 Adam-1.164
ax-1.150
 vanilla-1.038
Token he
Token ID339
Feature activation+1.87e+00

Pos logit contributions
oppy+2.078
igs+1.993
 hedge+1.923
ind+1.903
 web+1.878
Neg logit contributions
 funn-1.159
 written-1.124
 Adam-1.102
ax-1.044
 vanilla-1.032
Token felt
Token ID2936
Feature activation+1.77e+00

Pos logit contributions
igs+3.968
oppy+3.945
 hedge+3.817
ind+3.676
 web+3.549
Neg logit contributions
 funn-1.850
 written-1.786
 flavor-1.622
ax-1.596
andra-1.595
Token very
Token ID845
Feature activation+2.72e+00

Pos logit contributions
oppy+3.384
igs+3.278
 hedge+3.127
ind+3.106
 web+2.981
Neg logit contributions
 funn-2.328
 written-2.196
ax-2.089
 vanilla-2.030
 Adam-1.970
Token time
Token ID640
Feature activation+3.61e-02

Pos logit contributions
 funn+0.469
ax+0.466
 Adam+0.461
 written+0.459
andra+0.446
Neg logit contributions
oppy-0.859
igs-0.830
 hedge-0.808
 web-0.797
ind-0.772
Token,
Token ID11
Feature activation+5.07e-01

Pos logit contributions
oppy+0.052
igs+0.050
 hedge+0.048
 web+0.048
ind+0.046
Neg logit contributions
ax-0.058
 Adam-0.057
 funn-0.057
 written-0.056
andra-0.055
Token there
Token ID612
Feature activation-1.20e-01

Pos logit contributions
oppy+1.028
igs+1.016
 hedge+0.980
 web+0.967
ind+0.945
Neg logit contributions
 Adam-0.499
 funn-0.499
ax-0.492
 written-0.491
andra-0.460
Token was
Token ID373
Feature activation-5.27e-01

Pos logit contributions
 funn+0.150
ax+0.150
 Adam+0.149
 written+0.147
andra+0.145
Neg logit contributions
oppy-0.216
igs-0.210
 hedge-0.203
 web-0.200
ind-0.194
Token a
Token ID257
Feature activation+9.30e-01

Pos logit contributions
ax+0.489
 funn+0.489
 Adam+0.478
 written+0.473
andra+0.470
Neg logit contributions
oppy-1.122
igs-1.089
 hedge-1.069
 web-1.051
ind-1.017
Token j
Token ID474
Feature activation+3.84e+00

Pos logit contributions
igs+1.855
oppy+1.835
 web+1.729
 hedge+1.717
 webs+1.702
Neg logit contributions
 funn-0.975
ax-0.936
 Adam-0.934
andra-0.896
 written-0.895
Tokenolly
Token ID5098
Feature activation+1.07e+00

Pos logit contributions
 webs+6.329
 web+6.288
igs+5.946
 hedge+5.706
edient+5.666
Neg logit contributions
ax-6.787
oph-5.628
 flavor-5.336
 funn-5.303
 Adam-5.192
Token boy
Token ID2933
Feature activation+2.52e+00

Pos logit contributions
igs+2.050
oppy+2.016
 hedge+1.888
 web+1.863
 webs+1.844
Neg logit contributions
ax-1.190
 Adam-1.180
 funn-1.158
 written-1.139
andra-1.132
Token named
Token ID3706
Feature activation-2.10e-01

Pos logit contributions
igs+3.081
oppy+2.938
 hedge+2.714
 webs+2.663
 web+2.591
Neg logit contributions
 written-4.843
 flavor-4.769
 Adam-4.742
ax-4.648
 Birthday-4.578
Token Tim
Token ID5045
Feature activation-4.44e-01

Pos logit contributions
 funn+0.242
ax+0.242
 Adam+0.240
 written+0.236
andra+0.232
Neg logit contributions
oppy-0.404
igs-0.392
 hedge-0.383
 web-0.376
ind-0.363
Tokenmy
Token ID1820
Feature activation+8.92e-01

Pos logit contributions
 funn+0.404
ax+0.398
 Adam+0.395
 written+0.389
andra+0.381
Neg logit contributions
oppy-0.958
igs-0.927
 hedge-0.910
 web-0.894
ind-0.873
Token.
Token ID13
Feature activation-4.80e-01

Pos logit contributions
oppy+3.943
igs+3.732
 hedge+3.474
 Bubble+3.396
 web+3.390
Neg logit contributions
 written-2.360
 Adam-2.270
 flavor-2.226
 funn-2.219
 topping-2.148
Token He
Token ID679
Feature activation+1.79e+00

Pos logit contributions
 funn+0.664
ax+0.660
 Adam+0.653
 written+0.651
andra+0.644
Neg logit contributions
oppy-0.804
igs-0.778
 hedge-0.762
 web-0.739
ind-0.706
Token had
Token ID550
Feature activation-1.10e+00

Pos logit contributions
igs+3.501
oppy+3.447
 hedge+3.220
 web+3.154
ind+3.048
Neg logit contributions
 written-2.199
 Adam-2.144
 funn-2.096
 flavor-2.012
andra-1.993
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
ax+1.266
andra+1.222
 Adam+1.191
 funn+1.189
 written+1.154
Neg logit contributions
oppy-2.104
igs-2.058
 hedge-2.054
 web-2.019
 webs-1.934
Token pet
Token ID4273
Feature activation+1.42e+00

Pos logit contributions
igs+1.776
oppy+1.750
 web+1.593
 hedge+1.590
ind+1.585
Neg logit contributions
 funn-1.390
 Adam-1.299
 flavor-1.296
 written-1.293
 vanilla-1.272
Token ant
Token ID1885
Feature activation+3.84e+00

Pos logit contributions
igs+2.847
oppy+2.791
 web+2.587
ind+2.553
edient+2.511
Neg logit contributions
 written-1.552
ax-1.549
 funn-1.539
 Adam-1.532
 vanilla-1.485
Token named
Token ID3706
Feature activation+4.35e-01

Pos logit contributions
igs+6.920
oppy+6.892
ands+6.615
ind+6.260
iona+6.258
Neg logit contributions
 flavor-5.355
 written-5.240
 funn-5.041
 vanilla-4.988
 chocolate-4.764
Token Andy
Token ID12382
Feature activation+2.04e+00

Pos logit contributions
oppy+0.785
igs+0.771
 hedge+0.732
 web+0.725
ind+0.707
Neg logit contributions
 Adam-0.548
ax-0.542
 funn-0.538
 written-0.528
andra-0.514
Token.
Token ID13
Feature activation+1.31e-01

Pos logit contributions
igs+3.725
oppy+3.700
 web+3.460
 hedge+3.419
 webs+3.293
Neg logit contributions
 Adam-2.669
 written-2.636
ax-2.581
 flavor-2.549
 vanilla-2.546
Token Andy
Token ID12382
Feature activation+2.27e+00

Pos logit contributions
oppy+0.248
igs+0.238
 hedge+0.229
 web+0.227
ind+0.223
Neg logit contributions
 Adam-0.155
 funn-0.154
ax-0.153
 written-0.150
andra-0.147
Token lived
Token ID5615
Feature activation+1.50e+00

Pos logit contributions
igs+4.660
oppy+4.626
 web+4.369
 hedge+4.346
 gut+4.121
Neg logit contributions
 written-2.592
 Adam-2.527
 funn-2.391
 flavor-2.370
andra-2.332
Token tried
Token ID3088
Feature activation+2.40e-01

Pos logit contributions
oppy+2.372
igs+2.312
 hedge+2.216
 web+2.146
ind+2.138
Neg logit contributions
 written-1.269
 funn-1.223
andra-1.199
 flavor-1.155
ax-1.153
Token so
Token ID523
Feature activation+3.33e+00

Pos logit contributions
oppy+0.486
igs+0.468
 hedge+0.455
 web+0.445
ind+0.438
Neg logit contributions
 funn-0.251
 written-0.251
ax-0.247
 Adam-0.245
andra-0.235
Token hard
Token ID1327
Feature activation+2.97e-01

Pos logit contributions
 hedge+5.347
ind+5.332
hog+5.318
 swamp+5.297
edient+5.262
Neg logit contributions
ax-4.828
 written-4.826
 flavor-4.754
 Sandra-4.657
 Adam-4.621
Token and
Token ID290
Feature activation+7.98e-01

Pos logit contributions
oppy+0.511
igs+0.484
 hedge+0.478
 web+0.466
ind+0.456
Neg logit contributions
 written-0.397
ax-0.395
 Adam-0.391
 funn-0.390
andra-0.375
Token worked
Token ID3111
Feature activation+1.19e+00

Pos logit contributions
oppy+1.485
igs+1.419
 hedge+1.373
ind+1.363
 web+1.332
Neg logit contributions
 funn-0.996
 written-0.989
ax-0.954
 Adam-0.948
andra-0.922
Token very
Token ID845
Feature activation+3.83e+00

Pos logit contributions
oppy+1.910
 hedge+1.774
igs+1.726
 web+1.681
ind+1.676
Neg logit contributions
 written-1.871
 funn-1.828
 Adam-1.783
ax-1.762
 flavor-1.687
Token hard
Token ID1327
Feature activation+8.70e-01

Pos logit contributions
 hedge+6.690
 swamp+6.206
 web+6.175
oppy+6.151
hog+6.118
Neg logit contributions
 funn-5.506
ax-5.467
 written-5.453
 flavor-5.355
 Adam-5.323
Token.
Token ID13
Feature activation+2.12e-01

Pos logit contributions
oppy+1.546
igs+1.444
 hedge+1.444
 web+1.393
ind+1.374
Neg logit contributions
 written-1.160
 Adam-1.121
ax-1.117
 funn-1.110
andra-1.041
Token But
Token ID887
Feature activation+1.71e+00

Pos logit contributions
oppy+0.346
igs+0.337
 hedge+0.318
 web+0.315
ind+0.311
Neg logit contributions
 Adam-0.305
ax-0.303
 funn-0.301
 written-0.300
andra-0.294
Token no
Token ID645
Feature activation-5.44e-01

Pos logit contributions
oppy+3.380
igs+3.283
ind+3.229
 web+3.098
 hedge+3.060
Neg logit contributions
 written-1.963
 Adam-1.941
 funn-1.857
 vanilla-1.857
 Sandra-1.822
Token matter
Token ID2300
Feature activation+1.32e+00

Pos logit contributions
ax+0.436
 funn+0.430
 Adam+0.428
 written+0.416
andra+0.409
Neg logit contributions
oppy-1.233
igs-1.203
 hedge-1.182
 web-1.164
ind-1.128
Token time
Token ID640
Feature activation-8.53e-02

Pos logit contributions
 funn+0.226
ax+0.225
 Adam+0.223
 written+0.220
andra+0.216
Neg logit contributions
oppy-0.436
igs-0.423
 hedge-0.414
 web-0.405
ind-0.393
Token,
Token ID11
Feature activation-6.81e-01

Pos logit contributions
ax+0.138
 Adam+0.138
 funn+0.137
 written+0.136
andra+0.134
Neg logit contributions
oppy-0.121
igs-0.117
 hedge-0.112
 web-0.110
ind-0.106
Token there
Token ID612
Feature activation-7.21e-01

Pos logit contributions
 funn+0.851
ax+0.843
 Adam+0.827
 written+0.823
andra+0.817
Neg logit contributions
oppy-1.250
igs-1.201
 hedge-1.174
 web-1.148
ind-1.114
Token was
Token ID373
Feature activation-1.27e+00

Pos logit contributions
 funn+1.009
ax+1.002
 written+0.979
 Adam+0.977
andra+0.964
Neg logit contributions
oppy-1.218
igs-1.154
 hedge-1.141
 web-1.111
ind-1.058
Token a
Token ID257
Feature activation+8.95e-01

Pos logit contributions
ax+1.343
 funn+1.342
andra+1.300
 Adam+1.283
 written+1.275
Neg logit contributions
oppy-2.556
igs-2.452
 hedge-2.423
 web-2.405
 webs-2.301
Token j
Token ID474
Feature activation+3.80e+00

Pos logit contributions
igs+1.783
oppy+1.760
 web+1.664
 hedge+1.656
 webs+1.638
Neg logit contributions
 funn-0.934
 Adam-0.914
ax-0.910
andra-0.876
 written-0.864
Tokenolly
Token ID5098
Feature activation+8.02e-01

Pos logit contributions
 webs+6.117
 web+5.974
igs+5.794
edient+5.760
 gut+5.643
Neg logit contributions
ax-6.568
oph-5.626
 flavor-5.393
 Adam-5.239
cake-5.179
Token little
Token ID1310
Feature activation+1.45e+00

Pos logit contributions
igs+1.512
oppy+1.498
 hedge+1.405
 web+1.382
 webs+1.358
Neg logit contributions
ax-0.913
 Adam-0.913
 funn-0.899
 written-0.882
andra-0.877
Token boy
Token ID2933
Feature activation+2.09e+00

Pos logit contributions
igs+3.410
oppy+3.339
 web+3.198
 webs+3.143
edient+3.099
Neg logit contributions
 funn-1.088
ax-1.034
 vanilla-0.952
 flavor-0.948
 Adam-0.942
Token named
Token ID3706
Feature activation-4.80e-02

Pos logit contributions
igs+2.424
oppy+2.298
 hedge+2.176
 web+2.053
 webs+2.035
Neg logit contributions
 written-4.179
 Adam-4.083
 flavor-4.064
ax-3.978
 Birthday-3.964
Token Tim
Token ID5045
Feature activation-5.66e-01

Pos logit contributions
ax+0.055
 Adam+0.054
 funn+0.054
 written+0.053
andra+0.052
Neg logit contributions
oppy-0.093
igs-0.090
 hedge-0.088
 web-0.087
ind-0.084
Token time
Token ID640
Feature activation-4.35e-01

Pos logit contributions
 funn+1.064
ax+1.057
 written+1.055
 Adam+1.036
andra+1.003
Neg logit contributions
oppy-1.754
igs-1.676
 web-1.624
 hedge-1.608
ind-1.552
Token,
Token ID11
Feature activation-3.03e-01

Pos logit contributions
 funn+0.740
ax+0.733
 Adam+0.725
 written+0.725
andra+0.718
Neg logit contributions
oppy-0.593
igs-0.563
 hedge-0.546
 web-0.528
ind-0.501
Token there
Token ID612
Feature activation-3.76e-01

Pos logit contributions
 funn+0.354
ax+0.353
 Adam+0.349
 written+0.346
andra+0.340
Neg logit contributions
oppy-0.571
igs-0.553
 hedge-0.540
 web-0.529
ind-0.512
Token was
Token ID373
Feature activation-1.06e+00

Pos logit contributions
 funn+0.494
ax+0.492
 Adam+0.485
 written+0.482
andra+0.475
Neg logit contributions
oppy-0.656
igs-0.632
 hedge-0.618
 web-0.603
ind-0.581
Token a
Token ID257
Feature activation+8.30e-01

Pos logit contributions
ax+1.092
 funn+1.086
andra+1.058
 Adam+1.048
 written+1.039
Neg logit contributions
oppy-2.163
igs-2.085
 hedge-2.064
 web-2.036
ind-1.951
Token j
Token ID474
Feature activation+3.78e+00

Pos logit contributions
igs+1.653
oppy+1.638
 web+1.540
 hedge+1.536
 webs+1.513
Neg logit contributions
 funn-0.866
ax-0.844
 Adam-0.841
andra-0.814
 written-0.803
Tokenolly
Token ID5098
Feature activation+7.71e-01

Pos logit contributions
 webs+6.128
 web+6.087
igs+5.838
edient+5.650
 gut+5.611
Neg logit contributions
ax-6.630
oph-5.602
 Adam-5.134
 funn-5.133
cake-5.130
Token little
Token ID1310
Feature activation+1.41e+00

Pos logit contributions
igs+1.455
oppy+1.446
 hedge+1.355
 web+1.334
 webs+1.307
Neg logit contributions
ax-0.882
 Adam-0.876
 funn-0.861
 written-0.847
andra-0.844
Token girl
Token ID2576
Feature activation+2.43e+00

Pos logit contributions
igs+3.325
oppy+3.283
 web+3.125
 webs+3.063
edient+3.025
Neg logit contributions
 funn-1.056
ax-0.986
 vanilla-0.924
 flavor-0.903
 Adam-0.887
Token who
Token ID508
Feature activation+2.53e+00

Pos logit contributions
igs+3.078
oppy+2.856
 hedge+2.439
 webs+2.363
 gut+2.310
Neg logit contributions
 written-4.851
 Birthday-4.665
 Adam-4.639
 Sandra-4.538
andra-4.525
Token lived
Token ID5615
Feature activation+1.60e+00

Pos logit contributions
igs+5.078
oppy+4.602
aro+4.321
 hedge+4.309
 web+4.286
Neg logit contributions
 written-3.580
 funn-3.486
 flavor-3.294
 Adam-3.218
 come-3.166
Token very
Token ID845
Feature activation+1.89e+00

Pos logit contributions
 funn+0.065
ax+0.064
 Adam+0.064
 written+0.063
andra+0.062
Neg logit contributions
oppy-0.061
igs-0.059
 hedge-0.056
 web-0.055
ind-0.053
Token small
Token ID1402
Feature activation+9.32e-01

Pos logit contributions
igs+3.432
 hedge+3.370
 web+3.341
oppy+3.300
 webs+3.217
Neg logit contributions
 funn-2.425
ax-2.367
 vanilla-2.342
 Adam-2.317
 written-2.222
Token but
Token ID475
Feature activation+1.50e+00

Pos logit contributions
oppy+1.859
igs+1.824
 hedge+1.690
 web+1.657
ind+1.636
Neg logit contributions
 funn-1.029
 written-1.024
 Adam-0.993
ax-0.987
 flavor-0.938
Token could
Token ID714
Feature activation-2.23e-01

Pos logit contributions
oppy+2.810
igs+2.767
 web+2.557
ind+2.513
 hedge+2.505
Neg logit contributions
 written-1.992
 Adam-1.991
 funn-1.972
 vanilla-1.954
 flavor-1.936
Token crawl
Token ID27318
Feature activation+2.29e+00

Pos logit contributions
 funn+0.271
ax+0.269
 Adam+0.267
 written+0.265
andra+0.260
Neg logit contributions
oppy-0.412
igs-0.399
 hedge-0.389
 web-0.381
ind-0.368
Token very
Token ID845
Feature activation+3.77e+00

Pos logit contributions
oppy+5.216
igs+4.654
ind+4.565
edient+4.458
 web+4.358
Neg logit contributions
 written-2.334
 funn-2.286
ax-2.104
 Adam-1.991
 come-1.930
Token fast
Token ID3049
Feature activation+1.86e+00

Pos logit contributions
oppy+6.691
 hedge+6.539
 web+6.295
igs+6.184
 cellar+6.081
Neg logit contributions
 funn-4.951
ax-4.771
 Adam-4.680
 flavor-4.664
 ding-4.536
Token.
Token ID13
Feature activation+1.01e-01

Pos logit contributions
oppy+3.562
igs+3.251
 hedge+3.082
ind+3.006
 web+2.994
Neg logit contributions
 written-2.440
 funn-2.362
 Adam-2.317
ax-2.238
 flavor-2.064
Token She
Token ID1375
Feature activation+2.05e+00

Pos logit contributions
oppy+0.146
igs+0.141
 hedge+0.133
 web+0.131
ind+0.127
Neg logit contributions
 Adam-0.165
 funn-0.164
ax-0.164
 written-0.162
andra-0.159
Token loved
Token ID6151
Feature activation+7.31e-01

Pos logit contributions
oppy+4.349
igs+4.251
ind+4.066
 hedge+4.027
edient+3.952
Neg logit contributions
 funn-2.060
 written-2.055
 Adam-1.934
 flavor-1.880
ax-1.848
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
oppy+1.573
igs+1.475
 hedge+1.421
 web+1.401
ind+1.392
Neg logit contributions
 funn-0.708
 written-0.702
 Adam-0.688
ax-0.670
 vanilla-0.644
Token time
Token ID640
Feature activation-2.04e-01

Pos logit contributions
 funn+1.010
ax+1.004
 written+0.996
 Adam+0.989
andra+0.954
Neg logit contributions
oppy-1.665
igs-1.593
 web-1.542
 hedge-1.538
ind-1.484
Token,
Token ID11
Feature activation-3.48e-01

Pos logit contributions
 funn+0.335
ax+0.335
 Adam+0.333
 written+0.330
andra+0.326
Neg logit contributions
oppy-0.286
igs-0.274
 hedge-0.265
 web-0.258
ind-0.247
Token there
Token ID612
Feature activation-4.58e-01

Pos logit contributions
 funn+0.414
ax+0.413
 Adam+0.408
 written+0.404
andra+0.399
Neg logit contributions
oppy-0.651
igs-0.629
 hedge-0.615
 web-0.602
ind-0.582
Token was
Token ID373
Feature activation-1.13e+00

Pos logit contributions
 funn+0.612
ax+0.609
 Adam+0.599
 written+0.597
andra+0.589
Neg logit contributions
oppy-0.792
igs-0.759
 hedge-0.745
 web-0.727
ind-0.696
Token a
Token ID257
Feature activation+8.24e-01

Pos logit contributions
ax+1.189
 funn+1.171
andra+1.158
 Adam+1.138
 written+1.128
Neg logit contributions
oppy-2.284
igs-2.206
 hedge-2.183
 web-2.144
ind-2.059
Token j
Token ID474
Feature activation+3.74e+00

Pos logit contributions
igs+1.621
oppy+1.614
 web+1.511
 hedge+1.507
 webs+1.486
Neg logit contributions
 funn-0.883
 Adam-0.847
ax-0.846
 written-0.815
andra-0.814
Tokenolly
Token ID5098
Feature activation+1.01e+00

Pos logit contributions
 webs+5.943
 web+5.858
igs+5.682
edient+5.467
 gut+5.395
Neg logit contributions
ax-6.540
oph-5.471
 flavor-5.265
 funn-5.190
 Adam-5.164
Token bear
Token ID6842
Feature activation+1.94e+00

Pos logit contributions
igs+1.958
oppy+1.932
 hedge+1.804
 web+1.781
 webs+1.766
Neg logit contributions
ax-1.104
 Adam-1.097
 funn-1.082
 written-1.056
andra-1.050
Token named
Token ID3706
Feature activation+3.55e-01

Pos logit contributions
igs+2.291
oppy+2.179
 hedge+1.936
 webs+1.871
 web+1.858
Neg logit contributions
 written-3.785
 flavor-3.637
ax-3.629
 funn-3.611
 Birthday-3.603
Token Benny
Token ID44275
Feature activation+1.55e+00

Pos logit contributions
oppy+0.628
igs+0.617
 hedge+0.596
 web+0.593
ind+0.566
Neg logit contributions
ax-0.456
 Adam-0.455
 funn-0.447
 written-0.432
andra-0.425
Token.
Token ID13
Feature activation+2.04e+00

Pos logit contributions
igs+2.751
oppy+2.709
 hedge+2.562
 web+2.474
 webs+2.444
Neg logit contributions
 written-2.066
ax-1.995
 funn-1.979
 Adam-1.930
 flavor-1.901
Token time
Token ID640
Feature activation-7.10e-02

Pos logit contributions
 funn+0.546
ax+0.543
 Adam+0.535
 written+0.535
andra+0.519
Neg logit contributions
oppy-0.986
igs-0.949
 hedge-0.924
 web-0.915
ind-0.885
Token,
Token ID11
Feature activation-1.43e-01

Pos logit contributions
ax+0.115
 Adam+0.114
 funn+0.114
 written+0.113
andra+0.111
Neg logit contributions
oppy-0.101
igs-0.097
 hedge-0.094
 web-0.092
ind-0.088
Token there
Token ID612
Feature activation-2.97e-01

Pos logit contributions
 funn+0.166
ax+0.166
 Adam+0.165
 written+0.163
andra+0.159
Neg logit contributions
oppy-0.269
igs-0.261
 hedge-0.254
 web-0.249
ind-0.241
Token was
Token ID373
Feature activation-9.80e-01

Pos logit contributions
 funn+0.384
ax+0.382
 Adam+0.378
 written+0.376
andra+0.370
Neg logit contributions
oppy-0.523
igs-0.504
 hedge-0.492
 web-0.481
ind-0.464
Token a
Token ID257
Feature activation+8.47e-01

Pos logit contributions
ax+1.003
 funn+0.991
andra+0.969
 Adam+0.966
 written+0.955
Neg logit contributions
oppy-2.002
igs-1.933
 hedge-1.913
 web-1.881
ind-1.810
Token j
Token ID474
Feature activation+3.74e+00

Pos logit contributions
igs+1.686
oppy+1.671
 web+1.570
 hedge+1.562
 webs+1.545
Neg logit contributions
 funn-0.893
ax-0.859
 Adam-0.856
 written-0.822
andra-0.821
Tokenolly
Token ID5098
Feature activation+8.55e-01

Pos logit contributions
 webs+5.954
 web+5.888
igs+5.731
edient+5.582
 gut+5.405
Neg logit contributions
ax-6.617
oph-5.605
 flavor-5.252
 Adam-5.213
 funn-5.180
Token turkey
Token ID26876
Feature activation+2.18e+00

Pos logit contributions
igs+1.632
oppy+1.616
 hedge+1.514
 web+1.493
 webs+1.471
Neg logit contributions
ax-0.957
 Adam-0.950
 funn-0.940
 written-0.917
andra-0.911
Token.
Token ID13
Feature activation+9.19e-01

Pos logit contributions
igs+3.338
oppy+3.187
 hedge+2.788
 webs+2.733
 gut+2.694
Neg logit contributions
 written-3.678
 funn-3.489
 Birthday-3.458
 flavor-3.452
 Adam-3.361
Token The
Token ID383
Feature activation-1.53e+00

Pos logit contributions
igs+1.501
oppy+1.491
ind+1.384
 web+1.369
 hedge+1.368
Neg logit contributions
 Adam-1.371
ax-1.305
 written-1.295
 funn-1.282
 Sandra-1.278
Token turkey
Token ID26876
Feature activation+1.92e+00

Pos logit contributions
ax+2.229
 Adam+2.145
 written+2.124
 funn+2.115
andra+2.087
Neg logit contributions
oppy-2.525
 hedge-2.466
igs-2.397
 web-2.374
ind-2.295
Token a
Token ID257
Feature activation-1.12e-01

Pos logit contributions
ax+2.780
andra+2.773
 funn+2.670
 nineteen+2.635
 Adam+2.609
Neg logit contributions
oppy-3.801
 web-3.665
 hedge-3.609
igs-3.564
 webs-3.484
Token time
Token ID640
Feature activation+1.31e-01

Pos logit contributions
 funn+0.115
ax+0.115
 Adam+0.113
 written+0.111
andra+0.110
Neg logit contributions
oppy-0.226
igs-0.220
 hedge-0.216
 web-0.210
ind-0.204
Token there
Token ID612
Feature activation-6.65e-01

Pos logit contributions
oppy+0.189
igs+0.184
 hedge+0.177
 web+0.174
ind+0.168
Neg logit contributions
ax-0.207
 Adam-0.206
 funn-0.204
 written-0.203
andra-0.199
Token was
Token ID373
Feature activation-7.83e-01

Pos logit contributions
 funn+0.918
ax+0.910
 written+0.892
 Adam+0.891
andra+0.880
Neg logit contributions
oppy-1.130
igs-1.071
 hedge-1.062
 web-1.028
ind-0.983
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
ax+0.766
 funn+0.764
 Adam+0.742
 written+0.734
andra+0.730
Neg logit contributions
oppy-1.633
igs-1.581
 hedge-1.555
 web-1.532
ind-1.479
Token j
Token ID474
Feature activation+3.74e+00

Pos logit contributions
igs+2.513
oppy+2.452
 web+2.324
 webs+2.310
 hedge+2.283
Neg logit contributions
 funn-1.248
 Adam-1.195
ax-1.185
andra-1.158
 vanilla-1.130
Tokenolly
Token ID5098
Feature activation+7.33e-01

Pos logit contributions
 webs+6.065
 web+5.888
igs+5.844
edient+5.551
 gut+5.530
Neg logit contributions
ax-6.554
oph-5.675
cake-5.297
 flavor-5.236
andra-5.154
Token little
Token ID1310
Feature activation+1.58e+00

Pos logit contributions
igs+1.371
oppy+1.364
 hedge+1.271
 web+1.252
ind+1.236
Neg logit contributions
ax-0.846
 Adam-0.844
 funn-0.836
 written-0.822
andra-0.816
Token girl
Token ID2576
Feature activation+2.48e+00

Pos logit contributions
igs+3.795
oppy+3.713
 web+3.554
 webs+3.500
ind+3.452
Neg logit contributions
 funn-1.125
ax-1.049
 vanilla-0.963
 flavor-0.945
 Adam-0.927
Token called
Token ID1444
Feature activation+1.02e+00

Pos logit contributions
igs+3.195
oppy+2.962
 hedge+2.533
 webs+2.475
 gut+2.450
Neg logit contributions
 written-4.963
 Birthday-4.689
 Adam-4.653
 flavor-4.576
 Sandra-4.575
Token Lucy
Token ID22162
Feature activation+1.15e+00

Pos logit contributions
igs+1.852
oppy+1.804
ind+1.740
 web+1.710
 hedge+1.686
Neg logit contributions
 Adam-1.328
ax-1.266
 Sandra-1.247
 written-1.227
andra-1.211
Token time
Token ID640
Feature activation+6.27e-02

Pos logit contributions
 funn+0.712
ax+0.707
 written+0.696
 Adam+0.695
andra+0.673
Neg logit contributions
oppy-1.237
igs-1.188
 hedge-1.155
 web-1.148
ind-1.109
Token,
Token ID11
Feature activation+1.51e-02

Pos logit contributions
oppy+0.090
igs+0.088
 hedge+0.084
 web+0.083
ind+0.080
Neg logit contributions
 Adam-0.100
ax-0.100
 funn-0.098
 written-0.098
andra-0.096
Token there
Token ID612
Feature activation-4.59e-01

Pos logit contributions
oppy+0.030
igs+0.029
 hedge+0.028
 web+0.028
ind+0.027
Neg logit contributions
 funn-0.016
 Adam-0.016
ax-0.016
 written-0.016
andra-0.015
Token was
Token ID373
Feature activation-1.05e+00

Pos logit contributions
 funn+0.612
ax+0.609
 Adam+0.598
 written+0.597
andra+0.588
Neg logit contributions
oppy-0.793
igs-0.760
 hedge-0.747
 web-0.728
ind-0.698
Token a
Token ID257
Feature activation+9.87e-01

Pos logit contributions
ax+1.098
 funn+1.081
andra+1.051
 Adam+1.045
 written+1.033
Neg logit contributions
oppy-2.124
igs-2.050
 hedge-2.041
 web-1.994
ind-1.922
Token j
Token ID474
Feature activation+3.73e+00

Pos logit contributions
igs+1.994
oppy+1.972
 web+1.855
 webs+1.830
 hedge+1.829
Neg logit contributions
 funn-1.021
 Adam-0.978
ax-0.976
andra-0.940
 written-0.936
Tokenolly
Token ID5098
Feature activation+1.18e+00

Pos logit contributions
 webs+5.935
 web+5.920
igs+5.658
edient+5.368
 gut+5.296
Neg logit contributions
ax-6.645
oph-5.676
 funn-5.239
andra-5.233
 Adam-5.201
Token old
Token ID1468
Feature activation+1.10e+00

Pos logit contributions
igs+2.337
oppy+2.285
 hedge+2.121
 web+2.111
 webs+2.097
Neg logit contributions
ax-1.277
 Adam-1.267
 funn-1.243
 written-1.218
andra-1.215
Token man
Token ID582
Feature activation+1.25e+00

Pos logit contributions
oppy+2.310
igs+2.281
 web+2.124
 webs+2.124
ind+2.083
Neg logit contributions
ax-1.023
 funn-1.015
 Adam-1.004
 written-0.990
 flavor-0.978
Token named
Token ID3706
Feature activation-1.41e+00

Pos logit contributions
oppy+1.624
igs+1.612
 hedge+1.477
 web+1.433
 webs+1.384
Neg logit contributions
 written-2.299
 Adam-2.210
ax-2.205
 funn-2.173
 flavor-2.137
Token Jack
Token ID3619
Feature activation+1.25e+00

Pos logit contributions
 funn+1.939
 written+1.892
andra+1.849
ax+1.828
 Adam+1.755
Neg logit contributions
oppy-2.429
igs-2.321
 hedge-2.282
ind-2.165
 web-2.152
Token time
Token ID640
Feature activation-2.01e-02

Pos logit contributions
 funn+0.378
ax+0.375
 Adam+0.371
 written+0.369
andra+0.360
Neg logit contributions
oppy-0.706
igs-0.684
 hedge-0.667
 web-0.656
ind-0.636
Token,
Token ID11
Feature activation-6.73e-01

Pos logit contributions
ax+0.032
 Adam+0.032
 funn+0.032
 written+0.032
andra+0.031
Neg logit contributions
oppy-0.029
igs-0.028
 hedge-0.027
 web-0.026
ind-0.025
Token there
Token ID612
Feature activation-6.53e-01

Pos logit contributions
 funn+0.825
ax+0.819
 Adam+0.802
 written+0.798
andra+0.793
Neg logit contributions
oppy-1.250
igs-1.203
 hedge-1.179
 web-1.150
ind-1.117
Token was
Token ID373
Feature activation-1.33e+00

Pos logit contributions
 funn+0.905
ax+0.899
 Adam+0.879
 written+0.879
andra+0.867
Neg logit contributions
oppy-1.107
igs-1.052
 hedge-1.040
 web-1.012
ind-0.965
Token a
Token ID257
Feature activation+8.93e-01

Pos logit contributions
ax+1.423
 funn+1.420
andra+1.371
 Adam+1.354
 written+1.352
Neg logit contributions
oppy-2.647
igs-2.539
 hedge-2.520
 web-2.486
ind-2.381
Token j
Token ID474
Feature activation+3.73e+00

Pos logit contributions
igs+1.787
oppy+1.768
 web+1.666
 hedge+1.654
 webs+1.641
Neg logit contributions
 funn-0.930
 Adam-0.901
ax-0.898
andra-0.865
 written-0.857
Tokenolly
Token ID5098
Feature activation+7.44e-01

Pos logit contributions
 webs+5.945
 web+5.807
igs+5.671
edient+5.522
 gut+5.466
Neg logit contributions
ax-6.525
oph-5.616
 flavor-5.366
 Adam-5.217
cake-5.183
Token boy
Token ID2933
Feature activation+1.90e+00

Pos logit contributions
igs+1.397
oppy+1.392
 hedge+1.300
 web+1.282
 webs+1.257
Neg logit contributions
ax-0.849
 Adam-0.846
 funn-0.840
 written-0.821
andra-0.814
Token named
Token ID3706
Feature activation-2.26e-01

Pos logit contributions
igs+2.113
oppy+2.052
 hedge+1.882
 web+1.767
 webs+1.755
Neg logit contributions
 written-3.886
 Adam-3.776
 flavor-3.755
ax-3.727
 funn-3.683
Token Jack
Token ID3619
Feature activation+6.37e-01

Pos logit contributions
 funn+0.260
ax+0.260
 Adam+0.257
 written+0.254
andra+0.249
Neg logit contributions
oppy-0.435
igs-0.422
 hedge-0.411
 web-0.403
ind-0.390
Token.
Token ID13
Feature activation-1.26e-01

Pos logit contributions
oppy+1.004
igs+0.993
 hedge+0.949
 web+0.923
 webs+0.887
Neg logit contributions
 written-0.955
ax-0.950
 Adam-0.938
 funn-0.935
andra-0.917
Token upon
Token ID2402
Feature activation-2.34e+00

Pos logit contributions
ax+0.066
 Adam+0.066
 funn+0.065
 written+0.065
andra+0.064
Neg logit contributions
oppy-0.122
igs-0.119
 hedge-0.116
 web-0.115
ind-0.111
Token a
Token ID257
Feature activation-7.59e-01

Pos logit contributions
ax+3.233
andra+3.171
 funn+3.131
 nineteen+3.025
 Adam+2.951
Neg logit contributions
oppy-4.032
 web-3.885
 hedge-3.872
igs-3.761
 webs-3.651
Token time
Token ID640
Feature activation+6.54e-02

Pos logit contributions
 funn+0.873
ax+0.868
 written+0.850
 Adam+0.847
andra+0.824
Neg logit contributions
oppy-1.462
igs-1.398
 hedge-1.363
 web-1.356
ind-1.312
Token,
Token ID11
Feature activation-3.06e-01

Pos logit contributions
oppy+0.094
igs+0.092
 hedge+0.088
 web+0.087
ind+0.084
Neg logit contributions
 Adam-0.104
ax-0.103
 funn-0.102
 written-0.102
andra-0.100
Token a
Token ID257
Feature activation+5.24e-01

Pos logit contributions
 funn+0.345
ax+0.344
 Adam+0.340
 written+0.337
andra+0.331
Neg logit contributions
oppy-0.591
igs-0.573
 hedge-0.560
 web-0.548
ind-0.531
Token j
Token ID474
Feature activation+3.72e+00

Pos logit contributions
igs+1.007
oppy+1.003
 hedge+0.944
 web+0.944
 webs+0.916
Neg logit contributions
 funn-0.580
 Adam-0.575
ax-0.573
 written-0.556
andra-0.555
Tokenolly
Token ID5098
Feature activation+1.15e+00

Pos logit contributions
 web+5.946
 webs+5.881
igs+5.669
edient+5.553
 Chick+5.302
Neg logit contributions
ax-6.539
oph-5.707
andra-5.403
 Adam-5.327
 funn-5.056
Token angel
Token ID18304
Feature activation+6.80e-01

Pos logit contributions
igs+2.341
oppy+2.265
 web+2.120
 hedge+2.108
 webs+2.091
Neg logit contributions
 Adam-1.194
ax-1.181
andra-1.163
 funn-1.140
 written-1.128
Token flew
Token ID13112
Feature activation+2.55e-01

Pos logit contributions
oppy+0.584
igs+0.564
 hedge+0.492
 web+0.466
ind+0.431
Neg logit contributions
 written-1.489
 funn-1.463
 Adam-1.458
ax-1.455
andra-1.452
Token down
Token ID866
Feature activation+7.37e-01

Pos logit contributions
oppy+0.398
igs+0.378
 hedge+0.358
 web+0.353
ind+0.339
Neg logit contributions
 funn-0.392
 written-0.386
ax-0.385
 Adam-0.379
andra-0.369
Token from
Token ID422
Feature activation+1.15e+00

Pos logit contributions
oppy+1.403
igs+1.338
 web+1.239
 hedge+1.236
ind+1.224
Neg logit contributions
 written-0.904
 Adam-0.894
 funn-0.880
ax-0.865
 vanilla-0.820
Token a
Token ID257
Feature activation-7.98e-01

Pos logit contributions
andra+3.031
ax+3.027
 funn+2.900
 nineteen+2.871
oph+2.807
Neg logit contributions
oppy-3.926
 web-3.771
 hedge-3.716
igs-3.706
 webs-3.566
Token time
Token ID640
Feature activation+1.20e-02

Pos logit contributions
 funn+0.917
ax+0.909
 written+0.904
 Adam+0.893
andra+0.865
Neg logit contributions
oppy-1.540
igs-1.473
 web-1.425
 hedge-1.424
ind-1.379
Token there
Token ID612
Feature activation-3.27e-01

Pos logit contributions
oppy+0.017
igs+0.017
 hedge+0.016
 web+0.016
ind+0.015
Neg logit contributions
ax-0.019
 Adam-0.019
 funn-0.019
 written-0.019
andra-0.018
Token was
Token ID373
Feature activation-6.08e-01

Pos logit contributions
 funn+0.424
ax+0.422
 Adam+0.417
 written+0.414
andra+0.408
Neg logit contributions
oppy-0.573
igs-0.552
 hedge-0.539
 web-0.527
ind-0.508
Token a
Token ID257
Feature activation+8.56e-01

Pos logit contributions
 funn+0.574
ax+0.572
 Adam+0.560
 written+0.553
andra+0.549
Neg logit contributions
oppy-1.287
igs-1.247
 hedge-1.223
 web-1.204
ind-1.166
Token j
Token ID474
Feature activation+3.72e+00

Pos logit contributions
igs+1.698
oppy+1.676
 web+1.583
 hedge+1.578
 webs+1.556
Neg logit contributions
 funn-0.904
ax-0.883
 Adam-0.877
andra-0.849
 written-0.834
Tokenolly
Token ID5098
Feature activation+9.46e-01

Pos logit contributions
 webs+5.933
 web+5.891
igs+5.649
edient+5.461
 gut+5.397
Neg logit contributions
ax-6.625
oph-5.629
 Adam-5.265
andra-5.220
 flavor-5.171
Token monster
Token ID9234
Feature activation+1.56e+00

Pos logit contributions
igs+1.811
oppy+1.786
 hedge+1.667
 web+1.647
 webs+1.624
Neg logit contributions
ax-1.068
 Adam-1.064
 funn-1.039
 written-1.026
andra-1.024
Token.
Token ID13
Feature activation+1.19e+00

Pos logit contributions
igs+2.231
oppy+2.191
 hedge+1.934
 web+1.855
ind+1.849
Neg logit contributions
 written-2.622
 Adam-2.519
 funn-2.507
ax-2.500
 Birthday-2.432
Token He
Token ID679
Feature activation+1.67e+00

Pos logit contributions
igs+2.027
oppy+1.978
ind+1.859
 hedge+1.815
 web+1.793
Neg logit contributions
 Adam-1.733
ax-1.654
 funn-1.631
 written-1.629
 Sandra-1.577
Token liked
Token ID8288
Feature activation-5.21e-01

Pos logit contributions
oppy+3.308
igs+3.287
 hedge+3.053
 web+2.969
ind+2.904
Neg logit contributions
 written-1.939
 funn-1.886
 Adam-1.799
 flavor-1.756
ax-1.713
Token hungry
Token ID14720
Feature activation-1.10e+00

Pos logit contributions
oppy+2.249
igs+2.203
 hedge+2.189
 web+2.169
 webs+2.076
Neg logit contributions
ax-1.655
 funn-1.620
 written-1.576
 Adam-1.569
 vanilla-1.496
Token and
Token ID290
Feature activation+9.95e-01

Pos logit contributions
 funn+1.571
andra+1.545
 Adam+1.515
ax+1.515
 written+1.468
Neg logit contributions
oppy-1.773
igs-1.723
 hedge-1.720
 web-1.684
ind-1.581
Token wanted
Token ID2227
Feature activation-4.86e-01

Pos logit contributions
oppy+2.086
igs+2.050
 web+1.977
 hedge+1.953
ind+1.915
Neg logit contributions
 funn-1.008
 written-0.975
ax-0.975
 Adam-0.973
andra-0.900
Token to
Token ID284
Feature activation-4.71e-01

Pos logit contributions
 funn+0.570
ax+0.568
 Adam+0.558
 written+0.551
andra+0.549
Neg logit contributions
oppy-0.915
igs-0.891
 hedge-0.870
 web-0.850
ind-0.823
Token eat
Token ID4483
Feature activation-6.37e-01

Pos logit contributions
 funn+0.563
ax+0.561
 Adam+0.554
 written+0.552
andra+0.541
Neg logit contributions
oppy-0.884
igs-0.854
 hedge-0.838
 web-0.818
ind-0.787
Token the
Token ID262
Feature activation-4.16e+00

Pos logit contributions
 funn+0.812
ax+0.808
 Adam+0.799
andra+0.797
 written+0.789
Neg logit contributions
oppy-1.123
igs-1.100
 hedge-1.067
 web-1.034
ind-1.008
Token cake
Token ID12187
Feature activation-1.60e+00

Pos logit contributions
ax+6.535
David+6.030
andra+6.022
acket+5.969
 Adam+5.541
Neg logit contributions
 web-7.585
 hedge-7.393
 hive-7.364
igs-7.343
 swamp-7.155
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
andra+2.982
 funn+2.940
ax+2.907
 Adam+2.893
Whe+2.857
Neg logit contributions
oppy-1.945
igs-1.821
 hedge-1.785
 web-1.726
 webs-1.560
Token
Token ID220
Feature activation-1.15e+00

Pos logit contributions
 funn+1.566
ax+1.550
 written+1.523
 Adam+1.509
andra+1.498
Neg logit contributions
oppy-1.877
 hedge-1.797
igs-1.794
 web-1.675
ind-1.637
Token
Token ID198
Feature activation-1.09e+00

Pos logit contributions
 funn+1.658
ax+1.583
 written+1.575
andra+1.568
 Adam+1.560
Neg logit contributions
oppy-1.931
igs-1.839
 hedge-1.769
 web-1.720
ind-1.666
Token
Token ID198
Feature activation+6.46e-01

Pos logit contributions
 funn+1.556
ax+1.494
 written+1.487
andra+1.467
 Adam+1.466
Neg logit contributions
oppy-1.826
igs-1.737
 hedge-1.694
 web-1.631
ind-1.601
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
 funn+0.273
ax+0.272
 Adam+0.269
 written+0.267
andra+0.261
Neg logit contributions
oppy-0.480
igs-0.465
 hedge-0.454
 web-0.445
ind-0.431
Token Ben
Token ID3932
Feature activation-5.89e-01

Pos logit contributions
 funn+1.844
 written+1.830
andra+1.753
ax+1.751
 Birthday+1.739
Neg logit contributions
oppy-2.345
igs-2.182
 hedge-2.149
 web-2.031
 Bubble-1.956
Token want
Token ID765
Feature activation-1.61e+00

Pos logit contributions
ax+0.588
 funn+0.585
 Adam+0.581
 written+0.573
andra+0.573
Neg logit contributions
oppy-1.195
igs-1.160
 hedge-1.146
 web-1.120
ind-1.081
Token to
Token ID284
Feature activation-1.26e+00

Pos logit contributions
andra+2.314
ax+2.305
 funn+2.280
 Adam+2.179
oph+2.159
Neg logit contributions
 hedge-2.532
igs-2.517
oppy-2.513
 web-2.377
ind-2.270
Token try
Token ID1949
Feature activation-1.74e+00

Pos logit contributions
 funn+1.635
ax+1.609
 written+1.591
 Adam+1.585
andra+1.576
Neg logit contributions
oppy-2.207
 hedge-2.135
igs-2.126
 web-2.038
ind-1.954
Token the
Token ID262
Feature activation-4.13e+00

Pos logit contributions
 funn+2.595
andra+2.518
ax+2.495
 written+2.428
 Adam+2.421
Neg logit contributions
igs-2.808
oppy-2.743
 hedge-2.658
ind-2.528
 web-2.526
Token cream
Token ID8566
Feature activation-2.08e+00

Pos logit contributions
andra+7.596
oph+7.596
ax+7.516
 Adam+7.182
 Sandra+7.055
Neg logit contributions
 hedge-6.619
igs-5.722
 case-5.617
 swamp-5.597
 web-5.505
Token.
Token ID13
Feature activation-7.85e-01

Pos logit contributions
andra+3.855
 funn+3.739
 Adam+3.713
ax+3.681
 written+3.658
Neg logit contributions
oppy-2.487
 hedge-2.458
igs-2.289
 web-2.176
ind-2.080
Token
Token ID198
Feature activation-2.92e-01

Pos logit contributions
 funn+1.159
 written+1.152
ax+1.143
 Adam+1.123
andra+1.118
Neg logit contributions
oppy-1.254
igs-1.198
 hedge-1.174
 web-1.125
ind-1.076
Token
Token ID198
Feature activation+6.32e-01

Pos logit contributions
 funn+0.356
ax+0.354
 Adam+0.351
 written+0.348
andra+0.342
Neg logit contributions
oppy-0.534
igs-0.517
 hedge-0.504
 web-0.493
ind-0.477
TokenThey
Token ID2990
Feature activation+5.96e-01

Pos logit contributions
oppy+1.073
igs+1.019
 hedge+0.992
 web+0.991
 webs+0.953
Neg logit contributions
ax-0.903
 funn-0.889
 Adam-0.876
 written-0.861
andra-0.852
Token him
Token ID683
Feature activation-1.94e-01

Pos logit contributions
 funn+2.546
 nineteen+2.388
ax+2.375
andra+2.373
 Birthday+2.342
Neg logit contributions
oppy-3.031
 hedge-2.986
igs-2.918
ind-2.771
 web-2.729
Token and
Token ID290
Feature activation-6.70e-01

Pos logit contributions
 funn+0.262
ax+0.262
 Adam+0.259
 written+0.258
andra+0.253
Neg logit contributions
oppy-0.330
igs-0.319
 hedge-0.309
 web-0.302
ind-0.292
Token called
Token ID1444
Feature activation-4.19e-01

Pos logit contributions
 funn+0.706
ax+0.702
 Adam+0.687
 written+0.680
andra+0.673
Neg logit contributions
oppy-1.331
igs-1.284
 hedge-1.273
 web-1.233
ind-1.191
Token out
Token ID503
Feature activation+8.64e-01

Pos logit contributions
 funn+0.526
ax+0.521
 Adam+0.514
 written+0.512
andra+0.506
Neg logit contributions
oppy-0.752
igs-0.725
 hedge-0.712
 web-0.693
ind-0.668
Token.
Token ID13
Feature activation-2.94e+00

Pos logit contributions
oppy+1.634
igs+1.619
 web+1.502
 hedge+1.502
ind+1.457
Neg logit contributions
 Adam-1.052
 written-1.049
 funn-1.012
ax-0.994
 vanilla-0.944
Token "
Token ID366
Feature activation-4.13e+00

Pos logit contributions
 funn+5.771
 topp+5.195
 Birthday+5.082
 flavor+5.073
 written+5.007
Neg logit contributions
 hedge-3.511
oppy-3.462
igs-3.074
 swamp-2.899
 cellar-2.899
TokenWhat
Token ID2061
Feature activation-1.98e+00

Pos logit contributions
 funn+6.849
 topp+6.607
 Birthday+6.575
 timer+6.276
 written+6.141
Neg logit contributions
ind-6.053
F-5.677
oppy-5.455
 hedge-5.453
Sp-5.285
Token do
Token ID466
Feature activation-6.14e-01

Pos logit contributions
andra+2.852
ax+2.828
 funn+2.707
 Adam+2.631
 Sandra+2.620
Neg logit contributions
oppy-3.322
 hedge-3.119
igs-3.051
 web-3.009
 swamp-2.837
Token you
Token ID345
Feature activation-1.23e+00

Pos logit contributions
 funn+0.933
ax+0.931
 written+0.915
 Adam+0.910
andra+0.905
Neg logit contributions
oppy-0.950
igs-0.911
 hedge-0.879
 web-0.854
ind-0.819
Token have
Token ID423
Feature activation-6.37e-01

Pos logit contributions
 Adam+1.750
andra+1.736
 funn+1.734
ax+1.721
 written+1.680
Neg logit contributions
oppy-2.045
 hedge-1.923
igs-1.913
 web-1.830
 swamp-1.779
Token in
Token ID287
Feature activation+7.00e-01

Pos logit contributions
 funn+0.847
ax+0.836
 Adam+0.824
andra+0.816
 written+0.804
Neg logit contributions
oppy-1.117
igs-1.084
 hedge-1.055
 web-1.025
ind-0.991
Token some
Token ID617
Feature activation-1.74e+00

Pos logit contributions
andra+1.390
ax+1.386
 funn+1.361
 Adam+1.350
 written+1.333
Neg logit contributions
oppy-1.786
igs-1.697
 hedge-1.671
ind-1.596
 web-1.577
Token popcorn
Token ID38914
Feature activation-1.63e+00

Pos logit contributions
andra+2.230
 Adam+2.156
ax+2.106
 Sandra+2.081
 written+2.019
Neg logit contributions
oppy-3.126
igs-3.089
 hedge-3.031
ind-2.887
 web-2.882
Token.
Token ID13
Feature activation+3.73e-01

Pos logit contributions
andra+2.965
 funn+2.943
 Adam+2.912
ax+2.870
Whe+2.817
Neg logit contributions
oppy-2.037
igs-1.900
 hedge-1.880
 web-1.794
ind-1.718
Token Lily
Token ID20037
Feature activation-2.25e-01

Pos logit contributions
oppy+0.597
igs+0.582
 hedge+0.543
 web+0.539
ind+0.533
Neg logit contributions
 Adam-0.555
 funn-0.552
ax-0.543
 written-0.540
andra-0.528
Token saw
Token ID2497
Feature activation-1.97e+00

Pos logit contributions
 funn+0.246
ax+0.245
 Adam+0.243
 written+0.240
andra+0.236
Neg logit contributions
oppy-0.438
igs-0.426
 hedge-0.415
 web-0.407
ind-0.395
Token the
Token ID262
Feature activation-4.12e+00

Pos logit contributions
andra+2.796
ax+2.755
 funn+2.713
OM+2.649
 written+2.643
Neg logit contributions
 hedge-3.150
oppy-3.135
igs-3.026
 web-2.903
ind-2.859
Token same
Token ID976
Feature activation-2.75e+00

Pos logit contributions
andra+7.174
 Sandra+7.146
ax+6.732
ill+6.576
oph+6.534
Neg logit contributions
ind-7.145
 swamp-6.222
 hedge-6.176
igs-5.999
 poorly-5.915
Token clown
Token ID25573
Feature activation-1.43e+00

Pos logit contributions
andra+3.926
ax+3.893
 Sandra+3.823
 Adam+3.740
 written+3.718
Neg logit contributions
oppy-4.567
ind-4.544
igs-4.523
 hedge-4.486
 web-4.480
Token eating
Token ID6600
Feature activation-1.08e+00

Pos logit contributions
 funn+2.290
andra+2.259
ax+2.225
 Adam+2.216
 written+2.190
Neg logit contributions
oppy-2.136
igs-1.949
 hedge-1.917
 web-1.844
ind-1.840
Token a
Token ID257
Feature activation-2.07e+00

Pos logit contributions
andra+1.312
ax+1.295
 funn+1.277
 Adam+1.270
 written+1.252
Neg logit contributions
oppy-1.934
igs-1.921
 hedge-1.868
 web-1.826
ind-1.777
Token hot
Token ID3024
Feature activation-2.17e+00

Pos logit contributions
andra+3.581
ax+3.553
 Sandra+3.521
 Adam+3.517
 written+3.384
Neg logit contributions
oppy-2.936
igs-2.797
 web-2.762
 hedge-2.749
ind-2.587
Token you
Token ID345
Feature activation-7.87e-01

Pos logit contributions
 funn+2.257
ax+2.250
 written+2.248
andra+2.203
 Adam+2.202
Neg logit contributions
oppy-2.011
igs-1.925
 hedge-1.807
 web-1.719
ind-1.662
Token want
Token ID765
Feature activation-1.05e+00

Pos logit contributions
 funn+0.849
 Adam+0.847
ax+0.842
andra+0.829
 written+0.820
Neg logit contributions
oppy-1.562
igs-1.499
 hedge-1.466
 web-1.446
ind-1.384
Token to
Token ID284
Feature activation-1.78e+00

Pos logit contributions
ax+1.335
andra+1.322
 funn+1.315
 Adam+1.307
 written+1.258
Neg logit contributions
oppy-1.859
igs-1.836
 hedge-1.811
 web-1.754
ind-1.676
Token have
Token ID423
Feature activation-2.13e+00

Pos logit contributions
ax+2.229
 funn+2.214
andra+2.203
 Adam+2.197
 written+2.175
Neg logit contributions
oppy-3.281
 hedge-3.166
igs-3.143
 web-3.011
 swamp-2.923
Token a
Token ID257
Feature activation-2.27e+00

Pos logit contributions
ax+2.878
andra+2.838
 Adam+2.718
 funn+2.710
OM+2.664
Neg logit contributions
igs-3.744
 hedge-3.585
oppy-3.561
 web-3.446
 webs-3.365
Token closer
Token ID5699
Feature activation-4.10e+00

Pos logit contributions
ax+3.467
 Adam+3.417
andra+3.332
 written+3.303
 funn+3.290
Neg logit contributions
 hedge-3.668
oppy-3.633
igs-3.451
 web-3.395
 swamp-3.311
Token look
Token ID804
Feature activation-7.10e-01

Pos logit contributions
 Adam+6.915
Adam+6.770
ax+6.764
andra+6.594
oph+6.398
Neg logit contributions
 hedge-7.423
 swamp-6.992
 web-6.985
oppy-6.973
igs-6.805
Token?"
Token ID1701
Feature activation-6.93e-01

Pos logit contributions
 funn+0.873
ax+0.861
 Adam+0.858
andra+0.853
 written+0.845
Neg logit contributions
oppy-1.290
igs-1.254
 hedge-1.211
 web-1.180
ind-1.163
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
 funn+0.914
ax+0.895
 Adam+0.887
 written+0.885
andra+0.873
Neg logit contributions
oppy-1.218
igs-1.174
 hedge-1.158
 web-1.115
ind-1.057
Token
Token ID198
Feature activation+6.24e-01

Pos logit contributions
oppy+0.293
igs+0.283
 hedge+0.276
 web+0.273
ind+0.265
Neg logit contributions
ax-0.176
 Adam-0.173
 funn-0.171
 written-0.169
andra-0.167
TokenTim
Token ID14967
Feature activation-1.16e+00

Pos logit contributions
oppy+1.197
igs+1.132
 web+1.098
 hedge+1.096
ind+1.056
Neg logit contributions
ax-0.746
 funn-0.728
 Adam-0.719
 written-0.707
andra-0.702
Token joke
Token ID9707
Feature activation-1.01e+00

Pos logit contributions
ax+0.816
 funn+0.806
 Adam+0.799
 written+0.794
andra+0.787
Neg logit contributions
oppy-1.194
igs-1.139
 hedge-1.137
 web-1.109
ind-1.049
Token.
Token ID13
Feature activation-4.12e-01

Pos logit contributions
 funn+1.452
ax+1.439
andra+1.436
 Adam+1.418
 written+1.373
Neg logit contributions
oppy-1.605
 hedge-1.533
igs-1.525
 web-1.501
 webs-1.405
Token "
Token ID366
Feature activation-1.53e+00

Pos logit contributions
 funn+0.570
ax+0.568
 written+0.557
 Adam+0.557
andra+0.548
Neg logit contributions
oppy-0.689
igs-0.662
 hedge-0.648
 web-0.631
ind-0.605
TokenWhy
Token ID5195
Feature activation-1.62e+00

Pos logit contributions
ax+2.238
 Adam+2.199
 funn+2.143
andra+2.140
 written+2.127
Neg logit contributions
oppy-2.379
 hedge-2.267
 web-2.239
igs-2.229
ind-2.199
Token did
Token ID750
Feature activation+1.19e-02

Pos logit contributions
ax+2.264
andra+2.231
 funn+2.229
 written+2.145
 nineteen+2.121
Neg logit contributions
oppy-2.843
igs-2.606
 hedge-2.577
 web-2.494
 webs-2.357
Token the
Token ID262
Feature activation-4.09e+00

Pos logit contributions
oppy+0.019
igs+0.019
 hedge+0.018
 web+0.018
ind+0.017
Neg logit contributions
 funn-0.017
 Adam-0.017
ax-0.017
 written-0.016
andra-0.016
Token chicken
Token ID9015
Feature activation-1.28e+00

Pos logit contributions
andra+6.963
ax+6.862
 Sandra+6.500
oph+6.399
 Adam+6.378
Neg logit contributions
 hedge-7.289
 web-6.913
 swamp-6.853
oppy-6.254
 webs-6.123
Token cross
Token ID3272
Feature activation-1.03e+00

Pos logit contributions
 Adam+1.664
ax+1.651
andra+1.624
 funn+1.620
 written+1.613
Neg logit contributions
oppy-2.292
igs-2.173
 hedge-2.150
 web-2.101
 swamp-2.005
Token the
Token ID262
Feature activation-2.68e-01

Pos logit contributions
ax+1.353
 Adam+1.345
 funn+1.344
andra+1.330
 written+1.321
Neg logit contributions
oppy-1.785
igs-1.747
 hedge-1.714
 web-1.662
ind-1.600
Token road
Token ID2975
Feature activation-1.31e+00

Pos logit contributions
 funn+0.407
ax+0.406
 Adam+0.402
 written+0.400
andra+0.394
Neg logit contributions
oppy-0.414
igs-0.398
 hedge-0.386
 web-0.376
ind-0.361
Token?"
Token ID1701
Feature activation-5.93e-01

Pos logit contributions
 funn+1.740
ax+1.732
 Adam+1.724
andra+1.720
 written+1.701
Neg logit contributions
oppy-2.229
igs-2.165
 hedge-2.141
 web-2.056
ind-1.994
Token
Token ID198
Feature activation-4.87e-01

Pos logit contributions
oppy+0.207
igs+0.203
 hedge+0.195
 web+0.194
ind+0.189
Neg logit contributions
 Adam-0.132
ax-0.131
 funn-0.130
 written-0.127
andra-0.125
Token
Token ID198
Feature activation+1.24e+00

Pos logit contributions
 funn+0.619
ax+0.612
 Adam+0.603
 written+0.602
andra+0.592
Neg logit contributions
oppy-0.874
igs-0.842
 hedge-0.822
 web-0.804
ind-0.776
TokenThe
Token ID464
Feature activation-9.79e-01

Pos logit contributions
oppy+2.333
igs+2.205
 web+2.060
 hedge+2.052
 webs+1.981
Neg logit contributions
 funn-1.612
 Adam-1.576
 written-1.565
ax-1.565
Adam-1.526
Token friend
Token ID1545
Feature activation+2.02e-01

Pos logit contributions
ax+1.477
 Adam+1.465
 written+1.443
 funn+1.434
andra+1.432
Neg logit contributions
oppy-1.539
igs-1.476
 hedge-1.474
 web-1.410
ind-1.352
Token without
Token ID1231
Feature activation-8.38e-01

Pos logit contributions
oppy+0.397
igs+0.391
 hedge+0.375
 web+0.370
ind+0.361
Neg logit contributions
ax-0.219
 Adam-0.219
 funn-0.217
 written-0.215
andra-0.211
Token the
Token ID262
Feature activation-4.02e+00

Pos logit contributions
ax+1.215
 funn+1.206
andra+1.189
 written+1.186
 Adam+1.178
Neg logit contributions
oppy-1.343
igs-1.294
 hedge-1.260
 web-1.221
ind-1.167
Token mustache
Token ID49303
Feature activation-7.44e-01

Pos logit contributions
Whe+5.350
 Adam+5.275
oph+5.147
OM+5.133
ax+4.983
Neg logit contributions
oppy-7.318
 hedge-7.289
igs-7.025
 web-6.996
 gut-6.971
Token was
Token ID373
Feature activation-1.05e+00

Pos logit contributions
 funn+1.128
ax+1.126
 Adam+1.112
andra+1.097
 written+1.083
Neg logit contributions
oppy-1.142
igs-1.083
 hedge-1.070
 web-1.043
ind-0.974
Token so
Token ID523
Feature activation+3.12e-02

Pos logit contributions
ax+1.534
 funn+1.522
 Adam+1.517
andra+1.512
 written+1.453
Neg logit contributions
oppy-1.646
igs-1.595
 hedge-1.556
 web-1.534
ind-1.461
Token excited
Token ID6568
Feature activation-1.62e+00

Pos logit contributions
oppy+0.053
igs+0.051
 hedge+0.050
 web+0.049
ind+0.047
Neg logit contributions
 funn-0.044
ax-0.043
 written-0.043
 Adam-0.043
andra-0.042
Token to
Token ID284
Feature activation-8.68e-01

Pos logit contributions
 funn+2.832
ax+2.712
andra+2.708
Whe+2.668
 Adam+2.665
Neg logit contributions
oppy-2.188
igs-2.094
 hedge-2.050
 web-1.961
 webs-1.875
Token"
Token ID1
Feature activation-1.32e+00

Pos logit contributions
 funn+1.368
ax+1.339
 written+1.332
 Adam+1.323
andra+1.303
Neg logit contributions
oppy-0.972
igs-0.925
 hedge-0.898
 web-0.869
ind-0.829
TokenPlease
Token ID5492
Feature activation+8.13e-01

Pos logit contributions
ax+1.638
 funn+1.627
 Adam+1.620
 written+1.590
andra+1.576
Neg logit contributions
oppy-2.319
 hedge-2.263
igs-2.194
 web-2.166
ind-2.113
Token let
Token ID1309
Feature activation-1.41e+00

Pos logit contributions
igs+1.388
oppy+1.365
 web+1.297
ind+1.291
 hedge+1.288
Neg logit contributions
 Adam-1.090
ax-1.082
 funn-1.053
 written-1.044
andra-1.023
Token me
Token ID502
Feature activation-9.25e-01

Pos logit contributions
 funn+2.387
 written+2.353
ax+2.285
 Adam+2.258
andra+2.245
Neg logit contributions
oppy-2.039
 hedge-1.882
igs-1.873
 web-1.773
ind-1.702
Token go
Token ID467
Feature activation-3.16e-01

Pos logit contributions
 funn+1.221
 Adam+1.206
ax+1.195
 written+1.190
andra+1.190
Neg logit contributions
oppy-1.638
igs-1.570
 hedge-1.544
 web-1.494
ind-1.429
Token"
Token ID1
Feature activation-4.00e+00

Pos logit contributions
 funn+0.364
ax+0.362
 Adam+0.358
 written+0.355
andra+0.349
Neg logit contributions
oppy-0.601
igs-0.582
 hedge-0.569
 web-0.557
ind-0.539
Token said
Token ID531
Feature activation-9.02e-01

Pos logit contributions
 Birthday+6.634
 funn+6.588
acket+6.264
andra+6.230
 nineteen+6.217
Neg logit contributions
oppy-5.395
 hedge-5.340
igs-5.340
 alone-5.084
hog-4.972
Token Jim
Token ID5395
Feature activation+1.50e-01

Pos logit contributions
 funn+1.313
ax+1.296
andra+1.283
 written+1.274
 Adam+1.249
Neg logit contributions
oppy-1.440
igs-1.381
 hedge-1.351
 web-1.327
 webs-1.249
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
oppy+0.220
igs+0.215
 hedge+0.208
 web+0.202
ind+0.195
Neg logit contributions
ax-0.237
 Adam-0.236
 funn-0.234
 written-0.233
andra-0.228
Token
Token ID198
Feature activation-9.68e-01

Pos logit contributions
 funn+1.777
ax+1.740
 written+1.716
andra+1.692
 Adam+1.616
Neg logit contributions
oppy-2.050
 hedge-1.960
igs-1.927
 web-1.872
 swamp-1.768
Token
Token ID198
Feature activation-1.08e+00

Pos logit contributions
 funn+1.327
 written+1.280
ax+1.274
 Adam+1.238
andra+1.228
Neg logit contributions
oppy-1.657
igs-1.594
 hedge-1.563
 web-1.520
ind-1.464
Token.
Token ID13
Feature activation-1.58e+00

Pos logit contributions
oppy+0.270
igs+0.257
 hedge+0.246
 web+0.239
ind+0.232
Neg logit contributions
 written-0.283
ax-0.282
 funn-0.281
 Adam-0.280
andra-0.272
Token Sam
Token ID3409
Feature activation+7.99e-01

Pos logit contributions
 funn+2.435
ax+2.420
 written+2.360
andra+2.352
 Adam+2.332
Neg logit contributions
oppy-2.451
 hedge-2.366
igs-2.339
 web-2.255
 swamp-2.105
Token wanted
Token ID2227
Feature activation-1.01e+00

Pos logit contributions
oppy+1.706
igs+1.674
 hedge+1.595
 web+1.587
ind+1.555
Neg logit contributions
 written-0.754
 funn-0.745
 Adam-0.742
ax-0.737
andra-0.708
Token to
Token ID284
Feature activation-5.86e-01

Pos logit contributions
 funn+1.306
andra+1.296
ax+1.290
 Adam+1.273
 written+1.247
Neg logit contributions
oppy-1.760
igs-1.736
 hedge-1.704
 web-1.644
ind-1.570
Token eat
Token ID4483
Feature activation-6.10e-01

Pos logit contributions
 funn+0.721
ax+0.716
 Adam+0.715
 written+0.709
andra+0.698
Neg logit contributions
oppy-1.071
igs-1.035
 hedge-1.019
 web-0.991
ind-0.948
Token the
Token ID262
Feature activation-3.99e+00

Pos logit contributions
 funn+0.815
ax+0.810
 Adam+0.808
andra+0.805
 written+0.798
Neg logit contributions
oppy-1.041
igs-1.020
 hedge-0.988
 web-0.961
ind-0.922
Token ice
Token ID4771
Feature activation-1.37e+00

Pos logit contributions
andra+7.201
 Adam+6.692
 Sandra+6.541
oph+6.347
 written+6.332
Neg logit contributions
igs-6.212
 hedge-6.116
 swamp-5.825
ind-5.679
 case-5.639
Token cream
Token ID8566
Feature activation-1.28e+00

Pos logit contributions
andra+1.495
 funn+1.457
 Adam+1.453
 written+1.453
ax+1.388
Neg logit contributions
oppy-2.725
igs-2.686
 hedge-2.567
 web-2.492
ind-2.433
Token cone
Token ID27763
Feature activation-4.80e-01

Pos logit contributions
andra+2.190
 funn+2.155
 Adam+2.136
ax+2.119
 written+2.112
Neg logit contributions
oppy-1.717
 hedge-1.622
igs-1.614
 web-1.537
ind-1.433
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
 funn+0.777
ax+0.772
 Adam+0.768
 written+0.762
andra+0.762
Neg logit contributions
oppy-0.686
igs-0.655
 hedge-0.635
 web-0.618
ind-0.587
Token He
Token ID679
Feature activation+3.32e-01

Pos logit contributions
 funn+2.163
 written+2.140
ax+2.137
andra+2.086
 Adam+2.083
Neg logit contributions
oppy-2.315
 hedge-2.201
igs-2.197
 web-2.076
 swamp-1.961
Token
Token ID198
Feature activation-3.59e-01

Pos logit contributions
ax+0.053
 funn+0.053
 Adam+0.053
 written+0.052
andra+0.051
Neg logit contributions
oppy-0.069
igs-0.067
 hedge-0.065
 web-0.064
ind-0.062
Token
Token ID198
Feature activation+5.57e-01

Pos logit contributions
 funn+0.483
ax+0.480
 Adam+0.475
 written+0.472
andra+0.465
Neg logit contributions
oppy-0.612
igs-0.590
 hedge-0.574
 web-0.561
ind-0.541
TokenThe
Token ID464
Feature activation-2.38e+00

Pos logit contributions
oppy+0.909
igs+0.860
 hedge+0.830
 web+0.819
 webs+0.774
Neg logit contributions
 funn-0.842
ax-0.829
 Adam-0.820
 written-0.815
andra-0.801
Token man
Token ID582
Feature activation-6.69e-01

Pos logit contributions
ax+3.594
 Adam+3.479
 written+3.440
andra+3.406
oph+3.375
Neg logit contributions
 hedge-3.814
oppy-3.730
igs-3.537
 swamp-3.441
ind-3.389
Token took
Token ID1718
Feature activation-1.23e+00

Pos logit contributions
 funn+0.763
ax+0.758
 Adam+0.750
 written+0.737
andra+0.725
Neg logit contributions
oppy-1.271
igs-1.226
 hedge-1.206
 web-1.174
ind-1.142
Token the
Token ID262
Feature activation-3.98e+00

Pos logit contributions
ax+1.952
 funn+1.946
 written+1.942
andra+1.917
 Adam+1.908
Neg logit contributions
oppy-1.775
igs-1.713
 hedge-1.665
 web-1.565
ind-1.528
Token boy
Token ID2933
Feature activation+8.03e-02

Pos logit contributions
ax+6.610
Whe+6.355
oph+6.339
 written+6.329
andra+6.320
Neg logit contributions
 hedge-6.085
oppy-6.048
 swamp-5.877
 poorly-5.723
 underground-5.524
Token to
Token ID284
Feature activation-1.52e+00

Pos logit contributions
oppy+0.151
igs+0.147
 hedge+0.142
 web+0.140
ind+0.135
Neg logit contributions
 funn-0.094
ax-0.094
 Adam-0.093
 written-0.093
andra-0.090
Token a
Token ID257
Feature activation+1.17e-01

Pos logit contributions
ax+2.527
 funn+2.525
 written+2.479
andra+2.440
 Adam+2.367
Neg logit contributions
oppy-2.091
 hedge-2.033
igs-1.986
 web-1.936
 swamp-1.822
Token special
Token ID2041
Feature activation+1.31e-01

Pos logit contributions
oppy+0.190
igs+0.184
 hedge+0.174
 web+0.171
ind+0.165
Neg logit contributions
 funn-0.168
 Adam-0.167
ax-0.166
 written-0.165
andra-0.161
Token pizza
Token ID14256
Feature activation-4.84e-01

Pos logit contributions
oppy+0.217
igs+0.210
 hedge+0.198
 web+0.195
 webs+0.188
Neg logit contributions
 funn-0.183
 Adam-0.182
ax-0.181
 written-0.180
andra-0.177
Token sandwich
Token ID20433
Feature activation-6.92e-01

Pos logit contributions
ax+1.346
 funn+1.274
andra+1.266
 Adam+1.256
 written+1.255
Neg logit contributions
oppy-2.941
 hedge-2.844
igs-2.844
 web-2.798
ind-2.697
Token.
Token ID13
Feature activation+5.06e-01

Pos logit contributions
 funn+1.111
ax+1.108
 Adam+1.102
andra+1.094
 written+1.076
Neg logit contributions
oppy-0.998
igs-0.954
 hedge-0.925
 web-0.912
ind-0.851
Token Tim
Token ID5045
Feature activation-8.91e-01

Pos logit contributions
oppy+0.940
igs+0.903
 web+0.855
ind+0.852
 hedge+0.845
Neg logit contributions
 Adam-0.628
 funn-0.621
ax-0.619
 written-0.614
 vanilla-0.602
Tokenmy
Token ID1820
Feature activation-3.52e-01

Pos logit contributions
 funn+0.591
ax+0.581
 Adam+0.577
 written+0.567
andra+0.542
Neg logit contributions
oppy-2.160
igs-2.081
 hedge-2.034
 web-2.018
ind-1.978
Token loved
Token ID6151
Feature activation-1.04e+00

Pos logit contributions
 funn+0.439
ax+0.438
 Adam+0.432
 written+0.428
andra+0.422
Neg logit contributions
oppy-0.638
igs-0.616
 hedge-0.599
 web-0.588
ind-0.567
Token the
Token ID262
Feature activation-3.98e+00

Pos logit contributions
ax+1.290
 funn+1.270
andra+1.266
 Adam+1.233
 written+1.226
Neg logit contributions
oppy-1.881
igs-1.863
 hedge-1.786
 web-1.761
ind-1.692
Token sandwich
Token ID20433
Feature activation-3.40e-01

Pos logit contributions
ax+6.902
andra+6.813
 Sandra+6.667
 Adam+6.545
David+6.373
Neg logit contributions
 hedge-6.253
 swamp-6.122
 web-6.099
oppy-6.035
 hive-5.707
Token so
Token ID523
Feature activation+9.05e-01

Pos logit contributions
 funn+0.470
ax+0.470
 Adam+0.465
 written+0.461
andra+0.457
Neg logit contributions
oppy-0.566
igs-0.545
 hedge-0.530
 web-0.519
ind-0.498
Token much
Token ID881
Feature activation-6.26e-01

Pos logit contributions
oppy+1.130
 hedge+1.099
igs+1.092
 web+1.065
ind+0.993
Neg logit contributions
 Adam-1.639
ax-1.624
 funn-1.622
 flavor-1.611
 written-1.595
Token that
Token ID326
Feature activation-5.36e-01

Pos logit contributions
 funn+1.026
ax+1.018
 Adam+1.006
andra+1.005
 written+0.996
Neg logit contributions
oppy-0.884
igs-0.852
 hedge-0.828
 web-0.805
ind-0.756
Token he
Token ID339
Feature activation+3.10e-01

Pos logit contributions
ax+0.681
 funn+0.674
 Adam+0.664
andra+0.659
 written+0.658
Neg logit contributions
oppy-0.960
igs-0.926
 hedge-0.908
 web-0.883
ind-0.847
Token Mom
Token ID11254
Feature activation+2.29e-01

Pos logit contributions
 funn+2.786
andra+2.722
 written+2.692
ax+2.674
 Birthday+2.588
Neg logit contributions
oppy-2.222
 hedge-2.169
igs-2.143
 web-2.020
 swamp-1.971
Token and
Token ID290
Feature activation-2.11e+00

Pos logit contributions
oppy+0.460
igs+0.444
 hedge+0.433
 web+0.426
ind+0.412
Neg logit contributions
ax-0.243
 Adam-0.242
 funn-0.240
 written-0.236
andra-0.229
Token Dad
Token ID17415
Feature activation-2.12e-01

Pos logit contributions
 funn+2.454
 written+2.315
andra+2.268
 Birthday+2.172
 nineteen+2.172
Neg logit contributions
oppy-4.215
igs-3.985
 hedge-3.942
ind-3.687
 swamp-3.587
Token say
Token ID910
Feature activation-5.82e-01

Pos logit contributions
 funn+0.222
ax+0.222
 Adam+0.219
 written+0.217
andra+0.212
Neg logit contributions
oppy-0.426
igs-0.414
 hedge-0.404
 web-0.397
ind-0.385
Token,
Token ID11
Feature activation-7.57e-01

Pos logit contributions
 funn+0.973
ax+0.966
 written+0.952
andra+0.951
 Adam+0.950
Neg logit contributions
oppy-0.810
igs-0.776
 hedge-0.755
 web-0.724
ind-0.680
Token "
Token ID366
Feature activation-3.94e+00

Pos logit contributions
 funn+0.988
ax+0.970
andra+0.950
 Adam+0.949
 written+0.949
Neg logit contributions
oppy-1.323
igs-1.282
 hedge-1.272
 web-1.222
ind-1.166
TokenTom
Token ID13787
Feature activation-1.07e+00

Pos logit contributions
 written+6.066
urger+5.923
 Birthday+5.917
 Adam+5.866
 funn+5.851
Neg logit contributions
F-5.726
ind-5.679
 hedge-5.665
Fl-5.621
B-5.386
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
 funn+1.963
 written+1.898
ax+1.894
 Adam+1.878
andra+1.855
Neg logit contributions
oppy-1.364
igs-1.282
 hedge-1.238
 web-1.176
ind-1.123
Token slow
Token ID3105
Feature activation-5.81e-01

Pos logit contributions
andra+3.402
 written+3.282
 funn+3.173
ax+3.170
 announced+3.085
Neg logit contributions
 hedge-2.972
oppy-2.966
igs-2.849
 web-2.712
 swamp-2.665
Token down
Token ID866
Feature activation+3.48e-01

Pos logit contributions
 funn+0.822
ax+0.810
 written+0.809
 Adam+0.804
andra+0.795
Neg logit contributions
oppy-0.957
igs-0.928
 hedge-0.906
 web-0.874
ind-0.844
Token!
Token ID0
Feature activation-1.10e+00

Pos logit contributions
oppy+0.683
igs+0.657
 hedge+0.631
 web+0.624
ind+0.606
Neg logit contributions
 Adam-0.388
ax-0.386
 funn-0.383
 written-0.377
andra-0.359
Token bird
Token ID6512
Feature activation-3.25e-01

Pos logit contributions
ax+0.904
 funn+0.894
 Adam+0.892
 written+0.885
andra+0.870
Neg logit contributions
oppy-1.188
igs-1.128
 hedge-1.127
 web-1.085
ind-1.044
Token flew
Token ID13112
Feature activation-5.65e-01

Pos logit contributions
 funn+0.369
ax+0.367
 Adam+0.364
 written+0.360
andra+0.353
Neg logit contributions
oppy-0.622
igs-0.603
 hedge-0.589
 web-0.577
ind-0.559
Token by
Token ID416
Feature activation-3.59e-01

Pos logit contributions
ax+0.794
 Adam+0.792
 funn+0.790
andra+0.779
 written+0.773
Neg logit contributions
oppy-0.927
igs-0.896
 hedge-0.880
 web-0.855
ind-0.826
Token and
Token ID290
Feature activation-2.46e-01

Pos logit contributions
 funn+0.427
ax+0.425
 Adam+0.419
 written+0.416
andra+0.410
Neg logit contributions
oppy-0.673
igs-0.653
 hedge-0.637
 web-0.625
ind-0.603
Token thought
Token ID1807
Feature activation+4.11e-01

Pos logit contributions
 funn+0.273
ax+0.272
 Adam+0.269
 written+0.267
andra+0.261
Neg logit contributions
oppy-0.479
igs-0.464
 hedge-0.453
 web-0.444
ind-0.431
Token the
Token ID262
Feature activation-3.92e+00

Pos logit contributions
oppy+0.786
igs+0.743
 hedge+0.741
 web+0.721
ind+0.716
Neg logit contributions
 Adam-0.479
ax-0.474
 written-0.472
 funn-0.460
andra-0.444
Token mustache
Token ID49303
Feature activation-3.14e-01

Pos logit contributions
 Adam+7.819
ax+7.733
oph+7.405
Adam+7.304
Whe+7.243
Neg logit contributions
 hedge-5.647
 web-5.047
 hive-4.675
 swamp-4.598
 alone-4.508
Token was
Token ID373
Feature activation-5.79e-01

Pos logit contributions
 funn+0.406
ax+0.406
 Adam+0.402
 written+0.397
andra+0.392
Neg logit contributions
oppy-0.549
igs-0.530
 hedge-0.517
 web-0.506
ind-0.487
Token a
Token ID257
Feature activation-3.53e-01

Pos logit contributions
ax+0.882
 Adam+0.875
 funn+0.869
andra+0.864
 written+0.855
Neg logit contributions
oppy-0.884
igs-0.855
 hedge-0.830
 web-0.814
ind-0.775
Token nest
Token ID16343
Feature activation-2.77e-01

Pos logit contributions
ax+0.523
 funn+0.522
 Adam+0.518
 written+0.514
andra+0.508
Neg logit contributions
oppy-0.555
igs-0.533
 hedge-0.520
 web-0.507
ind-0.486
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
 funn+0.440
ax+0.439
 Adam+0.436
 written+0.433
andra+0.427
Neg logit contributions
oppy-0.404
igs-0.388
 hedge-0.375
 web-0.365
ind-0.350
Token Jack
Token ID3619
Feature activation+3.03e-01

Pos logit contributions
ax+1.524
andra+1.500
 funn+1.497
 written+1.469
 Adam+1.452
Neg logit contributions
oppy-2.189
 hedge-2.144
igs-2.072
 web-2.001
 Bubble-1.932
Token and
Token ID290
Feature activation-3.31e-01

Pos logit contributions
oppy+0.491
igs+0.479
 hedge+0.459
 web+0.453
ind+0.437
Neg logit contributions
 Adam-0.434
ax-0.433
 written-0.429
 funn-0.422
andra-0.422
Token Jill
Token ID22772
Feature activation+5.68e-01

Pos logit contributions
 funn+0.432
ax+0.432
 Adam+0.426
 written+0.424
andra+0.418
Neg logit contributions
oppy-0.580
igs-0.559
 hedge-0.547
 web-0.533
ind-0.514
Token,
Token ID11
Feature activation-1.75e+00

Pos logit contributions
oppy+0.885
igs+0.868
 hedge+0.815
 web+0.815
ind+0.785
Neg logit contributions
ax-0.859
 Adam-0.854
 written-0.851
 funn-0.847
andra-0.826
Token wanted
Token ID2227
Feature activation-1.20e+00

Pos logit contributions
 funn+2.331
ax+2.322
andra+2.263
 Adam+2.237
 nineteen+2.200
Neg logit contributions
oppy-2.934
 hedge-2.896
igs-2.728
 web-2.713
 webs-2.592
Token the
Token ID262
Feature activation-3.91e+00

Pos logit contributions
ax+1.610
 funn+1.576
andra+1.561
 Adam+1.511
 written+1.483
Neg logit contributions
oppy-2.032
 hedge-2.006
igs-1.996
 web-1.912
ind-1.823
Token flag
Token ID6056
Feature activation-1.27e+00

Pos logit contributions
ax+7.272
oph+6.720
 Adam+6.684
Whe+6.604
acket+6.498
Neg logit contributions
 hedge-6.051
igs-5.348
 swamp-5.332
 web-5.142
 underground-5.115
Token too
Token ID1165
Feature activation+5.77e-01

Pos logit contributions
andra+2.134
ax+2.132
 Adam+2.122
 funn+2.089
 written+2.056
Neg logit contributions
oppy-1.757
 hedge-1.668
igs-1.653
 web-1.574
ind-1.469
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
oppy+0.910
igs+0.881
 hedge+0.849
 web+0.843
 webs+0.801
Neg logit contributions
ax-0.876
 Adam-0.870
 written-0.864
 funn-0.860
 flavor-0.830
Token So
Token ID1406
Feature activation-3.09e-01

Pos logit contributions
 funn+2.980
 written+2.976
ax+2.968
andra+2.908
 vanilla+2.888
Neg logit contributions
oppy-2.108
 hedge-2.015
igs-1.961
 web-1.776
 swamp-1.771
Token they
Token ID484
Feature activation-1.75e-01

Pos logit contributions
 funn+0.514
ax+0.513
 Adam+0.508
 written+0.506
andra+0.500
Neg logit contributions
oppy-0.432
igs-0.413
 hedge-0.400
 web-0.388
ind-0.370
Token"
Token ID1
Feature activation-1.81e+00

Pos logit contributions
oppy+3.067
igs+2.931
 web+2.853
 webs+2.758
 hedge+2.722
Neg logit contributions
 Adam-2.738
Adam-2.586
ax-2.575
andra-2.563
 Sandra-2.517
TokenYes
Token ID5297
Feature activation-1.16e+00

Pos logit contributions
andra+2.192
ax+2.122
 Adam+2.097
 written+2.021
 Sandra+1.996
Neg logit contributions
 hedge-3.303
oppy-3.300
igs-3.182
 web-3.091
ind-3.047
Token,
Token ID11
Feature activation-2.18e+00

Pos logit contributions
 funn+2.166
 written+2.143
andra+2.101
ax+2.083
 Adam+2.056
Neg logit contributions
oppy-1.455
igs-1.376
 hedge-1.312
 web-1.231
ind-1.169
Token we
Token ID356
Feature activation-2.31e+00

Pos logit contributions
andra+3.385
 funn+3.159
ax+3.148
 written+3.062
Whe+3.060
Neg logit contributions
oppy-3.318
 hedge-3.311
igs-3.179
 web-3.031
 poorly-2.910
Token liked
Token ID8288
Feature activation-1.11e+00

Pos logit contributions
 Adam+2.965
andra+2.943
ax+2.862
 Sandra+2.787
 funn+2.767
Neg logit contributions
oppy-4.147
 hedge-3.852
igs-3.767
 web-3.752
 swamp-3.643
Token the
Token ID262
Feature activation-3.89e+00

Pos logit contributions
ax+1.558
andra+1.552
 funn+1.548
 written+1.503
 Adam+1.503
Neg logit contributions
oppy-1.798
igs-1.788
 hedge-1.727
 web-1.678
ind-1.656
Token clown
Token ID25573
Feature activation-1.54e+00

Pos logit contributions
andra+6.255
ax+6.008
oph+5.845
 Sandra+5.816
 written+5.724
Neg logit contributions
 hedge-6.905
 swamp-6.879
 web-6.051
 case-5.989
oppy-5.908
Token.
Token ID13
Feature activation-9.00e-01

Pos logit contributions
andra+2.408
ax+2.364
 funn+2.358
 Adam+2.341
 written+2.298
Neg logit contributions
oppy-2.365
igs-2.220
 hedge-2.125
 web-2.117
ind-2.028
Token He
Token ID679
Feature activation-8.49e-01

Pos logit contributions
ax+1.385
andra+1.370
 funn+1.369
 written+1.352
 Adam+1.340
Neg logit contributions
oppy-1.339
igs-1.266
 hedge-1.262
 web-1.209
ind-1.174
Token was
Token ID373
Feature activation-1.21e+00

Pos logit contributions
ax+1.170
 Adam+1.150
 funn+1.147
andra+1.134
 written+1.121
Neg logit contributions
oppy-1.417
igs-1.347
 hedge-1.330
 web-1.297
ind-1.253
Token very
Token ID845
Feature activation+1.72e-01

Pos logit contributions
ax+1.921
andra+1.918
 funn+1.853
 Adam+1.852
 written+1.843
Neg logit contributions
oppy-1.739
igs-1.676
 hedge-1.627
 web-1.609
ind-1.579
Token
Token ID198
Feature activation-3.29e-01

Pos logit contributions
 funn+2.295
 written+2.270
ax+2.234
 Adam+2.224
andra+2.188
Neg logit contributions
oppy-2.351
 hedge-2.260
igs-2.259
 web-2.094
 Bubble-2.005
Token
Token ID198
Feature activation+7.55e-01

Pos logit contributions
 funn+0.405
ax+0.403
 Adam+0.398
 written+0.395
andra+0.389
Neg logit contributions
oppy-0.601
igs-0.581
 hedge-0.566
 web-0.554
ind-0.535
Token"
Token ID1
Feature activation-2.20e+00

Pos logit contributions
oppy+1.168
igs+1.102
 web+1.078
 hedge+1.054
ind+1.029
Neg logit contributions
ax-1.160
 funn-1.141
andra-1.136
 Adam-1.128
 written-1.126
TokenWhat
Token ID2061
Feature activation-1.95e+00

Pos logit contributions
ax+2.467
 Adam+2.330
andra+2.255
 written+2.235
 funn+2.207
Neg logit contributions
 hedge-4.352
oppy-4.046
 web-4.033
ind-3.969
igs-3.931
Token is
Token ID318
Feature activation-1.00e+00

Pos logit contributions
ax+3.391
andra+3.323
 funn+3.230
 Adam+3.111
 Sandra+3.077
Neg logit contributions
oppy-2.741
 hedge-2.564
 web-2.518
igs-2.498
 webs-2.285
Token that
Token ID326
Feature activation-3.89e+00

Pos logit contributions
ax+1.615
 funn+1.589
andra+1.588
 Adam+1.546
 written+1.535
Neg logit contributions
oppy-1.435
igs-1.434
 hedge-1.390
 web-1.321
ind-1.278
Token?"
Token ID1701
Feature activation-2.84e-01

Pos logit contributions
ax+7.249
andra+6.986
oph+6.920
 funn+6.492
Whe+6.477
Neg logit contributions
 hedge-5.495
 web-5.370
igs-5.031
oppy-4.992
 swamp-4.768
Token Sam
Token ID3409
Feature activation-3.43e-01

Pos logit contributions
 funn+0.299
ax+0.298
 Adam+0.294
 written+0.291
andra+0.286
Neg logit contributions
oppy-0.568
igs-0.552
 hedge-0.539
 web-0.529
ind-0.512
Token asks
Token ID7893
Feature activation+5.16e-01

Pos logit contributions
 funn+0.401
ax+0.399
 Adam+0.395
 written+0.391
andra+0.385
Neg logit contributions
oppy-0.644
igs-0.623
 hedge-0.609
 web-0.596
ind-0.577
Token.
Token ID13
Feature activation+9.15e-02

Pos logit contributions
oppy+0.797
igs+0.788
 hedge+0.753
 web+0.745
ind+0.726
Neg logit contributions
 Adam-0.783
ax-0.775
 funn-0.770
 written-0.770
andra-0.742
Token
Token ID198
Feature activation+6.28e-01

Pos logit contributions
oppy+0.162
igs+0.159
 hedge+0.153
 web+0.151
ind+0.149
Neg logit contributions
ax-0.116
 Adam-0.116
 funn-0.116
 written-0.113
andra-0.111
Token very
Token ID845
Feature activation+6.50e-01

Pos logit contributions
ax+0.912
 funn+0.910
 Adam+0.901
andra+0.900
 written+0.889
Neg logit contributions
oppy-0.966
igs-0.927
 hedge-0.913
 web-0.892
ind-0.855
Token angry
Token ID7954
Feature activation-5.03e-01

Pos logit contributions
oppy+1.144
igs+1.125
 hedge+1.086
 web+1.073
 webs+1.025
Neg logit contributions
 funn-0.856
ax-0.845
 Adam-0.826
 written-0.826
andra-0.783
Token and
Token ID290
Feature activation+6.66e-01

Pos logit contributions
 funn+0.737
ax+0.729
 Adam+0.724
andra+0.719
 written+0.716
Neg logit contributions
oppy-0.799
igs-0.772
 hedge-0.753
 web-0.736
ind-0.705
Token he
Token ID339
Feature activation+3.10e-01

Pos logit contributions
oppy+1.347
igs+1.311
 web+1.253
 hedge+1.242
ind+1.211
Neg logit contributions
 funn-0.720
ax-0.708
 Adam-0.707
 written-0.704
andra-0.660
Token wanted
Token ID2227
Feature activation-3.03e-01

Pos logit contributions
oppy+0.610
igs+0.596
 hedge+0.572
 web+0.565
ind+0.546
Neg logit contributions
 funn-0.341
ax-0.335
 written-0.335
 Adam-0.331
andra-0.320
Token the
Token ID262
Feature activation-3.89e+00

Pos logit contributions
 funn+0.370
ax+0.369
 Adam+0.365
 written+0.362
andra+0.356
Neg logit contributions
oppy-0.560
igs-0.542
 hedge-0.529
 web-0.517
ind-0.500
Token steak
Token ID26320
Feature activation-8.35e-01

Pos logit contributions
andra+7.010
ax+6.960
 Sandra+6.784
David+6.433
Whe+6.413
Neg logit contributions
 web-5.788
 hedge-5.765
oppy-5.621
 hive-5.395
 swamp-5.360
Token for
Token ID329
Feature activation+3.58e-01

Pos logit contributions
andra+1.219
ax+1.215
 funn+1.212
 Adam+1.212
 written+1.189
Neg logit contributions
oppy-1.320
igs-1.255
 hedge-1.238
 web-1.210
 webs-1.130
Token himself
Token ID2241
Feature activation-2.13e-01

Pos logit contributions
oppy+0.705
igs+0.677
 hedge+0.665
 web+0.653
ind+0.638
Neg logit contributions
 Adam-0.399
ax-0.392
 funn-0.390
 written-0.382
andra-0.369
Token.
Token ID13
Feature activation+4.89e-01

Pos logit contributions
 funn+0.347
ax+0.347
 Adam+0.344
 written+0.342
andra+0.336
Neg logit contributions
oppy-0.304
igs-0.292
 hedge-0.282
 web-0.274
ind-0.263
Token He
Token ID679
Feature activation+4.77e-01

Pos logit contributions
oppy+0.768
igs+0.742
 web+0.691
 hedge+0.690
ind+0.689
Neg logit contributions
 Adam-0.749
 funn-0.744
ax-0.736
 written-0.725
andra-0.709
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
ax+1.983
 funn+1.975
 Adam+1.974
andra+1.966
 Sandra+1.875
Neg logit contributions
oppy-1.451
 hedge-1.393
igs-1.381
 web-1.361
 webs-1.266
Token Lily
Token ID20037
Feature activation+1.16e-01

Pos logit contributions
ax+1.889
 written+1.885
 funn+1.882
andra+1.854
 Adam+1.852
Neg logit contributions
oppy-1.994
igs-1.895
 hedge-1.892
 web-1.812
 webs-1.707
Token laughed
Token ID13818
Feature activation-1.52e+00

Pos logit contributions
oppy+0.221
igs+0.216
 hedge+0.208
 web+0.204
ind+0.197
Neg logit contributions
 funn-0.135
ax-0.133
 Adam-0.132
 written-0.132
andra-0.129
Token and
Token ID290
Feature activation-5.82e-01

Pos logit contributions
andra+2.136
ax+2.116
 Adam+2.072
 funn+2.044
 topp+1.999
Neg logit contributions
oppy-2.465
 hedge-2.424
igs-2.385
 web-2.347
 webs-2.284
Token thought
Token ID1807
Feature activation-6.63e-02

Pos logit contributions
ax+0.640
 funn+0.635
 Adam+0.627
 written+0.620
andra+0.616
Neg logit contributions
oppy-1.133
igs-1.096
 hedge-1.081
 web-1.053
ind-1.010
Token the
Token ID262
Feature activation-3.88e+00

Pos logit contributions
ax+0.088
 Adam+0.088
 funn+0.088
 written+0.087
andra+0.084
Neg logit contributions
oppy-0.115
igs-0.110
 hedge-0.108
 web-0.105
ind-0.103
Token clown
Token ID25573
Feature activation+1.02e-01

Pos logit contributions
andra+6.257
ax+6.077
 Sandra+5.868
oph+5.771
David+5.368
Neg logit contributions
 hedge-7.062
 swamp-6.890
 web-6.328
 alone-6.137
ind-6.097
Token was
Token ID373
Feature activation-7.11e-01

Pos logit contributions
oppy+0.186
igs+0.182
 hedge+0.175
 web+0.171
ind+0.166
Neg logit contributions
 funn-0.127
 written-0.124
ax-0.124
 Adam-0.123
andra-0.119
Token so
Token ID523
Feature activation-5.73e-02

Pos logit contributions
ax+0.948
andra+0.933
 Adam+0.931
 funn+0.924
 written+0.917
Neg logit contributions
oppy-1.213
igs-1.167
 hedge-1.155
 web-1.127
ind-1.082
Token funny
Token ID8258
Feature activation-4.78e-01

Pos logit contributions
 funn+0.066
ax+0.065
 Adam+0.064
 written+0.064
andra+0.062
Neg logit contributions
oppy-0.110
igs-0.108
 hedge-0.104
 web-0.103
ind-0.099
Token.
Token ID13
Feature activation-7.03e-01

Pos logit contributions
 funn+0.772
ax+0.771
 Adam+0.763
andra+0.757
 written+0.754
Neg logit contributions
oppy-0.684
igs-0.658
 hedge-0.640
 web-0.624
ind-0.592
Token and
Token ID290
Feature activation+1.20e-01

Pos logit contributions
 funn+2.308
andra+2.263
ax+2.242
Whe+2.229
 Adam+2.184
Neg logit contributions
oppy-2.278
 hedge-2.149
igs-2.143
 web-2.105
ind-2.035
Token went
Token ID1816
Feature activation+5.03e-01

Pos logit contributions
oppy+0.252
igs+0.245
 hedge+0.238
 web+0.237
ind+0.230
Neg logit contributions
 funn-0.118
ax-0.117
 Adam-0.116
 written-0.114
andra-0.111
Token to
Token ID284
Feature activation-1.48e+00

Pos logit contributions
oppy+1.008
igs+0.974
 web+0.934
 hedge+0.933
ind+0.891
Neg logit contributions
ax-0.549
 written-0.548
 funn-0.545
 Adam-0.545
andra-0.511
Token talk
Token ID1561
Feature activation-6.08e-01

Pos logit contributions
 funn+2.350
 written+2.331
ax+2.325
andra+2.261
 Birthday+2.210
Neg logit contributions
oppy-2.203
 hedge-2.121
igs-2.070
 web-1.970
 swamp-1.896
Token to
Token ID284
Feature activation-1.33e+00

Pos logit contributions
ax+0.731
 funn+0.728
andra+0.718
 Adam+0.716
 written+0.700
Neg logit contributions
oppy-1.114
igs-1.077
 hedge-1.063
 web-1.040
ind-1.001
Token the
Token ID262
Feature activation-3.87e+00

Pos logit contributions
ax+1.850
 funn+1.842
andra+1.827
 written+1.820
 Adam+1.791
Neg logit contributions
oppy-2.175
 hedge-2.159
igs-2.142
 web-1.984
ind-1.968
Token boy
Token ID2933
Feature activation-6.69e-01

Pos logit contributions
andra+6.092
 nineteen+6.085
ax+6.057
 Sandra+6.017
oph+5.912
Neg logit contributions
 hedge-6.609
ind-6.004
 alone-5.913
 underground-5.867
 swamp-5.854
Token.
Token ID13
Feature activation-5.42e-02

Pos logit contributions
 funn+1.145
ax+1.136
 Adam+1.127
andra+1.123
 written+1.116
Neg logit contributions
oppy-0.903
igs-0.846
 hedge-0.828
 web-0.803
ind-0.765
Token The
Token ID383
Feature activation-3.21e+00

Pos logit contributions
 funn+0.069
ax+0.068
 Adam+0.068
 written+0.067
andra+0.066
Neg logit contributions
oppy-0.098
igs-0.095
 hedge-0.091
 web-0.090
ind-0.087
Token boy
Token ID2933
Feature activation-3.92e-01

Pos logit contributions
ax+4.389
andra+4.309
 Sandra+4.185
oph+4.182
 written+4.127
Neg logit contributions
 hedge-5.803
oppy-5.297
ind-5.240
 swamp-5.171
 poorly-5.149
Token said
Token ID531
Feature activation+2.35e-01

Pos logit contributions
 funn+0.409
ax+0.409
 Adam+0.402
 written+0.397
andra+0.390
Neg logit contributions
oppy-0.786
igs-0.761
 hedge-0.746
 web-0.730
ind-0.709
Token nice
Token ID3621
Feature activation+1.52e-01

Pos logit contributions
oppy+0.136
igs+0.132
 hedge+0.125
 web+0.124
ind+0.120
Neg logit contributions
 funn-0.113
ax-0.111
 Adam-0.110
 written-0.110
andra-0.107
Token hat
Token ID6877
Feature activation-9.40e-01

Pos logit contributions
oppy+0.267
igs+0.261
 hedge+0.247
 web+0.245
ind+0.238
Neg logit contributions
ax-0.192
 funn-0.191
 Adam-0.191
 written-0.190
andra-0.186
Token.
Token ID13
Feature activation-4.80e-01

Pos logit contributions
 funn+1.459
ax+1.447
 Adam+1.441
andra+1.421
 written+1.369
Neg logit contributions
oppy-1.413
 hedge-1.356
igs-1.351
 web-1.317
ind-1.251
Token He
Token ID679
Feature activation+2.74e-01

Pos logit contributions
 funn+0.671
ax+0.670
 Adam+0.654
 written+0.653
andra+0.645
Neg logit contributions
oppy-0.799
igs-0.768
 hedge-0.755
 web-0.732
ind-0.699
Token wanted
Token ID2227
Feature activation-7.95e-01

Pos logit contributions
oppy+0.541
igs+0.528
 hedge+0.511
 web+0.504
ind+0.484
Neg logit contributions
 funn-0.296
 written-0.296
ax-0.292
 Adam-0.292
andra-0.287
Token the
Token ID262
Feature activation-3.86e+00

Pos logit contributions
 funn+0.988
ax+0.986
andra+0.967
 Adam+0.957
 written+0.941
Neg logit contributions
oppy-1.434
igs-1.403
 hedge-1.379
 web-1.335
ind-1.290
Token hat
Token ID6877
Feature activation-1.65e+00

Pos logit contributions
ax+5.278
 Adam+5.234
oph+5.107
andra+4.873
 Birthday+4.866
Neg logit contributions
 hedge-7.295
igs-7.242
 web-7.152
oppy-7.082
 webs-6.495
Token for
Token ID329
Feature activation+3.78e-01

Pos logit contributions
andra+2.798
 Adam+2.744
ax+2.743
 funn+2.718
Whe+2.668
Neg logit contributions
oppy-2.308
 hedge-2.175
igs-2.110
 web-2.049
 webs-1.919
Token himself
Token ID2241
Feature activation-3.64e-01

Pos logit contributions
oppy+0.741
igs+0.712
 hedge+0.706
 web+0.692
ind+0.669
Neg logit contributions
 Adam-0.429
ax-0.418
 funn-0.416
 written-0.415
andra-0.397
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
 funn+0.529
ax+0.526
 Adam+0.522
 written+0.517
andra+0.513
Neg logit contributions
oppy-0.583
igs-0.560
 hedge-0.545
 web-0.532
ind-0.510
Token picked
Token ID6497
Feature activation-1.24e+00

Pos logit contributions
ax+1.692
andra+1.637
 funn+1.620
 Adam+1.613
 written+1.584
Neg logit contributions
oppy-2.177
 hedge-2.132
igs-2.117
 web-2.030
 webs-1.933
Token her
Token ID607
Feature activation+6.82e-02

Pos logit contributions
oppy+1.362
igs+1.291
 hedge+1.198
 web+1.179
ind+1.177
Neg logit contributions
 Adam-1.177
 funn-1.161
 written-1.142
ax-1.138
 vanilla-1.106
Token doll
Token ID3654
Feature activation+4.81e-01

Pos logit contributions
oppy+0.119
igs+0.115
 hedge+0.111
 web+0.108
ind+0.106
Neg logit contributions
 funn-0.090
ax-0.088
 Adam-0.088
 written-0.088
andra-0.085
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
oppy+0.856
igs+0.841
 hedge+0.797
 web+0.764
ind+0.763
Neg logit contributions
 funn-0.607
 written-0.605
ax-0.600
 Adam-0.595
andra-0.573
Token She
Token ID1375
Feature activation+1.06e+00

Pos logit contributions
igs+2.064
oppy+2.053
ind+1.922
 hedge+1.897
 web+1.859
Neg logit contributions
 Adam-1.345
 written-1.279
ax-1.225
 Sandra-1.220
 funn-1.216
Token was
Token ID373
Feature activation+1.03e+00

Pos logit contributions
oppy+2.099
igs+2.074
 hedge+1.963
ind+1.874
 web+1.871
Neg logit contributions
 written-1.216
 funn-1.205
 Adam-1.168
ax-1.133
andra-1.102
Token very
Token ID845
Feature activation+2.35e+00

Pos logit contributions
oppy+1.771
igs+1.711
 hedge+1.624
ind+1.520
 web+1.506
Neg logit contributions
 funn-1.467
 written-1.464
 Adam-1.393
ax-1.392
 vanilla-1.317
Token worried
Token ID7960
Feature activation-4.07e-02

Pos logit contributions
igs+4.703
oppy+4.568
 hedge+4.539
ind+4.233
 web+4.204
Neg logit contributions
 funn-2.745
 written-2.579
ax-2.423
 Adam-2.339
 ding-2.320
Token and
Token ID290
Feature activation+1.56e+00

Pos logit contributions
ax+0.053
 funn+0.053
 Adam+0.053
 written+0.052
andra+0.051
Neg logit contributions
oppy-0.072
igs-0.069
 hedge-0.067
 web-0.065
ind-0.063
Token sad
Token ID6507
Feature activation+7.68e-01

Pos logit contributions
oppy+3.048
igs+3.003
 hedge+2.795
ind+2.775
 web+2.722
Neg logit contributions
 written-1.896
 funn-1.854
 Adam-1.794
ax-1.767
 ding-1.657
Token.
Token ID13
Feature activation+1.30e+00

Pos logit contributions
oppy+1.270
igs+1.219
 hedge+1.160
ind+1.132
 web+1.106
Neg logit contributions
 written-1.124
 funn-1.114
 Adam-1.095
ax-1.084
 vanilla-1.036
Token
Token ID198
Feature activation+1.24e+00

Pos logit contributions
igs+2.612
oppy+2.563
ind+2.402
 hedge+2.356
 web+2.341
Neg logit contributions
 Adam-1.527
 written-1.444
 Sandra-1.380
ax-1.368
 funn-1.367
Token,
Token ID11
Feature activation-1.70e-01

Pos logit contributions
 funn+0.444
ax+0.443
 Adam+0.440
 written+0.437
andra+0.432
Neg logit contributions
oppy-0.372
igs-0.357
 hedge-0.345
 web-0.335
ind-0.320
Token there
Token ID612
Feature activation-4.04e-01

Pos logit contributions
 funn+0.193
ax+0.192
 Adam+0.190
 written+0.188
andra+0.184
Neg logit contributions
oppy-0.325
igs-0.316
 hedge-0.308
 web-0.302
ind-0.293
Token was
Token ID373
Feature activation-1.26e+00

Pos logit contributions
 funn+0.533
ax+0.531
 Adam+0.523
 written+0.520
andra+0.513
Neg logit contributions
oppy-0.702
igs-0.674
 hedge-0.660
 web-0.645
ind-0.620
Token a
Token ID257
Feature activation+6.52e-01

Pos logit contributions
ax+1.382
 funn+1.351
andra+1.325
 Adam+1.306
 written+1.286
Neg logit contributions
oppy-2.500
 hedge-2.405
igs-2.401
 web-2.360
 webs-2.251
Token little
Token ID1310
Feature activation+1.17e+00

Pos logit contributions
oppy+1.247
igs+1.246
 web+1.164
 hedge+1.163
 webs+1.138
Neg logit contributions
 funn-0.729
ax-0.706
 Adam-0.705
 written-0.680
andra-0.679
Token boy
Token ID2933
Feature activation+2.31e+00

Pos logit contributions
igs+2.720
oppy+2.709
 web+2.565
 webs+2.524
 hedge+2.501
Neg logit contributions
 funn-0.835
ax-0.804
 vanilla-0.765
 Adam-0.755
 flavor-0.746
Token named
Token ID3706
Feature activation-5.78e-01

Pos logit contributions
igs+2.605
oppy+2.475
 hedge+2.287
 webs+2.219
 web+2.178
Neg logit contributions
 written-4.562
 flavor-4.516
 Adam-4.496
ax-4.408
 funn-4.396
Token Tim
Token ID5045
Feature activation-6.13e-01

Pos logit contributions
 funn+0.715
ax+0.705
 written+0.701
andra+0.694
 Adam+0.690
Neg logit contributions
oppy-1.066
igs-1.023
 hedge-1.003
 web-0.973
ind-0.943
Tokenmy
Token ID1820
Feature activation+7.78e-01

Pos logit contributions
 funn+0.431
ax+0.423
 Adam+0.421
 written+0.409
andra+0.400
Neg logit contributions
oppy-1.445
igs-1.401
 hedge-1.378
 web-1.358
ind-1.335
Token.
Token ID13
Feature activation-2.06e-01

Pos logit contributions
oppy+1.239
igs+1.221
 hedge+1.148
 web+1.114
 webs+1.074
Neg logit contributions
 written-1.148
 funn-1.134
ax-1.123
 Adam-1.117
andra-1.087
Token Tim
Token ID5045
Feature activation-3.52e-01

Pos logit contributions
 funn+0.244
ax+0.243
 Adam+0.240
 written+0.238
andra+0.233
Neg logit contributions
oppy-0.386
igs-0.374
 hedge-0.364
 web-0.357
ind-0.346
Token,
Token ID11
Feature activation-3.59e-01

Pos logit contributions
 funn+0.364
ax+0.364
 Adam+0.361
 written+0.358
andra+0.354
Neg logit contributions
oppy-0.308
igs-0.296
 hedge-0.286
 web-0.278
ind-0.267
Token there
Token ID612
Feature activation-2.99e-01

Pos logit contributions
 funn+0.411
ax+0.410
 Adam+0.404
 written+0.401
andra+0.395
Neg logit contributions
oppy-0.687
igs-0.664
 hedge-0.650
 web-0.636
ind-0.614
Token was
Token ID373
Feature activation-1.19e+00

Pos logit contributions
 funn+0.387
ax+0.386
 Adam+0.382
 written+0.379
andra+0.373
Neg logit contributions
oppy-0.526
igs-0.507
 hedge-0.494
 web-0.484
ind-0.466
Token a
Token ID257
Feature activation+6.51e-01

Pos logit contributions
ax+1.284
 funn+1.261
andra+1.232
 Adam+1.217
 written+1.204
Neg logit contributions
oppy-2.378
igs-2.297
 hedge-2.288
 web-2.236
ind-2.142
Token little
Token ID1310
Feature activation+1.21e+00

Pos logit contributions
igs+1.255
oppy+1.255
 web+1.175
 hedge+1.172
 webs+1.148
Neg logit contributions
 funn-0.714
ax-0.693
 Adam-0.692
andra-0.668
 written-0.668
Token boy
Token ID2933
Feature activation+2.34e+00

Pos logit contributions
igs+2.825
oppy+2.816
 web+2.676
 webs+2.631
 hedge+2.601
Neg logit contributions
 funn-0.861
ax-0.826
 vanilla-0.792
 Adam-0.780
 flavor-0.771
Token named
Token ID3706
Feature activation-6.30e-01

Pos logit contributions
igs+2.630
oppy+2.474
 hedge+2.330
 webs+2.291
 web+2.260
Neg logit contributions
 written-4.591
 Adam-4.556
 flavor-4.553
ax-4.449
 Birthday-4.426
Token Tim
Token ID5045
Feature activation-7.24e-01

Pos logit contributions
 funn+0.791
ax+0.780
 written+0.775
andra+0.771
 Adam+0.762
Neg logit contributions
oppy-1.157
igs-1.108
 hedge-1.087
 web-1.050
ind-1.020
Tokenmy
Token ID1820
Feature activation+8.94e-01

Pos logit contributions
 funn+0.466
 Adam+0.455
ax+0.452
 written+0.442
andra+0.429
Neg logit contributions
oppy-1.751
igs-1.698
 hedge-1.670
 web-1.645
ind-1.626
Token.
Token ID13
Feature activation-1.05e-01

Pos logit contributions
oppy+1.443
igs+1.427
 hedge+1.339
 web+1.305
 webs+1.261
Neg logit contributions
 written-1.300
 funn-1.275
ax-1.269
 Adam-1.265
andra-1.228
Token Tim
Token ID5045
Feature activation-4.06e-01

Pos logit contributions
 funn+0.123
ax+0.122
 Adam+0.121
 written+0.120
andra+0.117
Neg logit contributions
oppy-0.198
igs-0.191
 hedge-0.186
 web-0.183
ind-0.178
Token her
Token ID607
Feature activation-1.06e+00

Pos logit contributions
oppy+0.479
igs+0.461
 hedge+0.454
 web+0.445
ind+0.437
Neg logit contributions
 funn-0.234
 Adam-0.229
 written-0.228
ax-0.226
andra-0.216
Token best
Token ID1266
Feature activation-8.48e-01

Pos logit contributions
ax+1.452
andra+1.406
 Adam+1.400
 written+1.399
 funn+1.395
Neg logit contributions
oppy-1.819
igs-1.744
 hedge-1.709
 web-1.666
ind-1.576
Token and
Token ID290
Feature activation+4.48e-01

Pos logit contributions
ax+1.290
 funn+1.285
 Adam+1.268
andra+1.256
 written+1.251
Neg logit contributions
oppy-1.314
igs-1.264
 hedge-1.226
 web-1.197
ind-1.131
Token feel
Token ID1254
Feature activation+2.06e+00

Pos logit contributions
oppy+0.791
igs+0.768
 hedge+0.734
ind+0.723
 web+0.719
Neg logit contributions
 funn-0.596
 Adam-0.584
 written-0.580
ax-0.571
andra-0.555
Token proud
Token ID6613
Feature activation+1.23e-01

Pos logit contributions
oppy+4.615
igs+4.546
ind+4.287
 hedge+4.228
 web+4.203
Neg logit contributions
 funn-2.241
 written-2.147
 Adam-1.950
ax-1.922
 vanilla-1.794
Token of
Token ID286
Feature activation+2.44e+00

Pos logit contributions
oppy+0.196
igs+0.189
 hedge+0.182
 web+0.177
ind+0.173
Neg logit contributions
 funn-0.183
 written-0.181
ax-0.180
 Adam-0.180
andra-0.173
Token herself
Token ID5223
Feature activation+4.26e-01

Pos logit contributions
oppy+4.873
igs+4.668
ind+4.337
 hedge+4.323
 web+4.244
Neg logit contributions
 Adam-3.329
 written-3.081
 funn-2.983
ax-2.875
 Sandra-2.800
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
oppy+0.670
igs+0.651
 hedge+0.620
 web+0.597
ind+0.592
Neg logit contributions
 written-0.642
 funn-0.637
 Adam-0.637
ax-0.629
andra-0.603
Token Chloe
Token ID29476
Feature activation+5.47e-01

Pos logit contributions
igs+3.332
oppy+3.239
ind+3.105
 hedge+3.077
 swamp+3.033
Neg logit contributions
 Adam-1.799
 written-1.655
 funn-1.591
ax-1.562
andra-1.560
Token thought
Token ID1807
Feature activation+6.44e-01

Pos logit contributions
oppy+1.086
igs+1.075
 hedge+1.033
 web+1.015
ind+0.991
Neg logit contributions
 Adam-0.594
ax-0.590
 written-0.587
 funn-0.586
andra-0.579
Token about
Token ID546
Feature activation+7.47e-01

Pos logit contributions
oppy+1.265
igs+1.212
 hedge+1.196
ind+1.162
 web+1.157
Neg logit contributions
 Adam-0.737
 written-0.720
ax-0.711
 funn-0.709
andra-0.661
TokenThe
Token ID464
Feature activation+7.65e-02

Pos logit contributions
oppy+1.665
igs+1.592
 hedge+1.542
 web+1.508
 webs+1.446
Neg logit contributions
ax-1.401
 Adam-1.397
 funn-1.381
 written-1.365
andra-1.347
Token moral
Token ID6573
Feature activation+1.09e+00

Pos logit contributions
oppy+0.130
igs+0.125
 hedge+0.120
 web+0.118
ind+0.113
Neg logit contributions
 funn-0.106
ax-0.104
 Adam-0.103
 written-0.103
andra-0.101
Token of
Token ID286
Feature activation+4.90e-01

Pos logit contributions
oppy+2.064
igs+1.951
 web+1.854
 hedge+1.853
 webs+1.785
Neg logit contributions
 written-1.416
 funn-1.372
 Adam-1.345
ax-1.340
 flavor-1.301
Token the
Token ID262
Feature activation+1.50e-01

Pos logit contributions
oppy+0.829
igs+0.806
 hedge+0.774
 web+0.755
ind+0.730
Neg logit contributions
 Adam-0.679
 funn-0.661
ax-0.661
 written-0.654
andra-0.645
Token story
Token ID1621
Feature activation+1.39e+00

Pos logit contributions
oppy+0.293
igs+0.283
 hedge+0.273
 web+0.271
ind+0.261
Neg logit contributions
 funn-0.169
ax-0.164
 Adam-0.164
 written-0.163
andra-0.159
Token is
Token ID318
Feature activation+2.69e+00

Pos logit contributions
oppy+2.677
igs+2.630
 hedge+2.463
ind+2.438
 web+2.421
Neg logit contributions
 written-1.665
 Adam-1.662
 funn-1.562
ax-1.512
 flavor-1.485
Token that
Token ID326
Feature activation+1.39e+00

Pos logit contributions
igs+5.526
oppy+5.429
 web+5.388
 webs+5.373
ind+5.161
Neg logit contributions
 written-2.924
 Adam-2.763
ax-2.752
 funn-2.613
 Sandra-2.591
Token being
Token ID852
Feature activation+6.15e-01

Pos logit contributions
oppy+2.802
igs+2.689
 hedge+2.529
ind+2.524
 web+2.507
Neg logit contributions
 funn-1.548
 written-1.521
 Adam-1.503
 flavor-1.434
 vanilla-1.366
Token kind
Token ID1611
Feature activation+4.54e-01

Pos logit contributions
oppy+0.989
igs+0.946
 hedge+0.902
 web+0.878
ind+0.854
Neg logit contributions
 funn-0.950
 written-0.919
 Adam-0.901
ax-0.900
andra-0.864
Token and
Token ID290
Feature activation+8.18e-01

Pos logit contributions
oppy+0.828
igs+0.812
 hedge+0.771
 web+0.756
ind+0.749
Neg logit contributions
 written-0.557
 funn-0.554
 Adam-0.551
ax-0.546
andra-0.529
Token sharing
Token ID7373
Feature activation+5.44e-01

Pos logit contributions
oppy+1.364
igs+1.290
 web+1.227
 hedge+1.227
ind+1.177
Neg logit contributions
 funn-1.208
 written-1.184
 Adam-1.154
ax-1.135
andra-1.093
Token was
Token ID373
Feature activation-8.67e-01

Pos logit contributions
oppy+0.338
igs+0.329
 hedge+0.320
 web+0.312
ind+0.302
Neg logit contributions
ax-0.217
 funn-0.217
 Adam-0.217
 written-0.214
andra-0.211
Token so
Token ID523
Feature activation-6.21e-01

Pos logit contributions
ax+1.042
 funn+1.034
andra+1.026
 Adam+1.017
 written+1.004
Neg logit contributions
oppy-1.573
igs-1.519
 hedge-1.494
 web-1.481
ind-1.407
Token proud
Token ID6613
Feature activation-5.74e-01

Pos logit contributions
ax+0.856
 funn+0.856
 Adam+0.848
andra+0.842
 written+0.840
Neg logit contributions
oppy-1.036
igs-0.989
 hedge-0.965
 web-0.943
ind-0.905
Token of
Token ID286
Feature activation+1.45e+00

Pos logit contributions
 funn+0.863
ax+0.856
 Adam+0.848
andra+0.841
 written+0.837
Neg logit contributions
oppy-0.888
igs-0.854
 hedge-0.829
 web-0.811
ind-0.772
Token himself
Token ID2241
Feature activation+2.42e-01

Pos logit contributions
oppy+3.101
igs+2.889
 hedge+2.850
ind+2.824
 web+2.822
Neg logit contributions
 Adam-1.606
ax-1.482
 written-1.477
 funn-1.476
 Sandra-1.362
Token for
Token ID329
Feature activation+5.62e-01

Pos logit contributions
oppy+0.413
igs+0.403
 hedge+0.390
 web+0.379
ind+0.372
Neg logit contributions
 Adam-0.325
ax-0.325
 funn-0.324
 written-0.323
andra-0.310
Token being
Token ID852
Feature activation-1.21e-01

Pos logit contributions
oppy+1.071
igs+1.017
 hedge+0.994
 web+0.969
ind+0.955
Neg logit contributions
 Adam-0.698
 written-0.681
 funn-0.677
ax-0.665
andra-0.630
Token successful
Token ID4388
Feature activation-3.60e-02

Pos logit contributions
 funn+0.150
ax+0.149
 Adam+0.148
 written+0.147
andra+0.143
Neg logit contributions
oppy-0.221
igs-0.214
 hedge-0.208
 web-0.204
ind-0.197
Token with
Token ID351
Feature activation+1.40e+00

Pos logit contributions
 funn+0.054
ax+0.054
 Adam+0.054
 written+0.053
andra+0.052
Neg logit contributions
oppy-0.056
igs-0.054
 hedge-0.052
 web-0.051
ind-0.049
Token his
Token ID465
Feature activation+9.26e-01

Pos logit contributions
oppy+2.782
igs+2.610
 hedge+2.594
 web+2.554
ind+2.479
Neg logit contributions
 Adam-1.715
 written-1.629
ax-1.613
 funn-1.584
 Sandra-1.482
Token plan
Token ID1410
Feature activation+1.33e-01

Pos logit contributions
oppy+1.676
igs+1.620
 hedge+1.571
 web+1.542
ind+1.504
Neg logit contributions
 Adam-1.187
 funn-1.163
 written-1.158
ax-1.124
andra-1.116
Token rabbit
Token ID22746
Feature activation+2.13e-01

Pos logit contributions
oppy+0.200
igs+0.192
 hedge+0.179
 web+0.177
ind+0.172
Neg logit contributions
 funn-0.221
ax-0.216
 written-0.214
 Adam-0.213
andra-0.211
Token ran
Token ID4966
Feature activation+7.62e-01

Pos logit contributions
oppy+0.390
igs+0.378
 hedge+0.363
 web+0.352
ind+0.350
Neg logit contributions
 funn-0.262
ax-0.258
 written-0.257
 Adam-0.254
andra-0.248
Token away
Token ID1497
Feature activation+1.57e+00

Pos logit contributions
oppy+1.375
igs+1.305
 hedge+1.225
 web+1.205
ind+1.190
Neg logit contributions
ax-1.016
 funn-1.012
 written-1.010
 Adam-1.002
andra-0.924
Token.
Token ID13
Feature activation+6.75e-01

Pos logit contributions
oppy+2.867
igs+2.744
ind+2.538
 hedge+2.518
 web+2.409
Neg logit contributions
 funn-2.086
 written-2.081
 Adam-2.022
ax-2.010
 announced-1.870
Token The
Token ID383
Feature activation+1.22e+00

Pos logit contributions
oppy+1.171
igs+1.154
 hedge+1.093
 web+1.090
ind+1.080
Neg logit contributions
ax-0.922
 Adam-0.920
 funn-0.884
 written-0.873
andra-0.863
Token wolf
Token ID17481
Feature activation+4.98e-01

Pos logit contributions
oppy+1.995
igs+1.944
 web+1.754
ind+1.673
 hedge+1.664
Neg logit contributions
 funn-1.883
 written-1.740
ax-1.735
andra-1.715
 son-1.688
Token had
Token ID550
Feature activation-5.08e-01

Pos logit contributions
oppy+0.959
igs+0.935
 hedge+0.884
 web+0.863
ind+0.860
Neg logit contributions
 funn-0.572
ax-0.567
 written-0.566
 Adam-0.559
andra-0.546
Token improved
Token ID6596
Feature activation+7.28e-01

Pos logit contributions
ax+0.564
 funn+0.563
 Adam+0.558
 written+0.546
andra+0.546
Neg logit contributions
oppy-0.979
igs-0.951
 hedge-0.931
 web-0.915
ind-0.876
Token!
Token ID0
Feature activation+7.74e-01

Pos logit contributions
oppy+1.351
igs+1.282
 hedge+1.239
 web+1.206
ind+1.197
Neg logit contributions
 funn-0.911
 Adam-0.911
 written-0.906
ax-0.882
andra-0.843
Token It
Token ID632
Feature activation+7.22e-01

Pos logit contributions
oppy+1.496
igs+1.479
 hedge+1.415
 web+1.406
ind+1.398
Neg logit contributions
ax-0.908
 Adam-0.901
 funn-0.850
 written-0.842
andra-0.830
Token was
Token ID373
Feature activation-1.43e-01

Pos logit contributions
oppy+1.381
igs+1.343
 hedge+1.272
ind+1.251
 web+1.233
Neg logit contributions
 funn-0.876
 written-0.853
ax-0.829
 Adam-0.822
andra-0.800
Token none
Token ID4844
Feature activation+1.04e-01

Pos logit contributions
oppy+1.417
igs+1.392
 hedge+1.315
 web+1.315
ind+1.275
Neg logit contributions
 Adam-0.968
 written-0.956
 funn-0.935
ax-0.921
 vanilla-0.887
Token was
Token ID373
Feature activation-4.87e-02

Pos logit contributions
oppy+0.174
igs+0.170
 hedge+0.163
 web+0.159
ind+0.155
Neg logit contributions
ax-0.146
 written-0.145
 funn-0.145
 Adam-0.144
andra-0.140
Token more
Token ID517
Feature activation+6.76e-01

Pos logit contributions
 funn+0.057
ax+0.057
 Adam+0.057
 written+0.056
andra+0.054
Neg logit contributions
oppy-0.092
igs-0.089
 hedge-0.087
 web-0.085
ind-0.082
Token special
Token ID2041
Feature activation+1.70e-01

Pos logit contributions
oppy+1.107
igs+1.045
 hedge+1.009
 web+0.970
ind+0.934
Neg logit contributions
 written-1.036
 funn-1.017
 Adam-1.003
ax-1.003
andra-0.951
Token than
Token ID621
Feature activation+2.24e-01

Pos logit contributions
oppy+0.277
igs+0.267
 hedge+0.257
 web+0.250
ind+0.242
Neg logit contributions
 written-0.246
 funn-0.245
 Adam-0.244
ax-0.242
andra-0.233
Token her
Token ID607
Feature activation+5.15e-01

Pos logit contributions
oppy+0.406
igs+0.393
 hedge+0.383
 web+0.376
ind+0.362
Neg logit contributions
 Adam-0.283
ax-0.281
 written-0.281
 funn-0.279
andra-0.268
Token last
Token ID938
Feature activation-2.99e-01

Pos logit contributions
oppy+0.897
igs+0.890
 hedge+0.836
 web+0.820
ind+0.797
Neg logit contributions
 Adam-0.676
 funn-0.675
 written-0.670
ax-0.658
andra-0.638
Token request
Token ID2581
Feature activation+3.00e-01

Pos logit contributions
 funn+0.369
ax+0.368
 Adam+0.364
 written+0.361
andra+0.355
Neg logit contributions
oppy-0.542
igs-0.524
 hedge-0.511
 web-0.501
ind-0.484
Token.
Token ID13
Feature activation+6.64e-01

Pos logit contributions
oppy+0.451
igs+0.438
 hedge+0.418
 web+0.408
ind+0.394
Neg logit contributions
 written-0.471
 Adam-0.468
ax-0.466
 funn-0.461
andra-0.450
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
oppy+1.257
igs+1.250
 web+1.190
 hedge+1.186
ind+1.149
Neg logit contributions
 Adam-0.795
ax-0.778
 funn-0.757
 written-0.751
andra-0.730
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
oppy+0.293
igs+0.287
 hedge+0.279
 web+0.275
ind+0.263
Neg logit contributions
 Adam-0.174
ax-0.174
 funn-0.170
 written-0.170
andra-0.166
Token that
Token ID326
Feature activation+1.48e+00

Pos logit contributions
igs+7.344
 webs+7.033
oppy+7.012
 web+6.974
ind+6.788
Neg logit contributions
 written-3.168
 Adam-3.092
 come-2.876
 Sandra-2.814
 Birthday-2.811
Token even
Token ID772
Feature activation+3.20e-01

Pos logit contributions
oppy+3.049
igs+2.947
ind+2.765
 hedge+2.707
 web+2.676
Neg logit contributions
 written-1.632
 funn-1.624
 Adam-1.618
 flavor-1.467
 son-1.408
Token when
Token ID618
Feature activation+8.18e-01

Pos logit contributions
oppy+0.598
igs+0.577
 hedge+0.559
 web+0.551
ind+0.536
Neg logit contributions
 funn-0.395
 written-0.379
 Adam-0.374
ax-0.372
andra-0.355
Token we
Token ID356
Feature activation+1.03e+00

Pos logit contributions
oppy+1.486
igs+1.421
 web+1.392
 hedge+1.391
ind+1.336
Neg logit contributions
 Adam-1.014
 funn-1.000
 written-0.989
ax-0.958
 flavor-0.950
Token stumble
Token ID35174
Feature activation+8.01e-01

Pos logit contributions
igs+2.031
oppy+2.019
 hedge+1.905
 web+1.903
ind+1.896
Neg logit contributions
 funn-1.160
 written-1.150
ax-1.105
 Adam-1.074
andra-1.034
Token,
Token ID11
Feature activation+2.55e-01

Pos logit contributions
oppy+1.470
igs+1.408
 hedge+1.349
 web+1.347
ind+1.313
Neg logit contributions
 funn-0.982
 written-0.968
 Adam-0.967
ax-0.967
andra-0.902
Token we
Token ID356
Feature activation+2.04e+00

Pos logit contributions
oppy+0.466
igs+0.450
 hedge+0.432
 web+0.426
ind+0.418
Neg logit contributions
 funn-0.314
 Adam-0.309
 written-0.306
ax-0.305
andra-0.293
Token have
Token ID423
Feature activation+7.26e-01

Pos logit contributions
igs+3.758
oppy+3.739
ind+3.618
 web+3.448
 hedge+3.447
Neg logit contributions
 funn-2.606
 written-2.601
ax-2.368
 Adam-2.305
 announced-2.280
Token friends
Token ID2460
Feature activation+1.19e+00

Pos logit contributions
oppy+1.502
igs+1.441
 hedge+1.399
ind+1.369
 web+1.366
Neg logit contributions
 funn-0.761
 written-0.745
 Adam-0.721
ax-0.711
andra-0.677
Token to
Token ID284
Feature activation+9.07e-01

Pos logit contributions
oppy+2.201
igs+2.147
 hedge+2.055
 web+1.998
ind+1.957
Neg logit contributions
 written-1.504
 funn-1.455
 Adam-1.454
ax-1.439
 Birthday-1.347
Token help
Token ID1037
Feature activation+3.86e-01

Pos logit contributions
oppy+1.805
igs+1.754
 hedge+1.688
 web+1.665
ind+1.650
Neg logit contributions
 funn-0.978
ax-0.944
 Adam-0.936
 written-0.935
andra-0.891
Token always
Token ID1464
Feature activation-2.09e-01

Pos logit contributions
 Adam+1.109
ax+1.106
 funn+1.091
andra+1.091
 written+1.071
Neg logit contributions
oppy-1.278
igs-1.260
 hedge-1.230
 web-1.205
ind-1.133
Token safe
Token ID3338
Feature activation+4.58e-01

Pos logit contributions
 funn+0.276
ax+0.275
 Adam+0.272
 written+0.271
andra+0.265
Neg logit contributions
oppy-0.361
igs-0.348
 hedge-0.339
 web-0.331
ind-0.320
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
oppy+0.823
igs+0.789
 hedge+0.758
 web+0.744
ind+0.721
Neg logit contributions
 written-0.580
 funn-0.579
ax-0.576
 Adam-0.574
andra-0.545
Token From
Token ID3574
Feature activation+9.97e-01

Pos logit contributions
igs+1.951
oppy+1.900
 web+1.797
ind+1.791
 hedge+1.773
Neg logit contributions
 Adam-1.819
ax-1.770
andra-1.717
 Sandra-1.712
 written-1.688
Token that
Token ID326
Feature activation+9.14e-01

Pos logit contributions
oppy+1.754
igs+1.666
 web+1.611
 hedge+1.579
 webs+1.524
Neg logit contributions
ax-1.366
 written-1.349
 funn-1.336
 Adam-1.324
andra-1.303
Token day
Token ID1110
Feature activation+8.99e-01

Pos logit contributions
oppy+1.578
igs+1.552
 hedge+1.438
 web+1.429
ind+1.424
Neg logit contributions
 funn-1.232
 written-1.223
 Adam-1.204
ax-1.189
andra-1.170
Token on
Token ID319
Feature activation+2.53e+00

Pos logit contributions
oppy+1.812
igs+1.771
 hedge+1.753
 web+1.734
ind+1.704
Neg logit contributions
 written-0.921
ax-0.920
 Adam-0.906
 funn-0.870
andra-0.846
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
ind+5.081
oppy+4.799
 web+4.773
igs+4.760
 webs+4.613
Neg logit contributions
 Adam-2.984
 Sandra-2.686
andra-2.631
 flavor-2.588
ax-2.557
Token she
Token ID673
Feature activation+8.88e-01

Pos logit contributions
oppy+1.921
igs+1.919
 web+1.836
ind+1.832
 hedge+1.807
Neg logit contributions
 funn-1.486
 Adam-1.482
ax-1.434
 written-1.423
 Sandra-1.389
Token made
Token ID925
Feature activation-1.58e+00

Pos logit contributions
oppy+1.622
igs+1.561
 hedge+1.512
 web+1.484
ind+1.468
Neg logit contributions
 written-1.109
 funn-1.108
ax-1.092
andra-1.077
 Adam-1.062
Token sure
Token ID1654
Feature activation+3.09e-01

Pos logit contributions
ax+1.643
andra+1.630
 written+1.617
 Adam+1.613
 funn+1.568
Neg logit contributions
igs-3.215
oppy-3.174
 hedge-3.068
 web-3.065
 webs-2.963
Token cross
Token ID3272
Feature activation+5.21e-01

Pos logit contributions
 funn+0.261
ax+0.260
 Adam+0.257
 written+0.254
andra+0.249
Neg logit contributions
oppy-0.538
igs-0.522
 hedge-0.510
 web-0.501
ind-0.486
Token the
Token ID262
Feature activation+5.41e-01

Pos logit contributions
oppy+0.942
igs+0.879
 hedge+0.841
 web+0.831
ind+0.804
Neg logit contributions
 written-0.684
 Adam-0.682
 funn-0.679
ax-0.677
andra-0.641
Token road
Token ID2975
Feature activation+1.47e-01

Pos logit contributions
oppy+0.904
igs+0.848
 hedge+0.773
 web+0.768
ind+0.749
Neg logit contributions
 funn-0.806
 Adam-0.779
ax-0.772
 written-0.771
andra-0.736
Token because
Token ID780
Feature activation+1.05e+00

Pos logit contributions
oppy+0.238
igs+0.228
 hedge+0.218
 web+0.213
ind+0.206
Neg logit contributions
ax-0.213
 funn-0.212
 Adam-0.212
 written-0.212
andra-0.204
Token there
Token ID612
Feature activation+2.83e-01

Pos logit contributions
oppy+2.110
igs+1.959
 web+1.883
ind+1.855
 hedge+1.852
Neg logit contributions
 Adam-1.212
 written-1.188
 funn-1.151
ax-1.148
 vanilla-1.123
Token were
Token ID547
Feature activation-3.47e-01

Pos logit contributions
oppy+0.572
igs+0.557
 hedge+0.529
 web+0.524
ind+0.522
Neg logit contributions
 funn-0.296
 written-0.296
 Adam-0.293
ax-0.291
andra-0.280
Token too
Token ID1165
Feature activation+1.27e+00

Pos logit contributions
ax+0.453
 funn+0.453
 Adam+0.448
 written+0.444
andra+0.439
Neg logit contributions
oppy-0.602
igs-0.583
 hedge-0.569
 web-0.556
ind-0.536
Token many
Token ID867
Feature activation-2.74e-02

Pos logit contributions
oppy+1.920
igs+1.706
 web+1.660
 hedge+1.645
 Bubble+1.559
Neg logit contributions
 written-2.142
 Adam-2.089
ax-2.065
 funn-2.020
 flavor-2.010
Token big
Token ID1263
Feature activation-5.20e-01

Pos logit contributions
 funn+0.037
ax+0.037
 written+0.037
 Adam+0.037
andra+0.035
Neg logit contributions
oppy-0.047
igs-0.045
 hedge-0.043
 web-0.043
ind-0.041
Token cars
Token ID5006
Feature activation-3.48e-01

Pos logit contributions
ax+0.729
 funn+0.727
 Adam+0.720
andra+0.713
 written+0.708
Neg logit contributions
oppy-0.849
igs-0.829
 hedge-0.809
 web-0.789
ind-0.755
Token driving
Token ID5059
Feature activation+1.67e-01

Pos logit contributions
 funn+0.544
ax+0.541
 Adam+0.538
 written+0.532
andra+0.528
Neg logit contributions
oppy-0.516
igs-0.496
 hedge-0.482
 web-0.469
ind-0.449
Token Sally
Token ID25737
Feature activation-1.04e-01

Pos logit contributions
 funn+0.816
ax+0.806
 written+0.802
 Adam+0.798
andra+0.772
Neg logit contributions
oppy-1.219
igs-1.164
 hedge-1.158
 web-1.114
ind-1.076
Token both
Token ID1111
Feature activation+2.49e-01

Pos logit contributions
 funn+0.120
ax+0.119
 Adam+0.118
 written+0.117
andra+0.114
Neg logit contributions
oppy-0.199
igs-0.193
 hedge-0.188
 web-0.184
ind-0.179
Token laughed
Token ID13818
Feature activation-6.23e-01

Pos logit contributions
oppy+0.497
igs+0.488
 hedge+0.471
 web+0.459
ind+0.455
Neg logit contributions
 funn-0.269
ax-0.267
 Adam-0.265
 written-0.262
andra-0.253
Token and
Token ID290
Feature activation-4.61e-01

Pos logit contributions
ax+0.814
 funn+0.807
 Adam+0.796
andra+0.794
 written+0.786
Neg logit contributions
oppy-1.080
igs-1.043
 hedge-1.027
 web-1.002
ind-0.960
Token laughed
Token ID13818
Feature activation-3.14e-01

Pos logit contributions
ax+0.489
 funn+0.489
 Adam+0.480
 written+0.476
andra+0.470
Neg logit contributions
oppy-0.918
igs-0.888
 hedge-0.874
 web-0.855
ind-0.822
Token.
Token ID13
Feature activation+3.27e-02

Pos logit contributions
 funn+0.460
ax+0.459
 Adam+0.455
 written+0.451
andra+0.446
Neg logit contributions
oppy-0.499
igs-0.480
 hedge-0.467
 web-0.455
ind-0.437
Token
Token ID198
Feature activation+7.03e-01

Pos logit contributions
oppy+0.049
igs+0.047
 hedge+0.045
 web+0.045
ind+0.043
Neg logit contributions
ax-0.051
 Adam-0.051
 funn-0.051
 written-0.050
andra-0.050
Token
Token ID198
Feature activation+5.28e-01

Pos logit contributions
oppy+1.405
igs+1.366
 hedge+1.341
 web+1.321
ind+1.278
Neg logit contributions
 Adam-0.746
ax-0.744
 written-0.716
andra-0.707
 funn-0.693
TokenFrom
Token ID4863
Feature activation+2.59e-01

Pos logit contributions
oppy+0.901
igs+0.854
 hedge+0.832
 web+0.822
 webs+0.783
Neg logit contributions
ax-0.735
 funn-0.719
 Adam-0.717
 written-0.715
andra-0.714
Token that
Token ID326
Feature activation+6.19e-02

Pos logit contributions
oppy+0.400
igs+0.381
 hedge+0.365
 web+0.363
ind+0.343
Neg logit contributions
 funn-0.398
ax-0.397
 Adam-0.395
 written-0.393
andra-0.383
Token day
Token ID1110
Feature activation+8.11e-01

Pos logit contributions
oppy+0.095
igs+0.092
 hedge+0.087
 web+0.085
ind+0.083
Neg logit contributions
 funn-0.095
 Adam-0.094
ax-0.094
 written-0.093
andra-0.091
Token of
Token ID286
Feature activation+1.04e+00

Pos logit contributions
oppy+0.958
igs+0.912
 hedge+0.905
 web+0.871
 swamp+0.825
Neg logit contributions
 written-0.727
ax-0.726
 funn-0.720
 Adam-0.714
 vanilla-0.669
Token you
Token ID345
Feature activation+1.49e-02

Pos logit contributions
oppy+1.914
igs+1.820
 hedge+1.782
 web+1.740
 swamp+1.648
Neg logit contributions
 Adam-1.350
ax-1.292
 funn-1.290
 written-1.270
 vanilla-1.235
Token."
Token ID526
Feature activation+5.18e-02

Pos logit contributions
oppy+0.025
igs+0.024
 hedge+0.024
 web+0.023
ind+0.022
Neg logit contributions
 funn-0.020
ax-0.020
 Adam-0.020
 written-0.020
andra-0.019
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
oppy+0.097
igs+0.095
 hedge+0.092
 web+0.090
ind+0.088
Neg logit contributions
 Adam-0.062
ax-0.061
 funn-0.060
 written-0.059
andra-0.059
Token
Token ID198
Feature activation+7.06e-01

Pos logit contributions
oppy+0.251
igs+0.244
 hedge+0.238
 web+0.233
ind+0.226
Neg logit contributions
ax-0.162
 Adam-0.162
 funn-0.159
 written-0.157
andra-0.156
TokenBen
Token ID11696
Feature activation-3.64e-02

Pos logit contributions
oppy+1.264
igs+1.217
 hedge+1.178
 web+1.156
 webs+1.114
Neg logit contributions
 Adam-0.936
ax-0.934
 funn-0.917
andra-0.911
 written-0.890
Token shook
Token ID14682
Feature activation-4.71e-01

Pos logit contributions
 funn+0.038
ax+0.038
 Adam+0.038
 written+0.037
andra+0.037
Neg logit contributions
oppy-0.073
igs-0.071
 hedge-0.069
 web-0.068
ind-0.066
Token his
Token ID465
Feature activation-2.26e-01

Pos logit contributions
 funn+0.671
ax+0.671
 written+0.658
 Adam+0.657
andra+0.650
Neg logit contributions
oppy-0.772
igs-0.744
 hedge-0.727
 web-0.710
ind-0.684
Token head
Token ID1182
Feature activation-5.85e-01

Pos logit contributions
 funn+0.317
ax+0.315
 Adam+0.313
 written+0.310
andra+0.306
Neg logit contributions
oppy-0.376
igs-0.362
 hedge-0.352
 web-0.343
ind-0.331
Token.
Token ID13
Feature activation+3.69e-01

Pos logit contributions
 funn+0.817
ax+0.811
 Adam+0.805
andra+0.793
 written+0.789
Neg logit contributions
oppy-0.960
igs-0.923
 hedge-0.904
 web-0.884
ind-0.851
Token He
Token ID679
Feature activation+1.86e-01

Pos logit contributions
oppy+0.675
igs+0.653
 hedge+0.626
 web+0.616
ind+0.611
Neg logit contributions
 Adam-0.468
 funn-0.460
ax-0.458
 written-0.450
andra-0.439
Token.
Token ID13
Feature activation+6.44e-01

Pos logit contributions
ax+2.846
andra+2.820
 funn+2.762
 Adam+2.749
 Sandra+2.702
Neg logit contributions
oppy-2.141
 hedge-1.983
 web-1.933
igs-1.908
ind-1.849
Token
Token ID198
Feature activation-1.21e-01

Pos logit contributions
oppy+1.129
igs+1.117
 web+1.062
 hedge+1.053
ind+1.029
Neg logit contributions
 Adam-0.857
ax-0.842
 funn-0.815
 written-0.812
andra-0.810
Token
Token ID198
Feature activation+5.04e-01

Pos logit contributions
ax+0.167
 funn+0.166
 Adam+0.165
 written+0.164
andra+0.161
Neg logit contributions
oppy-0.202
igs-0.196
 hedge-0.190
 web-0.186
ind-0.179
TokenLater
Token ID18602
Feature activation-1.01e+00

Pos logit contributions
oppy+0.806
igs+0.757
 hedge+0.729
 web+0.721
 webs+0.685
Neg logit contributions
 funn-0.758
ax-0.756
 written-0.749
 Adam-0.745
andra-0.733
Token that
Token ID326
Feature activation-1.11e+00

Pos logit contributions
 funn+1.880
ax+1.836
 written+1.811
 Adam+1.809
andra+1.791
Neg logit contributions
oppy-1.264
igs-1.180
 hedge-1.179
 web-1.100
ind-1.052
Token day
Token ID1110
Feature activation-2.94e-01

Pos logit contributions
ax+2.004
 funn+1.998
 Adam+1.942
 written+1.924
andra+1.913
Neg logit contributions
oppy-1.430
 hedge-1.358
igs-1.311
 web-1.267
ind-1.221
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
 funn+0.490
ax+0.489
 Adam+0.485
 written+0.482
andra+0.476
Neg logit contributions
oppy-0.408
igs-0.390
 hedge-0.377
 web-0.366
ind-0.350
Token Tim
Token ID5045
Feature activation-1.70e+00

Pos logit contributions
ax+1.545
 funn+1.516
andra+1.512
 written+1.490
 Adam+1.483
Neg logit contributions
oppy-1.856
igs-1.787
 hedge-1.781
 web-1.668
 swamp-1.614
Tokenmy
Token ID1820
Feature activation-5.65e-01

Pos logit contributions
 funn+1.425
 Adam+1.366
 written+1.341
 nineteen+1.314
ax+1.307
Neg logit contributions
oppy-3.945
igs-3.650
 hedge-3.636
ind-3.589
 web-3.576
Token's
Token ID338
Feature activation-3.34e-01

Pos logit contributions
 funn+0.643
ax+0.635
 Adam+0.625
 written+0.618
andra+0.605
Neg logit contributions
oppy-1.088
igs-1.047
 hedge-1.026
 web-1.006
ind-0.975
Token mom
Token ID1995
Feature activation-1.69e+00

Pos logit contributions
 funn+0.432
ax+0.431
 Adam+0.427
 written+0.423
andra+0.417
Neg logit contributions
oppy-0.591
igs-0.570
 hedge-0.556
 web-0.543
ind-0.525
Token and
Token ID290
Feature activation-5.16e-01

Pos logit contributions
oppy+0.795
igs+0.788
 hedge+0.733
 web+0.715
ind+0.699
Neg logit contributions
 funn-0.665
 Adam-0.660
 written-0.658
ax-0.657
andra-0.631
Token said
Token ID531
Feature activation+2.97e-01

Pos logit contributions
ax+0.477
 funn+0.473
 Adam+0.467
 written+0.456
andra+0.450
Neg logit contributions
oppy-1.098
igs-1.062
 hedge-1.050
 web-1.026
ind-0.994
Token,
Token ID11
Feature activation-6.12e-01

Pos logit contributions
oppy+0.450
igs+0.443
 hedge+0.418
 web+0.415
ind+0.408
Neg logit contributions
 Adam-0.455
 written-0.451
ax-0.451
 funn-0.450
andra-0.433
Token "
Token ID366
Feature activation-2.09e+00

Pos logit contributions
 funn+0.777
ax+0.769
 Adam+0.753
andra+0.747
 written+0.743
Neg logit contributions
oppy-1.091
igs-1.046
 hedge-1.041
 web-1.009
ind-0.968
TokenThat
Token ID2504
Feature activation-2.20e+00

Pos logit contributions
 Adam+3.170
ax+3.153
 funn+3.127
 topp+3.099
 written+3.082
Neg logit contributions
 hedge-3.158
oppy-3.147
ind-3.011
 web-2.934
 Bubble-2.785
Token's
Token ID338
Feature activation-7.55e-01

Pos logit contributions
ax+3.598
andra+3.520
 Adam+3.358
 Sandra+3.353
 written+3.307
Neg logit contributions
 hedge-3.519
oppy-3.251
 web-3.187
igs-3.126
 swamp-3.042
Token a
Token ID257
Feature activation-5.51e-01

Pos logit contributions
ax+1.115
 Adam+1.094
 funn+1.091
andra+1.087
 written+1.082
Neg logit contributions
oppy-1.205
igs-1.143
 hedge-1.130
 web-1.100
ind-1.055
Token nice
Token ID3621
Feature activation-9.01e-01

Pos logit contributions
ax+0.808
 Adam+0.799
 funn+0.796
 written+0.793
andra+0.788
Neg logit contributions
oppy-0.876
igs-0.836
 hedge-0.830
 web-0.804
ind-0.775
Token car
Token ID1097
Feature activation-5.04e-01

Pos logit contributions
ax+1.347
 funn+1.328
 written+1.321
andra+1.318
 Adam+1.316
Neg logit contributions
oppy-1.426
 hedge-1.386
igs-1.372
 web-1.320
ind-1.244
Token.
Token ID13
Feature activation+8.65e-02

Pos logit contributions
ax+0.764
 funn+0.763
 Adam+0.754
 written+0.747
andra+0.747
Neg logit contributions
oppy-0.772
igs-0.737
 hedge-0.723
 web-0.704
ind-0.672
Token Can
Token ID1680
Feature activation+7.00e-01

Pos logit contributions
oppy+0.166
igs+0.162
 hedge+0.155
 web+0.154
ind+0.150
Neg logit contributions
 funn-0.099
 Adam-0.098
ax-0.097
 written-0.097
andra-0.093
TokenLook
Token ID8567
Feature activation-1.92e+00

Pos logit contributions
ax+2.369
 Adam+2.311
 funn+2.275
 written+2.257
andra+2.244
Neg logit contributions
oppy-3.052
 hedge-2.998
igs-2.851
 web-2.813
 Bubble-2.771
Token,
Token ID11
Feature activation-1.46e+00

Pos logit contributions
 funn+3.551
andra+3.474
 written+3.422
ax+3.328
 topp+3.308
Neg logit contributions
oppy-2.595
igs-2.296
 hedge-2.188
 web-2.090
 webs-2.014
Token a
Token ID257
Feature activation-3.27e-01

Pos logit contributions
 funn+2.009
andra+1.970
 written+1.959
ax+1.957
 Adam+1.873
Neg logit contributions
oppy-2.504
 hedge-2.402
igs-2.303
 web-2.265
 swamp-2.152
Token dog
Token ID3290
Feature activation-8.54e-02

Pos logit contributions
 funn+0.453
ax+0.453
 Adam+0.449
 written+0.445
andra+0.439
Neg logit contributions
oppy-0.540
igs-0.520
 hedge-0.507
 web-0.495
ind-0.476
Token!"
Token ID2474
Feature activation-5.25e-01

Pos logit contributions
 funn+0.093
ax+0.093
 Adam+0.092
 written+0.092
andra+0.089
Neg logit contributions
oppy-0.167
igs-0.162
 hedge-0.158
 web-0.154
ind-0.151
Token Tim
Token ID5045
Feature activation-2.10e+00

Pos logit contributions
 funn+0.670
ax+0.664
 written+0.653
 Adam+0.651
andra+0.643
Neg logit contributions
oppy-0.932
igs-0.897
 hedge-0.879
 web-0.858
ind-0.819
Token said
Token ID531
Feature activation-4.54e-01

Pos logit contributions
 funn+2.199
 written+2.136
andra+2.125
 topping+2.117
 vanilla+2.115
Neg logit contributions
oppy-4.246
 hedge-3.977
 web-3.973
igs-3.873
ind-3.705
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
 funn+0.759
ax+0.755
 Adam+0.748
 written+0.744
andra+0.739
Neg logit contributions
oppy-0.629
igs-0.598
 hedge-0.580
 web-0.564
ind-0.534
Token "
Token ID366
Feature activation-1.39e+00

Pos logit contributions
oppy+0.024
igs+0.023
 hedge+0.022
 web+0.022
ind+0.021
Neg logit contributions
 Adam-0.016
ax-0.016
 funn-0.016
 written-0.016
andra-0.016
TokenCan
Token ID6090
Feature activation+6.63e-02

Pos logit contributions
ax+1.843
 Adam+1.839
andra+1.828
 funn+1.811
 written+1.781
Neg logit contributions
oppy-2.338
 hedge-2.293
igs-2.217
 web-2.197
ind-2.160
Token we
Token ID356
Feature activation+3.49e-01

Pos logit contributions
oppy+0.120
igs+0.117
 hedge+0.112
 web+0.111
ind+0.108
Neg logit contributions
 funn-0.083
 Adam-0.082
ax-0.082
 written-0.080
andra-0.078
Token he
Token ID339
Feature activation-3.43e-01

Pos logit contributions
oppy+0.293
igs+0.281
 hedge+0.272
 web+0.265
ind+0.259
Neg logit contributions
ax-0.303
 Adam-0.301
 funn-0.301
 written-0.300
andra-0.291
Token squ
Token ID2809
Feature activation-2.62e-01

Pos logit contributions
 funn+0.419
ax+0.418
 Adam+0.413
 written+0.409
andra+0.403
Neg logit contributions
oppy-0.626
igs-0.604
 hedge-0.589
 web-0.577
ind-0.558
Tokeneal
Token ID2287
Feature activation-1.03e-01

Pos logit contributions
 funn+0.292
ax+0.291
 Adam+0.288
 written+0.285
andra+0.280
Neg logit contributions
oppy-0.512
igs-0.496
 hedge-0.485
 web-0.475
ind-0.460
Tokened
Token ID276
Feature activation+1.81e-01

Pos logit contributions
ax+0.127
 funn+0.126
 Adam+0.126
 written+0.125
andra+0.121
Neg logit contributions
oppy-0.189
igs-0.183
 hedge-0.178
 web-0.174
ind-0.169
Token,
Token ID11
Feature activation-1.15e+00

Pos logit contributions
oppy+0.307
igs+0.297
 hedge+0.286
 web+0.279
ind+0.269
Neg logit contributions
 funn-0.248
ax-0.247
 Adam-0.247
 written-0.246
andra-0.237
Token "
Token ID366
Feature activation-2.53e+00

Pos logit contributions
ax+1.455
 funn+1.451
andra+1.413
 Adam+1.405
 written+1.401
Neg logit contributions
oppy-1.997
igs-1.951
 hedge-1.937
 web-1.885
ind-1.767
TokenI
Token ID40
Feature activation-6.48e-01

Pos logit contributions
 Adam+3.657
ax+3.517
 funn+3.464
andra+3.418
 written+3.388
Neg logit contributions
ind-3.938
oppy-3.869
 hedge-3.740
igs-3.735
 Bubble-3.536
Token did
Token ID750
Feature activation-6.84e-01

Pos logit contributions
 funn+0.780
 Adam+0.777
ax+0.777
andra+0.758
 written+0.758
Neg logit contributions
oppy-1.204
igs-1.151
 hedge-1.124
 web-1.106
ind-1.059
Token it
Token ID340
Feature activation-1.37e+00

Pos logit contributions
 funn+0.850
ax+0.850
andra+0.831
 Adam+0.831
 written+0.822
Neg logit contributions
oppy-1.232
igs-1.201
 hedge-1.163
 web-1.143
ind-1.093
Token!
Token ID0
Feature activation-6.14e-01

Pos logit contributions
andra+1.722
 funn+1.717
ax+1.703
 Adam+1.697
 written+1.678
Neg logit contributions
oppy-2.464
igs-2.352
 hedge-2.299
 web-2.258
ind-2.162
Token I
Token ID314
Feature activation-1.60e-01

Pos logit contributions
 funn+0.820
ax+0.814
 written+0.801
 Adam+0.799
andra+0.797
Neg logit contributions
oppy-1.054
igs-1.006
 hedge-0.989
 web-0.962
ind-0.922
Token thinking
Token ID3612
Feature activation+9.34e-01

Pos logit contributions
oppy+1.165
igs+1.129
 hedge+1.088
 web+1.065
ind+1.032
Neg logit contributions
 funn-0.795
 written-0.784
 Adam-0.777
ax-0.755
andra-0.724
Token independently
Token ID14799
Feature activation+1.25e+00

Pos logit contributions
oppy+1.962
igs+1.884
 hedge+1.832
 web+1.799
ind+1.756
Neg logit contributions
 Adam-0.946
 funn-0.937
 written-0.933
ax-0.915
 vanilla-0.857
Token!"
Token ID2474
Feature activation-1.81e-01

Pos logit contributions
oppy+2.154
igs+2.094
 hedge+2.025
 web+1.982
ind+1.911
Neg logit contributions
 Adam-1.696
 funn-1.674
 written-1.672
ax-1.616
 flavor-1.556
Token<|endoftext|>
Token ID50256
Feature activation-3.20e-01

Pos logit contributions
ax+0.226
 funn+0.226
 Adam+0.223
 written+0.221
andra+0.217
Neg logit contributions
oppy-0.329
igs-0.319
 hedge-0.311
 web-0.304
ind-0.295
TokenOnce
Token ID7454
Feature activation-1.85e-01

Pos logit contributions
 funn+0.433
ax+0.431
 Adam+0.426
 written+0.423
andra+0.417
Neg logit contributions
oppy-0.544
igs-0.524
 hedge-0.511
 web-0.499
ind-0.480
Token upon
Token ID2402
Feature activation-2.37e+00

Pos logit contributions
ax+0.199
 funn+0.198
 Adam+0.196
 written+0.194
andra+0.190
Neg logit contributions
oppy-0.364
igs-0.354
 hedge-0.346
 web-0.339
ind-0.329
Token a
Token ID257
Feature activation-1.18e+00

Pos logit contributions
ax+3.287
andra+3.281
 funn+3.175
 nineteen+3.089
 Adam+3.047
Neg logit contributions
oppy-4.084
 web-3.943
 hedge-3.888
igs-3.806
 webs-3.707
Token time
Token ID640
Feature activation-6.87e-02

Pos logit contributions
 funn+1.451
ax+1.431
 written+1.429
 Adam+1.397
andra+1.363
Neg logit contributions
oppy-2.222
igs-2.098
 web-2.056
 hedge-2.018
ind-1.963
Token,
Token ID11
Feature activation-9.19e-02

Pos logit contributions
ax+0.111
 Adam+0.111
 funn+0.110
 written+0.109
andra+0.107
Neg logit contributions
oppy-0.097
igs-0.094
 hedge-0.091
 web-0.089
ind-0.086
Token there
Token ID612
Feature activation-3.57e-01

Pos logit contributions
 funn+0.101
ax+0.100
 Adam+0.100
 written+0.099
andra+0.096
Neg logit contributions
oppy-0.179
igs-0.174
 hedge-0.169
 web-0.166
ind-0.162
Token was
Token ID373
Feature activation-1.24e+00

Pos logit contributions
 funn+0.467
ax+0.465
 Adam+0.459
 written+0.457
andra+0.450
Neg logit contributions
oppy-0.625
igs-0.601
 hedge-0.588
 web-0.574
ind-0.553
Token rolling
Token ID10708
Feature activation+2.00e-01

Pos logit contributions
ax+1.361
 funn+1.344
andra+1.318
 written+1.315
 Adam+1.309
Neg logit contributions
oppy-1.505
igs-1.431
 hedge-1.416
 web-1.368
ind-1.299
Token maze
Token ID31237
Feature activation-1.72e-01

Pos logit contributions
oppy+0.341
igs+0.324
 hedge+0.311
 web+0.306
ind+0.298
Neg logit contributions
 funn-0.274
ax-0.270
 Adam-0.270
 written-0.266
andra-0.258
Token.
Token ID13
Feature activation+5.74e-02

Pos logit contributions
 funn+0.257
ax+0.256
 Adam+0.254
 written+0.253
andra+0.248
Neg logit contributions
oppy-0.266
igs-0.255
 hedge-0.247
 web-0.241
ind-0.232
Token<|endoftext|>
Token ID50256
Feature activation-6.07e-01

Pos logit contributions
oppy+0.090
igs+0.087
 hedge+0.083
 web+0.082
ind+0.079
Neg logit contributions
 Adam-0.085
ax-0.085
 funn-0.085
 written-0.084
andra-0.083
TokenOnce
Token ID7454
Feature activation+7.08e-02

Pos logit contributions
 funn+0.798
ax+0.787
 Adam+0.772
 written+0.770
andra+0.763
Neg logit contributions
oppy-1.065
igs-1.020
 hedge-1.003
 web-0.978
ind-0.937
Token upon
Token ID2402
Feature activation-2.35e+00

Pos logit contributions
oppy+0.141
igs+0.139
 hedge+0.137
 web+0.134
ind+0.130
Neg logit contributions
ax-0.072
 Adam-0.072
 funn-0.071
 written-0.070
andra-0.069
Token a
Token ID257
Feature activation-7.36e-01

Pos logit contributions
ax+3.246
andra+3.242
 funn+3.077
 Adam+3.033
 nineteen+3.029
Neg logit contributions
oppy-4.098
 web-3.952
 hedge-3.851
igs-3.840
 webs-3.720
Token time
Token ID640
Feature activation-1.93e-01

Pos logit contributions
 funn+0.837
 written+0.826
ax+0.825
 Adam+0.816
andra+0.788
Neg logit contributions
oppy-1.432
igs-1.375
 hedge-1.328
 web-1.326
ind-1.279
Token,
Token ID11
Feature activation-5.30e-01

Pos logit contributions
ax+0.317
 funn+0.317
 Adam+0.315
 written+0.313
andra+0.309
Neg logit contributions
oppy-0.271
igs-0.260
 hedge-0.251
 web-0.245
ind-0.234
Token there
Token ID612
Feature activation-6.98e-01

Pos logit contributions
 funn+0.646
ax+0.644
 Adam+0.633
 written+0.628
andra+0.623
Neg logit contributions
oppy-0.982
igs-0.947
 hedge-0.925
 web-0.905
ind-0.876
Token was
Token ID373
Feature activation-1.06e+00

Pos logit contributions
 funn+0.977
ax+0.970
 written+0.947
 Adam+0.946
andra+0.936
Neg logit contributions
oppy-1.176
igs-1.117
 hedge-1.103
 web-1.075
ind-1.022
Token Jimmy
Token ID12963
Feature activation+1.45e-01

Pos logit contributions
 funn+1.060
 written+1.024
ax+1.019
andra+0.983
 Adam+0.962
Neg logit contributions
oppy-1.790
igs-1.704
 hedge-1.673
ind-1.619
 web-1.619
Token and
Token ID290
Feature activation-1.24e+00

Pos logit contributions
oppy+0.267
igs+0.262
 hedge+0.253
 web+0.247
ind+0.241
Neg logit contributions
ax-0.176
 Adam-0.175
 funn-0.174
 written-0.173
andra-0.168
Token said
Token ID531
Feature activation+6.22e-01

Pos logit contributions
 funn+0.959
ax+0.935
 written+0.912
andra+0.897
 Adam+0.885
Neg logit contributions
oppy-2.842
 hedge-2.696
igs-2.686
 web-2.607
 Bubble-2.532
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
oppy+0.966
igs+0.941
 hedge+0.916
 web+0.902
ind+0.886
Neg logit contributions
 Adam-0.945
ax-0.929
 written-0.920
 funn-0.914
 vanilla-0.885
Token "
Token ID366
Feature activation-2.09e+00

Pos logit contributions
 funn+1.478
ax+1.449
andra+1.416
 written+1.409
 Adam+1.404
Neg logit contributions
oppy-1.796
igs-1.734
 hedge-1.727
 web-1.650
ind-1.592
TokenLet
Token ID5756
Feature activation-2.16e+00

Pos logit contributions
ax+2.681
 Adam+2.604
 written+2.559
andra+2.528
 funn+2.504
Neg logit contributions
oppy-3.501
 hedge-3.457
igs-3.378
ind-3.362
 web-3.298
Token your
Token ID534
Feature activation-5.48e-01

Pos logit contributions
 written+3.022
 funn+2.935
ax+2.934
 Adam+2.879
 topp+2.877
Neg logit contributions
oppy-3.661
 hedge-3.469
igs-3.337
 web-3.306
ind-3.281
Token shadow
Token ID9082
Feature activation-6.24e-01

Pos logit contributions
ax+0.882
 funn+0.869
 Adam+0.864
 written+0.862
andra+0.857
Neg logit contributions
oppy-0.812
igs-0.773
 hedge-0.762
 web-0.745
 webs-0.698
Token go
Token ID467
Feature activation+2.13e-01

Pos logit contributions
ax+0.778
 funn+0.774
 Adam+0.770
 written+0.758
andra+0.754
Neg logit contributions
oppy-1.126
igs-1.090
 hedge-1.069
 web-1.045
ind-0.998
Token on
Token ID319
Feature activation+7.21e-01

Pos logit contributions
oppy+0.395
igs+0.380
 hedge+0.367
 web+0.365
ind+0.349
Neg logit contributions
ax-0.257
 funn-0.255
 Adam-0.253
 written-0.250
andra-0.243
Token a
Token ID257
Feature activation+6.17e-01

Pos logit contributions
oppy+1.458
igs+1.388
 hedge+1.340
 web+1.319
ind+1.292
Neg logit contributions
 Adam-0.798
 funn-0.758
ax-0.753
 written-0.753
andra-0.721