TOP ACTIVATIONS
MAX = 4.189

much and he decided to memor ize it . Every day
more curious . " What is a zero ?" she asked
so beautiful and everyone was mesmer ized . They all listened
had two long arms . Fasc inated , she picked it
and beautiful and Joe was mesmer ised . He watched it
before and he stood there mesmer ised . He began to
beautiful flower . Lucy was mesmer ised by the flower 's
the floor , everyone was mesmer ized by her charming moves
with a golden frame . Fasc inated , Jerry and his
sound and he tried to memor ize every note . After
! " What is this ?" she asked herself
her face . She was mesmer ised . When
the air . Jane was mesmer ized . The old man
the wheat and found himself mesmer ised by its beauty .
and graceful that everyone was mesmer ized . When the song
little girl kept walking towards the rainbow and soon she realized
ly and the bunny was mesmer ised . The
Mom my , " What is that ? Is it a
music playing . She was mesmer ized . She
Everyone around the table was mesmer ized by its beauty .

MIN ACTIVATIONS
MIN = -3.802

. " In a box under the tree ."
He thinks about what Tom and Sue said . He realizes
happy to help . Tom and Lily were happy to be
. In the middle of the night , Tim
. Max asked , " L ily ,
. " Spot , come here !" Lily called
toy . Max asked Luna , " What are
talks to the dog in a soft voice . He gives
fell asleep . In the middle of the night , she
. " Spot , sit !" Ben says .
" Rex , Rex , come here !" the man
sees Tim hurt . She calls for help and takes Tim
. Max asked Lily , " What are
cow boys . They run , jump and shoot . They
eyes and sleeps . He dreams of his ball and the
Max . Max asked Lily , " Can we
Max was excited . He asked Lily to show him how
their hats and coats and sne aked out of the house
the dog and said in a loud voice , " Hey
them . He lives in a house near the woods .

INTERVAL 2.191 - 4.189
CONTAINS 0.306%

to go away . Then one day , her
. One day , in the middle of spring
She got very happy and opened it . She saw that
walking down the street , adm iring the tidy gardens .
scared . When she woke up , she was confused

INTERVAL 0.193 - 2.191
CONTAINS 38.600%

pointed the arrow at the sky . He took a deep
other mushroom . Inside the mushroom was a secret room full
that the things that are truly valuable are the ones that
when she reached the top , she saw a boy who
at you and see how beautiful you are ," she said

INTERVAL -1.804 - 0.193
CONTAINS 60.432%

fluffy and grey . One day , she was walking through
see it . They sat on the floor and
books . She had many books in her room . Some
happy again and fell asleep with a smile on her face
was a boy named Tim . Tim loved cookies and he

INTERVAL -3.802 - -1.804
CONTAINS 0.663%

, " Mom , mom , I 'm sorry . I
clean up the mess and went to bed early . They
her the ring . He thought she would like it .
. Her mom hears her and comes outside . She sees
the house . They are hungry and happy . They had
Token much
Token ID881
Feature activation+1.87e-01

Pos logit contributions
inated+1.480
work+1.432
ised+1.400
ized+1.310
el+1.261
Neg logit contributions
isk-0.874
ens-0.734
arl-0.722
 bisc-0.676
 shady-0.661
Token and
Token ID290
Feature activation-1.32e-01

Pos logit contributions
inated+0.288
work+0.273
ised+0.271
ized+0.246
el+0.229
Neg logit contributions
isk-0.380
arl-0.339
ens-0.338
 bisc-0.325
coat-0.319
Token he
Token ID339
Feature activation+9.97e-01

Pos logit contributions
isk+0.234
arl+0.207
ens+0.202
 bisc+0.194
coat+0.191
Neg logit contributions
inated-0.239
work-0.229
ised-0.228
ized-0.211
el-0.197
Token decided
Token ID3066
Feature activation+6.57e-01

Pos logit contributions
inated+1.937
work+1.851
ised+1.844
ized+1.692
izing+1.627
Neg logit contributions
isk-1.638
ens-1.415
arl-1.341
 bisc-1.326
coat-1.307
Token to
Token ID284
Feature activation+7.93e-01

Pos logit contributions
inated+1.378
work+1.332
ised+1.328
ized+1.227
el+1.188
Neg logit contributions
isk-0.958
ens-0.815
arl-0.814
 bisc-0.771
 shady-0.753
Token memor
Token ID16181
Feature activation+4.19e+00

Pos logit contributions
inated+1.833
ised+1.789
work+1.772
ized+1.680
ement+1.602
Neg logit contributions
isk-1.006
arl-0.812
ens-0.810
 bisc-0.758
coat-0.732
Tokenize
Token ID1096
Feature activation+3.88e-01

Pos logit contributions
work+6.038
inated+5.358
inct+4.127
 horizon+3.688
 sunset+3.639
Neg logit contributions
isk-9.706
ens-9.010
aid-8.996
smart-8.504
arl-8.472
Token it
Token ID340
Feature activation+3.56e-01

Pos logit contributions
inated+0.687
work+0.655
ised+0.650
ized+0.597
el+0.560
Neg logit contributions
isk-0.701
ens-0.613
arl-0.612
 bisc-0.583
coat-0.568
Token.
Token ID13
Feature activation+9.55e-01

Pos logit contributions
inated+0.569
work+0.539
ised+0.535
ized+0.486
el+0.455
Neg logit contributions
isk-0.704
arl-0.624
ens-0.623
 bisc-0.598
coat-0.586
Token Every
Token ID3887
Feature activation+3.75e-02

Pos logit contributions
inated+2.055
work+2.002
ised+1.970
ized+1.877
izing+1.778
Neg logit contributions
isk-1.347
ens-1.141
arl-1.130
 bisc-1.073
ives-1.058
Token day
Token ID1110
Feature activation+1.35e-01

Pos logit contributions
inated+0.070
work+0.067
ised+0.067
ized+0.062
el+0.058
Neg logit contributions
isk-0.064
arl-0.056
ens-0.055
 bisc-0.053
coat-0.052
Token more
Token ID517
Feature activation+1.78e+00

Pos logit contributions
isk+1.283
arl+1.161
ens+1.091
 bisc+1.035
ives+1.024
Neg logit contributions
inated-1.331
ised-1.315
work-1.291
ized-1.212
used-1.130
Token curious
Token ID11040
Feature activation+5.21e-02

Pos logit contributions
inated+3.744
work+3.743
ised+3.493
ized+3.226
el+3.215
Neg logit contributions
isk-2.440
ens-2.236
arl-2.202
 bisc-2.167
 shady-2.150
Token.
Token ID13
Feature activation+1.69e+00

Pos logit contributions
inated+0.090
work+0.086
ised+0.085
ized+0.078
el+0.074
Neg logit contributions
isk-0.096
arl-0.085
ens-0.084
 bisc-0.081
coat-0.079
Token "
Token ID366
Feature activation+1.76e+00

Pos logit contributions
inated+3.802
work+3.795
ised+3.761
ized+3.641
ement+3.417
Neg logit contributions
isk-2.065
arl-1.771
ens-1.737
 comes-1.626
 shady-1.576
TokenWhat
Token ID2061
Feature activation+5.97e-01

Pos logit contributions
inated+3.299
ised+3.146
work+3.136
ized+2.936
ement+2.793
Neg logit contributions
isk-2.984
arl-2.628
ens-2.537
 bisc-2.488
Rex-2.484
Token is
Token ID318
Feature activation+4.15e+00

Pos logit contributions
inated+1.285
ised+1.235
work+1.226
ized+1.154
izing+1.086
Neg logit contributions
isk-0.848
ens-0.726
arl-0.711
 bisc-0.686
ives-0.661
Token a
Token ID257
Feature activation+1.82e+00

Pos logit contributions
ised+9.584
inated+9.040
ized+8.837
ement+8.793
work+8.644
Neg logit contributions
 comes-5.660
 fetch-5.452
 Luna-5.391
 backup-5.287
 hay-5.242
Token zero
Token ID6632
Feature activation+1.36e+00

Pos logit contributions
inated+3.241
ised+3.174
work+3.027
ized+2.900
el+2.637
Neg logit contributions
isk-3.223
 bisc-2.874
arl-2.836
 shady-2.805
 robber-2.734
Token?"
Token ID1701
Feature activation+2.62e+00

Pos logit contributions
inated+2.340
ised+2.155
work+2.138
ized+1.957
el+1.846
Neg logit contributions
isk-2.500
ens-2.237
arl-2.215
 bisc-2.150
 comes-2.121
Token she
Token ID673
Feature activation+2.68e+00

Pos logit contributions
inated+5.640
work+5.627
ised+5.429
ized+5.287
izing+5.119
Neg logit contributions
ens-3.364
isk-3.310
 comes-3.301
 Luna-3.241
 wake-3.031
Token asked
Token ID1965
Feature activation+1.52e+00

Pos logit contributions
work+5.272
inated+5.146
ised+4.773
ized+4.568
ement+4.467
Neg logit contributions
isk-4.376
ens-3.905
 Luna-3.737
 comes-3.698
 shady-3.581
Token so
Token ID523
Feature activation+1.13e+00

Pos logit contributions
inated+1.155
ised+1.095
work+1.094
ized+0.997
el+0.939
Neg logit contributions
isk-1.293
arl-1.138
ens-1.127
 bisc-1.088
coat-1.075
Token beautiful
Token ID4950
Feature activation+1.59e+00

Pos logit contributions
inated+2.334
work+2.262
ised+2.216
ized+2.050
el+2.002
Neg logit contributions
isk-1.643
arl-1.466
ens-1.413
 shady-1.365
 bisc-1.325
Token and
Token ID290
Feature activation+1.78e-01

Pos logit contributions
inated+2.766
work+2.569
ised+2.565
ized+2.301
el+2.287
Neg logit contributions
isk-2.843
ens-2.588
 bisc-2.460
arl-2.452
 shady-2.442
Token everyone
Token ID2506
Feature activation+1.30e+00

Pos logit contributions
inated+0.279
work+0.265
ised+0.263
ized+0.239
el+0.222
Neg logit contributions
isk-0.356
arl-0.317
ens-0.315
 bisc-0.304
coat-0.297
Token was
Token ID373
Feature activation+1.25e+00

Pos logit contributions
inated+2.642
ised+2.520
work+2.480
ized+2.308
ement+2.236
Neg logit contributions
isk-1.929
ens-1.700
arl-1.617
 comes-1.579
coat-1.558
Token mesmer
Token ID46814
Feature activation+3.99e+00

Pos logit contributions
inated+2.296
ised+2.191
work+2.164
ized+1.990
izing+1.912
Neg logit contributions
isk-2.135
arl-1.847
ens-1.801
coat-1.771
 shady-1.763
Tokenized
Token ID1143
Feature activation+1.75e+00

Pos logit contributions
inated+3.293
work+3.273
inct+2.262
ble+2.168
used+2.039
Neg logit contributions
isk-10.720
arl-10.072
ens-10.025
 shady-9.602
 laps-9.441
Token.
Token ID13
Feature activation+1.75e+00

Pos logit contributions
inated+3.130
work+2.832
ised+2.785
ble+2.620
ized+2.515
Neg logit contributions
isk-3.172
ens-2.929
arl-2.743
 comes-2.656
 shady-2.611
Token They
Token ID1119
Feature activation+1.43e+00

Pos logit contributions
inated+3.547
work+3.468
ised+3.436
ized+3.279
ble+3.177
Neg logit contributions
isk-2.677
ens-2.290
arl-2.277
 bisc-2.125
ives-2.125
Token all
Token ID477
Feature activation+6.09e-01

Pos logit contributions
inated+2.873
ised+2.724
work+2.704
ized+2.497
ement+2.444
Neg logit contributions
isk-2.214
ens-1.941
arl-1.842
coat-1.769
 comes-1.765
Token listened
Token ID16399
Feature activation+1.35e+00

Pos logit contributions
inated+1.185
work+1.131
ised+1.126
ized+1.044
izing+0.993
Neg logit contributions
isk-0.979
ens-0.852
arl-0.841
 bisc-0.796
coat-0.784
Token had
Token ID550
Feature activation+2.93e-01

Pos logit contributions
inated+2.245
work+2.164
ised+2.142
ized+1.951
used+1.927
Neg logit contributions
isk-1.742
ens-1.488
 bisc-1.409
coat-1.380
 shady-1.374
Token two
Token ID734
Feature activation+2.59e-01

Pos logit contributions
inated+0.501
work+0.479
ised+0.477
ized+0.436
el+0.409
Neg logit contributions
isk-0.545
arl-0.479
ens-0.478
 bisc-0.460
coat-0.449
Token long
Token ID890
Feature activation+5.39e-01

Pos logit contributions
inated+0.464
work+0.444
ised+0.440
ized+0.404
el+0.380
Neg logit contributions
isk-0.462
arl-0.405
ens-0.403
 bisc-0.387
coat-0.377
Token arms
Token ID5101
Feature activation+1.15e+00

Pos logit contributions
inated+1.002
work+0.972
ised+0.955
ized+0.886
izing+0.832
Neg logit contributions
isk-0.913
ens-0.797
arl-0.796
 bisc-0.770
coat-0.751
Token.
Token ID13
Feature activation+3.80e-01

Pos logit contributions
inated+1.925
ised+1.872
work+1.837
ized+1.693
ement+1.589
Neg logit contributions
isk-2.128
arl-1.919
ens-1.899
coat-1.830
 bisc-1.815
Token Fasc
Token ID41467
Feature activation+3.96e+00

Pos logit contributions
inated+0.633
work+0.610
ised+0.603
ized+0.552
izing+0.513
Neg logit contributions
isk-0.726
ens-0.642
arl-0.640
 bisc-0.614
coat-0.601
Tokeninated
Token ID3898
Feature activation-3.43e-01

Pos logit contributions
work+4.242
ement+3.817
ised+3.729
ized+3.047
used+2.912
Neg logit contributions
isk-10.848
 fetch-9.447
arl-9.341
 Luna-9.200
 shady-9.092
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
isk+0.613
arl+0.537
ens+0.519
 bisc+0.512
coat+0.496
Neg logit contributions
inated-0.621
work-0.595
ised-0.593
ized-0.547
el-0.524
Token she
Token ID673
Feature activation+1.43e+00

Pos logit contributions
inated+2.019
work+1.971
ised+1.915
ized+1.754
ement+1.672
Neg logit contributions
isk-1.968
ens-1.779
arl-1.697
 bisc-1.651
 shady-1.615
Token picked
Token ID6497
Feature activation+1.55e+00

Pos logit contributions
work+3.093
inated+3.080
ised+2.960
ized+2.743
ement+2.739
Neg logit contributions
isk-1.988
ens-1.667
 bisc-1.570
 shady-1.556
 comes-1.553
Token it
Token ID340
Feature activation+1.78e+00

Pos logit contributions
inated+2.294
ised+2.200
work+2.164
ized+1.959
izing+1.863
Neg logit contributions
isk-3.210
ens-3.048
 bisc-2.857
 comes-2.833
 Luna-2.821
Token and
Token ID290
Feature activation+2.13e-01

Pos logit contributions
inated+3.172
work+2.994
ised+2.968
ized+2.744
used+2.709
Neg logit contributions
isk-2.192
ens-1.972
 shady-1.948
arl-1.942
 bisc-1.886
Token beautiful
Token ID4950
Feature activation+2.17e+00

Pos logit contributions
inated+0.362
work+0.345
ised+0.343
ized+0.314
el+0.295
Neg logit contributions
isk-0.399
arl-0.352
ens-0.351
 bisc-0.337
coat-0.329
Token and
Token ID290
Feature activation+1.30e-01

Pos logit contributions
inated+4.037
ised+3.739
work+3.703
used+3.414
ized+3.388
Neg logit contributions
isk-3.536
 shady-3.219
ens-3.210
 bisc-3.155
coat-3.068
Token Joe
Token ID5689
Feature activation+1.43e+00

Pos logit contributions
inated+0.215
work+0.205
ised+0.203
ized+0.186
el+0.174
Neg logit contributions
isk-0.249
arl-0.221
ens-0.220
 bisc-0.211
coat-0.207
Token was
Token ID373
Feature activation+1.24e+00

Pos logit contributions
inated+2.844
work+2.736
ised+2.689
ized+2.453
izing+2.401
Neg logit contributions
isk-2.248
ens-1.954
 bisc-1.811
coat-1.804
 comes-1.791
Token mesmer
Token ID46814
Feature activation+3.94e+00

Pos logit contributions
inated+2.156
ised+2.078
work+2.052
ized+1.891
izing+1.804
Neg logit contributions
isk-2.229
arl-1.928
 shady-1.901
ens-1.888
coat-1.878
Tokenised
Token ID1417
Feature activation+1.26e+00

Pos logit contributions
work+4.085
inated+3.623
used+2.596
inct+2.562
ble+2.174
Neg logit contributions
isk-10.135
arl-9.641
ens-9.436
 shady-9.183
aks-8.680
Token.
Token ID13
Feature activation+8.90e-01

Pos logit contributions
inated+2.103
work+1.925
ised+1.904
ized+1.747
ble+1.664
Neg logit contributions
isk-2.404
ens-2.139
arl-2.109
 shady-2.016
 bisc-1.997
Token He
Token ID679
Feature activation+1.52e+00

Pos logit contributions
inated+2.075
work+2.030
ised+2.025
ized+1.922
ble+1.833
Neg logit contributions
isk-1.079
ens-0.879
arl-0.866
 bisc-0.806
 shady-0.781
Token watched
Token ID7342
Feature activation+1.10e+00

Pos logit contributions
inated+3.141
work+3.085
ised+2.981
ized+2.757
ement+2.730
Neg logit contributions
isk-2.272
ens-1.929
 bisc-1.828
 shady-1.825
coat-1.792
Token it
Token ID340
Feature activation+3.21e-01

Pos logit contributions
inated+1.948
ised+1.851
work+1.842
ized+1.686
el+1.595
Neg logit contributions
isk-1.923
ens-1.696
arl-1.696
 bisc-1.594
 shady-1.589
Token before
Token ID878
Feature activation+1.31e+00

Pos logit contributions
inated+1.164
work+1.107
ised+1.094
ized+0.978
el+0.909
Neg logit contributions
isk-1.593
arl-1.442
ens-1.437
 bisc-1.374
coat-1.349
Token and
Token ID290
Feature activation-5.81e-02

Pos logit contributions
inated+2.324
work+2.208
ised+2.206
ized+1.992
izing+1.870
Neg logit contributions
isk-2.350
ens-2.113
arl-2.053
 bisc-2.005
 shady-2.001
Token he
Token ID339
Feature activation+1.66e+00

Pos logit contributions
isk+0.106
arl+0.094
ens+0.092
 bisc+0.089
coat+0.087
Neg logit contributions
inated-0.103
work-0.098
ised-0.097
ized-0.090
el-0.084
Token stood
Token ID6204
Feature activation+2.76e+00

Pos logit contributions
inated+3.386
work+3.236
ised+3.216
ized+2.940
izing+2.901
Neg logit contributions
isk-2.652
ens-2.166
 comes-2.066
 bisc-2.060
 shady-2.046
Token there
Token ID612
Feature activation+6.00e-01

Pos logit contributions
inated+5.324
work+5.164
ised+4.900
izing+4.404
ized+4.346
Neg logit contributions
isk-4.448
 shady-4.173
arl-4.111
aks-3.829
 welcome-3.823
Token mesmer
Token ID46814
Feature activation+3.91e+00

Pos logit contributions
inated+1.109
ised+1.063
work+1.055
ized+0.972
el+0.919
Neg logit contributions
isk-1.029
arl-0.889
ens-0.888
 bisc-0.854
 shady-0.838
Tokenised
Token ID1417
Feature activation+1.15e+00

Pos logit contributions
work+5.004
inated+4.560
used+3.682
inct+3.380
astically+3.266
Neg logit contributions
isk-8.858
arl-8.478
 shady-8.153
ens-8.020
aks-7.686
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
inated+1.923
work+1.768
ised+1.748
ized+1.587
ble+1.495
Neg logit contributions
isk-2.195
ens-1.964
arl-1.916
 shady-1.842
 bisc-1.841
Token He
Token ID679
Feature activation+2.18e+00

Pos logit contributions
inated+0.277
work+0.266
ised+0.264
ized+0.245
el+0.233
Neg logit contributions
isk-0.210
arl-0.180
ens-0.179
 bisc-0.170
coat-0.165
Token began
Token ID2540
Feature activation+3.17e-01

Pos logit contributions
inated+4.599
work+4.465
ised+4.345
ement+4.071
izing+4.026
Neg logit contributions
isk-3.150
ens-2.583
 comes-2.581
 shady-2.527
 bisc-2.422
Token to
Token ID284
Feature activation+1.32e+00

Pos logit contributions
inated+0.653
work+0.629
ised+0.627
ized+0.583
el+0.553
Neg logit contributions
isk-0.480
arl-0.409
ens-0.407
 bisc-0.386
coat-0.375
Token beautiful
Token ID4950
Feature activation+1.48e+00

Pos logit contributions
inated+0.944
ised+0.902
work+0.902
ized+0.837
el+0.796
Neg logit contributions
isk-0.725
arl-0.625
ens-0.620
 bisc-0.599
coat-0.579
Token flower
Token ID15061
Feature activation+2.12e+00

Pos logit contributions
inated+2.781
ised+2.677
work+2.595
ized+2.437
used+2.361
Neg logit contributions
isk-2.470
 bisc-2.198
ens-2.153
 shady-2.151
arl-2.136
Token.
Token ID13
Feature activation+3.61e-01

Pos logit contributions
inated+4.032
ised+3.974
work+3.743
ized+3.498
used+3.316
Neg logit contributions
isk-3.484
 bisc-3.150
ens-3.098
 shady-3.060
arl-3.054
Token Lucy
Token ID22162
Feature activation+2.02e+00

Pos logit contributions
inated+0.688
work+0.666
ised+0.659
ized+0.612
el+0.577
Neg logit contributions
isk-0.598
ens-0.519
arl-0.519
 bisc-0.494
coat-0.483
Token was
Token ID373
Feature activation+1.40e+00

Pos logit contributions
work+4.217
inated+4.153
ised+4.077
ized+3.799
izing+3.755
Neg logit contributions
isk-3.017
ens-2.557
 comes-2.405
 bisc-2.372
 shady-2.331
Token mesmer
Token ID46814
Feature activation+3.86e+00

Pos logit contributions
inated+2.559
ised+2.518
work+2.499
ized+2.308
izing+2.216
Neg logit contributions
isk-2.423
arl-2.052
 shady-2.008
ens-1.996
coat-1.968
Tokenised
Token ID1417
Feature activation+1.42e+00

Pos logit contributions
work+4.292
inated+3.475
inct+2.533
used+2.414
astically+2.126
Neg logit contributions
isk-10.307
ens-9.811
arl-9.605
 shady-8.967
coat-8.908
Token by
Token ID416
Feature activation+1.51e+00

Pos logit contributions
inated+2.580
work+2.410
ised+2.332
ized+2.151
ble+2.086
Neg logit contributions
isk-2.536
ens-2.260
arl-2.185
coat-2.115
 shady-2.100
Token the
Token ID262
Feature activation+2.19e+00

Pos logit contributions
inated+2.841
ised+2.752
work+2.736
ized+2.527
used+2.377
Neg logit contributions
isk-2.536
ens-2.163
arl-2.123
 shady-2.119
 bisc-2.022
Token flower
Token ID15061
Feature activation+1.47e+00

Pos logit contributions
inated+4.186
ised+4.113
work+3.956
used+3.638
ized+3.635
Neg logit contributions
isk-3.464
 shady-3.276
 bisc-3.220
ens-3.169
arl-3.087
Token's
Token ID338
Feature activation+2.05e+00

Pos logit contributions
inated+2.758
ised+2.654
work+2.570
ized+2.392
used+2.257
Neg logit contributions
isk-2.465
ens-2.192
 bisc-2.121
 shady-2.082
coat-2.046
Token the
Token ID262
Feature activation+1.10e+00

Pos logit contributions
inated+0.115
work+0.110
ised+0.110
ized+0.101
el+0.095
Neg logit contributions
isk-0.115
arl-0.101
ens-0.100
 bisc-0.096
coat-0.094
Token floor
Token ID4314
Feature activation-6.55e-01

Pos logit contributions
inated+2.091
ised+1.964
work+1.922
ized+1.791
el+1.717
Neg logit contributions
isk-1.826
arl-1.653
ens-1.632
 bisc-1.586
coat-1.568
Token,
Token ID11
Feature activation-2.54e-01

Pos logit contributions
isk+1.256
arl+1.113
ens+1.098
 bisc+1.073
coat+1.015
Neg logit contributions
inated-1.097
work-1.056
ised-1.045
ized-0.964
el-0.902
Token everyone
Token ID2506
Feature activation+6.02e-01

Pos logit contributions
isk+0.420
arl+0.364
ens+0.356
 bisc+0.345
coat+0.332
Neg logit contributions
inated-0.493
work-0.473
ised-0.470
ized-0.436
izing-0.412
Token was
Token ID373
Feature activation+6.44e-01

Pos logit contributions
inated+1.158
ised+1.101
work+1.099
ized+1.014
el+0.969
Neg logit contributions
isk-0.985
ens-0.855
arl-0.845
 bisc-0.801
coat-0.793
Token mesmer
Token ID46814
Feature activation+3.84e+00

Pos logit contributions
inated+1.109
work+1.052
ised+1.046
ized+0.956
el+0.910
Neg logit contributions
isk-1.191
arl-1.042
ens-1.032
 bisc-0.995
coat-0.995
Tokenized
Token ID1143
Feature activation+1.11e+00

Pos logit contributions
work+4.671
inated+4.341
used+3.281
inct+3.202
ble+2.906
Neg logit contributions
isk-9.442
ens-8.784
arl-8.738
 shady-8.129
aks-7.968
Token by
Token ID416
Feature activation+8.48e-01

Pos logit contributions
inated+1.898
ised+1.763
work+1.746
ized+1.582
ble+1.541
Neg logit contributions
isk-2.136
ens-1.928
arl-1.859
 bisc-1.778
 comes-1.769
Token her
Token ID607
Feature activation+1.94e+00

Pos logit contributions
inated+1.541
ised+1.465
work+1.463
ized+1.346
el+1.256
Neg logit contributions
isk-1.507
arl-1.299
ens-1.299
 shady-1.211
 bisc-1.211
Token charming
Token ID23332
Feature activation+5.00e-01

Pos logit contributions
inated+3.795
ised+3.578
work+3.498
ized+3.280
el+3.121
Neg logit contributions
isk-3.124
ens-2.709
arl-2.685
 bisc-2.680
 shady-2.637
Token moves
Token ID6100
Feature activation+4.42e-01

Pos logit contributions
inated+0.926
work+0.882
ised+0.881
ized+0.813
el+0.765
Neg logit contributions
isk-0.861
arl-0.749
ens-0.746
 bisc-0.717
coat-0.703
Token with
Token ID351
Feature activation-3.23e-02

Pos logit contributions
inated+3.850
ised+3.561
work+3.541
used+3.263
ized+3.229
Neg logit contributions
isk-3.164
ens-2.865
 shady-2.815
 bisc-2.785
coat-2.755
Token a
Token ID257
Feature activation+8.17e-01

Pos logit contributions
isk+0.063
arl+0.056
ens+0.055
 bisc+0.053
coat+0.052
Neg logit contributions
inated-0.053
work-0.050
ised-0.050
ized-0.046
el-0.043
Token golden
Token ID10861
Feature activation+5.47e-01

Pos logit contributions
inated+1.545
ised+1.481
work+1.477
ized+1.369
el+1.290
Neg logit contributions
isk-1.341
arl-1.194
 bisc-1.170
ens-1.165
 shady-1.138
Token frame
Token ID5739
Feature activation+7.08e-01

Pos logit contributions
inated+1.120
work+1.076
ised+1.071
ized+0.997
izing+0.942
Neg logit contributions
isk-0.824
arl-0.716
ens-0.704
 bisc-0.685
coat-0.668
Token.
Token ID13
Feature activation+3.04e-01

Pos logit contributions
inated+1.169
ised+1.110
work+1.091
ized+0.996
el+0.933
Neg logit contributions
isk-1.350
ens-1.197
arl-1.188
 bisc-1.171
coat-1.150
Token Fasc
Token ID41467
Feature activation+3.83e+00

Pos logit contributions
inated+0.550
work+0.527
ised+0.524
ized+0.483
el+0.455
Neg logit contributions
isk-0.540
arl-0.472
ens-0.471
 bisc-0.450
coat-0.439
Tokeninated
Token ID3898
Feature activation-1.79e-01

Pos logit contributions
work+4.013
ised+3.723
ement+3.564
ized+2.930
used+2.929
Neg logit contributions
isk-10.204
arl-8.934
 fetch-8.844
 shady-8.839
coat-8.704
Token,
Token ID11
Feature activation+8.30e-01

Pos logit contributions
isk+0.319
arl+0.279
ens+0.273
 bisc+0.265
coat+0.258
Neg logit contributions
inated-0.326
work-0.313
ised-0.312
ized-0.287
el-0.273
Token Jerry
Token ID13075
Feature activation+4.58e-01

Pos logit contributions
inated+1.524
work+1.446
ised+1.431
ized+1.315
el+1.257
Neg logit contributions
isk-1.437
ens-1.277
arl-1.238
 bisc-1.202
 shady-1.180
Token and
Token ID290
Feature activation-9.78e-01

Pos logit contributions
inated+0.886
work+0.853
ised+0.847
ized+0.782
izing+0.740
Neg logit contributions
isk-0.747
ens-0.644
arl-0.634
 bisc-0.614
coat-0.599
Token his
Token ID465
Feature activation+1.21e+00

Pos logit contributions
isk+1.669
arl+1.533
ens+1.413
 bisc+1.411
ives+1.397
Neg logit contributions
inated-1.811
work-1.780
ised-1.748
ized-1.597
el-1.528
Token sound
Token ID2128
Feature activation-1.46e-01

Pos logit contributions
inated+1.735
work+1.671
ised+1.664
ized+1.532
ement+1.464
Neg logit contributions
isk-1.547
ens-1.356
arl-1.334
 bisc-1.280
coat-1.257
Token and
Token ID290
Feature activation-3.43e-01

Pos logit contributions
isk+0.246
arl+0.213
ens+0.210
 bisc+0.202
coat+0.197
Neg logit contributions
inated-0.279
work-0.271
ised-0.266
ized-0.248
el-0.235
Token he
Token ID339
Feature activation+8.68e-02

Pos logit contributions
isk+0.625
arl+0.554
ens+0.544
 bisc+0.520
coat+0.515
Neg logit contributions
inated-0.615
work-0.590
ised-0.582
ized-0.538
izing-0.500
Token tried
Token ID3088
Feature activation+1.53e-01

Pos logit contributions
inated+0.172
work+0.165
ised+0.164
ized+0.153
el+0.145
Neg logit contributions
isk-0.138
arl-0.120
ens-0.118
 bisc-0.113
coat-0.110
Token to
Token ID284
Feature activation-1.25e-01

Pos logit contributions
inated+0.300
work+0.287
ised+0.286
ized+0.265
el+0.251
Neg logit contributions
isk-0.248
arl-0.214
ens-0.213
 bisc-0.203
coat-0.197
Token memor
Token ID16181
Feature activation+3.79e+00

Pos logit contributions
isk+0.178
arl+0.152
ens+0.151
 bisc+0.142
coat+0.138
Neg logit contributions
inated-0.270
work-0.259
ised-0.258
ized-0.241
el-0.231
Tokenize
Token ID1096
Feature activation+2.64e-03

Pos logit contributions
work+5.576
inated+4.978
inct+3.861
 horizon+3.645
ised+3.616
Neg logit contributions
isk-8.702
ens-8.133
aid-7.783
 fetch-7.688
Rex-7.625
Token every
Token ID790
Feature activation+2.59e-01

Pos logit contributions
inated+0.005
work+0.004
ised+0.004
ized+0.004
el+0.004
Neg logit contributions
isk-0.005
arl-0.004
ens-0.004
 bisc-0.004
coat-0.004
Token note
Token ID3465
Feature activation-5.09e-02

Pos logit contributions
inated+0.489
work+0.468
ised+0.466
ized+0.431
el+0.406
Neg logit contributions
isk-0.439
arl-0.383
ens-0.381
 bisc-0.364
coat-0.355
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
isk+0.099
arl+0.088
ens+0.087
 bisc+0.084
coat+0.082
Neg logit contributions
inated-0.083
work-0.080
ised-0.078
ized-0.072
el-0.067
Token After
Token ID2293
Feature activation-3.14e-01

Pos logit contributions
inated+0.433
work+0.415
ised+0.413
ized+0.382
el+0.361
Neg logit contributions
isk-0.380
arl-0.329
ens-0.328
 bisc-0.313
coat-0.305
Token!
Token ID0
Feature activation+9.93e-01

Pos logit contributions
inated+1.902
work+1.807
ised+1.800
ized+1.588
ble+1.501
Neg logit contributions
isk-2.267
ens-2.011
arl-1.998
 bisc-1.953
 shady-1.929
Token
Token ID198
Feature activation+1.46e-01

Pos logit contributions
inated+1.991
work+1.964
ised+1.924
ized+1.813
izing+1.719
Neg logit contributions
isk-1.561
ens-1.346
arl-1.327
 bisc-1.266
ives-1.224
Token
Token ID198
Feature activation+4.86e-02

Pos logit contributions
inated+0.269
work+0.257
ised+0.256
ized+0.236
el+0.223
Neg logit contributions
isk-0.252
arl-0.220
ens-0.219
 bisc-0.209
coat-0.204
Token"
Token ID1
Feature activation+1.54e+00

Pos logit contributions
inated+0.087
ised+0.082
work+0.082
ized+0.076
el+0.071
Neg logit contributions
isk-0.088
arl-0.077
ens-0.077
 bisc-0.073
coat-0.072
TokenWhat
Token ID2061
Feature activation+3.62e-01

Pos logit contributions
inated+2.779
work+2.696
ised+2.639
ized+2.430
ement+2.330
Neg logit contributions
isk-2.798
ens-2.429
 bisc-2.402
arl-2.388
Rex-2.299
Token is
Token ID318
Feature activation+3.79e+00

Pos logit contributions
inated+0.799
work+0.767
ised+0.767
ized+0.718
el+0.681
Neg logit contributions
isk-0.497
ens-0.417
arl-0.413
 bisc-0.393
coat-0.378
Token this
Token ID428
Feature activation+1.90e+00

Pos logit contributions
ised+8.503
inated+8.279
work+8.127
ement+7.999
ized+7.738
Neg logit contributions
ens-4.538
 fetch-4.494
 comes-4.452
 robber-4.427
isk-4.388
Token?"
Token ID1701
Feature activation+1.81e+00

Pos logit contributions
inated+3.278
ised+3.028
work+3.011
ized+2.801
el+2.671
Neg logit contributions
isk-3.431
arl-3.113
 shady-3.047
 bisc-3.017
ens-3.012
Token she
Token ID673
Feature activation+2.16e+00

Pos logit contributions
work+3.739
inated+3.713
ised+3.489
ized+3.272
izing+3.197
Neg logit contributions
isk-2.662
ens-2.519
 Luna-2.332
 bisc-2.264
 comes-2.168
Token asked
Token ID1965
Feature activation+1.64e+00

Pos logit contributions
work+4.007
inated+3.910
ised+3.638
ized+3.450
ement+3.321
Neg logit contributions
isk-3.719
ens-3.445
 Luna-3.234
 bisc-3.223
 comes-3.174
Token herself
Token ID5223
Feature activation+1.54e+00

Pos logit contributions
inated+3.002
work+2.855
ised+2.797
ized+2.562
izing+2.444
Neg logit contributions
isk-2.925
ens-2.641
arl-2.418
 Luna-2.403
 bisc-2.401
Token her
Token ID607
Feature activation+8.21e-01

Pos logit contributions
inated+2.068
work+1.981
ised+1.958
ized+1.814
ble+1.722
Neg logit contributions
isk-1.675
ens-1.467
arl-1.409
 bisc-1.363
 shady-1.330
Token face
Token ID1986
Feature activation+7.33e-01

Pos logit contributions
inated+1.490
work+1.419
ised+1.415
ized+1.301
izing+1.218
Neg logit contributions
isk-1.435
arl-1.273
ens-1.270
 bisc-1.222
coat-1.200
Token.
Token ID13
Feature activation+8.68e-01

Pos logit contributions
inated+1.147
work+1.082
ised+1.082
ized+0.977
izing+0.897
Neg logit contributions
isk-1.462
ens-1.307
arl-1.304
 bisc-1.263
coat-1.241
Token She
Token ID1375
Feature activation+8.25e-01

Pos logit contributions
inated+1.703
work+1.656
ised+1.630
ized+1.529
ble+1.460
Neg logit contributions
isk-1.413
ens-1.226
arl-1.209
 bisc-1.135
ives-1.124
Token was
Token ID373
Feature activation+3.91e-01

Pos logit contributions
inated+1.615
work+1.559
ised+1.523
ized+1.409
izing+1.350
Neg logit contributions
isk-1.331
ens-1.158
arl-1.104
 bisc-1.098
coat-1.085
Token mesmer
Token ID46814
Feature activation+3.78e+00

Pos logit contributions
inated+0.659
work+0.629
ised+0.625
ized+0.571
el+0.537
Neg logit contributions
isk-0.741
arl-0.651
ens-0.650
 bisc-0.625
coat-0.616
Tokenised
Token ID1417
Feature activation+6.32e-01

Pos logit contributions
work+3.744
inated+3.127
inct+2.260
used+1.922
ble+1.853
Neg logit contributions
isk-10.092
ens-9.674
arl-9.606
 shady-9.065
coat-8.905
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
inated+1.054
work+0.992
ised+0.977
ized+0.894
ble+0.839
Neg logit contributions
isk-1.209
ens-1.073
arl-1.067
 bisc-1.015
coat-1.006
Token
Token ID198
Feature activation+3.22e-01

Pos logit contributions
inated+2.173
work+2.118
ised+2.077
ized+1.967
ble+1.874
Neg logit contributions
isk-1.717
ens-1.488
arl-1.473
ives-1.368
 bisc-1.367
Token
Token ID198
Feature activation+9.25e-01

Pos logit contributions
inated+0.605
work+0.582
ised+0.575
ized+0.532
el+0.503
Neg logit contributions
isk-0.546
arl-0.475
ens-0.472
 bisc-0.449
coat-0.439
TokenWhen
Token ID2215
Feature activation+1.15e+00

Pos logit contributions
inated+1.673
work+1.646
ised+1.558
ized+1.437
el+1.398
Neg logit contributions
isk-1.661
arl-1.437
ens-1.418
 bisc-1.345
ives-1.341
Token the
Token ID262
Feature activation+9.86e-01

Pos logit contributions
inated+2.460
work+2.342
ised+2.294
ized+2.116
ble+2.070
Neg logit contributions
isk-1.999
ens-1.750
arl-1.714
 shady-1.690
 bisc-1.649
Token air
Token ID1633
Feature activation+1.85e-01

Pos logit contributions
inated+1.803
work+1.720
ised+1.690
ized+1.564
ble+1.506
Neg logit contributions
isk-1.692
arl-1.528
ens-1.514
 bisc-1.473
 shady-1.457
Token.
Token ID13
Feature activation+8.73e-01

Pos logit contributions
inated+0.281
work+0.266
ised+0.264
ized+0.239
el+0.222
Neg logit contributions
isk-0.378
arl-0.337
ens-0.336
 bisc-0.324
coat-0.317
Token Jane
Token ID12091
Feature activation+9.37e-01

Pos logit contributions
inated+1.648
work+1.594
ised+1.591
ized+1.494
ble+1.411
Neg logit contributions
isk-1.458
arl-1.264
ens-1.258
 bisc-1.191
ives-1.176
Token was
Token ID373
Feature activation+6.42e-01

Pos logit contributions
inated+1.799
work+1.710
ised+1.702
ized+1.570
izing+1.506
Neg logit contributions
isk-1.537
ens-1.333
arl-1.273
 bisc-1.256
coat-1.241
Token mesmer
Token ID46814
Feature activation+3.78e+00

Pos logit contributions
inated+1.076
work+1.022
ised+1.016
ized+0.928
el+0.875
Neg logit contributions
isk-1.214
arl-1.060
ens-1.056
 bisc-1.022
coat-1.013
Tokenized
Token ID1143
Feature activation+1.48e+00

Pos logit contributions
work+3.275
inated+2.942
inct+1.956
ble+1.925
used+1.740
Neg logit contributions
isk-9.975
arl-9.611
ens-9.454
 shady-9.308
 laps-8.884
Token.
Token ID13
Feature activation+1.21e+00

Pos logit contributions
inated+2.756
work+2.577
ised+2.495
ble+2.295
ized+2.286
Neg logit contributions
isk-2.552
ens-2.325
arl-2.205
 shady-2.120
 comes-2.099
Token The
Token ID383
Feature activation+1.27e+00

Pos logit contributions
inated+2.466
work+2.409
ised+2.397
ized+2.276
ble+2.159
Neg logit contributions
isk-1.840
arl-1.590
ens-1.575
ives-1.479
 bisc-1.448
Token old
Token ID1468
Feature activation+5.67e-01

Pos logit contributions
inated+2.690
work+2.561
ised+2.499
ized+2.303
el+2.267
Neg logit contributions
isk-1.841
ens-1.668
arl-1.618
 bisc-1.574
 shady-1.537
Token man
Token ID582
Feature activation+1.34e+00

Pos logit contributions
inated+1.272
work+1.223
ised+1.215
ized+1.131
el+1.089
Neg logit contributions
isk-0.760
ens-0.649
arl-0.632
 bisc-0.610
coat-0.589
Token the
Token ID262
Feature activation+1.27e+00

Pos logit contributions
inated+3.411
work+3.247
ised+3.157
ized+2.986
izing+2.909
Neg logit contributions
isk-2.769
ens-2.407
arl-2.273
 shady-2.152
 bisc-2.135
Token wheat
Token ID17135
Feature activation+1.03e+00

Pos logit contributions
inated+2.390
work+2.259
ised+2.254
ized+2.058
used+1.990
Neg logit contributions
isk-2.154
ens-1.913
 bisc-1.882
arl-1.874
 shady-1.820
Token and
Token ID290
Feature activation+1.63e+00

Pos logit contributions
inated+1.992
ised+1.908
work+1.888
ized+1.744
used+1.665
Neg logit contributions
isk-1.651
ens-1.436
arl-1.395
 bisc-1.394
 shady-1.362
Token found
Token ID1043
Feature activation+1.16e+00

Pos logit contributions
inated+3.368
work+3.331
ised+3.161
ement+2.963
el+2.938
Neg logit contributions
isk-2.380
ens-2.110
 shady-1.959
 bisc-1.946
 comes-1.908
Token himself
Token ID2241
Feature activation+1.73e+00

Pos logit contributions
inated+1.966
work+1.853
ised+1.843
ized+1.695
izing+1.610
Neg logit contributions
isk-2.176
ens-1.928
arl-1.903
 bisc-1.866
 shady-1.802
Token mesmer
Token ID46814
Feature activation+3.77e+00

Pos logit contributions
inated+3.007
work+2.939
ised+2.898
ized+2.603
ement+2.540
Neg logit contributions
isk-3.136
ens-2.667
 shady-2.666
 bisc-2.662
arl-2.621
Tokenised
Token ID1417
Feature activation+1.62e+00

Pos logit contributions
work+4.844
inated+4.122
used+3.120
inct+2.933
astically+2.837
Neg logit contributions
isk-9.220
arl-8.690
ens-8.239
 shady-8.155
aks-7.800
Token by
Token ID416
Feature activation+8.79e-01

Pos logit contributions
inated+2.786
work+2.563
ised+2.491
ized+2.277
used+2.161
Neg logit contributions
isk-3.030
ens-2.724
arl-2.619
 shady-2.576
 bisc-2.561
Token its
Token ID663
Feature activation+2.12e+00

Pos logit contributions
inated+1.616
work+1.553
ised+1.532
ized+1.411
ement+1.324
Neg logit contributions
isk-1.529
ens-1.311
arl-1.310
 bisc-1.249
 shady-1.243
Token beauty
Token ID8737
Feature activation+1.23e+00

Pos logit contributions
inated+3.840
work+3.735
ised+3.679
ized+3.350
used+3.260
Neg logit contributions
isk-3.717
 shady-3.349
 bisc-3.268
arl-3.144
ens-3.105
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
inated+2.036
ised+1.898
work+1.876
ized+1.706
used+1.655
Neg logit contributions
isk-2.298
ens-2.107
arl-2.076
 bisc-2.019
 shady-2.015
Token and
Token ID290
Feature activation-8.00e-01

Pos logit contributions
inated+0.268
work+0.256
ised+0.254
ized+0.232
el+0.218
Neg logit contributions
isk-0.304
arl-0.268
ens-0.267
 bisc-0.257
coat-0.251
Token graceful
Token ID44363
Feature activation+6.83e-03

Pos logit contributions
isk+1.638
arl+1.479
ens+1.419
 bisc+1.385
ives+1.376
Neg logit contributions
inated-1.256
ised-1.219
work-1.217
ized-1.102
izing-1.028
Token that
Token ID326
Feature activation-4.98e-01

Pos logit contributions
inated+0.011
work+0.010
ised+0.010
ized+0.009
el+0.009
Neg logit contributions
isk-0.014
arl-0.012
ens-0.012
 bisc-0.012
coat-0.011
Token everyone
Token ID2506
Feature activation+9.98e-01

Pos logit contributions
isk+0.994
arl+0.895
ens+0.855
 bisc+0.834
coat+0.820
Neg logit contributions
inated-0.798
work-0.775
ised-0.760
ized-0.696
el-0.662
Token was
Token ID373
Feature activation+8.02e-01

Pos logit contributions
inated+1.959
ised+1.868
work+1.845
ized+1.718
ement+1.647
Neg logit contributions
isk-1.561
ens-1.384
arl-1.328
 comes-1.284
coat-1.270
Token mesmer
Token ID46814
Feature activation+3.77e+00

Pos logit contributions
inated+1.422
ised+1.344
work+1.343
ized+1.231
el+1.172
Neg logit contributions
isk-1.434
arl-1.260
ens-1.238
coat-1.202
 bisc-1.188
Tokenized
Token ID1143
Feature activation+1.07e+00

Pos logit contributions
work+3.940
inated+3.748
used+2.715
inct+2.591
ble+2.356
Neg logit contributions
isk-9.649
ens-9.129
arl-9.124
aks-8.348
 shady-8.264
Token.
Token ID13
Feature activation+1.41e+00

Pos logit contributions
inated+1.778
ised+1.640
work+1.632
ized+1.482
ble+1.431
Neg logit contributions
isk-2.075
ens-1.894
arl-1.834
 comes-1.737
 bisc-1.733
Token When
Token ID1649
Feature activation+1.46e+00

Pos logit contributions
inated+2.844
work+2.825
ised+2.765
ized+2.641
ble+2.564
Neg logit contributions
isk-2.163
ens-1.900
arl-1.832
 bisc-1.730
 Luna-1.721
Token the
Token ID262
Feature activation+4.91e-01

Pos logit contributions
inated+2.974
work+2.799
ised+2.774
ized+2.582
ement+2.475
Neg logit contributions
isk-2.193
ens-2.077
 shady-1.914
 bisc-1.893
 comes-1.889
Token song
Token ID3496
Feature activation+5.07e-01

Pos logit contributions
inated+0.924
work+0.876
ised+0.873
ized+0.805
el+0.762
Neg logit contributions
isk-0.833
arl-0.735
ens-0.735
 bisc-0.704
coat-0.687
Token little
Token ID1310
Feature activation+1.15e+00

Pos logit contributions
inated+2.745
ised+2.634
work+2.598
ized+2.450
used+2.368
Neg logit contributions
isk-1.820
arl-1.592
ens-1.561
 bisc-1.506
 shady-1.468
Token girl
Token ID2576
Feature activation+9.79e-01

Pos logit contributions
inated+2.689
work+2.562
ised+2.561
ized+2.425
ement+2.331
Neg logit contributions
isk-1.390
ens-1.273
arl-1.157
 bisc-1.131
coat-1.122
Token kept
Token ID4030
Feature activation+9.20e-01

Pos logit contributions
inated+2.084
work+2.021
ised+2.008
ized+1.862
izing+1.820
Neg logit contributions
isk-1.393
ens-1.196
 bisc-1.135
coat-1.093
arl-1.090
Token walking
Token ID6155
Feature activation+3.32e-01

Pos logit contributions
inated+1.770
ised+1.703
work+1.694
ized+1.564
ement+1.488
Neg logit contributions
isk-1.518
ens-1.311
arl-1.301
 bisc-1.264
 shady-1.237
Token towards
Token ID3371
Feature activation+4.11e-01

Pos logit contributions
inated+0.612
work+0.585
ised+0.580
ized+0.534
el+0.504
Neg logit contributions
isk-0.582
ens-0.509
arl-0.507
 bisc-0.485
coat-0.474
Token the
Token ID262
Feature activation+3.74e+00

Pos logit contributions
inated+0.738
work+0.703
ised+0.700
ized+0.643
izing+0.604
Neg logit contributions
isk-0.737
ens-0.644
arl-0.639
 bisc-0.612
coat-0.598
Token rainbow
Token ID27223
Feature activation+5.81e-01

Pos logit contributions
inated+7.721
ised+7.543
used+7.402
work+6.930
ement+6.851
Neg logit contributions
 shady-5.588
 wake-5.110
 welcome-4.868
isk-4.850
 bisc-4.703
Token and
Token ID290
Feature activation+1.94e-01

Pos logit contributions
inated+0.969
ised+0.918
work+0.911
ized+0.832
ble+0.775
Neg logit contributions
isk-1.103
ens-0.978
arl-0.964
 bisc-0.936
coat-0.923
Token soon
Token ID2582
Feature activation-5.33e-01

Pos logit contributions
inated+0.361
work+0.345
ised+0.343
ized+0.317
el+0.299
Neg logit contributions
isk-0.335
arl-0.291
ens-0.290
 bisc-0.277
coat-0.271
Token she
Token ID673
Feature activation+5.59e-01

Pos logit contributions
isk+0.985
arl+0.889
ens+0.834
 bisc+0.817
coat+0.805
Neg logit contributions
inated-0.932
ised-0.889
work-0.882
ized-0.826
el-0.768
Token realized
Token ID6939
Feature activation+1.71e+00

Pos logit contributions
inated+1.257
work+1.211
ised+1.204
ized+1.120
izing+1.078
Neg logit contributions
isk-0.751
ens-0.620
arl-0.607
 bisc-0.585
coat-0.568
Tokenly
Token ID306
Feature activation+4.72e-01

Pos logit contributions
inated+0.882
work+0.843
ised+0.829
ized+0.761
used+0.721
Neg logit contributions
isk-0.825
ens-0.729
arl-0.716
 bisc-0.687
coat-0.672
Token and
Token ID290
Feature activation-8.09e-02

Pos logit contributions
inated+0.835
work+0.795
ised+0.794
ized+0.728
el+0.685
Neg logit contributions
isk-0.842
ens-0.741
arl-0.737
 bisc-0.714
coat-0.696
Token the
Token ID262
Feature activation+7.94e-01

Pos logit contributions
isk+0.155
arl+0.138
ens+0.136
 bisc+0.130
coat+0.128
Neg logit contributions
inated-0.136
work-0.130
ised-0.129
ized-0.119
el-0.111
Token bunny
Token ID44915
Feature activation+1.10e+00

Pos logit contributions
inated+1.495
work+1.431
ised+1.419
ized+1.319
ement+1.246
Neg logit contributions
isk-1.338
ens-1.191
arl-1.164
 bisc-1.137
 shady-1.116
Token was
Token ID373
Feature activation+2.90e-01

Pos logit contributions
inated+2.074
work+2.015
ised+1.974
ized+1.814
ement+1.749
Neg logit contributions
isk-1.839
ens-1.586
 bisc-1.500
coat-1.465
 shady-1.464
Token mesmer
Token ID46814
Feature activation+3.74e+00

Pos logit contributions
inated+0.499
work+0.477
ised+0.473
ized+0.434
el+0.408
Neg logit contributions
isk-0.540
arl-0.475
ens-0.472
 bisc-0.454
coat-0.445
Tokenised
Token ID1417
Feature activation+1.45e+00

Pos logit contributions
work+4.345
inated+3.646
used+2.909
inct+2.576
astically+2.299
Neg logit contributions
isk-9.677
ens-8.983
arl-8.946
 shady-8.523
aks-8.120
Token.
Token ID13
Feature activation+7.23e-01

Pos logit contributions
inated+2.464
work+2.272
ised+2.238
ized+2.059
ble+1.973
Neg logit contributions
isk-2.760
ens-2.471
arl-2.391
 shady-2.316
coat-2.260
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
inated+1.386
work+1.371
ised+1.337
ized+1.253
ble+1.186
Neg logit contributions
isk-1.191
ens-1.035
arl-1.013
 bisc-0.971
 shady-0.943
Token
Token ID198
Feature activation+3.40e-01

Pos logit contributions
inated+0.282
work+0.270
ised+0.268
ized+0.247
el+0.234
Neg logit contributions
isk-0.265
arl-0.231
ens-0.230
 bisc-0.220
coat-0.214
TokenThe
Token ID464
Feature activation+3.34e-01

Pos logit contributions
inated+0.589
work+0.569
ised+0.559
ized+0.513
el+0.485
Neg logit contributions
isk-0.636
arl-0.558
ens-0.556
 bisc-0.533
coat-0.522
Token Mom
Token ID11254
Feature activation-6.52e-01

Pos logit contributions
isk+0.983
arl+0.877
ens+0.841
 bisc+0.830
coat+0.816
Neg logit contributions
inated-0.917
work-0.889
ised-0.875
ized-0.798
el-0.769
Tokenmy
Token ID1820
Feature activation-2.63e-01

Pos logit contributions
isk+0.760
arl+0.637
ens+0.604
 bisc+0.585
coat+0.560
Neg logit contributions
inated-1.545
work-1.537
ised-1.480
ized-1.390
el-1.379
Token,
Token ID11
Feature activation+5.18e-01

Pos logit contributions
isk+0.438
arl+0.382
ens+0.376
 bisc+0.365
coat+0.355
Neg logit contributions
inated-0.502
work-0.485
ised-0.475
ized-0.439
el-0.420
Token "
Token ID366
Feature activation+1.74e+00

Pos logit contributions
inated+1.151
work+1.109
ised+1.107
ized+1.040
el+0.989
Neg logit contributions
isk-0.697
arl-0.590
ens-0.583
 bisc-0.547
coat-0.528
TokenWhat
Token ID2061
Feature activation+1.87e-01

Pos logit contributions
inated+3.529
ised+3.364
work+3.285
ized+3.137
izing+2.959
Neg logit contributions
isk-2.755
 bisc-2.378
arl-2.337
ens-2.315
coat-2.254
Token is
Token ID318
Feature activation+3.73e+00

Pos logit contributions
inated+0.401
work+0.386
ised+0.384
ized+0.359
el+0.342
Neg logit contributions
isk-0.267
arl-0.226
ens-0.224
 bisc-0.212
coat-0.206
Token that
Token ID326
Feature activation+1.64e+00

Pos logit contributions
ised+8.098
inated+7.743
ized+7.521
ement+7.331
work+7.137
Neg logit contributions
 comes-5.037
 Luna-4.753
 shady-4.746
ch-4.733
arl-4.703
Token?
Token ID30
Feature activation+1.92e+00

Pos logit contributions
inated+2.823
ised+2.677
work+2.589
ized+2.488
izing+2.249
Neg logit contributions
isk-2.908
arl-2.688
ens-2.626
 shady-2.596
 bisc-2.575
Token Is
Token ID1148
Feature activation+2.57e+00

Pos logit contributions
ised+3.948
inated+3.873
ized+3.787
work+3.643
izing+3.520
Neg logit contributions
isk-2.769
arl-2.464
ens-2.339
 bisc-2.304
ives-2.304
Token it
Token ID340
Feature activation+1.42e+00

Pos logit contributions
inated+4.480
ised+4.431
work+4.148
ized+3.985
izing+3.788
Neg logit contributions
isk-4.400
 comes-4.262
ens-4.231
 Luna-4.049
ives-3.999
Token a
Token ID257
Feature activation+1.86e+00

Pos logit contributions
inated+2.477
ised+2.401
work+2.272
ized+2.196
ement+2.051
Neg logit contributions
isk-2.535
arl-2.244
ens-2.217
 shady-2.180
 bisc-2.154
Token music
Token ID2647
Feature activation+3.15e-01

Pos logit contributions
inated+1.416
ised+1.329
work+1.327
ized+1.218
used+1.159
Neg logit contributions
isk-1.426
ens-1.254
arl-1.246
 bisc-1.224
coat-1.186
Token playing
Token ID2712
Feature activation-7.35e-01

Pos logit contributions
inated+0.559
work+0.531
ised+0.531
ized+0.487
el+0.459
Neg logit contributions
isk-0.564
arl-0.494
ens-0.493
 bisc-0.472
coat-0.462
Token.
Token ID13
Feature activation+4.16e-01

Pos logit contributions
isk+1.505
arl+1.367
ens+1.337
 bisc+1.292
ives+1.250
Neg logit contributions
inated-1.118
work-1.083
ised-1.079
ized-0.992
izing-0.918
Token She
Token ID1375
Feature activation+9.09e-01

Pos logit contributions
inated+0.785
work+0.757
ised+0.751
ized+0.699
el+0.656
Neg logit contributions
isk-0.703
ens-0.611
arl-0.611
 bisc-0.579
coat-0.567
Token was
Token ID373
Feature activation+6.54e-01

Pos logit contributions
inated+1.814
work+1.740
ised+1.714
ized+1.597
izing+1.533
Neg logit contributions
isk-1.434
ens-1.218
arl-1.187
 bisc-1.153
coat-1.141
Token mesmer
Token ID46814
Feature activation+3.71e+00

Pos logit contributions
inated+1.075
ised+1.024
work+1.019
ized+0.929
el+0.874
Neg logit contributions
isk-1.265
arl-1.106
ens-1.102
 bisc-1.063
coat-1.060
Tokenized
Token ID1143
Feature activation+1.45e+00

Pos logit contributions
work+3.552
inated+3.432
inct+2.391
used+2.288
ble+2.222
Neg logit contributions
isk-9.709
ens-9.054
arl-8.969
 shady-8.538
 laps-8.409
Token.
Token ID13
Feature activation+9.05e-01

Pos logit contributions
inated+2.585
work+2.382
ised+2.350
ized+2.128
ble+2.127
Neg logit contributions
isk-2.652
ens-2.403
arl-2.273
 shady-2.187
 comes-2.174
Token
Token ID198
Feature activation+3.34e-01

Pos logit contributions
inated+1.786
work+1.743
ised+1.719
ized+1.624
izing+1.528
Neg logit contributions
isk-1.451
ens-1.259
arl-1.249
 bisc-1.173
ives-1.167
Token
Token ID198
Feature activation+2.10e-01

Pos logit contributions
inated+0.629
work+0.606
ised+0.597
ized+0.553
el+0.523
Neg logit contributions
isk-0.565
arl-0.491
ens-0.488
 bisc-0.465
coat-0.454
TokenShe
Token ID3347
Feature activation+1.08e+00

Pos logit contributions
inated+0.361
work+0.345
ised+0.342
ized+0.313
el+0.295
Neg logit contributions
isk-0.393
arl-0.346
ens-0.345
 bisc-0.331
coat-0.324
Token Everyone
Token ID11075
Feature activation+1.31e+00

Pos logit contributions
inated+1.527
work+1.504
ised+1.478
ized+1.391
ble+1.304
Neg logit contributions
isk-1.425
ens-1.258
arl-1.245
 bisc-1.199
 shady-1.181
Token around
Token ID1088
Feature activation+1.40e-01

Pos logit contributions
inated+2.600
ised+2.492
work+2.454
ized+2.310
ement+2.222
Neg logit contributions
isk-1.992
ens-1.817
 comes-1.713
arl-1.680
 bisc-1.673
Token the
Token ID262
Feature activation-4.42e-01

Pos logit contributions
inated+0.266
work+0.255
ised+0.254
ized+0.235
el+0.222
Neg logit contributions
isk-0.235
arl-0.204
ens-0.202
 bisc-0.193
coat-0.188
Token table
Token ID3084
Feature activation-1.46e-02

Pos logit contributions
isk+0.731
arl+0.632
ens+0.612
 bisc+0.584
ives+0.572
Neg logit contributions
inated-0.868
ised-0.831
work-0.827
ized-0.774
el-0.735
Token was
Token ID373
Feature activation+1.77e-01

Pos logit contributions
isk+0.026
arl+0.023
ens+0.022
 bisc+0.021
coat+0.021
Neg logit contributions
inated-0.026
work-0.025
ised-0.025
ized-0.023
el-0.022
Token mesmer
Token ID46814
Feature activation+3.70e+00

Pos logit contributions
inated+0.319
work+0.305
ised+0.303
ized+0.279
el+0.263
Neg logit contributions
isk-0.314
arl-0.275
ens-0.274
 bisc-0.262
coat-0.256
Tokenized
Token ID1143
Feature activation+1.45e+00

Pos logit contributions
work+4.426
inated+4.214
used+3.511
inct+3.262
ble+2.757
Neg logit contributions
isk-8.834
ens-8.272
arl-8.165
 shady-7.780
aks-7.563
Token by
Token ID416
Feature activation+7.75e-01

Pos logit contributions
inated+2.564
ised+2.359
work+2.323
ized+2.140
ble+2.109
Neg logit contributions
isk-2.641
ens-2.437
arl-2.309
 comes-2.226
 shady-2.196
Token its
Token ID663
Feature activation+7.12e-01

Pos logit contributions
inated+1.369
ised+1.308
work+1.299
ized+1.200
ement+1.122
Neg logit contributions
isk-1.408
arl-1.222
ens-1.221
 bisc-1.158
 shady-1.148
Token beauty
Token ID8737
Feature activation+8.10e-01

Pos logit contributions
inated+1.388
work+1.332
ised+1.324
ized+1.230
used+1.160
Neg logit contributions
isk-1.164
ens-0.994
arl-0.989
 bisc-0.962
 shady-0.940
Token.
Token ID13
Feature activation+1.57e+00

Pos logit contributions
inated+1.287
ised+1.210
work+1.191
ized+1.093
used+1.037
Neg logit contributions
isk-1.584
ens-1.437
arl-1.421
 bisc-1.368
coat-1.353
Token.
Token ID13
Feature activation-3.14e-01

Pos logit contributions
isk+0.272
arl+0.243
ens+0.240
 bisc+0.234
coat+0.230
Neg logit contributions
inated-0.176
work-0.168
ised-0.166
ized-0.149
el-0.140
Token
Token ID198
Feature activation-5.02e-01

Pos logit contributions
isk+0.569
ens+0.486
arl+0.484
 bisc+0.471
coat+0.459
Neg logit contributions
inated-0.568
ised-0.536
work-0.532
ized-0.487
el-0.463
Token
Token ID198
Feature activation-7.49e-01

Pos logit contributions
isk+0.944
ens+0.819
arl+0.804
 bisc+0.795
coat+0.788
Neg logit contributions
inated-0.856
ised-0.816
work-0.795
ized-0.737
el-0.706
Token"
Token ID1
Feature activation+6.86e-01

Pos logit contributions
isk+1.243
ens+1.038
arl+1.019
 bisc+1.005
coat+0.987
Neg logit contributions
inated-1.483
ised-1.440
work-1.383
ized-1.315
el-1.258
TokenIn
Token ID818
Feature activation-1.44e-01

Pos logit contributions
inated+1.170
work+1.114
ised+1.106
ized+1.009
el+0.972
Neg logit contributions
isk-1.284
arl-1.153
ens-1.142
 bisc-1.091
coat-1.076
Token a
Token ID257
Feature activation-3.80e+00

Pos logit contributions
isk+0.266
arl+0.233
ens+0.233
 bisc+0.224
coat+0.220
Neg logit contributions
inated-0.249
work-0.240
ised-0.238
ized-0.217
el-0.208
Token box
Token ID3091
Feature activation-7.58e-01

Pos logit contributions
isk+8.105
ens+7.158
ives+7.111
arl+6.999
coat+6.663
Neg logit contributions
 top-6.305
inated-6.099
ised-6.056
 sunset-5.568
work-5.545
Token under
Token ID739
Feature activation+1.84e-01

Pos logit contributions
isk+1.508
arl+1.324
ens+1.304
 bisc+1.282
coat+1.237
Neg logit contributions
inated-1.208
work-1.175
ised-1.166
ized-1.084
el-1.027
Token the
Token ID262
Feature activation-1.33e+00

Pos logit contributions
inated+0.318
work+0.304
ised+0.301
ized+0.276
el+0.260
Neg logit contributions
isk-0.342
arl-0.301
ens-0.300
 bisc-0.287
coat-0.281
Token tree
Token ID5509
Feature activation-7.18e-01

Pos logit contributions
isk+2.393
arl+1.962
ens+1.956
ives+1.896
coat+1.837
Neg logit contributions
inated-2.428
work-2.399
ised-2.215
el-2.119
ized-2.113
Token."
Token ID526
Feature activation+5.81e-01

Pos logit contributions
isk+1.472
arl+1.327
ens+1.304
 bisc+1.265
coat+1.230
Neg logit contributions
inated-1.099
work-1.050
ised-1.034
ized-0.940
el-0.900
Token He
Token ID679
Feature activation+3.70e-01

Pos logit contributions
isk+3.231
coat+2.754
ens+2.694
 bisc+2.641
arl+2.565
Neg logit contributions
inated-3.925
ised-3.700
work-3.504
ized-3.190
el-3.177
Token thinks
Token ID6834
Feature activation-1.86e+00

Pos logit contributions
inated+0.776
work+0.750
ised+0.745
ized+0.694
el+0.660
Neg logit contributions
isk-0.549
ens-0.466
arl-0.465
 bisc-0.441
coat-0.425
Token about
Token ID546
Feature activation-1.60e-01

Pos logit contributions
isk+4.001
arl+3.440
 bisc+3.407
coat+3.405
ens+3.284
Neg logit contributions
work-2.692
inated-2.692
ised-2.592
ized-2.368
ble-2.342
Token what
Token ID644
Feature activation-1.27e+00

Pos logit contributions
isk+0.274
arl+0.242
ens+0.239
 bisc+0.230
coat+0.227
Neg logit contributions
inated-0.296
work-0.285
ised-0.281
ized-0.259
el-0.248
Token Tom
Token ID4186
Feature activation-1.24e+00

Pos logit contributions
isk+2.264
arl+2.005
coat+1.841
ens+1.827
 bisc+1.816
Neg logit contributions
work-2.308
inated-2.300
ised-2.135
el-1.964
ized-1.956
Token and
Token ID290
Feature activation-3.65e+00

Pos logit contributions
isk+2.341
arl+2.145
 bisc+1.976
 shady+1.971
coat+1.970
Neg logit contributions
inated-2.022
work-2.007
ised-1.933
ized-1.803
el-1.787
Token Sue
Token ID26089
Feature activation-1.76e-03

Pos logit contributions
ives+5.754
isk+5.625
coat+5.564
arl+5.456
aks+5.229
Neg logit contributions
inated-6.782
el-6.537
work-6.423
ised-6.247
 sunset-5.766
Token said
Token ID531
Feature activation+5.31e-02

Pos logit contributions
isk+0.003
arl+0.003
ens+0.003
 bisc+0.003
coat+0.003
Neg logit contributions
inated-0.003
work-0.003
ised-0.003
ized-0.002
el-0.002
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
inated+0.079
work+0.075
ised+0.074
ized+0.067
el+0.062
Neg logit contributions
isk-0.111
arl-0.099
ens-0.099
 bisc-0.095
coat-0.094
Token He
Token ID679
Feature activation+4.69e-01

Pos logit contributions
isk+1.863
ens+1.556
coat+1.525
arl+1.502
 bisc+1.492
Neg logit contributions
inated-2.499
ised-2.370
work-2.313
ized-2.115
ble-2.060
Token realizes
Token ID23660
Feature activation+8.42e-01

Pos logit contributions
inated+0.987
work+0.953
ised+0.947
ized+0.882
izing+0.840
Neg logit contributions
isk-0.697
ens-0.592
arl-0.588
 bisc-0.560
coat-0.537
Token happy
Token ID3772
Feature activation-4.36e-01

Pos logit contributions
isk+2.031
ens+1.815
arl+1.805
ives+1.732
 bisc+1.706
Neg logit contributions
inated-2.106
work-2.021
ised-1.997
ized-1.871
ble-1.836
Token to
Token ID284
Feature activation-1.18e+00

Pos logit contributions
isk+0.881
arl+0.785
ens+0.769
 bisc+0.747
coat+0.739
Neg logit contributions
inated-0.690
work-0.669
ised-0.651
ized-0.597
el-0.557
Token help
Token ID1037
Feature activation-2.57e+00

Pos logit contributions
isk+1.597
ens+1.361
arl+1.360
coat+1.316
ives+1.292
Neg logit contributions
inated-2.557
work-2.498
ised-2.398
el-2.260
ble-2.257
Token.
Token ID13
Feature activation-2.02e+00

Pos logit contributions
isk+4.177
arl+4.026
ives+3.823
 bisc+3.702
coat+3.587
Neg logit contributions
el-4.565
work-4.541
inated-4.509
ised-4.428
ble-4.395
Token Tom
Token ID4186
Feature activation-1.36e+00

Pos logit contributions
isk+3.879
coat+3.307
ens+3.278
 bisc+3.161
arl+3.133
Neg logit contributions
inated-3.451
ised-3.242
work-3.021
ble-2.725
ized-2.713
Token and
Token ID290
Feature activation-3.65e+00

Pos logit contributions
isk+2.725
arl+2.603
ens+2.357
 bisc+2.302
 germ+2.246
Neg logit contributions
inated-2.134
work-2.028
ised-1.986
ized-1.853
ble-1.707
Token Lily
Token ID20037
Feature activation-2.37e-01

Pos logit contributions
isk+5.867
ives+5.786
coat+5.701
arl+5.551
aks+5.435
Neg logit contributions
inated-6.854
ised-6.508
work-6.356
 top-5.930
 sunset-5.906
Token were
Token ID547
Feature activation-6.23e-01

Pos logit contributions
isk+0.357
arl+0.311
ens+0.298
 bisc+0.284
coat+0.276
Neg logit contributions
inated-0.489
work-0.472
ised-0.466
ized-0.438
el-0.413
Token happy
Token ID3772
Feature activation+3.81e-01

Pos logit contributions
isk+1.024
arl+0.895
ens+0.888
 bisc+0.849
ives+0.837
Neg logit contributions
inated-1.216
work-1.166
ised-1.160
ized-1.087
ble-1.055
Token to
Token ID284
Feature activation-4.14e-01

Pos logit contributions
inated+0.623
ised+0.588
work+0.588
ized+0.535
el+0.500
Neg logit contributions
isk-0.747
ens-0.663
arl-0.663
 bisc-0.636
coat-0.620
Token be
Token ID307
Feature activation-1.32e+00

Pos logit contributions
isk+0.509
arl+0.419
ens+0.418
 bisc+0.386
coat+0.386
Neg logit contributions
inated-0.970
work-0.942
ised-0.923
ized-0.866
el-0.848
Token.
Token ID13
Feature activation+4.15e-01

Pos logit contributions
isk+0.849
arl+0.765
ens+0.761
 bisc+0.739
ives+0.719
Neg logit contributions
inated-0.564
work-0.538
ised-0.538
ized-0.487
ble-0.450
Token
Token ID198
Feature activation-3.77e-01

Pos logit contributions
inated+0.793
work+0.768
ised+0.760
ized+0.706
el+0.668
Neg logit contributions
isk-0.695
ens-0.600
arl-0.598
 bisc-0.573
coat-0.557
Token
Token ID198
Feature activation-5.29e-01

Pos logit contributions
isk+0.694
arl+0.614
ens+0.607
 bisc+0.586
coat+0.573
Neg logit contributions
inated-0.649
ised-0.620
work-0.611
ized-0.563
el-0.534
TokenIn
Token ID818
Feature activation-3.72e-01

Pos logit contributions
isk+0.858
arl+0.760
ens+0.757
 bisc+0.714
ives+0.693
Neg logit contributions
inated-1.049
ised-0.997
work-0.966
ized-0.923
izing-0.867
Token the
Token ID262
Feature activation-3.00e+00

Pos logit contributions
isk+0.713
arl+0.633
ens+0.623
 bisc+0.606
ives+0.604
Neg logit contributions
inated-0.624
ised-0.599
work-0.599
ized-0.541
el-0.520
Token middle
Token ID3504
Feature activation-3.61e+00

Pos logit contributions
isk+6.829
ives+6.312
arl+6.245
coat+6.167
aid+6.112
Neg logit contributions
 top-4.195
ised-4.075
inated-3.995
 sunset-3.882
work-3.835
Token of
Token ID286
Feature activation-8.85e-01

Pos logit contributions
isk+7.811
arl+6.973
coat+6.684
 robber+6.499
 bisc+6.467
Neg logit contributions
ised-6.452
el-6.190
ement-5.797
work-5.675
ized-5.652
Token the
Token ID262
Feature activation-2.14e+00

Pos logit contributions
isk+1.776
arl+1.626
ives+1.600
ens+1.596
 bisc+1.570
Neg logit contributions
inated-1.376
ised-1.330
work-1.328
ized-1.174
el-1.154
Token night
Token ID1755
Feature activation-1.07e+00

Pos logit contributions
isk+4.195
arl+3.678
ives+3.577
aid+3.576
coat+3.525
Neg logit contributions
inated-3.663
ised-3.599
work-3.480
 top-3.283
ized-3.228
Token,
Token ID11
Feature activation+1.19e+00

Pos logit contributions
isk+2.112
arl+1.852
ens+1.789
 bisc+1.730
coat+1.714
Neg logit contributions
inated-1.803
work-1.767
ised-1.744
ized-1.581
el-1.500
Token Tim
Token ID5045
Feature activation-1.93e+00

Pos logit contributions
inated+2.655
work+2.561
ised+2.495
ized+2.341
ement+2.293
Neg logit contributions
isk-1.595
ens-1.420
arl-1.309
 shady-1.265
 bisc-1.263
Token.
Token ID13
Feature activation-1.76e+00

Pos logit contributions
inated+1.314
work+1.246
ised+1.209
ized+1.094
ble+1.014
Neg logit contributions
isk-1.768
ens-1.621
arl-1.585
 bisc-1.530
coat-1.508
Token
Token ID220
Feature activation-3.59e-01

Pos logit contributions
isk+3.484
arl+3.129
ives+2.960
 bisc+2.944
aks+2.936
Neg logit contributions
inated-2.892
ised-2.733
work-2.479
ized-2.357
izing-2.202
Token
Token ID198
Feature activation-1.03e+00

Pos logit contributions
isk+0.652
ens+0.573
arl+0.572
 bisc+0.559
coat+0.544
Neg logit contributions
inated-0.620
ised-0.592
work-0.586
ized-0.539
el-0.512
Token
Token ID198
Feature activation-2.11e+00

Pos logit contributions
isk+2.056
arl+1.817
ens+1.789
 bisc+1.769
coat+1.728
Neg logit contributions
inated-1.608
ised-1.569
work-1.474
ized-1.389
el-1.312
TokenMax
Token ID11518
Feature activation+8.44e-01

Pos logit contributions
isk+2.863
arl+2.453
 bisc+2.415
ens+2.355
ives+2.325
Neg logit contributions
ised-4.714
inated-4.681
ized-4.285
izing-4.187
work-4.067
Token asked
Token ID1965
Feature activation-3.58e+00

Pos logit contributions
inated+1.599
work+1.565
ised+1.529
ized+1.420
ement+1.374
Neg logit contributions
isk-1.427
ens-1.245
arl-1.168
 bisc-1.159
coat-1.134
Token,
Token ID11
Feature activation-3.19e-01

Pos logit contributions
 bisc+7.885
isk+7.780
coat+7.710
arl+7.628
ives+7.410
Neg logit contributions
ised-4.785
inated-4.510
 curiously-4.399
work-4.263
el-4.034
Token "
Token ID366
Feature activation-1.59e+00

Pos logit contributions
isk+0.473
arl+0.400
ens+0.399
 bisc+0.382
ives+0.370
Neg logit contributions
inated-0.670
work-0.644
ised-0.638
ized-0.591
ble-0.565
TokenL
Token ID43
Feature activation-2.06e+00

Pos logit contributions
isk+3.363
arl+3.180
ens+3.009
ives+3.000
 bisc+2.963
Neg logit contributions
inated-2.208
ised-2.122
work-2.071
ized-1.912
izing-1.742
Tokenily
Token ID813
Feature activation-1.69e+00

Pos logit contributions
isk+2.504
 bisc+2.237
 shady+2.191
ens+2.050
arl+2.038
Neg logit contributions
inated-4.518
ised-4.491
work-4.426
ized-4.303
el-4.271
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
isk+3.420
arl+3.101
 bisc+3.064
ives+2.977
 shady+2.945
Neg logit contributions
inated-2.590
work-2.487
ised-2.388
el-2.227
used-2.152
Token.
Token ID13
Feature activation-2.05e+00

Pos logit contributions
isk+2.488
arl+2.275
ens+2.216
 bisc+2.123
coat+2.073
Neg logit contributions
inated-1.666
work-1.594
ised-1.548
ized-1.416
ement-1.304
Token
Token ID198
Feature activation-1.53e+00

Pos logit contributions
isk+4.364
arl+3.820
ives+3.781
 germ+3.708
coat+3.690
Neg logit contributions
inated-3.047
ised-2.928
ized-2.398
work-2.360
 top-2.359
Token
Token ID198
Feature activation-1.88e+00

Pos logit contributions
isk+3.250
ens+2.859
arl+2.804
 bisc+2.785
 germ+2.758
Neg logit contributions
inated-2.232
ised-2.183
work-1.963
ized-1.855
el-1.801
Token"
Token ID1
Feature activation-1.96e+00

Pos logit contributions
isk+3.671
arl+3.199
ens+3.197
ives+3.164
 germ+3.085
Neg logit contributions
ised-3.100
inated-3.075
ized-2.701
work-2.606
izing-2.550
TokenSpot
Token ID32565
Feature activation-1.27e+00

Pos logit contributions
isk+3.423
arl+3.078
ens+2.815
ives+2.762
 germ+2.714
Neg logit contributions
inated-3.657
ised-3.416
work-3.346
ized-3.083
el-2.968
Token,
Token ID11
Feature activation-3.56e+00

Pos logit contributions
isk+2.395
arl+2.240
ives+2.079
ens+2.062
 bisc+2.015
Neg logit contributions
inated-2.163
work-2.066
ised-1.988
ized-1.816
el-1.785
Token come
Token ID1282
Feature activation-5.02e-01

Pos logit contributions
isk+7.877
arl+7.304
ives+7.276
aks+7.137
coat+7.095
Neg logit contributions
inated-4.972
work-4.713
 top-4.326
ised-4.264
ble-4.195
Token here
Token ID994
Feature activation+2.52e-01

Pos logit contributions
isk+0.912
ens+0.813
arl+0.804
 bisc+0.772
ives+0.762
Neg logit contributions
inated-0.882
work-0.854
ised-0.824
ized-0.754
el-0.743
Token!"
Token ID2474
Feature activation-6.23e-01

Pos logit contributions
inated+0.443
work+0.422
ised+0.420
ized+0.386
el+0.363
Neg logit contributions
isk-0.458
arl-0.402
ens-0.400
 bisc-0.384
coat-0.375
Token Lily
Token ID20037
Feature activation+7.31e-01

Pos logit contributions
isk+0.726
arl+0.576
ens+0.510
ives+0.506
coat+0.505
Neg logit contributions
inated-1.529
ised-1.498
work-1.436
ized-1.395
el-1.341
Token called
Token ID1444
Feature activation-5.80e-01

Pos logit contributions
inated+1.333
work+1.293
ised+1.272
ized+1.179
ble+1.111
Neg logit contributions
isk-1.287
ens-1.145
 bisc-1.088
arl-1.087
coat-1.069
Token toy
Token ID13373
Feature activation+1.48e+00

Pos logit contributions
isk+0.582
ens+0.489
arl+0.488
ives+0.455
 bisc+0.450
Neg logit contributions
inated-0.744
work-0.707
ised-0.705
ized-0.655
el-0.622
Token.
Token ID13
Feature activation-2.07e+00

Pos logit contributions
inated+3.166
ised+3.074
work+2.973
ized+2.829
used+2.730
Neg logit contributions
isk-2.158
ens-1.899
arl-1.887
 bisc-1.781
coat-1.729
Token
Token ID198
Feature activation-8.04e-01

Pos logit contributions
isk+3.894
arl+3.600
aid+3.350
coat+3.316
ives+3.282
Neg logit contributions
inated-3.644
ised-3.197
work-3.060
el-2.788
ized-2.723
Token
Token ID198
Feature activation-2.15e+00

Pos logit contributions
isk+1.553
arl+1.383
ens+1.357
 bisc+1.330
coat+1.311
Neg logit contributions
inated-1.304
ised-1.245
work-1.206
ized-1.109
el-1.073
TokenMax
Token ID11518
Feature activation+4.77e-01

Pos logit contributions
isk+3.519
arl+3.164
 bisc+2.916
ens+2.889
ives+2.840
Neg logit contributions
inated-4.288
ised-4.100
ized-3.658
work-3.599
izing-3.546
Token asked
Token ID1965
Feature activation-3.55e+00

Pos logit contributions
inated+0.893
work+0.862
ised+0.853
ized+0.789
ement+0.750
Neg logit contributions
isk-0.814
ens-0.708
arl-0.690
 bisc-0.669
coat-0.653
Token Luna
Token ID23694
Feature activation-1.33e+00

Pos logit contributions
arl+7.123
isk+7.056
 bisc+6.950
ives+6.943
coat+6.938
Neg logit contributions
inated-5.020
el-4.897
ised-4.773
work-4.632
 curiously-4.624
Token,
Token ID11
Feature activation-2.20e-01

Pos logit contributions
isk+2.499
arl+2.329
 bisc+2.254
coat+2.218
ens+2.181
Neg logit contributions
inated-2.211
work-2.158
ised-2.012
ized-1.839
ble-1.794
Token "
Token ID366
Feature activation-1.40e+00

Pos logit contributions
isk+0.321
arl+0.272
ens+0.270
 bisc+0.258
coat+0.250
Neg logit contributions
inated-0.466
work-0.449
ised-0.443
ized-0.411
el-0.394
TokenWhat
Token ID2061
Feature activation-6.74e-01

Pos logit contributions
isk+2.629
arl+2.523
ens+2.352
ives+2.305
coat+2.252
Neg logit contributions
inated-2.240
work-2.144
ised-2.075
ized-1.908
el-1.848
Token are
Token ID389
Feature activation+8.35e-02

Pos logit contributions
isk+1.053
arl+0.924
ens+0.877
 bisc+0.832
coat+0.829
Neg logit contributions
inated-1.362
work-1.332
ised-1.271
ized-1.183
el-1.170
Token talks
Token ID6130
Feature activation-1.97e+00

Pos logit contributions
isk+1.127
arl+0.990
ens+0.923
 bisc+0.899
coat+0.888
Neg logit contributions
inated-1.830
ised-1.716
work-1.709
ized-1.618
el-1.577
Token to
Token ID284
Feature activation-4.46e-01

Pos logit contributions
isk+3.903
 bisc+3.425
ens+3.381
arl+3.378
coat+3.197
Neg logit contributions
inated-3.259
work-3.199
ised-3.078
ized-2.816
ble-2.693
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
isk+0.824
arl+0.749
ens+0.720
 bisc+0.708
coat+0.707
Neg logit contributions
inated-0.757
work-0.731
ised-0.717
ized-0.663
el-0.631
Token dog
Token ID3290
Feature activation-9.92e-01

Pos logit contributions
isk+1.839
arl+1.476
ens+1.416
ives+1.372
 bisc+1.324
Neg logit contributions
inated-2.718
ised-2.603
work-2.584
ized-2.465
ble-2.356
Token in
Token ID287
Feature activation-1.68e+00

Pos logit contributions
isk+2.192
arl+2.049
ens+1.965
 bisc+1.950
coat+1.918
Neg logit contributions
inated-1.338
work-1.259
ised-1.167
ized-1.058
el-1.033
Token a
Token ID257
Feature activation-3.53e+00

Pos logit contributions
isk+3.463
arl+3.086
ives+3.043
coat+3.041
ens+3.022
Neg logit contributions
inated-2.520
ised-2.457
work-2.364
el-2.328
ized-2.163
Token soft
Token ID2705
Feature activation-2.64e+00

Pos logit contributions
isk+7.800
ens+6.586
ives+6.543
arl+6.272
coat+5.900
Neg logit contributions
inated-5.878
ised-5.686
 top-5.400
el-5.267
ized-5.142
Token voice
Token ID3809
Feature activation-4.77e-01

Pos logit contributions
isk+5.547
ives+5.152
arl+4.990
ens+4.794
 backup+4.573
Neg logit contributions
inated-4.346
ised-4.088
el-3.919
work-3.637
ized-3.484
Token.
Token ID13
Feature activation-9.62e-01

Pos logit contributions
isk+1.071
arl+0.952
ens+0.949
 bisc+0.922
coat+0.909
Neg logit contributions
inated-0.642
work-0.619
ised-0.594
ized-0.533
el-0.503
Token He
Token ID679
Feature activation+3.42e-01

Pos logit contributions
isk+1.786
ens+1.516
arl+1.515
coat+1.501
 bisc+1.469
Neg logit contributions
inated-1.718
ised-1.594
work-1.585
ized-1.435
el-1.389
Token gives
Token ID3607
Feature activation-9.17e-01

Pos logit contributions
inated+0.759
work+0.734
ised+0.731
ized+0.684
el+0.651
Neg logit contributions
isk-0.462
ens-0.384
arl-0.384
 bisc-0.361
coat-0.347
Token fell
Token ID3214
Feature activation-5.21e-01

Pos logit contributions
isk+0.511
arl+0.455
ens+0.437
 bisc+0.419
ives+0.417
Neg logit contributions
inated-0.502
ised-0.479
work-0.478
ized-0.441
el-0.414
Token asleep
Token ID16039
Feature activation-1.36e-01

Pos logit contributions
isk+0.670
ens+0.536
arl+0.535
 bisc+0.516
ives+0.493
Neg logit contributions
inated-1.208
work-1.167
ised-1.160
ized-1.088
el-1.051
Token.
Token ID13
Feature activation-3.04e-01

Pos logit contributions
isk+0.265
arl+0.236
ens+0.234
 bisc+0.225
ives+0.219
Neg logit contributions
inated-0.223
work-0.213
ised-0.212
ized-0.194
izing-0.179
Token In
Token ID554
Feature activation+4.23e-02

Pos logit contributions
isk+0.570
arl+0.504
ens+0.503
 bisc+0.488
coat+0.474
Neg logit contributions
inated-0.522
work-0.486
ised-0.484
ized-0.439
el-0.416
Token the
Token ID262
Feature activation-1.75e+00

Pos logit contributions
inated+0.075
work+0.072
ised+0.071
ized+0.065
el+0.062
Neg logit contributions
isk-0.077
arl-0.068
ens-0.067
 bisc-0.065
coat-0.063
Token middle
Token ID3504
Feature activation-3.51e+00

Pos logit contributions
isk+3.687
arl+3.256
ives+3.240
ens+3.197
coat+3.081
Neg logit contributions
inated-2.491
work-2.457
ised-2.419
el-2.237
 sunset-2.186
Token of
Token ID286
Feature activation+3.61e-02

Pos logit contributions
isk+7.339
arl+6.503
 robber+6.217
 robbers+6.099
 bisc+5.872
Neg logit contributions
ised-6.170
el-6.147
ement-6.058
ized-5.634
work-5.605
Token the
Token ID262
Feature activation-1.07e+00

Pos logit contributions
inated+0.064
work+0.061
ised+0.060
ized+0.055
el+0.053
Neg logit contributions
isk-0.066
arl-0.058
ens-0.058
 bisc-0.056
coat-0.054
Token night
Token ID1755
Feature activation-1.31e+00

Pos logit contributions
isk+2.145
arl+1.886
ens+1.871
ives+1.838
coat+1.755
Neg logit contributions
inated-1.673
work-1.620
ised-1.582
ized-1.440
el-1.424
Token,
Token ID11
Feature activation-1.34e-01

Pos logit contributions
isk+2.632
arl+2.324
ens+2.229
 bisc+2.145
coat+2.106
Neg logit contributions
inated-2.167
work-2.136
ised-2.119
ized-1.938
el-1.845
Token she
Token ID673
Feature activation-1.01e+00

Pos logit contributions
isk+0.215
arl+0.186
ens+0.182
 bisc+0.176
ives+0.171
Neg logit contributions
inated-0.266
work-0.256
ised-0.256
ized-0.238
el-0.224
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
isk+0.069
arl+0.062
ens+0.061
 bisc+0.060
coat+0.058
Neg logit contributions
inated-0.050
work-0.047
ised-0.047
ized-0.042
el-0.039
Token
Token ID198
Feature activation-5.25e-01

Pos logit contributions
isk+1.925
arl+1.674
ens+1.601
coat+1.588
 bisc+1.561
Neg logit contributions
inated-1.938
ised-1.818
work-1.718
ized-1.610
el-1.574
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
isk+0.994
arl+0.867
ens+0.863
 bisc+0.839
coat+0.831
Neg logit contributions
inated-0.886
ised-0.843
work-0.820
ized-0.758
el-0.724
Token"
Token ID1
Feature activation-1.96e+00

Pos logit contributions
isk+1.948
arl+1.676
ens+1.632
 bisc+1.561
coat+1.559
Neg logit contributions
inated-1.967
ised-1.918
work-1.760
ized-1.735
izing-1.645
TokenSpot
Token ID32565
Feature activation-1.49e+00

Pos logit contributions
isk+3.240
arl+2.962
ens+2.612
ives+2.526
 germ+2.477
Neg logit contributions
inated-3.762
ised-3.552
work-3.487
ized-3.263
used-3.109
Token,
Token ID11
Feature activation-3.49e+00

Pos logit contributions
isk+2.928
arl+2.729
ives+2.534
ens+2.511
and+2.490
Neg logit contributions
inated-2.397
work-2.290
ised-2.199
ized-2.007
el-1.983
Token sit
Token ID1650
Feature activation-4.23e-01

Pos logit contributions
isk+7.088
arl+6.695
ives+6.332
coat+6.306
aks+6.302
Neg logit contributions
inated-5.363
work-5.232
 top-4.764
ble-4.712
el-4.533
Token!"
Token ID2474
Feature activation-3.19e-01

Pos logit contributions
isk+0.814
ens+0.714
arl+0.707
 bisc+0.699
ives+0.684
Neg logit contributions
inated-0.700
ised-0.672
work-0.670
ized-0.617
el-0.583
Token Ben
Token ID3932
Feature activation-1.73e-02

Pos logit contributions
isk+0.323
arl+0.250
ens+0.230
 bisc+0.216
coat+0.215
Neg logit contributions
inated-0.832
ised-0.806
work-0.795
ized-0.757
el-0.734
Token says
Token ID1139
Feature activation-2.85e-01

Pos logit contributions
isk+0.028
arl+0.024
ens+0.024
 bisc+0.023
coat+0.022
Neg logit contributions
inated-0.034
work-0.033
ised-0.033
ized-0.030
el-0.029
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
isk+0.628
arl+0.560
ens+0.556
 bisc+0.541
coat+0.529
Neg logit contributions
inated-0.397
work-0.378
ised-0.367
ized-0.330
el-0.307
Token
Token ID198
Feature activation-1.84e+00

Pos logit contributions
isk+2.113
ens+1.857
arl+1.841
coat+1.838
 bisc+1.816
Neg logit contributions
inated-1.655
ised-1.571
work-1.503
ized-1.401
ble-1.350
Token"
Token ID1
Feature activation-1.45e+00

Pos logit contributions
isk+3.481
coat+3.003
ens+2.966
 bisc+2.913
arl+2.904
Neg logit contributions
ised-3.184
inated-3.177
ized-2.898
work-2.803
izing-2.692
TokenRex
Token ID47389
Feature activation-1.73e+00

Pos logit contributions
isk+2.608
arl+2.301
ens+2.182
coat+2.145
ives+2.091
Neg logit contributions
inated-2.593
ised-2.500
work-2.481
ized-2.338
ble-2.128
Token,
Token ID11
Feature activation-3.40e+00

Pos logit contributions
isk+3.367
arl+3.110
and+2.910
coat+2.898
 shady+2.896
Neg logit contributions
inated-2.791
work-2.595
ised-2.504
ized-2.369
el-2.305
Token Rex
Token ID17853
Feature activation-1.53e+00

Pos logit contributions
isk+6.824
coat+6.516
arl+6.333
ives+6.277
aks+6.165
Neg logit contributions
work-5.243
inated-4.863
 top-4.538
el-4.469
ble-4.384
Token,
Token ID11
Feature activation-3.48e+00

Pos logit contributions
isk+3.425
arl+3.171
ives+3.013
coat+3.005
ens+2.996
Neg logit contributions
inated-1.993
work-1.837
ised-1.803
ized-1.663
el-1.598
Token come
Token ID1282
Feature activation-8.93e-01

Pos logit contributions
isk+7.158
coat+6.779
arl+6.509
ives+6.452
aks+6.363
Neg logit contributions
work-5.145
inated-5.005
el-4.554
ised-4.380
used-4.354
Token here
Token ID994
Feature activation-3.11e-01

Pos logit contributions
isk+1.659
ens+1.474
arl+1.456
 bisc+1.423
coat+1.412
Neg logit contributions
inated-1.508
work-1.446
ised-1.397
el-1.291
ized-1.283
Token!"
Token ID2474
Feature activation-1.68e+00

Pos logit contributions
isk+0.661
arl+0.591
ens+0.586
 bisc+0.565
coat+0.560
Neg logit contributions
inated-0.448
work-0.426
ised-0.419
ized-0.379
el-0.353
Token the
Token ID262
Feature activation-1.46e+00

Pos logit contributions
isk+2.758
arl+2.291
coat+2.180
ives+2.148
 bisc+2.058
Neg logit contributions
inated-3.383
ised-3.307
work-3.146
ized-3.058
el-2.856
Token man
Token ID582
Feature activation-3.55e-01

Pos logit contributions
isk+2.403
ens+1.976
ives+1.972
arl+1.952
and+1.856
Neg logit contributions
inated-2.856
ised-2.793
work-2.727
ized-2.596
ble-2.547
Token sees
Token ID7224
Feature activation-1.22e+00

Pos logit contributions
isk+3.248
arl+2.881
coat+2.831
 bisc+2.658
ives+2.624
Neg logit contributions
inated-4.422
work-4.268
ised-4.227
ized-4.036
el-3.992
Token Tim
Token ID5045
Feature activation-1.21e+00

Pos logit contributions
isk+2.394
coat+2.174
arl+2.118
 bisc+2.116
ives+2.056
Neg logit contributions
inated-1.928
ised-1.909
work-1.845
ized-1.749
el-1.695
Token hurt
Token ID5938
Feature activation-5.32e-01

Pos logit contributions
isk+2.470
arl+2.271
 bisc+2.211
ens+2.183
coat+2.125
Neg logit contributions
inated-1.814
work-1.732
ised-1.664
ized-1.562
el-1.559
Token.
Token ID13
Feature activation-3.95e-01

Pos logit contributions
isk+1.005
arl+0.886
ens+0.863
 bisc+0.861
ives+0.844
Neg logit contributions
inated-0.891
ised-0.862
work-0.859
ized-0.791
el-0.772
Token She
Token ID1375
Feature activation-1.94e-01

Pos logit contributions
isk+0.682
arl+0.586
ens+0.575
 bisc+0.571
coat+0.556
Neg logit contributions
inated-0.739
ised-0.695
work-0.693
ized-0.635
el-0.617
Token calls
Token ID3848
Feature activation-3.47e+00

Pos logit contributions
isk+0.264
arl+0.220
ens+0.218
 bisc+0.208
coat+0.206
Neg logit contributions
inated-0.429
work-0.412
ised-0.411
ized-0.385
el-0.373
Token for
Token ID329
Feature activation-2.45e+00

Pos logit contributions
isk+7.676
 bisc+7.411
coat+7.188
ives+7.099
arl+6.806
Neg logit contributions
work-4.315
inated-4.295
el-4.293
ised-4.282
 top-4.217
Token help
Token ID1037
Feature activation-2.02e+00

Pos logit contributions
isk+4.470
ives+4.107
arl+4.078
coat+4.022
aks+3.831
Neg logit contributions
ised-4.108
inated-4.050
work-3.967
ized-3.880
el-3.759
Token and
Token ID290
Feature activation-1.82e+00

Pos logit contributions
isk+4.969
arl+4.675
 bisc+4.629
 shady+4.431
coat+4.359
Neg logit contributions
inated-2.073
ised-2.039
work-2.015
el-1.798
ized-1.780
Token takes
Token ID2753
Feature activation-1.34e+00

Pos logit contributions
isk+3.739
arl+3.360
coat+3.232
ens+3.214
 bisc+3.206
Neg logit contributions
inated-2.783
work-2.726
ised-2.690
ized-2.502
used-2.275
Token Tim
Token ID5045
Feature activation-1.20e+00

Pos logit contributions
isk+2.624
arl+2.384
coat+2.273
ives+2.202
 bisc+2.197
Neg logit contributions
inated-2.114
ised-2.089
work-2.061
el-1.948
ized-1.890
Token.
Token ID13
Feature activation-1.71e+00

Pos logit contributions
inated+1.421
ised+1.343
work+1.333
ized+1.199
ble+1.145
Neg logit contributions
isk-1.767
ens-1.554
arl-1.527
 bisc-1.493
coat-1.466
Token
Token ID220
Feature activation-4.95e-01

Pos logit contributions
isk+3.423
arl+3.044
 bisc+2.833
ives+2.830
 germ+2.829
Neg logit contributions
inated-2.753
ised-2.562
work-2.459
ized-2.219
el-2.135
Token
Token ID198
Feature activation-1.19e+00

Pos logit contributions
isk+0.910
arl+0.800
ens+0.799
 bisc+0.781
coat+0.760
Neg logit contributions
inated-0.842
ised-0.801
work-0.793
ized-0.728
el-0.696
Token
Token ID198
Feature activation-1.94e+00

Pos logit contributions
isk+2.449
arl+2.157
ens+2.097
 bisc+2.071
 germ+2.028
Neg logit contributions
inated-1.808
ised-1.764
work-1.652
ized-1.558
el-1.472
TokenMax
Token ID11518
Feature activation+7.27e-01

Pos logit contributions
isk+3.134
arl+2.724
 bisc+2.564
ens+2.543
 germ+2.540
Neg logit contributions
inated-3.881
ised-3.876
ized-3.556
work-3.408
izing-3.363
Token asked
Token ID1965
Feature activation-3.47e+00

Pos logit contributions
inated+1.352
work+1.304
ised+1.297
ized+1.197
ement+1.142
Neg logit contributions
isk-1.257
ens-1.095
arl-1.038
 bisc-1.030
coat-1.003
Token Lily
Token ID20037
Feature activation-1.10e+00

Pos logit contributions
isk+7.968
arl+7.849
 bisc+7.789
coat+7.644
ives+7.481
Neg logit contributions
ised-4.460
inated-3.947
work-3.812
 curiously-3.784
el-3.712
Token,
Token ID11
Feature activation-1.89e-01

Pos logit contributions
isk+1.992
arl+1.835
 bisc+1.787
coat+1.732
ens+1.722
Neg logit contributions
inated-1.926
work-1.867
ised-1.773
ized-1.631
el-1.587
Token "
Token ID366
Feature activation-8.36e-01

Pos logit contributions
isk+0.275
arl+0.232
ens+0.232
 bisc+0.221
coat+0.213
Neg logit contributions
inated-0.401
work-0.387
ised-0.382
ized-0.356
el-0.339
TokenWhat
Token ID2061
Feature activation-3.29e-01

Pos logit contributions
isk+1.514
arl+1.389
ens+1.339
 bisc+1.280
ives+1.271
Neg logit contributions
inated-1.440
work-1.397
ised-1.366
ized-1.265
el-1.187
Token are
Token ID389
Feature activation+6.86e-01

Pos logit contributions
isk+0.490
arl+0.422
ens+0.408
 bisc+0.384
coat+0.379
Neg logit contributions
inated-0.687
work-0.668
ised-0.650
ized-0.606
el-0.591
Token cow
Token ID9875
Feature activation-1.45e+00

Pos logit contributions
isk+3.096
arl+2.846
ens+2.791
coat+2.656
 bisc+2.648
Neg logit contributions
work-2.724
inated-2.673
ised-2.571
ized-2.368
ble-2.212
Tokenboys
Token ID13202
Feature activation-1.06e+00

Pos logit contributions
isk+2.696
arl+2.595
ens+2.351
 shady+2.299
coat+2.290
Neg logit contributions
work-2.403
inated-2.324
ised-2.171
ized-2.057
el-1.936
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
isk+2.063
arl+1.902
ens+1.807
 bisc+1.806
 shady+1.762
Neg logit contributions
work-1.692
inated-1.685
ised-1.576
ized-1.461
el-1.364
Token They
Token ID1119
Feature activation-1.18e+00

Pos logit contributions
isk+2.139
arl+1.808
ens+1.802
coat+1.757
 bisc+1.701
Neg logit contributions
inated-2.162
work-2.041
ised-1.998
ized-1.767
el-1.699
Token run
Token ID1057
Feature activation-2.16e+00

Pos logit contributions
isk+1.563
arl+1.373
ens+1.240
 bisc+1.223
coat+1.196
Neg logit contributions
inated-2.587
work-2.539
ised-2.414
ized-2.268
el-2.225
Token,
Token ID11
Feature activation-3.47e+00

Pos logit contributions
isk+4.617
arl+4.177
 bisc+4.005
ens+3.911
 germ+3.773
Neg logit contributions
inated-3.295
work-3.152
ised-3.024
ized-2.704
 top-2.589
Token jump
Token ID4391
Feature activation-1.62e+00

Pos logit contributions
isk+6.373
arl+5.540
and+4.926
ives+4.841
aid+4.736
Neg logit contributions
inated-6.738
work-6.578
ised-5.651
 top-5.595
ized-5.454
Token and
Token ID290
Feature activation-2.15e+00

Pos logit contributions
isk+3.417
arl+2.956
ens+2.950
 bisc+2.929
 germ+2.830
Neg logit contributions
inated-2.518
work-2.397
ised-2.339
ized-2.116
ble-2.025
Token shoot
Token ID2686
Feature activation-1.25e+00

Pos logit contributions
isk+3.236
arl+2.831
ens+2.423
ives+2.411
coat+2.299
Neg logit contributions
inated-4.695
work-4.672
ised-4.219
ized-3.982
 top-3.925
Token.
Token ID13
Feature activation-9.75e-01

Pos logit contributions
isk+2.430
arl+2.105
coat+2.048
ens+2.004
 bisc+1.969
Neg logit contributions
inated-2.123
ised-2.064
work-2.035
ized-1.869
el-1.838
Token They
Token ID1119
Feature activation-1.35e+00

Pos logit contributions
isk+1.717
ens+1.460
arl+1.459
 bisc+1.402
coat+1.394
Neg logit contributions
inated-1.824
work-1.715
ised-1.683
ized-1.507
el-1.444
Token eyes
Token ID2951
Feature activation-4.33e-02

Pos logit contributions
isk+0.558
ens+0.493
arl+0.486
coat+0.467
 bisc+0.464
Neg logit contributions
inated-0.442
work-0.416
ised-0.413
ized-0.373
el-0.361
Token and
Token ID290
Feature activation-5.32e-01

Pos logit contributions
isk+0.077
arl+0.067
ens+0.067
 bisc+0.064
coat+0.063
Neg logit contributions
inated-0.078
work-0.074
ised-0.074
ized-0.068
el-0.064
Token sleeps
Token ID44263
Feature activation-1.56e+00

Pos logit contributions
isk+0.758
arl+0.644
ens+0.614
coat+0.585
 bisc+0.572
Neg logit contributions
inated-1.159
work-1.104
ised-1.096
ized-1.027
el-0.993
Token.
Token ID13
Feature activation-1.45e+00

Pos logit contributions
isk+3.705
ens+3.312
arl+3.255
 bisc+3.187
coat+3.129
Neg logit contributions
ised-1.898
inated-1.891
work-1.872
ized-1.718
ble-1.608
Token He
Token ID679
Feature activation+6.11e-03

Pos logit contributions
isk+2.711
coat+2.312
ens+2.290
arl+2.239
 bisc+2.212
Neg logit contributions
inated-2.560
ised-2.368
work-2.315
ble-2.098
ized-2.059
Token dreams
Token ID10625
Feature activation-3.46e+00

Pos logit contributions
inated+0.013
work+0.012
ised+0.012
ized+0.011
el+0.011
Neg logit contributions
isk-0.009
arl-0.008
ens-0.008
 bisc-0.007
coat-0.007
Token of
Token ID286
Feature activation-1.43e+00

Pos logit contributions
isk+6.966
 bisc+6.251
 robbers+5.730
ens+5.597
coat+5.512
Neg logit contributions
work-6.025
ised-5.767
ble-5.554
inated-5.543
ized-5.457
Token his
Token ID465
Feature activation+3.95e-01

Pos logit contributions
isk+2.698
arl+2.493
ens+2.407
ives+2.396
 bisc+2.333
Neg logit contributions
inated-2.362
ised-2.247
work-2.234
ized-2.092
ble-2.001
Token ball
Token ID2613
Feature activation-1.30e+00

Pos logit contributions
inated+0.791
work+0.757
ised+0.754
ized+0.700
el+0.665
Neg logit contributions
isk-0.628
arl-0.540
ens-0.539
 bisc-0.514
coat-0.499
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
isk+2.634
arl+2.428
ens+2.311
 bisc+2.305
coat+2.204
Neg logit contributions
inated-1.960
work-1.929
ised-1.825
ized-1.692
el-1.635
Token the
Token ID262
Feature activation-7.76e-01

Pos logit contributions
isk+2.076
arl+1.898
coat+1.746
ives+1.719
ens+1.703
Neg logit contributions
inated-2.467
work-2.378
ised-2.301
ized-2.143
el-2.097
Token Max
Token ID5436
Feature activation+1.16e-01

Pos logit contributions
isk+0.455
arl+0.393
ens+0.376
 bisc+0.362
coat+0.352
Neg logit contributions
inated-0.727
work-0.702
ised-0.684
ized-0.640
el-0.616
Token.
Token ID13
Feature activation-1.92e+00

Pos logit contributions
inated+0.169
work+0.161
ised+0.159
ized+0.143
el+0.133
Neg logit contributions
isk-0.246
arl-0.221
ens-0.219
 bisc-0.212
coat-0.208
Token
Token ID198
Feature activation-1.15e+00

Pos logit contributions
isk+3.699
arl+3.265
aid+3.116
 bisc+3.058
aks+3.053
Neg logit contributions
inated-3.285
ised-3.051
work-2.883
ized-2.603
 sunset-2.508
Token
Token ID198
Feature activation-2.14e+00

Pos logit contributions
isk+2.322
arl+2.055
ens+2.008
 bisc+1.978
coat+1.924
Neg logit contributions
inated-1.769
ised-1.705
work-1.617
ized-1.509
el-1.433
TokenMax
Token ID11518
Feature activation+1.31e-01

Pos logit contributions
isk+3.661
arl+3.228
ens+3.095
 bisc+3.087
 germ+3.035
Neg logit contributions
inated-4.073
ised-4.010
ized-3.646
izing-3.463
work-3.445
Token asked
Token ID1965
Feature activation-3.43e+00

Pos logit contributions
inated+0.242
work+0.231
ised+0.230
ized+0.212
el+0.200
Neg logit contributions
isk-0.227
arl-0.199
ens-0.197
 bisc-0.188
coat-0.184
Token Lily
Token ID20037
Feature activation-1.36e+00

Pos logit contributions
isk+7.524
arl+7.389
 bisc+7.375
coat+7.111
ives+6.922
Neg logit contributions
ised-4.613
inated-4.282
 curiously-4.079
work-4.039
el-3.977
Token,
Token ID11
Feature activation-4.17e-01

Pos logit contributions
isk+2.485
arl+2.314
 bisc+2.262
coat+2.179
ens+2.146
Neg logit contributions
inated-2.355
work-2.294
ised-2.149
ized-1.976
used-1.929
Token "
Token ID366
Feature activation-1.17e+00

Pos logit contributions
isk+0.628
arl+0.532
ens+0.530
 bisc+0.509
ives+0.495
Neg logit contributions
inated-0.866
work-0.833
ised-0.822
ized-0.762
used-0.731
TokenCan
Token ID6090
Feature activation-1.01e-01

Pos logit contributions
isk+2.121
arl+1.988
ens+1.881
ives+1.809
 bisc+1.789
Neg logit contributions
inated-1.982
work-1.926
ised-1.880
ized-1.736
el-1.636
Token we
Token ID356
Feature activation+3.52e-01

Pos logit contributions
isk+0.190
arl+0.168
ens+0.165
 bisc+0.160
coat+0.156
Neg logit contributions
inated-0.173
work-0.165
ised-0.163
ized-0.150
el-0.141
TokenMax
Token ID11518
Feature activation+5.30e-01

Pos logit contributions
isk+0.806
arl+0.679
ens+0.657
 bisc+0.636
 shady+0.597
Neg logit contributions
inated-1.439
ised-1.405
work-1.350
ized-1.314
izing-1.260
Token was
Token ID373
Feature activation-2.14e-01

Pos logit contributions
inated+1.001
work+0.957
ised+0.955
ized+0.886
izing+0.839
Neg logit contributions
isk-0.902
ens-0.780
arl-0.757
 bisc-0.740
coat-0.724
Token excited
Token ID6568
Feature activation+1.37e-01

Pos logit contributions
isk+0.419
arl+0.377
ens+0.373
 bisc+0.360
ives+0.350
Neg logit contributions
inated-0.348
work-0.332
ised-0.327
ized-0.298
el-0.279
Token.
Token ID13
Feature activation-1.85e+00

Pos logit contributions
inated+0.265
work+0.254
ised+0.253
ized+0.234
el+0.222
Neg logit contributions
isk-0.227
arl-0.197
ens-0.195
 bisc-0.187
coat-0.182
Token He
Token ID679
Feature activation+4.96e-01

Pos logit contributions
isk+2.900
arl+2.538
 bisc+2.438
ens+2.405
aks+2.387
Neg logit contributions
inated-3.785
ised-3.580
work-3.454
ized-3.125
ement-3.085
Token asked
Token ID1965
Feature activation-3.43e+00

Pos logit contributions
inated+1.000
work+0.962
ised+0.959
ized+0.890
izing+0.848
Neg logit contributions
isk-0.779
ens-0.667
arl-0.654
 bisc-0.633
coat-0.618
Token Lily
Token ID20037
Feature activation-1.03e+00

Pos logit contributions
 bisc+6.582
arl+6.315
isk+6.239
coat+6.082
ives+5.854
Neg logit contributions
ised-5.683
work-5.305
inated-5.243
 curiously-5.239
el-4.832
Token to
Token ID284
Feature activation-4.52e-01

Pos logit contributions
isk+1.786
arl+1.623
 bisc+1.574
ens+1.548
coat+1.512
Neg logit contributions
inated-1.881
work-1.845
ised-1.748
ized-1.602
used-1.562
Token show
Token ID905
Feature activation-5.35e-01

Pos logit contributions
isk+0.624
arl+0.534
ens+0.530
 bisc+0.500
ives+0.477
Neg logit contributions
inated-0.980
work-0.957
ised-0.933
ized-0.875
el-0.851
Token him
Token ID683
Feature activation-1.53e-01

Pos logit contributions
isk+0.944
arl+0.846
 bisc+0.809
ens+0.805
coat+0.791
Neg logit contributions
inated-0.954
work-0.926
ised-0.914
ized-0.845
el-0.811
Token how
Token ID703
Feature activation+7.32e-02

Pos logit contributions
isk+0.288
arl+0.258
ens+0.252
 bisc+0.245
coat+0.239
Neg logit contributions
inated-0.259
work-0.249
ised-0.245
ized-0.225
el-0.213
Token their
Token ID511
Feature activation-1.00e+00

Pos logit contributions
inated+0.767
work+0.736
ised+0.733
ized+0.681
izing+0.645
Neg logit contributions
isk-0.596
ens-0.510
arl-0.507
 bisc-0.486
coat-0.470
Token hats
Token ID23910
Feature activation-6.08e-01

Pos logit contributions
isk+1.405
arl+1.185
ens+1.141
ives+1.092
coat+1.025
Neg logit contributions
inated-2.190
work-2.136
ised-2.091
ized-1.969
el-1.915
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
isk+1.133
arl+0.998
ens+0.978
 bisc+0.943
 shady+0.917
Neg logit contributions
inated-1.037
work-1.021
ised-0.982
ized-0.909
el-0.864
Token coats
Token ID30720
Feature activation-3.33e-01

Pos logit contributions
isk+2.716
arl+2.326
ives+2.121
and+2.066
ens+2.037
Neg logit contributions
inated-3.634
work-3.587
ised-3.468
el-3.197
ized-3.186
Token and
Token ID290
Feature activation-1.92e+00

Pos logit contributions
isk+0.620
arl+0.543
ens+0.539
 bisc+0.518
ives+0.501
Neg logit contributions
inated-0.573
work-0.558
ised-0.544
ized-0.500
el-0.472
Token sne
Token ID10505
Feature activation-3.41e+00

Pos logit contributions
isk+3.051
arl+2.597
ives+2.310
and+2.294
ens+2.258
Neg logit contributions
inated-3.905
work-3.847
ised-3.729
ized-3.418
el-3.383
Tokenaked
Token ID4335
Feature activation-1.18e+00

Pos logit contributions
isk+4.857
 shady+4.709
arl+4.226
ens+3.976
 bisc+3.919
Neg logit contributions
ised-7.305
work-7.193
el-7.103
inated-6.981
ized-6.254
Token out
Token ID503
Feature activation-1.07e+00

Pos logit contributions
isk+2.401
arl+2.114
ens+2.108
 bisc+1.992
ives+1.949
Neg logit contributions
inated-1.835
work-1.792
ised-1.753
ized-1.595
el-1.489
Token of
Token ID286
Feature activation-1.08e+00

Pos logit contributions
isk+2.115
ens+1.841
arl+1.796
 bisc+1.773
coat+1.737
Neg logit contributions
inated-1.745
work-1.742
ised-1.660
ized-1.526
el-1.502
Token the
Token ID262
Feature activation-1.57e+00

Pos logit contributions
isk+2.141
arl+1.876
ens+1.857
ives+1.856
coat+1.783
Neg logit contributions
inated-1.699
work-1.668
ised-1.662
ized-1.506
el-1.490
Token house
Token ID2156
Feature activation-9.92e-01

Pos logit contributions
isk+2.744
ens+2.276
arl+2.206
ives+2.196
 bisc+2.051
Neg logit contributions
work-2.941
inated-2.941
ised-2.821
ized-2.613
el-2.589
Token the
Token ID262
Feature activation-2.16e+00

Pos logit contributions
isk+1.175
arl+1.001
 bisc+0.942
coat+0.926
ens+0.911
Neg logit contributions
inated-1.894
work-1.829
ised-1.800
ized-1.688
el-1.679
Token dog
Token ID3290
Feature activation-1.40e+00

Pos logit contributions
isk+3.989
ens+3.244
ives+3.171
arl+3.021
aks+2.924
Neg logit contributions
inated-3.959
work-3.852
ised-3.776
 top-3.613
ble-3.543
Token and
Token ID290
Feature activation-1.94e+00

Pos logit contributions
isk+2.719
arl+2.608
ens+2.400
 bisc+2.378
coat+2.318
Neg logit contributions
inated-2.284
work-2.157
ised-2.028
ized-1.909
el-1.786
Token said
Token ID531
Feature activation-1.97e+00

Pos logit contributions
isk+3.316
arl+3.305
ives+2.886
coat+2.810
ens+2.761
Neg logit contributions
inated-3.616
work-3.444
ised-3.401
ized-3.199
 curiously-2.942
Token in
Token ID287
Feature activation-1.19e+00

Pos logit contributions
isk+4.165
 bisc+3.457
ens+3.437
coat+3.424
arl+3.381
Neg logit contributions
inated-3.243
work-3.020
ised-2.926
ble-2.760
used-2.718
Token a
Token ID257
Feature activation-3.41e+00

Pos logit contributions
isk+2.533
ens+2.293
arl+2.261
coat+2.255
ives+2.199
Neg logit contributions
inated-1.682
ised-1.644
work-1.573
ized-1.433
el-1.412
Token loud
Token ID7812
Feature activation-2.86e+00

Pos logit contributions
isk+7.342
ives+6.522
ens+6.434
arl+6.080
coat+5.931
Neg logit contributions
ised-5.550
inated-5.470
 top-5.244
ized-4.891
 curiously-4.787
Token voice
Token ID3809
Feature activation-7.73e-01

Pos logit contributions
isk+6.650
ives+6.143
arl+6.043
ens+5.844
coat+5.756
Neg logit contributions
inated-4.006
ised-3.663
work-3.574
el-3.485
ized-3.111
Token,
Token ID11
Feature activation-7.23e-01

Pos logit contributions
isk+1.759
arl+1.566
ens+1.565
 bisc+1.540
coat+1.514
Neg logit contributions
inated-1.007
work-0.998
ised-0.934
ized-0.834
ement-0.789
Token "
Token ID366
Feature activation+9.07e-02

Pos logit contributions
isk+1.124
ens+0.969
arl+0.961
 bisc+0.929
coat+0.905
Neg logit contributions
inated-1.484
work-1.405
ised-1.385
ized-1.294
used-1.246
TokenHey
Token ID10814
Feature activation-1.34e+00

Pos logit contributions
inated+0.176
work+0.169
ised+0.168
ized+0.156
el+0.147
Neg logit contributions
isk-0.150
arl-0.130
ens-0.129
 bisc-0.123
coat-0.120
Token them
Token ID606
Feature activation-4.21e-01

Pos logit contributions
isk+1.044
arl+0.908
ens+0.890
coat+0.888
 bisc+0.880
Neg logit contributions
inated-1.016
ised-0.980
work-0.978
ized-0.901
el-0.870
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
isk+0.862
arl+0.765
ens+0.757
 bisc+0.745
coat+0.730
Neg logit contributions
inated-0.646
work-0.610
ised-0.599
ized-0.544
el-0.526
Token He
Token ID679
Feature activation-1.75e+00

Pos logit contributions
isk+1.884
arl+1.578
coat+1.492
ens+1.478
 bisc+1.457
Neg logit contributions
inated-2.665
work-2.566
ised-2.516
ized-2.317
el-2.312
Token lives
Token ID3160
Feature activation-1.57e+00

Pos logit contributions
isk+2.090
arl+1.821
coat+1.764
ens+1.753
 bisc+1.720
Neg logit contributions
inated-4.032
work-3.873
ised-3.841
el-3.707
ized-3.628
Token in
Token ID287
Feature activation-8.09e-01

Pos logit contributions
isk+3.758
 bisc+3.441
ens+3.395
arl+3.337
coat+3.288
Neg logit contributions
work-1.856
ised-1.842
inated-1.828
ized-1.725
ble-1.510
Token a
Token ID257
Feature activation-3.41e+00

Pos logit contributions
isk+1.621
arl+1.462
ens+1.438
coat+1.411
ives+1.395
Neg logit contributions
inated-1.246
work-1.220
ised-1.215
ized-1.089
el-1.075
Token house
Token ID2156
Feature activation-1.40e+00

Pos logit contributions
isk+7.743
ives+7.058
ens+6.902
coat+6.584
arl+6.584
Neg logit contributions
inated-5.014
 top-4.959
ised-4.918
 sunset-4.578
ized-4.464
Token near
Token ID1474
Feature activation-4.78e-01

Pos logit contributions
isk+3.295
arl+3.000
ens+2.988
 bisc+2.947
coat+2.840
Neg logit contributions
inated-1.706
work-1.688
ised-1.652
ized-1.538
izing-1.365
Token the
Token ID262
Feature activation-2.50e-01

Pos logit contributions
isk+0.909
arl+0.817
ens+0.803
coat+0.782
 bisc+0.775
Neg logit contributions
inated-0.789
work-0.768
ised-0.758
ized-0.688
el-0.671
Token woods
Token ID16479
Feature activation-5.63e-01

Pos logit contributions
isk+0.456
ens+0.396
arl+0.396
 bisc+0.379
coat+0.373
Neg logit contributions
inated-0.443
work-0.425
ised-0.419
ized-0.385
el-0.362
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
isk+1.267
arl+1.149
ens+1.140
 bisc+1.113
coat+1.096
Neg logit contributions
inated-0.739
work-0.712
ised-0.699
ized-0.633
el-0.578
Token to
Token ID284
Feature activation-2.13e-01

Pos logit contributions
isk+0.044
arl+0.039
ens+0.038
 bisc+0.037
coat+0.036
Neg logit contributions
inated-0.036
work-0.034
ised-0.034
ized-0.031
el-0.029
Token go
Token ID467
Feature activation+4.25e-01

Pos logit contributions
isk+0.292
arl+0.249
ens+0.246
 bisc+0.232
ives+0.225
Neg logit contributions
inated-0.465
work-0.447
ised-0.446
ized-0.419
el-0.400
Token away
Token ID1497
Feature activation+8.81e-01

Pos logit contributions
inated+0.834
work+0.798
ised+0.791
ized+0.731
el+0.694
Neg logit contributions
isk-0.695
ens-0.600
arl-0.597
 bisc-0.565
coat-0.560
Token.
Token ID13
Feature activation+2.50e+00

Pos logit contributions
inated+1.448
work+1.352
ised+1.348
ized+1.224
el+1.159
Neg logit contributions
isk-1.716
ens-1.517
arl-1.505
 bisc-1.438
 shady-1.438
Token
Token ID198
Feature activation+1.55e+00

Pos logit contributions
work+5.758
inated+5.544
ised+5.348
ble+5.272
ized+5.159
Neg logit contributions
isk-3.346
ens-2.925
arl-2.692
ives-2.629
and-2.628
Token
Token ID198
Feature activation+2.62e+00

Pos logit contributions
inated+3.351
work+3.336
ised+3.135
ized+2.941
izing+2.851
Neg logit contributions
isk-2.231
arl-1.905
ens-1.897
ives-1.759
and-1.744
TokenThen
Token ID6423
Feature activation+2.69e+00

Pos logit contributions
work+5.406
inated+5.134
ised+4.614
el+4.511
ble+4.417
Neg logit contributions
isk-4.255
and-3.741
arl-3.631
ives-3.608
ens-3.502
Token one
Token ID530
Feature activation-1.84e-01

Pos logit contributions
inated+6.352
work+6.018
ised+5.756
ble+5.588
ized+5.544
Neg logit contributions
ens-3.125
isk-3.061
 comes-3.019
 wake-2.725
 welcome-2.651
Token day
Token ID1110
Feature activation+5.22e-01

Pos logit contributions
isk+0.323
arl+0.281
ens+0.275
 bisc+0.263
coat+0.259
Neg logit contributions
inated-0.339
work-0.329
ised-0.323
ized-0.297
el-0.285
Token,
Token ID11
Feature activation+3.02e+00

Pos logit contributions
inated+1.032
work+0.988
ised+0.976
ized+0.907
el+0.859
Neg logit contributions
isk-0.832
ens-0.728
arl-0.715
 bisc-0.690
coat-0.670
Token her
Token ID607
Feature activation+1.65e+00

Pos logit contributions
inated+6.462
work+5.897
ised+5.788
el+5.411
ized+5.348
Neg logit contributions
isk-4.322
ens-4.131
 comes-3.895
 welcome-3.845
 wake-3.840
Token.
Token ID13
Feature activation+1.48e+00

Pos logit contributions
inated+0.670
work+0.634
ised+0.634
ized+0.571
el+0.530
Neg logit contributions
isk-0.889
arl-0.791
ens-0.790
 bisc-0.762
coat-0.752
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
inated+3.050
work+2.999
ised+2.928
ized+2.786
izing+2.662
Neg logit contributions
isk-2.223
arl-1.936
ens-1.920
coat-1.800
 bisc-1.785
Token
Token ID198
Feature activation+1.67e+00

Pos logit contributions
inated+2.293
work+2.241
ised+2.159
ized+2.034
izing+1.948
Neg logit contributions
isk-1.663
arl-1.432
ens-1.431
 shady-1.327
 bisc-1.326
TokenOne
Token ID3198
Feature activation-5.54e-03

Pos logit contributions
work+3.269
inated+3.229
ised+2.977
ized+2.785
ble+2.763
Neg logit contributions
isk-2.772
arl-2.398
ens-2.331
coat-2.291
 shady-2.275
Token day
Token ID1110
Feature activation+1.11e+00

Pos logit contributions
isk+0.009
arl+0.008
ens+0.008
 bisc+0.008
coat+0.008
Neg logit contributions
inated-0.010
work-0.010
ised-0.010
ized-0.009
el-0.009
Token,
Token ID11
Feature activation+2.39e+00

Pos logit contributions
inated+2.341
work+2.263
ised+2.210
ized+2.085
izing+1.982
Neg logit contributions
isk-1.586
ens-1.413
arl-1.387
 bisc-1.299
coat-1.295
Token in
Token ID287
Feature activation+7.38e-02

Pos logit contributions
inated+4.907
work+4.685
ised+4.407
ized+4.128
used+4.118
Neg logit contributions
isk-3.611
ens-3.360
arl-3.165
coat-3.132
 shady-2.989
Token the
Token ID262
Feature activation-4.04e-01

Pos logit contributions
inated+0.131
work+0.126
ised+0.125
ized+0.115
el+0.108
Neg logit contributions
isk-0.134
arl-0.118
ens-0.117
 bisc-0.113
coat-0.110
Token middle
Token ID3504
Feature activation-1.34e+00

Pos logit contributions
isk+0.711
arl+0.617
ens+0.613
 bisc+0.586
ives+0.578
Neg logit contributions
inated-0.745
work-0.710
ised-0.707
ized-0.649
el-0.627
Token of
Token ID286
Feature activation-3.65e-02

Pos logit contributions
isk+1.998
arl+1.727
 bisc+1.554
ens+1.551
 robber+1.517
Neg logit contributions
inated-2.850
ised-2.842
work-2.760
el-2.682
ized-2.646
Token spring
Token ID6076
Feature activation-3.21e-01

Pos logit contributions
isk+0.067
arl+0.059
ens+0.058
 bisc+0.056
coat+0.055
Neg logit contributions
inated-0.064
work-0.061
ised-0.061
ized-0.056
el-0.053
Token She
Token ID1375
Feature activation+1.02e+00

Pos logit contributions
inated+0.856
work+0.830
ised+0.818
ized+0.761
el+0.717
Neg logit contributions
isk-0.753
ens-0.658
arl-0.657
 bisc-0.625
coat-0.610
Token got
Token ID1392
Feature activation+1.22e+00

Pos logit contributions
inated+1.989
work+1.933
ised+1.903
ized+1.765
ement+1.697
Neg logit contributions
isk-1.706
ens-1.445
arl-1.384
 bisc-1.363
coat-1.362
Token very
Token ID845
Feature activation+1.16e+00

Pos logit contributions
inated+2.307
ised+2.176
work+2.155
ized+1.995
izing+1.949
Neg logit contributions
isk-2.070
ens-1.859
arl-1.807
 bisc-1.767
 shady-1.720
Token happy
Token ID3772
Feature activation+6.25e-01

Pos logit contributions
inated+2.277
work+2.218
ised+2.137
ized+1.967
izing+1.928
Neg logit contributions
isk-1.842
arl-1.662
ens-1.645
 bisc-1.557
 shady-1.542
Token and
Token ID290
Feature activation+1.22e-01

Pos logit contributions
inated+1.197
ised+1.139
work+1.134
ized+1.047
el+0.994
Neg logit contributions
isk-1.038
ens-0.905
arl-0.892
 bisc-0.869
coat-0.850
Token opened
Token ID4721
Feature activation+2.87e+00

Pos logit contributions
inated+0.233
work+0.223
ised+0.222
ized+0.205
el+0.194
Neg logit contributions
isk-0.206
arl-0.179
ens-0.177
 bisc-0.169
coat-0.165
Token it
Token ID340
Feature activation+1.88e+00

Pos logit contributions
work+6.288
inated+6.159
ised+5.889
ble+5.453
ized+5.450
Neg logit contributions
isk-3.921
ens-3.641
 shady-3.607
 Luna-3.568
 comes-3.420
Token.
Token ID13
Feature activation+8.08e-01

Pos logit contributions
inated+3.401
work+3.306
ised+3.234
ized+2.976
used+2.900
Neg logit contributions
isk-3.248
 shady-2.865
ens-2.863
 bisc-2.818
coat-2.784
Token She
Token ID1375
Feature activation+5.52e-01

Pos logit contributions
inated+1.585
work+1.563
ised+1.530
ized+1.442
ble+1.364
Neg logit contributions
isk-1.276
ens-1.116
arl-1.111
 bisc-1.050
coat-1.028
Token saw
Token ID2497
Feature activation+5.28e-01

Pos logit contributions
inated+1.149
work+1.107
ised+1.101
ized+1.022
izing+0.979
Neg logit contributions
isk-0.828
ens-0.698
arl-0.684
 bisc-0.660
coat-0.649
Token that
Token ID326
Feature activation+6.01e-01

Pos logit contributions
inated+0.915
work+0.866
ised+0.864
ized+0.788
izing+0.744
Neg logit contributions
isk-0.977
ens-0.863
arl-0.854
 bisc-0.825
coat-0.803
Token walking
Token ID6155
Feature activation+1.22e+00

Pos logit contributions
inated+0.360
work+0.343
ised+0.341
ized+0.313
el+0.294
Neg logit contributions
isk-0.378
arl-0.332
ens-0.331
 bisc-0.317
coat-0.310
Token down
Token ID866
Feature activation+1.26e+00

Pos logit contributions
inated+2.150
ised+2.026
work+1.998
ized+1.869
izing+1.833
Neg logit contributions
isk-2.202
ens-1.982
arl-1.932
coat-1.861
 shady-1.845
Token the
Token ID262
Feature activation+2.03e+00

Pos logit contributions
inated+2.574
ised+2.419
work+2.386
ized+2.265
ement+2.186
Neg logit contributions
isk-1.938
ens-1.712
arl-1.697
 shady-1.633
 bisc-1.584
Token street
Token ID4675
Feature activation+1.60e+00

Pos logit contributions
inated+4.945
ised+4.869
work+4.765
ement+4.652
ized+4.548
Neg logit contributions
isk-2.253
ens-1.908
arl-1.874
 shady-1.853
coat-1.742
Token,
Token ID11
Feature activation+8.22e-01

Pos logit contributions
inated+2.824
ised+2.682
work+2.638
ized+2.406
ble+2.386
Neg logit contributions
isk-2.865
ens-2.487
coat-2.452
 shady-2.422
arl-2.416
Token adm
Token ID6178
Feature activation+2.27e+00

Pos logit contributions
inated+1.545
work+1.488
ised+1.480
ized+1.349
el+1.299
Neg logit contributions
isk-1.391
ens-1.212
arl-1.189
coat-1.148
 bisc-1.132
Tokeniring
Token ID3428
Feature activation+3.59e-01

Pos logit contributions
inated+2.413
work+2.293
ised+1.874
used+1.758
ized+1.545
Neg logit contributions
isk-5.688
arl-5.592
ens-5.178
 shady-5.153
aks-5.053
Token the
Token ID262
Feature activation+1.31e+00

Pos logit contributions
inated+0.636
work+0.607
ised+0.602
ized+0.554
el+0.520
Neg logit contributions
isk-0.647
ens-0.567
arl-0.567
 bisc-0.540
coat-0.528
Token tidy
Token ID43044
Feature activation+1.08e-01

Pos logit contributions
inated+2.564
ised+2.416
work+2.397
ized+2.208
used+2.154
Neg logit contributions
isk-2.101
ens-1.872
arl-1.864
 bisc-1.812
 shady-1.778
Token gardens
Token ID25476
Feature activation+7.37e-01

Pos logit contributions
inated+0.219
work+0.211
ised+0.209
ized+0.195
el+0.185
Neg logit contributions
isk-0.166
arl-0.142
ens-0.141
 bisc-0.134
coat-0.130
Token.
Token ID13
Feature activation+6.88e-01

Pos logit contributions
inated+1.231
ised+1.159
work+1.157
ized+1.045
el+0.990
Neg logit contributions
isk-1.389
ens-1.235
arl-1.222
 bisc-1.181
coat-1.167
Token scared
Token ID12008
Feature activation-7.42e-01

Pos logit contributions
inated+0.954
work+0.915
ised+0.912
ized+0.851
izing+0.807
Neg logit contributions
isk-0.706
ens-0.607
arl-0.600
 bisc-0.571
coat-0.560
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
isk+1.683
arl+1.498
 bisc+1.470
ens+1.458
ives+1.425
Neg logit contributions
inated-0.987
work-0.949
ised-0.936
ized-0.826
el-0.813
Token
Token ID220
Feature activation+2.16e-01

Pos logit contributions
inated+2.316
work+2.284
ised+2.208
ized+2.087
izing+1.958
Neg logit contributions
isk-1.940
ens-1.686
arl-1.673
 bisc-1.566
coat-1.522
Token When
Token ID1649
Feature activation+1.38e+00

Pos logit contributions
inated+0.400
work+0.383
ised+0.380
ized+0.351
el+0.332
Neg logit contributions
isk-0.370
arl-0.323
ens-0.321
 bisc-0.306
coat-0.299
Token she
Token ID673
Feature activation-6.00e-01

Pos logit contributions
inated+2.801
work+2.667
ised+2.622
ized+2.428
ement+2.301
Neg logit contributions
isk-2.168
ens-1.974
arl-1.882
 bisc-1.847
 shady-1.786
Token woke
Token ID19092
Feature activation+2.38e+00

Pos logit contributions
isk+0.970
arl+0.867
ens+0.822
 bisc+0.785
ives+0.777
Neg logit contributions
inated-1.181
ised-1.136
work-1.129
ized-1.047
el-1.022
Token up
Token ID510
Feature activation+1.51e-01

Pos logit contributions
work+3.474
inated+3.450
ised+3.293
ized+2.890
ement+2.687
Neg logit contributions
isk-4.921
ens-4.692
arl-4.469
 Luna-4.424
 stret-4.349
Token,
Token ID11
Feature activation+1.84e+00

Pos logit contributions
inated+0.286
work+0.274
ised+0.272
ized+0.252
el+0.238
Neg logit contributions
isk-0.255
arl-0.222
ens-0.220
 bisc-0.210
coat-0.205
Token she
Token ID673
Feature activation-1.55e-01

Pos logit contributions
inated+3.923
work+3.777
ised+3.632
ized+3.356
ement+3.283
Neg logit contributions
isk-2.743
ens-2.477
arl-2.247
 shady-2.211
 germ-2.210
Token was
Token ID373
Feature activation+1.59e-01

Pos logit contributions
isk+0.248
arl+0.219
ens+0.213
 bisc+0.203
coat+0.197
Neg logit contributions
inated-0.306
ised-0.293
work-0.293
ized-0.272
el-0.259
Token confused
Token ID10416
Feature activation+1.22e+00

Pos logit contributions
inated+0.268
work+0.256
ised+0.254
ized+0.232
el+0.218
Neg logit contributions
isk-0.303
arl-0.267
ens-0.266
 bisc-0.256
coat-0.250
Token pointed
Token ID6235
Feature activation+1.06e+00

Pos logit contributions
inated+0.093
work+0.090
ised+0.089
ized+0.082
el+0.077
Neg logit contributions
isk-0.087
arl-0.077
ens-0.076
 bisc-0.072
coat-0.071
Token the
Token ID262
Feature activation+1.28e+00

Pos logit contributions
inated+1.914
work+1.853
ised+1.816
ized+1.655
izing+1.579
Neg logit contributions
isk-1.810
ens-1.647
arl-1.638
 bisc-1.583
 shady-1.555
Token arrow
Token ID15452
Feature activation+1.55e+00

Pos logit contributions
inated+2.008
ised+1.892
work+1.876
ized+1.712
used+1.578
Neg logit contributions
isk-2.530
arl-2.320
ens-2.306
 bisc-2.290
 shady-2.189
Token at
Token ID379
Feature activation+1.08e+00

Pos logit contributions
inated+2.750
ised+2.681
work+2.610
ized+2.376
ement+2.259
Neg logit contributions
isk-2.669
ens-2.407
arl-2.355
 bisc-2.351
 shady-2.309
Token the
Token ID262
Feature activation+1.32e+00

Pos logit contributions
inated+2.145
ised+2.007
work+2.003
ized+1.848
izing+1.769
Neg logit contributions
isk-1.772
ens-1.533
arl-1.472
 bisc-1.425
 shady-1.392
Token sky
Token ID6766
Feature activation+5.54e-01

Pos logit contributions
inated+2.362
ised+2.279
work+2.188
ized+2.076
ement+1.955
Neg logit contributions
isk-2.371
ens-2.081
arl-2.081
 bisc-2.064
 shady-2.013
Token.
Token ID13
Feature activation+1.83e-01

Pos logit contributions
inated+0.842
ised+0.793
work+0.788
ized+0.714
el+0.662
Neg logit contributions
isk-1.127
ens-1.005
arl-1.002
 bisc-0.974
coat-0.953
Token He
Token ID679
Feature activation+1.49e+00

Pos logit contributions
inated+0.359
work+0.344
ised+0.342
ized+0.317
el+0.301
Neg logit contributions
isk-0.298
arl-0.258
ens-0.256
 bisc-0.244
coat-0.238
Token took
Token ID1718
Feature activation+9.47e-03

Pos logit contributions
inated+3.047
work+2.881
ised+2.850
ized+2.638
izing+2.580
Neg logit contributions
isk-2.281
ens-1.930
 bisc-1.918
 shady-1.838
 comes-1.814
Token a
Token ID257
Feature activation-1.17e-01

Pos logit contributions
inated+0.015
work+0.014
ised+0.014
ized+0.012
el+0.012
Neg logit contributions
isk-0.019
arl-0.017
ens-0.017
 bisc-0.017
coat-0.016
Token deep
Token ID2769
Feature activation+5.35e-01

Pos logit contributions
isk+0.185
arl+0.157
ens+0.157
 bisc+0.146
coat+0.144
Neg logit contributions
inated-0.236
work-0.227
ised-0.224
ized-0.208
el-0.198
Token other
Token ID584
Feature activation+4.57e-02

Pos logit contributions
isk+0.162
arl+0.140
ens+0.138
 bisc+0.131
coat+0.129
Neg logit contributions
inated-0.198
work-0.190
ised-0.189
ized-0.176
el-0.167
Token mushroom
Token ID28520
Feature activation-2.37e-01

Pos logit contributions
inated+0.089
work+0.085
ised+0.085
ized+0.079
el+0.075
Neg logit contributions
isk-0.075
arl-0.065
ens-0.064
 bisc-0.061
coat-0.060
Token.
Token ID13
Feature activation+4.70e-01

Pos logit contributions
isk+0.502
arl+0.454
ens+0.443
 bisc+0.429
coat+0.423
Neg logit contributions
inated-0.344
work-0.331
ised-0.321
ized-0.294
el-0.272
Token Inside
Token ID14384
Feature activation+3.40e-02

Pos logit contributions
inated+0.970
work+0.943
ised+0.931
ized+0.872
el+0.826
Neg logit contributions
isk-0.709
ens-0.603
arl-0.601
 bisc-0.572
coat-0.548
Token the
Token ID262
Feature activation+6.43e-01

Pos logit contributions
inated+0.062
work+0.060
ised+0.059
ized+0.055
el+0.052
Neg logit contributions
isk-0.059
arl-0.052
ens-0.051
 bisc-0.049
coat-0.048
Token mushroom
Token ID28520
Feature activation+3.19e-01

Pos logit contributions
inated+1.232
work+1.173
ised+1.171
ized+1.081
izing+1.019
Neg logit contributions
isk-1.077
ens-0.940
arl-0.929
 bisc-0.898
 shady-0.866
Token was
Token ID373
Feature activation-8.73e-02

Pos logit contributions
inated+0.592
work+0.565
ised+0.562
ized+0.518
el+0.489
Neg logit contributions
isk-0.548
ens-0.476
arl-0.473
 bisc-0.456
coat-0.443
Token a
Token ID257
Feature activation+4.01e-02

Pos logit contributions
isk+0.171
arl+0.153
ens+0.151
 bisc+0.145
coat+0.142
Neg logit contributions
inated-0.141
work-0.134
ised-0.133
ized-0.122
el-0.113
Token secret
Token ID3200
Feature activation+3.73e-01

Pos logit contributions
inated+0.075
work+0.072
ised+0.071
ized+0.066
el+0.062
Neg logit contributions
isk-0.068
arl-0.059
ens-0.059
 bisc-0.056
coat-0.055
Token room
Token ID2119
Feature activation-6.32e-03

Pos logit contributions
inated+0.684
work+0.651
ised+0.650
ized+0.599
el+0.565
Neg logit contributions
isk-0.645
arl-0.561
ens-0.560
 bisc-0.541
coat-0.526
Token full
Token ID1336
Feature activation-6.42e-01

Pos logit contributions
isk+0.013
arl+0.011
ens+0.011
 bisc+0.011
coat+0.011
Neg logit contributions
inated-0.010
work-0.009
ised-0.009
ized-0.008
el-0.008
Token that
Token ID326
Feature activation+1.22e-01

Pos logit contributions
inated+1.898
work+1.772
ised+1.764
ized+1.608
used+1.517
Neg logit contributions
isk-1.985
ens-1.752
arl-1.746
 bisc-1.649
 shady-1.638
Token the
Token ID262
Feature activation+4.48e-01

Pos logit contributions
inated+0.228
work+0.218
ised+0.217
ized+0.200
el+0.189
Neg logit contributions
isk-0.210
arl-0.183
ens-0.181
 bisc-0.173
coat-0.169
Token things
Token ID1243
Feature activation+5.89e-01

Pos logit contributions
inated+0.908
work+0.871
ised+0.863
ized+0.803
el+0.761
Neg logit contributions
isk-0.698
arl-0.605
ens-0.602
 bisc-0.574
coat-0.556
Token that
Token ID326
Feature activation-5.84e-01

Pos logit contributions
inated+1.128
ised+1.069
work+1.064
ized+0.982
el+0.933
Neg logit contributions
isk-0.970
ens-0.855
arl-0.841
 bisc-0.797
coat-0.782
Token are
Token ID389
Feature activation+5.66e-01

Pos logit contributions
isk+1.017
arl+0.925
ens+0.863
 bisc+0.854
coat+0.838
Neg logit contributions
inated-1.054
work-1.038
ised-1.003
ized-0.936
el-0.894
Token truly
Token ID4988
Feature activation+3.76e-01

Pos logit contributions
inated+1.042
ised+0.986
work+0.984
ized+0.906
el+0.860
Neg logit contributions
isk-0.997
arl-0.865
ens-0.863
 bisc-0.818
 shady-0.800
Token valuable
Token ID8119
Feature activation+5.02e-01

Pos logit contributions
inated+0.699
work+0.665
ised+0.665
ized+0.612
el+0.584
Neg logit contributions
isk-0.644
arl-0.559
ens-0.558
 bisc-0.533
coat-0.518
Token are
Token ID389
Feature activation+6.17e-01

Pos logit contributions
inated+0.931
work+0.876
ised+0.876
ized+0.809
el+0.765
Neg logit contributions
isk-0.870
ens-0.766
arl-0.757
 bisc-0.723
ives-0.707
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
inated+1.123
ised+1.058
work+1.056
ized+0.967
el+0.920
Neg logit contributions
isk-1.094
ens-0.952
arl-0.951
 bisc-0.906
 shady-0.882
Token ones
Token ID3392
Feature activation+4.49e-01

Pos logit contributions
inated+1.867
ised+1.753
work+1.714
ized+1.612
used+1.512
Neg logit contributions
isk-1.931
ens-1.723
arl-1.710
 bisc-1.664
coat-1.610
Token that
Token ID326
Feature activation-3.31e-01

Pos logit contributions
inated+0.801
ised+0.757
work+0.756
ized+0.692
el+0.655
Neg logit contributions
isk-0.798
ens-0.706
arl-0.701
 bisc-0.671
coat-0.654
Token when
Token ID618
Feature activation+5.66e-01

Pos logit contributions
inated+0.814
work+0.783
ised+0.766
ized+0.701
ement+0.656
Neg logit contributions
isk-0.939
ens-0.838
arl-0.826
 bisc-0.793
coat-0.782
Token she
Token ID673
Feature activation-1.47e-01

Pos logit contributions
inated+1.129
work+1.093
ised+1.079
ized+1.000
ement+0.945
Neg logit contributions
isk-0.896
ens-0.787
arl-0.773
 bisc-0.738
coat-0.719
Token reached
Token ID4251
Feature activation-1.29e-01

Pos logit contributions
isk+0.216
arl+0.186
ens+0.180
 bisc+0.171
coat+0.166
Neg logit contributions
inated-0.309
ised-0.297
work-0.296
ized-0.277
el-0.267
Token the
Token ID262
Feature activation+2.69e-01

Pos logit contributions
isk+0.239
arl+0.210
ens+0.207
 bisc+0.202
ives+0.196
Neg logit contributions
inated-0.222
work-0.212
ised-0.211
ized-0.192
el-0.184
Token top
Token ID1353
Feature activation-7.95e-02

Pos logit contributions
inated+0.241
work+0.220
ised+0.217
ized+0.180
el+0.155
Neg logit contributions
isk-0.724
arl-0.666
ens-0.663
 bisc-0.646
coat-0.637
Token,
Token ID11
Feature activation+5.71e-01

Pos logit contributions
isk+0.135
arl+0.117
ens+0.115
 bisc+0.111
coat+0.107
Neg logit contributions
inated-0.150
work-0.144
ised-0.143
ized-0.132
el-0.126
Token she
Token ID673
Feature activation+9.92e-02

Pos logit contributions
inated+1.182
work+1.139
ised+1.125
ized+1.047
el+0.995
Neg logit contributions
isk-0.861
ens-0.754
arl-0.731
 bisc-0.698
coat-0.681
Token saw
Token ID2497
Feature activation+6.45e-01

Pos logit contributions
inated+0.204
work+0.196
ised+0.195
ized+0.182
el+0.173
Neg logit contributions
isk-0.151
arl-0.130
ens-0.128
 bisc-0.122
coat-0.118
Token a
Token ID257
Feature activation-4.36e-01

Pos logit contributions
inated+1.058
work+1.008
ised+0.989
ized+0.897
el+0.845
Neg logit contributions
isk-1.265
ens-1.129
arl-1.114
 bisc-1.075
coat-1.052
Token boy
Token ID2933
Feature activation-1.27e-01

Pos logit contributions
isk+0.722
ens+0.616
arl+0.614
ives+0.577
 bisc+0.567
Neg logit contributions
inated-0.839
work-0.809
ised-0.800
ized-0.741
ble-0.711
Token who
Token ID508
Feature activation+1.18e+00

Pos logit contributions
isk+0.236
arl+0.210
ens+0.206
 bisc+0.197
coat+0.193
Neg logit contributions
inated-0.220
work-0.211
ised-0.207
ized-0.192
el-0.182
Token at
Token ID379
Feature activation+1.12e+00

Pos logit contributions
inated+0.861
work+0.820
ised+0.814
ized+0.749
el+0.712
Neg logit contributions
isk-0.822
arl-0.724
ens-0.720
 bisc-0.687
 shady-0.670
Token you
Token ID345
Feature activation+5.82e-01

Pos logit contributions
inated+2.088
ised+1.949
work+1.935
ized+1.791
ement+1.709
Neg logit contributions
isk-1.944
ens-1.702
arl-1.678
 shady-1.583
 bisc-1.579
Token and
Token ID290
Feature activation-2.37e-01

Pos logit contributions
inated+0.988
ised+0.933
work+0.930
ized+0.853
el+0.803
Neg logit contributions
isk-1.095
ens-0.958
arl-0.957
 bisc-0.914
coat-0.899
Token see
Token ID766
Feature activation+3.42e-01

Pos logit contributions
isk+0.330
arl+0.280
ens+0.273
 bisc+0.259
coat+0.253
Neg logit contributions
inated-0.516
work-0.501
ised-0.494
ized-0.462
el-0.440
Token how
Token ID703
Feature activation+7.13e-01

Pos logit contributions
inated+0.643
work+0.615
ised+0.611
ized+0.564
el+0.533
Neg logit contributions
isk-0.582
arl-0.505
ens-0.504
 bisc-0.480
coat-0.467
Token beautiful
Token ID4950
Feature activation+1.04e+00

Pos logit contributions
inated+1.461
work+1.399
ised+1.390
ized+1.287
el+1.235
Neg logit contributions
isk-1.062
arl-0.936
ens-0.927
 shady-0.861
 bisc-0.861
Token you
Token ID345
Feature activation+2.48e-01

Pos logit contributions
inated+1.700
work+1.586
ised+1.582
ized+1.436
used+1.353
Neg logit contributions
isk-1.964
ens-1.789
arl-1.747
 bisc-1.693
 shady-1.672
Token are
Token ID389
Feature activation+9.25e-01

Pos logit contributions
inated+0.576
work+0.555
ised+0.554
ized+0.520
el+0.497
Neg logit contributions
isk-0.310
arl-0.254
ens-0.253
 bisc-0.236
coat-0.228
Token,"
Token ID553
Feature activation+1.24e+00

Pos logit contributions
inated+1.608
ised+1.536
work+1.516
ized+1.398
el+1.314
Neg logit contributions
isk-1.685
arl-1.495
ens-1.478
 bisc-1.418
coat-1.404
Token she
Token ID673
Feature activation+4.49e-01

Pos logit contributions
inated+2.649
work+2.494
ised+2.491
ized+2.309
izing+2.235
Neg logit contributions
isk-1.843
ens-1.627
arl-1.508
 bisc-1.485
 Luna-1.433
Token said
Token ID531
Feature activation+3.58e-01

Pos logit contributions
inated+0.794
work+0.756
ised+0.752
ized+0.691
el+0.649
Neg logit contributions
isk-0.817
ens-0.717
arl-0.712
 bisc-0.685
coat-0.666
Token fluffy
Token ID39145
Feature activation+4.89e-03

Pos logit contributions
isk+0.923
arl+0.837
ens+0.823
 bisc+0.799
ives+0.789
Neg logit contributions
inated-0.678
work-0.640
ised-0.626
ized-0.563
el-0.547
Token and
Token ID290
Feature activation-1.01e+00

Pos logit contributions
inated+0.009
work+0.009
ised+0.008
ized+0.008
el+0.007
Neg logit contributions
isk-0.009
arl-0.008
ens-0.007
 bisc-0.007
coat-0.007
Token grey
Token ID13791
Feature activation+5.56e-01

Pos logit contributions
isk+1.893
arl+1.718
ives+1.635
ens+1.617
 bisc+1.592
Neg logit contributions
inated-1.762
work-1.658
ised-1.646
ized-1.505
el-1.452
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
inated+0.920
ised+0.874
work+0.873
ized+0.804
izing+0.746
Neg logit contributions
isk-1.071
ens-0.951
arl-0.940
 bisc-0.906
coat-0.896
Token One
Token ID1881
Feature activation-7.95e-01

Pos logit contributions
inated+0.305
work+0.292
ised+0.290
ized+0.267
el+0.252
Neg logit contributions
isk-0.300
arl-0.262
ens-0.261
 bisc-0.250
coat-0.244
Token day
Token ID1110
Feature activation+1.16e-01

Pos logit contributions
isk+1.520
arl+1.366
ives+1.273
ens+1.272
 bisc+1.265
Neg logit contributions
inated-1.344
work-1.276
ised-1.265
ized-1.143
el-1.130
Token,
Token ID11
Feature activation+8.76e-01

Pos logit contributions
inated+0.219
work+0.210
ised+0.209
ized+0.193
el+0.183
Neg logit contributions
isk-0.195
arl-0.169
ens-0.168
 bisc-0.161
coat-0.156
Token she
Token ID673
Feature activation-6.21e-01

Pos logit contributions
inated+1.713
work+1.672
ised+1.642
ized+1.522
el+1.444
Neg logit contributions
isk-1.386
ens-1.240
arl-1.212
 shady-1.149
 bisc-1.148
Token was
Token ID373
Feature activation-3.18e-01

Pos logit contributions
isk+1.035
arl+0.944
ens+0.879
 bisc+0.849
ives+0.837
Neg logit contributions
inated-1.193
ised-1.132
work-1.125
ized-1.055
el-1.013
Token walking
Token ID6155
Feature activation+1.45e-01

Pos logit contributions
isk+0.549
arl+0.482
ens+0.478
 bisc+0.461
ives+0.446
Neg logit contributions
inated-0.597
work-0.565
ised-0.561
ized-0.518
el-0.495
Token through
Token ID832
Feature activation+2.80e-01

Pos logit contributions
inated+0.232
work+0.220
ised+0.219
ized+0.199
el+0.186
Neg logit contributions
isk-0.289
arl-0.257
ens-0.255
 bisc-0.246
coat-0.241
Token see
Token ID766
Feature activation+7.19e-03

Pos logit contributions
isk+1.045
arl+0.864
ens+0.855
 bisc+0.832
ives+0.795
Neg logit contributions
inated-1.674
work-1.619
ised-1.571
ized-1.486
ble-1.451
Token it
Token ID340
Feature activation-3.78e-01

Pos logit contributions
inated+0.012
work+0.012
ised+0.012
ized+0.011
el+0.010
Neg logit contributions
isk-0.013
arl-0.012
ens-0.012
 bisc-0.011
coat-0.011
Token.
Token ID13
Feature activation+3.87e-01

Pos logit contributions
isk+0.797
arl+0.723
ens+0.713
 bisc+0.695
coat+0.681
Neg logit contributions
inated-0.548
work-0.527
ised-0.509
ized-0.468
el-0.441
Token
Token ID198
Feature activation-6.87e-02

Pos logit contributions
inated+0.745
work+0.721
ised+0.714
ized+0.665
el+0.626
Neg logit contributions
isk-0.639
arl-0.557
ens-0.551
 bisc-0.524
coat-0.509
Token
Token ID198
Feature activation+2.36e-02

Pos logit contributions
isk+0.122
ens+0.106
arl+0.106
 bisc+0.102
coat+0.100
Neg logit contributions
inated-0.124
ised-0.118
work-0.117
ized-0.108
el-0.102
TokenThey
Token ID2990
Feature activation-8.36e-02

Pos logit contributions
inated+0.046
ised+0.044
work+0.044
ized+0.040
el+0.038
Neg logit contributions
isk-0.039
ens-0.034
arl-0.034
 bisc-0.032
coat-0.031
Token sat
Token ID3332
Feature activation+7.00e-02

Pos logit contributions
isk+0.135
arl+0.118
ens+0.115
 bisc+0.110
coat+0.108
Neg logit contributions
inated-0.163
work-0.157
ised-0.156
ized-0.145
el-0.138
Token on
Token ID319
Feature activation+1.49e+00

Pos logit contributions
inated+0.121
work+0.115
ised+0.115
ized+0.105
el+0.099
Neg logit contributions
isk-0.131
arl-0.115
ens-0.115
 bisc-0.110
coat-0.108
Token the
Token ID262
Feature activation-3.01e-01

Pos logit contributions
inated+3.041
ised+2.897
work+2.871
ized+2.689
used+2.618
Neg logit contributions
isk-2.334
ens-2.043
 bisc-1.936
arl-1.936
 Luna-1.916
Token floor
Token ID4314
Feature activation-7.55e-01

Pos logit contributions
isk+0.370
arl+0.299
ens+0.296
 bisc+0.275
ives+0.265
Neg logit contributions
inated-0.712
work-0.689
ised-0.676
ized-0.641
el-0.614
Token and
Token ID290
Feature activation+1.63e-01

Pos logit contributions
isk+1.446
arl+1.294
ens+1.287
 bisc+1.250
coat+1.180
Neg logit contributions
inated-1.261
work-1.219
ised-1.207
ized-1.122
ement-1.038
Token books
Token ID3835
Feature activation+7.50e-01

Pos logit contributions
inated+1.620
work+1.539
ised+1.523
ized+1.403
used+1.346
Neg logit contributions
isk-1.484
ens-1.318
arl-1.275
 bisc-1.264
 shady-1.218
Token.
Token ID13
Feature activation-3.99e-01

Pos logit contributions
inated+1.210
ised+1.129
work+1.119
ized+1.023
used+0.959
Neg logit contributions
isk-1.484
ens-1.328
arl-1.311
 bisc-1.258
coat-1.249
Token She
Token ID1375
Feature activation-5.32e-01

Pos logit contributions
isk+0.748
arl+0.658
ens+0.646
 bisc+0.630
coat+0.612
Neg logit contributions
inated-0.689
work-0.652
ised-0.643
ized-0.581
el-0.567
Token had
Token ID550
Feature activation-6.69e-01

Pos logit contributions
isk+0.888
arl+0.823
ens+0.760
 bisc+0.738
ives+0.712
Neg logit contributions
inated-0.995
work-0.957
ised-0.939
ized-0.873
el-0.837
Token many
Token ID867
Feature activation-7.71e-01

Pos logit contributions
isk+1.411
arl+1.283
ens+1.234
 bisc+1.175
ives+1.165
Neg logit contributions
inated-0.998
work-0.942
ised-0.926
ized-0.838
el-0.799
Token books
Token ID3835
Feature activation-6.19e-01

Pos logit contributions
isk+1.217
arl+1.065
ens+0.994
 bisc+0.947
coat+0.931
Neg logit contributions
inated-1.539
ised-1.527
work-1.467
ized-1.417
el-1.368
Token in
Token ID287
Feature activation+2.72e-02

Pos logit contributions
isk+1.125
arl+0.986
ens+0.973
 bisc+0.950
coat+0.901
Neg logit contributions
inated-1.076
work-1.064
ised-1.033
ized-0.960
el-0.907
Token her
Token ID607
Feature activation-1.00e+00

Pos logit contributions
inated+0.049
work+0.047
ised+0.047
ized+0.043
el+0.041
Neg logit contributions
isk-0.048
arl-0.042
ens-0.042
 bisc-0.040
coat-0.039
Token room
Token ID2119
Feature activation-5.79e-01

Pos logit contributions
isk+1.826
arl+1.636
ens+1.594
ives+1.525
 bisc+1.506
Neg logit contributions
inated-1.770
work-1.682
ised-1.674
ized-1.520
el-1.516
Token.
Token ID13
Feature activation-6.33e-01

Pos logit contributions
isk+1.266
arl+1.159
ens+1.151
 bisc+1.114
coat+1.074
Neg logit contributions
inated-0.785
work-0.759
ised-0.736
ized-0.671
ement-0.619
Token Some
Token ID2773
Feature activation-1.60e+00

Pos logit contributions
isk+1.260
arl+1.100
ens+1.083
 bisc+1.063
coat+1.040
Neg logit contributions
inated-1.021
work-0.954
ised-0.947
ized-0.843
el-0.839
Token happy
Token ID3772
Feature activation-1.98e-01

Pos logit contributions
isk+1.238
arl+1.075
ens+1.055
 bisc+1.009
coat+0.998
Neg logit contributions
inated-1.192
ised-1.146
work-1.118
ized-1.067
ble-0.983
Token again
Token ID757
Feature activation+7.98e-01

Pos logit contributions
isk+0.382
arl+0.341
ens+0.333
 bisc+0.320
coat+0.316
Neg logit contributions
inated-0.328
work-0.316
ised-0.311
ized-0.286
el-0.267
Token and
Token ID290
Feature activation-6.12e-01

Pos logit contributions
inated+1.494
work+1.414
ised+1.407
ized+1.293
el+1.220
Neg logit contributions
isk-1.386
ens-1.199
arl-1.183
 bisc-1.147
 shady-1.121
Token fell
Token ID3214
Feature activation-8.38e-01

Pos logit contributions
isk+1.125
arl+1.023
ens+0.966
 bisc+0.936
coat+0.930
Neg logit contributions
inated-1.073
work-1.026
ised-1.019
ized-0.953
el-0.878
Token asleep
Token ID16039
Feature activation-1.64e-01

Pos logit contributions
isk+1.060
ens+0.856
arl+0.852
 bisc+0.827
ives+0.781
Neg logit contributions
inated-1.948
work-1.890
ised-1.862
ized-1.754
el-1.705
Token with
Token ID351
Feature activation+3.28e-02

Pos logit contributions
isk+0.324
arl+0.290
ens+0.288
 bisc+0.277
ives+0.269
Neg logit contributions
inated-0.260
work-0.249
ised-0.247
ized-0.226
izing-0.208
Token a
Token ID257
Feature activation-1.31e+00

Pos logit contributions
inated+0.057
work+0.054
ised+0.054
ized+0.050
el+0.047
Neg logit contributions
isk-0.061
arl-0.053
ens-0.053
 bisc-0.051
coat-0.050
Token smile
Token ID8212
Feature activation-1.49e-01

Pos logit contributions
isk+2.248
ens+1.858
arl+1.832
ives+1.717
aks+1.648
Neg logit contributions
inated-2.508
work-2.442
ised-2.427
ized-2.277
ement-2.185
Token on
Token ID319
Feature activation+9.61e-01

Pos logit contributions
isk+0.271
arl+0.236
ens+0.235
 bisc+0.225
coat+0.218
Neg logit contributions
inated-0.264
work-0.255
ised-0.249
ized-0.230
el-0.219
Token her
Token ID607
Feature activation+2.94e-01

Pos logit contributions
inated+1.920
work+1.907
ised+1.877
ized+1.735
izing+1.646
Neg logit contributions
isk-1.510
ens-1.320
arl-1.318
 bisc-1.239
 shady-1.196
Token face
Token ID1986
Feature activation+5.54e-01

Pos logit contributions
inated+0.622
work+0.599
ised+0.596
ized+0.555
el+0.528
Neg logit contributions
isk-0.431
arl-0.366
ens-0.364
 bisc-0.346
coat-0.334
Token was
Token ID373
Feature activation+8.87e-01

Pos logit contributions
isk+0.842
arl+0.774
ens+0.743
 bisc+0.720
coat+0.710
Neg logit contributions
inated-0.660
ised-0.628
work-0.622
ized-0.566
el-0.547
Token a
Token ID257
Feature activation-5.49e-01

Pos logit contributions
inated+1.540
work+1.480
ised+1.459
ized+1.345
izing+1.283
Neg logit contributions
isk-1.635
ens-1.459
arl-1.427
 bisc-1.410
 shady-1.362
Token boy
Token ID2933
Feature activation-1.85e-02

Pos logit contributions
isk+0.948
arl+0.818
ens+0.812
ives+0.783
 bisc+0.767
Neg logit contributions
inated-1.016
work-0.972
ised-0.961
ized-0.892
ble-0.851
Token named
Token ID3706
Feature activation-1.24e+00

Pos logit contributions
isk+0.025
arl+0.021
ens+0.021
 bisc+0.020
coat+0.019
Neg logit contributions
inated-0.041
work-0.040
ised-0.040
ized-0.037
el-0.036
Token Tim
Token ID5045
Feature activation-4.79e-01

Pos logit contributions
isk+2.286
arl+2.025
ives+1.960
ens+1.946
coat+1.913
Neg logit contributions
inated-2.210
work-2.091
ised-1.990
ized-1.828
el-1.797
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
isk+0.543
arl+0.457
ens+0.428
 bisc+0.420
ives+0.405
Neg logit contributions
inated-1.146
work-1.118
ised-1.105
ized-1.040
el-1.029
Token Tim
Token ID5045
Feature activation-1.85e-01

Pos logit contributions
isk+2.279
arl+2.007
ens+1.921
ives+1.920
 bisc+1.892
Neg logit contributions
inated-2.372
work-2.204
ised-2.184
el-1.991
ized-1.965
Token loved
Token ID6151
Feature activation-5.51e-01

Pos logit contributions
isk+0.280
arl+0.246
ens+0.235
 bisc+0.227
ives+0.220
Neg logit contributions
inated-0.380
work-0.366
ised-0.363
ized-0.338
el-0.326
Token cookies
Token ID14746
Feature activation-1.55e-01

Pos logit contributions
isk+0.924
arl+0.806
ens+0.781
 bisc+0.743
ives+0.738
Neg logit contributions
inated-1.063
work-1.023
ised-0.995
ized-0.929
el-0.896
Token and
Token ID290
Feature activation-1.95e+00

Pos logit contributions
isk+0.326
arl+0.295
ens+0.289
 bisc+0.280
coat+0.274
Neg logit contributions
inated-0.224
work-0.215
ised-0.212
ized-0.192
el-0.179
Token he
Token ID339
Feature activation-6.43e-01

Pos logit contributions
isk+3.531
arl+3.521
ives+3.140
aks+2.865
and+2.853
Neg logit contributions
inated-3.439
work-3.260
ised-3.206
ized-2.986
el-2.931
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
isk+1.670
arl+1.397
ens+1.369
 bisc+1.368
coat+1.341
Neg logit contributions
inated-1.536
work-1.462
ised-1.436
ized-1.318
ble-1.274
Token "
Token ID366
Feature activation+4.00e-01

Pos logit contributions
isk+2.033
ens+1.706
arl+1.679
 bisc+1.669
coat+1.613
Neg logit contributions
inated-2.341
work-2.207
ised-2.145
ized-1.961
ble-1.944
TokenMom
Token ID29252
Feature activation-1.59e+00

Pos logit contributions
inated+0.859
work+0.831
ised+0.818
ized+0.764
el+0.728
Neg logit contributions
isk-0.586
ens-0.501
arl-0.499
 bisc-0.471
coat-0.456
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
isk+2.998
arl+2.690
 bisc+2.617
 shady+2.561
ens+2.536
Neg logit contributions
work-2.638
inated-2.557
ised-2.439
el-2.371
used-2.211
Token mom
Token ID1995
Feature activation-1.37e+00

Pos logit contributions
isk+3.596
arl+3.124
aks+3.008
coat+2.984
ives+2.984
Neg logit contributions
work-2.667
inated-2.619
ised-2.314
el-2.230
ble-2.141
Token,
Token ID11
Feature activation-1.92e+00

Pos logit contributions
isk+2.781
arl+2.535
 bisc+2.388
ens+2.342
ives+2.339
Neg logit contributions
work-2.100
inated-2.077
ised-1.996
ized-1.827
el-1.826
Token I
Token ID314
Feature activation-5.98e-01

Pos logit contributions
isk+4.033
arl+3.504
aks+3.358
ives+3.334
coat+3.330
Neg logit contributions
work-3.005
inated-2.895
ised-2.573
el-2.470
ble-2.383
Token'm
Token ID1101
Feature activation+3.38e-01

Pos logit contributions
isk+0.953
arl+0.843
ens+0.805
 bisc+0.759
 shady+0.754
Neg logit contributions
inated-1.168
work-1.140
ised-1.120
ized-1.034
el-0.995
Token sorry
Token ID7926
Feature activation-2.00e-01

Pos logit contributions
inated+0.799
work+0.771
ised+0.769
ized+0.722
el+0.691
Neg logit contributions
isk-0.416
arl-0.343
ens-0.338
 bisc-0.314
coat-0.304
Token.
Token ID13
Feature activation-3.04e-01

Pos logit contributions
isk+0.418
arl+0.368
ens+0.365
 bisc+0.358
coat+0.351
Neg logit contributions
inated-0.300
work-0.285
ised-0.281
ized-0.254
el-0.241
Token I
Token ID314
Feature activation-8.88e-02

Pos logit contributions
isk+0.572
arl+0.493
ens+0.488
 bisc+0.475
coat+0.467
Neg logit contributions
inated-0.530
work-0.502
ised-0.493
ized-0.448
el-0.433
Token clean
Token ID3424
Feature activation+2.13e-01

Pos logit contributions
isk+4.577
arl+4.168
 bisc+3.817
ens+3.709
 robber+3.701
Neg logit contributions
work-4.226
inated-3.979
ised-3.875
ized-3.570
ble-3.393
Token up
Token ID510
Feature activation-4.81e-01

Pos logit contributions
inated+0.279
work+0.262
ised+0.260
ized+0.231
el+0.211
Neg logit contributions
isk-0.483
arl-0.436
ens-0.435
 bisc-0.420
coat-0.413
Token the
Token ID262
Feature activation-7.37e-01

Pos logit contributions
isk+0.912
arl+0.806
ens+0.787
 bisc+0.771
coat+0.761
Neg logit contributions
inated-0.803
work-0.789
ised-0.776
ized-0.713
el-0.674
Token mess
Token ID2085
Feature activation-1.02e+00

Pos logit contributions
isk+1.041
ens+0.841
arl+0.828
 bisc+0.773
ives+0.762
Neg logit contributions
inated-1.635
work-1.590
ised-1.580
ized-1.484
el-1.443
Token and
Token ID290
Feature activation-1.51e+00

Pos logit contributions
isk+2.256
arl+1.981
 bisc+1.972
ens+1.954
 shady+1.900
Neg logit contributions
work-1.437
inated-1.392
ised-1.366
ized-1.257
ble-1.164
Token went
Token ID1816
Feature activation-1.97e+00

Pos logit contributions
isk+2.710
arl+2.472
ens+2.246
ives+2.212
coat+2.181
Neg logit contributions
work-2.780
inated-2.727
ised-2.636
ized-2.455
el-2.263
Token to
Token ID284
Feature activation-6.22e-01

Pos logit contributions
isk+4.061
ens+3.469
 bisc+3.387
arl+3.322
 robber+3.192
Neg logit contributions
inated-3.124
work-3.111
ised-2.846
ized-2.645
el-2.506
Token bed
Token ID3996
Feature activation-3.97e-01

Pos logit contributions
isk+1.009
arl+0.880
ens+0.858
 bisc+0.831
ives+0.825
Neg logit contributions
inated-1.223
work-1.178
ised-1.147
ized-1.061
el-1.028
Token early
Token ID1903
Feature activation+3.74e-01

Pos logit contributions
isk+0.839
ens+0.749
arl+0.743
 bisc+0.727
coat+0.707
Neg logit contributions
inated-0.585
ised-0.555
work-0.554
ized-0.507
ble-0.463
Token.
Token ID13
Feature activation+7.16e-01

Pos logit contributions
inated+0.578
work+0.546
ised+0.545
ized+0.492
el+0.461
Neg logit contributions
isk-0.757
arl-0.674
ens-0.672
 bisc-0.648
coat-0.636
Token They
Token ID1119
Feature activation-6.75e-01

Pos logit contributions
inated+1.517
work+1.480
ised+1.479
ized+1.397
izing+1.321
Neg logit contributions
isk-1.036
arl-0.907
ens-0.888
 bisc-0.830
 shady-0.807
Token her
Token ID607
Feature activation+1.18e-01

Pos logit contributions
isk+0.785
arl+0.702
ens+0.669
 bisc+0.658
coat+0.646
Neg logit contributions
inated-0.744
work-0.719
ised-0.706
ized-0.657
el-0.630
Token the
Token ID262
Feature activation-1.77e-01

Pos logit contributions
inated+0.198
work+0.190
ised+0.188
ized+0.172
el+0.162
Neg logit contributions
isk-0.223
arl-0.197
ens-0.196
 bisc-0.188
coat-0.184
Token ring
Token ID5858
Feature activation+6.67e-01

Pos logit contributions
isk+0.283
arl+0.241
ens+0.240
 bisc+0.226
coat+0.221
Neg logit contributions
inated-0.347
work-0.338
ised-0.330
ized-0.307
el-0.294
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
inated+1.108
ised+1.060
work+1.031
ized+0.962
izing+0.901
Neg logit contributions
isk-1.281
ens-1.144
arl-1.135
 bisc-1.095
coat-1.075
Token He
Token ID679
Feature activation+3.63e-01

Pos logit contributions
isk+2.258
ens+1.923
 bisc+1.858
arl+1.845
coat+1.825
Neg logit contributions
inated-2.891
ised-2.755
work-2.636
ized-2.452
ement-2.410
Token thought
Token ID1807
Feature activation-1.93e+00

Pos logit contributions
inated+0.692
work+0.664
ised+0.659
ized+0.608
el+0.576
Neg logit contributions
isk-0.611
ens-0.528
arl-0.523
 bisc-0.503
coat-0.490
Token she
Token ID673
Feature activation+1.46e-01

Pos logit contributions
isk+3.975
arl+3.445
 bisc+3.329
coat+3.218
ens+3.143
Neg logit contributions
inated-2.981
work-2.975
ised-2.842
ized-2.697
ement-2.663
Token would
Token ID561
Feature activation-5.19e-01

Pos logit contributions
inated+0.293
work+0.281
ised+0.279
ized+0.260
el+0.247
Neg logit contributions
isk-0.229
arl-0.198
ens-0.196
 bisc-0.186
coat-0.181
Token like
Token ID588
Feature activation-6.45e-01

Pos logit contributions
isk+0.805
ens+0.691
arl+0.688
 bisc+0.654
coat+0.633
Neg logit contributions
inated-1.038
work-1.011
ised-0.978
ized-0.917
ble-0.883
Token it
Token ID340
Feature activation-2.23e-01

Pos logit contributions
isk+1.189
arl+1.060
ens+1.019
 bisc+1.002
coat+0.979
Neg logit contributions
inated-1.108
work-1.078
ised-1.032
ized-0.966
el-0.930
Token.
Token ID13
Feature activation-1.56e+00

Pos logit contributions
isk+0.451
arl+0.407
ens+0.398
 bisc+0.388
coat+0.379
Neg logit contributions
inated-0.340
work-0.328
ised-0.319
ized-0.291
el-0.274
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
isk+4.713
arl+4.434
 bisc+4.283
 shady+4.225
coat+4.190
Neg logit contributions
work-2.372
inated-2.176
el-1.999
ised-1.910
ized-1.693
Token Her
Token ID2332
Feature activation+2.15e-01

Pos logit contributions
isk+0.201
arl+0.174
ens+0.173
 bisc+0.168
coat+0.163
Neg logit contributions
inated-0.212
ised-0.200
work-0.200
ized-0.183
el-0.175
Token mom
Token ID1995
Feature activation-2.41e+00

Pos logit contributions
inated+0.342
work+0.325
ised+0.323
ized+0.294
el+0.274
Neg logit contributions
isk-0.431
arl-0.383
ens-0.382
 bisc-0.367
coat-0.360
Token hears
Token ID23180
Feature activation-1.47e+00

Pos logit contributions
isk+3.532
arl+3.035
 bisc+2.987
coat+2.971
 shady+2.892
Neg logit contributions
inated-4.934
el-4.883
ised-4.874
work-4.734
used-4.608
Token her
Token ID607
Feature activation-6.38e-01

Pos logit contributions
isk+2.988
coat+2.701
 bisc+2.688
arl+2.634
 shady+2.573
Neg logit contributions
inated-2.207
ised-2.166
work-2.056
ized-1.973
el-1.955
Token and
Token ID290
Feature activation-1.82e+00

Pos logit contributions
isk+1.112
arl+0.962
ens+0.951
 bisc+0.934
coat+0.915
Neg logit contributions
inated-1.169
work-1.115
ised-1.098
ized-1.010
el-0.985
Token comes
Token ID2058
Feature activation-1.18e+00

Pos logit contributions
isk+1.856
arl+1.516
 bisc+1.451
coat+1.438
ives+1.305
Neg logit contributions
inated-4.700
work-4.431
ised-4.428
el-4.269
used-4.229
Token outside
Token ID2354
Feature activation-1.37e-01

Pos logit contributions
isk+2.222
arl+1.955
 bisc+1.934
ens+1.930
coat+1.850
Neg logit contributions
inated-1.993
work-1.898
ised-1.893
el-1.714
ized-1.685
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
isk+0.298
arl+0.267
ens+0.265
 bisc+0.258
coat+0.253
Neg logit contributions
inated-0.193
work-0.182
ised-0.181
ized-0.163
el-0.152
Token She
Token ID1375
Feature activation-1.70e+00

Pos logit contributions
isk+1.936
arl+1.651
 bisc+1.650
coat+1.620
ens+1.606
Neg logit contributions
inated-1.830
ised-1.695
work-1.658
ized-1.503
el-1.479
Token sees
Token ID7224
Feature activation-9.43e-01

Pos logit contributions
isk+2.250
coat+1.950
arl+1.929
 bisc+1.884
ens+1.845
Neg logit contributions
inated-3.706
ised-3.604
work-3.525
el-3.406
ized-3.357
Token the
Token ID262
Feature activation+6.01e-02

Pos logit contributions
inated+0.503
work+0.482
ised+0.479
ized+0.442
el+0.417
Neg logit contributions
isk-0.471
arl-0.409
ens-0.408
 bisc-0.390
coat-0.379
Token house
Token ID2156
Feature activation-1.07e+00

Pos logit contributions
inated+0.119
work+0.115
ised+0.113
ized+0.105
el+0.100
Neg logit contributions
isk-0.096
arl-0.082
ens-0.082
 bisc-0.077
coat-0.076
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
isk+2.073
arl+1.877
ens+1.852
 bisc+1.795
ives+1.709
Neg logit contributions
inated-1.765
work-1.744
ised-1.682
ized-1.570
el-1.424
Token They
Token ID1119
Feature activation-7.17e-01

Pos logit contributions
inated+0.247
work+0.237
ised+0.235
ized+0.218
el+0.207
Neg logit contributions
isk-0.206
arl-0.178
ens-0.177
 bisc-0.169
coat-0.164
Token are
Token ID389
Feature activation-1.20e+00

Pos logit contributions
isk+0.990
arl+0.859
ens+0.810
 bisc+0.784
coat+0.752
Neg logit contributions
inated-1.557
work-1.509
ised-1.476
ized-1.396
el-1.336
Token hungry
Token ID14720
Feature activation-2.01e+00

Pos logit contributions
isk+2.169
arl+1.889
ens+1.888
ives+1.819
 bisc+1.761
Neg logit contributions
inated-2.187
work-2.104
ised-2.067
ized-1.931
ble-1.901
Token and
Token ID290
Feature activation-1.37e+00

Pos logit contributions
isk+4.911
arl+4.407
ens+4.147
coat+4.098
Rex+3.998
Neg logit contributions
work-2.364
inated-2.330
ised-2.177
ized-2.012
el-1.989
Token happy
Token ID3772
Feature activation+3.27e-01

Pos logit contributions
isk+2.511
arl+2.133
ens+2.057
ives+2.014
coat+2.006
Neg logit contributions
inated-2.542
work-2.400
ised-2.373
ized-2.240
used-2.079
Token.
Token ID13
Feature activation+5.93e-01

Pos logit contributions
inated+0.496
work+0.466
ised+0.466
ized+0.421
el+0.391
Neg logit contributions
isk-0.680
arl-0.607
ens-0.606
 bisc-0.585
coat-0.572
Token They
Token ID1119
Feature activation-8.10e-01

Pos logit contributions
inated+1.187
work+1.156
ised+1.141
ized+1.073
izing+1.011
Neg logit contributions
isk-0.928
arl-0.816
ens-0.809
 bisc-0.766
 shady-0.738
Token had
Token ID550
Feature activation-8.87e-01

Pos logit contributions
isk+1.102
arl+0.941
ens+0.881
 bisc+0.845
coat+0.845
Neg logit contributions
inated-1.778
work-1.705
ised-1.670
ized-1.584
el-1.515