TOP ACTIVATIONS
MAX = 543.553

I am sorry , m ister . That is very sad
said , " I am sorry , m ister . That
no family . Lily felt sorry for him .
, " Hello , m ister . Do you want to
Oh ! I 'm so sorry ," he said . "
The two cr anes were always loyal to each
." Tom felt sorry . He said , "
said , " I 'm sorry , mom . I will
a moment . He felt sorry . He liked his truck
very kind . I 'm sorry I was selfish . You
Tom crying . She felt sorry for him . She ran
feel better . I 'm sorry you hurt your knee ,"
med . From that day onwards , the grape learned to
" Then you should say sorry to Sara and give her
reply , " I 'm sorry , I didn 't know
. Lee . We are sorry for the mess . We
said , " I 'm sorry . I won 't be
The puppy then looked very sorry and bowed his head .
" We are sorry . We did not know
on some cozy p aj amas and sn ugg led up

MIN ACTIVATIONS
MIN = 543.553

upon a time , there was a kind person named Tom
. <|endoftext|> Once upon a time , there was a kind
, and he never had a good night 's sleep again
again . <|endoftext|> Once upon a time , there was a
scared . Tom went to his bed and hid
say anything either . Tom was scared . Tom
Tom went to his bed and hid under the covers .
under the covers . He heard the sound again . It
you every night !" Tom was very scared and cried .
Tom and said , " I will scare you every night
big monster . The monster looked at Tom and said ,
The monster looked at Tom and said , " I will
that day on , Tom was always scared in his room
!" Tom was very scared and cried . From that day
day on , Tom was always scared in his room ,
room , and he never had a good night 's sleep
opened the closet and saw a big monster . The monster
Tom opened the closet and saw a big monster . The
the sound again . It was coming from his closet !
! Tom opened the closet and saw a big monster .

INTERVAL 543.553 - 543.553
CONTAINS 0.000%

TokenI
Token ID40
Feature activation+5.44e+02

Pos logit contributions
 and+5.139
,+5.136
 the+4.909
.+4.876
 a+4.691
Neg logit contributions
lor-4.377
umm-4.255
arians-4.158
ocol-4.147
rol-4.083
Token am
Token ID716
Feature activation+5.44e+02

Pos logit contributions
 and+4.614
,+4.534
 the+4.478
.+4.341
 a+4.203
Neg logit contributions
lor-4.650
 former-4.510
ocol-4.488
umm-4.463
arians-4.454
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.968
,+4.855
.+4.667
 the+4.643
 a+4.422
Neg logit contributions
lor-4.384
ocol-4.301
rol-4.210
umm-4.205
arians-4.185
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.885
,+0.756
 the+0.698
.+0.559
 a+0.505
Neg logit contributions
lor-8.298
umm-8.242
ocol-8.211
arians-8.156
osity-8.109
Token m
Token ID285
Feature activation+5.44e+02

Pos logit contributions
 and+5.689
,+5.669
.+5.510
 the+5.469
 a+5.314
Neg logit contributions
lor-3.522
ocol-3.478
umm-3.472
arians-3.421
osity-3.323
Tokenister
Token ID1694
Feature activation+5.44e+02

Pos logit contributions
 and+7.251
,+7.153
 the+7.048
.+6.953
 a+6.857
Neg logit contributions
lor-2.217
umm-2.080
ocol-2.034
arians-2.012
rol-1.943
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.975
,+0.869
 the+0.806
 a+0.649
.+0.598
Neg logit contributions
lor-8.236
arians-8.131
umm-8.118
ocol-8.054
osity-8.017
Token That
Token ID1320
Feature activation+5.44e+02

Pos logit contributions
 and+4.436
,+4.338
 the+4.165
.+4.132
 a+3.962
Neg logit contributions
lor-5.011
umm-4.953
ocol-4.848
osity-4.807
arians-4.798
Token is
Token ID318
Feature activation+5.44e+02

Pos logit contributions
 and+2.623
,+2.586
 the+2.429
.+2.361
 a+2.180
Neg logit contributions
lor-6.714
umm-6.526
arians-6.469
rol-6.392
 former-6.375
Token very
Token ID845
Feature activation+5.44e+02

Pos logit contributions
 and+3.400
,+3.328
.+3.093
 the+3.046
 a+2.819
Neg logit contributions
lor-5.959
ocol-5.835
umm-5.823
arians-5.785
 former-5.746
Token sad
Token ID6507
Feature activation+5.44e+02

Pos logit contributions
 and+4.022
,+3.908
 the+3.724
.+3.666
 a+3.458
Neg logit contributions
lor-5.398
ocol-5.290
 former-5.240
umm-5.232
osity-5.151
Token said
Token ID531
Feature activation+5.44e+02

Pos logit contributions
,+3.025
 and+2.991
 the+2.845
.+2.797
 a+2.590
Neg logit contributions
lor-6.255
arians-6.043
 former-5.985
umm-5.964
rol-5.938
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.518
,+0.331
 the+0.266
.+0.241
 a+0.079
Neg logit contributions
lor-8.805
ocol-8.683
umm-8.637
arians-8.567
osity-8.553
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+2.417
,+2.359
 the+2.226
.+2.177
 a+2.041
Neg logit contributions
lor-6.817
arians-6.633
ocol-6.625
osity-6.625
umm-6.600
TokenI
Token ID40
Feature activation+5.44e+02

Pos logit contributions
 and+5.139
,+5.136
 the+4.909
.+4.876
 a+4.691
Neg logit contributions
lor-4.377
umm-4.255
arians-4.158
ocol-4.147
rol-4.083
Token am
Token ID716
Feature activation+5.44e+02

Pos logit contributions
 and+4.614
,+4.534
 the+4.478
.+4.341
 a+4.203
Neg logit contributions
lor-4.650
 former-4.510
ocol-4.488
umm-4.463
arians-4.454
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.968
,+4.855
.+4.667
 the+4.643
 a+4.422
Neg logit contributions
lor-4.384
ocol-4.301
rol-4.210
umm-4.205
arians-4.185
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.885
,+0.756
 the+0.698
.+0.559
 a+0.505
Neg logit contributions
lor-8.298
umm-8.242
ocol-8.211
arians-8.156
osity-8.109
Token m
Token ID285
Feature activation+5.44e+02

Pos logit contributions
 and+5.689
,+5.669
.+5.510
 the+5.469
 a+5.314
Neg logit contributions
lor-3.522
ocol-3.478
umm-3.472
arians-3.421
osity-3.323
Tokenister
Token ID1694
Feature activation+5.44e+02

Pos logit contributions
 and+7.251
,+7.153
 the+7.048
.+6.953
 a+6.857
Neg logit contributions
lor-2.217
umm-2.080
ocol-2.034
arians-2.012
rol-1.943
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.975
,+0.869
 the+0.806
 a+0.649
.+0.598
Neg logit contributions
lor-8.236
arians-8.131
umm-8.118
ocol-8.054
osity-8.017
Token That
Token ID1320
Feature activation+5.44e+02

Pos logit contributions
 and+4.436
,+4.338
 the+4.165
.+4.132
 a+3.962
Neg logit contributions
lor-5.011
umm-4.953
ocol-4.848
osity-4.807
arians-4.798
Token no
Token ID645
Feature activation+5.44e+02

Pos logit contributions
,+2.747
 and+2.700
.+2.536
 the+2.524
 a+2.278
Neg logit contributions
lor-6.446
ocol-6.300
umm-6.292
arians-6.289
rol-6.286
Token family
Token ID1641
Feature activation+5.44e+02

Pos logit contributions
 and+4.466
,+4.365
 the+4.223
.+4.115
 a+3.988
Neg logit contributions
umm-4.846
ocol-4.805
lor-4.760
 former-4.746
arians-4.714
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.361
,+2.306
 the+2.204
.+2.050
 a+2.045
Neg logit contributions
lor-6.823
ocol-6.659
arians-6.657
umm-6.650
osity-6.600
Token Lily
Token ID20037
Feature activation+5.44e+02

Pos logit contributions
 and+3.231
,+3.192
 the+3.004
.+2.941
 a+2.781
Neg logit contributions
lor-6.162
umm-6.086
ocol-6.009
arians-5.960
 former-5.944
Token felt
Token ID2936
Feature activation+5.44e+02

Pos logit contributions
 and+3.324
,+3.321
 the+3.223
.+3.139
 a+3.019
Neg logit contributions
lor-5.878
arians-5.735
umm-5.604
rol-5.586
 former-5.580
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.273
,+4.176
.+3.961
 the+3.948
 a+3.699
Neg logit contributions
lor-4.980
ocol-4.949
umm-4.874
 former-4.840
rol-4.786
Token for
Token ID329
Feature activation+5.44e+02

Pos logit contributions
 and+1.840
,+1.800
 the+1.750
.+1.578
 a+1.522
Neg logit contributions
lor-7.153
umm-7.109
osity-7.036
ocol-6.997
 former-6.975
Token him
Token ID683
Feature activation+5.44e+02

Pos logit contributions
 and+3.097
,+2.973
.+2.746
 the+2.672
 a+2.546
Neg logit contributions
ocol-6.182
arians-6.114
lor-6.044
umm-5.978
osity-5.929
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.010
 the+0.968
,+0.967
.+0.728
 a+0.719
Neg logit contributions
lor-8.046
arians-7.886
umm-7.867
ides-7.848
 former-7.830
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+2.880
,+2.813
 the+2.647
.+2.579
 a+2.442
Neg logit contributions
lor-6.490
umm-6.398
arians-6.281
osity-6.263
ocol-6.259
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+1.451
,+1.404
 the+1.198
.+1.150
 a+1.011
Neg logit contributions
lor-7.910
ocol-7.684
arians-7.636
umm-7.635
rol-7.626
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.384
,+1.174
 the+1.126
.+1.083
 a+0.923
Neg logit contributions
lor-7.944
ocol-7.828
umm-7.737
osity-7.717
arians-7.686
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+2.433
,+2.374
 the+2.277
.+2.191
 a+2.071
Neg logit contributions
lor-6.729
osity-6.596
arians-6.584
ocol-6.554
umm-6.519
TokenHello
Token ID15496
Feature activation+5.44e+02

Pos logit contributions
 and+5.184
,+5.152
 the+4.937
.+4.910
 a+4.737
Neg logit contributions
lor-4.333
umm-4.188
ocol-4.121
arians-4.088
rol-4.032
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.644
,+0.464
 the+0.389
.+0.305
 a+0.204
Neg logit contributions
lor-8.657
umm-8.567
ocol-8.541
arians-8.474
rol-8.363
Token m
Token ID285
Feature activation+5.44e+02

Pos logit contributions
 and+4.762
,+4.687
.+4.542
 the+4.469
 a+4.296
Neg logit contributions
lor-4.558
ocol-4.482
umm-4.447
arians-4.365
osity-4.359
Tokenister
Token ID1694
Feature activation+5.44e+02

Pos logit contributions
 and+7.297
,+7.182
 the+7.096
.+6.988
 a+6.900
Neg logit contributions
lor-2.207
umm-2.041
ocol-2.013
arians-1.978
rol-1.927
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.630
,+1.513
 the+1.440
 a+1.284
.+1.266
Neg logit contributions
lor-7.614
umm-7.492
arians-7.483
ocol-7.475
osity-7.382
Token Do
Token ID2141
Feature activation+5.44e+02

Pos logit contributions
 and+5.202
,+5.102
 the+4.925
.+4.915
 a+4.715
Neg logit contributions
lor-4.222
umm-4.136
ocol-4.114
osity-4.028
arians-4.018
Token you
Token ID345
Feature activation+5.44e+02

Pos logit contributions
 and+3.242
,+3.112
.+2.931
 the+2.929
 a+2.721
Neg logit contributions
lor-6.164
ocol-5.991
umm-5.977
arians-5.943
osity-5.875
Token want
Token ID765
Feature activation+5.44e+02

Pos logit contributions
 and+4.161
,+4.095
 the+3.969
.+3.874
 a+3.724
Neg logit contributions
lor-5.117
ocol-5.084
umm-5.036
ides-4.970
rol-4.953
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+1.446
,+1.366
 the+1.154
.+1.147
 a+0.919
Neg logit contributions
lor-7.884
umm-7.740
ocol-7.738
arians-7.681
rol-7.642
TokenOh
Token ID5812
Feature activation+5.44e+02

Pos logit contributions
 and+5.677
,+5.653
 the+5.427
.+5.410
 a+5.210
Neg logit contributions
lor-3.773
umm-3.648
ocol-3.609
rol-3.550
arians-3.537
Token!
Token ID0
Feature activation+5.44e+02

Pos logit contributions
 and+4.099
,+3.866
 the+3.787
.+3.766
 a+3.529
Neg logit contributions
lor-5.310
ocol-5.181
umm-5.169
rol-5.073
arians-5.032
Token I
Token ID314
Feature activation+5.44e+02

Pos logit contributions
 and+5.331
,+5.218
 the+5.069
.+5.054
 a+4.844
Neg logit contributions
lor-4.126
umm-3.989
ocol-3.955
osity-3.901
arians-3.901
Token'm
Token ID1101
Feature activation+5.44e+02

Pos logit contributions
 and+4.885
,+4.811
 the+4.735
.+4.648
 a+4.476
Neg logit contributions
lor-4.446
ocol-4.295
rol-4.247
umm-4.223
 former-4.201
Token so
Token ID523
Feature activation+5.44e+02

Pos logit contributions
 and+4.143
,+4.058
.+3.868
 the+3.844
 a+3.602
Neg logit contributions
lor-5.248
ocol-5.151
umm-5.062
rol-5.049
arians-4.999
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.326
,+4.183
 the+4.049
.+4.013
 a+3.824
Neg logit contributions
lor-5.032
ocol-5.006
umm-4.964
rol-4.880
 former-4.862
Token,"
Token ID553
Feature activation+5.44e+02

Pos logit contributions
 and+2.946
,+2.834
 the+2.715
.+2.631
 a+2.502
Neg logit contributions
lor-6.337
umm-6.282
ocol-6.220
arians-6.172
rol-6.119
Token he
Token ID339
Feature activation+5.44e+02

Pos logit contributions
 and+2.416
,+2.319
.+2.107
 the+2.064
 a+1.921
Neg logit contributions
lor-6.762
umm-6.707
ocol-6.697
rol-6.573
arians-6.561
Token said
Token ID531
Feature activation+5.44e+02

Pos logit contributions
 and+2.912
,+2.824
 the+2.658
.+2.632
 a+2.432
Neg logit contributions
lor-6.392
arians-6.215
umm-6.163
rol-6.161
ocol-6.147
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.070
,+0.930
 the+0.825
.+0.644
 a+0.611
Neg logit contributions
lor-8.117
umm-8.007
ocol-8.002
rol-7.920
arians-7.902
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+3.179
,+3.091
 the+2.897
.+2.881
 a+2.715
Neg logit contributions
lor-6.059
umm-5.969
ocol-5.899
rol-5.868
arians-5.857
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+3.788
,+3.752
 the+3.538
.+3.514
 a+3.337
Neg logit contributions
lor-5.674
arians-5.442
umm-5.438
rol-5.368
ocol-5.353
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+3.130
,+3.097
 the+2.877
.+2.858
 a+2.667
Neg logit contributions
lor-6.304
ocol-6.044
rol-6.027
umm-6.022
arians-6.020
TokenThe
Token ID464
Feature activation+5.44e+02

Pos logit contributions
 and+6.044
,+6.008
 the+5.811
.+5.789
 a+5.595
Neg logit contributions
lor-3.408
umm-3.207
arians-3.176
ocol-3.169
rol-3.134
Token two
Token ID734
Feature activation+5.44e+02

Pos logit contributions
 and+5.002
,+4.954
 the+4.762
.+4.731
 a+4.512
Neg logit contributions
lor-4.357
umm-4.330
arians-4.255
 former-4.238
ocol-4.231
Token cr
Token ID1067
Feature activation+5.44e+02

Pos logit contributions
 and+4.125
,+4.057
 the+3.909
.+3.904
 a+3.746
Neg logit contributions
lor-5.141
umm-5.093
arians-5.067
ocol-5.059
rol-4.981
Tokenanes
Token ID7305
Feature activation+5.44e+02

Pos logit contributions
 and+7.764
,+7.716
 the+7.571
.+7.509
 a+7.344
Neg logit contributions
lor-1.673
umm-1.564
ocol-1.520
rol-1.464
arians-1.454
Token were
Token ID547
Feature activation+5.44e+02

Pos logit contributions
 and+4.305
,+4.246
 the+4.110
.+4.074
 a+3.955
Neg logit contributions
lor-4.951
ocol-4.795
arians-4.790
rol-4.774
umm-4.771
Token always
Token ID1464
Feature activation+5.44e+02

Pos logit contributions
 and+4.370
,+4.282
.+4.117
 the+4.059
 a+3.888
Neg logit contributions
lor-4.891
rol-4.784
umm-4.773
arians-4.759
ocol-4.756
Token loyal
Token ID9112
Feature activation+5.44e+02

Pos logit contributions
 and+3.957
,+3.870
.+3.688
 the+3.657
 a+3.450
Neg logit contributions
lor-5.249
umm-5.192
ocol-5.179
arians-5.162
rol-5.141
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+2.892
,+2.839
 the+2.771
.+2.649
 a+2.548
Neg logit contributions
lor-6.306
umm-6.270
arians-6.209
ocol-6.150
osity-6.116
Token each
Token ID1123
Feature activation+5.44e+02

Pos logit contributions
 and+5.587
,+5.500
.+5.272
 the+5.217
 a+5.057
Neg logit contributions
lor-3.793
arians-3.756
ocol-3.614
umm-3.596
 former-3.594
Token."
Token ID526
Feature activation+5.44e+02

Pos logit contributions
 and+2.219
,+2.199
 the+2.126
.+1.939
 a+1.918
Neg logit contributions
lor-6.937
umm-6.775
arians-6.758
 former-6.726
rol-6.707
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+2.265
,+2.209
 the+2.029
.+1.980
 a+1.849
Neg logit contributions
lor-6.982
umm-6.922
ocol-6.831
arians-6.794
rol-6.758
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+1.745
,+1.702
 the+1.481
.+1.444
 a+1.297
Neg logit contributions
lor-7.591
ocol-7.376
umm-7.363
arians-7.306
rol-7.296
TokenTom
Token ID13787
Feature activation+5.44e+02

Pos logit contributions
 and+4.569
,+4.523
 the+4.335
.+4.294
 a+4.138
Neg logit contributions
lor-4.837
umm-4.695
ocol-4.660
arians-4.630
rol-4.571
Token felt
Token ID2936
Feature activation+5.44e+02

Pos logit contributions
,+3.908
 and+3.885
 the+3.806
.+3.730
 a+3.563
Neg logit contributions
lor-5.312
umm-5.121
arians-5.117
ides-5.034
rol-5.009
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.654
,+4.574
 the+4.338
.+4.332
 a+4.080
Neg logit contributions
lor-4.562
ocol-4.523
umm-4.443
 former-4.384
rol-4.346
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.952
,+0.929
 the+0.878
 a+0.630
.+0.616
Neg logit contributions
lor-8.091
umm-8.081
ocol-7.980
ides-7.954
osity-7.940
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+3.396
,+3.352
 the+3.185
.+3.086
 a+2.947
Neg logit contributions
umm-5.974
lor-5.915
ocol-5.779
arians-5.751
 former-5.738
Token said
Token ID531
Feature activation+5.44e+02

Pos logit contributions
,+3.132
 and+3.127
 the+2.972
.+2.899
 a+2.713
Neg logit contributions
lor-6.149
umm-5.970
arians-5.935
ocol-5.897
 former-5.883
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.921
,+1.716
 the+1.662
.+1.602
 a+1.469
Neg logit contributions
lor-7.383
ocol-7.297
umm-7.266
osity-7.186
arians-7.155
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+2.435
,+2.388
 the+2.246
.+2.209
 a+2.067
Neg logit contributions
lor-6.746
arians-6.600
umm-6.590
osity-6.590
ocol-6.560
Token said
Token ID531
Feature activation+5.44e+02

Pos logit contributions
,+3.132
 and+3.127
 the+2.972
.+2.899
 a+2.713
Neg logit contributions
lor-6.149
umm-5.970
arians-5.935
ocol-5.897
 former-5.883
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.921
,+1.716
 the+1.662
.+1.602
 a+1.469
Neg logit contributions
lor-7.383
ocol-7.297
umm-7.266
osity-7.186
arians-7.155
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+2.435
,+2.388
 the+2.246
.+2.209
 a+2.067
Neg logit contributions
lor-6.746
arians-6.600
umm-6.590
osity-6.590
ocol-6.560
TokenI
Token ID40
Feature activation+5.44e+02

Pos logit contributions
 and+5.238
,+5.224
 the+5.029
.+4.949
 a+4.818
Neg logit contributions
lor-4.210
umm-4.193
arians-4.014
ocol-4.003
 former-3.937
Token'm
Token ID1101
Feature activation+5.44e+02

Pos logit contributions
 and+4.987
,+4.895
 the+4.859
.+4.679
 a+4.580
Neg logit contributions
lor-4.268
umm-4.124
 former-4.121
ocol-4.091
ides-4.064
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+5.160
,+5.040
.+4.862
 the+4.828
 a+4.604
Neg logit contributions
lor-4.219
ocol-4.155
umm-4.110
rol-4.022
arians-4.017
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.559
,+0.406
 the+0.346
.+0.219
 a+0.150
Neg logit contributions
lor-8.685
umm-8.641
ocol-8.590
arians-8.526
osity-8.486
Token mom
Token ID1995
Feature activation+5.44e+02

Pos logit contributions
 and+2.696
,+2.653
.+2.465
 the+2.435
 a+2.247
Neg logit contributions
umm-6.524
lor-6.508
ocol-6.461
arians-6.392
 former-6.329
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.540
,+0.420
 the+0.391
 a+0.208
.+0.140
Neg logit contributions
lor-8.569
umm-8.547
ocol-8.457
arians-8.439
osity-8.364
Token I
Token ID314
Feature activation+5.44e+02

Pos logit contributions
 and+4.153
,+4.031
 the+3.867
.+3.836
 a+3.658
Neg logit contributions
lor-5.273
umm-5.254
ocol-5.145
arians-5.072
osity-5.059
Token will
Token ID481
Feature activation+5.44e+02

Pos logit contributions
 and+3.903
,+3.867
 the+3.779
.+3.627
 a+3.498
Neg logit contributions
lor-5.363
umm-5.234
 former-5.207
ocol-5.165
ides-5.148
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+0.776
,+0.775
.+0.453
 the+0.440
 a+0.086
Neg logit contributions
lor-8.450
ocol-8.339
umm-8.323
 former-8.321
arians-8.258
Token moment
Token ID2589
Feature activation+5.44e+02

Pos logit contributions
 and+4.754
,+4.698
 the+4.584
.+4.489
 a+4.311
Neg logit contributions
lor-4.560
umm-4.418
ocol-4.379
arians-4.337
 former-4.319
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.386
,+0.374
 the+0.352
 a+0.124
.+0.072
Neg logit contributions
lor-8.614
ocol-8.481
umm-8.462
ides-8.390
arians-8.366
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+3.415
,+3.395
 the+3.238
.+3.119
 a+2.951
Neg logit contributions
umm-5.862
lor-5.843
 former-5.683
ocol-5.681
arians-5.649
Token felt
Token ID2936
Feature activation+5.44e+02

Pos logit contributions
,+3.861
 and+3.823
 the+3.688
.+3.639
 a+3.413
Neg logit contributions
lor-5.415
arians-5.242
umm-5.223
 former-5.196
ocol-5.163
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.421
,+4.358
.+4.103
 the+4.100
 a+3.826
Neg logit contributions
ocol-4.823
lor-4.819
umm-4.727
 former-4.670
rol-4.631
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.807
,+1.788
 the+1.740
.+1.539
 a+1.511
Neg logit contributions
lor-7.186
umm-7.157
ocol-7.098
osity-7.067
ides-7.024
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+3.363
,+3.328
 the+3.150
.+3.050
 a+2.861
Neg logit contributions
lor-5.926
umm-5.896
ocol-5.759
 former-5.723
arians-5.697
Token liked
Token ID8288
Feature activation+5.44e+02

Pos logit contributions
,+3.857
 and+3.824
 the+3.685
.+3.622
 a+3.422
Neg logit contributions
lor-5.432
arians-5.236
umm-5.228
 former-5.192
ocol-5.178
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+3.594
,+3.583
.+3.324
 the+3.242
 a+3.089
Neg logit contributions
ocol-5.639
lor-5.616
umm-5.547
arians-5.482
 former-5.474
Token truck
Token ID7779
Feature activation+5.44e+02

Pos logit contributions
 and+4.575
,+4.525
 the+4.369
.+4.296
 a+4.133
Neg logit contributions
lor-4.662
ocol-4.645
umm-4.643
arians-4.595
 former-4.556
Token very
Token ID845
Feature activation+5.44e+02

Pos logit contributions
 and+2.797
,+2.713
.+2.490
 the+2.486
 a+2.243
Neg logit contributions
lor-6.578
umm-6.459
ocol-6.435
arians-6.394
rol-6.332
Token kind
Token ID1611
Feature activation+5.44e+02

Pos logit contributions
 and+3.339
,+3.204
 the+3.064
.+2.984
 a+2.788
Neg logit contributions
lor-6.092
ocol-5.975
umm-5.952
arians-5.851
osity-5.837
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.034
,+1.000
 the+0.921
.+0.731
 a+0.678
Neg logit contributions
lor-8.159
umm-8.092
ocol-8.024
arians-7.982
rol-7.925
Token I
Token ID314
Feature activation+5.44e+02

Pos logit contributions
 and+5.009
,+4.932
 the+4.760
.+4.715
 a+4.552
Neg logit contributions
lor-4.383
umm-4.329
ocol-4.307
arians-4.201
osity-4.175
Token'm
Token ID1101
Feature activation+5.44e+02

Pos logit contributions
 and+4.463
,+4.433
 the+4.331
.+4.177
 a+4.046
Neg logit contributions
lor-4.807
 former-4.663
umm-4.643
ocol-4.638
arians-4.604
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+5.118
,+5.029
.+4.818
 the+4.778
 a+4.540
Neg logit contributions
lor-4.245
ocol-4.162
umm-4.063
rol-4.052
arians-4.040
Token I
Token ID314
Feature activation+5.44e+02

Pos logit contributions
 and+2.671
,+2.568
 the+2.459
.+2.333
 a+2.257
Neg logit contributions
lor-6.527
umm-6.484
ocol-6.467
arians-6.396
osity-6.339
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+4.043
,+4.018
 the+3.871
.+3.730
 a+3.615
Neg logit contributions
lor-5.239
rol-5.080
ocol-5.063
 former-5.051
arians-5.049
Token selfish
Token ID20363
Feature activation+5.44e+02

Pos logit contributions
 and+3.949
,+3.905
 the+3.649
.+3.628
 a+3.373
Neg logit contributions
lor-5.286
ocol-5.252
umm-5.122
 former-5.102
arians-5.100
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.077
,+1.033
 the+0.979
.+0.763
 a+0.726
Neg logit contributions
lor-8.081
umm-8.049
ocol-7.968
arians-7.948
ides-7.887
Token You
Token ID921
Feature activation+5.44e+02

Pos logit contributions
 and+4.856
,+4.784
 the+4.602
.+4.563
 a+4.377
Neg logit contributions
lor-4.531
umm-4.490
ocol-4.445
arians-4.359
osity-4.338
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+3.070
,+3.010
.+2.786
 the+2.716
 a+2.519
Neg logit contributions
ocol-6.198
lor-6.193
umm-6.058
arians-5.992
osity-5.987
Token crying
Token ID13774
Feature activation+5.44e+02

Pos logit contributions
,+3.762
 and+3.756
 the+3.609
.+3.497
 a+3.409
Neg logit contributions
lor-5.424
ocol-5.253
arians-5.208
rol-5.202
umm-5.200
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
,+0.274
 and+0.217
 the+0.213
 a-0.021
.-0.060
Neg logit contributions
lor-8.711
umm-8.683
ocol-8.607
ides-8.599
arians-8.584
Token She
Token ID1375
Feature activation+5.44e+02

Pos logit contributions
 and+3.202
,+3.142
 the+2.990
.+2.936
 a+2.805
Neg logit contributions
lor-6.010
umm-5.957
ocol-5.887
osity-5.837
arians-5.804
Token felt
Token ID2936
Feature activation+5.44e+02

Pos logit contributions
,+3.440
 and+3.436
 the+3.229
.+3.211
 a+3.010
Neg logit contributions
lor-5.822
ocol-5.654
arians-5.619
umm-5.599
rol-5.560
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.837
,+4.770
 the+4.530
.+4.518
 a+4.277
Neg logit contributions
lor-4.424
ocol-4.413
umm-4.313
rol-4.230
 former-4.196
Token for
Token ID329
Feature activation+5.44e+02

Pos logit contributions
 and+1.631
,+1.613
 the+1.536
.+1.340
 a+1.301
Neg logit contributions
lor-7.412
umm-7.348
ocol-7.322
osity-7.262
ides-7.238
Token him
Token ID683
Feature activation+5.44e+02

Pos logit contributions
 and+3.104
,+3.002
.+2.763
 the+2.741
 a+2.599
Neg logit contributions
ocol-6.160
arians-6.038
lor-6.035
umm-5.971
osity-5.901
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.556
,+0.529
 the+0.492
 a+0.246
.+0.228
Neg logit contributions
lor-8.518
arians-8.361
umm-8.356
ides-8.351
ocol-8.301
Token She
Token ID1375
Feature activation+5.44e+02

Pos logit contributions
 and+3.182
,+3.135
 the+2.998
.+2.909
 a+2.770
Neg logit contributions
lor-6.046
umm-5.967
arians-5.874
ocol-5.852
osity-5.840
Token ran
Token ID4966
Feature activation+5.44e+02

Pos logit contributions
,+3.869
 and+3.864
 the+3.684
.+3.633
 a+3.440
Neg logit contributions
lor-5.414
umm-5.219
ocol-5.216
arians-5.211
rol-5.145
Token feel
Token ID1254
Feature activation+5.44e+02

Pos logit contributions
 and+3.576
,+3.529
 the+3.345
.+3.289
 a+3.088
Neg logit contributions
lor-5.697
ocol-5.571
umm-5.553
 former-5.542
rol-5.500
Token better
Token ID1365
Feature activation+5.44e+02

Pos logit contributions
 and+3.031
,+2.971
 the+2.774
.+2.715
 a+2.546
Neg logit contributions
lor-6.241
ocol-6.146
umm-6.123
 former-6.046
rol-6.025
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.450
,+2.402
 the+2.371
 a+2.126
.+2.099
Neg logit contributions
lor-6.712
umm-6.609
arians-6.547
 former-6.539
ocol-6.538
Token I
Token ID314
Feature activation+5.44e+02

Pos logit contributions
 and+5.230
,+5.176
 the+5.005
.+4.964
 a+4.766
Neg logit contributions
lor-4.185
umm-4.064
ocol-4.045
arians-4.007
 former-3.924
Token'm
Token ID1101
Feature activation+5.44e+02

Pos logit contributions
 and+4.074
,+4.050
 the+3.937
.+3.815
 a+3.674
Neg logit contributions
lor-5.203
ocol-5.075
 former-5.065
umm-5.062
arians-5.005
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+5.024
,+4.944
.+4.727
 the+4.718
 a+4.472
Neg logit contributions
lor-4.317
ocol-4.252
umm-4.143
rol-4.137
arians-4.116
Token you
Token ID345
Feature activation+5.44e+02

Pos logit contributions
 and+2.668
,+2.554
 the+2.466
.+2.343
 a+2.271
Neg logit contributions
lor-6.530
umm-6.485
ocol-6.448
arians-6.414
osity-6.321
Token hurt
Token ID5938
Feature activation+5.44e+02

Pos logit contributions
 and+4.008
,+3.947
 the+3.840
.+3.700
 a+3.575
Neg logit contributions
lor-5.202
arians-5.071
rol-5.055
ides-5.046
ocol-5.044
Token your
Token ID534
Feature activation+5.44e+02

Pos logit contributions
 and+3.738
,+3.680
 the+3.505
.+3.412
 a+3.324
Neg logit contributions
lor-5.440
ocol-5.358
umm-5.336
arians-5.297
rol-5.228
Token knee
Token ID10329
Feature activation+5.44e+02

Pos logit contributions
 and+4.309
,+4.265
 the+4.131
.+4.020
 a+3.897
Neg logit contributions
lor-5.045
umm-4.944
ocol-4.893
 former-4.859
arians-4.859
Token,"
Token ID553
Feature activation+5.44e+02

Pos logit contributions
 and+1.792
 the+1.780
,+1.762
 a+1.503
.+1.452
Neg logit contributions
lor-7.298
arians-7.144
umm-7.132
 former-7.115
ides-7.095
Tokenmed
Token ID1150
Feature activation+5.44e+02

Pos logit contributions
 and+7.135
,+7.079
 the+6.911
.+6.815
 a+6.716
Neg logit contributions
lor-2.370
ocol-2.225
umm-2.175
arians-2.149
rol-2.139
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.197
,+1.149
 the+1.051
.+0.890
 a+0.823
Neg logit contributions
lor-7.956
umm-7.819
rol-7.812
arians-7.784
ocol-7.776
Token From
Token ID3574
Feature activation+5.44e+02

Pos logit contributions
 and+3.711
,+3.647
 the+3.420
.+3.419
 a+3.216
Neg logit contributions
lor-5.758
umm-5.654
ocol-5.548
arians-5.537
rol-5.502
Token that
Token ID326
Feature activation+5.44e+02

Pos logit contributions
 and+2.537
,+2.472
.+2.303
 the+2.241
 a+2.066
Neg logit contributions
lor-6.771
umm-6.607
ocol-6.600
arians-6.591
 former-6.510
Token day
Token ID1110
Feature activation+5.44e+02

Pos logit contributions
 and+2.974
,+2.853
.+2.759
 the+2.738
 a+2.560
Neg logit contributions
lor-6.347
umm-6.259
ocol-6.129
arians-6.102
rol-6.098
Token onwards
Token ID31719
Feature activation+5.44e+02

Pos logit contributions
 and+4.050
,+3.921
 the+3.804
.+3.778
 a+3.607
Neg logit contributions
lor-5.303
umm-5.124
ocol-5.114
rol-5.063
arians-5.061
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.994
,+0.808
.+0.743
 the+0.675
 a+0.562
Neg logit contributions
lor-8.319
ocol-8.200
umm-8.194
arians-8.147
osity-8.017
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.066
,+1.982
.+1.822
 the+1.649
 a+1.560
Neg logit contributions
umm-7.163
lor-7.138
ocol-7.109
arians-7.074
 former-7.005
Token grape
Token ID30777
Feature activation+5.44e+02

Pos logit contributions
 and+4.937
,+4.874
 the+4.697
.+4.695
 a+4.416
Neg logit contributions
umm-4.514
lor-4.400
arians-4.303
ocol-4.300
 former-4.266
Token learned
Token ID4499
Feature activation+5.44e+02

Pos logit contributions
,+3.789
 and+3.788
 the+3.651
.+3.644
 a+3.485
Neg logit contributions
lor-5.485
umm-5.310
arians-5.278
rol-5.210
ocol-5.184
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+1.546
,+1.541
.+1.304
 the+1.304
 a+1.048
Neg logit contributions
lor-7.766
 former-7.532
arians-7.477
umm-7.465
osity-7.454
Token"
Token ID1
Feature activation+5.44e+02

Pos logit contributions
 and+3.946
,+3.891
 the+3.747
.+3.665
 a+3.529
Neg logit contributions
lor-5.473
umm-5.317
ocol-5.273
arians-5.232
 former-5.174
TokenThen
Token ID6423
Feature activation+5.44e+02

Pos logit contributions
 and+5.531
,+5.519
 the+5.302
.+5.268
 a+5.073
Neg logit contributions
lor-3.923
umm-3.828
arians-3.763
ocol-3.742
 former-3.667
Token you
Token ID345
Feature activation+5.44e+02

Pos logit contributions
 and+3.313
,+3.122
 the+2.977
.+2.973
 a+2.787
Neg logit contributions
lor-5.982
ocol-5.882
umm-5.857
arians-5.809
osity-5.704
Token should
Token ID815
Feature activation+5.44e+02

Pos logit contributions
 and+3.902
,+3.837
 the+3.694
.+3.645
 a+3.446
Neg logit contributions
lor-5.378
ocol-5.297
arians-5.264
umm-5.248
ides-5.206
Token say
Token ID910
Feature activation+5.44e+02

Pos logit contributions
 and+3.732
,+3.611
 the+3.494
.+3.403
 a+3.274
Neg logit contributions
lor-5.635
ocol-5.488
umm-5.458
arians-5.441
 former-5.414
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.973
,+4.827
 the+4.675
.+4.621
 a+4.458
Neg logit contributions
lor-4.310
ocol-4.255
umm-4.147
arians-4.146
 former-4.063
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+1.439
,+1.364
 the+1.308
.+1.155
 a+1.091
Neg logit contributions
lor-7.725
umm-7.648
ocol-7.618
arians-7.553
ides-7.529
Token Sara
Token ID24799
Feature activation+5.44e+02

Pos logit contributions
 and+3.321
,+3.273
.+3.008
 the+2.929
 a+2.799
Neg logit contributions
lor-5.987
ocol-5.966
arians-5.881
umm-5.859
 former-5.809
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+1.087
,+1.005
 the+0.942
.+0.770
 a+0.737
Neg logit contributions
lor-8.085
arians-7.922
ocol-7.918
umm-7.861
ides-7.850
Token give
Token ID1577
Feature activation+5.44e+02

Pos logit contributions
 and+3.469
,+3.395
.+3.170
 the+3.110
 a+2.959
Neg logit contributions
ocol-5.748
umm-5.746
lor-5.743
 former-5.672
osity-5.642
Token her
Token ID607
Feature activation+5.44e+02

Pos logit contributions
 and+2.469
,+2.416
.+2.149
 the+2.124
 a+1.949
Neg logit contributions
lor-6.906
ocol-6.770
umm-6.710
rol-6.650
arians-6.644
Token reply
Token ID10971
Feature activation+5.44e+02

Pos logit contributions
 and+4.185
,+4.073
.+3.857
 the+3.839
 a+3.671
Neg logit contributions
lor-5.059
ocol-5.009
rol-4.967
umm-4.959
arians-4.953
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+3.254
,+3.112
 the+3.039
.+2.880
 a+2.825
Neg logit contributions
lor-6.067
umm-5.920
arians-5.873
osity-5.873
ocol-5.865
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+3.110
,+3.110
 the+2.928
.+2.918
 a+2.743
Neg logit contributions
lor-6.101
umm-5.967
arians-5.966
ocol-5.916
rol-5.914
TokenI
Token ID40
Feature activation+5.44e+02

Pos logit contributions
,+5.788
 and+5.783
 the+5.552
.+5.520
 a+5.327
Neg logit contributions
lor-3.696
umm-3.594
ocol-3.512
arians-3.494
rol-3.480
Token'm
Token ID1101
Feature activation+5.44e+02

Pos logit contributions
 and+4.640
,+4.585
 the+4.496
.+4.365
 a+4.232
Neg logit contributions
lor-4.691
ocol-4.544
umm-4.486
 former-4.479
rol-4.475
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.027
,+3.944
.+3.729
 the+3.699
 a+3.455
Neg logit contributions
lor-5.295
ocol-5.282
umm-5.148
rol-5.144
arians-5.131
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+2.837
,+2.733
 the+2.625
.+2.544
 a+2.411
Neg logit contributions
lor-6.492
umm-6.421
ocol-6.337
arians-6.313
rol-6.236
Token I
Token ID314
Feature activation+5.44e+02

Pos logit contributions
 and+4.291
,+4.281
.+4.101
 the+4.065
 a+3.879
Neg logit contributions
lor-4.996
umm-4.906
ocol-4.880
arians-4.824
rol-4.736
Token didn
Token ID1422
Feature activation+5.44e+02

Pos logit contributions
 and+4.367
,+4.311
 the+4.230
.+4.092
 a+3.975
Neg logit contributions
lor-4.959
ocol-4.769
umm-4.763
 former-4.750
rol-4.738
Token't
Token ID470
Feature activation+5.44e+02

Pos logit contributions
 and+4.164
,+4.084
 the+3.894
.+3.861
 a+3.671
Neg logit contributions
lor-5.232
umm-5.060
ocol-5.030
arians-5.018
rol-4.941
Token know
Token ID760
Feature activation+5.44e+02

Pos logit contributions
 and+4.260
,+4.162
 the+4.025
.+3.925
 a+3.791
Neg logit contributions
lor-5.011
ocol-4.935
umm-4.930
arians-4.865
ides-4.862
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.021
,+0.935
 the+0.827
.+0.631
 a+0.601
Neg logit contributions
lor-8.342
umm-8.241
ocol-8.199
arians-8.187
ides-8.104
Token Lee
Token ID5741
Feature activation+5.44e+02

Pos logit contributions
 and+6.414
,+6.344
 the+6.231
.+6.125
 a+5.998
Neg logit contributions
lor-3.049
umm-2.969
ocol-2.855
arians-2.813
 former-2.763
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.891
,+0.823
 the+0.729
.+0.554
 a+0.522
Neg logit contributions
lor-8.336
arians-8.180
umm-8.178
ocol-8.163
ides-8.070
Token We
Token ID775
Feature activation+5.44e+02

Pos logit contributions
 and+5.197
,+5.124
 the+4.968
.+4.908
 a+4.751
Neg logit contributions
lor-4.244
umm-4.165
ocol-4.084
arians-4.033
rol-3.995
Token are
Token ID389
Feature activation+5.44e+02

Pos logit contributions
 and+3.939
,+3.882
 the+3.749
.+3.663
 a+3.506
Neg logit contributions
lor-5.401
umm-5.266
ocol-5.258
arians-5.210
rol-5.187
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.297
,+4.200
 the+3.992
.+3.982
 a+3.768
Neg logit contributions
lor-5.085
ocol-5.005
umm-4.979
arians-4.907
rol-4.885
Token for
Token ID329
Feature activation+5.44e+02

Pos logit contributions
 and+2.095
,+2.015
 the+1.919
.+1.777
 a+1.716
Neg logit contributions
lor-7.105
umm-7.071
ocol-6.988
arians-6.980
osity-6.911
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+4.397
,+4.305
.+4.068
 the+4.004
 a+3.833
Neg logit contributions
ocol-4.962
lor-4.936
umm-4.863
arians-4.863
 former-4.778
Token mess
Token ID2085
Feature activation+5.44e+02

Pos logit contributions
 and+4.741
,+4.692
 the+4.519
.+4.456
 a+4.273
Neg logit contributions
lor-4.620
umm-4.591
ocol-4.523
arians-4.413
 former-4.398
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.452
,+1.416
 the+1.358
 a+1.176
.+1.107
Neg logit contributions
lor-7.656
umm-7.577
arians-7.517
ocol-7.477
ides-7.450
Token We
Token ID775
Feature activation+5.44e+02

Pos logit contributions
 and+5.382
,+5.325
 the+5.148
.+5.100
 a+4.919
Neg logit contributions
lor-4.037
umm-3.978
ocol-3.880
arians-3.858
osity-3.790
Token said
Token ID531
Feature activation+5.44e+02

Pos logit contributions
 and+3.375
,+3.246
 the+3.097
.+3.083
 a+2.885
Neg logit contributions
lor-5.806
umm-5.711
ocol-5.673
arians-5.643
osity-5.637
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.028
,+0.775
 the+0.742
.+0.658
 a+0.513
Neg logit contributions
lor-8.296
ocol-8.156
umm-8.145
osity-8.094
arians-8.050
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+2.433
,+2.360
 the+2.228
.+2.165
 a+2.011
Neg logit contributions
lor-6.781
umm-6.638
arians-6.630
osity-6.609
ocol-6.561
TokenI
Token ID40
Feature activation+5.44e+02

Pos logit contributions
,+5.570
 and+5.550
 the+5.350
.+5.282
 a+5.118
Neg logit contributions
lor-3.891
umm-3.824
arians-3.701
 former-3.665
osity-3.662
Token'm
Token ID1101
Feature activation+5.44e+02

Pos logit contributions
 and+4.567
,+4.491
 the+4.437
.+4.311
 a+4.196
Neg logit contributions
lor-4.689
umm-4.582
ocol-4.580
 former-4.578
arians-4.552
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+5.047
,+4.944
.+4.763
 the+4.724
 a+4.497
Neg logit contributions
ocol-4.267
lor-4.240
umm-4.162
arians-4.145
rol-4.105
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.820
,+1.698
 the+1.615
.+1.510
 a+1.409
Neg logit contributions
lor-7.391
umm-7.391
ocol-7.295
arians-7.259
osity-7.251
Token I
Token ID314
Feature activation+5.44e+02

Pos logit contributions
 and+4.747
,+4.664
 the+4.498
.+4.480
 a+4.283
Neg logit contributions
lor-4.639
umm-4.622
arians-4.519
osity-4.518
ocol-4.465
Token won
Token ID1839
Feature activation+5.44e+02

Pos logit contributions
 and+4.323
,+4.283
 the+4.197
.+4.074
 a+3.959
Neg logit contributions
lor-4.931
umm-4.816
 former-4.813
ocol-4.780
rol-4.760
Token't
Token ID470
Feature activation+5.44e+02

Pos logit contributions
 and+4.164
,+4.087
 the+3.886
.+3.875
 a+3.678
Neg logit contributions
lor-5.133
arians-5.050
umm-4.946
ocol-4.922
osity-4.881
Token be
Token ID307
Feature activation+5.44e+02

Pos logit contributions
 and+4.336
,+4.268
 the+4.152
.+4.034
 a+3.954
Neg logit contributions
lor-4.849
umm-4.791
arians-4.783
ides-4.724
ocol-4.720
Token The
Token ID383
Feature activation+5.44e+02

Pos logit contributions
 and+4.245
,+4.180
 the+4.002
.+3.964
 a+3.791
Neg logit contributions
lor-5.110
umm-5.003
 former-4.932
arians-4.927
ocol-4.888
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+4.668
,+4.639
 the+4.437
.+4.386
 a+4.181
Neg logit contributions
lor-4.650
umm-4.635
ocol-4.598
arians-4.536
 former-4.530
Token then
Token ID788
Feature activation+5.44e+02

Pos logit contributions
 and+3.523
,+3.491
 the+3.361
.+3.333
 a+3.170
Neg logit contributions
lor-5.701
arians-5.532
umm-5.458
rol-5.455
 former-5.422
Token looked
Token ID3114
Feature activation+5.44e+02

Pos logit contributions
 and+4.256
,+4.191
.+4.026
 the+4.004
 a+3.811
Neg logit contributions
lor-4.983
ocol-4.825
umm-4.772
rol-4.770
arians-4.767
Token very
Token ID845
Feature activation+5.44e+02

Pos logit contributions
 and+2.947
,+2.906
 the+2.746
.+2.712
 a+2.543
Neg logit contributions
lor-6.271
ocol-6.210
umm-6.187
rol-6.155
 former-6.095
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.310
,+4.206
 the+4.050
.+3.993
 a+3.851
Neg logit contributions
ocol-4.990
umm-4.929
lor-4.901
 former-4.874
rol-4.809
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+1.194
,+1.165
 the+1.122
.+0.941
 a+0.881
Neg logit contributions
lor-7.902
umm-7.779
osity-7.705
arians-7.682
rol-7.677
Token bowed
Token ID35737
Feature activation+5.44e+02

Pos logit contributions
 and+4.110
,+4.032
.+3.828
 the+3.749
 a+3.625
Neg logit contributions
lor-5.094
umm-5.052
ocol-5.043
rol-5.014
 former-4.991
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+2.551
,+2.469
 the+2.340
.+2.210
 a+2.150
Neg logit contributions
lor-6.599
osity-6.418
rol-6.415
umm-6.404
 former-6.393
Token head
Token ID1182
Feature activation+5.44e+02

Pos logit contributions
 and+4.745
,+4.736
 the+4.598
.+4.464
 a+4.329
Neg logit contributions
lor-4.569
umm-4.503
 former-4.459
arians-4.452
ocol-4.411
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.031
,+1.975
 the+1.924
.+1.675
 a+1.663
Neg logit contributions
lor-7.167
rol-7.028
umm-6.946
ocol-6.928
 former-6.914
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+2.558
,+2.518
 the+2.307
.+2.266
 a+2.127
Neg logit contributions
lor-6.750
umm-6.608
ocol-6.585
arians-6.514
rol-6.498
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+1.442
,+1.403
 the+1.174
.+1.139
 a+0.997
Neg logit contributions
lor-7.907
ocol-7.711
umm-7.672
rol-7.640
arians-7.638
Token"
Token ID1
Feature activation+5.44e+02

Pos logit contributions
 and+4.479
,+4.425
 the+4.233
.+4.197
 a+4.033
Neg logit contributions
lor-4.939
umm-4.798
ocol-4.789
arians-4.740
rol-4.695
TokenWe
Token ID1135
Feature activation+5.44e+02

Pos logit contributions
 and+5.715
,+5.713
 the+5.468
.+5.456
 a+5.250
Neg logit contributions
lor-3.751
umm-3.649
arians-3.596
ocol-3.579
 former-3.496
Token are
Token ID389
Feature activation+5.44e+02

Pos logit contributions
 and+3.958
,+3.885
 the+3.769
.+3.684
 a+3.528
Neg logit contributions
lor-5.360
ocol-5.239
umm-5.211
arians-5.194
rol-5.163
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.924
,+4.840
 the+4.648
.+4.620
 a+4.429
Neg logit contributions
lor-4.390
ocol-4.343
arians-4.285
umm-4.280
rol-4.216
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.057
,+0.950
 the+0.851
.+0.748
 a+0.648
Neg logit contributions
lor-8.236
umm-8.173
ocol-8.146
arians-8.094
osity-8.014
Token We
Token ID775
Feature activation+5.44e+02

Pos logit contributions
 and+5.387
,+5.306
 the+5.110
.+5.093
 a+4.913
Neg logit contributions
lor-4.025
umm-3.982
ocol-3.891
arians-3.874
osity-3.832
Token did
Token ID750
Feature activation+5.44e+02

Pos logit contributions
 and+3.733
,+3.670
 the+3.537
.+3.453
 a+3.302
Neg logit contributions
lor-5.591
umm-5.457
ocol-5.448
arians-5.417
rol-5.406
Token not
Token ID407
Feature activation+5.44e+02

Pos logit contributions
 and+2.555
,+2.467
 the+2.252
.+2.222
 a+2.024
Neg logit contributions
lor-6.780
ocol-6.631
arians-6.621
umm-6.585
 former-6.584
Token know
Token ID760
Feature activation+5.44e+02

Pos logit contributions
 and+4.056
,+3.965
 the+3.848
.+3.722
 a+3.590
Neg logit contributions
lor-5.187
ocol-5.131
umm-5.112
ides-5.062
arians-5.044
Token on
Token ID319
Feature activation+5.44e+02

Pos logit contributions
 and+1.799
,+1.786
.+1.551
 the+1.466
 a+1.254
Neg logit contributions
lor-7.429
ocol-7.387
rol-7.331
umm-7.314
isher-7.218
Token some
Token ID617
Feature activation+5.44e+02

Pos logit contributions
 and+1.918
,+1.885
.+1.665
 the+1.594
 a+1.396
Neg logit contributions
lor-7.264
ocol-7.201
 former-7.139
umm-7.138
rol-7.066
Token cozy
Token ID37438
Feature activation+5.44e+02

Pos logit contributions
 and+4.619
,+4.578
 the+4.413
.+4.344
 a+4.170
Neg logit contributions
ocol-4.632
lor-4.593
umm-4.540
rol-4.466
arians-4.428
Token p
Token ID279
Feature activation+5.44e+02

Pos logit contributions
 and+4.754
,+4.691
 the+4.585
.+4.506
 a+4.342
Neg logit contributions
lor-4.534
ocol-4.433
umm-4.409
rol-4.298
arians-4.276
Tokenaj
Token ID1228
Feature activation+5.44e+02

Pos logit contributions
 and+7.864
,+7.804
 the+7.708
.+7.587
 a+7.480
Neg logit contributions
lor-1.558
ocol-1.450
umm-1.442
rol-1.379
arians-1.349
Tokenamas
Token ID17485
Feature activation+5.44e+02

Pos logit contributions
 and+8.104
,+8.050
 the+8.004
.+7.847
 a+7.737
Neg logit contributions
lor-1.364
umm-1.285
arians-1.173
ides-1.115
ocol-1.091
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 the+0.651
,+0.617
 and+0.612
 a+0.383
.+0.375
Neg logit contributions
lor-8.514
umm-8.288
rol-8.284
arians-8.234
osity-8.215
Token sn
Token ID3013
Feature activation+5.44e+02

Pos logit contributions
 and+3.854
,+3.832
.+3.627
 the+3.582
 a+3.333
Neg logit contributions
lor-5.361
 former-5.211
umm-5.183
ocol-5.164
rol-5.135
Tokenugg
Token ID6837
Feature activation+5.44e+02

Pos logit contributions
 and+8.012
,+7.932
 the+7.815
.+7.719
 a+7.598
Neg logit contributions
lor-1.487
umm-1.333
ocol-1.316
rol-1.251
 former-1.208
Tokenled
Token ID992
Feature activation+5.44e+02

Pos logit contributions
 and+7.020
,+7.011
 the+6.839
.+6.734
 a+6.694
Neg logit contributions
lor-2.358
umm-2.246
ocol-2.140
arians-2.091
rol-2.048
Token up
Token ID510
Feature activation+5.44e+02

Pos logit contributions
 and+2.772
,+2.750
 the+2.559
.+2.490
 a+2.313
Neg logit contributions
lor-6.492
umm-6.371
ocol-6.362
rol-6.348
 former-6.315
Token upon
Token ID2402
Feature activation+5.44e+02

Pos logit contributions
 and+4.340
,+4.225
 the+4.086
.+4.041
 a+3.934
Neg logit contributions
lor-4.915
ocol-4.789
umm-4.746
rol-4.696
osity-4.678
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
,+0.658
 and+0.626
 the+0.368
.+0.333
 a+0.024
Neg logit contributions
lor-8.586
ocol-8.348
umm-8.312
rol-8.288
 former-8.272
Token time
Token ID640
Feature activation+5.44e+02

Pos logit contributions
 and+2.820
,+2.763
 the+2.654
.+2.553
 a+2.404
Neg logit contributions
lor-6.494
umm-6.362
ocol-6.316
arians-6.244
ides-6.240
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.219
,+0.020
 the+0.001
.-0.053
 a-0.206
Neg logit contributions
lor-9.045
ocol-8.908
umm-8.867
rol-8.813
arians-8.775
Token there
Token ID612
Feature activation+5.44e+02

Pos logit contributions
 and+2.004
,+1.891
.+1.733
 the+1.730
 a+1.495
Neg logit contributions
lor-7.191
ocol-7.121
umm-7.050
rol-7.021
osity-6.991
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+1.466
,+1.317
.+1.199
 the+1.185
 a+1.042
Neg logit contributions
lor-7.725
umm-7.546
rol-7.503
ocol-7.490
arians-7.370
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+0.639
,+0.607
.+0.392
 the+0.376
 a+0.083
Neg logit contributions
lor-8.625
umm-8.547
ocol-8.545
rol-8.431
 former-8.423
Token kind
Token ID1611
Feature activation+5.44e+02

Pos logit contributions
 and+2.954
,+2.888
 the+2.735
.+2.700
 a+2.518
Neg logit contributions
lor-6.259
umm-6.234
ocol-6.181
arians-6.129
 former-6.077
Token person
Token ID1048
Feature activation+5.44e+02

Pos logit contributions
 and+4.142
,+4.091
 the+3.992
.+3.917
 a+3.795
Neg logit contributions
lor-5.063
umm-5.031
ocol-4.951
arians-4.866
osity-4.837
Token named
Token ID3706
Feature activation+5.44e+02

Pos logit contributions
 and+3.606
,+3.542
 the+3.418
.+3.288
 a+3.231
Neg logit contributions
lor-5.626
ocol-5.490
umm-5.442
arians-5.424
rol-5.385
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+3.190
,+3.029
.+2.833
 the+2.767
 a+2.551
Neg logit contributions
lor-6.108
umm-6.084
ocol-6.074
arians-6.042
 former-5.946
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.550
,+0.494
 the+0.396
 a+0.221
.+0.091
Neg logit contributions
lor-8.633
 former-8.468
ocol-8.437
umm-8.431
arians-8.378
Token<|endoftext|>
Token ID50256
Feature activation+5.44e+02

Pos logit contributions
 and+1.624
,+1.605
 the+1.356
.+1.341
 a+1.146
Neg logit contributions
lor-7.744
umm-7.609
ocol-7.466
arians-7.449
rol-7.444
TokenOnce
Token ID7454
Feature activation+5.44e+02

Pos logit contributions
 and+2.041
,+2.011
 the+1.787
.+1.699
 a+1.587
Neg logit contributions
lor-7.310
umm-7.008
rol-6.984
ocol-6.948
arians-6.902
Token upon
Token ID2402
Feature activation+5.44e+02

Pos logit contributions
 and+4.340
,+4.225
 the+4.086
.+4.041
 a+3.934
Neg logit contributions
lor-4.915
ocol-4.789
umm-4.746
rol-4.696
osity-4.678
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
,+0.658
 and+0.626
 the+0.368
.+0.333
 a+0.024
Neg logit contributions
lor-8.586
ocol-8.348
umm-8.312
rol-8.288
 former-8.272
Token time
Token ID640
Feature activation+5.44e+02

Pos logit contributions
 and+2.820
,+2.763
 the+2.654
.+2.553
 a+2.404
Neg logit contributions
lor-6.494
umm-6.362
ocol-6.316
arians-6.244
ides-6.240
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.219
,+0.020
 the+0.001
.-0.053
 a-0.206
Neg logit contributions
lor-9.045
ocol-8.908
umm-8.867
rol-8.813
arians-8.775
Token there
Token ID612
Feature activation+5.44e+02

Pos logit contributions
 and+2.004
,+1.891
.+1.733
 the+1.730
 a+1.495
Neg logit contributions
lor-7.191
ocol-7.121
umm-7.050
rol-7.021
osity-6.991
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+1.466
,+1.317
.+1.199
 the+1.185
 a+1.042
Neg logit contributions
lor-7.725
umm-7.546
rol-7.503
ocol-7.490
arians-7.370
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+0.639
,+0.607
.+0.392
 the+0.376
 a+0.083
Neg logit contributions
lor-8.625
umm-8.547
ocol-8.545
rol-8.431
 former-8.423
Token kind
Token ID1611
Feature activation+5.44e+02

Pos logit contributions
 and+2.954
,+2.888
 the+2.735
.+2.700
 a+2.518
Neg logit contributions
lor-6.259
umm-6.234
ocol-6.181
arians-6.129
 former-6.077
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.489
,+1.452
 the+1.416
.+1.186
 a+1.179
Neg logit contributions
lor-7.639
arians-7.479
 former-7.475
umm-7.446
rol-7.374
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+2.617
 and+2.547
.+2.416
 the+2.390
 a+2.209
Neg logit contributions
lor-6.506
arians-6.395
umm-6.393
 former-6.383
ocol-6.364
Token he
Token ID339
Feature activation+5.44e+02

Pos logit contributions
 and+2.809
,+2.787
.+2.615
 the+2.474
 a+2.308
Neg logit contributions
lor-6.352
umm-6.211
 former-6.204
arians-6.202
osity-6.166
Token never
Token ID1239
Feature activation+5.44e+02

Pos logit contributions
,+3.695
 and+3.690
 the+3.530
.+3.486
 a+3.319
Neg logit contributions
lor-5.586
arians-5.414
 former-5.346
umm-5.329
rol-5.329
Token had
Token ID550
Feature activation+5.44e+02

Pos logit contributions
 and+4.201
,+4.097
 the+3.948
.+3.884
 a+3.736
Neg logit contributions
lor-5.041
umm-4.929
osity-4.920
 former-4.887
ocol-4.886
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+2.759
,+2.678
.+2.419
 the+2.413
 a+2.165
Neg logit contributions
lor-6.589
ocol-6.432
umm-6.400
 former-6.363
rol-6.360
Token good
Token ID922
Feature activation+5.44e+02

Pos logit contributions
 and+5.775
,+5.664
 the+5.593
.+5.507
 a+5.316
Neg logit contributions
lor-3.575
umm-3.440
 former-3.422
ocol-3.411
arians-3.363
Token night
Token ID1755
Feature activation+5.44e+02

Pos logit contributions
 and+4.878
,+4.819
 the+4.756
.+4.619
 a+4.466
Neg logit contributions
lor-4.403
umm-4.342
ocol-4.266
 former-4.245
ides-4.191
Token's
Token ID338
Feature activation+5.44e+02

Pos logit contributions
 and+3.207
,+3.148
 the+3.059
.+2.860
 a+2.856
Neg logit contributions
lor-6.063
umm-6.003
ocol-5.874
 former-5.801
arians-5.794
Token sleep
Token ID3993
Feature activation+5.44e+02

Pos logit contributions
 and+5.541
,+5.487
 the+5.369
.+5.210
 a+5.097
Neg logit contributions
lor-3.728
 former-3.676
umm-3.658
ocol-3.631
arians-3.548
Token again
Token ID757
Feature activation+5.44e+02

Pos logit contributions
 and+3.463
,+3.409
 the+3.318
 a+3.090
.+3.073
Neg logit contributions
lor-5.907
umm-5.689
arians-5.639
ides-5.569
ocol-5.568
Token again
Token ID757
Feature activation+5.44e+02

Pos logit contributions
 and+3.463
,+3.409
 the+3.318
 a+3.090
.+3.073
Neg logit contributions
lor-5.907
umm-5.689
arians-5.639
ides-5.569
ocol-5.568
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.550
,+0.494
 the+0.396
 a+0.221
.+0.091
Neg logit contributions
lor-8.633
 former-8.468
ocol-8.437
umm-8.431
arians-8.378
Token<|endoftext|>
Token ID50256
Feature activation+5.44e+02

Pos logit contributions
 and+1.624
,+1.605
 the+1.356
.+1.341
 a+1.146
Neg logit contributions
lor-7.744
umm-7.609
ocol-7.466
arians-7.449
rol-7.444
TokenOnce
Token ID7454
Feature activation+5.44e+02

Pos logit contributions
 and+2.041
,+2.011
 the+1.787
.+1.699
 a+1.587
Neg logit contributions
lor-7.310
umm-7.008
rol-6.984
ocol-6.948
arians-6.902
Token upon
Token ID2402
Feature activation+5.44e+02

Pos logit contributions
 and+4.340
,+4.225
 the+4.086
.+4.041
 a+3.934
Neg logit contributions
lor-4.915
ocol-4.789
umm-4.746
rol-4.696
osity-4.678
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
,+0.658
 and+0.626
 the+0.368
.+0.333
 a+0.024
Neg logit contributions
lor-8.586
ocol-8.348
umm-8.312
rol-8.288
 former-8.272
Token time
Token ID640
Feature activation+5.44e+02

Pos logit contributions
 and+2.820
,+2.763
 the+2.654
.+2.553
 a+2.404
Neg logit contributions
lor-6.494
umm-6.362
ocol-6.316
arians-6.244
ides-6.240
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.219
,+0.020
 the+0.001
.-0.053
 a-0.206
Neg logit contributions
lor-9.045
ocol-8.908
umm-8.867
rol-8.813
arians-8.775
Token there
Token ID612
Feature activation+5.44e+02

Pos logit contributions
 and+2.004
,+1.891
.+1.733
 the+1.730
 a+1.495
Neg logit contributions
lor-7.191
ocol-7.121
umm-7.050
rol-7.021
osity-6.991
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+1.466
,+1.317
.+1.199
 the+1.185
 a+1.042
Neg logit contributions
lor-7.725
umm-7.546
rol-7.503
ocol-7.490
arians-7.370
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+0.639
,+0.607
.+0.392
 the+0.376
 a+0.083
Neg logit contributions
lor-8.625
umm-8.547
ocol-8.545
rol-8.431
 former-8.423
Token scared
Token ID12008
Feature activation+5.44e+02

Pos logit contributions
 and+4.034
,+4.006
 the+3.766
.+3.760
 a+3.493
Neg logit contributions
lor-5.190
ocol-5.111
rol-5.054
 former-5.047
umm-5.036
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.575
,+0.530
 the+0.473
 a+0.312
.+0.275
Neg logit contributions
umm-8.438
ocol-8.416
lor-8.400
rol-8.265
arians-8.261
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+3.311
,+3.254
 the+3.038
.+2.984
 a+2.816
Neg logit contributions
lor-6.051
umm-5.944
ocol-5.860
arians-5.831
rol-5.799
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+1.470
,+1.413
 the+1.198
.+1.144
 a+1.002
Neg logit contributions
lor-7.909
ocol-7.676
umm-7.639
rol-7.630
arians-7.599
TokenTom
Token ID13787
Feature activation+5.44e+02

Pos logit contributions
 and+4.027
,+3.964
 the+3.788
.+3.735
 a+3.574
Neg logit contributions
lor-5.431
umm-5.246
ocol-5.225
arians-5.199
rol-5.155
Token went
Token ID1816
Feature activation+5.44e+02

Pos logit contributions
,+3.188
 and+3.183
 the+3.091
.+3.021
 a+2.873
Neg logit contributions
lor-6.002
arians-5.832
umm-5.747
ides-5.742
rol-5.740
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+2.099
,+2.062
 the+1.890
.+1.820
 a+1.690
Neg logit contributions
lor-7.136
rol-6.967
ocol-6.932
arians-6.931
umm-6.902
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+2.646
,+2.594
.+2.337
 the+2.229
 a+2.087
Neg logit contributions
lor-6.557
ocol-6.548
arians-6.421
 former-6.414
umm-6.411
Token bed
Token ID3996
Feature activation+5.44e+02

Pos logit contributions
 and+4.176
,+4.127
 the+3.974
.+3.903
 a+3.745
Neg logit contributions
lor-5.078
umm-5.023
 former-4.979
arians-4.977
ocol-4.917
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+1.394
 the+1.375
,+1.374
 a+1.130
.+1.119
Neg logit contributions
lor-7.813
umm-7.553
 former-7.525
arians-7.513
ides-7.458
Token hid
Token ID24519
Feature activation+5.44e+02

Pos logit contributions
 and+3.939
,+3.905
.+3.688
 the+3.647
 a+3.440
Neg logit contributions
lor-5.320
 former-5.106
arians-5.102
ocol-5.096
rol-5.047
Token say
Token ID910
Feature activation+5.44e+02

Pos logit contributions
 and+4.227
,+4.170
 the+4.021
.+3.933
 a+3.787
Neg logit contributions
lor-5.018
ocol-4.986
umm-4.970
ides-4.899
arians-4.883
Token anything
Token ID1997
Feature activation+5.44e+02

Pos logit contributions
 and+4.630
,+4.447
 the+4.300
.+4.254
 a+4.078
Neg logit contributions
lor-4.776
ocol-4.602
umm-4.522
arians-4.468
osity-4.467
Token either
Token ID2035
Feature activation+5.44e+02

Pos logit contributions
 and+2.149
,+2.081
 the+1.975
.+1.818
 a+1.792
Neg logit contributions
lor-7.034
ocol-6.882
umm-6.868
arians-6.862
osity-6.796
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.077
,+0.991
 the+0.953
 a+0.765
.+0.735
Neg logit contributions
lor-8.073
ocol-7.915
umm-7.914
ides-7.874
arians-7.824
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+3.219
,+3.171
 the+2.961
.+2.920
 a+2.745
Neg logit contributions
lor-6.167
umm-5.996
ocol-5.986
arians-5.932
rol-5.898
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
,+3.264
 and+3.251
 the+3.148
.+3.080
 a+2.930
Neg logit contributions
lor-5.925
arians-5.762
umm-5.708
ides-5.669
ocol-5.667
Token scared
Token ID12008
Feature activation+5.44e+02

Pos logit contributions
 and+4.034
,+4.006
 the+3.766
.+3.760
 a+3.493
Neg logit contributions
lor-5.190
ocol-5.111
rol-5.054
 former-5.047
umm-5.036
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.575
,+0.530
 the+0.473
 a+0.312
.+0.275
Neg logit contributions
umm-8.438
ocol-8.416
lor-8.400
rol-8.265
arians-8.261
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+3.311
,+3.254
 the+3.038
.+2.984
 a+2.816
Neg logit contributions
lor-6.051
umm-5.944
ocol-5.860
arians-5.831
rol-5.799
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+1.470
,+1.413
 the+1.198
.+1.144
 a+1.002
Neg logit contributions
lor-7.909
ocol-7.676
umm-7.639
rol-7.630
arians-7.599
TokenTom
Token ID13787
Feature activation+5.44e+02

Pos logit contributions
 and+4.027
,+3.964
 the+3.788
.+3.735
 a+3.574
Neg logit contributions
lor-5.431
umm-5.246
ocol-5.225
arians-5.199
rol-5.155
TokenTom
Token ID13787
Feature activation+5.44e+02

Pos logit contributions
 and+4.027
,+3.964
 the+3.788
.+3.735
 a+3.574
Neg logit contributions
lor-5.431
umm-5.246
ocol-5.225
arians-5.199
rol-5.155
Token went
Token ID1816
Feature activation+5.44e+02

Pos logit contributions
,+3.188
 and+3.183
 the+3.091
.+3.021
 a+2.873
Neg logit contributions
lor-6.002
arians-5.832
umm-5.747
ides-5.742
rol-5.740
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+2.099
,+2.062
 the+1.890
.+1.820
 a+1.690
Neg logit contributions
lor-7.136
rol-6.967
ocol-6.932
arians-6.931
umm-6.902
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+2.646
,+2.594
.+2.337
 the+2.229
 a+2.087
Neg logit contributions
lor-6.557
ocol-6.548
arians-6.421
 former-6.414
umm-6.411
Token bed
Token ID3996
Feature activation+5.44e+02

Pos logit contributions
 and+4.176
,+4.127
 the+3.974
.+3.903
 a+3.745
Neg logit contributions
lor-5.078
umm-5.023
 former-4.979
arians-4.977
ocol-4.917
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+1.394
 the+1.375
,+1.374
 a+1.130
.+1.119
Neg logit contributions
lor-7.813
umm-7.553
 former-7.525
arians-7.513
ides-7.458
Token hid
Token ID24519
Feature activation+5.44e+02

Pos logit contributions
 and+3.939
,+3.905
.+3.688
 the+3.647
 a+3.440
Neg logit contributions
lor-5.320
 former-5.106
arians-5.102
ocol-5.096
rol-5.047
Token under
Token ID739
Feature activation+5.44e+02

Pos logit contributions
 and+3.261
,+3.227
 the+2.977
.+2.911
 a+2.808
Neg logit contributions
lor-5.966
ocol-5.963
umm-5.870
arians-5.813
rol-5.793
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+1.727
,+1.670
.+1.401
 the+1.322
 a+1.122
Neg logit contributions
lor-7.531
ocol-7.520
umm-7.413
 former-7.313
arians-7.298
Token covers
Token ID8698
Feature activation+5.44e+02

Pos logit contributions
 and+4.833
,+4.799
 the+4.634
.+4.545
 a+4.438
Neg logit contributions
lor-4.468
umm-4.373
ocol-4.365
arians-4.236
rol-4.210
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.011
,+0.976
 the+0.816
 a+0.660
.+0.626
Neg logit contributions
lor-8.263
umm-8.082
ocol-8.048
rol-8.023
arians-7.936
Token under
Token ID739
Feature activation+5.44e+02

Pos logit contributions
 and+3.261
,+3.227
 the+2.977
.+2.911
 a+2.808
Neg logit contributions
lor-5.966
ocol-5.963
umm-5.870
arians-5.813
rol-5.793
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+1.727
,+1.670
.+1.401
 the+1.322
 a+1.122
Neg logit contributions
lor-7.531
ocol-7.520
umm-7.413
 former-7.313
arians-7.298
Token covers
Token ID8698
Feature activation+5.44e+02

Pos logit contributions
 and+4.833
,+4.799
 the+4.634
.+4.545
 a+4.438
Neg logit contributions
lor-4.468
umm-4.373
ocol-4.365
arians-4.236
rol-4.210
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.011
,+0.976
 the+0.816
 a+0.660
.+0.626
Neg logit contributions
lor-8.263
umm-8.082
ocol-8.048
rol-8.023
arians-7.936
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+3.554
,+3.516
 the+3.296
.+3.265
 a+3.039
Neg logit contributions
lor-5.774
umm-5.674
 former-5.607
ocol-5.596
osity-5.549
Token heard
Token ID2982
Feature activation+5.44e+02

Pos logit contributions
,+3.796
 and+3.784
 the+3.573
.+3.571
 a+3.356
Neg logit contributions
lor-5.492
arians-5.290
 former-5.258
ocol-5.241
umm-5.204
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.441
,+2.423
.+2.172
 the+2.088
 a+1.852
Neg logit contributions
lor-6.806
ocol-6.733
umm-6.670
rol-6.620
 former-6.615
Token sound
Token ID2128
Feature activation+5.44e+02

Pos logit contributions
 and+4.449
,+4.439
 the+4.248
.+4.154
 a+3.991
Neg logit contributions
lor-4.860
umm-4.806
ocol-4.755
arians-4.693
 former-4.681
Token again
Token ID757
Feature activation+5.44e+02

Pos logit contributions
,+1.130
 and+1.118
 the+1.008
.+0.873
 a+0.850
Neg logit contributions
lor-7.837
ocol-7.752
umm-7.715
rol-7.673
ides-7.638
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.405
,+0.379
 the+0.317
 a+0.174
.+0.094
Neg logit contributions
ocol-8.555
lor-8.555
umm-8.458
ides-8.422
 former-8.405
Token It
Token ID632
Feature activation+5.44e+02

Pos logit contributions
 and+3.461
,+3.419
 the+3.196
.+3.167
 a+2.945
Neg logit contributions
lor-5.924
umm-5.800
ocol-5.710
 former-5.678
arians-5.666
Token you
Token ID345
Feature activation+5.44e+02

Pos logit contributions
 and+3.163
,+3.101
.+2.874
 the+2.832
 a+2.683
Neg logit contributions
lor-6.126
ocol-6.040
umm-5.989
arians-5.939
rol-5.917
Token every
Token ID790
Feature activation+5.44e+02

Pos logit contributions
 and+3.071
,+3.005
 the+2.911
.+2.771
 a+2.670
Neg logit contributions
lor-6.220
ocol-6.010
umm-6.003
rol-5.997
arians-5.997
Token night
Token ID1755
Feature activation+5.44e+02

Pos logit contributions
 and+3.733
,+3.663
 the+3.512
.+3.434
 a+3.274
Neg logit contributions
lor-5.619
umm-5.490
ocol-5.437
arians-5.359
rol-5.354
Token!"
Token ID2474
Feature activation+5.44e+02

Pos logit contributions
 and+2.499
,+2.424
 the+2.356
 a+2.152
.+2.148
Neg logit contributions
lor-6.758
umm-6.654
ocol-6.567
arians-6.557
ides-6.511
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+2.220
,+2.211
 the+2.027
.+1.973
 a+1.846
Neg logit contributions
lor-6.985
umm-6.843
ocol-6.759
arians-6.758
rol-6.732
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+3.396
,+3.381
 the+3.277
.+3.215
 a+3.034
Neg logit contributions
lor-5.796
arians-5.616
rol-5.570
umm-5.557
ides-5.519
Token very
Token ID845
Feature activation+5.44e+02

Pos logit contributions
 and+3.671
,+3.625
.+3.390
 the+3.384
 a+3.127
Neg logit contributions
lor-5.568
ocol-5.414
rol-5.409
arians-5.376
 former-5.348
Token scared
Token ID12008
Feature activation+5.44e+02

Pos logit contributions
 and+3.763
,+3.644
 the+3.514
.+3.457
 a+3.296
Neg logit contributions
ocol-5.445
lor-5.443
umm-5.329
 former-5.323
rim-5.314
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+0.300
,+0.275
 the+0.226
 a+0.064
.+0.023
Neg logit contributions
lor-8.640
umm-8.631
ocol-8.618
arians-8.473
rol-8.465
Token cried
Token ID16896
Feature activation+5.44e+02

Pos logit contributions
 and+4.016
,+3.942
.+3.720
 the+3.710
 a+3.530
Neg logit contributions
lor-5.238
ocol-5.133
arians-5.080
rol-5.077
umm-5.055
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.719
,+0.630
 the+0.561
.+0.354
 a+0.279
Neg logit contributions
lor-8.310
umm-8.303
ocol-8.285
ides-8.185
 former-8.184
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+2.714
,+2.708
.+2.461
 the+2.341
 a+2.233
Neg logit contributions
lor-6.514
ocol-6.428
umm-6.390
arians-6.324
rol-6.278
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+0.570
 and+0.553
 the+0.499
.+0.311
 a+0.285
Neg logit contributions
lor-8.580
arians-8.364
ides-8.363
ocol-8.354
rol-8.326
Token said
Token ID531
Feature activation+5.44e+02

Pos logit contributions
 and+3.790
,+3.748
.+3.552
 the+3.416
 a+3.251
Neg logit contributions
lor-5.349
ocol-5.334
umm-5.277
arians-5.243
 former-5.198
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.151
,+0.947
 the+0.876
.+0.831
 a+0.679
Neg logit contributions
lor-8.185
ocol-8.068
umm-7.966
osity-7.930
rol-7.909
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+2.432
,+2.381
 the+2.230
.+2.183
 a+2.054
Neg logit contributions
lor-6.748
ocol-6.611
osity-6.578
arians-6.576
umm-6.528
TokenI
Token ID40
Feature activation+5.44e+02

Pos logit contributions
 and+6.191
,+6.154
 the+5.940
.+5.914
 a+5.732
Neg logit contributions
lor-3.269
umm-3.154
ocol-3.106
arians-3.057
rol-3.028
Token will
Token ID481
Feature activation+5.44e+02

Pos logit contributions
 and+4.555
,+4.459
 the+4.387
.+4.298
 a+4.117
Neg logit contributions
lor-4.748
ocol-4.624
rol-4.546
umm-4.540
 former-4.519
Token scare
Token ID19437
Feature activation+5.44e+02

Pos logit contributions
 and+3.919
,+3.812
 the+3.679
.+3.598
 a+3.449
Neg logit contributions
lor-5.405
arians-5.247
ocol-5.242
 former-5.205
umm-5.193
Token you
Token ID345
Feature activation+5.44e+02

Pos logit contributions
 and+3.163
,+3.101
.+2.874
 the+2.832
 a+2.683
Neg logit contributions
lor-6.126
ocol-6.040
umm-5.989
arians-5.939
rol-5.917
Token every
Token ID790
Feature activation+5.44e+02

Pos logit contributions
 and+3.071
,+3.005
 the+2.911
.+2.771
 a+2.670
Neg logit contributions
lor-6.220
ocol-6.010
umm-6.003
rol-5.997
arians-5.997
Token night
Token ID1755
Feature activation+5.44e+02

Pos logit contributions
 and+3.733
,+3.663
 the+3.512
.+3.434
 a+3.274
Neg logit contributions
lor-5.619
umm-5.490
ocol-5.437
arians-5.359
rol-5.354
Token big
Token ID1263
Feature activation+5.44e+02

Pos logit contributions
 and+3.999
,+3.919
 the+3.794
.+3.733
 a+3.559
Neg logit contributions
lor-5.328
umm-5.227
ocol-5.181
arians-5.118
 former-5.082
Token monster
Token ID9234
Feature activation+5.44e+02

Pos logit contributions
 and+4.606
,+4.520
 the+4.413
.+4.381
 a+4.225
Neg logit contributions
ocol-4.603
lor-4.593
umm-4.588
arians-4.440
 former-4.347
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.873
,+2.820
 the+2.700
.+2.543
 a+2.487
Neg logit contributions
lor-6.429
ocol-6.279
umm-6.250
arians-6.213
rol-6.183
Token The
Token ID383
Feature activation+5.44e+02

Pos logit contributions
 and+3.252
,+3.214
.+2.951
 the+2.940
 a+2.733
Neg logit contributions
lor-6.081
umm-5.956
ocol-5.936
arians-5.787
osity-5.783
Token monster
Token ID9234
Feature activation+5.44e+02

Pos logit contributions
 and+5.192
,+5.119
 the+4.932
.+4.886
 a+4.681
Neg logit contributions
lor-4.209
umm-4.138
ocol-4.055
arians-4.002
 former-3.947
Token looked
Token ID3114
Feature activation+5.44e+02

Pos logit contributions
 and+3.077
,+3.040
 the+2.930
.+2.838
 a+2.722
Neg logit contributions
lor-6.152
arians-5.975
umm-5.959
ocol-5.951
rol-5.899
Token at
Token ID379
Feature activation+5.44e+02

Pos logit contributions
 and+3.555
,+3.513
 the+3.329
.+3.307
 a+3.098
Neg logit contributions
ocol-5.627
lor-5.577
rol-5.517
umm-5.513
arians-5.419
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+2.714
,+2.708
.+2.461
 the+2.341
 a+2.233
Neg logit contributions
lor-6.514
ocol-6.428
umm-6.390
arians-6.324
rol-6.278
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+0.570
 and+0.553
 the+0.499
.+0.311
 a+0.285
Neg logit contributions
lor-8.580
arians-8.364
ides-8.363
ocol-8.354
rol-8.326
Token said
Token ID531
Feature activation+5.44e+02

Pos logit contributions
 and+3.790
,+3.748
.+3.552
 the+3.416
 a+3.251
Neg logit contributions
lor-5.349
ocol-5.334
umm-5.277
arians-5.243
 former-5.198
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.151
,+0.947
 the+0.876
.+0.831
 a+0.679
Neg logit contributions
lor-8.185
ocol-8.068
umm-7.966
osity-7.930
rol-7.909
Token The
Token ID383
Feature activation+5.44e+02

Pos logit contributions
 and+3.252
,+3.214
.+2.951
 the+2.940
 a+2.733
Neg logit contributions
lor-6.081
umm-5.956
ocol-5.936
arians-5.787
osity-5.783
Token monster
Token ID9234
Feature activation+5.44e+02

Pos logit contributions
 and+5.192
,+5.119
 the+4.932
.+4.886
 a+4.681
Neg logit contributions
lor-4.209
umm-4.138
ocol-4.055
arians-4.002
 former-3.947
Token looked
Token ID3114
Feature activation+5.44e+02

Pos logit contributions
 and+3.077
,+3.040
 the+2.930
.+2.838
 a+2.722
Neg logit contributions
lor-6.152
arians-5.975
umm-5.959
ocol-5.951
rol-5.899
Token at
Token ID379
Feature activation+5.44e+02

Pos logit contributions
 and+3.555
,+3.513
 the+3.329
.+3.307
 a+3.098
Neg logit contributions
ocol-5.627
lor-5.577
rol-5.517
umm-5.513
arians-5.419
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+2.714
,+2.708
.+2.461
 the+2.341
 a+2.233
Neg logit contributions
lor-6.514
ocol-6.428
umm-6.390
arians-6.324
rol-6.278
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+0.570
 and+0.553
 the+0.499
.+0.311
 a+0.285
Neg logit contributions
lor-8.580
arians-8.364
ides-8.363
ocol-8.354
rol-8.326
Token said
Token ID531
Feature activation+5.44e+02

Pos logit contributions
 and+3.790
,+3.748
.+3.552
 the+3.416
 a+3.251
Neg logit contributions
lor-5.349
ocol-5.334
umm-5.277
arians-5.243
 former-5.198
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.151
,+0.947
 the+0.876
.+0.831
 a+0.679
Neg logit contributions
lor-8.185
ocol-8.068
umm-7.966
osity-7.930
rol-7.909
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+2.432
,+2.381
 the+2.230
.+2.183
 a+2.054
Neg logit contributions
lor-6.748
ocol-6.611
osity-6.578
arians-6.576
umm-6.528
TokenI
Token ID40
Feature activation+5.44e+02

Pos logit contributions
 and+6.191
,+6.154
 the+5.940
.+5.914
 a+5.732
Neg logit contributions
lor-3.269
umm-3.154
ocol-3.106
arians-3.057
rol-3.028
Token will
Token ID481
Feature activation+5.44e+02

Pos logit contributions
 and+4.555
,+4.459
 the+4.387
.+4.298
 a+4.117
Neg logit contributions
lor-4.748
ocol-4.624
rol-4.546
umm-4.540
 former-4.519
Token that
Token ID326
Feature activation+5.44e+02

Pos logit contributions
 and+1.907
,+1.862
.+1.642
 the+1.626
 a+1.420
Neg logit contributions
lor-7.390
umm-7.263
ocol-7.251
arians-7.222
rol-7.140
Token day
Token ID1110
Feature activation+5.44e+02

Pos logit contributions
 and+2.995
,+2.895
 the+2.783
.+2.739
 a+2.586
Neg logit contributions
lor-6.287
umm-6.144
ocol-6.046
arians-6.000
osity-5.981
Token on
Token ID319
Feature activation+5.44e+02

Pos logit contributions
 and+2.094
,+1.957
 the+1.862
.+1.801
 a+1.668
Neg logit contributions
lor-7.236
umm-7.055
ocol-7.038
arians-7.004
rol-6.980
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.320
,+0.125
.+0.053
 the+0.034
 a-0.067
Neg logit contributions
lor-8.952
umm-8.769
ocol-8.749
arians-8.719
rol-8.565
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+2.796
,+2.721
.+2.552
 the+2.394
 a+2.249
Neg logit contributions
umm-6.293
ocol-6.247
arians-6.243
lor-6.238
osity-6.147
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+3.137
,+3.129
 the+3.011
.+2.975
 a+2.820
Neg logit contributions
lor-6.071
arians-5.869
umm-5.779
rol-5.764
osity-5.750
Token always
Token ID1464
Feature activation+5.44e+02

Pos logit contributions
 and+3.551
,+3.480
.+3.252
 the+3.198
 a+2.956
Neg logit contributions
lor-5.818
ocol-5.646
rol-5.589
arians-5.586
 former-5.585
Token scared
Token ID12008
Feature activation+5.44e+02

Pos logit contributions
 and+4.003
,+3.930
.+3.714
 the+3.682
 a+3.432
Neg logit contributions
lor-5.211
ocol-5.057
rol-5.047
 former-5.046
arians-5.039
Token in
Token ID287
Feature activation+5.44e+02

Pos logit contributions
 and+1.673
,+1.650
 the+1.530
.+1.421
 a+1.372
Neg logit contributions
umm-7.316
ocol-7.299
lor-7.283
rol-7.202
arians-7.170
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+3.828
,+3.744
.+3.494
 the+3.425
 a+3.251
Neg logit contributions
ocol-5.459
lor-5.431
 former-5.315
umm-5.304
arians-5.242
Token room
Token ID2119
Feature activation+5.44e+02

Pos logit contributions
 and+5.259
,+5.120
 the+5.024
.+4.922
 a+4.801
Neg logit contributions
 former-4.027
lor-4.018
umm-3.994
ocol-3.955
arians-3.843
Token!"
Token ID2474
Feature activation+5.44e+02

Pos logit contributions
 and+2.499
,+2.424
 the+2.356
 a+2.152
.+2.148
Neg logit contributions
lor-6.758
umm-6.654
ocol-6.567
arians-6.557
ides-6.511
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+2.220
,+2.211
 the+2.027
.+1.973
 a+1.846
Neg logit contributions
lor-6.985
umm-6.843
ocol-6.759
arians-6.758
rol-6.732
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+3.396
,+3.381
 the+3.277
.+3.215
 a+3.034
Neg logit contributions
lor-5.796
arians-5.616
rol-5.570
umm-5.557
ides-5.519
Token very
Token ID845
Feature activation+5.44e+02

Pos logit contributions
 and+3.671
,+3.625
.+3.390
 the+3.384
 a+3.127
Neg logit contributions
lor-5.568
ocol-5.414
rol-5.409
arians-5.376
 former-5.348
Token scared
Token ID12008
Feature activation+5.44e+02

Pos logit contributions
 and+3.763
,+3.644
 the+3.514
.+3.457
 a+3.296
Neg logit contributions
ocol-5.445
lor-5.443
umm-5.329
 former-5.323
rim-5.314
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+0.300
,+0.275
 the+0.226
 a+0.064
.+0.023
Neg logit contributions
lor-8.640
umm-8.631
ocol-8.618
arians-8.473
rol-8.465
Token cried
Token ID16896
Feature activation+5.44e+02

Pos logit contributions
 and+4.016
,+3.942
.+3.720
 the+3.710
 a+3.530
Neg logit contributions
lor-5.238
ocol-5.133
arians-5.080
rol-5.077
umm-5.055
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.719
,+0.630
 the+0.561
.+0.354
 a+0.279
Neg logit contributions
lor-8.310
umm-8.303
ocol-8.285
ides-8.185
 former-8.184
Token From
Token ID3574
Feature activation+5.44e+02

Pos logit contributions
 and+3.077
,+3.016
 the+2.786
.+2.771
 a+2.593
Neg logit contributions
lor-6.259
umm-6.159
arians-6.075
ocol-6.034
osity-5.998
Token that
Token ID326
Feature activation+5.44e+02

Pos logit contributions
 and+1.907
,+1.862
.+1.642
 the+1.626
 a+1.420
Neg logit contributions
lor-7.390
umm-7.263
ocol-7.251
arians-7.222
rol-7.140
Token day
Token ID1110
Feature activation+5.44e+02

Pos logit contributions
 and+2.995
,+2.895
 the+2.783
.+2.739
 a+2.586
Neg logit contributions
lor-6.287
umm-6.144
ocol-6.046
arians-6.000
osity-5.981
Token day
Token ID1110
Feature activation+5.44e+02

Pos logit contributions
 and+2.995
,+2.895
 the+2.783
.+2.739
 a+2.586
Neg logit contributions
lor-6.287
umm-6.144
ocol-6.046
arians-6.000
osity-5.981
Token on
Token ID319
Feature activation+5.44e+02

Pos logit contributions
 and+2.094
,+1.957
 the+1.862
.+1.801
 a+1.668
Neg logit contributions
lor-7.236
umm-7.055
ocol-7.038
arians-7.004
rol-6.980
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.320
,+0.125
.+0.053
 the+0.034
 a-0.067
Neg logit contributions
lor-8.952
umm-8.769
ocol-8.749
arians-8.719
rol-8.565
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+2.796
,+2.721
.+2.552
 the+2.394
 a+2.249
Neg logit contributions
umm-6.293
ocol-6.247
arians-6.243
lor-6.238
osity-6.147
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+3.137
,+3.129
 the+3.011
.+2.975
 a+2.820
Neg logit contributions
lor-6.071
arians-5.869
umm-5.779
rol-5.764
osity-5.750
Token always
Token ID1464
Feature activation+5.44e+02

Pos logit contributions
 and+3.551
,+3.480
.+3.252
 the+3.198
 a+2.956
Neg logit contributions
lor-5.818
ocol-5.646
rol-5.589
arians-5.586
 former-5.585
Token scared
Token ID12008
Feature activation+5.44e+02

Pos logit contributions
 and+4.003
,+3.930
.+3.714
 the+3.682
 a+3.432
Neg logit contributions
lor-5.211
ocol-5.057
rol-5.047
 former-5.046
arians-5.039
Token in
Token ID287
Feature activation+5.44e+02

Pos logit contributions
 and+1.673
,+1.650
 the+1.530
.+1.421
 a+1.372
Neg logit contributions
umm-7.316
ocol-7.299
lor-7.283
rol-7.202
arians-7.170
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+3.828
,+3.744
.+3.494
 the+3.425
 a+3.251
Neg logit contributions
ocol-5.459
lor-5.431
 former-5.315
umm-5.304
arians-5.242
Token room
Token ID2119
Feature activation+5.44e+02

Pos logit contributions
 and+5.259
,+5.120
 the+5.024
.+4.922
 a+4.801
Neg logit contributions
 former-4.027
lor-4.018
umm-3.994
ocol-3.955
arians-3.843
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.489
,+1.452
 the+1.416
.+1.186
 a+1.179
Neg logit contributions
lor-7.639
arians-7.479
 former-7.475
umm-7.446
rol-7.374
Token room
Token ID2119
Feature activation+5.44e+02

Pos logit contributions
 and+5.259
,+5.120
 the+5.024
.+4.922
 a+4.801
Neg logit contributions
 former-4.027
lor-4.018
umm-3.994
ocol-3.955
arians-3.843
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.489
,+1.452
 the+1.416
.+1.186
 a+1.179
Neg logit contributions
lor-7.639
arians-7.479
 former-7.475
umm-7.446
rol-7.374
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+2.617
 and+2.547
.+2.416
 the+2.390
 a+2.209
Neg logit contributions
lor-6.506
arians-6.395
umm-6.393
 former-6.383
ocol-6.364
Token he
Token ID339
Feature activation+5.44e+02

Pos logit contributions
 and+2.809
,+2.787
.+2.615
 the+2.474
 a+2.308
Neg logit contributions
lor-6.352
umm-6.211
 former-6.204
arians-6.202
osity-6.166
Token never
Token ID1239
Feature activation+5.44e+02

Pos logit contributions
,+3.695
 and+3.690
 the+3.530
.+3.486
 a+3.319
Neg logit contributions
lor-5.586
arians-5.414
 former-5.346
umm-5.329
rol-5.329
Token had
Token ID550
Feature activation+5.44e+02

Pos logit contributions
 and+4.201
,+4.097
 the+3.948
.+3.884
 a+3.736
Neg logit contributions
lor-5.041
umm-4.929
osity-4.920
 former-4.887
ocol-4.886
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+2.759
,+2.678
.+2.419
 the+2.413
 a+2.165
Neg logit contributions
lor-6.589
ocol-6.432
umm-6.400
 former-6.363
rol-6.360
Token good
Token ID922
Feature activation+5.44e+02

Pos logit contributions
 and+5.775
,+5.664
 the+5.593
.+5.507
 a+5.316
Neg logit contributions
lor-3.575
umm-3.440
 former-3.422
ocol-3.411
arians-3.363
Token night
Token ID1755
Feature activation+5.44e+02

Pos logit contributions
 and+4.878
,+4.819
 the+4.756
.+4.619
 a+4.466
Neg logit contributions
lor-4.403
umm-4.342
ocol-4.266
 former-4.245
ides-4.191
Token's
Token ID338
Feature activation+5.44e+02

Pos logit contributions
 and+3.207
,+3.148
 the+3.059
.+2.860
 a+2.856
Neg logit contributions
lor-6.063
umm-6.003
ocol-5.874
 former-5.801
arians-5.794
Token sleep
Token ID3993
Feature activation+5.44e+02

Pos logit contributions
 and+5.541
,+5.487
 the+5.369
.+5.210
 a+5.097
Neg logit contributions
lor-3.728
 former-3.676
umm-3.658
ocol-3.631
arians-3.548
Token opened
Token ID4721
Feature activation+5.44e+02

Pos logit contributions
 and+3.321
,+3.321
 the+3.212
.+3.163
 a+2.989
Neg logit contributions
lor-5.888
arians-5.686
rol-5.592
ides-5.580
umm-5.578
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+1.221
,+1.164
.+0.949
 the+0.913
 a+0.761
Neg logit contributions
lor-8.066
ocol-7.953
umm-7.907
rol-7.831
arians-7.821
Token closet
Token ID21615
Feature activation+5.44e+02

Pos logit contributions
 and+4.956
,+4.903
 the+4.751
.+4.657
 a+4.525
Neg logit contributions
lor-4.419
umm-4.277
ocol-4.275
arians-4.156
 former-4.147
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+2.587
 and+2.586
 the+2.571
 a+2.377
.+2.366
Neg logit contributions
lor-6.615
umm-6.381
arians-6.285
rol-6.269
osity-6.215
Token saw
Token ID2497
Feature activation+5.44e+02

Pos logit contributions
 and+3.164
,+3.134
.+2.942
 the+2.871
 a+2.685
Neg logit contributions
lor-6.118
ocol-5.942
rol-5.894
arians-5.881
umm-5.880
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+2.305
,+2.216
.+1.965
 the+1.888
 a+1.659
Neg logit contributions
lor-7.025
ocol-6.988
umm-6.956
rol-6.871
arians-6.817
Token big
Token ID1263
Feature activation+5.44e+02

Pos logit contributions
 and+3.999
,+3.919
 the+3.794
.+3.733
 a+3.559
Neg logit contributions
lor-5.328
umm-5.227
ocol-5.181
arians-5.118
 former-5.082
Token monster
Token ID9234
Feature activation+5.44e+02

Pos logit contributions
 and+4.606
,+4.520
 the+4.413
.+4.381
 a+4.225
Neg logit contributions
ocol-4.603
lor-4.593
umm-4.588
arians-4.440
 former-4.347
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.873
,+2.820
 the+2.700
.+2.543
 a+2.487
Neg logit contributions
lor-6.429
ocol-6.279
umm-6.250
arians-6.213
rol-6.183
Token The
Token ID383
Feature activation+5.44e+02

Pos logit contributions
 and+3.252
,+3.214
.+2.951
 the+2.940
 a+2.733
Neg logit contributions
lor-6.081
umm-5.956
ocol-5.936
arians-5.787
osity-5.783
Token monster
Token ID9234
Feature activation+5.44e+02

Pos logit contributions
 and+5.192
,+5.119
 the+4.932
.+4.886
 a+4.681
Neg logit contributions
lor-4.209
umm-4.138
ocol-4.055
arians-4.002
 former-3.947
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+3.210
,+3.181
 the+2.934
.+2.933
 a+2.708
Neg logit contributions
lor-6.193
umm-6.040
ocol-5.987
rol-5.919
arians-5.908
Token opened
Token ID4721
Feature activation+5.44e+02

Pos logit contributions
 and+3.321
,+3.321
 the+3.212
.+3.163
 a+2.989
Neg logit contributions
lor-5.888
arians-5.686
rol-5.592
ides-5.580
umm-5.578
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+1.221
,+1.164
.+0.949
 the+0.913
 a+0.761
Neg logit contributions
lor-8.066
ocol-7.953
umm-7.907
rol-7.831
arians-7.821
Token closet
Token ID21615
Feature activation+5.44e+02

Pos logit contributions
 and+4.956
,+4.903
 the+4.751
.+4.657
 a+4.525
Neg logit contributions
lor-4.419
umm-4.277
ocol-4.275
arians-4.156
 former-4.147
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+2.587
 and+2.586
 the+2.571
 a+2.377
.+2.366
Neg logit contributions
lor-6.615
umm-6.381
arians-6.285
rol-6.269
osity-6.215
Token saw
Token ID2497
Feature activation+5.44e+02

Pos logit contributions
 and+3.164
,+3.134
.+2.942
 the+2.871
 a+2.685
Neg logit contributions
lor-6.118
ocol-5.942
rol-5.894
arians-5.881
umm-5.880
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+2.305
,+2.216
.+1.965
 the+1.888
 a+1.659
Neg logit contributions
lor-7.025
ocol-6.988
umm-6.956
rol-6.871
arians-6.817
Token big
Token ID1263
Feature activation+5.44e+02

Pos logit contributions
 and+3.999
,+3.919
 the+3.794
.+3.733
 a+3.559
Neg logit contributions
lor-5.328
umm-5.227
ocol-5.181
arians-5.118
 former-5.082
Token monster
Token ID9234
Feature activation+5.44e+02

Pos logit contributions
 and+4.606
,+4.520
 the+4.413
.+4.381
 a+4.225
Neg logit contributions
ocol-4.603
lor-4.593
umm-4.588
arians-4.440
 former-4.347
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.873
,+2.820
 the+2.700
.+2.543
 a+2.487
Neg logit contributions
lor-6.429
ocol-6.279
umm-6.250
arians-6.213
rol-6.183
Token The
Token ID383
Feature activation+5.44e+02

Pos logit contributions
 and+3.252
,+3.214
.+2.951
 the+2.940
 a+2.733
Neg logit contributions
lor-6.081
umm-5.956
ocol-5.936
arians-5.787
osity-5.783
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.441
,+2.423
.+2.172
 the+2.088
 a+1.852
Neg logit contributions
lor-6.806
ocol-6.733
umm-6.670
rol-6.620
 former-6.615
Token sound
Token ID2128
Feature activation+5.44e+02

Pos logit contributions
 and+4.449
,+4.439
 the+4.248
.+4.154
 a+3.991
Neg logit contributions
lor-4.860
umm-4.806
ocol-4.755
arians-4.693
 former-4.681
Token again
Token ID757
Feature activation+5.44e+02

Pos logit contributions
,+1.130
 and+1.118
 the+1.008
.+0.873
 a+0.850
Neg logit contributions
lor-7.837
ocol-7.752
umm-7.715
rol-7.673
ides-7.638
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.405
,+0.379
 the+0.317
 a+0.174
.+0.094
Neg logit contributions
ocol-8.555
lor-8.555
umm-8.458
ides-8.422
 former-8.405
Token It
Token ID632
Feature activation+5.44e+02

Pos logit contributions
 and+3.461
,+3.419
 the+3.196
.+3.167
 a+2.945
Neg logit contributions
lor-5.924
umm-5.800
ocol-5.710
 former-5.678
arians-5.666
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+3.713
,+3.691
 the+3.456
.+3.452
 a+3.243
Neg logit contributions
lor-5.568
umm-5.400
rol-5.351
ocol-5.337
arians-5.328
Token coming
Token ID2406
Feature activation+5.44e+02

Pos logit contributions
 and+2.872
,+2.829
.+2.564
 the+2.553
 a+2.323
Neg logit contributions
ocol-6.355
lor-6.341
umm-6.276
 former-6.239
rol-6.193
Token from
Token ID422
Feature activation+5.44e+02

Pos logit contributions
 and+2.465
,+2.437
 the+2.281
.+2.183
 a+2.050
Neg logit contributions
lor-6.712
umm-6.689
ocol-6.589
rol-6.587
arians-6.548
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+2.775
,+2.765
.+2.473
 the+2.464
 a+2.265
Neg logit contributions
ocol-6.418
lor-6.386
umm-6.330
 former-6.265
arians-6.256
Token closet
Token ID21615
Feature activation+5.44e+02

Pos logit contributions
 and+4.567
,+4.501
 the+4.386
.+4.263
 a+4.138
Neg logit contributions
lor-4.710
umm-4.661
ocol-4.600
 former-4.598
arians-4.594
Token!
Token ID0
Feature activation+5.44e+02

Pos logit contributions
 and+2.156
 the+2.121
,+2.113
 a+1.882
.+1.821
Neg logit contributions
lor-7.148
umm-6.931
arians-6.887
rol-6.788
 former-6.785
Token!
Token ID0
Feature activation+5.44e+02

Pos logit contributions
 and+2.156
 the+2.121
,+2.113
 a+1.882
.+1.821
Neg logit contributions
lor-7.148
umm-6.931
arians-6.887
rol-6.788
 former-6.785
Token Tom
Token ID4186
Feature activation+5.44e+02

Pos logit contributions
 and+3.210
,+3.181
 the+2.934
.+2.933
 a+2.708
Neg logit contributions
lor-6.193
umm-6.040
ocol-5.987
rol-5.919
arians-5.908
Token opened
Token ID4721
Feature activation+5.44e+02

Pos logit contributions
 and+3.321
,+3.321
 the+3.212
.+3.163
 a+2.989
Neg logit contributions
lor-5.888
arians-5.686
rol-5.592
ides-5.580
umm-5.578
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+1.221
,+1.164
.+0.949
 the+0.913
 a+0.761
Neg logit contributions
lor-8.066
ocol-7.953
umm-7.907
rol-7.831
arians-7.821
Token closet
Token ID21615
Feature activation+5.44e+02

Pos logit contributions
 and+4.956
,+4.903
 the+4.751
.+4.657
 a+4.525
Neg logit contributions
lor-4.419
umm-4.277
ocol-4.275
arians-4.156
 former-4.147
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+2.587
 and+2.586
 the+2.571
 a+2.377
.+2.366
Neg logit contributions
lor-6.615
umm-6.381
arians-6.285
rol-6.269
osity-6.215
Token saw
Token ID2497
Feature activation+5.44e+02

Pos logit contributions
 and+3.164
,+3.134
.+2.942
 the+2.871
 a+2.685
Neg logit contributions
lor-6.118
ocol-5.942
rol-5.894
arians-5.881
umm-5.880
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+2.305
,+2.216
.+1.965
 the+1.888
 a+1.659
Neg logit contributions
lor-7.025
ocol-6.988
umm-6.956
rol-6.871
arians-6.817
Token big
Token ID1263
Feature activation+5.44e+02

Pos logit contributions
 and+3.999
,+3.919
 the+3.794
.+3.733
 a+3.559
Neg logit contributions
lor-5.328
umm-5.227
ocol-5.181
arians-5.118
 former-5.082
Token monster
Token ID9234
Feature activation+5.44e+02

Pos logit contributions
 and+4.606
,+4.520
 the+4.413
.+4.381
 a+4.225
Neg logit contributions
ocol-4.603
lor-4.593
umm-4.588
arians-4.440
 former-4.347
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.873
,+2.820
 the+2.700
.+2.543
 a+2.487
Neg logit contributions
lor-6.429
ocol-6.279
umm-6.250
arians-6.213
rol-6.183