TOP ACTIVATIONS
MAX = 1.180

- and - a - half - deep " source rock
Subscribe now for as little as $ 2 a month !
Arc < Board >> , // The previous board , if
to elect , at whatever cost , whichever Democrat is running
bills on Oct . 17 By law , the
: Option < Board >, // The previous board | ----------------
on Oct . 17 By law , the national
than through a nice Public Service Announce ment ? And what
is requesting three and a half year sentence for the blogger
--+ + -------------------- --+ < -- Start of system () '
. copy _ board (); // !! !!! self . prev
legal landscape . In that sense , ap ople ctic opponents
not optimistic given his track record of late . The Jaguars
outraged at these so - called supporters of Bernie .
point . Liquid ity , merchants , and user
u Part icle ) { // For each particle in the
out , and in what order the neurons activated .
Jordan Kal ish Surv iv or s
the game in as little as 2 weeks after the campaign
more than two and a half , because we want to

MIN ACTIVATIONS
MIN = -1.523

pur ak ushi Profile Joined August 2012 United States 3
Ko Do Profile Joined December 2010 United States 6
TB one - Profile Blog Joined November 2010 United States 21
heart l xp Profile Joined September 2010 United States 10
re i Profile Blog Joined October 2002 United States 3
ada ver BB Profile Blog Joined June 2009 Germany 247 34
K ier an Profile Blog Joined August 2011 United States 6
J aaaa asper Profile Blog Joined April 2012 United States 5
of Dal K Profile Blog Joined June 2012 Canada 16 10
Dark horse Profile Blog Joined December 2011 United States 228
PowerPoint slide Power Point slide PNG larger image
into a blender . Do le 's Orange - P ine
Big Fan Profile Blog Joined December 2010 TL AD T
and N INE DO LL ARS ? N INE DO
spokesman said : " Th ur rock Council is following up
, physicists have turned their sights to the phase that occurs
, and off the do le forever . The
The responses I receive are lad en with exp let ives
was being prepared at Cape Canaveral in Florida for an 18
Property ' H KC U :\ Control Panel \ Desktop '

INTERVAL 0.504 - 1.180
CONTAINS 0.587%

the temple referring to Pand ya Kings and Ray a (
and it strikes an excellent balance between speed , stability ,
the heart beat faster , reverber ating against the rib cage
once for their most recent video , Mir acles
to the so - called s avage

INTERVAL -0.171 - 0.504
CONTAINS 83.485%

their imaging , transparency , sound st aging , and surprising
tremb led . <|endoftext|> Al ain B ad i ou The
were set to enjoy a wonderful Easter Br unch at Ben
a ringing slap down against the conf lation of sovereign Israel
picked based more on who would be better fitted for the

INTERVAL -0.847 - -0.171
CONTAINS 15.913%

up all the remaining cap space . So ,
game the system . This is the first of the weekly
well . After the initial euph oria wears off
Our sub - 2 marathon page has everything you need to
but as a native Vietnamese person we strive hard because

INTERVAL -1.523 - -0.847
CONTAINS 0.014%

support the mission of New Advent and get the full contents
linear Dead Space and Bio Shock never captured . It
contains no dialog or on om at op o etic words
Police have also seized two dump ers and arrested d umper
true path , you are damned . I know
Token-
Token ID12
Feature activation+4.30e-02

Pos logit contributions
ertodd+0.035
rive+0.034
ointed+0.034
rived+0.034
ulously+0.034
Neg logit contributions
izabeth-0.053
-0.051
 awa-0.049
 Definition-0.046
sense-0.046
Tokenand
Token ID392
Feature activation+3.99e-01

Pos logit contributions
izabeth+0.154
+0.149
 awa+0.143
 Definition+0.134
sense+0.133
Neg logit contributions
ertodd-0.098
rive-0.095
ointed-0.095
rived-0.095
ulously-0.095
Token-
Token ID12
Feature activation+1.92e-01

Pos logit contributions
izabeth+1.336
+1.289
 awa+1.218
 Definition+1.147
sense+1.142
Neg logit contributions
ertodd-1.002
rived-0.971
rive-0.971
ointed-0.968
ulously-0.966
Tokena
Token ID64
Feature activation+6.15e-01

Pos logit contributions
izabeth+0.553
+0.529
 awa+0.497
sense+0.463
 Definition+0.458
Neg logit contributions
ertodd-0.578
ointed-0.563
rived-0.560
ulously-0.559
rive-0.558
Token-
Token ID12
Feature activation+1.19e-01

Pos logit contributions
izabeth+2.018
+1.933
 awa+1.779
sense+1.709
 Definition+1.708
Neg logit contributions
ertodd-1.597
rived-1.546
rive-1.543
ointed-1.542
ulously-1.540
Tokenhalf
Token ID13959
Feature activation+1.18e+00

Pos logit contributions
izabeth+0.346
+0.332
 awa+0.308
sense+0.291
 Definition+0.288
Neg logit contributions
ertodd-0.356
ointed-0.347
rived-0.344
ulously-0.344
rive-0.343
Token-
Token ID12
Feature activation+1.77e-01

Pos logit contributions
izabeth+4.051
+3.931
 awa+3.725
 Definition+3.510
sense+3.432
Neg logit contributions
ertodd-2.921
ointed-2.719
rived-2.704
rive-2.688
ulously-2.654
Tokendeep
Token ID22089
Feature activation+2.18e-01

Pos logit contributions
izabeth+0.596
+0.575
 awa+0.544
sense+0.512
 Definition+0.511
Neg logit contributions
ertodd-0.452
ointed-0.436
ulously-0.431
rived-0.430
rive-0.429
Token "
Token ID366
Feature activation+1.89e-01

Pos logit contributions
izabeth+0.816
+0.793
 awa+0.755
 Definition+0.717
sense+0.707
Neg logit contributions
ertodd-0.471
ointed-0.449
rive-0.448
rived-0.446
ulously-0.443
Tokensource
Token ID10459
Feature activation-2.98e-01

Pos logit contributions
izabeth+0.632
+0.615
 awa+0.578
sense+0.546
 Definition+0.544
Neg logit contributions
ertodd-0.480
ointed-0.464
ulously-0.459
rived-0.458
rive-0.456
Token rock
Token ID3881
Feature activation-2.53e-02

Pos logit contributions
ertodd+0.665
rive+0.649
ointed+0.648
ulously+0.646
rived+0.646
Neg logit contributions
izabeth-1.079
-1.040
 awa-0.998
 Definition-0.937
sense-0.932
TokenSubscribe
Token ID27125
Feature activation+1.36e-01

Pos logit contributions
izabeth+1.281
+1.242
 awa+1.157
 Definition+1.113
sense+1.107
Neg logit contributions
ertodd-1.048
rived-1.007
ulously-1.000
rive-1.000
ointed-0.995
Token now
Token ID783
Feature activation-1.77e-02

Pos logit contributions
izabeth+0.414
+0.400
 awa+0.376
 Definition+0.355
ッド+0.352
Neg logit contributions
ertodd-0.384
rived-0.367
rive-0.366
ointed-0.366
ulously-0.364
Token for
Token ID329
Feature activation+7.42e-03

Pos logit contributions
ertodd+0.032
ointed+0.031
ulously+0.030
rive+0.030
rived+0.030
Neg logit contributions
izabeth-0.072
-0.070
 awa-0.067
 Definition-0.064
sense-0.064
Token as
Token ID355
Feature activation-7.03e-02

Pos logit contributions
izabeth+0.029
+0.028
 awa+0.027
 Definition+0.025
ッド+0.025
Neg logit contributions
ertodd-0.015
rive-0.014
rived-0.014
ointed-0.014
ulously-0.014
Token little
Token ID1310
Feature activation-3.74e-01

Pos logit contributions
ertodd+0.195
rived+0.187
ulously+0.185
ointed+0.185
rive+0.185
Neg logit contributions
izabeth-0.219
-0.215
 awa-0.200
 Definition-0.189
sense-0.187
Token as
Token ID355
Feature activation+1.12e+00

Pos logit contributions
ertodd+0.714
rive+0.690
ointed+0.688
rived+0.684
ulously+0.684
Neg logit contributions
izabeth-1.483
-1.435
 awa-1.384
 Definition-1.305
sense-1.297
Token $
Token ID720
Feature activation+3.82e-01

Pos logit contributions
izabeth+3.343
+3.125
 awa+2.962
 Definition+2.860
sense+2.792
Neg logit contributions
ertodd-3.229
ulously-3.185
rived-3.140
rive-3.135
ointed-3.091
Token2
Token ID17
Feature activation+4.64e-01

Pos logit contributions
izabeth+1.063
+1.010
 awa+0.956
 Definition+0.900
sense+0.893
Neg logit contributions
ertodd-1.175
ulously-1.144
ointed-1.134
rived-1.132
rive-1.127
Token a
Token ID257
Feature activation+3.00e-01

Pos logit contributions
izabeth+1.333
+1.288
 awa+1.212
 Definition+1.137
sense+1.128
Neg logit contributions
ertodd-1.385
ulously-1.331
ointed-1.324
rived-1.324
rive-1.310
Token month
Token ID1227
Feature activation+2.55e-01

Pos logit contributions
izabeth+1.329
+1.290
 awa+1.249
 Definition+1.180
ッド+1.179
Neg logit contributions
ertodd-0.436
rive-0.419
ointed-0.418
ulously-0.415
rived-0.415
Token!
Token ID0
Feature activation-4.20e-02

Pos logit contributions
izabeth+0.867
+0.835
 awa+0.796
 Definition+0.749
sense+0.743
Neg logit contributions
ertodd-0.629
ointed-0.610
rive-0.610
rived-0.608
ulously-0.607
Token Arc
Token ID10173
Feature activation+1.67e-01

Pos logit contributions
izabeth+1.107
+1.068
 awa+1.022
 Definition+0.958
sense+0.939
Neg logit contributions
ertodd-0.986
rive-0.956
rived-0.956
ointed-0.956
ulously-0.951
Token <
Token ID1279
Feature activation+5.04e-02

Pos logit contributions
izabeth+0.483
+0.463
 awa+0.439
 Definition+0.406
sense+0.401
Neg logit contributions
ertodd-0.496
rive-0.484
ointed-0.484
ulously-0.483
rived-0.483
Token Board
Token ID5926
Feature activation+3.95e-01

Pos logit contributions
izabeth+0.166
+0.160
 awa+0.152
 Definition+0.142
sense+0.141
Neg logit contributions
ertodd-0.130
rive-0.127
ointed-0.127
rived-0.126
ulously-0.126
Token >>
Token ID9609
Feature activation+2.99e-01

Pos logit contributions
izabeth+1.162
+1.114
 awa+1.056
 Definition+0.982
sense+0.970
Neg logit contributions
ertodd-1.158
ointed-1.129
rive-1.129
ulously-1.127
rived-1.127
Token,
Token ID837
Feature activation+3.01e-01

Pos logit contributions
izabeth+0.999
+0.963
 awa+0.921
 Definition+0.862
sense+0.852
Neg logit contributions
ertodd-0.755
rive-0.733
ulously-0.732
ointed-0.731
rived-0.729
Token //
Token ID3373
Feature activation+1.10e+00

Pos logit contributions
izabeth+0.816
+0.778
 awa+0.734
 Definition+0.674
sense+0.668
Neg logit contributions
ertodd-0.954
rive-0.934
ointed-0.933
rived-0.931
ulously-0.931
Token The
Token ID383
Feature activation+4.86e-01

Pos logit contributions
izabeth+3.564
+3.450
 awa+3.355
 Definition+3.188
sense+3.092
Neg logit contributions
ertodd-2.815
rive-2.740
rived-2.734
ulously-2.718
ointed-2.706
Token previous
Token ID2180
Feature activation+1.99e-01

Pos logit contributions
izabeth+1.672
+1.616
 awa+1.558
 Definition+1.457
sense+1.441
Neg logit contributions
ertodd-1.175
rive-1.139
ointed-1.135
rived-1.134
ulously-1.133
Token board
Token ID3096
Feature activation+2.20e-01

Pos logit contributions
izabeth+0.575
+0.550
 awa+0.525
 Definition+0.483
sense+0.479
Neg logit contributions
ertodd-0.594
rive-0.578
ointed-0.578
rived-0.578
ulously-0.577
Token,
Token ID11
Feature activation+2.73e-02

Pos logit contributions
izabeth+0.637
+0.610
 awa+0.579
 Definition+0.535
sense+0.530
Neg logit contributions
ertodd-0.655
rive-0.641
ointed-0.640
rived-0.638
ulously-0.638
Token if
Token ID611
Feature activation+2.20e-01

Pos logit contributions
izabeth+0.091
+0.087
 awa+0.083
 Definition+0.078
ッド+0.077
Neg logit contributions
ertodd-0.069
rive-0.068
ointed-0.068
rived-0.067
ulously-0.067
Token to
Token ID284
Feature activation+3.65e-01

Pos logit contributions
izabeth+0.058
+0.056
 awa+0.054
 Definition+0.051
sense+0.050
Neg logit contributions
ertodd-0.033
rive-0.032
ointed-0.032
ulously-0.032
rived-0.032
Token elect
Token ID1742
Feature activation+2.41e-01

Pos logit contributions
izabeth+1.239
+1.186
 awa+1.147
 Definition+1.074
sense+1.058
Neg logit contributions
ertodd-0.906
rive-0.871
rived-0.870
ointed-0.868
ulously-0.866
Token,
Token ID11
Feature activation-9.01e-02

Pos logit contributions
izabeth+0.749
+0.717
 awa+0.683
 Definition+0.635
sense+0.630
Neg logit contributions
ertodd-0.668
rive-0.649
rived-0.647
ulously-0.647
ointed-0.647
Token at
Token ID379
Feature activation+8.19e-02

Pos logit contributions
ertodd+0.197
rive+0.191
ointed+0.190
rived+0.190
ulously+0.190
Neg logit contributions
izabeth-0.332
-0.321
 awa-0.308
 Definition-0.290
sense-0.288
Token whatever
Token ID4232
Feature activation+3.70e-02

Pos logit contributions
izabeth+0.269
+0.258
 awa+0.246
 Definition+0.231
sense+0.228
Neg logit contributions
ertodd-0.212
rive-0.206
ointed-0.206
rived-0.205
ulously-0.205
Token cost
Token ID1575
Feature activation+1.06e+00

Pos logit contributions
izabeth+0.100
+0.095
 awa+0.090
 Definition+0.083
sense+0.082
Neg logit contributions
ertodd-0.118
rive-0.115
ointed-0.115
rived-0.115
ulously-0.115
Token,
Token ID11
Feature activation+2.13e-01

Pos logit contributions
izabeth+3.474
+3.298
 awa+3.187
 Definition+3.006
sense+2.959
Neg logit contributions
ertodd-2.749
rive-2.590
rived-2.573
ulously-2.562
ointed-2.538
Token whichever
Token ID26204
Feature activation+5.47e-03

Pos logit contributions
izabeth+0.668
+0.639
 awa+0.609
 Definition+0.568
sense+0.562
Neg logit contributions
ertodd-0.587
rive-0.571
rived-0.569
ulously-0.568
ointed-0.567
Token Democrat
Token ID9755
Feature activation+3.49e-01

Pos logit contributions
izabeth+0.019
+0.018
 awa+0.017
 Definition+0.016
sense+0.016
Neg logit contributions
ertodd-0.013
ointed-0.013
rive-0.013
rived-0.013
ulously-0.013
Token is
Token ID318
Feature activation+1.43e-01

Pos logit contributions
izabeth+1.099
+1.050
 awa+1.004
 Definition+0.937
sense+0.927
Neg logit contributions
ertodd-0.953
rive-0.920
rived-0.918
ulously-0.917
ointed-0.916
Token running
Token ID2491
Feature activation+6.14e-02

Pos logit contributions
izabeth+0.488
+0.469
 awa+0.449
 Definition+0.420
sense+0.417
Neg logit contributions
ertodd-0.356
rive-0.344
ointed-0.343
ulously-0.343
rived-0.343
Token bills
Token ID9024
Feature activation-6.18e-02

Pos logit contributions
izabeth+0.334
+0.319
 awa+0.303
 Definition+0.280
sense+0.278
Neg logit contributions
ertodd-0.335
rive-0.328
ointed-0.327
rived-0.326
ulously-0.326
Token on
Token ID319
Feature activation-1.00e-01

Pos logit contributions
ertodd+0.135
rive+0.131
ointed+0.131
rived+0.130
ulously+0.130
Neg logit contributions
izabeth-0.228
-0.220
 awa-0.211
 Definition-0.199
sense-0.197
Token Oct
Token ID2556
Feature activation+2.69e-02

Pos logit contributions
ertodd+0.199
rive+0.192
ointed+0.191
rived+0.190
ulously+0.190
Neg logit contributions
izabeth-0.390
-0.377
 awa-0.363
 Definition-0.343
sense-0.341
Token.
Token ID13
Feature activation+6.16e-02

Pos logit contributions
izabeth+0.091
+0.087
 awa+0.083
 Definition+0.078
sense+0.077
Neg logit contributions
ertodd-0.067
rive-0.066
ointed-0.065
rived-0.065
ulously-0.065
Token 17
Token ID1596
Feature activation+4.63e-01

Pos logit contributions
izabeth+0.211
+0.203
 awa+0.194
sense+0.180
 Definition+0.180
Neg logit contributions
ertodd-0.151
rive-0.148
ointed-0.147
rived-0.147
ulously-0.146
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
izabeth+1.555
+1.492
 awa+1.423
 Definition+1.354
ッド+1.329
Neg logit contributions
ertodd-1.165
rive-1.130
ointed-1.130
ulously-1.127
rived-1.123
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
izabeth+3.055
+3.012
 Definition+2.629
sense+2.627
iq+2.593
Neg logit contributions
ertodd-3.121
ulously-3.014
rived-3.013
rive-2.993
ointed-2.970
TokenBy
Token ID3886
Feature activation+5.31e-01

Pos logit contributions
+3.368
izabeth+3.361
sense+2.980
 awa+2.964
iq+2.931
Neg logit contributions
ertodd-2.643
ointed-2.522
rived-2.506
ulously-2.500
rive-2.458
Token law
Token ID1099
Feature activation-7.24e-02

Pos logit contributions
izabeth+1.891
+1.809
 awa+1.740
 Definition+1.648
sense+1.630
Neg logit contributions
ertodd-1.222
rive-1.190
rived-1.185
ointed-1.173
ulously-1.173
Token,
Token ID11
Feature activation-1.76e-01

Pos logit contributions
ertodd+0.171
rive+0.166
ointed+0.165
rived+0.164
ulously+0.164
Neg logit contributions
izabeth-0.255
-0.245
 awa-0.235
 Definition-0.221
sense-0.219
Token the
Token ID262
Feature activation-6.75e-02

Pos logit contributions
ertodd+0.459
rive+0.447
ointed+0.446
rived+0.445
ulously+0.444
Neg logit contributions
izabeth-0.575
-0.553
 awa-0.527
 Definition-0.492
sense-0.489
Token:
Token ID25
Feature activation-3.47e-02

Pos logit contributions
ertodd+0.813
rive+0.797
ointed+0.792
ulously+0.791
rived+0.791
Neg logit contributions
izabeth-1.131
-1.080
 awa-1.027
ッド-0.965
sense-0.961
Token Option
Token ID16018
Feature activation-2.76e-01

Pos logit contributions
ertodd+0.096
rive+0.094
ointed+0.093
ulously+0.093
rived+0.093
Neg logit contributions
izabeth-0.108
-0.103
 awa-0.098
ッド-0.091
sense-0.091
Token<
Token ID27
Feature activation+7.12e-02

Pos logit contributions
ertodd+0.857
rive+0.838
ointed+0.837
rived+0.835
ulously+0.835
Neg logit contributions
izabeth-0.767
-0.731
 awa-0.691
 Definition-0.634
sense-0.631
TokenBoard
Token ID29828
Feature activation+1.10e-01

Pos logit contributions
izabeth+0.224
+0.215
 awa+0.205
 Definition+0.190
sense+0.189
Neg logit contributions
ertodd-0.194
rive-0.189
ointed-0.189
rived-0.189
ulously-0.188
Token>,
Token ID22330
Feature activation+9.53e-04

Pos logit contributions
izabeth+0.302
+0.288
 awa+0.272
 Definition+0.251
sense+0.248
Neg logit contributions
ertodd-0.346
ointed-0.339
rive-0.339
ulously-0.338
rived-0.338
Token //
Token ID3373
Feature activation+1.05e+00

Pos logit contributions
izabeth+0.003
+0.002
 awa+0.002
ッド+0.002
 Definition+0.002
Neg logit contributions
ertodd-0.003
ointed-0.003
rive-0.003
rived-0.003
ulously-0.003
Token The
Token ID383
Feature activation+3.87e-01

Pos logit contributions
izabeth+3.365
+3.270
 awa+3.157
 Definition+2.982
sense+2.911
Neg logit contributions
ertodd-2.768
rive-2.689
ulously-2.671
rived-2.668
ointed-2.668
Token previous
Token ID2180
Feature activation+2.55e-01

Pos logit contributions
izabeth+1.332
+1.285
 awa+1.232
 Definition+1.154
sense+1.144
Neg logit contributions
ertodd-0.940
rive-0.913
ointed-0.911
rived-0.908
ulously-0.908
Token board
Token ID3096
Feature activation+2.82e-01

Pos logit contributions
izabeth+0.752
+0.721
 awa+0.689
 Definition+0.636
sense+0.629
Neg logit contributions
ertodd-0.749
rive-0.729
ointed-0.728
rived-0.727
ulously-0.726
Token |
Token ID930
Feature activation+1.46e-01

Pos logit contributions
izabeth+0.984
+0.949
 awa+0.910
 Definition+0.855
sense+0.847
Neg logit contributions
ertodd-0.674
rive-0.655
ointed-0.654
rived-0.651
ulously-0.651
Token ----------------
Token ID34400
Feature activation+3.27e-01

Pos logit contributions
izabeth+0.439
+0.420
 awa+0.397
 Definition+0.368
sense+0.366
Neg logit contributions
ertodd-0.415
ointed-0.406
rive-0.406
ulously-0.405
rived-0.405
Token on
Token ID319
Feature activation-1.00e-01

Pos logit contributions
ertodd+0.135
rive+0.131
ointed+0.131
rived+0.130
ulously+0.130
Neg logit contributions
izabeth-0.228
-0.220
 awa-0.211
 Definition-0.199
sense-0.197
Token Oct
Token ID2556
Feature activation+2.69e-02

Pos logit contributions
ertodd+0.199
rive+0.192
ointed+0.191
rived+0.190
ulously+0.190
Neg logit contributions
izabeth-0.390
-0.377
 awa-0.363
 Definition-0.343
sense-0.341
Token.
Token ID13
Feature activation+6.16e-02

Pos logit contributions
izabeth+0.091
+0.087
 awa+0.083
 Definition+0.078
sense+0.077
Neg logit contributions
ertodd-0.067
rive-0.066
ointed-0.065
rived-0.065
ulously-0.065
Token 17
Token ID1596
Feature activation+4.63e-01

Pos logit contributions
izabeth+0.211
+0.203
 awa+0.194
sense+0.180
 Definition+0.180
Neg logit contributions
ertodd-0.151
rive-0.148
ointed-0.147
rived-0.147
ulously-0.146
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
izabeth+1.555
+1.492
 awa+1.423
 Definition+1.354
ッド+1.329
Neg logit contributions
ertodd-1.165
rive-1.130
ointed-1.130
ulously-1.127
rived-1.123
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
izabeth+3.055
+3.012
 Definition+2.629
sense+2.627
iq+2.593
Neg logit contributions
ertodd-3.121
ulously-3.014
rived-3.013
rive-2.993
ointed-2.970
TokenBy
Token ID3886
Feature activation+5.31e-01

Pos logit contributions
+3.368
izabeth+3.361
sense+2.980
 awa+2.964
iq+2.931
Neg logit contributions
ertodd-2.643
ointed-2.522
rived-2.506
ulously-2.500
rive-2.458
Token law
Token ID1099
Feature activation-7.24e-02

Pos logit contributions
izabeth+1.891
+1.809
 awa+1.740
 Definition+1.648
sense+1.630
Neg logit contributions
ertodd-1.222
rive-1.190
rived-1.185
ointed-1.173
ulously-1.173
Token,
Token ID11
Feature activation-1.76e-01

Pos logit contributions
ertodd+0.171
rive+0.166
ointed+0.165
rived+0.164
ulously+0.164
Neg logit contributions
izabeth-0.255
-0.245
 awa-0.235
 Definition-0.221
sense-0.219
Token the
Token ID262
Feature activation-6.75e-02

Pos logit contributions
ertodd+0.459
rive+0.447
ointed+0.446
rived+0.445
ulously+0.444
Neg logit contributions
izabeth-0.575
-0.553
 awa-0.527
 Definition-0.492
sense-0.489
Token national
Token ID2260
Feature activation+5.97e-02

Pos logit contributions
ertodd+0.154
rive+0.150
ointed+0.150
rived+0.149
ulously+0.149
Neg logit contributions
izabeth-0.242
-0.234
 awa-0.224
 Definition-0.210
sense-0.209
Token than
Token ID621
Feature activation-3.29e-02

Pos logit contributions
ertodd+0.919
ointed+0.902
rive+0.898
ulously+0.896
rived+0.894
Neg logit contributions
izabeth-1.550
-1.493
 awa-1.429
 Definition-1.344
ッド-1.340
Token through
Token ID832
Feature activation-1.12e-01

Pos logit contributions
ertodd+0.070
rived+0.067
rive+0.067
ulously+0.067
ointed+0.067
Neg logit contributions
izabeth-0.124
-0.120
 awa-0.115
 Definition-0.109
sense-0.108
Token a
Token ID257
Feature activation+2.35e-03

Pos logit contributions
ertodd+0.293
rive+0.284
ointed+0.283
rived+0.283
ulously+0.283
Neg logit contributions
izabeth-0.365
-0.352
 awa-0.335
 Definition-0.314
sense-0.311
Token nice
Token ID3621
Feature activation+1.41e-01

Pos logit contributions
izabeth+0.008
+0.008
 awa+0.007
 Definition+0.007
sense+0.007
Neg logit contributions
ertodd-0.006
rive-0.006
ointed-0.006
ulously-0.006
rived-0.006
Token Public
Token ID5094
Feature activation-2.35e-01

Pos logit contributions
izabeth+0.488
+0.472
 awa+0.451
 Definition+0.424
sense+0.420
Neg logit contributions
ertodd-0.339
rive-0.327
ointed-0.327
rived-0.326
ulously-0.326
Token Service
Token ID4809
Feature activation+1.02e+00

Pos logit contributions
ertodd+0.617
rive+0.603
ointed+0.602
rived+0.600
ulously+0.599
Neg logit contributions
izabeth-0.764
-0.735
 awa-0.700
 Definition-0.652
sense-0.648
Token Announce
Token ID43470
Feature activation+2.57e-01

Pos logit contributions
izabeth+2.538
+2.408
 awa+2.313
 Definition+2.181
sense+2.093
Neg logit contributions
ertodd-3.396
ulously-3.327
rived-3.291
rive-3.286
ointed-3.243
Tokenment
Token ID434
Feature activation+1.74e-01

Pos logit contributions
izabeth+1.051
+1.015
 awa+0.979
 Definition+0.940
sense+0.928
Neg logit contributions
ertodd-0.459
ulously-0.444
ointed-0.437
rive-0.434
rived-0.433
Token?
Token ID30
Feature activation-2.32e-02

Pos logit contributions
izabeth+0.615
+0.597
 awa+0.573
 Definition+0.544
sense+0.535
Neg logit contributions
ertodd-0.405
ulously-0.390
rive-0.386
rived-0.386
ointed-0.385
Token And
Token ID843
Feature activation+1.35e-01

Pos logit contributions
ertodd+0.059
rive+0.058
rived+0.058
ointed+0.058
ulously+0.058
Neg logit contributions
izabeth-0.077
-0.074
 awa-0.071
 Definition-0.066
sense-0.066
Token what
Token ID644
Feature activation-2.95e-01

Pos logit contributions
izabeth+0.454
+0.435
 awa+0.421
 Definition+0.396
sense+0.390
Neg logit contributions
ertodd-0.341
rive-0.328
ointed-0.327
rived-0.326
ulously-0.326
Token is
Token ID318
Feature activation+1.03e-01

Pos logit contributions
ertodd+0.286
rive+0.278
ointed+0.278
ulously+0.277
rived+0.277
Neg logit contributions
izabeth-0.409
-0.393
 awa-0.376
 Definition-0.352
sense-0.350
Token requesting
Token ID20623
Feature activation-1.63e-01

Pos logit contributions
izabeth+0.364
+0.351
 awa+0.336
 Definition+0.315
sense+0.313
Neg logit contributions
ertodd-0.245
rive-0.237
ointed-0.237
rived-0.236
ulously-0.236
Token three
Token ID1115
Feature activation+1.27e-01

Pos logit contributions
ertodd+0.400
rive+0.390
ointed+0.389
rived+0.388
ulously+0.387
Neg logit contributions
izabeth-0.556
-0.535
 awa-0.511
 Definition-0.478
sense-0.477
Token and
Token ID290
Feature activation+2.03e-01

Pos logit contributions
izabeth+0.422
+0.406
 awa+0.388
 Definition+0.362
sense+0.360
Neg logit contributions
ertodd-0.321
rive-0.313
ointed-0.313
rived-0.312
ulously-0.311
Token a
Token ID257
Feature activation+1.25e-01

Pos logit contributions
izabeth+0.631
+0.607
 awa+0.579
 Definition+0.538
sense+0.535
Neg logit contributions
ertodd-0.559
rive-0.540
rived-0.539
ulously-0.539
ointed-0.537
Token half
Token ID2063
Feature activation+1.01e+00

Pos logit contributions
izabeth+0.433
+0.417
 awa+0.393
 Definition+0.372
sense+0.372
Neg logit contributions
ertodd-0.303
rive-0.292
ulously-0.290
rived-0.290
ointed-0.289
Token year
Token ID614
Feature activation-6.85e-02

Pos logit contributions
izabeth+3.417
+3.285
 awa+3.136
 Definition+2.937
sense+2.899
Neg logit contributions
ertodd-2.496
ulously-2.401
rive-2.400
ointed-2.397
rived-2.386
Token sentence
Token ID6827
Feature activation+1.16e-01

Pos logit contributions
ertodd+0.163
rive+0.159
ointed+0.159
rived+0.158
ulously+0.158
Neg logit contributions
izabeth-0.239
-0.230
 awa-0.220
 Definition-0.207
sense-0.205
Token for
Token ID329
Feature activation+9.98e-02

Pos logit contributions
izabeth+0.446
+0.431
 awa+0.414
 Definition+0.392
sense+0.389
Neg logit contributions
ertodd-0.239
rive-0.230
ointed-0.230
ulously-0.229
rived-0.229
Token the
Token ID262
Feature activation-8.37e-02

Pos logit contributions
izabeth+0.327
+0.314
 awa+0.300
 Definition+0.280
sense+0.278
Neg logit contributions
ertodd-0.260
rive-0.253
ointed-0.252
ulously-0.252
rived-0.251
Token blogger
Token ID23530
Feature activation-1.28e-01

Pos logit contributions
ertodd+0.204
rive+0.199
ointed+0.199
ulously+0.198
rived+0.198
Neg logit contributions
izabeth-0.287
-0.276
 awa-0.264
 Definition-0.247
sense-0.246
Token--+
Token ID44785
Feature activation+4.30e-01

Pos logit contributions
izabeth+0.429
+0.416
 awa+0.399
 Definition+0.377
sense+0.374
Neg logit contributions
ertodd-0.227
rive-0.222
ointed-0.220
ulously-0.220
rived-0.220
Token +
Token ID1343
Feature activation+3.42e-01

Pos logit contributions
izabeth+1.137
+1.086
 awa+1.022
 Definition+0.941
sense+0.929
Neg logit contributions
ertodd-1.383
rive-1.358
ointed-1.355
ulously-1.352
rived-1.351
Token--------------------
Token ID19351
Feature activation+2.47e-01

Pos logit contributions
izabeth+1.080
+1.039
 awa+0.989
 Definition+0.922
sense+0.914
Neg logit contributions
ertodd-0.923
rive-0.902
ointed-0.901
ulously-0.898
rived-0.897
Token--+
Token ID44785
Feature activation+5.73e-01

Pos logit contributions
izabeth+0.937
+0.908
 awa+0.871
 Definition+0.825
sense+0.816
Neg logit contributions
ertodd-0.507
rive-0.498
ointed-0.494
rived-0.493
ulously-0.493
Token <
Token ID1279
Feature activation+2.12e-01

Pos logit contributions
izabeth+1.622
+1.559
 awa+1.476
 Definition+1.372
sense+1.350
Neg logit contributions
ertodd-1.725
rive-1.694
ointed-1.689
ulously-1.686
rived-1.684
Token--
Token ID438
Feature activation+1.00e+00

Pos logit contributions
izabeth+0.685
+0.657
 awa+0.627
 Definition+0.583
ッド+0.580
Neg logit contributions
ertodd-0.561
rive-0.547
ointed-0.546
rived-0.545
ulously-0.544
Token Start
Token ID7253
Feature activation+1.91e-01

Pos logit contributions
izabeth+3.271
+3.171
 awa+3.052
 Definition+2.877
sense+2.829
Neg logit contributions
ertodd-2.560
rive-2.481
ointed-2.479
ulously-2.470
rived-2.447
Token of
Token ID286
Feature activation+8.60e-02

Pos logit contributions
izabeth+0.643
+0.621
 awa+0.594
 Definition+0.557
sense+0.551
Neg logit contributions
ertodd-0.476
ointed-0.461
rive-0.460
ulously-0.459
rived-0.459
Token system
Token ID1080
Feature activation+7.15e-02

Pos logit contributions
izabeth+0.289
+0.278
 awa+0.265
 Definition+0.247
sense+0.246
Neg logit contributions
ertodd-0.216
rive-0.211
ointed-0.210
rived-0.210
ulously-0.210
Token()
Token ID3419
Feature activation+2.54e-02

Pos logit contributions
izabeth+0.219
+0.210
 awa+0.199
ッド+0.184
 Definition+0.184
Neg logit contributions
ertodd-0.201
rive-0.197
ointed-0.196
rived-0.196
ulously-0.196
Token'
Token ID6
Feature activation-7.99e-02

Pos logit contributions
izabeth+0.080
+0.076
 awa+0.073
 Definition+0.067
ッド+0.067
Neg logit contributions
ertodd-0.070
rive-0.068
ointed-0.068
rived-0.068
ulously-0.068
Token.
Token ID764
Feature activation+9.10e-02

Pos logit contributions
izabeth+0.644
+0.621
 awa+0.594
 Definition+0.558
sense+0.555
Neg logit contributions
ertodd-0.424
rive-0.412
ointed-0.411
rived-0.409
ulously-0.409
Tokencopy
Token ID30073
Feature activation-7.58e-02

Pos logit contributions
izabeth+0.282
+0.270
 awa+0.256
 Definition+0.238
ッド+0.237
Neg logit contributions
ertodd-0.252
rive-0.248
ointed-0.247
rived-0.247
ulously-0.246
Token_
Token ID62
Feature activation-2.07e-01

Pos logit contributions
ertodd+0.231
rive+0.226
ointed+0.226
ulously+0.225
rived+0.225
Neg logit contributions
izabeth-0.214
-0.205
 awa-0.193
 Definition-0.178
ッド-0.177
Tokenboard
Token ID3526
Feature activation+2.69e-02

Pos logit contributions
ertodd+0.638
rive+0.627
ointed+0.624
ulously+0.623
rived+0.623
Neg logit contributions
izabeth-0.573
-0.548
 awa-0.518
 Definition-0.477
ッド-0.472
Token ();
Token ID13979
Feature activation+3.10e-01

Pos logit contributions
izabeth+0.086
+0.083
 awa+0.078
 Definition+0.073
ッド+0.073
Neg logit contributions
ertodd-0.072
rive-0.070
ointed-0.070
rived-0.070
ulously-0.070
Token //
Token ID3373
Feature activation+1.00e+00

Pos logit contributions
izabeth+1.053
+1.015
 awa+0.974
 Definition+0.911
sense+0.904
Neg logit contributions
ertodd-0.765
rive-0.742
ointed-0.741
rived-0.739
ulously-0.739
Token!!
Token ID37867
Feature activation+4.68e-01

Pos logit contributions
izabeth+3.316
+3.220
 awa+3.128
 Definition+2.967
sense+2.902
Neg logit contributions
ertodd-2.523
ointed-2.412
rived-2.407
rive-2.401
ulously-2.383
Token!!!
Token ID10185
Feature activation+2.65e-01

Pos logit contributions
izabeth+1.532
+1.476
 awa+1.417
 Definition+1.322
sense+1.310
Neg logit contributions
ertodd-1.214
rive-1.176
ointed-1.175
rived-1.170
ulously-1.169
Token self
Token ID2116
Feature activation+1.06e-02

Pos logit contributions
izabeth+0.885
+0.853
 awa+0.816
 Definition+0.763
sense+0.757
Neg logit contributions
ertodd-0.673
rive-0.655
ointed-0.653
rived-0.651
ulously-0.651
Token.
Token ID764
Feature activation-9.51e-02

Pos logit contributions
izabeth+0.038
+0.036
 awa+0.035
 Definition+0.033
ッド+0.033
Neg logit contributions
ertodd-0.025
rive-0.024
ointed-0.024
rived-0.024
ulously-0.024
Tokenprev
Token ID47050
Feature activation-1.13e-01

Pos logit contributions
ertodd+0.258
rive+0.255
ointed+0.254
rived+0.253
ulously+0.253
Neg logit contributions
izabeth-0.299
-0.287
 awa-0.272
 Definition-0.254
ッド-0.252
Token legal
Token ID2742
Feature activation-9.17e-02

Pos logit contributions
ertodd+0.129
rive+0.125
ointed+0.125
rived+0.125
ulously+0.125
Neg logit contributions
izabeth-0.187
-0.181
 awa-0.173
 Definition-0.162
sense-0.161
Token landscape
Token ID10747
Feature activation-2.00e-01

Pos logit contributions
ertodd+0.223
rive+0.219
ointed+0.219
ulously+0.218
rived+0.218
Neg logit contributions
izabeth-0.313
-0.302
 awa-0.289
ッド-0.269
 Definition-0.269
Token.
Token ID13
Feature activation-4.29e-02

Pos logit contributions
ertodd+0.432
rive+0.419
ointed+0.418
rived+0.417
ulously+0.417
Neg logit contributions
izabeth-0.744
-0.719
 awa-0.689
 Definition-0.649
sense-0.645
Token In
Token ID554
Feature activation-5.31e-02

Pos logit contributions
ertodd+0.112
rive+0.110
ointed+0.109
rived+0.109
ulously+0.109
Neg logit contributions
izabeth-0.140
-0.134
 awa-0.128
 Definition-0.119
sense-0.119
Token that
Token ID326
Feature activation+5.37e-02

Pos logit contributions
ertodd+0.145
rive+0.142
ointed+0.142
ulously+0.141
rived+0.141
Neg logit contributions
izabeth-0.167
-0.160
 awa-0.153
 Definition-0.142
sense-0.141
Token sense
Token ID2565
Feature activation+9.83e-01

Pos logit contributions
izabeth+0.164
+0.158
 awa+0.150
 Definition+0.139
sense+0.138
Neg logit contributions
ertodd-0.151
rive-0.147
ointed-0.147
rived-0.147
ulously-0.147
Token,
Token ID11
Feature activation-2.75e-02

Pos logit contributions
izabeth+3.161
+3.060
 awa+2.917
 Definition+2.772
ッド+2.725
Neg logit contributions
ertodd-2.529
rive-2.478
rived-2.461
ointed-2.417
ulously-2.408
Token ap
Token ID2471
Feature activation+5.98e-02

Pos logit contributions
ertodd+0.069
rive+0.067
rived+0.067
ointed+0.067
ulously+0.067
Neg logit contributions
izabeth-0.092
-0.089
 awa-0.085
 Definition-0.080
sense-0.079
Tokenople
Token ID643
Feature activation-3.78e-01

Pos logit contributions
izabeth+0.191
+0.184
 awa+0.175
 Definition+0.163
sense+0.162
Neg logit contributions
ertodd-0.160
ointed-0.156
rive-0.156
rived-0.156
ulously-0.156
Tokenctic
Token ID11048
Feature activation-3.15e-01

Pos logit contributions
ertodd+0.993
rive+0.967
ointed+0.966
ulously+0.962
rived+0.962
Neg logit contributions
izabeth-1.226
-1.177
 awa-1.122
 Definition-1.048
sense-1.040
Token opponents
Token ID7691
Feature activation-3.33e-01

Pos logit contributions
ertodd+0.832
ointed+0.823
rive+0.820
ulously+0.819
rived+0.815
Neg logit contributions
izabeth-1.008
-0.974
 awa-0.923
ッド-0.860
 Definition-0.859
Token not
Token ID407
Feature activation+6.29e-02

Pos logit contributions
ertodd+0.219
rive+0.213
ointed+0.213
ulously+0.212
rived+0.212
Neg logit contributions
izabeth-0.327
-0.315
 awa-0.301
 Definition-0.283
sense-0.281
Token optimistic
Token ID16915
Feature activation-3.01e-01

Pos logit contributions
izabeth+0.213
+0.205
 awa+0.196
 Definition+0.184
sense+0.182
Neg logit contributions
ertodd-0.156
rive-0.152
ointed-0.152
rived-0.151
ulously-0.151
Token given
Token ID1813
Feature activation-2.67e-01

Pos logit contributions
ertodd+0.739
rive+0.715
ointed+0.712
rived+0.711
ulously+0.710
Neg logit contributions
izabeth-1.031
-0.993
 awa-0.951
 Definition-0.891
sense-0.886
Token his
Token ID465
Feature activation+3.07e-01

Pos logit contributions
ertodd+0.727
rive+0.708
ointed+0.707
rived+0.705
ulously+0.704
Neg logit contributions
izabeth-0.841
-0.807
 awa-0.769
 Definition-0.715
sense-0.710
Token track
Token ID2610
Feature activation-1.33e-01

Pos logit contributions
izabeth+1.026
+0.986
 awa+0.949
 Definition+0.886
sense+0.879
Neg logit contributions
ertodd-0.781
rive-0.752
rived-0.749
ointed-0.748
ulously-0.745
Token record
Token ID1700
Feature activation+9.80e-01

Pos logit contributions
ertodd+0.282
ointed+0.270
ulously+0.270
rived+0.269
rive+0.269
Neg logit contributions
izabeth-0.503
-0.485
 awa-0.466
 Definition-0.441
sense-0.441
Token of
Token ID286
Feature activation+5.84e-02

Pos logit contributions
izabeth+3.488
+3.359
 awa+3.275
sense+3.082
afety+3.055
Neg logit contributions
ertodd-2.204
rive-2.056
ulously-2.036
rived-2.035
ointed-1.976
Token late
Token ID2739
Feature activation+1.02e-01

Pos logit contributions
izabeth+0.201
+0.193
 awa+0.185
 Definition+0.173
sense+0.172
Neg logit contributions
ertodd-0.143
rive-0.138
ointed-0.138
rived-0.137
ulously-0.137
Token.
Token ID13
Feature activation-1.10e-01

Pos logit contributions
izabeth+0.353
+0.340
 awa+0.325
 Definition+0.306
sense+0.305
Neg logit contributions
ertodd-0.247
ointed-0.236
rive-0.235
ulously-0.235
rived-0.235
Token The
Token ID383
Feature activation-1.38e-01

Pos logit contributions
ertodd+0.290
rive+0.282
ointed+0.281
rived+0.281
ulously+0.281
Neg logit contributions
izabeth-0.356
-0.342
 awa-0.326
 Definition-0.305
sense-0.303
Token Jaguars
Token ID28392
Feature activation+1.86e-01

Pos logit contributions
ertodd+0.357
rive+0.350
ointed+0.350
rived+0.348
ulously+0.348
Neg logit contributions
izabeth-0.450
-0.433
 awa-0.412
 Definition-0.384
sense-0.382
Token outraged
Token ID27697
Feature activation+2.48e-01

Pos logit contributions
izabeth+0.105
+0.102
 awa+0.097
 Definition+0.091
sense+0.091
Neg logit contributions
ertodd-0.072
rive-0.070
ointed-0.070
rived-0.070
ulously-0.070
Token at
Token ID379
Feature activation+8.41e-02

Pos logit contributions
izabeth+0.892
+0.863
 awa+0.823
 Definition+0.774
sense+0.767
Neg logit contributions
ertodd-0.572
rive-0.549
ointed-0.547
rived-0.545
ulously-0.543
Token these
Token ID777
Feature activation+4.81e-02

Pos logit contributions
izabeth+0.269
+0.259
 awa+0.246
 Definition+0.230
sense+0.228
Neg logit contributions
ertodd-0.225
rive-0.219
ointed-0.219
rived-0.218
ulously-0.218
Token so
Token ID523
Feature activation+3.45e-02

Pos logit contributions
izabeth+0.156
+0.150
 awa+0.143
 Definition+0.134
sense+0.133
Neg logit contributions
ertodd-0.126
rive-0.123
ointed-0.122
rived-0.122
ulously-0.122
Token-
Token ID12
Feature activation-1.80e-03

Pos logit contributions
izabeth+0.115
+0.112
 awa+0.107
 Definition+0.099
ッド+0.099
Neg logit contributions
ertodd-0.087
rive-0.085
ointed-0.084
rived-0.084
ulously-0.084
Tokencalled
Token ID7174
Feature activation+9.66e-01

Pos logit contributions
ertodd+0.007
rive+0.007
ointed+0.007
ulously+0.007
rived+0.007
Neg logit contributions
izabeth-0.004
-0.003
 awa-0.003
 Definition-0.003
ッド-0.003
Token supporters
Token ID5941
Feature activation-8.75e-02

Pos logit contributions
izabeth+3.149
+3.029
 awa+2.885
 Definition+2.736
sense+2.675
Neg logit contributions
ertodd-2.552
rive-2.413
ointed-2.391
ulously-2.384
rived-2.380
Token of
Token ID286
Feature activation+1.57e-01

Pos logit contributions
ertodd+0.195
rive+0.189
ointed+0.189
rived+0.188
ulously+0.187
Neg logit contributions
izabeth-0.319
-0.308
 awa-0.296
 Definition-0.279
sense-0.277
Token Bernie
Token ID10477
Feature activation-8.08e-02

Pos logit contributions
izabeth+0.485
+0.466
 awa+0.443
 Definition+0.411
sense+0.408
Neg logit contributions
ertodd-0.441
rive-0.429
ointed-0.428
rived-0.427
ulously-0.427
Token.
Token ID13
Feature activation-1.69e-01

Pos logit contributions
ertodd+0.198
rive+0.192
ointed+0.192
rived+0.192
ulously+0.191
Neg logit contributions
izabeth-0.276
-0.267
 awa-0.255
 Definition-0.239
sense-0.237
Token
Token ID198
Feature activation-7.60e-02

Pos logit contributions
ertodd+0.419
ointed+0.410
rive+0.408
ulously+0.406
rived+0.406
Neg logit contributions
izabeth-0.568
-0.546
 awa-0.524
sense-0.485
ッド-0.484
Token point
Token ID966
Feature activation-3.07e-02

Pos logit contributions
ertodd+0.200
rive+0.196
ointed+0.196
ulously+0.195
rived+0.195
Neg logit contributions
izabeth-0.184
-0.176
 awa-0.166
 Definition-0.153
ッド-0.152
Token.
Token ID13
Feature activation-6.78e-02

Pos logit contributions
ertodd+0.069
rive+0.067
ointed+0.066
rived+0.066
ulously+0.066
Neg logit contributions
izabeth-0.112
-0.108
 awa-0.104
 Definition-0.098
sense-0.097
Token
Token ID564
Feature activation-3.44e-02

Pos logit contributions
ertodd+0.177
rive+0.173
ointed+0.172
rived+0.172
ulously+0.172
Neg logit contributions
izabeth-0.221
-0.213
 awa-0.202
 Definition-0.190
sense-0.188
Token
Token ID250
Feature activation-2.55e-02

Pos logit contributions
ertodd+0.081
ointed+0.080
ulously+0.079
rive+0.079
rived+0.079
Neg logit contributions
izabeth-0.120
-0.116
 awa-0.111
 Definition-0.104
ッド-0.104
TokenLiquid
Token ID41966
Feature activation-4.98e-02

Pos logit contributions
ertodd+0.062
rive+0.060
ointed+0.060
rived+0.060
ulously+0.060
Neg logit contributions
izabeth-0.088
-0.085
 awa-0.081
 Definition-0.076
sense-0.076
Tokenity
Token ID414
Feature activation+9.65e-01

Pos logit contributions
ertodd+0.165
rive+0.162
ulously+0.161
ointed+0.161
rived+0.161
Neg logit contributions
izabeth-0.128
-0.121
 awa-0.115
 Definition-0.104
ッド-0.103
Token,
Token ID11
Feature activation+3.07e-02

Pos logit contributions
izabeth+3.267
+3.184
 awa+3.029
 Definition+2.906
sense+2.856
Neg logit contributions
ertodd-2.308
ointed-2.271
rive-2.265
rived-2.247
ulously-2.227
Token merchants
Token ID21779
Feature activation+6.66e-02

Pos logit contributions
izabeth+0.101
+0.097
 awa+0.092
 Definition+0.086
sense+0.086
Neg logit contributions
ertodd-0.080
rive-0.078
ointed-0.078
rived-0.077
ulously-0.077
Token,
Token ID11
Feature activation+1.05e-01

Pos logit contributions
izabeth+0.236
+0.227
 awa+0.217
 Definition+0.204
ッド+0.203
Neg logit contributions
ertodd-0.156
rive-0.152
ointed-0.151
rived-0.151
ulously-0.150
Token and
Token ID290
Feature activation+2.29e-01

Pos logit contributions
izabeth+0.340
+0.326
 awa+0.311
 Definition+0.290
sense+0.288
Neg logit contributions
ertodd-0.274
rive-0.268
ointed-0.267
ulously-0.266
rived-0.266
Token user
Token ID2836
Feature activation-7.34e-02

Pos logit contributions
izabeth+0.705
+0.676
 awa+0.643
 Definition+0.597
sense+0.592
Neg logit contributions
ertodd-0.642
rive-0.627
ointed-0.626
rived-0.625
ulously-0.624
Token u
Token ID334
Feature activation+9.24e-02

Pos logit contributions
izabeth+0.849
+0.820
 awa+0.781
 Definition+0.733
sense+0.731
Neg logit contributions
ertodd-0.629
rive-0.609
ointed-0.609
rived-0.607
ulously-0.606
TokenPart
Token ID7841
Feature activation+3.74e-01

Pos logit contributions
izabeth+0.267
+0.255
 awa+0.241
 Definition+0.223
sense+0.222
Neg logit contributions
ertodd-0.276
rive-0.269
ointed-0.269
rived-0.269
ulously-0.268
Tokenicle
Token ID1548
Feature activation-1.12e-02

Pos logit contributions
izabeth+1.114
+1.063
 awa+0.972
 Definition+0.938
sense+0.936
Neg logit contributions
ertodd-1.077
ulously-1.052
ointed-1.049
rived-1.049
rive-1.043
Token )
Token ID1267
Feature activation-1.40e-01

Pos logit contributions
ertodd+0.028
rive+0.027
ointed+0.027
ulously+0.027
rived+0.027
Neg logit contributions
izabeth-0.038
-0.037
 awa-0.035
 Definition-0.033
sense-0.033
Token {
Token ID1391
Feature activation+3.00e-01

Pos logit contributions
ertodd+0.449
ointed+0.438
ulously+0.438
rive+0.438
rived+0.437
Neg logit contributions
izabeth-0.376
-0.359
 awa-0.337
 Definition-0.313
sense-0.309
Token //
Token ID3373
Feature activation+9.57e-01

Pos logit contributions
izabeth+0.916
+0.890
 awa+0.845
 Definition+0.804
sense+0.787
Neg logit contributions
ertodd-0.840
ulously-0.812
rived-0.805
ointed-0.804
rive-0.799
Token For
Token ID1114
Feature activation+4.06e-02

Pos logit contributions
izabeth+3.227
+3.150
 awa+3.002
 Definition+2.919
sense+2.842
Neg logit contributions
ertodd-2.355
ointed-2.243
rived-2.228
ulously-2.222
rive-2.192
Token each
Token ID1123
Feature activation-3.06e-01

Pos logit contributions
izabeth+0.123
+0.118
 awa+0.112
 Definition+0.104
sense+0.104
Neg logit contributions
ertodd-0.115
rive-0.112
ointed-0.112
rived-0.112
ulously-0.112
Token particle
Token ID18758
Feature activation+3.75e-01

Pos logit contributions
ertodd+0.772
rive+0.756
ointed+0.754
rived+0.751
ulously+0.750
Neg logit contributions
izabeth-1.023
-0.984
 awa-0.940
 Definition-0.877
sense-0.872
Token in
Token ID287
Feature activation+1.42e-03

Pos logit contributions
izabeth+1.308
+1.268
 awa+1.213
 Definition+1.145
sense+1.132
Neg logit contributions
ertodd-0.898
ointed-0.860
rived-0.858
ulously-0.856
rive-0.856
Token the
Token ID262
Feature activation+6.24e-02

Pos logit contributions
izabeth+0.005
+0.004
 awa+0.004
 Definition+0.004
sense+0.004
Neg logit contributions
ertodd-0.004
ointed-0.004
rived-0.004
rive-0.004
ulously-0.004
Token out
Token ID503
Feature activation+2.50e-01

Pos logit contributions
ertodd+0.405
rive+0.390
ointed+0.389
rived+0.389
ulously+0.388
Neg logit contributions
izabeth-0.710
-0.687
 awa-0.663
 Definition-0.621
sense-0.620
Token,
Token ID11
Feature activation-5.07e-02

Pos logit contributions
izabeth+0.931
+0.900
 awa+0.867
 Definition+0.820
sense+0.815
Neg logit contributions
ertodd-0.527
rived-0.509
ointed-0.509
rive-0.507
ulously-0.502
Token and
Token ID290
Feature activation-8.49e-02

Pos logit contributions
ertodd+0.130
ointed+0.127
rive+0.127
rived+0.126
ulously+0.126
Neg logit contributions
izabeth-0.168
-0.161
 awa-0.154
 Definition-0.143
sense-0.143
Token in
Token ID287
Feature activation-6.77e-02

Pos logit contributions
ertodd+0.245
ointed+0.239
rive+0.239
rived+0.238
ulously+0.238
Neg logit contributions
izabeth-0.254
-0.243
 awa-0.230
 Definition-0.213
ッド-0.212
Token what
Token ID644
Feature activation-3.47e-01

Pos logit contributions
ertodd+0.201
ointed+0.196
rive+0.196
rived+0.195
ulously+0.195
Neg logit contributions
izabeth-0.198
-0.189
 awa-0.179
 Definition-0.165
ッド-0.164
Token order
Token ID1502
Feature activation+9.56e-01

Pos logit contributions
ertodd+0.991
rive+0.977
ointed+0.977
rived+0.975
ulously+0.969
Neg logit contributions
izabeth-1.037
-0.996
 awa-0.947
 Definition-0.870
ッド-0.870
Token the
Token ID262
Feature activation-9.53e-02

Pos logit contributions
izabeth+2.995
+2.913
 awa+2.765
 Definition+2.617
sense+2.588
Neg logit contributions
ertodd-2.522
ulously-2.440
ointed-2.432
rive-2.429
rived-2.427
Token neurons
Token ID16890
Feature activation-1.80e-01

Pos logit contributions
ertodd+0.235
rive+0.229
ointed+0.228
ulously+0.228
rived+0.228
Neg logit contributions
izabeth-0.324
-0.312
 awa-0.298
 Definition-0.279
sense-0.278
Token activated
Token ID13906
Feature activation-1.32e-01

Pos logit contributions
ertodd+0.413
rive+0.401
ointed+0.400
rived+0.399
ulously+0.398
Neg logit contributions
izabeth-0.644
-0.621
 awa-0.595
 Definition-0.559
sense-0.555
Token.
Token ID13
Feature activation-1.49e-01

Pos logit contributions
ertodd+0.313
rive+0.304
ointed+0.303
rived+0.302
ulously+0.302
Neg logit contributions
izabeth-0.460
-0.443
 awa-0.424
 Definition-0.398
sense-0.395
Token
Token ID198
Feature activation-1.22e-01

Pos logit contributions
ertodd+0.370
rive+0.360
ointed+0.359
rived+0.359
ulously+0.359
Neg logit contributions
izabeth-0.507
-0.488
 awa-0.466
 Definition-0.438
sense-0.434
Token Jordan
Token ID8078
Feature activation+2.11e-01

Pos logit contributions
izabeth+1.011
+0.970
 awa+0.934
 Definition+0.874
sense+0.862
Neg logit contributions
ertodd-0.789
ointed-0.759
rived-0.759
rive-0.759
ulously-0.759
Token Kal
Token ID12612
Feature activation+3.70e-01

Pos logit contributions
izabeth+0.720
+0.692
 awa+0.662
 Definition+0.620
sense+0.615
Neg logit contributions
ertodd-0.522
ointed-0.504
ulously-0.503
rive-0.503
rived-0.502
Tokenish
Token ID680
Feature activation+4.35e-01

Pos logit contributions
izabeth+1.262
+1.216
 awa+1.158
sense+1.088
 Definition+1.084
Neg logit contributions
ertodd-0.913
ointed-0.880
ulously-0.873
rived-0.871
rive-0.866
Token
Token ID198
Feature activation+3.79e-01

Pos logit contributions
izabeth+1.419
+1.364
 awa+1.305
 Definition+1.218
iq+1.207
Neg logit contributions
ertodd-1.140
ointed-1.098
ulously-1.094
rived-1.089
rive-1.087
Token
Token ID198
Feature activation+3.39e-01

Pos logit contributions
izabeth+1.207
+1.171
 awa+1.065
 Definition+1.037
sense+1.036
Neg logit contributions
ertodd-1.020
rived-0.989
ulously-0.989
rive-0.987
ointed-0.986
TokenSurv
Token ID34652
Feature activation+9.56e-01

Pos logit contributions
izabeth+1.172
+1.143
 awa+1.073
sense+1.021
 Definition+1.016
Neg logit contributions
ertodd-0.825
ointed-0.795
ulously-0.791
rived-0.790
rive-0.787
Tokeniv
Token ID452
Feature activation+1.83e-01

Pos logit contributions
izabeth+2.219
+2.074
 awa+1.912
iq+1.786
sense+1.769
Neg logit contributions
ertodd-3.409
ointed-3.298
rived-3.245
rive-3.234
ulously-3.221
Tokenor
Token ID273
Feature activation+2.45e-01

Pos logit contributions
izabeth+0.571
+0.547
 awa+0.518
 Definition+0.487
sense+0.486
Neg logit contributions
ertodd-0.503
ulously-0.486
ointed-0.485
rived-0.482
rive-0.481
Token
Token ID447
Feature activation+1.50e-01

Pos logit contributions
izabeth+0.813
+0.784
 awa+0.752
 Definition+0.707
sense+0.699
Neg logit contributions
ertodd-0.626
rive-0.603
ulously-0.601
ointed-0.601
rived-0.600
Token
Token ID247
Feature activation+1.22e-01

Pos logit contributions
izabeth+0.443
+0.425
 awa+0.401
 Definition+0.373
ッド+0.371
Neg logit contributions
ertodd-0.441
ulously-0.427
rive-0.427
ointed-0.427
rived-0.425
Tokens
Token ID82
Feature activation+1.78e-01

Pos logit contributions
izabeth+0.437
+0.423
 awa+0.406
 Definition+0.383
sense+0.381
Neg logit contributions
ertodd-0.277
ointed-0.265
ulously-0.265
rive-0.265
rived-0.264
Token the
Token ID262
Feature activation+8.88e-02

Pos logit contributions
izabeth+0.151
+0.145
 awa+0.138
 Definition+0.128
sense+0.128
Neg logit contributions
ertodd-0.122
rive-0.120
ointed-0.119
rived-0.119
ulously-0.119
Token game
Token ID983
Feature activation+2.72e-02

Pos logit contributions
izabeth+0.336
+0.325
 awa+0.311
 Definition+0.293
sense+0.292
Neg logit contributions
ertodd-0.185
rive-0.180
ointed-0.180
rived-0.179
ulously-0.179
Token in
Token ID287
Feature activation-9.31e-03

Pos logit contributions
izabeth+0.097
+0.094
 awa+0.090
 Definition+0.085
sense+0.084
Neg logit contributions
ertodd-0.062
rive-0.061
ointed-0.061
rived-0.060
ulously-0.060
Token as
Token ID355
Feature activation-2.18e-01

Pos logit contributions
ertodd+0.024
rive+0.024
ointed+0.024
rived+0.024
ulously+0.024
Neg logit contributions
izabeth-0.030
-0.029
 awa-0.028
 Definition-0.026
sense-0.026
Token little
Token ID1310
Feature activation-2.50e-01

Pos logit contributions
ertodd+0.534
rive+0.528
ulously+0.526
ointed+0.525
rived+0.524
Neg logit contributions
izabeth-0.743
-0.712
 awa-0.680
 Definition-0.636
ッド-0.633
Token as
Token ID355
Feature activation+9.55e-01

Pos logit contributions
ertodd+0.497
ointed+0.478
rive+0.478
ulously+0.477
rived+0.476
Neg logit contributions
izabeth-0.970
-0.940
 awa-0.904
 Definition-0.854
sense-0.850
Token 2
Token ID362
Feature activation+3.06e-01

Pos logit contributions
izabeth+2.564
+2.436
 awa+2.329
 Definition+2.181
sense+2.144
Neg logit contributions
ertodd-3.001
rived-2.886
rive-2.880
ulously-2.867
ointed-2.859
Token weeks
Token ID2745
Feature activation+2.92e-01

Pos logit contributions
izabeth+1.087
+1.050
 awa+1.004
 Definition+0.944
sense+0.940
Neg logit contributions
ertodd-0.712
rive-0.688
ointed-0.687
rived-0.684
ulously-0.683
Token after
Token ID706
Feature activation-1.27e-01

Pos logit contributions
izabeth+1.011
+0.968
 awa+0.929
 Definition+0.875
sense+0.871
Neg logit contributions
ertodd-0.710
rive-0.680
rived-0.678
ointed-0.677
ulously-0.677
Token the
Token ID262
Feature activation-2.22e-02

Pos logit contributions
ertodd+0.329
rive+0.322
ointed+0.322
rived+0.321
ulously+0.320
Neg logit contributions
izabeth-0.416
-0.400
 awa-0.381
 Definition-0.355
ッド-0.354
Token campaign
Token ID1923
Feature activation+3.98e-02

Pos logit contributions
ertodd+0.047
rive+0.046
ointed+0.046
rived+0.046
ulously+0.046
Neg logit contributions
izabeth-0.083
-0.080
 awa-0.077
 Definition-0.072
ッド-0.072
Token more
Token ID517
Feature activation+4.15e-01

Pos logit contributions
ertodd+0.460
rive+0.452
ointed+0.452
ulously+0.449
rived+0.449
Neg logit contributions
izabeth-0.607
-0.584
 awa-0.558
 Definition-0.521
sense-0.517
Token than
Token ID621
Feature activation-3.73e-02

Pos logit contributions
izabeth+1.446
+1.393
 awa+1.335
 Definition+1.251
sense+1.243
Neg logit contributions
ertodd-0.994
rive-0.959
ointed-0.955
rived-0.954
ulously-0.954
Token two
Token ID734
Feature activation+2.40e-01

Pos logit contributions
ertodd+0.091
rive+0.089
ointed+0.089
rived+0.089
ulously+0.089
Neg logit contributions
izabeth-0.127
-0.123
 awa-0.117
 Definition-0.110
sense-0.109
Token and
Token ID290
Feature activation+1.45e-01

Pos logit contributions
izabeth+0.805
+0.775
 awa+0.741
 Definition+0.693
sense+0.687
Neg logit contributions
ertodd-0.606
rive-0.587
ointed-0.586
rived-0.584
ulously-0.584
Token a
Token ID257
Feature activation+4.29e-02

Pos logit contributions
izabeth+0.490
+0.472
 awa+0.451
 Definition+0.423
sense+0.419
Neg logit contributions
ertodd-0.362
rive-0.352
ointed-0.350
ulously-0.350
rived-0.350
Token half
Token ID2063
Feature activation+9.49e-01

Pos logit contributions
izabeth+0.153
+0.147
 awa+0.141
 Definition+0.132
sense+0.132
Neg logit contributions
ertodd-0.100
rive-0.097
ointed-0.096
ulously-0.096
rived-0.096
Token,
Token ID11
Feature activation+2.25e-01

Pos logit contributions
izabeth+3.154
+3.021
 awa+2.897
 Definition+2.720
sense+2.687
Neg logit contributions
ertodd-2.414
ulously-2.308
ointed-2.302
rived-2.296
rive-2.295
Token because
Token ID780
Feature activation-5.35e-02

Pos logit contributions
izabeth+0.783
+0.754
 awa+0.721
 Definition+0.678
sense+0.673
Neg logit contributions
ertodd-0.543
rive-0.525
ulously-0.524
ointed-0.524
rived-0.523
Token we
Token ID356
Feature activation+4.25e-02

Pos logit contributions
ertodd+0.127
rive+0.123
ointed+0.122
rived+0.122
ulously+0.122
Neg logit contributions
izabeth-0.188
-0.181
 awa-0.173
 Definition-0.163
sense-0.162
Token want
Token ID765
Feature activation-2.12e-02

Pos logit contributions
izabeth+0.164
+0.158
 awa+0.152
 Definition+0.144
sense+0.143
Neg logit contributions
ertodd-0.086
rive-0.083
ointed-0.083
ulously-0.083
rived-0.082
Token to
Token ID284
Feature activation+6.68e-02

Pos logit contributions
ertodd+0.042
rive+0.040
ointed+0.040
rived+0.040
ulously+0.040
Neg logit contributions
izabeth-0.083
-0.080
 awa-0.077
 Definition-0.073
sense-0.073
Token
Token ID198
Feature activation-7.93e-02

Pos logit contributions
ertodd+0.158
rive+0.154
ointed+0.154
ulously+0.153
rived+0.153
Neg logit contributions
izabeth-0.203
-0.196
 awa-0.184
 Definition-0.174
sense-0.173
Tokenpur
Token ID14225
Feature activation+8.85e-02

Pos logit contributions
ertodd+0.197
ointed+0.191
rive+0.190
ulously+0.190
rived+0.190
Neg logit contributions
izabeth-0.269
-0.261
 awa-0.247
 Definition-0.232
sense-0.232
Tokenak
Token ID461
Feature activation-1.89e-01

Pos logit contributions
izabeth+0.321
+0.310
 awa+0.297
 Definition+0.279
sense+0.278
Neg logit contributions
ertodd-0.198
ointed-0.192
rive-0.191
ulously-0.191
rived-0.191
Tokenushi
Token ID17731
Feature activation+2.11e-01

Pos logit contributions
ertodd+0.487
rive+0.474
ointed+0.473
rived+0.472
ulously+0.472
Neg logit contributions
izabeth-0.625
-0.601
 awa-0.574
 Definition-0.536
sense-0.533
Token Profile
Token ID13118
Feature activation-3.23e-01

Pos logit contributions
izabeth+0.721
+0.694
 awa+0.665
 Definition+0.625
sense+0.620
Neg logit contributions
ertodd-0.521
rive-0.504
rived-0.502
ointed-0.502
ulously-0.502
Token Joined
Token ID16709
Feature activation-1.52e+00

Pos logit contributions
ertodd+0.550
rive+0.529
ointed+0.527
rived+0.525
ulously+0.524
Neg logit contributions
izabeth-1.347
-1.307
 awa-1.260
 Definition-1.196
sense-1.189
Token August
Token ID2932
Feature activation-1.92e-01

Pos logit contributions
ointed+2.283
ertodd+2.241
rive+2.226
ulously+2.223
 Hollow+2.216
Neg logit contributions
izabeth-6.491
-6.180
 awa-6.115
ッド-5.754
afety-5.700
Token 2012
Token ID2321
Feature activation-4.57e-01

Pos logit contributions
ertodd+0.336
rive+0.323
ointed+0.322
rived+0.321
ulously+0.320
Neg logit contributions
izabeth-0.789
-0.765
 awa-0.737
 Definition-0.699
sense-0.695
Token United
Token ID1578
Feature activation-3.26e-01

Pos logit contributions
ertodd+1.181
ointed+1.180
rive+1.166
ulously+1.158
rived+1.156
Neg logit contributions
izabeth-1.471
-1.406
 awa-1.358
ッド-1.245
afety-1.245
Token States
Token ID1829
Feature activation-3.26e-01

Pos logit contributions
ertodd+1.099
rive+1.077
ointed+1.075
rived+1.072
ulously+1.071
Neg logit contributions
izabeth-0.815
-0.776
 awa-0.727
 Definition-0.661
ッド-0.657
Token 3
Token ID513
Feature activation-4.09e-01

Pos logit contributions
ertodd+0.922
ointed+0.904
rive+0.901
rived+0.897
ulously+0.897
Neg logit contributions
izabeth-0.987
-0.945
 awa-0.901
ッド-0.827
sense-0.826
Token
Token ID198
Feature activation+2.20e-01

Pos logit contributions
izabeth+0.765
+0.736
 awa+0.701
 Definition+0.661
sense+0.652
Neg logit contributions
ertodd-0.611
rive-0.594
ulously-0.593
ointed-0.592
rived-0.591
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
izabeth+0.693
+0.672
 awa+0.607
 Definition+0.595
sense+0.593
Neg logit contributions
ertodd-0.601
ulously-0.582
rived-0.580
rive-0.580
ointed-0.577
TokenKo
Token ID48735
Feature activation+4.47e-02

Pos logit contributions
izabeth+0.437
+0.424
 awa+0.399
sense+0.379
 Definition+0.376
Neg logit contributions
ertodd-0.308
ointed-0.298
ulously-0.296
rived-0.296
rive-0.296
TokenDo
Token ID5211
Feature activation+6.01e-02

Pos logit contributions
izabeth+0.150
+0.144
 awa+0.138
 Definition+0.129
sense+0.128
Neg logit contributions
ertodd-0.113
rive-0.110
ointed-0.110
rived-0.109
ulously-0.109
Token Profile
Token ID13118
Feature activation-2.36e-01

Pos logit contributions
izabeth+0.204
+0.197
 awa+0.188
 Definition+0.176
sense+0.175
Neg logit contributions
ertodd-0.149
rive-0.145
ointed-0.144
ulously-0.144
rived-0.144
Token Joined
Token ID16709
Feature activation-1.35e+00

Pos logit contributions
ertodd+0.435
rive+0.420
ointed+0.417
rived+0.416
ulously+0.416
Neg logit contributions
izabeth-0.951
-0.921
 awa-0.887
 Definition-0.841
sense-0.835
Token December
Token ID3426
Feature activation-2.07e-01

Pos logit contributions
ointed+2.011
ertodd+1.964
 Hollow+1.964
ulously+1.952
rive+1.948
Neg logit contributions
izabeth-5.791
-5.538
 awa-5.487
ッド-5.147
afety-5.112
Token 2010
Token ID3050
Feature activation-3.24e-01

Pos logit contributions
ertodd+0.354
ointed+0.342
rive+0.342
ulously+0.339
rived+0.339
Neg logit contributions
izabeth-0.860
-0.834
 awa-0.805
 Definition-0.760
sense-0.759
Token United
Token ID1578
Feature activation-9.51e-02

Pos logit contributions
ertodd+0.857
ointed+0.850
rive+0.843
ulously+0.839
rived+0.839
Neg logit contributions
izabeth-1.033
-0.989
 awa-0.948
afety-0.871
sense-0.871
Token States
Token ID1829
Feature activation-2.01e-01

Pos logit contributions
ertodd+0.315
rive+0.309
ointed+0.308
rived+0.307
ulously+0.307
Neg logit contributions
izabeth-0.243
-0.231
 awa-0.219
 Definition-0.197
ッド-0.197
Token 6
Token ID718
Feature activation-2.55e-01

Pos logit contributions
ertodd+0.562
ointed+0.551
rive+0.550
ulously+0.547
rived+0.547
Neg logit contributions
izabeth-0.616
-0.590
 awa-0.562
 Definition-0.517
ッド-0.517
TokenTB
Token ID22737
Feature activation+1.84e-01

Pos logit contributions
ertodd+0.115
ointed+0.111
rive+0.111
ulously+0.111
rived+0.111
Neg logit contributions
izabeth-0.161
-0.156
 awa-0.148
sense-0.139
 Definition-0.139
Tokenone
Token ID505
Feature activation-3.51e-01

Pos logit contributions
izabeth+0.619
+0.596
 awa+0.567
 Definition+0.533
sense+0.532
Neg logit contributions
ertodd-0.460
ointed-0.445
rive-0.445
rived-0.444
ulously-0.443
Token-
Token ID12
Feature activation+2.36e-01

Pos logit contributions
ertodd+0.848
rive+0.828
ointed+0.826
rived+0.823
ulously+0.820
Neg logit contributions
izabeth-1.213
-1.168
 awa-1.118
 Definition-1.045
ッド-1.038
Token Profile
Token ID13118
Feature activation-2.00e-01

Pos logit contributions
izabeth+0.816
+0.789
 awa+0.748
sense+0.712
 Definition+0.707
Neg logit contributions
ertodd-0.571
ointed-0.550
rive-0.550
rived-0.549
ulously-0.549
Token Blog
Token ID14001
Feature activation-7.17e-01

Pos logit contributions
ertodd+0.371
rive+0.356
rived+0.353
ulously+0.353
ointed+0.352
Neg logit contributions
izabeth-0.802
-0.777
 awa-0.749
 Definition-0.712
sense-0.705
Token Joined
Token ID16709
Feature activation-1.32e+00

Pos logit contributions
ertodd+1.052
ointed+1.027
rive+1.020
rived+1.014
ulously+1.008
Neg logit contributions
izabeth-3.151
-3.038
 awa-2.969
ッド-2.789
 Definition-2.787
Token November
Token ID3389
Feature activation-2.47e-01

Pos logit contributions
ointed+1.981
 Hollow+1.937
rive+1.932
ertodd+1.926
ulously+1.917
Neg logit contributions
izabeth-5.612
-5.338
 awa-5.283
ッド-4.963
afety-4.943
Token 2010
Token ID3050
Feature activation-3.73e-01

Pos logit contributions
ertodd+0.416
ointed+0.405
rive+0.403
ulously+0.401
rived+0.400
Neg logit contributions
izabeth-1.030
-0.995
 awa-0.963
sense-0.906
ッド-0.905
Token United
Token ID1578
Feature activation-1.26e-01

Pos logit contributions
ertodd+0.948
ointed+0.939
rive+0.932
ulously+0.927
rived+0.926
Neg logit contributions
izabeth-1.224
-1.173
 awa-1.129
sense-1.040
afety-1.040
Token States
Token ID1829
Feature activation-2.19e-01

Pos logit contributions
ertodd+0.429
rive+0.421
ointed+0.421
ulously+0.420
rived+0.419
Neg logit contributions
izabeth-0.311
-0.295
 awa-0.277
 Definition-0.252
sense-0.249
Token 21
Token ID2310
Feature activation-3.73e-01

Pos logit contributions
ertodd+0.619
ointed+0.607
rive+0.606
ulously+0.604
rived+0.603
Neg logit contributions
izabeth-0.666
-0.638
 awa-0.608
 Definition-0.559
ッド-0.558
Token
Token ID198
Feature activation+2.57e-01

Pos logit contributions
izabeth+0.881
+0.855
 awa+0.769
 Definition+0.758
sense+0.755
Neg logit contributions
ertodd-0.769
ulously-0.744
rived-0.743
rive-0.742
ointed-0.737
Tokenheart
Token ID11499
Feature activation-8.44e-02

Pos logit contributions
izabeth+0.889
+0.863
 awa+0.811
sense+0.774
 Definition+0.767
Neg logit contributions
ertodd-0.617
ointed-0.597
ulously-0.595
rived-0.595
rive-0.593
Tokenl
Token ID75
Feature activation+3.01e-01

Pos logit contributions
ertodd+0.216
rive+0.210
ointed+0.210
ulously+0.209
rived+0.209
Neg logit contributions
izabeth-0.280
-0.269
 awa-0.257
 Definition-0.240
sense-0.239
Tokenxp
Token ID42372
Feature activation+4.53e-01

Pos logit contributions
izabeth+1.016
+0.967
 awa+0.916
 Definition+0.866
sense+0.863
Neg logit contributions
ertodd-0.757
rive-0.729
ulously-0.729
ointed-0.729
rived-0.728
Token Profile
Token ID13118
Feature activation-3.23e-01

Pos logit contributions
izabeth+1.514
+1.450
 awa+1.375
 Definition+1.300
sense+1.296
Neg logit contributions
ertodd-1.140
rive-1.094
ulously-1.090
rived-1.089
ointed-1.084
Token Joined
Token ID16709
Feature activation-1.28e+00

Pos logit contributions
ertodd+0.566
rive+0.544
ointed+0.541
rived+0.540
ulously+0.539
Neg logit contributions
izabeth-1.334
-1.294
 awa-1.247
 Definition-1.184
sense-1.176
Token September
Token ID2693
Feature activation-1.21e-01

Pos logit contributions
ointed+1.937
ertodd+1.893
rive+1.885
ulously+1.879
 Hollow+1.873
Neg logit contributions
izabeth-5.440
-5.186
 awa-5.128
ッド-4.808
afety-4.781
Token 2010
Token ID3050
Feature activation-3.57e-01

Pos logit contributions
ertodd+0.202
ointed+0.196
rive+0.195
rived+0.194
ulously+0.194
Neg logit contributions
izabeth-0.506
-0.490
 awa-0.474
 Definition-0.446
sense-0.446
Token United
Token ID1578
Feature activation-1.47e-01

Pos logit contributions
ertodd+0.930
ointed+0.927
rive+0.920
ulously+0.913
rived+0.913
Neg logit contributions
izabeth-1.144
-1.094
 awa-1.053
ッド-0.968
afety-0.968
Token States
Token ID1829
Feature activation-2.32e-01

Pos logit contributions
ertodd+0.500
rive+0.490
ointed+0.490
ulously+0.489
rived+0.489
Neg logit contributions
izabeth-0.364
-0.345
 awa-0.324
 Definition-0.295
sense-0.292
Token 10
Token ID838
Feature activation-3.38e-01

Pos logit contributions
ertodd+0.653
ointed+0.641
rive+0.639
ulously+0.637
rived+0.636
Neg logit contributions
izabeth-0.708
-0.678
 awa-0.647
ッド-0.594
 Definition-0.593
Token
Token ID198
Feature activation+2.31e-01

Pos logit contributions
izabeth+0.743
+0.721
 awa+0.647
 Definition+0.638
sense+0.636
Neg logit contributions
ertodd-0.647
ulously-0.627
rived-0.626
rive-0.625
ointed-0.623
Tokenre
Token ID260
Feature activation-1.29e-01

Pos logit contributions
izabeth+0.793
+0.772
 awa+0.721
sense+0.690
iq+0.683
Neg logit contributions
ertodd-0.560
ointed-0.543
ulously-0.540
rived-0.539
rive-0.537
Tokeni
Token ID72
Feature activation+3.14e-01

Pos logit contributions
ertodd+0.350
rive+0.341
ointed+0.341
rived+0.340
ulously+0.340
Neg logit contributions
izabeth-0.406
-0.390
 awa-0.371
 Definition-0.345
sense-0.343
Token Profile
Token ID13118
Feature activation-1.14e-01

Pos logit contributions
izabeth+1.048
+1.006
 awa+0.958
 Definition+0.900
sense+0.897
Neg logit contributions
ertodd-0.794
ointed-0.770
rive-0.769
ulously-0.767
rived-0.767
Token Blog
Token ID14001
Feature activation-6.28e-01

Pos logit contributions
ertodd+0.207
rive+0.199
rived+0.198
ulously+0.197
ointed+0.197
Neg logit contributions
izabeth-0.463
-0.449
 awa-0.432
 Definition-0.411
sense-0.407
Token Joined
Token ID16709
Feature activation-1.26e+00

Pos logit contributions
ertodd+0.943
ointed+0.911
rive+0.904
ulously+0.902
rived+0.900
Neg logit contributions
izabeth-2.749
-2.649
 awa-2.584
 Definition-2.434
ッド-2.424
Token October
Token ID3267
Feature activation-3.12e-01

Pos logit contributions
ointed+1.954
ertodd+1.931
rive+1.919
ulously+1.902
 Hollow+1.886
Neg logit contributions
izabeth-5.316
-5.074
 awa-5.009
ッド-4.697
afety-4.673
Token 2002
Token ID6244
Feature activation-5.82e-01

Pos logit contributions
ertodd+0.511
ointed+0.500
rive+0.497
ulously+0.494
rived+0.493
Neg logit contributions
izabeth-1.313
-1.269
 awa-1.228
sense-1.156
ッド-1.154
Token United
Token ID1578
Feature activation-1.66e-01

Pos logit contributions
ertodd+1.470
ointed+1.465
rive+1.446
ulously+1.439
rived+1.432
Neg logit contributions
izabeth-1.902
-1.825
 awa-1.760
afety-1.614
sense-1.614
Token States
Token ID1829
Feature activation-2.16e-01

Pos logit contributions
ertodd+0.568
rive+0.558
ointed+0.557
rived+0.556
ulously+0.556
Neg logit contributions
izabeth-0.408
-0.387
 awa-0.363
 Definition-0.330
sense-0.326
Token 3
Token ID513
Feature activation-3.37e-01

Pos logit contributions
ertodd+0.611
ointed+0.597
rive+0.597
ulously+0.594
rived+0.594
Neg logit contributions
izabeth-0.658
-0.631
 awa-0.600
 Definition-0.554
sense-0.552
Tokenada
Token ID4763
Feature activation+1.46e-01

Pos logit contributions
ertodd+0.349
rive+0.340
ointed+0.339
rived+0.338
ulously+0.338
Neg logit contributions
izabeth-0.440
-0.424
 awa-0.404
 Definition-0.377
sense-0.374
Tokenver
Token ID332
Feature activation+1.87e-01

Pos logit contributions
izabeth+0.487
+0.469
 awa+0.447
 Definition+0.417
sense+0.414
Neg logit contributions
ertodd-0.368
rive-0.359
ointed-0.359
rived-0.358
ulously-0.357
TokenBB
Token ID15199
Feature activation-1.04e-01

Pos logit contributions
izabeth+0.635
+0.613
 awa+0.586
 Definition+0.550
sense+0.547
Neg logit contributions
ertodd-0.464
ointed-0.449
rive-0.448
ulously-0.447
rived-0.446
Token Profile
Token ID13118
Feature activation-4.15e-01

Pos logit contributions
ertodd+0.251
rive+0.245
ointed+0.245
rived+0.244
ulously+0.244
Neg logit contributions
izabeth-0.361
-0.348
 awa-0.333
 Definition-0.312
ッド-0.309
Token Blog
Token ID14001
Feature activation-7.67e-01

Pos logit contributions
ertodd+0.758
rive+0.731
ointed+0.731
rived+0.726
ulously+0.726
Neg logit contributions
izabeth-1.680
-1.627
 awa-1.566
 Definition-1.482
sense-1.476
Token Joined
Token ID16709
Feature activation-1.23e+00

Pos logit contributions
ertodd+1.121
ointed+1.096
rive+1.088
rived+1.079
ulously+1.076
Neg logit contributions
izabeth-3.376
-3.254
 awa-3.181
ッド-2.985
 Definition-2.982
Token June
Token ID2795
Feature activation-8.22e-02

Pos logit contributions
ointed+1.897
ertodd+1.848
rive+1.846
 Hollow+1.839
ulously+1.824
Neg logit contributions
izabeth-5.234
-4.991
 awa-4.924
ッド-4.627
afety-4.605
Token 2009
Token ID3717
Feature activation-3.32e-01

Pos logit contributions
ertodd+0.141
rive+0.136
ointed+0.136
rived+0.135
ulously+0.135
Neg logit contributions
izabeth-0.341
-0.331
 awa-0.319
 Definition-0.302
sense-0.301
Token Germany
Token ID4486
Feature activation-9.74e-02

Pos logit contributions
ertodd+0.810
ointed+0.808
rive+0.799
rived+0.795
ulously+0.793
Neg logit contributions
izabeth-1.126
-1.079
 awa-1.036
ッド-0.958
sense-0.957
Token 247
Token ID30179
Feature activation-2.74e-01

Pos logit contributions
ertodd+0.272
ointed+0.267
rive+0.267
rived+0.265
ulously+0.265
Neg logit contributions
izabeth-0.298
-0.285
 awa-0.273
sense-0.250
 Definition-0.250
Token34
Token ID2682
Feature activation-3.71e-01

Pos logit contributions
ertodd+0.481
ointed+0.464
rive+0.462
rived+0.459
ulously+0.458
Neg logit contributions
izabeth-1.123
-1.087
 awa-1.052
 Definition-0.989
sense-0.988
TokenK
Token ID42
Feature activation+1.57e-01

Pos logit contributions
ertodd+0.062
rive+0.060
ointed+0.060
rived+0.060
ulously+0.060
Neg logit contributions
izabeth-0.085
-0.082
 awa-0.078
 Definition-0.074
sense-0.073
Tokenier
Token ID959
Feature activation+3.84e-01

Pos logit contributions
izabeth+0.522
+0.501
 awa+0.477
 Definition+0.448
sense+0.445
Neg logit contributions
ertodd-0.401
rive-0.388
ointed-0.388
rived-0.387
ulously-0.387
Tokenan
Token ID272
Feature activation+2.68e-01

Pos logit contributions
izabeth+1.284
+1.236
 awa+1.176
 Definition+1.108
sense+1.101
Neg logit contributions
ertodd-0.966
ointed-0.934
ulously-0.931
rive-0.929
rived-0.928
Token Profile
Token ID13118
Feature activation-2.11e-01

Pos logit contributions
izabeth+0.895
+0.863
 awa+0.822
 Definition+0.776
sense+0.769
Neg logit contributions
ertodd-0.675
ointed-0.651
rived-0.650
ulously-0.650
rive-0.650
Token Blog
Token ID14001
Feature activation-7.20e-01

Pos logit contributions
ertodd+0.403
rive+0.388
rived+0.385
ulously+0.384
ointed+0.384
Neg logit contributions
izabeth-0.838
-0.812
 awa-0.782
 Definition-0.743
sense-0.735
Token Joined
Token ID16709
Feature activation-1.22e+00

Pos logit contributions
ertodd+1.046
ointed+1.024
rive+1.016
rived+1.008
ulously+1.007
Neg logit contributions
izabeth-3.171
-3.054
 awa-2.991
 Definition-2.802
afety-2.802
Token August
Token ID2932
Feature activation+2.34e-03

Pos logit contributions
ointed+1.880
rive+1.828
ertodd+1.827
 Hollow+1.823
ulously+1.815
Neg logit contributions
izabeth-5.196
-4.951
 awa-4.893
ッド-4.595
afety-4.578
Token 2011
Token ID2813
Feature activation-3.14e-01

Pos logit contributions
izabeth+0.010
+0.009
 awa+0.009
 Definition+0.009
sense+0.008
Neg logit contributions
ertodd-0.004
rive-0.004
rived-0.004
ointed-0.004
ulously-0.004
Token United
Token ID1578
Feature activation-1.09e-01

Pos logit contributions
ertodd+0.808
ointed+0.805
rive+0.798
ulously+0.793
rived+0.792
Neg logit contributions
izabeth-1.018
-0.972
 awa-0.936
afety-0.861
ッド-0.861
Token States
Token ID1829
Feature activation-1.62e-01

Pos logit contributions
ertodd+0.369
rive+0.362
ointed+0.362
ulously+0.361
rived+0.361
Neg logit contributions
izabeth-0.271
-0.257
 awa-0.241
 Definition-0.220
sense-0.217
Token 6
Token ID718
Feature activation-2.58e-01

Pos logit contributions
ertodd+0.453
ointed+0.444
rive+0.443
ulously+0.442
rived+0.441
Neg logit contributions
izabeth-0.494
-0.473
 awa-0.451
 Definition-0.415
ッド-0.414
TokenJ
Token ID41
Feature activation-1.96e-01

Pos logit contributions
izabeth+0.503
+0.487
 awa+0.460
sense+0.435
 Definition+0.433
Neg logit contributions
ertodd-0.363
ointed-0.352
ulously-0.350
rived-0.350
rive-0.350
Tokenaaaa
Token ID24794
Feature activation+1.20e-01

Pos logit contributions
ertodd+0.506
rive+0.494
ointed+0.493
rived+0.491
ulously+0.491
Neg logit contributions
izabeth-0.646
-0.621
 awa-0.594
 Definition-0.554
sense-0.549
Tokenasper
Token ID32981
Feature activation+1.58e-01

Pos logit contributions
izabeth+0.394
+0.379
 awa+0.361
 Definition+0.337
sense+0.335
Neg logit contributions
ertodd-0.314
rive-0.305
ointed-0.305
rived-0.304
ulously-0.304
Token Profile
Token ID13118
Feature activation-2.11e-01

Pos logit contributions
izabeth+0.533
+0.513
 awa+0.489
 Definition+0.461
sense+0.458
Neg logit contributions
ertodd-0.393
rive-0.381
ointed-0.381
ulously-0.381
rived-0.380
Token Blog
Token ID14001
Feature activation-6.53e-01

Pos logit contributions
ertodd+0.378
rive+0.363
rived+0.360
ointed+0.360
ulously+0.360
Neg logit contributions
izabeth-0.863
-0.837
 awa-0.807
 Definition-0.766
sense-0.760
Token Joined
Token ID16709
Feature activation-1.21e+00

Pos logit contributions
ertodd+0.982
ointed+0.952
rive+0.947
rived+0.940
ulously+0.939
Neg logit contributions
izabeth-2.852
-2.752
 awa-2.683
 Definition-2.526
ッド-2.518
Token April
Token ID3035
Feature activation-1.55e-01

Pos logit contributions
ointed+1.871
ertodd+1.832
rive+1.828
 Hollow+1.814
ulously+1.803
Neg logit contributions
izabeth-5.153
-4.917
 awa-4.847
ッド-4.553
afety-4.526
Token 2012
Token ID2321
Feature activation-4.01e-01

Pos logit contributions
ertodd+0.263
ointed+0.255
rive+0.255
ulously+0.253
rived+0.253
Neg logit contributions
izabeth-0.645
-0.625
 awa-0.605
sense-0.568
 Definition-0.568
Token United
Token ID1578
Feature activation-1.61e-01

Pos logit contributions
ertodd+0.996
ointed+0.993
rive+0.982
ulously+0.976
rived+0.974
Neg logit contributions
izabeth-1.332
-1.275
 awa-1.230
ッド-1.134
afety-1.132
Token States
Token ID1829
Feature activation-2.08e-01

Pos logit contributions
ertodd+0.549
rive+0.538
ointed+0.537
rived+0.536
ulously+0.536
Neg logit contributions
izabeth-0.399
-0.379
 awa-0.355
 Definition-0.323
sense-0.320
Token 5
Token ID642
Feature activation-3.69e-01

Pos logit contributions
ertodd+0.587
ointed+0.575
rive+0.574
ulously+0.572
rived+0.572
Neg logit contributions
izabeth-0.629
-0.603
 awa-0.574
 Definition-0.528
ッド-0.527
Token of
Token ID286
Feature activation+1.80e-01

Pos logit contributions
ertodd+0.518
rive+0.503
ointed+0.502
ulously+0.500
rived+0.500
Neg logit contributions
izabeth-0.725
-0.698
 awa-0.667
 Definition-0.626
sense-0.622
Token Dal
Token ID12348
Feature activation+7.37e-02

Pos logit contributions
izabeth+0.598
+0.570
 awa+0.545
 Definition+0.520
ッド+0.507
Neg logit contributions
ertodd-0.458
rive-0.449
ointed-0.447
ulously-0.447
rived-0.446
TokenK
Token ID42
Feature activation+8.02e-02

Pos logit contributions
izabeth+0.218
+0.209
 awa+0.198
 Definition+0.183
sense+0.182
Neg logit contributions
ertodd-0.215
rive-0.210
ointed-0.210
rived-0.209
ulously-0.209
Token Profile
Token ID13118
Feature activation-1.90e-01

Pos logit contributions
izabeth+0.265
+0.255
 awa+0.243
 Definition+0.227
sense+0.226
Neg logit contributions
ertodd-0.206
rive-0.200
ointed-0.200
rived-0.200
ulously-0.199
Token Blog
Token ID14001
Feature activation-5.40e-01

Pos logit contributions
ertodd+0.358
rive+0.344
ointed+0.342
rived+0.342
ulously+0.341
Neg logit contributions
izabeth-0.760
-0.737
 awa-0.710
 Definition-0.673
sense-0.667
Token Joined
Token ID16709
Feature activation-1.19e+00

Pos logit contributions
ertodd+0.812
ointed+0.787
rive+0.784
ulously+0.778
rived+0.778
Neg logit contributions
izabeth-2.361
-2.278
 awa-2.221
 Definition-2.090
ッド-2.085
Token June
Token ID2795
Feature activation-2.28e-02

Pos logit contributions
ointed+1.838
ertodd+1.795
rive+1.794
 Hollow+1.787
ulously+1.776
Neg logit contributions
izabeth-5.070
-4.833
 awa-4.777
ッド-4.488
afety-4.459
Token 2012
Token ID2321
Feature activation-1.88e-01

Pos logit contributions
ertodd+0.039
rive+0.038
ointed+0.038
rived+0.037
ulously+0.037
Neg logit contributions
izabeth-0.094
-0.091
 awa-0.088
 Definition-0.083
sense-0.083
Token Canada
Token ID3340
Feature activation-1.56e-01

Pos logit contributions
ertodd+0.485
ointed+0.481
rive+0.477
ulously+0.475
rived+0.474
Neg logit contributions
izabeth-0.614
-0.588
 awa-0.565
ッド-0.521
sense-0.520
Token 16
Token ID1467
Feature activation-7.36e-02

Pos logit contributions
ertodd+0.441
ointed+0.431
rive+0.431
rived+0.429
ulously+0.429
Neg logit contributions
izabeth-0.477
-0.457
 awa-0.435
 Definition-0.401
sense-0.400
Token10
Token ID940
Feature activation-3.91e-01

Pos logit contributions
ertodd+0.169
ointed+0.164
rive+0.164
rived+0.163
ulously+0.163
Neg logit contributions
izabeth-0.262
-0.253
 awa-0.243
 Definition-0.227
sense-0.226
Token
Token ID198
Feature activation+2.00e-01

Pos logit contributions
izabeth+0.833
+0.808
 awa+0.730
 Definition+0.716
sense+0.714
Neg logit contributions
ertodd-0.723
ulously-0.701
rived-0.699
rive-0.698
ointed-0.695
TokenDark
Token ID17367
Feature activation+1.53e-01

Pos logit contributions
izabeth+0.684
+0.663
 awa+0.625
sense+0.592
 Definition+0.589
Neg logit contributions
ertodd-0.488
ointed-0.473
ulously-0.471
rived-0.471
rive-0.470
Tokenhorse
Token ID30527
Feature activation-3.68e-01

Pos logit contributions
izabeth+0.528
+0.509
 awa+0.486
 Definition+0.458
sense+0.458
Neg logit contributions
ertodd-0.369
ointed-0.358
ulously-0.358
rived-0.357
rive-0.355
Token Profile
Token ID13118
Feature activation-8.51e-02

Pos logit contributions
ertodd+0.895
rive+0.870
ointed+0.869
rived+0.866
ulously+0.866
Neg logit contributions
izabeth-1.267
-1.221
 awa-1.167
 Definition-1.094
sense-1.087
Token Blog
Token ID14001
Feature activation-5.72e-01

Pos logit contributions
ertodd+0.159
rive+0.153
rived+0.152
ulously+0.152
ointed+0.151
Neg logit contributions
izabeth-0.340
-0.329
 awa-0.318
 Definition-0.302
sense-0.298
Token Joined
Token ID16709
Feature activation-1.19e+00

Pos logit contributions
ertodd+0.858
ointed+0.833
rive+0.829
rived+0.822
ulously+0.822
Neg logit contributions
izabeth-2.495
-2.409
 awa-2.350
 Definition-2.211
ッド-2.206
Token December
Token ID3426
Feature activation-3.04e-01

Pos logit contributions
ointed+1.845
rive+1.800
ertodd+1.798
ulously+1.787
 Hollow+1.787
Neg logit contributions
izabeth-5.051
-4.819
 awa-4.758
ッド-4.467
afety-4.443
Token 2011
Token ID2813
Feature activation-4.56e-01

Pos logit contributions
ertodd+0.503
ointed+0.494
rive+0.491
ulously+0.487
rived+0.485
Neg logit contributions
izabeth-1.276
-1.233
 awa-1.198
sense-1.125
ッド-1.122
Token United
Token ID1578
Feature activation-5.11e-02

Pos logit contributions
ointed+1.164
ertodd+1.163
rive+1.150
ulously+1.142
rived+1.141
Neg logit contributions
izabeth-1.484
-1.417
 awa-1.367
afety-1.258
ッド-1.254
Token States
Token ID1829
Feature activation-1.26e-01

Pos logit contributions
ertodd+0.174
rive+0.170
ointed+0.170
ulously+0.170
rived+0.170
Neg logit contributions
izabeth-0.126
-0.120
 awa-0.112
 Definition-0.103
sense-0.101
Token 228
Token ID29041
Feature activation-8.72e-02

Pos logit contributions
ertodd+0.358
ointed+0.350
rive+0.350
ulously+0.348
rived+0.348
Neg logit contributions
izabeth-0.382
-0.366
 awa-0.348
 Definition-0.321
sense-0.320
Token PowerPoint
Token ID43563
Feature activation-5.47e-01

Pos logit contributions
ertodd+0.040
ointed+0.039
rive+0.039
ulously+0.039
rived+0.039
Neg logit contributions
izabeth-0.043
-0.041
 awa-0.039
 Definition-0.037
sense-0.036
Token slide
Token ID10649
Feature activation-5.00e-01

Pos logit contributions
ertodd+1.169
rive+1.157
ointed+1.140
rived+1.137
ulously+1.128
Neg logit contributions
izabeth-2.025
-1.960
 awa-1.887
ッド-1.759
afety-1.756
Token
Token ID198
Feature activation+2.80e-02

Pos logit contributions
ertodd+1.170
ointed+1.145
rive+1.142
ulously+1.135
rived+1.134
Neg logit contributions
izabeth-1.750
-1.684
 awa-1.653
ッド-1.515
 Definition-1.502
Token
Token ID198
Feature activation+1.97e-01

Pos logit contributions
izabeth+0.092
+0.088
 awa+0.083
 Definition+0.079
sense+0.078
Neg logit contributions
ertodd-0.073
rive-0.071
ulously-0.071
ointed-0.071
rived-0.071
TokenPower
Token ID13434
Feature activation-1.56e-01

Pos logit contributions
izabeth+0.475
+0.450
 awa+0.428
 Definition+0.384
ッド+0.378
Neg logit contributions
ertodd-0.677
rive-0.673
rived-0.667
ointed-0.666
ulously-0.666
TokenPoint
Token ID12727
Feature activation-1.16e+00

Pos logit contributions
ertodd+0.258
ointed+0.248
rive+0.247
rived+0.246
ulously+0.245
Neg logit contributions
izabeth-0.657
-0.638
 awa-0.613
 Definition-0.587
sense-0.583
Token slide
Token ID10649
Feature activation-4.55e-01

Pos logit contributions
ulously+2.332
ointed+2.327
rive+2.325
ertodd+2.318
rived+2.284
Neg logit contributions
izabeth-4.383
-4.231
 awa-4.125
afety-3.825
ッド-3.822
Token PNG
Token ID36182
Feature activation-2.90e-01

Pos logit contributions
ertodd+1.150
ulously+1.145
rive+1.140
rived+1.135
ointed+1.134
Neg logit contributions
izabeth-1.504
-1.440
 awa-1.381
ッド-1.281
 Definition-1.278
Token larger
Token ID4025
Feature activation-1.44e-02

Pos logit contributions
ertodd+0.813
rive+0.798
ointed+0.796
ulously+0.793
rived+0.791
Neg logit contributions
izabeth-0.887
-0.852
 awa-0.812
 Definition-0.750
sense-0.743
Token image
Token ID2939
Feature activation-1.90e-01

Pos logit contributions
ertodd+0.021
rived+0.020
ulously+0.020
rive+0.020
ointed+0.020
Neg logit contributions
izabeth-0.064
-0.062
 awa-0.059
 Definition-0.057
sense-0.057
Token
Token ID198
Feature activation-3.01e-01

Pos logit contributions
ertodd+0.466
rive+0.454
ointed+0.454
ulously+0.451
rived+0.449
Neg logit contributions
izabeth-0.648
-0.623
 awa-0.603
ッド-0.556
 Definition-0.555
Token into
Token ID656
Feature activation+2.73e-01

Pos logit contributions
izabeth+0.717
+0.693
 awa+0.666
 Definition+0.625
sense+0.622
Neg logit contributions
ertodd-0.454
rive-0.438
ointed-0.437
rived-0.436
ulously-0.435
Token a
Token ID257
Feature activation+6.53e-02

Pos logit contributions
izabeth+0.843
+0.808
 awa+0.769
 Definition+0.715
sense+0.708
Neg logit contributions
ertodd-0.762
rive-0.741
ointed-0.740
rived-0.739
ulously-0.738
Token blender
Token ID40047
Feature activation+1.89e-01

Pos logit contributions
izabeth+0.216
+0.208
 awa+0.198
 Definition+0.186
sense+0.184
Neg logit contributions
ertodd-0.168
ointed-0.163
rive-0.162
ulously-0.162
rived-0.162
Token.
Token ID13
Feature activation+3.47e-02

Pos logit contributions
izabeth+0.638
+0.616
 awa+0.591
 Definition+0.553
sense+0.548
Neg logit contributions
ertodd-0.472
ulously-0.451
rive-0.451
rived-0.450
ointed-0.449
Token Do
Token ID2141
Feature activation+1.36e-01

Pos logit contributions
izabeth+0.123
+0.119
 awa+0.114
 Definition+0.107
sense+0.106
Neg logit contributions
ertodd-0.080
rive-0.078
ointed-0.078
rived-0.078
ulously-0.078
Tokenle
Token ID293
Feature activation-1.13e+00

Pos logit contributions
izabeth+0.493
+0.476
 awa+0.459
 Definition+0.431
sense+0.428
Neg logit contributions
ertodd-0.303
rive-0.291
ointed-0.291
rived-0.290
ulously-0.290
Token's
Token ID338
Feature activation+3.09e-01

Pos logit contributions
rive+2.544
ertodd+2.536
ulously+2.535
ointed+2.532
rived+2.514
Neg logit contributions
izabeth-4.010
-3.861
 awa-3.676
afety-3.458
ッド-3.451
Token Orange
Token ID11942
Feature activation+3.74e-01

Pos logit contributions
izabeth+1.051
+1.012
 awa+0.969
 Definition+0.910
sense+0.903
Neg logit contributions
ertodd-0.763
rive-0.740
ointed-0.738
rived-0.737
ulously-0.736
Token-
Token ID12
Feature activation+3.43e-01

Pos logit contributions
izabeth+1.111
+1.065
 awa+1.011
 Definition+0.942
sense+0.932
Neg logit contributions
ertodd-1.088
rive-1.060
ointed-1.057
rived-1.057
ulously-1.055
TokenP
Token ID47
Feature activation+4.41e-01

Pos logit contributions
izabeth+1.223
+1.188
 awa+1.130
 Definition+1.069
sense+1.068
Neg logit contributions
ertodd-0.790
ointed-0.758
rive-0.757
ulously-0.755
rived-0.754
Tokenine
Token ID500
Feature activation+3.23e-01

Pos logit contributions
izabeth+1.512
+1.452
 awa+1.378
 Definition+1.303
sense+1.296
Neg logit contributions
ertodd-1.077
ointed-1.037
ulously-1.034
rive-1.028
rived-1.027
Token
Token ID198
Feature activation-2.75e-02

Pos logit contributions
izabeth+0.251
+0.243
 awa+0.223
 Definition+0.216
sense+0.215
Neg logit contributions
ertodd-0.211
ulously-0.204
rive-0.204
rived-0.203
ointed-0.203
TokenBig
Token ID12804
Feature activation-5.15e-02

Pos logit contributions
ertodd+0.067
ointed+0.065
rive+0.065
ulously+0.065
rived+0.065
Neg logit contributions
izabeth-0.094
-0.091
 awa-0.087
sense-0.082
 Definition-0.081
TokenFan
Token ID22480
Feature activation+6.40e-02

Pos logit contributions
ertodd+0.126
rive+0.123
ointed+0.123
rived+0.122
ulously+0.122
Neg logit contributions
izabeth-0.176
-0.170
 awa-0.162
 Definition-0.152
sense-0.151
Token Profile
Token ID13118
Feature activation+1.10e-02

Pos logit contributions
izabeth+0.220
+0.212
 awa+0.203
 Definition+0.191
sense+0.189
Neg logit contributions
ertodd-0.155
rive-0.151
ointed-0.150
rived-0.150
ulously-0.150
Token Blog
Token ID14001
Feature activation-5.93e-01

Pos logit contributions
izabeth+0.044
+0.043
 awa+0.041
 Definition+0.039
sense+0.039
Neg logit contributions
ertodd-0.021
rive-0.020
rived-0.020
ulously-0.020
ointed-0.020
Token Joined
Token ID16709
Feature activation-1.12e+00

Pos logit contributions
ertodd+0.861
ointed+0.846
rive+0.840
rived+0.832
ulously+0.832
Neg logit contributions
izabeth-2.613
-2.516
 awa-2.463
ッド-2.312
 Definition-2.309
Token December
Token ID3426
Feature activation-2.29e-01

Pos logit contributions
ointed+1.746
rive+1.708
ertodd+1.708
ulously+1.696
 Hollow+1.686
Neg logit contributions
izabeth-4.736
-4.523
 awa-4.461
ッド-4.186
afety-4.169
Token 2010
Token ID3050
Feature activation-3.04e-01

Pos logit contributions
ertodd+0.393
ointed+0.379
rive+0.379
ulously+0.376
rived+0.376
Neg logit contributions
izabeth-0.954
-0.924
 awa-0.892
 Definition-0.842
sense-0.840
Token TL
Token ID24811
Feature activation-4.13e-01

Pos logit contributions
ertodd+0.786
ointed+0.775
rive+0.771
ulously+0.767
rived+0.766
Neg logit contributions
izabeth-0.990
-0.949
 awa-0.911
sense-0.840
afety-0.839
TokenAD
Token ID2885
Feature activation-4.75e-02

Pos logit contributions
ertodd+0.980
ointed+0.959
rive+0.957
rived+0.952
ulously+0.951
Neg logit contributions
izabeth-1.437
-1.386
 awa-1.326
 Definition-1.234
ッド-1.231
TokenT
Token ID51
Feature activation+3.03e-02

Pos logit contributions
ertodd+0.111
ulously+0.104
ointed+0.104
rived+0.104
rive+0.103
Neg logit contributions
izabeth-0.169
-0.164
 awa-0.156
 Definition-0.149
sense-0.148
Token and
Token ID290
Feature activation+8.95e-03

Pos logit contributions
ertodd+0.292
ointed+0.284
rive+0.284
ulously+0.283
rived+0.283
Neg logit contributions
izabeth-0.435
-0.418
 awa-0.400
 Definition-0.375
ッド-0.374
Token
Token ID3926
Feature activation-5.39e-02

Pos logit contributions
izabeth+0.029
+0.028
 awa+0.027
 Definition+0.025
sense+0.025
Neg logit contributions
ertodd-0.023
rive-0.023
ointed-0.023
ulously-0.023
rived-0.023
Token N
Token ID399
Feature activation-5.26e-02

Pos logit contributions
ertodd+0.122
ointed+0.119
rive+0.119
rived+0.119
ulously+0.118
Neg logit contributions
izabeth-0.194
-0.187
 awa-0.178
 Definition-0.168
ッド-0.167
TokenINE
Token ID8881
Feature activation+1.41e-01

Pos logit contributions
ertodd+0.146
rive+0.143
ointed+0.143
ulously+0.142
rived+0.142
Neg logit contributions
izabeth-0.163
-0.156
 awa-0.148
 Definition-0.138
sense-0.137
Token DO
Token ID8410
Feature activation-5.13e-02

Pos logit contributions
izabeth+0.484
+0.466
 awa+0.445
 Definition+0.417
sense+0.414
Neg logit contributions
ertodd-0.347
rive-0.337
ointed-0.337
rived-0.336
ulously-0.336
TokenLL
Token ID3069
Feature activation-1.11e+00

Pos logit contributions
ertodd+0.144
rive+0.142
ointed+0.141
rived+0.140
ulously+0.140
Neg logit contributions
izabeth-0.157
-0.150
 awa-0.143
 Definition-0.131
sense-0.131
TokenARS
Token ID27415
Feature activation+7.13e-03

Pos logit contributions
ertodd+3.479
rive+3.428
ointed+3.425
ulously+3.398
rived+3.389
Neg logit contributions
izabeth-2.907
-2.711
 awa-2.666
sense-2.363
 Definition-2.359
Token?
Token ID30
Feature activation+6.69e-02

Pos logit contributions
izabeth+0.025
+0.025
 awa+0.024
 Definition+0.022
sense+0.022
Neg logit contributions
ertodd-0.016
rive-0.016
ointed-0.016
rived-0.016
ulously-0.016
Token N
Token ID399
Feature activation-1.15e-01

Pos logit contributions
izabeth+0.223
+0.215
 awa+0.205
 Definition+0.192
sense+0.190
Neg logit contributions
ertodd-0.170
rive-0.166
ointed-0.165
rived-0.165
ulously-0.165
TokenINE
Token ID8881
Feature activation-1.10e-01

Pos logit contributions
ertodd+0.283
rive+0.275
ointed+0.275
rived+0.274
ulously+0.273
Neg logit contributions
izabeth-0.393
-0.378
 awa-0.361
 Definition-0.338
sense-0.337
Token DO
Token ID8410
Feature activation-8.09e-02

Pos logit contributions
ertodd+0.232
rive+0.225
ointed+0.224
rived+0.224
ulously+0.223
Neg logit contributions
izabeth-0.413
-0.400
 awa-0.384
 Definition-0.362
sense-0.360
Token spokesman
Token ID6523
Feature activation+2.82e-01

Pos logit contributions
izabeth+0.660
+0.635
 awa+0.606
 Definition+0.567
sense+0.563
Neg logit contributions
ertodd-0.535
ointed-0.518
rive-0.516
ulously-0.515
rived-0.514
Token said
Token ID531
Feature activation-6.49e-02

Pos logit contributions
izabeth+1.052
+1.013
 awa+0.976
 Definition+0.921
sense+0.914
Neg logit contributions
ertodd-0.606
rive-0.584
ointed-0.582
rived-0.576
ulously-0.575
Token:
Token ID25
Feature activation+1.00e-01

Pos logit contributions
ertodd+0.157
ointed+0.151
rive+0.151
rived+0.150
ulously+0.150
Neg logit contributions
izabeth-0.225
-0.217
 awa-0.209
 Definition-0.195
sense-0.194
Token "
Token ID366
Feature activation+9.03e-02

Pos logit contributions
izabeth+0.335
+0.323
 awa+0.307
 Definition+0.289
sense+0.287
Neg logit contributions
ertodd-0.253
rive-0.246
ointed-0.246
rived-0.245
ulously-0.245
TokenTh
Token ID817
Feature activation-5.02e-01

Pos logit contributions
izabeth+0.314
+0.305
 awa+0.290
 Definition+0.273
sense+0.272
Neg logit contributions
ertodd-0.217
ointed-0.210
rive-0.209
rived-0.208
ulously-0.208
Tokenur
Token ID333
Feature activation-1.09e+00

Pos logit contributions
ertodd+1.081
ointed+1.046
rive+1.045
rived+1.040
ulously+1.040
Neg logit contributions
izabeth-1.870
-1.807
 awa-1.733
 Definition-1.633
sense-1.623
Tokenrock
Token ID10823
Feature activation+2.28e-02

Pos logit contributions
rive+2.867
ertodd+2.846
ulously+2.816
rived+2.800
ointed+2.798
Neg logit contributions
izabeth-3.518
-3.368
 awa-3.364
ッド-3.052
 Definition-3.046
Token Council
Token ID4281
Feature activation+9.60e-02

Pos logit contributions
izabeth+0.079
+0.076
 awa+0.073
 Definition+0.068
sense+0.068
Neg logit contributions
ertodd-0.055
rive-0.054
ointed-0.054
rived-0.053
ulously-0.053
Token is
Token ID318
Feature activation+1.98e-01

Pos logit contributions
izabeth+0.335
+0.323
 awa+0.309
 Definition+0.290
sense+0.288
Neg logit contributions
ertodd-0.229
rive-0.222
ointed-0.221
rived-0.221
ulously-0.221
Token following
Token ID1708
Feature activation-1.95e-01

Pos logit contributions
izabeth+0.697
+0.669
 awa+0.649
 Definition+0.606
sense+0.601
Neg logit contributions
ertodd-0.471
rived-0.449
rive-0.447
ointed-0.447
ulously-0.446
Token up
Token ID510
Feature activation-3.43e-01

Pos logit contributions
ertodd+0.468
rive+0.452
ointed+0.451
rived+0.450
ulously+0.449
Neg logit contributions
izabeth-0.682
-0.657
 awa-0.630
 Definition-0.592
sense-0.588
Token,
Token ID11
Feature activation-2.19e-01

Pos logit contributions
ertodd+0.351
rive+0.339
ointed+0.338
rived+0.337
ulously+0.335
Neg logit contributions
izabeth-0.637
-0.616
 awa-0.591
 Definition-0.558
sense-0.556
Token physicists
Token ID37898
Feature activation-1.15e-01

Pos logit contributions
ertodd+0.537
rive+0.522
ointed+0.521
rived+0.520
ulously+0.519
Neg logit contributions
izabeth-0.747
-0.719
 awa-0.688
 Definition-0.645
sense-0.640
Token have
Token ID423
Feature activation-2.05e-01

Pos logit contributions
ertodd+0.283
rive+0.276
ointed+0.276
rived+0.275
ulously+0.275
Neg logit contributions
izabeth-0.393
-0.379
 awa-0.361
 Definition-0.337
ッド-0.337
Token turned
Token ID2900
Feature activation-2.82e-01

Pos logit contributions
ertodd+0.507
rive+0.494
ointed+0.493
rived+0.491
ulously+0.491
Neg logit contributions
izabeth-0.695
-0.670
 awa-0.640
 Definition-0.599
sense-0.595
Token their
Token ID511
Feature activation-2.76e-01

Pos logit contributions
ertodd+0.532
ointed+0.509
rive+0.507
rived+0.507
ulously+0.504
Neg logit contributions
izabeth-1.128
-1.094
 awa-1.052
 Definition-0.992
sense-0.991
Token sights
Token ID21343
Feature activation-1.08e+00

Pos logit contributions
ertodd+0.681
rive+0.663
ointed+0.662
rived+0.660
ulously+0.659
Neg logit contributions
izabeth-0.937
-0.902
 awa-0.862
 Definition-0.807
sense-0.802
Token to
Token ID284
Feature activation-1.73e-01

Pos logit contributions
ertodd+1.918
rive+1.883
ointed+1.866
ulously+1.862
rived+1.855
Neg logit contributions
izabeth-4.397
-4.253
 awa-4.111
 Definition-3.907
sense-3.874
Token the
Token ID262
Feature activation-9.08e-02

Pos logit contributions
ertodd+0.458
rive+0.448
ointed+0.447
ulously+0.445
rived+0.445
Neg logit contributions
izabeth-0.556
-0.534
 awa-0.509
 Definition-0.474
ッド-0.472
Token phase
Token ID7108
Feature activation+1.94e-01

Pos logit contributions
ertodd+0.239
rive+0.234
ointed+0.234
rived+0.233
ulously+0.232
Neg logit contributions
izabeth-0.293
-0.281
 awa-0.268
 Definition-0.249
ッド-0.249
Token that
Token ID326
Feature activation+1.66e-01

Pos logit contributions
izabeth+0.638
+0.614
 awa+0.585
 Definition+0.546
sense+0.543
Neg logit contributions
ertodd-0.501
rive-0.489
ointed-0.488
ulously-0.487
rived-0.487
Token occurs
Token ID8833
Feature activation-1.36e-01

Pos logit contributions
izabeth+0.534
+0.513
 awa+0.489
 Definition+0.456
sense+0.452
Neg logit contributions
ertodd-0.443
rive-0.432
ointed-0.431
rived-0.430
ulously-0.429
Token,
Token ID11
Feature activation-4.38e-02

Pos logit contributions
izabeth+0.549
+0.527
 awa+0.509
 Definition+0.481
sense+0.478
Neg logit contributions
ertodd-0.350
ointed-0.339
rived-0.335
rive-0.335
ulously-0.335
Token and
Token ID290
Feature activation-4.08e-02

Pos logit contributions
ertodd+0.099
rive+0.095
ointed+0.095
rived+0.095
ulously+0.095
Neg logit contributions
izabeth-0.159
-0.153
 awa-0.147
 Definition-0.138
sense-0.137
Token off
Token ID572
Feature activation-1.90e-02

Pos logit contributions
ertodd+0.093
rive+0.090
rived+0.089
ointed+0.089
ulously+0.089
Neg logit contributions
izabeth-0.147
-0.142
 awa-0.136
 Definition-0.128
sense-0.127
Token the
Token ID262
Feature activation-2.01e-01

Pos logit contributions
ertodd+0.047
ointed+0.045
rive+0.045
rived+0.045
ulously+0.045
Neg logit contributions
izabeth-0.065
-0.062
 awa-0.060
 Definition-0.056
iq-0.056
Token do
Token ID466
Feature activation-2.19e-01

Pos logit contributions
ertodd+0.555
rive+0.542
ointed+0.540
rived+0.539
ulously+0.538
Neg logit contributions
izabeth-0.628
-0.602
 awa-0.573
 Definition-0.534
sense-0.529
Tokenle
Token ID293
Feature activation-1.05e+00

Pos logit contributions
ertodd+0.632
rive+0.624
ointed+0.620
ulously+0.619
rived+0.619
Neg logit contributions
izabeth-0.651
-0.625
 awa-0.592
 Definition-0.549
sense-0.544
Token forever
Token ID8097
Feature activation-1.42e-01

Pos logit contributions
ertodd+2.365
rive+2.327
rived+2.320
ointed+2.318
ulously+2.318
Neg logit contributions
izabeth-3.769
-3.656
 awa-3.484
ッド-3.266
 Definition-3.265
Token.
Token ID13
Feature activation+2.19e-02

Pos logit contributions
ertodd+0.354
ointed+0.343
rive+0.343
rived+0.343
ulously+0.342
Neg logit contributions
izabeth-0.481
-0.464
 awa-0.444
 Definition-0.415
sense-0.411
Token
Token ID198
Feature activation+1.67e-01

Pos logit contributions
izabeth+0.074
+0.071
 awa+0.068
 Definition+0.063
sense+0.063
Neg logit contributions
ertodd-0.055
rive-0.053
ointed-0.053
rived-0.053
ulously-0.053
Token
Token ID198
Feature activation+1.95e-01

Pos logit contributions
izabeth+0.538
+0.521
 awa+0.479
 Definition+0.462
sense+0.460
Neg logit contributions
ertodd-0.447
rive-0.433
rived-0.433
ulously-0.433
ointed-0.431
TokenThe
Token ID464
Feature activation+1.03e-01

Pos logit contributions
izabeth+0.689
+0.668
 awa+0.632
 Definition+0.599
sense+0.597
Neg logit contributions
ertodd-0.459
rive-0.444
ointed-0.443
rived-0.443
ulously-0.442
TokenThe
Token ID464
Feature activation+1.64e-02

Pos logit contributions
ertodd+0.131
rive+0.128
ointed+0.128
rived+0.127
ulously+0.127
Neg logit contributions
izabeth-0.187
-0.180
 awa-0.173
 Definition-0.162
sense-0.160
Token responses
Token ID9109
Feature activation-1.82e-01

Pos logit contributions
izabeth+0.054
+0.052
 awa+0.050
 Definition+0.047
sense+0.046
Neg logit contributions
ertodd-0.042
rive-0.041
ointed-0.041
ulously-0.041
rived-0.041
Token I
Token ID314
Feature activation-7.47e-02

Pos logit contributions
ertodd+0.374
rive+0.360
ointed+0.360
ulously+0.358
rived+0.358
Neg logit contributions
izabeth-0.696
-0.673
 awa-0.647
 Definition-0.611
sense-0.607
Token receive
Token ID3328
Feature activation-3.67e-01

Pos logit contributions
ertodd+0.194
ointed+0.190
rive+0.190
rived+0.189
ulously+0.188
Neg logit contributions
izabeth-0.244
-0.235
 awa-0.223
 Definition-0.209
ッド-0.208
Token are
Token ID389
Feature activation-9.56e-02

Pos logit contributions
ertodd+0.757
rive+0.736
ointed+0.734
rived+0.731
ulously+0.731
Neg logit contributions
izabeth-1.399
-1.354
 awa-1.297
 Definition-1.225
ッド-1.219
Token lad
Token ID9717
Feature activation-1.05e+00

Pos logit contributions
ertodd+0.225
rive+0.219
ointed+0.219
rived+0.218
ulously+0.218
Neg logit contributions
izabeth-0.336
-0.324
 awa-0.310
 Definition-0.291
sense-0.289
Tokenen
Token ID268
Feature activation-4.27e-01

Pos logit contributions
ertodd+1.950
rived+1.932
rive+1.921
ulously+1.887
ointed+1.881
Neg logit contributions
izabeth-4.152
-4.027
 awa-3.850
 Definition-3.667
ッド-3.632
Token with
Token ID351
Feature activation-1.42e-01

Pos logit contributions
ertodd+0.890
rive+0.862
ointed+0.860
rived+0.855
ulously+0.855
Neg logit contributions
izabeth-1.621
-1.566
 awa-1.504
 Definition-1.418
ッド-1.409
Token exp
Token ID1033
Feature activation+1.95e-01

Pos logit contributions
ertodd+0.372
rive+0.364
ointed+0.364
rived+0.362
ulously+0.361
Neg logit contributions
izabeth-0.459
-0.442
 awa-0.420
 Definition-0.392
sense-0.389
Tokenlet
Token ID1616
Feature activation-1.33e-01

Pos logit contributions
izabeth+0.598
+0.574
 awa+0.545
 Definition+0.506
sense+0.503
Neg logit contributions
ertodd-0.549
rive-0.535
ointed-0.534
ulously-0.533
rived-0.532
Tokenives
Token ID1083
Feature activation+3.11e-01

Pos logit contributions
ertodd+0.372
ointed+0.368
rive+0.368
rived+0.366
ulously+0.365
Neg logit contributions
izabeth-0.404
-0.394
 awa-0.369
 Definition-0.343
sense-0.341
Token was
Token ID373
Feature activation+2.22e-02

Pos logit contributions
izabeth+0.314
+0.302
 awa+0.289
 Definition+0.271
sense+0.269
Neg logit contributions
ertodd-0.216
rive-0.211
ointed-0.211
ulously-0.210
rived-0.210
Token being
Token ID852
Feature activation+2.53e-01

Pos logit contributions
izabeth+0.076
+0.073
 awa+0.070
 Definition+0.065
sense+0.065
Neg logit contributions
ertodd-0.055
rive-0.053
ointed-0.053
rived-0.053
ulously-0.053
Token prepared
Token ID5597
Feature activation+1.36e-01

Pos logit contributions
izabeth+0.848
+0.814
 awa+0.780
 Definition+0.727
sense+0.722
Neg logit contributions
ertodd-0.642
rive-0.620
ointed-0.619
ulously-0.618
rived-0.618
Token at
Token ID379
Feature activation+6.99e-02

Pos logit contributions
izabeth+0.539
+0.518
 awa+0.499
 Definition+0.472
afety+0.471
Neg logit contributions
ertodd-0.260
ointed-0.249
rive-0.246
rived-0.246
ulously-0.246
Token Cape
Token ID15725
Feature activation-2.43e-01

Pos logit contributions
izabeth+0.218
+0.209
 awa+0.198
 Definition+0.185
sense+0.183
Neg logit contributions
ertodd-0.194
rive-0.188
ointed-0.188
rived-0.188
ulously-0.188
Token Canaveral
Token ID49433
Feature activation-1.03e+00

Pos logit contributions
ertodd+0.724
ointed+0.704
rive+0.703
ulously+0.702
rived+0.700
Neg logit contributions
izabeth-0.709
-0.681
 awa-0.642
 Definition-0.598
sense-0.590
Token in
Token ID287
Feature activation+6.25e-03

Pos logit contributions
ertodd+2.106
rive+2.084
ulously+2.073
rived+2.066
ointed+2.060
Neg logit contributions
izabeth-3.884
-3.753
 awa-3.634
ッド-3.396
sense-3.386
Token Florida
Token ID4744
Feature activation-6.34e-02

Pos logit contributions
izabeth+0.017
+0.017
 awa+0.016
 Definition+0.015
sense+0.014
Neg logit contributions
ertodd-0.019
ointed-0.019
rive-0.019
rived-0.019
ulously-0.019
Token for
Token ID329
Feature activation-1.51e-01

Pos logit contributions
ertodd+0.131
ointed+0.127
rive+0.127
rived+0.127
ulously+0.126
Neg logit contributions
izabeth-0.241
-0.233
 awa-0.224
 Definition-0.211
sense-0.210
Token an
Token ID281
Feature activation+1.03e-01

Pos logit contributions
ertodd+0.399
rive+0.390
ointed+0.390
rived+0.389
ulously+0.388
Neg logit contributions
izabeth-0.487
-0.468
 awa-0.446
 Definition-0.415
sense-0.413
Token 18
Token ID1248
Feature activation+4.05e-02

Pos logit contributions
izabeth+0.337
+0.324
 awa+0.310
 Definition+0.288
sense+0.286
Neg logit contributions
ertodd-0.270
rive-0.261
rived-0.260
ointed-0.260
ulously-0.260
TokenProperty
Token ID21746
Feature activation+9.91e-02

Pos logit contributions
ertodd+0.057
rive+0.056
ointed+0.056
ulously+0.056
rived+0.055
Neg logit contributions
izabeth-0.057
-0.054
 awa-0.052
sense-0.047
 Definition-0.047
Token '
Token ID705
Feature activation+2.25e-01

Pos logit contributions
izabeth+0.297
+0.284
 awa+0.269
 Definition+0.249
sense+0.248
Neg logit contributions
ertodd-0.285
rive-0.279
ointed-0.279
ulously-0.278
rived-0.278
TokenH
Token ID39
Feature activation-6.54e-02

Pos logit contributions
izabeth+0.748
+0.726
 awa+0.687
 Definition+0.646
sense+0.644
Neg logit contributions
ertodd-0.575
ointed-0.556
rived-0.553
rive-0.553
ulously-0.552
TokenKC
Token ID36222
Feature activation-9.23e-02

Pos logit contributions
ertodd+0.193
rive+0.189
ointed+0.188
rived+0.188
ulously+0.188
Neg logit contributions
izabeth-0.191
-0.183
 awa-0.174
 Definition-0.160
ッド-0.159
TokenU
Token ID52
Feature activation-1.71e-01

Pos logit contributions
ertodd+0.175
ointed+0.169
rive+0.168
rived+0.167
ulously+0.167
Neg logit contributions
izabeth-0.368
-0.356
 awa-0.342
 Definition-0.325
sense-0.323
Token:\
Token ID7479
Feature activation-1.02e+00

Pos logit contributions
ertodd+0.372
rive+0.366
ulously+0.364
ointed+0.364
rived+0.362
Neg logit contributions
izabeth-0.628
-0.607
 awa-0.583
 Definition-0.544
ッド-0.544
TokenControl
Token ID15988
Feature activation+3.07e-02

Pos logit contributions
ertodd+2.771
rive+2.756
rived+2.722
ointed+2.716
ulously+2.712
Neg logit contributions
izabeth-3.168
-3.027
 awa-2.892
 Definition-2.635
afety-2.628
Token Panel
Token ID18810
Feature activation+7.08e-01

Pos logit contributions
izabeth+0.094
+0.091
 awa+0.086
 Definition+0.081
sense+0.080
Neg logit contributions
ertodd-0.087
ointed-0.084
rive-0.084
ulously-0.084
rived-0.084
Token\
Token ID59
Feature activation-5.08e-01

Pos logit contributions
izabeth+2.424
+2.339
 awa+2.235
 Definition+2.128
ッド+2.087
Neg logit contributions
ertodd-1.744
ulously-1.672
rive-1.671
rived-1.669
ointed-1.665
TokenDesktop
Token ID36881
Feature activation+3.60e-01

Pos logit contributions
ertodd+1.249
rive+1.231
ointed+1.224
rived+1.220
ulously+1.217
Neg logit contributions
izabeth-1.725
-1.650
 awa-1.587
 Definition-1.470
sense-1.464
Token'
Token ID6
Feature activation+8.30e-02

Pos logit contributions
izabeth+1.221
+1.179
 awa+1.125
 Definition+1.059
sense+1.047
Neg logit contributions
ertodd-0.894
rive-0.866
rived-0.865
ointed-0.865
ulously-0.865
Token the
Token ID262
Feature activation+1.00e-01

Pos logit contributions
ertodd+0.054
rive+0.053
rived+0.053
ointed+0.053
ulously+0.053
Neg logit contributions
izabeth-0.066
-0.064
 awa-0.061
 Definition-0.057
sense-0.056
Token temple
Token ID12505
Feature activation+4.29e-02

Pos logit contributions
izabeth+0.350
+0.338
 awa+0.324
 Definition+0.304
sense+0.301
Neg logit contributions
ertodd-0.238
rive-0.231
ointed-0.230
rived-0.230
ulously-0.229
Token referring
Token ID9759
Feature activation+6.46e-02

Pos logit contributions
izabeth+0.150
+0.145
 awa+0.139
 Definition+0.131
sense+0.130
Neg logit contributions
ertodd-0.102
rive-0.099
rived-0.099
ointed-0.098
ulously-0.098
Token to
Token ID284
Feature activation+2.94e-02

Pos logit contributions
izabeth+0.260
+0.254
 awa+0.243
 Definition+0.230
ッド+0.230
Neg logit contributions
ertodd-0.120
rive-0.115
ointed-0.114
rived-0.114
ulously-0.113
Token Pand
Token ID16492
Feature activation+1.71e-01

Pos logit contributions
izabeth+0.099
+0.095
 awa+0.091
 Definition+0.085
sense+0.084
Neg logit contributions
ertodd-0.074
rive-0.072
rived-0.072
ointed-0.072
ulously-0.072
Tokenya
Token ID3972
Feature activation+5.75e-01

Pos logit contributions
izabeth+0.505
+0.484
 awa+0.459
 Definition+0.424
ッド+0.421
Neg logit contributions
ertodd-0.498
rive-0.488
ointed-0.487
rived-0.486
ulously-0.486
Token Kings
Token ID10578
Feature activation-1.81e-01

Pos logit contributions
izabeth+1.995
+1.934
 awa+1.849
 Definition+1.742
sense+1.721
Neg logit contributions
ertodd-1.379
rive-1.333
rived-1.329
ointed-1.325
ulously-1.317
Token and
Token ID290
Feature activation+6.27e-02

Pos logit contributions
ertodd+0.426
rive+0.415
ointed+0.415
rived+0.413
ulously+0.413
Neg logit contributions
izabeth-0.633
-0.610
 awa-0.584
 Definition-0.547
sense-0.545
Token Ray
Token ID7760
Feature activation+1.79e-01

Pos logit contributions
izabeth+0.209
+0.202
 awa+0.193
 Definition+0.181
sense+0.179
Neg logit contributions
ertodd-0.159
rive-0.154
rived-0.153
ointed-0.153
ulously-0.153
Tokena
Token ID64
Feature activation+1.23e-01

Pos logit contributions
izabeth+0.565
+0.542
 awa+0.516
 Definition+0.480
sense+0.477
Neg logit contributions
ertodd-0.486
rive-0.474
ointed-0.474
rived-0.472
ulously-0.472
Token (
Token ID357
Feature activation+2.28e-02

Pos logit contributions
izabeth+0.423
+0.408
 awa+0.390
 Definition+0.366
sense+0.363
Neg logit contributions
ertodd-0.300
rive-0.291
ointed-0.290
rived-0.290
ulously-0.289
Token and
Token ID290
Feature activation-1.22e-02

Pos logit contributions
ertodd+0.173
rive+0.168
ointed+0.168
rived+0.168
ulously+0.167
Neg logit contributions
izabeth-0.273
-0.264
 awa-0.253
 Definition-0.237
sense-0.236
Token it
Token ID340
Feature activation+1.63e-01

Pos logit contributions
ertodd+0.030
rive+0.029
ointed+0.029
ulously+0.029
rived+0.028
Neg logit contributions
izabeth-0.042
-0.041
 awa-0.039
 Definition-0.037
sense-0.036
Token strikes
Token ID8956
Feature activation-1.59e-01

Pos logit contributions
izabeth+0.562
+0.541
 awa+0.519
 Definition+0.485
sense+0.483
Neg logit contributions
ertodd-0.395
rive-0.381
ointed-0.381
rived-0.380
ulously-0.379
Token an
Token ID281
Feature activation+7.64e-03

Pos logit contributions
ertodd+0.469
rive+0.462
ointed+0.461
rived+0.460
ulously+0.459
Neg logit contributions
izabeth-0.465
-0.444
 awa-0.419
 Definition-0.388
ッド-0.387
Token excellent
Token ID6275
Feature activation-3.01e-02

Pos logit contributions
izabeth+0.028
+0.027
 awa+0.026
 Definition+0.025
ッド+0.025
Neg logit contributions
ertodd-0.016
ointed-0.016
rive-0.016
ulously-0.016
rived-0.016
Token balance
Token ID5236
Feature activation+5.19e-01

Pos logit contributions
ertodd+0.089
ointed+0.089
rive+0.089
rived+0.089
ulously+0.088
Neg logit contributions
izabeth-0.087
-0.082
 awa-0.078
ッド-0.072
 Definition-0.072
Token between
Token ID1022
Feature activation-1.28e-01

Pos logit contributions
izabeth+1.789
+1.717
 awa+1.650
 Definition+1.547
sense+1.545
Neg logit contributions
ertodd-1.265
rive-1.216
ointed-1.214
ulously-1.213
rived-1.204
Token speed
Token ID2866
Feature activation-2.21e-02

Pos logit contributions
ertodd+0.331
rive+0.325
ointed+0.325
rived+0.324
ulously+0.323
Neg logit contributions
izabeth-0.416
-0.402
 awa-0.381
 Definition-0.356
ッド-0.354
Token,
Token ID11
Feature activation+2.03e-01

Pos logit contributions
ertodd+0.053
rive+0.051
ointed+0.051
ulously+0.051
rived+0.051
Neg logit contributions
izabeth-0.077
-0.074
 awa-0.071
 Definition-0.067
sense-0.066
Token stability
Token ID10159
Feature activation+6.64e-02

Pos logit contributions
izabeth+0.606
+0.580
 awa+0.552
 Definition+0.511
sense+0.508
Neg logit contributions
ertodd-0.587
rive-0.571
ointed-0.570
ulously-0.570
rived-0.568
Token,
Token ID11
Feature activation+1.22e-01

Pos logit contributions
izabeth+0.228
+0.221
 awa+0.211
 Definition+0.197
sense+0.197
Neg logit contributions
ertodd-0.162
rive-0.157
ointed-0.157
ulously-0.156
rived-0.156
Token the
Token ID262
Feature activation+5.88e-02

Pos logit contributions
ertodd+0.289
rive+0.282
ointed+0.282
rived+0.281
ulously+0.281
Neg logit contributions
izabeth-0.359
-0.345
 awa-0.328
 Definition-0.307
sense-0.304
Token heart
Token ID2612
Feature activation-4.03e-01

Pos logit contributions
izabeth+0.198
+0.191
 awa+0.182
 Definition+0.170
sense+0.169
Neg logit contributions
ertodd-0.148
rive-0.144
ointed-0.143
rived-0.143
ulously-0.143
Token beat
Token ID4405
Feature activation-5.32e-01

Pos logit contributions
ertodd+1.015
rive+0.995
ointed+0.988
ulously+0.985
rived+0.983
Neg logit contributions
izabeth-1.349
-1.303
 awa-1.232
 Definition-1.163
ッド-1.153
Token faster
Token ID5443
Feature activation+2.01e-01

Pos logit contributions
ertodd+1.189
rive+1.165
ulously+1.156
rived+1.152
ointed+1.151
Neg logit contributions
izabeth-1.933
-1.870
 awa-1.774
 Definition-1.679
ッド-1.671
Token,
Token ID11
Feature activation-1.38e-01

Pos logit contributions
izabeth+0.696
+0.672
 awa+0.646
 Definition+0.604
sense+0.600
Neg logit contributions
ertodd-0.481
ointed-0.464
rived-0.464
rive-0.462
ulously-0.460
Token reverber
Token ID48134
Feature activation+5.82e-01

Pos logit contributions
ertodd+0.337
rive+0.328
ointed+0.328
rived+0.326
ulously+0.326
Neg logit contributions
izabeth-0.476
-0.459
 awa-0.437
 Definition-0.410
sense-0.408
Tokenating
Token ID803
Feature activation+5.19e-01

Pos logit contributions
izabeth+1.826
+1.730
 awa+1.657
 Definition+1.534
sense+1.532
Neg logit contributions
ertodd-1.600
ointed-1.551
rived-1.544
rive-1.541
ulously-1.522
Token against
Token ID1028
Feature activation-2.69e-01

Pos logit contributions
izabeth+1.876
+1.804
 awa+1.753
 Definition+1.637
sense+1.626
Neg logit contributions
ertodd-1.176
rive-1.122
ointed-1.121
rived-1.118
ulously-1.109
Token the
Token ID262
Feature activation-1.62e-01

Pos logit contributions
ertodd+0.707
rive+0.695
ointed+0.693
rived+0.691
ulously+0.689
Neg logit contributions
izabeth-0.868
-0.836
 awa-0.792
 Definition-0.740
sense-0.735
Token rib
Token ID12183
Feature activation+1.09e-01

Pos logit contributions
ertodd+0.425
rive+0.417
ointed+0.417
rived+0.415
ulously+0.414
Neg logit contributions
izabeth-0.522
-0.502
 awa-0.475
 Definition-0.444
ッド-0.442
Token cage
Token ID17186
Feature activation+5.77e-03

Pos logit contributions
izabeth+0.362
+0.349
 awa+0.333
 Definition+0.311
sense+0.309
Neg logit contributions
ertodd-0.280
rive-0.272
ointed-0.272
rived-0.271
ulously-0.271
Token once
Token ID1752
Feature activation+5.58e-01

Pos logit contributions
izabeth+0.860
+0.829
 awa+0.792
 Definition+0.743
sense+0.736
Neg logit contributions
ertodd-0.636
rive-0.618
ointed-0.615
ulously-0.615
rived-0.615
Token for
Token ID329
Feature activation+8.29e-02

Pos logit contributions
izabeth+1.771
+1.710
 awa+1.644
 Definition+1.524
sense+1.508
Neg logit contributions
ertodd-1.505
rive-1.450
ulously-1.448
rived-1.437
ointed-1.436
Token their
Token ID511
Feature activation+2.57e-02

Pos logit contributions
izabeth+0.269
+0.259
 awa+0.247
 Definition+0.231
sense+0.229
Neg logit contributions
ertodd-0.218
rive-0.212
ointed-0.211
ulously-0.211
rived-0.211
Token most
Token ID749
Feature activation+1.68e-01

Pos logit contributions
izabeth+0.087
+0.084
 awa+0.080
 Definition+0.075
sense+0.075
Neg logit contributions
ertodd-0.064
rive-0.062
ointed-0.062
rived-0.061
ulously-0.061
Token recent
Token ID2274
Feature activation+5.98e-02

Pos logit contributions
izabeth+0.580
+0.562
 awa+0.539
 Definition+0.503
sense+0.502
Neg logit contributions
ertodd-0.407
rive-0.391
ulously-0.390
rived-0.389
ointed-0.388
Token video
Token ID2008
Feature activation+5.61e-01

Pos logit contributions
izabeth+0.211
+0.204
 awa+0.196
 Definition+0.184
sense+0.183
Neg logit contributions
ertodd-0.141
rive-0.134
ointed-0.134
rived-0.134
ulously-0.134
Token,
Token ID11
Feature activation+7.11e-02

Pos logit contributions
izabeth+1.996
+1.951
 awa+1.870
 Definition+1.754
sense+1.741
Neg logit contributions
ertodd-1.295
rived-1.222
ulously-1.221
rive-1.220
ointed-1.213
Token
Token ID564
Feature activation-1.48e-02

Pos logit contributions
izabeth+0.240
+0.232
 awa+0.222
 Definition+0.208
sense+0.206
Neg logit contributions
ertodd-0.178
rive-0.172
rived-0.171
ulously-0.171
ointed-0.171
Token
Token ID250
Feature activation-5.21e-02

Pos logit contributions
ertodd+0.049
rive+0.048
ointed+0.048
ulously+0.048
rived+0.048
Neg logit contributions
izabeth-0.038
-0.036
 awa-0.034
 Definition-0.031
ッド-0.031
TokenMir
Token ID27453
Feature activation+1.45e-01

Pos logit contributions
ertodd+0.132
rive+0.129
ointed+0.129
rived+0.128
ulously+0.128
Neg logit contributions
izabeth-0.174
-0.168
 awa-0.160
 Definition-0.149
sense-0.148
Tokenacles
Token ID9928
Feature activation+1.01e-01

Pos logit contributions
izabeth+0.437
+0.418
 awa+0.397
 Definition+0.368
sense+0.366
Neg logit contributions
ertodd-0.414
rive-0.404
ointed-0.404
rived-0.403
ulously-0.402
Token
Token ID851
Feature activation+2.13e-01

Pos logit contributions
izabeth+0.513
+0.495
 awa+0.474
 Definition+0.447
sense+0.443
Neg logit contributions
ertodd-0.335
rive-0.325
ointed-0.324
ulously-0.322
rived-0.321
Token to
Token ID284
Feature activation+1.20e-02

Pos logit contributions
izabeth+0.816
+0.791
 awa+0.755
 Definition+0.715
sense+0.711
Neg logit contributions
ertodd-0.438
rive-0.419
ointed-0.417
ulously-0.415
rived-0.412
Token the
Token ID262
Feature activation-6.17e-02

Pos logit contributions
izabeth+0.040
+0.038
 awa+0.037
 Definition+0.034
sense+0.034
Neg logit contributions
ertodd-0.030
rive-0.029
ointed-0.029
rived-0.029
ulously-0.029
Token so
Token ID523
Feature activation-7.11e-02

Pos logit contributions
ertodd+0.157
rive+0.153
ointed+0.153
rived+0.153
ulously+0.153
Neg logit contributions
izabeth-0.204
-0.197
 awa-0.188
 Definition-0.175
sense-0.174
Token-
Token ID12
Feature activation+1.98e-01

Pos logit contributions
ertodd+0.175
rive+0.170
ointed+0.169
rived+0.169
ulously+0.169
Neg logit contributions
izabeth-0.243
-0.235
 awa-0.224
 Definition-0.210
sense-0.209
Tokencalled
Token ID7174
Feature activation+8.07e-01

Pos logit contributions
izabeth+0.394
+0.368
 awa+0.343
 Definition+0.300
ッド+0.295
Neg logit contributions
ertodd-0.772
rive-0.760
ointed-0.759
rived-0.758
ulously-0.758
Token
Token ID564
Feature activation+2.18e-01

Pos logit contributions
izabeth+2.563
+2.459
 awa+2.342
 Definition+2.220
ッド+2.172
Neg logit contributions
ertodd-2.203
rive-2.102
ulously-2.090
ointed-2.082
rived-2.072
Token
Token ID250
Feature activation+1.79e-01

Pos logit contributions
izabeth+0.751
+0.728
 awa+0.693
 Definition+0.652
ッド+0.649
Neg logit contributions
ertodd-0.533
ointed-0.510
ulously-0.510
rive-0.507
rived-0.505
Tokens
Token ID82
Feature activation-2.55e-02

Pos logit contributions
izabeth+0.606
+0.585
 awa+0.556
 Definition+0.522
sense+0.520
Neg logit contributions
ertodd-0.449
ointed-0.434
rive-0.434
ulously-0.433
rived-0.432
Tokenavage
Token ID33757
Feature activation-9.14e-02

Pos logit contributions
ertodd+0.067
rive+0.065
ointed+0.065
rived+0.065
ulously+0.065
Neg logit contributions
izabeth-0.082
-0.079
 awa-0.075
 Definition-0.070
sense-0.070
Token
Token ID447
Feature activation+1.66e-01

Pos logit contributions
ertodd+0.246
rive+0.241
ointed+0.240
rived+0.240
ulously+0.240
Neg logit contributions
izabeth-0.290
-0.279
 awa-0.265
 Definition-0.247
sense-0.245
Token their
Token ID511
Feature activation-1.13e-01

Pos logit contributions
izabeth+0.353
+0.339
 awa+0.322
 Definition+0.301
sense+0.298
Neg logit contributions
ertodd-0.303
rive-0.294
ointed-0.294
rived-0.293
ulously-0.293
Token imaging
Token ID19560
Feature activation+3.30e-01

Pos logit contributions
ertodd+0.302
rive+0.298
ointed+0.297
ulously+0.296
rived+0.296
Neg logit contributions
izabeth-0.357
-0.342
 awa-0.326
 Definition-0.302
ッド-0.301
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
izabeth+1.111
+1.074
 awa+1.025
 Definition+0.960
sense+0.954
Neg logit contributions
ertodd-0.824
rive-0.799
ointed-0.797
rived-0.795
ulously-0.794
Token transparency
Token ID13902
Feature activation-2.14e-02

Pos logit contributions
ertodd+0.319
rive+0.312
ointed+0.311
rived+0.310
ulously+0.310
Neg logit contributions
izabeth-0.386
-0.371
 awa-0.353
 Definition-0.329
sense-0.327
Token,
Token ID11
Feature activation-1.56e-01

Pos logit contributions
ertodd+0.049
rive+0.048
ointed+0.048
rived+0.048
ulously+0.048
Neg logit contributions
izabeth-0.076
-0.074
 awa-0.070
 Definition-0.066
sense-0.066
Token sound
Token ID2128
Feature activation-9.42e-02

Pos logit contributions
ertodd+0.393
rive+0.386
ointed+0.385
ulously+0.383
rived+0.383
Neg logit contributions
izabeth-0.519
-0.498
 awa-0.477
 Definition-0.444
ッド-0.443
Tokenst
Token ID301
Feature activation-1.38e-01

Pos logit contributions
ertodd+0.284
rive+0.279
ointed+0.278
rived+0.278
ulously+0.277
Neg logit contributions
izabeth-0.268
-0.256
 awa-0.242
 Definition-0.223
ッド-0.222
Tokenaging
Token ID3039
Feature activation+3.64e-02

Pos logit contributions
ertodd+0.456
rive+0.452
ointed+0.449
rived+0.448
ulously+0.441
Neg logit contributions
izabeth-0.357
-0.341
 awa-0.319
ッド-0.297
 Definition-0.290
Token,
Token ID11
Feature activation-1.12e-02

Pos logit contributions
izabeth+0.125
+0.121
 awa+0.116
 Definition+0.108
sense+0.108
Neg logit contributions
ertodd-0.088
rive-0.086
ointed-0.086
rived-0.086
ulously-0.086
Token and
Token ID290
Feature activation+8.78e-02

Pos logit contributions
ertodd+0.027
rive+0.027
ointed+0.027
rived+0.027
ulously+0.027
Neg logit contributions
izabeth-0.038
-0.037
 awa-0.035
 Definition-0.033
ッド-0.033
Token surprising
Token ID6452
Feature activation+7.03e-02

Pos logit contributions
izabeth+0.268
+0.256
 awa+0.244
 Definition+0.225
ッド+0.224
Neg logit contributions
ertodd-0.248
rive-0.243
ointed-0.242
rived-0.241
ulously-0.241
Token tremb
Token ID29868
Feature activation+2.60e-01

Pos logit contributions
ertodd+0.618
rive+0.605
ointed+0.602
rived+0.602
ulously+0.599
Neg logit contributions
izabeth-1.069
-1.031
 awa-0.990
 Definition-0.931
ッド-0.929
Tokenled
Token ID992
Feature activation+8.54e-02

Pos logit contributions
izabeth+0.766
+0.734
 awa+0.693
sense+0.643
 Definition+0.642
Neg logit contributions
ertodd-0.761
rive-0.734
ointed-0.732
rived-0.731
ulously-0.728
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
izabeth+0.303
+0.293
 awa+0.281
 Definition+0.263
sense+0.262
Neg logit contributions
ertodd-0.198
rive-0.192
ointed-0.192
rived-0.191
ulously-0.191
Token<|endoftext|>
Token ID50256
Feature activation+1.49e-01

Pos logit contributions
ertodd+0.593
rive+0.577
ointed+0.576
rived+0.574
ulously+0.573
Neg logit contributions
izabeth-0.793
-0.763
 awa-0.730
 Definition-0.680
ッド-0.676
TokenAl
Token ID2348
Feature activation-1.79e-01

Pos logit contributions
izabeth+0.505
+0.490
 awa+0.463
 Definition+0.436
sense+0.435
Neg logit contributions
ertodd-0.371
ointed-0.360
rive-0.359
ulously-0.358
rived-0.358
Tokenain
Token ID391
Feature activation+2.87e-01

Pos logit contributions
ertodd+0.507
rive+0.497
ointed+0.496
rived+0.495
ulously+0.494
Neg logit contributions
izabeth-0.541
-0.520
 awa-0.494
 Definition-0.457
sense-0.454
Token B
Token ID347
Feature activation-1.42e-01

Pos logit contributions
izabeth+1.004
+0.974
 awa+0.928
 Definition+0.880
ッド+0.870
Neg logit contributions
ertodd-0.678
ulously-0.654
rive-0.653
ointed-0.653
rived-0.652
Token ad
Token ID512
Feature activation+1.80e-01

Pos logit contributions
ertodd+0.360
rive+0.349
ointed+0.349
ulously+0.348
rived+0.348
Neg logit contributions
izabeth-0.474
-0.456
 awa-0.436
 Definition-0.407
sense-0.405
Tokeni
Token ID72
Feature activation-1.59e-01

Pos logit contributions
izabeth+0.611
+0.586
 awa+0.563
 Definition+0.526
sense+0.524
Neg logit contributions
ertodd-0.447
ointed-0.432
ulously-0.431
rive-0.431
rived-0.429
Tokenou
Token ID280
Feature activation-1.31e-01

Pos logit contributions
ertodd+0.380
rive+0.371
ointed+0.370
rived+0.369
ulously+0.369
Neg logit contributions
izabeth-0.552
-0.532
 awa-0.508
 Definition-0.476
sense-0.474
Token The
Token ID383
Feature activation-1.93e-01

Pos logit contributions
ertodd+0.324
rive+0.316
ointed+0.315
rived+0.315
ulously+0.314
Neg logit contributions
izabeth-0.445
-0.428
 awa-0.409
 Definition-0.382
ッド-0.380
Token were
Token ID547
Feature activation+1.07e-01

Pos logit contributions
ertodd+1.047
rived+1.038
ointed+1.037
ulously+1.035
rive+1.034
Neg logit contributions
izabeth-1.580
-1.542
 awa-1.460
ッド-1.383
 Definition-1.371
Token set
Token ID900
Feature activation+2.04e-02

Pos logit contributions
izabeth+0.370
+0.357
 awa+0.341
 Definition+0.320
sense+0.318
Neg logit contributions
ertodd-0.258
ointed-0.251
rive-0.251
ulously-0.250
rived-0.250
Token to
Token ID284
Feature activation+1.05e-01

Pos logit contributions
izabeth+0.080
+0.077
 awa+0.075
sense+0.070
 Definition+0.070
Neg logit contributions
ertodd-0.041
ointed-0.038
rive-0.038
rived-0.038
ulously-0.038
Token enjoy
Token ID2883
Feature activation-1.79e-01

Pos logit contributions
izabeth+0.378
+0.364
 awa+0.350
 Definition+0.329
sense+0.326
Neg logit contributions
ertodd-0.237
rive-0.228
ointed-0.228
rived-0.227
ulously-0.227
Token a
Token ID257
Feature activation+1.09e-02

Pos logit contributions
ertodd+0.477
rive+0.464
ointed+0.464
rived+0.462
ulously+0.462
Neg logit contributions
izabeth-0.576
-0.553
 awa-0.527
 Definition-0.491
sense-0.488
Token wonderful
Token ID7932
Feature activation+2.46e-02

Pos logit contributions
izabeth+0.039
+0.037
 awa+0.036
 Definition+0.034
sense+0.033
Neg logit contributions
ertodd-0.025
rive-0.024
ointed-0.024
rived-0.024
ulously-0.024
Token Easter
Token ID20111
Feature activation-3.72e-02

Pos logit contributions
izabeth+0.095
+0.092
 awa+0.088
 Definition+0.084
sense+0.083
Neg logit contributions
ertodd-0.049
rive-0.047
ointed-0.047
rived-0.047
ulously-0.047
Token Br
Token ID1709
Feature activation-2.53e-02

Pos logit contributions
ertodd+0.070
ointed+0.067
rive+0.067
ulously+0.067
rived+0.067
Neg logit contributions
izabeth-0.148
-0.144
 awa-0.139
 Definition-0.131
sense-0.130
Tokenunch
Token ID3316
Feature activation-4.24e-01

Pos logit contributions
ertodd+0.087
rive+0.086
ointed+0.086
rived+0.085
ulously+0.085
Neg logit contributions
izabeth-0.061
-0.058
 awa-0.054
 Definition-0.049
sense-0.049
Token at
Token ID379
Feature activation-1.07e-03

Pos logit contributions
ertodd+0.914
ointed+0.896
rived+0.895
rive+0.895
ulously+0.890
Neg logit contributions
izabeth-1.563
-1.510
 awa-1.443
ッド-1.359
sense-1.356
Token Ben
Token ID3932
Feature activation+9.35e-02

Pos logit contributions
ertodd+0.003
ointed+0.003
rive+0.003
ulously+0.003
rived+0.003
Neg logit contributions
izabeth-0.004
-0.003
 awa-0.003
 Definition-0.003
sense-0.003
Token a
Token ID257
Feature activation+9.53e-02

Pos logit contributions
ertodd+0.631
ointed+0.619
rive+0.617
ulously+0.616
rived+0.615
Neg logit contributions
izabeth-0.730
-0.701
 awa-0.666
 Definition-0.620
ッド-0.617
Token ringing
Token ID32333
Feature activation-1.61e-01

Pos logit contributions
izabeth+0.311
+0.299
 awa+0.285
 Definition+0.265
ッド+0.264
Neg logit contributions
ertodd-0.248
rive-0.243
ointed-0.242
rived-0.242
ulously-0.241
Token slap
Token ID23905
Feature activation-3.34e-01

Pos logit contributions
ertodd+0.458
rive+0.450
ointed+0.449
ulously+0.449
rived+0.446
Neg logit contributions
izabeth-0.481
-0.462
 awa-0.436
ッド-0.405
 Definition-0.399
Tokendown
Token ID2902
Feature activation+1.12e-02

Pos logit contributions
ertodd+0.663
rive+0.644
ointed+0.643
rived+0.640
ulously+0.640
Neg logit contributions
izabeth-1.297
-1.255
 awa-1.207
 Definition-1.138
ッド-1.134
Token against
Token ID1028
Feature activation-1.93e-01

Pos logit contributions
izabeth+0.044
+0.043
 awa+0.041
 Definition+0.039
sense+0.039
Neg logit contributions
ertodd-0.022
rive-0.021
ointed-0.021
ulously-0.021
rived-0.021
Token the
Token ID262
Feature activation-3.40e-02

Pos logit contributions
ertodd+0.509
ointed+0.500
ulously+0.497
rive+0.497
rived+0.497
Neg logit contributions
izabeth-0.619
-0.596
 awa-0.564
sense-0.526
 Definition-0.526
Token conf
Token ID1013
Feature activation+6.90e-02

Pos logit contributions
ertodd+0.089
ointed+0.087
ulously+0.087
rive+0.087
rived+0.087
Neg logit contributions
izabeth-0.110
-0.106
 awa-0.101
 Definition-0.093
ッド-0.093
Tokenlation
Token ID7592
Feature activation-1.85e-01

Pos logit contributions
izabeth+0.237
+0.229
 awa+0.219
 Definition+0.205
sense+0.203
Neg logit contributions
ertodd-0.168
rive-0.164
ointed-0.163
rived-0.163
ulously-0.163
Token of
Token ID286
Feature activation+1.73e-01

Pos logit contributions
ertodd+0.353
ointed+0.342
rive+0.342
rived+0.341
ulously+0.340
Neg logit contributions
izabeth-0.733
-0.710
 awa-0.682
 Definition-0.644
sense-0.642
Token sovereign
Token ID18901
Feature activation+1.11e-01

Pos logit contributions
izabeth+0.551
+0.529
 awa+0.503
 Definition+0.468
sense+0.466
Neg logit contributions
ertodd-0.463
ointed-0.454
rive-0.453
rived-0.452
ulously-0.452
Token Israel
Token ID2692
Feature activation+1.51e-01

Pos logit contributions
izabeth+0.352
+0.338
 awa+0.322
 Definition+0.299
sense+0.297
Neg logit contributions
ertodd-0.302
rive-0.295
ointed-0.295
rived-0.294
ulously-0.293
Token picked
Token ID6497
Feature activation-5.90e-02

Pos logit contributions
ertodd+0.538
ointed+0.524
rive+0.524
rived+0.521
ulously+0.521
Neg logit contributions
izabeth-0.778
-0.749
 awa-0.717
 Definition-0.672
ッド-0.668
Token based
Token ID1912
Feature activation-2.08e-01

Pos logit contributions
ertodd+0.148
rive+0.145
ointed+0.145
ulously+0.144
rived+0.144
Neg logit contributions
izabeth-0.198
-0.191
 awa-0.182
 Definition-0.170
sense-0.169
Token more
Token ID517
Feature activation+3.85e-02

Pos logit contributions
ertodd+0.391
ointed+0.376
rive+0.376
ulously+0.374
rived+0.373
Neg logit contributions
izabeth-0.832
-0.805
 awa-0.776
 Definition-0.733
sense-0.729
Token on
Token ID319
Feature activation-3.88e-02

Pos logit contributions
izabeth+0.154
+0.149
 awa+0.143
 Definition+0.135
sense+0.135
Neg logit contributions
ertodd-0.073
rive-0.070
ointed-0.069
rived-0.069
ulously-0.069
Token who
Token ID508
Feature activation-2.67e-01

Pos logit contributions
ertodd+0.106
ointed+0.104
rive+0.104
rived+0.104
ulously+0.104
Neg logit contributions
izabeth-0.121
-0.116
 awa-0.110
ッド-0.102
 Definition-0.102
Token would
Token ID561
Feature activation+2.12e-02

Pos logit contributions
ertodd+0.634
ointed+0.620
rive+0.620
rived+0.618
ulously+0.615
Neg logit contributions
izabeth-0.929
-0.896
 awa-0.856
 Definition-0.803
ッド-0.799
Token be
Token ID307
Feature activation+7.07e-02

Pos logit contributions
izabeth+0.079
+0.076
 awa+0.073
 Definition+0.069
sense+0.069
Neg logit contributions
ertodd-0.046
ointed-0.044
ulously-0.044
rive-0.044
rived-0.044
Token better
Token ID1365
Feature activation+1.40e-02

Pos logit contributions
izabeth+0.244
+0.235
 awa+0.225
 Definition+0.211
sense+0.210
Neg logit contributions
ertodd-0.172
rive-0.166
ulously-0.166
rived-0.165
ointed-0.165
Token fitted
Token ID18235
Feature activation-5.62e-02

Pos logit contributions
izabeth+0.050
+0.049
 awa+0.047
 Definition+0.044
sense+0.043
Neg logit contributions
ertodd-0.032
rive-0.031
ulously-0.031
rived-0.031
ointed-0.031
Token for
Token ID329
Feature activation-2.91e-01

Pos logit contributions
ertodd+0.113
rive+0.109
ointed+0.109
rived+0.108
ulously+0.108
Neg logit contributions
izabeth-0.217
-0.210
 awa-0.202
 Definition-0.191
sense-0.190
Token the
Token ID262
Feature activation-1.48e-01

Pos logit contributions
ertodd+0.765
ointed+0.752
rive+0.749
rived+0.748
ulously+0.746
Neg logit contributions
izabeth-0.936
-0.899
 awa-0.856
ッド-0.797
 Definition-0.794
Token up
Token ID510
Feature activation-4.80e-02

Pos logit contributions
izabeth+0.445
+0.429
 awa+0.414
 Definition+0.390
sense+0.388
Neg logit contributions
ertodd-0.253
ointed-0.243
rive-0.243
rived-0.241
ulously-0.240
Token all
Token ID477
Feature activation-2.31e-02

Pos logit contributions
ertodd+0.122
rive+0.118
ointed+0.118
rived+0.118
ulously+0.118
Neg logit contributions
izabeth-0.160
-0.154
 awa-0.147
 Definition-0.138
sense-0.137
Token the
Token ID262
Feature activation-2.61e-01

Pos logit contributions
ertodd+0.054
rive+0.052
rived+0.052
ointed+0.052
ulously+0.051
Neg logit contributions
izabeth-0.082
-0.079
 awa-0.076
sense-0.071
 Definition-0.071
Token remaining
Token ID5637
Feature activation+1.02e-01

Pos logit contributions
ertodd+0.683
rive+0.666
ointed+0.665
rived+0.663
ulously+0.663
Neg logit contributions
izabeth-0.848
-0.815
 awa-0.777
 Definition-0.725
sense-0.720
Token cap
Token ID1451
Feature activation-1.53e-02

Pos logit contributions
izabeth+0.324
+0.311
 awa+0.298
 Definition+0.277
sense+0.275
Neg logit contributions
ertodd-0.275
rive-0.266
rived-0.266
ointed-0.266
ulously-0.265
Token space
Token ID2272
Feature activation-4.07e-01

Pos logit contributions
ertodd+0.046
rive+0.044
ointed+0.044
rived+0.044
ulously+0.044
Neg logit contributions
izabeth-0.044
-0.042
 awa-0.040
 Definition-0.037
sense-0.037
Token.
Token ID13
Feature activation-1.52e-01

Pos logit contributions
ertodd+0.934
rive+0.906
ointed+0.905
rived+0.901
ulously+0.901
Neg logit contributions
izabeth-1.455
-1.404
 awa-1.345
 Definition-1.264
sense-1.256
Token
Token ID198
Feature activation-6.51e-02

Pos logit contributions
ertodd+0.381
rive+0.371
ointed+0.370
rived+0.370
ulously+0.369
Neg logit contributions
izabeth-0.512
-0.493
 awa-0.470
 Definition-0.442
sense-0.438
Token
Token ID198
Feature activation-5.16e-02

Pos logit contributions
ertodd+0.171
rive+0.166
rived+0.166
ulously+0.166
ointed+0.166
Neg logit contributions
izabeth-0.212
-0.205
 awa-0.190
 Definition-0.182
sense-0.181
TokenSo
Token ID2396
Feature activation-3.85e-02

Pos logit contributions
ertodd+0.123
ointed+0.119
rive+0.119
rived+0.119
ulously+0.119
Neg logit contributions
izabeth-0.180
-0.175
 awa-0.165
 Definition-0.156
sense-0.156
Token,
Token ID11
Feature activation-5.82e-02

Pos logit contributions
ertodd+0.096
rive+0.093
rived+0.093
ointed+0.093
ulously+0.093
Neg logit contributions
izabeth-0.130
-0.125
 awa-0.120
 Definition-0.113
sense-0.111
Token game
Token ID983
Feature activation+1.48e-01

Pos logit contributions
izabeth+0.647
+0.623
 awa+0.603
 Definition+0.569
sense+0.565
Neg logit contributions
ertodd-0.376
ointed-0.361
ulously-0.360
rive-0.360
rived-0.359
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
izabeth+0.523
+0.503
 awa+0.483
 Definition+0.455
sense+0.453
Neg logit contributions
ertodd-0.350
rive-0.337
ointed-0.337
rived-0.336
ulously-0.335
Token system
Token ID1080
Feature activation-1.40e-01

Pos logit contributions
ertodd+0.251
rive+0.243
ointed+0.242
ulously+0.241
rived+0.241
Neg logit contributions
izabeth-0.403
-0.389
 awa-0.373
 Definition-0.351
sense-0.349
Token.
Token ID13
Feature activation-5.18e-02

Pos logit contributions
ertodd+0.329
rive+0.318
ointed+0.318
rived+0.317
ulously+0.316
Neg logit contributions
izabeth-0.495
-0.476
 awa-0.457
 Definition-0.429
sense-0.426
Token This
Token ID770
Feature activation+4.61e-02

Pos logit contributions
ertodd+0.132
rive+0.128
ointed+0.128
rived+0.128
ulously+0.128
Neg logit contributions
izabeth-0.173
-0.166
 awa-0.159
 Definition-0.148
sense-0.147
Token is
Token ID318
Feature activation-2.66e-01

Pos logit contributions
izabeth+0.175
+0.170
 awa+0.164
 Definition+0.155
sense+0.153
Neg logit contributions
ertodd-0.096
rive-0.092
ointed-0.092
rived-0.092
ulously-0.092
Token the
Token ID262
Feature activation-1.21e-01

Pos logit contributions
ertodd+0.671
ointed+0.661
rive+0.660
rived+0.657
ulously+0.657
Neg logit contributions
izabeth-0.886
-0.853
 awa-0.809
 Definition-0.752
ッド-0.751
Token first
Token ID717
Feature activation-9.97e-02

Pos logit contributions
ertodd+0.304
rive+0.297
ointed+0.297
rived+0.296
ulously+0.295
Neg logit contributions
izabeth-0.407
-0.392
 awa-0.373
 Definition-0.348
ッド-0.347
Token of
Token ID286
Feature activation+3.44e-02

Pos logit contributions
ertodd+0.215
rive+0.205
ointed+0.205
ulously+0.205
rived+0.204
Neg logit contributions
izabeth-0.372
-0.360
 awa-0.347
 Definition-0.327
sense-0.325
Token the
Token ID262
Feature activation-9.59e-02

Pos logit contributions
izabeth+0.113
+0.109
 awa+0.103
 Definition+0.096
sense+0.096
Neg logit contributions
ertodd-0.089
rive-0.087
ointed-0.087
ulously-0.086
rived-0.086
Token weekly
Token ID10273
Feature activation-8.60e-02

Pos logit contributions
ertodd+0.222
rive+0.215
ointed+0.214
rived+0.214
ulously+0.213
Neg logit contributions
izabeth-0.342
-0.330
 awa-0.316
 Definition-0.297
sense-0.295
Token well
Token ID880
Feature activation-1.42e-01

Pos logit contributions
ertodd+0.237
rive+0.229
ointed+0.229
rived+0.228
ulously+0.228
Neg logit contributions
izabeth-0.352
-0.339
 awa-0.325
 Definition-0.305
sense-0.303
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
ertodd+0.322
rive+0.311
ointed+0.311
rived+0.310
ulously+0.310
Neg logit contributions
izabeth-0.515
-0.497
 awa-0.477
 Definition-0.449
sense-0.446
Token
Token ID198
Feature activation+1.74e-01

Pos logit contributions
izabeth+0.679
+0.654
 awa+0.622
 Definition+0.586
sense+0.579
Neg logit contributions
ertodd-0.527
rive-0.514
ointed-0.512
rived-0.512
ulously-0.512
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
izabeth+0.560
+0.542
 awa+0.499
 Definition+0.482
sense+0.479
Neg logit contributions
ertodd-0.464
rived-0.450
rive-0.450
ulously-0.450
ointed-0.448
TokenAfter
Token ID3260
Feature activation-2.26e-01

Pos logit contributions
izabeth+0.441
+0.426
 awa+0.405
 Definition+0.381
sense+0.379
Neg logit contributions
ertodd-0.303
rive-0.294
ointed-0.294
rived-0.293
ulously-0.293
Token the
Token ID262
Feature activation-2.50e-01

Pos logit contributions
ertodd+0.557
rive+0.543
ointed+0.542
rived+0.540
ulously+0.540
Neg logit contributions
izabeth-0.769
-0.741
 awa-0.708
 Definition-0.662
sense-0.659
Token initial
Token ID4238
Feature activation+9.86e-02

Pos logit contributions
ertodd+0.641
ointed+0.626
rive+0.626
ulously+0.624
rived+0.624
Neg logit contributions
izabeth-0.826
-0.796
 awa-0.761
 Definition-0.707
sense-0.705
Token euph
Token ID40381
Feature activation+5.68e-01

Pos logit contributions
izabeth+0.317
+0.305
 awa+0.290
 Definition+0.271
sense+0.269
Neg logit contributions
ertodd-0.262
rive-0.255
ointed-0.255
rived-0.254
ulously-0.254
Tokenoria
Token ID7661
Feature activation+4.09e-01

Pos logit contributions
izabeth+1.812
+1.707
 awa+1.634
 Definition+1.534
sense+1.527
Neg logit contributions
ertodd-1.520
ointed-1.493
ulously-1.479
rive-1.474
rived-1.460
Token wears
Token ID17326
Feature activation-3.64e-01

Pos logit contributions
izabeth+1.513
+1.451
 awa+1.398
 Definition+1.315
sense+1.310
Neg logit contributions
ertodd-0.893
ulously-0.857
rive-0.857
ointed-0.851
rived-0.849
Token off
Token ID572
Feature activation-1.46e-01

Pos logit contributions
ertodd+0.701
ulously+0.682
ointed+0.680
rive+0.679
rived+0.678
Neg logit contributions
izabeth-1.430
-1.387
 awa-1.323
 Definition-1.258
sense-1.250
Token Our
Token ID3954
Feature activation-6.39e-02

Pos logit contributions
ertodd+0.687
ointed+0.671
rive+0.671
ulously+0.668
rived+0.666
Neg logit contributions
izabeth-0.963
-0.931
 awa-0.886
sense-0.825
 Definition-0.825
Token sub
Token ID850
Feature activation-3.37e-01

Pos logit contributions
ertodd+0.157
rive+0.153
ointed+0.153
rived+0.152
ulously+0.152
Neg logit contributions
izabeth-0.218
-0.210
 awa-0.201
 Definition-0.188
sense-0.187
Token-
Token ID12
Feature activation-1.14e-01

Pos logit contributions
ertodd+0.823
rive+0.801
ointed+0.800
rived+0.797
ulously+0.796
Neg logit contributions
izabeth-1.154
-1.112
 awa-1.063
 Definition-0.995
sense-0.989
Token2
Token ID17
Feature activation-1.10e-01

Pos logit contributions
ertodd+0.322
rive+0.314
ointed+0.314
rived+0.313
ulously+0.312
Neg logit contributions
izabeth-0.349
-0.335
 awa-0.318
 Definition-0.296
sense-0.294
Token marathon
Token ID22336
Feature activation-1.98e-01

Pos logit contributions
ertodd+0.215
ointed+0.208
rive+0.206
rived+0.206
ulously+0.206
Neg logit contributions
izabeth-0.430
-0.416
 awa-0.399
 Definition-0.379
sense-0.376
Token page
Token ID2443
Feature activation-3.17e-01

Pos logit contributions
ertodd+0.487
rive+0.475
ointed+0.474
rived+0.472
ulously+0.472
Neg logit contributions
izabeth-0.674
-0.649
 awa-0.620
 Definition-0.580
sense-0.577
Token has
Token ID468
Feature activation-1.19e-01

Pos logit contributions
ertodd+0.727
rive+0.719
ointed+0.718
ulously+0.714
rived+0.707
Neg logit contributions
izabeth-1.123
-1.082
 awa-1.036
ッド-0.971
sense-0.969
Token everything
Token ID2279
Feature activation-1.26e-01

Pos logit contributions
ertodd+0.289
rive+0.282
ointed+0.282
ulously+0.280
rived+0.280
Neg logit contributions
izabeth-0.411
-0.396
 awa-0.378
 Definition-0.355
sense-0.353
Token you
Token ID345
Feature activation-1.50e-01

Pos logit contributions
ertodd+0.329
rive+0.322
ointed+0.322
ulously+0.320
rived+0.320
Neg logit contributions
izabeth-0.412
-0.397
 awa-0.378
 Definition-0.352
sense-0.350
Token need
Token ID761
Feature activation-5.50e-02

Pos logit contributions
ertodd+0.398
rive+0.389
ointed+0.389
rived+0.388
ulously+0.387
Neg logit contributions
izabeth-0.484
-0.465
 awa-0.444
 Definition-0.413
ッド-0.410
Token to
Token ID284
Feature activation-8.21e-02

Pos logit contributions
ertodd+0.101
ointed+0.096
rive+0.096
ulously+0.096
rived+0.095
Neg logit contributions
izabeth-0.223
-0.216
 awa-0.208
 Definition-0.197
sense-0.196
Token
Token ID851
Feature activation-1.27e-01

Pos logit contributions
izabeth+0.150
+0.144
 awa+0.140
 Definition+0.132
sense+0.131
Neg logit contributions
ertodd-0.079
ointed-0.076
rive-0.076
rived-0.076
ulously-0.075
Token but
Token ID475
Feature activation-1.72e-01

Pos logit contributions
ertodd+0.308
rive+0.299
ointed+0.298
rived+0.297
ulously+0.297
Neg logit contributions
izabeth-0.441
-0.425
 awa-0.406
 Definition-0.381
sense-0.379
Token as
Token ID355
Feature activation+1.83e-01

Pos logit contributions
ertodd+0.421
rive+0.410
ointed+0.409
rived+0.408
ulously+0.407
Neg logit contributions
izabeth-0.591
-0.570
 awa-0.545
 Definition-0.510
sense-0.507
Token a
Token ID257
Feature activation-9.86e-02

Pos logit contributions
izabeth+0.597
+0.574
 awa+0.548
 Definition+0.512
sense+0.509
Neg logit contributions
ertodd-0.478
ointed-0.463
rive-0.463
rived-0.462
ulously-0.461
Token native
Token ID6868
Feature activation+1.78e-01

Pos logit contributions
ertodd+0.236
rive+0.231
ointed+0.230
rived+0.230
ulously+0.230
Neg logit contributions
izabeth-0.342
-0.329
 awa-0.315
 Definition-0.295
ッド-0.293
Token Vietnamese
Token ID23618
Feature activation-2.70e-01

Pos logit contributions
izabeth+0.613
+0.591
 awa+0.565
 Definition+0.532
sense+0.529
Neg logit contributions
ertodd-0.435
ulously-0.415
rive-0.414
ointed-0.413
rived-0.411
Token person
Token ID1048
Feature activation-1.49e-01

Pos logit contributions
ertodd+0.656
rive+0.640
ointed+0.639
rived+0.637
ulously+0.635
Neg logit contributions
izabeth-0.926
-0.892
 awa-0.852
 Definition-0.798
sense-0.793
Token we
Token ID356
Feature activation+9.28e-02

Pos logit contributions
ertodd+0.355
rive+0.344
ointed+0.344
ulously+0.343
rived+0.342
Neg logit contributions
izabeth-0.519
-0.500
 awa-0.478
 Definition-0.449
sense-0.446
Token strive
Token ID22076
Feature activation+8.01e-02

Pos logit contributions
izabeth+0.340
+0.328
 awa+0.314
 Definition+0.296
sense+0.295
Neg logit contributions
ertodd-0.205
rive-0.199
ointed-0.199
ulously-0.198
rived-0.198
Token hard
Token ID1327
Feature activation-4.72e-03

Pos logit contributions
izabeth+0.320
+0.309
 awa+0.299
 Definition+0.283
sense+0.281
Neg logit contributions
ertodd-0.151
ointed-0.144
rived-0.143
rive-0.143
ulously-0.143
Token because
Token ID780
Feature activation-1.97e-01

Pos logit contributions
ertodd+0.010
ointed+0.009
rive+0.009
rived+0.009
ulously+0.009
Neg logit contributions
izabeth-0.018
-0.018
 awa-0.017
 Definition-0.016
sense-0.016
Token support
Token ID1104
Feature activation+2.68e-02

Pos logit contributions
izabeth+1.154
+1.112
 awa+1.063
 Definition+0.996
sense+0.990
Neg logit contributions
ertodd-0.801
rive-0.780
ointed-0.779
rived-0.776
ulously-0.775
Token the
Token ID262
Feature activation-1.26e-01

Pos logit contributions
izabeth+0.086
+0.082
 awa+0.079
 Definition+0.073
sense+0.073
Neg logit contributions
ertodd-0.071
ointed-0.069
rive-0.069
rived-0.069
ulously-0.069
Token mission
Token ID4365
Feature activation+2.10e-01

Pos logit contributions
rive+0.345
ulously+0.340
rived+0.331
 rig+0.328
ointed+0.328
Neg logit contributions
izabeth-0.400
-0.375
 awa-0.367
afety-0.339
ッド-0.336
Token of
Token ID286
Feature activation+1.03e-01

Pos logit contributions
izabeth+0.812
+0.780
 awa+0.741
 Definition+0.721
sense+0.720
Neg logit contributions
ertodd-0.431
ointed-0.407
ulously-0.404
rive-0.403
rived-0.399
Token New
Token ID968
Feature activation-8.03e-02

Pos logit contributions
izabeth+0.340
+0.328
 awa+0.313
ッド+0.291
 Definition+0.291
Neg logit contributions
ertodd-0.259
rive-0.255
ulously-0.254
ointed-0.254
rived-0.253
Token Advent
Token ID33732
Feature activation-9.98e-01

Pos logit contributions
ertodd+0.166
rive+0.163
ointed+0.161
rived+0.160
ulously+0.159
Neg logit contributions
izabeth-0.306
-0.296
 awa-0.285
ッド-0.266
sense-0.266
Token and
Token ID290
Feature activation+1.15e-01

Pos logit contributions
ertodd+2.141
rive+2.123
ointed+2.105
ulously+2.075
rived+2.067
Neg logit contributions
izabeth-3.738
-3.579
 awa-3.524
ッド-3.208
 Definition-3.189
Token get
Token ID651
Feature activation+7.68e-02

Pos logit contributions
izabeth+0.356
+0.334
 awa+0.314
 Definition+0.302
sense+0.298
Neg logit contributions
ertodd-0.328
rived-0.313
ulously-0.309
ointed-0.308
rive-0.306
Token the
Token ID262
Feature activation+2.17e-01

Pos logit contributions
izabeth+0.237
+0.229
 awa+0.216
 Definition+0.203
sense+0.201
Neg logit contributions
ertodd-0.214
rived-0.208
rive-0.207
ulously-0.207
ointed-0.206
Token full
Token ID1336
Feature activation-3.18e-03

Pos logit contributions
izabeth+0.828
+0.800
 awa+0.768
 Definition+0.724
sense+0.720
Neg logit contributions
ertodd-0.445
rive-0.431
ointed-0.431
ulously-0.429
rived-0.429
Token contents
Token ID10154
Feature activation-2.84e-01

Pos logit contributions
 rig+0.008
rive+0.008
ointed+0.008
ertodd+0.008
rived+0.008
Neg logit contributions
izabeth-0.010
-0.010
 awa-0.010
afety-0.009
ッド-0.009
Token linear
Token ID14174
Feature activation-9.31e-02

Pos logit contributions
ertodd+0.396
rive+0.386
ointed+0.385
rived+0.384
ulously+0.383
Neg logit contributions
izabeth-0.585
-0.564
 awa-0.540
 Definition-0.507
sense-0.503
Token Dead
Token ID5542
Feature activation-3.23e-01

Pos logit contributions
ertodd+0.223
rive+0.219
ointed+0.218
ulously+0.218
rived+0.217
Neg logit contributions
izabeth-0.323
-0.311
 awa-0.298
 Definition-0.278
ッド-0.276
Token Space
Token ID4687
Feature activation-4.38e-01

Pos logit contributions
ertodd+0.896
rive+0.887
ulously+0.885
rived+0.882
ointed+0.879
Neg logit contributions
izabeth-0.996
-0.952
 awa-0.912
 Definition-0.834
ッド-0.829
Token and
Token ID290
Feature activation-8.19e-02

Pos logit contributions
ertodd+1.089
rived+1.077
ointed+1.072
ulously+1.071
rive+1.071
Neg logit contributions
izabeth-1.459
-1.414
 awa-1.348
 Definition-1.241
sense-1.241
Token Bio
Token ID16024
Feature activation-4.36e-03

Pos logit contributions
ertodd+0.197
rive+0.191
ointed+0.191
rived+0.190
ulously+0.190
Neg logit contributions
izabeth-0.284
-0.274
 awa-0.262
 Definition-0.246
sense-0.244
TokenShock
Token ID31646
Feature activation-9.08e-01

Pos logit contributions
ertodd+0.008
ointed+0.007
rive+0.007
rived+0.007
ulously+0.007
Neg logit contributions
izabeth-0.018
-0.017
 awa-0.017
 Definition-0.016
sense-0.016
Token never
Token ID1239
Feature activation-3.71e-01

Pos logit contributions
rived+2.154
ulously+2.149
ertodd+2.148
ointed+2.132
rive+2.118
Neg logit contributions
izabeth-3.089
-2.978
 awa-2.854
 Definition-2.637
afety-2.628
Token captured
Token ID7907
Feature activation+6.75e-02

Pos logit contributions
ertodd+0.867
ointed+0.847
rive+0.846
rived+0.846
ulously+0.842
Neg logit contributions
izabeth-1.310
-1.263
 awa-1.204
 Definition-1.133
ッド-1.129
Token.
Token ID13
Feature activation-9.54e-02

Pos logit contributions
izabeth+0.244
+0.236
 awa+0.226
 Definition+0.213
sense+0.212
Neg logit contributions
ertodd-0.153
rive-0.147
ointed-0.147
ulously-0.147
rived-0.147
Token It
Token ID632
Feature activation-8.55e-02

Pos logit contributions
ertodd+0.237
ointed+0.233
rive+0.232
ulously+0.231
rived+0.231
Neg logit contributions
izabeth-0.321
-0.308
 awa-0.296
 Definition-0.274
sense-0.273
Token
Token ID447
Feature activation+5.32e-02

Pos logit contributions
ertodd+0.208
rive+0.202
ointed+0.202
rived+0.201
ulously+0.201
Neg logit contributions
izabeth-0.294
-0.283
 awa-0.271
 Definition-0.254
sense-0.252
Token contains
Token ID4909
Feature activation-2.03e-01

Pos logit contributions
ertodd+0.678
ointed+0.660
rive+0.659
rived+0.658
ulously+0.657
Neg logit contributions
izabeth-0.889
-0.854
 awa-0.814
 Definition-0.758
ッド-0.755
Token no
Token ID645
Feature activation-5.18e-02

Pos logit contributions
ertodd+0.499
rive+0.487
ointed+0.487
rived+0.485
ulously+0.485
Neg logit contributions
izabeth-0.692
-0.666
 awa-0.636
 Definition-0.595
sense-0.591
Token dialog
Token ID17310
Feature activation+4.42e-01

Pos logit contributions
ertodd+0.108
rive+0.105
ointed+0.104
rived+0.104
ulously+0.104
Neg logit contributions
izabeth-0.196
-0.190
 awa-0.182
 Definition-0.172
sense-0.171
Token or
Token ID393
Feature activation+2.39e-02

Pos logit contributions
izabeth+1.482
+1.429
 awa+1.364
sense+1.276
 Definition+1.275
Neg logit contributions
ertodd-1.118
rive-1.077
ointed-1.075
rived-1.069
ulously-1.064
Token on
Token ID319
Feature activation-4.53e-02

Pos logit contributions
izabeth+0.093
+0.090
 awa+0.086
 Definition+0.081
sense+0.081
Neg logit contributions
ertodd-0.047
rive-0.046
ointed-0.046
rived-0.046
ulously-0.046
Tokenom
Token ID296
Feature activation-8.52e-01

Pos logit contributions
ertodd+0.115
rive+0.112
ointed+0.112
rived+0.112
ulously+0.111
Neg logit contributions
izabeth-0.151
-0.145
 awa-0.139
 Definition-0.130
sense-0.129
Tokenat
Token ID265
Feature activation-8.97e-02

Pos logit contributions
ointed+2.207
ertodd+2.196
rive+2.177
rived+2.167
ulously+2.167
Neg logit contributions
izabeth-2.708
-2.679
 awa-2.535
ッド-2.369
sense-2.339
Tokenop
Token ID404
Feature activation-2.82e-01

Pos logit contributions
ertodd+0.224
ointed+0.219
rive+0.218
rived+0.217
ulously+0.215
Neg logit contributions
izabeth-0.301
-0.295
 awa-0.281
 Definition-0.262
ッド-0.260
Tokeno
Token ID78
Feature activation-3.24e-01

Pos logit contributions
ointed+0.869
rived+0.866
ertodd+0.862
rive+0.857
ulously+0.850
Neg logit contributions
izabeth-0.776
-0.749
 awa-0.700
ッド-0.649
sense-0.649
Tokenetic
Token ID5139
Feature activation-1.12e-01

Pos logit contributions
ertodd+0.550
ointed+0.539
rive+0.538
rived+0.536
ulously+0.533
Neg logit contributions
izabeth-1.339
-1.301
 awa-1.254
ッド-1.187
 Definition-1.183
Token words
Token ID2456
Feature activation+1.02e-01

Pos logit contributions
ertodd+0.264
rive+0.259
ointed+0.259
rived+0.258
ulously+0.258
Neg logit contributions
izabeth-0.392
-0.377
 awa-0.361
 Definition-0.338
ッド-0.337
TokenPolice
Token ID9039
Feature activation+3.85e-01

Pos logit contributions
izabeth+0.540
+0.524
 awa+0.495
 Definition+0.466
sense+0.465
Neg logit contributions
ertodd-0.396
ointed-0.385
rive-0.383
ulously-0.383
rived-0.383
Token have
Token ID423
Feature activation+6.84e-02

Pos logit contributions
izabeth+1.356
+1.309
 awa+1.265
 Definition+1.201
sense+1.174
Neg logit contributions
ertodd-0.906
rive-0.863
ulously-0.859
ointed-0.858
rived-0.858
Token also
Token ID635
Feature activation-4.07e-02

Pos logit contributions
izabeth+0.221
+0.212
 awa+0.202
 Definition+0.189
sense+0.187
Neg logit contributions
ertodd-0.181
rive-0.176
ointed-0.176
rived-0.175
ulously-0.175
Token seized
Token ID12000
Feature activation-4.82e-01

Pos logit contributions
ertodd+0.105
rive+0.103
ointed+0.102
rived+0.102
ulously+0.102
Neg logit contributions
izabeth-0.133
-0.128
 awa-0.123
 Definition-0.114
sense-0.114
Token two
Token ID734
Feature activation-1.08e-01

Pos logit contributions
ertodd+1.198
rive+1.181
ointed+1.178
ulously+1.169
rived+1.169
Neg logit contributions
izabeth-1.621
-1.561
 awa-1.478
sense-1.391
 Definition-1.390
Token dump
Token ID10285
Feature activation-9.08e-01

Pos logit contributions
ertodd+0.260
rive+0.253
ointed+0.253
rived+0.252
ulously+0.252
Neg logit contributions
izabeth-0.372
-0.358
 awa-0.343
 Definition-0.321
sense-0.319
Tokeners
Token ID364
Feature activation-1.34e-01

Pos logit contributions
rive+2.300
rived+2.269
ointed+2.268
 rig+2.253
ulously+2.252
Neg logit contributions
izabeth-2.982
-2.863
 awa-2.713
sense-2.555
ッド-2.547
Token and
Token ID290
Feature activation-1.60e-01

Pos logit contributions
ertodd+0.303
rive+0.293
ointed+0.292
ulously+0.291
rived+0.291
Neg logit contributions
izabeth-0.484
-0.467
 awa-0.448
 Definition-0.421
sense-0.418
Token arrested
Token ID5169
Feature activation-9.27e-02

Pos logit contributions
ertodd+0.398
rive+0.388
ointed+0.388
rived+0.386
ulously+0.386
Neg logit contributions
izabeth-0.541
-0.521
 awa-0.497
 Definition-0.466
sense-0.463
Token d
Token ID288
Feature activation-1.57e-01

Pos logit contributions
ertodd+0.231
rive+0.225
ointed+0.225
rived+0.224
ulously+0.224
Neg logit contributions
izabeth-0.313
-0.301
 awa-0.288
 Definition-0.269
sense-0.267
Tokenumper
Token ID15829
Feature activation-4.64e-01

Pos logit contributions
ertodd+0.395
rive+0.386
ointed+0.385
rived+0.385
ulously+0.383
Neg logit contributions
izabeth-0.527
-0.507
 awa-0.486
 Definition-0.454
ッド-0.453
Token true
Token ID2081
Feature activation+2.06e-01

Pos logit contributions
ertodd+0.327
rive+0.319
ointed+0.318
rived+0.317
ulously+0.317
Neg logit contributions
izabeth-0.397
-0.381
 awa-0.364
 Definition-0.339
sense-0.337
Token path
Token ID3108
Feature activation-1.70e-01

Pos logit contributions
izabeth+0.569
+0.542
 awa+0.513
 Definition+0.474
sense+0.470
Neg logit contributions
ertodd-0.640
rive-0.623
rived-0.622
ointed-0.622
ulously-0.621
Token,
Token ID11
Feature activation+6.31e-02

Pos logit contributions
ertodd+0.397
rive+0.385
rived+0.383
ointed+0.382
ulously+0.382
Neg logit contributions
izabeth-0.604
-0.582
 awa-0.558
 Definition-0.526
sense-0.522
Token you
Token ID345
Feature activation+8.09e-03

Pos logit contributions
izabeth+0.206
+0.197
 awa+0.189
 Definition+0.177
sense+0.175
Neg logit contributions
ertodd-0.166
rive-0.161
rived-0.160
ointed-0.160
ulously-0.160
Token are
Token ID389
Feature activation-2.98e-02

Pos logit contributions
izabeth+0.030
+0.029
 awa+0.028
 Definition+0.027
sense+0.026
Neg logit contributions
ertodd-0.017
rive-0.017
ulously-0.017
ointed-0.017
rived-0.017
Token damned
Token ID28911
Feature activation-8.54e-01

Pos logit contributions
ertodd+0.069
rive+0.067
ointed+0.067
rived+0.067
ulously+0.066
Neg logit contributions
izabeth-0.106
-0.103
 awa-0.098
 Definition-0.092
sense-0.092
Token.
Token ID13
Feature activation-1.97e-01

Pos logit contributions
ertodd+1.869
rive+1.858
ulously+1.856
ointed+1.850
rived+1.838
Neg logit contributions
izabeth-3.094
-3.015
 awa-2.869
sense-2.687
 Definition-2.686
Token
Token ID198
Feature activation-9.99e-02

Pos logit contributions
ertodd+0.486
ointed+0.475
rive+0.473
ulously+0.470
rived+0.470
Neg logit contributions
izabeth-0.667
-0.642
 awa-0.616
sense-0.570
ッド-0.569
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
ertodd+0.258
rive+0.250
ointed+0.250
rived+0.250
ulously+0.250
Neg logit contributions
izabeth-0.329
-0.317
 awa-0.299
 Definition-0.283
sense-0.281
TokenI
Token ID40
Feature activation-3.99e-02

Pos logit contributions
ertodd+0.381
rive+0.373
ointed+0.371
rived+0.369
ulously+0.368
Neg logit contributions
izabeth-0.580
-0.555
 awa-0.542
 Definition-0.501
ッド-0.497
Token know
Token ID760
Feature activation+3.42e-02

Pos logit contributions
ertodd+0.089
rive+0.086
ointed+0.086
rived+0.086
ulously+0.086
Neg logit contributions
izabeth-0.145
-0.140
 awa-0.135
 Definition-0.127
sense-0.126