TOP ACTIVATIONS
MAX = 3.883

Cl ad in a Toronto Argon auts football jersey , he
Richard Painter , the chief White House
Senate violated perjury laws . Painter also said Ross must rec
Casino Recap - Sports - Pain less Chick - An ore
sanctions regime , Painter said . That
their income was pretty much passive after the book launched ,
perhaps to buy a Chevy Volt from the bailed - out
itas ( h ull ed pumpkin seeds ) , Sun flower
girlfriend ." I fur row my eyebrows at the
back soon . <|endoftext|> Yet even these monuments can reinforce myths
below . <|endoftext|> V andals painted the word Islam in large
diseases as well . <|endoftext|> HA V ANA ( Reuters )
selling fetal tissue . With Republicans using
unto thy self . De uter onomy 20 : 12 .
null atomic pointer . pub fn null () -> Atomic <
ross . Carlton L . Guth rie . Marion T .
that Planned Parenthood is selling fetal tissue . In reality ,
as a vehicle for making passive investments unrelated to the business
evidence . <|endoftext|> The cookies baked quickly at 375 , so
- old gay son , Isaiah Henderson .

MIN ACTIVATIONS
MIN = -4.523

who worked on the CDC Guid eline itself . They see
who worked on the CDC Guid eline , signed a public
pl i is knocked by Mov i eline for mirror ing
rainfall ." Peter Mell gard / The World Post
s is one of the focal points of the 2017 Disney
Anyone who knows of his whereabouts is urged to call NYPD
With the words of Max ller , the Greek
of curses , injuries or kidn appings of non player characters
, with the advice of Gand alf , chooses companions for
You contributed money to Al Qaeda ) without revealing
've been the manager at Pop eyes right now , if
6 , 2010 Subject : R iff Tra x
9 , 2010 Subject : R ipe for M
potential side effects . The aim of this study is to
viation for duty status whereabouts unknown ) TIM EL INE
create a distributed algorithm that conver ged to this operation point
13 , 2010 Subject : boys beware This film
Mill iy et , a smugg ler named H id ir
. The combination of a mov able command center with a
guy ended up becoming my focal point for his outlandish offense

INTERVAL 1.782 - 3.883
CONTAINS 0.530%

48 . Pro verbs 49 . Drunk
election season and still take back the House next year .
bestowed the name Z hen jin ( True - gold )
and Tesla Motors Model S automobile , everything that
goal against Holland declared a Un es co World Heritage Site

INTERVAL -0.320 - 1.782
CONTAINS 76.046%

claimed , flat out , that Kam al had merely missed
." The email continues , via Mic report , " he
third overall pick in the 2015 NBA draft by the Philadelphia
top notch tools just for you . Here 's a quick
will have a strong sensation of what alcohol does , but

INTERVAL -2.421 - -0.320
CONTAINS 23.350%

Las Vegas . Ren aud Lav o ie (@ ren
as here , to ( 1 g ) 1
of this is a round about way of saying that the
to push an ability we don t often see
make no mistake these attacks are against all of us

INTERVAL -4.523 - -2.421
CONTAINS 0.073%

College career [ edit ] O ka
² ) and obesity ( 30 kg / m
. There they unexpectedly meet Gand alf . Gand alf explains
Ag 2 r - La Mond ial e ) s .
at oder ma with variable penet rance , characterized clinically by
Token Cl
Token ID1012
Feature activation-1.74e-01

Pos logit contributions
 restraint+0.336
���+0.330
otaur+0.327
────────+0.322
heed+0.311
Neg logit contributions
ially-0.335
ciation-0.297
ijn-0.285
-0.271
llo-0.270
Tokenad
Token ID324
Feature activation-2.24e-01

Pos logit contributions
ially+0.315
ciation+0.281
ijn+0.271
igious+0.257
+0.256
Neg logit contributions
 restraint-0.269
���-0.263
────────-0.259
otaur-0.259
 swe-0.248
Token in
Token ID287
Feature activation+6.75e-02

Pos logit contributions
ially+0.317
ciation+0.276
ijn+0.261
+0.244
igious+0.242
Neg logit contributions
 restraint-0.430
otaur-0.421
���-0.420
────────-0.419
 swe-0.404
Token a
Token ID257
Feature activation+4.65e-02

Pos logit contributions
 restraint+0.105
���+0.104
otaur+0.104
────────+0.103
heed+0.098
Neg logit contributions
ially-0.118
ciation-0.104
ijn-0.101
igious-0.096
-0.096
Token Toronto
Token ID6586
Feature activation+3.42e-02

Pos logit contributions
 restraint+0.072
���+0.071
otaur+0.071
────────+0.070
heed+0.067
Neg logit contributions
ially-0.082
ciation-0.073
ijn-0.071
-0.067
llo-0.067
Token Argon
Token ID49974
Feature activation+3.88e+00

Pos logit contributions
 restraint+0.055
otaur+0.055
���+0.055
────────+0.054
heed+0.051
Neg logit contributions
ially-0.058
ciation-0.051
ijn-0.049
-0.047
llo-0.046
Tokenauts
Token ID17712
Feature activation+1.06e+00

Pos logit contributions
 restraint+2.798
────────+2.656
otaur+2.638
 swe+2.413
heed+2.411
Neg logit contributions
ially-9.941
ciation-9.357
ijn-9.107
-8.822
igious-8.695
Token football
Token ID4346
Feature activation-4.87e-01

Pos logit contributions
 restraint+1.601
otaur+1.554
���+1.547
────────+1.545
 swe+1.478
Neg logit contributions
ially-1.925
ciation-1.732
ijn-1.661
-1.582
igious-1.573
Token jersey
Token ID22383
Feature activation+1.51e+00

Pos logit contributions
ially+0.832
ciation+0.727
ijn+0.706
llo+0.667
+0.666
Neg logit contributions
���-0.783
 restraint-0.774
otaur-0.772
────────-0.765
heed-0.729
Token,
Token ID11
Feature activation+3.11e-02

Pos logit contributions
 restraint+2.702
────────+2.612
otaur+2.611
���+2.596
 swe+2.540
Neg logit contributions
ially-2.309
ciation-2.038
ijn-1.935
-1.818
igious-1.816
Token he
Token ID339
Feature activation+4.42e-01

Pos logit contributions
 restraint+0.046
otaur+0.046
���+0.046
────────+0.045
heed+0.043
Neg logit contributions
ially-0.057
ciation-0.051
ijn-0.049
-0.046
llo-0.046
Token
Token ID447
Feature activation-3.85e-02

Pos logit contributions
ially+0.103
ciation+0.092
ijn+0.087
+0.084
llo+0.083
Neg logit contributions
 restraint-0.108
���-0.106
otaur-0.104
────────-0.103
heed-0.100
Token
Token ID251
Feature activation-2.06e-01

Pos logit contributions
ially+0.083
ciation+0.075
ijn+0.073
+0.070
llo+0.070
Neg logit contributions
 restraint-0.046
otaur-0.044
���-0.043
────────-0.043
 swe-0.041
Token
Token ID198
Feature activation+1.45e-01

Pos logit contributions
ially+0.342
ciation+0.304
ijn+0.289
+0.275
llo+0.274
Neg logit contributions
 restraint-0.347
���-0.339
otaur-0.337
────────-0.333
 swe-0.321
Token
Token ID198
Feature activation+6.79e-02

Pos logit contributions
 restraint+0.239
otaur+0.233
���+0.233
────────+0.232
 swe+0.222
Neg logit contributions
ially-0.244
ciation-0.217
ijn-0.208
-0.197
igious-0.196
TokenRichard
Token ID22245
Feature activation+2.67e-01

Pos logit contributions
 restraint+0.108
���+0.108
otaur+0.105
────────+0.103
 swe+0.101
Neg logit contributions
ially-0.119
ciation-0.106
ijn-0.101
-0.097
llo-0.097
Token Painter
Token ID49102
Feature activation+3.85e+00

Pos logit contributions
 restraint+0.389
���+0.378
otaur+0.378
────────+0.375
 swe+0.356
Neg logit contributions
ially-0.505
ciation-0.454
ijn-0.437
-0.418
llo-0.416
Token,
Token ID11
Feature activation-1.87e-01

Pos logit contributions
 restraint+6.516
────────+6.312
���+6.295
otaur+6.260
 swe+6.149
Neg logit contributions
ially-6.035
ciation-5.390
ijn-5.168
igious-4.829
ication-4.795
Token the
Token ID262
Feature activation+6.11e-02

Pos logit contributions
ially+0.319
ciation+0.279
ijn+0.266
+0.257
llo+0.255
Neg logit contributions
 restraint-0.302
otaur-0.301
���-0.300
────────-0.295
heed-0.285
Token chief
Token ID4039
Feature activation-9.81e-01

Pos logit contributions
���+0.090
otaur+0.089
 restraint+0.089
────────+0.087
heed+0.084
Neg logit contributions
ially-0.113
ciation-0.098
ijn-0.095
-0.092
igious-0.092
Token White
Token ID2635
Feature activation-4.98e-01

Pos logit contributions
ially+2.176
ciation+1.917
ijn+1.871
+1.834
ication+1.817
Neg logit contributions
���-1.080
otaur-1.060
 restraint-1.059
────────-1.057
heed-1.007
Token House
Token ID2097
Feature activation+3.08e-01

Pos logit contributions
ially+0.800
ciation+0.702
ijn+0.672
llo+0.639
+0.639
Neg logit contributions
���-0.861
 restraint-0.860
otaur-0.844
────────-0.834
 swe-0.802
Token Senate
Token ID3845
Feature activation+8.92e-01

Pos logit contributions
 restraint+0.311
���+0.311
otaur+0.308
────────+0.306
heed+0.292
Neg logit contributions
ially-0.367
ciation-0.323
ijn-0.312
llo-0.300
-0.297
Token violated
Token ID13998
Feature activation+8.84e-01

Pos logit contributions
 restraint+1.341
otaur+1.319
���+1.317
────────+1.305
heed+1.241
Neg logit contributions
ially-1.634
ciation-1.448
ijn-1.393
-1.333
llo-1.332
Token perjury
Token ID47816
Feature activation-1.07e+00

Pos logit contributions
 restraint+1.396
otaur+1.377
���+1.371
────────+1.367
heed+1.307
Neg logit contributions
ially-1.547
ciation-1.363
ijn-1.318
-1.252
igious-1.245
Token laws
Token ID3657
Feature activation+1.25e+00

Pos logit contributions
ially+2.058
ciation+1.800
ijn+1.725
+1.676
ication+1.658
Neg logit contributions
���-1.539
────────-1.505
otaur-1.502
 restraint-1.470
heed-1.405
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
 restraint+2.280
otaur+2.223
���+2.216
────────+2.213
 swe+2.132
Neg logit contributions
ially-1.892
ciation-1.664
ijn-1.581
-1.486
igious-1.475
Token Painter
Token ID49102
Feature activation+3.79e+00

Pos logit contributions
 restraint+0.198
���+0.195
otaur+0.193
────────+0.190
heed+0.185
Neg logit contributions
ially-0.189
ciation-0.168
ijn-0.159
llo-0.152
-0.151
Token also
Token ID635
Feature activation+1.45e+00

Pos logit contributions
 restraint+5.663
────────+5.446
otaur+5.406
���+5.394
 swe+5.308
Neg logit contributions
ially-6.695
ciation-6.117
ijn-5.890
-5.586
igious-5.560
Token said
Token ID531
Feature activation+2.55e-01

Pos logit contributions
 restraint+2.240
otaur+2.177
���+2.170
────────+2.163
 swe+2.067
Neg logit contributions
ially-2.594
ciation-2.326
ijn-2.231
-2.122
igious-2.110
Token Ross
Token ID9847
Feature activation-6.63e-01

Pos logit contributions
 restraint+0.425
���+0.421
otaur+0.419
────────+0.414
heed+0.398
Neg logit contributions
ially-0.423
ciation-0.370
ijn-0.354
llo-0.337
-0.336
Token must
Token ID1276
Feature activation+6.16e-01

Pos logit contributions
ially+1.127
ciation+0.980
ijn+0.936
llo+0.901
igious+0.888
Neg logit contributions
 restraint-1.087
otaur-1.077
���-1.068
────────-1.051
heed-1.003
Token rec
Token ID664
Feature activation+6.80e-01

Pos logit contributions
 restraint+1.048
otaur+1.020
���+1.016
────────+1.015
 swe+0.975
Neg logit contributions
ially-1.001
ciation-0.888
ijn-0.849
-0.803
igious-0.797
Token Casino
Token ID32802
Feature activation-3.50e-01

Pos logit contributions
 restraint+1.979
otaur+1.927
���+1.917
────────+1.911
 swe+1.838
Neg logit contributions
ially-2.035
ciation-1.811
ijn-1.731
-1.646
igious-1.629
Token Recap
Token ID46585
Feature activation+8.22e-01

Pos logit contributions
ially+0.527
ciation+0.463
ijn+0.440
+0.413
igious+0.411
Neg logit contributions
 restraint-0.640
otaur-0.625
���-0.623
────────-0.621
 swe-0.598
Token-
Token ID12
Feature activation+2.95e-02

Pos logit contributions
 restraint+1.534
otaur+1.503
���+1.497
────────+1.492
 swe+1.447
Neg logit contributions
ially-1.179
ciation-1.033
ijn-0.979
-0.922
igious-0.908
TokenSports
Token ID18153
Feature activation+6.22e-02

Pos logit contributions
 restraint+0.046
���+0.045
otaur+0.044
────────+0.044
 swe+0.042
Neg logit contributions
ially-0.053
ciation-0.047
ijn-0.045
-0.043
llo-0.043
Token-
Token ID12
Feature activation+4.14e-02

Pos logit contributions
 restraint+0.102
otaur+0.100
────────+0.100
���+0.099
 swe+0.095
Neg logit contributions
ially-0.105
ciation-0.093
ijn-0.089
igious-0.084
llo-0.084
TokenPain
Token ID38490
Feature activation+3.62e+00

Pos logit contributions
 restraint+0.064
���+0.062
otaur+0.062
────────+0.061
 swe+0.059
Neg logit contributions
ially-0.074
ciation-0.067
ijn-0.064
-0.061
llo-0.061
Tokenless
Token ID1203
Feature activation+1.54e+00

Pos logit contributions
 restraint+5.685
otaur+5.547
���+5.521
heed+5.402
────────+5.398
Neg logit contributions
ially-6.010
ciation-5.365
ijn-5.138
-4.992
igious-4.781
Token Chick
Token ID31939
Feature activation+9.37e-01

Pos logit contributions
 restraint+2.539
otaur+2.462
���+2.449
────────+2.441
 swe+2.360
Neg logit contributions
ially-2.552
ciation-2.278
ijn-2.178
-2.076
igious-2.044
Token-
Token ID12
Feature activation+8.48e-02

Pos logit contributions
 restraint+1.569
otaur+1.528
���+1.524
────────+1.517
 swe+1.458
Neg logit contributions
ially-1.548
ciation-1.375
ijn-1.314
-1.248
igious-1.237
TokenAn
Token ID2025
Feature activation+5.32e-01

Pos logit contributions
 restraint+0.130
���+0.127
otaur+0.127
────────+0.125
 swe+0.120
Neg logit contributions
ially-0.153
ciation-0.137
ijn-0.131
-0.125
llo-0.125
Tokenore
Token ID382
Feature activation-4.11e-01

Pos logit contributions
 restraint+0.843
────────+0.816
otaur+0.814
���+0.814
 swe+0.778
Neg logit contributions
ially-0.937
ciation-0.839
ijn-0.804
igious-0.761
llo-0.759
Token sanctions
Token ID9388
Feature activation+6.13e-01

Pos logit contributions
 restraint+1.278
otaur+1.263
���+1.262
────────+1.247
heed+1.195
Neg logit contributions
ially-1.409
ciation-1.253
ijn-1.194
-1.139
llo-1.135
Token regime
Token ID7142
Feature activation+9.02e-02

Pos logit contributions
 restraint+1.010
���+0.995
otaur+0.994
────────+0.983
 swe+0.940
Neg logit contributions
ially-1.029
ciation-0.913
ijn-0.867
-0.825
igious-0.820
Token,
Token ID11
Feature activation+1.53e-01

Pos logit contributions
 restraint+0.164
���+0.162
otaur+0.162
────────+0.160
 swe+0.154
Neg logit contributions
ially-0.136
ciation-0.119
ijn-0.112
-0.106
igious-0.106
Token
Token ID447
Feature activation+3.15e-02

Pos logit contributions
 restraint+0.227
���+0.226
otaur+0.224
────────+0.220
 swe+0.211
Neg logit contributions
ially-0.282
ciation-0.252
ijn-0.239
llo-0.231
-0.230
Token
Token ID251
Feature activation-3.73e-02

Pos logit contributions
 restraint+0.040
���+0.039
otaur+0.039
────────+0.038
 swe+0.036
Neg logit contributions
ially-0.065
ciation-0.059
ijn-0.057
-0.055
llo-0.055
Token Painter
Token ID49102
Feature activation+3.50e+00

Pos logit contributions
ially+0.068
ciation+0.061
ijn+0.058
llo+0.056
igious+0.055
Neg logit contributions
 restraint-0.056
���-0.056
otaur-0.055
────────-0.054
 swe-0.052
Token said
Token ID531
Feature activation-1.08e-01

Pos logit contributions
 restraint+4.920
────────+4.711
otaur+4.697
���+4.671
 swe+4.551
Neg logit contributions
ially-6.535
ciation-5.980
ijn-5.776
-5.467
igious-5.462
Token.
Token ID13
Feature activation-2.55e-01

Pos logit contributions
ially+0.158
ciation+0.137
ijn+0.130
+0.122
llo+0.122
Neg logit contributions
 restraint-0.203
���-0.198
otaur-0.198
────────-0.197
 swe-0.190
Token
Token ID564
Feature activation+2.74e-01

Pos logit contributions
ially+0.410
ciation+0.366
ijn+0.344
llo+0.331
+0.329
Neg logit contributions
 restraint-0.440
���-0.433
otaur-0.429
────────-0.422
heed-0.411
Token
Token ID250
Feature activation-3.56e-01

Pos logit contributions
 restraint+0.301
otaur+0.288
���+0.288
────────+0.279
 swe+0.269
Neg logit contributions
ially-0.612
ciation-0.559
ijn-0.541
-0.526
llo-0.524
TokenThat
Token ID2504
Feature activation+4.52e-01

Pos logit contributions
ially+0.615
ciation+0.546
ijn+0.516
+0.502
llo+0.500
Neg logit contributions
���-0.577
 restraint-0.574
otaur-0.561
────────-0.541
 swe-0.534
Token their
Token ID511
Feature activation-1.96e-01

Pos logit contributions
 restraint+0.682
otaur+0.667
���+0.666
────────+0.662
heed+0.635
Neg logit contributions
ially-0.710
ciation-0.627
ijn-0.600
llo-0.569
-0.568
Token income
Token ID3739
Feature activation+4.02e-01

Pos logit contributions
ially+0.371
ciation+0.329
ijn+0.317
llo+0.304
igious+0.303
Neg logit contributions
���-0.279
 restraint-0.278
otaur-0.275
────────-0.271
heed-0.260
Token was
Token ID373
Feature activation+1.60e-01

Pos logit contributions
���+0.624
 restraint+0.621
otaur+0.620
────────+0.611
heed+0.589
Neg logit contributions
ially-0.713
ciation-0.634
ijn-0.604
-0.576
llo-0.575
Token pretty
Token ID2495
Feature activation+1.21e-01

Pos logit contributions
otaur+0.249
 restraint+0.248
���+0.247
────────+0.243
heed+0.233
Neg logit contributions
ially-0.285
ciation-0.249
ijn-0.241
llo-0.228
ication-0.227
Token much
Token ID881
Feature activation+6.57e-01

Pos logit contributions
 restraint+0.178
otaur+0.178
���+0.175
────────+0.172
heed+0.166
Neg logit contributions
ially-0.224
ciation-0.198
ijn-0.192
llo-0.183
igious-0.181
Token passive
Token ID14513
Feature activation+3.29e+00

Pos logit contributions
 restraint+1.009
otaur+0.993
���+0.984
────────+0.979
heed+0.934
Neg logit contributions
ially-1.180
ciation-1.048
ijn-1.011
-0.958
llo-0.955
Token after
Token ID706
Feature activation+4.13e-01

Pos logit contributions
 restraint+5.543
────────+5.411
otaur+5.155
���+5.133
 swe+5.124
Neg logit contributions
ially-5.233
ciation-4.749
ijn-4.525
llo-4.268
-4.262
Token the
Token ID262
Feature activation+5.95e-02

Pos logit contributions
 restraint+0.686
otaur+0.679
���+0.676
────────+0.670
heed+0.643
Neg logit contributions
ially-0.689
ciation-0.608
ijn-0.582
igious-0.547
-0.546
Token book
Token ID1492
Feature activation+7.99e-01

Pos logit contributions
 restraint+0.089
���+0.089
otaur+0.088
────────+0.087
heed+0.084
Neg logit contributions
ially-0.108
ciation-0.096
ijn-0.092
llo-0.088
-0.087
Token launched
Token ID5611
Feature activation+8.62e-01

Pos logit contributions
 restraint+1.276
���+1.254
otaur+1.247
────────+1.241
heed+1.189
Neg logit contributions
ially-1.385
ciation-1.236
ijn-1.179
-1.120
llo-1.112
Token,
Token ID11
Feature activation-1.74e-02

Pos logit contributions
 restraint+1.588
otaur+1.550
────────+1.542
���+1.542
 swe+1.490
Neg logit contributions
ially-1.278
ciation-1.119
ijn-1.065
-1.000
igious-0.993
Token perhaps
Token ID3737
Feature activation+4.18e-01

Pos logit contributions
���+0.404
 restraint+0.402
otaur+0.400
────────+0.400
heed+0.380
Neg logit contributions
ially-0.418
ciation-0.368
ijn-0.348
llo-0.333
igious-0.331
Token to
Token ID284
Feature activation+1.29e-01

Pos logit contributions
 restraint+0.677
���+0.674
otaur+0.670
────────+0.666
heed+0.634
Neg logit contributions
ially-0.711
ciation-0.628
ijn-0.597
llo-0.567
-0.566
Token buy
Token ID2822
Feature activation-1.69e-01

Pos logit contributions
���+0.199
 restraint+0.198
otaur+0.197
────────+0.195
heed+0.184
Neg logit contributions
ially-0.231
ciation-0.205
ijn-0.196
igious-0.186
llo-0.186
Token a
Token ID257
Feature activation+2.30e-01

Pos logit contributions
ially+0.319
ciation+0.276
ijn+0.270
llo+0.257
ication+0.256
Neg logit contributions
���-0.245
otaur-0.244
 restraint-0.239
────────-0.239
heed-0.225
Token Chevy
Token ID40688
Feature activation+2.95e-01

Pos logit contributions
otaur+0.342
 restraint+0.342
���+0.341
────────+0.338
heed+0.320
Neg logit contributions
ially-0.418
ciation-0.370
ijn-0.358
llo-0.339
-0.339
Token Volt
Token ID22702
Feature activation+3.29e+00

Pos logit contributions
 restraint+0.426
���+0.426
otaur+0.424
────────+0.419
heed+0.393
Neg logit contributions
ially-0.551
ciation-0.494
ijn-0.475
llo-0.452
-0.449
Token from
Token ID422
Feature activation+8.71e-01

Pos logit contributions
 restraint+5.598
otaur+5.353
────────+5.322
 swe+5.280
���+5.191
Neg logit contributions
ially-5.220
ciation-4.718
ijn-4.464
-4.293
igious-4.215
Token the
Token ID262
Feature activation+8.81e-02

Pos logit contributions
 restraint+1.398
���+1.373
otaur+1.368
────────+1.362
 swe+1.293
Neg logit contributions
ially-1.501
ciation-1.332
ijn-1.277
-1.215
igious-1.208
Token bailed
Token ID47739
Feature activation-1.99e-01

Pos logit contributions
���+0.134
 restraint+0.134
otaur+0.132
────────+0.132
heed+0.125
Neg logit contributions
ially-0.158
ciation-0.137
ijn-0.133
llo-0.128
igious-0.127
Token-
Token ID12
Feature activation+1.07e-01

Pos logit contributions
ially+0.270
ciation+0.230
ijn+0.218
+0.203
igious+0.203
Neg logit contributions
 restraint-0.392
otaur-0.386
���-0.385
────────-0.382
 swe-0.369
Tokenout
Token ID448
Feature activation+2.64e-01

Pos logit contributions
 restraint+0.180
���+0.176
otaur+0.175
────────+0.174
 swe+0.167
Neg logit contributions
ially-0.176
ciation-0.156
ijn-0.149
-0.141
igious-0.140
Tokenitas
Token ID21416
Feature activation+8.20e-01

Pos logit contributions
ially+1.788
ciation+1.548
ijn+1.494
igious+1.373
ication+1.366
Neg logit contributions
 restraint-2.346
���-2.301
otaur-2.290
────────-2.282
 swe-2.199
Token (
Token ID357
Feature activation-4.00e-01

Pos logit contributions
 restraint+1.496
otaur+1.463
���+1.456
────────+1.455
 swe+1.407
Neg logit contributions
ially-1.228
ciation-1.079
ijn-1.021
-0.963
igious-0.955
Tokenh
Token ID71
Feature activation+7.17e-01

Pos logit contributions
ially+0.714
ciation+0.638
ijn+0.612
+0.588
igious+0.584
Neg logit contributions
 restraint-0.619
���-0.608
otaur-0.601
────────-0.592
heed-0.566
Tokenull
Token ID724
Feature activation+6.63e-01

Pos logit contributions
 restraint+1.094
���+1.066
otaur+1.057
────────+1.055
heed+1.000
Neg logit contributions
ially-1.305
ciation-1.165
ijn-1.128
igious-1.064
llo-1.061
Tokened
Token ID276
Feature activation-1.32e+00

Pos logit contributions
 restraint+1.180
otaur+1.151
���+1.148
────────+1.145
 swe+1.101
Neg logit contributions
ially-1.027
ciation-0.905
ijn-0.861
-0.812
igious-0.806
Token pumpkin
Token ID30089
Feature activation+3.26e+00

Pos logit contributions
ially+2.429
ciation+2.146
ijn+2.098
igious+2.012
llo+2.011
Neg logit contributions
���-1.926
 restraint-1.925
────────-1.882
otaur-1.878
heed-1.790
Token seeds
Token ID11904
Feature activation+3.18e-01

Pos logit contributions
 restraint+5.379
otaur+5.223
 swe+5.131
────────+5.127
���+5.061
Neg logit contributions
ially-5.278
ciation-4.756
ijn-4.486
-4.349
igious-4.275
Token)
Token ID8
Feature activation-6.88e-02

Pos logit contributions
 restraint+0.611
otaur+0.597
���+0.595
────────+0.594
 swe+0.573
Neg logit contributions
ially-0.449
ciation-0.390
ijn-0.370
-0.346
igious-0.343
Token,
Token ID837
Feature activation-4.53e-01

Pos logit contributions
ially+0.100
ciation+0.087
ijn+0.083
+0.078
igious+0.077
Neg logit contributions
 restraint-0.129
otaur-0.126
���-0.126
────────-0.125
heed-0.121
Token Sun
Token ID3825
Feature activation+7.14e-01

Pos logit contributions
ially+0.763
ciation+0.674
ijn+0.648
llo+0.616
+0.614
Neg logit contributions
 restraint-0.744
���-0.730
otaur-0.726
────────-0.721
heed-0.692
Tokenflower
Token ID25547
Feature activation+6.90e-01

Pos logit contributions
 restraint+0.821
otaur+0.792
���+0.789
────────+0.783
heed+0.732
Neg logit contributions
ially-1.563
ciation-1.425
ijn-1.378
igious-1.325
-1.325
Token girlfriend
Token ID11077
Feature activation-6.11e-01

Pos logit contributions
 restraint+0.478
���+0.471
otaur+0.468
────────+0.460
heed+0.444
Neg logit contributions
ially-0.577
ciation-0.512
ijn-0.492
-0.468
llo-0.468
Token."
Token ID526
Feature activation+4.08e-01

Pos logit contributions
ially+0.885
ciation+0.769
ijn+0.718
+0.678
llo+0.675
Neg logit contributions
 restraint-1.156
���-1.146
otaur-1.137
────────-1.121
heed-1.084
Token
Token ID198
Feature activation+4.56e-01

Pos logit contributions
 restraint+0.705
���+0.689
otaur+0.688
────────+0.685
heed+0.656
Neg logit contributions
ially-0.655
ciation-0.580
ijn-0.551
-0.521
llo-0.519
Token
Token ID198
Feature activation+4.09e-01

Pos logit contributions
 restraint+0.736
otaur+0.714
────────+0.714
���+0.697
 swe+0.683
Neg logit contributions
ially-0.781
ciation-0.698
ijn-0.669
-0.634
igious-0.632
TokenI
Token ID40
Feature activation-1.62e-01

Pos logit contributions
 restraint+0.729
���+0.721
otaur+0.710
────────+0.701
 swe+0.681
Neg logit contributions
ially-0.637
ciation-0.560
ijn-0.531
-0.503
llo-0.499
Token fur
Token ID9230
Feature activation+3.25e+00

Pos logit contributions
ially+0.277
ciation+0.246
ijn+0.235
llo+0.222
+0.222
Neg logit contributions
 restraint-0.262
���-0.257
otaur-0.257
────────-0.255
heed-0.242
Tokenrow
Token ID808
Feature activation+1.43e+00

Pos logit contributions
 restraint+4.571
otaur+4.397
────────+4.380
���+4.360
 swe+4.240
Neg logit contributions
ially-6.078
ciation-5.606
ijn-5.422
-5.164
igious-5.099
Token my
Token ID616
Feature activation+1.42e-01

Pos logit contributions
 restraint+2.471
otaur+2.396
────────+2.388
���+2.381
 swe+2.300
Neg logit contributions
ially-2.272
ciation-2.024
ijn-1.920
-1.823
igious-1.813
Token eyebrows
Token ID26928
Feature activation+2.36e+00

Pos logit contributions
 restraint+0.138
���+0.137
otaur+0.135
────────+0.132
heed+0.124
Neg logit contributions
ially-0.333
ciation-0.304
ijn-0.298
-0.287
llo-0.285
Token at
Token ID379
Feature activation+1.31e+00

Pos logit contributions
 restraint+4.080
otaur+3.913
────────+3.896
 swe+3.820
���+3.797
Neg logit contributions
ially-3.701
ciation-3.302
ijn-3.156
igious-2.998
-2.980
Token the
Token ID262
Feature activation+2.66e-01

Pos logit contributions
 restraint+2.255
otaur+2.197
���+2.190
────────+2.184
 swe+2.098
Neg logit contributions
ially-2.120
ciation-1.878
ijn-1.792
-1.693
igious-1.683
Token back
Token ID736
Feature activation+1.86e+00

Pos logit contributions
 restraint+0.542
���+0.531
otaur+0.529
────────+0.524
heed+0.503
Neg logit contributions
ially-0.528
ciation-0.466
ijn-0.447
-0.421
llo-0.419
Token soon
Token ID2582
Feature activation-1.38e-01

Pos logit contributions
 restraint+2.885
────────+2.794
otaur+2.793
���+2.756
 swe+2.673
Neg logit contributions
ially-3.290
ciation-2.966
ijn-2.838
-2.696
igious-2.686
Token.
Token ID13
Feature activation+5.00e-01

Pos logit contributions
ially+0.211
ciation+0.184
ijn+0.175
+0.165
llo+0.164
Neg logit contributions
 restraint-0.251
���-0.247
otaur-0.245
────────-0.243
heed-0.234
Token<|endoftext|>
Token ID50256
Feature activation+9.73e-02

Pos logit contributions
 restraint+0.827
���+0.806
otaur+0.804
────────+0.790
heed+0.766
Neg logit contributions
ially-0.842
ciation-0.751
ijn-0.718
-0.682
igious-0.675
TokenYet
Token ID11486
Feature activation+1.68e+00

Pos logit contributions
 restraint+0.154
���+0.152
otaur+0.150
────────+0.147
 swe+0.143
Neg logit contributions
ially-0.171
ciation-0.152
ijn-0.146
-0.139
igious-0.138
Token even
Token ID772
Feature activation+3.17e+00

Pos logit contributions
 restraint+2.891
otaur+2.798
────────+2.780
���+2.779
 swe+2.692
Neg logit contributions
ially-2.684
ciation-2.380
ijn-2.257
-2.145
igious-2.129
Token these
Token ID777
Feature activation+8.87e-01

Pos logit contributions
 restraint+5.174
otaur+4.929
────────+4.917
���+4.850
 swe+4.841
Neg logit contributions
ially-5.249
ciation-4.791
ijn-4.557
-4.324
igious-4.289
Token monuments
Token ID28814
Feature activation+8.58e-01

Pos logit contributions
 restraint+1.351
otaur+1.322
���+1.318
────────+1.310
heed+1.250
Neg logit contributions
ially-1.604
ciation-1.434
ijn-1.377
-1.311
igious-1.304
Token can
Token ID460
Feature activation+1.08e+00

Pos logit contributions
 restraint+1.457
otaur+1.416
────────+1.405
���+1.403
 swe+1.356
Neg logit contributions
ially-1.396
ciation-1.242
ijn-1.189
-1.126
igious-1.114
Token reinforce
Token ID19594
Feature activation+6.78e-01

Pos logit contributions
 restraint+1.850
otaur+1.786
────────+1.775
���+1.756
 swe+1.728
Neg logit contributions
ially-1.738
ciation-1.564
ijn-1.499
-1.422
llo-1.398
Token myths
Token ID22886
Feature activation+1.48e+00

Pos logit contributions
 restraint+1.052
���+1.033
otaur+1.030
────────+1.025
heed+0.975
Neg logit contributions
ially-1.203
ciation-1.072
ijn-1.028
-0.976
igious-0.975
Token below
Token ID2174
Feature activation-1.58e-01

Pos logit contributions
 restraint+1.878
otaur+1.831
────────+1.824
���+1.820
 swe+1.753
Neg logit contributions
ially-1.659
ciation-1.464
ijn-1.394
-1.312
igious-1.304
Token.
Token ID13
Feature activation+5.81e-02

Pos logit contributions
ially+0.233
ciation+0.204
ijn+0.194
+0.183
llo+0.181
Neg logit contributions
 restraint-0.293
���-0.288
otaur-0.287
────────-0.284
heed-0.274
Token<|endoftext|>
Token ID50256
Feature activation+2.37e-02

Pos logit contributions
 restraint+0.099
���+0.097
otaur+0.097
────────+0.095
heed+0.092
Neg logit contributions
ially-0.095
ciation-0.084
ijn-0.080
-0.076
llo-0.076
TokenV
Token ID53
Feature activation-3.80e-01

Pos logit contributions
 restraint+0.037
���+0.037
otaur+0.036
────────+0.036
 swe+0.035
Neg logit contributions
ially-0.042
ciation-0.037
ijn-0.036
-0.034
igious-0.034
Tokenandals
Token ID23819
Feature activation+1.23e+00

Pos logit contributions
ially+0.643
ciation+0.564
ijn+0.551
igious+0.516
llo+0.514
Neg logit contributions
 restraint-0.628
���-0.614
otaur-0.609
────────-0.605
 swe-0.581
Token painted
Token ID13055
Feature activation+3.15e+00

Pos logit contributions
 restraint+1.885
otaur+1.813
���+1.806
────────+1.804
 swe+1.748
Neg logit contributions
ially-2.187
ciation-1.994
ijn-1.901
-1.817
igious-1.798
Token the
Token ID262
Feature activation+5.05e-01

Pos logit contributions
 restraint+5.023
otaur+4.797
���+4.780
────────+4.776
 swe+4.725
Neg logit contributions
ially-5.336
ciation-4.875
ijn-4.644
-4.445
igious-4.348
Token word
Token ID1573
Feature activation+6.66e-01

Pos logit contributions
 restraint+0.765
���+0.754
otaur+0.751
────────+0.745
heed+0.713
Neg logit contributions
ially-0.915
ciation-0.816
ijn-0.782
llo-0.748
-0.746
Token Islam
Token ID3449
Feature activation-1.55e+00

Pos logit contributions
 restraint+1.237
otaur+1.207
���+1.205
────────+1.200
 swe+1.160
Neg logit contributions
ially-0.977
ciation-0.854
ijn-0.812
-0.761
igious-0.757
Token in
Token ID287
Feature activation+1.44e-01

Pos logit contributions
ially+2.299
ciation+1.956
ijn+1.857
ication+1.770
+1.758
Neg logit contributions
 restraint-2.878
────────-2.830
���-2.811
otaur-2.808
heed-2.670
Token large
Token ID1588
Feature activation-1.19e+00

Pos logit contributions
 restraint+0.230
otaur+0.226
���+0.226
────────+0.225
heed+0.215
Neg logit contributions
ially-0.249
ciation-0.220
ijn-0.213
llo-0.201
igious-0.201
Token diseases
Token ID10040
Feature activation-2.11e+00

Pos logit contributions
 restraint+0.011
���+0.011
otaur+0.011
────────+0.011
heed+0.010
Neg logit contributions
ially-0.011
ciation-0.010
ijn-0.009
-0.009
igious-0.009
Token as
Token ID355
Feature activation-1.56e-01

Pos logit contributions
ially+3.013
ciation+2.527
ijn+2.396
llo+2.254
igious+2.230
Neg logit contributions
���-4.093
 restraint-3.991
otaur-3.959
────────-3.925
heed-3.744
Token well
Token ID880
Feature activation-6.81e-01

Pos logit contributions
ially+0.258
ciation+0.226
ijn+0.215
llo+0.203
+0.203
Neg logit contributions
 restraint-0.265
otaur-0.260
���-0.260
────────-0.258
heed-0.249
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
ially+1.095
ciation+0.969
ijn+0.924
+0.873
igious+0.867
Neg logit contributions
 restraint-1.172
otaur-1.144
���-1.142
────────-1.136
 swe-1.091
Token<|endoftext|>
Token ID50256
Feature activation-6.71e-02

Pos logit contributions
 restraint+0.265
���+0.259
otaur+0.258
────────+0.256
heed+0.246
Neg logit contributions
ially-0.255
ciation-0.227
ijn-0.217
-0.204
llo-0.204
TokenHA
Token ID7801
Feature activation+3.13e+00

Pos logit contributions
ially+0.118
ciation+0.105
ijn+0.101
+0.096
igious+0.095
Neg logit contributions
 restraint-0.106
���-0.105
otaur-0.104
────────-0.101
 swe-0.099
TokenV
Token ID53
Feature activation-7.20e-01

Pos logit contributions
 restraint+4.145
────────+4.008
otaur+3.998
���+3.928
 swe+3.801
Neg logit contributions
ially-6.168
ciation-5.620
ijn-5.404
-5.165
ication-5.151
TokenANA
Token ID31574
Feature activation+6.71e-02

Pos logit contributions
ially+1.376
ciation+1.233
ijn+1.212
+1.141
llo+1.137
Neg logit contributions
 restraint-1.024
���-1.012
otaur-0.981
────────-0.969
heed-0.940
Token (
Token ID357
Feature activation+8.15e-01

Pos logit contributions
 restraint+0.131
otaur+0.128
���+0.128
────────+0.127
 swe+0.122
Neg logit contributions
ially-0.093
ciation-0.080
ijn-0.076
-0.071
llo-0.071
TokenReuters
Token ID12637
Feature activation+4.68e-01

Pos logit contributions
 restraint+0.926
���+0.902
otaur+0.889
────────+0.872
 swe+0.828
Neg logit contributions
ially-1.797
ciation-1.641
ijn-1.593
-1.538
igious-1.525
Token)
Token ID8
Feature activation+1.41e+00

Pos logit contributions
 restraint+0.939
���+0.927
otaur+0.919
────────+0.912
 swe+0.883
Neg logit contributions
ially-0.622
ciation-0.534
ijn-0.503
-0.469
llo-0.465
Token
Token ID564
Feature activation+1.55e-01

Pos logit contributions
ially+0.490
ciation+0.427
ijn+0.406
+0.383
igious+0.383
Neg logit contributions
 restraint-0.562
otaur-0.551
���-0.550
────────-0.543
heed-0.526
Token
Token ID250
Feature activation-4.38e-01

Pos logit contributions
 restraint+0.159
���+0.153
otaur+0.151
────────+0.148
 swe+0.140
Neg logit contributions
ially-0.360
ciation-0.329
ijn-0.320
-0.310
llo-0.308
Tokenselling
Token ID16473
Feature activation-3.93e-01

Pos logit contributions
ially+0.726
ciation+0.638
ijn+0.610
igious+0.580
+0.579
Neg logit contributions
 restraint-0.737
���-0.719
otaur-0.717
────────-0.707
 swe-0.682
Token
Token ID447
Feature activation+2.43e-02

Pos logit contributions
ially+0.755
ciation+0.678
ijn+0.646
+0.624
llo+0.620
Neg logit contributions
 restraint-0.549
���-0.542
otaur-0.539
────────-0.526
heed-0.507
Token
Token ID251
Feature activation-3.41e-01

Pos logit contributions
 restraint+0.035
otaur+0.034
���+0.033
────────+0.033
 swe+0.032
Neg logit contributions
ially-0.046
ciation-0.042
ijn-0.040
-0.038
igious-0.038
Token fetal
Token ID27652
Feature activation+3.13e+00

Pos logit contributions
ially+0.659
ciation+0.589
ijn+0.562
llo+0.539
+0.537
Neg logit contributions
 restraint-0.476
���-0.472
otaur-0.466
────────-0.461
heed-0.444
Token tissue
Token ID10712
Feature activation-1.01e-01

Pos logit contributions
 restraint+4.003
────────+3.772
otaur+3.713
���+3.615
 swe+3.594
Neg logit contributions
ially-6.308
ciation-5.811
ijn-5.683
-5.404
igious-5.388
Token.
Token ID13
Feature activation-3.57e-02

Pos logit contributions
ially+0.147
ciation+0.128
ijn+0.120
+0.114
llo+0.113
Neg logit contributions
 restraint-0.188
���-0.186
otaur-0.185
────────-0.183
heed-0.177
Token With
Token ID2080
Feature activation-4.44e-01

Pos logit contributions
ially+0.059
ciation+0.053
ijn+0.050
+0.048
llo+0.048
Neg logit contributions
 restraint-0.060
���-0.059
otaur-0.058
────────-0.057
heed-0.055
Token Republicans
Token ID4734
Feature activation-6.03e-01

Pos logit contributions
ially+0.757
ciation+0.668
ijn+0.637
llo+0.611
igious+0.608
Neg logit contributions
 restraint-0.709
otaur-0.701
���-0.700
────────-0.697
heed-0.666
Token using
Token ID1262
Feature activation+1.05e+00

Pos logit contributions
ially+0.964
ciation+0.839
ijn+0.803
llo+0.762
+0.759
Neg logit contributions
 restraint-1.040
otaur-1.029
���-1.027
────────-1.011
heed-0.968
Token unto
Token ID12722
Feature activation+7.47e-01

Pos logit contributions
 restraint+0.230
���+0.229
otaur+0.228
────────+0.225
heed+0.217
Neg logit contributions
ially-0.211
ciation-0.183
ijn-0.174
-0.165
llo-0.165
Token thy
Token ID11906
Feature activation+1.08e+00

Pos logit contributions
 restraint+1.157
otaur+1.125
���+1.121
────────+1.117
 swe+1.068
Neg logit contributions
ially-1.330
ciation-1.192
ijn-1.143
-1.087
igious-1.081
Tokenself
Token ID944
Feature activation+4.55e-01

Pos logit contributions
 restraint+1.519
���+1.489
otaur+1.486
────────+1.475
heed+1.395
Neg logit contributions
ially-2.065
ciation-1.853
ijn-1.789
-1.707
igious-1.706
Token.
Token ID13
Feature activation+4.09e-01

Pos logit contributions
 restraint+0.831
���+0.824
otaur+0.820
────────+0.814
heed+0.782
Neg logit contributions
ially-0.681
ciation-0.592
ijn-0.559
-0.528
llo-0.526
Token De
Token ID1024
Feature activation-3.60e-01

Pos logit contributions
 restraint+0.660
���+0.656
otaur+0.645
────────+0.634
heed+0.611
Neg logit contributions
ially-0.703
ciation-0.629
ijn-0.599
-0.572
llo-0.570
Tokenuter
Token ID11894
Feature activation+3.12e+00

Pos logit contributions
ially+0.750
ciation+0.679
ijn+0.657
llo+0.632
+0.630
Neg logit contributions
 restraint-0.446
���-0.440
otaur-0.437
────────-0.427
 swe-0.405
Tokenonomy
Token ID30565
Feature activation+1.61e+00

Pos logit contributions
 restraint+3.184
otaur+3.063
────────+3.027
���+2.957
 swe+2.825
Neg logit contributions
ially-6.999
ciation-6.469
ijn-6.336
-6.067
igious-6.040
Token 20
Token ID1160
Feature activation-7.62e-02

Pos logit contributions
 restraint+2.841
otaur+2.762
────────+2.756
���+2.749
 swe+2.650
Neg logit contributions
ially-2.502
ciation-2.210
ijn-2.103
-1.985
igious-1.974
Token:
Token ID25
Feature activation+2.10e-02

Pos logit contributions
ially+0.087
ciation+0.073
ijn+0.068
+0.062
llo+0.062
Neg logit contributions
 restraint-0.167
���-0.164
otaur-0.164
────────-0.162
heed-0.158
Token 12
Token ID1105
Feature activation-1.83e-01

Pos logit contributions
���+0.034
 restraint+0.034
otaur+0.033
────────+0.032
heed+0.031
Neg logit contributions
ially-0.037
ciation-0.033
ijn-0.031
-0.030
llo-0.030
Token.
Token ID13
Feature activation+4.13e-01

Pos logit contributions
ially+0.279
ciation+0.245
ijn+0.233
llo+0.219
+0.218
Neg logit contributions
 restraint-0.331
���-0.327
otaur-0.326
────────-0.320
heed-0.308
Token null
Token ID9242
Feature activation-5.89e-01

Pos logit contributions
ially+0.403
ciation+0.356
ijn+0.342
+0.323
llo+0.322
Neg logit contributions
 restraint-0.368
otaur-0.363
���-0.363
────────-0.358
heed-0.345
Token atomic
Token ID17226
Feature activation+1.07e+00

Pos logit contributions
ially+0.838
ciation+0.714
ijn+0.674
+0.635
igious+0.633
Neg logit contributions
 restraint-1.125
���-1.113
otaur-1.105
────────-1.096
 swe-1.059
Token pointer
Token ID17562
Feature activation-2.75e-01

Pos logit contributions
 restraint+1.781
otaur+1.731
────────+1.721
���+1.719
 swe+1.653
Neg logit contributions
ially-1.779
ciation-1.587
ijn-1.517
-1.438
igious-1.429
Token.
Token ID13
Feature activation-4.91e-02

Pos logit contributions
ially+0.442
ciation+0.390
ijn+0.370
+0.350
llo+0.350
Neg logit contributions
 restraint-0.468
���-0.467
otaur-0.459
────────-0.457
heed-0.441
Token pub
Token ID2240
Feature activation+6.80e-01

Pos logit contributions
ially+0.088
ciation+0.079
ijn+0.075
llo+0.071
+0.071
Neg logit contributions
 restraint-0.074
���-0.074
otaur-0.073
────────-0.072
heed-0.070
Token fn
Token ID24714
Feature activation+3.11e+00

Pos logit contributions
 restraint+0.885
���+0.863
otaur+0.860
────────+0.851
heed+0.807
Neg logit contributions
ially-1.380
ciation-1.245
ijn-1.202
-1.151
llo-1.146
Token null
Token ID9242
Feature activation-3.14e-01

Pos logit contributions
 restraint+4.754
otaur+4.573
────────+4.539
���+4.496
 swe+4.431
Neg logit contributions
ially-5.531
ciation-5.024
ijn-4.817
igious-4.588
-4.557
Token ()
Token ID7499
Feature activation+1.29e+00

Pos logit contributions
ially+0.428
ciation+0.366
ijn+0.343
+0.322
ication+0.322
Neg logit contributions
 restraint-0.622
���-0.613
otaur-0.604
────────-0.602
 swe-0.585
Token ->
Token ID4613
Feature activation+6.66e-01

Pos logit contributions
 restraint+2.165
���+2.110
otaur+2.108
────────+2.095
heed+2.010
Neg logit contributions
ially-2.116
ciation-1.876
ijn-1.794
-1.697
llo-1.686
Token Atomic
Token ID28976
Feature activation+1.67e+00

Pos logit contributions
 restraint+0.984
���+0.957
otaur+0.957
────────+0.949
heed+0.908
Neg logit contributions
ially-1.237
ciation-1.111
ijn-1.066
-1.016
igious-1.012
Token <
Token ID1279
Feature activation+6.47e-01

Pos logit contributions
 restraint+2.097
otaur+2.031
────────+2.019
���+2.001
 swe+1.905
Neg logit contributions
ially-3.450
ciation-3.159
ijn-3.039
-2.918
igious-2.908
Tokenross
Token ID1214
Feature activation+6.25e-01

Pos logit contributions
ially+0.225
ciation+0.199
ijn+0.192
igious+0.180
llo+0.180
Neg logit contributions
 restraint-0.216
���-0.212
otaur-0.209
────────-0.209
 swe-0.200
Token.
Token ID13
Feature activation-2.82e-01

Pos logit contributions
 restraint+1.170
���+1.144
otaur+1.143
────────+1.140
 swe+1.095
Neg logit contributions
ially-0.915
ciation-0.799
ijn-0.758
-0.709
igious-0.705
Token Carlton
Token ID44348
Feature activation-5.69e-01

Pos logit contributions
ially+0.466
ciation+0.414
ijn+0.395
llo+0.373
+0.373
Neg logit contributions
 restraint-0.476
���-0.468
otaur-0.465
────────-0.461
heed-0.441
Token L
Token ID406
Feature activation+2.91e-01

Pos logit contributions
ially+1.003
ciation+0.894
ijn+0.864
llo+0.817
+0.812
Neg logit contributions
 restraint-0.900
���-0.883
otaur-0.874
────────-0.868
heed-0.829
Token.
Token ID13
Feature activation-3.90e-01

Pos logit contributions
 restraint+0.521
otaur+0.511
���+0.507
────────+0.505
heed+0.489
Neg logit contributions
ially-0.445
ciation-0.392
ijn-0.371
-0.353
igious-0.348
Token Guth
Token ID44003
Feature activation+3.11e+00

Pos logit contributions
ially+0.690
ciation+0.619
ijn+0.595
+0.564
llo+0.563
Neg logit contributions
 restraint-0.611
���-0.598
otaur-0.595
────────-0.589
heed-0.562
Tokenrie
Token ID5034
Feature activation+5.12e-01

Pos logit contributions
 restraint+3.493
otaur+3.384
────────+3.339
���+3.331
heed+3.176
Neg logit contributions
ially-6.717
ciation-6.166
ijn-5.931
-5.770
igious-5.705
Token.
Token ID13
Feature activation-3.96e-01

Pos logit contributions
 restraint+0.936
otaur+0.912
���+0.911
────────+0.909
 swe+0.876
Neg logit contributions
ially-0.770
ciation-0.674
ijn-0.641
-0.603
igious-0.599
Token Marion
Token ID32473
Feature activation-9.20e-01

Pos logit contributions
ially+0.654
ciation+0.581
ijn+0.554
llo+0.523
+0.522
Neg logit contributions
 restraint-0.669
���-0.659
otaur-0.653
────────-0.647
heed-0.620
Token T
Token ID309
Feature activation-5.57e-02

Pos logit contributions
ially+1.624
ciation+1.449
ijn+1.398
llo+1.326
+1.310
Neg logit contributions
 restraint-1.452
���-1.419
otaur-1.408
────────-1.404
heed-1.332
Token.
Token ID13
Feature activation-4.88e-01

Pos logit contributions
ially+0.084
ciation+0.074
ijn+0.070
+0.066
igious+0.065
Neg logit contributions
 restraint-0.102
otaur-0.099
���-0.099
────────-0.099
 swe-0.095
Token that
Token ID326
Feature activation+2.00e-01

Pos logit contributions
ially+0.317
ciation+0.276
ijn+0.262
+0.248
llo+0.247
Neg logit contributions
 restraint-0.373
���-0.367
otaur-0.366
────────-0.363
heed-0.349
Token Planned
Token ID18451
Feature activation+1.12e+00

Pos logit contributions
 restraint+0.289
otaur+0.286
���+0.284
────────+0.282
heed+0.269
Neg logit contributions
ially-0.375
ciation-0.335
ijn-0.320
llo-0.307
-0.307
Token Parenthood
Token ID18461
Feature activation+1.04e+00

Pos logit contributions
 restraint+0.908
otaur+0.852
────────+0.849
���+0.833
 swe+0.773
Neg logit contributions
ially-2.830
ciation-2.624
ijn-2.553
-2.469
igious-2.461
Token is
Token ID318
Feature activation+4.24e-01

Pos logit contributions
 restraint+1.654
otaur+1.608
���+1.603
────────+1.598
 swe+1.529
Neg logit contributions
ially-1.826
ciation-1.633
ijn-1.565
-1.486
igious-1.478
Token selling
Token ID6301
Feature activation-4.61e-01

Pos logit contributions
 restraint+0.684
otaur+0.670
���+0.667
────────+0.663
heed+0.635
Neg logit contributions
ially-0.728
ciation-0.646
ijn-0.618
-0.585
igious-0.584
Token fetal
Token ID27652
Feature activation+3.10e+00

Pos logit contributions
ially+1.095
ciation+0.988
ijn+0.956
llo+0.930
ication+0.924
Neg logit contributions
���-0.445
otaur-0.436
────────-0.429
 restraint-0.429
heed-0.395
Token tissue
Token ID10712
Feature activation-2.94e-02

Pos logit contributions
 restraint+4.119
────────+3.815
otaur+3.806
 swe+3.712
���+3.705
Neg logit contributions
ially-6.177
ciation-5.712
ijn-5.558
-5.311
igious-5.253
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
ially+0.045
ciation+0.039
ijn+0.037
+0.035
llo+0.035
Neg logit contributions
 restraint-0.053
���-0.053
otaur-0.053
────────-0.052
heed-0.050
Token In
Token ID554
Feature activation-8.57e-02

Pos logit contributions
 restraint+0.223
���+0.219
otaur+0.217
────────+0.213
heed+0.207
Neg logit contributions
ially-0.223
ciation-0.199
ijn-0.189
-0.180
llo-0.180
Token reality
Token ID3950
Feature activation+7.40e-01

Pos logit contributions
ially+0.143
ciation+0.126
ijn+0.120
llo+0.114
+0.114
Neg logit contributions
 restraint-0.141
���-0.139
otaur-0.139
────────-0.138
heed-0.132
Token,
Token ID11
Feature activation+3.33e-01

Pos logit contributions
 restraint+1.333
���+1.300
otaur+1.293
────────+1.291
 swe+1.246
Neg logit contributions
ially-1.132
ciation-0.995
ijn-0.950
-0.892
igious-0.885
Token as
Token ID355
Feature activation-1.63e-01

Pos logit contributions
 restraint+1.634
otaur+1.596
���+1.594
────────+1.586
 swe+1.520
Neg logit contributions
ially-1.580
ciation-1.401
ijn-1.337
-1.264
igious-1.256
Token a
Token ID257
Feature activation+2.21e-02

Pos logit contributions
ially+0.280
ciation+0.247
ijn+0.236
ication+0.222
+0.221
Neg logit contributions
otaur-0.260
 restraint-0.260
���-0.260
────────-0.256
heed-0.246
Token vehicle
Token ID4038
Feature activation-3.31e-01

Pos logit contributions
otaur+0.034
 restraint+0.034
���+0.034
────────+0.034
heed+0.032
Neg logit contributions
ially-0.039
ciation-0.035
ijn-0.033
-0.031
llo-0.031
Token for
Token ID329
Feature activation+1.68e-01

Pos logit contributions
ially+0.503
ciation+0.440
ijn+0.416
+0.390
igious+0.388
Neg logit contributions
 restraint-0.597
���-0.596
otaur-0.590
────────-0.588
heed-0.563
Token making
Token ID1642
Feature activation+5.08e-01

Pos logit contributions
otaur+0.264
���+0.263
 restraint+0.263
────────+0.259
heed+0.247
Neg logit contributions
ially-0.292
ciation-0.257
ijn-0.248
-0.233
igious-0.232
Token passive
Token ID14513
Feature activation+3.09e+00

Pos logit contributions
 restraint+0.760
otaur+0.747
���+0.746
────────+0.739
heed+0.705
Neg logit contributions
ially-0.931
ciation-0.830
ijn-0.799
-0.759
igious-0.756
Token investments
Token ID11115
Feature activation+1.02e+00

Pos logit contributions
 restraint+4.217
────────+3.912
otaur+3.862
 swe+3.799
���+3.793
Neg logit contributions
ially-5.963
ciation-5.507
ijn-5.325
-5.100
igious-5.083
Token unrelated
Token ID19938
Feature activation+6.55e-01

Pos logit contributions
 restraint+1.856
otaur+1.800
────────+1.793
���+1.785
 swe+1.737
Neg logit contributions
ially-1.538
ciation-1.360
ijn-1.294
-1.220
igious-1.209
Token to
Token ID284
Feature activation+2.44e-01

Pos logit contributions
 restraint+1.100
otaur+1.064
────────+1.058
���+1.045
 swe+1.019
Neg logit contributions
ially-1.080
ciation-0.966
ijn-0.923
-0.877
igious-0.869
Token the
Token ID262
Feature activation+4.87e-02

Pos logit contributions
 restraint+0.384
���+0.381
otaur+0.381
────────+0.376
heed+0.358
Neg logit contributions
ially-0.427
ciation-0.377
ijn-0.363
-0.344
igious-0.343
Token business
Token ID1597
Feature activation+7.65e-01

Pos logit contributions
 restraint+0.073
���+0.072
otaur+0.072
────────+0.071
heed+0.068
Neg logit contributions
ially-0.089
ciation-0.079
ijn-0.076
-0.073
llo-0.072
Token evidence
Token ID2370
Feature activation+5.12e-01

Pos logit contributions
 restraint+0.684
���+0.674
otaur+0.672
────────+0.667
heed+0.639
Neg logit contributions
ially-0.754
ciation-0.670
ijn-0.642
-0.609
llo-0.606
Token.
Token ID13
Feature activation+2.90e-01

Pos logit contributions
 restraint+0.921
���+0.902
otaur+0.902
────────+0.897
 swe+0.860
Neg logit contributions
ially-0.785
ciation-0.688
ijn-0.654
-0.616
llo-0.612
Token<|endoftext|>
Token ID50256
Feature activation+1.01e-01

Pos logit contributions
 restraint+0.482
���+0.470
otaur+0.469
────────+0.466
heed+0.446
Neg logit contributions
ially-0.487
ciation-0.434
ijn-0.415
-0.393
llo-0.392
TokenThe
Token ID464
Feature activation+6.34e-01

Pos logit contributions
 restraint+0.160
���+0.158
otaur+0.156
────────+0.153
 swe+0.149
Neg logit contributions
ially-0.178
ciation-0.159
ijn-0.153
-0.146
igious-0.144
Token cookies
Token ID14746
Feature activation+2.45e-01

Pos logit contributions
 restraint+0.985
otaur+0.960
���+0.959
────────+0.955
heed+0.910
Neg logit contributions
ially-1.124
ciation-1.006
ijn-0.965
-0.917
igious-0.913
Token baked
Token ID22979
Feature activation+3.09e+00

Pos logit contributions
 restraint+0.424
otaur+0.412
���+0.411
────────+0.410
 swe+0.395
Neg logit contributions
ially-0.391
ciation-0.347
ijn-0.331
-0.312
igious-0.310
Token quickly
Token ID2952
Feature activation-6.38e-02

Pos logit contributions
 restraint+4.991
otaur+4.839
────────+4.804
���+4.733
 swe+4.689
Neg logit contributions
ially-5.116
ciation-4.651
ijn-4.369
igious-4.168
-4.154
Token at
Token ID379
Feature activation+1.40e+00

Pos logit contributions
ially+0.101
ciation+0.089
ijn+0.084
+0.080
llo+0.079
Neg logit contributions
 restraint-0.112
���-0.110
otaur-0.109
────────-0.109
heed-0.104
Token 375
Token ID29414
Feature activation+4.40e-01

Pos logit contributions
 restraint+2.255
otaur+2.197
���+2.191
────────+2.182
 swe+2.088
Neg logit contributions
ially-2.397
ciation-2.137
ijn-2.047
-1.941
igious-1.931
Token,
Token ID11
Feature activation+6.26e-01

Pos logit contributions
 restraint+0.752
otaur+0.733
���+0.731
────────+0.729
 swe+0.700
Neg logit contributions
ially-0.711
ciation-0.630
ijn-0.602
-0.568
igious-0.565
Token so
Token ID523
Feature activation+7.97e-01

Pos logit contributions
 restraint+1.015
���+0.993
otaur+0.991
────────+0.983
heed+0.941
Neg logit contributions
ially-1.072
ciation-0.951
ijn-0.913
-0.866
llo-0.861
Token-
Token ID12
Feature activation+3.05e-01

Pos logit contributions
 restraint+0.260
otaur+0.254
────────+0.253
���+0.252
 swe+0.244
Neg logit contributions
ially-0.186
ciation-0.162
ijn-0.154
-0.143
igious-0.142
Tokenold
Token ID727
Feature activation+2.78e-01

Pos logit contributions
 restraint+0.440
otaur+0.428
────────+0.425
���+0.422
 swe+0.404
Neg logit contributions
ially-0.573
ciation-0.518
ijn-0.498
-0.475
igious-0.473
Token gay
Token ID5650
Feature activation-1.67e+00

Pos logit contributions
 restraint+0.377
���+0.369
otaur+0.369
────────+0.364
heed+0.346
Neg logit contributions
ially-0.547
ciation-0.492
ijn-0.476
-0.454
llo-0.453
Token son
Token ID3367
Feature activation+1.35e+00

Pos logit contributions
ially+3.268
ciation+2.902
ijn+2.838
llo+2.697
+2.689
Neg logit contributions
���-2.269
 restraint-2.257
otaur-2.212
────────-2.191
heed-2.066
Token,
Token ID11
Feature activation+1.48e-01

Pos logit contributions
 restraint+2.426
otaur+2.365
────────+2.354
���+2.350
 swe+2.273
Neg logit contributions
ially-2.063
ciation-1.821
ijn-1.729
-1.628
igious-1.623
Token Isaiah
Token ID29756
Feature activation+3.07e+00

Pos logit contributions
 restraint+0.249
���+0.246
otaur+0.245
────────+0.243
heed+0.232
Neg logit contributions
ially-0.243
ciation-0.214
ijn-0.205
llo-0.193
-0.193
Token Henderson
Token ID22016
Feature activation-5.09e-01

Pos logit contributions
 restraint+5.191
otaur+5.050
────────+5.042
���+5.002
 swe+4.894
Neg logit contributions
ially-4.850
ciation-4.275
ijn-4.060
-3.864
igious-3.852
Token.
Token ID13
Feature activation-6.16e-02

Pos logit contributions
ially+0.743
ciation+0.650
ijn+0.620
llo+0.584
+0.578
Neg logit contributions
 restraint-0.951
otaur-0.931
���-0.929
────────-0.918
heed-0.884
Token
Token ID198
Feature activation-7.30e-02

Pos logit contributions
ially+0.101
ciation+0.090
ijn+0.086
+0.081
llo+0.081
Neg logit contributions
 restraint-0.105
���-0.102
otaur-0.102
────────-0.101
heed-0.097
Token
Token ID198
Feature activation-1.37e-01

Pos logit contributions
ially+0.121
ciation+0.108
ijn+0.103
+0.097
llo+0.097
Neg logit contributions
 restraint-0.122
���-0.120
otaur-0.119
────────-0.118
 swe-0.113
Token
Token ID447
Feature activation+1.40e-01

Pos logit contributions
ially+0.233
ciation+0.207
ijn+0.198
+0.188
llo+0.187
Neg logit contributions
 restraint-0.226
���-0.224
otaur-0.220
────────-0.214
 swe-0.210
Token who
Token ID508
Feature activation-6.80e-01

Pos logit contributions
ially+0.643
ciation+0.558
ijn+0.535
+0.508
igious+0.506
Neg logit contributions
 restraint-0.671
���-0.664
otaur-0.661
────────-0.657
heed-0.631
Token worked
Token ID3111
Feature activation+2.63e-01

Pos logit contributions
ially+1.162
ciation+1.023
ijn+0.980
llo+0.933
igious+0.924
Neg logit contributions
 restraint-1.091
���-1.086
otaur-1.078
────────-1.075
heed-1.010
Token on
Token ID319
Feature activation-1.95e-01

Pos logit contributions
 restraint+0.478
���+0.471
otaur+0.471
────────+0.469
 swe+0.448
Neg logit contributions
ially-0.397
ciation-0.344
ijn-0.326
igious-0.307
llo-0.307
Token the
Token ID262
Feature activation-2.38e-02

Pos logit contributions
ially+0.339
ciation+0.301
ijn+0.288
llo+0.274
+0.274
Neg logit contributions
 restraint-0.306
���-0.302
otaur-0.301
────────-0.300
heed-0.286
Token CDC
Token ID20434
Feature activation-5.28e-01

Pos logit contributions
ially+0.042
ciation+0.037
ijn+0.035
llo+0.034
+0.034
Neg logit contributions
 restraint-0.037
���-0.037
otaur-0.036
────────-0.036
heed-0.035
Token Guid
Token ID37026
Feature activation-4.52e+00

Pos logit contributions
ially+0.792
ciation+0.683
ijn+0.648
llo+0.617
+0.616
Neg logit contributions
 restraint-0.961
���-0.954
otaur-0.948
────────-0.939
heed-0.902
Tokeneline
Token ID4470
Feature activation-9.25e-01

Pos logit contributions
ially+4.270
ciation+3.333
ication+2.831
igious+2.773
ijn+2.759
Neg logit contributions
���-10.748
────────-10.492
 restraint-10.473
otaur-10.360
 swe-10.051
Token itself
Token ID2346
Feature activation-8.38e-02

Pos logit contributions
ially+1.369
ciation+1.197
ijn+1.124
llo+1.081
+1.059
Neg logit contributions
 restraint-1.690
���-1.674
otaur-1.671
────────-1.668
 swe-1.594
Token.
Token ID13
Feature activation+3.96e-02

Pos logit contributions
ially+0.123
ciation+0.107
ijn+0.101
+0.096
igious+0.095
Neg logit contributions
 restraint-0.156
���-0.154
otaur-0.153
────────-0.152
 swe-0.146
Token They
Token ID1119
Feature activation+1.28e-01

Pos logit contributions
 restraint+0.065
���+0.064
otaur+0.064
────────+0.063
heed+0.061
Neg logit contributions
ially-0.067
ciation-0.060
ijn-0.057
-0.054
llo-0.054
Token see
Token ID766
Feature activation+8.78e-01

Pos logit contributions
 restraint+0.211
���+0.208
otaur+0.206
────────+0.205
 swe+0.195
Neg logit contributions
ially-0.214
ciation-0.190
ijn-0.181
-0.171
llo-0.171
Token who
Token ID508
Feature activation-6.05e-01

Pos logit contributions
ially+0.199
ciation+0.176
ijn+0.168
+0.160
igious+0.159
Neg logit contributions
 restraint-0.180
otaur-0.178
���-0.177
────────-0.177
heed-0.168
Token worked
Token ID3111
Feature activation+2.09e-01

Pos logit contributions
ially+1.045
ciation+0.923
ijn+0.878
llo+0.839
+0.832
Neg logit contributions
 restraint-0.966
otaur-0.954
────────-0.954
���-0.951
heed-0.897
Token on
Token ID319
Feature activation-2.61e-01

Pos logit contributions
 restraint+0.390
���+0.383
otaur+0.383
────────+0.383
 swe+0.366
Neg logit contributions
ially-0.306
ciation-0.264
ijn-0.250
-0.234
llo-0.234
Token the
Token ID262
Feature activation-7.72e-02

Pos logit contributions
ially+0.438
ciation+0.388
ijn+0.372
+0.353
llo+0.351
Neg logit contributions
 restraint-0.425
���-0.418
otaur-0.417
────────-0.416
 swe-0.396
Token CDC
Token ID20434
Feature activation-2.71e-01

Pos logit contributions
ially+0.136
ciation+0.121
ijn+0.116
+0.111
llo+0.110
Neg logit contributions
 restraint-0.120
otaur-0.118
���-0.118
────────-0.117
heed-0.112
Token Guid
Token ID37026
Feature activation-4.38e+00

Pos logit contributions
ially+0.436
ciation+0.384
ijn+0.365
+0.347
igious+0.345
Neg logit contributions
 restraint-0.465
otaur-0.456
���-0.455
────────-0.452
 swe-0.434
Tokeneline
Token ID4470
Feature activation-8.24e-01

Pos logit contributions
ially+4.029
ciation+3.182
ication+2.739
ijn+2.670
llo+2.613
Neg logit contributions
���-11.140
 restraint-10.273
otaur-10.212
────────-10.079
 swe-10.058
Token,
Token ID11
Feature activation-2.02e-01

Pos logit contributions
ially+1.194
ciation+1.016
ijn+0.961
llo+0.916
igious+0.913
Neg logit contributions
 restraint-1.540
���-1.535
otaur-1.525
────────-1.514
 swe-1.451
Token signed
Token ID4488
Feature activation+8.81e-01

Pos logit contributions
ially+0.345
ciation+0.306
ijn+0.290
llo+0.278
ication+0.275
Neg logit contributions
 restraint-0.325
���-0.323
otaur-0.320
────────-0.319
heed-0.302
Token a
Token ID257
Feature activation-2.03e-01

Pos logit contributions
 restraint+1.445
otaur+1.411
���+1.406
────────+1.404
 swe+1.342
Neg logit contributions
ially-1.491
ciation-1.323
ijn-1.263
-1.202
igious-1.197
Token public
Token ID1171
Feature activation-1.66e-01

Pos logit contributions
ially+0.338
ciation+0.297
ijn+0.284
+0.271
llo+0.269
Neg logit contributions
 restraint-0.337
otaur-0.334
���-0.333
────────-0.331
heed-0.316
Tokenpl
Token ID489
Feature activation+5.64e-01

Pos logit contributions
ially+1.296
ciation+1.174
ijn+1.113
llo+1.076
+1.070
Neg logit contributions
 restraint-1.058
���-1.050
otaur-1.020
────────-1.020
 swe-0.974
Tokeni
Token ID72
Feature activation-2.09e-01

Pos logit contributions
 restraint+0.933
otaur+0.909
────────+0.901
���+0.892
heed+0.867
Neg logit contributions
ially-0.942
ciation-0.839
ijn-0.801
-0.764
igious-0.756
Token is
Token ID318
Feature activation+2.70e-01

Pos logit contributions
ially+0.336
ciation+0.297
ijn+0.283
+0.269
llo+0.268
Neg logit contributions
 restraint-0.358
���-0.354
otaur-0.352
────────-0.347
heed-0.334
Token knocked
Token ID13642
Feature activation-5.93e-01

Pos logit contributions
 restraint+0.418
otaur+0.414
���+0.413
────────+0.405
heed+0.388
Neg logit contributions
ially-0.481
ciation-0.425
ijn-0.411
llo-0.390
-0.390
Token by
Token ID416
Feature activation-2.82e-01

Pos logit contributions
ially+0.941
ciation+0.817
ijn+0.788
+0.744
llo+0.738
Neg logit contributions
 restraint-1.029
otaur-1.021
���-1.018
────────-1.002
heed-0.960
Token Mov
Token ID44795
Feature activation-3.72e+00

Pos logit contributions
ially+0.484
ciation+0.427
ijn+0.412
+0.392
igious+0.390
Neg logit contributions
���-0.450
 restraint-0.446
otaur-0.445
────────-0.440
heed-0.420
Tokeni
Token ID72
Feature activation-8.31e-01

Pos logit contributions
ially+6.100
ijn+5.405
ciation+5.397
ication+4.918
igious+4.833
Neg logit contributions
���-6.287
 restraint-6.005
────────-5.890
otaur-5.781
 swe-5.590
Tokeneline
Token ID4470
Feature activation-4.68e-01

Pos logit contributions
ially+1.217
ciation+1.054
ijn+1.000
igious+0.944
ication+0.936
Neg logit contributions
 restraint-1.547
���-1.539
otaur-1.515
────────-1.486
heed-1.442
Token for
Token ID329
Feature activation-5.20e-02

Pos logit contributions
ially+0.725
ciation+0.640
ijn+0.611
+0.579
llo+0.578
Neg logit contributions
 restraint-0.822
���-0.819
otaur-0.813
────────-0.806
heed-0.771
Token mirror
Token ID10162
Feature activation+9.19e-01

Pos logit contributions
ially+0.087
ciation+0.077
ijn+0.074
+0.070
llo+0.070
Neg logit contributions
 restraint-0.085
���-0.084
otaur-0.084
────────-0.083
heed-0.079
Tokening
Token ID278
Feature activation-9.25e-01

Pos logit contributions
 restraint+2.065
otaur+2.026
���+2.022
────────+2.017
 swe+1.956
Neg logit contributions
ially-0.997
ciation-0.826
ijn-0.766
-0.697
igious-0.690
Token rainfall
Token ID25807
Feature activation+4.48e-01

Pos logit contributions
ially+0.177
ciation+0.156
ijn+0.150
+0.142
igious+0.142
Neg logit contributions
���-0.159
 restraint-0.159
otaur-0.157
────────-0.156
heed-0.149
Token."
Token ID526
Feature activation-3.19e-01

Pos logit contributions
 restraint+0.803
���+0.791
otaur+0.789
────────+0.783
heed+0.753
Neg logit contributions
ially-0.687
ciation-0.603
ijn-0.572
-0.539
igious-0.535
Token
Token ID198
Feature activation-3.57e-01

Pos logit contributions
ially+0.530
ciation+0.473
ijn+0.451
+0.429
llo+0.425
Neg logit contributions
 restraint-0.534
���-0.520
otaur-0.519
────────-0.514
 swe-0.494
Token
Token ID198
Feature activation-3.64e-01

Pos logit contributions
ially+0.586
ciation+0.518
ijn+0.494
+0.467
llo+0.467
Neg logit contributions
���-0.607
 restraint-0.605
otaur-0.592
────────-0.584
 swe-0.562
TokenPeter
Token ID19727
Feature activation+6.10e-01

Pos logit contributions
ially+0.621
ciation+0.552
ijn+0.528
+0.507
llo+0.498
Neg logit contributions
���-0.599
 restraint-0.595
otaur-0.580
────────-0.563
 swe-0.556
Token Mell
Token ID32993
Feature activation-3.72e+00

Pos logit contributions
 restraint+0.963
otaur+0.937
���+0.933
────────+0.931
 swe+0.891
Neg logit contributions
ially-1.066
ciation-0.954
ijn-0.912
-0.868
igious-0.864
Tokengard
Token ID19977
Feature activation-4.44e-01

Pos logit contributions
ially+6.360
ijn+5.659
ciation+5.532
llo+5.123
ication+5.110
Neg logit contributions
���-5.871
 restraint-5.835
────────-5.615
otaur-5.389
 swe-5.242
Token/
Token ID14
Feature activation-5.11e-02

Pos logit contributions
ially+0.641
ciation+0.554
ijn+0.532
llo+0.499
+0.494
Neg logit contributions
 restraint-0.838
���-0.822
otaur-0.819
────────-0.814
 swe-0.780
TokenThe
Token ID464
Feature activation-2.94e-01

Pos logit contributions
ially+0.094
ciation+0.084
ijn+0.081
+0.077
igious+0.076
Neg logit contributions
 restraint-0.077
���-0.075
otaur-0.075
────────-0.074
 swe-0.071
Token World
Token ID2159
Feature activation-1.63e-01

Pos logit contributions
ially+0.479
ciation+0.424
ijn+0.406
igious+0.382
+0.382
Neg logit contributions
 restraint-0.500
���-0.491
────────-0.485
otaur-0.485
 swe-0.466
TokenPost
Token ID6307
Feature activation+1.39e+00

Pos logit contributions
ially+0.324
ciation+0.293
ijn+0.283
+0.270
igious+0.270
Neg logit contributions
 restraint-0.219
���-0.214
otaur-0.212
────────-0.210
 swe-0.201
Tokens
Token ID82
Feature activation-2.28e-01

Pos logit contributions
 restraint+0.636
otaur+0.621
���+0.620
────────+0.617
heed+0.593
Neg logit contributions
ially-0.568
ciation-0.500
ijn-0.476
-0.450
igious-0.446
Token is
Token ID318
Feature activation+2.49e-01

Pos logit contributions
ially+0.369
ciation+0.325
ijn+0.309
+0.293
llo+0.291
Neg logit contributions
 restraint-0.390
���-0.383
otaur-0.382
────────-0.380
heed-0.363
Token one
Token ID530
Feature activation+1.89e-01

Pos logit contributions
 restraint+0.412
otaur+0.403
���+0.402
────────+0.399
heed+0.383
Neg logit contributions
ially-0.417
ciation-0.369
ijn-0.353
-0.335
llo-0.333
Token of
Token ID286
Feature activation-2.55e-01

Pos logit contributions
 restraint+0.331
otaur+0.320
────────+0.319
���+0.319
 swe+0.309
Neg logit contributions
ially-0.297
ciation-0.264
ijn-0.253
-0.239
igious-0.236
Token the
Token ID262
Feature activation-1.03e-01

Pos logit contributions
ially+0.429
ciation+0.378
ijn+0.362
+0.342
llo+0.342
Neg logit contributions
 restraint-0.421
otaur-0.412
���-0.412
────────-0.411
heed-0.391
Token focal
Token ID25397
Feature activation-3.55e+00

Pos logit contributions
ially+0.184
ciation+0.164
ijn+0.156
llo+0.149
+0.149
Neg logit contributions
 restraint-0.160
���-0.158
otaur-0.157
────────-0.155
heed-0.149
Token points
Token ID2173
Feature activation-1.50e+00

Pos logit contributions
ially+5.667
ciation+5.050
ijn+4.665
+4.627
ication+4.606
Neg logit contributions
���-6.050
 restraint-5.761
otaur-5.649
────────-5.645
heed-5.493
Token of
Token ID286
Feature activation-1.75e-01

Pos logit contributions
ially+2.250
ciation+1.972
ijn+1.871
+1.761
igious+1.749
Neg logit contributions
 restraint-2.755
otaur-2.692
���-2.690
────────-2.676
heed-2.576
Token the
Token ID262
Feature activation-5.29e-02

Pos logit contributions
ially+0.297
ciation+0.264
ijn+0.254
igious+0.239
+0.239
Neg logit contributions
 restraint-0.284
otaur-0.279
���-0.278
────────-0.278
heed-0.264
Token 2017
Token ID2177
Feature activation+5.40e-01

Pos logit contributions
ially+0.090
ciation+0.081
ijn+0.077
llo+0.073
igious+0.073
Neg logit contributions
 restraint-0.085
���-0.084
otaur-0.083
────────-0.083
heed-0.079
Token Disney
Token ID8519
Feature activation-1.51e+00

Pos logit contributions
 restraint+0.854
otaur+0.831
���+0.830
────────+0.826
 swe+0.789
Neg logit contributions
ially-0.946
ciation-0.846
ijn-0.810
-0.770
igious-0.765
TokenAnyone
Token ID21129
Feature activation+1.34e-02

Pos logit contributions
ially+0.065
ciation+0.058
ijn+0.055
+0.053
llo+0.053
Neg logit contributions
 restraint-0.054
���-0.054
otaur-0.053
────────-0.052
 swe-0.050
Token who
Token ID508
Feature activation-1.21e-01

Pos logit contributions
 restraint+0.026
���+0.026
otaur+0.026
────────+0.026
heed+0.025
Neg logit contributions
ially-0.018
ciation-0.016
ijn-0.015
igious-0.014
-0.014
Token knows
Token ID4206
Feature activation+5.91e-01

Pos logit contributions
ially+0.232
ciation+0.209
ijn+0.200
llo+0.192
+0.191
Neg logit contributions
 restraint-0.171
���-0.167
otaur-0.167
────────-0.165
heed-0.157
Token of
Token ID286
Feature activation+1.58e-01

Pos logit contributions
 restraint+1.113
otaur+1.091
���+1.090
────────+1.081
heed+1.043
Neg logit contributions
ially-0.855
ciation-0.742
ijn-0.704
-0.661
llo-0.657
Token his
Token ID465
Feature activation+8.92e-02

Pos logit contributions
 restraint+0.298
otaur+0.294
���+0.293
────────+0.291
heed+0.280
Neg logit contributions
ially-0.229
ciation-0.199
ijn-0.189
-0.176
llo-0.176
Token whereabouts
Token ID33035
Feature activation-3.55e+00

Pos logit contributions
 restraint+0.167
���+0.166
otaur+0.165
────────+0.163
heed+0.158
Neg logit contributions
ially-0.129
ciation-0.112
ijn-0.106
-0.099
llo-0.099
Token is
Token ID318
Feature activation+4.14e-01

Pos logit contributions
ially+5.711
ciation+5.028
ijn+4.719
llo+4.633
igious+4.574
Neg logit contributions
���-5.906
otaur-5.748
 restraint-5.689
────────-5.462
heed-5.233
Token urged
Token ID11643
Feature activation-5.33e-01

Pos logit contributions
 restraint+0.728
otaur+0.709
────────+0.707
���+0.706
 swe+0.681
Neg logit contributions
ially-0.649
ciation-0.575
ijn-0.547
-0.517
igious-0.513
Token to
Token ID284
Feature activation-1.07e-01

Pos logit contributions
ially+0.879
ciation+0.780
ijn+0.749
+0.708
igious+0.703
Neg logit contributions
 restraint-0.898
otaur-0.869
────────-0.866
���-0.866
 swe-0.833
Token call
Token ID869
Feature activation+1.17e-01

Pos logit contributions
ially+0.165
ciation+0.145
ijn+0.137
llo+0.129
+0.129
Neg logit contributions
 restraint-0.189
���-0.188
otaur-0.188
────────-0.185
heed-0.177
Token NYPD
Token ID27615
Feature activation+3.54e-01

Pos logit contributions
 restraint+0.192
otaur+0.188
���+0.188
────────+0.186
heed+0.178
Neg logit contributions
ially-0.198
ciation-0.176
ijn-0.169
-0.160
llo-0.159
Token With
Token ID2080
Feature activation-4.43e-01

Pos logit contributions
ially+0.001
ciation+0.001
ijn+0.001
+0.001
llo+0.001
Neg logit contributions
 restraint-0.001
���-0.001
otaur-0.001
────────-0.001
heed-0.001
Token the
Token ID262
Feature activation-2.87e-02

Pos logit contributions
ially+0.744
ciation+0.653
ijn+0.627
igious+0.598
+0.598
Neg logit contributions
 restraint-0.716
otaur-0.710
���-0.709
────────-0.693
heed-0.677
Token words
Token ID2456
Feature activation-3.35e-01

Pos logit contributions
ially+0.049
ciation+0.044
ijn+0.042
+0.040
igious+0.040
Neg logit contributions
 restraint-0.046
otaur-0.045
���-0.045
────────-0.044
heed-0.043
Token of
Token ID286
Feature activation-5.03e-02

Pos logit contributions
ially+0.556
ciation+0.493
ijn+0.467
+0.441
igious+0.441
Neg logit contributions
 restraint-0.559
���-0.550
otaur-0.549
────────-0.542
heed-0.525
Token Max
Token ID5436
Feature activation-5.93e-02

Pos logit contributions
ially+0.088
ciation+0.078
ijn+0.075
+0.071
llo+0.071
Neg logit contributions
 restraint-0.079
otaur-0.077
���-0.077
────────-0.076
heed-0.074
Token
Token ID40790
Feature activation-3.49e+00

Pos logit contributions
ially+0.087
ciation+0.076
ijn+0.073
+0.068
llo+0.067
Neg logit contributions
 restraint-0.111
���-0.108
otaur-0.108
────────-0.107
 swe-0.103
Tokenller
Token ID6051
Feature activation-1.09e+00

Pos logit contributions
ially+3.606
ciation+3.082
ijn+2.975
llo+2.688
ication+2.530
Neg logit contributions
 restraint-7.844
���-7.802
────────-7.675
otaur-7.521
 swe-7.307
Token,
Token ID11
Feature activation-1.92e-01

Pos logit contributions
ially+1.550
ciation+1.346
ijn+1.295
+1.202
llo+1.198
Neg logit contributions
 restraint-2.056
otaur-2.024
���-2.015
────────-1.977
heed-1.929
Token the
Token ID262
Feature activation-1.01e-01

Pos logit contributions
ially+0.317
ciation+0.281
ijn+0.270
+0.256
ication+0.254
Neg logit contributions
 restraint-0.320
otaur-0.315
���-0.313
────────-0.308
heed-0.301
Token Greek
Token ID8312
Feature activation+1.19e+00

Pos logit contributions
ially+0.177
ciation+0.158
ijn+0.152
+0.145
llo+0.144
Neg logit contributions
 restraint-0.156
otaur-0.154
���-0.154
────────-0.150
heed-0.146
Token
Token ID564
Feature activation-4.63e-01

Pos logit contributions
 restraint+1.690
otaur+1.639
���+1.635
────────+1.626
 swe+1.547
Neg logit contributions
ially-2.279
ciation-2.059
ijn-1.980
-1.892
igious-1.883
Token of
Token ID286
Feature activation-1.69e-01

Pos logit contributions
 restraint+1.178
otaur+1.151
────────+1.141
���+1.135
 swe+1.099
Neg logit contributions
ially-1.062
ciation-0.939
ijn-0.897
-0.843
llo-0.836
Token curses
Token ID43878
Feature activation+8.23e-01

Pos logit contributions
ially+0.292
ciation+0.259
ijn+0.248
+0.237
igious+0.236
Neg logit contributions
���-0.268
 restraint-0.267
otaur-0.264
────────-0.261
heed-0.252
Token,
Token ID11
Feature activation-6.13e-02

Pos logit contributions
 restraint+1.527
otaur+1.491
���+1.487
────────+1.483
 swe+1.429
Neg logit contributions
ially-1.213
ciation-1.062
ijn-1.008
-0.946
igious-0.939
Token injuries
Token ID6821
Feature activation-8.21e-01

Pos logit contributions
ially+0.112
ciation+0.100
ijn+0.095
igious+0.091
+0.091
Neg logit contributions
���-0.091
 restraint-0.091
otaur-0.090
────────-0.089
heed-0.085
Token or
Token ID393
Feature activation-2.28e-01

Pos logit contributions
ially+1.155
ciation+1.004
ijn+0.948
+0.887
llo+0.885
Neg logit contributions
 restraint-1.567
���-1.550
otaur-1.541
────────-1.533
heed-1.474
Token kidn
Token ID12668
Feature activation-3.42e+00

Pos logit contributions
ially+0.416
ciation+0.371
ijn+0.355
igious+0.340
+0.340
Neg logit contributions
���-0.338
 restraint-0.335
otaur-0.332
────────-0.328
heed-0.315
Tokenappings
Token ID39242
Feature activation-8.26e-01

Pos logit contributions
ially+2.940
ciation+2.357
ijn+2.306
igious+2.064
ication+1.991
Neg logit contributions
���-8.062
 restraint-8.059
────────-8.034
otaur-7.971
 swe-7.649
Token of
Token ID286
Feature activation+1.68e-01

Pos logit contributions
ially+1.181
ciation+1.026
ijn+0.967
+0.909
llo+0.907
Neg logit contributions
 restraint-1.560
���-1.554
otaur-1.537
────────-1.526
heed-1.469
Token non
Token ID1729
Feature activation+1.10e+00

Pos logit contributions
 restraint+0.276
���+0.273
otaur+0.271
────────+0.269
heed+0.258
Neg logit contributions
ially-0.283
ciation-0.249
ijn-0.240
-0.227
llo-0.226
Token player
Token ID2137
Feature activation+8.84e-01

Pos logit contributions
 restraint+2.067
otaur+2.014
────────+2.005
���+2.002
 swe+1.931
Neg logit contributions
ially-1.597
ciation-1.397
ijn-1.327
-1.239
igious-1.232
Token characters
Token ID3435
Feature activation+6.90e-02

Pos logit contributions
 restraint+1.322
otaur+1.285
���+1.281
────────+1.276
 swe+1.217
Neg logit contributions
ially-1.622
ciation-1.458
ijn-1.400
-1.334
igious-1.327
Token,
Token ID11
Feature activation-7.39e-02

Pos logit contributions
ially+0.827
ciation+0.722
ijn+0.703
llo+0.668
+0.664
Neg logit contributions
 restraint-0.772
���-0.757
otaur-0.754
────────-0.748
heed-0.711
Token with
Token ID351
Feature activation-2.24e-01

Pos logit contributions
ially+0.121
ciation+0.105
ijn+0.102
+0.097
llo+0.096
Neg logit contributions
 restraint-0.125
otaur-0.123
���-0.122
────────-0.121
heed-0.116
Token the
Token ID262
Feature activation+1.07e-01

Pos logit contributions
ially+0.384
ciation+0.335
ijn+0.324
llo+0.309
+0.308
Neg logit contributions
 restraint-0.357
otaur-0.354
���-0.353
────────-0.351
 swe-0.333
Token advice
Token ID5608
Feature activation-2.94e-01

Pos logit contributions
 restraint+0.166
otaur+0.163
���+0.163
────────+0.162
heed+0.155
Neg logit contributions
ially-0.188
ciation-0.167
ijn-0.161
llo-0.153
-0.153
Token of
Token ID286
Feature activation-1.46e-01

Pos logit contributions
ially+0.444
ciation+0.388
ijn+0.369
+0.347
llo+0.346
Neg logit contributions
 restraint-0.534
otaur-0.522
���-0.522
────────-0.519
heed-0.499
Token Gand
Token ID17727
Feature activation-3.37e+00

Pos logit contributions
ially+0.251
ciation+0.222
ijn+0.214
+0.203
llo+0.202
Neg logit contributions
 restraint-0.234
otaur-0.229
���-0.228
────────-0.228
 swe-0.217
Tokenalf
Token ID1604
Feature activation+1.02e+00

Pos logit contributions
ially+7.137
ijn+6.148
ciation+6.129
llo+5.991
ication+5.836
Neg logit contributions
���-4.398
 restraint-4.067
────────-4.022
otaur-3.882
 swe-3.637
Token,
Token ID11
Feature activation-3.20e-01

Pos logit contributions
 restraint+1.857
otaur+1.810
���+1.803
────────+1.802
 swe+1.736
Neg logit contributions
ially-1.542
ciation-1.363
ijn-1.292
-1.214
igious-1.211
Token chooses
Token ID19769
Feature activation+5.43e-01

Pos logit contributions
ially+0.571
ciation+0.499
ijn+0.481
llo+0.465
+0.462
Neg logit contributions
 restraint-0.492
���-0.486
otaur-0.485
────────-0.481
heed-0.456
Token companions
Token ID19429
Feature activation-4.55e-01

Pos logit contributions
 restraint+0.911
otaur+0.890
���+0.888
────────+0.884
 swe+0.848
Neg logit contributions
ially-0.898
ciation-0.795
ijn-0.760
-0.720
llo-0.715
Token for
Token ID329
Feature activation-7.38e-03

Pos logit contributions
ially+0.725
ciation+0.630
ijn+0.599
llo+0.568
+0.566
Neg logit contributions
 restraint-0.797
���-0.787
otaur-0.778
────────-0.774
 swe-0.741
TokenYou
Token ID1639
Feature activation+2.75e-01

Pos logit contributions
ially+0.578
ciation+0.512
ijn+0.488
+0.467
igious+0.466
Neg logit contributions
 restraint-0.555
���-0.546
otaur-0.542
────────-0.536
 swe-0.516
Token contributed
Token ID8639
Feature activation+1.31e+00

Pos logit contributions
 restraint+0.448
���+0.439
otaur+0.436
────────+0.433
 swe+0.413
Neg logit contributions
ially-0.468
ciation-0.416
ijn-0.398
-0.378
llo-0.378
Token money
Token ID1637
Feature activation+7.18e-02

Pos logit contributions
 restraint+2.181
otaur+2.126
���+2.120
────────+2.112
 swe+2.025
Neg logit contributions
ially-2.195
ciation-1.951
ijn-1.864
-1.767
igious-1.756
Token to
Token ID284
Feature activation+1.99e-02

Pos logit contributions
 restraint+0.126
���+0.124
otaur+0.123
────────+0.122
heed+0.117
Neg logit contributions
ially-0.114
ciation-0.100
ijn-0.095
-0.090
llo-0.090
Token Al
Token ID978
Feature activation-5.58e-01

Pos logit contributions
 restraint+0.032
otaur+0.032
���+0.032
────────+0.032
 swe+0.030
Neg logit contributions
ially-0.034
ciation-0.030
ijn-0.028
-0.027
igious-0.027
Token Qaeda
Token ID17942
Feature activation-3.35e+00

Pos logit contributions
ially+0.777
ciation+0.663
ijn+0.634
igious+0.597
+0.594
Neg logit contributions
 restraint-1.082
otaur-1.058
���-1.056
────────-1.044
 swe-1.009
Token
Token ID447
Feature activation-3.96e-01

Pos logit contributions
ially+5.326
ciation+4.551
ijn+4.328
+4.216
roups+4.168
Neg logit contributions
���-5.657
otaur-5.531
────────-5.491
 restraint-5.482
 swe-5.103
Token
Token ID251
Feature activation-2.22e-01

Pos logit contributions
ially+0.791
ciation+0.714
ijn+0.690
+0.663
llo+0.659
Neg logit contributions
 restraint-0.528
otaur-0.510
────────-0.506
���-0.506
 swe-0.481
Token)
Token ID8
Feature activation+3.18e-01

Pos logit contributions
ially+0.345
ciation+0.303
ijn+0.288
+0.273
llo+0.271
Neg logit contributions
 restraint-0.394
���-0.387
otaur-0.385
────────-0.382
 swe-0.368
Token without
Token ID1231
Feature activation+4.55e-01

Pos logit contributions
 restraint+0.551
���+0.542
otaur+0.541
────────+0.535
heed+0.513
Neg logit contributions
ially-0.510
ciation-0.448
ijn-0.426
-0.402
llo-0.402
Token revealing
Token ID13477
Feature activation+8.69e-01

Pos logit contributions
 restraint+0.744
otaur+0.736
���+0.734
────────+0.726
heed+0.696
Neg logit contributions
ially-0.769
ciation-0.676
ijn-0.649
-0.616
llo-0.615
Token've
Token ID1053
Feature activation-2.45e-01

Pos logit contributions
ially+1.566
ciation+1.378
ijn+1.321
igious+1.253
llo+1.247
Neg logit contributions
 restraint-1.576
���-1.564
otaur-1.551
────────-1.529
 swe-1.459
Token been
Token ID587
Feature activation-8.31e-01

Pos logit contributions
ially+0.423
ciation+0.375
ijn+0.360
llo+0.341
+0.338
Neg logit contributions
 restraint-0.393
otaur-0.388
���-0.385
────────-0.381
heed-0.362
Token the
Token ID262
Feature activation-1.36e-01

Pos logit contributions
ially+1.445
ciation+1.259
ijn+1.220
llo+1.169
ication+1.150
Neg logit contributions
otaur-1.322
 restraint-1.302
���-1.294
────────-1.278
heed-1.224
Token manager
Token ID4706
Feature activation-7.72e-01

Pos logit contributions
ially+0.242
ciation+0.213
ijn+0.205
llo+0.198
+0.194
Neg logit contributions
 restraint-0.209
���-0.208
otaur-0.207
────────-0.205
heed-0.196
Token at
Token ID379
Feature activation+7.75e-01

Pos logit contributions
ially+1.155
ciation+0.992
ijn+0.941
+0.883
llo+0.881
Neg logit contributions
 restraint-1.418
���-1.413
otaur-1.391
────────-1.379
heed-1.325
Token Pop
Token ID8099
Feature activation-3.28e+00

Pos logit contributions
 restraint+1.280
otaur+1.256
���+1.255
────────+1.246
heed+1.190
Neg logit contributions
ially-1.295
ciation-1.149
ijn-1.103
-1.038
igious-1.037
Tokeneyes
Token ID48418
Feature activation-5.78e-01

Pos logit contributions
ially+5.657
ciation+5.104
ijn+5.041
igious+4.868
llo+4.792
Neg logit contributions
���-4.867
 restraint-4.858
────────-4.784
otaur-4.774
heed-4.476
Token right
Token ID826
Feature activation-1.56e+00

Pos logit contributions
ially+0.872
ciation+0.764
ijn+0.728
+0.685
igious+0.681
Neg logit contributions
 restraint-1.049
���-1.029
otaur-1.027
────────-1.017
heed-0.976
Token now
Token ID783
Feature activation-2.83e-01

Pos logit contributions
ially+3.113
ciation+2.774
ijn+2.688
llo+2.572
igious+2.568
Neg logit contributions
���-2.073
 restraint-2.064
otaur-2.025
────────-1.977
heed-1.889
Token,
Token ID11
Feature activation+1.03e-01

Pos logit contributions
ially+0.414
ciation+0.362
ijn+0.343
+0.321
llo+0.319
Neg logit contributions
 restraint-0.528
���-0.519
otaur-0.517
────────-0.511
heed-0.493
Token if
Token ID611
Feature activation-7.53e-01

Pos logit contributions
 restraint+0.148
���+0.144
otaur+0.144
────────+0.142
heed+0.136
Neg logit contributions
ially-0.193
ciation-0.174
ijn-0.168
-0.160
llo-0.160
Token 6
Token ID718
Feature activation+4.57e-01

Pos logit contributions
ially+1.409
ciation+1.219
ijn+1.173
llo+1.100
+1.092
Neg logit contributions
���-1.794
 restraint-1.757
otaur-1.726
────────-1.698
heed-1.652
Token,
Token ID11
Feature activation+3.55e-01

Pos logit contributions
 restraint+0.841
otaur+0.821
────────+0.818
���+0.806
 swe+0.787
Neg logit contributions
ially-0.679
ciation-0.596
ijn-0.566
-0.531
igious-0.529
Token 2010
Token ID3050
Feature activation-3.81e-01

Pos logit contributions
 restraint+0.676
���+0.659
otaur+0.655
────────+0.654
 swe+0.635
Neg logit contributions
ially-0.504
ciation-0.444
ijn-0.417
-0.394
llo-0.392
Token
Token ID198
Feature activation+3.67e-01

Pos logit contributions
ially+0.622
ciation+0.549
ijn+0.523
llo+0.497
+0.496
Neg logit contributions
���-0.651
 restraint-0.649
otaur-0.635
────────-0.625
 swe-0.602
Token
Token ID198
Feature activation+2.94e-01

Pos logit contributions
 restraint+0.592
otaur+0.575
────────+0.574
���+0.561
 swe+0.549
Neg logit contributions
ially-0.629
ciation-0.563
ijn-0.538
-0.511
igious-0.508
TokenSubject
Token ID19776
Feature activation-3.27e+00

Pos logit contributions
 restraint+0.468
���+0.465
otaur+0.457
────────+0.449
 swe+0.435
Neg logit contributions
ially-0.514
ciation-0.456
ijn-0.436
-0.419
llo-0.416
Token:
Token ID25
Feature activation-8.15e-01

Pos logit contributions
ially+3.136
ciation+2.470
ijn+2.228
llo+2.084
+2.039
Neg logit contributions
���-8.330
 restraint-7.665
otaur-7.588
 adolesc-7.309
────────-7.307
Token R
Token ID371
Feature activation+3.15e-01

Pos logit contributions
ially+1.391
ciation+1.228
ijn+1.176
llo+1.116
+1.115
Neg logit contributions
 restraint-1.299
���-1.298
otaur-1.294
────────-1.278
heed-1.219
Tokeniff
Token ID733
Feature activation+1.41e-01

Pos logit contributions
 restraint+0.507
���+0.497
otaur+0.495
────────+0.493
 swe+0.470
Neg logit contributions
ially-0.543
ciation-0.482
ijn-0.464
igious-0.438
llo-0.437
TokenTra
Token ID15721
Feature activation-3.28e-01

Pos logit contributions
 restraint+0.274
���+0.269
otaur+0.268
────────+0.268
 swe+0.257
Neg logit contributions
ially-0.196
ciation-0.170
ijn-0.161
-0.150
igious-0.149
Tokenx
Token ID87
Feature activation-1.27e+00

Pos logit contributions
ially+0.486
ciation+0.425
ijn+0.408
llo+0.383
+0.382
Neg logit contributions
 restraint-0.607
���-0.598
────────-0.593
otaur-0.592
 swe-0.568
Token 9
Token ID860
Feature activation+2.73e-01

Pos logit contributions
ially+1.910
ciation+1.644
ijn+1.594
llo+1.502
+1.480
Neg logit contributions
���-2.469
 restraint-2.395
otaur-2.353
────────-2.311
heed-2.259
Token,
Token ID11
Feature activation+3.07e-01

Pos logit contributions
 restraint+0.506
otaur+0.493
────────+0.491
���+0.489
 swe+0.473
Neg logit contributions
ially-0.402
ciation-0.352
ijn-0.334
-0.314
igious-0.312
Token 2010
Token ID3050
Feature activation-4.25e-01

Pos logit contributions
 restraint+0.582
���+0.568
otaur+0.565
────────+0.564
 swe+0.547
Neg logit contributions
ially-0.437
ciation-0.383
ijn-0.362
-0.341
llo-0.339
Token
Token ID198
Feature activation+4.10e-01

Pos logit contributions
ially+0.697
ciation+0.615
ijn+0.587
+0.556
llo+0.556
Neg logit contributions
 restraint-0.720
���-0.719
otaur-0.703
────────-0.694
 swe-0.667
Token
Token ID198
Feature activation+3.32e-01

Pos logit contributions
 restraint+0.658
otaur+0.639
────────+0.638
���+0.621
 swe+0.610
Neg logit contributions
ially-0.703
ciation-0.630
ijn-0.603
-0.572
igious-0.570
TokenSubject
Token ID19776
Feature activation-3.24e+00

Pos logit contributions
 restraint+0.525
���+0.521
otaur+0.512
────────+0.504
 swe+0.488
Neg logit contributions
ially-0.582
ciation-0.517
ijn-0.495
-0.474
llo-0.471
Token:
Token ID25
Feature activation-8.52e-01

Pos logit contributions
ially+3.110
ciation+2.451
ijn+2.216
llo+2.074
+2.023
Neg logit contributions
���-8.280
 restraint-7.619
otaur-7.534
 adolesc-7.270
────────-7.252
Token R
Token ID371
Feature activation+2.14e-01

Pos logit contributions
ially+1.447
ciation+1.277
ijn+1.226
llo+1.162
+1.161
Neg logit contributions
���-1.363
 restraint-1.363
otaur-1.360
────────-1.339
heed-1.277
Tokenipe
Token ID3757
Feature activation-5.25e-01

Pos logit contributions
 restraint+0.176
���+0.170
otaur+0.168
────────+0.166
 swe+0.151
Neg logit contributions
ially-0.539
ciation-0.497
ijn-0.485
igious-0.467
llo-0.466
Token for
Token ID329
Feature activation+2.82e-01

Pos logit contributions
ially+0.882
ciation+0.782
ijn+0.750
llo+0.716
igious+0.710
Neg logit contributions
 restraint-0.860
���-0.855
otaur-0.850
────────-0.844
heed-0.799
Token M
Token ID337
Feature activation-2.79e-01

Pos logit contributions
 restraint+0.461
���+0.455
otaur+0.454
────────+0.452
heed+0.429
Neg logit contributions
ially-0.473
ciation-0.422
ijn-0.403
-0.381
llo-0.380
Token potential
Token ID2785
Feature activation-8.88e-01

Pos logit contributions
ially+0.653
ciation+0.576
ijn+0.551
ication+0.525
+0.522
Neg logit contributions
otaur-0.600
 restraint-0.599
���-0.599
────────-0.592
heed-0.568
Token side
Token ID1735
Feature activation-5.72e-01

Pos logit contributions
ially+1.676
ciation+1.480
ijn+1.421
ication+1.367
+1.360
Neg logit contributions
���-1.280
 restraint-1.254
otaur-1.251
────────-1.236
heed-1.193
Token effects
Token ID3048
Feature activation-1.02e+00

Pos logit contributions
ially+0.913
ciation+0.802
ijn+0.767
+0.726
igious+0.724
Neg logit contributions
 restraint-0.991
otaur-0.974
���-0.974
────────-0.958
 swe-0.923
Token.
Token ID13
Feature activation-3.95e-01

Pos logit contributions
ially+1.443
ciation+1.240
ijn+1.174
llo+1.098
+1.096
Neg logit contributions
���-1.946
 restraint-1.939
otaur-1.929
────────-1.904
heed-1.836
Token The
Token ID383
Feature activation-3.18e-01

Pos logit contributions
ially+0.654
ciation+0.581
ijn+0.554
+0.528
llo+0.527
Neg logit contributions
 restraint-0.668
���-0.663
otaur-0.651
────────-0.635
heed-0.622
Token aim
Token ID4031
Feature activation-3.23e+00

Pos logit contributions
ially+0.438
ciation+0.376
ijn+0.357
llo+0.332
+0.331
Neg logit contributions
 restraint-0.616
���-0.609
otaur-0.608
────────-0.605
heed-0.583
Token of
Token ID286
Feature activation-6.96e-01

Pos logit contributions
ially+4.554
ciation+3.877
ijn+3.636
+3.430
ication+3.413
Neg logit contributions
 restraint-6.078
otaur-5.997
���-5.989
────────-5.950
heed-5.779
Token this
Token ID428
Feature activation-1.10e+00

Pos logit contributions
ially+1.148
ciation+1.010
ijn+0.976
ication+0.915
+0.909
Neg logit contributions
 restraint-1.155
otaur-1.144
────────-1.134
���-1.122
heed-1.082
Token study
Token ID2050
Feature activation-1.07e+00

Pos logit contributions
ially+1.772
ciation+1.550
ijn+1.500
ication+1.408
llo+1.404
Neg logit contributions
otaur-1.855
 restraint-1.838
���-1.835
────────-1.825
heed-1.743
Token is
Token ID318
Feature activation-2.46e-01

Pos logit contributions
ially+1.796
ciation+1.589
ijn+1.516
+1.431
igious+1.428
Neg logit contributions
 restraint-1.773
otaur-1.748
���-1.737
────────-1.734
heed-1.654
Token to
Token ID284
Feature activation-7.55e-01

Pos logit contributions
ially+0.406
ciation+0.360
ijn+0.346
+0.327
igious+0.326
Neg logit contributions
 restraint-0.413
���-0.401
otaur-0.401
────────-0.398
 swe-0.383
Tokenviation
Token ID47625
Feature activation-3.99e-01

Pos logit contributions
ially+0.350
ciation+0.289
ijn+0.263
+0.233
igious+0.231
Neg logit contributions
 restraint-0.849
otaur-0.832
���-0.830
────────-0.829
 swe-0.804
Token for
Token ID329
Feature activation+4.17e-02

Pos logit contributions
ially+0.642
ciation+0.567
ijn+0.536
+0.509
llo+0.505
Neg logit contributions
 restraint-0.687
���-0.678
otaur-0.675
────────-0.671
heed-0.645
Token duty
Token ID7077
Feature activation+1.06e+00

Pos logit contributions
 restraint+0.068
���+0.067
otaur+0.067
────────+0.066
heed+0.063
Neg logit contributions
ially-0.071
ciation-0.063
ijn-0.060
-0.057
llo-0.057
Token status
Token ID3722
Feature activation-1.58e-01

Pos logit contributions
 restraint+1.758
���+1.717
otaur+1.717
────────+1.708
 swe+1.634
Neg logit contributions
ially-1.774
ciation-1.577
ijn-1.506
-1.429
igious-1.418
Token
Token ID784
Feature activation+1.44e-01

Pos logit contributions
ially+0.239
ciation+0.210
ijn+0.198
+0.188
llo+0.186
Neg logit contributions
 restraint-0.286
���-0.283
otaur-0.282
────────-0.279
heed-0.269
Token whereabouts
Token ID33035
Feature activation-3.21e+00

Pos logit contributions
 restraint+0.230
���+0.228
otaur+0.227
────────+0.225
heed+0.215
Neg logit contributions
ially-0.249
ciation-0.222
ijn-0.211
-0.202
llo-0.200
Token unknown
Token ID6439
Feature activation-8.03e-01

Pos logit contributions
ially+4.666
ciation+3.972
ijn+3.847
+3.648
llo+3.612
Neg logit contributions
���-5.848
otaur-5.740
 restraint-5.653
────────-5.568
heed-5.366
Token)
Token ID8
Feature activation+3.36e-01

Pos logit contributions
ially+1.096
ciation+0.939
ijn+0.888
+0.832
llo+0.824
Neg logit contributions
 restraint-1.568
���-1.566
otaur-1.547
────────-1.526
heed-1.478
Token TIM
Token ID31742
Feature activation+7.33e-02

Pos logit contributions
 restraint+0.540
���+0.529
otaur+0.528
────────+0.522
heed+0.501
Neg logit contributions
ially-0.578
ciation-0.514
ijn-0.494
-0.467
llo-0.466
TokenEL
Token ID3698
Feature activation-1.17e+00

Pos logit contributions
 restraint+0.124
otaur+0.120
���+0.120
────────+0.120
 swe+0.115
Neg logit contributions
ially-0.121
ciation-0.107
ijn-0.102
-0.097
llo-0.096
TokenINE
Token ID8881
Feature activation-4.77e-01

Pos logit contributions
ially+2.138
ciation+1.886
ijn+1.850
+1.748
igious+1.724
Neg logit contributions
 restraint-1.767
���-1.751
otaur-1.729
────────-1.675
 swe-1.632
Token create
Token ID2251
Feature activation+2.57e-01

Pos logit contributions
ially+0.373
ciation+0.327
ijn+0.314
llo+0.298
+0.297
Neg logit contributions
 restraint-0.351
���-0.348
otaur-0.347
────────-0.340
heed-0.327
Token a
Token ID257
Feature activation-2.04e-01

Pos logit contributions
 restraint+0.405
���+0.398
otaur+0.397
────────+0.393
heed+0.378
Neg logit contributions
ially-0.452
ciation-0.401
ijn-0.385
-0.366
llo-0.365
Token distributed
Token ID9387
Feature activation-7.94e-01

Pos logit contributions
ially+0.341
ciation+0.301
ijn+0.288
llo+0.273
+0.273
Neg logit contributions
 restraint-0.335
otaur-0.330
���-0.330
────────-0.325
heed-0.314
Token algorithm
Token ID11862
Feature activation+4.05e-01

Pos logit contributions
ially+1.322
ciation+1.159
ijn+1.105
llo+1.050
+1.038
Neg logit contributions
 restraint-1.317
���-1.310
otaur-1.304
────────-1.290
heed-1.238
Token that
Token ID326
Feature activation-7.58e-02

Pos logit contributions
 restraint+0.708
���+0.693
otaur+0.692
────────+0.687
heed+0.661
Neg logit contributions
ially-0.642
ciation-0.566
ijn-0.538
-0.507
igious-0.504
Token conver
Token ID6718
Feature activation-3.18e+00

Pos logit contributions
ially+0.138
ciation+0.123
ijn+0.118
+0.113
llo+0.113
Neg logit contributions
 restraint-0.114
otaur-0.112
���-0.111
────────-0.110
heed-0.106
Tokenged
Token ID2004
Feature activation-1.37e+00

Pos logit contributions
ially+4.040
ciation+3.380
ijn+3.093
llo+2.958
igious+2.930
Neg logit contributions
 restraint-6.498
���-6.423
otaur-6.345
────────-6.256
heed-6.095
Token to
Token ID284
Feature activation-4.15e-02

Pos logit contributions
ially+2.240
ciation+1.936
ijn+1.846
llo+1.765
+1.728
Neg logit contributions
 restraint-2.305
���-2.303
otaur-2.272
────────-2.241
heed-2.187
Token this
Token ID428
Feature activation-6.01e-01

Pos logit contributions
ially+0.071
ciation+0.064
ijn+0.061
+0.058
llo+0.057
Neg logit contributions
 restraint-0.066
���-0.065
otaur-0.065
────────-0.064
heed-0.062
Token operation
Token ID4905
Feature activation-7.08e-01

Pos logit contributions
ially+1.036
ciation+0.916
ijn+0.879
llo+0.833
+0.830
Neg logit contributions
 restraint-0.958
���-0.950
otaur-0.949
────────-0.930
heed-0.899
Token point
Token ID966
Feature activation-1.20e+00

Pos logit contributions
ially+1.080
ciation+0.942
ijn+0.882
+0.832
igious+0.829
Neg logit contributions
���-1.279
 restraint-1.278
otaur-1.252
────────-1.247
heed-1.203
Token 13
Token ID1511
Feature activation+9.40e-02

Pos logit contributions
ially+0.864
ciation+0.744
ijn+0.718
llo+0.676
+0.671
Neg logit contributions
���-1.065
 restraint-1.050
otaur-1.031
────────-1.015
heed-0.986
Token,
Token ID11
Feature activation+3.60e-01

Pos logit contributions
 restraint+0.179
���+0.177
otaur+0.175
────────+0.174
heed+0.168
Neg logit contributions
ially-0.134
ciation-0.117
ijn-0.111
-0.104
llo-0.103
Token 2010
Token ID3050
Feature activation-4.49e-01

Pos logit contributions
 restraint+0.688
otaur+0.676
���+0.675
────────+0.673
heed+0.647
Neg logit contributions
ially-0.510
ciation-0.442
ijn-0.420
-0.392
igious-0.390
Token
Token ID198
Feature activation+3.86e-01

Pos logit contributions
ially+0.719
ciation+0.629
ijn+0.598
llo+0.574
+0.568
Neg logit contributions
���-0.799
 restraint-0.779
otaur-0.766
────────-0.751
 swe-0.723
Token
Token ID198
Feature activation+3.08e-01

Pos logit contributions
 restraint+0.624
otaur+0.606
────────+0.605
���+0.592
 swe+0.578
Neg logit contributions
ially-0.660
ciation-0.591
ijn-0.566
-0.537
igious-0.534
TokenSubject
Token ID19776
Feature activation-3.17e+00

Pos logit contributions
���+0.532
 restraint+0.521
otaur+0.513
────────+0.496
 swe+0.492
Neg logit contributions
ially-0.510
ciation-0.439
ijn-0.417
-0.412
igious-0.409
Token:
Token ID25
Feature activation-7.88e-01

Pos logit contributions
ially+3.033
ciation+2.378
ijn+2.147
llo+2.027
ication+1.972
Neg logit contributions
���-7.998
 restraint-7.401
otaur-7.371
────────-7.096
heed-7.031
Token boys
Token ID6510
Feature activation+1.25e+00

Pos logit contributions
ially+1.334
ciation+1.178
ijn+1.125
llo+1.070
+1.070
Neg logit contributions
 restraint-1.267
���-1.264
otaur-1.262
────────-1.242
heed-1.188
Token beware
Token ID40600
Feature activation+8.72e-02

Pos logit contributions
 restraint+2.076
otaur+2.020
���+2.009
────────+2.005
 swe+1.927
Neg logit contributions
ially-2.101
ciation-1.872
ijn-1.790
-1.696
igious-1.685
Token This
Token ID770
Feature activation-2.42e-01

Pos logit contributions
 restraint+0.151
���+0.149
otaur+0.148
────────+0.147
heed+0.141
Neg logit contributions
ially-0.139
ciation-0.122
ijn-0.116
-0.110
igious-0.110
Token film
Token ID2646
Feature activation-3.01e-01

Pos logit contributions
ially+0.399
ciation+0.353
ijn+0.337
+0.320
llo+0.319
Neg logit contributions
 restraint-0.401
otaur-0.398
���-0.398
────────-0.391
heed-0.375
Token Mill
Token ID9212
Feature activation-1.41e+00

Pos logit contributions
ially+0.102
ciation+0.090
ijn+0.087
igious+0.082
+0.082
Neg logit contributions
 restraint-0.093
otaur-0.091
���-0.091
────────-0.091
heed-0.086
Tokeniy
Token ID7745
Feature activation-1.53e-01

Pos logit contributions
ially+1.919
ciation+1.649
ijn+1.584
igious+1.468
ication+1.455
Neg logit contributions
 restraint-2.802
���-2.733
────────-2.727
otaur-2.710
 swe-2.608
Tokenet
Token ID316
Feature activation-1.28e-01

Pos logit contributions
ially+0.203
ciation+0.174
ijn+0.162
llo+0.153
igious+0.152
Neg logit contributions
 restraint-0.308
���-0.303
────────-0.301
otaur-0.299
 swe-0.291
Token,
Token ID11
Feature activation-2.33e-01

Pos logit contributions
ially+0.194
ciation+0.169
ijn+0.162
igious+0.151
llo+0.151
Neg logit contributions
 restraint-0.232
���-0.227
otaur-0.227
────────-0.226
 swe-0.217
Token a
Token ID257
Feature activation-7.69e-02

Pos logit contributions
ially+0.401
ciation+0.355
ijn+0.339
+0.322
ication+0.320
Neg logit contributions
 restraint-0.375
otaur-0.369
���-0.367
────────-0.366
heed-0.350
Token smugg
Token ID19176
Feature activation-3.15e+00

Pos logit contributions
ially+0.154
ciation+0.138
ijn+0.134
igious+0.127
llo+0.127
Neg logit contributions
 restraint-0.102
otaur-0.100
���-0.100
────────-0.099
heed-0.094
Tokenler
Token ID1754
Feature activation-5.27e-01

Pos logit contributions
ially+4.737
ciation+4.045
ijn+3.923
ication+3.706
llo+3.706
Neg logit contributions
���-5.906
 restraint-5.778
────────-5.650
otaur-5.555
 swe-5.319
Token named
Token ID3706
Feature activation-1.74e-01

Pos logit contributions
ially+0.841
ciation+0.731
ijn+0.699
llo+0.661
+0.655
Neg logit contributions
 restraint-0.916
���-0.910
otaur-0.901
────────-0.895
heed-0.857
Token H
Token ID367
Feature activation+9.84e-01

Pos logit contributions
ially+0.299
ciation+0.266
ijn+0.255
+0.241
llo+0.240
Neg logit contributions
 restraint-0.282
otaur-0.274
���-0.274
────────-0.273
heed-0.260
Tokenid
Token ID312
Feature activation-3.04e-01

Pos logit contributions
 restraint+1.739
otaur+1.700
────────+1.688
���+1.687
heed+1.628
Neg logit contributions
ially-1.528
ciation-1.352
ijn-1.280
-1.217
igious-1.201
Tokenir
Token ID343
Feature activation+6.61e-02

Pos logit contributions
ially+0.519
ciation+0.458
ijn+0.444
llo+0.418
igious+0.418
Neg logit contributions
 restraint-0.500
���-0.484
────────-0.483
otaur-0.481
 swe-0.457
Token.
Token ID13
Feature activation+9.02e-02

Pos logit contributions
ially+1.722
ciation+1.505
ijn+1.422
llo+1.338
+1.338
Neg logit contributions
���-1.920
 restraint-1.914
otaur-1.894
────────-1.869
heed-1.817
Token The
Token ID383
Feature activation-9.20e-02

Pos logit contributions
 restraint+0.143
���+0.140
otaur+0.138
────────+0.136
heed+0.132
Neg logit contributions
ially-0.159
ciation-0.142
ijn-0.136
-0.130
llo-0.129
Token combination
Token ID6087
Feature activation+4.78e-01

Pos logit contributions
ially+0.153
ciation+0.135
ijn+0.129
+0.122
llo+0.122
Neg logit contributions
 restraint-0.153
otaur-0.151
���-0.151
────────-0.150
heed-0.143
Token of
Token ID286
Feature activation-2.27e-01

Pos logit contributions
 restraint+0.836
otaur+0.813
���+0.811
────────+0.811
 swe+0.779
Neg logit contributions
ially-0.755
ciation-0.669
ijn-0.639
-0.601
igious-0.598
Token a
Token ID257
Feature activation-1.72e-01

Pos logit contributions
ially+0.392
ciation+0.345
ijn+0.333
llo+0.315
+0.314
Neg logit contributions
 restraint-0.359
���-0.356
otaur-0.356
────────-0.351
heed-0.338
Token mov
Token ID1409
Feature activation-3.14e+00

Pos logit contributions
ially+0.302
ciation+0.269
ijn+0.258
+0.245
llo+0.245
Neg logit contributions
 restraint-0.268
otaur-0.264
���-0.264
────────-0.262
heed-0.250
Tokenable
Token ID540
Feature activation+1.41e-01

Pos logit contributions
ially+4.205
ciation+3.566
ijn+3.433
igious+3.170
ication+3.143
Neg logit contributions
 restraint-6.199
���-6.181
────────-6.069
otaur-6.045
 swe-5.828
Token command
Token ID3141
Feature activation-5.11e-01

Pos logit contributions
 restraint+0.217
���+0.214
otaur+0.213
────────+0.212
heed+0.201
Neg logit contributions
ially-0.252
ciation-0.225
ijn-0.216
llo-0.204
-0.204
Token center
Token ID3641
Feature activation-1.81e+00

Pos logit contributions
ially+0.871
ciation+0.777
ijn+0.739
llo+0.705
+0.698
Neg logit contributions
���-0.819
 restraint-0.816
otaur-0.808
────────-0.803
heed-0.764
Token with
Token ID351
Feature activation-6.40e-01

Pos logit contributions
ially+2.588
ciation+2.251
ijn+2.127
llo+2.027
ication+1.989
Neg logit contributions
���-3.334
 restraint-3.317
otaur-3.306
────────-3.276
heed-3.159
Token a
Token ID257
Feature activation-2.16e-01

Pos logit contributions
ially+1.119
ciation+0.990
ijn+0.954
llo+0.905
+0.898
Neg logit contributions
 restraint-0.992
otaur-0.990
���-0.989
────────-0.976
heed-0.939
Token guy
Token ID3516
Feature activation+1.21e-01

Pos logit contributions
ially+1.346
ciation+1.183
ijn+1.138
llo+1.093
+1.088
Neg logit contributions
���-1.369
 restraint-1.355
otaur-1.349
────────-1.325
heed-1.283
Token ended
Token ID4444
Feature activation+8.30e-01

Pos logit contributions
 restraint+0.196
���+0.192
otaur+0.191
────────+0.190
heed+0.182
Neg logit contributions
ially-0.207
ciation-0.184
ijn-0.176
-0.168
llo-0.167
Token up
Token ID510
Feature activation-1.10e+00

Pos logit contributions
 restraint+1.470
otaur+1.436
���+1.433
────────+1.427
 swe+1.372
Neg logit contributions
ially-1.294
ciation-1.139
ijn-1.085
-1.024
igious-1.017
Token becoming
Token ID5033
Feature activation+5.40e-01

Pos logit contributions
ially+1.707
ciation+1.491
ijn+1.436
+1.352
llo+1.344
Neg logit contributions
 restraint-1.944
otaur-1.925
���-1.918
────────-1.882
heed-1.831
Token my
Token ID616
Feature activation-6.03e-01

Pos logit contributions
 restraint+0.904
���+0.884
otaur+0.884
────────+0.871
heed+0.840
Neg logit contributions
ially-0.895
ciation-0.790
ijn-0.756
-0.721
llo-0.714
Token focal
Token ID25397
Feature activation-3.12e+00

Pos logit contributions
ially+1.068
ciation+0.939
ijn+0.914
+0.876
llo+0.872
Neg logit contributions
 restraint-0.921
���-0.917
otaur-0.915
────────-0.885
heed-0.864
Token point
Token ID966
Feature activation-1.61e+00

Pos logit contributions
ially+4.124
ciation+3.556
ijn+3.290
+3.263
ication+3.214
Neg logit contributions
���-6.119
 restraint-5.778
otaur-5.686
────────-5.685
videos-5.562
Token for
Token ID329
Feature activation-5.76e-02

Pos logit contributions
ially+2.338
ciation+2.029
ijn+1.919
+1.816
llo+1.800
Neg logit contributions
���-3.025
 restraint-2.982
otaur-2.922
────────-2.906
heed-2.808
Token his
Token ID465
Feature activation-8.32e-02

Pos logit contributions
ially+0.096
ciation+0.084
ijn+0.081
llo+0.077
+0.077
Neg logit contributions
 restraint-0.094
���-0.094
otaur-0.093
────────-0.092
heed-0.089
Token outlandish
Token ID47284
Feature activation+7.37e-01

Pos logit contributions
ially+0.148
ciation+0.131
ijn+0.126
llo+0.121
+0.121
Neg logit contributions
���-0.127
 restraint-0.127
otaur-0.126
────────-0.124
heed-0.119
Token offense
Token ID6907
Feature activation+3.00e-01

Pos logit contributions
 restraint+1.132
���+1.115
otaur+1.108
────────+1.101
heed+1.054
Neg logit contributions
ially-1.322
ciation-1.181
ijn-1.133
-1.079
igious-1.075
Token
Token ID198
Feature activation+6.67e-02

Pos logit contributions
ially+0.022
ciation+0.020
ijn+0.019
+0.018
llo+0.018
Neg logit contributions
 restraint-0.022
otaur-0.021
���-0.021
────────-0.021
heed-0.020
Token
Token ID198
Feature activation+7.00e-02

Pos logit contributions
 restraint+0.111
���+0.109
otaur+0.108
────────+0.107
 swe+0.103
Neg logit contributions
ially-0.111
ciation-0.099
ijn-0.095
-0.090
llo-0.089
Token48
Token ID2780
Feature activation-4.12e-01

Pos logit contributions
 restraint+0.101
���+0.099
otaur+0.097
────────+0.096
 swe+0.092
Neg logit contributions
ially-0.134
ciation-0.120
ijn-0.116
-0.110
igious-0.110
Token.
Token ID13
Feature activation-1.60e-01

Pos logit contributions
ially+0.592
ciation+0.515
ijn+0.488
+0.458
llo+0.457
Neg logit contributions
���-0.785
 restraint-0.782
otaur-0.765
────────-0.753
 swe-0.731
Token Pro
Token ID1041
Feature activation-6.46e-01

Pos logit contributions
ially+0.286
ciation+0.257
ijn+0.247
+0.234
llo+0.234
Neg logit contributions
 restraint-0.249
otaur-0.243
���-0.242
────────-0.239
heed-0.230
Tokenverbs
Token ID46211
Feature activation+2.83e+00

Pos logit contributions
ially+1.009
ciation+0.895
ijn+0.849
+0.800
igious+0.795
Neg logit contributions
 restraint-1.139
otaur-1.119
────────-1.112
���-1.110
 swe-1.064
Token
Token ID198
Feature activation+1.18e-01

Pos logit contributions
 restraint+4.867
────────+4.678
otaur+4.652
���+4.630
 swe+4.533
Neg logit contributions
ially-4.449
ciation-3.977
ijn-3.778
-3.628
igious-3.568
Token
Token ID198
Feature activation+1.09e-01

Pos logit contributions
 restraint+0.194
otaur+0.189
���+0.188
────────+0.188
 swe+0.180
Neg logit contributions
ially-0.198
ciation-0.177
ijn-0.169
-0.160
igious-0.159
Token49
Token ID2920
Feature activation+2.54e-01

Pos logit contributions
 restraint+0.139
���+0.136
otaur+0.135
────────+0.133
 swe+0.127
Neg logit contributions
ially-0.225
ciation-0.204
ijn-0.197
-0.189
igious-0.189
Token.
Token ID13
Feature activation-8.35e-02

Pos logit contributions
 restraint+0.470
otaur+0.459
���+0.458
────────+0.456
 swe+0.439
Neg logit contributions
ially-0.377
ciation-0.330
ijn-0.314
-0.295
igious-0.292
Token Drunk
Token ID42739
Feature activation-5.56e-01

Pos logit contributions
ially+0.147
ciation+0.133
ijn+0.127
llo+0.121
+0.120
Neg logit contributions
 restraint-0.131
otaur-0.128
���-0.128
────────-0.126
heed-0.122
Token election
Token ID3071
Feature activation-4.47e-01

Pos logit contributions
 restraint+0.177
���+0.174
otaur+0.174
────────+0.173
heed+0.167
Neg logit contributions
ially-0.134
ciation-0.116
ijn-0.110
-0.103
llo-0.102
Token season
Token ID1622
Feature activation+1.21e-02

Pos logit contributions
ially+0.812
ciation+0.714
ijn+0.685
+0.667
llo+0.661
Neg logit contributions
���-0.672
 restraint-0.669
otaur-0.664
────────-0.655
heed-0.630
Token and
Token ID290
Feature activation+2.72e-01

Pos logit contributions
 restraint+0.022
���+0.022
otaur+0.022
────────+0.021
heed+0.021
Neg logit contributions
ially-0.018
ciation-0.016
ijn-0.015
-0.014
llo-0.014
Token still
Token ID991
Feature activation-9.80e-01

Pos logit contributions
 restraint+0.425
���+0.421
otaur+0.420
────────+0.415
heed+0.395
Neg logit contributions
ially-0.480
ciation-0.423
ijn-0.407
llo-0.388
-0.388
Token take
Token ID1011
Feature activation-6.29e-02

Pos logit contributions
ially+1.599
ciation+1.388
ijn+1.331
llo+1.259
+1.258
Neg logit contributions
���-1.651
 restraint-1.644
otaur-1.635
────────-1.610
heed-1.536
Token back
Token ID736
Feature activation+1.92e+00

Pos logit contributions
ially+0.105
ciation+0.092
ijn+0.088
llo+0.084
+0.084
Neg logit contributions
 restraint-0.104
otaur-0.102
���-0.102
────────-0.101
heed-0.097
Token the
Token ID262
Feature activation+2.43e-01

Pos logit contributions
 restraint+3.194
otaur+3.079
────────+3.064
���+3.062
 swe+2.961
Neg logit contributions
ially-3.194
ciation-2.874
ijn-2.743
-2.585
igious-2.562
Token House
Token ID2097
Feature activation+1.30e+00

Pos logit contributions
 restraint+0.398
���+0.394
otaur+0.392
────────+0.388
heed+0.373
Neg logit contributions
ially-0.407
ciation-0.357
ijn-0.343
llo-0.327
-0.327
Token next
Token ID1306
Feature activation+9.76e-02

Pos logit contributions
 restraint+2.375
otaur+2.319
���+2.314
────────+2.306
 swe+2.220
Neg logit contributions
ially-1.961
ciation-1.721
ijn-1.635
-1.538
igious-1.527
Token year
Token ID614
Feature activation+4.87e-02

Pos logit contributions
 restraint+0.132
���+0.129
otaur+0.129
────────+0.128
heed+0.121
Neg logit contributions
ially-0.192
ciation-0.173
ijn-0.166
-0.160
llo-0.160
Token.
Token ID13
Feature activation+3.28e-01

Pos logit contributions
 restraint+0.091
���+0.089
otaur+0.089
────────+0.088
 swe+0.085
Neg logit contributions
ially-0.072
ciation-0.063
ijn-0.059
-0.056
igious-0.055
Token bestowed
Token ID40293
Feature activation+2.20e-01

Pos logit contributions
ially+0.499
ciation+0.443
ijn+0.420
+0.401
llo+0.397
Neg logit contributions
 restraint-0.499
otaur-0.490
���-0.487
────────-0.482
heed-0.463
Token the
Token ID262
Feature activation+5.15e-02

Pos logit contributions
 restraint+0.385
otaur+0.377
���+0.376
────────+0.375
 swe+0.359
Neg logit contributions
ially-0.350
ciation-0.308
ijn-0.293
-0.277
igious-0.276
Token name
Token ID1438
Feature activation-8.76e-01

Pos logit contributions
 restraint+0.101
���+0.099
otaur+0.099
────────+0.099
heed+0.095
Neg logit contributions
ially-0.071
ciation-0.061
ijn-0.058
-0.054
igious-0.054
Token Z
Token ID1168
Feature activation-9.81e-01

Pos logit contributions
ially+1.459
ciation+1.299
ijn+1.230
+1.169
llo+1.167
Neg logit contributions
 restraint-1.453
���-1.430
otaur-1.424
────────-1.422
heed-1.352
Tokenhen
Token ID831
Feature activation+1.15e+00

Pos logit contributions
ially+1.704
ciation+1.511
ijn+1.471
igious+1.375
llo+1.365
Neg logit contributions
 restraint-1.579
���-1.558
────────-1.527
otaur-1.520
 swe-1.456
Tokenjin
Token ID18594
Feature activation+2.54e+00

Pos logit contributions
 restraint+2.054
otaur+2.011
���+1.996
────────+1.991
 swe+1.925
Neg logit contributions
ially-1.759
ciation-1.546
ijn-1.473
-1.392
igious-1.377
Token (
Token ID357
Feature activation+1.71e-01

Pos logit contributions
 restraint+4.722
otaur+4.620
────────+4.567
���+4.553
 swe+4.463
Neg logit contributions
ially-3.614
ciation-3.144
ijn-2.989
-2.809
igious-2.805
TokenTrue
Token ID17821
Feature activation+7.33e-01

Pos logit contributions
 restraint+0.270
���+0.265
otaur+0.263
────────+0.260
 swe+0.249
Neg logit contributions
ially-0.301
ciation-0.271
ijn-0.258
-0.248
igious-0.245
Token-
Token ID12
Feature activation-2.85e-01

Pos logit contributions
 restraint+1.199
���+1.172
otaur+1.169
────────+1.165
 swe+1.111
Neg logit contributions
ially-1.246
ciation-1.109
ijn-1.062
-1.008
igious-1.001
Tokengold
Token ID24267
Feature activation-9.03e-02

Pos logit contributions
ially+0.508
ciation+0.454
ijn+0.436
+0.416
igious+0.414
Neg logit contributions
 restraint-0.442
���-0.436
otaur-0.430
────────-0.427
 swe-0.407
Token)
Token ID8
Feature activation+2.83e-01

Pos logit contributions
ially+0.129
ciation+0.113
ijn+0.107
+0.100
llo+0.100
Neg logit contributions
 restraint-0.170
���-0.168
otaur-0.166
────────-0.166
heed-0.159
Token and
Token ID290
Feature activation+2.07e-01

Pos logit contributions
 restraint+0.073
otaur+0.072
���+0.072
────────+0.071
heed+0.069
Neg logit contributions
ially-0.059
ciation-0.051
ijn-0.049
-0.046
igious-0.046
Token Tesla
Token ID11938
Feature activation+9.98e-01

Pos logit contributions
 restraint+0.292
���+0.286
otaur+0.285
────────+0.282
heed+0.268
Neg logit contributions
ially-0.397
ciation-0.356
ijn-0.344
-0.330
llo-0.329
Token Motors
Token ID19292
Feature activation+9.68e-01

Pos logit contributions
 restraint+1.545
otaur+1.506
���+1.501
────────+1.493
 swe+1.426
Neg logit contributions
ially-1.777
ciation-1.592
ijn-1.527
-1.452
igious-1.445
Token
Token ID447
Feature activation+2.57e-01

Pos logit contributions
 restraint+1.673
otaur+1.630
���+1.625
────────+1.622
 swe+1.559
Neg logit contributions
ially-1.552
ciation-1.373
ijn-1.310
-1.237
igious-1.228
Token
Token ID247
Feature activation+3.04e-01

Pos logit contributions
 restraint+0.306
otaur+0.297
���+0.297
────────+0.292
 swe+0.277
Neg logit contributions
ially-0.551
ciation-0.499
ijn-0.484
-0.466
llo-0.465
Token Model
Token ID9104
Feature activation+2.39e+00

Pos logit contributions
 restraint+0.583
otaur+0.571
���+0.570
────────+0.567
 swe+0.547
Neg logit contributions
ially-0.429
ciation-0.373
ijn-0.353
-0.330
igious-0.328
Token S
Token ID311
Feature activation+1.52e+00

Pos logit contributions
 restraint+3.166
────────+2.991
otaur+2.991
���+2.963
 swe+2.871
Neg logit contributions
ially-4.756
ciation-4.378
ijn-4.183
-4.015
igious-4.000
Token automobile
Token ID27930
Feature activation-3.05e-02

Pos logit contributions
 restraint+2.743
otaur+2.662
────────+2.657
���+2.650
 swe+2.564
Neg logit contributions
ially-2.322
ciation-2.041
ijn-1.949
-1.834
igious-1.821
Token,
Token ID11
Feature activation+4.34e-02

Pos logit contributions
ially+0.045
ciation+0.040
ijn+0.038
+0.035
llo+0.035
Neg logit contributions
 restraint-0.056
���-0.055
otaur-0.055
────────-0.055
heed-0.053
Token everything
Token ID2279
Feature activation+6.06e-02

Pos logit contributions
 restraint+0.071
���+0.071
otaur+0.071
────────+0.070
heed+0.067
Neg logit contributions
ially-0.072
ciation-0.064
ijn-0.061
-0.058
ication-0.057
Token that
Token ID326
Feature activation+2.88e-01

Pos logit contributions
 restraint+0.104
���+0.103
otaur+0.103
────────+0.101
heed+0.099
Neg logit contributions
ially-0.098
ciation-0.085
ijn-0.082
-0.078
llo-0.077
Token goal
Token ID3061
Feature activation-6.44e-01

Pos logit contributions
ially+0.488
ciation+0.434
ijn+0.414
+0.398
llo+0.397
Neg logit contributions
 restraint-0.416
otaur-0.412
���-0.411
────────-0.406
heed-0.388
Token against
Token ID1028
Feature activation-5.79e-02

Pos logit contributions
ially+1.017
ciation+0.897
ijn+0.851
+0.809
igious+0.798
Neg logit contributions
 restraint-1.130
otaur-1.109
���-1.109
────────-1.096
heed-1.058
Token Holland
Token ID16070
Feature activation-9.21e-01

Pos logit contributions
ially+0.100
ciation+0.089
ijn+0.085
+0.081
llo+0.081
Neg logit contributions
 restraint-0.092
otaur-0.092
���-0.091
────────-0.091
heed-0.086
Token declared
Token ID6875
Feature activation-8.02e-01

Pos logit contributions
ially+1.483
ciation+1.307
ijn+1.255
+1.186
igious+1.168
Neg logit contributions
 restraint-1.574
���-1.569
otaur-1.558
────────-1.528
heed-1.474
Token a
Token ID257
Feature activation-3.75e-04

Pos logit contributions
ially+1.284
ciation+1.106
ijn+1.042
+1.005
igious+0.997
Neg logit contributions
���-1.398
otaur-1.393
 restraint-1.391
────────-1.358
heed-1.307
Token Un
Token ID791
Feature activation+1.81e+00

Pos logit contributions
ially+0.001
ciation+0.001
ijn+0.001
+0.001
igious+0.001
Neg logit contributions
 restraint-0.001
otaur-0.001
���-0.001
────────-0.001
heed-0.001
Tokenes
Token ID274
Feature activation-6.86e-01

Pos logit contributions
 restraint+3.222
otaur+3.136
────────+3.130
���+3.103
heed+3.030
Neg logit contributions
ially-2.730
ciation-2.414
ijn-2.310
-2.164
llo-2.155
Tokenco
Token ID1073
Feature activation+1.35e+00

Pos logit contributions
ially+0.779
ciation+0.649
ijn+0.590
igious+0.547
+0.538
Neg logit contributions
 restraint-1.518
���-1.517
otaur-1.503
────────-1.455
 swe-1.435
Token World
Token ID2159
Feature activation+2.51e-01

Pos logit contributions
 restraint+2.371
otaur+2.305
────────+2.300
���+2.295
 swe+2.218
Neg logit contributions
ially-2.104
ciation-1.853
ijn-1.772
-1.661
llo-1.656
Token Heritage
Token ID18518
Feature activation+2.63e-01

Pos logit contributions
 restraint+0.437
otaur+0.427
���+0.425
────────+0.423
heed+0.406
Neg logit contributions
ially-0.400
ciation-0.355
ijn-0.338
-0.320
igious-0.318
Token Site
Token ID14413
Feature activation-6.19e-02

Pos logit contributions
 restraint+0.447
otaur+0.437
���+0.437
────────+0.434
heed+0.416
Neg logit contributions
ially-0.428
ciation-0.379
ijn-0.362
-0.343
igious-0.341
Token claimed
Token ID4752
Feature activation-3.33e-01

Pos logit contributions
 restraint+1.338
���+1.310
otaur+1.310
────────+1.300
heed+1.239
Neg logit contributions
ially-1.458
ciation-1.299
ijn-1.240
-1.180
llo-1.177
Token,
Token ID11
Feature activation-9.27e-02

Pos logit contributions
ially+0.535
ciation+0.464
ijn+0.444
llo+0.420
+0.419
Neg logit contributions
 restraint-0.575
���-0.571
otaur-0.568
────────-0.563
 swe-0.539
Token flat
Token ID6228
Feature activation+7.03e-01

Pos logit contributions
ially+0.146
ciation+0.127
ijn+0.121
+0.114
llo+0.114
Neg logit contributions
 restraint-0.163
���-0.161
otaur-0.160
────────-0.159
heed-0.152
Token out
Token ID503
Feature activation-2.55e-01

Pos logit contributions
 restraint+1.407
otaur+1.377
���+1.374
────────+1.370
 swe+1.324
Neg logit contributions
ially-0.933
ciation-0.804
ijn-0.758
-0.705
igious-0.699
Token,
Token ID11
Feature activation-4.32e-02

Pos logit contributions
ially+0.387
ciation+0.342
ijn+0.325
+0.306
igious+0.305
Neg logit contributions
 restraint-0.461
���-0.448
otaur-0.448
────────-0.447
 swe-0.432
Token that
Token ID326
Feature activation+6.82e-02

Pos logit contributions
ially+0.070
ciation+0.061
ijn+0.059
+0.055
llo+0.055
Neg logit contributions
 restraint-0.074
otaur-0.073
���-0.072
────────-0.072
 swe-0.069
Token Kam
Token ID12670
Feature activation-1.00e+00

Pos logit contributions
 restraint+0.115
���+0.114
otaur+0.114
────────+0.113
 swe+0.108
Neg logit contributions
ially-0.112
ciation-0.098
ijn-0.093
-0.088
llo-0.088
Tokenal
Token ID282
Feature activation+3.99e-01

Pos logit contributions
ially+2.080
ciation+1.872
ijn+1.820
llo+1.741
igious+1.729
Neg logit contributions
���-1.296
 restraint-1.286
otaur-1.231
────────-1.226
 swe-1.156
Token had
Token ID550
Feature activation-3.46e-01

Pos logit contributions
 restraint+0.636
otaur+0.620
���+0.618
────────+0.614
 swe+0.586
Neg logit contributions
ially-0.694
ciation-0.621
ijn-0.593
-0.563
llo-0.560
Token merely
Token ID6974
Feature activation-3.45e-01

Pos logit contributions
ially+0.587
ciation+0.518
ijn+0.493
llo+0.468
+0.466
Neg logit contributions
 restraint-0.567
otaur-0.559
���-0.557
────────-0.550
heed-0.526
Token missed
Token ID6825
Feature activation-6.30e-01

Pos logit contributions
ially+0.581
ciation+0.514
ijn+0.492
llo+0.464
+0.464
Neg logit contributions
 restraint-0.570
otaur-0.558
���-0.556
────────-0.552
heed-0.527
Token."
Token ID526
Feature activation-2.18e-01

Pos logit contributions
 restraint+0.605
���+0.596
otaur+0.592
────────+0.590
heed+0.567
Neg logit contributions
ially-0.485
ciation-0.421
ijn-0.399
-0.375
llo-0.374
Token The
Token ID383
Feature activation-4.49e-02

Pos logit contributions
ially+0.362
ciation+0.323
ijn+0.308
+0.291
llo+0.290
Neg logit contributions
 restraint-0.366
���-0.361
otaur-0.356
────────-0.353
heed-0.340
Token email
Token ID3053
Feature activation-1.18e+00

Pos logit contributions
ially+0.077
ciation+0.068
ijn+0.065
llo+0.062
+0.062
Neg logit contributions
 restraint-0.072
���-0.071
otaur-0.071
────────-0.071
heed-0.068
Token continues
Token ID4477
Feature activation-8.32e-01

Pos logit contributions
ially+2.087
ciation+1.845
ijn+1.749
llo+1.701
+1.660
Neg logit contributions
���-1.838
 restraint-1.803
────────-1.792
otaur-1.777
heed-1.694
Token,
Token ID11
Feature activation-1.74e-01

Pos logit contributions
ially+1.118
ciation+0.961
ijn+0.907
+0.842
llo+0.838
Neg logit contributions
 restraint-1.653
otaur-1.620
���-1.618
────────-1.610
heed-1.556
Token via
Token ID2884
Feature activation+1.58e-01

Pos logit contributions
ially+0.246
ciation+0.214
ijn+0.203
+0.189
igious+0.188
Neg logit contributions
 restraint-0.332
otaur-0.325
���-0.324
────────-0.323
 swe-0.312
Token Mic
Token ID7631
Feature activation-8.15e-01

Pos logit contributions
 restraint+0.255
���+0.249
otaur+0.249
────────+0.248
heed+0.237
Neg logit contributions
ially-0.271
ciation-0.242
ijn-0.231
-0.219
llo-0.218
Token report
Token ID989
Feature activation+1.06e+00

Pos logit contributions
ially+1.061
ciation+0.908
ijn+0.856
+0.792
llo+0.788
Neg logit contributions
 restraint-1.655
���-1.619
otaur-1.617
────────-1.615
heed-1.559
Token,
Token ID11
Feature activation-3.37e-01

Pos logit contributions
 restraint+2.127
otaur+2.081
���+2.070
────────+2.066
 swe+2.000
Neg logit contributions
ially-1.387
ciation-1.194
ijn-1.127
-1.047
igious-1.040
Token "
Token ID366
Feature activation+2.00e-01

Pos logit contributions
ially+0.497
ciation+0.435
ijn+0.413
+0.387
igious+0.384
Neg logit contributions
 restraint-0.625
otaur-0.610
���-0.608
────────-0.607
 swe-0.584
Tokenhe
Token ID258
Feature activation+7.15e-02

Pos logit contributions
 restraint+0.325
���+0.321
otaur+0.317
────────+0.312
 swe+0.302
Neg logit contributions
ially-0.345
ciation-0.307
ijn-0.293
-0.278
igious-0.278
Token third
Token ID2368
Feature activation-2.77e-01

Pos logit contributions
ially+0.288
ciation+0.256
ijn+0.246
llo+0.233
igious+0.232
Neg logit contributions
 restraint-0.247
���-0.243
otaur-0.241
────────-0.241
heed-0.229
Token overall
Token ID4045
Feature activation+3.56e-01

Pos logit contributions
ially+0.453
ciation+0.399
ijn+0.381
+0.358
igious+0.358
Neg logit contributions
 restraint-0.472
otaur-0.461
���-0.460
────────-0.460
 swe-0.439
Token pick
Token ID2298
Feature activation+2.98e-01

Pos logit contributions
 restraint+0.484
otaur+0.469
���+0.469
────────+0.466
 swe+0.442
Neg logit contributions
ially-0.704
ciation-0.636
ijn-0.612
-0.585
igious-0.583
Token in
Token ID287
Feature activation-1.96e-01

Pos logit contributions
 restraint+0.557
otaur+0.544
���+0.542
────────+0.541
 swe+0.521
Neg logit contributions
ially-0.435
ciation-0.380
ijn-0.360
-0.338
igious-0.336
Token the
Token ID262
Feature activation-4.46e-02

Pos logit contributions
ially+0.316
ciation+0.279
ijn+0.268
+0.252
igious+0.250
Neg logit contributions
 restraint-0.334
otaur-0.328
────────-0.327
���-0.326
 swe-0.312
Token 2015
Token ID1853
Feature activation-1.67e-01

Pos logit contributions
ially+0.067
ciation+0.059
ijn+0.056
llo+0.053
+0.053
Neg logit contributions
 restraint-0.081
���-0.080
otaur-0.080
────────-0.079
 swe-0.076
Token NBA
Token ID7403
Feature activation-8.68e-02

Pos logit contributions
ially+0.234
ciation+0.202
ijn+0.192
+0.178
llo+0.178
Neg logit contributions
 restraint-0.322
���-0.316
otaur-0.315
────────-0.313
 swe-0.301
Token draft
Token ID4538
Feature activation+8.78e-01

Pos logit contributions
ially+0.146
ciation+0.129
ijn+0.123
+0.117
igious+0.116
Neg logit contributions
 restraint-0.143
otaur-0.140
���-0.140
────────-0.139
 swe-0.133
Token by
Token ID416
Feature activation-4.93e-01

Pos logit contributions
 restraint+1.532
otaur+1.491
���+1.483
────────+1.483
 swe+1.428
Neg logit contributions
ially-1.388
ciation-1.230
ijn-1.172
-1.104
igious-1.099
Token the
Token ID262
Feature activation-4.19e-02

Pos logit contributions
ially+0.817
ciation+0.720
ijn+0.692
+0.652
igious+0.649
Neg logit contributions
 restraint-0.823
otaur-0.808
────────-0.804
���-0.803
 swe-0.765
Token Philadelphia
Token ID8857
Feature activation+6.03e-02

Pos logit contributions
ially+0.071
ciation+0.062
ijn+0.060
igious+0.057
+0.057
Neg logit contributions
 restraint-0.069
otaur-0.067
���-0.067
────────-0.067
 swe-0.064
Token top
Token ID1353
Feature activation-7.60e-01

Pos logit contributions
 restraint+0.404
���+0.399
otaur+0.396
────────+0.392
heed+0.378
Neg logit contributions
ially-0.433
ciation-0.382
ijn-0.367
-0.348
llo-0.346
Token notch
Token ID29838
Feature activation-1.36e-01

Pos logit contributions
ially+1.204
ciation+1.032
ijn+0.982
igious+0.932
+0.931
Neg logit contributions
 restraint-1.333
���-1.327
otaur-1.312
────────-1.303
heed-1.260
Token tools
Token ID4899
Feature activation+3.23e-01

Pos logit contributions
ially+0.230
ciation+0.202
ijn+0.194
+0.183
llo+0.183
Neg logit contributions
 restraint-0.225
���-0.222
otaur-0.221
────────-0.219
heed-0.210
Token just
Token ID655
Feature activation-5.68e-01

Pos logit contributions
 restraint+0.586
���+0.575
otaur+0.573
────────+0.569
heed+0.546
Neg logit contributions
ially-0.493
ciation-0.433
ijn-0.410
-0.386
igious-0.384
Token for
Token ID329
Feature activation+1.28e-01

Pos logit contributions
ially+0.840
ciation+0.732
ijn+0.702
llo+0.652
+0.650
Neg logit contributions
 restraint-1.050
���-1.040
otaur-1.031
────────-1.020
heed-0.981
Token you
Token ID345
Feature activation+8.27e-01

Pos logit contributions
 restraint+0.240
���+0.236
otaur+0.236
────────+0.235
heed+0.226
Neg logit contributions
ially-0.186
ciation-0.162
ijn-0.154
-0.145
llo-0.143
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
 restraint+1.499
otaur+1.463
���+1.461
────────+1.455
 swe+1.400
Neg logit contributions
ially-1.256
ciation-1.104
ijn-1.049
-0.988
igious-0.981
Token Here
Token ID3423
Feature activation+1.43e-01

Pos logit contributions
 restraint+0.284
���+0.281
otaur+0.275
────────+0.272
heed+0.264
Neg logit contributions
ially-0.280
ciation-0.250
ijn-0.239
-0.227
llo-0.225
Token's
Token ID338
Feature activation+2.75e-01

Pos logit contributions
 restraint+0.261
���+0.256
otaur+0.255
────────+0.254
heed+0.244
Neg logit contributions
ially-0.217
ciation-0.190
ijn-0.181
-0.170
llo-0.168
Token a
Token ID257
Feature activation+4.89e-02

Pos logit contributions
 restraint+0.440
���+0.434
otaur+0.433
────────+0.428
heed+0.410
Neg logit contributions
ially-0.474
ciation-0.422
ijn-0.405
llo-0.383
-0.382
Token quick
Token ID2068
Feature activation+5.81e-01

Pos logit contributions
 restraint+0.076
���+0.075
otaur+0.075
────────+0.074
heed+0.071
Neg logit contributions
ially-0.087
ciation-0.077
ijn-0.074
-0.070
llo-0.070
Token will
Token ID481
Feature activation+2.27e-01

Pos logit contributions
 restraint+1.158
otaur+1.126
���+1.122
────────+1.119
 swe+1.071
Neg logit contributions
ially-1.296
ciation-1.160
ijn-1.112
-1.057
igious-1.050
Token have
Token ID423
Feature activation+2.52e-01

Pos logit contributions
 restraint+0.359
���+0.353
otaur+0.351
────────+0.348
heed+0.332
Neg logit contributions
ially-0.395
ciation-0.352
ijn-0.338
igious-0.319
llo-0.319
Token a
Token ID257
Feature activation+4.43e-02

Pos logit contributions
 restraint+0.401
���+0.396
otaur+0.394
────────+0.391
heed+0.375
Neg logit contributions
ially-0.435
ciation-0.388
ijn-0.371
-0.351
llo-0.350
Token strong
Token ID1913
Feature activation+3.23e-01

Pos logit contributions
 restraint+0.066
���+0.066
otaur+0.065
────────+0.065
heed+0.062
Neg logit contributions
ially-0.080
ciation-0.072
ijn-0.069
-0.066
igious-0.066
Token sensation
Token ID18098
Feature activation+5.35e-01

Pos logit contributions
 restraint+0.488
���+0.485
otaur+0.480
────────+0.476
heed+0.454
Neg logit contributions
ially-0.584
ciation-0.520
ijn-0.504
igious-0.477
-0.475
Token of
Token ID286
Feature activation-2.74e-02

Pos logit contributions
 restraint+0.965
���+0.945
otaur+0.943
────────+0.938
heed+0.902
Neg logit contributions
ially-0.817
ciation-0.717
ijn-0.683
-0.642
igious-0.638
Token what
Token ID644
Feature activation+2.40e-01

Pos logit contributions
ially+0.050
ciation+0.044
ijn+0.043
ication+0.041
llo+0.040
Neg logit contributions
���-0.041
otaur-0.040
────────-0.040
 restraint-0.040
heed-0.038
Token alcohol
Token ID5548
Feature activation-1.33e+00

Pos logit contributions
 restraint+0.427
���+0.423
otaur+0.419
────────+0.418
heed+0.402
Neg logit contributions
ially-0.371
ciation-0.325
ijn-0.308
-0.290
llo-0.288
Token does
Token ID857
Feature activation-5.74e-01

Pos logit contributions
ially+2.254
ciation+2.002
ijn+1.894
ication+1.817
igious+1.795
Neg logit contributions
���-2.159
 restraint-2.130
────────-2.101
otaur-2.092
heed-2.024
Token,
Token ID11
Feature activation-7.64e-02

Pos logit contributions
ially+0.893
ciation+0.786
ijn+0.742
+0.702
igious+0.700
Neg logit contributions
 restraint-1.017
���-1.010
otaur-0.995
────────-0.992
heed-0.951
Token but
Token ID475
Feature activation-1.03e+00

Pos logit contributions
ially+0.115
ciation+0.101
ijn+0.095
ication+0.089
igious+0.089
Neg logit contributions
���-0.139
 restraint-0.138
otaur-0.137
────────-0.136
heed-0.132
Token Las
Token ID10123
Feature activation-1.76e+00

Pos logit contributions
 restraint+0.078
���+0.077
otaur+0.077
────────+0.076
heed+0.073
Neg logit contributions
ially-0.083
ciation-0.073
ijn-0.070
-0.067
llo-0.066
Token Vegas
Token ID9621
Feature activation-4.97e-01

Pos logit contributions
ially+2.933
ciation+2.598
ijn+2.474
llo+2.382
igious+2.341
Neg logit contributions
 restraint-2.898
���-2.879
otaur-2.841
────────-2.824
 swe-2.655
Token.
Token ID13
Feature activation-5.02e-03

Pos logit contributions
ially+0.813
ciation+0.714
ijn+0.683
igious+0.649
llo+0.645
Neg logit contributions
 restraint-0.832
otaur-0.830
���-0.828
────────-0.812
 swe-0.775
Token
Token ID851
Feature activation+4.46e-01

Pos logit contributions
ially+0.007
ciation+0.007
ijn+0.006
llo+0.006
+0.006
Neg logit contributions
 restraint-0.009
���-0.009
────────-0.009
otaur-0.009
 swe-0.009
Token Ren
Token ID7152
Feature activation+3.23e-01

Pos logit contributions
 restraint+0.880
otaur+0.862
���+0.861
────────+0.857
heed+0.826
Neg logit contributions
ially-0.609
ciation-0.524
ijn-0.496
-0.461
igious-0.459
Tokenaud
Token ID3885
Feature activation-4.38e-01

Pos logit contributions
 restraint+0.617
���+0.602
otaur+0.602
────────+0.600
 swe+0.578
Neg logit contributions
ially-0.459
ciation-0.398
ijn-0.379
-0.353
llo-0.351
Token Lav
Token ID21438
Feature activation-1.23e+00

Pos logit contributions
ially+0.777
ciation+0.697
ijn+0.672
+0.636
llo+0.631
Neg logit contributions
 restraint-0.682
���-0.662
otaur-0.661
────────-0.655
 swe-0.629
Tokeno
Token ID78
Feature activation-7.62e-01

Pos logit contributions
ially+1.859
ciation+1.618
ijn+1.580
llo+1.460
igious+1.456
Neg logit contributions
 restraint-2.259
���-2.245
────────-2.181
otaur-2.168
 swe-2.094
Tokenie
Token ID494
Feature activation-1.46e+00

Pos logit contributions
ially+1.042
ciation+0.868
ijn+0.850
ication+0.777
+0.766
Neg logit contributions
 restraint-1.517
otaur-1.506
���-1.490
────────-1.473
 swe-1.422
Token (@
Token ID4275
Feature activation+5.67e-01

Pos logit contributions
ially+2.267
ciation+1.985
ijn+1.935
llo+1.884
+1.803
Neg logit contributions
 restraint-2.553
otaur-2.506
���-2.488
────────-2.450
 swe-2.363
Tokenren
Token ID918
Feature activation+1.99e-01

Pos logit contributions
 restraint+0.766
otaur+0.742
���+0.737
────────+0.737
 swe+0.698
Neg logit contributions
ially-1.120
ciation-1.016
ijn-0.978
-0.937
igious-0.932
Token as
Token ID355
Feature activation-5.87e-01

Pos logit contributions
ially+1.269
ciation+1.094
ijn+1.040
+0.974
llo+0.971
Neg logit contributions
���-1.412
 restraint-1.412
otaur-1.400
────────-1.386
heed-1.336
Token here
Token ID994
Feature activation-4.79e-01

Pos logit contributions
ially+0.998
ciation+0.879
ijn+0.836
+0.790
llo+0.787
Neg logit contributions
 restraint-0.961
otaur-0.951
���-0.944
────────-0.932
heed-0.897
Token,
Token ID11
Feature activation-1.40e-01

Pos logit contributions
ially+0.707
ciation+0.616
ijn+0.584
+0.549
llo+0.545
Neg logit contributions
 restraint-0.889
���-0.874
otaur-0.872
────────-0.864
heed-0.834
Token to
Token ID284
Feature activation-2.99e-01

Pos logit contributions
ially+0.226
ciation+0.197
ijn+0.189
+0.177
llo+0.177
Neg logit contributions
 restraint-0.241
���-0.239
otaur-0.237
────────-0.234
heed-0.226
Token (
Token ID357
Feature activation+1.56e-02

Pos logit contributions
ially+0.455
ciation+0.394
ijn+0.375
+0.353
igious+0.351
Neg logit contributions
 restraint-0.538
otaur-0.532
���-0.531
────────-0.523
heed-0.506
Token1
Token ID16
Feature activation-1.17e+00

Pos logit contributions
 restraint+0.025
���+0.024
otaur+0.024
────────+0.023
 swe+0.023
Neg logit contributions
ially-0.028
ciation-0.025
ijn-0.024
-0.022
igious-0.022
Token
Token ID784
Feature activation+2.76e-01

Pos logit contributions
ially+1.641
ciation+1.411
ijn+1.344
+1.267
llo+1.258
Neg logit contributions
 restraint-2.256
���-2.210
otaur-2.205
────────-2.179
heed-2.112
Token g
Token ID308
Feature activation-7.18e-01

Pos logit contributions
 restraint+0.437
���+0.427
otaur+0.426
────────+0.422
heed+0.405
Neg logit contributions
ially-0.483
ciation-0.430
ijn-0.411
-0.390
llo-0.389
Token)
Token ID8
Feature activation+4.19e-02

Pos logit contributions
ially+1.114
ciation+0.974
ijn+0.927
+0.868
llo+0.868
Neg logit contributions
 restraint-1.282
���-1.260
otaur-1.253
────────-1.238
 swe-1.197
Token
Token ID1906
Feature activation+6.91e-01

Pos logit contributions
 restraint+0.069
���+0.068
otaur+0.067
────────+0.067
 swe+0.064
Neg logit contributions
ially-0.071
ciation-0.062
ijn-0.060
-0.057
igious-0.056
Token1
Token ID16
Feature activation-7.12e-01

Pos logit contributions
 restraint+1.185
otaur+1.155
���+1.155
────────+1.145
 swe+1.102
Neg logit contributions
ially-1.120
ciation-0.991
ijn-0.946
-0.895
igious-0.888
Token of
Token ID286
Feature activation-6.74e-02

Pos logit contributions
ially+1.589
ciation+1.360
ijn+1.325
llo+1.250
+1.249
Neg logit contributions
 restraint-1.784
���-1.768
otaur-1.766
────────-1.730
heed-1.673
Token this
Token ID428
Feature activation-5.51e-01

Pos logit contributions
ially+0.112
ciation+0.099
ijn+0.095
+0.090
llo+0.090
Neg logit contributions
 restraint-0.112
���-0.110
otaur-0.110
────────-0.108
heed-0.104
Token is
Token ID318
Feature activation+1.07e-01

Pos logit contributions
ially+0.955
ciation+0.850
ijn+0.817
+0.784
llo+0.775
Neg logit contributions
���-0.862
 restraint-0.858
otaur-0.854
────────-0.831
heed-0.804
Token a
Token ID257
Feature activation-3.12e-01

Pos logit contributions
 restraint+0.175
otaur+0.173
���+0.172
────────+0.170
heed+0.164
Neg logit contributions
ially-0.181
ciation-0.160
ijn-0.153
-0.145
llo-0.145
Token round
Token ID2835
Feature activation-1.88e+00

Pos logit contributions
ially+0.567
ciation+0.499
ijn+0.480
+0.462
igious+0.458
Neg logit contributions
otaur-0.468
���-0.466
 restraint-0.466
────────-0.453
heed-0.437
Tokenabout
Token ID10755
Feature activation-3.77e-01

Pos logit contributions
ially+2.489
ciation+2.006
ijn+1.933
igious+1.813
llo+1.808
Neg logit contributions
���-3.706
 restraint-3.706
otaur-3.665
────────-3.630
heed-3.535
Token way
Token ID835
Feature activation+3.61e-01

Pos logit contributions
ially+0.640
ciation+0.565
ijn+0.541
+0.514
llo+0.509
Neg logit contributions
 restraint-0.615
otaur-0.605
���-0.604
────────-0.596
heed-0.578
Token of
Token ID286
Feature activation-1.43e-01

Pos logit contributions
 restraint+0.631
otaur+0.614
────────+0.613
���+0.613
 swe+0.590
Neg logit contributions
ially-0.570
ciation-0.502
ijn-0.481
-0.453
igious-0.452
Token saying
Token ID2282
Feature activation-1.55e-01

Pos logit contributions
ially+0.243
ciation+0.214
ijn+0.206
llo+0.194
+0.194
Neg logit contributions
 restraint-0.232
otaur-0.230
���-0.229
────────-0.225
heed-0.217
Token that
Token ID326
Feature activation-1.09e-01

Pos logit contributions
ially+0.247
ciation+0.217
ijn+0.208
+0.196
llo+0.195
Neg logit contributions
 restraint-0.269
���-0.264
otaur-0.263
────────-0.261
heed-0.251
Token the
Token ID262
Feature activation-2.61e-01

Pos logit contributions
ially+0.181
ciation+0.160
ijn+0.153
+0.145
llo+0.144
Neg logit contributions
 restraint-0.182
���-0.180
otaur-0.179
────────-0.177
 swe-0.170
Token to
Token ID284
Feature activation+8.23e-02

Pos logit contributions
 restraint+0.114
���+0.113
otaur+0.112
────────+0.110
heed+0.106
Neg logit contributions
ially-0.097
ciation-0.085
ijn-0.081
-0.076
llo-0.076
Token push
Token ID4574
Feature activation+7.17e-02

Pos logit contributions
 restraint+0.139
���+0.138
otaur+0.138
────────+0.135
heed+0.131
Neg logit contributions
ially-0.133
ciation-0.117
ijn-0.112
-0.108
llo-0.106
Token an
Token ID281
Feature activation+1.20e-01

Pos logit contributions
 restraint+0.117
���+0.116
otaur+0.116
────────+0.115
heed+0.110
Neg logit contributions
ially-0.120
ciation-0.106
ijn-0.102
-0.097
igious-0.096
Token ability
Token ID2694
Feature activation+6.19e-01

Pos logit contributions
 restraint+0.181
otaur+0.179
────────+0.178
���+0.177
heed+0.171
Neg logit contributions
ially-0.217
ciation-0.192
ijn-0.185
igious-0.176
llo-0.175
Token we
Token ID356
Feature activation+2.44e-01

Pos logit contributions
 restraint+1.045
otaur+1.020
���+1.020
────────+1.015
heed+0.972
Neg logit contributions
ially-1.015
ciation-0.900
ijn-0.858
-0.812
igious-0.808
Token don
Token ID836
Feature activation-4.58e-01

Pos logit contributions
 restraint+0.380
otaur+0.370
���+0.370
────────+0.365
heed+0.349
Neg logit contributions
ially-0.433
ciation-0.386
ijn-0.372
llo-0.354
-0.353
Token
Token ID447
Feature activation+1.89e-01

Pos logit contributions
ially+0.868
ciation+0.778
ijn+0.746
llo+0.716
+0.715
Neg logit contributions
���-0.670
 restraint-0.664
otaur-0.644
────────-0.630
heed-0.611
Token
Token ID247
Feature activation+9.62e-02

Pos logit contributions
 restraint+0.221
���+0.213
otaur+0.212
────────+0.210
 swe+0.198
Neg logit contributions
ially-0.410
ciation-0.374
ijn-0.362
-0.349
llo-0.347
Tokent
Token ID83
Feature activation-8.73e-02

Pos logit contributions
 restraint+0.169
���+0.165
otaur+0.165
────────+0.163
 swe+0.158
Neg logit contributions
ially-0.152
ciation-0.134
ijn-0.128
llo-0.120
-0.120
Token often
Token ID1690
Feature activation-1.18e-01

Pos logit contributions
ially+0.149
ciation+0.132
ijn+0.127
llo+0.121
igious+0.120
Neg logit contributions
 restraint-0.141
���-0.140
otaur-0.139
────────-0.136
heed-0.132
Token see
Token ID766
Feature activation+7.49e-01

Pos logit contributions
ially+0.200
ciation+0.177
ijn+0.170
llo+0.161
igious+0.160
Neg logit contributions
 restraint-0.193
���-0.190
otaur-0.189
────────-0.187
heed-0.180
Tokenmake
Token ID15883
Feature activation+4.80e-01

Pos logit contributions
 restraint+1.282
���+1.262
otaur+1.251
────────+1.241
 swe+1.187
Neg logit contributions
ially-1.313
ciation-1.165
ijn-1.113
-1.060
igious-1.052
Token no
Token ID645
Feature activation+1.32e+00

Pos logit contributions
 restraint+0.611
���+0.595
otaur+0.593
────────+0.586
heed+0.551
Neg logit contributions
ially-0.989
ciation-0.897
ijn-0.864
-0.835
llo-0.829
Token mistake
Token ID7457
Feature activation-1.24e-01

Pos logit contributions
 restraint+2.480
���+2.448
otaur+2.435
────────+2.421
heed+2.332
Neg logit contributions
ially-1.897
ciation-1.647
ijn-1.559
-1.462
igious-1.452
Token
Token ID960
Feature activation+4.28e-01

Pos logit contributions
ially+0.177
ciation+0.155
ijn+0.146
+0.137
igious+0.136
Neg logit contributions
 restraint-0.235
otaur-0.230
���-0.229
────────-0.228
heed-0.220
Tokenthese
Token ID27218
Feature activation+3.73e-01

Pos logit contributions
 restraint+0.672
���+0.661
otaur+0.659
────────+0.649
 swe+0.621
Neg logit contributions
ially-0.754
ciation-0.667
ijn-0.639
-0.616
llo-0.608
Token attacks
Token ID3434
Feature activation-3.84e-01

Pos logit contributions
 restraint+0.574
���+0.570
otaur+0.570
────────+0.560
heed+0.537
Neg logit contributions
ially-0.662
ciation-0.585
ijn-0.559
-0.543
llo-0.536
Token are
Token ID389
Feature activation+5.15e-01

Pos logit contributions
ially+0.628
ciation+0.551
ijn+0.521
+0.500
llo+0.499
Neg logit contributions
 restraint-0.647
���-0.646
otaur-0.636
────────-0.635
 swe-0.600
Token against
Token ID1028
Feature activation-2.85e-01

Pos logit contributions
 restraint+0.799
���+0.791
otaur+0.788
────────+0.780
heed+0.745
Neg logit contributions
ially-0.913
ciation-0.810
ijn-0.772
-0.740
llo-0.737
Token all
Token ID477
Feature activation-5.58e-01

Pos logit contributions
ially+0.503
ciation+0.443
ijn+0.423
+0.409
igious+0.403
Neg logit contributions
 restraint-0.442
otaur-0.440
���-0.440
────────-0.434
 swe-0.413
Token of
Token ID286
Feature activation+1.43e-01

Pos logit contributions
ially+0.986
ciation+0.860
ijn+0.820
+0.798
llo+0.791
Neg logit contributions
���-0.866
otaur-0.863
 restraint-0.856
────────-0.844
heed-0.809
Token us
Token ID514
Feature activation+7.53e-01

Pos logit contributions
 restraint+0.255
���+0.252
otaur+0.252
────────+0.249
 swe+0.239
Neg logit contributions
ially-0.219
ciation-0.190
ijn-0.181
-0.173
igious-0.171
Token
Token ID198
Feature activation+3.13e-01

Pos logit contributions
 restraint+0.746
otaur+0.726
���+0.725
────────+0.721
 swe+0.691
Neg logit contributions
ially-0.772
ciation-0.688
ijn-0.658
-0.624
igious-0.620
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
 restraint+0.514
otaur+0.500
────────+0.498
���+0.496
 swe+0.477
Neg logit contributions
ially-0.529
ciation-0.472
ijn-0.451
-0.428
igious-0.425
TokenCollege
Token ID38951
Feature activation-8.13e-01

Pos logit contributions
 restraint+0.351
���+0.343
otaur+0.342
────────+0.337
 swe+0.326
Neg logit contributions
ially-0.401
ciation-0.359
ijn-0.344
-0.328
igious-0.326
Token career
Token ID3451
Feature activation+1.10e+00

Pos logit contributions
ially+1.341
ciation+1.181
ijn+1.127
igious+1.073
+1.070
Neg logit contributions
 restraint-1.349
otaur-1.338
���-1.329
────────-1.315
heed-1.265
Token [
Token ID685
Feature activation+7.80e-01

Pos logit contributions
 restraint+1.796
otaur+1.740
────────+1.734
���+1.733
 swe+1.660
Neg logit contributions
ially-1.872
ciation-1.668
ijn-1.599
-1.518
igious-1.506
Token edit
Token ID4370
Feature activation-2.60e+00

Pos logit contributions
 restraint+1.341
���+1.311
otaur+1.310
────────+1.300
 swe+1.250
Neg logit contributions
ially-1.256
ciation-1.111
ijn-1.061
-1.002
igious-0.996
Token ]
Token ID2361
Feature activation-1.72e+00

Pos logit contributions
ially+2.648
ciation+2.080
ijn+2.019
llo+1.858
+1.814
Neg logit contributions
���-6.187
 restraint-5.887
otaur-5.847
────────-5.672
heed-5.646
Token
Token ID198
Feature activation+2.41e-01

Pos logit contributions
ially+2.663
ciation+2.322
ijn+2.226
llo+2.110
+2.090
Neg logit contributions
���-3.203
 restraint-3.078
otaur-3.020
────────-2.931
heed-2.865
Token
Token ID198
Feature activation+2.08e-01

Pos logit contributions
 restraint+0.394
otaur+0.383
────────+0.382
���+0.379
 swe+0.365
Neg logit contributions
ially-0.408
ciation-0.364
ijn-0.348
-0.330
igious-0.329
TokenO
Token ID46
Feature activation-1.36e+00

Pos logit contributions
 restraint+0.319
���+0.311
otaur+0.310
────────+0.305
 swe+0.295
Neg logit contributions
ially-0.374
ciation-0.336
ijn-0.322
-0.307
igious-0.305
Tokenka
Token ID4914
Feature activation+1.48e+00

Pos logit contributions
ially+2.226
ciation+1.956
ijn+1.871
llo+1.791
igious+1.773
Neg logit contributions
 restraint-2.286
���-2.259
────────-2.234
otaur-2.233
 swe-2.134
Token²
Token ID31185
Feature activation-5.37e-01

Pos logit contributions
ially+0.512
ciation+0.440
ijn+0.422
llo+0.389
+0.386
Neg logit contributions
 restraint-0.719
���-0.717
otaur-0.702
────────-0.695
heed-0.673
Token)
Token ID8
Feature activation+1.18e-01

Pos logit contributions
ially+0.753
ciation+0.653
ijn+0.619
+0.577
igious+0.573
Neg logit contributions
 restraint-1.034
���-1.014
otaur-1.013
────────-1.008
 swe-0.971
Token and
Token ID290
Feature activation-1.18e-01

Pos logit contributions
 restraint+0.212
otaur+0.208
���+0.208
────────+0.207
 swe+0.199
Neg logit contributions
ially-0.181
ciation-0.159
ijn-0.151
-0.142
igious-0.141
Token obesity
Token ID13825
Feature activation-4.59e-02

Pos logit contributions
ially+0.206
ciation+0.182
ijn+0.175
+0.165
igious+0.165
Neg logit contributions
���-0.184
 restraint-0.184
otaur-0.184
────────-0.181
heed-0.173
Token (
Token ID357
Feature activation-9.06e-02

Pos logit contributions
ially+0.068
ciation+0.060
ijn+0.057
+0.053
igious+0.053
Neg logit contributions
 restraint-0.084
otaur-0.082
���-0.082
────────-0.082
 swe-0.079
Token
Token ID35705
Feature activation-2.53e+00

Pos logit contributions
ially+0.170
ciation+0.151
ijn+0.147
+0.140
igious+0.139
Neg logit contributions
 restraint-0.132
���-0.131
otaur-0.129
────────-0.127
 swe-0.122
Token
Token ID98
Feature activation-9.90e-01

Pos logit contributions
ially+4.206
ciation+3.669
ijn+3.574
llo+3.402
+3.396
Neg logit contributions
 restraint-4.043
otaur-3.972
────────-3.931
���-3.848
heed-3.779
Token30
Token ID1270
Feature activation-1.15e+00

Pos logit contributions
ially+1.720
ciation+1.499
ijn+1.466
+1.388
igious+1.384
Neg logit contributions
���-1.571
 restraint-1.553
otaur-1.552
────────-1.497
 swe-1.459
Token kg
Token ID14211
Feature activation-6.63e-01

Pos logit contributions
ially+1.774
ciation+1.545
ijn+1.490
+1.401
igious+1.391
Neg logit contributions
 restraint-2.025
���-2.017
otaur-2.000
────────-1.974
heed-1.910
Token/
Token ID14
Feature activation+3.87e-02

Pos logit contributions
ially+0.850
ciation+0.718
ijn+0.684
+0.630
igious+0.623
Neg logit contributions
 restraint-1.355
���-1.340
otaur-1.332
────────-1.324
heed-1.279
Tokenm
Token ID76
Feature activation-3.41e-01

Pos logit contributions
���+0.057
 restraint+0.057
otaur+0.055
────────+0.054
 swe+0.052
Neg logit contributions
ially-0.073
ciation-0.065
ijn-0.063
llo-0.059
-0.059
Token.
Token ID13
Feature activation-2.14e-02

Pos logit contributions
ially+0.298
ciation+0.263
ijn+0.252
llo+0.241
+0.238
Neg logit contributions
 restraint-0.288
���-0.288
otaur-0.282
────────-0.282
 swe-0.268
Token There
Token ID1318
Feature activation-4.62e-02

Pos logit contributions
ially+0.037
ciation+0.032
ijn+0.031
+0.030
llo+0.029
Neg logit contributions
 restraint-0.035
���-0.035
otaur-0.034
────────-0.034
 swe-0.033
Token they
Token ID484
Feature activation+2.18e-01

Pos logit contributions
ially+0.068
ciation+0.059
ijn+0.056
+0.053
llo+0.053
Neg logit contributions
 restraint-0.086
otaur-0.084
���-0.084
────────-0.083
 swe-0.080
Token unexpectedly
Token ID25884
Feature activation-2.33e-01

Pos logit contributions
 restraint+0.365
���+0.359
otaur+0.357
────────+0.356
 swe+0.339
Neg logit contributions
ially-0.359
ciation-0.317
ijn-0.303
-0.286
llo-0.286
Token meet
Token ID1826
Feature activation-1.16e+00

Pos logit contributions
ially+0.393
ciation+0.347
ijn+0.332
+0.314
llo+0.314
Neg logit contributions
 restraint-0.382
���-0.375
otaur-0.373
────────-0.371
 swe-0.353
Token Gand
Token ID17727
Feature activation-2.52e+00

Pos logit contributions
ially+1.949
ciation+1.690
ijn+1.635
+1.557
igious+1.556
Neg logit contributions
 restraint-1.883
���-1.877
otaur-1.858
────────-1.839
heed-1.757
Tokenalf
Token ID1604
Feature activation+1.26e+00

Pos logit contributions
ially+5.611
ciation+4.948
ijn+4.892
llo+4.754
ication+4.679
Neg logit contributions
���-2.991
 restraint-2.840
────────-2.766
otaur-2.689
 swe-2.504
Token.
Token ID13
Feature activation+1.56e-01

Pos logit contributions
 restraint+2.270
otaur+2.212
────────+2.201
���+2.194
 swe+2.122
Neg logit contributions
ially-1.919
ciation-1.697
ijn-1.611
-1.514
igious-1.510
Token Gand
Token ID17727
Feature activation-2.14e+00

Pos logit contributions
 restraint+0.261
���+0.255
otaur+0.253
────────+0.250
 swe+0.241
Neg logit contributions
ially-0.261
ciation-0.231
ijn-0.221
-0.210
llo-0.210
Tokenalf
Token ID1604
Feature activation+1.26e+00

Pos logit contributions
ially+4.765
ciation+4.193
ijn+4.121
llo+4.028
+3.957
Neg logit contributions
���-2.594
 restraint-2.411
otaur-2.304
────────-2.302
 swe-2.121
Token explains
Token ID6688
Feature activation-5.60e-02

Pos logit contributions
 restraint+1.978
otaur+1.918
���+1.907
────────+1.907
 swe+1.832
Neg logit contributions
ially-2.227
ciation-2.000
ijn-1.918
-1.824
igious-1.816
TokenAg
Token ID10262
Feature activation-1.05e+00

Pos logit contributions
ially+0.316
ciation+0.284
ijn+0.271
+0.262
llo+0.260
Neg logit contributions
���-0.280
 restraint-0.280
otaur-0.271
────────-0.262
 swe-0.258
Token2
Token ID17
Feature activation-1.21e+00

Pos logit contributions
ially+1.822
ciation+1.578
ijn+1.522
llo+1.488
+1.440
Neg logit contributions
���-1.721
 restraint-1.667
────────-1.651
otaur-1.603
 swe-1.555
Tokenr
Token ID81
Feature activation-5.13e-01

Pos logit contributions
ially+1.567
ciation+1.351
ijn+1.310
llo+1.207
igious+1.188
Neg logit contributions
���-2.440
 restraint-2.434
otaur-2.393
────────-2.375
 swe-2.285
Token-
Token ID12
Feature activation-5.21e-01

Pos logit contributions
ially+0.646
ciation+0.571
ijn+0.547
llo+0.499
igious+0.499
Neg logit contributions
 restraint-1.048
���-1.040
────────-1.029
otaur-1.028
 swe-0.971
TokenLa
Token ID14772
Feature activation-8.05e-01

Pos logit contributions
ially+0.815
ciation+0.727
ijn+0.681
+0.645
igious+0.641
Neg logit contributions
 restraint-0.922
���-0.915
otaur-0.901
────────-0.897
 swe-0.852
Token Mond
Token ID27328
Feature activation-2.57e+00

Pos logit contributions
ially+1.099
ciation+0.951
ijn+0.898
+0.838
igious+0.831
Neg logit contributions
 restraint-1.582
otaur-1.547
���-1.543
────────-1.539
 swe-1.486
Tokenial
Token ID498
Feature activation-4.51e-01

Pos logit contributions
ially+1.463
ciation+0.487
ication+0.382
llo+0.340
ijn+0.311
Neg logit contributions
���-7.730
────────-7.394
 restraint-7.298
 swe-7.171
otaur-7.149
Tokene
Token ID68
Feature activation-8.36e-01

Pos logit contributions
ially+0.497
ciation+0.408
ijn+0.383
llo+0.347
+0.345
Neg logit contributions
 restraint-1.005
otaur-0.989
���-0.987
────────-0.986
 swe-0.952
Token)
Token ID8
Feature activation+1.19e-01

Pos logit contributions
ially+1.083
ciation+0.934
ijn+0.887
llo+0.821
+0.804
Neg logit contributions
 restraint-1.695
otaur-1.667
���-1.661
────────-1.659
heed-1.588
Token s
Token ID264
Feature activation+2.80e-01

Pos logit contributions
 restraint+0.186
���+0.180
otaur+0.180
────────+0.180
 swe+0.173
Neg logit contributions
ially-0.210
ciation-0.188
ijn-0.179
-0.171
igious-0.171
Token.
Token ID13
Feature activation-3.01e-01

Pos logit contributions
 restraint+0.510
otaur+0.498
────────+0.497
���+0.489
 swe+0.477
Neg logit contributions
ially-0.421
ciation-0.370
ijn-0.351
-0.331
igious-0.329
Tokenat
Token ID265
Feature activation+1.28e-01

Pos logit contributions
 restraint+0.883
otaur+0.861
���+0.852
────────+0.851
heed+0.813
Neg logit contributions
ially-1.172
ciation-1.058
ijn-1.018
-0.976
igious-0.966
Tokenoder
Token ID12342
Feature activation-9.96e-01

Pos logit contributions
 restraint+0.198
���+0.196
────────+0.193
otaur+0.193
heed+0.183
Neg logit contributions
ially-0.229
ciation-0.203
ijn-0.197
igious-0.184
llo-0.183
Tokenma
Token ID2611
Feature activation-3.51e-01

Pos logit contributions
ially+1.593
ciation+1.408
ijn+1.304
igious+1.282
ication+1.245
Neg logit contributions
 restraint-1.720
────────-1.705
���-1.705
otaur-1.697
heed-1.622
Token with
Token ID351
Feature activation-1.18e+00

Pos logit contributions
ially+0.526
ciation+0.460
ijn+0.436
+0.409
llo+0.406
Neg logit contributions
 restraint-0.642
���-0.630
otaur-0.630
────────-0.627
heed-0.602
Token variable
Token ID7885
Feature activation-1.60e+00

Pos logit contributions
ially+2.072
ciation+1.826
ijn+1.738
llo+1.643
igious+1.630
Neg logit contributions
 restraint-1.835
otaur-1.825
���-1.821
────────-1.804
heed-1.735
Token penet
Token ID11958
Feature activation-2.59e+00

Pos logit contributions
ially+2.853
ciation+2.484
ijn+2.363
ication+2.230
igious+2.217
Neg logit contributions
���-2.500
otaur-2.460
 restraint-2.454
────────-2.428
heed-2.329
Tokenrance
Token ID8132
Feature activation-8.50e-01

Pos logit contributions
ially+3.713
ciation+3.297
ijn+3.013
igious+2.921
ication+2.866
Neg logit contributions
���-4.867
────────-4.811
 restraint-4.808
otaur-4.663
 swe-4.529
Token,
Token ID11
Feature activation-6.74e-01

Pos logit contributions
ially+1.272
ciation+1.104
ijn+1.041
+0.976
llo+0.974
Neg logit contributions
 restraint-1.550
���-1.547
otaur-1.531
────────-1.524
heed-1.466
Token characterized
Token ID16264
Feature activation+5.01e-01

Pos logit contributions
ially+1.083
ciation+0.946
ijn+0.898
ication+0.843
llo+0.838
Neg logit contributions
 restraint-1.157
otaur-1.148
���-1.147
────────-1.138
heed-1.097
Token clinically
Token ID36688
Feature activation-1.82e+00

Pos logit contributions
 restraint+0.872
otaur+0.852
���+0.850
────────+0.847
heed+0.812
Neg logit contributions
ially-0.797
ciation-0.703
ijn-0.670
-0.632
igious-0.629
Token by
Token ID416
Feature activation-1.07e+00

Pos logit contributions
ially+2.710
ciation+2.303
ijn+2.199
+2.051
igious+2.051
Neg logit contributions
���-3.338
otaur-3.328
 restraint-3.324
────────-3.283
heed-3.129