TOP ACTIVATIONS
MAX = 1.417

Minister Man ish S is od ia , and asked how
With some notable exceptions , public reaction was swift
weekend , Amazon quietly rolled out a series of changes to
san collapsed ?」 " Yes . At the moment
is it ?」 " I think it 's probably
can be done by switching off the monitor and leaving the
straw poll and huge turn outs at town meetings in New
. There is no " same - sex " and "
The film seemed every bit as sensational istic as its
Feb re has now rec anted , saying club managers coerced
, Eve Radio is rolling out a brand new schedule system
a " clear and convincing evidence " standard in assessing sexual
reader , but occasionally , Dungeons and Dragons players do not
Ol we us , D . ( 1978 ). Agg ression
D uck Go , B aid u and Y and ex
lee , W . C . ( 1934 ). The social
the shoulder and drew his attention to somebody running towards the
the central city of B aid oa . Earlier
. Pl ant ain Fl our Pl ant
all ad bin Su w aid was killed when a women

MIN ACTIVATIONS
MIN = -1.374

identification and mitigation ( See Figure 1 for OS I Attack
re upt ake inhibitor ( SS RI ), significantly shortened imm
the gains . Table 2 : Six Sigma Ph
Sh river Award for Dist inguished Human itarian Service
, eliminating size from the orphan race calculation . We already
non - governmental research institute based in London , wrote an
and he let it be known to some reporters that he
, motor ized wheelchair ( Figure 2 A , [ Action
Big Talk Productions had [ arr anged ] for distribution in
many other things . See | aut oc omm and |
by Facebook employees in Palo Alto , Calif ., and in
, synd icated col um ... Just saw this
the town Buy Photo Pokemon Go players make their
Winnipeg Jets (@ N HL J ets ) October 21
Gr um man . ( About 100 passengers injured in the
was designed as shown in Figure 1 . The maze was
there also exists self - righteous ness , racial superiority ,
+ Natural leader , Bon v ive ur , Char ism
- a ... <|endoftext|> Buy Photo F SU target Marvin Wilson
limited supply on the arch ipel ago , is likely to

INTERVAL 0.719 - 1.417
CONTAINS 0.392%

called her son who lives next door . Police say the
only big city in the nation where no one who needs
D poly gon arcade racing games from 1988 to 1992 and
Also , the way " hom osexual s " are described
. p one . 00 96 275 . g 001 Pre

INTERVAL 0.021 - 0.719
CONTAINS 45.733%

more time , I would ve made it a
turtles ." How much debris does it take to
help the masses . Based on this logic and my energy
looks . It s that
in the sax ophon ist 's elabor ations of a more

INTERVAL -0.677 - 0.021
CONTAINS 53.381%

urt said that he never liked literal things . He liked
that the investigator was detained in an upscale district of Sk
of proposed rule making would do away with the 20 -
! Here are some amazing real life photographs of a 5
7 { ( 2 s )} 53 . e 6 {

INTERVAL -1.374 - -0.677
CONTAINS 0.494%

a kick out of the surprise burning of Atlanta in Gone
the movie . if you live in Louisville be safe !
a boil . Once its boiling , reduce the heat and
and grid cell , as shown in Figure 3 .
/ album / cry stal - emb ry o <|endoftext|> Over
Token Minister
Token ID4139
Feature activation-4.67e-02

Pos logit contributions
 FN+0.750
 polic+0.738
 operative+0.699
 reconciliation+0.694
rost+0.685
Neg logit contributions
ł-1.483
cyl-1.396
peat-1.395
mes-1.393
enment-1.365
Token Man
Token ID1869
Feature activation+1.96e-01

Pos logit contributions
 FN+0.104
 polic+0.100
 reconciliation+0.099
rost+0.099
 operative+0.098
Neg logit contributions
ł-0.149
mes-0.139
cyl-0.139
peat-0.137
enment-0.136
Tokenish
Token ID680
Feature activation+4.21e-01

Pos logit contributions
ł+0.596
cyl+0.556
mes+0.555
enment+0.547
peat+0.542
Neg logit contributions
 FN-0.448
 polic-0.442
rost-0.438
 reconciliation-0.435
 Roche-0.425
Token S
Token ID311
Feature activation+1.52e-01

Pos logit contributions
ł+1.529
mes+1.448
cyl+1.430
peat+1.409
enment+1.404
Neg logit contributions
 FN-0.733
 polic-0.712
 reconciliation-0.710
rost-0.703
 corrid-0.694
Tokenis
Token ID271
Feature activation+7.98e-01

Pos logit contributions
ł+0.409
mes+0.379
cyl+0.376
peat+0.367
enment+0.364
Neg logit contributions
 FN-0.407
 reconciliation-0.399
rost-0.398
 polic-0.398
 operative-0.389
Tokenod
Token ID375
Feature activation+1.42e+00

Pos logit contributions
ł+1.892
mes+1.737
cyl+1.704
peat+1.665
enment+1.652
Neg logit contributions
 FN-2.396
 polic-2.354
 reconciliation-2.349
rost-2.339
 corrid-2.316
Tokenia
Token ID544
Feature activation+6.01e-01

Pos logit contributions
ł+3.128
mes+2.888
cyl+2.747
emo+2.710
peat+2.663
Neg logit contributions
 corrid-4.500
 FN-4.499
 reconciliation-4.435
rost-4.394
 polic-4.388
Token,
Token ID11
Feature activation+1.22e-01

Pos logit contributions
ł+1.681
mes+1.565
cyl+1.544
peat+1.513
enment+1.504
Neg logit contributions
 FN-1.553
 polic-1.522
 reconciliation-1.518
rost-1.509
 corrid-1.487
Token and
Token ID290
Feature activation+1.21e-01

Pos logit contributions
ł+0.418
mes+0.394
cyl+0.390
peat+0.384
enment+0.382
Neg logit contributions
 FN-0.242
 polic-0.236
 reconciliation-0.236
rost-0.233
 operative-0.228
Token asked
Token ID1965
Feature activation+3.52e-01

Pos logit contributions
ł+0.379
mes+0.355
cyl+0.351
peat+0.345
enment+0.343
Neg logit contributions
 FN-0.273
 reconciliation-0.267
 polic-0.267
rost-0.264
 operative-0.259
Token how
Token ID703
Feature activation-7.76e-03

Pos logit contributions
ł+1.086
mes+1.016
cyl+1.005
peat+0.987
enment+0.983
Neg logit contributions
 FN-0.812
 polic-0.796
 reconciliation-0.795
rost-0.786
 operative-0.772
Token
Token ID198
Feature activation-1.48e-01

Pos logit contributions
 FN+0.202
 polic+0.194
 reconciliation+0.194
rost+0.193
 Roche+0.188
Neg logit contributions
ł-0.268
mes-0.250
cyl-0.250
peat-0.247
emo-0.244
Token
Token ID198
Feature activation-3.76e-02

Pos logit contributions
 FN+0.326
 polic+0.317
 reconciliation+0.316
rost+0.316
 operative+0.308
Neg logit contributions
ł-0.467
mes-0.437
cyl-0.432
peat-0.425
enment-0.424
TokenWith
Token ID3152
Feature activation+1.56e-01

Pos logit contributions
 FN+0.096
rost+0.094
 reconciliation+0.093
 polic+0.092
 operative+0.091
Neg logit contributions
ł-0.105
mes-0.097
cyl-0.097
enment-0.096
peat-0.096
Token some
Token ID617
Feature activation+4.79e-01

Pos logit contributions
ł+0.490
mes+0.458
cyl+0.454
peat+0.448
enment+0.445
Neg logit contributions
 FN-0.353
 reconciliation-0.347
 polic-0.345
rost-0.342
 operative-0.335
Token notable
Token ID12411
Feature activation+1.49e-01

Pos logit contributions
ł+1.543
mes+1.450
cyl+1.431
peat+1.407
enment+1.399
Neg logit contributions
 FN-1.033
 polic-1.008
 reconciliation-1.004
rost-0.999
 corrid-0.986
Token exceptions
Token ID13269
Feature activation+1.39e+00

Pos logit contributions
ł+0.520
mes+0.490
cyl+0.484
peat+0.477
enment+0.475
Neg logit contributions
 FN-0.281
 polic-0.273
 reconciliation-0.273
rost-0.270
 corrid-0.266
Token,
Token ID11
Feature activation-1.72e-01

Pos logit contributions
ł+3.840
mes+3.586
cyl+3.481
peat+3.393
emo+3.385
Neg logit contributions
 FN-3.594
 reconciliation-3.528
 corrid-3.516
 polic-3.504
rost-3.501
Token public
Token ID1171
Feature activation-1.54e-02

Pos logit contributions
 FN+0.375
 reconciliation+0.368
 polic+0.367
rost+0.362
 operative+0.355
Neg logit contributions
ł-0.553
mes-0.516
cyl-0.514
peat-0.507
enment-0.504
Token reaction
Token ID6317
Feature activation-3.97e-01

Pos logit contributions
 FN+0.034
 reconciliation+0.034
 polic+0.034
rost+0.033
 operative+0.033
Neg logit contributions
ł-0.048
mes-0.045
cyl-0.045
peat-0.044
enment-0.044
Token was
Token ID373
Feature activation-1.00e-01

Pos logit contributions
 FN+1.073
 polic+1.068
rost+1.056
 reconciliation+1.053
 therapy+1.038
Neg logit contributions
ł-1.046
mes-0.976
cyl-0.965
peat-0.957
enment-0.941
Token swift
Token ID14622
Feature activation-1.72e-01

Pos logit contributions
 FN+0.214
 polic+0.212
 reconciliation+0.210
rost+0.209
 operative+0.205
Neg logit contributions
ł-0.324
mes-0.306
cyl-0.301
peat-0.298
enment-0.297
Token weekend
Token ID5041
Feature activation-2.47e-01

Pos logit contributions
 FN+0.376
 polic+0.369
 reconciliation+0.368
rost+0.363
 operative+0.356
Neg logit contributions
ł-0.481
mes-0.446
cyl-0.446
peat-0.436
enment-0.435
Token,
Token ID11
Feature activation+2.68e-01

Pos logit contributions
 FN+0.577
 reconciliation+0.561
 polic+0.559
rost+0.545
 operative+0.543
Neg logit contributions
ł-0.753
mes-0.708
cyl-0.704
peat-0.696
enment-0.687
Token Amazon
Token ID6186
Feature activation-2.03e-01

Pos logit contributions
ł+0.845
mes+0.792
cyl+0.784
peat+0.773
enment+0.768
Neg logit contributions
 FN-0.606
 polic-0.588
 reconciliation-0.588
rost-0.580
 operative-0.571
Token quietly
Token ID12703
Feature activation-3.15e-01

Pos logit contributions
 FN+0.502
 polic+0.492
 reconciliation+0.491
 broker+0.488
rost+0.483
Neg logit contributions
ł-0.583
mes-0.543
cyl-0.542
enment-0.535
peat-0.533
Token rolled
Token ID11686
Feature activation+6.77e-01

Pos logit contributions
 FN+0.805
 polic+0.805
 reconciliation+0.790
 operative+0.777
 broker+0.776
Neg logit contributions
ł-0.884
mes-0.823
cyl-0.819
enment-0.805
peat-0.804
Token out
Token ID503
Feature activation+1.38e+00

Pos logit contributions
ł+2.270
mes+2.138
cyl+2.114
peat+2.076
enment+2.072
Neg logit contributions
 FN-1.376
 reconciliation-1.340
 polic-1.337
rost-1.325
 corrid-1.303
Token a
Token ID257
Feature activation+3.45e-01

Pos logit contributions
ł+3.945
mes+3.693
cyl+3.620
peat+3.543
enment+3.526
Neg logit contributions
 FN-3.421
 polic-3.354
rost-3.332
 reconciliation-3.328
 corrid-3.313
Token series
Token ID2168
Feature activation+4.96e-01

Pos logit contributions
ł+1.111
mes+1.036
cyl+1.029
peat+1.012
enment+1.006
Neg logit contributions
 FN-0.754
 reconciliation-0.739
 polic-0.739
rost-0.724
 corrid-0.712
Token of
Token ID286
Feature activation+2.87e-01

Pos logit contributions
ł+1.047
mes+0.954
cyl+0.931
peat+0.911
emo+0.893
Neg logit contributions
 FN-1.610
 corrid-1.594
 polic-1.584
 reconciliation-1.581
rost-1.577
Token changes
Token ID2458
Feature activation+5.31e-01

Pos logit contributions
ł+0.904
mes+0.835
cyl+0.834
peat+0.820
enment+0.815
Neg logit contributions
 FN-0.651
 reconciliation-0.641
 polic-0.639
rost-0.626
 operative-0.619
Token to
Token ID284
Feature activation+4.77e-01

Pos logit contributions
ł+1.355
mes+1.250
cyl+1.232
peat+1.202
enment+1.199
Neg logit contributions
 FN-1.501
 polic-1.477
 reconciliation-1.472
rost-1.462
 corrid-1.444
Tokensan
Token ID12807
Feature activation+8.96e-01

Pos logit contributions
ł+0.171
mes+0.155
cyl+0.153
peat+0.150
enment+0.148
Neg logit contributions
 FN-0.229
 reconciliation-0.225
rost-0.225
 polic-0.225
 corrid-0.220
Token collapsed
Token ID14707
Feature activation+2.85e-01

Pos logit contributions
ł+2.447
mes+2.272
cyl+2.234
peat+2.189
enment+2.174
Neg logit contributions
 FN-2.369
 polic-2.318
 reconciliation-2.313
rost-2.302
 corrid-2.289
Token?」
Token ID42943
Feature activation-2.32e-01

Pos logit contributions
ł+0.827
mes+0.774
cyl+0.763
peat+0.747
enment+0.747
Neg logit contributions
 FN-0.702
 polic-0.689
 reconciliation-0.688
rost-0.684
 operative-0.672
Token
Token ID198
Feature activation+4.41e-01

Pos logit contributions
 FN+0.491
 reconciliation+0.475
 polic+0.470
rost+0.468
 Roche+0.462
Neg logit contributions
ł-0.757
cyl-0.709
mes-0.707
peat-0.702
emo-0.688
Token
Token ID198
Feature activation+4.60e-01

Pos logit contributions
ł+1.311
mes+1.228
cyl+1.200
peat+1.176
enment+1.168
Neg logit contributions
 corrid-1.105
ailability-1.059
 polic-1.056
 FN-1.055
 reconciliation-1.044
Token"
Token ID1
Feature activation+1.34e+00

Pos logit contributions
ł+1.271
mes+1.181
cyl+1.163
peat+1.143
enment+1.134
Neg logit contributions
 FN-1.200
 reconciliation-1.173
 polic-1.173
rost-1.169
 corrid-1.149
TokenYes
Token ID5297
Feature activation+1.13e+00

Pos logit contributions
ł+3.425
mes+3.199
cyl+3.099
emo+3.026
peat+3.015
Neg logit contributions
 polic-3.731
 corrid-3.730
 FN-3.722
 reconciliation-3.673
ailability-3.643
Token.
Token ID13
Feature activation+9.33e-01

Pos logit contributions
ł+3.017
mes+2.802
cyl+2.752
peat+2.687
emo+2.680
Neg logit contributions
 FN-3.021
 polic-2.964
 reconciliation-2.956
rost-2.939
 corrid-2.928
Token At
Token ID1629
Feature activation+6.76e-01

Pos logit contributions
ł+2.764
mes+2.589
cyl+2.527
enment+2.492
peat+2.472
Neg logit contributions
 FN-2.222
 corrid-2.204
 polic-2.204
 reconciliation-2.193
rost-2.183
Token the
Token ID262
Feature activation+3.88e-01

Pos logit contributions
ł+2.044
mes+1.911
cyl+1.885
peat+1.852
enment+1.841
Neg logit contributions
 FN-1.592
 polic-1.557
 reconciliation-1.553
rost-1.544
 corrid-1.525
Token moment
Token ID2589
Feature activation+4.13e-01

Pos logit contributions
ł+1.237
mes+1.159
cyl+1.147
peat+1.128
enment+1.121
Neg logit contributions
 FN-0.855
 polic-0.835
 reconciliation-0.833
rost-0.827
 corrid-0.813
Token is
Token ID318
Feature activation-2.17e-01

Pos logit contributions
ł+0.262
mes+0.245
cyl+0.241
peat+0.237
emo+0.235
Neg logit contributions
 FN-0.244
 reconciliation-0.241
 polic-0.241
rost-0.239
 therapy-0.234
Token it
Token ID340
Feature activation+4.12e-01

Pos logit contributions
 FN+0.474
 reconciliation+0.468
 polic+0.463
rost+0.463
 therapy+0.451
Neg logit contributions
ł-0.694
mes-0.650
cyl-0.646
peat-0.634
enment-0.627
Token?」
Token ID42943
Feature activation-2.60e-01

Pos logit contributions
ł+0.996
mes+0.916
cyl+0.900
peat+0.879
enment+0.872
Neg logit contributions
 FN-1.221
 polic-1.201
 reconciliation-1.199
rost-1.193
 corrid-1.178
Token
Token ID198
Feature activation+4.83e-01

Pos logit contributions
 FN+0.531
 reconciliation+0.512
rost+0.509
 polic+0.504
 Roche+0.497
Neg logit contributions
ł-0.864
mes-0.814
cyl-0.811
peat-0.803
emo-0.792
Token
Token ID198
Feature activation+4.84e-01

Pos logit contributions
ł+1.436
mes+1.346
cyl+1.317
peat+1.292
enment+1.281
Neg logit contributions
 corrid-1.211
ailability-1.161
 polic-1.157
 FN-1.156
 reconciliation-1.144
Token"
Token ID1
Feature activation+1.29e+00

Pos logit contributions
ł+1.367
mes+1.272
cyl+1.254
peat+1.233
enment+1.223
Neg logit contributions
 FN-1.234
 reconciliation-1.205
 polic-1.203
rost-1.202
 corrid-1.177
TokenI
Token ID40
Feature activation+7.80e-01

Pos logit contributions
ł+3.369
mes+3.139
cyl+3.055
emo+2.978
peat+2.976
Neg logit contributions
 corrid-3.564
 polic-3.548
 FN-3.528
 reconciliation-3.484
ailability-3.466
Token think
Token ID892
Feature activation+6.48e-01

Pos logit contributions
ł+2.331
mes+2.176
cyl+2.131
emo+2.086
peat+2.078
Neg logit contributions
 FN-1.875
 reconciliation-1.835
 corrid-1.831
 polic-1.822
rost-1.809
Token it
Token ID340
Feature activation+4.29e-01

Pos logit contributions
ł+2.066
mes+1.940
cyl+1.914
peat+1.882
enment+1.871
Neg logit contributions
 FN-1.421
 polic-1.387
 reconciliation-1.385
rost-1.376
 corrid-1.355
Token's
Token ID338
Feature activation+1.72e-01

Pos logit contributions
ł+1.164
mes+1.085
cyl+1.071
peat+1.049
enment+1.046
Neg logit contributions
 FN-1.134
 polic-1.119
 reconciliation-1.115
rost-1.111
 operative-1.088
Token probably
Token ID2192
Feature activation+2.97e-01

Pos logit contributions
ł+0.558
mes+0.526
cyl+0.519
peat+0.511
enment+0.509
Neg logit contributions
 FN-0.366
 polic-0.359
 reconciliation-0.359
rost-0.356
 operative-0.348
Token can
Token ID460
Feature activation+9.28e-02

Pos logit contributions
ł+0.224
mes+0.209
cyl+0.207
peat+0.203
enment+0.202
Neg logit contributions
 FN-0.184
 reconciliation-0.181
 polic-0.180
rost-0.178
 operative-0.176
Token be
Token ID307
Feature activation+1.58e-03

Pos logit contributions
ł+0.287
mes+0.269
cyl+0.266
peat+0.262
enment+0.261
Neg logit contributions
 FN-0.212
 polic-0.207
 reconciliation-0.206
rost-0.206
 broker-0.201
Token done
Token ID1760
Feature activation-1.44e-01

Pos logit contributions
ł+0.005
mes+0.005
cyl+0.005
peat+0.004
enment+0.004
Neg logit contributions
 FN-0.004
 polic-0.004
 reconciliation-0.004
rost-0.003
 operative-0.003
Token by
Token ID416
Feature activation+3.98e-02

Pos logit contributions
 FN+0.284
 reconciliation+0.277
 polic+0.276
rost+0.273
 operative+0.268
Neg logit contributions
ł-0.491
mes-0.464
cyl-0.459
enment-0.451
peat-0.450
Token switching
Token ID15430
Feature activation+1.98e-01

Pos logit contributions
ł+0.134
mes+0.126
cyl+0.124
peat+0.123
enment+0.123
Neg logit contributions
 FN-0.081
 reconciliation-0.079
 polic-0.079
rost-0.078
 operative-0.076
Token off
Token ID572
Feature activation+1.29e+00

Pos logit contributions
ł+0.575
mes+0.534
cyl+0.528
peat+0.523
enment+0.519
Neg logit contributions
 FN-0.492
 reconciliation-0.480
 polic-0.479
rost-0.476
 operative-0.471
Token the
Token ID262
Feature activation+1.93e-01

Pos logit contributions
ł+3.822
mes+3.579
cyl+3.519
peat+3.449
enment+3.432
Neg logit contributions
 FN-3.070
 polic-3.017
 reconciliation-2.997
rost-2.985
 corrid-2.959
Token monitor
Token ID5671
Feature activation+4.70e-01

Pos logit contributions
ł+0.605
mes+0.564
cyl+0.559
peat+0.551
enment+0.547
Neg logit contributions
 FN-0.436
 polic-0.425
 reconciliation-0.424
rost-0.421
 corrid-0.414
Token and
Token ID290
Feature activation+1.97e-01

Pos logit contributions
ł+1.395
mes+1.302
cyl+1.283
peat+1.261
enment+1.255
Neg logit contributions
 FN-1.136
 polic-1.110
 reconciliation-1.109
rost-1.101
 corrid-1.085
Token leaving
Token ID4305
Feature activation-5.67e-02

Pos logit contributions
ł+0.642
mes+0.600
cyl+0.595
peat+0.586
enment+0.583
Neg logit contributions
 FN-0.423
 reconciliation-0.411
 polic-0.411
rost-0.409
 corrid-0.400
Token the
Token ID262
Feature activation+1.19e-02

Pos logit contributions
 FN+0.121
 polic+0.117
 reconciliation+0.117
rost+0.116
 operative+0.115
Neg logit contributions
ł-0.185
mes-0.173
cyl-0.172
enment-0.169
peat-0.169
Token straw
Token ID14787
Feature activation+3.53e-01

Pos logit contributions
ł+1.615
mes+1.514
cyl+1.502
peat+1.477
enment+1.465
Neg logit contributions
 FN-1.079
 polic-1.051
 reconciliation-1.049
rost-1.036
 corrid-1.022
Token poll
Token ID3278
Feature activation+5.65e-01

Pos logit contributions
ł+1.159
mes+1.082
cyl+1.079
peat+1.055
enment+1.054
Neg logit contributions
 FN-0.750
 polic-0.741
 reconciliation-0.737
 operative-0.724
rost-0.721
Token and
Token ID290
Feature activation+1.80e-01

Pos logit contributions
ł+1.617
mes+1.505
cyl+1.485
peat+1.457
enment+1.447
Neg logit contributions
 FN-1.428
 polic-1.398
 reconciliation-1.395
rost-1.385
 corrid-1.368
Token huge
Token ID3236
Feature activation+6.83e-01

Pos logit contributions
ł+0.578
mes+0.536
cyl+0.535
peat+0.527
emo+0.521
Neg logit contributions
 FN-0.399
 reconciliation-0.390
 polic-0.386
rost-0.382
 operative-0.377
Token turn
Token ID1210
Feature activation+3.63e-01

Pos logit contributions
ł+2.034
mes+1.901
cyl+1.874
peat+1.840
enment+1.829
Neg logit contributions
 FN-1.637
 polic-1.602
 reconciliation-1.598
rost-1.589
 corrid-1.570
Tokenouts
Token ID5269
Feature activation+1.28e+00

Pos logit contributions
ł+1.031
mes+0.944
peat+0.943
cyl+0.938
enment+0.931
Neg logit contributions
 FN-0.936
 reconciliation-0.918
rost-0.910
 polic-0.904
 operative-0.896
Token at
Token ID379
Feature activation+6.88e-02

Pos logit contributions
ł+3.656
mes+3.452
cyl+3.349
peat+3.277
enment+3.262
Neg logit contributions
 FN-3.140
 corrid-3.082
 polic-3.063
rost-3.060
 reconciliation-3.056
Token town
Token ID3240
Feature activation+6.91e-01

Pos logit contributions
ł+0.217
cyl+0.202
mes+0.201
peat+0.199
enment+0.197
Neg logit contributions
 FN-0.157
 reconciliation-0.153
 polic-0.151
rost-0.149
 corrid-0.147
Token meetings
Token ID8292
Feature activation+7.38e-01

Pos logit contributions
ł+1.673
mes+1.531
cyl+1.505
peat+1.477
enment+1.466
Neg logit contributions
 FN-2.054
 polic-2.016
 reconciliation-2.013
rost-2.000
 corrid-1.985
Token in
Token ID287
Feature activation+1.78e-01

Pos logit contributions
ł+2.170
mes+2.039
cyl+1.997
peat+1.960
enment+1.947
Neg logit contributions
 FN-1.785
 polic-1.741
 reconciliation-1.740
rost-1.732
 corrid-1.727
Token New
Token ID968
Feature activation+1.87e-01

Pos logit contributions
ł+0.559
mes+0.519
cyl+0.518
peat+0.509
enment+0.506
Neg logit contributions
 FN-0.408
 reconciliation-0.396
 polic-0.395
rost-0.389
 operative-0.384
Token.
Token ID13
Feature activation+2.09e-01

Pos logit contributions
 FN+0.210
 polic+0.207
 reconciliation+0.206
rost+0.203
 therapy+0.203
Neg logit contributions
ł-0.217
mes-0.200
cyl-0.199
enment-0.194
peat-0.193
Token There
Token ID1318
Feature activation+2.49e-01

Pos logit contributions
ł+0.636
mes+0.595
cyl+0.588
peat+0.577
enment+0.572
Neg logit contributions
 FN-0.493
 polic-0.479
 reconciliation-0.478
rost-0.475
 corrid-0.466
Token is
Token ID318
Feature activation-2.32e-01

Pos logit contributions
ł+0.687
mes+0.640
cyl+0.631
peat+0.619
enment+0.616
Neg logit contributions
 FN-0.650
 reconciliation-0.636
 polic-0.636
rost-0.631
 operative-0.621
Token no
Token ID645
Feature activation-2.52e-01

Pos logit contributions
 FN+0.489
 reconciliation+0.480
 polic+0.477
rost+0.471
 operative+0.462
Neg logit contributions
ł-0.761
mes-0.714
cyl-0.709
peat-0.697
enment-0.692
Token "
Token ID366
Feature activation+2.00e-01

Pos logit contributions
 FN+0.511
 reconciliation+0.501
 polic+0.496
rost+0.489
 operative+0.483
Neg logit contributions
ł-0.853
mes-0.800
cyl-0.792
peat-0.777
enment-0.774
Tokensame
Token ID31642
Feature activation+1.27e+00

Pos logit contributions
ł+0.649
mes+0.606
cyl+0.599
enment+0.589
peat+0.587
Neg logit contributions
 FN-0.425
 reconciliation-0.416
rost-0.414
 polic-0.412
 operative-0.403
Token-
Token ID12
Feature activation+5.37e-01

Pos logit contributions
ł+3.814
mes+3.611
cyl+3.530
peat+3.478
enment+3.463
Neg logit contributions
 FN-2.942
 polic-2.868
rost-2.863
 reconciliation-2.859
 corrid-2.850
Tokensex
Token ID8044
Feature activation+3.39e-01

Pos logit contributions
ł+1.680
mes+1.555
cyl+1.542
peat+1.519
enment+1.510
Neg logit contributions
 FN-1.210
 polic-1.187
rost-1.185
 reconciliation-1.178
 operative-1.150
Token"
Token ID1
Feature activation+3.22e-01

Pos logit contributions
ł+1.218
mes+1.148
cyl+1.138
peat+1.120
enment+1.113
Neg logit contributions
 FN-0.615
 reconciliation-0.601
 polic-0.597
rost-0.586
 operative-0.578
Token and
Token ID290
Feature activation+6.59e-02

Pos logit contributions
ł+1.072
mes+1.010
cyl+0.997
peat+0.982
enment+0.977
Neg logit contributions
 FN-0.654
 polic-0.639
 reconciliation-0.635
rost-0.634
 corrid-0.626
Token "
Token ID366
Feature activation-7.76e-02

Pos logit contributions
ł+0.212
mes+0.199
cyl+0.197
peat+0.193
enment+0.192
Neg logit contributions
 FN-0.144
 reconciliation-0.142
 polic-0.140
rost-0.139
 operative-0.136
Token
Token ID198
Feature activation-1.87e-01

Pos logit contributions
 FN+0.525
rost+0.507
 reconciliation+0.506
 polic+0.505
 operative+0.491
Neg logit contributions
ł-0.770
mes-0.721
cyl-0.716
peat-0.703
enment-0.701
TokenThe
Token ID464
Feature activation-1.04e-01

Pos logit contributions
 FN+0.472
rost+0.464
 reconciliation+0.458
 polic+0.453
 Roche+0.450
Neg logit contributions
ł-0.526
mes-0.486
cyl-0.483
enment-0.478
peat-0.476
Token film
Token ID2646
Feature activation+3.03e-01

Pos logit contributions
 FN+0.220
 reconciliation+0.215
 polic+0.214
rost+0.211
 operative+0.210
Neg logit contributions
ł-0.342
cyl-0.319
mes-0.316
peat-0.313
enment-0.311
Token seemed
Token ID3947
Feature activation-2.00e-01

Pos logit contributions
ł+0.878
mes+0.818
cyl+0.808
peat+0.792
enment+0.788
Neg logit contributions
 FN-0.750
 polic-0.737
 reconciliation-0.735
rost-0.728
 corrid-0.718
Token every
Token ID790
Feature activation+2.71e-01

Pos logit contributions
 FN+0.499
 polic+0.495
 reconciliation+0.491
rost+0.486
 operative+0.483
Neg logit contributions
ł-0.580
mes-0.537
cyl-0.528
peat-0.521
enment-0.519
Token bit
Token ID1643
Feature activation+1.25e+00

Pos logit contributions
ł+0.763
mes+0.709
cyl+0.699
peat+0.685
enment+0.682
Neg logit contributions
 FN-0.693
 polic-0.679
 reconciliation-0.678
rost-0.673
 corrid-0.665
Token as
Token ID355
Feature activation+1.99e-01

Pos logit contributions
ł+3.519
mes+3.271
cyl+3.222
peat+3.158
enment+3.139
Neg logit contributions
 FN-3.233
 polic-3.170
 reconciliation-3.162
rost-3.144
 corrid-3.108
Token sensational
Token ID31826
Feature activation-1.24e-01

Pos logit contributions
ł+0.666
mes+0.623
cyl+0.617
peat+0.606
enment+0.604
Neg logit contributions
 FN-0.409
 polic-0.401
 reconciliation-0.398
rost-0.394
 corrid-0.389
Tokenistic
Token ID2569
Feature activation-4.22e-02

Pos logit contributions
 FN+0.305
 polic+0.300
 reconciliation+0.297
rost+0.297
 operative+0.292
Neg logit contributions
ł-0.359
mes-0.333
cyl-0.333
peat-0.325
enment-0.323
Token as
Token ID355
Feature activation+4.19e-02

Pos logit contributions
 FN+0.105
 polic+0.104
 reconciliation+0.102
rost+0.102
 operative+0.100
Neg logit contributions
ł-0.122
mes-0.114
cyl-0.113
peat-0.111
enment-0.110
Token its
Token ID663
Feature activation-1.43e-01

Pos logit contributions
ł+0.129
mes+0.121
cyl+0.120
peat+0.118
enment+0.117
Neg logit contributions
 FN-0.097
 polic-0.094
 reconciliation-0.094
rost-0.094
 Roche-0.092
TokenFeb
Token ID15146
Feature activation-2.24e-01

Pos logit contributions
 FN+0.052
 polic+0.051
 corrid+0.051
 reconciliation+0.051
rost+0.050
Neg logit contributions
ł-0.080
mes-0.076
cyl-0.075
peat-0.073
emo-0.073
Tokenre
Token ID260
Feature activation-5.87e-02

Pos logit contributions
 FN+0.608
rost+0.607
 reconciliation+0.593
 polic+0.593
 Roche+0.584
Neg logit contributions
ł-0.585
mes-0.539
cyl-0.533
enment-0.526
peat-0.524
Token has
Token ID468
Feature activation-6.97e-03

Pos logit contributions
 FN+0.149
 polic+0.146
 reconciliation+0.145
rost+0.144
 corrid+0.142
Neg logit contributions
ł-0.168
mes-0.156
cyl-0.154
peat-0.151
enment-0.150
Token now
Token ID783
Feature activation+3.90e-01

Pos logit contributions
 FN+0.017
 polic+0.016
 reconciliation+0.016
rost+0.016
 corrid+0.016
Neg logit contributions
ł-0.021
mes-0.020
cyl-0.019
peat-0.019
enment-0.019
Token rec
Token ID664
Feature activation+1.69e-01

Pos logit contributions
ł+1.172
mes+1.098
cyl+1.080
peat+1.061
enment+1.054
Neg logit contributions
 FN-0.922
 polic-0.899
 reconciliation-0.897
rost-0.895
 corrid-0.887
Tokenanted
Token ID4126
Feature activation+1.25e+00

Pos logit contributions
ł+0.498
mes+0.467
cyl+0.465
peat+0.456
enment+0.453
Neg logit contributions
 FN-0.410
 polic-0.404
rost-0.402
 reconciliation-0.397
 corrid-0.388
Token,
Token ID11
Feature activation+9.64e-02

Pos logit contributions
ł+3.663
mes+3.427
cyl+3.343
peat+3.285
emo+3.262
Neg logit contributions
 FN-3.019
 polic-2.962
 reconciliation-2.955
 corrid-2.931
rost-2.923
Token saying
Token ID2282
Feature activation-1.48e-02

Pos logit contributions
ł+0.329
mes+0.310
cyl+0.306
peat+0.302
enment+0.300
Neg logit contributions
 FN-0.190
 polic-0.185
 reconciliation-0.185
rost-0.183
 corrid-0.180
Token club
Token ID3430
Feature activation+2.63e-02

Pos logit contributions
 FN+0.033
 polic+0.032
 reconciliation+0.032
rost+0.032
 corrid+0.031
Neg logit contributions
ł-0.047
mes-0.044
cyl-0.043
peat-0.043
enment-0.042
Token managers
Token ID11663
Feature activation-3.79e-01

Pos logit contributions
ł+0.089
mes+0.083
cyl+0.083
peat+0.082
enment+0.081
Neg logit contributions
 FN-0.052
 polic-0.051
 reconciliation-0.051
rost-0.051
 operative-0.050
Token coerced
Token ID48474
Feature activation-2.93e-02

Pos logit contributions
 FN+0.843
 polic+0.832
 reconciliation+0.820
rost+0.818
 operative+0.801
Neg logit contributions
ł-1.194
mes-1.115
cyl-1.113
peat-1.095
enment-1.085
Token,
Token ID11
Feature activation+1.17e-01

Pos logit contributions
ł+0.582
mes+0.545
cyl+0.536
peat+0.529
enment+0.525
Neg logit contributions
 FN-0.452
 reconciliation-0.440
 polic-0.440
rost-0.437
 corrid-0.430
Token Eve
Token ID12882
Feature activation-2.75e-01

Pos logit contributions
ł+0.360
mes+0.336
cyl+0.332
peat+0.327
enment+0.325
Neg logit contributions
 FN-0.270
 reconciliation-0.263
 polic-0.263
rost-0.260
 corrid-0.256
Token Radio
Token ID8829
Feature activation-5.68e-01

Pos logit contributions
 FN+0.488
 polic+0.471
 reconciliation+0.465
rost+0.465
 operative+0.455
Neg logit contributions
ł-0.991
mes-0.941
cyl-0.930
peat-0.927
enment-0.921
Token is
Token ID318
Feature activation+2.64e-01

Pos logit contributions
 FN+1.355
 polic+1.315
 reconciliation+1.303
rost+1.293
 operative+1.288
Neg logit contributions
ł-1.689
cyl-1.587
mes-1.585
peat-1.579
enment-1.543
Token rolling
Token ID10708
Feature activation+3.81e-01

Pos logit contributions
ł+0.817
mes+0.765
cyl+0.755
peat+0.744
enment+0.740
Neg logit contributions
 FN-0.605
 polic-0.591
 reconciliation-0.589
rost-0.584
 corrid-0.576
Token out
Token ID503
Feature activation+1.24e+00

Pos logit contributions
ł+1.291
mes+1.219
cyl+1.203
enment+1.184
peat+1.177
Neg logit contributions
 FN-0.766
 reconciliation-0.746
 polic-0.742
rost-0.733
 operative-0.723
Token a
Token ID257
Feature activation+2.33e-01

Pos logit contributions
ł+3.648
mes+3.419
cyl+3.363
peat+3.291
emo+3.268
Neg logit contributions
 FN-2.952
 polic-2.907
rost-2.888
 reconciliation-2.882
 corrid-2.859
Token brand
Token ID4508
Feature activation+2.60e-01

Pos logit contributions
ł+0.756
mes+0.706
cyl+0.700
peat+0.689
enment+0.688
Neg logit contributions
 FN-0.502
 reconciliation-0.491
 polic-0.490
rost-0.482
 corrid-0.476
Token new
Token ID649
Feature activation-2.74e-01

Pos logit contributions
ł+0.854
mes+0.799
cyl+0.789
enment+0.777
peat+0.773
Neg logit contributions
 FN-0.554
 polic-0.545
 reconciliation-0.541
rost-0.530
 operative-0.529
Token schedule
Token ID7269
Feature activation-2.70e-01

Pos logit contributions
 FN+0.580
 reconciliation+0.565
 polic+0.563
rost+0.549
 operative+0.548
Neg logit contributions
ł-0.900
mes-0.836
cyl-0.833
enment-0.825
peat-0.824
Token system
Token ID1080
Feature activation-7.08e-01

Pos logit contributions
 FN+0.645
 polic+0.638
 reconciliation+0.638
rost+0.622
 operative+0.619
Neg logit contributions
ł-0.813
mes-0.750
cyl-0.742
enment-0.736
peat-0.732
Token a
Token ID257
Feature activation+3.35e-01

Pos logit contributions
 FN+0.322
 reconciliation+0.321
 polic+0.315
rost+0.308
 therapy+0.308
Neg logit contributions
ł-0.468
cyl-0.435
mes-0.430
peat-0.427
enment-0.427
Token "
Token ID366
Feature activation-5.01e-03

Pos logit contributions
ł+1.046
mes+0.972
cyl+0.965
peat+0.950
enment+0.947
Neg logit contributions
 FN-0.761
 reconciliation-0.750
 polic-0.746
rost-0.735
 broker-0.723
Tokenclear
Token ID20063
Feature activation+8.45e-02

Pos logit contributions
 FN+0.015
rost+0.014
 reconciliation+0.014
 polic+0.014
 operative+0.014
Neg logit contributions
ł-0.012
mes-0.011
enment-0.011
cyl-0.011
peat-0.011
Token and
Token ID290
Feature activation+2.24e-01

Pos logit contributions
ł+0.267
cyl+0.245
peat+0.244
mes+0.244
enment+0.241
Neg logit contributions
 FN-0.190
 reconciliation-0.189
 polic-0.187
rost-0.183
 broker-0.182
Token convincing
Token ID17101
Feature activation+4.11e-01

Pos logit contributions
ł+0.753
mes+0.709
cyl+0.700
peat+0.689
enment+0.685
Neg logit contributions
 FN-0.453
 polic-0.442
 reconciliation-0.441
rost-0.438
 corrid-0.431
Token evidence
Token ID2370
Feature activation+1.22e+00

Pos logit contributions
ł+1.227
mes+1.142
cyl+1.129
peat+1.111
enment+1.104
Neg logit contributions
 FN-0.990
 polic-0.970
 reconciliation-0.969
rost-0.958
 corrid-0.947
Token"
Token ID1
Feature activation+4.23e-01

Pos logit contributions
ł+3.261
mes+3.041
cyl+2.975
peat+2.914
emo+2.891
Neg logit contributions
 FN-3.252
 polic-3.177
rost-3.169
 corrid-3.152
 reconciliation-3.143
Token standard
Token ID3210
Feature activation+1.38e-01

Pos logit contributions
ł+1.159
mes+1.077
cyl+1.060
peat+1.039
enment+1.032
Neg logit contributions
 FN-1.114
 polic-1.092
 reconciliation-1.089
rost-1.085
 corrid-1.074
Token in
Token ID287
Feature activation+3.47e-01

Pos logit contributions
ł+0.391
mes+0.361
cyl+0.357
peat+0.350
enment+0.350
Neg logit contributions
 FN-0.355
 polic-0.351
 reconciliation-0.348
rost-0.345
 operative-0.341
Token assessing
Token ID24171
Feature activation+8.54e-02

Pos logit contributions
ł+1.120
mes+1.051
cyl+1.038
peat+1.021
enment+1.016
Neg logit contributions
 FN-0.747
 polic-0.730
 reconciliation-0.728
rost-0.722
 corrid-0.712
Token sexual
Token ID3206
Feature activation+1.89e-01

Pos logit contributions
ł+0.266
mes+0.248
cyl+0.246
peat+0.242
enment+0.241
Neg logit contributions
 FN-0.196
 reconciliation-0.192
 polic-0.191
rost-0.189
 operative-0.186
Token reader
Token ID9173
Feature activation+2.82e-01

Pos logit contributions
 FN+0.232
 reconciliation+0.223
 polic+0.222
rost+0.221
 operative+0.220
Neg logit contributions
ł-0.407
mes-0.380
peat-0.377
cyl-0.377
enment-0.372
Token,
Token ID11
Feature activation-9.50e-02

Pos logit contributions
ł+0.803
mes+0.747
cyl+0.740
peat+0.729
enment+0.722
Neg logit contributions
 FN-0.711
 polic-0.695
rost-0.693
 reconciliation-0.693
 operative-0.682
Token but
Token ID475
Feature activation-3.13e-04

Pos logit contributions
 FN+0.212
 reconciliation+0.206
 polic+0.206
rost+0.203
 operative+0.200
Neg logit contributions
ł-0.299
mes-0.281
cyl-0.278
peat-0.276
emo-0.273
Token occasionally
Token ID10491
Feature activation-4.42e-02

Pos logit contributions
 FN+0.001
 reconciliation+0.001
 polic+0.001
rost+0.001
 operative+0.001
Neg logit contributions
ł-0.001
mes-0.001
cyl-0.001
peat-0.001
emo-0.001
Token,
Token ID11
Feature activation-8.81e-03

Pos logit contributions
 FN+0.104
 reconciliation+0.102
 polic+0.100
rost+0.100
 therapy+0.099
Neg logit contributions
ł-0.135
mes-0.126
cyl-0.124
peat-0.124
enment-0.122
Token Dungeons
Token ID25461
Feature activation+1.21e+00

Pos logit contributions
 FN+0.020
 reconciliation+0.020
 polic+0.020
rost+0.020
 operative+0.019
Neg logit contributions
ł-0.027
mes-0.025
cyl-0.025
peat-0.025
emo-0.025
Token and
Token ID290
Feature activation-7.05e-02

Pos logit contributions
ł+3.560
mes+3.330
cyl+3.277
peat+3.186
emo+3.179
Neg logit contributions
 FN-2.922
 polic-2.869
 reconciliation-2.865
rost-2.845
 corrid-2.821
Token Dragons
Token ID19075
Feature activation+5.24e-01

Pos logit contributions
 FN+0.182
 polic+0.180
 reconciliation+0.177
rost+0.176
 corrid+0.175
Neg logit contributions
ł-0.197
peat-0.182
mes-0.182
cyl-0.181
enment-0.177
Token players
Token ID1938
Feature activation+1.67e-01

Pos logit contributions
ł+1.640
mes+1.535
cyl+1.518
peat+1.496
enment+1.487
Neg logit contributions
 FN-1.182
 polic-1.153
 reconciliation-1.150
rost-1.141
 corrid-1.126
Token do
Token ID466
Feature activation-9.32e-02

Pos logit contributions
ł+0.483
mes+0.449
cyl+0.445
peat+0.436
enment+0.433
Neg logit contributions
 FN-0.415
 polic-0.406
 reconciliation-0.405
rost-0.403
 operative-0.395
Token not
Token ID407
Feature activation+4.40e-02

Pos logit contributions
 FN+0.216
 reconciliation+0.210
 polic+0.208
rost+0.206
 therapy+0.204
Neg logit contributions
ł-0.287
cyl-0.268
mes-0.268
peat-0.262
enment-0.262
TokenOl
Token ID30098
Feature activation+2.11e-02

Pos logit contributions
ł+0.214
cyl+0.198
mes+0.197
enment+0.194
peat+0.191
Neg logit contributions
 FN-0.205
rost-0.203
 polic-0.200
 reconciliation-0.199
 Roche-0.196
Tokenwe
Token ID732
Feature activation-1.63e-01

Pos logit contributions
ł+0.049
cyl+0.044
peat+0.044
enment+0.043
mes+0.043
Neg logit contributions
 FN-0.063
rost-0.063
 polic-0.062
 reconciliation-0.061
 Roche-0.061
Tokenus
Token ID385
Feature activation-1.65e-01

Pos logit contributions
 FN+0.394
 polic+0.382
rost+0.381
 reconciliation+0.378
 operative+0.372
Neg logit contributions
ł-0.476
mes-0.449
cyl-0.442
peat-0.437
enment-0.435
Token,
Token ID11
Feature activation+2.17e-01

Pos logit contributions
 FN+0.398
 polic+0.387
rost+0.383
 reconciliation+0.382
 Roche+0.376
Neg logit contributions
ł-0.481
mes-0.453
cyl-0.452
peat-0.449
enment-0.439
Token D
Token ID360
Feature activation+6.88e-01

Pos logit contributions
ł+0.705
mes+0.663
cyl+0.651
peat+0.641
enment+0.639
Neg logit contributions
 FN-0.456
 polic-0.449
 reconciliation-0.448
rost-0.445
 corrid-0.445
Token.
Token ID13
Feature activation+1.21e+00

Pos logit contributions
ł+1.639
mes+1.508
cyl+1.480
peat+1.448
enment+1.437
Neg logit contributions
 FN-2.058
 polic-2.021
 reconciliation-2.018
rost-2.013
 corrid-1.987
Token (
Token ID357
Feature activation+9.61e-01

Pos logit contributions
ł+3.629
mes+3.399
cyl+3.292
enment+3.236
peat+3.235
Neg logit contributions
 corrid-2.845
 polic-2.775
 reconciliation-2.771
ailability-2.755
rost-2.753
Token 1978
Token ID15524
Feature activation+8.46e-01

Pos logit contributions
ł+2.587
mes+2.405
cyl+2.345
peat+2.295
emo+2.282
Neg logit contributions
 corrid-2.540
 FN-2.516
 polic-2.509
 reconciliation-2.502
rost-2.474
Token ).
Token ID6739
Feature activation+5.69e-01

Pos logit contributions
ł+2.327
mes+2.166
cyl+2.129
peat+2.086
enment+2.075
Neg logit contributions
 FN-2.210
 polic-2.166
 reconciliation-2.159
rost-2.153
 corrid-2.132
Token Agg
Token ID19015
Feature activation-5.72e-03

Pos logit contributions
ł+1.730
mes+1.618
cyl+1.601
peat+1.572
enment+1.559
Neg logit contributions
 FN-1.332
 polic-1.300
 reconciliation-1.295
rost-1.286
 corrid-1.265
Tokenression
Token ID2234
Feature activation-1.55e-01

Pos logit contributions
rost+0.013
 FN+0.013
 polic+0.013
 reconciliation+0.013
 therapy+0.013
Neg logit contributions
ł-0.017
mes-0.016
cyl-0.016
peat-0.016
enment-0.016
TokenD
Token ID35
Feature activation+6.16e-02

Pos logit contributions
ł+0.121
cyl+0.109
enment+0.108
mes+0.108
peat+0.105
Neg logit contributions
 FN-0.136
rost-0.133
 polic-0.132
 reconciliation-0.132
 broker-0.131
Tokenuck
Token ID1347
Feature activation-1.01e-01

Pos logit contributions
ł+0.208
mes+0.196
cyl+0.193
peat+0.190
enment+0.189
Neg logit contributions
 FN-0.124
 polic-0.121
 reconciliation-0.120
rost-0.119
 corrid-0.118
TokenGo
Token ID5247
Feature activation-1.16e-01

Pos logit contributions
 FN+0.259
 reconciliation+0.251
 polic+0.249
rost+0.249
 therapy+0.245
Neg logit contributions
ł-0.293
mes-0.265
cyl-0.265
enment-0.262
peat-0.260
Token,
Token ID11
Feature activation-2.99e-01

Pos logit contributions
 FN+0.265
 polic+0.257
 reconciliation+0.256
rost+0.254
 operative+0.252
Neg logit contributions
ł-0.362
mes-0.336
cyl-0.334
peat-0.330
enment-0.325
Token B
Token ID347
Feature activation+8.80e-02

Pos logit contributions
 FN+0.686
 polic+0.664
 reconciliation+0.662
rost+0.657
 broker+0.646
Neg logit contributions
ł-0.935
mes-0.862
cyl-0.858
peat-0.847
emo-0.844
Tokenaid
Token ID1698
Feature activation+1.20e+00

Pos logit contributions
ł+0.256
mes+0.240
cyl+0.238
peat+0.235
enment+0.233
Neg logit contributions
 FN-0.214
rost-0.208
 reconciliation-0.208
 polic-0.208
 corrid-0.202
Tokenu
Token ID84
Feature activation-1.16e-01

Pos logit contributions
ł+3.738
mes+3.526
cyl+3.502
peat+3.436
enment+3.417
Neg logit contributions
 FN-2.707
rost-2.671
 reconciliation-2.602
 polic-2.595
 operative-2.560
Token and
Token ID290
Feature activation-1.87e-01

Pos logit contributions
 FN+0.268
 polic+0.260
 reconciliation+0.258
rost+0.257
 broker+0.254
Neg logit contributions
ł-0.358
mes-0.332
cyl-0.331
peat-0.328
enment-0.321
Token Y
Token ID575
Feature activation+2.46e-01

Pos logit contributions
 FN+0.395
 polic+0.383
 reconciliation+0.381
rost+0.380
 broker+0.372
Neg logit contributions
ł-0.616
mes-0.573
cyl-0.568
peat-0.561
emo-0.560
Tokenand
Token ID392
Feature activation-1.08e-01

Pos logit contributions
ł+0.841
mes+0.792
cyl+0.781
peat+0.771
enment+0.767
Neg logit contributions
 FN-0.481
 polic-0.468
 reconciliation-0.466
rost-0.464
 corrid-0.455
Tokenex
Token ID1069
Feature activation+5.10e-01

Pos logit contributions
 FN+0.361
rost+0.360
 polic+0.351
 reconciliation+0.350
 broker+0.345
Neg logit contributions
ł-0.217
cyl-0.193
mes-0.193
enment-0.189
peat-0.189
Tokenlee
Token ID7197
Feature activation+3.13e-01

Pos logit contributions
ł+1.280
mes+1.167
cyl+1.162
peat+1.136
enment+1.134
Neg logit contributions
 FN-1.404
 polic-1.378
rost-1.376
 reconciliation-1.370
 operative-1.340
Token,
Token ID11
Feature activation+2.79e-01

Pos logit contributions
ł+0.912
mes+0.848
cyl+0.844
peat+0.826
enment+0.821
Neg logit contributions
 FN-0.772
 polic-0.753
rost-0.749
 reconciliation-0.747
 operative-0.730
Token W
Token ID370
Feature activation+5.06e-01

Pos logit contributions
ł+0.886
mes+0.833
cyl+0.812
peat+0.801
enment+0.799
Neg logit contributions
 corrid-0.616
 polic-0.601
 reconciliation-0.600
 FN-0.598
rost-0.598
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
ł+1.211
mes+1.113
cyl+1.092
peat+1.072
enment+1.062
Neg logit contributions
 FN-1.509
 polic-1.481
 reconciliation-1.478
rost-1.477
 corrid-1.454
Token C
Token ID327
Feature activation+5.11e-01

Pos logit contributions
ł+3.255
mes+3.053
cyl+2.950
enment+2.910
peat+2.905
Neg logit contributions
 corrid-2.469
 polic-2.387
 reconciliation-2.386
ailability-2.376
rost-2.371
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
ł+1.217
mes+1.118
cyl+1.097
peat+1.073
enment+1.064
Neg logit contributions
 FN-1.530
 polic-1.503
 reconciliation-1.500
rost-1.495
 corrid-1.478
Token (
Token ID357
Feature activation+9.30e-01

Pos logit contributions
ł+3.136
mes+2.910
cyl+2.796
enment+2.756
peat+2.745
Neg logit contributions
 corrid-3.350
 reconciliation-3.249
 polic-3.246
 FN-3.220
rost-3.200
Token 1934
Token ID29300
Feature activation+6.18e-01

Pos logit contributions
ł+2.420
mes+2.243
cyl+2.180
emo+2.121
peat+2.117
Neg logit contributions
 corrid-2.577
 reconciliation-2.499
 polic-2.498
rost-2.491
 FN-2.488
Token ).
Token ID6739
Feature activation+3.88e-01

Pos logit contributions
ł+1.720
mes+1.601
cyl+1.574
peat+1.543
enment+1.534
Neg logit contributions
 FN-1.598
 polic-1.568
 reconciliation-1.565
rost-1.557
 corrid-1.545
Token The
Token ID383
Feature activation+7.72e-02

Pos logit contributions
ł+1.180
mes+1.097
cyl+1.093
peat+1.075
emo+1.065
Neg logit contributions
 FN-0.917
 polic-0.888
 reconciliation-0.886
rost-0.880
 Roche-0.865
Token social
Token ID1919
Feature activation+4.82e-02

Pos logit contributions
ł+0.247
cyl+0.229
mes+0.228
peat+0.225
enment+0.224
Neg logit contributions
 FN-0.171
 reconciliation-0.168
 polic-0.167
rost-0.164
 operative-0.163
Token the
Token ID262
Feature activation+2.82e-01

Pos logit contributions
ł+0.722
mes+0.678
cyl+0.672
peat+0.657
enment+0.654
Neg logit contributions
 FN-0.542
 reconciliation-0.530
 polic-0.528
rost-0.523
 corrid-0.515
Token shoulder
Token ID8163
Feature activation+9.50e-02

Pos logit contributions
ł+0.984
mes+0.930
cyl+0.916
peat+0.903
enment+0.898
Neg logit contributions
 FN-0.529
 polic-0.516
 reconciliation-0.514
rost-0.511
 corrid-0.505
Token and
Token ID290
Feature activation+2.18e-01

Pos logit contributions
ł+0.270
mes+0.253
cyl+0.249
enment+0.244
peat+0.244
Neg logit contributions
 FN-0.241
 polic-0.236
 reconciliation-0.236
rost-0.233
 operative-0.230
Token drew
Token ID9859
Feature activation+5.12e-01

Pos logit contributions
ł+0.667
mes+0.625
cyl+0.617
peat+0.606
enment+0.603
Neg logit contributions
 FN-0.504
 polic-0.493
 reconciliation-0.491
rost-0.488
 corrid-0.481
Token his
Token ID465
Feature activation+5.01e-01

Pos logit contributions
ł+1.558
mes+1.459
cyl+1.438
peat+1.413
enment+1.408
Neg logit contributions
 FN-1.200
 reconciliation-1.169
 polic-1.169
rost-1.161
 corrid-1.142
Token attention
Token ID3241
Feature activation+1.20e+00

Pos logit contributions
ł+1.678
mes+1.585
cyl+1.562
peat+1.533
enment+1.525
Neg logit contributions
 FN-1.006
 polic-0.982
 reconciliation-0.980
rost-0.977
 corrid-0.970
Token to
Token ID284
Feature activation+4.23e-01

Pos logit contributions
ł+3.056
mes+2.819
cyl+2.778
peat+2.714
enment+2.690
Neg logit contributions
 FN-3.380
 polic-3.314
 reconciliation-3.309
rost-3.294
 corrid-3.281
Token somebody
Token ID8276
Feature activation-7.16e-02

Pos logit contributions
ł+1.304
mes+1.218
cyl+1.210
peat+1.188
enment+1.182
Neg logit contributions
 FN-0.977
 reconciliation-0.953
 polic-0.952
rost-0.944
 operative-0.927
Token running
Token ID2491
Feature activation+2.41e-01

Pos logit contributions
 FN+0.166
 polic+0.163
 reconciliation+0.162
rost+0.160
 operative+0.159
Neg logit contributions
ł-0.220
mes-0.207
cyl-0.203
enment-0.200
peat-0.199
Token towards
Token ID3371
Feature activation+6.49e-01

Pos logit contributions
ł+0.697
mes+0.655
cyl+0.644
enment+0.629
peat+0.629
Neg logit contributions
 FN-0.603
 polic-0.591
 reconciliation-0.590
rost-0.583
 operative-0.575
Token the
Token ID262
Feature activation+1.52e-01

Pos logit contributions
ł+2.148
mes+2.022
cyl+1.996
peat+1.964
enment+1.954
Neg logit contributions
 FN-1.343
 polic-1.309
 reconciliation-1.306
rost-1.297
 corrid-1.278
Token the
Token ID262
Feature activation+3.00e-01

Pos logit contributions
ł+1.075
mes+1.007
cyl+0.995
peat+0.978
enment+0.973
Neg logit contributions
 FN-0.764
 polic-0.746
 reconciliation-0.745
rost-0.738
 corrid-0.726
Token central
Token ID4318
Feature activation-3.11e-02

Pos logit contributions
ł+0.886
mes+0.832
cyl+0.816
peat+0.800
enment+0.795
Neg logit contributions
 FN-0.719
 polic-0.704
rost-0.703
 reconciliation-0.700
 corrid-0.697
Token city
Token ID1748
Feature activation+3.29e-01

Pos logit contributions
 FN+0.068
 polic+0.067
 reconciliation+0.067
rost+0.066
 corrid+0.065
Neg logit contributions
ł-0.100
mes-0.093
cyl-0.092
peat-0.091
enment-0.090
Token of
Token ID286
Feature activation+3.14e-01

Pos logit contributions
ł+0.773
mes+0.702
cyl+0.692
peat+0.679
enment+0.673
Neg logit contributions
 FN-1.006
 polic-0.990
 reconciliation-0.989
rost-0.980
 corrid-0.966
Token B
Token ID347
Feature activation+2.19e-01

Pos logit contributions
ł+1.038
mes+0.975
cyl+0.964
peat+0.949
enment+0.944
Neg logit contributions
 FN-0.655
 polic-0.638
 reconciliation-0.637
rost-0.631
 corrid-0.619
Tokenaid
Token ID1698
Feature activation+1.20e+00

Pos logit contributions
ł+0.651
mes+0.608
cyl+0.600
peat+0.591
enment+0.586
Neg logit contributions
 FN-0.526
 polic-0.515
 reconciliation-0.514
rost-0.512
 corrid-0.501
Tokenoa
Token ID12162
Feature activation-1.44e-01

Pos logit contributions
ł+3.094
mes+2.860
cyl+2.810
peat+2.750
enment+2.732
Neg logit contributions
 FN-3.344
 polic-3.282
 reconciliation-3.274
rost-3.257
 corrid-3.229
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
 FN+0.396
 polic+0.391
 reconciliation+0.389
rost+0.386
 operative+0.381
Neg logit contributions
ł-0.379
mes-0.350
cyl-0.347
peat-0.341
enment-0.337
Token
Token ID198
Feature activation+5.44e-02

Pos logit contributions
 FN+0.034
 polic+0.032
 reconciliation+0.032
rost+0.032
 Roche+0.032
Neg logit contributions
ł-0.050
mes-0.047
cyl-0.047
peat-0.047
emo-0.046
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
ł+0.167
mes+0.156
cyl+0.154
peat+0.151
enment+0.150
Neg logit contributions
 FN-0.125
 corrid-0.125
 polic-0.124
 reconciliation-0.123
rost-0.122
TokenEarlier
Token ID13689
Feature activation-7.89e-02

Pos logit contributions
ł+0.423
mes+0.394
cyl+0.389
peat+0.382
enment+0.381
Neg logit contributions
 FN-0.375
 reconciliation-0.366
rost-0.365
 polic-0.364
 operative-0.356
Token.
Token ID13
Feature activation+7.42e-02

Pos logit contributions
ł+0.964
mes+0.893
cyl+0.879
peat+0.862
enment+0.857
Neg logit contributions
 FN-0.999
 polic-0.980
 reconciliation-0.978
rost-0.974
 corrid-0.960
Token
Token ID198
Feature activation-1.70e-01

Pos logit contributions
ł+0.223
mes+0.208
cyl+0.206
peat+0.203
enment+0.201
Neg logit contributions
 FN-0.177
 polic-0.171
 reconciliation-0.171
rost-0.170
 corrid-0.166
Token
Token ID198
Feature activation-1.66e-02

Pos logit contributions
 FN+0.363
rost+0.352
 reconciliation+0.346
 polic+0.344
 Roche+0.336
Neg logit contributions
ł-0.551
mes-0.517
cyl-0.513
peat-0.505
enment-0.504
TokenPl
Token ID3646
Feature activation+9.61e-02

Pos logit contributions
 FN+0.044
rost+0.043
 reconciliation+0.042
 polic+0.042
 Roche+0.041
Neg logit contributions
ł-0.045
mes-0.042
cyl-0.041
enment-0.041
peat-0.041
Tokenant
Token ID415
Feature activation-1.73e-01

Pos logit contributions
ł+0.283
cyl+0.261
mes+0.259
enment+0.253
peat+0.252
Neg logit contributions
rost-0.234
 FN-0.231
 reconciliation-0.227
 polic-0.224
 Roche-0.221
Tokenain
Token ID391
Feature activation+1.20e+00

Pos logit contributions
 FN+0.419
rost+0.408
 reconciliation+0.407
 polic+0.404
 therapy+0.397
Neg logit contributions
ł-0.514
mes-0.475
cyl-0.474
enment-0.461
peat-0.460
Token Fl
Token ID1610
Feature activation+4.80e-01

Pos logit contributions
ł+3.498
mes+3.298
cyl+3.244
peat+3.191
enment+3.148
Neg logit contributions
 FN-2.864
 polic-2.828
 corrid-2.806
 reconciliation-2.805
rost-2.765
Tokenour
Token ID454
Feature activation+6.53e-02

Pos logit contributions
ł+1.025
mes+0.922
cyl+0.903
peat+0.889
enment+0.882
Neg logit contributions
 FN-1.554
rost-1.539
 polic-1.524
 reconciliation-1.523
 operative-1.489
Token
Token ID960
Feature activation-6.12e-02

Pos logit contributions
ł+0.201
mes+0.187
cyl+0.185
peat+0.182
emo+0.181
Neg logit contributions
 FN-0.152
rost-0.147
 polic-0.147
 reconciliation-0.147
 operative-0.142
TokenPl
Token ID3646
Feature activation+1.70e-02

Pos logit contributions
 FN+0.165
rost+0.161
 reconciliation+0.160
 polic+0.160
 corrid+0.156
Neg logit contributions
ł-0.164
mes-0.151
cyl-0.149
enment-0.147
peat-0.146
Tokenant
Token ID415
Feature activation-2.12e-01

Pos logit contributions
ł+0.047
cyl+0.043
mes+0.043
enment+0.042
peat+0.041
Neg logit contributions
rost-0.045
 FN-0.044
 polic-0.043
 reconciliation-0.043
 Roche-0.042
Tokenall
Token ID439
Feature activation+9.42e-02

Pos logit contributions
 FN+0.510
rost+0.501
 polic+0.500
 reconciliation+0.499
 operative+0.486
Neg logit contributions
ł-0.516
mes-0.478
cyl-0.476
peat-0.467
enment-0.465
Tokenad
Token ID324
Feature activation+7.33e-01

Pos logit contributions
ł+0.248
mes+0.228
cyl+0.226
peat+0.221
enment+0.221
Neg logit contributions
 FN-0.259
 polic-0.254
 reconciliation-0.253
rost-0.253
 operative-0.247
Token bin
Token ID9874
Feature activation+4.79e-02

Pos logit contributions
ł+2.180
mes+2.037
cyl+2.004
peat+1.968
enment+1.956
Neg logit contributions
 FN-1.760
 polic-1.722
 reconciliation-1.716
rost-1.708
 corrid-1.691
Token Su
Token ID1778
Feature activation+1.21e-01

Pos logit contributions
ł+0.157
mes+0.147
cyl+0.146
peat+0.144
enment+0.143
Neg logit contributions
 FN-0.102
 polic-0.099
 reconciliation-0.098
rost-0.097
 operative-0.095
Tokenw
Token ID86
Feature activation+1.86e-01

Pos logit contributions
ł+0.294
mes+0.271
cyl+0.267
enment+0.265
peat+0.264
Neg logit contributions
 FN-0.349
 polic-0.344
 reconciliation-0.343
rost-0.342
 operative-0.334
Tokenaid
Token ID1698
Feature activation+1.19e+00

Pos logit contributions
ł+0.537
mes+0.501
cyl+0.498
enment+0.488
peat+0.488
Neg logit contributions
 FN-0.456
 polic-0.450
rost-0.446
 reconciliation-0.446
 operative-0.435
Token was
Token ID373
Feature activation+9.48e-02

Pos logit contributions
ł+3.453
mes+3.226
cyl+3.152
peat+3.092
emo+3.074
Neg logit contributions
 FN-2.952
 polic-2.883
 reconciliation-2.875
rost-2.872
 corrid-2.852
Token killed
Token ID2923
Feature activation-3.11e-01

Pos logit contributions
ł+0.297
mes+0.278
cyl+0.276
peat+0.271
enment+0.270
Neg logit contributions
 FN-0.213
 polic-0.209
 reconciliation-0.208
rost-0.206
 operative-0.203
Token when
Token ID618
Feature activation+1.41e-01

Pos logit contributions
 FN+0.718
 polic+0.706
 reconciliation+0.705
rost+0.696
 operative+0.685
Neg logit contributions
ł-0.944
mes-0.887
cyl-0.881
enment-0.864
peat-0.864
Token a
Token ID257
Feature activation-8.08e-02

Pos logit contributions
ł+0.439
mes+0.412
cyl+0.406
peat+0.399
enment+0.397
Neg logit contributions
 FN-0.319
 polic-0.311
 reconciliation-0.310
rost-0.308
 corrid-0.304
Token women
Token ID1466
Feature activation+9.98e-02

Pos logit contributions
 FN+0.160
 polic+0.157
 reconciliation+0.156
rost+0.153
 corrid+0.151
Neg logit contributions
ł-0.276
mes-0.259
cyl-0.256
peat-0.253
enment-0.252
Token identification
Token ID11795
Feature activation-5.98e-03

Pos logit contributions
 reconciliation+0.336
 FN+0.333
 polic+0.328
rost+0.323
 therapy+0.323
Neg logit contributions
ł-0.433
mes-0.401
cyl-0.399
enment-0.393
peat-0.392
Token and
Token ID290
Feature activation+1.02e-01

Pos logit contributions
 FN+0.015
 reconciliation+0.015
 polic+0.015
rost+0.015
 therapy+0.015
Neg logit contributions
ł-0.017
cyl-0.016
mes-0.016
peat-0.015
enment-0.015
Token mitigation
Token ID33585
Feature activation+1.35e-01

Pos logit contributions
ł+0.321
mes+0.301
cyl+0.298
peat+0.293
enment+0.291
Neg logit contributions
 FN-0.228
 reconciliation-0.225
 polic-0.224
rost-0.221
 operative-0.218
Token (
Token ID357
Feature activation+2.35e-01

Pos logit contributions
ł+0.367
cyl+0.338
mes+0.337
peat+0.330
enment+0.327
Neg logit contributions
 FN-0.359
 reconciliation-0.357
 polic-0.356
 operative-0.348
 therapy-0.348
TokenSee
Token ID6214
Feature activation-7.00e-01

Pos logit contributions
ł+0.641
mes+0.593
cyl+0.585
peat+0.573
enment+0.571
Neg logit contributions
 FN-0.623
 polic-0.611
 reconciliation-0.610
rost-0.607
 corrid-0.599
Token Figure
Token ID11291
Feature activation-1.37e+00

Pos logit contributions
 FN+1.580
 reconciliation+1.558
rost+1.533
 polic+1.531
 operative+1.498
Neg logit contributions
ł-2.172
cyl-2.030
mes-1.999
peat-1.995
enment-1.976
Token 1
Token ID352
Feature activation-7.82e-01

Pos logit contributions
 FN+2.870
 reconciliation+2.737
rost+2.696
 polic+2.674
 operative+2.639
Neg logit contributions
ł-4.491
cyl-4.167
peat-4.162
mes-4.141
enment-4.118
Token for
Token ID329
Feature activation-3.18e-01

Pos logit contributions
 FN+1.918
 reconciliation+1.875
 polic+1.842
rost+1.834
 operative+1.809
Neg logit contributions
ł-2.272
mes-2.087
cyl-2.079
peat-2.073
enment-2.057
Token OS
Token ID7294
Feature activation-5.03e-01

Pos logit contributions
 FN+0.759
 reconciliation+0.757
 polic+0.743
 operative+0.732
rost+0.729
Neg logit contributions
ł-0.954
cyl-0.883
mes-0.879
enment-0.872
peat-0.867
TokenI
Token ID40
Feature activation-1.12e-01

Pos logit contributions
 FN+1.352
 reconciliation+1.337
 polic+1.318
 operative+1.315
 corrid+1.309
Neg logit contributions
ł-1.332
cyl-1.226
mes-1.219
enment-1.205
peat-1.203
Token Attack
Token ID8307
Feature activation-9.97e-02

Pos logit contributions
 FN+0.262
 reconciliation+0.259
 polic+0.258
rost+0.253
 operative+0.252
Neg logit contributions
ł-0.343
cyl-0.319
mes-0.316
peat-0.313
enment-0.312
Token re
Token ID302
Feature activation-1.84e-01

Pos logit contributions
 FN+0.315
 reconciliation+0.306
rost+0.305
 polic+0.305
 operative+0.300
Neg logit contributions
ł-0.473
mes-0.443
cyl-0.435
peat-0.428
emo-0.427
Tokenupt
Token ID37623
Feature activation-1.02e+00

Pos logit contributions
 FN+0.644
rost+0.633
 reconciliation+0.632
 polic+0.629
 therapy+0.626
Neg logit contributions
ł-0.352
mes-0.310
cyl-0.298
enment-0.295
peat-0.295
Tokenake
Token ID539
Feature activation+6.35e-02

Pos logit contributions
rost+2.909
 FN+2.797
 polic+2.794
 reconciliation+2.794
resy+2.770
Neg logit contributions
ł-2.579
mes-2.378
cyl-2.303
peat-2.281
enment-2.271
Token inhibitor
Token ID41704
Feature activation-4.15e-01

Pos logit contributions
ł+0.220
mes+0.208
cyl+0.203
peat+0.201
enment+0.200
Neg logit contributions
 FN-0.125
 reconciliation-0.120
rost-0.119
 polic-0.118
 therapy-0.117
Token (
Token ID357
Feature activation+2.42e-01

Pos logit contributions
 FN+1.016
rost+0.985
 polic+0.981
 reconciliation+0.978
 therapy+0.973
Neg logit contributions
ł-1.235
mes-1.148
cyl-1.120
peat-1.113
emo-1.111
TokenSS
Token ID5432
Feature activation-1.37e+00

Pos logit contributions
ł+0.611
mes+0.561
cyl+0.552
peat+0.541
enment+0.539
Neg logit contributions
 FN-0.692
rost-0.679
 polic-0.677
 reconciliation-0.676
 corrid-0.663
TokenRI
Token ID7112
Feature activation-5.21e-01

Pos logit contributions
rost+2.672
 FN+2.643
 Roche+2.593
 operative+2.518
 reconciliation+2.515
Neg logit contributions
ł-4.608
mes-4.470
cyl-4.323
enment-4.310
emo-4.296
Token),
Token ID828
Feature activation-9.88e-03

Pos logit contributions
 FN+1.396
rost+1.364
 therapy+1.356
 reconciliation+1.353
 polic+1.344
Neg logit contributions
ł-1.407
mes-1.313
cyl-1.276
enment-1.269
emo-1.260
Token significantly
Token ID5566
Feature activation-2.06e-01

Pos logit contributions
 FN+0.024
 polic+0.024
 reconciliation+0.024
rost+0.023
 operative+0.023
Neg logit contributions
ł-0.029
mes-0.027
cyl-0.027
peat-0.026
enment-0.026
Token shortened
Token ID34464
Feature activation-9.78e-02

Pos logit contributions
 FN+0.511
 polic+0.500
 reconciliation+0.496
rost+0.495
 operative+0.486
Neg logit contributions
ł-0.604
mes-0.560
cyl-0.549
enment-0.545
peat-0.545
Token imm
Token ID2296
Feature activation+1.69e-01

Pos logit contributions
 FN+0.218
 reconciliation+0.210
rost+0.210
 polic+0.209
 operative+0.206
Neg logit contributions
ł-0.313
mes-0.292
cyl-0.288
peat-0.285
emo-0.283
Token the
Token ID262
Feature activation+8.38e-02

Pos logit contributions
 FN+0.616
 reconciliation+0.610
 polic+0.594
 broker+0.592
 operative+0.591
Neg logit contributions
ł-0.768
mes-0.723
cyl-0.708
enment-0.698
peat-0.698
Token gains
Token ID8810
Feature activation-2.37e-01

Pos logit contributions
ł+0.267
mes+0.250
cyl+0.248
peat+0.243
enment+0.242
Neg logit contributions
 FN-0.185
 reconciliation-0.181
 polic-0.180
rost-0.178
 operative-0.176
Token.
Token ID13
Feature activation+1.09e-01

Pos logit contributions
 FN+0.566
 reconciliation+0.560
 polic+0.555
 operative+0.547
rost+0.541
Neg logit contributions
ł-0.709
mes-0.665
cyl-0.647
enment-0.644
peat-0.639
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
ł+0.327
mes+0.304
cyl+0.302
peat+0.296
enment+0.294
Neg logit contributions
 FN-0.259
 reconciliation-0.252
 polic-0.251
rost-0.249
 operative-0.246
Token
Token ID198
Feature activation-1.01e-02

Pos logit contributions
 FN+0.243
 reconciliation+0.236
rost+0.235
 polic+0.235
 operative+0.229
Neg logit contributions
ł-0.352
mes-0.330
cyl-0.326
peat-0.321
enment-0.320
TokenTable
Token ID10962
Feature activation-1.29e+00

Pos logit contributions
 FN+0.026
 reconciliation+0.026
rost+0.026
 polic+0.025
 operative+0.025
Neg logit contributions
ł-0.028
mes-0.026
cyl-0.026
enment-0.025
peat-0.025
Token 2
Token ID362
Feature activation-2.98e-01

Pos logit contributions
 FN+2.960
 reconciliation+2.871
rost+2.793
 therapy+2.774
 Roche+2.744
Neg logit contributions
ł-3.965
cyl-3.665
mes-3.644
peat-3.626
emo-3.566
Token:
Token ID25
Feature activation-1.33e-01

Pos logit contributions
 FN+0.763
 reconciliation+0.744
 polic+0.730
rost+0.727
 operative+0.720
Neg logit contributions
ł-0.849
mes-0.778
cyl-0.773
peat-0.762
enment-0.758
Token Six
Token ID9699
Feature activation-2.46e-01

Pos logit contributions
 FN+0.327
 reconciliation+0.319
 polic+0.315
rost+0.313
 operative+0.311
Neg logit contributions
ł-0.393
mes-0.364
cyl-0.363
peat-0.355
enment-0.352
Token Sigma
Token ID31669
Feature activation+7.31e-02

Pos logit contributions
 FN+0.639
 reconciliation+0.627
 polic+0.621
 operative+0.617
rost+0.615
Neg logit contributions
ł-0.688
mes-0.622
cyl-0.622
enment-0.615
peat-0.607
Token Ph
Token ID1380
Feature activation-1.15e-01

Pos logit contributions
ł+0.222
mes+0.207
cyl+0.205
peat+0.201
enment+0.201
Neg logit contributions
 FN-0.173
 reconciliation-0.168
 polic-0.168
rost-0.166
 operative-0.165
Token Sh
Token ID911
Feature activation+1.15e-01

Pos logit contributions
ł+0.911
mes+0.855
cyl+0.849
peat+0.833
enment+0.829
Neg logit contributions
 FN-0.637
 polic-0.617
 reconciliation-0.617
rost-0.613
 operative-0.600
Tokenriver
Token ID38291
Feature activation+2.67e-01

Pos logit contributions
ł+0.309
mes+0.287
cyl+0.284
peat+0.278
enment+0.276
Neg logit contributions
 FN-0.306
rost-0.306
 polic-0.300
 reconciliation-0.299
 Roche-0.293
Token Award
Token ID11289
Feature activation-2.72e-01

Pos logit contributions
ł+0.767
mes+0.716
cyl+0.714
enment+0.697
peat+0.695
Neg logit contributions
 FN-0.666
 polic-0.649
 reconciliation-0.649
rost-0.645
 operative-0.633
Token for
Token ID329
Feature activation+1.88e-01

Pos logit contributions
 FN+0.601
 reconciliation+0.583
 polic+0.580
rost+0.577
 operative+0.567
Neg logit contributions
ł-0.859
cyl-0.806
mes-0.804
enment-0.787
peat-0.782
Token Dist
Token ID4307
Feature activation-4.00e-01

Pos logit contributions
ł+0.562
cyl+0.520
mes+0.519
peat+0.511
enment+0.508
Neg logit contributions
 FN-0.454
 reconciliation-0.449
 polic-0.443
rost-0.436
 operative-0.433
Tokeninguished
Token ID46709
Feature activation-1.28e+00

Pos logit contributions
rost+0.458
 FN+0.427
 reconciliation+0.420
 polic+0.407
isting+0.400
Neg logit contributions
ł-1.662
mes-1.590
cyl-1.574
peat-1.565
emo-1.546
Token Human
Token ID5524
Feature activation+4.63e-01

Pos logit contributions
 FN+2.625
 reconciliation+2.592
 polic+2.569
 operative+2.502
 Roche+2.500
Neg logit contributions
ł-4.252
cyl-3.956
peat-3.893
emo-3.881
enment-3.851
Tokenitarian
Token ID8353
Feature activation-3.10e-01

Pos logit contributions
ł+1.210
mes+1.110
cyl+1.102
peat+1.075
enment+1.068
Neg logit contributions
 FN-1.284
 reconciliation-1.257
 polic-1.256
rost-1.248
 operative-1.231
Token Service
Token ID4809
Feature activation-2.31e-01

Pos logit contributions
 FN+0.661
 polic+0.643
 reconciliation+0.642
 operative+0.636
rost+0.629
Neg logit contributions
ł-1.004
cyl-0.944
emo-0.932
peat-0.929
mes-0.917
Token
Token ID198
Feature activation+5.50e-02

Pos logit contributions
 FN+0.503
 reconciliation+0.491
 polic+0.491
 operative+0.487
rost+0.480
Neg logit contributions
ł-0.732
cyl-0.687
mes-0.681
peat-0.676
emo-0.673
Token
Token ID198
Feature activation+2.02e-01

Pos logit contributions
ł+0.170
mes+0.159
cyl+0.157
peat+0.154
enment+0.153
Neg logit contributions
 FN-0.126
 polic-0.124
 corrid-0.123
 reconciliation-0.123
rost-0.122
Token,
Token ID11
Feature activation+1.04e-01

Pos logit contributions
 FN+0.489
 reconciliation+0.486
 polic+0.483
rost+0.478
 operative+0.472
Neg logit contributions
ł-0.505
mes-0.468
cyl-0.463
peat-0.451
enment-0.450
Token eliminating
Token ID18591
Feature activation-2.23e-02

Pos logit contributions
ł+0.332
mes+0.311
cyl+0.308
peat+0.303
enment+0.302
Neg logit contributions
 FN-0.226
 polic-0.221
 reconciliation-0.221
rost-0.218
 operative-0.214
Token size
Token ID2546
Feature activation-2.94e-01

Pos logit contributions
 FN+0.049
 reconciliation+0.049
 polic+0.049
rost+0.048
 corrid+0.047
Neg logit contributions
ł-0.071
mes-0.066
cyl-0.065
enment-0.064
peat-0.064
Token from
Token ID422
Feature activation+5.43e-02

Pos logit contributions
 reconciliation+0.653
 polic+0.649
 FN+0.647
rost+0.635
 broker+0.630
Neg logit contributions
ł-0.925
mes-0.872
cyl-0.861
peat-0.846
enment-0.845
Token the
Token ID262
Feature activation+5.89e-02

Pos logit contributions
ł+0.171
mes+0.159
cyl+0.158
peat+0.155
enment+0.155
Neg logit contributions
 FN-0.122
 reconciliation-0.121
 polic-0.120
rost-0.118
 operative-0.117
Token orphan
Token ID26051
Feature activation-1.27e+00

Pos logit contributions
ł+0.196
mes+0.184
cyl+0.182
peat+0.179
enment+0.178
Neg logit contributions
 FN-0.122
 reconciliation-0.119
 polic-0.119
rost-0.117
 corrid-0.116
Token race
Token ID3234
Feature activation-5.97e-01

Pos logit contributions
 reconciliation+2.657
 polic+2.607
 broker+2.581
 FN+2.578
 rule+2.560
Neg logit contributions
ł-4.116
mes-3.846
cyl-3.827
peat-3.734
enment-3.709
Token calculation
Token ID17952
Feature activation-4.32e-01

Pos logit contributions
 reconciliation+1.532
 polic+1.530
 FN+1.527
 broker+1.497
 rule+1.487
Neg logit contributions
ł-1.660
mes-1.537
cyl-1.510
enment-1.495
peat-1.480
Token.
Token ID13
Feature activation-9.27e-02

Pos logit contributions
 reconciliation+1.111
 FN+1.104
 polic+1.102
 broker+1.073
 operative+1.073
Neg logit contributions
ł-1.218
mes-1.118
cyl-1.118
enment-1.101
peat-1.091
Token We
Token ID775
Feature activation+4.35e-02

Pos logit contributions
 FN+0.216
 polic+0.210
 reconciliation+0.209
rost+0.207
 Roche+0.204
Neg logit contributions
ł-0.284
mes-0.265
cyl-0.263
peat-0.259
enment-0.257
Token already
Token ID1541
Feature activation-1.26e-01

Pos logit contributions
ł+0.128
mes+0.119
cyl+0.118
peat+0.116
enment+0.115
Neg logit contributions
 FN-0.106
 polic-0.104
 reconciliation-0.103
rost-0.103
 operative-0.101
Token non
Token ID1729
Feature activation+1.65e-01

Pos logit contributions
ł+0.376
mes+0.352
cyl+0.350
peat+0.344
emo+0.342
Neg logit contributions
 FN-0.259
 reconciliation-0.252
 polic-0.251
rost-0.247
 broker-0.244
Token-
Token ID12
Feature activation-2.13e-01

Pos logit contributions
ł+0.450
mes+0.418
cyl+0.411
peat+0.403
enment+0.402
Neg logit contributions
 FN-0.438
 polic-0.427
 reconciliation-0.427
rost-0.424
 operative-0.419
Tokengovernmental
Token ID31353
Feature activation-9.22e-01

Pos logit contributions
 FN+0.587
 polic+0.570
rost+0.570
 reconciliation+0.569
 operative+0.564
Neg logit contributions
ł-0.555
mes-0.513
cyl-0.503
enment-0.499
peat-0.497
Token research
Token ID2267
Feature activation-9.83e-02

Pos logit contributions
 FN+1.588
 reconciliation+1.561
 polic+1.532
 operative+1.512
 broker+1.510
Neg logit contributions
ł-3.359
peat-3.191
cyl-3.176
mes-3.141
enment-3.120
Token institute
Token ID24224
Feature activation-5.79e-02

Pos logit contributions
 FN+0.178
 reconciliation+0.175
 polic+0.174
rost+0.170
 operative+0.169
Neg logit contributions
ł-0.352
cyl-0.330
mes-0.330
peat-0.327
enment-0.324
Token based
Token ID1912
Feature activation-1.26e+00

Pos logit contributions
 FN+0.133
 polic+0.130
 reconciliation+0.129
rost+0.128
 operative+0.127
Neg logit contributions
ł-0.179
mes-0.167
cyl-0.166
peat-0.164
emo-0.162
Token in
Token ID287
Feature activation+2.22e-01

Pos logit contributions
 FN+2.593
 polic+2.536
rost+2.508
 reconciliation+2.487
 operative+2.467
Neg logit contributions
ł-4.035
cyl-3.819
mes-3.807
peat-3.776
enment-3.739
Token London
Token ID3576
Feature activation-7.27e-02

Pos logit contributions
ł+0.686
mes+0.642
cyl+0.639
peat+0.631
emo+0.623
Neg logit contributions
 FN-0.511
 polic-0.496
 reconciliation-0.494
rost-0.488
 operative-0.480
Token,
Token ID11
Feature activation+8.79e-02

Pos logit contributions
 FN+0.162
 polic+0.160
 reconciliation+0.157
rost+0.155
 operative+0.154
Neg logit contributions
ł-0.230
mes-0.214
cyl-0.213
peat-0.210
emo-0.209
Token wrote
Token ID2630
Feature activation+2.00e-01

Pos logit contributions
ł+0.276
mes+0.259
cyl+0.257
peat+0.253
emo+0.251
Neg logit contributions
 FN-0.198
 polic-0.193
 reconciliation-0.192
rost-0.190
 operative-0.188
Token an
Token ID281
Feature activation-2.64e-01

Pos logit contributions
ł+0.606
mes+0.568
cyl+0.564
peat+0.553
enment+0.550
Neg logit contributions
 FN-0.472
 reconciliation-0.461
 polic-0.460
rost-0.458
 corrid-0.449
Token and
Token ID290
Feature activation+9.45e-02

Pos logit contributions
ł+0.080
mes+0.075
cyl+0.074
peat+0.073
enment+0.073
Neg logit contributions
 FN-0.055
 polic-0.054
 reconciliation-0.054
rost-0.053
 corrid-0.053
Token he
Token ID339
Feature activation+2.25e-02

Pos logit contributions
ł+0.280
mes+0.263
cyl+0.259
peat+0.254
enment+0.253
Neg logit contributions
 FN-0.228
 polic-0.224
 reconciliation-0.223
rost-0.221
 corrid-0.218
Token let
Token ID1309
Feature activation-2.94e-02

Pos logit contributions
ł+0.067
mes+0.063
cyl+0.062
peat+0.061
enment+0.061
Neg logit contributions
 FN-0.054
 polic-0.052
 reconciliation-0.052
rost-0.052
 corrid-0.052
Token it
Token ID340
Feature activation-1.85e-01

Pos logit contributions
 FN+0.066
 polic+0.064
 reconciliation+0.064
rost+0.063
 corrid+0.062
Neg logit contributions
ł-0.093
mes-0.087
cyl-0.086
peat-0.084
enment-0.084
Token be
Token ID307
Feature activation-3.56e-01

Pos logit contributions
 FN+0.450
 polic+0.447
rost+0.444
 reconciliation+0.444
 broker+0.432
Neg logit contributions
ł-0.541
mes-0.503
cyl-0.500
enment-0.489
peat-0.486
Token known
Token ID1900
Feature activation-1.25e+00

Pos logit contributions
 FN+0.311
 polic+0.306
rost+0.293
 reconciliation+0.289
 operative+0.278
Neg logit contributions
ł-1.599
mes-1.542
cyl-1.519
enment-1.506
peat-1.500
Token to
Token ID284
Feature activation-3.26e-01

Pos logit contributions
 FN+2.903
 polic+2.892
rost+2.879
 reconciliation+2.863
 operative+2.792
Neg logit contributions
ł-3.717
mes-3.536
cyl-3.506
enment-3.450
peat-3.431
Token some
Token ID617
Feature activation+2.43e-02

Pos logit contributions
 FN+0.712
 polic+0.694
 reconciliation+0.689
rost+0.684
 corrid+0.669
Neg logit contributions
ł-1.043
mes-0.976
cyl-0.975
peat-0.958
enment-0.958
Token reporters
Token ID7638
Feature activation-3.31e-01

Pos logit contributions
ł+0.071
mes+0.067
cyl+0.066
peat+0.065
enment+0.064
Neg logit contributions
 FN-0.059
 polic-0.058
 reconciliation-0.058
rost-0.058
 corrid-0.057
Token that
Token ID326
Feature activation+1.25e-01

Pos logit contributions
 FN+0.819
 polic+0.808
 reconciliation+0.802
rost+0.801
 operative+0.786
Neg logit contributions
ł-0.953
mes-0.893
cyl-0.884
peat-0.867
enment-0.865
Token he
Token ID339
Feature activation-8.79e-02

Pos logit contributions
ł+0.389
mes+0.366
cyl+0.361
peat+0.355
enment+0.353
Neg logit contributions
 FN-0.280
 polic-0.274
 reconciliation-0.273
rost-0.272
 corrid-0.268
Token,
Token ID11
Feature activation-1.70e-01

Pos logit contributions
 FN+0.495
 polic+0.483
 reconciliation+0.481
rost+0.478
 corrid+0.471
Neg logit contributions
ł-0.667
mes-0.625
cyl-0.614
peat-0.605
enment-0.602
Token motor
Token ID5584
Feature activation+3.76e-01

Pos logit contributions
 FN+0.376
 polic+0.365
 reconciliation+0.363
rost+0.361
 corrid+0.355
Neg logit contributions
ł-0.541
mes-0.507
cyl-0.499
peat-0.492
enment-0.488
Tokenized
Token ID1143
Feature activation-5.40e-02

Pos logit contributions
ł+1.050
mes+0.977
cyl+0.960
peat+0.943
enment+0.937
Neg logit contributions
 FN-0.973
 polic-0.953
 reconciliation-0.951
rost-0.946
 corrid-0.935
Token wheelchair
Token ID27181
Feature activation-3.43e-01

Pos logit contributions
 FN+0.123
 polic+0.120
 reconciliation+0.120
rost+0.119
 corrid+0.118
Neg logit contributions
ł-0.167
mes-0.157
cyl-0.154
peat-0.152
enment-0.151
Token (
Token ID357
Feature activation-2.28e-01

Pos logit contributions
 FN+0.871
 polic+0.850
 reconciliation+0.846
rost+0.839
 operative+0.835
Neg logit contributions
ł-0.979
mes-0.914
cyl-0.890
enment-0.878
peat-0.876
TokenFigure
Token ID11337
Feature activation-1.22e+00

Pos logit contributions
 FN+0.606
rost+0.592
 polic+0.589
 reconciliation+0.585
 operative+0.578
Neg logit contributions
ł-0.622
mes-0.572
cyl-0.563
enment-0.556
peat-0.555
Token 2
Token ID362
Feature activation-5.64e-01

Pos logit contributions
 FN+2.479
 reconciliation+2.340
 polic+2.321
rost+2.319
 therapy+2.282
Neg logit contributions
ł-4.112
mes-3.798
peat-3.795
cyl-3.787
enment-3.769
TokenA
Token ID32
Feature activation-8.76e-01

Pos logit contributions
 FN+1.488
 reconciliation+1.436
 polic+1.434
rost+1.415
 operative+1.402
Neg logit contributions
ł-1.557
mes-1.435
cyl-1.405
peat-1.398
enment-1.397
Token,
Token ID11
Feature activation-3.20e-01

Pos logit contributions
 FN+2.163
 polic+2.095
 reconciliation+2.093
rost+2.062
 operative+2.034
Neg logit contributions
ł-2.547
mes-2.354
enment-2.319
cyl-2.318
peat-2.316
Token [
Token ID685
Feature activation+7.75e-02

Pos logit contributions
 FN+0.719
 polic+0.694
 reconciliation+0.694
rost+0.687
 corrid+0.678
Neg logit contributions
ł-1.009
mes-0.943
cyl-0.930
peat-0.924
enment-0.914
TokenAction
Token ID12502
Feature activation-2.25e-01

Pos logit contributions
ł+0.223
mes+0.208
cyl+0.205
peat+0.201
enment+0.200
Neg logit contributions
 FN-0.194
 polic-0.190
 reconciliation-0.189
rost-0.188
 corrid-0.186
Token Big
Token ID4403
Feature activation+9.62e-02

Pos logit contributions
ł+0.342
mes+0.318
cyl+0.315
peat+0.308
enment+0.306
Neg logit contributions
 FN-0.285
 reconciliation-0.277
 polic-0.277
rost-0.275
 operative-0.270
Token Talk
Token ID12167
Feature activation-9.87e-01

Pos logit contributions
ł+0.304
mes+0.284
cyl+0.282
peat+0.277
enment+0.276
Neg logit contributions
 FN-0.215
 polic-0.209
 reconciliation-0.207
rost-0.206
 operative-0.202
Token Productions
Token ID25315
Feature activation-1.85e-01

Pos logit contributions
 FN+2.362
 polic+2.304
 reconciliation+2.300
 operative+2.294
rost+2.279
Neg logit contributions
ł-2.902
cyl-2.703
mes-2.675
peat-2.665
enment-2.628
Token had
Token ID550
Feature activation+5.61e-02

Pos logit contributions
 FN+0.457
 polic+0.447
 reconciliation+0.446
rost+0.439
 operative+0.437
Neg logit contributions
ł-0.543
mes-0.504
cyl-0.503
peat-0.492
enment-0.489
Token [
Token ID685
Feature activation-2.43e-01

Pos logit contributions
ł+0.169
mes+0.158
cyl+0.157
peat+0.153
enment+0.153
Neg logit contributions
 FN-0.133
 polic-0.130
 reconciliation-0.130
rost-0.129
 corrid-0.127
Tokenarr
Token ID3258
Feature activation-1.21e+00

Pos logit contributions
 FN+0.631
rost+0.620
 reconciliation+0.615
 polic+0.614
 operative+0.603
Neg logit contributions
ł-0.678
mes-0.620
cyl-0.611
enment-0.609
peat-0.598
Tokenanged
Token ID5102
Feature activation-4.14e-01

Pos logit contributions
rost+2.580
 FN+2.568
 polic+2.461
 operative+2.445
isting+2.442
Neg logit contributions
ł-3.881
peat-3.591
cyl-3.568
mes-3.530
enment-3.481
Token]
Token ID60
Feature activation-2.52e-01

Pos logit contributions
 FN+0.936
 reconciliation+0.921
 polic+0.907
rost+0.900
 operative+0.892
Neg logit contributions
ł-1.296
cyl-1.207
mes-1.204
enment-1.183
peat-1.178
Token for
Token ID329
Feature activation-1.16e-01

Pos logit contributions
 FN+0.597
 reconciliation+0.584
 polic+0.579
rost+0.574
 operative+0.567
Neg logit contributions
ł-0.766
mes-0.711
cyl-0.708
peat-0.692
enment-0.691
Token distribution
Token ID6082
Feature activation-1.90e-01

Pos logit contributions
 FN+0.250
 reconciliation+0.243
 polic+0.243
rost+0.241
 operative+0.236
Neg logit contributions
ł-0.376
mes-0.351
cyl-0.349
peat-0.343
enment-0.342
Token in
Token ID287
Feature activation+6.68e-02

Pos logit contributions
 FN+0.451
 reconciliation+0.444
 polic+0.437
rost+0.433
 operative+0.432
Neg logit contributions
ł-0.579
mes-0.536
cyl-0.535
enment-0.524
peat-0.523
Token many
Token ID867
Feature activation-6.16e-02

Pos logit contributions
ł+0.069
mes+0.064
cyl+0.064
peat+0.063
enment+0.062
Neg logit contributions
 FN-0.057
 reconciliation-0.056
rost-0.056
 polic-0.055
 operative-0.054
Token other
Token ID584
Feature activation-1.64e-01

Pos logit contributions
 FN+0.109
 reconciliation+0.104
 polic+0.103
rost+0.102
 operative+0.100
Neg logit contributions
ł-0.225
mes-0.212
cyl-0.211
peat-0.209
enment-0.207
Token things
Token ID1243
Feature activation+2.13e-01

Pos logit contributions
 FN+0.362
 reconciliation+0.350
 polic+0.348
rost+0.342
 operative+0.342
Neg logit contributions
ł-0.524
mes-0.489
cyl-0.489
peat-0.483
enment-0.478
Token.
Token ID13
Feature activation-8.42e-02

Pos logit contributions
ł+0.563
mes+0.520
cyl+0.515
peat+0.504
enment+0.501
Neg logit contributions
 FN-0.589
 polic-0.574
 reconciliation-0.574
rost-0.569
 operative-0.564
Token See
Token ID4091
Feature activation-5.27e-01

Pos logit contributions
 FN+0.203
 reconciliation+0.195
 polic+0.195
rost+0.193
 operative+0.191
Neg logit contributions
ł-0.253
mes-0.235
cyl-0.235
peat-0.231
enment-0.229
Token |
Token ID930
Feature activation-1.21e+00

Pos logit contributions
 FN+1.248
 reconciliation+1.204
rost+1.182
 polic+1.174
 operative+1.166
Neg logit contributions
ł-1.619
cyl-1.506
mes-1.481
peat-1.476
enment-1.458
Tokenaut
Token ID2306
Feature activation+2.52e-01

Pos logit contributions
 FN+2.726
rost+2.650
 reconciliation+2.623
 operative+2.571
 polic+2.539
Neg logit contributions
ł-3.705
peat-3.390
cyl-3.373
enment-3.366
mes-3.355
Tokenoc
Token ID420
Feature activation+1.43e-01

Pos logit contributions
ł+0.699
mes+0.648
cyl+0.642
peat+0.629
enment+0.623
Neg logit contributions
 FN-0.655
rost-0.639
 reconciliation-0.639
 polic-0.637
 operative-0.624
Tokenomm
Token ID2002
Feature activation+1.36e-01

Pos logit contributions
ł+0.326
cyl+0.297
mes+0.295
peat+0.285
enment+0.283
Neg logit contributions
 FN-0.438
rost-0.434
 reconciliation-0.429
 polic-0.425
 operative-0.420
Tokenand
Token ID392
Feature activation+2.45e-01

Pos logit contributions
ł+0.432
cyl+0.405
mes+0.400
peat+0.392
enment+0.390
Neg logit contributions
 FN-0.301
 reconciliation-0.292
rost-0.288
 polic-0.288
 operative-0.283
Token|
Token ID91
Feature activation-6.19e-01

Pos logit contributions
ł+0.664
cyl+0.603
mes+0.602
peat+0.593
emo+0.585
Neg logit contributions
 FN-0.660
 reconciliation-0.646
rost-0.641
 polic-0.637
 operative-0.628
Token by
Token ID416
Feature activation-1.46e-01

Pos logit contributions
 FN+0.337
 polic+0.333
 reconciliation+0.328
rost+0.327
 operative+0.318
Neg logit contributions
ł-0.555
mes-0.523
cyl-0.516
enment-0.511
peat-0.511
Token Facebook
Token ID3203
Feature activation+1.66e-01

Pos logit contributions
 FN+0.333
 polic+0.330
 reconciliation+0.326
rost+0.322
 broker+0.317
Neg logit contributions
ł-0.457
mes-0.423
cyl-0.420
peat-0.414
enment-0.412
Token employees
Token ID4409
Feature activation-9.80e-02

Pos logit contributions
ł+0.491
mes+0.454
cyl+0.450
peat+0.443
enment+0.441
Neg logit contributions
 FN-0.406
 polic-0.400
 reconciliation-0.397
rost-0.394
 corrid-0.387
Token in
Token ID287
Feature activation-1.86e-01

Pos logit contributions
 FN+0.230
 polic+0.227
 reconciliation+0.224
rost+0.224
 operative+0.218
Neg logit contributions
ł-0.298
mes-0.277
cyl-0.275
peat-0.270
enment-0.270
Token Palo
Token ID44878
Feature activation-9.11e-01

Pos logit contributions
 FN+0.410
 reconciliation+0.403
 polic+0.403
rost+0.398
 therapy+0.392
Neg logit contributions
ł-0.593
cyl-0.550
mes-0.550
peat-0.541
enment-0.541
Token Alto
Token ID34317
Feature activation-1.21e+00

Pos logit contributions
 FN+2.870
 Roche+2.800
 polic+2.797
 reconciliation+2.796
rost+2.777
Neg logit contributions
ł-2.055
cyl-1.826
mes-1.807
enment-1.753
emo-1.739
Token,
Token ID11
Feature activation-1.64e-01

Pos logit contributions
 polic+2.703
 FN+2.659
 therapy+2.610
rost+2.591
 reconciliation+2.590
Neg logit contributions
ł-3.784
cyl-3.509
peat-3.482
mes-3.448
enment-3.427
Token Calif
Token ID3192
Feature activation-1.60e-01

Pos logit contributions
 FN+0.301
 polic+0.296
 reconciliation+0.294
rost+0.289
 operative+0.286
Neg logit contributions
ł-0.583
mes-0.546
cyl-0.541
peat-0.535
enment-0.531
Token.,
Token ID1539
Feature activation+2.88e-01

Pos logit contributions
 FN+0.455
rost+0.448
 polic+0.447
 reconciliation+0.444
 corrid+0.439
Neg logit contributions
ł-0.400
mes-0.368
cyl-0.363
peat-0.362
enment-0.355
Token and
Token ID290
Feature activation+6.90e-02

Pos logit contributions
ł+0.878
mes+0.820
cyl+0.810
peat+0.797
enment+0.792
Neg logit contributions
 FN-0.673
 polic-0.659
 reconciliation-0.656
rost-0.652
 corrid-0.642
Token in
Token ID287
Feature activation+4.63e-02

Pos logit contributions
ł+0.215
mes+0.201
cyl+0.199
peat+0.195
enment+0.194
Neg logit contributions
 FN-0.157
 polic-0.154
 reconciliation-0.153
rost-0.152
 corrid-0.149
Token,
Token ID11
Feature activation+2.30e-01

Pos logit contributions
ł+0.121
mes+0.114
cyl+0.112
peat+0.111
enment+0.110
Neg logit contributions
 FN-0.091
 polic-0.089
 reconciliation-0.088
rost-0.087
 corrid-0.086
Token synd
Token ID11150
Feature activation-2.24e-01

Pos logit contributions
ł+0.726
mes+0.680
cyl+0.673
peat+0.662
enment+0.658
Neg logit contributions
 FN-0.513
 polic-0.500
 reconciliation-0.499
rost-0.493
 corrid-0.488
Tokenicated
Token ID3474
Feature activation-2.04e-01

Pos logit contributions
rost+0.474
 FN+0.471
 reconciliation+0.454
 polic+0.451
isting+0.444
Neg logit contributions
ł-0.705
mes-0.671
cyl-0.667
peat-0.656
enment-0.650
Token col
Token ID951
Feature activation-3.44e-01

Pos logit contributions
 FN+0.441
 polic+0.429
 reconciliation+0.428
rost+0.419
 corrid+0.418
Neg logit contributions
ł-0.661
mes-0.622
cyl-0.619
peat-0.612
enment-0.606
Tokenum
Token ID388
Feature activation+1.45e-02

Pos logit contributions
 FN+0.898
rost+0.880
 polic+0.869
 reconciliation+0.860
isting+0.859
Neg logit contributions
ł-0.934
cyl-0.864
enment-0.840
peat-0.835
mes-0.829
Token...
Token ID986
Feature activation-1.21e+00

Pos logit contributions
ł+0.038
mes+0.035
cyl+0.035
peat+0.034
enment+0.034
Neg logit contributions
 FN-0.040
 polic-0.039
 reconciliation-0.039
rost-0.038
 operative-0.038
Token
Token ID198
Feature activation-5.39e-01

Pos logit contributions
 FN+2.959
 reconciliation+2.843
 polic+2.835
rost+2.810
 operative+2.750
Neg logit contributions
ł-3.531
mes-3.264
cyl-3.243
enment-3.208
peat-3.186
Token
Token ID198
Feature activation-4.52e-01

Pos logit contributions
 FN+1.157
rost+1.117
 reconciliation+1.113
 polic+1.106
 Roche+1.080
Neg logit contributions
ł-1.737
mes-1.625
cyl-1.616
peat-1.589
enment-1.588
TokenJust
Token ID5703
Feature activation-2.29e-02

Pos logit contributions
 FN+1.154
rost+1.135
 reconciliation+1.130
 polic+1.116
 operative+1.100
Neg logit contributions
ł-1.254
mes-1.150
enment-1.142
cyl-1.139
peat-1.127
Token saw
Token ID2497
Feature activation-1.29e-01

Pos logit contributions
 FN+0.054
 polic+0.053
 reconciliation+0.053
rost+0.053
 operative+0.052
Neg logit contributions
ł-0.069
mes-0.064
cyl-0.064
peat-0.063
enment-0.062
Token this
Token ID428
Feature activation-1.27e-01

Pos logit contributions
 FN+0.294
 reconciliation+0.285
 polic+0.285
rost+0.280
 operative+0.276
Neg logit contributions
ł-0.404
mes-0.377
cyl-0.375
peat-0.369
enment-0.366
Token the
Token ID262
Feature activation-1.82e-01

Pos logit contributions
 FN+0.108
 reconciliation+0.105
 polic+0.105
rost+0.103
 corrid+0.102
Neg logit contributions
ł-0.149
mes-0.139
cyl-0.138
peat-0.135
enment-0.135
Token town
Token ID3240
Feature activation+6.88e-01

Pos logit contributions
 FN+0.368
 reconciliation+0.357
 polic+0.356
rost+0.351
 corrid+0.346
Neg logit contributions
ł-0.615
mes-0.576
cyl-0.573
peat-0.563
enment-0.562
Token
Token ID198
Feature activation-2.11e-01

Pos logit contributions
ł+1.857
mes+1.722
cyl+1.695
peat+1.659
enment+1.654
Neg logit contributions
 FN-1.846
 reconciliation-1.811
 polic-1.811
rost-1.793
 corrid-1.764
Token
Token ID198
Feature activation-5.70e-02

Pos logit contributions
 FN+0.464
 reconciliation+0.451
rost+0.450
 polic+0.450
 operative+0.437
Neg logit contributions
ł-0.666
mes-0.625
cyl-0.618
peat-0.607
enment-0.605
TokenBuy
Token ID14518
Feature activation-2.44e-01

Pos logit contributions
 FN+0.147
rost+0.144
 reconciliation+0.144
 polic+0.143
 operative+0.140
Neg logit contributions
ł-0.157
mes-0.145
enment-0.143
cyl-0.143
peat-0.141
Token Photo
Token ID5555
Feature activation-1.20e+00

Pos logit contributions
 FN+0.702
 polic+0.681
 reconciliation+0.677
rost+0.673
 therapy+0.665
Neg logit contributions
ł-0.611
mes-0.560
cyl-0.559
enment-0.550
emo-0.540
Token Pokemon
Token ID14878
Feature activation+3.08e-01

Pos logit contributions
 FN+2.794
 Roche+2.626
 polic+2.620
rost+2.541
 reconciliation+2.529
Neg logit contributions
ł-3.670
cyl-3.480
peat-3.453
mes-3.440
enment-3.393
Token Go
Token ID1514
Feature activation-6.10e-02

Pos logit contributions
ł+0.978
mes+0.917
cyl+0.907
peat+0.891
enment+0.889
Neg logit contributions
 FN-0.684
 polic-0.667
 reconciliation-0.663
rost-0.658
 corrid-0.652
Token players
Token ID1938
Feature activation-3.61e-01

Pos logit contributions
 FN+0.146
 polic+0.144
 reconciliation+0.143
rost+0.141
 operative+0.141
Neg logit contributions
ł-0.182
mes-0.170
cyl-0.169
peat-0.166
enment-0.166
Token make
Token ID787
Feature activation-6.18e-02

Pos logit contributions
 FN+0.848
 polic+0.828
 reconciliation+0.815
rost+0.813
 broker+0.799
Neg logit contributions
ł-1.104
cyl-1.027
mes-1.025
peat-1.009
enment-1.003
Token their
Token ID511
Feature activation-2.19e-02

Pos logit contributions
 FN+0.118
 reconciliation+0.116
 polic+0.114
rost+0.112
 corrid+0.111
Neg logit contributions
ł-0.216
mes-0.203
cyl-0.201
enment-0.199
peat-0.198
Token
Token ID851
Feature activation+8.27e-02

Pos logit contributions
 FN+1.484
rost+1.437
 polic+1.437
 reconciliation+1.434
 operative+1.402
Neg logit contributions
ł-1.611
mes-1.494
cyl-1.485
peat-1.479
enment-1.459
Token Winnipeg
Token ID23434
Feature activation-6.87e-02

Pos logit contributions
ł+0.267
mes+0.251
cyl+0.249
peat+0.246
enment+0.244
Neg logit contributions
 FN-0.179
 polic-0.174
 reconciliation-0.173
rost-0.172
 corrid-0.168
Token Jets
Token ID14728
Feature activation+2.36e-01

Pos logit contributions
 FN+0.154
 polic+0.150
rost+0.148
 reconciliation+0.148
 operative+0.145
Neg logit contributions
ł-0.217
cyl-0.202
mes-0.202
peat-0.201
enment-0.199
Token (@
Token ID4275
Feature activation-3.35e-01

Pos logit contributions
ł+0.606
mes+0.560
cyl+0.554
peat+0.546
enment+0.541
Neg logit contributions
 FN-0.665
 polic-0.651
rost-0.647
 reconciliation-0.647
 corrid-0.636
TokenN
Token ID45
Feature activation-4.29e-01

Pos logit contributions
 FN+0.882
rost+0.870
 polic+0.858
 reconciliation+0.854
 operative+0.834
Neg logit contributions
ł-0.900
cyl-0.821
mes-0.816
enment-0.816
peat-0.808
TokenHL
Token ID6581
Feature activation-1.20e+00

Pos logit contributions
 FN+0.902
rost+0.883
 reconciliation+0.862
 polic+0.862
 operative+0.832
Neg logit contributions
ł-1.396
mes-1.301
cyl-1.296
peat-1.288
enment-1.278
TokenJ
Token ID41
Feature activation+8.99e-03

Pos logit contributions
 FN+2.899
rost+2.896
 polic+2.851
isting+2.819
 reconciliation+2.812
Neg logit contributions
ł-3.442
enment-3.166
cyl-3.161
peat-3.072
mes-3.032
Tokenets
Token ID1039
Feature activation-7.30e-02

Pos logit contributions
ł+0.021
cyl+0.019
mes+0.019
peat+0.019
enment+0.018
Neg logit contributions
 FN-0.027
rost-0.027
 polic-0.027
 reconciliation-0.027
 operative-0.026
Token)
Token ID8
Feature activation-1.46e-01

Pos logit contributions
 FN+0.182
 polic+0.178
rost+0.177
 reconciliation+0.176
 corrid+0.173
Neg logit contributions
ł-0.210
mes-0.195
cyl-0.193
peat-0.190
enment-0.189
Token October
Token ID3267
Feature activation-1.36e-01

Pos logit contributions
 FN+0.353
 polic+0.344
 reconciliation+0.344
rost+0.342
 corrid+0.336
Neg logit contributions
ł-0.434
mes-0.405
cyl-0.400
peat-0.393
enment-0.390
Token 21
Token ID2310
Feature activation-4.40e-01

Pos logit contributions
 FN+0.332
 polic+0.327
 reconciliation+0.326
 corrid+0.326
rost+0.324
Neg logit contributions
ł-0.399
mes-0.373
cyl-0.365
peat-0.358
enment-0.356
Token Gr
Token ID1902
Feature activation+1.70e-01

Pos logit contributions
ł+0.539
mes+0.493
cyl+0.491
peat+0.480
enment+0.475
Neg logit contributions
 FN-0.569
 reconciliation-0.554
rost-0.551
 therapy-0.541
 polic-0.538
Tokenum
Token ID388
Feature activation+2.39e-01

Pos logit contributions
ł+0.406
mes+0.374
cyl+0.365
peat+0.355
emo+0.353
Neg logit contributions
 FN-0.508
 polic-0.504
 reconciliation-0.500
rost-0.497
 corrid-0.497
Tokenman
Token ID805
Feature activation+1.46e-01

Pos logit contributions
ł+0.670
mes+0.615
cyl+0.611
peat+0.603
enment+0.601
Neg logit contributions
 FN-0.616
 polic-0.598
 reconciliation-0.598
rost-0.597
 operative-0.586
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
ł+0.402
mes+0.370
cyl+0.368
peat+0.359
enment+0.357
Neg logit contributions
 FN-0.389
 reconciliation-0.379
 polic-0.379
rost-0.376
 operative-0.371
Token (
Token ID357
Feature activation+7.25e-02

Pos logit contributions
 FN+0.444
 reconciliation+0.426
 polic+0.425
rost+0.423
 Roche+0.420
Neg logit contributions
ł-0.612
mes-0.570
cyl-0.570
peat-0.560
emo-0.554
TokenAbout
Token ID8585
Feature activation-1.20e+00

Pos logit contributions
ł+0.210
mes+0.195
cyl+0.192
peat+0.189
enment+0.188
Neg logit contributions
 FN-0.181
 polic-0.177
 reconciliation-0.176
rost-0.175
 corrid-0.172
Token 100
Token ID1802
Feature activation-5.35e-01

Pos logit contributions
 FN+2.876
 reconciliation+2.803
 polic+2.771
rost+2.742
 Roche+2.714
Neg logit contributions
ł-3.581
mes-3.294
peat-3.287
cyl-3.279
enment-3.279
Token passengers
Token ID10405
Feature activation-5.54e-01

Pos logit contributions
 FN+1.143
 reconciliation+1.115
 polic+1.113
rost+1.094
 broker+1.071
Neg logit contributions
ł-1.753
mes-1.622
cyl-1.613
peat-1.596
enment-1.591
Token injured
Token ID6686
Feature activation-3.46e-01

Pos logit contributions
 FN+1.231
 polic+1.217
rost+1.204
 reconciliation+1.199
 broker+1.174
Neg logit contributions
ł-1.740
cyl-1.628
mes-1.615
peat-1.596
enment-1.587
Token in
Token ID287
Feature activation-1.24e-01

Pos logit contributions
 FN+0.792
 polic+0.776
 reconciliation+0.773
rost+0.771
 operative+0.751
Neg logit contributions
ł-1.067
mes-0.992
cyl-0.987
peat-0.970
enment-0.965
Token the
Token ID262
Feature activation+3.56e-02

Pos logit contributions
 FN+0.288
 reconciliation+0.280
 polic+0.279
rost+0.277
 operative+0.272
Neg logit contributions
ł-0.384
mes-0.357
cyl-0.355
peat-0.349
enment-0.348
Token was
Token ID373
Feature activation-1.57e-01

Pos logit contributions
ł+0.908
mes+0.843
cyl+0.833
peat+0.822
enment+0.818
Neg logit contributions
 FN-0.731
 polic-0.719
 reconciliation-0.710
rost-0.704
 operative-0.697
Token designed
Token ID3562
Feature activation-2.92e-01

Pos logit contributions
 FN+0.325
 polic+0.323
rost+0.314
 reconciliation+0.313
 operative+0.312
Neg logit contributions
ł-0.519
mes-0.484
cyl-0.482
peat-0.477
enment-0.475
Token as
Token ID355
Feature activation-1.69e-01

Pos logit contributions
 FN+0.693
 polic+0.680
 reconciliation+0.670
 operative+0.665
rost+0.664
Neg logit contributions
ł-0.884
mes-0.818
cyl-0.812
enment-0.810
peat-0.802
Token shown
Token ID3402
Feature activation-6.40e-01

Pos logit contributions
 FN+0.420
 polic+0.407
 reconciliation+0.405
 operative+0.400
rost+0.397
Neg logit contributions
ł-0.501
mes-0.460
cyl-0.457
enment-0.453
peat-0.452
Token in
Token ID287
Feature activation+4.93e-02

Pos logit contributions
 FN+1.412
 reconciliation+1.367
 polic+1.361
rost+1.354
 operative+1.339
Neg logit contributions
ł-2.024
mes-1.891
enment-1.876
cyl-1.870
peat-1.858
Token Figure
Token ID11291
Feature activation-1.20e+00

Pos logit contributions
ł+0.157
mes+0.146
cyl+0.145
peat+0.143
enment+0.143
Neg logit contributions
 FN-0.110
 reconciliation-0.106
 polic-0.106
rost-0.105
 operative-0.103
Token 1
Token ID352
Feature activation-5.81e-01

Pos logit contributions
 FN+2.433
 reconciliation+2.304
rost+2.254
 polic+2.245
 operative+2.230
Neg logit contributions
ł-4.048
peat-3.767
cyl-3.742
mes-3.734
enment-3.706
Token.
Token ID13
Feature activation-5.82e-02

Pos logit contributions
 FN+1.549
 reconciliation+1.488
 polic+1.476
 operative+1.459
rost+1.457
Neg logit contributions
ł-1.604
mes-1.460
peat-1.445
cyl-1.441
enment-1.427
Token The
Token ID383
Feature activation+9.77e-02

Pos logit contributions
 FN+0.138
 polic+0.130
 Roche+0.129
 reconciliation+0.128
rost+0.128
Neg logit contributions
ł-0.179
cyl-0.166
peat-0.165
mes-0.164
enment-0.161
Token maze
Token ID31237
Feature activation+3.15e-01

Pos logit contributions
ł+0.296
cyl+0.273
mes+0.270
peat+0.269
enment+0.267
Neg logit contributions
 FN-0.234
 polic-0.228
 reconciliation-0.227
rost-0.224
 operative-0.224
Token was
Token ID373
Feature activation+8.37e-02

Pos logit contributions
ł+0.891
mes+0.816
cyl+0.811
peat+0.806
enment+0.800
Neg logit contributions
 FN-0.818
 polic-0.805
 reconciliation-0.795
rost-0.788
 operative-0.784
Token there
Token ID612
Feature activation-5.83e-02

Pos logit contributions
 FN+0.359
 reconciliation+0.354
 polic+0.351
rost+0.346
 operative+0.342
Neg logit contributions
ł-0.503
mes-0.467
cyl-0.465
peat-0.455
enment-0.453
Token also
Token ID635
Feature activation-3.24e-01

Pos logit contributions
 FN+0.147
 polic+0.145
 reconciliation+0.144
rost+0.143
 operative+0.141
Neg logit contributions
ł-0.166
mes-0.155
cyl-0.153
peat-0.150
enment-0.149
Token exists
Token ID7160
Feature activation-7.09e-01

Pos logit contributions
 FN+0.801
 polic+0.795
 reconciliation+0.794
rost+0.778
 operative+0.774
Neg logit contributions
ł-0.940
cyl-0.877
mes-0.876
peat-0.858
emo-0.847
Token self
Token ID2116
Feature activation-5.20e-01

Pos logit contributions
 reconciliation+1.569
 FN+1.562
 polic+1.554
rost+1.513
 therapy+1.508
Neg logit contributions
ł-2.243
cyl-2.112
mes-2.091
peat-2.066
emo-2.037
Token-
Token ID12
Feature activation-4.38e-01

Pos logit contributions
 reconciliation+1.347
 FN+1.338
 polic+1.336
rost+1.315
 therapy+1.297
Neg logit contributions
ł-1.445
mes-1.345
cyl-1.333
peat-1.296
emo-1.283
Tokenrighteous
Token ID49955
Feature activation-1.19e+00

Pos logit contributions
 reconciliation+1.128
 FN+1.127
 polic+1.126
rost+1.121
 therapy+1.098
Neg logit contributions
ł-1.196
mes-1.099
cyl-1.093
peat-1.074
enment-1.059
Tokenness
Token ID1108
Feature activation-6.15e-01

Pos logit contributions
 reconciliation+2.522
 polic+2.520
 FN+2.510
 therapy+2.464
 rule+2.452
Neg logit contributions
ł-3.853
cyl-3.618
mes-3.560
peat-3.539
oleon-3.471
Token,
Token ID11
Feature activation-2.20e-01

Pos logit contributions
 polic+1.459
 FN+1.446
 reconciliation+1.429
 operative+1.417
rost+1.406
Neg logit contributions
ł-1.854
mes-1.730
cyl-1.717
enment-1.667
peat-1.664
Token racial
Token ID8425
Feature activation-3.44e-01

Pos logit contributions
 FN+0.425
 polic+0.420
 reconciliation+0.419
rost+0.408
 operative+0.404
Neg logit contributions
ł-0.759
mes-0.713
cyl-0.708
peat-0.694
enment-0.690
Token superiority
Token ID26883
Feature activation-5.70e-01

Pos logit contributions
 FN+0.554
 reconciliation+0.549
 polic+0.548
rost+0.529
 operative+0.519
Neg logit contributions
ł-1.300
mes-1.226
cyl-1.219
peat-1.207
enment-1.194
Token,
Token ID11
Feature activation-3.19e-01

Pos logit contributions
 polic+1.254
 FN+1.240
 reconciliation+1.239
rost+1.220
 therapy+1.214
Neg logit contributions
ł-1.800
mes-1.678
cyl-1.664
peat-1.639
emo-1.624
Token+
Token ID10
Feature activation-1.98e-01

Pos logit contributions
 FN+0.131
rost+0.129
 reconciliation+0.129
 polic+0.128
 operative+0.126
Neg logit contributions
ł-0.077
mes-0.069
cyl-0.069
peat-0.067
enment-0.067
Token Natural
Token ID12068
Feature activation+4.75e-01

Pos logit contributions
 FN+0.457
 polic+0.441
 reconciliation+0.437
rost+0.435
 Roche+0.433
Neg logit contributions
ł-0.614
mes-0.570
cyl-0.565
peat-0.559
emo-0.549
Token leader
Token ID3554
Feature activation-1.24e-01

Pos logit contributions
ł+1.447
mes+1.343
cyl+1.329
peat+1.308
enment+1.298
Neg logit contributions
 FN-1.121
 polic-1.093
 reconciliation-1.093
rost-1.081
 operative-1.068
Token,
Token ID11
Feature activation-3.04e-01

Pos logit contributions
 FN+0.295
 polic+0.287
 reconciliation+0.284
 operative+0.282
rost+0.282
Neg logit contributions
ł-0.376
mes-0.346
cyl-0.343
peat-0.341
emo-0.333
Token Bon
Token ID7979
Feature activation+1.92e-02

Pos logit contributions
 FN+0.690
 polic+0.678
 reconciliation+0.673
 operative+0.665
rost+0.661
Neg logit contributions
ł-0.960
mes-0.884
cyl-0.877
peat-0.870
emo-0.856
Token v
Token ID410
Feature activation-1.19e+00

Pos logit contributions
ł+0.051
mes+0.047
cyl+0.046
peat+0.046
enment+0.046
Neg logit contributions
 FN-0.053
 polic-0.051
rost-0.051
 reconciliation-0.051
 therapy-0.050
Tokenive
Token ID425
Feature activation-7.80e-02

Pos logit contributions
rost+3.274
 FN+3.242
 Roche+3.208
 polic+3.208
 reconciliation+3.194
Neg logit contributions
ł-2.996
cyl-2.809
mes-2.741
peat-2.728
enment-2.725
Tokenur
Token ID333
Feature activation-3.55e-01

Pos logit contributions
 FN+0.194
 polic+0.192
 reconciliation+0.190
rost+0.189
 operative+0.186
Neg logit contributions
ł-0.227
cyl-0.209
mes-0.206
peat-0.204
enment-0.204
Token,
Token ID11
Feature activation-3.43e-01

Pos logit contributions
 FN+0.809
 polic+0.805
 reconciliation+0.791
 operative+0.788
rost+0.788
Neg logit contributions
ł-1.104
mes-1.021
cyl-1.013
peat-1.002
emo-0.990
Token Char
Token ID3178
Feature activation-1.42e-01

Pos logit contributions
 FN+0.783
 polic+0.767
 reconciliation+0.759
 operative+0.750
rost+0.746
Neg logit contributions
ł-1.076
mes-0.997
cyl-0.990
peat-0.978
emo-0.965
Tokenism
Token ID1042
Feature activation-2.95e-03

Pos logit contributions
 FN+0.351
 reconciliation+0.347
rost+0.346
 polic+0.346
 operative+0.337
Neg logit contributions
ł-0.409
cyl-0.380
enment-0.374
mes-0.374
peat-0.369
Token-
Token ID12
Feature activation+1.01e-01

Pos logit contributions
 FN+0.419
 reconciliation+0.414
rost+0.414
 polic+0.412
 therapy+0.407
Neg logit contributions
ł-0.319
cyl-0.288
mes-0.286
peat-0.283
enment-0.281
Tokena
Token ID64
Feature activation-3.48e-02

Pos logit contributions
ł+0.254
mes+0.231
cyl+0.229
peat+0.224
enment+0.224
Neg logit contributions
 FN-0.288
rost-0.284
 polic-0.283
 reconciliation-0.282
 operative-0.277
Token...
Token ID986
Feature activation-1.04e+00

Pos logit contributions
 FN+0.088
 polic+0.086
rost+0.086
 reconciliation+0.086
 broker+0.084
Neg logit contributions
ł-0.097
cyl-0.090
mes-0.090
enment-0.089
peat-0.089
Token<|endoftext|>
Token ID50256
Feature activation-9.17e-03

Pos logit contributions
 FN+2.161
 polic+2.084
 reconciliation+2.060
rost+2.055
 corrid+2.007
Neg logit contributions
ł-3.347
mes-3.178
cyl-3.147
peat-3.118
enment-3.093
TokenBuy
Token ID14518
Feature activation-3.89e-01

Pos logit contributions
 FN+0.024
rost+0.023
 reconciliation+0.023
 polic+0.023
 operative+0.022
Neg logit contributions
ł-0.025
mes-0.024
cyl-0.023
enment-0.023
peat-0.023
Token Photo
Token ID5555
Feature activation-1.18e+00

Pos logit contributions
 FN+1.118
 reconciliation+1.082
 polic+1.079
rost+1.069
 Roche+1.063
Neg logit contributions
ł-0.977
cyl-0.897
mes-0.896
enment-0.870
peat-0.867
Token F
Token ID376
Feature activation+2.26e-02

Pos logit contributions
 FN+2.764
 Roche+2.604
 polic+2.591
 reconciliation+2.520
rost+2.519
Neg logit contributions
ł-3.613
cyl-3.404
peat-3.377
mes-3.360
enment-3.319
TokenSU
Token ID12564
Feature activation-1.20e-03

Pos logit contributions
ł+0.058
mes+0.054
cyl+0.053
peat+0.053
enment+0.052
Neg logit contributions
 FN-0.063
rost-0.062
 polic-0.062
 reconciliation-0.061
 corrid-0.060
Token target
Token ID2496
Feature activation+3.59e-01

Pos logit contributions
 FN+0.002
 polic+0.002
 reconciliation+0.002
rost+0.002
 operative+0.002
Neg logit contributions
ł-0.004
mes-0.004
cyl-0.004
peat-0.004
enment-0.004
Token Marvin
Token ID35105
Feature activation-9.78e-03

Pos logit contributions
ł+1.174
mes+1.102
cyl+1.089
peat+1.072
enment+1.066
Neg logit contributions
 FN-0.760
 polic-0.739
 reconciliation-0.737
rost-0.732
 corrid-0.719
Token Wilson
Token ID8127
Feature activation-1.20e-01

Pos logit contributions
 FN+0.021
 polic+0.020
 reconciliation+0.020
rost+0.020
 Roche+0.020
Neg logit contributions
ł-0.032
mes-0.030
cyl-0.030
peat-0.029
enment-0.029
Token limited
Token ID3614
Feature activation-2.78e-01

Pos logit contributions
 FN+0.411
 reconciliation+0.407
rost+0.405
 polic+0.402
 operative+0.399
Neg logit contributions
ł-0.470
mes-0.438
cyl-0.434
enment-0.430
peat-0.426
Token supply
Token ID5127
Feature activation+1.96e-01

Pos logit contributions
 FN+0.512
 reconciliation+0.501
 polic+0.498
rost+0.494
 therapy+0.482
Neg logit contributions
ł-0.995
mes-0.936
cyl-0.930
peat-0.917
emo-0.915
Token on
Token ID319
Feature activation-8.14e-02

Pos logit contributions
ł+0.600
mes+0.560
cyl+0.553
peat+0.546
enment+0.543
Neg logit contributions
 FN-0.452
 polic-0.447
 reconciliation-0.443
rost-0.442
 therapy-0.431
Token the
Token ID262
Feature activation+4.56e-02

Pos logit contributions
 FN+0.187
 reconciliation+0.184
 polic+0.183
rost+0.181
 operative+0.177
Neg logit contributions
ł-0.252
mes-0.235
cyl-0.233
enment-0.229
peat-0.228
Token arch
Token ID3934
Feature activation-4.71e-01

Pos logit contributions
ł+0.147
mes+0.137
cyl+0.136
peat+0.133
enment+0.133
Neg logit contributions
 FN-0.099
 polic-0.097
 reconciliation-0.097
rost-0.096
 corrid-0.094
Tokenipel
Token ID40634
Feature activation-1.18e+00

Pos logit contributions
rost+0.919
 FN+0.917
 polic+0.908
 reconciliation+0.901
 broker+0.882
Neg logit contributions
ł-1.586
enment-1.468
cyl-1.461
mes-1.458
peat-1.433
Tokenago
Token ID3839
Feature activation-3.11e-01

Pos logit contributions
rost+3.599
 FN+3.568
 polic+3.498
isting+3.475
 reconciliation+3.464
Neg logit contributions
ł-2.710
mes-2.506
cyl-2.423
enment-2.286
peat-2.264
Token,
Token ID11
Feature activation-6.58e-02

Pos logit contributions
 FN+0.763
 polic+0.760
rost+0.755
 reconciliation+0.752
 therapy+0.734
Neg logit contributions
ł-0.903
mes-0.850
cyl-0.830
enment-0.820
peat-0.811
Token is
Token ID318
Feature activation+9.56e-02

Pos logit contributions
 FN+0.168
 polic+0.165
 reconciliation+0.165
rost+0.163
 operative+0.161
Neg logit contributions
ł-0.187
mes-0.173
cyl-0.172
enment-0.168
peat-0.167
Token likely
Token ID1884
Feature activation+2.42e-01

Pos logit contributions
ł+0.307
mes+0.288
cyl+0.284
peat+0.280
enment+0.279
Neg logit contributions
 FN-0.208
 polic-0.206
 reconciliation-0.203
rost-0.202
 operative-0.199
Token to
Token ID284
Feature activation+8.14e-02

Pos logit contributions
ł+0.627
mes+0.581
cyl+0.566
enment+0.560
peat+0.554
Neg logit contributions
 FN-0.676
 polic-0.676
 reconciliation-0.669
rost-0.664
 operative-0.658
Token called
Token ID1444
Feature activation-1.32e-01

Pos logit contributions
ł+0.134
mes+0.125
cyl+0.124
peat+0.122
enment+0.121
Neg logit contributions
 FN-0.105
 polic-0.103
 reconciliation-0.102
rost-0.102
 corrid-0.100
Token her
Token ID607
Feature activation+2.76e-02

Pos logit contributions
 FN+0.316
 polic+0.308
 reconciliation+0.306
rost+0.303
 corrid+0.298
Neg logit contributions
ł-0.400
mes-0.372
cyl-0.369
peat-0.364
enment-0.363
Token son
Token ID3367
Feature activation+5.38e-02

Pos logit contributions
ł+0.091
mes+0.086
cyl+0.085
peat+0.083
enment+0.083
Neg logit contributions
 FN-0.058
 polic-0.057
 reconciliation-0.056
rost-0.056
 corrid-0.055
Token who
Token ID508
Feature activation-1.80e-01

Pos logit contributions
ł+0.156
mes+0.145
cyl+0.144
peat+0.142
enment+0.141
Neg logit contributions
 FN-0.134
 polic-0.131
 reconciliation-0.131
rost-0.130
 corrid-0.127
Token lives
Token ID3160
Feature activation-1.56e-01

Pos logit contributions
 FN+0.430
 polic+0.423
 reconciliation+0.417
rost+0.416
 operative+0.407
Neg logit contributions
ł-0.540
mes-0.504
cyl-0.499
peat-0.490
enment-0.488
Token next
Token ID1306
Feature activation+7.57e-01

Pos logit contributions
 FN+0.363
 polic+0.356
 reconciliation+0.354
rost+0.352
 corrid+0.345
Neg logit contributions
ł-0.478
mes-0.448
cyl-0.442
peat-0.435
enment-0.434
Token door
Token ID3420
Feature activation+2.92e-02

Pos logit contributions
ł+2.117
mes+1.971
cyl+1.937
peat+1.903
enment+1.891
Neg logit contributions
 FN-1.955
 polic-1.917
 reconciliation-1.912
rost-1.901
 corrid-1.881
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
ł+0.084
mes+0.078
cyl+0.077
peat+0.076
enment+0.076
Neg logit contributions
 FN-0.074
 polic-0.072
 reconciliation-0.072
rost-0.071
 corrid-0.070
Token Police
Token ID4287
Feature activation-1.47e-03

Pos logit contributions
 FN+0.226
 polic+0.216
rost+0.213
 reconciliation+0.212
 Roche+0.212
Neg logit contributions
ł-0.329
cyl-0.309
mes-0.308
peat-0.306
enment-0.301
Token say
Token ID910
Feature activation+3.60e-02

Pos logit contributions
 FN+0.003
 polic+0.003
 reconciliation+0.003
rost+0.003
 corrid+0.003
Neg logit contributions
ł-0.004
mes-0.004
cyl-0.004
peat-0.004
enment-0.004
Token the
Token ID262
Feature activation-2.19e-01

Pos logit contributions
ł+0.117
mes+0.110
cyl+0.109
peat+0.107
enment+0.107
Neg logit contributions
 FN-0.077
 polic-0.075
 reconciliation-0.075
rost-0.074
 Roche-0.073
Token only
Token ID691
Feature activation-1.78e-01

Pos logit contributions
ł+0.550
mes+0.515
cyl+0.511
peat+0.502
enment+0.499
Neg logit contributions
 FN-0.383
 polic-0.375
 reconciliation-0.373
rost-0.369
 operative-0.364
Token big
Token ID1263
Feature activation+3.92e-01

Pos logit contributions
 FN+0.472
 polic+0.467
 reconciliation+0.464
rost+0.459
 operative+0.455
Neg logit contributions
ł-0.486
mes-0.450
cyl-0.445
peat-0.435
enment-0.432
Token city
Token ID1748
Feature activation-2.99e-01

Pos logit contributions
ł+1.224
mes+1.148
cyl+1.132
peat+1.113
enment+1.107
Neg logit contributions
 FN-0.886
 polic-0.866
 reconciliation-0.863
rost-0.858
 corrid-0.847
Token in
Token ID287
Feature activation-1.09e-01

Pos logit contributions
 polic+0.693
 FN+0.692
 reconciliation+0.675
rost+0.670
 operative+0.666
Neg logit contributions
ł-0.914
mes-0.856
cyl-0.845
enment-0.830
peat-0.827
Token the
Token ID262
Feature activation+1.71e-01

Pos logit contributions
 FN+0.227
 reconciliation+0.224
 polic+0.223
 operative+0.219
rost+0.218
Neg logit contributions
ł-0.358
mes-0.334
cyl-0.332
peat-0.326
enment-0.325
Token nation
Token ID3277
Feature activation+7.97e-01

Pos logit contributions
ł+0.627
mes+0.588
cyl+0.586
peat+0.579
enment+0.574
Neg logit contributions
 FN-0.297
 polic-0.288
 reconciliation-0.287
rost-0.283
 operative-0.279
Token where
Token ID810
Feature activation+2.42e-01

Pos logit contributions
ł+2.131
mes+1.976
cyl+1.943
peat+1.904
enment+1.891
Neg logit contributions
 FN-2.154
 polic-2.112
 reconciliation-2.108
rost-2.097
 corrid-2.077
Token no
Token ID645
Feature activation-1.80e-01

Pos logit contributions
ł+0.772
mes+0.729
cyl+0.723
enment+0.711
peat+0.709
Neg logit contributions
 FN-0.526
 polic-0.521
 reconciliation-0.517
rost-0.507
 operative-0.500
Token one
Token ID530
Feature activation+9.48e-02

Pos logit contributions
 FN+0.384
 polic+0.380
 reconciliation+0.378
rost+0.371
 broker+0.368
Neg logit contributions
ł-0.582
mes-0.550
cyl-0.549
enment-0.537
peat-0.536
Token who
Token ID508
Feature activation-7.62e-02

Pos logit contributions
ł+0.286
mes+0.267
cyl+0.265
peat+0.259
enment+0.259
Neg logit contributions
 FN-0.224
 polic-0.221
 reconciliation-0.219
rost-0.217
 operative-0.214
Token needs
Token ID2476
Feature activation+2.19e-01

Pos logit contributions
 FN+0.174
 polic+0.173
rost+0.170
 reconciliation+0.169
 operative+0.166
Neg logit contributions
ł-0.232
mes-0.218
cyl-0.217
peat-0.214
enment-0.212
TokenD
Token ID35
Feature activation+6.03e-01

Pos logit contributions
 FN+0.125
 polic+0.122
 reconciliation+0.121
rost+0.121
 corrid+0.120
Neg logit contributions
ł-0.117
mes-0.108
cyl-0.105
enment-0.104
peat-0.103
Token poly
Token ID7514
Feature activation+8.04e-01

Pos logit contributions
ł+2.087
mes+1.970
cyl+1.946
peat+1.915
enment+1.906
Neg logit contributions
 FN-1.152
 polic-1.121
 reconciliation-1.117
rost-1.109
 corrid-1.093
Tokengon
Token ID14520
Feature activation+4.87e-01

Pos logit contributions
ł+2.007
mes+1.837
cyl+1.800
peat+1.774
enment+1.768
Neg logit contributions
 FN-2.324
 polic-2.283
 reconciliation-2.275
rost-2.271
 corrid-2.244
Token arcade
Token ID27210
Feature activation-3.00e-02

Pos logit contributions
ł+1.536
mes+1.435
cyl+1.416
peat+1.396
enment+1.393
Neg logit contributions
 FN-1.089
 polic-1.066
 reconciliation-1.064
rost-1.055
 corrid-1.041
Token racing
Token ID11717
Feature activation+3.21e-01

Pos logit contributions
 FN+0.053
 polic+0.051
 reconciliation+0.051
rost+0.050
 operative+0.049
Neg logit contributions
ł-0.110
mes-0.103
cyl-0.102
peat-0.101
enment-0.101
Token games
Token ID1830
Feature activation+7.73e-01

Pos logit contributions
ł+1.176
mes+1.107
cyl+1.093
peat+1.085
enment+1.080
Neg logit contributions
 FN-0.560
 polic-0.543
 reconciliation-0.536
rost-0.530
 operative-0.522
Token from
Token ID422
Feature activation+3.20e-01

Pos logit contributions
ł+2.130
mes+1.982
cyl+1.949
peat+1.910
enment+1.897
Neg logit contributions
 FN-2.023
 polic-1.985
 reconciliation-1.981
rost-1.971
 corrid-1.953
Token 1988
Token ID12122
Feature activation+4.72e-01

Pos logit contributions
ł+0.971
mes+0.905
cyl+0.892
peat+0.881
enment+0.877
Neg logit contributions
 FN-0.757
 polic-0.735
 reconciliation-0.733
rost-0.728
 corrid-0.716
Token to
Token ID284
Feature activation+4.80e-01

Pos logit contributions
ł+1.246
mes+1.153
cyl+1.135
peat+1.113
enment+1.107
Neg logit contributions
 FN-1.297
 polic-1.269
 reconciliation-1.266
rost-1.260
 corrid-1.244
Token 1992
Token ID9768
Feature activation+1.39e-01

Pos logit contributions
ł+1.326
mes+1.231
cyl+1.216
peat+1.192
enment+1.185
Neg logit contributions
 FN-1.255
 polic-1.226
 reconciliation-1.225
rost-1.219
 corrid-1.201
Token and
Token ID290
Feature activation+1.24e-01

Pos logit contributions
ł+0.352
mes+0.324
cyl+0.320
peat+0.314
enment+0.312
Neg logit contributions
 FN-0.395
 polic-0.387
 reconciliation-0.386
rost-0.383
 operative-0.378
TokenAlso
Token ID7583
Feature activation-1.28e-01

Pos logit contributions
ł+0.218
mes+0.202
cyl+0.199
enment+0.196
peat+0.195
Neg logit contributions
 FN-0.203
 reconciliation-0.198
rost-0.198
 polic-0.198
 operative-0.193
Token,
Token ID11
Feature activation+1.42e-01

Pos logit contributions
 FN+0.303
 polic+0.295
 reconciliation+0.295
rost+0.291
 corrid+0.287
Neg logit contributions
ł-0.388
mes-0.361
cyl-0.358
enment-0.349
peat-0.349
Token the
Token ID262
Feature activation+6.43e-03

Pos logit contributions
ł+0.442
mes+0.413
cyl+0.409
peat+0.401
enment+0.399
Neg logit contributions
 FN-0.322
 polic-0.314
 reconciliation-0.313
rost-0.311
 corrid-0.306
Token way
Token ID835
Feature activation+5.78e-02

Pos logit contributions
ł+0.021
mes+0.020
cyl+0.019
peat+0.019
enment+0.019
Neg logit contributions
 FN-0.014
 polic-0.013
 reconciliation-0.013
rost-0.013
 corrid-0.013
Token "
Token ID366
Feature activation+1.77e-02

Pos logit contributions
ł+0.177
mes+0.165
cyl+0.164
enment+0.160
peat+0.160
Neg logit contributions
 FN-0.135
 polic-0.132
 reconciliation-0.131
rost-0.130
 corrid-0.128
Tokenhom
Token ID26452
Feature activation+7.41e-01

Pos logit contributions
ł+0.048
mes+0.044
cyl+0.044
enment+0.043
peat+0.043
Neg logit contributions
 FN-0.047
 polic-0.046
 reconciliation-0.046
rost-0.046
 corrid-0.045
Tokenosexual
Token ID8542
Feature activation-2.78e-02

Pos logit contributions
ł+2.016
mes+1.874
cyl+1.842
peat+1.806
enment+1.794
Neg logit contributions
 FN-1.968
 polic-1.930
 reconciliation-1.925
rost-1.915
 corrid-1.897
Tokens
Token ID82
Feature activation+2.20e-01

Pos logit contributions
 FN+0.064
 polic+0.063
 reconciliation+0.063
rost+0.062
 therapy+0.062
Neg logit contributions
ł-0.085
cyl-0.079
mes-0.079
peat-0.077
enment-0.077
Token"
Token ID1
Feature activation+3.09e-01

Pos logit contributions
ł+0.610
mes+0.567
cyl+0.559
peat+0.548
enment+0.544
Neg logit contributions
 FN-0.571
 polic-0.561
 reconciliation-0.559
rost-0.555
 corrid-0.549
Token are
Token ID389
Feature activation+3.90e-01

Pos logit contributions
ł+0.921
mes+0.861
cyl+0.845
peat+0.831
enment+0.826
Neg logit contributions
 FN-0.742
 polic-0.723
 reconciliation-0.723
rost-0.721
 corrid-0.714
Token described
Token ID3417
Feature activation+3.10e-01

Pos logit contributions
ł+1.272
mes+1.199
cyl+1.179
peat+1.162
emo+1.153
Neg logit contributions
 FN-0.819
 reconciliation-0.793
rost-0.792
 polic-0.789
 corrid-0.783
Token.
Token ID13
Feature activation+9.38e-02

Pos logit contributions
 FN+0.432
rost+0.423
 reconciliation+0.419
 polic+0.417
 Roche+0.411
Neg logit contributions
ł-0.369
mes-0.338
cyl-0.338
peat-0.327
emo-0.325
Tokenp
Token ID79
Feature activation-6.97e-03

Pos logit contributions
ł+0.246
mes+0.225
cyl+0.220
enment+0.219
emo+0.217
Neg logit contributions
rost-0.253
 FN-0.250
 reconciliation-0.248
 polic-0.246
 therapy-0.244
Tokenone
Token ID505
Feature activation-9.40e-02

Pos logit contributions
rost+0.015
 FN+0.015
 reconciliation+0.014
 polic+0.014
 therapy+0.014
Neg logit contributions
ł-0.022
mes-0.021
cyl-0.021
peat-0.020
enment-0.020
Token.
Token ID13
Feature activation+1.60e-01

Pos logit contributions
 FN+0.269
rost+0.264
 polic+0.260
 reconciliation+0.259
 Roche+0.255
Neg logit contributions
ł-0.235
mes-0.214
cyl-0.213
enment-0.209
peat-0.207
Token00
Token ID405
Feature activation+8.04e-01

Pos logit contributions
ł+0.385
mes+0.350
cyl+0.345
peat+0.338
enment+0.337
Neg logit contributions
 FN-0.475
rost-0.467
 reconciliation-0.463
 polic-0.463
 operative-0.455
Token96
Token ID4846
Feature activation+8.70e-01

Pos logit contributions
ł+2.207
mes+2.050
cyl+2.017
peat+1.967
enment+1.961
Neg logit contributions
 FN-2.108
 polic-2.091
 reconciliation-2.081
 corrid-2.076
rost-2.050
Token275
Token ID23195
Feature activation+5.16e-01

Pos logit contributions
ł+2.413
mes+2.247
cyl+2.205
peat+2.157
enment+2.147
Neg logit contributions
 FN-2.253
 polic-2.230
 reconciliation-2.226
 corrid-2.217
rost-2.193
Token.
Token ID13
Feature activation-2.74e-02

Pos logit contributions
ł+1.131
mes+1.040
cyl+1.014
enment+0.990
emo+0.974
Neg logit contributions
 corrid-1.625
 FN-1.623
 polic-1.610
 reconciliation-1.607
rost-1.597
Tokeng
Token ID70
Feature activation+7.05e-02

Pos logit contributions
 corrid+0.087
 FN+0.087
 polic+0.086
ailability+0.085
 reconciliation+0.084
Neg logit contributions
ł-0.061
mes-0.057
cyl-0.054
emo-0.053
peat-0.053
Token001
Token ID8298
Feature activation+7.01e-01

Pos logit contributions
ł+0.205
mes+0.196
cyl+0.192
peat+0.191
enment+0.185
Neg logit contributions
 corrid-0.172
 FN-0.170
 polic-0.168
 reconciliation-0.167
rost-0.165
Token Pre
Token ID3771
Feature activation+9.08e-02

Pos logit contributions
ł+2.117
mes+1.978
cyl+1.953
peat+1.919
enment+1.903
Neg logit contributions
 FN-1.658
 polic-1.615
 reconciliation-1.612
rost-1.602
 corrid-1.579
Token more
Token ID517
Feature activation+3.98e-02

Pos logit contributions
 FN+0.889
 polic+0.879
rost+0.873
 reconciliation+0.872
 operative+0.844
Neg logit contributions
ł-1.167
mes-1.096
cyl-1.086
enment-1.069
peat-1.066
Token time
Token ID640
Feature activation-1.02e-01

Pos logit contributions
ł+0.124
mes+0.116
cyl+0.115
peat+0.112
enment+0.112
Neg logit contributions
 FN-0.090
 polic-0.089
 reconciliation-0.088
rost-0.087
 corrid-0.086
Token,
Token ID11
Feature activation-3.14e-01

Pos logit contributions
 FN+0.256
 polic+0.252
rost+0.250
 reconciliation+0.250
 operative+0.245
Neg logit contributions
ł-0.292
mes-0.272
cyl-0.271
peat-0.265
emo-0.263
Token I
Token ID314
Feature activation-2.87e-01

Pos logit contributions
 FN+0.694
 reconciliation+0.683
 polic+0.677
rost+0.673
 therapy+0.660
Neg logit contributions
ł-0.989
cyl-0.925
mes-0.925
peat-0.907
emo-0.905
Token would
Token ID561
Feature activation-1.83e-01

Pos logit contributions
 FN+0.713
 polic+0.704
 reconciliation+0.702
rost+0.699
 corrid+0.682
Neg logit contributions
ł-0.814
mes-0.764
cyl-0.757
peat-0.745
emo-0.742
Token
Token ID447
Feature activation+2.53e-02

Pos logit contributions
 FN+0.464
 polic+0.457
 reconciliation+0.457
rost+0.455
 corrid+0.442
Neg logit contributions
ł-0.516
mes-0.480
cyl-0.477
enment-0.468
peat-0.467
Token
Token ID247
Feature activation-1.54e-01

Pos logit contributions
ł+0.075
mes+0.070
cyl+0.069
peat+0.068
emo+0.067
Neg logit contributions
 FN-0.061
 polic-0.060
 corrid-0.060
 reconciliation-0.060
rost-0.059
Tokenve
Token ID303
Feature activation+9.71e-02

Pos logit contributions
 FN+0.395
rost+0.387
 reconciliation+0.384
 polic+0.384
 operative+0.374
Neg logit contributions
ł-0.429
mes-0.393
cyl-0.392
enment-0.383
emo-0.382
Token made
Token ID925
Feature activation+8.15e-02

Pos logit contributions
ł+0.295
mes+0.276
cyl+0.272
peat+0.267
enment+0.266
Neg logit contributions
 FN-0.227
 polic-0.222
 reconciliation-0.221
rost-0.220
 corrid-0.217
Token it
Token ID340
Feature activation+8.10e-02

Pos logit contributions
ł+0.268
mes+0.251
cyl+0.249
peat+0.244
enment+0.244
Neg logit contributions
 FN-0.171
 reconciliation-0.167
 polic-0.167
rost-0.165
 corrid-0.163
Token a
Token ID257
Feature activation-5.48e-02

Pos logit contributions
ł+0.243
mes+0.227
cyl+0.225
enment+0.221
peat+0.221
Neg logit contributions
 FN-0.192
 reconciliation-0.189
 polic-0.188
rost-0.187
 corrid-0.183
Token turtles
Token ID36288
Feature activation+5.84e-02

Pos logit contributions
 FN+0.381
 reconciliation+0.371
 polic+0.371
rost+0.370
 operative+0.359
Neg logit contributions
ł-0.720
mes-0.680
cyl-0.672
enment-0.667
peat-0.662
Token."
Token ID526
Feature activation-2.07e-01

Pos logit contributions
ł+0.163
mes+0.150
cyl+0.148
enment+0.146
peat+0.145
Neg logit contributions
 FN-0.152
 polic-0.150
 reconciliation-0.149
rost-0.148
 operative-0.146
Token
Token ID198
Feature activation+2.05e-01

Pos logit contributions
 FN+0.446
 polic+0.430
rost+0.430
 reconciliation+0.430
 Roche+0.418
Neg logit contributions
ł-0.668
mes-0.625
cyl-0.619
peat-0.608
enment-0.603
Token
Token ID198
Feature activation+2.37e-01

Pos logit contributions
ł+0.620
mes+0.581
cyl+0.570
peat+0.561
emo+0.555
Neg logit contributions
 corrid-0.489
 FN-0.480
 polic-0.476
 reconciliation-0.473
ailability-0.468
TokenHow
Token ID2437
Feature activation-2.39e-01

Pos logit contributions
ł+0.674
mes+0.626
cyl+0.619
enment+0.607
peat+0.606
Neg logit contributions
 FN-0.598
rost-0.585
 reconciliation-0.583
 polic-0.581
 operative-0.567
Token much
Token ID881
Feature activation+1.57e-01

Pos logit contributions
 FN+0.534
 polic+0.525
 reconciliation+0.524
rost+0.520
 operative+0.507
Neg logit contributions
ł-0.752
mes-0.703
cyl-0.698
enment-0.690
peat-0.682
Token debris
Token ID16468
Feature activation+5.51e-03

Pos logit contributions
ł+0.471
mes+0.439
cyl+0.432
enment+0.426
peat+0.425
Neg logit contributions
 FN-0.375
 polic-0.369
 reconciliation-0.368
rost-0.365
 operative-0.357
Token does
Token ID857
Feature activation-3.55e-01

Pos logit contributions
ł+0.016
mes+0.015
cyl+0.015
peat+0.014
enment+0.014
Neg logit contributions
 FN-0.014
 polic-0.014
 reconciliation-0.013
rost-0.013
 therapy-0.013
Token it
Token ID340
Feature activation+1.70e-01

Pos logit contributions
 FN+0.804
 polic+0.787
 reconciliation+0.787
rost+0.782
 therapy+0.762
Neg logit contributions
ł-1.113
mes-1.036
cyl-1.025
enment-1.011
peat-1.010
Token take
Token ID1011
Feature activation+7.49e-01

Pos logit contributions
ł+0.478
mes+0.444
cyl+0.439
peat+0.431
enment+0.429
Neg logit contributions
 FN-0.433
 polic-0.425
 reconciliation-0.424
rost-0.422
 corrid-0.415
Token to
Token ID284
Feature activation+3.06e-01

Pos logit contributions
ł+1.838
mes+1.693
cyl+1.662
peat+1.625
enment+1.614
Neg logit contributions
 FN-2.190
 polic-2.151
 reconciliation-2.147
rost-2.138
 corrid-2.114
Token help
Token ID1037
Feature activation+2.95e-01

Pos logit contributions
 FN+0.050
 reconciliation+0.049
 polic+0.048
rost+0.048
 operative+0.047
Neg logit contributions
ł-0.062
mes-0.058
cyl-0.058
peat-0.057
enment-0.056
Token the
Token ID262
Feature activation-1.53e-01

Pos logit contributions
ł+0.999
mes+0.939
cyl+0.931
peat+0.917
enment+0.912
Neg logit contributions
 FN-0.590
 reconciliation-0.576
 polic-0.574
rost-0.567
 corrid-0.557
Token masses
Token ID14568
Feature activation-1.37e-01

Pos logit contributions
 FN+0.331
 reconciliation+0.326
 polic+0.322
rost+0.318
 operative+0.313
Neg logit contributions
ł-0.496
mes-0.463
cyl-0.461
peat-0.451
emo-0.449
Token.
Token ID13
Feature activation+3.72e-02

Pos logit contributions
 FN+0.337
 reconciliation+0.335
 polic+0.332
rost+0.329
 operative+0.324
Neg logit contributions
ł-0.397
mes-0.369
cyl-0.367
peat-0.360
emo-0.358
Token Based
Token ID13403
Feature activation-7.94e-01

Pos logit contributions
ł+0.115
mes+0.108
cyl+0.107
peat+0.106
emo+0.104
Neg logit contributions
 FN-0.086
 reconciliation-0.083
 polic-0.082
rost-0.082
 Roche-0.080
Token on
Token ID319
Feature activation+2.44e-01

Pos logit contributions
rost+1.531
 FN+1.523
 reconciliation+1.522
 therapy+1.491
 polic+1.484
Neg logit contributions
ł-2.639
mes-2.513
cyl-2.493
peat-2.463
emo-2.450
Token this
Token ID428
Feature activation+4.11e-02

Pos logit contributions
ł+0.765
mes+0.715
cyl+0.710
peat+0.699
enment+0.692
Neg logit contributions
 FN-0.552
 reconciliation-0.541
 polic-0.538
rost-0.534
 operative-0.524
Token logic
Token ID9156
Feature activation+3.16e-01

Pos logit contributions
ł+0.126
mes+0.118
cyl+0.117
peat+0.115
emo+0.114
Neg logit contributions
 FN-0.095
 reconciliation-0.094
 polic-0.093
rost-0.092
 operative-0.091
Token and
Token ID290
Feature activation+1.18e-01

Pos logit contributions
ł+0.950
mes+0.890
cyl+0.884
peat+0.867
emo+0.861
Neg logit contributions
 FN-0.751
 reconciliation-0.736
 polic-0.734
rost-0.724
 operative-0.715
Token my
Token ID616
Feature activation-6.49e-02

Pos logit contributions
ł+0.369
mes+0.345
cyl+0.341
peat+0.335
emo+0.333
Neg logit contributions
 FN-0.266
 reconciliation-0.261
 polic-0.260
rost-0.257
 operative-0.253
Token energy
Token ID2568
Feature activation+3.53e-02

Pos logit contributions
 FN+0.136
 reconciliation+0.133
 polic+0.132
rost+0.131
 operative+0.129
Neg logit contributions
ł-0.214
mes-0.200
cyl-0.199
peat-0.196
emo-0.194
Token looks
Token ID3073
Feature activation-1.37e-01

Pos logit contributions
ł+0.066
mes+0.063
cyl+0.062
peat+0.062
enment+0.061
Neg logit contributions
 FN-0.049
 polic-0.048
 reconciliation-0.048
rost-0.047
 operative-0.047
Token.
Token ID13
Feature activation+4.12e-02

Pos logit contributions
 FN+0.337
 polic+0.335
rost+0.329
 reconciliation+0.329
 operative+0.324
Neg logit contributions
ł-0.395
mes-0.370
cyl-0.366
peat-0.363
emo-0.358
Token
Token ID447
Feature activation+1.60e-02

Pos logit contributions
ł+0.124
mes+0.117
cyl+0.116
peat+0.115
emo+0.113
Neg logit contributions
 FN-0.097
 polic-0.093
 reconciliation-0.093
rost-0.092
 Roche-0.091
Token
Token ID251
Feature activation-6.40e-02

Pos logit contributions
ł+0.062
mes+0.059
cyl+0.058
peat+0.058
emo+0.057
Neg logit contributions
 FN-0.024
 reconciliation-0.023
 polic-0.023
rost-0.023
 corrid-0.022
Token
Token ID198
Feature activation+6.39e-02

Pos logit contributions
 FN+0.149
 reconciliation+0.143
 polic+0.143
rost+0.142
 Roche+0.140
Neg logit contributions
ł-0.194
mes-0.182
cyl-0.182
peat-0.180
emo-0.177
Token
Token ID198
Feature activation+1.43e-01

Pos logit contributions
ł+0.198
mes+0.186
cyl+0.183
peat+0.180
enment+0.178
Neg logit contributions
 FN-0.146
 polic-0.143
 reconciliation-0.142
 corrid-0.142
rost-0.141
TokenIt
Token ID1026
Feature activation+1.85e-01

Pos logit contributions
ł+0.396
mes+0.368
cyl+0.364
peat+0.359
enment+0.358
Neg logit contributions
 FN-0.369
 reconciliation-0.361
rost-0.360
 polic-0.358
 operative-0.350
Token
Token ID447
Feature activation-1.03e-02

Pos logit contributions
ł+0.523
mes+0.489
cyl+0.485
peat+0.478
enment+0.474
Neg logit contributions
 FN-0.466
 polic-0.457
 reconciliation-0.456
rost-0.455
 corrid-0.447
Token
Token ID247
Feature activation-1.53e-02

Pos logit contributions
 FN+0.024
 polic+0.024
 reconciliation+0.024
rost+0.024
 corrid+0.024
Neg logit contributions
ł-0.031
mes-0.029
cyl-0.029
peat-0.028
enment-0.028
Tokens
Token ID82
Feature activation-1.99e-01

Pos logit contributions
 FN+0.045
rost+0.044
 polic+0.044
 reconciliation+0.043
 operative+0.043
Neg logit contributions
ł-0.037
cyl-0.034
mes-0.033
peat-0.033
emo-0.033
Token that
Token ID326
Feature activation+3.66e-01

Pos logit contributions
 FN+0.434
 polic+0.426
 reconciliation+0.426
rost+0.422
 operative+0.416
Neg logit contributions
ł-0.635
mes-0.596
cyl-0.592
peat-0.583
enment-0.583
Token in
Token ID287
Feature activation+7.51e-02

Pos logit contributions
ł+0.415
mes+0.386
cyl+0.380
peat+0.374
enment+0.372
Neg logit contributions
 FN-0.403
 polic-0.396
 reconciliation-0.394
rost-0.392
 operative-0.387
Token the
Token ID262
Feature activation+2.22e-01

Pos logit contributions
ł+0.241
mes+0.227
cyl+0.224
peat+0.221
enment+0.219
Neg logit contributions
 FN-0.163
 reconciliation-0.158
 polic-0.158
rost-0.157
 corrid-0.155
Token sax
Token ID46909
Feature activation-2.82e-02

Pos logit contributions
ł+0.687
mes+0.644
cyl+0.634
peat+0.623
enment+0.620
Neg logit contributions
 FN-0.505
 polic-0.493
 reconciliation-0.492
rost-0.489
 corrid-0.483
Tokenophon
Token ID48982
Feature activation+2.97e-01

Pos logit contributions
 FN+0.064
 reconciliation+0.063
 polic+0.063
rost+0.063
 corrid+0.062
Neg logit contributions
ł-0.086
cyl-0.081
mes-0.081
peat-0.080
enment-0.079
Tokenist
Token ID396
Feature activation+1.42e-01

Pos logit contributions
ł+1.002
mes+0.944
cyl+0.939
peat+0.926
enment+0.916
Neg logit contributions
 FN-0.589
 reconciliation-0.577
 polic-0.576
rost-0.573
 operative-0.564
Token's
Token ID338
Feature activation+7.23e-02

Pos logit contributions
ł+0.377
mes+0.353
cyl+0.349
peat+0.343
emo+0.338
Neg logit contributions
 FN-0.381
 reconciliation-0.376
 polic-0.373
rost-0.372
 corrid-0.368
Token elabor
Token ID12628
Feature activation+2.86e-01

Pos logit contributions
ł+0.234
mes+0.220
cyl+0.218
peat+0.215
enment+0.213
Neg logit contributions
 FN-0.155
 reconciliation-0.152
 polic-0.152
rost-0.150
 corrid-0.148
Tokenations
Token ID602
Feature activation-3.09e-02

Pos logit contributions
ł+1.050
mes+0.994
cyl+0.983
peat+0.969
enment+0.964
Neg logit contributions
 FN-0.489
 polic-0.474
 reconciliation-0.473
rost-0.469
 corrid-0.460
Token of
Token ID286
Feature activation+2.12e-01

Pos logit contributions
 FN+0.080
 polic+0.079
 reconciliation+0.078
rost+0.078
 operative+0.077
Neg logit contributions
ł-0.086
mes-0.080
cyl-0.079
enment-0.078
peat-0.077
Token a
Token ID257
Feature activation+9.64e-02

Pos logit contributions
ł+0.658
mes+0.617
cyl+0.608
peat+0.597
enment+0.594
Neg logit contributions
 FN-0.480
 polic-0.469
 reconciliation-0.467
rost-0.465
 corrid-0.459
Token more
Token ID517
Feature activation+9.77e-02

Pos logit contributions
ł+0.304
mes+0.285
cyl+0.282
peat+0.277
enment+0.275
Neg logit contributions
 FN-0.215
 reconciliation-0.210
 polic-0.210
rost-0.208
 corrid-0.205
Tokenurt
Token ID3325
Feature activation+3.56e-02

Pos logit contributions
rost+0.088
 FN+0.087
 reconciliation+0.086
 polic+0.086
 operative+0.085
Neg logit contributions
ł-0.051
mes-0.046
cyl-0.046
peat-0.044
enment-0.044
Token said
Token ID531
Feature activation+8.30e-03

Pos logit contributions
ł+0.100
mes+0.093
cyl+0.092
peat+0.090
enment+0.090
Neg logit contributions
 FN-0.091
 polic-0.089
 reconciliation-0.089
rost-0.089
 corrid-0.087
Token that
Token ID326
Feature activation+8.91e-02

Pos logit contributions
ł+0.025
mes+0.023
cyl+0.023
peat+0.022
enment+0.022
Neg logit contributions
 FN-0.020
 reconciliation-0.020
 polic-0.019
rost-0.019
 therapy-0.019
Token he
Token ID339
Feature activation-4.57e-02

Pos logit contributions
ł+0.279
mes+0.260
cyl+0.258
peat+0.254
enment+0.252
Neg logit contributions
 FN-0.201
 reconciliation-0.198
 polic-0.196
rost-0.195
 corrid-0.192
Token never
Token ID1239
Feature activation-1.57e-01

Pos logit contributions
 FN+0.106
 polic+0.104
 reconciliation+0.103
rost+0.102
 corrid+0.100
Neg logit contributions
ł-0.140
mes-0.131
cyl-0.129
peat-0.127
enment-0.127
Token liked
Token ID8288
Feature activation+1.99e-02

Pos logit contributions
 FN+0.361
 polic+0.356
 reconciliation+0.353
rost+0.351
 operative+0.344
Neg logit contributions
ł-0.480
mes-0.449
cyl-0.445
peat-0.436
enment-0.435
Token literal
Token ID18875
Feature activation+9.59e-02

Pos logit contributions
ł+0.064
mes+0.059
cyl+0.059
peat+0.058
enment+0.058
Neg logit contributions
 FN-0.044
 reconciliation-0.043
 polic-0.043
rost-0.042
 corrid-0.042
Token things
Token ID1243
Feature activation-4.73e-02

Pos logit contributions
ł+0.309
mes+0.290
cyl+0.287
peat+0.282
enment+0.280
Neg logit contributions
 FN-0.206
 polic-0.202
 reconciliation-0.201
rost-0.200
 corrid-0.197
Token.
Token ID13
Feature activation+3.53e-02

Pos logit contributions
 FN+0.112
 polic+0.110
 reconciliation+0.110
rost+0.108
 operative+0.107
Neg logit contributions
ł-0.142
mes-0.133
cyl-0.131
peat-0.129
enment-0.129
Token He
Token ID679
Feature activation+3.23e-02

Pos logit contributions
ł+0.110
mes+0.103
cyl+0.102
peat+0.100
emo+0.099
Neg logit contributions
 FN-0.080
 reconciliation-0.078
 polic-0.078
rost-0.077
 Roche-0.076
Token liked
Token ID8288
Feature activation-2.05e-01

Pos logit contributions
ł+0.106
mes+0.099
cyl+0.098
peat+0.096
enment+0.096
Neg logit contributions
 FN-0.068
 polic-0.066
 reconciliation-0.066
rost-0.066
 corrid-0.065
Token that
Token ID326
Feature activation-1.00e-01

Pos logit contributions
 FN+1.195
 reconciliation+1.176
 polic+1.174
rost+1.152
 operative+1.140
Neg logit contributions
ł-1.408
mes-1.342
cyl-1.329
enment-1.308
peat-1.305
Token the
Token ID262
Feature activation-1.36e-01

Pos logit contributions
 FN+0.221
 polic+0.217
 reconciliation+0.216
rost+0.212
 operative+0.210
Neg logit contributions
ł-0.318
mes-0.301
cyl-0.298
peat-0.294
enment-0.292
Token investigator
Token ID20725
Feature activation-2.05e-01

Pos logit contributions
 FN+0.284
 polic+0.278
 reconciliation+0.275
 operative+0.270
rost+0.267
Neg logit contributions
ł-0.449
mes-0.424
cyl-0.422
peat-0.418
enment-0.415
Token was
Token ID373
Feature activation-2.76e-01

Pos logit contributions
 FN+0.506
 polic+0.497
 reconciliation+0.495
rost+0.488
 operative+0.484
Neg logit contributions
ł-0.597
mes-0.561
cyl-0.556
peat-0.546
emo-0.542
Token detained
Token ID14847
Feature activation-1.07e-02

Pos logit contributions
 FN+0.635
 polic+0.623
 reconciliation+0.612
rost+0.604
 operative+0.604
Neg logit contributions
ł-0.849
mes-0.807
cyl-0.797
peat-0.788
enment-0.783
Token in
Token ID287
Feature activation-7.13e-02

Pos logit contributions
 FN+0.025
 polic+0.024
 reconciliation+0.024
rost+0.024
 corrid+0.024
Neg logit contributions
ł-0.033
mes-0.031
cyl-0.030
peat-0.030
enment-0.030
Token an
Token ID281
Feature activation-1.27e-01

Pos logit contributions
 FN+0.158
 polic+0.155
 reconciliation+0.155
rost+0.151
 operative+0.150
Neg logit contributions
ł-0.226
mes-0.213
cyl-0.213
peat-0.210
enment-0.209
Token upscale
Token ID44918
Feature activation+2.13e-01

Pos logit contributions
 FN+0.263
 polic+0.257
 reconciliation+0.253
 operative+0.250
rost+0.249
Neg logit contributions
ł-0.423
mes-0.401
cyl-0.397
peat-0.394
enment-0.391
Token district
Token ID4783
Feature activation+2.13e-01

Pos logit contributions
ł+0.643
mes+0.602
cyl+0.595
peat+0.587
enment+0.583
Neg logit contributions
 FN-0.508
 polic-0.500
 reconciliation-0.495
rost-0.488
 corrid-0.482
Token of
Token ID286
Feature activation+2.07e-01

Pos logit contributions
ł+0.574
mes+0.533
cyl+0.527
peat+0.518
enment+0.516
Neg logit contributions
 FN-0.573
 polic-0.566
 reconciliation-0.560
rost-0.556
 operative-0.549
Token Sk
Token ID3661
Feature activation+1.78e-02

Pos logit contributions
ł+0.669
mes+0.629
cyl+0.626
peat+0.616
enment+0.613
Neg logit contributions
 FN-0.445
 polic-0.436
 reconciliation-0.432
rost-0.426
 operative-0.419
Token of
Token ID286
Feature activation+2.57e-01

Pos logit contributions
ł+1.312
mes+1.211
cyl+1.198
peat+1.175
emo+1.164
Neg logit contributions
 FN-1.415
 reconciliation-1.393
 polic-1.386
rost-1.369
 therapy-1.353
Token proposed
Token ID5150
Feature activation+1.70e-02

Pos logit contributions
ł+0.821
mes+0.772
cyl+0.764
peat+0.749
enment+0.747
Neg logit contributions
 FN-0.560
 reconciliation-0.547
 polic-0.546
rost-0.540
 operative-0.531
Token rule
Token ID3896
Feature activation+5.22e-02

Pos logit contributions
ł+0.089
mes+0.086
cyl+0.085
peat+0.084
enment+0.084
Neg logit contributions
 FN-0.003
 reconciliation-0.002
 polic-0.002
rost-0.001
 operative-0.001
Tokenmaking
Token ID8601
Feature activation+2.13e-01

Pos logit contributions
ł+0.134
mes+0.124
cyl+0.122
peat+0.120
enment+0.120
Neg logit contributions
 FN-0.148
 reconciliation-0.145
 polic-0.144
rost-0.143
 rule-0.140
Token would
Token ID561
Feature activation-1.43e-01

Pos logit contributions
ł+0.599
mes+0.556
cyl+0.552
peat+0.541
enment+0.538
Neg logit contributions
 FN-0.551
 reconciliation-0.541
 polic-0.539
rost-0.532
 corrid-0.527
Token do
Token ID466
Feature activation-3.72e-01

Pos logit contributions
 FN+0.349
 polic+0.344
 reconciliation+0.343
rost+0.339
 broker+0.336
Neg logit contributions
ł-0.419
cyl-0.390
mes-0.387
peat-0.386
enment-0.384
Token away
Token ID1497
Feature activation-5.11e-01

Pos logit contributions
 FN+0.774
 reconciliation+0.760
rost+0.746
 polic+0.743
 operative+0.727
Neg logit contributions
ł-1.205
mes-1.154
enment-1.145
peat-1.144
cyl-1.140
Token with
Token ID351
Feature activation+1.52e-01

Pos logit contributions
 FN+1.099
rost+1.073
 reconciliation+1.067
 polic+1.063
 operative+1.035
Neg logit contributions
ł-1.637
mes-1.549
cyl-1.527
enment-1.505
peat-1.504
Token the
Token ID262
Feature activation+2.03e-01

Pos logit contributions
ł+0.484
mes+0.453
cyl+0.449
peat+0.441
enment+0.439
Neg logit contributions
 FN-0.339
 reconciliation-0.331
 polic-0.330
rost-0.327
 operative-0.321
Token 20
Token ID1160
Feature activation+1.28e-01

Pos logit contributions
ł+0.663
mes+0.620
cyl+0.615
peat+0.606
enment+0.602
Neg logit contributions
 FN-0.434
 reconciliation-0.424
 polic-0.422
rost-0.419
 operative-0.411
Token-
Token ID12
Feature activation+2.07e-01

Pos logit contributions
ł+0.355
mes+0.330
cyl+0.324
peat+0.319
enment+0.318
Neg logit contributions
 FN-0.333
 reconciliation-0.326
 polic-0.326
rost-0.325
 operative-0.322
Token!
Token ID0
Feature activation+2.67e-02

Pos logit contributions
ł+1.850
mes+1.721
cyl+1.693
peat+1.658
enment+1.649
Neg logit contributions
 FN-1.699
 polic-1.666
 reconciliation-1.663
rost-1.651
 corrid-1.638
Token Here
Token ID3423
Feature activation+2.30e-01

Pos logit contributions
ł+0.078
mes+0.073
cyl+0.072
peat+0.071
emo+0.070
Neg logit contributions
 FN-0.066
 polic-0.064
 reconciliation-0.064
rost-0.064
 operative-0.062
Token are
Token ID389
Feature activation-2.20e-01

Pos logit contributions
ł+0.659
mes+0.616
cyl+0.610
peat+0.600
enment+0.597
Neg logit contributions
 FN-0.576
 reconciliation-0.562
 polic-0.561
rost-0.560
 corrid-0.549
Token some
Token ID617
Feature activation-1.30e-01

Pos logit contributions
 FN+0.445
 polic+0.433
 reconciliation+0.429
rost+0.427
 operative+0.419
Neg logit contributions
ł-0.734
mes-0.695
cyl-0.690
peat-0.681
enment-0.680
Token amazing
Token ID4998
Feature activation+2.33e-01

Pos logit contributions
 FN+0.338
 polic+0.330
 reconciliation+0.327
rost+0.325
 operative+0.322
Neg logit contributions
ł-0.364
mes-0.342
cyl-0.337
enment-0.331
peat-0.331
Token real
Token ID1103
Feature activation-1.16e-01

Pos logit contributions
ł+0.672
mes+0.627
cyl+0.621
peat+0.610
enment+0.608
Neg logit contributions
 FN-0.582
 polic-0.568
 reconciliation-0.564
rost-0.561
 corrid-0.555
Token life
Token ID1204
Feature activation+2.06e-01

Pos logit contributions
 FN+0.206
 polic+0.199
 reconciliation+0.195
rost+0.195
 corrid+0.193
Neg logit contributions
ł-0.418
cyl-0.394
mes-0.394
peat-0.388
enment-0.388
Token photographs
Token ID12566
Feature activation-3.93e-01

Pos logit contributions
ł+0.500
mes+0.462
cyl+0.457
peat+0.447
enment+0.444
Neg logit contributions
 FN-0.612
 polic-0.600
 reconciliation-0.595
rost-0.592
 corrid-0.587
Token of
Token ID286
Feature activation-1.16e-01

Pos logit contributions
 FN+1.054
 polic+1.043
rost+1.024
 reconciliation+1.022
 operative+1.011
Neg logit contributions
ł-1.040
mes-0.974
cyl-0.967
enment-0.955
peat-0.948
Token a
Token ID257
Feature activation-1.96e-01

Pos logit contributions
 FN+0.267
 polic+0.262
 reconciliation+0.259
rost+0.258
 corrid+0.255
Neg logit contributions
ł-0.356
mes-0.333
cyl-0.329
peat-0.325
emo-0.324
Token 5
Token ID642
Feature activation-1.01e-01

Pos logit contributions
 FN+0.467
 polic+0.459
 reconciliation+0.455
rost+0.452
 corrid+0.444
Neg logit contributions
ł-0.588
mes-0.552
cyl-0.539
peat-0.535
emo-0.533
Token7
Token ID22
Feature activation+4.94e-01

Pos logit contributions
ł+1.854
mes+1.722
cyl+1.693
peat+1.659
enment+1.649
Neg logit contributions
 FN-1.839
 polic-1.804
 reconciliation-1.799
rost-1.791
 corrid-1.770
Token {
Token ID1391
Feature activation-2.00e-01

Pos logit contributions
ł+1.524
mes+1.435
cyl+1.415
peat+1.393
enment+1.382
Neg logit contributions
 FN-1.132
 polic-1.105
 reconciliation-1.103
rost-1.093
 corrid-1.073
Token(
Token ID7
Feature activation+2.19e-01

Pos logit contributions
 FN+0.569
 reconciliation+0.559
rost+0.556
 polic+0.555
 operative+0.545
Neg logit contributions
ł-0.498
mes-0.461
cyl-0.450
enment-0.443
emo-0.440
Token2
Token ID17
Feature activation+2.32e-01

Pos logit contributions
ł+0.597
mes+0.553
cyl+0.546
peat+0.535
enment+0.533
Neg logit contributions
 FN-0.577
rost-0.563
 reconciliation-0.563
 polic-0.562
 corrid-0.549
Tokens
Token ID82
Feature activation-9.19e-02

Pos logit contributions
ł+0.550
mes+0.497
cyl+0.491
peat+0.483
enment+0.482
Neg logit contributions
 FN-0.696
 reconciliation-0.686
 polic-0.684
rost-0.681
 operative-0.669
Token)}
Token ID38165
Feature activation-7.62e-02

Pos logit contributions
 FN+0.311
rost+0.305
 reconciliation+0.305
 polic+0.304
 Roche+0.299
Neg logit contributions
ł-0.181
mes-0.163
cyl-0.160
peat-0.158
enment-0.157
Token 53
Token ID7192
Feature activation+1.62e-01

Pos logit contributions
 FN+0.174
 reconciliation+0.168
 polic+0.167
rost+0.165
 Roche+0.162
Neg logit contributions
ł-0.238
mes-0.224
cyl-0.223
peat-0.220
emo-0.217
Token.
Token ID13
Feature activation-1.55e-01

Pos logit contributions
ł+0.383
mes+0.352
cyl+0.346
peat+0.337
enment+0.335
Neg logit contributions
 FN-0.490
 polic-0.480
 reconciliation-0.480
rost-0.478
 corrid-0.471
Token e
Token ID304
Feature activation+7.50e-02

Pos logit contributions
 FN+0.317
 reconciliation+0.305
 polic+0.304
rost+0.301
 Roche+0.296
Neg logit contributions
ł-0.522
mes-0.492
cyl-0.485
peat-0.480
enment-0.478
Token6
Token ID21
Feature activation+1.99e-01

Pos logit contributions
ł+0.192
mes+0.178
cyl+0.175
enment+0.172
peat+0.170
Neg logit contributions
 FN-0.210
 polic-0.205
rost-0.205
 reconciliation-0.205
 Roche-0.200
Token {
Token ID1391
Feature activation-3.33e-01

Pos logit contributions
ł+0.613
mes+0.575
cyl+0.566
peat+0.559
enment+0.557
Neg logit contributions
 FN-0.461
 reconciliation-0.450
 polic-0.448
rost-0.444
 operative-0.435
Token a
Token ID257
Feature activation-1.13e-02

Pos logit contributions
 FN+0.455
 polic+0.444
 reconciliation+0.443
rost+0.442
 operative+0.436
Neg logit contributions
ł-0.615
mes-0.572
cyl-0.570
enment-0.561
peat-0.558
Tokenkick
Token ID24585
Feature activation-1.82e-01

Pos logit contributions
 FN+0.025
 reconciliation+0.025
 polic+0.025
rost+0.024
 operative+0.024
Neg logit contributions
ł-0.036
mes-0.033
cyl-0.033
enment-0.032
peat-0.032
Token out
Token ID503
Feature activation-1.14e-01

Pos logit contributions
 FN+0.418
 reconciliation+0.406
rost+0.405
 polic+0.404
 operative+0.399
Neg logit contributions
ł-0.558
mes-0.516
cyl-0.515
enment-0.507
peat-0.503
Token of
Token ID286
Feature activation-2.25e-01

Pos logit contributions
 FN+0.348
 reconciliation+0.342
 polic+0.341
rost+0.341
 corrid+0.337
Neg logit contributions
ł-0.264
mes-0.241
cyl-0.236
peat-0.231
enment-0.230
Token the
Token ID262
Feature activation-2.23e-01

Pos logit contributions
 FN+0.477
 reconciliation+0.465
 polic+0.461
rost+0.460
 operative+0.449
Neg logit contributions
ł-0.739
mes-0.688
cyl-0.684
peat-0.673
enment-0.670
Token surprise
Token ID5975
Feature activation-6.96e-01

Pos logit contributions
 FN+0.472
 reconciliation+0.464
 polic+0.463
 operative+0.453
rost+0.453
Neg logit contributions
ł-0.728
cyl-0.675
mes-0.673
peat-0.663
enment-0.662
Token burning
Token ID9482
Feature activation-3.60e-01

Pos logit contributions
 FN+1.621
 reconciliation+1.608
 polic+1.583
 therapy+1.562
 operative+1.550
Neg logit contributions
ł-2.120
mes-1.956
cyl-1.938
peat-1.902
emo-1.896
Token of
Token ID286
Feature activation+1.69e-01

Pos logit contributions
 FN+0.896
 reconciliation+0.878
 polic+0.869
 operative+0.858
 therapy+0.858
Neg logit contributions
ł-1.041
mes-0.962
cyl-0.957
enment-0.935
peat-0.928
Token Atlanta
Token ID9371
Feature activation-2.00e-01

Pos logit contributions
ł+0.517
mes+0.479
cyl+0.476
peat+0.466
enment+0.466
Neg logit contributions
 FN-0.394
 reconciliation-0.385
rost-0.382
 polic-0.381
 operative-0.373
Token in
Token ID287
Feature activation-1.66e-01

Pos logit contributions
 FN+0.497
 reconciliation+0.486
 polic+0.484
rost+0.479
 operative+0.475
Neg logit contributions
ł-0.582
mes-0.539
cyl-0.534
peat-0.527
enment-0.526
Token Gone
Token ID26893
Feature activation-1.28e-01

Pos logit contributions
 FN+0.388
 reconciliation+0.377
 polic+0.373
rost+0.372
 operative+0.368
Neg logit contributions
ł-0.508
mes-0.471
cyl-0.470
peat-0.460
enment-0.460
Token the
Token ID262
Feature activation+6.30e-02

Pos logit contributions
 FN+0.244
 polic+0.237
 reconciliation+0.236
rost+0.231
 corrid+0.230
Neg logit contributions
ł-0.341
mes-0.321
cyl-0.318
enment-0.311
peat-0.310
Token movie
Token ID3807
Feature activation+2.61e-01

Pos logit contributions
ł+0.209
mes+0.196
cyl+0.194
peat+0.190
enment+0.190
Neg logit contributions
 FN-0.131
 polic-0.127
 reconciliation-0.127
rost-0.126
 corrid-0.124
Token.
Token ID13
Feature activation-2.76e-02

Pos logit contributions
ł+0.717
mes+0.667
cyl+0.657
peat+0.644
enment+0.643
Neg logit contributions
 FN-0.687
 polic-0.674
 reconciliation-0.673
rost-0.667
 corrid-0.657
Token if
Token ID611
Feature activation+1.63e-02

Pos logit contributions
 FN+0.066
 polic+0.064
 reconciliation+0.064
rost+0.064
 corrid+0.063
Neg logit contributions
ł-0.082
mes-0.077
cyl-0.076
peat-0.075
enment-0.074
Token you
Token ID345
Feature activation-1.16e-01

Pos logit contributions
ł+0.048
mes+0.045
cyl+0.045
peat+0.044
enment+0.044
Neg logit contributions
 FN-0.040
 polic-0.039
 reconciliation-0.038
rost-0.038
 corrid-0.038
Token live
Token ID2107
Feature activation-7.63e-01

Pos logit contributions
 FN+0.265
 polic+0.259
 reconciliation+0.257
rost+0.257
 corrid+0.252
Neg logit contributions
ł-0.357
mes-0.335
cyl-0.331
peat-0.326
enment-0.325
Token in
Token ID287
Feature activation-4.04e-01

Pos logit contributions
 FN+1.672
 polic+1.637
 reconciliation+1.596
rost+1.594
 corrid+1.582
Neg logit contributions
ł-2.391
mes-2.268
cyl-2.242
peat-2.211
enment-2.211
Token Louisville
Token ID21525
Feature activation+6.46e-03

Pos logit contributions
 FN+0.952
 polic+0.918
 reconciliation+0.905
rost+0.898
 corrid+0.891
Neg logit contributions
ł-1.226
mes-1.154
cyl-1.146
peat-1.127
enment-1.120
Token be
Token ID307
Feature activation-5.37e-01

Pos logit contributions
ł+0.019
mes+0.018
cyl+0.018
peat+0.018
enment+0.018
Neg logit contributions
 FN-0.015
 polic-0.015
 reconciliation-0.015
rost-0.015
 corrid-0.015
Token safe
Token ID3338
Feature activation-4.74e-01

Pos logit contributions
 FN+1.080
 polic+1.061
rost+1.037
 reconciliation+1.022
 operative+1.015
Neg logit contributions
ł-1.792
mes-1.697
cyl-1.686
enment-1.670
peat-1.652
Token!
Token ID0
Feature activation-1.47e-01

Pos logit contributions
 FN+1.215
 polic+1.197
rost+1.172
 reconciliation+1.160
 operative+1.150
Neg logit contributions
ł-1.324
mes-1.252
cyl-1.230
peat-1.207
enment-1.204
Token a
Token ID257
Feature activation-1.26e-01

Pos logit contributions
ł+0.246
mes+0.228
cyl+0.228
peat+0.225
enment+0.224
Neg logit contributions
 FN-0.194
 reconciliation-0.193
rost-0.192
 polic-0.188
 operative-0.186
Token boil
Token ID20667
Feature activation-6.95e-01

Pos logit contributions
 reconciliation+0.222
 FN+0.220
rost+0.220
 polic+0.216
 operative+0.212
Neg logit contributions
ł-0.457
mes-0.428
cyl-0.426
emo-0.419
enment-0.419
Token.
Token ID13
Feature activation-6.42e-02

Pos logit contributions
 FN+1.745
 reconciliation+1.721
rost+1.720
 polic+1.696
 therapy+1.679
Neg logit contributions
ł-1.962
mes-1.818
cyl-1.791
enment-1.775
peat-1.753
Token Once
Token ID4874
Feature activation+8.99e-02

Pos logit contributions
 FN+0.164
rost+0.159
 reconciliation+0.158
 polic+0.157
 Roche+0.157
Neg logit contributions
ł-0.182
mes-0.170
cyl-0.169
peat-0.166
enment-0.164
Token its
Token ID663
Feature activation-4.31e-02

Pos logit contributions
ł+0.302
mes+0.287
cyl+0.283
peat+0.276
enment+0.276
Neg logit contributions
 FN-0.181
 reconciliation-0.178
rost-0.178
 polic-0.176
 therapy-0.171
Token boiling
Token ID24372
Feature activation-7.58e-01

Pos logit contributions
 FN+0.070
rost+0.070
 reconciliation+0.070
 polic+0.069
 therapy+0.067
Neg logit contributions
ł-0.160
mes-0.153
cyl-0.151
peat-0.149
enment-0.148
Token,
Token ID11
Feature activation+2.52e-01

Pos logit contributions
 FN+1.643
rost+1.621
 reconciliation+1.619
 foil+1.593
 polic+1.593
Neg logit contributions
ł-2.377
mes-2.229
cyl-2.224
enment-2.198
peat-2.172
Token reduce
Token ID4646
Feature activation-3.58e-01

Pos logit contributions
ł+0.732
mes+0.681
cyl+0.675
peat+0.663
enment+0.662
Neg logit contributions
 FN-0.627
 reconciliation-0.612
 polic-0.612
rost-0.611
 broker-0.597
Token the
Token ID262
Feature activation-4.62e-01

Pos logit contributions
 FN+0.781
rost+0.765
 reconciliation+0.762
 polic+0.753
 therapy+0.736
Neg logit contributions
ł-1.142
mes-1.080
cyl-1.059
enment-1.047
peat-1.042
Token heat
Token ID4894
Feature activation-5.67e-01

Pos logit contributions
 FN+0.949
rost+0.923
 reconciliation+0.917
 polic+0.908
 therapy+0.891
Neg logit contributions
ł-1.542
mes-1.444
cyl-1.423
peat-1.414
enment-1.409
Token and
Token ID290
Feature activation-3.05e-02

Pos logit contributions
 FN+1.506
rost+1.468
 polic+1.454
 reconciliation+1.453
 therapy+1.446
Neg logit contributions
ł-1.529
mes-1.434
peat-1.394
enment-1.389
cyl-1.379
Token and
Token ID290
Feature activation+5.06e-02

Pos logit contributions
ł+0.227
mes+0.211
cyl+0.206
peat+0.205
enment+0.204
Neg logit contributions
 FN-0.215
 polic-0.211
 reconciliation-0.209
rost-0.209
 operative-0.206
Token grid
Token ID10706
Feature activation+3.92e-01

Pos logit contributions
ł+0.147
mes+0.135
peat+0.133
enment+0.132
cyl+0.132
Neg logit contributions
 FN-0.128
 polic-0.125
rost-0.125
 reconciliation-0.124
 operative-0.121
Token cell
Token ID2685
Feature activation+7.12e-02

Pos logit contributions
ł+1.063
mes+0.985
cyl+0.964
peat+0.955
enment+0.948
Neg logit contributions
 FN-1.046
 polic-1.028
 reconciliation-1.024
rost-1.021
 operative-1.002
Token,
Token ID11
Feature activation-2.88e-02

Pos logit contributions
ł+0.202
mes+0.187
peat+0.183
cyl+0.183
enment+0.181
Neg logit contributions
 FN-0.182
 polic-0.178
 reconciliation-0.178
rost-0.177
 operative-0.173
Token as
Token ID355
Feature activation-6.53e-02

Pos logit contributions
 FN+0.064
 reconciliation+0.062
 polic+0.062
rost+0.061
 operative+0.060
Neg logit contributions
ł-0.092
mes-0.086
cyl-0.085
peat-0.084
enment-0.083
Token shown
Token ID3402
Feature activation-8.36e-01

Pos logit contributions
 FN+0.155
 reconciliation+0.150
 polic+0.150
rost+0.147
 operative+0.146
Neg logit contributions
ł-0.199
mes-0.184
cyl-0.182
peat-0.180
enment-0.180
Token in
Token ID287
Feature activation-2.02e-01

Pos logit contributions
 FN+1.796
 reconciliation+1.760
 polic+1.724
rost+1.724
 operative+1.709
Neg logit contributions
ł-2.675
mes-2.493
enment-2.476
cyl-2.457
peat-2.455
Token Figure
Token ID11291
Feature activation-1.02e+00

Pos logit contributions
 FN+0.377
 reconciliation+0.367
 polic+0.358
rost+0.357
 operative+0.351
Neg logit contributions
ł-0.716
mes-0.668
cyl-0.667
peat-0.663
enment-0.658
Token 3
Token ID513
Feature activation-4.99e-01

Pos logit contributions
 FN+2.227
 reconciliation+2.143
rost+2.117
 polic+2.085
 operative+2.084
Neg logit contributions
ł-3.253
peat-3.005
cyl-3.000
mes-2.993
enment-2.961
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
 FN+1.380
 reconciliation+1.347
 polic+1.326
rost+1.323
 operative+1.313
Neg logit contributions
ł-1.312
mes-1.202
cyl-1.175
peat-1.168
enment-1.164
Token
Token ID198
Feature activation-2.39e-01

Pos logit contributions
 FN+0.409
 reconciliation+0.390
rost+0.389
 polic+0.389
 Roche+0.382
Neg logit contributions
ł-0.537
mes-0.501
cyl-0.496
peat-0.494
enment-0.487
Token/
Token ID14
Feature activation+7.24e-02

Pos logit contributions
ł+1.103
mes+1.019
cyl+1.004
peat+0.983
emo+0.974
Neg logit contributions
 FN-1.210
 polic-1.185
 reconciliation-1.183
rost-1.180
 corrid-1.165
Tokenalbum
Token ID40916
Feature activation-5.16e-02

Pos logit contributions
ł+0.228
mes+0.213
cyl+0.211
peat+0.209
emo+0.206
Neg logit contributions
 FN-0.161
rost-0.156
 reconciliation-0.156
 polic-0.155
 corrid-0.152
Token/
Token ID14
Feature activation-3.51e-01

Pos logit contributions
 FN+0.144
rost+0.141
 reconciliation+0.141
 polic+0.140
 corrid+0.137
Neg logit contributions
ł-0.133
mes-0.122
cyl-0.122
peat-0.119
emo-0.118
Tokencry
Token ID20470
Feature activation+3.85e-02

Pos logit contributions
rost+0.908
 FN+0.905
 reconciliation+0.890
 polic+0.885
 operative+0.866
Neg logit contributions
ł-0.961
mes-0.870
cyl-0.861
enment-0.858
emo-0.846
Tokenstal
Token ID7757
Feature activation-3.65e-01

Pos logit contributions
ł+0.121
mes+0.111
cyl+0.111
peat+0.109
emo+0.109
Neg logit contributions
 FN-0.085
rost-0.084
 polic-0.083
 reconciliation-0.082
 operative-0.080
Token-
Token ID12
Feature activation-6.97e-01

Pos logit contributions
 FN+0.940
rost+0.929
 polic+0.918
 reconciliation+0.918
 broker+0.896
Neg logit contributions
ł-1.008
mes-0.925
cyl-0.919
enment-0.898
peat-0.889
Tokenemb
Token ID24419
Feature activation-3.49e-02

Pos logit contributions
rost+1.861
 FN+1.815
 reconciliation+1.773
 polic+1.766
isting+1.760
Neg logit contributions
ł-1.843
cyl-1.641
mes-1.631
enment-1.626
oleon-1.586
Tokenry
Token ID563
Feature activation-3.03e-02

Pos logit contributions
 FN+0.079
 polic+0.077
 reconciliation+0.077
rost+0.076
 corrid+0.076
Neg logit contributions
ł-0.108
mes-0.102
cyl-0.100
peat-0.099
enment-0.098
Tokeno
Token ID78
Feature activation-1.82e-01

Pos logit contributions
 FN+0.066
 polic+0.064
rost+0.064
 reconciliation+0.064
 operative+0.062
Neg logit contributions
ł-0.097
mes-0.091
cyl-0.090
peat-0.088
enment-0.088
Token<|endoftext|>
Token ID50256
Feature activation-8.52e-02

Pos logit contributions
 FN+0.463
rost+0.451
 reconciliation+0.450
 polic+0.449
 corrid+0.441
Neg logit contributions
ł-0.515
mes-0.479
cyl-0.475
peat-0.466
enment-0.461
TokenOver
Token ID5886
Feature activation-5.34e-01

Pos logit contributions
 FN+0.218
rost+0.214
 reconciliation+0.213
 polic+0.211
 operative+0.207
Neg logit contributions
ł-0.237
mes-0.218
cyl-0.216
enment-0.215
peat-0.213