TOP ACTIVATIONS
MAX = 3.551

well as more relaxed AMD Catalyst Control Center and Over Current
much larger company . The principals gained ownership in each other
isn 't to keep Red Hat or CentOS servers operating smoothly
With filming for Teen age Mutant Ninja Turtles II : Half
's President Nicolas Maduro ( cent re L ) walks with
: Developers from multiple Linux distributions and companies today announced collaboration
/ Me / 2000 / XP I Develop ed
a low body temperature . Lep ro sy lesions appear mainly
Chaos Charm Sc ourge of Fle ets vs Crypt
ation , fals ification or plagiar ism " but not "
John Nguyen is confident that Viet - C aj un craw
on - track session at Circuit Paul Ric ard .
read : " Bar ack Obama would be the
: Lic enses and acc reditation aren t required
CPUs and memory ( Intel VT - x , AMD -
. Likewise , any Linux distribution with the Flat pak software
to keep Red Hat or CentOS servers operating smoothly and you
claiming " multiple " Linux distributions are collaborating on supporting Snap
guard to stop Laura from overwhelming them quickly .
Goodman 1994 John Lar ro qu ette

MIN ACTIVATIONS
MIN = -3.041

. youtube . com / watch ? v = 2 g
aging in the Yuk on . H ordes will
Liberals will tax their life savings . " A
from him , and he masturb ated me at the end
Additional filtering is done get rid of some spurious results .
Rice , and Special Assistant Samantha Power ), has been a
with Eleanor Roosevelt to secure luxury travel buses for Eddie Durham
, accused of small insignificant offences , like breaking the window
'd go some place and masturb ate and use it .
it is hard to get rid of them .
(@ m f lynn JR ) March 4 ,
ed Out Atlanta pic . twitter . com / xx A
had with their checking or savings accounts , including complaints over
from the Harvard - Smith sonian Center for Ast roph ysics
all their former captain has ever known is the path to
you can watch porn and masturb ate for a long time
major dollars just to get rid of the proverbial al bat
the sentences on four remaining offences were to be served concurrently
we realize the subject 's masturbation has killed his dog ,
8 b ET pic . twitter . com / W 8

INTERVAL 1.903 - 3.551
CONTAINS 0.066%

high bandwidth Layer 3 & Layer 4 D DoS attacks as
by Facebook employees in Palo Alto , Calif ., and in
cue for the recently under whelming Can a , Campbell and
casualty could be the Mars Atmosp here and Vol atile Evolution
targeting Windows , Mac and Linux for its initial release .

INTERVAL 0.255 - 1.903
CONTAINS 23.187%

mans the border with North Korea , where reports have emerged
: 47 ] < l ucky star 19 > z fg
He 's scored at least 25 Fantasy points in his past
service , which brings you great games created for consoles such
of other OD IN based boot load ers , so when

INTERVAL -1.393 - 0.255
CONTAINS 76.143%

she s has some miscar riages in the past
only way to be sure is to ask the opinion of
to handle multiple spikes in the angle - of - attack
By Christmas 10 69 , he entered the ruined city and
afforded on both sides of the building , with a 28

INTERVAL -3.041 - -1.393
CONTAINS 0.604%

Win ners in each category are marked with a star
was the redacted Cleveland source responsible for the initial African slurs
time your HS when the asc ension will happen as he
. That s what I want to be known
you are pretty much a wreck . The whole
Token well
Token ID880
Feature activation+1.01e-01

Pos logit contributions
tips+0.038
inges+0.038
interrupted+0.035
verbs+0.034
ollar+0.034
Neg logit contributions
 irrad-0.031
 kinderg-0.030
 reactor-0.030
 rejection-0.029
ific-0.027
Token as
Token ID355
Feature activation-2.79e-02

Pos logit contributions
 irrad+0.160
 reactor+0.155
 kinderg+0.155
 rejection+0.151
 workload+0.139
Neg logit contributions
tips-0.180
inges-0.180
interrupted-0.168
verbs-0.161
ollar-0.159
Token more
Token ID517
Feature activation-3.48e-01

Pos logit contributions
tips+0.053
inges+0.053
interrupted+0.049
verbs+0.047
ollar+0.047
Neg logit contributions
 irrad-0.041
 kinderg-0.040
 reactor-0.039
 rejection-0.038
 workload-0.035
Token relaxed
Token ID18397
Feature activation-5.87e-01

Pos logit contributions
tips+0.712
inges+0.705
interrupted+0.667
verbs+0.641
ollar+0.636
Neg logit contributions
 irrad-0.460
 kinderg-0.446
 reactor-0.442
 rejection-0.430
ific-0.385
Token AMD
Token ID10324
Feature activation+1.28e-01

Pos logit contributions
tips+1.183
inges+1.168
interrupted+1.104
verbs+1.061
ollar+1.054
Neg logit contributions
 irrad-0.782
 kinderg-0.765
 reactor-0.748
 rejection-0.728
ific-0.659
Token Catalyst
Token ID48238
Feature activation+3.55e+00

Pos logit contributions
 irrad+0.187
 kinderg+0.181
 reactor+0.180
 rejection+0.176
 workload+0.159
Neg logit contributions
tips-0.244
inges-0.242
interrupted-0.228
verbs-0.219
ollar-0.217
Token Control
Token ID6779
Feature activation+5.86e-01

Pos logit contributions
 irrad+5.230
 reactor+5.072
 kinderg+4.966
 rejection+4.938
 workload+4.655
Neg logit contributions
inges-6.616
tips-6.578
interrupted-6.220
ollar-6.007
verbs-5.976
Token Center
Token ID3337
Feature activation+9.59e-01

Pos logit contributions
 irrad+0.909
 reactor+0.880
 kinderg+0.880
 rejection+0.857
 workload+0.781
Neg logit contributions
tips-1.071
inges-1.063
interrupted-0.994
verbs-0.954
ollar-0.944
Token and
Token ID290
Feature activation+1.56e-01

Pos logit contributions
 irrad+1.547
 reactor+1.501
 kinderg+1.494
 rejection+1.464
 workload+1.347
Neg logit contributions
tips-1.690
inges-1.685
interrupted-1.572
verbs-1.504
ollar-1.494
Token Over
Token ID3827
Feature activation+1.14e+00

Pos logit contributions
 irrad+0.233
 kinderg+0.226
 reactor+0.225
 rejection+0.220
 workload+0.199
Neg logit contributions
tips-0.294
inges-0.292
interrupted-0.274
verbs-0.263
ollar-0.261
TokenCurrent
Token ID11297
Feature activation+9.18e-01

Pos logit contributions
 irrad+1.697
 reactor+1.646
 kinderg+1.628
 rejection+1.599
 workload+1.472
Neg logit contributions
inges-2.127
tips-2.126
interrupted-1.978
verbs-1.911
ollar-1.898
Token much
Token ID881
Feature activation-3.32e-01

Pos logit contributions
 irrad+0.148
 reactor+0.141
 kinderg+0.141
 rejection+0.138
 workload+0.124
Neg logit contributions
tips-0.214
inges-0.213
interrupted-0.200
verbs-0.192
ollar-0.192
Token larger
Token ID4025
Feature activation-6.98e-01

Pos logit contributions
tips+0.534
inges+0.529
interrupted+0.494
verbs+0.468
ollar+0.466
Neg logit contributions
 irrad-0.579
 reactor-0.565
 kinderg-0.564
 rejection-0.554
 workload-0.510
Token company
Token ID1664
Feature activation+3.66e-01

Pos logit contributions
tips+1.371
inges+1.361
interrupted+1.282
ollar+1.229
ippers+1.228
Neg logit contributions
 irrad-0.951
 kinderg-0.911
 reactor-0.903
 rejection-0.891
ific-0.798
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
 irrad+0.546
 reactor+0.528
 kinderg+0.526
 rejection+0.514
 workload+0.469
Neg logit contributions
tips-0.689
inges-0.686
interrupted-0.643
verbs-0.618
ollar-0.613
Token The
Token ID383
Feature activation+1.33e-01

Pos logit contributions
 irrad+0.020
 reactor+0.020
 kinderg+0.020
 rejection+0.019
 workload+0.018
Neg logit contributions
tips-0.021
inges-0.020
interrupted-0.019
verbs-0.018
ollar-0.018
Token principals
Token ID44998
Feature activation+2.92e+00

Pos logit contributions
 irrad+0.185
 kinderg+0.178
 reactor+0.178
 rejection+0.173
 workload+0.156
Neg logit contributions
tips-0.263
inges-0.262
interrupted-0.246
verbs-0.235
ollar-0.235
Token gained
Token ID8618
Feature activation-1.52e-01

Pos logit contributions
 irrad+4.079
 reactor+3.950
 kinderg+3.899
 rejection+3.844
 workload+3.487
Neg logit contributions
tips-5.750
inges-5.708
interrupted-5.359
verbs-5.215
ollar-5.120
Token ownership
Token ID9238
Feature activation-2.81e-01

Pos logit contributions
tips+0.334
inges+0.333
interrupted+0.314
verbs+0.303
ollar+0.302
Neg logit contributions
 irrad-0.179
 kinderg-0.171
 reactor-0.171
 rejection-0.165
 workload-0.146
Token in
Token ID287
Feature activation-1.16e-01

Pos logit contributions
tips+0.536
inges+0.535
interrupted+0.502
verbs+0.481
ollar+0.479
Neg logit contributions
 irrad-0.410
 kinderg-0.396
 reactor-0.395
 rejection-0.385
 workload-0.350
Token each
Token ID1123
Feature activation+1.26e+00

Pos logit contributions
inges+0.225
tips+0.224
interrupted+0.209
verbs+0.202
ollar+0.201
Neg logit contributions
 irrad-0.164
 reactor-0.158
 kinderg-0.158
 rejection-0.154
 workload-0.139
Token other
Token ID584
Feature activation+6.30e-02

Pos logit contributions
 irrad+1.670
 reactor+1.613
 kinderg+1.600
 rejection+1.563
 workload+1.407
Neg logit contributions
tips-2.577
inges-2.564
interrupted-2.418
verbs-2.338
ollar-2.311
Token isn
Token ID2125
Feature activation-6.80e-02

Pos logit contributions
 irrad+0.403
 reactor+0.392
 kinderg+0.392
 rejection+0.382
 workload+0.352
Neg logit contributions
tips-0.378
inges-0.376
interrupted-0.347
verbs-0.332
ollar-0.329
Token't
Token ID470
Feature activation-3.79e-01

Pos logit contributions
tips+0.146
inges+0.145
interrupted+0.137
verbs+0.133
ollar+0.132
Neg logit contributions
 irrad-0.084
 reactor-0.080
 kinderg-0.080
 rejection-0.078
 workload-0.069
Token to
Token ID284
Feature activation+1.76e-01

Pos logit contributions
tips+0.701
inges+0.699
interrupted+0.654
verbs+0.626
ollar+0.623
Neg logit contributions
 irrad-0.571
 kinderg-0.554
 reactor-0.550
 rejection-0.539
 workload-0.486
Token keep
Token ID1394
Feature activation+1.54e-01

Pos logit contributions
 irrad+0.258
 reactor+0.250
 kinderg+0.249
 rejection+0.243
 workload+0.221
Neg logit contributions
tips-0.336
inges-0.335
interrupted-0.314
verbs-0.301
ollar-0.299
Token Red
Token ID2297
Feature activation+1.51e+00

Pos logit contributions
 irrad+0.227
 kinderg+0.220
 reactor+0.219
 rejection+0.214
 workload+0.194
Neg logit contributions
tips-0.290
inges-0.289
interrupted-0.270
verbs-0.260
ollar-0.259
Token Hat
Token ID10983
Feature activation+2.82e+00

Pos logit contributions
 irrad+2.317
 reactor+2.245
 kinderg+2.233
 rejection+2.187
 workload+2.000
Neg logit contributions
tips-2.793
inges-2.785
interrupted-2.609
verbs-2.498
ollar-2.477
Token or
Token ID393
Feature activation+6.39e-01

Pos logit contributions
 irrad+4.305
 reactor+4.204
 kinderg+4.128
 rejection+4.013
 workload+3.767
Neg logit contributions
inges-5.166
tips-5.110
interrupted-4.852
ollar-4.582
verbs-4.572
Token CentOS
Token ID32418
Feature activation+2.44e+00

Pos logit contributions
 irrad+0.964
 reactor+0.933
 kinderg+0.929
 rejection+0.909
 workload+0.828
Neg logit contributions
tips-1.190
inges-1.186
interrupted-1.109
verbs-1.065
ollar-1.057
Token servers
Token ID9597
Feature activation+1.45e+00

Pos logit contributions
 irrad+3.805
 reactor+3.708
 kinderg+3.666
 rejection+3.559
 workload+3.334
Neg logit contributions
inges-4.423
tips-4.400
interrupted-4.125
verbs-3.944
ollar-3.910
Token operating
Token ID5361
Feature activation+5.73e-01

Pos logit contributions
 irrad+2.359
 reactor+2.289
 kinderg+2.272
 rejection+2.220
 workload+2.063
Neg logit contributions
tips-2.543
inges-2.539
interrupted-2.365
verbs-2.264
ollar-2.243
Token smoothly
Token ID21461
Feature activation-4.28e-01

Pos logit contributions
 irrad+0.877
 reactor+0.850
 kinderg+0.848
 rejection+0.830
 workload+0.751
Neg logit contributions
tips-1.054
inges-1.047
interrupted-0.979
verbs-0.940
ollar-0.935
TokenWith
Token ID3152
Feature activation-3.18e-01

Pos logit contributions
 irrad+0.011
 reactor+0.011
 kinderg+0.011
 rejection+0.010
 workload+0.010
Neg logit contributions
tips-0.011
inges-0.011
interrupted-0.011
verbs-0.010
ollar-0.010
Token filming
Token ID17691
Feature activation-8.33e-01

Pos logit contributions
tips+0.606
inges+0.602
interrupted+0.561
verbs+0.542
ollar+0.540
Neg logit contributions
 irrad-0.459
 kinderg-0.446
 reactor-0.445
 rejection-0.431
 workload-0.395
Token for
Token ID329
Feature activation-1.30e-01

Pos logit contributions
tips+1.534
inges+1.508
interrupted+1.438
ollar+1.360
ippers+1.358
Neg logit contributions
 irrad-1.250
 reactor-1.213
 kinderg-1.210
 rejection-1.173
 teamwork-1.070
Token Teen
Token ID21432
Feature activation+5.93e-01

Pos logit contributions
tips+0.241
inges+0.240
interrupted+0.224
verbs+0.216
ollar+0.215
Neg logit contributions
 irrad-0.196
 reactor-0.190
 kinderg-0.190
 rejection-0.184
 workload-0.169
Tokenage
Token ID496
Feature activation+7.06e-01

Pos logit contributions
 irrad+0.734
 reactor+0.704
 kinderg+0.702
 rejection+0.680
 workload+0.610
Neg logit contributions
tips-1.268
inges-1.263
interrupted-1.195
verbs-1.155
ollar-1.148
Token Mutant
Token ID35319
Feature activation+2.79e+00

Pos logit contributions
 irrad+0.706
 reactor+0.669
 kinderg+0.668
 rejection+0.641
 workload+0.558
Neg logit contributions
tips-1.681
inges-1.669
interrupted-1.588
verbs-1.539
ollar-1.533
Token Ninja
Token ID14152
Feature activation+1.03e+00

Pos logit contributions
 irrad+3.104
 reactor+3.030
 kinderg+2.963
 rejection+2.877
 workload+2.525
Neg logit contributions
inges-6.217
tips-6.141
interrupted-5.856
verbs-5.693
ippers-5.662
Token Turtles
Token ID44356
Feature activation+2.19e+00

Pos logit contributions
 irrad+1.765
 reactor+1.713
 kinderg+1.710
 rejection+1.676
 workload+1.549
Neg logit contributions
tips-1.716
inges-1.708
interrupted-1.586
verbs-1.514
ollar-1.505
Token II
Token ID2873
Feature activation-5.57e-01

Pos logit contributions
 irrad+3.419
 reactor+3.303
 kinderg+3.293
 rejection+3.234
 workload+2.948
Neg logit contributions
inges-4.006
tips-3.962
interrupted-3.652
verbs-3.568
ollar-3.481
Token:
Token ID25
Feature activation+8.98e-02

Pos logit contributions
tips+1.032
inges+1.022
interrupted+0.971
ollar+0.921
verbs+0.921
Neg logit contributions
 irrad-0.832
 reactor-0.807
 kinderg-0.807
 rejection-0.787
 workload-0.719
Token Half
Token ID13139
Feature activation+8.76e-02

Pos logit contributions
 irrad+0.142
 kinderg+0.138
 reactor+0.137
 rejection+0.132
 workload+0.125
Neg logit contributions
tips-0.162
inges-0.158
interrupted-0.148
verbs-0.144
ollar-0.143
Token's
Token ID338
Feature activation+1.04e-01

Pos logit contributions
 irrad+0.533
 kinderg+0.515
 reactor+0.514
 rejection+0.502
 workload+0.463
Neg logit contributions
tips-0.559
inges-0.559
interrupted-0.519
verbs-0.498
ollar-0.494
Token President
Token ID1992
Feature activation+9.30e-01

Pos logit contributions
 irrad+0.110
 kinderg+0.105
 reactor+0.104
 rejection+0.099
 teamwork+0.087
Neg logit contributions
tips-0.238
inges-0.238
interrupted-0.223
ollar-0.217
verbs-0.217
Token Nicolas
Token ID29737
Feature activation+5.22e-01

Pos logit contributions
 irrad+1.461
 reactor+1.415
 kinderg+1.411
 rejection+1.377
 workload+1.264
Neg logit contributions
tips-1.681
inges-1.675
interrupted-1.563
verbs-1.504
ollar-1.494
Token Maduro
Token ID33007
Feature activation+1.79e+00

Pos logit contributions
 irrad+0.488
 reactor+0.462
 kinderg+0.459
 rejection+0.438
 workload+0.374
Neg logit contributions
inges-1.272
tips-1.270
interrupted-1.204
verbs-1.167
ollar-1.167
Token (
Token ID357
Feature activation+4.09e-01

Pos logit contributions
 irrad+3.241
 reactor+3.146
 kinderg+3.128
 rejection+3.073
 workload+2.860
Neg logit contributions
tips-2.821
inges-2.778
interrupted-2.594
verbs-2.481
ollar-2.444
Tokencent
Token ID1087
Feature activation+2.76e+00

Pos logit contributions
 irrad+0.677
 reactor+0.657
 kinderg+0.654
 rejection+0.639
 workload+0.590
Neg logit contributions
tips-0.708
inges-0.702
interrupted-0.657
verbs-0.626
ollar-0.621
Tokenre
Token ID260
Feature activation+4.67e-01

Pos logit contributions
 irrad+3.171
 reactor+3.050
 kinderg+3.007
 rejection+2.934
 workload+2.594
Neg logit contributions
tips-6.119
inges-6.091
interrupted-5.791
verbs-5.604
ollar-5.516
Token L
Token ID406
Feature activation+8.24e-01

Pos logit contributions
 irrad+0.656
 kinderg+0.636
 reactor+0.632
 rejection+0.615
 workload+0.558
Neg logit contributions
tips-0.921
inges-0.917
interrupted-0.860
ollar-0.823
verbs-0.823
Token)
Token ID8
Feature activation+3.14e-01

Pos logit contributions
 irrad+1.236
 reactor+1.197
 kinderg+1.192
 rejection+1.165
 workload+1.063
Neg logit contributions
tips-1.545
inges-1.540
interrupted-1.441
verbs-1.384
ollar-1.375
Token walks
Token ID11114
Feature activation-8.57e-02

Pos logit contributions
 irrad+0.552
 reactor+0.536
 kinderg+0.536
 rejection+0.525
 workload+0.486
Neg logit contributions
tips-0.506
inges-0.505
interrupted-0.467
verbs-0.444
ollar-0.441
Token with
Token ID351
Feature activation+3.42e-02

Pos logit contributions
tips+0.149
inges+0.148
interrupted+0.138
verbs+0.132
ollar+0.130
Neg logit contributions
 irrad-0.139
 kinderg-0.135
 reactor-0.134
 rejection-0.132
 workload-0.121
Token:
Token ID25
Feature activation+8.44e-02

Pos logit contributions
 irrad+0.159
 kinderg+0.153
 reactor+0.152
 rejection+0.148
 workload+0.135
Neg logit contributions
tips-0.204
inges-0.202
interrupted-0.188
verbs-0.182
ippers-0.180
Token Developers
Token ID34152
Feature activation+2.56e-01

Pos logit contributions
 irrad+0.144
 reactor+0.140
 kinderg+0.139
 rejection+0.136
 workload+0.126
Neg logit contributions
tips-0.141
inges-0.140
interrupted-0.130
verbs-0.125
ollar-0.124
Token from
Token ID422
Feature activation+4.69e-01

Pos logit contributions
 irrad+0.397
 reactor+0.385
 kinderg+0.384
 rejection+0.373
 workload+0.341
Neg logit contributions
tips-0.468
inges-0.465
interrupted-0.435
verbs-0.418
ollar-0.414
Token multiple
Token ID3294
Feature activation+3.81e-01

Pos logit contributions
 irrad+0.690
 reactor+0.667
 kinderg+0.661
 rejection+0.646
 workload+0.586
Neg logit contributions
tips-0.890
inges-0.889
interrupted-0.828
ollar-0.797
verbs-0.797
Token Linux
Token ID7020
Feature activation+1.46e+00

Pos logit contributions
 irrad+0.541
 reactor+0.523
 kinderg+0.523
 rejection+0.506
 teamwork+0.456
Neg logit contributions
inges-0.737
tips-0.735
interrupted-0.686
verbs-0.661
ippers-0.661
Token distributions
Token ID24570
Feature activation+2.73e+00

Pos logit contributions
 irrad+2.121
 reactor+2.051
 kinderg+2.045
 rejection+1.996
 workload+1.823
Neg logit contributions
tips-2.808
inges-2.798
interrupted-2.631
verbs-2.525
ollar-2.508
Token and
Token ID290
Feature activation-9.09e-02

Pos logit contributions
 irrad+3.915
 reactor+3.766
 kinderg+3.714
 rejection+3.701
 workload+3.393
Neg logit contributions
inges-5.279
tips-5.246
interrupted-4.929
verbs-4.742
ollar-4.692
Token companies
Token ID2706
Feature activation+7.96e-01

Pos logit contributions
tips+0.172
inges+0.171
interrupted+0.159
ippers+0.153
verbs+0.153
Neg logit contributions
 irrad-0.133
 reactor-0.129
 kinderg-0.128
 rejection-0.125
 teamwork-0.113
Token today
Token ID1909
Feature activation+1.10e+00

Pos logit contributions
 irrad+1.210
 reactor+1.173
 kinderg+1.169
 rejection+1.141
 workload+1.042
Neg logit contributions
tips-1.476
inges-1.470
interrupted-1.377
verbs-1.321
ollar-1.312
Token announced
Token ID3414
Feature activation+5.33e-01

Pos logit contributions
 irrad+1.664
 reactor+1.611
 kinderg+1.603
 rejection+1.570
 workload+1.435
Neg logit contributions
tips-2.034
inges-2.028
interrupted-1.898
verbs-1.822
ollar-1.807
Token collaboration
Token ID12438
Feature activation+5.12e-01

Pos logit contributions
 irrad+0.744
 reactor+0.718
 kinderg+0.714
 rejection+0.693
 workload+0.628
Neg logit contributions
tips-1.055
inges-1.049
interrupted-0.985
verbs-0.950
ollar-0.943
Token /
Token ID1220
Feature activation+1.09e-01

Pos logit contributions
 irrad+1.193
 reactor+1.157
 kinderg+1.152
 rejection+1.129
 workload+1.035
Neg logit contributions
tips-1.340
inges-1.335
interrupted-1.247
verbs-1.194
ollar-1.185
Token Me
Token ID2185
Feature activation+4.91e-01

Pos logit contributions
 irrad+0.156
 reactor+0.151
 kinderg+0.149
 rejection+0.147
 teamwork+0.133
Neg logit contributions
tips-0.211
inges-0.209
interrupted-0.195
verbs-0.188
ollar-0.187
Token /
Token ID1220
Feature activation+9.19e-02

Pos logit contributions
 irrad+0.784
 reactor+0.761
 kinderg+0.758
 rejection+0.742
 workload+0.681
Neg logit contributions
tips-0.871
inges-0.868
interrupted-0.810
verbs-0.775
ollar-0.770
Token 2000
Token ID4751
Feature activation+4.61e-01

Pos logit contributions
 irrad+0.131
 reactor+0.127
 kinderg+0.125
 rejection+0.122
 workload+0.111
Neg logit contributions
tips-0.180
inges-0.179
interrupted-0.167
verbs-0.161
ollar-0.160
Token /
Token ID1220
Feature activation+1.30e-01

Pos logit contributions
 irrad+0.698
 reactor+0.676
 kinderg+0.675
 rejection+0.657
 workload+0.600
Neg logit contributions
tips-0.857
inges-0.855
interrupted-0.800
verbs-0.767
ollar-0.763
Token XP
Token ID11961
Feature activation+2.71e+00

Pos logit contributions
 irrad+0.118
 reactor+0.113
 kinderg+0.111
 rejection+0.107
 workload+0.091
Neg logit contributions
tips-0.320
inges-0.319
interrupted-0.302
verbs-0.293
ollar-0.293
Token I
Token ID314
Feature activation-2.43e-01

Pos logit contributions
 irrad+3.902
 reactor+3.775
 kinderg+3.724
 rejection+3.695
 workload+3.358
Neg logit contributions
tips-5.193
inges-5.173
interrupted-4.879
verbs-4.677
ollar-4.615
Token
Token ID198
Feature activation+6.19e-02

Pos logit contributions
inges+0.409
tips+0.407
interrupted+0.379
verbs+0.364
ollar+0.361
Neg logit contributions
 irrad-0.412
 reactor-0.398
 kinderg-0.397
 rejection-0.386
 teamwork-0.356
Token
Token ID198
Feature activation+7.70e-02

Pos logit contributions
 irrad+0.105
 reactor+0.102
 kinderg+0.101
 rejection+0.099
 workload+0.092
Neg logit contributions
tips-0.105
inges-0.104
interrupted-0.096
verbs-0.092
ollar-0.092
TokenDevelop
Token ID19246
Feature activation+1.31e+00

Pos logit contributions
 irrad+0.122
 reactor+0.118
 kinderg+0.117
 rejection+0.115
 workload+0.105
Neg logit contributions
tips-0.139
inges-0.138
interrupted-0.129
verbs-0.124
ollar-0.123
Tokened
Token ID276
Feature activation+4.52e-02

Pos logit contributions
 irrad+2.363
 reactor+2.301
 kinderg+2.291
 rejection+2.251
 workload+2.092
Neg logit contributions
tips-2.044
inges-2.034
interrupted-1.881
verbs-1.798
ollar-1.777
Token a
Token ID257
Feature activation-7.87e-03

Pos logit contributions
 irrad+0.006
 reactor+0.006
 kinderg+0.005
 rejection+0.005
 workload+0.005
Neg logit contributions
tips-0.008
inges-0.008
interrupted-0.008
verbs-0.007
ollar-0.007
Token low
Token ID1877
Feature activation-2.32e-01

Pos logit contributions
tips+0.016
inges+0.016
interrupted+0.015
verbs+0.015
ollar+0.015
Neg logit contributions
 irrad-0.010
 reactor-0.010
 kinderg-0.010
 rejection-0.009
 workload-0.009
Token body
Token ID1767
Feature activation+7.21e-01

Pos logit contributions
tips+0.482
inges+0.480
interrupted+0.453
verbs+0.437
ollar+0.435
Neg logit contributions
 irrad-0.296
 kinderg-0.287
 reactor-0.286
 rejection-0.275
 teamwork-0.248
Token temperature
Token ID5951
Feature activation+2.22e-01

Pos logit contributions
 irrad+0.843
 reactor+0.813
 kinderg+0.813
 rejection+0.781
 teamwork+0.695
Neg logit contributions
tips-1.584
inges-1.575
interrupted-1.489
verbs-1.440
ollar-1.435
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
 irrad+0.342
 kinderg+0.332
 reactor+0.332
 rejection+0.322
 workload+0.296
Neg logit contributions
tips-0.407
inges-0.405
interrupted-0.379
verbs-0.362
ollar-0.361
Token Lep
Token ID42957
Feature activation+2.66e+00

Pos logit contributions
 irrad+0.300
 reactor+0.291
 kinderg+0.290
 rejection+0.284
 workload+0.262
Neg logit contributions
tips-0.300
inges-0.299
interrupted-0.278
verbs-0.266
ollar-0.263
Tokenro
Token ID305
Feature activation+1.82e-01

Pos logit contributions
 irrad+3.567
 reactor+3.415
 kinderg+3.358
 rejection+3.327
ific+2.986
Neg logit contributions
inges-5.334
tips-5.312
interrupted-5.026
verbs-4.801
ollar-4.769
Tokensy
Token ID1837
Feature activation-3.51e-01

Pos logit contributions
 irrad+0.329
 reactor+0.319
 kinderg+0.318
 rejection+0.313
 workload+0.290
Neg logit contributions
tips-0.285
inges-0.285
interrupted-0.264
verbs-0.250
ollar-0.248
Token lesions
Token ID35258
Feature activation-2.10e-02

Pos logit contributions
tips+0.587
inges+0.584
interrupted+0.542
verbs+0.517
ollar+0.513
Neg logit contributions
 irrad-0.593
 reactor-0.578
 kinderg-0.577
 rejection-0.563
 workload-0.521
Token appear
Token ID1656
Feature activation-5.64e-01

Pos logit contributions
tips+0.037
inges+0.037
interrupted+0.034
verbs+0.033
ollar+0.033
Neg logit contributions
 irrad-0.033
 kinderg-0.033
 reactor-0.033
 rejection-0.032
 workload-0.029
Token mainly
Token ID8384
Feature activation+9.95e-01

Pos logit contributions
inges+1.020
tips+1.019
interrupted+0.948
verbs+0.901
ollar+0.901
Neg logit contributions
 irrad-0.861
 reactor-0.850
 kinderg-0.847
 rejection-0.824
 workload-0.757
Token Chaos
Token ID13903
Feature activation+1.21e+00

Pos logit contributions
 irrad+0.930
 reactor+0.903
 kinderg+0.900
 rejection+0.881
 workload+0.812
Neg logit contributions
tips-0.967
inges-0.964
interrupted-0.897
verbs-0.858
ollar-0.851
Token Charm
Token ID30225
Feature activation+9.84e-01

Pos logit contributions
 irrad+1.960
 reactor+1.908
 kinderg+1.892
 rejection+1.859
 workload+1.705
Neg logit contributions
tips-2.119
inges-2.109
interrupted-1.967
verbs-1.883
ollar-1.863
Token
Token ID198
Feature activation+1.06e-01

Pos logit contributions
 irrad+1.589
 reactor+1.547
 kinderg+1.533
 rejection+1.506
 workload+1.379
Neg logit contributions
tips-1.730
inges-1.723
interrupted-1.607
verbs-1.540
ollar-1.521
Token
Token ID198
Feature activation+2.67e-02

Pos logit contributions
 irrad+0.176
 reactor+0.171
 kinderg+0.171
 rejection+0.167
 workload+0.154
Neg logit contributions
tips-0.182
inges-0.181
interrupted-0.168
verbs-0.161
ollar-0.160
TokenSc
Token ID3351
Feature activation+9.07e-01

Pos logit contributions
 irrad+0.044
 reactor+0.043
 kinderg+0.042
 rejection+0.042
 workload+0.038
Neg logit contributions
tips-0.047
inges-0.046
interrupted-0.043
verbs-0.041
ollar-0.041
Tokenourge
Token ID14501
Feature activation+2.63e+00

Pos logit contributions
 irrad+1.464
 reactor+1.418
 kinderg+1.406
 rejection+1.380
ific+1.273
Neg logit contributions
tips-1.588
inges-1.575
interrupted-1.476
verbs-1.412
ollar-1.390
Token of
Token ID286
Feature activation+1.31e-01

Pos logit contributions
 irrad+3.802
 reactor+3.708
 kinderg+3.657
 rejection+3.587
 workload+3.248
Neg logit contributions
tips-5.040
inges-5.008
interrupted-4.698
verbs-4.529
ollar-4.455
Token Fle
Token ID12005
Feature activation+1.23e+00

Pos logit contributions
 irrad+0.212
 reactor+0.206
 kinderg+0.205
 rejection+0.201
 workload+0.185
Neg logit contributions
tips-0.229
inges-0.229
interrupted-0.212
verbs-0.204
ollar-0.202
Tokenets
Token ID1039
Feature activation+2.62e-01

Pos logit contributions
 irrad+1.589
 reactor+1.530
 kinderg+1.520
 rejection+1.478
ific+1.325
Neg logit contributions
tips-2.558
inges-2.532
interrupted-2.409
verbs-2.315
ollar-2.294
Token vs
Token ID3691
Feature activation+5.77e-01

Pos logit contributions
 irrad+0.353
 kinderg+0.339
 reactor+0.338
 rejection+0.329
 workload+0.298
Neg logit contributions
tips-0.532
inges-0.530
interrupted-0.499
verbs-0.480
ollar-0.479
Token Crypt
Token ID15126
Feature activation+4.51e-01

Pos logit contributions
 irrad+0.954
 reactor+0.926
 kinderg+0.923
 rejection+0.904
 workload+0.833
Neg logit contributions
tips-0.994
inges-0.990
interrupted-0.922
verbs-0.881
ollar-0.875
Tokenation
Token ID341
Feature activation+2.61e-01

Pos logit contributions
 irrad+1.170
 reactor+1.137
 kinderg+1.133
 rejection+1.113
 workload+1.028
Neg logit contributions
tips-1.104
inges-1.100
interrupted-1.021
verbs-0.975
ollar-0.965
Token,
Token ID11
Feature activation+5.95e-02

Pos logit contributions
 irrad+0.408
 reactor+0.396
 kinderg+0.395
 rejection+0.385
 workload+0.353
Neg logit contributions
tips-0.471
inges-0.469
interrupted-0.439
verbs-0.419
ollar-0.418
Token fals
Token ID27807
Feature activation-4.41e-01

Pos logit contributions
 irrad+0.080
 kinderg+0.078
 reactor+0.078
 rejection+0.074
 workload+0.068
Neg logit contributions
tips-0.118
inges-0.117
interrupted-0.111
verbs-0.106
ippers-0.106
Tokenification
Token ID2649
Feature activation-8.51e-02

Pos logit contributions
inges+0.722
tips+0.720
interrupted+0.667
verbs+0.640
ollar+0.637
Neg logit contributions
 irrad-0.760
 kinderg-0.742
 reactor-0.740
 rejection-0.720
 workload-0.668
Token or
Token ID393
Feature activation+3.80e-01

Pos logit contributions
tips+0.155
inges+0.154
interrupted+0.144
verbs+0.138
ollar+0.137
Neg logit contributions
 irrad-0.132
 reactor-0.128
 kinderg-0.127
 rejection-0.124
 workload-0.114
Token plagiar
Token ID42054
Feature activation+2.60e+00

Pos logit contributions
 irrad+0.497
 reactor+0.481
 kinderg+0.479
 rejection+0.461
 workload+0.419
Neg logit contributions
tips-0.773
inges-0.770
interrupted-0.729
verbs-0.697
ollar-0.694
Tokenism
Token ID1042
Feature activation+1.93e-01

Pos logit contributions
 irrad+3.452
 reactor+3.321
 rejection+3.271
 kinderg+3.242
ific+3.019
Neg logit contributions
tips-5.262
inges-5.161
interrupted-4.911
verbs-4.752
ollar-4.683
Token"
Token ID1
Feature activation+9.14e-01

Pos logit contributions
 irrad+0.297
 reactor+0.289
 kinderg+0.288
 rejection+0.280
 workload+0.257
Neg logit contributions
tips-0.352
inges-0.351
interrupted-0.328
verbs-0.313
ollar-0.313
Token but
Token ID475
Feature activation-5.70e-01

Pos logit contributions
 irrad+1.420
 reactor+1.374
 kinderg+1.368
 rejection+1.343
 workload+1.228
Neg logit contributions
tips-1.665
inges-1.658
interrupted-1.551
verbs-1.491
ollar-1.475
Token not
Token ID407
Feature activation-7.03e-01

Pos logit contributions
inges+0.986
tips+0.986
interrupted+0.906
verbs+0.866
ollar+0.863
Neg logit contributions
 irrad-0.923
 reactor-0.898
 kinderg-0.898
 rejection-0.869
 teamwork-0.806
Token "
Token ID366
Feature activation-1.26e-02

Pos logit contributions
tips+1.333
inges+1.329
interrupted+1.231
ollar+1.179
verbs+1.179
Neg logit contributions
 irrad-1.009
 reactor-0.980
 kinderg-0.980
 rejection-0.942
 workload-0.863
TokenJohn
Token ID7554
Feature activation+4.70e-03

Pos logit contributions
 irrad+0.008
 reactor+0.008
 kinderg+0.007
 rejection+0.007
 workload+0.007
Neg logit contributions
tips-0.008
inges-0.007
interrupted-0.007
verbs-0.007
ollar-0.007
Token Nguyen
Token ID42379
Feature activation+6.12e-01

Pos logit contributions
 irrad+0.007
 reactor+0.007
 kinderg+0.007
 rejection+0.007
 workload+0.006
Neg logit contributions
tips-0.009
inges-0.009
interrupted-0.008
verbs-0.008
ollar-0.008
Token is
Token ID318
Feature activation+2.31e-01

Pos logit contributions
 irrad+1.018
 reactor+0.989
 kinderg+0.985
 rejection+0.965
 workload+0.890
Neg logit contributions
tips-1.047
inges-1.043
interrupted-0.971
verbs-0.929
ollar-0.921
Token confident
Token ID6563
Feature activation+9.49e-01

Pos logit contributions
 irrad+0.341
 reactor+0.331
 kinderg+0.330
 rejection+0.324
 workload+0.293
Neg logit contributions
tips-0.435
inges-0.432
interrupted-0.405
verbs-0.389
ollar-0.387
Token that
Token ID326
Feature activation+2.23e-01

Pos logit contributions
 irrad+1.370
 reactor+1.323
 kinderg+1.320
 rejection+1.279
 workload+1.166
Neg logit contributions
tips-1.835
inges-1.826
interrupted-1.707
verbs-1.646
ollar-1.639
Token Viet
Token ID8730
Feature activation+2.59e+00

Pos logit contributions
 irrad+0.354
 reactor+0.344
 kinderg+0.342
 rejection+0.332
 workload+0.306
Neg logit contributions
tips-0.398
inges-0.396
interrupted-0.368
verbs-0.353
ollar-0.353
Token-
Token ID12
Feature activation+8.98e-02

Pos logit contributions
 irrad+4.474
 kinderg+4.378
 reactor+4.371
 rejection+4.304
 workload+3.955
Neg logit contributions
inges-4.126
tips-4.091
interrupted-3.799
verbs-3.572
ollar-3.570
TokenC
Token ID34
Feature activation+2.48e-01

Pos logit contributions
 irrad+0.196
 reactor+0.192
 kinderg+0.191
 rejection+0.189
 workload+0.177
Neg logit contributions
tips-0.107
inges-0.106
interrupted-0.096
verbs-0.089
ollar-0.088
Tokenaj
Token ID1228
Feature activation+2.02e-01

Pos logit contributions
 irrad+0.469
 reactor+0.458
 kinderg+0.456
 rejection+0.446
 workload+0.416
Neg logit contributions
tips-0.370
inges-0.368
interrupted-0.336
ollar-0.324
verbs-0.322
Tokenun
Token ID403
Feature activation+4.70e-01

Pos logit contributions
 irrad+0.305
 reactor+0.296
 kinderg+0.294
 rejection+0.288
 workload+0.263
Neg logit contributions
tips-0.376
inges-0.375
interrupted-0.351
verbs-0.337
ollar-0.334
Token craw
Token ID27784
Feature activation+1.40e+00

Pos logit contributions
 irrad+0.739
 reactor+0.715
 kinderg+0.712
 rejection+0.697
 workload+0.639
Neg logit contributions
tips-0.847
inges-0.842
interrupted-0.787
verbs-0.756
ollar-0.750
Token on
Token ID319
Feature activation-2.37e-01

Pos logit contributions
tips+1.059
inges+1.055
interrupted+0.984
verbs+0.943
ollar+0.943
Neg logit contributions
 irrad-0.889
 kinderg-0.857
 reactor-0.854
 rejection-0.836
 workload-0.761
Token-
Token ID12
Feature activation-9.02e-02

Pos logit contributions
tips+0.443
inges+0.442
interrupted+0.412
verbs+0.396
ollar+0.394
Neg logit contributions
 irrad-0.356
 reactor-0.344
 kinderg-0.343
 rejection-0.335
 workload-0.305
Tokentrack
Token ID11659
Feature activation+5.53e-01

Pos logit contributions
tips+0.143
inges+0.142
interrupted+0.131
verbs+0.125
ollar+0.124
Neg logit contributions
 irrad-0.163
 kinderg-0.158
 reactor-0.158
 rejection-0.155
 workload-0.143
Token session
Token ID6246
Feature activation+7.53e-03

Pos logit contributions
 irrad+0.779
 reactor+0.752
 kinderg+0.749
 rejection+0.731
 workload+0.662
Neg logit contributions
tips-1.089
inges-1.085
interrupted-1.020
verbs-0.982
ollar-0.975
Token at
Token ID379
Feature activation-2.80e-01

Pos logit contributions
 irrad+0.012
 kinderg+0.012
 reactor+0.012
 rejection+0.011
 workload+0.011
Neg logit contributions
tips-0.013
inges-0.013
interrupted-0.012
verbs-0.012
ollar-0.012
Token Circuit
Token ID13588
Feature activation+2.57e+00

Pos logit contributions
tips+0.492
inges+0.490
interrupted+0.454
verbs+0.433
ollar+0.433
Neg logit contributions
 irrad-0.450
 reactor-0.433
 kinderg-0.432
 rejection-0.424
 workload-0.388
Token Paul
Token ID3362
Feature activation-2.85e-01

Pos logit contributions
 irrad+3.702
 reactor+3.598
 kinderg+3.590
 rejection+3.521
 workload+3.170
Neg logit contributions
inges-4.832
tips-4.829
interrupted-4.536
verbs-4.380
ippers-4.300
Token Ric
Token ID15868
Feature activation+3.45e-01

Pos logit contributions
tips+0.491
inges+0.489
interrupted+0.453
verbs+0.433
ollar+0.430
Neg logit contributions
 irrad-0.474
 reactor-0.459
 kinderg-0.458
 rejection-0.449
 workload-0.413
Tokenard
Token ID446
Feature activation+1.66e-01

Pos logit contributions
 irrad+0.581
 reactor+0.566
 kinderg+0.563
 rejection+0.552
 workload+0.509
Neg logit contributions
tips-0.579
inges-0.577
interrupted-0.538
verbs-0.514
ollar-0.507
Token.
Token ID13
Feature activation+7.41e-02

Pos logit contributions
 irrad+0.266
 reactor+0.257
 kinderg+0.256
 rejection+0.251
 workload+0.230
Neg logit contributions
tips-0.296
inges-0.294
interrupted-0.275
verbs-0.263
ollar-0.261
Token
Token ID198
Feature activation-3.63e-02

Pos logit contributions
 irrad+0.125
 reactor+0.121
 kinderg+0.121
 rejection+0.118
 workload+0.109
Neg logit contributions
tips-0.125
inges-0.125
interrupted-0.116
verbs-0.111
ollar-0.110
Token read
Token ID1100
Feature activation+4.60e-01

Pos logit contributions
 irrad+0.389
 reactor+0.377
 kinderg+0.375
 rejection+0.366
 workload+0.337
Neg logit contributions
tips-0.423
inges-0.420
interrupted-0.392
verbs-0.376
ollar-0.373
Token:
Token ID25
Feature activation-1.90e-01

Pos logit contributions
 irrad+0.677
 reactor+0.655
 kinderg+0.651
 rejection+0.637
 workload+0.581
Neg logit contributions
tips-0.877
inges-0.873
interrupted-0.819
verbs-0.789
ollar-0.781
Token
Token ID198
Feature activation+2.18e-01

Pos logit contributions
tips+0.304
inges+0.303
interrupted+0.280
verbs+0.267
ollar+0.265
Neg logit contributions
 irrad-0.336
 reactor-0.326
 kinderg-0.325
 rejection-0.319
 workload-0.296
Token
Token ID198
Feature activation+1.66e-01

Pos logit contributions
 irrad+0.362
 reactor+0.352
 kinderg+0.351
 rejection+0.343
 workload+0.316
Neg logit contributions
tips-0.374
inges-0.372
interrupted-0.346
verbs-0.331
ollar-0.329
Token"
Token ID1
Feature activation+3.93e-01

Pos logit contributions
 irrad+0.248
 reactor+0.240
 kinderg+0.238
 rejection+0.232
 workload+0.213
Neg logit contributions
tips-0.317
inges-0.314
interrupted-0.295
verbs-0.283
ollar-0.281
TokenBar
Token ID10374
Feature activation+2.50e+00

Pos logit contributions
 irrad+0.631
 reactor+0.613
 kinderg+0.609
 rejection+0.595
 workload+0.549
Neg logit contributions
tips-0.698
inges-0.693
interrupted-0.648
verbs-0.620
ollar-0.615
Tokenack
Token ID441
Feature activation-4.57e-02

Pos logit contributions
 irrad+4.441
 reactor+4.344
 kinderg+4.315
 rejection+4.261
 workload+3.932
Neg logit contributions
tips-3.926
inges-3.885
interrupted-3.601
verbs-3.377
ippers-3.332
Token Obama
Token ID2486
Feature activation+3.14e-01

Pos logit contributions
tips+0.114
inges+0.113
interrupted+0.108
ollar+0.105
verbs+0.104
Neg logit contributions
 irrad-0.041
 kinderg-0.038
 reactor-0.038
 rejection-0.036
 workload-0.031
Token would
Token ID561
Feature activation-8.28e-01

Pos logit contributions
 irrad+0.546
 reactor+0.531
 kinderg+0.529
 rejection+0.518
 workload+0.480
Neg logit contributions
tips-0.514
inges-0.512
interrupted-0.476
verbs-0.453
ollar-0.450
Token be
Token ID307
Feature activation+1.38e-01

Pos logit contributions
tips+1.551
inges+1.543
interrupted+1.447
verbs+1.382
ollar+1.382
Neg logit contributions
 irrad-1.220
 kinderg-1.194
 reactor-1.182
 rejection-1.148
 workload-1.053
Token the
Token ID262
Feature activation+9.86e-02

Pos logit contributions
 irrad+0.197
 reactor+0.191
 kinderg+0.191
 rejection+0.185
 workload+0.169
Neg logit contributions
tips-0.265
inges-0.264
interrupted-0.247
verbs-0.238
ollar-0.237
Token:
Token ID25
Feature activation-2.66e-01

Pos logit contributions
 irrad+0.215
 reactor+0.207
 kinderg+0.206
 rejection+0.201
 workload+0.183
Neg logit contributions
tips-0.301
inges-0.299
interrupted-0.280
verbs-0.271
ollar-0.270
TokenLic
Token ID26656
Feature activation+1.17e+00

Pos logit contributions
tips+0.488
inges+0.480
interrupted+0.444
verbs+0.431
ollar+0.427
Neg logit contributions
 irrad-0.415
 reactor-0.401
 kinderg-0.397
 rejection-0.388
 workload-0.356
Tokenenses
Token ID4541
Feature activation+4.41e-01

Pos logit contributions
 irrad+1.982
 reactor+1.926
 kinderg+1.919
 rejection+1.881
 workload+1.736
Neg logit contributions
tips-1.972
inges-1.963
interrupted-1.825
verbs-1.744
ollar-1.729
Token and
Token ID290
Feature activation-1.44e-01

Pos logit contributions
 irrad+0.641
 reactor+0.620
 kinderg+0.618
 rejection+0.601
 workload+0.547
Neg logit contributions
tips-0.843
inges-0.839
interrupted-0.787
verbs-0.755
ollar-0.752
Token acc
Token ID697
Feature activation+1.68e-01

Pos logit contributions
tips+0.304
inges+0.302
interrupted+0.283
verbs+0.274
ippers+0.273
Neg logit contributions
 irrad-0.178
 kinderg-0.171
 reactor-0.171
 rejection-0.164
ific-0.147
Tokenreditation
Token ID42845
Feature activation+2.48e+00

Pos logit contributions
 irrad+0.172
 kinderg+0.167
 reactor+0.164
 rejection+0.156
 teamwork+0.136
Neg logit contributions
inges-0.395
tips-0.391
interrupted-0.368
ippers-0.363
verbs-0.360
Token aren
Token ID3588
Feature activation+3.49e-02

Pos logit contributions
 irrad+3.531
 reactor+3.421
 kinderg+3.410
 rejection+3.351
 workload+3.049
Neg logit contributions
tips-4.813
inges-4.791
interrupted-4.517
verbs-4.341
ollar-4.309
Token
Token ID447
Feature activation-7.40e-02

Pos logit contributions
 irrad+0.047
 reactor+0.045
 kinderg+0.045
 rejection+0.044
 workload+0.040
Neg logit contributions
tips-0.071
inges-0.071
interrupted-0.066
verbs-0.064
ollar-0.064
Token
Token ID247
Feature activation+7.21e-02

Pos logit contributions
inges+0.163
tips+0.163
interrupted+0.154
verbs+0.149
ollar+0.148
Neg logit contributions
 irrad-0.086
 reactor-0.083
 kinderg-0.082
 rejection-0.080
 workload-0.071
Tokent
Token ID83
Feature activation-3.45e-01

Pos logit contributions
 irrad+0.141
 reactor+0.137
 kinderg+0.137
 rejection+0.134
 workload+0.126
Neg logit contributions
tips-0.103
inges-0.102
interrupted-0.094
verbs-0.088
ollar-0.088
Token required
Token ID2672
Feature activation-1.10e-01

Pos logit contributions
inges+0.671
tips+0.671
interrupted+0.629
verbs+0.602
ollar+0.602
Neg logit contributions
 irrad-0.487
 reactor-0.471
 kinderg-0.470
 rejection-0.455
 workload-0.412
Token CPUs
Token ID32340
Feature activation+1.54e+00

Pos logit contributions
 irrad+0.247
 reactor+0.238
 kinderg+0.238
 rejection+0.232
 teamwork+0.209
Neg logit contributions
tips-0.327
inges-0.327
interrupted-0.305
verbs-0.294
ollar-0.291
Token and
Token ID290
Feature activation+1.91e-01

Pos logit contributions
 irrad+2.583
 reactor+2.502
 kinderg+2.479
 rejection+2.445
 workload+2.281
Neg logit contributions
tips-2.602
inges-2.600
interrupted-2.420
verbs-2.318
ollar-2.297
Token memory
Token ID4088
Feature activation+7.35e-01

Pos logit contributions
 irrad+0.276
 kinderg+0.267
 reactor+0.266
 rejection+0.259
 teamwork+0.234
Neg logit contributions
tips-0.366
inges-0.366
interrupted-0.342
verbs-0.329
ippers-0.326
Token (
Token ID357
Feature activation+5.36e-01

Pos logit contributions
 irrad+1.182
 reactor+1.148
 kinderg+1.141
 rejection+1.120
 workload+1.034
Neg logit contributions
tips-1.297
inges-1.294
interrupted-1.205
verbs-1.156
ollar-1.148
TokenIntel
Token ID24123
Feature activation+2.12e+00

Pos logit contributions
 irrad+0.886
 reactor+0.859
 kinderg+0.857
 rejection+0.839
 workload+0.771
Neg logit contributions
tips-0.926
inges-0.920
interrupted-0.859
verbs-0.821
ollar-0.813
Token VT
Token ID32751
Feature activation+2.47e+00

Pos logit contributions
 irrad+3.338
 reactor+3.243
 kinderg+3.218
 rejection+3.175
 workload+2.939
Neg logit contributions
inges-3.750
tips-3.748
interrupted-3.496
ollar-3.373
verbs-3.311
Token-
Token ID12
Feature activation+9.66e-01

Pos logit contributions
 irrad+4.170
 kinderg+4.139
 reactor+4.100
 rejection+4.031
 workload+3.786
Neg logit contributions
tips-4.025
inges-4.021
interrupted-3.691
ollar-3.669
ippers-3.561
Tokenx
Token ID87
Feature activation+1.66e+00

Pos logit contributions
 irrad+1.591
 reactor+1.552
 kinderg+1.537
 rejection+1.505
 workload+1.382
Neg logit contributions
tips-1.703
inges-1.673
interrupted-1.562
verbs-1.486
ippers-1.465
Token,
Token ID11
Feature activation+7.32e-01

Pos logit contributions
 irrad+2.367
 reactor+2.286
 kinderg+2.279
 rejection+2.222
 teamwork+2.002
Neg logit contributions
tips-3.233
inges-3.217
interrupted-3.015
verbs-2.892
ollar-2.873
Token AMD
Token ID10324
Feature activation+1.10e+00

Pos logit contributions
 irrad+1.135
 reactor+1.102
 kinderg+1.095
 rejection+1.073
 workload+0.985
Neg logit contributions
tips-1.333
inges-1.331
interrupted-1.241
verbs-1.191
ollar-1.184
Token-
Token ID12
Feature activation+8.30e-01

Pos logit contributions
 irrad+1.682
 reactor+1.634
 kinderg+1.605
 rejection+1.595
 workload+1.470
Neg logit contributions
inges-2.046
tips-2.039
interrupted-1.901
ollar-1.818
verbs-1.809
Token.
Token ID13
Feature activation-1.36e-02

Pos logit contributions
 irrad+1.342
 reactor+1.300
 kinderg+1.296
 rejection+1.267
 workload+1.158
Neg logit contributions
tips-1.601
inges-1.596
interrupted-1.492
verbs-1.431
ollar-1.420
Token Likewise
Token ID22660
Feature activation-5.20e-01

Pos logit contributions
tips+0.022
inges+0.022
interrupted+0.021
verbs+0.020
ollar+0.019
Neg logit contributions
 irrad-0.024
 reactor-0.023
 kinderg-0.023
 rejection-0.022
 workload-0.021
Token,
Token ID11
Feature activation-2.49e-01

Pos logit contributions
tips+0.929
inges+0.919
interrupted+0.857
verbs+0.824
ippers+0.819
Neg logit contributions
 irrad-0.824
 reactor-0.796
 kinderg-0.795
 rejection-0.769
 workload-0.705
Token any
Token ID597
Feature activation+7.71e-02

Pos logit contributions
tips+0.462
inges+0.458
interrupted+0.427
verbs+0.411
ippers+0.409
Neg logit contributions
 irrad-0.377
 reactor-0.365
 kinderg-0.363
 rejection-0.352
 workload-0.321
Token Linux
Token ID7020
Feature activation+1.18e+00

Pos logit contributions
 irrad+0.102
 kinderg+0.098
 reactor+0.098
 rejection+0.094
 teamwork+0.084
Neg logit contributions
tips-0.157
inges-0.156
interrupted-0.147
verbs-0.142
ollar-0.141
Token distribution
Token ID6082
Feature activation+2.45e+00

Pos logit contributions
 irrad+1.528
 reactor+1.471
 kinderg+1.464
 rejection+1.426
 workload+1.280
Neg logit contributions
tips-2.469
inges-2.462
interrupted-2.321
verbs-2.239
ollar-2.224
Token with
Token ID351
Feature activation+3.27e-02

Pos logit contributions
 irrad+3.330
 reactor+3.216
 kinderg+3.173
 rejection+3.130
 workload+2.836
Neg logit contributions
inges-4.918
tips-4.915
interrupted-4.611
verbs-4.456
ollar-4.431
Token the
Token ID262
Feature activation-1.39e-02

Pos logit contributions
 irrad+0.049
 reactor+0.048
 kinderg+0.047
 rejection+0.046
 workload+0.042
Neg logit contributions
inges-0.061
tips-0.061
interrupted-0.057
verbs-0.054
ollar-0.054
Token Flat
Token ID21939
Feature activation+6.38e-01

Pos logit contributions
tips+0.025
inges+0.025
interrupted+0.024
verbs+0.023
ollar+0.022
Neg logit contributions
 irrad-0.022
 reactor-0.021
 kinderg-0.021
 rejection-0.020
 workload-0.019
Tokenpak
Token ID41091
Feature activation-3.04e-01

Pos logit contributions
 irrad+1.021
 reactor+0.990
 kinderg+0.987
 rejection+0.965
 workload+0.886
Neg logit contributions
tips-1.134
inges-1.128
interrupted-1.052
verbs-1.009
ollar-1.001
Token software
Token ID3788
Feature activation+1.68e-01

Pos logit contributions
tips+0.633
inges+0.632
interrupted+0.593
verbs+0.575
ippers+0.570
Neg logit contributions
 irrad-0.387
 kinderg-0.370
 reactor-0.367
 rejection-0.356
 workload-0.317
Token to
Token ID284
Feature activation+1.76e-01

Pos logit contributions
tips+0.701
inges+0.699
interrupted+0.654
verbs+0.626
ollar+0.623
Neg logit contributions
 irrad-0.571
 kinderg-0.554
 reactor-0.550
 rejection-0.539
 workload-0.486
Token keep
Token ID1394
Feature activation+1.54e-01

Pos logit contributions
 irrad+0.258
 reactor+0.250
 kinderg+0.249
 rejection+0.243
 workload+0.221
Neg logit contributions
tips-0.336
inges-0.335
interrupted-0.314
verbs-0.301
ollar-0.299
Token Red
Token ID2297
Feature activation+1.51e+00

Pos logit contributions
 irrad+0.227
 kinderg+0.220
 reactor+0.219
 rejection+0.214
 workload+0.194
Neg logit contributions
tips-0.290
inges-0.289
interrupted-0.270
verbs-0.260
ollar-0.259
Token Hat
Token ID10983
Feature activation+2.82e+00

Pos logit contributions
 irrad+2.317
 reactor+2.245
 kinderg+2.233
 rejection+2.187
 workload+2.000
Neg logit contributions
tips-2.793
inges-2.785
interrupted-2.609
verbs-2.498
ollar-2.477
Token or
Token ID393
Feature activation+6.39e-01

Pos logit contributions
 irrad+4.305
 reactor+4.204
 kinderg+4.128
 rejection+4.013
 workload+3.767
Neg logit contributions
inges-5.166
tips-5.110
interrupted-4.852
ollar-4.582
verbs-4.572
Token CentOS
Token ID32418
Feature activation+2.44e+00

Pos logit contributions
 irrad+0.964
 reactor+0.933
 kinderg+0.929
 rejection+0.909
 workload+0.828
Neg logit contributions
tips-1.190
inges-1.186
interrupted-1.109
verbs-1.065
ollar-1.057
Token servers
Token ID9597
Feature activation+1.45e+00

Pos logit contributions
 irrad+3.805
 reactor+3.708
 kinderg+3.666
 rejection+3.559
 workload+3.334
Neg logit contributions
inges-4.423
tips-4.400
interrupted-4.125
verbs-3.944
ollar-3.910
Token operating
Token ID5361
Feature activation+5.73e-01

Pos logit contributions
 irrad+2.359
 reactor+2.289
 kinderg+2.272
 rejection+2.220
 workload+2.063
Neg logit contributions
tips-2.543
inges-2.539
interrupted-2.365
verbs-2.264
ollar-2.243
Token smoothly
Token ID21461
Feature activation-4.28e-01

Pos logit contributions
 irrad+0.877
 reactor+0.850
 kinderg+0.848
 rejection+0.830
 workload+0.751
Neg logit contributions
tips-1.054
inges-1.047
interrupted-0.979
verbs-0.940
ollar-0.935
Token and
Token ID290
Feature activation+3.99e-02

Pos logit contributions
tips+0.762
inges+0.758
interrupted+0.708
verbs+0.676
ollar+0.673
Neg logit contributions
 irrad-0.675
 kinderg-0.656
 reactor-0.655
 rejection-0.640
 teamwork-0.581
Token you
Token ID345
Feature activation+3.35e-01

Pos logit contributions
 irrad+0.057
 kinderg+0.055
 reactor+0.055
 rejection+0.054
 workload+0.048
Neg logit contributions
tips-0.077
inges-0.077
interrupted-0.072
verbs-0.069
ollar-0.069
Token claiming
Token ID8512
Feature activation+6.86e-02

Pos logit contributions
 irrad+0.000
 reactor+0.000
 kinderg+0.000
 rejection+0.000
 workload+0.000
Neg logit contributions
tips-0.000
inges-0.000
interrupted-0.000
verbs-0.000
ollar-0.000
Token "
Token ID366
Feature activation+3.15e-03

Pos logit contributions
 irrad+0.102
 reactor+0.099
 kinderg+0.099
 rejection+0.095
 workload+0.088
Neg logit contributions
tips-0.129
inges-0.128
interrupted-0.120
verbs-0.115
ollar-0.115
Tokenmultiple
Token ID48101
Feature activation+4.85e-01

Pos logit contributions
 irrad+0.005
 reactor+0.005
 kinderg+0.005
 rejection+0.005
 workload+0.005
Neg logit contributions
tips-0.005
inges-0.005
interrupted-0.005
verbs-0.005
ollar-0.005
Token"
Token ID1
Feature activation+9.21e-01

Pos logit contributions
 irrad+0.646
 reactor+0.621
 kinderg+0.619
 rejection+0.599
 workload+0.539
Neg logit contributions
tips-0.987
inges-0.983
interrupted-0.922
verbs-0.895
ollar-0.885
Token Linux
Token ID7020
Feature activation+1.14e+00

Pos logit contributions
 irrad+1.228
 reactor+1.184
 kinderg+1.178
 rejection+1.147
 workload+1.033
Neg logit contributions
tips-1.878
inges-1.872
interrupted-1.762
verbs-1.699
ollar-1.687
Token distributions
Token ID24570
Feature activation+2.43e+00

Pos logit contributions
 irrad+1.628
 reactor+1.573
 kinderg+1.568
 rejection+1.531
 workload+1.391
Neg logit contributions
tips-2.229
inges-2.221
interrupted-2.089
verbs-2.007
ollar-1.994
Token are
Token ID389
Feature activation+1.92e-01

Pos logit contributions
 irrad+3.379
 reactor+3.241
 kinderg+3.209
 rejection+3.196
 workload+2.889
Neg logit contributions
inges-4.808
tips-4.799
interrupted-4.511
verbs-4.335
ollar-4.321
Token collaborating
Token ID38152
Feature activation+3.74e-01

Pos logit contributions
 irrad+0.295
 reactor+0.286
 kinderg+0.286
 rejection+0.278
 workload+0.254
Neg logit contributions
tips-0.350
inges-0.350
interrupted-0.327
verbs-0.313
ollar-0.311
Token on
Token ID319
Feature activation-1.69e-01

Pos logit contributions
 irrad+0.571
 reactor+0.553
 kinderg+0.552
 rejection+0.540
 workload+0.492
Neg logit contributions
tips-0.690
inges-0.687
interrupted-0.641
verbs-0.616
ollar-0.612
Token supporting
Token ID6493
Feature activation-1.39e-01

Pos logit contributions
tips+0.310
inges+0.308
interrupted+0.287
verbs+0.276
ollar+0.275
Neg logit contributions
 irrad-0.259
 reactor-0.250
 kinderg-0.250
 rejection-0.242
 workload-0.222
Token Snap
Token ID16026
Feature activation+4.61e-02

Pos logit contributions
tips+0.253
inges+0.252
interrupted+0.236
verbs+0.226
ollar+0.225
Neg logit contributions
 irrad-0.215
 reactor-0.209
 kinderg-0.208
 rejection-0.202
 workload-0.185
Token guard
Token ID4860
Feature activation+4.13e-02

Pos logit contributions
tips+0.040
inges+0.040
interrupted+0.037
verbs+0.036
ollar+0.036
Neg logit contributions
 irrad-0.028
 reactor-0.026
 kinderg-0.026
 rejection-0.026
 workload-0.023
Token to
Token ID284
Feature activation+1.83e-01

Pos logit contributions
 irrad+0.068
 reactor+0.065
 kinderg+0.065
 rejection+0.064
 workload+0.059
Neg logit contributions
tips-0.072
inges-0.072
interrupted-0.067
verbs-0.064
ollar-0.064
Token stop
Token ID2245
Feature activation-6.02e-01

Pos logit contributions
 irrad+0.251
 kinderg+0.242
 reactor+0.242
 rejection+0.235
 workload+0.213
Neg logit contributions
tips-0.366
inges-0.365
interrupted-0.343
verbs-0.330
ollar-0.329
Token Laura
Token ID16753
Feature activation-5.30e-01

Pos logit contributions
tips+1.107
inges+1.106
interrupted+1.035
verbs+0.986
ollar+0.986
Neg logit contributions
 irrad-0.912
 reactor-0.879
 kinderg-0.878
 rejection-0.852
 workload-0.777
Token from
Token ID422
Feature activation+8.71e-01

Pos logit contributions
tips+0.892
inges+0.890
interrupted+0.824
verbs+0.787
ollar+0.782
Neg logit contributions
 irrad-0.892
 kinderg-0.865
 reactor-0.865
 rejection-0.845
 workload-0.777
Token overwhelming
Token ID9721
Feature activation+2.42e+00

Pos logit contributions
 irrad+1.280
 reactor+1.238
 kinderg+1.231
 rejection+1.204
 workload+1.096
Neg logit contributions
tips-1.662
inges-1.655
interrupted-1.552
verbs-1.493
ollar-1.480
Token them
Token ID606
Feature activation+1.35e-01

Pos logit contributions
 irrad+3.023
 reactor+2.915
 kinderg+2.900
 rejection+2.833
 workload+2.534
Neg logit contributions
tips-5.143
inges-5.112
interrupted-4.847
verbs-4.679
ollar-4.630
Token quickly
Token ID2952
Feature activation-3.86e-01

Pos logit contributions
 irrad+0.220
 reactor+0.214
 kinderg+0.213
 rejection+0.209
 workload+0.191
Neg logit contributions
tips-0.236
inges-0.236
interrupted-0.219
verbs-0.210
ollar-0.209
Token.
Token ID13
Feature activation+2.22e-01

Pos logit contributions
tips+0.679
inges+0.676
interrupted+0.628
verbs+0.603
ollar+0.599
Neg logit contributions
 irrad-0.625
 kinderg-0.606
 reactor-0.605
 rejection-0.591
 workload-0.542
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
 irrad+0.381
 reactor+0.370
 kinderg+0.369
 rejection+0.361
 workload+0.334
Neg logit contributions
tips-0.368
inges-0.366
interrupted-0.339
verbs-0.324
ollar-0.322
Token
Token ID198
Feature activation+2.45e-01

Pos logit contributions
 irrad+0.190
 reactor+0.184
 kinderg+0.183
 rejection+0.180
 workload+0.166
Neg logit contributions
tips-0.195
inges-0.194
interrupted-0.180
verbs-0.172
ollar-0.171
Token Goodman
Token ID33318
Feature activation+3.18e-02

Pos logit contributions
inges+1.222
tips+1.217
interrupted+1.134
verbs+1.090
ollar+1.088
Neg logit contributions
 irrad-1.049
 kinderg-1.014
 reactor-1.009
 rejection-0.988
 workload-0.907
Token
Token ID198
Feature activation-1.43e-01

Pos logit contributions
 irrad+0.053
 kinderg+0.052
 reactor+0.052
 rejection+0.051
 workload+0.047
Neg logit contributions
tips-0.054
inges-0.054
interrupted-0.050
verbs-0.048
ollar-0.047
Token
Token ID198
Feature activation-1.87e-01

Pos logit contributions
tips+0.243
inges+0.240
interrupted+0.223
verbs+0.214
ollar+0.213
Neg logit contributions
 irrad-0.242
 reactor-0.235
 kinderg-0.235
 rejection-0.229
 workload-0.212
Token1994
Token ID22666
Feature activation-1.39e-01

Pos logit contributions
tips+0.428
inges+0.422
interrupted+0.406
ippers+0.388
verbs+0.388
Neg logit contributions
 irrad-0.212
 kinderg-0.203
 reactor-0.202
 rejection-0.195
 workload-0.173
Token John
Token ID1757
Feature activation-5.52e-01

Pos logit contributions
tips+0.237
inges+0.237
interrupted+0.219
verbs+0.210
ollar+0.209
Neg logit contributions
 irrad-0.232
 kinderg-0.225
 reactor-0.225
 rejection-0.219
 workload-0.203
Token Lar
Token ID25577
Feature activation+2.42e+00

Pos logit contributions
inges+1.009
tips+1.006
interrupted+0.938
verbs+0.901
ollar+0.899
Neg logit contributions
 irrad-0.850
 kinderg-0.821
 reactor-0.818
 rejection-0.798
 workload-0.733
Tokenro
Token ID305
Feature activation+7.83e-02

Pos logit contributions
 irrad+3.623
 reactor+3.541
 kinderg+3.522
 rejection+3.416
ific+3.132
Neg logit contributions
tips-4.499
inges-4.454
interrupted-4.182
verbs-3.999
ollar-3.949
Tokenqu
Token ID421
Feature activation-9.67e-01

Pos logit contributions
 irrad+0.116
 reactor+0.112
 kinderg+0.112
 rejection+0.109
 workload+0.100
Neg logit contributions
tips-0.147
inges-0.147
interrupted-0.138
verbs-0.132
ollar-0.131
Tokenette
Token ID5857
Feature activation-1.01e+00

Pos logit contributions
inges+1.134
tips+1.123
interrupted+1.018
verbs+0.948
ollar+0.945
Neg logit contributions
 irrad-2.141
 reactor-2.086
 kinderg-2.083
 rejection-2.047
 workload-1.937
Token
Token ID198
Feature activation-1.17e-01

Pos logit contributions
inges+1.811
tips+1.806
interrupted+1.712
verbs+1.613
ollar+1.613
Neg logit contributions
 irrad-1.565
 reactor-1.510
 kinderg-1.508
 rejection-1.474
 workload-1.363
Token
Token ID198
Feature activation-1.58e-01

Pos logit contributions
tips+0.197
inges+0.196
interrupted+0.182
verbs+0.174
ollar+0.173
Neg logit contributions
 irrad-0.197
 reactor-0.192
 kinderg-0.191
 rejection-0.187
 workload-0.173
Token.
Token ID13
Feature activation-4.96e-01

Pos logit contributions
tips+1.515
inges+1.486
interrupted+1.372
verbs+1.327
ollar+1.325
Neg logit contributions
 irrad-1.442
 reactor-1.399
 kinderg-1.392
 rejection-1.366
 workload-1.259
Tokenyoutube
Token ID11604
Feature activation-1.83e+00

Pos logit contributions
tips+0.704
inges+0.698
interrupted+0.643
verbs+0.606
ollar+0.600
Neg logit contributions
 irrad-0.976
 reactor-0.952
 kinderg-0.948
 rejection-0.933
 workload-0.871
Token.
Token ID13
Feature activation-4.53e-01

Pos logit contributions
tips+3.057
inges+2.982
interrupted+2.736
verbs+2.657
ollar+2.651
Neg logit contributions
 irrad-3.149
 reactor-3.060
 kinderg-3.040
 rejection-2.985
���-2.824
Tokencom
Token ID785
Feature activation-1.73e+00

Pos logit contributions
tips+0.653
inges+0.645
interrupted+0.590
verbs+0.562
ollar+0.557
Neg logit contributions
 irrad-0.885
 reactor-0.861
 kinderg-0.858
 rejection-0.844
 workload-0.789
Token/
Token ID14
Feature activation-8.99e-02

Pos logit contributions
tips+2.764
inges+2.721
interrupted+2.520
verbs+2.432
ollar+2.412
Neg logit contributions
 irrad-3.053
 reactor-2.949
 kinderg-2.943
 rejection-2.885
 workload-2.667
Tokenwatch
Token ID8340
Feature activation-3.04e+00

Pos logit contributions
tips+0.117
inges+0.116
interrupted+0.105
verbs+0.099
ollar+0.098
Neg logit contributions
 irrad-0.186
 reactor-0.182
 kinderg-0.181
 rejection-0.178
 workload-0.168
Token?
Token ID30
Feature activation-6.92e-01

Pos logit contributions
tips+5.185
inges+5.036
interrupted+4.702
ollar+4.548
verbs+4.536
Neg logit contributions
 irrad-4.978
 kinderg-4.796
 reactor-4.786
 rejection-4.688
���-4.668
Tokenv
Token ID85
Feature activation-3.62e-01

Pos logit contributions
tips+1.109
inges+1.103
interrupted+1.022
verbs+0.976
ollar+0.965
Neg logit contributions
 irrad-1.231
 reactor-1.195
 kinderg-1.191
 rejection-1.172
 workload-1.081
Token=
Token ID28
Feature activation-1.01e+00

Pos logit contributions
tips+0.477
inges+0.474
interrupted+0.427
verbs+0.406
ollar+0.404
Neg logit contributions
 irrad-0.747
 kinderg-0.729
 reactor-0.726
 rejection-0.716
 workload-0.667
Token2
Token ID17
Feature activation-4.29e-01

Pos logit contributions
tips+1.698
inges+1.685
interrupted+1.558
verbs+1.500
ollar+1.486
Neg logit contributions
 irrad-1.711
 reactor-1.660
 kinderg-1.656
 rejection-1.623
 workload-1.501
Tokeng
Token ID70
Feature activation-5.41e-02

Pos logit contributions
tips+0.707
inges+0.702
interrupted+0.650
verbs+0.623
ollar+0.617
Neg logit contributions
 irrad-0.744
 reactor-0.723
 kinderg-0.721
 rejection-0.707
 workload-0.655
Tokenaging
Token ID3039
Feature activation+5.62e-01

Pos logit contributions
inges+0.190
tips+0.188
interrupted+0.175
ippers+0.170
verbs+0.169
Neg logit contributions
 irrad-0.184
 reactor-0.179
 kinderg-0.178
 rejection-0.174
 workload-0.161
Token
Token ID447
Feature activation+1.45e-01

Pos logit contributions
 irrad+0.988
 reactor+0.961
 kinderg+0.958
 rejection+0.939
 workload+0.869
Neg logit contributions
tips-0.908
inges-0.905
interrupted-0.836
verbs-0.798
ollar-0.791
Token
Token ID251
Feature activation-3.09e-01

Pos logit contributions
 irrad+0.228
 reactor+0.221
 kinderg+0.220
 rejection+0.215
 workload+0.198
Neg logit contributions
tips-0.261
inges-0.260
interrupted-0.243
verbs-0.233
ollar-0.231
Token in
Token ID287
Feature activation+3.76e-02

Pos logit contributions
tips+0.507
inges+0.505
interrupted+0.470
verbs+0.446
ollar+0.442
Neg logit contributions
 irrad-0.537
 reactor-0.523
 kinderg-0.521
 rejection-0.511
 workload-0.472
Token the
Token ID262
Feature activation-2.32e-01

Pos logit contributions
 irrad+0.058
 reactor+0.056
 kinderg+0.056
 rejection+0.055
 workload+0.050
Neg logit contributions
tips-0.069
inges-0.068
interrupted-0.064
verbs-0.061
ollar-0.061
Token Yuk
Token ID19760
Feature activation-2.81e+00

Pos logit contributions
tips+0.429
inges+0.427
interrupted+0.397
verbs+0.381
ollar+0.379
Neg logit contributions
 irrad-0.353
 reactor-0.342
 kinderg-0.341
 rejection-0.333
 workload-0.306
Tokenon
Token ID261
Feature activation-9.83e-01

Pos logit contributions
tips+4.477
inges+4.395
interrupted+4.030
ippers+3.849
ollar+3.846
Neg logit contributions
 irrad-5.021
 kinderg-4.911
 reactor-4.851
 rejection-4.695
 workload-4.451
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
tips+1.694
inges+1.690
interrupted+1.566
ollar+1.491
verbs+1.481
Neg logit contributions
 irrad-1.605
 kinderg-1.565
 reactor-1.560
 rejection-1.523
 workload-1.410
Token H
Token ID367
Feature activation+1.62e-01

Pos logit contributions
 irrad+0.205
 reactor+0.199
 kinderg+0.198
 rejection+0.194
 workload+0.180
Neg logit contributions
tips-0.189
inges-0.188
interrupted-0.174
verbs-0.166
ollar-0.165
Tokenordes
Token ID35770
Feature activation+1.33e+00

Pos logit contributions
 irrad+0.308
 reactor+0.301
 kinderg+0.300
 rejection+0.294
 workload+0.274
Neg logit contributions
inges-0.238
tips-0.237
interrupted-0.217
verbs-0.206
ollar-0.205
Token will
Token ID481
Feature activation+2.09e-01

Pos logit contributions
 irrad+1.848
 reactor+1.783
 kinderg+1.772
 rejection+1.732
 workload+1.567
Neg logit contributions
tips-2.637
inges-2.630
interrupted-2.471
verbs-2.383
ollar-2.362
Token Liberals
Token ID20162
Feature activation-3.51e-02

Pos logit contributions
 irrad+0.120
 kinderg+0.116
 reactor+0.116
 rejection+0.112
 workload+0.101
Neg logit contributions
tips-0.170
inges-0.170
interrupted-0.159
verbs-0.153
ollar-0.152
Token will
Token ID481
Feature activation+2.11e-01

Pos logit contributions
tips+0.065
inges+0.065
interrupted+0.061
verbs+0.058
ollar+0.058
Neg logit contributions
 irrad-0.053
 reactor-0.052
 kinderg-0.051
 rejection-0.050
 workload-0.046
Token tax
Token ID1687
Feature activation-3.65e-01

Pos logit contributions
 irrad+0.307
 reactor+0.296
 kinderg+0.295
 rejection+0.289
 workload+0.263
Neg logit contributions
tips-0.405
inges-0.404
interrupted-0.379
verbs-0.364
ollar-0.361
Token their
Token ID511
Feature activation-3.04e-01

Pos logit contributions
tips+0.754
inges+0.753
interrupted+0.707
verbs+0.682
ollar+0.681
Neg logit contributions
 irrad-0.472
 reactor-0.455
 kinderg-0.453
 rejection-0.441
 workload-0.395
Token life
Token ID1204
Feature activation-5.04e-01

Pos logit contributions
tips+0.570
inges+0.568
interrupted+0.531
verbs+0.510
ollar+0.507
Neg logit contributions
 irrad-0.454
 reactor-0.440
 kinderg-0.438
 rejection-0.428
 workload-0.390
Token savings
Token ID10653
Feature activation-2.76e+00

Pos logit contributions
tips+0.842
inges+0.834
interrupted+0.773
verbs+0.739
ollar+0.731
Neg logit contributions
 irrad-0.858
 reactor-0.835
 kinderg-0.833
 rejection-0.817
 workload-0.753
Token.
Token ID13
Feature activation+7.38e-02

Pos logit contributions
tips+4.663
inges+4.585
interrupted+4.183
ollar+4.091
verbs+4.015
Neg logit contributions
 irrad-4.497
 kinderg-4.387
 reactor-4.345
 rejection-4.232
 workload-3.934
Token
Token ID198
Feature activation-5.50e-02

Pos logit contributions
 irrad+0.124
 reactor+0.121
 kinderg+0.120
 rejection+0.118
 workload+0.109
Neg logit contributions
tips-0.125
inges-0.124
interrupted-0.115
verbs-0.110
ollar-0.109
Token
Token ID198
Feature activation+7.25e-02

Pos logit contributions
tips+0.093
inges+0.093
interrupted+0.086
verbs+0.083
ollar+0.082
Neg logit contributions
 irrad-0.092
 reactor-0.090
 kinderg-0.089
 rejection-0.087
 workload-0.081
Token"
Token ID1
Feature activation+2.95e-01

Pos logit contributions
 irrad+0.105
 reactor+0.101
 kinderg+0.101
 rejection+0.098
 workload+0.090
Neg logit contributions
tips-0.140
inges-0.140
interrupted-0.131
verbs-0.126
ollar-0.125
TokenA
Token ID32
Feature activation+2.14e-01

Pos logit contributions
 irrad+0.497
 reactor+0.483
 kinderg+0.481
 rejection+0.471
 workload+0.435
Neg logit contributions
tips-0.499
inges-0.497
interrupted-0.462
verbs-0.442
ollar-0.438
Token from
Token ID422
Feature activation+3.47e-01

Pos logit contributions
 irrad+0.105
 kinderg+0.102
 reactor+0.102
 rejection+0.099
 workload+0.091
Neg logit contributions
tips-0.131
inges-0.131
interrupted-0.122
verbs-0.118
ollar-0.117
Token him
Token ID683
Feature activation-1.82e-01

Pos logit contributions
 irrad+0.555
 reactor+0.538
 kinderg+0.536
 rejection+0.525
 workload+0.482
Neg logit contributions
tips-0.615
inges-0.613
interrupted-0.571
verbs-0.547
ollar-0.543
Token,
Token ID11
Feature activation-1.61e-01

Pos logit contributions
tips+0.325
inges+0.324
interrupted+0.302
verbs+0.289
ollar+0.287
Neg logit contributions
 irrad-0.289
 reactor-0.281
 kinderg-0.280
 rejection-0.274
 workload-0.251
Token and
Token ID290
Feature activation-2.39e-01

Pos logit contributions
tips+0.281
inges+0.278
interrupted+0.260
verbs+0.248
ollar+0.247
Neg logit contributions
 irrad-0.262
 reactor-0.255
 kinderg-0.254
 rejection-0.249
 workload-0.229
Token he
Token ID339
Feature activation-3.01e-02

Pos logit contributions
tips+0.413
inges+0.410
interrupted+0.382
verbs+0.365
ollar+0.363
Neg logit contributions
 irrad-0.393
 reactor-0.382
 kinderg-0.381
 rejection-0.373
 workload-0.344
Token masturb
Token ID22938
Feature activation-2.58e+00

Pos logit contributions
tips+0.052
inges+0.052
interrupted+0.048
verbs+0.046
ollar+0.046
Neg logit contributions
 irrad-0.050
 reactor-0.048
 kinderg-0.048
 rejection-0.047
 workload-0.043
Tokenated
Token ID515
Feature activation-1.02e+00

Pos logit contributions
tips+3.567
inges+3.560
interrupted+3.221
ippers+3.053
ollar+3.035
Neg logit contributions
 irrad-5.058
 kinderg-4.998
 reactor-4.970
 rejection-4.911
 workload-4.603
Token me
Token ID502
Feature activation-7.00e-02

Pos logit contributions
tips+1.772
inges+1.757
interrupted+1.639
verbs+1.562
ollar+1.549
Neg logit contributions
 irrad-1.657
 reactor-1.608
 kinderg-1.607
 rejection-1.572
 workload-1.446
Token at
Token ID379
Feature activation-4.69e-01

Pos logit contributions
tips+0.124
inges+0.124
interrupted+0.115
verbs+0.110
ollar+0.110
Neg logit contributions
 irrad-0.112
 reactor-0.108
 kinderg-0.108
 rejection-0.106
 workload-0.097
Token the
Token ID262
Feature activation-8.83e-02

Pos logit contributions
tips+0.889
inges+0.883
interrupted+0.823
verbs+0.793
ippers+0.787
Neg logit contributions
 irrad-0.692
 reactor-0.666
 kinderg-0.664
 rejection-0.648
 workload-0.593
Token end
Token ID886
Feature activation-8.96e-01

Pos logit contributions
tips+0.170
inges+0.169
interrupted+0.158
verbs+0.152
ollar+0.151
Neg logit contributions
 irrad-0.128
 reactor-0.124
 kinderg-0.124
 rejection-0.120
 workload-0.109
Token Additional
Token ID15891
Feature activation-8.81e-01

Pos logit contributions
tips+0.525
inges+0.520
interrupted+0.481
verbs+0.464
ollar+0.459
Neg logit contributions
 irrad-0.531
 kinderg-0.514
 reactor-0.512
 rejection-0.502
 workload-0.461
Token filtering
Token ID25431
Feature activation-5.22e-01

Pos logit contributions
tips+1.731
inges+1.714
interrupted+1.592
verbs+1.561
ollar+1.530
Neg logit contributions
 irrad-1.231
 kinderg-1.191
 reactor-1.177
 rejection-1.143
 teamwork-1.029
Token is
Token ID318
Feature activation-1.95e-01

Pos logit contributions
tips+0.852
inges+0.837
interrupted+0.773
verbs+0.751
ippers+0.733
Neg logit contributions
 irrad-0.908
 kinderg-0.883
 reactor-0.881
 rejection-0.858
 teamwork-0.790
Token done
Token ID1760
Feature activation-4.41e-01

Pos logit contributions
tips+0.328
inges+0.326
interrupted+0.302
verbs+0.290
ollar+0.286
Neg logit contributions
 irrad-0.330
 kinderg-0.321
 reactor-0.320
 rejection-0.314
 workload-0.288
Token get
Token ID651
Feature activation-5.03e-01

Pos logit contributions
tips+0.778
inges+0.772
interrupted+0.718
verbs+0.691
ollar+0.684
Neg logit contributions
 irrad-0.708
 kinderg-0.691
 reactor-0.688
 rejection-0.672
 workload-0.614
Token rid
Token ID5755
Feature activation-2.55e+00

Pos logit contributions
tips+0.864
inges+0.861
interrupted+0.793
verbs+0.767
ollar+0.755
Neg logit contributions
 irrad-0.835
 kinderg-0.811
 reactor-0.808
 rejection-0.787
 workload-0.725
Token of
Token ID286
Feature activation-1.41e-01

Pos logit contributions
tips+4.511
inges+4.472
interrupted+4.181
verbs+4.049
ollar+4.041
Neg logit contributions
 irrad-3.954
 kinderg-3.866
 reactor-3.789
 rejection-3.638
 teamwork-3.379
Token some
Token ID617
Feature activation-4.82e-01

Pos logit contributions
tips+0.272
inges+0.271
interrupted+0.253
verbs+0.245
ollar+0.243
Neg logit contributions
 irrad-0.204
 kinderg-0.197
 reactor-0.196
 rejection-0.190
 teamwork-0.173
Token spurious
Token ID49062
Feature activation-2.72e-02

Pos logit contributions
tips+0.929
inges+0.927
interrupted+0.864
verbs+0.841
ollar+0.830
Neg logit contributions
 irrad-0.692
 kinderg-0.668
 reactor-0.666
 rejection-0.642
 teamwork-0.589
Token results
Token ID2482
Feature activation+5.70e-01

Pos logit contributions
tips+0.052
inges+0.052
interrupted+0.048
verbs+0.047
ollar+0.046
Neg logit contributions
 irrad-0.040
 kinderg-0.038
 reactor-0.038
 rejection-0.037
 workload-0.034
Token.
Token ID13
Feature activation-1.86e-01

Pos logit contributions
 irrad+0.898
 reactor+0.870
 kinderg+0.867
 rejection+0.848
 workload+0.778
Neg logit contributions
tips-1.026
inges-1.022
interrupted-0.954
verbs-0.915
ollar-0.908
Token Rice
Token ID13823
Feature activation-1.11e+00

Pos logit contributions
tips+1.392
inges+1.374
interrupted+1.263
ippers+1.219
ollar+1.209
Neg logit contributions
 irrad-1.349
 kinderg-1.305
 reactor-1.289
 rejection-1.287
 teamwork-1.195
Token,
Token ID11
Feature activation-1.46e-01

Pos logit contributions
tips+1.899
inges+1.896
interrupted+1.769
ollar+1.693
verbs+1.678
Neg logit contributions
 irrad-1.802
 kinderg-1.764
 reactor-1.757
 rejection-1.703
 workload-1.580
Token and
Token ID290
Feature activation-2.25e-01

Pos logit contributions
tips+0.259
inges+0.258
interrupted+0.240
ollar+0.229
verbs+0.229
Neg logit contributions
 irrad-0.233
 reactor-0.226
 kinderg-0.225
 rejection-0.220
 workload-0.203
Token Special
Token ID6093
Feature activation-1.34e+00

Pos logit contributions
tips+0.425
inges+0.423
interrupted+0.394
ollar+0.378
verbs+0.377
Neg logit contributions
 irrad-0.332
 reactor-0.320
 kinderg-0.319
 rejection-0.311
 workload-0.285
Token Assistant
Token ID15286
Feature activation-1.40e+00

Pos logit contributions
inges+2.435
tips+2.419
interrupted+2.273
ollar+2.178
verbs+2.161
Neg logit contributions
 irrad-2.074
 kinderg-2.015
 reactor-1.992
 rejection-1.950
 workload-1.786
Token Samantha
Token ID34778
Feature activation-2.53e+00

Pos logit contributions
tips+2.310
inges+2.290
interrupted+2.116
ollar+2.025
ippers+2.020
Neg logit contributions
 irrad-2.393
 kinderg-2.328
 reactor-2.321
 rejection-2.252
 workload-2.097
Token Power
Token ID4333
Feature activation+1.09e-01

Pos logit contributions
tips+4.658
inges+4.556
interrupted+4.283
ollar+4.175
ippers+4.166
Neg logit contributions
 irrad-3.746
 kinderg-3.620
 reactor-3.536
 rejection-3.490
 workload-3.201
Token),
Token ID828
Feature activation-6.08e-01

Pos logit contributions
 irrad+0.164
 reactor+0.159
 kinderg+0.158
 rejection+0.154
 workload+0.141
Neg logit contributions
tips-0.204
inges-0.203
interrupted-0.190
verbs-0.183
ollar-0.182
Token has
Token ID468
Feature activation+1.92e-01

Pos logit contributions
tips+0.935
inges+0.935
interrupted+0.864
ollar+0.812
verbs+0.811
Neg logit contributions
 irrad-1.102
 kinderg-1.079
 reactor-1.075
 rejection-1.055
 workload-0.982
Token been
Token ID587
Feature activation-2.10e-01

Pos logit contributions
 irrad+0.299
 kinderg+0.291
 reactor+0.290
 rejection+0.284
 workload+0.260
Neg logit contributions
tips-0.346
inges-0.346
interrupted-0.324
verbs-0.309
ollar-0.308
Token a
Token ID257
Feature activation+2.07e-01

Pos logit contributions
inges+0.393
tips+0.393
interrupted+0.370
verbs+0.352
ollar+0.352
Neg logit contributions
 irrad-0.309
 reactor-0.300
 kinderg-0.300
 rejection-0.294
 workload-0.268
Token with
Token ID351
Feature activation-2.98e-01

Pos logit contributions
tips+1.238
inges+1.232
interrupted+1.146
verbs+1.086
ollar+1.078
Neg logit contributions
 irrad-1.203
 kinderg-1.178
 reactor-1.170
 rejection-1.143
 workload-1.054
Token Eleanor
Token ID45566
Feature activation-1.45e+00

Pos logit contributions
inges+0.539
tips+0.538
interrupted+0.500
verbs+0.481
ollar+0.477
Neg logit contributions
 irrad-0.461
 reactor-0.446
 kinderg-0.445
 rejection-0.434
 workload-0.400
Token Roosevelt
Token ID19297
Feature activation-1.54e+00

Pos logit contributions
inges+2.362
tips+2.361
interrupted+2.173
ollar+2.077
verbs+2.074
Neg logit contributions
 irrad-2.488
 reactor-2.412
 kinderg-2.408
 rejection-2.356
 workload-2.178
Token to
Token ID284
Feature activation-1.10e-01

Pos logit contributions
inges+2.614
tips+2.600
interrupted+2.434
ollar+2.301
verbs+2.299
Neg logit contributions
 irrad-2.520
 kinderg-2.456
 reactor-2.448
 rejection-2.394
 workload-2.200
Token secure
Token ID5713
Feature activation+1.47e-02

Pos logit contributions
tips+0.207
inges+0.206
interrupted+0.193
verbs+0.185
ollar+0.185
Neg logit contributions
 irrad-0.162
 reactor-0.157
 kinderg-0.156
 rejection-0.152
 workload-0.139
Token luxury
Token ID13064
Feature activation-2.51e+00

Pos logit contributions
 irrad+0.022
 reactor+0.021
 kinderg+0.021
 rejection+0.020
 workload+0.019
Neg logit contributions
tips-0.028
inges-0.028
interrupted-0.026
verbs-0.025
ollar-0.025
Token travel
Token ID3067
Feature activation-1.62e+00

Pos logit contributions
tips+4.405
inges+4.367
interrupted+4.053
verbs+3.889
ippers+3.873
Neg logit contributions
 irrad-3.858
 reactor-3.684
 kinderg-3.663
 rejection-3.596
 workload-3.320
Token buses
Token ID16893
Feature activation-2.36e-01

Pos logit contributions
tips+2.769
inges+2.723
interrupted+2.535
ippers+2.441
verbs+2.407
Neg logit contributions
 irrad-2.665
 reactor-2.570
 kinderg-2.567
 rejection-2.490
 workload-2.306
Token for
Token ID329
Feature activation-2.58e-01

Pos logit contributions
tips+0.408
inges+0.406
interrupted+0.379
verbs+0.361
ippers+0.358
Neg logit contributions
 irrad-0.389
 reactor-0.378
 kinderg-0.377
 rejection-0.369
 workload-0.340
Token Eddie
Token ID19478
Feature activation+4.39e-02

Pos logit contributions
tips+0.480
inges+0.479
interrupted+0.447
verbs+0.428
ollar+0.427
Neg logit contributions
 irrad-0.386
 reactor-0.373
 kinderg-0.370
 rejection-0.363
 workload-0.331
Token Durham
Token ID30928
Feature activation-8.26e-01

Pos logit contributions
 irrad+0.073
 reactor+0.070
 kinderg+0.070
 rejection+0.069
 workload+0.063
Neg logit contributions
tips-0.075
inges-0.075
interrupted-0.070
verbs-0.067
ollar-0.066
Token,
Token ID11
Feature activation-3.31e-03

Pos logit contributions
 irrad+0.045
 reactor+0.043
 kinderg+0.043
 rejection+0.042
 workload+0.039
Neg logit contributions
tips-0.051
inges-0.051
interrupted-0.048
verbs-0.045
ollar-0.045
Token accused
Token ID5371
Feature activation-1.35e+00

Pos logit contributions
tips+0.006
inges+0.006
interrupted+0.006
ollar+0.005
verbs+0.005
Neg logit contributions
 irrad-0.005
 reactor-0.005
 kinderg-0.005
 rejection-0.005
 workload-0.004
Token of
Token ID286
Feature activation+1.12e-01

Pos logit contributions
inges+2.441
tips+2.431
interrupted+2.267
ollar+2.196
ippers+2.163
Neg logit contributions
 irrad-2.043
 reactor-1.977
 kinderg-1.960
 rejection-1.898
 teamwork-1.757
Token small
Token ID1402
Feature activation-3.57e-01

Pos logit contributions
 irrad+0.167
 reactor+0.162
 kinderg+0.161
 rejection+0.158
 workload+0.144
Neg logit contributions
tips-0.209
inges-0.208
interrupted-0.195
verbs-0.187
ollar-0.186
Token insignificant
Token ID32081
Feature activation+1.29e-01

Pos logit contributions
tips+0.548
inges+0.546
interrupted+0.503
verbs+0.479
ollar+0.474
Neg logit contributions
 irrad-0.656
 reactor-0.638
 kinderg-0.636
 rejection-0.625
 workload-0.580
Token offences
Token ID20819
Feature activation-2.51e+00

Pos logit contributions
 irrad+0.240
 reactor+0.234
 kinderg+0.233
 rejection+0.229
 workload+0.213
Neg logit contributions
tips-0.195
inges-0.195
interrupted-0.180
verbs-0.171
ollar-0.169
Token,
Token ID11
Feature activation+9.18e-02

Pos logit contributions
tips+4.271
inges+4.246
interrupted+3.919
ollar+3.769
verbs+3.698
Neg logit contributions
 irrad-3.984
 kinderg-3.881
 reactor-3.881
 rejection-3.771
 teamwork-3.486
Token like
Token ID588
Feature activation-3.17e-01

Pos logit contributions
 irrad+0.137
 reactor+0.133
 kinderg+0.132
 rejection+0.129
 workload+0.118
Neg logit contributions
tips-0.172
inges-0.172
interrupted-0.161
verbs-0.154
ollar-0.153
Token breaking
Token ID7163
Feature activation-5.37e-01

Pos logit contributions
tips+0.606
inges+0.602
interrupted+0.565
verbs+0.543
ippers+0.539
Neg logit contributions
 irrad-0.459
 reactor-0.445
 kinderg-0.440
 rejection-0.431
 workload-0.394
Token the
Token ID262
Feature activation+3.92e-02

Pos logit contributions
tips+0.997
inges+0.992
interrupted+0.923
verbs+0.891
ollar+0.884
Neg logit contributions
 irrad-0.811
 reactor-0.785
 kinderg-0.781
 rejection-0.765
 workload-0.701
Token window
Token ID4324
Feature activation-5.36e-01

Pos logit contributions
 irrad+0.053
 reactor+0.051
 kinderg+0.051
 rejection+0.050
 workload+0.045
Neg logit contributions
tips-0.079
inges-0.079
interrupted-0.074
verbs-0.072
ollar-0.071
Token'd
Token ID1549
Feature activation-2.63e-01

Pos logit contributions
tips+0.167
inges+0.166
interrupted+0.153
verbs+0.146
ollar+0.145
Neg logit contributions
 irrad-0.193
 reactor-0.188
 kinderg-0.188
 rejection-0.184
 workload-0.171
Token go
Token ID467
Feature activation+4.46e-01

Pos logit contributions
tips+0.497
inges+0.495
interrupted+0.464
verbs+0.445
ollar+0.444
Neg logit contributions
 irrad-0.387
 kinderg-0.377
 reactor-0.376
 rejection-0.367
 workload-0.334
Token some
Token ID617
Feature activation-5.01e-01

Pos logit contributions
 irrad+0.704
 reactor+0.683
 kinderg+0.679
 rejection+0.666
 workload+0.610
Neg logit contributions
tips-0.802
inges-0.800
interrupted-0.749
verbs-0.717
ollar-0.711
Tokenplace
Token ID5372
Feature activation+8.00e-02

Pos logit contributions
tips+0.898
inges+0.889
interrupted+0.825
ollar+0.793
verbs+0.791
Neg logit contributions
 irrad-0.787
 reactor-0.762
 kinderg-0.762
 rejection-0.744
 workload-0.683
Token and
Token ID290
Feature activation-1.27e-01

Pos logit contributions
 irrad+0.127
 reactor+0.123
 kinderg+0.123
 rejection+0.120
 workload+0.110
Neg logit contributions
tips-0.142
inges-0.141
interrupted-0.132
verbs-0.126
ollar-0.126
Token masturb
Token ID22938
Feature activation-2.43e+00

Pos logit contributions
tips+0.232
inges+0.231
interrupted+0.215
verbs+0.207
ollar+0.205
Neg logit contributions
 irrad-0.196
 reactor-0.190
 kinderg-0.190
 rejection-0.185
 workload-0.170
Tokenate
Token ID378
Feature activation-1.22e+00

Pos logit contributions
tips+2.130
inges+2.113
interrupted+1.814
ollar+1.711
verbs+1.648
Neg logit contributions
 irrad-5.946
 kinderg-5.885
 reactor-5.834
 rejection-5.774
 workload-5.487
Token and
Token ID290
Feature activation-1.00e-01

Pos logit contributions
tips+2.074
inges+2.065
interrupted+1.923
ollar+1.828
verbs+1.826
Neg logit contributions
 irrad-1.991
 reactor-1.929
 kinderg-1.927
 rejection-1.881
 workload-1.740
Token use
Token ID779
Feature activation-3.67e-01

Pos logit contributions
tips+0.184
inges+0.183
interrupted+0.171
verbs+0.164
ollar+0.163
Neg logit contributions
 irrad-0.155
 reactor-0.150
 kinderg-0.150
 rejection-0.146
 workload-0.134
Token it
Token ID340
Feature activation+1.92e-01

Pos logit contributions
tips+0.669
inges+0.665
interrupted+0.621
verbs+0.596
ollar+0.592
Neg logit contributions
 irrad-0.567
 reactor-0.550
 kinderg-0.549
 rejection-0.534
 workload-0.491
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
 irrad+0.306
 reactor+0.297
 kinderg+0.295
 rejection+0.289
 workload+0.266
Neg logit contributions
tips-0.341
inges-0.340
interrupted-0.318
verbs-0.304
ollar-0.302
Token it
Token ID340
Feature activation+1.99e-01

Pos logit contributions
tips+0.193
inges+0.192
interrupted+0.179
verbs+0.173
ollar+0.172
Neg logit contributions
 irrad-0.145
 reactor-0.140
 kinderg-0.140
 rejection-0.136
 workload-0.124
Token is
Token ID318
Feature activation-2.70e-01

Pos logit contributions
 irrad+0.336
 reactor+0.327
 kinderg+0.326
 rejection+0.319
 workload+0.294
Neg logit contributions
tips-0.336
inges-0.335
interrupted-0.311
verbs-0.297
ollar-0.295
Token hard
Token ID1327
Feature activation-3.15e-01

Pos logit contributions
tips+0.531
inges+0.525
interrupted+0.489
verbs+0.472
ollar+0.471
Neg logit contributions
 irrad-0.377
 reactor-0.365
 kinderg-0.365
 rejection-0.353
 workload-0.319
Token to
Token ID284
Feature activation+1.95e-01

Pos logit contributions
tips+0.536
inges+0.534
interrupted+0.497
verbs+0.475
ollar+0.471
Neg logit contributions
 irrad-0.526
 reactor-0.511
 kinderg-0.509
 rejection-0.499
 workload-0.460
Token get
Token ID651
Feature activation+2.64e-01

Pos logit contributions
 irrad+0.300
 reactor+0.290
 kinderg+0.288
 rejection+0.282
 workload+0.259
Neg logit contributions
tips-0.358
inges-0.357
interrupted-0.334
verbs-0.321
ollar-0.318
Token rid
Token ID5755
Feature activation-2.41e+00

Pos logit contributions
 irrad+0.382
 reactor+0.370
 kinderg+0.368
 rejection+0.357
 workload+0.326
Neg logit contributions
tips-0.508
inges-0.505
interrupted-0.474
verbs-0.455
ollar-0.453
Token of
Token ID286
Feature activation+2.29e-01

Pos logit contributions
tips+4.177
inges+4.109
interrupted+3.782
ollar+3.704
ippers+3.660
Neg logit contributions
 irrad-3.773
 kinderg-3.667
 reactor-3.655
 rejection-3.516
���-3.442
Token them
Token ID606
Feature activation-8.21e-02

Pos logit contributions
 irrad+0.329
 reactor+0.318
 kinderg+0.317
 rejection+0.308
 workload+0.280
Neg logit contributions
tips-0.443
inges-0.441
interrupted-0.414
verbs-0.398
ollar-0.396
Token.
Token ID13
Feature activation-5.69e-02

Pos logit contributions
tips+0.149
inges+0.149
interrupted+0.138
verbs+0.133
ollar+0.132
Neg logit contributions
 irrad-0.128
 reactor-0.124
 kinderg-0.124
 rejection-0.121
 workload-0.110
Token
Token ID198
Feature activation-2.06e-01

Pos logit contributions
tips+0.097
inges+0.096
interrupted+0.089
verbs+0.085
ollar+0.085
Neg logit contributions
 irrad-0.096
 reactor-0.093
 kinderg-0.093
 rejection-0.091
 workload-0.084
Token
Token ID198
Feature activation-1.09e-01

Pos logit contributions
tips+0.347
inges+0.343
interrupted+0.318
verbs+0.305
ollar+0.303
Neg logit contributions
 irrad-0.350
 reactor-0.340
 kinderg-0.339
 rejection-0.331
 workload-0.307
Token
Token ID229
Feature activation+3.08e-01

Pos logit contributions
 irrad+0.291
 reactor+0.281
 kinderg+0.279
 rejection+0.272
 workload+0.245
Neg logit contributions
tips-0.452
inges-0.449
interrupted-0.421
verbs-0.409
ollar-0.407
Token
Token ID116
Feature activation-4.68e-02

Pos logit contributions
 irrad+0.527
 reactor+0.511
 kinderg+0.509
 rejection+0.500
 workload+0.462
Neg logit contributions
tips-0.515
inges-0.513
interrupted-0.475
verbs-0.456
ollar-0.451
Token (@
Token ID4275
Feature activation-6.76e-01

Pos logit contributions
tips+0.079
inges+0.079
interrupted+0.073
verbs+0.070
ollar+0.069
Neg logit contributions
 irrad-0.079
 reactor-0.077
 kinderg-0.076
 rejection-0.075
 workload-0.069
Tokenm
Token ID76
Feature activation+7.73e-01

Pos logit contributions
tips+1.185
inges+1.170
interrupted+1.095
verbs+1.045
ollar+1.040
Neg logit contributions
 irrad-1.111
 reactor-1.077
 kinderg-1.074
 rejection-1.053
 workload-0.973
Tokenf
Token ID69
Feature activation-3.13e-01

Pos logit contributions
 irrad+1.184
 reactor+1.147
 kinderg+1.143
 rejection+1.117
 workload+1.022
Neg logit contributions
tips-1.427
inges-1.421
interrupted-1.330
verbs-1.276
ollar-1.267
Tokenlynn
Token ID12935
Feature activation-2.41e+00

Pos logit contributions
tips+0.384
inges+0.383
interrupted+0.345
verbs+0.324
ollar+0.322
Neg logit contributions
 irrad-0.672
 reactor-0.657
 kinderg-0.656
 rejection-0.646
 workload-0.608
TokenJR
Token ID44817
Feature activation-1.79e+00

Pos logit contributions
tips+3.643
inges+3.602
interrupted+3.346
verbs+3.178
ollar+3.162
Neg logit contributions
 irrad-4.437
 kinderg-4.297
 reactor-4.229
 rejection-4.176
 workload-3.925
Token)
Token ID8
Feature activation-4.56e-01

Pos logit contributions
tips+3.341
inges+3.300
interrupted+3.068
ollar+2.986
verbs+2.962
Neg logit contributions
 irrad-2.674
 kinderg-2.579
 reactor-2.576
 rejection-2.508
���-2.331
Token March
Token ID2805
Feature activation+4.43e-01

Pos logit contributions
tips+0.888
inges+0.879
interrupted+0.827
verbs+0.798
ollar+0.792
Neg logit contributions
 irrad-0.648
 reactor-0.624
 kinderg-0.619
 rejection-0.603
 workload-0.553
Token 4
Token ID604
Feature activation-4.37e-01

Pos logit contributions
 irrad+0.743
 reactor+0.722
 kinderg+0.719
 rejection+0.705
 workload+0.650
Neg logit contributions
tips-0.751
inges-0.749
interrupted-0.697
verbs-0.666
ollar-0.660
Token,
Token ID11
Feature activation+5.33e-02

Pos logit contributions
tips+0.781
inges+0.766
interrupted+0.712
verbs+0.690
ollar+0.688
Neg logit contributions
 irrad-0.698
 reactor-0.676
 kinderg-0.674
 rejection-0.658
 workload-0.608
Tokened
Token ID276
Feature activation-5.50e-03

Pos logit contributions
tips+0.200
inges+0.199
interrupted+0.187
verbs+0.177
ollar+0.176
Neg logit contributions
 irrad-0.185
 reactor-0.179
 kinderg-0.177
 rejection-0.174
 workload-0.160
TokenOut
Token ID7975
Feature activation-4.90e-02

Pos logit contributions
tips+0.010
inges+0.010
interrupted+0.009
verbs+0.009
ollar+0.009
Neg logit contributions
 irrad-0.009
 reactor-0.009
 kinderg-0.009
 rejection-0.008
 workload-0.008
TokenAtlanta
Token ID43482
Feature activation-7.31e-01

Pos logit contributions
tips+0.084
inges+0.083
interrupted+0.078
verbs+0.074
ollar+0.073
Neg logit contributions
 irrad-0.082
 reactor-0.080
 kinderg-0.079
 rejection-0.078
 workload-0.072
Token pic
Token ID8301
Feature activation-1.34e+00

Pos logit contributions
tips+1.226
inges+1.212
interrupted+1.139
verbs+1.077
ollar+1.070
Neg logit contributions
 irrad-1.246
 reactor-1.202
 kinderg-1.194
 rejection-1.175
 workload-1.082
Token.
Token ID13
Feature activation-2.00e-01

Pos logit contributions
tips+2.226
inges+2.177
interrupted+1.985
ollar+1.943
verbs+1.923
Neg logit contributions
 irrad-2.311
 reactor-2.244
 kinderg-2.227
 rejection-2.187
���-2.066
Tokentwitter
Token ID6956
Feature activation-2.41e+00

Pos logit contributions
tips+0.186
inges+0.185
interrupted+0.161
verbs+0.147
ollar+0.145
Neg logit contributions
 irrad-0.490
 reactor-0.480
 kinderg-0.479
 rejection-0.472
 workload-0.447
Token.
Token ID13
Feature activation-5.80e-01

Pos logit contributions
tips+3.795
inges+3.668
interrupted+3.298
ollar+3.288
verbs+3.226
Neg logit contributions
 irrad-4.342
 reactor-4.216
 kinderg-4.192
���-4.128
 rejection-4.095
Tokencom
Token ID785
Feature activation-2.14e+00

Pos logit contributions
tips+0.833
inges+0.825
interrupted+0.755
verbs+0.719
ollar+0.711
Neg logit contributions
 irrad-1.130
 reactor-1.101
 kinderg-1.096
 rejection-1.078
 workload-1.007
Token/
Token ID14
Feature activation-1.69e-01

Pos logit contributions
tips+3.267
inges+3.173
interrupted+2.882
verbs+2.827
ollar+2.825
Neg logit contributions
 irrad-3.948
 reactor-3.838
 kinderg-3.821
 rejection-3.739
���-3.558
Tokenxx
Token ID5324
Feature activation-1.12e+00

Pos logit contributions
tips+0.284
inges+0.283
interrupted+0.262
verbs+0.251
ollar+0.249
Neg logit contributions
 irrad-0.288
 reactor-0.280
 kinderg-0.279
 rejection-0.273
 workload-0.252
TokenA
Token ID32
Feature activation-5.18e-01

Pos logit contributions
tips+1.880
inges+1.868
interrupted+1.724
verbs+1.660
ollar+1.648
Neg logit contributions
 irrad-1.907
 reactor-1.851
 kinderg-1.850
 rejection-1.812
 workload-1.675
Token had
Token ID550
Feature activation+7.20e-02

Pos logit contributions
tips+0.048
inges+0.048
interrupted+0.045
verbs+0.043
ollar+0.042
Neg logit contributions
 irrad-0.042
 kinderg-0.041
 reactor-0.041
 rejection-0.040
 workload-0.036
Token with
Token ID351
Feature activation+1.29e-01

Pos logit contributions
 irrad+0.116
 kinderg+0.112
 reactor+0.112
 rejection+0.109
 workload+0.101
Neg logit contributions
tips-0.126
inges-0.125
interrupted-0.117
ollar-0.111
verbs-0.111
Token their
Token ID511
Feature activation-2.61e-01

Pos logit contributions
 irrad+0.200
 reactor+0.193
 kinderg+0.193
 rejection+0.188
 workload+0.171
Neg logit contributions
tips-0.236
inges-0.235
interrupted-0.219
ollar-0.210
verbs-0.210
Token checking
Token ID10627
Feature activation-7.62e-01

Pos logit contributions
tips+0.459
inges+0.459
interrupted+0.427
ollar+0.410
verbs+0.408
Neg logit contributions
 irrad-0.415
 reactor-0.403
 kinderg-0.401
 rejection-0.389
 workload-0.356
Token or
Token ID393
Feature activation+6.05e-01

Pos logit contributions
tips+1.044
inges+1.032
interrupted+0.944
verbs+0.889
ollar+0.887
Neg logit contributions
 irrad-1.525
 reactor-1.484
 kinderg-1.480
 rejection-1.453
 workload-1.360
Token savings
Token ID10653
Feature activation-2.40e+00

Pos logit contributions
 irrad+0.995
 reactor+0.966
 kinderg+0.963
 rejection+0.943
 workload+0.868
Neg logit contributions
tips-1.046
inges-1.042
interrupted-0.970
verbs-0.928
ollar-0.921
Token accounts
Token ID5504
Feature activation-1.25e+00

Pos logit contributions
tips+3.024
inges+2.953
interrupted+2.716
ollar+2.591
ippers+2.497
Neg logit contributions
 irrad-4.932
 kinderg-4.811
 reactor-4.793
 rejection-4.664
 teamwork-4.382
Token,
Token ID11
Feature activation+1.37e-01

Pos logit contributions
tips+2.132
inges+2.106
interrupted+1.966
ollar+1.882
verbs+1.869
Neg logit contributions
 irrad-2.057
 reactor-1.998
 kinderg-1.991
 rejection-1.943
 teamwork-1.793
Token including
Token ID1390
Feature activation+1.07e+00

Pos logit contributions
 irrad+0.221
 reactor+0.214
 kinderg+0.213
 rejection+0.208
 workload+0.191
Neg logit contributions
tips-0.242
inges-0.240
interrupted-0.224
verbs-0.214
ollar-0.214
Token complaints
Token ID9687
Feature activation+5.12e-01

Pos logit contributions
 irrad+1.585
 reactor+1.533
 kinderg+1.527
 rejection+1.491
 workload+1.359
Neg logit contributions
tips-2.033
inges-2.026
interrupted-1.899
verbs-1.823
ollar-1.811
Token over
Token ID625
Feature activation+7.23e-01

Pos logit contributions
 irrad+0.673
 reactor+0.648
 kinderg+0.646
 rejection+0.626
 workload+0.564
Neg logit contributions
tips-1.055
inges-1.052
interrupted-0.991
verbs-0.952
ollar-0.951
Token from
Token ID422
Feature activation+3.76e-01

Pos logit contributions
inges+1.185
tips+1.180
interrupted+1.106
ollar+1.058
verbs+1.054
Neg logit contributions
 irrad-1.038
 kinderg-1.015
 reactor-1.010
 rejection-0.984
 workload-0.909
Token the
Token ID262
Feature activation-1.32e-01

Pos logit contributions
 irrad+0.571
 reactor+0.552
 kinderg+0.550
 rejection+0.539
 workload+0.492
Neg logit contributions
tips-0.695
inges-0.693
interrupted-0.647
verbs-0.622
ollar-0.619
Token Harvard
Token ID11131
Feature activation+9.95e-01

Pos logit contributions
tips+0.246
inges+0.245
interrupted+0.228
verbs+0.219
ollar+0.219
Neg logit contributions
 irrad-0.197
 reactor-0.189
 kinderg-0.189
 rejection-0.185
 workload-0.170
Token-
Token ID12
Feature activation-1.52e-01

Pos logit contributions
 irrad+1.730
 reactor+1.687
 kinderg+1.669
 rejection+1.642
 workload+1.516
Neg logit contributions
tips-1.634
inges-1.629
interrupted-1.520
verbs-1.456
ollar-1.428
TokenSmith
Token ID17919
Feature activation-8.25e-01

Pos logit contributions
tips+0.245
inges+0.244
interrupted+0.225
verbs+0.214
ollar+0.214
Neg logit contributions
 irrad-0.267
 reactor-0.259
 kinderg-0.258
 rejection-0.254
 workload-0.235
Tokensonian
Token ID35202
Feature activation-2.40e+00

Pos logit contributions
tips+0.330
inges+0.322
interrupted+0.214
verbs+0.166
ollar+0.159
Neg logit contributions
 irrad-2.453
 reactor-2.412
 kinderg-2.409
 rejection-2.384
 workload-2.286
Token Center
Token ID3337
Feature activation+1.09e-01

Pos logit contributions
inges+4.350
tips+4.334
interrupted+4.031
ollar+3.890
ippers+3.816
Neg logit contributions
 irrad-3.613
 kinderg-3.477
 reactor-3.429
 rejection-3.427
 workload-3.100
Token for
Token ID329
Feature activation-1.55e-01

Pos logit contributions
 irrad+0.191
 kinderg+0.185
 reactor+0.185
 rejection+0.181
 workload+0.168
Neg logit contributions
tips-0.175
inges-0.174
interrupted-0.162
verbs-0.154
ollar-0.153
Token Ast
Token ID8304
Feature activation-1.80e-01

Pos logit contributions
tips+0.235
inges+0.233
interrupted+0.213
verbs+0.203
ollar+0.203
Neg logit contributions
 irrad-0.288
 reactor-0.281
 kinderg-0.280
 rejection-0.276
 workload-0.256
Tokenroph
Token ID10051
Feature activation-5.27e-01

Pos logit contributions
tips+0.342
inges+0.341
interrupted+0.317
verbs+0.306
ollar+0.306
Neg logit contributions
 irrad-0.264
 reactor-0.256
 kinderg-0.255
 rejection-0.250
 workload-0.226
Tokenysics
Token ID23154
Feature activation-2.16e-01

Pos logit contributions
tips+0.695
inges+0.692
interrupted+0.623
verbs+0.595
ollar+0.588
Neg logit contributions
 irrad-1.082
 reactor-1.059
 kinderg-1.057
 rejection-1.040
 workload-0.972
Token all
Token ID477
Feature activation-6.89e-02

Pos logit contributions
 irrad+0.202
 reactor+0.196
 kinderg+0.196
 rejection+0.191
 workload+0.176
Neg logit contributions
tips-0.224
inges-0.224
interrupted-0.209
verbs-0.200
ollar-0.199
Token their
Token ID511
Feature activation-4.57e-01

Pos logit contributions
tips+0.129
inges+0.128
interrupted+0.120
verbs+0.115
ollar+0.114
Neg logit contributions
 irrad-0.104
 reactor-0.101
 kinderg-0.100
 rejection-0.098
 workload-0.089
Token former
Token ID1966
Feature activation-1.29e+00

Pos logit contributions
tips+0.873
inges+0.870
interrupted+0.818
verbs+0.784
ollar+0.778
Neg logit contributions
 irrad-0.665
 reactor-0.641
 kinderg-0.641
 rejection-0.624
 workload-0.566
Token captain
Token ID10654
Feature activation-6.14e-01

Pos logit contributions
inges+2.409
tips+2.401
interrupted+2.227
ippers+2.147
verbs+2.134
Neg logit contributions
 irrad-1.903
 reactor-1.826
 kinderg-1.823
 rejection-1.795
ific-1.621
Token has
Token ID468
Feature activation-1.35e-02

Pos logit contributions
tips+1.036
inges+1.033
interrupted+0.956
verbs+0.915
ippers+0.908
Neg logit contributions
 irrad-1.029
 kinderg-0.998
 reactor-0.998
 rejection-0.975
 workload-0.894
Token ever
Token ID1683
Feature activation-2.39e+00

Pos logit contributions
tips+0.024
inges+0.024
interrupted+0.022
verbs+0.021
ollar+0.021
Neg logit contributions
 irrad-0.021
 kinderg-0.021
 reactor-0.021
 rejection-0.020
 workload-0.018
Token known
Token ID1900
Feature activation-1.28e+00

Pos logit contributions
inges+3.918
tips+3.889
interrupted+3.659
verbs+3.474
ippers+3.448
Neg logit contributions
 kinderg-3.895
 irrad-3.858
 reactor-3.792
 rejection-3.774
 antioxid-3.479
Token is
Token ID318
Feature activation+4.98e-02

Pos logit contributions
inges+1.980
tips+1.980
interrupted+1.804
ippers+1.725
verbs+1.724
Neg logit contributions
 irrad-2.303
 reactor-2.239
 kinderg-2.228
 rejection-2.165
 workload-2.009
Token the
Token ID262
Feature activation-1.68e-01

Pos logit contributions
 irrad+0.077
 reactor+0.075
 kinderg+0.075
 rejection+0.072
 workload+0.066
Neg logit contributions
tips-0.090
inges-0.090
interrupted-0.084
verbs-0.080
ollar-0.080
Token path
Token ID3108
Feature activation+6.19e-01

Pos logit contributions
tips+0.331
inges+0.330
interrupted+0.310
verbs+0.297
ollar+0.296
Neg logit contributions
 irrad-0.232
 reactor-0.225
 kinderg-0.224
 rejection-0.217
ific-0.196
Token to
Token ID284
Feature activation+4.77e-02

Pos logit contributions
 irrad+0.986
 reactor+0.956
 kinderg+0.952
 rejection+0.933
 workload+0.856
Neg logit contributions
tips-1.103
inges-1.100
interrupted-1.026
verbs-0.984
ollar-0.975
Token you
Token ID345
Feature activation+3.92e-01

Pos logit contributions
 irrad+0.368
 reactor+0.356
 kinderg+0.355
 rejection+0.346
 workload+0.316
Neg logit contributions
tips-0.460
inges-0.458
interrupted-0.429
verbs-0.412
ollar-0.408
Token can
Token ID460
Feature activation-3.81e-01

Pos logit contributions
 irrad+0.597
 reactor+0.579
 kinderg+0.579
 rejection+0.565
 workload+0.516
Neg logit contributions
tips-0.725
inges-0.721
interrupted-0.675
verbs-0.648
ollar-0.643
Token watch
Token ID2342
Feature activation-5.05e-01

Pos logit contributions
tips+0.725
inges+0.721
interrupted+0.679
verbs+0.649
ollar+0.648
Neg logit contributions
 irrad-0.550
 kinderg-0.540
 reactor-0.535
 rejection-0.524
 workload-0.474
Token porn
Token ID8483
Feature activation-4.03e-01

Pos logit contributions
tips+0.971
inges+0.970
interrupted+0.911
verbs+0.874
ollar+0.866
Neg logit contributions
 irrad-0.722
 reactor-0.696
 kinderg-0.695
 rejection-0.675
 workload-0.617
Token and
Token ID290
Feature activation-1.05e-01

Pos logit contributions
tips+0.710
inges+0.708
interrupted+0.659
verbs+0.631
ollar+0.625
Neg logit contributions
 irrad-0.649
 kinderg-0.629
 reactor-0.629
 rejection-0.615
 workload-0.565
Token masturb
Token ID22938
Feature activation-2.39e+00

Pos logit contributions
tips+0.180
inges+0.180
interrupted+0.167
verbs+0.160
ollar+0.158
Neg logit contributions
 irrad-0.173
 reactor-0.168
 kinderg-0.168
 rejection-0.164
 workload-0.151
Tokenate
Token ID378
Feature activation-1.19e+00

Pos logit contributions
inges+2.052
tips+2.035
interrupted+1.733
ollar+1.621
verbs+1.586
Neg logit contributions
 irrad-5.848
 kinderg-5.800
 reactor-5.745
 rejection-5.687
 teamwork-5.400
Token for
Token ID329
Feature activation-7.47e-02

Pos logit contributions
tips+2.062
inges+2.052
interrupted+1.917
verbs+1.824
ollar+1.802
Neg logit contributions
 irrad-1.904
 kinderg-1.846
 reactor-1.840
 rejection-1.794
 workload-1.652
Token a
Token ID257
Feature activation-1.90e-03

Pos logit contributions
tips+0.142
inges+0.141
interrupted+0.133
verbs+0.127
ollar+0.126
Neg logit contributions
 irrad-0.109
 reactor-0.106
 kinderg-0.105
 rejection-0.103
 workload-0.094
Token long
Token ID890
Feature activation-1.17e+00

Pos logit contributions
tips+0.003
inges+0.003
interrupted+0.003
verbs+0.003
ollar+0.003
Neg logit contributions
 irrad-0.003
 reactor-0.003
 kinderg-0.003
 rejection-0.003
 workload-0.003
Token time
Token ID640
Feature activation-7.53e-01

Pos logit contributions
tips+2.148
inges+2.147
interrupted+2.031
verbs+1.926
ollar+1.904
Neg logit contributions
 irrad-1.737
 kinderg-1.678
 reactor-1.669
 rejection-1.628
 teamwork-1.484
Token major
Token ID1688
Feature activation-2.07e-01

Pos logit contributions
 irrad+1.752
 reactor+1.697
 kinderg+1.690
 rejection+1.654
 workload+1.511
Neg logit contributions
tips-2.167
inges-2.160
interrupted-2.025
verbs-1.950
ollar-1.926
Token dollars
Token ID5054
Feature activation-2.60e-01

Pos logit contributions
tips+0.376
inges+0.374
interrupted+0.350
ollar+0.333
verbs+0.332
Neg logit contributions
 irrad-0.321
 reactor-0.310
 kinderg-0.310
 rejection-0.302
 teamwork-0.275
Token just
Token ID655
Feature activation-1.40e-01

Pos logit contributions
tips+0.457
inges+0.455
interrupted+0.424
verbs+0.406
ollar+0.404
Neg logit contributions
 irrad-0.419
 reactor-0.407
 kinderg-0.405
 rejection-0.397
 workload-0.364
Token to
Token ID284
Feature activation+1.81e-01

Pos logit contributions
tips+0.234
inges+0.234
interrupted+0.217
verbs+0.207
ollar+0.206
Neg logit contributions
 irrad-0.239
 reactor-0.232
 kinderg-0.231
 rejection-0.226
 workload-0.209
Token get
Token ID651
Feature activation+6.23e-02

Pos logit contributions
 irrad+0.274
 reactor+0.265
 kinderg+0.264
 rejection+0.258
 workload+0.236
Neg logit contributions
tips-0.339
inges-0.338
interrupted-0.316
verbs-0.304
ollar-0.301
Token rid
Token ID5755
Feature activation-2.38e+00

Pos logit contributions
 irrad+0.099
 reactor+0.096
 kinderg+0.095
 rejection+0.093
 workload+0.086
Neg logit contributions
tips-0.111
inges-0.110
interrupted-0.103
verbs-0.098
ollar-0.098
Token of
Token ID286
Feature activation+7.56e-02

Pos logit contributions
tips+4.083
inges+3.999
interrupted+3.707
ollar+3.624
ippers+3.595
Neg logit contributions
 irrad-3.785
 kinderg-3.661
 reactor-3.656
 rejection-3.524
���-3.457
Token the
Token ID262
Feature activation-2.27e-02

Pos logit contributions
 irrad+0.124
 reactor+0.120
 kinderg+0.119
 rejection+0.116
 workload+0.107
Neg logit contributions
inges-0.131
tips-0.131
interrupted-0.122
verbs-0.116
ollar-0.116
Token proverbial
Token ID47327
Feature activation-5.99e-01

Pos logit contributions
inges+0.045
tips+0.044
interrupted+0.042
verbs+0.040
ollar+0.040
Neg logit contributions
 irrad-0.032
 reactor-0.030
 kinderg-0.030
 rejection-0.030
 teamwork-0.027
Token al
Token ID435
Feature activation-2.04e-02

Pos logit contributions
inges+1.141
tips+1.139
interrupted+1.065
ollar+1.021
verbs+1.018
Neg logit contributions
 irrad-0.856
 reactor-0.826
 kinderg-0.821
 rejection-0.806
ific-0.729
Tokenbat
Token ID8664
Feature activation-1.50e-01

Pos logit contributions
tips+0.041
inges+0.041
interrupted+0.038
verbs+0.037
ollar+0.037
Neg logit contributions
 irrad-0.028
 reactor-0.027
 kinderg-0.027
 rejection-0.026
 workload-0.023
Token the
Token ID262
Feature activation+2.24e-01

Pos logit contributions
 irrad+0.173
 reactor+0.168
 kinderg+0.167
 rejection+0.163
 workload+0.148
Neg logit contributions
tips-0.218
inges-0.217
interrupted-0.204
verbs-0.195
ollar-0.194
Token sentences
Token ID13439
Feature activation-2.38e-01

Pos logit contributions
 irrad+0.307
 reactor+0.296
 kinderg+0.295
 rejection+0.285
 workload+0.258
Neg logit contributions
tips-0.445
inges-0.443
interrupted-0.418
verbs-0.400
ollar-0.400
Token on
Token ID319
Feature activation-1.26e-01

Pos logit contributions
inges+0.424
tips+0.424
interrupted+0.397
ollar+0.378
verbs+0.375
Neg logit contributions
 irrad-0.373
 reactor-0.362
 kinderg-0.360
 rejection-0.351
 teamwork-0.323
Token four
Token ID1440
Feature activation+1.05e-01

Pos logit contributions
tips+0.238
inges+0.237
interrupted+0.223
ollar+0.212
verbs+0.212
Neg logit contributions
 irrad-0.185
 reactor-0.178
 kinderg-0.178
 rejection-0.173
 workload-0.158
Token remaining
Token ID5637
Feature activation-6.10e-01

Pos logit contributions
 irrad+0.164
 reactor+0.159
 kinderg+0.158
 rejection+0.155
 workload+0.142
Neg logit contributions
tips-0.190
inges-0.190
interrupted-0.178
verbs-0.169
ollar-0.169
Token offences
Token ID20819
Feature activation-2.37e+00

Pos logit contributions
tips+1.041
inges+1.039
interrupted+0.967
verbs+0.915
ollar+0.914
Neg logit contributions
 irrad-1.000
 reactor-0.969
 kinderg-0.966
 rejection-0.948
 teamwork-0.871
Token were
Token ID547
Feature activation+2.09e-01

Pos logit contributions
inges+4.060
tips+4.055
interrupted+3.807
ollar+3.599
verbs+3.536
Neg logit contributions
 irrad-3.705
 kinderg-3.664
 reactor-3.611
 rejection-3.521
 teamwork-3.263
Token to
Token ID284
Feature activation+2.41e-01

Pos logit contributions
 irrad+0.311
 reactor+0.301
 kinderg+0.300
 rejection+0.292
 workload+0.266
Neg logit contributions
tips-0.393
inges-0.391
interrupted-0.368
verbs-0.351
ollar-0.350
Token be
Token ID307
Feature activation+4.32e-01

Pos logit contributions
 irrad+0.378
 reactor+0.367
 kinderg+0.366
 rejection+0.358
 workload+0.327
Neg logit contributions
tips-0.431
inges-0.430
interrupted-0.403
ollar-0.383
verbs-0.383
Token served
Token ID4983
Feature activation-8.42e-01

Pos logit contributions
 irrad+0.702
 reactor+0.681
 kinderg+0.679
 rejection+0.664
 workload+0.610
Neg logit contributions
tips-0.751
inges-0.749
interrupted-0.700
verbs-0.665
ollar-0.664
Token concurrently
Token ID47480
Feature activation+3.05e-01

Pos logit contributions
inges+1.260
tips+1.259
interrupted+1.169
ollar+1.097
verbs+1.089
Neg logit contributions
 irrad-1.543
 reactor-1.509
 kinderg-1.500
 rejection-1.473
ific-1.366
Token we
Token ID356
Feature activation-2.00e-01

Pos logit contributions
 irrad+0.564
 reactor+0.543
 kinderg+0.539
 rejection+0.523
 workload+0.472
Neg logit contributions
tips-0.900
inges-0.897
interrupted-0.847
verbs-0.816
ollar-0.810
Token realize
Token ID6537
Feature activation+9.54e-01

Pos logit contributions
tips+0.363
inges+0.362
interrupted+0.338
verbs+0.324
ollar+0.323
Neg logit contributions
 irrad-0.307
 kinderg-0.298
 reactor-0.298
 rejection-0.290
 workload-0.267
Token the
Token ID262
Feature activation-1.00e-01

Pos logit contributions
 irrad+1.310
 reactor+1.264
 kinderg+1.259
 rejection+1.227
 workload+1.109
Neg logit contributions
tips-1.908
inges-1.901
interrupted-1.789
verbs-1.722
ollar-1.711
Token subject
Token ID2426
Feature activation+2.72e-01

Pos logit contributions
tips+0.202
inges+0.202
interrupted+0.190
verbs+0.182
ollar+0.182
Neg logit contributions
 irrad-0.134
 reactor-0.129
 kinderg-0.128
 rejection-0.123
 workload-0.112
Token's
Token ID338
Feature activation-2.38e-01

Pos logit contributions
 irrad+0.460
 reactor+0.446
 kinderg+0.444
 rejection+0.433
 workload+0.401
Neg logit contributions
inges-0.459
tips-0.458
interrupted-0.423
verbs-0.405
ollar-0.402
Token masturbation
Token ID41054
Feature activation-2.37e+00

Pos logit contributions
tips+0.468
inges+0.465
interrupted+0.438
verbs+0.421
ollar+0.418
Neg logit contributions
 irrad-0.327
 reactor-0.315
 kinderg-0.315
 rejection-0.302
 teamwork-0.275
Token has
Token ID468
Feature activation-7.16e-02

Pos logit contributions
tips+4.075
inges+4.028
interrupted+3.752
verbs+3.586
ollar+3.558
Neg logit contributions
 irrad-3.727
 kinderg-3.654
 reactor-3.595
 rejection-3.470
 teamwork-3.232
Token killed
Token ID2923
Feature activation-6.61e-01

Pos logit contributions
tips+0.126
inges+0.126
interrupted+0.118
verbs+0.112
ollar+0.112
Neg logit contributions
 irrad-0.113
 kinderg-0.111
 reactor-0.110
 rejection-0.108
 workload-0.099
Token his
Token ID465
Feature activation+8.12e-02

Pos logit contributions
inges+1.287
tips+1.283
interrupted+1.201
verbs+1.161
ollar+1.152
Neg logit contributions
 irrad-0.928
 kinderg-0.892
 reactor-0.890
 rejection-0.862
 workload-0.785
Token dog
Token ID3290
Feature activation-1.40e+00

Pos logit contributions
 irrad+0.128
 reactor+0.123
 kinderg+0.123
 rejection+0.120
 workload+0.110
Neg logit contributions
tips-0.145
inges-0.145
interrupted-0.135
verbs-0.129
ollar-0.129
Token,
Token ID11
Feature activation-4.52e-02

Pos logit contributions
tips+2.390
inges+2.365
interrupted+2.201
verbs+2.121
ollar+2.097
Neg logit contributions
 irrad-2.305
 reactor-2.231
 kinderg-2.223
 rejection-2.175
 workload-2.003
Token8
Token ID23
Feature activation+3.85e-02

Pos logit contributions
tips+0.002
inges+0.002
interrupted+0.002
verbs+0.002
ollar+0.002
Neg logit contributions
 irrad-0.002
 reactor-0.002
 kinderg-0.002
 rejection-0.002
 workload-0.002
Tokenb
Token ID65
Feature activation-1.69e-01

Pos logit contributions
 irrad+0.067
 reactor+0.065
 kinderg+0.065
 rejection+0.064
 workload+0.059
Neg logit contributions
tips-0.063
inges-0.063
interrupted-0.058
verbs-0.056
ollar-0.055
TokenET
Token ID2767
Feature activation-7.19e-01

Pos logit contributions
tips+0.287
inges+0.285
interrupted+0.264
verbs+0.253
ollar+0.251
Neg logit contributions
 irrad-0.285
 reactor-0.276
 kinderg-0.275
 rejection-0.270
 workload-0.249
Token pic
Token ID8301
Feature activation-1.54e+00

Pos logit contributions
tips+1.146
inges+1.139
interrupted+1.057
verbs+1.007
ollar+1.001
Neg logit contributions
 irrad-1.280
 reactor-1.240
 kinderg-1.235
 rejection-1.213
 workload-1.125
Token.
Token ID13
Feature activation-1.90e-01

Pos logit contributions
tips+2.523
inges+2.459
interrupted+2.246
ollar+2.208
verbs+2.170
Neg logit contributions
 irrad-2.675
 reactor-2.601
 kinderg-2.588
 rejection-2.524
���-2.438
Tokentwitter
Token ID6956
Feature activation-2.37e+00

Pos logit contributions
tips+0.178
inges+0.177
interrupted+0.154
verbs+0.141
ollar+0.139
Neg logit contributions
 irrad-0.462
 reactor-0.453
 kinderg-0.452
 rejection-0.446
 workload-0.423
Token.
Token ID13
Feature activation-4.80e-01

Pos logit contributions
tips+3.766
inges+3.627
interrupted+3.286
ollar+3.260
verbs+3.201
Neg logit contributions
 irrad-4.253
 reactor-4.134
 kinderg-4.106
���-4.049
 rejection-3.998
Tokencom
Token ID785
Feature activation-1.98e+00

Pos logit contributions
tips+0.696
inges+0.689
interrupted+0.633
verbs+0.601
ollar+0.595
Neg logit contributions
 irrad-0.929
 reactor-0.905
 kinderg-0.902
 rejection-0.886
 workload-0.827
Token/
Token ID14
Feature activation-2.68e-03

Pos logit contributions
tips+3.050
inges+2.949
interrupted+2.693
verbs+2.633
ollar+2.632
Neg logit contributions
 irrad-3.634
 reactor-3.528
 kinderg-3.516
 rejection-3.435
���-3.299
TokenW
Token ID54
Feature activation-4.03e-02

Pos logit contributions
tips+0.004
inges+0.004
interrupted+0.004
verbs+0.004
ollar+0.004
Neg logit contributions
 irrad-0.005
 reactor-0.004
 kinderg-0.004
 rejection-0.004
 workload-0.004
Token8
Token ID23
Feature activation+9.22e-02

Pos logit contributions
tips+0.068
inges+0.068
interrupted+0.063
verbs+0.060
ollar+0.060
Neg logit contributions
 irrad-0.068
 reactor-0.066
 kinderg-0.066
 rejection-0.064
 workload-0.059
Token high
Token ID1029
Feature activation-6.19e-01

Pos logit contributions
 irrad+2.092
 reactor+2.028
 kinderg+2.019
 rejection+1.978
 workload+1.813
Neg logit contributions
tips-2.422
inges-2.412
interrupted-2.254
verbs-2.163
ollar-2.146
Token bandwidth
Token ID19484
Feature activation-6.24e-01

Pos logit contributions
tips+1.196
inges+1.194
interrupted+1.127
verbs+1.084
ollar+1.075
Neg logit contributions
 irrad-0.869
 kinderg-0.852
 reactor-0.841
 rejection-0.810
 teamwork-0.743
Token Layer
Token ID34398
Feature activation+2.27e+00

Pos logit contributions
tips+1.183
inges+1.180
interrupted+1.106
verbs+1.056
ippers+1.050
Neg logit contributions
 irrad-0.908
 kinderg-0.879
 reactor-0.878
 rejection-0.846
 teamwork-0.775
Token 3
Token ID513
Feature activation+3.52e-01

Pos logit contributions
 irrad+3.664
 reactor+3.599
 kinderg+3.590
 rejection+3.567
 workload+3.298
Neg logit contributions
inges-3.915
tips-3.870
interrupted-3.585
verbs-3.465
ollar-3.449
Token &
Token ID1222
Feature activation-1.43e-01

Pos logit contributions
 irrad+0.495
 kinderg+0.477
 reactor+0.476
 rejection+0.461
 teamwork+0.418
Neg logit contributions
tips-0.690
inges-0.689
interrupted-0.646
verbs-0.619
ollar-0.616
Token Layer
Token ID34398
Feature activation+2.04e+00

Pos logit contributions
tips+0.241
inges+0.241
interrupted+0.224
verbs+0.213
ollar+0.211
Neg logit contributions
 irrad-0.239
 reactor-0.232
 kinderg-0.231
 rejection-0.226
 teamwork-0.207
Token 4
Token ID604
Feature activation-1.38e-02

Pos logit contributions
 irrad+3.205
 reactor+3.126
 kinderg+3.097
 rejection+3.050
 workload+2.803
Neg logit contributions
inges-3.645
tips-3.630
interrupted-3.388
verbs-3.257
ollar-3.223
Token D
Token ID360
Feature activation-3.88e-01

Pos logit contributions
inges+0.027
tips+0.027
interrupted+0.025
verbs+0.024
ollar+0.024
Neg logit contributions
 irrad-0.019
 reactor-0.019
 kinderg-0.019
 rejection-0.018
 teamwork-0.016
TokenDoS
Token ID46498
Feature activation-3.66e-01

Pos logit contributions
inges+0.671
tips+0.667
interrupted+0.614
verbs+0.592
ollar+0.592
Neg logit contributions
 irrad-0.643
 reactor-0.626
 kinderg-0.624
 rejection-0.606
 teamwork-0.557
Token attacks
Token ID3434
Feature activation+2.83e-01

Pos logit contributions
inges+0.904
tips+0.902
interrupted+0.857
verbs+0.831
ollar+0.826
Neg logit contributions
 irrad-0.329
 reactor-0.312
 kinderg-0.309
 rejection-0.293
 teamwork-0.251
Token as
Token ID355
Feature activation-1.11e-01

Pos logit contributions
 irrad+0.449
 reactor+0.436
 kinderg+0.435
 rejection+0.424
 workload+0.387
Neg logit contributions
tips-0.505
inges-0.503
interrupted-0.468
verbs-0.449
ollar-0.447
Token by
Token ID416
Feature activation-8.84e-01

Pos logit contributions
tips+0.112
inges+0.111
interrupted+0.103
verbs+0.099
ollar+0.098
Neg logit contributions
 irrad-0.108
 reactor-0.105
 kinderg-0.105
 rejection-0.103
 workload-0.094
Token Facebook
Token ID3203
Feature activation-3.49e-01

Pos logit contributions
tips+1.679
inges+1.663
interrupted+1.541
ollar+1.488
verbs+1.486
Neg logit contributions
 irrad-1.289
 reactor-1.237
 kinderg-1.223
 rejection-1.196
ific-1.096
Token employees
Token ID4409
Feature activation+2.99e-01

Pos logit contributions
tips+0.627
inges+0.622
interrupted+0.577
verbs+0.554
ippers+0.549
Neg logit contributions
 irrad-0.552
 reactor-0.535
 kinderg-0.530
 rejection-0.519
 workload-0.475
Token in
Token ID287
Feature activation-4.10e-02

Pos logit contributions
 irrad+0.477
 reactor+0.463
 kinderg+0.461
 rejection+0.451
 workload+0.414
Neg logit contributions
tips-0.531
inges-0.529
interrupted-0.494
verbs-0.473
ollar-0.470
Token Palo
Token ID44878
Feature activation+1.77e+00

Pos logit contributions
inges+0.078
tips+0.078
interrupted+0.072
verbs+0.070
ollar+0.069
Neg logit contributions
 irrad-0.060
 reactor-0.058
 kinderg-0.057
 rejection-0.056
 workload-0.051
Token Alto
Token ID34317
Feature activation+2.19e+00

Pos logit contributions
 irrad+1.346
 reactor+1.260
 kinderg+1.254
 rejection+1.191
 workload+0.967
Neg logit contributions
tips-4.600
inges-4.582
interrupted-4.388
verbs-4.261
ollar-4.260
Token,
Token ID11
Feature activation+6.57e-02

Pos logit contributions
 irrad+3.242
 kinderg+3.150
 reactor+3.143
 rejection+3.052
 workload+2.797
Neg logit contributions
tips-4.102
inges-4.101
interrupted-3.851
verbs-3.716
ollar-3.685
Token Calif
Token ID3192
Feature activation+7.28e-01

Pos logit contributions
 irrad+0.091
 reactor+0.089
 kinderg+0.088
 rejection+0.086
 workload+0.078
Neg logit contributions
tips-0.130
inges-0.129
interrupted-0.121
verbs-0.116
ollar-0.116
Token.,
Token ID1539
Feature activation+7.72e-01

Pos logit contributions
 irrad+1.313
 reactor+1.279
 kinderg+1.272
 rejection+1.250
 workload+1.158
Neg logit contributions
tips-1.140
inges-1.131
interrupted-1.051
verbs-1.000
ollar-0.986
Token and
Token ID290
Feature activation-9.89e-02

Pos logit contributions
 irrad+1.234
 reactor+1.197
 kinderg+1.194
 rejection+1.168
 workload+1.072
Neg logit contributions
tips-1.370
inges-1.366
interrupted-1.275
verbs-1.224
ollar-1.211
Token in
Token ID287
Feature activation+1.53e-01

Pos logit contributions
tips+0.179
inges+0.177
interrupted+0.166
verbs+0.158
ollar+0.157
Neg logit contributions
 irrad-0.154
 reactor-0.150
 kinderg-0.149
 rejection-0.146
 workload-0.133
Token cue
Token ID28381
Feature activation+9.33e-02

Pos logit contributions
 irrad+0.146
 reactor+0.140
 kinderg+0.140
 rejection+0.136
 workload+0.123
Neg logit contributions
tips-0.210
inges-0.209
interrupted-0.196
verbs-0.189
ollar-0.188
Token for
Token ID329
Feature activation-1.02e-01

Pos logit contributions
 irrad+0.157
 kinderg+0.152
 reactor+0.152
 rejection+0.148
 workload+0.137
Neg logit contributions
tips-0.158
inges-0.157
interrupted-0.146
verbs-0.140
ollar-0.138
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
inges+0.174
tips+0.174
interrupted+0.162
verbs+0.154
ollar+0.153
Neg logit contributions
 irrad-0.168
 reactor-0.163
 kinderg-0.162
 rejection-0.158
 workload-0.146
Token recently
Token ID2904
Feature activation-1.33e-01

Pos logit contributions
 irrad+0.171
 reactor+0.165
 kinderg+0.165
 rejection+0.161
 workload+0.146
Neg logit contributions
tips-0.229
inges-0.228
interrupted-0.214
verbs-0.205
ollar-0.204
Token under
Token ID739
Feature activation+7.96e-01

Pos logit contributions
tips+0.227
inges+0.226
interrupted+0.210
verbs+0.201
ollar+0.199
Neg logit contributions
 irrad-0.222
 reactor-0.216
 kinderg-0.215
 rejection-0.210
 workload-0.194
Tokenwhelming
Token ID36433
Feature activation+2.11e+00

Pos logit contributions
 irrad+1.374
 reactor+1.338
 kinderg+1.330
 rejection+1.304
 workload+1.210
Neg logit contributions
tips-1.311
inges-1.306
interrupted-1.205
verbs-1.154
ollar-1.146
Token Can
Token ID1680
Feature activation+8.88e-02

Pos logit contributions
 irrad+3.472
 reactor+3.402
 kinderg+3.374
 rejection+3.337
 workload+3.080
Neg logit contributions
tips-3.634
inges-3.583
interrupted-3.366
verbs-3.239
ollar-3.187
Tokena
Token ID64
Feature activation+7.00e-01

Pos logit contributions
 irrad+0.125
 reactor+0.120
 kinderg+0.119
 rejection+0.117
 workload+0.106
Neg logit contributions
tips-0.175
inges-0.175
interrupted-0.164
verbs-0.158
ollar-0.158
Token,
Token ID11
Feature activation+2.54e-01

Pos logit contributions
 irrad+1.037
 reactor+1.002
 kinderg+0.999
 rejection+0.974
 workload+0.888
Neg logit contributions
tips-1.326
inges-1.322
interrupted-1.240
verbs-1.191
ollar-1.182
Token Campbell
Token ID14327
Feature activation-3.55e-01

Pos logit contributions
 irrad+0.397
 reactor+0.385
 kinderg+0.382
 rejection+0.375
 workload+0.343
Neg logit contributions
tips-0.462
inges-0.460
interrupted-0.430
verbs-0.413
ollar-0.409
Token and
Token ID290
Feature activation+8.65e-02

Pos logit contributions
tips+0.608
inges+0.606
interrupted+0.565
verbs+0.539
ollar+0.534
Neg logit contributions
 irrad-0.583
 kinderg-0.568
 reactor-0.566
 rejection-0.553
 workload-0.509
Token casualty
Token ID39068
Feature activation+2.80e-01

Pos logit contributions
 irrad+0.476
 kinderg+0.460
 reactor+0.456
 rejection+0.445
 workload+0.402
Neg logit contributions
tips-0.670
inges-0.668
interrupted-0.627
ollar-0.600
verbs-0.598
Token could
Token ID714
Feature activation-4.41e-01

Pos logit contributions
 irrad+0.442
 kinderg+0.429
 reactor+0.426
 rejection+0.416
 workload+0.382
Neg logit contributions
tips-0.503
inges-0.501
interrupted-0.468
verbs-0.446
ollar-0.445
Token be
Token ID307
Feature activation+4.52e-01

Pos logit contributions
inges+0.764
tips+0.763
interrupted+0.713
ollar+0.676
ippers+0.673
Neg logit contributions
 irrad-0.714
 kinderg-0.704
 reactor-0.695
 rejection-0.681
 workload-0.624
Token the
Token ID262
Feature activation+1.56e-01

Pos logit contributions
 irrad+0.651
 kinderg+0.629
 reactor+0.628
 rejection+0.612
 workload+0.555
Neg logit contributions
tips-0.871
inges-0.868
interrupted-0.815
verbs-0.778
ollar-0.776
Token Mars
Token ID8706
Feature activation+2.37e-01

Pos logit contributions
 irrad+0.216
 kinderg+0.209
 reactor+0.208
 rejection+0.202
ific+0.183
Neg logit contributions
tips-0.308
inges-0.307
interrupted-0.288
ollar-0.276
verbs-0.276
Token Atmosp
Token ID33276
Feature activation+2.01e+00

Pos logit contributions
 irrad+0.364
 kinderg+0.352
 reactor+0.352
 rejection+0.343
 workload+0.314
Neg logit contributions
tips-0.436
inges-0.434
interrupted-0.406
verbs-0.388
ollar-0.386
Tokenhere
Token ID1456
Feature activation-4.16e-01

Pos logit contributions
 irrad+3.926
 reactor+3.820
 kinderg+3.734
 rejection+3.732
ific+3.485
Neg logit contributions
tips-2.864
inges-2.775
interrupted-2.649
verbs-2.429
ollar-2.377
Token and
Token ID290
Feature activation+1.24e-01

Pos logit contributions
inges+0.674
tips+0.672
interrupted+0.621
verbs+0.588
ollar+0.586
Neg logit contributions
 irrad-0.719
 kinderg-0.703
 reactor-0.700
 rejection-0.683
 workload-0.637
Token Vol
Token ID4709
Feature activation+7.87e-01

Pos logit contributions
 irrad+0.156
 kinderg+0.156
 reactor+0.152
 rejection+0.147
 workload+0.135
Neg logit contributions
inges-0.259
tips-0.258
interrupted-0.240
verbs-0.234
ollar-0.234
Tokenatile
Token ID12610
Feature activation-5.74e-01

Pos logit contributions
 irrad+1.313
 reactor+1.286
 kinderg+1.243
 rejection+1.235
ific+1.141
Neg logit contributions
tips-1.305
inges-1.300
interrupted-1.245
ippers-1.174
verbs-1.171
Token Evolution
Token ID15815
Feature activation+4.59e-01

Pos logit contributions
inges+0.848
tips+0.846
interrupted+0.773
verbs+0.725
ollar+0.714
Neg logit contributions
 irrad-1.056
 kinderg-1.050
 reactor-1.033
 rejection-1.023
 workload-0.961
Token targeting
Token ID10822
Feature activation+7.06e-01

Pos logit contributions
 irrad+0.415
 kinderg+0.405
 reactor+0.404
 rejection+0.394
 workload+0.360
Neg logit contributions
tips-0.513
inges-0.508
interrupted-0.479
verbs-0.457
ollar-0.457
Token Windows
Token ID3964
Feature activation+7.40e-01

Pos logit contributions
 irrad+1.024
 reactor+0.991
 kinderg+0.985
 rejection+0.962
 workload+0.875
Neg logit contributions
tips-1.358
inges-1.353
interrupted-1.268
verbs-1.220
ollar-1.212
Token,
Token ID11
Feature activation+7.74e-02

Pos logit contributions
 irrad+1.136
 reactor+1.101
 kinderg+1.095
 rejection+1.070
 workload+0.978
Neg logit contributions
tips-1.362
inges-1.354
interrupted-1.267
verbs-1.216
ollar-1.208
Token Mac
Token ID4100
Feature activation+2.19e+00

Pos logit contributions
 irrad+0.122
 reactor+0.119
 kinderg+0.118
 rejection+0.115
 workload+0.106
Neg logit contributions
tips-0.139
inges-0.138
interrupted-0.129
verbs-0.124
ollar-0.123
Token and
Token ID290
Feature activation-1.63e-02

Pos logit contributions
 irrad+3.199
 kinderg+3.092
 reactor+3.076
 rejection+3.039
 workload+2.781
Neg logit contributions
inges-4.138
tips-4.108
interrupted-3.884
verbs-3.692
ollar-3.668
Token Linux
Token ID7020
Feature activation+1.96e+00

Pos logit contributions
tips+0.031
inges+0.031
interrupted+0.029
verbs+0.028
ollar+0.028
Neg logit contributions
 irrad-0.024
 reactor-0.023
 kinderg-0.023
 rejection-0.022
 workload-0.020
Token for
Token ID329
Feature activation-3.99e-02

Pos logit contributions
 irrad+2.884
 kinderg+2.800
 reactor+2.793
 rejection+2.734
 workload+2.520
Neg logit contributions
tips-3.686
inges-3.678
interrupted-3.479
verbs-3.335
ollar-3.300
Token its
Token ID663
Feature activation-1.79e-01

Pos logit contributions
inges+0.079
tips+0.079
interrupted+0.074
verbs+0.071
ippers+0.071
Neg logit contributions
 irrad-0.054
 reactor-0.052
 kinderg-0.052
 rejection-0.050
 workload-0.046
Token initial
Token ID4238
Feature activation+3.23e-01

Pos logit contributions
tips+0.360
inges+0.360
interrupted+0.337
verbs+0.324
ippers+0.323
Neg logit contributions
 irrad-0.242
 reactor-0.233
 kinderg-0.232
 rejection-0.224
 workload-0.203
Token release
Token ID2650
Feature activation+9.21e-01

Pos logit contributions
 irrad+0.363
 reactor+0.348
 kinderg+0.346
 rejection+0.335
 workload+0.295
Neg logit contributions
tips-0.725
inges-0.724
interrupted-0.684
verbs-0.662
ollar-0.658
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
 irrad+1.406
 reactor+1.364
 kinderg+1.356
 rejection+1.330
 workload+1.218
Neg logit contributions
tips-1.702
inges-1.696
interrupted-1.589
verbs-1.528
ollar-1.514
Token mans
Token ID25045
Feature activation-5.94e-01

Pos logit contributions
 irrad+1.163
 reactor+1.128
 kinderg+1.124
 rejection+1.101
 workload+1.016
Neg logit contributions
tips-1.193
inges-1.188
interrupted-1.107
verbs-1.059
ollar-1.050
Token the
Token ID262
Feature activation-4.25e-02

Pos logit contributions
tips+1.105
inges+1.095
interrupted+1.022
verbs+0.985
ippers+0.978
Neg logit contributions
 irrad-0.903
 kinderg-0.869
 reactor-0.866
 rejection-0.848
 workload-0.776
Token border
Token ID4865
Feature activation+1.98e+00

Pos logit contributions
tips+0.082
inges+0.082
interrupted+0.077
verbs+0.074
ollar+0.074
Neg logit contributions
 irrad-0.060
 kinderg-0.058
 reactor-0.058
 rejection-0.057
 workload-0.051
Token with
Token ID351
Feature activation+2.54e-01

Pos logit contributions
 irrad+2.882
 reactor+2.810
 kinderg+2.781
 rejection+2.721
 workload+2.469
Neg logit contributions
tips-3.772
inges-3.770
interrupted-3.533
verbs-3.408
ollar-3.371
Token North
Token ID2258
Feature activation-1.09e-01

Pos logit contributions
 irrad+0.383
 reactor+0.369
 kinderg+0.368
 rejection+0.361
 workload+0.331
Neg logit contributions
tips-0.470
inges-0.468
interrupted-0.436
verbs-0.420
ollar-0.417
Token Korea
Token ID4969
Feature activation+6.09e-01

Pos logit contributions
tips+0.227
inges+0.227
interrupted+0.214
verbs+0.207
ollar+0.205
Neg logit contributions
 irrad-0.140
 reactor-0.134
 kinderg-0.134
 rejection-0.130
 workload-0.117
Token,
Token ID11
Feature activation+2.35e-01

Pos logit contributions
 irrad+0.968
 reactor+0.938
 kinderg+0.935
 rejection+0.917
 workload+0.841
Neg logit contributions
tips-1.086
inges-1.082
interrupted-1.011
verbs-0.967
ollar-0.961
Token where
Token ID810
Feature activation+8.92e-01

Pos logit contributions
 irrad+0.352
 reactor+0.341
 kinderg+0.339
 rejection+0.332
 workload+0.303
Neg logit contributions
tips-0.439
inges-0.436
interrupted-0.409
verbs-0.392
ollar-0.390
Token reports
Token ID3136
Feature activation+9.66e-01

Pos logit contributions
 irrad+1.340
 reactor+1.296
 kinderg+1.291
 rejection+1.263
 workload+1.153
Neg logit contributions
tips-1.664
inges-1.659
interrupted-1.555
verbs-1.493
ollar-1.482
Token have
Token ID423
Feature activation-1.45e-03

Pos logit contributions
 irrad+1.411
 reactor+1.364
 kinderg+1.359
 rejection+1.327
 workload+1.208
Neg logit contributions
tips-1.850
inges-1.841
interrupted-1.730
verbs-1.657
ollar-1.653
Token emerged
Token ID9349
Feature activation-2.20e-01

Pos logit contributions
tips+0.002
inges+0.002
interrupted+0.002
ollar+0.002
verbs+0.002
Neg logit contributions
 irrad-0.002
 kinderg-0.002
 reactor-0.002
 rejection-0.002
 workload-0.002
Token:
Token ID25
Feature activation+5.75e-01

Pos logit contributions
 irrad+1.109
 reactor+1.069
 kinderg+1.063
 rejection+1.038
 workload+0.930
Neg logit contributions
inges-1.712
tips-1.707
interrupted-1.617
verbs-1.551
ollar-1.537
Token47
Token ID2857
Feature activation-2.15e-01

Pos logit contributions
 irrad+1.106
 reactor+1.078
 kinderg+1.073
 rejection+1.059
 workload+0.984
Neg logit contributions
tips-0.831
inges-0.829
interrupted-0.759
verbs-0.718
ollar-0.712
Token]
Token ID60
Feature activation-1.38e-01

Pos logit contributions
tips+0.406
inges+0.403
interrupted+0.375
verbs+0.362
ollar+0.361
Neg logit contributions
 irrad-0.321
 reactor-0.310
 kinderg-0.310
 rejection-0.302
 workload-0.277
Token <
Token ID1279
Feature activation+2.20e-01

Pos logit contributions
tips+0.198
inges+0.197
interrupted+0.181
verbs+0.171
ollar+0.170
Neg logit contributions
 irrad-0.267
 reactor-0.260
 kinderg-0.259
 rejection-0.255
 workload-0.238
Tokenl
Token ID75
Feature activation+9.16e-01

Pos logit contributions
 irrad+0.370
 reactor+0.359
 kinderg+0.357
 rejection+0.351
 workload+0.324
Neg logit contributions
tips-0.374
inges-0.371
interrupted-0.346
verbs-0.330
ollar-0.327
Tokenucky
Token ID5309
Feature activation+7.39e-01

Pos logit contributions
 irrad+1.558
 reactor+1.513
 kinderg+1.505
 rejection+1.477
 workload+1.363
Neg logit contributions
tips-1.528
inges-1.521
interrupted-1.417
verbs-1.351
ollar-1.333
Tokenstar
Token ID7364
Feature activation-4.79e-01

Pos logit contributions
 irrad+1.256
 reactor+1.221
 kinderg+1.217
 rejection+1.192
 workload+1.098
Neg logit contributions
tips-1.233
inges-1.230
interrupted-1.142
verbs-1.090
ollar-1.079
Token19
Token ID1129
Feature activation+6.75e-01

Pos logit contributions
tips+0.798
inges+0.786
interrupted+0.734
verbs+0.699
ollar+0.698
Neg logit contributions
 irrad-0.823
 kinderg-0.803
 reactor-0.799
 rejection-0.785
 workload-0.728
Token>
Token ID29
Feature activation-5.54e-02

Pos logit contributions
 irrad+1.204
 reactor+1.171
 kinderg+1.166
 rejection+1.144
 workload+1.060
Neg logit contributions
tips-1.074
inges-1.070
interrupted-0.991
verbs-0.944
ollar-0.935
Token z
Token ID1976
Feature activation+2.02e-01

Pos logit contributions
tips+0.099
inges+0.098
interrupted+0.091
verbs+0.087
ollar+0.087
Neg logit contributions
 irrad-0.088
 reactor-0.085
 kinderg-0.085
 rejection-0.083
 workload-0.077
Tokenfg
Token ID40616
Feature activation-9.67e-01

Pos logit contributions
 irrad+0.359
 reactor+0.348
 kinderg+0.348
 rejection+0.341
 workload+0.316
Neg logit contributions
tips-0.323
inges-0.322
interrupted-0.297
verbs-0.284
ollar-0.282
Token He
Token ID679
Feature activation+2.87e-01

Pos logit contributions
tips+0.086
inges+0.086
interrupted+0.079
verbs+0.076
ollar+0.075
Neg logit contributions
 irrad-0.088
 reactor-0.085
 kinderg-0.085
 rejection-0.083
 workload-0.077
Token's
Token ID338
Feature activation-9.82e-02

Pos logit contributions
 irrad+0.492
 reactor+0.477
 kinderg+0.475
 rejection+0.465
 workload+0.432
Neg logit contributions
tips-0.476
inges-0.474
interrupted-0.441
verbs-0.420
ollar-0.416
Token scored
Token ID7781
Feature activation-2.10e-03

Pos logit contributions
tips+0.173
inges+0.173
interrupted+0.161
verbs+0.154
ollar+0.154
Neg logit contributions
 irrad-0.156
 reactor-0.151
 kinderg-0.151
 rejection-0.148
 teamwork-0.135
Token at
Token ID379
Feature activation-3.99e-01

Pos logit contributions
tips+0.004
inges+0.004
interrupted+0.004
verbs+0.003
ollar+0.003
Neg logit contributions
 irrad-0.003
 reactor-0.003
 kinderg-0.003
 rejection-0.003
 workload-0.003
Token least
Token ID1551
Feature activation+2.85e-01

Pos logit contributions
inges+0.857
tips+0.856
interrupted+0.805
verbs+0.781
ollar+0.774
Neg logit contributions
 irrad-0.484
 reactor-0.463
 kinderg-0.462
 rejection-0.447
 workload-0.397
Token 25
Token ID1679
Feature activation+3.39e-01

Pos logit contributions
 irrad+0.482
 reactor+0.468
 kinderg+0.466
 rejection+0.457
 workload+0.422
Neg logit contributions
tips-0.481
inges-0.479
interrupted-0.446
verbs-0.426
ollar-0.422
Token Fantasy
Token ID10982
Feature activation-6.25e-01

Pos logit contributions
 irrad+0.637
 reactor+0.621
 kinderg+0.619
 rejection+0.608
 workload+0.566
Neg logit contributions
tips-0.508
inges-0.506
interrupted-0.466
verbs-0.442
ollar-0.438
Token points
Token ID2173
Feature activation+4.68e-01

Pos logit contributions
tips+1.253
inges+1.251
interrupted+1.168
verbs+1.138
ollar+1.127
Neg logit contributions
 irrad-0.850
 kinderg-0.818
 reactor-0.817
 rejection-0.795
 workload-0.711
Token in
Token ID287
Feature activation+8.95e-02

Pos logit contributions
 irrad+0.694
 reactor+0.672
 kinderg+0.669
 rejection+0.654
 workload+0.599
Neg logit contributions
tips-0.886
inges-0.882
interrupted-0.827
verbs-0.795
ollar-0.788
Token his
Token ID465
Feature activation+2.52e-01

Pos logit contributions
 irrad+0.138
 reactor+0.134
 kinderg+0.133
 rejection+0.130
 workload+0.119
Neg logit contributions
tips-0.164
inges-0.163
interrupted-0.152
verbs-0.146
ollar-0.145
Token past
Token ID1613
Feature activation+4.08e-01

Pos logit contributions
 irrad+0.358
 reactor+0.345
 kinderg+0.344
 rejection+0.335
 workload+0.304
Neg logit contributions
tips-0.494
inges-0.492
interrupted-0.462
verbs-0.445
ollar-0.441
Token service
Token ID2139
Feature activation-1.06e+00

Pos logit contributions
inges+0.741
tips+0.736
interrupted+0.691
verbs+0.663
ollar+0.657
Neg logit contributions
 irrad-0.566
 reactor-0.546
 kinderg-0.545
 rejection-0.534
 workload-0.483
Token,
Token ID11
Feature activation-6.12e-03

Pos logit contributions
tips+1.816
inges+1.789
interrupted+1.663
verbs+1.601
ollar+1.597
Neg logit contributions
 irrad-1.721
 kinderg-1.701
 rejection-1.655
 reactor-1.652
 workload-1.490
Token which
Token ID543
Feature activation+1.42e-01

Pos logit contributions
tips+0.012
inges+0.012
interrupted+0.012
verbs+0.011
ollar+0.011
Neg logit contributions
 irrad-0.008
 kinderg-0.008
 reactor-0.008
 rejection-0.008
 workload-0.007
Token brings
Token ID6774
Feature activation+8.34e-01

Pos logit contributions
 irrad+0.236
 reactor+0.228
 kinderg+0.227
 rejection+0.225
 workload+0.206
Neg logit contributions
tips-0.239
inges-0.238
interrupted-0.222
verbs-0.213
ippers-0.213
Token you
Token ID345
Feature activation+8.81e-02

Pos logit contributions
 irrad+0.893
 reactor+0.853
 kinderg+0.848
 rejection+0.821
 workload+0.718
Neg logit contributions
tips-1.921
inges-1.916
interrupted-1.817
verbs-1.759
ollar-1.749
Token great
Token ID1049
Feature activation+3.90e-01

Pos logit contributions
 irrad+0.131
 reactor+0.128
 kinderg+0.127
 rejection+0.123
 workload+0.112
Neg logit contributions
tips-0.165
inges-0.164
interrupted-0.154
verbs-0.148
ollar-0.146
Token games
Token ID1830
Feature activation+2.22e-01

Pos logit contributions
 irrad+0.496
 reactor+0.475
 kinderg+0.474
 rejection+0.460
 workload+0.417
Neg logit contributions
inges-0.817
tips-0.815
interrupted-0.771
verbs-0.746
ollar-0.743
Token created
Token ID2727
Feature activation-5.38e-01

Pos logit contributions
 irrad+0.439
 reactor+0.429
 kinderg+0.429
 rejection+0.421
 workload+0.393
Neg logit contributions
tips-0.310
inges-0.308
interrupted-0.282
verbs-0.265
ollar-0.263
Token for
Token ID329
Feature activation-3.03e-01

Pos logit contributions
tips+0.855
inges+0.852
interrupted+0.784
verbs+0.743
ippers+0.742
Neg logit contributions
 irrad-0.955
 reactor-0.927
 kinderg-0.924
 rejection-0.916
 workload-0.844
Token consoles
Token ID22629
Feature activation-3.18e-01

Pos logit contributions
inges+0.458
tips+0.458
interrupted+0.421
verbs+0.398
ollar+0.395
Neg logit contributions
 irrad-0.561
 reactor-0.546
 kinderg-0.544
 rejection-0.535
 workload-0.497
Token such
Token ID884
Feature activation-2.30e-01

Pos logit contributions
tips+0.538
inges+0.536
interrupted+0.498
verbs+0.476
ollar+0.472
Neg logit contributions
 irrad-0.534
 reactor-0.519
 kinderg-0.517
 rejection-0.507
 workload-0.467
Token of
Token ID286
Feature activation-1.28e-02

Pos logit contributions
tips+0.351
inges+0.349
interrupted+0.327
verbs+0.314
ollar+0.311
Neg logit contributions
 irrad-0.276
 reactor-0.265
 kinderg-0.265
 rejection-0.259
 workload-0.235
Token other
Token ID584
Feature activation-3.87e-02

Pos logit contributions
tips+0.024
inges+0.024
interrupted+0.022
verbs+0.022
ollar+0.021
Neg logit contributions
 irrad-0.019
 reactor-0.018
 kinderg-0.018
 rejection-0.018
 workload-0.016
Token OD
Token ID31245
Feature activation+1.10e-01

Pos logit contributions
tips+0.067
inges+0.067
interrupted+0.062
verbs+0.060
ollar+0.059
Neg logit contributions
 irrad-0.063
 reactor-0.061
 kinderg-0.061
 rejection-0.060
 workload-0.055
TokenIN
Token ID1268
Feature activation+5.53e-01

Pos logit contributions
 irrad+0.174
 reactor+0.169
 kinderg+0.168
 rejection+0.165
 workload+0.151
Neg logit contributions
tips-0.198
inges-0.197
interrupted-0.184
verbs-0.176
ollar-0.175
Token based
Token ID1912
Feature activation+7.39e-01

Pos logit contributions
 irrad+1.040
 reactor+1.020
 kinderg+1.012
 rejection+0.995
 workload+0.933
Neg logit contributions
tips-0.820
inges-0.820
interrupted-0.756
verbs-0.715
ollar-0.713
Token boot
Token ID6297
Feature activation+2.70e-01

Pos logit contributions
 irrad+1.437
 reactor+1.407
 kinderg+1.397
 rejection+1.375
 workload+1.290
Neg logit contributions
tips-1.056
inges-1.055
interrupted-0.967
verbs-0.916
ollar-0.909
Tokenload
Token ID2220
Feature activation-3.88e-02

Pos logit contributions
 irrad+0.369
 reactor+0.355
 kinderg+0.354
 rejection+0.345
 workload+0.310
Neg logit contributions
tips-0.545
inges-0.542
interrupted-0.509
verbs-0.491
ippers-0.486
Tokeners
Token ID364
Feature activation-4.84e-01

Pos logit contributions
tips+0.063
inges+0.062
interrupted+0.057
verbs+0.055
ollar+0.054
Neg logit contributions
 irrad-0.068
 kinderg-0.066
 reactor-0.066
 rejection-0.065
 workload-0.060
Token,
Token ID11
Feature activation-8.15e-02

Pos logit contributions
tips+0.853
inges+0.842
interrupted+0.782
verbs+0.754
ippers+0.744
Neg logit contributions
 irrad-0.783
 kinderg-0.760
 reactor-0.756
 rejection-0.734
 teamwork-0.676
Token so
Token ID523
Feature activation+2.57e-01

Pos logit contributions
tips+0.155
inges+0.154
interrupted+0.145
verbs+0.139
ollar+0.138
Neg logit contributions
 irrad-0.120
 reactor-0.116
 kinderg-0.116
 rejection-0.113
 workload-0.103
Token when
Token ID618
Feature activation+3.12e-01

Pos logit contributions
 irrad+0.381
 kinderg+0.368
 reactor+0.368
 rejection+0.358
 workload+0.325
Neg logit contributions
tips-0.487
inges-0.484
interrupted-0.454
verbs-0.436
ollar-0.432
Token she
Token ID673
Feature activation-7.50e-02

Pos logit contributions
 irrad+0.223
 reactor+0.215
 kinderg+0.213
 rejection+0.208
 workload+0.191
Neg logit contributions
tips-0.283
inges-0.282
interrupted-0.264
verbs-0.253
ollar-0.252
Token
Token ID447
Feature activation+2.74e-01

Pos logit contributions
tips+0.122
inges+0.122
interrupted+0.112
verbs+0.107
ollar+0.107
Neg logit contributions
 irrad-0.131
 reactor-0.127
 kinderg-0.127
 rejection-0.124
 workload-0.115
Token
Token ID247
Feature activation+7.32e-03

Pos logit contributions
 irrad+0.248
 reactor+0.235
 kinderg+0.233
 rejection+0.225
 workload+0.191
Neg logit contributions
tips-0.676
inges-0.676
interrupted-0.642
verbs-0.624
ollar-0.621
Tokens
Token ID82
Feature activation+2.96e-01

Pos logit contributions
 irrad+0.013
 reactor+0.013
 kinderg+0.013
 rejection+0.012
 workload+0.012
Neg logit contributions
tips-0.012
inges-0.012
interrupted-0.011
verbs-0.010
ollar-0.010
Token has
Token ID468
Feature activation+9.35e-02

Pos logit contributions
 irrad+0.437
 reactor+0.424
 kinderg+0.422
 rejection+0.412
 workload+0.376
Neg logit contributions
tips-0.558
inges-0.555
interrupted-0.521
verbs-0.501
ollar-0.497
Token some
Token ID617
Feature activation-4.59e-01

Pos logit contributions
 irrad+0.142
 reactor+0.137
 kinderg+0.136
 rejection+0.133
 workload+0.122
Neg logit contributions
tips-0.173
inges-0.173
interrupted-0.162
verbs-0.155
ollar-0.154
Token miscar
Token ID35976
Feature activation-6.82e-01

Pos logit contributions
tips+0.858
inges+0.850
interrupted+0.787
ollar+0.759
verbs+0.757
Neg logit contributions
 irrad-0.676
 reactor-0.653
 kinderg-0.643
 rejection-0.623
ific-0.578
Tokenriages
Token ID16451
Feature activation-1.16e+00

Pos logit contributions
inges+0.969
tips+0.967
interrupted+0.884
verbs+0.839
ollar+0.834
Neg logit contributions
 irrad-1.333
 reactor-1.302
 kinderg-1.297
 rejection-1.273
 workload-1.192
Token in
Token ID287
Feature activation+1.88e-01

Pos logit contributions
tips+2.024
inges+2.015
interrupted+1.874
ollar+1.795
verbs+1.785
Neg logit contributions
 irrad-1.867
 reactor-1.813
 kinderg-1.805
 rejection-1.768
 teamwork-1.621
Token the
Token ID262
Feature activation-2.12e-01

Pos logit contributions
 irrad+0.278
 reactor+0.268
 kinderg+0.266
 rejection+0.261
 workload+0.238
Neg logit contributions
tips-0.355
inges-0.354
interrupted-0.332
verbs-0.319
ollar-0.316
Token past
Token ID1613
Feature activation+5.72e-01

Pos logit contributions
tips+0.401
inges+0.398
interrupted+0.373
verbs+0.357
ippers+0.355
Neg logit contributions
 irrad-0.312
 reactor-0.300
 kinderg-0.298
 rejection-0.293
 teamwork-0.267
Token only
Token ID691
Feature activation+1.14e-01

Pos logit contributions
 irrad+0.031
 kinderg+0.030
 reactor+0.030
 rejection+0.029
ific+0.026
Neg logit contributions
tips-0.047
inges-0.047
interrupted-0.044
verbs-0.042
ippers-0.042
Token way
Token ID835
Feature activation+2.56e-01

Pos logit contributions
 irrad+0.154
 kinderg+0.149
 reactor+0.148
 rejection+0.143
 workload+0.129
Neg logit contributions
tips-0.227
inges-0.226
interrupted-0.212
verbs-0.204
ollar-0.203
Token to
Token ID284
Feature activation+2.29e-02

Pos logit contributions
 irrad+0.423
 reactor+0.410
 kinderg+0.410
 rejection+0.400
 workload+0.369
Neg logit contributions
tips-0.440
inges-0.437
interrupted-0.406
verbs-0.389
ollar-0.385
Token be
Token ID307
Feature activation+4.51e-02

Pos logit contributions
 irrad+0.032
 kinderg+0.031
 reactor+0.031
 rejection+0.030
 workload+0.027
Neg logit contributions
tips-0.045
inges-0.045
interrupted-0.042
verbs-0.040
ollar-0.040
Token sure
Token ID1654
Feature activation+3.91e-03

Pos logit contributions
 irrad+0.077
 kinderg+0.075
 reactor+0.075
 rejection+0.073
 workload+0.068
Neg logit contributions
tips-0.075
inges-0.074
interrupted-0.069
verbs-0.066
ollar-0.066
Token is
Token ID318
Feature activation+1.18e-01

Pos logit contributions
 irrad+0.007
 kinderg+0.006
 reactor+0.006
 rejection+0.006
 workload+0.006
Neg logit contributions
tips-0.007
inges-0.007
interrupted-0.006
verbs-0.006
ippers-0.006
Token to
Token ID284
Feature activation+7.54e-02

Pos logit contributions
 irrad+0.201
 kinderg+0.196
 reactor+0.196
 rejection+0.190
 workload+0.176
Neg logit contributions
tips-0.198
inges-0.196
interrupted-0.182
verbs-0.173
ollar-0.172
Token ask
Token ID1265
Feature activation-3.59e-01

Pos logit contributions
 irrad+0.107
 kinderg+0.104
 reactor+0.104
 rejection+0.101
 workload+0.092
Neg logit contributions
tips-0.145
inges-0.145
interrupted-0.136
verbs-0.130
ollar-0.130
Token the
Token ID262
Feature activation-1.67e-02

Pos logit contributions
tips+0.664
inges+0.661
interrupted+0.613
verbs+0.592
ippers+0.590
Neg logit contributions
 irrad-0.534
 reactor-0.517
 kinderg-0.515
 rejection-0.500
 workload-0.459
Token opinion
Token ID4459
Feature activation+1.07e+00

Pos logit contributions
tips+0.033
inges+0.033
interrupted+0.031
verbs+0.030
ippers+0.030
Neg logit contributions
 irrad-0.022
 kinderg-0.022
 reactor-0.021
 rejection-0.021
 teamwork-0.019
Token of
Token ID286
Feature activation-1.81e-01

Pos logit contributions
 irrad+1.488
 reactor+1.438
 kinderg+1.428
 rejection+1.397
 workload+1.263
Neg logit contributions
tips-2.139
inges-2.131
interrupted-2.008
verbs-1.932
ollar-1.916
Token to
Token ID284
Feature activation+3.52e-01

Pos logit contributions
 irrad+1.155
 reactor+1.122
 kinderg+1.117
 rejection+1.095
 workload+1.008
Neg logit contributions
tips-1.218
inges-1.213
interrupted-1.130
verbs-1.081
ollar-1.073
Token handle
Token ID5412
Feature activation+4.45e-01

Pos logit contributions
 irrad+0.473
 reactor+0.457
 kinderg+0.456
 rejection+0.444
 workload+0.399
Neg logit contributions
tips-0.714
inges-0.711
interrupted-0.669
verbs-0.645
ollar-0.641
Token multiple
Token ID3294
Feature activation+4.51e-01

Pos logit contributions
 irrad+0.646
 kinderg+0.624
 reactor+0.624
 rejection+0.604
 workload+0.548
Neg logit contributions
tips-0.854
inges-0.849
interrupted-0.795
verbs-0.765
ollar-0.759
Token spikes
Token ID27198
Feature activation-6.75e-01

Pos logit contributions
 irrad+0.645
 kinderg+0.622
 reactor+0.621
 rejection+0.602
 workload+0.545
Neg logit contributions
tips-0.876
inges-0.872
interrupted-0.817
verbs-0.787
ollar-0.780
Token in
Token ID287
Feature activation+1.54e-01

Pos logit contributions
inges+1.203
tips+1.198
interrupted+1.121
verbs+1.067
ollar+1.065
Neg logit contributions
 irrad-1.056
 kinderg-1.035
 reactor-1.027
 rejection-0.999
 teamwork-0.918
Token the
Token ID262
Feature activation-1.04e-01

Pos logit contributions
 irrad+0.217
 kinderg+0.211
 reactor+0.209
 rejection+0.202
ific+0.183
Neg logit contributions
tips-0.300
inges-0.299
interrupted-0.280
verbs-0.269
ollar-0.267
Token angle
Token ID9848
Feature activation+3.38e-01

Pos logit contributions
tips+0.210
inges+0.208
interrupted+0.196
verbs+0.189
ollar+0.187
Neg logit contributions
 irrad-0.141
 kinderg-0.137
 reactor-0.135
 rejection-0.130
ific-0.118
Token-
Token ID12
Feature activation+7.24e-03

Pos logit contributions
 irrad+0.546
 reactor+0.529
 kinderg+0.528
 rejection+0.516
 workload+0.475
Neg logit contributions
tips-0.594
inges-0.592
interrupted-0.552
verbs-0.528
ollar-0.524
Tokenof
Token ID1659
Feature activation-1.61e-01

Pos logit contributions
 irrad+0.012
 kinderg+0.012
 reactor+0.012
 rejection+0.011
 workload+0.010
Neg logit contributions
tips-0.013
inges-0.012
interrupted-0.012
verbs-0.011
ollar-0.011
Token-
Token ID12
Feature activation+1.48e-02

Pos logit contributions
tips+0.234
inges+0.232
interrupted+0.213
verbs+0.202
ollar+0.200
Neg logit contributions
 irrad-0.311
 reactor-0.303
 kinderg-0.303
 rejection-0.296
 workload-0.277
Tokenattack
Token ID20358
Feature activation-7.61e-01

Pos logit contributions
 irrad+0.018
 kinderg+0.018
 reactor+0.018
 rejection+0.017
 workload+0.015
Neg logit contributions
tips-0.032
inges-0.031
interrupted-0.030
verbs-0.029
ollar-0.029
Token By
Token ID2750
Feature activation-8.45e-01

Pos logit contributions
 irrad+0.406
 reactor+0.395
 kinderg+0.393
 rejection+0.386
 workload+0.355
Neg logit contributions
tips-0.413
inges-0.412
interrupted-0.383
verbs-0.366
ollar-0.363
Token Christmas
Token ID6786
Feature activation-6.06e-01

Pos logit contributions
tips+1.519
inges+1.510
interrupted+1.411
verbs+1.344
ollar+1.341
Neg logit contributions
 irrad-1.311
 reactor-1.275
 kinderg-1.274
 rejection-1.234
 workload-1.139
Token 10
Token ID838
Feature activation+2.89e-01

Pos logit contributions
tips+1.089
inges+1.085
interrupted+1.014
verbs+0.965
ollar+0.963
Neg logit contributions
 irrad-0.952
 reactor-0.923
 kinderg-0.921
 rejection-0.897
 workload-0.824
Token69
Token ID3388
Feature activation+1.08e-01

Pos logit contributions
 irrad+0.584
 reactor+0.570
 kinderg+0.568
 rejection+0.559
 workload+0.523
Neg logit contributions
tips-0.393
inges-0.390
interrupted-0.356
verbs-0.337
ollar-0.334
Token,
Token ID11
Feature activation+1.93e-01

Pos logit contributions
 irrad+0.169
 reactor+0.163
 kinderg+0.163
 rejection+0.159
 workload+0.146
Neg logit contributions
tips-0.197
inges-0.196
interrupted-0.184
verbs-0.176
ollar-0.175
Token he
Token ID339
Feature activation+1.98e-01

Pos logit contributions
 irrad+0.302
 reactor+0.292
 kinderg+0.292
 rejection+0.284
 workload+0.261
Neg logit contributions
tips-0.349
inges-0.348
interrupted-0.326
verbs-0.312
ollar-0.310
Token entered
Token ID5982
Feature activation+4.62e-01

Pos logit contributions
 irrad+0.345
 reactor+0.336
 kinderg+0.335
 rejection+0.329
 workload+0.304
Neg logit contributions
tips-0.323
inges-0.322
interrupted-0.299
verbs-0.285
ollar-0.283
Token the
Token ID262
Feature activation+7.95e-02

Pos logit contributions
 irrad+0.741
 reactor+0.719
 kinderg+0.716
 rejection+0.702
 workload+0.643
Neg logit contributions
tips-0.822
inges-0.817
interrupted-0.763
verbs-0.732
ollar-0.726
Token ruined
Token ID20484
Feature activation-5.91e-01

Pos logit contributions
 irrad+0.120
 reactor+0.116
 kinderg+0.116
 rejection+0.113
 workload+0.103
Neg logit contributions
tips-0.148
inges-0.147
interrupted-0.138
verbs-0.132
ollar-0.131
Token city
Token ID1748
Feature activation+4.59e-01

Pos logit contributions
inges+1.157
tips+1.151
interrupted+1.072
verbs+1.032
ollar+1.027
Neg logit contributions
 irrad-0.829
 kinderg-0.800
 reactor-0.794
 rejection-0.780
 workload-0.708
Token and
Token ID290
Feature activation+2.64e-01

Pos logit contributions
 irrad+0.714
 reactor+0.692
 kinderg+0.690
 rejection+0.675
 workload+0.619
Neg logit contributions
tips-0.834
inges-0.833
interrupted-0.778
verbs-0.744
ollar-0.739
Token afforded
Token ID28109
Feature activation+8.02e-02

Pos logit contributions
 irrad+0.453
 reactor+0.438
 kinderg+0.437
 rejection+0.426
 workload+0.388
Neg logit contributions
tips-0.589
inges-0.587
interrupted-0.550
verbs-0.528
ollar-0.524
Token on
Token ID319
Feature activation-8.02e-02

Pos logit contributions
 irrad+0.130
 reactor+0.126
 kinderg+0.126
 rejection+0.123
 workload+0.113
Neg logit contributions
tips-0.140
inges-0.140
interrupted-0.130
verbs-0.124
ollar-0.123
Token both
Token ID1111
Feature activation+1.18e+00

Pos logit contributions
tips+0.152
inges+0.152
interrupted+0.142
verbs+0.136
ollar+0.135
Neg logit contributions
 irrad-0.118
 reactor-0.114
 kinderg-0.114
 rejection-0.111
 workload-0.101
Token sides
Token ID5389
Feature activation-1.21e-01

Pos logit contributions
 irrad+1.336
 reactor+1.277
 kinderg+1.273
 rejection+1.234
 workload+1.092
Neg logit contributions
tips-2.640
inges-2.628
interrupted-2.504
verbs-2.425
ollar-2.410
Token of
Token ID286
Feature activation+7.32e-02

Pos logit contributions
tips+0.229
inges+0.228
interrupted+0.213
verbs+0.204
ollar+0.204
Neg logit contributions
 irrad-0.178
 kinderg-0.172
 reactor-0.172
 rejection-0.168
 workload-0.153
Token the
Token ID262
Feature activation+1.43e-01

Pos logit contributions
 irrad+0.109
 reactor+0.106
 kinderg+0.105
 rejection+0.103
 workload+0.094
Neg logit contributions
tips-0.138
inges-0.137
interrupted-0.129
verbs-0.123
ollar-0.122
Token building
Token ID2615
Feature activation-2.83e-01

Pos logit contributions
 irrad+0.203
 reactor+0.196
 kinderg+0.195
 rejection+0.191
 workload+0.173
Neg logit contributions
tips-0.278
inges-0.277
interrupted-0.260
verbs-0.250
ollar-0.248
Token,
Token ID11
Feature activation+1.86e-01

Pos logit contributions
tips+0.503
inges+0.500
interrupted+0.466
verbs+0.445
ollar+0.441
Neg logit contributions
 irrad-0.453
 kinderg-0.439
 reactor-0.437
 rejection-0.427
 workload-0.392
Token with
Token ID351
Feature activation+1.63e-01

Pos logit contributions
 irrad+0.279
 reactor+0.270
 kinderg+0.269
 rejection+0.263
 workload+0.240
Neg logit contributions
tips-0.347
inges-0.346
interrupted-0.324
verbs-0.311
ollar-0.309
Token a
Token ID257
Feature activation+2.05e-01

Pos logit contributions
 irrad+0.238
 reactor+0.230
 kinderg+0.229
 rejection+0.224
 workload+0.203
Neg logit contributions
tips-0.312
inges-0.312
interrupted-0.292
verbs-0.280
ollar-0.278
Token 28
Token ID2579
Feature activation+2.17e-01

Pos logit contributions
 irrad+0.300
 reactor+0.289
 kinderg+0.288
 rejection+0.281
 workload+0.255
Neg logit contributions
tips-0.392
inges-0.390
interrupted-0.366
verbs-0.352
ollar-0.349
Token
Token ID198
Feature activation+9.76e-02

Pos logit contributions
 irrad+0.224
 reactor+0.218
 kinderg+0.217
 rejection+0.213
 workload+0.196
Neg logit contributions
tips-0.229
inges-0.228
interrupted-0.212
verbs-0.203
ollar-0.201
TokenWin
Token ID16643
Feature activation+8.29e-01

Pos logit contributions
 irrad+0.157
 reactor+0.152
 kinderg+0.151
 rejection+0.148
 workload+0.136
Neg logit contributions
tips-0.174
inges-0.173
interrupted-0.161
verbs-0.155
ollar-0.153
Tokenners
Token ID2741
Feature activation-2.90e-01

Pos logit contributions
 irrad+1.445
 reactor+1.405
 kinderg+1.401
 rejection+1.374
 workload+1.271
Neg logit contributions
tips-1.351
inges-1.346
interrupted-1.248
verbs-1.190
ollar-1.180
Token in
Token ID287
Feature activation-1.18e-01

Pos logit contributions
tips+0.514
inges+0.513
interrupted+0.477
verbs+0.457
ollar+0.454
Neg logit contributions
 irrad-0.463
 reactor-0.449
 kinderg-0.446
 rejection-0.436
 workload-0.401
Token each
Token ID1123
Feature activation+7.76e-01

Pos logit contributions
tips+0.213
inges+0.213
interrupted+0.199
verbs+0.191
ollar+0.190
Neg logit contributions
 irrad-0.183
 reactor-0.178
 kinderg-0.177
 rejection-0.173
 workload-0.158
Token category
Token ID6536
Feature activation-1.45e+00

Pos logit contributions
 irrad+1.293
 reactor+1.256
 kinderg+1.250
 rejection+1.230
 workload+1.132
Neg logit contributions
tips-1.329
inges-1.324
interrupted-1.231
verbs-1.180
ollar-1.167
Token are
Token ID389
Feature activation-9.32e-02

Pos logit contributions
tips+2.502
inges+2.482
interrupted+2.304
verbs+2.202
ippers+2.199
Neg logit contributions
 irrad-2.334
 kinderg-2.266
 reactor-2.265
 rejection-2.189
 workload-2.027
Token marked
Token ID7498
Feature activation-1.32e+00

Pos logit contributions
tips+0.171
inges+0.170
interrupted+0.158
ollar+0.152
verbs+0.151
Neg logit contributions
 irrad-0.143
 reactor-0.139
 kinderg-0.138
 rejection-0.135
 workload-0.124
Token with
Token ID351
Feature activation-1.38e-02

Pos logit contributions
tips+2.179
inges+2.148
interrupted+1.976
ollar+1.901
verbs+1.895
Neg logit contributions
 irrad-2.230
 reactor-2.185
 kinderg-2.185
 rejection-2.133
 workload-1.965
Token a
Token ID257
Feature activation-2.72e-04

Pos logit contributions
tips+0.026
inges+0.026
interrupted+0.024
verbs+0.023
ollar+0.023
Neg logit contributions
 irrad-0.020
 reactor-0.020
 kinderg-0.020
 rejection-0.019
 workload-0.018
Token star
Token ID3491
Feature activation-1.04e+00

Pos logit contributions
tips+0.001
inges+0.001
interrupted+0.000
verbs+0.000
ollar+0.000
Neg logit contributions
 irrad-0.000
 reactor-0.000
 kinderg-0.000
 rejection-0.000
 workload-0.000
Token was
Token ID373
Feature activation+1.67e-01

Pos logit contributions
 irrad+0.607
 reactor+0.591
 kinderg+0.588
 rejection+0.577
 workload+0.533
Neg logit contributions
tips-0.579
inges-0.577
interrupted-0.535
verbs-0.511
ollar-0.507
Token the
Token ID262
Feature activation+7.29e-02

Pos logit contributions
 irrad+0.262
 reactor+0.255
 kinderg+0.253
 rejection+0.248
 workload+0.227
Neg logit contributions
tips-0.301
inges-0.299
interrupted-0.281
verbs-0.268
ollar-0.266
Token redacted
Token ID44740
Feature activation+1.03e+00

Pos logit contributions
 irrad+0.105
 reactor+0.102
 kinderg+0.101
 rejection+0.099
 workload+0.090
Neg logit contributions
tips-0.139
inges-0.139
interrupted-0.130
verbs-0.124
ollar-0.124
Token Cleveland
Token ID10306
Feature activation-4.33e-01

Pos logit contributions
 irrad+1.381
 reactor+1.328
 kinderg+1.322
 rejection+1.286
 workload+1.159
Neg logit contributions
tips-2.104
inges-2.099
interrupted-1.970
verbs-1.897
ollar-1.890
Token source
Token ID2723
Feature activation+7.54e-01

Pos logit contributions
inges+0.865
tips+0.863
interrupted+0.811
ippers+0.780
ollar+0.775
Neg logit contributions
 irrad-0.583
 reactor-0.562
 kinderg-0.556
 rejection-0.539
 workload-0.492
Token responsible
Token ID4497
Feature activation-1.50e+00

Pos logit contributions
 irrad+1.174
 reactor+1.137
 kinderg+1.133
 rejection+1.106
 workload+1.014
Neg logit contributions
tips-1.370
inges-1.366
interrupted-1.276
verbs-1.223
ollar-1.216
Token for
Token ID329
Feature activation-1.14e-01

Pos logit contributions
tips+2.314
inges+2.309
interrupted+2.140
verbs+2.011
ippers+2.011
Neg logit contributions
 irrad-2.703
 kinderg-2.626
 reactor-2.620
 rejection-2.569
 workload-2.393
Token the
Token ID262
Feature activation+4.30e-02

Pos logit contributions
tips+0.213
inges+0.212
interrupted+0.199
verbs+0.190
ollar+0.190
Neg logit contributions
 irrad-0.168
 reactor-0.164
 kinderg-0.163
 rejection-0.158
 workload-0.145
Token initial
Token ID4238
Feature activation+1.63e-02

Pos logit contributions
 irrad+0.062
 reactor+0.060
 kinderg+0.060
 rejection+0.058
 workload+0.052
Neg logit contributions
tips-0.083
inges-0.083
interrupted-0.078
verbs-0.074
ippers-0.074
Token African
Token ID5510
Feature activation-4.95e-02

Pos logit contributions
 irrad+0.023
 reactor+0.022
 kinderg+0.022
 rejection+0.021
 workload+0.019
Neg logit contributions
tips-0.032
inges-0.032
interrupted-0.030
verbs-0.028
ippers-0.028
Token slurs
Token ID44981
Feature activation-7.80e-01

Pos logit contributions
tips+0.086
inges+0.086
interrupted+0.080
verbs+0.076
ollar+0.076
Neg logit contributions
 irrad-0.081
 reactor-0.078
 kinderg-0.078
 rejection-0.076
 workload-0.070
Token time
Token ID640
Feature activation-9.44e-01

Pos logit contributions
tips+0.655
inges+0.652
interrupted+0.614
verbs+0.590
ollar+0.588
Neg logit contributions
 irrad-0.436
 kinderg-0.416
 reactor-0.415
 rejection-0.404
 workload-0.366
Token your
Token ID534
Feature activation-6.11e-02

Pos logit contributions
tips+1.714
inges+1.709
interrupted+1.595
verbs+1.524
ollar+1.515
Neg logit contributions
 irrad-1.452
 kinderg-1.399
 reactor-1.396
 rejection-1.368
 workload-1.249
Token HS
Token ID18070
Feature activation-1.81e-02

Pos logit contributions
tips+0.123
inges+0.123
interrupted+0.115
verbs+0.111
ollar+0.110
Neg logit contributions
 irrad-0.082
 reactor-0.078
 kinderg-0.078
 rejection-0.076
 workload-0.069
Token when
Token ID618
Feature activation+3.19e-01

Pos logit contributions
inges+0.034
tips+0.034
interrupted+0.032
verbs+0.031
ollar+0.031
Neg logit contributions
 irrad-0.027
 kinderg-0.025
 reactor-0.025
 rejection-0.025
 workload-0.023
Token the
Token ID262
Feature activation-1.17e-01

Pos logit contributions
 irrad+0.434
 reactor+0.419
 kinderg+0.417
 rejection+0.406
 workload+0.367
Neg logit contributions
tips-0.641
inges-0.639
interrupted-0.602
verbs-0.579
ollar-0.574
Token asc
Token ID10570
Feature activation-1.41e+00

Pos logit contributions
inges+0.229
tips+0.229
interrupted+0.215
verbs+0.206
ollar+0.204
Neg logit contributions
 irrad-0.163
 kinderg-0.157
 reactor-0.156
 rejection-0.152
 teamwork-0.137
Tokenension
Token ID3004
Feature activation-7.76e-01

Pos logit contributions
inges+2.770
tips+2.720
interrupted+2.554
ippers+2.513
verbs+2.467
Neg logit contributions
 irrad-1.959
 kinderg-1.928
 reactor-1.898
 rejection-1.807
 workload-1.674
Token will
Token ID481
Feature activation-3.47e-02

Pos logit contributions
tips+1.237
inges+1.234
interrupted+1.146
ippers+1.073
ollar+1.072
Neg logit contributions
 irrad-1.357
 kinderg-1.318
 reactor-1.312
 rejection-1.278
 workload-1.184
Token happen
Token ID1645
Feature activation+3.84e-01

Pos logit contributions
tips+0.058
inges+0.058
interrupted+0.054
verbs+0.051
ollar+0.051
Neg logit contributions
 irrad-0.059
 reactor-0.058
 kinderg-0.057
 rejection-0.056
 workload-0.052
Token as
Token ID355
Feature activation+7.33e-02

Pos logit contributions
 irrad+0.601
 reactor+0.582
 kinderg+0.580
 rejection+0.568
 workload+0.520
Neg logit contributions
tips-0.696
inges-0.693
interrupted-0.648
verbs-0.621
ollar-0.616
Token he
Token ID339
Feature activation-3.19e-01

Pos logit contributions
 irrad+0.111
 reactor+0.107
 kinderg+0.107
 rejection+0.104
 workload+0.095
Neg logit contributions
tips-0.136
inges-0.136
interrupted-0.127
verbs-0.122
ollar-0.121
Token.
Token ID13
Feature activation-6.76e-02

Pos logit contributions
tips+1.479
inges+1.466
interrupted+1.370
verbs+1.308
ollar+1.296
Neg logit contributions
 irrad-1.403
 reactor-1.362
 kinderg-1.361
 rejection-1.318
 workload-1.224
Token That
Token ID1320
Feature activation-2.07e-01

Pos logit contributions
tips+0.093
inges+0.092
interrupted+0.084
verbs+0.080
ollar+0.079
Neg logit contributions
 irrad-0.136
 reactor-0.132
 kinderg-0.132
 rejection-0.130
 workload-0.121
Token
Token ID447
Feature activation+6.90e-02

Pos logit contributions
tips+0.312
inges+0.311
interrupted+0.287
verbs+0.272
ollar+0.269
Neg logit contributions
 irrad-0.385
 reactor-0.374
 kinderg-0.373
 rejection-0.366
 workload-0.341
Token
Token ID247
Feature activation+8.58e-02

Pos logit contributions
 irrad+0.074
 reactor+0.071
 kinderg+0.071
 rejection+0.069
 workload+0.060
Neg logit contributions
inges-0.158
tips-0.158
interrupted-0.149
verbs-0.145
ollar-0.144
Tokens
Token ID82
Feature activation+2.20e-01

Pos logit contributions
 irrad+0.150
 reactor+0.146
 kinderg+0.145
 rejection+0.142
 workload+0.132
Neg logit contributions
tips-0.140
inges-0.139
interrupted-0.129
verbs-0.123
ollar-0.122
Token what
Token ID644
Feature activation-1.47e+00

Pos logit contributions
 irrad+0.328
 reactor+0.318
 kinderg+0.316
 rejection+0.307
 workload+0.282
Neg logit contributions
tips-0.413
inges-0.411
interrupted-0.384
verbs-0.370
ollar-0.367
Token I
Token ID314
Feature activation-3.39e-01

Pos logit contributions
tips+2.361
inges+2.359
interrupted+2.181
verbs+2.096
ollar+2.055
Neg logit contributions
 irrad-2.553
 reactor-2.481
 kinderg-2.430
 rejection-2.396
 workload-2.246
Token want
Token ID765
Feature activation-3.67e-01

Pos logit contributions
inges+0.584
tips+0.584
interrupted+0.545
verbs+0.524
ollar+0.520
Neg logit contributions
 irrad-0.550
 reactor-0.536
 kinderg-0.535
 rejection-0.521
 workload-0.481
Token to
Token ID284
Feature activation-4.92e-02

Pos logit contributions
tips+0.617
inges+0.614
interrupted+0.573
verbs+0.547
ollar+0.546
Neg logit contributions
 irrad-0.617
 reactor-0.595
 kinderg-0.590
 rejection-0.580
 workload-0.538
Token be
Token ID307
Feature activation-6.20e-02

Pos logit contributions
tips+0.087
inges+0.087
interrupted+0.081
verbs+0.078
ollar+0.077
Neg logit contributions
 irrad-0.078
 reactor-0.076
 kinderg-0.076
 rejection-0.074
 workload-0.068
Token known
Token ID1900
Feature activation-1.31e+00

Pos logit contributions
tips+0.108
inges+0.108
interrupted+0.101
verbs+0.097
ollar+0.096
Neg logit contributions
 irrad-0.100
 reactor-0.097
 kinderg-0.097
 rejection-0.095
 workload-0.088
Token you
Token ID345
Feature activation+7.62e-03

Pos logit contributions
tips+0.360
inges+0.360
interrupted+0.337
verbs+0.325
ollar+0.321
Neg logit contributions
 irrad-0.260
 reactor-0.252
 kinderg-0.251
 rejection-0.244
 workload-0.221
Token are
Token ID389
Feature activation-2.28e-01

Pos logit contributions
 irrad+0.011
 kinderg+0.011
 reactor+0.011
 rejection+0.011
 workload+0.010
Neg logit contributions
tips-0.014
inges-0.014
interrupted-0.013
verbs-0.013
ollar-0.013
Token pretty
Token ID2495
Feature activation+1.58e+00

Pos logit contributions
tips+0.431
inges+0.428
interrupted+0.403
verbs+0.385
ollar+0.385
Neg logit contributions
 irrad-0.332
 reactor-0.324
 kinderg-0.324
 rejection-0.316
 teamwork-0.286
Token much
Token ID881
Feature activation-2.42e-01

Pos logit contributions
 irrad+2.113
 reactor+2.026
 kinderg+2.016
 rejection+1.969
 workload+1.775
Neg logit contributions
tips-3.217
inges-3.210
interrupted-3.018
verbs-2.913
ollar-2.887
Token a
Token ID257
Feature activation+5.47e-02

Pos logit contributions
tips+0.449
inges+0.446
interrupted+0.421
ollar+0.402
verbs+0.402
Neg logit contributions
 irrad-0.359
 reactor-0.351
 kinderg-0.350
 rejection-0.341
 teamwork-0.312
Token wreck
Token ID15228
Feature activation-1.51e+00

Pos logit contributions
 irrad+0.077
 reactor+0.074
 kinderg+0.073
 rejection+0.071
 workload+0.065
Neg logit contributions
tips-0.107
inges-0.106
interrupted-0.101
verbs-0.097
ollar-0.096
Token.
Token ID13
Feature activation-2.42e-01

Pos logit contributions
tips+2.547
inges+2.543
interrupted+2.347
ollar+2.239
verbs+2.216
Neg logit contributions
 irrad-2.466
 reactor-2.406
 kinderg-2.402
 rejection-2.355
 workload-2.177
Token
Token ID198
Feature activation-2.18e-01

Pos logit contributions
tips+0.416
inges+0.411
interrupted+0.380
verbs+0.366
ollar+0.364
Neg logit contributions
 irrad-0.405
 reactor-0.392
 kinderg-0.391
 rejection-0.381
 workload-0.354
Token
Token ID198
Feature activation-4.70e-02

Pos logit contributions
tips+0.366
inges+0.362
interrupted+0.336
verbs+0.323
ollar+0.320
Neg logit contributions
 irrad-0.371
 reactor-0.360
 kinderg-0.359
 rejection-0.351
 workload-0.325
TokenThe
Token ID464
Feature activation-6.27e-02

Pos logit contributions
tips+0.083
inges+0.082
interrupted+0.076
verbs+0.073
ollar+0.073
Neg logit contributions
 irrad-0.077
 reactor-0.075
 kinderg-0.074
 rejection-0.072
 workload-0.067
Token whole
Token ID2187
Feature activation+6.60e-01

Pos logit contributions
tips+0.128
inges+0.128
interrupted+0.120
verbs+0.116
ollar+0.116
Neg logit contributions
 irrad-0.083
 reactor-0.080
 kinderg-0.079
 rejection-0.077
 teamwork-0.069