TOP ACTIVATIONS
MAX = 0.427

ans 12 19 / Inst agram Now that Jen
and flying from B ourn emouth on December 22 .
130 95 <|endoftext|> Donald Mir alle / Getty Images
of 62 mm , the Garg ant ua feels like it
Can Gabriel " F alle N " Toledo and company
Cup tie at B ourn emouth . We went
on the flip side the Garg ant ua is a really
the visit of B ourn emouth to The Hawth orns .
plastics listed here , the Garg ant ua is probably one
ulo . Tom Garg i ulo : I think
really want anti - m alle ability fixes ( which would
Under Mr . Ram í rez , PD V SA has
t be one dec ibel softer , he
cent ering The Garg ant ua is the first
that question to investor Tom Garg i ulo .
The dilemma is that the Garg ant ua s
Y oy ore creation Garg ant ua D
; Mr . Ram í rez also lost his job as
affairs , Rafael Ram í rez , with an army general
, like Dragon ic Di agram or Dec ode Talk er

MIN ACTIVATIONS
MIN = -0.516

some tweets sent by pee ved p oth oles in Panama
Test Kitchen - App ro ved Author Notes :
extr usion v . ver tex . xy z += v
V SA has not thri ved . Oil exports have fallen
an Arc , an As ynchronous Reference Counter . We use
es Ascend player . Sym tex in sc 2 , cs
. The company has stri ved to make the phone easier
for Snap . Meanwhile Flat pak has not enjoyed a widespread
is being treated in some quarters as mission failure . That
) Fl at pak , Snap and App Image
the loss of around three quarters of a million jobs ,
ully / Vict im Question naire [ M imeo ]. Ber
) { v . ver tex . xy z += v
". Fl at pak 's website offers the following
recommendation quickly became out of date . It 's all a
Institution report , Lead ing Beyond Limits : May
poll is the first to date of the proposed legislation ,
. Lead the lady through the quad
room - tem perature super conduct ivity has remained so elusive
new package formats , Flat pak and Snap ( aka Sn

INTERVAL 0.191 - 0.427
CONTAINS 0.727%

otherwise a her ky - jer ky , uneven debate .
is about 12 hours off winning a Stanley Cup and obviously
the legal standard required to effect an arrest ,
the administration s bold est experiment to date involves
green lighting its first min iser ies since 2009 s

INTERVAL -0.044 - 0.191
CONTAINS 81.706%

overall increase is down to a booming population : some 8
his second place finish as a success , whereas they are
remind us of the power of love , even if we
The proposed conversion to L NG has major drawbacks .
Next was a bit of gre enery . I started by

INTERVAL -0.280 - -0.044
CONTAINS 17.479%

adopted by the Ukrainian Parliament on 25 September . The Minister
. That is a real tank that has crashed into the
5 . O - O { ( 0 s )} c
nearly 50 %. But the congestion is also spreading out .
't I tell you I 'd burn your ( re ar

INTERVAL -0.516 - -0.280
CONTAINS 0.088%

San Francisco Board of Super visors San Francisco ,
2011 , the exp iry date is deemed to be the
and J erv is Bay Territory , a naval base and
Red Christmas album .[ c itation needed ] As
- flight operation , tele metry system modifications to increase failure
Tokenans
Token ID504
Feature activation+1.02e-01

Pos logit contributions
orf+0.179
 COURT+0.173
icho+0.165
 Garg+0.163
emouth+0.163
Neg logit contributions
PB-0.194
fters-0.192
 cancellation-0.188
Lead-0.186
Writer-0.180
Token12
Token ID1065
Feature activation-2.81e-02

Pos logit contributions
orf+0.065
 COURT+0.063
icho+0.060
emouth+0.059
 Garg+0.059
Neg logit contributions
PB-0.069
fters-0.067
 cancellation-0.066
Lead-0.065
Writer-0.063
Token19
Token ID1129
Feature activation-4.88e-02

Pos logit contributions
PB+0.018
fters+0.017
 cancellation+0.017
Lead+0.017
Writer+0.016
Neg logit contributions
orf-0.019
 COURT-0.018
icho-0.018
emouth-0.017
 Garg-0.017
Token/
Token ID14
Feature activation+8.92e-03

Pos logit contributions
PB+0.033
fters+0.032
 cancellation+0.031
Lead+0.031
Writer+0.030
Neg logit contributions
orf-0.031
 COURT-0.030
icho-0.029
emouth-0.028
 Garg-0.028
TokenInst
Token ID6310
Feature activation+1.95e-02

Pos logit contributions
orf+0.006
 COURT+0.006
icho+0.006
emouth+0.006
 Garg+0.006
Neg logit contributions
PB-0.006
fters-0.005
 cancellation-0.005
Lead-0.005
Writer-0.005
Tokenagram
Token ID6713
Feature activation+4.27e-01

Pos logit contributions
orf+0.006
 COURT+0.005
icho+0.005
emouth+0.004
 Garg+0.004
Neg logit contributions
PB-0.020
fters-0.020
 cancellation-0.019
Lead-0.019
Writer-0.019
Token
Token ID198
Feature activation+6.32e-02

Pos logit contributions
orf+0.270
 COURT+0.259
icho+0.249
 Garg+0.245
emouth+0.245
Neg logit contributions
PB-0.287
fters-0.284
 cancellation-0.278
Lead-0.274
Writer-0.265
Token
Token ID198
Feature activation+5.39e-02

Pos logit contributions
orf+0.039
 COURT+0.037
icho+0.036
emouth+0.036
 Garg+0.036
Neg logit contributions
PB-0.044
fters-0.043
Lead-0.042
 cancellation-0.042
Writer-0.041
TokenNow
Token ID3844
Feature activation+9.83e-02

Pos logit contributions
orf+0.040
 COURT+0.038
icho+0.037
 Garg+0.036
emouth+0.036
Neg logit contributions
PB-0.031
fters-0.030
 cancellation-0.030
Lead-0.029
Writer-0.028
Token that
Token ID326
Feature activation+1.62e-02

Pos logit contributions
orf+0.063
 COURT+0.061
icho+0.058
emouth+0.057
 Garg+0.057
Neg logit contributions
PB-0.065
fters-0.064
 cancellation-0.063
Lead-0.062
Writer-0.060
Token Jen
Token ID13364
Feature activation+1.04e-01

Pos logit contributions
orf+0.010
 COURT+0.010
icho+0.009
emouth+0.009
 Garg+0.009
Neg logit contributions
PB-0.011
fters-0.011
 cancellation-0.011
Lead-0.011
Writer-0.010
Token and
Token ID290
Feature activation+1.09e-02

Pos logit contributions
orf+0.045
 COURT+0.044
icho+0.042
emouth+0.041
 Garg+0.041
Neg logit contributions
PB-0.046
fters-0.045
 cancellation-0.045
Lead-0.044
Writer-0.042
Token flying
Token ID7348
Feature activation+8.99e-02

Pos logit contributions
orf+0.007
 COURT+0.007
icho+0.006
emouth+0.006
 Garg+0.006
Neg logit contributions
PB-0.007
fters-0.007
 cancellation-0.007
Lead-0.007
Writer-0.007
Token from
Token ID422
Feature activation-7.59e-04

Pos logit contributions
orf+0.059
 COURT+0.056
icho+0.054
emouth+0.053
 Garg+0.053
Neg logit contributions
PB-0.059
fters-0.058
 cancellation-0.057
Lead-0.056
Writer-0.054
Token B
Token ID347
Feature activation+5.57e-02

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.000
Neg logit contributions
orf-0.000
 COURT-0.000
icho-0.000
emouth-0.000
 Garg-0.000
Tokenourn
Token ID1798
Feature activation-7.79e-02

Pos logit contributions
orf+0.036
 COURT+0.035
 Garg+0.033
icho+0.033
emouth+0.033
Neg logit contributions
PB-0.036
fters-0.036
 cancellation-0.035
Lead-0.035
Writer-0.033
Tokenemouth
Token ID46880
Feature activation+4.22e-01

Pos logit contributions
PB+0.094
fters+0.093
 cancellation+0.093
Lead+0.092
Writer+0.090
Neg logit contributions
orf-0.007
 COURT-0.005
icho-0.003
 Garg-0.002
alle-0.001
Token on
Token ID319
Feature activation-7.37e-03

Pos logit contributions
orf+0.281
 COURT+0.270
icho+0.262
 Garg+0.255
emouth+0.252
Neg logit contributions
PB-0.269
fters-0.264
 cancellation-0.263
Lead-0.255
Writer-0.245
Token December
Token ID3426
Feature activation+2.24e-02

Pos logit contributions
PB+0.005
 cancellation+0.005
fters+0.005
Lead+0.004
Writer+0.004
Neg logit contributions
orf-0.005
 COURT-0.005
icho-0.005
emouth-0.005
 Garg-0.004
Token 22
Token ID2534
Feature activation-1.03e-01

Pos logit contributions
orf+0.017
 COURT+0.016
icho+0.016
emouth+0.016
 Garg+0.015
Neg logit contributions
PB-0.013
fters-0.012
 cancellation-0.012
Lead-0.012
Writer-0.011
Token.
Token ID13
Feature activation+3.40e-02

Pos logit contributions
PB+0.064
fters+0.063
 cancellation+0.062
Lead+0.061
Writer+0.058
Neg logit contributions
orf-0.070
 COURT-0.067
icho-0.065
 Garg-0.064
emouth-0.064
Token
Token ID198
Feature activation+2.25e-02

Pos logit contributions
orf+0.021
 COURT+0.020
icho+0.020
emouth+0.019
 Garg+0.019
Neg logit contributions
PB-0.023
fters-0.023
 cancellation-0.022
Lead-0.022
Writer-0.021
Token130
Token ID12952
Feature activation+1.27e-01

Pos logit contributions
orf+0.007
 COURT+0.006
icho+0.006
 Garg+0.006
emouth+0.006
Neg logit contributions
PB-0.006
fters-0.006
 cancellation-0.006
Lead-0.005
Writer-0.005
Token95
Token ID3865
Feature activation+9.15e-02

Pos logit contributions
orf+0.089
 COURT+0.085
icho+0.082
emouth+0.081
 Garg+0.081
Neg logit contributions
PB-0.078
fters-0.076
 cancellation-0.075
Lead-0.074
Writer-0.071
Token<|endoftext|>
Token ID50256
Feature activation+3.55e-02

Pos logit contributions
orf+0.059
 COURT+0.057
icho+0.054
 Garg+0.054
emouth+0.054
Neg logit contributions
PB-0.060
fters-0.060
 cancellation-0.058
Lead-0.058
Writer-0.056
TokenDonald
Token ID7371
Feature activation+6.55e-03

Pos logit contributions
orf+0.026
 COURT+0.025
icho+0.024
 Garg+0.024
emouth+0.024
Neg logit contributions
PB-0.020
fters-0.020
 cancellation-0.019
Lead-0.019
Writer-0.018
Token Mir
Token ID7381
Feature activation+1.92e-01

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.003
emouth+0.003
 Garg+0.003
Neg logit contributions
PB-0.005
fters-0.005
 cancellation-0.005
Lead-0.005
Writer-0.004
Tokenalle
Token ID6765
Feature activation+4.20e-01

Pos logit contributions
orf+0.125
 COURT+0.122
icho+0.116
emouth+0.115
 Garg+0.115
Neg logit contributions
PB-0.123
fters-0.121
 cancellation-0.120
Lead-0.118
Writer-0.113
Token/
Token ID14
Feature activation-1.88e-02

Pos logit contributions
orf+0.267
 COURT+0.257
icho+0.248
emouth+0.244
 Garg+0.242
Neg logit contributions
PB-0.279
fters-0.276
 cancellation-0.271
Lead-0.266
Writer-0.259
TokenGetty
Token ID6633
Feature activation-1.30e-01

Pos logit contributions
PB+0.012
fters+0.012
 cancellation+0.012
Lead+0.012
Writer+0.011
Neg logit contributions
orf-0.012
 COURT-0.012
icho-0.011
emouth-0.011
 Garg-0.011
Token Images
Token ID5382
Feature activation-2.65e-02

Pos logit contributions
PB+0.099
fters+0.098
 cancellation+0.096
Lead+0.095
Writer+0.092
Neg logit contributions
orf-0.070
 COURT-0.067
icho-0.064
 Garg-0.063
emouth-0.063
Token
Token ID198
Feature activation-3.40e-03

Pos logit contributions
PB+0.018
fters+0.018
 cancellation+0.018
Lead+0.018
Writer+0.017
Neg logit contributions
orf-0.016
 COURT-0.015
icho-0.015
emouth-0.015
 Garg-0.015
Token
Token ID198
Feature activation+2.96e-03

Pos logit contributions
PB+0.002
fters+0.002
Lead+0.002
 cancellation+0.002
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
emouth-0.002
 Garg-0.002
Token of
Token ID286
Feature activation-4.02e-03

Pos logit contributions
orf+0.013
 COURT+0.013
icho+0.012
 Garg+0.012
emouth+0.012
Neg logit contributions
PB-0.013
fters-0.013
 cancellation-0.012
Lead-0.012
Writer-0.012
Token 62
Token ID8190
Feature activation-2.52e-02

Pos logit contributions
PB+0.002
fters+0.002
 cancellation+0.002
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.003
 COURT-0.003
icho-0.003
 Garg-0.003
emouth-0.003
Tokenmm
Token ID3020
Feature activation-4.36e-02

Pos logit contributions
PB+0.015
fters+0.015
 cancellation+0.015
Lead+0.015
Writer+0.014
Neg logit contributions
orf-0.018
 COURT-0.017
icho-0.016
 Garg-0.016
emouth-0.016
Token,
Token ID11
Feature activation+5.07e-02

Pos logit contributions
PB+0.029
fters+0.028
 cancellation+0.028
Lead+0.027
Writer+0.026
Neg logit contributions
orf-0.028
 COURT-0.027
icho-0.026
 Garg-0.026
emouth-0.026
Token the
Token ID262
Feature activation+2.95e-02

Pos logit contributions
orf+0.033
 COURT+0.031
icho+0.030
emouth+0.030
 Garg+0.030
Neg logit contributions
PB-0.034
fters-0.033
 cancellation-0.033
Lead-0.032
Writer-0.031
Token Garg
Token ID39013
Feature activation+4.19e-01

Pos logit contributions
orf+0.005
 COURT+0.005
icho+0.004
emouth+0.004
 Garg+0.003
Neg logit contributions
PB-0.033
fters-0.033
 cancellation-0.033
Lead-0.032
Writer-0.032
Tokenant
Token ID415
Feature activation-1.49e-01

Pos logit contributions
orf+0.304
 COURT+0.296
icho+0.285
emouth+0.281
 Garg+0.280
Neg logit contributions
PB-0.241
fters-0.236
 cancellation-0.231
Lead-0.228
Writer-0.219
Tokenua
Token ID6413
Feature activation-1.39e-01

Pos logit contributions
PB+0.071
fters+0.070
 cancellation+0.068
Lead+0.066
Writer+0.064
Neg logit contributions
orf-0.124
 COURT-0.121
icho-0.116
 Garg-0.115
emouth-0.115
Token feels
Token ID5300
Feature activation+6.20e-02

Pos logit contributions
PB+0.097
fters+0.095
 cancellation+0.094
Lead+0.093
Writer+0.090
Neg logit contributions
orf-0.085
 COURT-0.081
icho-0.078
emouth-0.077
 Garg-0.077
Token like
Token ID588
Feature activation+4.18e-02

Pos logit contributions
orf+0.040
 COURT+0.038
icho+0.037
emouth+0.036
 Garg+0.036
Neg logit contributions
PB-0.041
fters-0.041
 cancellation-0.040
Lead-0.039
Writer-0.038
Token it
Token ID340
Feature activation+1.70e-02

Pos logit contributions
orf+0.026
 COURT+0.024
icho+0.023
 Garg+0.023
emouth+0.023
Neg logit contributions
PB-0.029
fters-0.029
 cancellation-0.028
Lead-0.028
Writer-0.027
Token
Token ID198
Feature activation-5.52e-03

Pos logit contributions
orf+0.005
 COURT+0.005
icho+0.005
emouth+0.004
 Garg+0.004
Neg logit contributions
PB-0.005
fters-0.005
Lead-0.005
 cancellation-0.005
Writer-0.005
TokenCan
Token ID6090
Feature activation-9.78e-02

Pos logit contributions
PB+0.003
fters+0.003
 cancellation+0.003
Lead+0.003
Writer+0.003
Neg logit contributions
orf-0.004
 COURT-0.004
icho-0.004
 Garg-0.004
emouth-0.004
Token Gabriel
Token ID17371
Feature activation-1.04e-01

Pos logit contributions
PB+0.066
fters+0.065
 cancellation+0.064
Lead+0.063
Writer+0.061
Neg logit contributions
orf-0.061
 COURT-0.059
icho-0.057
emouth-0.056
 Garg-0.056
Token "
Token ID366
Feature activation-3.90e-02

Pos logit contributions
PB+0.064
fters+0.063
 cancellation+0.063
Lead+0.062
Writer+0.059
Neg logit contributions
orf-0.071
 COURT-0.069
icho-0.066
emouth-0.065
 Garg-0.065
TokenF
Token ID37
Feature activation+8.11e-02

Pos logit contributions
PB+0.025
fters+0.024
Lead+0.024
 cancellation+0.023
Writer+0.023
Neg logit contributions
orf-0.026
 COURT-0.026
icho-0.024
emouth-0.024
 Garg-0.024
Tokenalle
Token ID6765
Feature activation+4.18e-01

Pos logit contributions
orf+0.045
 COURT+0.043
icho+0.040
 Garg+0.040
emouth+0.040
Neg logit contributions
PB-0.061
fters-0.060
 cancellation-0.059
Lead-0.059
Writer-0.057
TokenN
Token ID45
Feature activation+3.69e-02

Pos logit contributions
orf+0.258
 COURT+0.249
icho+0.238
emouth+0.236
 Garg+0.236
Neg logit contributions
PB-0.286
fters-0.283
 cancellation-0.278
Lead-0.275
Writer-0.265
Token"
Token ID1
Feature activation-2.28e-02

Pos logit contributions
orf+0.028
 COURT+0.028
icho+0.027
 Garg+0.026
emouth+0.026
Neg logit contributions
PB-0.020
fters-0.019
 cancellation-0.019
Lead-0.019
Writer-0.018
Token Toledo
Token ID36026
Feature activation-1.24e-01

Pos logit contributions
PB+0.012
fters+0.012
 cancellation+0.011
Lead+0.011
Writer+0.011
Neg logit contributions
orf-0.018
 COURT-0.017
icho-0.017
 Garg-0.017
emouth-0.016
Token and
Token ID290
Feature activation-3.29e-02

Pos logit contributions
PB+0.084
fters+0.083
 cancellation+0.082
Lead+0.081
Writer+0.078
Neg logit contributions
orf-0.077
 COURT-0.075
icho-0.072
emouth-0.070
 Garg-0.070
Token company
Token ID1664
Feature activation+7.80e-02

Pos logit contributions
PB+0.021
fters+0.021
 cancellation+0.021
Lead+0.020
Writer+0.020
Neg logit contributions
orf-0.022
 COURT-0.021
icho-0.020
emouth-0.020
 Garg-0.020
Token Cup
Token ID5454
Feature activation+1.21e-01

Pos logit contributions
orf+0.012
 COURT+0.012
icho+0.011
 Garg+0.011
emouth+0.011
Neg logit contributions
PB-0.016
fters-0.015
 cancellation-0.015
Lead-0.015
Writer-0.015
Token tie
Token ID9839
Feature activation+1.31e-01

Pos logit contributions
orf+0.084
 COURT+0.081
icho+0.078
 Garg+0.077
emouth+0.077
Neg logit contributions
PB-0.074
fters-0.073
 cancellation-0.072
Lead-0.070
Writer-0.068
Token at
Token ID379
Feature activation+3.90e-02

Pos logit contributions
orf+0.084
 COURT+0.080
icho+0.077
 Garg+0.076
emouth+0.076
Neg logit contributions
PB-0.087
fters-0.086
 cancellation-0.084
Lead-0.083
Writer-0.080
Token B
Token ID347
Feature activation+4.83e-02

Pos logit contributions
orf+0.021
 COURT+0.021
icho+0.020
emouth+0.019
 Garg+0.019
Neg logit contributions
PB-0.029
fters-0.029
 cancellation-0.029
Lead-0.028
Writer-0.027
Tokenourn
Token ID1798
Feature activation-6.13e-02

Pos logit contributions
orf+0.038
 COURT+0.037
icho+0.036
 Garg+0.036
emouth+0.035
Neg logit contributions
PB-0.025
fters-0.024
 cancellation-0.024
Lead-0.023
Writer-0.022
Tokenemouth
Token ID46880
Feature activation+4.15e-01

Pos logit contributions
PB+0.072
fters+0.072
 cancellation+0.071
Lead+0.071
Animation+0.069
Neg logit contributions
orf-0.007
 COURT-0.006
icho-0.004
 Garg-0.003
oled-0.002
Token.
Token ID13
Feature activation-8.55e-03

Pos logit contributions
orf+0.276
 COURT+0.265
icho+0.255
 Garg+0.251
emouth+0.251
Neg logit contributions
PB-0.265
fters-0.262
 cancellation-0.257
Lead-0.253
Writer-0.244
Token
Token ID564
Feature activation-1.19e-02

Pos logit contributions
PB+0.006
fters+0.005
 cancellation+0.005
Lead+0.005
Writer+0.005
Neg logit contributions
orf-0.006
 COURT-0.005
icho-0.005
emouth-0.005
 Garg-0.005
Token
Token ID250
Feature activation+2.97e-02

Pos logit contributions
PB+0.007
fters+0.007
 cancellation+0.007
Lead+0.007
Writer+0.007
Neg logit contributions
orf-0.008
 COURT-0.008
icho-0.008
 Garg-0.008
emouth-0.008
TokenWe
Token ID1135
Feature activation+1.44e-02

Pos logit contributions
orf+0.020
 COURT+0.019
icho+0.018
 Garg+0.018
emouth+0.018
Neg logit contributions
PB-0.019
fters-0.019
 cancellation-0.019
Lead-0.018
Writer-0.018
Token went
Token ID1816
Feature activation+2.92e-02

Pos logit contributions
orf+0.009
 COURT+0.009
icho+0.008
 Garg+0.008
emouth+0.008
Neg logit contributions
PB-0.010
fters-0.010
 cancellation-0.009
Lead-0.009
Writer-0.009
Token on
Token ID319
Feature activation-1.03e-02

Pos logit contributions
orf+0.029
 COURT+0.028
icho+0.026
 Garg+0.026
emouth+0.026
Neg logit contributions
PB-0.028
fters-0.027
 cancellation-0.027
Lead-0.027
Writer-0.026
Token the
Token ID262
Feature activation+1.77e-02

Pos logit contributions
PB+0.007
fters+0.007
 cancellation+0.007
Lead+0.007
Writer+0.006
Neg logit contributions
orf-0.007
 COURT-0.006
icho-0.006
 Garg-0.006
emouth-0.006
Token flip
Token ID14283
Feature activation-5.41e-02

Pos logit contributions
orf+0.012
 COURT+0.011
icho+0.011
emouth+0.011
 Garg+0.011
Neg logit contributions
PB-0.011
fters-0.011
 cancellation-0.011
Lead-0.011
Writer-0.010
Token side
Token ID1735
Feature activation+6.21e-02

Pos logit contributions
PB+0.037
fters+0.037
 cancellation+0.036
Lead+0.036
Writer+0.034
Neg logit contributions
orf-0.033
 COURT-0.032
icho-0.031
 Garg-0.030
emouth-0.030
Token the
Token ID262
Feature activation+3.48e-02

Pos logit contributions
orf+0.042
 COURT+0.040
icho+0.038
 Garg+0.038
emouth+0.038
Neg logit contributions
PB-0.039
fters-0.039
 cancellation-0.038
Lead-0.038
Writer-0.036
Token Garg
Token ID39013
Feature activation+4.14e-01

Pos logit contributions
orf+0.007
 COURT+0.006
icho+0.006
emouth+0.005
 Garg+0.005
Neg logit contributions
PB-0.038
fters-0.038
 cancellation-0.037
Lead-0.037
Writer-0.036
Tokenant
Token ID415
Feature activation-1.49e-01

Pos logit contributions
orf+0.303
 COURT+0.297
icho+0.285
emouth+0.280
 Garg+0.280
Neg logit contributions
PB-0.235
fters-0.230
 cancellation-0.225
Lead-0.223
Writer-0.213
Tokenua
Token ID6413
Feature activation-1.29e-01

Pos logit contributions
PB+0.070
fters+0.069
 cancellation+0.067
Lead+0.065
Writer+0.062
Neg logit contributions
orf-0.123
 COURT-0.121
icho-0.116
 Garg-0.115
emouth-0.115
Token is
Token ID318
Feature activation+3.67e-02

Pos logit contributions
PB+0.087
fters+0.086
 cancellation+0.085
Lead+0.083
Writer+0.081
Neg logit contributions
orf-0.081
 COURT-0.077
icho-0.074
emouth-0.073
 Garg-0.073
Token a
Token ID257
Feature activation+1.85e-02

Pos logit contributions
orf+0.024
 COURT+0.023
icho+0.022
emouth+0.022
 Garg+0.022
Neg logit contributions
PB-0.024
fters-0.024
 cancellation-0.023
Lead-0.023
Writer-0.022
Token really
Token ID1107
Feature activation+2.91e-02

Pos logit contributions
orf+0.012
 COURT+0.011
icho+0.011
emouth+0.011
 Garg+0.011
Neg logit contributions
PB-0.012
fters-0.012
 cancellation-0.012
Lead-0.012
Writer-0.011
Token the
Token ID262
Feature activation+7.00e-03

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
 Garg+0.003
emouth+0.003
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.004
Writer-0.004
Token visit
Token ID3187
Feature activation+1.56e-01

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.004
emouth+0.004
 Garg+0.004
Neg logit contributions
PB-0.005
fters-0.005
 cancellation-0.005
Lead-0.004
Writer-0.004
Token of
Token ID286
Feature activation-1.08e-02

Pos logit contributions
orf+0.101
 COURT+0.097
icho+0.093
 Garg+0.092
emouth+0.092
Neg logit contributions
PB-0.102
fters-0.101
 cancellation-0.098
Lead-0.097
Writer-0.094
Token B
Token ID347
Feature activation+5.38e-02

Pos logit contributions
PB+0.008
fters+0.008
 cancellation+0.008
Lead+0.008
Writer+0.008
Neg logit contributions
orf-0.006
 COURT-0.006
icho-0.005
 Garg-0.005
emouth-0.005
Tokenourn
Token ID1798
Feature activation-1.22e-01

Pos logit contributions
orf+0.043
 COURT+0.041
icho+0.040
 Garg+0.040
emouth+0.039
Neg logit contributions
PB-0.028
fters-0.027
 cancellation-0.026
Lead-0.026
Writer-0.025
Tokenemouth
Token ID46880
Feature activation+4.08e-01

Pos logit contributions
PB+0.145
fters+0.144
 cancellation+0.143
Lead+0.142
Animation+0.138
Neg logit contributions
orf-0.012
 COURT-0.011
icho-0.007
 Garg-0.005
oled-0.003
Token to
Token ID284
Feature activation-2.74e-03

Pos logit contributions
orf+0.257
 COURT+0.247
icho+0.237
 Garg+0.233
emouth+0.232
Neg logit contributions
PB-0.275
fters-0.271
 cancellation-0.267
Lead-0.263
Writer-0.254
Token The
Token ID383
Feature activation+4.65e-02

Pos logit contributions
PB+0.002
fters+0.002
 cancellation+0.002
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
emouth-0.002
 Garg-0.002
Token Hawth
Token ID41457
Feature activation-1.79e-01

Pos logit contributions
orf+0.036
 COURT+0.035
icho+0.034
emouth+0.033
 Garg+0.033
Neg logit contributions
PB-0.025
fters-0.024
 cancellation-0.024
Lead-0.023
Writer-0.022
Tokenorns
Token ID19942
Feature activation+9.42e-02

Pos logit contributions
PB+0.146
fters+0.146
 cancellation+0.142
Lead+0.141
Writer+0.137
Neg logit contributions
orf-0.086
 COURT-0.082
icho-0.076
 Garg-0.075
emouth-0.074
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
orf+0.061
 COURT+0.059
icho+0.056
 Garg+0.055
emouth+0.055
Neg logit contributions
PB-0.062
fters-0.061
 cancellation-0.060
Lead-0.059
Writer-0.057
Token plastics
Token ID42257
Feature activation-7.37e-03

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
emouth+0.003
 Garg+0.003
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.004
Writer-0.004
Token listed
Token ID5610
Feature activation-7.05e-02

Pos logit contributions
PB+0.005
fters+0.005
 cancellation+0.005
Lead+0.005
Writer+0.005
Neg logit contributions
orf-0.005
 COURT-0.005
icho-0.004
emouth-0.004
 Garg-0.004
Token here
Token ID994
Feature activation+2.27e-02

Pos logit contributions
PB+0.047
fters+0.046
 cancellation+0.045
Lead+0.045
Writer+0.043
Neg logit contributions
orf-0.045
 COURT-0.043
icho-0.042
 Garg-0.041
emouth-0.041
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
orf+0.015
 COURT+0.015
icho+0.014
 Garg+0.014
emouth+0.014
Neg logit contributions
PB-0.015
fters-0.014
 cancellation-0.014
Lead-0.014
Writer-0.013
Token the
Token ID262
Feature activation-8.82e-03

Pos logit contributions
PB+0.012
fters+0.012
 cancellation+0.012
Lead+0.012
Writer+0.011
Neg logit contributions
orf-0.012
 COURT-0.011
icho-0.011
 Garg-0.010
emouth-0.010
Token Garg
Token ID39013
Feature activation+4.04e-01

Pos logit contributions
PB+0.009
fters+0.009
 cancellation+0.008
Lead+0.008
Writer+0.008
Neg logit contributions
orf-0.003
 COURT-0.003
icho-0.002
emouth-0.002
 Garg-0.002
Tokenant
Token ID415
Feature activation-1.57e-01

Pos logit contributions
orf+0.294
 COURT+0.287
icho+0.276
emouth+0.272
 Garg+0.272
Neg logit contributions
PB-0.231
fters-0.226
 cancellation-0.222
Lead-0.219
Writer-0.210
Tokenua
Token ID6413
Feature activation-1.50e-01

Pos logit contributions
PB+0.075
fters+0.073
 cancellation+0.071
Lead+0.069
Writer+0.067
Neg logit contributions
orf-0.130
 COURT-0.127
icho-0.122
 Garg-0.121
emouth-0.121
Token is
Token ID318
Feature activation-1.67e-03

Pos logit contributions
PB+0.105
fters+0.104
 cancellation+0.102
Lead+0.101
Writer+0.097
Neg logit contributions
orf-0.091
 COURT-0.087
icho-0.084
emouth-0.082
 Garg-0.082
Token probably
Token ID2192
Feature activation-1.53e-01

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.001
 COURT-0.001
icho-0.001
 Garg-0.001
emouth-0.001
Token one
Token ID530
Feature activation-1.51e-02

Pos logit contributions
PB+0.102
fters+0.101
 cancellation+0.099
Lead+0.098
Writer+0.094
Neg logit contributions
orf-0.098
 COURT-0.094
icho-0.090
 Garg-0.089
emouth-0.088
Tokenulo
Token ID43348
Feature activation-2.90e-02

Pos logit contributions
PB+0.090
fters+0.089
 cancellation+0.087
Lead+0.085
Writer+0.082
Neg logit contributions
orf-0.099
 COURT-0.095
icho-0.091
 Garg-0.091
emouth-0.090
Token.
Token ID13
Feature activation-2.68e-02

Pos logit contributions
PB+0.019
fters+0.018
 cancellation+0.018
Lead+0.018
Writer+0.017
Neg logit contributions
orf-0.019
 COURT-0.018
icho-0.018
emouth-0.017
 Garg-0.017
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
PB+0.018
fters+0.018
 cancellation+0.018
Lead+0.017
Writer+0.017
Neg logit contributions
orf-0.017
 COURT-0.016
icho-0.015
emouth-0.015
alle-0.015
Token
Token ID198
Feature activation-3.28e-03

Pos logit contributions
PB+0.008
fters+0.007
Lead+0.007
 cancellation+0.007
Writer+0.007
Neg logit contributions
orf-0.007
 COURT-0.007
icho-0.006
emouth-0.006
 Garg-0.006
TokenTom
Token ID13787
Feature activation+1.66e-02

Pos logit contributions
PB+0.002
fters+0.002
 cancellation+0.002
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
 Garg-0.002
emouth-0.002
Token Garg
Token ID39013
Feature activation+4.04e-01

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.003
emouth+0.003
 Garg+0.003
Neg logit contributions
PB-0.017
fters-0.017
 cancellation-0.017
Lead-0.017
Writer-0.016
Tokeni
Token ID72
Feature activation-1.43e-01

Pos logit contributions
orf+0.292
 COURT+0.286
icho+0.273
emouth+0.270
 Garg+0.269
Neg logit contributions
PB-0.232
fters-0.228
 cancellation-0.222
Lead-0.221
Writer-0.214
Tokenulo
Token ID43348
Feature activation-1.93e-02

Pos logit contributions
PB+0.084
fters+0.082
 cancellation+0.080
Lead+0.080
Writer+0.076
Neg logit contributions
orf-0.103
 COURT-0.100
icho-0.096
emouth-0.096
 Garg-0.095
Token:
Token ID25
Feature activation-1.01e-02

Pos logit contributions
PB+0.013
fters+0.013
 cancellation+0.013
Lead+0.013
Writer+0.012
Neg logit contributions
orf-0.012
 COURT-0.011
icho-0.011
 Garg-0.011
emouth-0.011
Token I
Token ID314
Feature activation-5.09e-02

Pos logit contributions
PB+0.007
fters+0.007
 cancellation+0.007
Lead+0.007
Writer+0.006
Neg logit contributions
orf-0.006
 COURT-0.006
icho-0.006
emouth-0.006
 Garg-0.006
Token think
Token ID892
Feature activation-1.99e-02

Pos logit contributions
PB+0.036
fters+0.035
 cancellation+0.035
Lead+0.034
Writer+0.033
Neg logit contributions
orf-0.031
 COURT-0.029
icho-0.028
emouth-0.028
 Garg-0.028
Token really
Token ID1107
Feature activation+3.52e-02

Pos logit contributions
orf+0.054
 COURT+0.051
icho+0.049
emouth+0.048
 Garg+0.048
Neg logit contributions
PB-0.061
fters-0.061
 cancellation-0.060
Lead-0.059
Writer-0.057
Token want
Token ID765
Feature activation+4.83e-02

Pos logit contributions
orf+0.023
 COURT+0.022
icho+0.021
emouth+0.021
 Garg+0.020
Neg logit contributions
PB-0.023
fters-0.023
 cancellation-0.023
Lead-0.022
Writer-0.021
Token anti
Token ID3098
Feature activation+5.56e-03

Pos logit contributions
orf+0.032
 COURT+0.030
icho+0.029
 Garg+0.029
emouth+0.029
Neg logit contributions
PB-0.031
fters-0.031
 cancellation-0.030
Lead-0.030
Writer-0.029
Token-
Token ID12
Feature activation+4.89e-02

Pos logit contributions
orf+0.004
 COURT+0.003
icho+0.003
 Garg+0.003
emouth+0.003
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.004
Writer-0.003
Tokenm
Token ID76
Feature activation-3.22e-02

Pos logit contributions
orf+0.035
 COURT+0.034
icho+0.032
 Garg+0.032
emouth+0.032
Neg logit contributions
PB-0.029
fters-0.028
 cancellation-0.028
Lead-0.027
Writer-0.026
Tokenalle
Token ID6765
Feature activation+3.99e-01

Pos logit contributions
PB+0.023
fters+0.023
 cancellation+0.022
Lead+0.022
Writer+0.021
Neg logit contributions
orf-0.019
 COURT-0.018
emouth-0.017
 Garg-0.017
icho-0.017
Tokenability
Token ID1799
Feature activation-1.53e-01

Pos logit contributions
orf+0.312
 COURT+0.302
icho+0.292
 Garg+0.289
emouth+0.289
Neg logit contributions
PB-0.209
fters-0.205
 cancellation-0.200
Lead-0.196
Writer-0.188
Token fixes
Token ID13040
Feature activation-8.87e-02

Pos logit contributions
PB+0.095
fters+0.094
 cancellation+0.093
Lead+0.090
Writer+0.087
Neg logit contributions
orf-0.104
 COURT-0.100
icho-0.096
emouth-0.095
 Garg-0.095
Token (
Token ID357
Feature activation+2.34e-02

Pos logit contributions
PB+0.057
fters+0.057
 cancellation+0.056
Lead+0.055
Writer+0.053
Neg logit contributions
orf-0.058
 COURT-0.056
icho-0.054
 Garg-0.053
emouth-0.053
Tokenwhich
Token ID4758
Feature activation+1.91e-02

Pos logit contributions
orf+0.015
 COURT+0.014
icho+0.014
 Garg+0.014
emouth+0.014
Neg logit contributions
PB-0.016
fters-0.015
 cancellation-0.015
Lead-0.015
Writer-0.014
Token would
Token ID561
Feature activation+1.43e-02

Pos logit contributions
orf+0.012
 COURT+0.011
icho+0.011
 Garg+0.010
emouth+0.010
Neg logit contributions
PB-0.013
fters-0.013
 cancellation-0.013
Lead-0.013
Writer-0.012
Token Under
Token ID4698
Feature activation-5.96e-02

Pos logit contributions
PB+0.021
fters+0.020
 cancellation+0.020
Lead+0.020
Writer+0.019
Neg logit contributions
orf-0.020
 COURT-0.019
icho-0.018
emouth-0.018
 Garg-0.018
Token Mr
Token ID1770
Feature activation-3.66e-02

Pos logit contributions
PB+0.040
fters+0.039
 cancellation+0.039
Lead+0.038
Writer+0.037
Neg logit contributions
orf-0.038
 COURT-0.036
icho-0.035
emouth-0.034
 Garg-0.034
Token.
Token ID13
Feature activation-2.02e-02

Pos logit contributions
PB+0.021
fters+0.021
Lead+0.020
 cancellation+0.020
Writer+0.020
Neg logit contributions
orf-0.026
 COURT-0.025
icho-0.025
emouth-0.025
 Garg-0.024
Token Ram
Token ID7431
Feature activation-9.36e-03

Pos logit contributions
PB+0.014
fters+0.013
 cancellation+0.013
Lead+0.013
Writer+0.012
Neg logit contributions
orf-0.013
 COURT-0.012
icho-0.012
emouth-0.012
 Garg-0.011
Tokení
Token ID8836
Feature activation+4.46e-02

Pos logit contributions
PB+0.006
fters+0.006
 cancellation+0.006
Lead+0.006
Writer+0.006
Neg logit contributions
orf-0.006
 COURT-0.006
icho-0.005
 Garg-0.005
emouth-0.005
Tokenrez
Token ID21107
Feature activation+3.97e-01

Pos logit contributions
orf+0.012
 COURT+0.012
icho+0.010
 Garg+0.010
emouth+0.010
Neg logit contributions
PB-0.045
fters-0.045
 cancellation-0.044
Lead-0.044
Writer-0.043
Token,
Token ID11
Feature activation+1.05e-02

Pos logit contributions
orf+0.255
 COURT+0.245
icho+0.235
 Garg+0.232
emouth+0.232
Neg logit contributions
PB-0.263
fters-0.259
 cancellation-0.254
Lead-0.251
Writer-0.242
Token PD
Token ID14340
Feature activation+3.89e-02

Pos logit contributions
orf+0.007
 COURT+0.007
icho+0.007
emouth+0.006
 Garg+0.006
Neg logit contributions
PB-0.007
fters-0.006
 cancellation-0.006
Lead-0.006
Writer-0.006
TokenV
Token ID53
Feature activation+7.49e-02

Pos logit contributions
orf+0.028
 COURT+0.027
icho+0.026
emouth+0.026
 Garg+0.026
Neg logit contributions
PB-0.023
fters-0.022
 cancellation-0.022
Lead-0.022
Writer-0.021
TokenSA
Token ID4090
Feature activation-1.22e-01

Pos logit contributions
orf+0.073
 COURT+0.071
icho+0.069
 Garg+0.069
emouth+0.069
Neg logit contributions
PB-0.026
fters-0.024
Lead-0.023
 cancellation-0.023
Writer-0.021
Token has
Token ID468
Feature activation+1.16e-04

Pos logit contributions
PB+0.083
fters+0.082
 cancellation+0.081
Lead+0.080
Writer+0.077
Neg logit contributions
orf-0.076
 COURT-0.073
icho-0.070
emouth-0.069
 Garg-0.068
Token
Token ID247
Feature activation+1.91e-02

Pos logit contributions
orf+0.015
 COURT+0.015
icho+0.015
emouth+0.014
 Garg+0.014
Neg logit contributions
PB-0.010
fters-0.010
 cancellation-0.009
Lead-0.009
Writer-0.009
Tokent
Token ID83
Feature activation+2.73e-03

Pos logit contributions
orf+0.013
 COURT+0.012
icho+0.012
emouth+0.012
 Garg+0.012
Neg logit contributions
PB-0.012
fters-0.012
 cancellation-0.012
Lead-0.012
Writer-0.011
Token be
Token ID307
Feature activation+8.14e-03

Pos logit contributions
orf+0.002
 COURT+0.002
icho+0.002
emouth+0.002
 Garg+0.002
Neg logit contributions
PB-0.002
fters-0.002
 cancellation-0.002
Lead-0.002
Writer-0.002
Token one
Token ID530
Feature activation+1.95e-02

Pos logit contributions
orf+0.005
 COURT+0.005
icho+0.005
 Garg+0.005
emouth+0.005
Neg logit contributions
PB-0.005
fters-0.005
 cancellation-0.005
Lead-0.005
Writer-0.005
Token dec
Token ID875
Feature activation-2.00e-01

Pos logit contributions
orf+0.013
 COURT+0.012
icho+0.012
emouth+0.011
 Garg+0.011
Neg logit contributions
PB-0.013
fters-0.013
 cancellation-0.012
Lead-0.012
Writer-0.012
Tokenibel
Token ID43837
Feature activation+3.94e-01

Pos logit contributions
PB+0.132
fters+0.130
 cancellation+0.127
Lead+0.125
Writer+0.121
Neg logit contributions
orf-0.129
 COURT-0.124
icho-0.119
emouth-0.117
 Garg-0.117
Token softer
Token ID32359
Feature activation-8.61e-02

Pos logit contributions
orf+0.226
 COURT+0.216
icho+0.207
emouth+0.203
 Garg+0.203
Neg logit contributions
PB-0.288
fters-0.285
 cancellation-0.280
Lead-0.276
Writer-0.268
Token,
Token ID11
Feature activation+1.24e-02

Pos logit contributions
PB+0.055
fters+0.054
 cancellation+0.053
Lead+0.053
Writer+0.051
Neg logit contributions
orf-0.057
 COURT-0.055
icho-0.053
 Garg-0.052
emouth-0.052
Token
Token ID447
Feature activation+3.31e-02

Pos logit contributions
orf+0.009
 COURT+0.008
icho+0.008
emouth+0.008
 Garg+0.008
Neg logit contributions
PB-0.008
fters-0.007
 cancellation-0.007
Lead-0.007
Writer-0.007
Token
Token ID251
Feature activation+3.04e-02

Pos logit contributions
orf+0.025
 COURT+0.024
icho+0.023
 Garg+0.023
emouth+0.023
Neg logit contributions
PB-0.018
fters-0.018
 cancellation-0.018
Lead-0.017
Writer-0.017
Token he
Token ID339
Feature activation+2.14e-02

Pos logit contributions
orf+0.021
 COURT+0.020
icho+0.019
 Garg+0.019
emouth+0.019
Neg logit contributions
PB-0.019
fters-0.019
 cancellation-0.019
Lead-0.018
Writer-0.018
Token cent
Token ID1247
Feature activation-6.12e-02

Pos logit contributions
PB+0.049
fters+0.048
 cancellation+0.047
Lead+0.047
Writer+0.045
Neg logit contributions
orf-0.047
 COURT-0.045
icho-0.044
 Garg-0.043
emouth-0.043
Tokenering
Token ID1586
Feature activation-8.25e-03

Pos logit contributions
PB+0.041
fters+0.040
 cancellation+0.039
Lead+0.039
Writer+0.038
Neg logit contributions
orf-0.039
 COURT-0.038
icho-0.036
emouth-0.036
 Garg-0.036
Token
Token ID198
Feature activation-2.42e-03

Pos logit contributions
PB+0.006
fters+0.006
 cancellation+0.006
Lead+0.006
Writer+0.005
Neg logit contributions
orf-0.005
 COURT-0.005
icho-0.005
 Garg-0.005
emouth-0.005
Token
Token ID198
Feature activation+1.13e-03

Pos logit contributions
PB+0.002
fters+0.002
Lead+0.002
 cancellation+0.002
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.001
icho-0.001
emouth-0.001
 Garg-0.001
TokenThe
Token ID464
Feature activation-3.91e-02

Pos logit contributions
orf+0.001
 COURT+0.001
icho+0.001
emouth+0.001
 Garg+0.001
Neg logit contributions
PB-0.001
fters-0.001
 cancellation-0.001
Lead-0.001
Writer-0.001
Token Garg
Token ID39013
Feature activation+3.92e-01

Pos logit contributions
PB+0.038
fters+0.038
 cancellation+0.037
Lead+0.037
Writer+0.036
Neg logit contributions
orf-0.013
 COURT-0.012
icho-0.011
emouth-0.011
 Garg-0.010
Tokenant
Token ID415
Feature activation-1.70e-01

Pos logit contributions
orf+0.285
 COURT+0.277
icho+0.266
emouth+0.262
 Garg+0.262
Neg logit contributions
PB-0.225
fters-0.222
 cancellation-0.217
Lead-0.214
Writer-0.205
Tokenua
Token ID6413
Feature activation-1.57e-01

Pos logit contributions
PB+0.082
fters+0.080
 cancellation+0.078
Lead+0.076
Writer+0.073
Neg logit contributions
orf-0.140
 COURT-0.136
icho-0.131
 Garg-0.130
emouth-0.130
Token is
Token ID318
Feature activation-6.18e-03

Pos logit contributions
PB+0.109
fters+0.107
 cancellation+0.105
Lead+0.104
Writer+0.100
Neg logit contributions
orf-0.097
 COURT-0.093
icho-0.089
emouth-0.088
 Garg-0.087
Token the
Token ID262
Feature activation-1.89e-02

Pos logit contributions
PB+0.004
fters+0.004
 cancellation+0.004
Lead+0.004
Writer+0.004
Neg logit contributions
orf-0.004
 COURT-0.004
icho-0.004
emouth-0.004
 Garg-0.004
Token first
Token ID717
Feature activation-2.29e-02

Pos logit contributions
PB+0.012
fters+0.012
 cancellation+0.012
Lead+0.012
Writer+0.011
Neg logit contributions
orf-0.012
 COURT-0.012
icho-0.011
emouth-0.011
 Garg-0.011
Token that
Token ID326
Feature activation+5.98e-03

Pos logit contributions
orf+0.020
 COURT+0.019
icho+0.018
emouth+0.018
 Garg+0.018
Neg logit contributions
PB-0.020
fters-0.019
 cancellation-0.019
Lead-0.019
Writer-0.018
Token question
Token ID1808
Feature activation+4.17e-02

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
emouth+0.003
 Garg+0.003
Neg logit contributions
PB-0.005
fters-0.005
 cancellation-0.005
Lead-0.005
Writer-0.004
Token to
Token ID284
Feature activation+5.99e-04

Pos logit contributions
orf+0.027
 COURT+0.026
icho+0.025
emouth+0.025
 Garg+0.025
Neg logit contributions
PB-0.027
fters-0.027
 cancellation-0.026
Lead-0.026
Writer-0.025
Token investor
Token ID15811
Feature activation-8.87e-02

Pos logit contributions
orf+0.000
 COURT+0.000
icho+0.000
emouth+0.000
 Garg+0.000
Neg logit contributions
PB-0.000
fters-0.000
 cancellation-0.000
Lead-0.000
Writer-0.000
Token Tom
Token ID4186
Feature activation-5.55e-02

Pos logit contributions
PB+0.057
fters+0.056
 cancellation+0.055
Lead+0.054
Writer+0.052
Neg logit contributions
orf-0.059
 COURT-0.056
icho-0.054
emouth-0.054
 Garg-0.053
Token Garg
Token ID39013
Feature activation+3.88e-01

Pos logit contributions
PB+0.036
fters+0.035
 cancellation+0.035
Lead+0.034
Writer+0.033
Neg logit contributions
orf-0.036
 COURT-0.035
icho-0.034
emouth-0.033
 Garg-0.033
Tokeni
Token ID72
Feature activation-1.44e-01

Pos logit contributions
orf+0.245
 COURT+0.238
icho+0.227
emouth+0.224
 Garg+0.223
Neg logit contributions
PB-0.260
fters-0.255
 cancellation-0.251
Lead-0.248
Writer-0.240
Tokenulo
Token ID43348
Feature activation-2.90e-02

Pos logit contributions
PB+0.090
fters+0.089
 cancellation+0.087
Lead+0.085
Writer+0.082
Neg logit contributions
orf-0.099
 COURT-0.095
icho-0.091
 Garg-0.091
emouth-0.090
Token.
Token ID13
Feature activation-2.68e-02

Pos logit contributions
PB+0.019
fters+0.018
 cancellation+0.018
Lead+0.018
Writer+0.017
Neg logit contributions
orf-0.019
 COURT-0.018
icho-0.018
emouth-0.017
 Garg-0.017
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
PB+0.018
fters+0.018
 cancellation+0.018
Lead+0.017
Writer+0.017
Neg logit contributions
orf-0.017
 COURT-0.016
icho-0.015
emouth-0.015
alle-0.015
Token
Token ID198
Feature activation-3.28e-03

Pos logit contributions
PB+0.008
fters+0.007
Lead+0.007
 cancellation+0.007
Writer+0.007
Neg logit contributions
orf-0.007
 COURT-0.007
icho-0.006
emouth-0.006
 Garg-0.006
TokenThe
Token ID464
Feature activation-1.42e-02

Pos logit contributions
orf+0.006
 COURT+0.006
icho+0.006
 Garg+0.005
emouth+0.005
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.004
Writer-0.004
Token dilemma
Token ID26566
Feature activation+8.34e-02

Pos logit contributions
PB+0.010
fters+0.010
 cancellation+0.009
Lead+0.009
Writer+0.009
Neg logit contributions
orf-0.009
 COURT-0.008
icho-0.008
emouth-0.008
 Garg-0.008
Token is
Token ID318
Feature activation+1.72e-02

Pos logit contributions
orf+0.052
 COURT+0.050
icho+0.048
emouth+0.047
 Garg+0.047
Neg logit contributions
PB-0.057
fters-0.056
 cancellation-0.055
Lead-0.054
Writer-0.052
Token that
Token ID326
Feature activation-1.12e-02

Pos logit contributions
orf+0.011
 COURT+0.011
icho+0.011
emouth+0.010
 Garg+0.010
Neg logit contributions
PB-0.011
fters-0.011
 cancellation-0.011
Lead-0.011
Writer-0.010
Token the
Token ID262
Feature activation+2.07e-03

Pos logit contributions
PB+0.007
fters+0.007
 cancellation+0.007
Lead+0.007
Writer+0.007
Neg logit contributions
orf-0.007
 COURT-0.007
icho-0.007
emouth-0.007
 Garg-0.007
Token Garg
Token ID39013
Feature activation+3.88e-01

Pos logit contributions
orf+0.001
 COURT+0.001
icho+0.001
emouth+0.001
 Garg+0.001
Neg logit contributions
PB-0.001
fters-0.001
 cancellation-0.001
Lead-0.001
Writer-0.001
Tokenant
Token ID415
Feature activation-1.73e-01

Pos logit contributions
orf+0.283
 COURT+0.277
icho+0.266
emouth+0.262
 Garg+0.261
Neg logit contributions
PB-0.221
fters-0.217
 cancellation-0.212
Lead-0.210
Writer-0.200
Tokenua
Token ID6413
Feature activation-1.66e-01

Pos logit contributions
PB+0.082
fters+0.080
 cancellation+0.078
Lead+0.076
Writer+0.073
Neg logit contributions
orf-0.143
 COURT-0.140
icho-0.135
 Garg-0.134
emouth-0.133
Token
Token ID447
Feature activation-5.79e-02

Pos logit contributions
PB+0.115
fters+0.113
 cancellation+0.111
Lead+0.110
Writer+0.106
Neg logit contributions
orf-0.103
 COURT-0.098
icho-0.094
emouth-0.093
 Garg-0.092
Token
Token ID247
Feature activation-6.03e-02

Pos logit contributions
PB+0.032
fters+0.031
 cancellation+0.031
Lead+0.030
Writer+0.029
Neg logit contributions
orf-0.044
 COURT-0.042
icho-0.041
emouth-0.040
 Garg-0.040
Tokens
Token ID82
Feature activation-1.82e-02

Pos logit contributions
PB+0.045
fters+0.044
 cancellation+0.043
Lead+0.043
Writer+0.041
Neg logit contributions
orf-0.034
 COURT-0.033
icho-0.031
 Garg-0.031
emouth-0.031
Token
Token ID198
Feature activation-1.48e-03

Pos logit contributions
PB+0.006
fters+0.006
Lead+0.006
 cancellation+0.006
Writer+0.006
Neg logit contributions
orf-0.006
 COURT-0.005
icho-0.005
emouth-0.005
 Garg-0.005
TokenY
Token ID56
Feature activation-6.38e-02

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.001
 COURT-0.001
icho-0.001
 Garg-0.001
emouth-0.001
Tokenoy
Token ID726
Feature activation-8.02e-03

Pos logit contributions
PB+0.044
fters+0.043
 cancellation+0.042
Lead+0.042
Writer+0.040
Neg logit contributions
orf-0.040
 COURT-0.038
icho-0.037
 Garg-0.036
emouth-0.036
Tokenore
Token ID382
Feature activation-1.71e-01

Pos logit contributions
PB+0.007
fters+0.006
 cancellation+0.006
Lead+0.006
Writer+0.006
Neg logit contributions
orf-0.004
 COURT-0.004
icho-0.004
 Garg-0.003
emouth-0.003
Tokencreation
Token ID38793
Feature activation+5.03e-02

Pos logit contributions
PB+0.113
fters+0.111
 cancellation+0.109
Lead+0.107
Writer+0.104
Neg logit contributions
orf-0.110
 COURT-0.106
icho-0.102
emouth-0.100
 Garg-0.100
Token Garg
Token ID39013
Feature activation+3.86e-01

Pos logit contributions
orf+0.032
 COURT+0.031
icho+0.030
emouth+0.030
 Garg+0.029
Neg logit contributions
PB-0.033
fters-0.033
 cancellation-0.032
Lead-0.032
Writer-0.031
Tokenant
Token ID415
Feature activation-1.82e-01

Pos logit contributions
orf+0.220
 COURT+0.210
icho+0.201
 Garg+0.198
emouth+0.197
Neg logit contributions
PB-0.284
fters-0.280
 cancellation-0.275
Lead-0.272
Writer-0.264
Tokenua
Token ID6413
Feature activation-1.84e-01

Pos logit contributions
PB+0.103
fters+0.101
 cancellation+0.099
Lead+0.097
Writer+0.093
Neg logit contributions
orf-0.135
 COURT-0.129
icho-0.125
 Garg-0.124
emouth-0.124
Token
Token ID198
Feature activation+1.11e-04

Pos logit contributions
PB+0.121
fters+0.119
 cancellation+0.117
Lead+0.115
Writer+0.111
Neg logit contributions
orf-0.120
 COURT-0.115
icho-0.111
emouth-0.109
 Garg-0.109
Token
Token ID198
Feature activation+4.85e-03

Pos logit contributions
orf+0.000
 COURT+0.000
icho+0.000
emouth+0.000
 Garg+0.000
Neg logit contributions
PB-0.000
fters-0.000
Lead-0.000
 cancellation-0.000
Writer-0.000
TokenD
Token ID35
Feature activation-3.48e-02

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.003
 Garg+0.003
emouth+0.003
Neg logit contributions
PB-0.003
fters-0.003
 cancellation-0.002
Lead-0.002
Writer-0.002
Token;
Token ID26
Feature activation-3.20e-02

Pos logit contributions
PB+0.035
fters+0.035
 cancellation+0.034
Lead+0.034
Writer+0.032
Neg logit contributions
orf-0.035
 COURT-0.034
icho-0.032
emouth-0.032
 Garg-0.032
Token Mr
Token ID1770
Feature activation-3.24e-02

Pos logit contributions
PB+0.021
fters+0.021
 cancellation+0.021
Lead+0.020
Writer+0.020
Neg logit contributions
orf-0.020
 COURT-0.020
icho-0.019
emouth-0.019
 Garg-0.018
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
PB+0.019
fters+0.018
Lead+0.018
 cancellation+0.018
Writer+0.017
Neg logit contributions
orf-0.023
 COURT-0.023
icho-0.022
emouth-0.022
 Garg-0.021
Token Ram
Token ID7431
Feature activation-2.26e-02

Pos logit contributions
PB+0.016
fters+0.016
 cancellation+0.016
Lead+0.015
Writer+0.015
Neg logit contributions
orf-0.015
 COURT-0.014
icho-0.013
emouth-0.013
 Garg-0.013
Tokení
Token ID8836
Feature activation+4.38e-02

Pos logit contributions
PB+0.015
fters+0.015
 cancellation+0.014
Lead+0.014
Writer+0.014
Neg logit contributions
orf-0.014
 COURT-0.014
icho-0.013
 Garg-0.013
emouth-0.013
Tokenrez
Token ID21107
Feature activation+3.85e-01

Pos logit contributions
orf+0.012
 COURT+0.011
icho+0.010
 Garg+0.010
emouth+0.010
Neg logit contributions
PB-0.045
fters-0.044
 cancellation-0.044
Lead-0.043
Writer-0.043
Token also
Token ID635
Feature activation-4.42e-02

Pos logit contributions
orf+0.235
 COURT+0.226
icho+0.216
emouth+0.213
 Garg+0.211
Neg logit contributions
PB-0.268
fters-0.265
 cancellation-0.261
Lead-0.256
Writer-0.248
Token lost
Token ID2626
Feature activation-4.28e-02

Pos logit contributions
PB+0.031
fters+0.031
 cancellation+0.030
Lead+0.030
Writer+0.029
Neg logit contributions
orf-0.026
 COURT-0.025
icho-0.024
emouth-0.024
 Garg-0.024
Token his
Token ID465
Feature activation+1.31e-02

Pos logit contributions
PB+0.029
 cancellation+0.029
fters+0.028
Lead+0.028
Writer+0.027
Neg logit contributions
orf-0.027
 COURT-0.026
icho-0.025
emouth-0.024
 Garg-0.024
Token job
Token ID1693
Feature activation+1.57e-02

Pos logit contributions
orf+0.008
 COURT+0.008
icho+0.008
emouth+0.008
 Garg+0.008
Neg logit contributions
PB-0.009
fters-0.008
 cancellation-0.008
Lead-0.008
Writer-0.008
Token as
Token ID355
Feature activation-2.43e-02

Pos logit contributions
orf+0.010
 COURT+0.010
icho+0.010
emouth+0.009
 Garg+0.009
Neg logit contributions
PB-0.010
fters-0.010
 cancellation-0.010
Lead-0.010
Writer-0.009
Token affairs
Token ID9674
Feature activation-2.66e-01

Pos logit contributions
orf+0.045
 COURT+0.044
icho+0.043
 Garg+0.043
emouth+0.042
Neg logit contributions
PB-0.022
fters-0.021
 cancellation-0.020
Lead-0.020
Writer-0.019
Token,
Token ID11
Feature activation-1.05e-02

Pos logit contributions
PB+0.174
fters+0.171
 cancellation+0.169
Lead+0.165
Writer+0.160
Neg logit contributions
orf-0.174
 COURT-0.167
icho-0.160
emouth-0.160
 Garg-0.157
Token Rafael
Token ID31918
Feature activation+6.15e-02

Pos logit contributions
PB+0.007
fters+0.007
 cancellation+0.007
Lead+0.007
Writer+0.007
Neg logit contributions
orf-0.007
 COURT-0.006
icho-0.006
emouth-0.006
 Garg-0.006
Token Ram
Token ID7431
Feature activation-3.59e-03

Pos logit contributions
orf+0.040
 COURT+0.038
icho+0.037
emouth+0.037
 Garg+0.036
Neg logit contributions
PB-0.040
fters-0.040
 cancellation-0.039
Lead-0.038
Writer-0.037
Tokení
Token ID8836
Feature activation+3.54e-02

Pos logit contributions
PB+0.003
fters+0.003
 cancellation+0.003
Lead+0.003
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
emouth-0.002
 Garg-0.002
Tokenrez
Token ID21107
Feature activation+3.84e-01

Pos logit contributions
orf+0.009
 COURT+0.008
icho+0.007
 Garg+0.007
emouth+0.007
Neg logit contributions
PB-0.037
fters-0.037
 cancellation-0.036
Lead-0.036
Writer-0.035
Token,
Token ID11
Feature activation-1.26e-03

Pos logit contributions
orf+0.258
 COURT+0.248
icho+0.238
emouth+0.235
 Garg+0.235
Neg logit contributions
PB-0.243
fters-0.240
 cancellation-0.235
Lead-0.232
Writer-0.224
Token with
Token ID351
Feature activation-1.75e-02

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.001
 COURT-0.001
icho-0.001
emouth-0.001
 Garg-0.001
Token an
Token ID281
Feature activation+3.56e-02

Pos logit contributions
PB+0.012
fters+0.012
 cancellation+0.012
Lead+0.011
Writer+0.011
Neg logit contributions
orf-0.011
 COURT-0.011
icho-0.010
emouth-0.010
 Garg-0.010
Token army
Token ID5428
Feature activation+1.83e-01

Pos logit contributions
orf+0.024
 COURT+0.023
icho+0.022
emouth+0.022
 Garg+0.021
Neg logit contributions
PB-0.023
fters-0.022
 cancellation-0.022
Lead-0.022
Writer-0.021
Token general
Token ID2276
Feature activation-5.38e-02

Pos logit contributions
orf+0.135
 COURT+0.130
icho+0.126
emouth+0.124
 Garg+0.124
Neg logit contributions
PB-0.104
fters-0.102
 cancellation-0.100
Lead-0.098
Writer-0.094
Token,
Token ID11
Feature activation-1.83e-02

Pos logit contributions
orf+0.155
 COURT+0.149
icho+0.144
emouth+0.141
 Garg+0.141
Neg logit contributions
PB-0.152
fters-0.150
 cancellation-0.148
Lead-0.144
Writer-0.140
Token like
Token ID588
Feature activation+4.85e-03

Pos logit contributions
PB+0.012
fters+0.012
 cancellation+0.012
Lead+0.012
Writer+0.011
Neg logit contributions
orf-0.012
 COURT-0.011
icho-0.011
emouth-0.011
 Garg-0.010
Token Dragon
Token ID2851
Feature activation-8.75e-02

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
emouth+0.003
 Garg+0.003
Neg logit contributions
PB-0.003
fters-0.003
 cancellation-0.003
Lead-0.003
Writer-0.003
Tokenic
Token ID291
Feature activation-3.56e-02

Pos logit contributions
PB+0.056
fters+0.055
 cancellation+0.054
Lead+0.054
Writer+0.052
Neg logit contributions
orf-0.058
 COURT-0.056
icho-0.054
emouth-0.053
 Garg-0.052
Token Di
Token ID6031
Feature activation-1.13e-01

Pos logit contributions
PB+0.022
fters+0.022
 cancellation+0.022
Lead+0.021
Writer+0.021
Neg logit contributions
orf-0.024
 COURT-0.023
icho-0.022
emouth-0.022
 Garg-0.022
Tokenagram
Token ID6713
Feature activation+3.83e-01

Pos logit contributions
PB+0.080
fters+0.079
 cancellation+0.078
Lead+0.077
Writer+0.074
Neg logit contributions
orf-0.067
 COURT-0.064
icho-0.061
emouth-0.060
 Garg-0.060
Token or
Token ID393
Feature activation-1.38e-02

Pos logit contributions
orf+0.250
 COURT+0.240
icho+0.231
emouth+0.228
 Garg+0.227
Neg logit contributions
PB-0.250
fters-0.247
 cancellation-0.242
Lead-0.238
Writer-0.231
Token Dec
Token ID4280
Feature activation-8.61e-02

Pos logit contributions
PB+0.009
fters+0.009
 cancellation+0.008
Lead+0.008
Writer+0.008
Neg logit contributions
orf-0.009
 COURT-0.009
icho-0.009
emouth-0.009
 Garg-0.008
Tokenode
Token ID1098
Feature activation-3.55e-02

Pos logit contributions
PB+0.051
fters+0.051
 cancellation+0.050
Lead+0.049
Writer+0.047
Neg logit contributions
orf-0.060
 COURT-0.059
icho-0.056
emouth-0.056
 Garg-0.055
Token Talk
Token ID12167
Feature activation-6.71e-02

Pos logit contributions
PB+0.022
fters+0.022
 cancellation+0.022
Lead+0.021
Writer+0.021
Neg logit contributions
orf-0.024
 COURT-0.023
icho-0.022
emouth-0.022
 Garg-0.022
Tokener
Token ID263
Feature activation-6.74e-02

Pos logit contributions
PB+0.043
fters+0.042
 cancellation+0.041
Lead+0.041
Writer+0.039
Neg logit contributions
orf-0.045
 COURT-0.043
icho-0.042
 Garg-0.041
emouth-0.041
Token some
Token ID617
Feature activation+8.44e-02

Pos logit contributions
orf+0.067
 COURT+0.064
icho+0.062
emouth+0.061
 Garg+0.061
Neg logit contributions
PB-0.071
fters-0.070
 cancellation-0.069
Lead-0.067
Writer-0.065
Token tweets
Token ID12665
Feature activation+4.37e-03

Pos logit contributions
orf+0.056
 COURT+0.054
icho+0.052
emouth+0.051
 Garg+0.051
Neg logit contributions
PB-0.054
fters-0.053
 cancellation-0.053
Lead-0.052
Writer-0.050
Token sent
Token ID1908
Feature activation+4.02e-02

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
emouth+0.003
 Garg+0.003
Neg logit contributions
PB-0.003
fters-0.003
 cancellation-0.003
Lead-0.003
Writer-0.003
Token by
Token ID416
Feature activation+2.49e-02

Pos logit contributions
orf+0.026
 COURT+0.025
icho+0.024
emouth+0.024
 Garg+0.024
Neg logit contributions
PB-0.026
fters-0.026
 cancellation-0.026
Lead-0.025
Writer-0.024
Token pee
Token ID32638
Feature activation+5.25e-02

Pos logit contributions
orf+0.016
 COURT+0.015
icho+0.015
emouth+0.015
 Garg+0.014
Neg logit contributions
PB-0.017
fters-0.016
 cancellation-0.016
Lead-0.016
Writer-0.015
Tokenved
Token ID1079
Feature activation-5.16e-01

Pos logit contributions
orf+0.035
 COURT+0.034
icho+0.033
 Garg+0.032
emouth+0.032
Neg logit contributions
PB-0.033
fters-0.033
 cancellation-0.032
Lead-0.032
Writer-0.031
Token p
Token ID279
Feature activation+5.21e-03

Pos logit contributions
PB+0.348
fters+0.344
 cancellation+0.339
Lead+0.333
Writer+0.322
Neg logit contributions
orf-0.326
 COURT-0.313
icho-0.300
emouth-0.295
 Garg-0.294
Tokenoth
Token ID849
Feature activation+9.18e-02

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
 Garg+0.003
emouth+0.003
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.004
Writer-0.004
Tokenoles
Token ID4316
Feature activation+2.36e-01

Pos logit contributions
orf+0.033
 COURT+0.032
icho+0.029
emouth+0.028
 Garg+0.028
Neg logit contributions
fters-0.086
PB-0.086
 cancellation-0.084
Lead-0.083
Writer-0.081
Token in
Token ID287
Feature activation+1.64e-02

Pos logit contributions
orf+0.155
 COURT+0.149
icho+0.143
emouth+0.141
 Garg+0.141
Neg logit contributions
PB-0.153
fters-0.151
 cancellation-0.148
Lead-0.146
Writer-0.141
Token Panama
Token ID23519
Feature activation-3.93e-02

Pos logit contributions
orf+0.011
 COURT+0.010
icho+0.010
emouth+0.010
 Garg+0.010
Neg logit contributions
PB-0.011
fters-0.011
 cancellation-0.010
Lead-0.010
Writer-0.010
TokenTest
Token ID14402
Feature activation-1.54e-03

Pos logit contributions
orf+0.002
 COURT+0.002
icho+0.002
 Garg+0.002
emouth+0.002
Neg logit contributions
PB-0.001
fters-0.001
 cancellation-0.001
Lead-0.001
Writer-0.001
Token Kitchen
Token ID22541
Feature activation-1.15e-01

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.001
 COURT-0.001
icho-0.001
emouth-0.001
 Garg-0.001
Token-
Token ID12
Feature activation-2.25e-02

Pos logit contributions
PB+0.074
fters+0.072
 cancellation+0.071
Lead+0.070
Writer+0.068
Neg logit contributions
orf-0.077
 COURT-0.074
icho-0.071
emouth-0.070
 Garg-0.070
TokenApp
Token ID4677
Feature activation+7.12e-02

Pos logit contributions
PB+0.014
fters+0.014
 cancellation+0.013
Lead+0.013
Writer+0.013
Neg logit contributions
orf-0.015
 COURT-0.015
icho-0.014
emouth-0.014
 Garg-0.014
Tokenro
Token ID305
Feature activation-1.82e-01

Pos logit contributions
orf+0.049
 COURT+0.047
icho+0.045
emouth+0.044
 Garg+0.044
Neg logit contributions
PB-0.044
fters-0.043
 cancellation-0.043
Lead-0.042
Writer-0.041
Tokenved
Token ID1079
Feature activation-4.91e-01

Pos logit contributions
PB+0.046
fters+0.045
 cancellation+0.042
Lead+0.041
Writer+0.037
Neg logit contributions
orf-0.191
 COURT-0.187
icho-0.182
 Garg-0.181
emouth-0.180
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
PB+0.319
fters+0.310
 cancellation+0.305
Lead+0.302
Writer+0.290
Neg logit contributions
orf-0.323
 COURT-0.311
icho-0.301
emouth-0.296
 Garg-0.293
Token
Token ID198
Feature activation-7.28e-03

Pos logit contributions
PB+0.010
fters+0.010
Lead+0.010
 cancellation+0.010
Writer+0.009
Neg logit contributions
orf-0.009
 COURT-0.009
icho-0.009
emouth-0.009
 Garg-0.008
TokenAuthor
Token ID13838
Feature activation-7.44e-02

Pos logit contributions
PB+0.004
fters+0.004
Lead+0.004
 cancellation+0.004
Writer+0.004
Neg logit contributions
orf-0.005
 COURT-0.005
icho-0.005
 Garg-0.005
emouth-0.005
Token Notes
Token ID11822
Feature activation-2.11e-01

Pos logit contributions
PB+0.050
fters+0.049
 cancellation+0.049
Lead+0.048
Writer+0.047
Neg logit contributions
orf-0.047
 COURT-0.045
icho-0.043
emouth-0.043
 Garg-0.042
Token:
Token ID25
Feature activation+1.35e-02

Pos logit contributions
PB+0.144
fters+0.142
 cancellation+0.139
Lead+0.138
Writer+0.133
Neg logit contributions
orf-0.132
 COURT-0.126
icho-0.121
emouth-0.120
 Garg-0.119
Token extr
Token ID22820
Feature activation-1.93e-01

Pos logit contributions
orf+0.120
 COURT+0.116
icho+0.111
 Garg+0.110
emouth+0.110
Neg logit contributions
PB-0.104
fters-0.102
 cancellation-0.100
Lead-0.098
Writer-0.095
Tokenusion
Token ID4241
Feature activation-6.26e-02

Pos logit contributions
PB+0.116
fters+0.114
 cancellation+0.112
Lead+0.110
Writer+0.106
Neg logit contributions
orf-0.137
 COURT-0.132
icho-0.127
 Garg-0.126
emouth-0.125
Token v
Token ID410
Feature activation+7.63e-03

Pos logit contributions
PB+0.039
fters+0.039
 cancellation+0.038
Lead+0.037
Writer+0.036
Neg logit contributions
orf-0.042
 COURT-0.041
icho-0.039
 Garg-0.039
emouth-0.039
Token.
Token ID13
Feature activation+8.98e-03

Pos logit contributions
orf+0.005
 COURT+0.005
icho+0.005
 Garg+0.005
alle+0.005
Neg logit contributions
PB-0.005
fters-0.005
 cancellation-0.004
Lead-0.004
Writer-0.004
Tokenver
Token ID332
Feature activation-1.49e-01

Pos logit contributions
orf+0.006
 COURT+0.006
icho+0.006
 Garg+0.006
emouth+0.006
Neg logit contributions
PB-0.005
fters-0.005
 cancellation-0.005
Lead-0.005
Writer-0.005
Tokentex
Token ID16886
Feature activation-4.35e-01

Pos logit contributions
PB+0.010
fters+0.010
 cancellation+0.007
Lead+0.006
Writer+0.003
Neg logit contributions
orf-0.184
 COURT-0.181
icho-0.177
 Garg-0.176
emouth-0.176
Token.
Token ID13
Feature activation+5.42e-03

Pos logit contributions
PB+0.265
fters+0.263
 cancellation+0.254
Lead+0.248
Writer+0.239
Neg logit contributions
orf-0.303
 COURT-0.292
icho-0.278
 Garg-0.277
emouth-0.274
Tokenxy
Token ID5431
Feature activation-2.30e-03

Pos logit contributions
orf+0.004
 COURT+0.003
icho+0.003
 Garg+0.003
emouth+0.003
Neg logit contributions
PB-0.003
fters-0.003
 cancellation-0.003
Lead-0.003
Writer-0.003
Tokenz
Token ID89
Feature activation-5.38e-02

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.002
 COURT-0.001
icho-0.001
 Garg-0.001
emouth-0.001
Token +=
Token ID15853
Feature activation-2.44e-01

Pos logit contributions
PB+0.021
fters+0.020
 cancellation+0.019
Lead+0.019
Writer+0.018
Neg logit contributions
orf-0.050
 COURT-0.048
icho-0.047
 Garg-0.046
emouth-0.046
Token v
Token ID410
Feature activation+7.72e-04

Pos logit contributions
PB+0.160
fters+0.158
 cancellation+0.154
Lead+0.153
Writer+0.147
Neg logit contributions
orf-0.159
 COURT-0.153
icho-0.147
 Garg-0.145
emouth-0.144
TokenV
Token ID53
Feature activation+7.49e-02

Pos logit contributions
orf+0.028
 COURT+0.027
icho+0.026
emouth+0.026
 Garg+0.026
Neg logit contributions
PB-0.023
fters-0.022
 cancellation-0.022
Lead-0.022
Writer-0.021
TokenSA
Token ID4090
Feature activation-1.22e-01

Pos logit contributions
orf+0.073
 COURT+0.071
icho+0.069
 Garg+0.069
emouth+0.069
Neg logit contributions
PB-0.026
fters-0.024
Lead-0.023
 cancellation-0.023
Writer-0.021
Token has
Token ID468
Feature activation+1.16e-04

Pos logit contributions
PB+0.083
fters+0.082
 cancellation+0.081
Lead+0.080
Writer+0.077
Neg logit contributions
orf-0.076
 COURT-0.073
icho-0.070
emouth-0.069
 Garg-0.068
Token not
Token ID407
Feature activation-2.89e-03

Pos logit contributions
orf+0.000
 COURT+0.000
icho+0.000
emouth+0.000
 Garg+0.000
Neg logit contributions
PB-0.000
fters-0.000
 cancellation-0.000
Lead-0.000
Writer-0.000
Token thri
Token ID19531
Feature activation-6.01e-02

Pos logit contributions
PB+0.002
fters+0.002
 cancellation+0.002
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
emouth-0.002
 Garg-0.002
Tokenved
Token ID1079
Feature activation-4.24e-01

Pos logit contributions
PB+0.013
fters+0.012
 cancellation+0.011
Lead+0.011
Writer+0.010
Neg logit contributions
orf-0.066
 COURT-0.064
icho-0.063
 Garg-0.062
emouth-0.062
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
PB+0.271
fters+0.267
 cancellation+0.262
Lead+0.258
Writer+0.249
Neg logit contributions
orf-0.283
 COURT-0.272
icho-0.262
 Garg-0.258
emouth-0.258
Token Oil
Token ID11474
Feature activation+5.58e-02

Pos logit contributions
orf+0.009
 COURT+0.008
icho+0.008
emouth+0.008
 Garg+0.008
Neg logit contributions
PB-0.008
fters-0.008
 cancellation-0.008
Lead-0.008
Writer-0.008
Token exports
Token ID15319
Feature activation+8.97e-02

Pos logit contributions
orf+0.037
 COURT+0.035
icho+0.034
 Garg+0.034
emouth+0.033
Neg logit contributions
PB-0.036
fters-0.036
 cancellation-0.035
Lead-0.034
Writer-0.033
Token have
Token ID423
Feature activation+2.51e-02

Pos logit contributions
orf+0.053
 COURT+0.051
icho+0.049
 Garg+0.048
emouth+0.048
Neg logit contributions
PB-0.063
fters-0.063
 cancellation-0.062
Lead-0.061
Writer-0.059
Token fallen
Token ID9292
Feature activation-5.46e-02

Pos logit contributions
orf+0.016
 COURT+0.015
icho+0.015
emouth+0.014
 Garg+0.014
Neg logit contributions
PB-0.017
fters-0.017
 cancellation-0.017
Lead-0.016
Writer-0.016
Token an
Token ID281
Feature activation+3.49e-02

Pos logit contributions
orf+0.033
 COURT+0.032
icho+0.030
emouth+0.030
 Garg+0.030
Neg logit contributions
PB-0.036
fters-0.036
 cancellation-0.035
Lead-0.035
Writer-0.034
Token Arc
Token ID10173
Feature activation-1.25e-01

Pos logit contributions
orf+0.020
 COURT+0.019
icho+0.018
emouth+0.018
 Garg+0.018
Neg logit contributions
PB-0.026
fters-0.026
 cancellation-0.025
Lead-0.025
Writer-0.024
Token,
Token ID837
Feature activation-3.00e-02

Pos logit contributions
PB+0.076
fters+0.074
 cancellation+0.073
Lead+0.072
Writer+0.069
Neg logit contributions
orf-0.088
 COURT-0.084
icho-0.081
 Garg-0.080
emouth-0.080
Token an
Token ID281
Feature activation+3.78e-02

Pos logit contributions
PB+0.020
fters+0.019
 cancellation+0.019
Lead+0.019
Writer+0.018
Neg logit contributions
orf-0.020
 COURT-0.019
icho-0.018
emouth-0.018
 Garg-0.018
Token As
Token ID1081
Feature activation-1.66e-02

Pos logit contributions
orf+0.023
 COURT+0.022
icho+0.021
emouth+0.021
 Garg+0.021
Neg logit contributions
PB-0.026
fters-0.026
 cancellation-0.025
Lead-0.025
Writer-0.024
Tokenynchronous
Token ID31301
Feature activation-4.20e-01

Pos logit contributions
PB+0.010
fters+0.010
 cancellation+0.010
Lead+0.010
Writer+0.009
Neg logit contributions
orf-0.012
 COURT-0.011
icho-0.011
 Garg-0.011
emouth-0.011
Token Reference
Token ID20984
Feature activation+3.51e-02

Pos logit contributions
PB+0.251
fters+0.246
 cancellation+0.243
Lead+0.238
Writer+0.231
Neg logit contributions
orf-0.298
 COURT-0.288
icho-0.277
emouth-0.274
 Garg-0.273
Token Counter
Token ID15034
Feature activation-1.49e-01

Pos logit contributions
orf+0.024
 COURT+0.023
icho+0.022
 Garg+0.022
emouth+0.022
Neg logit contributions
PB-0.022
fters-0.022
 cancellation-0.021
Lead-0.021
Writer-0.020
Token.
Token ID764
Feature activation-1.84e-01

Pos logit contributions
PB+0.096
fters+0.094
 cancellation+0.092
Lead+0.091
Writer+0.088
Neg logit contributions
orf-0.099
 COURT-0.095
icho-0.092
 Garg-0.090
emouth-0.090
Token We
Token ID775
Feature activation+2.34e-02

Pos logit contributions
PB+0.126
fters+0.124
 cancellation+0.122
Lead+0.121
Writer+0.117
Neg logit contributions
orf-0.115
 COURT-0.109
icho-0.105
emouth-0.104
 Garg-0.103
Token use
Token ID779
Feature activation+2.82e-03

Pos logit contributions
orf+0.014
 COURT+0.014
icho+0.013
emouth+0.013
 Garg+0.013
Neg logit contributions
PB-0.016
fters-0.016
 cancellation-0.016
Lead-0.015
Writer-0.015
Tokenes
Token ID274
Feature activation-4.77e-03

Pos logit contributions
PB+0.190
fters+0.187
 cancellation+0.185
Lead+0.183
Writer+0.177
Neg logit contributions
orf-0.121
 COURT-0.117
icho-0.109
emouth-0.109
 Garg-0.109
Token Ascend
Token ID45387
Feature activation+1.52e-01

Pos logit contributions
PB+0.003
fters+0.003
 cancellation+0.003
Lead+0.003
Writer+0.003
Neg logit contributions
orf-0.003
 COURT-0.003
icho-0.003
emouth-0.003
 Garg-0.003
Token player
Token ID2137
Feature activation-1.10e-01

Pos logit contributions
orf+0.099
 COURT+0.095
icho+0.092
emouth+0.090
 Garg+0.090
Neg logit contributions
PB-0.099
fters-0.097
 cancellation-0.096
Lead-0.094
Writer-0.091
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
PB+0.072
fters+0.071
 cancellation+0.070
Lead+0.069
Writer+0.067
Neg logit contributions
orf-0.071
 COURT-0.068
icho-0.065
emouth-0.065
 Garg-0.064
Token Sym
Token ID15845
Feature activation+7.54e-02

Pos logit contributions
PB+0.017
fters+0.017
 cancellation+0.017
Lead+0.016
Writer+0.016
Neg logit contributions
orf-0.016
 COURT-0.015
icho-0.015
emouth-0.015
 Garg-0.014
Tokentex
Token ID16886
Feature activation-4.17e-01

Pos logit contributions
orf+0.048
 COURT+0.047
icho+0.044
emouth+0.044
 Garg+0.044
Neg logit contributions
PB-0.050
fters-0.050
 cancellation-0.049
Lead-0.048
Writer-0.046
Token in
Token ID287
Feature activation+1.37e-02

Pos logit contributions
PB+0.276
fters+0.272
 cancellation+0.267
Lead+0.263
Writer+0.256
Neg logit contributions
orf-0.267
 COURT-0.259
icho-0.248
emouth-0.245
 Garg-0.243
Token sc
Token ID629
Feature activation-3.14e-02

Pos logit contributions
orf+0.009
 COURT+0.009
icho+0.008
emouth+0.008
 Garg+0.008
Neg logit contributions
PB-0.009
fters-0.009
 cancellation-0.009
Lead-0.008
Writer-0.008
Token2
Token ID17
Feature activation-2.17e-02

Pos logit contributions
PB+0.018
fters+0.017
 cancellation+0.017
Lead+0.017
Writer+0.016
Neg logit contributions
orf-0.023
 COURT-0.023
icho-0.022
emouth-0.022
 Garg-0.021
Token,
Token ID11
Feature activation+1.84e-03

Pos logit contributions
PB+0.014
fters+0.014
 cancellation+0.014
Lead+0.014
Writer+0.013
Neg logit contributions
orf-0.014
 COURT-0.014
icho-0.013
emouth-0.013
 Garg-0.013
Token cs
Token ID50115
Feature activation-2.51e-02

Pos logit contributions
orf+0.001
 COURT+0.001
icho+0.001
emouth+0.001
 Garg+0.001
Neg logit contributions
PB-0.001
fters-0.001
 cancellation-0.001
Lead-0.001
Writer-0.001
Token.
Token ID13
Feature activation-1.06e-02

Pos logit contributions
PB+0.034
fters+0.034
 cancellation+0.033
Lead+0.033
Writer+0.032
Neg logit contributions
orf-0.034
 COURT-0.032
icho-0.031
emouth-0.031
 Garg-0.031
Token The
Token ID383
Feature activation+1.15e-02

Pos logit contributions
PB+0.007
fters+0.007
 cancellation+0.007
Lead+0.006
Writer+0.006
Neg logit contributions
orf-0.007
 COURT-0.007
icho-0.007
emouth-0.006
 Garg-0.006
Token company
Token ID1664
Feature activation+1.13e-01

Pos logit contributions
orf+0.008
 COURT+0.007
icho+0.007
emouth+0.007
 Garg+0.007
Neg logit contributions
PB-0.007
fters-0.007
 cancellation-0.007
Lead-0.007
Writer-0.007
Token has
Token ID468
Feature activation-4.34e-02

Pos logit contributions
orf+0.069
 COURT+0.067
icho+0.064
emouth+0.063
 Garg+0.063
Neg logit contributions
PB-0.078
fters-0.077
 cancellation-0.076
Lead-0.075
Writer-0.072
Token stri
Token ID2338
Feature activation-6.31e-02

Pos logit contributions
PB+0.029
fters+0.028
 cancellation+0.028
Lead+0.027
Writer+0.026
Neg logit contributions
orf-0.028
 COURT-0.027
icho-0.026
emouth-0.026
 Garg-0.025
Tokenved
Token ID1079
Feature activation-4.12e-01

Pos logit contributions
PB+0.018
fters+0.017
 cancellation+0.016
Lead+0.016
Writer+0.015
Neg logit contributions
orf-0.064
 COURT-0.063
icho-0.061
 Garg-0.061
emouth-0.061
Token to
Token ID284
Feature activation-8.94e-03

Pos logit contributions
PB+0.273
fters+0.269
 cancellation+0.264
Lead+0.260
Writer+0.252
Neg logit contributions
orf-0.265
 COURT-0.254
icho-0.245
 Garg-0.241
emouth-0.241
Token make
Token ID787
Feature activation-3.52e-02

Pos logit contributions
PB+0.006
fters+0.006
 cancellation+0.005
Lead+0.005
Writer+0.005
Neg logit contributions
orf-0.006
 COURT-0.006
icho-0.006
 Garg-0.006
emouth-0.006
Token the
Token ID262
Feature activation+1.39e-02

Pos logit contributions
PB+0.023
fters+0.022
 cancellation+0.022
Lead+0.022
Writer+0.021
Neg logit contributions
orf-0.023
 COURT-0.022
icho-0.022
emouth-0.021
 Garg-0.021
Token phone
Token ID3072
Feature activation-8.19e-02

Pos logit contributions
orf+0.009
 COURT+0.009
icho+0.009
emouth+0.009
 Garg+0.009
Neg logit contributions
PB-0.009
fters-0.009
 cancellation-0.009
Lead-0.008
Writer-0.008
Token easier
Token ID4577
Feature activation+1.31e-01

Pos logit contributions
PB+0.054
fters+0.053
 cancellation+0.052
Lead+0.051
Writer+0.050
Neg logit contributions
orf-0.053
 COURT-0.051
icho-0.049
emouth-0.049
 Garg-0.048
Token for
Token ID329
Feature activation+6.26e-03

Pos logit contributions
PB+0.070
fters+0.069
 cancellation+0.068
Lead+0.067
Writer+0.065
Neg logit contributions
orf-0.066
 COURT-0.063
icho-0.060
 Garg-0.059
emouth-0.059
Token Snap
Token ID16026
Feature activation+1.56e-01

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
 Garg+0.003
emouth+0.003
Neg logit contributions
PB-0.005
fters-0.005
 cancellation-0.005
Lead-0.005
Writer-0.004
Token.
Token ID13
Feature activation-4.43e-02

Pos logit contributions
orf+0.105
 COURT+0.101
icho+0.097
 Garg+0.096
emouth+0.096
Neg logit contributions
PB-0.099
fters-0.097
 cancellation-0.095
Lead-0.094
Writer-0.091
Token Meanwhile
Token ID11214
Feature activation+1.07e-01

Pos logit contributions
PB+0.030
fters+0.029
 cancellation+0.029
Lead+0.029
Writer+0.028
Neg logit contributions
orf-0.028
 COURT-0.027
icho-0.026
emouth-0.025
 Garg-0.025
Token Flat
Token ID21939
Feature activation-1.56e-01

Pos logit contributions
orf+0.071
 COURT+0.068
icho+0.065
 Garg+0.064
emouth+0.064
Neg logit contributions
PB-0.069
fters-0.068
 cancellation-0.067
Lead-0.066
Writer-0.064
Tokenpak
Token ID41091
Feature activation-4.10e-01

Pos logit contributions
PB+0.018
fters+0.016
Lead+0.013
 cancellation+0.013
Writer+0.010
Neg logit contributions
orf-0.186
 COURT-0.183
 Garg-0.178
icho-0.178
emouth-0.178
Token has
Token ID468
Feature activation-4.02e-02

Pos logit contributions
PB+0.279
fters+0.275
 cancellation+0.270
Lead+0.267
Writer+0.257
Neg logit contributions
orf-0.256
 COURT-0.246
icho-0.236
emouth-0.232
 Garg-0.232
Token not
Token ID407
Feature activation-3.76e-02

Pos logit contributions
PB+0.028
fters+0.027
 cancellation+0.027
Lead+0.026
Writer+0.025
Neg logit contributions
orf-0.025
 COURT-0.024
icho-0.023
emouth-0.023
 Garg-0.022
Token enjoyed
Token ID8359
Feature activation+1.27e-01

Pos logit contributions
PB+0.025
fters+0.024
 cancellation+0.024
Lead+0.024
Writer+0.023
Neg logit contributions
orf-0.024
 COURT-0.023
icho-0.023
emouth-0.022
 Garg-0.022
Token a
Token ID257
Feature activation-2.74e-02

Pos logit contributions
orf+0.081
 COURT+0.078
icho+0.075
 Garg+0.074
emouth+0.073
Neg logit contributions
PB-0.085
fters-0.084
 cancellation-0.082
Lead-0.081
Writer-0.079
Token widespread
Token ID10095
Feature activation+7.22e-02

Pos logit contributions
PB+0.017
fters+0.016
 cancellation+0.016
Lead+0.016
Writer+0.015
Neg logit contributions
orf-0.019
 COURT-0.019
icho-0.018
 Garg-0.018
emouth-0.018
Token is
Token ID318
Feature activation-4.12e-03

Pos logit contributions
orf+0.017
 COURT+0.017
icho+0.016
emouth+0.016
 Garg+0.016
Neg logit contributions
PB-0.019
fters-0.018
 cancellation-0.018
Lead-0.018
Writer-0.017
Token being
Token ID852
Feature activation-6.85e-02

Pos logit contributions
PB+0.003
fters+0.003
 cancellation+0.003
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.003
 COURT-0.003
icho-0.003
 Garg-0.003
emouth-0.003
Token treated
Token ID5716
Feature activation-3.40e-02

Pos logit contributions
PB+0.044
fters+0.043
 cancellation+0.042
Lead+0.042
Writer+0.040
Neg logit contributions
orf-0.045
 COURT-0.044
icho-0.042
 Garg-0.041
emouth-0.041
Token in
Token ID287
Feature activation-2.94e-03

Pos logit contributions
PB+0.022
fters+0.022
 cancellation+0.021
Lead+0.021
Writer+0.020
Neg logit contributions
orf-0.022
 COURT-0.022
icho-0.021
 Garg-0.020
emouth-0.020
Token some
Token ID617
Feature activation+2.05e-02

Pos logit contributions
PB+0.002
fters+0.002
 cancellation+0.002
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
emouth-0.002
 Garg-0.002
Token quarters
Token ID13620
Feature activation-4.07e-01

Pos logit contributions
orf+0.015
 COURT+0.014
icho+0.014
 Garg+0.014
emouth+0.014
Neg logit contributions
PB-0.012
fters-0.012
 cancellation-0.012
Lead-0.011
Writer-0.011
Token as
Token ID355
Feature activation-3.92e-02

Pos logit contributions
PB+0.267
fters+0.263
 cancellation+0.259
Lead+0.254
Writer+0.246
Neg logit contributions
orf-0.265
 COURT-0.253
icho-0.244
emouth-0.241
 Garg-0.240
Token mission
Token ID4365
Feature activation+1.17e-01

Pos logit contributions
PB+0.026
fters+0.026
 cancellation+0.025
Lead+0.025
Writer+0.024
Neg logit contributions
orf-0.025
 COURT-0.024
icho-0.023
 Garg-0.022
emouth-0.022
Token failure
Token ID5287
Feature activation-1.22e-01

Pos logit contributions
orf+0.081
 COURT+0.078
icho+0.075
 Garg+0.074
emouth+0.074
Neg logit contributions
PB-0.072
fters-0.071
 cancellation-0.068
Lead-0.068
Writer-0.066
Token.
Token ID13
Feature activation-2.80e-02

Pos logit contributions
PB+0.077
fters+0.076
 cancellation+0.075
Lead+0.073
Writer+0.071
Neg logit contributions
orf-0.083
 COURT-0.080
icho-0.077
emouth-0.076
 Garg-0.076
Token That
Token ID1320
Feature activation+2.58e-02

Pos logit contributions
PB+0.019
fters+0.018
 cancellation+0.018
Lead+0.018
Writer+0.017
Neg logit contributions
orf-0.018
 COURT-0.017
icho-0.017
emouth-0.016
 Garg-0.016
Token)
Token ID8
Feature activation+1.24e-02

Pos logit contributions
PB+0.091
fters+0.089
 cancellation+0.088
Lead+0.087
Writer+0.084
Neg logit contributions
orf-0.096
 COURT-0.092
icho-0.089
 Garg-0.087
emouth-0.087
Token
Token ID198
Feature activation+1.80e-02

Pos logit contributions
orf+0.007
 COURT+0.007
icho+0.007
 Garg+0.007
emouth+0.007
Neg logit contributions
PB-0.009
fters-0.009
 cancellation-0.009
Lead-0.008
Writer-0.008
Token
Token ID198
Feature activation+1.32e-02

Pos logit contributions
orf+0.011
 COURT+0.011
icho+0.010
emouth+0.010
 Garg+0.010
Neg logit contributions
PB-0.012
fters-0.012
Lead-0.012
 cancellation-0.012
Writer-0.011
TokenFl
Token ID7414
Feature activation-9.58e-02

Pos logit contributions
orf+0.010
 COURT+0.010
icho+0.009
 Garg+0.009
emouth+0.009
Neg logit contributions
PB-0.007
fters-0.007
Lead-0.007
 cancellation-0.007
Writer-0.007
Tokenat
Token ID265
Feature activation-2.66e-02

Pos logit contributions
PB+0.058
fters+0.057
 cancellation+0.056
Lead+0.055
Writer+0.054
Neg logit contributions
orf-0.066
 COURT-0.064
icho-0.061
 Garg-0.061
emouth-0.061
Tokenpak
Token ID41091
Feature activation-4.07e-01

Pos logit contributions
PB+0.016
fters+0.015
 cancellation+0.015
Lead+0.015
Writer+0.014
Neg logit contributions
orf-0.019
 COURT-0.018
icho-0.018
 Garg-0.018
emouth-0.017
Token,
Token ID11
Feature activation+2.61e-03

Pos logit contributions
PB+0.263
fters+0.258
 cancellation+0.254
Lead+0.250
Writer+0.243
Neg logit contributions
orf-0.269
 COURT-0.259
icho-0.249
emouth-0.246
 Garg-0.245
Token Snap
Token ID16026
Feature activation+1.58e-01

Pos logit contributions
orf+0.002
 COURT+0.002
icho+0.002
emouth+0.002
 Garg+0.002
Neg logit contributions
PB-0.002
fters-0.002
 cancellation-0.002
Lead-0.002
Writer-0.002
Token and
Token ID290
Feature activation+3.34e-02

Pos logit contributions
orf+0.089
 COURT+0.085
icho+0.081
 Garg+0.080
emouth+0.080
Neg logit contributions
PB-0.117
fters-0.115
 cancellation-0.114
Lead-0.112
Writer-0.109
Token App
Token ID2034
Feature activation+3.77e-02

Pos logit contributions
orf+0.022
 COURT+0.021
icho+0.021
emouth+0.020
 Garg+0.020
Neg logit contributions
PB-0.021
fters-0.021
 cancellation-0.021
Lead-0.020
Writer-0.020
TokenImage
Token ID5159
Feature activation+6.34e-02

Pos logit contributions
orf+0.028
 COURT+0.027
icho+0.026
emouth+0.025
 Garg+0.025
Neg logit contributions
PB-0.022
fters-0.021
 cancellation-0.021
Lead-0.020
Writer-0.020
Token the
Token ID262
Feature activation-9.14e-03

Pos logit contributions
orf+0.061
 COURT+0.058
icho+0.056
 Garg+0.055
emouth+0.055
Neg logit contributions
PB-0.066
fters-0.065
 cancellation-0.065
Lead-0.063
Writer-0.061
Token loss
Token ID2994
Feature activation-1.17e-01

Pos logit contributions
PB+0.006
fters+0.006
 cancellation+0.006
Lead+0.006
Writer+0.005
Neg logit contributions
orf-0.006
 COURT-0.006
icho-0.006
 Garg-0.005
emouth-0.005
Token of
Token ID286
Feature activation-2.80e-02

Pos logit contributions
PB+0.077
fters+0.076
 cancellation+0.074
Lead+0.073
Writer+0.071
Neg logit contributions
orf-0.076
 COURT-0.073
icho-0.070
emouth-0.069
 Garg-0.069
Token around
Token ID1088
Feature activation-3.96e-02

Pos logit contributions
PB+0.018
fters+0.018
 cancellation+0.017
Lead+0.017
Writer+0.016
Neg logit contributions
orf-0.019
 COURT-0.018
icho-0.017
emouth-0.017
 Garg-0.017
Token three
Token ID1115
Feature activation-7.33e-03

Pos logit contributions
PB+0.025
fters+0.025
 cancellation+0.024
Lead+0.024
Writer+0.023
Neg logit contributions
orf-0.027
 COURT-0.026
icho-0.025
emouth-0.024
 Garg-0.024
Token quarters
Token ID13620
Feature activation-4.06e-01

Pos logit contributions
PB+0.005
fters+0.005
 cancellation+0.005
Lead+0.004
Writer+0.004
Neg logit contributions
orf-0.005
 COURT-0.005
icho-0.005
 Garg-0.004
emouth-0.004
Token of
Token ID286
Feature activation-3.19e-02

Pos logit contributions
PB+0.262
fters+0.258
 cancellation+0.253
Lead+0.250
Writer+0.241
Neg logit contributions
orf-0.268
 COURT-0.257
icho-0.248
 Garg-0.244
emouth-0.244
Token a
Token ID257
Feature activation-3.16e-03

Pos logit contributions
PB+0.021
fters+0.021
 cancellation+0.021
Lead+0.020
Writer+0.020
Neg logit contributions
orf-0.021
 COURT-0.020
icho-0.019
 Garg-0.019
emouth-0.019
Token million
Token ID1510
Feature activation-3.33e-02

Pos logit contributions
PB+0.002
fters+0.002
 cancellation+0.002
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
 Garg-0.002
emouth-0.002
Token jobs
Token ID3946
Feature activation+2.90e-02

Pos logit contributions
PB+0.022
fters+0.021
 cancellation+0.021
Lead+0.021
Writer+0.020
Neg logit contributions
orf-0.022
 COURT-0.021
icho-0.020
 Garg-0.020
emouth-0.020
Token,
Token ID11
Feature activation-4.35e-02

Pos logit contributions
orf+0.020
 COURT+0.019
icho+0.019
 Garg+0.018
emouth+0.018
Neg logit contributions
PB-0.018
fters-0.018
 cancellation-0.017
Lead-0.017
Writer-0.016
Tokenully
Token ID2132
Feature activation-1.05e-01

Pos logit contributions
orf+0.024
 COURT+0.023
icho+0.022
 Garg+0.022
emouth+0.022
Neg logit contributions
PB-0.023
fters-0.022
 cancellation-0.022
Lead-0.022
Writer-0.021
Token/
Token ID14
Feature activation-1.04e-02

Pos logit contributions
PB+0.065
fters+0.064
 cancellation+0.063
Lead+0.062
Writer+0.060
Neg logit contributions
orf-0.071
 COURT-0.069
icho-0.066
emouth-0.065
 Garg-0.065
TokenVict
Token ID21944
Feature activation-2.42e-02

Pos logit contributions
PB+0.006
fters+0.006
 cancellation+0.006
Lead+0.006
Writer+0.006
Neg logit contributions
orf-0.007
 COURT-0.007
icho-0.007
emouth-0.007
 Garg-0.007
Tokenim
Token ID320
Feature activation+1.08e-01

Pos logit contributions
PB+0.017
fters+0.017
 cancellation+0.016
Lead+0.016
Writer+0.016
Neg logit contributions
orf-0.015
 COURT-0.014
icho-0.013
 Garg-0.013
emouth-0.013
Token Question
Token ID18233
Feature activation+1.17e-01

Pos logit contributions
orf+0.073
 COURT+0.070
icho+0.067
emouth+0.066
 Garg+0.066
Neg logit contributions
PB-0.069
fters-0.067
 cancellation-0.066
Lead-0.065
Writer-0.063
Tokennaire
Token ID24042
Feature activation-4.06e-01

Pos logit contributions
orf+0.140
 COURT+0.137
icho+0.134
 Garg+0.134
emouth+0.133
Neg logit contributions
PB-0.012
fters-0.011
 cancellation-0.010
Lead-0.009
Writer-0.006
Token [
Token ID685
Feature activation-3.12e-02

Pos logit contributions
PB+0.250
fters+0.246
 cancellation+0.242
Lead+0.238
Writer+0.230
Neg logit contributions
orf-0.280
 COURT-0.269
icho-0.259
emouth-0.256
 Garg-0.256
TokenM
Token ID44
Feature activation+2.78e-02

Pos logit contributions
PB+0.018
fters+0.018
 cancellation+0.018
Lead+0.018
Writer+0.017
Neg logit contributions
orf-0.022
 COURT-0.021
icho-0.021
emouth-0.021
 Garg-0.020
Tokenimeo
Token ID47776
Feature activation+1.26e-01

Pos logit contributions
orf+0.019
 COURT+0.018
icho+0.017
 Garg+0.017
emouth+0.017
Neg logit contributions
PB-0.018
fters-0.017
 cancellation-0.017
Lead-0.017
Writer-0.016
Token].
Token ID4083
Feature activation-6.43e-02

Pos logit contributions
orf+0.091
 COURT+0.087
icho+0.084
 Garg+0.083
emouth+0.083
Neg logit contributions
PB-0.074
fters-0.072
 cancellation-0.071
Lead-0.070
Writer-0.067
Token Ber
Token ID4312
Feature activation-5.33e-02

Pos logit contributions
PB+0.045
fters+0.044
 cancellation+0.043
Lead+0.043
Writer+0.041
Neg logit contributions
orf-0.039
 COURT-0.037
icho-0.036
emouth-0.036
 Garg-0.035
Token)
Token ID8
Feature activation+2.05e-02

Pos logit contributions
orf+0.029
 COURT+0.028
icho+0.026
 Garg+0.026
emouth+0.026
Neg logit contributions
PB-0.028
fters-0.028
 cancellation-0.027
Lead-0.027
Writer-0.026
Token {
Token ID1391
Feature activation-7.04e-02

Pos logit contributions
orf+0.014
 COURT+0.014
icho+0.013
 Garg+0.013
emouth+0.013
Neg logit contributions
PB-0.012
fters-0.012
 cancellation-0.012
Lead-0.012
Writer-0.011
Token v
Token ID410
Feature activation+3.62e-02

Pos logit contributions
PB+0.045
fters+0.044
 cancellation+0.043
Lead+0.042
Writer+0.041
Neg logit contributions
orf-0.047
 COURT-0.046
icho-0.044
 Garg-0.043
emouth-0.043
Token.
Token ID13
Feature activation+2.42e-02

Pos logit contributions
orf+0.026
 COURT+0.025
icho+0.024
 Garg+0.024
emouth+0.023
Neg logit contributions
PB-0.021
fters-0.021
 cancellation-0.021
Lead-0.020
Writer-0.019
Tokenver
Token ID332
Feature activation-1.22e-01

Pos logit contributions
orf+0.017
 COURT+0.016
icho+0.016
 Garg+0.015
emouth+0.015
Neg logit contributions
PB-0.015
fters-0.015
 cancellation-0.014
Lead-0.014
Writer-0.014
Tokentex
Token ID16886
Feature activation-4.00e-01

Pos logit contributions
PB+0.013
fters+0.012
 cancellation+0.011
Lead+0.009
Writer+0.007
Neg logit contributions
orf-0.145
 COURT-0.142
icho-0.139
 Garg-0.138
emouth-0.138
Token.
Token ID13
Feature activation+1.99e-02

Pos logit contributions
PB+0.207
fters+0.204
 cancellation+0.198
Lead+0.194
Writer+0.185
Neg logit contributions
orf-0.315
 COURT-0.304
icho-0.294
 Garg-0.291
emouth-0.290
Tokenxy
Token ID5431
Feature activation+1.23e-02

Pos logit contributions
orf+0.013
 COURT+0.013
icho+0.012
 Garg+0.012
emouth+0.012
Neg logit contributions
PB-0.013
fters-0.012
 cancellation-0.012
Lead-0.012
Writer-0.011
Tokenz
Token ID89
Feature activation-2.81e-02

Pos logit contributions
orf+0.009
 COURT+0.009
icho+0.008
 Garg+0.008
emouth+0.008
Neg logit contributions
PB-0.007
fters-0.007
 cancellation-0.007
Lead-0.007
Writer-0.007
Token +=
Token ID15853
Feature activation-2.33e-01

Pos logit contributions
PB+0.013
fters+0.013
 cancellation+0.013
Lead+0.012
Writer+0.012
Neg logit contributions
orf-0.023
 COURT-0.023
icho-0.022
 Garg-0.022
emouth-0.022
Token v
Token ID410
Feature activation+1.66e-02

Pos logit contributions
PB+0.153
fters+0.151
 cancellation+0.148
Lead+0.146
Writer+0.141
Neg logit contributions
orf-0.152
 COURT-0.146
icho-0.140
 Garg-0.138
emouth-0.138
Token".
Token ID1911
Feature activation+5.56e-02

Pos logit contributions
orf+0.019
 COURT+0.018
icho+0.017
 Garg+0.017
emouth+0.017
Neg logit contributions
PB-0.015
fters-0.015
 cancellation-0.015
Lead-0.015
Writer-0.014
Token
Token ID198
Feature activation+3.60e-02

Pos logit contributions
orf+0.035
 COURT+0.034
icho+0.032
 Garg+0.032
emouth+0.032
Neg logit contributions
PB-0.037
fters-0.037
 cancellation-0.036
Lead-0.036
Writer-0.034
Token
Token ID198
Feature activation+2.63e-02

Pos logit contributions
orf+0.023
 COURT+0.022
icho+0.021
emouth+0.021
 Garg+0.021
Neg logit contributions
PB-0.024
fters-0.024
Lead-0.023
 cancellation-0.023
Writer-0.023
TokenFl
Token ID7414
Feature activation-1.15e-01

Pos logit contributions
orf+0.020
 COURT+0.019
icho+0.018
 Garg+0.018
emouth+0.018
Neg logit contributions
PB-0.015
fters-0.014
 cancellation-0.014
Lead-0.014
Writer-0.013
Tokenat
Token ID265
Feature activation-3.41e-02

Pos logit contributions
PB+0.074
fters+0.074
 cancellation+0.071
Lead+0.071
Writer+0.068
Neg logit contributions
orf-0.075
 COURT-0.073
 Garg-0.070
icho-0.069
emouth-0.069
Tokenpak
Token ID41091
Feature activation-3.99e-01

Pos logit contributions
PB+0.004
fters+0.004
 cancellation+0.003
Lead+0.003
Writer+0.003
Neg logit contributions
orf-0.040
 COURT-0.039
icho-0.039
 Garg-0.038
emouth-0.038
Token's
Token ID338
Feature activation-6.69e-03

Pos logit contributions
PB+0.269
fters+0.265
 cancellation+0.261
Lead+0.257
Writer+0.249
Neg logit contributions
orf-0.252
 COURT-0.242
icho-0.232
emouth-0.229
 Garg-0.228
Token website
Token ID3052
Feature activation-3.55e-02

Pos logit contributions
PB+0.004
fters+0.004
 cancellation+0.004
Lead+0.004
Writer+0.004
Neg logit contributions
orf-0.004
 COURT-0.004
icho-0.004
emouth-0.004
 Garg-0.004
Token offers
Token ID4394
Feature activation-2.66e-02

Pos logit contributions
PB+0.024
fters+0.023
 cancellation+0.023
Lead+0.023
Writer+0.022
Neg logit contributions
orf-0.023
 COURT-0.022
icho-0.021
emouth-0.020
 Garg-0.020
Token the
Token ID262
Feature activation-1.23e-02

Pos logit contributions
PB+0.018
fters+0.018
 cancellation+0.018
Lead+0.017
Writer+0.017
Neg logit contributions
orf-0.017
 COURT-0.016
icho-0.015
emouth-0.015
 Garg-0.015
Token following
Token ID1708
Feature activation-2.97e-02

Pos logit contributions
PB+0.008
fters+0.008
 cancellation+0.007
Lead+0.007
Writer+0.007
Neg logit contributions
orf-0.008
 COURT-0.008
icho-0.008
 Garg-0.008
emouth-0.008
Token recommendation
Token ID15602
Feature activation-9.33e-02

Pos logit contributions
PB+0.035
fters+0.034
 cancellation+0.034
Lead+0.033
Writer+0.032
Neg logit contributions
orf-0.033
 COURT-0.031
icho-0.030
emouth-0.030
 Garg-0.029
Token quickly
Token ID2952
Feature activation-3.93e-02

Pos logit contributions
PB+0.063
fters+0.062
 cancellation+0.062
Lead+0.060
Writer+0.058
Neg logit contributions
orf-0.059
 COURT-0.056
icho-0.054
emouth-0.053
 Garg-0.053
Token became
Token ID2627
Feature activation+2.78e-02

Pos logit contributions
PB+0.028
fters+0.027
 cancellation+0.027
Lead+0.027
Writer+0.026
Neg logit contributions
orf-0.024
 COURT-0.023
icho-0.022
emouth-0.021
 Garg-0.021
Token out
Token ID503
Feature activation-8.39e-02

Pos logit contributions
orf+0.019
 COURT+0.019
icho+0.018
emouth+0.018
 Garg+0.018
Neg logit contributions
PB-0.017
fters-0.017
 cancellation-0.016
Lead-0.016
Writer-0.015
Token of
Token ID286
Feature activation-5.71e-02

Pos logit contributions
PB+0.053
fters+0.052
 cancellation+0.051
Lead+0.051
Writer+0.049
Neg logit contributions
orf-0.056
 COURT-0.054
icho-0.052
 Garg-0.051
emouth-0.051
Token date
Token ID3128
Feature activation-3.98e-01

Pos logit contributions
PB+0.034
fters+0.033
 cancellation+0.033
Lead+0.032
Writer+0.031
Neg logit contributions
orf-0.041
 COURT-0.039
icho-0.038
emouth-0.037
 Garg-0.037
Token.
Token ID13
Feature activation-7.02e-02

Pos logit contributions
PB+0.256
fters+0.252
 cancellation+0.247
Lead+0.243
Writer+0.235
Neg logit contributions
orf-0.264
 COURT-0.253
icho-0.244
emouth-0.241
 Garg-0.240
Token It
Token ID632
Feature activation-2.96e-02

Pos logit contributions
PB+0.047
fters+0.046
 cancellation+0.045
Lead+0.044
Writer+0.043
Neg logit contributions
orf-0.045
 COURT-0.043
icho-0.042
emouth-0.041
 Garg-0.041
Token's
Token ID338
Feature activation+4.98e-02

Pos logit contributions
PB+0.020
fters+0.019
 cancellation+0.019
Lead+0.019
Writer+0.018
Neg logit contributions
orf-0.019
 COURT-0.018
icho-0.017
emouth-0.017
 Garg-0.017
Token all
Token ID477
Feature activation-1.86e-02

Pos logit contributions
orf+0.033
 COURT+0.031
icho+0.030
emouth+0.030
 Garg+0.030
Neg logit contributions
PB-0.032
fters-0.032
 cancellation-0.031
Lead-0.031
Writer-0.030
Token a
Token ID257
Feature activation-2.79e-02

Pos logit contributions
PB+0.012
fters+0.012
 cancellation+0.012
Lead+0.012
Writer+0.011
Neg logit contributions
orf-0.012
 COURT-0.012
icho-0.011
emouth-0.011
 Garg-0.011
Token Institution
Token ID29426
Feature activation+4.39e-03

Pos logit contributions
orf+0.017
 COURT+0.017
icho+0.016
 Garg+0.016
emouth+0.016
Neg logit contributions
PB-0.018
fters-0.018
 cancellation-0.017
Lead-0.017
Writer-0.017
Token report
Token ID989
Feature activation+4.35e-02

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
 Garg+0.003
emouth+0.003
Neg logit contributions
PB-0.003
fters-0.003
 cancellation-0.003
Lead-0.003
Writer-0.003
Token,
Token ID11
Feature activation-5.90e-03

Pos logit contributions
orf+0.029
 COURT+0.028
icho+0.027
 Garg+0.027
emouth+0.027
Neg logit contributions
PB-0.027
fters-0.027
 cancellation-0.027
Lead-0.026
Writer-0.025
Token
Token ID564
Feature activation-3.20e-02

Pos logit contributions
PB+0.004
fters+0.004
 cancellation+0.004
Lead+0.004
Writer+0.003
Neg logit contributions
orf-0.004
 COURT-0.004
icho-0.004
emouth-0.004
 Garg-0.004
Token
Token ID250
Feature activation+2.30e-02

Pos logit contributions
PB+0.019
fters+0.019
 cancellation+0.018
Lead+0.018
Writer+0.017
Neg logit contributions
orf-0.023
 COURT-0.022
icho-0.021
 Garg-0.021
emouth-0.021
TokenLead
Token ID20451
Feature activation-3.96e-01

Pos logit contributions
orf+0.015
 COURT+0.015
icho+0.014
 Garg+0.014
emouth+0.014
Neg logit contributions
PB-0.015
fters-0.014
 cancellation-0.014
Lead-0.014
Writer-0.013
Tokening
Token ID278
Feature activation+3.55e-02

Pos logit contributions
PB+0.249
fters+0.245
Lead+0.242
 cancellation+0.241
Writer+0.231
Neg logit contributions
orf-0.266
 COURT-0.257
icho-0.247
 Garg-0.242
emouth-0.241
Token Beyond
Token ID12197
Feature activation-1.48e-01

Pos logit contributions
orf+0.023
 COURT+0.022
icho+0.021
 Garg+0.021
emouth+0.020
Neg logit contributions
PB-0.024
fters-0.023
 cancellation-0.023
Lead-0.023
Writer-0.022
Token Limits
Token ID44943
Feature activation-1.20e-01

Pos logit contributions
PB+0.096
fters+0.094
 cancellation+0.092
Lead+0.092
Writer+0.088
Neg logit contributions
orf-0.099
 COURT-0.094
icho-0.091
emouth-0.090
 Garg-0.089
Token:
Token ID25
Feature activation+5.54e-03

Pos logit contributions
PB+0.078
fters+0.076
 cancellation+0.075
Lead+0.074
Writer+0.072
Neg logit contributions
orf-0.079
 COURT-0.076
icho-0.073
 Garg-0.072
emouth-0.072
Token May
Token ID1737
Feature activation-1.40e-01

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.003
emouth+0.003
 Garg+0.003
Neg logit contributions
PB-0.004
fters-0.003
 cancellation-0.003
Lead-0.003
Writer-0.003
Token poll
Token ID3278
Feature activation+3.20e-02

Pos logit contributions
PB+0.007
fters+0.007
 cancellation+0.007
Lead+0.007
Writer+0.006
Neg logit contributions
orf-0.007
 COURT-0.006
icho-0.006
emouth-0.006
 Garg-0.006
Token is
Token ID318
Feature activation-1.60e-02

Pos logit contributions
orf+0.021
 COURT+0.020
icho+0.019
 Garg+0.019
emouth+0.019
Neg logit contributions
PB-0.021
fters-0.020
 cancellation-0.020
Lead-0.020
Writer-0.019
Token the
Token ID262
Feature activation-4.32e-02

Pos logit contributions
PB+0.010
fters+0.010
 cancellation+0.010
Lead+0.010
Writer+0.009
Neg logit contributions
orf-0.011
 COURT-0.010
icho-0.010
 Garg-0.010
emouth-0.010
Token first
Token ID717
Feature activation-4.40e-02

Pos logit contributions
PB+0.027
fters+0.027
 cancellation+0.027
Lead+0.026
Writer+0.025
Neg logit contributions
orf-0.029
 COURT-0.028
icho-0.027
emouth-0.027
 Garg-0.026
Token to
Token ID284
Feature activation-2.69e-02

Pos logit contributions
PB+0.028
fters+0.027
 cancellation+0.027
Lead+0.026
Writer+0.026
Neg logit contributions
orf-0.030
 COURT-0.028
icho-0.027
 Garg-0.027
emouth-0.027
Token date
Token ID3128
Feature activation-3.96e-01

Pos logit contributions
PB+0.017
fters+0.017
 cancellation+0.016
Lead+0.016
Writer+0.015
Neg logit contributions
orf-0.018
 COURT-0.018
icho-0.017
emouth-0.017
 Garg-0.017
Token of
Token ID286
Feature activation-1.94e-02

Pos logit contributions
PB+0.254
fters+0.250
 cancellation+0.245
Lead+0.241
Writer+0.233
Neg logit contributions
orf-0.263
 COURT-0.253
icho-0.243
 Garg-0.240
emouth-0.240
Token the
Token ID262
Feature activation-2.36e-02

Pos logit contributions
PB+0.013
fters+0.013
 cancellation+0.012
Lead+0.012
Writer+0.012
Neg logit contributions
orf-0.012
 COURT-0.012
icho-0.011
emouth-0.011
 Garg-0.011
Token proposed
Token ID5150
Feature activation+6.11e-02

Pos logit contributions
PB+0.015
fters+0.015
 cancellation+0.015
Lead+0.015
Writer+0.014
Neg logit contributions
orf-0.015
 COURT-0.015
icho-0.014
emouth-0.014
 Garg-0.014
Token legislation
Token ID5520
Feature activation+4.85e-03

Pos logit contributions
orf+0.039
 COURT+0.037
icho+0.036
emouth+0.035
 Garg+0.035
Neg logit contributions
PB-0.041
fters-0.040
 cancellation-0.040
Lead-0.039
Writer-0.038
Token,
Token ID11
Feature activation-3.32e-02

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
 Garg+0.003
emouth+0.003
Neg logit contributions
PB-0.003
fters-0.003
 cancellation-0.003
Lead-0.003
Writer-0.003
Token.
Token ID13
Feature activation-4.00e-04

Pos logit contributions
orf+0.035
 COURT+0.034
icho+0.033
 Garg+0.032
emouth+0.032
Neg logit contributions
PB-0.037
fters-0.037
 cancellation-0.036
Lead-0.035
Writer-0.034
Token
Token ID198
Feature activation-4.53e-04

Pos logit contributions
PB+0.000
fters+0.000
 cancellation+0.000
Lead+0.000
Writer+0.000
Neg logit contributions
orf-0.000
 COURT-0.000
icho-0.000
 Garg-0.000
emouth-0.000
Token
Token ID198
Feature activation-1.12e-03

Pos logit contributions
PB+0.000
fters+0.000
Lead+0.000
 cancellation+0.000
Writer+0.000
Neg logit contributions
orf-0.000
 COURT-0.000
icho-0.000
emouth-0.000
 Garg-0.000
Token
Token ID447
Feature activation-2.31e-03

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.001
 COURT-0.001
icho-0.001
 Garg-0.001
emouth-0.001
Token
Token ID250
Feature activation-3.20e-02

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
emouth-0.002
 Garg-0.002
TokenLead
Token ID20451
Feature activation-3.93e-01

Pos logit contributions
PB+0.018
fters+0.018
 cancellation+0.018
Lead+0.018
Writer+0.017
Neg logit contributions
orf-0.023
 COURT-0.022
icho-0.022
 Garg-0.021
emouth-0.021
Token the
Token ID262
Feature activation+2.41e-03

Pos logit contributions
PB+0.261
fters+0.259
 cancellation+0.254
Lead+0.251
Writer+0.241
Neg logit contributions
orf-0.249
 COURT-0.239
icho-0.230
 Garg-0.226
emouth-0.224
Token lady
Token ID10846
Feature activation+1.34e-01

Pos logit contributions
orf+0.001
 COURT+0.001
icho+0.001
emouth+0.001
 Garg+0.001
Neg logit contributions
PB-0.002
fters-0.002
 cancellation-0.002
Lead-0.002
Writer-0.002
Token through
Token ID832
Feature activation+6.73e-02

Pos logit contributions
orf+0.087
 COURT+0.083
icho+0.080
emouth+0.079
 Garg+0.079
Neg logit contributions
PB-0.087
fters-0.086
 cancellation-0.085
Lead-0.084
Writer-0.081
Token the
Token ID262
Feature activation+5.93e-03

Pos logit contributions
orf+0.041
 COURT+0.039
icho+0.038
emouth+0.037
 Garg+0.037
Neg logit contributions
PB-0.046
fters-0.046
 cancellation-0.045
Lead-0.044
Writer-0.043
Token quad
Token ID15094
Feature activation-2.57e-01

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.004
emouth+0.004
 Garg+0.004
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.003
Writer-0.003
Token room
Token ID2119
Feature activation-2.20e-02

Pos logit contributions
PB+0.011
fters+0.011
 cancellation+0.011
Lead+0.011
Writer+0.010
Neg logit contributions
orf-0.012
 COURT-0.011
icho-0.011
emouth-0.011
 Garg-0.011
Token-
Token ID12
Feature activation-2.81e-02

Pos logit contributions
PB+0.016
fters+0.016
 cancellation+0.016
Lead+0.015
Writer+0.015
Neg logit contributions
orf-0.013
 COURT-0.012
icho-0.012
emouth-0.011
 Garg-0.011
Tokentem
Token ID11498
Feature activation+2.36e-01

Pos logit contributions
PB+0.023
fters+0.022
 cancellation+0.022
Lead+0.022
Writer+0.021
Neg logit contributions
orf-0.014
 COURT-0.013
icho-0.013
emouth-0.012
 Garg-0.012
Tokenperature
Token ID21069
Feature activation+1.98e-01

Pos logit contributions
orf+0.086
 COURT+0.080
icho+0.074
emouth+0.074
 Garg+0.072
Neg logit contributions
PB-0.222
fters-0.220
 cancellation-0.217
Lead-0.214
Writer-0.209
Token super
Token ID2208
Feature activation+8.94e-02

Pos logit contributions
orf+0.133
 COURT+0.128
icho+0.123
emouth+0.122
 Garg+0.120
Neg logit contributions
PB-0.126
 cancellation-0.124
fters-0.124
Lead-0.120
Writer-0.116
Tokenconduct
Token ID36495
Feature activation-3.92e-01

Pos logit contributions
orf+0.088
 COURT+0.086
icho+0.084
emouth+0.083
 Garg+0.083
Neg logit contributions
PB-0.028
fters-0.028
 cancellation-0.026
Lead-0.026
Writer-0.024
Tokenivity
Token ID3458
Feature activation+8.23e-02

Pos logit contributions
PB+0.293
fters+0.290
 cancellation+0.286
Lead+0.282
Writer+0.273
Neg logit contributions
orf-0.217
 COURT-0.208
icho-0.198
 Garg-0.196
emouth-0.195
Token has
Token ID468
Feature activation-2.45e-02

Pos logit contributions
orf+0.051
 COURT+0.049
icho+0.047
emouth+0.046
 Garg+0.046
Neg logit contributions
PB-0.056
fters-0.055
 cancellation-0.055
Lead-0.054
Writer-0.052
Token remained
Token ID6150
Feature activation+6.03e-03

Pos logit contributions
PB+0.017
fters+0.016
 cancellation+0.016
Lead+0.016
Writer+0.015
Neg logit contributions
orf-0.015
 COURT-0.015
icho-0.014
emouth-0.014
 Garg-0.014
Token so
Token ID523
Feature activation-1.07e-01

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.004
emouth+0.004
 Garg+0.004
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.004
Writer-0.003
Token elusive
Token ID29885
Feature activation-1.85e-01

Pos logit contributions
PB+0.066
fters+0.065
 cancellation+0.064
Lead+0.063
Writer+0.061
Neg logit contributions
orf-0.073
 COURT-0.070
icho-0.067
emouth-0.067
 Garg-0.066
Token new
Token ID649
Feature activation+1.26e-02

Pos logit contributions
PB+0.008
fters+0.008
 cancellation+0.008
Lead+0.008
Writer+0.007
Neg logit contributions
orf-0.009
 COURT-0.009
icho-0.008
emouth-0.008
 Garg-0.008
Token package
Token ID5301
Feature activation+2.47e-02

Pos logit contributions
orf+0.009
 COURT+0.009
icho+0.008
emouth+0.008
 Garg+0.008
Neg logit contributions
PB-0.008
fters-0.007
 cancellation-0.007
Lead-0.007
Writer-0.007
Token formats
Token ID17519
Feature activation-2.05e-01

Pos logit contributions
orf+0.020
 COURT+0.020
icho+0.019
emouth+0.019
 Garg+0.019
Neg logit contributions
PB-0.012
fters-0.012
 cancellation-0.012
Lead-0.011
Writer-0.011
Token,
Token ID11
Feature activation+9.17e-03

Pos logit contributions
PB+0.132
fters+0.130
 cancellation+0.128
Lead+0.125
Writer+0.121
Neg logit contributions
orf-0.136
 COURT-0.130
icho-0.125
emouth-0.124
 Garg-0.123
Token Flat
Token ID21939
Feature activation-1.50e-01

Pos logit contributions
orf+0.007
 COURT+0.007
icho+0.007
emouth+0.007
 Garg+0.007
Neg logit contributions
PB-0.005
fters-0.005
 cancellation-0.005
Lead-0.005
Writer-0.004
Tokenpak
Token ID41091
Feature activation-3.92e-01

Pos logit contributions
PB+0.020
fters+0.018
 cancellation+0.017
Lead+0.015
Writer+0.013
Neg logit contributions
orf-0.176
 COURT-0.173
icho-0.169
emouth-0.168
 Garg-0.168
Token and
Token ID290
Feature activation+8.92e-03

Pos logit contributions
PB+0.276
fters+0.271
 cancellation+0.266
Lead+0.262
Writer+0.256
Neg logit contributions
orf-0.237
 COURT-0.227
icho-0.217
emouth-0.215
 Garg-0.213
Token Snap
Token ID16026
Feature activation+1.74e-01

Pos logit contributions
orf+0.005
 COURT+0.005
icho+0.005
emouth+0.005
 Garg+0.005
Neg logit contributions
PB-0.006
fters-0.006
 cancellation-0.006
Lead-0.006
Writer-0.006
Token (
Token ID357
Feature activation+2.01e-02

Pos logit contributions
orf+0.119
 COURT+0.114
icho+0.110
 Garg+0.109
emouth+0.108
Neg logit contributions
PB-0.108
fters-0.107
 cancellation-0.104
Lead-0.103
Writer-0.099
Tokenaka
Token ID8130
Feature activation+1.54e-01

Pos logit contributions
orf+0.013
 COURT+0.013
icho+0.012
emouth+0.012
 Garg+0.012
Neg logit contributions
PB-0.013
fters-0.013
 cancellation-0.012
Lead-0.012
Writer-0.012
Token Sn
Token ID5489
Feature activation-4.18e-02

Pos logit contributions
orf+0.102
 COURT+0.098
icho+0.095
emouth+0.094
 Garg+0.093
Neg logit contributions
PB-0.099
fters-0.097
 cancellation-0.095
Lead-0.094
Writer-0.090
Token otherwise
Token ID4306
Feature activation-1.33e-01

Pos logit contributions
PB+0.006
fters+0.006
 cancellation+0.006
Lead+0.006
Writer+0.006
Neg logit contributions
orf-0.006
 COURT-0.006
icho-0.006
 Garg-0.006
emouth-0.006
Token a
Token ID257
Feature activation-8.15e-03

Pos logit contributions
PB+0.095
fters+0.094
 cancellation+0.093
Lead+0.091
Writer+0.089
Neg logit contributions
orf-0.079
 COURT-0.075
icho-0.072
emouth-0.071
 Garg-0.071
Token her
Token ID607
Feature activation-1.18e-02

Pos logit contributions
PB+0.005
fters+0.005
 cancellation+0.005
Lead+0.005
Writer+0.005
Neg logit contributions
orf-0.005
 COURT-0.005
icho-0.005
 Garg-0.005
emouth-0.005
Tokenky
Token ID2584
Feature activation+1.42e-01

Pos logit contributions
PB+0.010
fters+0.010
 cancellation+0.010
Lead+0.010
Writer+0.009
Neg logit contributions
orf-0.006
 COURT-0.005
icho-0.005
 Garg-0.005
emouth-0.005
Token-
Token ID12
Feature activation-2.56e-02

Pos logit contributions
orf+0.092
 COURT+0.088
icho+0.085
emouth+0.084
 Garg+0.084
Neg logit contributions
PB-0.093
fters-0.092
 cancellation-0.091
Lead-0.089
Writer-0.086
Tokenjer
Token ID44009
Feature activation+2.06e-01

Pos logit contributions
PB+0.022
fters+0.021
Lead+0.021
 cancellation+0.021
Writer+0.020
Neg logit contributions
orf-0.012
 COURT-0.011
 Garg-0.010
icho-0.010
emouth-0.010
Tokenky
Token ID2584
Feature activation+2.32e-01

Pos logit contributions
orf+0.096
 COURT+0.089
 Garg+0.085
emouth+0.084
alle+0.078
Neg logit contributions
fters-0.177
PB-0.175
 cancellation-0.171
Lead-0.166
rette-0.164
Token,
Token ID11
Feature activation-9.31e-03

Pos logit contributions
orf+0.148
 COURT+0.142
icho+0.137
emouth+0.135
 Garg+0.134
Neg logit contributions
PB-0.155
fters-0.152
 cancellation-0.150
Lead-0.148
Writer-0.143
Token uneven
Token ID30690
Feature activation-8.26e-02

Pos logit contributions
PB+0.006
fters+0.006
 cancellation+0.006
Lead+0.006
Writer+0.006
Neg logit contributions
orf-0.006
 COURT-0.006
icho-0.006
emouth-0.006
 Garg-0.006
Token debate
Token ID4384
Feature activation+8.35e-02

Pos logit contributions
PB+0.053
fters+0.052
 cancellation+0.051
Lead+0.050
Writer+0.049
Neg logit contributions
orf-0.055
 COURT-0.053
icho-0.051
emouth-0.050
 Garg-0.050
Token.
Token ID13
Feature activation-3.02e-02

Pos logit contributions
orf+0.057
 COURT+0.055
icho+0.053
 Garg+0.053
emouth+0.052
Neg logit contributions
PB-0.052
fters-0.051
 cancellation-0.050
Lead-0.049
Writer-0.047
Token is
Token ID318
Feature activation+2.82e-02

Pos logit contributions
orf+0.030
 COURT+0.029
icho+0.028
 Garg+0.027
emouth+0.027
Neg logit contributions
PB-0.033
fters-0.033
 cancellation-0.032
Lead-0.032
Writer-0.031
Token about
Token ID546
Feature activation+5.88e-03

Pos logit contributions
orf+0.019
 COURT+0.018
icho+0.017
emouth+0.017
 Garg+0.017
Neg logit contributions
PB-0.018
fters-0.018
 cancellation-0.018
Lead-0.017
Writer-0.017
Token 12
Token ID1105
Feature activation-4.33e-02

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.004
 Garg+0.003
emouth+0.003
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.004
Writer-0.004
Token hours
Token ID2250
Feature activation+4.85e-02

Pos logit contributions
PB+0.029
fters+0.028
 cancellation+0.028
Lead+0.027
Writer+0.026
Neg logit contributions
orf-0.028
 COURT-0.027
icho-0.026
emouth-0.026
 Garg-0.025
Token off
Token ID572
Feature activation-5.25e-02

Pos logit contributions
orf+0.031
 COURT+0.029
icho+0.028
emouth+0.028
 Garg+0.028
Neg logit contributions
PB-0.033
fters-0.032
 cancellation-0.032
Lead-0.031
Writer-0.030
Token winning
Token ID5442
Feature activation+1.96e-01

Pos logit contributions
PB+0.034
fters+0.034
 cancellation+0.033
Lead+0.033
Writer+0.032
Neg logit contributions
orf-0.034
 COURT-0.033
icho-0.032
 Garg-0.031
emouth-0.031
Token a
Token ID257
Feature activation-3.99e-03

Pos logit contributions
orf+0.126
 COURT+0.121
icho+0.116
 Garg+0.114
emouth+0.114
Neg logit contributions
PB-0.130
fters-0.129
 cancellation-0.127
Lead-0.124
Writer-0.120
Token Stanley
Token ID14167
Feature activation+2.62e-02

Pos logit contributions
PB+0.003
fters+0.002
 cancellation+0.002
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.003
 COURT-0.003
icho-0.002
emouth-0.002
 Garg-0.002
Token Cup
Token ID5454
Feature activation+1.77e-01

Pos logit contributions
orf+0.015
 COURT+0.014
icho+0.014
emouth+0.014
 Garg+0.013
Neg logit contributions
PB-0.019
fters-0.019
 cancellation-0.019
Lead-0.018
Writer-0.018
Token and
Token ID290
Feature activation-8.41e-03

Pos logit contributions
orf+0.116
 COURT+0.111
icho+0.107
emouth+0.105
 Garg+0.105
Neg logit contributions
PB-0.115
fters-0.114
 cancellation-0.112
Lead-0.110
Writer-0.106
Token obviously
Token ID6189
Feature activation-5.38e-02

Pos logit contributions
PB+0.006
fters+0.005
 cancellation+0.005
Lead+0.005
Writer+0.005
Neg logit contributions
orf-0.005
 COURT-0.005
icho-0.005
 Garg-0.005
emouth-0.005
Token the
Token ID262
Feature activation+1.68e-02

Pos logit contributions
orf+0.010
 COURT+0.010
icho+0.009
emouth+0.009
 Garg+0.009
Neg logit contributions
PB-0.010
fters-0.010
 cancellation-0.010
Lead-0.010
Writer-0.009
Token legal
Token ID2742
Feature activation+7.21e-02

Pos logit contributions
orf+0.011
 COURT+0.011
icho+0.010
emouth+0.010
 Garg+0.010
Neg logit contributions
PB-0.011
fters-0.011
 cancellation-0.011
Lead-0.010
Writer-0.010
Token standard
Token ID3210
Feature activation+9.56e-02

Pos logit contributions
orf+0.051
 COURT+0.049
icho+0.048
emouth+0.047
 Garg+0.047
Neg logit contributions
PB-0.043
fters-0.042
 cancellation-0.041
Lead-0.041
Writer-0.039
Token required
Token ID2672
Feature activation+2.32e-02

Pos logit contributions
orf+0.062
 COURT+0.060
icho+0.058
emouth+0.057
 Garg+0.057
Neg logit contributions
PB-0.062
fters-0.062
 cancellation-0.060
Lead-0.060
Writer-0.057
Token to
Token ID284
Feature activation+2.33e-02

Pos logit contributions
orf+0.015
 COURT+0.014
icho+0.014
 Garg+0.013
emouth+0.013
Neg logit contributions
PB-0.015
fters-0.015
 cancellation-0.015
Lead-0.015
Writer-0.014
Token effect
Token ID1245
Feature activation+2.67e-01

Pos logit contributions
orf+0.016
 COURT+0.015
icho+0.014
emouth+0.014
 Garg+0.014
Neg logit contributions
PB-0.015
fters-0.015
 cancellation-0.014
Lead-0.014
Writer-0.013
Token an
Token ID281
Feature activation+5.92e-02

Pos logit contributions
orf+0.166
 COURT+0.158
icho+0.152
 Garg+0.150
emouth+0.150
Neg logit contributions
PB-0.182
fters-0.179
 cancellation-0.178
Lead-0.174
Writer-0.168
Token arrest
Token ID3251
Feature activation-1.62e-01

Pos logit contributions
orf+0.051
 COURT+0.050
icho+0.049
emouth+0.048
 Garg+0.048
Neg logit contributions
PB-0.026
 cancellation-0.025
fters-0.025
Lead-0.024
Writer-0.023
Token,
Token ID11
Feature activation+1.07e-02

Pos logit contributions
PB+0.100
fters+0.098
 cancellation+0.097
Lead+0.095
Writer+0.091
Neg logit contributions
orf-0.112
 COURT-0.108
icho-0.104
emouth-0.102
 Garg-0.102
Token
Token ID447
Feature activation+3.48e-03

Pos logit contributions
orf+0.007
 COURT+0.007
icho+0.006
emouth+0.006
 Garg+0.006
Neg logit contributions
PB-0.007
fters-0.007
 cancellation-0.007
Lead-0.007
Writer-0.006
Token
Token ID251
Feature activation+4.33e-02

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
emouth+0.003
 Garg+0.003
Neg logit contributions
PB-0.002
fters-0.002
 cancellation-0.002
Lead-0.001
Writer-0.001
Token the
Token ID262
Feature activation-1.67e-02

Pos logit contributions
orf+0.002
 COURT+0.002
icho+0.002
emouth+0.002
 Garg+0.002
Neg logit contributions
PB-0.003
fters-0.003
 cancellation-0.002
Lead-0.002
Writer-0.002
Token administration
Token ID3662
Feature activation+2.26e-02

Pos logit contributions
PB+0.011
 cancellation+0.010
fters+0.010
Lead+0.010
Writer+0.010
Neg logit contributions
orf-0.011
 COURT-0.011
icho-0.010
emouth-0.010
 Garg-0.010
Token
Token ID447
Feature activation-1.21e-03

Pos logit contributions
orf+0.014
 COURT+0.014
icho+0.013
 Garg+0.013
emouth+0.013
Neg logit contributions
PB-0.015
fters-0.015
 cancellation-0.015
Lead-0.015
Writer-0.014
Token
Token ID247
Feature activation+4.87e-03

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.001
 COURT-0.001
icho-0.001
 Garg-0.001
emouth-0.001
Tokens
Token ID82
Feature activation+2.72e-02

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.002
 Garg+0.002
emouth+0.002
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.003
Writer-0.003
Token bold
Token ID10758
Feature activation+1.93e-01

Pos logit contributions
orf+0.019
 COURT+0.018
icho+0.017
emouth+0.017
 Garg+0.017
Neg logit contributions
PB-0.017
fters-0.016
 cancellation-0.016
Lead-0.016
Writer-0.015
Tokenest
Token ID395
Feature activation-5.04e-02

Pos logit contributions
orf+0.120
 COURT+0.115
icho+0.110
 Garg+0.108
emouth+0.108
Neg logit contributions
PB-0.132
fters-0.130
 cancellation-0.129
Lead-0.127
Writer-0.122
Token experiment
Token ID6306
Feature activation+1.10e-01

Pos logit contributions
PB+0.033
fters+0.032
 cancellation+0.032
Lead+0.031
Writer+0.030
Neg logit contributions
orf-0.033
 COURT-0.032
icho-0.030
emouth-0.030
 Garg-0.030
Token to
Token ID284
Feature activation+7.40e-04

Pos logit contributions
orf+0.072
 COURT+0.069
icho+0.067
 Garg+0.066
emouth+0.066
Neg logit contributions
PB-0.071
fters-0.070
 cancellation-0.068
Lead-0.067
Writer-0.065
Token date
Token ID3128
Feature activation-3.45e-01

Pos logit contributions
orf+0.001
 COURT+0.001
icho+0.001
 Garg+0.001
emouth+0.001
Neg logit contributions
PB-0.000
fters-0.000
 cancellation-0.000
Lead-0.000
Writer-0.000
Token involves
Token ID9018
Feature activation+1.01e-01

Pos logit contributions
PB+0.232
fters+0.229
 cancellation+0.225
Lead+0.222
Writer+0.215
Neg logit contributions
orf-0.218
 COURT-0.209
icho-0.201
 Garg-0.198
emouth-0.197
Token green
Token ID4077
Feature activation-2.52e-02

Pos logit contributions
orf+0.011
 COURT+0.011
icho+0.010
emouth+0.010
 Garg+0.010
Neg logit contributions
PB-0.012
fters-0.012
 cancellation-0.012
Lead-0.011
Writer-0.011
Tokenlighting
Token ID43351
Feature activation+1.33e-02

Pos logit contributions
PB+0.017
fters+0.016
 cancellation+0.016
Lead+0.016
Writer+0.015
Neg logit contributions
orf-0.016
 COURT-0.016
icho-0.015
emouth-0.015
 Garg-0.015
Token its
Token ID663
Feature activation-5.74e-03

Pos logit contributions
orf+0.009
 COURT+0.008
icho+0.008
emouth+0.008
 Garg+0.008
Neg logit contributions
PB-0.009
fters-0.009
 cancellation-0.009
Lead-0.008
Writer-0.008
Token first
Token ID717
Feature activation+2.43e-02

Pos logit contributions
PB+0.004
fters+0.004
 cancellation+0.004
Lead+0.003
Writer+0.003
Neg logit contributions
orf-0.004
 COURT-0.004
icho-0.004
emouth-0.003
 Garg-0.003
Token min
Token ID949
Feature activation+2.11e-02

Pos logit contributions
orf+0.017
 COURT+0.016
icho+0.015
emouth+0.015
 Garg+0.015
Neg logit contributions
PB-0.015
fters-0.015
 cancellation-0.015
Lead-0.014
Writer-0.014
Tokeniser
Token ID5847
Feature activation+3.64e-01

Pos logit contributions
orf+0.006
 COURT+0.006
icho+0.005
 Garg+0.005
emouth+0.005
Neg logit contributions
PB-0.022
fters-0.021
 cancellation-0.021
Lead-0.021
Writer-0.020
Tokenies
Token ID444
Feature activation+2.59e-01

Pos logit contributions
orf+0.165
 COURT+0.156
icho+0.147
 Garg+0.144
emouth+0.144
Neg logit contributions
PB-0.310
fters-0.307
 cancellation-0.302
Lead-0.299
Writer-0.292
Token since
Token ID1201
Feature activation-5.57e-02

Pos logit contributions
orf+0.171
 COURT+0.164
icho+0.158
 Garg+0.156
emouth+0.155
Neg logit contributions
PB-0.167
fters-0.165
 cancellation-0.161
Lead-0.159
Writer-0.154
Token 2009
Token ID3717
Feature activation+4.54e-02

Pos logit contributions
PB+0.037
fters+0.036
 cancellation+0.035
Lead+0.035
Writer+0.034
Neg logit contributions
orf-0.036
 COURT-0.035
icho-0.033
 Garg-0.033
emouth-0.033
Token
Token ID17478
Feature activation-1.13e-01

Pos logit contributions
orf+0.028
 COURT+0.027
icho+0.026
 Garg+0.025
emouth+0.025
Neg logit contributions
PB-0.031
fters-0.031
 cancellation-0.030
Lead-0.030
Writer-0.029
Tokens
Token ID82
Feature activation+9.80e-04

Pos logit contributions
PB+0.085
fters+0.084
 cancellation+0.083
Lead+0.082
Writer+0.080
Neg logit contributions
orf-0.062
 COURT-0.059
icho-0.056
 Garg-0.055
emouth-0.055
Token overall
Token ID4045
Feature activation+1.00e-01

Pos logit contributions
PB+0.014
fters+0.013
 cancellation+0.013
Lead+0.013
Writer+0.013
Neg logit contributions
orf-0.014
 COURT-0.013
icho-0.013
emouth-0.012
 Garg-0.012
Token increase
Token ID2620
Feature activation+7.49e-02

Pos logit contributions
orf+0.064
 COURT+0.062
icho+0.060
emouth+0.058
 Garg+0.058
Neg logit contributions
PB-0.067
 cancellation-0.066
fters-0.066
Lead-0.064
Writer-0.062
Token is
Token ID318
Feature activation+1.73e-02

Pos logit contributions
orf+0.047
 COURT+0.045
icho+0.043
 Garg+0.043
emouth+0.042
Neg logit contributions
PB-0.051
fters-0.050
 cancellation-0.049
Lead-0.049
Writer-0.047
Token down
Token ID866
Feature activation-2.27e-02

Pos logit contributions
orf+0.011
 COURT+0.011
icho+0.010
 Garg+0.010
emouth+0.010
Neg logit contributions
PB-0.011
fters-0.011
 cancellation-0.011
Lead-0.011
Writer-0.010
Token to
Token ID284
Feature activation+1.34e-03

Pos logit contributions
PB+0.015
fters+0.015
 cancellation+0.015
Lead+0.015
Writer+0.014
Neg logit contributions
orf-0.014
 COURT-0.014
icho-0.013
emouth-0.013
 Garg-0.013
Token a
Token ID257
Feature activation+1.75e-03

Pos logit contributions
orf+0.001
 COURT+0.001
icho+0.001
emouth+0.001
 Garg+0.001
Neg logit contributions
PB-0.001
fters-0.001
 cancellation-0.001
Lead-0.001
Writer-0.001
Token booming
Token ID32017
Feature activation-5.58e-02

Pos logit contributions
orf+0.001
 COURT+0.001
icho+0.001
emouth+0.001
 Garg+0.001
Neg logit contributions
PB-0.001
fters-0.001
 cancellation-0.001
Lead-0.001
Writer-0.001
Token population
Token ID3265
Feature activation+2.32e-02

Pos logit contributions
PB+0.035
fters+0.034
 cancellation+0.034
Lead+0.033
Writer+0.032
Neg logit contributions
orf-0.038
 COURT-0.036
icho-0.035
emouth-0.034
 Garg-0.034
Token:
Token ID25
Feature activation-7.37e-03

Pos logit contributions
orf+0.016
 COURT+0.015
icho+0.015
 Garg+0.014
emouth+0.014
Neg logit contributions
PB-0.014
fters-0.014
 cancellation-0.014
Lead-0.014
Writer-0.013
Token some
Token ID617
Feature activation+2.33e-03

Pos logit contributions
PB+0.005
fters+0.005
 cancellation+0.005
Lead+0.005
Writer+0.004
Neg logit contributions
orf-0.005
 COURT-0.005
icho-0.005
 Garg-0.004
emouth-0.004
Token 8
Token ID807
Feature activation+4.31e-03

Pos logit contributions
orf+0.002
 COURT+0.002
icho+0.002
emouth+0.001
 Garg+0.001
Neg logit contributions
PB-0.001
fters-0.001
 cancellation-0.001
Lead-0.001
Writer-0.001
Token his
Token ID465
Feature activation+2.46e-02

Pos logit contributions
orf+0.007
 COURT+0.007
icho+0.007
emouth+0.007
 Garg+0.007
Neg logit contributions
PB-0.008
fters-0.008
 cancellation-0.008
Lead-0.007
Writer-0.007
Token second
Token ID1218
Feature activation-1.16e-01

Pos logit contributions
orf+0.017
 COURT+0.017
icho+0.016
emouth+0.016
 Garg+0.016
Neg logit contributions
PB-0.015
fters-0.014
 cancellation-0.014
Lead-0.014
Writer-0.013
Token place
Token ID1295
Feature activation-2.38e-02

Pos logit contributions
PB+0.073
fters+0.072
 cancellation+0.071
Lead+0.069
Writer+0.067
Neg logit contributions
orf-0.079
 COURT-0.075
icho-0.072
emouth-0.071
 Garg-0.071
Token finish
Token ID5461
Feature activation-1.30e-01

Pos logit contributions
PB+0.010
fters+0.010
 cancellation+0.010
Lead+0.010
Writer+0.009
Neg logit contributions
orf-0.021
 COURT-0.020
icho-0.020
emouth-0.019
 Garg-0.019
Token as
Token ID355
Feature activation-1.24e-02

Pos logit contributions
PB+0.085
fters+0.084
 cancellation+0.083
Lead+0.082
Writer+0.079
Neg logit contributions
orf-0.084
 COURT-0.081
icho-0.078
 Garg-0.077
emouth-0.076
Token a
Token ID257
Feature activation+1.03e-02

Pos logit contributions
PB+0.008
fters+0.008
 cancellation+0.008
Lead+0.008
Writer+0.008
Neg logit contributions
orf-0.008
 COURT-0.008
icho-0.007
 Garg-0.007
emouth-0.007
Token success
Token ID1943
Feature activation+1.43e-02

Pos logit contributions
orf+0.007
 COURT+0.007
icho+0.007
 Garg+0.006
emouth+0.006
Neg logit contributions
PB-0.006
fters-0.006
 cancellation-0.006
Lead-0.006
Writer-0.006
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
orf+0.010
 COURT+0.009
icho+0.009
 Garg+0.009
emouth+0.009
Neg logit contributions
PB-0.009
fters-0.009
 cancellation-0.009
Lead-0.009
Writer-0.008
Token whereas
Token ID9472
Feature activation-6.09e-02

Pos logit contributions
orf+0.006
 COURT+0.006
icho+0.006
emouth+0.006
 Garg+0.006
Neg logit contributions
PB-0.007
fters-0.007
 cancellation-0.007
Lead-0.007
Writer-0.007
Token they
Token ID484
Feature activation+5.82e-04

Pos logit contributions
PB+0.042
fters+0.041
 cancellation+0.040
Lead+0.040
Writer+0.039
Neg logit contributions
orf-0.038
 COURT-0.036
icho-0.035
emouth-0.034
 Garg-0.034
Token are
Token ID389
Feature activation+2.65e-02

Pos logit contributions
orf+0.000
 COURT+0.000
icho+0.000
 Garg+0.000
emouth+0.000
Neg logit contributions
PB-0.000
fters-0.000
 cancellation-0.000
Lead-0.000
Writer-0.000
Token remind
Token ID7101
Feature activation-3.20e-02

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.001
 COURT-0.001
icho-0.001
emouth-0.001
 Garg-0.001
Token us
Token ID514
Feature activation-9.13e-04

Pos logit contributions
PB+0.023
fters+0.023
 cancellation+0.023
Lead+0.022
Writer+0.022
Neg logit contributions
orf-0.018
 COURT-0.018
icho-0.017
emouth-0.016
 Garg-0.016
Token of
Token ID286
Feature activation+1.09e-02

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.001
 COURT-0.001
icho-0.001
emouth-0.001
 Garg-0.001
Token the
Token ID262
Feature activation+1.32e-04

Pos logit contributions
orf+0.007
 COURT+0.007
icho+0.007
emouth+0.007
 Garg+0.007
Neg logit contributions
PB-0.007
fters-0.007
 cancellation-0.007
Lead-0.007
Writer-0.006
Token power
Token ID1176
Feature activation+7.47e-02

Pos logit contributions
orf+0.000
 COURT+0.000
icho+0.000
emouth+0.000
 Garg+0.000
Neg logit contributions
PB-0.000
fters-0.000
 cancellation-0.000
Lead-0.000
Writer-0.000
Token of
Token ID286
Feature activation+2.49e-03

Pos logit contributions
orf+0.050
 COURT+0.048
icho+0.046
emouth+0.045
 Garg+0.045
Neg logit contributions
PB-0.048
fters-0.047
 cancellation-0.046
Lead-0.046
Writer-0.044
Token love
Token ID1842
Feature activation+1.21e-01

Pos logit contributions
orf+0.002
 COURT+0.002
icho+0.002
emouth+0.002
 Garg+0.002
Neg logit contributions
PB-0.002
fters-0.002
 cancellation-0.002
Lead-0.002
Writer-0.001
Token,
Token ID11
Feature activation-4.65e-03

Pos logit contributions
orf+0.081
 COURT+0.078
icho+0.075
 Garg+0.074
emouth+0.074
Neg logit contributions
PB-0.077
fters-0.076
 cancellation-0.075
Lead-0.074
Writer-0.071
Token even
Token ID772
Feature activation+3.48e-02

Pos logit contributions
PB+0.003
fters+0.003
 cancellation+0.003
Lead+0.003
Writer+0.003
Neg logit contributions
orf-0.003
 COURT-0.003
icho-0.003
emouth-0.003
 Garg-0.003
Token if
Token ID611
Feature activation-3.91e-02

Pos logit contributions
orf+0.023
 COURT+0.022
icho+0.021
emouth+0.020
 Garg+0.020
Neg logit contributions
PB-0.023
fters-0.023
 cancellation-0.022
Lead-0.022
Writer-0.021
Token we
Token ID356
Feature activation-9.21e-03

Pos logit contributions
PB+0.027
fters+0.026
 cancellation+0.026
Lead+0.025
Writer+0.025
Neg logit contributions
orf-0.025
 COURT-0.023
icho-0.023
 Garg-0.022
emouth-0.022
Token
Token ID198
Feature activation-9.44e-03

Pos logit contributions
PB+0.001
fters+0.001
Lead+0.001
 cancellation+0.000
Writer+0.000
Neg logit contributions
orf-0.000
 COURT-0.000
icho-0.000
emouth-0.000
 Garg-0.000
TokenThe
Token ID464
Feature activation-1.86e-02

Pos logit contributions
PB+0.005
fters+0.005
 cancellation+0.005
Lead+0.005
Writer+0.005
Neg logit contributions
orf-0.007
 COURT-0.007
icho-0.007
emouth-0.006
 Garg-0.006
Token proposed
Token ID5150
Feature activation+8.89e-02

Pos logit contributions
PB+0.013
fters+0.013
 cancellation+0.013
Lead+0.012
Writer+0.012
Neg logit contributions
orf-0.011
 COURT-0.011
icho-0.011
emouth-0.010
 Garg-0.010
Token conversion
Token ID11315
Feature activation-1.77e-01

Pos logit contributions
orf+0.053
 COURT+0.051
icho+0.049
emouth+0.049
 Garg+0.047
Neg logit contributions
PB-0.063
fters-0.062
 cancellation-0.062
Lead-0.060
Writer-0.058
Token to
Token ID284
Feature activation-1.37e-02

Pos logit contributions
PB+0.111
fters+0.109
 cancellation+0.108
Lead+0.106
Writer+0.102
Neg logit contributions
orf-0.119
 COURT-0.115
icho-0.111
emouth-0.110
 Garg-0.109
Token L
Token ID406
Feature activation+3.26e-02

Pos logit contributions
PB+0.010
fters+0.009
 cancellation+0.009
Lead+0.009
Writer+0.009
Neg logit contributions
orf-0.008
 COURT-0.008
icho-0.008
emouth-0.008
 Garg-0.008
TokenNG
Token ID10503
Feature activation+1.15e-01

Pos logit contributions
orf+0.021
 COURT+0.020
icho+0.019
 Garg+0.019
emouth+0.019
Neg logit contributions
PB-0.022
fters-0.021
 cancellation-0.021
Lead-0.021
Writer-0.020
Token has
Token ID468
Feature activation-6.53e-03

Pos logit contributions
orf+0.072
 COURT+0.069
icho+0.067
emouth+0.065
 Garg+0.065
Neg logit contributions
PB-0.078
fters-0.077
 cancellation-0.076
Lead-0.075
Writer-0.072
Token major
Token ID1688
Feature activation-7.33e-02

Pos logit contributions
PB+0.004
fters+0.004
 cancellation+0.004
Lead+0.004
Writer+0.004
Neg logit contributions
orf-0.004
 COURT-0.004
icho-0.004
emouth-0.004
 Garg-0.004
Token drawbacks
Token ID42763
Feature activation+4.35e-02

Pos logit contributions
PB+0.045
fters+0.044
 cancellation+0.044
Lead+0.043
Writer+0.041
Neg logit contributions
orf-0.051
 COURT-0.049
icho-0.047
emouth-0.046
 Garg-0.046
Token.
Token ID13
Feature activation-2.85e-02

Pos logit contributions
orf+0.029
 COURT+0.028
icho+0.026
 Garg+0.026
emouth+0.026
Neg logit contributions
PB-0.028
fters-0.028
 cancellation-0.027
Lead-0.027
Writer-0.026
TokenNext
Token ID10019
Feature activation+1.69e-02

Pos logit contributions
PB+0.005
fters+0.005
 cancellation+0.005
Lead+0.005
Writer+0.005
Neg logit contributions
orf-0.006
 COURT-0.006
icho-0.006
 Garg-0.006
emouth-0.006
Token was
Token ID373
Feature activation+4.85e-03

Pos logit contributions
orf+0.011
 COURT+0.010
icho+0.010
 Garg+0.010
emouth+0.010
Neg logit contributions
PB-0.011
fters-0.011
 cancellation-0.011
Lead-0.011
Writer-0.011
Token a
Token ID257
Feature activation-4.40e-03

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
 Garg+0.003
emouth+0.003
Neg logit contributions
PB-0.003
fters-0.003
 cancellation-0.003
Lead-0.003
Writer-0.003
Token bit
Token ID1643
Feature activation-6.82e-05

Pos logit contributions
PB+0.003
fters+0.003
 cancellation+0.003
Lead+0.003
Writer+0.003
Neg logit contributions
orf-0.003
 COURT-0.003
icho-0.003
 Garg-0.003
emouth-0.003
Token of
Token ID286
Feature activation-2.11e-02

Pos logit contributions
PB+0.000
fters+0.000
 cancellation+0.000
Lead+0.000
Writer+0.000
Neg logit contributions
orf-0.000
 COURT-0.000
icho-0.000
 Garg-0.000
emouth-0.000
Token gre
Token ID10536
Feature activation-1.15e-02

Pos logit contributions
PB+0.014
fters+0.014
 cancellation+0.014
Lead+0.013
Writer+0.013
Neg logit contributions
orf-0.013
 COURT-0.013
icho-0.012
 Garg-0.012
emouth-0.012
Tokenenery
Token ID24156
Feature activation-3.31e-03

Pos logit contributions
PB+0.008
fters+0.008
 cancellation+0.008
Lead+0.008
Writer+0.007
Neg logit contributions
orf-0.007
 COURT-0.007
icho-0.006
 Garg-0.006
emouth-0.006
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
PB+0.002
fters+0.002
 cancellation+0.002
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
 Garg-0.002
emouth-0.002
Token I
Token ID314
Feature activation-2.51e-02

Pos logit contributions
PB+0.009
fters+0.009
 cancellation+0.009
Lead+0.009
Writer+0.009
Neg logit contributions
orf-0.008
 COURT-0.008
icho-0.008
emouth-0.008
 Garg-0.008
Token started
Token ID2067
Feature activation+6.80e-03

Pos logit contributions
PB+0.017
fters+0.017
 cancellation+0.016
Lead+0.016
Writer+0.016
Neg logit contributions
orf-0.016
 COURT-0.015
icho-0.015
 Garg-0.014
emouth-0.014
Token by
Token ID416
Feature activation+5.88e-02

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.004
emouth+0.004
 Garg+0.004
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.004
Writer-0.004
Token adopted
Token ID8197
Feature activation+1.86e-01

Pos logit contributions
PB+0.108
fters+0.107
 cancellation+0.105
Lead+0.103
Writer+0.099
Neg logit contributions
orf-0.111
 COURT-0.107
icho-0.103
emouth-0.101
 Garg-0.101
Token by
Token ID416
Feature activation+1.11e-02

Pos logit contributions
orf+0.115
 COURT+0.110
icho+0.106
 Garg+0.104
emouth+0.104
Neg logit contributions
PB-0.127
fters-0.126
 cancellation-0.123
Lead-0.122
Writer-0.118
Token the
Token ID262
Feature activation-1.62e-02

Pos logit contributions
orf+0.007
 COURT+0.007
icho+0.007
emouth+0.006
 Garg+0.006
Neg logit contributions
PB-0.007
fters-0.007
 cancellation-0.007
Lead-0.007
Writer-0.007
Token Ukrainian
Token ID13053
Feature activation-1.12e-01

Pos logit contributions
PB+0.011
fters+0.011
 cancellation+0.010
Lead+0.010
Writer+0.010
Neg logit contributions
orf-0.010
 COURT-0.010
icho-0.010
emouth-0.010
 Garg-0.010
Token Parliament
Token ID8411
Feature activation-5.98e-03

Pos logit contributions
PB+0.071
fters+0.070
 cancellation+0.069
Lead+0.067
Writer+0.065
Neg logit contributions
orf-0.075
 COURT-0.071
icho-0.069
emouth-0.069
 Garg-0.068
Token on
Token ID319
Feature activation-5.03e-02

Pos logit contributions
PB+0.004
fters+0.004
 cancellation+0.004
Lead+0.004
Writer+0.003
Neg logit contributions
orf-0.004
 COURT-0.004
icho-0.004
emouth-0.004
 Garg-0.004
Token 25
Token ID1679
Feature activation-1.37e-02

Pos logit contributions
PB+0.032
fters+0.032
 cancellation+0.031
Lead+0.031
Writer+0.030
Neg logit contributions
orf-0.033
 COURT-0.032
icho-0.031
emouth-0.030
 Garg-0.030
Token September
Token ID2693
Feature activation+3.88e-02

Pos logit contributions
PB+0.009
fters+0.009
 cancellation+0.009
Lead+0.009
Writer+0.008
Neg logit contributions
orf-0.009
 COURT-0.008
icho-0.008
emouth-0.008
 Garg-0.008
Token.
Token ID13
Feature activation-8.65e-03

Pos logit contributions
orf+0.026
 COURT+0.025
icho+0.024
emouth+0.024
 Garg+0.023
Neg logit contributions
PB-0.025
fters-0.025
 cancellation-0.024
Lead-0.024
Writer-0.023
Token The
Token ID383
Feature activation+1.75e-02

Pos logit contributions
PB+0.006
fters+0.006
 cancellation+0.006
Lead+0.006
Writer+0.005
Neg logit contributions
orf-0.006
 COURT-0.005
icho-0.005
emouth-0.005
 Garg-0.005
Token Minister
Token ID4139
Feature activation-1.55e-01

Pos logit contributions
orf+0.012
 COURT+0.012
icho+0.011
 Garg+0.011
emouth+0.011
Neg logit contributions
PB-0.011
fters-0.010
 cancellation-0.010
Lead-0.010
Writer-0.010
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
PB+0.083
fters+0.082
 cancellation+0.080
Lead+0.079
Writer+0.077
Neg logit contributions
orf-0.083
 COURT-0.080
icho-0.077
 Garg-0.076
emouth-0.076
Token That
Token ID1320
Feature activation+5.30e-02

Pos logit contributions
orf+0.009
 COURT+0.009
icho+0.009
emouth+0.009
 Garg+0.009
Neg logit contributions
PB-0.010
fters-0.009
 cancellation-0.009
Lead-0.009
Writer-0.009
Token is
Token ID318
Feature activation+8.57e-03

Pos logit contributions
orf+0.034
 COURT+0.033
icho+0.032
 Garg+0.031
emouth+0.031
Neg logit contributions
PB-0.035
fters-0.034
 cancellation-0.034
Lead-0.033
Writer-0.032
Token a
Token ID257
Feature activation-1.42e-02

Pos logit contributions
orf+0.006
 COURT+0.005
icho+0.005
emouth+0.005
 Garg+0.005
Neg logit contributions
PB-0.005
fters-0.005
 cancellation-0.005
Lead-0.005
Writer-0.005
Token real
Token ID1103
Feature activation+8.26e-02

Pos logit contributions
PB+0.009
fters+0.009
 cancellation+0.009
Lead+0.009
Writer+0.008
Neg logit contributions
orf-0.009
 COURT-0.009
icho-0.009
emouth-0.009
 Garg-0.009
Token tank
Token ID6873
Feature activation-1.44e-01

Pos logit contributions
orf+0.052
 COURT+0.050
icho+0.048
emouth+0.047
 Garg+0.047
Neg logit contributions
PB-0.056
fters-0.055
 cancellation-0.054
Lead-0.053
Writer-0.051
Token that
Token ID326
Feature activation-3.06e-03

Pos logit contributions
PB+0.093
fters+0.092
 cancellation+0.090
Lead+0.088
Writer+0.085
Neg logit contributions
orf-0.095
 COURT-0.091
icho-0.087
 Garg-0.086
emouth-0.086
Token has
Token ID468
Feature activation-4.33e-02

Pos logit contributions
PB+0.002
fters+0.002
 cancellation+0.002
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
emouth-0.002
 Garg-0.002
Token crashed
Token ID14997
Feature activation-8.33e-02

Pos logit contributions
PB+0.030
fters+0.029
 cancellation+0.029
Lead+0.028
Writer+0.027
Neg logit contributions
orf-0.027
 COURT-0.026
icho-0.025
 Garg-0.024
emouth-0.024
Token into
Token ID656
Feature activation-8.01e-02

Pos logit contributions
PB+0.055
fters+0.054
 cancellation+0.053
Lead+0.052
Writer+0.051
Neg logit contributions
orf-0.054
 COURT-0.052
icho-0.050
 Garg-0.049
emouth-0.049
Token the
Token ID262
Feature activation-4.48e-02

Pos logit contributions
PB+0.055
fters+0.054
 cancellation+0.053
Lead+0.052
Writer+0.050
Neg logit contributions
orf-0.050
 COURT-0.048
icho-0.046
emouth-0.045
 Garg-0.045
Token 5
Token ID642
Feature activation-3.04e-03

Pos logit contributions
orf+0.055
 COURT+0.052
icho+0.050
 Garg+0.050
emouth+0.050
Neg logit contributions
PB-0.061
fters-0.060
 cancellation-0.059
Lead-0.058
Writer-0.056
Token.
Token ID13
Feature activation+9.75e-03

Pos logit contributions
PB+0.002
fters+0.002
 cancellation+0.002
Lead+0.002
Writer+0.002
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
emouth-0.002
 Garg-0.002
Token O
Token ID440
Feature activation+2.84e-02

Pos logit contributions
orf+0.006
 COURT+0.006
icho+0.005
emouth+0.005
 Garg+0.005
Neg logit contributions
PB-0.007
fters-0.007
 cancellation-0.007
Lead-0.007
Writer-0.006
Token-
Token ID12
Feature activation+6.04e-03

Pos logit contributions
orf+0.019
 COURT+0.018
icho+0.017
emouth+0.017
 Garg+0.017
Neg logit contributions
PB-0.018
fters-0.018
 cancellation-0.018
Lead-0.018
Writer-0.017
TokenO
Token ID46
Feature activation-1.28e-02

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.004
 Garg+0.004
emouth+0.004
Neg logit contributions
PB-0.004
fters-0.004
Lead-0.004
 cancellation-0.004
Writer-0.003
Token {
Token ID1391
Feature activation-7.42e-02

Pos logit contributions
PB+0.008
fters+0.007
 cancellation+0.007
Lead+0.007
Writer+0.007
Neg logit contributions
orf-0.009
 COURT-0.009
icho-0.008
emouth-0.008
 Garg-0.008
Token(
Token ID7
Feature activation-5.84e-04

Pos logit contributions
PB+0.044
fters+0.043
Lead+0.042
 cancellation+0.042
Writer+0.041
Neg logit contributions
orf-0.053
 COURT-0.051
icho-0.050
emouth-0.049
 Garg-0.049
Token0
Token ID15
Feature activation-9.61e-02

Pos logit contributions
PB+0.000
fters+0.000
 cancellation+0.000
Lead+0.000
Writer+0.000
Neg logit contributions
orf-0.000
 COURT-0.000
icho-0.000
emouth-0.000
 Garg-0.000
Tokens
Token ID82
Feature activation-7.78e-03

Pos logit contributions
PB+0.071
fters+0.070
 cancellation+0.068
Lead+0.068
Writer+0.066
Neg logit contributions
orf-0.055
 COURT-0.053
icho-0.050
emouth-0.050
 Garg-0.050
Token)}
Token ID38165
Feature activation+8.86e-02

Pos logit contributions
PB+0.005
fters+0.005
 cancellation+0.005
Lead+0.005
Writer+0.005
Neg logit contributions
orf-0.005
 COURT-0.005
icho-0.004
emouth-0.004
 Garg-0.004
Token c
Token ID269
Feature activation+1.79e-01

Pos logit contributions
orf+0.054
 COURT+0.052
icho+0.049
emouth+0.049
 Garg+0.049
Neg logit contributions
PB-0.062
fters-0.061
 cancellation-0.060
Lead-0.059
Writer-0.057
Token nearly
Token ID3016
Feature activation+8.53e-03

Pos logit contributions
orf+0.010
 COURT+0.010
icho+0.010
emouth+0.010
 Garg+0.009
Neg logit contributions
PB-0.009
fters-0.009
 cancellation-0.009
Lead-0.009
Writer-0.009
Token 50
Token ID2026
Feature activation-6.77e-02

Pos logit contributions
orf+0.006
 COURT+0.006
icho+0.005
emouth+0.005
 Garg+0.005
Neg logit contributions
PB-0.005
fters-0.005
 cancellation-0.005
Lead-0.005
Writer-0.005
Token%.
Token ID7225
Feature activation+6.58e-02

Pos logit contributions
PB+0.041
fters+0.040
 cancellation+0.039
Lead+0.039
Writer+0.037
Neg logit contributions
orf-0.048
 COURT-0.046
icho-0.044
 Garg-0.044
emouth-0.044
Token But
Token ID887
Feature activation+3.37e-02

Pos logit contributions
orf+0.043
 COURT+0.041
icho+0.040
emouth+0.039
 Garg+0.038
Neg logit contributions
PB-0.043
fters-0.042
 cancellation-0.042
Lead-0.041
Writer-0.040
Token the
Token ID262
Feature activation-3.20e-02

Pos logit contributions
orf+0.022
 COURT+0.021
icho+0.020
emouth+0.020
 Garg+0.020
Neg logit contributions
PB-0.022
fters-0.022
 cancellation-0.021
Lead-0.021
Writer-0.020
Token congestion
Token ID28014
Feature activation-4.56e-02

Pos logit contributions
PB+0.021
fters+0.021
 cancellation+0.021
Lead+0.020
Writer+0.020
Neg logit contributions
orf-0.021
 COURT-0.020
icho-0.019
emouth-0.019
 Garg-0.019
Token is
Token ID318
Feature activation-3.18e-02

Pos logit contributions
PB+0.030
 cancellation+0.030
fters+0.029
Lead+0.029
Writer+0.028
Neg logit contributions
orf-0.030
 COURT-0.028
icho-0.027
emouth-0.027
 Garg-0.027
Token also
Token ID635
Feature activation-1.13e-01

Pos logit contributions
PB+0.020
fters+0.020
 cancellation+0.020
Lead+0.019
Writer+0.019
Neg logit contributions
orf-0.021
 COURT-0.020
icho-0.020
emouth-0.019
 Garg-0.019
Token spreading
Token ID14342
Feature activation-4.38e-02

Pos logit contributions
PB+0.073
fters+0.072
 cancellation+0.072
Lead+0.070
Writer+0.067
Neg logit contributions
orf-0.075
 COURT-0.071
icho-0.069
emouth-0.067
 Garg-0.067
Token out
Token ID503
Feature activation-1.67e-03

Pos logit contributions
PB+0.027
fters+0.027
 cancellation+0.027
Lead+0.026
Writer+0.025
Neg logit contributions
orf-0.030
 COURT-0.029
icho-0.028
emouth-0.027
 Garg-0.027
Token.
Token ID13
Feature activation-1.00e-02

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.001
 COURT-0.001
icho-0.001
 Garg-0.001
emouth-0.001
Token't
Token ID470
Feature activation-4.69e-03

Pos logit contributions
PB+0.040
fters+0.039
Lead+0.039
 cancellation+0.038
Writer+0.037
Neg logit contributions
orf-0.043
 COURT-0.041
emouth-0.040
icho-0.040
 Garg-0.039
Token I
Token ID314
Feature activation+4.28e-02

Pos logit contributions
PB+0.003
fters+0.003
 cancellation+0.003
Lead+0.003
Writer+0.003
Neg logit contributions
orf-0.003
 COURT-0.003
icho-0.002
emouth-0.002
 Garg-0.002
Token tell
Token ID1560
Feature activation-6.67e-02

Pos logit contributions
orf+0.027
 COURT+0.026
icho+0.025
emouth+0.025
 Garg+0.025
Neg logit contributions
PB-0.029
fters-0.028
 cancellation-0.028
Lead-0.027
Writer-0.026
Token you
Token ID345
Feature activation+2.42e-02

Pos logit contributions
PB+0.052
fters+0.051
 cancellation+0.050
Lead+0.050
Writer+0.048
Neg logit contributions
orf-0.036
 COURT-0.034
icho-0.032
emouth-0.032
 Garg-0.031
Token I
Token ID314
Feature activation+1.78e-02

Pos logit contributions
orf+0.015
 COURT+0.015
icho+0.014
 Garg+0.014
emouth+0.014
Neg logit contributions
PB-0.016
fters-0.016
 cancellation-0.016
Lead-0.015
Writer-0.015
Token'd
Token ID1549
Feature activation-8.34e-02

Pos logit contributions
orf+0.011
 COURT+0.011
icho+0.010
emouth+0.010
 Garg+0.010
Neg logit contributions
PB-0.012
fters-0.012
 cancellation-0.012
Lead-0.012
Writer-0.011
Token burn
Token ID4245
Feature activation-2.86e-02

Pos logit contributions
PB+0.058
fters+0.057
 cancellation+0.057
Lead+0.055
Writer+0.054
Neg logit contributions
orf-0.051
 COURT-0.048
icho-0.047
emouth-0.046
 Garg-0.046
Token your
Token ID534
Feature activation+2.79e-02

Pos logit contributions
PB+0.020
fters+0.019
 cancellation+0.019
Lead+0.019
Writer+0.018
Neg logit contributions
orf-0.018
 COURT-0.017
icho-0.016
emouth-0.016
 Garg-0.016
Token (
Token ID357
Feature activation+4.17e-02

Pos logit contributions
orf+0.017
 COURT+0.016
icho+0.016
emouth+0.015
 Garg+0.015
Neg logit contributions
PB-0.020
fters-0.019
 cancellation-0.019
Lead-0.019
Writer-0.018
Tokenre
Token ID260
Feature activation+8.19e-02

Pos logit contributions
orf+0.028
 COURT+0.027
icho+0.026
 Garg+0.026
emouth+0.026
Neg logit contributions
PB-0.027
fters-0.026
 cancellation-0.025
Lead-0.025
Writer-0.024
Tokenar
Token ID283
Feature activation-3.82e-02

Pos logit contributions
orf+0.052
 COURT+0.050
icho+0.048
 Garg+0.047
emouth+0.047
Neg logit contributions
PB-0.055
fters-0.054
 cancellation-0.053
Lead-0.052
Writer-0.051
TokenSan
Token ID15017
Feature activation+1.41e-03

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.001
 COURT-0.001
icho-0.001
 Garg-0.001
emouth-0.001
Token Francisco
Token ID6033
Feature activation+8.37e-02

Pos logit contributions
orf+0.001
 COURT+0.001
icho+0.001
emouth+0.001
 Garg+0.001
Neg logit contributions
PB-0.001
fters-0.001
 cancellation-0.001
Lead-0.001
Writer-0.001
Token Board
Token ID5926
Feature activation-1.25e-01

Pos logit contributions
orf+0.063
 COURT+0.061
icho+0.059
emouth+0.058
 Garg+0.058
Neg logit contributions
PB-0.046
fters-0.045
 cancellation-0.044
Lead-0.044
Writer-0.042
Token of
Token ID286
Feature activation-7.14e-03

Pos logit contributions
PB+0.079
fters+0.077
 cancellation+0.076
Lead+0.075
Writer+0.072
Neg logit contributions
orf-0.085
 COURT-0.081
icho-0.078
emouth-0.077
 Garg-0.077
Token Super
Token ID3115
Feature activation+4.77e-02

Pos logit contributions
PB+0.005
fters+0.005
 cancellation+0.005
Lead+0.005
Writer+0.004
Neg logit contributions
orf-0.005
 COURT-0.004
icho-0.004
emouth-0.004
 Garg-0.004
Tokenvisors
Token ID27681
Feature activation-2.82e-01

Pos logit contributions
orf+0.045
 COURT+0.044
icho+0.042
emouth+0.042
 Garg+0.042
Neg logit contributions
PB-0.017
fters-0.017
 cancellation-0.016
Lead-0.016
Writer-0.015
Token
Token ID198
Feature activation-5.30e-04

Pos logit contributions
PB+0.192
fters+0.188
 cancellation+0.186
Lead+0.183
Writer+0.177
Neg logit contributions
orf-0.176
 COURT-0.168
icho-0.162
emouth-0.160
 Garg-0.159
Token
Token ID198
Feature activation-2.09e-03

Pos logit contributions
PB+0.000
fters+0.000
Lead+0.000
 cancellation+0.000
Writer+0.000
Neg logit contributions
orf-0.000
 COURT-0.000
icho-0.000
emouth-0.000
 Garg-0.000
TokenSan
Token ID15017
Feature activation-5.13e-04

Pos logit contributions
PB+0.001
fters+0.001
 cancellation+0.001
Lead+0.001
Writer+0.001
Neg logit contributions
orf-0.002
 COURT-0.002
icho-0.002
 Garg-0.002
emouth-0.002
Token Francisco
Token ID6033
Feature activation+9.79e-02

Pos logit contributions
PB+0.000
fters+0.000
 cancellation+0.000
Lead+0.000
Writer+0.000
Neg logit contributions
orf-0.000
 COURT-0.000
icho-0.000
 Garg-0.000
emouth-0.000
Token,
Token ID11
Feature activation+1.64e-02

Pos logit contributions
orf+0.067
 COURT+0.064
icho+0.062
emouth+0.061
 Garg+0.061
Neg logit contributions
PB-0.061
fters-0.060
 cancellation-0.059
Lead-0.058
Writer-0.056
Token 2011
Token ID2813
Feature activation+5.81e-02

Pos logit contributions
orf+0.022
 COURT+0.021
icho+0.020
emouth+0.020
 Garg+0.019
Neg logit contributions
PB-0.022
fters-0.021
 cancellation-0.021
Lead-0.021
Writer-0.020
Token,
Token ID11
Feature activation-2.04e-02

Pos logit contributions
orf+0.039
 COURT+0.037
icho+0.036
 Garg+0.036
emouth+0.035
Neg logit contributions
PB-0.037
fters-0.036
 cancellation-0.036
Lead-0.035
Writer-0.034
Token the
Token ID262
Feature activation-4.54e-03

Pos logit contributions
PB+0.014
fters+0.014
 cancellation+0.013
Lead+0.013
Writer+0.013
Neg logit contributions
orf-0.013
 COURT-0.012
icho-0.012
emouth-0.012
 Garg-0.012
Token exp
Token ID1033
Feature activation+8.06e-02

Pos logit contributions
PB+0.003
fters+0.003
 cancellation+0.003
Lead+0.003
Writer+0.003
Neg logit contributions
orf-0.003
 COURT-0.003
icho-0.003
 Garg-0.003
emouth-0.003
Tokeniry
Token ID9045
Feature activation+5.72e-02

Pos logit contributions
orf+0.046
 COURT+0.043
icho+0.041
 Garg+0.041
emouth+0.041
Neg logit contributions
PB-0.060
fters-0.059
 cancellation-0.058
Lead-0.057
Writer-0.056
Token date
Token ID3128
Feature activation-3.51e-01

Pos logit contributions
orf+0.049
 COURT+0.047
icho+0.046
 Garg+0.046
emouth+0.045
Neg logit contributions
PB-0.026
fters-0.025
 cancellation-0.024
Lead-0.024
Writer-0.023
Token is
Token ID318
Feature activation+2.22e-02

Pos logit contributions
PB+0.229
fters+0.227
 cancellation+0.224
Lead+0.219
Writer+0.212
Neg logit contributions
orf-0.229
 COURT-0.219
icho-0.212
emouth-0.208
 Garg-0.207
Token deemed
Token ID10762
Feature activation-1.41e-01

Pos logit contributions
orf+0.014
 COURT+0.014
icho+0.013
 Garg+0.013
emouth+0.013
Neg logit contributions
PB-0.015
fters-0.014
 cancellation-0.014
Lead-0.014
Writer-0.013
Token to
Token ID284
Feature activation-2.63e-02

Pos logit contributions
PB+0.091
fters+0.089
 cancellation+0.087
Lead+0.086
Writer+0.083
Neg logit contributions
orf-0.094
 COURT-0.090
icho-0.087
 Garg-0.086
emouth-0.085
Token be
Token ID307
Feature activation-1.35e-02

Pos logit contributions
PB+0.019
fters+0.019
 cancellation+0.019
Lead+0.018
Writer+0.018
Neg logit contributions
orf-0.015
 COURT-0.015
icho-0.014
emouth-0.014
 Garg-0.014
Token the
Token ID262
Feature activation-1.03e-02

Pos logit contributions
PB+0.009
fters+0.009
 cancellation+0.009
Lead+0.008
Writer+0.008
Neg logit contributions
orf-0.009
 COURT-0.008
icho-0.008
 Garg-0.008
emouth-0.008
Token and
Token ID290
Feature activation+5.23e-02

Pos logit contributions
orf+0.005
 COURT+0.004
icho+0.004
emouth+0.004
 Garg+0.004
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.004
Writer-0.004
Token J
Token ID449
Feature activation+2.34e-02

Pos logit contributions
orf+0.035
 COURT+0.033
icho+0.032
emouth+0.032
 Garg+0.032
Neg logit contributions
PB-0.033
fters-0.033
 cancellation-0.032
Lead-0.032
Writer-0.031
Tokenerv
Token ID712
Feature activation+5.66e-02

Pos logit contributions
orf+0.016
 COURT+0.016
icho+0.015
 Garg+0.015
emouth+0.015
Neg logit contributions
PB-0.015
fters-0.014
 cancellation-0.014
Lead-0.014
Writer-0.013
Tokenis
Token ID271
Feature activation-1.80e-02

Pos logit contributions
orf+0.038
 COURT+0.037
 Garg+0.035
emouth+0.034
icho+0.034
Neg logit contributions
PB-0.035
fters-0.035
 cancellation-0.034
Lead-0.034
Writer-0.033
Token Bay
Token ID4696
Feature activation+7.76e-02

Pos logit contributions
PB+0.011
fters+0.011
 cancellation+0.010
Lead+0.010
Writer+0.010
Neg logit contributions
orf-0.013
 COURT-0.012
icho-0.012
 Garg-0.012
emouth-0.012
Token Territory
Token ID25219
Feature activation-2.84e-01

Pos logit contributions
orf+0.051
 COURT+0.049
icho+0.047
 Garg+0.047
emouth+0.047
Neg logit contributions
PB-0.050
fters-0.049
 cancellation-0.048
Lead-0.048
Writer-0.046
Token,
Token ID11
Feature activation+2.10e-02

Pos logit contributions
PB+0.178
fters+0.175
 cancellation+0.172
Lead+0.170
Writer+0.164
Neg logit contributions
orf-0.193
 COURT-0.185
icho-0.179
 Garg-0.176
emouth-0.176
Token a
Token ID257
Feature activation+5.78e-03

Pos logit contributions
orf+0.014
 COURT+0.013
icho+0.012
emouth+0.012
 Garg+0.012
Neg logit contributions
PB-0.014
fters-0.014
 cancellation-0.013
Lead-0.013
Writer-0.013
Token naval
Token ID19014
Feature activation+1.09e-01

Pos logit contributions
orf+0.004
 COURT+0.004
icho+0.004
 Garg+0.004
emouth+0.004
Neg logit contributions
PB-0.004
fters-0.004
 cancellation-0.004
Lead-0.003
Writer-0.003
Token base
Token ID2779
Feature activation-1.25e-01

Pos logit contributions
orf+0.077
 COURT+0.074
icho+0.072
emouth+0.070
 Garg+0.070
Neg logit contributions
PB-0.066
fters-0.065
 cancellation-0.064
Lead-0.062
Writer-0.060
Token and
Token ID290
Feature activation+1.85e-02

Pos logit contributions
PB+0.082
fters+0.080
 cancellation+0.079
Lead+0.078
Writer+0.075
Neg logit contributions
orf-0.082
 COURT-0.079
icho-0.076
 Garg-0.074
emouth-0.074
Token Red
Token ID2297
Feature activation+6.98e-02

Pos logit contributions
PB+0.000
fters+0.000
 cancellation+0.000
Lead+0.000
Writer+0.000
Neg logit contributions
orf-0.000
 COURT-0.000
icho-0.000
emouth-0.000
 Garg-0.000
Token Christmas
Token ID6786
Feature activation+5.57e-02

Pos logit contributions
orf+0.046
 COURT+0.045
icho+0.043
emouth+0.042
 Garg+0.042
Neg logit contributions
PB-0.045
fters-0.044
 cancellation-0.043
Lead-0.042
Writer-0.041
Token album
Token ID5062
Feature activation+7.43e-02

Pos logit contributions
orf+0.037
 COURT+0.036
icho+0.035
emouth+0.034
 Garg+0.034
Neg logit contributions
PB-0.036
fters-0.035
 cancellation-0.034
Lead-0.034
Writer-0.033
Token.[
Token ID3693
Feature activation+2.55e-02

Pos logit contributions
orf+0.050
 COURT+0.048
icho+0.046
 Garg+0.046
emouth+0.046
Neg logit contributions
PB-0.047
fters-0.046
 cancellation-0.045
Lead-0.045
Writer-0.043
Tokenc
Token ID66
Feature activation+1.17e-01

Pos logit contributions
orf+0.018
 COURT+0.017
icho+0.017
emouth+0.017
 Garg+0.016
Neg logit contributions
PB-0.015
fters-0.015
 cancellation-0.015
Lead-0.015
Writer-0.014
Tokenitation
Token ID3780
Feature activation-2.86e-01

Pos logit contributions
orf+0.112
 COURT+0.109
icho+0.106
 Garg+0.105
emouth+0.105
Neg logit contributions
PB-0.042
fters-0.041
 cancellation-0.039
Lead-0.038
Writer-0.036
Token needed
Token ID2622
Feature activation-3.90e-02

Pos logit contributions
PB+0.164
fters+0.160
 cancellation+0.157
Lead+0.154
Writer+0.149
Neg logit contributions
orf-0.210
 COURT-0.202
icho-0.196
emouth-0.196
 Garg-0.193
Token]
Token ID60
Feature activation-3.56e-02

Pos logit contributions
PB+0.026
fters+0.025
 cancellation+0.025
Lead+0.024
Writer+0.024
Neg logit contributions
orf-0.025
 COURT-0.024
icho-0.023
 Garg-0.023
emouth-0.023
Token
Token ID198
Feature activation+1.67e-02

Pos logit contributions
PB+0.024
fters+0.024
 cancellation+0.024
Lead+0.023
Writer+0.023
Neg logit contributions
orf-0.022
 COURT-0.021
icho-0.020
emouth-0.020
 Garg-0.020
Token
Token ID198
Feature activation+3.89e-03

Pos logit contributions
orf+0.010
 COURT+0.010
icho+0.010
emouth+0.010
 Garg+0.010
Neg logit contributions
PB-0.011
fters-0.011
 cancellation-0.011
Lead-0.011
Writer-0.011
TokenAs
Token ID1722
Feature activation-7.65e-03

Pos logit contributions
orf+0.003
 COURT+0.003
icho+0.003
emouth+0.003
 Garg+0.003
Neg logit contributions
PB-0.002
fters-0.002
 cancellation-0.002
Lead-0.002
Writer-0.002
Token-
Token ID12
Feature activation-1.23e-02

Pos logit contributions
orf+0.016
 COURT+0.016
icho+0.015
emouth+0.015
 Garg+0.015
Neg logit contributions
PB-0.016
fters-0.016
 cancellation-0.015
Lead-0.015
Writer-0.015
Tokenflight
Token ID22560
Feature activation+3.34e-02

Pos logit contributions
PB+0.009
fters+0.009
 cancellation+0.009
Lead+0.009
Writer+0.008
Neg logit contributions
orf-0.007
 COURT-0.007
icho-0.006
 Garg-0.006
emouth-0.006
Token operation
Token ID4905
Feature activation-6.86e-02

Pos logit contributions
orf+0.024
 COURT+0.023
icho+0.023
 Garg+0.022
emouth+0.022
Neg logit contributions
PB-0.019
fters-0.019
 cancellation-0.019
Lead-0.018
Writer-0.018
Token,
Token ID11
Feature activation+3.30e-03

Pos logit contributions
PB+0.044
fters+0.043
 cancellation+0.042
Lead+0.042
Writer+0.041
Neg logit contributions
orf-0.045
 COURT-0.044
icho-0.042
 Garg-0.041
emouth-0.041
Token tele
Token ID5735
Feature activation+6.76e-02

Pos logit contributions
orf+0.002
 COURT+0.002
icho+0.002
emouth+0.002
 Garg+0.002
Neg logit contributions
PB-0.002
fters-0.002
 cancellation-0.002
Lead-0.002
Writer-0.002
Tokenmetry
Token ID41935
Feature activation-3.13e-01

Pos logit contributions
orf+0.073
 COURT+0.071
icho+0.069
 Garg+0.069
emouth+0.069
Neg logit contributions
PB-0.016
fters-0.015
 cancellation-0.014
Lead-0.013
Writer-0.012
Token system
Token ID1080
Feature activation+6.78e-02

Pos logit contributions
PB+0.190
fters+0.187
 cancellation+0.185
Lead+0.180
Writer+0.174
Neg logit contributions
orf-0.219
 COURT-0.210
icho-0.203
emouth-0.200
 Garg-0.200
Token modifications
Token ID19008
Feature activation-5.08e-02

Pos logit contributions
orf+0.050
 COURT+0.048
icho+0.046
emouth+0.046
 Garg+0.046
Neg logit contributions
PB-0.039
fters-0.038
 cancellation-0.038
Lead-0.037
Writer-0.036
Token to
Token ID284
Feature activation+1.35e-02

Pos logit contributions
PB+0.033
fters+0.033
 cancellation+0.032
Lead+0.032
Writer+0.031
Neg logit contributions
orf-0.033
 COURT-0.032
icho-0.031
 Garg-0.030
emouth-0.030
Token increase
Token ID2620
Feature activation+1.07e-01

Pos logit contributions
orf+0.009
 COURT+0.009
icho+0.008
emouth+0.008
 Garg+0.008
Neg logit contributions
PB-0.009
fters-0.008
 cancellation-0.008
Lead-0.008
Writer-0.008
Token failure
Token ID5287
Feature activation-9.25e-02

Pos logit contributions
orf+0.073
 COURT+0.070
icho+0.068
emouth+0.067
 Garg+0.067
Neg logit contributions
PB-0.066
fters-0.065
 cancellation-0.065
Lead-0.063
Writer-0.061