TOP ACTIVATIONS
MAX = 3.413

went away . ]] But this wasn
suggested , " What about Duchess ? That 's a really
full of delicious red - and - white candy ! The
event was just a pre lude . Over the next few
the cars stopped and the pedestrian light turned green . Then
friends were playing hide - and - seek . Sammy was
special game of hide - and - seek , so she
decided to play hide - and - seek behind them .
said his dad , " and it 's unknown where they
friends were playing hide - and - seek . Tim my
, while playing hide - and - seek , Fl opsy
hill , played hide - and - seek in a me
backyard , played hide - and - seek and had a
a game of hide - and - seek . Bub ba
again while playing hide - and - seek with her friends
all started playing hide - and - seek and making jokes
had just played hide - and - seek with Tweet ie
decided to play hide - and - seek instead . They
and they played hide - and - seek . After a
. They played hide - and - seek until it was

MIN ACTIVATIONS
MIN = -4.273

A ww , okay . â
at it said " ew ww !" One
He said , " E ww , this is the worst
and said , " A ww www , do I have
be bitter , like e ww . Do you want to
the ground . " E ww , what 's that ?"
She said , " E ww , disgusting !" She threw
friends said , " A ww , okay . But come
. " E ww !" she screamed .
and said , " E ww , Bruno . You are
!" " A ww !" Lily says . "
. " E ww , this fish is sm
. " A ww , it is so cute
He says , " A ww , this is from Anna
make it whole again . Amen ." Then ,
As the light ening f lic kered away , the man
. She used a m itt to take out the pan
They get a m itt and a spat ula .
put on an oven m itt and a a pron .
was seeing . P olly wanted to find out

INTERVAL 1.491 - 3.413
CONTAINS 0.596%

ordered one last apple for himself and then went on his
gift . Choose what you like ." She said .
to go home . The end . <|endoftext|> Once upon a
smell . Lily did not like it and ran away .
" Okay Max , I promise ". From that

INTERVAL -0.430 - 1.491
CONTAINS 84.631%

again , ok ?" Tom nodded and hugged her
The doctor smiled at her and said , " Because it
their mom and dad hug them and kiss them .
." Tom smiled and said , " You are
kind ." L ily smiled and said ,

INTERVAL -2.351 - -0.430
CONTAINS 14.725%

so big , it 's too hard to count !".
Tim my loved to build things with his blocks . He
, but my boat is too small and it 's not
. It is dark and black inside . "
. He had eaten some of her leaves when she was

INTERVAL -4.273 - -2.351
CONTAINS 0.048%

thought it was the prett iest coin that he had ever
. He was so eng ross ed that he didn 't
and said , " Y ay ! Can we get ice
his mind . He w agged his tail and bark ed
yellow flowers . " Spr ay these everyday , just like
Token went
Token ID1816
Feature activation-3.98e-01

Pos logit contributions
itting+0.005
olly+0.004
ubby+0.004
reetings+0.004
lic+0.004
Neg logit contributions
 bank-0.004
 banks-0.003
 find-0.003
 choke-0.003
 wounds-0.003
Token away
Token ID1497
Feature activation-1.39e-01

Pos logit contributions
itting+0.051
olly+0.047
 importantly+0.045
mber+0.043
ubby+0.042
Neg logit contributions
 bank-0.027
 banks-0.020
 find-0.016
 wounds-0.016
bank-0.015
Token.
Token ID13
Feature activation+1.15e-01

Pos logit contributions
itting+0.015
olly+0.014
apping+0.013
 importantly+0.013
mber+0.013
Neg logit contributions
 bank-0.011
 banks-0.009
 choke-0.008
 wounds-0.008
 necks-0.007
Token
Token ID198
Feature activation-1.30e-02

Pos logit contributions
 bank+0.009
 banks+0.007
bank+0.006
 find+0.006
 choke+0.006
Neg logit contributions
itting-0.012
olly-0.012
ubby-0.011
ister-0.010
lic-0.010
Token
Token ID198
Feature activation-1.32e-02

Pos logit contributions
itting+0.001
olly+0.001
 importantly+0.001
ubby+0.001
mber+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 wounds-0.001
 choke-0.001
bank-0.001
Token]]
Token ID11907
Feature activation+3.33e+00

Pos logit contributions
itting+0.001
olly+0.001
ubby+0.001
ancer+0.001
 importantly+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 wounds-0.001
 find-0.001
bank-0.001
Token
Token ID198
Feature activation-1.44e-02

Pos logit contributions
 bank+0.242
 banks+0.191
 choke+0.161
bank+0.159
 find+0.156
Neg logit contributions
itting-0.371
olly-0.357
 importantly-0.327
mber-0.317
lic-0.316
Token
Token ID198
Feature activation-7.39e-03

Pos logit contributions
itting+0.001
olly+0.001
 importantly+0.001
mber+0.001
lic+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 choke-0.001
 wounds-0.001
bank-0.001
TokenBut
Token ID1537
Feature activation+8.26e-01

Pos logit contributions
itting+0.001
olly+0.001
 importantly+0.001
mber+0.001
lic+0.001
Neg logit contributions
 bank-0.001
 banks-0.000
 wounds-0.000
 choke-0.000
bank-0.000
Token this
Token ID428
Feature activation+3.30e-01

Pos logit contributions
 bank+0.070
 banks+0.057
 choke+0.049
 find+0.048
 wounds+0.048
Neg logit contributions
itting-0.086
olly-0.082
lic-0.076
mber-0.071
omed-0.071
Token wasn
Token ID2492
Feature activation+1.32e-01

Pos logit contributions
 bank+0.026
 banks+0.020
 choke+0.017
 way+0.017
bank+0.016
Neg logit contributions
itting-0.037
olly-0.034
lic-0.032
year-0.032
apping-0.031
Token suggested
Token ID5220
Feature activation-7.63e-01

Pos logit contributions
 bank+0.071
 banks+0.053
 neck+0.049
 find+0.048
 choke+0.046
Neg logit contributions
itting-0.097
olly-0.091
arl-0.083
ubby-0.082
ay-0.079
Token,
Token ID11
Feature activation-2.51e-01

Pos logit contributions
itting+0.079
olly+0.075
ubby+0.066
 importantly+0.065
mber+0.064
Neg logit contributions
 bank-0.068
 banks-0.055
 choke-0.048
 wounds-0.047
bank-0.047
Token "
Token ID366
Feature activation+3.30e-01

Pos logit contributions
itting+0.026
olly+0.025
 importantly+0.021
ubby+0.021
ane+0.021
Neg logit contributions
 bank-0.023
 banks-0.019
 choke-0.016
 wounds-0.016
bank-0.016
TokenWhat
Token ID2061
Feature activation+4.27e-02

Pos logit contributions
 bank+0.031
 banks+0.025
 choke+0.022
 wounds+0.022
bank+0.022
Neg logit contributions
itting-0.031
olly-0.031
 importantly-0.026
ubby-0.026
apping-0.026
Token about
Token ID546
Feature activation-9.69e-01

Pos logit contributions
 bank+0.003
 find+0.002
 way+0.002
 banks+0.002
 choke+0.002
Neg logit contributions
itting-0.005
olly-0.005
arl-0.004
lic-0.004
omed-0.004
Token Duchess
Token ID40289
Feature activation+3.28e+00

Pos logit contributions
itting+0.106
olly+0.101
lic+0.090
year+0.088
apping+0.087
Neg logit contributions
 bank-0.083
 banks-0.067
 find-0.058
 wounds-0.057
 neck-0.057
Token?
Token ID30
Feature activation-1.36e+00

Pos logit contributions
 bank+0.336
 banks+0.281
 find+0.254
 neck+0.250
 choke+0.248
Neg logit contributions
itting-0.290
olly-0.270
 importantly-0.242
year-0.236
lic-0.232
Token That
Token ID1320
Feature activation-4.78e-01

Pos logit contributions
itting+0.141
olly+0.132
ying+0.116
ross+0.115
ubby+0.115
Neg logit contributions
 bank-0.108
 banks-0.087
 choke-0.081
 find-0.081
 himself-0.077
Token's
Token ID338
Feature activation+3.22e-01

Pos logit contributions
itting+0.062
olly+0.060
year+0.054
 importantly+0.054
lic+0.054
Neg logit contributions
 bank-0.031
 banks-0.023
 choke-0.017
bank-0.017
 wounds-0.017
Token a
Token ID257
Feature activation+5.30e-02

Pos logit contributions
 bank+0.026
 banks+0.021
 choke+0.018
 wounds+0.018
bank+0.017
Neg logit contributions
itting-0.035
olly-0.035
apping-0.030
lic-0.030
mber-0.030
Token really
Token ID1107
Feature activation+2.60e-01

Pos logit contributions
 bank+0.005
 banks+0.004
 wounds+0.004
 choke+0.004
 necks+0.003
Neg logit contributions
itting-0.005
olly-0.005
mber-0.004
 importantly-0.004
lic-0.004
Token full
Token ID1336
Feature activation+1.41e-01

Pos logit contributions
 bank+0.022
 banks+0.018
 wounds+0.015
 choke+0.015
 necks+0.014
Neg logit contributions
itting-0.029
olly-0.028
 importantly-0.025
lic-0.025
apping-0.024
Token of
Token ID286
Feature activation-1.27e+00

Pos logit contributions
 bank+0.019
 banks+0.017
 choke+0.015
 wounds+0.015
bank+0.015
Neg logit contributions
itting-0.008
olly-0.007
 importantly-0.006
mber-0.005
lic-0.005
Token delicious
Token ID12625
Feature activation+2.55e-01

Pos logit contributions
itting+0.130
olly+0.126
 importantly+0.114
year+0.110
mber+0.110
Neg logit contributions
 bank-0.107
 banks-0.089
 choke-0.076
 wounds-0.074
bank-0.072
Token red
Token ID2266
Feature activation-3.94e-01

Pos logit contributions
 bank+0.020
 banks+0.017
 choke+0.014
 wounds+0.014
bank+0.013
Neg logit contributions
itting-0.026
olly-0.026
 importantly-0.024
mber-0.023
lic-0.023
Token-
Token ID12
Feature activation-1.05e+00

Pos logit contributions
itting+0.042
olly+0.041
 importantly+0.036
ubby+0.035
lic+0.035
Neg logit contributions
 bank-0.030
 banks-0.025
 choke-0.022
 wounds-0.022
bank-0.020
Tokenand
Token ID392
Feature activation+3.18e+00

Pos logit contributions
itting+0.079
olly+0.074
 importantly+0.063
mber+0.061
apping+0.061
Neg logit contributions
 bank-0.120
 banks-0.103
 choke-0.092
 wounds-0.091
bank-0.091
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 bank+0.330
 banks+0.277
bank+0.247
 wounds+0.243
 choke+0.240
Neg logit contributions
itting-0.275
olly-0.254
 importantly-0.221
ubby-0.219
omed-0.217
Tokenwhite
Token ID11186
Feature activation-3.57e-02

Pos logit contributions
itting+0.088
olly+0.083
 importantly+0.073
apping+0.070
mber+0.070
Neg logit contributions
 bank-0.104
 banks-0.087
bank-0.077
 choke-0.076
 wounds-0.075
Token candy
Token ID18550
Feature activation+1.06e+00

Pos logit contributions
itting+0.004
olly+0.004
 importantly+0.004
mber+0.003
lic+0.003
Neg logit contributions
 bank-0.003
 banks-0.002
 choke-0.002
 wounds-0.002
bank-0.002
Token!
Token ID0
Feature activation-1.44e-01

Pos logit contributions
 bank+0.078
 banks+0.062
 choke+0.051
 wounds+0.050
bank+0.048
Neg logit contributions
itting-0.124
olly-0.118
 importantly-0.106
mber-0.105
lic-0.105
Token The
Token ID383
Feature activation+5.38e-01

Pos logit contributions
itting+0.016
olly+0.015
lic+0.013
 importantly+0.013
ubby+0.013
Neg logit contributions
 bank-0.012
 banks-0.010
bank-0.008
 choke-0.008
 find-0.007
Token event
Token ID1785
Feature activation+2.39e-02

Pos logit contributions
 bank+0.023
 banks+0.018
 way+0.015
 choke+0.014
 find+0.014
Neg logit contributions
itting-0.036
olly-0.035
apping-0.031
lic-0.031
year-0.031
Token was
Token ID373
Feature activation-5.34e-02

Pos logit contributions
 bank+0.001
 banks+0.001
 way+0.001
bank+0.001
 find+0.001
Neg logit contributions
itting-0.003
olly-0.003
ubby-0.003
arl-0.003
year-0.003
Token just
Token ID655
Feature activation-5.40e-02

Pos logit contributions
itting+0.006
olly+0.006
apping+0.005
mber+0.005
lic+0.005
Neg logit contributions
 bank-0.004
 banks-0.003
 choke-0.003
 wounds-0.003
 find-0.002
Token a
Token ID257
Feature activation+1.49e-01

Pos logit contributions
itting+0.006
olly+0.006
mber+0.006
lic+0.006
apping+0.006
Neg logit contributions
 bank-0.004
 banks-0.003
 choke-0.002
 wounds-0.002
 find-0.002
Token pre
Token ID662
Feature activation+3.63e-01

Pos logit contributions
 bank+0.013
 banks+0.010
 choke+0.009
 wounds+0.008
bank+0.008
Neg logit contributions
itting-0.016
olly-0.015
apping-0.013
 importantly-0.013
year-0.013
Tokenlude
Token ID38792
Feature activation+3.16e+00

Pos logit contributions
 bank+0.035
 banks+0.028
bank+0.026
 choke+0.024
 floor+0.023
Neg logit contributions
itting-0.033
olly-0.028
ancers-0.028
 importantly-0.028
uttered-0.027
Token.
Token ID13
Feature activation+7.89e-02

Pos logit contributions
 bank+0.216
 banks+0.166
bank+0.127
 choke+0.124
 find+0.122
Neg logit contributions
itting-0.388
olly-0.376
 importantly-0.336
year-0.332
lic-0.332
Token Over
Token ID3827
Feature activation+3.04e-01

Pos logit contributions
 bank+0.006
 banks+0.005
 choke+0.004
bank+0.004
 find+0.004
Neg logit contributions
itting-0.009
olly-0.008
lic-0.008
ister-0.008
ubby-0.008
Token the
Token ID262
Feature activation-2.40e-01

Pos logit contributions
 bank+0.027
 banks+0.022
 choke+0.019
 wounds+0.019
bank+0.019
Neg logit contributions
itting-0.030
olly-0.029
lic-0.026
mber-0.025
 importantly-0.025
Token next
Token ID1306
Feature activation+9.17e-01

Pos logit contributions
itting+0.025
olly+0.023
lic+0.021
apping+0.021
 importantly+0.021
Neg logit contributions
 bank-0.021
 banks-0.017
 choke-0.015
 wounds-0.015
 find-0.014
Token few
Token ID1178
Feature activation-8.75e-01

Pos logit contributions
 bank+0.075
 banks+0.059
 way+0.054
 find+0.051
bank+0.050
Neg logit contributions
itting-0.096
olly-0.088
otion-0.082
omed-0.082
year-0.081
Token the
Token ID262
Feature activation-1.14e-01

Pos logit contributions
 bank+0.043
 banks+0.033
 find+0.029
 choke+0.028
 wounds+0.026
Neg logit contributions
itting-0.066
olly-0.063
lic-0.060
 importantly-0.058
ubby-0.057
Token cars
Token ID5006
Feature activation+8.35e-01

Pos logit contributions
itting+0.013
olly+0.012
 importantly+0.011
lic+0.011
mber+0.011
Neg logit contributions
 bank-0.010
 banks-0.007
 wounds-0.006
 way-0.006
 find-0.006
Token stopped
Token ID5025
Feature activation-7.51e-01

Pos logit contributions
 bank+0.074
 banks+0.059
 find+0.047
 choke+0.047
bank+0.046
Neg logit contributions
itting-0.089
olly-0.089
 importantly-0.081
mber-0.080
ubby-0.077
Token and
Token ID290
Feature activation-2.30e-02

Pos logit contributions
itting+0.082
olly+0.077
 importantly+0.069
apping+0.069
lic+0.067
Neg logit contributions
 bank-0.061
 banks-0.050
 choke-0.042
 wounds-0.041
bank-0.040
Token the
Token ID262
Feature activation-9.43e-02

Pos logit contributions
itting+0.002
olly+0.002
 importantly+0.002
ubby+0.002
apping+0.002
Neg logit contributions
 bank-0.002
 banks-0.002
 choke-0.001
 wounds-0.001
bank-0.001
Token pedestrian
Token ID22382
Feature activation+3.12e+00

Pos logit contributions
itting+0.010
olly+0.010
 importantly+0.009
mber+0.009
lic+0.009
Neg logit contributions
 bank-0.008
 banks-0.006
 choke-0.005
 wounds-0.005
bank-0.005
Token light
Token ID1657
Feature activation-4.40e-01

Pos logit contributions
 bank+0.263
 banks+0.217
bank+0.177
 piles+0.173
 wounds+0.171
Neg logit contributions
itting-0.327
olly-0.319
mber-0.282
ister-0.278
 importantly-0.278
Token turned
Token ID2900
Feature activation+2.76e-01

Pos logit contributions
itting+0.050
olly+0.050
mber+0.044
year+0.044
ubby+0.043
Neg logit contributions
 bank-0.033
 banks-0.026
bank-0.022
 wounds-0.021
 choke-0.020
Token green
Token ID4077
Feature activation-4.62e-01

Pos logit contributions
 bank+0.022
 banks+0.018
 piles+0.015
 wounds+0.015
bank+0.015
Neg logit contributions
olly-0.030
itting-0.029
mber-0.027
 importantly-0.025
www-0.025
Token.
Token ID13
Feature activation+7.70e-02

Pos logit contributions
itting+0.053
olly+0.051
mber+0.046
 importantly+0.045
lic+0.045
Neg logit contributions
 bank-0.035
 banks-0.028
 choke-0.023
 wounds-0.023
 necks-0.022
Token Then
Token ID3244
Feature activation+1.75e-01

Pos logit contributions
 bank+0.006
 banks+0.005
bank+0.004
 choke+0.004
 piles+0.004
Neg logit contributions
itting-0.009
olly-0.008
ister-0.008
ubby-0.008
ane-0.007
Token friends
Token ID2460
Feature activation-1.12e+00

Pos logit contributions
itting+0.012
olly+0.011
 importantly+0.010
mber+0.010
lic+0.010
Neg logit contributions
 bank-0.012
 banks-0.010
 choke-0.009
 wounds-0.009
bank-0.008
Token were
Token ID547
Feature activation+2.69e-01

Pos logit contributions
itting+0.127
olly+0.122
 importantly+0.112
mber+0.109
lic+0.107
Neg logit contributions
 bank-0.092
 banks-0.073
 find-0.061
 choke-0.058
bank-0.056
Token playing
Token ID2712
Feature activation+2.54e-01

Pos logit contributions
 bank+0.022
 banks+0.018
 choke+0.015
 wounds+0.015
bank+0.014
Neg logit contributions
itting-0.029
olly-0.028
mber-0.025
 importantly-0.025
lic-0.025
Token hide
Token ID7808
Feature activation+9.58e-01

Pos logit contributions
 bank+0.025
 banks+0.021
 choke+0.019
 wounds+0.019
bank+0.018
Neg logit contributions
itting-0.023
olly-0.021
apping-0.019
 importantly-0.018
lic-0.018
Token-
Token ID12
Feature activation-1.01e+00

Pos logit contributions
 bank+0.083
 banks+0.067
 choke+0.064
bank+0.060
 wounds+0.057
Neg logit contributions
itting-0.093
olly-0.092
ubby-0.079
year-0.079
apping-0.079
Tokenand
Token ID392
Feature activation+3.03e+00

Pos logit contributions
itting+0.108
olly+0.105
year+0.093
lic+0.092
apping+0.091
Neg logit contributions
 bank-0.082
 banks-0.067
 choke-0.060
 wounds-0.060
 neck-0.055
Token-
Token ID12
Feature activation-9.96e-01

Pos logit contributions
 bank+0.347
 banks+0.301
 choke+0.271
 wounds+0.268
bank+0.261
Neg logit contributions
itting-0.230
olly-0.217
 importantly-0.180
mber-0.178
lic-0.177
Tokenseek
Token ID36163
Feature activation+9.59e-01

Pos logit contributions
itting+0.104
olly+0.100
lic+0.089
year+0.089
apping+0.088
Neg logit contributions
 bank-0.080
 banks-0.065
 wounds-0.061
 choke-0.060
 neck-0.056
Token.
Token ID13
Feature activation+1.14e-01

Pos logit contributions
 bank+0.081
 banks+0.066
 choke+0.058
 wounds+0.057
bank+0.054
Neg logit contributions
itting-0.101
olly-0.096
apping-0.087
 importantly-0.086
year-0.083
Token Sammy
Token ID44067
Feature activation-6.10e-01

Pos logit contributions
 bank+0.010
 banks+0.009
 choke+0.008
 wounds+0.007
bank+0.007
Neg logit contributions
itting-0.011
olly-0.011
 importantly-0.009
mber-0.009
lic-0.009
Token was
Token ID373
Feature activation-5.91e-02

Pos logit contributions
itting+0.062
olly+0.060
mber+0.052
 importantly+0.052
lic+0.052
Neg logit contributions
 bank-0.052
 banks-0.043
 wounds-0.038
 choke-0.038
bank-0.035
Token special
Token ID2041
Feature activation-2.82e-01

Pos logit contributions
itting+0.021
olly+0.020
 importantly+0.018
mber+0.018
lic+0.017
Neg logit contributions
 bank-0.015
 banks-0.012
 choke-0.010
 wounds-0.010
bank-0.010
Token game
Token ID983
Feature activation-1.89e-01

Pos logit contributions
itting+0.030
olly+0.029
mber+0.025
 importantly+0.025
lic+0.025
Neg logit contributions
 bank-0.023
 banks-0.019
 choke-0.017
 wounds-0.016
 necks-0.015
Token of
Token ID286
Feature activation-1.27e+00

Pos logit contributions
itting+0.023
olly+0.022
mber+0.020
 importantly+0.020
lic+0.020
Neg logit contributions
 bank-0.013
 banks-0.010
 choke-0.008
 wounds-0.007
bank-0.007
Token hide
Token ID7808
Feature activation+1.05e+00

Pos logit contributions
itting+0.137
olly+0.132
 importantly+0.117
mber+0.115
lic+0.115
Neg logit contributions
 bank-0.103
 banks-0.084
 choke-0.071
 wounds-0.070
bank-0.068
Token-
Token ID12
Feature activation-1.08e+00

Pos logit contributions
 bank+0.081
 banks+0.065
 choke+0.054
 wounds+0.053
bank+0.051
Neg logit contributions
itting-0.118
olly-0.114
 importantly-0.101
mber-0.100
lic-0.100
Tokenand
Token ID392
Feature activation+3.01e+00

Pos logit contributions
itting+0.121
olly+0.116
 importantly+0.104
mber+0.102
apping+0.102
Neg logit contributions
 bank-0.085
 banks-0.068
 choke-0.057
 wounds-0.056
bank-0.054
Token-
Token ID12
Feature activation-1.06e+00

Pos logit contributions
 bank+0.319
 banks+0.274
 wounds+0.243
 choke+0.236
bank+0.236
Neg logit contributions
itting-0.257
olly-0.236
 importantly-0.209
apping-0.199
ubby-0.199
Tokenseek
Token ID36163
Feature activation+1.00e+00

Pos logit contributions
itting+0.131
olly+0.127
 importantly+0.117
ubby+0.114
mber+0.113
Neg logit contributions
 bank-0.072
 banks-0.055
bank-0.043
 choke-0.042
 wounds-0.042
Token,
Token ID11
Feature activation-2.44e-01

Pos logit contributions
 bank+0.074
 banks+0.059
 choke+0.050
 wounds+0.048
bank+0.046
Neg logit contributions
itting-0.115
olly-0.111
 importantly-0.100
apping-0.099
mber-0.098
Token so
Token ID523
Feature activation+6.39e-01

Pos logit contributions
itting+0.028
olly+0.027
 importantly+0.025
apping+0.024
mber+0.024
Neg logit contributions
 bank-0.017
 banks-0.014
 choke-0.011
 wounds-0.011
bank-0.011
Token she
Token ID673
Feature activation+3.27e-01

Pos logit contributions
 bank+0.070
 banks+0.059
 find+0.052
 choke+0.052
 neck+0.052
Neg logit contributions
itting-0.054
olly-0.050
ubby-0.043
lic-0.043
apping-0.043
Token decided
Token ID3066
Feature activation+1.48e-01

Pos logit contributions
 bank+0.003
 banks+0.002
 choke+0.002
 wounds+0.002
bank+0.002
Neg logit contributions
itting-0.003
olly-0.003
 importantly-0.003
lic-0.003
mber-0.003
Token to
Token ID284
Feature activation+3.87e-01

Pos logit contributions
 bank+0.010
 banks+0.008
 choke+0.007
 wounds+0.007
bank+0.006
Neg logit contributions
itting-0.018
olly-0.017
 importantly-0.015
mber-0.015
apping-0.015
Token play
Token ID711
Feature activation-1.89e-01

Pos logit contributions
 bank+0.027
 banks+0.021
 choke+0.016
 find+0.016
 wounds+0.015
Neg logit contributions
itting-0.048
olly-0.046
 importantly-0.042
year-0.042
lic-0.041
Token hide
Token ID7808
Feature activation+9.48e-01

Pos logit contributions
itting+0.020
olly+0.019
apping+0.017
 importantly+0.017
lic+0.017
Neg logit contributions
 bank-0.016
 banks-0.013
 choke-0.011
 wounds-0.011
bank-0.010
Token-
Token ID12
Feature activation-1.03e+00

Pos logit contributions
 bank+0.079
 banks+0.064
 choke+0.060
bank+0.055
 wounds+0.053
Neg logit contributions
itting-0.096
olly-0.095
year-0.084
apping-0.082
ubby-0.081
Tokenand
Token ID392
Feature activation+2.99e+00

Pos logit contributions
itting+0.109
olly+0.107
year+0.095
lic+0.095
mber+0.092
Neg logit contributions
 bank-0.083
 banks-0.069
 choke-0.063
 wounds-0.061
 necks-0.055
Token-
Token ID12
Feature activation-1.02e+00

Pos logit contributions
 bank+0.341
 banks+0.296
 choke+0.266
 wounds+0.264
bank+0.256
Neg logit contributions
itting-0.228
olly-0.215
 importantly-0.179
mber-0.176
lic-0.176
Tokenseek
Token ID36163
Feature activation+9.87e-01

Pos logit contributions
itting+0.110
olly+0.106
lic+0.095
year+0.094
mber+0.092
Neg logit contributions
 bank-0.080
 banks-0.065
 choke-0.059
 wounds-0.059
 neck-0.053
Token behind
Token ID2157
Feature activation+6.05e-02

Pos logit contributions
 bank+0.076
 banks+0.061
 choke+0.052
 wounds+0.050
bank+0.048
Neg logit contributions
itting-0.111
olly-0.107
 importantly-0.095
apping-0.095
year-0.094
Token them
Token ID606
Feature activation+7.75e-01

Pos logit contributions
 bank+0.005
 banks+0.004
 choke+0.003
 wounds+0.003
bank+0.003
Neg logit contributions
itting-0.006
olly-0.006
 importantly-0.006
apping-0.005
lic-0.005
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
 bank+0.056
 banks+0.044
 choke+0.037
 wounds+0.037
bank+0.034
Neg logit contributions
itting-0.091
olly-0.088
 importantly-0.078
apping-0.077
lic-0.077
Token said
Token ID531
Feature activation-9.19e-02

Pos logit contributions
 bank+0.005
 banks+0.004
 choke+0.003
 wounds+0.003
bank+0.003
Neg logit contributions
itting-0.005
olly-0.005
apping-0.004
lic-0.004
 importantly-0.004
Token his
Token ID465
Feature activation-2.40e-02

Pos logit contributions
itting+0.011
olly+0.010
 importantly+0.009
reetings+0.009
lic+0.009
Neg logit contributions
 bank-0.007
 banks-0.005
 choke-0.004
 wounds-0.004
bank-0.004
Token dad
Token ID9955
Feature activation-1.96e-01

Pos logit contributions
itting+0.003
olly+0.002
 importantly+0.002
lic+0.002
mber+0.002
Neg logit contributions
 bank-0.002
 banks-0.002
 wounds-0.001
 choke-0.001
 necks-0.001
Token,
Token ID11
Feature activation-1.23e-01

Pos logit contributions
itting+0.024
olly+0.023
 importantly+0.020
mber+0.020
lic+0.020
Neg logit contributions
 bank-0.014
 banks-0.011
 choke-0.009
 wounds-0.009
bank-0.008
Token "
Token ID366
Feature activation+3.96e-01

Pos logit contributions
itting+0.013
olly+0.012
 importantly+0.011
apping+0.011
lic+0.011
Neg logit contributions
 bank-0.011
 banks-0.009
 choke-0.007
 wounds-0.007
bank-0.007
Tokenand
Token ID392
Feature activation+2.99e+00

Pos logit contributions
 bank+0.041
 banks+0.035
 choke+0.031
 wounds+0.031
bank+0.030
Neg logit contributions
itting-0.034
olly-0.032
 importantly-0.027
mber-0.027
lic-0.027
Token it
Token ID340
Feature activation-5.71e-01

Pos logit contributions
 bank+0.208
 banks+0.149
 find+0.149
 wounds+0.142
 himself+0.139
Neg logit contributions
itting-0.352
olly-0.349
 importantly-0.317
lic-0.310
apping-0.303
Token's
Token ID338
Feature activation+5.17e-01

Pos logit contributions
itting+0.067
olly+0.065
 importantly+0.059
mber+0.059
year+0.058
Neg logit contributions
 bank-0.043
 banks-0.034
 choke-0.027
bank-0.026
 find-0.026
Token unknown
Token ID6439
Feature activation+1.42e-01

Pos logit contributions
 bank+0.043
 banks+0.035
 choke+0.030
 wounds+0.029
bank+0.029
Neg logit contributions
itting-0.056
olly-0.055
mber-0.047
 importantly-0.047
lic-0.047
Token where
Token ID810
Feature activation-2.72e-01

Pos logit contributions
 bank+0.010
 banks+0.008
 wounds+0.006
 choke+0.006
bank+0.006
Neg logit contributions
itting-0.017
olly-0.017
 importantly-0.015
lic-0.015
mber-0.015
Token they
Token ID484
Feature activation+5.56e-01

Pos logit contributions
itting+0.029
olly+0.029
lic+0.025
mber+0.025
apping+0.024
Neg logit contributions
 bank-0.023
 banks-0.019
 find-0.016
 wounds-0.016
 choke-0.016
Token friends
Token ID2460
Feature activation-1.15e+00

Pos logit contributions
itting+0.006
olly+0.006
 importantly+0.005
mber+0.005
lic+0.005
Neg logit contributions
 bank-0.006
 banks-0.005
 choke-0.004
 wounds-0.004
 necks-0.004
Token were
Token ID547
Feature activation+2.69e-01

Pos logit contributions
itting+0.128
olly+0.125
 importantly+0.113
mber+0.112
lic+0.109
Neg logit contributions
 bank-0.094
 banks-0.074
 find-0.062
 choke-0.060
bank-0.057
Token playing
Token ID2712
Feature activation+2.69e-01

Pos logit contributions
 bank+0.023
 banks+0.018
 choke+0.016
 wounds+0.015
bank+0.015
Neg logit contributions
itting-0.029
olly-0.028
mber-0.024
 importantly-0.024
lic-0.024
Token hide
Token ID7808
Feature activation+9.57e-01

Pos logit contributions
 bank+0.026
 banks+0.022
 choke+0.020
 wounds+0.019
bank+0.019
Neg logit contributions
itting-0.025
olly-0.023
apping-0.020
 importantly-0.020
lic-0.020
Token-
Token ID12
Feature activation-9.81e-01

Pos logit contributions
 bank+0.091
 banks+0.075
 choke+0.072
bank+0.069
 wounds+0.064
Neg logit contributions
itting-0.084
olly-0.084
year-0.071
ubby-0.071
apping-0.070
Tokenand
Token ID392
Feature activation+2.99e+00

Pos logit contributions
itting+0.102
olly+0.099
year+0.089
lic+0.088
apping+0.086
Neg logit contributions
 bank-0.081
 banks-0.068
 choke-0.062
 wounds-0.061
 neck-0.056
Token-
Token ID12
Feature activation-9.77e-01

Pos logit contributions
 bank+0.353
 banks+0.307
 choke+0.280
 wounds+0.275
bank+0.267
Neg logit contributions
itting-0.214
olly-0.204
lic-0.164
 importantly-0.164
mber-0.164
Tokenseek
Token ID36163
Feature activation+8.97e-01

Pos logit contributions
itting+0.099
olly+0.095
lic+0.085
year+0.084
apping+0.083
Neg logit contributions
 bank-0.080
 banks-0.067
 wounds-0.064
 choke-0.062
 neck-0.059
Token.
Token ID13
Feature activation+2.50e-01

Pos logit contributions
 bank+0.078
 banks+0.064
 choke+0.057
 wounds+0.055
bank+0.054
Neg logit contributions
itting-0.093
olly-0.088
apping-0.080
 importantly-0.079
year-0.076
Token Tim
Token ID5045
Feature activation+4.80e-01

Pos logit contributions
 bank+0.024
 wounds+0.021
 banks+0.020
 choke+0.020
 necks+0.019
Neg logit contributions
itting-0.021
olly-0.019
lic-0.018
 importantly-0.017
omed-0.017
Tokenmy
Token ID1820
Feature activation-1.84e-01

Pos logit contributions
 bank+0.050
 banks+0.043
 wounds+0.040
 choke+0.039
bank+0.036
Neg logit contributions
itting-0.040
olly-0.039
mber-0.032
ubby-0.032
lic-0.032
Token,
Token ID11
Feature activation-2.62e-01

Pos logit contributions
itting+0.022
olly+0.020
 importantly+0.018
year+0.018
lic+0.018
Neg logit contributions
 bank-0.017
 banks-0.014
 choke-0.012
 wounds-0.011
bank-0.011
Token while
Token ID981
Feature activation+6.15e-01

Pos logit contributions
olly+0.028
itting+0.028
mber+0.025
 importantly+0.024
lic+0.024
Neg logit contributions
 bank-0.019
 banks-0.016
 wounds-0.015
 choke-0.013
 necks-0.013
Token playing
Token ID2712
Feature activation+2.15e-01

Pos logit contributions
 bank+0.053
 banks+0.043
 choke+0.038
 wounds+0.035
bank+0.035
Neg logit contributions
itting-0.065
olly-0.061
lic-0.055
year-0.054
mber-0.054
Token hide
Token ID7808
Feature activation+9.09e-01

Pos logit contributions
 bank+0.019
 banks+0.016
 choke+0.014
 wounds+0.013
bank+0.013
Neg logit contributions
itting-0.022
olly-0.021
 importantly-0.018
lic-0.018
mber-0.018
Token-
Token ID12
Feature activation-1.04e+00

Pos logit contributions
 bank+0.076
 banks+0.062
 wounds+0.052
 choke+0.051
bank+0.049
Neg logit contributions
itting-0.099
olly-0.093
 importantly-0.084
apping-0.082
lic-0.082
Tokenand
Token ID392
Feature activation+2.99e+00

Pos logit contributions
itting+0.116
olly+0.113
lic+0.099
mber+0.098
year+0.098
Neg logit contributions
 bank-0.081
 banks-0.065
 choke-0.057
 wounds-0.056
 necks-0.051
Token-
Token ID12
Feature activation-1.02e+00

Pos logit contributions
 bank+0.321
 banks+0.277
 wounds+0.243
 choke+0.243
bank+0.239
Neg logit contributions
itting-0.248
olly-0.229
 importantly-0.202
year-0.194
apping-0.194
Tokenseek
Token ID36163
Feature activation+9.05e-01

Pos logit contributions
itting+0.111
olly+0.108
lic+0.095
mber+0.094
apping+0.093
Neg logit contributions
 bank-0.080
 banks-0.065
 wounds-0.059
 choke-0.058
 necks-0.053
Token,
Token ID11
Feature activation-2.01e-01

Pos logit contributions
 bank+0.085
 banks+0.071
 choke+0.062
 wounds+0.062
bank+0.059
Neg logit contributions
itting-0.086
olly-0.083
 importantly-0.072
apping-0.071
mber-0.071
Token Fl
Token ID1610
Feature activation+1.94e+00

Pos logit contributions
itting+0.024
olly+0.024
mber+0.021
 importantly+0.021
lic+0.021
Neg logit contributions
 bank-0.013
 banks-0.010
 wounds-0.009
 choke-0.008
 necks-0.008
Tokenopsy
Token ID44522
Feature activation+1.96e-01

Pos logit contributions
 bank+0.214
 banks+0.188
 choke+0.169
 wounds+0.165
 necks+0.164
Neg logit contributions
itting-0.145
olly-0.136
year-0.123
lic-0.122
 importantly-0.121
Token hill
Token ID12788
Feature activation+2.56e-01

Pos logit contributions
 bank+0.006
 banks+0.005
 wounds+0.005
 choke+0.004
 necks+0.004
Neg logit contributions
olly-0.007
itting-0.007
mber-0.006
lic-0.006
unt-0.006
Token,
Token ID11
Feature activation-1.91e-01

Pos logit contributions
 bank+0.019
 banks+0.016
 choke+0.013
 wounds+0.013
 necks+0.012
Neg logit contributions
itting-0.029
olly-0.028
apping-0.025
 importantly-0.024
lic-0.024
Token played
Token ID2826
Feature activation+9.15e-02

Pos logit contributions
itting+0.021
olly+0.020
apping+0.018
year+0.017
 importantly+0.017
Neg logit contributions
 bank-0.015
 banks-0.012
 choke-0.010
 wounds-0.010
bank-0.010
Token hide
Token ID7808
Feature activation+8.72e-01

Pos logit contributions
 bank+0.007
 banks+0.006
 choke+0.005
 wounds+0.005
bank+0.005
Neg logit contributions
itting-0.010
olly-0.010
lic-0.008
apping-0.008
 importantly-0.008
Token-
Token ID12
Feature activation-1.09e+00

Pos logit contributions
 bank+0.069
 banks+0.055
 choke+0.049
 wounds+0.046
bank+0.045
Neg logit contributions
itting-0.095
olly-0.092
lic-0.080
apping-0.080
mber-0.080
Tokenand
Token ID392
Feature activation+2.97e+00

Pos logit contributions
itting+0.119
olly+0.116
lic+0.103
year+0.102
mber+0.101
Neg logit contributions
 bank-0.085
 banks-0.070
 choke-0.062
 wounds-0.061
 necks-0.055
Token-
Token ID12
Feature activation-1.06e+00

Pos logit contributions
 bank+0.329
 banks+0.285
 wounds+0.252
 choke+0.249
bank+0.246
Neg logit contributions
itting-0.237
olly-0.221
 importantly-0.191
apping-0.183
mber-0.183
Tokenseek
Token ID36163
Feature activation+9.54e-01

Pos logit contributions
itting+0.117
olly+0.113
lic+0.101
year+0.100
mber+0.099
Neg logit contributions
 bank-0.082
 banks-0.067
 choke-0.059
 wounds-0.059
 necks-0.053
Token in
Token ID287
Feature activation+6.36e-01

Pos logit contributions
 bank+0.078
 banks+0.063
 choke+0.058
 wounds+0.054
bank+0.051
Neg logit contributions
itting-0.101
olly-0.097
apping-0.088
 importantly-0.086
lic-0.084
Token a
Token ID257
Feature activation+9.83e-02

Pos logit contributions
 bank+0.059
 banks+0.050
 choke+0.046
 wounds+0.044
bank+0.042
Neg logit contributions
itting-0.054
olly-0.052
 importantly-0.049
year-0.046
ister-0.046
Token me
Token ID502
Feature activation-3.47e-01

Pos logit contributions
 bank+0.008
 banks+0.007
 choke+0.006
 wounds+0.006
bank+0.006
Neg logit contributions
olly-0.010
itting-0.010
mber-0.009
lic-0.009
 importantly-0.008
Token backyard
Token ID24296
Feature activation-4.50e-01

Pos logit contributions
itting+0.018
olly+0.018
mber+0.015
 importantly+0.015
lic+0.015
Neg logit contributions
 bank-0.014
 banks-0.011
 choke-0.010
 wounds-0.010
bank-0.010
Token,
Token ID11
Feature activation-2.30e-01

Pos logit contributions
itting+0.048
olly+0.045
apping+0.040
 importantly+0.040
year+0.039
Neg logit contributions
 bank-0.037
 banks-0.031
 choke-0.027
 wounds-0.027
 necks-0.025
Token played
Token ID2826
Feature activation+1.38e-01

Pos logit contributions
itting+0.025
olly+0.023
 importantly+0.021
apping+0.021
lic+0.021
Neg logit contributions
 bank-0.019
 banks-0.015
 choke-0.013
 wounds-0.013
bank-0.012
Token hide
Token ID7808
Feature activation+9.72e-01

Pos logit contributions
 bank+0.010
 banks+0.008
 choke+0.007
 wounds+0.007
bank+0.006
Neg logit contributions
itting-0.016
olly-0.015
 importantly-0.014
mber-0.013
lic-0.013
Token-
Token ID12
Feature activation-1.05e+00

Pos logit contributions
 bank+0.075
 banks+0.060
 choke+0.052
 wounds+0.049
bank+0.048
Neg logit contributions
itting-0.109
olly-0.105
 importantly-0.093
lic-0.092
apping-0.092
Tokenand
Token ID392
Feature activation+2.97e+00

Pos logit contributions
itting+0.118
olly+0.114
 importantly+0.102
mber+0.100
apping+0.100
Neg logit contributions
 bank-0.081
 banks-0.065
 choke-0.054
 wounds-0.053
bank-0.052
Token-
Token ID12
Feature activation-1.04e+00

Pos logit contributions
 bank+0.317
 banks+0.273
 wounds+0.242
bank+0.235
 choke+0.234
Neg logit contributions
itting-0.248
olly-0.233
 importantly-0.202
mber-0.194
ubby-0.194
Tokenseek
Token ID36163
Feature activation+9.42e-01

Pos logit contributions
itting+0.118
olly+0.113
 importantly+0.102
mber+0.100
apping+0.099
Neg logit contributions
 bank-0.080
 banks-0.064
 choke-0.053
 wounds-0.052
bank-0.051
Token and
Token ID290
Feature activation+3.06e-02

Pos logit contributions
 bank+0.078
 banks+0.064
 choke+0.058
 wounds+0.055
bank+0.052
Neg logit contributions
itting-0.099
olly-0.094
apping-0.085
 importantly-0.084
lic-0.082
Token had
Token ID550
Feature activation+1.37e-01

Pos logit contributions
 bank+0.003
 banks+0.002
 choke+0.002
 wounds+0.002
 necks+0.002
Neg logit contributions
itting-0.003
olly-0.003
 importantly-0.003
mber-0.003
lic-0.003
Token a
Token ID257
Feature activation+1.06e-01

Pos logit contributions
 bank+0.012
 banks+0.010
 choke+0.009
 wounds+0.009
bank+0.008
Neg logit contributions
itting-0.013
olly-0.012
lic-0.011
 importantly-0.011
mber-0.011
Token a
Token ID257
Feature activation+1.04e-01

Pos logit contributions
 bank+0.027
 banks+0.023
 choke+0.020
 wounds+0.020
bank+0.019
Neg logit contributions
itting-0.026
olly-0.024
apping-0.022
lic-0.021
 importantly-0.021
Token game
Token ID983
Feature activation-9.91e-02

Pos logit contributions
 bank+0.009
 banks+0.008
 wounds+0.008
 necks+0.007
 choke+0.007
Neg logit contributions
olly-0.010
itting-0.009
lic-0.008
mber-0.008
ww-0.008
Token of
Token ID286
Feature activation-1.09e+00

Pos logit contributions
itting+0.008
olly+0.008
mber+0.006
 importantly+0.006
lic+0.006
Neg logit contributions
 bank-0.011
 banks-0.009
 choke-0.008
 wounds-0.008
bank-0.008
Token hide
Token ID7808
Feature activation+9.37e-01

Pos logit contributions
itting+0.120
olly+0.115
 importantly+0.102
lic+0.102
apping+0.101
Neg logit contributions
 bank-0.085
 banks-0.069
 choke-0.059
 wounds-0.057
bank-0.055
Token-
Token ID12
Feature activation-9.75e-01

Pos logit contributions
 bank+0.086
 banks+0.071
 choke+0.065
 wounds+0.061
bank+0.061
Neg logit contributions
itting-0.089
olly-0.087
apping-0.074
year-0.074
ubby-0.073
Tokenand
Token ID392
Feature activation+2.96e+00

Pos logit contributions
itting+0.106
olly+0.104
lic+0.092
year+0.091
apping+0.090
Neg logit contributions
 bank-0.077
 banks-0.063
 choke-0.055
 wounds-0.055
 necks-0.050
Token-
Token ID12
Feature activation-9.64e-01

Pos logit contributions
 bank+0.341
 banks+0.296
 choke+0.267
 wounds+0.264
bank+0.257
Neg logit contributions
itting-0.222
olly-0.210
 importantly-0.173
mber-0.171
lic-0.171
Tokenseek
Token ID36163
Feature activation+8.64e-01

Pos logit contributions
itting+0.106
olly+0.103
lic+0.091
year+0.090
apping+0.090
Neg logit contributions
 bank-0.074
 banks-0.060
 wounds-0.053
 choke-0.053
 necks-0.048
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
 bank+0.073
 banks+0.059
 choke+0.053
 wounds+0.051
bank+0.049
Neg logit contributions
itting-0.090
olly-0.086
apping-0.079
 importantly-0.077
lic-0.073
Token Bub
Token ID19645
Feature activation+1.50e+00

Pos logit contributions
 bank+0.013
 wounds+0.011
 choke+0.011
 banks+0.011
 mistake+0.010
Neg logit contributions
itting-0.014
olly-0.013
lic-0.012
 importantly-0.011
apping-0.011
Tokenba
Token ID7012
Feature activation+4.97e-01

Pos logit contributions
 bank+0.167
 banks+0.144
 choke+0.133
 wounds+0.129
bank+0.125
Neg logit contributions
itting-0.118
olly-0.113
ubby-0.094
apping-0.094
mber-0.094
Token again
Token ID757
Feature activation+7.16e-01

Pos logit contributions
 bank+0.023
 banks+0.018
 choke+0.015
 wounds+0.014
bank+0.014
Neg logit contributions
itting-0.039
olly-0.038
 importantly-0.034
mber-0.033
apping-0.033
Token while
Token ID981
Feature activation+7.30e-01

Pos logit contributions
 bank+0.056
 banks+0.045
 choke+0.038
 wounds+0.038
bank+0.036
Neg logit contributions
itting-0.079
olly-0.076
 importantly-0.068
mber-0.067
apping-0.067
Token playing
Token ID2712
Feature activation+2.59e-01

Pos logit contributions
 bank+0.071
 banks+0.059
 choke+0.054
 find+0.053
 wounds+0.052
Neg logit contributions
itting-0.067
olly-0.063
lic-0.057
year-0.055
mber-0.055
Token hide
Token ID7808
Feature activation+9.64e-01

Pos logit contributions
 bank+0.021
 banks+0.017
 choke+0.015
 wounds+0.015
bank+0.014
Neg logit contributions
itting-0.028
olly-0.027
 importantly-0.023
mber-0.023
lic-0.023
Token-
Token ID12
Feature activation-1.04e+00

Pos logit contributions
 bank+0.076
 banks+0.061
 choke+0.053
 wounds+0.051
bank+0.049
Neg logit contributions
itting-0.106
olly-0.102
 importantly-0.090
apping-0.089
mber-0.089
Tokenand
Token ID392
Feature activation+2.96e+00

Pos logit contributions
itting+0.114
olly+0.109
lic+0.096
year+0.096
apping+0.095
Neg logit contributions
 bank-0.083
 banks-0.068
 choke-0.059
 wounds-0.058
 necks-0.054
Token-
Token ID12
Feature activation-1.03e+00

Pos logit contributions
 bank+0.302
 banks+0.257
 wounds+0.229
bank+0.221
 find+0.220
Neg logit contributions
itting-0.261
olly-0.245
 importantly-0.214
ubby-0.209
lic-0.207
Tokenseek
Token ID36163
Feature activation+9.05e-01

Pos logit contributions
itting+0.123
olly+0.119
 importantly+0.107
mber+0.105
apping+0.104
Neg logit contributions
 bank-0.074
 banks-0.058
 choke-0.047
 wounds-0.046
bank-0.045
Token with
Token ID351
Feature activation+2.52e-01

Pos logit contributions
 bank+0.072
 banks+0.059
 choke+0.050
 wounds+0.049
bank+0.047
Neg logit contributions
itting-0.099
olly-0.095
 importantly-0.085
apping-0.084
mber-0.083
Token her
Token ID607
Feature activation+1.11e-01

Pos logit contributions
 bank+0.020
 banks+0.016
 choke+0.014
 wounds+0.014
bank+0.013
Neg logit contributions
itting-0.027
olly-0.026
 importantly-0.024
mber-0.023
lic-0.023
Token friends
Token ID2460
Feature activation-1.11e+00

Pos logit contributions
 bank+0.009
 banks+0.008
 choke+0.006
 wounds+0.006
bank+0.006
Neg logit contributions
itting-0.012
olly-0.011
 importantly-0.010
mber-0.010
apping-0.010
Token all
Token ID477
Feature activation-2.07e-01

Pos logit contributions
 bank+0.040
 banks+0.030
 find+0.025
 neck+0.024
 choke+0.023
Neg logit contributions
itting-0.057
olly-0.054
 importantly-0.050
ubby-0.049
reetings-0.048
Token started
Token ID2067
Feature activation-6.27e-01

Pos logit contributions
itting+0.024
olly+0.023
 importantly+0.021
apping+0.020
ama+0.020
Neg logit contributions
 bank-0.017
 banks-0.013
bank-0.011
 find-0.010
 choke-0.010
Token playing
Token ID2712
Feature activation+2.81e-01

Pos logit contributions
itting+0.054
apping+0.046
olly+0.045
 importantly+0.042
lic+0.042
Neg logit contributions
 bank-0.058
 banks-0.051
 wounds-0.045
 choke-0.044
 necks-0.043
Token hide
Token ID7808
Feature activation+1.04e+00

Pos logit contributions
 bank+0.025
 banks+0.021
 choke+0.021
bank+0.018
 wounds+0.018
Neg logit contributions
itting-0.024
olly-0.023
apping-0.021
year-0.021
lic-0.020
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
 bank+0.084
 banks+0.068
 choke+0.064
bank+0.059
 wounds+0.056
Neg logit contributions
itting-0.108
olly-0.105
year-0.094
apping-0.093
lic-0.090
Tokenand
Token ID392
Feature activation+2.96e+00

Pos logit contributions
itting+0.110
olly+0.106
lic+0.094
year+0.094
apping+0.093
Neg logit contributions
 bank-0.080
 banks-0.065
 choke-0.057
 wounds-0.055
 necks-0.052
Token-
Token ID12
Feature activation-9.98e-01

Pos logit contributions
 bank+0.337
 banks+0.292
 choke+0.262
 wounds+0.260
bank+0.253
Neg logit contributions
itting-0.226
olly-0.213
 importantly-0.178
mber-0.175
lic-0.174
Tokenseek
Token ID36163
Feature activation+9.30e-01

Pos logit contributions
itting+0.113
olly+0.109
lic+0.097
year+0.096
apping+0.096
Neg logit contributions
 bank-0.076
 banks-0.061
 choke-0.052
 wounds-0.051
 necks-0.047
Token and
Token ID290
Feature activation-9.75e-03

Pos logit contributions
 bank+0.079
 banks+0.065
 choke+0.059
 wounds+0.056
bank+0.054
Neg logit contributions
itting-0.096
olly-0.090
apping-0.084
 importantly-0.081
year-0.080
Token making
Token ID1642
Feature activation+4.82e-01

Pos logit contributions
itting+0.001
olly+0.001
 importantly+0.001
lic+0.001
mber+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 wounds-0.001
 choke-0.001
 necks-0.001
Token jokes
Token ID14532
Feature activation+3.33e-01

Pos logit contributions
 bank+0.042
 banks+0.035
 choke+0.033
 necks+0.030
bank+0.029
Neg logit contributions
itting-0.041
olly-0.040
lic-0.037
 importantly-0.036
year-0.036
Token had
Token ID550
Feature activation+8.70e-02

Pos logit contributions
 bank+0.024
 banks+0.019
 find+0.017
 choke+0.016
 wounds+0.016
Neg logit contributions
itting-0.032
olly-0.031
apping-0.028
ubby-0.027
mber-0.027
Token just
Token ID655
Feature activation-6.40e-02

Pos logit contributions
 bank+0.008
 banks+0.007
 choke+0.006
 wounds+0.006
bank+0.006
Neg logit contributions
itting-0.008
olly-0.008
 importantly-0.007
mber-0.007
lic-0.007
Token played
Token ID2826
Feature activation+1.33e-01

Pos logit contributions
itting+0.007
olly+0.007
lic+0.006
mber+0.006
apping+0.006
Neg logit contributions
 bank-0.005
 banks-0.004
 choke-0.004
 wounds-0.004
 find-0.004
Token hide
Token ID7808
Feature activation+9.13e-01

Pos logit contributions
 bank+0.010
 banks+0.008
 choke+0.007
 wounds+0.007
bank+0.007
Neg logit contributions
itting-0.015
olly-0.014
 importantly-0.013
apping-0.012
lic-0.012
Token-
Token ID12
Feature activation-9.82e-01

Pos logit contributions
 bank+0.069
 banks+0.056
 choke+0.048
 wounds+0.045
bank+0.044
Neg logit contributions
itting-0.103
olly-0.099
 importantly-0.088
year-0.087
apping-0.087
Tokenand
Token ID392
Feature activation+2.96e+00

Pos logit contributions
itting+0.105
olly+0.101
 importantly+0.090
mber+0.088
lic+0.088
Neg logit contributions
 bank-0.081
 banks-0.066
 choke-0.056
 wounds-0.055
bank-0.053
Token-
Token ID12
Feature activation-9.79e-01

Pos logit contributions
 bank+0.321
 banks+0.274
 wounds+0.246
 choke+0.243
bank+0.239
Neg logit contributions
itting-0.244
olly-0.229
 importantly-0.195
mber-0.190
apping-0.190
Tokenseek
Token ID36163
Feature activation+8.44e-01

Pos logit contributions
itting+0.114
olly+0.110
 importantly+0.099
mber+0.098
apping+0.097
Neg logit contributions
 bank-0.072
 banks-0.057
 choke-0.047
 wounds-0.046
bank-0.045
Token with
Token ID351
Feature activation+2.44e-01

Pos logit contributions
 bank+0.071
 banks+0.059
 choke+0.051
 wounds+0.050
bank+0.048
Neg logit contributions
itting-0.087
olly-0.083
 importantly-0.075
apping-0.075
year-0.073
Token Tweet
Token ID18752
Feature activation+1.22e+00

Pos logit contributions
 bank+0.019
 banks+0.016
 choke+0.013
 necks+0.013
bank+0.013
Neg logit contributions
itting-0.024
olly-0.024
 importantly-0.023
ubby-0.022
year-0.021
Tokenie
Token ID494
Feature activation-2.01e-01

Pos logit contributions
 bank+0.147
 banks+0.129
 choke+0.117
 wounds+0.115
bank+0.112
Neg logit contributions
itting-0.086
olly-0.081
 importantly-0.063
ubby-0.062
omed-0.061
Token decided
Token ID3066
Feature activation+6.53e-02

Pos logit contributions
itting+0.159
olly+0.152
 importantly+0.136
mber+0.134
lic+0.134
Neg logit contributions
 bank-0.130
 banks-0.106
 choke-0.088
 wounds-0.087
bank-0.085
Token to
Token ID284
Feature activation+3.16e-01

Pos logit contributions
 bank+0.005
 banks+0.004
 wounds+0.003
 choke+0.003
bank+0.003
Neg logit contributions
itting-0.008
olly-0.008
 importantly-0.007
lic-0.007
ubby-0.007
Token play
Token ID711
Feature activation-1.55e-01

Pos logit contributions
 bank+0.023
 banks+0.018
 choke+0.014
 find+0.014
 wounds+0.014
Neg logit contributions
itting-0.039
olly-0.037
 importantly-0.034
lic-0.033
year-0.033
Token hide
Token ID7808
Feature activation+9.87e-01

Pos logit contributions
itting+0.016
olly+0.016
 importantly+0.014
apping+0.014
lic+0.014
Neg logit contributions
 bank-0.013
 banks-0.011
 choke-0.009
 wounds-0.009
bank-0.009
Token-
Token ID12
Feature activation-1.04e+00

Pos logit contributions
 bank+0.075
 banks+0.059
 choke+0.052
 wounds+0.049
bank+0.048
Neg logit contributions
itting-0.111
olly-0.107
 importantly-0.095
year-0.094
apping-0.094
Tokenand
Token ID392
Feature activation+2.96e+00

Pos logit contributions
itting+0.121
olly+0.117
 importantly+0.104
mber+0.103
lic+0.103
Neg logit contributions
 bank-0.077
 banks-0.061
 choke-0.050
 wounds-0.050
bank-0.047
Token-
Token ID12
Feature activation-1.03e+00

Pos logit contributions
 bank+0.317
 banks+0.273
 wounds+0.242
 choke+0.235
bank+0.235
Neg logit contributions
itting-0.247
olly-0.231
 importantly-0.200
mber-0.194
lic-0.194
Tokenseek
Token ID36163
Feature activation+9.35e-01

Pos logit contributions
itting+0.120
olly+0.115
 importantly+0.102
lic+0.102
mber+0.102
Neg logit contributions
 bank-0.076
 banks-0.060
 choke-0.050
 wounds-0.049
bank-0.046
Token instead
Token ID2427
Feature activation-6.70e-01

Pos logit contributions
 bank+0.077
 banks+0.062
 choke+0.054
 wounds+0.052
bank+0.051
Neg logit contributions
itting-0.100
olly-0.096
 importantly-0.086
apping-0.085
year-0.084
Token.
Token ID13
Feature activation+6.67e-02

Pos logit contributions
itting+0.080
olly+0.076
mber+0.070
lic+0.068
 importantly+0.067
Neg logit contributions
 bank-0.048
 banks-0.038
bank-0.029
 choke-0.029
 wounds-0.029
Token They
Token ID1119
Feature activation+7.24e-02

Pos logit contributions
 bank+0.006
 banks+0.005
 choke+0.004
 wounds+0.004
bank+0.004
Neg logit contributions
itting-0.007
olly-0.006
 importantly-0.006
mber-0.006
lic-0.006
Token and
Token ID290
Feature activation-6.54e-02

Pos logit contributions
olly+0.048
itting+0.048
mber+0.042
apping+0.041
ister+0.041
Neg logit contributions
 bank-0.043
 banks-0.037
 choke-0.033
 wounds-0.032
 necks-0.030
Token they
Token ID484
Feature activation+3.93e-01

Pos logit contributions
itting+0.007
olly+0.007
ubby+0.006
 importantly+0.006
omed+0.006
Neg logit contributions
 bank-0.006
 banks-0.005
 find-0.004
 choke-0.004
bank-0.004
Token played
Token ID2826
Feature activation+5.37e-02

Pos logit contributions
 bank+0.032
 banks+0.025
 find+0.021
 choke+0.020
 neck+0.019
Neg logit contributions
itting-0.045
olly-0.043
 importantly-0.040
apping-0.039
mber-0.038
Token hide
Token ID7808
Feature activation+8.47e-01

Pos logit contributions
 bank+0.005
 banks+0.004
 choke+0.004
 wounds+0.004
bank+0.004
Neg logit contributions
itting-0.005
olly-0.005
 importantly-0.004
apping-0.004
lic-0.004
Token-
Token ID12
Feature activation-1.08e+00

Pos logit contributions
 bank+0.070
 banks+0.057
 choke+0.053
bank+0.049
 wounds+0.046
Neg logit contributions
itting-0.086
olly-0.085
year-0.074
apping-0.074
 importantly-0.073
Tokenand
Token ID392
Feature activation+2.96e+00

Pos logit contributions
itting+0.115
olly+0.112
lic+0.100
year+0.100
apping+0.097
Neg logit contributions
 bank-0.088
 banks-0.073
 choke-0.065
 wounds-0.062
 neck-0.058
Token-
Token ID12
Feature activation-1.06e+00

Pos logit contributions
 bank+0.332
 banks+0.287
 choke+0.256
 wounds+0.255
bank+0.248
Neg logit contributions
itting-0.230
olly-0.217
 importantly-0.182
mber-0.179
lic-0.178
Tokenseek
Token ID36163
Feature activation+1.01e+00

Pos logit contributions
itting+0.116
olly+0.112
lic+0.100
year+0.099
mber+0.098
Neg logit contributions
 bank-0.082
 banks-0.067
 choke-0.059
 wounds-0.058
 neck-0.053
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
 bank+0.087
 banks+0.071
 choke+0.063
 wounds+0.061
bank+0.058
Neg logit contributions
itting-0.105
olly-0.100
apping-0.089
 importantly-0.089
lic-0.087
Token After
Token ID2293
Feature activation-8.90e-01

Pos logit contributions
 bank+0.020
 banks+0.016
bank+0.013
 choke+0.013
 necks+0.012
Neg logit contributions
itting-0.027
olly-0.025
ubby-0.023
 importantly-0.023
ister-0.023
Token a
Token ID257
Feature activation+1.02e-01

Pos logit contributions
itting+0.097
olly+0.092
 importantly+0.081
mber+0.081
apping+0.080
Neg logit contributions
 bank-0.074
 banks-0.060
 choke-0.051
 wounds-0.050
bank-0.048
Token.
Token ID13
Feature activation+7.52e-02

Pos logit contributions
itting+0.020
olly+0.019
 importantly+0.017
lic+0.017
apping+0.017
Neg logit contributions
 bank-0.015
 banks-0.012
 choke-0.010
 wounds-0.010
bank-0.009
Token They
Token ID1119
Feature activation+1.22e-01

Pos logit contributions
 bank+0.006
 banks+0.005
 find+0.005
bank+0.004
 choke+0.004
Neg logit contributions
itting-0.008
olly-0.007
ister-0.007
ubby-0.006
ying-0.006
Token played
Token ID2826
Feature activation+8.95e-02

Pos logit contributions
 bank+0.010
 banks+0.008
 choke+0.007
 wounds+0.007
 necks+0.007
Neg logit contributions
itting-0.013
olly-0.013
 importantly-0.011
mber-0.011
apping-0.011
Token hide
Token ID7808
Feature activation+9.73e-01

Pos logit contributions
 bank+0.009
 banks+0.007
 choke+0.007
 wounds+0.006
bank+0.006
Neg logit contributions
itting-0.008
olly-0.008
apping-0.007
lic-0.007
 importantly-0.007
Token-
Token ID12
Feature activation-1.05e+00

Pos logit contributions
 bank+0.072
 banks+0.057
 choke+0.049
 wounds+0.046
bank+0.045
Neg logit contributions
itting-0.112
olly-0.109
 importantly-0.096
lic-0.096
apping-0.095
Tokenand
Token ID392
Feature activation+2.95e+00

Pos logit contributions
itting+0.116
olly+0.113
lic+0.100
year+0.099
mber+0.098
Neg logit contributions
 bank-0.082
 banks-0.067
 choke-0.059
 wounds-0.057
 necks-0.052
Token-
Token ID12
Feature activation-1.03e+00

Pos logit contributions
 bank+0.334
 banks+0.289
 choke+0.258
 wounds+0.257
bank+0.250
Neg logit contributions
itting-0.228
olly-0.214
 importantly-0.180
mber-0.176
lic-0.175
Tokenseek
Token ID36163
Feature activation+9.50e-01

Pos logit contributions
itting+0.111
olly+0.107
lic+0.096
year+0.095
mber+0.093
Neg logit contributions
 bank-0.082
 banks-0.067
 choke-0.061
 wounds-0.060
 neck-0.054
Token until
Token ID1566
Feature activation+6.04e-01

Pos logit contributions
 bank+0.081
 banks+0.066
 choke+0.060
 wounds+0.058
bank+0.055
Neg logit contributions
itting-0.099
olly-0.094
apping-0.085
 importantly-0.084
lic-0.082
Token it
Token ID340
Feature activation-6.42e-01

Pos logit contributions
 bank+0.044
 banks+0.034
 find+0.029
 way+0.028
 tears+0.027
Neg logit contributions
itting-0.073
olly-0.068
lic-0.063
reetings-0.063
apping-0.061
Token was
Token ID373
Feature activation-1.23e-01

Pos logit contributions
itting+0.070
olly+0.067
lic+0.060
mber+0.059
 importantly+0.059
Neg logit contributions
 bank-0.051
 banks-0.041
 wounds-0.036
 choke-0.036
 necks-0.034
Token
Token ID95
Feature activation+1.04e+00

Pos logit contributions
 bank+0.007
 banks+0.006
bank+0.005
 choke+0.005
 wounds+0.005
Neg logit contributions
itting-0.005
 importantly-0.005
year-0.005
olly-0.005
enna-0.005
Token
Token ID26391
Feature activation-2.37e-01

Pos logit contributions
 bank+0.101
 banks+0.085
bank+0.076
 choke+0.073
 wounds+0.070
Neg logit contributions
itting-0.097
olly-0.091
 importantly-0.087
year-0.080
apping-0.079
Token
Token ID129
Feature activation+2.62e-01

Pos logit contributions
itting+0.023
olly+0.022
year+0.021
 importantly+0.020
ubby+0.019
Neg logit contributions
 bank-0.021
 banks-0.018
 choke-0.016
 neck-0.016
 necks-0.015
Token
Token ID241
Feature activation+6.18e-01

Pos logit contributions
 bank+0.025
 banks+0.021
 choke+0.018
 wounds+0.018
bank+0.018
Neg logit contributions
itting-0.025
olly-0.024
 importantly-0.021
mber-0.020
lic-0.020
TokenA
Token ID32
Feature activation+7.48e-02

Pos logit contributions
 bank+0.062
 banks+0.053
 choke+0.047
 wounds+0.046
bank+0.045
Neg logit contributions
itting-0.053
olly-0.051
 importantly-0.043
lic-0.043
apping-0.043
Tokenww
Token ID1383
Feature activation-4.27e+00

Pos logit contributions
 bank+0.007
 banks+0.005
 choke+0.005
 way+0.004
bank+0.004
Neg logit contributions
itting-0.008
olly-0.007
apping-0.007
 importantly-0.007
year-0.006
Token,
Token ID11
Feature activation-3.44e-01

Pos logit contributions
itting+0.546
year+0.518
apping+0.461
 importantly+0.458
ane+0.456
Neg logit contributions
 bank-0.259
 banks-0.192
 w-0.162
 way-0.162
 choke-0.152
Token okay
Token ID8788
Feature activation-4.62e-01

Pos logit contributions
itting+0.036
olly+0.033
apping+0.030
ane+0.030
lic+0.030
Neg logit contributions
 bank-0.029
 banks-0.023
 find-0.021
 choke-0.021
 wounds-0.020
Token.
Token ID13
Feature activation+8.74e-02

Pos logit contributions
itting+0.043
olly+0.042
year+0.038
 importantly+0.037
ival+0.037
Neg logit contributions
 bank-0.040
 banks-0.033
 choke-0.031
 mistake-0.031
 find-0.030
Tokenâ
Token ID22940
Feature activation-1.05e-01

Pos logit contributions
 bank+0.007
 banks+0.006
 choke+0.005
 find+0.005
bank+0.005
Neg logit contributions
itting-0.009
olly-0.008
lic-0.007
ross-0.007
year-0.007
Token
Token ID26391
Feature activation-2.35e-01

Pos logit contributions
itting+0.012
 importantly+0.011
olly+0.010
year+0.010
apping+0.010
Neg logit contributions
 bank-0.008
 banks-0.007
bank-0.006
 sav-0.005
 way-0.005
Token at
Token ID379
Feature activation-1.40e+00

Pos logit contributions
itting+0.021
olly+0.020
 importantly+0.018
year+0.017
apping+0.017
Neg logit contributions
 bank-0.018
 banks-0.015
 choke-0.013
 wounds-0.013
bank-0.013
Token it
Token ID340
Feature activation-5.80e-01

Pos logit contributions
itting+0.142
olly+0.135
 importantly+0.120
year+0.118
ane+0.117
Neg logit contributions
 bank-0.118
 banks-0.098
 choke-0.087
 wounds-0.083
bank-0.081
Token said
Token ID531
Feature activation-1.73e-01

Pos logit contributions
itting+0.065
olly+0.062
 importantly+0.055
apping+0.055
lic+0.054
Neg logit contributions
 bank-0.045
 banks-0.036
 choke-0.031
 wounds-0.030
bank-0.029
Token "
Token ID366
Feature activation+3.38e-01

Pos logit contributions
itting+0.019
olly+0.019
mber+0.017
ubby+0.017
lic+0.016
Neg logit contributions
 bank-0.013
 banks-0.010
 wounds-0.009
 neck-0.009
bank-0.008
Tokenew
Token ID413
Feature activation-3.46e-01

Pos logit contributions
 bank+0.032
 banks+0.027
 choke+0.023
 wounds+0.023
bank+0.022
Neg logit contributions
itting-0.032
olly-0.031
 importantly-0.027
mber-0.026
lic-0.026
Tokenww
Token ID1383
Feature activation-4.27e+00

Pos logit contributions
itting+0.032
olly+0.030
 importantly+0.029
year+0.027
apping+0.025
Neg logit contributions
 bank-0.034
 banks-0.029
bank-0.024
 necks-0.024
 neck-0.023
Token!"
Token ID2474
Feature activation-4.53e-01

Pos logit contributions
itting+0.406
year+0.388
 importantly+0.374
olly+0.352
itte+0.332
Neg logit contributions
 bank-0.391
 banks-0.337
 piles-0.269
 choke-0.263
 heads-0.260
Token
Token ID220
Feature activation+2.51e-01

Pos logit contributions
itting+0.046
olly+0.044
 importantly+0.039
mber+0.039
lic+0.039
Neg logit contributions
 bank-0.039
 banks-0.033
 choke-0.028
 wounds-0.028
bank-0.027
Token
Token ID198
Feature activation-8.92e-03

Pos logit contributions
 bank+0.019
 banks+0.015
 choke+0.013
 wounds+0.013
bank+0.012
Neg logit contributions
itting-0.028
olly-0.027
 importantly-0.024
mber-0.024
lic-0.024
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
itting+0.001
olly+0.001
 importantly+0.001
apping+0.001
mber+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 choke-0.000
 wounds-0.000
bank-0.000
TokenOne
Token ID3198
Feature activation+4.47e-01

Pos logit contributions
itting+0.001
olly+0.001
 importantly+0.001
apping+0.001
mber+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 choke-0.001
bank-0.001
 wounds-0.001
Token He
Token ID679
Feature activation-2.50e-01

Pos logit contributions
 bank+0.003
 banks+0.002
 choke+0.002
 tears+0.002
 teeth+0.002
Neg logit contributions
itting-0.005
olly-0.005
ying-0.004
ane-0.004
reetings-0.004
Token said
Token ID531
Feature activation-4.59e-02

Pos logit contributions
itting+0.022
olly+0.021
 importantly+0.018
mber+0.018
lic+0.017
Neg logit contributions
 bank-0.026
 banks-0.022
 choke-0.019
 wounds-0.019
bank-0.019
Token,
Token ID11
Feature activation-1.19e-01

Pos logit contributions
itting+0.005
olly+0.004
 importantly+0.004
lic+0.004
mber+0.004
Neg logit contributions
 bank-0.004
 banks-0.003
 choke-0.003
 wounds-0.003
bank-0.003
Token "
Token ID366
Feature activation+3.69e-01

Pos logit contributions
itting+0.014
olly+0.013
ubby+0.012
 importantly+0.011
ancers+0.011
Neg logit contributions
 bank-0.009
 banks-0.007
 neck-0.006
 choke-0.006
bank-0.006
TokenE
Token ID36
Feature activation+5.84e-01

Pos logit contributions
 bank+0.040
 banks+0.034
 wounds+0.030
 choke+0.030
bank+0.029
Neg logit contributions
itting-0.031
olly-0.029
mber-0.025
 importantly-0.025
lic-0.025
Tokenww
Token ID1383
Feature activation-4.20e+00

Pos logit contributions
 bank+0.064
 banks+0.053
 choke+0.051
 necks+0.050
bank+0.049
Neg logit contributions
itting-0.041
 importantly-0.037
lic-0.035
year-0.034
omed-0.033
Token,
Token ID11
Feature activation-1.28e-01

Pos logit contributions
itting+0.521
year+0.487
 importantly+0.461
olly+0.446
itte+0.444
Neg logit contributions
 bank-0.267
 banks-0.198
 choke-0.175
 way-0.155
 find-0.149
Token this
Token ID428
Feature activation+2.22e-01

Pos logit contributions
itting+0.015
olly+0.013
apping+0.012
ane+0.012
ubby+0.012
Neg logit contributions
 bank-0.010
 banks-0.008
 choke-0.007
 wounds-0.007
 find-0.007
Token is
Token ID318
Feature activation-2.61e-01

Pos logit contributions
 bank+0.018
 banks+0.014
 choke+0.012
 wounds+0.012
bank+0.012
Neg logit contributions
itting-0.024
olly-0.023
 importantly-0.021
apping-0.021
year-0.021
Token the
Token ID262
Feature activation-2.04e-01

Pos logit contributions
itting+0.026
olly+0.026
 importantly+0.023
mber+0.022
lic+0.022
Neg logit contributions
 bank-0.023
 banks-0.019
 wounds-0.016
bank-0.016
 choke-0.015
Token worst
Token ID5290
Feature activation-1.06e-01

Pos logit contributions
itting+0.017
olly+0.017
ay+0.014
ister+0.014
 importantly+0.014
Neg logit contributions
 bank-0.020
 banks-0.018
bank-0.015
 wounds-0.015
 choke-0.015
Token and
Token ID290
Feature activation-3.73e-02

Pos logit contributions
 bank+0.010
 banks+0.008
 choke+0.007
 wounds+0.007
bank+0.006
Neg logit contributions
itting-0.016
olly-0.015
 importantly-0.013
lic-0.013
mber-0.013
Token said
Token ID531
Feature activation-1.31e-01

Pos logit contributions
itting+0.004
olly+0.004
ane+0.003
reetings+0.003
lic+0.003
Neg logit contributions
 bank-0.003
 banks-0.003
 choke-0.002
 wounds-0.002
 find-0.002
Token,
Token ID11
Feature activation-2.69e-01

Pos logit contributions
itting+0.016
olly+0.015
ubby+0.014
 importantly+0.014
lic+0.013
Neg logit contributions
 bank-0.009
 banks-0.006
bank-0.005
 wounds-0.005
 find-0.005
Token "
Token ID366
Feature activation+3.54e-01

Pos logit contributions
itting+0.026
olly+0.025
 importantly+0.022
ubby+0.021
year+0.021
Neg logit contributions
 bank-0.025
 banks-0.021
 choke-0.018
 wounds-0.018
bank-0.018
TokenA
Token ID32
Feature activation+1.82e-01

Pos logit contributions
 bank+0.036
 banks+0.031
 choke+0.027
 wounds+0.027
bank+0.026
Neg logit contributions
itting-0.031
olly-0.030
 importantly-0.025
lic-0.025
apping-0.025
Tokenww
Token ID1383
Feature activation-4.20e+00

Pos logit contributions
 bank+0.014
 banks+0.011
 way+0.010
 find+0.009
 choke+0.009
Neg logit contributions
itting-0.020
olly-0.019
 importantly-0.018
year-0.018
apping-0.018
Tokenwww
Token ID2503
Feature activation-2.24e+00

Pos logit contributions
year+0.527
itting+0.510
 importantly+0.472
itte+0.455
ane+0.449
Neg logit contributions
 bank-0.234
 banks-0.180
 way-0.163
 choke-0.152
 piles-0.143
Token,
Token ID11
Feature activation-2.83e-01

Pos logit contributions
itting+0.261
year+0.253
 importantly+0.246
olly+0.242
mber+0.231
Neg logit contributions
 bank-0.152
 banks-0.119
 find-0.101
 way-0.097
 choke-0.094
Token do
Token ID466
Feature activation-1.05e+00

Pos logit contributions
itting+0.030
olly+0.028
year+0.025
apping+0.025
 importantly+0.024
Neg logit contributions
 bank-0.024
 banks-0.019
 find-0.018
 way-0.017
 wounds-0.017
Token I
Token ID314
Feature activation-3.35e-01

Pos logit contributions
itting+0.111
olly+0.101
year+0.100
lic+0.092
ane+0.091
Neg logit contributions
 bank-0.091
 banks-0.076
 find-0.068
 choke-0.067
 wounds-0.066
Token have
Token ID423
Feature activation-2.59e-01

Pos logit contributions
itting+0.035
olly+0.034
 importantly+0.030
mber+0.030
apping+0.029
Neg logit contributions
 bank-0.029
 banks-0.023
 choke-0.020
 wounds-0.019
 find-0.019
Token be
Token ID307
Feature activation-2.61e-01

Pos logit contributions
itting+0.098
olly+0.095
ister+0.085
ous+0.085
 importantly+0.085
Neg logit contributions
 bank-0.064
 banks-0.053
 wounds-0.046
 necks-0.044
bank-0.043
Token bitter
Token ID12922
Feature activation+6.93e-01

Pos logit contributions
itting+0.027
olly+0.026
 importantly+0.023
mber+0.023
lic+0.023
Neg logit contributions
 bank-0.022
 banks-0.018
 choke-0.016
 wounds-0.015
bank-0.015
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
 bank+0.053
 banks+0.042
 choke+0.035
 wounds+0.035
bank+0.034
Neg logit contributions
itting-0.079
olly-0.076
 importantly-0.068
apping-0.067
lic-0.066
Token like
Token ID588
Feature activation+1.71e+00

Pos logit contributions
itting+0.013
olly+0.012
 importantly+0.011
lic+0.011
mber+0.011
Neg logit contributions
 bank-0.009
 banks-0.008
 choke-0.006
 wounds-0.006
bank-0.006
Token e
Token ID304
Feature activation-1.90e-01

Pos logit contributions
 bank+0.155
 banks+0.129
 choke+0.113
 wounds+0.112
bank+0.106
Neg logit contributions
itting-0.170
olly-0.162
lic-0.141
mber-0.141
 importantly-0.140
Tokenww
Token ID1383
Feature activation-4.19e+00

Pos logit contributions
itting+0.016
olly+0.015
 importantly+0.015
year+0.014
mber+0.014
Neg logit contributions
 bank-0.019
 banks-0.016
 wounds-0.015
 choke-0.015
 necks-0.014
Token.
Token ID13
Feature activation+6.01e-02

Pos logit contributions
itting+0.479
year+0.427
olly+0.423
mber+0.405
reetings+0.400
Neg logit contributions
 bank-0.326
 banks-0.265
 choke-0.227
 wounds-0.208
 piles-0.206
Token Do
Token ID2141
Feature activation-8.87e-01

Pos logit contributions
 bank+0.005
 banks+0.004
 choke+0.004
 find+0.004
 neck+0.004
Neg logit contributions
itting-0.006
olly-0.006
mber-0.005
ying-0.005
lic-0.005
Token you
Token ID345
Feature activation-5.92e-01

Pos logit contributions
itting+0.086
olly+0.077
year+0.075
lic+0.073
apping+0.071
Neg logit contributions
 bank-0.080
 banks-0.066
 wounds-0.064
 choke-0.062
 find-0.061
Token want
Token ID765
Feature activation+3.72e-01

Pos logit contributions
itting+0.063
olly+0.061
 importantly+0.054
mber+0.054
lic+0.054
Neg logit contributions
 bank-0.048
 banks-0.039
 choke-0.035
 wounds-0.034
 find-0.033
Token to
Token ID284
Feature activation+2.55e-01

Pos logit contributions
 bank+0.029
 banks+0.023
 choke+0.019
 wounds+0.019
bank+0.018
Neg logit contributions
itting-0.042
olly-0.041
 importantly-0.036
mber-0.036
lic-0.036
Token the
Token ID262
Feature activation-1.43e-01

Pos logit contributions
itting+0.041
olly+0.040
ister+0.035
ild+0.035
mber+0.035
Neg logit contributions
 bank-0.040
 banks-0.034
 wounds-0.030
 choke-0.029
bank-0.029
Token ground
Token ID2323
Feature activation+9.28e-02

Pos logit contributions
olly+0.012
itting+0.012
unt+0.012
mber+0.011
www+0.011
Neg logit contributions
 bank-0.010
 choke-0.009
 banks-0.009
 wounds-0.008
 teeth-0.008
Token.
Token ID13
Feature activation+7.35e-02

Pos logit contributions
 bank+0.006
 banks+0.005
 choke+0.004
 wounds+0.004
bank+0.004
Neg logit contributions
itting-0.011
olly-0.011
 importantly-0.010
mber-0.009
lic-0.009
Token "
Token ID366
Feature activation+3.25e-01

Pos logit contributions
 bank+0.007
 banks+0.005
bank+0.005
 choke+0.004
 neck+0.004
Neg logit contributions
itting-0.007
olly-0.007
 importantly-0.006
ubby-0.006
year-0.006
TokenE
Token ID36
Feature activation+5.21e-01

Pos logit contributions
 bank+0.031
 banks+0.025
 choke+0.022
 wounds+0.022
 necks+0.022
Neg logit contributions
itting-0.030
olly-0.030
 importantly-0.026
apping-0.025
lic-0.025
Tokenww
Token ID1383
Feature activation-4.18e+00

Pos logit contributions
 bank+0.062
 banks+0.053
 choke+0.048
 necks+0.048
bank+0.048
Neg logit contributions
itting-0.032
lic-0.030
 importantly-0.029
olly-0.027
year-0.027
Token,
Token ID11
Feature activation-2.91e-01

Pos logit contributions
itting+0.502
year+0.482
 importantly+0.449
apping+0.436
olly+0.435
Neg logit contributions
 bank-0.297
 banks-0.232
 choke-0.188
 way-0.178
 find-0.166
Token what
Token ID644
Feature activation-1.26e+00

Pos logit contributions
itting+0.031
olly+0.029
apping+0.026
lic+0.026
 importantly+0.025
Neg logit contributions
 bank-0.025
 banks-0.019
 find-0.018
 choke-0.018
 wounds-0.017
Token's
Token ID338
Feature activation+4.64e-01

Pos logit contributions
itting+0.147
olly+0.141
lic+0.130
arl+0.129
year+0.127
Neg logit contributions
 bank-0.087
 banks-0.063
 find-0.063
 choke-0.063
 way-0.061
Token that
Token ID326
Feature activation+8.61e-02

Pos logit contributions
 bank+0.041
 banks+0.032
 wounds+0.030
 choke+0.029
 find+0.028
Neg logit contributions
itting-0.049
olly-0.048
lic-0.043
apping-0.042
mber-0.042
Token?"
Token ID1701
Feature activation-1.71e+00

Pos logit contributions
 bank+0.008
 banks+0.007
 choke+0.006
bank+0.006
 way+0.006
Neg logit contributions
itting-0.008
olly-0.008
year-0.007
 importantly-0.007
apping-0.007
Token She
Token ID1375
Feature activation-3.70e-01

Pos logit contributions
 bank+0.010
 banks+0.009
 choke+0.008
 wounds+0.007
bank+0.007
Neg logit contributions
itting-0.010
olly-0.009
mber-0.008
 importantly-0.008
year-0.008
Token said
Token ID531
Feature activation-1.59e-02

Pos logit contributions
itting+0.032
olly+0.032
lic+0.028
mber+0.028
 importantly+0.027
Neg logit contributions
 bank-0.037
 banks-0.031
bank-0.027
 choke-0.027
 wounds-0.027
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
itting+0.002
olly+0.002
 importantly+0.001
lic+0.001
mber+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 choke-0.001
 wounds-0.001
bank-0.001
Token "
Token ID366
Feature activation+3.92e-01

Pos logit contributions
itting+0.012
olly+0.012
 importantly+0.010
mber+0.010
year+0.010
Neg logit contributions
 bank-0.010
 banks-0.008
 choke-0.007
 wounds-0.007
 neck-0.007
TokenE
Token ID36
Feature activation+6.01e-01

Pos logit contributions
 bank+0.045
 banks+0.039
 wounds+0.036
bank+0.034
 choke+0.034
Neg logit contributions
itting-0.030
olly-0.027
mber-0.023
ww-0.023
lic-0.023
Tokenww
Token ID1383
Feature activation-4.17e+00

Pos logit contributions
 bank+0.074
 banks+0.064
 choke+0.061
 necks+0.059
bank+0.058
Neg logit contributions
itting-0.037
 importantly-0.031
olly-0.030
lic-0.030
year-0.030
Token,
Token ID11
Feature activation-1.32e-01

Pos logit contributions
itting+0.501
year+0.490
 importantly+0.455
lic+0.440
olly+0.438
Neg logit contributions
 bank-0.260
 banks-0.197
 choke-0.180
 find-0.157
 way-0.154
Token disgusting
Token ID23374
Feature activation+1.53e-01

Pos logit contributions
itting+0.014
olly+0.013
 importantly+0.011
year+0.011
apping+0.011
Neg logit contributions
 bank-0.012
 banks-0.010
 choke-0.009
 wounds-0.008
bank-0.008
Token!"
Token ID2474
Feature activation-3.69e-01

Pos logit contributions
 bank+0.013
 banks+0.010
 choke+0.009
 wounds+0.009
bank+0.008
Neg logit contributions
itting-0.017
olly-0.016
 importantly-0.014
year-0.014
lic-0.014
Token She
Token ID1375
Feature activation-3.72e-01

Pos logit contributions
itting+0.031
olly+0.030
 importantly+0.026
apping+0.025
mber+0.025
Neg logit contributions
 bank-0.039
 banks-0.033
 choke-0.029
 wounds-0.029
bank-0.028
Token threw
Token ID9617
Feature activation+4.72e-01

Pos logit contributions
itting+0.035
olly+0.034
 importantly+0.029
lic+0.029
mber+0.029
Neg logit contributions
 bank-0.035
 banks-0.030
 choke-0.026
 wounds-0.026
bank-0.025
Token friends
Token ID2460
Feature activation-1.14e+00

Pos logit contributions
itting+0.020
olly+0.019
 importantly+0.017
mber+0.017
apping+0.016
Neg logit contributions
 bank-0.018
 banks-0.015
 choke-0.013
 wounds-0.013
bank-0.013
Token said
Token ID531
Feature activation-1.12e-01

Pos logit contributions
itting+0.116
olly+0.110
 importantly+0.097
ubby+0.094
lic+0.094
Neg logit contributions
 bank-0.104
 banks-0.085
 choke-0.072
bank-0.071
 wounds-0.071
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
itting+0.012
olly+0.012
mber+0.010
ubby+0.010
reetings+0.010
Neg logit contributions
 bank-0.009
 banks-0.007
bank-0.006
 wounds-0.006
 choke-0.006
Token "
Token ID366
Feature activation+4.00e-01

Pos logit contributions
itting+0.012
olly+0.011
 importantly+0.010
ubby+0.009
ane+0.009
Neg logit contributions
 bank-0.011
 banks-0.009
 choke-0.008
 wounds-0.008
bank-0.008
TokenA
Token ID32
Feature activation+1.24e-01

Pos logit contributions
 bank+0.036
 banks+0.029
bank+0.026
 necks+0.025
 wounds+0.025
Neg logit contributions
itting-0.037
olly-0.037
ubby-0.033
omed-0.032
 importantly-0.032
Tokenww
Token ID1383
Feature activation-4.17e+00

Pos logit contributions
 bank+0.011
 banks+0.008
 way+0.008
 choke+0.007
bank+0.007
Neg logit contributions
itting-0.014
olly-0.012
apping-0.011
otion-0.011
ying-0.011
Token,
Token ID11
Feature activation-2.13e-01

Pos logit contributions
itting+0.517
year+0.505
 importantly+0.464
lic+0.461
mber+0.459
Neg logit contributions
 bank-0.261
 banks-0.193
 choke-0.163
 find-0.153
 way-0.150
Token okay
Token ID8788
Feature activation-2.68e-01

Pos logit contributions
itting+0.023
olly+0.021
apping+0.019
ane+0.019
lic+0.019
Neg logit contributions
 bank-0.017
 banks-0.013
 find-0.013
 way-0.012
 wounds-0.012
Token.
Token ID13
Feature activation+7.86e-02

Pos logit contributions
itting+0.031
olly+0.030
year+0.026
lic+0.026
 importantly+0.026
Neg logit contributions
 bank-0.020
 banks-0.016
 choke-0.014
 wounds-0.013
 find-0.013
Token But
Token ID887
Feature activation+3.94e-01

Pos logit contributions
 bank+0.006
 banks+0.005
 find+0.004
bank+0.004
 Fl+0.004
Neg logit contributions
itting-0.008
olly-0.008
ier-0.007
lic-0.007
ying-0.007
Token come
Token ID1282
Feature activation+8.40e-02

Pos logit contributions
 bank+0.033
 banks+0.027
 find+0.025
 choke+0.024
 wounds+0.024
Neg logit contributions
itting-0.042
olly-0.041
lic-0.038
year-0.035
ane-0.035
Token.
Token ID13
Feature activation+9.19e-02

Pos logit contributions
 bank+0.057
 banks+0.046
 wounds+0.040
 choke+0.039
 necks+0.037
Neg logit contributions
itting-0.085
olly-0.081
apping-0.072
 importantly-0.071
lic-0.071
Token
Token ID198
Feature activation-5.72e-02

Pos logit contributions
 bank+0.008
 banks+0.007
bank+0.006
 choke+0.006
 find+0.006
Neg logit contributions
itting-0.009
olly-0.008
 importantly-0.007
ister-0.007
ubby-0.007
Token
Token ID198
Feature activation-5.81e-02

Pos logit contributions
itting+0.006
olly+0.006
 importantly+0.006
mber+0.005
lic+0.005
Neg logit contributions
 bank-0.004
 banks-0.003
 choke-0.003
 wounds-0.003
bank-0.003
Token"
Token ID1
Feature activation-4.96e-01

Pos logit contributions
itting+0.005
olly+0.005
 importantly+0.004
ous+0.004
ancer+0.004
Neg logit contributions
 bank-0.006
 banks-0.005
 find-0.004
 wounds-0.004
 choke-0.004
TokenE
Token ID36
Feature activation+5.77e-01

Pos logit contributions
itting+0.047
olly+0.045
apping+0.039
 importantly+0.039
mber+0.038
Neg logit contributions
 bank-0.047
 banks-0.039
 choke-0.035
 wounds-0.034
 necks-0.033
Tokenww
Token ID1383
Feature activation-4.14e+00

Pos logit contributions
 bank+0.070
 banks+0.061
 choke+0.056
 necks+0.055
bank+0.054
Neg logit contributions
itting-0.034
 importantly-0.031
olly-0.029
lic-0.029
year-0.028
Token!"
Token ID2474
Feature activation-4.29e-01

Pos logit contributions
itting+0.487
year+0.449
olly+0.424
 importantly+0.424
apping+0.412
Neg logit contributions
 bank-0.303
 banks-0.242
 choke-0.207
 way-0.185
 piles-0.184
Token she
Token ID673
Feature activation+4.13e-01

Pos logit contributions
itting+0.038
olly+0.035
 importantly+0.030
lic+0.030
apping+0.029
Neg logit contributions
 bank-0.045
 banks-0.038
 choke-0.034
 wounds-0.033
 find-0.032
Token screamed
Token ID25421
Feature activation+3.47e-01

Pos logit contributions
 bank+0.046
 banks+0.040
 wounds+0.035
 choke+0.035
bank+0.034
Neg logit contributions
itting-0.031
olly-0.031
mber-0.026
lic-0.026
 importantly-0.026
Token.
Token ID13
Feature activation+1.06e-01

Pos logit contributions
 bank+0.030
 banks+0.026
 necks+0.022
bank+0.021
 wounds+0.021
Neg logit contributions
itting-0.034
olly-0.033
apping-0.031
 importantly-0.029
year-0.029
Token
Token ID198
Feature activation-2.87e-02

Pos logit contributions
 bank+0.009
 banks+0.007
 choke+0.006
bank+0.006
 wounds+0.006
Neg logit contributions
itting-0.011
olly-0.011
 importantly-0.010
year-0.010
ubby-0.010
Token and
Token ID290
Feature activation+3.19e-02

Pos logit contributions
 bank+0.045
 banks+0.038
 wounds+0.034
 choke+0.033
bank+0.031
Neg logit contributions
itting-0.052
olly-0.049
apping-0.045
ous-0.043
lic-0.042
Token said
Token ID531
Feature activation-6.52e-02

Pos logit contributions
 bank+0.003
 banks+0.002
 wounds+0.002
 choke+0.002
bank+0.002
Neg logit contributions
itting-0.003
olly-0.003
 importantly-0.003
lic-0.003
mber-0.003
Token,
Token ID11
Feature activation-1.37e-01

Pos logit contributions
itting+0.007
olly+0.006
mber+0.006
 importantly+0.006
year+0.006
Neg logit contributions
 bank-0.006
 banks-0.005
 choke-0.004
 wounds-0.004
bank-0.004
Token "
Token ID366
Feature activation+3.74e-01

Pos logit contributions
itting+0.012
olly+0.011
 importantly+0.009
lic+0.009
apping+0.009
Neg logit contributions
 bank-0.014
 banks-0.012
 choke-0.011
 wounds-0.011
bank-0.010
TokenE
Token ID36
Feature activation+5.82e-01

Pos logit contributions
 bank+0.043
 banks+0.038
 way+0.035
 wounds+0.035
 find+0.034
Neg logit contributions
itting-0.025
olly-0.023
ww-0.021
www-0.020
mber-0.020
Tokenww
Token ID1383
Feature activation-4.14e+00

Pos logit contributions
 bank+0.062
 banks+0.053
 choke+0.049
bank+0.047
 wounds+0.046
Neg logit contributions
itting-0.047
olly-0.041
 importantly-0.040
lic-0.040
year-0.039
Token,
Token ID11
Feature activation-1.43e-01

Pos logit contributions
itting+0.476
year+0.438
olly+0.428
mber+0.416
 importantly+0.414
Neg logit contributions
 bank-0.308
 banks-0.249
 choke-0.214
 wounds-0.190
 necks-0.189
Token Bruno
Token ID31045
Feature activation+4.79e-02

Pos logit contributions
itting+0.014
olly+0.013
 importantly+0.011
mber+0.011
lic+0.011
Neg logit contributions
 bank-0.014
 banks-0.011
 choke-0.010
 wounds-0.010
bank-0.009
Token.
Token ID13
Feature activation+9.18e-02

Pos logit contributions
 bank+0.003
 banks+0.003
 choke+0.002
 wounds+0.002
bank+0.002
Neg logit contributions
itting-0.006
olly-0.005
 importantly-0.005
lic-0.005
mber-0.005
Token You
Token ID921
Feature activation-5.61e-01

Pos logit contributions
 bank+0.009
 banks+0.008
 choke+0.007
 wounds+0.007
bank+0.007
Neg logit contributions
itting-0.008
olly-0.008
mber-0.007
lic-0.007
 importantly-0.007
Token are
Token ID389
Feature activation+3.28e-01

Pos logit contributions
itting+0.056
olly+0.054
lic+0.047
 importantly+0.047
mber+0.046
Neg logit contributions
 bank-0.049
 banks-0.041
 wounds-0.035
bank-0.034
 choke-0.034
Token!"
Token ID2474
Feature activation-3.64e-01

Pos logit contributions
 bank+0.044
 banks+0.035
 choke+0.031
 find+0.030
 wounds+0.029
Neg logit contributions
itting-0.055
olly-0.052
 importantly-0.047
year-0.047
mber-0.046
Token
Token ID198
Feature activation-1.17e-03

Pos logit contributions
itting+0.047
olly+0.043
ane+0.039
reetings+0.039
ubby+0.038
Neg logit contributions
 bank-0.024
 banks-0.018
 wounds-0.015
 choke-0.015
 find-0.014
Token
Token ID198
Feature activation-4.49e-03

Pos logit contributions
itting+0.000
olly+0.000
 importantly+0.000
mber+0.000
year+0.000
Neg logit contributions
 bank-0.000
 banks-0.000
 wounds-0.000
 choke-0.000
 find-0.000
Token"
Token ID1
Feature activation-4.91e-01

Pos logit contributions
itting+0.000
olly+0.000
ous+0.000
 importantly+0.000
ane+0.000
Neg logit contributions
 bank-0.000
 banks-0.000
 wounds-0.000
 necks-0.000
 neck-0.000
TokenA
Token ID32
Feature activation+1.39e-01

Pos logit contributions
itting+0.044
olly+0.042
 importantly+0.036
apping+0.036
mber+0.035
Neg logit contributions
 bank-0.049
 banks-0.041
 choke-0.036
 wounds-0.036
 necks-0.035
Tokenww
Token ID1383
Feature activation-4.12e+00

Pos logit contributions
 bank+0.008
 way+0.006
 banks+0.006
 neck+0.006
 t+0.005
Neg logit contributions
itting-0.018
olly-0.016
year-0.016
omed-0.016
apping-0.015
Token!"
Token ID2474
Feature activation-3.61e-01

Pos logit contributions
itting+0.530
year+0.509
 importantly+0.481
itte+0.453
olly+0.452
Neg logit contributions
 bank-0.248
 banks-0.182
 way-0.156
 choke-0.154
 find-0.152
Token Lily
Token ID20037
Feature activation+2.86e-01

Pos logit contributions
itting+0.040
olly+0.036
reetings+0.032
apping+0.032
ane+0.032
Neg logit contributions
 bank-0.031
 banks-0.025
 choke-0.022
 wounds-0.021
 find-0.020
Token says
Token ID1139
Feature activation+3.54e-01

Pos logit contributions
 bank+0.033
 banks+0.028
 choke+0.025
 wounds+0.025
bank+0.025
Neg logit contributions
itting-0.021
olly-0.021
mber-0.017
apping-0.017
lic-0.016
Token.
Token ID13
Feature activation+7.91e-02

Pos logit contributions
 bank+0.021
 banks+0.016
 wounds+0.013
 choke+0.013
 find+0.012
Neg logit contributions
itting-0.046
olly-0.044
mber-0.040
 importantly-0.040
lic-0.040
Token "
Token ID366
Feature activation+3.74e-01

Pos logit contributions
 bank+0.006
 banks+0.005
 neck+0.004
 find+0.004
 choke+0.004
Neg logit contributions
itting-0.009
olly-0.008
 importantly-0.008
reetings-0.008
year-0.008
Token.
Token ID13
Feature activation+8.84e-02

Pos logit contributions
 bank+0.007
 banks+0.006
 choke+0.005
 wounds+0.005
 necks+0.005
Neg logit contributions
itting-0.012
olly-0.011
apping-0.010
 importantly-0.010
mber-0.010
Token
Token ID198
Feature activation-7.52e-02

Pos logit contributions
 bank+0.009
 banks+0.008
 choke+0.007
 wounds+0.007
bank+0.007
Neg logit contributions
itting-0.008
olly-0.007
 importantly-0.006
lic-0.006
apping-0.006
Token
Token ID198
Feature activation-7.43e-02

Pos logit contributions
itting+0.009
olly+0.008
 importantly+0.007
apping+0.007
mber+0.007
Neg logit contributions
 bank-0.006
 banks-0.005
 choke-0.004
 wounds-0.004
bank-0.004
Token"
Token ID1
Feature activation-5.05e-01

Pos logit contributions
itting+0.007
olly+0.006
 importantly+0.006
ous+0.006
ubby+0.006
Neg logit contributions
 bank-0.007
 banks-0.006
 find-0.005
 wounds-0.005
 choke-0.005
TokenE
Token ID36
Feature activation+5.48e-01

Pos logit contributions
itting+0.045
olly+0.043
 importantly+0.037
mber+0.036
lic+0.036
Neg logit contributions
 bank-0.051
 banks-0.044
 choke-0.039
 wounds-0.038
bank-0.037
Tokenww
Token ID1383
Feature activation-4.10e+00

Pos logit contributions
 bank+0.072
 banks+0.063
 choke+0.058
 necks+0.057
bank+0.057
Neg logit contributions
itting-0.031
olly-0.026
 importantly-0.025
lic-0.025
omed-0.023
Token,
Token ID11
Feature activation-2.52e-01

Pos logit contributions
itting+0.494
year+0.457
 importantly+0.434
olly+0.426
apping+0.417
Neg logit contributions
 bank-0.298
 banks-0.238
 choke-0.194
 find-0.180
 way-0.179
Token this
Token ID428
Feature activation+3.58e-01

Pos logit contributions
itting+0.025
olly+0.023
apping+0.021
 importantly+0.021
year+0.021
Neg logit contributions
 bank-0.023
 banks-0.019
 choke-0.017
 find-0.017
 wounds-0.016
Token fish
Token ID5916
Feature activation+5.20e-01

Pos logit contributions
 bank+0.029
 banks+0.023
 wounds+0.021
 choke+0.021
 necks+0.019
Neg logit contributions
itting-0.038
olly-0.037
mber-0.032
 importantly-0.032
lic-0.032
Token is
Token ID318
Feature activation-3.96e-01

Pos logit contributions
 bank+0.045
 banks+0.037
 choke+0.032
 wounds+0.031
bank+0.031
Neg logit contributions
itting-0.054
olly-0.051
 importantly-0.045
mber-0.045
lic-0.045
Token sm
Token ID895
Feature activation+3.35e-01

Pos logit contributions
itting+0.040
olly+0.038
 importantly+0.035
ubby+0.034
lic+0.034
Neg logit contributions
 bank-0.033
 banks-0.027
 wounds-0.024
 necks-0.023
 choke-0.022
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
 bank+0.036
 banks+0.028
 choke+0.023
 wounds+0.023
bank+0.022
Neg logit contributions
itting-0.060
olly-0.058
 importantly-0.052
mber-0.051
lic-0.051
Token
Token ID198
Feature activation+8.70e-02

Pos logit contributions
 bank+0.006
 banks+0.004
 choke+0.004
 neck+0.004
 find+0.004
Neg logit contributions
itting-0.014
olly-0.013
ross-0.012
ying-0.012
mber-0.012
Token
Token ID198
Feature activation+8.44e-02

Pos logit contributions
 bank+0.007
 banks+0.006
 choke+0.005
 wounds+0.005
bank+0.005
Neg logit contributions
itting-0.009
olly-0.009
 importantly-0.008
lic-0.008
apping-0.008
Token"
Token ID1
Feature activation-3.82e-01

Pos logit contributions
 bank+0.008
 banks+0.007
 necks+0.006
 wounds+0.006
 edge+0.006
Neg logit contributions
itting-0.008
olly-0.007
ous-0.007
 importantly-0.007
ancer-0.006
TokenA
Token ID32
Feature activation+1.29e-01

Pos logit contributions
itting+0.033
olly+0.031
 importantly+0.026
mber+0.026
apping+0.026
Neg logit contributions
 bank-0.040
 banks-0.034
 choke-0.030
 wounds-0.030
bank-0.029
Tokenww
Token ID1383
Feature activation-4.06e+00

Pos logit contributions
 bank+0.008
 banks+0.006
 way+0.005
 neck+0.005
 choke+0.005
Neg logit contributions
itting-0.017
year-0.015
olly-0.015
omed-0.015
apping-0.014
Token,
Token ID11
Feature activation-7.81e-02

Pos logit contributions
itting+0.512
year+0.489
 importantly+0.461
olly+0.447
itte+0.438
Neg logit contributions
 bank-0.256
 banks-0.195
 choke-0.167
 way-0.156
 find-0.154
Token it
Token ID340
Feature activation-4.89e-01

Pos logit contributions
itting+0.009
olly+0.008
apping+0.008
year+0.008
ane+0.008
Neg logit contributions
 bank-0.006
 banks-0.004
 wounds-0.004
 find-0.004
 t-0.004
Token is
Token ID318
Feature activation-2.76e-01

Pos logit contributions
itting+0.052
olly+0.050
 importantly+0.044
mber+0.044
apping+0.043
Neg logit contributions
 bank-0.041
 banks-0.033
 choke-0.028
 wounds-0.028
bank-0.027
Token so
Token ID523
Feature activation+3.23e-01

Pos logit contributions
itting+0.029
olly+0.028
 importantly+0.025
mber+0.024
lic+0.024
Neg logit contributions
 bank-0.023
 banks-0.019
 choke-0.016
 wounds-0.016
bank-0.015
Token cute
Token ID13779
Feature activation+1.59e-02

Pos logit contributions
 bank+0.021
 banks+0.015
 choke+0.013
 neck+0.013
 necks+0.012
Neg logit contributions
itting-0.041
olly-0.038
apping-0.035
omed-0.035
ane-0.035
Token He
Token ID679
Feature activation-1.95e-01

Pos logit contributions
 bank+0.002
 banks+0.002
 choke+0.001
bank+0.001
 wounds+0.001
Neg logit contributions
itting-0.003
olly-0.003
ubby-0.003
 importantly-0.003
year-0.003
Token says
Token ID1139
Feature activation+2.63e-01

Pos logit contributions
itting+0.017
olly+0.016
 importantly+0.014
mber+0.013
lic+0.013
Neg logit contributions
 bank-0.020
 banks-0.017
 choke-0.015
 wounds-0.015
bank-0.015
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
 bank+0.019
 banks+0.014
 wounds+0.013
 choke+0.013
bank+0.012
Neg logit contributions
itting-0.031
olly-0.028
 importantly-0.026
year-0.025
mber-0.025
Token "
Token ID366
Feature activation+3.64e-01

Pos logit contributions
itting+0.013
olly+0.013
 importantly+0.011
mber+0.011
ubby+0.011
Neg logit contributions
 bank-0.011
 banks-0.008
 choke-0.007
bank-0.007
 wounds-0.007
TokenA
Token ID32
Feature activation+9.36e-02

Pos logit contributions
 bank+0.039
 banks+0.034
 wounds+0.030
 choke+0.030
bank+0.029
Neg logit contributions
itting-0.031
olly-0.029
 importantly-0.024
mber-0.024
lic-0.024
Tokenww
Token ID1383
Feature activation-4.02e+00

Pos logit contributions
 bank+0.007
 way+0.005
 find+0.004
 neck+0.004
 bite+0.004
Neg logit contributions
itting-0.011
olly-0.010
year-0.010
ying-0.009
ancer-0.009
Token,
Token ID11
Feature activation-1.53e-01

Pos logit contributions
year+0.509
itting+0.507
 importantly+0.468
itte+0.457
mber+0.451
Neg logit contributions
 bank-0.228
 banks-0.155
 choke-0.149
 way-0.135
 bite-0.133
Token this
Token ID428
Feature activation+2.56e-01

Pos logit contributions
itting+0.016
olly+0.015
year+0.013
lic+0.013
apping+0.013
Neg logit contributions
 bank-0.013
 banks-0.010
 choke-0.010
 wounds-0.010
 find-0.009
Token is
Token ID318
Feature activation-4.54e-01

Pos logit contributions
 bank+0.019
 banks+0.014
 way+0.011
 find+0.011
 neck+0.011
Neg logit contributions
itting-0.030
olly-0.029
year-0.027
 importantly-0.027
lic-0.027
Token from
Token ID422
Feature activation+1.32e+00

Pos logit contributions
itting+0.051
olly+0.048
apping+0.042
mber+0.042
lic+0.042
Neg logit contributions
 bank-0.037
 banks-0.029
 choke-0.026
 wounds-0.025
 neck-0.024
Token Anna
Token ID11735
Feature activation+2.53e-01

Pos logit contributions
 bank+0.117
 banks+0.096
 choke+0.084
 wounds+0.083
 necks+0.079
Neg logit contributions
itting-0.136
olly-0.129
 importantly-0.113
lic-0.113
mber-0.112
Token make
Token ID787
Feature activation+9.19e-01

Pos logit contributions
 bank+0.003
 banks+0.002
 choke+0.002
 wounds+0.002
 find+0.002
Neg logit contributions
itting-0.004
olly-0.004
lic-0.003
apping-0.003
mber-0.003
Token it
Token ID340
Feature activation-5.98e-01

Pos logit contributions
 bank+0.089
 banks+0.075
 choke+0.065
bank+0.064
 necks+0.063
Neg logit contributions
itting-0.082
olly-0.080
 importantly-0.069
apping-0.068
ister-0.068
Token whole
Token ID2187
Feature activation+1.15e+00

Pos logit contributions
itting+0.057
olly+0.054
apping+0.048
lic+0.048
ous+0.047
Neg logit contributions
 bank-0.050
 banks-0.043
 necks-0.039
bank-0.037
 wounds-0.035
Token again
Token ID757
Feature activation+7.66e-01

Pos logit contributions
 bank+0.113
 banks+0.098
 necks+0.089
 wounds+0.087
bank+0.086
Neg logit contributions
itting-0.090
olly-0.086
ay-0.079
 importantly-0.077
apping-0.076
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
 bank+0.054
 banks+0.043
 choke+0.035
 wounds+0.035
bank+0.032
Neg logit contributions
itting-0.092
olly-0.088
lic-0.079
 importantly-0.079
mber-0.079
Token Amen
Token ID34717
Feature activation-3.85e+00

Pos logit contributions
 bank+0.012
 banks+0.010
 choke+0.009
 wounds+0.009
 find+0.009
Neg logit contributions
itting-0.015
olly-0.014
lic-0.012
mber-0.012
ying-0.012
Token."
Token ID526
Feature activation-3.72e-01

Pos logit contributions
itting+0.440
olly+0.416
lic+0.378
 importantly+0.375
mber+0.374
Neg logit contributions
 bank-0.290
 banks-0.228
 wounds-0.192
 choke-0.191
 find-0.185
Token
Token ID198
Feature activation+8.25e-02

Pos logit contributions
itting+0.037
olly+0.036
 importantly+0.032
mber+0.032
apping+0.032
Neg logit contributions
 bank-0.033
 banks-0.027
 choke-0.023
 wounds-0.023
bank-0.022
Token
Token ID198
Feature activation+7.42e-02

Pos logit contributions
 bank+0.007
 banks+0.005
 choke+0.005
 wounds+0.005
bank+0.004
Neg logit contributions
itting-0.009
olly-0.009
 importantly-0.008
mber-0.008
lic-0.008
TokenThen
Token ID6423
Feature activation+2.31e-01

Pos logit contributions
 bank+0.007
 banks+0.006
 wounds+0.005
 choke+0.005
 necks+0.005
Neg logit contributions
itting-0.007
olly-0.006
ous-0.006
omed-0.006
ane-0.006
Token,
Token ID11
Feature activation-1.37e-01

Pos logit contributions
 bank+0.019
 banks+0.015
 choke+0.013
 find+0.013
 way+0.012
Neg logit contributions
itting-0.026
olly-0.024
year-0.022
lic-0.022
 importantly-0.022
TokenAs
Token ID1722
Feature activation-5.42e-02

Pos logit contributions
itting+0.011
olly+0.010
 importantly+0.009
apping+0.009
ubby+0.009
Neg logit contributions
 bank-0.010
 banks-0.009
 choke-0.007
 wounds-0.007
bank-0.007
Token the
Token ID262
Feature activation-2.62e-01

Pos logit contributions
itting+0.007
olly+0.006
lic+0.006
apping+0.006
ane+0.005
Neg logit contributions
 bank-0.004
 banks-0.003
 choke-0.003
 way-0.002
bank-0.002
Token light
Token ID1657
Feature activation-4.61e-01

Pos logit contributions
itting+0.030
olly+0.028
apping+0.026
lic+0.026
reetings+0.025
Neg logit contributions
 bank-0.021
 banks-0.017
 way-0.015
 tears-0.014
 find-0.013
Tokenening
Token ID3101
Feature activation-2.68e-01

Pos logit contributions
itting+0.051
olly+0.050
arl+0.044
 importantly+0.043
year+0.043
Neg logit contributions
 bank-0.038
 banks-0.031
 way-0.024
bank-0.024
 find-0.024
Token f
Token ID277
Feature activation-1.86e-01

Pos logit contributions
itting+0.030
olly+0.029
lic+0.026
 importantly+0.025
year+0.025
Neg logit contributions
 bank-0.022
 banks-0.018
bank-0.014
 choke-0.014
 way-0.014
Tokenlic
Token ID677
Feature activation-3.83e+00

Pos logit contributions
mber+0.012
olly+0.011
www+0.011
ww+0.011
 importantly+0.011
Neg logit contributions
 bank-0.019
 banks-0.018
bank-0.017
 backs-0.014
 piles-0.014
Tokenkered
Token ID28970
Feature activation-5.37e-01

Pos logit contributions
itting+0.309
olly+0.286
mber+0.242
 importantly+0.242
reetings+0.234
Neg logit contributions
 bank-0.425
 banks-0.370
bank-0.318
 choke-0.313
 wounds-0.310
Token away
Token ID1497
Feature activation-6.05e-02

Pos logit contributions
itting+0.058
olly+0.056
 importantly+0.049
mber+0.049
year+0.049
Neg logit contributions
 bank-0.044
 banks-0.036
 choke-0.030
 wounds-0.029
bank-0.029
Token,
Token ID11
Feature activation-3.53e-01

Pos logit contributions
itting+0.007
olly+0.007
 importantly+0.006
lic+0.006
year+0.006
Neg logit contributions
 bank-0.005
 banks-0.004
 choke-0.003
 wounds-0.003
bank-0.003
Token the
Token ID262
Feature activation-2.47e-01

Pos logit contributions
itting+0.041
olly+0.040
lic+0.035
apping+0.035
ubby+0.035
Neg logit contributions
 bank-0.027
 banks-0.021
 choke-0.018
 wounds-0.017
bank-0.016
Token man
Token ID582
Feature activation+5.99e-01

Pos logit contributions
itting+0.027
olly+0.026
lic+0.023
apping+0.023
year+0.023
Neg logit contributions
 bank-0.021
 banks-0.017
 wounds-0.014
 neck-0.014
 choke-0.014
Token.
Token ID13
Feature activation+5.70e-02

Pos logit contributions
itting+0.106
olly+0.102
 importantly+0.092
mber+0.091
lic+0.090
Neg logit contributions
 bank-0.060
 banks-0.046
 choke-0.037
 wounds-0.037
bank-0.035
Token She
Token ID1375
Feature activation-3.49e-01

Pos logit contributions
 bank+0.004
 banks+0.003
 find+0.003
bank+0.003
 choke+0.003
Neg logit contributions
itting-0.006
olly-0.006
ubby-0.006
ross-0.005
ier-0.005
Token used
Token ID973
Feature activation+8.85e-01

Pos logit contributions
itting+0.036
olly+0.034
 importantly+0.030
mber+0.030
lic+0.029
Neg logit contributions
 bank-0.031
 banks-0.025
 choke-0.022
 wounds-0.022
bank-0.021
Token a
Token ID257
Feature activation+9.99e-02

Pos logit contributions
 bank+0.076
 banks+0.064
 choke+0.054
bank+0.053
 wounds+0.052
Neg logit contributions
itting-0.090
olly-0.085
 importantly-0.076
year-0.076
lic-0.076
Token m
Token ID285
Feature activation-6.31e-01

Pos logit contributions
 bank+0.008
 banks+0.006
 choke+0.005
 wounds+0.005
bank+0.005
Neg logit contributions
itting-0.011
olly-0.011
 importantly-0.009
mber-0.009
lic-0.009
Tokenitt
Token ID715
Feature activation-3.82e+00

Pos logit contributions
itting+0.031
olly+0.027
 importantly+0.021
mber+0.021
lic+0.020
Neg logit contributions
 bank-0.088
 banks-0.078
 choke-0.073
 wounds-0.073
 necks-0.070
Token to
Token ID284
Feature activation+3.41e-01

Pos logit contributions
itting+0.306
olly+0.291
 importantly+0.246
mber+0.243
lic+0.240
Neg logit contributions
 bank-0.416
 banks-0.356
 choke-0.319
 wounds-0.313
bank-0.307
Token take
Token ID1011
Feature activation-7.89e-01

Pos logit contributions
 bank+0.023
 banks+0.018
 choke+0.015
 wounds+0.014
 find+0.014
Neg logit contributions
itting-0.042
olly-0.041
 importantly-0.037
mber-0.037
apping-0.036
Token out
Token ID503
Feature activation+9.32e-01

Pos logit contributions
itting+0.091
olly+0.088
 importantly+0.079
mber+0.078
apping+0.078
Neg logit contributions
 bank-0.060
 banks-0.047
 choke-0.039
 wounds-0.039
 find-0.036
Token the
Token ID262
Feature activation-1.45e-01

Pos logit contributions
 bank+0.073
 banks+0.058
 wounds+0.049
 choke+0.048
 necks+0.046
Neg logit contributions
itting-0.105
olly-0.101
mber-0.089
 importantly-0.089
apping-0.088
Token pan
Token ID3425
Feature activation-4.17e-02

Pos logit contributions
itting+0.014
olly+0.013
 importantly+0.011
ubby+0.011
ay+0.011
Neg logit contributions
 bank-0.013
 banks-0.011
bank-0.010
 choke-0.010
 necks-0.009
Token
Token ID198
Feature activation+3.07e-02

Pos logit contributions
 bank+0.003
 banks+0.002
 choke+0.002
 wounds+0.002
bank+0.002
Neg logit contributions
itting-0.003
olly-0.003
 importantly-0.003
lic-0.003
apping-0.003
TokenThey
Token ID2990
Feature activation+2.55e-02

Pos logit contributions
 bank+0.003
 banks+0.002
 wounds+0.002
 necks+0.002
 find+0.002
Neg logit contributions
itting-0.003
olly-0.003
ous-0.003
ane-0.003
ancer-0.002
Token get
Token ID651
Feature activation-8.26e-03

Pos logit contributions
 bank+0.002
 banks+0.002
 choke+0.001
 wounds+0.001
bank+0.001
Neg logit contributions
itting-0.003
olly-0.003
 importantly-0.002
mber-0.002
lic-0.002
Token a
Token ID257
Feature activation+1.47e-01

Pos logit contributions
itting+0.001
olly+0.001
year+0.001
apping+0.001
apped+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
bank-0.001
 necks-0.001
 choke-0.001
Token m
Token ID285
Feature activation-5.72e-01

Pos logit contributions
 bank+0.012
 banks+0.010
 wounds+0.008
 necks+0.008
 choke+0.008
Neg logit contributions
itting-0.016
olly-0.015
lic-0.014
mber-0.013
apping-0.013
Tokenitt
Token ID715
Feature activation-3.78e+00

Pos logit contributions
itting+0.024
olly+0.021
 importantly+0.015
mber+0.015
lic+0.014
Neg logit contributions
 bank-0.084
 banks-0.075
 choke-0.071
 wounds-0.070
bank-0.068
Token and
Token ID290
Feature activation+2.40e-02

Pos logit contributions
itting+0.370
olly+0.353
 importantly+0.308
mber+0.304
lic+0.303
Neg logit contributions
 bank-0.348
 banks-0.290
 choke-0.252
 wounds-0.250
bank-0.241
Token a
Token ID257
Feature activation+1.51e-01

Pos logit contributions
 bank+0.002
 banks+0.001
 choke+0.001
 wounds+0.001
bank+0.001
Neg logit contributions
itting-0.003
olly-0.003
 importantly-0.002
mber-0.002
lic-0.002
Token spat
Token ID15246
Feature activation+4.45e-01

Pos logit contributions
 bank+0.012
 banks+0.010
 wounds+0.008
bank+0.008
 choke+0.008
Neg logit contributions
itting-0.016
olly-0.016
lic-0.014
mber-0.014
 importantly-0.014
Tokenula
Token ID4712
Feature activation+4.96e-01

Pos logit contributions
 bank+0.040
 banks+0.034
 choke+0.029
 wounds+0.029
bank+0.028
Neg logit contributions
itting-0.044
olly-0.043
apping-0.037
 importantly-0.037
lic-0.036
Token.
Token ID13
Feature activation+6.65e-02

Pos logit contributions
 bank+0.040
 banks+0.033
 wounds+0.030
 necks+0.028
 choke+0.028
Neg logit contributions
itting-0.052
olly-0.050
apping-0.044
lic-0.043
ay-0.042
Token put
Token ID1234
Feature activation-6.92e-02

Pos logit contributions
itting+0.035
olly+0.033
 importantly+0.029
mber+0.029
lic+0.029
Neg logit contributions
 bank-0.029
 banks-0.023
 choke-0.020
 wounds-0.020
bank-0.019
Token on
Token ID319
Feature activation-5.11e-01

Pos logit contributions
itting+0.008
olly+0.008
 importantly+0.007
apping+0.007
mber+0.007
Neg logit contributions
 bank-0.005
 banks-0.004
 wounds-0.004
 neck-0.003
 choke-0.003
Token an
Token ID281
Feature activation+6.33e-01

Pos logit contributions
itting+0.043
olly+0.043
ild+0.037
mber+0.035
year+0.035
Neg logit contributions
 bank-0.048
 banks-0.042
bank-0.037
 choke-0.037
 wounds-0.035
Token oven
Token ID14361
Feature activation-7.21e-01

Pos logit contributions
 bank+0.057
 banks+0.047
 choke+0.041
 wounds+0.041
bank+0.039
Neg logit contributions
itting-0.063
olly-0.060
 importantly-0.053
mber-0.052
lic-0.052
Token m
Token ID285
Feature activation-5.81e-01

Pos logit contributions
itting+0.089
olly+0.087
 importantly+0.079
mber+0.078
ubby+0.077
Neg logit contributions
 bank-0.049
 banks-0.037
 wounds-0.029
 choke-0.029
 necks-0.028
Tokenitt
Token ID715
Feature activation-3.78e+00

Pos logit contributions
itting+0.031
olly+0.029
 importantly+0.026
mber+0.026
lic+0.023
Neg logit contributions
 bank-0.075
 banks-0.067
 choke-0.063
 wounds-0.063
bank-0.061
Token and
Token ID290
Feature activation-9.03e-03

Pos logit contributions
itting+0.411
olly+0.396
 importantly+0.354
mber+0.349
lic+0.348
Neg logit contributions
 bank-0.307
 banks-0.249
 choke-0.209
 wounds-0.205
bank-0.200
Token a
Token ID257
Feature activation+1.07e-01

Pos logit contributions
itting+0.001
olly+0.001
apping+0.001
 importantly+0.001
ubby+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 choke-0.000
 wounds-0.000
 find-0.000
Token a
Token ID257
Feature activation+1.04e-01

Pos logit contributions
 bank+0.009
 banks+0.007
 choke+0.006
 wounds+0.006
bank+0.006
Neg logit contributions
itting-0.011
olly-0.011
 importantly-0.010
mber-0.010
lic-0.010
Tokenpron
Token ID31186
Feature activation-2.29e+00

Pos logit contributions
 bank+0.009
 banks+0.007
 choke+0.006
 wounds+0.006
bank+0.006
Neg logit contributions
itting-0.011
olly-0.011
 importantly-0.009
apping-0.009
mber-0.009
Token.
Token ID13
Feature activation+6.62e-02

Pos logit contributions
itting+0.260
olly+0.251
 importantly+0.224
mber+0.222
lic+0.220
Neg logit contributions
 bank-0.176
 banks-0.140
 choke-0.117
 wounds-0.115
bank-0.112
Token was
Token ID373
Feature activation-3.92e-01

Pos logit contributions
itting+0.107
olly+0.103
 importantly+0.091
lic+0.090
mber+0.090
Neg logit contributions
 bank-0.075
 banks-0.060
 choke-0.052
 wounds-0.050
bank-0.048
Token seeing
Token ID4379
Feature activation+2.37e-01

Pos logit contributions
itting+0.044
olly+0.042
 importantly+0.038
mber+0.037
lic+0.037
Neg logit contributions
 bank-0.030
 banks-0.024
 choke-0.020
 wounds-0.020
bank-0.019
Token.
Token ID13
Feature activation+2.63e-01

Pos logit contributions
 bank+0.019
 banks+0.016
 choke+0.013
 wounds+0.013
bank+0.013
Neg logit contributions
itting-0.026
olly-0.025
 importantly-0.022
lic-0.022
mber-0.022
Token
Token ID198
Feature activation+8.38e-02

Pos logit contributions
 bank+0.020
 banks+0.016
bank+0.014
 choke+0.013
 find+0.013
Neg logit contributions
itting-0.030
olly-0.028
ubby-0.025
 importantly-0.025
year-0.024
Token
Token ID198
Feature activation+4.86e-02

Pos logit contributions
 bank+0.005
 banks+0.004
 wounds+0.003
bank+0.003
 choke+0.003
Neg logit contributions
itting-0.010
olly-0.010
 importantly-0.009
ous-0.009
ubby-0.009
TokenP
Token ID47
Feature activation-3.76e+00

Pos logit contributions
 bank+0.004
 banks+0.004
 find+0.003
bank+0.003
 wounds+0.003
Neg logit contributions
itting-0.005
olly-0.004
ous-0.004
ancer-0.004
ane-0.004
Tokenolly
Token ID5098
Feature activation-1.44e+00

Pos logit contributions
itting+0.280
 importantly+0.246
olly+0.235
ous+0.233
year+0.230
Neg logit contributions
 bank-0.406
 banks-0.346
 choke-0.306
 wounds-0.302
 necks-0.298
Token wanted
Token ID2227
Feature activation-3.06e-01

Pos logit contributions
itting+0.165
olly+0.153
 importantly+0.142
year+0.141
ancer+0.138
Neg logit contributions
 bank-0.113
 banks-0.090
 necks-0.072
 neck-0.072
bank-0.070
Token to
Token ID284
Feature activation+3.31e-01

Pos logit contributions
itting+0.033
olly+0.032
mber+0.028
 importantly+0.028
lic+0.028
Neg logit contributions
 bank-0.024
 banks-0.020
 choke-0.017
 wounds-0.016
 necks-0.016
Token find
Token ID1064
Feature activation+1.23e+00

Pos logit contributions
 bank+0.026
 banks+0.021
 wounds+0.019
 necks+0.018
bank+0.018
Neg logit contributions
itting-0.034
olly-0.034
ister-0.030
mber-0.029
 importantly-0.029
Token out
Token ID503
Feature activation+9.53e-01

Pos logit contributions
 bank+0.094
 banks+0.075
 wounds+0.063
 choke+0.063
bank+0.059
Neg logit contributions
itting-0.141
olly-0.135
 importantly-0.120
apping-0.119
mber-0.119
Token ordered
Token ID6149
Feature activation+6.34e-01

Pos logit contributions
itting+0.017
olly+0.016
 importantly+0.014
mber+0.014
lic+0.014
Neg logit contributions
 bank-0.016
 banks-0.013
 choke-0.011
 wounds-0.011
bank-0.011
Token one
Token ID530
Feature activation+1.94e-01

Pos logit contributions
 bank+0.047
 banks+0.038
 choke+0.031
 wounds+0.031
bank+0.029
Neg logit contributions
itting-0.073
olly-0.070
 importantly-0.062
mber-0.062
lic-0.062
Token last
Token ID938
Feature activation+9.66e-01

Pos logit contributions
 bank+0.018
 banks+0.015
 choke+0.013
 wounds+0.013
bank+0.013
Neg logit contributions
itting-0.019
olly-0.018
 importantly-0.016
mber-0.015
apping-0.015
Token apple
Token ID17180
Feature activation+4.21e-01

Pos logit contributions
 bank+0.077
 banks+0.066
 wounds+0.056
 choke+0.054
 necks+0.053
Neg logit contributions
itting-0.102
olly-0.097
 importantly-0.085
mber-0.085
apping-0.084
Token for
Token ID329
Feature activation-6.49e-01

Pos logit contributions
 bank+0.034
 banks+0.029
 wounds+0.025
 choke+0.025
 necks+0.024
Neg logit contributions
itting-0.043
olly-0.042
 importantly-0.036
apping-0.036
mber-0.036
Token himself
Token ID2241
Feature activation+1.93e+00

Pos logit contributions
itting+0.075
olly+0.072
 importantly+0.064
mber+0.064
lic+0.063
Neg logit contributions
 bank-0.049
 banks-0.039
 choke-0.032
 wounds-0.032
bank-0.030
Token and
Token ID290
Feature activation-7.03e-03

Pos logit contributions
 bank+0.158
 banks+0.131
 wounds+0.112
 choke+0.112
bank+0.105
Neg logit contributions
itting-0.204
olly-0.196
 importantly-0.173
apping-0.171
mber-0.170
Token then
Token ID788
Feature activation+9.22e-01

Pos logit contributions
itting+0.001
olly+0.001
 importantly+0.001
lic+0.001
apping+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 choke-0.000
 wounds-0.000
bank-0.000
Token went
Token ID1816
Feature activation-3.94e-01

Pos logit contributions
 bank+0.081
 banks+0.063
 choke+0.054
bank+0.053
 find+0.052
Neg logit contributions
itting-0.102
olly-0.095
lic-0.085
ubby-0.084
apping-0.083
Token on
Token ID319
Feature activation-4.72e-01

Pos logit contributions
itting+0.052
olly+0.049
 importantly+0.046
mber+0.045
lic+0.044
Neg logit contributions
 bank-0.025
 banks-0.018
 find-0.014
 wounds-0.014
bank-0.013
Token his
Token ID465
Feature activation-4.09e-02

Pos logit contributions
itting+0.045
olly+0.044
 importantly+0.041
ild+0.040
year+0.039
Neg logit contributions
 bank-0.038
 banks-0.033
 choke-0.028
 wounds-0.028
bank-0.027
Token gift
Token ID6979
Feature activation-1.23e-03

Pos logit contributions
 bank+0.007
 banks+0.006
 choke+0.005
 wounds+0.005
bank+0.005
Neg logit contributions
itting-0.008
olly-0.008
mber-0.007
 importantly-0.007
apping-0.007
Token.
Token ID13
Feature activation+9.03e-02

Pos logit contributions
itting+0.000
olly+0.000
 importantly+0.000
lic+0.000
mber+0.000
Neg logit contributions
 bank-0.000
 banks-0.000
 choke-0.000
 wounds-0.000
 find-0.000
Token Choose
Token ID17489
Feature activation+6.29e-02

Pos logit contributions
 bank+0.008
 banks+0.007
 neck+0.006
 find+0.006
 choke+0.006
Neg logit contributions
itting-0.009
olly-0.008
lic-0.008
mber-0.007
ous-0.007
Token what
Token ID644
Feature activation-1.24e+00

Pos logit contributions
 bank+0.005
 banks+0.004
 wounds+0.004
 choke+0.003
 necks+0.003
Neg logit contributions
itting-0.007
olly-0.007
 importantly-0.006
apping-0.006
lic-0.006
Token you
Token ID345
Feature activation-5.79e-01

Pos logit contributions
itting+0.112
olly+0.103
lic+0.097
year+0.094
omed+0.091
Neg logit contributions
 bank-0.126
 banks-0.104
 find-0.096
 wounds-0.095
 choke-0.094
Token like
Token ID588
Feature activation+1.80e+00

Pos logit contributions
itting+0.066
olly+0.064
 importantly+0.059
apping+0.058
mber+0.057
Neg logit contributions
 bank-0.046
 banks-0.036
 find-0.031
 choke-0.030
 wounds-0.028
Token."
Token ID526
Feature activation-4.17e-01

Pos logit contributions
 bank+0.152
 banks+0.124
 choke+0.106
 wounds+0.105
bank+0.101
Neg logit contributions
itting-0.190
olly-0.182
 importantly-0.161
mber-0.159
lic-0.158
Token She
Token ID1375
Feature activation-3.25e-01

Pos logit contributions
itting+0.046
olly+0.044
 importantly+0.039
lic+0.039
mber+0.039
Neg logit contributions
 bank-0.033
 banks-0.027
 choke-0.023
 wounds-0.022
bank-0.021
Token said
Token ID531
Feature activation-9.40e-02

Pos logit contributions
itting+0.033
olly+0.032
 importantly+0.028
mber+0.028
lic+0.028
Neg logit contributions
 bank-0.028
 banks-0.023
 choke-0.020
 wounds-0.020
bank-0.019
Token.
Token ID13
Feature activation+7.25e-02

Pos logit contributions
itting+0.012
olly+0.011
mber+0.010
lic+0.010
 importantly+0.010
Neg logit contributions
 bank-0.006
 banks-0.005
 wounds-0.004
 choke-0.004
bank-0.004
Token
Token ID198
Feature activation+1.70e-02

Pos logit contributions
 bank+0.005
 banks+0.003
 choke+0.003
bank+0.003
 neck+0.003
Neg logit contributions
itting-0.009
olly-0.008
year-0.008
ubby-0.008
 importantly-0.008
Token to
Token ID284
Feature activation+3.38e-01

Pos logit contributions
 bank+0.041
 banks+0.032
 choke+0.026
 wounds+0.025
 find+0.024
Neg logit contributions
itting-0.074
olly-0.072
 importantly-0.065
mber-0.065
lic-0.065
Token go
Token ID467
Feature activation-1.21e-01

Pos logit contributions
 bank+0.028
 banks+0.023
 wounds+0.020
 choke+0.020
bank+0.019
Neg logit contributions
itting-0.035
olly-0.034
ister-0.029
ubby-0.029
mber-0.029
Token home
Token ID1363
Feature activation+4.53e-01

Pos logit contributions
itting+0.015
olly+0.014
 importantly+0.013
mber+0.013
lic+0.013
Neg logit contributions
 bank-0.008
 banks-0.006
 choke-0.005
 wounds-0.005
bank-0.005
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
 bank+0.037
 banks+0.031
 necks+0.026
 choke+0.026
 wounds+0.026
Neg logit contributions
itting-0.047
olly-0.046
apping-0.042
ister-0.040
 importantly-0.040
Token The
Token ID383
Feature activation+3.43e-01

Pos logit contributions
 bank+0.011
 banks+0.009
 choke+0.007
 wounds+0.007
bank+0.007
Neg logit contributions
itting-0.015
olly-0.014
 importantly-0.013
apping-0.013
mber-0.013
Token end
Token ID886
Feature activation+1.60e+00

Pos logit contributions
 bank+0.026
 banks+0.020
 wounds+0.017
 choke+0.016
 find+0.016
Neg logit contributions
itting-0.040
olly-0.038
lic-0.035
 importantly-0.034
apping-0.034
Token.
Token ID13
Feature activation+1.21e-01

Pos logit contributions
 bank+0.062
 banks+0.038
 way+0.032
 find+0.032
 choke+0.031
Neg logit contributions
itting-0.226
olly-0.213
mber-0.203
year-0.202
ubby-0.201
Token<|endoftext|>
Token ID50256
Feature activation+2.12e-01

Pos logit contributions
 bank+0.010
 banks+0.008
 choke+0.007
 wounds+0.007
bank+0.007
Neg logit contributions
itting-0.013
olly-0.012
 importantly-0.011
lic-0.011
mber-0.011
TokenOnce
Token ID7454
Feature activation+3.30e-01

Pos logit contributions
 bank+0.018
 banks+0.015
 choke+0.013
 wounds+0.013
bank+0.012
Neg logit contributions
itting-0.022
olly-0.021
lic-0.019
 importantly-0.019
apping-0.019
Token upon
Token ID2402
Feature activation+2.88e-01

Pos logit contributions
 bank+0.030
 banks+0.025
 find+0.023
 choke+0.022
 wounds+0.022
Neg logit contributions
itting-0.030
olly-0.027
lic-0.025
 importantly-0.025
year-0.025
Token a
Token ID257
Feature activation+7.93e-02

Pos logit contributions
 bank+0.029
 banks+0.024
bank+0.021
 wounds+0.021
 necks+0.020
Neg logit contributions
itting-0.026
olly-0.025
lic-0.022
year-0.022
 importantly-0.022
Token smell
Token ID8508
Feature activation+8.74e-01

Pos logit contributions
 bank+0.046
 banks+0.039
 wounds+0.036
 choke+0.035
 necks+0.035
Neg logit contributions
olly-0.054
itting-0.054
ubby-0.047
ww-0.047
www-0.046
Token.
Token ID13
Feature activation+6.10e-02

Pos logit contributions
 bank+0.073
 banks+0.060
 choke+0.051
 wounds+0.050
bank+0.048
Neg logit contributions
itting-0.093
olly-0.089
 importantly-0.079
lic-0.078
mber-0.078
Token Lily
Token ID20037
Feature activation+2.76e-01

Pos logit contributions
 bank+0.006
 banks+0.005
 wounds+0.004
 choke+0.004
 necks+0.004
Neg logit contributions
itting-0.006
olly-0.006
mber-0.005
lic-0.005
 importantly-0.005
Token did
Token ID750
Feature activation-7.81e-01

Pos logit contributions
 bank+0.026
 banks+0.022
 choke+0.020
 wounds+0.020
bank+0.018
Neg logit contributions
itting-0.024
olly-0.024
lic-0.021
mber-0.021
apping-0.020
Token not
Token ID407
Feature activation-7.68e-02

Pos logit contributions
itting+0.067
olly+0.062
apping+0.055
lic+0.055
otion+0.054
Neg logit contributions
 bank-0.078
 banks-0.067
 necks-0.057
bank-0.056
 choke-0.056
Token like
Token ID588
Feature activation+1.75e+00

Pos logit contributions
itting+0.008
olly+0.008
lic+0.007
 importantly+0.007
mber+0.007
Neg logit contributions
 bank-0.007
 banks-0.006
 wounds-0.005
 choke-0.005
bank-0.005
Token it
Token ID340
Feature activation-5.50e-01

Pos logit contributions
 bank+0.170
 banks+0.147
 wounds+0.124
 choke+0.122
bank+0.120
Neg logit contributions
itting-0.154
olly-0.146
 importantly-0.132
mber-0.127
unt-0.126
Token and
Token ID290
Feature activation-2.17e-02

Pos logit contributions
itting+0.056
olly+0.053
 importantly+0.046
apping+0.046
lic+0.045
Neg logit contributions
 bank-0.048
 banks-0.040
 wounds-0.036
 necks-0.034
 choke-0.034
Token ran
Token ID4966
Feature activation-2.29e-01

Pos logit contributions
itting+0.002
olly+0.002
 importantly+0.002
apping+0.002
lic+0.002
Neg logit contributions
 bank-0.002
 banks-0.002
 choke-0.001
 wounds-0.001
bank-0.001
Token away
Token ID1497
Feature activation-2.25e-01

Pos logit contributions
itting+0.020
olly+0.020
lic+0.019
year+0.018
ha+0.017
Neg logit contributions
 bank-0.019
 banks-0.017
 choke-0.015
 wounds-0.015
 necks-0.015
Token.
Token ID13
Feature activation+5.87e-02

Pos logit contributions
itting+0.020
olly+0.019
apping+0.017
lic+0.017
ous+0.017
Neg logit contributions
 bank-0.021
 banks-0.018
 wounds-0.016
 necks-0.016
 choke-0.015
Token "
Token ID366
Feature activation+3.97e-01

Pos logit contributions
itting+0.010
olly+0.010
mber+0.009
ubby+0.009
lic+0.009
Neg logit contributions
 bank-0.008
 banks-0.006
 wounds-0.005
 choke-0.005
bank-0.005
TokenOkay
Token ID16454
Feature activation-2.29e-01

Pos logit contributions
 bank+0.042
 banks+0.035
 choke+0.032
 wounds+0.031
bank+0.031
Neg logit contributions
itting-0.033
olly-0.032
 importantly-0.027
lic-0.026
apping-0.026
Token Max
Token ID5436
Feature activation-3.57e-01

Pos logit contributions
itting+0.027
olly+0.025
year+0.023
lic+0.023
 importantly+0.023
Neg logit contributions
 bank-0.017
 banks-0.013
 find-0.011
 wounds-0.011
 way-0.011
Token,
Token ID11
Feature activation-2.00e-01

Pos logit contributions
itting+0.039
olly+0.037
 importantly+0.034
lic+0.034
year+0.033
Neg logit contributions
 bank-0.029
 banks-0.023
 choke-0.019
bank-0.019
 wounds-0.019
Token I
Token ID314
Feature activation-3.46e-01

Pos logit contributions
itting+0.021
olly+0.020
lic+0.018
year+0.018
mber+0.018
Neg logit contributions
 bank-0.017
 banks-0.014
 wounds-0.012
 choke-0.012
 find-0.012
Token promise
Token ID6991
Feature activation+1.57e+00

Pos logit contributions
itting+0.036
olly+0.035
 importantly+0.031
mber+0.030
apping+0.030
Neg logit contributions
 bank-0.030
 banks-0.025
 choke-0.021
 wounds-0.021
 find-0.021
Token".
Token ID1911
Feature activation-2.28e-01

Pos logit contributions
 bank+0.124
 banks+0.098
 wounds+0.084
 choke+0.084
 find+0.081
Neg logit contributions
itting-0.178
olly-0.171
lic-0.153
year-0.152
 importantly-0.151
Token
Token ID198
Feature activation+5.33e-02

Pos logit contributions
olly+0.020
itting+0.020
 importantly+0.017
lic+0.017
unt+0.017
Neg logit contributions
 bank-0.023
 banks-0.019
 wounds-0.017
 choke-0.016
 necks-0.016
Token
Token ID198
Feature activation+6.05e-02

Pos logit contributions
 bank+0.005
 banks+0.004
 choke+0.003
 wounds+0.003
bank+0.003
Neg logit contributions
itting-0.005
olly-0.005
lic-0.005
 importantly-0.005
mber-0.005
TokenFrom
Token ID4863
Feature activation+3.92e-01

Pos logit contributions
 bank+0.007
 banks+0.006
 choke+0.006
bank+0.006
 wounds+0.006
Neg logit contributions
itting-0.004
olly-0.004
 importantly-0.003
lic-0.003
mber-0.003
Token that
Token ID326
Feature activation+7.54e-02

Pos logit contributions
 bank+0.027
 banks+0.022
 wounds+0.020
 himself+0.020
 necks+0.016
Neg logit contributions
itting-0.049
olly-0.044
lic-0.042
apping-0.041
omed-0.041
Token again
Token ID757
Feature activation+7.45e-01

Pos logit contributions
itting+0.082
olly+0.079
 importantly+0.071
apping+0.071
lic+0.071
Neg logit contributions
 bank-0.038
 banks-0.029
 choke-0.022
 wounds-0.022
bank-0.020
Token,
Token ID11
Feature activation-2.29e-01

Pos logit contributions
 bank+0.051
 banks+0.040
 choke+0.033
 wounds+0.032
bank+0.030
Neg logit contributions
itting-0.090
olly-0.087
 importantly-0.078
mber-0.077
lic-0.077
Token ok
Token ID12876
Feature activation-7.47e-01

Pos logit contributions
itting+0.023
olly+0.022
 importantly+0.020
apping+0.019
mber+0.019
Neg logit contributions
 bank-0.020
 banks-0.017
 choke-0.014
 wounds-0.014
bank-0.014
Token?"
Token ID1701
Feature activation-1.73e+00

Pos logit contributions
itting+0.065
olly+0.063
year+0.053
lic+0.053
mber+0.053
Neg logit contributions
 bank-0.076
 banks-0.065
 wounds-0.060
 choke-0.059
 find-0.058
Token
Token ID198
Feature activation-2.09e-02

Pos logit contributions
itting+0.169
olly+0.163
apping+0.142
 importantly+0.142
ister+0.139
Neg logit contributions
 bank-0.161
 banks-0.135
 choke-0.113
 wounds-0.113
bank-0.111
Token
Token ID198
Feature activation-1.90e-02

Pos logit contributions
itting+0.002
olly+0.002
ister+0.002
apping+0.002
 importantly+0.002
Neg logit contributions
 bank-0.002
 banks-0.002
 choke-0.001
bank-0.001
 wounds-0.001
TokenTom
Token ID13787
Feature activation-1.13e-01

Pos logit contributions
itting+0.001
olly+0.001
 importantly+0.001
mber+0.001
apping+0.001
Neg logit contributions
 bank-0.002
 banks-0.002
 choke-0.002
 wounds-0.002
bank-0.002
Token nodded
Token ID14464
Feature activation-1.37e+00

Pos logit contributions
itting+0.011
olly+0.011
 importantly+0.009
mber+0.009
lic+0.009
Neg logit contributions
 bank-0.010
 banks-0.008
 choke-0.007
 wounds-0.007
bank-0.007
Token and
Token ID290
Feature activation-1.69e-02

Pos logit contributions
itting+0.137
olly+0.132
apping+0.121
year+0.119
 importantly+0.117
Neg logit contributions
 bank-0.123
 banks-0.104
bank-0.086
 choke-0.086
 necks-0.081
Token hugged
Token ID41130
Feature activation-1.51e-02

Pos logit contributions
itting+0.002
olly+0.002
ubby+0.001
reetings+0.001
 importantly+0.001
Neg logit contributions
 bank-0.002
 banks-0.001
 wounds-0.001
 choke-0.001
bank-0.001
Token her
Token ID607
Feature activation+1.08e-01

Pos logit contributions
itting+0.001
olly+0.001
 importantly+0.001
ister+0.001
year+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 choke-0.001
bank-0.001
 necks-0.001
TokenThe
Token ID464
Feature activation+1.05e+00

Pos logit contributions
 bank+0.001
 banks+0.001
 choke+0.001
 wounds+0.001
 necks+0.001
Neg logit contributions
itting-0.001
olly-0.001
 importantly-0.001
lic-0.001
omed-0.001
Token doctor
Token ID6253
Feature activation-2.03e-03

Pos logit contributions
 bank+0.103
 banks+0.088
 choke+0.075
 wounds+0.075
bank+0.074
Neg logit contributions
itting-0.096
olly-0.093
 importantly-0.080
mber-0.078
lic-0.078
Token smiled
Token ID13541
Feature activation+1.59e-01

Pos logit contributions
itting+0.000
olly+0.000
 importantly+0.000
ubby+0.000
apping+0.000
Neg logit contributions
 bank-0.000
 banks-0.000
 choke-0.000
 wounds-0.000
 find-0.000
Token at
Token ID379
Feature activation-1.42e+00

Pos logit contributions
 bank+0.011
 banks+0.009
 choke+0.007
 wounds+0.007
bank+0.007
Neg logit contributions
itting-0.019
olly-0.018
 importantly-0.016
apping-0.016
mber-0.016
Token her
Token ID607
Feature activation+6.79e-02

Pos logit contributions
itting+0.147
olly+0.137
 importantly+0.122
lic+0.122
apping+0.121
Neg logit contributions
 bank-0.123
 banks-0.101
 wounds-0.089
 choke-0.089
 necks-0.084
Token and
Token ID290
Feature activation-6.10e-02

Pos logit contributions
 bank+0.006
 banks+0.005
bank+0.004
 choke+0.004
 necks+0.004
Neg logit contributions
olly-0.007
itting-0.007
 importantly-0.006
apping-0.006
ous-0.006
Token said
Token ID531
Feature activation-1.62e-01

Pos logit contributions
itting+0.007
olly+0.006
apping+0.005
ubby+0.005
ane+0.005
Neg logit contributions
 bank-0.006
 banks-0.004
 choke-0.004
 wounds-0.004
 tears-0.004
Token,
Token ID11
Feature activation-2.68e-01

Pos logit contributions
itting+0.018
olly+0.017
 importantly+0.015
mber+0.015
lic+0.015
Neg logit contributions
 bank-0.013
 banks-0.010
 wounds-0.008
 choke-0.008
bank-0.008
Token "
Token ID366
Feature activation+3.10e-01

Pos logit contributions
itting+0.027
olly+0.026
 importantly+0.022
ubby+0.022
mber+0.022
Neg logit contributions
 bank-0.024
 banks-0.020
 choke-0.017
 wounds-0.017
bank-0.017
TokenBecause
Token ID8128
Feature activation-4.82e-01

Pos logit contributions
 bank+0.031
 banks+0.026
 choke+0.023
 wounds+0.023
bank+0.022
Neg logit contributions
itting-0.027
olly-0.026
apping-0.023
ubby-0.023
omed-0.022
Token it
Token ID340
Feature activation-6.30e-01

Pos logit contributions
itting+0.049
olly+0.045
lic+0.043
omed+0.040
mber+0.040
Neg logit contributions
 bank-0.042
 banks-0.034
 choke-0.033
 wounds-0.032
 find-0.031
Token their
Token ID511
Feature activation-2.45e-01

Pos logit contributions
 bank+0.031
 banks+0.025
 choke+0.022
bank+0.021
 necks+0.021
Neg logit contributions
itting-0.035
olly-0.033
 importantly-0.031
apping-0.030
year-0.030
Token mom
Token ID1995
Feature activation+4.21e-01

Pos logit contributions
itting+0.027
olly+0.026
 importantly+0.023
apping+0.023
ubby+0.023
Neg logit contributions
 bank-0.019
 banks-0.015
 choke-0.013
bank-0.013
 wounds-0.012
Token and
Token ID290
Feature activation+4.43e-02

Pos logit contributions
 bank+0.035
 banks+0.029
 choke+0.025
 wounds+0.024
bank+0.024
Neg logit contributions
itting-0.044
olly-0.042
apping-0.038
 importantly-0.037
lic-0.037
Token dad
Token ID9955
Feature activation-1.73e-01

Pos logit contributions
 bank+0.004
 banks+0.004
 wounds+0.003
 choke+0.003
 necks+0.003
Neg logit contributions
itting-0.004
olly-0.004
 importantly-0.003
lic-0.003
mber-0.003
Token hug
Token ID16225
Feature activation+8.65e-01

Pos logit contributions
itting+0.017
olly+0.016
apping+0.015
 importantly+0.015
omed+0.014
Neg logit contributions
 bank-0.015
 banks-0.013
 wounds-0.011
 choke-0.011
bank-0.010
Token them
Token ID606
Feature activation+9.01e-01

Pos logit contributions
 bank+0.074
 banks+0.061
 choke+0.054
bank+0.050
 wounds+0.050
Neg logit contributions
itting-0.089
olly-0.085
apping-0.075
 importantly-0.074
year-0.074
Token and
Token ID290
Feature activation+5.63e-02

Pos logit contributions
 bank+0.074
 banks+0.061
 choke+0.056
bank+0.053
 wounds+0.053
Neg logit contributions
itting-0.090
olly-0.089
apping-0.081
year-0.076
ister-0.074
Token kiss
Token ID9245
Feature activation+6.51e-01

Pos logit contributions
 bank+0.005
 banks+0.004
 choke+0.004
 wounds+0.004
bank+0.004
Neg logit contributions
itting-0.005
olly-0.005
 importantly-0.005
mber-0.004
lic-0.004
Token them
Token ID606
Feature activation+9.05e-01

Pos logit contributions
 bank+0.049
 banks+0.039
 choke+0.033
 wounds+0.032
bank+0.031
Neg logit contributions
itting-0.075
olly-0.072
 importantly-0.064
mber-0.063
lic-0.063
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
 bank+0.069
 banks+0.055
 choke+0.048
 wounds+0.046
bank+0.045
Neg logit contributions
itting-0.100
olly-0.097
apping-0.087
year-0.085
 importantly-0.085
Token
Token ID198
Feature activation+2.78e-02

Pos logit contributions
 bank+0.010
 banks+0.008
 choke+0.007
 wounds+0.007
bank+0.007
Neg logit contributions
itting-0.012
olly-0.012
 importantly-0.010
mber-0.010
lic-0.010
Token."
Token ID526
Feature activation-3.21e-01

Pos logit contributions
itting+0.072
olly+0.067
lic+0.062
mber+0.061
omed+0.061
Neg logit contributions
 bank-0.032
 banks-0.023
 wounds-0.019
 necks-0.018
 neck-0.018
Token
Token ID198
Feature activation+1.04e-01

Pos logit contributions
itting+0.037
olly+0.035
 importantly+0.031
mber+0.031
lic+0.031
Neg logit contributions
 bank-0.025
 banks-0.020
 choke-0.016
 wounds-0.016
bank-0.016
Token
Token ID198
Feature activation+1.04e-01

Pos logit contributions
 bank+0.010
 banks+0.008
 choke+0.007
bank+0.007
 wounds+0.007
Neg logit contributions
itting-0.010
olly-0.010
lic-0.008
 importantly-0.008
apping-0.008
TokenTom
Token ID13787
Feature activation-5.34e-02

Pos logit contributions
 bank+0.011
 banks+0.009
 wounds+0.008
 choke+0.008
 necks+0.008
Neg logit contributions
itting-0.009
olly-0.008
 importantly-0.007
mber-0.007
lic-0.007
Token smiled
Token ID13541
Feature activation+2.68e-01

Pos logit contributions
itting+0.005
olly+0.005
mber+0.004
lic+0.004
 importantly+0.004
Neg logit contributions
 bank-0.005
 banks-0.004
 choke-0.004
 wounds-0.004
bank-0.004
Token and
Token ID290
Feature activation+6.08e-02

Pos logit contributions
 bank+0.020
 banks+0.016
 choke+0.013
bank+0.013
 wounds+0.013
Neg logit contributions
itting-0.031
olly-0.030
 importantly-0.027
apping-0.027
year-0.027
Token said
Token ID531
Feature activation-3.46e-02

Pos logit contributions
 bank+0.005
 w+0.004
 neck+0.004
 banks+0.003
 bark+0.003
Neg logit contributions
itting-0.007
olly-0.006
ubby-0.006
reetings-0.006
ane-0.006
Token,
Token ID11
Feature activation-8.62e-02

Pos logit contributions
itting+0.004
olly+0.004
aley+0.004
 importantly+0.004
mber+0.003
Neg logit contributions
 bank-0.002
 himself-0.001
 banks-0.001
 find-0.001
bank-0.001
Token "
Token ID366
Feature activation+4.02e-01

Pos logit contributions
itting+0.010
olly+0.009
ubby+0.008
 importantly+0.008
mber+0.008
Neg logit contributions
 bank-0.007
 banks-0.005
 neck-0.005
 choke-0.005
bank-0.005
TokenYou
Token ID1639
Feature activation+5.78e-01

Pos logit contributions
 bank+0.040
 banks+0.034
 choke+0.031
bank+0.030
 necks+0.030
Neg logit contributions
itting-0.035
olly-0.033
omed-0.028
lic-0.028
apping-0.028
Token are
Token ID389
Feature activation+3.42e-01

Pos logit contributions
 bank+0.041
 banks+0.033
 find+0.029
 choke+0.027
 neck+0.025
Neg logit contributions
itting-0.068
olly-0.064
 importantly-0.059
year-0.059
apping-0.058
Token kind
Token ID1611
Feature activation-4.32e-01

Pos logit contributions
 bank+0.023
 banks+0.019
 choke+0.017
 find+0.017
 neck+0.016
Neg logit contributions
itting-0.030
olly-0.027
omed-0.025
ane-0.025
apping-0.024
Token."
Token ID526
Feature activation-3.62e-01

Pos logit contributions
itting+0.055
olly+0.051
omed+0.047
lic+0.047
 importantly+0.047
Neg logit contributions
 bank-0.027
 banks-0.020
bank-0.015
 neck-0.015
 choke-0.015
Token
Token ID220
Feature activation+2.93e-01

Pos logit contributions
itting+0.042
olly+0.041
 importantly+0.036
mber+0.036
lic+0.036
Neg logit contributions
 bank-0.027
 banks-0.021
 choke-0.018
 wounds-0.017
bank-0.016
Token
Token ID198
Feature activation+2.92e-02

Pos logit contributions
 bank+0.022
 banks+0.018
 choke+0.015
 wounds+0.015
bank+0.014
Neg logit contributions
itting-0.033
olly-0.032
 importantly-0.028
mber-0.028
year-0.028
Token
Token ID198
Feature activation+2.78e-02

Pos logit contributions
 bank+0.003
 banks+0.002
 choke+0.002
 wounds+0.002
bank+0.002
Neg logit contributions
olly-0.003
itting-0.003
lic-0.002
mber-0.002
 importantly-0.002
TokenL
Token ID43
Feature activation+4.65e-01

Pos logit contributions
 bank+0.003
 banks+0.003
 choke+0.002
 wounds+0.002
bank+0.002
Neg logit contributions
itting-0.002
olly-0.002
 importantly-0.002
lic-0.002
mber-0.002
Tokenily
Token ID813
Feature activation+2.00e-01

Pos logit contributions
 bank+0.050
 banks+0.043
 wounds+0.042
 find+0.041
 mistake+0.040
Neg logit contributions
itting-0.031
olly-0.024
ister-0.024
unt-0.024
 importantly-0.024
Token smiled
Token ID13541
Feature activation+1.70e-01

Pos logit contributions
 bank+0.019
 banks+0.017
 choke+0.015
 wounds+0.015
bank+0.014
Neg logit contributions
olly-0.018
itting-0.017
mber-0.015
lic-0.014
apping-0.014
Token and
Token ID290
Feature activation-4.49e-02

Pos logit contributions
 bank+0.012
 banks+0.010
 choke+0.008
 wounds+0.008
bank+0.007
Neg logit contributions
itting-0.020
olly-0.020
 importantly-0.017
apping-0.017
mber-0.017
Token said
Token ID531
Feature activation-1.54e-01

Pos logit contributions
itting+0.005
olly+0.004
ubby+0.004
reetings+0.004
apping+0.004
Neg logit contributions
 bank-0.004
 banks-0.003
 choke-0.003
 neck-0.003
 wounds-0.003
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
itting+0.018
olly+0.017
lic+0.015
 importantly+0.015
mber+0.015
Neg logit contributions
 bank-0.011
 banks-0.008
 find-0.007
bank-0.006
 himself-0.006
Token so
Token ID523
Feature activation+4.89e-01

Pos logit contributions
itting+0.031
olly+0.030
mber+0.027
lic+0.026
apping+0.026
Neg logit contributions
 bank-0.019
 banks-0.015
 choke-0.013
 find-0.013
 wounds-0.012
Token big
Token ID1263
Feature activation-1.39e-01

Pos logit contributions
 bank+0.035
 banks+0.026
 find+0.025
 choke+0.023
 himself+0.022
Neg logit contributions
itting-0.056
olly-0.053
ancers-0.049
mber-0.048
ancer-0.048
Token,
Token ID11
Feature activation-3.37e-01

Pos logit contributions
itting+0.018
olly+0.017
lic+0.015
year+0.015
apping+0.015
Neg logit contributions
 bank-0.009
 banks-0.007
bank-0.005
 choke-0.005
 find-0.005
Token it
Token ID340
Feature activation-6.79e-01

Pos logit contributions
itting+0.037
olly+0.035
lic+0.031
apping+0.031
year+0.030
Neg logit contributions
 bank-0.027
 banks-0.022
 neck-0.020
 wounds-0.019
 choke-0.019
Token's
Token ID338
Feature activation+4.07e-01

Pos logit contributions
itting+0.079
olly+0.076
apping+0.069
year+0.069
 importantly+0.069
Neg logit contributions
 bank-0.051
 banks-0.039
 find-0.033
 choke-0.033
 way-0.032
Token too
Token ID1165
Feature activation-1.42e+00

Pos logit contributions
 bank+0.031
 banks+0.024
 choke+0.021
 wounds+0.020
 neck+0.019
Neg logit contributions
itting-0.047
olly-0.047
apping-0.041
mber-0.041
lic-0.040
Token hard
Token ID1327
Feature activation+1.93e-01

Pos logit contributions
itting+0.175
olly+0.169
ancers+0.157
apping+0.156
 importantly+0.154
Neg logit contributions
 bank-0.094
 banks-0.069
 way-0.056
 choke-0.055
 find-0.053
Token to
Token ID284
Feature activation+3.25e-01

Pos logit contributions
 bank+0.014
 banks+0.011
 choke+0.008
bank+0.008
 find+0.008
Neg logit contributions
itting-0.023
olly-0.023
mber-0.020
 importantly-0.020
lic-0.020
Token count
Token ID954
Feature activation-9.28e-02

Pos logit contributions
 bank+0.023
 banks+0.019
 wounds+0.015
 choke+0.015
bank+0.014
Neg logit contributions
itting-0.038
olly-0.037
 importantly-0.033
mber-0.032
lic-0.032
Token!".
Token ID48220
Feature activation-1.78e-01

Pos logit contributions
itting+0.011
olly+0.010
mber+0.009
 importantly+0.009
lic+0.009
Neg logit contributions
 bank-0.007
 banks-0.006
 wounds-0.005
 choke-0.004
 necks-0.004
Token
Token ID198
Feature activation-8.58e-02

Pos logit contributions
itting+0.019
olly+0.019
 importantly+0.017
mber+0.016
apping+0.016
Neg logit contributions
 bank-0.014
 banks-0.012
 wounds-0.010
 choke-0.010
 necks-0.009
Token Tim
Token ID5045
Feature activation+2.78e-01

Pos logit contributions
 bank+0.009
 banks+0.008
 wounds+0.007
 choke+0.007
 necks+0.007
Neg logit contributions
itting-0.008
olly-0.008
 importantly-0.007
lic-0.007
apping-0.007
Tokenmy
Token ID1820
Feature activation-4.67e-01

Pos logit contributions
 bank+0.032
 banks+0.028
 wounds+0.025
bank+0.024
 choke+0.024
Neg logit contributions
itting-0.020
olly-0.019
ubby-0.016
ay-0.016
ane-0.016
Token loved
Token ID6151
Feature activation-1.13e-01

Pos logit contributions
itting+0.032
olly+0.031
 importantly+0.025
unt+0.025
mber+0.025
Neg logit contributions
 bank-0.054
 banks-0.049
 wounds-0.045
 choke-0.042
bank-0.042
Token to
Token ID284
Feature activation+2.54e-01

Pos logit contributions
itting+0.010
olly+0.010
 importantly+0.009
ild+0.009
mber+0.009
Neg logit contributions
 bank-0.010
 banks-0.009
 wounds-0.008
bank-0.007
 choke-0.007
Token build
Token ID1382
Feature activation+4.54e-01

Pos logit contributions
 bank+0.018
 banks+0.014
 wounds+0.012
bank+0.011
 choke+0.011
Neg logit contributions
itting-0.030
olly-0.029
 importantly-0.026
lic-0.026
mber-0.026
Token things
Token ID1243
Feature activation-8.89e-01

Pos logit contributions
 bank+0.040
 banks+0.033
 choke+0.031
bank+0.031
 necks+0.028
Neg logit contributions
itting-0.040
olly-0.037
lic-0.036
apping-0.034
ay-0.034
Token with
Token ID351
Feature activation+1.82e-01

Pos logit contributions
itting+0.080
olly+0.076
apping+0.069
ying+0.066
 importantly+0.066
Neg logit contributions
 bank-0.083
 banks-0.070
 wounds-0.063
 necks-0.063
 choke-0.061
Token his
Token ID465
Feature activation-9.52e-02

Pos logit contributions
 bank+0.015
 banks+0.012
bank+0.012
 choke+0.010
 wounds+0.010
Neg logit contributions
itting-0.018
olly-0.016
ay-0.016
year-0.015
 importantly-0.015
Token blocks
Token ID7021
Feature activation+2.72e-01

Pos logit contributions
itting+0.010
olly+0.009
 importantly+0.008
ay+0.008
ubby+0.008
Neg logit contributions
 bank-0.008
 banks-0.007
bank-0.006
 choke-0.006
 wounds-0.006
Token.
Token ID13
Feature activation+1.06e-01

Pos logit contributions
 bank+0.024
 banks+0.020
 wounds+0.018
 choke+0.018
bank+0.018
Neg logit contributions
itting-0.027
olly-0.026
ay-0.023
apping-0.023
 importantly-0.022
Token He
Token ID679
Feature activation-2.48e-01

Pos logit contributions
 bank+0.010
 banks+0.009
 wounds+0.009
 choke+0.008
 necks+0.008
Neg logit contributions
itting-0.009
olly-0.009
ay-0.008
lic-0.007
apping-0.007
Token,
Token ID11
Feature activation-1.99e-01

Pos logit contributions
 bank+0.020
 banks+0.015
 choke+0.013
 wounds+0.012
bank+0.012
Neg logit contributions
itting-0.034
olly-0.033
 importantly-0.029
mber-0.029
lic-0.029
Token but
Token ID475
Feature activation+9.30e-01

Pos logit contributions
itting+0.018
olly+0.018
lic+0.015
mber+0.015
 importantly+0.015
Neg logit contributions
 bank-0.019
 banks-0.016
 choke-0.014
 wounds-0.014
 find-0.014
Token my
Token ID616
Feature activation-8.12e-01

Pos logit contributions
 bank+0.093
 banks+0.077
 find+0.074
 choke+0.070
 wounds+0.070
Neg logit contributions
itting-0.088
olly-0.082
lic-0.074
apping-0.072
omed-0.071
Token boat
Token ID8848
Feature activation-2.40e-01

Pos logit contributions
itting+0.089
olly+0.085
 importantly+0.076
lic+0.076
year+0.075
Neg logit contributions
 bank-0.066
 banks-0.054
 choke-0.045
 wounds-0.045
 necks-0.043
Token is
Token ID318
Feature activation-2.55e-01

Pos logit contributions
itting+0.024
olly+0.023
 importantly+0.020
lic+0.019
mber+0.019
Neg logit contributions
 bank-0.021
 banks-0.018
 choke-0.015
 wounds-0.015
 necks-0.015
Token too
Token ID1165
Feature activation-1.51e+00

Pos logit contributions
itting+0.028
olly+0.027
 importantly+0.024
mber+0.024
lic+0.024
Neg logit contributions
 bank-0.020
 banks-0.016
 choke-0.014
 wounds-0.014
bank-0.013
Token small
Token ID1402
Feature activation+3.31e-01

Pos logit contributions
itting+0.182
olly+0.174
 importantly+0.160
otion+0.158
ancers+0.156
Neg logit contributions
 bank-0.104
 banks-0.079
 choke-0.068
 wounds-0.060
 find-0.058
Token and
Token ID290
Feature activation+1.38e-02

Pos logit contributions
 bank+0.025
 banks+0.019
 choke+0.016
 wounds+0.016
bank+0.015
Neg logit contributions
itting-0.038
olly-0.037
 importantly-0.033
mber-0.033
lic-0.032
Token it
Token ID340
Feature activation-5.21e-01

Pos logit contributions
 bank+0.001
 banks+0.001
 choke+0.001
 wounds+0.001
 find+0.001
Neg logit contributions
itting-0.001
olly-0.001
apping-0.001
mber-0.001
lic-0.001
Token's
Token ID338
Feature activation+3.82e-01

Pos logit contributions
itting+0.055
olly+0.053
 importantly+0.046
lic+0.046
apping+0.045
Neg logit contributions
 bank-0.043
 banks-0.035
 wounds-0.030
 choke-0.030
 necks-0.029
Token not
Token ID407
Feature activation-6.86e-02

Pos logit contributions
 bank+0.030
 banks+0.024
 choke+0.021
 wounds+0.020
bank+0.019
Neg logit contributions
itting-0.043
olly-0.042
mber-0.037
lic-0.037
 importantly-0.036
Token.
Token ID13
Feature activation+3.77e-02

Pos logit contributions
itting+0.106
olly+0.103
mber+0.094
 importantly+0.093
lic+0.092
Neg logit contributions
 bank-0.053
 banks-0.040
 choke-0.031
 wounds-0.030
bank-0.030
Token It
Token ID632
Feature activation-1.68e-01

Pos logit contributions
 bank+0.003
 banks+0.003
 choke+0.002
 wounds+0.002
 necks+0.002
Neg logit contributions
itting-0.004
olly-0.004
 importantly-0.003
mber-0.003
ubby-0.003
Token is
Token ID318
Feature activation-3.26e-01

Pos logit contributions
itting+0.014
olly+0.014
lic+0.012
omed+0.012
ane+0.012
Neg logit contributions
 bank-0.017
 banks-0.014
 wounds-0.013
 choke-0.013
 necks-0.012
Token dark
Token ID3223
Feature activation-8.74e-02

Pos logit contributions
itting+0.030
olly+0.027
 importantly+0.026
lic+0.026
year+0.025
Neg logit contributions
 bank-0.030
 banks-0.025
 wounds-0.022
 choke-0.022
bank-0.022
Token and
Token ID290
Feature activation-5.55e-02

Pos logit contributions
itting+0.010
olly+0.009
 importantly+0.008
apping+0.008
lic+0.008
Neg logit contributions
 bank-0.007
 banks-0.005
 choke-0.005
 wounds-0.005
bank-0.004
Token black
Token ID2042
Feature activation-5.40e-01

Pos logit contributions
itting+0.006
olly+0.006
apping+0.005
mber+0.005
 importantly+0.005
Neg logit contributions
 bank-0.005
 banks-0.004
 wounds-0.003
 choke-0.003
 necks-0.003
Token inside
Token ID2641
Feature activation+4.46e-01

Pos logit contributions
itting+0.068
olly+0.065
mber+0.059
 importantly+0.059
lic+0.058
Neg logit contributions
 bank-0.035
 banks-0.027
 choke-0.021
 wounds-0.021
bank-0.020
Token.
Token ID13
Feature activation+3.46e-02

Pos logit contributions
 bank+0.027
 banks+0.021
 wounds+0.016
 choke+0.015
 piles+0.015
Neg logit contributions
itting-0.058
olly-0.056
mber-0.051
lic-0.051
 importantly-0.050
Token
Token ID198
Feature activation-1.72e-03

Pos logit contributions
 bank+0.003
 banks+0.003
 choke+0.002
 wounds+0.002
 necks+0.002
Neg logit contributions
itting-0.004
olly-0.003
 importantly-0.003
ubby-0.003
year-0.003
Token
Token ID198
Feature activation+3.28e-04

Pos logit contributions
itting+0.000
olly+0.000
 importantly+0.000
ubby+0.000
mber+0.000
Neg logit contributions
 bank-0.000
 banks-0.000
 choke-0.000
 wounds-0.000
bank-0.000
Token"
Token ID1
Feature activation-3.59e-01

Pos logit contributions
 bank+0.000
 banks+0.000
 necks+0.000
 piles+0.000
 find+0.000
Neg logit contributions
itting-0.000
olly-0.000
ancer-0.000
ous-0.000
 importantly-0.000
Token.
Token ID13
Feature activation+1.70e-01

Pos logit contributions
 bank+0.029
 banks+0.023
 choke+0.019
 wounds+0.019
bank+0.018
Neg logit contributions
itting-0.045
olly-0.043
 importantly-0.038
mber-0.038
apping-0.038
Token He
Token ID679
Feature activation-6.72e-02

Pos logit contributions
 bank+0.014
 banks+0.012
bank+0.010
 choke+0.010
 wounds+0.010
Neg logit contributions
itting-0.018
olly-0.017
year-0.015
ubby-0.014
lic-0.014
Token had
Token ID550
Feature activation+6.86e-02

Pos logit contributions
itting+0.006
olly+0.005
 importantly+0.005
lic+0.005
mber+0.005
Neg logit contributions
 bank-0.007
 banks-0.006
 choke-0.005
bank-0.005
 wounds-0.005
Token eaten
Token ID17065
Feature activation+1.22e+00

Pos logit contributions
 bank+0.006
 banks+0.005
bank+0.004
 choke+0.004
 wounds+0.004
Neg logit contributions
itting-0.006
olly-0.006
 importantly-0.006
ild-0.005
lic-0.005
Token some
Token ID617
Feature activation-3.05e-01

Pos logit contributions
 bank+0.102
 banks+0.083
 choke+0.071
 wounds+0.069
bank+0.068
Neg logit contributions
itting-0.128
olly-0.125
 importantly-0.110
mber-0.107
year-0.107
Token of
Token ID286
Feature activation-1.17e+00

Pos logit contributions
itting+0.029
olly+0.028
 importantly+0.025
year+0.024
lic+0.024
Neg logit contributions
 bank-0.028
 banks-0.024
 choke-0.020
 wounds-0.020
bank-0.020
Token her
Token ID607
Feature activation+5.94e-02

Pos logit contributions
itting+0.115
olly+0.114
 importantly+0.101
mber+0.098
ister+0.097
Neg logit contributions
 bank-0.099
 banks-0.082
 choke-0.068
bank-0.066
 wounds-0.066
Token leaves
Token ID5667
Feature activation+6.81e-01

Pos logit contributions
 bank+0.004
 banks+0.004
bank+0.004
 choke+0.003
 wounds+0.003
Neg logit contributions
olly-0.006
itting-0.006
 importantly-0.005
ubby-0.005
ama-0.005
Token when
Token ID618
Feature activation-2.03e-01

Pos logit contributions
 bank+0.055
 banks+0.046
 wounds+0.041
 choke+0.040
 necks+0.039
Neg logit contributions
itting-0.069
olly-0.068
apping-0.059
ister-0.058
ying-0.057
Token she
Token ID673
Feature activation+3.68e-01

Pos logit contributions
itting+0.020
olly+0.018
lic+0.016
apping+0.016
ubby+0.016
Neg logit contributions
 bank-0.020
 banks-0.016
 choke-0.015
 wounds-0.014
 find-0.014
Token was
Token ID373
Feature activation-1.11e-01

Pos logit contributions
 bank+0.032
 banks+0.026
 wounds+0.022
 choke+0.022
bank+0.022
Neg logit contributions
itting-0.038
olly-0.036
 importantly-0.032
lic-0.031
mber-0.031
Token thought
Token ID1807
Feature activation+2.92e-02

Pos logit contributions
itting+0.016
olly+0.015
mber+0.013
 importantly+0.013
year+0.013
Neg logit contributions
 bank-0.016
 banks-0.014
 choke-0.012
 wounds-0.012
bank-0.011
Token it
Token ID340
Feature activation-5.91e-01

Pos logit contributions
 bank+0.002
 banks+0.002
 choke+0.002
 wounds+0.002
 find+0.002
Neg logit contributions
itting-0.003
olly-0.003
mber-0.003
lic-0.003
apping-0.003
Token was
Token ID373
Feature activation-1.81e-01

Pos logit contributions
itting+0.058
olly+0.054
lic+0.049
 importantly+0.047
mber+0.047
Neg logit contributions
 bank-0.053
 banks-0.045
 wounds-0.040
 choke-0.040
 necks-0.037
Token the
Token ID262
Feature activation-9.58e-02

Pos logit contributions
itting+0.017
olly+0.016
 importantly+0.016
lic+0.014
mber+0.014
Neg logit contributions
 bank-0.016
 banks-0.014
 wounds-0.012
bank-0.012
 choke-0.011
Token prett
Token ID46442
Feature activation+2.66e-01

Pos logit contributions
itting+0.007
olly+0.007
 importantly+0.006
ancer+0.006
mber+0.006
Neg logit contributions
 bank-0.010
 banks-0.009
 wounds-0.008
bank-0.008
 choke-0.008
Tokeniest
Token ID6386
Feature activation-3.13e+00

Pos logit contributions
 bank+0.030
 banks+0.025
 himself+0.024
 choke+0.024
 find+0.023
Neg logit contributions
itting-0.015
unt-0.015
mber-0.014
lic-0.013
ild-0.013
Token coin
Token ID10752
Feature activation+5.35e-01

Pos logit contributions
itting+0.270
olly+0.269
 importantly+0.249
iest+0.229
mber+0.229
Neg logit contributions
 bank-0.284
 banks-0.260
bank-0.223
 choke-0.223
 wounds-0.222
Token that
Token ID326
Feature activation+1.30e-01

Pos logit contributions
 bank+0.047
 banks+0.039
 choke+0.034
 wounds+0.033
bank+0.032
Neg logit contributions
itting-0.054
olly-0.052
 importantly-0.045
mber-0.045
lic-0.045
Token he
Token ID339
Feature activation+2.73e-01

Pos logit contributions
 bank+0.012
 banks+0.010
 choke+0.009
 wounds+0.009
bank+0.008
Neg logit contributions
itting-0.013
olly-0.012
 importantly-0.011
apping-0.011
lic-0.011
Token had
Token ID550
Feature activation+5.77e-02

Pos logit contributions
 bank+0.027
 banks+0.023
 choke+0.020
 wounds+0.020
bank+0.019
Neg logit contributions
itting-0.025
olly-0.024
 importantly-0.021
mber-0.020
apping-0.020
Token ever
Token ID1683
Feature activation+5.92e-01

Pos logit contributions
 bank+0.006
 banks+0.005
 wounds+0.005
 choke+0.005
bank+0.005
Neg logit contributions
itting-0.005
olly-0.004
 importantly-0.004
year-0.004
ane-0.004
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
itting+0.051
olly+0.049
 importantly+0.044
mber+0.043
lic+0.043
Neg logit contributions
 bank-0.030
 banks-0.024
 choke-0.019
 wounds-0.019
bank-0.018
Token He
Token ID679
Feature activation-2.18e-01

Pos logit contributions
 bank+0.011
 banks+0.010
 wounds+0.008
 choke+0.008
 necks+0.008
Neg logit contributions
itting-0.013
olly-0.012
 importantly-0.011
mber-0.011
apping-0.010
Token was
Token ID373
Feature activation-1.67e-01

Pos logit contributions
itting+0.022
olly+0.021
 importantly+0.019
mber+0.018
lic+0.018
Neg logit contributions
 bank-0.019
 banks-0.016
 choke-0.014
 wounds-0.014
bank-0.013
Token so
Token ID523
Feature activation+4.97e-01

Pos logit contributions
itting+0.019
olly+0.018
 importantly+0.017
apping+0.016
lic+0.016
Neg logit contributions
 bank-0.012
 banks-0.010
 choke-0.008
 wounds-0.008
bank-0.008
Token eng
Token ID1786
Feature activation+4.68e-01

Pos logit contributions
 bank+0.033
 banks+0.026
 choke+0.023
 find+0.022
 piles+0.020
Neg logit contributions
itting-0.059
olly-0.056
mber-0.052
omed-0.051
ancers-0.051
Tokenross
Token ID1214
Feature activation-2.39e+00

Pos logit contributions
 bank+0.060
 banks+0.055
 himself+0.051
 find+0.049
 choke+0.049
Neg logit contributions
itting-0.020
mber-0.018
lic-0.018
unt-0.017
ha-0.017
Tokened
Token ID276
Feature activation+2.04e-02

Pos logit contributions
itting+0.185
olly+0.179
 importantly+0.159
mber+0.155
lic+0.152
Neg logit contributions
 bank-0.267
 banks-0.222
bank-0.198
 choke-0.194
 find-0.194
Token that
Token ID326
Feature activation+1.57e-01

Pos logit contributions
 bank+0.002
 banks+0.001
 choke+0.001
 wounds+0.001
bank+0.001
Neg logit contributions
itting-0.002
olly-0.002
mber-0.002
lic-0.002
 importantly-0.002
Token he
Token ID339
Feature activation+2.04e-01

Pos logit contributions
 bank+0.014
 banks+0.011
 choke+0.010
 wounds+0.010
bank+0.009
Neg logit contributions
itting-0.016
olly-0.015
lic-0.013
 importantly-0.013
mber-0.013
Token didn
Token ID1422
Feature activation+2.02e-01

Pos logit contributions
 bank+0.018
 banks+0.015
 choke+0.012
 wounds+0.012
bank+0.012
Neg logit contributions
itting-0.022
olly-0.020
apping-0.018
 importantly-0.018
mber-0.017
Token't
Token ID470
Feature activation-9.62e-01

Pos logit contributions
 bank+0.020
 banks+0.017
 choke+0.015
 wounds+0.015
bank+0.015
Neg logit contributions
itting-0.018
olly-0.017
 importantly-0.015
mber-0.015
lic-0.014
Token and
Token ID290
Feature activation+2.18e-02

Pos logit contributions
itting+0.048
olly+0.047
mber+0.040
apping+0.040
 importantly+0.040
Neg logit contributions
 bank-0.039
 banks-0.033
 choke-0.029
 wounds-0.028
 necks-0.027
Token said
Token ID531
Feature activation-8.97e-02

Pos logit contributions
 bank+0.002
 banks+0.001
 find+0.001
 choke+0.001
bank+0.001
Neg logit contributions
itting-0.003
olly-0.002
reetings-0.002
ubby-0.002
ane-0.002
Token,
Token ID11
Feature activation-1.35e-01

Pos logit contributions
itting+0.010
olly+0.010
 importantly+0.009
ubby+0.009
mber+0.009
Neg logit contributions
 bank-0.006
 banks-0.005
 find-0.004
 neck-0.004
bank-0.004
Token "
Token ID366
Feature activation+3.93e-01

Pos logit contributions
itting+0.014
olly+0.013
 importantly+0.011
ubby+0.011
year+0.011
Neg logit contributions
 bank-0.012
 banks-0.010
 choke-0.008
bank-0.008
 wounds-0.008
TokenY
Token ID56
Feature activation+3.79e-01

Pos logit contributions
 bank+0.039
 banks+0.033
 choke+0.029
 wounds+0.028
bank+0.028
Neg logit contributions
itting-0.035
olly-0.034
 importantly-0.030
apping-0.029
lic-0.029
Tokenay
Token ID323
Feature activation-2.72e+00

Pos logit contributions
 bank+0.050
 banks+0.045
 choke+0.040
 piles+0.040
 find+0.040
Neg logit contributions
itting-0.016
mber-0.015
 importantly-0.015
year-0.013
olly-0.013
Token!
Token ID0
Feature activation-2.07e-01

Pos logit contributions
itting+0.337
olly+0.306
 importantly+0.304
year+0.300
mber+0.294
Neg logit contributions
 bank-0.179
 banks-0.128
 find-0.108
 choke-0.108
 t-0.099
Token Can
Token ID1680
Feature activation-1.04e+00

Pos logit contributions
itting+0.022
olly+0.021
ier+0.019
ying+0.019
dd+0.019
Neg logit contributions
 bank-0.016
 banks-0.013
 find-0.011
 neck-0.011
 Fl-0.010
Token we
Token ID356
Feature activation+7.95e-01

Pos logit contributions
itting+0.094
olly+0.085
lic+0.079
year+0.078
mber+0.074
Neg logit contributions
 bank-0.106
 banks-0.090
 find-0.082
 wounds-0.079
 choke-0.078
Token get
Token ID651
Feature activation-1.23e-02

Pos logit contributions
 bank+0.053
 banks+0.040
 find+0.032
 choke+0.031
 wounds+0.027
Neg logit contributions
itting-0.102
olly-0.097
 importantly-0.090
mber-0.089
year-0.089
Token ice
Token ID4771
Feature activation+6.40e-03

Pos logit contributions
itting+0.001
olly+0.001
mber+0.001
apping+0.001
 importantly+0.001
Neg logit contributions
 bank-0.001
 banks-0.001
 wounds-0.001
 choke-0.001
 find-0.001
Token his
Token ID465
Feature activation-7.01e-02

Pos logit contributions
 bank+0.001
 banks+0.001
 choke+0.001
 wounds+0.001
bank+0.001
Neg logit contributions
itting-0.002
olly-0.002
 importantly-0.002
mber-0.002
lic-0.002
Token mind
Token ID2000
Feature activation+4.14e-01

Pos logit contributions
itting+0.007
olly+0.007
 importantly+0.006
mber+0.006
apping+0.006
Neg logit contributions
 bank-0.006
 banks-0.005
 choke-0.004
bank-0.004
 wounds-0.004
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
 bank+0.027
 banks+0.021
bank+0.016
 choke+0.015
 neck+0.015
Neg logit contributions
itting-0.051
olly-0.049
mber-0.045
 importantly-0.045
lic-0.044
Token He
Token ID679
Feature activation-2.24e-01

Pos logit contributions
 bank+0.009
 banks+0.007
bank+0.006
 choke+0.006
 wounds+0.006
Neg logit contributions
itting-0.012
olly-0.011
 importantly-0.010
ubby-0.010
year-0.010
Token w
Token ID266
Feature activation+2.10e-02

Pos logit contributions
itting+0.023
olly+0.022
 importantly+0.019
apping+0.019
lic+0.019
Neg logit contributions
 bank-0.020
 banks-0.016
 choke-0.014
 wounds-0.014
 necks-0.013
Tokenagged
Token ID14655
Feature activation-2.37e+00

Pos logit contributions
 bank+0.002
 banks+0.002
 wounds+0.002
 necks+0.002
 tears+0.002
Neg logit contributions
itting-0.001
ancer-0.001
olly-0.001
mber-0.001
unt-0.001
Token his
Token ID465
Feature activation-6.26e-02

Pos logit contributions
itting+0.298
olly+0.285
mber+0.266
 importantly+0.262
lic+0.261
Neg logit contributions
 bank-0.144
 banks-0.111
 choke-0.089
 wounds-0.084
bank-0.075
Token tail
Token ID7894
Feature activation+1.29e-01

Pos logit contributions
itting+0.008
olly+0.007
 importantly+0.007
apping+0.007
mber+0.007
Neg logit contributions
 bank-0.004
 banks-0.003
 choke-0.003
 wounds-0.003
bank-0.003
Token and
Token ID290
Feature activation-6.94e-02

Pos logit contributions
 bank+0.009
 banks+0.008
 choke+0.006
 wounds+0.006
bank+0.006
Neg logit contributions
itting-0.015
olly-0.014
 importantly-0.013
apping-0.013
mber-0.013
Token bark
Token ID21405
Feature activation+5.31e-01

Pos logit contributions
itting+0.007
olly+0.007
 importantly+0.006
apping+0.006
lic+0.006
Neg logit contributions
 bank-0.006
 banks-0.005
 choke-0.004
 wounds-0.004
bank-0.004
Tokened
Token ID276
Feature activation+1.71e-02

Pos logit contributions
 bank+0.064
 banks+0.056
 choke+0.050
 wounds+0.049
bank+0.049
Neg logit contributions
itting-0.037
olly-0.034
 importantly-0.028
mber-0.028
ister-0.027
Token yellow
Token ID7872
Feature activation-4.98e-01

Pos logit contributions
 bank+0.014
 banks+0.012
 choke+0.010
 wounds+0.010
bank+0.009
Neg logit contributions
itting-0.019
olly-0.018
 importantly-0.016
lic-0.016
mber-0.016
Token flowers
Token ID12734
Feature activation+5.43e-01

Pos logit contributions
itting+0.054
olly+0.052
 importantly+0.046
mber+0.046
lic+0.046
Neg logit contributions
 bank-0.040
 banks-0.032
 choke-0.027
 wounds-0.027
bank-0.026
Token.
Token ID13
Feature activation+8.72e-02

Pos logit contributions
 bank+0.042
 banks+0.033
 choke+0.027
bank+0.026
 find+0.026
Neg logit contributions
itting-0.061
olly-0.059
mber-0.053
year-0.053
 importantly-0.053
Token "
Token ID366
Feature activation+3.54e-01

Pos logit contributions
 bank+0.007
 banks+0.006
bank+0.004
 find+0.004
 choke+0.004
Neg logit contributions
itting-0.010
olly-0.009
ubby-0.008
 importantly-0.008
ister-0.008
TokenSpr
Token ID38454
Feature activation-1.33e-02

Pos logit contributions
 bank+0.031
 banks+0.025
 choke+0.021
 necks+0.021
 wounds+0.021
Neg logit contributions
itting-0.035
olly-0.035
apping-0.030
 importantly-0.030
ister-0.030
Tokenay
Token ID323
Feature activation-2.77e+00

Pos logit contributions
itting+0.001
 importantly+0.001
olly+0.001
year+0.001
mber+0.000
Neg logit contributions
 bank-0.002
 banks-0.002
 choke-0.001
bank-0.001
 find-0.001
Token these
Token ID777
Feature activation+6.96e-02

Pos logit contributions
itting+0.308
 importantly+0.287
olly+0.281
year+0.265
apping+0.257
Neg logit contributions
 bank-0.229
 banks-0.180
 find-0.153
bank-0.147
 choke-0.145
Token everyday
Token ID10908
Feature activation-1.48e-01

Pos logit contributions
 bank+0.005
 banks+0.004
 wounds+0.004
 choke+0.004
 necks+0.003
Neg logit contributions
itting-0.008
olly-0.008
 importantly-0.007
apping-0.007
lic-0.007
Token,
Token ID11
Feature activation-2.80e-01

Pos logit contributions
itting+0.016
olly+0.015
 importantly+0.014
lic+0.014
year+0.014
Neg logit contributions
 bank-0.012
 banks-0.010
 choke-0.008
 wounds-0.008
 find-0.008
Token just
Token ID655
Feature activation-7.59e-02

Pos logit contributions
itting+0.031
olly+0.030
ane+0.027
apping+0.026
year+0.026
Neg logit contributions
 bank-0.022
 banks-0.017
 find-0.016
 neck-0.015
 piles-0.015
Token like
Token ID588
Feature activation+1.75e+00

Pos logit contributions
itting+0.009
olly+0.009
lic+0.008
mber+0.008
ister+0.008
Neg logit contributions
 bank-0.006
 banks-0.004
 choke-0.004
 find-0.004
 wounds-0.004