TOP ACTIVATIONS
MAX = 0.353

ride to the apple or chard . The neighbor said yes
and go to the or chard . They pick some red
p ears in the or chard . As Lilly
Stay away from my or chard , young Fox ! â
go to the apple or chard and fill his wagon with
At the or chard , they picked as many
went to the closest or chard . At the
the owner of the or chard didn 't know who was
would sneak into the or chard at night and take the
a trap in the or chard . When the elephant came
and run around the or chard . One day
mice away from the or chard . The moral
traps all over the or chard , making sure to check
had been disturbing his or chard and he wanted to get
from disturbing the cherry or chard . Had he not been
Jill put on her swim suit and spl ashed in the
bug changed into its swim suit and jumped in , feeling
on his different colored swim suit and ran out to the
She put on her swim suit and grabbed her float y
She put on her swim suit and ran to the shore

MIN ACTIVATIONS
MIN = -0.332

Tim grabbed the hook and anx iously put it in the
are you ?" she shouted anx iously .
Both of them put their anx ieties aside . From then
! â Jim said anx iously . His
M ummy watched him anx iously as Ali waited for
the answer . She asked anx iously , " What is
smile . Mary anx iously opened up the pass
swept and scrub bed and mo pped and d usted until
daughter managed to find the tenth flower so quickly . Annie
We are going to the tenth floor , where I have
You could have caused an accident . Don 't you ever
to see the little boat mo ored there . The boat
They got into a car accident and Bob got hurt .
?" Ben asked for the tenth time . "
was surrounded with a wet mo at .
pushed the button for the tenth floor . The elevator started
had broken the notebook by accident . She was very sad
The little boy dug a mo at around the sand castle
. They d usted and mo pped , they put away
surrounded by a shimmer ing mo at .

INTERVAL 0.182 - 0.353
CONTAINS 0.597%

and she soon became known as the " pur ple l
in the garden . They chose a fragile shell and gently
but ran out as fast as he could .
everything glow . The all igator and his friends were amazed
standing there . He was wearing a suit and a hat

INTERVAL 0.011 - 0.182
CONTAINS 35.201%

together . <|endoftext|> Once upon a time , there was a
there more food available ?" They look at the
please ." Her mom said , " That fruit
was called " cham om ile " and it smelled very
. She knew she was ready to face anything ! Lucy

INTERVAL -0.161 - 0.011
CONTAINS 63.604%

here . Do you want to meet your new friends ?"
" I love you Julie ." Jul ie smiled
to wear . It was pink and had spark les on
around her , the sky was turning black . She was
. It tried to get the poison out of the lock

INTERVAL -0.332 - -0.161
CONTAINS 0.598%

O w ! Ow ! Ow !" he cries . He
had changed . It was no longer orange but gray .
had to help . They sw am over and pushed the
more apples to the barrel ," dad said .
the forest , and they never felt lonely again .
Token ride
Token ID6594
Feature activation+5.06e-02

Pos logit contributions
chard+0.016
suit+0.015
 Labrador+0.015
pper+0.014
mit+0.014
Neg logit contributions
 tenth-0.014
 bru-0.014
 maj-0.014
 aer-0.014
 mo-0.014
Token to
Token ID284
Feature activation-2.56e-02

Pos logit contributions
chard+0.070
suit+0.069
 Labrador+0.067
mit+0.066
 opener+0.066
Neg logit contributions
 accident-0.051
 tenth-0.050
 aer-0.050
 anx-0.050
und-0.048
Token the
Token ID262
Feature activation+3.22e-03

Pos logit contributions
 anx+0.027
 tenth+0.027
 accident+0.027
und+0.026
 maj+0.026
Neg logit contributions
suit-0.036
chard-0.034
 Labrador-0.033
mit-0.032
 opener-0.031
Token apple
Token ID17180
Feature activation+3.43e-02

Pos logit contributions
suit+0.004
chard+0.004
 Labrador+0.004
 opener+0.004
mit+0.004
Neg logit contributions
 tenth-0.004
 accident-0.004
 anx-0.004
 maj-0.004
 mo-0.004
Token or
Token ID393
Feature activation+7.41e-02

Pos logit contributions
suit+0.046
mit+0.045
 Labrador+0.044
chard+0.044
Same+0.042
Neg logit contributions
 accident-0.037
 aer-0.035
 tenth-0.035
 anx-0.034
und-0.034
Tokenchard
Token ID30215
Feature activation+3.39e-01

Pos logit contributions
suit+0.104
 Labrador+0.099
mit+0.094
 opener+0.093
chard+0.091
Neg logit contributions
 anx-0.076
 tenth-0.076
 accident-0.076
 bru-0.073
 aer-0.073
Token.
Token ID13
Feature activation+4.85e-03

Pos logit contributions
suit+0.532
 Labrador+0.493
chard+0.486
mit+0.470
icals+0.469
Neg logit contributions
 accident-0.312
 tenth-0.302
 anx-0.300
 aer-0.292
 mo-0.273
Token The
Token ID383
Feature activation-8.03e-02

Pos logit contributions
suit+0.007
chard+0.007
 Labrador+0.006
 opener+0.006
mit+0.006
Neg logit contributions
 tenth-0.005
 anx-0.005
 accident-0.005
 maj-0.005
und-0.005
Token neighbor
Token ID4780
Feature activation+1.15e-01

Pos logit contributions
 accident+0.108
 tenth+0.104
 anx+0.101
 maj+0.099
 mo+0.095
Neg logit contributions
suit-0.102
 opener-0.086
chard-0.086
resses-0.084
mit-0.084
Token said
Token ID531
Feature activation+9.30e-03

Pos logit contributions
suit+0.148
chard+0.146
 Labrador+0.139
mit+0.136
 opener+0.134
Neg logit contributions
 anx-0.129
 tenth-0.127
 accident-0.127
 maj-0.126
und-0.126
Token yes
Token ID3763
Feature activation-8.40e-02

Pos logit contributions
chard+0.011
 as+0.011
suit+0.010
 Coco+0.010
mit+0.010
Neg logit contributions
und-0.012
 tenth-0.011
 maj-0.011
xc-0.011
 anx-0.011
Token and
Token ID290
Feature activation+7.68e-03

Pos logit contributions
suit+0.156
 Labrador+0.155
chard+0.153
mit+0.148
 Cookie+0.141
Neg logit contributions
 anx-0.112
 accident-0.111
 tenth-0.111
 maj-0.108
 mo-0.108
Token go
Token ID467
Feature activation+1.57e-02

Pos logit contributions
suit+0.011
 Labrador+0.011
chard+0.010
mit+0.010
 opener+0.010
Neg logit contributions
 anx-0.008
 mo-0.008
 accident-0.008
 tenth-0.007
 maj-0.007
Token to
Token ID284
Feature activation-1.93e-02

Pos logit contributions
 Labrador+0.026
chard+0.025
suit+0.024
 Cookie+0.024
Same+0.024
Neg logit contributions
 mo-0.012
 tenth-0.012
 maj-0.012
 anx-0.011
 accident-0.010
Token the
Token ID262
Feature activation+7.09e-03

Pos logit contributions
 anx+0.021
 tenth+0.021
 accident+0.020
 maj+0.020
und+0.020
Neg logit contributions
suit-0.026
chard-0.025
 Labrador-0.024
mit-0.024
 opener-0.024
Token or
Token ID393
Feature activation+7.79e-02

Pos logit contributions
suit+0.009
chard+0.009
mit+0.009
 Labrador+0.009
 opener+0.009
Neg logit contributions
 tenth-0.008
 anx-0.008
 accident-0.008
 maj-0.007
 mo-0.007
Tokenchard
Token ID30215
Feature activation+3.37e-01

Pos logit contributions
 Labrador+0.047
suit+0.046
manship+0.037
mit+0.036
 opener+0.036
Neg logit contributions
 tenth-0.139
 anx-0.138
 bru-0.138
 mo-0.138
 maj-0.138
Token.
Token ID13
Feature activation+5.10e-03

Pos logit contributions
suit+0.572
 Labrador+0.550
mit+0.512
icals+0.512
chard+0.511
Neg logit contributions
 accident-0.275
 tenth-0.274
 anx-0.249
 mo-0.244
 aer-0.241
Token They
Token ID1119
Feature activation-6.44e-03

Pos logit contributions
suit+0.007
chard+0.007
mit+0.007
 Labrador+0.007
 opener+0.007
Neg logit contributions
 tenth-0.005
 anx-0.005
und-0.005
 maj-0.005
 accident-0.005
Token pick
Token ID2298
Feature activation-9.70e-03

Pos logit contributions
 anx+0.006
 mo+0.006
 tenth+0.006
 accident+0.006
 maj+0.006
Neg logit contributions
suit-0.009
 Labrador-0.009
chard-0.009
uts-0.009
Same-0.008
Token some
Token ID617
Feature activation-7.62e-02

Pos logit contributions
 maj+0.009
und+0.009
 anx+0.008
 accident+0.008
 tenth+0.008
Neg logit contributions
suit-0.015
chard-0.014
 Labrador-0.014
mit-0.014
 opener-0.014
Token red
Token ID2266
Feature activation-4.48e-02

Pos logit contributions
 aer+0.074
 many+0.074
und+0.072
 anx+0.071
 accident+0.071
Neg logit contributions
suit-0.111
 Labrador-0.103
chard-0.103
mit-0.103
 Tech-0.096
Token p
Token ID279
Feature activation-9.74e-02

Pos logit contributions
 aer+0.090
 mo+0.085
 tenth+0.085
 anx+0.084
 maj+0.083
Neg logit contributions
suit-0.112
chard-0.110
 Labrador-0.104
 opener-0.100
mit-0.099
Tokenears
Token ID4127
Feature activation+1.13e-01

Pos logit contributions
 maj+0.091
und+0.091
 mo+0.090
 tenth+0.083
xc+0.082
Neg logit contributions
chard-0.154
 Labrador-0.150
 Cookie-0.140
 opener-0.138
suit-0.137
Token in
Token ID287
Feature activation-4.29e-02

Pos logit contributions
 Labrador+0.191
suit+0.183
chard+0.174
mit+0.171
 licence+0.166
Neg logit contributions
 aer-0.087
 tenth-0.087
 accident-0.086
 mo-0.084
und-0.084
Token the
Token ID262
Feature activation+3.62e-03

Pos logit contributions
 tenth+0.034
 accident+0.034
 aer+0.032
 anx+0.031
 maj+0.031
Neg logit contributions
suit-0.077
mit-0.070
chard-0.070
 Labrador-0.069
pper-0.065
Token or
Token ID393
Feature activation+6.71e-02

Pos logit contributions
suit+0.005
chard+0.005
 Labrador+0.005
 stall+0.005
mit+0.005
Neg logit contributions
 anx-0.004
 maj-0.004
 tenth-0.004
 accident-0.004
und-0.004
Tokenchard
Token ID30215
Feature activation+3.34e-01

Pos logit contributions
suit+0.042
 Labrador+0.038
chard+0.037
mit+0.035
 opener+0.035
Neg logit contributions
 anx-0.119
 tenth-0.119
 accident-0.118
 maj-0.118
und-0.117
Token.
Token ID13
Feature activation-3.17e-03

Pos logit contributions
suit+0.528
 Labrador+0.497
chard+0.482
mit+0.470
 opener+0.457
Neg logit contributions
 tenth-0.307
 accident-0.302
 aer-0.294
 anx-0.291
 mo-0.286
Token
Token ID198
Feature activation+1.88e-02

Pos logit contributions
 maj+0.002
 anx+0.002
und+0.002
 Sad+0.002
 Other+0.002
Neg logit contributions
suit-0.006
pper-0.005
chard-0.005
mit-0.005
guard-0.005
Token
Token ID198
Feature activation+1.90e-02

Pos logit contributions
suit+0.021
chard+0.021
 Labrador+0.020
 opener+0.019
mit+0.019
Neg logit contributions
 anx-0.024
 tenth-0.024
 accident-0.024
 maj-0.024
und-0.023
TokenAs
Token ID1722
Feature activation+7.04e-02

Pos logit contributions
suit+0.026
chard+0.026
 Labrador+0.025
mit+0.024
 opener+0.024
Neg logit contributions
 anx-0.020
 tenth-0.019
und-0.019
 accident-0.019
 maj-0.019
Token Lilly
Token ID35134
Feature activation+1.33e-02

Pos logit contributions
chard+0.073
suit+0.072
 as+0.069
 Labrador+0.066
 opener+0.065
Neg logit contributions
 tenth-0.097
und-0.096
 anx-0.096
xc-0.096
 maj-0.095
TokenStay
Token ID25681
Feature activation+1.43e-01

Pos logit contributions
suit+0.222
chard+0.209
 Labrador+0.206
mit+0.198
 opener+0.197
Neg logit contributions
 anx-0.228
xc-0.224
 maj-0.223
 tenth-0.221
unct-0.219
Token away
Token ID1497
Feature activation+2.81e-02

Pos logit contributions
suit+0.152
chard+0.147
 Labrador+0.139
mit+0.136
 as+0.133
Neg logit contributions
 tenth-0.194
und-0.193
 anx-0.192
 maj-0.192
 accident-0.190
Token from
Token ID422
Feature activation+7.01e-02

Pos logit contributions
suit+0.025
 as+0.023
chard+0.022
mit+0.022
 opener+0.022
Neg logit contributions
Acc-0.043
 maj-0.043
xc-0.043
 tenth-0.043
 anx-0.042
Token my
Token ID616
Feature activation+3.33e-02

Pos logit contributions
suit+0.096
chard+0.092
 Labrador+0.089
 opener+0.088
mit+0.087
Neg logit contributions
 accident-0.075
 anx-0.075
 tenth-0.073
 maj-0.073
und-0.073
Token or
Token ID393
Feature activation+5.82e-02

Pos logit contributions
suit+0.042
chard+0.042
 Labrador+0.040
mit+0.039
 opener+0.038
Neg logit contributions
 anx-0.038
 tenth-0.038
 maj-0.038
 accident-0.038
und-0.037
Tokenchard
Token ID30215
Feature activation+3.34e-01

Pos logit contributions
suit+0.075
 Labrador+0.073
chard+0.071
 opener+0.070
mit+0.069
Neg logit contributions
 anx-0.064
 tenth-0.063
 accident-0.063
 maj-0.062
und-0.062
Token,
Token ID11
Feature activation-4.16e-04

Pos logit contributions
suit+0.483
chard+0.470
 Labrador+0.459
 opener+0.439
mit+0.433
Neg logit contributions
 accident-0.328
 tenth-0.324
 anx-0.321
und-0.314
 aer-0.309
Token young
Token ID1862
Feature activation+4.46e-02

Pos logit contributions
 anx+0.000
 accident+0.000
 tenth+0.000
und+0.000
 maj+0.000
Neg logit contributions
suit-0.001
chard-0.001
 Labrador-0.001
mit-0.000
 opener-0.000
Token Fox
Token ID5426
Feature activation+5.11e-02

Pos logit contributions
suit+0.052
chard+0.051
 Labrador+0.048
mit+0.047
 opener+0.047
Neg logit contributions
 anx-0.055
 accident-0.055
 tenth-0.055
 maj-0.054
und-0.054
Token!
Token ID0
Feature activation-1.37e-02

Pos logit contributions
chard+0.069
suit+0.068
 Labrador+0.065
 opener+0.063
mit+0.063
Neg logit contributions
und-0.054
 anx-0.054
 accident-0.053
 maj-0.053
 tenth-0.053
Tokenâ
Token ID22940
Feature activation-1.07e-01

Pos logit contributions
und+0.013
 anx+0.013
 maj+0.012
 tenth+0.012
 mo+0.012
Neg logit contributions
suit-0.019
chard-0.019
mit-0.018
 Labrador-0.018
 opener-0.017
Token go
Token ID467
Feature activation+8.65e-03

Pos logit contributions
suit+0.001
chard+0.001
 Labrador+0.001
mit+0.001
 opener+0.001
Neg logit contributions
 anx-0.001
 tenth-0.001
 maj-0.001
und-0.001
 accident-0.001
Token to
Token ID284
Feature activation-2.63e-02

Pos logit contributions
 Labrador+0.014
suit+0.014
mit+0.014
chard+0.014
Same+0.013
Neg logit contributions
 mo-0.007
 maj-0.007
 tenth-0.006
 aer-0.006
 anx-0.006
Token the
Token ID262
Feature activation+1.91e-03

Pos logit contributions
 anx+0.030
und+0.029
 tenth+0.029
 maj+0.029
 accident+0.029
Neg logit contributions
chard-0.033
suit-0.033
 Labrador-0.032
mit-0.031
 opener-0.031
Token apple
Token ID17180
Feature activation+3.14e-02

Pos logit contributions
suit+0.002
mit+0.002
 opener+0.002
chard+0.002
 Labrador+0.002
Neg logit contributions
 tenth-0.002
 accident-0.002
 anx-0.002
 maj-0.002
 aer-0.002
Token or
Token ID393
Feature activation+7.10e-02

Pos logit contributions
suit+0.041
mit+0.040
 Labrador+0.038
chard+0.036
Same+0.036
Neg logit contributions
 accident-0.036
 tenth-0.035
 aer-0.035
und-0.035
 mo-0.035
Tokenchard
Token ID30215
Feature activation+3.34e-01

Pos logit contributions
suit+0.044
 Labrador+0.042
mit+0.035
asket+0.033
uts+0.032
Neg logit contributions
 bru-0.129
 tenth-0.128
 mo-0.128
 anx-0.127
 accident-0.125
Token and
Token ID290
Feature activation+3.20e-03

Pos logit contributions
suit+0.538
 Labrador+0.501
mit+0.474
chard+0.464
icals+0.461
Neg logit contributions
 accident-0.316
 tenth-0.303
 anx-0.291
 aer-0.289
 many-0.282
Token fill
Token ID6070
Feature activation+8.49e-02

Pos logit contributions
suit+0.005
 Labrador+0.005
mit+0.004
 opener+0.004
asket+0.004
Neg logit contributions
 anx-0.003
 mo-0.003
 bru-0.003
 tenth-0.003
 accident-0.003
Token his
Token ID465
Feature activation-2.19e-02

Pos logit contributions
suit+0.119
 Labrador+0.114
chard+0.114
mit+0.111
 opener+0.108
Neg logit contributions
 anx-0.086
 maj-0.086
 tenth-0.086
 accident-0.085
und-0.084
Token wagon
Token ID31860
Feature activation+7.01e-02

Pos logit contributions
 maj+0.022
 bru+0.020
 many+0.020
 accident+0.020
 aer+0.020
Neg logit contributions
 opener-0.032
guard-0.031
mit-0.030
 Tech-0.029
pper-0.029
Token with
Token ID351
Feature activation+1.85e-02

Pos logit contributions
suit+0.085
chard+0.083
 Labrador+0.082
mit+0.079
 opener+0.076
Neg logit contributions
 anx-0.083
 tenth-0.082
 maj-0.081
 accident-0.081
und-0.080
Token
Token ID198
Feature activation+2.08e-02

Pos logit contributions
und+0.001
 anx+0.001
 Other+0.001
 maj+0.001
 tenth+0.001
Neg logit contributions
suit-0.003
chard-0.002
mit-0.002
 Labrador-0.002
pper-0.002
Token
Token ID198
Feature activation+2.10e-02

Pos logit contributions
suit+0.021
chard+0.021
 opener+0.020
 Labrador+0.020
 stall+0.019
Neg logit contributions
 anx-0.028
 tenth-0.028
 maj-0.028
 accident-0.028
xc-0.028
TokenAt
Token ID2953
Feature activation+5.49e-02

Pos logit contributions
chard+0.027
suit+0.027
 Labrador+0.026
 opener+0.025
mit+0.025
Neg logit contributions
 anx-0.023
 maj-0.023
 accident-0.023
 tenth-0.022
und-0.022
Token the
Token ID262
Feature activation-3.28e-04

Pos logit contributions
chard+0.068
suit+0.067
 Labrador+0.063
mit+0.062
 opener+0.061
Neg logit contributions
 anx-0.064
und-0.064
 maj-0.064
 tenth-0.063
 accident-0.063
Token or
Token ID393
Feature activation+6.87e-02

Pos logit contributions
 tenth+0.000
 accident+0.000
 anx+0.000
 many+0.000
 maj+0.000
Neg logit contributions
uts-0.000
suit-0.000
 opener-0.000
resses-0.000
 Tech-0.000
Tokenchard
Token ID30215
Feature activation+3.34e-01

Pos logit contributions
suit+0.010
chard+0.005
 Labrador+0.005
mit+0.002
 opener+0.002
Neg logit contributions
 anx-0.156
 tenth-0.155
 accident-0.154
 maj-0.154
und-0.154
Token,
Token ID11
Feature activation+6.82e-03

Pos logit contributions
suit+0.501
 Labrador+0.466
chard+0.466
mit+0.444
 opener+0.442
Neg logit contributions
 accident-0.321
 anx-0.320
und-0.312
 tenth-0.306
 mo-0.296
Token they
Token ID484
Feature activation+1.00e-02

Pos logit contributions
suit+0.009
chard+0.009
 Labrador+0.009
mit+0.008
 opener+0.008
Neg logit contributions
 anx-0.007
 tenth-0.007
 maj-0.007
und-0.007
 accident-0.007
Token picked
Token ID6497
Feature activation+1.50e-02

Pos logit contributions
suit+0.015
 Labrador+0.015
 opener+0.014
resses+0.014
chard+0.013
Neg logit contributions
 anx-0.010
 mo-0.010
 accident-0.009
und-0.009
 maj-0.009
Token as
Token ID355
Feature activation+2.93e-01

Pos logit contributions
suit+0.022
chard+0.021
 opener+0.021
 Labrador+0.020
mit+0.020
Neg logit contributions
und-0.015
 maj-0.015
 anx-0.015
 accident-0.014
 tenth-0.014
Token many
Token ID867
Feature activation-2.25e-01

Pos logit contributions
suit+0.579
chard+0.563
 Labrador+0.556
mit+0.546
 opener+0.539
Neg logit contributions
 anx-0.131
 tenth-0.131
 maj-0.126
 accident-0.124
und-0.123
Token went
Token ID1816
Feature activation-8.21e-03

Pos logit contributions
suit+0.048
chard+0.047
 Labrador+0.046
 opener+0.044
mit+0.043
Neg logit contributions
 anx-0.040
 accident-0.038
und-0.038
 mo-0.038
 maj-0.037
Token to
Token ID284
Feature activation-2.89e-02

Pos logit contributions
 anx+0.008
 mo+0.007
 maj+0.007
und+0.007
 aer+0.007
Neg logit contributions
chard-0.012
suit-0.012
mit-0.011
 opener-0.011
 Labrador-0.011
Token the
Token ID262
Feature activation+1.17e-04

Pos logit contributions
 anx+0.033
und+0.033
 tenth+0.033
 maj+0.033
 accident+0.033
Neg logit contributions
chard-0.036
suit-0.035
 Labrador-0.034
mit-0.033
 opener-0.033
Token closest
Token ID11706
Feature activation+2.43e-01

Pos logit contributions
suit+0.000
 opener+0.000
mit+0.000
chard+0.000
 Labrador+0.000
Neg logit contributions
 tenth-0.000
 anx-0.000
 accident-0.000
 maj-0.000
 mo-0.000
Token or
Token ID393
Feature activation+6.76e-02

Pos logit contributions
suit+0.340
chard+0.319
mit+0.310
 Labrador+0.299
 Tech+0.297
Neg logit contributions
 accident-0.258
 aer-0.253
 mo-0.253
 anx-0.253
 tenth-0.244
Tokenchard
Token ID30215
Feature activation+3.34e-01

Pos logit contributions
 Labrador+0.036
suit+0.032
 opener+0.030
 Tech+0.024
uts+0.023
Neg logit contributions
 anx-0.130
 tenth-0.127
 maj-0.124
 accident-0.122
 mo-0.122
Token.
Token ID13
Feature activation-1.54e-03

Pos logit contributions
suit+0.514
 Labrador+0.485
chard+0.477
 opener+0.456
mit+0.454
Neg logit contributions
 anx-0.310
 accident-0.308
und-0.289
 tenth-0.287
 aer-0.280
Token
Token ID198
Feature activation+2.08e-02

Pos logit contributions
und+0.001
 anx+0.001
 Other+0.001
 maj+0.001
 tenth+0.001
Neg logit contributions
suit-0.003
chard-0.002
mit-0.002
 Labrador-0.002
pper-0.002
Token
Token ID198
Feature activation+2.10e-02

Pos logit contributions
suit+0.021
chard+0.021
 opener+0.020
 Labrador+0.020
 stall+0.019
Neg logit contributions
 anx-0.028
 tenth-0.028
 maj-0.028
 accident-0.028
xc-0.028
TokenAt
Token ID2953
Feature activation+5.49e-02

Pos logit contributions
chard+0.027
suit+0.027
 Labrador+0.026
 opener+0.025
mit+0.025
Neg logit contributions
 anx-0.023
 maj-0.023
 accident-0.023
 tenth-0.022
und-0.022
Token the
Token ID262
Feature activation-3.28e-04

Pos logit contributions
chard+0.068
suit+0.067
 Labrador+0.063
mit+0.062
 opener+0.061
Neg logit contributions
 anx-0.064
und-0.064
 maj-0.064
 tenth-0.063
 accident-0.063
Token the
Token ID262
Feature activation-5.13e-03

Pos logit contributions
 anx+0.070
 tenth+0.070
und+0.069
 maj+0.069
 accident+0.068
Neg logit contributions
suit-0.087
chard-0.087
 Labrador-0.082
mit-0.081
 opener-0.080
Token owner
Token ID4870
Feature activation+1.63e-01

Pos logit contributions
 tenth+0.006
 accident+0.006
 maj+0.006
 anx+0.006
 mo+0.005
Neg logit contributions
suit-0.007
mit-0.006
chard-0.006
 opener-0.006
 Labrador-0.006
Token of
Token ID286
Feature activation+5.59e-03

Pos logit contributions
chard+0.197
suit+0.195
 Labrador+0.184
mit+0.182
 opener+0.179
Neg logit contributions
 tenth-0.194
 maj-0.192
 anx-0.192
und-0.190
xc-0.189
Token the
Token ID262
Feature activation-4.71e-03

Pos logit contributions
suit+0.010
mit+0.009
chard+0.009
 Labrador+0.008
 opener+0.008
Neg logit contributions
 accident-0.005
und-0.005
 aer-0.005
 mo-0.005
 maj-0.005
Token or
Token ID393
Feature activation+7.19e-02

Pos logit contributions
 accident+0.005
 tenth+0.005
 anx+0.005
 maj+0.005
 mo+0.005
Neg logit contributions
suit-0.006
chard-0.006
mit-0.006
 opener-0.006
 Labrador-0.006
Tokenchard
Token ID30215
Feature activation+3.32e-01

Pos logit contributions
suit+0.093
chard+0.092
 Labrador+0.087
mit+0.086
 opener+0.084
Neg logit contributions
 anx-0.080
 tenth-0.080
und-0.079
 maj-0.079
 accident-0.078
Token didn
Token ID1422
Feature activation+1.40e-02

Pos logit contributions
suit+0.501
 Labrador+0.468
chard+0.463
 opener+0.445
mit+0.444
Neg logit contributions
 accident-0.323
 tenth-0.315
und-0.305
 anx-0.305
 aer-0.301
Token't
Token ID470
Feature activation+4.70e-02

Pos logit contributions
suit+0.024
chard+0.023
mit+0.022
 Labrador+0.022
 pads+0.021
Neg logit contributions
 tenth-0.012
 accident-0.012
 anx-0.012
 aer-0.011
und-0.011
Token know
Token ID760
Feature activation+1.27e-02

Pos logit contributions
suit+0.059
chard+0.058
 Labrador+0.056
mit+0.054
 stall+0.052
Neg logit contributions
 accident-0.054
 tenth-0.054
 anx-0.053
 maj-0.052
 mo-0.051
Token who
Token ID508
Feature activation-6.00e-02

Pos logit contributions
suit+0.015
chard+0.014
 Labrador+0.013
mit+0.013
 opener+0.013
Neg logit contributions
 anx-0.016
 accident-0.016
 tenth-0.016
und-0.016
 maj-0.016
Token was
Token ID373
Feature activation+1.19e-02

Pos logit contributions
 accident+0.057
 aer+0.055
und+0.053
 tenth+0.053
 anx+0.052
Neg logit contributions
suit-0.094
 Labrador-0.087
mit-0.085
 opener-0.082
chard-0.081
Token would
Token ID561
Feature activation+1.40e-03

Pos logit contributions
 anx+0.025
 accident+0.024
 tenth+0.024
 maj+0.024
und+0.024
Neg logit contributions
suit-0.032
chard-0.031
 Labrador-0.030
mit-0.029
 opener-0.029
Token sneak
Token ID20528
Feature activation+1.29e-02

Pos logit contributions
chard+0.002
suit+0.002
 Labrador+0.002
mit+0.002
 opener+0.002
Neg logit contributions
und-0.002
xc-0.002
 maj-0.002
 tenth-0.002
 anx-0.002
Token into
Token ID656
Feature activation+9.50e-02

Pos logit contributions
 Labrador+0.016
suit+0.016
chard+0.016
mit+0.016
 opener+0.015
Neg logit contributions
 anx-0.015
 maj-0.014
 mo-0.014
 tenth-0.014
und-0.014
Token the
Token ID262
Feature activation-7.43e-03

Pos logit contributions
suit+0.127
chard+0.125
 Labrador+0.119
mit+0.117
 opener+0.115
Neg logit contributions
 anx-0.102
 tenth-0.102
 maj-0.101
und-0.101
 accident-0.100
Token or
Token ID393
Feature activation+7.07e-02

Pos logit contributions
 anx+0.010
 maj+0.009
und+0.009
 accident+0.009
 tenth+0.009
Neg logit contributions
chard-0.008
suit-0.008
 Labrador-0.008
 stall-0.008
mit-0.008
Tokenchard
Token ID30215
Feature activation+3.32e-01

Pos logit contributions
suit+0.066
 Labrador+0.062
mit+0.056
 opener+0.055
asket+0.053
Neg logit contributions
 anx-0.106
 tenth-0.105
 mo-0.104
 bru-0.104
 maj-0.102
Token at
Token ID379
Feature activation-7.82e-02

Pos logit contributions
suit+0.515
 Labrador+0.487
chard+0.468
mit+0.463
Same+0.440
Neg logit contributions
 tenth-0.312
 accident-0.311
 anx-0.299
 aer-0.296
 mo-0.290
Token night
Token ID1755
Feature activation-6.22e-02

Pos logit contributions
 accident+0.073
 tenth+0.071
 aer+0.068
und+0.067
 maj+0.067
Neg logit contributions
suit-0.128
 Labrador-0.119
mit-0.116
chard-0.110
 opener-0.110
Token and
Token ID290
Feature activation+3.64e-03

Pos logit contributions
 anx+0.074
 tenth+0.074
 maj+0.073
und+0.073
 accident+0.073
Neg logit contributions
suit-0.075
chard-0.074
 Labrador-0.070
mit-0.069
 opener-0.068
Token take
Token ID1011
Feature activation-8.60e-02

Pos logit contributions
suit+0.005
chard+0.005
 Labrador+0.004
mit+0.004
 opener+0.004
Neg logit contributions
 anx-0.004
 tenth-0.004
 accident-0.004
 maj-0.004
und-0.004
Token the
Token ID262
Feature activation-6.74e-03

Pos logit contributions
 accident+0.083
 anx+0.082
und+0.082
 maj+0.081
 tenth+0.080
Neg logit contributions
suit-0.126
chard-0.125
 Labrador-0.120
mit-0.117
 opener-0.116
Token a
Token ID257
Feature activation+1.37e-02

Pos logit contributions
 accident+0.012
 anx+0.012
und+0.012
 tenth+0.012
 maj+0.012
Neg logit contributions
suit-0.018
mit-0.017
chard-0.017
 Labrador-0.017
 opener-0.017
Token trap
Token ID12840
Feature activation+7.11e-02

Pos logit contributions
suit+0.017
chard+0.017
 Labrador+0.016
mit+0.016
 opener+0.016
Neg logit contributions
 anx-0.016
und-0.016
 tenth-0.016
 maj-0.015
 accident-0.015
Token in
Token ID287
Feature activation-4.27e-02

Pos logit contributions
suit+0.088
chard+0.087
 Labrador+0.082
 opener+0.080
mit+0.079
Neg logit contributions
 tenth-0.084
und-0.082
 anx-0.082
 maj-0.082
 accident-0.081
Token the
Token ID262
Feature activation-4.67e-03

Pos logit contributions
 anx+0.047
 tenth+0.047
 maj+0.047
und+0.046
 accident+0.046
Neg logit contributions
suit-0.055
chard-0.055
 Labrador-0.052
 opener-0.051
mit-0.050
Token or
Token ID393
Feature activation+6.95e-02

Pos logit contributions
und+0.006
 anx+0.006
xc+0.006
 maj+0.006
 tenth+0.006
Neg logit contributions
chard-0.005
suit-0.005
 Labrador-0.005
 stall-0.005
 as-0.004
Tokenchard
Token ID30215
Feature activation+3.31e-01

Pos logit contributions
chard+0.083
suit+0.079
 Labrador+0.071
mit+0.071
 opener+0.071
Neg logit contributions
und-0.089
 tenth-0.088
 anx-0.088
 maj-0.087
 accident-0.086
Token.
Token ID13
Feature activation-2.79e-03

Pos logit contributions
suit+0.437
chard+0.423
 Labrador+0.411
mit+0.402
 opener+0.390
Neg logit contributions
 anx-0.365
 accident-0.364
 tenth-0.362
und-0.358
 maj-0.354
Token When
Token ID1649
Feature activation+8.40e-02

Pos logit contributions
 tenth+0.003
 anx+0.003
xc+0.003
 accident+0.003
 maj+0.003
Neg logit contributions
suit-0.004
chard-0.004
 Labrador-0.004
 opener-0.004
 as-0.004
Token the
Token ID262
Feature activation-4.05e-03

Pos logit contributions
 as+0.067
chard+0.060
 Coco+0.057
 Cookie+0.051
 stall+0.050
Neg logit contributions
xc-0.158
und-0.149
Acc-0.141
itive-0.138
 maj-0.136
Token elephant
Token ID20950
Feature activation+1.07e-02

Pos logit contributions
 accident+0.005
 tenth+0.005
 anx+0.005
 maj+0.005
 mo+0.005
Neg logit contributions
suit-0.005
chard-0.005
mit-0.004
 opener-0.004
 Labrador-0.004
Token came
Token ID1625
Feature activation+1.93e-02

Pos logit contributions
suit+0.013
chard+0.013
 Labrador+0.012
 opener+0.012
mit+0.012
Neg logit contributions
 tenth-0.013
 maj-0.013
 anx-0.012
und-0.012
xc-0.012
Token and
Token ID290
Feature activation+1.43e-03

Pos logit contributions
suit+0.195
chard+0.192
 Labrador+0.185
mit+0.181
 opener+0.176
Neg logit contributions
 anx-0.169
 tenth-0.166
und-0.166
 accident-0.165
 maj-0.163
Token run
Token ID1057
Feature activation-6.80e-02

Pos logit contributions
suit+0.002
chard+0.002
 Labrador+0.002
mit+0.002
 opener+0.002
Neg logit contributions
 anx-0.002
 tenth-0.002
 maj-0.002
und-0.002
 accident-0.002
Token around
Token ID1088
Feature activation+7.70e-02

Pos logit contributions
und+0.078
 anx+0.077
 accident+0.073
 mo+0.073
 tenth+0.073
Neg logit contributions
chard-0.090
 Labrador-0.084
mit-0.082
 opener-0.079
suit-0.077
Token the
Token ID262
Feature activation+2.81e-03

Pos logit contributions
chard+0.085
suit+0.084
 as+0.084
 Labrador+0.079
 opener+0.078
Neg logit contributions
 tenth-0.098
xc-0.097
 maj-0.096
und-0.094
unct-0.094
Token or
Token ID393
Feature activation+6.84e-02

Pos logit contributions
suit+0.004
chard+0.003
 Labrador+0.003
mit+0.003
 opener+0.003
Neg logit contributions
 anx-0.003
 tenth-0.003
 maj-0.003
 accident-0.003
und-0.003
Tokenchard
Token ID30215
Feature activation+3.31e-01

Pos logit contributions
suit+0.086
 Labrador+0.082
 opener+0.079
mit+0.078
chard+0.078
Neg logit contributions
 anx-0.079
 tenth-0.077
 mo-0.076
 maj-0.076
 bru-0.076
Token.
Token ID13
Feature activation-2.78e-03

Pos logit contributions
suit+0.480
 Labrador+0.459
chard+0.450
mit+0.434
 opener+0.424
Neg logit contributions
 anx-0.327
 tenth-0.327
 accident-0.322
und-0.312
 aer-0.307
Token
Token ID198
Feature activation+1.90e-02

Pos logit contributions
 anx+0.003
 tenth+0.003
und+0.003
 accident+0.003
 maj+0.003
Neg logit contributions
suit-0.004
chard-0.004
mit-0.004
 Labrador-0.004
 stall-0.003
Token
Token ID198
Feature activation+1.90e-02

Pos logit contributions
suit+0.023
chard+0.023
 Labrador+0.022
mit+0.021
 opener+0.021
Neg logit contributions
 anx-0.022
 tenth-0.022
und-0.022
 maj-0.022
 accident-0.022
TokenOne
Token ID3198
Feature activation-3.70e-02

Pos logit contributions
suit+0.029
chard+0.028
mit+0.027
 Labrador+0.027
 opener+0.026
Neg logit contributions
 tenth-0.017
 anx-0.017
und-0.017
 accident-0.017
 maj-0.017
Token day
Token ID1110
Feature activation-4.37e-03

Pos logit contributions
 accident+0.033
 anx+0.032
 tenth+0.032
 maj+0.031
 aer+0.031
Neg logit contributions
suit-0.058
chard-0.057
 Labrador-0.053
 opener-0.053
mit-0.052
Token mice
Token ID10693
Feature activation+9.70e-02

Pos logit contributions
suit+0.007
mit+0.006
 Labrador+0.006
chard+0.006
 opener+0.006
Neg logit contributions
 accident-0.003
und-0.003
 anx-0.003
 mo-0.003
 aer-0.003
Token away
Token ID1497
Feature activation+3.21e-02

Pos logit contributions
suit+0.138
chard+0.135
 Labrador+0.135
mit+0.127
 opener+0.123
Neg logit contributions
 anx-0.097
und-0.097
 accident-0.095
 mo-0.091
 tenth-0.091
Token from
Token ID422
Feature activation+7.87e-02

Pos logit contributions
suit+0.043
chard+0.042
 Labrador+0.041
mit+0.039
 opener+0.038
Neg logit contributions
 anx-0.035
und-0.035
 accident-0.034
 tenth-0.033
 maj-0.033
Token the
Token ID262
Feature activation+6.54e-04

Pos logit contributions
suit+0.118
 Labrador+0.109
mit+0.108
 opener+0.107
chard+0.106
Neg logit contributions
 accident-0.079
 anx-0.078
und-0.075
 tenth-0.075
 maj-0.074
Token or
Token ID393
Feature activation+6.75e-02

Pos logit contributions
suit+0.001
chard+0.001
 Labrador+0.001
mit+0.001
 opener+0.001
Neg logit contributions
 anx-0.001
 tenth-0.001
 accident-0.001
 maj-0.001
und-0.001
Tokenchard
Token ID30215
Feature activation+3.30e-01

Pos logit contributions
suit+0.087
 Labrador+0.082
mit+0.079
 opener+0.078
chard+0.077
Neg logit contributions
 anx-0.077
 accident-0.076
 tenth-0.075
 maj-0.074
 mo-0.074
Token.
Token ID13
Feature activation-3.87e-03

Pos logit contributions
suit+0.479
chard+0.449
 Labrador+0.445
mit+0.430
 opener+0.420
Neg logit contributions
 accident-0.334
 anx-0.333
und-0.323
 tenth-0.317
 mo-0.310
Token
Token ID198
Feature activation+1.88e-02

Pos logit contributions
und+0.004
 anx+0.004
 tenth+0.004
 maj+0.004
 accident+0.004
Neg logit contributions
suit-0.005
chard-0.005
 Labrador-0.005
mit-0.005
 opener-0.005
Token
Token ID198
Feature activation+1.90e-02

Pos logit contributions
chard+0.019
suit+0.019
 opener+0.019
 Labrador+0.019
 stall+0.018
Neg logit contributions
 anx-0.025
 maj-0.025
 tenth-0.025
 accident-0.025
xc-0.025
TokenThe
Token ID464
Feature activation-6.89e-02

Pos logit contributions
chard+0.026
suit+0.026
 Labrador+0.025
 opener+0.025
mit+0.024
Neg logit contributions
 anx-0.020
 maj-0.019
 tenth-0.019
 accident-0.019
und-0.019
Token moral
Token ID6573
Feature activation-3.25e-02

Pos logit contributions
 accident+0.077
 anx+0.077
 tenth+0.074
 maj+0.072
 mo+0.070
Neg logit contributions
suit-0.094
chard-0.088
mit-0.084
 opener-0.083
 Labrador-0.082
Token traps
Token ID20348
Feature activation+5.35e-02

Pos logit contributions
suit+0.060
chard+0.059
 Labrador+0.057
mit+0.056
 opener+0.055
Neg logit contributions
 anx-0.049
 tenth-0.048
und-0.048
 maj-0.048
 accident-0.048
Token all
Token ID477
Feature activation-1.07e-01

Pos logit contributions
suit+0.072
chard+0.072
 Labrador+0.070
mit+0.069
 opener+0.065
Neg logit contributions
 anx-0.058
 accident-0.056
und-0.056
 maj-0.053
 tenth-0.053
Token over
Token ID625
Feature activation+1.01e-01

Pos logit contributions
 tenth+0.146
 anx+0.146
 maj+0.144
 accident+0.142
und+0.142
Neg logit contributions
suit-0.112
chard-0.110
 Labrador-0.102
mit-0.100
 opener-0.100
Token the
Token ID262
Feature activation-1.09e-06

Pos logit contributions
suit+0.138
chard+0.134
 Labrador+0.130
mit+0.128
 opener+0.124
Neg logit contributions
 anx-0.107
und-0.105
 accident-0.105
 tenth-0.105
 maj-0.104
Token or
Token ID393
Feature activation+6.81e-02

Feature not active on prev token;
no predictions were affected.
Tokenchard
Token ID30215
Feature activation+3.30e-01

Pos logit contributions
suit+0.072
chard+0.070
 Labrador+0.067
mit+0.065
 opener+0.064
Neg logit contributions
 anx-0.092
 tenth-0.091
 maj-0.090
 accident-0.090
und-0.090
Token,
Token ID11
Feature activation+8.62e-03

Pos logit contributions
suit+0.472
 Labrador+0.444
chard+0.441
mit+0.426
 opener+0.410
Neg logit contributions
 anx-0.341
 accident-0.337
 tenth-0.327
und-0.326
 mo-0.316
Token making
Token ID1642
Feature activation-6.41e-02

Pos logit contributions
chard+0.011
suit+0.011
 as+0.010
 Labrador+0.010
 stall+0.010
Neg logit contributions
xc-0.010
 tenth-0.010
und-0.010
 maj-0.010
 anx-0.009
Token sure
Token ID1654
Feature activation-1.03e-01

Pos logit contributions
 tenth+0.069
 anx+0.069
 maj+0.068
und+0.068
 accident+0.067
Neg logit contributions
suit-0.084
chard-0.083
 Labrador-0.079
mit-0.078
 opener-0.077
Token to
Token ID284
Feature activation-2.87e-02

Pos logit contributions
 anx+0.113
 tenth+0.112
 maj+0.111
und+0.111
 accident+0.110
Neg logit contributions
suit-0.135
chard-0.134
 Labrador-0.127
mit-0.125
 opener-0.123
Token check
Token ID2198
Feature activation-2.47e-02

Pos logit contributions
 anx+0.032
 accident+0.032
 tenth+0.032
und+0.032
 maj+0.031
Neg logit contributions
suit-0.038
chard-0.036
 Labrador-0.035
mit-0.034
 opener-0.033
Token had
Token ID550
Feature activation+9.11e-03

Pos logit contributions
suit+0.120
chard+0.118
 Labrador+0.113
mit+0.111
 opener+0.109
Neg logit contributions
 anx-0.106
 tenth-0.105
und-0.105
 accident-0.104
 maj-0.104
Token been
Token ID587
Feature activation+1.10e-02

Pos logit contributions
suit+0.012
chard+0.012
 Labrador+0.011
mit+0.011
 opener+0.011
Neg logit contributions
 anx-0.010
 tenth-0.010
 maj-0.010
und-0.010
 accident-0.010
Token disturbing
Token ID14851
Feature activation-3.46e-02

Pos logit contributions
suit+0.014
chard+0.014
 Labrador+0.013
mit+0.013
 opener+0.013
Neg logit contributions
 anx-0.012
 accident-0.012
 tenth-0.012
 maj-0.012
und-0.012
Token his
Token ID465
Feature activation-2.52e-02

Pos logit contributions
 anx+0.035
 accident+0.035
und+0.035
 tenth+0.034
 maj+0.033
Neg logit contributions
suit-0.050
chard-0.048
 Labrador-0.046
mit-0.046
 opener-0.045
Token or
Token ID393
Feature activation+6.51e-02

Pos logit contributions
 anx+0.029
 accident+0.029
 tenth+0.028
 maj+0.028
 aer+0.027
Neg logit contributions
suit-0.033
chard-0.032
 Labrador-0.030
 opener-0.030
mit-0.029
Tokenchard
Token ID30215
Feature activation+3.29e-01

Pos logit contributions
suit+0.091
 Labrador+0.086
 opener+0.083
mit+0.082
chard+0.080
Neg logit contributions
 accident-0.070
 anx-0.068
 mo-0.066
und-0.065
 tenth-0.065
Token and
Token ID290
Feature activation+2.36e-04

Pos logit contributions
suit+0.466
chard+0.429
 Labrador+0.428
 opener+0.416
mit+0.407
Neg logit contributions
 accident-0.347
 anx-0.340
 tenth-0.336
und-0.329
 aer-0.324
Token he
Token ID339
Feature activation-1.80e-02

Pos logit contributions
suit+0.000
chard+0.000
 Labrador+0.000
mit+0.000
 opener+0.000
Neg logit contributions
 accident-0.000
 anx-0.000
und-0.000
 tenth-0.000
 mo-0.000
Token wanted
Token ID2227
Feature activation-1.18e-02

Pos logit contributions
 anx+0.020
 tenth+0.020
und+0.020
 maj+0.020
 accident+0.020
Neg logit contributions
suit-0.023
chard-0.023
 Labrador-0.022
mit-0.021
 opener-0.021
Token to
Token ID284
Feature activation-2.98e-02

Pos logit contributions
 anx+0.011
 accident+0.011
 tenth+0.011
 maj+0.011
und+0.011
Neg logit contributions
suit-0.018
chard-0.017
 Labrador-0.017
mit-0.016
 opener-0.016
Token get
Token ID651
Feature activation+8.95e-02

Pos logit contributions
 maj+0.036
 anx+0.035
 tenth+0.035
und+0.035
xc+0.034
Neg logit contributions
chard-0.035
suit-0.035
 Labrador-0.034
mit-0.033
 opener-0.033
Token from
Token ID422
Feature activation+7.79e-02

Pos logit contributions
suit+0.119
chard+0.117
 Labrador+0.112
mit+0.109
 opener+0.107
Neg logit contributions
 anx-0.111
 tenth-0.110
und-0.109
 accident-0.109
 maj-0.108
Token disturbing
Token ID14851
Feature activation-2.91e-02

Pos logit contributions
chard+0.098
suit+0.097
 Labrador+0.092
mit+0.090
 opener+0.088
Neg logit contributions
 anx-0.089
 tenth-0.088
 maj-0.088
und-0.088
 accident-0.086
Token the
Token ID262
Feature activation-1.05e-03

Pos logit contributions
 anx+0.031
und+0.031
 accident+0.031
 tenth+0.029
 maj+0.029
Neg logit contributions
suit-0.041
chard-0.039
 Labrador-0.038
mit-0.037
 opener-0.037
Token cherry
Token ID23612
Feature activation+3.73e-02

Pos logit contributions
 anx+0.001
 tenth+0.001
und+0.001
 maj+0.001
 accident+0.001
Neg logit contributions
suit-0.001
chard-0.001
 Labrador-0.001
mit-0.001
 opener-0.001
Token or
Token ID393
Feature activation+6.54e-02

Pos logit contributions
suit+0.055
mit+0.053
chard+0.052
 Labrador+0.052
 opener+0.047
Neg logit contributions
und-0.037
 accident-0.037
 anx-0.035
 aer-0.035
 mo-0.034
Tokenchard
Token ID30215
Feature activation+3.29e-01

Pos logit contributions
suit+0.090
 Labrador+0.085
mit+0.081
chard+0.081
 opener+0.081
Neg logit contributions
 anx-0.070
 accident-0.070
 tenth-0.068
 mo-0.068
 maj-0.068
Token.
Token ID13
Feature activation-6.72e-03

Pos logit contributions
suit+0.473
chard+0.447
 Labrador+0.439
mit+0.425
 opener+0.416
Neg logit contributions
 accident-0.337
 anx-0.334
und-0.324
 tenth-0.323
 aer-0.313
Token Had
Token ID11161
Feature activation-1.70e-02

Pos logit contributions
 anx+0.007
und+0.007
 tenth+0.007
 maj+0.007
 accident+0.007
Neg logit contributions
suit-0.009
chard-0.009
 Labrador-0.008
mit-0.008
 opener-0.008
Token he
Token ID339
Feature activation-1.85e-02

Pos logit contributions
 anx+0.020
 tenth+0.020
 maj+0.019
und+0.019
 accident+0.019
Neg logit contributions
chard-0.021
suit-0.021
mit-0.019
 Labrador-0.019
 opener-0.019
Token not
Token ID407
Feature activation+1.60e-02

Pos logit contributions
 accident+0.019
 anx+0.019
und+0.019
 maj+0.018
 tenth+0.018
Neg logit contributions
suit-0.026
chard-0.026
 Labrador-0.025
 opener-0.023
 stall-0.023
Token been
Token ID587
Feature activation+1.78e-02

Pos logit contributions
suit+0.021
chard+0.020
 Labrador+0.019
mit+0.019
 opener+0.019
Neg logit contributions
 anx-0.018
 tenth-0.018
 accident-0.018
und-0.018
 maj-0.018
Token Jill
Token ID22772
Feature activation-2.43e-02

Pos logit contributions
 tenth+0.028
und+0.028
 maj+0.027
xc+0.027
 bru+0.026
Neg logit contributions
suit-0.029
mit-0.027
chard-0.027
 as-0.026
 Labrador-0.025
Token put
Token ID1234
Feature activation-3.08e-02

Pos logit contributions
 tenth+0.031
 bru+0.029
 maj+0.029
xc+0.029
und+0.029
Neg logit contributions
suit-0.028
mit-0.027
chard-0.026
 as-0.026
 Labrador-0.025
Token on
Token ID319
Feature activation-9.67e-02

Pos logit contributions
 anx+0.030
 maj+0.029
 tenth+0.029
 accident+0.029
und+0.029
Neg logit contributions
suit-0.044
chard-0.044
 Labrador-0.042
mit-0.041
 opener-0.041
Token her
Token ID607
Feature activation+4.13e-03

Pos logit contributions
 tenth+0.111
 anx+0.110
und+0.110
 maj+0.110
 accident+0.109
Neg logit contributions
suit-0.119
chard-0.118
 Labrador-0.112
mit-0.110
 opener-0.108
Token swim
Token ID9422
Feature activation+9.14e-03

Pos logit contributions
suit+0.005
chard+0.005
 Labrador+0.005
 opener+0.005
mit+0.004
Neg logit contributions
 anx-0.005
 tenth-0.005
 maj-0.005
 accident-0.005
 mo-0.005
Tokensuit
Token ID6063
Feature activation+3.24e-01

Pos logit contributions
chard+0.006
 Labrador+0.006
 opener+0.005
isi+0.005
 Cookie+0.005
Neg logit contributions
 maj-0.016
 anx-0.016
und-0.015
 mo-0.015
 accident-0.015
Token and
Token ID290
Feature activation-4.00e-03

Pos logit contributions
suit+0.406
chard+0.403
 Labrador+0.387
mit+0.376
 opener+0.371
Neg logit contributions
 anx-0.367
 tenth-0.363
 accident-0.361
und-0.361
 maj-0.360
Token spl
Token ID4328
Feature activation-1.67e-01

Pos logit contributions
 tenth+0.005
und+0.005
 maj+0.005
 accident+0.005
 anx+0.005
Neg logit contributions
suit-0.005
chard-0.005
mit-0.005
 as-0.004
 Labrador-0.004
Tokenashed
Token ID5263
Feature activation+5.65e-02

Pos logit contributions
 anx+0.217
und+0.217
 maj+0.215
 tenth+0.213
 accident+0.212
Neg logit contributions
chard-0.185
suit-0.178
 Labrador-0.175
 opener-0.169
 stall-0.166
Token in
Token ID287
Feature activation-4.99e-02

Pos logit contributions
suit+0.060
chard+0.055
 as+0.054
mit+0.052
 Labrador+0.052
Neg logit contributions
 tenth-0.079
 anx-0.077
 maj-0.077
und-0.076
 bru-0.076
Token the
Token ID262
Feature activation-4.65e-03

Pos logit contributions
 anx+0.054
 tenth+0.053
 accident+0.053
 maj+0.053
und+0.052
Neg logit contributions
suit-0.067
chard-0.066
 Labrador-0.063
mit-0.062
 opener-0.061
Token bug
Token ID5434
Feature activation+5.12e-03

Pos logit contributions
 accident+0.098
 anx+0.098
 tenth+0.097
 mo+0.096
 maj+0.095
Neg logit contributions
chard-0.113
suit-0.112
 Labrador-0.107
 opener-0.105
mit-0.102
Token changed
Token ID3421
Feature activation+6.61e-02

Pos logit contributions
suit+0.006
chard+0.006
 Labrador+0.006
mit+0.006
 opener+0.006
Neg logit contributions
 tenth-0.006
 anx-0.006
 maj-0.006
und-0.006
 accident-0.006
Token into
Token ID656
Feature activation+9.80e-02

Pos logit contributions
chard+0.085
suit+0.085
 Labrador+0.083
 opener+0.080
mit+0.078
Neg logit contributions
 anx-0.076
 maj-0.073
und-0.073
 mo-0.072
 accident-0.072
Token its
Token ID663
Feature activation-8.49e-02

Pos logit contributions
chard+0.118
suit+0.117
 Labrador+0.112
 opener+0.110
 as+0.109
Neg logit contributions
 tenth-0.115
 maj-0.114
und-0.113
xc-0.113
 anx-0.113
Token swim
Token ID9422
Feature activation+8.43e-03

Pos logit contributions
 anx+0.095
 maj+0.094
 mo+0.093
 accident+0.092
 aer+0.090
Neg logit contributions
chard-0.112
suit-0.105
 Labrador-0.105
 opener-0.104
 Cookie-0.097
Tokensuit
Token ID6063
Feature activation+3.21e-01

Pos logit contributions
chard+0.009
 Labrador+0.008
isi+0.007
 opener+0.007
Veter+0.007
Neg logit contributions
 anx-0.012
und-0.012
 mo-0.012
 maj-0.012
 accident-0.011
Token and
Token ID290
Feature activation-5.89e-03

Pos logit contributions
suit+0.414
chard+0.413
 Labrador+0.402
mit+0.387
 opener+0.384
Neg logit contributions
 anx-0.354
 accident-0.347
und-0.343
 tenth-0.340
 maj-0.339
Token jumped
Token ID11687
Feature activation-1.54e-02

Pos logit contributions
 tenth+0.007
 anx+0.007
 maj+0.006
und+0.006
 accident+0.006
Neg logit contributions
suit-0.008
chard-0.008
 Labrador-0.007
mit-0.007
 opener-0.007
Token in
Token ID287
Feature activation-4.87e-02

Pos logit contributions
 anx+0.019
 tenth+0.019
 maj+0.019
und+0.019
 accident+0.019
Neg logit contributions
suit-0.018
chard-0.018
 Labrador-0.017
mit-0.016
 opener-0.016
Token,
Token ID11
Feature activation+3.50e-03

Pos logit contributions
xc+0.065
 tenth+0.065
unct+0.065
 maj+0.064
und+0.063
Neg logit contributions
 as-0.050
suit-0.050
 opener-0.047
 Labrador-0.047
chard-0.047
Token feeling
Token ID4203
Feature activation-1.21e-01

Pos logit contributions
suit+0.004
chard+0.004
 Labrador+0.004
mit+0.004
 as+0.004
Neg logit contributions
 tenth-0.004
und-0.004
xc-0.004
 anx-0.004
 maj-0.004
Token on
Token ID319
Feature activation-8.71e-02

Pos logit contributions
 anx+0.010
 accident+0.010
 maj+0.010
 tenth+0.010
 mo+0.010
Neg logit contributions
chard-0.021
suit-0.020
 Labrador-0.020
mit-0.020
 opener-0.019
Token his
Token ID465
Feature activation-2.48e-02

Pos logit contributions
und+0.100
 tenth+0.099
 bru+0.098
 maj+0.097
 anx+0.097
Neg logit contributions
suit-0.107
chard-0.105
 as-0.101
 Labrador-0.099
mit-0.098
Token different
Token ID1180
Feature activation-2.41e-02

Pos logit contributions
 anx+0.027
 mo+0.027
 aer+0.027
 maj+0.026
 tenth+0.026
Neg logit contributions
chard-0.034
 opener-0.032
 Labrador-0.032
keepers-0.030
suit-0.030
Token colored
Token ID16396
Feature activation-1.89e-02

Pos logit contributions
 anx+0.032
 accident+0.031
 maj+0.031
 aer+0.031
 tenth+0.031
Neg logit contributions
chard-0.028
suit-0.026
 opener-0.025
 Labrador-0.025
mit-0.024
Token swim
Token ID9422
Feature activation+9.08e-03

Pos logit contributions
 aer+0.022
 mo+0.021
 maj+0.018
 anx+0.018
 tenth+0.018
Neg logit contributions
chard-0.027
mit-0.024
keepers-0.024
suit-0.024
pper-0.024
Tokensuit
Token ID6063
Feature activation+3.19e-01

Pos logit contributions
chard+0.007
 Labrador+0.006
isi+0.005
Veter+0.005
Same+0.005
Neg logit contributions
 mo-0.016
 anx-0.016
und-0.016
 maj-0.016
 sw-0.015
Token and
Token ID290
Feature activation-1.26e-03

Pos logit contributions
chard+0.452
 Labrador+0.439
suit+0.435
mit+0.418
 opener+0.416
Neg logit contributions
 anx-0.321
 accident-0.315
und-0.302
 mo-0.302
 maj-0.297
Token ran
Token ID4966
Feature activation-6.71e-02

Pos logit contributions
und+0.002
 tenth+0.002
xc+0.002
 bru+0.002
 maj+0.002
Neg logit contributions
suit-0.002
chard-0.001
mit-0.001
 as-0.001
 Labrador-0.001
Token out
Token ID503
Feature activation+1.50e-02

Pos logit contributions
 anx+0.071
 maj+0.069
 mo+0.068
 tenth+0.068
 accident+0.066
Neg logit contributions
chard-0.096
 Labrador-0.091
suit-0.087
 opener-0.086
mit-0.084
Token to
Token ID284
Feature activation-2.75e-02

Pos logit contributions
 as+0.016
suit+0.016
 opener+0.015
chard+0.015
mit+0.014
Neg logit contributions
 accident-0.020
und-0.020
 bru-0.019
 tenth-0.019
Acc-0.019
Token the
Token ID262
Feature activation-2.23e-03

Pos logit contributions
und+0.035
 tenth+0.034
 anx+0.034
 accident+0.034
 maj+0.034
Neg logit contributions
chard-0.031
suit-0.031
mit-0.029
 Labrador-0.029
 as-0.028
Token She
Token ID1375
Feature activation-3.91e-02

Pos logit contributions
 tenth+0.007
 accident+0.007
 anx+0.007
 maj+0.007
und+0.007
Neg logit contributions
suit-0.009
chard-0.008
 Labrador-0.008
mit-0.008
 opener-0.008
Token put
Token ID1234
Feature activation-1.46e-02

Pos logit contributions
 tenth+0.045
 maj+0.044
und+0.044
 anx+0.044
 accident+0.043
Neg logit contributions
suit-0.050
chard-0.049
 Labrador-0.046
mit-0.046
 opener-0.045
Token on
Token ID319
Feature activation-8.95e-02

Pos logit contributions
 anx+0.011
 accident+0.011
 maj+0.011
und+0.011
 mo+0.011
Neg logit contributions
suit-0.024
chard-0.024
 Labrador-0.023
mit-0.023
 opener-0.022
Token her
Token ID607
Feature activation+1.07e-02

Pos logit contributions
 tenth+0.103
 anx+0.103
und+0.102
 maj+0.102
 accident+0.101
Neg logit contributions
suit-0.111
chard-0.110
 Labrador-0.104
mit-0.102
 opener-0.100
Token swim
Token ID9422
Feature activation+7.69e-03

Pos logit contributions
chard+0.014
 Labrador+0.014
suit+0.014
 opener+0.014
keepers+0.013
Neg logit contributions
 anx-0.012
 mo-0.012
 maj-0.011
 aer-0.011
 accident-0.011
Tokensuit
Token ID6063
Feature activation+3.18e-01

Pos logit contributions
chard+0.005
 Labrador+0.004
isi+0.004
Veter+0.004
 Cookie+0.003
Neg logit contributions
 anx-0.014
 mo-0.014
 maj-0.014
und-0.014
 sw-0.013
Token and
Token ID290
Feature activation-4.35e-03

Pos logit contributions
chard+0.439
 Labrador+0.433
suit+0.430
 opener+0.414
mit+0.412
Neg logit contributions
 anx-0.328
 accident-0.317
und-0.306
 maj-0.303
 mo-0.302
Token grabbed
Token ID13176
Feature activation+1.91e-02

Pos logit contributions
 tenth+0.005
und+0.005
 maj+0.005
 anx+0.005
 accident+0.005
Neg logit contributions
suit-0.006
chard-0.005
 Labrador-0.005
mit-0.005
 opener-0.005
Token her
Token ID607
Feature activation+1.06e-02

Pos logit contributions
suit+0.026
chard+0.026
 Labrador+0.025
 opener+0.024
mit+0.024
Neg logit contributions
 anx-0.020
 accident-0.019
 maj-0.019
 tenth-0.019
und-0.019
Token float
Token ID12178
Feature activation+2.47e-02

Pos logit contributions
chard+0.015
 Labrador+0.014
 opener+0.014
isi+0.014
Veter+0.014
Neg logit contributions
 mo-0.011
 aer-0.011
 anx-0.011
 maj-0.010
 sne-0.010
Tokeny
Token ID88
Feature activation+3.43e-02

Pos logit contributions
chard+0.035
 Labrador+0.034
 opener+0.031
 Cookie+0.031
Veter+0.030
Neg logit contributions
 anx-0.025
xc-0.025
und-0.025
 aer-0.025
 maj-0.024
Token She
Token ID1375
Feature activation-2.80e-02

Pos logit contributions
suit+0.015
chard+0.015
mit+0.015
 Labrador+0.014
 opener+0.013
Neg logit contributions
 anx-0.010
 maj-0.010
und-0.010
 tenth-0.010
 mo-0.010
Token put
Token ID1234
Feature activation-1.09e-02

Pos logit contributions
 mo+0.026
 anx+0.026
 accident+0.024
 maj+0.023
 sne+0.023
Neg logit contributions
chard-0.045
 Labrador-0.042
 opener-0.040
suit-0.040
 stall-0.039
Token on
Token ID319
Feature activation-8.87e-02

Pos logit contributions
 accident+0.008
 mo+0.008
 anx+0.008
 maj+0.008
 aer+0.008
Neg logit contributions
suit-0.018
 Labrador-0.018
mit-0.018
chard-0.018
 opener-0.017
Token her
Token ID607
Feature activation+1.38e-02

Pos logit contributions
 anx+0.095
 tenth+0.093
 maj+0.093
 accident+0.093
und+0.091
Neg logit contributions
suit-0.122
chard-0.119
 Labrador-0.114
mit-0.112
 opener-0.111
Token swim
Token ID9422
Feature activation+9.98e-03

Pos logit contributions
chard+0.019
 Labrador+0.018
 opener+0.018
suit+0.018
keepers+0.017
Neg logit contributions
 mo-0.015
 aer-0.014
 anx-0.014
 maj-0.014
 accident-0.014
Tokensuit
Token ID6063
Feature activation+3.17e-01

Pos logit contributions
chard+0.007
 Labrador+0.007
isi+0.006
Veter+0.006
Same+0.006
Neg logit contributions
 mo-0.017
 maj-0.017
 sw-0.017
 anx-0.017
 aer-0.016
Token and
Token ID290
Feature activation+9.50e-03

Pos logit contributions
chard+0.430
suit+0.425
 Labrador+0.419
mit+0.406
 opener+0.398
Neg logit contributions
 anx-0.335
 accident-0.334
 tenth-0.319
und-0.317
 maj-0.315
Token ran
Token ID4966
Feature activation-6.19e-02

Pos logit contributions
suit+0.013
chard+0.013
 Labrador+0.012
mit+0.012
 opener+0.012
Neg logit contributions
 tenth-0.010
 anx-0.010
und-0.010
 maj-0.010
 accident-0.010
Token to
Token ID284
Feature activation-2.56e-02

Pos logit contributions
 anx+0.047
 mo+0.044
 maj+0.043
 tenth+0.042
 accident+0.039
Neg logit contributions
chard-0.108
 Labrador-0.106
pper-0.100
mit-0.098
 opener-0.097
Token the
Token ID262
Feature activation+1.02e-03

Pos logit contributions
 anx+0.024
 tenth+0.023
 mo+0.022
 accident+0.022
 maj+0.022
Neg logit contributions
suit-0.041
chard-0.038
 Labrador-0.038
mit-0.037
 opener-0.036
Token shore
Token ID15191
Feature activation-7.08e-02

Pos logit contributions
suit+0.001
chard+0.001
 Labrador+0.001
mit+0.001
 opener+0.001
Neg logit contributions
 anx-0.001
 tenth-0.001
 maj-0.001
und-0.001
 accident-0.001
TokenTim
Token ID14967
Feature activation-7.09e-02

Pos logit contributions
suit+0.021
chard+0.021
 Labrador+0.020
 opener+0.020
mit+0.019
Neg logit contributions
 maj-0.016
 anx-0.016
 tenth-0.016
 bru-0.016
 accident-0.016
Token grabbed
Token ID13176
Feature activation+2.14e-02

Pos logit contributions
 tenth+0.087
 maj+0.085
und+0.084
 anx+0.084
 mo+0.083
Neg logit contributions
suit-0.085
chard-0.080
 Labrador-0.079
mit-0.079
 as-0.075
Token the
Token ID262
Feature activation+8.92e-04

Pos logit contributions
suit+0.032
chard+0.031
 Labrador+0.030
mit+0.029
 opener+0.029
Neg logit contributions
 anx-0.021
 accident-0.020
 tenth-0.020
 maj-0.020
 mo-0.019
Token hook
Token ID8011
Feature activation+4.63e-02

Pos logit contributions
chard+0.001
 Labrador+0.001
suit+0.001
 Cookie+0.001
mit+0.001
Neg logit contributions
 tenth-0.001
 accident-0.001
 mo-0.001
 maj-0.001
 anx-0.001
Token and
Token ID290
Feature activation-3.26e-03

Pos logit contributions
chard+0.063
suit+0.062
 Labrador+0.061
mit+0.058
 opener+0.056
Neg logit contributions
 anx-0.049
 accident-0.047
 mo-0.046
 tenth-0.046
 maj-0.045
Token anx
Token ID7296
Feature activation-3.32e-01

Pos logit contributions
 anx+0.004
 mo+0.004
 accident+0.004
und+0.004
 maj+0.004
Neg logit contributions
suit-0.004
chard-0.004
 Labrador-0.004
mit-0.004
 opener-0.004
Tokeniously
Token ID6819
Feature activation-3.84e-02

Pos logit contributions
 tenth+0.452
und+0.450
 mo+0.435
 maj+0.431
 bru+0.417
Neg logit contributions
 as-0.358
mit-0.344
suit-0.327
chard-0.326
 Labrador-0.315
Token put
Token ID1234
Feature activation-1.28e-02

Pos logit contributions
 anx+0.050
 tenth+0.049
 accident+0.049
und+0.049
 maj+0.049
Neg logit contributions
suit-0.043
chard-0.042
 Labrador-0.040
mit-0.039
 opener-0.038
Token it
Token ID340
Feature activation+3.00e-03

Pos logit contributions
 anx+0.011
 accident+0.011
 mo+0.010
 maj+0.010
und+0.010
Neg logit contributions
chard-0.021
 Labrador-0.021
suit-0.020
mit-0.020
 stall-0.019
Token in
Token ID287
Feature activation-4.34e-02

Pos logit contributions
suit+0.004
 Labrador+0.004
chard+0.004
mit+0.004
 opener+0.004
Neg logit contributions
 anx-0.003
 maj-0.003
 accident-0.003
 tenth-0.003
und-0.003
Token the
Token ID262
Feature activation+9.18e-04

Pos logit contributions
 anx+0.041
 accident+0.041
 tenth+0.040
und+0.039
 maj+0.038
Neg logit contributions
suit-0.066
chard-0.065
mit-0.062
 Labrador-0.062
 opener-0.059
Token are
Token ID389
Feature activation+6.61e-03

Pos logit contributions
suit+0.162
chard+0.160
mit+0.152
 opener+0.151
 Labrador+0.150
Neg logit contributions
 anx-0.167
 tenth-0.164
und-0.163
 maj-0.163
 mo-0.161
Token you
Token ID345
Feature activation-1.61e-02

Pos logit contributions
chard+0.008
 as+0.007
suit+0.007
mit+0.007
 Coco+0.007
Neg logit contributions
 anx-0.009
und-0.009
 maj-0.008
 bru-0.008
 accident-0.008
Token?"
Token ID1701
Feature activation-4.14e-02

Pos logit contributions
 tenth+0.020
und+0.020
 anx+0.019
 maj+0.019
 mo+0.019
Neg logit contributions
chard-0.019
suit-0.018
mit-0.017
 Labrador-0.017
 as-0.017
Token she
Token ID673
Feature activation+2.26e-02

Pos logit contributions
 tenth+0.053
und+0.053
 maj+0.052
xc+0.051
 anx+0.051
Neg logit contributions
chard-0.048
suit-0.046
 Labrador-0.044
 as-0.043
 opener-0.043
Token shouted
Token ID17293
Feature activation-9.06e-03

Pos logit contributions
chard+0.027
suit+0.026
 Labrador+0.025
mit+0.024
 opener+0.024
Neg logit contributions
 tenth-0.028
und-0.028
 maj-0.028
 anx-0.027
 accident-0.027
Token anx
Token ID7296
Feature activation-3.32e-01

Pos logit contributions
und+0.012
 tenth+0.012
xc+0.012
 maj+0.012
 mo+0.012
Neg logit contributions
 as-0.010
suit-0.010
chard-0.010
 Labrador-0.009
 opener-0.009
Tokeniously
Token ID6819
Feature activation-3.91e-02

Pos logit contributions
und+0.436
 tenth+0.436
 maj+0.429
 mo+0.414
 anx+0.403
Neg logit contributions
 as-0.361
chard-0.361
mit-0.359
 Labrador-0.352
suit-0.350
Token.
Token ID13
Feature activation-6.92e-03

Pos logit contributions
und+0.054
xc+0.053
 tenth+0.053
 maj+0.051
 accident+0.051
Neg logit contributions
chard-0.041
 as-0.041
suit-0.040
mit-0.039
 Labrador-0.038
Token
Token ID220
Feature activation-2.63e-02

Pos logit contributions
 tenth+0.008
 accident+0.008
 anx+0.008
und+0.008
 maj+0.008
Neg logit contributions
chard-0.008
suit-0.008
 Labrador-0.008
mit-0.008
 opener-0.007
Token
Token ID198
Feature activation+1.42e-02

Pos logit contributions
 anx+0.028
und+0.028
 tenth+0.028
 maj+0.028
 accident+0.027
Neg logit contributions
suit-0.036
chard-0.035
 Labrador-0.034
mit-0.033
 stall-0.031
Token
Token ID198
Feature activation+1.45e-02

Pos logit contributions
chard+0.014
suit+0.014
 opener+0.014
 Labrador+0.014
 stall+0.013
Neg logit contributions
 anx-0.019
 accident-0.019
 tenth-0.019
 maj-0.019
xc-0.019
Token Both
Token ID5747
Feature activation+2.57e-02

Pos logit contributions
und+0.004
 anx+0.004
 maj+0.004
 Sad+0.004
Acc+0.004
Neg logit contributions
suit-0.009
chard-0.008
mit-0.008
pper-0.008
 Labrador-0.008
Token of
Token ID286
Feature activation+3.63e-03

Pos logit contributions
chard+0.044
suit+0.042
 Labrador+0.040
pper+0.039
mit+0.039
Neg logit contributions
 accident-0.022
 anx-0.022
 aer-0.021
 Sad-0.020
 tenth-0.020
Token them
Token ID606
Feature activation+3.83e-02

Pos logit contributions
suit+0.007
mit+0.006
chard+0.006
 pads+0.006
 Labrador+0.006
Neg logit contributions
 accident-0.003
 anx-0.003
und-0.003
 aer-0.003
 many-0.003
Token put
Token ID1234
Feature activation-1.18e-02

Pos logit contributions
suit+0.046
chard+0.046
mit+0.043
 Labrador+0.043
 opener+0.041
Neg logit contributions
 tenth-0.046
 maj-0.045
 anx-0.045
und-0.045
 accident-0.044
Token their
Token ID511
Feature activation+9.48e-05

Pos logit contributions
und+0.011
 accident+0.011
 tenth+0.011
 anx+0.011
 maj+0.011
Neg logit contributions
 Labrador-0.017
suit-0.017
chard-0.017
 opener-0.017
mit-0.016
Token anx
Token ID7296
Feature activation-3.31e-01

Pos logit contributions
suit+0.000
chard+0.000
 Labrador+0.000
 opener+0.000
mit+0.000
Neg logit contributions
 mo-0.000
 tenth-0.000
 anx-0.000
 maj-0.000
 accident-0.000
Tokenieties
Token ID9545
Feature activation-2.14e-02

Pos logit contributions
und+0.420
 tenth+0.417
 maj+0.407
 mo+0.394
 anx+0.392
Neg logit contributions
suit-0.370
mit-0.369
 Labrador-0.359
 as-0.357
chard-0.353
Token aside
Token ID7263
Feature activation+8.28e-02

Pos logit contributions
 anx+0.023
 accident+0.022
und+0.022
 tenth+0.022
 mo+0.022
Neg logit contributions
 Labrador-0.029
suit-0.028
chard-0.028
mit-0.027
 opener-0.027
Token.
Token ID13
Feature activation-5.98e-03

Pos logit contributions
suit+0.103
chard+0.101
 Labrador+0.096
mit+0.095
 opener+0.093
Neg logit contributions
 tenth-0.095
 anx-0.095
 maj-0.094
und-0.094
 accident-0.093
Token From
Token ID3574
Feature activation-8.10e-02

Pos logit contributions
und+0.005
 anx+0.005
 Sad+0.005
 maj+0.005
 tenth+0.004
Neg logit contributions
suit-0.010
chard-0.010
mit-0.010
pper-0.009
 Labrador-0.009
Token then
Token ID788
Feature activation+1.43e-02

Pos logit contributions
und+0.068
 accident+0.068
 aer+0.066
 anx+0.066
 maj+0.063
Neg logit contributions
suit-0.132
 opener-0.129
chard-0.123
 Labrador-0.123
mit-0.117
Token!
Token ID0
Feature activation-8.79e-03

Pos logit contributions
 as+0.141
chard+0.139
suit+0.136
 opener+0.128
resses+0.125
Neg logit contributions
xc-0.185
und-0.184
 accident-0.181
 anx-0.179
 bru-0.175
Tokenâ
Token ID22940
Feature activation-1.03e-01

Pos logit contributions
 anx+0.009
und+0.009
 tenth+0.009
 maj+0.009
 accident+0.009
Neg logit contributions
suit-0.012
chard-0.012
 Labrador-0.011
mit-0.011
 opener-0.011
Token
Token ID26391
Feature activation+2.76e-02

Pos logit contributions
 anx+0.124
 tenth+0.123
 accident+0.122
und+0.122
 maj+0.122
Neg logit contributions
suit-0.124
chard-0.123
 Labrador-0.117
mit-0.114
 opener-0.113
Token Jim
Token ID5395
Feature activation+8.08e-03

Pos logit contributions
chard+0.037
suit+0.036
 Labrador+0.035
mit+0.034
 opener+0.034
Neg logit contributions
und-0.030
 maj-0.029
 anx-0.029
 accident-0.029
 tenth-0.029
Token said
Token ID531
Feature activation-4.26e-03

Pos logit contributions
suit+0.010
chard+0.010
 Labrador+0.009
mit+0.009
 opener+0.009
Neg logit contributions
 tenth-0.009
 anx-0.009
 maj-0.009
und-0.009
 accident-0.009
Token anx
Token ID7296
Feature activation-3.31e-01

Pos logit contributions
 tenth+0.006
und+0.005
 maj+0.005
 accident+0.005
xc+0.005
Neg logit contributions
suit-0.005
 as-0.005
chard-0.005
mit-0.004
 Labrador-0.004
Tokeniously
Token ID6819
Feature activation-3.46e-02

Pos logit contributions
 tenth+0.413
und+0.408
 maj+0.401
 mo+0.398
 anx+0.384
Neg logit contributions
suit-0.385
mit-0.381
 Labrador-0.375
chard-0.366
 as-0.362
Token.
Token ID13
Feature activation-8.52e-03

Pos logit contributions
und+0.043
 tenth+0.043
 maj+0.043
xc+0.043
 anx+0.042
Neg logit contributions
suit-0.040
chard-0.037
mit-0.037
 as-0.037
 Labrador-0.036
Token
Token ID198
Feature activation+1.16e-02

Pos logit contributions
 anx+0.009
 tenth+0.009
und+0.009
 maj+0.009
 accident+0.009
Neg logit contributions
suit-0.011
chard-0.011
 Labrador-0.010
mit-0.010
 opener-0.010
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
suit+0.013
chard+0.013
 Labrador+0.012
 opener+0.012
mit+0.012
Neg logit contributions
 anx-0.015
 tenth-0.015
 maj-0.015
 accident-0.014
und-0.014
TokenHis
Token ID6653
Feature activation+8.28e-03

Pos logit contributions
suit+0.016
chard+0.015
 Labrador+0.015
 opener+0.014
mit+0.014
Neg logit contributions
 maj-0.013
 anx-0.013
 tenth-0.013
 accident-0.013
und-0.012
Token
Token ID198
Feature activation+1.65e-02

Pos logit contributions
suit+0.016
chard+0.016
 opener+0.016
 Labrador+0.016
mit+0.015
Neg logit contributions
 accident-0.021
 tenth-0.021
 maj-0.021
 anx-0.021
 bru-0.021
TokenM
Token ID44
Feature activation-1.28e-01

Pos logit contributions
suit+0.021
 Labrador+0.020
chard+0.020
 opener+0.020
mit+0.019
Neg logit contributions
 accident-0.019
 maj-0.018
 anx-0.018
 bru-0.018
 tenth-0.018
Tokenummy
Token ID13513
Feature activation-3.75e-02

Pos logit contributions
 maj+0.146
 bru+0.143
 tenth+0.143
und+0.143
 accident+0.141
Neg logit contributions
suit-0.152
 Labrador-0.152
 opener-0.148
mit-0.148
 as-0.141
Token watched
Token ID7342
Feature activation-1.36e-01

Pos logit contributions
 tenth+0.048
 maj+0.046
und+0.046
 bru+0.045
xc+0.045
Neg logit contributions
suit-0.042
chard-0.041
 Labrador-0.040
mit-0.040
 as-0.039
Token him
Token ID683
Feature activation+1.27e-02

Pos logit contributions
 tenth+0.209
 bru+0.206
Acc+0.203
und+0.203
xc+0.201
Neg logit contributions
 as-0.134
suit-0.111
 opener-0.110
chard-0.106
mit-0.103
Token anx
Token ID7296
Feature activation-3.30e-01

Pos logit contributions
suit+0.016
chard+0.016
 Labrador+0.015
mit+0.014
 opener+0.014
Neg logit contributions
 anx-0.015
 tenth-0.015
und-0.015
 accident-0.015
 maj-0.015
Tokeniously
Token ID6819
Feature activation-3.55e-02

Pos logit contributions
 tenth+0.461
und+0.461
 maj+0.453
 mo+0.445
 aer+0.418
Neg logit contributions
 as-0.342
mit-0.340
 Labrador-0.323
chard-0.322
suit-0.307
Token as
Token ID355
Feature activation+2.92e-01

Pos logit contributions
 tenth+0.050
xc+0.050
und+0.050
 maj+0.049
 accident+0.049
Neg logit contributions
 as-0.035
chard-0.035
suit-0.034
mit-0.033
 stall-0.031
Token Ali
Token ID12104
Feature activation-7.76e-02

Pos logit contributions
 as+0.315
chard+0.314
suit+0.301
 opener+0.294
mit+0.288
Neg logit contributions
xc-0.377
und-0.375
 bru-0.372
 accident-0.370
 tenth-0.368
Token waited
Token ID13488
Feature activation-5.59e-02

Pos logit contributions
 tenth+0.097
 maj+0.096
und+0.093
xc+0.092
 anx+0.092
Neg logit contributions
suit-0.091
chard-0.087
mit-0.084
 Labrador-0.084
 as-0.083
Token for
Token ID329
Feature activation-8.75e-02

Pos logit contributions
xc+0.080
und+0.080
 tenth+0.078
 bru+0.076
Acc+0.074
Neg logit contributions
 as-0.063
suit-0.054
chard-0.053
 opener-0.051
mit-0.049
Token the
Token ID262
Feature activation-1.58e-03

Pos logit contributions
suit+0.011
chard+0.011
 Labrador+0.010
mit+0.010
 opener+0.010
Neg logit contributions
 anx-0.012
 tenth-0.012
 maj-0.011
und-0.011
 accident-0.011
Token answer
Token ID3280
Feature activation+2.55e-02

Pos logit contributions
und+0.002
 anx+0.002
 maj+0.002
xc+0.002
 tenth+0.002
Neg logit contributions
suit-0.002
chard-0.002
 Labrador-0.002
mit-0.002
 opener-0.002
Token.
Token ID13
Feature activation-5.94e-03

Pos logit contributions
chard+0.038
suit+0.037
 Labrador+0.035
 opener+0.034
mit+0.034
Neg logit contributions
 accident-0.024
 anx-0.024
 aer-0.023
und-0.023
 tenth-0.022
Token She
Token ID1375
Feature activation-3.54e-02

Pos logit contributions
 anx+0.006
 tenth+0.006
 maj+0.006
und+0.006
 accident+0.006
Neg logit contributions
suit-0.008
chard-0.008
 Labrador-0.008
mit-0.008
 opener-0.007
Token asked
Token ID1965
Feature activation+1.27e-03

Pos logit contributions
 tenth+0.043
 maj+0.042
und+0.042
 accident+0.041
 anx+0.040
Neg logit contributions
suit-0.043
chard-0.042
 Labrador-0.040
mit-0.040
 as-0.038
Token anx
Token ID7296
Feature activation-3.30e-01

Pos logit contributions
suit+0.002
chard+0.002
 Labrador+0.002
mit+0.001
 opener+0.001
Neg logit contributions
 tenth-0.001
 anx-0.001
und-0.001
 maj-0.001
 accident-0.001
Tokeniously
Token ID6819
Feature activation-3.74e-02

Pos logit contributions
und+0.436
 tenth+0.434
 maj+0.426
 mo+0.417
 bru+0.402
Neg logit contributions
mit-0.356
 as-0.350
 Labrador-0.347
chard-0.345
suit-0.334
Token,
Token ID11
Feature activation+3.47e-03

Pos logit contributions
und+0.047
 tenth+0.046
 maj+0.046
 accident+0.046
xc+0.046
Neg logit contributions
chard-0.043
suit-0.042
 as-0.041
mit-0.041
 Labrador-0.040
Token "
Token ID366
Feature activation+4.95e-02

Pos logit contributions
chard+0.003
suit+0.003
 as+0.003
 Labrador+0.003
mit+0.003
Neg logit contributions
 tenth-0.005
und-0.005
 accident-0.005
xc-0.005
 maj-0.005
TokenWhat
Token ID2061
Feature activation+8.68e-02

Pos logit contributions
suit+0.054
chard+0.052
 Labrador+0.049
mit+0.049
 opener+0.048
Neg logit contributions
 anx-0.066
 tenth-0.066
und-0.065
 accident-0.065
 maj-0.065
Token is
Token ID318
Feature activation-3.59e-03

Pos logit contributions
suit+0.120
chard+0.118
 Labrador+0.113
mit+0.110
 opener+0.109
Neg logit contributions
 anx-0.090
 tenth-0.090
 maj-0.088
 accident-0.088
und-0.088
Token smile
Token ID8212
Feature activation+5.54e-03

Pos logit contributions
und+0.047
 tenth+0.047
 anx+0.047
 maj+0.046
xc+0.046
Neg logit contributions
suit-0.039
chard-0.037
 Labrador-0.037
mit-0.035
 opener-0.035
Token.
Token ID13
Feature activation+1.90e-03

Pos logit contributions
suit+0.006
chard+0.006
 as+0.006
 Labrador+0.006
mit+0.006
Neg logit contributions
 tenth-0.007
 maj-0.007
und-0.007
 anx-0.007
xc-0.007
Token
Token ID198
Feature activation+2.11e-02

Pos logit contributions
suit+0.002
chard+0.002
 Labrador+0.002
mit+0.002
 opener+0.002
Neg logit contributions
 anx-0.002
und-0.002
 maj-0.002
 tenth-0.002
 accident-0.002
Token
Token ID198
Feature activation+2.18e-02

Pos logit contributions
suit+0.028
chard+0.027
 Labrador+0.026
mit+0.026
 opener+0.025
Neg logit contributions
 anx-0.023
und-0.023
 maj-0.023
 tenth-0.023
 accident-0.023
TokenMary
Token ID24119
Feature activation-5.36e-02

Pos logit contributions
suit+0.027
chard+0.027
 Labrador+0.025
mit+0.025
 opener+0.025
Neg logit contributions
 anx-0.026
 tenth-0.025
und-0.025
 maj-0.025
 accident-0.025
Token anx
Token ID7296
Feature activation-3.28e-01

Pos logit contributions
 tenth+0.065
und+0.063
 maj+0.063
 anx+0.062
 accident+0.061
Neg logit contributions
suit-0.065
chard-0.063
mit-0.061
 Labrador-0.060
 as-0.058
Tokeniously
Token ID6819
Feature activation-3.53e-02

Pos logit contributions
 tenth+0.444
und+0.432
 mo+0.432
 maj+0.429
 anx+0.403
Neg logit contributions
mit-0.344
 as-0.341
 Labrador-0.337
chard-0.330
suit-0.319
Token opened
Token ID4721
Feature activation+7.76e-02

Pos logit contributions
 tenth+0.049
 maj+0.047
und+0.046
xc+0.046
Acc+0.045
Neg logit contributions
chard-0.038
suit-0.038
 as-0.038
mit-0.036
 Labrador-0.034
Token up
Token ID510
Feature activation+4.88e-02

Pos logit contributions
suit+0.104
chard+0.101
 Labrador+0.097
mit+0.095
 opener+0.094
Neg logit contributions
 anx-0.085
 tenth-0.083
 maj-0.083
und-0.083
 accident-0.082
Token the
Token ID262
Feature activation+7.83e-03

Pos logit contributions
suit+0.064
chard+0.063
 Labrador+0.060
mit+0.059
 opener+0.058
Neg logit contributions
 anx-0.054
 tenth-0.054
und-0.053
 maj-0.053
 accident-0.053
Token pass
Token ID1208
Feature activation+9.87e-02

Pos logit contributions
chard+0.011
suit+0.010
 Labrador+0.010
 pads+0.010
pper+0.010
Neg logit contributions
 tenth-0.008
 aer-0.008
 mo-0.008
 maj-0.008
 accident-0.008
Token swept
Token ID17676
Feature activation-8.48e-02

Pos logit contributions
 tenth+0.022
 maj+0.022
und+0.022
 anx+0.022
xc+0.021
Neg logit contributions
suit-0.024
chard-0.024
mit-0.022
 opener-0.022
 Labrador-0.022
Token and
Token ID290
Feature activation-2.13e-03

Pos logit contributions
 tenth+0.097
 anx+0.097
 maj+0.097
 accident+0.096
und+0.096
Neg logit contributions
suit-0.106
chard-0.104
 Labrador-0.099
mit-0.097
 opener-0.096
Token scrub
Token ID27268
Feature activation-2.09e-01

Pos logit contributions
 maj+0.002
 tenth+0.002
und+0.002
xc+0.002
 anx+0.002
Neg logit contributions
suit-0.003
chard-0.003
 as-0.003
 opener-0.002
mit-0.002
Tokenbed
Token ID3077
Feature activation-8.06e-02

Pos logit contributions
 tenth+0.232
 maj+0.227
 anx+0.225
und+0.220
 accident+0.219
Neg logit contributions
suit-0.275
chard-0.262
 Labrador-0.252
 as-0.251
mit-0.248
Token and
Token ID290
Feature activation-2.68e-03

Pos logit contributions
 tenth+0.092
 maj+0.090
 anx+0.089
 accident+0.089
und+0.088
Neg logit contributions
suit-0.102
chard-0.098
 as-0.095
 Labrador-0.094
mit-0.093
Token mo
Token ID6941
Feature activation-3.19e-01

Pos logit contributions
 maj+0.003
xc+0.003
und+0.003
 tenth+0.003
 aer+0.003
Neg logit contributions
suit-0.004
chard-0.004
 as-0.004
 opener-0.004
mit-0.003
Tokenpped
Token ID1496
Feature activation+4.05e-02

Pos logit contributions
und+0.393
 anx+0.373
 accident+0.366
 tenth+0.361
 maj+0.360
Neg logit contributions
chard-0.407
suit-0.400
 Labrador-0.381
mit-0.372
 stall-0.369
Token and
Token ID290
Feature activation-2.45e-03

Pos logit contributions
suit+0.055
chard+0.054
mit+0.053
 Labrador+0.052
 opener+0.049
Neg logit contributions
 anx-0.045
 accident-0.044
und-0.043
 mo-0.043
 tenth-0.043
Token d
Token ID288
Feature activation-1.11e-01

Pos logit contributions
 maj+0.003
und+0.003
 tenth+0.003
xc+0.003
 anx+0.003
Neg logit contributions
chard-0.003
suit-0.003
 as-0.003
 opener-0.003
mit-0.003
Tokenusted
Token ID8459
Feature activation-1.03e-01

Pos logit contributions
 anx+0.102
und+0.101
 maj+0.096
 mo+0.095
 accident+0.095
Neg logit contributions
chard-0.173
suit-0.166
 Labrador-0.163
mit-0.159
 stall-0.158
Token until
Token ID1566
Feature activation-1.47e-01

Pos logit contributions
 tenth+0.114
 anx+0.114
 maj+0.113
und+0.112
 accident+0.111
Neg logit contributions
suit-0.133
chard-0.130
 Labrador-0.124
mit-0.122
 opener-0.120
Token daughter
Token ID4957
Feature activation-2.63e-02

Pos logit contributions
suit+0.015
chard+0.015
 Labrador+0.014
mit+0.014
 opener+0.014
Neg logit contributions
 anx-0.012
 tenth-0.012
 maj-0.012
 accident-0.012
und-0.012
Token managed
Token ID5257
Feature activation+2.22e-02

Pos logit contributions
 anx+0.028
 accident+0.027
 maj+0.027
 mo+0.027
und+0.027
Neg logit contributions
suit-0.036
chard-0.036
 Labrador-0.034
 opener-0.033
mit-0.032
Token to
Token ID284
Feature activation-2.90e-02

Pos logit contributions
suit+0.036
chard+0.034
 Labrador+0.034
mit+0.033
 opener+0.032
Neg logit contributions
 anx-0.019
 accident-0.019
 maj-0.019
 tenth-0.018
und-0.018
Token find
Token ID1064
Feature activation-2.65e-02

Pos logit contributions
 maj+0.040
und+0.039
xc+0.039
Acc+0.039
 anx+0.038
Neg logit contributions
chard-0.030
 as-0.028
 opener-0.028
mit-0.028
suit-0.027
Token the
Token ID262
Feature activation-2.56e-03

Pos logit contributions
 anx+0.028
 accident+0.027
 tenth+0.027
 maj+0.027
und+0.027
Neg logit contributions
suit-0.037
chard-0.036
 Labrador-0.035
mit-0.034
 opener-0.033
Token tenth
Token ID24395
Feature activation-3.18e-01

Pos logit contributions
und+0.003
 anx+0.003
 tenth+0.003
 maj+0.003
xc+0.003
Neg logit contributions
suit-0.003
chard-0.003
mit-0.003
 Labrador-0.003
 stall-0.003
Token flower
Token ID15061
Feature activation+1.60e-02

Pos logit contributions
 accident+0.349
 anx+0.346
 tenth+0.343
 maj+0.338
und+0.337
Neg logit contributions
chard-0.420
suit-0.415
 Labrador-0.398
 opener-0.386
mit-0.386
Token so
Token ID523
Feature activation+6.19e-03

Pos logit contributions
suit+0.022
chard+0.022
 Labrador+0.022
mit+0.021
 opener+0.020
Neg logit contributions
 anx-0.016
 accident-0.016
und-0.015
 tenth-0.015
 mo-0.015
Token quickly
Token ID2952
Feature activation-6.92e-02

Pos logit contributions
chard+0.007
suit+0.007
mit+0.007
 as+0.007
 Labrador+0.006
Neg logit contributions
 tenth-0.008
und-0.008
xc-0.008
 anx-0.008
 accident-0.008
Token.
Token ID13
Feature activation-5.50e-03

Pos logit contributions
 tenth+0.082
 anx+0.082
 maj+0.081
und+0.081
 accident+0.080
Neg logit contributions
suit-0.084
chard-0.082
 Labrador-0.078
mit-0.077
 opener-0.076
Token Annie
Token ID25860
Feature activation-1.06e-01

Pos logit contributions
 anx+0.006
und+0.005
 maj+0.005
 tenth+0.005
 accident+0.005
Neg logit contributions
suit-0.008
chard-0.008
 Labrador-0.007
mit-0.007
 opener-0.007
TokenWe
Token ID1135
Feature activation-5.69e-02

Pos logit contributions
suit+0.180
chard+0.178
 Labrador+0.173
mit+0.169
 opener+0.163
Neg logit contributions
 tenth-0.128
 anx-0.124
 maj-0.123
und-0.123
 accident-0.122
Token are
Token ID389
Feature activation+1.38e-02

Pos logit contributions
 anx+0.070
 tenth+0.070
 maj+0.070
und+0.069
 accident+0.069
Neg logit contributions
suit-0.067
chard-0.064
 Labrador-0.061
mit-0.061
 opener-0.060
Token going
Token ID1016
Feature activation+2.68e-03

Pos logit contributions
chard+0.017
suit+0.017
 Labrador+0.016
mit+0.015
 as+0.015
Neg logit contributions
und-0.016
 tenth-0.016
 anx-0.016
 maj-0.016
 accident-0.016
Token to
Token ID284
Feature activation-2.60e-02

Pos logit contributions
suit+0.003
chard+0.003
 as+0.003
mit+0.003
 opener+0.003
Neg logit contributions
und-0.003
 anx-0.003
Acc-0.003
 accident-0.003
 tenth-0.003
Token the
Token ID262
Feature activation-4.54e-03

Pos logit contributions
und+0.035
Acc+0.035
 anx+0.035
xc+0.035
 maj+0.035
Neg logit contributions
 as-0.026
chard-0.025
 opener-0.025
 Labrador-0.024
 Cookie-0.024
Token tenth
Token ID24395
Feature activation-3.16e-01

Pos logit contributions
und+0.006
xc+0.006
 bru+0.006
 anx+0.006
itive+0.005
Neg logit contributions
chard-0.005
 stall-0.005
suit-0.005
 Labrador-0.005
mit-0.004
Token floor
Token ID4314
Feature activation-2.76e-02

Pos logit contributions
 anx+0.381
 maj+0.381
und+0.381
Acc+0.378
xc+0.376
Neg logit contributions
suit-0.378
chard-0.351
 Labrador-0.339
 stall-0.338
mit-0.336
Token,
Token ID11
Feature activation+1.21e-02

Pos logit contributions
xc+0.035
Acc+0.035
 maj+0.034
und+0.034
 anx+0.034
Neg logit contributions
suit-0.031
chard-0.030
 as-0.030
 opener-0.028
 stall-0.027
Token where
Token ID810
Feature activation+4.23e-02

Pos logit contributions
suit+0.014
chard+0.014
 Labrador+0.013
mit+0.013
 opener+0.012
Neg logit contributions
 anx-0.015
 tenth-0.015
 accident-0.015
 maj-0.015
und-0.015
Token I
Token ID314
Feature activation+3.94e-02

Pos logit contributions
chard+0.042
 as+0.039
suit+0.037
mit+0.035
 Coco+0.035
Neg logit contributions
und-0.061
 anx-0.060
Acc-0.058
 bru-0.058
unct-0.057
Token have
Token ID423
Feature activation+1.03e-03

Pos logit contributions
chard+0.054
suit+0.052
 Labrador+0.051
mit+0.048
 stall+0.047
Neg logit contributions
 anx-0.043
 mo-0.043
 accident-0.043
und-0.042
 tenth-0.041
Token You
Token ID921
Feature activation-6.19e-02

Pos logit contributions
suit+0.017
chard+0.017
mit+0.016
 Labrador+0.016
 opener+0.016
Neg logit contributions
und-0.012
 anx-0.012
 maj-0.011
 tenth-0.011
 accident-0.011
Token could
Token ID714
Feature activation+1.31e-01

Pos logit contributions
 accident+0.068
 anx+0.067
 tenth+0.067
und+0.067
 maj+0.065
Neg logit contributions
suit-0.081
chard-0.080
 Labrador-0.078
mit-0.075
 stall-0.073
Token have
Token ID423
Feature activation-4.07e-03

Pos logit contributions
chard+0.158
suit+0.157
 Labrador+0.150
 opener+0.148
mit+0.147
Neg logit contributions
 anx-0.154
 maj-0.153
und-0.153
 tenth-0.152
xc-0.149
Token caused
Token ID4073
Feature activation-1.13e-01

Pos logit contributions
 anx+0.005
 tenth+0.005
 maj+0.005
und+0.005
xc+0.005
Neg logit contributions
chard-0.005
suit-0.005
 as-0.005
 Labrador-0.005
 opener-0.004
Token an
Token ID281
Feature activation-9.70e-02

Pos logit contributions
 anx+0.126
 tenth+0.125
 accident+0.124
und+0.124
 maj+0.123
Neg logit contributions
suit-0.147
chard-0.144
 Labrador-0.137
mit-0.135
 opener-0.133
Token accident
Token ID5778
Feature activation-3.16e-01

Pos logit contributions
und+0.119
 maj+0.114
 tenth+0.113
xc+0.111
 anx+0.110
Neg logit contributions
 Labrador-0.115
chard-0.113
suit-0.112
 as-0.112
mit-0.110
Token.
Token ID13
Feature activation-3.09e-03

Pos logit contributions
 maj+0.403
 tenth+0.398
Acc+0.391
 anx+0.391
 mo+0.389
Neg logit contributions
 as-0.364
suit-0.364
chard-0.333
 opener-0.330
 Labrador-0.323
Token Don
Token ID2094
Feature activation-2.56e-02

Pos logit contributions
 anx+0.003
 tenth+0.003
und+0.003
 maj+0.003
 accident+0.003
Neg logit contributions
suit-0.004
chard-0.004
 Labrador-0.004
mit-0.004
 opener-0.004
Token't
Token ID470
Feature activation+3.98e-02

Pos logit contributions
 accident+0.025
 aer+0.025
und+0.025
 tenth+0.025
xc+0.023
Neg logit contributions
suit-0.039
chard-0.038
mit-0.037
 Labrador-0.036
 stall-0.035
Token you
Token ID345
Feature activation-2.12e-02

Pos logit contributions
chard+0.043
 as+0.043
suit+0.041
 opener+0.040
mit+0.039
Neg logit contributions
 anx-0.054
 maj-0.054
und-0.052
xc-0.051
 tenth-0.051
Token ever
Token ID1683
Feature activation-1.21e-02

Pos logit contributions
 anx+0.024
 accident+0.024
 tenth+0.024
und+0.023
 maj+0.023
Neg logit contributions
suit-0.027
chard-0.027
 Labrador-0.026
mit-0.025
 opener-0.025
Token to
Token ID284
Feature activation-2.99e-02

Pos logit contributions
suit+0.098
chard+0.095
 as+0.092
mit+0.091
 Labrador+0.091
Neg logit contributions
 tenth-0.099
 anx-0.098
 accident-0.096
und-0.096
xc-0.095
Token see
Token ID766
Feature activation+6.30e-04

Pos logit contributions
xc+0.047
und+0.047
 tenth+0.045
 anx+0.045
 bru+0.045
Neg logit contributions
 as-0.027
mit-0.024
chard-0.023
 opener-0.023
 Labrador-0.021
Token the
Token ID262
Feature activation+6.57e-05

Pos logit contributions
suit+0.001
chard+0.001
 Labrador+0.001
mit+0.001
 opener+0.001
Neg logit contributions
 anx-0.001
 tenth-0.001
und-0.001
 maj-0.001
 accident-0.001
Token little
Token ID1310
Feature activation-8.21e-02

Pos logit contributions
chard+0.000
suit+0.000
 Labrador+0.000
 opener+0.000
mit+0.000
Neg logit contributions
 accident-0.000
 anx-0.000
 maj-0.000
 tenth-0.000
 mo-0.000
Token boat
Token ID8848
Feature activation-9.79e-02

Pos logit contributions
 anx+0.077
 tenth+0.077
 maj+0.077
 accident+0.076
und+0.075
Neg logit contributions
suit-0.119
chard-0.118
 Labrador-0.114
mit-0.111
 opener-0.111
Token mo
Token ID6941
Feature activation-3.16e-01

Pos logit contributions
 anx+0.112
 tenth+0.111
 maj+0.110
und+0.110
 accident+0.110
Neg logit contributions
suit-0.123
chard-0.121
 Labrador-0.116
mit-0.113
 opener-0.111
Tokenored
Token ID1850
Feature activation-4.21e-03

Pos logit contributions
 anx+0.386
 tenth+0.384
 maj+0.380
und+0.379
 accident+0.379
Neg logit contributions
suit-0.376
chard-0.370
 Labrador-0.351
mit-0.344
 opener-0.339
Token there
Token ID612
Feature activation-1.01e-02

Pos logit contributions
 anx+0.004
 accident+0.004
und+0.004
 maj+0.004
 tenth+0.004
Neg logit contributions
suit-0.006
chard-0.006
 Labrador-0.006
 opener-0.005
mit-0.005
Token.
Token ID13
Feature activation-4.02e-03

Pos logit contributions
 anx+0.011
 tenth+0.011
 maj+0.011
und+0.011
 accident+0.011
Neg logit contributions
suit-0.013
chard-0.013
 Labrador-0.012
mit-0.012
 opener-0.012
Token The
Token ID383
Feature activation-8.24e-02

Pos logit contributions
und+0.004
 anx+0.004
 maj+0.004
 tenth+0.004
 accident+0.004
Neg logit contributions
suit-0.006
chard-0.006
 Labrador-0.006
mit-0.005
 opener-0.005
Token boat
Token ID8848
Feature activation-9.79e-02

Pos logit contributions
 anx+0.081
 accident+0.080
 tenth+0.080
 maj+0.079
 mo+0.078
Neg logit contributions
suit-0.118
chard-0.118
 Labrador-0.113
 opener-0.109
mit-0.109
Token They
Token ID1119
Feature activation-1.01e-02

Pos logit contributions
und+0.026
 anx+0.026
 tenth+0.025
 accident+0.025
 maj+0.025
Neg logit contributions
suit-0.037
chard-0.036
mit-0.035
 Labrador-0.034
 opener-0.034
Token got
Token ID1392
Feature activation+1.14e-01

Pos logit contributions
 maj+0.011
 tenth+0.011
und+0.011
 anx+0.011
 accident+0.011
Neg logit contributions
suit-0.013
chard-0.013
mit-0.012
 Labrador-0.012
 as-0.011
Token into
Token ID656
Feature activation+8.99e-02

Pos logit contributions
suit+0.137
chard+0.134
 Labrador+0.128
mit+0.125
 opener+0.123
Neg logit contributions
 anx-0.138
 tenth-0.137
 maj-0.135
 accident-0.135
und-0.135
Token a
Token ID257
Feature activation+3.72e-03

Pos logit contributions
suit+0.093
 as+0.093
chard+0.092
 stall+0.091
 opener+0.090
Neg logit contributions
xc-0.116
 maj-0.114
und-0.113
unct-0.113
Acc-0.112
Token car
Token ID1097
Feature activation-4.73e-02

Pos logit contributions
suit+0.005
chard+0.005
 Labrador+0.004
 stall+0.004
 opener+0.004
Neg logit contributions
und-0.004
 maj-0.004
 anx-0.004
xc-0.004
 tenth-0.004
Token accident
Token ID5778
Feature activation-3.16e-01

Pos logit contributions
 maj+0.060
 tenth+0.060
 anx+0.060
xc+0.058
und+0.058
Neg logit contributions
suit-0.054
chard-0.051
 Labrador-0.049
 as-0.049
 opener-0.048
Token and
Token ID290
Feature activation+1.68e-03

Pos logit contributions
 tenth+0.447
 maj+0.433
 mo+0.431
xc+0.426
 bru+0.424
Neg logit contributions
suit-0.332
 as-0.325
chard-0.290
 Labrador-0.287
 opener-0.277
Token Bob
Token ID5811
Feature activation+5.74e-02

Pos logit contributions
suit+0.002
chard+0.002
 Labrador+0.002
mit+0.002
 opener+0.002
Neg logit contributions
 anx-0.002
 tenth-0.002
 accident-0.002
und-0.002
 maj-0.002
Token got
Token ID1392
Feature activation+1.18e-01

Pos logit contributions
suit+0.069
chard+0.065
 Labrador+0.063
mit+0.062
 as+0.060
Neg logit contributions
 tenth-0.071
 maj-0.070
 anx-0.069
und-0.068
 accident-0.067
Token hurt
Token ID5938
Feature activation-1.56e-02

Pos logit contributions
chard+0.126
suit+0.126
 as+0.116
 Labrador+0.115
mit+0.113
Neg logit contributions
und-0.155
 maj-0.155
 tenth-0.154
 anx-0.154
 accident-0.150
Token.
Token ID13
Feature activation-9.96e-03

Pos logit contributions
 maj+0.021
 tenth+0.021
 anx+0.021
xc+0.020
 mo+0.020
Neg logit contributions
suit-0.016
 as-0.016
chard-0.016
 Labrador-0.015
 opener-0.014
Token?"
Token ID1701
Feature activation-3.12e-02

Pos logit contributions
suit+0.067
chard+0.066
 Labrador+0.064
mit+0.062
 opener+0.061
Neg logit contributions
 anx-0.052
 tenth-0.051
 maj-0.051
und-0.051
 accident-0.051
Token Ben
Token ID3932
Feature activation+4.79e-02

Pos logit contributions
und+0.040
 tenth+0.040
 anx+0.040
 maj+0.040
 accident+0.039
Neg logit contributions
suit-0.035
chard-0.034
 Labrador-0.032
mit-0.032
 opener-0.031
Token asked
Token ID1965
Feature activation+7.81e-03

Pos logit contributions
suit+0.060
chard+0.059
 Labrador+0.056
mit+0.055
 opener+0.054
Neg logit contributions
 tenth-0.055
 anx-0.055
 maj-0.055
und-0.054
 accident-0.054
Token for
Token ID329
Feature activation-7.67e-02

Pos logit contributions
suit+0.010
chard+0.010
 Labrador+0.009
mit+0.009
 opener+0.009
Neg logit contributions
 tenth-0.009
 anx-0.009
und-0.009
 maj-0.009
 accident-0.009
Token the
Token ID262
Feature activation+3.10e-03

Pos logit contributions
und+0.093
 maj+0.090
 anx+0.088
xc+0.088
 tenth+0.087
Neg logit contributions
chard-0.093
suit-0.091
 as-0.087
 opener-0.086
 Labrador-0.086
Token tenth
Token ID24395
Feature activation-3.16e-01

Pos logit contributions
suit+0.003
 stall+0.003
 uniforms+0.003
 opener+0.003
 as+0.003
Neg logit contributions
und-0.005
xc-0.005
Acc-0.005
unct-0.004
 maj-0.004
Token time
Token ID640
Feature activation+4.06e-02

Pos logit contributions
 anx+0.380
und+0.378
 tenth+0.377
 maj+0.377
 accident+0.368
Neg logit contributions
suit-0.380
chard-0.367
 Labrador-0.353
mit-0.345
 opener-0.341
Token.
Token ID13
Feature activation+4.88e-03

Pos logit contributions
suit+0.047
chard+0.046
 as+0.043
 Labrador+0.043
 opener+0.042
Neg logit contributions
und-0.050
 maj-0.050
 tenth-0.050
xc-0.050
 anx-0.049
Token
Token ID198
Feature activation+2.59e-02

Pos logit contributions
suit+0.007
chard+0.007
mit+0.006
 Labrador+0.006
 opener+0.006
Neg logit contributions
 anx-0.005
 maj-0.005
 tenth-0.005
 accident-0.005
und-0.005
Token
Token ID198
Feature activation+2.57e-02

Pos logit contributions
suit+0.032
chard+0.032
 Labrador+0.030
mit+0.029
 opener+0.029
Neg logit contributions
 anx-0.031
 tenth-0.030
 maj-0.030
und-0.030
 accident-0.030
Token"
Token ID1
Feature activation+1.27e-01

Pos logit contributions
chard+0.027
suit+0.027
 Labrador+0.026
mit+0.025
 opener+0.024
Neg logit contributions
 anx-0.035
 tenth-0.035
 maj-0.034
 accident-0.034
und-0.034
Token was
Token ID373
Feature activation+1.13e-02

Pos logit contributions
 anx+0.067
 accident+0.067
 mo+0.065
und+0.065
 maj+0.064
Neg logit contributions
suit-0.086
chard-0.085
 Labrador-0.082
mit-0.079
 opener-0.077
Token surrounded
Token ID11191
Feature activation+7.27e-02

Pos logit contributions
suit+0.016
 Labrador+0.015
chard+0.015
mit+0.015
 opener+0.014
Neg logit contributions
 anx-0.013
 accident-0.013
 mo-0.013
 maj-0.012
 tenth-0.012
Token with
Token ID351
Feature activation+1.35e-02

Pos logit contributions
suit+0.100
chard+0.097
 Labrador+0.095
mit+0.094
 opener+0.092
Neg logit contributions
 anx-0.076
und-0.076
 accident-0.075
 maj-0.073
xc-0.073
Token a
Token ID257
Feature activation+1.32e-02

Pos logit contributions
suit+0.019
chard+0.018
 Labrador+0.018
mit+0.017
 opener+0.017
Neg logit contributions
 anx-0.014
 tenth-0.014
 accident-0.014
 maj-0.014
und-0.014
Token wet
Token ID9583
Feature activation-1.59e-02

Pos logit contributions
suit+0.017
chard+0.017
 Labrador+0.016
mit+0.016
 stall+0.015
Neg logit contributions
und-0.015
 anx-0.015
 tenth-0.015
 accident-0.014
 maj-0.014
Token mo
Token ID6941
Feature activation-3.16e-01

Pos logit contributions
 tenth+0.021
xc+0.021
 anx+0.020
und+0.020
Acc+0.020
Neg logit contributions
suit-0.019
chard-0.017
 Labrador-0.017
 as-0.017
mit-0.016
Tokenat
Token ID265
Feature activation+5.32e-02

Pos logit contributions
 maj+0.436
 tenth+0.435
 anx+0.433
 bru+0.431
und+0.425
Neg logit contributions
suit-0.322
chard-0.308
mit-0.293
 opener-0.292
 Labrador-0.288
Token.
Token ID13
Feature activation-4.08e-03

Pos logit contributions
suit+0.065
chard+0.062
mit+0.059
 Labrador+0.059
 opener+0.058
Neg logit contributions
 tenth-0.064
 maj-0.064
 anx-0.063
und-0.063
 accident-0.062
Token
Token ID220
Feature activation-2.74e-02

Pos logit contributions
 anx+0.004
und+0.004
 tenth+0.004
 maj+0.004
 mo+0.004
Neg logit contributions
suit-0.006
chard-0.006
 Labrador-0.006
mit-0.006
 opener-0.005
Token
Token ID198
Feature activation+1.68e-02

Pos logit contributions
und+0.031
 anx+0.031
 maj+0.031
 tenth+0.030
 accident+0.030
Neg logit contributions
suit-0.035
chard-0.035
 Labrador-0.033
mit-0.032
 opener-0.031
Token
Token ID198
Feature activation+1.65e-02

Pos logit contributions
suit+0.021
chard+0.020
 Labrador+0.019
mit+0.019
 opener+0.018
Neg logit contributions
 anx-0.020
 tenth-0.020
und-0.020
 maj-0.020
 accident-0.020
Token pushed
Token ID7121
Feature activation-3.95e-03

Pos logit contributions
 tenth+0.002
und+0.002
 maj+0.002
xc+0.002
 anx+0.002
Neg logit contributions
suit-0.002
chard-0.002
mit-0.002
 as-0.002
 Labrador-0.002
Token the
Token ID262
Feature activation+9.39e-04

Pos logit contributions
 anx+0.005
 tenth+0.005
 accident+0.004
und+0.004
 maj+0.004
Neg logit contributions
suit-0.005
chard-0.005
 Labrador-0.005
mit-0.005
 opener-0.005
Token button
Token ID4936
Feature activation-7.70e-02

Pos logit contributions
suit+0.001
chard+0.001
mit+0.001
 Labrador+0.001
 opener+0.001
Neg logit contributions
und-0.001
 anx-0.001
 maj-0.001
xc-0.001
Acc-0.001
Token for
Token ID329
Feature activation-8.07e-02

Pos logit contributions
 anx+0.092
 tenth+0.091
und+0.091
 maj+0.091
 accident+0.090
Neg logit contributions
suit-0.093
chard-0.091
 Labrador-0.086
mit-0.085
 opener-0.084
Token the
Token ID262
Feature activation+2.05e-03

Pos logit contributions
 anx+0.095
 tenth+0.094
 accident+0.093
 maj+0.093
und+0.092
Neg logit contributions
suit-0.101
chard-0.099
 Labrador-0.094
mit-0.093
 opener-0.090
Token tenth
Token ID24395
Feature activation-3.15e-01

Pos logit contributions
suit+0.002
chard+0.002
 Labrador+0.002
mit+0.002
 opener+0.002
Neg logit contributions
und-0.003
 anx-0.003
 maj-0.003
xc-0.003
Acc-0.002
Token floor
Token ID4314
Feature activation-2.61e-02

Pos logit contributions
 anx+0.375
 tenth+0.372
 accident+0.369
und+0.368
 maj+0.368
Neg logit contributions
suit-0.382
chard-0.378
 Labrador-0.359
mit-0.351
 opener-0.345
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
 accident+0.028
 anx+0.028
 tenth+0.028
 maj+0.027
und+0.027
Neg logit contributions
chard-0.035
suit-0.035
 Labrador-0.034
mit-0.032
 opener-0.031
Token The
Token ID383
Feature activation-8.07e-02

Pos logit contributions
suit+0.013
chard+0.013
 opener+0.012
 Coco+0.012
 Labrador+0.012
Neg logit contributions
xc-0.015
 anx-0.015
unct-0.015
 accident-0.015
 tenth-0.015
Token elevator
Token ID20932
Feature activation-1.75e-01

Pos logit contributions
und+0.112
xc+0.102
 anx+0.099
Acc+0.098
 maj+0.098
Neg logit contributions
suit-0.091
chard-0.087
 opener-0.084
 Labrador-0.083
 stall-0.081
Token started
Token ID2067
Feature activation+1.32e-01

Pos logit contributions
 anx+0.211
 tenth+0.210
und+0.209
 maj+0.208
 accident+0.205
Neg logit contributions
suit-0.209
chard-0.203
 Labrador-0.192
mit-0.189
 opener-0.188
Token had
Token ID550
Feature activation+5.37e-03

Pos logit contributions
 tenth+0.099
 maj+0.096
xc+0.093
und+0.091
 accident+0.091
Neg logit contributions
suit-0.083
chard-0.079
 Labrador-0.076
 as-0.076
mit-0.076
Token broken
Token ID5445
Feature activation-5.81e-02

Pos logit contributions
suit+0.009
chard+0.009
 Labrador+0.008
mit+0.008
 opener+0.008
Neg logit contributions
 anx-0.004
 tenth-0.004
 accident-0.004
und-0.004
 maj-0.004
Token the
Token ID262
Feature activation-1.06e-02

Pos logit contributions
 accident+0.054
 aer+0.054
 anx+0.054
und+0.052
 mo+0.050
Neg logit contributions
suit-0.092
chard-0.088
mit-0.087
 opener-0.085
 Labrador-0.084
Token notebook
Token ID20922
Feature activation+9.54e-02

Pos logit contributions
und+0.014
 anx+0.014
 maj+0.014
 tenth+0.014
xc+0.014
Neg logit contributions
suit-0.011
chard-0.011
 Labrador-0.010
mit-0.010
 opener-0.010
Token by
Token ID416
Feature activation-5.59e-03

Pos logit contributions
suit+0.123
chard+0.121
 Labrador+0.118
mit+0.115
 opener+0.111
Neg logit contributions
 anx-0.107
 accident-0.106
 tenth-0.104
und-0.103
 maj-0.101
Token accident
Token ID5778
Feature activation-3.15e-01

Pos logit contributions
 anx+0.006
 tenth+0.006
 maj+0.006
und+0.006
xc+0.006
Neg logit contributions
suit-0.007
chard-0.007
 Labrador-0.007
mit-0.007
 opener-0.006
Token.
Token ID13
Feature activation-6.59e-03

Pos logit contributions
 anx+0.355
 accident+0.350
und+0.348
 tenth+0.345
 maj+0.342
Neg logit contributions
chard-0.410
suit-0.406
 Labrador-0.385
mit-0.378
 opener-0.371
Token She
Token ID1375
Feature activation-3.45e-02

Pos logit contributions
 anx+0.007
 tenth+0.007
und+0.007
 maj+0.006
 accident+0.006
Neg logit contributions
suit-0.009
chard-0.009
 Labrador-0.009
mit-0.009
 opener-0.008
Token was
Token ID373
Feature activation+9.69e-03

Pos logit contributions
 tenth+0.043
 maj+0.043
 anx+0.041
und+0.041
xc+0.041
Neg logit contributions
suit-0.040
chard-0.039
 Labrador-0.037
mit-0.037
 as-0.036
Token very
Token ID845
Feature activation+1.05e-01

Pos logit contributions
suit+0.013
chard+0.012
 Labrador+0.012
 opener+0.011
mit+0.011
Neg logit contributions
 accident-0.011
 anx-0.011
 mo-0.011
 tenth-0.011
und-0.011
Token sad
Token ID6507
Feature activation-1.96e-02

Pos logit contributions
suit+0.134
chard+0.131
 Labrador+0.126
mit+0.123
 opener+0.121
Neg logit contributions
 anx-0.121
 tenth-0.119
 maj-0.118
 accident-0.118
und-0.118
TokenThe
Token ID464
Feature activation-6.87e-02

Pos logit contributions
suit+0.027
chard+0.026
 Labrador+0.025
mit+0.025
 opener+0.024
Neg logit contributions
und-0.015
 tenth-0.015
 anx-0.015
 maj-0.015
 accident-0.015
Token little
Token ID1310
Feature activation-8.07e-02

Pos logit contributions
und+0.069
 tenth+0.066
 anx+0.066
 maj+0.065
xc+0.064
Neg logit contributions
suit-0.100
chard-0.098
 Labrador-0.094
mit-0.094
 opener-0.091
Token boy
Token ID2933
Feature activation-6.88e-02

Pos logit contributions
und+0.087
 anx+0.086
 tenth+0.085
 maj+0.084
 accident+0.081
Neg logit contributions
suit-0.111
chard-0.108
 Labrador-0.105
mit-0.102
 opener-0.099
Token dug
Token ID18735
Feature activation-1.17e-01

Pos logit contributions
 tenth+0.098
 maj+0.093
und+0.092
xc+0.092
 bru+0.091
Neg logit contributions
suit-0.070
mit-0.068
 as-0.067
chard-0.063
 Labrador-0.062
Token a
Token ID257
Feature activation+1.37e-02

Pos logit contributions
 tenth+0.157
 bru+0.151
und+0.151
xc+0.150
unct+0.150
Neg logit contributions
suit-0.129
 as-0.128
chard-0.116
 opener-0.114
mit-0.112
Token mo
Token ID6941
Feature activation-3.15e-01

Pos logit contributions
suit+0.018
chard+0.017
mit+0.016
 stall+0.016
 Labrador+0.015
Neg logit contributions
und-0.016
xc-0.016
 accident-0.015
 tenth-0.015
 anx-0.015
Tokenat
Token ID265
Feature activation+5.27e-02

Pos logit contributions
 maj+0.472
 bru+0.471
 tenth+0.464
 anx+0.456
 aer+0.448
Neg logit contributions
suit-0.288
 opener-0.264
chard-0.264
mit-0.263
pper-0.258
Token around
Token ID1088
Feature activation+7.51e-02

Pos logit contributions
suit+0.059
 as+0.058
 opener+0.055
chard+0.054
mit+0.053
Neg logit contributions
 bru-0.069
 tenth-0.068
Acc-0.068
 maj-0.068
xc-0.067
Token the
Token ID262
Feature activation+2.96e-03

Pos logit contributions
 as+0.078
chard+0.070
suit+0.067
mit+0.064
 opener+0.064
Neg logit contributions
xc-0.109
 tenth-0.107
unct-0.107
und-0.103
 bru-0.103
Token sand
Token ID6450
Feature activation-9.45e-02

Pos logit contributions
suit+0.004
chard+0.004
 Labrador+0.003
mit+0.003
 opener+0.003
Neg logit contributions
 anx-0.003
und-0.003
 tenth-0.003
 maj-0.003
 accident-0.003
Tokencastle
Token ID18676
Feature activation-6.32e-02

Pos logit contributions
 tenth+0.113
 anx+0.112
 maj+0.111
und+0.111
xc+0.110
Neg logit contributions
suit-0.115
chard-0.111
mit-0.105
 Labrador-0.104
 opener-0.103
Token.
Token ID13
Feature activation-1.96e-03

Pos logit contributions
 anx+0.074
 tenth+0.074
und+0.073
 maj+0.073
 accident+0.073
Neg logit contributions
suit-0.080
chard-0.079
 Labrador-0.075
mit-0.074
 opener-0.073
Token They
Token ID1119
Feature activation-1.73e-02

Pos logit contributions
und+0.002
 anx+0.002
 maj+0.002
 tenth+0.002
 accident+0.002
Neg logit contributions
suit-0.003
chard-0.003
mit-0.003
 Labrador-0.003
 opener-0.003
Token d
Token ID288
Feature activation-1.09e-01

Pos logit contributions
und+0.021
 maj+0.021
 bru+0.021
xc+0.021
 tenth+0.021
Neg logit contributions
chard-0.020
suit-0.020
 as-0.019
 opener-0.019
mit-0.018
Tokenusted
Token ID8459
Feature activation-9.91e-02

Pos logit contributions
 anx+0.109
und+0.109
 maj+0.105
 accident+0.104
 tenth+0.104
Neg logit contributions
chard-0.157
suit-0.155
 Labrador-0.148
mit-0.145
 stall-0.143
Token and
Token ID290
Feature activation-8.19e-04

Pos logit contributions
 tenth+0.120
 maj+0.117
 anx+0.117
und+0.117
 accident+0.116
Neg logit contributions
suit-0.118
chard-0.113
 as-0.110
mit-0.107
 Labrador-0.107
Token mo
Token ID6941
Feature activation-3.15e-01

Pos logit contributions
xc+0.001
 maj+0.001
und+0.001
 tenth+0.001
 aer+0.001
Neg logit contributions
suit-0.001
 as-0.001
chard-0.001
 opener-0.001
mit-0.001
Tokenpped
Token ID1496
Feature activation+4.33e-02

Pos logit contributions
und+0.398
 anx+0.387
 accident+0.379
 tenth+0.377
 maj+0.375
Neg logit contributions
chard-0.380
suit-0.379
 Labrador-0.356
mit-0.347
 stall-0.342
Token,
Token ID11
Feature activation+6.38e-03

Pos logit contributions
suit+0.056
chard+0.055
mit+0.053
 Labrador+0.053
 opener+0.050
Neg logit contributions
 anx-0.050
 accident-0.049
und-0.048
 tenth-0.048
 maj-0.048
Token they
Token ID484
Feature activation+9.74e-03

Pos logit contributions
suit+0.007
 as+0.007
chard+0.007
 opener+0.007
mit+0.006
Neg logit contributions
xc-0.008
und-0.008
 tenth-0.008
 maj-0.008
unct-0.008
Token put
Token ID1234
Feature activation-1.27e-02

Pos logit contributions
suit+0.011
chard+0.011
 as+0.010
 opener+0.010
mit+0.010
Neg logit contributions
und-0.012
 tenth-0.012
 maj-0.012
xc-0.012
 accident-0.012
Token away
Token ID1497
Feature activation+3.19e-02

Pos logit contributions
 anx+0.013
 accident+0.013
und+0.013
 maj+0.013
 tenth+0.013
Neg logit contributions
chard-0.018
suit-0.018
 Labrador-0.017
mit-0.017
 opener-0.017
Token surrounded
Token ID11191
Feature activation+7.45e-02

Pos logit contributions
 anx+0.003
 mo+0.002
 tenth+0.002
 maj+0.002
 accident+0.002
Neg logit contributions
suit-0.003
 Labrador-0.003
mit-0.003
chard-0.003
 opener-0.003
Token by
Token ID416
Feature activation-3.11e-03

Pos logit contributions
suit+0.111
 Labrador+0.105
chard+0.104
mit+0.103
 opener+0.103
Neg logit contributions
und-0.071
 maj-0.070
 anx-0.069
 accident-0.069
xc-0.068
Token a
Token ID257
Feature activation+1.13e-02

Pos logit contributions
xc+0.004
 tenth+0.004
 anx+0.004
 bru+0.004
Acc+0.004
Neg logit contributions
suit-0.003
chard-0.003
 as-0.003
 stall-0.003
mit-0.003
Token shimmer
Token ID34493
Feature activation+3.19e-02

Pos logit contributions
suit+0.014
chard+0.014
 stall+0.013
 Labrador+0.013
mit+0.013
Neg logit contributions
xc-0.013
und-0.013
 anx-0.013
 accident-0.013
 maj-0.013
Tokening
Token ID278
Feature activation-9.96e-02

Pos logit contributions
chard+0.046
suit+0.045
 Labrador+0.044
mit+0.043
 opener+0.042
Neg logit contributions
 anx-0.031
und-0.030
 accident-0.030
 tenth-0.030
 maj-0.029
Token mo
Token ID6941
Feature activation-3.14e-01

Pos logit contributions
xc+0.132
Acc+0.132
 anx+0.127
 tenth+0.126
unct+0.125
Neg logit contributions
suit-0.116
chard-0.111
 as-0.107
 stall-0.106
 Labrador-0.103
Tokenat
Token ID265
Feature activation+5.44e-02

Pos logit contributions
 anx+0.422
 tenth+0.419
 maj+0.416
und+0.414
 accident+0.414
Neg logit contributions
suit-0.336
chard-0.330
 Labrador-0.312
mit-0.304
 opener-0.300
Token.
Token ID13
Feature activation-5.92e-03

Pos logit contributions
suit+0.064
chard+0.062
 Labrador+0.058
 opener+0.058
mit+0.058
Neg logit contributions
 maj-0.067
 tenth-0.066
 anx-0.066
 accident-0.066
und-0.065
Token
Token ID220
Feature activation-2.84e-02

Pos logit contributions
 tenth+0.006
 anx+0.006
 accident+0.006
und+0.006
 maj+0.006
Neg logit contributions
suit-0.008
chard-0.008
 Labrador-0.008
mit-0.007
 opener-0.007
Token
Token ID198
Feature activation+1.44e-02

Pos logit contributions
und+0.029
 maj+0.028
 anx+0.028
 tenth+0.027
 accident+0.027
Neg logit contributions
suit-0.041
chard-0.040
 Labrador-0.038
mit-0.038
 stall-0.036
Token
Token ID198
Feature activation+1.45e-02

Pos logit contributions
suit+0.015
chard+0.014
 opener+0.014
 Labrador+0.014
 stall+0.013
Neg logit contributions
 accident-0.020
 anx-0.020
 tenth-0.020
xc-0.019
 maj-0.019
Token and
Token ID290
Feature activation-1.87e-03

Pos logit contributions
suit+0.163
chard+0.161
 Labrador+0.154
mit+0.148
 opener+0.148
Neg logit contributions
 anx-0.123
 tenth-0.120
 maj-0.119
und-0.119
 accident-0.118
Token she
Token ID673
Feature activation+2.25e-02

Pos logit contributions
 tenth+0.002
 anx+0.002
 maj+0.002
und+0.002
 accident+0.002
Neg logit contributions
suit-0.002
chard-0.002
 Labrador-0.002
mit-0.002
 opener-0.002
Token soon
Token ID2582
Feature activation+3.64e-03

Pos logit contributions
suit+0.033
chard+0.033
 Labrador+0.032
 opener+0.031
 stall+0.030
Neg logit contributions
 anx-0.022
 mo-0.021
 accident-0.021
 maj-0.021
und-0.020
Token became
Token ID2627
Feature activation+9.64e-02

Pos logit contributions
suit+0.005
chard+0.005
 Labrador+0.005
 opener+0.005
mit+0.005
Neg logit contributions
 accident-0.004
 anx-0.004
und-0.004
 maj-0.004
 mo-0.003
Token known
Token ID1900
Feature activation-2.46e-02

Pos logit contributions
suit+0.139
 opener+0.133
 Labrador+0.131
chard+0.129
mit+0.124
Neg logit contributions
 mo-0.097
 anx-0.097
 maj-0.096
 aer-0.095
 accident-0.093
Token as
Token ID355
Feature activation+2.93e-01

Pos logit contributions
 anx+0.040
 tenth+0.039
 maj+0.039
 accident+0.039
und+0.038
Neg logit contributions
suit-0.020
chard-0.019
 Labrador-0.019
 opener-0.018
mit-0.017
Token the
Token ID262
Feature activation-9.39e-04

Pos logit contributions
suit+0.392
chard+0.375
 Labrador+0.370
 opener+0.355
mit+0.354
Neg logit contributions
 anx-0.320
 maj-0.318
 tenth-0.314
 accident-0.311
und-0.310
Token "
Token ID366
Feature activation+4.83e-02

Pos logit contributions
und+0.001
 anx+0.001
 maj+0.001
 tenth+0.001
xc+0.001
Neg logit contributions
suit-0.001
chard-0.001
 Labrador-0.001
mit-0.001
 stall-0.001
Tokenpur
Token ID14225
Feature activation-8.51e-02

Pos logit contributions
suit+0.061
chard+0.060
 Labrador+0.058
mit+0.058
 stall+0.056
Neg logit contributions
und-0.056
 accident-0.052
 aer-0.052
 tenth-0.052
xc-0.052
Tokenple
Token ID1154
Feature activation+1.04e-01

Pos logit contributions
und+0.100
xc+0.094
 maj+0.093
 accident+0.092
 anx+0.092
Neg logit contributions
chard-0.118
suit-0.116
 stall-0.103
 opener-0.101
mit-0.099
Token l
Token ID300
Feature activation+9.77e-02

Pos logit contributions
chard+0.155
suit+0.152
 opener+0.140
 Labrador+0.140
mit+0.137
Neg logit contributions
und-0.099
 accident-0.096
 maj-0.095
 tenth-0.095
 aer-0.095
Token in
Token ID287
Feature activation-4.74e-02

Pos logit contributions
 Labrador+0.128
chard+0.124
suit+0.123
 Cookie+0.118
mit+0.115
Neg logit contributions
 accident-0.068
 anx-0.066
und-0.063
 maj-0.062
 mo-0.060
Token the
Token ID262
Feature activation-4.97e-03

Pos logit contributions
 accident+0.040
 tenth+0.038
 mo+0.037
 aer+0.037
und+0.037
Neg logit contributions
suit-0.080
 Labrador-0.076
mit-0.073
chard-0.072
 pads-0.070
Token garden
Token ID11376
Feature activation-1.16e-02

Pos logit contributions
 anx+0.006
und+0.006
 tenth+0.006
 maj+0.006
 accident+0.006
Neg logit contributions
suit-0.006
chard-0.006
 Labrador-0.006
mit-0.005
 opener-0.005
Token.
Token ID13
Feature activation-4.03e-03

Pos logit contributions
 anx+0.012
 accident+0.012
und+0.012
 tenth+0.012
 maj+0.012
Neg logit contributions
suit-0.015
chard-0.015
 Labrador-0.015
mit-0.014
 opener-0.014
Token They
Token ID1119
Feature activation-1.60e-02

Pos logit contributions
und+0.003
 anx+0.003
 Other+0.003
 maj+0.003
 Sadly+0.003
Neg logit contributions
chard-0.006
pper-0.006
suit-0.006
guard-0.006
 Labrador-0.006
Token chose
Token ID7690
Feature activation+2.34e-01

Pos logit contributions
 tenth+0.018
 maj+0.017
und+0.017
 anx+0.017
 accident+0.017
Neg logit contributions
suit-0.021
chard-0.021
mit-0.019
 opener-0.019
 Labrador-0.019
Token a
Token ID257
Feature activation+1.19e-02

Pos logit contributions
suit+0.343
chard+0.326
 Labrador+0.326
mit+0.314
 opener+0.309
Neg logit contributions
 accident-0.227
 maj-0.225
 anx-0.225
 mo-0.222
und-0.222
Token fragile
Token ID21049
Feature activation+6.96e-02

Pos logit contributions
suit+0.015
chard+0.015
 Labrador+0.015
mit+0.014
 opener+0.014
Neg logit contributions
 mo-0.013
 anx-0.013
 tenth-0.013
 maj-0.013
 accident-0.012
Token shell
Token ID7582
Feature activation+6.38e-03

Pos logit contributions
suit+0.082
chard+0.081
 Labrador+0.077
mit+0.075
 opener+0.074
Neg logit contributions
 anx-0.084
 tenth-0.084
 accident-0.083
 maj-0.083
und-0.083
Token and
Token ID290
Feature activation+4.58e-04

Pos logit contributions
chard+0.009
suit+0.009
 Labrador+0.009
mit+0.008
 Cookie+0.008
Neg logit contributions
 anx-0.006
 accident-0.006
 mo-0.006
und-0.006
 maj-0.006
Token gently
Token ID15165
Feature activation-1.60e-02

Pos logit contributions
suit+0.001
 Labrador+0.001
chard+0.001
mit+0.001
 stall+0.001
Neg logit contributions
 anx-0.000
 mo-0.000
 accident-0.000
 maj-0.000
und-0.000
Token but
Token ID475
Feature activation-7.76e-02

Pos logit contributions
 as+0.016
suit+0.015
chard+0.014
 Labrador+0.014
mit+0.014
Neg logit contributions
 tenth-0.020
 maj-0.020
xc-0.020
und-0.019
 anx-0.019
Token ran
Token ID4966
Feature activation-6.97e-02

Pos logit contributions
 anx+0.067
 accident+0.062
 mo+0.057
 bru+0.054
Acc+0.054
Neg logit contributions
suit-0.124
resses-0.119
 opener-0.118
 Labrador-0.117
umes-0.113
Token out
Token ID503
Feature activation+1.07e-02

Pos logit contributions
 anx+0.080
 accident+0.076
 tenth+0.073
 maj+0.073
 mo+0.072
Neg logit contributions
chard-0.094
suit-0.088
 Labrador-0.086
mit-0.086
 opener-0.082
Token as
Token ID355
Feature activation+2.92e-01

Pos logit contributions
suit+0.011
 as+0.011
chard+0.011
 opener+0.011
 Labrador+0.011
Neg logit contributions
und-0.014
 tenth-0.014
 maj-0.014
 accident-0.014
xc-0.014
Token fast
Token ID3049
Feature activation+6.77e-02

Pos logit contributions
chard+0.284
suit+0.273
mit+0.248
 as+0.245
 opener+0.244
Neg logit contributions
und-0.433
 tenth-0.423
xc-0.422
 maj-0.422
 accident-0.420
Token as
Token ID355
Feature activation+2.92e-01

Pos logit contributions
chard+0.028
suit+0.026
 Labrador+0.022
mit+0.022
 opener+0.018
Neg logit contributions
 anx-0.138
 accident-0.136
und-0.133
 mo-0.132
 maj-0.132
Token he
Token ID339
Feature activation-2.05e-02

Pos logit contributions
suit+0.472
 Labrador+0.457
chard+0.433
mit+0.425
 stall+0.422
Neg logit contributions
 anx-0.248
 accident-0.227
 tenth-0.226
 maj-0.223
 mo-0.222
Token could
Token ID714
Feature activation+1.16e-01

Pos logit contributions
 anx+0.029
 mo+0.028
und+0.028
 maj+0.028
 tenth+0.028
Neg logit contributions
chard-0.021
suit-0.020
 Labrador-0.020
mit-0.019
 stall-0.018
Token.
Token ID13
Feature activation-9.11e-03

Pos logit contributions
suit+0.146
chard+0.144
 Labrador+0.137
mit+0.134
 opener+0.132
Neg logit contributions
 anx-0.134
 tenth-0.133
 maj-0.132
und-0.132
 accident-0.132
Token
Token ID220
Feature activation-2.85e-02

Pos logit contributions
 Sadly+0.006
und+0.005
 anx+0.005
 Other+0.005
 bru+0.005
Neg logit contributions
chard-0.016
pper-0.015
suit-0.015
 CDs-0.015
mit-0.015
Token
Token ID198
Feature activation+1.34e-02

Pos logit contributions
und+0.024
 anx+0.021
 maj+0.020
 bru+0.019
 Other+0.018
Neg logit contributions
chard-0.048
suit-0.048
mit-0.044
 Labrador-0.044
 stall-0.043
Token everything
Token ID2279
Feature activation-3.55e-02

Pos logit contributions
 anx+0.047
 accident+0.046
 tenth+0.046
 maj+0.046
und+0.046
Neg logit contributions
suit-0.058
chard-0.057
 Labrador-0.055
mit-0.053
 opener-0.052
Token glow
Token ID19634
Feature activation+3.97e-02

Pos logit contributions
und+0.041
 anx+0.041
 tenth+0.041
 accident+0.041
 maj+0.041
Neg logit contributions
suit-0.044
chard-0.043
mit-0.040
 Labrador-0.040
 opener-0.040
Token.
Token ID13
Feature activation+4.26e-04

Pos logit contributions
suit+0.053
chard+0.053
 Labrador+0.052
mit+0.049
 opener+0.048
Neg logit contributions
 anx-0.043
 maj-0.042
 accident-0.042
und-0.042
 tenth-0.042
Token The
Token ID383
Feature activation-8.40e-02

Pos logit contributions
suit+0.001
chard+0.001
 Labrador+0.001
mit+0.001
 opener+0.001
Neg logit contributions
 anx-0.000
und-0.000
 tenth-0.000
 maj-0.000
 accident-0.000
Token all
Token ID477
Feature activation-1.06e-01

Pos logit contributions
 accident+0.078
 anx+0.076
 maj+0.074
 mo+0.074
 tenth+0.073
Neg logit contributions
suit-0.129
 opener-0.125
 pads-0.121
pper-0.121
mit-0.120
Tokenigator
Token ID23823
Feature activation+2.44e-01

Pos logit contributions
 anx+0.187
 tenth+0.186
und+0.185
 maj+0.185
 accident+0.185
Neg logit contributions
suit-0.069
chard-0.067
 Labrador-0.061
mit-0.058
 opener-0.056
Token and
Token ID290
Feature activation-2.79e-04

Pos logit contributions
suit+0.287
chard+0.281
mit+0.269
 Labrador+0.264
 as+0.260
Neg logit contributions
 tenth-0.303
 maj-0.294
und-0.291
xc-0.289
 anx-0.289
Token his
Token ID465
Feature activation-2.44e-02

Pos logit contributions
 anx+0.000
 tenth+0.000
 accident+0.000
und+0.000
 maj+0.000
Neg logit contributions
suit-0.000
chard-0.000
 Labrador-0.000
mit-0.000
 opener-0.000
Token friends
Token ID2460
Feature activation+2.13e-02

Pos logit contributions
 anx+0.021
 maj+0.021
 mo+0.021
 aer+0.020
 accident+0.020
Neg logit contributions
chard-0.040
suit-0.039
 opener-0.039
 Labrador-0.036
och-0.036
Token were
Token ID547
Feature activation+4.88e-03

Pos logit contributions
suit+0.023
mit+0.022
chard+0.022
 as+0.022
 Labrador+0.020
Neg logit contributions
 bru-0.028
 tenth-0.028
 maj-0.028
xc-0.028
Acc-0.027
Token amazed
Token ID24562
Feature activation+2.87e-02

Pos logit contributions
suit+0.007
chard+0.007
 Labrador+0.007
 opener+0.006
mit+0.006
Neg logit contributions
 anx-0.005
 mo-0.005
 accident-0.005
 maj-0.005
 tenth-0.005
Token standing
Token ID5055
Feature activation-3.71e-02

Pos logit contributions
suit+0.027
 Labrador+0.026
mit+0.025
 opener+0.025
manship+0.025
Neg logit contributions
 mo-0.017
 accident-0.016
 maj-0.016
 anx-0.016
 tenth-0.016
Token there
Token ID612
Feature activation-8.65e-03

Pos logit contributions
und+0.043
 tenth+0.042
 anx+0.042
 accident+0.042
xc+0.042
Neg logit contributions
chard-0.047
suit-0.047
 stall-0.042
mit-0.042
 Labrador-0.042
Token.
Token ID13
Feature activation+7.95e-03

Pos logit contributions
 anx+0.009
 accident+0.009
 tenth+0.009
 maj+0.009
und+0.008
Neg logit contributions
suit-0.012
chard-0.012
 Labrador-0.012
mit-0.011
 opener-0.011
Token He
Token ID679
Feature activation-1.64e-02

Pos logit contributions
mit+0.013
suit+0.013
chard+0.013
 Labrador+0.012
pper+0.012
Neg logit contributions
 tenth-0.006
 anx-0.006
 maj-0.006
 bru-0.006
 accident-0.006
Token was
Token ID373
Feature activation+1.73e-02

Pos logit contributions
 mo+0.015
 anx+0.015
 maj+0.014
 accident+0.014
 aer+0.014
Neg logit contributions
suit-0.025
chard-0.024
 Labrador-0.024
mit-0.023
 opener-0.022
Token wearing
Token ID5762
Feature activation+1.97e-01

Pos logit contributions
suit+0.027
 Labrador+0.025
 pads+0.024
manship+0.024
mit+0.024
Neg logit contributions
 mo-0.018
 anx-0.017
 tenth-0.017
 maj-0.017
 accident-0.016
Token a
Token ID257
Feature activation+1.81e-02

Pos logit contributions
suit+0.301
mit+0.297
chard+0.296
 Labrador+0.292
 opener+0.286
Neg logit contributions
 aer-0.174
 accident-0.173
 anx-0.172
 tenth-0.172
 maj-0.172
Token suit
Token ID6050
Feature activation-2.15e-02

Pos logit contributions
suit+0.022
chard+0.022
 Labrador+0.021
mit+0.021
 opener+0.021
Neg logit contributions
 anx-0.021
 tenth-0.021
 maj-0.021
 accident-0.021
 mo-0.020
Token and
Token ID290
Feature activation+9.60e-03

Pos logit contributions
 anx+0.025
 tenth+0.025
 maj+0.024
und+0.024
 accident+0.024
Neg logit contributions
suit-0.027
chard-0.026
 Labrador-0.025
mit-0.025
 opener-0.024
Token a
Token ID257
Feature activation+1.87e-02

Pos logit contributions
suit+0.012
chard+0.012
 Labrador+0.011
mit+0.011
 opener+0.011
Neg logit contributions
 anx-0.011
 tenth-0.011
und-0.011
 maj-0.011
 accident-0.011
Token hat
Token ID6877
Feature activation+3.24e-03

Pos logit contributions
suit+0.021
chard+0.020
 Labrador+0.019
mit+0.018
 stall+0.018
Neg logit contributions
und-0.025
 anx-0.024
xc-0.024
 tenth-0.024
 accident-0.024
Token together
Token ID1978
Feature activation-7.27e-02

Pos logit contributions
suit+0.027
chard+0.027
 Labrador+0.026
mit+0.025
 opener+0.025
Neg logit contributions
 anx-0.019
 accident-0.019
und-0.019
 tenth-0.019
 maj-0.019
Token.
Token ID13
Feature activation-6.49e-03

Pos logit contributions
 anx+0.072
 accident+0.071
und+0.070
 tenth+0.069
 mo+0.068
Neg logit contributions
suit-0.105
chard-0.104
 Labrador-0.101
mit-0.096
 opener-0.096
Token<|endoftext|>
Token ID50256
Feature activation-4.89e-02

Pos logit contributions
 anx+0.007
und+0.007
 maj+0.007
 tenth+0.007
 accident+0.007
Neg logit contributions
suit-0.009
chard-0.009
 Labrador-0.008
mit-0.008
 opener-0.008
TokenOnce
Token ID7454
Feature activation+6.86e-03

Pos logit contributions
und+0.052
 maj+0.048
 anx+0.048
 accident+0.047
 tenth+0.047
Neg logit contributions
chard-0.069
suit-0.067
mit-0.065
 Labrador-0.065
 as-0.060
Token upon
Token ID2402
Feature activation+1.69e-02

Pos logit contributions
suit+0.008
chard+0.008
 Labrador+0.007
mit+0.007
 opener+0.007
Neg logit contributions
 anx-0.009
 tenth-0.009
und-0.009
 accident-0.009
 maj-0.009
Token a
Token ID257
Feature activation+1.07e-02

Pos logit contributions
chard+0.018
suit+0.017
 as+0.017
 opener+0.016
 Labrador+0.016
Neg logit contributions
und-0.023
 anx-0.022
 maj-0.022
xc-0.022
 tenth-0.022
Token time
Token ID640
Feature activation+3.57e-02

Pos logit contributions
suit+0.010
 Labrador+0.009
chard+0.009
 as+0.009
mit+0.009
Neg logit contributions
 tenth-0.016
 aer-0.015
und-0.015
xc-0.015
 mo-0.015
Token,
Token ID11
Feature activation+4.00e-03

Pos logit contributions
suit+0.048
chard+0.047
 Labrador+0.045
mit+0.044
 opener+0.043
Neg logit contributions
 anx-0.039
 maj-0.038
und-0.038
 tenth-0.038
 accident-0.037
Token there
Token ID612
Feature activation-1.06e-02

Pos logit contributions
chard+0.004
suit+0.004
 as+0.004
mit+0.003
 opener+0.003
Neg logit contributions
 tenth-0.006
unct-0.006
 aer-0.006
 anx-0.006
 mo-0.006
Token was
Token ID373
Feature activation+7.48e-03

Pos logit contributions
und+0.010
 accident+0.010
 anx+0.010
 maj+0.010
 tenth+0.010
Neg logit contributions
suit-0.016
chard-0.016
 Labrador-0.015
mit-0.015
 opener-0.014
Token a
Token ID257
Feature activation+1.03e-02

Pos logit contributions
suit+0.009
chard+0.009
 Labrador+0.009
mit+0.009
 opener+0.009
Neg logit contributions
 anx-0.009
 tenth-0.008
 maj-0.008
und-0.008
 accident-0.008
Token there
Token ID612
Feature activation-5.79e-03

Pos logit contributions
suit+0.068
 Labrador+0.064
chard+0.064
 opener+0.061
mit+0.061
Neg logit contributions
 anx-0.053
 tenth-0.053
 accident-0.052
 maj-0.052
und-0.051
Token more
Token ID517
Feature activation+1.79e-03

Pos logit contributions
 aer+0.005
 tenth+0.005
 accident+0.004
 maj+0.004
 many+0.004
Neg logit contributions
suit-0.010
 Labrador-0.010
mit-0.009
 opener-0.009
manship-0.009
Token food
Token ID2057
Feature activation+1.41e-01

Pos logit contributions
suit+0.002
chard+0.002
 Labrador+0.002
mit+0.002
 opener+0.002
Neg logit contributions
 accident-0.002
 anx-0.002
 mo-0.002
 maj-0.002
 tenth-0.002
Token available
Token ID1695
Feature activation+1.47e-01

Pos logit contributions
 Labrador+0.269
suit+0.261
mit+0.257
Same+0.246
asket+0.244
Neg logit contributions
 accident-0.078
 tenth-0.065
und-0.065
 mo-0.063
 woke-0.061
Token?"
Token ID1701
Feature activation-2.76e-02

Pos logit contributions
 Labrador+0.249
suit+0.249
mit+0.236
chard+0.234
 opener+0.223
Neg logit contributions
 accident-0.105
 anx-0.101
 tenth-0.100
 maj-0.099
 aer-0.096
Token
Token ID198
Feature activation+3.35e-02

Pos logit contributions
und+0.039
xc+0.039
 anx+0.038
 tenth+0.038
 accident+0.038
Neg logit contributions
suit-0.027
chard-0.027
 opener-0.027
 Labrador-0.026
 as-0.025
Token
Token ID198
Feature activation+3.37e-02

Pos logit contributions
suit+0.037
chard+0.036
 Labrador+0.035
 opener+0.034
mit+0.033
Neg logit contributions
 anx-0.043
 tenth-0.043
 accident-0.042
und-0.042
 maj-0.042
TokenThey
Token ID2990
Feature activation-1.63e-03

Pos logit contributions
suit+0.031
chard+0.030
 opener+0.029
 Labrador+0.028
 stall+0.028
Neg logit contributions
 anx-0.051
 accident-0.050
 tenth-0.050
und-0.049
 maj-0.049
Token look
Token ID804
Feature activation-7.22e-02

Pos logit contributions
und+0.002
xc+0.002
 maj+0.002
 accident+0.002
 tenth+0.002
Neg logit contributions
chard-0.002
suit-0.002
mit-0.002
 opener-0.002
 Labrador-0.002
Token at
Token ID379
Feature activation-7.76e-02

Pos logit contributions
 anx+0.074
 maj+0.071
 accident+0.069
 mo+0.069
 tenth+0.068
Neg logit contributions
suit-0.101
chard-0.101
 Labrador-0.100
 opener-0.098
mit-0.096
Token the
Token ID262
Feature activation+7.41e-03

Pos logit contributions
 anx+0.079
 tenth+0.079
 accident+0.079
 maj+0.079
und+0.077
Neg logit contributions
suit-0.109
chard-0.105
 Labrador-0.102
mit-0.100
 opener-0.099
Token please
Token ID3387
Feature activation+5.98e-02

Pos logit contributions
suit+0.016
chard+0.016
 Labrador+0.015
mit+0.015
 opener+0.014
Neg logit contributions
 accident-0.013
 tenth-0.013
 anx-0.013
 maj-0.013
 mo-0.013
Token."
Token ID526
Feature activation-5.71e-02

Pos logit contributions
suit+0.108
chard+0.105
 Labrador+0.104
mit+0.099
 opener+0.098
Neg logit contributions
 maj-0.040
 accident-0.039
 mo-0.036
 tenth-0.036
 anx-0.036
Token
Token ID198
Feature activation+2.31e-02

Pos logit contributions
 anx+0.064
 tenth+0.063
 maj+0.063
und+0.062
 accident+0.062
Neg logit contributions
suit-0.075
chard-0.074
 Labrador-0.070
mit-0.069
 opener-0.068
Token
Token ID198
Feature activation+2.31e-02

Pos logit contributions
suit+0.025
chard+0.024
 Labrador+0.023
 opener+0.023
mit+0.022
Neg logit contributions
 anx-0.030
 tenth-0.030
 accident-0.030
 maj-0.030
und-0.029
TokenHer
Token ID9360
Feature activation-2.36e-04

Pos logit contributions
suit+0.032
chard+0.032
 Labrador+0.030
mit+0.030
 opener+0.029
Neg logit contributions
 anx-0.024
 tenth-0.024
 accident-0.024
 maj-0.024
und-0.024
Token mom
Token ID1995
Feature activation+1.48e-02

Pos logit contributions
 anx+0.000
 accident+0.000
 tenth+0.000
 mo+0.000
 maj+0.000
Neg logit contributions
suit-0.000
chard-0.000
 Labrador-0.000
mit-0.000
 opener-0.000
Token said
Token ID531
Feature activation+7.49e-03

Pos logit contributions
chard+0.014
suit+0.014
 Labrador+0.014
 as+0.014
mit+0.013
Neg logit contributions
 tenth-0.021
und-0.021
 maj-0.020
 bru-0.020
xc-0.019
Token,
Token ID11
Feature activation+1.18e-02

Pos logit contributions
 as+0.007
chard+0.006
suit+0.006
mit+0.005
 opener+0.005
Neg logit contributions
und-0.013
 tenth-0.012
xc-0.012
unct-0.012
 anx-0.011
Token "
Token ID366
Feature activation+5.31e-02

Pos logit contributions
chard+0.009
suit+0.009
 as+0.009
 Labrador+0.008
 opener+0.008
Neg logit contributions
und-0.019
 tenth-0.019
xc-0.019
 accident-0.019
 anx-0.018
TokenThat
Token ID2504
Feature activation-2.30e-02

Pos logit contributions
suit+0.065
chard+0.064
 Labrador+0.061
 opener+0.059
mit+0.057
Neg logit contributions
 anx-0.066
 maj-0.063
und-0.063
 tenth-0.063
 mo-0.063
Token fruit
Token ID8234
Feature activation+8.83e-02

Pos logit contributions
 anx+0.028
und+0.027
 tenth+0.027
 maj+0.027
xc+0.026
Neg logit contributions
chard-0.027
suit-0.027
 Labrador-0.026
mit-0.026
 stall-0.025
Token was
Token ID373
Feature activation+1.76e-02

Pos logit contributions
suit+0.044
chard+0.042
mit+0.040
 opener+0.039
 as+0.039
Neg logit contributions
 tenth-0.052
und-0.051
 anx-0.050
 maj-0.050
xc-0.050
Token called
Token ID1444
Feature activation+1.25e-01

Pos logit contributions
suit+0.026
 Labrador+0.024
mit+0.024
chard+0.024
 opener+0.023
Neg logit contributions
 accident-0.018
 anx-0.018
 mo-0.017
 maj-0.017
 tenth-0.017
Token "
Token ID366
Feature activation+5.73e-02

Pos logit contributions
suit+0.165
chard+0.164
 Labrador+0.157
mit+0.152
 opener+0.150
Neg logit contributions
 anx-0.135
 maj-0.135
 tenth-0.132
 accident-0.132
und-0.129
Tokencham
Token ID49869
Feature activation+6.44e-03

Pos logit contributions
suit+0.074
chard+0.073
 Labrador+0.071
mit+0.069
 opener+0.066
Neg logit contributions
 tenth-0.063
und-0.063
 anx-0.062
 accident-0.062
 maj-0.061
Tokenom
Token ID296
Feature activation+1.53e-02

Pos logit contributions
chard+0.008
suit+0.008
 Labrador+0.008
 opener+0.007
mit+0.007
Neg logit contributions
und-0.007
 maj-0.007
 anx-0.007
 tenth-0.007
 accident-0.007
Tokenile
Token ID576
Feature activation+6.95e-02

Pos logit contributions
suit+0.019
chard+0.019
 Labrador+0.018
mit+0.017
 opener+0.017
Neg logit contributions
 anx-0.018
 tenth-0.018
 maj-0.017
und-0.017
 accident-0.017
Token"
Token ID1
Feature activation+1.28e-01

Pos logit contributions
chard+0.095
suit+0.095
 Labrador+0.091
 opener+0.088
mit+0.086
Neg logit contributions
und-0.071
 maj-0.071
 anx-0.070
 tenth-0.069
 accident-0.069
Token and
Token ID290
Feature activation+7.91e-03

Pos logit contributions
suit+0.155
chard+0.152
 Labrador+0.146
mit+0.144
 opener+0.139
Neg logit contributions
 anx-0.150
 tenth-0.149
und-0.149
 accident-0.148
 maj-0.147
Token it
Token ID340
Feature activation+2.41e-03

Pos logit contributions
suit+0.010
chard+0.010
 Labrador+0.009
mit+0.009
 opener+0.009
Neg logit contributions
 anx-0.009
 maj-0.009
 tenth-0.009
 accident-0.009
und-0.009
Token smelled
Token ID37609
Feature activation+2.62e-02

Pos logit contributions
suit+0.003
chard+0.003
 Labrador+0.003
mit+0.003
 opener+0.003
Neg logit contributions
 anx-0.003
 accident-0.003
und-0.003
 maj-0.002
 mo-0.002
Token very
Token ID845
Feature activation+1.09e-01

Pos logit contributions
suit+0.033
chard+0.033
 Labrador+0.031
mit+0.030
 opener+0.030
Neg logit contributions
 anx-0.030
 tenth-0.030
und-0.029
 maj-0.029
 accident-0.029
Token.
Token ID13
Feature activation-1.31e-03

Pos logit contributions
suit+0.014
 as+0.013
chard+0.013
 Labrador+0.013
mit+0.012
Neg logit contributions
 tenth-0.016
xc-0.016
 maj-0.016
und-0.016
 anx-0.016
Token She
Token ID1375
Feature activation-2.84e-02

Pos logit contributions
 anx+0.001
und+0.001
 maj+0.001
 tenth+0.001
 accident+0.001
Neg logit contributions
suit-0.002
chard-0.002
 Labrador-0.002
mit-0.002
 opener-0.002
Token knew
Token ID2993
Feature activation-5.59e-02

Pos logit contributions
 tenth+0.036
und+0.034
 maj+0.034
xc+0.034
 anx+0.033
Neg logit contributions
suit-0.033
chard-0.032
mit-0.031
 as-0.030
 Labrador-0.030
Token she
Token ID673
Feature activation+2.69e-02

Pos logit contributions
 tenth+0.073
 maj+0.072
und+0.071
xc+0.070
 anx+0.070
Neg logit contributions
chard-0.063
 as-0.060
mit-0.057
 Labrador-0.057
suit-0.056
Token was
Token ID373
Feature activation+1.34e-02

Pos logit contributions
suit+0.038
chard+0.038
 Labrador+0.037
 stall+0.036
mit+0.035
Neg logit contributions
 accident-0.027
 anx-0.027
 maj-0.025
 mo-0.025
und-0.025
Token ready
Token ID3492
Feature activation+6.14e-02

Pos logit contributions
suit+0.022
 Labrador+0.020
 opener+0.019
 stall+0.018
chard+0.018
Neg logit contributions
 accident-0.013
 anx-0.013
 mo-0.013
 maj-0.013
 tenth-0.012
Token to
Token ID284
Feature activation-2.72e-02

Pos logit contributions
suit+0.072
chard+0.071
 as+0.070
 opener+0.066
mit+0.066
Neg logit contributions
und-0.074
 tenth-0.074
xc-0.073
 maj-0.072
 anx-0.072
Token face
Token ID1986
Feature activation-1.27e-01

Pos logit contributions
xc+0.036
 maj+0.036
und+0.036
 tenth+0.035
 anx+0.035
Neg logit contributions
chard-0.028
 opener-0.027
 as-0.027
mit-0.027
suit-0.026
Token anything
Token ID1997
Feature activation+2.10e-02

Pos logit contributions
 anx+0.134
 accident+0.129
 maj+0.127
 tenth+0.126
 mo+0.124
Neg logit contributions
suit-0.181
chard-0.171
 Labrador-0.170
 opener-0.165
mit-0.163
Token!
Token ID0
Feature activation-2.21e-03

Pos logit contributions
suit+0.027
 Labrador+0.027
chard+0.026
mit+0.025
 opener+0.024
Neg logit contributions
 anx-0.024
 accident-0.024
 maj-0.023
 tenth-0.023
und-0.023
Token Lucy
Token ID22162
Feature activation+3.16e-03

Pos logit contributions
 anx+0.002
und+0.002
 maj+0.002
 tenth+0.002
 accident+0.002
Neg logit contributions
suit-0.003
chard-0.003
mit-0.003
 Labrador-0.003
 opener-0.003
Token here
Token ID994
Feature activation+9.68e-02

Pos logit contributions
suit+0.073
 as+0.072
chard+0.070
mit+0.067
 Labrador+0.067
Neg logit contributions
xc-0.072
 tenth-0.072
 anx-0.071
 bru-0.071
und-0.070
Token.
Token ID13
Feature activation+3.56e-03

Pos logit contributions
suit+0.144
chard+0.142
 Labrador+0.140
mit+0.134
 opener+0.132
Neg logit contributions
 anx-0.089
 maj-0.088
und-0.087
 accident-0.087
 tenth-0.087
Token Do
Token ID2141
Feature activation-4.41e-02

Pos logit contributions
suit+0.005
chard+0.005
 Labrador+0.005
mit+0.005
 opener+0.005
Neg logit contributions
 anx-0.003
 tenth-0.003
 maj-0.003
und-0.003
 accident-0.003
Token you
Token ID345
Feature activation-1.13e-02

Pos logit contributions
 anx+0.046
 tenth+0.046
und+0.046
 maj+0.046
 accident+0.045
Neg logit contributions
suit-0.060
chard-0.059
 Labrador-0.056
mit-0.055
 opener-0.054
Token want
Token ID765
Feature activation+6.73e-02

Pos logit contributions
 anx+0.015
 accident+0.014
 tenth+0.014
 maj+0.014
und+0.014
Neg logit contributions
suit-0.013
chard-0.013
 Labrador-0.012
mit-0.012
 opener-0.011
Token to
Token ID284
Feature activation-2.04e-02

Pos logit contributions
suit+0.089
chard+0.088
 Labrador+0.084
mit+0.082
 opener+0.082
Neg logit contributions
 anx-0.072
und-0.072
 tenth-0.071
 maj-0.071
 accident-0.070
Token meet
Token ID1826
Feature activation+6.16e-02

Pos logit contributions
 tenth+0.024
 anx+0.024
 accident+0.023
 maj+0.023
und+0.023
Neg logit contributions
suit-0.026
chard-0.025
 Labrador-0.024
mit-0.023
 opener-0.023
Token your
Token ID534
Feature activation-6.42e-02

Pos logit contributions
suit+0.084
chard+0.080
 Labrador+0.078
 opener+0.076
mit+0.076
Neg logit contributions
 anx-0.067
 maj-0.066
 tenth-0.066
 accident-0.065
und-0.065
Token new
Token ID649
Feature activation+2.93e-02

Pos logit contributions
 aer+0.070
 anx+0.070
 maj+0.069
 mo+0.069
 accident+0.069
Neg logit contributions
chard-0.089
suit-0.086
 opener-0.083
 Labrador-0.082
pper-0.078
Token friends
Token ID2460
Feature activation+2.40e-02

Pos logit contributions
chard+0.037
suit+0.036
 Labrador+0.034
 opener+0.034
mit+0.032
Neg logit contributions
 anx-0.034
 maj-0.034
 accident-0.034
 tenth-0.034
 aer-0.033
Token?"
Token ID1701
Feature activation-3.30e-02

Pos logit contributions
suit+0.042
 Labrador+0.041
chard+0.040
mit+0.039
 opener+0.038
Neg logit contributions
 accident-0.016
 tenth-0.015
 anx-0.015
 maj-0.015
 aer-0.015
Token "
Token ID366
Feature activation+4.69e-02

Pos logit contributions
chard+0.004
suit+0.004
 as+0.003
 opener+0.003
 Labrador+0.003
Neg logit contributions
und-0.007
xc-0.007
 tenth-0.007
 accident-0.007
 bru-0.007
TokenI
Token ID40
Feature activation+1.71e-02

Pos logit contributions
suit+0.050
chard+0.048
 Labrador+0.048
 opener+0.046
 Cookie+0.046
Neg logit contributions
 mo-0.062
 anx-0.062
 tenth-0.061
 accident-0.061
 bru-0.060
Token love
Token ID1842
Feature activation+3.79e-02

Pos logit contributions
suit+0.022
chard+0.022
 Labrador+0.021
mit+0.020
 opener+0.020
Neg logit contributions
 anx-0.019
 accident-0.019
 tenth-0.019
 maj-0.019
und-0.019
Token you
Token ID345
Feature activation-1.52e-02

Pos logit contributions
chard+0.046
suit+0.044
 as+0.043
 Labrador+0.043
mit+0.042
Neg logit contributions
und-0.045
 tenth-0.044
 anx-0.044
 maj-0.044
 accident-0.043
Token Julie
Token ID21946
Feature activation-9.30e-02

Pos logit contributions
 anx+0.019
 tenth+0.019
und+0.019
 maj+0.019
 accident+0.019
Neg logit contributions
suit-0.018
chard-0.017
 Labrador-0.016
mit-0.016
 opener-0.016
Token."
Token ID526
Feature activation-6.31e-02

Pos logit contributions
 accident+0.093
 anx+0.093
 tenth+0.092
 maj+0.091
 aer+0.089
Neg logit contributions
chard-0.130
suit-0.129
 Labrador-0.126
 opener-0.121
mit-0.119
Token
Token ID198
Feature activation+1.58e-02

Pos logit contributions
 accident+0.092
 bru+0.092
xc+0.090
 tenth+0.090
und+0.088
Neg logit contributions
 as-0.062
suit-0.061
 opener-0.060
mit-0.053
 Cookie-0.052
Token
Token ID198
Feature activation+1.60e-02

Pos logit contributions
 opener+0.010
suit+0.009
 as+0.008
 Coco+0.008
 stall+0.008
Neg logit contributions
 bru-0.028
 accident-0.028
xc-0.027
unct-0.026
itive-0.026
TokenJul
Token ID16980
Feature activation-1.59e-01

Pos logit contributions
 opener+0.019
suit+0.019
 as+0.018
mit+0.017
manship+0.017
Neg logit contributions
 bru-0.020
 accident-0.018
xc-0.017
und-0.017
unct-0.017
Tokenie
Token ID494
Feature activation-1.05e-01

Pos logit contributions
 bru+0.242
 tenth+0.230
und+0.218
 mo+0.215
Acc+0.211
Neg logit contributions
mit-0.152
 as-0.150
suit-0.144
kin-0.132
pper-0.127
Token smiled
Token ID13541
Feature activation-1.03e-02

Pos logit contributions
 tenth+0.135
und+0.132
 maj+0.127
 anx+0.125
xc+0.124
Neg logit contributions
suit-0.121
chard-0.116
mit-0.115
 Labrador-0.113
 as-0.110
Token to
Token ID284
Feature activation-2.69e-02

Pos logit contributions
chard+0.176
suit+0.172
mit+0.168
 Labrador+0.168
 stall+0.157
Neg logit contributions
 accident-0.101
 anx-0.098
 tenth-0.094
 maj-0.094
 aer-0.093
Token wear
Token ID5806
Feature activation+6.58e-02

Pos logit contributions
 accident+0.030
 aer+0.029
 anx+0.028
 mo+0.028
 tenth+0.028
Neg logit contributions
suit-0.037
chard-0.036
mit-0.034
 Labrador-0.034
keepers-0.034
Token.
Token ID13
Feature activation+1.35e-04

Pos logit contributions
chard+0.103
 Labrador+0.102
mit+0.101
suit+0.100
 stall+0.097
Neg logit contributions
 accident-0.058
 anx-0.055
 aer-0.054
und-0.053
 maj-0.053
Token It
Token ID632
Feature activation+3.05e-02

Pos logit contributions
mit+0.000
chard+0.000
suit+0.000
 Labrador+0.000
pper+0.000
Neg logit contributions
 maj-0.000
und-0.000
 anx-0.000
 accident-0.000
 tenth-0.000
Token was
Token ID373
Feature activation+1.12e-02

Pos logit contributions
suit+0.040
chard+0.039
 Labrador+0.038
mit+0.037
 opener+0.036
Neg logit contributions
 anx-0.034
 tenth-0.033
 accident-0.033
 maj-0.033
und-0.033
Token pink
Token ID11398
Feature activation-1.75e-03

Pos logit contributions
suit+0.018
mit+0.018
 Labrador+0.017
chard+0.017
 stall+0.017
Neg logit contributions
 accident-0.010
 aer-0.009
 mo-0.009
 anx-0.009
 maj-0.009
Token and
Token ID290
Feature activation+2.48e-03

Pos logit contributions
 accident+0.002
 anx+0.002
 tenth+0.002
und+0.002
 maj+0.002
Neg logit contributions
chard-0.002
suit-0.002
 Labrador-0.002
mit-0.002
 opener-0.002
Token had
Token ID550
Feature activation+1.10e-02

Pos logit contributions
suit+0.003
 Labrador+0.003
chard+0.003
mit+0.003
 stall+0.003
Neg logit contributions
 accident-0.002
 mo-0.002
 aer-0.002
 anx-0.002
 tenth-0.002
Token spark
Token ID9009
Feature activation-6.10e-02

Pos logit contributions
suit+0.012
 as+0.011
chard+0.011
 pads+0.011
 Labrador+0.011
Neg logit contributions
 anx-0.014
xc-0.014
und-0.014
Acc-0.014
unct-0.014
Tokenles
Token ID829
Feature activation+4.69e-02

Pos logit contributions
 anx+0.064
 accident+0.064
und+0.064
 maj+0.064
 aer+0.063
Neg logit contributions
chard-0.087
 Labrador-0.080
suit-0.079
 stall-0.077
 opener-0.077
Token on
Token ID319
Feature activation-8.69e-02

Pos logit contributions
chard+0.072
suit+0.072
 Labrador+0.070
mit+0.067
 opener+0.067
Neg logit contributions
 accident-0.040
 anx-0.039
und-0.039
 tenth-0.039
 maj-0.039
Token around
Token ID1088
Feature activation+7.77e-02

Pos logit contributions
 anx+0.064
 maj+0.063
und+0.063
 accident+0.063
 tenth+0.063
Neg logit contributions
suit-0.087
chard-0.085
 Labrador-0.083
mit-0.080
 opener-0.079
Token her
Token ID607
Feature activation+1.24e-02

Pos logit contributions
suit+0.090
chard+0.088
 Labrador+0.083
mit+0.082
 opener+0.081
Neg logit contributions
 tenth-0.096
 anx-0.094
 accident-0.093
und-0.093
 maj-0.093
Token,
Token ID11
Feature activation+4.65e-03

Pos logit contributions
chard+0.017
suit+0.017
 Labrador+0.017
 opener+0.016
mit+0.015
Neg logit contributions
 anx-0.013
 maj-0.013
und-0.013
 accident-0.013
 mo-0.013
Token the
Token ID262
Feature activation-1.35e-03

Pos logit contributions
chard+0.005
suit+0.005
 as+0.005
 Labrador+0.004
 Coco+0.004
Neg logit contributions
xc-0.006
und-0.006
 tenth-0.006
 anx-0.006
 maj-0.006
Token sky
Token ID6766
Feature activation+3.46e-02

Pos logit contributions
und+0.002
xc+0.002
 anx+0.002
 tenth+0.002
 maj+0.002
Neg logit contributions
chard-0.001
suit-0.001
mit-0.001
 stall-0.001
 as-0.001
Token was
Token ID373
Feature activation+9.61e-03

Pos logit contributions
chard+0.056
suit+0.056
 Labrador+0.054
mit+0.052
 stall+0.052
Neg logit contributions
 accident-0.030
 mo-0.029
 anx-0.029
und-0.029
 aer-0.028
Token turning
Token ID6225
Feature activation-5.12e-02

Pos logit contributions
suit+0.014
chard+0.013
 Labrador+0.013
mit+0.013
 opener+0.012
Neg logit contributions
 accident-0.010
 anx-0.010
 mo-0.010
 maj-0.010
 tenth-0.010
Token black
Token ID2042
Feature activation-2.06e-02

Pos logit contributions
 anx+0.055
 maj+0.054
 tenth+0.054
und+0.054
 accident+0.053
Neg logit contributions
suit-0.068
chard-0.068
 Labrador-0.066
mit-0.064
 opener-0.063
Token.
Token ID13
Feature activation-5.75e-03

Pos logit contributions
 tenth+0.026
 anx+0.026
 maj+0.025
und+0.025
xc+0.025
Neg logit contributions
suit-0.024
chard-0.023
 Labrador-0.022
 as-0.022
 opener-0.021
Token She
Token ID1375
Feature activation-3.52e-02

Pos logit contributions
 anx+0.006
 tenth+0.006
 accident+0.006
und+0.006
 maj+0.006
Neg logit contributions
suit-0.008
chard-0.008
 Labrador-0.007
mit-0.007
 opener-0.007
Token was
Token ID373
Feature activation+1.01e-02

Pos logit contributions
 tenth+0.041
und+0.040
 maj+0.039
 anx+0.039
 accident+0.039
Neg logit contributions
suit-0.044
chard-0.044
mit-0.041
 Labrador-0.041
 as-0.040
Token.
Token ID13
Feature activation-4.66e-03

Pos logit contributions
suit+0.042
chard+0.041
 Labrador+0.040
mit+0.039
 opener+0.038
Neg logit contributions
 anx-0.038
 tenth-0.038
 accident-0.037
und-0.037
 maj-0.037
Token It
Token ID632
Feature activation+2.86e-02

Pos logit contributions
 tenth+0.005
 anx+0.005
xc+0.005
 maj+0.005
unct+0.005
Neg logit contributions
suit-0.006
chard-0.006
 as-0.006
 opener-0.006
 Labrador-0.006
Token tried
Token ID3088
Feature activation+3.03e-02

Pos logit contributions
suit+0.036
chard+0.036
 Labrador+0.034
mit+0.033
 opener+0.033
Neg logit contributions
 tenth-0.033
 anx-0.033
 maj-0.032
und-0.032
 accident-0.032
Token to
Token ID284
Feature activation-2.99e-02

Pos logit contributions
suit+0.040
chard+0.040
 Labrador+0.038
mit+0.037
 opener+0.037
Neg logit contributions
 tenth-0.033
 anx-0.032
und-0.032
 maj-0.032
 accident-0.032
Token get
Token ID651
Feature activation+9.10e-02

Pos logit contributions
 maj+0.038
und+0.038
 anx+0.038
 tenth+0.038
xc+0.037
Neg logit contributions
chard-0.033
suit-0.032
 Labrador-0.032
 opener-0.031
 as-0.031
Token the
Token ID262
Feature activation-4.28e-03

Pos logit contributions
suit+0.138
 Labrador+0.133
mit+0.127
chard+0.123
 stall+0.122
Neg logit contributions
 accident-0.088
 anx-0.087
 mo-0.087
 maj-0.087
 bru-0.084
Token poison
Token ID8764
Feature activation+1.04e-01

Pos logit contributions
 accident+0.006
 tenth+0.005
 maj+0.005
 anx+0.005
 bru+0.005
Neg logit contributions
suit-0.005
chard-0.005
 Labrador-0.005
mit-0.005
 opener-0.004
Token out
Token ID503
Feature activation+1.40e-02

Pos logit contributions
chard+0.159
 Labrador+0.155
mit+0.152
suit+0.151
Same+0.143
Neg logit contributions
 accident-0.097
und-0.089
 anx-0.088
 aer-0.084
 mo-0.082
Token of
Token ID286
Feature activation+2.27e-03

Pos logit contributions
 as+0.015
suit+0.014
chard+0.013
 opener+0.013
mit+0.012
Neg logit contributions
 tenth-0.020
xc-0.020
 maj-0.020
 anx-0.019
Acc-0.019
Token the
Token ID262
Feature activation-4.02e-03

Pos logit contributions
suit+0.004
mit+0.004
 Labrador+0.004
chard+0.004
 opener+0.003
Neg logit contributions
 accident-0.002
und-0.002
 anx-0.002
 maj-0.002
 bru-0.002
Token lock
Token ID5793
Feature activation+3.03e-02

Pos logit contributions
 anx+0.005
 tenth+0.005
und+0.005
 maj+0.005
 accident+0.005
Neg logit contributions
suit-0.005
chard-0.005
 Labrador-0.004
mit-0.004
 opener-0.004
TokenO
Token ID46
Feature activation-5.01e-02

Pos logit contributions
suit+0.159
chard+0.156
 opener+0.154
 Labrador+0.150
 Cookie+0.141
Neg logit contributions
 anx-0.153
 maj-0.150
 aer-0.146
und-0.146
 tenth-0.143
Tokenw
Token ID86
Feature activation-1.31e-01

Pos logit contributions
 anx+0.055
 maj+0.050
 aer+0.049
xc+0.049
 mo+0.048
Neg logit contributions
suit-0.071
 opener-0.066
pper-0.065
chard-0.063
mit-0.062
Token!
Token ID0
Feature activation+7.42e-03

Pos logit contributions
 tenth+0.147
 anx+0.147
 maj+0.144
und+0.142
 accident+0.142
Neg logit contributions
suit-0.170
chard-0.163
 Labrador-0.159
mit-0.157
 as-0.152
Token Ow
Token ID11960
Feature activation-1.88e-01

Pos logit contributions
 opener+0.010
suit+0.010
 Labrador+0.010
 Cookie+0.010
mit+0.010
Neg logit contributions
 anx-0.008
 tenth-0.007
unct-0.007
 aer-0.007
xc-0.007
Token!
Token ID0
Feature activation+7.36e-03

Pos logit contributions
 anx+0.212
 tenth+0.210
 maj+0.208
 accident+0.202
Acc+0.201
Neg logit contributions
suit-0.241
chard-0.228
 Labrador-0.221
mit-0.220
 opener-0.217
Token Ow
Token ID11960
Feature activation-1.88e-01

Pos logit contributions
 opener+0.010
 Cookie+0.010
suit+0.010
 Labrador+0.009
 Coco+0.009
Neg logit contributions
 anx-0.008
unct-0.007
itive-0.007
 tenth-0.007
 aer-0.007
Token!"
Token ID2474
Feature activation-8.03e-02

Pos logit contributions
 anx+0.205
 tenth+0.202
 maj+0.200
 accident+0.194
Acc+0.194
Neg logit contributions
suit-0.249
chard-0.234
 Labrador-0.227
mit-0.227
 as-0.225
Token he
Token ID339
Feature activation-1.03e-02

Pos logit contributions
xc+0.114
Acc+0.110
und+0.109
 maj+0.109
 tenth+0.108
Neg logit contributions
 as-0.081
 opener-0.078
chard-0.076
suit-0.075
 Coco-0.072
Token cries
Token ID24691
Feature activation-1.34e-01

Pos logit contributions
 anx+0.008
 tenth+0.008
 accident+0.008
 maj+0.008
und+0.008
Neg logit contributions
suit-0.017
chard-0.017
 Labrador-0.016
mit-0.016
 opener-0.015
Token.
Token ID13
Feature activation+9.96e-03

Pos logit contributions
und+0.216
 tenth+0.209
 maj+0.205
xc+0.201
Acc+0.197
Neg logit contributions
 as-0.128
 opener-0.113
suit-0.113
 Labrador-0.101
chard-0.099
Token He
Token ID679
Feature activation-1.23e-02

Pos logit contributions
suit+0.009
chard+0.009
 opener+0.009
 as+0.008
 Labrador+0.008
Neg logit contributions
xc-0.015
 tenth-0.015
unct-0.015
 aer-0.015
und-0.015
Token had
Token ID550
Feature activation+8.35e-03

Pos logit contributions
 accident+0.122
 anx+0.122
 mo+0.120
und+0.119
 maj+0.117
Neg logit contributions
suit-0.153
chard-0.149
 Labrador-0.148
mit-0.141
 opener-0.138
Token changed
Token ID3421
Feature activation+6.20e-02

Pos logit contributions
suit+0.011
chard+0.010
 Labrador+0.010
mit+0.010
 opener+0.010
Neg logit contributions
 anx-0.010
 tenth-0.009
 accident-0.009
 maj-0.009
und-0.009
Token.
Token ID13
Feature activation-5.26e-03

Pos logit contributions
suit+0.080
chard+0.079
 Labrador+0.076
mit+0.073
 opener+0.072
Neg logit contributions
 anx-0.070
und-0.068
 accident-0.068
 maj-0.068
 tenth-0.068
Token It
Token ID632
Feature activation+2.96e-02

Pos logit contributions
 tenth+0.006
und+0.006
 anx+0.006
 aer+0.006
xc+0.006
Neg logit contributions
suit-0.006
chard-0.006
 Labrador-0.006
 opener-0.006
mit-0.006
Token was
Token ID373
Feature activation+9.09e-03

Pos logit contributions
suit+0.042
chard+0.041
 Labrador+0.040
mit+0.039
 stall+0.038
Neg logit contributions
 anx-0.031
 accident-0.030
 mo-0.029
und-0.029
 maj-0.029
Token no
Token ID645
Feature activation-1.78e-01

Pos logit contributions
suit+0.015
 Labrador+0.014
mit+0.014
manship+0.013
chard+0.013
Neg logit contributions
 accident-0.009
 anx-0.008
 mo-0.008
 maj-0.008
 aer-0.007
Token longer
Token ID2392
Feature activation-8.22e-02

Pos logit contributions
xc+0.254
und+0.250
Acc+0.239
 tenth+0.233
 anx+0.230
Neg logit contributions
 as-0.194
chard-0.173
mit-0.171
suit-0.171
 opener-0.170
Token orange
Token ID10912
Feature activation+2.95e-02

Pos logit contributions
 anx+0.098
 tenth+0.098
und+0.097
 maj+0.096
 accident+0.095
Neg logit contributions
suit-0.099
chard-0.098
 Labrador-0.092
mit-0.091
 opener-0.089
Token but
Token ID475
Feature activation-7.76e-02

Pos logit contributions
suit+0.037
chard+0.036
 Labrador+0.035
mit+0.034
 opener+0.033
Neg logit contributions
 anx-0.034
 tenth-0.034
 maj-0.034
und-0.034
 accident-0.034
Token gray
Token ID12768
Feature activation-4.16e-03

Pos logit contributions
 anx+0.073
 accident+0.071
 maj+0.066
 mo+0.065
 tenth+0.060
Neg logit contributions
suit-0.125
 Labrador-0.120
chard-0.116
 stall-0.113
pper-0.111
Token.
Token ID13
Feature activation-4.63e-03

Pos logit contributions
 accident+0.005
 anx+0.005
und+0.004
 maj+0.004
 tenth+0.004
Neg logit contributions
suit-0.005
chard-0.005
 Labrador-0.005
mit-0.005
 opener-0.005
Token had
Token ID550
Feature activation+1.05e-02

Pos logit contributions
suit+0.011
chard+0.011
 Labrador+0.010
mit+0.010
 opener+0.010
Neg logit contributions
 anx-0.010
 tenth-0.010
 accident-0.010
und-0.010
 maj-0.010
Token to
Token ID284
Feature activation-2.78e-02

Pos logit contributions
suit+0.014
chard+0.014
 Labrador+0.014
mit+0.013
 opener+0.013
Neg logit contributions
 anx-0.011
 tenth-0.011
und-0.011
 maj-0.011
 accident-0.011
Token help
Token ID1037
Feature activation-3.78e-02

Pos logit contributions
 maj+0.036
 tenth+0.035
 anx+0.035
und+0.035
Acc+0.035
Neg logit contributions
 Labrador-0.029
suit-0.029
chard-0.029
mit-0.029
 as-0.028
Token.
Token ID13
Feature activation-2.75e-03

Pos logit contributions
 anx+0.040
 accident+0.039
und+0.038
 tenth+0.037
 mo+0.037
Neg logit contributions
suit-0.054
chard-0.052
 Labrador-0.050
mit-0.049
 opener-0.047
Token They
Token ID1119
Feature activation-1.80e-02

Pos logit contributions
und+0.003
 anx+0.003
 maj+0.003
 tenth+0.003
 accident+0.003
Neg logit contributions
chard-0.004
suit-0.004
 Labrador-0.004
mit-0.004
 opener-0.004
Token sw
Token ID1509
Feature activation-2.19e-01

Pos logit contributions
 tenth+0.020
 maj+0.020
und+0.020
 anx+0.020
 accident+0.019
Neg logit contributions
suit-0.023
chard-0.023
 Labrador-0.022
mit-0.022
 opener-0.021
Tokenam
Token ID321
Feature activation-3.00e-02

Pos logit contributions
 anx+0.219
 tenth+0.216
und+0.216
 maj+0.215
 accident+0.214
Neg logit contributions
suit-0.310
chard-0.307
 Labrador-0.293
mit-0.287
 opener-0.285
Token over
Token ID625
Feature activation+1.03e-01

Pos logit contributions
 anx+0.038
 tenth+0.037
 maj+0.037
und+0.037
 accident+0.037
Neg logit contributions
suit-0.035
chard-0.034
 Labrador-0.033
mit-0.032
 opener-0.031
Token and
Token ID290
Feature activation-2.02e-03

Pos logit contributions
suit+0.120
chard+0.118
 as+0.113
 Labrador+0.112
 opener+0.112
Neg logit contributions
 tenth-0.130
 maj-0.126
 anx-0.124
 accident-0.124
und-0.124
Token pushed
Token ID7121
Feature activation-6.98e-03

Pos logit contributions
 tenth+0.002
 anx+0.002
 maj+0.002
und+0.002
 accident+0.002
Neg logit contributions
suit-0.003
chard-0.003
 Labrador-0.002
mit-0.002
 opener-0.002
Token the
Token ID262
Feature activation-5.24e-04

Pos logit contributions
 tenth+0.009
 anx+0.008
 maj+0.008
 accident+0.008
und+0.008
Neg logit contributions
suit-0.008
chard-0.008
 Labrador-0.008
mit-0.008
 as-0.008
Token more
Token ID517
Feature activation-1.97e-03

Pos logit contributions
 accident+0.088
 anx+0.088
 aer+0.088
 mo+0.087
 tenth+0.087
Neg logit contributions
suit-0.107
chard-0.103
 Labrador-0.102
mit-0.098
 opener-0.091
Token apples
Token ID22514
Feature activation+6.76e-02

Pos logit contributions
 accident+0.002
 maj+0.002
und+0.002
 mo+0.002
 anx+0.002
Neg logit contributions
suit-0.003
chard-0.003
 Labrador-0.003
mit-0.002
 Cookie-0.002
Token to
Token ID284
Feature activation-2.23e-02

Pos logit contributions
 Labrador+0.122
suit+0.112
mit+0.108
chard+0.108
Same+0.107
Neg logit contributions
 accident-0.047
,"-0.046
 tenth-0.046
und-0.045
 aer-0.044
Token the
Token ID262
Feature activation+3.03e-03

Pos logit contributions
 tenth+0.024
 anx+0.024
 accident+0.024
und+0.024
 maj+0.023
Neg logit contributions
suit-0.030
chard-0.029
 Labrador-0.028
mit-0.028
 opener-0.027
Token barrel
Token ID9036
Feature activation+2.03e-02

Pos logit contributions
chard+0.003
suit+0.003
 stall+0.003
 Labrador+0.003
mit+0.003
Neg logit contributions
und-0.004
xc-0.004
 maj-0.004
 anx-0.004
 tenth-0.004
Token,"
Token ID553
Feature activation-1.91e-01

Pos logit contributions
suit+0.030
chard+0.030
 Labrador+0.028
 opener+0.028
mit+0.028
Neg logit contributions
 anx-0.019
 tenth-0.018
 maj-0.018
und-0.018
 accident-0.018
Token dad
Token ID9955
Feature activation+8.57e-02

Pos logit contributions
und+0.328
xc+0.303
unct+0.294
 maj+0.292
Acc+0.287
Neg logit contributions
chard-0.172
 stall-0.161
 Coco-0.156
 as-0.155
 Cookie-0.152
Token said
Token ID531
Feature activation+1.22e-02

Pos logit contributions
suit+0.117
chard+0.116
 Labrador+0.111
mit+0.108
 stall+0.103
Neg logit contributions
 anx-0.092
 accident-0.090
 tenth-0.089
und-0.086
 maj-0.086
Token.
Token ID13
Feature activation+7.36e-03

Pos logit contributions
suit+0.012
chard+0.012
 as+0.012
 opener+0.012
mit+0.011
Neg logit contributions
und-0.017
 tenth-0.017
 maj-0.017
 aer-0.016
 mo-0.016
Token
Token ID198
Feature activation+2.77e-02

Pos logit contributions
chard+0.011
suit+0.011
mit+0.011
 Labrador+0.010
 opener+0.009
Neg logit contributions
 anx-0.007
und-0.007
 maj-0.007
 tenth-0.007
 accident-0.007
Token
Token ID198
Feature activation+2.78e-02

Pos logit contributions
suit+0.029
chard+0.028
 Labrador+0.027
 opener+0.027
 stall+0.026
Neg logit contributions
 anx-0.037
 tenth-0.036
 accident-0.036
 maj-0.036
und-0.036
Token the
Token ID262
Feature activation-5.59e-03

Pos logit contributions
chard+0.085
suit+0.084
 as+0.083
 stall+0.078
 Labrador+0.077
Neg logit contributions
 tenth-0.096
xc-0.095
und-0.092
 bru-0.092
 maj-0.091
Token forest
Token ID8222
Feature activation-4.28e-02

Pos logit contributions
 maj+0.006
 accident+0.006
 anx+0.005
 tenth+0.005
 aer+0.005
Neg logit contributions
suit-0.008
chard-0.008
mit-0.008
 opener-0.008
 Labrador-0.007
Token,
Token ID11
Feature activation+2.18e-03

Pos logit contributions
 tenth+0.051
xc+0.051
 anx+0.051
 maj+0.050
und+0.050
Neg logit contributions
suit-0.051
chard-0.050
 as-0.047
mit-0.047
 Labrador-0.046
Token and
Token ID290
Feature activation-2.03e-03

Pos logit contributions
suit+0.003
chard+0.003
 Labrador+0.003
mit+0.003
 opener+0.003
Neg logit contributions
 tenth-0.002
 anx-0.002
und-0.002
 maj-0.002
 accident-0.002
Token they
Token ID484
Feature activation+5.67e-03

Pos logit contributions
 anx+0.002
 tenth+0.002
 maj+0.002
und+0.002
 accident+0.002
Neg logit contributions
suit-0.003
chard-0.003
 Labrador-0.003
mit-0.002
 opener-0.002
Token never
Token ID1239
Feature activation-1.62e-01

Pos logit contributions
suit+0.007
chard+0.007
mit+0.007
 Labrador+0.007
 opener+0.007
Neg logit contributions
 tenth-0.006
 anx-0.006
 maj-0.006
und-0.006
 accident-0.006
Token felt
Token ID2936
Feature activation-6.87e-02

Pos logit contributions
 anx+0.182
 accident+0.181
 tenth+0.180
und+0.180
 maj+0.179
Neg logit contributions
suit-0.207
chard-0.203
 Labrador-0.197
mit-0.189
 opener-0.189
Token lonely
Token ID21757
Feature activation-6.02e-03

Pos logit contributions
 accident+0.067
 anx+0.066
 bru+0.062
 mo+0.062
 maj+0.061
Neg logit contributions
 Labrador-0.101
suit-0.098
 opener-0.094
 stall-0.092
chard-0.092
Token again
Token ID757
Feature activation-1.70e-01

Pos logit contributions
 anx+0.004
 accident+0.004
 tenth+0.004
und+0.004
 maj+0.004
Neg logit contributions
suit-0.010
chard-0.010
 Labrador-0.010
mit-0.009
 opener-0.009
Token.
Token ID13
Feature activation-7.82e-03

Pos logit contributions
 accident+0.169
und+0.168
 anx+0.168
 tenth+0.158
 aer+0.158
Neg logit contributions
suit-0.248
chard-0.240
 Labrador-0.236
mit-0.228
 opener-0.227
Token
Token ID198
Feature activation+1.64e-02

Pos logit contributions
und+0.007
 anx+0.007
 maj+0.007
 tenth+0.007
 accident+0.007
Neg logit contributions
suit-0.012
chard-0.012
mit-0.011
 Labrador-0.011
 opener-0.011