TOP ACTIVATIONS
MAX = 0.434

cried , dropping the magn ifying glass . "
examine it with the magn ifying glass ?" "
She took out a magn ifying glass and looked very carefully
. Lily had a magn ifying glass that she found in
borrowed her mom 's magn ifying glass to take a closer
and handed him the magn ifying glass . He
. They had a magn ifying glass and a small box
the necklace and the magn ifying glass and told him that
He pointed his magn ifying glass at different classmates and
She had a big magn ifying glass that made things look
said , holding the magn ifying glass over a black ant
hat and grabbed her magn ifying glass . She was ready
a flashlight and a magn ifying glass to examine their finds
When Daisy put the magn ifying glass down , the cater
flashlight , and a magn ifying glass . As
She takes out a magn ifying glass . She looks at
He took out a magn ifying glass and a notebook and
it she had a magn ifying glass , a tiny torch
, handing him the magn ifying glass . She looked around
the watch with the magn ifying glass . He saw the

MIN ACTIVATIONS
MIN = -0.450

grandma 's house for safe keeping . When
took the bird to the vet and they fixed its wing
to the vet . The vet gave the puppy a license
took the puppy to the vet . The vet gave the
visit . The vet said that Spot was fine
and took it to the vet . The vet helped the
to the vet . The vet helped the bird and it
to the vet . The vet was a nice person who
She took Max to the vet to get an x -
had to go to the vet . The vet was a
and carried it to the vet . The vet was able
news . They thanked the vet and took Matt home .
to the vet . The vet was able to fix the
take Mitt ens to the vet . The vet can manage
take the bird to the vet and help it get better
owner took him to the vet for an operation . Max
, took him to the vet . The vet said that
who helped animals . The vet looked at Luna 's cut
The end <|endoftext|> The vet was a nice lady .
took Mitt ens to the vet and the vet gave Mitt

INTERVAL 0.213 - 0.434
CONTAINS 1.329%

the park allowed him to watch the graceful bird while he
silly , running around and laughing . Suddenly , one of
Mom my has some pink sprink les in the kitchen .
asked . " Yes , of course !" she
Sam 's arm and pulling him back . "

INTERVAL -0.008 - 0.213
CONTAINS 54.336%

how fast he could run . The old dog tried his
ily and Sam are twins . They like to play outside
. She taught them the language for many weeks until they
R over was so happy he had found the path
taught him how to play . Tim my was very happy

INTERVAL -0.229 - -0.008
CONTAINS 43.266%

. Suddenly he noticed a key lying on the ground nearby
useful , so she said , " Yes , I can
brave little boy . He lived near a mighty lake .
p outed and crossed her arms . She did not like
, her mom my had a special treat for her ;

INTERVAL -0.450 - -0.229
CONTAINS 1.070%

night when he came back home , he would still find
surprised ! He was so used to being angry , but
's mom figured out a way to get the net open
day , the telephone was used again . This time it
They also hoped to return home soon and end their holiday
Token cried
Token ID16896
Feature activation-1.65e-01

Pos logit contributions
 traps+0.127
 vet+0.127
keeping+0.126
 dam+0.125
ooks+0.116
Neg logit contributions
ifying-0.267
ay-0.244
 la-0.243
ksh-0.242
ahoo-0.241
Token,
Token ID11
Feature activation+9.03e-04

Pos logit contributions
 vet+0.231
 traps+0.222
 dam+0.220
keeping+0.209
ooks+0.207
Neg logit contributions
ifying-0.293
ahoo-0.261
 la-0.261
ay-0.255
very-0.253
Token dropping
Token ID12047
Feature activation+6.47e-02

Pos logit contributions
ifying+0.001
ay+0.001
ahoo+0.001
ksh+0.001
very+0.001
Neg logit contributions
 vet-0.001
 traps-0.001
 dam-0.001
keeping-0.001
 hospital-0.001
Token the
Token ID262
Feature activation+2.24e-02

Pos logit contributions
ifying+0.096
ay+0.085
 g+0.084
 tr+0.084
 la+0.083
Neg logit contributions
keeping-0.103
 dam-0.103
 vet-0.102
 traps-0.102
ooks-0.099
Token magn
Token ID7842
Feature activation+6.19e-02

Pos logit contributions
ifying+0.043
ksh+0.039
 la+0.039
ay+0.039
ahoo+0.039
Neg logit contributions
 traps-0.027
 vet-0.026
 dam-0.026
keeping-0.026
ooks-0.024
Tokenifying
Token ID4035
Feature activation+4.34e-01

Pos logit contributions
 g+0.021
very+0.020
 la+0.019
 Ryan+0.019
ahoo+0.018
Neg logit contributions
keeping-0.158
inger-0.156
 traps-0.156
 hole-0.155
orns-0.154
Token glass
Token ID5405
Feature activation-1.00e-01

Pos logit contributions
ifying+0.671
ay+0.618
 g+0.535
 tr+0.501
 screen+0.474
Neg logit contributions
igi-0.787
 conflicts-0.771
aches-0.749
keeping-0.747
ooks-0.730
Token.
Token ID13
Feature activation+3.71e-02

Pos logit contributions
 vet+0.158
 traps+0.157
 dam+0.156
ooks+0.156
keeping+0.155
Neg logit contributions
ifying-0.154
ay-0.137
 tr-0.134
 g-0.128
 la-0.127
Token
Token ID198
Feature activation-1.33e-02

Pos logit contributions
ifying+0.054
ksh+0.047
ay+0.046
ahoo+0.046
 la+0.046
Neg logit contributions
 vet-0.062
 traps-0.060
keeping-0.060
 dam-0.060
ooks-0.058
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
 traps+0.021
keeping+0.021
 dam+0.021
 vet+0.020
orns+0.020
Neg logit contributions
ifying-0.020
ay-0.018
 la-0.018
 tr-0.018
ahoo-0.018
Token"
Token ID1
Feature activation-3.70e-02

Pos logit contributions
 vet+0.018
 traps+0.017
keeping+0.017
 dam+0.017
ooks+0.017
Neg logit contributions
ifying-0.024
ksh-0.022
 la-0.022
ay-0.022
ahoo-0.022
Token examine
Token ID10716
Feature activation+3.11e-02

Pos logit contributions
ifying+0.107
ay+0.097
 g+0.095
 tr+0.094
 la+0.094
Neg logit contributions
keeping-0.125
 traps-0.123
 dam-0.117
orns-0.117
 conflicts-0.116
Token it
Token ID340
Feature activation-4.66e-02

Pos logit contributions
ifying+0.052
ay+0.047
 la+0.047
 tr+0.046
ksh+0.046
Neg logit contributions
keeping-0.043
 vet-0.043
 dam-0.043
 traps-0.043
ooks-0.041
Token with
Token ID351
Feature activation-2.31e-02

Pos logit contributions
keeping+0.073
 traps+0.067
ooks+0.067
 dam+0.066
 vet+0.065
Neg logit contributions
ifying-0.074
ay-0.070
 yes-0.069
 tr-0.069
 la-0.069
Token the
Token ID262
Feature activation+2.01e-02

Pos logit contributions
 traps+0.037
 vet+0.037
keeping+0.037
 dam+0.037
ooks+0.035
Neg logit contributions
ifying-0.034
ay-0.030
ksh-0.030
 la-0.030
ahoo-0.029
Token magn
Token ID7842
Feature activation+5.88e-02

Pos logit contributions
ifying+0.033
ksh+0.030
ahoo+0.029
ay+0.028
very+0.028
Neg logit contributions
 vet-0.031
 traps-0.031
 dam-0.030
 antidote-0.029
 hole-0.029
Tokenifying
Token ID4035
Feature activation+4.31e-01

Pos logit contributions
 g+0.017
ahoo+0.013
 tr+0.013
Jake+0.013
Nice+0.012
Neg logit contributions
keeping-0.152
orns-0.150
 vet-0.148
inger-0.148
 traps-0.148
Token glass
Token ID5405
Feature activation-1.04e-01

Pos logit contributions
ifying+0.694
ay+0.637
 g+0.592
 tr+0.563
 ding+0.526
Neg logit contributions
keeping-0.733
igi-0.719
ooks-0.697
 conflicts-0.695
 traps-0.689
Token?"
Token ID1701
Feature activation-1.22e-01

Pos logit contributions
ooks+0.157
 traps+0.155
 dam+0.151
 vet+0.149
keeping+0.148
Neg logit contributions
ifying-0.168
ay-0.163
 tr-0.151
very-0.147
 la-0.146
Token
Token ID198
Feature activation-2.02e-02

Pos logit contributions
keeping+0.243
 traps+0.236
 antidote+0.224
inger+0.223
 dam+0.221
Neg logit contributions
 �-0.137
 Ryan-0.133
ay-0.132
 g-0.131
 tr-0.130
Token
Token ID198
Feature activation-2.03e-02

Pos logit contributions
keeping+0.037
 traps+0.036
 dam+0.035
 vet+0.034
orns+0.034
Neg logit contributions
ifying-0.026
ay-0.026
 �-0.023
 tr-0.023
very-0.023
Token"
Token ID1
Feature activation-4.30e-02

Pos logit contributions
 traps+0.033
keeping+0.033
 dam+0.031
 vet+0.031
orns+0.031
Neg logit contributions
ifying-0.031
ay-0.028
 tr-0.028
ksh-0.027
 la-0.027
Token She
Token ID1375
Feature activation-5.18e-02

Pos logit contributions
 �+0.034
ifying+0.033
ay+0.032
 Ryan+0.031
 la+0.030
Neg logit contributions
 traps-0.051
 conflicts-0.050
 killed-0.050
 dam-0.050
aches-0.049
Token took
Token ID1718
Feature activation+2.70e-02

Pos logit contributions
keeping+0.088
ooks+0.083
 traps+0.083
igi+0.081
 die+0.081
Neg logit contributions
 tr-0.076
 �-0.066
ifying-0.066
 jugg-0.066
ahoo-0.066
Token out
Token ID503
Feature activation+1.60e-02

Pos logit contributions
ifying+0.039
ay+0.036
 g+0.036
 tr+0.035
 la+0.035
Neg logit contributions
keeping-0.043
 vet-0.043
 dam-0.043
 traps-0.042
ooks-0.042
Token a
Token ID257
Feature activation-1.26e-03

Pos logit contributions
ifying+0.025
ay+0.022
 tr+0.022
 la+0.022
 g+0.021
Neg logit contributions
keeping-0.025
 dam-0.025
 vet-0.025
 traps-0.024
 bidder-0.024
Token magn
Token ID7842
Feature activation+6.00e-02

Pos logit contributions
 dam+0.002
 vet+0.002
 traps+0.002
 hole+0.002
 jail+0.002
Neg logit contributions
ifying-0.002
ksh-0.002
ahoo-0.002
 la-0.002
ay-0.002
Tokenifying
Token ID4035
Feature activation+4.31e-01

Pos logit contributions
 la+0.028
very+0.025
ksh+0.025
 g+0.025
 Ple+0.023
Neg logit contributions
inger-0.143
keeping-0.142
ooks-0.140
 hole-0.138
 traps-0.137
Token glass
Token ID5405
Feature activation-1.05e-01

Pos logit contributions
ifying+0.703
ay+0.638
 g+0.579
 tr+0.571
very+0.523
Neg logit contributions
keeping-0.716
igi-0.703
 conflicts-0.679
ooks-0.667
 traps-0.663
Token and
Token ID290
Feature activation-2.14e-02

Pos logit contributions
keeping+0.157
 traps+0.157
 vet+0.155
 dam+0.154
ooks+0.152
Neg logit contributions
ifying-0.165
ay-0.153
 tr-0.148
 g-0.143
 �-0.141
Token looked
Token ID3114
Feature activation+1.01e-01

Pos logit contributions
igi+0.039
orns+0.037
ooks+0.037
inger+0.037
keeping+0.037
Neg logit contributions
 tr-0.028
 g-0.026
ifying-0.026
 �-0.025
 yes-0.024
Token very
Token ID845
Feature activation+6.98e-02

Pos logit contributions
ifying+0.142
ksh+0.125
ay+0.125
 la+0.124
ahoo+0.124
Neg logit contributions
 traps-0.168
 dam-0.166
 vet-0.165
keeping-0.164
ooks-0.159
Token carefully
Token ID7773
Feature activation+1.92e-02

Pos logit contributions
ifying+0.094
ay+0.082
 la+0.082
 tr+0.081
 g+0.081
Neg logit contributions
keeping-0.122
 vet-0.121
 traps-0.120
 dam-0.118
orns-0.115
Token.
Token ID13
Feature activation+2.99e-02

Pos logit contributions
 traps+0.049
keeping+0.049
 vet+0.048
 dam+0.048
orns+0.046
Neg logit contributions
ifying-0.065
ksh-0.060
 la-0.060
ahoo-0.060
ay-0.058
Token Lily
Token ID20037
Feature activation-1.35e-01

Pos logit contributions
ifying+0.039
 �+0.037
ksh+0.036
ay+0.036
 Ryan+0.034
Neg logit contributions
keeping-0.055
 traps-0.053
 killed-0.053
 conflicts-0.050
 vet-0.050
Token had
Token ID550
Feature activation+3.91e-02

Pos logit contributions
keeping+0.242
 traps+0.237
 dam+0.237
 vet+0.233
ooks+0.229
Neg logit contributions
 tr-0.173
ifying-0.160
 la-0.158
 g-0.157
 Ryan-0.156
Token a
Token ID257
Feature activation-1.46e-03

Pos logit contributions
ifying+0.051
 tr+0.047
 Eric+0.046
 Ryan+0.045
 g+0.045
Neg logit contributions
 dam-0.069
keeping-0.067
igi-0.066
ooks-0.065
 traps-0.065
Token magn
Token ID7842
Feature activation+5.66e-02

Pos logit contributions
 vet+0.002
 traps+0.002
 dam+0.002
keeping+0.002
ooks+0.002
Neg logit contributions
ifying-0.002
ksh-0.002
ay-0.002
ahoo-0.002
 la-0.002
Tokenifying
Token ID4035
Feature activation+4.30e-01

Pos logit contributions
 g+0.024
ksh+0.022
very+0.021
 la+0.020
ahoo+0.020
Neg logit contributions
inger-0.137
keeping-0.136
orns-0.136
ooks-0.134
gy-0.134
Token glass
Token ID5405
Feature activation-1.07e-01

Pos logit contributions
ifying+0.701
ay+0.646
 g+0.612
 tr+0.588
 la+0.548
Neg logit contributions
keeping-0.729
igi-0.679
 conflicts-0.673
 traps-0.662
ooks-0.654
Token that
Token ID326
Feature activation+4.25e-02

Pos logit contributions
 traps+0.155
keeping+0.155
 vet+0.154
 dam+0.154
ooks+0.147
Neg logit contributions
ifying-0.173
ay-0.155
 tr-0.152
 la-0.152
ksh-0.150
Token she
Token ID673
Feature activation-6.66e-02

Pos logit contributions
ifying+0.062
 tr+0.056
ay+0.056
 g+0.055
 la+0.055
Neg logit contributions
keeping-0.067
 vet-0.067
 dam-0.067
 traps-0.067
ooks-0.064
Token found
Token ID1043
Feature activation-1.92e-01

Pos logit contributions
 dam+0.111
keeping+0.109
 vet+0.109
 traps+0.108
ooks+0.106
Neg logit contributions
ifying-0.091
 tr-0.087
 la-0.085
 g-0.083
ay-0.082
Token in
Token ID287
Feature activation+1.37e-01

Pos logit contributions
 vet+0.307
 traps+0.307
keeping+0.306
 dam+0.306
ooks+0.292
Neg logit contributions
ifying-0.281
ay-0.248
 la-0.246
ksh-0.244
 tr-0.244
Token borrowed
Token ID22546
Feature activation-6.20e-02

Pos logit contributions
 traps+0.074
 vet+0.074
keeping+0.074
 dam+0.074
ooks+0.070
Neg logit contributions
ifying-0.085
ay-0.075
 la-0.075
ksh-0.075
ahoo-0.074
Token her
Token ID607
Feature activation-6.64e-03

Pos logit contributions
keeping+0.109
 dam+0.107
ooks+0.104
 vet+0.104
 bidder+0.104
Neg logit contributions
ifying-0.082
 g-0.078
ay-0.076
 �-0.074
 tr-0.074
Token mom
Token ID1995
Feature activation+2.80e-02

Pos logit contributions
keeping+0.011
ooks+0.011
 vet+0.010
 bidder+0.010
 dam+0.010
Neg logit contributions
ifying-0.010
 tr-0.010
 g-0.009
ay-0.009
ahoo-0.009
Token's
Token ID338
Feature activation+6.71e-02

Pos logit contributions
ifying+0.044
 la+0.039
ksh+0.039
ay+0.039
ahoo+0.039
Neg logit contributions
 traps-0.042
keeping-0.042
 vet-0.042
 dam-0.041
orns-0.040
Token magn
Token ID7842
Feature activation+5.98e-02

Pos logit contributions
ifying+0.110
 tr+0.102
 g+0.100
ay+0.099
ahoo+0.097
Neg logit contributions
keeping-0.093
 dam-0.092
ooks-0.092
 vet-0.092
igi-0.091
Tokenifying
Token ID4035
Feature activation+4.30e-01

Pos logit contributions
very+0.024
 ding+0.024
 parade+0.024
 la+0.024
 movies+0.023
Neg logit contributions
keeping-0.147
inger-0.145
orns-0.144
 hole-0.141
ooks-0.141
Token glass
Token ID5405
Feature activation-1.04e-01

Pos logit contributions
ifying+0.690
ay+0.626
 g+0.584
 tr+0.569
 tut+0.517
Neg logit contributions
keeping-0.730
igi-0.715
 conflicts-0.690
limits-0.685
 traps-0.664
Token to
Token ID284
Feature activation+2.10e-02

Pos logit contributions
 vet+0.166
 dam+0.164
 traps+0.164
ooks+0.162
keeping+0.161
Neg logit contributions
ifying-0.156
ay-0.144
 tr-0.141
 g-0.135
 �-0.130
Token take
Token ID1011
Feature activation+6.19e-02

Pos logit contributions
ifying+0.030
 g+0.027
 la+0.027
ay+0.027
 tr+0.026
Neg logit contributions
keeping-0.035
 dam-0.033
 vet-0.033
ooks-0.033
 traps-0.033
Token a
Token ID257
Feature activation-2.54e-03

Pos logit contributions
ifying+0.081
ay+0.079
 g+0.077
 �+0.073
 la+0.072
Neg logit contributions
 dam-0.105
 vet-0.104
ooks-0.102
keeping-0.101
igi-0.101
Token closer
Token ID5699
Feature activation+1.18e-02

Pos logit contributions
 dam+0.004
 vet+0.004
 traps+0.004
 hole+0.004
 jail+0.003
Neg logit contributions
ifying-0.004
ksh-0.004
ahoo-0.004
 Ryan-0.004
ay-0.004
Token and
Token ID290
Feature activation-8.42e-03

Pos logit contributions
ifying+0.699
ksh+0.616
 la+0.602
ahoo+0.597
ay+0.591
Neg logit contributions
 vet-0.455
 traps-0.443
keeping-0.435
 dam-0.430
 hospital-0.412
Token handed
Token ID10158
Feature activation+4.67e-02

Pos logit contributions
 dam+0.015
keeping+0.015
orns+0.015
 traps+0.015
ooks+0.015
Neg logit contributions
 Ryan-0.010
 yes-0.010
 tr-0.010
 g-0.010
 �-0.010
Token him
Token ID683
Feature activation+2.53e-01

Pos logit contributions
ifying+0.073
ksh+0.065
ahoo+0.064
 la+0.063
ay+0.061
Neg logit contributions
 traps-0.074
 vet-0.073
keeping-0.072
orns-0.070
 dam-0.068
Token the
Token ID262
Feature activation+1.90e-02

Pos logit contributions
ifying+0.412
ay+0.367
 la+0.365
 tr+0.364
ksh+0.363
Neg logit contributions
keeping-0.366
 vet-0.365
 traps-0.364
 dam-0.364
ooks-0.347
Token magn
Token ID7842
Feature activation+5.76e-02

Pos logit contributions
ifying+0.033
ay+0.031
 tr+0.030
 la+0.030
 g+0.029
Neg logit contributions
keeping-0.025
 dam-0.025
ooks-0.025
 vet-0.025
 traps-0.024
Tokenifying
Token ID4035
Feature activation+4.29e-01

Pos logit contributions
 g+0.022
ahoo+0.021
ksh+0.021
 la+0.020
 tr+0.020
Neg logit contributions
keeping-0.145
orns-0.142
 traps-0.141
inger-0.141
 dam-0.140
Token glass
Token ID5405
Feature activation-1.07e-01

Pos logit contributions
ifying+0.685
ay+0.624
 g+0.539
 tr+0.514
 ding+0.482
Neg logit contributions
igi-0.750
keeping-0.728
limits-0.707
ooks-0.705
 traps-0.701
Token.
Token ID13
Feature activation+4.02e-02

Pos logit contributions
 traps+0.180
 vet+0.180
 dam+0.179
ooks+0.177
keeping+0.174
Neg logit contributions
ifying-0.149
ay-0.136
 tr-0.133
 g-0.128
 �-0.124
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
ifying+0.057
ay+0.050
 la+0.050
ksh+0.050
 tr+0.049
Neg logit contributions
 traps-0.067
keeping-0.067
 vet-0.066
 dam-0.066
ooks-0.063
Token
Token ID198
Feature activation-1.32e-02

Pos logit contributions
 traps+0.020
 dam+0.020
keeping+0.020
 vet+0.020
 bidder+0.019
Neg logit contributions
ifying-0.018
ay-0.017
 la-0.016
 Ryan-0.016
 tr-0.016
TokenHe
Token ID1544
Feature activation+2.97e-02

Pos logit contributions
 vet+0.025
 traps+0.024
keeping+0.024
 dam+0.024
ooks+0.024
Neg logit contributions
ifying-0.016
ksh-0.014
ahoo-0.014
ay-0.014
 la-0.014
Token.
Token ID13
Feature activation+2.48e-02

Pos logit contributions
 vet+0.207
 traps+0.207
keeping+0.206
 dam+0.206
ooks+0.200
Neg logit contributions
ifying-0.186
 la-0.163
ay-0.163
 tr-0.163
ksh-0.160
Token They
Token ID1119
Feature activation-8.38e-02

Pos logit contributions
ifying+0.032
 �+0.032
 tr+0.030
ksh+0.029
ay+0.029
Neg logit contributions
keeping-0.045
 traps-0.045
inger-0.042
 killed-0.042
 vet-0.042
Token had
Token ID550
Feature activation+3.39e-02

Pos logit contributions
 vet+0.119
 traps+0.116
 dam+0.115
keeping+0.113
 killed+0.110
Neg logit contributions
ifying-0.142
ay-0.129
ksh-0.129
ahoo-0.125
 la-0.124
Token a
Token ID257
Feature activation-6.08e-03

Pos logit contributions
ifying+0.051
 tr+0.046
 la+0.045
 g+0.045
ksh+0.045
Neg logit contributions
keeping-0.053
 dam-0.052
 vet-0.052
 traps-0.052
ooks-0.050
Token magn
Token ID7842
Feature activation+5.57e-02

Pos logit contributions
 vet+0.009
 dam+0.009
 hole+0.009
 traps+0.009
 shelter+0.008
Neg logit contributions
ifying-0.010
ksh-0.010
ahoo-0.009
ay-0.009
 la-0.009
Tokenifying
Token ID4035
Feature activation+4.28e-01

Pos logit contributions
very+0.021
 g+0.020
ksh+0.020
 ding+0.018
 �+0.017
Neg logit contributions
inger-0.137
orns-0.136
ooks-0.135
igi-0.134
 dam-0.134
Token glass
Token ID5405
Feature activation-1.07e-01

Pos logit contributions
ifying+0.702
ay+0.632
 g+0.632
 tr+0.603
 la+0.559
Neg logit contributions
keeping-0.708
igi-0.656
 traps-0.642
ooks-0.635
 conflicts-0.629
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
 traps+0.160
keeping+0.159
 vet+0.157
 dam+0.156
ooks+0.152
Neg logit contributions
ifying-0.172
ay-0.155
 tr-0.153
 la-0.149
 g-0.147
Token a
Token ID257
Feature activation-5.63e-03

Pos logit contributions
igi+0.045
inger+0.045
 dam+0.044
orns+0.044
 bidder+0.043
Neg logit contributions
ifying-0.033
 tr-0.030
 g-0.029
 yes-0.029
 Ryan-0.027
Token small
Token ID1402
Feature activation+2.18e-02

Pos logit contributions
 vet+0.008
 dam+0.008
 hole+0.008
 traps+0.008
 nest+0.007
Neg logit contributions
ifying-0.009
ksh-0.009
ahoo-0.009
ay-0.008
 Ryan-0.008
Token box
Token ID3091
Feature activation-8.02e-03

Pos logit contributions
ifying+0.037
ay+0.033
 la+0.033
ksh+0.033
 tr+0.033
Neg logit contributions
 traps-0.030
keeping-0.030
 vet-0.030
 dam-0.030
ooks-0.028
Token the
Token ID262
Feature activation+1.93e-02

Pos logit contributions
ifying+0.356
 tr+0.316
 la+0.312
ay+0.312
 g+0.310
Neg logit contributions
keeping-0.415
 dam-0.411
 vet-0.408
 traps-0.407
 bidder-0.395
Token necklace
Token ID38987
Feature activation-1.56e-01

Pos logit contributions
ifying+0.030
 tr+0.028
ay+0.028
 yes+0.028
 la+0.028
Neg logit contributions
ooks-0.029
keeping-0.029
 dam-0.028
igi-0.028
 traps-0.027
Token and
Token ID290
Feature activation-8.74e-03

Pos logit contributions
 dam+0.258
ooks+0.252
 vet+0.252
keeping+0.252
 traps+0.252
Neg logit contributions
ifying-0.217
ay-0.202
 tr-0.200
 g-0.193
 �-0.191
Token the
Token ID262
Feature activation+1.86e-02

Pos logit contributions
 dam+0.017
ooks+0.017
inger+0.017
keeping+0.017
igi+0.016
Neg logit contributions
 Ryan-0.010
 tr-0.010
 g-0.009
 Eric-0.009
 Jake-0.009
Token magn
Token ID7842
Feature activation+5.48e-02

Pos logit contributions
ifying+0.032
 la+0.029
 tr+0.029
ay+0.029
 g+0.029
Neg logit contributions
keeping-0.025
 dam-0.025
 traps-0.024
ooks-0.024
 vet-0.024
Tokenifying
Token ID4035
Feature activation+4.26e-01

Pos logit contributions
 g+0.022
ksh+0.021
 tr+0.021
ahoo+0.020
very+0.019
Neg logit contributions
keeping-0.138
orns-0.135
inger-0.134
 traps-0.134
 vet-0.132
Token glass
Token ID5405
Feature activation-1.09e-01

Pos logit contributions
ifying+0.684
ay+0.595
 g+0.531
 tr+0.507
 ding+0.485
Neg logit contributions
igi-0.741
keeping-0.721
ooks-0.707
 conflicts-0.705
 traps-0.696
Token and
Token ID290
Feature activation-8.98e-03

Pos logit contributions
 vet+0.181
 traps+0.180
 dam+0.179
keeping+0.177
ooks+0.177
Neg logit contributions
ifying-0.155
ay-0.141
 tr-0.137
 g-0.133
 la-0.130
Token told
Token ID1297
Feature activation-1.64e-01

Pos logit contributions
igi+0.017
ooks+0.017
inger+0.017
 dam+0.017
age+0.016
Neg logit contributions
 tr-0.011
 g-0.010
 Ryan-0.010
ifying-0.010
 �-0.010
Token him
Token ID683
Feature activation+2.51e-01

Pos logit contributions
 traps+0.267
 vet+0.256
 hospital+0.249
keeping+0.239
ooks+0.234
Neg logit contributions
ifying-0.275
ksh-0.239
ahoo-0.230
ay-0.223
ding-0.216
Token that
Token ID326
Feature activation+3.52e-02

Pos logit contributions
ifying+0.350
 tr+0.315
 g+0.313
 la+0.309
 Ryan+0.306
Neg logit contributions
 dam-0.415
 vet-0.411
keeping-0.411
 traps-0.404
 bidder-0.397
Token
Token ID198
Feature activation-1.55e-02

Pos logit contributions
 dam+0.027
keeping+0.027
 traps+0.027
 vet+0.026
 antidote+0.026
Neg logit contributions
ifying-0.020
ay-0.020
Jake-0.019
 �-0.019
 la-0.019
TokenHe
Token ID1544
Feature activation+2.82e-02

Pos logit contributions
 dam+0.031
 traps+0.030
keeping+0.030
 vet+0.030
orns+0.029
Neg logit contributions
ifying-0.017
 la-0.015
ay-0.015
 tr-0.014
ahoo-0.014
Token pointed
Token ID6235
Feature activation+1.70e-01

Pos logit contributions
ifying+0.046
ay+0.041
ksh+0.041
 la+0.040
ahoo+0.040
Neg logit contributions
 vet-0.042
 traps-0.041
 dam-0.040
keeping-0.040
 killed-0.039
Token his
Token ID465
Feature activation+7.54e-03

Pos logit contributions
ifying+0.296
ksh+0.280
 la+0.266
ahoo+0.266
very+0.257
Neg logit contributions
 traps-0.231
 vet-0.228
 hole-0.213
ooks-0.212
 dam-0.212
Token magn
Token ID7842
Feature activation+5.38e-02

Pos logit contributions
ksh+0.012
ifying+0.012
ahoo+0.012
Nice+0.011
very+0.011
Neg logit contributions
 traps-0.012
 hole-0.011
 dam-0.010
 antidote-0.010
 vet-0.010
Tokenifying
Token ID4035
Feature activation+4.26e-01

Pos logit contributions
 la+0.025
 parade+0.023
 g+0.022
Nice+0.022
very+0.022
Neg logit contributions
keeping-0.128
inger-0.127
 hole-0.125
 infected-0.125
gy-0.124
Token glass
Token ID5405
Feature activation-1.09e-01

Pos logit contributions
ifying+0.696
ay+0.645
 g+0.587
 tr+0.565
AY+0.515
Neg logit contributions
igi-0.701
keeping-0.691
 conflicts-0.674
 traps-0.671
 dam-0.657
Token at
Token ID379
Feature activation+1.93e-01

Pos logit contributions
 traps+0.152
 vet+0.151
keeping+0.151
 dam+0.150
ooks+0.142
Neg logit contributions
ifying-0.184
 la-0.165
ay-0.165
ksh-0.164
ahoo-0.163
Token different
Token ID1180
Feature activation+3.67e-02

Pos logit contributions
ifying+0.275
ay+0.245
 la+0.239
 tr+0.239
ksh+0.236
Neg logit contributions
 vet-0.318
 dam-0.317
 traps-0.316
keeping-0.314
ooks-0.304
Token classmates
Token ID28999
Feature activation+6.63e-03

Pos logit contributions
ifying+0.057
 tr+0.053
ay+0.052
 g+0.052
 la+0.051
Neg logit contributions
 vet-0.056
 dam-0.055
keeping-0.054
 traps-0.053
igi-0.053
Token and
Token ID290
Feature activation-1.55e-02

Pos logit contributions
ifying+0.013
ksh+0.012
ahoo+0.012
ay+0.012
 la+0.011
Neg logit contributions
 traps-0.008
keeping-0.008
 vet-0.007
 dam-0.007
 jail-0.007
Token She
Token ID1375
Feature activation-4.87e-02

Pos logit contributions
ifying+0.035
ksh+0.031
ay+0.031
 la+0.031
 tr+0.031
Neg logit contributions
keeping-0.036
 traps-0.036
 dam-0.035
 vet-0.034
 killed-0.033
Token had
Token ID550
Feature activation+3.33e-02

Pos logit contributions
keeping+0.086
 traps+0.085
 dam+0.083
 vet+0.083
ooks+0.080
Neg logit contributions
ifying-0.064
 tr-0.058
 la-0.058
 g-0.056
ahoo-0.056
Token a
Token ID257
Feature activation-7.78e-03

Pos logit contributions
ifying+0.049
 tr+0.045
 la+0.044
 g+0.044
ahoo+0.043
Neg logit contributions
 dam-0.052
keeping-0.052
 traps-0.052
 vet-0.052
ooks-0.050
Token big
Token ID1263
Feature activation+1.18e-01

Pos logit contributions
 vet+0.013
 dam+0.012
 hole+0.012
 traps+0.012
 hospital+0.011
Neg logit contributions
ifying-0.012
ksh-0.012
ahoo-0.011
ay-0.011
 la-0.010
Token magn
Token ID7842
Feature activation+5.27e-02

Pos logit contributions
ifying+0.183
ksh+0.162
ay+0.156
ahoo+0.155
 la+0.154
Neg logit contributions
 dam-0.186
 vet-0.186
 traps-0.185
keeping-0.179
 bidder-0.175
Tokenifying
Token ID4035
Feature activation+4.24e-01

Pos logit contributions
ksh+0.026
 g+0.025
very+0.025
Jake+0.025
 Ple+0.024
Neg logit contributions
inger-0.124
orns-0.120
ooks-0.118
igi-0.117
gy-0.116
Token glass
Token ID5405
Feature activation-1.11e-01

Pos logit contributions
ifying+0.718
ay+0.654
 g+0.632
 tr+0.608
 la+0.584
Neg logit contributions
keeping-0.664
 traps-0.619
igi-0.617
ooks-0.606
 conflicts-0.604
Token that
Token ID326
Feature activation+3.42e-02

Pos logit contributions
 traps+0.166
keeping+0.165
 vet+0.164
 dam+0.163
ooks+0.158
Neg logit contributions
ifying-0.176
ay-0.159
 tr-0.156
 la-0.153
 g-0.151
Token made
Token ID925
Feature activation+8.36e-03

Pos logit contributions
ifying+0.047
 tr+0.043
 g+0.043
ay+0.042
 la+0.042
Neg logit contributions
keeping-0.057
 dam-0.056
 vet-0.056
 traps-0.056
ooks-0.054
Token things
Token ID1243
Feature activation-1.52e-01

Pos logit contributions
ifying+0.013
 la+0.012
ksh+0.012
ay+0.012
ahoo+0.012
Neg logit contributions
 traps-0.012
 vet-0.012
keeping-0.012
 dam-0.012
ooks-0.012
Token look
Token ID804
Feature activation+7.82e-02

Pos logit contributions
 traps+0.233
 dam+0.230
 vet+0.229
keeping+0.228
ooks+0.217
Neg logit contributions
ifying-0.233
ksh-0.211
 la-0.209
ahoo-0.207
ay-0.206
Token said
Token ID531
Feature activation-5.44e-03

Pos logit contributions
 traps+0.239
keeping+0.238
 dam+0.236
 vet+0.234
orns+0.229
Neg logit contributions
ifying-0.191
 tr-0.185
 g-0.182
 la-0.179
ksh-0.177
Token,
Token ID11
Feature activation-1.27e-02

Pos logit contributions
keeping+0.006
 vet+0.006
 dam+0.006
 traps+0.006
ooks+0.006
Neg logit contributions
ifying-0.011
ahoo-0.010
ksh-0.009
ay-0.009
 la-0.009
Token holding
Token ID4769
Feature activation+1.25e-02

Pos logit contributions
 vet+0.019
 dam+0.019
 traps+0.018
 wound+0.018
keeping+0.018
Neg logit contributions
ifying-0.021
ay-0.021
ahoo-0.019
ksh-0.019
rav-0.018
Token the
Token ID262
Feature activation+1.21e-02

Pos logit contributions
ifying+0.024
ahoo+0.022
 la+0.022
ksh+0.022
very+0.022
Neg logit contributions
 traps-0.015
 vet-0.014
 hole-0.013
 dam-0.013
keeping-0.013
Token magn
Token ID7842
Feature activation+5.21e-02

Pos logit contributions
ifying+0.023
ksh+0.022
 la+0.021
ay+0.021
ahoo+0.021
Neg logit contributions
 traps-0.015
 vet-0.014
 dam-0.014
keeping-0.014
 hole-0.014
Tokenifying
Token ID4035
Feature activation+4.24e-01

Pos logit contributions
very+0.019
 Ryan+0.019
 g+0.018
 la+0.018
 apr+0.018
Neg logit contributions
inger-0.128
 hole-0.127
keeping-0.127
orns-0.125
 traps-0.125
Token glass
Token ID5405
Feature activation-1.11e-01

Pos logit contributions
ifying+0.684
ay+0.618
 g+0.566
 tr+0.536
 ding+0.500
Neg logit contributions
keeping-0.720
igi-0.714
 conflicts-0.697
limits-0.669
ooks-0.669
Token over
Token ID625
Feature activation+2.52e-02

Pos logit contributions
 vet+0.172
 traps+0.172
keeping+0.172
 dam+0.171
ooks+0.165
Neg logit contributions
ifying-0.170
ay-0.150
 tr-0.148
 la-0.145
ksh-0.145
Token a
Token ID257
Feature activation-7.87e-03

Pos logit contributions
ifying+0.048
ksh+0.047
ahoo+0.046
 la+0.045
ay+0.044
Neg logit contributions
keeping-0.031
 vet-0.030
 traps-0.029
ooks-0.028
 pits-0.027
Token black
Token ID2042
Feature activation+1.90e-01

Pos logit contributions
 vet+0.009
 hole+0.009
 traps+0.009
 dam+0.009
 bidder+0.008
Neg logit contributions
ifying-0.015
ksh-0.014
ahoo-0.014
 Ryan-0.014
ay-0.014
Token ant
Token ID1885
Feature activation-8.17e-02

Pos logit contributions
ifying+0.343
ksh+0.328
ahoo+0.315
 Ryan+0.314
 la+0.313
Neg logit contributions
 vet-0.248
keeping-0.240
 hole-0.236
 traps-0.233
 dam-0.231
Token hat
Token ID6877
Feature activation+6.45e-02

Pos logit contributions
 traps+0.014
 vet+0.014
 dam+0.013
 hole+0.013
 hospital+0.013
Neg logit contributions
ifying-0.013
ahoo-0.013
ksh-0.012
ay-0.012
 Eric-0.012
Token and
Token ID290
Feature activation-1.79e-02

Pos logit contributions
ifying+0.104
ay+0.094
 la+0.092
 tr+0.092
ksh+0.091
Neg logit contributions
 traps-0.095
 dam-0.094
keeping-0.094
 vet-0.093
ooks-0.091
Token grabbed
Token ID13176
Feature activation+4.64e-02

Pos logit contributions
 dam+0.033
ooks+0.033
keeping+0.032
inger+0.032
igi+0.031
Neg logit contributions
 tr-0.022
ifying-0.022
 g-0.021
 la-0.020
 Ryan-0.020
Token her
Token ID607
Feature activation-8.09e-03

Pos logit contributions
ifying+0.081
ahoo+0.074
ksh+0.073
 la+0.073
ay+0.072
Neg logit contributions
 traps-0.062
 vet-0.062
keeping-0.060
 dam-0.059
orns-0.057
Token magn
Token ID7842
Feature activation+4.96e-02

Pos logit contributions
 traps+0.012
 vet+0.012
 dam+0.011
 hole+0.011
 antidote+0.011
Neg logit contributions
ifying-0.013
ahoo-0.012
ksh-0.012
very-0.012
 la-0.011
Tokenifying
Token ID4035
Feature activation+4.21e-01

Pos logit contributions
 la+0.018
very+0.017
 g+0.017
 parade+0.015
ahoo+0.015
Neg logit contributions
keeping-0.127
inger-0.124
 infected-0.120
orns-0.120
gy-0.120
Token glass
Token ID5405
Feature activation-1.13e-01

Pos logit contributions
ifying+0.647
ay+0.604
 g+0.548
 tr+0.514
AY+0.476
Neg logit contributions
igi-0.725
keeping-0.720
 conflicts-0.717
 traps-0.707
orns-0.688
Token.
Token ID13
Feature activation+3.48e-02

Pos logit contributions
 traps+0.187
 dam+0.186
 vet+0.185
ooks+0.182
keeping+0.181
Neg logit contributions
ifying-0.165
ay-0.149
 tr-0.144
 g-0.139
ksh-0.137
Token She
Token ID1375
Feature activation-4.87e-02

Pos logit contributions
ifying+0.046
ay+0.043
 �+0.043
 la+0.042
 Ryan+0.042
Neg logit contributions
 dam-0.062
 traps-0.062
keeping-0.061
 conflicts-0.058
 killed-0.058
Token was
Token ID373
Feature activation-3.86e-02

Pos logit contributions
keeping+0.091
ooks+0.089
 dam+0.089
 traps+0.087
orns+0.086
Neg logit contributions
 tr-0.062
 g-0.059
ifying-0.057
 jugg-0.055
 yes-0.055
Token ready
Token ID3492
Feature activation+1.15e-01

Pos logit contributions
keeping+0.072
 dam+0.071
 traps+0.070
orns+0.070
 vet+0.070
Neg logit contributions
ifying-0.048
 g-0.044
 la-0.043
 tr-0.041
 yes-0.041
Token a
Token ID257
Feature activation-1.30e-02

Pos logit contributions
 tr+0.033
ifying+0.033
 g+0.032
 yes+0.030
 Eric+0.029
Neg logit contributions
 dam-0.048
 vet-0.047
keeping-0.047
igi-0.047
inger-0.047
Token flashlight
Token ID38371
Feature activation+1.79e-02

Pos logit contributions
 vet+0.021
 traps+0.021
 dam+0.021
keeping+0.021
ooks+0.020
Neg logit contributions
ifying-0.019
ksh-0.017
ay-0.016
ahoo-0.016
 la-0.016
Token and
Token ID290
Feature activation-3.13e-02

Pos logit contributions
ifying+0.031
ksh+0.029
ahoo+0.028
 la+0.028
ay+0.027
Neg logit contributions
 vet-0.024
 traps-0.024
 dam-0.024
keeping-0.023
 hospital-0.022
Token a
Token ID257
Feature activation-1.27e-02

Pos logit contributions
igi+0.060
inger+0.059
 dam+0.058
orns+0.058
ooks+0.058
Neg logit contributions
ifying-0.038
 tr-0.034
 yes-0.034
 g-0.034
 Ryan-0.032
Token magn
Token ID7842
Feature activation+5.08e-02

Pos logit contributions
 vet+0.019
 dam+0.019
 traps+0.019
 hole+0.018
keeping+0.017
Neg logit contributions
ifying-0.021
ksh-0.019
ahoo-0.019
ay-0.019
 la-0.018
Tokenifying
Token ID4035
Feature activation+4.21e-01

Pos logit contributions
 g+0.021
very+0.021
ksh+0.020
ahoo+0.018
 la+0.018
Neg logit contributions
inger-0.124
orns-0.124
keeping-0.122
ooks-0.121
 dam-0.121
Token glass
Token ID5405
Feature activation-1.14e-01

Pos logit contributions
ifying+0.718
ay+0.638
 g+0.614
 tr+0.608
 la+0.561
Neg logit contributions
keeping-0.673
igi-0.622
ooks-0.618
 conflicts-0.617
 traps-0.616
Token to
Token ID284
Feature activation+1.14e-02

Pos logit contributions
 traps+0.176
 vet+0.175
keeping+0.175
 dam+0.174
ooks+0.168
Neg logit contributions
ifying-0.175
ay-0.156
 tr-0.154
 la-0.152
ksh-0.150
Token examine
Token ID10716
Feature activation+2.01e-02

Pos logit contributions
ifying+0.016
 g+0.015
 tr+0.015
ay+0.015
 la+0.014
Neg logit contributions
keeping-0.019
 vet-0.018
 dam-0.018
 traps-0.018
ooks-0.018
Token their
Token ID511
Feature activation+6.80e-03

Pos logit contributions
ifying+0.030
ay+0.027
 tr+0.026
ksh+0.026
 la+0.026
Neg logit contributions
 vet-0.032
 dam-0.032
keeping-0.031
 traps-0.031
ooks-0.030
Token finds
Token ID7228
Feature activation-2.71e-01

Pos logit contributions
ifying+0.013
ksh+0.012
 la+0.012
ahoo+0.012
ay+0.012
Neg logit contributions
 traps-0.008
keeping-0.008
 dam-0.008
 vet-0.008
 hole-0.007
Token When
Token ID1649
Feature activation-1.60e-02

Pos logit contributions
ifying+0.095
 la+0.084
ay+0.084
ksh+0.083
 tr+0.082
Neg logit contributions
 traps-0.093
keeping-0.092
 vet-0.092
 dam-0.092
ooks-0.088
Token Daisy
Token ID40355
Feature activation-5.68e-02

Pos logit contributions
 traps+0.024
 vet+0.024
keeping+0.024
 dam+0.023
ooks+0.023
Neg logit contributions
ifying-0.025
 la-0.023
ay-0.023
 tr-0.022
ahoo-0.022
Token put
Token ID1234
Feature activation-6.44e-02

Pos logit contributions
keeping+0.088
 traps+0.088
 dam+0.088
 vet+0.087
ooks+0.085
Neg logit contributions
ifying-0.086
 la-0.077
 tr-0.077
ay-0.076
 g-0.075
Token the
Token ID262
Feature activation-9.66e-04

Pos logit contributions
 vet+0.102
keeping+0.102
 dam+0.102
 traps+0.102
ooks+0.098
Neg logit contributions
ifying-0.097
ay-0.086
 la-0.085
 tr-0.084
ksh-0.083
Token magn
Token ID7842
Feature activation+4.99e-02

Pos logit contributions
 vet+0.001
 traps+0.001
 hole+0.001
 dam+0.001
 hospital+0.001
Neg logit contributions
ifying-0.002
ksh-0.002
ahoo-0.002
Nice-0.002
 la-0.001
Tokenifying
Token ID4035
Feature activation+4.20e-01

Pos logit contributions
ahoo+0.018
ksh+0.018
 la+0.017
 g+0.017
Nice+0.017
Neg logit contributions
keeping-0.122
inger-0.118
 traps-0.118
orns-0.118
 hole-0.117
Token glass
Token ID5405
Feature activation-1.21e-01

Pos logit contributions
ifying+0.724
ay+0.635
 g+0.590
 tr+0.589
ksh+0.565
Neg logit contributions
 traps-0.646
keeping-0.639
 conflicts-0.630
 vet-0.620
 dam-0.615
Token down
Token ID866
Feature activation+5.61e-02

Pos logit contributions
 vet+0.204
 traps+0.201
 dam+0.199
ooks+0.198
keeping+0.195
Neg logit contributions
ifying-0.177
ay-0.163
 tr-0.158
 g-0.154
very-0.148
Token,
Token ID11
Feature activation-2.71e-02

Pos logit contributions
ifying+0.078
ay+0.071
 g+0.069
 tr+0.068
 �+0.067
Neg logit contributions
 vet-0.097
 dam-0.097
keeping-0.096
 traps-0.095
 conflicts-0.092
Token the
Token ID262
Feature activation-1.53e-03

Pos logit contributions
 dam+0.045
ooks+0.044
 vet+0.044
keeping+0.043
inger+0.043
Neg logit contributions
ifying-0.038
 g-0.036
 Ryan-0.035
 la-0.035
 tr-0.035
Token cater
Token ID20825
Feature activation+4.77e-02

Pos logit contributions
 vet+0.002
 traps+0.002
keeping+0.002
 dam+0.002
 hole+0.002
Neg logit contributions
ifying-0.003
ksh-0.002
ahoo-0.002
ay-0.002
 la-0.002
Token flashlight
Token ID38371
Feature activation+1.68e-02

Pos logit contributions
 vet+0.012
 dam+0.011
 traps+0.011
 hole+0.011
 hospital+0.011
Neg logit contributions
ifying-0.011
ksh-0.011
ahoo-0.010
ay-0.010
Nice-0.010
Token,
Token ID11
Feature activation-1.15e-02

Pos logit contributions
ifying+0.027
ksh+0.024
 la+0.024
ahoo+0.024
ay+0.024
Neg logit contributions
keeping-0.025
 traps-0.025
 vet-0.024
 dam-0.024
 bidder-0.023
Token and
Token ID290
Feature activation-2.59e-02

Pos logit contributions
 dam+0.019
keeping+0.019
ooks+0.018
 vet+0.018
 traps+0.018
Neg logit contributions
ifying-0.017
 la-0.015
 tr-0.015
 yes-0.015
 g-0.015
Token a
Token ID257
Feature activation-1.06e-02

Pos logit contributions
 dam+0.045
inger+0.044
ooks+0.043
igi+0.043
keeping+0.043
Neg logit contributions
ifying-0.036
 tr-0.033
 g-0.033
 yes-0.033
 la-0.033
Token magn
Token ID7842
Feature activation+4.93e-02

Pos logit contributions
 vet+0.016
 dam+0.016
 traps+0.016
 hole+0.015
keeping+0.015
Neg logit contributions
ifying-0.016
ksh-0.015
ahoo-0.015
ay-0.015
 la-0.014
Tokenifying
Token ID4035
Feature activation+4.20e-01

Pos logit contributions
 g+0.024
ksh+0.023
very+0.022
ahoo+0.021
 tr+0.021
Neg logit contributions
keeping-0.119
inger-0.115
 traps-0.114
orns-0.114
ooks-0.113
Token glass
Token ID5405
Feature activation-1.11e-01

Pos logit contributions
ifying+0.737
ay+0.651
 tr+0.616
 g+0.612
ahoo+0.584
Neg logit contributions
keeping-0.619
 traps-0.617
 conflicts-0.593
 vet-0.592
igi-0.590
Token.
Token ID13
Feature activation+1.91e-02

Pos logit contributions
 vet+0.181
 traps+0.179
 dam+0.178
keeping+0.175
ooks+0.174
Neg logit contributions
ifying-0.164
ay-0.145
 tr-0.143
 g-0.138
 la-0.138
Token
Token ID198
Feature activation-2.65e-02

Pos logit contributions
ifying+0.028
 �+0.026
 la+0.025
ay+0.025
 Ryan+0.025
Neg logit contributions
 traps-0.032
 dam-0.032
keeping-0.031
 killed-0.031
 conflicts-0.030
Token
Token ID198
Feature activation-2.64e-02

Pos logit contributions
 traps+0.052
keeping+0.051
 dam+0.051
 killed+0.049
 conflicts+0.048
Neg logit contributions
ay-0.030
 �-0.030
ifying-0.029
Jake-0.028
very-0.027
TokenAs
Token ID1722
Feature activation-9.28e-02

Pos logit contributions
 traps+0.040
keeping+0.040
 dam+0.040
 vet+0.039
orns+0.037
Neg logit contributions
ifying-0.042
 la-0.037
ay-0.037
ahoo-0.036
ksh-0.036
Token She
Token ID1375
Feature activation-3.66e-02

Pos logit contributions
ifying+0.057
ay+0.051
 la+0.050
ksh+0.050
 tr+0.049
Neg logit contributions
 traps-0.076
keeping-0.075
 dam-0.075
 vet-0.074
ooks-0.071
Token takes
Token ID2753
Feature activation-5.70e-02

Pos logit contributions
keeping+0.055
 dam+0.052
 vet+0.051
 traps+0.051
igi+0.050
Neg logit contributions
ifying-0.059
 tr-0.056
 la-0.055
ay-0.055
ahoo-0.054
Token out
Token ID503
Feature activation+9.87e-04

Pos logit contributions
 traps+0.083
 vet+0.082
keeping+0.081
 dam+0.080
ooks+0.076
Neg logit contributions
ifying-0.095
 la-0.084
ksh-0.084
ahoo-0.083
ay-0.082
Token a
Token ID257
Feature activation-7.49e-03

Pos logit contributions
ifying+0.002
ay+0.001
 tr+0.001
 la+0.001
 g+0.001
Neg logit contributions
keeping-0.002
 dam-0.002
 vet-0.002
 traps-0.002
 bidder-0.001
Token magn
Token ID7842
Feature activation+4.54e-02

Pos logit contributions
 traps+0.011
 vet+0.011
 dam+0.011
 hole+0.010
keeping+0.010
Neg logit contributions
ifying-0.012
ksh-0.012
ahoo-0.011
 la-0.011
ay-0.011
Tokenifying
Token ID4035
Feature activation+4.20e-01

Pos logit contributions
very+0.019
 la+0.018
ksh+0.017
 g+0.017
Nice+0.017
Neg logit contributions
inger-0.109
keeping-0.108
 traps-0.107
 hole-0.107
ooks-0.106
Token glass
Token ID5405
Feature activation-1.19e-01

Pos logit contributions
ifying+0.722
ay+0.645
 g+0.597
 tr+0.596
 ding+0.562
Neg logit contributions
keeping-0.642
igi-0.627
 traps-0.625
 conflicts-0.621
ooks-0.607
Token.
Token ID13
Feature activation+3.77e-02

Pos logit contributions
 traps+0.181
keeping+0.180
 dam+0.180
 vet+0.180
ooks+0.177
Neg logit contributions
ifying-0.187
ay-0.175
 tr-0.168
 g-0.165
 �-0.159
Token She
Token ID1375
Feature activation-4.15e-02

Pos logit contributions
ifying+0.056
ay+0.050
 la+0.049
ksh+0.049
 tr+0.048
Neg logit contributions
 traps-0.061
keeping-0.060
 dam-0.060
 vet-0.059
orns-0.057
Token looks
Token ID3073
Feature activation+1.09e-01

Pos logit contributions
keeping+0.068
 dam+0.062
igi+0.062
 vet+0.061
ooks+0.060
Neg logit contributions
ifying-0.063
 tr-0.062
 la-0.060
 g-0.059
ay-0.058
Token at
Token ID379
Feature activation+1.83e-01

Pos logit contributions
ifying+0.146
ksh+0.127
ahoo+0.127
ay+0.127
 la+0.123
Neg logit contributions
 traps-0.192
 dam-0.188
 vet-0.187
ooks-0.181
keeping-0.179
Token He
Token ID679
Feature activation-2.70e-02

Pos logit contributions
 traps+0.368
 vet+0.366
 dam+0.366
keeping+0.365
ooks+0.350
Neg logit contributions
ifying-0.311
ay-0.276
 la-0.274
ksh-0.271
 tr-0.270
Token took
Token ID1718
Feature activation+1.81e-02

Pos logit contributions
keeping+0.046
 traps+0.045
 dam+0.044
orns+0.044
ooks+0.043
Neg logit contributions
 tr-0.037
ifying-0.036
 g-0.034
 la-0.034
ay-0.034
Token out
Token ID503
Feature activation+2.88e-03

Pos logit contributions
ifying+0.025
ay+0.024
 g+0.023
 tr+0.022
 la+0.022
Neg logit contributions
 dam-0.031
keeping-0.031
 vet-0.031
igi-0.030
 traps-0.029
Token a
Token ID257
Feature activation-1.11e-02

Pos logit contributions
ifying+0.005
ksh+0.004
 la+0.004
ahoo+0.004
ay+0.004
Neg logit contributions
 traps-0.004
 vet-0.004
keeping-0.004
 dam-0.004
ooks-0.004
Token magn
Token ID7842
Feature activation+4.78e-02

Pos logit contributions
 traps+0.016
 vet+0.016
 dam+0.016
 hole+0.015
 jail+0.015
Neg logit contributions
ifying-0.019
ksh-0.018
ahoo-0.017
ay-0.016
Nice-0.016
Tokenifying
Token ID4035
Feature activation+4.19e-01

Pos logit contributions
 g+0.020
ksh+0.019
very+0.017
Nice+0.017
 Ple+0.016
Neg logit contributions
keeping-0.115
inger-0.113
 hole-0.111
orns-0.111
 traps-0.110
Token glass
Token ID5405
Feature activation-1.17e-01

Pos logit contributions
ifying+0.725
ay+0.656
 tr+0.614
 g+0.608
 la+0.581
Neg logit contributions
keeping-0.627
 traps-0.606
igi-0.605
 conflicts-0.596
ooks-0.594
Token and
Token ID290
Feature activation-3.30e-02

Pos logit contributions
 traps+0.169
keeping+0.168
 vet+0.168
 dam+0.167
ooks+0.163
Neg logit contributions
ifying-0.190
ay-0.173
 tr-0.170
 la-0.166
 g-0.165
Token a
Token ID257
Feature activation-1.04e-02

Pos logit contributions
igi+0.059
inger+0.058
orns+0.058
ooks+0.057
 dam+0.057
Neg logit contributions
 tr-0.043
 g-0.040
 Ryan-0.039
ifying-0.039
 Jake-0.039
Token notebook
Token ID20922
Feature activation+7.21e-02

Pos logit contributions
 vet+0.016
 dam+0.015
 traps+0.015
 hole+0.015
 jail+0.014
Neg logit contributions
ifying-0.016
ksh-0.015
ahoo-0.015
ay-0.015
 la-0.014
Token and
Token ID290
Feature activation-3.09e-02

Pos logit contributions
ifying+0.115
ksh+0.102
 la+0.102
ay+0.102
ahoo+0.101
Neg logit contributions
 traps-0.107
 vet-0.107
keeping-0.106
 dam-0.105
ooks-0.100
Token it
Token ID340
Feature activation-6.84e-02

Pos logit contributions
ifying+0.040
ay+0.037
 Ryan+0.036
 g+0.035
 tr+0.035
Neg logit contributions
 dam-0.059
keeping-0.057
 traps-0.057
ooks-0.057
 vet-0.057
Token she
Token ID673
Feature activation-8.94e-02

Pos logit contributions
keeping+0.145
orns+0.130
igi+0.129
 conflicts+0.126
 traps+0.126
Neg logit contributions
 tr-0.073
 g-0.071
 la-0.069
 �-0.068
 yes-0.067
Token had
Token ID550
Feature activation+2.31e-02

Pos logit contributions
keeping+0.175
orns+0.169
igi+0.166
 traps+0.162
 pits+0.161
Neg logit contributions
 tr-0.103
 Ryan-0.095
 cl-0.093
 Eric-0.093
 tut-0.093
Token a
Token ID257
Feature activation-1.59e-02

Pos logit contributions
ifying+0.030
 tr+0.029
 g+0.029
 Eric+0.027
 wo+0.027
Neg logit contributions
 dam-0.041
ooks-0.039
 traps-0.038
igi-0.037
 conflicts-0.037
Token magn
Token ID7842
Feature activation+4.83e-02

Pos logit contributions
 vet+0.026
 dam+0.025
 traps+0.025
 hole+0.024
 jail+0.024
Neg logit contributions
ifying-0.024
ksh-0.022
ahoo-0.021
ay-0.021
 la-0.021
Tokenifying
Token ID4035
Feature activation+4.18e-01

Pos logit contributions
ksh+0.022
 apr+0.020
 g+0.019
very+0.018
 la+0.018
Neg logit contributions
inger-0.117
keeping-0.115
orns-0.114
age-0.113
ooks-0.113
Token glass
Token ID5405
Feature activation-1.17e-01

Pos logit contributions
ifying+0.707
ay+0.637
 g+0.609
 tr+0.590
 la+0.556
Neg logit contributions
keeping-0.667
 traps-0.626
igi-0.623
 conflicts-0.619
ooks-0.615
Token,
Token ID11
Feature activation-1.98e-02

Pos logit contributions
keeping+0.180
 traps+0.180
 dam+0.178
 vet+0.178
ooks+0.175
Neg logit contributions
ifying-0.180
ay-0.163
 tr-0.160
 g-0.152
 la-0.151
Token a
Token ID257
Feature activation-1.42e-02

Pos logit contributions
 dam+0.034
keeping+0.033
 vet+0.033
 traps+0.032
igi+0.032
Neg logit contributions
ifying-0.027
 g-0.025
 tr-0.025
 yes-0.024
 la-0.024
Token tiny
Token ID7009
Feature activation+3.11e-02

Pos logit contributions
 vet+0.022
 hole+0.021
 dam+0.021
 traps+0.021
 nest+0.020
Neg logit contributions
ifying-0.022
ksh-0.022
ahoo-0.021
ay-0.020
Nice-0.020
Token torch
Token ID28034
Feature activation-6.77e-02

Pos logit contributions
ifying+0.051
 tr+0.046
ay+0.046
 la+0.045
 g+0.045
Neg logit contributions
keeping-0.046
 traps-0.044
 vet-0.043
 dam-0.043
ooks-0.043
Token,
Token ID11
Feature activation-9.66e-03

Pos logit contributions
 vet+0.003
 traps+0.003
ooks+0.003
keeping+0.003
 die+0.003
Neg logit contributions
ifying-0.007
ksh-0.007
ay-0.007
ahoo-0.007
 la-0.006
Token handing
Token ID22786
Feature activation+2.47e-02

Pos logit contributions
 vet+0.014
 dam+0.012
 wound+0.012
 traps+0.012
 Tweet+0.012
Neg logit contributions
ifying-0.018
ay-0.017
ahoo-0.015
ee-0.015
ksh-0.015
Token him
Token ID683
Feature activation+2.41e-01

Pos logit contributions
ifying+0.035
 la+0.030
ahoo+0.030
ksh+0.029
very+0.029
Neg logit contributions
 vet-0.044
 traps-0.043
keeping-0.043
orns-0.041
 dam-0.040
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
ifying+0.420
 la+0.379
ay+0.378
ksh+0.378
ahoo+0.376
Neg logit contributions
 traps-0.336
 vet-0.330
keeping-0.318
 dam-0.314
orns-0.308
Token magn
Token ID7842
Feature activation+4.64e-02

Pos logit contributions
ifying+0.019
ksh+0.018
ahoo+0.017
ay+0.017
very+0.017
Neg logit contributions
 vet-0.018
 traps-0.017
 dam-0.017
 hole-0.016
 hospital-0.016
Tokenifying
Token ID4035
Feature activation+4.18e-01

Pos logit contributions
 g+0.018
ahoo+0.017
very+0.016
 apr+0.016
ksh+0.016
Neg logit contributions
keeping-0.114
orns-0.110
inger-0.110
 traps-0.109
 hole-0.109
Token glass
Token ID5405
Feature activation-1.17e-01

Pos logit contributions
ifying+0.724
ay+0.659
 g+0.606
 tr+0.598
 la+0.568
Neg logit contributions
keeping-0.633
igi-0.612
 traps-0.611
ooks-0.604
 conflicts-0.598
Token.
Token ID13
Feature activation+3.18e-02

Pos logit contributions
 traps+0.191
 dam+0.191
ooks+0.189
 vet+0.188
keeping+0.186
Neg logit contributions
ifying-0.169
ay-0.152
 tr-0.150
 g-0.147
 la-0.142
Token She
Token ID1375
Feature activation-5.62e-02

Pos logit contributions
ifying+0.054
ksh+0.048
ahoo+0.047
ay+0.047
 la+0.047
Neg logit contributions
 vet-0.045
 traps-0.044
 dam-0.044
keeping-0.044
ooks-0.042
Token looked
Token ID3114
Feature activation+9.02e-02

Pos logit contributions
keeping+0.108
 dam+0.104
 traps+0.103
ooks+0.103
orns+0.102
Neg logit contributions
 tr-0.069
 g-0.064
 yes-0.062
 jugg-0.062
ifying-0.062
Token around
Token ID1088
Feature activation+8.66e-02

Pos logit contributions
ifying+0.125
ay+0.113
ahoo+0.112
ksh+0.111
 la+0.103
Neg logit contributions
 traps-0.156
 dam-0.148
 hole-0.148
 vet-0.147
 killed-0.146
Token the
Token ID262
Feature activation+9.14e-03

Pos logit contributions
 Ryan+0.171
 Jake+0.168
 tr+0.165
 g+0.165
 football+0.160
Neg logit contributions
ooks-0.371
 bidder-0.370
age-0.366
 dam-0.360
 antidote-0.352
Token watch
Token ID2342
Feature activation+2.28e-01

Pos logit contributions
ifying+0.016
ksh+0.016
ahoo+0.015
 Ryan+0.014
ay+0.014
Neg logit contributions
 vet-0.013
 traps-0.013
 hole-0.013
 dam-0.012
 antidote-0.012
Token with
Token ID351
Feature activation-4.17e-02

Pos logit contributions
ifying+0.406
ksh+0.388
ahoo+0.381
Nice+0.374
 la+0.366
Neg logit contributions
 vet-0.290
 traps-0.283
keeping-0.277
 bidder-0.272
 dam-0.272
Token the
Token ID262
Feature activation+7.78e-03

Pos logit contributions
keeping+0.068
 dam+0.067
 vet+0.067
 traps+0.066
ooks+0.065
Neg logit contributions
ifying-0.061
 g-0.055
ay-0.054
 tr-0.053
 Ryan-0.052
Token magn
Token ID7842
Feature activation+4.62e-02

Pos logit contributions
ifying+0.013
ksh+0.012
ahoo+0.011
 Ryan+0.011
ay+0.011
Neg logit contributions
 vet-0.012
 traps-0.012
 dam-0.011
 hole-0.011
 antidote-0.011
Tokenifying
Token ID4035
Feature activation+4.18e-01

Pos logit contributions
very+0.024
 apr+0.024
 g+0.024
ahoo+0.022
 sports+0.022
Neg logit contributions
inger-0.106
keeping-0.103
house-0.103
age-0.103
orns-0.103
Token glass
Token ID5405
Feature activation-1.17e-01

Pos logit contributions
ifying+0.705
ay+0.641
 g+0.596
 tr+0.574
 ding+0.544
Neg logit contributions
keeping-0.654
igi-0.645
 conflicts-0.632
ooks-0.631
 traps-0.631
Token.
Token ID13
Feature activation+3.02e-02

Pos logit contributions
 vet+0.178
 traps+0.177
keeping+0.176
 dam+0.176
ooks+0.171
Neg logit contributions
ifying-0.182
ay-0.162
 tr-0.161
 la-0.159
ksh-0.157
Token He
Token ID679
Feature activation-2.25e-02

Pos logit contributions
ifying+0.038
 �+0.036
 tr+0.036
ay+0.035
ksh+0.034
Neg logit contributions
keeping-0.053
igi-0.052
 traps-0.052
 conflicts-0.051
 vet-0.051
Token saw
Token ID2497
Feature activation+2.36e-02

Pos logit contributions
keeping+0.036
 traps+0.036
 vet+0.036
 dam+0.036
ooks+0.034
Neg logit contributions
ifying-0.033
 la-0.029
ay-0.029
 tr-0.029
ksh-0.029
Token the
Token ID262
Feature activation+9.54e-03

Pos logit contributions
 g+0.026
 tr+0.025
ifying+0.025
 �+0.024
 football+0.024
Neg logit contributions
keeping-0.046
inger-0.046
 bidder-0.045
igi-0.044
ooks-0.043
Token grandma
Token ID49890
Feature activation-2.40e-02

Pos logit contributions
keeping+0.016
 traps+0.016
 vet+0.015
 dam+0.015
orns+0.015
Neg logit contributions
ifying-0.020
 tr-0.019
ay-0.019
 la-0.018
ksh-0.018
Token's
Token ID338
Feature activation+4.99e-02

Pos logit contributions
 traps+0.035
 vet+0.035
keeping+0.035
 dam+0.035
 bidder+0.033
Neg logit contributions
ifying-0.039
ay-0.035
ahoo-0.034
 la-0.034
ksh-0.034
Token house
Token ID2156
Feature activation-1.46e-01

Pos logit contributions
ifying+0.094
ay+0.085
 la+0.085
 tr+0.085
ksh+0.084
Neg logit contributions
 traps-0.059
keeping-0.059
 vet-0.059
 dam-0.058
ooks-0.056
Token for
Token ID329
Feature activation+2.98e-02

Pos logit contributions
 traps+0.220
 vet+0.219
 dam+0.218
keeping+0.217
ooks+0.209
Neg logit contributions
ifying-0.228
ay-0.203
 tr-0.201
 la-0.201
ksh-0.200
Token safe
Token ID3338
Feature activation-1.65e-01

Pos logit contributions
ifying+0.039
 Ryan+0.035
ay+0.035
 tr+0.035
 g+0.035
Neg logit contributions
 dam-0.051
 vet-0.050
ooks-0.050
 traps-0.050
keeping-0.049
Tokenkeeping
Token ID19934
Feature activation-4.50e-01

Pos logit contributions
 vet+0.224
 dam+0.219
ooks+0.217
 bidder+0.213
igi+0.213
Neg logit contributions
ifying-0.276
 tr-0.263
 Ryan-0.257
 g-0.253
ay-0.249
Token.
Token ID13
Feature activation+9.66e-03

Pos logit contributions
 traps+0.783
 vet+0.778
 dam+0.777
ooks+0.743
orns+0.735
Neg logit contributions
ifying-0.607
 tr-0.539
ay-0.530
 la-0.524
 g-0.520
Token
Token ID220
Feature activation+4.22e-02

Pos logit contributions
ifying+0.011
 �+0.011
ay+0.010
 Ryan+0.010
 tr+0.010
Neg logit contributions
 traps-0.019
 dam-0.018
 conflicts-0.018
keeping-0.018
 antidote-0.018
Token
Token ID198
Feature activation-4.07e-02

Pos logit contributions
ifying+0.055
ay+0.053
 �+0.049
 Ryan+0.049
ahoo+0.048
Neg logit contributions
 traps-0.075
 dam-0.073
keeping-0.072
 vet-0.072
 bidder-0.070
Token
Token ID198
Feature activation-4.02e-02

Pos logit contributions
 traps+0.072
 dam+0.070
 vet+0.069
keeping+0.069
 conflicts+0.067
Neg logit contributions
ifying-0.053
ay-0.051
 �-0.048
very-0.048
 Ryan-0.048
TokenWhen
Token ID2215
Feature activation-2.23e-02

Pos logit contributions
 vet+0.061
keeping+0.060
 traps+0.060
 dam+0.059
ooks+0.057
Neg logit contributions
ifying-0.064
 la-0.057
ksh-0.057
ay-0.057
ahoo-0.056
Token took
Token ID1718
Feature activation+8.51e-03

Pos logit contributions
keeping+0.113
 traps+0.104
ooks+0.104
orns+0.102
 dam+0.102
Neg logit contributions
 tr-0.078
 �-0.072
ifying-0.071
 wo-0.071
 g-0.071
Token the
Token ID262
Feature activation+3.10e-03

Pos logit contributions
ay+0.009
ifying+0.009
 g+0.008
 Ryan+0.008
 Jake+0.008
Neg logit contributions
 dam-0.017
igi-0.017
ooks-0.017
keeping-0.017
 bidder-0.016
Token bird
Token ID6512
Feature activation-7.97e-02

Pos logit contributions
ifying+0.005
ay+0.005
 la+0.005
 tr+0.005
ksh+0.005
Neg logit contributions
keeping-0.004
 traps-0.004
 dam-0.004
 vet-0.004
ooks-0.004
Token to
Token ID284
Feature activation-1.49e-03

Pos logit contributions
ooks+0.144
 bidder+0.143
 dies+0.140
 conflicts+0.140
 killed+0.138
Neg logit contributions
ifying-0.092
 g-0.088
ay-0.088
 tr-0.085
 int-0.078
Token the
Token ID262
Feature activation+2.38e-03

Pos logit contributions
orns+0.003
ooks+0.003
igi+0.003
 conflicts+0.003
keeping+0.003
Neg logit contributions
 g-0.002
 Ryan-0.001
 la-0.001
 �-0.001
 cl-0.001
Token vet
Token ID20202
Feature activation-4.46e-01

Pos logit contributions
ifying+0.005
 tr+0.004
ay+0.004
 la+0.004
 g+0.004
Neg logit contributions
keeping-0.003
ooks-0.003
 traps-0.003
 dam-0.002
orns-0.002
Token and
Token ID290
Feature activation-2.92e-02

Pos logit contributions
 traps+0.793
 dam+0.790
 conflicts+0.789
orns+0.788
keeping+0.777
Neg logit contributions
ifying-0.545
 �-0.508
 yes-0.495
 la-0.487
 tr-0.484
Token they
Token ID484
Feature activation-3.47e-02

Pos logit contributions
ooks+0.061
orns+0.059
inger+0.058
 conflicts+0.058
keeping+0.057
Neg logit contributions
 yes-0.030
 g-0.029
 tr-0.029
 Daddy-0.028
 cl-0.028
Token fixed
Token ID5969
Feature activation-1.41e-01

Pos logit contributions
keeping+0.056
 traps+0.056
 dam+0.055
ooks+0.055
 vet+0.055
Neg logit contributions
ifying-0.051
 tr-0.045
ay-0.045
 g-0.044
ksh-0.044
Token its
Token ID663
Feature activation-1.65e-01

Pos logit contributions
 dam+0.235
 vet+0.233
keeping+0.229
ooks+0.228
 traps+0.225
Neg logit contributions
ifying-0.199
ay-0.178
 tr-0.176
 g-0.175
ksh-0.172
Token wing
Token ID8539
Feature activation-3.55e-01

Pos logit contributions
ooks+0.168
orns+0.166
keeping+0.162
igi+0.155
limits+0.151
Neg logit contributions
ifying-0.338
 tr-0.337
 g-0.316
 trumpet-0.309
ay-0.309
Token to
Token ID284
Feature activation-1.75e-03

Pos logit contributions
ifying+0.123
 g+0.112
 tr+0.112
ksh+0.106
ay+0.104
Neg logit contributions
ooks-0.120
igi-0.116
 dam-0.116
 bidder-0.116
 vet-0.115
Token the
Token ID262
Feature activation+1.17e-03

Pos logit contributions
orns+0.003
ooks+0.003
keeping+0.003
 traps+0.003
 conflicts+0.003
Neg logit contributions
 g-0.002
ifying-0.002
 Ryan-0.002
 �-0.002
 la-0.002
Token vet
Token ID20202
Feature activation-4.46e-01

Pos logit contributions
ifying+0.002
 tr+0.002
ay+0.002
 la+0.002
ksh+0.002
Neg logit contributions
keeping-0.001
 traps-0.001
 dam-0.001
ooks-0.001
orns-0.001
Token.
Token ID13
Feature activation+1.68e-02

Pos logit contributions
 traps+0.825
orns+0.822
 dam+0.815
ooks+0.813
igi+0.810
Neg logit contributions
ifying-0.518
 �-0.466
 yes-0.455
 tr-0.452
 g-0.450
Token The
Token ID383
Feature activation+1.79e-02

Pos logit contributions
ifying+0.018
 �+0.017
 Ryan+0.016
ay+0.016
ksh+0.016
Neg logit contributions
 traps-0.034
keeping-0.033
 conflicts-0.033
 dam-0.033
 antidote-0.032
Token vet
Token ID20202
Feature activation-4.46e-01

Pos logit contributions
ifying+0.030
 tr+0.028
ay+0.028
 la+0.027
 g+0.027
Neg logit contributions
keeping-0.025
ooks-0.024
 dam-0.023
orns-0.023
 traps-0.023
Token gave
Token ID2921
Feature activation-7.89e-02

Pos logit contributions
keeping+0.834
orns+0.804
 traps+0.803
 dam+0.795
ooks+0.791
Neg logit contributions
 yes-0.532
 tr-0.506
 �-0.505
ifying-0.504
 Ryan-0.497
Token the
Token ID262
Feature activation+3.95e-03

Pos logit contributions
 dam+0.148
ooks+0.145
keeping+0.143
 vet+0.142
 traps+0.141
Neg logit contributions
ifying-0.095
ay-0.086
 Ryan-0.082
 g-0.081
ksh-0.081
Token puppy
Token ID26188
Feature activation+7.79e-02

Pos logit contributions
ifying+0.005
 tr+0.004
ay+0.004
 la+0.004
 g+0.004
Neg logit contributions
ooks-0.008
keeping-0.007
 dam-0.007
 traps-0.007
orns-0.007
Token a
Token ID257
Feature activation-1.89e-02

Pos logit contributions
ifying+0.116
 g+0.104
 tr+0.103
ay+0.101
ksh+0.100
Neg logit contributions
 dam-0.125
keeping-0.123
ooks-0.121
 bidder-0.120
 vet-0.119
Token license
Token ID5964
Feature activation+7.01e-02

Pos logit contributions
keeping+0.034
 traps+0.033
 dam+0.033
ooks+0.032
orns+0.032
Neg logit contributions
ifying-0.025
ay-0.022
 tr-0.022
 g-0.021
 la-0.021
Token took
Token ID1718
Feature activation+1.01e-02

Pos logit contributions
inger+0.061
ooks+0.061
keeping+0.061
orns+0.058
 bidder+0.056
Neg logit contributions
 g-0.039
 �-0.037
 tr-0.037
 cl-0.036
 wo-0.035
Token the
Token ID262
Feature activation+2.55e-03

Pos logit contributions
ifying+0.013
ay+0.013
 g+0.012
 Ryan+0.011
 tr+0.011
Neg logit contributions
 dam-0.018
ooks-0.018
keeping-0.017
igi-0.017
 vet-0.017
Token puppy
Token ID26188
Feature activation+7.94e-02

Pos logit contributions
ifying+0.004
ksh+0.004
 la+0.004
ahoo+0.004
ay+0.004
Neg logit contributions
 vet-0.004
 traps-0.004
keeping-0.003
 dam-0.003
 killed-0.003
Token to
Token ID284
Feature activation-1.75e-03

Pos logit contributions
ifying+0.123
 g+0.112
 tr+0.112
ksh+0.106
ay+0.104
Neg logit contributions
ooks-0.120
igi-0.116
 dam-0.116
 bidder-0.116
 vet-0.115
Token the
Token ID262
Feature activation+1.17e-03

Pos logit contributions
orns+0.003
ooks+0.003
keeping+0.003
 traps+0.003
 conflicts+0.003
Neg logit contributions
 g-0.002
ifying-0.002
 Ryan-0.002
 �-0.002
 la-0.002
Token vet
Token ID20202
Feature activation-4.46e-01

Pos logit contributions
ifying+0.002
 tr+0.002
ay+0.002
 la+0.002
ksh+0.002
Neg logit contributions
keeping-0.001
 traps-0.001
 dam-0.001
ooks-0.001
orns-0.001
Token.
Token ID13
Feature activation+1.68e-02

Pos logit contributions
 traps+0.825
orns+0.822
 dam+0.815
ooks+0.813
igi+0.810
Neg logit contributions
ifying-0.518
 �-0.466
 yes-0.455
 tr-0.452
 g-0.450
Token The
Token ID383
Feature activation+1.79e-02

Pos logit contributions
ifying+0.018
 �+0.017
 Ryan+0.016
ay+0.016
ksh+0.016
Neg logit contributions
 traps-0.034
keeping-0.033
 conflicts-0.033
 dam-0.033
 antidote-0.032
Token vet
Token ID20202
Feature activation-4.46e-01

Pos logit contributions
ifying+0.030
 tr+0.028
ay+0.028
 la+0.027
 g+0.027
Neg logit contributions
keeping-0.025
ooks-0.024
 dam-0.023
orns-0.023
 traps-0.023
Token gave
Token ID2921
Feature activation-7.89e-02

Pos logit contributions
keeping+0.834
orns+0.804
 traps+0.803
 dam+0.795
ooks+0.791
Neg logit contributions
 yes-0.532
 tr-0.506
 �-0.505
ifying-0.504
 Ryan-0.497
Token the
Token ID262
Feature activation+3.95e-03

Pos logit contributions
 dam+0.148
ooks+0.145
keeping+0.143
 vet+0.142
 traps+0.141
Neg logit contributions
ifying-0.095
ay-0.086
 Ryan-0.082
 g-0.081
ksh-0.081
Token visit
Token ID3187
Feature activation+2.27e-02

Pos logit contributions
ifying+0.007
ay+0.007
 la+0.007
ksh+0.007
ahoo+0.007
Neg logit contributions
 vet-0.005
 traps-0.005
 dam-0.005
keeping-0.005
 bidder-0.004
Token.
Token ID13
Feature activation+2.24e-02

Pos logit contributions
ifying+0.031
ay+0.028
 tr+0.027
 la+0.027
ksh+0.027
Neg logit contributions
 dam-0.038
 vet-0.038
 traps-0.037
keeping-0.037
ooks-0.037
Token
Token ID198
Feature activation-2.89e-02

Pos logit contributions
 �+0.026
ifying+0.025
 Ryan+0.024
ay+0.024
 Eric+0.023
Neg logit contributions
 traps-0.043
 dam-0.043
 conflicts-0.042
 antidote-0.042
keeping-0.041
Token
Token ID198
Feature activation-2.82e-02

Pos logit contributions
 traps+0.050
 dam+0.049
keeping+0.049
 conflicts+0.047
 vet+0.047
Neg logit contributions
ifying-0.038
ay-0.037
 �-0.035
 la-0.035
very-0.034
TokenThe
Token ID464
Feature activation+5.27e-02

Pos logit contributions
 traps+0.047
keeping+0.047
 dam+0.047
 vet+0.046
ooks+0.045
Neg logit contributions
ifying-0.040
ay-0.035
 la-0.035
ksh-0.035
 tr-0.035
Token vet
Token ID20202
Feature activation-4.45e-01

Pos logit contributions
ifying+0.116
 tr+0.115
 yes+0.114
 la+0.114
 �+0.113
Neg logit contributions
keeping-0.047
ooks-0.039
orns-0.034
 conflicts-0.033
 dam-0.033
Token said
Token ID531
Feature activation-9.92e-03

Pos logit contributions
keeping+0.789
 traps+0.764
orns+0.760
 dam+0.755
ooks+0.751
Neg logit contributions
 yes-0.543
 tr-0.540
ifying-0.535
 �-0.533
 la-0.530
Token that
Token ID326
Feature activation+1.94e-02

Pos logit contributions
 vet+0.015
 traps+0.015
keeping+0.014
 dam+0.014
 killed+0.014
Neg logit contributions
ifying-0.016
ksh-0.015
ay-0.014
ahoo-0.014
 la-0.014
Token Spot
Token ID15899
Feature activation-6.15e-03

Pos logit contributions
ifying+0.025
 tr+0.024
 Ryan+0.024
 �+0.024
 g+0.023
Neg logit contributions
 dam-0.033
keeping-0.033
ooks-0.032
inger-0.032
igi-0.031
Token was
Token ID373
Feature activation-5.24e-02

Pos logit contributions
 dam+0.010
ooks+0.010
keeping+0.010
 vet+0.010
 traps+0.010
Neg logit contributions
ifying-0.008
 tr-0.007
ay-0.007
ksh-0.007
 Ryan-0.007
Token fine
Token ID3734
Feature activation+3.07e-02

Pos logit contributions
keeping+0.095
orns+0.089
 dam+0.088
ooks+0.088
 bidder+0.086
Neg logit contributions
ifying-0.068
 �-0.063
 yes-0.062
 Ryan-0.061
 g-0.059
Token and
Token ID290
Feature activation-3.20e-02

Pos logit contributions
ooks+0.150
keeping+0.148
 dam+0.145
 vet+0.144
 conflicts+0.144
Neg logit contributions
 g-0.086
ifying-0.084
 tr-0.082
 �-0.078
ay-0.078
Token took
Token ID1718
Feature activation+8.36e-03

Pos logit contributions
ooks+0.065
inger+0.064
keeping+0.064
 bidder+0.062
alk+0.061
Neg logit contributions
 tr-0.036
 cl-0.034
 g-0.034
 Ryan-0.032
 str-0.032
Token it
Token ID340
Feature activation-7.02e-02

Pos logit contributions
ifying+0.012
ay+0.011
 g+0.011
 la+0.010
 Ryan+0.010
Neg logit contributions
 dam-0.014
keeping-0.014
ooks-0.014
 vet-0.014
 traps-0.014
Token to
Token ID284
Feature activation-1.15e-03

Pos logit contributions
 bidder+0.124
 vet+0.121
 conflicts+0.121
ooks+0.120
igi+0.120
Neg logit contributions
ay-0.085
ifying-0.084
 g-0.082
 tr-0.082
 �-0.076
Token the
Token ID262
Feature activation+1.70e-03

Pos logit contributions
orns+0.002
ooks+0.002
 conflicts+0.002
keeping+0.002
igi+0.002
Neg logit contributions
 g-0.001
 la-0.001
 Ryan-0.001
ifying-0.001
 �-0.001
Token vet
Token ID20202
Feature activation-4.45e-01

Pos logit contributions
ifying+0.003
 tr+0.003
 g+0.003
 la+0.003
ay+0.003
Neg logit contributions
ooks-0.002
keeping-0.002
orns-0.002
 conflicts-0.002
 traps-0.002
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
orns+0.923
 conflicts+0.921
igi+0.909
 traps+0.909
 dam+0.900
Neg logit contributions
ifying-0.398
 �-0.389
 yes-0.383
 tr-0.355
 g-0.355
Token The
Token ID383
Feature activation+2.03e-02

Pos logit contributions
 �+0.019
ifying+0.016
 Ryan+0.016
 Daddy+0.016
 Wow+0.016
Neg logit contributions
 traps-0.044
 conflicts-0.044
keeping-0.043
 dam-0.042
inger-0.042
Token vet
Token ID20202
Feature activation-4.44e-01

Pos logit contributions
ifying+0.029
 tr+0.028
 la+0.028
 yes+0.028
 trumpet+0.027
Neg logit contributions
keeping-0.033
ooks-0.031
orns-0.029
 conflicts-0.029
igi-0.028
Token helped
Token ID4193
Feature activation-3.01e-02

Pos logit contributions
keeping+0.858
orns+0.825
 traps+0.814
 conflicts+0.809
 dam+0.805
Neg logit contributions
 yes-0.530
 tr-0.493
 �-0.478
 Ryan-0.476
 g-0.476
Token the
Token ID262
Feature activation+5.64e-03

Pos logit contributions
 dam+0.053
keeping+0.051
 traps+0.050
orns+0.050
ooks+0.050
Neg logit contributions
ay-0.035
 g-0.035
 la-0.035
ifying-0.034
 Ryan-0.034
Token to
Token ID284
Feature activation-1.15e-03

Pos logit contributions
 bidder+0.124
 vet+0.121
 conflicts+0.121
ooks+0.120
igi+0.120
Neg logit contributions
ay-0.085
ifying-0.084
 g-0.082
 tr-0.082
 �-0.076
Token the
Token ID262
Feature activation+1.70e-03

Pos logit contributions
orns+0.002
ooks+0.002
 conflicts+0.002
keeping+0.002
igi+0.002
Neg logit contributions
 g-0.001
 la-0.001
 Ryan-0.001
ifying-0.001
 �-0.001
Token vet
Token ID20202
Feature activation-4.45e-01

Pos logit contributions
ifying+0.003
 tr+0.003
 g+0.003
 la+0.003
ay+0.003
Neg logit contributions
ooks-0.002
keeping-0.002
orns-0.002
 conflicts-0.002
 traps-0.002
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
orns+0.923
 conflicts+0.921
igi+0.909
 traps+0.909
 dam+0.900
Neg logit contributions
ifying-0.398
 �-0.389
 yes-0.383
 tr-0.355
 g-0.355
Token The
Token ID383
Feature activation+2.03e-02

Pos logit contributions
 �+0.019
ifying+0.016
 Ryan+0.016
 Daddy+0.016
 Wow+0.016
Neg logit contributions
 traps-0.044
 conflicts-0.044
keeping-0.043
 dam-0.042
inger-0.042
Token vet
Token ID20202
Feature activation-4.44e-01

Pos logit contributions
ifying+0.029
 tr+0.028
 la+0.028
 yes+0.028
 trumpet+0.027
Neg logit contributions
keeping-0.033
ooks-0.031
orns-0.029
 conflicts-0.029
igi-0.028
Token helped
Token ID4193
Feature activation-3.01e-02

Pos logit contributions
keeping+0.858
orns+0.825
 traps+0.814
 conflicts+0.809
 dam+0.805
Neg logit contributions
 yes-0.530
 tr-0.493
 �-0.478
 Ryan-0.476
 g-0.476
Token the
Token ID262
Feature activation+5.64e-03

Pos logit contributions
 dam+0.053
keeping+0.051
 traps+0.050
orns+0.050
ooks+0.050
Neg logit contributions
ay-0.035
 g-0.035
 la-0.035
ifying-0.034
 Ryan-0.034
Token bird
Token ID6512
Feature activation-7.70e-02

Pos logit contributions
ifying+0.007
 tr+0.007
 la+0.007
 trumpet+0.006
 g+0.006
Neg logit contributions
ooks-0.010
keeping-0.010
orns-0.009
igi-0.009
inger-0.009
Token and
Token ID290
Feature activation-3.08e-02

Pos logit contributions
keeping+0.150
ooks+0.143
igi+0.143
 conflicts+0.139
orns+0.137
Neg logit contributions
 g-0.095
 yes-0.079
 int-0.077
 tr-0.075
 rum-0.074
Token it
Token ID340
Feature activation-6.78e-02

Pos logit contributions
inger+0.068
ooks+0.066
orns+0.066
keeping+0.064
alk+0.063
Neg logit contributions
 yes-0.028
 cl-0.027
 g-0.027
 tr-0.026
 Daddy-0.026
Token to
Token ID284
Feature activation+3.61e-03

Pos logit contributions
ifying+0.078
 tr+0.074
 j+0.071
 g+0.071
ding+0.070
Neg logit contributions
orns-0.239
 bidder-0.238
ooks-0.238
 conflicts-0.235
 traps-0.234
Token the
Token ID262
Feature activation+9.87e-03

Pos logit contributions
 g+0.003
 Ryan+0.003
 Jake+0.003
 Eric+0.003
 Daddy+0.003
Neg logit contributions
ooks-0.008
orns-0.008
igi-0.008
age-0.008
inger-0.007
Token vet
Token ID20202
Feature activation-4.43e-01

Pos logit contributions
ifying+0.019
 tr+0.018
 la+0.017
ay+0.017
 g+0.017
Neg logit contributions
keeping-0.011
ooks-0.011
 traps-0.011
orns-0.011
 dam-0.011
Token.
Token ID13
Feature activation+2.56e-02

Pos logit contributions
 dam+0.825
ooks+0.823
 traps+0.819
orns+0.819
 conflicts+0.808
Neg logit contributions
ifying-0.496
 yes-0.467
 �-0.448
 tr-0.447
 la-0.446
Token The
Token ID383
Feature activation+2.77e-02

Pos logit contributions
ifying+0.033
 la+0.029
 �+0.029
 g+0.029
 Eric+0.029
Neg logit contributions
keeping-0.049
orns-0.048
 dam-0.047
igi-0.047
inger-0.046
Token vet
Token ID20202
Feature activation-4.43e-01

Pos logit contributions
 tr+0.055
 yes+0.055
ifying+0.055
 trumpet+0.054
 guitar+0.054
Neg logit contributions
keeping-0.030
ooks-0.025
alk-0.022
orns-0.022
 conflicts-0.022
Token was
Token ID373
Feature activation-4.53e-02

Pos logit contributions
keeping+0.830
orns+0.812
 dam+0.806
 traps+0.805
ooks+0.800
Neg logit contributions
 yes-0.539
 tr-0.506
ifying-0.493
 la-0.481
 Ryan-0.479
Token a
Token ID257
Feature activation-1.24e-02

Pos logit contributions
keeping+0.083
orns+0.078
 dam+0.074
ooks+0.074
 conflicts+0.074
Neg logit contributions
ifying-0.060
 �-0.053
 yes-0.053
 ding-0.053
 g-0.053
Token nice
Token ID3621
Feature activation+1.21e-01

Pos logit contributions
keeping+0.019
orns+0.019
ooks+0.018
 traps+0.018
 conflicts+0.017
Neg logit contributions
ifying-0.019
 tr-0.018
 g-0.018
ay-0.018
 la-0.018
Token person
Token ID1048
Feature activation-1.77e-01

Pos logit contributions
 �+0.153
ay+0.152
 g+0.149
 tr+0.147
 wo+0.143
Neg logit contributions
keeping-0.211
orns-0.200
 conflicts-0.197
 traps-0.191
ooks-0.189
Token who
Token ID508
Feature activation+6.80e-02

Pos logit contributions
 dam+0.308
 traps+0.306
keeping+0.306
 vet+0.302
ooks+0.299
Neg logit contributions
ifying-0.237
 la-0.208
ay-0.204
 tr-0.203
ksh-0.199
Token She
Token ID1375
Feature activation-7.00e-02

Pos logit contributions
ifying+0.002
ay+0.002
 la+0.002
ksh+0.002
 tr+0.002
Neg logit contributions
 traps-0.002
keeping-0.002
 dam-0.002
 vet-0.002
ooks-0.002
Token took
Token ID1718
Feature activation+8.06e-03

Pos logit contributions
keeping+0.127
ooks+0.121
orns+0.120
 traps+0.119
 dam+0.117
Neg logit contributions
 tr-0.095
ifying-0.090
 �-0.089
 g-0.088
 yes-0.086
Token Max
Token ID5436
Feature activation+1.15e-01

Pos logit contributions
ifying+0.011
 la+0.009
ksh+0.009
ay+0.009
ahoo+0.009
Neg logit contributions
 traps-0.014
 vet-0.014
keeping-0.014
 dam-0.014
ooks-0.013
Token to
Token ID284
Feature activation+1.37e-03

Pos logit contributions
ifying+0.169
 tr+0.149
ay+0.149
 la+0.146
 g+0.146
Neg logit contributions
 dam-0.185
 traps-0.181
keeping-0.181
 vet-0.181
ooks-0.180
Token the
Token ID262
Feature activation-5.04e-03

Pos logit contributions
ifying+0.002
 g+0.002
 Ryan+0.002
 tr+0.002
 la+0.002
Neg logit contributions
ooks-0.002
 dam-0.002
orns-0.002
keeping-0.002
igi-0.002
Token vet
Token ID20202
Feature activation-4.43e-01

Pos logit contributions
keeping+0.004
ooks+0.003
orns+0.003
igi+0.003
 conflicts+0.003
Neg logit contributions
ifying-0.012
 tr-0.012
 g-0.011
 �-0.011
 la-0.011
Token to
Token ID284
Feature activation+1.86e-03

Pos logit contributions
 dam+0.728
 traps+0.725
keeping+0.716
orns+0.714
ooks+0.713
Neg logit contributions
ifying-0.599
 �-0.562
 tr-0.551
 la-0.545
 yes-0.541
Token get
Token ID651
Feature activation-1.08e-01

Pos logit contributions
ifying+0.003
 g+0.003
 tr+0.002
 Ryan+0.002
 la+0.002
Neg logit contributions
ooks-0.003
keeping-0.003
orns-0.003
 dam-0.003
 traps-0.003
Token an
Token ID281
Feature activation-6.25e-02

Pos logit contributions
 dam+0.202
keeping+0.200
ooks+0.200
 traps+0.198
 bidder+0.197
Neg logit contributions
ifying-0.131
 g-0.113
 Ryan-0.111
 tr-0.109
ay-0.109
Token x
Token ID2124
Feature activation-5.78e-02

Pos logit contributions
keeping+0.110
ooks+0.101
igi+0.099
 conflicts+0.097
orns+0.096
Neg logit contributions
ay-0.082
 g-0.081
 yes-0.080
 tr-0.079
 �-0.078
Token-
Token ID12
Feature activation-1.09e-02

Pos logit contributions
ooks+0.105
 dam+0.104
keeping+0.101
 conflicts+0.100
 traps+0.099
Neg logit contributions
ifying-0.068
ay-0.067
 tr-0.064
 la-0.063
 g-0.062
Token had
Token ID550
Feature activation+2.89e-02

Pos logit contributions
 traps+0.044
keeping+0.044
 dam+0.044
 vet+0.044
ooks+0.042
Neg logit contributions
ifying-0.033
ay-0.029
 la-0.029
 tr-0.028
ksh-0.028
Token to
Token ID284
Feature activation+4.15e-03

Pos logit contributions
ifying+0.045
ay+0.040
 la+0.040
ksh+0.040
ahoo+0.040
Neg logit contributions
 traps-0.044
 vet-0.043
keeping-0.043
 dam-0.043
ooks-0.041
Token go
Token ID467
Feature activation+1.10e-01

Pos logit contributions
ifying+0.005
 g+0.005
 tr+0.004
 la+0.004
ay+0.004
Neg logit contributions
 traps-0.008
orns-0.008
keeping-0.007
ooks-0.007
 dam-0.007
Token to
Token ID284
Feature activation+3.61e-03

Pos logit contributions
ifying+0.078
 tr+0.074
 j+0.071
 g+0.071
ding+0.070
Neg logit contributions
orns-0.239
 bidder-0.238
ooks-0.238
 conflicts-0.235
 traps-0.234
Token the
Token ID262
Feature activation+9.87e-03

Pos logit contributions
 g+0.003
 Ryan+0.003
 Jake+0.003
 Eric+0.003
 Daddy+0.003
Neg logit contributions
ooks-0.008
orns-0.008
igi-0.008
age-0.008
inger-0.007
Token vet
Token ID20202
Feature activation-4.43e-01

Pos logit contributions
ifying+0.019
 tr+0.018
 la+0.017
ay+0.017
 g+0.017
Neg logit contributions
keeping-0.011
ooks-0.011
 traps-0.011
orns-0.011
 dam-0.011
Token.
Token ID13
Feature activation+2.56e-02

Pos logit contributions
 dam+0.825
ooks+0.823
 traps+0.819
orns+0.819
 conflicts+0.808
Neg logit contributions
ifying-0.496
 yes-0.467
 �-0.448
 tr-0.447
 la-0.446
Token The
Token ID383
Feature activation+2.77e-02

Pos logit contributions
ifying+0.033
 la+0.029
 �+0.029
 g+0.029
 Eric+0.029
Neg logit contributions
keeping-0.049
orns-0.048
 dam-0.047
igi-0.047
inger-0.046
Token vet
Token ID20202
Feature activation-4.43e-01

Pos logit contributions
 tr+0.055
 yes+0.055
ifying+0.055
 trumpet+0.054
 guitar+0.054
Neg logit contributions
keeping-0.030
ooks-0.025
alk-0.022
orns-0.022
 conflicts-0.022
Token was
Token ID373
Feature activation-4.53e-02

Pos logit contributions
keeping+0.830
orns+0.812
 dam+0.806
 traps+0.805
ooks+0.800
Neg logit contributions
 yes-0.539
 tr-0.506
ifying-0.493
 la-0.481
 Ryan-0.479
Token a
Token ID257
Feature activation-1.24e-02

Pos logit contributions
keeping+0.083
orns+0.078
 dam+0.074
ooks+0.074
 conflicts+0.074
Neg logit contributions
ifying-0.060
 �-0.053
 yes-0.053
 ding-0.053
 g-0.053
Token and
Token ID290
Feature activation-3.20e-02

Pos logit contributions
 traps+0.157
keeping+0.155
 dam+0.154
 vet+0.154
 conflicts+0.147
Neg logit contributions
ifying-0.168
ay-0.151
 tr-0.149
ksh-0.147
ahoo-0.147
Token carried
Token ID5281
Feature activation-1.15e-01

Pos logit contributions
ooks+0.064
inger+0.062
igi+0.061
keeping+0.060
orns+0.060
Neg logit contributions
 tr-0.038
 cl-0.036
 g-0.035
 str-0.035
 laughed-0.033
Token it
Token ID340
Feature activation-6.79e-02

Pos logit contributions
 traps+0.147
keeping+0.147
 vet+0.146
 dam+0.144
ooks+0.136
Neg logit contributions
ifying-0.210
 la-0.189
ksh-0.189
ay-0.188
ahoo-0.187
Token to
Token ID284
Feature activation+2.16e-03

Pos logit contributions
 vet+0.120
 bidder+0.117
keeping+0.115
 dam+0.115
ooks+0.114
Neg logit contributions
ifying-0.084
 tr-0.081
 g-0.078
ay-0.077
 yes-0.074
Token the
Token ID262
Feature activation+2.65e-03

Pos logit contributions
 g+0.002
ifying+0.002
 Ryan+0.002
 tr+0.002
 la+0.002
Neg logit contributions
keeping-0.004
igi-0.004
orns-0.004
ooks-0.004
inger-0.004
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
ifying+0.005
 tr+0.004
 g+0.004
ay+0.004
 la+0.004
Neg logit contributions
ooks-0.004
keeping-0.004
 traps-0.003
orns-0.003
 dam-0.003
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
 traps+0.789
 dam+0.786
 vet+0.784
orns+0.782
igi+0.776
Neg logit contributions
ifying-0.541
 �-0.482
 tr-0.478
 g-0.465
 yes-0.464
Token The
Token ID383
Feature activation+2.14e-02

Pos logit contributions
ifying+0.026
ay+0.023
ksh+0.023
 la+0.022
 Ryan+0.022
Neg logit contributions
keeping-0.035
 traps-0.035
 dam-0.035
 vet-0.034
igi-0.034
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
 yes+0.032
 trumpet+0.032
 tr+0.031
 g+0.031
ifying+0.030
Neg logit contributions
keeping-0.035
ooks-0.032
orns-0.028
 conflicts-0.028
igi-0.028
Token was
Token ID373
Feature activation-5.02e-02

Pos logit contributions
keeping+0.781
 traps+0.748
orns+0.747
 dam+0.735
ooks+0.733
Neg logit contributions
 yes-0.556
 tr-0.551
ifying-0.547
 Ryan-0.531
 �-0.530
Token able
Token ID1498
Feature activation-2.39e-01

Pos logit contributions
keeping+0.095
orns+0.093
 traps+0.091
 conflicts+0.090
 dam+0.089
Neg logit contributions
ifying-0.059
 g-0.055
 yes-0.055
 �-0.054
 ding-0.052
Token news
Token ID1705
Feature activation+1.40e-02

Pos logit contributions
keeping+0.072
ooks+0.069
 dam+0.069
igi+0.068
 traps+0.068
Neg logit contributions
ifying-0.089
 g-0.086
 tr-0.085
ay-0.083
 la-0.083
Token.
Token ID13
Feature activation+9.87e-03

Pos logit contributions
ifying+0.026
 la+0.023
ksh+0.023
ahoo+0.023
ay+0.023
Neg logit contributions
 traps-0.018
 vet-0.018
keeping-0.018
 dam-0.018
 hospital-0.017
Token They
Token ID1119
Feature activation-9.71e-02

Pos logit contributions
ifying+0.016
ay+0.014
 la+0.014
ksh+0.014
ahoo+0.014
Neg logit contributions
 traps-0.014
 vet-0.014
keeping-0.014
 dam-0.014
ooks-0.014
Token thanked
Token ID26280
Feature activation-4.11e-02

Pos logit contributions
 vet+0.146
 traps+0.146
keeping+0.145
 dam+0.145
ooks+0.137
Neg logit contributions
ifying-0.152
ay-0.136
ksh-0.135
 la-0.135
ahoo-0.133
Token the
Token ID262
Feature activation+3.06e-04

Pos logit contributions
keeping+0.062
 traps+0.060
 jail+0.060
 vet+0.060
 hospital+0.060
Neg logit contributions
ifying-0.070
ksh-0.064
opus-0.056
 la-0.056
ahoo-0.056
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
ifying+0.001
ksh+0.001
ay+0.001
 la+0.001
ahoo+0.001
Neg logit contributions
 vet-0.000
 traps-0.000
 dam-0.000
keeping-0.000
ooks-0.000
Token and
Token ID290
Feature activation-3.81e-02

Pos logit contributions
 vet+0.600
 traps+0.596
keeping+0.592
 dam+0.583
ooks+0.551
Neg logit contributions
ifying-0.772
ay-0.685
 la-0.682
ksh-0.681
ahoo-0.680
Token took
Token ID1718
Feature activation+7.69e-03

Pos logit contributions
 dam+0.060
 traps+0.059
keeping+0.059
 vet+0.059
ooks+0.058
Neg logit contributions
ifying-0.057
 tr-0.052
 g-0.050
 Ryan-0.050
 la-0.050
Token Matt
Token ID4705
Feature activation+1.88e-01

Pos logit contributions
ifying+0.011
ksh+0.010
 la+0.010
ahoo+0.009
 tr+0.009
Neg logit contributions
 traps-0.013
keeping-0.013
 vet-0.012
 dam-0.012
ooks-0.012
Token home
Token ID1363
Feature activation-2.57e-01

Pos logit contributions
ifying+0.294
 la+0.262
ay+0.262
ksh+0.261
ahoo+0.259
Neg logit contributions
 traps-0.283
 vet-0.282
keeping-0.281
 dam-0.276
ooks-0.263
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
 traps+0.385
 vet+0.383
 dam+0.382
keeping+0.380
orns+0.364
Neg logit contributions
ifying-0.407
ay-0.361
 tr-0.358
ksh-0.358
 la-0.357
Token to
Token ID284
Feature activation+2.16e-03

Pos logit contributions
 vet+0.120
 bidder+0.117
keeping+0.115
 dam+0.115
ooks+0.114
Neg logit contributions
ifying-0.084
 tr-0.081
 g-0.078
ay-0.077
 yes-0.074
Token the
Token ID262
Feature activation+2.65e-03

Pos logit contributions
 g+0.002
ifying+0.002
 Ryan+0.002
 tr+0.002
 la+0.002
Neg logit contributions
keeping-0.004
igi-0.004
orns-0.004
ooks-0.004
inger-0.004
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
ifying+0.005
 tr+0.004
 g+0.004
ay+0.004
 la+0.004
Neg logit contributions
ooks-0.004
keeping-0.004
 traps-0.003
orns-0.003
 dam-0.003
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
 traps+0.789
 dam+0.786
 vet+0.784
orns+0.782
igi+0.776
Neg logit contributions
ifying-0.541
 �-0.482
 tr-0.478
 g-0.465
 yes-0.464
Token The
Token ID383
Feature activation+2.14e-02

Pos logit contributions
ifying+0.026
ay+0.023
ksh+0.023
 la+0.022
 Ryan+0.022
Neg logit contributions
keeping-0.035
 traps-0.035
 dam-0.035
 vet-0.034
igi-0.034
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
 yes+0.032
 trumpet+0.032
 tr+0.031
 g+0.031
ifying+0.030
Neg logit contributions
keeping-0.035
ooks-0.032
orns-0.028
 conflicts-0.028
igi-0.028
Token was
Token ID373
Feature activation-5.02e-02

Pos logit contributions
keeping+0.781
 traps+0.748
orns+0.747
 dam+0.735
ooks+0.733
Neg logit contributions
 yes-0.556
 tr-0.551
ifying-0.547
 Ryan-0.531
 �-0.530
Token able
Token ID1498
Feature activation-2.39e-01

Pos logit contributions
keeping+0.095
orns+0.093
 traps+0.091
 conflicts+0.090
 dam+0.089
Neg logit contributions
ifying-0.059
 g-0.055
 yes-0.055
 �-0.054
 ding-0.052
Token to
Token ID284
Feature activation+3.68e-03

Pos logit contributions
 vet+0.386
keeping+0.385
 dam+0.383
 traps+0.381
ooks+0.369
Neg logit contributions
ifying-0.345
 tr-0.304
ay-0.303
 la-0.301
ahoo-0.299
Token fix
Token ID4259
Feature activation-9.30e-02

Pos logit contributions
 g+0.005
ifying+0.004
 yes+0.004
 tr+0.004
 cl+0.004
Neg logit contributions
ooks-0.007
keeping-0.007
 conflicts-0.006
orns-0.006
 traps-0.006
Token the
Token ID262
Feature activation+5.69e-03

Pos logit contributions
keeping+0.159
 vet+0.157
 dam+0.157
ooks+0.154
igi+0.153
Neg logit contributions
ifying-0.125
ay-0.114
 tr-0.112
 Ryan-0.112
 g-0.110
Token take
Token ID1011
Feature activation+4.87e-02

Pos logit contributions
ifying+0.119
ksh+0.105
 la+0.105
ay+0.105
ahoo+0.103
Neg logit contributions
 vet-0.118
 traps-0.117
 dam-0.117
keeping-0.117
ooks-0.111
Token Mitt
Token ID16627
Feature activation-1.15e-01

Pos logit contributions
ifying+0.074
ay+0.069
 g+0.068
 la+0.067
 Ryan+0.067
Neg logit contributions
 dam-0.075
ooks-0.075
keeping-0.073
 vet-0.073
 bidder-0.072
Tokenens
Token ID641
Feature activation-8.25e-02

Pos logit contributions
 vet+0.144
 traps+0.144
keeping+0.142
 dam+0.140
orns+0.135
Neg logit contributions
ifying-0.213
ay-0.190
 la-0.189
ksh-0.188
ahoo-0.188
Token to
Token ID284
Feature activation+2.60e-03

Pos logit contributions
 dam+0.131
 vet+0.131
keeping+0.131
 traps+0.129
ooks+0.128
Neg logit contributions
ifying-0.122
 la-0.107
ay-0.107
 tr-0.106
ksh-0.105
Token the
Token ID262
Feature activation-2.54e-03

Pos logit contributions
 g+0.003
ifying+0.003
 Ryan+0.003
 �+0.003
 tr+0.003
Neg logit contributions
orns-0.005
ooks-0.005
inger-0.005
keeping-0.005
igi-0.005
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
keeping+0.002
 traps+0.002
 dam+0.002
ooks+0.002
 vet+0.002
Neg logit contributions
ifying-0.006
 tr-0.006
 la-0.006
ksh-0.005
ay-0.005
Token.
Token ID13
Feature activation+8.25e-03

Pos logit contributions
 traps+0.722
 dam+0.721
ooks+0.715
orns+0.709
keeping+0.708
Neg logit contributions
ifying-0.619
 la-0.565
 tr-0.546
 yes-0.545
ksh-0.543
Token The
Token ID383
Feature activation+1.88e-02

Pos logit contributions
ifying+0.012
ksh+0.011
 la+0.011
ay+0.010
 �+0.010
Neg logit contributions
keeping-0.014
 traps-0.014
 dam-0.014
 conflicts-0.013
orns-0.013
Token vet
Token ID20202
Feature activation-4.41e-01

Pos logit contributions
ifying+0.041
 tr+0.041
 �+0.040
 yes+0.040
 wo+0.039
Neg logit contributions
keeping-0.016
ooks-0.015
 conflicts-0.012
alk-0.012
ubs-0.012
Token can
Token ID460
Feature activation+7.49e-02

Pos logit contributions
keeping+0.792
 dam+0.753
 traps+0.748
ooks+0.747
orns+0.729
Neg logit contributions
 la-0.556
ifying-0.555
 yes-0.536
ay-0.536
 tr-0.530
Token manage
Token ID6687
Feature activation-1.10e-01

Pos logit contributions
ifying+0.103
 g+0.094
 tr+0.092
ay+0.092
 la+0.091
Neg logit contributions
keeping-0.129
ooks-0.126
 traps-0.125
 dam-0.124
 conflicts-0.122
Token take
Token ID1011
Feature activation+4.69e-02

Pos logit contributions
ifying+0.009
 tr+0.008
 g+0.008
ay+0.008
 la+0.008
Neg logit contributions
 traps-0.012
keeping-0.012
 dam-0.012
 vet-0.012
ooks-0.012
Token the
Token ID262
Feature activation+3.41e-03

Pos logit contributions
ifying+0.071
 la+0.063
ksh+0.063
ay+0.063
ahoo+0.062
Neg logit contributions
 traps-0.074
 vet-0.073
keeping-0.073
 dam-0.072
orns-0.069
Token bird
Token ID6512
Feature activation-7.33e-02

Pos logit contributions
ifying+0.006
ksh+0.005
very+0.005
ahoo+0.005
ay+0.005
Neg logit contributions
 vet-0.005
 traps-0.005
 dam-0.005
 hospital-0.005
 jail-0.005
Token to
Token ID284
Feature activation+4.00e-03

Pos logit contributions
ooks+0.128
 bidder+0.120
keeping+0.119
 dam+0.118
igi+0.118
Neg logit contributions
ifying-0.096
 tr-0.088
ay-0.086
 la-0.085
 g-0.085
Token the
Token ID262
Feature activation+2.99e-03

Pos logit contributions
ifying+0.005
 g+0.005
 Ryan+0.005
 la+0.005
 tr+0.005
Neg logit contributions
keeping-0.007
orns-0.007
ooks-0.007
 dam-0.007
 traps-0.007
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
ifying+0.006
ay+0.005
 la+0.005
ksh+0.005
ahoo+0.005
Neg logit contributions
 traps-0.004
 vet-0.003
keeping-0.003
 dam-0.003
ooks-0.003
Token and
Token ID290
Feature activation-2.69e-02

Pos logit contributions
 dam+0.758
 traps+0.756
ooks+0.751
keeping+0.747
orns+0.745
Neg logit contributions
ifying-0.573
 la-0.515
 yes-0.514
 tr-0.510
 �-0.510
Token help
Token ID1037
Feature activation+2.75e-02

Pos logit contributions
ooks+0.052
orns+0.051
keeping+0.048
igi+0.048
 conflicts+0.048
Neg logit contributions
ifying-0.032
 g-0.031
 yes-0.031
 tr-0.030
 Ryan-0.029
Token it
Token ID340
Feature activation-6.71e-02

Pos logit contributions
ifying+0.031
ay+0.027
 la+0.027
 tr+0.026
ksh+0.026
Neg logit contributions
keeping-0.053
 dam-0.053
 traps-0.052
 vet-0.052
ooks-0.051
Token get
Token ID651
Feature activation-1.09e-01

Pos logit contributions
ooks+0.122
keeping+0.120
igi+0.119
orns+0.118
inger+0.111
Neg logit contributions
 g-0.082
 tr-0.078
ifying-0.077
 �-0.076
 yes-0.074
Token better
Token ID1365
Feature activation-1.87e-01

Pos logit contributions
ooks+0.167
keeping+0.166
 bidder+0.164
 dam+0.163
 vet+0.162
Neg logit contributions
ifying-0.169
 g-0.150
ksh-0.148
 tr-0.145
ay-0.143
Token owner
Token ID4870
Feature activation-1.84e-01

Pos logit contributions
ifying+0.144
ksh+0.140
very+0.136
 la+0.132
ahoo+0.129
Neg logit contributions
 vet-0.076
 dam-0.063
 hole-0.063
 traps-0.062
 nest-0.061
Token took
Token ID1718
Feature activation+2.13e-02

Pos logit contributions
keeping+0.339
 dam+0.315
 traps+0.314
ooks+0.310
 conflicts+0.310
Neg logit contributions
 tr-0.228
 la-0.226
 Ryan-0.222
ifying-0.221
 wo-0.216
Token him
Token ID683
Feature activation+2.41e-01

Pos logit contributions
ifying+0.022
 la+0.018
ksh+0.018
ay+0.018
ahoo+0.018
Neg logit contributions
 traps-0.045
 vet-0.044
keeping-0.044
 dam-0.044
orns-0.042
Token to
Token ID284
Feature activation+6.85e-03

Pos logit contributions
ifying+0.347
ay+0.304
 tr+0.302
 la+0.297
ahoo+0.294
Neg logit contributions
 dam-0.393
 vet-0.392
keeping-0.390
 traps-0.386
ooks-0.384
Token the
Token ID262
Feature activation+6.09e-03

Pos logit contributions
ifying+0.007
 Ryan+0.007
 g+0.007
 tr+0.007
 la+0.007
Neg logit contributions
ooks-0.013
igi-0.013
keeping-0.013
orns-0.013
inger-0.013
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
ifying+0.011
 la+0.010
ay+0.010
ksh+0.010
ahoo+0.010
Neg logit contributions
 vet-0.007
 traps-0.007
 dam-0.007
keeping-0.007
ooks-0.007
Token for
Token ID329
Feature activation+2.94e-02

Pos logit contributions
 traps+0.755
 dam+0.747
keeping+0.740
ooks+0.734
 conflicts+0.733
Neg logit contributions
ifying-0.587
 tr-0.521
 �-0.514
 la-0.514
ksh-0.510
Token an
Token ID281
Feature activation-6.16e-02

Pos logit contributions
ifying+0.037
 g+0.033
 Ryan+0.033
 tr+0.032
ay+0.032
Neg logit contributions
ooks-0.052
 dam-0.050
keeping-0.050
igi-0.049
inger-0.049
Token operation
Token ID4905
Feature activation-1.46e-01

Pos logit contributions
keeping+0.092
ooks+0.086
 traps+0.083
orns+0.082
 conflicts+0.082
Neg logit contributions
ay-0.092
 tr-0.091
ahoo-0.091
ifying-0.090
 g-0.089
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
 traps+0.251
 dam+0.249
ooks+0.249
 vet+0.245
keeping+0.241
Neg logit contributions
ifying-0.189
 tr-0.173
ay-0.169
 la-0.166
 �-0.164
Token Max
Token ID5436
Feature activation+1.23e-01

Pos logit contributions
ifying+0.024
ay+0.021
 la+0.021
ksh+0.021
 tr+0.020
Neg logit contributions
keeping-0.039
 traps-0.038
 dam-0.038
 vet-0.037
ooks-0.037
Token,
Token ID11
Feature activation-2.31e-02

Pos logit contributions
ifying+0.048
ay+0.043
ksh+0.042
 la+0.042
ahoo+0.042
Neg logit contributions
 vet-0.043
 traps-0.043
keeping-0.043
 dam-0.042
orns-0.040
Token took
Token ID1718
Feature activation+1.22e-02

Pos logit contributions
 dam+0.035
keeping+0.035
 traps+0.035
 vet+0.034
ooks+0.033
Neg logit contributions
ifying-0.036
 la-0.033
 tr-0.032
ay-0.032
 Ryan-0.032
Token him
Token ID683
Feature activation+2.43e-01

Pos logit contributions
ifying+0.018
ay+0.016
 la+0.016
 tr+0.015
 g+0.015
Neg logit contributions
 dam-0.020
 vet-0.020
 traps-0.020
keeping-0.020
ooks-0.019
Token to
Token ID284
Feature activation+4.43e-03

Pos logit contributions
ifying+0.372
ay+0.329
 tr+0.329
 la+0.319
 g+0.318
Neg logit contributions
 dam-0.374
 vet-0.373
ooks-0.369
 bidder-0.366
keeping-0.364
Token the
Token ID262
Feature activation+1.07e-04

Pos logit contributions
ifying+0.006
 g+0.005
 la+0.005
 Ryan+0.005
 tr+0.005
Neg logit contributions
 dam-0.007
 traps-0.007
keeping-0.007
orns-0.007
ooks-0.007
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
ifying+0.000
ay+0.000
ksh+0.000
 la+0.000
ahoo+0.000
Neg logit contributions
 vet-0.000
 traps-0.000
 dam-0.000
keeping-0.000
ooks-0.000
Token.
Token ID13
Feature activation+6.78e-03

Pos logit contributions
 traps+0.759
 dam+0.754
ooks+0.746
orns+0.743
keeping+0.742
Neg logit contributions
ifying-0.579
 la-0.510
 tr-0.510
 yes-0.502
 �-0.501
Token The
Token ID383
Feature activation+1.58e-02

Pos logit contributions
ifying+0.009
ay+0.008
 la+0.008
ksh+0.008
 Ryan+0.008
Neg logit contributions
 traps-0.012
 dam-0.012
keeping-0.012
 vet-0.011
orns-0.011
Token vet
Token ID20202
Feature activation-4.41e-01

Pos logit contributions
ifying+0.037
 tr+0.035
 �+0.034
 g+0.034
ay+0.034
Neg logit contributions
keeping-0.012
ooks-0.011
orns-0.010
 conflicts-0.010
igi-0.009
Token said
Token ID531
Feature activation-1.63e-02

Pos logit contributions
keeping+0.771
 traps+0.753
orns+0.750
 dam+0.749
ooks+0.736
Neg logit contributions
ifying-0.561
 yes-0.553
 tr-0.536
 Ryan-0.532
 la-0.532
Token that
Token ID326
Feature activation+2.44e-02

Pos logit contributions
 traps+0.023
 vet+0.023
keeping+0.023
 dam+0.023
 killed+0.022
Neg logit contributions
ifying-0.027
ksh-0.025
ay-0.024
ahoo-0.024
 la-0.024
Token who
Token ID508
Feature activation+6.80e-02

Pos logit contributions
 dam+0.308
 traps+0.306
keeping+0.306
 vet+0.302
ooks+0.299
Neg logit contributions
ifying-0.237
 la-0.208
ay-0.204
 tr-0.203
ksh-0.199
Token helped
Token ID4193
Feature activation-2.60e-02

Pos logit contributions
ifying+0.098
 la+0.087
 tr+0.086
 g+0.086
ay+0.085
Neg logit contributions
 dam-0.110
keeping-0.110
ooks-0.108
 traps-0.107
 vet-0.107
Token animals
Token ID4695
Feature activation-1.46e-01

Pos logit contributions
 dam+0.043
keeping+0.042
 traps+0.041
 vet+0.041
orns+0.040
Neg logit contributions
ifying-0.036
 la-0.033
 g-0.033
ay-0.033
 tr-0.032
Token.
Token ID13
Feature activation+2.36e-02

Pos logit contributions
keeping+0.248
ooks+0.246
 vet+0.242
 dam+0.242
 traps+0.240
Neg logit contributions
ifying-0.195
 g-0.171
 tr-0.171
 la-0.166
 yes-0.164
Token The
Token ID383
Feature activation+2.70e-02

Pos logit contributions
ifying+0.031
ksh+0.026
 la+0.026
 �+0.026
 Eric+0.026
Neg logit contributions
keeping-0.046
 dam-0.044
inger-0.044
igi-0.044
orns-0.044
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
ifying+0.056
 tr+0.055
 yes+0.054
 trumpet+0.053
 la+0.053
Neg logit contributions
keeping-0.027
ooks-0.021
orns-0.021
igi-0.020
 conflicts-0.019
Token looked
Token ID3114
Feature activation+9.28e-02

Pos logit contributions
keeping+0.808
orns+0.779
 traps+0.774
 dam+0.772
ooks+0.769
Neg logit contributions
 yes-0.543
 tr-0.521
ifying-0.517
 la-0.505
 Ryan-0.499
Token at
Token ID379
Feature activation+1.79e-01

Pos logit contributions
ifying+0.126
ahoo+0.111
ay+0.111
 la+0.111
ksh+0.110
Neg logit contributions
 traps-0.160
 vet-0.157
 dam-0.154
keeping-0.151
 killed-0.149
Token Luna
Token ID23694
Feature activation-1.63e-01

Pos logit contributions
ifying+0.275
 la+0.244
ay+0.243
ksh+0.241
 tr+0.239
Neg logit contributions
 dam-0.277
keeping-0.276
 traps-0.275
 vet-0.275
ooks-0.264
Token's
Token ID338
Feature activation+5.38e-02

Pos logit contributions
keeping+0.238
 vet+0.237
 traps+0.237
 dam+0.236
ooks+0.228
Neg logit contributions
ifying-0.263
 la-0.235
ay-0.233
 tr-0.233
ksh-0.233
Token cut
Token ID2005
Feature activation-1.82e-02

Pos logit contributions
ifying+0.091
 tr+0.084
 g+0.082
ay+0.080
 la+0.080
Neg logit contributions
ooks-0.071
orns-0.070
igi-0.069
keeping-0.069
 dam-0.068
Token
Token ID198
Feature activation-4.12e-02

Pos logit contributions
 vet+0.058
keeping+0.057
 traps+0.055
 dam+0.055
ooks+0.054
Neg logit contributions
ifying-0.073
ksh-0.064
 la-0.063
 tr-0.063
ahoo-0.063
TokenThe
Token ID464
Feature activation+4.53e-02

Pos logit contributions
 vet+0.065
keeping+0.061
 dam+0.060
ooks+0.060
 traps+0.060
Neg logit contributions
ifying-0.065
ksh-0.058
 tr-0.057
 la-0.056
ay-0.056
Token end
Token ID886
Feature activation+1.58e-02

Pos logit contributions
ifying+0.084
ksh+0.081
ahoo+0.076
ay+0.074
 la+0.073
Neg logit contributions
 vet-0.064
 traps-0.058
 hospital-0.058
 dam-0.056
 hole-0.054
Token<|endoftext|>
Token ID50256
Feature activation+3.49e-02

Pos logit contributions
ifying+0.031
ksh+0.028
 la+0.028
 trump+0.027
ahoo+0.027
Neg logit contributions
 vet-0.018
 traps-0.018
 shelter-0.018
 hole-0.017
 jail-0.017
TokenThe
Token ID464
Feature activation+4.59e-02

Pos logit contributions
ifying+0.061
ksh+0.055
 la+0.055
ahoo+0.054
 tr+0.054
Neg logit contributions
keeping-0.046
 vet-0.046
 traps-0.046
 dam-0.045
orns-0.044
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
ksh+0.091
ifying+0.088
rav+0.081
ahoo+0.079
Nice+0.077
Neg logit contributions
 vet-0.059
 hole-0.054
 nest-0.054
 shelter-0.054
 hospital-0.054
Token was
Token ID373
Feature activation-5.74e-02

Pos logit contributions
 dam+0.714
 traps+0.711
keeping+0.710
ooks+0.694
 vet+0.686
Neg logit contributions
ifying-0.612
 la-0.587
 Ryan-0.573
very-0.566
 tr-0.564
Token a
Token ID257
Feature activation-2.04e-02

Pos logit contributions
keeping+0.099
 traps+0.096
 dam+0.092
orns+0.092
 vet+0.091
Neg logit contributions
ifying-0.080
 la-0.071
ay-0.071
very-0.071
ahoo-0.070
Token nice
Token ID3621
Feature activation+1.17e-01

Pos logit contributions
 vet+0.031
 dam+0.028
 hospital+0.028
 hole+0.028
 jail+0.027
Neg logit contributions
ifying-0.035
ahoo-0.032
ay-0.032
ksh-0.032
 Eric-0.031
Token lady
Token ID10846
Feature activation-1.42e-01

Pos logit contributions
ifying+0.184
ay+0.166
 la+0.165
 tr+0.163
ksh+0.163
Neg logit contributions
 traps-0.174
keeping-0.173
 dam-0.171
 vet-0.171
ooks-0.164
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
 traps+0.224
 vet+0.223
keeping+0.223
 dam+0.222
ooks+0.212
Neg logit contributions
ifying-0.213
ay-0.188
 la-0.187
ksh-0.186
ahoo-0.185
Token took
Token ID1718
Feature activation+1.10e-02

Pos logit contributions
ifying+0.028
 la+0.026
ay+0.024
ksh+0.024
ahoo+0.024
Neg logit contributions
 vet-0.023
 dam-0.022
keeping-0.022
 traps-0.021
house-0.021
Token Mitt
Token ID16627
Feature activation-1.16e-01

Pos logit contributions
ifying+0.017
ay+0.016
ksh+0.015
 g+0.015
 la+0.015
Neg logit contributions
 dam-0.016
keeping-0.016
 traps-0.016
 vet-0.016
ooks-0.016
Tokenens
Token ID641
Feature activation-8.18e-02

Pos logit contributions
 vet+0.130
 traps+0.124
keeping+0.121
orns+0.120
 dam+0.116
Neg logit contributions
ifying-0.244
 la-0.213
ahoo-0.212
ay-0.212
 tr-0.211
Token to
Token ID284
Feature activation+3.43e-03

Pos logit contributions
 dam+0.134
 vet+0.133
keeping+0.133
 traps+0.131
ooks+0.131
Neg logit contributions
ifying-0.115
ay-0.102
 tr-0.101
ksh-0.099
 la-0.098
Token the
Token ID262
Feature activation+2.15e-03

Pos logit contributions
 g+0.003
 �+0.003
 Ryan+0.003
ifying+0.003
 tr+0.003
Neg logit contributions
orns-0.007
ooks-0.007
igi-0.007
keeping-0.007
inger-0.007
Token vet
Token ID20202
Feature activation-4.42e-01

Pos logit contributions
ifying+0.005
 tr+0.005
 �+0.004
 ding+0.004
 motorcycle+0.004
Neg logit contributions
keeping-0.002
ooks-0.002
orns-0.002
igi-0.002
alk-0.002
Token and
Token ID290
Feature activation-4.23e-02

Pos logit contributions
orns+0.776
 traps+0.773
 dam+0.771
 conflicts+0.770
igi+0.767
Neg logit contributions
ifying-0.536
 �-0.508
 yes-0.491
 tr-0.482
ksh-0.472
Token the
Token ID262
Feature activation+3.02e-03

Pos logit contributions
ooks+0.078
orns+0.077
inger+0.075
keeping+0.074
 conflicts+0.074
Neg logit contributions
ifying-0.053
 yes-0.050
 tr-0.050
 Ryan-0.050
 g-0.049
Token vet
Token ID20202
Feature activation-4.41e-01

Pos logit contributions
ifying+0.007
 tr+0.007
 yes+0.007
 �+0.007
 wo+0.006
Neg logit contributions
keeping-0.002
ooks-0.002
alk-0.002
 conflicts-0.002
orns-0.002
Token gave
Token ID2921
Feature activation-7.53e-02

Pos logit contributions
keeping+0.718
 traps+0.711
 dam+0.705
orns+0.702
ooks+0.701
Neg logit contributions
ifying-0.616
 yes-0.573
 tr-0.565
 g-0.557
 �-0.554
Token Mitt
Token ID16627
Feature activation-1.16e-01

Pos logit contributions
 dam+0.137
 traps+0.136
keeping+0.136
 vet+0.135
ooks+0.133
Neg logit contributions
ifying-0.094
ay-0.082
ksh-0.081
 la-0.079
 g-0.079
Token the
Token ID262
Feature activation+1.77e-02

Pos logit contributions
ifying+0.061
ksh+0.055
ay+0.054
 la+0.054
ahoo+0.054
Neg logit contributions
 traps-0.049
 vet-0.048
keeping-0.047
 dam-0.046
 conflicts-0.045
Token park
Token ID3952
Feature activation-3.19e-02

Pos logit contributions
ifying+0.029
ay+0.027
 la+0.027
 tr+0.027
 g+0.026
Neg logit contributions
keeping-0.025
 traps-0.025
 dam-0.024
ooks-0.024
 vet-0.024
Token allowed
Token ID3142
Feature activation-1.54e-01

Pos logit contributions
 vet+0.044
 traps+0.044
keeping+0.044
 dam+0.043
 killed+0.041
Neg logit contributions
ifying-0.055
 la-0.049
ksh-0.049
ay-0.048
ahoo-0.048
Token him
Token ID683
Feature activation+2.52e-01

Pos logit contributions
 vet+0.269
keeping+0.269
 traps+0.265
 dam+0.253
 hospital+0.251
Neg logit contributions
ifying-0.210
ksh-0.186
 la-0.184
ahoo-0.181
 tr-0.175
Token to
Token ID284
Feature activation+1.97e-02

Pos logit contributions
ifying+0.324
 tr+0.290
ay+0.281
 la+0.279
ahoo+0.277
Neg logit contributions
 dam-0.452
 vet-0.444
 traps-0.441
ooks-0.436
 bidder-0.435
Token watch
Token ID2342
Feature activation+2.37e-01

Pos logit contributions
ifying+0.024
ay+0.021
 tr+0.021
 la+0.021
 g+0.021
Neg logit contributions
 traps-0.036
keeping-0.036
 vet-0.035
 dam-0.035
ooks-0.035
Token the
Token ID262
Feature activation+1.90e-02

Pos logit contributions
ifying+0.437
ksh+0.399
ahoo+0.390
ay+0.380
very+0.377
Neg logit contributions
 traps-0.308
 vet-0.296
 dam-0.286
keeping-0.282
 hospital-0.280
Token graceful
Token ID44363
Feature activation+7.34e-02

Pos logit contributions
ifying+0.036
ksh+0.032
 Ryan+0.031
ahoo+0.031
very+0.031
Neg logit contributions
 vet-0.027
 traps-0.026
 dam-0.025
 nest-0.024
 hole-0.024
Token bird
Token ID6512
Feature activation-6.11e-02

Pos logit contributions
ifying+0.133
ksh+0.119
ay+0.118
 la+0.117
ahoo+0.117
Neg logit contributions
 vet-0.096
 traps-0.095
 dam-0.095
keeping-0.094
 bidder-0.088
Token while
Token ID981
Feature activation+2.80e-02

Pos logit contributions
keeping+0.092
 vet+0.092
 traps+0.092
 dam+0.091
ooks+0.088
Neg logit contributions
ifying-0.095
 la-0.085
ay-0.084
ksh-0.084
 tr-0.084
Token he
Token ID339
Feature activation+1.21e-02

Pos logit contributions
ifying+0.048
ksh+0.045
ay+0.044
ahoo+0.044
 la+0.043
Neg logit contributions
 traps-0.039
keeping-0.038
 vet-0.038
 dam-0.037
 killed-0.036
Token silly
Token ID14397
Feature activation+1.66e-01

Pos logit contributions
ifying+0.102
ay+0.090
 la+0.090
ksh+0.089
ahoo+0.089
Neg logit contributions
 traps-0.098
 vet-0.097
 dam-0.097
keeping-0.097
ooks-0.093
Token,
Token ID11
Feature activation-7.99e-03

Pos logit contributions
ifying+0.294
 la+0.261
ksh+0.261
ay+0.259
ahoo+0.256
Neg logit contributions
keeping-0.223
 traps-0.223
 vet-0.218
 dam-0.218
ooks-0.205
Token running
Token ID2491
Feature activation+9.96e-02

Pos logit contributions
 traps+0.011
 vet+0.011
 dam+0.010
keeping+0.010
 hospital+0.010
Neg logit contributions
ifying-0.014
ay-0.013
ksh-0.013
ahoo-0.013
 la-0.012
Token around
Token ID1088
Feature activation+1.03e-01

Pos logit contributions
ifying+0.112
 tr+0.107
 g+0.100
ay+0.099
 la+0.097
Neg logit contributions
 vet-0.195
igi-0.187
 bidder-0.186
ooks-0.185
 antidote-0.185
Token and
Token ID290
Feature activation-2.05e-02

Pos logit contributions
ifying+0.221
ksh+0.203
ay+0.200
ahoo+0.198
 la+0.195
Neg logit contributions
 traps-0.116
 dam-0.113
 hole-0.105
keeping-0.102
 vet-0.098
Token laughing
Token ID14376
Feature activation+2.25e-01

Pos logit contributions
 traps+0.031
 dam+0.030
keeping+0.030
 vet+0.029
 killed+0.028
Neg logit contributions
ifying-0.034
ay-0.032
ksh-0.030
ahoo-0.029
 la-0.028
Token.
Token ID13
Feature activation+3.01e-02

Pos logit contributions
ifying+0.462
ahoo+0.417
 la+0.414
very+0.412
ay+0.407
Neg logit contributions
 traps-0.254
 dam-0.230
keeping-0.230
 hole-0.217
 shelter-0.215
Token Suddenly
Token ID24975
Feature activation-5.70e-02

Pos logit contributions
ifying+0.061
 la+0.055
ahoo+0.054
ksh+0.053
ay+0.053
Neg logit contributions
 vet-0.033
 dam-0.033
ooks-0.033
 traps-0.030
 hole-0.029
Token,
Token ID11
Feature activation-8.00e-03

Pos logit contributions
keeping+0.087
 vet+0.086
 traps+0.085
 dam+0.085
orns+0.081
Neg logit contributions
ifying-0.089
ay-0.080
 la-0.080
 tr-0.079
ahoo-0.078
Token one
Token ID530
Feature activation-7.10e-02

Pos logit contributions
 traps+0.010
 vet+0.010
 dam+0.010
 killed+0.009
keeping+0.009
Neg logit contributions
ifying-0.015
ksh-0.014
ay-0.014
ahoo-0.014
 la-0.013
Token of
Token ID286
Feature activation-5.35e-02

Pos logit contributions
 vet+0.075
 traps+0.072
 dam+0.070
 hole+0.067
 hospital+0.066
Neg logit contributions
ifying-0.151
ksh-0.143
ahoo-0.137
 la-0.131
 Ryan-0.130
Token Mom
Token ID11254
Feature activation+1.44e-01

Pos logit contributions
 traps+0.155
 vet+0.154
 dam+0.151
keeping+0.150
ooks+0.144
Neg logit contributions
ifying-0.175
ksh-0.156
ay-0.156
ahoo-0.156
 la-0.155
Tokenmy
Token ID1820
Feature activation+2.64e-02

Pos logit contributions
ifying+0.208
 la+0.186
ay+0.186
ksh+0.184
 tr+0.182
Neg logit contributions
 dam-0.234
 traps-0.233
keeping-0.233
 vet-0.230
ooks-0.225
Token has
Token ID468
Feature activation-4.58e-02

Pos logit contributions
ifying+0.040
 la+0.036
ay+0.036
 tr+0.036
ksh+0.036
Neg logit contributions
 dam-0.041
keeping-0.041
 traps-0.040
 vet-0.040
ooks-0.040
Token some
Token ID617
Feature activation-1.55e-01

Pos logit contributions
 dam+0.071
keeping+0.069
ooks+0.068
 vet+0.068
 traps+0.068
Neg logit contributions
ifying-0.072
 la-0.063
 tr-0.062
 g-0.062
ksh-0.062
Token pink
Token ID11398
Feature activation+1.93e-01

Pos logit contributions
keeping+0.218
 traps+0.216
 dam+0.215
 vet+0.214
ooks+0.205
Neg logit contributions
ifying-0.261
ay-0.236
 la-0.233
 tr-0.231
ksh-0.229
Token sprink
Token ID27562
Feature activation+2.43e-01

Pos logit contributions
ifying+0.308
ay+0.275
 la+0.273
 tr+0.271
ksh+0.265
Neg logit contributions
 traps-0.293
keeping-0.291
 dam-0.290
 vet-0.284
ooks-0.284
Tokenles
Token ID829
Feature activation+3.27e-02

Pos logit contributions
ifying+0.403
ahoo+0.394
 �+0.387
 la+0.382
 tr+0.382
Neg logit contributions
orns-0.325
 traps-0.304
keeping-0.299
 conflicts-0.297
 killed-0.289
Token in
Token ID287
Feature activation+1.29e-01

Pos logit contributions
ifying+0.052
ay+0.046
 la+0.046
ksh+0.046
 tr+0.046
Neg logit contributions
 traps-0.048
 vet-0.048
 dam-0.048
keeping-0.048
ooks-0.046
Token the
Token ID262
Feature activation+1.54e-02

Pos logit contributions
ifying+0.195
ay+0.168
 g+0.165
 la+0.160
 tr+0.157
Neg logit contributions
 dam-0.218
 traps-0.217
 vet-0.212
keeping-0.211
ooks-0.205
Token kitchen
Token ID9592
Feature activation-4.48e-02

Pos logit contributions
ifying+0.025
ahoo+0.025
 Ryan+0.024
 Eric+0.024
ksh+0.024
Neg logit contributions
 vet-0.021
 traps-0.020
 hole-0.020
 hospital-0.020
 dam-0.020
Token.
Token ID13
Feature activation+3.59e-02

Pos logit contributions
 traps+0.066
 dam+0.065
 vet+0.065
keeping+0.064
ooks+0.062
Neg logit contributions
ifying-0.072
ay-0.064
 tr-0.064
ksh-0.064
 la-0.064
Token asked
Token ID1965
Feature activation-1.57e-01

Pos logit contributions
ifying+0.097
ay+0.089
ahoo+0.087
 la+0.087
ksh+0.086
Neg logit contributions
 vet-0.040
keeping-0.038
 dam-0.038
 traps-0.038
 bidder-0.035
Token.
Token ID13
Feature activation+2.66e-02

Pos logit contributions
 vet+0.177
keeping+0.173
 traps+0.168
ooks+0.163
 dam+0.158
Neg logit contributions
ifying-0.320
ksh-0.292
 la-0.289
ay-0.283
ahoo-0.282
Token
Token ID198
Feature activation-2.31e-02

Pos logit contributions
ifying+0.051
ksh+0.047
 la+0.045
ahoo+0.045
ay+0.045
Neg logit contributions
 vet-0.034
 dam-0.030
ooks-0.029
 hospital-0.029
keeping-0.029
Token
Token ID198
Feature activation-2.37e-02

Pos logit contributions
 vet+0.029
 dam+0.025
ooks+0.025
 traps+0.025
keeping+0.025
Neg logit contributions
ifying-0.045
ksh-0.042
 la-0.041
ahoo-0.040
 tr-0.039
Token"
Token ID1
Feature activation-5.01e-02

Pos logit contributions
 vet+0.032
 dam+0.028
age+0.027
ooks+0.027
 traps+0.026
Neg logit contributions
ifying-0.044
ksh-0.042
 la-0.040
ahoo-0.039
ay-0.039
TokenYes
Token ID5297
Feature activation+2.31e-01

Pos logit contributions
 vet+0.086
igi+0.074
ifi+0.069
aches+0.065
 known+0.064
Neg logit contributions
ifying-0.072
ay-0.069
ahoo-0.063
ksh-0.063
 la-0.062
Token,
Token ID11
Feature activation-1.04e-02

Pos logit contributions
ifying+0.456
ksh+0.408
Nice+0.400
ahoo+0.399
 la+0.392
Neg logit contributions
 vet-0.262
 traps-0.256
 dam-0.253
igi-0.246
ooks-0.241
Token of
Token ID286
Feature activation-5.66e-02

Pos logit contributions
 traps+0.014
 vet+0.014
 dam+0.013
igi+0.013
 hole+0.013
Neg logit contributions
ifying-0.019
ksh-0.017
ahoo-0.016
ay-0.016
ding-0.016
Token course
Token ID1781
Feature activation+1.72e-01

Pos logit contributions
 traps+0.106
 vet+0.102
 dam+0.102
keeping+0.101
orns+0.099
Neg logit contributions
ifying-0.073
ksh-0.064
ay-0.060
ahoo-0.059
 la-0.058
Token!"
Token ID2474
Feature activation-6.59e-02

Pos logit contributions
ifying+0.303
ksh+0.271
ay+0.268
ahoo+0.265
 la+0.265
Neg logit contributions
 traps-0.233
 vet-0.227
keeping-0.226
 dam-0.223
ooks-0.211
Token she
Token ID673
Feature activation-7.79e-02

Pos logit contributions
 vet+0.101
 traps+0.095
 dam+0.094
keeping+0.094
 hospital+0.092
Neg logit contributions
ifying-0.107
ksh-0.098
ahoo-0.095
 la-0.094
ay-0.094
Token Sam
Token ID3409
Feature activation+2.74e-02

Pos logit contributions
ifying+0.038
ay+0.034
 la+0.034
ksh+0.033
 tr+0.033
Neg logit contributions
keeping-0.039
 vet-0.039
 dam-0.039
 traps-0.039
ooks-0.038
Token's
Token ID338
Feature activation+6.51e-02

Pos logit contributions
ifying+0.046
ay+0.041
ahoo+0.040
 la+0.040
ksh+0.040
Neg logit contributions
 traps-0.040
keeping-0.039
 vet-0.039
 dam-0.038
 killed-0.036
Token arm
Token ID3211
Feature activation-2.12e-02

Pos logit contributions
ifying+0.112
very+0.104
ahoo+0.102
ksh+0.102
 la+0.100
Neg logit contributions
 vet-0.089
 traps-0.088
keeping-0.088
 dam-0.085
 hole-0.083
Token and
Token ID290
Feature activation-1.04e-02

Pos logit contributions
 traps+0.033
keeping+0.033
 dam+0.033
 vet+0.033
ooks+0.031
Neg logit contributions
ifying-0.033
ay-0.029
 tr-0.029
ksh-0.029
 la-0.029
Token pulling
Token ID10427
Feature activation+2.26e-02

Pos logit contributions
 traps+0.016
 vet+0.016
 dam+0.016
keeping+0.015
 killed+0.015
Neg logit contributions
ifying-0.016
ay-0.015
ksh-0.014
ahoo-0.014
very-0.014
Token him
Token ID683
Feature activation+2.53e-01

Pos logit contributions
ifying+0.039
ahoo+0.035
ksh+0.035
 la+0.034
very+0.034
Neg logit contributions
 traps-0.033
 vet-0.031
keeping-0.031
 dam-0.030
 hole-0.029
Token back
Token ID736
Feature activation+5.37e-02

Pos logit contributions
ifying+0.411
 tr+0.379
 g+0.377
ay+0.363
 la+0.360
Neg logit contributions
 dam-0.364
 vet-0.363
keeping-0.355
ooks-0.355
 traps-0.350
Token.
Token ID13
Feature activation+4.13e-02

Pos logit contributions
ifying+0.058
 tr+0.054
 g+0.052
ay+0.051
 �+0.046
Neg logit contributions
 dam-0.106
 vet-0.106
ooks-0.103
 antidote-0.102
 killed-0.102
Token
Token ID198
Feature activation-1.01e-02

Pos logit contributions
ifying+0.055
 �+0.051
 la+0.050
ay+0.050
 tr+0.049
Neg logit contributions
keeping-0.072
 traps-0.070
 dam-0.069
 conflicts-0.067
 antidote-0.067
Token
Token ID198
Feature activation-9.96e-03

Pos logit contributions
keeping+0.021
 antidote+0.021
 traps+0.021
 dam+0.021
 conflicts+0.021
Neg logit contributions
ay-0.010
 �-0.009
Jake-0.009
 Jake-0.008
 Eric-0.008
Token"
Token ID1
Feature activation-3.82e-02

Pos logit contributions
keeping+0.018
 dam+0.017
 antidote+0.017
 bidder+0.017
 conflicts+0.017
Neg logit contributions
ifying-0.012
Jake-0.011
 �-0.011
 la-0.011
ay-0.011
Token how
Token ID703
Feature activation+4.10e-02

Pos logit contributions
ifying+0.213
ksh+0.191
ay+0.190
ahoo+0.188
 la+0.188
Neg logit contributions
 traps-0.178
 vet-0.174
 dam-0.171
keeping-0.171
 conflicts-0.163
Token fast
Token ID3049
Feature activation-3.26e-02

Pos logit contributions
ifying+0.076
ksh+0.073
 la+0.071
ahoo+0.069
ay+0.069
Neg logit contributions
 traps-0.051
 vet-0.049
keeping-0.049
 conflicts-0.048
 killed-0.048
Token he
Token ID339
Feature activation+1.74e-03

Pos logit contributions
 traps+0.048
 vet+0.048
keeping+0.047
 dam+0.047
orns+0.044
Neg logit contributions
ifying-0.053
ksh-0.047
ahoo-0.047
ay-0.047
 la-0.047
Token could
Token ID714
Feature activation+4.57e-02

Pos logit contributions
ifying+0.002
 tr+0.002
ay+0.002
 la+0.002
 g+0.002
Neg logit contributions
keeping-0.003
ooks-0.003
 dam-0.003
 vet-0.003
 traps-0.003
Token run
Token ID1057
Feature activation+1.30e-01

Pos logit contributions
ifying+0.059
ay+0.055
 tr+0.055
 g+0.054
 la+0.054
Neg logit contributions
 traps-0.080
keeping-0.080
 dam-0.078
ooks-0.077
 vet-0.077
Token.
Token ID13
Feature activation+1.99e-02

Pos logit contributions
ifying+0.191
ay+0.171
 tr+0.171
 la+0.170
ksh+0.166
Neg logit contributions
 vet-0.208
keeping-0.207
 dam-0.207
 traps-0.207
ooks-0.203
Token The
Token ID383
Feature activation+2.36e-02

Pos logit contributions
ifying+0.033
 la+0.029
ay+0.029
ksh+0.029
ahoo+0.029
Neg logit contributions
 vet-0.029
 traps-0.029
keeping-0.029
 dam-0.028
ooks-0.027
Token old
Token ID1468
Feature activation-1.12e-01

Pos logit contributions
ifying+0.050
ksh+0.046
ahoo+0.044
 la+0.043
very+0.043
Neg logit contributions
 vet-0.033
 hospital-0.027
 traps-0.027
 dam-0.026
 hole-0.026
Token dog
Token ID3290
Feature activation+5.17e-02

Pos logit contributions
keeping+0.183
 traps+0.181
 dam+0.181
ooks+0.180
 conflicts+0.174
Neg logit contributions
ifying-0.150
 tr-0.149
ay-0.146
 la-0.146
 Ryan-0.144
Token tried
Token ID3088
Feature activation-1.03e-01

Pos logit contributions
ifying+0.079
 tr+0.071
 la+0.071
ay+0.070
ksh+0.070
Neg logit contributions
keeping-0.080
 traps-0.080
 vet-0.079
 dam-0.079
ooks-0.076
Token his
Token ID465
Feature activation-2.23e-03

Pos logit contributions
keeping+0.171
 dam+0.168
 vet+0.168
 traps+0.167
ooks+0.164
Neg logit contributions
ifying-0.143
 tr-0.134
 g-0.130
 la-0.129
ahoo-0.128
Tokenily
Token ID813
Feature activation-1.83e-01

Pos logit contributions
 dam+0.002
 traps+0.002
 vet+0.002
keeping+0.002
 killed+0.002
Neg logit contributions
ifying-0.002
ay-0.002
 Ryan-0.002
 la-0.002
 Eric-0.002
Token and
Token ID290
Feature activation-2.08e-02

Pos logit contributions
 dam+0.304
 traps+0.301
keeping+0.296
 vet+0.295
ooks+0.289
Neg logit contributions
ifying-0.251
 tr-0.236
 la-0.234
ay-0.229
 Ryan-0.227
Token Sam
Token ID3409
Feature activation+2.45e-02

Pos logit contributions
 vet+0.032
keeping+0.031
 traps+0.030
 killed+0.029
 dam+0.029
Neg logit contributions
ifying-0.034
ksh-0.031
ay-0.030
ahoo-0.030
 tr-0.029
Token are
Token ID389
Feature activation+7.71e-02

Pos logit contributions
ifying+0.037
ay+0.033
ksh+0.033
 la+0.033
ahoo+0.032
Neg logit contributions
 traps-0.038
 vet-0.038
keeping-0.038
 dam-0.037
orns-0.036
Token twins
Token ID20345
Feature activation-6.17e-02

Pos logit contributions
ifying+0.163
ksh+0.153
 tr+0.145
ahoo+0.143
ay+0.140
Neg logit contributions
ooks-0.080
 vet-0.078
 traps-0.075
 dam-0.075
 hospital-0.072
Token.
Token ID13
Feature activation+3.33e-02

Pos logit contributions
 vet+0.069
 hospital+0.062
 clinic+0.061
 jail+0.061
 killed+0.060
Neg logit contributions
ifying-0.124
ahoo-0.120
ksh-0.117
ay-0.116
 la-0.111
Token They
Token ID1119
Feature activation-7.86e-02

Pos logit contributions
ifying+0.061
ay+0.055
 la+0.054
ksh+0.054
 tr+0.054
Neg logit contributions
 traps-0.043
keeping-0.042
 vet-0.042
 dam-0.042
ooks-0.039
Token like
Token ID588
Feature activation+8.45e-02

Pos logit contributions
 vet+0.107
 killed+0.103
 hospital+0.101
keeping+0.101
ooks+0.100
Neg logit contributions
ifying-0.134
ksh-0.127
ahoo-0.119
ay-0.116
 int-0.114
Token to
Token ID284
Feature activation+2.37e-02

Pos logit contributions
ifying+0.181
ksh+0.162
ay+0.160
ahoo+0.158
 la+0.152
Neg logit contributions
 vet-0.097
keeping-0.091
 traps-0.090
 jail-0.084
 dam-0.082
Token play
Token ID711
Feature activation+1.51e-01

Pos logit contributions
ifying+0.034
ksh+0.031
ahoo+0.028
ay+0.028
very+0.028
Neg logit contributions
 vet-0.041
 dam-0.040
keeping-0.039
 traps-0.039
 bidder-0.038
Token outside
Token ID2354
Feature activation+1.50e-01

Pos logit contributions
ifying+0.286
ahoo+0.252
ksh+0.252
ay+0.241
 la+0.236
Neg logit contributions
 dam-0.212
 vet-0.211
 traps-0.209
 shelter-0.202
 hospital-0.199
Token.
Token ID13
Feature activation+2.99e-02

Pos logit contributions
ifying+0.089
ay+0.079
 la+0.078
ksh+0.077
 tr+0.077
Neg logit contributions
 traps-0.093
 dam-0.093
 vet-0.093
keeping-0.092
ooks-0.089
Token She
Token ID1375
Feature activation-4.89e-02

Pos logit contributions
ifying+0.050
 la+0.045
ay+0.045
ksh+0.045
ahoo+0.044
Neg logit contributions
 vet-0.042
 traps-0.042
keeping-0.042
 dam-0.041
ooks-0.039
Token taught
Token ID7817
Feature activation-3.18e-02

Pos logit contributions
keeping+0.080
 traps+0.077
 dam+0.077
orns+0.075
ooks+0.075
Neg logit contributions
ifying-0.070
 tr-0.068
 la-0.066
 Ryan-0.066
 g-0.064
Token them
Token ID606
Feature activation+2.12e-02

Pos logit contributions
 traps+0.046
keeping+0.046
 vet+0.046
 dam+0.045
 conflicts+0.043
Neg logit contributions
ifying-0.054
ksh-0.047
ahoo-0.046
 la-0.046
ay-0.046
Token the
Token ID262
Feature activation+1.58e-02

Pos logit contributions
ifying+0.034
ay+0.031
 la+0.031
ksh+0.030
ahoo+0.030
Neg logit contributions
 traps-0.031
 vet-0.031
keeping-0.031
 dam-0.031
ooks-0.029
Token language
Token ID3303
Feature activation+2.26e-02

Pos logit contributions
ifying+0.027
ksh+0.024
ay+0.023
ahoo+0.023
 la+0.023
Neg logit contributions
 vet-0.023
keeping-0.022
 traps-0.022
 dam-0.022
 bidder-0.021
Token for
Token ID329
Feature activation+4.80e-02

Pos logit contributions
ifying+0.032
ay+0.030
 tr+0.029
 la+0.029
 g+0.028
Neg logit contributions
 dam-0.037
keeping-0.037
 traps-0.036
 vet-0.036
igi-0.035
Token many
Token ID867
Feature activation-1.68e-01

Pos logit contributions
ifying+0.078
ay+0.069
 la+0.069
ksh+0.069
ahoo+0.068
Neg logit contributions
 traps-0.070
 vet-0.070
keeping-0.070
 dam-0.070
ooks-0.066
Token weeks
Token ID2745
Feature activation-1.58e-01

Pos logit contributions
 dam+0.230
 vet+0.229
keeping+0.229
 traps+0.223
ooks+0.218
Neg logit contributions
ifying-0.281
ay-0.263
 la-0.259
 tr-0.258
 g-0.255
Token until
Token ID1566
Feature activation-1.14e-01

Pos logit contributions
 traps+0.256
 dam+0.256
 vet+0.255
keeping+0.249
ooks+0.243
Neg logit contributions
ifying-0.226
ay-0.207
 la-0.201
 tr-0.197
ahoo-0.195
Token they
Token ID484
Feature activation-2.56e-02

Pos logit contributions
keeping+0.159
 traps+0.159
 dam+0.159
 vet+0.158
ooks+0.152
Neg logit contributions
ifying-0.190
ay-0.170
 la-0.169
 tr-0.168
ahoo-0.166
Token
Token ID198
Feature activation-2.44e-02

Pos logit contributions
keeping+0.042
 traps+0.042
 dam+0.042
 vet+0.041
 conflicts+0.040
Neg logit contributions
ifying-0.033
ay-0.029
 la-0.028
 tr-0.028
ahoo-0.028
TokenR
Token ID49
Feature activation+1.23e-01

Pos logit contributions
 vet+0.042
 traps+0.041
 dam+0.040
keeping+0.040
orns+0.039
Neg logit contributions
ifying-0.034
ksh-0.030
 la-0.030
ay-0.029
ahoo-0.029
Tokenover
Token ID2502
Feature activation-1.72e-02

Pos logit contributions
ifying+0.205
 la+0.182
ay+0.182
ksh+0.181
 tr+0.181
Neg logit contributions
 vet-0.177
 traps-0.177
keeping-0.175
 dam-0.175
ooks-0.167
Token was
Token ID373
Feature activation-4.19e-02

Pos logit contributions
 traps+0.030
 conflicts+0.030
orns+0.030
 dam+0.029
 vet+0.029
Neg logit contributions
 tr-0.022
ifying-0.022
 g-0.021
 �-0.021
very-0.021
Token so
Token ID523
Feature activation+9.15e-03

Pos logit contributions
orns+0.082
 traps+0.081
 conflicts+0.080
keeping+0.079
 dam+0.078
Neg logit contributions
ifying-0.050
 g-0.046
 �-0.044
 yes-0.042
ay-0.042
Token happy
Token ID3772
Feature activation+2.53e-02

Pos logit contributions
ifying+0.012
 g+0.011
ay+0.011
 la+0.010
ahoo+0.010
Neg logit contributions
 traps-0.016
keeping-0.016
 vet-0.016
 dam-0.016
orns-0.016
Token he
Token ID339
Feature activation+1.01e-02

Pos logit contributions
ifying+0.040
ay+0.035
 la+0.035
ksh+0.035
ahoo+0.035
Neg logit contributions
 traps-0.038
keeping-0.038
 vet-0.038
 dam-0.038
ooks-0.036
Token had
Token ID550
Feature activation+3.10e-02

Pos logit contributions
ifying+0.016
ay+0.014
ksh+0.014
 la+0.014
ahoo+0.014
Neg logit contributions
keeping-0.015
 traps-0.015
 vet-0.015
 dam-0.015
 killed-0.014
Token found
Token ID1043
Feature activation-1.95e-01

Pos logit contributions
ifying+0.054
 tr+0.052
 Ryan+0.051
ahoo+0.051
ay+0.051
Neg logit contributions
 dam-0.039
 vet-0.038
 traps-0.038
keeping-0.037
 bidder-0.037
Token the
Token ID262
Feature activation+1.38e-02

Pos logit contributions
 dam+0.352
keeping+0.351
 bidder+0.350
ooks+0.346
 vet+0.340
Neg logit contributions
ifying-0.227
 g-0.211
 tr-0.211
ay-0.211
 Eric-0.210
Token path
Token ID3108
Feature activation-1.82e-01

Pos logit contributions
 tr+0.020
ifying+0.019
ay+0.019
 g+0.019
 yes+0.018
Neg logit contributions
keeping-0.023
ooks-0.022
 conflicts-0.020
orns-0.020
 bidder-0.020
Token taught
Token ID7817
Feature activation-5.15e-02

Pos logit contributions
keeping+0.057
 traps+0.056
 dam+0.055
 vet+0.055
orns+0.054
Neg logit contributions
ifying-0.061
 tr-0.055
 la-0.055
 g-0.054
ay-0.054
Token him
Token ID683
Feature activation+2.37e-01

Pos logit contributions
 traps+0.088
keeping+0.088
 vet+0.087
 dam+0.086
 jail+0.084
Neg logit contributions
ifying-0.077
 la-0.066
ksh-0.065
ahoo-0.062
very-0.061
Token how
Token ID703
Feature activation+3.35e-02

Pos logit contributions
ifying+0.298
 tr+0.266
 g+0.265
 la+0.261
ay+0.259
Neg logit contributions
keeping-0.426
 vet-0.418
 dam-0.417
 traps-0.413
 bidder-0.411
Token to
Token ID284
Feature activation+1.11e-03

Pos logit contributions
ifying+0.068
ksh+0.063
ay+0.061
 la+0.061
ahoo+0.059
Neg logit contributions
 traps-0.039
 vet-0.037
 dam-0.037
keeping-0.036
 conflicts-0.036
Token play
Token ID711
Feature activation+1.33e-01

Pos logit contributions
ifying+0.002
ksh+0.002
ay+0.001
 la+0.001
ahoo+0.001
Neg logit contributions
 vet-0.002
 dam-0.002
 traps-0.002
keeping-0.002
 killed-0.002
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
ifying+0.208
ay+0.184
 la+0.184
ksh+0.183
ahoo+0.181
Neg logit contributions
 traps-0.204
 vet-0.203
keeping-0.203
 dam-0.202
ooks-0.193
Token Tim
Token ID5045
Feature activation+2.60e-02

Pos logit contributions
ifying+0.017
ay+0.015
 �+0.015
ksh+0.015
 Ryan+0.015
Neg logit contributions
 traps-0.022
keeping-0.021
 vet-0.021
 dam-0.021
 antidote-0.020
Tokenmy
Token ID1820
Feature activation+1.21e-02

Pos logit contributions
ifying+0.032
 tr+0.030
ay+0.029
 la+0.029
ahoo+0.029
Neg logit contributions
 traps-0.047
keeping-0.046
 vet-0.046
 dam-0.046
ooks-0.045
Token was
Token ID373
Feature activation-5.34e-02

Pos logit contributions
 tr+0.016
 g+0.015
 Ryan+0.015
ifying+0.015
 la+0.015
Neg logit contributions
keeping-0.021
 traps-0.020
ooks-0.020
 dam-0.020
igi-0.020
Token very
Token ID845
Feature activation+5.18e-02

Pos logit contributions
keeping+0.113
orns+0.110
igi+0.107
 traps+0.106
 bidder+0.105
Neg logit contributions
 g-0.055
 �-0.053
 Ryan-0.049
 yes-0.049
ifying-0.048
Token happy
Token ID3772
Feature activation+1.17e-02

Pos logit contributions
 g+0.056
ifying+0.051
 la+0.050
 �+0.048
very+0.048
Neg logit contributions
keeping-0.108
orns-0.107
 traps-0.103
 dam-0.099
 bidder-0.099
Token.
Token ID13
Feature activation+1.95e-02

Pos logit contributions
ifying+0.203
ksh+0.191
ahoo+0.182
ay+0.182
 la+0.181
Neg logit contributions
 traps-0.101
 dam-0.096
 vet-0.095
 hole-0.094
 jail-0.094
Token Suddenly
Token ID24975
Feature activation-6.50e-02

Pos logit contributions
ifying+0.030
ay+0.027
ksh+0.026
 la+0.026
 tr+0.026
Neg logit contributions
 traps-0.030
keeping-0.030
 vet-0.030
 dam-0.029
orns-0.028
Token he
Token ID339
Feature activation+4.72e-03

Pos logit contributions
keeping+0.108
 vet+0.105
 dam+0.103
 traps+0.103
orns+0.101
Neg logit contributions
ifying-0.093
 tr-0.084
ay-0.082
 la-0.082
 �-0.082
Token noticed
Token ID6810
Feature activation-2.03e-03

Pos logit contributions
ifying+0.008
ksh+0.007
 la+0.007
ay+0.007
ahoo+0.007
Neg logit contributions
 traps-0.007
 vet-0.006
 dam-0.006
keeping-0.006
 killed-0.006
Token a
Token ID257
Feature activation-9.30e-03

Pos logit contributions
keeping+0.003
 vet+0.003
 traps+0.003
 dam+0.003
 bidder+0.003
Neg logit contributions
ifying-0.003
 tr-0.003
ay-0.003
ahoo-0.003
 g-0.003
Token key
Token ID1994
Feature activation-1.54e-01

Pos logit contributions
 vet+0.014
 hole+0.014
 dam+0.013
 traps+0.013
 jail+0.013
Neg logit contributions
ifying-0.015
ksh-0.015
ahoo-0.014
 la-0.014
ay-0.014
Token lying
Token ID9105
Feature activation+1.34e-01

Pos logit contributions
 traps+0.201
 vet+0.198
 hole+0.196
keeping+0.195
 dam+0.194
Neg logit contributions
ksh-0.265
ifying-0.258
 la-0.249
ahoo-0.246
very-0.243
Token on
Token ID319
Feature activation+4.26e-02

Pos logit contributions
ksh+0.230
 la+0.222
ahoo+0.222
ifying+0.216
 g+0.212
Neg logit contributions
 traps-0.178
 hole-0.177
 dam-0.166
keeping-0.165
 vet-0.163
Token the
Token ID262
Feature activation+5.60e-03

Pos logit contributions
ifying+0.062
ay+0.054
 tr+0.053
 g+0.053
 la+0.052
Neg logit contributions
 traps-0.069
keeping-0.069
 vet-0.069
 dam-0.069
ooks-0.067
Token ground
Token ID2323
Feature activation-1.40e-01

Pos logit contributions
ifying+0.012
ksh+0.011
 la+0.011
ay+0.011
ahoo+0.011
Neg logit contributions
 vet-0.006
 traps-0.006
keeping-0.005
 dam-0.005
 hole-0.005
Token nearby
Token ID6716
Feature activation+1.08e-01

Pos logit contributions
 traps+0.206
keeping+0.206
 dam+0.205
 vet+0.205
ooks+0.193
Neg logit contributions
ifying-0.225
 la-0.205
ksh-0.204
ay-0.201
ahoo-0.200
Token useful
Token ID4465
Feature activation-9.18e-02

Pos logit contributions
keeping+0.074
 bidder+0.070
 dies+0.070
orns+0.067
 traps+0.066
Neg logit contributions
 g-0.044
ifying-0.040
 la-0.040
 Ryan-0.039
 football-0.039
Token,
Token ID11
Feature activation-1.29e-02

Pos logit contributions
 vet+0.140
 dam+0.140
 traps+0.139
keeping+0.139
ooks+0.134
Neg logit contributions
ifying-0.140
ay-0.126
 tr-0.126
 la-0.125
ksh-0.124
Token so
Token ID523
Feature activation+8.27e-03

Pos logit contributions
keeping+0.024
 dam+0.023
ooks+0.023
 traps+0.023
igi+0.023
Neg logit contributions
ifying-0.016
 g-0.015
 la-0.015
 �-0.015
 tr-0.015
Token she
Token ID673
Feature activation-8.13e-02

Pos logit contributions
ifying+0.012
 g+0.011
 la+0.011
 �+0.011
ay+0.011
Neg logit contributions
keeping-0.014
 dam-0.013
 vet-0.013
ooks-0.013
 traps-0.013
Token said
Token ID531
Feature activation-2.55e-03

Pos logit contributions
keeping+0.136
ooks+0.133
 traps+0.132
 dam+0.128
 conflicts+0.128
Neg logit contributions
 tr-0.111
ifying-0.108
 la-0.107
 yes-0.106
very-0.104
Token,
Token ID11
Feature activation-1.33e-02

Pos logit contributions
 vet+0.004
 traps+0.004
 Tweet+0.004
 shelter+0.004
 dam+0.004
Neg logit contributions
ifying-0.004
ksh-0.004
ay-0.004
ahoo-0.003
Jake-0.003
Token "
Token ID366
Feature activation+1.40e-01

Pos logit contributions
 vet+0.019
 traps+0.017
 dam+0.016
 bidder+0.016
 Tweet+0.016
Neg logit contributions
ifying-0.024
ay-0.021
ahoo-0.020
ee-0.020
ksh-0.020
TokenYes
Token ID5297
Feature activation+2.26e-01

Pos logit contributions
ifying+0.185
ay+0.150
 tr+0.149
ahoo+0.148
 Ryan+0.143
Neg logit contributions
 vet-0.278
keeping-0.253
orns-0.243
igi-0.239
 bidder-0.239
Token,
Token ID11
Feature activation-1.27e-02

Pos logit contributions
ifying+0.492
ksh+0.467
Nice+0.456
ahoo+0.433
 trump+0.430
Neg logit contributions
 vet-0.219
 traps-0.193
 dam-0.192
 shelter-0.192
 jail-0.190
Token I
Token ID314
Feature activation+6.43e-02

Pos logit contributions
 vet+0.021
 traps+0.019
 dam+0.018
 shelter+0.018
 hospital+0.018
Neg logit contributions
ifying-0.021
ksh-0.019
ahoo-0.018
Nice-0.017
ding-0.017
Token can
Token ID460
Feature activation+8.46e-02

Pos logit contributions
ifying+0.078
ay+0.071
 la+0.069
 tr+0.069
 g+0.069
Neg logit contributions
keeping-0.124
 traps-0.119
 dam-0.116
orns-0.115
ooks-0.115
Token brave
Token ID14802
Feature activation+4.33e-02

Pos logit contributions
 vet+0.006
 dam+0.006
 hole+0.006
 hospital+0.005
 nest+0.005
Neg logit contributions
ifying-0.009
ahoo-0.008
ksh-0.008
AY-0.008
ay-0.008
Token little
Token ID1310
Feature activation+1.09e-01

Pos logit contributions
ifying+0.078
ksh+0.068
ahoo+0.068
 int+0.064
ay+0.064
Neg logit contributions
 vet-0.063
 dam-0.061
 traps-0.059
 jail-0.057
 hole-0.057
Token boy
Token ID2933
Feature activation+1.80e-03

Pos logit contributions
ifying+0.234
ahoo+0.211
 int+0.210
ksh+0.208
 YES+0.195
Neg logit contributions
 vet-0.127
 dam-0.119
 hole-0.116
 traps-0.115
 shelter-0.111
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
ifying+0.004
ay+0.004
ksh+0.004
 la+0.004
ahoo+0.004
Neg logit contributions
 traps-0.002
 vet-0.002
keeping-0.002
 dam-0.002
ooks-0.001
Token He
Token ID679
Feature activation-2.04e-02

Pos logit contributions
ifying+0.057
 la+0.051
ksh+0.050
ahoo+0.050
 trump+0.049
Neg logit contributions
 vet-0.025
 jail-0.025
 dam-0.025
ooks-0.025
 traps-0.025
Token lived
Token ID5615
Feature activation-4.40e-02

Pos logit contributions
 vet+0.032
 traps+0.032
 dam+0.032
 killed+0.031
keeping+0.031
Neg logit contributions
ifying-0.032
ksh-0.028
ay-0.028
ahoo-0.027
 la-0.026
Token near
Token ID1474
Feature activation-3.76e-02

Pos logit contributions
 traps+0.080
 vet+0.080
keeping+0.079
 dam+0.079
ooks+0.076
Neg logit contributions
ifying-0.056
ay-0.048
 la-0.048
ksh-0.048
ahoo-0.047
Token a
Token ID257
Feature activation-2.91e-03

Pos logit contributions
 traps+0.051
 vet+0.050
 dam+0.048
keeping+0.048
 jail+0.046
Neg logit contributions
ifying-0.068
 la-0.062
ksh-0.061
ahoo-0.061
ay-0.060
Token mighty
Token ID18680
Feature activation+5.36e-02

Pos logit contributions
 vet+0.004
 hole+0.004
 dam+0.004
 hospital+0.004
 jail+0.004
Neg logit contributions
ifying-0.005
ksh-0.005
ahoo-0.005
 la-0.005
 echoing-0.004
Token lake
Token ID13546
Feature activation+6.26e-02

Pos logit contributions
ifying+0.097
ksh+0.090
ahoo+0.086
 la+0.085
ay+0.084
Neg logit contributions
 vet-0.073
 dam-0.073
 traps-0.071
 jail-0.068
 hole-0.068
Token.
Token ID13
Feature activation+2.74e-02

Pos logit contributions
ifying+0.111
ksh+0.104
 la+0.103
ahoo+0.101
ay+0.100
Neg logit contributions
 dam-0.082
 traps-0.081
 vet-0.081
keeping-0.080
ooks-0.078
Token p
Token ID279
Feature activation-2.79e-02

Pos logit contributions
 traps+0.294
keeping+0.293
 vet+0.293
 dam+0.292
ooks+0.279
Neg logit contributions
ifying-0.283
ay-0.250
 la-0.250
ksh-0.249
 tr-0.248
Tokenouted
Token ID18534
Feature activation+2.31e-02

Pos logit contributions
keeping+0.042
 vet+0.042
 dam+0.042
 traps+0.042
ooks+0.041
Neg logit contributions
ifying-0.043
 la-0.039
ksh-0.038
ay-0.038
 tr-0.038
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
ifying+0.042
ahoo+0.037
ksh+0.037
ay+0.037
 la+0.037
Neg logit contributions
 traps-0.031
 vet-0.031
keeping-0.030
 dam-0.029
 hole-0.028
Token crossed
Token ID12606
Feature activation-3.12e-02

Pos logit contributions
 vet+0.015
 traps+0.015
 dam+0.015
 killed+0.015
keeping+0.014
Neg logit contributions
ifying-0.022
ay-0.020
ahoo-0.019
ksh-0.019
 la-0.019
Token her
Token ID607
Feature activation-3.80e-03

Pos logit contributions
 traps+0.047
 vet+0.047
keeping+0.047
 dam+0.047
ooks+0.045
Neg logit contributions
ifying-0.049
ay-0.044
 la-0.043
ksh-0.043
ahoo-0.043
Token arms
Token ID5101
Feature activation-5.43e-02

Pos logit contributions
ooks+0.006
 traps+0.006
keeping+0.006
igi+0.006
 dam+0.006
Neg logit contributions
ifying-0.006
 tr-0.006
 g-0.005
ay-0.005
 la-0.005
Token.
Token ID13
Feature activation+4.02e-02

Pos logit contributions
 vet+0.072
 traps+0.071
keeping+0.069
 dam+0.069
ooks+0.066
Neg logit contributions
ifying-0.094
 la-0.086
ksh-0.085
ahoo-0.085
very-0.084
Token She
Token ID1375
Feature activation-4.32e-02

Pos logit contributions
ifying+0.062
ay+0.056
 la+0.056
 tr+0.055
 Ryan+0.055
Neg logit contributions
keeping-0.062
 traps-0.062
 dam-0.062
 vet-0.060
igi-0.059
Token did
Token ID750
Feature activation+1.23e-01

Pos logit contributions
keeping+0.080
 dam+0.079
 traps+0.078
ooks+0.077
orns+0.076
Neg logit contributions
ifying-0.053
 tr-0.050
 g-0.049
 la-0.048
ksh-0.046
Token not
Token ID407
Feature activation-1.97e-01

Pos logit contributions
ifying+0.239
 la+0.231
 g+0.230
 tr+0.225
 yes+0.224
Neg logit contributions
keeping-0.146
 dam-0.140
orns-0.135
 vet-0.131
igi-0.130
Token like
Token ID588
Feature activation+7.74e-02

Pos logit contributions
 traps+0.286
 vet+0.285
keeping+0.284
 dam+0.283
ooks+0.269
Neg logit contributions
ifying-0.324
ay-0.289
ksh-0.287
 la-0.287
ahoo-0.285
Token,
Token ID11
Feature activation-1.88e-02

Pos logit contributions
 traps+0.272
 vet+0.271
keeping+0.270
 dam+0.270
ooks+0.259
Neg logit contributions
ifying-0.274
 la-0.242
ay-0.242
 tr-0.239
ksh-0.238
Token her
Token ID607
Feature activation-1.79e-02

Pos logit contributions
 dam+0.034
keeping+0.034
inger+0.034
 traps+0.033
orns+0.033
Neg logit contributions
ifying-0.024
 g-0.023
 Ryan-0.022
 tr-0.022
 yes-0.021
Token mom
Token ID1995
Feature activation+1.85e-02

Pos logit contributions
 vet+0.025
 traps+0.024
 dam+0.024
 hole+0.024
 hospital+0.024
Neg logit contributions
ifying-0.031
ksh-0.028
 la-0.028
ahoo-0.028
ay-0.027
Tokenmy
Token ID1820
Feature activation+1.65e-02

Pos logit contributions
ifying+0.025
ay+0.022
 la+0.022
ksh+0.022
 tr+0.022
Neg logit contributions
 traps-0.032
keeping-0.032
 dam-0.032
 vet-0.031
ooks-0.030
Token had
Token ID550
Feature activation+2.35e-02

Pos logit contributions
ifying+0.026
 la+0.024
 tr+0.024
ay+0.024
ksh+0.024
Neg logit contributions
 traps-0.024
keeping-0.024
 dam-0.024
 vet-0.023
ooks-0.023
Token a
Token ID257
Feature activation-1.12e-02

Pos logit contributions
ifying+0.037
 tr+0.033
 g+0.033
 Ryan+0.032
ksh+0.032
Neg logit contributions
keeping-0.036
 traps-0.036
 dam-0.036
 vet-0.034
ooks-0.034
Token special
Token ID2041
Feature activation+5.84e-02

Pos logit contributions
 vet+0.018
 hospital+0.017
 dam+0.017
 shelter+0.016
 hole+0.016
Neg logit contributions
ifying-0.019
ksh-0.018
ahoo-0.016
ay-0.016
 la-0.016
Token treat
Token ID2190
Feature activation+5.68e-02

Pos logit contributions
ifying+0.090
ksh+0.080
 la+0.078
ay+0.077
ahoo+0.076
Neg logit contributions
 vet-0.094
 dam-0.091
 traps-0.090
keeping-0.089
 antidote-0.086
Token for
Token ID329
Feature activation+3.50e-02

Pos logit contributions
ifying+0.095
ksh+0.087
 la+0.087
ahoo+0.086
ay+0.084
Neg logit contributions
 vet-0.080
keeping-0.079
 traps-0.078
 dam-0.076
 hospital-0.073
Token her
Token ID607
Feature activation-1.71e-02

Pos logit contributions
ifying+0.054
ay+0.048
 la+0.048
ksh+0.047
 tr+0.047
Neg logit contributions
 traps-0.053
 vet-0.053
keeping-0.053
 dam-0.053
ooks-0.051
Token;
Token ID26
Feature activation+8.68e-02

Pos logit contributions
 vet+0.024
keeping+0.023
 traps+0.023
 dam+0.022
 hospital+0.022
Neg logit contributions
ifying-0.030
ay-0.027
 la-0.027
ksh-0.027
ahoo-0.027
Token night
Token ID1755
Feature activation-7.03e-03

Pos logit contributions
ifying+0.035
ksh+0.031
 la+0.031
ay+0.030
ahoo+0.030
Neg logit contributions
 vet-0.036
 traps-0.035
 dam-0.034
keeping-0.034
 hospital-0.033
Token when
Token ID618
Feature activation-6.11e-02

Pos logit contributions
keeping+0.011
 traps+0.011
 vet+0.011
 dam+0.011
orns+0.010
Neg logit contributions
ifying-0.011
ay-0.010
 la-0.009
 tr-0.009
ksh-0.009
Token he
Token ID339
Feature activation-3.64e-04

Pos logit contributions
keeping+0.106
 vet+0.103
 dam+0.103
inger+0.103
ooks+0.103
Neg logit contributions
ifying-0.082
 Ryan-0.074
 g-0.073
 tr-0.073
 Eric-0.073
Token came
Token ID1625
Feature activation+1.58e-02

Pos logit contributions
 traps+0.001
 vet+0.001
 dam+0.001
keeping+0.001
 killed+0.001
Neg logit contributions
ifying-0.001
ay-0.001
ksh-0.001
 la-0.000
ahoo-0.000
Token back
Token ID736
Feature activation+4.21e-02

Pos logit contributions
ifying+0.025
ksh+0.023
ay+0.023
 la+0.022
ahoo+0.022
Neg logit contributions
 vet-0.024
 traps-0.023
 dam-0.023
keeping-0.023
ooks-0.022
Token home
Token ID1363
Feature activation-2.54e-01

Pos logit contributions
ifying+0.067
ay+0.060
 la+0.060
 tr+0.059
ksh+0.059
Neg logit contributions
 traps-0.062
keeping-0.062
 vet-0.062
 dam-0.062
ooks-0.059
Token,
Token ID11
Feature activation-2.31e-02

Pos logit contributions
 traps+0.409
keeping+0.409
 vet+0.407
 dam+0.404
orns+0.401
Neg logit contributions
ifying-0.367
 tr-0.331
 g-0.329
 Ryan-0.324
ay-0.324
Token he
Token ID339
Feature activation+4.45e-04

Pos logit contributions
keeping+0.036
 traps+0.036
 vet+0.036
 dam+0.036
ooks+0.035
Neg logit contributions
ifying-0.034
 la-0.030
 tr-0.030
ay-0.030
 g-0.030
Token would
Token ID561
Feature activation+1.17e-01

Pos logit contributions
ifying+0.001
 la+0.001
 tr+0.001
 g+0.001
ksh+0.001
Neg logit contributions
keeping-0.001
 traps-0.001
 dam-0.001
 vet-0.001
ooks-0.001
Token still
Token ID991
Feature activation-4.77e-02

Pos logit contributions
ifying+0.176
ksh+0.157
ay+0.156
 la+0.153
ahoo+0.152
Neg logit contributions
 vet-0.186
 traps-0.182
keeping-0.182
 dam-0.182
 bidder-0.173
Token find
Token ID1064
Feature activation-2.49e-01

Pos logit contributions
 traps+0.075
keeping+0.074
 dam+0.074
 vet+0.073
ooks+0.072
Neg logit contributions
ifying-0.072
 la-0.064
 g-0.064
 tr-0.064
ay-0.063
Token surprised
Token ID6655
Feature activation-4.85e-02

Pos logit contributions
 traps+0.000
 dam+0.000
 vet+0.000
keeping+0.000
ooks+0.000
Neg logit contributions
ifying-0.000
 la-0.000
ksh-0.000
ay-0.000
ahoo-0.000
Token!
Token ID0
Feature activation+1.61e-01

Pos logit contributions
 traps+0.066
 vet+0.065
 dam+0.064
keeping+0.064
ooks+0.061
Neg logit contributions
ifying-0.087
ksh-0.078
 la-0.077
ay-0.076
ahoo-0.075
Token He
Token ID679
Feature activation-3.02e-02

Pos logit contributions
ifying+0.245
ay+0.217
 la+0.216
ksh+0.215
 tr+0.213
Neg logit contributions
 traps-0.250
keeping-0.249
 vet-0.249
 dam-0.247
ooks-0.237
Token was
Token ID373
Feature activation-5.63e-02

Pos logit contributions
 dam+0.045
 traps+0.044
 vet+0.044
keeping+0.043
 killed+0.043
Neg logit contributions
ifying-0.049
ay-0.044
ksh-0.043
 la-0.043
ahoo-0.043
Token so
Token ID523
Feature activation+7.54e-04

Pos logit contributions
keeping+0.094
 traps+0.094
 vet+0.093
orns+0.092
 dam+0.091
Neg logit contributions
ifying-0.080
 g-0.071
 tr-0.069
 la-0.069
ahoo-0.068
Token used
Token ID973
Feature activation-2.42e-01

Pos logit contributions
ifying+0.001
ay+0.001
 la+0.001
ksh+0.001
 tr+0.001
Neg logit contributions
 traps-0.001
 vet-0.001
keeping-0.001
 dam-0.001
ooks-0.001
Token to
Token ID284
Feature activation+5.63e-03

Pos logit contributions
 traps+0.363
 vet+0.360
keeping+0.360
 dam+0.359
ooks+0.341
Neg logit contributions
ifying-0.384
 la-0.342
ay-0.341
ksh-0.341
ahoo-0.336
Token being
Token ID852
Feature activation+5.52e-02

Pos logit contributions
ifying+0.008
 la+0.007
ay+0.007
 tr+0.007
ksh+0.007
Neg logit contributions
 traps-0.009
 vet-0.009
keeping-0.009
 dam-0.009
ooks-0.009
Token angry
Token ID7954
Feature activation+1.44e-02

Pos logit contributions
ifying+0.094
 tr+0.084
 la+0.084
ay+0.084
ksh+0.084
Neg logit contributions
keeping-0.076
 vet-0.075
 traps-0.075
 dam-0.074
 bidder-0.071
Token,
Token ID11
Feature activation-2.13e-02

Pos logit contributions
ifying+0.024
ksh+0.022
 la+0.022
ay+0.022
ahoo+0.021
Neg logit contributions
 traps-0.020
 vet-0.020
keeping-0.020
 dam-0.020
ooks-0.019
Token but
Token ID475
Feature activation-5.19e-03

Pos logit contributions
 traps+0.032
 vet+0.032
keeping+0.032
 dam+0.031
ooks+0.030
Neg logit contributions
ifying-0.034
ay-0.030
 la-0.030
ksh-0.030
 tr-0.030
Token's
Token ID338
Feature activation+5.29e-02

Pos logit contributions
ifying+0.443
 la+0.396
 tr+0.394
ay+0.392
ksh+0.387
Neg logit contributions
 traps-0.438
keeping-0.436
 dam-0.436
 vet-0.433
ooks-0.423
Token mom
Token ID1995
Feature activation+1.69e-02

Pos logit contributions
ifying+0.094
ksh+0.088
 la+0.084
very+0.084
Nice+0.082
Neg logit contributions
 vet-0.075
 traps-0.073
 dam-0.072
 hole-0.070
 hospital-0.069
Token figured
Token ID11638
Feature activation-8.89e-02

Pos logit contributions
ifying+0.028
ksh+0.025
ay+0.025
 la+0.025
ahoo+0.025
Neg logit contributions
 vet-0.024
keeping-0.023
 traps-0.023
 dam-0.023
 killed-0.022
Token out
Token ID503
Feature activation-3.90e-03

Pos logit contributions
 traps+0.131
 vet+0.130
keeping+0.130
 dam+0.129
ooks+0.123
Neg logit contributions
ifying-0.143
ksh-0.127
 la-0.127
ay-0.127
ahoo-0.126
Token a
Token ID257
Feature activation-1.96e-02

Pos logit contributions
 traps+0.006
 vet+0.006
keeping+0.006
 dam+0.006
ooks+0.005
Neg logit contributions
ifying-0.006
ksh-0.006
ay-0.006
 la-0.006
ahoo-0.006
Token way
Token ID835
Feature activation-2.46e-01

Pos logit contributions
 vet+0.029
 dam+0.029
 traps+0.028
 hole+0.027
 jail+0.027
Neg logit contributions
ifying-0.032
ksh-0.030
ay-0.029
 la-0.028
ahoo-0.028
Token to
Token ID284
Feature activation+2.16e-03

Pos logit contributions
 traps+0.371
 vet+0.370
keeping+0.370
 dam+0.368
ooks+0.351
Neg logit contributions
ifying-0.386
ay-0.342
 la-0.342
ksh-0.341
ahoo-0.337
Token get
Token ID651
Feature activation-1.09e-01

Pos logit contributions
ifying+0.003
 g+0.003
 tr+0.003
ay+0.003
 la+0.003
Neg logit contributions
 traps-0.003
keeping-0.003
 dam-0.003
 vet-0.003
ooks-0.003
Token the
Token ID262
Feature activation-5.88e-03

Pos logit contributions
 vet+0.163
 traps+0.163
 dam+0.163
keeping+0.163
ooks+0.157
Neg logit contributions
ifying-0.173
ay-0.151
 tr-0.151
ksh-0.151
 la-0.150
Token net
Token ID2010
Feature activation-6.86e-02

Pos logit contributions
 vet+0.009
 traps+0.009
 dam+0.008
 hospital+0.008
 hole+0.008
Neg logit contributions
ifying-0.010
ksh-0.009
ay-0.009
ahoo-0.009
 la-0.009
Token open
Token ID1280
Feature activation-4.91e-03

Pos logit contributions
 vet+0.107
 traps+0.107
keeping+0.106
 dam+0.106
ooks+0.103
Neg logit contributions
ifying-0.103
ay-0.092
 tr-0.091
 la-0.090
ksh-0.089
Token day
Token ID1110
Feature activation+2.58e-02

Pos logit contributions
ifying+0.133
ksh+0.118
ahoo+0.111
 la+0.111
 Ryan+0.109
Neg logit contributions
 vet-0.106
 dam-0.103
 traps-0.102
keeping-0.098
 hospital-0.098
Token,
Token ID11
Feature activation-1.25e-02

Pos logit contributions
ifying+0.041
ay+0.037
 la+0.036
 tr+0.036
ahoo+0.036
Neg logit contributions
keeping-0.039
 traps-0.038
 vet-0.038
 dam-0.038
ooks-0.036
Token the
Token ID262
Feature activation+7.29e-03

Pos logit contributions
keeping+0.021
orns+0.020
igi+0.020
 dam+0.020
 traps+0.020
Neg logit contributions
ifying-0.018
 la-0.016
 tr-0.016
 g-0.016
ay-0.016
Token telephone
Token ID11426
Feature activation+1.15e-01

Pos logit contributions
ifying+0.012
ksh+0.011
ahoo+0.010
 la+0.010
 Ple+0.010
Neg logit contributions
 vet-0.012
 hospital-0.011
 jail-0.011
 traps-0.011
 killed-0.011
Token was
Token ID373
Feature activation-4.46e-02

Pos logit contributions
ifying+0.179
ay+0.160
 la+0.159
 tr+0.159
ahoo+0.156
Neg logit contributions
 traps-0.173
keeping-0.173
 dam-0.171
 vet-0.171
ooks-0.166
Token used
Token ID973
Feature activation-2.32e-01

Pos logit contributions
keeping+0.065
 traps+0.062
 dam+0.061
 vet+0.061
orns+0.061
Neg logit contributions
ifying-0.076
 tr-0.067
ay-0.067
 la-0.066
 g-0.066
Token again
Token ID757
Feature activation-9.74e-02

Pos logit contributions
 traps+0.341
keeping+0.336
 dam+0.335
 vet+0.334
ooks+0.317
Neg logit contributions
ifying-0.374
ksh-0.338
 la-0.337
ay-0.333
ahoo-0.329
Token.
Token ID13
Feature activation+2.56e-02

Pos logit contributions
 vet+0.164
 dam+0.162
 traps+0.160
keeping+0.160
igi+0.157
Neg logit contributions
ifying-0.136
 tr-0.120
ay-0.118
 g-0.117
 la-0.117
Token This
Token ID770
Feature activation+1.64e-01

Pos logit contributions
ifying+0.035
ay+0.031
 la+0.031
ksh+0.031
ahoo+0.030
Neg logit contributions
 traps-0.044
keeping-0.043
 vet-0.043
 dam-0.043
ooks-0.041
Token time
Token ID640
Feature activation+5.15e-02

Pos logit contributions
ifying+0.210
ay+0.198
 Ryan+0.190
 tr+0.189
very+0.187
Neg logit contributions
keeping-0.298
 dam-0.293
 traps-0.292
ooks-0.289
igi-0.285
Token it
Token ID340
Feature activation-5.89e-02

Pos logit contributions
ifying+0.074
ay+0.067
 tr+0.066
 la+0.065
 �+0.065
Neg logit contributions
keeping-0.086
 vet-0.084
 dam-0.083
 traps-0.082
orns-0.081
Token They
Token ID1119
Feature activation-7.04e-02

Pos logit contributions
ifying+0.050
 �+0.047
 Ryan+0.045
 Eric+0.045
 la+0.044
Neg logit contributions
 traps-0.065
 dam-0.063
aches-0.062
orns-0.062
inger-0.061
Token also
Token ID635
Feature activation-1.17e-01

Pos logit contributions
keeping+0.117
ooks+0.116
 dam+0.113
 traps+0.112
orns+0.112
Neg logit contributions
ifying-0.100
 g-0.096
 tr-0.095
 yes-0.092
 la-0.090
Token hoped
Token ID10719
Feature activation-2.18e-01

Pos logit contributions
orns+0.181
ooks+0.180
igi+0.178
keeping+0.177
inger+0.171
Neg logit contributions
 g-0.175
ifying-0.172
 yes-0.170
 tr-0.167
 la-0.162
Token to
Token ID284
Feature activation+1.07e-02

Pos logit contributions
 dam+0.370
keeping+0.368
orns+0.367
inger+0.362
 traps+0.356
Neg logit contributions
ifying-0.291
 tr-0.260
 Ryan-0.259
 yes-0.259
 g-0.254
Token return
Token ID1441
Feature activation-2.05e-01

Pos logit contributions
ifying+0.014
 g+0.013
 yes+0.012
 tr+0.012
 jugg+0.012
Neg logit contributions
orns-0.019
ooks-0.019
 conflicts-0.018
 traps-0.018
keeping-0.018
Token home
Token ID1363
Feature activation-2.58e-01

Pos logit contributions
 dam+0.409
ooks+0.399
inger+0.388
 bidder+0.384
 antidote+0.381
Neg logit contributions
ifying-0.211
 tr-0.191
 Ryan-0.191
 yes-0.188
 g-0.184
Token soon
Token ID2582
Feature activation-2.37e-01

Pos logit contributions
 dam+0.449
 vet+0.430
orns+0.429
 bidder+0.425
 traps+0.420
Neg logit contributions
ifying-0.360
 yes-0.315
 tr-0.310
ay-0.302
 Ryan-0.301
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
 dam+0.394
 vet+0.390
 traps+0.385
ooks+0.384
keeping+0.382
Neg logit contributions
ifying-0.335
ay-0.293
 la-0.290
 tr-0.287
 Ryan-0.279
Token end
Token ID886
Feature activation+2.23e-02

Pos logit contributions
orns+0.026
ooks+0.026
inger+0.025
igi+0.025
igs+0.024
Neg logit contributions
 yes-0.019
ifying-0.019
 g-0.018
 �-0.017
 tr-0.017
Token their
Token ID511
Feature activation+1.38e-02

Pos logit contributions
ifying+0.031
 tr+0.028
 g+0.027
ay+0.027
 la+0.027
Neg logit contributions
keeping-0.036
 dam-0.036
 traps-0.036
 vet-0.036
orns-0.035
Token holiday
Token ID9912
Feature activation+1.20e-01

Pos logit contributions
ifying+0.023
ksh+0.020
ay+0.020
 la+0.020
ahoo+0.020
Neg logit contributions
 traps-0.020
 vet-0.020
keeping-0.020
 dam-0.020
 bidder-0.019