TOP ACTIVATIONS
MAX = 4.027

the side . He leans too much and loses his balance
happy . When faced with lack , we must not be
pieces . Tom fell down too . He was cold and
not careful . He swung too hard and missed the branch
The thread was too strong and Jay couldn รข
down . Tom falls down too . He hurts his knee
She was too fast and too high . They
every day , he was too weak and fell straight back
said , holding up the large knot . Her
and interesting plants . One time she noticed something strange in
down . Ben fell down too . He hit his head
You have to respect the rules and stay away from places
One day , she bounced too hard and fell off the
" I am thinking about wisdom ," said the owl .
want you to respect their rules and their things . If
to climb down but was too dizz y from the height
slid . But she slid too fast and lost her balance
jet ! But she was too young . One day ,
day , the bird flew too high and hit a tree
replied . He took a long piece of string and tied

MIN ACTIVATIONS
MIN = -4.128

fire work . <|endoftext|> E dd ie was a three year
shouting " Ah oy , mate y !" and " Sh
was mean . He said sun fl owers were alive .
peaceful . The sw an looked around and saw
the shore and watches the sw an . She thinks it
O ll ie . O ll ie loved to play all
said , " Hi O ll ie ". <|endoftext|> Once ,
the pond and watch the sw ans swim . She loved
resist ! I will try the o at !"
walked until they finally reached the o asis . They drank
we say goodbye to our snow man first ?"
around the lake . The sw an wanted to join the
When Twe ety arrived at the o asis , she saw
the park to study the sw an and other animals .
by the lake . The sw an was so graceful that
sun came up , the sw an would wake up again
One morning , the sw an discovered something very unique
O ll ie . O ll ie was very adventurous and
a and Billy stayed by the o asis until they were
the park , watching the sw an and talking about all

INTERVAL 1.989 - 4.027
CONTAINS 0.410%

asked the daughter , " What do you want for your
floor . Lily picks up a shoe and tries to scan
. They have a ball and a net . They throw
sat in the yard all day , feeling sad and alone
that she would always love ponies . <|endoftext|> Once upon a

INTERVAL -0.050 - 1.989
CONTAINS 51.398%

. " Mom my , I want to go
she walked , she heard something spl ashing .
he had never gone swimming before . He looked around and
amazed and thought Mary was really smart for asking for opinions
, just like in the movies . To make

INTERVAL -2.089 - -0.050
CONTAINS 47.983%

But remember , it 's okay to make mistakes and things
to work together to fix the drain . They used sticks
She hugged Anna and said she was sorry . She said
her home . Every day , m ama oy
the doctor ." At the doctor 's office ,

INTERVAL -4.128 - -2.089
CONTAINS 0.209%

" Let 's go home now ," Ben said
chase the car - nos aur around the room , having
They were excited to see Grand ma . They hoped she
Mia wanted to make a snow man . As she picked
fire truck and notice the fire men . The fire men
Token the
Token ID262
Feature activation-3.21e-02

Pos logit contributions
bs+0.058
Scar+0.051
 cane+0.051
powers+0.051
able+0.049
Neg logit contributions
ator-0.051
icles-0.051
urg-0.048
icle-0.045
zie-0.042
Token side
Token ID1735
Feature activation+8.62e-01

Pos logit contributions
ator+0.001
urg+0.001
icles+0.001
icle+0.001
rim+0.001
Neg logit contributions
bs-0.001
 cane-0.001
Scar-0.001
 arrows-0.001
 confident-0.001
Token.
Token ID13
Feature activation+1.21e-01

Pos logit contributions
bs+0.032
Scar+0.032
 cane+0.031
 arrows+0.029
Har+0.029
Neg logit contributions
ator-0.041
icles-0.040
urg-0.038
icle-0.035
zie-0.034
Token He
Token ID679
Feature activation+8.74e-01

Pos logit contributions
bs+0.005
 cane+0.004
Scar+0.004
 arrows+0.004
 stamp+0.004
Neg logit contributions
urg-0.006
ator-0.006
icles-0.005
icle-0.005
zie-0.005
Token leans
Token ID39416
Feature activation-5.55e-01

Pos logit contributions
bs+0.032
 cane+0.031
Scar+0.029
Michelle+0.029
Har+0.028
Neg logit contributions
urg-0.040
ator-0.039
icles-0.038
zie-0.033
ric-0.033
Token too
Token ID1165
Feature activation+3.93e+00

Pos logit contributions
urg+0.026
ator+0.026
icles+0.024
rim+0.022
zie+0.021
Neg logit contributions
bs-0.022
 cane-0.021
Scar-0.021
 arrows-0.019
 confident-0.019
Token much
Token ID881
Feature activation+3.12e+00

Pos logit contributions
nam+0.183
Aud+0.178
pill+0.178
 stamps+0.177
 diary+0.175
Neg logit contributions
 now-0.111
ator-0.110
 next-0.105
icles-0.105
 first-0.103
Token and
Token ID290
Feature activation+1.71e+00

Pos logit contributions
nam+0.175
Jennifer+0.157
Scar+0.154
Terry+0.153
Michelle+0.153
Neg logit contributions
icles-0.089
icle-0.082
ator-0.075
 side-0.069
 them-0.068
Token loses
Token ID14754
Feature activation+5.54e-01

Pos logit contributions
Mic+0.072
Michelle+0.067
bs+0.066
able+0.065
 cane+0.065
Neg logit contributions
icles-0.066
urg-0.065
ator-0.059
icle-0.056
ric-0.055
Token his
Token ID465
Feature activation+1.61e+00

Pos logit contributions
bs+0.028
 cane+0.025
Scar+0.025
able+0.024
Aud+0.024
Neg logit contributions
ator-0.020
urg-0.020
icles-0.020
icle-0.017
 Esk-0.016
Token balance
Token ID5236
Feature activation+1.57e+00

Pos logit contributions
bs+0.062
Scar+0.060
Michelle+0.057
Charl+0.057
Aud+0.057
Neg logit contributions
ator-0.072
urg-0.070
icles-0.068
icle-0.062
 ig-0.058
Token happy
Token ID3772
Feature activation-5.47e-01

Pos logit contributions
 cane+0.041
Scar+0.040
bs+0.040
Har+0.038
Mic+0.038
Neg logit contributions
ator-0.056
urg-0.054
icles-0.050
rim-0.047
ric-0.045
Token.
Token ID13
Feature activation+5.66e-02

Pos logit contributions
urg+0.028
ator+0.028
icles+0.027
rim+0.025
icle+0.024
Neg logit contributions
bs-0.018
 cane-0.018
Scar-0.017
 arrows-0.016
 arrow-0.016
Token When
Token ID1649
Feature activation-6.58e-01

Pos logit contributions
bs+0.002
 cane+0.002
Scar+0.002
 arrows+0.002
 arrow+0.002
Neg logit contributions
ator-0.003
urg-0.003
icles-0.003
rim-0.002
icle-0.002
Token faced
Token ID7452
Feature activation+5.43e-01

Pos logit contributions
urg+0.035
ator+0.035
icles+0.033
rim+0.030
icle+0.029
Neg logit contributions
bs-0.021
 cane-0.021
Scar-0.020
 arrows-0.019
 arrow-0.018
Token with
Token ID351
Feature activation+1.12e+00

Pos logit contributions
bs+0.020
 cane+0.019
Scar+0.019
 confident+0.017
Aud+0.017
Neg logit contributions
ator-0.027
urg-0.026
icles-0.025
rim-0.022
icle-0.022
Token lack
Token ID3092
Feature activation+3.80e+00

Pos logit contributions
Scar+0.042
bs+0.042
 cane+0.040
powers+0.038
Har+0.038
Neg logit contributions
ator-0.052
urg-0.049
icles-0.049
icle-0.043
zie-0.042
Token,
Token ID11
Feature activation+1.50e+00

Pos logit contributions
Scar+0.182
nam+0.177
 cane+0.176
Har+0.173
Mic+0.170
Neg logit contributions
 them-0.110
icles-0.104
ator-0.102
 us-0.091
icle-0.089
Token we
Token ID356
Feature activation+1.73e-01

Pos logit contributions
bs+0.057
 cane+0.056
Scar+0.055
Aud+0.052
powers+0.050
Neg logit contributions
ator-0.068
icles-0.065
urg-0.064
icle-0.054
zie-0.053
Token must
Token ID1276
Feature activation-4.84e-01

Pos logit contributions
bs+0.007
 cane+0.006
Scar+0.006
 arrows+0.006
powers+0.006
Neg logit contributions
ator-0.008
urg-0.008
icles-0.007
rim-0.007
zie-0.007
Token not
Token ID407
Feature activation+8.63e-01

Pos logit contributions
urg+0.022
ator+0.022
icles+0.021
rim+0.018
icle+0.018
Neg logit contributions
bs-0.020
 cane-0.019
Scar-0.018
 arrows-0.017
 arrow-0.017
Token be
Token ID307
Feature activation+1.29e+00

Pos logit contributions
Scar+0.037
bs+0.037
 cane+0.036
Aud+0.033
 arrows+0.033
Neg logit contributions
ator-0.035
urg-0.034
icles-0.032
rim-0.029
icle-0.028
Token pieces
Token ID5207
Feature activation+1.68e+00

Pos logit contributions
bs+0.002
 cane+0.002
Scar+0.002
 arrows+0.001
 confident+0.001
Neg logit contributions
urg-0.003
ator-0.003
icles-0.003
rim-0.003
icle-0.003
Token.
Token ID13
Feature activation+3.52e-01

Pos logit contributions
Scar+0.071
Har+0.066
Aud+0.065
Michelle+0.064
 cane+0.063
Neg logit contributions
ator-0.072
icles-0.070
urg-0.068
icle-0.060
rim-0.059
Token Tom
Token ID4186
Feature activation+5.99e-01

Pos logit contributions
 cane+0.014
bs+0.014
Scar+0.013
 arrows+0.013
 arrow+0.013
Neg logit contributions
urg-0.016
ator-0.015
icles-0.015
icle-0.013
rim-0.013
Token fell
Token ID3214
Feature activation-2.96e-01

Pos logit contributions
bs+0.021
Scar+0.020
 cane+0.020
Har+0.018
Aud+0.018
Neg logit contributions
urg-0.029
ator-0.029
icles-0.028
rim-0.025
icle-0.024
Token down
Token ID866
Feature activation+1.80e-01

Pos logit contributions
ator+0.014
urg+0.014
icles+0.013
rim+0.012
icle+0.011
Neg logit contributions
bs-0.012
 cane-0.011
Scar-0.011
 arrows-0.010
 arrow-0.010
Token too
Token ID1165
Feature activation+3.75e+00

Pos logit contributions
bs+0.006
 cane+0.005
Scar+0.005
 arrows+0.005
 arrow+0.005
Neg logit contributions
ator-0.010
urg-0.010
icles-0.009
icle-0.008
rim-0.008
Token.
Token ID13
Feature activation-2.05e-01

Pos logit contributions
Aud+0.206
Scar+0.201
pill+0.195
Jennifer+0.190
 stamps+0.187
Neg logit contributions
ator-0.080
urg-0.078
 now-0.078
icles-0.077
 side-0.072
Token He
Token ID679
Feature activation+1.77e-01

Pos logit contributions
ator+0.010
urg+0.010
icles+0.010
rim+0.009
icle+0.009
Neg logit contributions
bs-0.007
 cane-0.007
Scar-0.007
 arrows-0.006
 arrow-0.006
Token was
Token ID373
Feature activation+4.32e-02

Pos logit contributions
bs+0.005
 cane+0.005
Scar+0.005
 arrows+0.004
Aud+0.004
Neg logit contributions
urg-0.010
ator-0.009
icles-0.009
rim-0.008
icle-0.008
Token cold
Token ID4692
Feature activation+6.12e-01

Pos logit contributions
bs+0.001
 cane+0.001
Scar+0.001
 arrows+0.001
Har+0.001
Neg logit contributions
urg-0.002
ator-0.002
icles-0.002
rim-0.002
icle-0.002
Token and
Token ID290
Feature activation-5.73e-01

Pos logit contributions
bs+0.021
Scar+0.020
 cane+0.019
 arrows+0.018
Har+0.017
Neg logit contributions
ator-0.032
urg-0.032
icles-0.032
icle-0.028
rim-0.028
Token not
Token ID407
Feature activation+1.60e+00

Pos logit contributions
bs+0.039
 cane+0.038
Scar+0.035
 arrows+0.035
powers+0.034
Neg logit contributions
ator-0.030
urg-0.029
icles-0.027
 Esk-0.024
zie-0.024
Token careful
Token ID8161
Feature activation+8.16e-02

Pos logit contributions
bs+0.063
 cane+0.060
pill+0.058
Scar+0.057
Har+0.056
Neg logit contributions
urg-0.070
ator-0.067
 Esk-0.063
icles-0.060
zie-0.054
Token.
Token ID13
Feature activation-5.05e-01

Pos logit contributions
bs+0.003
 cane+0.003
Scar+0.003
 arrows+0.002
 arrow+0.002
Neg logit contributions
ator-0.004
urg-0.004
icles-0.004
rim-0.004
icle-0.004
Token He
Token ID679
Feature activation+6.39e-01

Pos logit contributions
ator+0.026
urg+0.026
icles+0.025
rim+0.023
zie+0.022
Neg logit contributions
bs-0.017
 cane-0.016
Scar-0.016
 arrows-0.015
 arrow-0.015
Token swung
Token ID28507
Feature activation-2.22e-01

Pos logit contributions
bs+0.025
 cane+0.025
Scar+0.025
Har+0.023
Aud+0.022
Neg logit contributions
urg-0.027
ator-0.027
icles-0.026
rim-0.023
icle-0.023
Token too
Token ID1165
Feature activation+3.73e+00

Pos logit contributions
ator+0.011
urg+0.011
icles+0.011
rim+0.010
icle+0.009
Neg logit contributions
bs-0.008
 cane-0.007
Scar-0.007
 arrows-0.007
 arrow-0.007
Token hard
Token ID1327
Feature activation+1.87e+00

Pos logit contributions
nam+0.178
Scar+0.170
Aud+0.168
pill+0.166
Jennifer+0.165
Neg logit contributions
 now-0.097
ator-0.093
 them-0.090
 next-0.087
icles-0.087
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
bs+0.077
Scar+0.076
nam+0.073
 cane+0.072
Aud+0.072
Neg logit contributions
ator-0.078
urg-0.076
icles-0.075
icle-0.064
zie-0.059
Token missed
Token ID6825
Feature activation-7.73e-01

Pos logit contributions
 cane+0.049
Scar+0.046
Har+0.046
bs+0.045
Michelle+0.045
Neg logit contributions
icles-0.052
urg-0.051
ator-0.049
icle-0.045
 overfl-0.045
Token the
Token ID262
Feature activation+8.08e-01

Pos logit contributions
urg+0.038
ator+0.037
icles+0.035
rim+0.032
icle+0.031
Neg logit contributions
bs-0.029
 cane-0.028
Scar-0.027
 arrows-0.026
 arrow-0.025
Token branch
Token ID8478
Feature activation+8.40e-01

Pos logit contributions
bs+0.025
Scar+0.022
Har+0.021
 cane+0.020
nam+0.019
Neg logit contributions
ator-0.043
icles-0.042
urg-0.042
icle-0.039
zie-0.037
Token
Token ID220
Feature activation+2.33e-01

Pos logit contributions
ator+0.080
urg+0.079
icles+0.074
rim+0.074
icle+0.066
Neg logit contributions
bs-0.038
 cane-0.038
 arrows-0.035
Scar-0.035
 arrow-0.034
Token
Token ID198
Feature activation-5.98e-01

Pos logit contributions
bs+0.007
 cane+0.007
Scar+0.007
 arrows+0.006
 arrow+0.006
Neg logit contributions
ator-0.013
urg-0.013
icles-0.012
rim-0.011
icle-0.011
TokenThe
Token ID464
Feature activation+6.39e-01

Pos logit contributions
urg+0.037
ator+0.036
icles+0.035
rim+0.032
icle+0.032
Neg logit contributions
bs-0.015
 cane-0.014
Scar-0.014
 arrows-0.012
Har-0.012
Token thread
Token ID4704
Feature activation+1.06e+00

Pos logit contributions
bs+0.029
Scar+0.026
 cane+0.026
Har+0.025
Aud+0.025
Neg logit contributions
ator-0.026
urg-0.024
icles-0.024
icle-0.021
zie-0.021
Token was
Token ID373
Feature activation+4.57e-01

Pos logit contributions
Scar+0.047
bs+0.045
 cane+0.045
Har+0.045
Aud+0.043
Neg logit contributions
ator-0.042
urg-0.041
icles-0.040
rim-0.034
icle-0.034
Token too
Token ID1165
Feature activation+3.71e+00

Pos logit contributions
bs+0.015
 cane+0.014
Scar+0.014
 arrows+0.013
Har+0.013
Neg logit contributions
urg-0.024
ator-0.023
icles-0.022
icle-0.020
rim-0.020
Token strong
Token ID1913
Feature activation+1.85e+00

Pos logit contributions
 jury+0.159
 Bow+0.159
Jennifer+0.157
 Heidi+0.155
Aud+0.154
Neg logit contributions
urg-0.111
ie-0.105
 setting-0.095
ator-0.094
ail-0.091
Token and
Token ID290
Feature activation-8.90e-02

Pos logit contributions
Scar+0.085
bs+0.084
nam+0.083
Har+0.080
 cane+0.078
Neg logit contributions
urg-0.069
icles-0.067
ator-0.066
icle-0.057
 setting-0.056
Token Jay
Token ID9180
Feature activation+8.19e-01

Pos logit contributions
urg+0.004
ator+0.004
icles+0.004
rim+0.004
icle+0.004
Neg logit contributions
bs-0.003
 cane-0.003
Scar-0.003
 arrows-0.003
Har-0.003
Token couldn
Token ID3521
Feature activation+1.19e+00

Pos logit contributions
 cane+0.035
bs+0.035
Scar+0.035
 arrows+0.033
Aud+0.033
Neg logit contributions
urg-0.032
ator-0.032
icles-0.030
rim-0.026
zie-0.025
Tokenรข
Token ID22940
Feature activation+2.56e-01

Pos logit contributions
bs+0.067
 cane+0.063
nam+0.062
 arrows+0.061
Scar+0.060
Neg logit contributions
ator-0.038
icles-0.036
urg-0.035
icle-0.030
zie-0.027
Token down
Token ID866
Feature activation+1.59e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.008
 arrows+0.007
Har+0.007
Neg logit contributions
urg-0.008
ator-0.008
icles-0.008
icle-0.007
zie-0.007
Token.
Token ID13
Feature activation+4.46e-01

Pos logit contributions
bs+0.006
 cane+0.005
Scar+0.005
 arrows+0.005
 arrow+0.005
Neg logit contributions
ator-0.008
urg-0.008
icles-0.008
icle-0.007
rim-0.007
Token Tom
Token ID4186
Feature activation+2.68e-01

Pos logit contributions
bs+0.018
 cane+0.018
Scar+0.017
 arrows+0.016
able+0.016
Neg logit contributions
ator-0.020
urg-0.020
icles-0.019
icle-0.017
zie-0.016
Token falls
Token ID8953
Feature activation+2.30e-01

Pos logit contributions
bs+0.010
 cane+0.010
Scar+0.010
 arrows+0.009
nam+0.009
Neg logit contributions
urg-0.013
ator-0.012
icles-0.012
zie-0.011
rim-0.011
Token down
Token ID866
Feature activation+5.60e-01

Pos logit contributions
bs+0.010
 cane+0.009
Scar+0.009
 arrows+0.009
 stamp+0.009
Neg logit contributions
ator-0.010
urg-0.010
icles-0.009
icle-0.008
zie-0.008
Token too
Token ID1165
Feature activation+3.68e+00

Pos logit contributions
bs+0.020
 cane+0.018
Scar+0.018
 arrows+0.017
Aud+0.016
Neg logit contributions
ator-0.029
urg-0.028
icles-0.028
icle-0.025
rim-0.024
Token.
Token ID13
Feature activation-4.83e-02

Pos logit contributions
nam+0.188
Aud+0.188
 stamps+0.184
Scar+0.181
bs+0.176
Neg logit contributions
ator-0.095
 now-0.091
urg-0.090
icles-0.089
 himself-0.082
Token He
Token ID679
Feature activation+3.55e-01

Pos logit contributions
ator+0.002
urg+0.002
icles+0.002
icle+0.002
rim+0.002
Neg logit contributions
bs-0.002
 cane-0.002
Scar-0.002
 arrows-0.002
 arrow-0.002
Token hurts
Token ID20406
Feature activation-2.76e-01

Pos logit contributions
bs+0.011
 cane+0.011
Scar+0.011
Aud+0.009
Har+0.009
Neg logit contributions
urg-0.018
ator-0.018
icles-0.018
rim-0.016
zie-0.016
Token his
Token ID465
Feature activation+4.89e-01

Pos logit contributions
ator+0.012
urg+0.012
icles+0.011
rim+0.010
icle+0.010
Neg logit contributions
bs-0.012
 cane-0.012
Scar-0.011
 arrows-0.011
 arrow-0.010
Token knee
Token ID10329
Feature activation+1.47e+00

Pos logit contributions
bs+0.024
Scar+0.022
Aud+0.021
 cane+0.020
Har+0.020
Neg logit contributions
urg-0.020
ator-0.020
icles-0.018
icle-0.016
 ig-0.015
Token She
Token ID1375
Feature activation+5.43e-01

Pos logit contributions
ator+0.003
urg+0.003
icles+0.003
rim+0.003
icle+0.003
Neg logit contributions
bs-0.002
 cane-0.002
Scar-0.002
 arrows-0.002
 arrow-0.002
Token was
Token ID373
Feature activation-6.45e-01

Pos logit contributions
bs+0.020
 cane+0.020
Scar+0.019
 arrows+0.018
powers+0.017
Neg logit contributions
urg-0.025
ator-0.025
icles-0.024
rim-0.021
zie-0.021
Token too
Token ID1165
Feature activation+2.70e+00

Pos logit contributions
ator+0.035
urg+0.035
icles+0.033
rim+0.030
icle+0.029
Neg logit contributions
bs-0.020
 cane-0.020
Scar-0.019
 confident-0.017
 arrows-0.017
Token fast
Token ID3049
Feature activation+1.28e+00

Pos logit contributions
 arrows+0.112
 arrow+0.099
 cane+0.099
Scar+0.097
Aud+0.096
Neg logit contributions
urg-0.105
ator-0.099
rim-0.097
 Esk-0.092
allows-0.089
Token and
Token ID290
Feature activation+1.38e+00

Pos logit contributions
bs+0.048
Scar+0.047
nam+0.047
 cane+0.046
Har+0.044
Neg logit contributions
urg-0.057
ator-0.055
icles-0.055
icle-0.047
zie-0.047
Token too
Token ID1165
Feature activation+3.63e+00

Pos logit contributions
 cane+0.043
Har+0.042
Scar+0.042
bs+0.041
Mic+0.038
Neg logit contributions
icles-0.064
urg-0.064
ator-0.064
zie-0.059
icle-0.056
Token high
Token ID1029
Feature activation+2.64e-01

Pos logit contributions
Mic+0.131
nam+0.127
 arrows+0.126
Scar+0.123
Har+0.121
Neg logit contributions
urg-0.120
ie-0.117
ator-0.116
icles-0.114
zie-0.112
Token.
Token ID13
Feature activation+9.06e-01

Pos logit contributions
bs+0.009
 cane+0.009
Scar+0.009
 arrows+0.008
 arrow+0.008
Neg logit contributions
ator-0.013
urg-0.013
icles-0.013
icle-0.011
zie-0.011
Token
Token ID198
Feature activation+8.47e-01

Pos logit contributions
 cane+0.031
bs+0.031
 arrows+0.028
 stamp+0.028
Scar+0.028
Neg logit contributions
icles-0.043
ator-0.043
urg-0.043
zie-0.038
icle-0.038
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
 cane+0.031
bs+0.030
 arrows+0.028
Scar+0.028
 stamp+0.027
Neg logit contributions
ator-0.040
urg-0.039
icles-0.038
rim-0.034
zie-0.034
TokenThey
Token ID2990
Feature activation+4.02e-01

Pos logit contributions
 cane+0.043
bs+0.042
 arrows+0.039
 stamps+0.039
powers+0.038
Neg logit contributions
ator-0.048
urg-0.045
icles-0.044
rim-0.042
zie-0.040
Token every
Token ID790
Feature activation+8.34e-01

Pos logit contributions
bs+0.003
Scar+0.003
 cane+0.002
powers+0.002
Har+0.002
Neg logit contributions
urg-0.004
ator-0.004
icles-0.003
icle-0.003
zie-0.003
Token day
Token ID1110
Feature activation+7.18e-01

Pos logit contributions
Scar+0.044
bs+0.044
nam+0.042
 cane+0.041
 arrows+0.041
Neg logit contributions
ator-0.025
urg-0.024
icles-0.023
icle-0.020
zie-0.020
Token,
Token ID11
Feature activation+2.03e-01

Pos logit contributions
Scar+0.025
bs+0.024
 cane+0.023
Aud+0.023
Jennifer+0.022
Neg logit contributions
icles-0.035
urg-0.034
ator-0.033
icle-0.031
 Esk-0.030
Token he
Token ID339
Feature activation+8.53e-01

Pos logit contributions
bs+0.007
Scar+0.006
 cane+0.006
Aud+0.006
Har+0.006
Neg logit contributions
ator-0.011
urg-0.011
icles-0.011
icle-0.009
 Esk-0.009
Token was
Token ID373
Feature activation+1.33e+00

Pos logit contributions
bs+0.037
Scar+0.037
 cane+0.037
Aud+0.035
Har+0.035
Neg logit contributions
urg-0.032
ator-0.031
icles-0.029
rim-0.026
zie-0.025
Token too
Token ID1165
Feature activation+3.60e+00

Pos logit contributions
bs+0.050
 cane+0.048
Scar+0.047
 arrows+0.045
able+0.044
Neg logit contributions
urg-0.059
ator-0.057
icles-0.052
ric-0.050
 setting-0.049
Token weak
Token ID4939
Feature activation+1.27e+00

Pos logit contributions
ird+0.144
 arrows+0.143
bench+0.142
Scar+0.141
 Faith+0.140
Neg logit contributions
urg-0.130
ipples-0.109
ie-0.108
 hyd-0.106
ric-0.105
Token and
Token ID290
Feature activation+2.06e-01

Pos logit contributions
Scar+0.057
bs+0.057
nam+0.054
 cane+0.053
Har+0.053
Neg logit contributions
urg-0.048
icles-0.047
ator-0.046
icle-0.040
zie-0.039
Token fell
Token ID3214
Feature activation+5.65e-01

Pos logit contributions
bs+0.007
Scar+0.007
 cane+0.007
Har+0.006
 arrows+0.006
Neg logit contributions
urg-0.010
ator-0.010
icles-0.010
icle-0.009
zie-0.009
Token straight
Token ID3892
Feature activation-6.26e-01

Pos logit contributions
bs+0.024
Scar+0.023
 cane+0.023
Jennifer+0.022
Har+0.022
Neg logit contributions
urg-0.024
icles-0.023
ator-0.023
icle-0.020
zie-0.019
Token back
Token ID736
Feature activation+6.32e-01

Pos logit contributions
ator+0.032
urg+0.032
icles+0.031
rim+0.028
zie+0.027
Neg logit contributions
bs-0.021
 cane-0.021
Scar-0.020
 arrows-0.019
 arrow-0.018
Token said
Token ID531
Feature activation-1.10e-01

Pos logit contributions
bs+0.032
 cane+0.031
Scar+0.030
 arrows+0.029
nam+0.028
Neg logit contributions
ator-0.035
urg-0.035
icles-0.033
zie-0.030
rim-0.030
Token,
Token ID11
Feature activation-6.06e-01

Pos logit contributions
urg+0.006
ator+0.006
icles+0.006
rim+0.005
icle+0.005
Neg logit contributions
bs-0.003
 cane-0.003
Scar-0.003
 arrows-0.003
 arrow-0.003
Token holding
Token ID4769
Feature activation-1.58e-01

Pos logit contributions
ator+0.030
urg+0.029
icles+0.028
rim+0.025
icle+0.024
Neg logit contributions
bs-0.022
 cane-0.022
Scar-0.021
 arrows-0.019
 arrow-0.019
Token up
Token ID510
Feature activation-1.19e-01

Pos logit contributions
ator+0.008
urg+0.008
icles+0.008
rim+0.007
icle+0.007
Neg logit contributions
bs-0.006
 cane-0.005
Scar-0.005
 arrows-0.005
 arrow-0.005
Token the
Token ID262
Feature activation+3.01e+00

Pos logit contributions
urg+0.006
ator+0.006
icles+0.006
rim+0.005
icle+0.005
Neg logit contributions
bs-0.004
 cane-0.004
Scar-0.003
 arrows-0.003
 arrow-0.003
Token large
Token ID1588
Feature activation+3.60e+00

Pos logit contributions
bs+0.088
 Jerry+0.085
Har+0.081
old+0.081
Terry+0.079
Neg logit contributions
ator-0.130
 side-0.121
 edge-0.119
icles-0.118
 surface-0.116
Token knot
Token ID29654
Feature activation+1.05e+00

Pos logit contributions
Har+0.116
bs+0.116
Terry+0.114
Mic+0.109
ver+0.108
Neg logit contributions
ator-0.145
 side-0.142
icles-0.138
 size-0.136
 surface-0.134
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
bs+0.037
Scar+0.035
 cane+0.035
Har+0.033
 arrows+0.032
Neg logit contributions
ator-0.052
urg-0.050
icles-0.049
icle-0.043
zie-0.043
Token
Token ID198
Feature activation-1.37e-01

Pos logit contributions
ator+0.011
urg+0.011
icles+0.010
rim+0.009
icle+0.009
Neg logit contributions
bs-0.007
 cane-0.007
Scar-0.007
 arrows-0.006
 arrow-0.006
Token
Token ID198
Feature activation-5.82e-02

Pos logit contributions
ator+0.008
urg+0.008
icles+0.007
rim+0.007
icle+0.007
Neg logit contributions
bs-0.004
 cane-0.004
Scar-0.004
 arrows-0.003
 arrow-0.003
TokenHer
Token ID9360
Feature activation-7.41e-02

Pos logit contributions
ator+0.003
urg+0.003
icles+0.003
rim+0.003
icle+0.003
Neg logit contributions
bs-0.002
 cane-0.002
Scar-0.002
 arrows-0.002
 arrow-0.002
Token and
Token ID290
Feature activation+3.45e-01

Pos logit contributions
Scar+0.051
bs+0.050
 cane+0.048
Har+0.046
powers+0.045
Neg logit contributions
ator-0.055
urg-0.053
icles-0.051
rim-0.047
fl-0.046
Token interesting
Token ID3499
Feature activation-8.40e-01

Pos logit contributions
bs+0.013
Scar+0.013
 cane+0.013
Har+0.011
Aud+0.011
Neg logit contributions
urg-0.016
ator-0.015
icles-0.015
rim-0.013
zie-0.013
Token plants
Token ID6134
Feature activation+4.59e-01

Pos logit contributions
ator+0.038
urg+0.038
icles+0.035
rim+0.031
icle+0.031
Neg logit contributions
bs-0.034
 cane-0.034
 arrows-0.032
Scar-0.031
 arrow-0.030
Token.
Token ID13
Feature activation+2.22e+00

Pos logit contributions
bs+0.016
Scar+0.015
 cane+0.015
Har+0.014
 arrows+0.014
Neg logit contributions
ator-0.023
urg-0.022
icles-0.022
rim-0.020
zie-0.019
Token One
Token ID1881
Feature activation-2.65e-01

Pos logit contributions
bs+0.096
 tangled+0.083
br+0.082
able+0.082
 pencil+0.081
Neg logit contributions
icles-0.083
zie-0.077
urg-0.076
ator-0.074
 Later-0.072
Token time
Token ID640
Feature activation+3.58e+00

Pos logit contributions
ator+0.010
urg+0.010
icles+0.010
rim+0.008
icle+0.008
Neg logit contributions
bs-0.012
 cane-0.012
Scar-0.012
 arrows-0.011
 arrow-0.011
Token she
Token ID673
Feature activation-4.76e-01

Pos logit contributions
powers+0.150
pill+0.137
Mic+0.134
bs+0.132
azz+0.131
Neg logit contributions
 them-0.123
icles-0.122
 setting-0.112
ator-0.098
fl-0.097
Token noticed
Token ID6810
Feature activation+1.37e+00

Pos logit contributions
ator+0.022
urg+0.022
icles+0.021
rim+0.019
icle+0.019
Neg logit contributions
bs-0.018
 cane-0.018
Scar-0.017
 arrows-0.016
 arrow-0.016
Token something
Token ID1223
Feature activation+6.68e-01

Pos logit contributions
bs+0.051
Scar+0.048
 cane+0.046
powers+0.045
 confident+0.043
Neg logit contributions
icles-0.065
ator-0.064
urg-0.063
icle-0.055
zie-0.054
Token strange
Token ID6283
Feature activation-7.14e-02

Pos logit contributions
bs+0.024
Scar+0.023
 cane+0.023
Har+0.021
powers+0.021
Neg logit contributions
urg-0.031
ator-0.031
icles-0.030
rim-0.027
icle-0.026
Token in
Token ID287
Feature activation+1.04e+00

Pos logit contributions
urg+0.004
ator+0.004
icles+0.003
rim+0.003
icle+0.003
Neg logit contributions
bs-0.002
 cane-0.002
Scar-0.002
 arrows-0.002
Har-0.002
Token down
Token ID866
Feature activation-2.94e-01

Pos logit contributions
urg+0.005
ator+0.005
icles+0.004
icle+0.004
rim+0.004
Neg logit contributions
bs-0.004
 cane-0.004
Scar-0.004
 arrows-0.003
 confident-0.003
Token.
Token ID13
Feature activation+5.23e-01

Pos logit contributions
ator+0.016
urg+0.016
icles+0.016
rim+0.014
icle+0.014
Neg logit contributions
bs-0.009
 cane-0.008
Scar-0.008
 arrows-0.007
 arrow-0.007
Token Ben
Token ID3932
Feature activation+1.55e+00

Pos logit contributions
 cane+0.019
bs+0.018
Scar+0.018
 arrows+0.017
 arrow+0.017
Neg logit contributions
urg-0.025
icles-0.024
ator-0.024
icle-0.022
rim-0.021
Token fell
Token ID3214
Feature activation-1.48e-01

Pos logit contributions
Scar+0.056
bs+0.055
nam+0.055
 cane+0.054
Har+0.053
Neg logit contributions
urg-0.065
icles-0.060
ator-0.059
 overfl-0.054
rim-0.054
Token down
Token ID866
Feature activation-3.02e-02

Pos logit contributions
urg+0.007
ator+0.007
icles+0.007
rim+0.006
icle+0.006
Neg logit contributions
bs-0.006
 cane-0.006
Scar-0.005
 arrows-0.005
 arrow-0.005
Token too
Token ID1165
Feature activation+3.56e+00

Pos logit contributions
ator+0.002
urg+0.002
icles+0.002
rim+0.001
icle+0.001
Neg logit contributions
bs-0.001
 cane-0.001
Scar-0.001
 arrows-0.001
 arrow-0.001
Token.
Token ID13
Feature activation-1.79e-01

Pos logit contributions
Aud+0.212
nam+0.210
Scar+0.208
Jennifer+0.200
 stamps+0.198
Neg logit contributions
ator-0.056
urg-0.056
icles-0.055
 now-0.053
 Esk-0.052
Token He
Token ID679
Feature activation+5.14e-01

Pos logit contributions
urg+0.010
ator+0.010
icles+0.009
rim+0.008
icle+0.008
Neg logit contributions
bs-0.006
 cane-0.005
Scar-0.005
 arrows-0.005
 arrow-0.005
Token hit
Token ID2277
Feature activation-2.78e-01

Pos logit contributions
bs+0.016
Scar+0.016
 cane+0.016
Aud+0.014
 confident+0.014
Neg logit contributions
urg-0.026
icles-0.025
ator-0.025
rim-0.022
icle-0.022
Token his
Token ID465
Feature activation-8.31e-01

Pos logit contributions
ator+0.015
urg+0.015
icles+0.015
rim+0.013
icle+0.013
Neg logit contributions
bs-0.009
 cane-0.008
Scar-0.008
 arrows-0.007
 arrow-0.007
Token head
Token ID1182
Feature activation+4.13e-01

Pos logit contributions
ator+0.033
urg+0.033
icles+0.031
rim+0.028
zie+0.026
Neg logit contributions
 cane-0.038
bs-0.037
Scar-0.035
 arrow-0.035
 string-0.034
Token You
Token ID921
Feature activation+7.19e-01

Pos logit contributions
 cane+0.040
bs+0.040
Scar+0.038
 arrows+0.037
 confident+0.037
Neg logit contributions
urg-0.029
ator-0.029
icles-0.029
icle-0.025
rim-0.024
Token have
Token ID423
Feature activation+5.40e-01

Pos logit contributions
Scar+0.028
 cane+0.028
bs+0.027
Aud+0.026
 confident+0.026
Neg logit contributions
ator-0.031
urg-0.030
icles-0.030
icle-0.026
rim-0.025
Token to
Token ID284
Feature activation+3.63e-01

Pos logit contributions
bs+0.022
 cane+0.021
Scar+0.021
 arrows+0.020
 confident+0.020
Neg logit contributions
urg-0.022
icles-0.022
ator-0.022
icle-0.019
rim-0.018
Token respect
Token ID2461
Feature activation+5.15e-01

Pos logit contributions
bs+0.017
 cane+0.016
Scar+0.016
 arrows+0.015
powers+0.015
Neg logit contributions
urg-0.014
ator-0.014
icles-0.013
rim-0.012
zie-0.011
Token the
Token ID262
Feature activation-5.41e-01

Pos logit contributions
bs+0.021
Scar+0.020
 cane+0.020
powers+0.019
 confident+0.019
Neg logit contributions
ator-0.022
urg-0.022
icles-0.022
rim-0.019
icle-0.019
Token rules
Token ID3173
Feature activation+3.56e+00

Pos logit contributions
urg+0.026
ator+0.026
icles+0.025
rim+0.022
icle+0.022
Neg logit contributions
bs-0.020
 cane-0.020
Scar-0.019
 arrows-0.018
 arrow-0.018
Token and
Token ID290
Feature activation+3.81e-01

Pos logit contributions
Scar+0.158
 cane+0.154
bs+0.149
Aud+0.148
 arrows+0.147
Neg logit contributions
ator-0.138
icles-0.130
urg-0.108
icle-0.106
 us-0.105
Token stay
Token ID2652
Feature activation-1.71e-01

Pos logit contributions
bs+0.016
Scar+0.015
 cane+0.015
Har+0.014
 confident+0.014
Neg logit contributions
urg-0.016
ator-0.016
icles-0.016
rim-0.013
icle-0.013
Token away
Token ID1497
Feature activation+7.44e-01

Pos logit contributions
ator+0.008
urg+0.008
icles+0.007
rim+0.006
icle+0.006
Neg logit contributions
bs-0.007
 cane-0.007
Scar-0.007
 arrows-0.006
 arrow-0.006
Token from
Token ID422
Feature activation-4.84e-01

Pos logit contributions
bs+0.033
 cane+0.032
Scar+0.032
Aud+0.030
 arrows+0.030
Neg logit contributions
urg-0.031
ator-0.030
icles-0.029
rim-0.025
 Esk-0.024
Token places
Token ID4113
Feature activation+2.17e+00

Pos logit contributions
urg+0.023
ator+0.023
icles+0.022
rim+0.020
icle+0.019
Neg logit contributions
bs-0.018
 cane-0.018
Scar-0.017
 arrows-0.016
 arrow-0.016
TokenOne
Token ID3198
Feature activation+1.99e-01

Pos logit contributions
ator+0.012
urg+0.012
icles+0.011
rim+0.010
icle+0.010
Neg logit contributions
bs-0.008
 cane-0.007
Scar-0.007
 arrows-0.007
 arrow-0.006
Token day
Token ID1110
Feature activation+1.01e+00

Pos logit contributions
Scar+0.010
bs+0.010
 cane+0.010
 arrows+0.010
Har+0.010
Neg logit contributions
urg-0.006
ator-0.006
icles-0.006
rim-0.005
icle-0.005
Token,
Token ID11
Feature activation+2.38e-02

Pos logit contributions
 cane+0.035
Scar+0.033
bs+0.032
powers+0.032
 confident+0.031
Neg logit contributions
urg-0.047
icles-0.046
ator-0.045
icle-0.040
zie-0.039
Token she
Token ID673
Feature activation+2.45e-01

Pos logit contributions
bs+0.001
 cane+0.001
Scar+0.001
 arrows+0.001
powers+0.001
Neg logit contributions
ator-0.001
urg-0.001
icles-0.001
rim-0.001
icle-0.001
Token bounced
Token ID31318
Feature activation+1.33e-01

Pos logit contributions
bs+0.010
 cane+0.010
Scar+0.009
 arrows+0.009
powers+0.009
Neg logit contributions
urg-0.011
ator-0.011
icles-0.010
rim-0.009
zie-0.009
Token too
Token ID1165
Feature activation+3.54e+00

Pos logit contributions
bs+0.005
 cane+0.005
Scar+0.005
Har+0.005
 arrows+0.005
Neg logit contributions
ator-0.006
urg-0.006
icles-0.006
icle-0.005
zie-0.005
Token hard
Token ID1327
Feature activation+1.79e+00

Pos logit contributions
nam+0.161
 stamps+0.157
Scar+0.157
 jury+0.155
Jennifer+0.150
Neg logit contributions
 now-0.105
ator-0.091
icles-0.090
ie-0.089
 side-0.087
Token and
Token ID290
Feature activation+6.68e-01

Pos logit contributions
Scar+0.080
nam+0.079
bs+0.078
Jennifer+0.077
 cane+0.076
Neg logit contributions
urg-0.066
ator-0.062
icles-0.062
fl-0.053
icle-0.052
Token fell
Token ID3214
Feature activation+1.64e-01

Pos logit contributions
 cane+0.024
bs+0.024
Scar+0.023
Har+0.022
powers+0.021
Neg logit contributions
urg-0.030
icles-0.030
ator-0.030
zie-0.027
icle-0.026
Token off
Token ID572
Feature activation+4.61e-01

Pos logit contributions
bs+0.007
 cane+0.007
Scar+0.007
 arrows+0.006
Har+0.006
Neg logit contributions
urg-0.007
ator-0.007
icles-0.007
icle-0.006
zie-0.006
Token the
Token ID262
Feature activation+5.69e-01

Pos logit contributions
bs+0.017
 cane+0.016
Scar+0.016
 arrows+0.015
Har+0.015
Neg logit contributions
ator-0.022
urg-0.022
icles-0.021
icle-0.019
zie-0.019
Token"
Token ID1
Feature activation+6.64e-01

Pos logit contributions
bs+0.021
 cane+0.020
Scar+0.018
 confident+0.018
nam+0.018
Neg logit contributions
ator-0.029
urg-0.027
icles-0.026
rim-0.024
zie-0.023
TokenI
Token ID40
Feature activation+3.20e-01

Pos logit contributions
bs+0.031
 cane+0.029
Scar+0.028
 arrows+0.026
 confident+0.026
Neg logit contributions
ator-0.026
icles-0.024
urg-0.023
rim-0.021
icle-0.021
Token am
Token ID716
Feature activation+6.64e-01

Pos logit contributions
 cane+0.014
bs+0.014
Scar+0.014
 arrows+0.013
nam+0.013
Neg logit contributions
urg-0.012
ator-0.012
icles-0.011
rim-0.010
zie-0.010
Token thinking
Token ID3612
Feature activation+1.24e+00

Pos logit contributions
bs+0.026
 cane+0.024
Scar+0.023
able+0.022
powers+0.022
Neg logit contributions
urg-0.032
ator-0.031
icles-0.028
rim-0.026
 Esk-0.026
Token about
Token ID546
Feature activation+6.56e-01

Pos logit contributions
Scar+0.052
bs+0.051
 cane+0.050
Har+0.048
Aud+0.047
Neg logit contributions
icles-0.053
ator-0.053
urg-0.052
icle-0.044
zie-0.042
Token wisdom
Token ID11501
Feature activation+3.53e+00

Pos logit contributions
bs+0.027
Scar+0.025
 cane+0.025
Aud+0.023
able+0.023
Neg logit contributions
urg-0.030
ator-0.029
icles-0.028
 Esk-0.024
rim-0.024
Token,"
Token ID553
Feature activation+9.83e-01

Pos logit contributions
Scar+0.155
Aud+0.144
Har+0.143
Jennifer+0.141
Mic+0.133
Neg logit contributions
icles-0.137
ator-0.132
ie-0.124
urg-0.118
fl-0.114
Token said
Token ID531
Feature activation+1.21e+00

Pos logit contributions
bs+0.036
Scar+0.035
powers+0.034
 cane+0.034
Aud+0.033
Neg logit contributions
urg-0.045
ator-0.045
icles-0.044
rim-0.038
icle-0.038
Token the
Token ID262
Feature activation+1.58e+00

Pos logit contributions
bs+0.051
powers+0.050
Scar+0.049
 cane+0.048
Aud+0.048
Neg logit contributions
ator-0.049
urg-0.048
icles-0.047
icle-0.042
ie-0.039
Token owl
Token ID39610
Feature activation+9.30e-01

Pos logit contributions
bs+0.068
Aud+0.060
Scar+0.058
powers+0.053
Jennifer+0.051
Neg logit contributions
ator-0.072
urg-0.065
icles-0.061
 Esk-0.058
fl-0.058
Token.
Token ID13
Feature activation-5.08e-02

Pos logit contributions
Scar+0.035
 cane+0.034
bs+0.034
Aud+0.032
Har+0.031
Neg logit contributions
ator-0.043
urg-0.043
icles-0.041
zie-0.036
icle-0.036
Token want
Token ID765
Feature activation-2.08e-01

Pos logit contributions
 cane+0.039
Scar+0.038
bs+0.038
 arrows+0.037
Aud+0.036
Neg logit contributions
urg-0.048
ator-0.046
icles-0.043
rim-0.042
ric-0.041
Token you
Token ID345
Feature activation+7.31e-01

Pos logit contributions
urg+0.010
ator+0.010
icles+0.010
rim+0.009
icle+0.009
Neg logit contributions
bs-0.007
 cane-0.007
Scar-0.007
 arrows-0.007
 arrow-0.006
Token to
Token ID284
Feature activation+5.36e-01

Pos logit contributions
Scar+0.029
 cane+0.028
bs+0.028
Har+0.026
 arrows+0.026
Neg logit contributions
urg-0.031
icles-0.030
ator-0.030
icle-0.026
rim-0.026
Token respect
Token ID2461
Feature activation+5.26e-02

Pos logit contributions
bs+0.025
 cane+0.024
Scar+0.024
 arrows+0.023
able+0.023
Neg logit contributions
urg-0.020
ator-0.020
icles-0.019
rim-0.016
zie-0.016
Token their
Token ID511
Feature activation-6.36e-01

Pos logit contributions
bs+0.002
 cane+0.002
Scar+0.002
 arrows+0.002
 confident+0.002
Neg logit contributions
ator-0.002
urg-0.002
icles-0.002
rim-0.002
icle-0.002
Token rules
Token ID3173
Feature activation+3.53e+00

Pos logit contributions
ator+0.029
urg+0.029
icles+0.027
rim+0.024
icle+0.024
Neg logit contributions
bs-0.026
 cane-0.026
Scar-0.024
 arrows-0.024
 arrow-0.023
Token and
Token ID290
Feature activation+7.37e-01

Pos logit contributions
Scar+0.167
bs+0.161
Aud+0.158
 cane+0.158
 arrows+0.157
Neg logit contributions
ator-0.121
icles-0.115
urg-0.096
 us-0.092
 them-0.090
Token their
Token ID511
Feature activation-6.39e-01

Pos logit contributions
bs+0.031
Scar+0.030
 cane+0.029
Har+0.028
Aud+0.027
Neg logit contributions
urg-0.030
ator-0.029
icles-0.028
rim-0.025
zie-0.024
Token things
Token ID1243
Feature activation+1.61e+00

Pos logit contributions
urg+0.027
ator+0.027
icles+0.026
rim+0.022
icle+0.022
Neg logit contributions
bs-0.028
 cane-0.028
Scar-0.026
 arrows-0.025
 arrow-0.025
Token.
Token ID13
Feature activation+3.30e-02

Pos logit contributions
Scar+0.068
 cane+0.065
bs+0.065
Har+0.065
Jennifer+0.064
Neg logit contributions
ator-0.064
icles-0.060
urg-0.059
rim-0.049
icle-0.048
Token If
Token ID1002
Feature activation-2.20e-01

Pos logit contributions
bs+0.001
 cane+0.001
Scar+0.001
 arrows+0.001
 arrow+0.001
Neg logit contributions
ator-0.001
urg-0.001
icles-0.001
rim-0.001
icle-0.001
Token to
Token ID284
Feature activation+7.75e-01

Pos logit contributions
bs+0.023
Scar+0.022
 cane+0.021
able+0.020
 arrows+0.020
Neg logit contributions
urg-0.023
ator-0.022
icles-0.022
icle-0.019
zie-0.018
Token climb
Token ID12080
Feature activation+1.77e-02

Pos logit contributions
bs+0.032
Scar+0.030
 cane+0.029
able+0.028
Aud+0.027
Neg logit contributions
ator-0.035
urg-0.035
icles-0.033
zie-0.030
icle-0.030
Token down
Token ID866
Feature activation+3.73e-01

Pos logit contributions
bs+0.001
 cane+0.001
Scar+0.001
 arrows+0.001
 confident+0.001
Neg logit contributions
ator-0.001
urg-0.001
icles-0.001
icle-0.001
zie-0.001
Token but
Token ID475
Feature activation+2.62e-01

Pos logit contributions
bs+0.013
Scar+0.012
 cane+0.012
Aud+0.011
Har+0.011
Neg logit contributions
ator-0.020
icles-0.019
urg-0.019
icle-0.017
zie-0.016
Token was
Token ID373
Feature activation+7.82e-01

Pos logit contributions
bs+0.009
 cane+0.009
Scar+0.009
 arrows+0.008
Har+0.008
Neg logit contributions
ator-0.013
icles-0.012
urg-0.012
zie-0.011
icle-0.011
Token too
Token ID1165
Feature activation+3.53e+00

Pos logit contributions
bs+0.036
Scar+0.035
 cane+0.035
Har+0.033
 arrows+0.033
Neg logit contributions
ator-0.029
urg-0.029
icles-0.028
icle-0.024
zie-0.024
Token dizz
Token ID29527
Feature activation+5.54e-01

Pos logit contributions
 brace+0.133
 arrows+0.133
 stamps+0.130
Aud+0.128
 Faith+0.126
Neg logit contributions
urg-0.127
 Esk-0.121
ator-0.121
zie-0.117
icles-0.114
Tokeny
Token ID88
Feature activation+2.99e-01

Pos logit contributions
bs+0.027
 cane+0.027
Scar+0.026
 arrows+0.026
 arrow+0.025
Neg logit contributions
urg-0.020
ator-0.020
icles-0.020
rim-0.016
icle-0.016
Token from
Token ID422
Feature activation-2.53e-01

Pos logit contributions
bs+0.011
 cane+0.010
Scar+0.010
 arrows+0.009
Har+0.009
Neg logit contributions
ator-0.015
urg-0.015
icles-0.014
rim-0.013
icle-0.013
Token the
Token ID262
Feature activation+5.58e-01

Pos logit contributions
ator+0.012
urg+0.012
icles+0.012
rim+0.010
icle+0.010
Neg logit contributions
bs-0.009
 cane-0.009
Scar-0.009
 arrows-0.008
 arrow-0.008
Token height
Token ID6001
Feature activation+1.13e+00

Pos logit contributions
bs+0.020
Scar+0.018
 cane+0.017
Har+0.017
Aud+0.016
Neg logit contributions
ator-0.028
urg-0.027
icles-0.027
zie-0.024
icle-0.023
Token slid
Token ID27803
Feature activation-8.21e-01

Pos logit contributions
Mic+0.075
powers+0.072
pill+0.070
 cane+0.070
able+0.070
Neg logit contributions
urg-0.067
ator-0.060
icles-0.060
zie-0.054
 Esk-0.052
Token.
Token ID13
Feature activation+5.54e-01

Pos logit contributions
urg+0.042
ator+0.041
icles+0.038
rim+0.037
icle+0.033
Neg logit contributions
bs-0.030
 cane-0.029
Scar-0.028
 arrows-0.027
 arrow-0.026
Token But
Token ID887
Feature activation+6.80e-01

Pos logit contributions
bs+0.018
 cane+0.017
 arrows+0.016
able+0.016
 arrow+0.016
Neg logit contributions
urg-0.028
icles-0.028
ator-0.027
icle-0.024
zie-0.024
Token she
Token ID673
Feature activation-3.00e-01

Pos logit contributions
bs+0.021
 cane+0.021
powers+0.021
Scar+0.020
Har+0.019
Neg logit contributions
ator-0.035
icles-0.034
urg-0.034
zie-0.031
icle-0.030
Token slid
Token ID27803
Feature activation-5.60e-01

Pos logit contributions
ator+0.014
urg+0.014
icles+0.013
rim+0.012
icle+0.011
Neg logit contributions
bs-0.012
 cane-0.011
Scar-0.011
 arrows-0.010
 arrow-0.010
Token too
Token ID1165
Feature activation+3.52e+00

Pos logit contributions
urg+0.027
ator+0.027
icles+0.025
rim+0.023
icle+0.022
Neg logit contributions
bs-0.021
 cane-0.021
Scar-0.020
 arrows-0.019
 arrow-0.019
Token fast
Token ID3049
Feature activation+9.59e-01

Pos logit contributions
Scar+0.154
pill+0.154
 stamps+0.151
Aud+0.149
Jennifer+0.147
Neg logit contributions
icles-0.103
 now-0.103
ator-0.101
 them-0.093
 Esk-0.091
Token and
Token ID290
Feature activation+7.39e-01

Pos logit contributions
bs+0.037
Scar+0.036
 cane+0.036
nam+0.034
Har+0.034
Neg logit contributions
ator-0.042
urg-0.042
icles-0.041
icle-0.036
zie-0.034
Token lost
Token ID2626
Feature activation-2.71e-01

Pos logit contributions
 cane+0.026
Scar+0.024
bs+0.024
Har+0.023
Michelle+0.022
Neg logit contributions
urg-0.034
icles-0.034
ator-0.033
zie-0.030
icle-0.030
Token her
Token ID607
Feature activation+2.59e-01

Pos logit contributions
ator+0.015
urg+0.014
icles+0.014
rim+0.012
icle+0.012
Neg logit contributions
bs-0.009
 cane-0.008
Scar-0.008
 arrows-0.007
 arrow-0.007
Token balance
Token ID5236
Feature activation+7.10e-01

Pos logit contributions
bs+0.010
Scar+0.010
 cane+0.009
Har+0.009
Aud+0.009
Neg logit contributions
ator-0.012
urg-0.012
icles-0.011
icle-0.010
rim-0.010
Token jet
Token ID12644
Feature activation+1.34e+00

Pos logit contributions
bs+0.067
Scar+0.061
Aud+0.055
 Jerry+0.054
Har+0.053
Neg logit contributions
ator-0.078
urg-0.070
icles-0.068
 ig-0.067
 waterfall-0.067
Token!
Token ID0
Feature activation+2.58e-01

Pos logit contributions
bs+0.053
Scar+0.051
 cane+0.048
powers+0.047
Har+0.047
Neg logit contributions
ator-0.060
urg-0.055
icles-0.055
zie-0.048
icle-0.048
Token But
Token ID887
Feature activation+1.98e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.007
 arrows+0.007
 stamp+0.007
Neg logit contributions
ator-0.014
icles-0.014
urg-0.014
zie-0.012
icle-0.012
Token she
Token ID673
Feature activation+2.56e-01

Pos logit contributions
bs+0.007
 cane+0.007
Scar+0.006
powers+0.006
 arrows+0.006
Neg logit contributions
ator-0.010
urg-0.010
icles-0.010
zie-0.008
icle-0.008
Token was
Token ID373
Feature activation+1.32e+00

Pos logit contributions
bs+0.012
 cane+0.012
Scar+0.011
 arrows+0.011
powers+0.011
Neg logit contributions
ator-0.010
urg-0.010
icles-0.009
rim-0.008
zie-0.008
Token too
Token ID1165
Feature activation+3.51e+00

Pos logit contributions
bs+0.059
 cane+0.054
 arrows+0.052
powers+0.051
Scar+0.050
Neg logit contributions
ator-0.052
urg-0.050
icles-0.044
zie-0.043
 Esk-0.043
Token young
Token ID1862
Feature activation+1.26e+00

Pos logit contributions
 arrows+0.147
 brace+0.141
pill+0.140
Mic+0.136
 Bow+0.133
Neg logit contributions
ator-0.119
urg-0.117
ie-0.103
 Esk-0.101
 hyd-0.100
Token.
Token ID13
Feature activation+6.20e-01

Pos logit contributions
bs+0.054
Scar+0.050
 cane+0.049
 arrows+0.048
Mic+0.048
Neg logit contributions
ator-0.050
urg-0.049
icles-0.045
zie-0.040
fl-0.039
Token One
Token ID1881
Feature activation+1.07e+00

Pos logit contributions
bs+0.021
 cane+0.020
Scar+0.019
 arrows+0.018
 stamp+0.018
Neg logit contributions
ator-0.032
icles-0.031
urg-0.030
zie-0.028
icle-0.027
Token day
Token ID1110
Feature activation+1.30e+00

Pos logit contributions
Scar+0.067
Aud+0.063
Har+0.063
bs+0.062
Charl+0.061
Neg logit contributions
ator-0.024
urg-0.021
icles-0.021
icle-0.018
 ig-0.017
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
powers+0.047
 cane+0.046
Scar+0.045
Aud+0.043
bs+0.043
Neg logit contributions
icles-0.057
ator-0.056
urg-0.056
zie-0.049
 setting-0.048
Token day
Token ID1110
Feature activation+1.07e+00

Pos logit contributions
bs+0.011
Scar+0.011
 cane+0.011
 arrows+0.010
Har+0.010
Neg logit contributions
ator-0.007
urg-0.007
icles-0.007
rim-0.006
icle-0.006
Token,
Token ID11
Feature activation+1.28e-01

Pos logit contributions
Scar+0.042
Aud+0.039
Jennifer+0.038
 cane+0.038
 confident+0.038
Neg logit contributions
icles-0.048
urg-0.048
ator-0.044
icle-0.043
 Esk-0.038
Token the
Token ID262
Feature activation-4.26e-01

Pos logit contributions
bs+0.004
Scar+0.004
 cane+0.004
Aud+0.004
powers+0.004
Neg logit contributions
urg-0.006
ator-0.006
icles-0.006
icle-0.006
 Esk-0.005
Token bird
Token ID6512
Feature activation+4.27e-02

Pos logit contributions
urg+0.023
ator+0.022
icles+0.022
rim+0.019
zie+0.019
Neg logit contributions
bs-0.014
 cane-0.014
Scar-0.013
 arrows-0.012
 arrow-0.012
Token flew
Token ID13112
Feature activation+1.59e-01

Pos logit contributions
bs+0.002
 cane+0.002
Scar+0.002
 arrows+0.001
Har+0.001
Neg logit contributions
urg-0.002
ator-0.002
icles-0.002
rim-0.002
zie-0.002
Token too
Token ID1165
Feature activation+3.51e+00

Pos logit contributions
bs+0.005
 cane+0.005
Scar+0.005
 arrows+0.004
Har+0.004
Neg logit contributions
urg-0.008
ator-0.008
icles-0.008
icle-0.007
zie-0.007
Token high
Token ID1029
Feature activation+1.20e-01

Pos logit contributions
Scar+0.157
 Jennifer+0.153
Jennifer+0.151
Aud+0.151
 Alison+0.148
Neg logit contributions
ie-0.111
ator-0.101
pl-0.101
 edge-0.099
 side-0.096
Token and
Token ID290
Feature activation+2.56e-01

Pos logit contributions
bs+0.004
 cane+0.004
Scar+0.004
 arrows+0.004
 arrow+0.004
Neg logit contributions
ator-0.006
urg-0.006
icles-0.006
icle-0.005
rim-0.005
Token hit
Token ID2277
Feature activation+2.67e-01

Pos logit contributions
bs+0.007
 cane+0.007
Scar+0.007
 arrows+0.006
Har+0.006
Neg logit contributions
urg-0.014
ator-0.014
icles-0.013
zie-0.012
icle-0.012
Token a
Token ID257
Feature activation+7.43e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.008
Aud+0.007
powers+0.007
Neg logit contributions
urg-0.015
icles-0.015
ator-0.015
icle-0.013
zie-0.013
Token tree
Token ID5509
Feature activation+8.81e-01

Pos logit contributions
bs+0.024
Scar+0.022
 cane+0.020
Aud+0.019
able+0.019
Neg logit contributions
urg-0.039
ator-0.039
icles-0.037
rim-0.033
icle-0.033
Token replied
Token ID8712
Feature activation+2.37e-01

Pos logit contributions
 cane+0.040
Scar+0.038
bs+0.038
nam+0.036
Har+0.036
Neg logit contributions
urg-0.055
ator-0.052
icles-0.050
icle-0.044
man-0.044
Token.
Token ID13
Feature activation-4.02e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.008
 arrows+0.008
Aud+0.007
Neg logit contributions
ator-0.012
urg-0.012
icles-0.011
rim-0.010
icle-0.010
Token He
Token ID679
Feature activation+9.90e-01

Pos logit contributions
urg+0.021
ator+0.021
icles+0.020
rim+0.018
icle+0.017
Neg logit contributions
bs-0.014
 cane-0.013
Scar-0.013
 arrows-0.012
 arrow-0.011
Token took
Token ID1718
Feature activation+9.66e-02

Pos logit contributions
bs+0.039
 cane+0.038
Scar+0.037
nam+0.036
Har+0.035
Neg logit contributions
ator-0.042
urg-0.042
icles-0.037
zie-0.033
io-0.033
Token a
Token ID257
Feature activation+3.35e+00

Pos logit contributions
bs+0.003
 cane+0.003
Scar+0.003
 arrows+0.003
 arrow+0.003
Neg logit contributions
ator-0.005
urg-0.005
icles-0.005
icle-0.004
zie-0.004
Token long
Token ID890
Feature activation+3.50e+00

Pos logit contributions
aces+0.126
 cheated+0.124
pill+0.122
uts+0.122
nam+0.119
Neg logit contributions
 waterfall-0.098
ator-0.098
ail-0.097
 setting-0.097
ie-0.096
Token piece
Token ID3704
Feature activation+1.09e+00

Pos logit contributions
nam+0.150
 cheated+0.131
 reckless+0.130
powers+0.127
bs+0.125
Neg logit contributions
ator-0.130
urg-0.129
zie-0.125
icle-0.122
icles-0.121
Token of
Token ID286
Feature activation+2.31e+00

Pos logit contributions
Scar+0.043
bs+0.042
 cane+0.040
nam+0.039
Har+0.039
Neg logit contributions
ator-0.049
urg-0.048
icles-0.048
icle-0.044
rim-0.041
Token string
Token ID4731
Feature activation+1.40e+00

Pos logit contributions
Scar+0.074
Har+0.073
ver+0.071
uts+0.067
 cheated+0.067
Neg logit contributions
icles-0.105
icle-0.099
urg-0.095
ator-0.094
 setting-0.090
Token and
Token ID290
Feature activation+2.12e+00

Pos logit contributions
Scar+0.053
bs+0.052
Har+0.049
Aud+0.048
Jennifer+0.047
Neg logit contributions
ator-0.066
icles-0.065
urg-0.063
icle-0.058
zie-0.052
Token tied
Token ID8165
Feature activation+1.50e-01

Pos logit contributions
Har+0.082
Scar+0.081
 cane+0.078
able+0.076
Mic+0.074
Neg logit contributions
urg-0.082
icles-0.079
ator-0.076
 setting-0.070
bled-0.070
Token fire
Token ID2046
Feature activation-2.99e+00

Pos logit contributions
urg+0.101
icles+0.097
zie+0.090
ator+0.090
io+0.090
Neg logit contributions
 cane-0.067
 long-0.065
 string-0.065
 stamp-0.063
 arrow-0.061
Tokenwork
Token ID1818
Feature activation-1.31e+00

Pos logit contributions
urg+0.099
icles+0.091
rim+0.091
ator+0.088
zie+0.086
Neg logit contributions
bs-0.131
 cane-0.122
 stamp-0.121
 arrow-0.114
 short-0.114
Token.
Token ID13
Feature activation-6.24e-01

Pos logit contributions
ator+0.068
urg+0.067
icles+0.066
rim+0.059
icle+0.058
Neg logit contributions
bs-0.044
 cane-0.042
Scar-0.038
 stamp-0.038
 string-0.038
Token<|endoftext|>
Token ID50256
Feature activation+1.01e-01

Pos logit contributions
urg+0.038
ator+0.037
icles+0.036
rim+0.033
icle+0.032
Neg logit contributions
bs-0.016
 cane-0.015
Scar-0.015
 confident-0.013
 arrow-0.013
TokenE
Token ID36
Feature activation-7.09e-01

Pos logit contributions
bs+0.003
 cane+0.003
Scar+0.003
 arrows+0.003
 arrow+0.003
Neg logit contributions
ator-0.005
urg-0.005
icles-0.005
rim-0.005
icle-0.005
Tokendd
Token ID1860
Feature activation-4.13e+00

Pos logit contributions
ator+0.034
urg+0.034
icles+0.033
rim+0.029
icle+0.029
Neg logit contributions
bs-0.026
 cane-0.025
Scar-0.024
Har-0.022
 stamp-0.022
Tokenie
Token ID494
Feature activation+5.98e-01

Pos logit contributions
umm+0.163
ator+0.140
urg+0.140
icles+0.133
omm+0.124
Neg logit contributions
bs-0.159
 tied-0.157
able-0.155
 cane-0.153
 arrow-0.148
Token was
Token ID373
Feature activation+4.72e-01

Pos logit contributions
bs+0.022
 cane+0.021
Scar+0.021
 arrows+0.020
nam+0.020
Neg logit contributions
urg-0.028
ator-0.028
icles-0.026
zie-0.024
rim-0.023
Token a
Token ID257
Feature activation+7.65e-01

Pos logit contributions
bs+0.016
 cane+0.015
Scar+0.015
 arrows+0.014
powers+0.014
Neg logit contributions
urg-0.025
ator-0.024
icles-0.023
rim-0.021
icle-0.021
Token three
Token ID1115
Feature activation-4.72e-02

Pos logit contributions
bs+0.027
Scar+0.024
 arrows+0.024
 cane+0.024
Aud+0.023
Neg logit contributions
ator-0.038
urg-0.037
icles-0.034
 Esk-0.032
 ig-0.031
Token year
Token ID614
Feature activation+7.33e-01

Pos logit contributions
ator+0.002
urg+0.002
icles+0.002
rim+0.002
icle+0.002
Neg logit contributions
bs-0.002
 cane-0.002
Scar-0.002
 arrows-0.002
 arrow-0.002
Token shouting
Token ID19642
Feature activation-1.16e+00

Pos logit contributions
ator+0.021
urg+0.021
icles+0.020
rim+0.018
icle+0.017
Neg logit contributions
bs-0.018
 cane-0.017
Scar-0.017
 arrows-0.016
 arrow-0.016
Token "
Token ID366
Feature activation-1.25e+00

Pos logit contributions
ator+0.071
urg+0.068
icles+0.067
rim+0.064
zie+0.062
Neg logit contributions
bs-0.030
 cane-0.029
Scar-0.026
 arrows-0.026
 arrow-0.024
TokenAh
Token ID10910
Feature activation-1.77e-01

Pos logit contributions
urg+0.060
ator+0.059
icles+0.054
rim+0.052
zie+0.049
Neg logit contributions
bs-0.046
 cane-0.045
Scar-0.044
 arrows-0.042
 string-0.042
Tokenoy
Token ID726
Feature activation-7.82e-01

Pos logit contributions
urg+0.008
ator+0.008
icles+0.008
rim+0.007
icle+0.007
Neg logit contributions
bs-0.007
 cane-0.007
Scar-0.007
 arrows-0.006
 arrow-0.006
Token,
Token ID11
Feature activation-1.47e+00

Pos logit contributions
ator+0.044
urg+0.043
icles+0.042
rim+0.039
zie+0.037
Neg logit contributions
bs-0.023
 cane-0.022
Scar-0.021
 arrows-0.019
 arrow-0.019
Token mate
Token ID16133
Feature activation-3.92e+00

Pos logit contributions
ator+0.071
urg+0.070
icles+0.066
rim+0.065
zie+0.059
Neg logit contributions
bs-0.053
 cane-0.052
 arrow-0.051
 arrows-0.050
 string-0.048
Tokeny
Token ID88
Feature activation-1.13e+00

Pos logit contributions
urg+0.195
ator+0.192
icles+0.188
rim+0.188
umm+0.176
Neg logit contributions
bs-0.092
 cane-0.089
able-0.087
ver-0.084
,-0.080
Token!"
Token ID2474
Feature activation-1.65e+00

Pos logit contributions
ator+0.056
urg+0.055
icles+0.053
rim+0.049
zie+0.047
Neg logit contributions
 cane-0.039
bs-0.039
Scar-0.036
 arrows-0.035
 arrow-0.035
Token and
Token ID290
Feature activation-2.60e+00

Pos logit contributions
ator+0.092
urg+0.092
icles+0.082
rim+0.081
ators+0.076
Neg logit contributions
bs-0.052
 Jerry-0.044
powers-0.041
 cane-0.041
 she-0.041
Token "
Token ID366
Feature activation-9.40e-01

Pos logit contributions
ator+0.162
urg+0.150
icles+0.145
rim+0.143
umm+0.143
Neg logit contributions
bs-0.070
 she-0.059
 long-0.057
 arrow-0.056
 arrows-0.056
TokenSh
Token ID2484
Feature activation-1.09e+00

Pos logit contributions
urg+0.047
ator+0.047
icles+0.044
rim+0.040
zie+0.039
Neg logit contributions
bs-0.034
 cane-0.033
Scar-0.032
 arrows-0.030
 arrow-0.030
Token was
Token ID373
Feature activation-2.08e-01

Pos logit contributions
ator+0.009
urg+0.009
icles+0.009
rim+0.008
icle+0.008
Neg logit contributions
bs-0.008
 cane-0.008
Scar-0.008
 arrows-0.007
 arrow-0.007
Token mean
Token ID1612
Feature activation-8.18e-01

Pos logit contributions
ator+0.011
urg+0.011
icles+0.011
rim+0.010
icle+0.009
Neg logit contributions
bs-0.007
 cane-0.007
Scar-0.006
 arrows-0.006
 arrow-0.006
Token.
Token ID13
Feature activation-2.74e-01

Pos logit contributions
ator+0.050
urg+0.050
icles+0.047
rim+0.044
zie+0.043
Neg logit contributions
bs-0.021
 cane-0.020
Scar-0.018
 arrows-0.017
 arrow-0.017
Token He
Token ID679
Feature activation-2.41e-02

Pos logit contributions
ator+0.014
urg+0.014
icles+0.013
rim+0.011
icle+0.011
Neg logit contributions
bs-0.010
 cane-0.010
Scar-0.009
 arrows-0.009
 arrow-0.009
Token said
Token ID531
Feature activation-1.76e-01

Pos logit contributions
urg+0.001
ator+0.001
icles+0.001
rim+0.001
icle+0.001
Neg logit contributions
bs-0.001
 cane-0.001
Scar-0.001
 arrows-0.001
powers-0.001
Token sun
Token ID4252
Feature activation-3.85e+00

Pos logit contributions
urg+0.010
ator+0.010
icles+0.010
rim+0.009
icle+0.009
Neg logit contributions
bs-0.005
 cane-0.005
Scar-0.004
 arrows-0.004
 arrow-0.004
Tokenfl
Token ID2704
Feature activation-1.70e+00

Pos logit contributions
urg+0.146
ator+0.142
rim+0.132
 Esk+0.116
umm+0.116
Neg logit contributions
bs-0.157
 she-0.150
 cane-0.146
ite-0.143
 stamp-0.138
Tokenowers
Token ID3618
Feature activation-3.63e-01

Pos logit contributions
ator+0.098
urg+0.098
icles+0.087
rim+0.085
zie+0.084
Neg logit contributions
bs-0.047
 cane-0.037
ite-0.035
able-0.035
 arrows-0.033
Token were
Token ID547
Feature activation-1.01e+00

Pos logit contributions
ator+0.018
urg+0.018
icles+0.017
rim+0.015
icle+0.015
Neg logit contributions
bs-0.013
 cane-0.013
Scar-0.012
 arrows-0.011
 arrow-0.011
Token alive
Token ID6776
Feature activation-1.70e-02

Pos logit contributions
ator+0.054
urg+0.053
icles+0.051
rim+0.046
zie+0.045
Neg logit contributions
bs-0.033
 cane-0.032
Scar-0.029
 arrows-0.029
 arrow-0.028
Token.
Token ID13
Feature activation-3.16e-01

Pos logit contributions
ator+0.001
urg+0.001
icles+0.001
rim+0.001
icle+0.001
Neg logit contributions
bs-0.001
 cane-0.001
Scar-0.001
 arrows-0.000
Har-0.000
Token peaceful
Token ID12309
Feature activation-1.36e+00

Pos logit contributions
urg+0.044
ator+0.044
icles+0.042
rim+0.039
icle+0.038
Neg logit contributions
bs-0.021
 cane-0.020
Scar-0.019
 arrows-0.018
 confident-0.018
Token.
Token ID13
Feature activation+1.48e-02

Pos logit contributions
urg+0.087
ator+0.086
icles+0.082
rim+0.078
zie+0.076
Neg logit contributions
 cane-0.029
bs-0.028
 arrows-0.024
 confident-0.023
 arrow-0.023
Token
Token ID198
Feature activation+9.98e-02

Pos logit contributions
bs+0.000
 cane+0.000
Scar+0.000
 arrows+0.000
 stamp+0.000
Neg logit contributions
ator-0.001
urg-0.001
icles-0.001
icle-0.001
rim-0.001
Token
Token ID198
Feature activation-2.98e-01

Pos logit contributions
bs+0.003
 cane+0.003
Scar+0.003
 arrows+0.003
 arrow+0.003
Neg logit contributions
ator-0.005
urg-0.005
icles-0.005
rim-0.005
icle-0.005
TokenThe
Token ID464
Feature activation-2.51e+00

Pos logit contributions
urg+0.016
ator+0.016
icles+0.015
rim+0.014
icle+0.014
Neg logit contributions
bs-0.009
 cane-0.009
Scar-0.009
 arrows-0.008
 arrow-0.008
Token sw
Token ID1509
Feature activation-3.85e+00

Pos logit contributions
urg+0.141
icles+0.139
ator+0.135
ators+0.122
rol+0.121
Neg logit contributions
 arrow-0.080
 pigeon-0.079
 long-0.078
 arrows-0.076
 string-0.075
Tokenan
Token ID272
Feature activation-3.49e-01

Pos logit contributions
icles+0.154
riage+0.135
 Manny+0.133
icle+0.132
ator+0.128
Neg logit contributions
 arrows-0.142
 arrow-0.137
 ar-0.137
powers-0.134
 pigeon-0.133
Token looked
Token ID3114
Feature activation-1.63e-01

Pos logit contributions
ator+0.018
urg+0.018
icles+0.017
rim+0.015
icle+0.015
Neg logit contributions
bs-0.012
 cane-0.012
Scar-0.011
 arrows-0.011
 arrow-0.010
Token around
Token ID1088
Feature activation-1.47e+00

Pos logit contributions
urg+0.007
ator+0.007
icles+0.007
rim+0.006
icle+0.006
Neg logit contributions
bs-0.007
 cane-0.007
Scar-0.007
 arrows-0.006
 arrow-0.006
Token and
Token ID290
Feature activation-6.74e-01

Pos logit contributions
ator+0.098
urg+0.094
icles+0.094
rim+0.087
icle+0.084
Neg logit contributions
bs-0.031
 cane-0.030
 long-0.028
 confident-0.027
 short-0.027
Token saw
Token ID2497
Feature activation-4.19e-01

Pos logit contributions
ator+0.037
urg+0.037
icles+0.035
rim+0.032
icle+0.031
Neg logit contributions
bs-0.022
 cane-0.020
Scar-0.019
 arrows-0.018
 confident-0.018
Token the
Token ID262
Feature activation-1.77e+00

Pos logit contributions
bs+0.031
 cane+0.027
powers+0.027
Scar+0.027
 confident+0.025
Neg logit contributions
ator-0.037
urg-0.036
icles-0.036
icle-0.031
zie-0.030
Token shore
Token ID15191
Feature activation-3.88e-01

Pos logit contributions
ator+0.083
urg+0.083
icles+0.081
rol+0.072
icle+0.069
Neg logit contributions
 cane-0.074
 arrow-0.071
 string-0.070
 arrows-0.067
 long-0.065
Token and
Token ID290
Feature activation+3.88e-01

Pos logit contributions
urg+0.023
ator+0.023
icles+0.022
rim+0.020
icle+0.020
Neg logit contributions
bs-0.010
 cane-0.010
Scar-0.009
 arrows-0.008
 arrow-0.008
Token watches
Token ID16860
Feature activation-5.41e-01

Pos logit contributions
bs+0.015
 cane+0.015
Scar+0.014
Har+0.013
 arrows+0.013
Neg logit contributions
urg-0.017
ator-0.017
icles-0.016
zie-0.014
rim-0.014
Token the
Token ID262
Feature activation-2.46e+00

Pos logit contributions
ator+0.027
urg+0.027
icles+0.026
rim+0.024
zie+0.023
Neg logit contributions
bs-0.019
 cane-0.018
Scar-0.018
 arrows-0.017
 arrow-0.016
Token sw
Token ID1509
Feature activation-3.83e+00

Pos logit contributions
ator+0.125
icles+0.123
urg+0.121
zie+0.111
ators+0.108
Neg logit contributions
 arrow-0.090
 arrows-0.089
 cane-0.086
 string-0.086
 pigeon-0.084
Tokenan
Token ID272
Feature activation-2.44e+00

Pos logit contributions
icles+0.143
ator+0.139
 Manny+0.134
ators+0.126
why+0.124
Neg logit contributions
 arrows-0.148
 arrow-0.147
bs-0.141
 ar-0.140
ak-0.137
Token.
Token ID13
Feature activation+6.92e-01

Pos logit contributions
ator+0.155
urg+0.145
icles+0.143
zie+0.134
rim+0.133
Neg logit contributions
bs-0.063
 cane-0.053
 arrows-0.051
 arrow-0.048
 string-0.047
Token She
Token ID1375
Feature activation+8.52e-02

Pos logit contributions
bs+0.023
 cane+0.022
 arrows+0.021
Scar+0.021
 stamp+0.021
Neg logit contributions
urg-0.034
icles-0.034
ator-0.033
icle-0.030
zie-0.029
Token thinks
Token ID6834
Feature activation+4.29e-01

Pos logit contributions
bs+0.003
 cane+0.003
Scar+0.003
 arrows+0.003
 arrow+0.003
Neg logit contributions
urg-0.004
ator-0.004
icles-0.004
rim-0.003
zie-0.003
Token it
Token ID340
Feature activation-1.30e+00

Pos logit contributions
 cane+0.016
Scar+0.016
bs+0.016
Har+0.015
Mic+0.015
Neg logit contributions
urg-0.020
ator-0.019
icles-0.019
icle-0.016
rim-0.016
Token O
Token ID440
Feature activation-1.14e+00

Pos logit contributions
ator+0.067
urg+0.066
rim+0.058
icles+0.058
ators+0.053
Neg logit contributions
 Jerry-0.047
bs-0.047
 cane-0.045
 arrow-0.044
 long-0.044
Tokenll
Token ID297
Feature activation-3.49e+00

Pos logit contributions
ator+0.038
urg+0.038
icles+0.037
icle+0.031
zie+0.028
Neg logit contributions
bs-0.057
 cane-0.055
Scar-0.055
Har-0.051
 stamp-0.051
Tokenie
Token ID494
Feature activation-1.19e+00

Pos logit contributions
urg+0.116
ator+0.113
umm+0.110
omm+0.104
 department+0.102
Neg logit contributions
able-0.151
 arrow-0.148
 string-0.141
 cane-0.141
bs-0.140
Token.
Token ID13
Feature activation-2.50e+00

Pos logit contributions
urg+0.076
ator+0.075
icles+0.073
rim+0.066
icle+0.066
Neg logit contributions
bs-0.027
 cane-0.026
 arrow-0.022
Scar-0.021
 arrows-0.021
Token O
Token ID440
Feature activation-1.40e+00

Pos logit contributions
ator+0.123
urg+0.107
ators+0.098
rina+0.094
aby+0.092
Neg logit contributions
 long-0.100
bs-0.097
 short-0.095
 Jerry-0.093
 high-0.091
Tokenll
Token ID297
Feature activation-3.82e+00

Pos logit contributions
icles+0.048
urg+0.048
ator+0.047
icle+0.041
umm+0.037
Neg logit contributions
bs-0.066
Scar-0.063
 cane-0.063
able-0.061
Har-0.059
Tokenie
Token ID494
Feature activation-4.67e-01

Pos logit contributions
urg+0.126
ator+0.117
umm+0.115
 department+0.113
icles+0.105
Neg logit contributions
 arrow-0.165
able-0.165
 cane-0.157
 string-0.155
 tied-0.150
Token loved
Token ID6151
Feature activation-1.02e+00

Pos logit contributions
urg+0.022
ator+0.022
icles+0.021
rim+0.018
icle+0.018
Neg logit contributions
bs-0.018
 cane-0.018
Scar-0.017
 arrows-0.016
 arrow-0.016
Token to
Token ID284
Feature activation-8.54e-01

Pos logit contributions
ator+0.061
urg+0.058
icles+0.056
zie+0.052
rim+0.051
Neg logit contributions
 cane-0.030
bs-0.028
 arrows-0.025
 arrow-0.025
 string-0.025
Token play
Token ID711
Feature activation-1.21e+00

Pos logit contributions
ator+0.039
urg+0.037
icles+0.036
rim+0.031
zie+0.031
Neg logit contributions
bs-0.035
 cane-0.035
 arrow-0.032
 string-0.032
Scar-0.032
Token all
Token ID477
Feature activation-7.85e-01

Pos logit contributions
urg+0.072
ator+0.071
icles+0.070
zie+0.065
ric+0.063
Neg logit contributions
 cane-0.031
bs-0.031
 arrows-0.030
 string-0.028
 arrow-0.028
Token said
Token ID531
Feature activation+3.62e-01

Pos logit contributions
bs+0.020
 cane+0.019
Scar+0.019
powers+0.018
able+0.017
Neg logit contributions
urg-0.031
ator-0.030
icles-0.029
zie-0.026
rim-0.026
Token,
Token ID11
Feature activation-8.72e-01

Pos logit contributions
bs+0.010
 cane+0.010
Scar+0.010
 arrows+0.009
 arrow+0.009
Neg logit contributions
urg-0.020
ator-0.020
icles-0.019
icle-0.017
rim-0.017
Token "
Token ID366
Feature activation-6.81e-01

Pos logit contributions
ator+0.044
urg+0.043
icles+0.041
rim+0.038
zie+0.036
Neg logit contributions
bs-0.031
 cane-0.030
Scar-0.029
 confident-0.027
 arrows-0.027
TokenHi
Token ID17250
Feature activation-2.00e+00

Pos logit contributions
urg+0.034
ator+0.033
icles+0.031
rim+0.028
zie+0.027
Neg logit contributions
bs-0.025
 cane-0.024
Scar-0.024
 arrows-0.022
 confident-0.022
Token O
Token ID440
Feature activation-1.92e+00

Pos logit contributions
ator+0.103
urg+0.096
icles+0.093
rim+0.089
zie+0.085
Neg logit contributions
bs-0.066
 cane-0.066
 short-0.062
 long-0.062
 arrows-0.061
Tokenll
Token ID297
Feature activation-3.82e+00

Pos logit contributions
ator+0.066
urg+0.065
icles+0.062
icle+0.052
ators+0.050
Neg logit contributions
bs-0.094
able-0.086
 cane-0.084
Scar-0.084
ite-0.082
Tokenie
Token ID494
Feature activation-1.56e+00

Pos logit contributions
urg+0.142
 overfl+0.137
 department+0.136
umm+0.132
ator+0.127
Neg logit contributions
 arrow-0.141
 stamp-0.133
able-0.128
 cane-0.127
,"-0.125
Token".
Token ID1911
Feature activation-7.05e-01

Pos logit contributions
urg+0.094
ator+0.093
icles+0.090
rim+0.084
icle+0.081
Neg logit contributions
 cane-0.038
bs-0.037
Scar-0.033
 stamp-0.031
 arrow-0.031
Token<|endoftext|>
Token ID50256
Feature activation+2.12e-01

Pos logit contributions
ator+0.040
urg+0.040
icles+0.037
rim+0.035
zie+0.033
Neg logit contributions
bs-0.021
 cane-0.020
Scar-0.018
 arrows-0.017
 arrow-0.017
TokenOnce
Token ID7454
Feature activation-4.44e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.007
 arrows+0.007
 arrow+0.007
Neg logit contributions
ator-0.010
urg-0.010
icles-0.010
rim-0.009
icle-0.009
Token,
Token ID11
Feature activation-1.62e+00

Pos logit contributions
urg+0.016
ator+0.016
icles+0.015
rim+0.013
icle+0.013
Neg logit contributions
bs-0.022
 cane-0.021
Scar-0.020
 arrows-0.019
 arrow-0.019
Token the
Token ID262
Feature activation-2.20e+00

Pos logit contributions
bs+0.003
 cane+0.003
Scar+0.003
 arrows+0.003
powers+0.003
Neg logit contributions
ator-0.004
urg-0.004
icles-0.004
rim-0.004
zie-0.004
Token pond
Token ID16723
Feature activation-1.42e-01

Pos logit contributions
urg+0.128
icles+0.125
ator+0.122
icle+0.110
zie+0.109
Neg logit contributions
 cane-0.065
 pigeon-0.065
 arrow-0.063
 string-0.062
 arrows-0.062
Token and
Token ID290
Feature activation-2.71e-01

Pos logit contributions
ator+0.007
urg+0.007
icles+0.007
rim+0.006
icle+0.006
Neg logit contributions
bs-0.005
 cane-0.005
Scar-0.004
 arrows-0.004
 arrow-0.004
Token watch
Token ID2342
Feature activation-1.19e+00

Pos logit contributions
ator+0.013
urg+0.013
icles+0.012
rim+0.011
icle+0.011
Neg logit contributions
bs-0.010
 cane-0.010
Scar-0.009
 arrows-0.009
 arrow-0.009
Token the
Token ID262
Feature activation-2.38e+00

Pos logit contributions
ator+0.066
urg+0.064
icles+0.060
rim+0.057
zie+0.055
Neg logit contributions
bs-0.038
 cane-0.036
Scar-0.033
 arrows-0.033
 arrow-0.032
Token sw
Token ID1509
Feature activation-3.81e+00

Pos logit contributions
ator+0.129
urg+0.129
icles+0.126
zie+0.117
icle+0.113
Neg logit contributions
 pigeon-0.078
 arrows-0.076
 long-0.073
 string-0.073
 arrow-0.073
Tokenans
Token ID504
Feature activation-9.43e-01

Pos logit contributions
icles+0.147
 Manny+0.143
ator+0.142
ators+0.128
icle+0.125
Neg logit contributions
 arrows-0.146
bs-0.139
 ar-0.137
 arrow-0.136
 pigeon-0.136
Token swim
Token ID9422
Feature activation-1.20e+00

Pos logit contributions
ator+0.049
urg+0.048
icles+0.047
rim+0.042
icle+0.041
Neg logit contributions
bs-0.032
 cane-0.031
Scar-0.029
 arrows-0.028
 arrow-0.027
Token.
Token ID13
Feature activation-2.62e-01

Pos logit contributions
ator+0.070
urg+0.070
icles+0.067
zie+0.062
icle+0.060
Neg logit contributions
bs-0.035
 cane-0.032
 arrows-0.030
able-0.028
Scar-0.028
Token She
Token ID1375
Feature activation-5.79e-01

Pos logit contributions
urg+0.013
ator+0.013
icles+0.013
rim+0.011
icle+0.011
Neg logit contributions
bs-0.009
 cane-0.009
Scar-0.009
 arrows-0.008
 arrow-0.008
Token loved
Token ID6151
Feature activation-1.14e+00

Pos logit contributions
ator+0.029
urg+0.028
icles+0.028
rim+0.024
icle+0.024
Neg logit contributions
bs-0.021
 cane-0.020
Scar-0.020
 arrows-0.018
 arrow-0.018
Token resist
Token ID4180
Feature activation-1.29e+00

Pos logit contributions
bs+0.010
Scar+0.010
 cane+0.009
 arrows+0.009
Aud+0.009
Neg logit contributions
urg-0.009
ator-0.009
icles-0.009
rim-0.008
icle-0.008
Token!
Token ID0
Feature activation-8.05e-01

Pos logit contributions
ator+0.067
urg+0.065
icles+0.063
rim+0.056
zie+0.056
Neg logit contributions
bs-0.043
 cane-0.042
Scar-0.038
 arrows-0.037
 string-0.037
Token I
Token ID314
Feature activation-8.24e-01

Pos logit contributions
urg+0.040
ator+0.040
icles+0.038
rim+0.035
zie+0.033
Neg logit contributions
bs-0.029
 cane-0.028
Scar-0.026
 arrows-0.025
 arrow-0.025
Token will
Token ID481
Feature activation-1.04e+00

Pos logit contributions
ator+0.043
urg+0.042
icles+0.041
rim+0.035
icle+0.035
Neg logit contributions
bs-0.030
 cane-0.028
Scar-0.027
 confident-0.025
 arrows-0.024
Token try
Token ID1949
Feature activation-2.30e-01

Pos logit contributions
ator+0.051
urg+0.051
icles+0.049
rim+0.043
icle+0.042
Neg logit contributions
bs-0.038
 cane-0.038
Scar-0.035
 arrow-0.033
 string-0.033
Token the
Token ID262
Feature activation-3.80e+00

Pos logit contributions
urg+0.011
ator+0.011
icles+0.011
rim+0.010
icle+0.009
Neg logit contributions
bs-0.008
 cane-0.008
Scar-0.008
 arrows-0.007
 arrow-0.007
Token o
Token ID267
Feature activation-2.77e+00

Pos logit contributions
ator+0.199
urg+0.182
bled+0.171
aby+0.165
icles+0.159
Neg logit contributions
 long-0.147
 short-0.144
 cane-0.123
 small-0.121
 high-0.120
Tokenat
Token ID265
Feature activation-2.03e+00

Pos logit contributions
ator+0.092
urg+0.091
icles+0.078
maid+0.074
opher+0.072
Neg logit contributions
bs-0.135
br-0.123
 cane-0.119
 short-0.115
 stamp-0.114
Token!"
Token ID2474
Feature activation+4.58e-01

Pos logit contributions
ator+0.100
urg+0.099
icles+0.090
rim+0.088
zie+0.083
Neg logit contributions
bs-0.071
 cane-0.069
Scar-0.060
 stamp-0.060
 string-0.059
Token
Token ID198
Feature activation+2.61e-01

Pos logit contributions
bs+0.017
 cane+0.016
Scar+0.016
Aud+0.015
 arrows+0.015
Neg logit contributions
urg-0.022
ator-0.022
icles-0.022
icle-0.019
rim-0.018
Token
Token ID198
Feature activation-2.13e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.008
 arrows+0.007
 arrow+0.007
Neg logit contributions
ator-0.014
urg-0.014
icles-0.013
rim-0.012
icle-0.012
Token walked
Token ID6807
Feature activation-9.33e-01

Pos logit contributions
ator+0.057
urg+0.056
icles+0.053
rim+0.050
icle+0.047
Neg logit contributions
bs-0.033
 cane-0.029
Scar-0.027
 arrows-0.026
 arrow-0.026
Token until
Token ID1566
Feature activation-1.70e-01

Pos logit contributions
urg+0.049
ator+0.048
icles+0.046
rim+0.043
icle+0.041
Neg logit contributions
bs-0.033
 cane-0.031
 arrows-0.029
Scar-0.029
 confident-0.028
Token they
Token ID484
Feature activation-1.26e+00

Pos logit contributions
ator+0.009
urg+0.009
icles+0.009
rim+0.008
icle+0.008
Neg logit contributions
bs-0.005
 cane-0.005
Scar-0.005
 arrows-0.004
 arrow-0.004
Token finally
Token ID3443
Feature activation-1.30e-01

Pos logit contributions
ator+0.075
urg+0.073
icles+0.071
rim+0.066
icle+0.064
Neg logit contributions
bs-0.039
Scar-0.034
 cane-0.032
 short-0.029
Har-0.029
Token reached
Token ID4251
Feature activation-1.05e+00

Pos logit contributions
urg+0.006
ator+0.006
icles+0.006
rim+0.005
zie+0.005
Neg logit contributions
bs-0.005
 cane-0.005
Scar-0.004
 arrows-0.004
 arrow-0.004
Token the
Token ID262
Feature activation-3.78e+00

Pos logit contributions
ator+0.067
urg+0.066
icles+0.063
rim+0.061
icle+0.057
Neg logit contributions
bs-0.023
 cane-0.023
Scar-0.021
 arrows-0.019
 string-0.018
Token o
Token ID267
Feature activation-3.26e+00

Pos logit contributions
urg+0.198
ator+0.190
abies+0.179
icles+0.176
rim+0.166
Neg logit contributions
 long-0.137
 far-0.132
 small-0.128
 big-0.126
 short-0.126
Tokenasis
Token ID17765
Feature activation-1.64e+00

Pos logit contributions
urg+0.113
ator+0.109
icles+0.103
opher+0.093
icle+0.091
Neg logit contributions
bs-0.142
br-0.133
ite-0.129
ak-0.121
 cane-0.118
Token.
Token ID13
Feature activation-4.18e-01

Pos logit contributions
urg+0.101
ator+0.099
icles+0.094
rim+0.091
icle+0.087
Neg logit contributions
bs-0.035
 cane-0.033
 confident-0.029
 short-0.028
Scar-0.028
Token They
Token ID1119
Feature activation-4.47e-01

Pos logit contributions
ator+0.021
urg+0.021
icles+0.020
rim+0.018
icle+0.018
Neg logit contributions
bs-0.015
 cane-0.014
Scar-0.014
 arrows-0.013
 arrow-0.013
Token drank
Token ID24070
Feature activation-4.76e-01

Pos logit contributions
ator+0.022
urg+0.022
icles+0.021
rim+0.019
icle+0.019
Neg logit contributions
bs-0.016
 cane-0.016
Scar-0.015
 arrows-0.014
 arrow-0.014
Token we
Token ID356
Feature activation-1.60e+00

Pos logit contributions
ator+0.005
urg+0.005
icles+0.005
rim+0.004
icle+0.004
Neg logit contributions
bs-0.004
 cane-0.004
Scar-0.004
 arrows-0.003
 confident-0.003
Token say
Token ID910
Feature activation-2.16e+00

Pos logit contributions
ator+0.090
urg+0.090
icles+0.084
rim+0.077
io+0.074
Neg logit contributions
bs-0.052
 cane-0.048
Scar-0.043
๏ฟฝ-0.042
 string-0.041
Token goodbye
Token ID24829
Feature activation-6.56e-01

Pos logit contributions
ator+0.134
urg+0.122
rim+0.116
icles+0.114
zie+0.111
Neg logit contributions
bs-0.066
 cane-0.058
 long-0.056
 short-0.055
 string-0.050
Token to
Token ID284
Feature activation-8.50e-01

Pos logit contributions
ator+0.035
urg+0.034
icles+0.033
rim+0.030
zie+0.029
Neg logit contributions
bs-0.022
 cane-0.021
Scar-0.020
 arrows-0.018
 arrow-0.018
Token our
Token ID674
Feature activation-3.12e+00

Pos logit contributions
ator+0.046
urg+0.045
icles+0.043
rim+0.039
icle+0.038
Neg logit contributions
bs-0.028
 cane-0.027
Scar-0.025
 arrow-0.024
 string-0.024
Token snow
Token ID6729
Feature activation-3.77e+00

Pos logit contributions
ator+0.160
icles+0.152
urg+0.149
rol+0.139
icle+0.138
Neg logit contributions
 cane-0.117
 long-0.099
 string-0.095
 newspaper-0.094
bs-0.093
Tokenman
Token ID805
Feature activation-7.81e-01

Pos logit contributions
urg+0.117
ator+0.108
rol+0.107
icles+0.100
add+0.100
Neg logit contributions
 cane-0.176
 stamp-0.165
 bike-0.156
 guitar-0.156
 rider-0.155
Token first
Token ID717
Feature activation-6.27e-01

Pos logit contributions
urg+0.039
ator+0.039
icles+0.037
rim+0.033
zie+0.032
Neg logit contributions
bs-0.028
 cane-0.027
Scar-0.026
 arrows-0.024
 arrow-0.024
Token?"
Token ID1701
Feature activation+1.48e+00

Pos logit contributions
urg+0.031
ator+0.031
icles+0.029
rim+0.026
zie+0.025
Neg logit contributions
bs-0.023
 cane-0.022
Scar-0.021
 arrows-0.020
 arrow-0.020
Token
Token ID198
Feature activation+7.09e-01

Pos logit contributions
bs+0.071
able+0.070
 cane+0.069
powers+0.068
 stamp+0.067
Neg logit contributions
icles-0.047
ator-0.043
urg-0.040
 them-0.040
zie-0.036
Token
Token ID198
Feature activation+1.14e+00

Pos logit contributions
bs+0.026
 cane+0.026
Scar+0.024
nam+0.023
 arrows+0.023
Neg logit contributions
ator-0.034
urg-0.034
icles-0.032
rim-0.029
icle-0.028
Token around
Token ID1088
Feature activation-8.75e-01

Pos logit contributions
bs+0.001
 cane+0.001
Scar+0.001
Har+0.001
 arrows+0.001
Neg logit contributions
ator-0.002
urg-0.002
icles-0.002
rim-0.001
icle-0.001
Token the
Token ID262
Feature activation-1.58e+00

Pos logit contributions
urg+0.051
ator+0.050
icles+0.048
rim+0.044
icle+0.043
Neg logit contributions
bs-0.024
 cane-0.024
 arrows-0.021
Scar-0.021
 string-0.020
Token lake
Token ID13546
Feature activation-8.49e-01

Pos logit contributions
urg+0.074
icles+0.070
ator+0.067
icle+0.062
omm+0.059
Neg logit contributions
 cane-0.062
 string-0.060
 arrows-0.059
 arrow-0.059
 long-0.056
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
urg+0.050
ator+0.049
icles+0.047
rim+0.043
icle+0.043
Neg logit contributions
bs-0.023
 cane-0.022
Scar-0.019
 arrows-0.019
 arrow-0.019
Token The
Token ID383
Feature activation-2.45e+00

Pos logit contributions
urg+0.008
ator+0.008
icles+0.008
rim+0.007
icle+0.007
Neg logit contributions
bs-0.006
 cane-0.006
Scar-0.006
 arrows-0.006
 arrow-0.006
Token sw
Token ID1509
Feature activation-3.75e+00

Pos logit contributions
urg+0.143
icles+0.140
ator+0.138
icle+0.125
ators+0.123
Neg logit contributions
 pigeon-0.073
 long-0.073
 arrows-0.073
 arrow-0.073
 string-0.069
Tokenan
Token ID272
Feature activation-1.57e-01

Pos logit contributions
icles+0.145
riage+0.129
 Manny+0.127
icle+0.127
ator+0.127
Neg logit contributions
 arrows-0.149
 arrow-0.141
 ar-0.139
powers-0.134
bs-0.132
Token wanted
Token ID2227
Feature activation-5.58e-01

Pos logit contributions
ator+0.008
urg+0.008
icles+0.007
rim+0.006
zie+0.006
Neg logit contributions
bs-0.006
 cane-0.006
Scar-0.005
 arrows-0.005
 arrow-0.005
Token to
Token ID284
Feature activation-5.90e-01

Pos logit contributions
ator+0.030
urg+0.030
icles+0.028
rim+0.026
zie+0.025
Neg logit contributions
bs-0.018
 cane-0.017
Scar-0.016
 arrows-0.016
 arrow-0.015
Token join
Token ID4654
Feature activation-8.97e-01

Pos logit contributions
ator+0.027
urg+0.027
icles+0.026
rim+0.022
zie+0.022
Neg logit contributions
bs-0.023
 cane-0.023
Scar-0.022
 arrows-0.021
 arrow-0.021
Token the
Token ID262
Feature activation-1.65e+00

Pos logit contributions
ator+0.037
urg+0.036
icles+0.034
rim+0.030
zie+0.030
Neg logit contributions
bs-0.040
 cane-0.039
 arrows-0.038
 arrow-0.037
Scar-0.037
TokenWhen
Token ID2215
Feature activation-2.25e-01

Pos logit contributions
bs+0.040
 cane+0.038
Scar+0.035
 arrows+0.034
 stamps+0.034
Neg logit contributions
ator-0.043
urg-0.042
icles-0.042
rim-0.039
 Esk-0.036
Token Twe
Token ID24205
Feature activation+3.36e-01

Pos logit contributions
ator+0.013
urg+0.013
icles+0.013
rim+0.012
icle+0.011
Neg logit contributions
bs-0.006
 cane-0.006
Scar-0.005
 arrows-0.005
 arrow-0.005
Tokenety
Token ID2963
Feature activation-7.18e-01

Pos logit contributions
 cane+0.014
bs+0.014
Scar+0.013
 arrows+0.013
 confident+0.013
Neg logit contributions
urg-0.014
ator-0.014
icles-0.014
rim-0.012
zie-0.012
Token arrived
Token ID5284
Feature activation-1.86e-01

Pos logit contributions
ator+0.037
urg+0.037
icles+0.036
rim+0.032
icle+0.031
Neg logit contributions
bs-0.025
 cane-0.024
Scar-0.022
 arrow-0.021
 arrows-0.021
Token at
Token ID379
Feature activation-1.02e+00

Pos logit contributions
ator+0.009
urg+0.009
icles+0.008
rim+0.007
icle+0.007
Neg logit contributions
bs-0.007
 cane-0.007
Scar-0.007
 arrows-0.006
 arrow-0.006
Token the
Token ID262
Feature activation-3.74e+00

Pos logit contributions
urg+0.063
ator+0.061
icles+0.059
rim+0.056
icle+0.052
Neg logit contributions
 cane-0.025
bs-0.024
Scar-0.022
 arrow-0.021
 string-0.021
Token o
Token ID267
Feature activation-3.14e+00

Pos logit contributions
urg+0.189
ator+0.183
icles+0.180
abies+0.173
ators+0.173
Neg logit contributions
 far-0.135
 long-0.132
 big-0.131
 small-0.128
 high-0.122
Tokenasis
Token ID17765
Feature activation-9.97e-01

Pos logit contributions
urg+0.115
ator+0.106
icles+0.105
icle+0.093
opher+0.092
Neg logit contributions
bs-0.136
ite-0.126
br-0.122
able-0.117
 she-0.117
Token,
Token ID11
Feature activation+7.83e-01

Pos logit contributions
urg+0.067
ator+0.066
icles+0.063
rim+0.060
icle+0.058
Neg logit contributions
bs-0.018
 cane-0.017
Scar-0.014
 arrows-0.013
 arrow-0.013
Token she
Token ID673
Feature activation-7.27e-01

Pos logit contributions
bs+0.020
Scar+0.018
 cane+0.017
Aud+0.017
powers+0.017
Neg logit contributions
icles-0.046
ator-0.045
urg-0.044
 Esk-0.041
icle-0.040
Token saw
Token ID2497
Feature activation-3.10e-01

Pos logit contributions
ator+0.039
urg+0.039
icles+0.038
icle+0.034
rim+0.034
Neg logit contributions
bs-0.024
 cane-0.023
Scar-0.022
 arrows-0.020
 arrow-0.020
Token the
Token ID262
Feature activation-2.22e+00

Pos logit contributions
bs+0.040
 cane+0.036
able+0.035
powers+0.034
Scar+0.034
Neg logit contributions
urg-0.043
ator-0.042
icles-0.038
 Esk-0.035
zie-0.034
Token park
Token ID3952
Feature activation+1.73e-01

Pos logit contributions
urg+0.125
icles+0.122
ator+0.118
icle+0.107
zie+0.103
Neg logit contributions
 cane-0.072
 pigeon-0.069
 arrow-0.068
 string-0.068
 long-0.067
Token to
Token ID284
Feature activation+1.39e-01

Pos logit contributions
bs+0.006
 cane+0.006
Scar+0.006
 arrows+0.005
 arrow+0.005
Neg logit contributions
ator-0.009
urg-0.009
icles-0.008
rim-0.007
zie-0.007
Token study
Token ID2050
Feature activation+5.63e-01

Pos logit contributions
bs+0.005
 cane+0.005
Scar+0.005
 arrows+0.005
powers+0.005
Neg logit contributions
ator-0.006
urg-0.006
icles-0.006
rim-0.005
zie-0.005
Token the
Token ID262
Feature activation-2.14e+00

Pos logit contributions
bs+0.019
 cane+0.019
Scar+0.019
Har+0.017
powers+0.017
Neg logit contributions
ator-0.028
urg-0.028
icles-0.027
rim-0.024
icle-0.023
Token sw
Token ID1509
Feature activation-3.73e+00

Pos logit contributions
urg+0.113
icles+0.109
ator+0.107
icle+0.097
zie+0.093
Neg logit contributions
 cane-0.075
 arrows-0.075
 string-0.074
 arrow-0.073
 long-0.072
Tokenan
Token ID272
Feature activation-1.21e+00

Pos logit contributions
icles+0.140
 Manny+0.126
ator+0.125
icle+0.118
urg+0.116
Neg logit contributions
 arrows-0.153
 arrow-0.147
 ar-0.146
bs-0.140
ak-0.139
Token and
Token ID290
Feature activation+3.02e-01

Pos logit contributions
urg+0.067
ator+0.067
icles+0.064
rim+0.059
icle+0.058
Neg logit contributions
bs-0.037
 cane-0.035
 arrows-0.032
Scar-0.032
 arrow-0.031
Token other
Token ID584
Feature activation-2.06e+00

Pos logit contributions
bs+0.012
 cane+0.012
Scar+0.012
powers+0.011
able+0.011
Neg logit contributions
ator-0.013
urg-0.013
icles-0.012
rim-0.011
zie-0.011
Token animals
Token ID4695
Feature activation+5.90e-02

Pos logit contributions
ator+0.105
urg+0.100
icles+0.099
icle+0.092
zie+0.088
Neg logit contributions
 arrows-0.075
bs-0.073
 cane-0.072
able-0.070
 arrow-0.069
Token.
Token ID13
Feature activation-7.13e-01

Pos logit contributions
bs+0.002
 cane+0.002
Scar+0.002
 arrows+0.002
 arrow+0.002
Neg logit contributions
ator-0.003
urg-0.003
icles-0.003
rim-0.003
icle-0.003
Token by
Token ID416
Feature activation-6.65e-01

Pos logit contributions
Scar+0.074
 cane+0.071
bs+0.070
Jennifer+0.068
Michelle+0.067
Neg logit contributions
ator-0.059
urg-0.057
rim-0.052
icles-0.051
 Esk-0.045
Token the
Token ID262
Feature activation-2.34e+00

Pos logit contributions
ator+0.034
urg+0.033
icles+0.032
rim+0.029
icle+0.028
Neg logit contributions
bs-0.023
 cane-0.023
Scar-0.021
 arrows-0.021
 arrow-0.021
Token lake
Token ID13546
Feature activation-1.01e+00

Pos logit contributions
ator+0.117
icles+0.117
urg+0.114
icle+0.102
zie+0.101
Neg logit contributions
 cane-0.085
 long-0.085
 arrows-0.084
 arrow-0.084
 string-0.082
Token.
Token ID13
Feature activation+1.86e-01

Pos logit contributions
ator+0.060
urg+0.059
icles+0.057
rim+0.052
zie+0.051
Neg logit contributions
bs-0.028
 cane-0.027
 arrows-0.023
 confident-0.022
 arrow-0.022
Token The
Token ID383
Feature activation-2.39e+00

Pos logit contributions
bs+0.007
 cane+0.007
Scar+0.007
 arrows+0.006
 confident+0.006
Neg logit contributions
ator-0.009
urg-0.009
icles-0.009
icle-0.007
zie-0.007
Token sw
Token ID1509
Feature activation-3.73e+00

Pos logit contributions
urg+0.128
ator+0.126
icles+0.123
ators+0.113
zie+0.107
Neg logit contributions
 arrow-0.082
 string-0.079
 long-0.077
 arrows-0.075
 pigeon-0.075
Tokenan
Token ID272
Feature activation-8.21e-01

Pos logit contributions
icles+0.142
riage+0.126
icle+0.121
 department+0.119
 Manny+0.119
Neg logit contributions
 arrows-0.144
 arrow-0.139
 ar-0.135
powers-0.134
bs-0.131
Token was
Token ID373
Feature activation-7.39e-01

Pos logit contributions
urg+0.042
ator+0.041
icles+0.040
icle+0.035
rim+0.035
Neg logit contributions
bs-0.030
 cane-0.028
Scar-0.027
 arrows-0.026
 arrow-0.026
Token so
Token ID523
Feature activation+4.52e-02

Pos logit contributions
ator+0.043
urg+0.042
icles+0.041
rim+0.037
icle+0.036
Neg logit contributions
bs-0.020
 cane-0.020
Scar-0.019
 confident-0.018
 arrows-0.018
Token graceful
Token ID44363
Feature activation-7.74e-03

Pos logit contributions
 cane+0.001
bs+0.001
Scar+0.001
 arrows+0.001
 arrow+0.001
Neg logit contributions
urg-0.003
ator-0.003
icles-0.002
rim-0.002
icle-0.002
Token that
Token ID326
Feature activation-8.91e-01

Pos logit contributions
urg+0.000
ator+0.000
icles+0.000
rim+0.000
icle+0.000
Neg logit contributions
bs-0.000
 cane-0.000
Scar-0.000
 arrows-0.000
Har-0.000
Token sun
Token ID4252
Feature activation-2.00e+00

Pos logit contributions
ator+0.114
urg+0.114
icles+0.109
ators+0.099
zie+0.095
Neg logit contributions
 arrow-0.072
 pigeon-0.069
 cane-0.069
 string-0.069
 long-0.068
Token came
Token ID1625
Feature activation-1.16e+00

Pos logit contributions
ator+0.105
urg+0.103
icles+0.096
rim+0.090
zie+0.086
Neg logit contributions
bs-0.068
 cane-0.066
Scar-0.057
 string-0.057
 arrow-0.056
Token up
Token ID510
Feature activation+7.16e-02

Pos logit contributions
ator+0.061
urg+0.060
icles+0.056
rim+0.055
zie+0.051
Neg logit contributions
 cane-0.037
bs-0.037
 arrows-0.034
Scar-0.034
 arrow-0.033
Token,
Token ID11
Feature activation-3.97e-02

Pos logit contributions
bs+0.002
 cane+0.002
Scar+0.002
 arrows+0.002
powers+0.002
Neg logit contributions
urg-0.004
ator-0.004
icles-0.004
rim-0.003
icle-0.003
Token the
Token ID262
Feature activation-1.53e+00

Pos logit contributions
urg+0.002
ator+0.002
icles+0.002
rim+0.002
icle+0.002
Neg logit contributions
bs-0.001
 cane-0.001
Scar-0.001
 arrows-0.001
powers-0.001
Token sw
Token ID1509
Feature activation-3.72e+00

Pos logit contributions
urg+0.074
ator+0.074
icles+0.071
zie+0.061
ators+0.060
Neg logit contributions
 cane-0.059
 arrow-0.058
 string-0.056
 arrows-0.056
bs-0.055
Tokenan
Token ID272
Feature activation-1.31e+00

Pos logit contributions
icles+0.139
ator+0.132
 Manny+0.126
ators+0.121
icle+0.114
Neg logit contributions
 arrows-0.151
 arrow-0.147
bs-0.142
 pigeon-0.141
 ar-0.140
Token would
Token ID561
Feature activation-1.58e+00

Pos logit contributions
ator+0.066
urg+0.065
icles+0.063
rim+0.055
icle+0.055
Neg logit contributions
bs-0.049
 cane-0.047
Scar-0.044
 arrows-0.042
 arrow-0.042
Token wake
Token ID7765
Feature activation-1.89e+00

Pos logit contributions
ator+0.077
urg+0.076
icles+0.074
rim+0.065
io+0.064
Neg logit contributions
 cane-0.057
bs-0.057
Scar-0.051
 string-0.051
 arrow-0.051
Token up
Token ID510
Feature activation-3.14e-01

Pos logit contributions
ator+0.087
urg+0.084
icles+0.080
rim+0.077
umm+0.074
Neg logit contributions
 cane-0.065
bs-0.064
 arrows-0.061
 arrow-0.060
Scar-0.058
Token again
Token ID757
Feature activation+5.81e-02

Pos logit contributions
ator+0.017
urg+0.017
icles+0.016
rim+0.015
icle+0.014
Neg logit contributions
bs-0.010
 cane-0.009
Scar-0.009
 arrows-0.008
 arrow-0.008
Token
Token ID198
Feature activation-4.17e-03

Pos logit contributions
bs+0.014
 cane+0.014
Scar+0.013
 arrows+0.013
 arrow+0.013
Neg logit contributions
ator-0.023
urg-0.022
icles-0.022
rim-0.020
zie-0.019
TokenOne
Token ID3198
Feature activation-2.09e-01

Pos logit contributions
ator+0.000
urg+0.000
icles+0.000
rim+0.000
zie+0.000
Neg logit contributions
bs-0.000
 cane-0.000
Scar-0.000
 arrows-0.000
 arrow-0.000
Token morning
Token ID3329
Feature activation+5.32e-01

Pos logit contributions
urg+0.008
ator+0.008
icles+0.007
rim+0.006
icle+0.006
Neg logit contributions
bs-0.010
 cane-0.010
Scar-0.009
 arrows-0.009
 arrow-0.009
Token,
Token ID11
Feature activation+2.64e-01

Pos logit contributions
 cane+0.016
bs+0.016
Scar+0.016
 arrows+0.015
powers+0.015
Neg logit contributions
urg-0.028
ator-0.027
icles-0.027
icle-0.024
rim-0.024
Token the
Token ID262
Feature activation-1.90e+00

Pos logit contributions
bs+0.009
 cane+0.008
Scar+0.008
powers+0.008
 arrows+0.008
Neg logit contributions
urg-0.013
ator-0.013
icles-0.013
icle-0.011
rim-0.011
Token sw
Token ID1509
Feature activation-3.71e+00

Pos logit contributions
urg+0.099
ator+0.097
icles+0.097
zie+0.084
ators+0.083
Neg logit contributions
 arrow-0.067
 string-0.066
 pigeon-0.066
 long-0.064
 cane-0.064
Tokenan
Token ID272
Feature activation-1.41e+00

Pos logit contributions
icles+0.153
 Manny+0.133
riage+0.130
icle+0.128
ator+0.127
Neg logit contributions
 arrows-0.133
 arrow-0.129
 ar-0.128
 pigeon-0.128
powers-0.125
Token discovered
Token ID5071
Feature activation+5.41e-03

Pos logit contributions
ator+0.080
urg+0.079
icles+0.078
icle+0.068
rim+0.068
Neg logit contributions
bs-0.044
 cane-0.042
Scar-0.038
 arrow-0.037
 arrows-0.037
Token something
Token ID1223
Feature activation+6.31e-01

Pos logit contributions
bs+0.000
 cane+0.000
Scar+0.000
 arrows+0.000
powers+0.000
Neg logit contributions
ator-0.000
urg-0.000
icles-0.000
rim-0.000
icle-0.000
Token very
Token ID845
Feature activation+3.82e-01

Pos logit contributions
bs+0.024
Scar+0.023
 cane+0.022
Har+0.020
powers+0.020
Neg logit contributions
ator-0.029
urg-0.029
icles-0.028
rim-0.025
icle-0.024
Token unique
Token ID3748
Feature activation+1.76e-01

Pos logit contributions
bs+0.013
Scar+0.013
 cane+0.013
Aud+0.012
Har+0.012
Neg logit contributions
urg-0.019
ator-0.019
icles-0.017
rim-0.016
 Esk-0.015
Token O
Token ID440
Feature activation-9.39e-01

Pos logit contributions
ator+0.108
urg+0.106
rim+0.094
umm+0.091
ators+0.090
Neg logit contributions
 Jerry-0.079
 long-0.075
 arrow-0.071
 string-0.071
bs-0.070
Tokenll
Token ID297
Feature activation-3.39e+00

Pos logit contributions
urg+0.030
ator+0.030
icles+0.029
icle+0.023
zie+0.022
Neg logit contributions
bs-0.049
 cane-0.048
Scar-0.047
Har-0.044
 stamp-0.044
Tokenie
Token ID494
Feature activation-8.11e-01

Pos logit contributions
urg+0.110
ator+0.101
umm+0.097
 department+0.095
 overfl+0.090
Neg logit contributions
able-0.150
 arrow-0.149
 cane-0.144
 string-0.144
 arrows-0.139
Token.
Token ID13
Feature activation-2.81e+00

Pos logit contributions
urg+0.048
ator+0.048
icles+0.047
rim+0.042
icle+0.041
Neg logit contributions
bs-0.021
 cane-0.021
Scar-0.018
 arrow-0.018
 arrows-0.018
Token O
Token ID440
Feature activation-1.33e+00

Pos logit contributions
ator+0.139
urg+0.121
ators+0.110
aby+0.108
rim+0.106
Neg logit contributions
 long-0.109
 Jerry-0.102
 short-0.101
bs-0.099
 high-0.097
Tokenll
Token ID297
Feature activation-3.70e+00

Pos logit contributions
urg+0.045
icles+0.044
ator+0.044
icle+0.037
zie+0.034
Neg logit contributions
bs-0.063
 cane-0.061
Scar-0.061
able-0.058
Har-0.058
Tokenie
Token ID494
Feature activation-5.94e-01

Pos logit contributions
urg+0.119
 department+0.106
ator+0.105
umm+0.102
 overfl+0.099
Neg logit contributions
 arrow-0.167
able-0.162
 cane-0.158
 string-0.158
 tied-0.151
Token was
Token ID373
Feature activation-4.24e-01

Pos logit contributions
urg+0.028
ator+0.028
icles+0.027
icle+0.023
rim+0.023
Neg logit contributions
bs-0.023
 cane-0.023
Scar-0.021
 arrows-0.021
 arrow-0.021
Token very
Token ID845
Feature activation-3.26e-01

Pos logit contributions
ator+0.024
urg+0.024
icles+0.023
rim+0.021
icle+0.020
Neg logit contributions
bs-0.012
 cane-0.012
Scar-0.011
 arrows-0.011
 arrow-0.010
Token adventurous
Token ID39770
Feature activation+2.46e-01

Pos logit contributions
ator+0.019
urg+0.019
icles+0.019
rim+0.017
icle+0.017
Neg logit contributions
bs-0.008
 cane-0.008
Scar-0.008
 arrows-0.007
 arrow-0.007
Token and
Token ID290
Feature activation-3.56e-01

Pos logit contributions
bs+0.008
Scar+0.008
 cane+0.007
Aud+0.007
Har+0.007
Neg logit contributions
ator-0.013
urg-0.013
icles-0.013
rim-0.011
zie-0.011
Tokena
Token ID64
Feature activation-2.82e-01

Pos logit contributions
ator+0.121
urg+0.119
icles+0.119
zie+0.110
icle+0.106
Neg logit contributions
bs-0.036
able-0.025
 cane-0.025
 stamp-0.025
old-0.023
Token and
Token ID290
Feature activation-6.94e-01

Pos logit contributions
ator+0.015
urg+0.015
icles+0.014
rim+0.013
icle+0.013
Neg logit contributions
bs-0.009
 cane-0.009
Scar-0.008
 arrows-0.008
 arrow-0.008
Token Billy
Token ID15890
Feature activation-2.78e-01

Pos logit contributions
ator+0.032
urg+0.032
icles+0.030
rim+0.027
zie+0.026
Neg logit contributions
bs-0.027
 cane-0.027
Scar-0.025
 arrow-0.024
 arrows-0.024
Token stayed
Token ID9658
Feature activation-1.37e+00

Pos logit contributions
urg+0.013
ator+0.013
icles+0.013
rim+0.011
icle+0.011
Neg logit contributions
bs-0.011
 cane-0.010
Scar-0.010
 arrows-0.009
 arrow-0.009
Token by
Token ID416
Feature activation-1.10e+00

Pos logit contributions
icles+0.082
ator+0.080
urg+0.080
icle+0.074
zie+0.071
Neg logit contributions
bs-0.036
 cane-0.035
 confident-0.033
Scar-0.032
 arrows-0.030
Token the
Token ID262
Feature activation-3.70e+00

Pos logit contributions
ator+0.067
urg+0.065
icles+0.064
rim+0.059
icle+0.057
Neg logit contributions
bs-0.026
 cane-0.025
 arrow-0.023
 string-0.023
Scar-0.022
Token o
Token ID267
Feature activation-3.07e+00

Pos logit contributions
ator+0.187
icles+0.181
urg+0.177
ators+0.169
icle+0.167
Neg logit contributions
 long-0.133
 small-0.128
 far-0.124
 high-0.120
 big-0.118
Tokenasis
Token ID17765
Feature activation-1.32e+00

Pos logit contributions
urg+0.102
ator+0.100
icles+0.097
opher+0.089
maid+0.086
Neg logit contributions
bs-0.138
ite-0.122
able-0.118
br-0.114
ak-0.113
Token until
Token ID1566
Feature activation-1.16e+00

Pos logit contributions
urg+0.069
ator+0.068
icles+0.066
rim+0.060
icle+0.059
Neg logit contributions
bs-0.042
 cane-0.041
Scar-0.036
 confident-0.036
 string-0.035
Token they
Token ID484
Feature activation-1.44e+00

Pos logit contributions
ator+0.070
urg+0.069
icles+0.065
rim+0.064
zie+0.059
Neg logit contributions
bs-0.032
 cane-0.030
Scar-0.027
 arrow-0.026
 Jerry-0.026
Token were
Token ID547
Feature activation-1.10e+00

Pos logit contributions
ator+0.083
urg+0.081
icles+0.079
rim+0.071
icle+0.070
Neg logit contributions
bs-0.048
Scar-0.041
 cane-0.040
Har-0.036
 confident-0.035
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
bs+0.013
 cane+0.012
Scar+0.011
 arrows+0.011
powers+0.011
Neg logit contributions
ator-0.016
urg-0.016
icles-0.015
zie-0.013
rim-0.013
Token park
Token ID3952
Feature activation+9.80e-02

Pos logit contributions
urg+0.072
icles+0.070
ator+0.070
icle+0.061
rim+0.058
Neg logit contributions
 cane-0.039
bs-0.036
Scar-0.036
 arrow-0.035
 arrows-0.035
Token,
Token ID11
Feature activation-1.26e-01

Pos logit contributions
bs+0.003
 cane+0.003
Scar+0.003
 arrows+0.003
 arrow+0.003
Neg logit contributions
ator-0.005
urg-0.005
icles-0.005
rim-0.004
zie-0.004
Token watching
Token ID4964
Feature activation-1.04e+00

Pos logit contributions
ator+0.006
urg+0.006
icles+0.005
rim+0.005
icle+0.005
Neg logit contributions
bs-0.005
 cane-0.005
Scar-0.005
 arrows-0.005
 arrow-0.004
Token the
Token ID262
Feature activation-2.30e+00

Pos logit contributions
ator+0.057
urg+0.055
icles+0.053
rim+0.049
zie+0.048
Neg logit contributions
bs-0.033
 cane-0.031
 arrows-0.030
Scar-0.029
 arrow-0.029
Token sw
Token ID1509
Feature activation-3.69e+00

Pos logit contributions
ator+0.116
urg+0.115
icles+0.114
icle+0.102
zie+0.100
Neg logit contributions
 arrows-0.082
 arrow-0.082
 pigeon-0.081
 string-0.080
 long-0.079
Tokenan
Token ID272
Feature activation-1.52e+00

Pos logit contributions
icles+0.143
ator+0.130
 Manny+0.128
icle+0.120
riage+0.120
Neg logit contributions
 arrows-0.148
 arrow-0.143
 ar-0.140
ak-0.136
bs-0.135
Token and
Token ID290
Feature activation-9.61e-01

Pos logit contributions
ator+0.085
urg+0.082
icles+0.080
rim+0.073
icle+0.072
Neg logit contributions
bs-0.047
 cane-0.043
 arrows-0.041
 arrow-0.040
 confident-0.039
Token talking
Token ID3375
Feature activation-1.14e+00

Pos logit contributions
ator+0.047
urg+0.047
icles+0.045
rim+0.040
icle+0.039
Neg logit contributions
bs-0.035
 cane-0.034
Scar-0.032
 arrows-0.032
 arrow-0.031
Token about
Token ID546
Feature activation-5.91e-01

Pos logit contributions
ator+0.063
urg+0.062
icles+0.059
rim+0.055
zie+0.054
Neg logit contributions
bs-0.036
 cane-0.034
 arrows-0.032
Scar-0.032
 arrow-0.030
Token all
Token ID477
Feature activation-1.66e-01

Pos logit contributions
ator+0.030
urg+0.030
icles+0.029
rim+0.026
icle+0.025
Neg logit contributions
bs-0.020
 cane-0.020
Scar-0.019
 arrows-0.018
 arrow-0.018
Token asked
Token ID1965
Feature activation-9.44e-01

Pos logit contributions
ator+0.030
urg+0.030
icles+0.029
rim+0.026
icle+0.026
Neg logit contributions
bs-0.016
 cane-0.016
Scar-0.015
 arrows-0.014
 arrow-0.014
Token the
Token ID262
Feature activation-5.64e-01

Pos logit contributions
ator+0.057
urg+0.056
icles+0.054
rim+0.051
icle+0.048
Neg logit contributions
bs-0.026
 cane-0.023
Scar-0.022
 arrows-0.019
 short-0.019
Token daughter
Token ID4957
Feature activation-1.82e-02

Pos logit contributions
urg+0.029
ator+0.029
icles+0.028
rim+0.025
icle+0.025
Neg logit contributions
bs-0.019
 cane-0.019
Scar-0.018
 arrows-0.017
 arrow-0.017
Token,
Token ID11
Feature activation-3.15e-01

Pos logit contributions
ator+0.001
urg+0.001
icles+0.001
rim+0.001
icle+0.001
Neg logit contributions
bs-0.001
 cane-0.001
Scar-0.001
 arrows-0.001
 arrow-0.001
Token "
Token ID366
Feature activation-1.12e+00

Pos logit contributions
ator+0.015
urg+0.015
icles+0.014
rim+0.012
icle+0.012
Neg logit contributions
bs-0.012
 cane-0.012
Scar-0.012
 arrows-0.011
 arrow-0.011
TokenWhat
Token ID2061
Feature activation+2.49e+00

Pos logit contributions
urg+0.049
ator+0.046
icles+0.043
rim+0.038
zie+0.037
Neg logit contributions
Scar-0.049
 cane-0.049
bs-0.048
Har-0.046
 stamp-0.045
Token do
Token ID466
Feature activation+8.68e-01

Pos logit contributions
bs+0.120
Mic+0.118
Har+0.115
 confident+0.114
Scar+0.113
Neg logit contributions
urg-0.070
icles-0.065
ator-0.063
 Esk-0.056
men-0.056
Token you
Token ID345
Feature activation+1.21e-01

Pos logit contributions
 cane+0.037
bs+0.036
Scar+0.036
 arrows+0.034
 confident+0.033
Neg logit contributions
ator-0.035
urg-0.034
icles-0.033
rim-0.029
icle-0.028
Token want
Token ID765
Feature activation-6.29e-02

Pos logit contributions
bs+0.004
 cane+0.004
Scar+0.004
 arrows+0.003
powers+0.003
Neg logit contributions
ator-0.006
urg-0.006
icles-0.006
rim-0.006
zie-0.006
Token for
Token ID329
Feature activation-9.18e-01

Pos logit contributions
ator+0.003
urg+0.003
icles+0.003
rim+0.003
icle+0.003
Neg logit contributions
bs-0.002
 cane-0.002
Scar-0.002
 arrows-0.002
 arrow-0.002
Token your
Token ID534
Feature activation-1.18e+00

Pos logit contributions
ator+0.044
urg+0.043
icles+0.042
rim+0.038
zie+0.037
Neg logit contributions
 cane-0.033
bs-0.033
Scar-0.032
 string-0.030
 arrows-0.030
Token floor
Token ID4314
Feature activation+5.51e-01

Pos logit contributions
bs+0.023
Scar+0.022
 cane+0.021
 confident+0.020
 arrows+0.020
Neg logit contributions
ator-0.018
urg-0.017
icles-0.017
icle-0.014
zie-0.014
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
bs+0.019
 cane+0.018
Scar+0.018
 arrows+0.017
Har+0.016
Neg logit contributions
ator-0.028
urg-0.028
icles-0.027
icle-0.024
rim-0.024
Token Lily
Token ID20037
Feature activation+7.84e-01

Pos logit contributions
 cane+0.016
bs+0.016
Scar+0.015
 arrows+0.014
 arrow+0.014
Neg logit contributions
urg-0.018
ator-0.018
icles-0.018
icle-0.016
rim-0.015
Token picks
Token ID11103
Feature activation-1.09e-01

Pos logit contributions
bs+0.031
 cane+0.031
Scar+0.030
nam+0.029
 arrows+0.029
Neg logit contributions
urg-0.033
ator-0.032
icles-0.031
zie-0.028
ric-0.027
Token up
Token ID510
Feature activation+9.19e-01

Pos logit contributions
urg+0.004
ator+0.004
icles+0.004
rim+0.003
icle+0.003
Neg logit contributions
bs-0.005
 cane-0.005
Scar-0.005
 arrows-0.005
 confident-0.004
Token a
Token ID257
Feature activation+2.00e+00

Pos logit contributions
bs+0.035
powers+0.034
Scar+0.034
 cane+0.033
 confident+0.032
Neg logit contributions
icles-0.039
urg-0.038
ator-0.037
icle-0.033
io-0.032
Token shoe
Token ID17292
Feature activation+8.17e-01

Pos logit contributions
bs+0.076
Har+0.071
Scar+0.071
 Jerry+0.069
powers+0.069
Neg logit contributions
ator-0.076
icles-0.075
urg-0.073
icle-0.072
maid-0.067
Token and
Token ID290
Feature activation+1.68e+00

Pos logit contributions
bs+0.029
Scar+0.027
 cane+0.026
Har+0.026
Jennifer+0.025
Neg logit contributions
ator-0.040
icles-0.039
urg-0.038
icle-0.035
zie-0.033
Token tries
Token ID8404
Feature activation+6.39e-01

Pos logit contributions
bs+0.062
Mic+0.061
Har+0.060
Michelle+0.059
able+0.059
Neg logit contributions
urg-0.070
icles-0.066
ator-0.063
zie-0.057
 Esk-0.056
Token to
Token ID284
Feature activation+8.46e-01

Pos logit contributions
bs+0.028
Scar+0.027
 cane+0.027
Har+0.025
 arrows+0.025
Neg logit contributions
icles-0.024
urg-0.024
ator-0.024
icle-0.021
zie-0.020
Token scan
Token ID9367
Feature activation-4.10e-02

Pos logit contributions
bs+0.033
Scar+0.030
 cane+0.030
able+0.029
powers+0.029
Neg logit contributions
ator-0.039
urg-0.038
icles-0.037
icle-0.034
zie-0.033
Token.
Token ID13
Feature activation+4.53e-01

Pos logit contributions
bs+0.013
Scar+0.013
 cane+0.012
 arrows+0.011
Har+0.011
Neg logit contributions
ator-0.017
urg-0.017
icles-0.016
rim-0.014
icle-0.014
Token They
Token ID1119
Feature activation+8.26e-01

Pos logit contributions
bs+0.018
 cane+0.017
 arrows+0.016
Scar+0.016
 arrow+0.016
Neg logit contributions
icles-0.020
urg-0.020
ator-0.019
zie-0.017
icle-0.017
Token have
Token ID423
Feature activation+3.00e-01

Pos logit contributions
 cane+0.031
powers+0.030
Scar+0.030
bs+0.030
 arrows+0.028
Neg logit contributions
ator-0.035
urg-0.034
icles-0.033
zie-0.032
 Esk-0.029
Token a
Token ID257
Feature activation+5.92e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.007
 arrows+0.007
 confident+0.007
Neg logit contributions
urg-0.017
ator-0.017
icles-0.016
zie-0.015
rim-0.015
Token ball
Token ID2613
Feature activation+1.01e+00

Pos logit contributions
bs+0.020
Scar+0.019
 cane+0.018
 confident+0.017
Har+0.017
Neg logit contributions
ator-0.029
urg-0.029
icles-0.027
rim-0.025
 ig-0.024
Token and
Token ID290
Feature activation+2.12e+00

Pos logit contributions
Scar+0.040
Har+0.036
bs+0.036
 cane+0.036
nam+0.034
Neg logit contributions
ator-0.046
icles-0.045
urg-0.044
icle-0.040
 ig-0.039
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
Scar+0.072
Har+0.072
 confident+0.070
Charl+0.068
 cane+0.067
Neg logit contributions
icles-0.090
urg-0.086
icle-0.078
 M-0.075
asted-0.073
Token net
Token ID2010
Feature activation+4.80e-01

Pos logit contributions
bs+0.044
Scar+0.040
Har+0.039
 confident+0.038
old+0.037
Neg logit contributions
ator-0.054
urg-0.053
icles-0.051
icle-0.046
 ig-0.046
Token.
Token ID13
Feature activation+3.11e-01

Pos logit contributions
Scar+0.017
 cane+0.017
bs+0.017
Har+0.016
 arrows+0.015
Neg logit contributions
ator-0.023
urg-0.023
icles-0.022
rim-0.020
icle-0.019
Token They
Token ID1119
Feature activation+1.17e+00

Pos logit contributions
bs+0.012
 cane+0.012
Scar+0.011
 arrows+0.011
 arrow+0.011
Neg logit contributions
icles-0.014
urg-0.014
ator-0.014
icle-0.012
zie-0.012
Token throw
Token ID3714
Feature activation+9.65e-01

Pos logit contributions
Scar+0.044
 cane+0.044
powers+0.044
Har+0.043
bs+0.043
Neg logit contributions
urg-0.045
ator-0.044
icles-0.043
zie-0.040
 Esk-0.037
Token sat
Token ID3332
Feature activation-7.34e-01

Pos logit contributions
urg+0.083
ator+0.080
icles+0.078
rim+0.070
icle+0.068
Neg logit contributions
bs-0.037
 cane-0.035
 arrow-0.032
 confident-0.031
Scar-0.031
Token in
Token ID287
Feature activation-3.59e-01

Pos logit contributions
urg+0.039
ator+0.038
icles+0.037
rim+0.034
icle+0.033
Neg logit contributions
 cane-0.023
bs-0.023
Scar-0.022
 arrows-0.021
 arrow-0.020
Token the
Token ID262
Feature activation-6.98e-01

Pos logit contributions
urg+0.020
ator+0.020
icles+0.020
rim+0.018
icle+0.017
Neg logit contributions
bs-0.010
 cane-0.010
Scar-0.009
 arrows-0.009
 arrow-0.009
Token yard
Token ID12699
Feature activation-1.01e-01

Pos logit contributions
urg+0.035
ator+0.033
icles+0.032
rim+0.029
icle+0.027
Neg logit contributions
 cane-0.027
bs-0.026
Scar-0.024
 string-0.023
 arrow-0.023
Token all
Token ID477
Feature activation-1.32e+00

Pos logit contributions
ator+0.005
urg+0.005
icles+0.005
rim+0.005
icle+0.005
Neg logit contributions
bs-0.003
 cane-0.003
Scar-0.003
 arrows-0.003
 arrow-0.003
Token day
Token ID1110
Feature activation+2.45e+00

Pos logit contributions
urg+0.068
ator+0.062
icles+0.060
rim+0.054
urger+0.053
Neg logit contributions
bs-0.048
able-0.045
 cane-0.045
 confident-0.043
 tangled-0.042
Token,
Token ID11
Feature activation-7.73e-01

Pos logit contributions
Scar+0.103
Aud+0.102
bs+0.096
 arrows+0.096
able+0.093
Neg logit contributions
urg-0.097
ator-0.097
 setting-0.088
icles-0.086
aby-0.075
Token feeling
Token ID4203
Feature activation+3.36e-02

Pos logit contributions
urg+0.041
ator+0.041
icles+0.038
rim+0.035
icle+0.034
Neg logit contributions
 cane-0.026
bs-0.025
Scar-0.023
 confident-0.023
 arrows-0.022
Token sad
Token ID6507
Feature activation-2.50e-01

Pos logit contributions
bs+0.001
 cane+0.001
Scar+0.001
 arrows+0.001
 arrow+0.001
Neg logit contributions
urg-0.002
ator-0.002
icles-0.002
rim-0.002
icle-0.002
Token and
Token ID290
Feature activation-9.46e-01

Pos logit contributions
ator+0.014
urg+0.014
icles+0.013
rim+0.012
icle+0.012
Neg logit contributions
bs-0.008
 cane-0.007
Scar-0.007
 arrows-0.007
 arrow-0.006
Token alone
Token ID3436
Feature activation-5.54e-01

Pos logit contributions
ator+0.060
urg+0.059
icles+0.057
rim+0.052
icle+0.052
Neg logit contributions
bs-0.023
 cane-0.021
 confident-0.018
Scar-0.018
 arrow-0.018
Token that
Token ID326
Feature activation+9.42e-01

Pos logit contributions
bs+0.019
 cane+0.018
Scar+0.018
powers+0.017
 arrows+0.016
Neg logit contributions
ator-0.021
urg-0.021
icles-0.020
icle-0.018
rim-0.017
Token she
Token ID673
Feature activation+1.35e-01

Pos logit contributions
bs+0.039
Scar+0.036
powers+0.035
 cane+0.035
 arrows+0.033
Neg logit contributions
ator-0.039
icles-0.039
urg-0.039
icle-0.034
rim-0.033
Token would
Token ID561
Feature activation-5.60e-01

Pos logit contributions
bs+0.006
 cane+0.006
Scar+0.006
 arrows+0.005
 arrow+0.005
Neg logit contributions
ator-0.005
urg-0.005
icles-0.005
rim-0.004
zie-0.004
Token always
Token ID1464
Feature activation+1.25e+00

Pos logit contributions
urg+0.024
ator+0.024
icles+0.023
rim+0.020
icle+0.019
Neg logit contributions
bs-0.024
 cane-0.023
Scar-0.022
 arrows-0.021
 arrow-0.021
Token love
Token ID1842
Feature activation+8.49e-01

Pos logit contributions
bs+0.066
Scar+0.061
powers+0.059
pill+0.058
 cane+0.058
Neg logit contributions
ator-0.040
urg-0.037
icles-0.034
rim-0.030
zie-0.029
Token ponies
Token ID46370
Feature activation+2.10e+00

Pos logit contributions
bs+0.036
Scar+0.033
powers+0.032
 cane+0.031
 arrows+0.030
Neg logit contributions
ator-0.035
urg-0.034
icles-0.034
fl-0.030
rim-0.029
Token.
Token ID13
Feature activation+3.76e-01

Pos logit contributions
bs+0.083
Scar+0.082
powers+0.077
Har+0.077
 cane+0.077
Neg logit contributions
ator-0.086
icles-0.078
urg-0.077
fl-0.068
ie-0.066
Token<|endoftext|>
Token ID50256
Feature activation+5.39e-01

Pos logit contributions
bs+0.012
 cane+0.012
Scar+0.011
 arrows+0.011
 arrow+0.011
Neg logit contributions
ator-0.019
urg-0.019
icles-0.019
zie-0.017
icle-0.016
TokenOnce
Token ID7454
Feature activation+1.64e-01

Pos logit contributions
bs+0.019
 cane+0.018
Scar+0.017
 arrows+0.017
 arrow+0.016
Neg logit contributions
ator-0.028
urg-0.026
icles-0.026
icle-0.023
rim-0.023
Token upon
Token ID2402
Feature activation+1.41e+00

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.008
 arrows+0.007
 arrow+0.007
Neg logit contributions
ator-0.006
urg-0.006
icles-0.005
rim-0.005
zie-0.005
Token a
Token ID257
Feature activation+1.08e+00

Pos logit contributions
bs+0.047
 cane+0.042
Scar+0.041
 arrows+0.040
Aud+0.038
Neg logit contributions
urg-0.070
icles-0.069
ator-0.068
 Esk-0.064
icle-0.061
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
ator+0.021
urg+0.021
icles+0.020
rim+0.018
icle+0.018
Neg logit contributions
bs-0.012
 cane-0.012
Scar-0.011
 arrows-0.010
 arrow-0.010
Token
Token ID198
Feature activation+5.86e-01

Pos logit contributions
bs+0.008
 cane+0.007
Scar+0.007
 arrows+0.007
able+0.007
Neg logit contributions
urg-0.008
ator-0.008
icles-0.008
zie-0.007
icle-0.007
Token
Token ID198
Feature activation+3.15e-01

Pos logit contributions
bs+0.020
 cane+0.020
Scar+0.018
nam+0.018
 arrows+0.018
Neg logit contributions
ator-0.029
urg-0.028
icles-0.027
rim-0.025
zie-0.024
Token"
Token ID1
Feature activation-6.97e-01

Pos logit contributions
bs+0.011
 cane+0.010
Scar+0.010
 arrows+0.009
nam+0.009
Neg logit contributions
ator-0.016
urg-0.016
icles-0.015
rim-0.014
zie-0.013
TokenMom
Token ID29252
Feature activation-1.01e+00

Pos logit contributions
urg+0.035
ator+0.034
icles+0.032
rim+0.028
zie+0.028
Neg logit contributions
bs-0.025
 cane-0.025
Scar-0.025
 arrows-0.023
Har-0.023
Tokenmy
Token ID1820
Feature activation+3.68e-02

Pos logit contributions
ator+0.057
urg+0.057
icles+0.054
rim+0.050
icle+0.048
Neg logit contributions
bs-0.029
 cane-0.028
Scar-0.026
 arrow-0.024
 arrows-0.024
Token,
Token ID11
Feature activation-1.04e+00

Pos logit contributions
bs+0.001
 cane+0.001
Scar+0.001
 arrows+0.001
 arrow+0.001
Neg logit contributions
urg-0.002
ator-0.002
icles-0.002
rim-0.002
zie-0.002
Token I
Token ID314
Feature activation-4.40e-02

Pos logit contributions
urg+0.053
ator+0.053
icles+0.050
rim+0.047
io+0.043
Neg logit contributions
 cane-0.037
bs-0.037
Scar-0.033
 arrow-0.033
 arrows-0.032
Token want
Token ID765
Feature activation-2.14e-01

Pos logit contributions
urg+0.002
ator+0.002
icles+0.002
rim+0.002
zie+0.002
Neg logit contributions
bs-0.002
 cane-0.002
Scar-0.002
 arrows-0.002
 arrow-0.002
Token to
Token ID284
Feature activation-2.66e-02

Pos logit contributions
urg+0.011
ator+0.011
icles+0.010
rim+0.009
icle+0.009
Neg logit contributions
bs-0.008
 cane-0.007
Scar-0.007
 arrows-0.007
 confident-0.006
Token go
Token ID467
Feature activation-9.30e-01

Pos logit contributions
ator+0.001
urg+0.001
icles+0.001
rim+0.001
icle+0.001
Neg logit contributions
bs-0.001
 cane-0.001
Scar-0.001
 arrows-0.001
 confident-0.001
Token she
Token ID673
Feature activation-5.37e-01

Pos logit contributions
 cane+0.034
bs+0.033
 arrows+0.033
powers+0.033
 arrow+0.033
Neg logit contributions
ator-0.068
icles-0.066
urg-0.065
 setting-0.060
fl-0.060
Token walked
Token ID6807
Feature activation-3.08e-01

Pos logit contributions
ator+0.027
urg+0.027
icles+0.025
rim+0.023
icle+0.022
Neg logit contributions
bs-0.020
 cane-0.019
Scar-0.018
 arrows-0.017
 arrow-0.017
Token,
Token ID11
Feature activation+1.95e+00

Pos logit contributions
urg+0.018
ator+0.018
icles+0.017
rim+0.016
icle+0.015
Neg logit contributions
bs-0.008
 cane-0.008
Scar-0.008
 arrows-0.007
 arrow-0.007
Token she
Token ID673
Feature activation-9.70e-02

Pos logit contributions
able+0.051
Mic+0.050
bs+0.049
powers+0.048
 cane+0.047
Neg logit contributions
ator-0.098
icles-0.098
urg-0.097
fl-0.089
aby-0.086
Token heard
Token ID2982
Feature activation+1.00e+00

Pos logit contributions
urg+0.005
ator+0.005
icles+0.004
rim+0.004
zie+0.004
Neg logit contributions
bs-0.003
 cane-0.003
Scar-0.003
 arrows-0.003
 arrow-0.003
Token something
Token ID1223
Feature activation+6.11e-01

Pos logit contributions
bs+0.030
 cane+0.029
Scar+0.028
Mic+0.025
able+0.025
Neg logit contributions
urg-0.054
icles-0.053
ator-0.052
icle-0.046
zie-0.045
Token spl
Token ID4328
Feature activation-8.17e-01

Pos logit contributions
bs+0.022
Scar+0.022
 cane+0.021
Har+0.020
powers+0.019
Neg logit contributions
urg-0.029
ator-0.028
icles-0.027
rim-0.025
icle-0.024
Tokenashing
Token ID2140
Feature activation-6.92e-01

Pos logit contributions
ator+0.038
urg+0.037
icles+0.034
rim+0.030
icle+0.029
Neg logit contributions
bs-0.035
 cane-0.031
Scar-0.030
 arrows-0.029
 confident-0.029
Token.
Token ID13
Feature activation+2.15e+00

Pos logit contributions
ator+0.035
urg+0.035
icles+0.033
rim+0.030
icle+0.029
Neg logit contributions
bs-0.024
 cane-0.024
Scar-0.022
 arrows-0.022
 arrow-0.021
Token
Token ID220
Feature activation+1.05e+00

Pos logit contributions
able+0.105
bs+0.099
 arrows+0.093
 cane+0.092
Mic+0.091
Neg logit contributions
icles-0.069
urg-0.064
ator-0.061
ric-0.056
fl-0.054
Token
Token ID220
Feature activation+1.01e+00

Pos logit contributions
 cane+0.038
bs+0.036
 arrows+0.036
 arrow+0.035
Scar+0.035
Neg logit contributions
ator-0.051
urg-0.049
icles-0.049
zie-0.042
icle-0.041
Token he
Token ID339
Feature activation-1.32e+00

Pos logit contributions
bs+0.016
 cane+0.015
Scar+0.015
Aud+0.014
 stamp+0.013
Neg logit contributions
ator-0.024
urg-0.024
icles-0.023
 Esk-0.021
zie-0.020
Token had
Token ID550
Feature activation-1.35e-01

Pos logit contributions
ator+0.075
urg+0.072
icles+0.070
rim+0.063
icle+0.061
Neg logit contributions
bs-0.045
 cane-0.042
 arrows-0.039
 arrow-0.038
Scar-0.038
Token never
Token ID1239
Feature activation+1.78e-01

Pos logit contributions
urg+0.007
ator+0.007
icles+0.007
rim+0.006
icle+0.006
Neg logit contributions
bs-0.005
 cane-0.004
Scar-0.004
 arrows-0.004
 arrow-0.004
Token gone
Token ID3750
Feature activation-5.94e-01

Pos logit contributions
bs+0.006
 cane+0.006
Scar+0.006
 arrows+0.006
 string+0.005
Neg logit contributions
urg-0.009
ator-0.008
icles-0.008
rim-0.007
zie-0.007
Token swimming
Token ID14899
Feature activation-7.94e-01

Pos logit contributions
ator+0.031
urg+0.031
icles+0.029
rim+0.026
icle+0.026
Neg logit contributions
bs-0.020
 cane-0.020
Scar-0.019
 arrows-0.018
 arrow-0.017
Token before
Token ID878
Feature activation+3.54e-01

Pos logit contributions
urg+0.040
ator+0.040
icles+0.038
rim+0.034
zie+0.033
Neg logit contributions
bs-0.028
 cane-0.027
Scar-0.026
 arrows-0.025
 arrow-0.024
Token.
Token ID13
Feature activation-5.81e-01

Pos logit contributions
bs+0.013
Scar+0.012
 cane+0.012
Aud+0.011
 arrows+0.011
Neg logit contributions
urg-0.018
ator-0.017
icles-0.017
rim-0.015
icle-0.015
Token He
Token ID679
Feature activation-9.20e-01

Pos logit contributions
ator+0.030
urg+0.030
icles+0.028
rim+0.026
icle+0.025
Neg logit contributions
bs-0.020
 cane-0.020
Scar-0.019
 arrows-0.018
 arrow-0.017
Token looked
Token ID3114
Feature activation-3.11e-01

Pos logit contributions
ator+0.047
urg+0.046
icles+0.045
rim+0.040
icle+0.039
Neg logit contributions
bs-0.033
 cane-0.032
Scar-0.030
 arrows-0.029
 arrow-0.029
Token around
Token ID1088
Feature activation-9.64e-01

Pos logit contributions
ator+0.014
urg+0.014
icles+0.013
rim+0.012
icle+0.011
Neg logit contributions
bs-0.013
 cane-0.012
Scar-0.012
 arrows-0.011
 arrow-0.011
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
ator+0.061
urg+0.061
icles+0.059
rim+0.055
icle+0.053
Neg logit contributions
 cane-0.022
bs-0.022
Scar-0.019
 arrows-0.019
 confident-0.018
Token amazed
Token ID24562
Feature activation-7.21e-01

Pos logit contributions
bs+0.010
 cane+0.010
Scar+0.009
 arrows+0.009
 arrow+0.009
Neg logit contributions
urg-0.014
ator-0.014
icles-0.013
rim-0.012
zie-0.012
Token and
Token ID290
Feature activation+3.23e-02

Pos logit contributions
urg+0.040
ator+0.039
icles+0.038
rim+0.034
icle+0.034
Neg logit contributions
bs-0.022
 cane-0.022
Scar-0.020
 arrows-0.019
 arrow-0.018
Token thought
Token ID1807
Feature activation+1.69e-01

Pos logit contributions
bs+0.001
 cane+0.001
Scar+0.001
 arrows+0.001
powers+0.001
Neg logit contributions
urg-0.002
ator-0.002
icles-0.002
rim-0.002
zie-0.002
Token Mary
Token ID5335
Feature activation-1.46e-01

Pos logit contributions
bs+0.005
 cane+0.005
Scar+0.005
 arrows+0.005
 arrow+0.005
Neg logit contributions
ator-0.009
urg-0.009
icles-0.008
rim-0.007
icle-0.007
Token was
Token ID373
Feature activation-7.21e-01

Pos logit contributions
ator+0.006
urg+0.006
icles+0.006
rim+0.005
zie+0.005
Neg logit contributions
bs-0.006
 cane-0.006
Scar-0.006
 arrows-0.005
 arrow-0.005
Token really
Token ID1107
Feature activation+5.56e-01

Pos logit contributions
urg+0.038
ator+0.037
icles+0.036
rim+0.032
icle+0.032
Neg logit contributions
bs-0.023
 cane-0.023
Scar-0.023
 confident-0.021
 arrows-0.021
Token smart
Token ID4451
Feature activation-3.43e-01

Pos logit contributions
bs+0.020
 cane+0.019
Scar+0.018
 arrows+0.017
 string+0.017
Neg logit contributions
ator-0.028
urg-0.027
icles-0.025
rim-0.024
zie-0.023
Token for
Token ID329
Feature activation-5.59e-01

Pos logit contributions
ator+0.019
urg+0.019
icles+0.018
rim+0.016
icle+0.016
Neg logit contributions
bs-0.011
 cane-0.010
Scar-0.010
 arrows-0.009
 arrow-0.009
Token asking
Token ID4737
Feature activation-6.98e-01

Pos logit contributions
ator+0.027
urg+0.027
icles+0.026
rim+0.023
icle+0.023
Neg logit contributions
bs-0.020
 cane-0.020
Scar-0.019
 arrows-0.018
 arrow-0.018
Token for
Token ID329
Feature activation+6.78e-02

Pos logit contributions
urg+0.035
ator+0.035
icles+0.033
rim+0.030
icle+0.029
Neg logit contributions
bs-0.025
 cane-0.024
Scar-0.023
 arrows-0.021
 arrow-0.021
Token opinions
Token ID9317
Feature activation+9.35e-01

Pos logit contributions
bs+0.003
 cane+0.002
Scar+0.002
 arrows+0.002
powers+0.002
Neg logit contributions
ator-0.003
urg-0.003
icles-0.003
rim-0.003
icle-0.003
Token,
Token ID11
Feature activation+7.26e-01

Pos logit contributions
bs+0.005
 cane+0.005
Scar+0.005
Har+0.004
 arrows+0.004
Neg logit contributions
ator-0.006
urg-0.006
icles-0.006
rim-0.005
icle-0.005
Token just
Token ID655
Feature activation+5.69e-01

Pos logit contributions
bs+0.029
Scar+0.027
 cane+0.027
Har+0.026
powers+0.024
Neg logit contributions
urg-0.032
icles-0.032
ator-0.031
icle-0.027
rim-0.027
Token like
Token ID588
Feature activation-4.31e-01

Pos logit contributions
Scar+0.020
bs+0.020
 cane+0.020
Har+0.019
 arrows+0.018
Neg logit contributions
urg-0.026
ator-0.026
icles-0.026
icle-0.023
zie-0.022
Token in
Token ID287
Feature activation+7.04e-01

Pos logit contributions
ator+0.022
urg+0.022
icles+0.021
rim+0.019
icle+0.019
Neg logit contributions
bs-0.014
 cane-0.014
Scar-0.013
 arrows-0.012
 arrow-0.012
Token the
Token ID262
Feature activation-7.38e-02

Pos logit contributions
bs+0.026
 cane+0.024
Scar+0.024
powers+0.023
 arrows+0.022
Neg logit contributions
ator-0.034
urg-0.032
icles-0.032
zie-0.028
rim-0.028
Token movies
Token ID6918
Feature activation+1.99e-01

Pos logit contributions
ator+0.003
urg+0.003
icles+0.003
rim+0.003
zie+0.003
Neg logit contributions
bs-0.003
 cane-0.003
Scar-0.003
 arrows-0.003
 arrow-0.003
Token.
Token ID13
Feature activation+6.31e-01

Pos logit contributions
bs+0.007
 cane+0.007
Scar+0.006
 arrows+0.006
 arrow+0.006
Neg logit contributions
ator-0.010
urg-0.010
icles-0.010
rim-0.009
icle-0.008
Token
Token ID198
Feature activation+2.55e-01

Pos logit contributions
 cane+0.021
bs+0.021
Scar+0.020
 arrows+0.020
 stamp+0.019
Neg logit contributions
ator-0.031
icles-0.030
urg-0.030
zie-0.027
icle-0.026
Token
Token ID198
Feature activation+3.97e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.008
 arrows+0.007
 arrow+0.007
Neg logit contributions
ator-0.013
urg-0.013
icles-0.013
rim-0.012
zie-0.011
TokenTo
Token ID2514
Feature activation-6.53e-01

Pos logit contributions
 cane+0.014
bs+0.014
 arrows+0.012
Scar+0.012
 arrow+0.012
Neg logit contributions
ator-0.020
urg-0.019
icles-0.018
rim-0.017
zie-0.016
Token make
Token ID787
Feature activation-3.77e-01

Pos logit contributions
urg+0.033
ator+0.033
icles+0.032
rim+0.028
icle+0.028
Neg logit contributions
bs-0.023
 cane-0.022
Scar-0.021
 arrows-0.020
 arrow-0.019
Token But
Token ID887
Feature activation-1.33e+00

Pos logit contributions
urg+0.038
ator+0.038
icles+0.036
rim+0.033
zie+0.031
Neg logit contributions
bs-0.032
 cane-0.029
Scar-0.029
 arrows-0.027
 arrow-0.027
Token remember
Token ID3505
Feature activation-4.48e-01

Pos logit contributions
urg+0.070
ator+0.068
icles+0.065
rim+0.063
zie+0.058
Neg logit contributions
bs-0.043
 cane-0.041
 arrow-0.040
 arrows-0.039
 string-0.039
Token,
Token ID11
Feature activation-5.06e-01

Pos logit contributions
ator+0.026
urg+0.026
icles+0.025
rim+0.023
icle+0.022
Neg logit contributions
bs-0.012
 cane-0.012
Scar-0.011
 arrows-0.010
 arrow-0.010
Token it
Token ID340
Feature activation+3.42e-01

Pos logit contributions
ator+0.026
urg+0.026
icles+0.024
rim+0.022
icle+0.021
Neg logit contributions
bs-0.018
 cane-0.017
Scar-0.016
 arrows-0.016
 arrow-0.015
Token's
Token ID338
Feature activation-8.09e-02

Pos logit contributions
 cane+0.011
bs+0.011
Scar+0.010
Har+0.010
 arrows+0.010
Neg logit contributions
ator-0.018
urg-0.018
icles-0.017
zie-0.015
rim-0.015
Token okay
Token ID8788
Feature activation-7.48e-01

Pos logit contributions
ator+0.004
urg+0.004
icles+0.004
rim+0.003
icle+0.003
Neg logit contributions
bs-0.003
 cane-0.003
Scar-0.003
 arrows-0.003
powers-0.003
Token to
Token ID284
Feature activation-1.06e-01

Pos logit contributions
ator+0.042
urg+0.041
icles+0.039
rim+0.036
icle+0.035
Neg logit contributions
bs-0.023
 cane-0.022
Scar-0.020
 arrow-0.019
 arrows-0.019
Token make
Token ID787
Feature activation+1.91e-01

Pos logit contributions
ator+0.005
urg+0.005
icles+0.005
rim+0.004
icle+0.004
Neg logit contributions
bs-0.004
 cane-0.004
Scar-0.004
 arrows-0.004
 arrow-0.004
Token mistakes
Token ID10135
Feature activation+1.89e+00

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.007
Har+0.007
Aud+0.007
Neg logit contributions
urg-0.009
ator-0.008
icles-0.008
rim-0.007
icle-0.007
Token and
Token ID290
Feature activation+3.19e-01

Pos logit contributions
Scar+0.076
 cane+0.075
bs+0.075
Har+0.075
Terry+0.070
Neg logit contributions
ator-0.081
icles-0.078
urg-0.078
ators-0.063
icle-0.063
Token things
Token ID1243
Feature activation+7.42e-01

Pos logit contributions
bs+0.013
 cane+0.013
Scar+0.013
Har+0.012
Aud+0.012
Neg logit contributions
urg-0.014
ator-0.014
icles-0.013
rim-0.011
icle-0.011
Token to
Token ID284
Feature activation-4.92e-01

Pos logit contributions
urg+0.006
ator+0.006
icles+0.006
rim+0.005
icle+0.005
Neg logit contributions
bs-0.005
 cane-0.004
Scar-0.004
 arrows-0.004
 arrow-0.004
Token work
Token ID670
Feature activation+9.05e-01

Pos logit contributions
urg+0.023
ator+0.023
icles+0.022
rim+0.019
icle+0.018
Neg logit contributions
bs-0.019
 cane-0.019
Scar-0.018
 arrows-0.017
 arrow-0.017
Token together
Token ID1978
Feature activation+4.70e-01

Pos logit contributions
bs+0.037
 cane+0.036
Scar+0.035
Har+0.032
powers+0.032
Neg logit contributions
ator-0.040
urg-0.040
icles-0.038
icle-0.033
rim-0.032
Token to
Token ID284
Feature activation+3.11e-01

Pos logit contributions
bs+0.018
Scar+0.017
 cane+0.017
Har+0.015
powers+0.015
Neg logit contributions
ator-0.022
urg-0.022
icles-0.021
icle-0.018
rim-0.018
Token fix
Token ID4259
Feature activation+6.97e-01

Pos logit contributions
bs+0.014
Scar+0.013
 cane+0.013
powers+0.012
Aud+0.012
Neg logit contributions
ator-0.013
urg-0.013
icles-0.012
rim-0.011
icle-0.010
Token the
Token ID262
Feature activation-8.58e-01

Pos logit contributions
bs+0.027
Scar+0.026
 cane+0.026
powers+0.024
 confident+0.024
Neg logit contributions
ator-0.033
urg-0.032
icles-0.032
rim-0.027
icle-0.027
Token drain
Token ID14782
Feature activation+1.67e-01

Pos logit contributions
ator+0.048
urg+0.048
icles+0.047
zie+0.042
rim+0.042
Neg logit contributions
 cane-0.026
bs-0.025
 string-0.023
 arrows-0.023
 arrow-0.023
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
bs+0.006
 cane+0.005
Scar+0.005
 arrows+0.005
Har+0.005
Neg logit contributions
ator-0.009
urg-0.009
icles-0.008
rim-0.007
icle-0.007
Token They
Token ID1119
Feature activation+1.51e-01

Pos logit contributions
ator+0.014
urg+0.014
icles+0.013
rim+0.012
icle+0.012
Neg logit contributions
bs-0.011
 cane-0.011
Scar-0.011
 arrows-0.010
 arrow-0.010
Token used
Token ID973
Feature activation+8.89e-01

Pos logit contributions
bs+0.005
 cane+0.005
Scar+0.005
powers+0.004
 arrows+0.004
Neg logit contributions
ator-0.008
urg-0.008
icles-0.007
rim-0.007
zie-0.007
Token sticks
Token ID16461
Feature activation+6.34e-01

Pos logit contributions
Scar+0.033
bs+0.032
 cane+0.031
Har+0.030
Aud+0.029
Neg logit contributions
urg-0.041
icles-0.041
ator-0.040
icle-0.036
zie-0.033
Token She
Token ID1375
Feature activation+1.20e-01

Pos logit contributions
bs+0.017
 cane+0.016
 arrows+0.016
Scar+0.016
 arrow+0.016
Neg logit contributions
ator-0.026
urg-0.026
icles-0.025
zie-0.023
 Esk-0.022
Token hugged
Token ID41130
Feature activation-3.06e-01

Pos logit contributions
bs+0.004
 cane+0.004
Scar+0.004
 arrows+0.004
powers+0.003
Neg logit contributions
urg-0.006
ator-0.006
icles-0.006
rim-0.005
zie-0.005
Token Anna
Token ID11735
Feature activation+1.04e+00

Pos logit contributions
ator+0.016
urg+0.016
icles+0.016
rim+0.014
icle+0.014
Neg logit contributions
bs-0.010
 cane-0.010
Scar-0.009
 arrows-0.009
 arrow-0.008
Token and
Token ID290
Feature activation+2.01e+00

Pos logit contributions
bs+0.038
Scar+0.036
powers+0.035
 cane+0.035
 arrows+0.034
Neg logit contributions
ator-0.045
urg-0.044
icles-0.044
zie-0.039
ie-0.037
Token said
Token ID531
Feature activation+8.62e-01

Pos logit contributions
powers+0.083
ner+0.079
bs+0.075
 cane+0.075
ite+0.075
Neg logit contributions
urg-0.075
icles-0.067
ator-0.064
zie-0.061
 M-0.060
Token she
Token ID673
Feature activation-6.60e-01

Pos logit contributions
 cane+0.026
Scar+0.026
Har+0.025
bs+0.025
 arrow+0.024
Neg logit contributions
urg-0.045
icles-0.043
ator-0.043
icle-0.037
 Esk-0.036
Token was
Token ID373
Feature activation-2.95e-01

Pos logit contributions
urg+0.031
ator+0.031
icles+0.029
rim+0.026
icle+0.026
Neg logit contributions
bs-0.026
 cane-0.026
Scar-0.024
 arrows-0.023
 confident-0.023
Token sorry
Token ID7926
Feature activation-8.73e-01

Pos logit contributions
urg+0.016
ator+0.016
icles+0.015
rim+0.014
icle+0.013
Neg logit contributions
bs-0.009
 cane-0.009
Scar-0.009
 arrows-0.008
 arrow-0.008
Token.
Token ID13
Feature activation+7.17e-01

Pos logit contributions
ator+0.051
urg+0.051
icles+0.048
rim+0.045
icle+0.043
Neg logit contributions
bs-0.026
 cane-0.024
Scar-0.022
 arrows-0.021
 confident-0.020
Token She
Token ID1375
Feature activation+2.90e-01

Pos logit contributions
bs+0.024
 cane+0.024
 arrows+0.023
 arrow+0.023
 stamp+0.023
Neg logit contributions
ator-0.033
urg-0.033
icles-0.032
zie-0.028
icle-0.028
Token said
Token ID531
Feature activation+8.08e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.008
 arrows+0.007
powers+0.007
Neg logit contributions
urg-0.016
ator-0.016
icles-0.015
rim-0.014
zie-0.013
Token her
Token ID607
Feature activation-8.65e-01

Pos logit contributions
ator+0.065
urg+0.063
icles+0.061
rim+0.057
icle+0.054
Neg logit contributions
bs-0.037
 cane-0.036
 arrows-0.033
 arrow-0.033
 confident-0.032
Token home
Token ID1363
Feature activation-6.77e-01

Pos logit contributions
ator+0.043
urg+0.042
icles+0.041
rim+0.037
icle+0.035
Neg logit contributions
 cane-0.032
bs-0.031
 arrows-0.029
 arrow-0.029
Scar-0.028
Token.
Token ID13
Feature activation+2.35e-01

Pos logit contributions
ator+0.038
urg+0.038
icles+0.036
rim+0.034
icle+0.032
Neg logit contributions
bs-0.020
 cane-0.019
Scar-0.017
 confident-0.016
 arrows-0.016
Token
Token ID198
Feature activation+4.45e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.008
 arrows+0.007
 arrow+0.007
Neg logit contributions
ator-0.012
urg-0.012
icles-0.012
zie-0.010
icle-0.010
Token
Token ID198
Feature activation+3.37e-01

Pos logit contributions
bs+0.015
 cane+0.015
Scar+0.014
 arrows+0.014
 arrow+0.013
Neg logit contributions
ator-0.023
urg-0.022
icles-0.022
rim-0.019
zie-0.019
TokenEvery
Token ID6109
Feature activation-1.43e+00

Pos logit contributions
bs+0.013
 cane+0.012
Scar+0.011
 arrows+0.011
powers+0.011
Neg logit contributions
ator-0.016
urg-0.015
icles-0.015
rim-0.013
zie-0.013
Token day
Token ID1110
Feature activation+2.80e-01

Pos logit contributions
urg+0.073
ator+0.073
icles+0.071
rim+0.063
zie+0.058
Neg logit contributions
 cane-0.054
bs-0.049
 short-0.047
 string-0.046
 long-0.046
Token,
Token ID11
Feature activation-1.52e-01

Pos logit contributions
 cane+0.008
Scar+0.008
bs+0.008
powers+0.007
 arrows+0.007
Neg logit contributions
urg-0.015
icles-0.015
ator-0.015
icle-0.013
zie-0.013
Token m
Token ID285
Feature activation-1.47e+00

Pos logit contributions
urg+0.009
ator+0.009
icles+0.009
rim+0.008
icle+0.008
Neg logit contributions
bs-0.004
 cane-0.004
Scar-0.004
 arrows-0.003
 arrow-0.003
Tokenama
Token ID1689
Feature activation-1.08e+00

Pos logit contributions
ator+0.060
urg+0.057
icles+0.054
zie+0.050
ators+0.049
Neg logit contributions
bs-0.062
 cane-0.058
 stamp-0.053
 arrow-0.052
powers-0.051
Token oy
Token ID35104
Feature activation-1.72e+00

Pos logit contributions
ator+0.062
urg+0.062
icles+0.059
rim+0.054
icle+0.052
Neg logit contributions
bs-0.030
 cane-0.030
Scar-0.027
 arrow-0.025
 arrows-0.025
Token the
Token ID262
Feature activation-6.02e-01

Pos logit contributions
bs+0.006
 cane+0.006
Scar+0.005
 arrows+0.005
able+0.005
Neg logit contributions
urg-0.008
ator-0.008
icles-0.008
rim-0.007
icle-0.007
Token doctor
Token ID6253
Feature activation+8.36e-02

Pos logit contributions
urg+0.028
ator+0.028
icles+0.027
rim+0.024
icle+0.023
Neg logit contributions
 cane-0.023
bs-0.023
Scar-0.022
 arrows-0.021
 arrow-0.021
Token."
Token ID526
Feature activation+3.10e-01

Pos logit contributions
bs+0.003
 cane+0.003
Scar+0.002
 arrows+0.002
 arrow+0.002
Neg logit contributions
ator-0.004
urg-0.004
icles-0.004
rim-0.004
zie-0.004
Token
Token ID198
Feature activation+8.37e-02

Pos logit contributions
bs+0.011
 cane+0.011
Scar+0.010
 arrows+0.010
 arrow+0.009
Neg logit contributions
ator-0.015
urg-0.015
icles-0.015
rim-0.013
icle-0.013
Token
Token ID198
Feature activation+3.08e-01

Pos logit contributions
bs+0.003
 cane+0.003
Scar+0.002
 arrows+0.002
 arrow+0.002
Neg logit contributions
ator-0.005
urg-0.005
icles-0.004
rim-0.004
icle-0.004
TokenAt
Token ID2953
Feature activation-5.54e-01

Pos logit contributions
bs+0.011
 cane+0.010
Scar+0.009
 arrows+0.009
 arrow+0.009
Neg logit contributions
ator-0.016
urg-0.015
icles-0.015
rim-0.014
icle-0.013
Token the
Token ID262
Feature activation-1.12e+00

Pos logit contributions
urg+0.028
ator+0.028
icles+0.027
rim+0.025
icle+0.023
Neg logit contributions
bs-0.019
 cane-0.019
Scar-0.018
 arrows-0.016
 arrow-0.016
Token doctor
Token ID6253
Feature activation+9.69e-02

Pos logit contributions
urg+0.061
ator+0.059
icles+0.057
rim+0.052
zie+0.049
Neg logit contributions
 cane-0.039
Scar-0.035
bs-0.035
 string-0.034
 stamp-0.033
Token's
Token ID338
Feature activation-8.92e-01

Pos logit contributions
bs+0.003
 cane+0.003
Scar+0.003
 arrows+0.002
 arrow+0.002
Neg logit contributions
urg-0.005
ator-0.005
icles-0.005
rim-0.005
zie-0.005
Token office
Token ID2607
Feature activation+2.89e-01

Pos logit contributions
urg+0.040
ator+0.040
icles+0.038
rim+0.034
zie+0.032
Neg logit contributions
 cane-0.037
bs-0.036
Scar-0.034
 string-0.032
 confident-0.032
Token,
Token ID11
Feature activation+1.13e-01

Pos logit contributions
bs+0.008
 cane+0.008
Scar+0.008
 arrows+0.007
 arrow+0.007
Neg logit contributions
ator-0.016
urg-0.016
icles-0.016
icle-0.014
zie-0.014
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
able+0.083
nam+0.081
bs+0.081
 stamp+0.080
 cane+0.079
Neg logit contributions
urg-0.058
icles-0.057
ator-0.055
 them-0.050
aby-0.046
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
 cane+0.042
bs+0.042
Scar+0.039
nam+0.039
 stamp+0.038
Neg logit contributions
ator-0.051
urg-0.051
icles-0.047
rim-0.042
zie-0.041
Token"
Token ID1
Feature activation-4.26e-01

Pos logit contributions
bs+0.052
 cane+0.048
nam+0.045
powers+0.044
 stamp+0.044
Neg logit contributions
ator-0.049
urg-0.046
icles-0.042
rim-0.038
aby-0.038
TokenLet
Token ID5756
Feature activation-6.98e-01

Pos logit contributions
urg+0.020
ator+0.020
icles+0.019
rim+0.017
zie+0.017
Neg logit contributions
bs-0.016
 cane-0.016
Scar-0.016
 arrows-0.015
 arrow-0.014
Token's
Token ID338
Feature activation-7.88e-01

Pos logit contributions
ator+0.041
urg+0.041
icles+0.039
rim+0.036
icle+0.035
Neg logit contributions
bs-0.019
 cane-0.019
Scar-0.018
 arrows-0.016
 arrow-0.016
Token go
Token ID467
Feature activation-2.23e+00

Pos logit contributions
urg+0.038
ator+0.038
icles+0.036
rim+0.033
icle+0.031
Neg logit contributions
 cane-0.029
bs-0.029
 arrow-0.027
Scar-0.027
 arrows-0.027
Token home
Token ID1363
Feature activation-2.75e-01

Pos logit contributions
urg+0.133
icles+0.122
ator+0.122
rim+0.112
umm+0.111
Neg logit contributions
bs-0.062
 cane-0.058
 arrow-0.054
 Jerry-0.053
 long-0.053
Token now
Token ID783
Feature activation+2.11e-01

Pos logit contributions
ator+0.012
urg+0.012
icles+0.012
rim+0.010
icle+0.010
Neg logit contributions
bs-0.011
 cane-0.011
Scar-0.011
 arrows-0.010
 arrow-0.010
Token,"
Token ID553
Feature activation+1.64e+00

Pos logit contributions
bs+0.009
 cane+0.009
Scar+0.008
 arrows+0.008
nam+0.008
Neg logit contributions
ator-0.009
urg-0.009
icles-0.009
rim-0.007
icle-0.007
Token Ben
Token ID3932
Feature activation+1.45e+00

Pos logit contributions
bs+0.073
powers+0.070
br+0.070
able+0.069
Scar+0.068
Neg logit contributions
icles-0.055
urg-0.054
 M-0.052
ator-0.051
 Manny-0.050
Token said
Token ID531
Feature activation+1.17e+00

Pos logit contributions
bs+0.056
 cane+0.054
nam+0.054
Scar+0.052
powers+0.048
Neg logit contributions
urg-0.064
icles-0.060
ator-0.059
io-0.054
rim-0.053
Token chase
Token ID15505
Feature activation+1.32e-01

Pos logit contributions
bs+0.025
 cane+0.025
Scar+0.024
powers+0.023
able+0.023
Neg logit contributions
ator-0.028
urg-0.027
icles-0.026
zie-0.023
icle-0.022
Token the
Token ID262
Feature activation-9.19e-01

Pos logit contributions
bs+0.005
 cane+0.005
Scar+0.005
 confident+0.005
Har+0.004
Neg logit contributions
urg-0.006
ator-0.006
icles-0.006
icle-0.005
rim-0.005
Token car
Token ID1097
Feature activation+2.35e-01

Pos logit contributions
urg+0.050
ator+0.049
icles+0.047
rim+0.043
zie+0.043
Neg logit contributions
 cane-0.030
bs-0.028
 string-0.028
 arrows-0.027
 arrow-0.027
Token-
Token ID12
Feature activation-8.48e-01

Pos logit contributions
bs+0.008
Scar+0.008
 cane+0.008
Har+0.007
 arrows+0.007
Neg logit contributions
ator-0.011
urg-0.011
icles-0.011
icle-0.009
zie-0.009
Tokennos
Token ID39369
Feature activation-1.77e+00

Pos logit contributions
urg+0.042
ator+0.042
icles+0.038
rim+0.036
zie+0.034
Neg logit contributions
bs-0.033
 cane-0.030
Scar-0.029
powers-0.028
able-0.028
Tokenaur
Token ID2899
Feature activation-2.47e+00

Pos logit contributions
urg+0.080
ator+0.077
icles+0.070
rim+0.067
 Esk+0.063
Neg logit contributions
bs-0.069
able-0.067
 cane-0.063
powers-0.059
Scar-0.059
Token around
Token ID1088
Feature activation-1.01e+00

Pos logit contributions
urg+0.098
ator+0.098
umm+0.095
icles+0.089
rim+0.087
Neg logit contributions
bs-0.105
 cane-0.096
 arrows-0.093
 arrow-0.091
 string-0.091
Token the
Token ID262
Feature activation-3.25e-02

Pos logit contributions
urg+0.057
ator+0.057
icles+0.054
rim+0.051
zie+0.048
Neg logit contributions
bs-0.030
 cane-0.029
Scar-0.026
 arrows-0.026
 string-0.026
Token room
Token ID2119
Feature activation-1.27e+00

Pos logit contributions
ator+0.001
urg+0.001
icles+0.001
rim+0.001
icle+0.001
Neg logit contributions
bs-0.001
 cane-0.001
Scar-0.001
 arrows-0.001
 confident-0.001
Token,
Token ID11
Feature activation-1.33e+00

Pos logit contributions
urg+0.078
ator+0.077
icles+0.075
rim+0.070
icle+0.068
Neg logit contributions
bs-0.032
 cane-0.029
 confident-0.026
Scar-0.025
 arrows-0.025
Token having
Token ID1719
Feature activation+1.33e-01

Pos logit contributions
ator+0.073
urg+0.073
icles+0.069
rim+0.062
icle+0.060
Neg logit contributions
bs-0.043
 cane-0.043
 confident-0.039
Scar-0.039
 arrow-0.038
Token They
Token ID1119
Feature activation+4.77e-01

Pos logit contributions
ator+0.016
urg+0.016
icles+0.016
rim+0.014
icle+0.014
Neg logit contributions
bs-0.010
 cane-0.009
Scar-0.009
 arrows-0.008
 arrow-0.008
Token were
Token ID547
Feature activation-8.26e-01

Pos logit contributions
bs+0.018
 cane+0.018
Scar+0.017
powers+0.017
 arrows+0.017
Neg logit contributions
urg-0.021
ator-0.021
icles-0.020
 Esk-0.018
zie-0.017
Token excited
Token ID6568
Feature activation-9.27e-01

Pos logit contributions
urg+0.048
ator+0.048
icles+0.047
rim+0.043
icle+0.042
Neg logit contributions
bs-0.022
 cane-0.021
Scar-0.021
 confident-0.020
 arrows-0.019
Token to
Token ID284
Feature activation-7.62e-01

Pos logit contributions
ator+0.056
urg+0.055
icles+0.052
rim+0.049
icle+0.048
Neg logit contributions
bs-0.025
 cane-0.024
Scar-0.020
 confident-0.020
 short-0.020
Token see
Token ID766
Feature activation-3.42e-01

Pos logit contributions
ator+0.044
urg+0.043
icles+0.042
rim+0.039
icle+0.037
Neg logit contributions
bs-0.022
 cane-0.022
Scar-0.020
 arrow-0.019
 arrows-0.018
Token Grand
Token ID5675
Feature activation-2.39e+00

Pos logit contributions
ator+0.016
urg+0.016
icles+0.015
rim+0.013
icle+0.013
Neg logit contributions
bs-0.014
 cane-0.013
Scar-0.013
 arrows-0.012
 arrow-0.012
Tokenma
Token ID2611
Feature activation-9.82e-01

Pos logit contributions
urg+0.112
icles+0.103
ator+0.101
opher+0.101
stack+0.099
Neg logit contributions
able-0.067
old-0.065
 cane-0.063
Scar-0.062
 stamp-0.062
Token.
Token ID13
Feature activation-6.26e-01

Pos logit contributions
ator+0.058
urg+0.058
icles+0.056
rim+0.051
icle+0.050
Neg logit contributions
bs-0.026
 cane-0.025
Scar-0.023
 stamp-0.021
 arrows-0.021
Token They
Token ID1119
Feature activation+2.86e-02

Pos logit contributions
ator+0.035
urg+0.035
icles+0.034
rim+0.031
icle+0.030
Neg logit contributions
bs-0.019
 cane-0.018
Scar-0.017
 arrows-0.015
Har-0.015
Token hoped
Token ID10719
Feature activation-1.02e+00

Pos logit contributions
bs+0.001
 cane+0.001
Scar+0.001
 arrows+0.001
powers+0.001
Neg logit contributions
ator-0.001
urg-0.001
icles-0.001
rim-0.001
zie-0.001
Token she
Token ID673
Feature activation-1.20e+00

Pos logit contributions
ator+0.060
urg+0.060
icles+0.056
rim+0.054
zie+0.051
Neg logit contributions
bs-0.029
 cane-0.026
Scar-0.025
 short-0.022
 arrows-0.022
Token Mia
Token ID32189
Feature activation+5.47e-01

Pos logit contributions
ator+0.032
urg+0.032
icles+0.031
rim+0.028
icle+0.027
Neg logit contributions
bs-0.016
 cane-0.015
Scar-0.015
 arrows-0.013
powers-0.013
Token wanted
Token ID2227
Feature activation-4.06e-01

Pos logit contributions
 cane+0.020
bs+0.020
Scar+0.020
 arrows+0.019
Har+0.019
Neg logit contributions
urg-0.025
ator-0.024
icles-0.023
zie-0.021
rim-0.021
Token to
Token ID284
Feature activation-3.98e-01

Pos logit contributions
ator+0.021
urg+0.021
icles+0.020
rim+0.018
icle+0.018
Neg logit contributions
bs-0.014
 cane-0.013
Scar-0.013
 arrows-0.012
 arrow-0.012
Token make
Token ID787
Feature activation-8.81e-01

Pos logit contributions
ator+0.020
urg+0.020
icles+0.019
rim+0.017
icle+0.016
Neg logit contributions
bs-0.015
 cane-0.014
Scar-0.013
 arrows-0.013
 arrow-0.013
Token a
Token ID257
Feature activation-1.58e+00

Pos logit contributions
ator+0.054
urg+0.052
icles+0.051
rim+0.047
zie+0.046
Neg logit contributions
bs-0.022
 cane-0.021
Scar-0.019
 arrows-0.019
 arrow-0.018
Token snow
Token ID6729
Feature activation-2.25e+00

Pos logit contributions
ator+0.082
icles+0.078
urg+0.078
zie+0.069
rim+0.069
Neg logit contributions
 cane-0.059
bs-0.054
 string-0.054
 arrow-0.054
 stamp-0.053
Tokenman
Token ID805
Feature activation-3.67e-01

Pos logit contributions
ator+0.095
urg+0.095
icles+0.082
rim+0.081
zie+0.080
Neg logit contributions
 cane-0.090
bs-0.084
 stamp-0.084
 arrow-0.082
 string-0.080
Token.
Token ID13
Feature activation-8.12e-01

Pos logit contributions
ator+0.019
urg+0.019
icles+0.019
rim+0.017
icle+0.016
Neg logit contributions
bs-0.012
 cane-0.012
Scar-0.011
 arrows-0.010
 arrow-0.010
Token As
Token ID1081
Feature activation-4.47e-01

Pos logit contributions
ator+0.051
urg+0.051
icles+0.048
rim+0.045
icle+0.043
Neg logit contributions
bs-0.021
 cane-0.019
Scar-0.018
 arrows-0.016
powers-0.015
Token she
Token ID673
Feature activation-9.12e-01

Pos logit contributions
ator+0.032
urg+0.032
icles+0.031
rim+0.029
icle+0.028
Neg logit contributions
bs-0.007
 cane-0.006
Scar-0.005
 arrows-0.005
 arrow-0.004
Token picked
Token ID6497
Feature activation-9.07e-01

Pos logit contributions
ator+0.049
urg+0.049
icles+0.046
rim+0.042
zie+0.041
Neg logit contributions
bs-0.030
 cane-0.028
 string-0.026
Scar-0.026
 arrow-0.025
Token fire
Token ID2046
Feature activation-1.90e+00

Pos logit contributions
urg+0.074
icles+0.067
ator+0.066
rim+0.064
zie+0.060
Neg logit contributions
 cane-0.057
 string-0.052
 long-0.049
 arrow-0.048
bs-0.048
Token truck
Token ID7779
Feature activation-3.31e-01

Pos logit contributions
urg+0.084
ator+0.075
rim+0.073
icles+0.071
zie+0.068
Neg logit contributions
 cane-0.078
bs-0.072
 stamp-0.069
 string-0.069
 arrow-0.066
Token and
Token ID290
Feature activation+7.90e-01

Pos logit contributions
urg+0.019
ator+0.019
icles+0.018
rim+0.017
icle+0.016
Neg logit contributions
bs-0.009
 cane-0.009
Scar-0.008
 arrows-0.008
 arrow-0.007
Token notice
Token ID4003
Feature activation-1.86e-01

Pos logit contributions
Scar+0.032
bs+0.031
 cane+0.030
able+0.029
powers+0.029
Neg logit contributions
urg-0.033
icles-0.033
ator-0.033
icle-0.028
zie-0.027
Token the
Token ID262
Feature activation-1.48e+00

Pos logit contributions
urg+0.010
ator+0.010
icles+0.010
rim+0.009
icle+0.009
Neg logit contributions
bs-0.006
 cane-0.006
Scar-0.005
 arrows-0.005
 arrow-0.005
Token fire
Token ID2046
Feature activation-2.47e+00

Pos logit contributions
urg+0.071
ator+0.068
icles+0.065
rim+0.061
zie+0.061
Neg logit contributions
 cane-0.057
 string-0.052
bs-0.051
 long-0.051
 arrow-0.050
Tokenmen
Token ID3653
Feature activation-1.19e+00

Pos logit contributions
urg+0.108
ator+0.105
rim+0.099
zie+0.099
icles+0.096
Neg logit contributions
 cane-0.087
bs-0.086
 string-0.078
 stamp-0.078
 arrow-0.075
Token.
Token ID13
Feature activation-8.47e-01

Pos logit contributions
ator+0.061
urg+0.060
icles+0.057
rim+0.052
zie+0.051
Neg logit contributions
bs-0.042
 cane-0.040
Scar-0.037
 arrows-0.035
 stamp-0.035
Token The
Token ID383
Feature activation-1.87e+00

Pos logit contributions
ator+0.047
urg+0.046
icles+0.043
rim+0.041
zie+0.039
Neg logit contributions
bs-0.028
 cane-0.026
Scar-0.024
 arrows-0.022
 string-0.022
Token fire
Token ID2046
Feature activation-2.58e+00

Pos logit contributions
urg+0.097
ator+0.091
icles+0.089
rim+0.086
zie+0.084
Neg logit contributions
 cane-0.065
 string-0.061
 long-0.059
bs-0.058
 arrow-0.058
Tokenmen
Token ID3653
Feature activation-7.08e-01

Pos logit contributions
urg+0.094
ator+0.091
icles+0.085
rim+0.083
zie+0.081
Neg logit contributions
 cane-0.112
bs-0.111
 string-0.105
 arrow-0.103
 arrows-0.101