TOP ACTIVATIONS
MAX = 7.214

The town spe ople had to find a
things â col ouring books , stickers and
she bowed , the town spe ople stopped to admire her
so excited and when the mer maid saw him she gave
suddenly , a brightly - col oured belt caught her eye
! The town spe ople were sad and scared
so much that she quickly tid ied up her room .
you do ?!" Sad ie quickly looked up ,
her goal ! <|endoftext|> J imm ie was scared . He
took my brush and I tid ied my room . Is
?" Step han ie said , " I
be happy and well - beh aved . The end .
proud that he had finished tid ying up the bathroom -
date , a rainbow - col oured butterfly flew in through
gifts . Dis obe ying his mom , Bobby
time to fix the f auc et . So , Lily
demands , crab ," the mer maid said . " I
Tim grabbed the hook and anx iously put it in the
dried his hands and followed Jol ene . When Jol ene
ever and he thanked the mer maid for being so kind

MIN ACTIVATIONS
MIN = -5.733

, cats , birds , ham sters , and more .
, cats , birds , fish , and rabbits . But
see shells , rocks , seaw eed , crabs and fish
, vegetables , bread , cheese , and toys . Lily
. They were cheese and ham and lettuce . They were
saw fruits , flowers , bread , and cheese . L
saw trees , birds , fish and lots of animals .
, blue fish , yellow fish and green fish . They
their gear : helmets , pads and gloves . They knew
kept rocks , leaves , grass , and so much more
shells , rocks , seaw eed , crabs and fish .
see birds , flowers , trees and a big pond .
: fruits , vegetables , bread , cheese , and toys
. They saw ducks , fish , frogs and turtles in
, milk , eggs , cheese , and more . They
new words , like " ban ana " and " flower
see apples , bananas , bread , milk , eggs ,
saw birds , flowers , swings , and slides . They
. It is cheese and ham . Do you like cheese
. He sees flowers , grass , and a swing .

INTERVAL 3.977 - 7.214
CONTAINS 0.002%

, there lived a strong mer maid . She had beautiful
says , " Thank you , Daddy . I love you
, " I 'm sorry , grandma . I didn 't
The town spe ople had to find a
Joe was scared and he slammed the jar shut .

INTERVAL 0.740 - 3.977
CONTAINS 18.044%

picked it up and looked inside , but it was empty
John smiled and said " Yes please !". The
. " Hello , can we have ice -
a little girl named Lily . One day , she went
boy named Danny was feeling very worried . He had been

INTERVAL -2.496 - 0.740
CONTAINS 81.632%

. He looked up and saw a group of birds .
As they looked closer , they saw that it was a
saw something shiny in the trash . Tim flew
<|endoftext|> L ily and Ben are playing in the park .
old and loved to explore . Today , she was in

INTERVAL -5.733 - -2.496
CONTAINS 0.322%

They saw many animals , like lions , ze br as
puts his ball on the grass and opens his sandwich .
. I forgive you . Do you want to play with
hole . They use some grass to tie it . They
But the pony loved it ! It ne ighed
Token
Token ID220
Feature activation+2.32e-01

Pos logit contributions
taker+0.011
lic+0.011
eway+0.011
 mant+0.011
 Sus+0.011
Neg logit contributions
Ben-0.009
avi-0.009
Le-0.009
 might-0.009
 will-0.008
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
bel+0.008
imm+0.008
lic+0.008
spe+0.007
rated+0.007
Neg logit contributions
Ben-0.003
 might-0.003
Le-0.003
avi-0.003
 will-0.003
Token
Token ID198
Feature activation-3.30e-01

Pos logit contributions
 might+0.002
avi+0.002
 will+0.001
Ben+0.001
Le+0.001
Neg logit contributions
bel-0.003
lic-0.003
shadow-0.003
spe-0.003
eway-0.003
TokenThe
Token ID464
Feature activation+1.84e-01

Pos logit contributions
 might+0.007
 will+0.006
avi+0.006
Ben+0.006
 cones+0.006
Neg logit contributions
bel-0.009
spe-0.009
shadow-0.009
lic-0.009
eway-0.009
Token town
Token ID3240
Feature activation+1.93e+00

Pos logit contributions
bel+0.005
 mant+0.005
bul+0.004
spe+0.004
rated+0.004
Neg logit contributions
Ben-0.004
 might-0.004
Le-0.004
avi-0.004
 Leo-0.003
Tokenspe
Token ID4125
Feature activation+5.93e+00

Pos logit contributions
bel+0.043
imm+0.042
rated+0.042
 mant+0.042
lic+0.042
Neg logit contributions
Ben-0.046
 might-0.045
Le-0.043
avi-0.043
 will-0.043
Tokenople
Token ID643
Feature activation+1.04e+00

Pos logit contributions
lic+0.133
 mant+0.124
bel+0.124
eway+0.123
taker+0.120
Neg logit contributions
 might-0.141
Ben-0.141
avi-0.140
 will-0.136
Le-0.135
Token had
Token ID550
Feature activation+4.03e-01

Pos logit contributions
bel+0.031
imm+0.031
 mant+0.031
rated+0.031
bul+0.030
Neg logit contributions
 might-0.017
Ben-0.017
avi-0.016
 will-0.016
Le-0.016
Token to
Token ID284
Feature activation+8.65e-01

Pos logit contributions
bel+0.011
imm+0.011
rated+0.011
 mant+0.010
bul+0.010
Neg logit contributions
Ben-0.008
 might-0.008
Le-0.008
avi-0.008
 will-0.007
Token find
Token ID1064
Feature activation+3.71e-01

Pos logit contributions
bel+0.025
imm+0.024
rated+0.024
 mant+0.024
bul+0.024
Neg logit contributions
Ben-0.015
 might-0.015
Le-0.015
avi-0.014
 Rex-0.014
Token a
Token ID257
Feature activation-2.92e-01

Pos logit contributions
bel+0.010
rated+0.010
imm+0.010
lic+0.010
 mant+0.010
Neg logit contributions
Ben-0.007
 might-0.007
Le-0.007
avi-0.007
 Rex-0.007
Token things
Token ID1243
Feature activation+3.45e-01

Pos logit contributions
Ben+0.008
Le+0.008
avi+0.007
 might+0.007
 Rex+0.007
Neg logit contributions
bel-0.008
imm-0.008
lic-0.008
 mant-0.008
bul-0.008
Tokenâ
Token ID22940
Feature activation+1.32e+00

Pos logit contributions
rated+0.011
bel+0.010
imm+0.010
 mant+0.010
lic+0.010
Neg logit contributions
 might-0.006
Ben-0.005
 will-0.005
Le-0.005
 Rex-0.005
Token
Token ID26391
Feature activation-9.11e-01

Pos logit contributions
bel+0.023
imm+0.023
rated+0.023
 mant+0.022
lic+0.022
Neg logit contributions
Ben-0.038
 might-0.037
Le-0.037
avi-0.036
 Rex-0.036
Token
Token ID447
Feature activation-1.37e+00

Pos logit contributions
Ben+0.019
Le+0.019
 Remy+0.018
 might+0.018
avi+0.017
Neg logit contributions
bel-0.024
lic-0.023
spe-0.022
imm-0.022
hr-0.022
Token
Token ID251
Feature activation-4.71e-01

Pos logit contributions
Le+0.039
 Rex+0.038
 Leo+0.038
avi+0.037
Ben+0.037
Neg logit contributions
imm-0.026
lic-0.024
 Sus-0.023
bel-0.023
bul-0.023
Tokencol
Token ID4033
Feature activation+5.22e+00

Pos logit contributions
Ben+0.008
 might+0.008
Le+0.008
avi+0.008
 Rex+0.007
Neg logit contributions
bel-0.014
imm-0.013
rated-0.013
 mant-0.013
lic-0.013
Tokenouring
Token ID21823
Feature activation-9.74e-03

Pos logit contributions
bel+0.066
rated+0.059
spe+0.058
taker+0.057
 mant+0.056
Neg logit contributions
Ben-0.179
avi-0.175
 might-0.174
Le-0.173
itch-0.170
Token books
Token ID3835
Feature activation+6.55e-02

Pos logit contributions
Ben+0.000
 might+0.000
Le+0.000
avi+0.000
 Rex+0.000
Neg logit contributions
bel-0.000
rated-0.000
imm-0.000
 mant-0.000
lic-0.000
Token,
Token ID11
Feature activation+3.24e-01

Pos logit contributions
bel+0.002
rated+0.002
imm+0.002
 mant+0.002
lic+0.002
Neg logit contributions
Ben-0.001
 might-0.001
Le-0.001
avi-0.001
 will-0.001
Token stickers
Token ID28568
Feature activation-1.48e+00

Pos logit contributions
bel+0.008
imm+0.008
 mant+0.008
bul+0.008
lic+0.008
Neg logit contributions
Ben-0.007
Le-0.007
 might-0.007
avi-0.006
 will-0.006
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
Ben+0.020
 might+0.019
Le+0.019
avi+0.018
 Rex+0.017
Neg logit contributions
bel-0.049
imm-0.048
rated-0.048
 mant-0.048
lic-0.048
Token she
Token ID673
Feature activation-4.54e-01

Pos logit contributions
 Sus+0.010
spe+0.009
bul+0.009
bel+0.009
hr+0.009
Neg logit contributions
Ben-0.008
avi-0.008
ars-0.007
Le-0.007
oxy-0.007
Token bowed
Token ID35737
Feature activation-9.97e-01

Pos logit contributions
Le+0.010
Ben+0.010
avi+0.010
 Rex+0.009
 Remy+0.009
Neg logit contributions
bel-0.011
bul-0.011
imm-0.011
rated-0.010
lled-0.010
Token,
Token ID11
Feature activation+5.02e-01

Pos logit contributions
Ben+0.018
 might+0.017
Le+0.017
avi+0.017
 Rex+0.016
Neg logit contributions
bel-0.028
imm-0.028
rated-0.027
bul-0.027
lic-0.027
Token the
Token ID262
Feature activation+2.92e-01

Pos logit contributions
bel+0.015
bul+0.014
spe+0.014
 Sus+0.014
imm+0.014
Neg logit contributions
Ben-0.009
Le-0.009
avi-0.008
 might-0.008
 Rex-0.008
Token town
Token ID3240
Feature activation+1.44e+00

Pos logit contributions
bel+0.008
 mant+0.008
bul+0.008
rated+0.007
spe+0.007
Neg logit contributions
Ben-0.006
Le-0.006
avi-0.005
 might-0.005
 Remy-0.005
Tokenspe
Token ID4125
Feature activation+5.20e+00

Pos logit contributions
bel+0.041
spe+0.040
bul+0.039
rated+0.039
imm+0.039
Neg logit contributions
Ben-0.028
Le-0.027
avi-0.026
 Remy-0.025
 might-0.025
Tokenople
Token ID643
Feature activation+4.23e-01

Pos logit contributions
lic+0.124
eway+0.114
bel+0.112
taker+0.112
 mant+0.110
Neg logit contributions
 might-0.121
avi-0.121
Ben-0.119
 will-0.117
Le-0.114
Token stopped
Token ID5025
Feature activation-1.47e+00

Pos logit contributions
bel+0.012
imm+0.011
rated+0.011
bul+0.011
spe+0.011
Neg logit contributions
Ben-0.008
Le-0.008
 might-0.007
avi-0.007
 Rex-0.007
Token to
Token ID284
Feature activation+2.01e-01

Pos logit contributions
Ben+0.032
Le+0.031
 might+0.030
avi+0.030
 Rex+0.028
Neg logit contributions
bel-0.037
bul-0.036
imm-0.035
spe-0.035
 mant-0.035
Token admire
Token ID21099
Feature activation-1.41e-01

Pos logit contributions
bel+0.006
imm+0.006
rated+0.006
 mant+0.006
bul+0.006
Neg logit contributions
Ben-0.004
 might-0.003
Le-0.003
avi-0.003
 Rex-0.003
Token her
Token ID607
Feature activation-8.90e-02

Pos logit contributions
Ben+0.003
Le+0.003
 might+0.003
avi+0.003
 Rex+0.003
Neg logit contributions
bel-0.004
bul-0.004
imm-0.004
 mant-0.004
spe-0.004
Token so
Token ID523
Feature activation-8.78e-01

Pos logit contributions
rated+0.003
bel+0.003
imm+0.002
lic+0.002
ounded+0.002
Neg logit contributions
Ben-0.002
 might-0.002
 Leo-0.002
 will-0.002
Le-0.002
Token excited
Token ID6568
Feature activation-4.69e-01

Pos logit contributions
Ben+0.020
 might+0.020
Le+0.019
avi+0.019
 will+0.019
Neg logit contributions
rated-0.021
bel-0.020
imm-0.020
 mant-0.020
bul-0.020
Token and
Token ID290
Feature activation+3.44e-02

Pos logit contributions
Ben+0.008
 might+0.008
Le+0.007
avi+0.007
 Rex+0.007
Neg logit contributions
bel-0.014
imm-0.014
rated-0.014
 mant-0.014
bul-0.014
Token when
Token ID618
Feature activation-3.35e-01

Pos logit contributions
imm+0.001
bel+0.001
rated+0.001
bul+0.001
 mant+0.001
Neg logit contributions
Ben-0.001
 might-0.001
Le-0.001
avi-0.001
 will-0.001
Token the
Token ID262
Feature activation+3.08e-01

Pos logit contributions
Ben+0.006
avi+0.006
 might+0.006
Le+0.006
ars+0.006
Neg logit contributions
 Sus-0.009
bel-0.009
spe-0.009
eway-0.009
lled-0.009
Token mer
Token ID4017
Feature activation+4.89e+00

Pos logit contributions
imm+0.008
bel+0.008
rated+0.008
shadow+0.008
lic+0.008
Neg logit contributions
Ben-0.006
 might-0.006
Le-0.006
avi-0.006
 will-0.006
Tokenmaid
Token ID23151
Feature activation+6.41e-01

Pos logit contributions
shadow+0.101
 mant+0.092
rated+0.090
 causing+0.090
bul+0.089
Neg logit contributions
avi-0.125
 might-0.124
 will-0.119
Ben-0.118
 dove-0.113
Token saw
Token ID2497
Feature activation+9.26e-02

Pos logit contributions
bel+0.016
imm+0.015
lic+0.015
rated+0.015
spe+0.015
Neg logit contributions
Ben-0.014
Le-0.013
avi-0.013
 might-0.013
 Rex-0.013
Token him
Token ID683
Feature activation+1.06e-01

Pos logit contributions
bel+0.003
imm+0.003
rated+0.003
lic+0.002
 mant+0.002
Neg logit contributions
Ben-0.002
 might-0.002
Le-0.002
avi-0.002
 will-0.002
Token she
Token ID673
Feature activation+4.15e-01

Pos logit contributions
bel+0.003
imm+0.003
rated+0.003
 mant+0.003
lic+0.003
Neg logit contributions
Ben-0.002
 might-0.002
Le-0.002
avi-0.002
 Rex-0.002
Token gave
Token ID2921
Feature activation-1.74e+00

Pos logit contributions
bel+0.011
imm+0.011
rated+0.011
 mant+0.011
lic+0.011
Neg logit contributions
Ben-0.008
 might-0.008
Le-0.008
avi-0.008
 Rex-0.007
Token suddenly
Token ID6451
Feature activation+1.53e+00

Pos logit contributions
ars+0.008
avi+0.008
Ben+0.007
 might+0.007
athy+0.007
Neg logit contributions
 Sus-0.008
bel-0.008
 Jos-0.008
spe-0.007
hr-0.007
Token,
Token ID11
Feature activation+2.20e+00

Pos logit contributions
bel+0.044
lic+0.043
imm+0.043
rated+0.043
bul+0.043
Neg logit contributions
Ben-0.027
 might-0.025
Le-0.025
avi-0.025
 will-0.024
Token a
Token ID257
Feature activation+6.23e-01

Pos logit contributions
bel+0.064
imm+0.062
rated+0.062
lic+0.062
 mant+0.062
Neg logit contributions
Ben-0.038
 might-0.036
Le-0.036
avi-0.035
 will-0.034
Token brightly
Token ID35254
Feature activation+1.42e+00

Pos logit contributions
bel+0.017
lic+0.016
bul+0.016
imm+0.016
 mant+0.016
Neg logit contributions
Ben-0.012
Le-0.012
avi-0.012
 might-0.012
 Rex-0.011
Token-
Token ID12
Feature activation+2.29e+00

Pos logit contributions
shadow+0.038
rol+0.037
 mant+0.035
ounded+0.035
ount+0.035
Neg logit contributions
Ben-0.030
 dog-0.030
 might-0.029
 dove-0.028
ira-0.027
Tokencol
Token ID4033
Feature activation+4.88e+00

Pos logit contributions
rated+0.036
bel+0.036
imm+0.036
 mant+0.035
shadow+0.034
Neg logit contributions
Ben-0.071
Le-0.069
 might-0.068
avi-0.067
 Rex-0.066
Tokenoured
Token ID8167
Feature activation+1.30e-01

Pos logit contributions
bel+0.049
rated+0.046
lled+0.044
eway+0.044
spe+0.043
Neg logit contributions
Ben-0.178
 might-0.176
avi-0.174
Le-0.172
 Rex-0.170
Token belt
Token ID10999
Feature activation+1.12e+00

Pos logit contributions
bel+0.004
bul+0.004
imm+0.004
 mant+0.004
lic+0.004
Neg logit contributions
Ben-0.002
 might-0.002
Le-0.002
avi-0.002
 Rex-0.002
Token caught
Token ID4978
Feature activation+8.60e-01

Pos logit contributions
imm+0.030
bel+0.030
rated+0.030
 mant+0.030
shadow+0.029
Neg logit contributions
 might-0.022
Ben-0.022
 will-0.021
 Rex-0.020
Le-0.020
Token her
Token ID607
Feature activation+3.78e-01

Pos logit contributions
lled+0.017
lling+0.017
bel+0.017
bul+0.017
imm+0.017
Neg logit contributions
Ben-0.022
 might-0.022
avi-0.021
Le-0.021
 will-0.020
Token eye
Token ID4151
Feature activation-8.98e-02

Pos logit contributions
bel+0.011
lic+0.011
imm+0.010
rated+0.010
lled+0.010
Neg logit contributions
Ben-0.007
 might-0.007
Le-0.006
avi-0.006
 will-0.006
Token!
Token ID0
Feature activation+4.25e-01

Pos logit contributions
 might+0.005
Ben+0.005
 will+0.005
Le+0.004
avi+0.004
Neg logit contributions
bel-0.011
lic-0.011
imm-0.011
 mant-0.011
ounded-0.011
Token
Token ID198
Feature activation+2.28e-01

Pos logit contributions
bel+0.013
lic+0.013
imm+0.013
 mant+0.013
 Sus+0.013
Neg logit contributions
Ben-0.006
 might-0.006
avi-0.006
Le-0.006
 will-0.006
Token
Token ID198
Feature activation-1.46e-01

Pos logit contributions
lic+0.007
bel+0.007
 Sus+0.007
shadow+0.007
Sus+0.007
Neg logit contributions
 might-0.004
avi-0.004
 will-0.003
 cones-0.003
Ben-0.003
TokenThe
Token ID464
Feature activation+1.47e-01

Pos logit contributions
 might+0.003
avi+0.003
 will+0.003
 Rex+0.003
 Leo+0.003
Neg logit contributions
bel-0.004
lic-0.004
shadow-0.004
spe-0.004
Sus-0.004
Token town
Token ID3240
Feature activation+1.93e+00

Pos logit contributions
bel+0.004
 mant+0.004
rated+0.004
bul+0.003
taker+0.003
Neg logit contributions
Ben-0.003
Le-0.003
avi-0.003
 might-0.003
 Rex-0.003
Tokenspe
Token ID4125
Feature activation+4.87e+00

Pos logit contributions
bel+0.049
imm+0.048
rated+0.048
 mant+0.047
lic+0.047
Neg logit contributions
Ben-0.041
 might-0.040
Le-0.039
avi-0.038
 will-0.037
Tokenople
Token ID643
Feature activation+1.46e+00

Pos logit contributions
lic+0.104
bel+0.103
eway+0.098
 mant+0.097
taker+0.097
Neg logit contributions
Ben-0.126
 might-0.122
Le-0.120
avi-0.120
 will-0.116
Token were
Token ID547
Feature activation-4.79e-01

Pos logit contributions
bel+0.040
imm+0.040
rated+0.039
 mant+0.039
lic+0.039
Neg logit contributions
Ben-0.028
 might-0.026
Le-0.026
avi-0.026
 Rex-0.025
Token sad
Token ID6507
Feature activation+2.89e-01

Pos logit contributions
 might+0.011
Ben+0.011
 Rex+0.011
 Leo+0.010
Le+0.010
Neg logit contributions
rated-0.012
bel-0.011
imm-0.011
bul-0.011
 mant-0.011
Token and
Token ID290
Feature activation+3.60e-01

Pos logit contributions
bel+0.009
imm+0.009
rated+0.009
bul+0.009
lic+0.009
Neg logit contributions
Ben-0.005
Le-0.004
 might-0.004
avi-0.004
 Rex-0.004
Token scared
Token ID12008
Feature activation+8.64e-01

Pos logit contributions
imm+0.009
bel+0.009
 mant+0.009
lic+0.009
rated+0.009
Neg logit contributions
 might-0.008
Ben-0.007
 Rex-0.007
 will-0.007
 Leo-0.007
Token so
Token ID523
Feature activation-5.32e-02

Pos logit contributions
bel+0.029
imm+0.028
rated+0.028
 mant+0.028
lic+0.028
Neg logit contributions
Ben-0.019
 might-0.018
Le-0.018
avi-0.017
 Rex-0.017
Token much
Token ID881
Feature activation+2.78e-01

Pos logit contributions
Ben+0.001
 might+0.001
Le+0.001
avi+0.001
 will+0.001
Neg logit contributions
bel-0.002
imm-0.002
rated-0.002
bul-0.002
 mant-0.002
Token that
Token ID326
Feature activation+1.24e+00

Pos logit contributions
bel+0.008
imm+0.008
rated+0.008
 mant+0.008
lic+0.008
Neg logit contributions
Ben-0.005
 might-0.005
Le-0.005
avi-0.005
 Rex-0.004
Token she
Token ID673
Feature activation+1.62e+00

Pos logit contributions
bel+0.038
imm+0.037
rated+0.037
 mant+0.036
bul+0.036
Neg logit contributions
Ben-0.020
 might-0.019
Le-0.019
avi-0.019
 Rex-0.017
Token quickly
Token ID2952
Feature activation+2.07e+00

Pos logit contributions
bel+0.042
rated+0.041
spe+0.040
imm+0.040
bul+0.040
Neg logit contributions
Ben-0.034
Le-0.034
avi-0.032
 Rex-0.031
 Leo-0.031
Token tid
Token ID29770
Feature activation+4.73e+00

Pos logit contributions
bel+0.054
imm+0.054
rated+0.053
lic+0.053
 mant+0.052
Neg logit contributions
Ben-0.042
 might-0.041
Le-0.039
avi-0.039
 will-0.039
Tokenied
Token ID798
Feature activation+1.70e+00

Pos logit contributions
 mant+0.074
lic+0.070
 glimpse+0.070
hr+0.067
imm+0.066
Neg logit contributions
 will-0.143
Ben-0.140
 might-0.138
avi-0.134
itch-0.133
Token up
Token ID510
Feature activation+2.72e-01

Pos logit contributions
rated+0.038
imm+0.038
lic+0.038
bel+0.037
bul+0.037
Neg logit contributions
Ben-0.040
 might-0.040
Le-0.038
 Rex-0.038
 will-0.038
Token her
Token ID607
Feature activation+1.30e+00

Pos logit contributions
bel+0.007
imm+0.007
rated+0.007
lic+0.007
 mant+0.007
Neg logit contributions
Ben-0.005
 might-0.005
Le-0.005
avi-0.005
 Rex-0.005
Token room
Token ID2119
Feature activation+1.36e+00

Pos logit contributions
lic+0.035
rated+0.034
imm+0.034
lled+0.033
bel+0.033
Neg logit contributions
 might-0.026
Ben-0.025
 will-0.024
avi-0.024
Le-0.023
Token.
Token ID13
Feature activation+7.26e-01

Pos logit contributions
rated+0.044
imm+0.044
lic+0.043
lled+0.043
shadow+0.043
Neg logit contributions
 might-0.019
Ben-0.018
 will-0.017
avi-0.016
 Leo-0.016
Token you
Token ID345
Feature activation+1.46e+00

Pos logit contributions
bel+0.003
 Sus+0.003
hr+0.003
imm+0.003
 mant+0.003
Neg logit contributions
Ben-0.001
avi-0.001
Le-0.001
 might-0.001
 will-0.001
Token do
Token ID466
Feature activation+1.05e+00

Pos logit contributions
bel+0.052
imm+0.051
rated+0.051
 mant+0.051
lic+0.051
Neg logit contributions
Ben-0.016
Le-0.015
 might-0.015
avi-0.014
 Rex-0.013
Token?!"
Token ID30823
Feature activation+2.62e-01

Pos logit contributions
bel+0.031
spe+0.031
rated+0.031
bul+0.031
 mant+0.030
Neg logit contributions
Ben-0.019
 might-0.018
 will-0.018
 cannot-0.017
 Rex-0.016
Token
Token ID198
Feature activation+1.17e+00

Pos logit contributions
bel+0.005
imm+0.004
rated+0.004
 mant+0.004
lic+0.004
Neg logit contributions
Ben-0.008
 might-0.008
Le-0.007
avi-0.007
 Rex-0.007
Token
Token ID198
Feature activation+8.02e-01

Pos logit contributions
bel+0.041
imm+0.040
rated+0.040
lic+0.040
 mant+0.040
Neg logit contributions
Ben-0.014
 might-0.013
Le-0.013
avi-0.012
 Rex-0.011
TokenSad
Token ID26699
Feature activation+4.70e+00

Pos logit contributions
bel+0.016
imm+0.015
rated+0.015
 mant+0.015
spe+0.015
Neg logit contributions
Ben-0.022
 might-0.021
Le-0.021
avi-0.020
 will-0.020
Tokenie
Token ID494
Feature activation+1.24e+00

Pos logit contributions
bel+0.145
ount+0.136
 unatt+0.136
ounded+0.135
 mant+0.134
Neg logit contributions
 and-0.072
 might-0.071
ira-0.069
 Leo-0.069
 Rex-0.067
Token quickly
Token ID2952
Feature activation+1.03e+00

Pos logit contributions
bel+0.031
lic+0.030
imm+0.030
rated+0.030
spe+0.030
Neg logit contributions
Ben-0.026
Le-0.025
 might-0.025
avi-0.025
 will-0.024
Token looked
Token ID3114
Feature activation+2.54e+00

Pos logit contributions
bel+0.028
rated+0.027
imm+0.027
 mant+0.027
bul+0.027
Neg logit contributions
Ben-0.021
 might-0.020
Le-0.019
avi-0.019
 Rex-0.019
Token up
Token ID510
Feature activation+4.11e-03

Pos logit contributions
imm+0.064
bel+0.062
spe+0.062
eway+0.060
bul+0.059
Neg logit contributions
 might-0.060
 will-0.057
 cannot-0.055
 Rex-0.055
Ben-0.054
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
bel+0.000
imm+0.000
rated+0.000
lic+0.000
 mant+0.000
Neg logit contributions
Ben-0.000
 might-0.000
Le-0.000
avi-0.000
 Rex-0.000
Token her
Token ID607
Feature activation+5.51e-01

Pos logit contributions
bel+0.033
imm+0.032
rated+0.032
 mant+0.032
lic+0.031
Neg logit contributions
Ben-0.026
 might-0.025
Le-0.025
avi-0.024
 Rex-0.023
Token goal
Token ID3061
Feature activation-5.14e-01

Pos logit contributions
bel+0.018
lic+0.018
imm+0.018
rated+0.017
bul+0.017
Neg logit contributions
Ben-0.008
 might-0.008
Le-0.007
 will-0.007
avi-0.007
Token!
Token ID0
Feature activation+7.68e-01

Pos logit contributions
Ben+0.010
Le+0.010
avi+0.009
 might+0.009
 Rex+0.009
Neg logit contributions
bel-0.014
imm-0.014
 mant-0.013
bul-0.013
rated-0.013
Token<|endoftext|>
Token ID50256
Feature activation+2.07e+00

Pos logit contributions
lic+0.024
bel+0.024
imm+0.024
taker+0.024
 Sus+0.024
Neg logit contributions
Ben-0.010
 might-0.010
avi-0.010
Le-0.010
 will-0.009
TokenJ
Token ID41
Feature activation+1.25e+00

Pos logit contributions
lic+0.053
bel+0.052
imm+0.052
 Sus+0.051
eway+0.051
Neg logit contributions
avi-0.041
Ben-0.040
 might-0.039
Le-0.039
 Rex-0.036
Tokenimm
Token ID8608
Feature activation+4.68e+00

Pos logit contributions
bel+0.022
imm+0.021
lic+0.021
rated+0.021
spe+0.020
Neg logit contributions
Ben-0.036
 might-0.036
Le-0.035
avi-0.035
 Rex-0.034
Tokenie
Token ID494
Feature activation+1.58e+00

Pos logit contributions
 mant+0.099
bul+0.099
shadow+0.098
bel+0.098
lic+0.097
Neg logit contributions
Ben-0.123
 might-0.121
 will-0.114
avi-0.113
Le-0.112
Token was
Token ID373
Feature activation+7.12e-01

Pos logit contributions
bel+0.043
imm+0.042
rated+0.042
 mant+0.042
spe+0.041
Neg logit contributions
Ben-0.031
Le-0.030
avi-0.029
 might-0.028
 Rex-0.027
Token scared
Token ID12008
Feature activation+1.27e+00

Pos logit contributions
rated+0.022
bel+0.022
ounded+0.022
shadow+0.021
imm+0.021
Neg logit contributions
Ben-0.012
 might-0.010
Le-0.010
 Leo-0.010
 Rex-0.010
Token.
Token ID13
Feature activation+2.38e+00

Pos logit contributions
bel+0.039
imm+0.038
rated+0.037
 mant+0.037
lic+0.037
Neg logit contributions
Ben-0.021
 might-0.020
Le-0.019
avi-0.019
 will-0.019
Token He
Token ID679
Feature activation+3.33e-02

Pos logit contributions
bel+0.078
imm+0.077
rated+0.077
 mant+0.076
lic+0.076
Neg logit contributions
Ben-0.032
 might-0.030
Le-0.030
avi-0.029
 will-0.028
Token took
Token ID1718
Feature activation+1.41e+00

Pos logit contributions
 mant+0.031
bel+0.030
imm+0.030
bul+0.030
spe+0.029
Neg logit contributions
 might-0.017
 will-0.016
Ben-0.016
 cannot-0.015
avi-0.013
Token my
Token ID616
Feature activation+1.05e+00

Pos logit contributions
bel+0.046
imm+0.046
 mant+0.045
rated+0.045
bul+0.045
Neg logit contributions
Ben-0.019
 might-0.018
Le-0.018
avi-0.017
 Rex-0.016
Token brush
Token ID14093
Feature activation+6.99e-01

Pos logit contributions
bel+0.028
imm+0.028
rated+0.028
lic+0.027
spe+0.027
Neg logit contributions
Ben-0.021
 might-0.021
Le-0.020
 will-0.020
avi-0.019
Token and
Token ID290
Feature activation+9.75e-01

Pos logit contributions
lic+0.020
imm+0.020
rated+0.020
bel+0.020
lled+0.020
Neg logit contributions
Ben-0.012
 might-0.011
 will-0.011
Le-0.010
 Rex-0.010
Token I
Token ID314
Feature activation+9.76e-01

Pos logit contributions
bel+0.028
imm+0.027
rated+0.027
lic+0.027
 mant+0.027
Neg logit contributions
Ben-0.018
 might-0.017
Le-0.017
avi-0.016
 will-0.016
Token tid
Token ID29770
Feature activation+4.67e+00

Pos logit contributions
 mant+0.029
bel+0.029
imm+0.029
bul+0.028
lling+0.028
Neg logit contributions
 might-0.017
Ben-0.016
 will-0.016
 cannot-0.014
avi-0.014
Tokenied
Token ID798
Feature activation+2.36e+00

Pos logit contributions
 glimpse+0.075
hr+0.072
shadow+0.072
 mant+0.072
lic+0.072
Neg logit contributions
 will-0.140
Ben-0.132
 might-0.131
 him-0.128
itch-0.127
Token my
Token ID616
Feature activation+7.00e-01

Pos logit contributions
bel+0.068
imm+0.068
rated+0.068
lic+0.068
spe+0.066
Neg logit contributions
Ben-0.042
 might-0.041
 will-0.038
 Rex-0.038
Le-0.038
Token room
Token ID2119
Feature activation+1.71e+00

Pos logit contributions
bel+0.017
 mant+0.016
imm+0.016
rated+0.016
bul+0.016
Neg logit contributions
Ben-0.016
Le-0.015
 might-0.015
avi-0.015
 Rex-0.014
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
imm+0.049
lic+0.048
lled+0.048
rated+0.047
bul+0.047
Neg logit contributions
 will-0.031
 might-0.031
Ben-0.029
 cannot-0.027
 cones-0.025
Token Is
Token ID1148
Feature activation+4.74e-01

Pos logit contributions
bel+0.041
imm+0.040
rated+0.040
 mant+0.040
lic+0.040
Neg logit contributions
Ben-0.026
 might-0.024
Le-0.024
avi-0.024
 Rex-0.023
Token?"
Token ID1701
Feature activation-1.33e+00

Pos logit contributions
Ben+0.002
 might+0.002
 will+0.002
avi+0.002
Le+0.002
Neg logit contributions
bel-0.003
 mant-0.003
imm-0.003
lic-0.003
spe-0.003
Token
Token ID220
Feature activation+1.57e-01

Pos logit contributions
Ben+0.030
Le+0.028
itch+0.027
avi+0.027
 will+0.027
Neg logit contributions
 Sus-0.034
bul-0.030
 handing-0.029
any-0.029
shadow-0.029
Token
Token ID198
Feature activation-3.77e-01

Pos logit contributions
bel+0.005
imm+0.005
lic+0.005
rated+0.005
 mant+0.005
Neg logit contributions
Ben-0.002
 might-0.002
Le-0.002
avi-0.002
 will-0.002
Token
Token ID198
Feature activation-7.98e-01

Pos logit contributions
 might+0.006
 will+0.006
avi+0.006
Ben+0.006
 cones+0.005
Neg logit contributions
 Sus-0.011
shadow-0.011
bel-0.011
lic-0.011
Sus-0.011
TokenStep
Token ID8600
Feature activation+1.10e-01

Pos logit contributions
 might+0.017
 Leo+0.016
itch+0.016
 will+0.015
 Rex+0.015
Neg logit contributions
bel-0.020
shadow-0.020
Sus-0.020
spe-0.020
lic-0.019
Tokenhan
Token ID7637
Feature activation+4.65e+00

Pos logit contributions
bel+0.002
rated+0.002
 mant+0.002
imm+0.002
bul+0.002
Neg logit contributions
Ben-0.003
 might-0.003
Le-0.003
avi-0.003
 will-0.003
Tokenie
Token ID494
Feature activation+9.02e-01

Pos logit contributions
bel+0.128
lic+0.119
shadow+0.118
 energ+0.116
 mant+0.114
Neg logit contributions
 might-0.096
Ben-0.095
 will-0.090
avi-0.087
Le-0.082
Token said
Token ID531
Feature activation+4.17e-01

Pos logit contributions
bel+0.021
spe+0.021
imm+0.021
rated+0.021
lic+0.021
Neg logit contributions
Ben-0.020
Le-0.020
avi-0.019
 might-0.019
 Rex-0.018
Token,
Token ID11
Feature activation+1.24e+00

Pos logit contributions
rated+0.015
imm+0.015
bel+0.015
lled+0.015
 mant+0.015
Neg logit contributions
Ben-0.004
 might-0.004
 will-0.004
Le-0.004
 Leo-0.004
Token "
Token ID366
Feature activation-2.89e-02

Pos logit contributions
imm+0.034
rated+0.033
bel+0.033
lled+0.032
lling+0.032
Neg logit contributions
Ben-0.025
 might-0.024
Le-0.024
 Leo-0.023
 Rex-0.023
TokenI
Token ID40
Feature activation+6.39e-02

Pos logit contributions
Ben+0.000
 might+0.000
avi+0.000
 Rex+0.000
Le+0.000
Neg logit contributions
bel-0.001
spe-0.001
imm-0.001
lic-0.001
bul-0.001
Token be
Token ID307
Feature activation+1.35e+00

Pos logit contributions
bel+0.035
 mant+0.034
rated+0.034
imm+0.034
bul+0.034
Neg logit contributions
Le-0.029
Ben-0.028
 Rex-0.026
avi-0.026
 Remy-0.026
Token happy
Token ID3772
Feature activation+4.80e-01

Pos logit contributions
rated+0.036
eway+0.035
 mant+0.035
ounded+0.035
imm+0.034
Neg logit contributions
 will-0.028
Ben-0.026
 might-0.026
 Rex-0.025
 Leo-0.024
Token and
Token ID290
Feature activation+2.03e+00

Pos logit contributions
bel+0.014
rated+0.014
imm+0.014
lic+0.013
 mant+0.013
Neg logit contributions
Ben-0.009
 might-0.008
Le-0.008
 will-0.008
avi-0.008
Token well
Token ID880
Feature activation+5.21e-01

Pos logit contributions
imm+0.053
bel+0.052
rated+0.051
lic+0.051
bul+0.051
Neg logit contributions
Ben-0.041
 might-0.041
 will-0.039
Le-0.039
avi-0.038
Token-
Token ID12
Feature activation+2.48e+00

Pos logit contributions
bel+0.016
imm+0.015
rated+0.015
 mant+0.015
lic+0.015
Neg logit contributions
Ben-0.009
 might-0.008
Le-0.008
avi-0.008
 will-0.008
Tokenbeh
Token ID20709
Feature activation+4.58e+00

Pos logit contributions
 mant+0.041
imm+0.040
bul+0.040
rated+0.039
eway+0.039
Neg logit contributions
Ben-0.073
 might-0.072
Le-0.071
avi-0.070
 will-0.070
Tokenaved
Token ID9586
Feature activation+2.11e+00

Pos logit contributions
bel+0.112
 mant+0.108
eway+0.106
hr+0.105
imm+0.105
Neg logit contributions
 might-0.099
 will-0.096
Ben-0.095
ars-0.092
 cannot-0.092
Token.
Token ID13
Feature activation+2.17e-01

Pos logit contributions
bel+0.060
shadow+0.060
imm+0.060
 mant+0.060
lic+0.060
Neg logit contributions
 might-0.036
Ben-0.036
 will-0.034
Le-0.032
 Rex-0.032
Token The
Token ID383
Feature activation+6.53e-01

Pos logit contributions
 Sus+0.006
taker+0.006
spe+0.006
bel+0.006
bul+0.006
Neg logit contributions
Ben-0.003
avi-0.003
 might-0.003
Le-0.003
 will-0.003
Token end
Token ID886
Feature activation-2.22e-01

Pos logit contributions
bel+0.019
imm+0.019
rated+0.019
lic+0.019
 mant+0.019
Neg logit contributions
Ben-0.012
 might-0.011
Le-0.011
avi-0.011
 will-0.010
Token.
Token ID13
Feature activation-3.83e-02

Pos logit contributions
Ben+0.004
 might+0.003
Le+0.003
avi+0.003
 will+0.003
Neg logit contributions
bel-0.007
rated-0.007
imm-0.007
 mant-0.007
lic-0.007
Token proud
Token ID6613
Feature activation-7.50e-02

Pos logit contributions
Ben+0.017
 might+0.016
avi+0.016
Le+0.015
 will+0.015
Neg logit contributions
imm-0.017
bul-0.017
rated-0.017
 mant-0.017
bel-0.017
Token that
Token ID326
Feature activation+1.33e+00

Pos logit contributions
Ben+0.001
 might+0.001
 Rex+0.001
 Leo+0.001
 will+0.001
Neg logit contributions
rated-0.002
 mant-0.002
imm-0.002
bel-0.002
shadow-0.002
Token he
Token ID339
Feature activation+1.47e+00

Pos logit contributions
bel+0.040
spe+0.040
bul+0.039
taker+0.039
 mant+0.039
Neg logit contributions
Ben-0.022
Le-0.020
 might-0.020
avi-0.020
 Rex-0.018
Token had
Token ID550
Feature activation+1.48e+00

Pos logit contributions
bel+0.043
imm+0.043
 mant+0.043
rated+0.042
bul+0.042
Neg logit contributions
Ben-0.026
 might-0.025
 will-0.024
Le-0.023
avi-0.023
Token finished
Token ID5201
Feature activation+8.82e-01

Pos logit contributions
imm+0.039
rated+0.039
bel+0.039
bul+0.038
 mant+0.038
Neg logit contributions
Ben-0.031
 might-0.030
 will-0.029
Le-0.028
avi-0.027
Token tid
Token ID29770
Feature activation+4.54e+00

Pos logit contributions
rated+0.023
bel+0.023
imm+0.023
lic+0.023
ounded+0.022
Neg logit contributions
Ben-0.018
 might-0.018
avi-0.017
 will-0.017
 Rex-0.017
Tokenying
Token ID1112
Feature activation+2.58e+00

Pos logit contributions
 glimpse+0.073
 mant+0.071
lic+0.068
shadow+0.067
rated+0.063
Neg logit contributions
 will-0.134
 might-0.131
 him-0.129
Ben-0.129
avi-0.128
Token up
Token ID510
Feature activation+2.11e-01

Pos logit contributions
bel+0.053
imm+0.052
rated+0.052
lic+0.051
 mant+0.051
Neg logit contributions
Ben-0.067
 might-0.065
Le-0.064
avi-0.063
 Rex-0.062
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
imm+0.006
rated+0.006
bel+0.006
lic+0.006
 mant+0.006
Neg logit contributions
Ben-0.004
 might-0.004
Le-0.004
avi-0.004
 will-0.004
Token bathroom
Token ID12436
Feature activation+2.49e+00

Pos logit contributions
imm+0.036
lic+0.035
bel+0.035
shadow+0.034
lled+0.034
Neg logit contributions
Ben-0.028
 might-0.028
 will-0.027
Le-0.026
avi-0.025
Token -
Token ID532
Feature activation+9.18e-01

Pos logit contributions
rated+0.078
imm+0.078
lled+0.078
shadow+0.078
bel+0.077
Neg logit contributions
 might-0.036
Ben-0.035
 will-0.034
avi-0.031
 Rex-0.030
Token date
Token ID3128
Feature activation+1.68e-01

Pos logit contributions
imm+0.010
bel+0.010
lic+0.010
shadow+0.010
rated+0.010
Neg logit contributions
 might-0.010
Ben-0.010
 will-0.010
Le-0.009
avi-0.009
Token,
Token ID11
Feature activation+9.07e-01

Pos logit contributions
bel+0.005
imm+0.005
bul+0.005
lic+0.005
 mant+0.005
Neg logit contributions
Ben-0.003
Le-0.003
avi-0.003
 might-0.003
 Remy-0.003
Token a
Token ID257
Feature activation+4.92e-01

Pos logit contributions
imm+0.029
rated+0.028
bel+0.028
lic+0.028
 mant+0.028
Neg logit contributions
 might-0.014
Ben-0.014
Le-0.012
 will-0.012
 Rex-0.012
Token rainbow
Token ID27223
Feature activation+1.20e+00

Pos logit contributions
bel+0.012
imm+0.011
rated+0.011
 mant+0.011
lic+0.011
Neg logit contributions
Ben-0.011
 might-0.011
Le-0.011
avi-0.011
 Rex-0.010
Token-
Token ID12
Feature activation+1.61e+00

Pos logit contributions
shadow+0.033
rated+0.032
 mant+0.032
ounded+0.031
bel+0.030
Neg logit contributions
 might-0.024
 will-0.023
Ben-0.023
 dove-0.021
Le-0.021
Tokencol
Token ID4033
Feature activation+4.54e+00

Pos logit contributions
rated+0.028
imm+0.027
bel+0.027
 mant+0.027
shadow+0.026
Neg logit contributions
Ben-0.048
Le-0.046
 might-0.046
avi-0.045
 will-0.044
Tokenoured
Token ID8167
Feature activation-3.05e-01

Pos logit contributions
bel+0.052
rated+0.049
taker+0.047
eway+0.047
spe+0.047
Neg logit contributions
Ben-0.159
 might-0.157
avi-0.155
Le-0.152
 will-0.151
Token butterfly
Token ID35113
Feature activation+1.80e-01

Pos logit contributions
Ben+0.006
 might+0.006
Le+0.006
avi+0.005
 will+0.005
Neg logit contributions
bel-0.008
imm-0.008
rated-0.008
lic-0.008
 mant-0.008
Token flew
Token ID13112
Feature activation-6.76e-01

Pos logit contributions
bel+0.005
rated+0.005
imm+0.005
 mant+0.005
spe+0.005
Neg logit contributions
Ben-0.003
 might-0.003
Le-0.003
 will-0.003
avi-0.003
Token in
Token ID287
Feature activation+2.98e-01

Pos logit contributions
 might+0.019
 will+0.018
Ben+0.018
Le+0.018
 Rex+0.017
Neg logit contributions
bel-0.013
rated-0.013
 mant-0.013
imm-0.012
shadow-0.012
Token through
Token ID832
Feature activation+1.30e+00

Pos logit contributions
bel+0.008
bul+0.008
 mant+0.008
imm+0.008
rated+0.008
Neg logit contributions
Ben-0.006
avi-0.006
Le-0.006
 might-0.006
 Rex-0.005
Token gifts
Token ID13201
Feature activation+8.17e-01

Pos logit contributions
shadow+0.036
imm+0.035
bel+0.034
rated+0.034
lic+0.034
Neg logit contributions
 might-0.026
 will-0.025
Ben-0.024
 dog-0.024
 cones-0.023
Token.
Token ID13
Feature activation+9.82e-02

Pos logit contributions
bel+0.022
imm+0.022
rated+0.022
lic+0.022
bul+0.022
Neg logit contributions
Ben-0.016
Le-0.015
 might-0.015
avi-0.015
 Rex-0.014
Token
Token ID198
Feature activation+6.45e-01

Pos logit contributions
bel+0.003
rated+0.003
imm+0.003
lled+0.003
spe+0.003
Neg logit contributions
Ben-0.002
 might-0.002
Le-0.002
 Rex-0.002
avi-0.002
Token
Token ID198
Feature activation+6.69e-01

Pos logit contributions
lic+0.021
bel+0.021
imm+0.020
shadow+0.020
 Sus+0.020
Neg logit contributions
Ben-0.009
avi-0.009
 might-0.008
Le-0.008
 will-0.008
TokenDis
Token ID7279
Feature activation+5.12e-01

Pos logit contributions
bel+0.020
imm+0.020
rated+0.020
bul+0.019
 mant+0.019
Neg logit contributions
Ben-0.012
 might-0.011
Le-0.011
 will-0.010
 Rex-0.010
Tokenobe
Token ID5910
Feature activation+4.51e+00

Pos logit contributions
bel+0.010
imm+0.010
rated+0.010
 mant+0.009
lic+0.009
Neg logit contributions
Ben-0.014
 might-0.014
Le-0.013
avi-0.013
 Rex-0.013
Tokenying
Token ID1112
Feature activation+1.55e+00

Pos logit contributions
bel+0.058
imm+0.054
shadow+0.052
 mant+0.051
lic+0.050
Neg logit contributions
 might-0.153
Ben-0.149
avi-0.146
 will-0.145
Le-0.143
Token his
Token ID465
Feature activation-1.27e+00

Pos logit contributions
bel+0.036
imm+0.035
rated+0.035
 mant+0.035
lic+0.035
Neg logit contributions
Ben-0.037
Le-0.035
 might-0.034
avi-0.034
 will-0.033
Token mom
Token ID1995
Feature activation+8.90e-02

Pos logit contributions
Ben+0.023
Le+0.023
 might+0.021
avi+0.020
 Remy+0.020
Neg logit contributions
bel-0.037
 mant-0.036
spe-0.036
imm-0.035
 Sus-0.035
Token,
Token ID11
Feature activation+8.80e-01

Pos logit contributions
bel+0.003
imm+0.003
rated+0.003
lic+0.003
bul+0.003
Neg logit contributions
Ben-0.002
Le-0.001
 might-0.001
avi-0.001
 Rex-0.001
Token Bobby
Token ID17155
Feature activation+7.94e-01

Pos logit contributions
bel+0.021
imm+0.021
rated+0.021
 mant+0.020
lic+0.020
Neg logit contributions
Ben-0.020
 might-0.019
Le-0.019
avi-0.019
 Rex-0.018
Token time
Token ID640
Feature activation+2.76e-01

Pos logit contributions
bel+0.039
imm+0.039
rated+0.038
lic+0.038
 mant+0.038
Neg logit contributions
 might-0.025
Ben-0.025
 will-0.023
Le-0.023
 Rex-0.023
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
bel+0.008
imm+0.008
rated+0.007
 mant+0.007
lic+0.007
Neg logit contributions
Ben-0.005
 might-0.005
Le-0.005
avi-0.005
 will-0.005
Token fix
Token ID4259
Feature activation+1.54e+00

Pos logit contributions
bel+0.044
 mant+0.044
rated+0.043
 Sus+0.043
bul+0.042
Neg logit contributions
Ben-0.030
Le-0.029
 might-0.029
avi-0.028
 will-0.026
Token the
Token ID262
Feature activation+1.72e+00

Pos logit contributions
bel+0.041
bul+0.041
 Sus+0.040
 mant+0.040
imm+0.040
Neg logit contributions
Ben-0.030
Le-0.029
avi-0.028
 might-0.027
 Leo-0.025
Token f
Token ID277
Feature activation+3.85e+00

Pos logit contributions
bel+0.047
imm+0.046
rated+0.046
 mant+0.046
bul+0.046
Neg logit contributions
Ben-0.033
 might-0.031
Le-0.031
avi-0.031
 Rex-0.029
Tokenauc
Token ID14272
Feature activation+4.50e+00

Pos logit contributions
 mant+0.076
shadow+0.075
lled+0.074
rated+0.073
eway+0.066
Neg logit contributions
 will-0.098
ars-0.097
Ben-0.094
avi-0.094
 might-0.094
Tokenet
Token ID316
Feature activation+1.57e+00

Pos logit contributions
shadow+0.121
 mant+0.114
eway+0.111
spe+0.110
hr+0.110
Neg logit contributions
 will-0.097
 might-0.093
ars-0.086
Ben-0.085
avi-0.084
Token.
Token ID13
Feature activation+8.10e-02

Pos logit contributions
bel+0.048
rated+0.047
imm+0.047
 mant+0.047
lic+0.047
Neg logit contributions
 might-0.025
Ben-0.025
 will-0.023
avi-0.022
 Rex-0.022
Token So
Token ID1406
Feature activation-1.89e-01

Pos logit contributions
 Sus+0.002
spe+0.002
taker+0.002
shadow+0.002
lic+0.002
Neg logit contributions
 might-0.001
avi-0.001
Le-0.001
Ben-0.001
 will-0.001
Token,
Token ID11
Feature activation+9.50e-01

Pos logit contributions
Ben+0.004
Le+0.004
avi+0.004
 might+0.003
ars+0.003
Neg logit contributions
 Sus-0.005
bel-0.005
spe-0.005
 mant-0.005
shadow-0.005
Token Lily
Token ID20037
Feature activation-2.17e-01

Pos logit contributions
bel+0.027
 Sus+0.026
bul+0.026
spe+0.026
 mant+0.026
Neg logit contributions
Ben-0.017
avi-0.017
Le-0.017
 might-0.017
 will-0.015
Token demands
Token ID8665
Feature activation+1.99e-01

Pos logit contributions
bel+0.018
 mant+0.017
imm+0.017
rated+0.017
spe+0.017
Neg logit contributions
Ben-0.014
Le-0.013
 might-0.013
avi-0.013
 will-0.012
Token,
Token ID11
Feature activation+1.86e+00

Pos logit contributions
bel+0.005
rated+0.005
imm+0.005
 mant+0.005
lic+0.005
Neg logit contributions
Ben-0.004
Le-0.004
 might-0.004
avi-0.004
 Rex-0.004
Token crab
Token ID32202
Feature activation-4.02e-01

Pos logit contributions
bel+0.050
imm+0.049
rated+0.049
 mant+0.048
lic+0.048
Neg logit contributions
Ben-0.037
 might-0.035
Le-0.035
avi-0.034
 Rex-0.033
Token,"
Token ID553
Feature activation-7.85e-01

Pos logit contributions
 might+0.007
Ben+0.007
Le+0.007
 will+0.007
avi+0.007
Neg logit contributions
bel-0.011
imm-0.011
 mant-0.011
bul-0.011
rated-0.011
Token the
Token ID262
Feature activation+4.17e-01

Pos logit contributions
Ben+0.015
avi+0.014
Le+0.014
 might+0.014
 will+0.014
Neg logit contributions
bel-0.021
taker-0.021
 Sus-0.021
bul-0.020
spe-0.020
Token mer
Token ID4017
Feature activation+4.48e+00

Pos logit contributions
imm+0.011
rated+0.011
bul+0.011
shadow+0.011
bel+0.011
Neg logit contributions
Ben-0.009
 might-0.009
Le-0.009
 dog-0.008
 will-0.008
Tokenmaid
Token ID23151
Feature activation+1.55e+00

Pos logit contributions
shadow+0.087
 causing+0.086
bul+0.086
 mant+0.083
 unatt+0.082
Neg logit contributions
avi-0.117
Ben-0.108
 might-0.107
itch-0.106
Le-0.102
Token said
Token ID531
Feature activation+8.38e-01

Pos logit contributions
bel+0.050
rated+0.049
 mant+0.049
imm+0.048
bul+0.048
Neg logit contributions
Ben-0.023
 might-0.022
Le-0.021
avi-0.021
 Leo-0.020
Token.
Token ID13
Feature activation+5.37e-01

Pos logit contributions
rated+0.027
imm+0.027
bel+0.026
lic+0.026
 mant+0.026
Neg logit contributions
 might-0.012
Ben-0.012
Le-0.011
 Leo-0.011
avi-0.011
Token "
Token ID366
Feature activation-2.03e-01

Pos logit contributions
 Sus+0.014
lic+0.014
bel+0.014
 mant+0.014
taker+0.013
Neg logit contributions
Ben-0.011
 might-0.010
avi-0.010
Le-0.010
 will-0.010
TokenI
Token ID40
Feature activation-3.11e-01

Pos logit contributions
Ben+0.004
Le+0.004
 might+0.004
avi+0.004
 Rex+0.003
Neg logit contributions
bel-0.006
imm-0.006
rated-0.006
 mant-0.006
lic-0.005
TokenTim
Token ID14967
Feature activation+5.09e-01

Pos logit contributions
Ben+0.002
Tom+0.002
 might+0.002
Le+0.002
They+0.001
Neg logit contributions
rated-0.005
imm-0.005
bul-0.005
 mant-0.005
taker-0.005
Token grabbed
Token ID13176
Feature activation+8.50e-01

Pos logit contributions
rated+0.012
imm+0.012
bel+0.012
spe+0.012
lic+0.011
Neg logit contributions
Ben-0.012
Le-0.011
 might-0.011
avi-0.011
 Rex-0.011
Token the
Token ID262
Feature activation-6.69e-01

Pos logit contributions
rated+0.031
spe+0.031
imm+0.031
lic+0.031
bel+0.030
Neg logit contributions
 Ben-0.011
 Rex-0.010
 Leo-0.010
 Remy-0.009
Ben-0.009
Token hook
Token ID8011
Feature activation-6.20e-01

Pos logit contributions
avi+0.009
Le+0.009
Ben+0.008
tail+0.008
Sun+0.008
Neg logit contributions
ounded-0.021
bel-0.021
 glimpse-0.021
 mant-0.020
lled-0.020
Token and
Token ID290
Feature activation+2.45e-01

Pos logit contributions
Ben+0.010
Le+0.009
avi+0.009
 might+0.009
 Rex+0.008
Neg logit contributions
bel-0.019
imm-0.019
bul-0.019
 mant-0.018
rated-0.018
Token anx
Token ID7296
Feature activation+4.47e+00

Pos logit contributions
bel+0.006
imm+0.006
rated+0.006
 mant+0.006
lic+0.006
Neg logit contributions
Ben-0.005
 might-0.005
Le-0.005
avi-0.005
 Rex-0.005
Tokeniously
Token ID6819
Feature activation+2.21e-01

Pos logit contributions
lic+0.073
bel+0.072
imm+0.065
taker+0.064
bul+0.064
Neg logit contributions
avi-0.136
Ben-0.135
 might-0.133
Le-0.130
 Leo-0.129
Token put
Token ID1234
Feature activation+4.13e-01

Pos logit contributions
bel+0.006
imm+0.006
rated+0.006
 mant+0.006
lic+0.006
Neg logit contributions
Ben-0.004
 might-0.004
Le-0.004
avi-0.004
 Rex-0.004
Token it
Token ID340
Feature activation+5.20e-01

Pos logit contributions
bel+0.014
imm+0.014
rated+0.014
 mant+0.014
bul+0.014
Neg logit contributions
Ben-0.005
Le-0.005
avi-0.005
 might-0.005
 Rex-0.004
Token in
Token ID287
Feature activation+2.49e-01

Pos logit contributions
rated+0.013
hr+0.013
lic+0.013
spe+0.013
bel+0.013
Neg logit contributions
 might-0.011
Ben-0.011
 will-0.011
Le-0.011
 Rex-0.011
Token the
Token ID262
Feature activation-6.18e-01

Pos logit contributions
bel+0.007
imm+0.007
rated+0.007
lic+0.007
 mant+0.007
Neg logit contributions
Ben-0.005
 might-0.005
Le-0.004
 Rex-0.004
avi-0.004
Token dried
Token ID16577
Feature activation+9.36e-01

Pos logit contributions
bel+0.056
imm+0.055
rated+0.055
 mant+0.054
bul+0.054
Neg logit contributions
Ben-0.040
 might-0.038
Le-0.037
avi-0.037
 Rex-0.036
Token his
Token ID465
Feature activation+5.36e-01

Pos logit contributions
lic+0.024
bel+0.024
imm+0.024
rated+0.024
ounded+0.024
Neg logit contributions
Ben-0.019
 Rex-0.019
 Leo-0.018
avi-0.018
 Remy-0.018
Token hands
Token ID2832
Feature activation-2.19e-01

Pos logit contributions
imm+0.015
lic+0.015
rated+0.015
shadow+0.015
bel+0.015
Neg logit contributions
 might-0.010
Ben-0.010
 Rex-0.009
 will-0.009
avi-0.009
Token and
Token ID290
Feature activation-3.56e-02

Pos logit contributions
Ben+0.003
Le+0.003
avi+0.003
 might+0.003
 Rex+0.003
Neg logit contributions
bel-0.007
imm-0.007
 mant-0.007
rated-0.007
bul-0.007
Token followed
Token ID3940
Feature activation+8.95e-01

Pos logit contributions
Ben+0.001
 might+0.001
Le+0.001
avi+0.001
 Rex+0.001
Neg logit contributions
bel-0.001
imm-0.001
rated-0.001
 mant-0.001
lic-0.001
Token Jol
Token ID39329
Feature activation+4.45e+00

Pos logit contributions
bel+0.025
imm+0.024
rated+0.024
 mant+0.024
bul+0.024
Neg logit contributions
Ben-0.017
 might-0.016
Le-0.016
avi-0.016
 will-0.015
Tokenene
Token ID1734
Feature activation+1.96e+00

Pos logit contributions
bel+0.095
lic+0.095
 mant+0.093
rated+0.091
eway+0.090
Neg logit contributions
Ben-0.109
avi-0.107
 might-0.104
itch-0.103
 Rex-0.100
Token.
Token ID13
Feature activation+2.59e-01

Pos logit contributions
bel+0.051
imm+0.051
rated+0.050
bul+0.049
lic+0.049
Neg logit contributions
Ben-0.040
 might-0.039
Le-0.038
avi-0.037
 Rex-0.037
Token When
Token ID1649
Feature activation+1.41e-01

Pos logit contributions
bel+0.008
imm+0.008
rated+0.008
lic+0.008
 mant+0.008
Neg logit contributions
Ben-0.004
 might-0.003
Le-0.003
avi-0.003
 will-0.003
Token Jol
Token ID39329
Feature activation+3.31e+00

Pos logit contributions
bel+0.004
 Sus+0.004
bul+0.004
spe+0.004
lic+0.004
Neg logit contributions
Ben-0.003
Le-0.003
avi-0.003
 might-0.003
 Leo-0.003
Tokenene
Token ID1734
Feature activation+7.63e-01

Pos logit contributions
bel+0.074
 mant+0.073
lic+0.072
rated+0.070
eway+0.070
Neg logit contributions
Ben-0.082
 might-0.077
avi-0.076
 Rex-0.075
itch-0.074
Token ever
Token ID1683
Feature activation+6.28e-01

Pos logit contributions
bel+0.003
 Sus+0.003
spe+0.003
imm+0.003
taker+0.003
Neg logit contributions
Ben-0.003
avi-0.003
Le-0.003
 might-0.003
 Rex-0.002
Token and
Token ID290
Feature activation+5.66e-01

Pos logit contributions
bel+0.018
imm+0.018
 mant+0.018
bul+0.017
lic+0.017
Neg logit contributions
Ben-0.011
 might-0.011
Le-0.011
avi-0.011
 will-0.010
Token he
Token ID339
Feature activation+3.35e-01

Pos logit contributions
bel+0.017
imm+0.017
rated+0.017
 mant+0.016
lic+0.016
Neg logit contributions
Ben-0.009
 might-0.009
Le-0.009
avi-0.009
 Rex-0.008
Token thanked
Token ID26280
Feature activation+1.02e+00

Pos logit contributions
bel+0.008
spe+0.008
imm+0.008
rated+0.008
lled+0.008
Neg logit contributions
Ben-0.007
Le-0.007
avi-0.007
 might-0.007
 Rex-0.007
Token the
Token ID262
Feature activation+4.35e-01

Pos logit contributions
 Sus+0.026
 mant+0.024
bul+0.023
bel+0.023
taker+0.023
Neg logit contributions
Ben-0.023
 might-0.022
Le-0.022
ars-0.021
avi-0.021
Token mer
Token ID4017
Feature activation+4.44e+00

Pos logit contributions
imm+0.008
rated+0.008
bel+0.007
shadow+0.007
bul+0.007
Neg logit contributions
Ben-0.013
 might-0.013
Le-0.013
 will-0.012
avi-0.012
Tokenmaid
Token ID23151
Feature activation+5.10e-01

Pos logit contributions
shadow+0.094
 causing+0.092
rated+0.090
 mant+0.088
bul+0.087
Neg logit contributions
avi-0.110
Ben-0.102
 might-0.102
 will-0.100
itch-0.098
Token for
Token ID329
Feature activation-5.97e-01

Pos logit contributions
bel+0.013
imm+0.013
rated+0.013
 mant+0.013
bul+0.013
Neg logit contributions
Ben-0.011
 might-0.010
Le-0.010
avi-0.010
 Rex-0.009
Token being
Token ID852
Feature activation+1.53e+00

Pos logit contributions
Ben+0.015
 might+0.014
Le+0.014
avi+0.014
 Rex+0.014
Neg logit contributions
bel-0.013
imm-0.013
rated-0.013
lic-0.012
 mant-0.012
Token so
Token ID523
Feature activation+2.27e-02

Pos logit contributions
rated+0.038
bel+0.036
 mant+0.035
eway+0.035
ounded+0.035
Neg logit contributions
Ben-0.036
 might-0.035
 Rex-0.035
 Leo-0.035
 Rover-0.034
Token kind
Token ID1611
Feature activation-8.87e-01

Pos logit contributions
bel+0.001
spe+0.001
lic+0.001
taker+0.001
 mant+0.001
Neg logit contributions
Ben-0.001
Le-0.001
avi-0.001
 might-0.001
 Rex-0.000
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
Le+0.034
Ben+0.034
avi+0.034
itch+0.030
oxy+0.030
Neg logit contributions
bel-0.044
 happening-0.044
lling-0.043
 handing-0.043
lic-0.043
Token cats
Token ID11875
Feature activation-1.34e+00

Pos logit contributions
Ben+0.019
avi+0.018
Le+0.018
itch+0.017
 might+0.017
Neg logit contributions
bel-0.028
lic-0.027
spe-0.026
taker-0.026
 mant-0.026
Token,
Token ID11
Feature activation-1.51e+00

Pos logit contributions
Ben+0.027
Le+0.026
avi+0.025
itch+0.024
oxy+0.023
Neg logit contributions
 handing-0.034
 happening-0.034
lic-0.033
 causing-0.033
bel-0.033
Token birds
Token ID10087
Feature activation-2.79e+00

Pos logit contributions
Ben+0.026
itch+0.025
Le+0.025
avi+0.025
oxy+0.022
Neg logit contributions
bel-0.044
 causing-0.041
lic-0.041
lled-0.040
bul-0.040
Token,
Token ID11
Feature activation-1.89e+00

Pos logit contributions
Ben+0.060
Le+0.058
avi+0.055
itch+0.052
 might+0.050
Neg logit contributions
mill-0.070
lic-0.068
imm-0.068
 handing-0.068
bel-0.068
Token ham
Token ID8891
Feature activation-5.73e+00

Pos logit contributions
Ben+0.033
itch+0.033
Le+0.031
avi+0.031
oxy+0.027
Neg logit contributions
bel-0.055
lled-0.051
lic-0.051
 causing-0.051
bul-0.051
Tokensters
Token ID5937
Feature activation-2.99e+00

Pos logit contributions
Ben+0.115
Le+0.111
 might+0.109
athy+0.104
tail+0.101
Neg logit contributions
bel-0.145
imm-0.140
lic-0.140
mill-0.134
 handing-0.133
Token,
Token ID11
Feature activation-2.22e+00

Pos logit contributions
Ben+0.059
Le+0.057
avi+0.053
 might+0.049
 Rex+0.049
Neg logit contributions
bel-0.080
lic-0.080
imm-0.079
 handing-0.077
lling-0.077
Token and
Token ID290
Feature activation-1.18e+00

Pos logit contributions
Ben+0.038
Le+0.036
itch+0.034
avi+0.034
 might+0.032
Neg logit contributions
bel-0.067
 mant-0.062
bul-0.062
 handing-0.062
lic-0.062
Token more
Token ID517
Feature activation-1.35e+00

Pos logit contributions
Ben+0.023
avi+0.022
Le+0.022
 might+0.021
itch+0.020
Neg logit contributions
bel-0.032
bul-0.031
 mant-0.031
spe-0.030
lic-0.030
Token.
Token ID13
Feature activation-9.79e-01

Pos logit contributions
Ben+0.025
Le+0.024
 might+0.023
avi+0.023
 will+0.022
Neg logit contributions
bel-0.039
lic-0.037
rated-0.037
imm-0.037
 mant-0.037
Token,
Token ID11
Feature activation-5.50e-01

Pos logit contributions
Le+0.034
Ben+0.033
avi+0.033
oxy+0.030
itch+0.030
Neg logit contributions
 handing-0.039
 happening-0.038
 Sus-0.038
mill-0.038
 causing-0.037
Token cats
Token ID11875
Feature activation-1.72e+00

Pos logit contributions
Ben+0.009
avi+0.009
Le+0.009
 might+0.009
 will+0.008
Neg logit contributions
bel-0.016
lic-0.015
 Sus-0.015
spe-0.015
taker-0.015
Token,
Token ID11
Feature activation-1.31e+00

Pos logit contributions
Ben+0.038
Le+0.038
avi+0.037
itch+0.035
oxy+0.034
Neg logit contributions
 handing-0.040
 Sus-0.039
 happening-0.039
mill-0.038
 causing-0.038
Token birds
Token ID10087
Feature activation-3.19e+00

Pos logit contributions
avi+0.023
Ben+0.022
itch+0.022
Le+0.022
ars+0.019
Neg logit contributions
bel-0.037
lic-0.035
 Sus-0.034
ount-0.034
lled-0.034
Token,
Token ID11
Feature activation-1.66e+00

Pos logit contributions
Ben+0.078
Le+0.077
avi+0.073
itch+0.070
 cannot+0.066
Neg logit contributions
mill-0.069
 handing-0.068
lic-0.066
 Sus-0.065
bul-0.064
Token fish
Token ID5916
Feature activation-5.16e+00

Pos logit contributions
itch+0.028
Ben+0.028
avi+0.028
Le+0.027
 might+0.024
Neg logit contributions
bel-0.047
lic-0.045
lled-0.044
lling-0.043
bul-0.043
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
Ben+0.117
Le+0.117
avi+0.115
itch+0.106
Hug+0.105
Neg logit contributions
lic-0.114
 handing-0.107
imm-0.107
ount-0.106
bel-0.105
Token and
Token ID290
Feature activation-5.95e-01

Pos logit contributions
itch+0.031
Ben+0.031
Le+0.031
avi+0.030
 might+0.027
Neg logit contributions
bel-0.050
lic-0.046
lled-0.045
 handing-0.045
bul-0.045
Token rabbits
Token ID33043
Feature activation-1.66e+00

Pos logit contributions
Ben+0.011
avi+0.011
Le+0.010
 might+0.010
itch+0.010
Neg logit contributions
bel-0.017
bul-0.016
lic-0.016
spe-0.016
 mant-0.016
Token.
Token ID13
Feature activation-2.86e-01

Pos logit contributions
Ben+0.038
Le+0.036
avi+0.036
 might+0.034
itch+0.032
Neg logit contributions
bel-0.037
spe-0.037
taker-0.037
bul-0.036
mill-0.036
Token But
Token ID887
Feature activation+1.91e-01

Pos logit contributions
Le+0.003
avi+0.003
Ben+0.003
 might+0.003
 Remy+0.003
Neg logit contributions
bel-0.010
lic-0.009
 Sus-0.009
spe-0.009
 mant-0.009
Token see
Token ID766
Feature activation-4.16e-01

Pos logit contributions
Le+0.030
oxy+0.029
Ben+0.028
avi+0.027
 Remy+0.027
Neg logit contributions
bel-0.044
 mant-0.043
lled-0.042
taker-0.042
lling-0.042
Token shells
Token ID19679
Feature activation-2.89e+00

Pos logit contributions
Ben+0.008
Le+0.008
 might+0.008
avi+0.008
 will+0.007
Neg logit contributions
bel-0.011
 mant-0.011
bul-0.011
lled-0.011
spe-0.010
Token,
Token ID11
Feature activation-6.16e-01

Pos logit contributions
Ben+0.054
Le+0.053
avi+0.051
 might+0.047
oxy+0.046
Neg logit contributions
bel-0.081
imm-0.079
bul-0.078
lled-0.077
lic-0.077
Token rocks
Token ID12586
Feature activation-3.49e+00

Pos logit contributions
Ben+0.014
 might+0.013
Le+0.013
avi+0.013
 will+0.012
Neg logit contributions
bel-0.015
 mant-0.014
bul-0.014
imm-0.014
lic-0.014
Token,
Token ID11
Feature activation-1.56e+00

Pos logit contributions
Ben+0.067
Le+0.066
avi+0.063
 might+0.058
oxy+0.057
Neg logit contributions
bel-0.095
imm-0.093
bul-0.093
mill-0.092
lled-0.091
Token seaw
Token ID32085
Feature activation-4.85e+00

Pos logit contributions
Ben+0.032
Le+0.032
 might+0.031
avi+0.031
 Rex+0.029
Neg logit contributions
bel-0.040
 mant-0.038
bul-0.038
lled-0.038
lic-0.037
Tokeneed
Token ID2308
Feature activation-4.43e+00

Pos logit contributions
 might+0.097
 Rex+0.093
Le+0.093
Ben+0.090
 Leo+0.090
Neg logit contributions
lic-0.128
imm-0.127
bel-0.120
 mant-0.119
ount-0.116
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
Ben+0.086
Le+0.085
avi+0.078
 might+0.075
 vet+0.074
Neg logit contributions
bel-0.115
imm-0.115
lic-0.114
mill-0.113
bul-0.112
Token crabs
Token ID49172
Feature activation-4.26e+00

Pos logit contributions
Ben+0.038
Le+0.037
 might+0.035
avi+0.035
 Rex+0.033
Neg logit contributions
bel-0.068
 mant-0.065
bul-0.064
lled-0.063
imm-0.063
Token and
Token ID290
Feature activation-4.53e-01

Pos logit contributions
Ben+0.091
Le+0.091
avi+0.086
 vet+0.083
 might+0.083
Neg logit contributions
 handing-0.095
mill-0.094
lic-0.093
imm-0.092
bel-0.091
Token fish
Token ID5916
Feature activation-3.99e+00

Pos logit contributions
Ben+0.009
Le+0.009
 might+0.009
avi+0.009
 Rex+0.008
Neg logit contributions
bel-0.012
 mant-0.011
bul-0.011
rated-0.011
lic-0.011
Token,
Token ID11
Feature activation-9.49e-01

Pos logit contributions
Le+0.070
Ben+0.068
avi+0.064
itch+0.059
 Rex+0.058
Neg logit contributions
imm-0.087
bel-0.085
 Sus-0.083
mill-0.083
bul-0.083
Token vegetables
Token ID13701
Feature activation-3.44e+00

Pos logit contributions
Ben+0.020
Le+0.020
avi+0.019
 might+0.018
 Rex+0.018
Neg logit contributions
bel-0.024
 mant-0.023
 Sus-0.022
bul-0.022
rated-0.022
Token,
Token ID11
Feature activation-1.56e+00

Pos logit contributions
Le+0.071
Ben+0.070
avi+0.064
 Rex+0.061
 Leo+0.058
Neg logit contributions
bel-0.088
imm-0.087
 handing-0.086
 mant-0.085
 happening-0.085
Token bread
Token ID8509
Feature activation-4.42e+00

Pos logit contributions
Ben+0.030
Le+0.030
avi+0.028
itch+0.027
 Rex+0.027
Neg logit contributions
bel-0.043
 mant-0.040
lled-0.039
 Sus-0.039
bul-0.039
Token,
Token ID11
Feature activation-2.21e+00

Pos logit contributions
Ben+0.084
Le+0.083
avi+0.077
 Rex+0.071
 might+0.071
Neg logit contributions
bel-0.121
imm-0.120
lic-0.118
 mant-0.115
 handing-0.114
Token cheese
Token ID9891
Feature activation-4.79e+00

Pos logit contributions
Le+0.045
Ben+0.045
itch+0.042
 Rex+0.042
avi+0.041
Neg logit contributions
bel-0.057
 mant-0.054
 handing-0.052
lled-0.051
bul-0.050
Token,
Token ID11
Feature activation-2.36e+00

Pos logit contributions
Ben+0.095
Le+0.093
avi+0.089
 might+0.083
 Rex+0.082
Neg logit contributions
bel-0.119
 handing-0.119
imm-0.117
 mant-0.116
lic-0.115
Token and
Token ID290
Feature activation-2.10e-01

Pos logit contributions
Le+0.046
Ben+0.045
itch+0.043
 Rex+0.042
avi+0.042
Neg logit contributions
bel-0.062
 mant-0.059
 handing-0.058
lled-0.056
rated-0.055
Token toys
Token ID14958
Feature activation-1.07e+00

Pos logit contributions
Ben+0.005
Le+0.004
avi+0.004
 might+0.004
 Rex+0.004
Neg logit contributions
bel-0.005
 mant-0.005
bul-0.005
spe-0.005
 Sus-0.005
Token.
Token ID13
Feature activation-5.60e-01

Pos logit contributions
Ben+0.022
Le+0.021
avi+0.020
 might+0.020
 Rex+0.019
Neg logit contributions
bel-0.028
 mant-0.027
bul-0.027
imm-0.027
spe-0.027
Token Lily
Token ID20037
Feature activation-7.66e-01

Pos logit contributions
Ben+0.008
avi+0.008
Le+0.008
 might+0.008
 will+0.007
Neg logit contributions
bel-0.017
lic-0.017
imm-0.017
 mant-0.017
 Sus-0.017
Token.
Token ID13
Feature activation-4.74e-01

Pos logit contributions
Ben+0.019
avi+0.019
Le+0.019
 might+0.018
 Rex+0.016
Neg logit contributions
bel-0.032
imm-0.032
taker-0.031
bul-0.031
 mant-0.031
Token They
Token ID1119
Feature activation-4.42e-01

Pos logit contributions
avi+0.008
Le+0.008
 might+0.007
ars+0.007
 will+0.007
Neg logit contributions
 Sus-0.014
lic-0.014
bel-0.014
taker-0.013
 handing-0.013
Token were
Token ID547
Feature activation-6.75e-01

Pos logit contributions
Le+0.010
Ben+0.010
avi+0.010
 might+0.010
 Remy+0.009
Neg logit contributions
bel-0.010
lled-0.010
lic-0.009
taker-0.009
bul-0.009
Token cheese
Token ID9891
Feature activation-3.28e+00

Pos logit contributions
Ben+0.014
 might+0.014
Le+0.013
 Rex+0.013
avi+0.013
Neg logit contributions
bel-0.018
rated-0.017
imm-0.017
bul-0.017
spe-0.017
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
Ben+0.068
avi+0.067
Le+0.066
 might+0.064
Hug+0.062
Neg logit contributions
imm-0.076
 handing-0.075
bel-0.075
mill-0.074
lic-0.072
Token ham
Token ID8891
Feature activation-4.79e+00

Pos logit contributions
avi+0.020
Ben+0.019
Le+0.018
itch+0.018
 might+0.018
Neg logit contributions
bel-0.030
 mant-0.029
bul-0.028
 handing-0.028
lic-0.028
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
Ben+0.104
athy+0.100
Le+0.099
 might+0.097
avi+0.090
Neg logit contributions
bel-0.110
imm-0.108
 handing-0.105
lic-0.103
mill-0.101
Token lettuce
Token ID39406
Feature activation-3.02e+00

Pos logit contributions
avi+0.020
Ben+0.020
Le+0.019
 might+0.018
itch+0.018
Neg logit contributions
bel-0.034
bul-0.033
 mant-0.033
 handing-0.033
 causing-0.031
Token.
Token ID13
Feature activation-5.64e-01

Pos logit contributions
Ben+0.063
Le+0.061
avi+0.058
 Rex+0.053
athy+0.053
Neg logit contributions
bel-0.078
imm-0.075
 handing-0.074
lled-0.073
mill-0.073
Token They
Token ID1119
Feature activation-8.59e-01

Pos logit contributions
avi+0.009
 might+0.009
Le+0.009
ars+0.008
 will+0.008
Neg logit contributions
 Sus-0.017
bel-0.016
lic-0.016
taker-0.016
 handing-0.015
Token were
Token ID547
Feature activation-1.11e+00

Pos logit contributions
Ben+0.021
avi+0.020
Le+0.020
 might+0.020
 Rex+0.019
Neg logit contributions
bel-0.019
lled-0.018
lic-0.018
bul-0.017
taker-0.017
Token saw
Token ID2497
Feature activation-1.73e-01

Pos logit contributions
Le+0.029
itch+0.029
Ben+0.028
avi+0.028
 Rex+0.027
Neg logit contributions
bel-0.031
spe-0.029
 Sus-0.029
lled-0.028
lic-0.028
Token fruits
Token ID15921
Feature activation-3.46e+00

Pos logit contributions
Ben+0.003
Le+0.003
avi+0.003
 might+0.003
 Remy+0.003
Neg logit contributions
bel-0.005
lled-0.004
lic-0.004
spe-0.004
 Sus-0.004
Token,
Token ID11
Feature activation-5.95e-01

Pos logit contributions
Le+0.073
Ben+0.072
avi+0.070
itch+0.063
oxy+0.062
Neg logit contributions
imm-0.089
bel-0.089
lic-0.086
 handing-0.084
bul-0.084
Token flowers
Token ID12734
Feature activation-3.45e+00

Pos logit contributions
Ben+0.013
Le+0.012
avi+0.012
 might+0.012
 Rex+0.012
Neg logit contributions
bel-0.015
lic-0.014
 mant-0.014
lled-0.014
rated-0.014
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
Ben+0.071
Le+0.070
avi+0.068
itch+0.064
 Rex+0.062
Neg logit contributions
bel-0.088
imm-0.087
lic-0.087
 handing-0.086
bul-0.085
Token bread
Token ID8509
Feature activation-4.76e+00

Pos logit contributions
Ben+0.027
Le+0.026
avi+0.025
itch+0.025
 might+0.025
Neg logit contributions
bel-0.033
lled-0.030
 mant-0.030
lic-0.030
lling-0.030
Token,
Token ID11
Feature activation-2.08e+00

Pos logit contributions
Ben+0.098
Le+0.098
avi+0.092
itch+0.085
 Rex+0.083
Neg logit contributions
bel-0.120
imm-0.119
lic-0.119
 handing-0.114
 mant-0.112
Token and
Token ID290
Feature activation-1.73e-01

Pos logit contributions
Ben+0.040
Le+0.040
itch+0.039
avi+0.037
 Rex+0.037
Neg logit contributions
bel-0.055
 mant-0.051
lled-0.050
 handing-0.050
lic-0.050
Token cheese
Token ID9891
Feature activation-3.30e+00

Pos logit contributions
Ben+0.004
Le+0.004
avi+0.003
 might+0.003
 Rex+0.003
Neg logit contributions
bel-0.004
 mant-0.004
lic-0.004
bul-0.004
spe-0.004
Token.
Token ID13
Feature activation-4.72e-02

Pos logit contributions
Ben+0.073
Le+0.072
avi+0.069
 might+0.067
 Rex+0.064
Neg logit contributions
bel-0.076
imm-0.074
lic-0.073
 handing-0.072
mill-0.072
Token L
Token ID406
Feature activation-1.36e+00

Pos logit contributions
Ben+0.001
Le+0.001
 might+0.001
avi+0.001
 will+0.001
Neg logit contributions
bel-0.001
lic-0.001
imm-0.001
 mant-0.001
rated-0.001
Token saw
Token ID2497
Feature activation+4.76e-02

Pos logit contributions
Ben+0.018
Le+0.017
avi+0.017
itch+0.016
 Remy+0.016
Neg logit contributions
bel-0.019
spe-0.017
imm-0.017
lic-0.017
lled-0.017
Token trees
Token ID7150
Feature activation-2.35e+00

Pos logit contributions
bel+0.001
imm+0.001
rated+0.001
 mant+0.001
bul+0.001
Neg logit contributions
Ben-0.001
 might-0.001
Le-0.001
avi-0.001
 Rex-0.001
Token,
Token ID11
Feature activation-7.72e-01

Pos logit contributions
Ben+0.046
avi+0.042
Le+0.042
itch+0.040
oxy+0.039
Neg logit contributions
bel-0.066
bul-0.065
imm-0.064
lic-0.064
mill-0.064
Token birds
Token ID10087
Feature activation-2.45e+00

Pos logit contributions
Ben+0.013
 might+0.012
Le+0.012
avi+0.012
 will+0.011
Neg logit contributions
bel-0.024
imm-0.023
rated-0.023
lic-0.023
 mant-0.023
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
Ben+0.054
Le+0.050
avi+0.049
itch+0.048
 might+0.046
Neg logit contributions
mill-0.062
bul-0.060
lling-0.060
lic-0.060
imm-0.059
Token fish
Token ID5916
Feature activation-4.67e+00

Pos logit contributions
Ben+0.026
itch+0.025
avi+0.025
Le+0.024
 might+0.024
Neg logit contributions
bel-0.050
lled-0.048
lic-0.048
bul-0.048
lling-0.048
Token and
Token ID290
Feature activation-4.85e-01

Pos logit contributions
Ben+0.105
avi+0.098
Le+0.096
itch+0.094
 vet+0.092
Neg logit contributions
lic-0.111
imm-0.107
bel-0.105
mill-0.104
rol-0.103
Token lots
Token ID6041
Feature activation-9.17e-01

Pos logit contributions
Ben+0.009
avi+0.008
 might+0.008
Le+0.008
 will+0.007
Neg logit contributions
bel-0.014
bul-0.013
rated-0.013
lic-0.013
 mant-0.013
Token of
Token ID286
Feature activation-1.31e-01

Pos logit contributions
Ben+0.021
 might+0.021
Le+0.021
avi+0.020
 Rex+0.020
Neg logit contributions
bel-0.021
rated-0.021
imm-0.021
 mant-0.021
bul-0.020
Token animals
Token ID4695
Feature activation-8.09e-01

Pos logit contributions
Ben+0.003
avi+0.003
Le+0.003
itch+0.002
 might+0.002
Neg logit contributions
bel-0.003
lic-0.003
rated-0.003
lled-0.003
lling-0.003
Token.
Token ID13
Feature activation-3.58e-01

Pos logit contributions
Ben+0.018
Le+0.017
avi+0.017
 might+0.016
itch+0.015
Neg logit contributions
bel-0.020
lling-0.019
lic-0.019
spe-0.019
taker-0.019
Token,
Token ID11
Feature activation-3.46e-01

Pos logit contributions
Ben+0.058
Le+0.056
avi+0.056
 might+0.053
oxy+0.052
Neg logit contributions
bel-0.068
mill-0.066
lling-0.065
lic-0.065
imm-0.065
Token blue
Token ID4171
Feature activation-3.58e+00

Pos logit contributions
Ben+0.006
 might+0.006
Le+0.006
avi+0.006
 will+0.006
Neg logit contributions
bel-0.010
imm-0.010
rated-0.010
 mant-0.010
lic-0.010
Token fish
Token ID5916
Feature activation-4.05e+00

Pos logit contributions
Ben+0.060
Le+0.057
avi+0.055
 might+0.055
 will+0.051
Neg logit contributions
bel-0.104
imm-0.103
bul-0.102
 mant-0.102
lic-0.101
Token,
Token ID11
Feature activation-1.18e+00

Pos logit contributions
Ben+0.089
Le+0.087
avi+0.087
athy+0.082
oxy+0.081
Neg logit contributions
mill-0.093
 handing-0.090
imm-0.090
lic-0.090
bel-0.087
Token yellow
Token ID7872
Feature activation-3.73e+00

Pos logit contributions
Ben+0.020
avi+0.019
Le+0.019
 might+0.019
 will+0.018
Neg logit contributions
bel-0.034
bul-0.033
 mant-0.033
lic-0.033
imm-0.033
Token fish
Token ID5916
Feature activation-4.59e+00

Pos logit contributions
Ben+0.065
Le+0.061
avi+0.060
 might+0.059
 will+0.056
Neg logit contributions
bel-0.106
 mant-0.105
bul-0.104
imm-0.104
lic-0.102
Token and
Token ID290
Feature activation-3.98e-01

Pos logit contributions
Ben+0.103
avi+0.100
Le+0.099
athy+0.095
 might+0.092
Neg logit contributions
mill-0.100
 handing-0.099
lic-0.099
imm-0.098
 among-0.096
Token green
Token ID4077
Feature activation-2.15e+00

Pos logit contributions
Ben+0.007
Le+0.007
avi+0.007
 might+0.007
 will+0.006
Neg logit contributions
bel-0.011
 mant-0.011
lic-0.011
bul-0.011
rated-0.011
Token fish
Token ID5916
Feature activation-3.13e+00

Pos logit contributions
Ben+0.036
Le+0.034
avi+0.033
 might+0.033
 will+0.031
Neg logit contributions
bel-0.064
imm-0.062
bul-0.062
 mant-0.062
lic-0.061
Token.
Token ID13
Feature activation+3.99e-01

Pos logit contributions
Ben+0.071
avi+0.067
Le+0.066
 might+0.065
athy+0.063
Neg logit contributions
bel-0.072
lic-0.071
taker-0.069
imm-0.069
mill-0.069
Token They
Token ID1119
Feature activation-6.13e-01

Pos logit contributions
 Sus+0.011
bul+0.011
 mant+0.010
bel+0.010
 handing+0.010
Neg logit contributions
avi-0.008
Le-0.007
ars-0.007
 might-0.007
Ben-0.007
Token their
Token ID511
Feature activation+1.47e-01

Pos logit contributions
Ben+0.012
Le+0.011
avi+0.011
 might+0.011
 will+0.010
Neg logit contributions
bel-0.024
imm-0.023
lled-0.023
lic-0.023
bul-0.023
Token gear
Token ID7733
Feature activation-4.79e-01

Pos logit contributions
bel+0.005
imm+0.005
 mant+0.005
spe+0.004
bul+0.004
Neg logit contributions
Ben-0.002
Le-0.002
avi-0.002
 Leo-0.002
 Remy-0.002
Token:
Token ID25
Feature activation-3.87e-01

Pos logit contributions
Ben+0.008
 might+0.008
Le+0.008
avi+0.008
 Rex+0.007
Neg logit contributions
bel-0.014
imm-0.014
rated-0.014
 mant-0.014
lic-0.014
Token helmets
Token ID28359
Feature activation-1.74e+00

Pos logit contributions
avi+0.008
Ben+0.008
itch+0.008
Le+0.008
oxy+0.008
Neg logit contributions
spe-0.009
bel-0.009
 Sus-0.009
mill-0.009
lic-0.008
Token,
Token ID11
Feature activation-1.50e+00

Pos logit contributions
Ben+0.029
Le+0.028
avi+0.026
 Remy+0.025
 Leo+0.024
Neg logit contributions
bel-0.054
imm-0.052
bul-0.052
rated-0.052
spe-0.052
Token pads
Token ID21226
Feature activation-4.57e+00

Pos logit contributions
Ben+0.029
Le+0.029
avi+0.028
itch+0.027
 might+0.025
Neg logit contributions
bel-0.042
bul-0.039
 causing-0.037
 mant-0.036
lic-0.036
Token and
Token ID290
Feature activation-7.61e-01

Pos logit contributions
Le+0.090
Ben+0.088
avi+0.087
 Leo+0.081
 Remy+0.080
Neg logit contributions
bel-0.126
taker-0.116
mill-0.115
bul-0.114
rol-0.114
Token gloves
Token ID18051
Feature activation-2.28e+00

Pos logit contributions
Ben+0.014
Le+0.014
avi+0.013
 might+0.012
itch+0.012
Neg logit contributions
bel-0.022
bul-0.021
spe-0.020
 mant-0.020
rated-0.020
Token.
Token ID13
Feature activation-1.48e-01

Pos logit contributions
Ben+0.045
Le+0.044
avi+0.042
 Leo+0.040
 Remy+0.040
Neg logit contributions
bel-0.064
bul-0.060
imm-0.060
taker-0.060
spe-0.060
Token They
Token ID1119
Feature activation-8.68e-01

Pos logit contributions
avi+0.002
 might+0.002
Le+0.002
Ben+0.002
 will+0.002
Neg logit contributions
 Sus-0.005
lic-0.004
ount-0.004
bel-0.004
 mant-0.004
Token knew
Token ID2993
Feature activation-1.36e-01

Pos logit contributions
Le+0.020
avi+0.020
Ben+0.019
 Remy+0.019
 Leo+0.019
Neg logit contributions
bel-0.021
spe-0.020
lled-0.020
taker-0.019
ounded-0.019
Token kept
Token ID4030
Feature activation-1.33e+00

Pos logit contributions
Ben+0.021
avi+0.021
Le+0.019
itch+0.019
 Rex+0.019
Neg logit contributions
bel-0.021
imm-0.021
spe-0.020
rated-0.020
lled-0.020
Token rocks
Token ID12586
Feature activation-2.49e+00

Pos logit contributions
Ben+0.036
avi+0.034
 might+0.034
Le+0.034
 will+0.032
Neg logit contributions
bel-0.027
 mant-0.026
bul-0.026
lled-0.025
lling-0.025
Token,
Token ID11
Feature activation-1.05e+00

Pos logit contributions
Ben+0.052
avi+0.049
Le+0.048
 might+0.045
 Rex+0.045
Neg logit contributions
bel-0.069
imm-0.067
bul-0.065
taker-0.065
lic-0.063
Token leaves
Token ID5667
Feature activation-3.30e+00

Pos logit contributions
Ben+0.019
avi+0.018
Le+0.018
 might+0.018
 Rex+0.017
Neg logit contributions
bel-0.030
imm-0.029
rated-0.029
 mant-0.029
lled-0.029
Token,
Token ID11
Feature activation-1.36e+00

Pos logit contributions
Ben+0.060
avi+0.055
Le+0.053
 Rex+0.050
 might+0.049
Neg logit contributions
bel-0.099
imm-0.098
bul-0.095
taker-0.093
lic-0.093
Token grass
Token ID8701
Feature activation-4.45e+00

Pos logit contributions
Ben+0.019
avi+0.018
Le+0.017
 might+0.017
 Rex+0.016
Neg logit contributions
bel-0.045
imm-0.044
lled-0.043
 mant-0.043
bul-0.043
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
Ben+0.089
avi+0.081
Le+0.079
oxy+0.074
 vet+0.073
Neg logit contributions
bel-0.125
imm-0.123
bul-0.116
mill-0.116
lic-0.115
Token and
Token ID290
Feature activation-1.26e-01

Pos logit contributions
Ben+0.022
avi+0.020
Le+0.020
 might+0.019
 Rex+0.018
Neg logit contributions
bel-0.051
imm-0.050
 mant-0.049
bul-0.049
rated-0.049
Token so
Token ID523
Feature activation-1.05e-01

Pos logit contributions
Ben+0.003
avi+0.002
Le+0.002
 might+0.002
 Rex+0.002
Neg logit contributions
bel-0.003
 mant-0.003
rated-0.003
bul-0.003
imm-0.003
Token much
Token ID881
Feature activation-9.88e-01

Pos logit contributions
Ben+0.002
avi+0.002
Le+0.002
oxy+0.002
 might+0.002
Neg logit contributions
bel-0.003
spe-0.003
taker-0.003
 Sus-0.003
rated-0.003
Token more
Token ID517
Feature activation-9.82e-01

Pos logit contributions
Ben+0.021
avi+0.020
Le+0.019
 might+0.019
 Rex+0.018
Neg logit contributions
bel-0.026
bul-0.025
imm-0.025
lic-0.025
rated-0.025
Token shells
Token ID19679
Feature activation-2.89e+00

Pos logit contributions
Ben+0.008
Le+0.008
 might+0.008
avi+0.008
 will+0.007
Neg logit contributions
bel-0.011
 mant-0.011
bul-0.011
lled-0.011
spe-0.010
Token,
Token ID11
Feature activation-6.16e-01

Pos logit contributions
Ben+0.054
Le+0.053
avi+0.051
 might+0.047
oxy+0.046
Neg logit contributions
bel-0.081
imm-0.079
bul-0.078
lled-0.077
lic-0.077
Token rocks
Token ID12586
Feature activation-3.49e+00

Pos logit contributions
Ben+0.014
 might+0.013
Le+0.013
avi+0.013
 will+0.012
Neg logit contributions
bel-0.015
 mant-0.014
bul-0.014
imm-0.014
lic-0.014
Token,
Token ID11
Feature activation-1.56e+00

Pos logit contributions
Ben+0.067
Le+0.066
avi+0.063
 might+0.058
oxy+0.057
Neg logit contributions
bel-0.095
imm-0.093
bul-0.093
mill-0.092
lled-0.091
Token seaw
Token ID32085
Feature activation-4.85e+00

Pos logit contributions
Ben+0.032
Le+0.032
 might+0.031
avi+0.031
 Rex+0.029
Neg logit contributions
bel-0.040
 mant-0.038
bul-0.038
lled-0.038
lic-0.037
Tokeneed
Token ID2308
Feature activation-4.43e+00

Pos logit contributions
 might+0.097
 Rex+0.093
Le+0.093
Ben+0.090
 Leo+0.090
Neg logit contributions
lic-0.128
imm-0.127
bel-0.120
 mant-0.119
ount-0.116
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
Ben+0.086
Le+0.085
avi+0.078
 might+0.075
 vet+0.074
Neg logit contributions
bel-0.115
imm-0.115
lic-0.114
mill-0.113
bul-0.112
Token crabs
Token ID49172
Feature activation-4.26e+00

Pos logit contributions
Ben+0.038
Le+0.037
 might+0.035
avi+0.035
 Rex+0.033
Neg logit contributions
bel-0.068
 mant-0.065
bul-0.064
lled-0.063
imm-0.063
Token and
Token ID290
Feature activation-4.53e-01

Pos logit contributions
Ben+0.091
Le+0.091
avi+0.086
 vet+0.083
 might+0.083
Neg logit contributions
 handing-0.095
mill-0.094
lic-0.093
imm-0.092
bel-0.091
Token fish
Token ID5916
Feature activation-3.99e+00

Pos logit contributions
Ben+0.009
Le+0.009
 might+0.009
avi+0.009
 Rex+0.008
Neg logit contributions
bel-0.012
 mant-0.011
bul-0.011
rated-0.011
lic-0.011
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
Ben+0.087
Le+0.084
avi+0.083
 might+0.079
 Rover+0.078
Neg logit contributions
bel-0.096
lic-0.092
imm-0.090
ount-0.089
mill-0.089
Token see
Token ID766
Feature activation-9.74e-01

Pos logit contributions
oxy+0.042
avi+0.036
 vet+0.036
itch+0.036
Le+0.035
Neg logit contributions
bel-0.039
 contain-0.039
ounded-0.037
 Sus-0.037
rated-0.037
Token birds
Token ID10087
Feature activation-2.55e+00

Pos logit contributions
avi+0.023
Ben+0.022
Le+0.022
itch+0.021
oxy+0.021
Neg logit contributions
bel-0.021
lled-0.020
 causing-0.020
 Sus-0.020
lic-0.020
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
Ben+0.055
Le+0.053
avi+0.053
itch+0.051
oxy+0.049
Neg logit contributions
mill-0.062
 happening-0.060
lic-0.060
lling-0.059
 handing-0.059
Token flowers
Token ID12734
Feature activation-3.69e+00

Pos logit contributions
avi+0.018
Ben+0.017
itch+0.017
 might+0.017
Le+0.016
Neg logit contributions
bel-0.032
lic-0.031
lled-0.029
rated-0.029
lling-0.029
Token,
Token ID11
Feature activation-1.47e+00

Pos logit contributions
Ben+0.083
avi+0.082
itch+0.081
Le+0.080
oxy+0.079
Neg logit contributions
lic-0.081
 handing-0.080
mill-0.080
imm-0.078
bel-0.078
Token trees
Token ID7150
Feature activation-4.42e+00

Pos logit contributions
itch+0.026
Ben+0.026
avi+0.025
Le+0.025
 might+0.024
Neg logit contributions
bel-0.041
lic-0.039
lled-0.038
rated-0.038
lling-0.038
Token and
Token ID290
Feature activation-7.31e-01

Pos logit contributions
itch+0.107
Ben+0.104
avi+0.103
Le+0.098
oxy+0.097
Neg logit contributions
mill-0.093
 handing-0.090
lic-0.087
 among-0.086
bul-0.085
Token a
Token ID257
Feature activation-2.08e+00

Pos logit contributions
Ben+0.013
avi+0.012
Le+0.012
 might+0.012
itch+0.011
Neg logit contributions
bel-0.021
rated-0.020
lic-0.020
lled-0.020
 mant-0.020
Token big
Token ID1263
Feature activation-1.07e+00

Pos logit contributions
avi+0.049
itch+0.046
Le+0.045
 might+0.045
 cannot+0.045
Neg logit contributions
 glimpse-0.046
lic-0.045
bel-0.044
spe-0.043
 mant-0.042
Token pond
Token ID16723
Feature activation-2.18e+00

Pos logit contributions
Le+0.023
avi+0.022
Ben+0.022
oxy+0.021
itch+0.020
Neg logit contributions
 Sus-0.026
 glimpse-0.026
 mant-0.026
rated-0.025
bul-0.025
Token.
Token ID13
Feature activation-3.88e-01

Pos logit contributions
Ben+0.041
Le+0.040
avi+0.039
 might+0.039
 Rex+0.037
Neg logit contributions
bel-0.060
imm-0.058
rated-0.057
lic-0.057
spe-0.057
Token:
Token ID25
Feature activation-6.93e-01

Pos logit contributions
Ben+0.025
Le+0.024
avi+0.023
 might+0.022
 Remy+0.022
Neg logit contributions
bel-0.036
spe-0.035
imm-0.034
lic-0.034
 mant-0.034
Token fruits
Token ID15921
Feature activation-3.34e+00

Pos logit contributions
Ben+0.015
Le+0.015
avi+0.014
itch+0.014
ars+0.014
Neg logit contributions
bel-0.017
spe-0.017
 Sus-0.016
lic-0.016
ounded-0.015
Token,
Token ID11
Feature activation-9.49e-01

Pos logit contributions
Le+0.070
Ben+0.068
avi+0.064
itch+0.059
 Rex+0.058
Neg logit contributions
imm-0.087
bel-0.085
 Sus-0.083
mill-0.083
bul-0.083
Token vegetables
Token ID13701
Feature activation-3.44e+00

Pos logit contributions
Ben+0.020
Le+0.020
avi+0.019
 might+0.018
 Rex+0.018
Neg logit contributions
bel-0.024
 mant-0.023
 Sus-0.022
bul-0.022
rated-0.022
Token,
Token ID11
Feature activation-1.56e+00

Pos logit contributions
Le+0.071
Ben+0.070
avi+0.064
 Rex+0.061
 Leo+0.058
Neg logit contributions
bel-0.088
imm-0.087
 handing-0.086
 mant-0.085
 happening-0.085
Token bread
Token ID8509
Feature activation-4.42e+00

Pos logit contributions
Ben+0.030
Le+0.030
avi+0.028
itch+0.027
 Rex+0.027
Neg logit contributions
bel-0.043
 mant-0.040
lled-0.039
 Sus-0.039
bul-0.039
Token,
Token ID11
Feature activation-2.21e+00

Pos logit contributions
Ben+0.084
Le+0.083
avi+0.077
 Rex+0.071
 might+0.071
Neg logit contributions
bel-0.121
imm-0.120
lic-0.118
 mant-0.115
 handing-0.114
Token cheese
Token ID9891
Feature activation-4.79e+00

Pos logit contributions
Le+0.045
Ben+0.045
itch+0.042
 Rex+0.042
avi+0.041
Neg logit contributions
bel-0.057
 mant-0.054
 handing-0.052
lled-0.051
bul-0.050
Token,
Token ID11
Feature activation-2.36e+00

Pos logit contributions
Ben+0.095
Le+0.093
avi+0.089
 might+0.083
 Rex+0.082
Neg logit contributions
bel-0.119
 handing-0.119
imm-0.117
 mant-0.116
lic-0.115
Token and
Token ID290
Feature activation-2.10e-01

Pos logit contributions
Le+0.046
Ben+0.045
itch+0.043
 Rex+0.042
avi+0.042
Neg logit contributions
bel-0.062
 mant-0.059
 handing-0.058
lled-0.056
rated-0.055
Token toys
Token ID14958
Feature activation-1.07e+00

Pos logit contributions
Ben+0.005
Le+0.004
avi+0.004
 might+0.004
 Rex+0.004
Neg logit contributions
bel-0.005
 mant-0.005
bul-0.005
spe-0.005
 Sus-0.005
Token.
Token ID13
Feature activation-1.49e-01

Pos logit contributions
Ben+0.019
 might+0.018
Le+0.017
 will+0.017
 Rex+0.017
Neg logit contributions
rated-0.033
imm-0.033
bel-0.033
lled-0.032
bul-0.032
Token They
Token ID1119
Feature activation-3.18e-01

Pos logit contributions
Ben+0.002
 might+0.001
Le+0.001
avi+0.001
 Rex+0.001
Neg logit contributions
bel-0.005
imm-0.005
rated-0.005
bul-0.005
 mant-0.005
Token saw
Token ID2497
Feature activation-1.75e-01

Pos logit contributions
Ben+0.007
avi+0.007
Le+0.007
 might+0.007
itch+0.007
Neg logit contributions
bel-0.007
spe-0.007
lic-0.007
lled-0.007
 Sus-0.007
Token ducks
Token ID39694
Feature activation-2.06e+00

Pos logit contributions
Ben+0.003
 might+0.003
avi+0.003
Le+0.003
 will+0.003
Neg logit contributions
bel-0.005
lic-0.005
lled-0.005
imm-0.005
rated-0.005
Token,
Token ID11
Feature activation-5.02e-01

Pos logit contributions
Ben+0.037
Le+0.036
avi+0.035
 might+0.033
itch+0.032
Neg logit contributions
lling-0.058
bel-0.058
mill-0.057
lled-0.057
lic-0.057
Token fish
Token ID5916
Feature activation-4.41e+00

Pos logit contributions
Ben+0.008
 might+0.007
Le+0.007
avi+0.007
 will+0.007
Neg logit contributions
bel-0.016
imm-0.015
rated-0.015
lic-0.015
 mant-0.015
Token,
Token ID11
Feature activation-1.12e+00

Pos logit contributions
Ben+0.097
avi+0.094
Le+0.094
itch+0.090
oxy+0.087
Neg logit contributions
lic-0.104
mill-0.100
imm-0.099
bel-0.099
 handing-0.096
Token frogs
Token ID37475
Feature activation-3.39e+00

Pos logit contributions
Ben+0.015
Le+0.014
avi+0.014
 might+0.014
itch+0.012
Neg logit contributions
bel-0.037
lic-0.036
rated-0.036
lled-0.036
 mant-0.035
Token and
Token ID290
Feature activation+1.42e-01

Pos logit contributions
Ben+0.071
Le+0.069
avi+0.068
itch+0.067
oxy+0.064
Neg logit contributions
mill-0.081
 handing-0.081
lic-0.080
lling-0.078
 happening-0.077
Token turtles
Token ID36288
Feature activation-1.50e+00

Pos logit contributions
bel+0.004
imm+0.004
rated+0.004
 mant+0.004
lic+0.004
Neg logit contributions
Ben-0.002
 might-0.002
Le-0.002
avi-0.002
 Rex-0.002
Token in
Token ID287
Feature activation-1.76e-01

Pos logit contributions
Ben+0.034
Le+0.032
avi+0.032
 might+0.030
itch+0.029
Neg logit contributions
lling-0.035
bel-0.035
lic-0.034
taker-0.034
 handing-0.034
Token,
Token ID11
Feature activation-2.04e+00

Pos logit contributions
Ben+0.087
Le+0.086
avi+0.082
 might+0.074
itch+0.073
Neg logit contributions
bel-0.116
lic-0.115
imm-0.114
 handing-0.111
bul-0.109
Token milk
Token ID7545
Feature activation-3.95e+00

Pos logit contributions
Ben+0.036
Le+0.035
avi+0.033
itch+0.033
 Rex+0.031
Neg logit contributions
bel-0.058
 mant-0.055
 handing-0.054
bul-0.053
lic-0.053
Token,
Token ID11
Feature activation-1.91e+00

Pos logit contributions
Ben+0.078
Le+0.077
avi+0.072
 might+0.066
Hug+0.065
Neg logit contributions
bel-0.106
imm-0.104
 handing-0.102
bul-0.102
 mant-0.102
Token eggs
Token ID9653
Feature activation-3.72e+00

Pos logit contributions
Ben+0.032
Le+0.031
avi+0.029
itch+0.028
 Rex+0.028
Neg logit contributions
bel-0.057
 mant-0.054
 handing-0.053
bul-0.053
lic-0.053
Token,
Token ID11
Feature activation-1.94e+00

Pos logit contributions
Ben+0.070
Le+0.068
avi+0.062
 might+0.060
 Rex+0.059
Neg logit contributions
imm-0.106
bel-0.105
lic-0.102
bul-0.102
 mant-0.101
Token cheese
Token ID9891
Feature activation-4.40e+00

Pos logit contributions
Ben+0.033
Le+0.032
avi+0.030
itch+0.029
 Rex+0.029
Neg logit contributions
bel-0.057
 mant-0.054
 handing-0.053
bul-0.053
lic-0.053
Token,
Token ID11
Feature activation-2.07e+00

Pos logit contributions
Ben+0.088
Le+0.087
avi+0.084
 might+0.077
Hug+0.076
Neg logit contributions
 handing-0.110
bel-0.109
imm-0.108
lic-0.107
 mant-0.106
Token and
Token ID290
Feature activation-1.15e-01

Pos logit contributions
Ben+0.035
Le+0.035
avi+0.032
 Rex+0.032
itch+0.031
Neg logit contributions
bel-0.059
 mant-0.057
 handing-0.056
bul-0.056
lic-0.056
Token more
Token ID517
Feature activation-7.87e-01

Pos logit contributions
Ben+0.002
Le+0.002
avi+0.002
 might+0.002
 Rex+0.002
Neg logit contributions
bel-0.003
 mant-0.003
bul-0.003
spe-0.003
lic-0.003
Token.
Token ID13
Feature activation-2.78e-01

Pos logit contributions
Ben+0.016
Le+0.015
avi+0.015
 might+0.014
 Rex+0.014
Neg logit contributions
bel-0.021
 mant-0.020
rated-0.020
bul-0.020
imm-0.020
Token They
Token ID1119
Feature activation-1.06e+00

Pos logit contributions
Ben+0.005
 might+0.004
Le+0.004
avi+0.004
 Rex+0.004
Neg logit contributions
bel-0.008
imm-0.008
rated-0.008
 mant-0.008
lic-0.008
Token new
Token ID649
Feature activation-2.89e-01

Pos logit contributions
Ben+0.020
Le+0.019
avi+0.019
 might+0.018
 Leo+0.017
Neg logit contributions
bel-0.019
bul-0.018
lic-0.018
imm-0.018
taker-0.017
Token words
Token ID2456
Feature activation-1.01e+00

Pos logit contributions
Ben+0.006
Le+0.005
 might+0.005
avi+0.005
 Rex+0.005
Neg logit contributions
bel-0.008
imm-0.008
bul-0.008
lic-0.008
rated-0.008
Token,
Token ID11
Feature activation-5.32e-01

Pos logit contributions
Ben+0.020
Le+0.019
avi+0.019
 might+0.017
 Remy+0.016
Neg logit contributions
bul-0.028
bel-0.027
taker-0.027
mill-0.027
imm-0.027
Token like
Token ID588
Feature activation-2.03e+00

Pos logit contributions
Ben+0.011
Le+0.010
avi+0.010
 might+0.010
itch+0.009
Neg logit contributions
bel-0.014
bul-0.014
 mant-0.014
 handing-0.013
imm-0.013
Token "
Token ID366
Feature activation-2.54e-01

Pos logit contributions
Ben+0.041
avi+0.039
Le+0.038
 might+0.036
ars+0.036
Neg logit contributions
bel-0.051
 handing-0.050
bul-0.050
 mant-0.049
spe-0.049
Tokenban
Token ID3820
Feature activation-4.39e+00

Pos logit contributions
 might+0.005
avi+0.005
Ben+0.005
 Leo+0.005
 Rex+0.005
Neg logit contributions
bel-0.007
imm-0.006
bul-0.006
lic-0.006
spe-0.006
Tokenana
Token ID2271
Feature activation-1.29e+00

Pos logit contributions
 might+0.098
Ben+0.094
Le+0.093
 will+0.087
 Rex+0.086
Neg logit contributions
lic-0.106
imm-0.101
bel-0.100
shadow-0.097
 handing-0.096
Token"
Token ID1
Feature activation+2.25e-02

Pos logit contributions
Ben+0.025
Le+0.024
 might+0.024
avi+0.023
 Rex+0.022
Neg logit contributions
bel-0.034
lic-0.033
imm-0.033
bul-0.033
taker-0.033
Token and
Token ID290
Feature activation-1.43e+00

Pos logit contributions
bel+0.001
lic+0.001
imm+0.001
bul+0.001
rated+0.001
Neg logit contributions
Ben-0.000
Le-0.000
avi-0.000
 might-0.000
 Remy-0.000
Token "
Token ID366
Feature activation-8.41e-02

Pos logit contributions
Ben+0.035
 might+0.034
Le+0.034
avi+0.033
 will+0.032
Neg logit contributions
bel-0.030
 mant-0.029
bul-0.029
lic-0.029
imm-0.028
Tokenflower
Token ID25547
Feature activation-1.63e+00

Pos logit contributions
 might+0.002
 Rex+0.002
avi+0.002
 Leo+0.002
Ben+0.002
Neg logit contributions
bel-0.002
imm-0.002
lic-0.002
bul-0.002
lled-0.002
Token see
Token ID766
Feature activation-2.24e-01

Pos logit contributions
Le+0.028
oxy+0.025
Ben+0.025
 Rex+0.025
 Remy+0.025
Neg logit contributions
 mant-0.028
bel-0.027
 Sus-0.027
rated-0.026
ounded-0.026
Token apples
Token ID22514
Feature activation-2.91e+00

Pos logit contributions
Le+0.005
Ben+0.005
avi+0.004
 might+0.004
 Remy+0.004
Neg logit contributions
bel-0.006
 Sus-0.005
spe-0.005
 mant-0.005
taker-0.005
Token,
Token ID11
Feature activation-6.74e-01

Pos logit contributions
Ben+0.058
Le+0.058
avi+0.056
 might+0.049
itch+0.049
Neg logit contributions
bel-0.076
imm-0.075
bul-0.074
mill-0.074
 handing-0.074
Token bananas
Token ID35484
Feature activation-3.71e+00

Pos logit contributions
Ben+0.014
Le+0.014
avi+0.013
itch+0.013
 might+0.013
Neg logit contributions
bel-0.017
lic-0.016
 mant-0.016
bul-0.016
lled-0.016
Token,
Token ID11
Feature activation-1.43e+00

Pos logit contributions
Ben+0.079
Le+0.079
avi+0.075
itch+0.068
oxy+0.067
Neg logit contributions
 handing-0.090
bel-0.090
lic-0.088
imm-0.087
ount-0.086
Token bread
Token ID8509
Feature activation-4.39e+00

Pos logit contributions
Ben+0.026
Le+0.026
avi+0.025
itch+0.024
 might+0.023
Neg logit contributions
bel-0.040
 mant-0.037
bul-0.037
lic-0.037
 handing-0.037
Token,
Token ID11
Feature activation-2.04e+00

Pos logit contributions
Ben+0.087
Le+0.086
avi+0.082
 might+0.074
itch+0.073
Neg logit contributions
bel-0.116
lic-0.115
imm-0.114
 handing-0.111
bul-0.109
Token milk
Token ID7545
Feature activation-3.95e+00

Pos logit contributions
Ben+0.036
Le+0.035
avi+0.033
itch+0.033
 Rex+0.031
Neg logit contributions
bel-0.058
 mant-0.055
 handing-0.054
bul-0.053
lic-0.053
Token,
Token ID11
Feature activation-1.91e+00

Pos logit contributions
Ben+0.078
Le+0.077
avi+0.072
 might+0.066
Hug+0.065
Neg logit contributions
bel-0.106
imm-0.104
 handing-0.102
bul-0.102
 mant-0.102
Token eggs
Token ID9653
Feature activation-3.72e+00

Pos logit contributions
Ben+0.032
Le+0.031
avi+0.029
itch+0.028
 Rex+0.028
Neg logit contributions
bel-0.057
 mant-0.054
 handing-0.053
bul-0.053
lic-0.053
Token,
Token ID11
Feature activation-1.94e+00

Pos logit contributions
Ben+0.070
Le+0.068
avi+0.062
 might+0.060
 Rex+0.059
Neg logit contributions
imm-0.106
bel-0.105
lic-0.102
bul-0.102
 mant-0.101
Token saw
Token ID2497
Feature activation-6.51e-01

Pos logit contributions
itch+0.026
avi+0.026
Le+0.025
Ben+0.025
ars+0.025
Neg logit contributions
bel-0.027
 Sus-0.027
spe-0.026
lled-0.025
lic-0.025
Token birds
Token ID10087
Feature activation-2.15e+00

Pos logit contributions
Ben+0.012
Le+0.012
avi+0.012
 might+0.012
 will+0.011
Neg logit contributions
bel-0.018
lic-0.017
lled-0.017
spe-0.017
imm-0.017
Token,
Token ID11
Feature activation-9.12e-01

Pos logit contributions
Ben+0.047
Le+0.045
avi+0.044
itch+0.042
 might+0.040
Neg logit contributions
mill-0.054
lling-0.053
bul-0.052
 happening-0.052
lic-0.052
Token flowers
Token ID12734
Feature activation-3.40e+00

Pos logit contributions
Ben+0.016
avi+0.015
 might+0.015
Le+0.015
 will+0.014
Neg logit contributions
bel-0.026
lic-0.025
rated-0.025
lled-0.025
bul-0.025
Token,
Token ID11
Feature activation-1.33e+00

Pos logit contributions
Ben+0.075
Le+0.073
avi+0.073
itch+0.073
oxy+0.067
Neg logit contributions
lic-0.079
imm-0.078
mill-0.078
 handing-0.078
bel-0.077
Token swings
Token ID26728
Feature activation-4.37e+00

Pos logit contributions
Ben+0.023
Le+0.022
avi+0.022
 might+0.022
itch+0.022
Neg logit contributions
bel-0.038
lled-0.036
lic-0.036
bul-0.036
lling-0.036
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
avi+0.092
itch+0.090
Ben+0.090
Le+0.086
oxy+0.082
Neg logit contributions
bel-0.110
imm-0.104
lic-0.103
ount-0.101
mill-0.100
Token and
Token ID290
Feature activation-2.38e-02

Pos logit contributions
Ben+0.016
 might+0.015
avi+0.015
Le+0.015
 Rex+0.014
Neg logit contributions
bel-0.031
rated-0.030
lic-0.030
bul-0.030
lled-0.030
Token slides
Token ID19392
Feature activation-2.48e+00

Pos logit contributions
Ben+0.000
 might+0.000
Le+0.000
avi+0.000
 Rex+0.000
Neg logit contributions
bel-0.001
rated-0.001
imm-0.001
lic-0.001
 mant-0.001
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
Ben+0.049
avi+0.046
Le+0.046
 might+0.045
 Leo+0.042
Neg logit contributions
bel-0.068
imm-0.065
lic-0.065
rated-0.065
spe-0.065
Token They
Token ID1119
Feature activation-8.09e-01

Pos logit contributions
Ben+0.002
 might+0.002
Le+0.002
avi+0.002
 will+0.002
Neg logit contributions
bel-0.004
imm-0.004
rated-0.004
 mant-0.004
lic-0.004
Token.
Token ID13
Feature activation-7.30e-01

Pos logit contributions
Ben+0.030
Le+0.029
avi+0.029
 might+0.028
 Rex+0.026
Neg logit contributions
bel-0.043
imm-0.043
lic-0.041
bul-0.041
rated-0.041
Token It
Token ID632
Feature activation-1.43e+00

Pos logit contributions
 might+0.018
avi+0.018
Ben+0.018
Le+0.017
ars+0.016
Neg logit contributions
 Sus-0.015
 handing-0.014
 mant-0.014
bul-0.014
ount-0.013
Token is
Token ID318
Feature activation-9.11e-01

Pos logit contributions
Ben+0.021
avi+0.020
Le+0.020
 Rover+0.017
 Rex+0.016
Neg logit contributions
imm-0.045
bul-0.045
rated-0.044
bel-0.044
 mant-0.043
Token cheese
Token ID9891
Feature activation-2.63e+00

Pos logit contributions
Ben+0.021
Le+0.020
avi+0.020
 might+0.019
 Rex+0.018
Neg logit contributions
 mant-0.021
imm-0.021
bel-0.021
rated-0.021
bul-0.020
Token and
Token ID290
Feature activation-1.18e+00

Pos logit contributions
avi+0.059
Le+0.058
Ben+0.058
 might+0.052
oxy+0.051
Neg logit contributions
imm-0.060
bel-0.059
taker-0.059
bul-0.057
 handing-0.057
Token ham
Token ID8891
Feature activation-4.34e+00

Pos logit contributions
avi+0.022
Ben+0.022
Le+0.021
itch+0.019
athy+0.019
Neg logit contributions
bel-0.031
 mant-0.031
bul-0.031
 handing-0.030
 causing-0.029
Token.
Token ID13
Feature activation-6.95e-01

Pos logit contributions
Ben+0.103
Le+0.102
athy+0.098
 might+0.097
avi+0.096
Neg logit contributions
bel-0.089
imm-0.086
 handing-0.085
mill-0.082
ter-0.082
Token Do
Token ID2141
Feature activation-3.33e+00

Pos logit contributions
 might+0.017
Ben+0.016
avi+0.016
Le+0.016
 cane+0.015
Neg logit contributions
 Sus-0.014
 mant-0.014
 handing-0.014
ount-0.014
bul-0.014
Token you
Token ID345
Feature activation+9.25e-01

Pos logit contributions
avi+0.057
unts+0.054
 cane+0.054
Ben+0.052
 tractor+0.051
Neg logit contributions
shadow-0.081
 how-0.079
 Sus-0.078
hr-0.078
bel-0.076
Token like
Token ID588
Feature activation-1.32e+00

Pos logit contributions
rated+0.020
bel+0.019
taker+0.019
shadow+0.019
imm+0.019
Neg logit contributions
Le-0.023
 Rex-0.022
avi-0.022
Ben-0.021
 Rover-0.021
Token cheese
Token ID9891
Feature activation-1.64e+00

Pos logit contributions
avi+0.029
Ben+0.027
ars+0.027
Le+0.026
 might+0.025
Neg logit contributions
 handing-0.031
 mant-0.029
bul-0.028
 Sus-0.028
 how-0.028
Token.
Token ID13
Feature activation-1.36e-01

Pos logit contributions
Ben+0.002
Le+0.002
avi+0.002
 might+0.002
 Remy+0.002
Neg logit contributions
bel-0.003
spe-0.002
rated-0.002
 mant-0.002
imm-0.002
Token He
Token ID679
Feature activation-2.19e+00

Pos logit contributions
avi+0.002
 might+0.002
Ben+0.002
 will+0.002
Le+0.002
Neg logit contributions
lic-0.004
bel-0.004
taker-0.004
eway-0.004
imm-0.004
Token sees
Token ID7224
Feature activation-2.25e-01

Pos logit contributions
oxy+0.043
Le+0.043
Ben+0.042
avi+0.041
 Remy+0.039
Neg logit contributions
bel-0.055
 mant-0.052
spe-0.052
lled-0.051
ounded-0.051
Token flowers
Token ID12734
Feature activation-2.05e+00

Pos logit contributions
Ben+0.004
 might+0.004
avi+0.004
Le+0.004
 will+0.004
Neg logit contributions
bel-0.006
lic-0.006
imm-0.006
bul-0.006
taker-0.006
Token,
Token ID11
Feature activation-7.24e-01

Pos logit contributions
Ben+0.040
Le+0.039
avi+0.038
oxy+0.035
itch+0.035
Neg logit contributions
bel-0.055
imm-0.055
bul-0.054
lic-0.054
lling-0.053
Token grass
Token ID8701
Feature activation-4.32e+00

Pos logit contributions
Ben+0.013
 might+0.013
avi+0.013
Le+0.013
 will+0.012
Neg logit contributions
bel-0.020
rated-0.019
lic-0.019
 mant-0.019
imm-0.019
Token,
Token ID11
Feature activation-1.70e+00

Pos logit contributions
Ben+0.088
avi+0.084
Le+0.084
oxy+0.078
itch+0.078
Neg logit contributions
mill-0.111
imm-0.111
bel-0.110
 happening-0.108
lic-0.107
Token and
Token ID290
Feature activation-3.44e-01

Pos logit contributions
Ben+0.029
avi+0.028
Le+0.027
 might+0.027
itch+0.026
Neg logit contributions
bel-0.050
 mant-0.048
lled-0.047
bul-0.047
lling-0.047
Token a
Token ID257
Feature activation-2.05e+00

Pos logit contributions
Ben+0.006
 might+0.006
Le+0.006
avi+0.006
 Rex+0.006
Neg logit contributions
bel-0.010
 mant-0.009
rated-0.009
bul-0.009
imm-0.009
Token swing
Token ID9628
Feature activation-1.32e+00

Pos logit contributions
avi+0.042
Ben+0.041
Le+0.040
 might+0.040
 Rex+0.037
Neg logit contributions
bel-0.052
lic-0.050
 mant-0.050
spe-0.050
bul-0.049
Token.
Token ID13
Feature activation-4.40e-01

Pos logit contributions
Ben+0.027
avi+0.026
Le+0.026
 might+0.024
 Remy+0.023
Neg logit contributions
bel-0.035
imm-0.034
spe-0.034
taker-0.034
bul-0.033
Token,
Token ID11
Feature activation+5.12e-01

Pos logit contributions
rated+0.003
 mant+0.003
bel+0.003
lling+0.003
imm+0.003
Neg logit contributions
Ben-0.001
 might-0.001
 Leo-0.001
 Rex-0.001
Le-0.001
Token there
Token ID612
Feature activation+7.53e-01

Pos logit contributions
 mant+0.013
bel+0.013
imm+0.013
rated+0.013
bul+0.012
Neg logit contributions
Ben-0.012
 might-0.011
 Leo-0.011
Le-0.011
 Rex-0.011
Token lived
Token ID5615
Feature activation-1.15e+00

Pos logit contributions
imm+0.018
bel+0.018
 Sus+0.018
lled+0.017
spe+0.017
Neg logit contributions
Ben-0.016
avi-0.016
Le-0.016
 might-0.015
 Remy-0.015
Token a
Token ID257
Feature activation-2.04e-01

Pos logit contributions
Ben+0.021
 might+0.020
Le+0.019
avi+0.019
 Rex+0.018
Neg logit contributions
bel-0.033
imm-0.033
rated-0.033
 mant-0.032
lic-0.032
Token strong
Token ID1913
Feature activation+8.81e-01

Pos logit contributions
 might+0.006
avi+0.006
Le+0.006
ars+0.006
 cones+0.006
Neg logit contributions
lic-0.003
bel-0.003
 Sus-0.003
bul-0.003
 mant-0.003
Token mer
Token ID4017
Feature activation+4.29e+00

Pos logit contributions
bul+0.021
taker+0.021
 mant+0.021
 Sus+0.021
spe+0.021
Neg logit contributions
Le-0.018
Ben-0.018
 might-0.018
avi-0.017
 will-0.017
Tokenmaid
Token ID23151
Feature activation+9.23e-02

Pos logit contributions
shadow+0.079
lled+0.076
rated+0.076
 causing+0.073
 mant+0.073
Neg logit contributions
avi-0.118
Ben-0.117
 might-0.115
 will-0.111
 Leo-0.110
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
bel+0.003
rated+0.003
imm+0.003
 mant+0.003
lic+0.003
Neg logit contributions
Ben-0.002
 might-0.002
Le-0.002
avi-0.002
 Rex-0.001
Token She
Token ID1375
Feature activation-1.59e-01

Pos logit contributions
bel+0.044
imm+0.044
 mant+0.043
lic+0.043
rated+0.043
Neg logit contributions
Ben-0.020
 might-0.019
Le-0.018
avi-0.018
 will-0.017
Token had
Token ID550
Feature activation+6.63e-01

Pos logit contributions
Ben+0.003
Le+0.003
avi+0.003
 might+0.003
 Rex+0.003
Neg logit contributions
bel-0.004
imm-0.004
lic-0.004
rated-0.004
lled-0.004
Token beautiful
Token ID4950
Feature activation+2.29e-01

Pos logit contributions
bel+0.018
lic+0.017
 mant+0.017
imm+0.017
lled+0.017
Neg logit contributions
Ben-0.013
avi-0.013
 might-0.012
Le-0.012
 Rex-0.012
Token says
Token ID1139
Feature activation-9.95e-01

Pos logit contributions
Le+0.006
Ben+0.005
avi+0.005
oxy+0.005
 might+0.005
Neg logit contributions
 mant-0.009
bul-0.009
lic-0.009
bel-0.009
taker-0.009
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
Ben+0.020
Le+0.019
 might+0.019
avi+0.019
oxy+0.018
Neg logit contributions
taker-0.026
 Sus-0.026
bel-0.025
lic-0.025
bul-0.024
Token "
Token ID366
Feature activation+9.80e-01

Pos logit contributions
rated+0.028
imm+0.028
bel+0.028
lled+0.027
spe+0.027
Neg logit contributions
Ben-0.023
 might-0.021
Le-0.021
 Rex-0.021
 Leo-0.021
TokenThank
Token ID10449
Feature activation-1.17e+00

Pos logit contributions
bel+0.026
imm+0.025
lic+0.025
bul+0.025
rated+0.025
Neg logit contributions
 might-0.019
Ben-0.019
avi-0.018
Le-0.018
 Rex-0.018
Token you
Token ID345
Feature activation+1.03e+00

Pos logit contributions
 might+0.019
avi+0.018
 vet+0.017
Ben+0.016
 Remy+0.016
Neg logit contributions
taker-0.033
 Sus-0.033
bel-0.033
lic-0.033
hr-0.033
Token,
Token ID11
Feature activation+4.27e+00

Pos logit contributions
lic+0.029
taker+0.029
 Sus+0.028
bel+0.028
 mant+0.028
Neg logit contributions
Ben-0.019
Le-0.018
avi-0.018
 might-0.018
 Remy-0.016
Token Daddy
Token ID26926
Feature activation-7.14e-01

Pos logit contributions
bel+0.108
imm+0.106
rated+0.105
 mant+0.104
lic+0.104
Neg logit contributions
Ben-0.092
 might-0.088
Le-0.087
avi-0.086
 Rex-0.083
Token.
Token ID13
Feature activation+8.16e-01

Pos logit contributions
Ben+0.015
Le+0.015
avi+0.014
 might+0.013
 Remy+0.013
Neg logit contributions
rated-0.018
imm-0.018
bul-0.018
bel-0.018
lic-0.018
Token I
Token ID314
Feature activation+2.16e-01

Pos logit contributions
bel+0.022
 mant+0.022
imm+0.022
rated+0.022
bul+0.022
Neg logit contributions
Ben-0.016
 might-0.015
Le-0.015
avi-0.015
 will-0.014
Token love
Token ID1842
Feature activation-5.78e-01

Pos logit contributions
bel+0.007
imm+0.007
rated+0.007
lic+0.007
 mant+0.007
Neg logit contributions
Ben-0.003
Le-0.003
 might-0.003
avi-0.003
 Rex-0.002
Token you
Token ID345
Feature activation+1.34e+00

Pos logit contributions
Ben+0.006
 might+0.006
Le+0.006
avi+0.005
 will+0.005
Neg logit contributions
bel-0.021
 mant-0.020
bul-0.020
 Sus-0.020
lic-0.020
Token,
Token ID11
Feature activation+2.24e+00

Pos logit contributions
rated+0.011
lling+0.011
imm+0.011
lled+0.011
 mant+0.010
Neg logit contributions
Ben-0.002
 might-0.002
 will-0.002
 Leo-0.002
 Rex-0.002
Token "
Token ID366
Feature activation+8.26e-01

Pos logit contributions
imm+0.059
rated+0.058
bel+0.058
lled+0.057
spe+0.056
Neg logit contributions
Ben-0.047
 might-0.046
Le-0.044
 Rex-0.044
 Leo-0.044
TokenI
Token ID40
Feature activation+2.84e-01

Pos logit contributions
bel+0.025
rated+0.024
bul+0.024
imm+0.024
 mant+0.024
Neg logit contributions
 might-0.013
Ben-0.013
avi-0.013
Le-0.013
 Leo-0.013
Token'm
Token ID1101
Feature activation+5.52e-01

Pos logit contributions
bel+0.008
rated+0.007
lic+0.007
bul+0.007
imm+0.007
Neg logit contributions
Ben-0.006
Le-0.005
avi-0.005
 might-0.005
 Leo-0.005
Token sorry
Token ID7926
Feature activation+6.52e-01

Pos logit contributions
rated+0.019
bel+0.019
spe+0.018
imm+0.018
eway+0.018
Neg logit contributions
 Ben-0.009
 cannot-0.009
 Rex-0.009
 Leo-0.008
 might-0.008
Token,
Token ID11
Feature activation+4.09e+00

Pos logit contributions
bel+0.020
rated+0.019
imm+0.019
bul+0.019
 mant+0.019
Neg logit contributions
Ben-0.011
Le-0.010
avi-0.010
 might-0.010
 Leo-0.009
Token grandma
Token ID49890
Feature activation+1.61e-01

Pos logit contributions
bel+0.130
bul+0.128
 mant+0.128
rated+0.127
imm+0.126
Neg logit contributions
Ben-0.060
Le-0.058
 might-0.056
avi-0.056
 Leo-0.050
Token.
Token ID13
Feature activation+7.58e-01

Pos logit contributions
bel+0.005
imm+0.005
rated+0.005
bul+0.005
lic+0.005
Neg logit contributions
Ben-0.003
Le-0.003
 might-0.003
avi-0.003
 Rex-0.003
Token I
Token ID314
Feature activation+5.15e-01

Pos logit contributions
bel+0.024
imm+0.023
rated+0.023
lled+0.023
lling+0.023
Neg logit contributions
Ben-0.012
 might-0.011
Le-0.011
avi-0.010
 Rex-0.010
Token didn
Token ID1422
Feature activation+4.05e-01

Pos logit contributions
bel+0.014
imm+0.014
rated+0.014
lic+0.014
 mant+0.013
Neg logit contributions
Ben-0.010
Le-0.010
 might-0.010
avi-0.009
 Rex-0.009
Token't
Token ID470
Feature activation+5.99e-01

Pos logit contributions
bel+0.012
bul+0.012
lic+0.012
 mant+0.012
imm+0.011
Neg logit contributions
Ben-0.007
 might-0.007
avi-0.007
Le-0.007
 will-0.006
Token
Token ID220
Feature activation+2.32e-01

Pos logit contributions
taker+0.011
lic+0.011
eway+0.011
 mant+0.011
 Sus+0.011
Neg logit contributions
Ben-0.009
avi-0.009
Le-0.009
 might-0.009
 will-0.008
Token
Token ID198
Feature activation-1.04e-01

Pos logit contributions
bel+0.008
imm+0.008
lic+0.008
spe+0.007
rated+0.007
Neg logit contributions
Ben-0.003
 might-0.003
Le-0.003
avi-0.003
 will-0.003
Token
Token ID198
Feature activation-3.30e-01

Pos logit contributions
 might+0.002
avi+0.002
 will+0.001
Ben+0.001
Le+0.001
Neg logit contributions
bel-0.003
lic-0.003
shadow-0.003
spe-0.003
eway-0.003
TokenThe
Token ID464
Feature activation+1.84e-01

Pos logit contributions
 might+0.007
 will+0.006
avi+0.006
Ben+0.006
 cones+0.006
Neg logit contributions
bel-0.009
spe-0.009
shadow-0.009
lic-0.009
eway-0.009
Token town
Token ID3240
Feature activation+1.93e+00

Pos logit contributions
bel+0.005
 mant+0.005
bul+0.004
spe+0.004
rated+0.004
Neg logit contributions
Ben-0.004
 might-0.004
Le-0.004
avi-0.004
 Leo-0.003
Tokenspe
Token ID4125
Feature activation+5.93e+00

Pos logit contributions
bel+0.043
imm+0.042
rated+0.042
 mant+0.042
lic+0.042
Neg logit contributions
Ben-0.046
 might-0.045
Le-0.043
avi-0.043
 will-0.043
Tokenople
Token ID643
Feature activation+1.04e+00

Pos logit contributions
lic+0.133
 mant+0.124
bel+0.124
eway+0.123
taker+0.120
Neg logit contributions
 might-0.141
Ben-0.141
avi-0.140
 will-0.136
Le-0.135
Token had
Token ID550
Feature activation+4.03e-01

Pos logit contributions
bel+0.031
imm+0.031
 mant+0.031
rated+0.031
bul+0.030
Neg logit contributions
 might-0.017
Ben-0.017
avi-0.016
 will-0.016
Le-0.016
Token to
Token ID284
Feature activation+8.65e-01

Pos logit contributions
bel+0.011
imm+0.011
rated+0.011
 mant+0.010
bul+0.010
Neg logit contributions
Ben-0.008
 might-0.008
Le-0.008
avi-0.008
 will-0.007
Token find
Token ID1064
Feature activation+3.71e-01

Pos logit contributions
bel+0.025
imm+0.024
rated+0.024
 mant+0.024
bul+0.024
Neg logit contributions
Ben-0.015
 might-0.015
Le-0.015
avi-0.014
 Rex-0.014
Token a
Token ID257
Feature activation-2.92e-01

Pos logit contributions
bel+0.010
rated+0.010
imm+0.010
lic+0.010
 mant+0.010
Neg logit contributions
Ben-0.007
 might-0.007
Le-0.007
avi-0.007
 Rex-0.007
Token Joe
Token ID5689
Feature activation+9.99e-01

Pos logit contributions
lic+0.005
bel+0.005
imm+0.005
 Sus+0.005
 mant+0.005
Neg logit contributions
Ben-0.003
Le-0.003
 might-0.003
avi-0.003
 will-0.003
Token was
Token ID373
Feature activation+7.30e-02

Pos logit contributions
spe+0.024
bel+0.024
lled+0.023
lling+0.023
lic+0.023
Neg logit contributions
Ben-0.023
Le-0.022
avi-0.021
 might-0.020
 Rex-0.020
Token scared
Token ID12008
Feature activation-1.66e-01

Pos logit contributions
rated+0.002
imm+0.002
bel+0.002
bul+0.002
ounded+0.002
Neg logit contributions
Ben-0.002
 might-0.002
 Rex-0.002
 Leo-0.002
avi-0.001
Token and
Token ID290
Feature activation+3.54e-01

Pos logit contributions
Ben+0.003
Le+0.003
avi+0.003
 might+0.003
 will+0.002
Neg logit contributions
bel-0.005
bul-0.005
imm-0.005
rated-0.005
 Sus-0.005
Token he
Token ID339
Feature activation+3.87e-01

Pos logit contributions
bel+0.010
imm+0.009
rated+0.009
 mant+0.009
bul+0.009
Neg logit contributions
Ben-0.007
 might-0.007
Le-0.007
avi-0.006
 Rex-0.006
Token slammed
Token ID21376
Feature activation+4.13e+00

Pos logit contributions
bel+0.009
lic+0.008
spe+0.008
rated+0.008
taker+0.008
Neg logit contributions
Ben-0.010
Le-0.009
avi-0.009
 Leo-0.008
 Remy-0.008
Token the
Token ID262
Feature activation+1.49e+00

Pos logit contributions
rated+0.124
imm+0.124
bel+0.124
lic+0.122
 mant+0.121
Neg logit contributions
 might-0.068
Ben-0.065
 Rex-0.065
Le-0.063
 Leo-0.062
Token jar
Token ID17379
Feature activation+5.56e-01

Pos logit contributions
imm+0.040
bel+0.039
rated+0.039
lled+0.038
spe+0.038
Neg logit contributions
Ben-0.031
 might-0.030
Le-0.029
 will-0.028
avi-0.028
Token shut
Token ID4423
Feature activation+1.92e+00

Pos logit contributions
bel+0.014
rated+0.014
imm+0.014
 mant+0.014
bul+0.014
Neg logit contributions
 might-0.012
Ben-0.011
 will-0.011
Le-0.011
 Rex-0.010
Token.
Token ID13
Feature activation+5.32e-01

Pos logit contributions
bel+0.052
imm+0.051
rated+0.051
lic+0.051
 mant+0.050
Neg logit contributions
Ben-0.037
 might-0.037
Le-0.035
 Rex-0.035
avi-0.035
Token
Token ID198
Feature activation+8.91e-01

Pos logit contributions
bel+0.016
imm+0.016
rated+0.016
lic+0.016
 mant+0.016
Neg logit contributions
Ben-0.009
 might-0.008
Le-0.008
avi-0.008
 will-0.008
Token picked
Token ID6497
Feature activation+4.28e-01

Pos logit contributions
bel+0.009
imm+0.009
rated+0.009
 mant+0.008
lic+0.008
Neg logit contributions
Ben-0.007
 might-0.007
Le-0.007
avi-0.007
 Rex-0.006
Token it
Token ID340
Feature activation-2.83e-02

Pos logit contributions
bel+0.014
imm+0.014
rated+0.014
 mant+0.014
lic+0.014
Neg logit contributions
Ben-0.006
 might-0.005
Le-0.005
avi-0.005
 Rex-0.005
Token up
Token ID510
Feature activation-1.38e+00

Pos logit contributions
Ben+0.001
 might+0.001
Le+0.001
 will+0.001
 Rex+0.001
Neg logit contributions
bel-0.000
spe-0.000
rated-0.000
imm-0.000
lic-0.000
Token and
Token ID290
Feature activation+7.73e-03

Pos logit contributions
Ben+0.014
 might+0.013
Le+0.013
avi+0.012
 Rex+0.011
Neg logit contributions
bel-0.051
rated-0.050
imm-0.050
lic-0.049
 mant-0.049
Token looked
Token ID3114
Feature activation+2.74e+00

Pos logit contributions
imm+0.000
bel+0.000
rated+0.000
spe+0.000
lic+0.000
Neg logit contributions
Ben-0.000
 might-0.000
 will-0.000
 Rex-0.000
 Leo-0.000
Token inside
Token ID2641
Feature activation+1.58e+00

Pos logit contributions
spe+0.093
eway+0.092
bel+0.091
imm+0.091
bul+0.089
Neg logit contributions
 might-0.047
 will-0.042
 cannot-0.039
 can-0.038
 Ben-0.036
Token,
Token ID11
Feature activation+1.03e+00

Pos logit contributions
bel+0.056
imm+0.056
rated+0.055
lic+0.055
spe+0.054
Neg logit contributions
 might-0.020
Ben-0.019
 will-0.017
 Leo-0.017
 Rex-0.016
Token but
Token ID475
Feature activation+8.70e-01

Pos logit contributions
imm+0.034
igsaw+0.033
spe+0.033
lic+0.033
bul+0.032
Neg logit contributions
 might-0.017
 Ben-0.016
Ben-0.016
 Leo-0.015
 Rex-0.015
Token it
Token ID340
Feature activation-3.62e-01

Pos logit contributions
bel+0.030
imm+0.029
rated+0.029
lic+0.029
 mant+0.029
Neg logit contributions
Ben-0.011
 might-0.010
Le-0.010
avi-0.010
 Rex-0.009
Token was
Token ID373
Feature activation-5.64e-01

Pos logit contributions
Ben+0.007
Le+0.007
avi+0.007
 might+0.007
 Rex+0.007
Neg logit contributions
bel-0.010
imm-0.009
rated-0.009
lic-0.009
bul-0.009
Token empty
Token ID6565
Feature activation+1.07e+00

Pos logit contributions
 might+0.012
Ben+0.011
 will+0.010
Le+0.010
 Rex+0.010
Neg logit contributions
bel-0.016
imm-0.015
rated-0.015
lic-0.015
spe-0.015
Token John
Token ID1757
Feature activation+4.91e-01

Pos logit contributions
bel+0.021
spe+0.021
 Sus+0.021
lic+0.020
 mant+0.020
Neg logit contributions
Ben-0.013
Le-0.012
 might-0.012
avi-0.012
 will-0.011
Token smiled
Token ID13541
Feature activation+5.54e-01

Pos logit contributions
spe+0.012
bel+0.012
imm+0.012
rated+0.011
lic+0.011
Neg logit contributions
Ben-0.011
Le-0.010
avi-0.010
 might-0.010
 Rex-0.010
Token and
Token ID290
Feature activation-1.74e-01

Pos logit contributions
bel+0.020
rated+0.020
imm+0.020
 mant+0.020
lic+0.020
Neg logit contributions
Ben-0.006
 might-0.006
Le-0.005
avi-0.005
 Rex-0.005
Token said
Token ID531
Feature activation+2.68e-01

Pos logit contributions
Ben+0.002
 might+0.002
Le+0.002
avi+0.002
 Rex+0.002
Neg logit contributions
bel-0.006
rated-0.006
imm-0.006
 mant-0.006
bul-0.006
Token "
Token ID366
Feature activation+8.91e-01

Pos logit contributions
rated+0.008
imm+0.008
bel+0.008
 mant+0.008
lled+0.008
Neg logit contributions
Ben-0.004
 might-0.004
Le-0.004
 will-0.004
 Leo-0.004
TokenYes
Token ID5297
Feature activation+1.12e+00

Pos logit contributions
bel+0.022
imm+0.022
rated+0.022
lic+0.021
bul+0.021
Neg logit contributions
Ben-0.019
 might-0.019
avi-0.018
Le-0.018
 Rex-0.018
Token please
Token ID3387
Feature activation-1.23e+00

Pos logit contributions
rated+0.030
imm+0.030
bel+0.030
lic+0.030
taker+0.030
Neg logit contributions
Ben-0.021
Le-0.021
avi-0.020
 might-0.020
 Rex-0.019
Token!".
Token ID48220
Feature activation-5.94e-01

Pos logit contributions
Ben+0.029
Le+0.029
avi+0.027
 might+0.027
 Rex+0.026
Neg logit contributions
bel-0.028
lic-0.027
rated-0.027
imm-0.027
 mant-0.027
Token
Token ID198
Feature activation+2.51e-01

Pos logit contributions
Ben+0.009
Le+0.009
 might+0.008
avi+0.008
 will+0.008
Neg logit contributions
bel-0.019
spe-0.018
imm-0.018
 mant-0.018
bul-0.018
Token
Token ID198
Feature activation-2.15e-02

Pos logit contributions
lic+0.008
bel+0.007
spe+0.007
shadow+0.007
Sus+0.007
Neg logit contributions
 might-0.004
Ben-0.004
 will-0.004
avi-0.004
 cannot-0.004
TokenThe
Token ID464
Feature activation-1.40e-01

Pos logit contributions
 might+0.000
Ben+0.000
avi+0.000
 will+0.000
 Remy+0.000
Neg logit contributions
bel-0.001
spe-0.001
lic-0.001
shadow-0.001
lled-0.000
Token.
Token ID13
Feature activation-1.33e-01

Pos logit contributions
Ben+0.024
Le+0.023
avi+0.022
 might+0.020
 Remy+0.020
Neg logit contributions
bel-0.040
spe-0.039
bul-0.038
imm-0.038
 mant-0.038
Token
Token ID198
Feature activation+4.72e-02

Pos logit contributions
Ben+0.002
avi+0.002
Le+0.002
 might+0.002
 will+0.002
Neg logit contributions
bel-0.004
lic-0.004
imm-0.004
 Sus-0.004
 mant-0.004
Token
Token ID198
Feature activation+1.93e-03

Pos logit contributions
lic+0.001
bel+0.001
eway+0.001
 Sus+0.001
Sus+0.001
Neg logit contributions
avi-0.001
 might-0.001
Ben-0.001
tail-0.001
 cones-0.001
Token"
Token ID1
Feature activation+9.73e-01

Pos logit contributions
bel+0.000
imm+0.000
rated+0.000
 mant+0.000
bul+0.000
Neg logit contributions
Ben-0.000
Le-0.000
 might-0.000
avi-0.000
 Rex-0.000
TokenHello
Token ID15496
Feature activation+1.87e-01

Pos logit contributions
bel+0.026
lic+0.026
rated+0.025
bul+0.025
 mant+0.025
Neg logit contributions
 might-0.018
Ben-0.018
avi-0.018
Le-0.017
 Leo-0.017
Token,
Token ID11
Feature activation+1.77e+00

Pos logit contributions
bel+0.006
imm+0.006
rated+0.006
lic+0.006
 mant+0.006
Neg logit contributions
Ben-0.003
Le-0.003
 might-0.003
avi-0.003
 Rex-0.002
Token can
Token ID460
Feature activation-5.68e-01

Pos logit contributions
bel+0.049
lic+0.049
 mant+0.048
rated+0.048
spe+0.047
Neg logit contributions
Ben-0.033
Le-0.032
avi-0.031
 might-0.030
 Remy-0.028
Token we
Token ID356
Feature activation-6.09e-01

Pos logit contributions
Ben+0.009
avi+0.009
Le+0.008
 might+0.008
 will+0.008
Neg logit contributions
bel-0.017
shadow-0.017
 Sus-0.017
 mant-0.017
bul-0.017
Token have
Token ID423
Feature activation-6.26e-01

Pos logit contributions
Le+0.013
Ben+0.012
avi+0.012
 Leo+0.012
 Patch+0.011
Neg logit contributions
bel-0.016
 Sus-0.016
bul-0.015
 mant-0.015
lic-0.015
Token ice
Token ID4771
Feature activation-2.41e-01

Pos logit contributions
Ben+0.012
Le+0.011
avi+0.011
 might+0.011
 Leo+0.010
Neg logit contributions
bel-0.017
 mant-0.017
bul-0.017
lic-0.016
imm-0.016
Token-
Token ID12
Feature activation+1.08e+00

Pos logit contributions
Ben+0.006
 might+0.005
Le+0.005
avi+0.005
 will+0.005
Neg logit contributions
bel-0.006
rated-0.006
 mant-0.006
imm-0.006
lic-0.005
Token a
Token ID257
Feature activation-7.48e-01

Pos logit contributions
 might+0.013
avi+0.013
Ben+0.013
Le+0.012
 will+0.012
Neg logit contributions
bel-0.015
lled-0.015
 Sus-0.015
imm-0.015
lic-0.015
Token little
Token ID1310
Feature activation-2.78e-01

Pos logit contributions
 might+0.022
avi+0.021
Le+0.021
ars+0.020
 will+0.020
Neg logit contributions
lic-0.012
bel-0.012
 mant-0.011
bul-0.011
 glimpse-0.011
Token girl
Token ID2576
Feature activation-8.65e-01

Pos logit contributions
avi+0.008
Le+0.008
 might+0.008
 cannot+0.008
ars+0.008
Neg logit contributions
bul-0.005
 Sus-0.004
taker-0.004
ter-0.004
 glimpse-0.004
Token named
Token ID3706
Feature activation+8.50e-02

Pos logit contributions
Ben+0.018
 might+0.018
Le+0.017
avi+0.017
 will+0.016
Neg logit contributions
bel-0.022
imm-0.022
bul-0.021
 mant-0.021
lic-0.021
Token Lily
Token ID20037
Feature activation-6.79e-01

Pos logit contributions
bel+0.003
imm+0.003
bul+0.002
spe+0.002
lled+0.002
Neg logit contributions
Ben-0.002
 might-0.002
 Ben-0.002
Le-0.001
 Leo-0.001
Token.
Token ID13
Feature activation+8.25e-01

Pos logit contributions
Ben+0.012
 might+0.011
Le+0.011
avi+0.011
 Rex+0.010
Neg logit contributions
bel-0.020
imm-0.020
rated-0.019
bul-0.019
 mant-0.019
Token One
Token ID1881
Feature activation-1.39e+00

Pos logit contributions
bel+0.028
imm+0.028
rated+0.028
lic+0.027
 mant+0.027
Neg logit contributions
Ben-0.010
avi-0.010
Le-0.010
 might-0.010
 will-0.009
Token day
Token ID1110
Feature activation+5.50e-01

Pos logit contributions
Ben+0.033
 might+0.032
Le+0.032
avi+0.031
 will+0.030
Neg logit contributions
bel-0.032
rated-0.031
imm-0.031
 mant-0.031
lic-0.031
Token,
Token ID11
Feature activation+1.43e+00

Pos logit contributions
rated+0.021
imm+0.021
 mant+0.021
bel+0.021
lling+0.021
Neg logit contributions
Ben-0.006
 might-0.005
Le-0.005
 Rex-0.005
 will-0.004
Token she
Token ID673
Feature activation-8.92e-01

Pos logit contributions
imm+0.051
rated+0.051
spe+0.050
lling+0.050
 mant+0.050
Neg logit contributions
Ben-0.021
 might-0.019
 Ben-0.018
 Rex-0.018
 Leo-0.017
Token went
Token ID1816
Feature activation+5.30e-01

Pos logit contributions
Ben+0.017
Le+0.017
avi+0.016
 might+0.016
 Leo+0.016
Neg logit contributions
bel-0.024
rated-0.024
imm-0.024
lic-0.023
 mant-0.023
Token boy
Token ID2933
Feature activation-1.04e+00

Pos logit contributions
Le+0.009
 might+0.009
avi+0.009
Ben+0.009
ars+0.008
Neg logit contributions
bul-0.010
 Sus-0.009
mill-0.009
spe-0.009
taker-0.009
Token named
Token ID3706
Feature activation+1.92e-01

Pos logit contributions
Ben+0.022
Le+0.021
avi+0.021
 might+0.021
 Leo+0.020
Neg logit contributions
bel-0.026
imm-0.025
bul-0.025
 Sus-0.025
lic-0.025
Token Danny
Token ID15105
Feature activation-7.72e-01

Pos logit contributions
bel+0.006
imm+0.005
bul+0.005
rated+0.005
spe+0.005
Neg logit contributions
Ben-0.004
 might-0.004
 Rex-0.003
 Leo-0.003
Le-0.003
Token was
Token ID373
Feature activation-1.60e-01

Pos logit contributions
Le+0.018
Ben+0.017
avi+0.017
 Remy+0.016
 Leo+0.016
Neg logit contributions
bel-0.018
 Sus-0.017
lled-0.017
imm-0.017
rated-0.017
Token feeling
Token ID4203
Feature activation+1.59e+00

Pos logit contributions
Ben+0.003
 might+0.003
Le+0.003
 Rex+0.003
avi+0.003
Neg logit contributions
rated-0.004
imm-0.004
bel-0.004
lic-0.004
bul-0.004
Token very
Token ID845
Feature activation+8.84e-01

Pos logit contributions
imm+0.046
rated+0.045
bel+0.044
eway+0.044
lic+0.044
Neg logit contributions
Ben-0.029
 might-0.029
 will-0.027
 Rex-0.026
 cannot-0.026
Token worried
Token ID7960
Feature activation+2.17e+00

Pos logit contributions
bel+0.021
imm+0.021
bul+0.021
rated+0.021
lic+0.021
Neg logit contributions
Ben-0.020
 might-0.019
Le-0.019
avi-0.019
 will-0.018
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
bel+0.070
imm+0.069
lic+0.068
ounded+0.068
shadow+0.068
Neg logit contributions
 might-0.032
Ben-0.030
 Rex-0.029
 will-0.028
 Leo-0.027
Token He
Token ID679
Feature activation-4.89e-02

Pos logit contributions
bel+0.037
lic+0.037
rated+0.037
imm+0.036
 mant+0.036
Neg logit contributions
Ben-0.016
Le-0.016
avi-0.015
 might-0.015
 will-0.014
Token had
Token ID550
Feature activation+8.38e-01

Pos logit contributions
Le+0.001
avi+0.001
 cones+0.001
Ben+0.001
 Leo+0.001
Neg logit contributions
rated-0.001
lic-0.001
any-0.001
bel-0.001
lled-0.001
Token been
Token ID587
Feature activation+1.70e+00

Pos logit contributions
imm+0.025
bel+0.024
rated+0.024
lic+0.024
 mant+0.024
Neg logit contributions
Ben-0.015
 might-0.014
Le-0.014
avi-0.013
 will-0.013
Token.
Token ID13
Feature activation+1.96e-01

Pos logit contributions
Ben+0.014
Le+0.014
avi+0.013
itch+0.012
 Remy+0.012
Neg logit contributions
bel-0.016
lling-0.015
bul-0.015
rated-0.015
spe-0.015
Token He
Token ID679
Feature activation-7.64e-01

Pos logit contributions
ounded+0.006
lic+0.006
rated+0.005
 mant+0.005
bel+0.005
Neg logit contributions
Le-0.003
Ben-0.003
avi-0.003
 might-0.003
 will-0.003
Token looked
Token ID3114
Feature activation+2.28e+00

Pos logit contributions
avi+0.019
Ben+0.019
 cones+0.019
Le+0.018
itch+0.018
Neg logit contributions
lic-0.016
rated-0.015
bul-0.014
bel-0.014
lled-0.014
Token up
Token ID510
Feature activation-3.64e-01

Pos logit contributions
bel+0.049
imm+0.049
spe+0.048
lic+0.047
bul+0.046
Neg logit contributions
 might-0.060
Ben-0.057
 will-0.057
 Rex-0.055
 Leo-0.055
Token and
Token ID290
Feature activation-7.20e-03

Pos logit contributions
Ben+0.005
 might+0.005
Le+0.005
avi+0.004
 Rex+0.004
Neg logit contributions
bel-0.012
imm-0.012
rated-0.012
 mant-0.012
bul-0.012
Token saw
Token ID2497
Feature activation+2.84e-01

Pos logit contributions
Ben+0.000
Le+0.000
 might+0.000
avi+0.000
 Rex+0.000
Neg logit contributions
bel-0.000
rated-0.000
imm-0.000
 mant-0.000
lic-0.000
Token a
Token ID257
Feature activation-8.61e-01

Pos logit contributions
bel+0.009
rated+0.009
imm+0.009
 mant+0.008
spe+0.008
Neg logit contributions
Ben-0.005
 might-0.005
Le-0.004
 Rex-0.004
 Leo-0.004
Token group
Token ID1448
Feature activation+1.33e+00

Pos logit contributions
Ben+0.017
avi+0.017
Le+0.017
 might+0.016
 Leo+0.016
Neg logit contributions
lic-0.022
bel-0.022
bul-0.022
 mant-0.021
 glimpse-0.021
Token of
Token ID286
Feature activation+4.30e-01

Pos logit contributions
bel+0.030
imm+0.029
rated+0.029
 mant+0.029
lic+0.029
Neg logit contributions
Ben-0.032
 might-0.031
Le-0.031
avi-0.030
 Rex-0.030
Token birds
Token ID10087
Feature activation+1.34e-01

Pos logit contributions
bel+0.012
lic+0.012
bul+0.012
 mant+0.012
taker+0.012
Neg logit contributions
Ben-0.008
Le-0.007
 might-0.007
avi-0.007
 will-0.007
Token.
Token ID13
Feature activation-1.49e-01

Pos logit contributions
bel+0.003
bul+0.003
imm+0.003
lling+0.003
lic+0.003
Neg logit contributions
Ben-0.003
Le-0.003
avi-0.003
 might-0.003
 Rex-0.003
Token As
Token ID1081
Feature activation-1.15e+00

Pos logit contributions
lic+0.004
 Sus+0.004
 mant+0.004
ounded+0.004
 handing+0.004
Neg logit contributions
 might-0.003
avi-0.002
Le-0.002
 will-0.002
Ben-0.002
Token they
Token ID484
Feature activation+1.84e-01

Pos logit contributions
avi+0.026
 might+0.024
tail+0.024
ars+0.024
Le+0.023
Neg logit contributions
 Sus-0.027
bel-0.026
lic-0.026
shadow-0.025
lling-0.025
Token looked
Token ID3114
Feature activation+1.92e+00

Pos logit contributions
bel+0.005
imm+0.005
rated+0.005
lic+0.005
 mant+0.005
Neg logit contributions
Ben-0.004
 might-0.003
Le-0.003
avi-0.003
 Rex-0.003
Token closer
Token ID5699
Feature activation+1.30e-01

Pos logit contributions
imm+0.053
spe+0.053
bel+0.052
any+0.052
rated+0.051
Neg logit contributions
 might-0.040
 will-0.038
Ben-0.037
 Rex-0.035
 Leo-0.035
Token,
Token ID11
Feature activation+8.43e-01

Pos logit contributions
bel+0.003
 Sus+0.003
imm+0.003
ounded+0.003
lic+0.003
Neg logit contributions
Ben-0.003
Le-0.002
avi-0.002
itch-0.002
 might-0.002
Token they
Token ID484
Feature activation-4.08e-01

Pos logit contributions
 Sus+0.025
bel+0.025
lic+0.024
shadow+0.024
ounded+0.023
Neg logit contributions
 might-0.013
avi-0.013
Le-0.013
Ben-0.013
ars-0.012
Token saw
Token ID2497
Feature activation+3.25e-01

Pos logit contributions
Ben+0.008
Le+0.008
avi+0.007
 might+0.007
 Rex+0.007
Neg logit contributions
bel-0.011
lic-0.011
lled-0.011
rated-0.010
ounded-0.010
Token that
Token ID326
Feature activation-8.04e-01

Pos logit contributions
bel+0.008
lled+0.008
bul+0.008
lic+0.008
imm+0.008
Neg logit contributions
Ben-0.007
 might-0.006
Le-0.006
avi-0.006
 will-0.006
Token it
Token ID340
Feature activation-4.51e-02

Pos logit contributions
Ben+0.015
 might+0.014
Le+0.014
avi+0.014
 will+0.013
Neg logit contributions
bel-0.023
rated-0.022
imm-0.022
 mant-0.022
bul-0.022
Token was
Token ID373
Feature activation-6.15e-01

Pos logit contributions
Ben+0.001
 might+0.001
Le+0.001
avi+0.001
 will+0.001
Neg logit contributions
bel-0.001
imm-0.001
rated-0.001
 mant-0.001
lic-0.001
Token a
Token ID257
Feature activation-5.91e-02

Pos logit contributions
Ben+0.010
 might+0.010
 Rex+0.009
 Leo+0.009
Le+0.009
Neg logit contributions
bel-0.019
rated-0.019
imm-0.019
ounded-0.019
lic-0.018
Token saw
Token ID2497
Feature activation+4.46e-01

Pos logit contributions
Ben+0.029
Le+0.028
avi+0.028
 Rover+0.027
tail+0.027
Neg logit contributions
imm-0.026
lled-0.026
spe-0.025
bel-0.025
lic-0.025
Token something
Token ID1223
Feature activation-1.50e-01

Pos logit contributions
rated+0.015
imm+0.014
shadow+0.014
 mant+0.014
bel+0.014
Neg logit contributions
 might-0.007
 Rex-0.007
Ben-0.007
 Leo-0.006
 Remy-0.006
Token shiny
Token ID22441
Feature activation-6.77e-02

Pos logit contributions
 might+0.003
Ben+0.003
 will+0.003
 Rex+0.003
Le+0.003
Neg logit contributions
bel-0.004
rated-0.004
imm-0.004
 mant-0.004
shadow-0.004
Token in
Token ID287
Feature activation+1.14e-01

Pos logit contributions
Ben+0.001
 might+0.001
Le+0.001
avi+0.001
 will+0.001
Neg logit contributions
bel-0.002
imm-0.002
rated-0.002
 mant-0.002
lic-0.002
Token the
Token ID262
Feature activation-3.54e-01

Pos logit contributions
bel+0.003
imm+0.003
lic+0.003
rated+0.003
ounded+0.003
Neg logit contributions
Ben-0.002
 might-0.002
Le-0.002
 Rex-0.002
 Leo-0.002
Token trash
Token ID13913
Feature activation-1.73e-01

Pos logit contributions
 might+0.007
Ben+0.007
 will+0.007
 cones+0.006
 dog+0.006
Neg logit contributions
bel-0.010
imm-0.010
shadow-0.010
lic-0.009
bul-0.009
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
avi+0.003
Ben+0.003
Le+0.003
oxy+0.003
 Remy+0.003
Neg logit contributions
bul-0.005
lic-0.005
taker-0.005
 mant-0.005
lling-0.004
Token
Token ID198
Feature activation+4.37e-01

Pos logit contributions
Ben+0.003
 might+0.003
Le+0.002
avi+0.002
 Rex+0.002
Neg logit contributions
bel-0.007
imm-0.007
rated-0.007
 mant-0.007
lic-0.007
Token
Token ID198
Feature activation+2.29e-01

Pos logit contributions
lic+0.014
bel+0.014
imm+0.013
eway+0.013
shadow+0.013
Neg logit contributions
avi-0.006
Ben-0.006
 might-0.006
Le-0.006
 will-0.005
TokenTim
Token ID14967
Feature activation+5.28e-02

Pos logit contributions
bul+0.008
imm+0.008
 mant+0.008
bel+0.008
rated+0.008
Neg logit contributions
Ben-0.003
Le-0.003
 might-0.003
Tom-0.003
 Rex-0.003
Token flew
Token ID13112
Feature activation-9.70e-01

Pos logit contributions
lic+0.001
bel+0.001
lled+0.001
imm+0.001
rated+0.001
Neg logit contributions
Ben-0.001
Le-0.001
avi-0.001
 Rover-0.001
 Rex-0.001
Token<|endoftext|>
Token ID50256
Feature activation+8.91e-01

Pos logit contributions
ars+0.018
 might+0.017
avi+0.017
 dove+0.017
 will+0.016
Neg logit contributions
 Sus-0.021
 Jos-0.019
what-0.019
ount-0.018
 handing-0.018
TokenL
Token ID43
Feature activation-7.26e-01

Pos logit contributions
lic+0.020
bel+0.020
eway+0.019
bul+0.019
imm+0.019
Neg logit contributions
avi-0.020
Ben-0.019
 might-0.019
Le-0.018
 Remy-0.018
Tokenily
Token ID813
Feature activation-2.88e-02

Pos logit contributions
 might+0.016
Le+0.016
Ben+0.016
avi+0.016
 Remy+0.015
Neg logit contributions
lled-0.016
eway-0.015
lic-0.015
 Sus-0.015
lling-0.015
Token and
Token ID290
Feature activation-1.70e+00

Pos logit contributions
Le+0.001
avi+0.001
Ben+0.001
 Remy+0.001
itch+0.001
Neg logit contributions
lled-0.001
any-0.001
imm-0.001
rated-0.001
 Sus-0.001
Token Ben
Token ID3932
Feature activation-3.83e-01

Pos logit contributions
ars+0.040
 might+0.039
Le+0.038
avi+0.038
Ben+0.037
Neg logit contributions
 Sus-0.040
bel-0.036
taker-0.033
lled-0.033
bul-0.032
Token are
Token ID389
Feature activation-1.07e+00

Pos logit contributions
Le+0.009
avi+0.008
 Remy+0.008
 Rex+0.008
 Leo+0.008
Neg logit contributions
lled-0.009
rated-0.009
 Sus-0.009
bel-0.008
imm-0.008
Token playing
Token ID2712
Feature activation+1.34e+00

Pos logit contributions
Ben+0.026
Le+0.026
 might+0.026
avi+0.025
 Remy+0.024
Neg logit contributions
bel-0.024
 Sus-0.022
 mant-0.021
spe-0.021
lic-0.021
Token in
Token ID287
Feature activation+5.08e-01

Pos logit contributions
bel+0.028
rated+0.028
imm+0.028
lic+0.027
 mant+0.027
Neg logit contributions
Ben-0.035
 might-0.033
Le-0.033
avi-0.032
 Rex-0.032
Token the
Token ID262
Feature activation-8.25e-01

Pos logit contributions
bel+0.013
 mant+0.012
imm+0.012
bul+0.012
spe+0.012
Neg logit contributions
Ben-0.011
 might-0.011
avi-0.010
Le-0.010
 will-0.010
Token park
Token ID3952
Feature activation-8.66e-01

Pos logit contributions
avi+0.015
 might+0.015
Ben+0.014
Le+0.014
 will+0.013
Neg logit contributions
bel-0.023
 mant-0.023
bul-0.022
imm-0.022
spe-0.022
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
Ben+0.017
Le+0.016
 might+0.016
avi+0.016
 Rex+0.015
Neg logit contributions
bel-0.024
imm-0.023
rated-0.023
spe-0.023
 mant-0.023
Token old
Token ID1468
Feature activation+5.79e-01

Pos logit contributions
bel+0.002
imm+0.002
rated+0.002
 mant+0.002
lic+0.002
Neg logit contributions
Ben-0.003
 might-0.003
Le-0.003
avi-0.003
 Rex-0.003
Token and
Token ID290
Feature activation-6.16e-01

Pos logit contributions
 Sus+0.016
bul+0.016
bel+0.016
spe+0.015
imm+0.015
Neg logit contributions
Ben-0.011
Le-0.011
avi-0.010
ars-0.010
 Leo-0.009
Token loved
Token ID6151
Feature activation-8.86e-01

Pos logit contributions
Ben+0.013
Le+0.013
avi+0.013
 might+0.012
 cones+0.012
Neg logit contributions
bel-0.016
taker-0.015
imm-0.015
 Sus-0.015
lic-0.015
Token to
Token ID284
Feature activation+1.28e+00

Pos logit contributions
Ben+0.021
Le+0.019
avi+0.019
 might+0.019
 will+0.018
Neg logit contributions
bel-0.022
bul-0.021
 mant-0.020
 Sus-0.020
spe-0.020
Token explore
Token ID7301
Feature activation+1.46e+00

Pos logit contributions
bel+0.034
 Sus+0.033
 mant+0.033
bul+0.033
imm+0.032
Neg logit contributions
Ben-0.025
Le-0.025
avi-0.023
 might-0.023
 Rex-0.022
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
bel+0.039
imm+0.038
 mant+0.038
bul+0.038
rated+0.037
Neg logit contributions
Ben-0.029
Le-0.028
avi-0.028
 might-0.028
 Rex-0.026
Token Today
Token ID6288
Feature activation+4.22e-01

Pos logit contributions
Ben+0.002
avi+0.002
Le+0.002
ars+0.002
 dove+0.002
Neg logit contributions
bul-0.003
 Sus-0.003
any-0.002
 Grac-0.002
imm-0.002
Token,
Token ID11
Feature activation+8.84e-01

Pos logit contributions
 Sus+0.012
imm+0.012
bel+0.012
bul+0.012
lic+0.011
Neg logit contributions
Ben-0.008
avi-0.007
Le-0.007
 might-0.007
ars-0.007
Token she
Token ID673
Feature activation-4.16e-01

Pos logit contributions
bel+0.023
bul+0.023
lic+0.022
 causing+0.022
taker+0.022
Neg logit contributions
Ben-0.019
avi-0.018
Le-0.016
itch-0.016
oco-0.015
Token was
Token ID373
Feature activation+5.88e-01

Pos logit contributions
Le+0.009
Ben+0.009
avi+0.009
 Rex+0.009
 Leo+0.009
Neg logit contributions
bel-0.010
rated-0.010
spe-0.010
lic-0.010
imm-0.010
Token in
Token ID287
Feature activation+5.66e-01

Pos logit contributions
rated+0.017
imm+0.017
bel+0.016
spe+0.016
bul+0.016
Neg logit contributions
 might-0.011
 Leo-0.011
 Rex-0.011
 will-0.010
 Ben-0.010
Token They
Token ID1119
Feature activation-1.24e+00

Pos logit contributions
bel+0.012
 mant+0.012
rated+0.012
 Sus+0.012
lic+0.012
Neg logit contributions
Ben-0.005
avi-0.005
Le-0.005
 might-0.005
ars-0.004
Token saw
Token ID2497
Feature activation-6.16e-01

Pos logit contributions
Le+0.030
Ben+0.030
avi+0.028
itch+0.028
oxy+0.028
Neg logit contributions
bel-0.027
 Sus-0.027
spe-0.027
lic-0.026
taker-0.025
Token many
Token ID867
Feature activation-4.43e-01

Pos logit contributions
Ben+0.014
avi+0.013
Le+0.012
 might+0.012
itch+0.012
Neg logit contributions
bel-0.015
lic-0.014
 Sus-0.014
lled-0.014
bul-0.014
Token animals
Token ID4695
Feature activation-9.46e-01

Pos logit contributions
Ben+0.009
Le+0.009
avi+0.009
itch+0.008
 Leo+0.008
Neg logit contributions
bel-0.011
lic-0.011
spe-0.011
lling-0.011
lled-0.011
Token,
Token ID11
Feature activation-6.36e-01

Pos logit contributions
Ben+0.022
Le+0.021
avi+0.020
itch+0.020
oxy+0.019
Neg logit contributions
bel-0.023
lic-0.022
ount-0.022
imm-0.022
 happening-0.021
Token like
Token ID588
Feature activation-2.78e+00

Pos logit contributions
Ben+0.015
Le+0.015
avi+0.015
 might+0.014
itch+0.014
Neg logit contributions
bel-0.015
lic-0.014
rated-0.013
spe-0.013
imm-0.013
Token lions
Token ID31420
Feature activation-2.74e+00

Pos logit contributions
Ben+0.059
avi+0.055
itch+0.055
Le+0.053
 might+0.050
Neg logit contributions
bel-0.071
lic-0.070
spe-0.068
 Sus-0.067
 causing-0.066
Token,
Token ID11
Feature activation-1.82e+00

Pos logit contributions
Ben+0.067
itch+0.063
Le+0.063
avi+0.061
oxy+0.059
Neg logit contributions
lic-0.058
imm-0.057
 happening-0.057
bel-0.055
 causing-0.054
Token ze
Token ID41271
Feature activation-3.31e+00

Pos logit contributions
itch+0.042
Ben+0.042
avi+0.040
Le+0.039
 might+0.037
Neg logit contributions
bel-0.042
lic-0.042
ount-0.038
 causing-0.037
spe-0.037
Tokenbr
Token ID1671
Feature activation-1.93e+00

Pos logit contributions
Ben+0.104
 will+0.101
 might+0.099
 vet+0.098
avi+0.098
Neg logit contributions
lic-0.057
ount-0.045
taker-0.042
imm-0.041
bel-0.038
Tokenas
Token ID292
Feature activation-2.72e+00

Pos logit contributions
Ben+0.052
 Rover+0.050
 might+0.050
Le+0.048
 vet+0.048
Neg logit contributions
bel-0.041
imm-0.040
lic-0.039
ount-0.036
rol-0.035
Token puts
Token ID7584
Feature activation+8.00e-02

Pos logit contributions
Ben+0.017
Le+0.017
avi+0.017
 might+0.016
oxy+0.015
Neg logit contributions
bel-0.025
rated-0.025
lled-0.025
lic-0.025
 mant-0.025
Token his
Token ID465
Feature activation+1.89e-01

Pos logit contributions
bel+0.002
imm+0.002
rated+0.002
lic+0.002
 mant+0.002
Neg logit contributions
Ben-0.002
 might-0.001
Le-0.001
avi-0.001
 Rex-0.001
Token ball
Token ID2613
Feature activation-8.96e-01

Pos logit contributions
bel+0.005
 mant+0.005
rated+0.005
imm+0.005
shadow+0.005
Neg logit contributions
Ben-0.003
avi-0.003
Le-0.003
 might-0.003
 will-0.003
Token on
Token ID319
Feature activation+6.21e-01

Pos logit contributions
Ben+0.021
 might+0.020
Le+0.020
avi+0.019
 Rex+0.019
Neg logit contributions
bel-0.021
imm-0.021
rated-0.020
lic-0.020
 mant-0.020
Token the
Token ID262
Feature activation-1.94e-01

Pos logit contributions
 Sus+0.012
 handing+0.011
 mant+0.011
bel+0.011
bul+0.010
Neg logit contributions
 might-0.016
ars-0.016
avi-0.016
 will-0.015
Le-0.015
Token grass
Token ID8701
Feature activation-2.51e+00

Pos logit contributions
Ben+0.004
avi+0.004
Le+0.004
 might+0.004
 will+0.003
Neg logit contributions
 mant-0.005
bel-0.005
rated-0.005
imm-0.005
bul-0.005
Token and
Token ID290
Feature activation-2.14e-01

Pos logit contributions
Ben+0.045
avi+0.044
Le+0.042
 might+0.040
 Remy+0.038
Neg logit contributions
bel-0.073
spe-0.070
bul-0.069
imm-0.069
rated-0.068
Token opens
Token ID9808
Feature activation+4.29e-01

Pos logit contributions
Ben+0.004
Le+0.004
avi+0.004
 might+0.004
ars+0.004
Neg logit contributions
bel-0.006
 handing-0.005
 mant-0.005
rated-0.005
lled-0.005
Token his
Token ID465
Feature activation-9.22e-01

Pos logit contributions
bel+0.012
imm+0.011
 mant+0.011
rated+0.011
bul+0.011
Neg logit contributions
Ben-0.008
 might-0.008
Le-0.008
avi-0.008
 will-0.007
Token sandwich
Token ID20433
Feature activation-1.88e+00

Pos logit contributions
Ben+0.017
Le+0.016
avi+0.016
 might+0.015
 Remy+0.013
Neg logit contributions
 mant-0.027
bel-0.026
imm-0.025
bul-0.025
rated-0.025
Token.
Token ID13
Feature activation-3.78e-01

Pos logit contributions
avi+0.038
Ben+0.038
Le+0.037
 might+0.035
 Remy+0.033
Neg logit contributions
bel-0.048
imm-0.047
 mant-0.047
lic-0.047
taker-0.046
Token.
Token ID13
Feature activation-5.81e-02

Pos logit contributions
Le+0.047
Ben+0.045
avi+0.044
 might+0.041
 Rex+0.040
Neg logit contributions
rated-0.048
imm-0.047
taker-0.047
lic-0.046
bul-0.045
Token I
Token ID314
Feature activation-2.20e-01

Pos logit contributions
Ben+0.001
avi+0.001
Le+0.001
 might+0.001
 will+0.001
Neg logit contributions
 mant-0.002
bel-0.002
taker-0.001
 Sus-0.001
bul-0.001
Token forgive
Token ID20927
Feature activation-7.56e-01

Pos logit contributions
Le+0.004
Ben+0.004
avi+0.004
 Leo+0.003
 might+0.003
Neg logit contributions
bel-0.006
rated-0.006
imm-0.006
lic-0.006
lled-0.006
Token you
Token ID345
Feature activation+8.32e-01

Pos logit contributions
ars+0.014
avi+0.013
 might+0.013
 vet+0.012
ira+0.012
Neg logit contributions
 handing-0.019
what-0.018
 Sus-0.018
 before-0.018
why-0.018
Token.
Token ID13
Feature activation-3.13e-02

Pos logit contributions
bel+0.021
rated+0.021
taker+0.021
 mant+0.021
shadow+0.021
Neg logit contributions
Ben-0.017
Le-0.017
avi-0.016
 might-0.016
 Remy-0.015
Token Do
Token ID2141
Feature activation-3.12e+00

Pos logit contributions
avi+0.001
Ben+0.001
Le+0.001
 might+0.001
 cones+0.001
Neg logit contributions
 mant-0.001
 Sus-0.001
taker-0.001
bul-0.001
bel-0.001
Token you
Token ID345
Feature activation+1.47e+00

Pos logit contributions
avi+0.048
 vet+0.043
Le+0.042
tail+0.042
 cones+0.042
Neg logit contributions
bel-0.084
 happening-0.084
hr-0.084
shadow-0.083
 Sus-0.082
Token want
Token ID765
Feature activation-7.64e-01

Pos logit contributions
bel+0.044
imm+0.043
rated+0.043
 mant+0.042
lic+0.042
Neg logit contributions
Ben-0.025
 might-0.024
Le-0.024
avi-0.023
 Rex-0.022
Token to
Token ID284
Feature activation+1.87e+00

Pos logit contributions
avi+0.019
Ben+0.018
Le+0.018
 might+0.017
ars+0.017
Neg logit contributions
 Sus-0.016
bul-0.016
bel-0.016
 mant-0.016
 handing-0.015
Token play
Token ID711
Feature activation+9.60e-01

Pos logit contributions
bel+0.050
 Sus+0.049
 mant+0.048
bul+0.048
hr+0.047
Neg logit contributions
Le-0.035
avi-0.035
Ben-0.034
 might-0.033
ars-0.031
Token with
Token ID351
Feature activation-1.38e-01

Pos logit contributions
bel+0.022
rated+0.021
imm+0.021
 mant+0.021
lic+0.021
Neg logit contributions
Ben-0.023
 might-0.023
Le-0.022
avi-0.022
 Rex-0.021
Token hole
Token ID7604
Feature activation+2.27e-01

Pos logit contributions
avi+0.010
Ben+0.010
Le+0.010
 might+0.009
 Leo+0.009
Neg logit contributions
 mant-0.012
bul-0.011
 causing-0.010
rated-0.010
taker-0.010
Token.
Token ID13
Feature activation-3.76e-01

Pos logit contributions
bel+0.007
imm+0.007
rated+0.007
 mant+0.007
bul+0.007
Neg logit contributions
Ben-0.004
Le-0.004
 might-0.004
avi-0.004
 Rex-0.003
Token They
Token ID1119
Feature activation-6.47e-01

Pos logit contributions
avi+0.006
Le+0.006
 might+0.006
Ben+0.005
itch+0.005
Neg logit contributions
bel-0.011
 Sus-0.011
lic-0.011
 mant-0.011
ounded-0.010
Token use
Token ID779
Feature activation-1.03e+00

Pos logit contributions
Le+0.016
emy+0.015
Ben+0.015
 Rover+0.015
 Remy+0.015
Neg logit contributions
 mant-0.014
bel-0.014
taker-0.013
bul-0.013
lled-0.013
Token some
Token ID617
Feature activation-1.06e+00

Pos logit contributions
Ben+0.025
avi+0.025
Le+0.024
oxy+0.022
ars+0.022
Neg logit contributions
 mant-0.023
bul-0.023
lled-0.023
 Sus-0.023
bel-0.022
Token grass
Token ID8701
Feature activation-2.96e+00

Pos logit contributions
Ben+0.021
avi+0.021
Le+0.020
 Rex+0.019
 Rover+0.019
Neg logit contributions
bul-0.029
bel-0.028
 mant-0.028
imm-0.027
rated-0.027
Token to
Token ID284
Feature activation-6.87e-01

Pos logit contributions
Ben+0.063
Le+0.059
avi+0.058
 might+0.054
 Rover+0.053
Neg logit contributions
bel-0.079
bul-0.076
imm-0.075
spe-0.074
mill-0.073
Token tie
Token ID9839
Feature activation-1.04e+00

Pos logit contributions
Ben+0.017
avi+0.017
Le+0.017
 Rover+0.016
itch+0.016
Neg logit contributions
 mant-0.015
bel-0.015
bul-0.014
imm-0.013
 Sus-0.013
Token it
Token ID340
Feature activation-2.33e-01

Pos logit contributions
Ben+0.023
avi+0.023
Le+0.022
oxy+0.021
 might+0.020
Neg logit contributions
bul-0.025
bel-0.025
lled-0.024
 Sus-0.024
 mant-0.024
Token.
Token ID13
Feature activation-5.69e-01

Pos logit contributions
Ben+0.005
 might+0.005
Le+0.005
avi+0.005
 Rex+0.004
Neg logit contributions
bel-0.006
imm-0.006
rated-0.006
 mant-0.006
bul-0.006
Token They
Token ID1119
Feature activation-7.80e-01

Pos logit contributions
avi+0.009
Le+0.009
 might+0.009
itch+0.008
Ben+0.008
Neg logit contributions
bel-0.017
 Sus-0.016
lic-0.016
 mant-0.016
ount-0.015
Token
Token ID198
Feature activation+2.38e-01

Pos logit contributions
 Sus+0.018
lic+0.018
bel+0.017
taker+0.017
 mant+0.017
Neg logit contributions
avi-0.011
Ben-0.011
 might-0.011
Le-0.011
 will-0.010
Token
Token ID198
Feature activation+3.05e-02

Pos logit contributions
lic+0.007
eway+0.007
 Sus+0.007
shadow+0.007
Sus+0.007
Neg logit contributions
 might-0.004
avi-0.004
 cones-0.004
 will-0.004
tail-0.004
TokenBut
Token ID1537
Feature activation+9.89e-01

Pos logit contributions
shadow+0.001
bel+0.001
lic+0.001
Sus+0.001
spe+0.001
Neg logit contributions
 might-0.001
avi-0.001
 will-0.001
 cones-0.001
 Leo-0.001
Token the
Token ID262
Feature activation-1.88e-01

Pos logit contributions
bel+0.027
imm+0.027
rated+0.027
 mant+0.026
bul+0.026
Neg logit contributions
Ben-0.019
 might-0.018
Le-0.018
avi-0.017
 will-0.017
Token pony
Token ID26902
Feature activation+1.72e-01

Pos logit contributions
Ben+0.004
Le+0.004
 might+0.004
avi+0.004
 Rex+0.003
Neg logit contributions
bel-0.005
rated-0.005
 mant-0.005
bul-0.005
spe-0.005
Token loved
Token ID6151
Feature activation-2.76e+00

Pos logit contributions
imm+0.004
rated+0.004
spe+0.004
bel+0.004
lled+0.004
Neg logit contributions
Ben-0.004
avi-0.004
Le-0.004
itch-0.004
 Rex-0.004
Token it
Token ID340
Feature activation+7.82e-01

Pos logit contributions
Ben+0.054
avi+0.051
Le+0.050
 might+0.049
 will+0.047
Neg logit contributions
bel-0.075
bul-0.072
imm-0.072
lic-0.072
 Sus-0.071
Token!
Token ID0
Feature activation+4.38e-01

Pos logit contributions
bel+0.022
imm+0.021
lic+0.021
rated+0.021
 mant+0.021
Neg logit contributions
 might-0.015
Ben-0.015
Le-0.014
 will-0.014
avi-0.013
Token It
Token ID632
Feature activation-3.20e-01

Pos logit contributions
lic+0.013
bel+0.013
 Sus+0.013
taker+0.013
imm+0.013
Neg logit contributions
avi-0.007
Ben-0.007
Le-0.007
 might-0.007
 will-0.006
Token ne
Token ID497
Feature activation+1.16e-01

Pos logit contributions
 might+0.006
Ben+0.006
Le+0.006
 will+0.005
 Rex+0.005
Neg logit contributions
bel-0.009
 mant-0.009
imm-0.009
rated-0.009
lic-0.009
Tokenighed
Token ID12570
Feature activation-3.97e-01

Pos logit contributions
imm+0.003
bel+0.003
rated+0.003
lic+0.003
spe+0.003
Neg logit contributions
Ben-0.002
 might-0.002
Le-0.002
avi-0.002
 Rex-0.002