TOP ACTIVATIONS
MAX = 4.290

friends . He would kick the ball very hard and try
and strong and could kick the ball very far . She
called soccer . They kick the ball with their feet and
the story : Respect the decisions of others and be open
. She would keep throwing the hoop as far as she
. One day as Meg was playing outside ,
play together . She shook the pitch and it flew in
<|endoftext|> The two players covered each other in the game .
, she saw that all the soccer balls were gone .
around and around , spinning the hoop until she was dizz
were two teams playing against each other . One team had
friends . They would kick the ball around and try to
players . They can kick the ball far and fast .
it . She kept rolling the hoop and got better and
. Lucy liked to throw the hoop and watch Sky fly
day the boy would hide the football so he could find
can run fast and kick the ball really far ."
, " I admire your soccer skills , Lily . You
excited . She wrapped the string around her body and held
two goals . They kick the ball and try to score

MIN ACTIVATIONS
MIN = -4.433

a very special toy for L ott ie . It was
One day , L ott ie 's mom went
a little pig gy named Bub bles . Bub bles loved
was a little boy named T ane . T ane liked
a pink pig gy named P ipp a . P ipp
a big bird named O str ich wanted to swim in
was a little girl named Sab rina . She had a
to come by and asked N ell ie if he could
scared , but she asked the gen ie to take her
to serve you ," said the gen ie .
was a cool day when Ma is ie and Benny were
fall . One day , L izz ie noticed something was
a small fire near the snow man . They see the
?" The voice belonged to G iz mo 's friend ,
girl called N ell . N ell was very special because
was a little spider named Sp ide y . Sp ide
hungry . One morning , N ell ie was very excited
mom my if she could have one of the eggs .
was a p anda named P andy . P andy was
there was a spider named Sp ide y . Sp ide

INTERVAL 2.109 - 4.290
CONTAINS 0.250%

Joe . The dog was always happy to help Farmer Joe
want to be independent and lead too ." Max
They held their k ites and their hands . They
recording . They threw the balls up and down , laughing
walking through the park one sunny day . He saw a

INTERVAL -0.071 - 2.109
CONTAINS 53.763%

. That is very creative . I am proud of you
: be wise . Bob thought of a plan
They are careful not to hurt them . One
the curtains , and in the clos ets . He could
cup . He spills the coffee on the table and on

INTERVAL -2.252 - -0.071
CONTAINS 45.818%

and ran around playing catch until it was time for G
she suddenly heard a sound that made her stop . It
, Mill ie 's mum put the little girl down on
. It does not like to be in Sue 's pocket
wanted to show it to her mom and dad . She

INTERVAL -4.433 - -2.252
CONTAINS 0.169%

cold st eth oscope and a loud voice . Lily was
" Can I have some ice , please ?"
the forest and decided to climb it . He grabbed a
was a little bear named Ted . Ted loved to go
nest and soon the rain stopped . Jack and his mom
Token friends
Token ID2460
Feature activation+8.69e-01

Pos logit contributions
liness+0.072
 directions+0.068
 zone+0.066
 receiver+0.062
shirt+0.062
Neg logit contributions
add-0.062
 int-0.048
opp-0.047
 once-0.046
utter-0.044
Token.
Token ID13
Feature activation-9.90e-01

Pos logit contributions
 directions+0.026
liness+0.026
 mats+0.025
 lap+0.024
 seek+0.024
Neg logit contributions
add-0.023
 int-0.017
opp-0.016
utt-0.016
 rum-0.015
Token He
Token ID679
Feature activation+1.79e+00

Pos logit contributions
add+0.027
opp+0.019
 int+0.019
 ven+0.019
utt+0.019
Neg logit contributions
 directions-0.029
liness-0.029
 lap-0.028
 mats-0.028
 zone-0.027
Token would
Token ID561
Feature activation+2.46e+00

Pos logit contributions
liness+0.050
 directions+0.049
 zone+0.047
 lap+0.045
 yourself+0.045
Neg logit contributions
add-0.050
 int-0.039
 rum-0.037
opp-0.037
utt-0.037
Token kick
Token ID4829
Feature activation+2.31e+00

Pos logit contributions
liness+0.064
 directions+0.061
 zone+0.061
 receiver+0.057
 newspapers+0.054
Neg logit contributions
add-0.072
 int-0.063
 rum-0.059
utt-0.054
 attent-0.054
Token the
Token ID262
Feature activation+4.29e+00

Pos logit contributions
liness+0.060
 zone+0.058
 directions+0.057
 lap+0.056
 seek+0.053
Neg logit contributions
add-0.067
 int-0.056
 rum-0.051
 attent-0.049
opp-0.049
Token ball
Token ID2613
Feature activation+2.40e+00

Pos logit contributions
liness+0.118
rd+0.106
 directions+0.104
Kim+0.102
 zone+0.102
Neg logit contributions
add-0.106
 int-0.091
 once-0.087
 rum-0.080
 sunset-0.077
Token very
Token ID845
Feature activation+2.26e+00

Pos logit contributions
liness+0.070
 directions+0.068
 zone+0.064
 lap+0.063
 mats+0.062
Neg logit contributions
add-0.065
 int-0.050
utt-0.048
opp-0.046
 rum-0.045
Token hard
Token ID1327
Feature activation+6.00e-01

Pos logit contributions
 directions+0.046
 mats+0.044
liness+0.043
 lap+0.042
 seek+0.041
Neg logit contributions
add-0.081
utt-0.063
opp-0.063
 int-0.063
 ven-0.061
Token and
Token ID290
Feature activation+8.56e-01

Pos logit contributions
 directions+0.013
liness+0.013
 mats+0.012
 lap+0.012
 zone+0.012
Neg logit contributions
add-0.021
 int-0.016
opp-0.016
utt-0.016
 ven-0.016
Token try
Token ID1949
Feature activation+3.57e-01

Pos logit contributions
liness+0.022
 zone+0.021
 directions+0.021
 newspapers+0.020
 lap+0.019
Neg logit contributions
add-0.023
 int-0.021
 rum-0.020
utt-0.019
 erupt-0.019
Token and
Token ID290
Feature activation+2.24e+00

Pos logit contributions
 directions+0.032
liness+0.031
 lap+0.029
 zone+0.028
 seek+0.027
Neg logit contributions
add-0.044
 int-0.034
opp-0.034
utt-0.033
 rum-0.033
Token strong
Token ID1913
Feature activation+9.70e-01

Pos logit contributions
liness+0.077
 directions+0.070
 zone+0.070
 lap+0.066
 newspapers+0.063
Neg logit contributions
add-0.043
 only-0.037
 rum-0.036
 int-0.035
 ven-0.035
Token and
Token ID290
Feature activation+2.22e+00

Pos logit contributions
 directions+0.026
liness+0.025
 lap+0.024
 mats+0.024
 zone+0.024
Neg logit contributions
add-0.028
 int-0.021
opp-0.021
utt-0.020
 rum-0.020
Token could
Token ID714
Feature activation+2.00e+00

Pos logit contributions
liness+0.068
 directions+0.064
 zone+0.061
 lap+0.059
 sandbox+0.057
Neg logit contributions
add-0.053
 ven-0.043
 int-0.042
 rum-0.042
ores-0.041
Token kick
Token ID4829
Feature activation+2.30e+00

Pos logit contributions
liness+0.042
 directions+0.041
 zone+0.038
 mats+0.037
 lap+0.037
Neg logit contributions
add-0.071
 int-0.056
utt-0.055
opp-0.055
 rum-0.054
Token the
Token ID262
Feature activation+4.22e+00

Pos logit contributions
liness+0.059
 lap+0.057
 zone+0.057
 directions+0.056
 receiver+0.054
Neg logit contributions
add-0.065
 int-0.053
 rum-0.049
opp-0.047
 attent-0.047
Token ball
Token ID2613
Feature activation+1.62e+00

Pos logit contributions
liness+0.101
 directions+0.088
rd+0.086
Kim+0.086
 zone+0.085
Neg logit contributions
add-0.123
 int-0.106
 once-0.096
 rum-0.092
der-0.091
Token very
Token ID845
Feature activation+2.10e+00

Pos logit contributions
liness+0.045
 directions+0.045
 lap+0.043
 mats+0.043
 zone+0.042
Neg logit contributions
add-0.045
 int-0.034
utt-0.033
opp-0.033
 rum-0.031
Token far
Token ID1290
Feature activation+5.40e-01

Pos logit contributions
 directions+0.049
liness+0.047
 mats+0.046
 lap+0.045
 seek+0.044
Neg logit contributions
add-0.070
utt-0.053
 int-0.053
opp-0.052
 ven-0.051
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
liness+0.015
 directions+0.015
 lap+0.015
 sandbox+0.015
 mats+0.014
Neg logit contributions
add-0.014
 int-0.011
opp-0.010
 rum-0.010
 attent-0.010
Token She
Token ID1375
Feature activation+1.36e+00

Pos logit contributions
add+0.007
utt+0.005
 ven+0.005
opp+0.005
 int+0.005
Neg logit contributions
 mats-0.006
 lap-0.006
liness-0.006
 directions-0.006
 sandbox-0.006
Token called
Token ID1444
Feature activation+1.03e+00

Pos logit contributions
 directions+0.084
liness+0.084
 receiver+0.076
 lap+0.075
 zone+0.073
Neg logit contributions
add-0.072
 int-0.056
 once-0.055
 ven-0.054
 attent-0.051
Token soccer
Token ID11783
Feature activation+2.28e+00

Pos logit contributions
 directions+0.028
liness+0.027
 lap+0.025
 zone+0.025
 sandbox+0.025
Neg logit contributions
add-0.030
 int-0.023
utt-0.022
opp-0.022
 rum-0.021
Token.
Token ID13
Feature activation+6.24e-01

Pos logit contributions
liness+0.059
 directions+0.058
 lap+0.053
 receiver+0.052
 zone+0.051
Neg logit contributions
add-0.065
 int-0.051
opp-0.048
utt-0.047
 ven-0.047
Token They
Token ID1119
Feature activation+1.01e+00

Pos logit contributions
 directions+0.015
liness+0.014
 mats+0.014
 lap+0.014
 zone+0.013
Neg logit contributions
add-0.020
 int-0.015
opp-0.015
utt-0.015
 ven-0.015
Token kick
Token ID4829
Feature activation+2.16e+00

Pos logit contributions
liness+0.024
 directions+0.024
 zone+0.022
 lap+0.022
 mats+0.021
Neg logit contributions
add-0.033
 int-0.026
 rum-0.025
utt-0.025
opp-0.025
Token the
Token ID262
Feature activation+4.10e+00

Pos logit contributions
liness+0.062
 zone+0.059
 seek+0.058
 directions+0.058
 receiver+0.057
Neg logit contributions
add-0.052
 int-0.046
 rum-0.041
 once-0.040
 attent-0.039
Token ball
Token ID2613
Feature activation+2.04e+00

Pos logit contributions
liness+0.100
 directions+0.089
 zone+0.088
rd+0.086
Kim+0.085
Neg logit contributions
add-0.114
 int-0.099
 rum-0.090
 once-0.089
 attent-0.085
Token with
Token ID351
Feature activation+2.19e+00

Pos logit contributions
liness+0.054
 directions+0.053
 zone+0.049
 lap+0.049
 mats+0.049
Neg logit contributions
add-0.061
 int-0.047
utt-0.046
opp-0.045
 rum-0.043
Token their
Token ID511
Feature activation+2.59e+00

Pos logit contributions
liness+0.065
 zone+0.062
 directions+0.062
 lap+0.061
 seek+0.060
Neg logit contributions
add-0.059
 int-0.043
opp-0.041
utt-0.040
 rum-0.040
Token feet
Token ID3625
Feature activation+1.70e+00

Pos logit contributions
liness+0.059
 zone+0.053
 directions+0.050
 receiver+0.048
 yourself+0.048
Neg logit contributions
add-0.085
 int-0.069
opp-0.066
 rum-0.065
utt-0.064
Token and
Token ID290
Feature activation+1.27e+00

Pos logit contributions
liness+0.049
 directions+0.047
 zone+0.045
 mats+0.045
 seek+0.044
Neg logit contributions
add-0.046
 int-0.037
utt-0.034
opp-0.033
 rum-0.033
Token the
Token ID262
Feature activation+8.20e-01

Pos logit contributions
liness+0.008
 mats+0.008
 directions+0.008
 sandbox+0.008
 lap+0.008
Neg logit contributions
add-0.011
 int-0.009
opp-0.009
utt-0.009
 ven-0.008
Token story
Token ID1621
Feature activation+8.94e-01

Pos logit contributions
 directions+0.019
liness+0.019
 mats+0.017
 lap+0.017
 zone+0.017
Neg logit contributions
add-0.027
opp-0.021
utt-0.021
 int-0.021
 ven-0.020
Token:
Token ID25
Feature activation+4.96e-01

Pos logit contributions
 directions+0.027
liness+0.027
 mats+0.026
 lap+0.026
 seek+0.025
Neg logit contributions
add-0.023
 int-0.016
utt-0.016
opp-0.016
 ven-0.015
Token Respect
Token ID42153
Feature activation+2.59e+00

Pos logit contributions
liness+0.011
 directions+0.011
 lap+0.010
 yourself+0.010
 zone+0.010
Neg logit contributions
add-0.017
opp-0.013
utt-0.013
 ven-0.013
 int-0.013
Token the
Token ID262
Feature activation+1.55e+00

Pos logit contributions
 sandbox+0.058
 lap+0.058
 mats+0.058
 seek+0.057
 Seek+0.054
Neg logit contributions
add-0.076
opp-0.061
 ven-0.061
 int-0.061
 attent-0.058
Token decisions
Token ID5370
Feature activation+4.09e+00

Pos logit contributions
liness+0.033
 directions+0.032
 mats+0.032
 lap+0.032
 zone+0.030
Neg logit contributions
add-0.052
 int-0.041
opp-0.040
 rum-0.039
utt-0.039
Token of
Token ID286
Feature activation+5.46e-01

Pos logit contributions
 directions+0.110
liness+0.109
 sandbox+0.109
 lap+0.108
 mats+0.106
Neg logit contributions
add-0.096
 int-0.088
 once-0.077
 rum-0.077
der-0.077
Token others
Token ID1854
Feature activation+2.90e+00

Pos logit contributions
liness+0.011
 directions+0.010
 mats+0.010
 lap+0.010
 zone+0.010
Neg logit contributions
add-0.020
 int-0.016
opp-0.015
 ven-0.015
utt-0.015
Token and
Token ID290
Feature activation+3.96e-01

Pos logit contributions
 directions+0.068
 sandbox+0.067
 lap+0.067
liness+0.067
 mats+0.067
Neg logit contributions
add-0.082
 int-0.071
opp-0.068
 rum-0.065
 attent-0.064
Token be
Token ID307
Feature activation+5.12e-01

Pos logit contributions
liness+0.009
 mats+0.009
 lap+0.009
 directions+0.008
 zone+0.008
Neg logit contributions
add-0.012
 int-0.010
 ven-0.010
 rum-0.010
opp-0.010
Token open
Token ID1280
Feature activation+1.38e+00

Pos logit contributions
 mats+0.013
liness+0.011
 lap+0.011
 zone+0.011
 mat+0.011
Neg logit contributions
add-0.016
 int-0.013
aze-0.013
 ven-0.012
opp-0.012
Token.
Token ID13
Feature activation-5.82e-01

Pos logit contributions
add+0.000
 int+0.000
 once+0.000
 ven+0.000
 attent+0.000
Neg logit contributions
 directions-0.001
liness-0.001
 zone-0.000
 lap-0.000
 pads-0.000
Token She
Token ID1375
Feature activation+9.84e-02

Pos logit contributions
add+0.020
 int+0.015
utt+0.015
opp+0.014
 ven+0.014
Neg logit contributions
 lap-0.014
liness-0.013
 directions-0.013
 mats-0.013
 zone-0.013
Token would
Token ID561
Feature activation+8.59e-01

Pos logit contributions
 directions+0.002
liness+0.002
 zone+0.002
 lap+0.002
 mats+0.002
Neg logit contributions
add-0.003
 int-0.002
opp-0.002
utt-0.002
 rum-0.002
Token keep
Token ID1394
Feature activation+1.66e+00

Pos logit contributions
liness+0.019
 directions+0.019
 zone+0.018
 mats+0.017
 lap+0.017
Neg logit contributions
add-0.029
 int-0.023
 rum-0.022
utt-0.022
opp-0.022
Token throwing
Token ID9644
Feature activation+2.67e+00

Pos logit contributions
liness+0.046
 zone+0.045
 directions+0.045
 lap+0.044
 mats+0.043
Neg logit contributions
add-0.045
 int-0.036
opp-0.032
utt-0.032
 rum-0.031
Token the
Token ID262
Feature activation+4.08e+00

Pos logit contributions
 zone+0.080
liness+0.079
 lap+0.078
rd+0.076
 Have+0.074
Neg logit contributions
add-0.059
 int-0.049
 once-0.042
 attent-0.041
 rum-0.039
Token hoop
Token ID44792
Feature activation+2.14e+00

Pos logit contributions
rum+0.083
Josh+0.083
rd+0.081
 lap+0.079
William+0.078
Neg logit contributions
add-0.115
 int-0.092
 once-0.089
 work-0.088
 attent-0.086
Token as
Token ID355
Feature activation+1.04e+00

Pos logit contributions
liness+0.060
 directions+0.059
 zone+0.056
 lap+0.054
 mats+0.054
Neg logit contributions
add-0.058
 int-0.047
utt-0.043
opp-0.042
 rum-0.041
Token far
Token ID1290
Feature activation-6.57e-01

Pos logit contributions
liness+0.027
 directions+0.027
 lap+0.025
 zone+0.025
 mats+0.024
Neg logit contributions
add-0.032
 int-0.024
opp-0.024
utt-0.024
 ven-0.023
Token as
Token ID355
Feature activation+9.12e-01

Pos logit contributions
add+0.021
opp+0.015
utt+0.015
 int+0.015
 ven+0.015
Neg logit contributions
 directions-0.017
liness-0.016
 mats-0.016
 lap-0.015
 zone-0.015
Token she
Token ID673
Feature activation-3.99e-01

Pos logit contributions
liness+0.018
 directions+0.018
 lap+0.017
 mats+0.016
 zone+0.016
Neg logit contributions
add-0.033
 int-0.026
utt-0.026
opp-0.026
 ven-0.025
Token.
Token ID13
Feature activation+2.04e+00

Pos logit contributions
 directions+0.016
liness+0.016
 mats+0.015
 lap+0.015
 zone+0.015
Neg logit contributions
add-0.020
opp-0.015
 int-0.015
utt-0.015
 ven-0.014
Token
Token ID198
Feature activation+5.68e-01

Pos logit contributions
liness+0.066
 directions+0.065
 zone+0.064
 seek+0.063
 Seek+0.063
Neg logit contributions
add-0.046
 int-0.035
utt-0.031
 attent-0.030
opp-0.028
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
 directions+0.014
liness+0.014
 lap+0.014
 mats+0.013
 seek+0.013
Neg logit contributions
add-0.017
opp-0.013
utt-0.013
 int-0.013
 rum-0.013
TokenOne
Token ID3198
Feature activation+8.86e-01

Pos logit contributions
liness+0.049
 zone+0.047
 directions+0.046
 lap+0.045
 mats+0.045
Neg logit contributions
add-0.031
 int-0.023
 attent-0.019
utt-0.019
aze-0.018
Token day
Token ID1110
Feature activation+3.07e+00

Pos logit contributions
liness+0.022
 directions+0.021
 mats+0.020
 lap+0.019
 zone+0.019
Neg logit contributions
add-0.029
 int-0.022
opp-0.022
utt-0.021
 rum-0.021
Token as
Token ID355
Feature activation+4.04e+00

Pos logit contributions
liness+0.095
 directions+0.091
 lap+0.089
shirt+0.086
 keyboard+0.086
Neg logit contributions
add-0.078
 int-0.056
 once-0.053
aze-0.053
opp-0.053
Token Meg
Token ID8336
Feature activation-3.25e-01

Pos logit contributions
liness+0.133
 lap+0.120
 zone+0.118
shirt+0.114
robat+0.114
Neg logit contributions
add-0.099
 int-0.077
ores-0.074
 ven-0.071
 once-0.070
Token was
Token ID373
Feature activation+1.64e-01

Pos logit contributions
add+0.011
 int+0.009
opp+0.009
utt+0.009
 rum+0.008
Neg logit contributions
 directions-0.007
liness-0.007
 mats-0.007
 lap-0.007
 zone-0.006
Token playing
Token ID2712
Feature activation+1.79e-01

Pos logit contributions
 mats+0.003
 lap+0.003
 directions+0.003
 tag+0.002
liness+0.002
Neg logit contributions
add-0.006
opp-0.005
 int-0.005
utt-0.005
 ven-0.005
Token outside
Token ID2354
Feature activation+2.94e-01

Pos logit contributions
 directions+0.006
liness+0.006
 pads+0.006
 zone+0.005
 lap+0.005
Neg logit contributions
add-0.004
 once-0.003
 ven-0.003
 rum-0.003
 int-0.003
Token,
Token ID11
Feature activation+1.55e+00

Pos logit contributions
 directions+0.007
liness+0.007
 mats+0.007
 Seek+0.007
 keyboard+0.007
Neg logit contributions
add-0.009
 rum-0.007
 int-0.007
 once-0.007
opp-0.006
Token play
Token ID711
Feature activation-4.76e-02

Pos logit contributions
liness+0.014
 directions+0.014
 zone+0.013
 mats+0.013
 lap+0.013
Neg logit contributions
add-0.021
 int-0.017
opp-0.016
utt-0.016
 rum-0.016
Token together
Token ID1978
Feature activation+1.59e-01

Pos logit contributions
add+0.001
 once+0.001
 int+0.001
 happily+0.001
 attent+0.001
Neg logit contributions
liness-0.002
 directions-0.002
 receiver-0.002
 zone-0.002
 lap-0.001
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
 directions+0.004
liness+0.004
 mats+0.004
 lap+0.004
 zone+0.004
Neg logit contributions
add-0.005
utt-0.004
opp-0.004
 int-0.004
 ven-0.003
Token She
Token ID1375
Feature activation+9.19e-01

Pos logit contributions
 directions+0.023
liness+0.023
 mats+0.022
 lap+0.022
 zone+0.021
Neg logit contributions
add-0.038
opp-0.030
utt-0.029
 int-0.028
 ven-0.028
Token shook
Token ID14682
Feature activation+1.79e+00

Pos logit contributions
liness+0.021
 directions+0.021
 zone+0.020
 lap+0.020
 mats+0.019
Neg logit contributions
add-0.031
 int-0.024
utt-0.023
opp-0.023
 rum-0.022
Token the
Token ID262
Feature activation+3.97e+00

Pos logit contributions
 zone+0.051
 Seek+0.050
liness+0.049
 seek+0.049
 lap+0.048
Neg logit contributions
add-0.042
 int-0.042
 rum-0.037
 attent-0.035
 once-0.034
Token pitch
Token ID7078
Feature activation+3.31e+00

Pos logit contributions
liness+0.098
 zone+0.090
shirt+0.089
robat+0.088
rd+0.086
Neg logit contributions
add-0.116
 int-0.092
 attent-0.087
 once-0.084
 ven-0.084
Token and
Token ID290
Feature activation+4.48e-01

Pos logit contributions
liness+0.079
 directions+0.078
 zone+0.076
 Seek+0.075
 yourself+0.075
Neg logit contributions
add-0.093
 int-0.083
 rum-0.076
 attent-0.073
 once-0.072
Token it
Token ID340
Feature activation+6.00e-01

Pos logit contributions
liness+0.012
 directions+0.011
 zone+0.011
 lap+0.011
 yourself+0.011
Neg logit contributions
add-0.012
 int-0.010
 rum-0.010
 attent-0.009
utt-0.009
Token flew
Token ID13112
Feature activation+1.61e-01

Pos logit contributions
liness+0.014
 directions+0.014
 lap+0.013
 zone+0.013
 mats+0.013
Neg logit contributions
add-0.019
 int-0.015
utt-0.015
opp-0.014
 rum-0.014
Token in
Token ID287
Feature activation+8.03e-01

Pos logit contributions
liness+0.004
 lap+0.004
 directions+0.004
 zone+0.004
 mats+0.004
Neg logit contributions
add-0.005
 int-0.004
utt-0.004
 rum-0.004
opp-0.004
Token<|endoftext|>
Token ID50256
Feature activation-2.66e-01

Pos logit contributions
add+0.002
 int+0.002
 rum+0.002
opp+0.002
 ven+0.002
Neg logit contributions
 directions-0.002
liness-0.002
 lap-0.002
 zone-0.001
 mats-0.001
TokenThe
Token ID464
Feature activation+3.26e-01

Pos logit contributions
add+0.007
 int+0.006
utt+0.005
opp+0.005
 ven+0.005
Neg logit contributions
liness-0.007
 directions-0.007
 zone-0.007
 mats-0.007
 lap-0.007
Token two
Token ID734
Feature activation+7.12e-01

Pos logit contributions
 directions+0.008
liness+0.007
 mats+0.007
 lap+0.007
 sandbox+0.007
Neg logit contributions
add-0.011
opp-0.009
utt-0.008
 ven-0.008
 int-0.008
Token players
Token ID1938
Feature activation+7.43e-01

Pos logit contributions
 directions+0.016
liness+0.015
 mats+0.014
 lap+0.014
 sandbox+0.014
Neg logit contributions
add-0.025
 int-0.019
utt-0.019
opp-0.019
 ven-0.019
Token covered
Token ID5017
Feature activation+2.07e+00

Pos logit contributions
 directions+0.016
liness+0.016
 mats+0.015
 zone+0.015
 lap+0.014
Neg logit contributions
add-0.026
 int-0.020
opp-0.020
utt-0.020
 ven-0.019
Token each
Token ID1123
Feature activation+3.97e+00

Pos logit contributions
 zone+0.058
 Seek+0.057
 pads+0.057
 lap+0.056
 seek+0.056
Neg logit contributions
add-0.054
 int-0.042
 once-0.040
 attent-0.037
 rum-0.037
Token other
Token ID584
Feature activation+2.06e+00

Pos logit contributions
 zone+0.100
rob+0.097
liness+0.096
 elbows+0.096
robat+0.093
Neg logit contributions
add-0.100
 once-0.084
 int-0.081
 three-0.080
 attent-0.075
Token in
Token ID287
Feature activation+1.67e+00

Pos logit contributions
Found+0.052
liness+0.052
 zone+0.052
 directions+0.052
 sandbox+0.051
Neg logit contributions
add-0.051
 int-0.047
 rum-0.042
opp-0.042
 attent-0.042
Token the
Token ID262
Feature activation+2.28e+00

Pos logit contributions
 zone+0.049
robat+0.047
liness+0.046
rob+0.044
fs+0.043
Neg logit contributions
add-0.038
 rum-0.033
 int-0.032
 twenty-0.031
 sense-0.029
Token game
Token ID983
Feature activation+1.63e+00

Pos logit contributions
liness+0.047
 zone+0.046
fs+0.045
robat+0.044
 directions+0.043
Neg logit contributions
add-0.070
 int-0.060
 rum-0.058
 attent-0.054
 once-0.054
Token.
Token ID13
Feature activation+7.57e-01

Pos logit contributions
liness+0.046
 directions+0.045
 mats+0.044
 lap+0.043
 Seek+0.043
Neg logit contributions
add-0.040
 int-0.031
opp-0.029
utt-0.029
 rum-0.028
Token,
Token ID11
Feature activation+9.53e-02

Pos logit contributions
add+0.013
opp+0.010
 int+0.010
utt+0.010
 rum+0.010
Neg logit contributions
 directions-0.007
liness-0.007
 lap-0.007
 mats-0.007
 zone-0.007
Token she
Token ID673
Feature activation+1.93e-01

Pos logit contributions
 directions+0.002
liness+0.002
 lap+0.002
 zone+0.002
 pads+0.002
Neg logit contributions
add-0.003
 rum-0.002
 int-0.002
opp-0.002
 ven-0.002
Token saw
Token ID2497
Feature activation+7.45e-01

Pos logit contributions
liness+0.005
 directions+0.005
 mats+0.004
 seek+0.004
 lap+0.004
Neg logit contributions
add-0.006
 int-0.005
utt-0.005
opp-0.005
 ven-0.005
Token that
Token ID326
Feature activation+8.82e-01

Pos logit contributions
liness+0.025
 lap+0.024
 directions+0.024
 zone+0.024
 seek+0.023
Neg logit contributions
add-0.017
 int-0.011
 ven-0.011
 rum-0.011
 once-0.010
Token all
Token ID477
Feature activation+1.77e+00

Pos logit contributions
 directions+0.025
liness+0.025
 lap+0.025
 zone+0.023
 mats+0.022
Neg logit contributions
add-0.022
 int-0.018
 once-0.017
 rum-0.017
 ven-0.016
Token the
Token ID262
Feature activation+3.96e+00

Pos logit contributions
 lap+0.058
 keyboard+0.056
 elbows+0.055
 goggles+0.055
 seek+0.055
Neg logit contributions
add-0.032
 rum-0.025
 int-0.024
 once-0.023
 all-0.023
Token soccer
Token ID11783
Feature activation+2.61e+00

Pos logit contributions
liness+0.097
 zone+0.096
 lap+0.091
shirt+0.083
rd+0.083
Neg logit contributions
add-0.114
 once-0.083
 rum-0.081
 sense-0.081
 int-0.080
Token balls
Token ID11333
Feature activation+1.13e+00

Pos logit contributions
 directions+0.063
liness+0.062
 lap+0.060
 yourself+0.055
 receiver+0.054
Neg logit contributions
add-0.073
 once-0.057
 ven-0.057
 rum-0.056
utt-0.055
Token were
Token ID547
Feature activation+1.19e+00

Pos logit contributions
 directions+0.028
liness+0.028
 lap+0.027
 mats+0.026
 zone+0.025
Neg logit contributions
add-0.035
utt-0.026
 rum-0.026
 int-0.026
 erupt-0.024
Token gone
Token ID3750
Feature activation-7.42e-01

Pos logit contributions
liness+0.034
 directions+0.034
 zone+0.032
 lap+0.032
 mats+0.032
Neg logit contributions
add-0.033
 int-0.024
utt-0.024
opp-0.023
 rum-0.023
Token.
Token ID13
Feature activation-1.76e-01

Pos logit contributions
add+0.020
 int+0.015
opp+0.015
utt+0.014
 rum+0.014
Neg logit contributions
 directions-0.021
liness-0.021
 mats-0.020
 lap-0.020
 zone-0.019
Token around
Token ID1088
Feature activation+4.35e-01

Pos logit contributions
add+0.004
 int+0.003
utt+0.003
opp+0.003
 rum+0.003
Neg logit contributions
liness-0.004
 directions-0.004
 lap-0.004
 zone-0.004
 mats-0.004
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
liness+0.010
 directions+0.010
 zone+0.009
 lap+0.009
 mats+0.009
Neg logit contributions
add-0.015
 int-0.012
opp-0.011
utt-0.011
 rum-0.011
Token around
Token ID1088
Feature activation-4.03e-01

Pos logit contributions
liness+0.031
 zone+0.030
 newspapers+0.030
 directions+0.030
 yourself+0.028
Neg logit contributions
add-0.031
 int-0.030
 rum-0.028
 attent-0.026
utt-0.025
Token,
Token ID11
Feature activation+4.25e-01

Pos logit contributions
add+0.014
 int+0.011
opp+0.011
utt+0.011
 rum+0.010
Neg logit contributions
 directions-0.009
liness-0.009
 mats-0.008
 lap-0.008
 zone-0.008
Token spinning
Token ID19493
Feature activation+4.51e-01

Pos logit contributions
 directions+0.011
liness+0.010
 zone+0.010
 newspapers+0.010
 mats+0.010
Neg logit contributions
add-0.012
 int-0.011
 rum-0.010
utt-0.010
 attent-0.010
Token the
Token ID262
Feature activation+3.93e+00

Pos logit contributions
liness+0.012
 zone+0.011
 lap+0.011
 directions+0.011
 receiver+0.011
Neg logit contributions
add-0.013
 int-0.011
 rum-0.010
 attent-0.010
opp-0.009
Token hoop
Token ID44792
Feature activation+1.20e+00

Pos logit contributions
rum+0.080
His+0.074
ability+0.070
trained+0.068
liness+0.068
Neg logit contributions
add-0.109
 int-0.105
 attent-0.097
 once-0.092
 rum-0.090
Token until
Token ID1566
Feature activation-1.30e+00

Pos logit contributions
liness+0.032
 directions+0.032
 zone+0.030
 newspapers+0.030
 lap+0.030
Neg logit contributions
add-0.033
 int-0.027
 rum-0.025
utt-0.025
opp-0.024
Token she
Token ID673
Feature activation-3.69e-01

Pos logit contributions
add+0.056
utt+0.046
opp+0.046
 ven+0.044
omm+0.044
Neg logit contributions
 directions-0.020
 sandbox-0.017
 mats-0.017
 seek-0.017
liness-0.016
Token was
Token ID373
Feature activation+9.59e-01

Pos logit contributions
add+0.014
utt+0.011
 int+0.011
opp+0.011
 ven+0.011
Neg logit contributions
 directions-0.007
liness-0.006
 mats-0.006
 seek-0.006
 sandbox-0.006
Token dizz
Token ID29527
Feature activation-5.64e-02

Pos logit contributions
 directions+0.023
liness+0.022
 mats+0.022
 seek+0.022
 lap+0.022
Neg logit contributions
add-0.031
utt-0.024
opp-0.023
 int-0.023
 ven-0.023
Token were
Token ID547
Feature activation+2.71e+00

Pos logit contributions
 directions+0.024
 lap+0.023
liness+0.023
 mats+0.022
 Seek+0.022
Neg logit contributions
add-0.027
opp-0.021
 rum-0.021
 once-0.020
utt-0.020
Token two
Token ID734
Feature activation+1.97e+00

Pos logit contributions
liness+0.080
 zone+0.077
 directions+0.074
 lap+0.072
ability+0.068
Neg logit contributions
add-0.078
utt-0.056
 int-0.055
opp-0.052
 rum-0.052
Token teams
Token ID3466
Feature activation+1.88e+00

Pos logit contributions
liness+0.049
 zone+0.048
 directions+0.047
 lap+0.046
 seek+0.044
Neg logit contributions
add-0.065
opp-0.045
 rum-0.045
utt-0.044
 int-0.044
Token playing
Token ID2712
Feature activation+1.95e+00

Pos logit contributions
liness+0.051
 directions+0.050
 lap+0.047
 mats+0.046
 yourself+0.045
Neg logit contributions
add-0.054
utt-0.040
 int-0.039
opp-0.039
 rum-0.038
Token against
Token ID1028
Feature activation+1.93e+00

Pos logit contributions
liness+0.044
 directions+0.043
 zone+0.039
 lap+0.039
 sandbox+0.037
Neg logit contributions
add-0.065
 int-0.051
opp-0.049
utt-0.048
 attent-0.048
Token each
Token ID1123
Feature activation+3.92e+00

Pos logit contributions
 directions+0.040
liness+0.039
 mats+0.037
 lap+0.036
 zone+0.036
Neg logit contributions
add-0.070
 int-0.055
utt-0.054
opp-0.054
 ven-0.052
Token other
Token ID584
Feature activation+1.40e+00

Pos logit contributions
liness+0.085
 zone+0.073
 yourself+0.072
 sandbox+0.072
 elbows+0.072
Neg logit contributions
add-0.127
 int-0.103
 once-0.103
 attent-0.094
 happily-0.094
Token.
Token ID13
Feature activation-1.51e-01

Pos logit contributions
liness+0.043
 directions+0.042
Found+0.042
 sandbox+0.041
 seek+0.040
Neg logit contributions
add-0.029
 int-0.025
opp-0.022
 once-0.022
 rum-0.021
Token One
Token ID1881
Feature activation+2.22e+00

Pos logit contributions
add+0.005
 ven+0.004
 int+0.004
opp+0.004
utt+0.004
Neg logit contributions
 mats-0.003
 lap-0.003
 directions-0.003
liness-0.003
 newspapers-0.003
Token team
Token ID1074
Feature activation+2.36e+00

Pos logit contributions
liness+0.067
 yourself+0.064
 directions+0.064
 newspapers+0.059
 mats+0.058
Neg logit contributions
add-0.057
 rum-0.042
 once-0.042
 int-0.041
utt-0.040
Token had
Token ID550
Feature activation+1.48e+00

Pos logit contributions
 directions+0.056
liness+0.056
 lap+0.054
 mats+0.053
 yourself+0.050
Neg logit contributions
add-0.075
utt-0.057
opp-0.056
 int-0.055
 rum-0.054
Token friends
Token ID2460
Feature activation+7.46e-01

Pos logit contributions
liness+0.044
 directions+0.042
 zone+0.041
 mats+0.039
 lap+0.039
Neg logit contributions
add-0.054
 int-0.041
opp-0.041
 ven-0.039
utt-0.039
Token.
Token ID13
Feature activation-6.21e-01

Pos logit contributions
 directions+0.021
liness+0.021
 mats+0.020
 lap+0.019
 zone+0.019
Neg logit contributions
add-0.021
 int-0.016
opp-0.015
utt-0.015
 rum-0.014
Token They
Token ID1119
Feature activation-7.02e-02

Pos logit contributions
add+0.017
opp+0.012
 int+0.012
 ven+0.012
utt+0.011
Neg logit contributions
liness-0.018
 directions-0.018
 lap-0.018
 mats-0.017
 zone-0.017
Token would
Token ID561
Feature activation+1.40e+00

Pos logit contributions
add+0.002
 int+0.002
opp+0.002
utt+0.002
 rum+0.002
Neg logit contributions
liness-0.002
 directions-0.002
 lap-0.001
 zone-0.001
 mats-0.001
Token kick
Token ID4829
Feature activation+1.55e+00

Pos logit contributions
liness+0.032
 directions+0.031
 zone+0.030
 mats+0.028
 lap+0.028
Neg logit contributions
add-0.046
 int-0.037
 rum-0.036
opp-0.036
utt-0.036
Token the
Token ID262
Feature activation+3.76e+00

Pos logit contributions
liness+0.039
 directions+0.037
 zone+0.037
 lap+0.036
 seek+0.034
Neg logit contributions
add-0.047
 int-0.038
 rum-0.035
opp-0.035
 attent-0.034
Token ball
Token ID2613
Feature activation+1.89e+00

Pos logit contributions
liness+0.092
 directions+0.082
 zone+0.079
rd+0.078
Kim+0.074
Neg logit contributions
add-0.109
 int-0.092
 rum-0.085
 once-0.084
 attent-0.079
Token around
Token ID1088
Feature activation+1.59e-01

Pos logit contributions
liness+0.051
 directions+0.051
 zone+0.047
 lap+0.047
 mats+0.047
Neg logit contributions
add-0.055
 int-0.042
utt-0.041
opp-0.039
 rum-0.038
Token and
Token ID290
Feature activation+6.64e-01

Pos logit contributions
liness+0.004
 directions+0.004
 lap+0.004
 mats+0.004
 zone+0.003
Neg logit contributions
add-0.005
 int-0.004
opp-0.004
utt-0.004
 rum-0.004
Token try
Token ID1949
Feature activation-4.64e-02

Pos logit contributions
liness+0.016
 directions+0.016
 zone+0.015
 yourself+0.014
 mats+0.014
Neg logit contributions
add-0.019
 int-0.017
 rum-0.017
utt-0.016
 erupt-0.015
Token to
Token ID284
Feature activation+2.04e+00

Pos logit contributions
add+0.001
 int+0.001
opp+0.001
utt+0.001
 rum+0.001
Neg logit contributions
liness-0.001
 directions-0.001
 zone-0.001
 lap-0.001
 mats-0.001
Token players
Token ID1938
Feature activation+1.59e+00

Pos logit contributions
 directions+0.016
liness+0.015
 mats+0.015
 zone+0.014
 lap+0.014
Neg logit contributions
add-0.028
opp-0.023
utt-0.022
 int-0.022
 ven-0.022
Token.
Token ID13
Feature activation-4.13e-01

Pos logit contributions
 directions+0.048
liness+0.048
 lap+0.044
 mats+0.044
 zone+0.044
Neg logit contributions
add-0.040
 int-0.029
opp-0.028
utt-0.028
 rum-0.026
Token They
Token ID1119
Feature activation+1.39e+00

Pos logit contributions
add+0.011
 int+0.009
 ven+0.008
opp+0.008
utt+0.008
Neg logit contributions
 directions-0.012
liness-0.011
 lap-0.011
 mats-0.011
 sandbox-0.011
Token can
Token ID460
Feature activation+2.42e+00

Pos logit contributions
liness+0.037
 directions+0.035
 zone+0.033
 lap+0.033
 mats+0.031
Neg logit contributions
add-0.042
 int-0.032
 rum-0.032
opp-0.031
utt-0.031
Token kick
Token ID4829
Feature activation+2.44e+00

Pos logit contributions
liness+0.056
 directions+0.054
 zone+0.052
 lap+0.050
 mats+0.049
Neg logit contributions
add-0.081
 int-0.064
utt-0.062
opp-0.062
 rum-0.061
Token the
Token ID262
Feature activation+3.76e+00

Pos logit contributions
liness+0.066
 zone+0.063
 lap+0.062
 receiver+0.062
 directions+0.061
Neg logit contributions
add-0.064
 int-0.055
 once-0.049
 rum-0.049
opp-0.047
Token ball
Token ID2613
Feature activation+1.81e+00

Pos logit contributions
liness+0.090
 directions+0.079
 zone+0.077
rd+0.076
Kim+0.074
Neg logit contributions
add-0.109
 int-0.093
 once-0.088
 rum-0.085
der-0.081
Token far
Token ID1290
Feature activation+1.14e+00

Pos logit contributions
liness+0.047
 directions+0.047
 mats+0.044
 lap+0.044
 zone+0.044
Neg logit contributions
add-0.054
 int-0.041
utt-0.040
opp-0.040
 rum-0.038
Token and
Token ID290
Feature activation+1.82e+00

Pos logit contributions
liness+0.026
 lap+0.025
 directions+0.025
 sandbox+0.024
 mats+0.024
Neg logit contributions
add-0.036
 int-0.030
opp-0.028
 rum-0.028
 attent-0.027
Token fast
Token ID3049
Feature activation+9.54e-01

Pos logit contributions
liness+0.048
 zone+0.044
 directions+0.044
 receiver+0.041
 newspapers+0.040
Neg logit contributions
add-0.048
 rum-0.043
 int-0.043
utt-0.040
 erupt-0.040
Token.
Token ID13
Feature activation-4.07e-01

Pos logit contributions
liness+0.028
 directions+0.028
 lap+0.027
 seek+0.026
 mats+0.026
Neg logit contributions
add-0.024
 int-0.018
opp-0.017
 rum-0.017
utt-0.017
Token it
Token ID340
Feature activation+7.64e-03

Pos logit contributions
liness+0.037
 zone+0.037
 lap+0.035
 seek+0.034
 sandbox+0.034
Neg logit contributions
add-0.033
 int-0.027
opp-0.023
utt-0.023
 attent-0.023
Token.
Token ID13
Feature activation+4.73e-01

Pos logit contributions
liness+0.000
 directions+0.000
 mats+0.000
 lap+0.000
 zone+0.000
Neg logit contributions
add-0.000
 int-0.000
utt-0.000
opp-0.000
 ven-0.000
Token She
Token ID1375
Feature activation+4.79e-01

Pos logit contributions
liness+0.009
 directions+0.009
 lap+0.009
 mats+0.008
 zone+0.008
Neg logit contributions
add-0.017
 erupt-0.014
 ven-0.013
utt-0.013
 rum-0.013
Token kept
Token ID4030
Feature activation+1.14e+00

Pos logit contributions
 directions+0.010
liness+0.009
 mats+0.009
 seek+0.009
 lap+0.009
Neg logit contributions
add-0.017
opp-0.013
 int-0.013
utt-0.013
 ven-0.013
Token rolling
Token ID10708
Feature activation+1.40e+00

Pos logit contributions
 directions+0.022
liness+0.022
 mats+0.020
 lap+0.020
 seek+0.020
Neg logit contributions
add-0.043
utt-0.033
opp-0.033
 int-0.032
 ven-0.032
Token the
Token ID262
Feature activation+3.76e+00

Pos logit contributions
liness+0.039
 lap+0.037
 zone+0.037
 receiver+0.037
 directions+0.035
Neg logit contributions
add-0.037
 int-0.030
 once-0.027
 attent-0.026
utt-0.025
Token hoop
Token ID44792
Feature activation+2.39e+00

Pos logit contributions
rd+0.079
rum+0.078
 zone+0.075
cher+0.074
liness+0.073
Neg logit contributions
add-0.102
 int-0.090
 work-0.084
 attent-0.083
igg-0.081
Token and
Token ID290
Feature activation+5.61e-01

Pos logit contributions
liness+0.062
 directions+0.061
 zone+0.058
 mats+0.057
 lap+0.056
Neg logit contributions
add-0.069
 int-0.058
utt-0.053
opp-0.052
 rum-0.051
Token got
Token ID1392
Feature activation+3.05e-01

Pos logit contributions
liness+0.014
 zone+0.014
 directions+0.013
 lap+0.013
 mats+0.013
Neg logit contributions
add-0.016
 int-0.014
 rum-0.013
utt-0.013
opp-0.012
Token better
Token ID1365
Feature activation+1.41e-01

Pos logit contributions
 directions+0.007
 mats+0.006
liness+0.006
 tag+0.006
 seek+0.006
Neg logit contributions
add-0.011
opp-0.008
 ven-0.008
utt-0.008
 int-0.008
Token and
Token ID290
Feature activation+2.68e-01

Pos logit contributions
liness+0.004
 directions+0.004
 lap+0.003
 mats+0.003
 zone+0.003
Neg logit contributions
add-0.004
 int-0.003
opp-0.003
utt-0.003
 rum-0.003
Token.
Token ID13
Feature activation+1.41e-01

Pos logit contributions
add+0.005
 once+0.003
utt+0.003
 int+0.003
ores+0.003
Neg logit contributions
 directions-0.009
liness-0.008
 zone-0.008
 keyboard-0.008
 receiver-0.008
Token Lucy
Token ID22162
Feature activation+8.15e-01

Pos logit contributions
 directions+0.003
 mats+0.003
liness+0.003
 lap+0.003
 zone+0.003
Neg logit contributions
add-0.005
opp-0.004
 int-0.004
 ven-0.004
utt-0.004
Token liked
Token ID8288
Feature activation+9.13e-01

Pos logit contributions
 directions+0.021
liness+0.021
 mats+0.019
 zone+0.019
 lap+0.019
Neg logit contributions
add-0.025
opp-0.019
utt-0.018
 int-0.018
 rum-0.018
Token to
Token ID284
Feature activation+9.74e-01

Pos logit contributions
liness+0.027
 directions+0.026
 zone+0.026
 lap+0.025
 seek+0.024
Neg logit contributions
add-0.024
 int-0.018
utt-0.017
 rum-0.017
 ven-0.016
Token throw
Token ID3714
Feature activation+1.71e+00

Pos logit contributions
liness+0.028
 directions+0.026
 zone+0.026
 receiver+0.025
boxes+0.024
Neg logit contributions
add-0.027
 rum-0.020
utt-0.020
 erupt-0.019
 int-0.019
Token the
Token ID262
Feature activation+3.74e+00

Pos logit contributions
liness+0.046
 zone+0.045
 lap+0.044
 directions+0.043
 keyboard+0.042
Neg logit contributions
add-0.047
 int-0.034
 rum-0.033
 O-0.032
 once-0.032
Token hoop
Token ID44792
Feature activation+1.91e+00

Pos logit contributions
rd+0.103
liness+0.098
rum+0.094
keeping+0.090
roads+0.087
Neg logit contributions
add-0.091
 dot-0.070
une-0.069
 all-0.068
 once-0.068
Token and
Token ID290
Feature activation+7.11e-02

Pos logit contributions
liness+0.055
 directions+0.054
 zone+0.051
 lap+0.050
 mats+0.049
Neg logit contributions
add-0.051
 int-0.040
utt-0.038
opp-0.036
 rum-0.035
Token watch
Token ID2342
Feature activation-1.92e-01

Pos logit contributions
liness+0.002
 zone+0.002
 directions+0.002
rd+0.002
 yourself+0.002
Neg logit contributions
add-0.002
 rum-0.002
 int-0.001
 erupt-0.001
 knocked-0.001
Token Sky
Token ID5274
Feature activation-1.07e+00

Pos logit contributions
add+0.007
opp+0.005
utt+0.005
 int+0.005
 ven+0.005
Neg logit contributions
 directions-0.004
liness-0.004
 mats-0.004
 lap-0.004
 zone-0.004
Token fly
Token ID6129
Feature activation-3.03e-01

Pos logit contributions
add+0.038
utt+0.029
opp+0.029
 ven+0.028
omm+0.028
Neg logit contributions
 directions-0.023
liness-0.023
 mats-0.022
 lap-0.022
 zone-0.021
Token day
Token ID1110
Feature activation+1.35e+00

Pos logit contributions
 directions+0.027
liness+0.027
 mats+0.026
 lap+0.025
 zone+0.025
Neg logit contributions
add-0.036
 int-0.027
opp-0.027
utt-0.027
 rum-0.026
Token the
Token ID262
Feature activation+1.75e+00

Pos logit contributions
liness+0.032
 directions+0.031
 lap+0.029
 mats+0.028
 zone+0.028
Neg logit contributions
add-0.044
opp-0.035
 rum-0.035
 int-0.035
utt-0.034
Token boy
Token ID2933
Feature activation-1.26e-01

Pos logit contributions
liness+0.044
 directions+0.043
 zone+0.041
 mats+0.041
 yourself+0.040
Neg logit contributions
add-0.052
 int-0.041
 rum-0.040
 once-0.038
 attent-0.036
Token would
Token ID561
Feature activation+1.12e+00

Pos logit contributions
add+0.004
 int+0.003
opp+0.003
utt+0.003
 ven+0.003
Neg logit contributions
 directions-0.003
liness-0.003
 mats-0.002
 lap-0.002
 zone-0.002
Token hide
Token ID7808
Feature activation+7.56e-01

Pos logit contributions
liness+0.026
 directions+0.026
 zone+0.024
 mats+0.024
 newspapers+0.023
Neg logit contributions
add-0.036
 int-0.029
 rum-0.028
utt-0.028
opp-0.028
Token the
Token ID262
Feature activation+3.73e+00

Pos logit contributions
liness+0.023
 directions+0.022
 lap+0.022
 zone+0.021
 sandbox+0.021
Neg logit contributions
add-0.019
 int-0.015
 attent-0.013
opp-0.013
 rum-0.013
Token football
Token ID4346
Feature activation+1.94e+00

Pos logit contributions
liness+0.084
 zone+0.079
 directions+0.078
 mats+0.074
 lap+0.074
Neg logit contributions
add-0.111
 int-0.096
 rum-0.091
 once-0.089
 attent-0.086
Token so
Token ID523
Feature activation+1.72e+00

Pos logit contributions
 directions+0.054
liness+0.054
 mats+0.050
 zone+0.049
 lap+0.049
Neg logit contributions
add-0.054
 int-0.042
opp-0.040
utt-0.039
 rum-0.038
Token he
Token ID339
Feature activation-1.67e-01

Pos logit contributions
 directions+0.039
liness+0.037
 mats+0.036
 lap+0.036
 seek+0.035
Neg logit contributions
add-0.059
opp-0.045
 int-0.045
utt-0.044
 ven-0.043
Token could
Token ID714
Feature activation+1.51e+00

Pos logit contributions
add+0.006
 int+0.005
opp+0.005
utt+0.005
omm+0.005
Neg logit contributions
liness-0.003
 mats-0.003
 directions-0.003
 seek-0.003
 lap-0.003
Token find
Token ID1064
Feature activation+5.45e-01

Pos logit contributions
liness+0.033
 directions+0.033
 mats+0.030
 zone+0.030
 lap+0.030
Neg logit contributions
add-0.052
 int-0.041
utt-0.040
opp-0.040
 rum-0.039
Token can
Token ID460
Feature activation+1.73e+00

Pos logit contributions
 directions+0.006
liness+0.006
 mats+0.006
 lap+0.006
 zone+0.006
Neg logit contributions
add-0.010
 int-0.008
opp-0.008
utt-0.008
 ven-0.008
Token run
Token ID1057
Feature activation+1.22e+00

Pos logit contributions
 directions+0.037
liness+0.037
 mats+0.034
 lap+0.034
 zone+0.034
Neg logit contributions
add-0.061
 int-0.047
opp-0.047
utt-0.047
 rum-0.045
Token fast
Token ID3049
Feature activation+1.43e+00

Pos logit contributions
liness+0.030
 directions+0.029
 zone+0.027
 lap+0.027
 sandbox+0.026
Neg logit contributions
add-0.038
 int-0.030
utt-0.029
opp-0.029
 rum-0.028
Token and
Token ID290
Feature activation+2.38e+00

Pos logit contributions
 directions+0.035
liness+0.035
 lap+0.033
 zone+0.031
 sandbox+0.031
Neg logit contributions
add-0.044
 int-0.034
opp-0.032
utt-0.032
 rum-0.032
Token kick
Token ID4829
Feature activation+2.34e+00

Pos logit contributions
liness+0.064
 zone+0.062
 directions+0.059
 newspapers+0.056
ability+0.056
Neg logit contributions
add-0.062
 int-0.054
 rum-0.053
 erupt-0.050
utt-0.050
Token the
Token ID262
Feature activation+3.68e+00

Pos logit contributions
liness+0.065
 directions+0.062
 zone+0.061
 lap+0.061
 sandbox+0.060
Neg logit contributions
add-0.061
 int-0.050
opp-0.044
 rum-0.044
 once-0.043
Token ball
Token ID2613
Feature activation+1.35e+00

Pos logit contributions
liness+0.082
 directions+0.073
 zone+0.073
rd+0.071
Kim+0.070
Neg logit contributions
add-0.115
 int-0.093
 once-0.087
 rum-0.084
 sense-0.083
Token really
Token ID1107
Feature activation+1.60e+00

Pos logit contributions
liness+0.037
 directions+0.037
 mats+0.035
 lap+0.035
 zone+0.034
Neg logit contributions
add-0.039
 int-0.029
utt-0.028
opp-0.028
 ven-0.027
Token far
Token ID1290
Feature activation+4.48e-01

Pos logit contributions
 directions+0.037
liness+0.037
 mats+0.035
 lap+0.035
 zone+0.034
Neg logit contributions
add-0.053
 int-0.040
opp-0.040
utt-0.040
 ven-0.038
Token."
Token ID526
Feature activation-1.68e+00

Pos logit contributions
liness+0.011
 directions+0.011
 mats+0.010
 lap+0.010
 sandbox+0.010
Neg logit contributions
add-0.014
 int-0.011
opp-0.011
utt-0.011
 rum-0.010
Token
Token ID198
Feature activation-9.07e-01

Pos logit contributions
add+0.050
 int+0.038
opp+0.038
 ven+0.037
utt+0.036
Neg logit contributions
liness-0.044
 directions-0.044
 lap-0.043
 mats-0.041
 sandbox-0.041
Token,
Token ID11
Feature activation+4.21e-01

Pos logit contributions
add+0.016
 int+0.013
opp+0.013
utt+0.013
 rum+0.012
Neg logit contributions
 directions-0.010
liness-0.010
 lap-0.009
 zone-0.009
 mats-0.009
Token "
Token ID366
Feature activation+8.05e-02

Pos logit contributions
 lap+0.006
liness+0.006
 mats+0.006
 directions+0.006
 tag+0.006
Neg logit contributions
add-0.018
opp-0.014
utt-0.014
 ven-0.013
 former-0.013
TokenI
Token ID40
Feature activation-4.95e-02

Pos logit contributions
 lap+0.002
 directions+0.002
liness+0.002
 mats+0.002
 goggles+0.002
Neg logit contributions
add-0.003
 int-0.002
 rum-0.002
 ven-0.002
 attent-0.002
Token admire
Token ID21099
Feature activation+1.91e+00

Pos logit contributions
add+0.002
 int+0.001
utt+0.001
opp+0.001
omm+0.001
Neg logit contributions
liness-0.001
 seek-0.001
 zone-0.001
 directions-0.001
 tag-0.001
Token your
Token ID534
Feature activation+2.39e+00

Pos logit contributions
liness+0.031
 directions+0.031
 lap+0.028
 zone+0.027
 mats+0.027
Neg logit contributions
add-0.076
 int-0.063
opp-0.061
utt-0.061
 rum-0.059
Token soccer
Token ID11783
Feature activation+3.68e+00

Pos logit contributions
liness+0.060
 directions+0.058
 zone+0.055
 lap+0.052
 sandbox+0.052
Neg logit contributions
add-0.072
 int-0.059
opp-0.054
 attent-0.053
 rum-0.053
Token skills
Token ID4678
Feature activation+2.36e+00

Pos logit contributions
 directions+0.083
liness+0.080
 receiver+0.074
 lap+0.070
 zone+0.069
Neg logit contributions
add-0.108
 int-0.092
opp-0.089
 once-0.086
 ven-0.086
Token,
Token ID11
Feature activation+7.14e-01

Pos logit contributions
 directions+0.064
liness+0.063
 sandbox+0.061
 mats+0.060
 lap+0.058
Neg logit contributions
add-0.062
 int-0.053
opp-0.050
 rum-0.047
utt-0.047
Token Lily
Token ID20037
Feature activation-5.04e-01

Pos logit contributions
 directions+0.018
 zone+0.017
liness+0.017
 lap+0.016
 mats+0.016
Neg logit contributions
add-0.022
 int-0.018
 rum-0.017
utt-0.016
opp-0.016
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
add+0.012
opp+0.009
 int+0.009
 rum+0.009
 ven+0.008
Neg logit contributions
 directions-0.016
 mats-0.015
liness-0.014
 lap-0.014
 sandbox-0.014
Token You
Token ID921
Feature activation+5.11e-01

Pos logit contributions
 directions+0.025
liness+0.025
 mats+0.024
 lap+0.023
 zone+0.023
Neg logit contributions
add-0.036
 int-0.028
utt-0.028
opp-0.028
 rum-0.026
Token excited
Token ID6568
Feature activation-3.31e-01

Pos logit contributions
add+0.007
 int+0.005
opp+0.005
utt+0.005
 ven+0.005
Neg logit contributions
 directions-0.005
liness-0.005
 mats-0.005
 lap-0.005
 zone-0.005
Token.
Token ID13
Feature activation+6.96e-01

Pos logit contributions
add+0.010
 int+0.007
opp+0.007
utt+0.007
 rum+0.007
Neg logit contributions
 directions-0.009
liness-0.009
 mats-0.008
 lap-0.008
 zone-0.008
Token She
Token ID1375
Feature activation+1.07e+00

Pos logit contributions
 directions+0.015
 lap+0.015
liness+0.014
 mats+0.014
 zone+0.013
Neg logit contributions
add-0.025
 int-0.019
utt-0.019
opp-0.019
 ven-0.018
Token wrapped
Token ID12908
Feature activation+2.30e+00

Pos logit contributions
 directions+0.023
liness+0.022
 mats+0.021
 lap+0.021
 seek+0.021
Neg logit contributions
add-0.038
opp-0.029
utt-0.029
 int-0.029
 ven-0.028
Token the
Token ID262
Feature activation+2.62e+00

Pos logit contributions
liness+0.054
 zone+0.054
 sandbox+0.053
 directions+0.051
 lap+0.051
Neg logit contributions
add-0.070
 int-0.058
 attent-0.053
 once-0.052
opp-0.052
Token string
Token ID4731
Feature activation+3.68e+00

Pos logit contributions
liness+0.070
 mats+0.065
 directions+0.065
 seek+0.063
 zone+0.063
Neg logit contributions
add-0.064
 int-0.056
 once-0.053
 rum-0.052
 sense-0.052
Token around
Token ID1088
Feature activation+1.11e+00

Pos logit contributions
liness+0.112
 directions+0.108
 zone+0.107
 mats+0.107
 sandbox+0.106
Neg logit contributions
add-0.082
 int-0.072
 once-0.062
 rum-0.061
 attent-0.060
Token her
Token ID607
Feature activation+5.66e-01

Pos logit contributions
liness+0.022
 directions+0.022
 zone+0.020
 mats+0.020
 lap+0.020
Neg logit contributions
add-0.040
 int-0.032
opp-0.031
utt-0.031
 rum-0.031
Token body
Token ID1767
Feature activation+1.74e+00

Pos logit contributions
liness+0.010
 directions+0.010
 zone+0.009
 mats+0.009
 Seek+0.009
Neg logit contributions
add-0.020
 int-0.017
opp-0.017
 rum-0.016
 attent-0.016
Token and
Token ID290
Feature activation+1.02e+00

Pos logit contributions
 directions+0.042
liness+0.042
 zone+0.039
 mats+0.039
 newspapers+0.038
Neg logit contributions
add-0.052
 int-0.045
opp-0.041
utt-0.040
 attent-0.040
Token held
Token ID2714
Feature activation+1.35e+00

Pos logit contributions
liness+0.027
 zone+0.027
 directions+0.027
 mats+0.025
 newspapers+0.025
Neg logit contributions
add-0.025
 int-0.022
 rum-0.021
 attent-0.020
utt-0.019
Token two
Token ID734
Feature activation+1.44e+00

Pos logit contributions
liness+0.032
 zone+0.029
 directions+0.029
 lap+0.027
robat+0.027
Neg logit contributions
add-0.013
 twenty-0.009
 only-0.008
 three-0.008
 rum-0.008
Token goals
Token ID4661
Feature activation+1.52e+00

Pos logit contributions
liness+0.041
 zone+0.038
 seek+0.036
 receiver+0.036
 lap+0.035
Neg logit contributions
add-0.041
 int-0.031
 rum-0.030
utt-0.028
der-0.027
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
liness+0.046
 directions+0.045
 lap+0.043
 mats+0.042
 zone+0.042
Neg logit contributions
add-0.039
 int-0.028
utt-0.028
opp-0.027
 ven-0.026
Token They
Token ID1119
Feature activation+1.21e+00

Pos logit contributions
 directions+0.000
liness+0.000
 mats+0.000
 lap+0.000
 sandbox+0.000
Neg logit contributions
add-0.000
 int-0.000
opp-0.000
 ven-0.000
utt-0.000
Token kick
Token ID4829
Feature activation+2.08e+00

Pos logit contributions
liness+0.030
 directions+0.029
 zone+0.027
 lap+0.027
 mats+0.026
Neg logit contributions
add-0.038
 int-0.030
 rum-0.029
opp-0.029
utt-0.029
Token the
Token ID262
Feature activation+3.68e+00

Pos logit contributions
liness+0.059
 zone+0.056
 seek+0.055
 lap+0.054
 receiver+0.054
Neg logit contributions
add-0.055
 int-0.045
 rum-0.041
 once-0.040
opp-0.039
Token ball
Token ID2613
Feature activation+1.97e+00

Pos logit contributions
liness+0.093
rd+0.084
 zone+0.081
 directions+0.080
 seek+0.077
Neg logit contributions
add-0.101
 int-0.084
 once-0.081
 rum-0.080
der-0.074
Token and
Token ID290
Feature activation+1.65e+00

Pos logit contributions
 directions+0.049
liness+0.049
 mats+0.047
 lap+0.046
 zone+0.046
Neg logit contributions
add-0.061
 int-0.046
utt-0.046
opp-0.046
 ven-0.043
Token try
Token ID1949
Feature activation+6.06e-01

Pos logit contributions
liness+0.046
 zone+0.043
 directions+0.042
rd+0.041
 receiver+0.040
Neg logit contributions
 rum-0.039
add-0.038
 int-0.036
 erupt-0.032
 twenty-0.031
Token to
Token ID284
Feature activation+2.70e+00

Pos logit contributions
liness+0.017
 directions+0.017
 zone+0.016
 lap+0.016
 mats+0.016
Neg logit contributions
add-0.017
 int-0.013
opp-0.012
utt-0.012
 rum-0.012
Token score
Token ID4776
Feature activation+1.07e+00

Pos logit contributions
liness+0.068
 directions+0.064
 zone+0.064
 mats+0.061
 newspapers+0.061
Neg logit contributions
add-0.082
 int-0.065
 rum-0.063
opp-0.062
utt-0.061
Token a
Token ID257
Feature activation-3.87e-01

Pos logit contributions
add+0.025
utt+0.021
 erupt+0.021
raper+0.020
 int+0.020
Neg logit contributions
 directions-0.010
liness-0.010
 mats-0.010
 newspapers-0.009
 tag-0.009
Token very
Token ID845
Feature activation+1.13e+00

Pos logit contributions
add+0.013
utt+0.010
 int+0.010
opp+0.010
 ven+0.010
Neg logit contributions
 directions-0.008
liness-0.008
 lap-0.008
 mats-0.008
 zone-0.008
Token special
Token ID2041
Feature activation-6.83e-01

Pos logit contributions
 directions+0.024
 lap+0.022
 mats+0.022
 seek+0.021
liness+0.021
Neg logit contributions
add-0.039
utt-0.031
opp-0.031
 ven-0.031
 int-0.030
Token toy
Token ID13373
Feature activation+9.03e-02

Pos logit contributions
add+0.023
 int+0.018
opp+0.018
utt+0.018
 rum+0.017
Neg logit contributions
 directions-0.015
liness-0.015
 mats-0.014
 zone-0.014
 lap-0.014
Token for
Token ID329
Feature activation-9.52e-01

Pos logit contributions
liness+0.002
 directions+0.002
 mats+0.002
 lap+0.002
 zone+0.002
Neg logit contributions
add-0.003
 int-0.002
opp-0.002
utt-0.002
 ven-0.002
Token L
Token ID406
Feature activation-4.43e+00

Pos logit contributions
add+0.039
 int+0.035
utt+0.034
 ven+0.033
une+0.033
Neg logit contributions
 directions-0.011
 yourself-0.011
 mats-0.011
 your-0.010
 tag-0.010
Tokenott
Token ID1252
Feature activation-2.14e+00

Pos logit contributions
add+0.103
 rum+0.091
 int+0.090
 Hipp+0.088
 erupt+0.086
Neg logit contributions
 yourself-0.123
liness-0.120
 zone-0.114
shirt-0.111
 skirts-0.108
Tokenie
Token ID494
Feature activation-6.51e-01

Pos logit contributions
add+0.062
 int+0.046
utt+0.046
opp+0.046
 rum+0.044
Neg logit contributions
liness-0.059
 directions-0.058
 zone-0.054
 lap-0.054
 mats-0.053
Token.
Token ID13
Feature activation-2.49e-01

Pos logit contributions
add+0.017
 int+0.012
opp+0.012
utt+0.012
 rum+0.012
Neg logit contributions
 directions-0.019
liness-0.019
 mats-0.018
 lap-0.018
 zone-0.018
Token It
Token ID632
Feature activation-4.58e-02

Pos logit contributions
add+0.008
opp+0.006
 int+0.006
utt+0.006
 ven+0.006
Neg logit contributions
liness-0.006
 directions-0.006
 mats-0.006
 lap-0.005
 zone-0.005
Token was
Token ID373
Feature activation+1.01e-01

Pos logit contributions
add+0.002
 int+0.001
opp+0.001
utt+0.001
 ven+0.001
Neg logit contributions
liness-0.001
 directions-0.001
 mats-0.001
 lap-0.001
 zone-0.001
Token
Token ID198
Feature activation-2.10e-01

Pos logit contributions
add+0.025
 ven+0.021
 eruption+0.021
 erupt+0.020
 rum+0.020
Neg logit contributions
 lap-0.014
liness-0.014
 directions-0.014
 mats-0.013
 sandbox-0.012
Token
Token ID198
Feature activation-8.78e-01

Pos logit contributions
add+0.007
 rum+0.006
 ven+0.006
utt+0.006
une+0.006
Neg logit contributions
 directions-0.004
 lap-0.004
liness-0.004
 mats-0.004
Your-0.004
TokenOne
Token ID3198
Feature activation+5.29e-01

Pos logit contributions
add+0.027
utt+0.020
opp+0.020
 int+0.020
 ven+0.020
Neg logit contributions
 directions-0.022
liness-0.022
 mats-0.021
 lap-0.021
 seek-0.020
Token day
Token ID1110
Feature activation+5.83e-01

Pos logit contributions
liness+0.013
 directions+0.013
 mats+0.012
 lap+0.012
 zone+0.012
Neg logit contributions
add-0.017
 int-0.013
opp-0.013
utt-0.013
 rum-0.013
Token,
Token ID11
Feature activation+4.64e-01

Pos logit contributions
 directions+0.011
liness+0.011
 mats+0.010
 lap+0.010
 zone+0.010
Neg logit contributions
add-0.022
 int-0.018
opp-0.018
utt-0.017
 rum-0.017
Token L
Token ID406
Feature activation-4.37e+00

Pos logit contributions
 directions+0.013
liness+0.013
 mats+0.013
 lap+0.012
 zone+0.012
Neg logit contributions
add-0.013
 int-0.009
utt-0.009
opp-0.009
 ven-0.009
Tokenott
Token ID1252
Feature activation-2.52e+00

Pos logit contributions
add+0.091
 int+0.068
 rum+0.065
 erupt+0.063
 attent+0.063
Neg logit contributions
liness-0.141
 yourself-0.138
 zone-0.132
 directions-0.131
 sandbox-0.125
Tokenie
Token ID494
Feature activation+9.39e-02

Pos logit contributions
add+0.073
utt+0.055
opp+0.055
 int+0.054
 rum+0.052
Neg logit contributions
liness-0.069
 directions-0.068
 zone-0.063
 mats-0.061
 lap-0.061
Token's
Token ID338
Feature activation-5.32e-01

Pos logit contributions
 directions+0.002
liness+0.002
 mats+0.002
 lap+0.002
 zone+0.002
Neg logit contributions
add-0.003
 int-0.002
opp-0.002
utt-0.002
 rum-0.002
Token mom
Token ID1995
Feature activation+1.00e+00

Pos logit contributions
add+0.016
utt+0.012
 int+0.012
opp+0.012
 ven+0.011
Neg logit contributions
 directions-0.014
 lap-0.014
liness-0.014
 mats-0.014
 sandbox-0.013
Token went
Token ID1816
Feature activation+5.20e-01

Pos logit contributions
liness+0.024
 directions+0.023
 seek+0.022
 lap+0.022
 zone+0.022
Neg logit contributions
add-0.033
 int-0.025
opp-0.025
utt-0.025
 ven-0.024
Token a
Token ID257
Feature activation-1.70e-01

Pos logit contributions
add+0.016
utt+0.013
 int+0.013
opp+0.013
 ven+0.012
Neg logit contributions
 mats-0.010
 directions-0.010
liness-0.009
 lap-0.009
 tag-0.009
Token little
Token ID1310
Feature activation+6.92e-01

Pos logit contributions
add+0.007
utt+0.006
 int+0.005
uned+0.005
opp+0.005
Neg logit contributions
 sandbox-0.003
 seek-0.003
 tag-0.003
 lap-0.003
 leash-0.003
Token pig
Token ID12967
Feature activation-4.17e-01

Pos logit contributions
liness+0.010
 directions+0.010
 mats+0.010
 sandbox+0.009
 lap+0.009
Neg logit contributions
add-0.029
 int-0.024
utt-0.024
opp-0.023
 attent-0.022
Tokengy
Token ID1360
Feature activation-1.75e+00

Pos logit contributions
add+0.014
 int+0.011
utt+0.011
opp+0.011
 ven+0.010
Neg logit contributions
liness-0.009
 directions-0.009
 lap-0.009
 mats-0.009
 zone-0.009
Token named
Token ID3706
Feature activation-1.33e+00

Pos logit contributions
add+0.062
 int+0.049
utt+0.048
opp+0.047
 ven+0.046
Neg logit contributions
 directions-0.037
liness-0.036
 lap-0.036
 mats-0.035
 sandbox-0.034
Token Bub
Token ID19645
Feature activation-4.17e+00

Pos logit contributions
add+0.043
 int+0.037
der+0.036
icker+0.036
 former+0.035
Neg logit contributions
 your-0.032
 my-0.029
 directions-0.026
 yourself-0.026
Spot-0.026
Tokenbles
Token ID7689
Feature activation-1.03e+00

Pos logit contributions
 int+0.134
 rum+0.130
add+0.126
 eruption+0.124
ores+0.124
Neg logit contributions
 yourself-0.070
liness-0.069
 zone-0.063
 pads-0.062
Your-0.060
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
add+0.035
 int+0.028
utt+0.027
 ven+0.027
omm+0.027
Neg logit contributions
 directions-0.021
liness-0.021
 zone-0.020
 lap-0.020
 sandbox-0.020
Token Bub
Token ID19645
Feature activation-2.49e+00

Pos logit contributions
add+0.028
 int+0.020
 ven+0.019
opp+0.019
 attent+0.018
Neg logit contributions
 directions-0.037
liness-0.037
 mats-0.035
 seek-0.035
 zone-0.035
Tokenbles
Token ID7689
Feature activation-5.48e-01

Pos logit contributions
add+0.081
 int+0.076
 rum+0.075
 attent+0.070
ores+0.069
Neg logit contributions
liness-0.047
 directions-0.044
 yourself-0.044
 zone-0.043
 sandbox-0.042
Token loved
Token ID6151
Feature activation-7.06e-01

Pos logit contributions
add+0.019
 int+0.015
opp+0.015
utt+0.015
 ven+0.014
Neg logit contributions
 directions-0.012
liness-0.012
 mats-0.011
 lap-0.011
 zone-0.011
Token was
Token ID373
Feature activation+4.57e-01

Pos logit contributions
 directions+0.028
 mats+0.027
 lap+0.027
liness+0.027
 newspapers+0.026
Neg logit contributions
add-0.033
opp-0.024
 rum-0.024
 once-0.024
 int-0.023
Token a
Token ID257
Feature activation+1.34e-02

Pos logit contributions
 directions+0.011
liness+0.011
 mats+0.010
 lap+0.010
 seek+0.010
Neg logit contributions
add-0.015
utt-0.011
opp-0.011
 int-0.011
 ven-0.011
Token little
Token ID1310
Feature activation+7.50e-01

Pos logit contributions
 sandbox+0.000
 seek+0.000
 tag+0.000
 hoop+0.000
 mats+0.000
Neg logit contributions
add-0.001
utt-0.000
 int-0.000
uned-0.000
opp-0.000
Token boy
Token ID2933
Feature activation-3.59e-01

Pos logit contributions
liness+0.011
 directions+0.011
 mats+0.010
 zone+0.010
 sandbox+0.010
Neg logit contributions
add-0.031
 int-0.026
utt-0.026
opp-0.025
 ven-0.024
Token named
Token ID3706
Feature activation+9.48e-02

Pos logit contributions
add+0.013
 int+0.010
utt+0.010
opp+0.010
 ven+0.010
Neg logit contributions
liness-0.007
 directions-0.007
 lap-0.007
 zone-0.007
 seek-0.007
Token T
Token ID309
Feature activation-4.14e+00

Pos logit contributions
 directions+0.002
liness+0.002
 mats+0.002
 seek+0.002
 Seek+0.002
Neg logit contributions
add-0.003
 int-0.002
 ven-0.002
opp-0.002
 attent-0.002
Tokenane
Token ID1531
Feature activation-6.77e-01

Pos logit contributions
add+0.125
 spe+0.111
 int+0.108
 rum+0.105
utt+0.104
Neg logit contributions
liness-0.094
 zone-0.088
 yourself-0.083
 newspapers-0.077
robat-0.077
Token.
Token ID13
Feature activation-5.26e-01

Pos logit contributions
add+0.026
opp+0.020
 ven+0.020
 int+0.020
utt+0.020
Neg logit contributions
liness-0.012
 lap-0.012
 zone-0.012
 directions-0.012
 tag-0.012
Token T
Token ID309
Feature activation-2.89e+00

Pos logit contributions
add+0.017
 int+0.013
utt+0.013
 ven+0.012
opp+0.012
Neg logit contributions
liness-0.013
 directions-0.012
 zone-0.012
 lap-0.012
 mats-0.012
Tokenane
Token ID1531
Feature activation+6.28e-01

Pos logit contributions
add+0.084
 int+0.075
 ven+0.073
 rum+0.072
 former+0.072
Neg logit contributions
 zone-0.063
 yourself-0.063
liness-0.061
 equipment-0.058
 newspapers-0.056
Token liked
Token ID8288
Feature activation-3.52e-01

Pos logit contributions
 directions+0.011
liness+0.011
 lap+0.010
 zone+0.010
 mats+0.010
Neg logit contributions
add-0.024
opp-0.019
 int-0.019
utt-0.019
 ven-0.019
Token a
Token ID257
Feature activation-1.47e-01

Pos logit contributions
add+0.014
utt+0.012
 int+0.011
opp+0.011
 ven+0.011
Neg logit contributions
 directions-0.009
 mats-0.009
liness-0.009
 lap-0.009
 seek-0.009
Token pink
Token ID11398
Feature activation-2.67e-01

Pos logit contributions
add+0.006
utt+0.005
 int+0.005
uned+0.004
opp+0.004
Neg logit contributions
 sandbox-0.003
 seek-0.003
 tag-0.003
 lap-0.003
 leash-0.003
Token pig
Token ID12967
Feature activation-1.84e-01

Pos logit contributions
add+0.010
utt+0.008
 int+0.008
opp+0.008
 ven+0.007
Neg logit contributions
 lap-0.005
liness-0.005
 mats-0.005
 sandbox-0.005
 zone-0.005
Tokengy
Token ID1360
Feature activation-1.48e+00

Pos logit contributions
add+0.006
 int+0.005
utt+0.005
opp+0.005
 ven+0.004
Neg logit contributions
liness-0.004
 directions-0.004
 mats-0.004
 lap-0.004
 zone-0.004
Token named
Token ID3706
Feature activation-1.26e+00

Pos logit contributions
add+0.050
 int+0.039
utt+0.038
opp+0.038
 ven+0.037
Neg logit contributions
liness-0.033
 directions-0.033
 lap-0.032
 mats-0.032
 zone-0.031
Token P
Token ID350
Feature activation-4.09e+00

Pos logit contributions
add+0.043
 int+0.036
icker+0.034
der+0.034
 ven+0.034
Neg logit contributions
 your-0.027
 my-0.025
 lap-0.024
 tag-0.023
 directions-0.023
Tokenipp
Token ID3974
Feature activation-1.28e+00

Pos logit contributions
 int+0.097
 eruption+0.095
 erupt+0.092
add+0.091
 rum+0.090
Neg logit contributions
liness-0.108
 yourself-0.102
 pads-0.096
 zone-0.095
 lap-0.091
Tokena
Token ID64
Feature activation-1.53e+00

Pos logit contributions
add+0.041
 int+0.038
 rum+0.035
 erupt+0.035
utt+0.034
Neg logit contributions
able-0.025
 zone-0.024
Your-0.024
ability-0.024
 directions-0.024
Token.
Token ID13
Feature activation+3.78e-01

Pos logit contributions
add+0.053
 int+0.042
utt+0.042
 rum+0.039
 ven+0.038
Neg logit contributions
liness-0.033
 directions-0.033
 lap-0.032
 mats-0.031
 zone-0.031
Token P
Token ID350
Feature activation-2.51e+00

Pos logit contributions
 directions+0.009
liness+0.009
 mats+0.009
 lap+0.009
 zone+0.009
Neg logit contributions
add-0.012
 int-0.009
opp-0.009
utt-0.009
 ven-0.008
Tokenipp
Token ID3974
Feature activation-6.58e-02

Pos logit contributions
 eruption+0.061
 erupt+0.058
 int+0.057
add+0.056
 rum+0.055
Neg logit contributions
 yourself-0.068
liness-0.066
Your-0.062
 sandbox-0.061
 mats-0.061
Token a
Token ID257
Feature activation-3.12e-01

Pos logit contributions
 directions+0.001
liness+0.001
 seek+0.001
 mats+0.001
 lap+0.001
Neg logit contributions
add-0.002
 int-0.001
utt-0.001
opp-0.001
 ven-0.001
Token big
Token ID1263
Feature activation-1.61e-01

Pos logit contributions
add+0.012
utt+0.011
 int+0.010
uned+0.010
urg+0.009
Neg logit contributions
 sandbox-0.005
 seek-0.005
 hoop-0.005
 tag-0.005
 mats-0.005
Token bird
Token ID6512
Feature activation-1.34e+00

Pos logit contributions
add+0.007
utt+0.006
 int+0.006
opp+0.006
urd+0.005
Neg logit contributions
 sandbox-0.002
 mats-0.002
liness-0.002
 lap-0.002
 seek-0.002
Token named
Token ID3706
Feature activation-7.16e-01

Pos logit contributions
add+0.042
 int+0.031
utt+0.031
opp+0.031
 rum+0.030
Neg logit contributions
 directions-0.033
liness-0.033
 mats-0.032
 lap-0.031
 zone-0.031
Token O
Token ID440
Feature activation-2.54e+00

Pos logit contributions
add+0.027
 int+0.022
 attent+0.021
 ven+0.021
icker+0.020
Neg logit contributions
 your-0.014
 tag-0.013
 my-0.013
 newspapers-0.012
 directions-0.012
Tokenstr
Token ID2536
Feature activation-4.00e+00

Pos logit contributions
add+0.088
 int+0.079
 attent+0.075
 rum+0.072
der+0.072
Neg logit contributions
liness-0.045
 sandbox-0.044
 directions-0.044
 yourself-0.042
 seek-0.042
Tokenich
Token ID488
Feature activation-1.04e+00

Pos logit contributions
add+0.159
 int+0.150
 attent+0.144
ores+0.143
 rum+0.138
Neg logit contributions
 yourself-0.048
 sandbox-0.046
liness-0.044
 directions-0.041
 pads-0.039
Token wanted
Token ID2227
Feature activation-3.60e-01

Pos logit contributions
add+0.042
 int+0.035
opp+0.033
utt+0.033
 ven+0.032
Neg logit contributions
liness-0.016
 directions-0.016
 lap-0.015
 seek-0.015
 sandbox-0.014
Token to
Token ID284
Feature activation+5.02e-02

Pos logit contributions
add+0.011
 int+0.009
opp+0.008
utt+0.008
 ven+0.008
Neg logit contributions
 directions-0.009
liness-0.009
 mats-0.009
 lap-0.009
 zone-0.008
Token swim
Token ID9422
Feature activation-6.13e-01

Pos logit contributions
 directions+0.001
 lap+0.001
 seek+0.001
liness+0.001
 mats+0.001
Neg logit contributions
add-0.002
 int-0.001
opp-0.001
utt-0.001
 attent-0.001
Token in
Token ID287
Feature activation+4.12e-01

Pos logit contributions
add+0.018
 int+0.013
utt+0.013
opp+0.013
 rum+0.012
Neg logit contributions
 directions-0.017
liness-0.017
 mats-0.016
 lap-0.016
 zone-0.016
Token was
Token ID373
Feature activation-1.73e-01

Pos logit contributions
 directions+0.013
liness+0.013
 mats+0.013
 lap+0.012
 newspapers+0.012
Neg logit contributions
add-0.018
opp-0.014
 rum-0.014
 int-0.014
utt-0.014
Token a
Token ID257
Feature activation-3.90e-02

Pos logit contributions
add+0.006
utt+0.005
 int+0.005
opp+0.004
 ven+0.004
Neg logit contributions
 directions-0.004
 mats-0.004
liness-0.004
 lap-0.004
 seek-0.004
Token little
Token ID1310
Feature activation+8.75e-01

Pos logit contributions
add+0.001
utt+0.001
 int+0.001
uned+0.001
opp+0.001
Neg logit contributions
 sandbox-0.001
 seek-0.001
 tag-0.001
 lap-0.001
 leash-0.001
Token girl
Token ID2576
Feature activation-4.25e-02

Pos logit contributions
liness+0.014
 directions+0.014
 mats+0.014
 lap+0.013
 zone+0.013
Neg logit contributions
add-0.035
 int-0.028
utt-0.028
opp-0.027
 ven-0.027
Token named
Token ID3706
Feature activation+1.66e-01

Pos logit contributions
add+0.001
opp+0.001
 int+0.001
utt+0.001
 ven+0.001
Neg logit contributions
 directions-0.001
liness-0.001
 mats-0.001
 zone-0.001
 lap-0.001
Token Sab
Token ID9910
Feature activation-3.96e+00

Pos logit contributions
 directions+0.004
liness+0.004
 mats+0.004
 lap+0.004
 seek+0.004
Neg logit contributions
add-0.006
 int-0.004
utt-0.004
opp-0.004
 rum-0.004
Tokenrina
Token ID22267
Feature activation-2.35e+00

Pos logit contributions
add+0.120
utt+0.100
 int+0.096
ores+0.095
 rum+0.095
Neg logit contributions
liness-0.092
 directions-0.091
 yourself-0.084
 zone-0.081
 book-0.081
Token.
Token ID13
Feature activation-8.94e-01

Pos logit contributions
add+0.088
utt+0.071
 int+0.069
opp+0.066
urd+0.066
Neg logit contributions
liness-0.047
 directions-0.046
 zone-0.043
 lap-0.042
 sandbox-0.040
Token She
Token ID1375
Feature activation+3.62e-01

Pos logit contributions
add+0.032
 int+0.025
utt+0.025
opp+0.025
 ven+0.024
Neg logit contributions
 directions-0.018
liness-0.018
 lap-0.017
 mats-0.017
 zone-0.017
Token had
Token ID550
Feature activation+1.17e-01

Pos logit contributions
 directions+0.008
liness+0.008
 mats+0.007
 zone+0.007
 lap+0.007
Neg logit contributions
add-0.012
opp-0.010
 int-0.010
utt-0.009
 rum-0.009
Token a
Token ID257
Feature activation-2.66e-01

Pos logit contributions
liness+0.003
 directions+0.003
 mats+0.003
 lap+0.003
 zone+0.003
Neg logit contributions
add-0.004
 int-0.003
opp-0.003
utt-0.003
 ven-0.002
Token to
Token ID284
Feature activation-1.36e+00

Pos logit contributions
add+0.025
 int+0.019
utt+0.018
opp+0.018
 rum+0.017
Neg logit contributions
 directions-0.029
liness-0.028
 mats-0.028
 lap-0.027
 zone-0.027
Token come
Token ID1282
Feature activation-2.56e-01

Pos logit contributions
add+0.057
utt+0.048
 int+0.047
icker+0.047
omm+0.046
Neg logit contributions
 your-0.019
 directions-0.019
 yourself-0.018
 tag-0.017
liness-0.017
Token by
Token ID416
Feature activation-1.23e+00

Pos logit contributions
add+0.008
 int+0.006
utt+0.006
opp+0.006
 rum+0.006
Neg logit contributions
 directions-0.006
liness-0.006
 mats-0.006
 lap-0.006
 zone-0.006
Token and
Token ID290
Feature activation-1.51e+00

Pos logit contributions
add+0.042
utt+0.032
 int+0.032
opp+0.032
 ven+0.031
Neg logit contributions
liness-0.027
 directions-0.027
 mats-0.026
 lap-0.025
 seek-0.025
Token asked
Token ID1965
Feature activation-5.74e-01

Pos logit contributions
add+0.052
opp+0.039
 int+0.039
utt+0.038
omm+0.037
Neg logit contributions
 directions-0.035
liness-0.034
 seek-0.033
 mats-0.032
 lap-0.031
Token N
Token ID399
Feature activation-3.93e+00

Pos logit contributions
add+0.021
 erupt+0.017
icker+0.017
utt+0.017
urd+0.017
Neg logit contributions
 directions-0.011
liness-0.010
 your-0.010
 yourself-0.009
 mats-0.009
Tokenell
Token ID695
Feature activation-5.74e-01

Pos logit contributions
 rum+0.094
 explosion+0.087
add+0.086
 int+0.085
 til+0.084
Neg logit contributions
 zone-0.101
 yourself-0.099
oys-0.098
liness-0.096
able-0.088
Tokenie
Token ID494
Feature activation-4.04e-01

Pos logit contributions
add+0.015
 int+0.011
opp+0.011
utt+0.011
 ven+0.010
Neg logit contributions
 directions-0.017
liness-0.017
 mats-0.016
 lap-0.016
 zone-0.016
Token if
Token ID611
Feature activation-7.41e-01

Pos logit contributions
add+0.012
 int+0.009
opp+0.009
utt+0.009
 ven+0.008
Neg logit contributions
 directions-0.011
liness-0.010
 mats-0.010
 lap-0.010
 zone-0.010
Token he
Token ID339
Feature activation-2.75e-01

Pos logit contributions
add+0.031
 int+0.026
utt+0.025
opp+0.025
 ven+0.025
Neg logit contributions
 directions-0.010
liness-0.010
 mats-0.009
 seek-0.009
 zone-0.008
Token could
Token ID714
Feature activation-2.23e-01

Pos logit contributions
add+0.011
 int+0.009
icker+0.009
opp+0.009
urd+0.009
Neg logit contributions
 seek-0.004
liness-0.004
 follows-0.004
 tag-0.004
 directions-0.004
Token scared
Token ID12008
Feature activation-1.94e-02

Pos logit contributions
add+0.034
opp+0.026
utt+0.025
omm+0.025
 int+0.024
Neg logit contributions
 directions-0.018
liness-0.017
 tag-0.017
 lap-0.017
 seek-0.016
Token,
Token ID11
Feature activation-3.49e-01

Pos logit contributions
add+0.001
opp+0.001
urd+0.001
utt+0.001
 int+0.001
Neg logit contributions
liness-0.000
 directions-0.000
 zone-0.000
 tag-0.000
 lap-0.000
Token but
Token ID475
Feature activation-8.10e-01

Pos logit contributions
add+0.014
 int+0.012
utt+0.012
opp+0.011
 rum+0.011
Neg logit contributions
liness-0.005
 directions-0.005
 mats-0.005
 zone-0.005
 lap-0.005
Token she
Token ID673
Feature activation-2.19e-01

Pos logit contributions
add+0.033
opp+0.027
 int+0.026
utt+0.026
 ven+0.025
Neg logit contributions
 directions-0.013
liness-0.012
 seek-0.012
 yourself-0.011
 follows-0.011
Token asked
Token ID1965
Feature activation-8.04e-01

Pos logit contributions
add+0.009
 int+0.007
utt+0.007
opp+0.007
andel+0.007
Neg logit contributions
 seek-0.004
 tag-0.003
liness-0.003
 directions-0.003
 lap-0.003
Token the
Token ID262
Feature activation-3.87e+00

Pos logit contributions
add+0.038
utt+0.031
urd+0.030
opp+0.030
 former+0.029
Neg logit contributions
 directions-0.009
liness-0.009
 your-0.008
 yourself-0.008
 newspapers-0.007
Token gen
Token ID2429
Feature activation-1.04e+00

Pos logit contributions
add+0.142
ores+0.128
utt+0.124
 int+0.119
inn+0.116
Neg logit contributions
 directions-0.082
 book-0.080
 other-0.078
 sandbox-0.076
 yourself-0.072
Tokenie
Token ID494
Feature activation-1.98e+00

Pos logit contributions
add+0.026
 int+0.018
opp+0.018
utt+0.017
 ven+0.016
Neg logit contributions
 directions-0.032
liness-0.032
 mats-0.032
 lap-0.031
 zone-0.030
Token to
Token ID284
Feature activation-1.69e+00

Pos logit contributions
add+0.066
opp+0.049
 int+0.049
utt+0.048
 ven+0.047
Neg logit contributions
liness-0.048
 directions-0.046
 zone-0.043
 mats-0.042
 lap-0.042
Token take
Token ID1011
Feature activation-1.46e-01

Pos logit contributions
add+0.071
utt+0.059
 int+0.057
opp+0.056
omm+0.056
Neg logit contributions
 yourself-0.026
 directions-0.026
 seek-0.023
liness-0.023
 tag-0.023
Token her
Token ID607
Feature activation-1.42e+00

Pos logit contributions
add+0.007
utt+0.006
 int+0.005
raper+0.005
der+0.005
Neg logit contributions
 directions-0.002
liness-0.002
 newspapers-0.001
 yourself-0.001
 your-0.001
Token to
Token ID284
Feature activation-1.77e+00

Pos logit contributions
 directions+0.016
 seek+0.016
liness+0.016
 lap+0.016
 mats+0.015
Neg logit contributions
add-0.026
utt-0.020
opp-0.020
 int-0.019
 ven-0.019
Token serve
Token ID4691
Feature activation+9.55e-01

Pos logit contributions
add+0.081
opp+0.068
 int+0.067
omm+0.066
utt+0.065
Neg logit contributions
 yourself-0.026
 your-0.026
 seek-0.025
 my-0.022
 directions-0.021
Token you
Token ID345
Feature activation-1.27e-01

Pos logit contributions
 directions+0.017
liness+0.016
 yourself+0.015
 mats+0.015
 lap+0.015
Neg logit contributions
add-0.039
utt-0.031
 int-0.030
opp-0.030
 ven-0.029
Token,"
Token ID553
Feature activation-2.58e+00

Pos logit contributions
add+0.004
 int+0.003
opp+0.003
utt+0.003
 rum+0.003
Neg logit contributions
 directions-0.003
liness-0.003
 mats-0.003
 lap-0.003
 zone-0.003
Token said
Token ID531
Feature activation-2.52e+00

Pos logit contributions
add+0.107
opp+0.084
utt+0.083
 int+0.080
omm+0.079
Neg logit contributions
 directions-0.045
liness-0.043
 seek-0.040
Your-0.039
 yourself-0.039
Token the
Token ID262
Feature activation-3.85e+00

Pos logit contributions
add+0.133
igg+0.108
utt+0.107
opp+0.106
ISE+0.104
Neg logit contributions
 your-0.018
 directions-0.016
 lap-0.013
 tag-0.013
 seek-0.010
Token gen
Token ID2429
Feature activation-9.61e-01

Pos logit contributions
add+0.138
ores+0.125
utt+0.118
 int+0.113
inn+0.111
Neg logit contributions
 directions-0.091
 other-0.082
 book-0.081
 receiver-0.079
 yourself-0.078
Tokenie
Token ID494
Feature activation-1.75e+00

Pos logit contributions
add+0.025
 int+0.017
opp+0.017
utt+0.017
 ven+0.016
Neg logit contributions
 directions-0.029
liness-0.029
 mats-0.028
 lap-0.028
 zone-0.027
Token.
Token ID13
Feature activation-2.54e+00

Pos logit contributions
add+0.064
opp+0.047
 ven+0.046
utt+0.046
 int+0.045
Neg logit contributions
liness-0.039
 directions-0.038
 lap-0.037
 zone-0.034
 seek-0.034
Token
Token ID198
Feature activation-1.13e+00

Pos logit contributions
add+0.118
opp+0.094
icker+0.092
urg+0.091
andel+0.090
Neg logit contributions
 lap-0.035
 Let-0.028
 seek-0.028
 tag-0.027
liness-0.026
Token
Token ID198
Feature activation-1.18e+00

Pos logit contributions
add+0.037
utt+0.028
 int+0.028
opp+0.028
 rum+0.027
Neg logit contributions
 directions-0.027
liness-0.026
 lap-0.025
 seek-0.025
 mats-0.024
Token was
Token ID373
Feature activation+3.48e-02

Pos logit contributions
 directions+0.010
liness+0.010
 seek+0.009
 mats+0.009
 lap+0.009
Neg logit contributions
add-0.020
opp-0.017
utt-0.016
 int-0.016
 ven-0.016
Token a
Token ID257
Feature activation-1.77e-01

Pos logit contributions
 directions+0.001
liness+0.001
 mats+0.001
 lap+0.001
 seek+0.001
Neg logit contributions
add-0.001
opp-0.001
utt-0.001
 int-0.001
 ven-0.001
Token cool
Token ID3608
Feature activation+1.36e+00

Pos logit contributions
add+0.006
utt+0.005
 int+0.005
opp+0.005
omm+0.004
Neg logit contributions
 seek-0.004
 directions-0.004
 sandbox-0.004
 mats-0.004
liness-0.004
Token day
Token ID1110
Feature activation+9.90e-01

Pos logit contributions
liness+0.027
 directions+0.027
 mats+0.025
 lap+0.025
 zone+0.025
Neg logit contributions
add-0.049
 int-0.039
opp-0.039
utt-0.039
 ven-0.037
Token when
Token ID618
Feature activation-2.82e-01

Pos logit contributions
 directions+0.030
liness+0.029
 mats+0.028
 sandbox+0.027
 lap+0.027
Neg logit contributions
add-0.025
 int-0.019
utt-0.019
 rum-0.018
opp-0.018
Token Ma
Token ID6669
Feature activation-3.83e+00

Pos logit contributions
add+0.010
 int+0.007
utt+0.007
opp+0.007
 ven+0.007
Neg logit contributions
 directions-0.006
liness-0.006
 mats-0.006
 seek-0.006
 lap-0.006
Tokenis
Token ID271
Feature activation-1.55e+00

Pos logit contributions
add+0.129
 int+0.109
opp+0.109
urd+0.109
 ven+0.107
Neg logit contributions
 mats-0.068
liness-0.067
 directions-0.063
 seek-0.062
 yourself-0.062
Tokenie
Token ID494
Feature activation-8.14e-01

Pos logit contributions
add+0.042
 int+0.030
opp+0.030
utt+0.029
 ven+0.028
Neg logit contributions
 directions-0.045
liness-0.045
 mats-0.043
 lap-0.043
 zone-0.042
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
add+0.028
 int+0.022
opp+0.022
utt+0.022
 ven+0.021
Neg logit contributions
liness-0.018
 directions-0.017
 lap-0.016
 mats-0.016
 zone-0.016
Token Benny
Token ID44275
Feature activation-3.11e-01

Pos logit contributions
add+0.044
utt+0.034
 int+0.032
opp+0.032
une+0.031
Neg logit contributions
liness-0.027
 directions-0.027
 seek-0.027
 mats-0.025
 sandbox-0.025
Token were
Token ID547
Feature activation+7.27e-01

Pos logit contributions
add+0.010
 int+0.008
utt+0.008
opp+0.008
 ven+0.008
Neg logit contributions
liness-0.007
 directions-0.007
 mats-0.007
 zone-0.007
 lap-0.007
Token fall
Token ID2121
Feature activation-2.24e+00

Pos logit contributions
add+0.057
 ven+0.046
omm+0.046
opp+0.046
utt+0.044
Neg logit contributions
 directions-0.032
liness-0.029
 lap-0.028
 mats-0.028
 tag-0.028
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
add+0.088
opp+0.071
utt+0.070
 ven+0.068
omm+0.066
Neg logit contributions
 directions-0.043
liness-0.041
 mats-0.038
 zone-0.037
 lap-0.036
Token One
Token ID1881
Feature activation+4.06e-01

Pos logit contributions
add+0.035
 eruption+0.029
 erupt+0.029
urg+0.027
 ven+0.027
Neg logit contributions
 lap-0.023
liness-0.022
 mats-0.022
 directions-0.021
 pads-0.020
Token day
Token ID1110
Feature activation+5.33e-01

Pos logit contributions
liness+0.013
 directions+0.013
 yourself+0.012
 mats+0.012
 Seek+0.012
Neg logit contributions
add-0.010
 rum-0.007
 int-0.007
 once-0.007
opp-0.006
Token,
Token ID11
Feature activation-9.44e-03

Pos logit contributions
 directions+0.011
liness+0.011
 lap+0.010
 mats+0.010
 zone+0.010
Neg logit contributions
add-0.019
opp-0.015
 int-0.015
utt-0.015
 rum-0.015
Token L
Token ID406
Feature activation-3.82e+00

Pos logit contributions
add+0.000
 int+0.000
utt+0.000
opp+0.000
 ven+0.000
Neg logit contributions
 directions-0.000
liness-0.000
 mats-0.000
 lap-0.000
 zone-0.000
Tokenizz
Token ID6457
Feature activation-1.19e+00

Pos logit contributions
 int+0.082
add+0.082
 eruption+0.077
 erupt+0.076
 former+0.076
Neg logit contributions
 yourself-0.099
liness-0.098
 directions-0.095
 zone-0.092
 skirts-0.092
Tokenie
Token ID494
Feature activation-6.56e-01

Pos logit contributions
add+0.029
 int+0.018
opp+0.018
utt+0.018
 rum+0.017
Neg logit contributions
 directions-0.039
 lap-0.038
liness-0.038
 mats-0.038
 seek-0.037
Token noticed
Token ID6810
Feature activation-2.73e-01

Pos logit contributions
add+0.020
 int+0.015
 rum+0.015
opp+0.015
utt+0.015
Neg logit contributions
 directions-0.016
liness-0.016
 lap-0.015
 mats-0.015
 zone-0.015
Token something
Token ID1223
Feature activation-8.06e-01

Pos logit contributions
add+0.009
 int+0.007
opp+0.007
utt+0.007
 ven+0.006
Neg logit contributions
 directions-0.007
liness-0.007
 mats-0.006
 lap-0.006
 zone-0.006
Token was
Token ID373
Feature activation-1.29e+00

Pos logit contributions
add+0.026
opp+0.020
utt+0.019
 int+0.019
 ven+0.018
Neg logit contributions
 directions-0.020
liness-0.019
 mats-0.019
 lap-0.018
 seek-0.018
Token a
Token ID257
Feature activation-1.32e+00

Pos logit contributions
add+0.088
omm+0.078
opp+0.076
utt+0.075
 ven+0.072
Neg logit contributions
 directions-0.039
 tag-0.032
 mats-0.031
 games-0.030
 equipment-0.030
Token small
Token ID1402
Feature activation-1.69e+00

Pos logit contributions
add+0.049
utt+0.039
opp+0.038
 int+0.037
omm+0.036
Neg logit contributions
 directions-0.028
 sandbox-0.026
 lap-0.026
liness-0.025
 tag-0.024
Token fire
Token ID2046
Feature activation-2.61e+00

Pos logit contributions
add+0.062
utt+0.048
opp+0.047
 int+0.046
 ven+0.045
Neg logit contributions
 directions-0.037
liness-0.036
 lap-0.035
 sandbox-0.034
 mats-0.034
Token near
Token ID1474
Feature activation-1.06e+00

Pos logit contributions
add+0.105
utt+0.082
opp+0.082
 int+0.077
 ven+0.076
Neg logit contributions
 directions-0.049
liness-0.048
 lap-0.048
 zone-0.046
 tag-0.045
Token the
Token ID262
Feature activation-1.62e+00

Pos logit contributions
add+0.046
utt+0.038
opp+0.036
 int+0.036
omm+0.036
Neg logit contributions
 directions-0.018
liness-0.015
 mats-0.014
 newspapers-0.014
 tag-0.014
Token snow
Token ID6729
Feature activation-3.82e+00

Pos logit contributions
add+0.054
 int+0.041
opp+0.041
utt+0.041
 ven+0.039
Neg logit contributions
 directions-0.037
liness-0.037
 mats-0.035
 lap-0.035
 zone-0.034
Tokenman
Token ID805
Feature activation-1.78e+00

Pos logit contributions
add+0.143
utt+0.113
 int+0.110
opp+0.109
 attent+0.102
Neg logit contributions
liness-0.078
 directions-0.076
 lap-0.075
 zone-0.072
 mats-0.072
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
add+0.069
opp+0.053
utt+0.053
 int+0.052
 ven+0.052
Neg logit contributions
 directions-0.034
liness-0.034
 lap-0.033
 mats-0.031
 zone-0.031
Token They
Token ID1119
Feature activation-1.06e+00

Pos logit contributions
add+0.050
 ven+0.042
opp+0.041
 rum+0.041
omm+0.040
Neg logit contributions
 mats-0.016
 lap-0.015
 directions-0.014
liness-0.013
 goggles-0.012
Token see
Token ID766
Feature activation-1.84e+00

Pos logit contributions
add+0.042
 int+0.033
utt+0.033
opp+0.033
 ven+0.032
Neg logit contributions
 directions-0.018
liness-0.018
 mats-0.017
 lap-0.017
 sandbox-0.016
Token the
Token ID262
Feature activation-1.66e+00

Pos logit contributions
add+0.066
opp+0.052
utt+0.052
 int+0.050
 ven+0.049
Neg logit contributions
 directions-0.040
liness-0.038
 mats-0.037
 sandbox-0.035
 newspapers-0.035
Token?"
Token ID1701
Feature activation-2.32e+00

Pos logit contributions
add+0.067
 int+0.054
 attent+0.054
utt+0.053
 ven+0.053
Neg logit contributions
 seek-0.026
liness-0.026
 yourself-0.025
 your-0.023
 tag-0.022
Token The
Token ID383
Feature activation-1.19e+00

Pos logit contributions
add+0.081
opp+0.060
utt+0.059
omm+0.058
 ven+0.057
Neg logit contributions
 directions-0.054
liness-0.049
 lap-0.047
 yourself-0.047
 zone-0.047
Token voice
Token ID3809
Feature activation-4.69e-01

Pos logit contributions
add+0.048
opp+0.043
utt+0.042
 int+0.040
imet+0.040
Neg logit contributions
 directions-0.023
liness-0.017
 receiver-0.017
 yourself-0.016
 tag-0.015
Token belonged
Token ID19611
Feature activation-5.22e-01

Pos logit contributions
add+0.018
opp+0.016
 ven+0.015
omm+0.014
 former+0.014
Neg logit contributions
 directions-0.008
liness-0.007
 receiver-0.007
 zone-0.007
 tag-0.007
Token to
Token ID284
Feature activation-1.86e+00

Pos logit contributions
add+0.017
 int+0.013
opp+0.013
utt+0.012
 ven+0.012
Neg logit contributions
 directions-0.013
liness-0.013
 mats-0.012
 lap-0.012
 zone-0.012
Token G
Token ID402
Feature activation-3.81e+00

Pos logit contributions
add+0.062
 int+0.050
omm+0.049
 ven+0.049
opp+0.049
Neg logit contributions
 directions-0.040
 yourself-0.038
liness-0.038
 zone-0.036
 tag-0.034
Tokeniz
Token ID528
Feature activation-2.29e+00

Pos logit contributions
add+0.129
 married+0.116
 ven+0.116
 rum+0.113
 int+0.113
Neg logit contributions
 directions-0.068
liness-0.064
 yourself-0.058
rum-0.058
 zone-0.057
Tokenmo
Token ID5908
Feature activation-9.31e-01

Pos logit contributions
add+0.059
opp+0.043
 ven+0.043
 int+0.040
utt+0.039
Neg logit contributions
 directions-0.071
liness-0.069
 zone-0.065
 lap-0.063
 receiver-0.062
Token's
Token ID338
Feature activation-8.25e-01

Pos logit contributions
add+0.035
 ven+0.027
opp+0.026
omm+0.026
utt+0.025
Neg logit contributions
 directions-0.019
liness-0.019
 lap-0.018
 zone-0.016
 receiver-0.016
Token friend
Token ID1545
Feature activation-1.65e-01

Pos logit contributions
add+0.030
opp+0.023
ores+0.023
utt+0.023
imet+0.023
Neg logit contributions
 directions-0.019
liness-0.016
 lap-0.016
 sandbox-0.014
 tag-0.014
Token,
Token ID11
Feature activation-1.14e-01

Pos logit contributions
add+0.006
opp+0.004
 ven+0.004
 int+0.004
utt+0.004
Neg logit contributions
liness-0.004
 directions-0.004
 lap-0.004
 zone-0.004
 mats-0.003
Token girl
Token ID2576
Feature activation+8.51e-02

Pos logit contributions
add+0.002
utt+0.001
 int+0.001
uned+0.001
opp+0.001
Neg logit contributions
 sandbox-0.001
 seek-0.001
 tag-0.001
 lap-0.001
 hoop-0.001
Token called
Token ID1444
Feature activation+5.00e-01

Pos logit contributions
 directions+0.002
liness+0.002
 mats+0.002
 lap+0.002
 zone+0.002
Neg logit contributions
add-0.003
 int-0.002
opp-0.002
utt-0.002
 ven-0.002
Token N
Token ID399
Feature activation-2.34e+00

Pos logit contributions
 directions+0.011
liness+0.011
 mats+0.010
 lap+0.010
 zone+0.010
Neg logit contributions
add-0.017
 int-0.013
opp-0.013
utt-0.013
 ven-0.013
Tokenell
Token ID695
Feature activation-1.07e+00

Pos logit contributions
add+0.080
 rum+0.069
 erupt+0.068
 eruption+0.068
 int+0.067
Neg logit contributions
 zone-0.050
 lap-0.044
 mats-0.044
liness-0.044
 skirts-0.041
Token.
Token ID13
Feature activation+4.28e-03

Pos logit contributions
add+0.031
 int+0.023
utt+0.023
opp+0.023
 ven+0.022
Neg logit contributions
 directions-0.029
liness-0.028
 mats-0.027
 lap-0.027
 zone-0.027
Token N
Token ID399
Feature activation-3.80e+00

Pos logit contributions
liness+0.000
 directions+0.000
 mats+0.000
 lap+0.000
 zone+0.000
Neg logit contributions
add-0.000
utt-0.000
 int-0.000
 ven-0.000
 erupt-0.000
Tokenell
Token ID695
Feature activation+9.05e-02

Pos logit contributions
 int+0.088
 rum+0.087
add+0.084
 eruption+0.081
 erupt+0.080
Neg logit contributions
 zone-0.109
 yourself-0.103
oys-0.103
liness-0.098
 skirts-0.097
Token was
Token ID373
Feature activation+6.17e-01

Pos logit contributions
 directions+0.002
liness+0.002
 mats+0.002
 zone+0.002
 lap+0.002
Neg logit contributions
add-0.003
 int-0.003
utt-0.003
opp-0.003
 ven-0.003
Token very
Token ID845
Feature activation+1.93e+00

Pos logit contributions
 seek+0.010
 lap+0.010
 tag+0.010
 your+0.010
 directions+0.010
Neg logit contributions
add-0.023
 int-0.019
utt-0.019
der-0.018
imet-0.018
Token special
Token ID2041
Feature activation+7.85e-01

Pos logit contributions
 directions+0.038
 seek+0.037
liness+0.037
 lap+0.037
 mats+0.037
Neg logit contributions
add-0.068
 int-0.054
utt-0.053
opp-0.051
 ven-0.051
Token because
Token ID780
Feature activation+2.01e-01

Pos logit contributions
 directions+0.023
liness+0.022
 mats+0.021
 lap+0.021
 zone+0.021
Neg logit contributions
add-0.021
 int-0.015
opp-0.015
 rum-0.015
utt-0.015
Token was
Token ID373
Feature activation-3.38e-01

Pos logit contributions
 directions+0.012
liness+0.012
 mats+0.012
 lap+0.012
 zone+0.011
Neg logit contributions
add-0.019
opp-0.015
 int-0.015
utt-0.015
 rum-0.014
Token a
Token ID257
Feature activation-4.27e-02

Pos logit contributions
add+0.012
utt+0.009
 int+0.009
opp+0.009
 ven+0.009
Neg logit contributions
 mats-0.007
 directions-0.007
 seek-0.006
 lap-0.006
liness-0.006
Token little
Token ID1310
Feature activation+8.54e-01

Pos logit contributions
add+0.002
utt+0.001
 int+0.001
uned+0.001
opp+0.001
Neg logit contributions
 sandbox-0.001
 seek-0.001
 tag-0.001
 leash-0.001
 hoop-0.001
Token spider
Token ID19230
Feature activation-7.52e-01

Pos logit contributions
liness+0.014
 directions+0.014
 mats+0.013
 lap+0.013
 zone+0.013
Neg logit contributions
add-0.034
 int-0.028
utt-0.027
opp-0.027
 ven-0.026
Token named
Token ID3706
Feature activation-2.91e-01

Pos logit contributions
add+0.026
 int+0.020
utt+0.020
opp+0.020
 ven+0.019
Neg logit contributions
liness-0.017
 directions-0.016
 lap-0.016
 mats-0.016
 zone-0.015
Token Sp
Token ID1338
Feature activation-3.78e+00

Pos logit contributions
add+0.011
 int+0.009
utt+0.008
opp+0.008
 rum+0.008
Neg logit contributions
 mats-0.006
 directions-0.005
liness-0.005
 lap-0.005
 seek-0.005
Tokenide
Token ID485
Feature activation-1.60e+00

Pos logit contributions
add+0.118
utt+0.102
 int+0.101
 rum+0.099
imet+0.093
Neg logit contributions
liness-0.078
 mats-0.077
 zone-0.074
robat-0.073
 mat-0.072
Tokeny
Token ID88
Feature activation-5.05e-01

Pos logit contributions
add+0.056
 int+0.045
utt+0.045
opp+0.043
 ven+0.043
Neg logit contributions
liness-0.033
 directions-0.032
 zone-0.032
 mats-0.031
 seek-0.029
Token.
Token ID13
Feature activation-9.09e-01

Pos logit contributions
add+0.019
utt+0.015
 int+0.015
opp+0.014
 ven+0.014
Neg logit contributions
liness-0.010
 directions-0.009
 lap-0.009
 mats-0.009
 zone-0.009
Token Sp
Token ID1338
Feature activation-2.98e+00

Pos logit contributions
add+0.025
 int+0.018
 ven+0.017
opp+0.017
utt+0.016
Neg logit contributions
liness-0.026
 directions-0.026
 mats-0.026
 lap-0.025
 seek-0.025
Tokenide
Token ID485
Feature activation-7.42e-01

Pos logit contributions
add+0.089
 int+0.077
utt+0.077
 rum+0.075
 twenty+0.070
Neg logit contributions
liness-0.066
 mats-0.064
 zone-0.063
 yourself-0.061
robat-0.061
Token hungry
Token ID14720
Feature activation+1.05e+00

Pos logit contributions
 directions+0.034
 seek+0.034
 mats+0.032
 lap+0.032
liness+0.032
Neg logit contributions
add-0.063
 int-0.051
utt-0.050
opp-0.049
imet-0.048
Token.
Token ID13
Feature activation+9.74e-02

Pos logit contributions
 directions+0.031
liness+0.030
ability+0.028
 receiver+0.028
 lap+0.027
Neg logit contributions
add-0.025
 rum-0.021
 int-0.020
 once-0.019
opp-0.019
Token One
Token ID1881
Feature activation+1.78e-01

Pos logit contributions
liness+0.002
 lap+0.002
 mats+0.002
 directions+0.002
 zone+0.002
Neg logit contributions
add-0.003
 erupt-0.003
 eruption-0.003
utt-0.003
 ven-0.003
Token morning
Token ID3329
Feature activation+1.42e-01

Pos logit contributions
 directions+0.005
liness+0.005
 mats+0.004
 lap+0.004
 zone+0.004
Neg logit contributions
add-0.006
 int-0.004
opp-0.004
utt-0.004
 rum-0.004
Token,
Token ID11
Feature activation+4.76e-01

Pos logit contributions
 directions+0.003
liness+0.003
 mats+0.003
 lap+0.003
 zone+0.003
Neg logit contributions
add-0.005
opp-0.004
 int-0.004
utt-0.004
 rum-0.004
Token N
Token ID399
Feature activation-3.77e+00

Pos logit contributions
liness+0.007
 directions+0.007
 mats+0.007
 lap+0.007
 seek+0.007
Neg logit contributions
add-0.019
utt-0.016
 int-0.015
 erupt-0.015
 ven-0.015
Tokenell
Token ID695
Feature activation-4.39e-01

Pos logit contributions
 int+0.086
add+0.085
 rum+0.085
 eruption+0.075
 erupt+0.072
Neg logit contributions
 yourself-0.109
 zone-0.108
oys-0.105
liness-0.098
 mats-0.095
Tokenie
Token ID494
Feature activation-6.05e-01

Pos logit contributions
add+0.012
 int+0.008
opp+0.008
utt+0.008
 ven+0.008
Neg logit contributions
 directions-0.013
liness-0.013
 mats-0.012
 lap-0.012
 zone-0.012
Token was
Token ID373
Feature activation-7.74e-02

Pos logit contributions
add+0.020
 int+0.016
opp+0.016
utt+0.016
 ven+0.015
Neg logit contributions
liness-0.014
 directions-0.014
 mats-0.013
 lap-0.013
 zone-0.012
Token very
Token ID845
Feature activation+6.00e-01

Pos logit contributions
add+0.003
utt+0.002
opp+0.002
 int+0.002
imet+0.002
Neg logit contributions
 lap-0.001
liness-0.001
 seek-0.001
 your-0.001
 directions-0.001
Token excited
Token ID6568
Feature activation-1.40e-01

Pos logit contributions
 directions+0.013
 lap+0.013
 seek+0.013
liness+0.012
 mats+0.012
Neg logit contributions
add-0.020
utt-0.015
 int-0.015
opp-0.015
imet-0.015
Token mom
Token ID1995
Feature activation-8.53e-02

Pos logit contributions
add+0.068
utt+0.057
urd+0.052
opp+0.051
 int+0.050
Neg logit contributions
 directions-0.037
 book-0.036
liness-0.035
 lap-0.034
 your-0.033
Tokenmy
Token ID1820
Feature activation-3.26e-01

Pos logit contributions
add+0.003
 int+0.003
opp+0.003
utt+0.003
 rum+0.003
Neg logit contributions
 directions-0.002
liness-0.001
 mats-0.001
 lap-0.001
 zone-0.001
Token if
Token ID611
Feature activation-1.38e+00

Pos logit contributions
add+0.011
 int+0.009
utt+0.009
opp+0.009
 ven+0.008
Neg logit contributions
liness-0.007
 directions-0.007
 mats-0.007
 lap-0.007
 zone-0.007
Token she
Token ID673
Feature activation-7.60e-01

Pos logit contributions
add+0.050
 int+0.039
opp+0.039
utt+0.039
 rum+0.038
Neg logit contributions
 directions-0.028
liness-0.028
 mats-0.026
 lap-0.026
 zone-0.026
Token could
Token ID714
Feature activation-7.44e-01

Pos logit contributions
add+0.032
 int+0.028
 attent+0.026
andel+0.026
urd+0.026
Neg logit contributions
 seek-0.010
 follows-0.009
 tag-0.009
 zone-0.009
 directions-0.009
Token have
Token ID423
Feature activation-3.76e+00

Pos logit contributions
add+0.029
 int+0.025
utt+0.024
 attent+0.023
 ven+0.023
Neg logit contributions
 lap-0.012
 tag-0.011
 seek-0.011
liness-0.010
 leash-0.010
Token one
Token ID530
Feature activation-1.56e+00

Pos logit contributions
add+0.151
utt+0.138
 int+0.138
icker+0.127
utter+0.125
Neg logit contributions
 your-0.059
 directions-0.054
 newspapers-0.052
 yourself-0.052
 tag-0.052
Token of
Token ID286
Feature activation-1.38e+00

Pos logit contributions
add+0.056
utt+0.044
 int+0.043
urd+0.041
opp+0.041
Neg logit contributions
 lap-0.033
 directions-0.033
liness-0.031
 leash-0.030
 zone-0.030
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
add+0.058
utt+0.049
 int+0.047
opp+0.046
urd+0.044
Neg logit contributions
 directions-0.021
liness-0.019
 newspapers-0.018
 your-0.018
 lap-0.018
Token eggs
Token ID9653
Feature activation-3.57e-01

Pos logit contributions
add+0.084
utt+0.075
 int+0.068
opp+0.065
raper+0.065
Neg logit contributions
 directions-0.044
 newspapers-0.039
 lap-0.039
liness-0.037
 other-0.037
Token.
Token ID13
Feature activation-9.01e-01

Pos logit contributions
add+0.010
opp+0.007
 int+0.007
utt+0.007
 rum+0.007
Neg logit contributions
 directions-0.010
liness-0.010
 mats-0.010
 lap-0.009
 zone-0.009
Token was
Token ID373
Feature activation-4.43e-01

Pos logit contributions
 directions+0.009
liness+0.009
 mats+0.008
 lap+0.008
 zone+0.008
Neg logit contributions
add-0.016
 int-0.013
utt-0.013
opp-0.013
 ven-0.012
Token a
Token ID257
Feature activation-1.25e-01

Pos logit contributions
add+0.015
utt+0.012
 int+0.012
opp+0.012
 ven+0.012
Neg logit contributions
 directions-0.009
 mats-0.009
liness-0.009
 lap-0.009
 seek-0.009
Token p
Token ID279
Feature activation+1.14e-04

Pos logit contributions
add+0.005
utt+0.004
 int+0.004
uned+0.004
opp+0.004
Neg logit contributions
 sandbox-0.002
 seek-0.002
 tag-0.002
 lap-0.002
 leash-0.002
Tokenanda
Token ID5282
Feature activation-8.20e-01

Pos logit contributions
liness+0.000
 directions+0.000
 sandbox+0.000
 mats+0.000
 lap+0.000
Neg logit contributions
add-0.000
 int-0.000
 rum-0.000
 ven-0.000
opp-0.000
Token named
Token ID3706
Feature activation-2.27e-01

Pos logit contributions
add+0.030
 int+0.023
utt+0.023
opp+0.023
 ven+0.022
Neg logit contributions
liness-0.017
 directions-0.016
 lap-0.016
 zone-0.015
 mats-0.015
Token P
Token ID350
Feature activation-3.75e+00

Pos logit contributions
add+0.007
 int+0.005
 ven+0.005
opp+0.005
utt+0.005
Neg logit contributions
 directions-0.006
liness-0.006
 seek-0.006
 mats-0.006
 lap-0.006
Tokenandy
Token ID10757
Feature activation-2.43e+00

Pos logit contributions
 int+0.099
add+0.093
 spe+0.092
 rum+0.090
 eruption+0.090
Neg logit contributions
liness-0.092
 sandbox-0.086
 yourself-0.085
 directions-0.081
 zone-0.080
Token.
Token ID13
Feature activation+3.48e-01

Pos logit contributions
add+0.080
 int+0.061
utt+0.059
opp+0.059
 ven+0.057
Neg logit contributions
 directions-0.057
liness-0.057
 lap-0.055
 mats-0.054
 zone-0.053
Token P
Token ID350
Feature activation-2.42e+00

Pos logit contributions
 directions+0.009
liness+0.009
 mats+0.008
 lap+0.008
 zone+0.008
Neg logit contributions
add-0.011
 int-0.008
opp-0.008
utt-0.008
 ven-0.008
Tokenandy
Token ID10757
Feature activation-7.05e-01

Pos logit contributions
add+0.061
 int+0.059
 rum+0.056
 attent+0.054
 spe+0.053
Neg logit contributions
liness-0.063
 yourself-0.057
 directions-0.057
 sandbox-0.057
 mats-0.055
Token was
Token ID373
Feature activation+2.92e-01

Pos logit contributions
add+0.025
 int+0.020
opp+0.020
utt+0.020
 ven+0.019
Neg logit contributions
 directions-0.014
liness-0.014
 mats-0.013
 lap-0.013
 zone-0.013
Token there
Token ID612
Feature activation+3.65e-01

Pos logit contributions
add+0.002
 int+0.001
utt+0.001
opp+0.001
 ven+0.001
Neg logit contributions
 directions-0.001
liness-0.001
 mats-0.001
 seek-0.001
 lap-0.001
Token was
Token ID373
Feature activation-4.97e-01

Pos logit contributions
 directions+0.008
liness+0.008
 mats+0.007
 lap+0.007
 zone+0.007
Neg logit contributions
add-0.013
opp-0.010
 int-0.010
utt-0.010
 rum-0.009
Token a
Token ID257
Feature activation-1.77e-01

Pos logit contributions
add+0.017
utt+0.013
 int+0.013
opp+0.013
 ven+0.013
Neg logit contributions
 directions-0.011
 mats-0.011
liness-0.010
 lap-0.010
 seek-0.010
Token spider
Token ID19230
Feature activation-6.99e-01

Pos logit contributions
add+0.007
utt+0.005
 int+0.005
uned+0.005
opp+0.005
Neg logit contributions
 sandbox-0.004
 seek-0.003
 lap-0.003
 tag-0.003
 mats-0.003
Token named
Token ID3706
Feature activation-3.28e-01

Pos logit contributions
add+0.024
 int+0.018
opp+0.018
utt+0.018
 ven+0.017
Neg logit contributions
liness-0.016
 directions-0.015
 lap-0.015
 mats-0.015
 zone-0.014
Token Sp
Token ID1338
Feature activation-3.73e+00

Pos logit contributions
add+0.013
utt+0.010
 int+0.010
opp+0.010
 ven+0.009
Neg logit contributions
 mats-0.006
 seek-0.006
 directions-0.006
 lap-0.006
liness-0.006
Tokenide
Token ID485
Feature activation-1.67e+00

Pos logit contributions
add+0.116
utt+0.100
 int+0.100
 rum+0.098
imet+0.093
Neg logit contributions
liness-0.077
 mats-0.077
 zone-0.073
 seek-0.073
 mat-0.071
Tokeny
Token ID88
Feature activation-6.04e-01

Pos logit contributions
add+0.061
 int+0.049
utt+0.049
opp+0.047
 ven+0.047
Neg logit contributions
liness-0.032
 zone-0.032
 directions-0.031
 mats-0.030
 seek-0.028
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
add+0.024
utt+0.019
 int+0.018
opp+0.018
imet+0.017
Neg logit contributions
 lap-0.011
liness-0.011
 tag-0.010
 directions-0.010
 mats-0.010
Token Sp
Token ID1338
Feature activation-2.99e+00

Pos logit contributions
add+0.030
 int+0.021
opp+0.021
 ven+0.021
utt+0.020
Neg logit contributions
 directions-0.034
liness-0.034
 mats-0.034
 lap-0.033
 seek-0.032
Tokenide
Token ID485
Feature activation-9.18e-01

Pos logit contributions
add+0.089
utt+0.076
 int+0.075
 rum+0.073
opp+0.070
Neg logit contributions
liness-0.069
 mats-0.066
 zone-0.064
 seek-0.064
 directions-0.062
Token Joe
Token ID5689
Feature activation-7.31e-01

Pos logit contributions
add+0.019
 int+0.014
opp+0.014
utt+0.014
 ven+0.013
Neg logit contributions
liness-0.018
 directions-0.018
 mats-0.017
 lap-0.017
 zone-0.017
Token.
Token ID13
Feature activation+9.15e-02

Pos logit contributions
add+0.020
 int+0.015
opp+0.015
utt+0.014
 rum+0.014
Neg logit contributions
 directions-0.021
liness-0.021
 mats-0.020
 zone-0.019
 lap-0.019
Token The
Token ID383
Feature activation+1.01e+00

Pos logit contributions
 directions+0.002
liness+0.002
 mats+0.002
 lap+0.002
 zone+0.002
Neg logit contributions
add-0.003
 int-0.002
opp-0.002
utt-0.002
 ven-0.002
Token dog
Token ID3290
Feature activation+1.36e+00

Pos logit contributions
 directions+0.022
liness+0.021
 zone+0.019
 mats+0.019
 lap+0.019
Neg logit contributions
add-0.036
 int-0.027
 rum-0.027
 ven-0.027
utt-0.026
Token was
Token ID373
Feature activation+1.02e+00

Pos logit contributions
 directions+0.028
liness+0.028
 mats+0.026
 zone+0.026
 lap+0.025
Neg logit contributions
add-0.048
 int-0.038
opp-0.038
utt-0.038
 rum-0.037
Token always
Token ID1464
Feature activation+2.12e+00

Pos logit contributions
 directions+0.023
liness+0.023
 lap+0.021
 seek+0.021
 mats+0.021
Neg logit contributions
add-0.035
 int-0.027
utt-0.027
opp-0.027
 ven-0.026
Token happy
Token ID3772
Feature activation-5.69e-02

Pos logit contributions
 mats+0.055
liness+0.053
 directions+0.052
 lap+0.051
 zone+0.050
Neg logit contributions
add-0.061
opp-0.047
 rum-0.047
 int-0.045
 ven-0.044
Token to
Token ID284
Feature activation+1.26e+00

Pos logit contributions
add+0.002
 int+0.001
utt+0.001
opp+0.001
 rum+0.001
Neg logit contributions
 directions-0.001
liness-0.001
 mats-0.001
 lap-0.001
 zone-0.001
Token help
Token ID1037
Feature activation+1.52e+00

Pos logit contributions
liness+0.028
 directions+0.028
 mats+0.027
 zone+0.026
 lap+0.026
Neg logit contributions
add-0.043
opp-0.033
utt-0.033
 int-0.032
 ven-0.032
Token Farmer
Token ID37755
Feature activation-4.18e-01

Pos logit contributions
liness+0.031
 mats+0.031
 sandbox+0.030
 lap+0.030
 directions+0.030
Neg logit contributions
add-0.053
 int-0.042
opp-0.041
 rum-0.040
utt-0.040
Token Joe
Token ID5689
Feature activation-5.92e-01

Pos logit contributions
add+0.012
 int+0.009
utt+0.009
opp+0.009
 rum+0.009
Neg logit contributions
 directions-0.011
liness-0.011
 mats-0.010
 lap-0.010
 zone-0.010
Token want
Token ID765
Feature activation+8.91e-01

Pos logit contributions
add+0.008
 int+0.006
utt+0.006
opp+0.006
 ven+0.006
Neg logit contributions
liness-0.004
 directions-0.004
 zone-0.004
 lap-0.004
 mats-0.004
Token to
Token ID284
Feature activation+6.95e-01

Pos logit contributions
liness+0.030
 zone+0.028
 directions+0.028
 lap+0.028
 sandbox+0.027
Neg logit contributions
add-0.020
 int-0.014
utt-0.014
opp-0.013
 rum-0.012
Token be
Token ID307
Feature activation+1.25e+00

Pos logit contributions
liness+0.017
 directions+0.017
 mats+0.016
 zone+0.015
 lap+0.015
Neg logit contributions
add-0.022
opp-0.017
 int-0.017
utt-0.017
 rum-0.016
Token independent
Token ID4795
Feature activation+1.67e+00

Pos logit contributions
liness+0.031
 directions+0.030
 mats+0.029
 lap+0.029
 zone+0.029
Neg logit contributions
add-0.037
aze-0.029
opp-0.028
ores-0.028
 twenty-0.027
Token and
Token ID290
Feature activation+1.34e+00

Pos logit contributions
 directions+0.043
liness+0.042
 lap+0.041
 mats+0.040
 sandbox+0.040
Neg logit contributions
add-0.048
 int-0.038
 rum-0.036
opp-0.036
utt-0.035
Token lead
Token ID1085
Feature activation+2.63e+00

Pos logit contributions
liness+0.035
 directions+0.033
 mats+0.031
 zone+0.031
 lap+0.031
Neg logit contributions
add-0.039
 int-0.031
utt-0.031
 rum-0.031
 ven-0.030
Token too
Token ID1165
Feature activation+1.40e+00

Pos logit contributions
liness+0.060
 sandbox+0.059
 zone+0.057
 lap+0.055
 directions+0.054
Neg logit contributions
add-0.083
 int-0.066
opp-0.063
utt-0.063
 attent-0.060
Token."
Token ID526
Feature activation+8.82e-03

Pos logit contributions
 directions+0.031
liness+0.031
 mats+0.029
 lap+0.029
 zone+0.029
Neg logit contributions
add-0.048
 int-0.037
opp-0.037
utt-0.037
 ven-0.035
Token
Token ID198
Feature activation+1.81e-01

Pos logit contributions
 directions+0.000
liness+0.000
 zone+0.000
 seek+0.000
 mats+0.000
Neg logit contributions
add-0.000
 int-0.000
opp-0.000
utt-0.000
 rum-0.000
Token
Token ID198
Feature activation+5.30e-01

Pos logit contributions
 directions+0.005
liness+0.005
 mats+0.005
 lap+0.005
 zone+0.005
Neg logit contributions
add-0.005
 int-0.003
opp-0.003
utt-0.003
 ven-0.003
TokenMax
Token ID11518
Feature activation+7.16e-01

Pos logit contributions
iest+0.016
 zone+0.015
 goggles+0.014
liness+0.014
 directions+0.013
Neg logit contributions
add-0.011
 int-0.011
Hours-0.011
 attent-0.011
ths-0.010
Token
Token ID198
Feature activation+2.90e-01

Pos logit contributions
 directions+0.005
liness+0.005
 lap+0.004
 mats+0.004
 yourself+0.004
Neg logit contributions
add-0.005
 int-0.004
opp-0.004
utt-0.004
 rum-0.004
TokenThey
Token ID2990
Feature activation+1.32e-01

Pos logit contributions
liness+0.006
 zone+0.006
 directions+0.006
 lap+0.005
shirt+0.005
Neg logit contributions
add-0.010
opp-0.008
 int-0.008
 ven-0.008
icker-0.008
Token held
Token ID2714
Feature activation+1.18e+00

Pos logit contributions
 directions+0.003
liness+0.003
 mats+0.003
 lap+0.003
 zone+0.003
Neg logit contributions
add-0.004
 int-0.003
opp-0.003
utt-0.003
 ven-0.003
Token their
Token ID511
Feature activation+7.36e-01

Pos logit contributions
liness+0.023
 directions+0.023
 seek+0.021
 zone+0.021
 mats+0.021
Neg logit contributions
add-0.044
 int-0.035
opp-0.034
utt-0.034
 rum-0.033
Token k
Token ID479
Feature activation+2.13e+00

Pos logit contributions
 directions+0.018
liness+0.017
 mats+0.017
 lap+0.017
 zone+0.016
Neg logit contributions
add-0.024
utt-0.018
 int-0.018
opp-0.018
 ven-0.017
Tokenites
Token ID2737
Feature activation+2.28e+00

Pos logit contributions
liness+0.039
 directions+0.038
 mats+0.035
 lap+0.035
 zone+0.034
Neg logit contributions
add-0.082
opp-0.065
utt-0.065
 int-0.065
 ven-0.062
Token and
Token ID290
Feature activation+3.72e-02

Pos logit contributions
liness+0.063
Found+0.060
 mats+0.060
 Seek+0.060
 yourself+0.059
Neg logit contributions
add-0.060
 int-0.050
 once-0.045
opp-0.044
utt-0.043
Token their
Token ID511
Feature activation+3.57e-01

Pos logit contributions
liness+0.001
 zone+0.001
 yourself+0.001
 directions+0.001
rd+0.001
Neg logit contributions
add-0.001
 rum-0.001
 ven-0.001
utt-0.001
 int-0.001
Token hands
Token ID2832
Feature activation+2.51e-01

Pos logit contributions
 directions+0.009
liness+0.009
 mats+0.009
 lap+0.008
 zone+0.008
Neg logit contributions
add-0.011
 int-0.008
utt-0.008
opp-0.008
 ven-0.008
Token.
Token ID13
Feature activation+6.17e-01

Pos logit contributions
 directions+0.006
liness+0.006
 mats+0.006
 lap+0.006
 zone+0.006
Neg logit contributions
add-0.008
opp-0.006
 int-0.006
utt-0.006
 ven-0.006
Token They
Token ID1119
Feature activation-6.36e-01

Pos logit contributions
 directions+0.012
 mats+0.012
liness+0.012
 lap+0.012
 zone+0.011
Neg logit contributions
add-0.023
opp-0.018
 int-0.018
utt-0.017
 rum-0.017
Token recording
Token ID8296
Feature activation+1.30e+00

Pos logit contributions
 directions+0.014
liness+0.014
 mats+0.013
 lap+0.013
 zone+0.013
Neg logit contributions
add-0.022
opp-0.016
utt-0.016
 ven-0.016
 int-0.016
Token.
Token ID13
Feature activation-3.41e-02

Pos logit contributions
liness+0.030
 directions+0.030
 lap+0.028
 mats+0.028
 zone+0.028
Neg logit contributions
add-0.042
 int-0.033
opp-0.032
utt-0.032
 rum-0.031
Token They
Token ID1119
Feature activation-5.75e-01

Pos logit contributions
add+0.001
 int+0.001
opp+0.001
 ven+0.001
utt+0.001
Neg logit contributions
 directions-0.001
liness-0.001
 mats-0.001
 lap-0.001
 newspapers-0.001
Token threw
Token ID9617
Feature activation+9.41e-01

Pos logit contributions
add+0.020
omm+0.016
utt+0.016
opp+0.016
 int+0.015
Neg logit contributions
 directions-0.013
liness-0.012
 tag-0.012
 mats-0.011
 seek-0.011
Token the
Token ID262
Feature activation+1.99e+00

Pos logit contributions
liness+0.021
 directions+0.021
 lap+0.020
 zone+0.020
 mats+0.020
Neg logit contributions
add-0.031
 int-0.024
opp-0.024
utt-0.023
 rum-0.023
Token balls
Token ID11333
Feature activation+2.39e+00

Pos logit contributions
liness+0.048
 yourself+0.047
 zone+0.047
 directions+0.046
ability+0.045
Neg logit contributions
add-0.053
 rum-0.043
 int-0.042
 once-0.040
 leak-0.040
Token up
Token ID510
Feature activation+1.03e+00

Pos logit contributions
liness+0.069
 directions+0.065
 zone+0.063
 mats+0.062
 lap+0.062
Neg logit contributions
add-0.067
 int-0.053
utt-0.049
opp-0.047
 rum-0.047
Token and
Token ID290
Feature activation+2.63e-01

Pos logit contributions
 directions+0.021
 mats+0.019
liness+0.019
 lap+0.018
 seek+0.018
Neg logit contributions
add-0.039
opp-0.031
utt-0.031
 int-0.030
 ven-0.029
Token down
Token ID866
Feature activation-2.77e-01

Pos logit contributions
liness+0.005
 directions+0.005
 zone+0.005
 lap+0.005
 mats+0.004
Neg logit contributions
add-0.010
 int-0.008
 rum-0.008
utt-0.008
opp-0.007
Token,
Token ID11
Feature activation+3.33e-01

Pos logit contributions
add+0.011
utt+0.008
opp+0.008
 int+0.008
 ven+0.008
Neg logit contributions
 directions-0.005
liness-0.005
 mats-0.005
 lap-0.005
 tag-0.005
Token laughing
Token ID14376
Feature activation+9.66e-02

Pos logit contributions
liness+0.008
 directions+0.008
 zone+0.008
 yourself+0.007
 mats+0.007
Neg logit contributions
add-0.010
 int-0.009
 rum-0.008
 attent-0.008
utt-0.008
Token walking
Token ID6155
Feature activation+1.20e+00

Pos logit contributions
 lap+0.015
 your+0.014
liness+0.014
 seek+0.014
 directions+0.013
Neg logit contributions
add-0.032
utt-0.026
opp-0.026
 ven-0.026
aff-0.025
Token through
Token ID832
Feature activation+1.58e+00

Pos logit contributions
liness+0.034
 directions+0.033
 mats+0.032
 lap+0.031
 zone+0.031
Neg logit contributions
add-0.033
 int-0.026
utt-0.023
 attent-0.023
 rum-0.023
Token the
Token ID262
Feature activation+2.10e+00

Pos logit contributions
liness+0.038
 directions+0.036
 zone+0.036
 lap+0.035
 mats+0.035
Neg logit contributions
add-0.049
 int-0.039
 rum-0.037
opp-0.037
utt-0.037
Token park
Token ID3952
Feature activation+1.48e+00

Pos logit contributions
liness+0.044
 directions+0.043
 lap+0.042
 mats+0.041
 zone+0.041
Neg logit contributions
add-0.073
 int-0.058
opp-0.056
utt-0.056
 rum-0.055
Token one
Token ID530
Feature activation+2.08e+00

Pos logit contributions
liness+0.043
 directions+0.042
 mats+0.041
 Seek+0.040
 zone+0.040
Neg logit contributions
add-0.038
 int-0.031
utt-0.028
 attent-0.027
opp-0.027
Token sunny
Token ID27737
Feature activation+2.33e+00

Pos logit contributions
liness+0.053
 mats+0.052
 keyboard+0.050
 newspapers+0.049
 directions+0.049
Neg logit contributions
add-0.058
 int-0.047
 rum-0.043
utt-0.043
opp-0.042
Token day
Token ID1110
Feature activation+1.84e+00

Pos logit contributions
liness+0.068
 mats+0.064
 directions+0.064
 keyboard+0.064
 Seek+0.063
Neg logit contributions
add-0.058
 once-0.043
 int-0.043
utt-0.039
 rum-0.039
Token.
Token ID13
Feature activation+8.19e-02

Pos logit contributions
liness+0.046
 directions+0.046
 mats+0.043
 lap+0.042
 zone+0.042
Neg logit contributions
add-0.057
 int-0.044
opp-0.043
utt-0.043
 rum-0.041
Token He
Token ID679
Feature activation+1.74e+00

Pos logit contributions
 directions+0.002
liness+0.002
 lap+0.002
 mats+0.002
 zone+0.002
Neg logit contributions
add-0.003
opp-0.002
utt-0.002
 int-0.002
 ven-0.002
Token saw
Token ID2497
Feature activation+1.18e+00

Pos logit contributions
 directions+0.042
liness+0.042
 mats+0.040
 lap+0.039
 zone+0.039
Neg logit contributions
add-0.056
 int-0.042
opp-0.042
utt-0.042
 rum-0.040
Token a
Token ID257
Feature activation+9.25e-01

Pos logit contributions
liness+0.039
 zone+0.038
 lap+0.038
 directions+0.037
 mats+0.036
Neg logit contributions
add-0.028
 int-0.017
utt-0.017
 rum-0.017
 once-0.016
Token.
Token ID13
Feature activation+4.95e-01

Pos logit contributions
 directions+0.003
liness+0.003
 mats+0.002
 lap+0.002
 zone+0.002
Neg logit contributions
add-0.003
 int-0.002
opp-0.002
utt-0.002
 rum-0.002
Token That
Token ID1320
Feature activation+7.36e-01

Pos logit contributions
 directions+0.010
liness+0.010
 mats+0.010
 lap+0.009
 zone+0.009
Neg logit contributions
add-0.017
 int-0.014
utt-0.014
opp-0.014
 ven-0.013
Token is
Token ID318
Feature activation-5.61e-01

Pos logit contributions
 directions+0.014
 mats+0.013
 lap+0.013
liness+0.013
 seek+0.012
Neg logit contributions
add-0.025
 int-0.022
 rum-0.022
opp-0.021
 ven-0.021
Token very
Token ID845
Feature activation+1.23e+00

Pos logit contributions
add+0.022
utt+0.017
opp+0.017
 int+0.016
icker+0.016
Neg logit contributions
 directions-0.011
 yourself-0.011
liness-0.010
 your-0.010
 seek-0.010
Token creative
Token ID7325
Feature activation+1.29e+00

Pos logit contributions
 directions+0.022
 yourself+0.020
 seek+0.020
liness+0.020
 Seek+0.019
Neg logit contributions
add-0.047
utt-0.037
 int-0.037
opp-0.037
 rum-0.036
Token.
Token ID13
Feature activation+6.23e-01

Pos logit contributions
 directions+0.039
 sandbox+0.037
 lap+0.037
liness+0.037
Back+0.036
Neg logit contributions
add-0.029
 int-0.022
 once-0.022
der-0.021
 rum-0.021
Token I
Token ID314
Feature activation-8.79e-01

Pos logit contributions
 directions+0.012
liness+0.012
 mats+0.011
 lap+0.011
 zone+0.011
Neg logit contributions
add-0.023
 int-0.018
utt-0.018
opp-0.018
 ven-0.017
Token am
Token ID716
Feature activation-4.54e-01

Pos logit contributions
add+0.037
utt+0.031
 int+0.030
omm+0.030
opp+0.029
Neg logit contributions
 seek-0.012
liness-0.012
 tag-0.012
 zone-0.011
 directions-0.011
Token proud
Token ID6613
Feature activation+5.35e-01

Pos logit contributions
add+0.019
utt+0.016
opp+0.015
 int+0.014
imet+0.014
Neg logit contributions
 your-0.008
 yourself-0.007
 directions-0.007
 my-0.006
 welcome-0.006
Token of
Token ID286
Feature activation+4.61e-01

Pos logit contributions
 directions+0.011
liness+0.011
 mats+0.010
 lap+0.010
 zone+0.010
Neg logit contributions
add-0.019
 int-0.015
opp-0.015
utt-0.015
 ven-0.014
Token you
Token ID345
Feature activation+4.62e-01

Pos logit contributions
 directions+0.006
 yourself+0.006
liness+0.006
 mats+0.005
 lap+0.005
Neg logit contributions
add-0.020
 int-0.016
utt-0.016
opp-0.016
 ven-0.016
Token:
Token ID25
Feature activation+4.31e-01

Pos logit contributions
add+0.005
 int+0.004
 rum+0.003
utt+0.003
 once+0.003
Neg logit contributions
 sandbox-0.008
 directions-0.008
 seek-0.008
liness-0.008
 Seek-0.008
Token be
Token ID307
Feature activation-1.25e-01

Pos logit contributions
 directions+0.008
liness+0.008
 lap+0.008
 mats+0.008
 tag+0.008
Neg logit contributions
add-0.016
opp-0.012
utt-0.012
 int-0.012
 ven-0.012
Token wise
Token ID10787
Feature activation+8.47e-01

Pos logit contributions
add+0.004
aze+0.003
ores+0.003
 twenty+0.003
 ven+0.003
Neg logit contributions
 zone-0.003
 mats-0.003
liness-0.002
 keyboard-0.002
 lap-0.002
Token.
Token ID13
Feature activation+3.39e-01

Pos logit contributions
 directions+0.023
 sandbox+0.022
liness+0.022
 seek+0.022
 mats+0.022
Neg logit contributions
add-0.022
 int-0.017
utt-0.016
 rum-0.016
 once-0.016
Token
Token ID198
Feature activation+3.40e-01

Pos logit contributions
 directions+0.006
liness+0.006
 lap+0.006
 mats+0.006
 zone+0.005
Neg logit contributions
add-0.013
 int-0.010
opp-0.010
 ven-0.010
 rum-0.010
Token
Token ID198
Feature activation-8.24e-03

Pos logit contributions
 directions+0.009
liness+0.009
 lap+0.008
 mats+0.008
 seek+0.008
Neg logit contributions
add-0.010
 int-0.008
opp-0.007
 rum-0.007
utt-0.007
TokenBob
Token ID18861
Feature activation+4.53e-02

Pos logit contributions
add+0.000
utt+0.000
 int+0.000
opp+0.000
 rum+0.000
Neg logit contributions
liness-0.000
 zone-0.000
 directions-0.000
 mats-0.000
 lap-0.000
Token thought
Token ID1807
Feature activation-1.98e-01

Pos logit contributions
 directions+0.001
liness+0.001
 seek+0.001
 lap+0.001
 follows+0.001
Neg logit contributions
add-0.002
 ven-0.001
opp-0.001
 int-0.001
utt-0.001
Token of
Token ID286
Feature activation-7.77e-01

Pos logit contributions
add+0.005
 int+0.004
opp+0.004
utt+0.004
 ven+0.004
Neg logit contributions
 directions-0.006
liness-0.006
 mats-0.006
 lap-0.006
 zone-0.005
Token a
Token ID257
Feature activation-5.61e-01

Pos logit contributions
add+0.021
 int+0.016
opp+0.016
utt+0.015
 rum+0.015
Neg logit contributions
liness-0.022
 directions-0.021
 zone-0.021
 mats-0.021
 lap-0.021
Token plan
Token ID1410
Feature activation+7.76e-01

Pos logit contributions
add+0.018
 int+0.014
opp+0.014
utt+0.013
 ven+0.013
Neg logit contributions
liness-0.014
 directions-0.013
 mats-0.013
 zone-0.013
 lap-0.013
Token They
Token ID1119
Feature activation-8.74e-01

Pos logit contributions
add+0.003
opp+0.002
 ven+0.002
 int+0.002
utt+0.002
Neg logit contributions
 lap-0.002
 mats-0.002
 directions-0.002
liness-0.002
 sandbox-0.001
Token are
Token ID389
Feature activation-9.38e-02

Pos logit contributions
add+0.032
 int+0.026
utt+0.025
opp+0.025
omm+0.025
Neg logit contributions
 directions-0.017
liness-0.016
 mats-0.016
 lap-0.015
 sandbox-0.015
Token careful
Token ID8161
Feature activation+1.12e+00

Pos logit contributions
add+0.003
 int+0.002
opp+0.002
utt+0.002
 rum+0.002
Neg logit contributions
liness-0.002
 directions-0.002
 mats-0.002
 lap-0.002
 zone-0.002
Token not
Token ID407
Feature activation+7.58e-02

Pos logit contributions
liness+0.033
Found+0.033
 Seek+0.031
 sandbox+0.031
 mats+0.031
Neg logit contributions
add-0.028
 int-0.024
 erupt-0.022
 rum-0.021
utt-0.021
Token to
Token ID284
Feature activation+6.81e-02

Pos logit contributions
 directions+0.002
liness+0.002
 mats+0.002
 lap+0.002
 zone+0.002
Neg logit contributions
add-0.002
 int-0.002
opp-0.002
utt-0.002
 ven-0.002
Token hurt
Token ID5938
Feature activation+1.12e+00

Pos logit contributions
 directions+0.001
 seek+0.001
 tag+0.001
 mats+0.001
 lap+0.001
Neg logit contributions
add-0.003
 int-0.002
omm-0.002
utt-0.002
opp-0.002
Token them
Token ID606
Feature activation+3.84e-01

Pos logit contributions
 directions+0.029
liness+0.029
 mats+0.028
 lap+0.027
 zone+0.027
Neg logit contributions
add-0.034
 int-0.025
opp-0.025
utt-0.025
 ven-0.024
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
 directions+0.011
liness+0.011
 mats+0.011
 lap+0.011
 zone+0.011
Neg logit contributions
add-0.010
 int-0.008
opp-0.007
 rum-0.007
utt-0.007
Token
Token ID198
Feature activation-9.14e-02

Pos logit contributions
 lap+0.002
 directions+0.002
 mats+0.002
liness+0.002
 sandbox+0.002
Neg logit contributions
add-0.004
 ven-0.003
 int-0.003
utt-0.003
opp-0.003
Token
Token ID198
Feature activation-1.48e-02

Pos logit contributions
add+0.003
 rum+0.002
 ven+0.002
opp+0.002
utt+0.002
Neg logit contributions
 directions-0.002
 lap-0.002
 mats-0.002
liness-0.002
 tag-0.002
TokenOne
Token ID3198
Feature activation+8.36e-01

Pos logit contributions
add+0.000
 int+0.000
opp+0.000
 attent+0.000
 rum+0.000
Neg logit contributions
liness-0.001
 zone-0.001
 directions-0.001
 lap-0.001
 receiver-0.000
Token the
Token ID262
Feature activation+4.48e-01

Pos logit contributions
liness+0.020
 directions+0.019
 zone+0.019
 seek+0.019
 sandbox+0.019
Neg logit contributions
add-0.025
 int-0.019
 rum-0.018
opp-0.018
 once-0.018
Token curtains
Token ID41160
Feature activation+8.34e-02

Pos logit contributions
liness+0.010
ability+0.010
 zone+0.010
 pads+0.010
 yourself+0.010
Neg logit contributions
add-0.013
 rum-0.011
 int-0.011
 erupt-0.010
opp-0.010
Token,
Token ID11
Feature activation-1.97e-01

Pos logit contributions
 Seek+0.002
 seek+0.002
 directions+0.002
 yourself+0.002
liness+0.002
Neg logit contributions
add-0.002
 int-0.002
 rum-0.002
opp-0.001
utt-0.001
Token and
Token ID290
Feature activation+4.74e-02

Pos logit contributions
add+0.004
 rum+0.004
 int+0.004
 once+0.003
 attent+0.003
Neg logit contributions
liness-0.006
 directions-0.006
 Seek-0.006
ability-0.006
 zone-0.006
Token in
Token ID287
Feature activation+7.98e-01

Pos logit contributions
liness+0.001
 directions+0.001
 zone+0.001
 mats+0.001
 lap+0.001
Neg logit contributions
add-0.001
 int-0.001
utt-0.001
opp-0.001
 rum-0.001
Token the
Token ID262
Feature activation+3.07e-01

Pos logit contributions
liness+0.019
 zone+0.018
 directions+0.018
 pads+0.018
 mats+0.018
Neg logit contributions
add-0.023
 int-0.019
 rum-0.018
opp-0.018
utt-0.017
Token clos
Token ID3542
Feature activation+1.61e-01

Pos logit contributions
liness+0.006
 zone+0.006
 directions+0.006
 yourself+0.006
 pads+0.006
Neg logit contributions
add-0.010
 rum-0.008
 int-0.008
 ven-0.008
 attent-0.008
Tokenets
Token ID1039
Feature activation+1.78e-01

Pos logit contributions
 seek+0.004
liness+0.004
 Seek+0.004
Your+0.004
 directions+0.004
Neg logit contributions
add-0.005
utt-0.004
 int-0.004
 rum-0.004
une-0.004
Token.
Token ID13
Feature activation+4.02e-01

Pos logit contributions
liness+0.005
 directions+0.005
 mats+0.005
 zone+0.005
 newspapers+0.005
Neg logit contributions
add-0.004
 int-0.003
utt-0.003
 rum-0.003
opp-0.003
Token He
Token ID679
Feature activation+2.98e-01

Pos logit contributions
 mats+0.008
 lap+0.007
 directions+0.007
liness+0.007
 follows+0.006
Neg logit contributions
add-0.015
opp-0.012
utt-0.012
 ven-0.012
 int-0.012
Token could
Token ID714
Feature activation+8.47e-01

Pos logit contributions
 directions+0.005
 seek+0.005
 tag+0.005
 mats+0.005
liness+0.005
Neg logit contributions
add-0.012
 int-0.009
opp-0.009
utt-0.009
 ven-0.009
Token cup
Token ID6508
Feature activation+1.28e-01

Pos logit contributions
liness+0.036
 directions+0.035
 zone+0.034
 Seek+0.033
 seek+0.033
Neg logit contributions
add-0.052
 int-0.043
 rum-0.042
 ven-0.040
 attent-0.040
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
 directions+0.003
liness+0.003
 mats+0.003
 lap+0.003
 zone+0.003
Neg logit contributions
add-0.004
 int-0.003
opp-0.003
utt-0.003
 rum-0.003
Token He
Token ID679
Feature activation+5.93e-01

Pos logit contributions
add+0.005
 int+0.004
opp+0.004
 rum+0.004
utt+0.004
Neg logit contributions
 lap-0.003
 mats-0.003
 directions-0.003
liness-0.003
 tag-0.002
Token spills
Token ID42123
Feature activation-7.59e-01

Pos logit contributions
liness+0.009
 mats+0.009
 directions+0.009
 lap+0.008
 follows+0.008
Neg logit contributions
add-0.025
opp-0.020
utt-0.020
 int-0.019
 ven-0.019
Token the
Token ID262
Feature activation+3.61e-01

Pos logit contributions
add+0.028
opp+0.023
utt+0.023
 int+0.022
 ven+0.022
Neg logit contributions
 directions-0.015
 mats-0.014
liness-0.014
 lap-0.013
 tag-0.013
Token coffee
Token ID6891
Feature activation+7.57e-01

Pos logit contributions
 directions+0.007
liness+0.007
 zone+0.007
 mats+0.007
 lap+0.007
Neg logit contributions
add-0.013
 int-0.010
opp-0.010
utt-0.010
 rum-0.010
Token on
Token ID319
Feature activation+2.47e-01

Pos logit contributions
 directions+0.019
liness+0.018
 seek+0.017
 zone+0.017
 Seek+0.017
Neg logit contributions
add-0.023
 int-0.018
 rum-0.018
opp-0.017
 ven-0.017
Token the
Token ID262
Feature activation+2.42e-01

Pos logit contributions
 directions+0.005
liness+0.005
 mats+0.005
 lap+0.004
 zone+0.004
Neg logit contributions
add-0.009
 int-0.007
opp-0.007
utt-0.007
 ven-0.007
Token table
Token ID3084
Feature activation+2.25e-01

Pos logit contributions
liness+0.006
 directions+0.005
 zone+0.005
 Seek+0.005
ability+0.005
Neg logit contributions
add-0.007
 int-0.006
 rum-0.006
 attent-0.006
 ven-0.006
Token and
Token ID290
Feature activation-5.23e-01

Pos logit contributions
 directions+0.007
liness+0.007
 Seek+0.006
 zone+0.006
 receiver+0.006
Neg logit contributions
add-0.005
 int-0.004
 rum-0.004
 erupt-0.004
opp-0.004
Token on
Token ID319
Feature activation+6.48e-02

Pos logit contributions
add+0.018
 int+0.014
utt+0.014
opp+0.014
 rum+0.013
Neg logit contributions
 directions-0.011
liness-0.011
 zone-0.011
 lap-0.011
 mats-0.010
Token and
Token ID290
Feature activation+2.41e-01

Pos logit contributions
add+0.005
opp+0.004
 int+0.004
utt+0.004
 ven+0.004
Neg logit contributions
 directions-0.003
liness-0.003
 mats-0.003
 lap-0.003
 zone-0.003
Token ran
Token ID4966
Feature activation+2.09e-01

Pos logit contributions
liness+0.005
 directions+0.005
 mats+0.005
 zone+0.005
 lap+0.005
Neg logit contributions
add-0.008
utt-0.006
 int-0.006
opp-0.006
 rum-0.006
Token around
Token ID1088
Feature activation-5.08e-01

Pos logit contributions
liness+0.005
 directions+0.005
 mats+0.005
 lap+0.005
 zone+0.005
Neg logit contributions
add-0.007
 int-0.005
utt-0.005
opp-0.005
 rum-0.005
Token playing
Token ID2712
Feature activation+4.02e-01

Pos logit contributions
add+0.019
utt+0.015
opp+0.014
 int+0.014
omm+0.014
Neg logit contributions
 directions-0.010
liness-0.010
 mats-0.010
 lap-0.009
 tag-0.009
Token catch
Token ID4929
Feature activation+5.10e-01

Pos logit contributions
 directions+0.011
liness+0.011
 zone+0.011
 lap+0.010
 receiver+0.010
Neg logit contributions
add-0.011
 int-0.008
 attent-0.008
opp-0.007
 ven-0.007
Token until
Token ID1566
Feature activation-1.55e+00

Pos logit contributions
liness+0.013
 directions+0.013
 lap+0.012
 zone+0.012
 mats+0.012
Neg logit contributions
add-0.015
 int-0.012
opp-0.011
utt-0.011
 rum-0.011
Token it
Token ID340
Feature activation-8.88e-01

Pos logit contributions
add+0.058
 ven+0.048
omm+0.048
 int+0.047
opp+0.047
Neg logit contributions
 directions-0.027
 mats-0.024
liness-0.023
 tag-0.023
 seek-0.023
Token was
Token ID373
Feature activation-2.66e-01

Pos logit contributions
add+0.036
opp+0.030
 int+0.029
utt+0.029
 ven+0.029
Neg logit contributions
 directions-0.014
liness-0.014
 seek-0.012
 tag-0.012
 mats-0.012
Token time
Token ID640
Feature activation+5.04e-01

Pos logit contributions
add+0.009
opp+0.007
utt+0.007
 ven+0.007
engu+0.007
Neg logit contributions
 directions-0.006
 seek-0.006
liness-0.006
 tag-0.006
 your-0.005
Token for
Token ID329
Feature activation-6.55e-01

Pos logit contributions
 directions+0.014
liness+0.013
 mats+0.013
 lap+0.013
 zone+0.013
Neg logit contributions
add-0.015
 int-0.011
opp-0.011
utt-0.011
 ven-0.010
Token G
Token ID402
Feature activation-1.87e+00

Pos logit contributions
add+0.023
 int+0.019
utt+0.018
une+0.018
 rum+0.018
Neg logit contributions
 directions-0.014
 mats-0.012
 tag-0.012
 yourself-0.012
liness-0.012
Token she
Token ID673
Feature activation-6.67e-01

Pos logit contributions
add+0.015
opp+0.012
 int+0.012
utt+0.012
 ven+0.012
Neg logit contributions
 directions-0.006
liness-0.005
 mats-0.005
 lap-0.005
 zone-0.005
Token suddenly
Token ID6451
Feature activation-4.49e-01

Pos logit contributions
add+0.024
opp+0.019
utt+0.019
 int+0.019
omm+0.018
Neg logit contributions
 directions-0.013
liness-0.013
 mats-0.012
 seek-0.012
 zone-0.012
Token heard
Token ID2982
Feature activation-2.05e-01

Pos logit contributions
add+0.016
 int+0.012
utt+0.012
opp+0.012
 ven+0.012
Neg logit contributions
 directions-0.010
liness-0.010
 mats-0.009
 zone-0.009
 lap-0.009
Token a
Token ID257
Feature activation-1.46e+00

Pos logit contributions
add+0.007
opp+0.006
utt+0.005
 ven+0.005
 int+0.005
Neg logit contributions
 directions-0.005
liness-0.004
 mats-0.004
 lap-0.004
 tag-0.004
Token sound
Token ID2128
Feature activation-1.59e+00

Pos logit contributions
add+0.052
utt+0.040
opp+0.040
 int+0.039
omm+0.037
Neg logit contributions
 directions-0.032
liness-0.030
 lap-0.029
 mats-0.029
 seek-0.028
Token that
Token ID326
Feature activation-7.64e-01

Pos logit contributions
add+0.057
opp+0.047
utt+0.044
omm+0.043
 ven+0.042
Neg logit contributions
 directions-0.033
liness-0.033
 zone-0.031
 lap-0.031
 tag-0.030
Token made
Token ID925
Feature activation-9.81e-01

Pos logit contributions
add+0.030
opp+0.024
utt+0.023
 int+0.022
omm+0.022
Neg logit contributions
 directions-0.015
liness-0.015
 seek-0.013
 zone-0.013
 mats-0.013
Token her
Token ID607
Feature activation-3.97e-01

Pos logit contributions
add+0.045
opp+0.041
utt+0.040
raper+0.039
 ven+0.039
Neg logit contributions
 directions-0.011
liness-0.008
 tag-0.007
 yourself-0.005
 mats-0.005
Token stop
Token ID2245
Feature activation-6.46e-01

Pos logit contributions
add+0.013
opp+0.010
utt+0.010
 ven+0.010
 int+0.010
Neg logit contributions
 directions-0.010
liness-0.009
 lap-0.009
 zone-0.009
 mats-0.009
Token.
Token ID13
Feature activation-2.99e-01

Pos logit contributions
add+0.023
 ven+0.018
opp+0.018
omm+0.017
utt+0.017
Neg logit contributions
 directions-0.014
liness-0.014
 zone-0.013
 lap-0.012
 yourself-0.012
Token It
Token ID632
Feature activation-4.98e-01

Pos logit contributions
add+0.011
opp+0.009
utt+0.009
 int+0.009
 ven+0.008
Neg logit contributions
liness-0.006
 directions-0.006
 lap-0.006
 mats-0.005
 zone-0.005
Token,
Token ID11
Feature activation-8.69e-01

Pos logit contributions
add+0.016
 int+0.013
opp+0.013
utt+0.012
 rum+0.012
Neg logit contributions
 directions-0.009
liness-0.009
 mats-0.008
 zone-0.008
 lap-0.008
Token Mill
Token ID9212
Feature activation-1.12e+00

Pos logit contributions
add+0.022
 int+0.015
opp+0.015
utt+0.015
 ven+0.014
Neg logit contributions
 directions-0.027
liness-0.027
 mats-0.026
 lap-0.025
 zone-0.025
Tokenie
Token ID494
Feature activation-1.05e-01

Pos logit contributions
add+0.029
 int+0.020
opp+0.020
utt+0.020
 rum+0.018
Neg logit contributions
 directions-0.034
liness-0.034
 mats-0.034
 lap-0.033
 zone-0.033
Token's
Token ID338
Feature activation+3.42e-01

Pos logit contributions
add+0.004
 int+0.003
utt+0.003
opp+0.003
 ven+0.003
Neg logit contributions
liness-0.002
 directions-0.002
 lap-0.002
 mats-0.002
 seek-0.002
Token mum
Token ID25682
Feature activation+1.90e-01

Pos logit contributions
liness+0.011
 directions+0.011
 mats+0.011
 zone+0.010
 lap+0.010
Neg logit contributions
add-0.008
 int-0.006
opp-0.005
utt-0.005
 rum-0.005
Token put
Token ID1234
Feature activation-6.59e-01

Pos logit contributions
liness+0.004
 directions+0.004
 lap+0.004
 seek+0.003
 tag+0.003
Neg logit contributions
add-0.007
 int-0.005
utt-0.005
opp-0.005
 ven-0.005
Token the
Token ID262
Feature activation+7.43e-01

Pos logit contributions
add+0.025
 int+0.020
utt+0.020
opp+0.020
 ven+0.019
Neg logit contributions
 directions-0.013
liness-0.012
 lap-0.011
 mats-0.011
 zone-0.011
Token little
Token ID1310
Feature activation+7.69e-01

Pos logit contributions
liness+0.018
 zone+0.016
 pads+0.016
 Seek+0.016
 directions+0.016
Neg logit contributions
add-0.025
 int-0.018
 rum-0.018
 once-0.017
opp-0.017
Token girl
Token ID2576
Feature activation+7.85e-01

Pos logit contributions
liness+0.017
 directions+0.017
 mats+0.016
 lap+0.016
 Seek+0.016
Neg logit contributions
add-0.024
 rum-0.019
 int-0.019
opp-0.018
utt-0.018
Token down
Token ID866
Feature activation-3.37e-01

Pos logit contributions
 directions+0.021
liness+0.021
 mats+0.021
 zone+0.020
 newspapers+0.020
Neg logit contributions
add-0.022
 int-0.016
opp-0.016
utt-0.016
 rum-0.016
Token on
Token ID319
Feature activation-1.43e-03

Pos logit contributions
add+0.011
 int+0.008
utt+0.008
opp+0.008
 rum+0.008
Neg logit contributions
liness-0.008
 directions-0.008
 mats-0.008
 zone-0.008
 sandbox-0.008
Token.
Token ID13
Feature activation+3.77e-01

Pos logit contributions
add+0.000
opp+0.000
 int+0.000
utt+0.000
 rum+0.000
Neg logit contributions
 directions-0.000
liness-0.000
 mats-0.000
 lap-0.000
 zone-0.000
Token It
Token ID632
Feature activation-2.84e-01

Pos logit contributions
liness+0.009
 directions+0.009
 lap+0.009
 mats+0.008
 zone+0.008
Neg logit contributions
add-0.012
 int-0.009
opp-0.009
utt-0.009
 rum-0.009
Token does
Token ID857
Feature activation-5.04e-01

Pos logit contributions
add+0.012
opp+0.010
utt+0.009
 int+0.009
omm+0.009
Neg logit contributions
 directions-0.004
 mats-0.004
liness-0.004
 follows-0.004
 tag-0.004
Token not
Token ID407
Feature activation-1.27e-02

Pos logit contributions
add+0.019
opp+0.015
utt+0.015
omm+0.014
 int+0.014
Neg logit contributions
 directions-0.010
 mats-0.009
 yourself-0.009
 tag-0.009
liness-0.009
Token like
Token ID588
Feature activation-8.96e-01

Pos logit contributions
add+0.000
 int+0.000
opp+0.000
omm+0.000
utt+0.000
Neg logit contributions
 directions-0.000
 seek-0.000
 tag-0.000
liness-0.000
 mats-0.000
Token to
Token ID284
Feature activation-2.60e-01

Pos logit contributions
add+0.033
opp+0.027
utt+0.026
urd+0.026
 int+0.026
Neg logit contributions
 directions-0.018
 mats-0.018
 your-0.016
 yourself-0.016
 tag-0.016
Token be
Token ID307
Feature activation-9.31e-01

Pos logit contributions
add+0.010
 int+0.008
utt+0.008
omm+0.008
opp+0.008
Neg logit contributions
 seek-0.005
 directions-0.005
 yourself-0.005
 tag-0.005
 lap-0.005
Token in
Token ID287
Feature activation-1.01e-02

Pos logit contributions
add+0.030
 int+0.023
opp+0.023
utt+0.022
omm+0.022
Neg logit contributions
 directions-0.023
liness-0.023
 mats-0.021
 lap-0.021
 seek-0.021
Token Sue
Token ID26089
Feature activation-4.55e-01

Pos logit contributions
add+0.000
 int+0.000
utt+0.000
opp+0.000
 ven+0.000
Neg logit contributions
 directions-0.000
liness-0.000
 mats-0.000
 lap-0.000
 zone-0.000
Token's
Token ID338
Feature activation-2.24e-01

Pos logit contributions
add+0.017
 int+0.015
opp+0.015
 rum+0.014
utt+0.014
Neg logit contributions
 directions-0.007
liness-0.007
 mats-0.007
 seek-0.006
 sandbox-0.006
Token pocket
Token ID10000
Feature activation-2.70e-01

Pos logit contributions
add+0.008
opp+0.006
 int+0.006
utt+0.006
 ven+0.006
Neg logit contributions
liness-0.005
 directions-0.004
 mats-0.004
 zone-0.004
 seek-0.004
Token wanted
Token ID2227
Feature activation-2.27e-01

Pos logit contributions
 tag+0.003
 mats+0.003
 seek+0.003
 sandbox+0.003
 directions+0.003
Neg logit contributions
add-0.009
utt-0.008
opp-0.007
 int-0.007
une-0.007
Token to
Token ID284
Feature activation+3.02e-01

Pos logit contributions
add+0.008
opp+0.006
utt+0.006
 int+0.006
 ven+0.006
Neg logit contributions
 directions-0.005
 mats-0.005
 newspapers-0.005
liness-0.004
 tag-0.004
Token show
Token ID905
Feature activation+1.19e+00

Pos logit contributions
 directions+0.005
 lap+0.005
 seek+0.005
 tag+0.005
 yourself+0.004
Neg logit contributions
add-0.013
utt-0.010
opp-0.010
 int-0.010
omm-0.009
Token it
Token ID340
Feature activation+3.17e-01

Pos logit contributions
 directions+0.026
liness+0.026
 mats+0.024
 lap+0.024
 zone+0.023
Neg logit contributions
add-0.042
utt-0.033
opp-0.032
 int-0.032
 ven-0.031
Token to
Token ID284
Feature activation+1.15e+00

Pos logit contributions
 directions+0.008
liness+0.008
 mats+0.007
 lap+0.007
 zone+0.007
Neg logit contributions
add-0.010
opp-0.008
 int-0.007
utt-0.007
 ven-0.007
Token her
Token ID607
Feature activation-1.43e+00

Pos logit contributions
 directions+0.022
liness+0.021
 mats+0.020
 lap+0.020
 zone+0.019
Neg logit contributions
add-0.044
utt-0.035
opp-0.035
 int-0.035
 ven-0.033
Token mom
Token ID1995
Feature activation+7.58e-02

Pos logit contributions
add+0.054
utt+0.044
opp+0.041
 int+0.039
 rum+0.039
Neg logit contributions
 directions-0.029
 newspapers-0.029
liness-0.029
 hoop-0.029
 book-0.029
Token and
Token ID290
Feature activation-5.50e-01

Pos logit contributions
 directions+0.002
 mats+0.002
liness+0.002
 seek+0.002
 lap+0.002
Neg logit contributions
add-0.002
 int-0.002
 rum-0.001
opp-0.001
utt-0.001
Token dad
Token ID9955
Feature activation+9.18e-01

Pos logit contributions
add+0.008
 int+0.005
 rum+0.005
 ven+0.005
 attent+0.004
Neg logit contributions
liness-0.021
 directions-0.021
 zone-0.020
 mats-0.020
 lap-0.020
Token.
Token ID13
Feature activation+9.25e-01

Pos logit contributions
 directions+0.028
liness+0.028
 mats+0.027
 seek+0.027
 lap+0.027
Neg logit contributions
add-0.022
 int-0.017
opp-0.016
 rum-0.015
utt-0.015
Token She
Token ID1375
Feature activation+3.34e-01

Pos logit contributions
liness+0.024
 directions+0.024
 seek+0.023
 zone+0.022
robat+0.022
Neg logit contributions
add-0.026
 int-0.021
opp-0.020
omm-0.020
 rum-0.020
Token cold
Token ID4692
Feature activation-6.28e-01

Pos logit contributions
add+0.056
 int+0.044
utt+0.043
opp+0.043
 ven+0.041
Neg logit contributions
 directions-0.033
liness-0.033
 mats-0.031
 lap-0.031
 zone-0.030
Token st
Token ID336
Feature activation-4.31e-01

Pos logit contributions
add+0.020
 rum+0.018
 ven+0.017
une+0.016
omm+0.016
Neg logit contributions
 directions-0.014
liness-0.012
 Seek-0.012
 yourself-0.011
 mats-0.011
Tokeneth
Token ID2788
Feature activation-9.33e-01

Pos logit contributions
add+0.013
 int+0.010
utt+0.009
 rum+0.009
opp+0.009
Neg logit contributions
 lap-0.011
liness-0.011
 directions-0.011
 mats-0.011
 zone-0.011
Tokenoscope
Token ID40326
Feature activation+2.67e-01

Pos logit contributions
add+0.024
 rum+0.020
 int+0.019
 once+0.018
opp+0.018
Neg logit contributions
 directions-0.026
 Seek-0.025
liness-0.025
 mats-0.023
 sandbox-0.023
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
 directions+0.006
liness+0.006
 mats+0.006
 lap+0.006
 sandbox+0.006
Neg logit contributions
add-0.009
 int-0.007
opp-0.007
utt-0.007
 rum-0.007
Token a
Token ID257
Feature activation-2.30e+00

Pos logit contributions
add+0.038
 int+0.029
 rum+0.028
utt+0.028
opp+0.027
Neg logit contributions
 directions-0.036
liness-0.036
 zone-0.034
 lap-0.034
 mats-0.034
Token loud
Token ID7812
Feature activation-1.71e+00

Pos logit contributions
add+0.079
utt+0.060
 int+0.060
opp+0.060
 ven+0.057
Neg logit contributions
 directions-0.051
liness-0.051
 lap-0.048
 mats-0.048
 zone-0.047
Token voice
Token ID3809
Feature activation-1.41e+00

Pos logit contributions
add+0.057
opp+0.043
utt+0.043
 int+0.043
 ven+0.041
Neg logit contributions
liness-0.040
 directions-0.040
 lap-0.038
 mats-0.038
 zone-0.037
Token.
Token ID13
Feature activation-5.66e-01

Pos logit contributions
add+0.047
opp+0.036
utt+0.035
 int+0.034
 ven+0.033
Neg logit contributions
liness-0.033
 directions-0.033
 zone-0.032
 lap-0.032
 mats-0.031
Token Lily
Token ID20037
Feature activation-4.30e-01

Pos logit contributions
add+0.022
opp+0.017
utt+0.017
 int+0.017
 ven+0.016
Neg logit contributions
 lap-0.011
liness-0.011
 directions-0.011
 mats-0.010
 Seek-0.009
Token was
Token ID373
Feature activation-5.68e-01

Pos logit contributions
add+0.016
utt+0.013
 int+0.013
opp+0.013
 ven+0.012
Neg logit contributions
liness-0.008
 directions-0.008
 lap-0.007
 mats-0.007
 seek-0.007
Token
Token ID198
Feature activation-3.87e-01

Pos logit contributions
add+0.029
 int+0.023
opp+0.022
 ven+0.022
 rum+0.021
Neg logit contributions
 lap-0.020
 directions-0.019
liness-0.019
 mats-0.018
 sandbox-0.017
Token
Token ID198
Feature activation-7.90e-01

Pos logit contributions
add+0.012
 int+0.009
utt+0.009
opp+0.009
 ven+0.009
Neg logit contributions
 directions-0.009
 lap-0.009
liness-0.009
 mats-0.009
 seek-0.008
Token"
Token ID1
Feature activation-7.37e-02

Pos logit contributions
add+0.025
 rum+0.019
utt+0.018
opp+0.018
aze+0.018
Neg logit contributions
liness-0.021
 zone-0.020
 directions-0.019
 mats-0.018
 lap-0.017
TokenCan
Token ID6090
Feature activation-1.21e+00

Pos logit contributions
add+0.003
 rum+0.002
opp+0.002
 ven+0.002
 int+0.002
Neg logit contributions
 directions-0.002
liness-0.002
 mats-0.002
 lap-0.001
 zone-0.001
Token I
Token ID314
Feature activation-7.46e-01

Pos logit contributions
add+0.048
 int+0.039
opp+0.038
utt+0.038
 ven+0.037
Neg logit contributions
 directions-0.020
liness-0.020
 mats-0.019
 lap-0.019
 zone-0.018
Token have
Token ID423
Feature activation-2.95e+00

Pos logit contributions
add+0.029
 int+0.024
 attent+0.023
utt+0.023
 ven+0.022
Neg logit contributions
 lap-0.013
 seek-0.013
 directions-0.012
liness-0.012
 tag-0.012
Token some
Token ID617
Feature activation-1.26e+00

Pos logit contributions
add+0.109
 int+0.091
 ven+0.089
icker+0.088
utt+0.088
Neg logit contributions
 directions-0.056
 your-0.055
 yourself-0.054
 mats-0.051
 newspapers-0.050
Token ice
Token ID4771
Feature activation-1.17e+00

Pos logit contributions
add+0.041
 int+0.035
 ven+0.034
utt+0.033
opp+0.033
Neg logit contributions
 directions-0.028
 yourself-0.026
 mats-0.026
 equipment-0.026
 lap-0.025
Token,
Token ID11
Feature activation-9.22e-01

Pos logit contributions
add+0.050
 int+0.042
utt+0.041
opp+0.041
 ven+0.040
Neg logit contributions
 directions-0.015
liness-0.015
 mats-0.014
 lap-0.014
 seek-0.014
Token please
Token ID3387
Feature activation-1.27e+00

Pos logit contributions
add+0.026
opp+0.019
 int+0.019
utt+0.019
 ven+0.018
Neg logit contributions
 directions-0.025
liness-0.025
 mats-0.024
 lap-0.024
 seek-0.024
Token?"
Token ID1701
Feature activation-1.89e+00

Pos logit contributions
add+0.051
 int+0.040
utt+0.039
 attent+0.038
opp+0.038
Neg logit contributions
 seek-0.024
 your-0.023
 yourself-0.022
liness-0.021
 lap-0.021
Token the
Token ID262
Feature activation+4.15e-01

Pos logit contributions
liness+0.003
 zone+0.002
 mats+0.002
 directions+0.002
 lap+0.002
Neg logit contributions
add-0.004
 int-0.003
opp-0.003
 rum-0.003
utt-0.003
Token forest
Token ID8222
Feature activation-2.26e-01

Pos logit contributions
liness+0.008
 directions+0.008
 mats+0.008
 zone+0.008
fs+0.008
Neg logit contributions
add-0.014
 rum-0.012
 ven-0.011
une-0.011
 int-0.011
Token and
Token ID290
Feature activation-1.35e+00

Pos logit contributions
add+0.005
 int+0.004
 rum+0.004
utt+0.004
 erupt+0.004
Neg logit contributions
liness-0.007
 Seek-0.007
 mats-0.006
 directions-0.006
 zone-0.006
Token decided
Token ID3066
Feature activation-8.87e-01

Pos logit contributions
add+0.045
 int+0.036
utt+0.035
 rum+0.035
opp+0.035
Neg logit contributions
liness-0.031
 directions-0.030
 zone-0.028
 lap-0.028
 mats-0.028
Token to
Token ID284
Feature activation+1.61e-01

Pos logit contributions
add+0.022
 int+0.017
utt+0.016
 rum+0.015
opp+0.015
Neg logit contributions
liness-0.028
 directions-0.027
 zone-0.026
 lap-0.026
 mats-0.026
Token climb
Token ID12080
Feature activation-2.41e+00

Pos logit contributions
 directions+0.004
liness+0.004
 mats+0.003
 lap+0.003
 zone+0.003
Neg logit contributions
add-0.005
 int-0.004
opp-0.004
utt-0.004
 rum-0.004
Token it
Token ID340
Feature activation-1.70e+00

Pos logit contributions
add+0.083
opp+0.064
 ven+0.062
utt+0.062
 int+0.062
Neg logit contributions
 directions-0.055
liness-0.052
 mats-0.051
 lap-0.049
 seek-0.049
Token.
Token ID13
Feature activation-9.77e-01

Pos logit contributions
add+0.050
 int+0.037
opp+0.037
utt+0.036
 ven+0.035
Neg logit contributions
 directions-0.046
liness-0.046
 mats-0.043
 lap-0.043
 zone-0.043
Token He
Token ID679
Feature activation-4.16e-01

Pos logit contributions
add+0.037
 int+0.029
 ven+0.029
opp+0.029
utt+0.028
Neg logit contributions
 directions-0.019
 lap-0.018
liness-0.018
 mats-0.017
 zone-0.016
Token grabbed
Token ID13176
Feature activation+2.19e-01

Pos logit contributions
add+0.015
opp+0.012
 int+0.012
 ven+0.012
omm+0.012
Neg logit contributions
 directions-0.008
 seek-0.008
 mats-0.008
 tag-0.007
 pads-0.007
Token a
Token ID257
Feature activation+6.58e-02

Pos logit contributions
 directions+0.004
 mats+0.003
 newspapers+0.003
 tag+0.003
 equipment+0.003
Neg logit contributions
add-0.009
opp-0.007
 ven-0.007
omm-0.007
utt-0.007
Token was
Token ID373
Feature activation-4.71e-01

Pos logit contributions
 directions+0.007
liness+0.007
 mats+0.007
 lap+0.007
 zone+0.006
Neg logit contributions
add-0.010
opp-0.008
 int-0.008
utt-0.008
 rum-0.008
Token a
Token ID257
Feature activation-2.03e-01

Pos logit contributions
add+0.016
utt+0.013
 int+0.013
opp+0.013
 ven+0.013
Neg logit contributions
 mats-0.009
 directions-0.009
liness-0.009
 seek-0.009
 tag-0.009
Token little
Token ID1310
Feature activation+6.36e-01

Pos logit contributions
add+0.008
utt+0.007
 int+0.006
uned+0.006
opp+0.006
Neg logit contributions
 sandbox-0.004
 seek-0.004
 tag-0.004
 lap-0.003
 leash-0.003
Token bear
Token ID6842
Feature activation-1.03e+00

Pos logit contributions
liness+0.010
 directions+0.010
 mats+0.009
 sandbox+0.009
 lap+0.009
Neg logit contributions
add-0.026
 int-0.021
utt-0.021
opp-0.020
 attent-0.020
Token named
Token ID3706
Feature activation-3.77e-01

Pos logit contributions
add+0.043
 int+0.035
utt+0.034
opp+0.033
omm+0.032
Neg logit contributions
liness-0.016
 lap-0.015
 zone-0.015
 seek-0.015
 tag-0.014
Token Ted
Token ID11396
Feature activation-2.57e+00

Pos logit contributions
add+0.013
 int+0.010
 ven+0.010
 attent+0.010
icker+0.010
Neg logit contributions
 directions-0.008
 seek-0.008
liness-0.008
 your-0.008
 lap-0.008
Token.
Token ID13
Feature activation+9.89e-01

Pos logit contributions
add+0.095
utt+0.076
 int+0.075
omm+0.073
opp+0.072
Neg logit contributions
liness-0.051
 directions-0.049
 zone-0.047
 lap-0.046
 seek-0.045
Token Ted
Token ID11396
Feature activation-1.20e+00

Pos logit contributions
 directions+0.026
liness+0.026
 zone+0.025
 mats+0.024
 lap+0.024
Neg logit contributions
add-0.028
 int-0.022
 rum-0.021
utt-0.021
opp-0.021
Token loved
Token ID6151
Feature activation-3.24e-01

Pos logit contributions
add+0.042
 int+0.033
opp+0.032
utt+0.032
 rum+0.031
Neg logit contributions
 directions-0.026
liness-0.025
 mats-0.024
 lap-0.024
 zone-0.023
Token to
Token ID284
Feature activation-4.94e-01

Pos logit contributions
add+0.011
opp+0.009
utt+0.009
 int+0.009
urd+0.008
Neg logit contributions
 directions-0.007
 mats-0.007
 newspapers-0.006
liness-0.006
 lap-0.006
Token go
Token ID467
Feature activation+2.29e-01

Pos logit contributions
add+0.017
 int+0.013
opp+0.013
utt+0.013
 rum+0.012
Neg logit contributions
 directions-0.011
liness-0.011
 mats-0.010
 zone-0.010
 lap-0.010
Token nest
Token ID16343
Feature activation-2.25e-01

Pos logit contributions
add+0.019
 int+0.014
opp+0.014
utt+0.014
 rum+0.014
Neg logit contributions
liness-0.018
 zone-0.018
 mats-0.017
 keyboard-0.016
 directions-0.016
Token and
Token ID290
Feature activation-1.36e+00

Pos logit contributions
add+0.006
 int+0.005
 once+0.005
utt+0.005
opp+0.005
Neg logit contributions
 Seek-0.006
Found-0.006
 mats-0.006
liness-0.006
 zone-0.006
Token soon
Token ID2582
Feature activation-1.26e+00

Pos logit contributions
add+0.048
opp+0.036
utt+0.036
omm+0.035
 int+0.035
Neg logit contributions
 directions-0.031
liness-0.030
 seek-0.028
 tag-0.028
 lap-0.028
Token the
Token ID262
Feature activation-9.23e-01

Pos logit contributions
add+0.041
 int+0.032
opp+0.031
utt+0.031
 ven+0.030
Neg logit contributions
 directions-0.030
liness-0.029
 mats-0.027
 zone-0.027
 lap-0.027
Token rain
Token ID6290
Feature activation-1.90e+00

Pos logit contributions
add+0.034
opp+0.028
 int+0.027
utt+0.027
ores+0.027
Neg logit contributions
 directions-0.020
 sandbox-0.019
 seek-0.018
 tag-0.017
 lap-0.017
Token stopped
Token ID5025
Feature activation-2.42e+00

Pos logit contributions
add+0.064
opp+0.051
utt+0.048
 ven+0.048
 int+0.048
Neg logit contributions
 directions-0.045
liness-0.045
 lap-0.041
 mats-0.041
 tag-0.041
Token.
Token ID13
Feature activation+6.40e-01

Pos logit contributions
add+0.095
omm+0.078
utt+0.075
 ven+0.074
opp+0.074
Neg logit contributions
 directions-0.045
liness-0.040
 yourself-0.038
 your-0.035
 tag-0.034
Token Jack
Token ID3619
Feature activation+1.83e-01

Pos logit contributions
 directions+0.015
liness+0.015
 lap+0.014
 mats+0.014
 zone+0.013
Neg logit contributions
add-0.021
opp-0.016
utt-0.016
 ven-0.016
 int-0.016
Token and
Token ID290
Feature activation-8.74e-01

Pos logit contributions
 directions+0.004
liness+0.003
 mats+0.003
 lap+0.003
 zone+0.003
Neg logit contributions
add-0.007
 int-0.005
opp-0.005
utt-0.005
 ven-0.005
Token his
Token ID465
Feature activation-1.09e+00

Pos logit contributions
add+0.034
 int+0.028
utt+0.027
opp+0.027
 rum+0.027
Neg logit contributions
liness-0.015
 directions-0.014
 mats-0.014
 zone-0.013
 lap-0.013
Token mom
Token ID1995
Feature activation-1.48e-01

Pos logit contributions
add+0.034
utt+0.026
 int+0.026
opp+0.025
 ven+0.024
Neg logit contributions
 directions-0.029
 lap-0.029
 sandbox-0.027
liness-0.027
 mats-0.027