TOP ACTIVATIONS
MAX = 0.085

's hands and falls on the floor . It makes a
hands . It fell on the floor and made a loud
hand . It falls on the floor and breaks into many
an ordinary marble lying on the ground . She picked it
a rock and fell on the ground . She scrap ed
noticed a penny lying on the ground near the edge of
and the water falls on the ground . L ila and
blue , yellow , green , and more .
see a big paper on the table . It has a
She opened the long drawer and there was her
saw a marble laying on the ground and asked his Dad
hand . It fell on the floor and broke into pieces
falls down and lands on the ground . It is wet
saw the shell lying on the sand . He picked it
, and it fell on the floor . It broke into
microscope and it fell on the floor . It broke into
. The coin falls on the floor . It rolls under
the stones fell back on the ground , or on the
back . She fell on the ground and got wet .
saw a yellow block on the floor . He wanted it

MIN ACTIVATIONS
MIN = -0.083

warm and cozy in their swe aters . They fall asleep
!" To her su prise , the thunder went
candy and ride in his toy je ep . One day
ear . Ben cried and bl ed . Lily screamed and
takes his ball and his k ite and his bike .
can we go in the toy store ?" Her mom my
head . Ben cried and bl ed . Sam screamed and
dog looks at him and bl inks . Ben thinks the
Looking back , he was su pr ised to see a
them . They cried and bl ed and hurt .
back to their ball and k ite . They are happy
hurt her knee and it bl ed . L
little helicopter and the little t eddy bear mom my ,
Now , you have three swe <|endoftext|> L ila was a
mom took him to a k ara oke party . There
got off the couch and tr ud ged out of the
might be ruined . Daisy tr ud ged along , carrying
her sc ooter . She tr und led along the sidewalk
?" Tom asked the mother k ang aroo .
The little girl p outed and said she wanted

INTERVAL 0.043 - 0.085
CONTAINS 0.218%

the bird c age on the scale . The scale showed
They took their own hats and ran outside . They played
. She would slide down the slide and swing on the
to ten . He couldn 't find anyone !
so she sat down on the grass to eat her snack

INTERVAL 0.001 - 0.043
CONTAINS 46.875%

can play with something else . How about the blocks ?"
. The end . <|endoftext|> Once upon a time , there
Her dad had to use his special scissors to cut the
the journal . They show each other their pictures . They
. It was next to theirs , but it was bigger

INTERVAL -0.041 - 0.001
CONTAINS 52.714%

maybe it is looking for food ," mom said .
little girl . Her name was Daisy . One day Daisy
" No , I haven 't seen it . But I
longer bitter , but sweet like candy . She took a
. She wished she had not played with the ball .

INTERVAL -0.083 - -0.041
CONTAINS 0.193%

Fred . He was very boun cy and loved to hop
she went , she heard of a lovely land called Slip
of the younger children , who was only three years old
. L ila and Max cl apped and laughed . They
Lucy a lime and a big hug . Lu
Token's
Token ID338
Feature activation+2.12e-02

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.005
idd-0.005
rous-0.005
rou-0.005
iw-0.004
Token hands
Token ID2832
Feature activation-1.99e-02

Pos logit contributions
 before+0.028
idi+0.028
 grant+0.027
ers+0.025
etta+0.024
Neg logit contributions
und-0.038
idd-0.035
ays-0.033
rous-0.033
rou-0.033
Token and
Token ID290
Feature activation-5.85e-03

Pos logit contributions
und+0.050
rous+0.048
idd+0.047
rou+0.046
iw+0.046
Neg logit contributions
 before-0.016
 grant-0.011
ers-0.010
 had-0.010
 floor-0.010
Token falls
Token ID8953
Feature activation-1.51e-02

Pos logit contributions
und+0.011
idd+0.010
rous+0.010
rou+0.010
iw+0.009
Neg logit contributions
 before-0.008
 grant-0.007
idi-0.007
ers-0.007
ace-0.007
Token on
Token ID319
Feature activation+6.43e-03

Pos logit contributions
und+0.035
rous+0.032
idd+0.032
rou+0.031
iw+0.031
Neg logit contributions
 before-0.015
 grant-0.012
ers-0.011
ace-0.011
 floor-0.011
Token the
Token ID262
Feature activation+8.55e-02

Pos logit contributions
 before+0.005
 grant+0.005
idi+0.004
ace+0.004
ers+0.004
Neg logit contributions
und-0.016
idd-0.014
rous-0.014
iw-0.014
rou-0.014
Token floor
Token ID4314
Feature activation-3.01e-02

Pos logit contributions
idi+0.110
etta+0.105
ace+0.102
In+0.100
 counted+0.094
Neg logit contributions
und-0.115
idd-0.107
rou-0.104
ays-0.103
 batch-0.102
Token.
Token ID13
Feature activation-5.63e-03

Pos logit contributions
und+0.076
rous+0.074
idd+0.071
iw+0.069
 Enjoy+0.069
Neg logit contributions
 before-0.025
ers-0.017
 floor-0.016
 grant-0.016
 had-0.015
Token It
Token ID632
Feature activation-7.64e-03

Pos logit contributions
und+0.012
idd+0.011
rous+0.011
rou+0.011
iw+0.010
Neg logit contributions
 before-0.006
 grant-0.006
idi-0.005
ers-0.005
ace-0.005
Token makes
Token ID1838
Feature activation-2.33e-02

Pos logit contributions
und+0.014
idd+0.012
rous+0.012
rou+0.012
iw+0.012
Neg logit contributions
 before-0.011
 grant-0.010
idi-0.010
ers-0.009
ace-0.009
Token a
Token ID257
Feature activation-1.31e-02

Pos logit contributions
und+0.058
idd+0.053
rous+0.052
iw+0.052
rou+0.051
Neg logit contributions
 before-0.020
 grant-0.016
 floor-0.015
ers-0.015
idi-0.015
Token hands
Token ID2832
Feature activation-1.09e-02

Pos logit contributions
 before+0.038
 grant+0.037
idi+0.035
etta+0.034
ers+0.033
Neg logit contributions
und-0.043
rous-0.039
idd-0.039
iw-0.038
 batch-0.038
Token.
Token ID13
Feature activation+8.39e-03

Pos logit contributions
und+0.026
idd+0.024
rous+0.023
rou+0.023
iw+0.023
Neg logit contributions
 before-0.010
 grant-0.009
ers-0.008
idi-0.008
ace-0.008
Token It
Token ID632
Feature activation+1.03e-02

Pos logit contributions
 before+0.011
 grant+0.010
ers+0.010
idi+0.010
ace+0.010
Neg logit contributions
und-0.016
idd-0.014
rous-0.014
iw-0.014
uck-0.014
Token fell
Token ID3214
Feature activation-8.67e-03

Pos logit contributions
 before+0.014
 grant+0.013
idi+0.012
ace+0.012
ers+0.011
Neg logit contributions
und-0.019
idd-0.017
rous-0.017
rou-0.016
iw-0.016
Token on
Token ID319
Feature activation+1.48e-02

Pos logit contributions
und+0.019
idd+0.017
rous+0.017
rou+0.016
iw+0.016
Neg logit contributions
 before-0.010
 grant-0.009
idi-0.008
ers-0.008
ace-0.008
Token the
Token ID262
Feature activation+8.20e-02

Pos logit contributions
 grant+0.013
 before+0.013
idi+0.011
ace+0.011
etta+0.010
Neg logit contributions
und-0.034
idd-0.030
rous-0.030
 batch-0.029
iw-0.029
Token floor
Token ID4314
Feature activation-1.15e-02

Pos logit contributions
idi+0.103
etta+0.099
 grant+0.093
 granted+0.086
In+0.086
Neg logit contributions
und-0.123
 batch-0.118
iw-0.113
idd-0.108
rou-0.103
Token and
Token ID290
Feature activation-4.61e-03

Pos logit contributions
und+0.027
idd+0.025
rous+0.025
rou+0.024
iw+0.024
Neg logit contributions
 before-0.011
 grant-0.009
ers-0.008
idi-0.008
 floor-0.008
Token made
Token ID925
Feature activation-1.83e-02

Pos logit contributions
und+0.009
idd+0.008
rous+0.008
rou+0.008
iw+0.008
Neg logit contributions
 before-0.006
 grant-0.005
idi-0.005
ers-0.005
ace-0.005
Token a
Token ID257
Feature activation-1.22e-02

Pos logit contributions
und+0.044
idd+0.041
rous+0.040
iw+0.040
rou+0.039
Neg logit contributions
 before-0.016
 grant-0.014
ers-0.013
idi-0.013
 floor-0.012
Token loud
Token ID7812
Feature activation-2.98e-02

Pos logit contributions
und+0.026
idd+0.023
rous+0.023
rou+0.022
iw+0.022
Neg logit contributions
 before-0.014
 grant-0.013
idi-0.012
 floor-0.012
ers-0.011
Token hand
Token ID1021
Feature activation-8.81e-03

Pos logit contributions
idi+0.038
 before+0.037
 grant+0.035
ers+0.034
etta+0.032
Neg logit contributions
und-0.051
idd-0.045
ays-0.044
iw-0.044
rou-0.044
Token.
Token ID13
Feature activation+3.05e-04

Pos logit contributions
und+0.021
idd+0.019
rous+0.019
rou+0.018
iw+0.018
Neg logit contributions
 before-0.008
 grant-0.007
ers-0.006
idi-0.006
ace-0.006
Token It
Token ID632
Feature activation-7.04e-03

Pos logit contributions
 before+0.000
 grant+0.000
idi+0.000
ers+0.000
ace+0.000
Neg logit contributions
und-0.001
idd-0.001
rous-0.001
rou-0.001
iw-0.001
Token falls
Token ID8953
Feature activation-1.25e-02

Pos logit contributions
und+0.013
idd+0.012
rous+0.011
rou+0.011
iw+0.011
Neg logit contributions
 before-0.010
 grant-0.009
idi-0.009
ers-0.009
ace-0.008
Token on
Token ID319
Feature activation+1.49e-02

Pos logit contributions
und+0.027
rous+0.025
idd+0.025
rou+0.024
iw+0.024
Neg logit contributions
 before-0.014
 grant-0.013
ace-0.011
ers-0.011
idi-0.011
Token the
Token ID262
Feature activation+8.13e-02

Pos logit contributions
 grant+0.012
 before+0.012
idi+0.011
ace+0.011
ers+0.010
Neg logit contributions
und-0.035
idd-0.030
rous-0.030
 batch-0.030
iw-0.029
Token floor
Token ID4314
Feature activation-1.81e-02

Pos logit contributions
idi+0.106
etta+0.096
mit+0.086
 grant+0.086
kins+0.086
Neg logit contributions
und-0.123
 batch-0.111
rou-0.105
ays-0.101
iw-0.101
Token and
Token ID290
Feature activation-1.93e-02

Pos logit contributions
und+0.044
rous+0.041
idd+0.040
rou+0.039
iw+0.039
Neg logit contributions
 before-0.016
 grant-0.013
ers-0.012
 floor-0.012
idi-0.011
Token breaks
Token ID9457
Feature activation+5.06e-03

Pos logit contributions
und+0.039
idd+0.036
rous+0.035
rou+0.034
unch+0.034
Neg logit contributions
 before-0.025
 grant-0.023
 floor-0.020
 had-0.020
idi-0.020
Token into
Token ID656
Feature activation-1.73e-02

Pos logit contributions
 before+0.005
 grant+0.005
idi+0.005
ers+0.004
ace+0.004
Neg logit contributions
und-0.011
idd-0.010
rou-0.010
rous-0.010
ays-0.009
Token many
Token ID867
Feature activation-4.64e-03

Pos logit contributions
und+0.039
rous+0.036
idd+0.035
rou+0.035
iw+0.034
Neg logit contributions
 before-0.018
 grant-0.015
 floor-0.015
ers-0.014
idi-0.014
Token an
Token ID281
Feature activation-7.66e-04

Pos logit contributions
und+0.037
idd+0.033
rous+0.032
rou+0.032
iw+0.032
Neg logit contributions
 before-0.014
 grant-0.012
idi-0.011
ers-0.011
 floor-0.010
Token ordinary
Token ID8850
Feature activation-1.87e-02

Pos logit contributions
und+0.002
idd+0.002
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token marble
Token ID30623
Feature activation+7.27e-05

Pos logit contributions
und+0.039
rous+0.035
idd+0.035
rou+0.034
rol+0.033
Neg logit contributions
 before-0.023
 grant-0.020
idi-0.019
 floor-0.019
ers-0.018
Token lying
Token ID9105
Feature activation-1.63e-02

Pos logit contributions
 before+0.000
 grant+0.000
idi+0.000
ace+0.000
ers+0.000
Neg logit contributions
und-0.000
idd-0.000
rous-0.000
rou-0.000
iw-0.000
Token on
Token ID319
Feature activation+1.36e-02

Pos logit contributions
und+0.039
rous+0.035
idd+0.035
rou+0.035
unch+0.034
Neg logit contributions
 before-0.015
 grant-0.012
ers-0.011
idi-0.011
 had-0.011
Token the
Token ID262
Feature activation+8.08e-02

Pos logit contributions
 before+0.011
 grant+0.011
ace+0.010
idi+0.010
ers+0.010
Neg logit contributions
und-0.032
rous-0.028
idd-0.028
 batch-0.028
iw-0.027
Token ground
Token ID2323
Feature activation-3.22e-03

Pos logit contributions
idi+0.106
etta+0.093
 grant+0.089
ce+0.085
ves+0.083
Neg logit contributions
und-0.116
iw-0.109
 bur-0.106
ho-0.105
 batch-0.101
Token.
Token ID13
Feature activation-8.45e-03

Pos logit contributions
und+0.007
idd+0.007
rous+0.007
rou+0.006
iw+0.006
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token She
Token ID1375
Feature activation-3.31e-04

Pos logit contributions
und+0.016
idd+0.015
rous+0.014
rou+0.014
iw+0.014
Neg logit contributions
 before-0.011
 grant-0.010
idi-0.010
ers-0.009
ace-0.009
Token picked
Token ID6497
Feature activation-1.82e-02

Pos logit contributions
und+0.001
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.000
 grant-0.000
idi-0.000
ers-0.000
ace-0.000
Token it
Token ID340
Feature activation-2.07e-03

Pos logit contributions
und+0.036
idd+0.033
rous+0.031
rou+0.031
unch+0.031
Neg logit contributions
 before-0.025
 grant-0.022
idi-0.020
ers-0.020
 had-0.019
Token a
Token ID257
Feature activation+1.76e-02

Pos logit contributions
und+0.055
idd+0.051
rous+0.051
rou+0.050
rol+0.049
Neg logit contributions
 before-0.017
 grant-0.013
ers-0.012
idi-0.011
ace-0.011
Token rock
Token ID3881
Feature activation-3.03e-02

Pos logit contributions
 before+0.022
idi+0.020
ace+0.020
 grant+0.020
ers+0.019
Neg logit contributions
und-0.034
idd-0.030
rou-0.029
iw-0.029
rous-0.029
Token and
Token ID290
Feature activation-4.41e-03

Pos logit contributions
und+0.079
rous+0.078
idd+0.077
rou+0.075
 Enjoy+0.073
Neg logit contributions
 before-0.021
 grant-0.013
ers-0.012
 floor-0.012
 had-0.012
Token fell
Token ID3214
Feature activation-1.04e-02

Pos logit contributions
und+0.009
idd+0.008
rous+0.008
rou+0.007
iw+0.007
Neg logit contributions
 before-0.006
 grant-0.005
idi-0.005
ers-0.005
ace-0.005
Token on
Token ID319
Feature activation+2.24e-03

Pos logit contributions
und+0.023
rous+0.021
idd+0.021
rou+0.020
iw+0.020
Neg logit contributions
 before-0.011
 grant-0.010
ers-0.009
ace-0.009
idi-0.009
Token the
Token ID262
Feature activation+8.02e-02

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.001
ace+0.001
ers+0.001
Neg logit contributions
und-0.005
idd-0.005
rous-0.005
rou-0.005
iw-0.005
Token ground
Token ID2323
Feature activation-1.35e-02

Pos logit contributions
idi+0.121
etta+0.114
In+0.108
atha+0.103
keys+0.103
Neg logit contributions
und-0.100
 bur-0.090
rou-0.088
iw-0.081
udd-0.076
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
und+0.032
idd+0.030
rous+0.030
rou+0.029
iw+0.028
Neg logit contributions
 before-0.013
 grant-0.010
ers-0.009
idi-0.009
 floor-0.009
Token She
Token ID1375
Feature activation+1.33e-02

Pos logit contributions
und+0.021
idd+0.019
rous+0.018
rou+0.018
iw+0.017
Neg logit contributions
 before-0.015
 grant-0.014
idi-0.013
ace-0.012
ers-0.012
Token scrap
Token ID15881
Feature activation-9.31e-03

Pos logit contributions
 before+0.021
 grant+0.019
idi+0.019
ace+0.019
ers+0.018
Neg logit contributions
und-0.022
idd-0.019
rou-0.018
rous-0.018
iw-0.018
Tokened
Token ID276
Feature activation-1.33e-02

Pos logit contributions
und+0.014
rous+0.012
idd+0.012
rou+0.012
iw+0.011
Neg logit contributions
 before-0.017
 grant-0.015
ers-0.015
idi-0.015
ace-0.015
Token noticed
Token ID6810
Feature activation-1.54e-02

Pos logit contributions
und+0.001
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token a
Token ID257
Feature activation-3.32e-03

Pos logit contributions
und+0.032
idd+0.029
rous+0.029
iw+0.028
rou+0.028
Neg logit contributions
 before-0.019
 grant-0.016
idi-0.016
ers-0.015
 had-0.015
Token penny
Token ID26369
Feature activation+5.24e-03

Pos logit contributions
und+0.007
idd+0.007
rous+0.006
rou+0.006
iw+0.006
Neg logit contributions
 before-0.004
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token lying
Token ID9105
Feature activation-1.24e-02

Pos logit contributions
 before+0.006
 grant+0.006
idi+0.006
ace+0.005
ers+0.005
Neg logit contributions
und-0.011
idd-0.010
rou-0.009
rous-0.009
iw-0.009
Token on
Token ID319
Feature activation+1.41e-02

Pos logit contributions
und+0.029
idd+0.027
rous+0.026
rou+0.026
iw+0.025
Neg logit contributions
 before-0.012
 grant-0.010
ers-0.009
idi-0.009
ace-0.009
Token the
Token ID262
Feature activation+8.02e-02

Pos logit contributions
 before+0.012
 grant+0.012
ace+0.011
idi+0.011
ers+0.010
Neg logit contributions
und-0.033
rous-0.029
idd-0.028
 batch-0.028
iw-0.028
Token ground
Token ID2323
Feature activation-3.38e-03

Pos logit contributions
idi+0.097
etta+0.090
 grant+0.087
ace+0.084
ia+0.083
Neg logit contributions
und-0.120
 bur-0.111
iw-0.103
idd-0.103
ho-0.102
Token near
Token ID1474
Feature activation-2.12e-03

Pos logit contributions
und+0.008
idd+0.007
rous+0.007
rou+0.007
iw+0.007
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token the
Token ID262
Feature activation+5.73e-02

Pos logit contributions
und+0.004
idd+0.004
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.002
Token edge
Token ID5743
Feature activation+1.61e-02

Pos logit contributions
idi+0.084
ace+0.081
etta+0.080
 grant+0.075
 before+0.075
Neg logit contributions
und-0.086
idd-0.079
iw-0.076
 batch-0.071
rou-0.070
Token of
Token ID286
Feature activation-9.03e-03

Pos logit contributions
 grant+0.019
 before+0.018
idi+0.018
ace+0.016
etta+0.016
Neg logit contributions
und-0.033
idd-0.029
rous-0.028
rou-0.028
uck-0.028
Token and
Token ID290
Feature activation+2.59e-03

Pos logit contributions
und+0.033
idd+0.031
rous+0.031
rou+0.030
iw+0.030
Neg logit contributions
 before-0.011
 grant-0.008
ers-0.008
 floor-0.008
idi-0.008
Token the
Token ID262
Feature activation+2.42e-02

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.003
ers+0.002
ace+0.002
Neg logit contributions
und-0.005
idd-0.005
rous-0.005
rou-0.005
iw-0.005
Token water
Token ID1660
Feature activation+3.19e-03

Pos logit contributions
 before+0.027
 grant+0.027
idi+0.027
ers+0.024
ace+0.024
Neg logit contributions
und-0.048
idd-0.043
rous-0.042
rou-0.042
iw-0.041
Token falls
Token ID8953
Feature activation-1.91e-03

Pos logit contributions
 before+0.005
 grant+0.004
idi+0.004
ace+0.004
ers+0.004
Neg logit contributions
und-0.006
idd-0.005
rous-0.005
rou-0.005
iw-0.005
Token on
Token ID319
Feature activation+3.22e-03

Pos logit contributions
und+0.004
idd+0.003
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.003
 grant-0.002
idi-0.002
ers-0.002
ace-0.002
Token the
Token ID262
Feature activation+7.92e-02

Pos logit contributions
 before+0.003
 grant+0.002
idi+0.002
ace+0.002
ers+0.002
Neg logit contributions
und-0.008
idd-0.007
rous-0.007
rou-0.007
 batch-0.007
Token ground
Token ID2323
Feature activation-2.76e-03

Pos logit contributions
idi+0.103
 grant+0.088
etta+0.085
lot+0.077
ace+0.077
Neg logit contributions
und-0.132
idd-0.113
 batch-0.111
ays-0.109
rou-0.109
Token.
Token ID13
Feature activation-1.02e-03

Pos logit contributions
und+0.006
idd+0.006
rous+0.006
rou+0.006
iw+0.005
Neg logit contributions
 before-0.003
 grant-0.002
idi-0.002
ers-0.002
ace-0.002
Token L
Token ID406
Feature activation-2.46e-03

Pos logit contributions
und+0.002
idd+0.002
rous+0.002
rou+0.002
iw+0.002
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Tokenila
Token ID10102
Feature activation-5.97e-04

Pos logit contributions
und+0.005
idd+0.004
rous+0.004
rou+0.004
iw+0.004
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token and
Token ID290
Feature activation-2.09e-02

Pos logit contributions
und+0.001
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.000
ers-0.000
ace-0.000
Token blue
Token ID4171
Feature activation+2.92e-02

Pos logit contributions
ace+0.112
 grant+0.108
idi+0.106
 before+0.105
oster+0.103
Neg logit contributions
und-0.080
 batch-0.071
iw-0.069
idd-0.068
 together-0.064
Token,
Token ID11
Feature activation+7.08e-02

Pos logit contributions
idi+0.040
 grant+0.039
 before+0.037
etta+0.035
ace+0.035
Neg logit contributions
und-0.055
idd-0.047
ays-0.047
iw-0.046
uck-0.044
Token yellow
Token ID7872
Feature activation+3.53e-02

Pos logit contributions
ace+0.120
 grant+0.118
idi+0.116
 before+0.113
 contain+0.111
Neg logit contributions
und-0.084
 batch-0.070
iw-0.070
 together-0.069
idd-0.068
Token,
Token ID11
Feature activation+7.66e-02

Pos logit contributions
idi+0.051
 grant+0.049
etta+0.046
 before+0.046
ace+0.044
Neg logit contributions
und-0.063
idd-0.054
iw-0.053
ays-0.052
 batch-0.052
Token green
Token ID4077
Feature activation+4.94e-02

Pos logit contributions
ace+0.130
 grant+0.129
idi+0.128
 before+0.124
 contain+0.123
Neg logit contributions
und-0.087
 together-0.073
iw-0.072
 batch-0.071
idd-0.068
Token,
Token ID11
Feature activation+7.91e-02

Pos logit contributions
idi+0.075
 grant+0.074
etta+0.070
 before+0.066
ace+0.066
Neg logit contributions
und-0.083
idd-0.069
iw-0.069
 batch-0.067
ays-0.066
Token and
Token ID290
Feature activation+3.18e-02

Pos logit contributions
ace+0.134
 grant+0.134
idi+0.133
 contain+0.127
 employer+0.126
Neg logit contributions
und-0.090
 together-0.076
iw-0.074
 batch-0.072
idd-0.068
Token more
Token ID517
Feature activation+3.33e-02

Pos logit contributions
 grant+0.044
 before+0.044
idi+0.043
ace+0.042
ers+0.039
Neg logit contributions
und-0.051
idd-0.046
iw-0.045
 batch-0.045
rous-0.045
Token.
Token ID13
Feature activation+2.74e-03

Pos logit contributions
 grant+0.052
idi+0.050
ace+0.044
 before+0.043
etta+0.043
Neg logit contributions
und-0.055
idd-0.048
iw-0.046
rou-0.045
 batch-0.044
Token
Token ID198
Feature activation-1.37e-02

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.003
ers+0.003
ace+0.003
Neg logit contributions
und-0.005
idd-0.005
rous-0.005
rou-0.005
iw-0.005
Token
Token ID198
Feature activation-1.00e-02

Pos logit contributions
und+0.032
idd+0.029
rous+0.029
rou+0.028
iw+0.028
Neg logit contributions
 before-0.013
 grant-0.011
idi-0.011
ers-0.010
 had-0.010
Token see
Token ID766
Feature activation+8.80e-04

Pos logit contributions
 before+0.007
 grant+0.006
idi+0.006
ers+0.006
ace+0.005
Neg logit contributions
und-0.013
idd-0.012
rous-0.012
rou-0.011
iw-0.011
Token a
Token ID257
Feature activation+3.76e-02

Pos logit contributions
 before+0.001
 grant+0.001
idi+0.001
ace+0.001
ers+0.001
Neg logit contributions
und-0.002
idd-0.002
rous-0.002
rou-0.002
iw-0.002
Token big
Token ID1263
Feature activation+1.88e-02

Pos logit contributions
ace+0.050
 before+0.047
ers+0.046
idi+0.044
 grant+0.043
Neg logit contributions
und-0.065
idd-0.064
rou-0.062
iw-0.062
 batch-0.062
Token paper
Token ID3348
Feature activation+2.49e-02

Pos logit contributions
 before+0.024
 grant+0.023
idi+0.023
ace+0.022
ers+0.022
Neg logit contributions
und-0.035
idd-0.032
rous-0.032
 batch-0.032
iw-0.031
Token on
Token ID319
Feature activation+3.03e-02

Pos logit contributions
idi+0.030
 before+0.030
 grant+0.029
ace+0.029
etta+0.026
Neg logit contributions
und-0.047
idd-0.043
ays-0.040
rou-0.040
 batch-0.040
Token the
Token ID262
Feature activation+7.91e-02

Pos logit contributions
 grant+0.028
ace+0.028
etta+0.027
idi+0.027
 before+0.026
Neg logit contributions
und-0.063
rous-0.058
 batch-0.057
ays-0.054
idd-0.053
Token table
Token ID3084
Feature activation-2.33e-03

Pos logit contributions
idi+0.120
etta+0.109
ace+0.099
 forgot+0.096
In+0.095
Neg logit contributions
 batch-0.099
iw-0.095
rous-0.089
und-0.086
 course-0.084
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
und+0.005
idd+0.005
rous+0.005
rou+0.005
iw+0.005
Neg logit contributions
 before-0.002
 grant-0.002
idi-0.002
ers-0.002
ace-0.002
Token It
Token ID632
Feature activation+1.05e-02

Pos logit contributions
 before+0.016
 grant+0.016
ace+0.015
idi+0.014
ers+0.014
Neg logit contributions
und-0.021
idd-0.018
rous-0.018
ays-0.017
iw-0.017
Token has
Token ID468
Feature activation+3.21e-02

Pos logit contributions
 before+0.016
 grant+0.014
idi+0.014
ace+0.014
ers+0.013
Neg logit contributions
und-0.018
idd-0.016
rous-0.016
iw-0.015
ays-0.015
Token a
Token ID257
Feature activation+1.60e-02

Pos logit contributions
 before+0.046
idi+0.043
 grant+0.043
etta+0.040
ace+0.039
Neg logit contributions
und-0.055
idd-0.048
rous-0.046
rou-0.046
iw-0.045
Token
Token ID198
Feature activation+5.25e-03

Pos logit contributions
und+0.037
idd+0.034
rous+0.033
rou+0.032
iw+0.031
Neg logit contributions
 before-0.023
 grant-0.020
idi-0.020
 floor-0.019
 had-0.019
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
 before+0.006
 grant+0.006
idi+0.005
ace+0.005
ers+0.005
Neg logit contributions
und-0.011
idd-0.010
rous-0.010
rou-0.009
unch-0.009
TokenShe
Token ID3347
Feature activation-7.33e-03

Pos logit contributions
 before+0.017
 grant+0.016
idi+0.015
ace+0.015
ers+0.015
Neg logit contributions
und-0.020
rous-0.017
rou-0.017
idd-0.017
uck-0.017
Token opened
Token ID4721
Feature activation-1.01e-02

Pos logit contributions
und+0.015
idd+0.014
rous+0.014
rou+0.013
iw+0.013
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.008
ers-0.007
ace-0.007
Token the
Token ID262
Feature activation+2.71e-02

Pos logit contributions
und+0.025
idd+0.023
rou+0.023
rous+0.022
unch+0.022
Neg logit contributions
 before-0.008
 grant-0.007
idi-0.006
ers-0.006
ace-0.006
Token long
Token ID890
Feature activation+7.89e-02

Pos logit contributions
idi+0.043
 before+0.043
 grant+0.040
ace+0.040
etta+0.040
Neg logit contributions
und-0.039
iw-0.036
rou-0.036
rous-0.036
 batch-0.035
Token drawer
Token ID33451
Feature activation-1.78e-02

Pos logit contributions
 grant+0.122
etta+0.121
idi+0.120
 contain+0.119
oster+0.117
Neg logit contributions
 batch-0.110
und-0.100
iw-0.094
idd-0.094
uck-0.084
Token and
Token ID290
Feature activation-8.24e-03

Pos logit contributions
und+0.045
idd+0.042
rous+0.041
rou+0.040
iw+0.039
Neg logit contributions
 before-0.015
 grant-0.011
ers-0.010
 floor-0.010
idi-0.010
Token there
Token ID612
Feature activation-1.30e-02

Pos logit contributions
und+0.016
idd+0.015
rous+0.014
rou+0.014
iw+0.014
Neg logit contributions
 before-0.011
 grant-0.010
idi-0.009
ers-0.009
ace-0.009
Token was
Token ID373
Feature activation+1.34e-02

Pos logit contributions
und+0.030
idd+0.027
rous+0.027
rou+0.027
iw+0.026
Neg logit contributions
 before-0.013
 grant-0.011
idi-0.011
ers-0.010
 had-0.010
Token her
Token ID607
Feature activation+1.11e-02

Pos logit contributions
 before+0.017
 grant+0.016
idi+0.015
ace+0.015
ers+0.014
Neg logit contributions
und-0.025
idd-0.023
rou-0.022
rous-0.022
 batch-0.022
Token saw
Token ID2497
Feature activation-1.85e-02

Pos logit contributions
 before+0.009
 grant+0.008
ace+0.008
ers+0.008
idi+0.008
Neg logit contributions
und-0.012
idd-0.011
rous-0.010
rou-0.010
iw-0.010
Token a
Token ID257
Feature activation-8.75e-03

Pos logit contributions
und+0.045
idd+0.041
rou+0.040
rous+0.040
iw+0.040
Neg logit contributions
 before-0.016
 grant-0.013
idi-0.013
ers-0.013
 floor-0.012
Token marble
Token ID30623
Feature activation+1.23e-03

Pos logit contributions
und+0.019
idd+0.017
rous+0.017
rou+0.016
iw+0.016
Neg logit contributions
 before-0.010
 grant-0.009
idi-0.008
ers-0.008
 floor-0.008
Token laying
Token ID16299
Feature activation-1.25e-02

Pos logit contributions
 before+0.001
 grant+0.001
idi+0.001
ace+0.001
ers+0.001
Neg logit contributions
und-0.003
idd-0.002
rous-0.002
rou-0.002
iw-0.002
Token on
Token ID319
Feature activation+2.41e-02

Pos logit contributions
und+0.030
idd+0.027
rous+0.027
rou+0.026
unch+0.026
Neg logit contributions
 before-0.012
 grant-0.009
ers-0.009
idi-0.009
ace-0.008
Token the
Token ID262
Feature activation+7.87e-02

Pos logit contributions
 grant+0.024
ace+0.022
etta+0.021
 before+0.021
idi+0.020
Neg logit contributions
und-0.053
rous-0.047
 batch-0.046
idd-0.045
iw-0.044
Token ground
Token ID2323
Feature activation+4.55e-03

Pos logit contributions
idi+0.101
etta+0.093
 grant+0.090
ves+0.084
 granted+0.084
Neg logit contributions
und-0.120
iw-0.117
idd-0.109
 batch-0.109
 bur-0.109
Token and
Token ID290
Feature activation+3.94e-03

Pos logit contributions
 before+0.005
 grant+0.005
idi+0.004
ace+0.004
ers+0.004
Neg logit contributions
und-0.010
idd-0.009
rous-0.009
rou-0.009
iw-0.008
Token asked
Token ID1965
Feature activation-3.03e-03

Pos logit contributions
 before+0.005
 grant+0.005
ace+0.005
ers+0.005
idi+0.005
Neg logit contributions
und-0.008
idd-0.007
rous-0.007
rou-0.006
iw-0.006
Token his
Token ID465
Feature activation-7.07e-04

Pos logit contributions
und+0.006
idd+0.006
rous+0.006
rou+0.006
iw+0.005
Neg logit contributions
 before-0.004
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token Dad
Token ID17415
Feature activation+9.02e-03

Pos logit contributions
und+0.001
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token hand
Token ID1021
Feature activation-8.93e-03

Pos logit contributions
idi+0.037
 before+0.035
ers+0.034
 grant+0.034
etta+0.032
Neg logit contributions
und-0.055
iw-0.048
rou-0.048
idd-0.048
rous-0.047
Token.
Token ID13
Feature activation-3.18e-03

Pos logit contributions
und+0.021
idd+0.019
rous+0.019
rou+0.018
iw+0.018
Neg logit contributions
 before-0.009
 grant-0.007
idi-0.007
ers-0.007
ace-0.007
Token It
Token ID632
Feature activation+6.26e-03

Pos logit contributions
und+0.006
idd+0.006
rous+0.005
rou+0.005
iw+0.005
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.004
ers-0.004
ace-0.003
Token fell
Token ID3214
Feature activation-2.54e-03

Pos logit contributions
 before+0.008
 grant+0.007
idi+0.007
ace+0.007
ers+0.007
Neg logit contributions
und-0.012
idd-0.011
rou-0.011
rous-0.011
iw-0.010
Token on
Token ID319
Feature activation+1.54e-02

Pos logit contributions
und+0.005
idd+0.005
rous+0.005
rou+0.004
iw+0.004
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token the
Token ID262
Feature activation+7.87e-02

Pos logit contributions
 grant+0.013
 before+0.012
idi+0.011
ace+0.011
ers+0.010
Neg logit contributions
und-0.036
idd-0.032
rous-0.032
iw-0.031
 batch-0.031
Token floor
Token ID4314
Feature activation-1.60e-02

Pos logit contributions
idi+0.093
etta+0.092
ace+0.087
In+0.078
 grant+0.078
Neg logit contributions
 batch-0.123
und-0.120
iw-0.117
rou-0.110
uck-0.103
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
und+0.039
rous+0.036
idd+0.036
rou+0.035
iw+0.034
Neg logit contributions
 before-0.014
 grant-0.011
ers-0.010
idi-0.010
 floor-0.010
Token broke
Token ID6265
Feature activation+5.72e-03

Pos logit contributions
und+0.022
idd+0.020
rous+0.020
rou+0.019
unch+0.019
Neg logit contributions
 before-0.013
 grant-0.012
idi-0.011
ers-0.011
 had-0.011
Token into
Token ID656
Feature activation-2.42e-02

Pos logit contributions
 grant+0.006
 before+0.006
idi+0.005
ers+0.005
ace+0.005
Neg logit contributions
und-0.013
idd-0.011
rou-0.011
rous-0.011
iw-0.011
Token pieces
Token ID5207
Feature activation-7.72e-03

Pos logit contributions
und+0.056
rous+0.053
idd+0.052
rou+0.051
rol+0.050
Neg logit contributions
 before-0.023
 floor-0.019
 grant-0.019
ers-0.018
 sides-0.017
Token falls
Token ID8953
Feature activation-2.26e-02

Pos logit contributions
und+0.030
idd+0.027
rous+0.027
rou+0.026
unch+0.025
Neg logit contributions
 before-0.022
 grant-0.020
idi-0.019
ers-0.019
 had-0.018
Token down
Token ID866
Feature activation-1.74e-02

Pos logit contributions
und+0.052
rous+0.051
idd+0.049
rou+0.048
 batch+0.048
Neg logit contributions
 before-0.022
 grant-0.017
ace-0.016
ers-0.016
 had-0.016
Token and
Token ID290
Feature activation-4.12e-02

Pos logit contributions
und+0.044
rous+0.041
idd+0.041
rou+0.040
unch+0.039
Neg logit contributions
 before-0.014
 grant-0.011
ers-0.009
 had-0.009
idi-0.009
Token lands
Token ID8604
Feature activation-2.93e-02

Pos logit contributions
und+0.086
unch+0.081
idd+0.080
rous+0.080
aman+0.078
Neg logit contributions
 before-0.050
 grant-0.046
 had-0.042
 floor-0.040
 finish-0.038
Token on
Token ID319
Feature activation+1.24e-02

Pos logit contributions
und+0.074
rous+0.071
aman+0.069
 batch+0.068
idd+0.067
Neg logit contributions
 before-0.024
ers-0.019
 grant-0.017
 had-0.016
ace-0.014
Token the
Token ID262
Feature activation+7.87e-02

Pos logit contributions
 before+0.011
 grant+0.011
idi+0.010
ace+0.010
ers+0.009
Neg logit contributions
und-0.029
idd-0.025
rous-0.025
iw-0.025
uck-0.024
Token ground
Token ID2323
Feature activation-1.96e-02

Pos logit contributions
idi+0.110
 grant+0.093
 before+0.090
etta+0.090
In+0.087
Neg logit contributions
und-0.111
idd-0.101
 batch-0.096
rous-0.095
iw-0.094
Token.
Token ID13
Feature activation-9.22e-03

Pos logit contributions
und+0.048
rous+0.045
idd+0.045
rou+0.043
iw+0.042
Neg logit contributions
 before-0.017
 grant-0.013
ers-0.012
 floor-0.012
ace-0.011
Token It
Token ID632
Feature activation-1.38e-02

Pos logit contributions
und+0.021
idd+0.019
rous+0.019
rou+0.019
iw+0.018
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.007
ers-0.007
ace-0.007
Token is
Token ID318
Feature activation+1.47e-02

Pos logit contributions
und+0.026
idd+0.023
rous+0.023
rou+0.022
unch+0.022
Neg logit contributions
 before-0.019
 grant-0.018
idi-0.017
ers-0.016
 floor-0.016
Token wet
Token ID9583
Feature activation-2.08e-02

Pos logit contributions
 grant+0.020
 before+0.020
idi+0.019
ers+0.019
ace+0.018
Neg logit contributions
und-0.026
rou-0.024
rous-0.023
idd-0.023
iw-0.021
Token saw
Token ID2497
Feature activation-7.33e-03

Pos logit contributions
 before+0.010
 grant+0.009
ace+0.009
ers+0.009
idi+0.009
Neg logit contributions
und-0.011
idd-0.010
rou-0.010
rous-0.009
iw-0.009
Token the
Token ID262
Feature activation+1.07e-02

Pos logit contributions
und+0.016
idd+0.014
rous+0.014
rou+0.014
iw+0.014
Neg logit contributions
 before-0.008
 grant-0.007
idi-0.007
ers-0.007
ace-0.007
Token shell
Token ID7582
Feature activation+1.98e-02

Pos logit contributions
 before+0.015
 grant+0.014
idi+0.013
ace+0.013
ers+0.013
Neg logit contributions
und-0.019
idd-0.018
rou-0.018
rous-0.018
iw-0.017
Token lying
Token ID9105
Feature activation-2.27e-02

Pos logit contributions
 grant+0.021
idi+0.021
 before+0.021
ace+0.020
etta+0.018
Neg logit contributions
und-0.041
rou-0.036
idd-0.036
rous-0.035
unch-0.035
Token on
Token ID319
Feature activation+2.10e-02

Pos logit contributions
und+0.054
rous+0.050
idd+0.050
rou+0.049
iw+0.048
Neg logit contributions
 before-0.021
ers-0.016
 grant-0.015
 had-0.015
idi-0.014
Token the
Token ID262
Feature activation+7.83e-02

Pos logit contributions
 grant+0.022
 before+0.020
etta+0.019
ace+0.019
idi+0.018
Neg logit contributions
und-0.046
rous-0.039
idd-0.038
 batch-0.038
iw-0.038
Token sand
Token ID6450
Feature activation-1.67e-02

Pos logit contributions
idi+0.093
ace+0.088
etta+0.088
 counted+0.081
In+0.080
Neg logit contributions
und-0.119
iw-0.104
 bur-0.104
ho-0.103
rou-0.103
Token.
Token ID13
Feature activation-8.62e-03

Pos logit contributions
und+0.040
rous+0.038
idd+0.038
rou+0.037
iw+0.036
Neg logit contributions
 before-0.015
 grant-0.011
ers-0.011
 floor-0.011
idi-0.010
Token He
Token ID679
Feature activation+9.52e-03

Pos logit contributions
und+0.020
idd+0.018
rous+0.018
rou+0.017
iw+0.017
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.007
ers-0.007
ace-0.007
Token picked
Token ID6497
Feature activation-2.16e-02

Pos logit contributions
 before+0.013
 grant+0.012
ace+0.012
ers+0.011
idi+0.011
Neg logit contributions
und-0.018
idd-0.016
rou-0.016
rous-0.015
iw-0.015
Token it
Token ID340
Feature activation-7.31e-03

Pos logit contributions
und+0.045
idd+0.041
rous+0.040
rou+0.039
unch+0.039
Neg logit contributions
 before-0.028
 grant-0.023
ers-0.023
idi-0.022
 had-0.021
Token,
Token ID11
Feature activation+1.21e-02

Pos logit contributions
 before+0.004
 grant+0.004
idi+0.004
ace+0.004
ers+0.003
Neg logit contributions
und-0.008
idd-0.008
rous-0.007
rou-0.007
iw-0.007
Token and
Token ID290
Feature activation-3.79e-04

Pos logit contributions
 grant+0.014
 before+0.013
idi+0.012
ace+0.012
ers+0.012
Neg logit contributions
und-0.025
idd-0.022
rou-0.022
rous-0.022
iw-0.021
Token it
Token ID340
Feature activation-3.12e-03

Pos logit contributions
und+0.001
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.000
 grant-0.000
idi-0.000
ace-0.000
ers-0.000
Token fell
Token ID3214
Feature activation-2.00e-02

Pos logit contributions
und+0.007
idd+0.006
rous+0.006
rou+0.006
iw+0.006
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token on
Token ID319
Feature activation+9.52e-03

Pos logit contributions
und+0.044
rous+0.042
idd+0.042
iw+0.040
rou+0.040
Neg logit contributions
 before-0.022
 grant-0.018
ers-0.017
ace-0.017
 had-0.016
Token the
Token ID262
Feature activation+7.82e-02

Pos logit contributions
 before+0.008
 grant+0.008
idi+0.007
ace+0.007
ers+0.006
Neg logit contributions
und-0.022
idd-0.020
rous-0.020
 batch-0.019
iw-0.019
Token floor
Token ID4314
Feature activation-2.30e-02

Pos logit contributions
idi+0.104
etta+0.092
ace+0.082
In+0.080
lot+0.080
Neg logit contributions
 batch-0.118
und-0.117
idd-0.101
 racket-0.100
iw-0.098
Token.
Token ID13
Feature activation-2.46e-03

Pos logit contributions
und+0.056
rous+0.053
idd+0.053
rou+0.051
iw+0.050
Neg logit contributions
 before-0.021
ers-0.015
 grant-0.015
 floor-0.014
 had-0.013
Token It
Token ID632
Feature activation+1.89e-03

Pos logit contributions
und+0.005
idd+0.005
rous+0.005
rou+0.005
iw+0.005
Neg logit contributions
 before-0.003
 grant-0.002
idi-0.002
ers-0.002
ace-0.002
Token broke
Token ID6265
Feature activation+5.21e-03

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ace+0.002
ers+0.002
Neg logit contributions
und-0.004
idd-0.003
rous-0.003
rou-0.003
iw-0.003
Token into
Token ID656
Feature activation-2.02e-02

Pos logit contributions
 grant+0.005
 before+0.005
idi+0.004
ers+0.004
ace+0.004
Neg logit contributions
und-0.012
idd-0.011
rou-0.011
rous-0.011
ays-0.010
Token microscope
Token ID36396
Feature activation-2.27e-02

Pos logit contributions
idi+0.060
 before+0.060
ace+0.057
 grant+0.057
etta+0.054
Neg logit contributions
und-0.069
idd-0.063
rou-0.060
iw-0.059
 batch-0.058
Token and
Token ID290
Feature activation+1.62e-02

Pos logit contributions
und+0.056
idd+0.053
rous+0.053
rou+0.052
iw+0.051
Neg logit contributions
 before-0.019
 grant-0.014
ers-0.013
 floor-0.013
idi-0.012
Token it
Token ID340
Feature activation+6.43e-03

Pos logit contributions
 grant+0.020
 before+0.019
ace+0.019
idi+0.018
 entire+0.018
Neg logit contributions
und-0.031
idd-0.027
rous-0.027
iw-0.026
rou-0.026
Token fell
Token ID3214
Feature activation-1.39e-02

Pos logit contributions
 before+0.009
 grant+0.009
idi+0.008
ace+0.008
ers+0.008
Neg logit contributions
und-0.011
idd-0.010
rous-0.010
rou-0.010
uck-0.009
Token on
Token ID319
Feature activation+5.83e-03

Pos logit contributions
und+0.030
rous+0.028
idd+0.028
rou+0.027
iw+0.027
Neg logit contributions
 before-0.015
 grant-0.013
ers-0.012
ace-0.012
 floor-0.012
Token the
Token ID262
Feature activation+7.82e-02

Pos logit contributions
 before+0.004
 grant+0.004
idi+0.004
ace+0.004
ers+0.003
Neg logit contributions
und-0.014
idd-0.013
rous-0.013
rou-0.013
 batch-0.012
Token floor
Token ID4314
Feature activation-1.74e-02

Pos logit contributions
idi+0.086
etta+0.077
In+0.077
 grant+0.077
 before+0.076
Neg logit contributions
und-0.136
 batch-0.117
rou-0.110
idd-0.110
iw-0.110
Token.
Token ID13
Feature activation+2.68e-03

Pos logit contributions
und+0.042
rous+0.039
idd+0.039
rou+0.037
iw+0.037
Neg logit contributions
 before-0.016
 grant-0.012
ers-0.012
 floor-0.011
idi-0.011
Token It
Token ID632
Feature activation+1.30e-02

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.003
ers+0.003
ace+0.003
Neg logit contributions
und-0.006
idd-0.005
rous-0.005
rou-0.005
iw-0.005
Token broke
Token ID6265
Feature activation+6.62e-03

Pos logit contributions
 before+0.017
 grant+0.016
idi+0.015
ace+0.015
ers+0.015
Neg logit contributions
und-0.024
idd-0.021
rou-0.021
rous-0.021
uck-0.020
Token into
Token ID656
Feature activation-1.47e-02

Pos logit contributions
 grant+0.006
 before+0.006
idi+0.006
ers+0.005
ace+0.005
Neg logit contributions
und-0.015
idd-0.014
rou-0.013
rous-0.013
ays-0.013
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
und+0.012
idd+0.011
rous+0.011
rou+0.011
iw+0.011
Neg logit contributions
 before-0.006
 grant-0.005
idi-0.005
ers-0.005
ace-0.005
Token The
Token ID383
Feature activation+1.07e-02

Pos logit contributions
 before+0.017
 grant+0.017
ers+0.016
ace+0.016
idi+0.016
Neg logit contributions
und-0.024
idd-0.020
rous-0.020
ays-0.019
uck-0.019
Token coin
Token ID10752
Feature activation+4.03e-03

Pos logit contributions
 before+0.015
idi+0.014
 grant+0.014
ace+0.013
ers+0.012
Neg logit contributions
und-0.019
idd-0.018
iw-0.017
rous-0.017
rou-0.017
Token falls
Token ID8953
Feature activation-1.80e-02

Pos logit contributions
 before+0.006
 grant+0.005
idi+0.005
ace+0.005
ers+0.005
Neg logit contributions
und-0.007
idd-0.006
rous-0.006
rou-0.006
iw-0.006
Token on
Token ID319
Feature activation+1.35e-02

Pos logit contributions
und+0.040
rous+0.038
idd+0.038
rou+0.036
iw+0.036
Neg logit contributions
 before-0.019
 grant-0.016
ers-0.015
ace-0.015
 floor-0.014
Token the
Token ID262
Feature activation+7.81e-02

Pos logit contributions
 grant+0.012
 before+0.012
idi+0.010
ace+0.010
ers+0.009
Neg logit contributions
und-0.031
idd-0.028
rous-0.027
 batch-0.027
uck-0.026
Token floor
Token ID4314
Feature activation-1.67e-02

Pos logit contributions
idi+0.096
etta+0.089
In+0.086
 grant+0.080
 before+0.078
Neg logit contributions
und-0.121
 batch-0.110
iw-0.109
idd-0.104
rou-0.096
Token.
Token ID13
Feature activation+2.79e-03

Pos logit contributions
und+0.040
rous+0.038
idd+0.037
rou+0.036
iw+0.036
Neg logit contributions
 before-0.015
 grant-0.012
ers-0.011
 floor-0.010
idi-0.010
Token It
Token ID632
Feature activation-1.01e-02

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.003
ace+0.003
ers+0.003
Neg logit contributions
und-0.006
idd-0.005
rous-0.005
rou-0.005
iw-0.005
Token rolls
Token ID14088
Feature activation-2.16e-02

Pos logit contributions
und+0.019
idd+0.017
rous+0.017
rou+0.017
iw+0.016
Neg logit contributions
 before-0.014
 grant-0.013
idi-0.012
ers-0.012
ace-0.011
Token under
Token ID739
Feature activation+1.97e-02

Pos logit contributions
und+0.049
rous+0.047
idd+0.046
unch+0.046
rou+0.045
Neg logit contributions
 before-0.023
ers-0.017
 grant-0.017
 had-0.017
 floor-0.016
Token the
Token ID262
Feature activation+3.46e-02

Pos logit contributions
und+0.007
idd+0.006
rous+0.006
rou+0.006
iw+0.006
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.002
ers-0.002
ace-0.002
Token stones
Token ID14966
Feature activation-6.32e-05

Pos logit contributions
 before+0.043
idi+0.043
 grant+0.040
etta+0.038
ers+0.036
Neg logit contributions
und-0.065
rous-0.060
iw-0.058
rou-0.057
idd-0.057
Token fell
Token ID3214
Feature activation+6.90e-03

Pos logit contributions
und+0.000
idd+0.000
rous+0.000
rou+0.000
iw+0.000
Neg logit contributions
 before-0.000
 grant-0.000
idi-0.000
ers-0.000
ace-0.000
Token back
Token ID736
Feature activation-4.26e-03

Pos logit contributions
 before+0.008
 grant+0.008
idi+0.008
ers+0.007
ace+0.007
Neg logit contributions
und-0.014
idd-0.012
rou-0.012
rous-0.012
ays-0.012
Token on
Token ID319
Feature activation+1.05e-02

Pos logit contributions
und+0.010
idd+0.009
rous+0.009
rou+0.009
iw+0.008
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.003
ers-0.003
ace-0.003
Token the
Token ID262
Feature activation+7.80e-02

Pos logit contributions
 grant+0.008
 before+0.008
idi+0.007
ace+0.007
ers+0.006
Neg logit contributions
und-0.026
rous-0.023
idd-0.023
rou-0.022
 batch-0.022
Token ground
Token ID2323
Feature activation+1.98e-03

Pos logit contributions
idi+0.119
etta+0.100
 grant+0.094
 before+0.093
ce+0.090
Neg logit contributions
und-0.110
ho-0.094
iw-0.092
 bur-0.092
udd-0.090
Token,
Token ID11
Feature activation+3.16e-03

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ace+0.002
ers+0.002
Neg logit contributions
und-0.004
idd-0.004
rous-0.004
rou-0.004
iw-0.004
Token or
Token ID393
Feature activation+2.09e-02

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.003
ace+0.003
ers+0.003
Neg logit contributions
und-0.007
idd-0.006
rous-0.006
rou-0.006
iw-0.006
Token on
Token ID319
Feature activation+1.07e-02

Pos logit contributions
 grant+0.022
ers+0.022
ace+0.022
idi+0.021
 before+0.021
Neg logit contributions
und-0.041
rous-0.039
idd-0.037
rou-0.037
iw-0.036
Token the
Token ID262
Feature activation+6.52e-02

Pos logit contributions
 grant+0.008
 before+0.008
idi+0.007
ace+0.007
ers+0.006
Neg logit contributions
und-0.026
rous-0.024
idd-0.024
rou-0.023
 batch-0.023
Token back
Token ID736
Feature activation+4.71e-03

Pos logit contributions
idi+0.033
 grant+0.029
ers+0.028
 before+0.028
 had+0.025
Neg logit contributions
und-0.049
uck-0.043
rou-0.042
idd-0.041
ays-0.041
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
 before+0.005
 grant+0.005
idi+0.005
ers+0.004
ace+0.004
Neg logit contributions
und-0.010
idd-0.009
rous-0.009
rou-0.009
iw-0.009
Token She
Token ID1375
Feature activation-1.88e-03

Pos logit contributions
und+0.003
idd+0.002
rous+0.002
rou+0.002
iw+0.002
Neg logit contributions
 before-0.002
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token fell
Token ID3214
Feature activation-6.38e-03

Pos logit contributions
und+0.004
idd+0.003
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.002
 grant-0.002
idi-0.002
ers-0.002
ace-0.002
Token on
Token ID319
Feature activation+1.20e-02

Pos logit contributions
und+0.014
idd+0.013
rous+0.013
rou+0.012
iw+0.012
Neg logit contributions
 before-0.007
 grant-0.006
idi-0.006
ers-0.006
ace-0.006
Token the
Token ID262
Feature activation+7.77e-02

Pos logit contributions
 grant+0.010
 before+0.009
idi+0.009
ace+0.009
ers+0.008
Neg logit contributions
und-0.028
rous-0.025
idd-0.025
rou-0.024
iw-0.024
Token ground
Token ID2323
Feature activation-1.45e-02

Pos logit contributions
idi+0.112
etta+0.102
 grant+0.101
ace+0.100
 before+0.096
Neg logit contributions
und-0.110
idd-0.104
 batch-0.089
iw-0.088
rou-0.087
Token and
Token ID290
Feature activation-8.01e-03

Pos logit contributions
und+0.036
idd+0.033
rous+0.033
rou+0.032
iw+0.031
Neg logit contributions
 before-0.012
 grant-0.010
ers-0.009
idi-0.009
 floor-0.008
Token got
Token ID1392
Feature activation-1.78e-02

Pos logit contributions
und+0.016
idd+0.015
rous+0.015
rou+0.014
iw+0.014
Neg logit contributions
 before-0.010
 grant-0.009
idi-0.008
ers-0.008
ace-0.008
Token wet
Token ID9583
Feature activation-5.99e-03

Pos logit contributions
und+0.038
idd+0.033
rous+0.032
rou+0.032
unch+0.031
Neg logit contributions
 before-0.022
 grant-0.019
idi-0.018
 had-0.018
ers-0.018
Token.
Token ID13
Feature activation+3.23e-03

Pos logit contributions
und+0.014
idd+0.013
rous+0.012
rou+0.012
iw+0.012
Neg logit contributions
 before-0.006
 grant-0.005
idi-0.005
ers-0.005
ace-0.004
Token saw
Token ID2497
Feature activation-1.47e-03

Pos logit contributions
ace+0.064
 grant+0.063
 before+0.062
 contain+0.061
idi+0.056
Neg logit contributions
und-0.065
idd-0.061
uck-0.056
rou-0.055
awed-0.053
Token a
Token ID257
Feature activation+2.27e-03

Pos logit contributions
und+0.003
idd+0.003
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.002
 grant-0.002
idi-0.001
ace-0.001
ers-0.001
Token yellow
Token ID7872
Feature activation-8.17e-03

Pos logit contributions
 before+0.003
 grant+0.002
idi+0.002
ace+0.002
ers+0.002
Neg logit contributions
und-0.005
idd-0.004
rous-0.004
rou-0.004
iw-0.004
Token block
Token ID2512
Feature activation-6.97e-04

Pos logit contributions
und+0.016
idd+0.014
rous+0.014
rou+0.014
iw+0.014
Neg logit contributions
 before-0.011
 grant-0.010
idi-0.009
ers-0.009
ace-0.009
Token on
Token ID319
Feature activation+1.53e-02

Pos logit contributions
und+0.002
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ace-0.001
ers-0.001
Token the
Token ID262
Feature activation+7.77e-02

Pos logit contributions
 grant+0.013
 before+0.013
ace+0.012
idi+0.011
etta+0.011
Neg logit contributions
und-0.035
rous-0.031
idd-0.031
 batch-0.030
iw-0.030
Token floor
Token ID4314
Feature activation-5.12e-03

Pos logit contributions
idi+0.105
etta+0.101
 grant+0.092
 forgot+0.088
 granted+0.088
Neg logit contributions
 batch-0.108
iw-0.103
und-0.102
idd-0.095
 course-0.092
Token.
Token ID13
Feature activation-9.64e-03

Pos logit contributions
und+0.012
idd+0.011
rous+0.011
rou+0.010
iw+0.010
Neg logit contributions
 before-0.005
 grant-0.004
idi-0.004
ers-0.004
ace-0.004
Token He
Token ID679
Feature activation+8.88e-03

Pos logit contributions
und+0.020
idd+0.019
rous+0.018
rou+0.018
iw+0.017
Neg logit contributions
 before-0.011
 grant-0.010
idi-0.009
ers-0.009
ace-0.009
Token wanted
Token ID2227
Feature activation+9.12e-03

Pos logit contributions
 before+0.013
 grant+0.012
ace+0.012
ers+0.011
idi+0.011
Neg logit contributions
und-0.016
idd-0.014
rou-0.013
rous-0.013
iw-0.013
Token it
Token ID340
Feature activation-1.45e-02

Pos logit contributions
 grant+0.012
 before+0.012
ace+0.011
idi+0.011
ers+0.011
Neg logit contributions
und-0.016
idd-0.015
rous-0.015
iw-0.014
rou-0.014
Token warm
Token ID5814
Feature activation-6.20e-03

Pos logit contributions
 before+0.010
 grant+0.010
idi+0.009
ace+0.009
ers+0.009
Neg logit contributions
und-0.014
rous-0.013
idd-0.013
rou-0.013
iw-0.012
Token and
Token ID290
Feature activation-3.19e-03

Pos logit contributions
und+0.015
idd+0.014
rous+0.013
rou+0.013
iw+0.013
Neg logit contributions
 before-0.005
 grant-0.005
idi-0.004
ers-0.004
ace-0.004
Token cozy
Token ID37438
Feature activation+1.75e-03

Pos logit contributions
und+0.007
idd+0.006
rous+0.006
rou+0.006
iw+0.006
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.003
ers-0.003
ace-0.003
Token in
Token ID287
Feature activation-2.27e-03

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ers+0.001
ace+0.001
Neg logit contributions
und-0.004
idd-0.003
rous-0.003
rou-0.003
iw-0.003
Token their
Token ID511
Feature activation-4.77e-03

Pos logit contributions
und+0.005
idd+0.004
rous+0.004
rou+0.004
iw+0.004
Neg logit contributions
 before-0.002
 grant-0.002
idi-0.002
ers-0.002
ace-0.002
Token swe
Token ID3490
Feature activation-8.32e-02

Pos logit contributions
und+0.010
idd+0.009
rous+0.009
rou+0.008
iw+0.008
Neg logit contributions
 before-0.006
 grant-0.005
idi-0.005
ers-0.005
ace-0.005
Tokenaters
Token ID8605
Feature activation-4.19e-03

Pos logit contributions
und+0.131
idd+0.128
rol+0.124
rou+0.121
haw+0.119
Neg logit contributions
ace-0.107
 be-0.095
 eye-0.095
 hinges-0.094
 before-0.094
Token.
Token ID13
Feature activation+8.06e-03

Pos logit contributions
und+0.010
idd+0.009
rous+0.009
rou+0.008
iw+0.008
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.003
ers-0.003
ace-0.003
Token They
Token ID1119
Feature activation+6.80e-03

Pos logit contributions
 before+0.011
 grant+0.010
ers+0.010
ace+0.009
idi+0.009
Neg logit contributions
und-0.015
idd-0.013
rous-0.013
rou-0.013
 Enjoy-0.013
Token fall
Token ID2121
Feature activation-1.36e-02

Pos logit contributions
 before+0.009
 grant+0.008
idi+0.008
ers+0.008
ace+0.008
Neg logit contributions
und-0.013
rous-0.011
idd-0.011
rou-0.011
iw-0.011
Token asleep
Token ID16039
Feature activation+1.34e-03

Pos logit contributions
und+0.024
idd+0.021
rous+0.021
rou+0.020
iw+0.020
Neg logit contributions
 before-0.021
 grant-0.019
ers-0.018
ace-0.018
idi-0.018
Token!"
Token ID2474
Feature activation+2.33e-03

Pos logit contributions
und+0.036
rous+0.033
idd+0.033
rou+0.031
iw+0.031
Neg logit contributions
 before-0.034
 grant-0.030
ers-0.029
idi-0.028
 floor-0.028
Token
Token ID198
Feature activation+4.10e-03

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.005
idd-0.005
rous-0.004
rou-0.004
iw-0.004
Token
Token ID198
Feature activation+4.47e-03

Pos logit contributions
 before+0.005
 grant+0.004
idi+0.004
ers+0.004
ace+0.004
Neg logit contributions
und-0.009
idd-0.008
rous-0.007
rou-0.007
unch-0.007
TokenTo
Token ID2514
Feature activation+8.99e-03

Pos logit contributions
 before+0.004
 grant+0.004
idi+0.003
ers+0.003
ace+0.003
Neg logit contributions
und-0.010
idd-0.009
rous-0.009
rou-0.009
unch-0.009
Token her
Token ID607
Feature activation-2.16e-03

Pos logit contributions
 before+0.011
 grant+0.010
idi+0.010
ers+0.009
 floor+0.009
Neg logit contributions
und-0.018
rous-0.016
idd-0.016
rou-0.015
uck-0.015
Token su
Token ID424
Feature activation-8.17e-02

Pos logit contributions
und+0.004
idd+0.003
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Tokenprise
Token ID7919
Feature activation-2.23e-02

Pos logit contributions
und+0.108
ays+0.104
rou+0.098
 Sorry+0.095
 afar+0.095
Neg logit contributions
In-0.119
ers-0.119
The-0.118
 before-0.109
idi-0.109
Token,
Token ID11
Feature activation+3.53e-03

Pos logit contributions
und+0.044
idd+0.041
rous+0.040
rou+0.039
iw+0.039
Neg logit contributions
 before-0.029
 grant-0.026
idi-0.025
ers-0.025
 floor-0.023
Token the
Token ID262
Feature activation+4.76e-03

Pos logit contributions
 before+0.003
 grant+0.002
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.009
idd-0.008
rous-0.008
rou-0.008
iw-0.008
Token thunder
Token ID18355
Feature activation+1.43e-02

Pos logit contributions
 before+0.006
 grant+0.006
idi+0.005
ace+0.005
ers+0.005
Neg logit contributions
und-0.009
idd-0.008
rous-0.008
rou-0.008
 batch-0.008
Token went
Token ID1816
Feature activation-1.70e-02

Pos logit contributions
 before+0.018
 grant+0.018
ace+0.017
idi+0.016
ers+0.015
Neg logit contributions
und-0.028
idd-0.025
rou-0.023
rous-0.023
 batch-0.022
Token candy
Token ID18550
Feature activation-2.06e-02

Pos logit contributions
und+0.032
idd+0.029
rou+0.029
rous+0.029
rol+0.027
Neg logit contributions
 before-0.022
 grant-0.019
ers-0.018
idi-0.018
 floor-0.018
Token and
Token ID290
Feature activation+4.25e-03

Pos logit contributions
und+0.049
idd+0.046
rous+0.045
rou+0.045
rol+0.042
Neg logit contributions
 before-0.021
ers-0.016
 grant-0.015
 floor-0.014
idi-0.014
Token ride
Token ID6594
Feature activation+1.08e-02

Pos logit contributions
 before+0.006
 grant+0.005
ace+0.005
ers+0.005
idi+0.005
Neg logit contributions
und-0.008
idd-0.007
rous-0.007
rou-0.007
iw-0.007
Token in
Token ID287
Feature activation-1.08e-02

Pos logit contributions
 grant+0.014
 before+0.014
idi+0.012
ace+0.012
 floor+0.012
Neg logit contributions
und-0.021
rous-0.018
idd-0.018
iw-0.017
 batch-0.017
Token his
Token ID465
Feature activation-3.13e-02

Pos logit contributions
und+0.029
idd+0.027
rou+0.027
rous+0.027
iw+0.026
Neg logit contributions
 before-0.006
 grant-0.005
idi-0.004
ers-0.004
 floor-0.004
Token toy
Token ID13373
Feature activation-7.88e-02

Pos logit contributions
und+0.072
rol+0.067
rou+0.065
idd+0.064
unch+0.062
Neg logit contributions
 before-0.027
idi-0.027
 floor-0.026
 finish-0.025
 had-0.024
Token je
Token ID11223
Feature activation-3.92e-02

Pos logit contributions
rou+0.182
und+0.167
rol+0.158
unch+0.156
rous+0.156
Neg logit contributions
 other-0.076
 shelf-0.075
 floor-0.075
 before-0.071
ers-0.067
Tokenep
Token ID538
Feature activation+1.10e-03

Pos logit contributions
und+0.073
 Enjoy+0.065
rous+0.065
 Sorry+0.064
idd+0.064
Neg logit contributions
ers-0.050
 before-0.046
idi-0.042
 floor-0.042
 entire-0.041
Token.
Token ID13
Feature activation-6.91e-03

Pos logit contributions
 before+0.001
 grant+0.001
idi+0.001
ace+0.001
ers+0.001
Neg logit contributions
und-0.002
idd-0.002
rous-0.002
rou-0.002
iw-0.002
Token One
Token ID1881
Feature activation+1.36e-02

Pos logit contributions
und+0.014
idd+0.013
rous+0.012
rou+0.012
iw+0.012
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.008
ers-0.007
ace-0.007
Token day
Token ID1110
Feature activation-7.58e-03

Pos logit contributions
 grant+0.019
 before+0.017
ers+0.017
idi+0.016
ace+0.016
Neg logit contributions
und-0.024
rous-0.023
idd-0.022
rou-0.022
 batch-0.021
Token ear
Token ID1027
Feature activation-3.30e-05

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.003
ers+0.002
ace+0.002
Neg logit contributions
und-0.005
idd-0.005
rous-0.005
rou-0.005
iw-0.005
Token.
Token ID13
Feature activation+1.53e-02

Pos logit contributions
und+0.000
idd+0.000
rous+0.000
rou+0.000
iw+0.000
Neg logit contributions
 before-0.000
 grant-0.000
idi-0.000
ers-0.000
ace-0.000
Token Ben
Token ID3932
Feature activation+8.84e-03

Pos logit contributions
 before+0.019
 grant+0.019
ace+0.018
idi+0.017
ers+0.017
Neg logit contributions
und-0.030
idd-0.026
rous-0.025
uck-0.025
ays-0.025
Token cried
Token ID16896
Feature activation-1.28e-02

Pos logit contributions
 before+0.011
 grant+0.011
idi+0.010
ace+0.010
ers+0.010
Neg logit contributions
und-0.017
idd-0.015
rou-0.014
rous-0.014
iw-0.014
Token and
Token ID290
Feature activation-2.40e-02

Pos logit contributions
und+0.030
idd+0.028
rous+0.028
iw+0.027
rou+0.027
Neg logit contributions
 before-0.012
 grant-0.010
ace-0.009
ers-0.009
idi-0.009
Token bl
Token ID698
Feature activation-7.75e-02

Pos logit contributions
und+0.051
rous+0.048
idd+0.046
rou+0.046
iw+0.044
Neg logit contributions
 before-0.029
 grant-0.026
 had-0.025
idi-0.024
ers-0.023
Tokened
Token ID276
Feature activation-3.11e-02

Pos logit contributions
rous+0.157
aman+0.156
 batch+0.155
 Enjoy+0.147
und+0.142
Neg logit contributions
 had-0.078
ace-0.073
ers-0.070
piece-0.069
 forgot-0.069
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
und+0.083
idd+0.081
rous+0.080
unch+0.079
iw+0.078
Neg logit contributions
 before-0.024
 had-0.014
ers-0.014
ace-0.012
 grant-0.011
Token Lily
Token ID20037
Feature activation+1.71e-02

Pos logit contributions
 grant+0.032
 before+0.031
ace+0.030
ers+0.028
idi+0.027
Neg logit contributions
und-0.041
idd-0.036
uck-0.034
iw-0.033
rous-0.033
Token screamed
Token ID25421
Feature activation-2.52e-02

Pos logit contributions
 before+0.020
 grant+0.020
idi+0.018
ace+0.018
ers+0.017
Neg logit contributions
und-0.034
idd-0.030
rou-0.029
uck-0.028
iw-0.028
Token and
Token ID290
Feature activation-2.37e-02

Pos logit contributions
und+0.066
rous+0.063
idd+0.063
iw+0.061
rou+0.059
Neg logit contributions
 before-0.021
 grant-0.014
 had-0.012
ace-0.012
 floor-0.012
Token takes
Token ID2753
Feature activation-6.01e-03

Pos logit contributions
und+0.021
idd+0.018
rous+0.018
rou+0.017
unch+0.017
Neg logit contributions
 before-0.018
 grant-0.017
idi-0.016
ers-0.016
ace-0.016
Token his
Token ID465
Feature activation+3.86e-03

Pos logit contributions
und+0.013
idd+0.012
rous+0.012
rou+0.011
iw+0.011
Neg logit contributions
 before-0.007
 grant-0.006
idi-0.006
ers-0.006
ace-0.005
Token ball
Token ID2613
Feature activation-1.52e-02

Pos logit contributions
 before+0.005
 grant+0.005
idi+0.005
ers+0.005
ace+0.005
Neg logit contributions
und-0.007
idd-0.006
rous-0.006
iw-0.006
rou-0.006
Token and
Token ID290
Feature activation-9.81e-03

Pos logit contributions
und+0.038
idd+0.035
rous+0.035
rou+0.034
iw+0.033
Neg logit contributions
 before-0.013
 grant-0.010
ers-0.009
ace-0.008
idi-0.008
Token his
Token ID465
Feature activation-8.28e-03

Pos logit contributions
und+0.018
idd+0.016
rous+0.016
rou+0.016
unch+0.015
Neg logit contributions
 before-0.014
 grant-0.013
idi-0.012
ers-0.012
ace-0.012
Token k
Token ID479
Feature activation-7.75e-02

Pos logit contributions
und+0.016
idd+0.015
rous+0.014
rou+0.014
iw+0.014
Neg logit contributions
 before-0.011
 grant-0.010
idi-0.009
ace-0.009
ers-0.009
Tokenite
Token ID578
Feature activation-8.52e-03

Pos logit contributions
aman+0.126
und+0.117
rous+0.109
arre+0.105
 batch+0.105
Neg logit contributions
ace-0.098
ers-0.089
 before-0.088
 be-0.087
 corner-0.085
Token and
Token ID290
Feature activation-5.28e-03

Pos logit contributions
und+0.020
idd+0.018
rous+0.018
rou+0.018
iw+0.017
Neg logit contributions
 before-0.008
 grant-0.007
idi-0.006
ers-0.006
ace-0.006
Token his
Token ID465
Feature activation-2.13e-03

Pos logit contributions
und+0.010
idd+0.009
rous+0.009
rou+0.009
iw+0.008
Neg logit contributions
 before-0.007
 grant-0.007
idi-0.006
ers-0.006
ace-0.006
Token bike
Token ID7161
Feature activation-2.17e-02

Pos logit contributions
und+0.004
idd+0.004
rous+0.004
rou+0.003
iw+0.003
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.002
Token.
Token ID13
Feature activation+1.93e-03

Pos logit contributions
und+0.055
rous+0.051
idd+0.051
rou+0.049
unch+0.048
Neg logit contributions
 before-0.019
 grant-0.014
ers-0.014
 had-0.012
 finish-0.012
Token can
Token ID460
Feature activation+7.87e-03

Pos logit contributions
 before+0.008
idi+0.007
 grant+0.007
ers+0.007
ace+0.007
Neg logit contributions
und-0.010
idd-0.009
rous-0.009
rou-0.009
iw-0.009
Token we
Token ID356
Feature activation-7.06e-03

Pos logit contributions
 before+0.012
idi+0.011
ers+0.011
 grant+0.011
ace+0.011
Neg logit contributions
und-0.013
idd-0.012
rous-0.011
uck-0.011
iw-0.011
Token go
Token ID467
Feature activation-9.35e-03

Pos logit contributions
und+0.016
idd+0.014
rous+0.014
rou+0.014
iw+0.014
Neg logit contributions
 before-0.007
 grant-0.007
idi-0.006
ace-0.006
ers-0.006
Token in
Token ID287
Feature activation-2.00e-02

Pos logit contributions
und+0.019
idd+0.017
rous+0.017
rou+0.017
iw+0.016
Neg logit contributions
 before-0.012
 grant-0.010
idi-0.010
ers-0.010
ace-0.009
Token the
Token ID262
Feature activation-3.66e-02

Pos logit contributions
und+0.049
idd+0.046
rou+0.044
unch+0.044
rous+0.044
Neg logit contributions
 before-0.018
 grant-0.013
idi-0.012
ers-0.012
 floor-0.012
Token toy
Token ID13373
Feature activation-7.68e-02

Pos logit contributions
und+0.071
unch+0.067
rou+0.065
idd+0.063
iw+0.060
Neg logit contributions
 before-0.046
 finish-0.046
 floor-0.046
 corner-0.044
 ground-0.044
Token store
Token ID3650
Feature activation-2.36e-02

Pos logit contributions
und+0.172
unch+0.171
rou+0.168
haw+0.165
beh+0.164
Neg logit contributions
 before-0.085
 finish-0.072
 floor-0.071
 end-0.069
 corner-0.065
Token?"
Token ID1701
Feature activation-3.43e-03

Pos logit contributions
und+0.047
idd+0.043
rous+0.041
rou+0.041
unch+0.040
Neg logit contributions
 before-0.033
 grant-0.027
 finish-0.026
ers-0.025
idi-0.025
Token Her
Token ID2332
Feature activation-8.06e-04

Pos logit contributions
und+0.007
idd+0.006
rous+0.006
rou+0.006
iw+0.006
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.004
ers-0.003
ace-0.003
Token mom
Token ID1995
Feature activation-1.33e-02

Pos logit contributions
und+0.002
idd+0.002
rous+0.002
rou+0.002
iw+0.002
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Tokenmy
Token ID1820
Feature activation-6.33e-03

Pos logit contributions
und+0.024
rous+0.022
idd+0.021
rou+0.020
iw+0.020
Neg logit contributions
 before-0.020
 grant-0.017
idi-0.017
ers-0.017
 had-0.017
Token head
Token ID1182
Feature activation-7.50e-03

Pos logit contributions
idi+0.068
 before+0.057
etta+0.056
 grant+0.054
In+0.054
Neg logit contributions
und-0.106
 batch-0.097
idd-0.096
rou-0.095
ays-0.087
Token.
Token ID13
Feature activation+8.52e-03

Pos logit contributions
und+0.017
idd+0.016
rous+0.016
rou+0.016
iw+0.015
Neg logit contributions
 before-0.007
 grant-0.006
idi-0.006
ers-0.005
ace-0.005
Token Ben
Token ID3932
Feature activation+9.13e-03

Pos logit contributions
 before+0.010
 grant+0.010
idi+0.009
ace+0.009
ers+0.009
Neg logit contributions
und-0.017
idd-0.015
rous-0.015
iw-0.015
rou-0.015
Token cried
Token ID16896
Feature activation-1.08e-02

Pos logit contributions
 before+0.013
 grant+0.012
idi+0.012
ace+0.011
ers+0.011
Neg logit contributions
und-0.017
idd-0.015
rou-0.014
rous-0.014
uck-0.013
Token and
Token ID290
Feature activation-1.76e-02

Pos logit contributions
und+0.026
idd+0.024
rous+0.024
rou+0.023
iw+0.023
Neg logit contributions
 before-0.010
 grant-0.009
ace-0.008
idi-0.008
ers-0.008
Token bl
Token ID698
Feature activation-7.66e-02

Pos logit contributions
und+0.034
rous+0.031
idd+0.030
rou+0.030
unch+0.029
Neg logit contributions
 before-0.024
 grant-0.023
 had-0.021
idi-0.021
ers-0.020
Tokened
Token ID276
Feature activation-3.36e-02

Pos logit contributions
aman+0.153
 batch+0.150
rous+0.149
 course+0.142
 Enjoy+0.141
Neg logit contributions
 had-0.080
piece-0.078
ace-0.075
 forgot-0.073
ers-0.066
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
und+0.090
idd+0.089
rous+0.088
unch+0.086
iw+0.085
Neg logit contributions
 before-0.025
 had-0.013
ace-0.012
ers-0.012
 floor-0.012
Token Sam
Token ID3409
Feature activation+1.06e-02

Pos logit contributions
 before+0.020
 grant+0.019
ace+0.018
ers+0.018
idi+0.018
Neg logit contributions
und-0.030
idd-0.026
rous-0.025
uck-0.025
iw-0.024
Token screamed
Token ID25421
Feature activation-2.74e-02

Pos logit contributions
 before+0.014
 grant+0.013
idi+0.012
ace+0.012
ers+0.011
Neg logit contributions
und-0.021
idd-0.018
rou-0.018
rous-0.017
uck-0.017
Token and
Token ID290
Feature activation-1.60e-02

Pos logit contributions
und+0.075
rous+0.072
idd+0.071
iw+0.070
rou+0.067
Neg logit contributions
 before-0.020
 grant-0.012
 had-0.010
 floor-0.010
ace-0.010
Token dog
Token ID3290
Feature activation-1.86e-02

Pos logit contributions
und+0.043
idd+0.038
rous+0.038
rou+0.037
rol+0.037
Neg logit contributions
 before-0.027
 grant-0.024
 floor-0.024
idi-0.023
 had-0.023
Token looks
Token ID3073
Feature activation-3.03e-02

Pos logit contributions
und+0.035
idd+0.032
rous+0.031
rou+0.030
unch+0.029
Neg logit contributions
 before-0.026
 grant-0.024
idi-0.023
ers-0.022
 had-0.022
Token at
Token ID379
Feature activation-6.13e-03

Pos logit contributions
und+0.056
idd+0.050
unch+0.046
rou+0.046
iw+0.045
Neg logit contributions
 before-0.047
idi-0.041
 had-0.040
 grant-0.040
ers-0.037
Token him
Token ID683
Feature activation-7.49e-03

Pos logit contributions
und+0.013
idd+0.012
rous+0.012
rou+0.012
iw+0.011
Neg logit contributions
 before-0.007
 grant-0.006
idi-0.006
ers-0.006
ace-0.006
Token and
Token ID290
Feature activation-2.61e-02

Pos logit contributions
und+0.017
idd+0.016
rous+0.016
rou+0.015
iw+0.015
Neg logit contributions
 before-0.007
 grant-0.006
idi-0.006
ers-0.006
ace-0.006
Token bl
Token ID698
Feature activation-7.62e-02

Pos logit contributions
und+0.052
idd+0.048
rous+0.046
unch+0.045
rou+0.045
Neg logit contributions
 before-0.036
 grant-0.032
idi-0.031
 had-0.030
 floor-0.028
Tokeninks
Token ID2973
Feature activation-1.91e-02

Pos logit contributions
und+0.131
 batch+0.130
aman+0.129
 course+0.121
rous+0.117
Neg logit contributions
piece-0.094
ace-0.090
ia-0.087
 before-0.084
 head-0.081
Token.
Token ID13
Feature activation+9.39e-03

Pos logit contributions
und+0.044
idd+0.041
rous+0.041
iw+0.039
rou+0.039
Neg logit contributions
 before-0.019
 grant-0.016
idi-0.015
ers-0.015
ace-0.014
Token Ben
Token ID3932
Feature activation+2.05e-03

Pos logit contributions
 before+0.011
 grant+0.011
ace+0.010
ers+0.010
idi+0.010
Neg logit contributions
und-0.019
idd-0.017
rous-0.016
iw-0.016
rou-0.016
Token thinks
Token ID6834
Feature activation+1.31e-02

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.004
idd-0.003
rous-0.003
rou-0.003
iw-0.003
Token the
Token ID262
Feature activation-2.85e-02

Pos logit contributions
 grant+0.017
 before+0.016
idi+0.016
ace+0.015
etta+0.015
Neg logit contributions
und-0.024
idd-0.021
rou-0.021
rous-0.021
iw-0.020
Token Looking
Token ID15616
Feature activation-1.18e-02

Pos logit contributions
 before+0.012
 grant+0.011
idi+0.011
ace+0.011
ers+0.011
Neg logit contributions
und-0.018
idd-0.016
rous-0.015
rou-0.015
uck-0.015
Token back
Token ID736
Feature activation-1.31e-02

Pos logit contributions
und+0.024
idd+0.022
rous+0.021
rou+0.021
unch+0.021
Neg logit contributions
 before-0.015
 grant-0.013
idi-0.012
ers-0.012
ace-0.012
Token,
Token ID11
Feature activation+1.04e-03

Pos logit contributions
und+0.027
idd+0.025
rous+0.024
rou+0.024
 batch+0.024
Neg logit contributions
 before-0.016
 grant-0.013
idi-0.013
ace-0.012
ers-0.012
Token he
Token ID339
Feature activation+7.55e-03

Pos logit contributions
 before+0.001
 grant+0.001
idi+0.001
ace+0.001
ers+0.001
Neg logit contributions
und-0.002
idd-0.002
rous-0.002
rou-0.002
iw-0.002
Token was
Token ID373
Feature activation+1.24e-02

Pos logit contributions
 before+0.012
 grant+0.011
ace+0.011
ers+0.011
idi+0.010
Neg logit contributions
und-0.012
idd-0.011
rous-0.011
rou-0.011
iw-0.010
Token su
Token ID424
Feature activation-7.62e-02

Pos logit contributions
 grant+0.018
 before+0.018
idi+0.016
ers+0.016
ace+0.016
Neg logit contributions
und-0.021
idd-0.019
rou-0.019
rous-0.019
unch-0.018
Tokenpr
Token ID1050
Feature activation-3.03e-02

Pos logit contributions
und+0.102
 Enjoy+0.090
ays+0.089
 Sorry+0.087
rous+0.082
Neg logit contributions
The-0.117
 before-0.116
ers-0.116
 grant-0.113
In-0.113
Tokenised
Token ID1417
Feature activation-3.61e-03

Pos logit contributions
und+0.042
idd+0.039
 Enjoy+0.039
iw+0.038
 Sorry+0.038
Neg logit contributions
 before-0.050
ace-0.047
 grant-0.046
idi-0.045
 entire-0.045
Token to
Token ID284
Feature activation+1.18e-02

Pos logit contributions
und+0.008
idd+0.007
rous+0.007
rou+0.007
iw+0.007
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.003
ers-0.003
ace-0.003
Token see
Token ID766
Feature activation-2.21e-02

Pos logit contributions
ers+0.012
 before+0.012
ace+0.011
idi+0.011
 grant+0.011
Neg logit contributions
und-0.024
rous-0.022
idd-0.022
 Enjoy-0.021
rou-0.021
Token a
Token ID257
Feature activation-1.30e-02

Pos logit contributions
und+0.047
idd+0.044
rous+0.042
rou+0.042
 batch+0.041
Neg logit contributions
 before-0.027
 grant-0.023
ers-0.020
idi-0.020
 had-0.020
Token them
Token ID606
Feature activation-2.98e-02

Pos logit contributions
 grant+0.007
 before+0.007
idi+0.007
ers+0.006
ace+0.006
Neg logit contributions
und-0.010
idd-0.009
rous-0.008
rou-0.008
ays-0.008
Token.
Token ID13
Feature activation+6.82e-03

Pos logit contributions
und+0.073
rous+0.071
idd+0.071
iw+0.069
rou+0.068
Neg logit contributions
 before-0.027
ace-0.018
 grant-0.018
ers-0.018
 had-0.018
Token They
Token ID1119
Feature activation+6.53e-03

Pos logit contributions
 before+0.008
 grant+0.008
idi+0.007
ers+0.007
ace+0.007
Neg logit contributions
und-0.014
idd-0.012
rous-0.012
iw-0.012
rou-0.012
Token cried
Token ID16896
Feature activation-7.31e-03

Pos logit contributions
 before+0.009
 grant+0.009
idi+0.009
ers+0.008
ace+0.008
Neg logit contributions
und-0.012
idd-0.010
rous-0.010
rou-0.010
iw-0.010
Token and
Token ID290
Feature activation-1.57e-02

Pos logit contributions
und+0.017
idd+0.016
rous+0.016
rou+0.015
iw+0.015
Neg logit contributions
 before-0.007
 grant-0.006
idi-0.005
ace-0.005
ers-0.005
Token bl
Token ID698
Feature activation-7.56e-02

Pos logit contributions
und+0.030
rous+0.028
idd+0.028
rou+0.027
iw+0.026
Neg logit contributions
 before-0.021
 grant-0.019
idi-0.018
 had-0.018
ers-0.017
Tokened
Token ID276
Feature activation-2.84e-02

Pos logit contributions
rous+0.149
aman+0.148
 batch+0.147
 course+0.139
 Enjoy+0.137
Neg logit contributions
 had-0.076
ace-0.075
piece-0.074
 forgot-0.071
ers-0.071
Token and
Token ID290
Feature activation-1.81e-02

Pos logit contributions
und+0.073
idd+0.070
rous+0.070
iw+0.068
unch+0.068
Neg logit contributions
 before-0.023
 had-0.015
ers-0.015
ace-0.014
 grant-0.014
Token hurt
Token ID5938
Feature activation-9.85e-04

Pos logit contributions
und+0.037
idd+0.034
rous+0.033
rou+0.032
unch+0.032
Neg logit contributions
 before-0.024
 grant-0.021
 had-0.020
idi-0.019
 floor-0.019
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
und+0.002
idd+0.002
rous+0.002
rou+0.002
iw+0.002
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token
Token ID198
Feature activation+1.38e-02

Pos logit contributions
 before+0.027
 grant+0.026
ers+0.025
idi+0.025
ace+0.024
Neg logit contributions
und-0.036
idd-0.031
rous-0.029
ays-0.028
iw-0.028
Token back
Token ID736
Feature activation-8.18e-03

Pos logit contributions
und+0.018
idd+0.016
rous+0.016
rou+0.016
iw+0.015
Neg logit contributions
 before-0.012
 grant-0.011
idi-0.011
ers-0.010
ace-0.010
Token to
Token ID284
Feature activation-2.97e-02

Pos logit contributions
und+0.018
idd+0.016
rous+0.016
rou+0.016
iw+0.015
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.007
ers-0.007
ace-0.007
Token their
Token ID511
Feature activation-1.80e-02

Pos logit contributions
und+0.075
idd+0.070
rol+0.069
rou+0.068
rous+0.067
Neg logit contributions
 before-0.026
 grant-0.020
idi-0.018
 finish-0.018
 the-0.017
Token ball
Token ID2613
Feature activation-2.28e-02

Pos logit contributions
und+0.038
idd+0.034
rol+0.033
rou+0.032
rous+0.032
Neg logit contributions
 before-0.022
 grant-0.019
idi-0.019
 floor-0.019
 finish-0.018
Token and
Token ID290
Feature activation+1.28e-02

Pos logit contributions
und+0.057
rous+0.053
idd+0.053
rou+0.051
iw+0.050
Neg logit contributions
 before-0.020
ers-0.014
 grant-0.014
ace-0.013
 floor-0.013
Token k
Token ID479
Feature activation-7.52e-02

Pos logit contributions
 before+0.016
 grant+0.016
ers+0.015
ace+0.015
idi+0.015
Neg logit contributions
und-0.024
rous-0.021
idd-0.021
iw-0.020
rou-0.020
Tokenite
Token ID578
Feature activation-1.65e-02

Pos logit contributions
aman+0.110
und+0.109
 batch+0.104
 Enjoy+0.097
rou+0.096
Neg logit contributions
The-0.098
ers-0.097
 before-0.089
ars-0.086
 corner-0.081
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
und+0.039
idd+0.036
rous+0.036
rou+0.035
iw+0.034
Neg logit contributions
 before-0.016
 grant-0.012
ers-0.012
idi-0.011
ace-0.011
Token They
Token ID1119
Feature activation+6.81e-03

Pos logit contributions
 before+0.014
 grant+0.013
ers+0.013
idi+0.012
ace+0.012
Neg logit contributions
und-0.019
idd-0.017
rous-0.016
 Enjoy-0.016
iw-0.016
Token are
Token ID389
Feature activation+8.03e-03

Pos logit contributions
 before+0.008
 grant+0.008
idi+0.008
ers+0.007
ace+0.007
Neg logit contributions
und-0.013
idd-0.012
rous-0.012
rou-0.012
iw-0.012
Token happy
Token ID3772
Feature activation-6.94e-03

Pos logit contributions
 before+0.011
 grant+0.011
idi+0.010
ers+0.010
 floor+0.010
Neg logit contributions
und-0.014
idd-0.013
rou-0.013
rous-0.013
iw-0.012
Token hurt
Token ID5938
Feature activation-1.66e-03

Pos logit contributions
 before+0.013
 grant+0.012
idi+0.011
ace+0.011
ers+0.011
Neg logit contributions
und-0.015
idd-0.013
rous-0.013
rou-0.012
iw-0.012
Token her
Token ID607
Feature activation+8.46e-03

Pos logit contributions
und+0.003
idd+0.003
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.002
 grant-0.002
idi-0.002
ers-0.002
ace-0.002
Token knee
Token ID10329
Feature activation-1.37e-02

Pos logit contributions
 before+0.011
 grant+0.011
idi+0.011
ers+0.010
ace+0.009
Neg logit contributions
und-0.016
idd-0.014
rou-0.014
rous-0.014
uck-0.013
Token and
Token ID290
Feature activation-9.73e-03

Pos logit contributions
und+0.033
idd+0.031
rous+0.031
iw+0.030
rou+0.030
Neg logit contributions
 before-0.012
 grant-0.010
ers-0.008
 floor-0.008
ace-0.008
Token it
Token ID340
Feature activation-1.10e-02

Pos logit contributions
und+0.020
idd+0.018
rous+0.018
rou+0.017
iw+0.017
Neg logit contributions
 before-0.012
 grant-0.011
idi-0.010
ers-0.010
 floor-0.010
Token bl
Token ID698
Feature activation-7.33e-02

Pos logit contributions
und+0.024
idd+0.022
rous+0.022
rou+0.022
iw+0.021
Neg logit contributions
 before-0.012
 grant-0.010
idi-0.010
ers-0.009
 had-0.009
Tokened
Token ID276
Feature activation-4.44e-02

Pos logit contributions
 batch+0.149
aman+0.145
rous+0.144
und+0.140
 sorts+0.134
Neg logit contributions
ace-0.075
 had-0.070
piece-0.069
 before-0.067
kin-0.066
Token.
Token ID13
Feature activation+1.06e-03

Pos logit contributions
und+0.113
idd+0.112
rous+0.109
unch+0.108
iw+0.107
Neg logit contributions
 before-0.040
 had-0.028
ers-0.024
ace-0.024
 end-0.021
Token
Token ID198
Feature activation+1.27e-02

Pos logit contributions
 before+0.001
 grant+0.001
idi+0.001
ers+0.001
ace+0.001
Neg logit contributions
und-0.002
idd-0.002
rous-0.002
rou-0.002
iw-0.002
Token
Token ID198
Feature activation+1.45e-02

Pos logit contributions
 before+0.016
 grant+0.015
ers+0.014
ace+0.014
idi+0.014
Neg logit contributions
und-0.025
idd-0.022
rous-0.022
unch-0.021
rou-0.021
TokenL
Token ID43
Feature activation+7.39e-03

Pos logit contributions
 before+0.019
 grant+0.017
ace+0.017
idi+0.016
ers+0.016
Neg logit contributions
und-0.029
idd-0.026
rous-0.025
uck-0.024
rou-0.024
Token little
Token ID1310
Feature activation-1.67e-02

Pos logit contributions
und+0.014
idd+0.013
rous+0.013
rou+0.013
iw+0.012
Neg logit contributions
 before-0.008
 grant-0.007
idi-0.007
ers-0.006
ace-0.006
Token helicopter
Token ID15392
Feature activation-1.49e-02

Pos logit contributions
und+0.036
idd+0.032
rou+0.032
rous+0.032
unch+0.031
Neg logit contributions
 before-0.020
 grant-0.017
idi-0.016
ers-0.016
 had-0.015
Token and
Token ID290
Feature activation-8.76e-03

Pos logit contributions
und+0.029
idd+0.026
rous+0.026
rou+0.025
iw+0.024
Neg logit contributions
 before-0.020
 grant-0.018
ers-0.017
idi-0.017
ace-0.017
Token the
Token ID262
Feature activation+1.20e-02

Pos logit contributions
und+0.018
idd+0.016
rous+0.016
rou+0.016
iw+0.016
Neg logit contributions
 before-0.011
 grant-0.010
idi-0.009
ers-0.009
ace-0.008
Token little
Token ID1310
Feature activation-2.15e-02

Pos logit contributions
 before+0.013
 grant+0.012
idi+0.011
ers+0.011
ace+0.011
Neg logit contributions
und-0.025
idd-0.023
rous-0.023
rou-0.022
iw-0.021
Token t
Token ID256
Feature activation-7.22e-02

Pos logit contributions
und+0.048
rou+0.043
idd+0.043
rous+0.043
unch+0.042
Neg logit contributions
 before-0.023
 grant-0.020
 had-0.019
idi-0.018
ace-0.018
Tokeneddy
Token ID21874
Feature activation-9.26e-03

Pos logit contributions
rous+0.134
und+0.127
 Enjoy+0.126
 batch+0.125
iw+0.124
Neg logit contributions
ace-0.083
ers-0.074
 entire-0.069
 grant-0.068
 before-0.066
Token bear
Token ID6842
Feature activation-2.34e-02

Pos logit contributions
und+0.021
idd+0.019
rous+0.019
rou+0.019
iw+0.018
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.008
ers-0.008
ace-0.007
Token mom
Token ID1995
Feature activation-2.02e-02

Pos logit contributions
und+0.046
rous+0.042
idd+0.042
rou+0.040
iw+0.039
Neg logit contributions
 before-0.031
 grant-0.028
ers-0.026
idi-0.025
 had-0.025
Tokenmy
Token ID1820
Feature activation-2.75e-02

Pos logit contributions
und+0.039
rous+0.036
idd+0.035
rou+0.034
iw+0.033
Neg logit contributions
 before-0.027
 grant-0.025
 had-0.023
ers-0.023
ace-0.022
Token,
Token ID11
Feature activation-1.22e-02

Pos logit contributions
und+0.053
rous+0.049
idd+0.048
rou+0.046
iw+0.046
Neg logit contributions
 before-0.037
 grant-0.034
 had-0.031
ers-0.031
ace-0.031
Token Now
Token ID2735
Feature activation+1.48e-02

Pos logit contributions
 before+0.013
 grant+0.012
idi+0.012
ers+0.012
 floor+0.012
Neg logit contributions
und-0.011
idd-0.010
rous-0.010
rou-0.010
iw-0.009
Token,
Token ID11
Feature activation+2.19e-03

Pos logit contributions
 before+0.017
idi+0.017
 grant+0.017
ers+0.017
ace+0.016
Neg logit contributions
und-0.028
rous-0.025
uck-0.024
rou-0.024
idd-0.024
Token you
Token ID345
Feature activation-4.71e-03

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.005
idd-0.005
rous-0.005
rou-0.004
iw-0.004
Token have
Token ID423
Feature activation-4.00e-03

Pos logit contributions
und+0.010
idd+0.009
rous+0.009
rou+0.009
iw+0.008
Neg logit contributions
 before-0.006
 grant-0.005
idi-0.005
ers-0.005
ace-0.005
Token three
Token ID1115
Feature activation-7.20e-03

Pos logit contributions
und+0.008
idd+0.008
rous+0.008
rou+0.007
iw+0.007
Neg logit contributions
 before-0.005
 grant-0.004
idi-0.004
ers-0.004
ace-0.004
Token swe
Token ID3490
Feature activation-7.20e-02

Pos logit contributions
und+0.015
idd+0.014
rous+0.013
rou+0.013
iw+0.013
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.007
ers-0.007
ace-0.007
Token<|endoftext|>
Token ID50256
Feature activation+1.38e-02

Pos logit contributions
und+0.123
rol+0.111
idd+0.108
unch+0.107
rous+0.106
Neg logit contributions
ace-0.093
 before-0.075
 hinges-0.073
piece-0.069
 be-0.068
TokenL
Token ID43
Feature activation+1.55e-03

Pos logit contributions
 before+0.025
 grant+0.024
idi+0.023
ace+0.022
 floor+0.022
Neg logit contributions
und-0.020
idd-0.018
rous-0.016
iw-0.016
uck-0.016
Tokenila
Token ID10102
Feature activation+1.70e-02

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ace+0.002
ers+0.002
Neg logit contributions
und-0.003
idd-0.002
rous-0.002
rou-0.002
iw-0.002
Token was
Token ID373
Feature activation-2.27e-02

Pos logit contributions
 before+0.018
 grant+0.018
idi+0.017
ace+0.016
ers+0.015
Neg logit contributions
und-0.036
idd-0.033
rous-0.031
rou-0.031
uck-0.031
Token a
Token ID257
Feature activation+6.51e-05

Pos logit contributions
und+0.038
idd+0.032
rou+0.032
iw+0.032
rous+0.031
Neg logit contributions
 before-0.036
idi-0.033
 grant-0.032
ace-0.032
ers-0.031
Token mom
Token ID1995
Feature activation-6.44e-03

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.004
idd-0.004
rous-0.004
rou-0.004
uck-0.004
Token took
Token ID1718
Feature activation-3.37e-02

Pos logit contributions
und+0.012
idd+0.011
rous+0.011
rou+0.010
iw+0.010
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.008
ers-0.008
ace-0.008
Token him
Token ID683
Feature activation-3.34e-03

Pos logit contributions
und+0.053
rou+0.051
idd+0.048
unch+0.047
iw+0.047
Neg logit contributions
 before-0.057
idi-0.051
ers-0.051
ace-0.050
 finish-0.049
Token to
Token ID284
Feature activation+4.13e-03

Pos logit contributions
und+0.007
idd+0.006
rous+0.006
rou+0.006
iw+0.006
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.003
ers-0.003
ace-0.003
Token a
Token ID257
Feature activation-3.82e-02

Pos logit contributions
 before+0.003
 grant+0.003
ers+0.003
idi+0.003
ace+0.003
Neg logit contributions
und-0.010
rous-0.009
idd-0.009
rou-0.009
iw-0.009
Token k
Token ID479
Feature activation-7.20e-02

Pos logit contributions
und+0.088
idd+0.079
ays+0.077
rol+0.076
unch+0.075
Neg logit contributions
 before-0.043
idi-0.037
 grant-0.036
 finish-0.036
 floor-0.035
Tokenara
Token ID3301
Feature activation-1.52e-02

Pos logit contributions
und+0.103
haw+0.088
aman+0.087
ahoo+0.085
 Enjoy+0.083
Neg logit contributions
 before-0.109
 corner-0.107
The-0.107
ers-0.095
 shelf-0.095
Tokenoke
Token ID2088
Feature activation-2.70e-02

Pos logit contributions
und+0.034
idd+0.031
rous+0.030
rou+0.030
iw+0.029
Neg logit contributions
 before-0.017
 grant-0.014
idi-0.013
ers-0.013
 floor-0.013
Token party
Token ID2151
Feature activation-3.46e-03

Pos logit contributions
und+0.050
rous+0.044
rou+0.044
iw+0.042
unch+0.042
Neg logit contributions
 before-0.039
ers-0.036
ace-0.035
 grant-0.034
idi-0.033
Token.
Token ID13
Feature activation-5.92e-03

Pos logit contributions
und+0.007
idd+0.007
rous+0.007
rou+0.006
iw+0.006
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.003
ers-0.003
ace-0.003
Token There
Token ID1318
Feature activation-3.21e-02

Pos logit contributions
und+0.011
idd+0.009
rous+0.009
rou+0.009
iw+0.009
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.008
ers-0.008
ace-0.007
Token got
Token ID1392
Feature activation-1.37e-02

Pos logit contributions
und+0.022
idd+0.020
rous+0.020
rou+0.019
iw+0.019
Neg logit contributions
 before-0.014
 grant-0.013
idi-0.012
ers-0.012
ace-0.012
Token off
Token ID572
Feature activation+8.72e-03

Pos logit contributions
und+0.030
idd+0.027
rou+0.026
rous+0.026
unch+0.026
Neg logit contributions
 before-0.016
 grant-0.013
idi-0.013
ers-0.012
 had-0.012
Token the
Token ID262
Feature activation+6.92e-03

Pos logit contributions
 before+0.008
 grant+0.008
idi+0.007
ace+0.007
ers+0.007
Neg logit contributions
und-0.019
idd-0.017
rous-0.017
rou-0.017
iw-0.017
Token couch
Token ID18507
Feature activation-3.10e-02

Pos logit contributions
 before+0.008
 grant+0.008
idi+0.007
ers+0.007
ace+0.007
Neg logit contributions
und-0.014
rous-0.012
idd-0.012
rou-0.012
 batch-0.012
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
und+0.078
rous+0.075
idd+0.075
rou+0.072
iw+0.071
Neg logit contributions
 before-0.026
 grant-0.017
ers-0.017
 floor-0.017
idi-0.016
Token tr
Token ID491
Feature activation-7.17e-02

Pos logit contributions
und+0.045
idd+0.041
rous+0.041
rou+0.040
iw+0.039
Neg logit contributions
 before-0.023
 grant-0.019
idi-0.019
 had-0.018
ers-0.018
Tokenud
Token ID463
Feature activation-1.01e-02

Pos logit contributions
 Sorry+0.090
 Enjoy+0.086
haw+0.083
cient+0.078
iw+0.078
Neg logit contributions
ace-0.108
idi-0.107
piece-0.099
ers-0.098
 before-0.098
Tokenged
Token ID2004
Feature activation-2.37e-02

Pos logit contributions
und+0.015
rous+0.013
idd+0.013
rou+0.012
iw+0.012
Neg logit contributions
 before-0.018
 grant-0.017
idi-0.016
ers-0.016
ace-0.016
Token out
Token ID503
Feature activation+5.21e-03

Pos logit contributions
und+0.049
idd+0.046
unch+0.044
rou+0.044
rous+0.044
Neg logit contributions
 before-0.030
idi-0.025
ers-0.024
ace-0.024
 had-0.023
Token of
Token ID286
Feature activation-2.22e-02

Pos logit contributions
 before+0.005
 grant+0.005
idi+0.005
ace+0.005
ers+0.005
Neg logit contributions
und-0.011
idd-0.010
rous-0.010
rou-0.010
iw-0.009
Token the
Token ID262
Feature activation-3.23e-03

Pos logit contributions
und+0.059
idd+0.056
rou+0.055
unch+0.055
rous+0.053
Neg logit contributions
 before-0.015
idi-0.010
 grant-0.009
 floor-0.009
ers-0.009
Token might
Token ID1244
Feature activation+1.56e-02

Pos logit contributions
 before+0.022
 grant+0.022
idi+0.021
ace+0.019
 floor+0.019
Neg logit contributions
und-0.043
idd-0.038
rou-0.038
rous-0.037
uck-0.036
Token be
Token ID307
Feature activation+8.91e-03

Pos logit contributions
 before+0.016
idi+0.015
ers+0.015
 grant+0.015
ace+0.014
Neg logit contributions
und-0.033
rou-0.029
rous-0.028
idd-0.028
unch-0.028
Token ruined
Token ID20484
Feature activation+2.08e-03

Pos logit contributions
 grant+0.012
 before+0.012
idi+0.011
ace+0.010
ers+0.010
Neg logit contributions
und-0.017
idd-0.014
rous-0.014
rou-0.014
iw-0.013
Token.
Token ID13
Feature activation-7.44e-03

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.004
idd-0.004
rous-0.004
rou-0.004
iw-0.004
Token Daisy
Token ID40355
Feature activation+1.93e-02

Pos logit contributions
und+0.015
idd+0.014
rous+0.013
rou+0.013
iw+0.013
Neg logit contributions
 before-0.009
 grant-0.009
idi-0.008
ers-0.008
ace-0.008
Token tr
Token ID491
Feature activation-7.16e-02

Pos logit contributions
 before+0.022
 grant+0.021
ace+0.019
ers+0.019
idi+0.019
Neg logit contributions
und-0.039
idd-0.035
rou-0.034
rous-0.033
uck-0.033
Tokenud
Token ID463
Feature activation-1.60e-02

Pos logit contributions
 Sorry+0.089
 Enjoy+0.089
 Good+0.081
haw+0.077
cient+0.073
Neg logit contributions
ace-0.114
idi-0.105
ers-0.100
kin-0.098
 sides-0.097
Tokenged
Token ID2004
Feature activation-1.81e-02

Pos logit contributions
und+0.020
rous+0.018
idd+0.017
rou+0.017
iw+0.016
Neg logit contributions
 before-0.032
 grant-0.029
idi-0.029
ers-0.028
ace-0.028
Token along
Token ID1863
Feature activation+1.88e-02

Pos logit contributions
und+0.039
idd+0.037
rou+0.035
rous+0.035
unch+0.035
Neg logit contributions
 before-0.021
 grant-0.017
idi-0.017
ace-0.016
 had-0.016
Token,
Token ID11
Feature activation-1.43e-03

Pos logit contributions
 grant+0.018
 before+0.016
ace+0.016
idi+0.016
ers+0.016
Neg logit contributions
und-0.041
rous-0.036
idd-0.035
rou-0.035
 batch-0.035
Token carrying
Token ID6872
Feature activation-1.92e-03

Pos logit contributions
und+0.003
idd+0.003
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.002
 grant-0.002
idi-0.001
ers-0.001
ace-0.001
Token her
Token ID607
Feature activation+1.84e-02

Pos logit contributions
und+0.010
idd+0.009
rous+0.009
rou+0.009
iw+0.009
Neg logit contributions
 before-0.007
 grant-0.007
idi-0.007
ers-0.006
ace-0.006
Token sc
Token ID629
Feature activation-3.65e-02

Pos logit contributions
 before+0.023
ers+0.022
 grant+0.021
idi+0.021
ace+0.019
Neg logit contributions
und-0.035
idd-0.032
rous-0.031
rou-0.031
iw-0.030
Tokenooter
Token ID25141
Feature activation-8.18e-03

Pos logit contributions
und+0.065
rous+0.056
idd+0.054
 batch+0.052
ays+0.052
Neg logit contributions
 before-0.053
 grant-0.052
ace-0.051
 finish-0.048
 contain-0.046
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
und+0.017
idd+0.016
rous+0.016
rou+0.015
iw+0.015
Neg logit contributions
 before-0.009
 grant-0.009
idi-0.008
ers-0.008
ace-0.008
Token She
Token ID1375
Feature activation+1.10e-02

Pos logit contributions
 grant+0.033
 before+0.033
ers+0.031
idi+0.030
kin+0.030
Neg logit contributions
und-0.033
rous-0.028
idd-0.027
 batch-0.027
uck-0.026
Token tr
Token ID491
Feature activation-7.13e-02

Pos logit contributions
 before+0.014
 grant+0.013
idi+0.012
ace+0.012
ers+0.012
Neg logit contributions
und-0.022
idd-0.020
rou-0.019
rous-0.019
iw-0.018
Tokenund
Token ID917
Feature activation+2.78e-02

Pos logit contributions
 Sorry+0.077
 Enjoy+0.076
prise+0.069
iw+0.065
 Good+0.065
Neg logit contributions
ace-0.120
idi-0.118
ers-0.108
 sides-0.107
 had-0.105
Tokenled
Token ID992
Feature activation-2.09e-03

Pos logit contributions
 before+0.047
 grant+0.047
 finish+0.041
 entire+0.040
ace+0.040
Neg logit contributions
und-0.044
rous-0.037
idd-0.035
uck-0.034
ays-0.034
Token along
Token ID1863
Feature activation+5.88e-04

Pos logit contributions
und+0.004
idd+0.004
rous+0.004
rou+0.003
iw+0.003
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.002
ers-0.002
ace-0.002
Token the
Token ID262
Feature activation+3.49e-02

Pos logit contributions
 before+0.000
 grant+0.000
idi+0.000
ace+0.000
ers+0.000
Neg logit contributions
und-0.001
idd-0.001
rous-0.001
rou-0.001
iw-0.001
Token sidewalk
Token ID24058
Feature activation-1.48e-02

Pos logit contributions
 grant+0.043
 before+0.039
ace+0.038
idi+0.038
ers+0.037
Neg logit contributions
und-0.066
rou-0.060
rous-0.059
idd-0.057
 batch-0.057
Token?"
Token ID1701
Feature activation+1.55e-02

Pos logit contributions
 before+0.000
 grant+0.000
idi+0.000
ers+0.000
ace+0.000
Neg logit contributions
und-0.000
idd-0.000
rous-0.000
rou-0.000
iw-0.000
Token Tom
Token ID4186
Feature activation-3.95e-05

Pos logit contributions
 grant+0.021
 before+0.020
ace+0.019
ers+0.019
idi+0.018
Neg logit contributions
und-0.028
rous-0.025
idd-0.024
 batch-0.024
iw-0.023
Token asked
Token ID1965
Feature activation+2.94e-02

Pos logit contributions
und+0.000
idd+0.000
rous+0.000
rou+0.000
iw+0.000
Neg logit contributions
 before-0.000
 grant-0.000
idi-0.000
ers-0.000
ace-0.000
Token the
Token ID262
Feature activation-2.16e-02

Pos logit contributions
 grant+0.039
 floor+0.034
 before+0.033
 ground+0.033
etta+0.033
Neg logit contributions
und-0.051
uck-0.046
idd-0.046
 batch-0.044
rous-0.044
Token mother
Token ID2802
Feature activation+3.15e-02

Pos logit contributions
und+0.048
idd+0.042
rous+0.041
rou+0.041
iw+0.040
Neg logit contributions
 before-0.025
 grant-0.022
idi-0.022
ace-0.021
 third-0.021
Token k
Token ID479
Feature activation-7.13e-02

Pos logit contributions
 grant+0.041
idi+0.036
 before+0.036
 floor+0.035
ers+0.035
Neg logit contributions
und-0.055
idd-0.052
rou-0.051
iw-0.050
uck-0.050
Tokenang
Token ID648
Feature activation-7.69e-03

Pos logit contributions
und+0.097
 Enjoy+0.088
 batch+0.087
aman+0.086
arre+0.084
Neg logit contributions
ace-0.105
The-0.103
 before-0.100
 grant-0.095
 corner-0.093
Tokenaroo
Token ID38049
Feature activation-4.52e-03

Pos logit contributions
und+0.015
idd+0.013
rous+0.013
rou+0.013
iw+0.013
Neg logit contributions
 before-0.010
 grant-0.009
idi-0.009
ers-0.009
ace-0.009
Token.
Token ID13
Feature activation+9.74e-03

Pos logit contributions
und+0.010
idd+0.009
rous+0.009
rou+0.009
iw+0.009
Neg logit contributions
 before-0.005
 grant-0.004
idi-0.004
ers-0.004
ace-0.004
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
 before+0.012
 grant+0.011
ers+0.011
ace+0.011
idi+0.010
Neg logit contributions
und-0.019
idd-0.017
rous-0.017
uck-0.016
iw-0.016
Token
Token ID198
Feature activation+1.01e-02

Pos logit contributions
 before+0.013
 grant+0.012
ace+0.011
 floor+0.011
ers+0.011
Neg logit contributions
und-0.021
idd-0.018
rous-0.018
uck-0.018
iw-0.017
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
und+0.039
rous+0.036
idd+0.035
rou+0.034
iw+0.033
Neg logit contributions
 before-0.029
 grant-0.026
idi-0.025
 had-0.025
ers-0.024
Token
Token ID198
Feature activation-1.47e-03

Pos logit contributions
 before+0.015
 grant+0.014
ace+0.013
ers+0.013
idi+0.012
Neg logit contributions
und-0.024
idd-0.021
rous-0.020
unch-0.020
rou-0.020
TokenThe
Token ID464
Feature activation+9.56e-03

Pos logit contributions
und+0.003
idd+0.003
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.002
 grant-0.002
idi-0.001
ers-0.001
ace-0.001
Token little
Token ID1310
Feature activation-7.77e-03

Pos logit contributions
 before+0.008
 grant+0.007
ace+0.006
idi+0.006
ers+0.006
Neg logit contributions
und-0.022
idd-0.021
rou-0.020
rous-0.020
 batch-0.019
Token girl
Token ID2576
Feature activation-2.52e-03

Pos logit contributions
und+0.017
idd+0.015
rous+0.015
rou+0.015
unch+0.014
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.007
ers-0.007
ace-0.007
Token p
Token ID279
Feature activation-7.09e-02

Pos logit contributions
und+0.005
idd+0.004
rous+0.004
rou+0.004
iw+0.004
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Tokenouted
Token ID18534
Feature activation+1.32e-02

Pos logit contributions
 Enjoy+0.105
und+0.097
 Good+0.097
 Sorry+0.097
 sorts+0.094
Neg logit contributions
ers-0.099
ia-0.089
idi-0.083
ace-0.082
ail-0.082
Token and
Token ID290
Feature activation-3.13e-02

Pos logit contributions
 grant+0.015
 before+0.014
idi+0.013
ers+0.012
ace+0.011
Neg logit contributions
und-0.028
rous-0.024
rou-0.024
idd-0.024
uck-0.024
Token said
Token ID531
Feature activation-1.20e-02

Pos logit contributions
und+0.064
idd+0.060
rous+0.058
rou+0.057
iw+0.056
Neg logit contributions
 before-0.041
idi-0.036
 had-0.035
 grant-0.035
 finish-0.033
Token she
Token ID673
Feature activation-2.99e-02

Pos logit contributions
und+0.027
idd+0.025
rous+0.025
rou+0.024
iw+0.024
Neg logit contributions
 before-0.012
 grant-0.011
idi-0.010
ers-0.009
ace-0.009
Token wanted
Token ID2227
Feature activation+4.98e-03

Pos logit contributions
und+0.071
rous+0.066
idd+0.063
iw+0.063
unch+0.062
Neg logit contributions
 before-0.027
 had-0.026
idi-0.023
 grant-0.023
ers-0.021
Token the
Token ID262
Feature activation+2.28e-02

Pos logit contributions
und+0.024
idd+0.022
rous+0.022
rou+0.021
iw+0.021
Neg logit contributions
 before-0.014
 grant-0.013
ers-0.012
idi-0.012
ace-0.011
Token bird
Token ID6512
Feature activation-1.58e-02

Pos logit contributions
 before+0.029
 grant+0.028
idi+0.026
ace+0.025
ers+0.025
Neg logit contributions
und-0.043
idd-0.039
rous-0.038
rou-0.038
iw-0.037
Tokenc
Token ID66
Feature activation+1.13e-02

Pos logit contributions
und+0.033
idd+0.030
rous+0.030
rou+0.029
iw+0.028
Neg logit contributions
 before-0.018
 grant-0.017
idi-0.015
ers-0.015
 had-0.015
Tokenage
Token ID496
Feature activation-1.60e-03

Pos logit contributions
 before+0.012
 grant+0.011
idi+0.011
ers+0.009
ace+0.009
Neg logit contributions
und-0.025
idd-0.023
rou-0.022
rous-0.022
uck-0.021
Token on
Token ID319
Feature activation+4.08e-03

Pos logit contributions
und+0.003
idd+0.003
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.002
 grant-0.002
idi-0.001
ers-0.001
ace-0.001
Token the
Token ID262
Feature activation+4.79e-02

Pos logit contributions
 before+0.004
 grant+0.003
idi+0.003
ers+0.003
ace+0.003
Neg logit contributions
und-0.010
idd-0.009
rous-0.009
rou-0.008
iw-0.008
Token scale
Token ID5046
Feature activation-2.88e-02

Pos logit contributions
 before+0.055
idi+0.053
 grant+0.053
ace+0.050
ers+0.049
Neg logit contributions
und-0.087
idd-0.079
rou-0.079
 batch-0.077
rous-0.077
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
und+0.066
idd+0.062
rous+0.062
rou+0.060
iw+0.059
Neg logit contributions
 before-0.028
 grant-0.024
ers-0.021
 had-0.020
 floor-0.020
Token The
Token ID383
Feature activation+1.59e-02

Pos logit contributions
und+0.022
idd+0.020
rous+0.020
rou+0.019
iw+0.019
Neg logit contributions
 before-0.012
 grant-0.010
idi-0.010
ers-0.009
ace-0.009
Token scale
Token ID5046
Feature activation-1.71e-02

Pos logit contributions
 before+0.020
 grant+0.019
idi+0.018
ace+0.018
ers+0.017
Neg logit contributions
und-0.030
idd-0.028
rou-0.027
rous-0.026
iw-0.026
Token showed
Token ID3751
Feature activation-2.05e-02

Pos logit contributions
und+0.037
idd+0.033
rous+0.033
rou+0.032
iw+0.032
Neg logit contributions
 before-0.019
 grant-0.017
idi-0.016
ers-0.016
 had-0.015
Token They
Token ID1119
Feature activation-8.28e-03

Pos logit contributions
 before+0.008
 grant+0.008
ers+0.007
idi+0.007
ace+0.007
Neg logit contributions
und-0.010
idd-0.009
rous-0.009
rou-0.008
ays-0.008
Token took
Token ID1718
Feature activation+7.40e-03

Pos logit contributions
und+0.016
idd+0.015
rous+0.014
rou+0.014
iw+0.014
Neg logit contributions
 before-0.011
 grant-0.010
idi-0.009
ers-0.009
ace-0.009
Token their
Token ID511
Feature activation+2.33e-02

Pos logit contributions
 before+0.010
 grant+0.010
idi+0.009
 floor+0.008
ers+0.008
Neg logit contributions
und-0.014
rous-0.012
idd-0.012
uck-0.012
ays-0.011
Token own
Token ID898
Feature activation+2.59e-02

Pos logit contributions
 grant+0.036
 before+0.034
ace+0.033
idi+0.032
ers+0.030
Neg logit contributions
rous-0.036
und-0.036
idd-0.034
rou-0.033
 batch-0.032
Token hats
Token ID23910
Feature activation+3.92e-03

Pos logit contributions
 grant+0.037
 before+0.036
idi+0.034
ace+0.033
etta+0.032
Neg logit contributions
und-0.042
rous-0.040
idd-0.039
rou-0.038
 batch-0.037
Token and
Token ID290
Feature activation+4.55e-02

Pos logit contributions
 before+0.004
 grant+0.004
idi+0.003
ace+0.003
ers+0.003
Neg logit contributions
und-0.009
idd-0.008
rous-0.008
rou-0.008
iw-0.008
Token ran
Token ID4966
Feature activation+8.53e-03

Pos logit contributions
 grant+0.077
 before+0.073
idi+0.070
ace+0.070
 entire+0.070
Neg logit contributions
und-0.057
rous-0.052
idd-0.050
 batch-0.048
rou-0.047
Token outside
Token ID2354
Feature activation+3.62e-03

Pos logit contributions
 grant+0.012
 before+0.011
idi+0.010
ace+0.010
ers+0.010
Neg logit contributions
und-0.016
idd-0.014
rous-0.014
rou-0.013
uck-0.013
Token.
Token ID13
Feature activation+7.54e-03

Pos logit contributions
 before+0.004
 grant+0.004
idi+0.004
ace+0.003
ers+0.003
Neg logit contributions
und-0.008
idd-0.007
rous-0.007
rou-0.007
iw-0.007
Token They
Token ID1119
Feature activation-9.85e-03

Pos logit contributions
 before+0.011
 grant+0.011
ers+0.010
idi+0.010
ace+0.009
Neg logit contributions
und-0.014
idd-0.012
rous-0.012
rou-0.011
ays-0.011
Token played
Token ID2826
Feature activation+7.21e-03

Pos logit contributions
und+0.021
idd+0.019
rous+0.019
rou+0.019
iw+0.018
Neg logit contributions
 before-0.011
 grant-0.010
idi-0.009
ers-0.009
 had-0.009
Token.
Token ID13
Feature activation-4.26e-03

Pos logit contributions
und+0.047
rou+0.043
idd+0.043
rous+0.043
iw+0.042
Neg logit contributions
 before-0.023
 grant-0.019
ers-0.019
idi-0.018
 floor-0.017
Token She
Token ID1375
Feature activation-7.23e-03

Pos logit contributions
und+0.009
idd+0.008
rous+0.008
rou+0.007
iw+0.007
Neg logit contributions
 before-0.005
 grant-0.005
idi-0.005
ers-0.004
ace-0.004
Token would
Token ID561
Feature activation+4.42e-03

Pos logit contributions
und+0.015
idd+0.013
rous+0.013
rou+0.013
iw+0.013
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.007
ers-0.007
ace-0.007
Token slide
Token ID10649
Feature activation-9.65e-03

Pos logit contributions
 before+0.005
 grant+0.005
idi+0.004
ers+0.004
ace+0.004
Neg logit contributions
und-0.009
idd-0.008
rous-0.008
rou-0.008
iw-0.008
Token down
Token ID866
Feature activation-1.27e-02

Pos logit contributions
und+0.020
idd+0.018
rous+0.018
rou+0.018
unch+0.017
Neg logit contributions
 before-0.012
 grant-0.010
idi-0.009
ers-0.009
ace-0.009
Token the
Token ID262
Feature activation+4.86e-02

Pos logit contributions
und+0.033
idd+0.031
rous+0.030
rou+0.030
unch+0.029
Neg logit contributions
 before-0.009
 grant-0.007
ers-0.006
idi-0.006
ace-0.005
Token slide
Token ID10649
Feature activation-2.05e-02

Pos logit contributions
 grant+0.079
 before+0.074
idi+0.071
ace+0.070
etta+0.064
Neg logit contributions
und-0.073
idd-0.064
 batch-0.063
iw-0.062
rous-0.062
Token and
Token ID290
Feature activation+1.83e-02

Pos logit contributions
und+0.048
idd+0.044
rous+0.044
rou+0.043
unch+0.042
Neg logit contributions
 before-0.020
 grant-0.016
ers-0.015
idi-0.014
 floor-0.014
Token swing
Token ID9628
Feature activation-1.44e-02

Pos logit contributions
 before+0.022
 grant+0.021
ers+0.020
idi+0.020
ace+0.020
Neg logit contributions
und-0.036
rous-0.032
idd-0.031
rou-0.030
iw-0.030
Token on
Token ID319
Feature activation-5.07e-04

Pos logit contributions
und+0.032
idd+0.029
rous+0.029
rou+0.028
iw+0.028
Neg logit contributions
 before-0.015
 grant-0.013
ers-0.012
idi-0.012
ace-0.012
Token the
Token ID262
Feature activation+3.70e-02

Pos logit contributions
und+0.001
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.000
 grant-0.000
idi-0.000
ers-0.000
ace-0.000
Token to
Token ID284
Feature activation+1.81e-02

Pos logit contributions
 grant+0.008
 before+0.007
idi+0.007
ace+0.006
ers+0.006
Neg logit contributions
und-0.013
idd-0.011
rou-0.011
rous-0.011
iw-0.011
Token ten
Token ID3478
Feature activation+2.20e-02

Pos logit contributions
 before+0.024
 grant+0.024
idi+0.023
ers+0.022
ace+0.021
Neg logit contributions
und-0.032
rous-0.028
idd-0.027
rou-0.027
iw-0.026
Token.
Token ID13
Feature activation+3.34e-03

Pos logit contributions
 grant+0.027
idi+0.024
 before+0.024
ace+0.023
etta+0.023
Neg logit contributions
und-0.044
idd-0.038
rous-0.038
iw-0.037
rou-0.037
Token He
Token ID679
Feature activation+1.37e-02

Pos logit contributions
 before+0.004
 grant+0.004
idi+0.004
ers+0.004
ace+0.004
Neg logit contributions
und-0.007
idd-0.006
rous-0.006
rou-0.006
iw-0.006
Token couldn
Token ID3521
Feature activation+7.35e-03

Pos logit contributions
 before+0.017
 grant+0.017
ace+0.014
ers+0.014
idi+0.014
Neg logit contributions
und-0.027
idd-0.024
rou-0.023
rous-0.023
iw-0.022
Token't
Token ID470
Feature activation+4.50e-02

Pos logit contributions
 before+0.010
 grant+0.009
idi+0.009
ace+0.009
ers+0.009
Neg logit contributions
und-0.014
idd-0.012
rous-0.012
uck-0.012
rou-0.011
Token find
Token ID1064
Feature activation+1.82e-04

Pos logit contributions
ers+0.064
idi+0.061
 before+0.057
etta+0.055
stals+0.055
Neg logit contributions
und-0.065
prise-0.058
rous-0.056
idd-0.054
 Enjoy-0.054
Token anyone
Token ID2687
Feature activation+1.38e-02

Pos logit contributions
 before+0.000
 grant+0.000
idi+0.000
ers+0.000
ace+0.000
Neg logit contributions
und-0.000
idd-0.000
rous-0.000
rou-0.000
iw-0.000
Token!
Token ID0
Feature activation-3.22e-03

Pos logit contributions
 grant+0.018
 before+0.017
idi+0.017
ers+0.016
ace+0.016
Neg logit contributions
und-0.026
rous-0.023
idd-0.022
rou-0.022
iw-0.022
Token
Token ID198
Feature activation+4.54e-03

Pos logit contributions
und+0.006
idd+0.006
rous+0.006
rou+0.006
iw+0.005
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.004
ers-0.003
ace-0.003
Token
Token ID198
Feature activation+4.46e-03

Pos logit contributions
 before+0.005
 grant+0.005
idi+0.004
ace+0.004
ers+0.004
Neg logit contributions
und-0.010
idd-0.008
rous-0.008
rou-0.008
unch-0.008
Token so
Token ID523
Feature activation-1.23e-02

Pos logit contributions
und+0.002
idd+0.002
rous+0.002
rou+0.002
iw+0.002
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token she
Token ID673
Feature activation+5.31e-03

Pos logit contributions
und+0.024
idd+0.022
rous+0.022
rou+0.021
iw+0.021
Neg logit contributions
 before-0.016
 grant-0.014
idi-0.014
ace-0.013
ers-0.013
Token sat
Token ID3332
Feature activation-2.63e-02

Pos logit contributions
 before+0.007
 grant+0.006
idi+0.006
ace+0.006
ers+0.006
Neg logit contributions
und-0.011
idd-0.009
rous-0.009
rou-0.009
iw-0.009
Token down
Token ID866
Feature activation-1.34e-02

Pos logit contributions
und+0.054
idd+0.049
rous+0.047
unch+0.047
 batch+0.047
Neg logit contributions
 before-0.035
ers-0.028
idi-0.027
 grant-0.026
 had-0.026
Token on
Token ID319
Feature activation-6.36e-03

Pos logit contributions
und+0.032
idd+0.029
rous+0.029
rou+0.028
unch+0.028
Neg logit contributions
 before-0.013
 grant-0.010
idi-0.009
ers-0.009
ace-0.008
Token the
Token ID262
Feature activation+4.54e-02

Pos logit contributions
und+0.015
idd+0.014
rous+0.014
rou+0.013
iw+0.013
Neg logit contributions
 before-0.006
 grant-0.005
idi-0.005
ers-0.004
ace-0.004
Token grass
Token ID8701
Feature activation-3.04e-02

Pos logit contributions
 grant+0.056
 before+0.055
idi+0.052
ace+0.050
etta+0.048
Neg logit contributions
und-0.074
 batch-0.072
rous-0.067
iw-0.064
idd-0.064
Token to
Token ID284
Feature activation+6.55e-03

Pos logit contributions
und+0.075
rous+0.072
idd+0.071
rou+0.068
prise+0.068
Neg logit contributions
 before-0.027
ers-0.018
 grant-0.017
idi-0.016
 floor-0.016
Token eat
Token ID4483
Feature activation-2.05e-02

Pos logit contributions
 before+0.008
 grant+0.007
ers+0.007
ace+0.007
idi+0.007
Neg logit contributions
und-0.013
rous-0.012
idd-0.011
iw-0.011
rou-0.011
Token her
Token ID607
Feature activation-8.92e-04

Pos logit contributions
und+0.048
idd+0.045
rous+0.043
rou+0.043
iw+0.041
Neg logit contributions
 before-0.021
 grant-0.017
ers-0.016
idi-0.015
 floor-0.014
Token snack
Token ID26906
Feature activation-8.97e-03

Pos logit contributions
und+0.002
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token can
Token ID460
Feature activation+5.54e-03

Pos logit contributions
und+0.005
idd+0.005
rous+0.005
rou+0.005
iw+0.004
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token play
Token ID711
Feature activation-2.23e-02

Pos logit contributions
 before+0.007
 grant+0.006
idi+0.006
ers+0.006
ace+0.006
Neg logit contributions
und-0.011
idd-0.010
rous-0.010
rou-0.010
uck-0.010
Token with
Token ID351
Feature activation-1.09e-02

Pos logit contributions
und+0.050
idd+0.046
rou+0.046
rous+0.045
rol+0.044
Neg logit contributions
 before-0.025
 grant-0.021
ers-0.020
idi-0.019
 finish-0.018
Token something
Token ID1223
Feature activation+1.31e-02

Pos logit contributions
und+0.022
idd+0.019
rous+0.019
rou+0.019
iw+0.018
Neg logit contributions
 before-0.014
 grant-0.013
idi-0.012
ers-0.012
 floor-0.012
Token else
Token ID2073
Feature activation-4.12e-03

Pos logit contributions
 grant+0.020
 before+0.019
idi+0.019
ace+0.019
ers+0.018
Neg logit contributions
und-0.021
rous-0.019
idd-0.018
rou-0.018
iw-0.017
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
und+0.009
idd+0.008
rous+0.008
rou+0.008
iw+0.008
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.004
ers-0.003
ace-0.003
Token How
Token ID1374
Feature activation+1.65e-03

Pos logit contributions
 before+0.028
ers+0.026
idi+0.026
 grant+0.025
 floor+0.025
Neg logit contributions
und-0.028
idd-0.025
 Enjoy-0.024
iw-0.024
rous-0.023
Token about
Token ID546
Feature activation-3.59e-03

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.003
idd-0.003
rous-0.003
rou-0.003
iw-0.003
Token the
Token ID262
Feature activation+2.50e-03

Pos logit contributions
und+0.007
idd+0.006
rous+0.006
rou+0.006
iw+0.006
Neg logit contributions
 before-0.005
 grant-0.004
idi-0.004
ers-0.004
ace-0.004
Token blocks
Token ID7021
Feature activation+3.32e-03

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.003
ace+0.003
ers+0.003
Neg logit contributions
und-0.005
idd-0.004
rous-0.004
rou-0.004
iw-0.004
Token?"
Token ID1701
Feature activation+9.10e-03

Pos logit contributions
 before+0.005
 grant+0.005
idi+0.004
ace+0.004
ers+0.004
Neg logit contributions
und-0.006
idd-0.005
rous-0.005
rou-0.005
iw-0.005
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
 grant+0.006
 before+0.006
idi+0.006
ers+0.005
ace+0.005
Neg logit contributions
und-0.013
idd-0.011
rous-0.011
rou-0.011
iw-0.011
Token The
Token ID383
Feature activation+1.56e-02

Pos logit contributions
 before+0.017
 grant+0.016
idi+0.015
ers+0.015
ace+0.015
Neg logit contributions
und-0.027
idd-0.024
rous-0.023
rou-0.023
iw-0.023
Token end
Token ID886
Feature activation+1.59e-02

Pos logit contributions
 before+0.011
 grant+0.011
idi+0.010
ers+0.010
ace+0.010
Neg logit contributions
und-0.036
rous-0.036
rou-0.035
idd-0.035
iw-0.034
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
 grant+0.020
 before+0.019
idi+0.019
ace+0.018
ers+0.017
Neg logit contributions
und-0.030
rou-0.028
rous-0.027
idd-0.027
uck-0.026
Token<|endoftext|>
Token ID50256
Feature activation+2.83e-02

Pos logit contributions
 before+0.015
 grant+0.014
idi+0.013
ace+0.013
ers+0.013
Neg logit contributions
und-0.023
idd-0.020
rous-0.020
rou-0.020
iw-0.020
TokenOnce
Token ID7454
Feature activation+7.79e-03

Pos logit contributions
 before+0.039
 grant+0.039
ace+0.036
idi+0.036
 floor+0.033
Neg logit contributions
und-0.052
idd-0.045
unch-0.042
uck-0.042
rous-0.042
Token upon
Token ID2402
Feature activation+4.17e-03

Pos logit contributions
 before+0.012
 grant+0.012
idi+0.011
ers+0.011
ace+0.011
Neg logit contributions
und-0.013
idd-0.011
rous-0.011
rou-0.011
iw-0.010
Token a
Token ID257
Feature activation-1.93e-02

Pos logit contributions
 before+0.005
 grant+0.004
idi+0.004
ers+0.004
ace+0.004
Neg logit contributions
und-0.009
idd-0.008
rous-0.008
rou-0.008
unch-0.008
Token time
Token ID640
Feature activation-2.09e-02

Pos logit contributions
und+0.057
idd+0.054
rous+0.052
rou+0.052
iw+0.052
Neg logit contributions
 before-0.006
idi-0.003
 grant-0.003
ers-0.002
 had-0.002
Token,
Token ID11
Feature activation+1.04e-02

Pos logit contributions
und+0.051
idd+0.047
rous+0.047
rou+0.046
iw+0.045
Neg logit contributions
 before-0.018
 grant-0.014
 had-0.013
idi-0.013
ace-0.012
Token there
Token ID612
Feature activation-4.97e-03

Pos logit contributions
 before+0.018
 grant+0.018
idi+0.017
ers+0.017
ace+0.016
Neg logit contributions
und-0.015
idd-0.013
rous-0.013
rou-0.012
uck-0.012
Token Her
Token ID2332
Feature activation+5.62e-03

Pos logit contributions
 before+0.000
 grant+0.000
idi+0.000
ers+0.000
ace+0.000
Neg logit contributions
und-0.000
idd-0.000
rous-0.000
rou-0.000
iw-0.000
Token dad
Token ID9955
Feature activation-1.22e-02

Pos logit contributions
 before+0.007
 grant+0.007
ers+0.006
idi+0.006
ace+0.006
Neg logit contributions
und-0.011
idd-0.010
rous-0.010
rou-0.010
uck-0.009
Token had
Token ID550
Feature activation-2.76e-02

Pos logit contributions
und+0.025
rous+0.022
idd+0.022
rou+0.021
iw+0.021
Neg logit contributions
 before-0.015
 grant-0.014
idi-0.013
ers-0.013
 had-0.013
Token to
Token ID284
Feature activation-2.44e-04

Pos logit contributions
und+0.058
idd+0.052
rous+0.052
ays+0.050
iw+0.050
Neg logit contributions
 before-0.034
 grant-0.031
 had-0.030
idi-0.028
ers-0.028
Token use
Token ID779
Feature activation-1.43e-02

Pos logit contributions
und+0.001
idd+0.000
rous+0.000
rou+0.000
iw+0.000
Neg logit contributions
 before-0.000
 grant-0.000
idi-0.000
ers-0.000
ace-0.000
Token his
Token ID465
Feature activation+7.46e-03

Pos logit contributions
und+0.036
idd+0.033
rous+0.032
rou+0.032
iw+0.032
Neg logit contributions
 before-0.012
 grant-0.009
idi-0.008
ers-0.008
ace-0.008
Token special
Token ID2041
Feature activation+4.79e-03

Pos logit contributions
 before+0.009
 grant+0.009
idi+0.008
ers+0.008
ace+0.008
Neg logit contributions
und-0.015
idd-0.013
rous-0.013
rou-0.013
iw-0.012
Token scissors
Token ID41786
Feature activation+2.59e-03

Pos logit contributions
 before+0.006
 grant+0.005
idi+0.005
ers+0.005
ace+0.005
Neg logit contributions
und-0.009
idd-0.009
rous-0.008
rou-0.008
iw-0.008
Token to
Token ID284
Feature activation-4.63e-03

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.006
idd-0.005
rous-0.005
rou-0.005
iw-0.005
Token cut
Token ID2005
Feature activation+7.62e-03

Pos logit contributions
und+0.011
idd+0.010
rous+0.010
rou+0.010
iw+0.009
Neg logit contributions
 before-0.004
 grant-0.004
idi-0.003
ers-0.003
ace-0.003
Token the
Token ID262
Feature activation+2.06e-02

Pos logit contributions
 before+0.008
 grant+0.008
idi+0.007
ace+0.007
ers+0.007
Neg logit contributions
und-0.016
idd-0.014
rous-0.014
rou-0.014
iw-0.014
Token the
Token ID262
Feature activation+8.17e-03

Pos logit contributions
und+0.000
idd+0.000
rous+0.000
rou+0.000
iw+0.000
Neg logit contributions
 before-0.000
 grant-0.000
idi-0.000
ers-0.000
ace-0.000
Token journal
Token ID3989
Feature activation-1.39e-02

Pos logit contributions
 before+0.013
 grant+0.012
idi+0.011
ace+0.011
ers+0.011
Neg logit contributions
und-0.014
rous-0.013
idd-0.012
iw-0.012
rou-0.012
Token.
Token ID13
Feature activation-3.16e-04

Pos logit contributions
und+0.033
idd+0.030
rous+0.030
rou+0.029
iw+0.029
Neg logit contributions
 before-0.013
 grant-0.011
ers-0.010
idi-0.010
ace-0.010
Token They
Token ID1119
Feature activation+2.12e-02

Pos logit contributions
und+0.001
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.000
 grant-0.000
idi-0.000
ers-0.000
ace-0.000
Token show
Token ID905
Feature activation-1.58e-02

Pos logit contributions
 before+0.023
idi+0.022
ers+0.021
 grant+0.018
ace+0.018
Neg logit contributions
und-0.043
rous-0.039
rou-0.038
idd-0.038
iw-0.038
Token each
Token ID1123
Feature activation+2.43e-02

Pos logit contributions
und+0.033
idd+0.030
rou+0.029
iw+0.029
rous+0.029
Neg logit contributions
 before-0.020
 grant-0.018
idi-0.017
ers-0.017
ace-0.016
Token other
Token ID584
Feature activation+7.24e-03

Pos logit contributions
 grant+0.020
 before+0.019
idi+0.018
ers+0.018
etta+0.018
Neg logit contributions
und-0.051
rous-0.048
 batch-0.048
 sorts-0.047
idd-0.047
Token their
Token ID511
Feature activation+1.69e-03

Pos logit contributions
 before+0.009
 grant+0.009
idi+0.008
ace+0.008
ers+0.007
Neg logit contributions
und-0.014
rous-0.013
idd-0.013
rou-0.013
 batch-0.012
Token pictures
Token ID5986
Feature activation+1.92e-02

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.004
idd-0.003
rous-0.003
rou-0.003
iw-0.003
Token.
Token ID13
Feature activation+9.65e-03

Pos logit contributions
 grant+0.022
idi+0.022
 before+0.021
ace+0.019
ers+0.018
Neg logit contributions
und-0.038
idd-0.034
rous-0.033
rou-0.033
ays-0.033
Token They
Token ID1119
Feature activation+1.61e-02

Pos logit contributions
 before+0.014
 grant+0.013
ers+0.012
ace+0.012
idi+0.012
Neg logit contributions
und-0.017
idd-0.015
rous-0.015
 Enjoy-0.015
iw-0.015
Token.
Token ID13
Feature activation-6.93e-03

Pos logit contributions
 before+0.006
 grant+0.006
idi+0.006
ace+0.005
ers+0.005
Neg logit contributions
und-0.013
idd-0.011
rous-0.011
rou-0.011
iw-0.011
Token It
Token ID632
Feature activation+1.22e-02

Pos logit contributions
und+0.015
idd+0.014
rous+0.013
rou+0.013
iw+0.013
Neg logit contributions
 before-0.008
 grant-0.007
idi-0.006
ers-0.006
ace-0.006
Token was
Token ID373
Feature activation+2.81e-02

Pos logit contributions
 before+0.014
 grant+0.013
ace+0.012
idi+0.012
ers+0.011
Neg logit contributions
und-0.025
idd-0.023
rous-0.022
rou-0.022
iw-0.021
Token next
Token ID1306
Feature activation+3.37e-03

Pos logit contributions
 grant+0.039
ace+0.035
 before+0.035
etta+0.034
idi+0.034
Neg logit contributions
und-0.048
rou-0.045
rous-0.044
idd-0.044
 batch-0.043
Token to
Token ID284
Feature activation-4.77e-04

Pos logit contributions
 before+0.004
 grant+0.004
idi+0.004
ers+0.003
ace+0.003
Neg logit contributions
und-0.007
idd-0.006
rous-0.006
rou-0.006
iw-0.006
Token theirs
Token ID22021
Feature activation+3.01e-03

Pos logit contributions
und+0.001
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.001
 grant-0.000
idi-0.000
ers-0.000
ace-0.000
Token,
Token ID11
Feature activation+6.40e-03

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.003
ace+0.003
ers+0.003
Neg logit contributions
und-0.007
idd-0.006
rous-0.006
rou-0.006
iw-0.006
Token but
Token ID475
Feature activation-8.68e-03

Pos logit contributions
 before+0.006
 grant+0.006
idi+0.005
ace+0.005
ers+0.005
Neg logit contributions
und-0.014
idd-0.013
rous-0.013
rou-0.013
iw-0.012
Token it
Token ID340
Feature activation+1.05e-02

Pos logit contributions
und+0.018
idd+0.016
rous+0.016
rou+0.015
iw+0.015
Neg logit contributions
 before-0.011
 grant-0.010
idi-0.009
ers-0.009
ace-0.009
Token was
Token ID373
Feature activation+2.40e-02

Pos logit contributions
 before+0.011
 grant+0.011
ace+0.010
idi+0.010
ers+0.009
Neg logit contributions
und-0.022
idd-0.020
rou-0.020
rous-0.019
iw-0.019
Token bigger
Token ID5749
Feature activation-4.25e-03

Pos logit contributions
 grant+0.038
ace+0.034
idi+0.033
ers+0.033
 before+0.033
Neg logit contributions
und-0.036
rou-0.035
rous-0.034
idd-0.033
iw-0.032
Token maybe
Token ID3863
Feature activation+6.35e-03

Pos logit contributions
 before+0.001
 grant+0.001
idi+0.001
ers+0.001
ace+0.001
Neg logit contributions
und-0.001
idd-0.001
rous-0.001
rou-0.001
iw-0.001
Token it
Token ID340
Feature activation-6.87e-03

Pos logit contributions
 before+0.010
 grant+0.009
idi+0.009
ers+0.009
ace+0.009
Neg logit contributions
und-0.010
idd-0.009
rous-0.009
rou-0.009
iw-0.009
Token is
Token ID318
Feature activation+1.36e-02

Pos logit contributions
und+0.014
idd+0.012
rous+0.012
rou+0.012
iw+0.011
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.008
ers-0.007
ace-0.007
Token looking
Token ID2045
Feature activation-7.29e-03

Pos logit contributions
 before+0.017
 grant+0.016
idi+0.016
ers+0.015
ace+0.015
Neg logit contributions
und-0.025
rous-0.023
idd-0.023
rou-0.022
iw-0.021
Token for
Token ID329
Feature activation+2.54e-03

Pos logit contributions
und+0.014
idd+0.013
rous+0.013
rou+0.012
iw+0.012
Neg logit contributions
 before-0.010
 grant-0.009
idi-0.008
ers-0.008
ace-0.008
Token food
Token ID2057
Feature activation-5.00e-04

Pos logit contributions
 before+0.003
 grant+0.003
idi+0.003
ers+0.003
ace+0.003
Neg logit contributions
und-0.005
idd-0.004
rous-0.004
rou-0.004
iw-0.004
Token,"
Token ID553
Feature activation+3.61e-03

Pos logit contributions
und+0.001
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token mom
Token ID1995
Feature activation-1.03e-02

Pos logit contributions
 before+0.004
 grant+0.003
ers+0.003
idi+0.003
ace+0.003
Neg logit contributions
und-0.008
idd-0.007
rous-0.007
rou-0.007
iw-0.007
Token said
Token ID531
Feature activation-1.61e-03

Pos logit contributions
und+0.018
idd+0.016
rous+0.016
rou+0.015
iw+0.015
Neg logit contributions
 before-0.016
 grant-0.015
idi-0.014
ers-0.014
ace-0.013
Token.
Token ID13
Feature activation-3.24e-03

Pos logit contributions
und+0.004
idd+0.003
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.002
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token
Token ID198
Feature activation-5.17e-03

Pos logit contributions
und+0.007
idd+0.006
rous+0.006
rou+0.006
iw+0.006
Neg logit contributions
 before-0.004
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token little
Token ID1310
Feature activation-1.87e-02

Pos logit contributions
 before+0.037
 grant+0.037
ers+0.033
idi+0.032
piece+0.032
Neg logit contributions
und-0.059
idd-0.057
rou-0.054
iw-0.053
 batch-0.053
Token girl
Token ID2576
Feature activation+8.73e-03

Pos logit contributions
und+0.042
rous+0.037
idd+0.037
rou+0.036
rol+0.036
Neg logit contributions
 before-0.020
 grant-0.017
idi-0.016
 had-0.015
 floor-0.015
Token.
Token ID13
Feature activation+3.74e-03

Pos logit contributions
 before+0.014
 grant+0.014
idi+0.014
ers+0.013
ace+0.013
Neg logit contributions
und-0.013
idd-0.012
rous-0.011
rou-0.011
iw-0.011
Token Her
Token ID2332
Feature activation-3.48e-03

Pos logit contributions
 before+0.006
 grant+0.006
idi+0.006
ers+0.006
ace+0.006
Neg logit contributions
und-0.006
idd-0.005
rous-0.005
iw-0.005
rou-0.005
Token name
Token ID1438
Feature activation+1.73e-02

Pos logit contributions
und+0.008
idd+0.007
rous+0.007
rou+0.007
iw+0.007
Neg logit contributions
 before-0.004
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token was
Token ID373
Feature activation-1.27e-04

Pos logit contributions
 before+0.021
 grant+0.020
ace+0.019
idi+0.018
 finish+0.017
Neg logit contributions
und-0.035
idd-0.030
rous-0.029
uck-0.029
 batch-0.029
Token Daisy
Token ID40355
Feature activation+3.52e-02

Pos logit contributions
und+0.000
idd+0.000
rous+0.000
rou+0.000
iw+0.000
Neg logit contributions
 before-0.000
 grant-0.000
idi-0.000
ers-0.000
ace-0.000
Token.
Token ID13
Feature activation-6.70e-03

Pos logit contributions
 grant+0.051
idi+0.047
 before+0.044
ers+0.043
 contain+0.043
Neg logit contributions
und-0.055
rou-0.054
 batch-0.051
iw-0.049
idd-0.048
Token One
Token ID1881
Feature activation+1.20e-02

Pos logit contributions
und+0.012
idd+0.011
rous+0.010
rou+0.010
iw+0.010
Neg logit contributions
 before-0.010
 grant-0.009
idi-0.009
ace-0.009
ers-0.009
Token day
Token ID1110
Feature activation+6.58e-03

Pos logit contributions
 grant+0.016
 before+0.015
ers+0.014
idi+0.014
ace+0.014
Neg logit contributions
und-0.022
rous-0.020
idd-0.020
uck-0.019
rou-0.019
Token Daisy
Token ID40355
Feature activation-1.09e-03

Pos logit contributions
 before+0.008
 grant+0.007
idi+0.007
ers+0.007
ace+0.007
Neg logit contributions
und-0.013
rous-0.012
idd-0.012
uck-0.011
iw-0.011
Token "
Token ID366
Feature activation-2.70e-02

Pos logit contributions
und+0.007
idd+0.007
rous+0.007
rou+0.007
iw+0.006
Neg logit contributions
 before-0.004
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
TokenNo
Token ID2949
Feature activation-1.42e-02

Pos logit contributions
und+0.050
idd+0.044
rou+0.042
ays+0.042
rous+0.042
Neg logit contributions
 before-0.040
 grant-0.038
The-0.035
idi-0.034
 had-0.033
Token,
Token ID11
Feature activation-8.00e-03

Pos logit contributions
und+0.033
idd+0.031
rous+0.031
rou+0.030
iw+0.029
Neg logit contributions
 before-0.015
 grant-0.012
idi-0.011
ace-0.011
ers-0.011
Token I
Token ID314
Feature activation-2.07e-02

Pos logit contributions
und+0.016
idd+0.014
rous+0.014
rou+0.014
iw+0.014
Neg logit contributions
 before-0.010
 grant-0.010
idi-0.009
ers-0.008
ace-0.008
Token haven
Token ID4398
Feature activation+1.01e-02

Pos logit contributions
und+0.040
rous+0.036
idd+0.036
ays+0.034
 batch+0.034
Neg logit contributions
 before-0.029
 grant-0.027
 had-0.026
idi-0.024
 finish-0.023
Token't
Token ID470
Feature activation+2.47e-05

Pos logit contributions
 before+0.014
idi+0.013
 grant+0.013
 floor+0.012
ers+0.012
Neg logit contributions
und-0.018
idd-0.017
uck-0.015
rou-0.015
rous-0.015
Token seen
Token ID1775
Feature activation-1.20e-03

Pos logit contributions
 before+0.000
 grant+0.000
idi+0.000
ers+0.000
ace+0.000
Neg logit contributions
und-0.000
idd-0.000
rous-0.000
rou-0.000
iw-0.000
Token it
Token ID340
Feature activation-7.62e-03

Pos logit contributions
und+0.002
idd+0.002
rous+0.002
rou+0.002
iw+0.002
Neg logit contributions
 before-0.002
 grant-0.002
idi-0.001
ers-0.001
ace-0.001
Token.
Token ID13
Feature activation-4.22e-03

Pos logit contributions
und+0.016
idd+0.015
rous+0.015
rou+0.014
iw+0.014
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.007
ers-0.007
ace-0.007
Token But
Token ID887
Feature activation-2.74e-03

Pos logit contributions
und+0.008
idd+0.007
rous+0.007
rou+0.007
iw+0.007
Neg logit contributions
 before-0.006
 grant-0.006
idi-0.005
ers-0.005
ace-0.005
Token I
Token ID314
Feature activation-1.55e-02

Pos logit contributions
und+0.005
idd+0.005
rous+0.005
rou+0.004
iw+0.004
Neg logit contributions
 before-0.004
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token longer
Token ID2392
Feature activation+1.62e-02

Pos logit contributions
 before+0.006
 grant+0.006
idi+0.005
ers+0.005
ace+0.005
Neg logit contributions
und-0.009
idd-0.008
rous-0.008
rou-0.008
iw-0.008
Token bitter
Token ID12922
Feature activation+4.69e-03

Pos logit contributions
 before+0.020
 grant+0.019
idi+0.018
ers+0.017
ace+0.017
Neg logit contributions
und-0.032
idd-0.028
rous-0.028
rou-0.028
iw-0.027
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
 before+0.005
 grant+0.005
idi+0.005
ace+0.005
ers+0.005
Neg logit contributions
und-0.010
idd-0.009
rous-0.009
rou-0.008
iw-0.008
Token but
Token ID475
Feature activation-2.17e-03

Pos logit contributions
 before+0.015
 grant+0.015
idi+0.014
ers+0.014
ace+0.014
Neg logit contributions
und-0.028
idd-0.025
rous-0.025
rou-0.024
iw-0.024
Token sweet
Token ID6029
Feature activation-2.96e-03

Pos logit contributions
und+0.004
idd+0.004
rous+0.004
rou+0.004
iw+0.004
Neg logit contributions
 before-0.003
 grant-0.002
idi-0.002
ers-0.002
ace-0.002
Token like
Token ID588
Feature activation-1.07e-02

Pos logit contributions
und+0.004
idd+0.004
rous+0.004
rou+0.004
iw+0.003
Neg logit contributions
 before-0.005
 grant-0.005
idi-0.005
ers-0.005
ace-0.005
Token candy
Token ID18550
Feature activation-1.08e-02

Pos logit contributions
und+0.022
idd+0.020
rous+0.019
rou+0.019
iw+0.019
Neg logit contributions
 before-0.013
 grant-0.012
idi-0.011
ers-0.011
ace-0.011
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
und+0.024
idd+0.022
rous+0.021
rou+0.021
iw+0.020
Neg logit contributions
 before-0.012
 grant-0.010
ers-0.010
idi-0.010
ace-0.009
Token She
Token ID1375
Feature activation-8.05e-04

Pos logit contributions
und+0.028
idd+0.026
rous+0.026
rou+0.025
iw+0.024
Neg logit contributions
 before-0.017
 grant-0.016
idi-0.015
ace-0.014
ers-0.014
Token took
Token ID1718
Feature activation-6.58e-03

Pos logit contributions
und+0.002
idd+0.001
rous+0.001
rou+0.001
iw+0.001
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token a
Token ID257
Feature activation+1.09e-03

Pos logit contributions
und+0.014
idd+0.013
rous+0.013
rou+0.013
iw+0.012
Neg logit contributions
 before-0.007
 grant-0.006
idi-0.006
ers-0.006
ace-0.006
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
 before+0.009
 grant+0.009
idi+0.009
ace+0.008
ers+0.008
Neg logit contributions
und-0.018
idd-0.016
rou-0.016
rous-0.016
 batch-0.016
Token She
Token ID1375
Feature activation+1.10e-02

Pos logit contributions
und+0.028
idd+0.026
rous+0.025
rou+0.025
 batch+0.024
Neg logit contributions
 before-0.018
 grant-0.016
idi-0.015
ace-0.015
 had-0.014
Token wished
Token ID16555
Feature activation+1.93e-02

Pos logit contributions
 before+0.020
 grant+0.019
ers+0.018
ace+0.018
idi+0.018
Neg logit contributions
und-0.015
idd-0.013
rou-0.013
rous-0.012
iw-0.012
Token she
Token ID673
Feature activation+2.46e-02

Pos logit contributions
idi+0.031
 grant+0.030
ers+0.030
ace+0.029
 entire+0.029
Neg logit contributions
und-0.027
rou-0.025
iw-0.023
idd-0.023
unch-0.023
Token had
Token ID550
Feature activation+1.03e-02

Pos logit contributions
 grant+0.020
 before+0.020
 sides+0.018
idi+0.017
ace+0.017
Neg logit contributions
und-0.053
idd-0.050
rou-0.049
rous-0.047
uck-0.046
Token not
Token ID407
Feature activation-4.95e-03

Pos logit contributions
 before+0.012
 grant+0.012
idi+0.011
ace+0.011
ers+0.011
Neg logit contributions
und-0.020
rous-0.018
idd-0.018
rou-0.018
uck-0.017
Token played
Token ID2826
Feature activation+8.21e-03

Pos logit contributions
und+0.010
idd+0.009
rous+0.009
rou+0.008
iw+0.008
Neg logit contributions
 before-0.006
 grant-0.006
idi-0.006
ers-0.005
ace-0.005
Token with
Token ID351
Feature activation+1.22e-03

Pos logit contributions
 before+0.010
 grant+0.010
idi+0.009
ace+0.009
ers+0.009
Neg logit contributions
und-0.016
idd-0.014
rous-0.014
rou-0.014
uck-0.013
Token the
Token ID262
Feature activation+5.00e-03

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.001
ers+0.001
ace+0.001
Neg logit contributions
und-0.002
idd-0.002
rous-0.002
rou-0.002
iw-0.002
Token ball
Token ID2613
Feature activation-1.18e-02

Pos logit contributions
 before+0.007
 grant+0.007
idi+0.006
ace+0.006
ers+0.006
Neg logit contributions
und-0.009
idd-0.008
rous-0.008
rou-0.008
iw-0.008
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
und+0.027
idd+0.025
rous+0.024
rou+0.024
iw+0.023
Neg logit contributions
 before-0.013
 grant-0.011
ers-0.010
idi-0.010
ace-0.009
Token Fred
Token ID8559
Feature activation+1.05e-02

Pos logit contributions
 before+0.010
 grant+0.009
idi+0.009
ers+0.009
ace+0.008
Neg logit contributions
und-0.011
idd-0.010
rous-0.010
rou-0.009
iw-0.009
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
 before+0.012
 grant+0.012
idi+0.011
 floor+0.010
ers+0.010
Neg logit contributions
und-0.022
idd-0.019
rou-0.019
rous-0.019
iw-0.018
Token He
Token ID679
Feature activation+2.15e-02

Pos logit contributions
und+0.030
idd+0.028
rou+0.027
rous+0.027
iw+0.026
Neg logit contributions
 before-0.017
 grant-0.015
idi-0.015
ers-0.014
 had-0.014
Token was
Token ID373
Feature activation+1.34e-02

Pos logit contributions
 before+0.031
 grant+0.029
ace+0.028
 sides+0.026
idi+0.026
Neg logit contributions
und-0.038
idd-0.033
rou-0.032
rous-0.031
iw-0.031
Token very
Token ID845
Feature activation-1.05e-02

Pos logit contributions
 before+0.019
 grant+0.018
ers+0.017
idi+0.017
 floor+0.016
Neg logit contributions
und-0.024
idd-0.022
rous-0.021
rou-0.021
iw-0.020
Token boun
Token ID31283
Feature activation-6.17e-02

Pos logit contributions
und+0.021
idd+0.019
rous+0.018
rou+0.018
iw+0.018
Neg logit contributions
 before-0.013
 grant-0.012
idi-0.012
ace-0.011
ers-0.011
Tokency
Token ID948
Feature activation+3.92e-03

Pos logit contributions
aman+0.100
und+0.100
unch+0.096
 batch+0.094
idd+0.090
Neg logit contributions
 before-0.079
ace-0.075
 had-0.072
 entire-0.071
ers-0.071
Token and
Token ID290
Feature activation-9.11e-04

Pos logit contributions
 before+0.004
 grant+0.004
idi+0.003
ers+0.003
ace+0.003
Neg logit contributions
und-0.009
idd-0.008
rous-0.008
rou-0.008
iw-0.008
Token loved
Token ID6151
Feature activation-2.35e-02

Pos logit contributions
und+0.002
idd+0.002
rous+0.002
rou+0.002
iw+0.002
Neg logit contributions
 before-0.001
 grant-0.001
idi-0.001
ers-0.001
ace-0.001
Token to
Token ID284
Feature activation-1.52e-03

Pos logit contributions
und+0.057
idd+0.053
rous+0.052
ays+0.050
aman+0.050
Neg logit contributions
 before-0.023
 had-0.016
 grant-0.016
idi-0.015
ers-0.015
Token hop
Token ID1725
Feature activation+1.21e-03

Pos logit contributions
und+0.003
idd+0.003
rous+0.003
rou+0.003
iw+0.003
Neg logit contributions
 before-0.002
 grant-0.002
idi-0.002
ers-0.002
ace-0.002
Token she
Token ID673
Feature activation+2.20e-02

Pos logit contributions
 before+0.014
 grant+0.014
ace+0.013
idi+0.013
ers+0.012
Neg logit contributions
und-0.015
rous-0.013
idd-0.013
iw-0.012
rou-0.012
Token went
Token ID1816
Feature activation+1.76e-02

Pos logit contributions
 before+0.032
 grant+0.029
ace+0.028
ers+0.027
etta+0.026
Neg logit contributions
und-0.039
rous-0.035
idd-0.034
iw-0.032
uck-0.032
Token,
Token ID11
Feature activation+4.28e-03

Pos logit contributions
 grant+0.024
 before+0.024
idi+0.023
ace+0.022
 floor+0.022
Neg logit contributions
und-0.032
uck-0.028
idd-0.028
rous-0.028
iw-0.027
Token she
Token ID673
Feature activation-6.31e-03

Pos logit contributions
 before+0.006
 grant+0.006
idi+0.006
ers+0.006
ace+0.006
Neg logit contributions
und-0.007
rous-0.007
idd-0.007
iw-0.006
rou-0.006
Token heard
Token ID2982
Feature activation-1.47e-02

Pos logit contributions
und+0.013
idd+0.012
rous+0.011
rou+0.011
iw+0.011
Neg logit contributions
 before-0.008
 grant-0.007
idi-0.007
ers-0.006
ace-0.006
Token of
Token ID286
Feature activation-5.15e-02

Pos logit contributions
und+0.033
idd+0.031
rous+0.030
rou+0.029
iw+0.029
Neg logit contributions
 before-0.015
 grant-0.013
idi-0.012
ers-0.012
 floor-0.011
Token a
Token ID257
Feature activation-2.66e-02

Pos logit contributions
und+0.131
idd+0.126
rou+0.119
rol+0.119
uck+0.117
Neg logit contributions
 before-0.044
 other-0.037
 the-0.037
 grant-0.033
 floor-0.033
Token lovely
Token ID14081
Feature activation-1.35e-02

Pos logit contributions
und+0.064
idd+0.057
ays+0.057
rol+0.056
rou+0.055
Neg logit contributions
 before-0.025
 grant-0.022
 floor-0.020
idi-0.020
 finish-0.020
Token land
Token ID1956
Feature activation+3.85e-04

Pos logit contributions
und+0.028
idd+0.026
rou+0.025
rous+0.025
iw+0.024
Neg logit contributions
 before-0.016
 grant-0.015
idi-0.014
 floor-0.013
ers-0.013
Token called
Token ID1444
Feature activation-1.08e-02

Pos logit contributions
 before+0.000
 grant+0.000
idi+0.000
ace+0.000
ers+0.000
Neg logit contributions
und-0.001
idd-0.001
rous-0.001
rou-0.001
iw-0.001
Token Slip
Token ID49988
Feature activation-5.06e-03

Pos logit contributions
und+0.022
idd+0.020
rous+0.020
rou+0.019
iw+0.019
Neg logit contributions
 before-0.013
 grant-0.012
idi-0.011
ers-0.011
ace-0.011
Token of
Token ID286
Feature activation+9.71e-03

Pos logit contributions
 grant+0.010
 before+0.010
ers+0.009
ace+0.009
idi+0.008
Neg logit contributions
und-0.017
rous-0.016
idd-0.016
rou-0.015
 batch-0.015
Token the
Token ID262
Feature activation+1.79e-03

Pos logit contributions
 before+0.010
 grant+0.010
ace+0.009
ers+0.008
idi+0.008
Neg logit contributions
und-0.021
rous-0.019
idd-0.019
rou-0.018
 batch-0.018
Token younger
Token ID7099
Feature activation+1.74e-02

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ers+0.002
ace+0.002
Neg logit contributions
und-0.004
idd-0.003
rous-0.003
rou-0.003
iw-0.003
Token children
Token ID1751
Feature activation-1.89e-02

Pos logit contributions
 grant+0.025
 before+0.025
idi+0.022
ers+0.022
 finish+0.021
Neg logit contributions
und-0.030
idd-0.028
rous-0.026
 batch-0.025
rou-0.025
Token,
Token ID11
Feature activation-3.13e-03

Pos logit contributions
und+0.039
rous+0.035
idd+0.035
rou+0.035
iw+0.034
Neg logit contributions
 before-0.023
 grant-0.020
idi-0.019
 had-0.019
ers-0.019
Token who
Token ID508
Feature activation-4.25e-02

Pos logit contributions
und+0.007
idd+0.006
rous+0.006
rou+0.006
iw+0.006
Neg logit contributions
 before-0.004
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token was
Token ID373
Feature activation+2.94e-03

Pos logit contributions
und+0.103
rous+0.098
unch+0.098
rou+0.098
iw+0.097
Neg logit contributions
 had-0.034
 before-0.031
idi-0.026
 grant-0.026
ers-0.023
Token only
Token ID691
Feature activation+8.39e-03

Pos logit contributions
 before+0.004
 grant+0.004
idi+0.003
ace+0.003
ers+0.003
Neg logit contributions
und-0.006
idd-0.005
rous-0.005
rou-0.005
iw-0.005
Token three
Token ID1115
Feature activation-2.81e-03

Pos logit contributions
 before+0.009
 grant+0.009
idi+0.008
ers+0.008
ace+0.008
Neg logit contributions
und-0.017
idd-0.016
rou-0.016
rous-0.016
iw-0.015
Token years
Token ID812
Feature activation+1.65e-02

Pos logit contributions
und+0.006
idd+0.005
rous+0.005
rou+0.005
iw+0.005
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token old
Token ID1468
Feature activation+1.52e-02

Pos logit contributions
 grant+0.014
 before+0.013
idi+0.012
ace+0.011
ers+0.011
Neg logit contributions
und-0.038
idd-0.036
rou-0.036
rous-0.036
 batch-0.034
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
 before+0.008
 grant+0.007
idi+0.007
ace+0.007
ers+0.006
Neg logit contributions
und-0.016
idd-0.014
rou-0.014
rous-0.014
ays-0.013
Token L
Token ID406
Feature activation+1.33e-03

Pos logit contributions
 before+0.013
 grant+0.012
idi+0.012
ers+0.012
ace+0.012
Neg logit contributions
und-0.019
idd-0.017
rou-0.016
rous-0.016
ays-0.016
Tokenila
Token ID10102
Feature activation+2.06e-02

Pos logit contributions
 before+0.002
 grant+0.002
idi+0.002
ace+0.001
ers+0.001
Neg logit contributions
und-0.003
idd-0.002
rous-0.002
rou-0.002
iw-0.002
Token and
Token ID290
Feature activation-6.07e-03

Pos logit contributions
 grant+0.022
 before+0.022
idi+0.021
ace+0.020
ers+0.019
Neg logit contributions
und-0.043
idd-0.039
rou-0.038
rous-0.036
uck-0.036
Token Max
Token ID5436
Feature activation+5.07e-03

Pos logit contributions
und+0.012
idd+0.011
rous+0.011
rou+0.010
iw+0.010
Neg logit contributions
 before-0.008
 grant-0.007
idi-0.007
ers-0.007
ace-0.007
Token cl
Token ID537
Feature activation-5.27e-02

Pos logit contributions
 before+0.007
 grant+0.007
idi+0.006
ers+0.006
ace+0.006
Neg logit contributions
und-0.009
idd-0.008
rous-0.008
rou-0.008
iw-0.008
Tokenapped
Token ID6320
Feature activation+1.03e-02

Pos logit contributions
aman+0.079
 batch+0.078
rous+0.078
iw+0.077
und+0.076
Neg logit contributions
ace-0.078
 had-0.068
 grant-0.066
piece-0.066
ers-0.065
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
 grant+0.012
 before+0.012
idi+0.012
ers+0.011
ace+0.011
Neg logit contributions
und-0.020
rou-0.018
rous-0.018
idd-0.017
uck-0.016
Token laughed
Token ID13818
Feature activation-2.85e-03

Pos logit contributions
 before+0.018
 grant+0.018
idi+0.017
ace+0.017
ers+0.017
Neg logit contributions
und-0.017
idd-0.015
rous-0.014
rou-0.014
iw-0.013
Token.
Token ID13
Feature activation+2.75e-02

Pos logit contributions
und+0.006
idd+0.006
rous+0.006
rou+0.006
iw+0.006
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.002
ers-0.002
ace-0.002
Token They
Token ID1119
Feature activation+5.07e-03

Pos logit contributions
 before+0.044
 grant+0.044
ace+0.042
ers+0.042
isy+0.042
Neg logit contributions
und-0.043
idd-0.038
ays-0.035
ucks-0.035
uck-0.034
Token Lucy
Token ID22162
Feature activation+2.42e-02

Pos logit contributions
 before+0.006
 grant+0.006
idi+0.005
ace+0.005
ers+0.005
Neg logit contributions
und-0.009
rous-0.008
idd-0.008
rou-0.008
iw-0.008
Token a
Token ID257
Feature activation-1.89e-02

Pos logit contributions
 grant+0.031
 before+0.030
idi+0.029
ace+0.028
 floor+0.028
Neg logit contributions
und-0.042
rous-0.039
rou-0.038
 batch-0.038
uck-0.038
Token lime
Token ID28738
Feature activation-4.83e-03

Pos logit contributions
und+0.041
idd+0.037
rous+0.035
ays+0.035
rou+0.034
Neg logit contributions
 before-0.022
 grant-0.021
idi-0.019
 finish-0.017
 had-0.017
Token and
Token ID290
Feature activation-2.40e-03

Pos logit contributions
und+0.010
idd+0.009
rous+0.009
rou+0.009
iw+0.009
Neg logit contributions
 before-0.005
 grant-0.005
idi-0.005
ers-0.004
ace-0.004
Token a
Token ID257
Feature activation-1.82e-02

Pos logit contributions
und+0.005
idd+0.004
rous+0.004
rou+0.004
iw+0.004
Neg logit contributions
 before-0.003
 grant-0.003
idi-0.003
ers-0.003
ace-0.003
Token big
Token ID1263
Feature activation-4.68e-02

Pos logit contributions
und+0.039
idd+0.035
rous+0.034
ays+0.033
rou+0.033
Neg logit contributions
 before-0.021
 grant-0.020
idi-0.018
 had-0.017
 finish-0.017
Token hug
Token ID16225
Feature activation-9.07e-03

Pos logit contributions
und+0.123
rol+0.111
idd+0.109
ays+0.108
rous+0.108
Neg logit contributions
 before-0.036
 grant-0.033
 finish-0.029
 be-0.028
 had-0.028
Token.
Token ID13
Feature activation+3.18e-03

Pos logit contributions
und+0.021
idd+0.019
rous+0.019
rou+0.019
iw+0.018
Neg logit contributions
 before-0.009
 grant-0.008
idi-0.007
ers-0.007
ace-0.007
Token
Token ID198
Feature activation+1.30e-02

Pos logit contributions
 before+0.004
 grant+0.004
idi+0.003
ace+0.003
ers+0.003
Neg logit contributions
und-0.006
idd-0.006
rous-0.005
rou-0.005
iw-0.005
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
 before+0.016
 grant+0.016
ace+0.014
ers+0.014
idi+0.014
Neg logit contributions
und-0.026
idd-0.022
rous-0.022
unch-0.021
uck-0.021
TokenLu
Token ID25596
Feature activation-1.94e-02

Pos logit contributions
 before+0.014
 grant+0.014
ace+0.012
ers+0.012
idi+0.012
Neg logit contributions
und-0.020
idd-0.017
rous-0.017
 batch-0.016
rou-0.016