TOP ACTIVATIONS
MAX = 0.062

Lily ? Where is the o asis ? I made a
" This is Mr . Tw ig ," said Ben proudly
park . They see many pige ons on the grass .
, " Thank you , Rib bit . You are very
day , they saw a kay ak on the shore .
listened to Lily . The o asis was important ."
You can play with the kay ak first and then I
her a box of ch ocol ates . Lily opened the
fox said : " This o asis really is an amazing
should not have fed the ge ese . They are wild
" Hello , caul iflower . I am Lily
, " That 's a kay ak . It 's used
here . This is my o asis . Go away !"
. I live in the re eds . I would love
a big box of ch ocol ates !
find a box of ch ocol ates . "
to a part of the o asis where there were no
L ily , these ch ocol ates are bitter ," he
He tried to find the o asis , but he could
, " This is an o asis . It is a

MIN ACTIVATIONS
MIN = -0.055

But Tom was not as tall as Lily , and he
anyway . As night came , Tim went home
and it was just as tall . The little
chickens very safe . As night began to fall , she
end in sight . As night fell , he started to
explore ! As evening came , Amy entered the
her hands and stretched as tall as she could . Somehow
Soon , it was as tall and wide as a big
to an end . As night started to fall , the
enough to make him as tall as Maria . He chose
an adventure anyway . As night fall came , he was
would take out a j igsaw puzzle and try to solve
from the rain . When morning came , Sarah looked in
time until it was as tall as they were ! They
and watched as the sun lit up the sky . The
a stack of blocks as tall as her . She was
this crib is just as wide as my old one ,"
here ." As Farmer Brown left , Jack looked
in the evening . As night time came , the sun
was fast asleep . As morning came , the little girl

INTERVAL 0.033 - 0.062
CONTAINS 0.101%

The crab asked the chicken , � � € ˜
ear . It is a fle a . A fle a
" That is a re ind eer , Tim .
on the door . He says : " Open the door
say she is a perfect chicken . One day

INTERVAL 0.003 - 0.033
CONTAINS 35.251%

with dolls is silly . It 's fun and creative .
, can you say ' please ' and ' thank you
was very pleased . She said , " Thank you for
park . They were so excited ! On their way ,
saw a pond in the grass . It looked so cool

INTERVAL -0.026 - 0.003
CONTAINS 64.083%

Daddy was proud . They went to the park
! He drove as fast as he could and soon he
children looked around in wonder . The theater was so big
" No , it is mine , I saw it first
to his dad , who fixed it up good as new

INTERVAL -0.055 - -0.026
CONTAINS 0.564%

stepped forward . But as soon he did , he heard
her nose at how it smelled . But then , she
was happy and it was tall . Everyone enjoyed walking in
happy and it thanked the little child . From now on
door . She crawled as fast as she could , but
Token Lily
Token ID20037
Feature activation+1.18e-02

Pos logit contributions
ese+0.025
 mir+0.024
reetings+0.023
rot+0.023
 hawk+0.022
Neg logit contributions
 work-0.014
 First-0.014
 On-0.012
 While-0.011
 Min-0.011
Token?
Token ID30
Feature activation+7.58e-03

Pos logit contributions
 mir+0.023
ese+0.023
 hawk+0.022
rot+0.021
reetings+0.021
Neg logit contributions
 work-0.020
 First-0.015
 On-0.015
 pull-0.014
 as-0.014
Token Where
Token ID6350
Feature activation-5.40e-03

Pos logit contributions
ese+0.015
 mir+0.015
 hawk+0.014
reetings+0.014
rot+0.014
Neg logit contributions
 First-0.011
 work-0.011
 On-0.010
 While-0.010
 When-0.009
Token is
Token ID318
Feature activation+9.85e-03

Pos logit contributions
 work+0.009
 First+0.009
 On+0.008
 While+0.008
 Min+0.007
Neg logit contributions
ese-0.010
 mir-0.009
 hawk-0.009
rot-0.009
reetings-0.009
Token the
Token ID262
Feature activation+2.80e-02

Pos logit contributions
ese+0.021
 mir+0.020
reetings+0.020
rot+0.019
 hawk+0.019
Neg logit contributions
 work-0.014
 First-0.013
 On-0.011
 While-0.011
 Min-0.010
Token o
Token ID267
Feature activation+6.18e-02

Pos logit contributions
lor+0.056
reetings+0.056
rot+0.056
ivering+0.056
ese+0.055
Neg logit contributions
 work-0.046
 turn-0.029
 all-0.029
 First-0.029
 pretend-0.029
Tokenasis
Token ID17765
Feature activation+2.29e-02

Pos logit contributions
 foe+0.065
 hawk+0.064
Fear+0.062
rot+0.059
Did+0.059
Neg logit contributions
 work-0.143
 On-0.122
 First-0.122
 stack-0.119
 drive-0.118
Token?
Token ID30
Feature activation+6.84e-03

Pos logit contributions
ese+0.047
 mir+0.047
 hawk+0.046
rot+0.046
Fear+0.044
Neg logit contributions
 work-0.035
 On-0.026
 First-0.026
 as-0.024
.-0.023
Token I
Token ID314
Feature activation+9.26e-03

Pos logit contributions
ese+0.014
 mir+0.013
 hawk+0.012
reetings+0.012
rot+0.012
Neg logit contributions
 First-0.011
 work-0.010
 On-0.009
 While-0.009
 When-0.009
Token made
Token ID925
Feature activation-5.22e-04

Pos logit contributions
ese+0.019
 mir+0.017
rot+0.016
reetings+0.016
 hawk+0.016
Neg logit contributions
 work-0.015
 First-0.014
 On-0.012
 While-0.012
 pull-0.011
Token a
Token ID257
Feature activation+1.94e-02

Pos logit contributions
 First+0.001
 work+0.001
 On+0.001
 While+0.001
 treadmill+0.001
Neg logit contributions
ese-0.001
 mir-0.001
 crow-0.001
 hawk-0.001
rot-0.001
Token"
Token ID1
Feature activation+1.03e-02

Pos logit contributions
 First+0.000
 work+0.000
 On+0.000
 While+0.000
 When+0.000
Neg logit contributions
 mir-0.000
ese-0.000
 hawk-0.000
reetings-0.000
 foe-0.000
TokenThis
Token ID1212
Feature activation+2.47e-02

Pos logit contributions
ese+0.018
 mir+0.017
reetings+0.015
asis+0.015
 crow+0.015
Neg logit contributions
 First-0.018
 While-0.017
 On-0.017
 work-0.017
 When-0.016
Token is
Token ID318
Feature activation-6.42e-03

Pos logit contributions
ese+0.055
rim+0.050
ivering+0.050
 mir+0.048
reetings+0.048
Neg logit contributions
 work-0.037
 First-0.029
 On-0.028
 Min-0.026
 pretend-0.025
Token Mr
Token ID1770
Feature activation+1.56e-02

Pos logit contributions
 work+0.008
 First+0.008
 Max+0.007
 On+0.007
 Tim+0.007
Neg logit contributions
ese-0.014
 mir-0.013
ivering-0.013
reetings-0.013
rot-0.013
Token.
Token ID13
Feature activation+4.24e-03

Pos logit contributions
ese+0.042
 mir+0.041
reetings+0.039
 hawk+0.039
rot+0.039
Neg logit contributions
 work-0.013
 First-0.012
 On-0.009
 While-0.008
 Min-0.007
Token Tw
Token ID1815
Feature activation+6.14e-02

Pos logit contributions
ese+0.010
 mir+0.009
ile+0.008
rim+0.008
 crow+0.008
Neg logit contributions
 First-0.005
 work-0.005
 On-0.005
 Min-0.005
 While-0.005
Tokenig
Token ID328
Feature activation+2.92e-02

Pos logit contributions
ese+0.096
 mir+0.087
 foe+0.080
Did+0.079
 hawk+0.078
Neg logit contributions
 work-0.125
 First-0.108
 On-0.106
igsaw-0.101
 Min-0.100
Token,"
Token ID553
Feature activation-1.60e-02

Pos logit contributions
 mir+0.069
ese+0.068
Did+0.064
Was+0.063
 hawk+0.063
Neg logit contributions
 work-0.035
 First-0.026
 On-0.024
 pull-0.022
 as-0.022
Token said
Token ID531
Feature activation-1.36e-02

Pos logit contributions
 work+0.024
 First+0.023
 While+0.021
 On+0.021
 treadmill+0.019
Neg logit contributions
ese-0.032
 mir-0.030
 crow-0.029
 hawk-0.028
rot-0.028
Token Ben
Token ID3932
Feature activation+3.87e-03

Pos logit contributions
 work+0.015
 First+0.015
 While+0.012
 On+0.012
 treadmill+0.012
Neg logit contributions
ese-0.032
 crow-0.030
 mir-0.030
 hawk-0.030
rot-0.029
Token proudly
Token ID23017
Feature activation-7.50e-03

Pos logit contributions
ese+0.009
 mir+0.009
rot+0.009
 crow+0.009
reetings+0.009
Neg logit contributions
 First-0.004
 work-0.004
 While-0.003
 On-0.003
 treadmill-0.003
Token park
Token ID3952
Feature activation-2.36e-03

Pos logit contributions
ese+0.030
 mir+0.029
reetings+0.027
 hawk+0.026
rot+0.026
Neg logit contributions
 work-0.033
 First-0.032
 On-0.029
 While-0.029
 Min-0.027
Token.
Token ID13
Feature activation-1.17e-03

Pos logit contributions
 work+0.002
 First+0.002
 While+0.002
 On+0.002
 Min+0.002
Neg logit contributions
ese-0.006
 mir-0.006
rot-0.005
 crow-0.005
 hawk-0.005
Token They
Token ID1119
Feature activation+1.13e-02

Pos logit contributions
 work+0.002
 Min+0.001
 First+0.001
ilda+0.001
igsaw+0.001
Neg logit contributions
 mir-0.002
ese-0.002
reetings-0.002
 crow-0.002
Did-0.002
Token see
Token ID766
Feature activation+2.16e-02

Pos logit contributions
ese+0.023
 mir+0.022
 hawk+0.021
reetings+0.021
 crow+0.021
Neg logit contributions
 work-0.016
 First-0.016
 On-0.014
 While-0.014
 Min-0.013
Token many
Token ID867
Feature activation+2.87e-02

Pos logit contributions
ese+0.061
 mir+0.060
reetings+0.059
rot+0.058
lor+0.057
Neg logit contributions
 work-0.014
 First-0.013
 On-0.009
 While-0.009
 treadmill-0.007
Token pige
Token ID32372
Feature activation+5.82e-02

Pos logit contributions
ese+0.059
lor+0.058
 mir+0.058
reetings+0.057
rot+0.056
Neg logit contributions
 work-0.042
 First-0.037
 On-0.032
 While-0.031
 treadmill-0.031
Tokenons
Token ID684
Feature activation+1.10e-02

Pos logit contributions
 mir+0.145
 foe+0.135
 hawk+0.133
reetings+0.133
 pearl+0.132
Neg logit contributions
 work-0.056
 On-0.039
he-0.037
 First-0.037
 Max-0.036
Token on
Token ID319
Feature activation+3.04e-02

Pos logit contributions
ese+0.026
 mir+0.026
reetings+0.025
 hawk+0.024
 crow+0.024
Neg logit contributions
 work-0.013
 First-0.011
 On-0.009
 While-0.008
 as-0.008
Token the
Token ID262
Feature activation+4.32e-03

Pos logit contributions
ese+0.079
 mir+0.076
 hawk+0.073
 crow+0.073
rot+0.072
Neg logit contributions
 work-0.028
 First-0.027
 On-0.021
 While-0.021
 Min-0.018
Token grass
Token ID8701
Feature activation+1.52e-02

Pos logit contributions
ese+0.008
 mir+0.008
reetings+0.008
rot+0.008
 hawk+0.008
Neg logit contributions
 work-0.007
 First-0.006
 On-0.006
 While-0.006
 treadmill-0.005
Token.
Token ID13
Feature activation+1.60e-03

Pos logit contributions
 mir+0.039
ese+0.039
reetings+0.037
 hawk+0.037
Fear+0.036
Neg logit contributions
 work-0.013
 First-0.012
 On-0.010
 While-0.009
 When-0.008
Token,
Token ID11
Feature activation+7.79e-03

Pos logit contributions
ese+0.036
 mir+0.035
reetings+0.033
 hawk+0.033
rot+0.033
Neg logit contributions
 work-0.012
 First-0.011
 While-0.008
 On-0.008
 When-0.007
Token "
Token ID366
Feature activation+7.87e-03

Pos logit contributions
ese+0.019
 mir+0.018
reetings+0.017
 hawk+0.017
rot+0.017
Neg logit contributions
 work-0.009
 First-0.009
 On-0.007
 While-0.007
 When-0.006
TokenThank
Token ID10449
Feature activation+1.07e-02

Pos logit contributions
 mir+0.014
ese+0.013
reetings+0.013
 hawk+0.012
 crow+0.012
Neg logit contributions
 First-0.013
 While-0.013
 work-0.012
 On-0.012
 When-0.011
Token you
Token ID345
Feature activation+1.04e-02

Pos logit contributions
 mir+0.018
ese+0.017
 hawk+0.016
reetings+0.016
 crow+0.016
Neg logit contributions
 work-0.019
 First-0.018
 On-0.016
 Min-0.016
 While-0.016
Token,
Token ID11
Feature activation-5.30e-03

Pos logit contributions
ese+0.026
 mir+0.025
rot+0.023
reetings+0.023
 hawk+0.023
Neg logit contributions
 work-0.011
 First-0.011
 On-0.009
 While-0.009
 Min-0.007
Token Rib
Token ID23133
Feature activation+5.77e-02

Pos logit contributions
 work+0.007
 First+0.006
 Lily+0.006
 On+0.006
 Tim+0.005
Neg logit contributions
ese-0.012
reetings-0.011
lor-0.011
 mir-0.011
ak-0.011
Tokenbit
Token ID2545
Feature activation+2.36e-02

Pos logit contributions
 hawk+0.103
 mir+0.101
Was+0.099
Did+0.098
 crow+0.093
Neg logit contributions
 work-0.090
 First-0.077
 While-0.068
 as-0.068
 On-0.067
Token.
Token ID13
Feature activation+6.32e-03

Pos logit contributions
ese+0.058
 mir+0.056
reetings+0.054
 hawk+0.054
lor+0.053
Neg logit contributions
 work-0.025
 First-0.021
 On-0.017
 While-0.017
 as-0.015
Token You
Token ID921
Feature activation-1.17e-02

Pos logit contributions
ese+0.012
 mir+0.011
 crow+0.010
 hawk+0.010
reetings+0.010
Neg logit contributions
 First-0.011
 work-0.010
 On-0.010
 While-0.010
 When-0.009
Token are
Token ID389
Feature activation+5.35e-03

Pos logit contributions
 work+0.017
 First+0.015
 On+0.013
 While+0.013
 Min+0.012
Neg logit contributions
ese-0.025
 mir-0.024
ich-0.023
 crow-0.023
 hawk-0.023
Token very
Token ID845
Feature activation+1.53e-02

Pos logit contributions
ese+0.013
 mir+0.012
lor+0.012
reetings+0.012
 hawk+0.012
Neg logit contributions
 work-0.006
 First-0.005
 Min-0.005
 Max-0.005
 as-0.004
Token day
Token ID1110
Feature activation-1.77e-02

Pos logit contributions
ese+0.068
 mir+0.064
lor+0.063
rot+0.063
reetings+0.062
Neg logit contributions
 work-0.031
 First-0.028
 On-0.025
 While-0.024
 When-0.021
Token,
Token ID11
Feature activation-9.50e-03

Pos logit contributions
 work+0.027
 First+0.025
 treadmill+0.024
igsaw+0.022
 On+0.022
Neg logit contributions
 mir-0.031
reetings-0.030
ese-0.029
rot-0.027
 crow-0.027
Token they
Token ID484
Feature activation+1.28e-02

Pos logit contributions
 work+0.015
 First+0.014
igsaw+0.014
 treadmill+0.013
 On+0.012
Neg logit contributions
 mir-0.015
 ge-0.014
Did-0.014
 pearl-0.014
 crow-0.014
Token saw
Token ID2497
Feature activation+1.04e-02

Pos logit contributions
ese+0.025
lor+0.025
 mir+0.024
rot+0.023
reetings+0.023
Neg logit contributions
 work-0.022
 First-0.020
 While-0.019
 On-0.019
 sort-0.017
Token a
Token ID257
Feature activation+1.25e-02

Pos logit contributions
ese+0.026
 mir+0.026
 hawk+0.025
 crow+0.025
rot+0.024
Neg logit contributions
 work-0.010
 First-0.010
 On-0.008
 While-0.008
 Min-0.007
Token kay
Token ID34681
Feature activation+5.71e-02

Pos logit contributions
ese+0.027
 mir+0.025
reetings+0.024
lor+0.023
rot+0.023
Neg logit contributions
 work-0.018
 First-0.016
 On-0.015
 While-0.014
 treadmill-0.013
Tokenak
Token ID461
Feature activation+7.45e-03

Pos logit contributions
 mir+0.043
ese+0.042
 hawk+0.038
 crow+0.038
 foe+0.038
Neg logit contributions
 work-0.148
 First-0.145
 On-0.141
 While-0.139
 treadmill-0.132
Token on
Token ID319
Feature activation+2.45e-02

Pos logit contributions
ese+0.016
 mir+0.016
reetings+0.015
 hawk+0.015
rot+0.015
Neg logit contributions
 work-0.010
 First-0.010
 On-0.008
 While-0.008
 When-0.007
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
 crow+0.047
 hawk+0.047
 mir+0.043
ese+0.042
 eagle+0.041
Neg logit contributions
 First-0.037
cases-0.033
 work-0.031
rie-0.031
 treadmill-0.031
Token shore
Token ID15191
Feature activation+9.77e-03

Pos logit contributions
ese+0.019
 mir+0.019
reetings+0.017
rot+0.017
 hawk+0.017
Neg logit contributions
 work-0.020
 First-0.019
 On-0.017
 While-0.017
 Min-0.016
Token.
Token ID13
Feature activation+2.91e-03

Pos logit contributions
ese+0.029
 mir+0.029
reetings+0.028
 hawk+0.028
rot+0.027
Neg logit contributions
 work-0.005
 First-0.004
 On-0.002
 While-0.002
 as-0.002
Token listened
Token ID16399
Feature activation+2.80e-02

Pos logit contributions
rot+0.001
reetings+0.001
ese+0.001
 mir+0.001
 eagle+0.001
Neg logit contributions
 work-0.001
 First-0.000
 pull-0.000
 drive-0.000
 Tim-0.000
Token to
Token ID284
Feature activation+4.42e-03

Pos logit contributions
ese+0.084
 hawk+0.080
 mir+0.080
rot+0.079
 crow+0.076
Neg logit contributions
 work-0.021
 First-0.016
 On-0.012
 as-0.011
 When-0.010
Token Lily
Token ID20037
Feature activation+5.74e-03

Pos logit contributions
ese+0.008
 mir+0.008
 crow+0.007
 hawk+0.007
reetings+0.007
Neg logit contributions
 First-0.007
 work-0.007
 On-0.006
 While-0.006
 treadmill-0.006
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
ese+0.014
 mir+0.014
 hawk+0.013
reetings+0.013
rot+0.013
Neg logit contributions
 work-0.006
 First-0.005
 On-0.004
 While-0.004
 Min-0.004
Token The
Token ID383
Feature activation+2.46e-02

Pos logit contributions
ese+0.022
 mir+0.021
 hawk+0.020
rot+0.020
 crow+0.020
Neg logit contributions
 First-0.016
 work-0.016
 On-0.014
 While-0.014
 When-0.013
Token o
Token ID267
Feature activation+5.70e-02

Pos logit contributions
rot+0.050
ese+0.049
lor+0.048
reetings+0.047
ivering+0.047
Neg logit contributions
 work-0.041
 First-0.029
 all-0.026
 turn-0.025
 stack-0.025
Tokenasis
Token ID17765
Feature activation+1.92e-02

Pos logit contributions
 foe+0.065
 hawk+0.064
Fear+0.063
Did+0.059
Was+0.058
Neg logit contributions
 work-0.121
 First-0.104
 On-0.102
 When-0.101
 stack-0.101
Token was
Token ID373
Feature activation+2.00e-03

Pos logit contributions
 mir+0.038
ese+0.038
reetings+0.036
 hawk+0.036
rot+0.035
Neg logit contributions
 work-0.030
 First-0.021
.-0.021
 as-0.021
 pull-0.021
Token important
Token ID1593
Feature activation+5.21e-03

Pos logit contributions
ese+0.004
rot+0.004
reetings+0.004
lor+0.004
 mir+0.004
Neg logit contributions
 work-0.003
 as-0.003
 Lily-0.003
 Max-0.003
 Grand-0.002
Token."
Token ID526
Feature activation+4.51e-03

Pos logit contributions
ese+0.014
 mir+0.014
reetings+0.014
 hawk+0.014
rot+0.014
Neg logit contributions
 work-0.004
 First-0.003
 On-0.002
 as-0.002
 While-0.002
Token
Token ID198
Feature activation+5.16e-03

Pos logit contributions
 mir+0.008
ese+0.008
 crow+0.008
 hawk+0.007
reetings+0.007
Neg logit contributions
 work-0.007
 First-0.007
 treadmill-0.006
 While-0.006
 pull-0.005
Token You
Token ID921
Feature activation-1.20e-02

Pos logit contributions
ese+0.000
 mir+0.000
reetings+0.000
 hawk+0.000
 crow+0.000
Neg logit contributions
 work-0.000
 First-0.000
 On-0.000
 While-0.000
 When-0.000
Token can
Token ID460
Feature activation+5.35e-03

Pos logit contributions
 work+0.020
 First+0.019
 On+0.017
 While+0.017
 pull+0.016
Neg logit contributions
ese-0.023
 mir-0.023
reetings-0.021
 crow-0.021
lor-0.021
Token play
Token ID711
Feature activation+1.26e-02

Pos logit contributions
ese+0.012
 mir+0.011
rot+0.011
 hawk+0.011
 crow+0.011
Neg logit contributions
 First-0.007
 work-0.006
 On-0.006
 While-0.006
igsaw-0.005
Token with
Token ID351
Feature activation-2.35e-03

Pos logit contributions
 mir+0.030
ese+0.030
reetings+0.029
 crow+0.028
rot+0.028
Neg logit contributions
 work-0.014
 First-0.012
 While-0.010
 On-0.010
 as-0.010
Token the
Token ID262
Feature activation+1.77e-02

Pos logit contributions
 work+0.002
 First+0.002
 While+0.001
 On+0.001
 Min+0.001
Neg logit contributions
ese-0.006
 mir-0.006
reetings-0.006
rot-0.006
lor-0.006
Token kay
Token ID34681
Feature activation+5.70e-02

Pos logit contributions
lor+0.038
ese+0.038
reetings+0.037
rot+0.035
ivering+0.035
Neg logit contributions
 work-0.028
 First-0.019
 pull-0.018
 drive-0.018
 While-0.017
Tokenak
Token ID461
Feature activation+8.88e-04

Pos logit contributions
 hawk+0.049
 foe+0.049
 mir+0.048
 crow+0.047
ese+0.045
Neg logit contributions
 work-0.138
 First-0.135
 On-0.129
 While-0.129
 treadmill-0.124
Token first
Token ID717
Feature activation+1.68e-02

Pos logit contributions
 mir+0.002
ese+0.002
 hawk+0.002
reetings+0.002
 crow+0.002
Neg logit contributions
 work-0.001
 First-0.001
 On-0.001
 While-0.001
 as-0.001
Token and
Token ID290
Feature activation-8.75e-03

Pos logit contributions
ese+0.038
 mir+0.037
 hawk+0.035
reetings+0.035
 crow+0.034
Neg logit contributions
 work-0.022
 First-0.020
 On-0.017
 While-0.017
 as-0.016
Token then
Token ID788
Feature activation+5.13e-03

Pos logit contributions
 work+0.011
 First+0.010
 While+0.008
 On+0.008
 pull+0.007
Neg logit contributions
ese-0.021
 mir-0.020
reetings-0.020
rot-0.019
lor-0.019
Token I
Token ID314
Feature activation-7.31e-03

Pos logit contributions
ese+0.010
 mir+0.010
reetings+0.009
rot+0.009
 hawk+0.009
Neg logit contributions
 work-0.008
 First-0.008
 On-0.007
 While-0.007
 pull-0.006
Token her
Token ID607
Feature activation-2.04e-03

Pos logit contributions
ese+0.006
 mir+0.006
 crow+0.006
 hawk+0.006
rot+0.006
Neg logit contributions
 First-0.003
 work-0.003
 On-0.003
 treadmill-0.003
 While-0.003
Token a
Token ID257
Feature activation-2.89e-03

Pos logit contributions
 First+0.002
 work+0.002
 On+0.002
 While+0.002
 treadmill+0.002
Neg logit contributions
ese-0.004
 mir-0.004
 crow-0.004
ak-0.004
 pearl-0.004
Token box
Token ID3091
Feature activation-4.36e-03

Pos logit contributions
 work+0.004
 First+0.004
 On+0.003
 While+0.003
 Min+0.003
Neg logit contributions
ese-0.006
 mir-0.006
reetings-0.005
 hawk-0.005
rot-0.005
Token of
Token ID286
Feature activation+1.81e-02

Pos logit contributions
 First+0.007
 work+0.007
 While+0.006
 Min+0.006
 treadmill+0.006
Neg logit contributions
ese-0.008
 mir-0.007
rot-0.007
ak-0.007
 crow-0.007
Token ch
Token ID442
Feature activation+2.97e-02

Pos logit contributions
ese+0.036
 mir+0.034
reetings+0.033
rot+0.033
lor+0.032
Neg logit contributions
 work-0.029
 First-0.027
 On-0.023
 While-0.023
 Min-0.022
Tokenocol
Token ID4668
Feature activation+5.66e-02

Pos logit contributions
ese+0.046
reetings+0.044
 hawk+0.044
Did+0.043
 mir+0.043
Neg logit contributions
 work-0.059
 First-0.052
 On-0.046
igsaw-0.045
 While-0.044
Tokenates
Token ID689
Feature activation+6.87e-03

Pos logit contributions
 mir+0.086
 hawk+0.080
 foe+0.078
 crow+0.076
ese+0.075
Neg logit contributions
 work-0.106
 First-0.103
 On-0.089
 While-0.085
 Min-0.085
Token.
Token ID13
Feature activation-1.89e-02

Pos logit contributions
ese+0.017
 mir+0.017
reetings+0.016
 hawk+0.016
rot+0.016
Neg logit contributions
 work-0.007
 First-0.006
 On-0.005
 While-0.005
 Min-0.004
Token Lily
Token ID20037
Feature activation+4.51e-03

Pos logit contributions
 work+0.026
 First+0.020
igsaw+0.020
ritz+0.019
working+0.019
Neg logit contributions
 mir-0.032
Was-0.030
Want-0.030
Did-0.029
Fear-0.029
Token opened
Token ID4721
Feature activation-8.21e-03

Pos logit contributions
ese+0.008
 mir+0.008
 crow+0.007
rot+0.007
 hawk+0.007
Neg logit contributions
 work-0.008
 First-0.008
 On-0.007
 While-0.007
 treadmill-0.006
Token the
Token ID262
Feature activation+1.31e-02

Pos logit contributions
 First+0.012
 work+0.012
 treadmill+0.011
igsaw+0.011
 While+0.011
Neg logit contributions
ese-0.014
 hawk-0.014
 crow-0.014
 pearl-0.013
 mir-0.013
Token fox
Token ID21831
Feature activation+1.12e-02

Pos logit contributions
ese+0.008
reetings+0.008
lor+0.008
ivering+0.007
Did+0.007
Neg logit contributions
 work-0.010
 First-0.007
 pull-0.007
 as-0.007
 Min-0.007
Token said
Token ID531
Feature activation+1.95e-02

Pos logit contributions
ese+0.024
 mir+0.023
reetings+0.022
 hawk+0.022
ich+0.021
Neg logit contributions
 work-0.016
 as-0.012
 First-0.012
 drive-0.012
 pull-0.012
Token:
Token ID25
Feature activation+4.71e-03

Pos logit contributions
ese+0.047
 mir+0.043
ich+0.041
 hawk+0.041
rim+0.040
Neg logit contributions
 work-0.024
 First-0.023
 On-0.020
 When-0.020
 as-0.019
Token "
Token ID366
Feature activation+9.84e-03

Pos logit contributions
ese+0.013
 mir+0.012
 hawk+0.012
ile+0.012
rot+0.012
Neg logit contributions
 work-0.004
 First-0.004
 On-0.004
 While-0.003
 When-0.003
TokenThis
Token ID1212
Feature activation+3.01e-02

Pos logit contributions
 mir+0.019
ese+0.017
 hawk+0.016
 crow+0.016
 foe+0.016
Neg logit contributions
 First-0.016
 On-0.015
 work-0.014
 While-0.014
 When-0.014
Token o
Token ID267
Feature activation+5.65e-02

Pos logit contributions
ivering+0.070
Did+0.065
ese+0.063
 unatt+0.062
ode+0.061
Neg logit contributions
 work-0.038
.-0.028
 pretend-0.027
 look-0.027
 a-0.026
Tokenasis
Token ID17765
Feature activation+2.01e-02

Pos logit contributions
 hawk+0.066
 foe+0.065
 mir+0.063
Did+0.062
 wing+0.061
Neg logit contributions
 work-0.115
 First-0.106
 On-0.105
 When-0.102
f-0.096
Token really
Token ID1107
Feature activation+5.73e-03

Pos logit contributions
 mir+0.039
ese+0.039
 hawk+0.038
 crow+0.036
rot+0.035
Neg logit contributions
 work-0.033
 First-0.027
 On-0.025
 as-0.024
 When-0.023
Token is
Token ID318
Feature activation-5.49e-03

Pos logit contributions
ese+0.013
rot+0.012
ich+0.012
 hawk+0.012
lor+0.011
Neg logit contributions
 work-0.009
 First-0.007
 On-0.007
 When-0.006
 While-0.006
Token an
Token ID281
Feature activation+5.46e-03

Pos logit contributions
 work+0.007
 as+0.005
 First+0.005
 On+0.005
 Tim+0.005
Neg logit contributions
ese-0.014
rot-0.013
 mir-0.012
lor-0.012
 hawk-0.012
Token amazing
Token ID4998
Feature activation+1.36e-02

Pos logit contributions
ese+0.011
rot+0.010
lor+0.010
Did+0.010
 mir+0.010
Neg logit contributions
 work-0.009
 First-0.008
 On-0.007
 While-0.007
 When-0.006
Token should
Token ID815
Feature activation+9.08e-03

Pos logit contributions
 work+0.001
 First+0.001
 On+0.001
 While+0.001
 pull+0.001
Neg logit contributions
ese-0.002
 mir-0.002
reetings-0.002
ich-0.002
rot-0.002
Token not
Token ID407
Feature activation+1.98e-04

Pos logit contributions
ese+0.016
 mir+0.016
reetings+0.015
rot+0.015
 hawk+0.015
Neg logit contributions
 work-0.016
 First-0.015
 On-0.013
 While-0.013
 Min-0.012
Token have
Token ID423
Feature activation-7.92e-03

Pos logit contributions
ese+0.000
 mir+0.000
reetings+0.000
rot+0.000
 hawk+0.000
Neg logit contributions
 work-0.000
 First-0.000
 On-0.000
 While-0.000
 Min-0.000
Token fed
Token ID11672
Feature activation+9.91e-03

Pos logit contributions
 work+0.011
 First+0.009
 as+0.008
 pull+0.008
 While+0.008
Neg logit contributions
reetings-0.017
rot-0.017
ese-0.017
 mir-0.017
lor-0.016
Token the
Token ID262
Feature activation+1.90e-02

Pos logit contributions
 mir+0.022
ese+0.021
reetings+0.021
rot+0.021
lor+0.020
Neg logit contributions
 work-0.013
 First-0.012
 On-0.010
 While-0.009
 as-0.009
Token ge
Token ID4903
Feature activation+5.62e-02

Pos logit contributions
reetings+0.036
lor+0.035
ese+0.033
ivering+0.032
rot+0.031
Neg logit contributions
 work-0.035
 First-0.025
 pull-0.025
 shelf-0.024
 stack-0.024
Tokenese
Token ID2771
Feature activation+1.71e-02

Pos logit contributions
reetings+0.039
 mir+0.037
Fear+0.036
lor+0.035
 pearl+0.035
Neg logit contributions
 work-0.143
 pull-0.129
 First-0.128
 as-0.126
 stack-0.123
Token.
Token ID13
Feature activation+6.42e-03

Pos logit contributions
 mir+0.045
ese+0.045
 hawk+0.044
reetings+0.044
rot+0.043
Neg logit contributions
 work-0.015
 First-0.012
 as-0.009
 On-0.009
 While-0.008
Token They
Token ID1119
Feature activation+8.18e-03

Pos logit contributions
ese+0.016
 mir+0.016
reetings+0.015
rot+0.015
 hawk+0.015
Neg logit contributions
 First-0.006
 work-0.006
 On-0.006
 While-0.005
 When-0.005
Token are
Token ID389
Feature activation+1.15e-02

Pos logit contributions
reetings+0.019
 mir+0.018
ese+0.018
lor+0.018
rot+0.018
Neg logit contributions
 work-0.012
 First-0.009
 pull-0.008
 On-0.007
 use-0.007
Token wild
Token ID4295
Feature activation+1.52e-02

Pos logit contributions
lor+0.028
reetings+0.027
rot+0.026
ese+0.026
 mir+0.024
Neg logit contributions
 work-0.017
 as-0.012
 Max-0.011
 First-0.011
 Lily-0.011
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
ese+0.015
 mir+0.015
reetings+0.015
 crow+0.015
rot+0.015
Neg logit contributions
 work-0.010
 First-0.009
 While-0.007
 treadmill-0.007
 On-0.007
Token
Token ID198
Feature activation+5.81e-03

Pos logit contributions
ese+0.018
 mir+0.018
reetings+0.018
 crow+0.017
ak+0.017
Neg logit contributions
 work-0.017
 First-0.016
 treadmill-0.014
igsaw-0.013
 While-0.013
Token"
Token ID1
Feature activation+8.41e-04

Pos logit contributions
ese+0.008
 mir+0.008
 hawk+0.007
reetings+0.007
rot+0.007
Neg logit contributions
 work-0.012
 First-0.012
 On-0.011
 While-0.011
 Min-0.010
TokenHello
Token ID15496
Feature activation-2.70e-03

Pos logit contributions
ese+0.001
 mir+0.001
reetings+0.001
lor+0.001
 hawk+0.001
Neg logit contributions
 First-0.002
 work-0.001
 On-0.001
 While-0.001
 brushes-0.001
Token,
Token ID11
Feature activation-1.31e-02

Pos logit contributions
 work+0.003
 First+0.003
 On+0.002
 While+0.002
 treadmill+0.002
Neg logit contributions
ese-0.007
 mir-0.006
reetings-0.006
rot-0.006
 hawk-0.006
Token caul
Token ID46087
Feature activation+5.58e-02

Pos logit contributions
 work+0.018
 First+0.017
 On+0.015
 While+0.014
 Min+0.013
Neg logit contributions
ese-0.028
 mir-0.027
reetings-0.026
 hawk-0.026
rot-0.026
Tokeniflower
Token ID42642
Feature activation+3.05e-02

Pos logit contributions
ese+0.086
 mir+0.085
 foe+0.077
 hawk+0.077
 brav+0.076
Neg logit contributions
 work-0.107
 Min-0.086
 First-0.085
 On-0.084
 shelf-0.078
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
ese+0.079
 mir+0.077
reetings+0.074
 hawk+0.073
lor+0.072
Neg logit contributions
 work-0.029
 First-0.022
 On-0.018
 Min-0.017
 While-0.016
Token I
Token ID314
Feature activation+2.36e-03

Pos logit contributions
ese+0.025
 mir+0.023
 hawk+0.022
rot+0.022
reetings+0.022
Neg logit contributions
 First-0.022
 work-0.021
 On-0.020
 While-0.020
 When-0.019
Token am
Token ID716
Feature activation+6.53e-03

Pos logit contributions
ese+0.006
ich+0.005
 hawk+0.005
lor+0.005
 mir+0.005
Neg logit contributions
 work-0.004
 First-0.003
 On-0.003
 While-0.002
 pull-0.002
Token Lily
Token ID20037
Feature activation+1.34e-02

Pos logit contributions
ese+0.016
lor+0.015
reetings+0.015
rot+0.015
 mir+0.015
Neg logit contributions
 work-0.008
 First-0.006
 Min-0.006
 On-0.005
 as-0.005
Token,
Token ID11
Feature activation+1.54e-03

Pos logit contributions
ese+0.013
 mir+0.013
rot+0.013
 crow+0.013
 hawk+0.012
Neg logit contributions
 First-0.009
 work-0.009
 On-0.007
 While-0.007
 Min-0.006
Token "
Token ID366
Feature activation+9.77e-03

Pos logit contributions
ese+0.003
 mir+0.003
 hawk+0.003
rot+0.003
reetings+0.003
Neg logit contributions
 work-0.002
 First-0.002
 On-0.002
 While-0.002
 Min-0.001
TokenThat
Token ID2504
Feature activation+1.01e-02

Pos logit contributions
ese+0.017
 mir+0.017
reetings+0.016
 hawk+0.015
ile+0.015
Neg logit contributions
 First-0.017
 While-0.016
 On-0.016
 work-0.016
 When-0.015
Token's
Token ID338
Feature activation-8.98e-03

Pos logit contributions
reetings+0.019
ese+0.019
ivering+0.018
 mir+0.018
ile+0.018
Neg logit contributions
 work-0.016
 First-0.015
 On-0.013
 While-0.013
 drive-0.012
Token a
Token ID257
Feature activation+1.95e-03

Pos logit contributions
 work+0.009
 First+0.009
 On+0.007
 While+0.007
 Min+0.006
Neg logit contributions
ese-0.023
 mir-0.022
reetings-0.021
rot-0.021
 hawk-0.021
Token kay
Token ID34681
Feature activation+5.57e-02

Pos logit contributions
ese+0.005
lor+0.005
reetings+0.005
rot+0.004
ivering+0.004
Neg logit contributions
 work-0.002
 drive-0.002
 While-0.002
 First-0.002
 On-0.002
Tokenak
Token ID461
Feature activation+5.06e-03

Pos logit contributions
 mir+0.048
 hawk+0.047
 foe+0.046
ese+0.046
 crow+0.045
Neg logit contributions
 work-0.134
 First-0.131
 On-0.127
 While-0.125
 treadmill-0.121
Token.
Token ID13
Feature activation+8.22e-03

Pos logit contributions
ese+0.013
 mir+0.013
 hawk+0.013
reetings+0.012
rot+0.012
Neg logit contributions
 work-0.005
 First-0.004
 On-0.003
 While-0.003
 When-0.003
Token It
Token ID632
Feature activation+5.42e-03

Pos logit contributions
ese+0.016
 mir+0.015
 hawk+0.014
 crow+0.014
reetings+0.014
Neg logit contributions
 First-0.014
 work-0.012
 On-0.012
 While-0.012
 When-0.011
Token's
Token ID338
Feature activation-4.06e-03

Pos logit contributions
ese+0.011
 mir+0.011
reetings+0.011
ivering+0.010
rot+0.010
Neg logit contributions
 work-0.008
 First-0.007
 pull-0.006
 On-0.006
 Min-0.006
Token used
Token ID973
Feature activation+8.29e-03

Pos logit contributions
 work+0.005
 First+0.004
 On+0.004
 While+0.003
 as+0.003
Neg logit contributions
ese-0.010
rot-0.009
lor-0.009
 mir-0.009
reetings-0.009
Token here
Token ID994
Feature activation+7.41e-03

Pos logit contributions
 mir+0.005
lor+0.005
 eagle+0.005
ivering+0.005
ese+0.005
Neg logit contributions
 work-0.001
 as-0.001
.-0.001
 First-0.001
 Lily-0.001
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
 mir+0.021
ese+0.020
 hawk+0.020
 crow+0.020
reetings+0.019
Neg logit contributions
 work-0.006
 First-0.005
 as-0.004
 On-0.004
.-0.003
Token This
Token ID770
Feature activation+2.82e-02

Pos logit contributions
ese+0.027
 mir+0.026
 hawk+0.025
 pearl+0.024
 crow+0.024
Neg logit contributions
 First-0.025
 On-0.022
 While-0.021
 work-0.021
 When-0.021
Token is
Token ID318
Feature activation-1.55e-03

Pos logit contributions
ese+0.057
ivering+0.055
Was+0.054
Did+0.054
 mir+0.054
Neg logit contributions
 work-0.045
 First-0.034
 pretend-0.032
 On-0.031
.-0.029
Token my
Token ID616
Feature activation+1.21e-02

Pos logit contributions
 work+0.002
 First+0.002
 On+0.002
 Max+0.002
 as+0.002
Neg logit contributions
ese-0.003
lor-0.003
rot-0.003
reetings-0.003
 mir-0.003
Token o
Token ID267
Feature activation+5.56e-02

Pos logit contributions
ese+0.023
reetings+0.022
lor+0.022
Did+0.022
ivering+0.022
Neg logit contributions
 work-0.022
 First-0.017
 pretend-0.015
 shelf-0.014
 On-0.014
Tokenasis
Token ID17765
Feature activation+1.73e-02

Pos logit contributions
Fear+0.063
 mir+0.063
 hawk+0.063
Did+0.061
Was+0.061
Neg logit contributions
 work-0.120
 First-0.105
 drive-0.100
 stack-0.100
 On-0.099
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
 mir+0.049
 hawk+0.048
ese+0.048
reetings+0.047
Fear+0.047
Neg logit contributions
 work-0.012
 First-0.007
 as-0.006
 On-0.005
 shelf-0.005
Token Go
Token ID1514
Feature activation+1.32e-02

Pos logit contributions
ese+0.031
 mir+0.028
 pearl+0.027
 hawk+0.027
rot+0.027
Neg logit contributions
 First-0.025
 On-0.023
 When-0.021
 While-0.021
 work-0.020
Token away
Token ID1497
Feature activation-4.31e-03

Pos logit contributions
ese+0.029
 mir+0.029
 hawk+0.027
 crow+0.027
reetings+0.027
Neg logit contributions
 work-0.018
 First-0.017
 While-0.015
 On-0.015
 When-0.013
Token!"
Token ID2474
Feature activation+4.35e-03

Pos logit contributions
 work+0.005
 First+0.004
 as+0.004
 While+0.003
 On+0.003
Neg logit contributions
ese-0.010
 mir-0.010
lor-0.009
rot-0.009
Fear-0.009
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
ese+0.027
 mir+0.026
 hawk+0.026
lor+0.025
 crow+0.025
Neg logit contributions
 work-0.007
 First-0.005
 On-0.004
 as-0.003
 While-0.003
Token I
Token ID314
Feature activation+2.17e-03

Pos logit contributions
ese+0.025
 mir+0.022
rot+0.021
 hawk+0.021
 crow+0.021
Neg logit contributions
 First-0.024
 On-0.021
 While-0.021
 work-0.021
 When-0.020
Token live
Token ID2107
Feature activation+1.67e-02

Pos logit contributions
ese+0.005
ich+0.004
 hawk+0.004
lor+0.004
 mir+0.004
Neg logit contributions
 work-0.004
 First-0.003
 On-0.003
 pull-0.003
 use-0.003
Token in
Token ID287
Feature activation+2.14e-02

Pos logit contributions
ese+0.043
 mir+0.043
 hawk+0.041
 eagle+0.040
lor+0.040
Neg logit contributions
 work-0.015
 as-0.012
 downstairs-0.012
.-0.011
 a-0.011
Token the
Token ID262
Feature activation+2.72e-02

Pos logit contributions
ese+0.061
 mir+0.058
reetings+0.058
rot+0.057
ich+0.055
Neg logit contributions
 work-0.018
 First-0.013
 On-0.011
 Min-0.010
 Max-0.009
Token re
Token ID302
Feature activation+5.52e-02

Pos logit contributions
ese+0.054
reetings+0.052
rot+0.051
lor+0.050
rors+0.046
Neg logit contributions
 work-0.044
 First-0.038
 While-0.035
 stack-0.034
 On-0.034
Tokeneds
Token ID5379
Feature activation+7.11e-03

Pos logit contributions
reetings+0.069
 brav+0.066
 mir+0.066
 hawk+0.064
Fear+0.064
Neg logit contributions
 work-0.107
 While-0.091
 First-0.090
ools-0.088
 Liz-0.087
Token.
Token ID13
Feature activation+7.53e-03

Pos logit contributions
ese+0.019
 mir+0.018
 hawk+0.017
reetings+0.017
 crow+0.017
Neg logit contributions
 work-0.006
 First-0.006
 On-0.005
 While-0.005
 When-0.004
Token I
Token ID314
Feature activation-1.44e-04

Pos logit contributions
ese+0.015
 mir+0.013
rot+0.013
 hawk+0.013
ich+0.012
Neg logit contributions
 First-0.014
 On-0.013
 While-0.012
 When-0.012
 work-0.012
Token would
Token ID561
Feature activation+1.37e-02

Pos logit contributions
 work+0.000
 First+0.000
 On+0.000
 pull+0.000
 use+0.000
Neg logit contributions
ese-0.000
ich-0.000
 hawk-0.000
lor-0.000
 eagle-0.000
Token love
Token ID1842
Feature activation+6.67e-03

Pos logit contributions
ese+0.035
lor+0.034
rot+0.034
reetings+0.033
ich+0.032
Neg logit contributions
 work-0.018
 pull-0.012
 use-0.011
 drive-0.011
 First-0.011
Token a
Token ID257
Feature activation-9.56e-03

Pos logit contributions
ese+0.004
 mir+0.004
reetings+0.004
rim+0.004
rot+0.004
Neg logit contributions
 work-0.002
 First-0.002
 On-0.001
 While-0.001
 as-0.001
Token big
Token ID1263
Feature activation+4.11e-03

Pos logit contributions
 work+0.013
 First+0.011
 While+0.010
 On+0.010
 pull+0.009
Neg logit contributions
ese-0.022
reetings-0.020
 mir-0.020
lor-0.019
rim-0.019
Token box
Token ID3091
Feature activation+5.40e-04

Pos logit contributions
ese+0.009
 mir+0.008
reetings+0.008
lor+0.008
rot+0.008
Neg logit contributions
 work-0.006
 First-0.005
 On-0.005
 While-0.004
 pull-0.004
Token of
Token ID286
Feature activation+3.98e-03

Pos logit contributions
ese+0.001
 mir+0.001
rot+0.001
reetings+0.001
 hawk+0.001
Neg logit contributions
 work-0.001
 First-0.001
 On-0.001
 While-0.001
 Min-0.001
Token ch
Token ID442
Feature activation+2.09e-02

Pos logit contributions
ese+0.009
 mir+0.008
reetings+0.008
rot+0.008
 hawk+0.008
Neg logit contributions
 work-0.005
 First-0.005
 On-0.004
 While-0.004
 Min-0.004
Tokenocol
Token ID4668
Feature activation+5.52e-02

Pos logit contributions
ese+0.031
 mir+0.029
reetings+0.027
 hawk+0.026
rot+0.026
Neg logit contributions
 work-0.044
 First-0.042
 On-0.038
 While-0.037
 Min-0.036
Tokenates
Token ID689
Feature activation+1.33e-02

Pos logit contributions
 mir+0.087
 hawk+0.078
 foe+0.078
 crow+0.074
ese+0.074
Neg logit contributions
 work-0.103
 First-0.100
 On-0.088
 Min-0.084
igsaw-0.084
Token!
Token ID0
Feature activation-3.29e-03

Pos logit contributions
ese+0.032
 mir+0.031
reetings+0.030
 hawk+0.030
rot+0.029
Neg logit contributions
 work-0.015
 First-0.014
 On-0.011
 While-0.011
 Min-0.010
Token
Token ID220
Feature activation-2.45e-02

Pos logit contributions
 work+0.005
 treadmill+0.004
 cartoons+0.004
 experiments+0.004
 First+0.004
Neg logit contributions
ese-0.005
 mir-0.005
lor-0.005
Fear-0.005
 crow-0.005
Token
Token ID198
Feature activation+2.86e-03

Pos logit contributions
 work+0.043
 First+0.037
 treadmill+0.034
 While+0.034
 cartoons+0.033
Neg logit contributions
ese-0.040
reetings-0.039
lor-0.038
rot-0.037
ak-0.037
Token
Token ID198
Feature activation+1.17e-03

Pos logit contributions
ese+0.005
 mir+0.005
reetings+0.005
lor+0.005
rot+0.005
Neg logit contributions
 work-0.005
 First-0.004
igsaw-0.004
 treadmill-0.004
 Min-0.004
Token find
Token ID1064
Feature activation+4.11e-03

Pos logit contributions
 First+0.006
 While+0.005
 work+0.005
 On+0.005
igsaw+0.005
Neg logit contributions
ese-0.009
 mir-0.009
 crow-0.008
 hawk-0.008
 eagle-0.008
Token a
Token ID257
Feature activation-9.58e-03

Pos logit contributions
ese+0.009
 mir+0.008
 crow+0.008
 hawk+0.008
 brav+0.008
Neg logit contributions
 work-0.005
 First-0.005
 On-0.005
 While-0.005
igsaw-0.005
Token box
Token ID3091
Feature activation-6.02e-03

Pos logit contributions
 work+0.014
 First+0.013
 On+0.011
 While+0.011
 Min+0.010
Neg logit contributions
ese-0.021
 mir-0.019
reetings-0.018
rot-0.018
 hawk-0.018
Token of
Token ID286
Feature activation+2.73e-03

Pos logit contributions
 First+0.009
 work+0.009
 While+0.008
 On+0.007
 Min+0.007
Neg logit contributions
ese-0.012
 mir-0.012
rot-0.011
 crow-0.011
 hawk-0.011
Token ch
Token ID442
Feature activation+2.30e-02

Pos logit contributions
ese+0.006
 mir+0.006
 crow+0.005
 hawk+0.005
rot+0.005
Neg logit contributions
 First-0.004
 work-0.003
 On-0.003
 While-0.003
 When-0.003
Tokenocol
Token ID4668
Feature activation+5.51e-02

Pos logit contributions
ese+0.036
 mir+0.034
reetings+0.031
 hawk+0.031
rot+0.031
Neg logit contributions
 work-0.045
 First-0.045
 On-0.040
 While-0.040
 Min-0.038
Tokenates
Token ID689
Feature activation+1.17e-02

Pos logit contributions
 mir+0.083
 hawk+0.076
 foe+0.074
ese+0.073
 crow+0.072
Neg logit contributions
 work-0.108
 First-0.104
 On-0.091
 While-0.088
 Min-0.087
Token.
Token ID13
Feature activation+6.97e-03

Pos logit contributions
ese+0.028
 mir+0.027
reetings+0.026
 hawk+0.026
rot+0.025
Neg logit contributions
 work-0.013
 First-0.013
 On-0.011
 While-0.010
 Min-0.009
Token
Token ID198
Feature activation+9.12e-03

Pos logit contributions
 mir+0.015
ese+0.014
 crow+0.014
 hawk+0.013
rot+0.013
Neg logit contributions
 work-0.009
 First-0.008
igsaw-0.007
 treadmill-0.007
 Min-0.007
Token
Token ID198
Feature activation+5.40e-03

Pos logit contributions
 mir+0.017
ese+0.017
reetings+0.016
ak+0.015
 crow+0.015
Neg logit contributions
 work-0.014
 First-0.013
igsaw-0.012
 While-0.011
 treadmill-0.011
Token"
Token ID1
Feature activation+4.58e-03

Pos logit contributions
ese+0.008
 mir+0.008
 hawk+0.007
reetings+0.007
rot+0.007
Neg logit contributions
 First-0.011
 work-0.011
 On-0.010
 While-0.010
 When-0.009
Token to
Token ID284
Feature activation+1.17e-02

Pos logit contributions
ese+0.003
 mir+0.003
 hawk+0.003
rot+0.003
 crow+0.003
Neg logit contributions
 work-0.002
 First-0.002
 On-0.002
 While-0.002
 Min-0.002
Token a
Token ID257
Feature activation+2.13e-02

Pos logit contributions
 mir+0.028
ese+0.027
 crow+0.026
 hawk+0.026
 foe+0.025
Neg logit contributions
 First-0.012
 work-0.011
 On-0.010
 While-0.010
igsaw-0.009
Token part
Token ID636
Feature activation+1.80e-02

Pos logit contributions
ese+0.044
reetings+0.039
rot+0.038
lor+0.038
 mir+0.038
Neg logit contributions
 work-0.034
 First-0.030
 While-0.027
 On-0.027
 treadmill-0.026
Token of
Token ID286
Feature activation+2.28e-02

Pos logit contributions
ese+0.042
reetings+0.040
 mir+0.039
lor+0.038
rot+0.037
Neg logit contributions
 work-0.022
 First-0.020
 On-0.018
 While-0.016
 as-0.016
Token the
Token ID262
Feature activation+1.77e-02

Pos logit contributions
ese+0.066
reetings+0.063
 mir+0.062
rot+0.062
lor+0.061
Neg logit contributions
 work-0.017
 First-0.014
 On-0.010
 While-0.009
 Min-0.008
Token o
Token ID267
Feature activation+5.51e-02

Pos logit contributions
lor+0.038
rot+0.037
ese+0.036
reetings+0.035
ivering+0.034
Neg logit contributions
 work-0.030
 First-0.021
 shelves-0.020
 shelf-0.020
 stack-0.020
Tokenasis
Token ID17765
Feature activation+1.42e-02

Pos logit contributions
 foe+0.049
 hawk+0.046
Fear+0.044
 mir+0.043
 pearl+0.042
Neg logit contributions
 work-0.136
 First-0.131
 On-0.126
 While-0.121
 When-0.120
Token where
Token ID810
Feature activation+2.74e-02

Pos logit contributions
ese+0.043
 mir+0.042
 hawk+0.042
reetings+0.041
rot+0.041
Neg logit contributions
 work-0.007
 First-0.005
 On-0.004
 When-0.002
 While-0.002
Token there
Token ID612
Feature activation+1.28e-02

Pos logit contributions
ese+0.081
ivering+0.077
rot+0.075
reetings+0.075
lor+0.074
Neg logit contributions
 work-0.023
 First-0.017
 Tim-0.013
 Lily-0.013
 On-0.013
Token were
Token ID547
Feature activation+6.00e-03

Pos logit contributions
ese+0.031
ivering+0.029
reetings+0.029
 mir+0.029
ich+0.029
Neg logit contributions
 work-0.015
 First-0.013
 On-0.012
 Tim-0.011
 pull-0.010
Token no
Token ID645
Feature activation+1.14e-02

Pos logit contributions
ivering+0.016
rot+0.015
ese+0.015
rim+0.015
reetings+0.015
Neg logit contributions
 work-0.006
 First-0.005
 a-0.005
 shelves-0.004
 as-0.004
TokenL
Token ID43
Feature activation+1.51e-03

Pos logit contributions
ese+0.001
 mir+0.001
ich+0.001
reetings+0.001
 hawk+0.001
Neg logit contributions
 First-0.001
 work-0.001
 On-0.001
 While-0.001
 Lily-0.001
Tokenily
Token ID813
Feature activation+5.43e-03

Pos logit contributions
ese+0.003
 mir+0.003
 hawk+0.002
rot+0.002
reetings+0.002
Neg logit contributions
 work-0.003
 First-0.003
 On-0.002
 While-0.002
 Min-0.002
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
 mir+0.015
ese+0.015
reetings+0.014
 hawk+0.014
lor+0.014
Neg logit contributions
 work-0.004
 First-0.003
 On-0.002
 While-0.002
 Min-0.002
Token these
Token ID777
Feature activation+1.20e-02

Pos logit contributions
 work+0.013
 First+0.011
 Lily+0.010
 Max+0.010
 pull+0.009
Neg logit contributions
ese-0.027
 mir-0.026
rot-0.026
reetings-0.026
lor-0.025
Token ch
Token ID442
Feature activation+3.56e-02

Pos logit contributions
ese+0.024
 mir+0.024
reetings+0.023
rot+0.022
 hawk+0.022
Neg logit contributions
 work-0.018
 First-0.017
 On-0.014
 Min-0.014
 While-0.014
Tokenocol
Token ID4668
Feature activation+5.51e-02

Pos logit contributions
ese+0.052
 mir+0.051
 hawk+0.047
reetings+0.047
 crow+0.045
Neg logit contributions
 work-0.072
 First-0.067
 On-0.061
 While-0.060
 Min-0.059
Tokenates
Token ID689
Feature activation+2.26e-02

Pos logit contributions
 mir+0.086
 hawk+0.079
ese+0.075
 foe+0.073
 crow+0.073
Neg logit contributions
 work-0.103
 First-0.100
 On-0.088
 Min-0.087
 While-0.085
Token are
Token ID389
Feature activation+2.98e-03

Pos logit contributions
 mir+0.052
ese+0.052
 hawk+0.050
reetings+0.049
rot+0.049
Neg logit contributions
 work-0.029
 First-0.022
 On-0.019
 pull-0.019
 Min-0.018
Token bitter
Token ID12922
Feature activation-4.79e-03

Pos logit contributions
ese+0.006
lor+0.006
rot+0.006
reetings+0.006
 mir+0.006
Neg logit contributions
 work-0.005
 First-0.003
 as-0.003
 Lily-0.003
 Min-0.003
Token,"
Token ID553
Feature activation-1.14e-02

Pos logit contributions
 First+0.006
 work+0.006
 On+0.005
 While+0.005
 treadmill+0.005
Neg logit contributions
ese-0.011
 mir-0.010
 crow-0.010
rot-0.010
 hawk-0.010
Token he
Token ID339
Feature activation-7.61e-03

Pos logit contributions
 work+0.014
 First+0.014
 While+0.012
 On+0.012
 treadmill+0.011
Neg logit contributions
ese-0.025
 mir-0.025
 crow-0.023
 hawk-0.023
reetings-0.023
Token He
Token ID679
Feature activation+5.65e-03

Pos logit contributions
 mir+0.013
ese+0.012
 crow+0.012
 foe+0.012
 hawk+0.011
Neg logit contributions
 work-0.012
 First-0.011
 Min-0.009
 treadmill-0.009
igsaw-0.009
Token tried
Token ID3088
Feature activation+2.03e-02

Pos logit contributions
ese+0.012
reetings+0.012
rot+0.011
lor+0.011
 mir+0.011
Neg logit contributions
 work-0.008
 First-0.007
 On-0.007
 While-0.007
 pull-0.006
Token to
Token ID284
Feature activation+8.54e-03

Pos logit contributions
ese+0.049
 mir+0.047
reetings+0.045
rot+0.045
 hawk+0.045
Neg logit contributions
 work-0.023
 First-0.022
 On-0.018
 While-0.018
 Min-0.016
Token find
Token ID1064
Feature activation+4.56e-03

Pos logit contributions
ese+0.019
 mir+0.018
 hawk+0.017
 crow+0.017
reetings+0.017
Neg logit contributions
 work-0.011
 First-0.011
 On-0.009
 While-0.009
 Min-0.008
Token the
Token ID262
Feature activation+2.18e-02

Pos logit contributions
ese+0.009
 mir+0.009
 crow+0.009
 hawk+0.009
ile+0.009
Neg logit contributions
 work-0.006
 First-0.006
 On-0.005
 While-0.005
 treadmill-0.005
Token o
Token ID267
Feature activation+5.50e-02

Pos logit contributions
lor+0.051
ese+0.049
reetings+0.049
ivering+0.048
rot+0.048
Neg logit contributions
 work-0.033
 turn-0.020
 First-0.020
 pretend-0.020
 pull-0.019
Tokenasis
Token ID17765
Feature activation+1.41e-02

Pos logit contributions
 hawk+0.055
 foe+0.051
Fear+0.050
rot+0.049
Did+0.048
Neg logit contributions
 work-0.133
 First-0.120
 On-0.115
 stack-0.113
 When-0.111
Token,
Token ID11
Feature activation+9.65e-03

Pos logit contributions
ese+0.037
 mir+0.036
reetings+0.035
 hawk+0.035
rot+0.035
Neg logit contributions
 work-0.013
 First-0.011
 On-0.009
 While-0.008
 as-0.007
Token but
Token ID475
Feature activation-8.34e-03

Pos logit contributions
ese+0.026
rot+0.026
reetings+0.025
lor+0.025
 mir+0.024
Neg logit contributions
 work-0.009
 First-0.008
 as-0.007
 On-0.007
 While-0.007
Token he
Token ID339
Feature activation+4.98e-04

Pos logit contributions
 work+0.008
 First+0.008
 On+0.006
 Max+0.006
 While+0.006
Neg logit contributions
ese-0.023
rot-0.021
 mir-0.021
reetings-0.021
ivering-0.020
Token could
Token ID714
Feature activation+7.10e-03

Pos logit contributions
ese+0.001
reetings+0.001
 mir+0.001
ivering+0.001
lor+0.001
Neg logit contributions
 work-0.001
 First-0.001
 Max-0.001
 pull-0.001
 Tim-0.001
Token,
Token ID11
Feature activation+1.72e-02

Pos logit contributions
ese+0.081
ich+0.075
 mir+0.075
reetings+0.075
rot+0.074
Neg logit contributions
 work-0.023
 First-0.021
 While-0.014
 On-0.014
 as-0.014
Token "
Token ID366
Feature activation+1.09e-02

Pos logit contributions
ese+0.044
 mir+0.041
reetings+0.040
rot+0.040
 hawk+0.040
Neg logit contributions
 work-0.018
 First-0.017
 On-0.015
 While-0.014
 When-0.013
TokenThis
Token ID1212
Feature activation+3.01e-02

Pos logit contributions
 mir+0.020
ese+0.019
reetings+0.018
 hawk+0.018
 foe+0.018
Neg logit contributions
 First-0.018
 work-0.017
 While-0.016
 On-0.015
 When-0.014
Token is
Token ID318
Feature activation+3.37e-04

Pos logit contributions
ivering+0.064
aky+0.064
ese+0.064
reetings+0.064
rot+0.063
Neg logit contributions
 work-0.047
 First-0.035
 On-0.032
 stack-0.031
 look-0.030
Token an
Token ID281
Feature activation+1.64e-02

Pos logit contributions
ese+0.001
rot+0.001
 mir+0.001
lor+0.001
reetings+0.001
Neg logit contributions
 work-0.000
 First-0.000
 On-0.000
 While-0.000
 When-0.000
Token o
Token ID267
Feature activation+5.49e-02

Pos logit contributions
ese+0.032
 mir+0.030
rot+0.030
reetings+0.029
lor+0.029
Neg logit contributions
 work-0.026
 First-0.025
 On-0.022
 While-0.021
 When-0.020
Tokenasis
Token ID17765
Feature activation+1.95e-02

Pos logit contributions
 mir+0.039
ese+0.035
 foe+0.034
 hawk+0.033
Was+0.030
Neg logit contributions
 work-0.150
 First-0.141
 On-0.135
 While-0.131
 drive-0.128
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
ese+0.043
 mir+0.043
 hawk+0.042
reetings+0.041
rot+0.040
Neg logit contributions
 work-0.025
 First-0.022
 On-0.019
 While-0.018
 as-0.017
Token It
Token ID632
Feature activation+1.31e-02

Pos logit contributions
ese+0.029
 mir+0.028
 hawk+0.027
 crow+0.026
rot+0.026
Neg logit contributions
 First-0.020
 work-0.018
 On-0.017
 While-0.017
 When-0.016
Token is
Token ID318
Feature activation+2.03e-03

Pos logit contributions
reetings+0.028
ese+0.027
rot+0.027
 mir+0.027
ich+0.026
Neg logit contributions
 work-0.020
 First-0.016
 Min-0.014
 On-0.014
 pull-0.013
Token a
Token ID257
Feature activation+7.87e-03

Pos logit contributions
ese+0.005
rot+0.005
lor+0.005
reetings+0.005
ivering+0.005
Neg logit contributions
 work-0.003
 First-0.002
 On-0.002
 as-0.002
 When-0.002
Token But
Token ID887
Feature activation-6.69e-03

Pos logit contributions
 mir+0.005
ese+0.005
reetings+0.005
Did+0.005
 foe+0.005
Neg logit contributions
 work-0.004
 First-0.003
 pull-0.003
 Min-0.003
 treadmill-0.003
Token Tom
Token ID4186
Feature activation-1.09e-03

Pos logit contributions
 work+0.007
 First+0.006
 On+0.005
 While+0.005
 When+0.004
Neg logit contributions
ese-0.017
 mir-0.017
rot-0.016
reetings-0.016
 hawk-0.016
Token was
Token ID373
Feature activation+2.14e-03

Pos logit contributions
 work+0.002
 First+0.001
 On+0.001
 While+0.001
 Max+0.001
Neg logit contributions
 mir-0.002
ese-0.002
reetings-0.002
 hawk-0.002
lor-0.002
Token not
Token ID407
Feature activation-5.20e-03

Pos logit contributions
ese+0.004
 mir+0.004
rot+0.004
reetings+0.004
lor+0.004
Neg logit contributions
 work-0.004
 First-0.003
 On-0.003
 as-0.003
 While-0.003
Token as
Token ID355
Feature activation-9.47e-04

Pos logit contributions
 work+0.009
 First+0.009
 On+0.008
 While+0.008
 Min+0.007
Neg logit contributions
ese-0.009
 mir-0.009
reetings-0.008
 hawk-0.008
rot-0.008
Token tall
Token ID7331
Feature activation-5.48e-02

Pos logit contributions
 work+0.002
 First+0.002
 On+0.001
 While+0.001
 Min+0.001
Neg logit contributions
ese-0.002
 mir-0.002
 hawk-0.002
rot-0.002
reetings-0.002
Token as
Token ID355
Feature activation-4.15e-03

Pos logit contributions
 First+0.063
 On+0.049
 While+0.049
 work+0.046
 Min+0.045
Neg logit contributions
ese-0.123
 mir-0.117
 crow-0.117
 hawk-0.114
 eagle-0.112
Token Lily
Token ID20037
Feature activation-8.25e-03

Pos logit contributions
 work+0.005
 First+0.004
 On+0.003
 While+0.003
 Min+0.003
Neg logit contributions
ese-0.010
 mir-0.010
rot-0.009
 hawk-0.009
reetings-0.009
Token,
Token ID11
Feature activation+3.54e-03

Pos logit contributions
 work+0.014
 First+0.014
 On+0.013
 While+0.013
 Min+0.012
Neg logit contributions
ese-0.014
 mir-0.013
rot-0.013
reetings-0.012
 crow-0.012
Token and
Token ID290
Feature activation+3.78e-03

Pos logit contributions
ese+0.009
reetings+0.009
 mir+0.009
lor+0.009
rot+0.009
Neg logit contributions
 work-0.004
 as-0.003
 First-0.003
 Max-0.003
 On-0.003
Token he
Token ID339
Feature activation-6.09e-04

Pos logit contributions
ese+0.008
 mir+0.008
 crow+0.008
 hawk+0.008
rot+0.008
Neg logit contributions
 work-0.005
 First-0.005
 On-0.004
 While-0.004
 Min-0.004
Token anyway
Token ID6949
Feature activation+1.60e-02

Pos logit contributions
 First+0.009
 work+0.008
 While+0.008
 On+0.007
 Min+0.007
Neg logit contributions
ese-0.017
 mir-0.017
 hawk-0.016
rot-0.016
 crow-0.016
Token.
Token ID13
Feature activation-7.35e-03

Pos logit contributions
ese+0.039
 mir+0.039
reetings+0.037
 crow+0.036
rot+0.036
Neg logit contributions
 First-0.016
 work-0.015
 While-0.013
 On-0.012
 Min-0.011
Token
Token ID198
Feature activation+1.37e-03

Pos logit contributions
 work+0.014
working+0.011
 First+0.011
 pull+0.011
 treadmill+0.011
Neg logit contributions
reetings-0.010
ese-0.010
Fear-0.009
Did-0.009
 mir-0.009
Token
Token ID198
Feature activation-7.93e-03

Pos logit contributions
ese+0.002
 mir+0.002
ak+0.002
reetings+0.002
rot+0.002
Neg logit contributions
 work-0.002
 First-0.002
 Min-0.002
 treadmill-0.002
igsaw-0.002
TokenAs
Token ID1722
Feature activation-3.33e-03

Pos logit contributions
 work+0.014
 First+0.013
 Min+0.012
 On+0.012
 While+0.011
Neg logit contributions
ese-0.013
 mir-0.013
reetings-0.012
rot-0.012
 hawk-0.012
Token night
Token ID1755
Feature activation-5.46e-02

Pos logit contributions
 First+0.004
 work+0.003
 On+0.003
 treadmill+0.003
 While+0.003
Neg logit contributions
 mir-0.007
ese-0.007
 hawk-0.007
 crow-0.007
reetings-0.007
Token came
Token ID1625
Feature activation+6.81e-03

Pos logit contributions
 First+0.108
 work+0.096
igsaw+0.093
 treadmill+0.092
 experiments+0.090
Neg logit contributions
 hawk-0.064
 mir-0.064
 crow-0.063
ull-0.061
rot-0.058
Token,
Token ID11
Feature activation-5.73e-03

Pos logit contributions
ese+0.017
 mir+0.016
lor+0.015
 hawk+0.015
rot+0.015
Neg logit contributions
 work-0.008
 First-0.008
 While-0.006
 On-0.006
 When-0.006
Token Tim
Token ID5045
Feature activation-1.00e-02

Pos logit contributions
 work+0.007
 First+0.007
 treadmill+0.006
igsaw+0.006
 extingu+0.006
Neg logit contributions
 hawk-0.010
Did-0.009
Fear-0.009
 crow-0.009
Nobody-0.009
Token went
Token ID1816
Feature activation+6.56e-04

Pos logit contributions
 work+0.016
 First+0.014
 While+0.013
 On+0.013
 Max+0.013
Neg logit contributions
 mir-0.020
ese-0.019
reetings-0.019
 hawk-0.018
rot-0.018
Token home
Token ID1363
Feature activation-2.29e-02

Pos logit contributions
ese+0.001
 mir+0.001
reetings+0.001
 hawk+0.001
 crow+0.001
Neg logit contributions
 First-0.001
 work-0.001
 On-0.001
 While-0.001
 Min-0.001
Token and
Token ID290
Feature activation+1.91e-03

Pos logit contributions
ese+0.022
 mir+0.021
lor+0.020
reetings+0.020
 hawk+0.020
Neg logit contributions
 work-0.013
 First-0.011
 On-0.010
 as-0.010
 pull-0.009
Token it
Token ID340
Feature activation+2.36e-03

Pos logit contributions
ese+0.005
 mir+0.004
reetings+0.004
lor+0.004
rot+0.004
Neg logit contributions
 work-0.002
 First-0.002
 On-0.002
 Min-0.002
 While-0.002
Token was
Token ID373
Feature activation-1.55e-02

Pos logit contributions
ese+0.005
 mir+0.005
reetings+0.005
lor+0.005
 hawk+0.005
Neg logit contributions
 work-0.003
 First-0.003
 On-0.003
 pull-0.002
 Min-0.002
Token just
Token ID655
Feature activation+7.91e-03

Pos logit contributions
 work+0.023
 First+0.019
 On+0.018
 as+0.017
 Em+0.017
Neg logit contributions
ese-0.034
 mir-0.032
lor-0.030
reetings-0.030
rot-0.030
Token as
Token ID355
Feature activation-6.42e-03

Pos logit contributions
ese+0.019
 mir+0.018
rot+0.018
lor+0.017
reetings+0.017
Neg logit contributions
 work-0.010
 First-0.008
 On-0.007
 as-0.007
 While-0.007
Token tall
Token ID7331
Feature activation-5.42e-02

Pos logit contributions
 work+0.009
 First+0.008
 On+0.007
 While+0.007
 Min+0.007
Neg logit contributions
ese-0.014
 mir-0.013
rot-0.013
lor-0.013
reetings-0.013
Token.
Token ID13
Feature activation-3.11e-03

Pos logit contributions
 First+0.045
 work+0.037
 On+0.034
 While+0.033
 Min+0.028
Neg logit contributions
ese-0.142
 mir-0.139
 crow-0.136
 hawk-0.136
rot-0.132
Token
Token ID198
Feature activation+5.33e-03

Pos logit contributions
 work+0.004
 First+0.003
 While+0.002
 On+0.002
 treadmill+0.002
Neg logit contributions
ese-0.007
 mir-0.007
reetings-0.007
rot-0.007
 hawk-0.007
Token
Token ID198
Feature activation+5.24e-03

Pos logit contributions
ese+0.013
 mir+0.012
 hawk+0.012
rot+0.012
 crow+0.012
Neg logit contributions
 work-0.006
 First-0.006
 On-0.005
 While-0.005
 When-0.004
TokenThe
Token ID464
Feature activation-1.07e-02

Pos logit contributions
ese+0.011
 mir+0.011
 pearl+0.010
 hawk+0.010
 foe+0.010
Neg logit contributions
 First-0.008
 On-0.007
 work-0.007
 While-0.007
 When-0.006
Token little
Token ID1310
Feature activation-2.43e-02

Pos logit contributions
 work+0.015
 First+0.014
 On+0.013
 While+0.012
 Min+0.011
Neg logit contributions
ese-0.023
 mir-0.022
reetings-0.020
 hawk-0.020
rot-0.020
Token chickens
Token ID25972
Feature activation+4.12e-03

Pos logit contributions
reetings+0.007
ese+0.007
 mir+0.007
rot+0.007
lor+0.006
Neg logit contributions
 work-0.008
 First-0.006
 Min-0.005
 While-0.005
 On-0.005
Token very
Token ID845
Feature activation+6.15e-03

Pos logit contributions
ese+0.010
 mir+0.009
reetings+0.009
rot+0.009
 hawk+0.009
Neg logit contributions
 work-0.005
 First-0.005
 On-0.004
 While-0.004
 Min-0.003
Token safe
Token ID3338
Feature activation-6.13e-03

Pos logit contributions
ese+0.009
 mir+0.009
 crow+0.008
 hawk+0.008
reetings+0.008
Neg logit contributions
 First-0.012
 work-0.011
 On-0.011
 While-0.011
 Min-0.010
Token.
Token ID13
Feature activation-1.52e-03

Pos logit contributions
 work+0.005
 First+0.004
 On+0.003
 While+0.003
 Min+0.002
Neg logit contributions
ese-0.017
 mir-0.017
reetings-0.016
 hawk-0.016
rot-0.016
Token As
Token ID1081
Feature activation+1.42e-03

Pos logit contributions
 work+0.002
 First+0.002
igsaw+0.001
 Min+0.001
 treadmill+0.001
Neg logit contributions
ese-0.003
 mir-0.003
 crow-0.003
 foe-0.003
Fear-0.003
Token night
Token ID1755
Feature activation-5.41e-02

Pos logit contributions
 mir+0.003
ese+0.003
 hawk+0.003
 crow+0.003
reetings+0.003
Neg logit contributions
 First-0.002
 work-0.002
 On-0.002
 While-0.002
 treadmill-0.001
Token began
Token ID2540
Feature activation-3.40e-03

Pos logit contributions
 First+0.107
 work+0.092
 treadmill+0.092
igsaw+0.088
 experiments+0.086
Neg logit contributions
 mir-0.070
 crow-0.069
 hawk-0.067
rot-0.061
reetings-0.059
Token to
Token ID284
Feature activation-1.60e-02

Pos logit contributions
 First+0.005
 work+0.004
 While+0.004
 On+0.004
 Em+0.003
Neg logit contributions
 mir-0.007
ese-0.007
 crow-0.006
 hawk-0.006
reetings-0.006
Token fall
Token ID2121
Feature activation-2.68e-02

Pos logit contributions
 First+0.023
 While+0.020
 work+0.020
 On+0.020
 When+0.018
Neg logit contributions
 mir-0.031
ese-0.031
 crow-0.029
 hawk-0.029
 eagle-0.028
Token,
Token ID11
Feature activation+3.68e-03

Pos logit contributions
 First+0.037
 work+0.033
 While+0.030
 On+0.030
igsaw+0.027
Neg logit contributions
 mir-0.053
ese-0.053
 hawk-0.052
 crow-0.049
reetings-0.049
Token she
Token ID673
Feature activation+4.39e-03

Pos logit contributions
 mir+0.008
ese+0.008
 crow+0.007
 hawk+0.007
 foe+0.007
Neg logit contributions
 First-0.005
 work-0.005
 On-0.004
 While-0.004
 treadmill-0.004
Token end
Token ID886
Feature activation-3.46e-04

Pos logit contributions
 work+0.006
 First+0.005
 On+0.005
 While+0.005
 pull+0.004
Neg logit contributions
ese-0.008
 mir-0.007
rot-0.007
reetings-0.007
lor-0.007
Token in
Token ID287
Feature activation+1.07e-02

Pos logit contributions
 work+0.000
 First+0.000
 On+0.000
 While+0.000
 Min+0.000
Neg logit contributions
ese-0.001
 mir-0.001
reetings-0.001
 hawk-0.001
rot-0.001
Token sight
Token ID6504
Feature activation-6.37e-03

Pos logit contributions
ese+0.026
 mir+0.025
reetings+0.024
rot+0.024
 hawk+0.023
Neg logit contributions
 work-0.012
 First-0.011
 On-0.010
 While-0.009
 Min-0.008
Token.
Token ID13
Feature activation-3.55e-03

Pos logit contributions
 work+0.006
 First+0.005
 On+0.004
 While+0.004
 Min+0.004
Neg logit contributions
ese-0.017
 mir-0.016
 hawk-0.015
rot-0.015
reetings-0.015
Token As
Token ID1081
Feature activation-6.35e-04

Pos logit contributions
 work+0.006
 First+0.005
 treadmill+0.004
igsaw+0.004
 pull+0.004
Neg logit contributions
ese-0.007
 mir-0.006
reetings-0.006
lor-0.006
 hawk-0.006
Token night
Token ID1755
Feature activation-5.32e-02

Pos logit contributions
 First+0.001
 work+0.001
 On+0.001
 While+0.001
 treadmill+0.001
Neg logit contributions
ese-0.001
 mir-0.001
 hawk-0.001
 crow-0.001
reetings-0.001
Token fell
Token ID3214
Feature activation-2.58e-02

Pos logit contributions
 First+0.107
 work+0.094
 treadmill+0.091
igsaw+0.089
 brushes+0.087
Neg logit contributions
 mir-0.072
 hawk-0.070
 crow-0.069
rot-0.064
reetings-0.063
Token,
Token ID11
Feature activation+3.70e-03

Pos logit contributions
 First+0.030
 work+0.029
 On+0.025
 While+0.024
igsaw+0.021
Neg logit contributions
ese-0.058
 mir-0.058
 hawk-0.056
reetings-0.054
 crow-0.054
Token he
Token ID339
Feature activation-2.43e-04

Pos logit contributions
 mir+0.008
ese+0.008
 hawk+0.007
 crow+0.007
 eagle+0.007
Neg logit contributions
 First-0.005
 work-0.005
 On-0.004
 While-0.004
 treadmill-0.004
Token started
Token ID2067
Feature activation+4.82e-03

Pos logit contributions
 work+0.000
 While+0.000
 pull+0.000
 First+0.000
 as+0.000
Neg logit contributions
reetings-0.001
ese-0.001
 mir-0.001
rot-0.001
rim-0.001
Token to
Token ID284
Feature activation-8.59e-03

Pos logit contributions
ese+0.011
 mir+0.011
reetings+0.011
rot+0.010
 hawk+0.010
Neg logit contributions
 work-0.006
 First-0.005
 On-0.005
 While-0.004
 Min-0.004
Token explore
Token ID7301
Feature activation+6.10e-03

Pos logit contributions
ese+0.032
 mir+0.031
 hawk+0.029
rot+0.029
 crow+0.029
Neg logit contributions
 work-0.017
 First-0.015
 On-0.014
 While-0.013
 When-0.012
Token!
Token ID0
Feature activation-1.18e-03

Pos logit contributions
ese+0.017
 mir+0.016
reetings+0.016
rot+0.016
 crow+0.015
Neg logit contributions
 work-0.006
 First-0.004
 On-0.004
 as-0.003
 Min-0.003
Token
Token ID220
Feature activation-2.13e-02

Pos logit contributions
 work+0.002
 First+0.001
 Min+0.001
 treadmill+0.001
 cartoons+0.001
Neg logit contributions
ese-0.002
reetings-0.002
Fear-0.002
rot-0.002
 mir-0.002
Token
Token ID198
Feature activation+4.13e-03

Pos logit contributions
 work+0.035
 First+0.031
 While+0.027
 treadmill+0.027
 Min+0.027
Neg logit contributions
ese-0.038
reetings-0.036
 mir-0.035
rot-0.035
lor-0.035
TokenAs
Token ID1722
Feature activation-4.81e-03

Pos logit contributions
ese+0.008
 mir+0.008
reetings+0.007
rot+0.007
 crow+0.007
Neg logit contributions
 work-0.006
 First-0.006
 Min-0.005
 On-0.005
 While-0.005
Token evening
Token ID6180
Feature activation-5.28e-02

Pos logit contributions
 work+0.006
 First+0.006
 On+0.005
 While+0.005
 treadmill+0.005
Neg logit contributions
ese-0.010
 mir-0.010
 hawk-0.009
 crow-0.009
reetings-0.009
Token came
Token ID1625
Feature activation+9.18e-03

Pos logit contributions
 First+0.092
 work+0.086
 treadmill+0.084
igsaw+0.081
resses+0.081
Neg logit contributions
 hawk-0.069
 crow-0.068
 mir-0.067
rot-0.063
ese-0.063
Token,
Token ID11
Feature activation+6.02e-04

Pos logit contributions
ese+0.023
 mir+0.021
 crow+0.020
 hawk+0.020
ile+0.020
Neg logit contributions
 work-0.011
 First-0.010
 On-0.009
 While-0.009
 When-0.008
Token Amy
Token ID14235
Feature activation-9.32e-03

Pos logit contributions
 hawk+0.001
 crow+0.001
ese+0.001
 foe+0.001
Fear+0.001
Neg logit contributions
 work-0.001
 First-0.001
 treadmill-0.001
igsaw-0.001
 On-0.001
Token entered
Token ID5982
Feature activation-2.53e-04

Pos logit contributions
 work+0.014
 First+0.012
 While+0.012
 On+0.011
 Min+0.011
Neg logit contributions
ese-0.019
reetings-0.019
 mir-0.018
 hawk-0.018
ile-0.018
Token the
Token ID262
Feature activation+6.24e-03

Pos logit contributions
 First+0.000
 work+0.000
 While+0.000
 On+0.000
 treadmill+0.000
Neg logit contributions
ese-0.001
 mir-0.001
 crow-0.001
 hawk-0.001
reetings-0.001
Token her
Token ID607
Feature activation+8.69e-03

Pos logit contributions
 First+0.005
 work+0.005
 While+0.004
 On+0.004
 treadmill+0.004
Neg logit contributions
ese-0.010
 mir-0.010
 crow-0.009
 hawk-0.009
rot-0.009
Token hands
Token ID2832
Feature activation-5.28e-04

Pos logit contributions
ese+0.016
 mir+0.016
reetings+0.015
 hawk+0.015
rot+0.015
Neg logit contributions
 work-0.014
 First-0.014
 On-0.012
 While-0.012
 Min-0.011
Token and
Token ID290
Feature activation-2.29e-03

Pos logit contributions
 work+0.001
 First+0.001
 On+0.001
 While+0.001
 Min+0.001
Neg logit contributions
ese-0.001
 mir-0.001
reetings-0.001
 hawk-0.001
rot-0.001
Token stretched
Token ID19110
Feature activation+6.05e-03

Pos logit contributions
 work+0.004
 First+0.004
 On+0.003
 While+0.003
igsaw+0.003
Neg logit contributions
ese-0.004
 mir-0.004
 crow-0.004
 hawk-0.004
rot-0.004
Token as
Token ID355
Feature activation-2.38e-03

Pos logit contributions
ese+0.015
 mir+0.014
 hawk+0.013
rot+0.013
 crow+0.013
Neg logit contributions
 work-0.006
 First-0.006
 While-0.005
 On-0.005
 Min-0.004
Token tall
Token ID7331
Feature activation-5.27e-02

Pos logit contributions
 First+0.004
 work+0.003
 While+0.003
 On+0.003
igsaw+0.003
Neg logit contributions
 mir-0.004
ese-0.004
 hawk-0.004
 crow-0.004
rot-0.004
Token as
Token ID355
Feature activation-9.24e-03

Pos logit contributions
 First+0.059
 On+0.050
 While+0.050
igsaw+0.048
 work+0.046
Neg logit contributions
 crow-0.118
 hawk-0.117
 mir-0.114
ese-0.114
 eagle-0.111
Token she
Token ID673
Feature activation-1.02e-02

Pos logit contributions
 First+0.012
 work+0.012
 While+0.011
 On+0.010
 treadmill+0.009
Neg logit contributions
 mir-0.019
ese-0.019
 hawk-0.018
 crow-0.018
reetings-0.018
Token could
Token ID714
Feature activation-9.32e-03

Pos logit contributions
 work+0.018
 First+0.018
 On+0.016
 While+0.016
 When+0.015
Neg logit contributions
ese-0.018
 mir-0.017
 hawk-0.016
reetings-0.016
rot-0.015
Token.
Token ID13
Feature activation+8.15e-04

Pos logit contributions
 work+0.008
 First+0.008
 On+0.006
 While+0.006
 Min+0.005
Neg logit contributions
ese-0.025
 mir-0.024
reetings-0.023
 hawk-0.023
rot-0.023
Token Somehow
Token ID36651
Feature activation-7.68e-03

Pos logit contributions
ese+0.002
 mir+0.002
reetings+0.002
 crow+0.002
rot+0.002
Neg logit contributions
 work-0.001
 First-0.001
 treadmill-0.001
igsaw-0.001
 While-0.001
Token Soon
Token ID15894
Feature activation-1.74e-02

Pos logit contributions
 work+0.006
 First+0.005
 While+0.004
 treadmill+0.004
 On+0.004
Neg logit contributions
ese-0.011
 mir-0.011
 hawk-0.010
reetings-0.010
 crow-0.010
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 work+0.021
 First+0.020
 On+0.016
 While+0.016
 Min+0.015
Neg logit contributions
ese-0.041
 mir-0.039
reetings-0.037
 hawk-0.037
rot-0.037
Token it
Token ID340
Feature activation+5.45e-03

Pos logit contributions
ese+0.030
 mir+0.029
reetings+0.027
rot+0.027
lor+0.026
Neg logit contributions
 work-0.012
 First-0.011
 On-0.009
 While-0.009
 Min-0.008
Token was
Token ID373
Feature activation-1.13e-02

Pos logit contributions
ese+0.013
 mir+0.012
reetings+0.011
ich+0.011
rot+0.011
Neg logit contributions
 work-0.008
 First-0.007
 On-0.006
 While-0.005
 pull-0.005
Token as
Token ID355
Feature activation+1.62e-03

Pos logit contributions
 work+0.015
 First+0.014
 On+0.012
 While+0.012
 Min+0.011
Neg logit contributions
ese-0.025
 mir-0.024
rot-0.023
reetings-0.023
 hawk-0.023
Token tall
Token ID7331
Feature activation-5.26e-02

Pos logit contributions
ese+0.004
 mir+0.003
rot+0.003
reetings+0.003
lor+0.003
Neg logit contributions
 work-0.002
 First-0.002
 On-0.002
 While-0.002
 Min-0.002
Token and
Token ID290
Feature activation-1.80e-03

Pos logit contributions
 First+0.052
 On+0.041
 work+0.041
 While+0.040
 treadmill+0.037
Neg logit contributions
 hawk-0.126
ese-0.125
 crow-0.125
 mir-0.124
 eagle-0.121
Token wide
Token ID3094
Feature activation-3.40e-02

Pos logit contributions
 work+0.002
 First+0.002
 On+0.002
 While+0.002
 Min+0.002
Neg logit contributions
ese-0.004
 mir-0.004
rot-0.004
lor-0.004
reetings-0.004
Token as
Token ID355
Feature activation-1.16e-02

Pos logit contributions
 First+0.035
 work+0.032
 While+0.028
 On+0.027
 treadmill+0.024
Neg logit contributions
ese-0.082
 mir-0.080
 hawk-0.077
 crow-0.077
reetings-0.076
Token a
Token ID257
Feature activation-1.56e-02

Pos logit contributions
 work+0.014
 First+0.013
 On+0.011
 While+0.011
 Min+0.010
Neg logit contributions
ese-0.027
 mir-0.026
reetings-0.024
rot-0.024
 hawk-0.024
Token big
Token ID1263
Feature activation-1.39e-02

Pos logit contributions
 work+0.024
 First+0.022
 On+0.019
 While+0.019
 Min+0.018
Neg logit contributions
ese-0.032
 mir-0.030
reetings-0.029
rot-0.028
 hawk-0.028
Token to
Token ID284
Feature activation+8.45e-04

Pos logit contributions
ese+0.017
 mir+0.017
 hawk+0.016
 crow+0.016
reetings+0.016
Neg logit contributions
 work-0.007
 First-0.007
 On-0.006
 While-0.006
 When-0.005
Token an
Token ID281
Feature activation+9.29e-03

Pos logit contributions
ese+0.002
 mir+0.002
 crow+0.002
 hawk+0.002
reetings+0.002
Neg logit contributions
 First-0.001
 work-0.001
 On-0.001
 While-0.001
 treadmill-0.001
Token end
Token ID886
Feature activation+4.61e-03

Pos logit contributions
ese+0.019
 mir+0.019
reetings+0.018
 hawk+0.018
rot+0.017
Neg logit contributions
 work-0.013
 First-0.013
 On-0.011
 While-0.011
 Min-0.010
Token.
Token ID13
Feature activation-8.90e-03

Pos logit contributions
ese+0.011
 mir+0.011
 hawk+0.010
reetings+0.010
rot+0.010
Neg logit contributions
 work-0.005
 First-0.005
 On-0.004
 While-0.004
 Min-0.003
Token As
Token ID1081
Feature activation-5.27e-04

Pos logit contributions
 work+0.011
 First+0.009
 treadmill+0.007
 pull+0.007
igsaw+0.007
Neg logit contributions
ese-0.019
 mir-0.019
 crow-0.018
 hawk-0.018
 foe-0.018
Token night
Token ID1755
Feature activation-5.14e-02

Pos logit contributions
 work+0.000
 First+0.000
 On+0.000
 While+0.000
 Min+0.000
Neg logit contributions
ese-0.001
 mir-0.001
rot-0.001
reetings-0.001
lor-0.001
Token started
Token ID2067
Feature activation-5.55e-03

Pos logit contributions
 First+0.104
 work+0.095
 treadmill+0.093
igsaw+0.090
 On+0.085
Neg logit contributions
 mir-0.065
 crow-0.064
 hawk-0.063
rot-0.058
ese-0.058
Token to
Token ID284
Feature activation-1.48e-02

Pos logit contributions
 work+0.007
 First+0.006
 On+0.006
 While+0.005
 Min+0.005
Neg logit contributions
ese-0.013
 mir-0.012
reetings-0.012
rot-0.012
 hawk-0.012
Token fall
Token ID2121
Feature activation-2.44e-02

Pos logit contributions
 work+0.021
 First+0.020
 On+0.017
 While+0.017
 Min+0.015
Neg logit contributions
ese-0.031
 mir-0.030
reetings-0.028
 hawk-0.028
rot-0.028
Token,
Token ID11
Feature activation+3.11e-03

Pos logit contributions
 First+0.034
 work+0.032
 While+0.028
 On+0.028
igsaw+0.026
Neg logit contributions
ese-0.049
 mir-0.048
 hawk-0.046
reetings-0.045
 crow-0.045
Token the
Token ID262
Feature activation-1.24e-02

Pos logit contributions
ese+0.007
 mir+0.007
 hawk+0.007
 crow+0.007
rot+0.006
Neg logit contributions
 work-0.003
 First-0.003
 On-0.003
 While-0.003
igsaw-0.003
Token enough
Token ID1576
Feature activation+2.36e-02

Pos logit contributions
 First+0.005
 work+0.005
 While+0.005
 On+0.005
 When+0.004
Neg logit contributions
ese-0.008
 mir-0.008
 crow-0.007
 hawk-0.007
rot-0.007
Token to
Token ID284
Feature activation-8.83e-03

Pos logit contributions
ese+0.061
 mir+0.059
rot+0.057
reetings+0.056
 hawk+0.056
Neg logit contributions
 work-0.023
 First-0.021
 On-0.017
 While-0.016
 Min-0.014
Token make
Token ID787
Feature activation-1.46e-02

Pos logit contributions
 First+0.009
 work+0.009
 While+0.007
 On+0.007
 Min+0.006
Neg logit contributions
ese-0.022
 mir-0.021
 crow-0.020
reetings-0.020
 hawk-0.020
Token him
Token ID683
Feature activation-4.32e-04

Pos logit contributions
 First+0.017
 work+0.017
 While+0.015
 On+0.015
 treadmill+0.013
Neg logit contributions
ese-0.033
 mir-0.031
 crow-0.030
rot-0.030
 hawk-0.029
Token as
Token ID355
Feature activation+9.97e-04

Pos logit contributions
 work+0.001
 First+0.001
 On+0.000
 While+0.000
 Min+0.000
Neg logit contributions
ese-0.001
 mir-0.001
reetings-0.001
 hawk-0.001
rot-0.001
Token tall
Token ID7331
Feature activation-5.14e-02

Pos logit contributions
ese+0.002
 mir+0.002
 crow+0.002
 hawk+0.002
reetings+0.002
Neg logit contributions
 First-0.001
 work-0.001
 On-0.001
 While-0.001
 Min-0.001
Token as
Token ID355
Feature activation-7.16e-03

Pos logit contributions
 First+0.047
 On+0.039
 work+0.039
 While+0.038
igsaw+0.033
Neg logit contributions
ese-0.129
 mir-0.125
 crow-0.122
 hawk-0.120
 eagle-0.117
Token Maria
Token ID14200
Feature activation-1.57e-02

Pos logit contributions
 work+0.008
 First+0.008
 On+0.007
 While+0.006
 Min+0.006
Neg logit contributions
ese-0.017
 mir-0.016
reetings-0.015
 hawk-0.015
rot-0.015
Token.
Token ID13
Feature activation-2.08e-03

Pos logit contributions
 work+0.025
 First+0.023
 On+0.021
 While+0.020
 Min+0.019
Neg logit contributions
ese-0.030
 mir-0.030
 hawk-0.028
reetings-0.028
rot-0.027
Token He
Token ID679
Feature activation+4.13e-03

Pos logit contributions
 work+0.003
 First+0.002
 treadmill+0.002
igsaw+0.002
 pull+0.002
Neg logit contributions
ese-0.004
reetings-0.004
 mir-0.004
Fear-0.004
Was-0.004
Token chose
Token ID7690
Feature activation+7.29e-03

Pos logit contributions
ese+0.009
 mir+0.008
reetings+0.008
 hawk+0.008
rot+0.008
Neg logit contributions
 work-0.006
 First-0.006
 While-0.005
 On-0.005
 Min-0.005
Token an
Token ID281
Feature activation+5.73e-03

Pos logit contributions
ese+0.003
 mir+0.003
 hawk+0.002
 crow+0.002
reetings+0.002
Neg logit contributions
 First-0.002
 work-0.002
 While-0.001
 On-0.001
 Min-0.001
Token adventure
Token ID8855
Feature activation-1.24e-02

Pos logit contributions
ese+0.010
 mir+0.010
 crow+0.009
 hawk+0.009
 eagle+0.009
Neg logit contributions
 First-0.009
 work-0.009
 While-0.008
 On-0.008
 Min-0.008
Token anyway
Token ID6949
Feature activation+2.06e-02

Pos logit contributions
 First+0.013
 work+0.012
 While+0.010
 On+0.010
 Min+0.009
Neg logit contributions
ese-0.030
 mir-0.029
 crow-0.028
 hawk-0.028
rot-0.027
Token.
Token ID13
Feature activation+3.24e-03

Pos logit contributions
ese+0.053
 mir+0.052
reetings+0.050
 hawk+0.049
 crow+0.049
Neg logit contributions
 First-0.018
 work-0.018
 While-0.014
 On-0.013
 Min-0.012
Token As
Token ID1081
Feature activation+4.56e-03

Pos logit contributions
ese+0.006
 mir+0.006
reetings+0.006
 hawk+0.006
 crow+0.006
Neg logit contributions
 work-0.005
 First-0.004
 Min-0.004
igsaw-0.004
 cartoons-0.004
Token night
Token ID1755
Feature activation-5.06e-02

Pos logit contributions
ese+0.010
 mir+0.010
 hawk+0.009
 crow+0.009
reetings+0.009
Neg logit contributions
 work-0.006
 First-0.006
 On-0.005
 While-0.005
 Min-0.004
Tokenfall
Token ID7207
Feature activation-2.35e-02

Pos logit contributions
 First+0.095
 work+0.087
igsaw+0.083
 treadmill+0.079
 Min+0.078
Neg logit contributions
 mir-0.075
 hawk-0.073
 crow-0.072
ese-0.068
rot-0.065
Token came
Token ID1625
Feature activation+7.97e-03

Pos logit contributions
 work+0.038
 First+0.037
 On+0.033
 While+0.032
 Min+0.030
Neg logit contributions
ese-0.044
 mir-0.043
reetings-0.040
 hawk-0.040
 crow-0.039
Token,
Token ID11
Feature activation+5.03e-03

Pos logit contributions
ese+0.020
 mir+0.019
lor+0.018
rim+0.018
rot+0.018
Neg logit contributions
 work-0.009
 First-0.008
 On-0.007
 While-0.007
 When-0.006
Token he
Token ID339
Feature activation+4.29e-03

Pos logit contributions
ese+0.010
 mir+0.010
 hawk+0.010
 crow+0.009
reetings+0.009
Neg logit contributions
 work-0.007
 First-0.007
 On-0.006
 While-0.006
igsaw-0.006
Token was
Token ID373
Feature activation-7.63e-04

Pos logit contributions
ese+0.010
reetings+0.009
rot+0.009
 hawk+0.009
ile+0.009
Neg logit contributions
 work-0.006
 While-0.005
 First-0.004
 On-0.004
 pull-0.004
Token would
Token ID561
Feature activation-1.57e-03

Pos logit contributions
 work+0.004
 First+0.004
 On+0.004
 While+0.004
 Min+0.004
Neg logit contributions
ese-0.004
 mir-0.004
 crow-0.004
reetings-0.004
 hawk-0.004
Token take
Token ID1011
Feature activation-6.68e-03

Pos logit contributions
 First+0.002
 work+0.002
 On+0.002
 While+0.002
 Em+0.001
Neg logit contributions
ese-0.003
 mir-0.003
 crow-0.003
 hawk-0.003
reetings-0.003
Token out
Token ID503
Feature activation-1.13e-02

Pos logit contributions
 First+0.009
 On+0.008
 While+0.008
 work+0.008
 treadmill+0.008
Neg logit contributions
ese-0.012
 mir-0.011
ak-0.011
 crow-0.011
 brav-0.011
Token a
Token ID257
Feature activation-1.18e-02

Pos logit contributions
 First+0.014
 work+0.013
 While+0.012
 On+0.012
 treadmill+0.011
Neg logit contributions
ese-0.024
 mir-0.023
 crow-0.022
ak-0.021
 hawk-0.021
Token j
Token ID474
Feature activation+2.49e-02

Pos logit contributions
 First+0.019
 work+0.016
 While+0.015
 When+0.015
 Em+0.015
Neg logit contributions
 mir-0.020
 crow-0.019
 hawk-0.019
rot-0.019
ese-0.018
Tokenigsaw
Token ID45636
Feature activation-5.05e-02

Pos logit contributions
 mir+0.036
ese+0.034
ak+0.031
asis+0.030
rim+0.030
Neg logit contributions
 First-0.049
 work-0.046
 Em-0.044
 While-0.043
 Min-0.043
Token puzzle
Token ID15027
Feature activation-8.86e-03

Pos logit contributions
 First+0.076
 While+0.067
 Em+0.064
 When+0.063
 work+0.062
Neg logit contributions
rot-0.091
ese-0.089
ile-0.084
 eagle-0.084
ak-0.083
Token and
Token ID290
Feature activation-1.50e-02

Pos logit contributions
 First+0.012
 work+0.012
 While+0.011
 On+0.010
 Em+0.010
Neg logit contributions
ese-0.018
 mir-0.017
rot-0.016
 crow-0.016
ak-0.016
Token try
Token ID1949
Feature activation+8.20e-03

Pos logit contributions
 First+0.017
 work+0.015
 On+0.014
 While+0.014
 treadmill+0.014
Neg logit contributions
ese-0.032
 mir-0.031
 crow-0.030
 hawk-0.029
 eagle-0.029
Token to
Token ID284
Feature activation-1.78e-02

Pos logit contributions
ese+0.017
 mir+0.017
reetings+0.016
 crow+0.016
rot+0.016
Neg logit contributions
 First-0.011
 work-0.011
 While-0.009
 On-0.009
 Min-0.008
Token solve
Token ID8494
Feature activation-2.17e-02

Pos logit contributions
 First+0.022
 While+0.019
 On+0.019
 work+0.018
 treadmill+0.017
Neg logit contributions
ese-0.037
 yourself-0.034
 mir-0.034
 crow-0.034
ak-0.033
Token from
Token ID422
Feature activation-1.51e-02

Pos logit contributions
 First+0.021
 work+0.020
 On+0.017
 While+0.017
 treadmill+0.015
Neg logit contributions
ese-0.045
 mir-0.043
 hawk-0.041
 crow-0.040
rot-0.040
Token the
Token ID262
Feature activation+3.00e-03

Pos logit contributions
 First+0.018
 work+0.017
 On+0.015
 While+0.015
 treadmill+0.013
Neg logit contributions
ese-0.034
 mir-0.033
 hawk-0.032
 crow-0.032
 foe-0.030
Token rain
Token ID6290
Feature activation-1.21e-02

Pos logit contributions
ese+0.006
reetings+0.006
lor+0.006
rot+0.006
 mir+0.006
Neg logit contributions
 work-0.005
 First-0.004
 While-0.003
 shelves-0.003
 On-0.003
Token.
Token ID13
Feature activation-2.13e-02

Pos logit contributions
 First+0.013
 work+0.012
 On+0.010
 While+0.010
 Min+0.009
Neg logit contributions
ese-0.030
 mir-0.028
 hawk-0.027
 crow-0.027
rot-0.027
Token When
Token ID1649
Feature activation-1.93e-02

Pos logit contributions
 work+0.023
 First+0.021
 Logan+0.018
oun+0.018
 Min+0.017
Neg logit contributions
ese-0.040
Fear-0.039
 mir-0.039
Did-0.039
Was-0.039
Token morning
Token ID3329
Feature activation-5.02e-02

Pos logit contributions
 First+0.025
 While+0.021
 work+0.020
 treadmill+0.020
 On+0.020
Neg logit contributions
 mir-0.037
 hawk-0.036
 crow-0.035
ese-0.035
reetings-0.033
Token came
Token ID1625
Feature activation+4.54e-03

Pos logit contributions
 First+0.100
 treadmill+0.090
 weights+0.086
resses+0.086
 work+0.085
Neg logit contributions
 crow-0.057
 mir-0.057
 hawk-0.056
reetings-0.055
rot-0.053
Token,
Token ID11
Feature activation-1.12e-02

Pos logit contributions
ese+0.012
 mir+0.012
 crow+0.011
lor+0.011
rot+0.011
Neg logit contributions
 work-0.004
 First-0.004
 On-0.003
 While-0.003
 When-0.003
Token Sarah
Token ID10490
Feature activation-1.61e-02

Pos logit contributions
 First+0.013
 work+0.012
 treadmill+0.011
oun+0.011
igsaw+0.011
Neg logit contributions
Fear-0.021
 hawk-0.020
Did-0.020
Nobody-0.020
 crow-0.020
Token looked
Token ID3114
Feature activation-8.48e-03

Pos logit contributions
 work+0.023
 While+0.018
 First+0.018
 Min+0.018
 On+0.018
Neg logit contributions
reetings-0.034
ese-0.034
 mir-0.033
rot-0.031
 hawk-0.031
Token in
Token ID287
Feature activation-7.33e-03

Pos logit contributions
 work+0.007
 First+0.006
 On+0.005
 While+0.005
 When+0.004
Neg logit contributions
ese-0.023
 mir-0.022
reetings-0.021
rot-0.021
 hawk-0.021
Token time
Token ID640
Feature activation-2.81e-03

Pos logit contributions
 work+0.001
 First+0.001
 On+0.001
 While+0.001
 Min+0.001
Neg logit contributions
ese-0.001
 mir-0.001
reetings-0.001
rot-0.001
lor-0.001
Token until
Token ID1566
Feature activation-8.00e-03

Pos logit contributions
 work+0.003
 First+0.003
 On+0.002
 While+0.002
 Min+0.002
Neg logit contributions
ese-0.007
 mir-0.007
reetings-0.006
 hawk-0.006
rot-0.006
Token it
Token ID340
Feature activation-9.57e-03

Pos logit contributions
 First+0.010
 work+0.010
 On+0.009
 While+0.009
 treadmill+0.008
Neg logit contributions
ese-0.017
 mir-0.017
 hawk-0.016
 crow-0.016
reetings-0.015
Token was
Token ID373
Feature activation-1.99e-02

Pos logit contributions
 work+0.015
 First+0.014
 On+0.012
 While+0.012
 When+0.011
Neg logit contributions
ese-0.020
 mir-0.019
 hawk-0.018
reetings-0.018
rot-0.017
Token as
Token ID355
Feature activation-2.69e-03

Pos logit contributions
 First+0.028
 work+0.028
 On+0.024
 While+0.024
 Min+0.022
Neg logit contributions
ese-0.040
 mir-0.039
 hawk-0.037
reetings-0.037
 crow-0.036
Token tall
Token ID7331
Feature activation-5.01e-02

Pos logit contributions
 work+0.004
 First+0.004
 On+0.003
 While+0.003
 Min+0.003
Neg logit contributions
ese-0.006
 mir-0.005
rot-0.005
reetings-0.005
 hawk-0.005
Token as
Token ID355
Feature activation-8.11e-03

Pos logit contributions
 First+0.046
 On+0.036
 While+0.036
 work+0.036
 Min+0.031
Neg logit contributions
ese-0.126
 mir-0.123
 crow-0.122
 hawk-0.121
 eagle-0.118
Token they
Token ID484
Feature activation-1.21e-02

Pos logit contributions
 work+0.008
 First+0.008
 On+0.006
 While+0.006
 Min+0.006
Neg logit contributions
ese-0.021
 mir-0.020
reetings-0.019
rot-0.019
 hawk-0.019
Token were
Token ID547
Feature activation-1.16e-02

Pos logit contributions
 work+0.022
 First+0.020
 On+0.019
 While+0.018
 When+0.017
Neg logit contributions
ese-0.022
 mir-0.021
reetings-0.020
 hawk-0.019
rot-0.019
Token!
Token ID0
Feature activation-4.75e-03

Pos logit contributions
 work+0.015
 First+0.014
 On+0.012
 While+0.011
 Min+0.010
Neg logit contributions
ese-0.026
 mir-0.026
 hawk-0.024
rot-0.024
reetings-0.024
Token They
Token ID1119
Feature activation-5.51e-03

Pos logit contributions
 work+0.006
 First+0.004
 treadmill+0.004
 pull+0.004
 While+0.003
Neg logit contributions
ese-0.011
 mir-0.010
reetings-0.010
rot-0.010
 hawk-0.010
Token and
Token ID290
Feature activation-9.10e-03

Pos logit contributions
 work+0.019
 First+0.019
 On+0.015
 While+0.015
 treadmill+0.013
Neg logit contributions
ese-0.041
 mir-0.040
 crow-0.038
reetings-0.038
 hawk-0.038
Token watched
Token ID7342
Feature activation-1.38e-02

Pos logit contributions
 work+0.016
 First+0.016
 On+0.014
 While+0.014
 treadmill+0.013
Neg logit contributions
ese-0.016
 mir-0.015
 crow-0.014
 hawk-0.014
reetings-0.014
Token as
Token ID355
Feature activation-6.66e-03

Pos logit contributions
 First+0.021
 treadmill+0.019
 While+0.019
igsaw+0.018
 On+0.017
Neg logit contributions
ese-0.024
 mir-0.023
 crow-0.023
 hawk-0.022
ull-0.021
Token the
Token ID262
Feature activation-2.76e-02

Pos logit contributions
 First+0.009
 While+0.008
 On+0.007
 work+0.007
 treadmill+0.007
Neg logit contributions
 mir-0.014
 crow-0.013
 hawk-0.013
ese-0.013
 foe-0.012
Token sun
Token ID4252
Feature activation-2.21e-02

Pos logit contributions
 work+0.044
 First+0.044
 On+0.039
 While+0.038
 Min+0.036
Neg logit contributions
ese-0.052
 mir-0.050
 hawk-0.047
reetings-0.046
rot-0.046
Token lit
Token ID6578
Feature activation-4.99e-02

Pos logit contributions
 work+0.032
 First+0.032
 On+0.027
 While+0.027
 Min+0.025
Neg logit contributions
ese-0.045
 mir-0.043
 hawk-0.041
rot-0.041
reetings-0.041
Token up
Token ID510
Feature activation-8.97e-03

Pos logit contributions
 First+0.070
 treadmill+0.060
igsaw+0.059
 work+0.059
 While+0.055
Neg logit contributions
ese-0.099
 mir-0.093
 hawk-0.091
rot-0.089
 crow-0.089
Token the
Token ID262
Feature activation-3.87e-03

Pos logit contributions
 First+0.009
 work+0.009
 While+0.007
 On+0.007
 treadmill+0.007
Neg logit contributions
ese-0.022
 mir-0.021
 hawk-0.020
rot-0.020
 crow-0.020
Token sky
Token ID6766
Feature activation-1.58e-02

Pos logit contributions
 work+0.008
 First+0.007
 On+0.007
 While+0.006
 Min+0.006
Neg logit contributions
ese-0.006
 mir-0.006
reetings-0.005
rot-0.005
 hawk-0.005
Token.
Token ID13
Feature activation-8.22e-03

Pos logit contributions
 First+0.014
 work+0.014
 While+0.011
 On+0.011
 treadmill+0.010
Neg logit contributions
ese-0.040
 mir-0.039
rot-0.037
 crow-0.037
reetings-0.037
Token The
Token ID383
Feature activation-1.57e-02

Pos logit contributions
 work+0.009
 First+0.007
 treadmill+0.006
igsaw+0.006
 pull+0.006
Neg logit contributions
 mir-0.018
ese-0.018
 crow-0.017
 foe-0.017
 hawk-0.017
Token a
Token ID257
Feature activation+2.19e-04

Pos logit contributions
 First+0.024
 While+0.022
 On+0.022
 work+0.021
igsaw+0.020
Neg logit contributions
ese-0.027
 crow-0.024
 mir-0.024
 yourself-0.023
rot-0.023
Token stack
Token ID8931
Feature activation-1.68e-02

Pos logit contributions
ese+0.000
 mir+0.000
 crow+0.000
 hawk+0.000
rot+0.000
Neg logit contributions
 First-0.000
 work-0.000
 On-0.000
 While-0.000
 Min-0.000
Token of
Token ID286
Feature activation-3.98e-03

Pos logit contributions
 First+0.030
 work+0.029
 While+0.028
 On+0.027
 Min+0.026
Neg logit contributions
ese-0.027
 mir-0.025
rot-0.023
 crow-0.023
ile-0.023
Token blocks
Token ID7021
Feature activation-1.54e-02

Pos logit contributions
 First+0.006
 On+0.005
 While+0.005
 work+0.005
 Min+0.005
Neg logit contributions
ese-0.007
 mir-0.007
 crow-0.007
ak-0.007
 yourself-0.007
Token as
Token ID355
Feature activation-5.40e-03

Pos logit contributions
 First+0.023
 While+0.021
 work+0.019
 On+0.019
 Min+0.018
Neg logit contributions
ese-0.030
 mir-0.028
rot-0.027
 crow-0.027
ak-0.027
Token tall
Token ID7331
Feature activation-4.99e-02

Pos logit contributions
 First+0.007
 work+0.006
 While+0.006
 On+0.006
 treadmill+0.005
Neg logit contributions
 mir-0.011
ese-0.011
 crow-0.011
rot-0.011
 hawk-0.011
Token as
Token ID355
Feature activation-8.07e-03

Pos logit contributions
 First+0.066
 On+0.061
 While+0.059
igsaw+0.056
 Min+0.056
Neg logit contributions
ese-0.099
 crow-0.092
 mir-0.088
 eagle-0.085
 ge-0.085
Token her
Token ID607
Feature activation-1.06e-02

Pos logit contributions
 work+0.009
 First+0.009
 While+0.008
 On+0.007
 treadmill+0.006
Neg logit contributions
ese-0.019
 mir-0.019
 crow-0.018
 hawk-0.017
rot-0.017
Token.
Token ID13
Feature activation+1.62e-03

Pos logit contributions
 First+0.014
 work+0.013
 While+0.013
 On+0.012
 Min+0.012
Neg logit contributions
ese-0.022
 mir-0.020
 crow-0.019
rot-0.019
ich-0.019
Token She
Token ID1375
Feature activation-1.98e-03

Pos logit contributions
 mir+0.003
reetings+0.002
Did+0.002
ese+0.002
 foe+0.002
Neg logit contributions
 work-0.003
 drive-0.002
 pull-0.002
 treadmill-0.002
 tug-0.002
Token was
Token ID373
Feature activation+4.89e-03

Pos logit contributions
 work+0.003
 First+0.003
 On+0.003
 While+0.003
 Min+0.003
Neg logit contributions
ese-0.004
 mir-0.003
reetings-0.003
 hawk-0.003
rot-0.003
Token this
Token ID428
Feature activation-9.83e-03

Pos logit contributions
ese+0.031
rot+0.029
 mir+0.027
ile+0.027
reetings+0.027
Neg logit contributions
 work-0.022
 First-0.019
 as-0.017
 Grand-0.017
 Tim-0.017
Token crib
Token ID48083
Feature activation+5.45e-03

Pos logit contributions
 work+0.017
 First+0.016
 On+0.015
 While+0.014
 Min+0.013
Neg logit contributions
ese-0.019
 mir-0.017
rot-0.016
reetings-0.016
 crow-0.016
Token is
Token ID318
Feature activation-1.69e-02

Pos logit contributions
ese+0.009
 mir+0.009
 hawk+0.008
 crow+0.008
reetings+0.008
Neg logit contributions
 work-0.011
 First-0.010
 On-0.008
 While-0.008
 pull-0.008
Token just
Token ID655
Feature activation+3.90e-03

Pos logit contributions
 work+0.026
 First+0.022
 On+0.019
 as+0.019
 While+0.018
Neg logit contributions
ese-0.038
 mir-0.034
rot-0.034
lor-0.033
 crow-0.033
Token as
Token ID355
Feature activation-5.09e-03

Pos logit contributions
ese+0.010
 mir+0.009
rot+0.009
 hawk+0.009
reetings+0.009
Neg logit contributions
 work-0.004
 First-0.004
 On-0.003
 While-0.003
 When-0.003
Token wide
Token ID3094
Feature activation-4.98e-02

Pos logit contributions
 work+0.007
 First+0.006
 On+0.005
 While+0.005
 Min+0.005
Neg logit contributions
ese-0.012
 mir-0.011
rot-0.011
lor-0.011
 hawk-0.011
Token as
Token ID355
Feature activation-1.27e-02

Pos logit contributions
 First+0.042
 work+0.037
 On+0.035
 While+0.035
 treadmill+0.031
Neg logit contributions
ese-0.125
 mir-0.120
 hawk-0.116
reetings-0.115
 crow-0.114
Token my
Token ID616
Feature activation-1.82e-02

Pos logit contributions
 work+0.018
 First+0.017
 On+0.014
 While+0.014
 Min+0.013
Neg logit contributions
ese-0.028
 mir-0.026
rot-0.026
reetings-0.025
 hawk-0.025
Token old
Token ID1468
Feature activation-2.23e-02

Pos logit contributions
 work+0.026
 First+0.025
 On+0.022
 While+0.022
 Min+0.020
Neg logit contributions
ese-0.038
 mir-0.036
 hawk-0.034
reetings-0.034
rot-0.034
Token one
Token ID530
Feature activation+2.82e-03

Pos logit contributions
 work+0.030
 First+0.028
 On+0.024
 While+0.023
 Min+0.021
Neg logit contributions
ese-0.050
 mir-0.048
reetings-0.046
rot-0.045
 crow-0.045
Token,"
Token ID553
Feature activation-1.35e-02

Pos logit contributions
ese+0.007
 mir+0.007
 hawk+0.007
rot+0.007
reetings+0.007
Neg logit contributions
 work-0.003
 First-0.003
 On-0.002
 While-0.002
 Min-0.002
Token here
Token ID994
Feature activation-3.06e-03

Pos logit contributions
 mir+0.020
ese+0.020
reetings+0.020
rot+0.019
 crow+0.019
Neg logit contributions
 work-0.017
 First-0.014
 On-0.013
 as-0.013
 While-0.012
Token."
Token ID526
Feature activation-1.32e-02

Pos logit contributions
 work+0.004
 First+0.003
 as+0.003
 On+0.003
 While+0.003
Neg logit contributions
ese-0.008
 mir-0.007
 hawk-0.007
 crow-0.007
rot-0.007
Token
Token ID198
Feature activation-2.00e-03

Pos logit contributions
 work+0.019
 First+0.017
igsaw+0.015
 treadmill+0.015
 On+0.015
Neg logit contributions
ese-0.027
 mir-0.025
 crow-0.024
 hawk-0.023
reetings-0.023
Token
Token ID198
Feature activation-7.99e-03

Pos logit contributions
 work+0.003
 First+0.003
 While+0.002
 treadmill+0.002
 Min+0.002
Neg logit contributions
ese-0.004
 mir-0.004
rot-0.003
reetings-0.003
 crow-0.003
TokenAs
Token ID1722
Feature activation-4.38e-03

Pos logit contributions
 First+0.013
 work+0.013
 While+0.012
 On+0.011
 When+0.010
Neg logit contributions
ese-0.015
 mir-0.015
reetings-0.014
 hawk-0.014
rot-0.014
Token Farmer
Token ID37755
Feature activation-4.93e-02

Pos logit contributions
 work+0.004
 First+0.004
 On+0.003
 While+0.003
 Min+0.003
Neg logit contributions
ese-0.011
 mir-0.011
 hawk-0.010
reetings-0.010
rot-0.010
Token Brown
Token ID4373
Feature activation-2.08e-02

Pos logit contributions
 work+0.057
 First+0.054
 On+0.045
 While+0.044
 Min+0.040
Neg logit contributions
ese-0.116
 mir-0.113
reetings-0.107
 hawk-0.106
rot-0.106
Token left
Token ID1364
Feature activation+5.29e-04

Pos logit contributions
 work+0.033
 First+0.030
 On+0.027
 While+0.027
 Min+0.025
Neg logit contributions
 mir-0.041
ese-0.041
reetings-0.038
 hawk-0.038
rot-0.038
Token,
Token ID11
Feature activation+2.15e-03

Pos logit contributions
ese+0.001
 mir+0.001
reetings+0.001
rot+0.001
 hawk+0.001
Neg logit contributions
 work-0.001
 First-0.001
 On-0.000
 While-0.000
 Min-0.000
Token Jack
Token ID3619
Feature activation-1.05e-02

Pos logit contributions
ese+0.005
 mir+0.005
 hawk+0.005
 crow+0.005
reetings+0.005
Neg logit contributions
 work-0.002
 First-0.002
 On-0.002
 While-0.002
 treadmill-0.002
Token looked
Token ID3114
Feature activation-5.06e-03

Pos logit contributions
 work+0.016
 First+0.014
 On+0.014
 While+0.013
 Min+0.013
Neg logit contributions
ese-0.021
reetings-0.020
 mir-0.020
rot-0.019
 hawk-0.019
Token in
Token ID287
Feature activation+8.57e-03

Pos logit contributions
 First+0.012
 work+0.011
 While+0.010
 On+0.010
 treadmill+0.009
Neg logit contributions
ese-0.021
 mir-0.020
 hawk-0.019
 crow-0.019
rot-0.019
Token the
Token ID262
Feature activation+5.70e-03

Pos logit contributions
ese+0.024
 mir+0.023
reetings+0.022
rot+0.022
 crow+0.021
Neg logit contributions
 work-0.007
 First-0.006
 On-0.005
 While-0.005
 Min-0.004
Token evening
Token ID6180
Feature activation-1.01e-02

Pos logit contributions
reetings+0.011
ese+0.011
rot+0.011
lor+0.011
 mir+0.011
Neg logit contributions
 work-0.009
 First-0.008
 On-0.007
 While-0.007
 When-0.007
Token.
Token ID13
Feature activation-4.68e-03

Pos logit contributions
 work+0.013
 First+0.011
 On+0.011
 While+0.010
 When+0.009
Neg logit contributions
ese-0.024
 mir-0.022
reetings-0.022
 hawk-0.021
rot-0.021
Token As
Token ID1081
Feature activation-2.21e-04

Pos logit contributions
 work+0.005
 First+0.004
 On+0.003
 While+0.003
 treadmill+0.003
Neg logit contributions
ese-0.012
 mir-0.011
reetings-0.011
 hawk-0.011
rot-0.011
Token night
Token ID1755
Feature activation-4.93e-02

Pos logit contributions
 work+0.000
 First+0.000
 On+0.000
 While+0.000
 Min+0.000
Neg logit contributions
ese-0.001
 mir-0.001
rot-0.001
reetings-0.001
lor-0.001
Token time
Token ID640
Feature activation-3.38e-02

Pos logit contributions
 First+0.089
 work+0.081
 treadmill+0.077
 On+0.074
igsaw+0.073
Neg logit contributions
 mir-0.077
 hawk-0.075
ese-0.073
 crow-0.073
rot-0.069
Token came
Token ID1625
Feature activation+4.11e-03

Pos logit contributions
 First+0.056
 work+0.056
 While+0.049
 On+0.049
 treadmill+0.047
Neg logit contributions
ese-0.058
 mir-0.057
 hawk-0.054
reetings-0.053
rot-0.053
Token,
Token ID11
Feature activation+7.28e-03

Pos logit contributions
ese+0.012
rim+0.011
 mir+0.011
ile+0.011
lor+0.011
Neg logit contributions
 work-0.003
 First-0.003
 as-0.002
 On-0.002
 While-0.002
Token the
Token ID262
Feature activation-1.03e-02

Pos logit contributions
ese+0.019
 mir+0.019
reetings+0.018
rot+0.018
 hawk+0.018
Neg logit contributions
 work-0.007
 First-0.006
 On-0.005
 While-0.005
 Min-0.004
Token sun
Token ID4252
Feature activation-3.72e-04

Pos logit contributions
 work+0.018
 First+0.015
 On+0.013
 While+0.013
 When+0.012
Neg logit contributions
lor-0.020
reetings-0.020
 mir-0.020
ese-0.020
rot-0.019
Token was
Token ID373
Feature activation-9.22e-03

Pos logit contributions
 work+0.010
 First+0.010
 On+0.009
 While+0.009
 Min+0.008
Neg logit contributions
ese-0.013
 mir-0.012
 hawk-0.011
reetings-0.011
rot-0.011
Token fast
Token ID3049
Feature activation-8.68e-03

Pos logit contributions
 work+0.013
 First+0.012
 On+0.011
 While+0.011
 Min+0.010
Neg logit contributions
ese-0.020
 mir-0.019
 hawk-0.018
rot-0.018
reetings-0.018
Token asleep
Token ID16039
Feature activation-2.64e-02

Pos logit contributions
 First+0.015
 work+0.013
 While+0.013
 On+0.013
 When+0.012
Neg logit contributions
ese-0.015
 mir-0.015
rot-0.014
 hawk-0.014
 crow-0.014
Token.
Token ID13
Feature activation-8.33e-03

Pos logit contributions
 First+0.025
 work+0.025
 On+0.019
 While+0.019
 treadmill+0.016
Neg logit contributions
ese-0.066
 mir-0.065
rot-0.061
 hawk-0.061
reetings-0.061
Token As
Token ID1081
Feature activation-4.51e-03

Pos logit contributions
 work+0.014
 treadmill+0.011
 First+0.011
 pull+0.010
 experiments+0.010
Neg logit contributions
Fear-0.014
reetings-0.014
lor-0.013
 mir-0.013
ese-0.013
Token morning
Token ID3329
Feature activation-4.90e-02

Pos logit contributions
 First+0.006
 work+0.006
 On+0.005
 While+0.005
 treadmill+0.004
Neg logit contributions
 mir-0.010
ese-0.009
 hawk-0.009
 crow-0.009
reetings-0.009
Token came
Token ID1625
Feature activation-2.43e-03

Pos logit contributions
 First+0.095
 treadmill+0.087
 work+0.085
igsaw+0.083
 experiments+0.079
Neg logit contributions
 mir-0.064
 crow-0.061
 hawk-0.059
lor-0.056
rot-0.056
Token,
Token ID11
Feature activation+1.03e-04

Pos logit contributions
 work+0.003
 First+0.003
 On+0.002
 While+0.002
 When+0.002
Neg logit contributions
ese-0.006
 mir-0.006
 hawk-0.005
 crow-0.005
rot-0.005
Token the
Token ID262
Feature activation-2.63e-02

Pos logit contributions
 mir+0.000
 hawk+0.000
ese+0.000
 crow+0.000
rot+0.000
Neg logit contributions
 First-0.000
 work-0.000
 On-0.000
 While-0.000
 treadmill-0.000
Token little
Token ID1310
Feature activation-3.14e-02

Pos logit contributions
 First+0.040
 work+0.039
 On+0.034
 While+0.033
 treadmill+0.031
Neg logit contributions
ese-0.051
 mir-0.048
rot-0.046
 crow-0.046
 hawk-0.045
Token girl
Token ID2576
Feature activation-5.92e-03

Pos logit contributions
 First+0.059
 work+0.058
 On+0.053
 While+0.052
 treadmill+0.050
Neg logit contributions
ese-0.048
 mir-0.046
 hawk-0.044
rot-0.043
 crow-0.043
Token
Token ID198
Feature activation+6.22e-03

Pos logit contributions
ese+0.017
 mir+0.016
rot+0.015
 hawk+0.015
 crow+0.015
Neg logit contributions
 First-0.008
 work-0.008
 On-0.007
 While-0.007
 When-0.006
TokenThe
Token ID464
Feature activation-2.03e-03

Pos logit contributions
ese+0.012
 mir+0.011
 hawk+0.010
rot+0.010
 foe+0.010
Neg logit contributions
 First-0.010
 work-0.010
 While-0.009
 On-0.009
 When-0.009
Token crab
Token ID32202
Feature activation+1.52e-02

Pos logit contributions
 work+0.004
 First+0.003
 Min+0.003
 pull+0.003
 While+0.003
Neg logit contributions
ese-0.004
reetings-0.004
lor-0.003
 mir-0.003
rot-0.003
Token asked
Token ID1965
Feature activation+4.04e-03

Pos logit contributions
reetings+0.034
 mir+0.033
ese+0.033
 yourself+0.031
lor+0.031
Neg logit contributions
 work-0.020
 as-0.015
 First-0.015
 On-0.014
 While-0.014
Token the
Token ID262
Feature activation+7.98e-03

Pos logit contributions
ese+0.010
 mir+0.010
reetings+0.009
 hawk+0.009
rot+0.009
Neg logit contributions
 work-0.004
 First-0.004
 On-0.003
 While-0.003
 Min-0.003
Token chicken
Token ID9015
Feature activation+3.96e-02

Pos logit contributions
reetings+0.014
ese+0.014
lor+0.014
ivering+0.013
 mir+0.013
Neg logit contributions
 work-0.015
 First-0.012
 pull-0.010
 Min-0.010
 shelf-0.010
Token,
Token ID11
Feature activation+5.89e-03

Pos logit contributions
ese+0.109
 mir+0.105
ivering+0.104
 hawk+0.102
ich+0.100
Neg logit contributions
 work-0.034
 First-0.030
 as-0.022
 On-0.022
 When-0.021
Token �
Token ID6184
Feature activation+1.79e-02

Pos logit contributions
ese+0.017
 mir+0.016
rot+0.015
asis+0.015
 hawk+0.015
Neg logit contributions
 work-0.005
 First-0.005
 On-0.004
 When-0.004
 While-0.004
Token�
Token ID95
Feature activation+1.17e-02

Pos logit contributions
ese+0.030
 mir+0.028
reetings+0.026
 hawk+0.026
rot+0.026
Neg logit contributions
 work-0.033
 First-0.032
 On-0.029
 While-0.028
 Min-0.027
Token€
Token ID26391
Feature activation+1.51e-02

Pos logit contributions
 mir+0.020
ese+0.019
 hawk+0.019
reetings+0.018
 foe+0.018
Neg logit contributions
 work-0.020
 First-0.020
 On-0.018
 While-0.017
 When-0.017
Token˜
Token ID41185
Feature activation+9.44e-03

Pos logit contributions
 mir+0.028
reetings+0.026
 hawk+0.026
ile+0.026
ese+0.025
Neg logit contributions
 work-0.023
 First-0.022
 On-0.021
 While-0.020
 When-0.019
Token ear
Token ID1027
Feature activation+5.69e-03

Pos logit contributions
ese+0.005
 mir+0.005
reetings+0.005
rot+0.005
 hawk+0.005
Neg logit contributions
 work-0.005
 First-0.005
 On-0.004
 While-0.004
 Min-0.004
Token.
Token ID13
Feature activation+9.95e-03

Pos logit contributions
ese+0.015
 mir+0.015
reetings+0.015
 hawk+0.015
rot+0.014
Neg logit contributions
 work-0.005
 First-0.004
 On-0.003
 While-0.003
 pull-0.003
Token It
Token ID632
Feature activation+8.31e-03

Pos logit contributions
ese+0.021
 mir+0.021
 crow+0.020
reetings+0.020
 hawk+0.020
Neg logit contributions
 work-0.013
 First-0.013
 While-0.010
 On-0.010
 Min-0.010
Token is
Token ID318
Feature activation-4.36e-03

Pos logit contributions
ese+0.017
 mir+0.015
reetings+0.015
ich+0.015
rot+0.014
Neg logit contributions
 work-0.014
 First-0.012
 On-0.012
 pull-0.011
 While-0.011
Token a
Token ID257
Feature activation-3.46e-03

Pos logit contributions
 work+0.004
 First+0.003
 On+0.002
 Max+0.002
 Min+0.002
Neg logit contributions
ese-0.012
reetings-0.011
lor-0.011
rot-0.011
 mir-0.011
Token fle
Token ID5104
Feature activation+3.53e-02

Pos logit contributions
 work+0.005
 First+0.004
 pull+0.004
 shelf+0.004
 TV+0.004
Neg logit contributions
ese-0.008
reetings-0.007
lor-0.007
 mir-0.007
rim-0.006
Tokena
Token ID64
Feature activation+2.95e-02

Pos logit contributions
ese+0.054
 mir+0.053
reetings+0.048
 hawk+0.047
Fear+0.046
Neg logit contributions
 work-0.063
 First-0.058
 On-0.055
 While-0.054
 pull-0.053
Token.
Token ID13
Feature activation+8.67e-03

Pos logit contributions
 mir+0.084
reetings+0.084
ese+0.084
 hawk+0.079
Fear+0.079
Neg logit contributions
 work-0.018
 as-0.009
 pull-0.008
.-0.007
 First-0.007
Token A
Token ID317
Feature activation+3.44e-03

Pos logit contributions
ese+0.022
 mir+0.021
 hawk+0.020
reetings+0.020
rot+0.020
Neg logit contributions
 First-0.009
 work-0.008
 On-0.007
 While-0.007
 When-0.006
Token fle
Token ID5104
Feature activation+3.82e-02

Pos logit contributions
ese+0.007
reetings+0.007
lor+0.006
 mir+0.006
ile+0.006
Neg logit contributions
 work-0.005
 First-0.004
 Min-0.004
 pull-0.004
 While-0.004
Tokena
Token ID64
Feature activation+2.71e-02

Pos logit contributions
ese+0.068
 mir+0.065
 eagle+0.059
 hawk+0.058
Fear+0.057
Neg logit contributions
 work-0.056
 First-0.050
 On-0.048
 Liz-0.046
 pull-0.046
Token
Token ID198
Feature activation+7.08e-03

Pos logit contributions
ese+0.020
 mir+0.019
reetings+0.018
rot+0.018
 hawk+0.018
Neg logit contributions
 work-0.012
 First-0.012
 While-0.010
 On-0.010
 treadmill-0.009
Token"
Token ID1
Feature activation+3.47e-03

Pos logit contributions
ese+0.012
 mir+0.011
 hawk+0.010
reetings+0.010
 crow+0.010
Neg logit contributions
 First-0.014
 work-0.013
 On-0.012
 While-0.012
 When-0.011
TokenThat
Token ID2504
Feature activation-1.89e-03

Pos logit contributions
ese+0.006
 mir+0.005
reetings+0.005
lor+0.005
rot+0.005
Neg logit contributions
 First-0.007
 work-0.006
 On-0.006
 While-0.006
 Min-0.006
Token is
Token ID318
Feature activation-1.24e-02

Pos logit contributions
 work+0.003
 First+0.003
 On+0.003
 While+0.003
 Min+0.003
Neg logit contributions
ese-0.003
 mir-0.003
reetings-0.003
ivering-0.003
rim-0.003
Token a
Token ID257
Feature activation-9.85e-03

Pos logit contributions
 work+0.011
 First+0.010
 On+0.008
 While+0.008
 Min+0.007
Neg logit contributions
ese-0.033
 mir-0.032
reetings-0.031
rot-0.030
 hawk-0.030
Token re
Token ID302
Feature activation+4.59e-02

Pos logit contributions
 work+0.016
 First+0.013
 On+0.012
 While+0.012
 shelf+0.012
Neg logit contributions
ese-0.020
reetings-0.018
 mir-0.017
ivering-0.016
ile-0.016
Tokenind
Token ID521
Feature activation+1.87e-02

Pos logit contributions
 mir+0.056
ese+0.056
reetings+0.052
 hawk+0.049
 foe+0.048
Neg logit contributions
 work-0.100
 First-0.088
 On-0.083
 Min-0.083
 While-0.082
Tokeneer
Token ID28153
Feature activation+2.06e-02

Pos logit contributions
ese+0.018
 mir+0.016
 crow+0.015
ile+0.015
rot+0.014
Neg logit contributions
 First-0.049
 work-0.046
 On-0.045
 While-0.045
 treadmill-0.043
Token,
Token ID11
Feature activation-1.44e-02

Pos logit contributions
ese+0.055
 mir+0.054
reetings+0.052
 hawk+0.051
rot+0.051
Neg logit contributions
 work-0.017
 First-0.015
 On-0.011
 While-0.010
 Min-0.010
Token Tim
Token ID5045
Feature activation+5.09e-03

Pos logit contributions
 work+0.009
 First+0.008
 Tim+0.008
 Max+0.007
 Grand+0.007
Neg logit contributions
reetings-0.042
ese-0.041
lor-0.040
rot-0.040
 mir-0.039
Token.
Token ID13
Feature activation+9.44e-03

Pos logit contributions
ese+0.013
 mir+0.012
rot+0.012
reetings+0.012
 crow+0.012
Neg logit contributions
 First-0.005
 work-0.005
 On-0.004
 While-0.004
 treadmill-0.003
Token on
Token ID319
Feature activation-3.32e-03

Pos logit contributions
ese+0.007
 mir+0.007
 hawk+0.006
 crow+0.006
reetings+0.006
Neg logit contributions
 work-0.005
 First-0.005
 On-0.005
 While-0.005
 Min-0.004
Token the
Token ID262
Feature activation+9.09e-03

Pos logit contributions
 First+0.004
 work+0.004
 While+0.004
 On+0.004
 treadmill+0.004
Neg logit contributions
ese-0.006
 mir-0.006
 crow-0.006
 hawk-0.006
ull-0.006
Token door
Token ID3420
Feature activation-2.53e-03

Pos logit contributions
ese+0.015
 mir+0.015
 hawk+0.014
 crow+0.014
rot+0.013
Neg logit contributions
 First-0.016
 work-0.016
 While-0.014
 On-0.014
 Min-0.013
Token.
Token ID13
Feature activation-7.06e-03

Pos logit contributions
 First+0.003
 work+0.003
 While+0.002
 Min+0.002
igsaw+0.002
Neg logit contributions
 mir-0.006
ese-0.005
 crow-0.005
reetings-0.005
rot-0.005
Token He
Token ID679
Feature activation-4.52e-04

Pos logit contributions
 work+0.010
 First+0.010
 Min+0.009
igsaw+0.008
 Em+0.008
Neg logit contributions
 mir-0.013
 crow-0.012
ese-0.011
 hawk-0.011
 foe-0.011
Token says
Token ID1139
Feature activation+3.63e-02

Pos logit contributions
 work+0.001
 First+0.001
 On+0.001
 While+0.001
 Min+0.001
Neg logit contributions
ese-0.001
 mir-0.001
reetings-0.001
rot-0.001
 hawk-0.001
Token:
Token ID25
Feature activation+5.79e-03

Pos logit contributions
ese+0.096
ivering+0.088
 mir+0.085
 hawk+0.085
ode+0.084
Neg logit contributions
 work-0.035
 First-0.029
 On-0.026
 When-0.025
 Lily-0.024
Token "
Token ID366
Feature activation-2.20e-03

Pos logit contributions
ese+0.015
 mir+0.014
 hawk+0.013
rot+0.013
ile+0.013
Neg logit contributions
 work-0.006
 First-0.006
 On-0.005
 While-0.005
 When-0.004
TokenOpen
Token ID11505
Feature activation+4.09e-03

Pos logit contributions
 work+0.004
 First+0.003
 On+0.003
 While+0.003
 pull+0.003
Neg logit contributions
 mir-0.004
ese-0.004
 foe-0.004
rot-0.004
 hawk-0.004
Token the
Token ID262
Feature activation+1.30e-02

Pos logit contributions
ese+0.007
 mir+0.006
reetings+0.006
 hawk+0.006
rot+0.006
Neg logit contributions
 work-0.008
 First-0.007
 On-0.007
 While-0.007
 Min-0.006
Token door
Token ID3420
Feature activation-2.84e-03

Pos logit contributions
ese+0.027
 mir+0.026
reetings+0.025
rot+0.024
 hawk+0.024
Neg logit contributions
 work-0.019
 First-0.018
 On-0.016
 While-0.015
 treadmill-0.014
Token say
Token ID910
Feature activation+3.36e-02

Pos logit contributions
ese+0.007
 mir+0.007
 hawk+0.006
 crow+0.006
rot+0.006
Neg logit contributions
 First-0.005
 work-0.004
 While-0.004
 On-0.004
 treadmill-0.004
Token she
Token ID673
Feature activation+1.73e-02

Pos logit contributions
ese+0.083
 mir+0.078
reetings+0.074
 hawk+0.073
ivering+0.073
Neg logit contributions
 work-0.038
 First-0.036
 On-0.029
 While-0.027
 Min-0.027
Token is
Token ID318
Feature activation+7.70e-03

Pos logit contributions
ese+0.028
 mir+0.028
 hawk+0.026
reetings+0.025
 crow+0.025
Neg logit contributions
 work-0.034
 First-0.031
 On-0.028
 While-0.028
 Min-0.027
Token a
Token ID257
Feature activation-5.72e-03

Pos logit contributions
ese+0.020
 mir+0.019
reetings+0.018
 hawk+0.018
rot+0.018
Neg logit contributions
 work-0.008
 First-0.006
 On-0.005
 Min-0.005
 While-0.005
Token perfect
Token ID2818
Feature activation+9.69e-03

Pos logit contributions
 work+0.008
 First+0.006
 Min+0.006
 drive+0.005
 While+0.005
Neg logit contributions
ese-0.013
reetings-0.012
lor-0.012
ivering-0.011
rim-0.011
Token chicken
Token ID9015
Feature activation+3.53e-02

Pos logit contributions
ese+0.017
reetings+0.016
 mir+0.015
lor+0.015
 hawk+0.014
Neg logit contributions
 work-0.019
 First-0.016
 On-0.014
 While-0.014
 Min-0.014
Token.
Token ID13
Feature activation+7.36e-03

Pos logit contributions
reetings+0.092
 mir+0.091
ese+0.091
Fear+0.088
 hawk+0.086
Neg logit contributions
 work-0.029
 First-0.022
 as-0.019
 Min-0.016
 Max-0.016
Token
Token ID198
Feature activation+1.25e-02

Pos logit contributions
ese+0.018
 mir+0.017
 hawk+0.016
rot+0.016
reetings+0.016
Neg logit contributions
 work-0.008
 First-0.007
 On-0.006
 While-0.006
 treadmill-0.005
Token
Token ID198
Feature activation+8.63e-03

Pos logit contributions
 mir+0.021
ese+0.021
 crow+0.020
rot+0.019
reetings+0.019
Neg logit contributions
 work-0.021
 First-0.021
 treadmill-0.017
 While-0.017
igsaw-0.017
TokenOne
Token ID3198
Feature activation+2.28e-02

Pos logit contributions
ese+0.016
 mir+0.015
reetings+0.014
 hawk+0.014
 crow+0.014
Neg logit contributions
 First-0.014
 work-0.014
 On-0.013
 While-0.013
 When-0.012
Token day
Token ID1110
Feature activation-1.38e-02

Pos logit contributions
ese+0.055
 mir+0.053
reetings+0.051
 hawk+0.050
lor+0.050
Neg logit contributions
 work-0.026
 First-0.023
 On-0.021
 While-0.019
 When-0.017
Token with
Token ID351
Feature activation-5.25e-03

Pos logit contributions
 work+0.004
 First+0.004
 On+0.003
 While+0.003
 as+0.003
Neg logit contributions
 mir-0.009
ese-0.009
reetings-0.009
 hawk-0.009
 crow-0.009
Token dolls
Token ID36062
Feature activation+1.47e-02

Pos logit contributions
 work+0.009
 First+0.008
 Max+0.006
 Min+0.006
 On+0.006
Neg logit contributions
ese-0.010
 mir-0.010
reetings-0.010
rot-0.010
lor-0.009
Token is
Token ID318
Feature activation-2.55e-04

Pos logit contributions
 mir+0.040
ese+0.040
 hawk+0.039
reetings+0.038
 crow+0.038
Neg logit contributions
 work-0.013
 First-0.008
 as-0.008
.-0.006
 On-0.006
Token silly
Token ID14397
Feature activation+3.90e-03

Pos logit contributions
 work+0.000
 as+0.000
 Max+0.000
 Lily+0.000
 First+0.000
Neg logit contributions
ese-0.001
lor-0.001
 mir-0.000
rot-0.000
reetings-0.000
Token.
Token ID13
Feature activation+4.56e-03

Pos logit contributions
 mir+0.010
ese+0.010
 hawk+0.009
reetings+0.009
lor+0.009
Neg logit contributions
 work-0.004
 First-0.004
 as-0.003
 On-0.003
 cartoons-0.003
Token It
Token ID632
Feature activation+1.17e-02

Pos logit contributions
ese+0.010
 mir+0.010
 hawk+0.009
 crow+0.009
rot+0.009
Neg logit contributions
 First-0.006
 work-0.006
 On-0.005
 While-0.005
 When-0.005
Token's
Token ID338
Feature activation+4.41e-03

Pos logit contributions
ese+0.024
 mir+0.024
reetings+0.023
 pearl+0.023
lor+0.023
Neg logit contributions
 work-0.019
 First-0.014
 pull-0.014
 On-0.013
 Min-0.013
Token fun
Token ID1257
Feature activation+1.30e-02

Pos logit contributions
ese+0.009
 mir+0.009
lor+0.009
rot+0.009
reetings+0.009
Neg logit contributions
 work-0.006
 First-0.005
 as-0.005
 Max-0.005
 On-0.005
Token and
Token ID290
Feature activation+1.59e-03

Pos logit contributions
ese+0.034
 mir+0.033
 hawk+0.032
lor+0.032
rot+0.032
Neg logit contributions
 work-0.015
 First-0.010
 as-0.009
 On-0.008
 While-0.008
Token creative
Token ID7325
Feature activation-3.29e-03

Pos logit contributions
ese+0.004
lor+0.003
rot+0.003
 mir+0.003
 hawk+0.003
Neg logit contributions
 work-0.003
 First-0.002
 Max-0.002
 pull-0.002
 as-0.002
Token.
Token ID13
Feature activation+3.83e-03

Pos logit contributions
 work+0.004
 First+0.003
 On+0.002
 While+0.002
 as+0.002
Neg logit contributions
ese-0.008
 mir-0.008
 hawk-0.008
 crow-0.008
rot-0.008
Token,
Token ID11
Feature activation-2.80e-03

Pos logit contributions
 work+0.020
 First+0.019
 On+0.016
 While+0.015
 Min+0.015
Neg logit contributions
ese-0.025
 mir-0.025
 hawk-0.023
reetings-0.023
 crow-0.023
Token can
Token ID460
Feature activation-6.04e-03

Pos logit contributions
 work+0.004
 First+0.004
 On+0.003
 pull+0.003
 Max+0.003
Neg logit contributions
ese-0.006
 mir-0.006
ivering-0.005
rim-0.005
ile-0.005
Token you
Token ID345
Feature activation-1.67e-02

Pos logit contributions
 work+0.011
 First+0.011
 On+0.009
 While+0.009
 Min+0.009
Neg logit contributions
ese-0.010
 mir-0.010
reetings-0.009
 hawk-0.009
 crow-0.009
Token say
Token ID910
Feature activation+1.20e-02

Pos logit contributions
 work+0.022
 First+0.019
 On+0.017
 While+0.015
 pull+0.015
Neg logit contributions
ese-0.039
 mir-0.039
 hawk-0.036
reetings-0.035
 crow-0.035
Token '
Token ID705
Feature activation-7.47e-03

Pos logit contributions
ese+0.026
 mir+0.025
reetings+0.023
 hawk+0.023
rot+0.023
Neg logit contributions
 work-0.016
 First-0.016
 On-0.013
 While-0.013
 Min-0.012
Tokenplease
Token ID29688
Feature activation+1.34e-02

Pos logit contributions
 work+0.014
 First+0.013
 On+0.012
 While+0.012
 Min+0.011
Neg logit contributions
ese-0.013
 mir-0.012
 hawk-0.011
 crow-0.011
reetings-0.011
Token'
Token ID6
Feature activation+1.61e-02

Pos logit contributions
 mir+0.021
ese+0.021
 hawk+0.019
 foe+0.018
rim+0.018
Neg logit contributions
 work-0.026
 First-0.025
 On-0.023
 While-0.022
 When-0.021
Token and
Token ID290
Feature activation-8.22e-03

Pos logit contributions
ese+0.032
 mir+0.031
 hawk+0.029
reetings+0.028
rot+0.028
Neg logit contributions
 work-0.025
 First-0.024
 On-0.021
 While-0.021
 Min-0.019
Token '
Token ID705
Feature activation-1.16e-02

Pos logit contributions
 work+0.011
 First+0.010
 On+0.009
 While+0.009
 Min+0.008
Neg logit contributions
ese-0.018
 mir-0.018
 hawk-0.016
reetings-0.016
rot-0.016
Tokenthank
Token ID40716
Feature activation-3.74e-03

Pos logit contributions
 work+0.021
 First+0.021
 On+0.019
 While+0.018
 Min+0.017
Neg logit contributions
ese-0.020
 mir-0.019
 hawk-0.017
reetings-0.017
 crow-0.017
Token you
Token ID345
Feature activation+1.66e-02

Pos logit contributions
 work+0.008
 First+0.008
 On+0.007
 While+0.007
 Min+0.007
Neg logit contributions
ese-0.005
 mir-0.005
 hawk-0.005
reetings-0.005
 crow-0.004
Token was
Token ID373
Feature activation-4.96e-03

Pos logit contributions
 work+0.002
 First+0.002
 On+0.002
 Min+0.002
 While+0.002
Neg logit contributions
ese-0.003
 mir-0.003
 hawk-0.003
reetings-0.003
rot-0.003
Token very
Token ID845
Feature activation+1.58e-04

Pos logit contributions
 work+0.007
 First+0.006
 Min+0.005
 Em+0.005
 as+0.005
Neg logit contributions
ese-0.011
 mir-0.011
reetings-0.010
 hawk-0.010
rot-0.010
Token pleased
Token ID10607
Feature activation+5.93e-04

Pos logit contributions
ese+0.000
lor+0.000
 hawk+0.000
rot+0.000
 mir+0.000
Neg logit contributions
 work-0.000
 First-0.000
 On-0.000
 While-0.000
 Min-0.000
Token.
Token ID13
Feature activation-2.36e-03

Pos logit contributions
ese+0.002
 mir+0.002
 hawk+0.001
reetings+0.001
 crow+0.001
Neg logit contributions
 work-0.001
 First-0.001
 On-0.000
 While-0.000
 When-0.000
Token She
Token ID1375
Feature activation-1.16e-03

Pos logit contributions
 work+0.003
 First+0.003
 While+0.002
 On+0.002
 pull+0.002
Neg logit contributions
ese-0.005
 mir-0.005
reetings-0.005
 crow-0.005
rot-0.005
Token said
Token ID531
Feature activation+1.77e-02

Pos logit contributions
 work+0.002
 First+0.001
 On+0.001
 Min+0.001
 While+0.001
Neg logit contributions
ese-0.003
reetings-0.002
 mir-0.002
 hawk-0.002
rot-0.002
Token,
Token ID11
Feature activation+7.22e-03

Pos logit contributions
ese+0.046
 mir+0.044
reetings+0.042
 hawk+0.041
rot+0.041
Neg logit contributions
 work-0.017
 First-0.016
 On-0.013
 While-0.012
 Min-0.012
Token "
Token ID366
Feature activation+2.47e-04

Pos logit contributions
ese+0.018
 mir+0.018
 hawk+0.017
reetings+0.017
rot+0.017
Neg logit contributions
 work-0.007
 First-0.007
 On-0.006
 While-0.006
 When-0.006
TokenThank
Token ID10449
Feature activation+7.67e-03

Pos logit contributions
 mir+0.000
ese+0.000
 hawk+0.000
 foe+0.000
 crow+0.000
Neg logit contributions
 First-0.000
 work-0.000
 While-0.000
 On-0.000
 Em-0.000
Token you
Token ID345
Feature activation+6.11e-03

Pos logit contributions
 mir+0.013
ese+0.012
 hawk+0.012
 crow+0.011
 foe+0.011
Neg logit contributions
 work-0.014
 First-0.014
 On-0.012
 Min-0.012
 When-0.012
Token for
Token ID329
Feature activation+4.26e-03

Pos logit contributions
ese+0.015
 mir+0.015
reetings+0.014
 hawk+0.014
rot+0.014
Neg logit contributions
 work-0.006
 First-0.006
 On-0.005
 While-0.005
 Min-0.004
Token park
Token ID3952
Feature activation-1.25e-02

Pos logit contributions
 work+0.001
 First+0.001
 On+0.001
 While+0.001
 Min+0.001
Neg logit contributions
ese-0.001
 mir-0.001
 crow-0.001
rot-0.001
 hawk-0.001
Token.
Token ID13
Feature activation+3.88e-03

Pos logit contributions
 First+0.014
 work+0.014
 Min+0.012
 While+0.011
 Em+0.011
Neg logit contributions
ese-0.029
 mir-0.027
rot-0.026
 crow-0.026
ich-0.025
Token They
Token ID1119
Feature activation+5.72e-03

Pos logit contributions
 mir+0.007
ese+0.007
 foe+0.007
Did+0.007
 cardinal+0.007
Neg logit contributions
 work-0.005
igsaw-0.004
 Min-0.004
 treadmill-0.004
ilda-0.004
Token were
Token ID547
Feature activation+4.77e-03

Pos logit contributions
ese+0.010
 mir+0.009
 hawk+0.009
reetings+0.009
rot+0.009
Neg logit contributions
 work-0.010
 First-0.010
 On-0.009
 While-0.009
 Min-0.008
Token so
Token ID523
Feature activation+2.02e-02

Pos logit contributions
ese+0.010
 mir+0.009
reetings+0.009
 hawk+0.009
rot+0.009
Neg logit contributions
 work-0.007
 First-0.007
 On-0.006
 While-0.006
 Min-0.005
Token excited
Token ID6568
Feature activation+1.04e-02

Pos logit contributions
ese+0.041
 mir+0.040
reetings+0.038
 hawk+0.038
rot+0.038
Neg logit contributions
 work-0.030
 First-0.029
 On-0.025
 While-0.025
 Min-0.023
Token!
Token ID0
Feature activation+4.16e-03

Pos logit contributions
ese+0.021
 mir+0.020
 crow+0.019
ich+0.018
 hawk+0.018
Neg logit contributions
 First-0.015
 work-0.014
 While-0.013
 On-0.012
 Min-0.012
Token On
Token ID1550
Feature activation+1.26e-02

Pos logit contributions
ese+0.008
 mir+0.008
 foe+0.008
rot+0.008
reetings+0.007
Neg logit contributions
 work-0.006
igsaw-0.004
 treadmill-0.004
 Min-0.004
 Em-0.004
Token their
Token ID511
Feature activation-3.28e-03

Pos logit contributions
ese+0.031
 mir+0.030
 hawk+0.029
 crow+0.029
rot+0.028
Neg logit contributions
 work-0.013
 First-0.012
 While-0.010
 On-0.010
 Min-0.009
Token way
Token ID835
Feature activation-2.73e-03

Pos logit contributions
 work+0.006
 First+0.006
 On+0.005
 While+0.005
 Min+0.005
Neg logit contributions
ese-0.006
 mir-0.005
 hawk-0.005
rot-0.005
 crow-0.005
Token,
Token ID11
Feature activation+1.01e-02

Pos logit contributions
 work+0.003
 First+0.003
 On+0.002
 While+0.002
 When+0.002
Neg logit contributions
ese-0.007
 mir-0.007
reetings-0.006
rot-0.006
 hawk-0.006
Token saw
Token ID2497
Feature activation+5.32e-03

Pos logit contributions
 work+0.008
 While+0.008
 On+0.007
 First+0.007
 When+0.007
Neg logit contributions
ese-0.016
reetings-0.015
ivering-0.015
 hawk-0.014
ich-0.014
Token a
Token ID257
Feature activation+2.08e-03

Pos logit contributions
ese+0.014
 mir+0.014
reetings+0.014
rot+0.013
 hawk+0.013
Neg logit contributions
 work-0.004
 First-0.004
 While-0.003
 On-0.003
 When-0.002
Token pond
Token ID16723
Feature activation+7.96e-03

Pos logit contributions
ese+0.005
reetings+0.004
 mir+0.004
lor+0.004
ile+0.004
Neg logit contributions
 work-0.003
 First-0.002
 On-0.002
 While-0.002
 treadmill-0.002
Token in
Token ID287
Feature activation+9.73e-03

Pos logit contributions
ese+0.023
 mir+0.022
reetings+0.022
 hawk+0.021
rot+0.021
Neg logit contributions
 work-0.005
 First-0.004
 On-0.004
 While-0.003
 When-0.002
Token the
Token ID262
Feature activation+2.12e-02

Pos logit contributions
ese+0.027
 mir+0.026
reetings+0.025
rot+0.025
 hawk+0.025
Neg logit contributions
 work-0.008
 First-0.007
 On-0.006
 While-0.005
 When-0.004
Token grass
Token ID8701
Feature activation+8.21e-03

Pos logit contributions
reetings+0.047
ese+0.045
rot+0.044
lor+0.044
 foe+0.043
Neg logit contributions
 work-0.026
 While-0.025
 First-0.025
 On-0.022
 When-0.021
Token.
Token ID13
Feature activation-7.13e-04

Pos logit contributions
ese+0.025
 mir+0.025
reetings+0.024
 hawk+0.024
Fear+0.023
Neg logit contributions
 work-0.003
 First-0.003
 On-0.002
 While-0.001
 When-0.001
Token It
Token ID632
Feature activation+5.67e-03

Pos logit contributions
 work+0.001
 First+0.001
 Min+0.001
 Em+0.001
igsaw+0.001
Neg logit contributions
 mir-0.001
ese-0.001
 crow-0.001
 hawk-0.001
reetings-0.001
Token looked
Token ID3114
Feature activation+3.27e-04

Pos logit contributions
ese+0.015
reetings+0.014
lor+0.014
rot+0.014
 mir+0.013
Neg logit contributions
 work-0.006
 On-0.005
 While-0.005
 First-0.005
 Tim-0.004
Token so
Token ID523
Feature activation+8.05e-03

Pos logit contributions
ese+0.001
lor+0.001
rot+0.001
reetings+0.001
ich+0.001
Neg logit contributions
 work-0.000
 On-0.000
 While-0.000
 First-0.000
 as-0.000
Token cool
Token ID3608
Feature activation+1.78e-03

Pos logit contributions
ese+0.018
rot+0.016
 mir+0.016
lor+0.016
reetings+0.016
Neg logit contributions
 work-0.011
 First-0.011
 On-0.010
 While-0.009
 When-0.008
Token Daddy
Token ID26926
Feature activation-3.67e-03

Pos logit contributions
 work+0.021
 treadmill+0.018
 First+0.017
 pull+0.017
 drive+0.016
Neg logit contributions
reetings-0.026
 mir-0.026
Fear-0.026
lor-0.025
Was-0.025
Token was
Token ID373
Feature activation-1.62e-03

Pos logit contributions
 work+0.006
 First+0.005
 While+0.004
 On+0.004
 Min+0.004
Neg logit contributions
 mir-0.008
ese-0.008
reetings-0.007
 hawk-0.007
 crow-0.007
Token proud
Token ID6613
Feature activation+9.88e-03

Pos logit contributions
 work+0.002
 First+0.002
 as+0.002
 While+0.002
 On+0.002
Neg logit contributions
ese-0.004
 mir-0.003
 hawk-0.003
reetings-0.003
rot-0.003
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
ese+0.023
 mir+0.021
reetings+0.020
 hawk+0.020
rot+0.020
Neg logit contributions
 work-0.013
 First-0.012
 On-0.011
 While-0.010
 Min-0.010
Token
Token ID198
Feature activation+6.27e-05

Pos logit contributions
 work+0.016
 First+0.014
 treadmill+0.013
 pull+0.012
 On+0.011
Neg logit contributions
 mir-0.020
ese-0.020
reetings-0.020
lor-0.019
Was-0.019
Token
Token ID198
Feature activation-3.59e-03

Pos logit contributions
ese+0.000
 mir+0.000
reetings+0.000
rot+0.000
lor+0.000
Neg logit contributions
 work-0.000
 First-0.000
 While-0.000
 On-0.000
 treadmill-0.000
TokenThey
Token ID2990
Feature activation-1.99e-03

Pos logit contributions
 work+0.006
 First+0.006
 On+0.005
 While+0.005
 When+0.005
Neg logit contributions
ese-0.007
 mir-0.006
 hawk-0.006
 crow-0.006
rot-0.006
Token went
Token ID1816
Feature activation-8.41e-04

Pos logit contributions
 work+0.003
 First+0.003
 On+0.003
 While+0.003
 Min+0.002
Neg logit contributions
ese-0.004
 mir-0.004
reetings-0.004
 hawk-0.004
rot-0.004
Token to
Token ID284
Feature activation-4.83e-04

Pos logit contributions
 First+0.001
 work+0.001
 On+0.001
 While+0.001
 Min+0.001
Neg logit contributions
ese-0.001
 mir-0.001
rot-0.001
 hawk-0.001
reetings-0.001
Token the
Token ID262
Feature activation+5.62e-04

Pos logit contributions
 First+0.001
 While+0.001
 On+0.001
cases+0.001
tered+0.001
Neg logit contributions
 mir-0.001
ese-0.001
 yourself-0.001
 crow-0.001
 foe-0.001
Token park
Token ID3952
Feature activation-1.53e-02

Pos logit contributions
ese+0.001
 mir+0.001
reetings+0.001
 hawk+0.001
rot+0.001
Neg logit contributions
 work-0.001
 First-0.001
 On-0.001
 While-0.001
 treadmill-0.001
Token!
Token ID0
Feature activation-4.15e-03

Pos logit contributions
ese+0.005
 mir+0.005
reetings+0.005
rot+0.005
 hawk+0.005
Neg logit contributions
 work-0.003
 First-0.003
 On-0.003
 While-0.003
 Min-0.002
Token He
Token ID679
Feature activation+1.21e-03

Pos logit contributions
 work+0.006
 First+0.004
 treadmill+0.004
igsaw+0.004
 pull+0.004
Neg logit contributions
ese-0.009
 mir-0.008
reetings-0.008
 hawk-0.008
rot-0.008
Token drove
Token ID10357
Feature activation-2.48e-04

Pos logit contributions
ese+0.003
 mir+0.002
lor+0.002
reetings+0.002
 hawk+0.002
Neg logit contributions
 work-0.002
 First-0.002
 While-0.002
 On-0.002
 drive-0.001
Token as
Token ID355
Feature activation-2.29e-03

Pos logit contributions
 First+0.000
 work+0.000
 On+0.000
 While+0.000
 Min+0.000
Neg logit contributions
ese-0.001
 mir-0.000
 hawk-0.000
rot-0.000
 crow-0.000
Token fast
Token ID3049
Feature activation-3.14e-02

Pos logit contributions
 First+0.003
 work+0.003
 On+0.003
 While+0.003
 Min+0.003
Neg logit contributions
ese-0.004
 mir-0.004
 hawk-0.004
 crow-0.004
reetings-0.004
Token as
Token ID355
Feature activation-5.17e-03

Pos logit contributions
 First+0.038
igsaw+0.035
 On+0.032
 Min+0.031
 brushes+0.030
Neg logit contributions
 hawk-0.066
 eagle-0.061
 crow-0.061
 mir-0.061
ese-0.060
Token he
Token ID339
Feature activation-9.06e-03

Pos logit contributions
 First+0.007
 work+0.007
 On+0.006
 While+0.006
 treadmill+0.005
Neg logit contributions
ese-0.011
 mir-0.011
 hawk-0.010
 crow-0.010
reetings-0.010
Token could
Token ID714
Feature activation-1.06e-02

Pos logit contributions
 work+0.017
 First+0.016
 On+0.015
 While+0.015
 When+0.014
Neg logit contributions
ese-0.016
 mir-0.015
reetings-0.013
 hawk-0.013
 crow-0.013
Token and
Token ID290
Feature activation-4.16e-03

Pos logit contributions
 work+0.012
 First+0.012
 On+0.009
 While+0.009
 Min+0.009
Neg logit contributions
ese-0.025
 mir-0.024
reetings-0.023
 hawk-0.023
rot-0.023
Token soon
Token ID2582
Feature activation-1.44e-02

Pos logit contributions
 work+0.006
 First+0.006
 On+0.006
 While+0.005
 Min+0.005
Neg logit contributions
ese-0.008
 mir-0.008
reetings-0.007
 hawk-0.007
 crow-0.007
Token he
Token ID339
Feature activation-4.46e-04

Pos logit contributions
 First+0.020
 work+0.019
 On+0.017
 While+0.016
 Min+0.015
Neg logit contributions
ese-0.030
 mir-0.030
 hawk-0.028
 crow-0.028
reetings-0.027
Token children
Token ID1751
Feature activation-6.50e-03

Pos logit contributions
 work+0.020
 First+0.019
 On+0.017
 While+0.017
 Min+0.016
Neg logit contributions
ese-0.022
 mir-0.021
reetings-0.020
 hawk-0.020
rot-0.020
Token looked
Token ID3114
Feature activation-7.48e-03

Pos logit contributions
 work+0.010
 First+0.009
 On+0.008
 While+0.008
 Min+0.007
Neg logit contributions
ese-0.013
 mir-0.013
 hawk-0.012
reetings-0.012
rot-0.012
Token around
Token ID1088
Feature activation-1.20e-02

Pos logit contributions
 work+0.010
 First+0.010
 On+0.009
 While+0.009
 Min+0.008
Neg logit contributions
ese-0.016
 mir-0.015
reetings-0.014
 hawk-0.014
rot-0.014
Token in
Token ID287
Feature activation-6.07e-04

Pos logit contributions
 work+0.014
 First+0.014
 On+0.012
 While+0.011
 Min+0.010
Neg logit contributions
ese-0.028
 mir-0.027
reetings-0.026
 hawk-0.026
rot-0.025
Token wonder
Token ID4240
Feature activation+6.77e-03

Pos logit contributions
 work+0.001
 First+0.001
 On+0.001
 While+0.001
 Min+0.000
Neg logit contributions
ese-0.001
 mir-0.001
reetings-0.001
 hawk-0.001
rot-0.001
Token.
Token ID13
Feature activation-9.14e-04

Pos logit contributions
ese+0.018
 mir+0.018
rot+0.017
 hawk+0.017
reetings+0.017
Neg logit contributions
 work-0.006
 First-0.005
 On-0.004
 While-0.004
 When-0.003
Token The
Token ID383
Feature activation-8.03e-03

Pos logit contributions
 work+0.001
 First+0.001
igsaw+0.001
 Min+0.001
 treadmill+0.001
Neg logit contributions
ese-0.002
 mir-0.002
reetings-0.002
 hawk-0.002
 foe-0.002
Token theater
Token ID13766
Feature activation+5.98e-03

Pos logit contributions
 work+0.013
 First+0.012
 On+0.011
 While+0.011
 Min+0.010
Neg logit contributions
ese-0.015
 mir-0.015
reetings-0.014
 hawk-0.014
rot-0.014
Token was
Token ID373
Feature activation-1.31e-02

Pos logit contributions
ese+0.013
 mir+0.012
 hawk+0.012
reetings+0.012
lor+0.012
Neg logit contributions
 work-0.009
 First-0.008
 On-0.007
 While-0.007
 When-0.006
Token so
Token ID523
Feature activation+9.30e-04

Pos logit contributions
 work+0.017
 First+0.015
 On+0.014
 While+0.013
 When+0.012
Neg logit contributions
ese-0.030
 mir-0.028
rot-0.028
lor-0.028
 hawk-0.028
Token big
Token ID1263
Feature activation-1.27e-02

Pos logit contributions
ese+0.002
 mir+0.002
 hawk+0.002
rot+0.002
reetings+0.002
Neg logit contributions
 work-0.001
 First-0.001
 On-0.001
 While-0.001
 Min-0.001
Token"
Token ID1
Feature activation+6.14e-03

Pos logit contributions
ese+0.005
 mir+0.004
rot+0.004
reetings+0.004
 hawk+0.004
Neg logit contributions
 work-0.009
 First-0.009
 On-0.008
 While-0.008
 Min-0.008
TokenNo
Token ID2949
Feature activation+2.93e-03

Pos logit contributions
ese+0.010
 mir+0.010
reetings+0.009
 hawk+0.009
rot+0.009
Neg logit contributions
 work-0.011
 First-0.011
 On-0.010
 While-0.010
 Min-0.009
Token,
Token ID11
Feature activation-8.59e-03

Pos logit contributions
ese+0.006
 mir+0.006
 hawk+0.006
 crow+0.006
 eagle+0.005
Neg logit contributions
 First-0.004
 On-0.004
 work-0.004
 While-0.004
 When-0.003
Token it
Token ID340
Feature activation-8.81e-04

Pos logit contributions
 work+0.010
 First+0.009
 pull+0.008
 Max+0.007
 Lily+0.007
Neg logit contributions
ese-0.021
reetings-0.020
 mir-0.020
rot-0.019
lor-0.019
Token is
Token ID318
Feature activation-2.57e-03

Pos logit contributions
 work+0.002
 First+0.002
 On+0.002
 While+0.002
 Min+0.002
Neg logit contributions
ese-0.001
 mir-0.001
rot-0.001
 hawk-0.001
reetings-0.001
Token mine
Token ID6164
Feature activation-1.58e-03

Pos logit contributions
 work+0.005
 First+0.005
 On+0.004
 While+0.004
 pull+0.004
Neg logit contributions
ese-0.004
 mir-0.004
reetings-0.004
rot-0.004
lor-0.004
Token,
Token ID11
Feature activation-9.13e-03

Pos logit contributions
 First+0.003
 work+0.003
 While+0.002
 On+0.002
 Min+0.002
Neg logit contributions
ese-0.003
 mir-0.003
rot-0.003
reetings-0.002
ich-0.002
Token I
Token ID314
Feature activation-9.21e-03

Pos logit contributions
 work+0.010
 Lily+0.009
 Max+0.008
 Tim+0.008
 as+0.008
Neg logit contributions
ese-0.023
reetings-0.023
 mir-0.021
lor-0.021
asis-0.020
Token saw
Token ID2497
Feature activation-7.70e-03

Pos logit contributions
 work+0.015
 First+0.014
 On+0.013
 While+0.012
 pull+0.011
Neg logit contributions
ese-0.018
 mir-0.017
 hawk-0.016
reetings-0.016
rot-0.016
Token it
Token ID340
Feature activation+3.20e-03

Pos logit contributions
 work+0.010
 First+0.009
 On+0.008
 While+0.008
 When+0.007
Neg logit contributions
ese-0.017
 mir-0.017
reetings-0.016
rot-0.016
 hawk-0.016
Token first
Token ID717
Feature activation+1.72e-03

Pos logit contributions
ese+0.007
 mir+0.006
reetings+0.006
 hawk+0.006
rot+0.006
Neg logit contributions
 work-0.004
 First-0.004
 On-0.004
 While-0.004
 Min-0.003
Token to
Token ID284
Feature activation+4.00e-03

Pos logit contributions
 First+0.035
 work+0.034
 On+0.031
 While+0.030
 Min+0.030
Neg logit contributions
ese-0.038
rot-0.034
 mir-0.034
 hawk-0.033
ich-0.033
Token his
Token ID465
Feature activation-1.77e-03

Pos logit contributions
ese+0.009
 mir+0.009
 crow+0.009
 hawk+0.009
reetings+0.009
Neg logit contributions
 First-0.004
 work-0.004
 On-0.003
 While-0.003
 treadmill-0.003
Token dad
Token ID9955
Feature activation-7.30e-03

Pos logit contributions
 First+0.003
 work+0.003
 On+0.003
 While+0.003
 When+0.002
Neg logit contributions
ese-0.003
 mir-0.003
 hawk-0.003
rot-0.003
 crow-0.003
Token,
Token ID11
Feature activation+3.72e-03

Pos logit contributions
 First+0.011
 work+0.011
 While+0.010
 On+0.009
 treadmill+0.009
Neg logit contributions
ese-0.014
 mir-0.013
rot-0.012
reetings-0.012
 crow-0.012
Token who
Token ID508
Feature activation+2.49e-04

Pos logit contributions
ese+0.008
reetings+0.008
 mir+0.008
rot+0.008
ile+0.008
Neg logit contributions
 work-0.005
 First-0.004
 While-0.004
 On-0.004
 as-0.004
Token fixed
Token ID5969
Feature activation-1.33e-02

Pos logit contributions
ese+0.001
 mir+0.000
reetings+0.000
 hawk+0.000
rot+0.000
Neg logit contributions
 work-0.000
 First-0.000
 On-0.000
 While-0.000
 Min-0.000
Token it
Token ID340
Feature activation-1.02e-02

Pos logit contributions
 work+0.016
 First+0.016
 On+0.014
 While+0.014
 When+0.012
Neg logit contributions
ese-0.030
 mir-0.029
 hawk-0.028
 crow-0.027
rot-0.027
Token up
Token ID510
Feature activation-2.55e-03

Pos logit contributions
 First+0.012
 work+0.011
 While+0.010
 On+0.009
 Min+0.009
Neg logit contributions
ese-0.022
 mir-0.021
reetings-0.021
rot-0.021
 crow-0.020
Token good
Token ID922
Feature activation+6.41e-03

Pos logit contributions
 work+0.003
 First+0.002
 On+0.002
 While+0.002
 Min+0.002
Neg logit contributions
ese-0.006
 mir-0.006
 hawk-0.006
reetings-0.006
rot-0.006
Token as
Token ID355
Feature activation-7.51e-03

Pos logit contributions
ese+0.014
 mir+0.013
reetings+0.012
 hawk+0.012
rot+0.012
Neg logit contributions
 work-0.009
 First-0.009
 On-0.007
 While-0.007
 Min-0.007
Token new
Token ID649
Feature activation-1.56e-02

Pos logit contributions
 work+0.009
 First+0.008
 On+0.007
 While+0.007
 Min+0.006
Neg logit contributions
ese-0.018
 mir-0.017
reetings-0.016
rot-0.016
 hawk-0.016
Token stepped
Token ID10764
Feature activation+4.51e-03

Pos logit contributions
 work+0.000
 First+0.000
 While+0.000
 On+0.000
 pull+0.000
Neg logit contributions
ese-0.000
reetings-0.000
 mir-0.000
rot-0.000
 hawk-0.000
Token forward
Token ID2651
Feature activation+6.35e-03

Pos logit contributions
ese+0.009
 mir+0.008
reetings+0.008
 hawk+0.008
rot+0.008
Neg logit contributions
 work-0.007
 First-0.007
 On-0.006
 While-0.006
 Min-0.006
Token.
Token ID13
Feature activation-1.27e-03

Pos logit contributions
ese+0.013
 mir+0.013
reetings+0.012
 crow+0.012
 hawk+0.012
Neg logit contributions
 First-0.009
 work-0.008
 While-0.007
 On-0.007
 treadmill-0.007
Token But
Token ID887
Feature activation-6.92e-03

Pos logit contributions
 work+0.002
 First+0.001
 cartoons+0.001
 treadmill+0.001
 Min+0.001
Neg logit contributions
 mir-0.003
ese-0.003
reetings-0.002
lor-0.002
 crow-0.002
Token as
Token ID355
Feature activation-5.57e-03

Pos logit contributions
 work+0.007
 First+0.007
 On+0.006
 While+0.006
 When+0.005
Neg logit contributions
ese-0.018
 mir-0.017
rot-0.016
reetings-0.016
 hawk-0.016
Token soon
Token ID2582
Feature activation-3.37e-02

Pos logit contributions
 First+0.008
 work+0.008
 On+0.007
 While+0.006
 treadmill+0.006
Neg logit contributions
ese-0.011
 mir-0.011
reetings-0.011
 crow-0.010
 hawk-0.010
Token he
Token ID339
Feature activation-1.46e-02

Pos logit contributions
 First+0.040
 treadmill+0.038
 shelf+0.034
 brushes+0.033
 work+0.033
Neg logit contributions
 mir-0.070
 crow-0.066
 hawk-0.065
 pearl-0.064
 ge-0.063
Token did
Token ID750
Feature activation-6.17e-03

Pos logit contributions
 work+0.021
 First+0.018
 On+0.017
 While+0.017
 Min+0.016
Neg logit contributions
ese-0.033
 mir-0.030
reetings-0.030
 hawk-0.029
ich-0.029
Token,
Token ID11
Feature activation+2.57e-03

Pos logit contributions
 work+0.006
 First+0.005
 On+0.004
 While+0.004
 When+0.003
Neg logit contributions
ese-0.017
 mir-0.016
rot-0.015
reetings-0.015
 hawk-0.015
Token he
Token ID339
Feature activation+2.06e-03

Pos logit contributions
ese+0.006
 mir+0.005
 hawk+0.005
 crow+0.005
reetings+0.005
Neg logit contributions
 work-0.003
 First-0.003
 On-0.003
 While-0.003
 Min-0.002
Token heard
Token ID2982
Feature activation+2.14e-03

Pos logit contributions
ese+0.004
 mir+0.004
reetings+0.004
rim+0.004
 hawk+0.004
Neg logit contributions
 work-0.003
 First-0.002
 pull-0.002
 On-0.002
 While-0.002
Token her
Token ID607
Feature activation+3.81e-03

Pos logit contributions
 work+0.002
 First+0.002
 On+0.001
 While+0.001
 Min+0.001
Neg logit contributions
ese-0.004
 mir-0.004
reetings-0.004
rot-0.003
 hawk-0.003
Token nose
Token ID9686
Feature activation+3.36e-03

Pos logit contributions
ese+0.008
 mir+0.007
reetings+0.007
 hawk+0.007
rot+0.007
Neg logit contributions
 work-0.006
 First-0.006
 On-0.005
 While-0.005
 Min-0.005
Token at
Token ID379
Feature activation+4.09e-03

Pos logit contributions
ese+0.007
 mir+0.007
reetings+0.007
 hawk+0.007
rot+0.007
Neg logit contributions
 work-0.004
 First-0.004
 On-0.004
 While-0.003
 Min-0.003
Token how
Token ID703
Feature activation-2.17e-03

Pos logit contributions
ese+0.009
 mir+0.009
 crow+0.009
 hawk+0.009
rot+0.008
Neg logit contributions
 First-0.005
 work-0.004
 On-0.004
 While-0.004
 treadmill-0.003
Token it
Token ID340
Feature activation-2.41e-03

Pos logit contributions
 work+0.003
 First+0.003
 On+0.002
 While+0.002
 Min+0.002
Neg logit contributions
ese-0.005
 mir-0.004
rot-0.004
reetings-0.004
 hawk-0.004
Token smelled
Token ID37609
Feature activation-2.80e-02

Pos logit contributions
 work+0.004
 First+0.003
 On+0.003
 Min+0.003
 While+0.003
Neg logit contributions
ese-0.005
reetings-0.005
 mir-0.005
rot-0.005
lor-0.005
Token.
Token ID13
Feature activation-4.34e-04

Pos logit contributions
 work+0.036
 First+0.033
 On+0.028
 While+0.027
 Min+0.025
Neg logit contributions
ese-0.065
 mir-0.061
rot-0.060
reetings-0.059
 hawk-0.059
Token But
Token ID887
Feature activation-9.31e-03

Pos logit contributions
 work+0.001
 First+0.000
 On+0.000
 While+0.000
 treadmill+0.000
Neg logit contributions
ese-0.001
 mir-0.001
reetings-0.001
 crow-0.001
rot-0.001
Token then
Token ID788
Feature activation-8.41e-03

Pos logit contributions
 work+0.009
 First+0.009
 On+0.007
 Min+0.007
 as+0.007
Neg logit contributions
ese-0.025
 mir-0.024
reetings-0.023
rot-0.023
lor-0.022
Token,
Token ID11
Feature activation+3.62e-03

Pos logit contributions
 work+0.009
 First+0.009
 On+0.007
 While+0.007
 Min+0.006
Neg logit contributions
ese-0.021
 mir-0.020
reetings-0.019
rot-0.019
 hawk-0.019
Token she
Token ID673
Feature activation+4.12e-03

Pos logit contributions
ese+0.009
 mir+0.008
 hawk+0.008
reetings+0.008
 crow+0.008
Neg logit contributions
 work-0.004
 First-0.004
 On-0.003
 While-0.003
 Min-0.003
Token was
Token ID373
Feature activation-1.46e-02

Pos logit contributions
 work+0.010
 First+0.009
 On+0.008
 While+0.008
 When+0.007
Neg logit contributions
ese-0.016
 mir-0.015
reetings-0.014
 crow-0.014
 hawk-0.014
Token happy
Token ID3772
Feature activation+2.65e-03

Pos logit contributions
 work+0.022
 First+0.020
 On+0.018
 While+0.018
 When+0.017
Neg logit contributions
ese-0.031
 mir-0.029
rot-0.028
reetings-0.028
lor-0.027
Token and
Token ID290
Feature activation-3.78e-03

Pos logit contributions
ese+0.007
 mir+0.006
reetings+0.006
rot+0.006
 hawk+0.006
Neg logit contributions
 work-0.003
 First-0.003
 On-0.002
 While-0.002
 Min-0.002
Token it
Token ID340
Feature activation+1.46e-03

Pos logit contributions
 work+0.006
 First+0.005
 While+0.004
 On+0.004
 When+0.004
Neg logit contributions
ese-0.008
 mir-0.008
rot-0.008
reetings-0.008
 hawk-0.008
Token was
Token ID373
Feature activation-1.31e-02

Pos logit contributions
ese+0.003
reetings+0.003
 mir+0.003
rot+0.003
 crow+0.003
Neg logit contributions
 work-0.002
 First-0.002
 On-0.001
 While-0.001
 Min-0.001
Token tall
Token ID7331
Feature activation-2.66e-02

Pos logit contributions
 work+0.019
 First+0.017
 On+0.015
 While+0.015
 When+0.014
Neg logit contributions
ese-0.028
 mir-0.027
rot-0.026
reetings-0.026
 hawk-0.026
Token.
Token ID13
Feature activation-8.96e-04

Pos logit contributions
 First+0.028
 work+0.028
 On+0.023
 While+0.023
 Min+0.020
Neg logit contributions
ese-0.064
 mir-0.062
 hawk-0.059
rot-0.059
 crow-0.059
Token Everyone
Token ID11075
Feature activation-1.55e-02

Pos logit contributions
 work+0.001
 First+0.001
 treadmill+0.001
 pull+0.001
 Min+0.001
Neg logit contributions
ese-0.002
 mir-0.002
reetings-0.002
rot-0.002
 hawk-0.002
Token enjoyed
Token ID8359
Feature activation+6.15e-03

Pos logit contributions
 work+0.022
 First+0.020
 While+0.017
 On+0.017
 When+0.016
Neg logit contributions
ese-0.034
 mir-0.033
reetings-0.031
 hawk-0.031
lor-0.031
Token walking
Token ID6155
Feature activation-1.26e-03

Pos logit contributions
ese+0.016
 mir+0.015
reetings+0.014
rot+0.014
 hawk+0.014
Neg logit contributions
 work-0.006
 First-0.006
 On-0.005
 While-0.005
 Min-0.004
Token in
Token ID287
Feature activation+5.32e-03

Pos logit contributions
 work+0.001
 First+0.001
 On+0.001
 While+0.001
 When+0.001
Neg logit contributions
ese-0.003
 mir-0.003
reetings-0.003
 crow-0.003
rot-0.003
Token happy
Token ID3772
Feature activation+3.16e-03

Pos logit contributions
ese+0.007
lor+0.006
rot+0.006
 mir+0.006
 hawk+0.006
Neg logit contributions
 work-0.006
 First-0.005
 On-0.004
 While-0.004
 pull-0.004
Token and
Token ID290
Feature activation-6.05e-03

Pos logit contributions
ese+0.008
 mir+0.008
reetings+0.008
 hawk+0.007
rot+0.007
Neg logit contributions
 work-0.003
 First-0.003
 On-0.003
 as-0.002
 While-0.002
Token it
Token ID340
Feature activation+1.64e-03

Pos logit contributions
 work+0.009
 First+0.008
 On+0.007
 While+0.007
 Min+0.007
Neg logit contributions
ese-0.013
 mir-0.013
reetings-0.012
rot-0.012
 hawk-0.012
Token thanked
Token ID26280
Feature activation-1.51e-02

Pos logit contributions
ese+0.004
 mir+0.003
reetings+0.003
ich+0.003
 hawk+0.003
Neg logit contributions
 work-0.002
 First-0.002
 Min-0.002
 On-0.002
 While-0.002
Token the
Token ID262
Feature activation-1.63e-02

Pos logit contributions
 work+0.014
 First+0.013
 On+0.010
 While+0.010
 Min+0.008
Neg logit contributions
ese-0.040
 mir-0.039
reetings-0.037
 hawk-0.037
rot-0.037
Token little
Token ID1310
Feature activation-2.57e-02

Pos logit contributions
 work+0.026
 First+0.024
 On+0.022
 While+0.021
 Min+0.020
Neg logit contributions
ese-0.032
 mir-0.031
reetings-0.029
 hawk-0.029
rot-0.028
Token child
Token ID1200
Feature activation+1.55e-03

Pos logit contributions
 work+0.042
 First+0.041
 On+0.036
 While+0.036
 Min+0.033
Neg logit contributions
ese-0.048
 mir-0.046
 hawk-0.043
reetings-0.043
rot-0.043
Token.
Token ID13
Feature activation-3.82e-03

Pos logit contributions
ese+0.004
 mir+0.003
rot+0.003
reetings+0.003
 crow+0.003
Neg logit contributions
 First-0.002
 work-0.002
 While-0.001
 On-0.001
 treadmill-0.001
Token From
Token ID3574
Feature activation-1.16e-03

Pos logit contributions
 work+0.005
 First+0.004
 treadmill+0.003
 On+0.003
 While+0.003
Neg logit contributions
ese-0.009
 mir-0.008
reetings-0.008
rot-0.008
 hawk-0.008
Token now
Token ID783
Feature activation+8.77e-03

Pos logit contributions
 work+0.001
 First+0.001
 On+0.001
 While+0.001
 Min+0.001
Neg logit contributions
ese-0.003
 mir-0.003
reetings-0.003
rot-0.003
 hawk-0.003
Token on
Token ID319
Feature activation+7.10e-03

Pos logit contributions
ese+0.018
 mir+0.017
reetings+0.016
 hawk+0.016
rot+0.016
Neg logit contributions
 work-0.013
 First-0.012
 On-0.011
 While-0.011
 Min-0.010
Token door
Token ID3420
Feature activation-7.07e-03

Pos logit contributions
ese+0.020
 mir+0.019
reetings+0.019
rot+0.018
 hawk+0.018
Neg logit contributions
 work-0.018
 First-0.018
 On-0.016
 While-0.016
 Min-0.014
Token.
Token ID13
Feature activation+6.63e-04

Pos logit contributions
 First+0.008
 work+0.008
 While+0.007
igsaw+0.007
 Min+0.007
Neg logit contributions
ese-0.015
 mir-0.014
 crow-0.014
reetings-0.013
rot-0.013
Token She
Token ID1375
Feature activation+3.56e-03

Pos logit contributions
Fear+0.001
reetings+0.001
 mir+0.001
Nobody+0.001
Was+0.001
Neg logit contributions
 work-0.001
igsaw-0.001
 cartoons-0.001
 treadmill-0.001
 experiments-0.001
Token crawled
Token ID45668
Feature activation+9.19e-03

Pos logit contributions
ese+0.007
 mir+0.007
reetings+0.007
rot+0.006
 hawk+0.006
Neg logit contributions
 work-0.006
 First-0.005
 While-0.005
 On-0.005
 Min-0.005
Token as
Token ID355
Feature activation+6.83e-03

Pos logit contributions
 mir+0.016
ese+0.015
 hawk+0.015
reetings+0.014
Fear+0.014
Neg logit contributions
 work-0.013
 First-0.012
igsaw-0.012
 cartoons-0.011
 Min-0.011
Token fast
Token ID3049
Feature activation-2.86e-02

Pos logit contributions
 mir+0.011
 hawk+0.011
ese+0.011
 crow+0.011
ich+0.011
Neg logit contributions
 First-0.011
 work-0.011
 On-0.010
 While-0.010
igsaw-0.009
Token as
Token ID355
Feature activation-5.33e-04

Pos logit contributions
 First+0.034
igsaw+0.034
 On+0.030
 assemble+0.029
 treadmill+0.028
Neg logit contributions
 hawk-0.057
 crow-0.054
 mir-0.054
 eagle-0.051
ese-0.050
Token she
Token ID673
Feature activation-4.71e-03

Pos logit contributions
 work+0.001
 First+0.001
 On+0.001
 While+0.001
 treadmill+0.001
Neg logit contributions
 mir-0.001
ese-0.001
 hawk-0.001
 crow-0.001
reetings-0.001
Token could
Token ID714
Feature activation+5.63e-04

Pos logit contributions
 work+0.009
 First+0.009
 On+0.008
 While+0.008
 Min+0.007
Neg logit contributions
ese-0.008
 mir-0.007
 hawk-0.007
reetings-0.007
rot-0.007
Token,
Token ID11
Feature activation+7.01e-03

Pos logit contributions
ese+0.001
 mir+0.001
reetings+0.001
 hawk+0.001
rot+0.001
Neg logit contributions
 work-0.001
 First-0.001
 On-0.000
 While-0.000
 Min-0.000
Token but
Token ID475
Feature activation-1.27e-02

Pos logit contributions
ese+0.016
 mir+0.015
rot+0.014
reetings+0.014
 hawk+0.014
Neg logit contributions
 work-0.010
 First-0.009
 While-0.008
 On-0.008
 Min-0.007