TOP ACTIVATIONS
MAX = 5.132

and said goodbye to her se ag ull friends . She
enough to scare away the se ag ull s !"
the beach to feed the se ag ull s , but
obedient ly and watched the se ag ull fly away .
my and Tom followed the se ag ull s all around
a plan to distract the se ag ull while Tim my
Be et le watched the se ag ull until she could
okay and the car nodded its head . The truck hooked
s follow the se ag ull s ! â
Tom said goodbye to the se ag ull s and put
running . He chased the se ag ull s as they
quiet compared to the noisy se ag ull s outside .
wave came and knocked the se ag ull off course .
easy ! Just follow the se ag ull s . They
beetle waved back and the se ag ull flew away .
say and you help each other . Do you understand ?"
will try to fix your aer opl ane or get you
the lake and put their kay ak in the water .
But then , a big se ag ull saw the oy
watched the waves and the se ag ull s . They

MIN ACTIVATIONS
MIN = -4.155

I don 't like rac co ons . They are mean
Mom and they loved mac aron i and cheese .
their favorite food : mac aron i and cheese .
broccoli . She wanted mac aron i and cheese instead .
crying . She wanted mac aron i and cheese . It
Dad said . " Gate 12 . Come on , let
had fun playing together at recess . After school
nodded . She liked mac aron i and cheese . She
She jumped on Tom and pushed him away from
ila peek ed from behind Tom and saw the picture again
dog . She hid behind Tom . " Tom
that . It jumped on Tom 's face and cro aked
saw the paper hat on Tom 's head and thought it
They played together at recess . They shared their toys
He jumped at Tom and Sue . He bit
He throws the ball at Tom and Sam and runs away
bone . It jumped on Tom and snatched the fl ute
swung his golf club at Tom and hit him on the
. He runs away from Tom . He does not want
too . She ran after Tom . Tom 's

INTERVAL 2.810 - 5.132
CONTAINS 0.070%

mouth . It was a se ag ull . It had
and everything in the park gl owed with amazing colors .
Tommy laughed at the silly se ag ull and the man
hands to hold the f auc et up and support it
to the biggest , sh ini est fish in the pond

INTERVAL 0.489 - 2.810
CONTAINS 22.835%

for a while . " Let 's bury this smooth stone
could move and stretch in any way she wanted .
ride in the elevator and see if she could find her
I knew you could do it , Lily . You always
. But let 's tie a string to it and make

INTERVAL -1.833 - 0.489
CONTAINS 75.869%

. And the tree continued to grow and grow , making
Jack . Jack loved playing in his garden , and each
day , Mam ma was very busy . She was making
He learned that it was fun and beautiful and full of
not look at the sky . <|endoftext|> Anna liked to cook

INTERVAL -4.155 - -1.833
CONTAINS 1.226%

She sang every day at home , at school , and
<|endoftext|> Once there was a bar ber named Sam . He
not listen . He grabs Sue 's hand and pulls her
plastic wheels . He could count them and play with them
They had a lot of fun together . But as the
Token and
Token ID290
Feature activation+1.32e+00

Pos logit contributions
cient+0.020
 weaker+0.020
 sy+0.019
 boo+0.018
onel+0.018
Neg logit contributions
ris-0.011
dad-0.010
aren-0.009
oys-0.009
Dan-0.009
Token said
Token ID531
Feature activation-1.37e-01

Pos logit contributions
cient+0.047
 sy+0.045
 weaker+0.045
 boo+0.042
 infected+0.042
Neg logit contributions
dad-0.027
ris-0.027
aren-0.024
 ahead-0.023
 Sophia-0.022
Token goodbye
Token ID24829
Feature activation+9.00e-01

Pos logit contributions
dad+0.002
ar+0.002
ae+0.002
 ahead+0.002
 than+0.002
Neg logit contributions
 sy-0.005
cient-0.005
 weaker-0.005
 boo-0.005
 infected-0.005
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
cient+0.019
 wing+0.019
 weaker+0.018
 boo+0.017
 sy+0.016
Neg logit contributions
ris-0.030
dad-0.029
Dan-0.029
atherine-0.029
aren-0.028
Token her
Token ID607
Feature activation+4.05e-01

Pos logit contributions
cient+0.028
 sy+0.028
 weaker+0.028
 boo+0.025
onel+0.025
Neg logit contributions
ris-0.030
dad-0.030
aren-0.027
Dan-0.027
 recess-0.026
Token se
Token ID384
Feature activation+5.13e+00

Pos logit contributions
cient+0.011
 sy+0.011
 weaker+0.010
onel+0.010
 boo+0.010
Neg logit contributions
ris-0.012
dad-0.011
 than-0.010
Dan-0.010
oys-0.010
Tokenag
Token ID363
Feature activation+3.89e-01

Pos logit contributions
cient+0.086
 weaker+0.077
 sy+0.075
 boo+0.074
onel+0.070
Neg logit contributions
ris-0.190
dad-0.186
Dan-0.182
aren-0.180
oys-0.178
Tokenull
Token ID724
Feature activation+1.96e+00

Pos logit contributions
cient+0.010
 weaker+0.009
 sy+0.009
 boo+0.009
onel+0.008
Neg logit contributions
ris-0.012
dad-0.012
aren-0.011
oys-0.011
Dan-0.011
Token friends
Token ID2460
Feature activation+4.75e-01

Pos logit contributions
cient+0.064
 weaker+0.063
 sy+0.062
 boo+0.059
 wing+0.058
Neg logit contributions
ris-0.045
dad-0.042
aren-0.038
oys-0.038
Dan-0.038
Token.
Token ID13
Feature activation+2.80e-01

Pos logit contributions
cient+0.015
 sy+0.015
 weaker+0.015
 boo+0.014
 wing+0.014
Neg logit contributions
ris-0.010
dad-0.010
Dan-0.010
oys-0.009
aren-0.009
Token She
Token ID1375
Feature activation+6.80e-04

Pos logit contributions
cient+0.008
 weaker+0.008
 sy+0.008
 boo+0.007
onel+0.007
Neg logit contributions
ris-0.008
dad-0.007
oys-0.007
aren-0.007
Dan-0.007
Token enough
Token ID1576
Feature activation-1.75e+00

Pos logit contributions
cient+0.004
 weaker+0.004
 sy+0.004
 boo+0.003
onel+0.003
Neg logit contributions
ris-0.002
dad-0.002
 than-0.002
oys-0.002
ure-0.002
Token to
Token ID284
Feature activation-3.81e-01

Pos logit contributions
ris+0.040
dad+0.037
 than+0.035
oys+0.034
Dan+0.033
Neg logit contributions
cient-0.059
 sy-0.056
 weaker-0.056
onel-0.054
 boo-0.053
Token scare
Token ID19437
Feature activation+1.51e-01

Pos logit contributions
ris+0.010
dad+0.009
Dan+0.009
ure+0.008
 than+0.008
Neg logit contributions
cient-0.012
 sy-0.011
 weaker-0.011
onel-0.011
 infected-0.010
Token away
Token ID1497
Feature activation-3.24e-01

Pos logit contributions
cient+0.004
 weaker+0.004
 sy+0.004
 boo+0.004
onel+0.004
Neg logit contributions
ris-0.005
dad-0.004
aren-0.004
oys-0.004
Dan-0.004
Token the
Token ID262
Feature activation+7.02e-01

Pos logit contributions
ris+0.008
dad+0.008
oys+0.007
 than+0.007
Dan+0.007
Neg logit contributions
cient-0.010
 weaker-0.009
 sy-0.009
 boo-0.009
onel-0.009
Token se
Token ID384
Feature activation+4.78e+00

Pos logit contributions
 weaker+0.019
cient+0.019
 sy+0.018
 boo+0.018
 wing+0.017
Neg logit contributions
ris-0.021
dad-0.019
aren-0.018
oys-0.018
ure-0.017
Tokenag
Token ID363
Feature activation-4.62e-03

Pos logit contributions
cient+0.091
 weaker+0.084
 sy+0.083
 boo+0.078
onel+0.075
Neg logit contributions
ris-0.173
dad-0.169
aren-0.162
Dan-0.161
oys-0.159
Tokenull
Token ID724
Feature activation+2.18e+00

Pos logit contributions
ris+0.000
dad+0.000
Dan+0.000
oys+0.000
ae+0.000
Neg logit contributions
 weaker-0.000
 sy-0.000
cient-0.000
 boo-0.000
 infected-0.000
Tokens
Token ID82
Feature activation+1.51e+00

Pos logit contributions
 sy+0.079
 weaker+0.078
 boo+0.076
 increasingly+0.075
cient+0.074
Neg logit contributions
 than-0.040
ris-0.037
 ahead-0.034
!-0.033
oe-0.032
Token!"
Token ID2474
Feature activation-3.36e-02

Pos logit contributions
cient+0.055
 sy+0.054
 weaker+0.052
 boo+0.050
onel+0.049
Neg logit contributions
ris-0.029
 than-0.028
dad-0.026
 ahead-0.024
oe-0.023
Token
Token ID198
Feature activation+6.28e-01

Pos logit contributions
ris+0.001
dad+0.001
aren+0.001
oys+0.001
Dan+0.001
Neg logit contributions
cient-0.001
 weaker-0.001
 sy-0.001
 boo-0.001
onel-0.001
Token the
Token ID262
Feature activation+1.96e-01

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.001
dad-0.001
aren-0.001
oys-0.001
Dan-0.001
Token beach
Token ID10481
Feature activation-1.07e-01

Pos logit contributions
 weaker+0.006
cient+0.006
 boo+0.006
 sy+0.006
 wing+0.006
Neg logit contributions
ris-0.005
aren-0.004
dad-0.004
oys-0.004
ure-0.004
Token to
Token ID284
Feature activation+2.60e-01

Pos logit contributions
ris+0.003
dad+0.003
aren+0.003
oys+0.002
Dan+0.002
Neg logit contributions
cient-0.003
 weaker-0.003
 sy-0.003
 wing-0.003
 boo-0.003
Token feed
Token ID3745
Feature activation-3.05e-01

Pos logit contributions
cient+0.007
 weaker+0.007
 sy+0.007
 boo+0.007
onel+0.007
Neg logit contributions
ris-0.007
dad-0.007
aren-0.006
oys-0.006
Dan-0.006
Token the
Token ID262
Feature activation+1.10e+00

Pos logit contributions
ris+0.008
dad+0.008
aren+0.007
oys+0.007
Dan+0.007
Neg logit contributions
cient-0.009
 weaker-0.008
 sy-0.008
 boo-0.008
onel-0.008
Token se
Token ID384
Feature activation+4.76e+00

Pos logit contributions
cient+0.025
 weaker+0.024
 sy+0.022
 boo+0.021
 wing+0.021
Neg logit contributions
ris-0.039
aren-0.034
dad-0.034
ae-0.033
ure-0.033
Tokenag
Token ID363
Feature activation+4.17e-01

Pos logit contributions
cient+0.090
 weaker+0.082
 sy+0.080
 boo+0.076
onel+0.073
Neg logit contributions
ris-0.173
dad-0.167
aren-0.162
Dan-0.160
oys-0.158
Tokenull
Token ID724
Feature activation+2.52e+00

Pos logit contributions
cient+0.010
 weaker+0.009
 sy+0.009
 boo+0.009
onel+0.008
Neg logit contributions
ris-0.014
dad-0.013
aren-0.012
oys-0.012
Dan-0.012
Tokens
Token ID82
Feature activation+1.77e+00

Pos logit contributions
cient+0.088
 weaker+0.088
 sy+0.087
 boo+0.084
onel+0.083
Neg logit contributions
ris-0.050
dad-0.047
 than-0.043
oys-0.041
ae-0.040
Token,
Token ID11
Feature activation+2.67e-01

Pos logit contributions
cient+0.068
 sy+0.067
 weaker+0.067
 boo+0.064
onel+0.063
Neg logit contributions
ris-0.030
dad-0.029
 than-0.025
oys-0.024
aren-0.024
Token but
Token ID475
Feature activation-1.40e+00

Pos logit contributions
cient+0.009
 weaker+0.009
 sy+0.009
onel+0.008
 boo+0.008
Neg logit contributions
ris-0.006
dad-0.006
 than-0.005
Dan-0.005
aren-0.005
Token obedient
Token ID41652
Feature activation-2.65e-01

Pos logit contributions
ris+0.047
dad+0.045
aren+0.042
oys+0.042
Dan+0.042
Neg logit contributions
cient-0.029
 weaker-0.027
 sy-0.027
 boo-0.025
onel-0.024
Tokenly
Token ID306
Feature activation-1.97e+00

Pos logit contributions
ris+0.007
Dan+0.006
aren+0.006
dad+0.006
oys+0.006
Neg logit contributions
cient-0.008
 sy-0.008
 weaker-0.007
 boo-0.007
 wing-0.007
Token and
Token ID290
Feature activation+3.36e-01

Pos logit contributions
ris+0.043
dad+0.041
aren+0.036
oys+0.036
Dan+0.035
Neg logit contributions
cient-0.067
 weaker-0.066
 sy-0.065
 boo-0.062
onel-0.061
Token watched
Token ID7342
Feature activation+4.81e-03

Pos logit contributions
cient+0.010
 weaker+0.010
 sy+0.010
onel+0.010
 wing+0.009
Neg logit contributions
ris-0.008
dad-0.008
Dan-0.007
ae-0.007
aren-0.007
Token the
Token ID262
Feature activation+4.48e-01

Pos logit contributions
cient+0.000
 weaker+0.000
 sy+0.000
onel+0.000
 infected+0.000
Neg logit contributions
dad-0.000
ris-0.000
 ahead-0.000
 than-0.000
Dan-0.000
Token se
Token ID384
Feature activation+4.73e+00

Pos logit contributions
cient+0.011
 weaker+0.011
 sy+0.010
 boo+0.010
 wing+0.010
Neg logit contributions
ris-0.015
dad-0.013
aren-0.013
oys-0.013
ae-0.013
Tokenag
Token ID363
Feature activation+3.74e-02

Pos logit contributions
cient+0.090
 weaker+0.084
 sy+0.083
 boo+0.078
onel+0.075
Neg logit contributions
ris-0.171
dad-0.166
aren-0.159
Dan-0.158
oys-0.157
Tokenull
Token ID724
Feature activation+2.14e+00

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.001
dad-0.001
oys-0.001
Dan-0.001
aren-0.001
Token fly
Token ID6129
Feature activation+5.21e-01

Pos logit contributions
cient+0.065
 weaker+0.061
 sy+0.061
onel+0.060
 boo+0.060
Neg logit contributions
ris-0.050
 than-0.046
dad-0.046
ar-0.045
 ahead-0.044
Token away
Token ID1497
Feature activation-4.00e-01

Pos logit contributions
cient+0.014
 sy+0.014
 weaker+0.013
 boo+0.013
onel+0.012
Neg logit contributions
ris-0.015
dad-0.014
aren-0.013
 than-0.013
oys-0.013
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
ris+0.008
dad+0.007
aren+0.007
oys+0.006
Dan+0.006
Neg logit contributions
cient-0.014
 weaker-0.014
 sy-0.014
 boo-0.013
onel-0.013
Tokenmy
Token ID1820
Feature activation-6.59e-02

Pos logit contributions
cient+0.006
 sy+0.005
 weaker+0.005
 boo+0.005
onel+0.005
Neg logit contributions
ris-0.005
dad-0.005
aren-0.005
oys-0.005
Dan-0.004
Token and
Token ID290
Feature activation+7.38e-01

Pos logit contributions
ris+0.002
dad+0.002
aren+0.002
oys+0.002
Dan+0.002
Neg logit contributions
cient-0.002
 sy-0.002
 weaker-0.002
 boo-0.002
onel-0.002
Token Tom
Token ID4186
Feature activation-1.72e+00

Pos logit contributions
 weaker+0.026
cient+0.026
onel+0.024
 sy+0.024
 wing+0.023
Neg logit contributions
dad-0.016
ris-0.016
oys-0.014
aren-0.014
Dan-0.013
Token followed
Token ID3940
Feature activation+1.50e-01

Pos logit contributions
ris+0.048
dad+0.045
aren+0.041
oys+0.040
Dan+0.040
Neg logit contributions
 weaker-0.050
cient-0.049
 sy-0.048
 boo-0.045
 wing-0.044
Token the
Token ID262
Feature activation+1.93e-01

Pos logit contributions
 sy+0.003
cient+0.003
 boo+0.003
 weaker+0.003
 proc+0.003
Neg logit contributions
ris-0.005
dad-0.005
aren-0.004
oys-0.004
ae-0.004
Token se
Token ID384
Feature activation+4.67e+00

Pos logit contributions
cient+0.004
 weaker+0.004
 sy+0.004
 boo+0.003
 wing+0.003
Neg logit contributions
ris-0.007
dad-0.007
aren-0.006
oys-0.006
ae-0.006
Tokenag
Token ID363
Feature activation-3.51e-01

Pos logit contributions
cient+0.090
 sy+0.083
 weaker+0.083
 boo+0.078
onel+0.076
Neg logit contributions
ris-0.166
dad-0.161
Dan-0.154
aren-0.154
oys-0.153
Tokenull
Token ID724
Feature activation+1.73e+00

Pos logit contributions
ris+0.010
dad+0.010
Dan+0.009
oys+0.009
ae+0.009
Neg logit contributions
cient-0.010
 weaker-0.010
 sy-0.009
 boo-0.009
onel-0.008
Tokens
Token ID82
Feature activation+1.25e+00

Pos logit contributions
cient+0.051
 weaker+0.050
 sy+0.050
 boo+0.048
onel+0.046
Neg logit contributions
ris-0.045
dad-0.042
oys-0.039
aren-0.038
Dan-0.038
Token all
Token ID477
Feature activation+3.52e-01

Pos logit contributions
cient+0.044
 weaker+0.043
 sy+0.042
 boo+0.041
 wing+0.040
Neg logit contributions
ris-0.026
dad-0.024
aren-0.022
oys-0.022
Dan-0.022
Token around
Token ID1088
Feature activation-2.50e-01

Pos logit contributions
cient+0.010
 sy+0.010
 weaker+0.009
 boo+0.009
onel+0.009
Neg logit contributions
ris-0.010
dad-0.010
aren-0.009
 than-0.009
oys-0.009
Token a
Token ID257
Feature activation-6.66e-01

Pos logit contributions
ris+0.005
dad+0.005
aren+0.004
oys+0.004
Dan+0.004
Neg logit contributions
cient-0.004
 sy-0.004
 weaker-0.004
onel-0.004
 boo-0.004
Token plan
Token ID1410
Feature activation+4.23e-01

Pos logit contributions
ris+0.018
dad+0.017
Dan+0.016
aren+0.016
oys+0.015
Neg logit contributions
cient-0.019
onel-0.018
 weaker-0.018
 sy-0.017
 infected-0.016
Token to
Token ID284
Feature activation+1.65e-01

Pos logit contributions
 sy+0.010
 weaker+0.010
cient+0.009
 wing+0.009
 worse+0.009
Neg logit contributions
dad-0.012
Dan-0.012
ae-0.012
ris-0.012
atherine-0.012
Token distract
Token ID11786
Feature activation-1.50e+00

Pos logit contributions
cient+0.004
 weaker+0.004
 sy+0.004
onel+0.004
 boo+0.004
Neg logit contributions
ris-0.005
dad-0.005
oys-0.004
Dan-0.004
aren-0.004
Token the
Token ID262
Feature activation+1.93e-01

Pos logit contributions
ris+0.053
dad+0.049
ae+0.047
aren+0.047
oys+0.047
Neg logit contributions
cient-0.032
 weaker-0.031
 sy-0.030
 boo-0.029
 wing-0.027
Token se
Token ID384
Feature activation+4.66e+00

Pos logit contributions
cient+0.004
 weaker+0.004
 sy+0.004
 boo+0.004
 wing+0.004
Neg logit contributions
ris-0.007
aren-0.006
dad-0.006
oys-0.006
ure-0.006
Tokenag
Token ID363
Feature activation-1.44e-01

Pos logit contributions
cient+0.080
 weaker+0.072
 sy+0.071
 boo+0.067
onel+0.065
Neg logit contributions
ris-0.173
dad-0.170
aren-0.166
Dan-0.164
oys-0.160
Tokenull
Token ID724
Feature activation+1.96e+00

Pos logit contributions
ris+0.005
dad+0.005
aren+0.004
oys+0.004
Dan+0.004
Neg logit contributions
cient-0.003
 weaker-0.003
 sy-0.003
 boo-0.003
 wing-0.003
Token while
Token ID981
Feature activation-1.33e+00

Pos logit contributions
cient+0.062
 weaker+0.061
 sy+0.061
 boo+0.059
onel+0.057
Neg logit contributions
ris-0.047
dad-0.042
oys-0.041
 than-0.040
aren-0.038
Token Tim
Token ID5045
Feature activation+6.69e-02

Pos logit contributions
ris+0.034
dad+0.033
oys+0.031
Dan+0.030
aren+0.030
Neg logit contributions
cient-0.040
 sy-0.038
 weaker-0.037
 boo-0.036
onel-0.035
Tokenmy
Token ID1820
Feature activation-6.07e-01

Pos logit contributions
cient+0.002
 weaker+0.002
 sy+0.002
 boo+0.002
onel+0.002
Neg logit contributions
ris-0.002
dad-0.002
aren-0.002
oys-0.002
Dan-0.002
TokenBe
Token ID3856
Feature activation+6.21e-01

Pos logit contributions
ris+0.005
dad+0.005
oys+0.005
aren+0.005
Dan+0.005
Neg logit contributions
cient-0.006
 weaker-0.005
 sy-0.005
 boo-0.005
onel-0.005
Tokenet
Token ID316
Feature activation-1.69e-01

Pos logit contributions
cient+0.017
 sy+0.017
 weaker+0.016
 boo+0.015
 proc+0.015
Neg logit contributions
ris-0.018
dad-0.017
 than-0.016
Dan-0.016
oys-0.016
Tokenle
Token ID293
Feature activation-5.94e-01

Pos logit contributions
ris+0.005
dad+0.004
aren+0.004
Dan+0.004
ar+0.004
Neg logit contributions
 weaker-0.005
 sy-0.005
cient-0.005
 boo-0.005
onel-0.005
Token watched
Token ID7342
Feature activation-1.29e-01

Pos logit contributions
ris+0.013
dad+0.012
ar+0.012
 than+0.012
ure+0.011
Neg logit contributions
 sy-0.019
 weaker-0.019
cient-0.019
 boo-0.018
 infected-0.017
Token the
Token ID262
Feature activation+3.71e-01

Pos logit contributions
dad+0.003
ris+0.003
 than+0.003
Dan+0.003
 ahead+0.003
Neg logit contributions
cient-0.004
 weaker-0.004
 sy-0.004
onel-0.004
 boo-0.004
Token se
Token ID384
Feature activation+4.61e+00

Pos logit contributions
cient+0.009
 weaker+0.009
 sy+0.009
 boo+0.008
 wing+0.008
Neg logit contributions
ris-0.012
dad-0.011
aren-0.011
oys-0.011
Dan-0.010
Tokenag
Token ID363
Feature activation-2.59e-01

Pos logit contributions
cient+0.100
 weaker+0.094
 sy+0.093
 boo+0.088
onel+0.085
Neg logit contributions
ris-0.156
dad-0.151
aren-0.143
oys-0.142
Dan-0.141
Tokenull
Token ID724
Feature activation+1.84e+00

Pos logit contributions
ris+0.008
dad+0.007
Dan+0.007
oys+0.007
ae+0.007
Neg logit contributions
cient-0.007
 sy-0.007
 weaker-0.007
 boo-0.006
 infected-0.006
Token until
Token ID1566
Feature activation-4.76e-01

Pos logit contributions
 sy+0.050
cient+0.049
 weaker+0.047
 boo+0.046
onel+0.045
Neg logit contributions
ris-0.048
 than-0.044
ar-0.044
 ahead-0.043
dad-0.043
Token she
Token ID673
Feature activation+1.14e-01

Pos logit contributions
ris+0.013
dad+0.012
oys+0.011
aren+0.011
Dan+0.011
Neg logit contributions
cient-0.014
 weaker-0.014
 sy-0.014
 boo-0.013
onel-0.013
Token could
Token ID714
Feature activation+2.98e-01

Pos logit contributions
cient+0.003
 weaker+0.003
 sy+0.003
 boo+0.003
onel+0.003
Neg logit contributions
ris-0.003
dad-0.003
Dan-0.003
aren-0.003
oys-0.003
Token okay
Token ID8788
Feature activation+3.75e-02

Pos logit contributions
cient+0.058
 sy+0.050
onel+0.049
xc+0.047
arre+0.046
Neg logit contributions
 than-0.060
ris-0.057
 ahead-0.057
dad-0.056
!-0.054
Token and
Token ID290
Feature activation+9.77e-01

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.001
dad-0.001
oys-0.001
 than-0.001
aren-0.001
Token the
Token ID262
Feature activation+8.45e-01

Pos logit contributions
ivery+0.038
onel+0.037
cient+0.036
arre+0.035
 sy+0.035
Neg logit contributions
 ranger-0.015
ris-0.014
 ahead-0.013
 ever-0.012
 payoff-0.012
Token car
Token ID1097
Feature activation+9.26e-01

Pos logit contributions
cient+0.026
onel+0.024
 sy+0.024
 weaker+0.024
 proc+0.023
Neg logit contributions
ris-0.021
dad-0.019
oys-0.018
aren-0.017
Dan-0.017
Token nodded
Token ID14464
Feature activation+1.32e-01

Pos logit contributions
cient+0.029
 weaker+0.028
 sy+0.027
onel+0.027
 boo+0.025
Neg logit contributions
ris-0.024
dad-0.020
ar-0.019
 than-0.019
oys-0.019
Token its
Token ID663
Feature activation+4.60e+00

Pos logit contributions
cient+0.005
 sy+0.005
 weaker+0.005
onel+0.005
 boo+0.005
Neg logit contributions
ris-0.002
dad-0.002
 than-0.002
oys-0.002
Dan-0.002
Token head
Token ID1182
Feature activation+1.53e+00

Pos logit contributions
cient+0.141
 sy+0.135
 weaker+0.134
onel+0.126
 boo+0.126
Neg logit contributions
ris-0.118
dad-0.109
oys-0.102
 than-0.099
Dan-0.098
Token.
Token ID13
Feature activation+6.21e-01

Pos logit contributions
cient+0.063
 sy+0.060
 weaker+0.060
 boo+0.058
onel+0.057
Neg logit contributions
ris-0.024
dad-0.021
 than-0.018
oys-0.018
aren-0.017
Token The
Token ID383
Feature activation-2.65e-02

Pos logit contributions
 weaker+0.019
 sy+0.018
cient+0.018
onel+0.018
 boo+0.017
Neg logit contributions
ris-0.016
dad-0.015
aren-0.014
ure-0.014
oys-0.014
Token truck
Token ID7779
Feature activation+2.94e-01

Pos logit contributions
ris+0.001
dad+0.001
oys+0.001
aren+0.001
Dan+0.001
Neg logit contributions
cient-0.001
 sy-0.001
 weaker-0.001
onel-0.001
 boo-0.001
Token hooked
Token ID23373
Feature activation-1.96e-01

Pos logit contributions
cient+0.010
 sy+0.010
 weaker+0.009
onel+0.009
 boo+0.009
Neg logit contributions
ris-0.007
dad-0.006
oys-0.005
aren-0.005
Dan-0.005
Token
Token ID26391
Feature activation+1.03e-01

Pos logit contributions
cient+0.019
 boo+0.018
 proc+0.018
 sy+0.018
have+0.017
Neg logit contributions
ris-0.025
 than-0.023
Dan-0.023
oys-0.022
ae-0.022
Token
Token ID8151
Feature activation-1.06e-01

Pos logit contributions
cient+0.003
 sy+0.002
 proc+0.002
 boo+0.002
 Mos+0.002
Neg logit contributions
ris-0.003
dad-0.003
oys-0.003
ae-0.003
ure-0.003
Tokens
Token ID82
Feature activation+1.34e+00

Pos logit contributions
ris+0.003
dad+0.003
oys+0.003
aren+0.003
Dan+0.003
Neg logit contributions
cient-0.003
 weaker-0.003
 sy-0.003
 boo-0.003
onel-0.003
Token follow
Token ID1061
Feature activation+5.26e-02

Pos logit contributions
cient+0.035
 weaker+0.034
 sy+0.034
 boo+0.032
onel+0.031
Neg logit contributions
ris-0.040
dad-0.038
aren-0.035
oys-0.035
Dan-0.035
Token the
Token ID262
Feature activation+5.79e-01

Pos logit contributions
cient+0.001
 sy+0.001
 boo+0.001
 weaker+0.001
 wing+0.001
Neg logit contributions
ris-0.002
dad-0.001
oys-0.001
aren-0.001
ae-0.001
Token se
Token ID384
Feature activation+4.59e+00

Pos logit contributions
cient+0.015
 sy+0.015
 weaker+0.014
 boo+0.013
 wing+0.013
Neg logit contributions
ris-0.018
ae-0.016
dad-0.015
aren-0.015
ar-0.015
Tokenag
Token ID363
Feature activation-3.89e-01

Pos logit contributions
cient+0.087
 sy+0.079
 weaker+0.078
 boo+0.074
onel+0.072
Neg logit contributions
ris-0.164
dad-0.159
Dan-0.153
aren-0.153
oys-0.151
Tokenull
Token ID724
Feature activation+1.78e+00

Pos logit contributions
ris+0.010
dad+0.009
Dan+0.009
oys+0.009
aren+0.008
Neg logit contributions
cient-0.012
 weaker-0.012
 sy-0.012
 boo-0.011
onel-0.011
Tokens
Token ID82
Feature activation+1.66e+00

Pos logit contributions
 weaker+0.052
cient+0.051
 sy+0.050
 boo+0.049
onel+0.047
Neg logit contributions
ris-0.048
dad-0.044
oys-0.041
aren-0.040
 than-0.040
Token!
Token ID0
Feature activation+2.56e-01

Pos logit contributions
cient+0.062
 weaker+0.060
 sy+0.060
 boo+0.058
onel+0.056
Neg logit contributions
ris-0.031
dad-0.029
aren-0.025
oys-0.025
Dan-0.025
Tokenâ
Token ID22940
Feature activation+3.13e-01

Pos logit contributions
cient+0.007
 weaker+0.007
 sy+0.007
 boo+0.007
onel+0.007
Neg logit contributions
ris-0.007
dad-0.006
aren-0.006
oys-0.006
Dan-0.006
Token Tom
Token ID4186
Feature activation-1.82e+00

Pos logit contributions
 weaker+0.015
cient+0.015
 boo+0.014
 sy+0.014
 worse+0.014
Neg logit contributions
ris-0.011
dad-0.010
ure-0.009
oys-0.009
aren-0.009
Token said
Token ID531
Feature activation-5.49e-01

Pos logit contributions
ris+0.046
dad+0.044
aren+0.039
oys+0.039
Dan+0.038
Neg logit contributions
cient-0.057
 weaker-0.056
 sy-0.054
onel-0.052
 boo-0.051
Token goodbye
Token ID24829
Feature activation+5.11e-01

Pos logit contributions
dad+0.009
ar+0.008
ris+0.008
ae+0.008
oe+0.007
Neg logit contributions
cient-0.022
 weaker-0.021
 sy-0.020
 infected-0.020
 boo-0.020
Token to
Token ID284
Feature activation+7.72e-01

Pos logit contributions
 wing+0.009
 boo+0.009
 sy+0.009
cient+0.008
 weaker+0.008
Neg logit contributions
ris-0.019
atherine-0.018
 recess-0.018
Dan-0.018
dad-0.018
Token the
Token ID262
Feature activation+9.48e-02

Pos logit contributions
cient+0.016
 weaker+0.016
 sy+0.016
 boo+0.015
 wing+0.014
Neg logit contributions
ris-0.027
dad-0.025
oys-0.024
ure-0.024
aren-0.024
Token se
Token ID384
Feature activation+4.57e+00

Pos logit contributions
cient+0.002
 weaker+0.002
 sy+0.002
 boo+0.002
 wing+0.002
Neg logit contributions
ris-0.003
dad-0.003
aren-0.003
oys-0.003
Dan-0.003
Tokenag
Token ID363
Feature activation-1.82e-01

Pos logit contributions
cient+0.083
 weaker+0.074
 sy+0.073
 boo+0.070
onel+0.067
Neg logit contributions
ris-0.164
dad-0.160
Dan-0.156
oys-0.155
aren-0.153
Tokenull
Token ID724
Feature activation+1.64e+00

Pos logit contributions
ris+0.006
dad+0.006
aren+0.005
oys+0.005
Dan+0.005
Neg logit contributions
cient-0.004
 weaker-0.004
 sy-0.004
 boo-0.004
onel-0.004
Tokens
Token ID82
Feature activation+1.08e+00

Pos logit contributions
cient+0.053
 weaker+0.051
 sy+0.051
 boo+0.048
 wing+0.047
Neg logit contributions
ris-0.039
dad-0.036
aren-0.033
oys-0.033
Dan-0.033
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
cient+0.039
 weaker+0.038
 sy+0.038
 boo+0.037
onel+0.036
Neg logit contributions
ris-0.021
dad-0.020
aren-0.017
oys-0.017
Dan-0.017
Token put
Token ID1234
Feature activation-5.74e-01

Pos logit contributions
cient+0.035
 weaker+0.034
 sy+0.033
onel+0.032
 boo+0.031
Neg logit contributions
ris-0.031
dad-0.030
Dan-0.027
aren-0.027
oys-0.026
Token running
Token ID2491
Feature activation+5.70e-01

Pos logit contributions
cient+0.025
 sy+0.022
 weaker+0.022
onel+0.022
 boo+0.020
Neg logit contributions
ris-0.023
dad-0.022
Dan-0.020
 than-0.020
ae-0.019
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
 weaker+0.015
 worse+0.014
 wing+0.013
 asleep+0.013
onel+0.013
Neg logit contributions
aren-0.017
ris-0.017
dad-0.016
atherine-0.016
andra-0.016
Token He
Token ID679
Feature activation-6.61e-01

Pos logit contributions
cient+0.003
 sy+0.003
 weaker+0.003
onel+0.003
 boo+0.003
Neg logit contributions
ris-0.002
dad-0.002
oys-0.002
aren-0.002
Dan-0.002
Token chased
Token ID26172
Feature activation-2.38e-01

Pos logit contributions
ris+0.019
dad+0.018
Dan+0.016
aren+0.016
ure+0.015
Neg logit contributions
cient-0.019
 sy-0.018
 weaker-0.018
onel-0.017
 boo-0.017
Token the
Token ID262
Feature activation+9.27e-02

Pos logit contributions
ris+0.008
aren+0.008
ae+0.008
cle+0.008
andra+0.008
Neg logit contributions
 boo-0.005
 weaker-0.004
 worse-0.004
 Bub-0.004
 en-0.004
Token se
Token ID384
Feature activation+4.46e+00

Pos logit contributions
 weaker+0.002
 boo+0.002
cient+0.002
 sy+0.002
 wing+0.002
Neg logit contributions
ris-0.003
aren-0.003
dad-0.003
atherine-0.003
ure-0.002
Tokenag
Token ID363
Feature activation-4.82e-01

Pos logit contributions
cient+0.069
 sy+0.062
 weaker+0.061
 boo+0.059
onel+0.058
Neg logit contributions
ris-0.170
dad-0.169
aren-0.165
Dan-0.163
Adam-0.159
Tokenull
Token ID724
Feature activation+1.56e+00

Pos logit contributions
ris+0.016
dad+0.015
aren+0.015
oys+0.014
ure+0.014
Neg logit contributions
cient-0.011
 weaker-0.010
 sy-0.010
 boo-0.009
onel-0.009
Tokens
Token ID82
Feature activation+7.84e-01

Pos logit contributions
cient+0.043
 weaker+0.042
 sy+0.041
 boo+0.039
onel+0.038
Neg logit contributions
ris-0.044
dad-0.042
aren-0.039
oys-0.038
Dan-0.038
Token as
Token ID355
Feature activation-1.06e+00

Pos logit contributions
 weaker+0.022
cient+0.021
 sy+0.020
 boo+0.020
 wing+0.020
Neg logit contributions
ris-0.022
dad-0.021
ae-0.020
aren-0.020
oys-0.020
Token they
Token ID484
Feature activation-9.65e-02

Pos logit contributions
dad+0.028
ris+0.028
aren+0.026
oys+0.025
ure+0.025
Neg logit contributions
 weaker-0.031
cient-0.029
 sy-0.028
 worse-0.028
 boo-0.027
Token quiet
Token ID5897
Feature activation+5.13e-01

Pos logit contributions
ris+0.017
dad+0.016
oys+0.016
aren+0.015
Dan+0.015
Neg logit contributions
cient-0.014
 weaker-0.014
 boo-0.014
 sy-0.014
 wing-0.014
Token compared
Token ID3688
Feature activation-1.07e+00

Pos logit contributions
cient+0.020
 weaker+0.020
 sy+0.019
onel+0.018
 boo+0.018
Neg logit contributions
ris-0.008
dad-0.008
 than-0.007
oys-0.007
aren-0.006
Token to
Token ID284
Feature activation+2.14e-01

Pos logit contributions
ris+0.031
dad+0.030
aren+0.027
oys+0.027
Dan+0.027
Neg logit contributions
cient-0.029
 weaker-0.028
 sy-0.028
 boo-0.026
onel-0.025
Token the
Token ID262
Feature activation-2.25e-01

Pos logit contributions
cient+0.006
 weaker+0.006
 sy+0.006
onel+0.005
 boo+0.005
Neg logit contributions
ris-0.006
dad-0.006
Dan-0.005
 than-0.005
oys-0.005
Token noisy
Token ID31210
Feature activation+4.11e-02

Pos logit contributions
ris+0.006
dad+0.006
oys+0.005
 than+0.005
aren+0.005
Neg logit contributions
cient-0.006
 weaker-0.006
 sy-0.006
onel-0.006
 boo-0.006
Token se
Token ID384
Feature activation+4.41e+00

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
 boo+0.001
 wing+0.001
Neg logit contributions
ris-0.001
dad-0.001
aren-0.001
oys-0.001
Dan-0.001
Tokenag
Token ID363
Feature activation-7.65e-01

Pos logit contributions
cient+0.120
 weaker+0.117
 sy+0.116
 boo+0.109
onel+0.105
Neg logit contributions
ris-0.128
dad-0.121
oys-0.110
aren-0.110
Dan-0.108
Tokenull
Token ID724
Feature activation+1.94e+00

Pos logit contributions
ris+0.022
dad+0.021
oys+0.019
Dan+0.019
 than+0.019
Neg logit contributions
cient-0.021
 sy-0.021
 weaker-0.021
 boo-0.020
onel-0.018
Tokens
Token ID82
Feature activation+1.34e+00

Pos logit contributions
 weaker+0.062
cient+0.061
 sy+0.060
 boo+0.058
onel+0.057
Neg logit contributions
ris-0.044
dad-0.040
 than-0.040
ar-0.037
oys-0.037
Token outside
Token ID2354
Feature activation+8.31e-01

Pos logit contributions
cient+0.049
 sy+0.048
 weaker+0.047
 boo+0.045
onel+0.044
Neg logit contributions
ris-0.026
dad-0.023
 than-0.023
oys-0.021
aren-0.020
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
cient+0.034
 sy+0.033
 weaker+0.032
 boo+0.032
onel+0.031
Neg logit contributions
ris-0.012
dad-0.012
 than-0.011
oys-0.010
Dan-0.009
Token wave
Token ID6769
Feature activation+1.04e+00

Pos logit contributions
 boo+0.002
 sy+0.002
cient+0.002
 weaker+0.002
 wing+0.002
Neg logit contributions
ris-0.002
dad-0.001
ae-0.001
oys-0.001
aren-0.001
Token came
Token ID1625
Feature activation+7.98e-02

Pos logit contributions
cient+0.033
 weaker+0.032
 sy+0.031
 boo+0.030
onel+0.029
Neg logit contributions
ris-0.025
dad-0.024
oys-0.022
ar-0.021
Dan-0.021
Token and
Token ID290
Feature activation+5.50e-01

Pos logit contributions
cient+0.003
 sy+0.003
 boo+0.003
onel+0.003
 weaker+0.003
Neg logit contributions
ris-0.001
 ahead-0.001
 than-0.001
!-0.001
oys-0.001
Token knocked
Token ID13642
Feature activation-6.36e-01

Pos logit contributions
onel+0.018
 sy+0.017
cient+0.017
 wing+0.017
ivery+0.016
Neg logit contributions
 ahead-0.012
ris-0.011
dad-0.011
 ranger-0.011
oe-0.011
Token the
Token ID262
Feature activation-7.54e-02

Pos logit contributions
ris+0.020
dad+0.019
aren+0.018
oys+0.018
Dan+0.017
Neg logit contributions
cient-0.016
 weaker-0.015
 sy-0.015
 boo-0.014
onel-0.014
Token se
Token ID384
Feature activation+4.40e+00

Pos logit contributions
ris+0.003
aren+0.002
dad+0.002
ae+0.002
oys+0.002
Neg logit contributions
 weaker-0.002
 boo-0.002
cient-0.002
 sy-0.002
 wing-0.002
Tokenag
Token ID363
Feature activation+1.47e-01

Pos logit contributions
cient+0.088
 weaker+0.083
 sy+0.080
 boo+0.076
onel+0.074
Neg logit contributions
ris-0.152
dad-0.148
aren-0.144
Dan-0.142
oys-0.141
Tokenull
Token ID724
Feature activation+2.35e+00

Pos logit contributions
cient+0.003
 weaker+0.003
 sy+0.003
 boo+0.003
onel+0.003
Neg logit contributions
ris-0.005
dad-0.005
aren-0.004
oys-0.004
Dan-0.004
Token off
Token ID572
Feature activation+1.86e+00

Pos logit contributions
cient+0.070
 sy+0.068
 boo+0.066
 weaker+0.064
onel+0.063
Neg logit contributions
ris-0.058
oe-0.053
 ahead-0.052
ar-0.051
 than-0.051
Token course
Token ID1781
Feature activation-9.31e-01

Pos logit contributions
cient+0.050
 weaker+0.049
 sy+0.049
 boo+0.046
 wing+0.045
Neg logit contributions
ris-0.054
dad-0.051
oys-0.047
aren-0.047
Dan-0.047
Token.
Token ID13
Feature activation+3.06e-01

Pos logit contributions
dad+0.021
ris+0.021
ae+0.021
Dan+0.020
Adam+0.019
Neg logit contributions
cient-0.028
 weaker-0.027
 sy-0.026
 wing-0.026
 boo-0.026
Token easy
Token ID2562
Feature activation+1.11e+00

Pos logit contributions
cient+0.080
 sy+0.078
 weaker+0.074
onel+0.073
 boo+0.071
Neg logit contributions
ris-0.061
dad-0.057
 than-0.055
oys-0.053
Dan-0.052
Token!
Token ID0
Feature activation+3.60e-01

Pos logit contributions
cient+0.034
 weaker+0.033
 sy+0.032
 boo+0.031
onel+0.030
Neg logit contributions
ris-0.029
dad-0.027
aren-0.025
Dan-0.024
oys-0.024
Token Just
Token ID2329
Feature activation-6.38e-01

Pos logit contributions
 weaker+0.011
 sy+0.011
cient+0.010
 boo+0.010
onel+0.010
Neg logit contributions
ris-0.009
oys-0.008
dad-0.008
aren-0.008
ure-0.008
Token follow
Token ID1061
Feature activation+3.32e-01

Pos logit contributions
ris+0.015
dad+0.014
 than+0.014
oys+0.013
ure+0.013
Neg logit contributions
cient-0.020
 sy-0.019
 weaker-0.019
onel-0.018
 infected-0.018
Token the
Token ID262
Feature activation+7.29e-01

Pos logit contributions
cient+0.009
 weaker+0.008
 sy+0.008
 boo+0.008
 wing+0.008
Neg logit contributions
ris-0.010
dad-0.009
aren-0.009
oys-0.009
Dan-0.009
Token se
Token ID384
Feature activation+4.38e+00

Pos logit contributions
cient+0.019
 weaker+0.018
 sy+0.017
 boo+0.017
 wing+0.016
Neg logit contributions
ris-0.024
aren-0.022
dad-0.021
atherine-0.020
oys-0.020
Tokenag
Token ID363
Feature activation-2.40e-01

Pos logit contributions
cient+0.072
 weaker+0.064
 sy+0.063
 boo+0.059
onel+0.057
Neg logit contributions
ris-0.167
dad-0.165
aren-0.160
Dan-0.159
oys-0.155
Tokenull
Token ID724
Feature activation+1.51e+00

Pos logit contributions
ris+0.007
dad+0.007
oys+0.006
Dan+0.006
ae+0.006
Neg logit contributions
 sy-0.006
cient-0.006
 weaker-0.006
 boo-0.006
 infected-0.005
Tokens
Token ID82
Feature activation+1.68e+00

Pos logit contributions
 sy+0.041
 weaker+0.040
cient+0.039
 boo+0.039
onel+0.037
Neg logit contributions
ris-0.043
dad-0.039
 than-0.038
oys-0.037
ure-0.036
Token.
Token ID13
Feature activation+2.60e-01

Pos logit contributions
cient+0.054
 weaker+0.051
 sy+0.050
 wing+0.048
 boo+0.048
Neg logit contributions
ris-0.042
dad-0.040
aren-0.036
Dan-0.036
oys-0.035
Token They
Token ID1119
Feature activation-3.76e-01

Pos logit contributions
 weaker+0.008
 sy+0.008
cient+0.007
 boo+0.007
onel+0.007
Neg logit contributions
ris-0.007
oys-0.006
dad-0.006
ure-0.006
aren-0.006
Token beetle
Token ID45489
Feature activation+1.25e+00

Pos logit contributions
ris+0.016
dad+0.016
Dan+0.015
oys+0.014
aren+0.014
Neg logit contributions
cient-0.015
 sy-0.015
 weaker-0.015
onel-0.014
 boo-0.014
Token waved
Token ID26834
Feature activation+1.49e-01

Pos logit contributions
cient+0.038
 weaker+0.037
 sy+0.037
onel+0.034
 boo+0.034
Neg logit contributions
ris-0.033
dad-0.030
ar-0.028
 than-0.027
Dan-0.027
Token back
Token ID736
Feature activation-1.78e-01

Pos logit contributions
cient+0.004
 weaker+0.004
 sy+0.004
 boo+0.003
onel+0.003
Neg logit contributions
ris-0.005
dad-0.004
aren-0.004
oys-0.004
Dan-0.004
Token and
Token ID290
Feature activation+6.96e-01

Pos logit contributions
ris+0.004
dad+0.004
aren+0.004
oys+0.004
Dan+0.004
Neg logit contributions
cient-0.006
 weaker-0.005
 sy-0.005
 boo-0.005
 wing-0.005
Token the
Token ID262
Feature activation+1.79e-01

Pos logit contributions
cient+0.020
 sy+0.019
 weaker+0.019
onel+0.018
 boo+0.018
Neg logit contributions
ris-0.018
dad-0.018
aren-0.017
Dan-0.017
 than-0.016
Token se
Token ID384
Feature activation+4.29e+00

Pos logit contributions
cient+0.005
 weaker+0.004
 sy+0.004
 boo+0.004
onel+0.004
Neg logit contributions
ris-0.005
dad-0.005
aren-0.005
oys-0.005
Dan-0.005
Tokenag
Token ID363
Feature activation-4.61e-01

Pos logit contributions
cient+0.103
 weaker+0.099
 sy+0.098
 boo+0.092
onel+0.090
Neg logit contributions
ris-0.135
dad-0.130
aren-0.122
oys-0.121
Dan-0.121
Tokenull
Token ID724
Feature activation+1.53e+00

Pos logit contributions
ris+0.014
dad+0.014
Dan+0.013
oys+0.013
aren+0.013
Neg logit contributions
cient-0.012
 sy-0.011
 weaker-0.011
 boo-0.010
onel-0.010
Token flew
Token ID13112
Feature activation+1.31e-01

Pos logit contributions
cient+0.040
 weaker+0.039
 sy+0.039
 boo+0.037
 proc+0.036
Neg logit contributions
ris-0.044
dad-0.040
ar-0.039
 than-0.039
 ahead-0.037
Token away
Token ID1497
Feature activation-6.46e-01

Pos logit contributions
cient+0.003
 weaker+0.003
 sy+0.003
 boo+0.003
 wing+0.003
Neg logit contributions
ris-0.004
dad-0.004
aren-0.004
oys-0.004
ae-0.004
Token.
Token ID13
Feature activation+2.81e-01

Pos logit contributions
ris+0.013
dad+0.013
aren+0.011
oys+0.011
Dan+0.011
Neg logit contributions
cient-0.023
 weaker-0.022
 sy-0.022
 boo-0.021
onel-0.021
Token say
Token ID910
Feature activation-3.38e-01

Pos logit contributions
cient+0.016
 weaker+0.016
 sy+0.016
 boo+0.015
onel+0.014
Neg logit contributions
ris-0.019
dad-0.018
aren-0.017
oys-0.017
Dan-0.017
Token and
Token ID290
Feature activation+3.60e-01

Pos logit contributions
ris+0.005
dad+0.005
 than+0.005
oys+0.005
 ahead+0.004
Neg logit contributions
cient-0.013
 weaker-0.013
 sy-0.013
 boo-0.012
onel-0.012
Token you
Token ID345
Feature activation+1.22e+00

Pos logit contributions
cient+0.011
 weaker+0.010
 sy+0.010
onel+0.010
 infected+0.009
Neg logit contributions
ris-0.009
dad-0.009
oys-0.009
 than-0.009
aren-0.008
Token help
Token ID1037
Feature activation+2.08e+00

Pos logit contributions
 weaker+0.029
cient+0.028
 sy+0.028
 boo+0.026
onel+0.026
Neg logit contributions
ris-0.041
dad-0.036
oys-0.035
 than-0.034
ure-0.034
Token each
Token ID1123
Feature activation+1.00e+00

Pos logit contributions
cient+0.054
 weaker+0.052
 sy+0.052
 boo+0.050
 wing+0.048
Neg logit contributions
ris-0.063
dad-0.057
aren-0.055
Dan-0.053
ae-0.053
Token other
Token ID584
Feature activation+4.26e+00

Pos logit contributions
cient+0.027
 sy+0.026
onel+0.025
 weaker+0.025
 boo+0.024
Neg logit contributions
ris-0.029
dad-0.028
oys-0.027
Dan-0.026
 than-0.025
Token.
Token ID13
Feature activation-7.43e-02

Pos logit contributions
cient+0.180
 sy+0.171
onel+0.167
 weaker+0.166
 boo+0.160
Neg logit contributions
ris-0.058
dad-0.053
 than-0.051
oys-0.051
 ahead-0.043
Token Do
Token ID2141
Feature activation+4.71e-01

Pos logit contributions
ris+0.002
dad+0.002
aren+0.002
oys+0.002
Dan+0.002
Neg logit contributions
cient-0.002
 weaker-0.002
 sy-0.002
 boo-0.002
onel-0.002
Token you
Token ID345
Feature activation+1.14e+00

Pos logit contributions
cient+0.014
 weaker+0.014
 sy+0.014
 boo+0.013
onel+0.013
Neg logit contributions
ris-0.012
dad-0.011
oys-0.010
aren-0.010
Dan-0.010
Token understand
Token ID1833
Feature activation+5.56e-01

Pos logit contributions
cient+0.028
 sy+0.027
 weaker+0.027
onel+0.024
 helicopter+0.023
Neg logit contributions
ris-0.037
dad-0.035
oys-0.033
 than-0.033
Dan-0.032
Token?"
Token ID1701
Feature activation-7.83e-01

Pos logit contributions
cient+0.012
 weaker+0.012
onel+0.012
 sy+0.012
 boo+0.011
Neg logit contributions
ris-0.018
dad-0.018
oys-0.017
 than-0.017
Dan-0.017
Token will
Token ID481
Feature activation-6.15e-01

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
onel+0.001
 wing+0.001
Neg logit contributions
ris-0.001
dad-0.001
oys-0.001
aren-0.001
Dan-0.001
Token try
Token ID1949
Feature activation+2.37e-01

Pos logit contributions
ris+0.019
dad+0.019
aren+0.018
ae+0.017
Dan+0.017
Neg logit contributions
cient-0.015
 sy-0.015
 weaker-0.015
 boo-0.014
 wing-0.013
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
cient+0.008
 sy+0.007
onel+0.007
 weaker+0.007
 boo+0.007
Neg logit contributions
ris-0.006
dad-0.005
oys-0.005
 than-0.005
Dan-0.005
Token fix
Token ID4259
Feature activation+3.15e-01

Pos logit contributions
cient+0.028
 weaker+0.026
 sy+0.026
onel+0.024
 boo+0.024
Neg logit contributions
ris-0.032
dad-0.030
oys-0.028
aren-0.028
Dan-0.028
Token your
Token ID534
Feature activation+1.42e+00

Pos logit contributions
cient+0.007
 weaker+0.007
 sy+0.007
 boo+0.006
 worse+0.006
Neg logit contributions
ris-0.011
dad-0.010
oys-0.010
aren-0.010
Dan-0.010
Token aer
Token ID9551
Feature activation+4.25e+00

Pos logit contributions
cient+0.033
 weaker+0.030
 sy+0.029
onel+0.028
 increasingly+0.026
Neg logit contributions
ris-0.045
dad-0.043
 than-0.042
 years-0.040
oys-0.039
Tokenopl
Token ID20106
Feature activation+8.69e-01

Pos logit contributions
cient+0.101
 sy+0.098
 weaker+0.096
 wing+0.092
 boo+0.089
Neg logit contributions
dad-0.128
ris-0.123
oys-0.120
Dan-0.120
aren-0.118
Tokenane
Token ID1531
Feature activation+2.64e+00

Pos logit contributions
cient+0.021
 sy+0.020
 weaker+0.020
 boo+0.018
onel+0.018
Neg logit contributions
ris-0.027
dad-0.026
aren-0.025
oys-0.024
Dan-0.024
Token or
Token ID393
Feature activation-6.28e-01

Pos logit contributions
cient+0.104
 sy+0.100
 weaker+0.098
onel+0.095
 boo+0.094
Neg logit contributions
ris-0.044
 than-0.040
dad-0.035
oys-0.033
oe-0.033
Token get
Token ID651
Feature activation+6.90e-01

Pos logit contributions
 than+0.014
ris+0.013
 ahead+0.012
dad+0.012
Tom+0.012
Neg logit contributions
cient-0.021
onel-0.019
 weaker-0.019
 sy-0.019
add-0.018
Token you
Token ID345
Feature activation+8.26e-01

Pos logit contributions
cient+0.023
 sy+0.020
onel+0.019
 weaker+0.019
ivery+0.019
Neg logit contributions
 ahead-0.015
dad-0.015
ris-0.014
oe-0.014
 than-0.013
Token the
Token ID262
Feature activation-5.61e-01

Pos logit contributions
cient+0.020
 weaker+0.019
 sy+0.019
 boo+0.018
onel+0.018
Neg logit contributions
ris-0.024
dad-0.023
aren-0.021
oys-0.021
Dan-0.021
Token lake
Token ID13546
Feature activation-6.56e-02

Pos logit contributions
ris+0.017
dad+0.016
aren+0.015
oys+0.015
 than+0.015
Neg logit contributions
 weaker-0.015
cient-0.015
 sy-0.014
 boo-0.014
 wing-0.013
Token and
Token ID290
Feature activation+2.96e-01

Pos logit contributions
ris+0.002
dad+0.002
aren+0.001
atherine+0.001
Dan+0.001
Neg logit contributions
 weaker-0.002
cient-0.002
 sy-0.002
 wing-0.002
 worse-0.002
Token put
Token ID1234
Feature activation-5.47e-01

Pos logit contributions
cient+0.009
 sy+0.009
 weaker+0.009
onel+0.008
 boo+0.008
Neg logit contributions
ris-0.008
dad-0.007
Dan-0.007
aren-0.007
oys-0.007
Token their
Token ID511
Feature activation+7.05e-01

Pos logit contributions
ris+0.021
dad+0.019
ae+0.019
ure+0.019
oys+0.018
Neg logit contributions
 weaker-0.010
 sy-0.010
cient-0.010
 boo-0.009
 wing-0.008
Token kay
Token ID34681
Feature activation+4.24e+00

Pos logit contributions
cient+0.020
 weaker+0.019
 sy+0.019
onel+0.018
 boo+0.017
Neg logit contributions
ris-0.019
dad-0.018
oys-0.017
aren-0.016
Dan-0.016
Tokenak
Token ID461
Feature activation+9.37e-01

Pos logit contributions
 weaker+0.092
cient+0.089
 sy+0.087
onel+0.085
 wing+0.083
Neg logit contributions
dad-0.126
 Rebecca-0.125
ris-0.122
oph-0.122
 Sophia-0.121
Token in
Token ID287
Feature activation+1.39e+00

Pos logit contributions
cient+0.027
 sy+0.025
 weaker+0.024
 boo+0.024
onel+0.023
Neg logit contributions
ris-0.027
dad-0.025
oys-0.024
 than-0.024
oe-0.023
Token the
Token ID262
Feature activation+2.50e-01

Pos logit contributions
 weaker+0.021
 sy+0.019
 wing+0.017
 boo+0.017
 infected+0.017
Neg logit contributions
ris-0.056
thel-0.055
dad-0.055
ae-0.055
ager-0.054
Token water
Token ID1660
Feature activation+4.16e-01

Pos logit contributions
cient+0.007
 weaker+0.007
 sy+0.007
 boo+0.007
onel+0.007
Neg logit contributions
ris-0.007
dad-0.006
oys-0.006
aren-0.006
Dan-0.006
Token.
Token ID13
Feature activation+5.55e-02

Pos logit contributions
cient+0.015
 weaker+0.015
 sy+0.015
 boo+0.014
 wing+0.014
Neg logit contributions
ris-0.008
dad-0.007
aren-0.007
Dan-0.007
oys-0.006
Token But
Token ID887
Feature activation-1.36e+00

Pos logit contributions
 weaker+0.008
cient+0.008
 sy+0.008
onel+0.007
 boo+0.007
Neg logit contributions
ris-0.006
dad-0.006
oys-0.006
aren-0.006
Dan-0.005
Token then
Token ID788
Feature activation+6.92e-01

Pos logit contributions
ris+0.034
dad+0.032
oys+0.030
aren+0.030
Dan+0.029
Neg logit contributions
cient-0.042
 sy-0.041
 weaker-0.040
onel-0.038
 boo-0.038
Token,
Token ID11
Feature activation+4.57e-01

Pos logit contributions
cient+0.023
 weaker+0.023
 sy+0.022
 boo+0.022
onel+0.021
Neg logit contributions
ris-0.016
dad-0.015
aren-0.013
oys-0.013
Dan-0.013
Token a
Token ID257
Feature activation+1.63e+00

Pos logit contributions
cient+0.017
 sy+0.017
 weaker+0.016
onel+0.016
 wing+0.015
Neg logit contributions
 recess-0.007
dad-0.007
ris-0.007
 ahead-0.007
Tom-0.007
Token big
Token ID1263
Feature activation+4.34e-01

Pos logit contributions
cient+0.036
 weaker+0.035
 sy+0.035
 boo+0.033
 wing+0.031
Neg logit contributions
ris-0.055
dad-0.052
aren-0.049
oys-0.049
Dan-0.048
Token se
Token ID384
Feature activation+4.21e+00

Pos logit contributions
 boo+0.012
cient+0.011
 weaker+0.011
 sy+0.011
 wing+0.011
Neg logit contributions
ris-0.014
ae-0.012
aren-0.012
dad-0.012
oys-0.011
Tokenag
Token ID363
Feature activation-2.80e-01

Pos logit contributions
cient+0.072
 sy+0.065
 weaker+0.063
 boo+0.063
onel+0.061
Neg logit contributions
dad-0.155
ris-0.154
Dan-0.149
ae-0.148
aren-0.147
Tokenull
Token ID724
Feature activation+1.59e+00

Pos logit contributions
ris+0.010
dad+0.009
aren+0.009
oys+0.009
Dan+0.008
Neg logit contributions
cient-0.006
 weaker-0.006
 sy-0.006
 boo-0.005
 wing-0.005
Token saw
Token ID2497
Feature activation+4.82e-02

Pos logit contributions
cient+0.043
 weaker+0.043
 sy+0.041
 boo+0.040
onel+0.040
Neg logit contributions
ris-0.044
dad-0.039
ar-0.038
 than-0.037
oys-0.037
Token the
Token ID262
Feature activation-3.93e-01

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.001
dad-0.001
oys-0.001
aren-0.001
Dan-0.001
Token oy
Token ID35104
Feature activation+1.78e+00

Pos logit contributions
ris+0.013
aren+0.012
dad+0.012
oys+0.011
ure+0.011
Neg logit contributions
cient-0.009
 weaker-0.009
 boo-0.009
 sy-0.009
 wing-0.008
Token watched
Token ID7342
Feature activation-4.50e-01

Pos logit contributions
ris+0.036
dad+0.034
oys+0.033
aren+0.032
Dan+0.032
Neg logit contributions
cient-0.028
 weaker-0.027
 sy-0.027
 boo-0.025
onel-0.024
Token the
Token ID262
Feature activation+1.70e-01

Pos logit contributions
dad+0.010
ris+0.010
oys+0.009
ae+0.009
 than+0.009
Neg logit contributions
cient-0.015
 weaker-0.014
 sy-0.014
onel-0.013
 infected-0.013
Token waves
Token ID9813
Feature activation+1.50e+00

Pos logit contributions
cient+0.005
 weaker+0.005
 sy+0.005
onel+0.004
 boo+0.004
Neg logit contributions
ris-0.005
dad-0.005
oys-0.004
Dan-0.004
aren-0.004
Token and
Token ID290
Feature activation-5.63e-03

Pos logit contributions
cient+0.052
 weaker+0.050
 sy+0.049
onel+0.048
 infected+0.047
Neg logit contributions
ris-0.031
dad-0.029
 than-0.026
oys-0.025
aren-0.024
Token the
Token ID262
Feature activation+2.95e-01

Pos logit contributions
ris+0.000
dad+0.000
Dan+0.000
aren+0.000
oys+0.000
Neg logit contributions
cient-0.000
 sy-0.000
 weaker-0.000
onel-0.000
 boo-0.000
Token se
Token ID384
Feature activation+4.20e+00

Pos logit contributions
cient+0.008
 weaker+0.007
 sy+0.007
onel+0.007
 boo+0.007
Neg logit contributions
ris-0.008
dad-0.008
Dan-0.008
oys-0.008
aren-0.008
Tokenag
Token ID363
Feature activation-2.25e-01

Pos logit contributions
cient+0.086
 sy+0.081
 weaker+0.081
 boo+0.076
onel+0.073
Neg logit contributions
ris-0.146
dad-0.142
Dan-0.135
aren-0.134
oys-0.133
Tokenull
Token ID724
Feature activation+1.82e+00

Pos logit contributions
ris+0.007
dad+0.007
oys+0.006
Dan+0.006
aren+0.006
Neg logit contributions
cient-0.006
 weaker-0.005
 sy-0.005
 boo-0.005
onel-0.005
Tokens
Token ID82
Feature activation+1.48e+00

Pos logit contributions
cient+0.054
 weaker+0.051
 sy+0.050
onel+0.050
 boo+0.049
Neg logit contributions
ris-0.046
dad-0.041
 than-0.040
ar-0.039
oe-0.037
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
cient+0.061
 sy+0.058
 weaker+0.058
onel+0.055
 boo+0.055
Neg logit contributions
ris-0.021
dad-0.020
 than-0.018
oys-0.016
Dan-0.016
Token They
Token ID1119
Feature activation-1.19e+00

Pos logit contributions
ris+0.011
dad+0.010
aren+0.010
ure+0.009
oys+0.009
Neg logit contributions
cient-0.008
 sy-0.008
 weaker-0.008
 boo-0.007
 wing-0.007
Token I
Token ID314
Feature activation-4.69e-01

Pos logit contributions
ris+0.030
dad+0.029
oys+0.027
 than+0.026
Dan+0.026
Neg logit contributions
cient-0.038
 sy-0.037
 weaker-0.036
onel-0.034
 boo-0.034
Token don
Token ID836
Feature activation-1.84e+00

Pos logit contributions
ris+0.015
dad+0.015
oys+0.014
aren+0.014
ae+0.013
Neg logit contributions
cient-0.010
 sy-0.010
 weaker-0.010
 boo-0.010
 infected-0.009
Token't
Token ID470
Feature activation-4.52e-01

Pos logit contributions
ris+0.038
dad+0.035
oys+0.033
ure+0.032
Dan+0.032
Neg logit contributions
cient-0.064
 weaker-0.062
 sy-0.061
 boo-0.061
onel-0.059
Token like
Token ID588
Feature activation-1.92e+00

Pos logit contributions
ris+0.015
aren+0.014
dad+0.014
oys+0.014
ae+0.014
Neg logit contributions
 boo-0.010
 worse-0.009
cient-0.009
 weaker-0.009
 wing-0.009
Token rac
Token ID3444
Feature activation-4.80e-01

Pos logit contributions
dad+0.041
 than+0.040
ris+0.039
Dan+0.037
 years+0.036
Neg logit contributions
cient-0.063
onel-0.060
 sy-0.058
 weaker-0.058
ivery-0.055
Tokenco
Token ID1073
Feature activation-4.16e+00

Pos logit contributions
ris+0.014
dad+0.014
aren+0.013
Dan+0.012
oys+0.012
Neg logit contributions
cient-0.013
 sy-0.012
 weaker-0.012
 boo-0.011
onel-0.011
Tokenons
Token ID684
Feature activation+3.63e-02

Pos logit contributions
ris+0.096
dad+0.094
oys+0.094
aren+0.089
ure+0.087
Neg logit contributions
cient-0.122
 weaker-0.115
 sy-0.111
 boo-0.107
 proc-0.105
Token.
Token ID13
Feature activation-6.78e-01

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.001
dad-0.001
 than-0.001
oys-0.000
ure-0.000
Token They
Token ID1119
Feature activation-8.09e-01

Pos logit contributions
ris+0.018
dad+0.016
oys+0.016
Dan+0.015
aren+0.015
Neg logit contributions
 weaker-0.020
cient-0.020
 sy-0.019
 boo-0.018
onel-0.018
Token are
Token ID389
Feature activation-9.79e-01

Pos logit contributions
ris+0.026
dad+0.025
 than+0.023
ure+0.023
oys+0.023
Neg logit contributions
cient-0.018
 weaker-0.018
 sy-0.018
 boo-0.016
onel-0.016
Token mean
Token ID1612
Feature activation-2.53e-01

Pos logit contributions
ris+0.028
dad+0.026
 than+0.026
oys+0.025
Dan+0.024
Neg logit contributions
cient-0.026
 sy-0.025
 weaker-0.025
onel-0.023
 boo-0.023
Token Mom
Token ID11254
Feature activation-3.77e-01

Pos logit contributions
ris+0.029
dad+0.026
aren+0.026
ure+0.026
oys+0.025
Neg logit contributions
 weaker-0.017
cient-0.017
 sy-0.016
 boo-0.016
 wing-0.014
Token and
Token ID290
Feature activation-8.53e-01

Pos logit contributions
ris+0.009
dad+0.008
Dan+0.008
oys+0.008
ure+0.008
Neg logit contributions
 weaker-0.012
 sy-0.011
cient-0.011
 boo-0.011
 worse-0.011
Token they
Token ID484
Feature activation-2.07e-03

Pos logit contributions
dad+0.018
ris+0.017
aren+0.016
Dan+0.015
oys+0.015
Neg logit contributions
cient-0.029
 weaker-0.028
 sy-0.028
onel-0.027
 infected-0.026
Token loved
Token ID6151
Feature activation-4.78e-01

Pos logit contributions
ris+0.000
dad+0.000
aren+0.000
oys+0.000
Dan+0.000
Neg logit contributions
cient-0.000
 weaker-0.000
 sy-0.000
 boo-0.000
onel-0.000
Token mac
Token ID8352
Feature activation-4.12e-01

Pos logit contributions
ris+0.018
aren+0.016
dad+0.016
ure+0.016
oys+0.016
Neg logit contributions
 weaker-0.009
cient-0.009
 sy-0.009
 boo-0.009
 worse-0.008
Tokenaron
Token ID8045
Feature activation-4.04e+00

Pos logit contributions
aren+0.011
ris+0.010
Dan+0.010
dad+0.010
atherine+0.009
Neg logit contributions
 sy-0.012
 asleep-0.011
 boo-0.011
 wing-0.011
 weaker-0.011
Tokeni
Token ID72
Feature activation-1.94e-01

Pos logit contributions
ris+0.117
dad+0.101
aren+0.099
oys+0.099
Dan+0.097
Neg logit contributions
cient-0.116
 weaker-0.111
 sy-0.107
 boo-0.103
 worse-0.097
Token and
Token ID290
Feature activation-1.78e-01

Pos logit contributions
ris+0.004
dad+0.004
aren+0.003
Dan+0.003
oys+0.003
Neg logit contributions
cient-0.007
 weaker-0.007
 sy-0.007
 boo-0.006
 wing-0.006
Token cheese
Token ID9891
Feature activation-1.31e+00

Pos logit contributions
ris+0.004
dad+0.004
 than+0.004
Dan+0.003
ae+0.003
Neg logit contributions
cient-0.006
 weaker-0.005
onel-0.005
 sy-0.005
 proc-0.005
Token.
Token ID13
Feature activation-2.37e-01

Pos logit contributions
ris+0.026
dad+0.024
aren+0.023
oys+0.022
Dan+0.022
Neg logit contributions
cient-0.046
 weaker-0.046
 sy-0.046
 boo-0.043
 wing-0.043
Token
Token ID198
Feature activation-4.34e-01

Pos logit contributions
ris+0.008
dad+0.008
ure+0.007
ar+0.007
aren+0.007
Neg logit contributions
 sy-0.005
cient-0.005
 weaker-0.005
 boo-0.005
 wing-0.005
Token their
Token ID511
Feature activation-7.05e-01

Pos logit contributions
dad+0.009
ris+0.009
ae+0.008
Dan+0.008
oys+0.008
Neg logit contributions
cient-0.010
 sy-0.010
 weaker-0.009
onel-0.009
 boo-0.009
Token favorite
Token ID4004
Feature activation-4.11e-01

Pos logit contributions
ris+0.020
dad+0.019
aren+0.018
oys+0.018
Dan+0.018
Neg logit contributions
cient-0.019
 weaker-0.019
 sy-0.018
 boo-0.018
 wing-0.017
Token food
Token ID2057
Feature activation-1.79e-01

Pos logit contributions
ris+0.010
dad+0.010
 than+0.009
oys+0.009
ae+0.009
Neg logit contributions
cient-0.013
 weaker-0.012
 sy-0.012
onel-0.012
 boo-0.011
Token:
Token ID25
Feature activation-4.70e-01

Pos logit contributions
ris+0.004
dad+0.004
aren+0.004
Dan+0.004
oys+0.003
Neg logit contributions
 sy-0.006
 weaker-0.006
cient-0.006
 boo-0.005
 wing-0.005
Token mac
Token ID8352
Feature activation-5.26e-01

Pos logit contributions
dad+0.011
ris+0.011
 than+0.011
Dan+0.010
 Sophia+0.010
Neg logit contributions
cient-0.015
onel-0.014
 weaker-0.014
xc-0.013
 sy-0.013
Tokenaron
Token ID8045
Feature activation-4.02e+00

Pos logit contributions
aren+0.014
ris+0.013
dad+0.012
 recess+0.012
Dan+0.012
Neg logit contributions
 sy-0.017
 boo-0.014
 wing-0.014
cient-0.014
 asleep-0.014
Tokeni
Token ID72
Feature activation-3.36e-01

Pos logit contributions
ris+0.113
dad+0.099
aren+0.096
oys+0.095
Dan+0.094
Neg logit contributions
cient-0.118
 sy-0.111
 weaker-0.110
 boo-0.106
 wing-0.100
Token and
Token ID290
Feature activation+1.34e-02

Pos logit contributions
ris+0.008
dad+0.007
aren+0.007
Dan+0.007
oys+0.007
Neg logit contributions
cient-0.011
 sy-0.011
 weaker-0.010
 boo-0.010
 wing-0.010
Token cheese
Token ID9891
Feature activation-1.39e+00

Pos logit contributions
cient+0.000
onel+0.000
 weaker+0.000
 proc+0.000
 sy+0.000
Neg logit contributions
ris-0.000
dad-0.000
 than-0.000
Dan-0.000
ae-0.000
Token.
Token ID13
Feature activation-4.02e-01

Pos logit contributions
ris+0.029
dad+0.027
aren+0.025
Dan+0.024
oys+0.024
Neg logit contributions
cient-0.048
 sy-0.048
 weaker-0.047
 boo-0.045
 wing-0.044
Token
Token ID198
Feature activation-6.77e-01

Pos logit contributions
ris+0.010
dad+0.010
oys+0.009
aren+0.009
Dan+0.009
Neg logit contributions
cient-0.012
 weaker-0.012
 sy-0.012
 boo-0.011
onel-0.011
Token broccoli
Token ID44653
Feature activation+7.72e-01

Pos logit contributions
ris+0.004
dad+0.004
oys+0.003
aren+0.003
Dan+0.003
Neg logit contributions
cient-0.003
 sy-0.003
 weaker-0.003
 boo-0.002
onel-0.002
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
cient+0.032
 sy+0.030
 weaker+0.030
 boo+0.029
onel+0.028
Neg logit contributions
ris-0.011
dad-0.011
 than-0.011
oys-0.009
 ahead-0.009
Token She
Token ID1375
Feature activation-5.45e-02

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.001
dad-0.001
aren-0.001
oys-0.001
Dan-0.000
Token wanted
Token ID2227
Feature activation-1.62e-01

Pos logit contributions
ris+0.001
dad+0.001
aren+0.001
ure+0.001
ae+0.001
Neg logit contributions
cient-0.002
 weaker-0.002
 sy-0.002
onel-0.002
 wing-0.001
Token mac
Token ID8352
Feature activation-3.29e-01

Pos logit contributions
ris+0.005
dad+0.004
aren+0.004
oys+0.004
Dan+0.004
Neg logit contributions
cient-0.004
 weaker-0.004
 sy-0.004
 boo-0.004
onel-0.004
Tokenaron
Token ID8045
Feature activation-4.00e+00

Pos logit contributions
ris+0.008
aren+0.007
dad+0.007
Dan+0.007
oe+0.007
Neg logit contributions
 sy-0.010
cient-0.010
 boo-0.009
onel-0.009
 weaker-0.009
Tokeni
Token ID72
Feature activation-2.18e-01

Pos logit contributions
ris+0.092
dad+0.087
aren+0.078
oys+0.078
Dan+0.077
Neg logit contributions
cient-0.131
 weaker-0.128
 sy-0.127
 boo-0.121
onel-0.119
Token and
Token ID290
Feature activation-6.59e-02

Pos logit contributions
ris+0.005
dad+0.004
 than+0.004
oys+0.004
ae+0.004
Neg logit contributions
cient-0.007
 weaker-0.007
 sy-0.007
 boo-0.007
onel-0.007
Token cheese
Token ID9891
Feature activation-1.35e+00

Pos logit contributions
dad+0.001
ris+0.001
ae+0.001
 than+0.001
Dan+0.001
Neg logit contributions
cient-0.002
onel-0.002
 weaker-0.002
 sy-0.002
 proc-0.002
Token instead
Token ID2427
Feature activation-2.00e-01

Pos logit contributions
ris+0.024
dad+0.023
oys+0.020
aren+0.019
Dan+0.019
Neg logit contributions
cient-0.051
 weaker-0.050
 sy-0.049
 boo-0.047
onel-0.046
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
ris+0.003
dad+0.003
aren+0.003
oys+0.003
Dan+0.002
Neg logit contributions
cient-0.008
 weaker-0.008
 sy-0.008
 boo-0.007
onel-0.007
Token crying
Token ID13774
Feature activation+6.43e-01

Pos logit contributions
ris+0.030
dad+0.028
aren+0.027
oys+0.027
Dan+0.026
Neg logit contributions
cient-0.025
 weaker-0.024
 sy-0.024
 boo-0.022
onel-0.022
Token.
Token ID13
Feature activation+9.22e-03

Pos logit contributions
cient+0.025
 weaker+0.024
 sy+0.024
 boo+0.023
onel+0.022
Neg logit contributions
ris-0.011
dad-0.011
oys-0.009
aren-0.009
Dan-0.009
Token She
Token ID1375
Feature activation-1.56e-01

Pos logit contributions
cient+0.000
 sy+0.000
 weaker+0.000
 boo+0.000
onel+0.000
Neg logit contributions
ris-0.000
dad-0.000
ae-0.000
oys-0.000
Dan-0.000
Token wanted
Token ID2227
Feature activation-4.71e-03

Pos logit contributions
ris+0.004
dad+0.004
aren+0.004
Dan+0.004
ae+0.004
Neg logit contributions
cient-0.004
 weaker-0.004
 sy-0.004
onel-0.004
 boo-0.004
Token mac
Token ID8352
Feature activation-3.47e-01

Pos logit contributions
ris+0.000
dad+0.000
oys+0.000
aren+0.000
Dan+0.000
Neg logit contributions
 weaker-0.000
cient-0.000
 sy-0.000
 boo-0.000
 worse-0.000
Tokenaron
Token ID8045
Feature activation-3.99e+00

Pos logit contributions
ris+0.008
aren+0.008
Dan+0.007
dad+0.007
oe+0.007
Neg logit contributions
 sy-0.011
cient-0.010
onel-0.010
 boo-0.010
 weaker-0.010
Tokeni
Token ID72
Feature activation-3.54e-01

Pos logit contributions
ris+0.094
dad+0.088
aren+0.080
oys+0.080
Dan+0.079
Neg logit contributions
cient-0.129
 weaker-0.126
 sy-0.125
 boo-0.119
onel-0.116
Token and
Token ID290
Feature activation-7.94e-02

Pos logit contributions
ris+0.008
dad+0.008
 than+0.007
ae+0.007
oys+0.007
Neg logit contributions
cient-0.011
 weaker-0.011
 sy-0.011
 boo-0.010
onel-0.010
Token cheese
Token ID9891
Feature activation-1.35e+00

Pos logit contributions
dad+0.002
ris+0.002
ae+0.002
 than+0.002
Dan+0.002
Neg logit contributions
cient-0.002
onel-0.002
 weaker-0.002
 sy-0.002
 proc-0.002
Token.
Token ID13
Feature activation+6.58e-02

Pos logit contributions
ris+0.027
dad+0.025
aren+0.022
oys+0.022
Dan+0.022
Neg logit contributions
cient-0.048
 weaker-0.047
 sy-0.047
 boo-0.045
onel-0.044
Token It
Token ID632
Feature activation-1.93e-01

Pos logit contributions
cient+0.002
 weaker+0.002
 sy+0.002
 boo+0.002
onel+0.002
Neg logit contributions
ris-0.002
dad-0.002
ae-0.002
oys-0.002
Dan-0.002
Token Dad
Token ID17415
Feature activation+1.16e+00

Pos logit contributions
ris+0.004
dad+0.004
oys+0.003
aren+0.003
Dan+0.003
Neg logit contributions
cient-0.005
 weaker-0.005
 sy-0.005
 boo-0.005
onel-0.004
Token said
Token ID531
Feature activation-4.07e-02

Pos logit contributions
cient+0.024
 sy+0.024
 weaker+0.022
 boo+0.022
onel+0.020
Neg logit contributions
ris-0.039
dad-0.036
Dan-0.035
aren-0.035
oys-0.035
Token.
Token ID13
Feature activation+1.61e-01

Pos logit contributions
ris+0.001
dad+0.001
aren+0.001
oys+0.001
Dan+0.001
Neg logit contributions
cient-0.002
 weaker-0.001
 sy-0.001
 boo-0.001
onel-0.001
Token "
Token ID366
Feature activation+7.57e-01

Pos logit contributions
cient+0.003
 sy+0.003
 weaker+0.003
 boo+0.003
 wing+0.003
Neg logit contributions
ris-0.006
dad-0.005
aren-0.005
oys-0.005
ure-0.005
TokenGate
Token ID22628
Feature activation-1.70e+00

Pos logit contributions
cient+0.019
 sy+0.019
 weaker+0.019
 boo+0.018
 wing+0.017
Neg logit contributions
ris-0.022
dad-0.022
aren-0.020
ar-0.019
ae-0.019
Token 12
Token ID1105
Feature activation-3.96e+00

Pos logit contributions
dad+0.046
ris+0.045
aren+0.041
oys+0.041
Dan+0.040
Neg logit contributions
cient-0.045
 weaker-0.043
 sy-0.042
 proc-0.040
 worse-0.040
Token.
Token ID13
Feature activation-7.13e-01

Pos logit contributions
dad+0.094
ris+0.094
aren+0.085
oys+0.084
Dan+0.081
Neg logit contributions
cient-0.118
 weaker-0.118
 sy-0.115
 wing-0.109
onel-0.108
Token Come
Token ID7911
Feature activation-8.54e-01

Pos logit contributions
ris+0.022
dad+0.021
aren+0.019
Dan+0.019
ae+0.019
Neg logit contributions
cient-0.018
 sy-0.017
 weaker-0.016
 boo-0.016
onel-0.015
Token on
Token ID319
Feature activation+2.02e+00

Pos logit contributions
dad+0.031
ris+0.031
ae+0.029
aren+0.029
Dan+0.028
Neg logit contributions
cient-0.016
 sy-0.014
 weaker-0.014
 boo-0.013
 wing-0.013
Token,
Token ID11
Feature activation-2.47e-02

Pos logit contributions
cient+0.063
 weaker+0.061
 sy+0.060
 boo+0.058
 wing+0.056
Neg logit contributions
ris-0.050
dad-0.047
aren-0.043
oys-0.042
Dan-0.042
Token let
Token ID1309
Feature activation+1.45e+00

Pos logit contributions
ris+0.001
dad+0.001
oys+0.001
aren+0.001
Dan+0.001
Neg logit contributions
cient-0.001
 weaker-0.001
 sy-0.001
 boo-0.001
onel-0.001
Token had
Token ID550
Feature activation-2.12e-01

Pos logit contributions
ris+0.024
dad+0.023
aren+0.021
Dan+0.021
oys+0.020
Neg logit contributions
cient-0.022
 sy-0.022
 weaker-0.022
 boo-0.020
onel-0.020
Token fun
Token ID1257
Feature activation-1.72e+00

Pos logit contributions
ris+0.005
dad+0.005
Dan+0.004
aren+0.004
 than+0.004
Neg logit contributions
cient-0.007
 sy-0.007
 weaker-0.007
onel-0.006
 boo-0.006
Token playing
Token ID2712
Feature activation+3.88e-01

Pos logit contributions
ris+0.060
dad+0.060
aren+0.056
oys+0.055
ae+0.055
Neg logit contributions
cient-0.034
 weaker-0.033
 sy-0.033
 boo-0.031
onel-0.030
Token together
Token ID1978
Feature activation-1.65e+00

Pos logit contributions
cient+0.009
 weaker+0.009
 sy+0.009
 boo+0.008
onel+0.008
Neg logit contributions
ris-0.013
dad-0.012
aren-0.011
oys-0.011
Dan-0.011
Token at
Token ID379
Feature activation-1.71e+00

Pos logit contributions
ris+0.042
dad+0.041
oys+0.039
aren+0.039
Dan+0.038
Neg logit contributions
 weaker-0.046
cient-0.046
 sy-0.044
onel-0.043
 wing-0.043
Token recess
Token ID28836
Feature activation-3.94e+00

Pos logit contributions
ris+0.051
dad+0.049
oys+0.047
aren+0.046
ure+0.046
Neg logit contributions
 weaker-0.043
cient-0.042
 boo-0.039
 sy-0.039
 wing-0.037
Token.
Token ID13
Feature activation+1.65e-01

Pos logit contributions
ris+0.090
dad+0.087
aren+0.080
Dan+0.079
oys+0.078
Neg logit contributions
cient-0.123
 weaker-0.118
 sy-0.117
onel-0.114
 boo-0.112
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
cient+0.004
 weaker+0.004
 sy+0.004
 boo+0.004
onel+0.004
Neg logit contributions
ris-0.005
dad-0.005
ae-0.004
ure-0.004
Dan-0.004
Token
Token ID198
Feature activation-4.04e-02

Pos logit contributions
cient+0.006
 weaker+0.006
 sy+0.006
 boo+0.006
onel+0.005
Neg logit contributions
ris-0.006
dad-0.005
aren-0.005
oys-0.005
Dan-0.005
TokenAfter
Token ID3260
Feature activation-7.67e-01

Pos logit contributions
ris+0.001
dad+0.001
oys+0.001
aren+0.001
Dan+0.001
Neg logit contributions
cient-0.001
 weaker-0.001
 sy-0.001
 boo-0.001
onel-0.001
Token school
Token ID1524
Feature activation-1.67e+00

Pos logit contributions
ris+0.027
dad+0.026
ae+0.024
aren+0.024
 than+0.024
Neg logit contributions
 weaker-0.015
cient-0.014
 sy-0.014
 boo-0.014
 worse-0.013
Token nodded
Token ID14464
Feature activation-8.50e-01

Pos logit contributions
cient+0.005
 weaker+0.004
 sy+0.004
 boo+0.004
onel+0.004
Neg logit contributions
ris-0.003
dad-0.002
aren-0.002
oys-0.002
Dan-0.002
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
dad+0.012
ris+0.012
 than+0.010
oys+0.010
Dan+0.009
Neg logit contributions
cient-0.035
 weaker-0.034
 sy-0.033
onel-0.032
 boo-0.032
Token She
Token ID1375
Feature activation+1.81e-01

Pos logit contributions
cient+0.003
 sy+0.003
 weaker+0.003
 boo+0.003
 wing+0.003
Neg logit contributions
ris-0.005
dad-0.005
aren-0.004
oys-0.004
Dan-0.004
Token liked
Token ID8288
Feature activation-1.73e+00

Pos logit contributions
cient+0.005
 weaker+0.005
 sy+0.005
 boo+0.004
onel+0.004
Neg logit contributions
ris-0.005
dad-0.005
oys-0.005
aren-0.005
Dan-0.005
Token mac
Token ID8352
Feature activation-6.63e-01

Pos logit contributions
ris+0.049
dad+0.047
aren+0.043
oys+0.043
Dan+0.043
Neg logit contributions
cient-0.047
 weaker-0.046
 sy-0.046
 boo-0.043
onel-0.042
Tokenaron
Token ID8045
Feature activation-3.92e+00

Pos logit contributions
ris+0.017
aren+0.017
dad+0.016
Dan+0.016
 recess+0.015
Neg logit contributions
 sy-0.020
 boo-0.017
cient-0.017
 wing-0.017
 means-0.017
Tokeni
Token ID72
Feature activation-3.92e-01

Pos logit contributions
ris+0.103
dad+0.092
oys+0.088
aren+0.087
Dan+0.086
Neg logit contributions
cient-0.120
 weaker-0.116
 sy-0.114
 boo-0.110
 infected-0.103
Token and
Token ID290
Feature activation-1.85e-01

Pos logit contributions
ris+0.010
dad+0.009
aren+0.009
oys+0.008
Dan+0.008
Neg logit contributions
cient-0.012
 weaker-0.012
 sy-0.012
 boo-0.011
onel-0.011
Token cheese
Token ID9891
Feature activation-1.36e+00

Pos logit contributions
ris+0.004
dad+0.004
 than+0.004
Dan+0.004
oys+0.003
Neg logit contributions
cient-0.006
 weaker-0.006
onel-0.006
 sy-0.005
 proc-0.005
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
ris+0.026
dad+0.024
aren+0.021
oys+0.021
Dan+0.021
Neg logit contributions
cient-0.050
 weaker-0.049
 sy-0.049
 boo-0.047
onel-0.046
Token She
Token ID1375
Feature activation-1.09e-01

Pos logit contributions
ris+0.008
dad+0.008
aren+0.007
oys+0.007
ure+0.007
Neg logit contributions
cient-0.005
 sy-0.005
 weaker-0.005
 boo-0.005
onel-0.005
Token
Token ID198
Feature activation-4.94e-01

Pos logit contributions
ris+0.012
dad+0.012
aren+0.010
oys+0.010
ure+0.010
Neg logit contributions
cient-0.011
 sy-0.011
 weaker-0.011
 boo-0.010
onel-0.010
Token
Token ID198
Feature activation-7.07e-01

Pos logit contributions
ris+0.013
dad+0.012
aren+0.011
oys+0.011
ure+0.011
Neg logit contributions
cient-0.014
 weaker-0.014
 sy-0.014
 boo-0.013
onel-0.013
TokenShe
Token ID3347
Feature activation-7.15e-02

Pos logit contributions
ris+0.021
dad+0.020
aren+0.019
ae+0.019
ure+0.018
Neg logit contributions
 weaker-0.016
 sy-0.016
cient-0.016
 worse-0.015
 boo-0.014
Token jumped
Token ID11687
Feature activation-4.27e-01

Pos logit contributions
ris+0.002
dad+0.002
aren+0.002
oys+0.002
Dan+0.002
Neg logit contributions
cient-0.002
 weaker-0.002
 sy-0.002
 boo-0.002
onel-0.002
Token on
Token ID319
Feature activation+7.36e-01

Pos logit contributions
ris+0.013
dad+0.013
oys+0.012
 than+0.012
Dan+0.012
Neg logit contributions
cient-0.011
 sy-0.010
 weaker-0.010
 boo-0.010
onel-0.009
Token Tom
Token ID4186
Feature activation-3.89e+00

Pos logit contributions
 weaker+0.010
 boo+0.010
 sy+0.010
 wing+0.009
 worse+0.009
Neg logit contributions
ris-0.030
dad-0.028
oys-0.028
ager-0.028
aren-0.027
Token and
Token ID290
Feature activation-4.86e-02

Pos logit contributions
ris+0.113
dad+0.108
aren+0.100
oys+0.100
Dan+0.099
Neg logit contributions
cient-0.103
 weaker-0.100
 sy-0.100
 boo-0.095
onel-0.091
Token pushed
Token ID7121
Feature activation-8.31e-01

Pos logit contributions
ris+0.001
dad+0.001
oys+0.001
aren+0.001
Dan+0.001
Neg logit contributions
cient-0.001
 sy-0.001
 weaker-0.001
 boo-0.001
onel-0.001
Token him
Token ID683
Feature activation-4.46e-01

Pos logit contributions
ris+0.030
aren+0.028
ae+0.027
ager+0.027
dad+0.027
Neg logit contributions
 weaker-0.016
 worse-0.016
 boo-0.015
 sy-0.014
 infected-0.014
Token away
Token ID1497
Feature activation-5.34e-01

Pos logit contributions
ris+0.012
dad+0.011
oys+0.010
aren+0.010
Dan+0.010
Neg logit contributions
cient-0.013
 weaker-0.013
 sy-0.013
 boo-0.012
onel-0.012
Token from
Token ID422
Feature activation+9.97e-01

Pos logit contributions
ris+0.012
dad+0.011
aren+0.011
oys+0.010
Dan+0.010
Neg logit contributions
 weaker-0.018
 sy-0.018
cient-0.017
 boo-0.017
 worse-0.017
Tokenila
Token ID10102
Feature activation-5.50e-01

Pos logit contributions
ris+0.031
dad+0.030
aren+0.028
cle+0.028
ure+0.028
Neg logit contributions
onel-0.023
 sy-0.023
 boo-0.022
 weaker-0.022
 worse-0.022
Token peek
Token ID27185
Feature activation-7.42e-01

Pos logit contributions
ris+0.016
dad+0.015
oys+0.014
aren+0.014
Dan+0.014
Neg logit contributions
cient-0.015
 weaker-0.014
 sy-0.014
 boo-0.014
onel-0.013
Tokened
Token ID276
Feature activation+1.65e-01

Pos logit contributions
ris+0.019
dad+0.018
ar+0.016
oys+0.016
ure+0.016
Neg logit contributions
cient-0.023
 weaker-0.022
 sy-0.022
 boo-0.021
 wing-0.020
Token from
Token ID422
Feature activation+9.44e-01

Pos logit contributions
 weaker+0.003
 sy+0.003
 boo+0.003
 worse+0.003
cient+0.003
Neg logit contributions
ris-0.006
oys-0.006
dad-0.006
aren-0.006
ae-0.006
Token behind
Token ID2157
Feature activation-4.34e-01

Pos logit contributions
 weaker+0.016
 boo+0.015
 helicopter+0.014
 worse+0.013
 sy+0.013
Neg logit contributions
ris-0.035
ae-0.034
oys-0.034
ure-0.034
ager-0.034
Token Tom
Token ID4186
Feature activation-3.88e+00

Pos logit contributions
ris+0.021
ager+0.019
oys+0.019
thel+0.019
ure+0.019
Neg logit contributions
 asleep-0.004
 sy-0.004
 boo-0.004
 weaker-0.003
 worse-0.003
Token and
Token ID290
Feature activation+3.23e-01

Pos logit contributions
ris+0.154
Dan+0.149
dad+0.147
aren+0.146
oys+0.140
Neg logit contributions
 sy-0.063
cient-0.061
 weaker-0.055
 boo-0.054
 wing-0.053
Token saw
Token ID2497
Feature activation-1.22e-01

Pos logit contributions
cient+0.009
 weaker+0.008
 sy+0.008
 boo+0.008
 worse+0.008
Neg logit contributions
ris-0.009
dad-0.009
oys-0.008
aren-0.008
Dan-0.008
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
ris+0.003
dad+0.003
oys+0.003
aren+0.003
Dan+0.003
Neg logit contributions
cient-0.003
 weaker-0.003
 sy-0.003
 boo-0.003
onel-0.003
Token picture
Token ID4286
Feature activation-3.30e-02

Pos logit contributions
ris+0.034
aren+0.033
dad+0.033
ae+0.033
ure+0.031
Neg logit contributions
 sy-0.026
 boo-0.024
 weaker-0.023
cient-0.023
 wing-0.021
Token again
Token ID757
Feature activation+8.06e-02

Pos logit contributions
ris+0.001
dad+0.001
aren+0.001
oys+0.001
Dan+0.001
Neg logit contributions
cient-0.001
 weaker-0.001
 sy-0.001
 boo-0.001
onel-0.001
Token dog
Token ID3290
Feature activation-3.13e-01

Pos logit contributions
ris+0.031
dad+0.028
aren+0.027
ae+0.027
ure+0.026
Neg logit contributions
 boo-0.024
 weaker-0.024
 sy-0.023
cient-0.022
 wing-0.021
Token.
Token ID13
Feature activation-2.03e-01

Pos logit contributions
ris+0.006
dad+0.005
oys+0.005
aren+0.005
Dan+0.004
Neg logit contributions
cient-0.012
 weaker-0.012
 sy-0.011
 boo-0.011
onel-0.011
Token She
Token ID1375
Feature activation-3.13e-01

Pos logit contributions
ris+0.006
dad+0.006
oys+0.006
aren+0.006
Dan+0.006
Neg logit contributions
cient-0.005
 weaker-0.005
 sy-0.005
 boo-0.005
onel-0.004
Token hid
Token ID24519
Feature activation-2.20e+00

Pos logit contributions
ris+0.008
dad+0.008
Dan+0.007
aren+0.007
 than+0.007
Neg logit contributions
cient-0.009
 weaker-0.009
 sy-0.009
onel-0.009
 wing-0.008
Token behind
Token ID2157
Feature activation-4.17e-01

Pos logit contributions
ris+0.081
ae+0.071
ure+0.070
dad+0.070
aren+0.069
Neg logit contributions
 weaker-0.049
 worse-0.043
nam-0.042
 infected-0.040
 sy-0.039
Token Tom
Token ID4186
Feature activation-3.87e+00

Pos logit contributions
ris+0.016
dad+0.014
oys+0.014
aren+0.014
Dan+0.014
Neg logit contributions
cient-0.008
 weaker-0.008
 sy-0.008
 boo-0.007
 wing-0.007
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
ris+0.107
dad+0.102
Dan+0.096
aren+0.096
oys+0.094
Neg logit contributions
cient-0.110
 sy-0.106
 weaker-0.105
 boo-0.100
 wing-0.098
Token
Token ID198
Feature activation-6.54e-01

Pos logit contributions
ris+0.009
dad+0.009
oys+0.008
aren+0.008
ure+0.008
Neg logit contributions
cient-0.010
 sy-0.009
 weaker-0.009
 boo-0.009
 wing-0.008
Token
Token ID198
Feature activation-7.82e-01

Pos logit contributions
ris+0.018
dad+0.018
ure+0.016
oys+0.016
aren+0.016
Neg logit contributions
cient-0.017
 weaker-0.017
 sy-0.017
 boo-0.016
 infected-0.015
Token"
Token ID1
Feature activation-4.98e-01

Pos logit contributions
ris+0.024
dad+0.023
ae+0.021
aren+0.020
oys+0.020
Neg logit contributions
cient-0.018
 weaker-0.018
 sy-0.017
 infected-0.016
 boo-0.016
TokenTom
Token ID13787
Feature activation-1.66e+00

Pos logit contributions
ris+0.013
dad+0.013
Dan+0.012
aren+0.012
oys+0.012
Neg logit contributions
cient-0.015
 weaker-0.014
 sy-0.014
 boo-0.013
onel-0.013
Token that
Token ID326
Feature activation+1.07e+00

Pos logit contributions
ris+0.046
dad+0.043
aren+0.040
oys+0.040
Dan+0.039
Neg logit contributions
cient-0.052
 weaker-0.051
 sy-0.050
 boo-0.048
onel-0.047
Token.
Token ID13
Feature activation-6.75e-01

Pos logit contributions
cient+0.044
 sy+0.042
 weaker+0.042
onel+0.040
 boo+0.040
Neg logit contributions
ris-0.016
dad-0.015
 than-0.013
Dan-0.012
oys-0.012
Token It
Token ID632
Feature activation-3.51e-01

Pos logit contributions
ris+0.021
dad+0.020
aren+0.018
oys+0.018
ure+0.018
Neg logit contributions
cient-0.017
 weaker-0.017
 sy-0.016
 boo-0.016
onel-0.015
Token jumped
Token ID11687
Feature activation-4.14e-01

Pos logit contributions
ris+0.008
 than+0.007
ar+0.007
dad+0.007
Dan+0.007
Neg logit contributions
cient-0.012
 sy-0.011
 weaker-0.010
onel-0.010
 proc-0.010
Token on
Token ID319
Feature activation+4.05e-01

Pos logit contributions
ris+0.010
 than+0.009
 ahead+0.009
dad+0.009
ar+0.009
Neg logit contributions
cient-0.013
 sy-0.012
 weaker-0.012
 boo-0.011
onel-0.011
Token Tom
Token ID4186
Feature activation-3.84e+00

Pos logit contributions
 weaker+0.007
 boo+0.007
 sy+0.007
 wing+0.007
cient+0.006
Neg logit contributions
ris-0.015
dad-0.015
aren-0.014
ae-0.014
ager-0.014
Token's
Token ID338
Feature activation-2.28e-01

Pos logit contributions
ris+0.091
dad+0.086
oys+0.078
aren+0.076
 than+0.076
Neg logit contributions
cient-0.124
 weaker-0.120
 sy-0.118
 boo-0.112
onel-0.111
Token face
Token ID1986
Feature activation-1.60e+00

Pos logit contributions
dad+0.005
ris+0.005
oys+0.005
 than+0.005
ae+0.005
Neg logit contributions
cient-0.008
 weaker-0.007
 sy-0.007
onel-0.007
 infected-0.006
Token and
Token ID290
Feature activation-1.84e-02

Pos logit contributions
ris+0.027
dad+0.024
 than+0.021
oys+0.021
Dan+0.020
Neg logit contributions
cient-0.064
 weaker-0.061
 sy-0.061
 boo-0.059
onel-0.058
Token cro
Token ID6763
Feature activation+2.84e-02

Pos logit contributions
dad+0.000
 than+0.000
ris+0.000
Dan+0.000
 ranger+0.000
Neg logit contributions
cient-0.001
onel-0.001
 weaker-0.001
 sy-0.000
 proc-0.000
Tokenaked
Token ID4335
Feature activation-1.51e+00

Pos logit contributions
cient+0.001
 sy+0.001
 weaker+0.001
 boo+0.001
 wing+0.001
Neg logit contributions
ris-0.001
dad-0.001
aren-0.001
oys-0.001
Dan-0.001
Token saw
Token ID2497
Feature activation-2.45e-01

Pos logit contributions
ris+0.026
dad+0.024
Dan+0.022
aren+0.022
ar+0.021
Neg logit contributions
cient-0.029
 weaker-0.027
 sy-0.027
onel-0.026
 boo-0.025
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
dad+0.005
ris+0.005
Dan+0.005
Tom+0.005
oys+0.005
Neg logit contributions
cient-0.008
onel-0.007
 sy-0.007
 proc-0.007
 weaker-0.007
Token paper
Token ID3348
Feature activation-1.30e+00

Pos logit contributions
ris+0.038
dad+0.036
oys+0.034
aren+0.034
Dan+0.033
Neg logit contributions
cient-0.034
 weaker-0.032
 sy-0.032
 boo-0.030
onel-0.030
Token hat
Token ID6877
Feature activation-7.33e-01

Pos logit contributions
ris+0.028
dad+0.027
 than+0.025
oys+0.024
Dan+0.023
Neg logit contributions
cient-0.044
 weaker-0.041
 sy-0.040
onel-0.039
 boo-0.039
Token on
Token ID319
Feature activation-6.41e-01

Pos logit contributions
ris+0.013
dad+0.012
 than+0.010
oys+0.010
aren+0.010
Neg logit contributions
cient-0.028
 weaker-0.027
 sy-0.027
 boo-0.026
onel-0.026
Token Tom
Token ID4186
Feature activation-3.83e+00

Pos logit contributions
ris+0.024
dad+0.022
aren+0.022
oys+0.022
ae+0.021
Neg logit contributions
 weaker-0.012
cient-0.012
 sy-0.012
 boo-0.011
 wing-0.011
Token's
Token ID338
Feature activation-4.39e-01

Pos logit contributions
ris+0.131
dad+0.125
aren+0.117
oys+0.117
Dan+0.116
Neg logit contributions
cient-0.083
 weaker-0.080
 sy-0.079
 boo-0.074
onel-0.071
Token head
Token ID1182
Feature activation-8.07e-01

Pos logit contributions
ris+0.012
dad+0.011
oys+0.010
aren+0.010
Dan+0.010
Neg logit contributions
cient-0.013
 weaker-0.012
 sy-0.012
 boo-0.011
onel-0.011
Token and
Token ID290
Feature activation+5.69e-02

Pos logit contributions
ris+0.016
dad+0.015
aren+0.013
oys+0.013
Dan+0.013
Neg logit contributions
cient-0.029
 weaker-0.028
 sy-0.028
 boo-0.027
 wing-0.026
Token thought
Token ID1807
Feature activation+4.29e-02

Pos logit contributions
cient+0.002
onel+0.002
 weaker+0.002
ivery+0.002
 proc+0.002
Neg logit contributions
dad-0.001
 than-0.001
ris-0.001
Dan-0.001
 ranger-0.001
Token it
Token ID340
Feature activation+1.40e+00

Pos logit contributions
cient+0.002
 weaker+0.001
 sy+0.001
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.001
dad-0.001
oys-0.001
Dan-0.001
 than-0.001
Token
Token ID198
Feature activation-6.46e-01

Pos logit contributions
ris+0.013
dad+0.012
oys+0.012
aren+0.012
ure+0.011
Neg logit contributions
cient-0.012
 weaker-0.012
 sy-0.012
 boo-0.011
onel-0.011
TokenThey
Token ID2990
Feature activation-5.87e-01

Pos logit contributions
ris+0.016
dad+0.015
aren+0.015
oys+0.014
ae+0.013
Neg logit contributions
 weaker-0.019
cient-0.019
 sy-0.018
 worse-0.017
 proc-0.017
Token played
Token ID2826
Feature activation+5.79e-01

Pos logit contributions
ris+0.017
dad+0.016
aren+0.015
Dan+0.014
ure+0.014
Neg logit contributions
 weaker-0.016
 sy-0.016
cient-0.016
 boo-0.015
onel-0.014
Token together
Token ID1978
Feature activation-8.49e-01

Pos logit contributions
cient+0.017
 sy+0.017
 weaker+0.016
 boo+0.015
onel+0.015
Neg logit contributions
ris-0.015
dad-0.014
 than-0.014
oys-0.013
Dan-0.013
Token at
Token ID379
Feature activation-1.19e+00

Pos logit contributions
ris+0.022
dad+0.021
aren+0.019
oys+0.019
Dan+0.019
Neg logit contributions
cient-0.026
 weaker-0.025
 sy-0.024
 boo-0.023
onel-0.023
Token recess
Token ID28836
Feature activation-3.83e+00

Pos logit contributions
ris+0.039
dad+0.038
aren+0.036
oys+0.036
ar+0.036
Neg logit contributions
 weaker-0.026
cient-0.025
 sy-0.024
 boo-0.023
 wing-0.022
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
ris+0.086
dad+0.082
aren+0.076
Dan+0.073
oys+0.073
Neg logit contributions
cient-0.123
 weaker-0.119
 sy-0.116
onel-0.113
 boo-0.111
Token They
Token ID1119
Feature activation-9.38e-01

Pos logit contributions
ris+0.005
dad+0.005
aren+0.004
ager+0.004
ure+0.004
Neg logit contributions
cient-0.003
 sy-0.003
 weaker-0.003
 boo-0.003
 en-0.003
Token shared
Token ID4888
Feature activation-8.30e-01

Pos logit contributions
ris+0.026
dad+0.024
aren+0.022
Dan+0.022
oys+0.022
Neg logit contributions
cient-0.027
 weaker-0.026
 sy-0.026
 boo-0.025
onel-0.024
Token their
Token ID511
Feature activation-2.36e-01

Pos logit contributions
ris+0.032
dad+0.030
aren+0.029
ure+0.029
oys+0.029
Neg logit contributions
cient-0.014
 sy-0.014
 weaker-0.014
 boo-0.013
 wing-0.012
Token toys
Token ID14958
Feature activation+4.91e-01

Pos logit contributions
ris+0.008
dad+0.007
aren+0.007
Dan+0.007
ure+0.007
Neg logit contributions
cient-0.005
 sy-0.005
 weaker-0.005
 boo-0.005
 wing-0.005
Token
Token ID198
Feature activation-8.01e-01

Pos logit contributions
ris+0.016
dad+0.015
aren+0.014
oys+0.014
Dan+0.013
Neg logit contributions
cient-0.017
 weaker-0.017
 sy-0.017
 boo-0.016
onel-0.015
Token
Token ID198
Feature activation-9.50e-01

Pos logit contributions
ris+0.023
dad+0.023
ure+0.021
aren+0.021
oys+0.020
Neg logit contributions
 weaker-0.020
cient-0.020
 sy-0.020
 boo-0.019
 infected-0.018
TokenHe
Token ID1544
Feature activation-7.78e-01

Pos logit contributions
ris+0.027
dad+0.026
aren+0.024
ae+0.023
ure+0.023
Neg logit contributions
 weaker-0.024
cient-0.023
 sy-0.023
 infected-0.022
 worse-0.022
Token jumped
Token ID11687
Feature activation-7.44e-01

Pos logit contributions
ris+0.021
dad+0.020
aren+0.019
Dan+0.019
oys+0.018
Neg logit contributions
cient-0.022
 weaker-0.021
 sy-0.021
onel-0.020
 boo-0.020
Token at
Token ID379
Feature activation-1.03e+00

Pos logit contributions
ris+0.022
dad+0.021
oys+0.020
 than+0.020
Dan+0.019
Neg logit contributions
cient-0.020
 sy-0.018
 weaker-0.018
 boo-0.017
onel-0.017
Token Tom
Token ID4186
Feature activation-3.82e+00

Pos logit contributions
ris+0.038
dad+0.035
ure+0.034
oys+0.033
ager+0.033
Neg logit contributions
 weaker-0.020
cient-0.020
 sy-0.019
 boo-0.019
 wing-0.017
Token and
Token ID290
Feature activation+1.57e-01

Pos logit contributions
ris+0.076
dad+0.071
aren+0.063
oys+0.062
Dan+0.062
Neg logit contributions
cient-0.138
 weaker-0.135
 sy-0.134
 boo-0.129
onel-0.126
Token Sue
Token ID26089
Feature activation-1.76e+00

Pos logit contributions
cient+0.004
 sy+0.004
 weaker+0.004
 boo+0.004
 wing+0.004
Neg logit contributions
ris-0.005
dad-0.004
aren-0.004
oys-0.004
Dan-0.004
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
ris+0.029
dad+0.026
 than+0.023
oys+0.022
aren+0.022
Neg logit contributions
cient-0.069
 sy-0.067
 weaker-0.067
 boo-0.063
onel-0.063
Token He
Token ID679
Feature activation-8.20e-01

Pos logit contributions
ris+0.004
dad+0.004
aren+0.003
oys+0.003
ae+0.003
Neg logit contributions
cient-0.003
 weaker-0.003
 sy-0.003
 boo-0.003
 wing-0.003
Token bit
Token ID1643
Feature activation-9.95e-01

Pos logit contributions
ris+0.024
dad+0.022
Dan+0.021
aren+0.021
oys+0.020
Neg logit contributions
cient-0.022
 weaker-0.022
 sy-0.021
 boo-0.020
onel-0.020
Token He
Token ID679
Feature activation-1.40e+00

Pos logit contributions
ris+0.019
dad+0.019
ae+0.018
ure+0.017
aren+0.017
Neg logit contributions
 sy-0.010
 weaker-0.010
cient-0.010
 boo-0.009
 Mos-0.009
Token throws
Token ID12542
Feature activation-1.18e+00

Pos logit contributions
ris+0.039
dad+0.036
 than+0.034
Dan+0.033
ure+0.033
Neg logit contributions
cient-0.040
 weaker-0.038
 sy-0.037
onel-0.035
 wing-0.035
Token the
Token ID262
Feature activation-1.34e+00

Pos logit contributions
ris+0.043
dad+0.040
ae+0.040
aren+0.039
thel+0.039
Neg logit contributions
 weaker-0.023
 boo-0.021
 sy-0.021
 worse-0.020
cient-0.020
Token ball
Token ID2613
Feature activation+3.00e-01

Pos logit contributions
ris+0.030
dad+0.029
oys+0.027
 than+0.027
Tom+0.027
Neg logit contributions
cient-0.045
 sy-0.041
onel-0.039
 weaker-0.039
 infected-0.037
Token at
Token ID379
Feature activation-1.32e+00

Pos logit contributions
cient+0.009
 sy+0.009
 weaker+0.008
 boo+0.008
onel+0.008
Neg logit contributions
ris-0.008
dad-0.007
 than-0.007
oys-0.007
Dan-0.006
Token Tom
Token ID4186
Feature activation-3.79e+00

Pos logit contributions
ris+0.038
ager+0.035
ure+0.034
dad+0.034
thel+0.033
Neg logit contributions
 weaker-0.033
cient-0.031
 worse-0.030
 wing-0.030
 boo-0.030
Token and
Token ID290
Feature activation+1.11e-01

Pos logit contributions
ris+0.068
dad+0.061
 than+0.054
oys+0.053
aren+0.052
Neg logit contributions
cient-0.145
 weaker-0.141
 sy-0.139
 boo-0.133
onel-0.132
Token Sam
Token ID3409
Feature activation-1.76e+00

Pos logit contributions
cient+0.003
 weaker+0.003
 sy+0.003
 boo+0.003
onel+0.003
Neg logit contributions
ris-0.003
dad-0.003
aren-0.003
oys-0.003
Dan-0.003
Token and
Token ID290
Feature activation+7.72e-01

Pos logit contributions
ris+0.030
dad+0.027
 than+0.025
oys+0.023
 ahead+0.022
Neg logit contributions
cient-0.069
 sy-0.066
 weaker-0.065
 boo-0.061
onel-0.061
Token runs
Token ID4539
Feature activation-7.62e-01

Pos logit contributions
cient+0.022
 weaker+0.021
 sy+0.021
 boo+0.020
onel+0.019
Neg logit contributions
ris-0.022
dad-0.020
aren-0.019
oys-0.019
Dan-0.019
Token away
Token ID1497
Feature activation-1.18e+00

Pos logit contributions
ris+0.026
ae+0.026
dad+0.026
aren+0.024
Dan+0.024
Neg logit contributions
 weaker-0.017
 worse-0.015
cient-0.015
 sy-0.015
 boo-0.014
Token bone
Token ID9970
Feature activation-1.14e+00

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.001
dad-0.001
aren-0.001
oys-0.001
Dan-0.001
Token.
Token ID13
Feature activation-2.59e-01

Pos logit contributions
ris+0.026
dad+0.025
aren+0.024
Dan+0.023
atherine+0.022
Neg logit contributions
 sy-0.037
 weaker-0.037
cient-0.036
 boo-0.035
 worse-0.035
Token It
Token ID632
Feature activation-3.71e-01

Pos logit contributions
ris+0.008
dad+0.008
oys+0.007
aren+0.007
ae+0.007
Neg logit contributions
cient-0.006
 sy-0.006
 weaker-0.006
 boo-0.006
 wing-0.005
Token jumped
Token ID11687
Feature activation-3.04e-01

Pos logit contributions
ris+0.009
dad+0.008
Dan+0.008
 than+0.008
ar+0.008
Neg logit contributions
cient-0.012
 weaker-0.011
 sy-0.011
onel-0.011
 wing-0.010
Token on
Token ID319
Feature activation+4.09e-01

Pos logit contributions
ris+0.009
dad+0.009
oys+0.008
 than+0.008
Dan+0.008
Neg logit contributions
cient-0.008
 sy-0.008
 weaker-0.008
 boo-0.007
onel-0.007
Token Tom
Token ID4186
Feature activation-3.77e+00

Pos logit contributions
 boo+0.006
 sy+0.006
 weaker+0.006
 wing+0.006
 worse+0.005
Neg logit contributions
ris-0.017
dad-0.015
ager-0.015
aren-0.015
ae-0.015
Token and
Token ID290
Feature activation+2.13e-02

Pos logit contributions
ris+0.083
dad+0.078
aren+0.071
Dan+0.070
oys+0.070
Neg logit contributions
cient-0.127
 sy-0.124
 weaker-0.124
 boo-0.120
 wing-0.115
Token snatched
Token ID43276
Feature activation-3.37e-01

Pos logit contributions
 sy+0.001
cient+0.001
 weaker+0.001
 boo+0.000
 worse+0.000
Neg logit contributions
ris-0.001
dad-0.001
aren-0.001
oys-0.001
ae-0.001
Token the
Token ID262
Feature activation-9.81e-01

Pos logit contributions
ris+0.013
ae+0.013
aren+0.013
oys+0.012
dad+0.012
Neg logit contributions
 worse-0.005
 boo-0.005
 weaker-0.005
 ang-0.005
 means-0.004
Token fl
Token ID781
Feature activation+1.70e+00

Pos logit contributions
ris+0.026
dad+0.025
oys+0.024
 than+0.024
ure+0.023
Neg logit contributions
cient-0.028
 weaker-0.026
onel-0.025
 proc-0.024
 sy-0.024
Tokenute
Token ID1133
Feature activation+1.37e+00

Pos logit contributions
 sy+0.020
cient+0.018
 wing+0.017
 boo+0.015
 weaker+0.014
Neg logit contributions
ris-0.075
dad-0.074
aren-0.074
Adam-0.070
Dan-0.069
Token swung
Token ID28507
Feature activation-8.05e-01

Pos logit contributions
ris+0.013
dad+0.012
aren+0.011
oys+0.011
Dan+0.011
Neg logit contributions
cient-0.012
 weaker-0.012
 sy-0.012
 boo-0.011
onel-0.011
Token his
Token ID465
Feature activation-7.23e-01

Pos logit contributions
ris+0.025
dad+0.024
oys+0.022
Dan+0.022
aren+0.022
Neg logit contributions
cient-0.020
 sy-0.019
 weaker-0.019
 boo-0.018
onel-0.017
Token golf
Token ID13126
Feature activation-4.99e-01

Pos logit contributions
ris+0.021
dad+0.020
oys+0.019
 than+0.018
aren+0.018
Neg logit contributions
cient-0.019
 weaker-0.019
 sy-0.018
onel-0.017
 boo-0.017
Token club
Token ID3430
Feature activation-4.57e-01

Pos logit contributions
ris+0.016
dad+0.016
aren+0.015
ae+0.015
Dan+0.015
Neg logit contributions
cient-0.011
 weaker-0.011
 sy-0.011
 boo-0.010
 wing-0.010
Token at
Token ID379
Feature activation-1.23e+00

Pos logit contributions
ris+0.011
dad+0.011
aren+0.010
Dan+0.010
oys+0.010
Neg logit contributions
 weaker-0.014
cient-0.014
 sy-0.013
 boo-0.013
 wing-0.013
Token Tom
Token ID4186
Feature activation-3.77e+00

Pos logit contributions
ris+0.042
ure+0.040
ager+0.040
dad+0.039
ar+0.039
Neg logit contributions
 weaker-0.023
cient-0.021
 boo-0.021
 worse-0.021
 sy-0.020
Token and
Token ID290
Feature activation+2.52e-01

Pos logit contributions
ris+0.085
dad+0.080
aren+0.072
oys+0.072
Dan+0.071
Neg logit contributions
cient-0.125
 weaker-0.122
 sy-0.121
 boo-0.116
onel-0.113
Token hit
Token ID2277
Feature activation-7.92e-01

Pos logit contributions
 sy+0.006
 weaker+0.006
cient+0.006
 boo+0.006
 worse+0.006
Neg logit contributions
ris-0.008
dad-0.007
oys-0.007
aren-0.007
ae-0.007
Token him
Token ID683
Feature activation+1.37e-01

Pos logit contributions
ris+0.025
dad+0.022
aren+0.022
ager+0.022
ure+0.022
Neg logit contributions
 weaker-0.020
 boo-0.019
 sy-0.019
 worse-0.018
 ang-0.017
Token on
Token ID319
Feature activation+6.51e-01

Pos logit contributions
cient+0.006
 sy+0.005
 weaker+0.005
onel+0.005
 boo+0.005
Neg logit contributions
ris-0.002
 than-0.002
dad-0.002
 ahead-0.002
oys-0.002
Token the
Token ID262
Feature activation-3.60e-01

Pos logit contributions
 sy+0.009
 weaker+0.009
 boo+0.008
 wing+0.008
 worse+0.008
Neg logit contributions
ris-0.026
aren-0.025
dad-0.025
oys-0.025
ager-0.024
Token.
Token ID13
Feature activation-4.43e-01

Pos logit contributions
cient+0.004
 sy+0.004
 weaker+0.004
onel+0.004
nam+0.004
Neg logit contributions
ris-0.001
oys-0.001
 than-0.001
dad-0.001
 ahead-0.001
Token He
Token ID679
Feature activation-1.34e+00

Pos logit contributions
ris+0.013
dad+0.013
aren+0.012
ure+0.012
ae+0.012
Neg logit contributions
cient-0.011
 weaker-0.011
 sy-0.011
 boo-0.010
 worse-0.010
Token runs
Token ID4539
Feature activation-1.41e+00

Pos logit contributions
ris+0.037
dad+0.035
Dan+0.031
 than+0.031
oys+0.031
Neg logit contributions
cient-0.038
 weaker-0.037
 sy-0.036
onel-0.034
 wing-0.034
Token away
Token ID1497
Feature activation-1.51e+00

Pos logit contributions
ris+0.049
dad+0.047
ae+0.045
aren+0.045
Dan+0.044
Neg logit contributions
 weaker-0.031
 worse-0.029
 sy-0.028
 boo-0.028
cient-0.028
Token from
Token ID422
Feature activation+4.32e-01

Pos logit contributions
ris+0.050
dad+0.050
aren+0.048
thel+0.045
ae+0.045
Neg logit contributions
 weaker-0.035
 sy-0.034
 boo-0.033
 worse-0.031
cient-0.030
Token Tom
Token ID4186
Feature activation-3.77e+00

Pos logit contributions
 weaker+0.009
 boo+0.008
 worse+0.007
 sy+0.007
 ang+0.007
Neg logit contributions
ris-0.017
ure-0.016
aren-0.015
dad-0.015
thel-0.015
Token.
Token ID13
Feature activation-4.10e-01

Pos logit contributions
ris+0.082
dad+0.078
Dan+0.072
aren+0.072
oys+0.069
Neg logit contributions
cient-0.127
 weaker-0.126
 sy-0.125
 boo-0.121
 wing-0.117
Token He
Token ID679
Feature activation-1.08e+00

Pos logit contributions
ris+0.014
dad+0.014
ure+0.013
aren+0.013
ae+0.012
Neg logit contributions
 weaker-0.009
 sy-0.008
cient-0.008
 boo-0.008
 Mos-0.008
Token does
Token ID857
Feature activation-8.53e-01

Pos logit contributions
ris+0.029
dad+0.028
Dan+0.025
 than+0.025
aren+0.025
Neg logit contributions
cient-0.031
 weaker-0.030
 sy-0.029
onel-0.028
 wing-0.027
Token not
Token ID407
Feature activation-1.40e+00

Pos logit contributions
ris+0.027
dad+0.026
aren+0.025
oys+0.024
ure+0.024
Neg logit contributions
 weaker-0.022
 boo-0.021
 worse-0.020
 sy-0.020
cient-0.019
Token want
Token ID765
Feature activation-9.36e-01

Pos logit contributions
ris+0.045
dad+0.043
aren+0.043
ae+0.042
Dan+0.040
Neg logit contributions
 boo-0.033
 weaker-0.032
 sy-0.032
 worse-0.031
cient-0.031
Token too
Token ID1165
Feature activation-2.53e+00

Pos logit contributions
 weaker+0.000
 worse+0.000
 sy+0.000
 boo+0.000
 sad+0.000
Neg logit contributions
ris-0.001
ae-0.000
atherine-0.000
ager-0.000
aren-0.000
Token.
Token ID13
Feature activation-4.11e-01

Pos logit contributions
ris+0.067
dad+0.064
aren+0.063
atherine+0.060
ure+0.059
Neg logit contributions
 weaker-0.078
 worse-0.074
 sy-0.073
 wing-0.071
 infected-0.071
Token She
Token ID1375
Feature activation-6.51e-01

Pos logit contributions
ris+0.014
dad+0.013
ae+0.012
ure+0.012
aren+0.012
Neg logit contributions
cient-0.010
 weaker-0.009
 sy-0.009
 boo-0.009
 wing-0.008
Token ran
Token ID4966
Feature activation-8.65e-01

Pos logit contributions
ris+0.018
dad+0.017
aren+0.016
Dan+0.016
oys+0.015
Neg logit contributions
cient-0.018
 weaker-0.018
 sy-0.017
onel-0.017
 boo-0.016
Token after
Token ID706
Feature activation-1.64e+00

Pos logit contributions
ae+0.038
ris+0.037
dad+0.035
Adam+0.035
Dan+0.035
Neg logit contributions
 weaker-0.012
 worse-0.011
 boo-0.010
 off-0.010
 sad-0.009
Token Tom
Token ID4186
Feature activation-3.76e+00

Pos logit contributions
ris+0.058
dad+0.049
ure+0.048
 than+0.048
oys+0.048
Neg logit contributions
 weaker-0.038
 boo-0.036
 sy-0.036
cient-0.035
 worse-0.035
Token.
Token ID13
Feature activation-5.04e-01

Pos logit contributions
ris+0.102
dad+0.099
Dan+0.093
ae+0.093
aren+0.093
Neg logit contributions
 weaker-0.106
 boo-0.104
cient-0.103
 sy-0.103
 worse-0.097
Token
Token ID198
Feature activation-4.99e-01

Pos logit contributions
ris+0.018
dad+0.016
ae+0.015
ure+0.015
ar+0.015
Neg logit contributions
cient-0.011
 sy-0.011
 weaker-0.011
 boo-0.010
 wing-0.009
Token
Token ID198
Feature activation-6.88e-01

Pos logit contributions
ris+0.014
dad+0.013
aren+0.012
ure+0.012
oys+0.012
Neg logit contributions
cient-0.014
 weaker-0.014
 sy-0.014
 boo-0.013
 wing-0.012
TokenTom
Token ID13787
Feature activation-2.65e+00

Pos logit contributions
ris+0.018
dad+0.016
ae+0.015
joy+0.015
aren+0.015
Neg logit contributions
 sy-0.019
 weaker-0.018
 boo-0.018
cient-0.018
 infected-0.018
Token's
Token ID338
Feature activation+7.70e-01

Pos logit contributions
ris+0.087
dad+0.082
oys+0.078
aren+0.077
ae+0.077
Neg logit contributions
 weaker-0.061
cient-0.061
 sy-0.060
 boo-0.059
 wing-0.054
Token mouth
Token ID5422
Feature activation-4.90e-01

Pos logit contributions
ris+0.033
dad+0.031
aren+0.029
oys+0.029
Dan+0.028
Neg logit contributions
cient-0.029
 sy-0.029
 boo-0.028
 weaker-0.028
 wing-0.028
Token.
Token ID13
Feature activation+8.39e-02

Pos logit contributions
ris+0.011
dad+0.010
aren+0.010
atherine+0.009
oys+0.009
Neg logit contributions
 sy-0.016
 weaker-0.016
cient-0.016
 wing-0.015
 boo-0.015
Token It
Token ID632
Feature activation-4.04e-01

Pos logit contributions
cient+0.003
 weaker+0.003
 sy+0.002
onel+0.002
 boo+0.002
Neg logit contributions
ris-0.002
dad-0.002
Dan-0.002
oys-0.002
aren-0.002
Token was
Token ID373
Feature activation+3.66e-01

Pos logit contributions
ris+0.007
 than+0.006
ar+0.006
oe+0.006
 ahead+0.006
Neg logit contributions
cient-0.015
onel-0.014
 weaker-0.014
 sy-0.014
 proc-0.013
Token a
Token ID257
Feature activation+9.73e-02

Pos logit contributions
cient+0.012
onel+0.011
 sy+0.010
xc+0.010
 weaker+0.010
Neg logit contributions
 than-0.008
 ahead-0.008
dad-0.008
ris-0.007
 sight-0.007
Token se
Token ID384
Feature activation+3.64e+00

Pos logit contributions
cient+0.003
 weaker+0.003
onel+0.002
 sy+0.002
 proc+0.002
Neg logit contributions
ris-0.003
dad-0.003
Dan-0.002
 than-0.002
oys-0.002
Tokenag
Token ID363
Feature activation-9.02e-01

Pos logit contributions
cient+0.054
 sy+0.051
 boo+0.050
onel+0.048
 weaker+0.046
Neg logit contributions
dad-0.140
ris-0.137
Dan-0.136
aren-0.135
ae-0.133
Tokenull
Token ID724
Feature activation+1.49e+00

Pos logit contributions
ris+0.029
dad+0.028
aren+0.026
oys+0.026
Dan+0.026
Neg logit contributions
cient-0.021
 weaker-0.020
 sy-0.020
 boo-0.019
onel-0.018
Token.
Token ID13
Feature activation+1.27e-03

Pos logit contributions
cient+0.041
 sy+0.040
 weaker+0.039
 boo+0.038
 wing+0.037
Neg logit contributions
ris-0.042
dad-0.041
aren-0.038
oys-0.037
Dan-0.037
Token It
Token ID632
Feature activation-4.22e-01

Pos logit contributions
cient+0.000
 weaker+0.000
 sy+0.000
onel+0.000
 boo+0.000
Neg logit contributions
ris-0.000
dad-0.000
Dan-0.000
oys-0.000
aren-0.000
Token had
Token ID550
Feature activation+1.09e-01

Pos logit contributions
ris+0.008
 than+0.007
ar+0.007
Dan+0.007
 ahead+0.007
Neg logit contributions
cient-0.015
onel-0.015
 weaker-0.014
 sy-0.014
 proc-0.014
Token and
Token ID290
Feature activation+1.91e-01

Pos logit contributions
ris+0.006
dad+0.005
oys+0.005
aren+0.005
ure+0.005
Neg logit contributions
 weaker-0.005
cient-0.005
 boo-0.005
 sy-0.005
 worse-0.005
Token everything
Token ID2279
Feature activation-3.46e-01

Pos logit contributions
cient+0.006
 weaker+0.005
 sy+0.005
 boo+0.005
onel+0.005
Neg logit contributions
ris-0.005
dad-0.005
oys-0.005
aren-0.005
Dan-0.004
Token in
Token ID287
Feature activation+5.39e-01

Pos logit contributions
ris+0.010
dad+0.009
aren+0.008
Dan+0.008
oys+0.008
Neg logit contributions
cient-0.010
 sy-0.009
 weaker-0.009
 boo-0.009
 wing-0.009
Token the
Token ID262
Feature activation+2.18e-01

Pos logit contributions
 weaker+0.005
 wing+0.005
 smaller+0.005
 helicopters+0.005
 asleep+0.005
Neg logit contributions
ar-0.023
ager-0.022
thel-0.022
oys-0.022
ae-0.022
Token park
Token ID3952
Feature activation-1.72e+00

Pos logit contributions
cient+0.007
 weaker+0.007
 sy+0.007
 boo+0.007
 wing+0.006
Neg logit contributions
ris-0.005
dad-0.005
oys-0.005
ae-0.005
Dan-0.004
Token gl
Token ID1278
Feature activation+2.84e+00

Pos logit contributions
ris+0.053
dad+0.050
aren+0.048
oys+0.048
atherine+0.047
Neg logit contributions
 weaker-0.041
cient-0.040
 sy-0.038
 wing-0.036
onel-0.036
Tokenowed
Token ID6972
Feature activation+9.89e-01

Pos logit contributions
cient+0.067
 weaker+0.065
 sy+0.064
 boo+0.060
onel+0.058
Neg logit contributions
ris-0.092
dad-0.088
oys-0.082
aren-0.082
Dan-0.081
Token with
Token ID351
Feature activation-7.31e-02

Pos logit contributions
cient+0.033
 weaker+0.032
 sy+0.032
 boo+0.030
onel+0.030
Neg logit contributions
ris-0.022
dad-0.021
aren-0.019
Dan-0.019
oys-0.019
Token amazing
Token ID4998
Feature activation+4.19e-01

Pos logit contributions
ris+0.002
dad+0.002
oys+0.002
aren+0.002
Dan+0.002
Neg logit contributions
cient-0.002
 weaker-0.002
 sy-0.002
 boo-0.002
onel-0.002
Token colors
Token ID7577
Feature activation-2.43e-01

Pos logit contributions
 sy+0.010
cient+0.010
 weaker+0.010
 boo+0.009
 wing+0.009
Neg logit contributions
ris-0.013
dad-0.012
aren-0.012
oys-0.012
ae-0.012
Token.
Token ID13
Feature activation+3.21e-01

Pos logit contributions
ris+0.005
dad+0.004
aren+0.004
Dan+0.004
oys+0.004
Neg logit contributions
cient-0.009
 weaker-0.009
 sy-0.009
 boo-0.008
onel-0.008
Token Tommy
Token ID19919
Feature activation-2.01e+00

Pos logit contributions
cient+0.005
 weaker+0.005
 boo+0.004
 sy+0.004
 proc+0.004
Neg logit contributions
ris-0.004
dad-0.004
oys-0.003
ae-0.003
ure-0.003
Token laughed
Token ID13818
Feature activation+2.31e-01

Pos logit contributions
ris+0.051
dad+0.049
Dan+0.044
aren+0.043
ure+0.043
Neg logit contributions
cient-0.061
 weaker-0.060
 sy-0.059
onel-0.057
 infected-0.053
Token at
Token ID379
Feature activation-1.03e+00

Pos logit contributions
cient+0.010
 sy+0.009
 weaker+0.009
onel+0.009
 boo+0.009
Neg logit contributions
ris-0.003
dad-0.003
 than-0.003
 ahead-0.003
oys-0.003
Token the
Token ID262
Feature activation-8.46e-01

Pos logit contributions
ris+0.042
oys+0.039
ure+0.038
ae+0.038
ar+0.037
Neg logit contributions
 weaker-0.017
 boo-0.015
cient-0.015
 worse-0.014
 wing-0.013
Token silly
Token ID14397
Feature activation-8.67e-01

Pos logit contributions
ris+0.021
dad+0.020
Dan+0.018
aren+0.018
oys+0.018
Neg logit contributions
cient-0.027
 weaker-0.026
 sy-0.026
 boo-0.024
onel-0.024
Token se
Token ID384
Feature activation+3.50e+00

Pos logit contributions
ris+0.025
dad+0.022
aren+0.022
ae+0.021
oys+0.021
Neg logit contributions
 sy-0.024
cient-0.024
 boo-0.024
 weaker-0.023
 wing-0.022
Tokenag
Token ID363
Feature activation-9.82e-01

Pos logit contributions
cient+0.061
 weaker+0.053
 sy+0.053
 boo+0.051
onel+0.049
Neg logit contributions
ris-0.129
dad-0.126
aren-0.123
Dan-0.121
oys-0.119
Tokenull
Token ID724
Feature activation+1.04e+00

Pos logit contributions
ris+0.031
dad+0.030
oys+0.028
Dan+0.028
aren+0.028
Neg logit contributions
cient-0.024
 weaker-0.023
 sy-0.023
 boo-0.021
onel-0.021
Token and
Token ID290
Feature activation+3.43e-01

Pos logit contributions
cient+0.037
 weaker+0.036
 sy+0.035
 boo+0.034
onel+0.033
Neg logit contributions
ris-0.021
dad-0.020
oys-0.017
aren-0.017
 than-0.017
Token the
Token ID262
Feature activation+3.13e-01

Pos logit contributions
cient+0.011
 sy+0.010
 weaker+0.010
onel+0.010
 infected+0.009
Neg logit contributions
ris-0.008
dad-0.008
Dan-0.008
 than-0.008
aren-0.007
Token man
Token ID582
Feature activation-6.19e-01

Pos logit contributions
cient+0.008
 weaker+0.007
 sy+0.007
onel+0.007
 boo+0.007
Neg logit contributions
ris-0.010
dad-0.010
Dan-0.009
oys-0.009
aren-0.009
Token hands
Token ID2832
Feature activation+4.27e-01

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.001
dad-0.001
oys-0.001
Dan-0.001
aren-0.001
Token to
Token ID284
Feature activation-3.58e-02

Pos logit contributions
cient+0.014
 sy+0.013
 weaker+0.013
 boo+0.013
onel+0.012
Neg logit contributions
ris-0.010
dad-0.010
oys-0.009
 than-0.009
aren-0.008
Token hold
Token ID1745
Feature activation-7.82e-01

Pos logit contributions
ris+0.001
dad+0.001
Dan+0.001
oys+0.001
ure+0.001
Neg logit contributions
cient-0.001
 sy-0.001
 weaker-0.001
onel-0.001
 boo-0.001
Token the
Token ID262
Feature activation+1.51e-01

Pos logit contributions
ris+0.023
dad+0.022
aren+0.020
oys+0.020
Dan+0.020
Neg logit contributions
cient-0.021
 weaker-0.020
 sy-0.020
 boo-0.019
onel-0.018
Token f
Token ID277
Feature activation+2.73e+00

Pos logit contributions
 weaker+0.004
cient+0.004
 sy+0.003
 boo+0.003
 wing+0.003
Neg logit contributions
ris-0.005
dad-0.005
aren-0.004
ae-0.004
atherine-0.004
Tokenauc
Token ID14272
Feature activation+4.01e+00

Pos logit contributions
cient+0.033
 weaker+0.031
 sy+0.031
 boo+0.026
onel+0.026
Neg logit contributions
ris-0.120
dad-0.116
aren-0.111
Dan-0.110
oys-0.109
Tokenet
Token ID316
Feature activation-1.65e-01

Pos logit contributions
cient+0.139
 weaker+0.135
 sy+0.134
 boo+0.131
onel+0.125
Neg logit contributions
ris-0.086
dad-0.078
aren-0.076
Dan-0.076
Adam-0.074
Token up
Token ID510
Feature activation+4.36e-02

Pos logit contributions
ris+0.004
dad+0.003
oys+0.003
 than+0.003
aren+0.003
Neg logit contributions
cient-0.006
 sy-0.005
 weaker-0.005
 boo-0.005
onel-0.005
Token and
Token ID290
Feature activation+1.38e+00

Pos logit contributions
 weaker+0.001
cient+0.001
 sy+0.001
 wing+0.001
 boo+0.001
Neg logit contributions
ris-0.001
dad-0.001
aren-0.001
Dan-0.001
oys-0.001
Token support
Token ID1104
Feature activation+1.27e+00

Pos logit contributions
cient+0.045
 sy+0.042
onel+0.042
ivery+0.040
 weaker+0.039
Neg logit contributions
ris-0.031
dad-0.030
 ahead-0.029
 than-0.027
Dan-0.026
Token it
Token ID340
Feature activation+9.59e-01

Pos logit contributions
 weaker+0.033
cient+0.033
 sy+0.032
 boo+0.030
 wing+0.030
Neg logit contributions
ris-0.038
dad-0.035
aren-0.034
Dan-0.033
oys-0.033
Token to
Token ID284
Feature activation+1.47e-02

Pos logit contributions
ris+0.013
dad+0.013
aren+0.012
atherine+0.012
Dan+0.012
Neg logit contributions
 weaker-0.015
cient-0.014
 sy-0.014
 wing-0.014
 worse-0.014
Token the
Token ID262
Feature activation+1.28e-01

Pos logit contributions
cient+0.000
 weaker+0.000
 sy+0.000
 boo+0.000
 wing+0.000
Neg logit contributions
ris-0.000
dad-0.000
aren-0.000
oys-0.000
ure-0.000
Token biggest
Token ID4094
Feature activation+1.04e+00

Pos logit contributions
cient+0.003
 sy+0.003
 weaker+0.003
 boo+0.003
onel+0.003
Neg logit contributions
ris-0.004
dad-0.004
oys-0.003
aren-0.003
Dan-0.003
Token,
Token ID11
Feature activation-4.25e-02

Pos logit contributions
 wing+0.030
 sy+0.029
 weaker+0.029
 boo+0.029
cient+0.028
Neg logit contributions
ris-0.027
dad-0.026
Dan-0.025
ae-0.025
Adam-0.025
Token sh
Token ID427
Feature activation+1.97e+00

Pos logit contributions
ris+0.001
dad+0.001
 than+0.001
aren+0.001
oys+0.001
Neg logit contributions
cient-0.001
 sy-0.001
 weaker-0.001
onel-0.001
 boo-0.001
Tokenini
Token ID5362
Feature activation+2.90e+00

Pos logit contributions
onel+0.033
cient+0.031
 weaker+0.029
 wing+0.029
 boo+0.026
Neg logit contributions
 recess-0.075
ris-0.071
 Rebecca-0.070
atherine-0.070
 Adam-0.069
Tokenest
Token ID395
Feature activation+7.29e-01

Pos logit contributions
cient+0.068
 weaker+0.065
 sy+0.064
 boo+0.061
onel+0.058
Neg logit contributions
ris-0.095
dad-0.090
aren-0.085
oys-0.085
Dan-0.084
Token fish
Token ID5916
Feature activation+1.13e+00

Pos logit contributions
 weaker+0.019
 sy+0.019
cient+0.018
 boo+0.018
 wing+0.018
Neg logit contributions
ris-0.022
dad-0.020
Dan-0.019
oys-0.019
aren-0.019
Token in
Token ID287
Feature activation+2.01e-01

Pos logit contributions
cient+0.037
 weaker+0.036
 sy+0.036
 boo+0.034
onel+0.033
Neg logit contributions
ris-0.026
dad-0.024
oys-0.022
aren-0.022
Dan-0.021
Token the
Token ID262
Feature activation+2.91e-01

Pos logit contributions
 weaker+0.004
cient+0.003
 sy+0.003
 wing+0.003
 boo+0.003
Neg logit contributions
ris-0.008
dad-0.007
oys-0.007
aren-0.007
ure-0.007
Token pond
Token ID16723
Feature activation-6.00e-01

Pos logit contributions
cient+0.009
 weaker+0.008
 sy+0.008
 boo+0.008
 wing+0.008
Neg logit contributions
ris-0.008
dad-0.007
oys-0.007
aren-0.007
ae-0.007
Token for
Token ID329
Feature activation+3.84e-02

Pos logit contributions
cient+0.013
 sy+0.012
 weaker+0.012
 boo+0.012
onel+0.011
Neg logit contributions
ris-0.010
dad-0.009
oys-0.008
aren-0.008
ae-0.008
Token a
Token ID257
Feature activation-1.75e-01

Pos logit contributions
 weaker+0.001
 worse+0.001
cient+0.001
 sy+0.001
 boo+0.001
Neg logit contributions
ris-0.002
aren-0.001
dad-0.001
atherine-0.001
Dan-0.001
Token while
Token ID981
Feature activation-9.62e-01

Pos logit contributions
ris+0.003
dad+0.003
aren+0.002
oys+0.002
Dan+0.002
Neg logit contributions
cient-0.007
 weaker-0.007
 sy-0.007
 boo-0.006
 wing-0.006
Token.
Token ID13
Feature activation-7.43e-02

Pos logit contributions
ris+0.027
atherine+0.025
aren+0.025
dad+0.025
ure+0.024
Neg logit contributions
 weaker-0.025
 sy-0.024
cient-0.023
 wing-0.023
 worse-0.023
Token "
Token ID366
Feature activation+8.55e-01

Pos logit contributions
ris+0.002
dad+0.002
aren+0.002
ae+0.002
ure+0.002
Neg logit contributions
cient-0.002
 weaker-0.002
 sy-0.002
 boo-0.002
onel-0.001
TokenLet
Token ID5756
Feature activation+1.44e+00

Pos logit contributions
 weaker+0.019
cient+0.019
 sy+0.018
 boo+0.017
onel+0.016
Neg logit contributions
ris-0.027
dad-0.027
aren-0.025
ae-0.024
oys-0.024
Token's
Token ID338
Feature activation+1.90e+00

Pos logit contributions
 sy+0.044
 weaker+0.044
nam+0.042
cient+0.041
onel+0.040
Neg logit contributions
dad-0.038
!-0.035
ue-0.033
Tom-0.032
ure-0.032
Token bury
Token ID30033
Feature activation+4.19e-01

Pos logit contributions
 sy+0.053
 weaker+0.053
cient+0.053
onel+0.048
 boo+0.046
Neg logit contributions
ris-0.052
dad-0.050
oys-0.045
ure-0.044
 ahead-0.044
Token this
Token ID428
Feature activation+1.46e+00

Pos logit contributions
cient+0.011
 weaker+0.011
 sy+0.010
 boo+0.010
onel+0.010
Neg logit contributions
ris-0.013
dad-0.012
oys-0.011
aren-0.011
Dan-0.011
Token smooth
Token ID7209
Feature activation+6.67e-01

Pos logit contributions
cient+0.051
 sy+0.047
onel+0.044
 weaker+0.044
 proc+0.043
Neg logit contributions
ris-0.031
 than-0.031
!-0.027
 ahead-0.027
dad-0.027
Token stone
Token ID7815
Feature activation+1.29e+00

Pos logit contributions
cient+0.021
 weaker+0.020
 sy+0.020
onel+0.019
 boo+0.018
Neg logit contributions
ris-0.017
dad-0.016
 than-0.015
oys-0.015
ure-0.014
Token could
Token ID714
Feature activation+2.67e-01

Pos logit contributions
cient+0.014
 weaker+0.014
 sy+0.013
 infected+0.012
 boo+0.012
Neg logit contributions
ris-0.013
dad-0.013
 than-0.012
ure-0.011
ager-0.011
Token move
Token ID1445
Feature activation-2.61e-01

Pos logit contributions
cient+0.008
 weaker+0.008
 sy+0.008
onel+0.007
arre+0.007
Neg logit contributions
ris-0.006
dad-0.006
 than-0.006
ae-0.006
Dan-0.005
Token and
Token ID290
Feature activation+9.51e-02

Pos logit contributions
ris+0.009
oys+0.008
dad+0.008
aren+0.008
ae+0.008
Neg logit contributions
 weaker-0.006
cient-0.006
 sy-0.006
 worse-0.006
 wing-0.005
Token stretch
Token ID7539
Feature activation+1.25e-01

Pos logit contributions
cient+0.003
 sy+0.003
 weaker+0.003
onel+0.003
 infected+0.003
Neg logit contributions
ris-0.002
dad-0.002
 than-0.002
 ahead-0.002
Dan-0.002
Token in
Token ID287
Feature activation+1.30e+00

Pos logit contributions
cient+0.004
 sy+0.003
 weaker+0.003
 boo+0.003
onel+0.003
Neg logit contributions
ris-0.003
dad-0.003
 than-0.003
oys-0.003
Dan-0.003
Token any
Token ID597
Feature activation+1.16e+00

Pos logit contributions
 weaker+0.030
 sy+0.027
cient+0.027
 boo+0.027
 wing+0.026
Neg logit contributions
ris-0.042
dad-0.041
oys-0.040
aren-0.040
ar-0.039
Token way
Token ID835
Feature activation+1.14e-01

Pos logit contributions
cient+0.037
 sy+0.035
 weaker+0.034
 boo+0.033
onel+0.033
Neg logit contributions
ris-0.028
dad-0.027
 than-0.024
Dan-0.024
ure-0.023
Token she
Token ID673
Feature activation+3.19e-01

Pos logit contributions
 weaker+0.003
cient+0.003
 sy+0.003
 wing+0.003
onel+0.003
Neg logit contributions
ris-0.003
dad-0.003
oys-0.003
aren-0.003
Dan-0.003
Token wanted
Token ID2227
Feature activation+2.74e-01

Pos logit contributions
cient+0.008
 sy+0.008
 weaker+0.008
 boo+0.007
 infected+0.007
Neg logit contributions
ris-0.010
dad-0.009
ar-0.008
Dan-0.008
 than-0.008
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
 weaker+0.008
cient+0.008
 sy+0.008
 wing+0.008
 boo+0.008
Neg logit contributions
ris-0.007
dad-0.007
aren-0.006
Dan-0.006
oys-0.006
Token
Token ID220
Feature activation+7.00e-01

Pos logit contributions
cient+0.004
 weaker+0.004
 sy+0.004
 boo+0.004
onel+0.004
Neg logit contributions
ris-0.004
dad-0.004
oys-0.004
aren-0.004
Dan-0.004
Token ride
Token ID6594
Feature activation-1.47e-01

Pos logit contributions
ris+0.026
dad+0.024
aren+0.022
oys+0.022
Dan+0.022
Neg logit contributions
cient-0.029
 weaker-0.028
 sy-0.028
 boo-0.027
onel-0.026
Token in
Token ID287
Feature activation+8.61e-01

Pos logit contributions
ris+0.004
dad+0.004
aren+0.004
oys+0.004
Dan+0.004
Neg logit contributions
cient-0.004
 weaker-0.004
 sy-0.004
 boo-0.004
onel-0.004
Token the
Token ID262
Feature activation+2.72e-01

Pos logit contributions
 sy+0.015
 weaker+0.014
cient+0.014
 boo+0.014
 wing+0.013
Neg logit contributions
ris-0.032
dad-0.030
oys-0.030
ar-0.029
ae-0.029
Token elevator
Token ID20932
Feature activation+4.54e-01

Pos logit contributions
cient+0.010
 weaker+0.010
 sy+0.010
onel+0.010
 boo+0.009
Neg logit contributions
ris-0.005
dad-0.005
aren-0.004
Dan-0.004
ure-0.004
Token and
Token ID290
Feature activation+1.86e-01

Pos logit contributions
cient+0.016
 weaker+0.016
 sy+0.016
 boo+0.015
onel+0.015
Neg logit contributions
ris-0.009
dad-0.008
aren-0.007
oys-0.007
Dan-0.007
Token see
Token ID766
Feature activation+7.15e-01

Pos logit contributions
 weaker+0.006
 sy+0.006
cient+0.006
onel+0.006
 infected+0.006
Neg logit contributions
ris-0.004
dad-0.004
aren-0.004
 ahead-0.004
Dan-0.003
Token if
Token ID611
Feature activation+7.06e-01

Pos logit contributions
cient+0.022
 sy+0.021
 boo+0.021
 weaker+0.021
onel+0.020
Neg logit contributions
ris-0.018
dad-0.016
 than-0.016
 ahead-0.016
aren-0.015
Token she
Token ID673
Feature activation+7.66e-01

Pos logit contributions
cient+0.024
 sy+0.023
 weaker+0.022
 boo+0.022
onel+0.022
Neg logit contributions
dad-0.015
ris-0.015
aren-0.014
Dan-0.013
ae-0.013
Token could
Token ID714
Feature activation+7.71e-01

Pos logit contributions
cient+0.018
 weaker+0.017
 sy+0.017
 boo+0.016
onel+0.015
Neg logit contributions
ris-0.025
dad-0.024
oys-0.022
aren-0.022
Dan-0.022
Token find
Token ID1064
Feature activation+1.00e+00

Pos logit contributions
cient+0.022
 weaker+0.021
 sy+0.021
 boo+0.020
onel+0.019
Neg logit contributions
ris-0.022
dad-0.021
Dan-0.019
aren-0.019
 than-0.018
Token her
Token ID607
Feature activation-1.60e-02

Pos logit contributions
cient+0.026
 weaker+0.025
 sy+0.025
 boo+0.024
onel+0.023
Neg logit contributions
ris-0.030
dad-0.029
aren-0.026
oys-0.026
Dan-0.026
TokenI
Token ID40
Feature activation-3.57e-01

Pos logit contributions
 weaker+0.003
 sy+0.003
cient+0.003
 boo+0.002
onel+0.002
Neg logit contributions
ris-0.003
dad-0.003
oys-0.003
ar-0.003
aren-0.003
Token knew
Token ID2993
Feature activation+6.03e-01

Pos logit contributions
ris+0.012
dad+0.012
oys+0.011
aren+0.011
ae+0.011
Neg logit contributions
cient-0.007
 sy-0.007
 boo-0.007
 weaker-0.007
onel-0.006
Token you
Token ID345
Feature activation+5.69e-02

Pos logit contributions
cient+0.018
 weaker+0.018
 sy+0.018
 boo+0.017
onel+0.016
Neg logit contributions
ris-0.016
dad-0.015
aren-0.013
oys-0.013
 than-0.013
Token could
Token ID714
Feature activation+6.40e-01

Pos logit contributions
cient+0.001
 weaker+0.001
 sy+0.001
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.002
dad-0.002
aren-0.002
oys-0.002
Dan-0.002
Token do
Token ID466
Feature activation+4.76e-01

Pos logit contributions
cient+0.015
 weaker+0.014
 sy+0.014
 boo+0.013
onel+0.013
Neg logit contributions
ris-0.021
dad-0.020
aren-0.019
oys-0.019
Dan-0.018
Token it
Token ID340
Feature activation+1.26e+00

Pos logit contributions
cient+0.014
 weaker+0.014
 sy+0.014
 boo+0.013
onel+0.013
Neg logit contributions
ris-0.012
dad-0.012
aren-0.011
oys-0.011
Dan-0.011
Token,
Token ID11
Feature activation+5.02e-01

Pos logit contributions
cient+0.045
 sy+0.044
 weaker+0.043
 boo+0.041
 wing+0.040
Neg logit contributions
ris-0.027
dad-0.025
 than-0.023
ae-0.022
oys-0.022
Token Lily
Token ID20037
Feature activation+2.31e-01

Pos logit contributions
cient+0.015
 weaker+0.014
 sy+0.014
 boo+0.013
onel+0.013
Neg logit contributions
ris-0.013
dad-0.013
oys-0.012
aren-0.012
Dan-0.011
Token.
Token ID13
Feature activation-1.73e-01

Pos logit contributions
cient+0.008
 weaker+0.008
 sy+0.008
 boo+0.008
onel+0.008
Neg logit contributions
ris-0.005
dad-0.004
oys-0.004
aren-0.004
Dan-0.004
Token You
Token ID921
Feature activation-3.06e-01

Pos logit contributions
ris+0.005
dad+0.005
Dan+0.004
aren+0.004
oys+0.004
Neg logit contributions
cient-0.005
 sy-0.004
 weaker-0.004
 boo-0.004
onel-0.004
Token always
Token ID1464
Feature activation+8.72e-02

Pos logit contributions
ris+0.010
dad+0.009
ure+0.008
aren+0.008
oys+0.008
Neg logit contributions
 weaker-0.008
 sy-0.008
cient-0.008
 boo-0.007
 wing-0.007
Token.
Token ID13
Feature activation+6.72e-02

Pos logit contributions
cient+0.050
 sy+0.047
 weaker+0.046
onel+0.045
 increasingly+0.045
Neg logit contributions
 than-0.012
!-0.011
ris-0.011
 ahead-0.008
oe-0.008
Token But
Token ID887
Feature activation-1.22e+00

Pos logit contributions
 weaker+0.002
cient+0.002
 sy+0.002
onel+0.002
 boo+0.002
Neg logit contributions
ris-0.002
oys-0.001
dad-0.001
 than-0.001
aren-0.001
Token let
Token ID1309
Feature activation+1.10e+00

Pos logit contributions
ris+0.024
dad+0.023
oys+0.023
 than+0.023
 ahead+0.021
Neg logit contributions
 sy-0.040
cient-0.040
 weaker-0.038
onel-0.037
 infected-0.036
Token's
Token ID338
Feature activation+1.87e+00

Pos logit contributions
 sy+0.036
onel+0.034
 prope+0.034
 weaker+0.034
nam+0.034
Neg logit contributions
ue-0.024
dad-0.024
Tom-0.023
!-0.023
ris-0.021
Token tie
Token ID9839
Feature activation+6.58e-01

Pos logit contributions
cient+0.053
 weaker+0.052
 sy+0.052
onel+0.049
 infected+0.045
Neg logit contributions
ris-0.052
dad-0.049
oys-0.043
 ahead-0.043
ure-0.042
Token a
Token ID257
Feature activation+8.49e-01

Pos logit contributions
cient+0.024
onel+0.022
 weaker+0.022
 sy+0.022
 proc+0.021
Neg logit contributions
ris-0.012
dad-0.012
 than-0.011
 ahead-0.011
oys-0.011
Token string
Token ID4731
Feature activation+1.02e-01

Pos logit contributions
cient+0.027
onel+0.026
 weaker+0.024
 sy+0.022
 proc+0.022
Neg logit contributions
ris-0.022
 than-0.018
oe-0.018
dad-0.017
ure-0.017
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
cient+0.003
 weaker+0.003
 sy+0.003
 boo+0.003
onel+0.003
Neg logit contributions
ris-0.002
dad-0.002
 than-0.002
oys-0.002
aren-0.002
Token it
Token ID340
Feature activation+9.04e-01

Pos logit contributions
cient+0.033
 weaker+0.032
 sy+0.032
onel+0.031
 proc+0.028
Neg logit contributions
ris-0.034
dad-0.031
oys-0.029
 than-0.029
ure-0.029
Token and
Token ID290
Feature activation+1.10e+00

Pos logit contributions
cient+0.033
 weaker+0.033
 sy+0.032
onel+0.032
 proc+0.031
Neg logit contributions
ris-0.017
 than-0.015
 ahead-0.013
!-0.013
oe-0.012
Token make
Token ID787
Feature activation+2.12e+00

Pos logit contributions
cient+0.035
 weaker+0.034
onel+0.033
 sy+0.033
 increasingly+0.030
Neg logit contributions
ris-0.026
dad-0.023
 ahead-0.022
 than-0.022
oys-0.021
Token.
Token ID13
Feature activation+2.81e-03

Pos logit contributions
ris+0.018
dad+0.017
aren+0.015
oys+0.015
Dan+0.015
Neg logit contributions
cient-0.032
 weaker-0.031
 sy-0.031
 boo-0.030
 wing-0.029
Token And
Token ID843
Feature activation-8.27e-01

Pos logit contributions
cient+0.000
 weaker+0.000
 sy+0.000
 boo+0.000
onel+0.000
Neg logit contributions
ris-0.000
dad-0.000
oys-0.000
aren-0.000
Dan-0.000
Token the
Token ID262
Feature activation+1.43e-01

Pos logit contributions
ris+0.022
dad+0.021
aren+0.019
oys+0.019
Dan+0.019
Neg logit contributions
cient-0.024
 weaker-0.023
 sy-0.023
 boo-0.022
onel-0.021
Token tree
Token ID5509
Feature activation-1.89e-01

Pos logit contributions
cient+0.004
 sy+0.004
 weaker+0.004
onel+0.004
 proc+0.003
Neg logit contributions
dad-0.004
ris-0.004
Dan-0.004
oys-0.003
 than-0.003
Token continued
Token ID3767
Feature activation-3.80e-02

Pos logit contributions
ris+0.005
dad+0.005
aren+0.005
 than+0.005
oys+0.005
Neg logit contributions
cient-0.005
 sy-0.005
 weaker-0.005
 boo-0.005
onel-0.005
Token to
Token ID284
Feature activation-4.51e-01

Pos logit contributions
ris+0.001
dad+0.001
 than+0.001
 ahead+0.001
oys+0.001
Neg logit contributions
cient-0.001
 sy-0.001
 weaker-0.001
onel-0.001
 boo-0.001
Token grow
Token ID1663
Feature activation-4.77e-01

Pos logit contributions
ris+0.014
dad+0.014
Dan+0.013
 than+0.012
ure+0.012
Neg logit contributions
cient-0.011
 weaker-0.010
 sy-0.010
onel-0.009
 boo-0.009
Token and
Token ID290
Feature activation+3.27e-01

Pos logit contributions
ris+0.009
 than+0.008
dad+0.008
!+0.007
ar+0.007
Neg logit contributions
cient-0.018
 boo-0.017
 sy-0.017
 proc-0.016
onel-0.016
Token grow
Token ID1663
Feature activation-7.26e-01

Pos logit contributions
cient+0.010
 sy+0.010
 weaker+0.010
onel+0.009
 boo+0.009
Neg logit contributions
ris-0.008
dad-0.007
 than-0.007
Dan-0.007
oys-0.007
Token,
Token ID11
Feature activation+2.82e-01

Pos logit contributions
ris+0.014
 than+0.014
dad+0.013
ar+0.012
!+0.012
Neg logit contributions
cient-0.026
 boo-0.026
 sy-0.025
 proc-0.023
onel-0.022
Token making
Token ID1642
Feature activation+1.45e+00

Pos logit contributions
cient+0.009
 sy+0.009
 weaker+0.009
 boo+0.008
onel+0.008
Neg logit contributions
ris-0.007
dad-0.006
oys-0.006
aren-0.006
 than-0.006
Token Jack
Token ID3619
Feature activation+5.52e-01

Pos logit contributions
ris+0.001
dad+0.001
oys+0.001
aren+0.001
Dan+0.001
Neg logit contributions
cient-0.001
 weaker-0.001
 sy-0.001
 boo-0.001
onel-0.001
Token.
Token ID13
Feature activation+2.86e-02

Pos logit contributions
cient+0.020
 sy+0.020
 weaker+0.020
 boo+0.019
 wing+0.018
Neg logit contributions
ris-0.010
dad-0.010
aren-0.009
oys-0.009
Dan-0.009
Token Jack
Token ID3619
Feature activation-1.53e-01

Pos logit contributions
cient+0.001
 weaker+0.001
 en+0.001
 younger+0.001
 Mos+0.001
Neg logit contributions
ris-0.001
aren-0.001
dad-0.001
ar-0.001
oys-0.001
Token loved
Token ID6151
Feature activation-7.43e-01

Pos logit contributions
ris+0.004
dad+0.004
 than+0.004
oys+0.004
aren+0.004
Neg logit contributions
cient-0.004
 weaker-0.004
 sy-0.004
onel-0.004
 boo-0.004
Token playing
Token ID2712
Feature activation+1.27e+00

Pos logit contributions
dad+0.013
ris+0.012
 than+0.012
Dan+0.011
 recess+0.010
Neg logit contributions
onel-0.027
cient-0.027
 weaker-0.026
ivery-0.025
 sy-0.025
Token in
Token ID287
Feature activation-4.95e-02

Pos logit contributions
 weaker+0.046
cient+0.045
 sy+0.044
onel+0.042
 boo+0.042
Neg logit contributions
ris-0.024
dad-0.023
 than-0.022
ar-0.021
ure-0.020
Token his
Token ID465
Feature activation-3.95e-01

Pos logit contributions
ris+0.002
dad+0.002
aren+0.002
oys+0.002
ae+0.002
Neg logit contributions
 weaker-0.001
cient-0.001
 sy-0.001
 boo-0.001
 wing-0.001
Token garden
Token ID11376
Feature activation-1.64e-01

Pos logit contributions
ris+0.011
dad+0.011
oys+0.010
 than+0.010
Dan+0.010
Neg logit contributions
cient-0.010
 weaker-0.010
 sy-0.010
onel-0.009
 boo-0.009
Token,
Token ID11
Feature activation-5.85e-01

Pos logit contributions
ris+0.004
dad+0.004
aren+0.004
oys+0.004
Dan+0.004
Neg logit contributions
cient-0.005
 weaker-0.005
 sy-0.005
 boo-0.005
 wing-0.005
Token and
Token ID290
Feature activation+7.93e-02

Pos logit contributions
ris+0.013
dad+0.012
 than+0.011
Dan+0.011
oys+0.011
Neg logit contributions
cient-0.020
 sy-0.019
 weaker-0.018
onel-0.018
 boo-0.018
Token each
Token ID1123
Feature activation+3.46e-01

Pos logit contributions
cient+0.003
 sy+0.003
 weaker+0.003
onel+0.003
 boo+0.002
Neg logit contributions
dad-0.002
ris-0.002
 than-0.001
Dan-0.001
ae-0.001
Token day
Token ID1110
Feature activation+9.19e-01

Pos logit contributions
ris+0.009
dad+0.009
aren+0.007
Dan+0.007
oys+0.007
Neg logit contributions
cient-0.020
 sy-0.020
 weaker-0.020
 boo-0.019
onel-0.019
Token,
Token ID11
Feature activation+1.22e-01

Pos logit contributions
 weaker+0.038
cient+0.037
 sy+0.037
 boo+0.035
onel+0.035
Neg logit contributions
dad-0.013
ris-0.013
oe-0.012
oys-0.012
Dan-0.011
Token Mam
Token ID29926
Feature activation+1.65e-01

Pos logit contributions
cient+0.003
 sy+0.003
 weaker+0.003
 boo+0.003
onel+0.003
Neg logit contributions
ris-0.004
dad-0.004
aren-0.003
oys-0.003
Dan-0.003
Tokenma
Token ID2611
Feature activation-2.10e-01

Pos logit contributions
cient+0.006
 weaker+0.006
 boo+0.006
 sy+0.006
onel+0.005
Neg logit contributions
ris-0.003
 than-0.003
oys-0.003
ure-0.003
dad-0.003
Token was
Token ID373
Feature activation-1.72e-01

Pos logit contributions
ris+0.006
dad+0.006
oys+0.005
aren+0.005
Dan+0.005
Neg logit contributions
cient-0.006
 weaker-0.005
 sy-0.005
 boo-0.005
onel-0.005
Token very
Token ID845
Feature activation-4.64e-01

Pos logit contributions
 ahead+0.004
dad+0.004
ris+0.003
oe+0.003
 years+0.003
Neg logit contributions
cient-0.006
 sy-0.005
 weaker-0.005
 affect-0.005
adoes-0.005
Token busy
Token ID8179
Feature activation+7.60e-02

Pos logit contributions
dad+0.011
oe+0.011
 ahead+0.011
ae+0.010
ris+0.010
Neg logit contributions
 sy-0.014
adoes-0.013
cient-0.013
 weaker-0.013
bul-0.013
Token.
Token ID13
Feature activation-6.79e-02

Pos logit contributions
cient+0.002
 weaker+0.002
 sy+0.002
 boo+0.002
onel+0.002
Neg logit contributions
ris-0.002
dad-0.002
 than-0.002
oys-0.002
aren-0.002
Token She
Token ID1375
Feature activation-2.23e-01

Pos logit contributions
ris+0.002
dad+0.002
aren+0.002
oys+0.002
Dan+0.002
Neg logit contributions
cient-0.002
 sy-0.002
 weaker-0.002
 boo-0.002
onel-0.001
Token was
Token ID373
Feature activation-6.98e-01

Pos logit contributions
ris+0.005
dad+0.005
 than+0.004
aren+0.004
ure+0.004
Neg logit contributions
cient-0.007
 weaker-0.007
 sy-0.007
onel-0.007
 wing-0.007
Token making
Token ID1642
Feature activation+7.07e-01

Pos logit contributions
 ahead+0.015
dad+0.014
 than+0.014
 years+0.013
ris+0.013
Neg logit contributions
cient-0.023
 sy-0.022
 weaker-0.021
onel-0.021
 affect-0.020
Token He
Token ID679
Feature activation-1.08e+00

Pos logit contributions
ris+0.009
dad+0.008
aren+0.008
ure+0.008
ae+0.007
Neg logit contributions
cient-0.006
 sy-0.006
 weaker-0.006
 boo-0.006
onel-0.006
Token learned
Token ID4499
Feature activation+1.08e-02

Pos logit contributions
ris+0.031
dad+0.030
aren+0.027
oys+0.027
Dan+0.027
Neg logit contributions
cient-0.029
 weaker-0.028
 sy-0.028
 boo-0.026
onel-0.026
Token that
Token ID326
Feature activation+1.21e+00

Pos logit contributions
cient+0.000
 weaker+0.000
 sy+0.000
onel+0.000
 boo+0.000
Neg logit contributions
ris-0.000
dad-0.000
oys-0.000
 than-0.000
Dan-0.000
Token it
Token ID340
Feature activation+4.81e-01

Pos logit contributions
cient+0.042
 sy+0.036
onel+0.035
 weaker+0.034
ivery+0.034
Neg logit contributions
dad-0.026
 than-0.026
ris-0.025
Dan-0.024
oys-0.024
Token was
Token ID373
Feature activation+1.50e+00

Pos logit contributions
cient+0.016
 sy+0.015
 weaker+0.014
onel+0.014
 boo+0.014
Neg logit contributions
ris-0.011
 than-0.010
dad-0.010
oys-0.010
ure-0.009
Token fun
Token ID1257
Feature activation-4.67e-01

Pos logit contributions
cient+0.042
 sy+0.038
onel+0.036
 weaker+0.035
 boo+0.035
Neg logit contributions
ris-0.044
dad-0.042
 than-0.041
oys-0.039
Dan-0.038
Token and
Token ID290
Feature activation+2.97e-01

Pos logit contributions
ris+0.014
dad+0.013
aren+0.013
Dan+0.013
ae+0.012
Neg logit contributions
 weaker-0.012
 sy-0.012
cient-0.011
 boo-0.011
 wing-0.011
Token beautiful
Token ID4950
Feature activation+9.60e-01

Pos logit contributions
cient+0.010
 sy+0.009
 weaker+0.009
onel+0.009
 boo+0.009
Neg logit contributions
ris-0.007
dad-0.006
 than-0.006
oys-0.006
 ahead-0.006
Token and
Token ID290
Feature activation+5.00e-01

Pos logit contributions
cient+0.034
 weaker+0.034
 sy+0.034
 boo+0.032
 wing+0.032
Neg logit contributions
ris-0.019
dad-0.018
aren-0.016
Dan-0.016
oys-0.016
Token full
Token ID1336
Feature activation+5.65e-01

Pos logit contributions
cient+0.015
 sy+0.014
 weaker+0.014
 boo+0.013
onel+0.013
Neg logit contributions
ris-0.013
dad-0.012
 than-0.011
oys-0.011
aren-0.011
Token of
Token ID286
Feature activation+1.23e+00

Pos logit contributions
cient+0.017
 weaker+0.016
 sy+0.016
 boo+0.015
onel+0.015
Neg logit contributions
ris-0.015
dad-0.014
oys-0.013
aren-0.013
 than-0.012
Token not
Token ID407
Feature activation-5.16e-01

Pos logit contributions
cient+0.003
 sy+0.002
 weaker+0.002
onel+0.002
 infected+0.002
Neg logit contributions
dad-0.002
ris-0.002
 than-0.002
ae-0.002
Dan-0.002
Token look
Token ID804
Feature activation-7.25e-02

Pos logit contributions
ris+0.012
dad+0.012
 than+0.012
 ahead+0.011
ae+0.011
Neg logit contributions
cient-0.016
 sy-0.015
 weaker-0.014
onel-0.014
 infected-0.014
Token at
Token ID379
Feature activation-3.20e-01

Pos logit contributions
ris+0.002
dad+0.002
 than+0.002
 ahead+0.002
oys+0.002
Neg logit contributions
cient-0.002
 sy-0.002
 weaker-0.002
onel-0.002
 boo-0.002
Token the
Token ID262
Feature activation-8.21e-01

Pos logit contributions
ris+0.009
dad+0.009
aren+0.008
oys+0.008
 than+0.008
Neg logit contributions
cient-0.009
 sy-0.009
 weaker-0.009
 boo-0.008
onel-0.008
Token sky
Token ID6766
Feature activation-4.00e-01

Pos logit contributions
ris+0.022
dad+0.021
 than+0.019
Dan+0.019
oys+0.019
Neg logit contributions
cient-0.023
 sy-0.022
 weaker-0.022
onel-0.021
 boo-0.020
Token.
Token ID13
Feature activation-4.62e-04

Pos logit contributions
ris+0.008
dad+0.007
oys+0.006
aren+0.006
 than+0.006
Neg logit contributions
cient-0.015
 sy-0.014
 weaker-0.014
 boo-0.013
onel-0.013
Token<|endoftext|>
Token ID50256
Feature activation-1.76e-01

Pos logit contributions
ris+0.000
dad+0.000
aren+0.000
oys+0.000
Dan+0.000
Neg logit contributions
cient-0.000
 weaker-0.000
 sy-0.000
 boo-0.000
onel-0.000
TokenAnna
Token ID31160
Feature activation-1.12e-01

Pos logit contributions
ris+0.005
dad+0.005
oys+0.005
aren+0.005
ae+0.005
Neg logit contributions
cient-0.004
 sy-0.004
 weaker-0.004
 boo-0.004
 wing-0.004
Token liked
Token ID8288
Feature activation-1.12e+00

Pos logit contributions
ris+0.003
dad+0.003
oys+0.003
aren+0.003
ae+0.003
Neg logit contributions
 sy-0.003
cient-0.003
 boo-0.003
 weaker-0.003
 wing-0.003
Token to
Token ID284
Feature activation-3.30e-01

Pos logit contributions
dad+0.015
 recess+0.012
 than+0.012
Dan+0.011
Tom+0.011
Neg logit contributions
onel-0.049
cient-0.046
 weaker-0.045
ivery-0.045
 infected-0.044
Token cook
Token ID4255
Feature activation-6.46e-02

Pos logit contributions
dad+0.008
ris+0.008
Dan+0.007
ae+0.007
 recess+0.007
Neg logit contributions
 weaker-0.011
cient-0.010
onel-0.010
 sy-0.010
 infected-0.009
Token She
Token ID1375
Feature activation+5.82e-02

Pos logit contributions
cient+0.008
 weaker+0.008
 sy+0.008
onel+0.007
 boo+0.007
Neg logit contributions
ris-0.008
dad-0.008
oys-0.007
aren-0.007
Dan-0.007
Token sang
Token ID25889
Feature activation+8.85e-01

Pos logit contributions
cient+0.002
 weaker+0.002
 sy+0.002
 boo+0.002
onel+0.002
Neg logit contributions
ris-0.001
dad-0.001
aren-0.001
oys-0.001
Dan-0.001
Token every
Token ID790
Feature activation-4.46e-01

Pos logit contributions
cient+0.028
 weaker+0.028
 sy+0.028
onel+0.027
 boo+0.027
Neg logit contributions
ris-0.019
dad-0.019
 than-0.019
 ahead-0.018
oys-0.018
Token day
Token ID1110
Feature activation-1.08e+00

Pos logit contributions
ris+0.008
dad+0.007
aren+0.006
oys+0.006
Dan+0.006
Neg logit contributions
cient-0.017
 weaker-0.016
 sy-0.016
 boo-0.016
onel-0.016
Token at
Token ID379
Feature activation-9.30e-01

Pos logit contributions
ris+0.026
dad+0.025
aren+0.023
Dan+0.023
oys+0.023
Neg logit contributions
cient-0.034
 sy-0.033
 weaker-0.033
 boo-0.031
 wing-0.031
Token home
Token ID1363
Feature activation-2.02e+00

Pos logit contributions
ris+0.022
dad+0.022
Dan+0.019
 recess+0.019
aren+0.019
Neg logit contributions
cient-0.029
 weaker-0.028
 sy-0.028
onel-0.027
 boo-0.026
Token,
Token ID11
Feature activation-4.97e-01

Pos logit contributions
ris+0.043
dad+0.040
aren+0.037
Dan+0.035
oys+0.035
Neg logit contributions
cient-0.070
 sy-0.069
 weaker-0.069
 boo-0.065
 wing-0.064
Token at
Token ID379
Feature activation-5.29e-01

Pos logit contributions
ris+0.010
dad+0.010
 than+0.009
aren+0.009
Dan+0.008
Neg logit contributions
cient-0.017
 weaker-0.016
 sy-0.016
onel-0.016
 infected-0.016
Token school
Token ID1524
Feature activation-2.03e+00

Pos logit contributions
ris+0.013
dad+0.013
aren+0.011
oys+0.011
Dan+0.011
Neg logit contributions
cient-0.016
 weaker-0.016
 sy-0.016
 boo-0.015
onel-0.015
Token,
Token ID11
Feature activation-2.02e-01

Pos logit contributions
ris+0.049
dad+0.047
Dan+0.044
aren+0.043
oys+0.042
Neg logit contributions
cient-0.064
 sy-0.062
 weaker-0.060
 boo-0.057
onel-0.056
Token and
Token ID290
Feature activation+5.62e-01

Pos logit contributions
ris+0.004
dad+0.004
 than+0.004
aren+0.003
oys+0.003
Neg logit contributions
cient-0.007
 weaker-0.007
 sy-0.007
 infected-0.006
onel-0.006
Token<|endoftext|>
Token ID50256
Feature activation-4.19e-01

Pos logit contributions
ris+0.003
dad+0.003
oys+0.003
ure+0.003
aren+0.003
Neg logit contributions
cient-0.003
 sy-0.002
 weaker-0.002
 boo-0.002
 wing-0.002
TokenOnce
Token ID7454
Feature activation+5.74e-01

Pos logit contributions
ris+0.013
dad+0.013
oys+0.012
ae+0.012
ure+0.012
Neg logit contributions
cient-0.009
 weaker-0.009
 sy-0.009
onel-0.008
 boo-0.008
Token there
Token ID612
Feature activation+1.41e+00

Pos logit contributions
cient+0.012
 weaker+0.012
 sy+0.012
 boo+0.011
onel+0.010
Neg logit contributions
ris-0.020
dad-0.019
aren-0.018
oys-0.018
Dan-0.018
Token was
Token ID373
Feature activation-1.58e-01

Pos logit contributions
cient+0.034
 weaker+0.033
 sy+0.033
 boo+0.031
onel+0.030
Neg logit contributions
ris-0.044
dad-0.042
oys-0.040
aren-0.039
Dan-0.039
Token a
Token ID257
Feature activation-2.89e-01

Pos logit contributions
ris+0.004
dad+0.004
Dan+0.004
 than+0.004
aren+0.003
Neg logit contributions
cient-0.005
 sy-0.005
 weaker-0.005
onel-0.004
 boo-0.004
Token bar
Token ID2318
Feature activation-2.01e+00

Pos logit contributions
ris+0.012
dad+0.012
aren+0.011
oys+0.011
ure+0.011
Neg logit contributions
 boo-0.004
cient-0.004
 sy-0.004
 weaker-0.004
 exc-0.004
Tokenber
Token ID527
Feature activation-7.11e-01

Pos logit contributions
dad+0.054
ris+0.054
oys+0.052
aren+0.052
andra+0.052
Neg logit contributions
cient-0.052
 sy-0.050
 weaker-0.046
onel-0.045
 wing-0.045
Token named
Token ID3706
Feature activation-9.15e-01

Pos logit contributions
ris+0.020
dad+0.020
Dan+0.020
oys+0.020
aren+0.019
Neg logit contributions
cient-0.018
 sy-0.018
 boo-0.016
 wing-0.016
 weaker-0.016
Token Sam
Token ID3409
Feature activation-6.43e-01

Pos logit contributions
ris+0.023
dad+0.022
oys+0.020
aren+0.020
Dan+0.020
Neg logit contributions
cient-0.028
 sy-0.027
 weaker-0.027
 boo-0.026
onel-0.025
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
ris+0.012
dad+0.011
oys+0.010
aren+0.010
Dan+0.009
Neg logit contributions
cient-0.024
 weaker-0.024
 sy-0.023
 boo-0.023
onel-0.022
Token He
Token ID679
Feature activation-5.38e-01

Pos logit contributions
dad+0.003
ris+0.003
aren+0.003
ure+0.003
ae+0.003
Neg logit contributions
cient-0.004
 weaker-0.003
 sy-0.003
 Mos-0.003
 boo-0.003
Token not
Token ID407
Feature activation-1.47e+00

Pos logit contributions
ris+0.005
dad+0.005
aren+0.004
oys+0.004
Dan+0.004
Neg logit contributions
cient-0.006
 weaker-0.006
 sy-0.006
 boo-0.006
onel-0.006
Token listen
Token ID6004
Feature activation-1.59e+00

Pos logit contributions
ris+0.046
aren+0.043
dad+0.043
atherine+0.041
ure+0.040
Neg logit contributions
 weaker-0.035
 boo-0.035
 sy-0.035
 worse-0.034
cient-0.033
Token.
Token ID13
Feature activation-4.04e-01

Pos logit contributions
ris+0.042
atherine+0.037
aren+0.037
dad+0.037
Dan+0.036
Neg logit contributions
 weaker-0.047
 sy-0.045
 worse-0.045
 boo-0.044
cient-0.043
Token He
Token ID679
Feature activation-8.39e-01

Pos logit contributions
ris+0.013
dad+0.013
aren+0.012
ure+0.011
oys+0.011
Neg logit contributions
 weaker-0.009
cient-0.009
 sy-0.009
 boo-0.009
 wing-0.008
Token grabs
Token ID22378
Feature activation-1.65e+00

Pos logit contributions
ris+0.024
dad+0.022
Dan+0.020
aren+0.020
 than+0.020
Neg logit contributions
cient-0.024
 weaker-0.023
 sy-0.022
onel-0.021
 wing-0.021
Token Sue
Token ID26089
Feature activation-1.96e+00

Pos logit contributions
ris+0.062
dad+0.059
oys+0.056
ure+0.056
ae+0.056
Neg logit contributions
 weaker-0.028
 boo-0.027
 worse-0.026
cient-0.026
 sy-0.026
Token's
Token ID338
Feature activation+1.02e-01

Pos logit contributions
ris+0.062
dad+0.059
oys+0.055
aren+0.055
 than+0.054
Neg logit contributions
cient-0.048
 weaker-0.045
 sy-0.045
onel-0.041
 boo-0.041
Token hand
Token ID1021
Feature activation-2.60e-01

Pos logit contributions
cient+0.004
 weaker+0.003
onel+0.003
 sy+0.003
 infected+0.003
Neg logit contributions
ris-0.002
dad-0.002
oys-0.002
 than-0.002
aren-0.002
Token and
Token ID290
Feature activation+3.13e-01

Pos logit contributions
ris+0.005
dad+0.005
aren+0.005
Dan+0.005
oys+0.004
Neg logit contributions
cient-0.009
 weaker-0.009
 sy-0.009
 boo-0.009
 wing-0.008
Token pulls
Token ID16194
Feature activation-8.48e-01

Pos logit contributions
cient+0.009
 weaker+0.008
onel+0.008
 sy+0.008
 wing+0.008
Neg logit contributions
ris-0.008
dad-0.008
 than-0.007
Dan-0.007
oys-0.007
Token her
Token ID607
Feature activation+2.59e-01

Pos logit contributions
ris+0.032
dad+0.029
ae+0.029
aren+0.029
ure+0.028
Neg logit contributions
 weaker-0.016
 sy-0.015
 boo-0.014
 worse-0.014
cient-0.014
Token plastic
Token ID7309
Feature activation+9.13e-02

Pos logit contributions
ris+0.004
dad+0.004
aren+0.004
oys+0.003
Dan+0.003
Neg logit contributions
cient-0.003
 weaker-0.003
 sy-0.003
 boo-0.003
onel-0.003
Token wheels
Token ID13666
Feature activation-6.78e-01

Pos logit contributions
cient+0.003
 weaker+0.003
 sy+0.003
 boo+0.003
onel+0.003
Neg logit contributions
ris-0.002
dad-0.002
aren-0.002
oys-0.002
Dan-0.002
Token.
Token ID13
Feature activation+2.38e-01

Pos logit contributions
ris+0.013
dad+0.012
aren+0.011
Dan+0.011
ae+0.011
Neg logit contributions
cient-0.024
 weaker-0.024
 sy-0.024
 boo-0.023
 wing-0.022
Token He
Token ID679
Feature activation-5.71e-01

Pos logit contributions
cient+0.007
 weaker+0.007
 sy+0.007
 boo+0.006
onel+0.006
Neg logit contributions
ris-0.006
dad-0.006
oys-0.005
aren-0.005
ure-0.005
Token could
Token ID714
Feature activation+2.50e-01

Pos logit contributions
ris+0.016
dad+0.015
aren+0.013
Dan+0.013
oys+0.013
Neg logit contributions
cient-0.017
 weaker-0.016
 sy-0.016
onel-0.015
 boo-0.015
Token count
Token ID954
Feature activation-1.99e+00

Pos logit contributions
cient+0.007
 weaker+0.007
 sy+0.007
 boo+0.006
onel+0.006
Neg logit contributions
ris-0.007
dad-0.007
aren-0.006
oys-0.006
Dan-0.006
Token them
Token ID606
Feature activation-1.76e+00

Pos logit contributions
ris+0.053
dad+0.049
 than+0.046
oys+0.046
Dan+0.045
Neg logit contributions
cient-0.059
 weaker-0.056
 sy-0.056
 boo-0.054
onel-0.054
Token and
Token ID290
Feature activation+4.15e-01

Pos logit contributions
ris+0.041
dad+0.037
 than+0.034
oys+0.033
Dan+0.033
Neg logit contributions
cient-0.058
 sy-0.057
 weaker-0.057
 boo-0.055
onel-0.053
Token play
Token ID711
Feature activation+7.26e-01

Pos logit contributions
cient+0.012
 weaker+0.012
 sy+0.012
 boo+0.011
onel+0.011
Neg logit contributions
ris-0.011
dad-0.011
Dan-0.010
oys-0.010
aren-0.009
Token with
Token ID351
Feature activation-5.24e-01

Pos logit contributions
cient+0.023
 weaker+0.022
 sy+0.022
 boo+0.021
onel+0.021
Neg logit contributions
ris-0.018
dad-0.017
oys-0.015
Dan-0.015
aren-0.015
Token them
Token ID606
Feature activation-1.07e+00

Pos logit contributions
ris+0.017
dad+0.016
oys+0.015
aren+0.015
ae+0.015
Neg logit contributions
cient-0.012
 weaker-0.012
 sy-0.012
 boo-0.011
onel-0.010
Token They
Token ID1119
Feature activation-6.82e-01

Pos logit contributions
cient+0.002
 weaker+0.002
 sy+0.002
 boo+0.001
onel+0.001
Neg logit contributions
ris-0.001
dad-0.001
oys-0.001
Dan-0.001
ae-0.001
Token had
Token ID550
Feature activation-7.85e-03

Pos logit contributions
ris+0.017
dad+0.016
Dan+0.014
aren+0.014
oys+0.014
Neg logit contributions
cient-0.021
 sy-0.021
 weaker-0.021
onel-0.020
 boo-0.019
Token a
Token ID257
Feature activation-6.19e-01

Pos logit contributions
ris+0.000
dad+0.000
aren+0.000
oys+0.000
Dan+0.000
Neg logit contributions
cient-0.000
 weaker-0.000
 sy-0.000
 boo-0.000
onel-0.000
Token lot
Token ID1256
Feature activation-1.98e+00

Pos logit contributions
dad+0.019
ris+0.019
oys+0.018
aren+0.018
ure+0.017
Neg logit contributions
cient-0.015
 boo-0.015
 sy-0.015
 weaker-0.015
 wing-0.013
Token of
Token ID286
Feature activation+8.23e-01

Pos logit contributions
atherine+0.083
dad+0.082
andra+0.081
thel+0.081
aren+0.080
Neg logit contributions
 weaker-0.019
 sy-0.017
 worse-0.017
 wing-0.014
onel-0.014
Token fun
Token ID1257
Feature activation-1.86e+00

Pos logit contributions
cient+0.027
 weaker+0.026
 sy+0.026
 boo+0.025
onel+0.024
Neg logit contributions
ris-0.019
dad-0.018
aren-0.016
oys-0.016
Dan-0.016
Token together
Token ID1978
Feature activation-1.47e+00

Pos logit contributions
dad+0.054
aren+0.054
ris+0.053
atherine+0.050
andra+0.049
Neg logit contributions
cient-0.045
 weaker-0.045
 sy-0.043
 wing-0.043
 exc-0.041
Token.
Token ID13
Feature activation+3.00e-01

Pos logit contributions
dad+0.040
aren+0.040
ris+0.039
oys+0.038
atherine+0.037
Neg logit contributions
 weaker-0.038
cient-0.038
 wing-0.036
 sy-0.035
 worse-0.034
Token But
Token ID887
Feature activation-1.54e+00

Pos logit contributions
cient+0.009
 weaker+0.008
 sy+0.008
 boo+0.008
onel+0.007
Neg logit contributions
ris-0.008
dad-0.008
aren-0.007
oys-0.007
Dan-0.007
Token as
Token ID355
Feature activation+2.73e-01

Pos logit contributions
ris+0.036
dad+0.034
Dan+0.031
oys+0.031
 than+0.031
Neg logit contributions
cient-0.050
 sy-0.049
 weaker-0.048
 boo-0.046
onel-0.045
Token the
Token ID262
Feature activation+3.39e-01

Pos logit contributions
 weaker+0.007
cient+0.007
 worse+0.007
onel+0.006
 sy+0.006
Neg logit contributions
ris-0.008
dad-0.008
aren-0.007
ure-0.007
oys-0.007