TOP ACTIVATIONS
MAX = 0.040

of all â if she only follows the
his elephant the name " BI GG EST ". He was
gr umble â It was a cow !
, three â â Andy
One â two â
just yet â but trust me , it
in here â .' <|endoftext|> Once there was
, three â '. The numbers flew back
never expected â <|endoftext|> Once upon a time
her favorite animal , a k itty cat . But she
trash cans , and in al leys . They did not
her favorite animal , a k itty cat . But she
never seen before - a k ite !
The voice said , " cong ratulations , you can have
two â three ! â
day , Tom saw a k ite in the sky .
in the sky but the k ite was gone . His
The girl saw a k ite up in the sky
day , Lemon saw a k ite stuck in the tree
pencil s , and a k ite ! The

MIN ACTIVATIONS
MIN = -0.069

She cried and threw a tant rum . Her mom tried
his feet and threw a tant rum . "
. He kept throwing a tant rum . His mom was
listen . She threw a tant rum in the store .
sad . She threw a tant rum and cried loudly .
and so she threw a tant rum when her parents said
her feet and threw a tant rum . She grabbed the
to cry and threw a tant rum . She didn 't
kept crying and throwing a tant rum . Finally
surprise . He threw a tant rum . He kicked and
and started to throw a tant rum . Suddenly , she
temper and often threw temper tant rums . One day ,
them and never threw a tant rum again . <|endoftext|> Once
. When D ott ie arrived at the lab
stubborn . She throws a tant rum . She kicks and
One day , when Br idget was walking in the me
the elderly man on a stret cher . She thanked Tim
They put Lily on a stret cher and carried her to
The new light bul b was much bigger and
shouted . He threw a snowball at the dog . The

INTERVAL 0.013 - 0.040
CONTAINS 3.611%

on the soup to make it cooler . They tasted the
Its bus ier days were yet to come !
sun was shining and the sky was blue .
They invited them to play again . Anna and
at the pe b ble and smiled . He said ,

INTERVAL -0.014 - 0.013
CONTAINS 93.307%

" Wow !" said Lucy . M
little girl named Ruby . She had a red bike with
so proud of himself for running so quickly around the pit
her knee . She cried and cried until her mom my
So , Tim my and Billy hid behind the

INTERVAL -0.041 - -0.014
CONTAINS 3.081%

He calls it an x - ray . He says it
the mail man put a new nut in the mail just
. He brought along a stuffed animal and some snacks .
Jack tried to pull it out , but it was too
was a kind man who loved to show her his special

INTERVAL -0.069 - -0.041
CONTAINS 0.001%

arm chair . THE <|endoftext|> Once upon a time
them and never threw a tant rum again . <|endoftext|> Once
face . THE <|endoftext|> Once upon a time
. They were blue and spe ck led . L ila
. Despite wearing a pink cast , Sally never went back
Token of
Token ID286
Feature activation-7.46e-03

Pos logit contributions
 be+0.009
 have+0.009
 do+0.009
 find+0.008
 see+0.007
Neg logit contributions
ier-0.014
iest-0.014
rito-0.014
us-0.014
led-0.014
Token all
Token ID477
Feature activation-1.70e-03

Pos logit contributions
iest+0.016
rito+0.016
ier+0.016
rol+0.016
rob+0.016
Neg logit contributions
 be-0.010
 have-0.009
 do-0.009
 find-0.009
 see-0.007
Tokenâ
Token ID22940
Feature activation+8.77e-03

Pos logit contributions
ier+0.004
rito+0.004
iest+0.004
rol+0.004
led+0.004
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 see-0.002
Token
Token ID26391
Feature activation-2.67e-03

Pos logit contributions
 be+0.021
 have+0.020
 do+0.020
 find+0.019
 I+0.018
Neg logit contributions
rito-0.012
rob-0.012
rol-0.012
ier-0.012
us-0.012
Token
Token ID126
Feature activation+1.58e-02

Pos logit contributions
us+0.004
ridge+0.004
rob+0.004
rol+0.004
at+0.004
Neg logit contributions
 be-0.006
 do-0.005
 have-0.005
 find-0.005
 go-0.005
Token
Token ID99
Feature activation+3.98e-02

Pos logit contributions
 be+0.032
 have+0.031
 do+0.031
 find+0.029
 I+0.027
Neg logit contributions
at-0.026
rito-0.026
rob-0.026
us-0.026
ridge-0.025
Tokenif
Token ID361
Feature activation+3.32e-03

Pos logit contributions
 be+0.090
 have+0.089
 do+0.087
 find+0.083
 go+0.077
Neg logit contributions
rol-0.059
us-0.058
rito-0.058
ier-0.058
at-0.057
Token she
Token ID673
Feature activation-8.19e-03

Pos logit contributions
 be+0.007
 have+0.006
 find+0.006
 do+0.006
 see+0.006
Neg logit contributions
us-0.006
ier-0.006
rol-0.006
rito-0.006
iest-0.006
Token only
Token ID691
Feature activation-7.04e-03

Pos logit contributions
led+0.019
ier+0.019
rito+0.019
at+0.019
iest+0.019
Neg logit contributions
 be-0.010
 have-0.009
 do-0.009
 find-0.009
 see-0.007
Token follows
Token ID5679
Feature activation+3.89e-03

Pos logit contributions
led+0.017
ier+0.017
rito+0.017
iest+0.017
at+0.016
Neg logit contributions
 be-0.008
 do-0.007
 have-0.007
 find-0.007
 mom-0.006
Token the
Token ID262
Feature activation+3.35e-04

Pos logit contributions
 be+0.005
 have+0.005
 do+0.005
 find+0.004
 see+0.004
Neg logit contributions
us-0.009
rito-0.009
rily-0.009
rol-0.009
ier-0.009
Token his
Token ID465
Feature activation-1.17e-03

Pos logit contributions
led+0.004
rol+0.004
rito+0.004
rob+0.004
us+0.003
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.002
 I-0.002
Token elephant
Token ID20950
Feature activation+1.08e-02

Pos logit contributions
rito+0.003
ridge+0.002
us+0.002
rob+0.002
at+0.002
Neg logit contributions
 have-0.002
 be-0.001
 do-0.001
 find-0.001
 I-0.001
Token the
Token ID262
Feature activation+1.27e-03

Pos logit contributions
 have+0.015
 be+0.015
 do+0.014
 find+0.014
 I+0.013
Neg logit contributions
led-0.023
ier-0.023
cream-0.022
rol-0.022
at-0.022
Token name
Token ID1438
Feature activation+1.47e-03

Pos logit contributions
 have+0.002
 be+0.002
 do+0.002
 find+0.002
 see+0.002
Neg logit contributions
rob-0.003
rito-0.003
ridge-0.002
iest-0.002
at-0.002
Token "
Token ID366
Feature activation-9.31e-03

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 see+0.001
Neg logit contributions
rob-0.004
rito-0.004
ier-0.004
rol-0.004
led-0.004
TokenBI
Token ID3483
Feature activation+3.91e-02

Pos logit contributions
rito+0.014
ier+0.014
rob+0.014
led+0.013
us+0.013
Neg logit contributions
 be-0.021
 have-0.020
 do-0.020
 find-0.019
 I-0.018
TokenGG
Token ID11190
Feature activation+2.02e-02

Pos logit contributions
 be+0.083
 have+0.082
 do+0.080
 find+0.076
 I+0.069
Neg logit contributions
rito-0.062
rob-0.062
rol-0.061
ier-0.061
us-0.061
TokenEST
Token ID6465
Feature activation-6.11e-03

Pos logit contributions
 be+0.041
 have+0.040
 do+0.039
 find+0.037
 I+0.034
Neg logit contributions
rito-0.034
rob-0.034
rol-0.033
ier-0.033
us-0.033
Token".
Token ID1911
Feature activation+1.95e-03

Pos logit contributions
rito+0.011
rob+0.011
us+0.011
rol+0.011
ier+0.011
Neg logit contributions
 be-0.011
 have-0.011
 do-0.011
 find-0.010
 see-0.009
Token He
Token ID679
Feature activation-2.10e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 I+0.002
Neg logit contributions
rol-0.004
rito-0.004
rob-0.004
us-0.004
led-0.004
Token was
Token ID373
Feature activation-8.18e-04

Pos logit contributions
led+0.005
rob+0.005
rito+0.005
rol+0.005
us+0.004
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.003
 I-0.002
Token gr
Token ID1036
Feature activation+8.23e-03

Pos logit contributions
 have+0.010
 be+0.010
 do+0.010
 find+0.009
 I+0.009
Neg logit contributions
rito-0.017
ier-0.017
led-0.017
at-0.017
us-0.016
Tokenumble
Token ID10344
Feature activation-4.34e-03

Pos logit contributions
 be+0.020
 have+0.020
 do+0.020
 find+0.019
 go+0.017
Neg logit contributions
rob-0.010
rito-0.010
ridge-0.010
rol-0.010
iest-0.010
Tokenâ
Token ID22940
Feature activation+1.73e-02

Pos logit contributions
rito+0.011
ier+0.011
led+0.011
us+0.011
at+0.011
Neg logit contributions
 be-0.005
 have-0.005
 do-0.005
 find-0.004
 I-0.004
Token
Token ID26391
Feature activation-8.84e-04

Pos logit contributions
 be+0.041
 have+0.040
 do+0.039
 find+0.037
 I+0.035
Neg logit contributions
rito-0.024
rob-0.023
ier-0.023
at-0.023
rol-0.023
Token
Token ID126
Feature activation+1.40e-02

Pos logit contributions
us+0.001
ridge+0.001
at+0.001
rito+0.001
led+0.001
Neg logit contributions
 be-0.002
 do-0.002
 have-0.002
 find-0.002
 go-0.002
Token
Token ID99
Feature activation+3.85e-02

Pos logit contributions
 be+0.029
 have+0.029
 do+0.028
 find+0.026
 I+0.025
Neg logit contributions
rito-0.023
at-0.023
us-0.022
rob-0.022
ridge-0.022
Token It
Token ID632
Feature activation+8.48e-04

Pos logit contributions
 be+0.083
 have+0.083
 do+0.080
 find+0.075
 go+0.070
Neg logit contributions
rito-0.060
at-0.059
us-0.059
rol-0.058
ier-0.058
Token was
Token ID373
Feature activation+3.79e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 see+0.001
Neg logit contributions
rito-0.002
ier-0.002
led-0.002
rol-0.002
at-0.002
Token a
Token ID257
Feature activation+6.52e-03

Pos logit contributions
 be+0.005
 have+0.005
 do+0.005
 find+0.004
 go+0.004
Neg logit contributions
rob-0.010
rol-0.009
rito-0.009
led-0.009
us-0.009
Token cow
Token ID9875
Feature activation+1.31e-02

Pos logit contributions
 be+0.010
 have+0.010
 do+0.009
 find+0.009
 see+0.008
Neg logit contributions
rito-0.014
us-0.014
ier-0.013
iest-0.013
rob-0.013
Token!
Token ID0
Feature activation+5.53e-03

Pos logit contributions
 be+0.016
 have+0.015
 do+0.014
 find+0.013
 see+0.012
Neg logit contributions
us-0.032
rito-0.032
ier-0.032
ridge-0.031
at-0.031
Token,
Token ID11
Feature activation+4.75e-03

Pos logit contributions
 find+0.002
 be+0.002
 have+0.002
 do+0.002
 learn+0.002
Neg logit contributions
iest-0.006
ier-0.005
rol-0.005
 kernels-0.005
at-0.005
Token three
Token ID1115
Feature activation-1.04e-02

Pos logit contributions
 be+0.006
 have+0.006
 do+0.005
 find+0.005
 sad+0.005
Neg logit contributions
ier-0.009
 kernels-0.009
iest-0.009
ridge-0.009
rol-0.009
Tokenâ
Token ID22940
Feature activation+1.41e-02

Pos logit contributions
iest+0.024
ier+0.024
led+0.023
us+0.023
 kernels+0.023
Neg logit contributions
 find-0.013
 have-0.013
 be-0.013
 do-0.013
 draw-0.011
Token
Token ID26391
Feature activation-1.94e-03

Pos logit contributions
 be+0.035
 have+0.034
 do+0.034
 find+0.032
 see+0.030
Neg logit contributions
rito-0.018
at-0.018
rob-0.018
ier-0.017
rol-0.017
Token
Token ID126
Feature activation+2.12e-02

Pos logit contributions
ridge+0.003
at+0.003
us+0.003
rob+0.003
rol+0.003
Neg logit contributions
 be-0.004
 do-0.004
 have-0.003
 find-0.003
 end-0.003
Token
Token ID99
Feature activation+3.83e-02

Pos logit contributions
 be+0.042
 have+0.040
 do+0.039
 find+0.038
 I+0.035
Neg logit contributions
at-0.037
rob-0.036
rito-0.036
us-0.035
ridge-0.035
Tokenâ
Token ID22940
Feature activation+1.27e-02

Pos logit contributions
 be+0.082
 have+0.081
 do+0.078
 find+0.076
 go+0.069
Neg logit contributions
at-0.061
rol-0.060
us-0.060
rito-0.059
rob-0.059
Token
Token ID26391
Feature activation-3.08e-03

Pos logit contributions
 be+0.031
 have+0.030
 do+0.029
 find+0.028
 I+0.026
Neg logit contributions
rito-0.017
rob-0.016
rol-0.016
ier-0.016
at-0.016
Token
Token ID198
Feature activation+2.14e-04

Pos logit contributions
us+0.005
rob+0.005
ridge+0.005
at+0.005
rol+0.005
Neg logit contributions
 be-0.006
 do-0.006
 have-0.006
 find-0.006
 end-0.005
Token
Token ID198
Feature activation+2.75e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 go+0.000
Neg logit contributions
us-0.001
rito-0.001
rol-0.001
rob-0.001
at-0.001
TokenAndy
Token ID35314
Feature activation-7.24e-03

Pos logit contributions
 be+0.005
 have+0.005
 do+0.005
 find+0.005
 I+0.004
Neg logit contributions
rito-0.005
rob-0.005
ier-0.005
rol-0.005
led-0.005
Token
Token ID241
Feature activation+8.62e-03

Pos logit contributions
rito+0.003
at+0.003
rob+0.003
rol+0.003
us+0.003
Neg logit contributions
 be-0.005
 have-0.005
 do-0.004
 find-0.004
 I-0.004
TokenOne
Token ID3198
Feature activation+1.62e-02

Pos logit contributions
 be+0.016
 have+0.015
 do+0.015
 find+0.014
 I+0.013
Neg logit contributions
rito-0.016
rob-0.016
rol-0.016
ier-0.016
us-0.016
Tokenâ
Token ID22940
Feature activation+1.05e-02

Pos logit contributions
 have+0.021
 be+0.020
 find+0.020
 do+0.019
 see+0.017
Neg logit contributions
iest-0.037
ier-0.036
at-0.036
rito-0.034
us-0.033
Token
Token ID26391
Feature activation-2.67e-03

Pos logit contributions
 be+0.025
 have+0.024
 do+0.024
 find+0.023
 I+0.021
Neg logit contributions
rito-0.014
rob-0.014
at-0.014
ier-0.014
rol-0.014
Token
Token ID126
Feature activation+1.77e-02

Pos logit contributions
at+0.005
ridge+0.005
rob+0.005
us+0.005
rol+0.005
Neg logit contributions
 be-0.005
 do-0.005
 have-0.005
 find-0.004
 go-0.004
Token
Token ID99
Feature activation+3.82e-02

Pos logit contributions
 be+0.036
 have+0.035
 do+0.034
 find+0.032
 I+0.029
Neg logit contributions
at-0.030
rito-0.030
rob-0.029
ridge-0.029
us-0.029
Token two
Token ID734
Feature activation-3.49e-03

Pos logit contributions
 be+0.072
 have+0.070
 do+0.068
 find+0.065
 go+0.060
Neg logit contributions
at-0.069
rito-0.068
ier-0.068
rob-0.068
us-0.067
Tokenâ
Token ID22940
Feature activation+8.99e-03

Pos logit contributions
ier+0.008
iest+0.008
rol+0.007
led+0.007
us+0.007
Neg logit contributions
 be-0.005
 have-0.004
 find-0.004
 do-0.004
 see-0.004
Token
Token ID26391
Feature activation-2.00e-03

Pos logit contributions
 be+0.021
 have+0.021
 do+0.021
 find+0.020
 I+0.018
Neg logit contributions
rito-0.012
rob-0.012
rol-0.012
ier-0.012
us-0.012
Token
Token ID126
Feature activation+1.45e-02

Pos logit contributions
rob+0.003
ridge+0.003
at+0.003
us+0.003
rol+0.003
Neg logit contributions
 be-0.004
 do-0.004
 have-0.004
 find-0.003
 go-0.003
Token
Token ID99
Feature activation+3.66e-02

Pos logit contributions
 be+0.029
 have+0.029
 do+0.028
 find+0.026
 I+0.024
Neg logit contributions
rito-0.025
at-0.025
rob-0.024
us-0.024
ier-0.023
Token just
Token ID655
Feature activation-1.12e-03

Pos logit contributions
 be+0.010
 have+0.010
 do+0.009
 find+0.009
 see+0.007
Neg logit contributions
ier-0.018
iest-0.018
rol-0.018
rito-0.018
us-0.018
Token yet
Token ID1865
Feature activation+1.49e-02

Pos logit contributions
ier+0.003
iest+0.003
at+0.003
rito+0.003
led+0.003
Neg logit contributions
 be-0.001
 do-0.001
 have-0.001
 mom-0.001
 find-0.001
Tokenâ
Token ID22940
Feature activation+1.19e-02

Pos logit contributions
 have+0.018
 be+0.018
 do+0.017
 find+0.016
 see+0.014
Neg logit contributions
ier-0.034
iest-0.034
us-0.033
rito-0.033
rol-0.033
Token
Token ID26391
Feature activation-5.45e-03

Pos logit contributions
 be+0.029
 have+0.029
 do+0.028
 find+0.027
 I+0.025
Neg logit contributions
rito-0.015
rob-0.015
ier-0.015
rol-0.015
at-0.015
Token
Token ID126
Feature activation+1.73e-02

Pos logit contributions
rob+0.009
us+0.009
rol+0.009
at+0.009
ridge+0.009
Neg logit contributions
 be-0.011
 have-0.010
 do-0.010
 find-0.010
 go-0.009
Token
Token ID99
Feature activation+3.78e-02

Pos logit contributions
 be+0.033
 have+0.032
 do+0.031
 find+0.030
 I+0.027
Neg logit contributions
rito-0.031
rob-0.031
at-0.031
us-0.030
rol-0.029
Tokenbut
Token ID4360
Feature activation-1.01e-03

Pos logit contributions
 be+0.084
 have+0.083
 do+0.080
 find+0.077
 go+0.070
Neg logit contributions
rito-0.057
rob-0.057
us-0.056
rol-0.056
at-0.056
Token trust
Token ID3774
Feature activation+2.49e-03

Pos logit contributions
iest+0.003
ier+0.003
rito+0.003
us+0.003
rol+0.002
Neg logit contributions
 be-0.001
 have-0.001
 find-0.001
 do-0.001
 frame-0.001
Token me
Token ID502
Feature activation+1.00e-02

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 see+0.002
Neg logit contributions
us-0.006
rito-0.006
ier-0.006
iest-0.006
rob-0.006
Token,
Token ID11
Feature activation+6.61e-03

Pos logit contributions
 be+0.013
 have+0.013
 do+0.012
 find+0.011
 see+0.010
Neg logit contributions
rito-0.024
ier-0.023
us-0.023
at-0.023
rol-0.023
Token it
Token ID340
Feature activation+6.09e-03

Pos logit contributions
 be+0.008
 have+0.008
 do+0.007
 find+0.007
 learn+0.006
Neg logit contributions
rol-0.015
rito-0.015
ier-0.015
us-0.015
iest-0.015
Token in
Token ID287
Feature activation+7.79e-03

Pos logit contributions
us+0.006
ier+0.006
rito+0.006
iest+0.006
led+0.006
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.003
 see-0.003
Token here
Token ID994
Feature activation+1.30e-03

Pos logit contributions
 be+0.010
 do+0.010
 have+0.009
 find+0.008
 learn+0.007
Neg logit contributions
iest-0.017
rily-0.017
rito-0.017
 kernels-0.016
us-0.016
Tokenâ
Token ID22940
Feature activation+1.22e-02

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 see+0.002
Neg logit contributions
rily-0.003
ier-0.003
rol-0.003
iest-0.003
rito-0.003
Token
Token ID26391
Feature activation-4.49e-03

Pos logit contributions
 be+0.028
 have+0.028
 do+0.027
 find+0.026
 I+0.024
Neg logit contributions
rito-0.017
at-0.017
rob-0.017
ier-0.016
rol-0.016
Token
Token ID126
Feature activation+1.43e-02

Pos logit contributions
ridge+0.007
at+0.007
us+0.007
rito+0.007
side+0.007
Neg logit contributions
 be-0.009
 do-0.009
 have-0.008
 find-0.008
 go-0.007
Token
Token ID99
Feature activation+3.76e-02

Pos logit contributions
 be+0.028
 have+0.027
 do+0.026
 find+0.025
 I+0.023
Neg logit contributions
rito-0.025
at-0.025
ridge-0.024
us-0.024
rob-0.024
Token.'
Token ID2637
Feature activation+5.91e-03

Pos logit contributions
 be+0.081
 have+0.080
 do+0.077
 find+0.073
 go+0.068
Neg logit contributions
rito-0.059
at-0.058
rol-0.057
us-0.057
ridge-0.057
Token<|endoftext|>
Token ID50256
Feature activation-7.28e-03

Pos logit contributions
 be+0.008
 do+0.007
 have+0.007
 find+0.006
 go+0.006
Neg logit contributions
us-0.014
rol-0.014
rito-0.014
rob-0.014
ridge-0.014
TokenOnce
Token ID7454
Feature activation+7.41e-03

Pos logit contributions
rito+0.012
ier+0.012
rob+0.011
rol+0.011
led+0.011
Neg logit contributions
 be-0.015
 have-0.015
 do-0.015
 find-0.014
 I-0.013
Token there
Token ID612
Feature activation-1.46e-03

Pos logit contributions
 be+0.015
 have+0.015
 do+0.015
 find+0.014
 go+0.013
Neg logit contributions
us-0.012
ier-0.012
iest-0.012
rol-0.012
led-0.011
Token was
Token ID373
Feature activation-6.46e-03

Pos logit contributions
rol+0.003
iest+0.003
led+0.003
rob+0.003
rito+0.003
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 learn-0.001
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
iest+0.004
ier+0.004
 kernels+0.004
rol+0.004
us+0.004
Neg logit contributions
 have-0.003
 be-0.003
 find-0.002
 do-0.002
 see-0.002
Token three
Token ID1115
Feature activation-7.38e-03

Pos logit contributions
 be+0.019
 have+0.019
 do+0.018
 find+0.017
 play+0.015
Neg logit contributions
ier-0.029
 kernels-0.029
ridge-0.029
rily-0.029
rito-0.029
Tokenâ
Token ID22940
Feature activation+1.33e-02

Pos logit contributions
iest+0.016
ier+0.016
us+0.016
led+0.016
 kernels+0.015
Neg logit contributions
 have-0.010
 be-0.010
 find-0.010
 do-0.010
 draw-0.009
Token
Token ID26391
Feature activation+2.32e-03

Pos logit contributions
 be+0.032
 have+0.032
 do+0.031
 find+0.030
 see+0.028
Neg logit contributions
rito-0.017
at-0.017
rob-0.017
ier-0.017
rol-0.016
Token
Token ID126
Feature activation+2.05e-02

Pos logit contributions
 be+0.004
 do+0.004
 have+0.004
 find+0.004
 mean+0.004
Neg logit contributions
ridge-0.004
at-0.004
us-0.004
rob-0.004
rol-0.004
Token
Token ID99
Feature activation+3.75e-02

Pos logit contributions
 be+0.040
 have+0.038
 do+0.038
 find+0.036
 I+0.033
Neg logit contributions
at-0.036
rito-0.036
us-0.035
rob-0.035
ridge-0.035
Token'.
Token ID4458
Feature activation+7.56e-03

Pos logit contributions
 be+0.077
 have+0.076
 do+0.074
 find+0.071
 go+0.064
Neg logit contributions
ier-0.062
rito-0.061
us-0.061
at-0.060
rol-0.060
Token The
Token ID383
Feature activation+6.11e-03

Pos logit contributions
 be+0.011
 do+0.010
 have+0.010
 find+0.010
 go+0.008
Neg logit contributions
rol-0.017
us-0.017
rito-0.017
rob-0.016
ridge-0.016
Token numbers
Token ID3146
Feature activation-2.07e-03

Pos logit contributions
 have+0.009
 be+0.009
 find+0.008
 do+0.008
 see+0.007
Neg logit contributions
iest-0.013
ier-0.013
 kernels-0.012
rob-0.012
at-0.012
Token flew
Token ID13112
Feature activation+3.20e-04

Pos logit contributions
rito+0.004
rob+0.004
rol+0.004
us+0.004
ier+0.004
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.003
 I-0.003
Token back
Token ID736
Feature activation-7.25e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 go+0.001
Neg logit contributions
rob-0.001
rito-0.001
at-0.001
ridge-0.001
us-0.001
Token never
Token ID1239
Feature activation-1.70e-03

Pos logit contributions
led+0.008
rol+0.008
ier+0.008
rito+0.008
us+0.008
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.004
 I-0.003
Token expected
Token ID2938
Feature activation+1.80e-02

Pos logit contributions
rito+0.003
ier+0.003
rol+0.003
rob+0.003
led+0.003
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.003
 I-0.003
Tokenâ
Token ID22940
Feature activation+6.86e-03

Pos logit contributions
 be+0.019
 have+0.018
 do+0.018
 find+0.016
 see+0.013
Neg logit contributions
ier-0.047
iest-0.046
rol-0.046
us-0.046
rito-0.046
Token
Token ID26391
Feature activation-4.90e-03

Pos logit contributions
 be+0.016
 have+0.016
 do+0.015
 find+0.015
 I+0.013
Neg logit contributions
rito-0.010
rob-0.010
rol-0.010
ier-0.010
us-0.010
Token
Token ID126
Feature activation+1.25e-02

Pos logit contributions
us+0.007
rol+0.007
at+0.007
ridge+0.007
rob+0.006
Neg logit contributions
 be-0.011
 do-0.011
 have-0.011
 find-0.010
 go-0.009
Token
Token ID99
Feature activation+3.74e-02

Pos logit contributions
 be+0.024
 have+0.023
 do+0.022
 find+0.021
 I+0.020
Neg logit contributions
rito-0.022
at-0.022
us-0.022
rob-0.022
ridge-0.021
Token<|endoftext|>
Token ID50256
Feature activation-4.41e-03

Pos logit contributions
 be+0.083
 have+0.083
 do+0.079
 find+0.076
 go+0.070
Neg logit contributions
rol-0.056
rito-0.056
at-0.056
us-0.055
ier-0.054
TokenOnce
Token ID7454
Feature activation+1.35e-02

Pos logit contributions
rito+0.007
ier+0.007
rol+0.007
led+0.007
rob+0.007
Neg logit contributions
 have-0.009
 be-0.009
 do-0.009
 find-0.008
 I-0.008
Token upon
Token ID2402
Feature activation+1.06e-02

Pos logit contributions
 be+0.028
 have+0.027
 do+0.027
 find+0.025
 I+0.023
Neg logit contributions
us-0.021
ier-0.020
iest-0.020
rol-0.020
led-0.020
Token a
Token ID257
Feature activation+3.17e-04

Pos logit contributions
 be+0.012
 have+0.011
 do+0.010
 find+0.010
 I+0.008
Neg logit contributions
rito-0.026
us-0.026
led-0.026
rob-0.026
ier-0.026
Token time
Token ID640
Feature activation+1.44e-02

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 I+0.000
Neg logit contributions
rito-0.001
at-0.001
rob-0.001
us-0.001
led-0.001
Token her
Token ID607
Feature activation+3.82e-03

Pos logit contributions
ridge+0.008
rob+0.008
med+0.007
 kernels+0.007
us+0.007
Neg logit contributions
 have-0.006
 be-0.006
 do-0.006
 want-0.005
 find-0.005
Token favorite
Token ID4004
Feature activation-1.55e-03

Pos logit contributions
 have+0.005
 be+0.004
 do+0.004
 find+0.004
 see+0.004
Neg logit contributions
ridge-0.008
iest-0.008
rob-0.008
led-0.008
at-0.008
Token animal
Token ID5044
Feature activation+8.87e-03

Pos logit contributions
iest+0.003
rob+0.003
at+0.003
ridge+0.003
rito+0.003
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 see-0.002
Token,
Token ID11
Feature activation-3.97e-04

Pos logit contributions
 be+0.010
 have+0.010
 find+0.009
 do+0.009
 see+0.009
Neg logit contributions
iest-0.020
ier-0.020
ridge-0.020
at-0.019
led-0.019
Token a
Token ID257
Feature activation+1.27e-02

Pos logit contributions
rito+0.001
rol+0.001
rob+0.001
ridge+0.001
cream+0.001
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 learn-0.000
Token k
Token ID479
Feature activation+3.74e-02

Pos logit contributions
 have+0.017
 be+0.016
 find+0.016
 do+0.015
 learn+0.014
Neg logit contributions
rito-0.026
iest-0.025
us-0.025
ridge-0.025
rob-0.025
Tokenitty
Token ID9760
Feature activation+1.84e-02

Pos logit contributions
 be+0.091
 do+0.087
 find+0.086
 have+0.086
 go+0.081
Neg logit contributions
rob-0.053
us-0.051
rol-0.048
rito-0.047
ted-0.046
Token cat
Token ID3797
Feature activation+1.85e-02

Pos logit contributions
 be+0.020
 have+0.020
 do+0.019
 find+0.019
 learn+0.018
Neg logit contributions
rito-0.042
rob-0.042
ridge-0.041
ted-0.041
ier-0.041
Token.
Token ID13
Feature activation+5.96e-03

Pos logit contributions
 be+0.021
 have+0.019
 do+0.019
 find+0.018
 learn+0.016
Neg logit contributions
rob-0.043
ridge-0.043
rito-0.042
ted-0.042
at-0.042
Token But
Token ID887
Feature activation+5.88e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.007
 find+0.006
 go+0.005
Neg logit contributions
rito-0.014
rol-0.014
rob-0.014
us-0.014
led-0.014
Token she
Token ID673
Feature activation+4.81e-03

Pos logit contributions
 have+0.007
 be+0.007
 do+0.007
 find+0.007
 see+0.006
Neg logit contributions
led-0.014
rito-0.013
ier-0.013
ridge-0.013
rol-0.013
Token trash
Token ID13913
Feature activation-1.67e-03

Pos logit contributions
 do+0.021
 be+0.021
 have+0.021
 find+0.018
 I+0.017
Neg logit contributions
rito-0.036
iest-0.035
rob-0.035
 kernels-0.035
rol-0.035
Token cans
Token ID23916
Feature activation+1.72e-03

Pos logit contributions
rito+0.005
rol+0.005
at+0.005
rob+0.005
us+0.005
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token,
Token ID11
Feature activation-9.19e-03

Pos logit contributions
 be+0.002
 have+0.002
 do+0.001
 find+0.001
 learn+0.001
Neg logit contributions
ier-0.005
rol-0.005
led-0.005
us-0.005
rito-0.005
Token and
Token ID290
Feature activation+1.01e-04

Pos logit contributions
rito+0.022
rob+0.021
led+0.021
ier+0.021
rol+0.021
Neg logit contributions
 be-0.012
 have-0.012
 do-0.011
 find-0.011
 I-0.009
Token in
Token ID287
Feature activation+1.49e-02

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 I+0.000
Neg logit contributions
led-0.000
 kernels-0.000
ier-0.000
rito-0.000
rob-0.000
Token al
Token ID435
Feature activation+3.69e-02

Pos logit contributions
 be+0.020
 do+0.019
 have+0.019
 find+0.018
 I+0.017
Neg logit contributions
rito-0.031
iest-0.031
 kernels-0.031
ier-0.030
rol-0.030
Tokenleys
Token ID21325
Feature activation-5.26e-03

Pos logit contributions
 be+0.094
 have+0.093
 do+0.091
 find+0.087
 see+0.081
Neg logit contributions
rito-0.043
rob-0.042
ier-0.042
rol-0.041
iest-0.041
Token.
Token ID13
Feature activation+4.88e-03

Pos logit contributions
ier+0.015
rito+0.014
rol+0.014
us+0.014
rob+0.014
Neg logit contributions
 have-0.005
 be-0.005
 do-0.004
 find-0.004
 I-0.003
Token They
Token ID1119
Feature activation+4.67e-03

Pos logit contributions
 be+0.006
 have+0.006
 do+0.005
 find+0.005
 go+0.004
Neg logit contributions
rito-0.012
rol-0.012
us-0.012
led-0.012
ier-0.012
Token did
Token ID750
Feature activation-2.61e-04

Pos logit contributions
 be+0.006
 have+0.006
 do+0.006
 find+0.005
 I+0.005
Neg logit contributions
rito-0.011
led-0.011
rol-0.011
ier-0.011
rob-0.011
Token not
Token ID407
Feature activation+1.19e-02

Pos logit contributions
ier+0.001
led+0.001
rito+0.001
 kernels+0.001
rily+0.001
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 I-0.000
 find-0.000
Token her
Token ID607
Feature activation+5.04e-04

Pos logit contributions
ridge+0.015
rob+0.015
us+0.014
 kernels+0.014
med+0.014
Neg logit contributions
 have-0.010
 be-0.010
 do-0.010
 find-0.009
 want-0.009
Token favorite
Token ID4004
Feature activation-2.57e-03

Pos logit contributions
 have+0.001
 be+0.001
 do+0.001
 find+0.000
 see+0.000
Neg logit contributions
ridge-0.001
iest-0.001
rob-0.001
at-0.001
us-0.001
Token animal
Token ID5044
Feature activation+9.75e-03

Pos logit contributions
iest+0.006
rob+0.005
at+0.005
ridge+0.005
rito+0.005
Neg logit contributions
 be-0.004
 have-0.004
 do-0.003
 find-0.003
 see-0.003
Token,
Token ID11
Feature activation-3.45e-03

Pos logit contributions
 be+0.011
 have+0.011
 find+0.010
 do+0.010
 see+0.009
Neg logit contributions
iest-0.023
ier-0.023
ridge-0.023
at-0.022
led-0.022
Token a
Token ID257
Feature activation+1.23e-02

Pos logit contributions
rito+0.008
rol+0.008
ridge+0.008
rob+0.007
rily+0.007
Neg logit contributions
 be-0.005
 have-0.004
 do-0.004
 find-0.004
 see-0.003
Token k
Token ID479
Feature activation+3.69e-02

Pos logit contributions
 have+0.017
 be+0.016
 find+0.016
 do+0.016
 learn+0.014
Neg logit contributions
rito-0.024
us-0.024
iest-0.024
ridge-0.024
rob-0.023
Tokenitty
Token ID9760
Feature activation+1.75e-02

Pos logit contributions
 be+0.094
 do+0.090
 have+0.089
 find+0.089
 go+0.085
Neg logit contributions
rob-0.050
us-0.050
rol-0.046
rito-0.044
at-0.043
Token cat
Token ID3797
Feature activation+1.89e-02

Pos logit contributions
 be+0.019
 have+0.018
 do+0.017
 find+0.017
 learn+0.015
Neg logit contributions
rito-0.042
ridge-0.041
ier-0.041
rob-0.041
iest-0.041
Token.
Token ID13
Feature activation-1.20e-03

Pos logit contributions
 be+0.020
 have+0.019
 do+0.018
 find+0.017
 learn+0.015
Neg logit contributions
ridge-0.046
rob-0.046
rito-0.046
at-0.045
ted-0.045
Token But
Token ID887
Feature activation+1.16e-03

Pos logit contributions
rito+0.003
rol+0.003
us+0.003
ridge+0.003
rob+0.003
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 go-0.001
Token she
Token ID673
Feature activation+5.13e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 see+0.001
Neg logit contributions
rito-0.003
led-0.003
ier-0.003
ridge-0.003
rol-0.003
Token never
Token ID1239
Feature activation-2.52e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.006
 find+0.006
 see+0.005
Neg logit contributions
rob-0.013
rito-0.013
rol-0.013
rily-0.013
us-0.013
Token seen
Token ID1775
Feature activation-3.45e-04

Pos logit contributions
rito+0.005
rob+0.005
ier+0.005
rol+0.005
us+0.004
Neg logit contributions
 be-0.005
 have-0.005
 do-0.005
 find-0.004
 I-0.004
Token before
Token ID878
Feature activation+4.88e-04

Pos logit contributions
rob+0.001
rito+0.001
rol+0.001
led+0.001
us+0.001
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token -
Token ID532
Feature activation+3.51e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 see+0.000
Neg logit contributions
ier-0.001
rito-0.001
us-0.001
rol-0.001
led-0.001
Token a
Token ID257
Feature activation+1.76e-02

Pos logit contributions
 be+0.005
 have+0.005
 do+0.005
 find+0.004
 learn+0.004
Neg logit contributions
iest-0.007
 kernels-0.007
ier-0.007
cream-0.007
rol-0.007
Token k
Token ID479
Feature activation+3.69e-02

Pos logit contributions
 be+0.025
 have+0.025
 do+0.023
 find+0.023
 learn+0.021
Neg logit contributions
rito-0.037
iest-0.036
 kernels-0.035
ier-0.035
us-0.034
Tokenite
Token ID578
Feature activation+1.83e-02

Pos logit contributions
 be+0.086
 do+0.081
 have+0.081
 find+0.078
 learn+0.073
Neg logit contributions
rob-0.056
us-0.056
rol-0.055
rito-0.055
iest-0.055
Token!
Token ID0
Feature activation+6.69e-03

Pos logit contributions
 have+0.025
 be+0.025
 do+0.024
 find+0.023
 learn+0.021
Neg logit contributions
ier-0.040
iest-0.040
rol-0.040
at-0.039
rito-0.039
Token
Token ID220
Feature activation+8.18e-04

Pos logit contributions
 be+0.010
 do+0.010
 have+0.009
 find+0.009
 learn+0.008
Neg logit contributions
rol-0.014
us-0.014
cream-0.014
rito-0.014
rob-0.014
Token
Token ID198
Feature activation+1.36e-04

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 go+0.001
Neg logit contributions
us-0.002
rol-0.002
rito-0.002
ier-0.002
at-0.002
Token
Token ID198
Feature activation-6.82e-04

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 go+0.000
Neg logit contributions
us-0.000
rito-0.000
rol-0.000
rob-0.000
at-0.000
TokenThe
Token ID464
Feature activation+5.44e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.002
 look+0.002
Neg logit contributions
us-0.003
rol-0.002
rito-0.002
at-0.002
ridge-0.002
Token voice
Token ID3809
Feature activation-5.80e-03

Pos logit contributions
 have+0.008
 be+0.008
 do+0.007
 find+0.007
 see+0.007
Neg logit contributions
rito-0.011
iest-0.011
ier-0.011
ridge-0.011
at-0.011
Token said
Token ID531
Feature activation-6.95e-03

Pos logit contributions
led+0.013
ier+0.013
rito+0.013
rol+0.013
iest+0.012
Neg logit contributions
 be-0.009
 have-0.008
 do-0.008
 find-0.008
 I-0.007
Token,
Token ID11
Feature activation-8.08e-03

Pos logit contributions
ier+0.019
rito+0.019
led+0.018
us+0.018
rol+0.018
Neg logit contributions
 have-0.007
 be-0.007
 do-0.006
 find-0.006
 see-0.004
Token "
Token ID366
Feature activation-4.21e-03

Pos logit contributions
rito+0.019
us+0.019
ier+0.019
rol+0.019
ridge+0.019
Neg logit contributions
 be-0.010
 have-0.010
 do-0.010
 find-0.009
 see-0.008
Tokencong
Token ID36801
Feature activation+3.67e-02

Pos logit contributions
rito+0.008
ier+0.008
rob+0.008
led+0.008
us+0.008
Neg logit contributions
 be-0.007
 have-0.007
 do-0.007
 find-0.006
 I-0.006
Tokenratulations
Token ID30167
Feature activation-8.92e-03

Pos logit contributions
 have+0.078
 be+0.074
 do+0.071
 find+0.066
 see+0.063
Neg logit contributions
oured-0.059
ridge-0.058
cream-0.058
rol-0.057
at-0.057
Token,
Token ID11
Feature activation+4.32e-03

Pos logit contributions
ier+0.022
iest+0.022
us+0.021
rob+0.021
led+0.021
Neg logit contributions
 be-0.012
 have-0.011
 do-0.010
 find-0.010
 see-0.008
Token you
Token ID345
Feature activation+6.97e-03

Pos logit contributions
 be+0.005
 have+0.005
 find+0.004
 see+0.004
 do+0.004
Neg logit contributions
iest-0.009
rol-0.009
ier-0.009
us-0.009
rito-0.009
Token can
Token ID460
Feature activation+1.18e-02

Pos logit contributions
 be+0.007
 find+0.006
 have+0.006
 do+0.006
 draw+0.005
Neg logit contributions
iest-0.017
ier-0.017
rol-0.016
at-0.016
us-0.015
Token have
Token ID423
Feature activation+1.47e-02

Pos logit contributions
 sad+0.009
 face+0.008
 mom+0.007
 birthday+0.007
 network+0.007
Neg logit contributions
at-0.028
iest-0.027
ier-0.027
ridge-0.026
rol-0.026
Token
Token ID99
Feature activation+3.82e-02

Pos logit contributions
 be+0.036
 have+0.035
 do+0.034
 find+0.032
 I+0.029
Neg logit contributions
at-0.030
rito-0.030
rob-0.029
ridge-0.029
us-0.029
Token two
Token ID734
Feature activation-3.49e-03

Pos logit contributions
 be+0.072
 have+0.070
 do+0.068
 find+0.065
 go+0.060
Neg logit contributions
at-0.069
rito-0.068
ier-0.068
rob-0.068
us-0.067
Tokenâ
Token ID22940
Feature activation+8.99e-03

Pos logit contributions
ier+0.008
iest+0.008
rol+0.007
led+0.007
us+0.007
Neg logit contributions
 be-0.005
 have-0.004
 find-0.004
 do-0.004
 see-0.004
Token
Token ID26391
Feature activation-2.00e-03

Pos logit contributions
 be+0.021
 have+0.021
 do+0.021
 find+0.020
 I+0.018
Neg logit contributions
rito-0.012
rob-0.012
rol-0.012
ier-0.012
us-0.012
Token
Token ID126
Feature activation+1.45e-02

Pos logit contributions
rob+0.003
ridge+0.003
at+0.003
us+0.003
rol+0.003
Neg logit contributions
 be-0.004
 do-0.004
 have-0.004
 find-0.003
 go-0.003
Token
Token ID99
Feature activation+3.66e-02

Pos logit contributions
 be+0.029
 have+0.029
 do+0.028
 find+0.026
 I+0.024
Neg logit contributions
rito-0.025
at-0.025
rob-0.024
us-0.024
ier-0.023
Token three
Token ID1115
Feature activation-9.42e-03

Pos logit contributions
 be+0.074
 have+0.072
 do+0.070
 find+0.067
 go+0.062
Neg logit contributions
at-0.061
rol-0.061
rito-0.061
us-0.061
ier-0.061
Token!
Token ID0
Feature activation+6.65e-03

Pos logit contributions
us+0.017
led+0.017
iest+0.017
ier+0.017
at+0.017
Neg logit contributions
 be-0.017
 have-0.017
 find-0.016
 do-0.016
 see-0.014
Tokenâ
Token ID22940
Feature activation+1.15e-02

Pos logit contributions
 be+0.010
 do+0.010
 have+0.009
 find+0.009
 go+0.008
Neg logit contributions
rol-0.014
rito-0.014
us-0.014
rob-0.014
ridge-0.014
Token
Token ID26391
Feature activation+7.06e-05

Pos logit contributions
 be+0.027
 have+0.027
 do+0.026
 find+0.025
 I+0.023
Neg logit contributions
rito-0.016
rol-0.016
rob-0.015
ier-0.015
us-0.015
Token
Token ID198
Feature activation-1.72e-03

Pos logit contributions
 be+0.000
 do+0.000
 find+0.000
 have+0.000
 end+0.000
Neg logit contributions
rob-0.000
ridge-0.000
at-0.000
us-0.000
rol-0.000
Token day
Token ID1110
Feature activation+1.27e-02

Pos logit contributions
 be+0.029
 have+0.028
 find+0.028
 do+0.027
 I+0.025
Neg logit contributions
iest-0.034
at-0.033
rito-0.031
us-0.031
led-0.031
Token,
Token ID11
Feature activation+6.65e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.006
 find+0.005
 I+0.003
Neg logit contributions
rito-0.041
rob-0.041
rol-0.040
ier-0.040
at-0.040
Token Tom
Token ID4186
Feature activation-7.95e-05

Pos logit contributions
 be+0.008
 have+0.007
 do+0.007
 find+0.006
 learn+0.005
Neg logit contributions
led-0.017
us-0.017
rol-0.017
rito-0.016
iest-0.016
Token saw
Token ID2497
Feature activation+2.99e-03

Pos logit contributions
at+0.000
led+0.000
us+0.000
rob+0.000
rito+0.000
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 I-0.000
Token a
Token ID257
Feature activation+7.31e-03

Pos logit contributions
 be+0.004
 have+0.003
 do+0.003
 find+0.003
 I+0.003
Neg logit contributions
rob-0.007
us-0.007
led-0.007
rito-0.007
ier-0.007
Token k
Token ID479
Feature activation+3.60e-02

Pos logit contributions
 have+0.010
 be+0.009
 find+0.009
 do+0.008
 learn+0.008
Neg logit contributions
us-0.016
rob-0.015
rito-0.015
 kernels-0.015
iest-0.015
Tokenite
Token ID578
Feature activation+2.10e-02

Pos logit contributions
 be+0.086
 have+0.082
 do+0.081
 find+0.080
 learn+0.074
Neg logit contributions
rol-0.052
us-0.051
rob-0.051
iest-0.049
ier-0.047
Token in
Token ID287
Feature activation+1.14e-02

Pos logit contributions
 have+0.030
 be+0.029
 do+0.027
 find+0.027
 see+0.026
Neg logit contributions
rol-0.043
at-0.043
led-0.042
ier-0.042
rob-0.042
Token the
Token ID262
Feature activation+2.70e-03

Pos logit contributions
 be+0.013
 have+0.012
 do+0.012
 find+0.011
 I+0.009
Neg logit contributions
rito-0.029
rob-0.029
rol-0.029
us-0.029
ier-0.029
Token sky
Token ID6766
Feature activation+2.04e-02

Pos logit contributions
 do+0.004
 have+0.004
 be+0.004
 want+0.004
 find+0.003
Neg logit contributions
rob-0.005
side-0.005
at-0.005
covered-0.005
rol-0.005
Token.
Token ID13
Feature activation-3.52e-04

Pos logit contributions
 have+0.021
 be+0.020
 do+0.019
 find+0.018
 I+0.015
Neg logit contributions
at-0.053
rol-0.052
ier-0.052
led-0.052
rito-0.052
Token in
Token ID287
Feature activation+8.05e-03

Pos logit contributions
rob+0.028
rito+0.027
rol+0.027
at+0.026
ridge+0.026
Neg logit contributions
 be-0.010
 have-0.009
 do-0.009
 find-0.008
 see-0.007
Token the
Token ID262
Feature activation+6.22e-03

Pos logit contributions
 be+0.012
 have+0.011
 do+0.011
 find+0.010
 I+0.009
Neg logit contributions
rito-0.018
rol-0.018
rob-0.018
us-0.017
iest-0.017
Token sky
Token ID6766
Feature activation+1.84e-02

Pos logit contributions
 be+0.011
 have+0.011
 do+0.010
 find+0.010
 I+0.009
Neg logit contributions
rito-0.012
rol-0.012
rob-0.012
at-0.012
us-0.012
Token but
Token ID475
Feature activation+1.56e-03

Pos logit contributions
 be+0.018
 have+0.018
 do+0.016
 find+0.015
 I+0.012
Neg logit contributions
rito-0.050
ier-0.050
rob-0.049
rol-0.049
led-0.049
Token the
Token ID262
Feature activation+1.55e-02

Pos logit contributions
 be+0.003
 have+0.002
 do+0.002
 find+0.002
 go+0.002
Neg logit contributions
led-0.003
ier-0.003
iest-0.003
us-0.003
rito-0.003
Token k
Token ID479
Feature activation+3.58e-02

Pos logit contributions
 be+0.025
 have+0.025
 do+0.023
 find+0.023
 see+0.020
Neg logit contributions
rito-0.032
ier-0.031
iest-0.031
rob-0.030
rol-0.030
Tokenite
Token ID578
Feature activation+1.65e-02

Pos logit contributions
 be+0.081
 have+0.079
 do+0.078
 find+0.074
 go+0.069
Neg logit contributions
rol-0.055
rob-0.055
us-0.055
rito-0.054
at-0.053
Token was
Token ID373
Feature activation+2.79e-03

Pos logit contributions
 be+0.024
 have+0.024
 do+0.023
 find+0.022
 see+0.019
Neg logit contributions
led-0.035
ier-0.035
rito-0.035
rol-0.035
iest-0.035
Token gone
Token ID3750
Feature activation-5.10e-03

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.003
 I+0.003
Neg logit contributions
rob-0.007
rito-0.007
us-0.006
rol-0.006
at-0.006
Token.
Token ID13
Feature activation+1.64e-03

Pos logit contributions
ier+0.013
rito+0.013
led+0.013
iest+0.012
us+0.012
Neg logit contributions
 be-0.006
 have-0.006
 do-0.006
 find-0.005
 see-0.004
Token His
Token ID2399
Feature activation+2.55e-03

Pos logit contributions
 be+0.003
 do+0.003
 have+0.003
 find+0.002
 go+0.002
Neg logit contributions
rol-0.003
us-0.003
rito-0.003
rob-0.003
cream-0.003
Token
Token ID198
Feature activation-3.25e-03

Pos logit contributions
us+0.006
rol+0.006
rito+0.006
rob+0.006
at+0.006
Neg logit contributions
 be-0.003
 do-0.003
 have-0.003
 find-0.003
 go-0.002
TokenThe
Token ID464
Feature activation+3.97e-03

Pos logit contributions
rol+0.006
us+0.006
rito+0.006
at+0.006
rob+0.006
Neg logit contributions
 be-0.006
 have-0.006
 do-0.006
 find-0.006
 look-0.005
Token girl
Token ID2576
Feature activation+1.03e-03

Pos logit contributions
 have+0.006
 be+0.005
 find+0.005
 do+0.005
 see+0.005
Neg logit contributions
iest-0.008
rito-0.008
 kernels-0.008
led-0.008
ier-0.008
Token saw
Token ID2497
Feature activation+1.42e-03

Pos logit contributions
 have+0.002
 be+0.002
 do+0.002
 find+0.001
 I+0.001
Neg logit contributions
led-0.002
at-0.002
ridge-0.002
rito-0.002
rol-0.002
Token a
Token ID257
Feature activation+1.09e-02

Pos logit contributions
 have+0.002
 be+0.002
 do+0.002
 find+0.002
 I+0.001
Neg logit contributions
led-0.003
ted-0.003
iest-0.003
rol-0.003
ier-0.003
Token k
Token ID479
Feature activation+3.53e-02

Pos logit contributions
 have+0.015
 find+0.014
 be+0.013
 do+0.013
 I+0.013
Neg logit contributions
rito-0.022
 kernels-0.022
iest-0.021
us-0.021
 casc-0.020
Tokenite
Token ID578
Feature activation+1.73e-02

Pos logit contributions
 be+0.081
 do+0.078
 have+0.078
 find+0.076
 go+0.071
Neg logit contributions
rol-0.054
us-0.052
rob-0.052
iest-0.051
ier-0.050
Token up
Token ID510
Feature activation-1.29e-02

Pos logit contributions
 have+0.026
 be+0.024
 find+0.024
 do+0.023
 see+0.022
Neg logit contributions
ier-0.034
iest-0.034
rol-0.033
led-0.033
rily-0.032
Token in
Token ID287
Feature activation+9.64e-03

Pos logit contributions
rob+0.034
ridge+0.033
rito+0.033
us+0.032
at+0.032
Neg logit contributions
 be-0.017
 have-0.015
 do-0.015
 find-0.014
 see-0.012
Token the
Token ID262
Feature activation+5.63e-04

Pos logit contributions
 be+0.011
 have+0.011
 do+0.011
 find+0.010
 I+0.008
Neg logit contributions
rito-0.024
rob-0.024
rol-0.024
iest-0.024
ier-0.023
Token sky
Token ID6766
Feature activation+1.81e-02

Pos logit contributions
 do+0.001
 have+0.001
 be+0.001
 find+0.001
 want+0.001
Neg logit contributions
rito-0.001
led-0.001
 kernels-0.001
rob-0.001
iest-0.001
Token day
Token ID1110
Feature activation+7.33e-03

Pos logit contributions
 be+0.023
 have+0.022
 find+0.021
 do+0.021
 see+0.019
Neg logit contributions
iest-0.027
at-0.027
led-0.026
rito-0.026
us-0.025
Token,
Token ID11
Feature activation+6.26e-03

Pos logit contributions
 be+0.006
 have+0.006
 do+0.005
 find+0.005
 learn+0.003
Neg logit contributions
ier-0.020
us-0.020
ted-0.020
led-0.020
iest-0.020
Token Lemon
Token ID27252
Feature activation-2.17e-03

Pos logit contributions
 be+0.008
 have+0.007
 do+0.007
 find+0.007
 want+0.005
Neg logit contributions
ted-0.015
rob-0.014
ier-0.014
rol-0.014
led-0.014
Token saw
Token ID2497
Feature activation+3.40e-03

Pos logit contributions
rito+0.005
rob+0.005
ier+0.005
at+0.005
led+0.005
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.003
 I-0.002
Token a
Token ID257
Feature activation+1.10e-02

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.004
 want+0.003
Neg logit contributions
ier-0.008
rob-0.008
led-0.008
ted-0.008
us-0.008
Token k
Token ID479
Feature activation+3.49e-02

Pos logit contributions
 have+0.015
 be+0.014
 find+0.013
 do+0.012
 learn+0.012
Neg logit contributions
us-0.022
rito-0.022
iest-0.022
rob-0.022
 kernels-0.022
Tokenite
Token ID578
Feature activation+1.78e-02

Pos logit contributions
 be+0.083
 have+0.079
 do+0.079
 find+0.078
 go+0.072
Neg logit contributions
rol-0.050
rob-0.048
us-0.048
iest-0.046
at-0.046
Token stuck
Token ID7819
Feature activation-9.19e-04

Pos logit contributions
 have+0.026
 be+0.026
 do+0.024
 find+0.024
 see+0.022
Neg logit contributions
ier-0.035
led-0.034
rol-0.034
at-0.034
iest-0.034
Token in
Token ID287
Feature activation+5.01e-03

Pos logit contributions
rito+0.002
rob+0.002
us+0.002
rol+0.002
ier+0.002
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token the
Token ID262
Feature activation+8.66e-05

Pos logit contributions
 be+0.006
 have+0.006
 do+0.005
 find+0.005
 I+0.004
Neg logit contributions
rito-0.012
rob-0.012
us-0.012
rol-0.012
 kernels-0.012
Token tree
Token ID5509
Feature activation+1.65e-02

Pos logit contributions
 have+0.000
 do+0.000
 be+0.000
 find+0.000
 want+0.000
Neg logit contributions
rito-0.000
ridge-0.000
at-0.000
side-0.000
rob-0.000
Token pencil
Token ID21613
Feature activation+5.48e-05

Pos logit contributions
iest+0.007
rob+0.007
 kernels+0.007
led+0.007
rol+0.007
Neg logit contributions
 have-0.007
 be-0.007
 do-0.007
 find-0.007
 see-0.006
Tokens
Token ID82
Feature activation+2.91e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 see+0.000
Neg logit contributions
ier-0.000
us-0.000
led-0.000
rob-0.000
iest-0.000
Token,
Token ID11
Feature activation-4.85e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 I+0.002
Neg logit contributions
rito-0.008
ier-0.008
rol-0.007
led-0.007
iest-0.007
Token and
Token ID290
Feature activation+8.14e-03

Pos logit contributions
rito+0.011
rob+0.011
rol+0.011
led+0.011
 kernels+0.011
Neg logit contributions
 be-0.006
 have-0.006
 do-0.006
 find-0.005
 go-0.004
Token a
Token ID257
Feature activation+1.46e-02

Pos logit contributions
 be+0.011
 have+0.011
 do+0.011
 find+0.010
 learn+0.009
Neg logit contributions
led-0.018
us-0.017
rol-0.017
 kernels-0.017
iest-0.017
Token k
Token ID479
Feature activation+3.49e-02

Pos logit contributions
 have+0.022
 be+0.021
 find+0.020
 do+0.020
 I+0.018
Neg logit contributions
rito-0.030
iest-0.029
us-0.029
 kernels-0.028
ridge-0.028
Tokenite
Token ID578
Feature activation+2.11e-02

Pos logit contributions
 be+0.077
 have+0.075
 do+0.074
 find+0.073
 learn+0.069
Neg logit contributions
rob-0.057
us-0.056
rol-0.055
iest-0.053
rito-0.053
Token!
Token ID0
Feature activation+5.82e-03

Pos logit contributions
 have+0.025
 be+0.024
 do+0.023
 find+0.022
 learn+0.020
Neg logit contributions
rol-0.050
ier-0.050
iest-0.049
led-0.048
at-0.048
Token
Token ID198
Feature activation+3.41e-03

Pos logit contributions
 be+0.008
 have+0.008
 do+0.008
 find+0.007
 go+0.006
Neg logit contributions
rol-0.013
rito-0.013
us-0.013
rob-0.013
ier-0.013
Token
Token ID198
Feature activation+1.99e-03

Pos logit contributions
 be+0.004
 have+0.003
 do+0.003
 find+0.003
 I+0.002
Neg logit contributions
us-0.009
rito-0.009
rol-0.009
rob-0.009
at-0.009
TokenThe
Token ID464
Feature activation+6.15e-03

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.003
 I+0.003
Neg logit contributions
rito-0.004
rob-0.004
rol-0.004
ier-0.004
us-0.004
Token She
Token ID1375
Feature activation-2.59e-03

Pos logit contributions
 have+0.005
 be+0.005
 do+0.005
 find+0.004
 see+0.004
Neg logit contributions
ted-0.008
rol-0.007
led-0.007
ridge-0.007
side-0.007
Token cried
Token ID16896
Feature activation+3.85e-03

Pos logit contributions
led+0.006
rito+0.006
rob+0.006
ier+0.006
us+0.006
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.003
 I-0.003
Token and
Token ID290
Feature activation-9.08e-03

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.003
 I+0.002
Neg logit contributions
rito-0.011
rob-0.011
rol-0.010
us-0.010
ier-0.010
Token threw
Token ID9617
Feature activation+2.56e-03

Pos logit contributions
led+0.020
rob+0.020
rito+0.020
ier+0.019
us+0.019
Neg logit contributions
 have-0.013
 be-0.013
 do-0.012
 find-0.012
 I-0.011
Token a
Token ID257
Feature activation-4.92e-03

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.003
 I+0.003
Neg logit contributions
rito-0.005
led-0.005
rol-0.005
ier-0.005
ted-0.005
Token tant
Token ID24246
Feature activation-6.86e-02

Pos logit contributions
rito+0.008
us+0.007
rob+0.007
iest+0.007
ier+0.007
Neg logit contributions
 be-0.011
 have-0.010
 do-0.010
 find-0.010
 I-0.009
Tokenrum
Token ID6582
Feature activation-5.43e-03

Pos logit contributions
rol+0.072
ridge+0.070
rob+0.069
 kernels+0.067
rito+0.067
Neg logit contributions
 be-0.178
 have-0.174
 do-0.171
 find-0.163
 want-0.153
Token.
Token ID13
Feature activation+4.65e-03

Pos logit contributions
ier+0.014
led+0.014
us+0.014
rol+0.014
rito+0.014
Neg logit contributions
 have-0.006
 be-0.006
 do-0.005
 find-0.005
 see-0.004
Token Her
Token ID2332
Feature activation-1.26e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.007
 find+0.006
 go+0.005
Neg logit contributions
rito-0.010
rol-0.010
led-0.010
us-0.010
ridge-0.010
Token mom
Token ID1995
Feature activation+3.53e-03

Pos logit contributions
led+0.003
rito+0.003
iest+0.003
ier+0.003
us+0.003
Neg logit contributions
 have-0.001
 be-0.001
 do-0.001
 find-0.001
 I-0.001
Token tried
Token ID3088
Feature activation-8.13e-03

Pos logit contributions
 be+0.005
 have+0.005
 do+0.005
 find+0.005
 I+0.004
Neg logit contributions
led-0.008
ier-0.008
us-0.008
rito-0.008
rob-0.008
Token his
Token ID465
Feature activation-1.09e-02

Pos logit contributions
 be+0.017
 have+0.016
 do+0.016
 find+0.015
 see+0.014
Neg logit contributions
rob-0.018
ridge-0.018
rito-0.017
rol-0.017
at-0.017
Token feet
Token ID3625
Feature activation-2.75e-04

Pos logit contributions
rito+0.023
 kernels+0.022
rily+0.022
led+0.022
us+0.021
Neg logit contributions
 have-0.016
 do-0.015
 be-0.015
 I-0.014
 find-0.014
Token and
Token ID290
Feature activation-3.81e-04

Pos logit contributions
rito+0.001
rob+0.001
rol+0.001
ier+0.001
us+0.001
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 I-0.000
Token threw
Token ID9617
Feature activation+8.27e-03

Pos logit contributions
rito+0.001
rob+0.001
led+0.001
rol+0.001
us+0.001
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token a
Token ID257
Feature activation-5.25e-03

Pos logit contributions
 be+0.014
 have+0.014
 do+0.013
 find+0.012
 I+0.011
Neg logit contributions
rito-0.017
rob-0.017
rol-0.017
ier-0.017
us-0.016
Token tant
Token ID24246
Feature activation-6.81e-02

Pos logit contributions
rito+0.011
rob+0.011
rol+0.011
ier+0.011
us+0.011
Neg logit contributions
 be-0.008
 have-0.008
 do-0.008
 find-0.007
 I-0.006
Tokenrum
Token ID6582
Feature activation+2.07e-03

Pos logit contributions
rol+0.082
ridge+0.078
 casc+0.077
rito+0.075
side+0.075
Neg logit contributions
 be-0.166
 do-0.156
 have-0.156
 find-0.148
 learn-0.139
Token.
Token ID13
Feature activation-3.11e-04

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 see+0.001
Neg logit contributions
ier-0.006
led-0.006
us-0.006
rito-0.006
rol-0.006
Token
Token ID198
Feature activation+3.17e-03

Pos logit contributions
rito+0.001
rob+0.001
rol+0.001
ier+0.001
us+0.001
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 I-0.000
Token
Token ID198
Feature activation+1.24e-03

Pos logit contributions
 be+0.004
 do+0.003
 have+0.003
 find+0.003
 go+0.003
Neg logit contributions
us-0.008
rito-0.008
rob-0.008
rol-0.008
at-0.008
Token"
Token ID1
Feature activation+4.31e-06

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 look+0.001
Neg logit contributions
us-0.003
rito-0.003
rob-0.003
at-0.003
rol-0.003
Token.
Token ID13
Feature activation-1.65e-03

Pos logit contributions
 be+0.008
 have+0.007
 do+0.007
 find+0.006
 I+0.005
Neg logit contributions
rito-0.021
ier-0.021
us-0.021
rob-0.021
rol-0.021
Token He
Token ID679
Feature activation-1.78e-03

Pos logit contributions
rito+0.003
ier+0.003
us+0.003
rob+0.003
rol+0.003
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.003
 go-0.003
Token kept
Token ID4030
Feature activation+3.64e-04

Pos logit contributions
rol+0.005
rito+0.005
rob+0.005
ier+0.005
iest+0.005
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 see-0.001
Token throwing
Token ID9644
Feature activation+3.99e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 I+0.000
Neg logit contributions
rito-0.001
rob-0.001
rol-0.001
ier-0.001
us-0.001
Token a
Token ID257
Feature activation-4.49e-03

Pos logit contributions
 be+0.006
 have+0.006
 do+0.006
 find+0.005
 I+0.005
Neg logit contributions
led-0.009
rito-0.009
ier-0.008
rob-0.008
 kernels-0.008
Token tant
Token ID24246
Feature activation-6.77e-02

Pos logit contributions
rito+0.010
us+0.010
rob+0.010
ier+0.010
iest+0.010
Neg logit contributions
 be-0.007
 have-0.007
 do-0.006
 find-0.006
 I-0.005
Tokenrum
Token ID6582
Feature activation+6.41e-05

Pos logit contributions
rol+0.069
ridge+0.065
rob+0.063
rito+0.063
 kernels+0.060
Neg logit contributions
 be-0.182
 have-0.178
 do-0.175
 find-0.167
 learn-0.157
Token.
Token ID13
Feature activation+1.99e-03

Pos logit contributions
 have+0.000
 be+0.000
 do+0.000
 find+0.000
 see+0.000
Neg logit contributions
ier-0.000
led-0.000
us-0.000
iest-0.000
rily-0.000
Token His
Token ID2399
Feature activation-3.80e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 go+0.003
Neg logit contributions
rito-0.004
rob-0.004
ier-0.004
rol-0.004
us-0.004
Token mom
Token ID1995
Feature activation+8.75e-03

Pos logit contributions
rito+0.010
 kernels+0.010
led+0.010
us+0.010
ier+0.010
Neg logit contributions
 have-0.003
 be-0.003
 do-0.003
 find-0.003
 I-0.002
Token was
Token ID373
Feature activation+8.14e-03

Pos logit contributions
 be+0.012
 have+0.012
 do+0.012
 find+0.010
 see+0.009
Neg logit contributions
rito-0.021
rob-0.020
rol-0.020
ier-0.020
us-0.020
Token listen
Token ID6004
Feature activation+9.56e-03

Pos logit contributions
 I+0.011
 be+0.011
 we+0.010
 frame+0.010
 house+0.010
Neg logit contributions
ier-0.022
rito-0.021
wy-0.020
at-0.020
iest-0.020
Token.
Token ID13
Feature activation-1.47e-03

Pos logit contributions
 be+0.014
 have+0.014
 do+0.013
 find+0.012
 I+0.011
Neg logit contributions
ier-0.021
rito-0.021
led-0.021
us-0.021
rol-0.021
Token She
Token ID1375
Feature activation-2.91e-03

Pos logit contributions
ted+0.003
ier+0.003
 asleep+0.003
rily+0.003
rito+0.003
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 go-0.002
Token threw
Token ID9617
Feature activation+3.39e-03

Pos logit contributions
rito+0.007
rob+0.007
rol+0.007
ier+0.007
us+0.007
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.003
 I-0.003
Token a
Token ID257
Feature activation-7.39e-03

Pos logit contributions
 be+0.005
 have+0.005
 do+0.004
 find+0.004
 I+0.004
Neg logit contributions
rito-0.008
led-0.008
rol-0.007
ier-0.007
rob-0.007
Token tant
Token ID24246
Feature activation-6.67e-02

Pos logit contributions
rito+0.011
rob+0.011
rol+0.011
ier+0.011
us+0.011
Neg logit contributions
 be-0.016
 have-0.016
 do-0.015
 find-0.015
 I-0.013
Tokenrum
Token ID6582
Feature activation-2.84e-03

Pos logit contributions
rol+0.070
ridge+0.068
rob+0.066
 kernels+0.064
rito+0.064
Neg logit contributions
 be-0.175
 have-0.170
 do-0.167
 find-0.159
 want-0.150
Token in
Token ID287
Feature activation+4.09e-03

Pos logit contributions
ier+0.007
led+0.007
ted+0.006
us+0.006
rily+0.006
Neg logit contributions
 have-0.004
 be-0.004
 do-0.003
 find-0.003
 see-0.003
Token the
Token ID262
Feature activation-4.87e-03

Pos logit contributions
 be+0.005
 do+0.005
 have+0.005
 find+0.005
 I+0.004
Neg logit contributions
iest-0.008
rito-0.008
ted-0.008
med-0.008
ier-0.008
Token store
Token ID3650
Feature activation-3.55e-03

Pos logit contributions
rob+0.009
us+0.009
led+0.009
rito+0.009
iest+0.008
Neg logit contributions
 be-0.008
 have-0.008
 do-0.008
 find-0.008
 I-0.007
Token.
Token ID13
Feature activation+9.56e-04

Pos logit contributions
ier+0.009
led+0.008
side+0.008
rol+0.008
ted+0.008
Neg logit contributions
 have-0.004
 be-0.004
 do-0.004
 find-0.003
 see-0.003
Token sad
Token ID6507
Feature activation-5.26e-03

Pos logit contributions
led+0.004
ted+0.003
ier+0.003
aned+0.003
rol+0.003
Neg logit contributions
 have-0.003
 be-0.003
 do-0.002
 find-0.002
 draw-0.002
Token.
Token ID13
Feature activation+4.66e-03

Pos logit contributions
ier+0.014
led+0.014
rito+0.013
us+0.013
iest+0.013
Neg logit contributions
 have-0.006
 be-0.006
 do-0.005
 find-0.005
 see-0.004
Token She
Token ID1375
Feature activation-3.94e-03

Pos logit contributions
 be+0.006
 do+0.006
 have+0.006
 find+0.006
 go+0.005
Neg logit contributions
ted-0.010
rol-0.010
side-0.009
led-0.009
aned-0.009
Token threw
Token ID9617
Feature activation+3.37e-03

Pos logit contributions
led+0.008
aned+0.008
ted+0.008
rito+0.008
us+0.008
Neg logit contributions
 have-0.006
 be-0.006
 do-0.006
 find-0.005
 I-0.005
Token a
Token ID257
Feature activation-3.10e-03

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.004
 I+0.003
Neg logit contributions
rito-0.008
led-0.008
rob-0.008
rol-0.008
ted-0.008
Token tant
Token ID24246
Feature activation-6.62e-02

Pos logit contributions
rito+0.005
rob+0.005
rol+0.005
ier+0.005
us+0.005
Neg logit contributions
 be-0.007
 have-0.006
 do-0.006
 find-0.006
 I-0.005
Tokenrum
Token ID6582
Feature activation-2.63e-03

Pos logit contributions
rol+0.073
ridge+0.071
rob+0.070
 kernels+0.069
 casc+0.068
Neg logit contributions
 be-0.165
 have-0.161
 do-0.157
 find-0.150
 want-0.142
Token and
Token ID290
Feature activation+2.78e-03

Pos logit contributions
led+0.006
ier+0.006
ted+0.006
us+0.006
rol+0.006
Neg logit contributions
 have-0.004
 be-0.003
 do-0.003
 find-0.003
 see-0.003
Token cried
Token ID16896
Feature activation+8.85e-03

Pos logit contributions
 have+0.004
 be+0.004
 do+0.004
 find+0.004
 I+0.004
Neg logit contributions
led-0.005
ted-0.005
ier-0.005
rob-0.005
us-0.005
Token loudly
Token ID23112
Feature activation-1.18e-02

Pos logit contributions
 be+0.010
 have+0.010
 do+0.009
 find+0.009
 I+0.007
Neg logit contributions
ier-0.023
rito-0.022
led-0.022
us-0.022
rol-0.022
Token.
Token ID13
Feature activation+3.68e-03

Pos logit contributions
rito+0.032
rob+0.032
ier+0.032
rol+0.032
us+0.032
Neg logit contributions
 be-0.012
 have-0.011
 do-0.011
 find-0.009
 I-0.008
Token and
Token ID290
Feature activation-1.46e-02

Pos logit contributions
rito+0.009
rob+0.009
rol+0.009
at+0.009
ier+0.009
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.002
 I-0.002
Token so
Token ID523
Feature activation+1.87e-03

Pos logit contributions
led+0.032
rito+0.030
ted+0.029
at+0.029
ier+0.029
Neg logit contributions
 be-0.021
 have-0.020
 find-0.020
 do-0.019
 see-0.017
Token she
Token ID673
Feature activation-2.58e-03

Pos logit contributions
 find+0.003
 be+0.003
 have+0.003
 do+0.002
 club+0.002
Neg logit contributions
led-0.004
iest-0.004
ted-0.003
ier-0.003
med-0.003
Token threw
Token ID9617
Feature activation-2.99e-03

Pos logit contributions
rol+0.006
ier+0.006
rob+0.006
rito+0.006
iest+0.006
Neg logit contributions
 be-0.004
 have-0.003
 do-0.003
 find-0.003
 go-0.003
Token a
Token ID257
Feature activation-8.50e-03

Pos logit contributions
rito+0.007
led+0.007
rob+0.007
rol+0.007
ier+0.006
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.004
 I-0.003
Token tant
Token ID24246
Feature activation-6.60e-02

Pos logit contributions
rito+0.020
rob+0.020
rol+0.020
ier+0.020
at+0.020
Neg logit contributions
 be-0.011
 have-0.011
 do-0.011
 find-0.010
 go-0.008
Tokenrum
Token ID6582
Feature activation-6.88e-03

Pos logit contributions
rol+0.073
rob+0.071
ridge+0.071
side+0.067
rito+0.067
Neg logit contributions
 be-0.166
 have-0.161
 do-0.159
 find-0.153
 learn-0.143
Token when
Token ID618
Feature activation+2.59e-03

Pos logit contributions
ier+0.018
led+0.018
rol+0.018
rito+0.018
us+0.018
Neg logit contributions
 be-0.008
 have-0.007
 do-0.007
 find-0.006
 see-0.005
Token her
Token ID607
Feature activation-4.06e-03

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.003
 want+0.003
Neg logit contributions
ted-0.005
led-0.005
iest-0.005
ier-0.005
 kernels-0.005
Token parents
Token ID3397
Feature activation-1.26e-02

Pos logit contributions
rito+0.012
ier+0.012
rob+0.012
us+0.012
led+0.012
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.002
 I-0.002
Token said
Token ID531
Feature activation+5.15e-04

Pos logit contributions
rito+0.029
rob+0.029
rol+0.029
ier+0.029
us+0.029
Neg logit contributions
 be-0.018
 have-0.017
 do-0.017
 find-0.015
 I-0.013
Token her
Token ID607
Feature activation-7.51e-03

Pos logit contributions
rito+0.001
rol+0.001
led+0.001
rob+0.001
us+0.001
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token feet
Token ID3625
Feature activation-2.12e-03

Pos logit contributions
led+0.016
rito+0.016
 kernels+0.016
rily+0.016
ier+0.016
Neg logit contributions
 have-0.011
 be-0.010
 do-0.010
 find-0.009
 I-0.008
Token and
Token ID290
Feature activation+2.06e-03

Pos logit contributions
ier+0.005
rol+0.005
rito+0.005
led+0.005
us+0.005
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 I-0.002
Token threw
Token ID9617
Feature activation+3.85e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 I+0.003
Neg logit contributions
led-0.004
rob-0.004
rito-0.004
rol-0.004
ier-0.004
Token a
Token ID257
Feature activation-4.72e-03

Pos logit contributions
 be+0.006
 have+0.006
 do+0.005
 find+0.005
 I+0.005
Neg logit contributions
rito-0.008
rol-0.008
led-0.008
rob-0.008
ier-0.008
Token tant
Token ID24246
Feature activation-6.53e-02

Pos logit contributions
rito+0.005
us+0.005
ier+0.005
iest+0.005
rob+0.005
Neg logit contributions
 be-0.013
 have-0.012
 do-0.012
 find-0.012
 I-0.011
Tokenrum
Token ID6582
Feature activation-4.26e-03

Pos logit contributions
rol+0.070
ridge+0.070
rob+0.068
 kernels+0.066
 casc+0.064
Neg logit contributions
 be-0.167
 have-0.161
 do-0.158
 find-0.152
 want-0.142
Token.
Token ID13
Feature activation+4.69e-03

Pos logit contributions
ier+0.011
led+0.011
us+0.011
rol+0.011
rily+0.010
Neg logit contributions
 have-0.005
 be-0.005
 do-0.004
 find-0.004
 see-0.004
Token She
Token ID1375
Feature activation-2.98e-03

Pos logit contributions
 be+0.006
 have+0.006
 do+0.006
 find+0.005
 go+0.004
Neg logit contributions
rol-0.011
rito-0.011
ier-0.011
us-0.011
led-0.011
Token grabbed
Token ID13176
Feature activation+1.55e-03

Pos logit contributions
led+0.006
rob+0.006
rito+0.006
us+0.006
rol+0.006
Neg logit contributions
 be-0.005
 have-0.005
 do-0.005
 find-0.004
 I-0.004
Token the
Token ID262
Feature activation-6.27e-03

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.001
 I+0.001
Neg logit contributions
rito-0.004
rob-0.004
rol-0.004
ier-0.004
us-0.004
Token to
Token ID284
Feature activation+7.25e-03

Pos logit contributions
rito+0.014
rob+0.014
rol+0.014
us+0.014
ier+0.014
Neg logit contributions
 be-0.011
 have-0.010
 do-0.010
 find-0.009
 I-0.008
Token cry
Token ID3960
Feature activation+6.77e-03

Pos logit contributions
 I+0.008
 we+0.006
 be+0.006
 have+0.006
 has+0.006
Neg logit contributions
rito-0.015
ridge-0.014
zzy-0.014
at-0.014
ited-0.014
Token and
Token ID290
Feature activation+7.56e-03

Pos logit contributions
 have+0.009
 be+0.009
 do+0.008
 find+0.008
 I+0.007
Neg logit contributions
ier-0.016
led-0.015
rito-0.014
us-0.014
rily-0.014
Token threw
Token ID9617
Feature activation+1.56e-03

Pos logit contributions
 have+0.012
 do+0.012
 be+0.012
 I+0.010
 find+0.010
Neg logit contributions
led-0.013
ted-0.013
ridge-0.012
 cooled-0.011
aned-0.011
Token a
Token ID257
Feature activation-4.24e-03

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 I+0.002
Neg logit contributions
rito-0.003
led-0.003
ted-0.003
rob-0.003
ier-0.003
Token tant
Token ID24246
Feature activation-6.43e-02

Pos logit contributions
rob+0.008
rito+0.008
rol+0.008
ier+0.008
at+0.008
Neg logit contributions
 be-0.008
 have-0.007
 do-0.007
 find-0.007
 go-0.006
Tokenrum
Token ID6582
Feature activation-4.72e-03

Pos logit contributions
rol+0.071
rob+0.068
ridge+0.067
 kernels+0.065
 casc+0.063
Neg logit contributions
 be-0.164
 have-0.160
 do-0.157
 find-0.151
 learn-0.141
Token.
Token ID13
Feature activation+4.89e-03

Pos logit contributions
ier+0.012
led+0.012
ted+0.011
rily+0.011
us+0.011
Neg logit contributions
 have-0.006
 be-0.006
 do-0.005
 find-0.005
 see-0.004
Token She
Token ID1375
Feature activation-4.60e-03

Pos logit contributions
 have+0.008
 be+0.007
 do+0.007
 find+0.007
 see+0.006
Neg logit contributions
ted-0.009
aned-0.009
side-0.009
ridge-0.009
led-0.009
Token didn
Token ID1422
Feature activation-1.64e-02

Pos logit contributions
led+0.010
rob+0.009
us+0.009
rito+0.009
ier+0.009
Neg logit contributions
 be-0.007
 have-0.007
 do-0.007
 find-0.006
 I-0.006
Token't
Token ID470
Feature activation+1.07e-03

Pos logit contributions
rito+0.035
rob+0.034
rol+0.034
ier+0.034
led+0.034
Neg logit contributions
 be-0.027
 have-0.026
 do-0.025
 find-0.023
 I-0.021
Token kept
Token ID4030
Feature activation+5.37e-03

Pos logit contributions
led+0.008
rito+0.008
us+0.008
rob+0.008
ier+0.008
Neg logit contributions
 be-0.006
 have-0.006
 do-0.006
 find-0.005
 I-0.005
Token crying
Token ID13774
Feature activation+1.44e-03

Pos logit contributions
 be+0.009
 have+0.009
 do+0.008
 find+0.008
 see+0.007
Neg logit contributions
rito-0.012
rol-0.011
rob-0.011
iest-0.011
ier-0.011
Token and
Token ID290
Feature activation+1.07e-02

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.001
 I+0.001
Neg logit contributions
ier-0.004
led-0.004
rito-0.004
us-0.004
iest-0.004
Token throwing
Token ID9644
Feature activation+4.87e-03

Pos logit contributions
 have+0.017
 be+0.017
 do+0.017
 find+0.015
 I+0.014
Neg logit contributions
led-0.021
ier-0.020
ted-0.020
rol-0.020
ridge-0.020
Token a
Token ID257
Feature activation-5.29e-04

Pos logit contributions
 be+0.007
 have+0.006
 do+0.006
 find+0.006
 I+0.005
Neg logit contributions
rito-0.012
rob-0.012
rol-0.011
ier-0.011
us-0.011
Token tant
Token ID24246
Feature activation-6.40e-02

Pos logit contributions
rob+0.001
rol+0.001
rito+0.001
ier+0.001
at+0.001
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 go-0.001
Tokenrum
Token ID6582
Feature activation+2.12e-05

Pos logit contributions
rol+0.071
ridge+0.069
rob+0.068
 kernels+0.067
 casc+0.065
Neg logit contributions
 be-0.161
 have-0.157
 do-0.154
 find-0.147
 learn-0.138
Token.
Token ID13
Feature activation+6.99e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 see+0.000
Neg logit contributions
ier-0.000
led-0.000
rol-0.000
iest-0.000
us-0.000
Token
Token ID198
Feature activation+3.96e-04

Pos logit contributions
 be+0.010
 do+0.010
 have+0.010
 find+0.009
 go+0.008
Neg logit contributions
rol-0.015
led-0.015
rito-0.015
ted-0.015
us-0.015
Token
Token ID198
Feature activation-1.59e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 go+0.000
Neg logit contributions
us-0.001
rito-0.001
rol-0.001
rob-0.001
at-0.001
TokenFinally
Token ID11158
Feature activation+8.95e-03

Pos logit contributions
side+0.002
led+0.002
aned+0.002
us+0.002
tto+0.002
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.003
 look-0.003
Token surprise
Token ID5975
Feature activation+1.66e-03

Pos logit contributions
iest+0.005
rito+0.005
ier+0.005
 kernels+0.005
rob+0.005
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.003
 see-0.003
Token.
Token ID13
Feature activation-1.03e-03

Pos logit contributions
 be+0.002
 have+0.002
 do+0.001
 find+0.001
 see+0.001
Neg logit contributions
ier-0.005
rito-0.005
rob-0.004
iest-0.004
us-0.004
Token He
Token ID679
Feature activation+1.08e-04

Pos logit contributions
rito+0.002
us+0.002
rob+0.002
ier+0.002
rol+0.002
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 go-0.001
Token threw
Token ID9617
Feature activation+7.17e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 I+0.000
Neg logit contributions
rito-0.000
rol-0.000
rob-0.000
ier-0.000
us-0.000
Token a
Token ID257
Feature activation-3.96e-03

Pos logit contributions
 be+0.010
 have+0.010
 do+0.010
 find+0.009
 I+0.008
Neg logit contributions
rito-0.016
led-0.016
ier-0.016
rob-0.016
rol-0.016
Token tant
Token ID24246
Feature activation-6.33e-02

Pos logit contributions
rito+0.008
us+0.008
rob+0.008
ier+0.008
iest+0.008
Neg logit contributions
 be-0.006
 have-0.006
 do-0.006
 find-0.006
 I-0.005
Tokenrum
Token ID6582
Feature activation-9.05e-05

Pos logit contributions
rol+0.068
ridge+0.064
rito+0.061
rob+0.061
 casc+0.059
Neg logit contributions
 be-0.167
 have-0.161
 do-0.159
 find-0.152
 learn-0.143
Token.
Token ID13
Feature activation+1.61e-03

Pos logit contributions
ier+0.000
led+0.000
us+0.000
rito+0.000
rol+0.000
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 see-0.000
Token He
Token ID679
Feature activation+7.17e-04

Pos logit contributions
 be+0.003
 have+0.002
 do+0.002
 find+0.002
 go+0.002
Neg logit contributions
rito-0.003
rob-0.003
rol-0.003
ier-0.003
us-0.003
Token kicked
Token ID12165
Feature activation+7.96e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 I+0.001
Neg logit contributions
rito-0.002
rol-0.002
rob-0.002
ier-0.002
at-0.002
Token and
Token ID290
Feature activation-3.24e-03

Pos logit contributions
 be+0.010
 have+0.010
 do+0.009
 find+0.009
 I+0.007
Neg logit contributions
rito-0.019
ier-0.019
led-0.019
us-0.019
rob-0.019
Token and
Token ID290
Feature activation+1.49e-03

Pos logit contributions
 have+0.001
 be+0.001
 do+0.001
 find+0.001
 see+0.000
Neg logit contributions
ier-0.001
led-0.001
us-0.001
ted-0.001
iest-0.001
Token started
Token ID2067
Feature activation-1.64e-03

Pos logit contributions
 be+0.003
 have+0.002
 do+0.002
 find+0.002
 I+0.002
Neg logit contributions
rito-0.003
rob-0.003
ier-0.003
rol-0.003
us-0.003
Token to
Token ID284
Feature activation+9.41e-03

Pos logit contributions
rob+0.004
iest+0.004
ier+0.004
rol+0.004
rito+0.004
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 see-0.002
Token throw
Token ID3714
Feature activation+8.06e-03

Pos logit contributions
 be+0.008
 have+0.008
 do+0.007
 I+0.007
 find+0.006
Neg logit contributions
rito-0.024
ridge-0.024
at-0.024
led-0.024
us-0.024
Token a
Token ID257
Feature activation-2.34e-03

Pos logit contributions
 be+0.011
 have+0.010
 do+0.010
 find+0.009
 I+0.008
Neg logit contributions
rito-0.020
rob-0.019
rol-0.019
ier-0.019
us-0.019
Token tant
Token ID24246
Feature activation-6.19e-02

Pos logit contributions
rob+0.005
rol+0.005
rito+0.005
ier+0.005
at+0.004
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.004
 go-0.003
Tokenrum
Token ID6582
Feature activation+1.01e-03

Pos logit contributions
rol+0.069
ridge+0.065
rob+0.065
 kernels+0.062
rito+0.062
Neg logit contributions
 be-0.158
 have-0.153
 do-0.151
 find-0.146
 learn-0.137
Token.
Token ID13
Feature activation+6.48e-03

Pos logit contributions
 have+0.001
 be+0.001
 do+0.001
 find+0.001
 see+0.001
Neg logit contributions
ier-0.003
led-0.003
rily-0.002
us-0.002
rol-0.002
Token Suddenly
Token ID24975
Feature activation+1.03e-02

Pos logit contributions
 have+0.009
 be+0.009
 do+0.009
 find+0.008
 see+0.007
Neg logit contributions
led-0.013
ted-0.013
rito-0.013
rol-0.013
ridge-0.013
Token,
Token ID11
Feature activation+7.87e-03

Pos logit contributions
 be+0.010
 have+0.010
 do+0.010
 find+0.009
 want+0.008
Neg logit contributions
ier-0.026
led-0.026
iest-0.025
ted-0.025
us-0.025
Token she
Token ID673
Feature activation+4.88e-03

Pos logit contributions
 be+0.010
 have+0.010
 do+0.009
 find+0.009
 I+0.007
Neg logit contributions
ier-0.018
led-0.018
rito-0.018
rob-0.018
iest-0.018
Token temper
Token ID4124
Feature activation-1.17e-02

Pos logit contributions
rito+0.045
iest+0.044
led+0.043
ier+0.043
at+0.043
Neg logit contributions
 have-0.031
 be-0.030
 do-0.028
 find-0.026
 I-0.026
Token and
Token ID290
Feature activation+5.17e-03

Pos logit contributions
rito+0.029
ier+0.029
led+0.029
iest+0.029
us+0.028
Neg logit contributions
 be-0.014
 have-0.014
 do-0.013
 find-0.012
 I-0.011
Token often
Token ID1690
Feature activation+4.48e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.007
 find+0.007
 see+0.006
Neg logit contributions
led-0.011
iest-0.011
us-0.011
rito-0.011
ier-0.010
Token threw
Token ID9617
Feature activation+5.00e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.007
 find+0.006
 I+0.006
Neg logit contributions
led-0.009
ier-0.009
rito-0.009
iest-0.009
at-0.009
Token temper
Token ID4124
Feature activation-1.84e-02

Pos logit contributions
 be+0.008
 have+0.008
 do+0.008
 find+0.007
 I+0.007
Neg logit contributions
rito-0.010
led-0.010
us-0.010
rol-0.010
rob-0.010
Token tant
Token ID24246
Feature activation-6.19e-02

Pos logit contributions
rob+0.036
rito+0.035
rol+0.035
at+0.035
ier+0.034
Neg logit contributions
 be-0.033
 have-0.032
 do-0.031
 find-0.029
 go-0.026
Tokenrums
Token ID45241
Feature activation-1.62e-03

Pos logit contributions
rol+0.088
rito+0.085
 casc+0.084
 kernels+0.082
side+0.081
Neg logit contributions
 be-0.133
 have-0.126
 do-0.124
 find-0.119
 learn-0.112
Token.
Token ID13
Feature activation+4.79e-03

Pos logit contributions
ier+0.004
rito+0.004
us+0.004
rol+0.004
led+0.004
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 see-0.002
Token One
Token ID1881
Feature activation+1.66e-02

Pos logit contributions
 be+0.006
 do+0.006
 have+0.006
 find+0.006
 go+0.005
Neg logit contributions
rol-0.011
rito-0.011
rob-0.011
us-0.011
at-0.011
Token day
Token ID1110
Feature activation+1.11e-02

Pos logit contributions
 be+0.026
 have+0.025
 do+0.024
 find+0.024
 see+0.021
Neg logit contributions
iest-0.035
rito-0.034
at-0.034
us-0.034
ier-0.034
Token,
Token ID11
Feature activation+7.41e-03

Pos logit contributions
 be+0.008
 have+0.007
 do+0.007
 find+0.005
 I+0.004
Neg logit contributions
rito-0.036
rob-0.036
rol-0.035
at-0.035
ridge-0.035
Token them
Token ID606
Feature activation+6.04e-03

Pos logit contributions
 have+0.008
 be+0.008
 do+0.007
 find+0.007
 I+0.006
Neg logit contributions
us-0.012
rito-0.012
ier-0.012
ridge-0.012
led-0.012
Token and
Token ID290
Feature activation+1.04e-02

Pos logit contributions
 be+0.008
 have+0.007
 do+0.007
 find+0.006
 I+0.006
Neg logit contributions
rito-0.015
ier-0.014
rol-0.014
led-0.014
us-0.014
Token never
Token ID1239
Feature activation-7.20e-04

Pos logit contributions
 be+0.011
 have+0.011
 do+0.011
 find+0.009
 I+0.008
Neg logit contributions
ridge-0.023
led-0.023
rito-0.023
cream-0.022
 kernels-0.022
Token threw
Token ID9617
Feature activation+6.45e-03

Pos logit contributions
led+0.002
ier+0.002
rito+0.002
 cooled+0.001
ted+0.001
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 mom-0.001
Token a
Token ID257
Feature activation-8.05e-03

Pos logit contributions
 be+0.011
 do+0.011
 have+0.011
 find+0.010
 I+0.009
Neg logit contributions
rito-0.012
led-0.012
ier-0.012
rol-0.012
us-0.012
Token tant
Token ID24246
Feature activation-6.11e-02

Pos logit contributions
rito+0.016
us+0.015
iest+0.015
ier+0.015
at+0.015
Neg logit contributions
 be-0.014
 have-0.014
 do-0.014
 find-0.013
 I-0.012
Tokenrum
Token ID6582
Feature activation+1.50e-03

Pos logit contributions
rol+0.066
rito+0.064
ridge+0.062
 kernels+0.061
rob+0.061
Neg logit contributions
 be-0.157
 have-0.151
 do-0.148
 find-0.143
 learn-0.134
Token again
Token ID757
Feature activation+1.22e-02

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 see+0.001
Neg logit contributions
ier-0.004
led-0.004
rol-0.004
iest-0.004
us-0.004
Token.
Token ID13
Feature activation+2.18e-03

Pos logit contributions
 be+0.013
 have+0.013
 do+0.013
 find+0.011
 go+0.010
Neg logit contributions
ier-0.030
rito-0.030
us-0.029
iest-0.029
led-0.029
Token<|endoftext|>
Token ID50256
Feature activation-1.02e-02

Pos logit contributions
 do+0.003
 be+0.003
 have+0.003
 find+0.003
 go+0.003
Neg logit contributions
rol-0.004
us-0.004
rito-0.004
rily-0.004
ridge-0.004
TokenOnce
Token ID7454
Feature activation+7.15e-03

Pos logit contributions
rito+0.029
ier+0.029
rol+0.029
rob+0.029
led+0.029
Neg logit contributions
 be-0.009
 have-0.009
 do-0.008
 find-0.007
 I-0.005
Token.
Token ID13
Feature activation+4.09e-05

Pos logit contributions
rito+0.013
us+0.013
rob+0.013
at+0.013
rol+0.013
Neg logit contributions
 be-0.007
 have-0.007
 do-0.006
 find-0.005
 I-0.005
Token
Token ID198
Feature activation-7.51e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 I+0.000
Neg logit contributions
rol-0.000
rito-0.000
us-0.000
rob-0.000
at-0.000
Token
Token ID198
Feature activation-9.16e-03

Pos logit contributions
us+0.020
rito+0.020
rob+0.020
rol+0.020
at+0.020
Neg logit contributions
 be-0.008
 have-0.007
 do-0.007
 find-0.006
 I-0.005
TokenWhen
Token ID2215
Feature activation-4.18e-03

Pos logit contributions
rol+0.016
rito+0.016
us+0.016
rob+0.016
at+0.016
Neg logit contributions
 be-0.018
 have-0.018
 do-0.017
 find-0.016
 I-0.015
Token D
Token ID360
Feature activation-7.31e-03

Pos logit contributions
rito+0.010
rob+0.010
ier+0.010
rol+0.010
at+0.010
Neg logit contributions
 be-0.006
 have-0.006
 do-0.005
 find-0.005
 I-0.004
Tokenott
Token ID1252
Feature activation-6.02e-02

Pos logit contributions
rito+0.013
us+0.012
ier+0.012
rol+0.012
at+0.012
Neg logit contributions
 be-0.014
 have-0.014
 do-0.014
 find-0.013
 I-0.012
Tokenie
Token ID494
Feature activation-1.94e-02

Pos logit contributions
rol+0.123
 kernels+0.120
ridge+0.118
rito+0.118
ier+0.116
Neg logit contributions
 be-0.101
 have-0.095
 do-0.094
 find-0.094
 go-0.077
Token arrived
Token ID5284
Feature activation-1.59e-02

Pos logit contributions
rito+0.044
ier+0.044
rob+0.044
rol+0.044
iest+0.043
Neg logit contributions
 be-0.030
 have-0.029
 do-0.028
 find-0.026
 go-0.023
Token at
Token ID379
Feature activation+1.38e-02

Pos logit contributions
rito+0.043
rob+0.042
ier+0.042
rol+0.042
us+0.042
Neg logit contributions
 be-0.016
 have-0.016
 do-0.015
 find-0.013
 I-0.011
Token the
Token ID262
Feature activation-5.33e-03

Pos logit contributions
 be+0.015
 have+0.015
 do+0.014
 find+0.012
 I+0.011
Neg logit contributions
rito-0.036
rob-0.036
rol-0.035
us-0.035
ier-0.035
Token lab
Token ID2248
Feature activation-1.56e-02

Pos logit contributions
rito+0.012
rob+0.012
us+0.012
ier+0.012
rol+0.012
Neg logit contributions
 be-0.007
 have-0.007
 do-0.007
 find-0.006
 I-0.006
Token stubborn
Token ID24444
Feature activation+2.41e-03

Pos logit contributions
 be+0.010
 have+0.010
 do+0.009
 find+0.008
 I+0.007
Neg logit contributions
rito-0.022
ier-0.021
rol-0.021
rob-0.021
led-0.021
Token.
Token ID13
Feature activation+1.68e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.002
 find+0.002
 see+0.002
Neg logit contributions
ier-0.006
led-0.006
iest-0.006
us-0.006
rol-0.006
Token She
Token ID1375
Feature activation+1.55e-03

Pos logit contributions
 be+0.002
 have+0.002
 find+0.002
 do+0.002
 go+0.002
Neg logit contributions
rito-0.003
us-0.003
ted-0.003
rol-0.003
ier-0.003
Token throws
Token ID12542
Feature activation+3.30e-03

Pos logit contributions
 be+0.002
 have+0.002
 find+0.002
 do+0.002
 I+0.002
Neg logit contributions
led-0.003
at-0.003
wy-0.003
rob-0.003
ridge-0.003
Token a
Token ID257
Feature activation-5.99e-03

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.003
 I+0.003
Neg logit contributions
rito-0.008
rob-0.008
rol-0.008
ier-0.008
us-0.008
Token tant
Token ID24246
Feature activation-5.93e-02

Pos logit contributions
rito+0.009
rob+0.009
ier+0.009
rol+0.009
us+0.009
Neg logit contributions
 be-0.013
 have-0.013
 do-0.013
 find-0.012
 I-0.011
Tokenrum
Token ID6582
Feature activation+3.00e-03

Pos logit contributions
rol+0.063
ridge+0.059
rito+0.058
rob+0.057
 kernels+0.055
Neg logit contributions
 be-0.156
 have-0.151
 do-0.150
 find-0.142
 learn-0.135
Token.
Token ID13
Feature activation+4.59e-03

Pos logit contributions
 have+0.004
 be+0.004
 do+0.003
 find+0.003
 see+0.003
Neg logit contributions
ier-0.008
led-0.008
us-0.007
ted-0.007
rol-0.007
Token She
Token ID1375
Feature activation+2.36e-03

Pos logit contributions
 be+0.006
 have+0.005
 do+0.005
 find+0.005
 go+0.004
Neg logit contributions
rito-0.011
rol-0.011
us-0.011
ier-0.010
ted-0.010
Token kicks
Token ID17364
Feature activation+5.21e-03

Pos logit contributions
 be+0.004
 have+0.003
 do+0.003
 find+0.003
 I+0.003
Neg logit contributions
led-0.005
rob-0.005
ted-0.005
ridge-0.005
at-0.005
Token and
Token ID290
Feature activation-4.61e-03

Pos logit contributions
 be+0.006
 have+0.006
 do+0.005
 find+0.005
 I+0.004
Neg logit contributions
rito-0.013
ier-0.013
rob-0.013
led-0.013
rol-0.013
TokenOne
Token ID3198
Feature activation+9.94e-03

Pos logit contributions
rol+0.018
rob+0.018
us+0.018
iest+0.018
rito+0.018
Neg logit contributions
 have-0.018
 be-0.018
 do-0.017
 find-0.017
 face-0.016
Token day
Token ID1110
Feature activation+2.36e-03

Pos logit contributions
 be+0.016
 have+0.016
 do+0.015
 find+0.015
 I+0.013
Neg logit contributions
iest-0.020
rito-0.020
at-0.020
us-0.019
led-0.019
Token,
Token ID11
Feature activation+4.34e-04

Pos logit contributions
 be+0.002
 have+0.001
 do+0.001
 find+0.001
 I+0.001
Neg logit contributions
rito-0.007
rob-0.007
ier-0.007
rol-0.007
us-0.007
Token when
Token ID618
Feature activation+3.05e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 I+0.000
Neg logit contributions
rito-0.001
rol-0.001
ier-0.001
iest-0.001
rob-0.001
Token Br
Token ID1709
Feature activation-1.44e-02

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.004
 learn+0.003
Neg logit contributions
iest-0.006
ted-0.006
rol-0.006
us-0.006
rob-0.006
Tokenidget
Token ID17484
Feature activation-5.24e-02

Pos logit contributions
rob+0.016
rito+0.015
 kernels+0.015
ridge+0.015
iest+0.015
Neg logit contributions
 be-0.038
 do-0.037
 have-0.036
 find-0.035
 go-0.033
Token was
Token ID373
Feature activation-6.79e-03

Pos logit contributions
rito+0.134
rol+0.134
rob+0.134
ier+0.133
us+0.131
Neg logit contributions
 be-0.063
 have-0.061
 do-0.058
 find-0.053
 go-0.044
Token walking
Token ID6155
Feature activation-1.35e-02

Pos logit contributions
rito+0.015
rol+0.015
ier+0.015
rob+0.015
us+0.015
Neg logit contributions
 be-0.010
 have-0.010
 do-0.009
 find-0.009
 I-0.008
Token in
Token ID287
Feature activation+8.31e-03

Pos logit contributions
rito+0.029
ier+0.029
rol+0.029
rob+0.029
iest+0.029
Neg logit contributions
 be-0.021
 have-0.020
 do-0.019
 find-0.018
 I-0.016
Token the
Token ID262
Feature activation-5.13e-03

Pos logit contributions
 be+0.011
 do+0.010
 have+0.010
 find+0.009
 I+0.009
Neg logit contributions
rob-0.017
rito-0.017
iest-0.017
rol-0.017
ted-0.017
Token me
Token ID502
Feature activation-8.32e-03

Pos logit contributions
rob+0.010
at+0.009
rol+0.009
side+0.009
rito+0.009
Neg logit contributions
 have-0.009
 be-0.009
 do-0.009
 find-0.008
 want-0.007
Token the
Token ID262
Feature activation+5.04e-03

Pos logit contributions
 be+0.008
 have+0.008
 do+0.008
 find+0.007
 go+0.007
Neg logit contributions
rito-0.011
rob-0.011
ier-0.011
rol-0.011
at-0.011
Token elderly
Token ID13830
Feature activation+3.64e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.007
 find+0.006
 I+0.006
Neg logit contributions
rito-0.011
ier-0.011
rob-0.011
iest-0.011
us-0.011
Token man
Token ID582
Feature activation+7.18e-03

Pos logit contributions
 be+0.006
 have+0.005
 do+0.005
 find+0.005
 I+0.004
Neg logit contributions
ier-0.008
rob-0.008
iest-0.008
rito-0.008
us-0.008
Token on
Token ID319
Feature activation+9.17e-05

Pos logit contributions
 be+0.012
 have+0.011
 do+0.011
 find+0.010
 I+0.009
Neg logit contributions
rito-0.015
rol-0.015
led-0.015
ier-0.015
at-0.015
Token a
Token ID257
Feature activation-6.00e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 go+0.000
Neg logit contributions
rito-0.000
rob-0.000
ier-0.000
rol-0.000
led-0.000
Token stret
Token ID5759
Feature activation-5.16e-02

Pos logit contributions
rito+0.005
us+0.005
 kernels+0.004
rob+0.004
rily+0.004
Neg logit contributions
 have-0.017
 be-0.016
 do-0.016
 find-0.015
 I-0.015
Tokencher
Token ID2044
Feature activation-7.62e-03

Pos logit contributions
rito+0.041
rob+0.039
rol+0.039
at+0.038
ier+0.037
Neg logit contributions
 be-0.152
 have-0.151
 do-0.147
 find-0.142
 I-0.134
Token.
Token ID13
Feature activation+2.66e-03

Pos logit contributions
rito+0.021
ier+0.021
rol+0.020
us+0.020
at+0.020
Neg logit contributions
 be-0.007
 have-0.007
 do-0.007
 find-0.006
 I-0.005
Token She
Token ID1375
Feature activation+2.30e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 go+0.002
Neg logit contributions
rito-0.006
rol-0.006
rob-0.006
us-0.006
ier-0.006
Token thanked
Token ID26280
Feature activation-6.43e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 I+0.003
Neg logit contributions
rito-0.005
rob-0.005
rol-0.005
ier-0.005
us-0.005
Token Tim
Token ID5045
Feature activation-2.60e-03

Pos logit contributions
rob+0.014
rito+0.014
rol+0.014
us+0.014
ier+0.014
Neg logit contributions
 be-0.010
 have-0.010
 do-0.009
 find-0.009
 I-0.008
Token They
Token ID1119
Feature activation+2.97e-03

Pos logit contributions
 be+0.014
 do+0.013
 have+0.013
 find+0.012
 go+0.011
Neg logit contributions
us-0.025
rob-0.024
led-0.024
rito-0.024
rol-0.024
Token put
Token ID1234
Feature activation+9.07e-03

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.004
 I+0.003
Neg logit contributions
rito-0.007
rob-0.007
rol-0.006
ier-0.006
us-0.006
Token Lily
Token ID20037
Feature activation+1.17e-02

Pos logit contributions
 be+0.014
 have+0.014
 do+0.013
 find+0.012
 I+0.011
Neg logit contributions
rito-0.020
rob-0.020
rol-0.020
ier-0.020
us-0.020
Token on
Token ID319
Feature activation-2.82e-03

Pos logit contributions
 be+0.016
 have+0.015
 do+0.015
 find+0.014
 I+0.012
Neg logit contributions
rito-0.028
rol-0.027
rob-0.027
ier-0.027
led-0.027
Token a
Token ID257
Feature activation-6.88e-03

Pos logit contributions
rito+0.008
rob+0.007
rol+0.007
ier+0.007
at+0.007
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.003
 see-0.002
Token stret
Token ID5759
Feature activation-5.03e-02

Pos logit contributions
rito+0.009
rob+0.008
us+0.008
rily+0.008
 kernels+0.008
Neg logit contributions
 be-0.017
 have-0.017
 do-0.016
 find-0.015
 I-0.015
Tokencher
Token ID2044
Feature activation-5.36e-03

Pos logit contributions
rito+0.040
rob+0.039
rol+0.039
at+0.038
ier+0.037
Neg logit contributions
 be-0.148
 have-0.146
 do-0.143
 find-0.138
 see-0.130
Token and
Token ID290
Feature activation-2.43e-03

Pos logit contributions
rito+0.014
ier+0.014
rol+0.014
us+0.014
led+0.014
Neg logit contributions
 be-0.006
 have-0.005
 do-0.005
 find-0.005
 I-0.004
Token carried
Token ID5281
Feature activation-6.01e-03

Pos logit contributions
rito+0.005
led+0.005
rob+0.005
ier+0.005
us+0.005
Neg logit contributions
 be-0.004
 have-0.003
 do-0.003
 find-0.003
 I-0.003
Token her
Token ID607
Feature activation+4.06e-03

Pos logit contributions
rol+0.013
rito+0.013
led+0.013
us+0.013
ier+0.012
Neg logit contributions
 be-0.009
 do-0.009
 have-0.009
 find-0.008
 I-0.007
Token to
Token ID284
Feature activation+7.80e-04

Pos logit contributions
 be+0.006
 have+0.006
 do+0.006
 find+0.005
 I+0.005
Neg logit contributions
rito-0.009
rob-0.009
rol-0.009
ier-0.009
us-0.009
Token
Token ID198
Feature activation-3.81e-04

Pos logit contributions
 be+0.007
 have+0.007
 do+0.007
 find+0.006
 go+0.005
Neg logit contributions
rol-0.015
rito-0.015
us-0.014
rob-0.014
ier-0.014
Token
Token ID198
Feature activation-1.52e-03

Pos logit contributions
us+0.001
rol+0.001
rito+0.001
rob+0.001
at+0.001
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 I-0.000
TokenThe
Token ID464
Feature activation+3.21e-03

Pos logit contributions
rol+0.003
us+0.003
rito+0.003
ier+0.003
iest+0.003
Neg logit contributions
 be-0.003
 have-0.003
 do-0.002
 find-0.002
 I-0.002
Token new
Token ID649
Feature activation-1.35e-02

Pos logit contributions
 have+0.004
 be+0.004
 do+0.004
 find+0.004
 see+0.004
Neg logit contributions
iest-0.007
ier-0.007
rito-0.007
led-0.006
 kernels-0.006
Token light
Token ID1657
Feature activation-2.11e-03

Pos logit contributions
iest+0.027
ier+0.026
rito+0.026
led+0.024
rol+0.024
Neg logit contributions
 have-0.021
 do-0.020
 be-0.020
 find-0.018
 I-0.017
Tokenbul
Token ID15065
Feature activation-4.92e-02

Pos logit contributions
rito+0.004
ier+0.004
led+0.003
rol+0.003
iest+0.003
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.004
 I-0.004
Tokenb
Token ID65
Feature activation-9.51e-03

Pos logit contributions
rito+0.051
rob+0.050
rol+0.049
ier+0.049
us+0.048
Neg logit contributions
 be-0.133
 have-0.131
 do-0.128
 find-0.123
 I-0.115
Token was
Token ID373
Feature activation-6.93e-03

Pos logit contributions
rito+0.013
rob+0.012
rol+0.012
ier+0.012
us+0.012
Neg logit contributions
 be-0.023
 have-0.023
 do-0.022
 find-0.021
 I-0.020
Token much
Token ID881
Feature activation+5.89e-03

Pos logit contributions
rob+0.018
us+0.017
rito+0.017
rol+0.017
at+0.017
Neg logit contributions
 be-0.009
 have-0.009
 do-0.008
 find-0.008
 go-0.007
Token bigger
Token ID5749
Feature activation-7.33e-03

Pos logit contributions
 be+0.011
 have+0.010
 do+0.010
 find+0.010
 I+0.009
Neg logit contributions
rito-0.011
ier-0.011
rob-0.011
rol-0.011
led-0.011
Token and
Token ID290
Feature activation-2.79e-03

Pos logit contributions
ier+0.017
rito+0.017
rob+0.017
iest+0.016
us+0.016
Neg logit contributions
 be-0.010
 have-0.010
 do-0.010
 find-0.009
 I-0.008
Token shouted
Token ID17293
Feature activation+9.74e-03

Pos logit contributions
led+0.007
rol+0.007
ier+0.007
rob+0.007
rito+0.007
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.004
 I-0.003
Token.
Token ID13
Feature activation-8.75e-03

Pos logit contributions
 have+0.010
 be+0.010
 do+0.010
 find+0.008
 I+0.007
Neg logit contributions
ier-0.025
led-0.025
rito-0.025
rol-0.025
us-0.025
Token He
Token ID679
Feature activation-6.23e-03

Pos logit contributions
rito+0.022
rol+0.022
rob+0.022
us+0.022
ier+0.022
Neg logit contributions
 be-0.010
 have-0.009
 do-0.009
 find-0.008
 go-0.007
Token threw
Token ID9617
Feature activation+4.13e-03

Pos logit contributions
rito+0.014
rob+0.014
rol+0.014
ier+0.014
us+0.014
Neg logit contributions
 be-0.009
 have-0.009
 do-0.008
 find-0.008
 I-0.007
Token a
Token ID257
Feature activation-9.53e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.007
 find+0.007
 I+0.006
Neg logit contributions
rol-0.008
led-0.008
rito-0.008
rob-0.008
ier-0.008
Token snowball
Token ID46275
Feature activation-4.65e-02

Pos logit contributions
rito+0.022
rob+0.022
us+0.022
ier+0.022
iest+0.022
Neg logit contributions
 be-0.013
 have-0.013
 do-0.012
 find-0.012
 I-0.010
Token at
Token ID379
Feature activation+1.53e-02

Pos logit contributions
rito+0.114
rob+0.112
us+0.111
rol+0.111
at+0.111
Neg logit contributions
 be-0.063
 have-0.060
 do-0.058
 find-0.052
 go-0.046
Token the
Token ID262
Feature activation-1.36e-02

Pos logit contributions
 be+0.020
 have+0.019
 do+0.018
 find+0.017
 see+0.014
Neg logit contributions
rito-0.039
rob-0.038
rol-0.038
ier-0.038
ridge-0.038
Token dog
Token ID3290
Feature activation-2.19e-03

Pos logit contributions
rob+0.027
rito+0.027
rol+0.026
 kernels+0.026
ier+0.026
Neg logit contributions
 have-0.021
 be-0.021
 do-0.019
 find-0.019
 see-0.017
Token.
Token ID13
Feature activation-5.19e-03

Pos logit contributions
rol+0.005
us+0.005
at+0.005
rito+0.005
ier+0.005
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 see-0.002
Token The
Token ID383
Feature activation+1.58e-03

Pos logit contributions
rito+0.013
rob+0.013
rol+0.013
us+0.013
ier+0.013
Neg logit contributions
 be-0.006
 have-0.006
 do-0.006
 find-0.005
 go-0.004
Token on
Token ID319
Feature activation+6.93e-03

Pos logit contributions
 be+0.016
 have+0.016
 do+0.015
 find+0.013
 go+0.011
Neg logit contributions
rob-0.035
rito-0.035
rol-0.035
at-0.034
ier-0.034
Token the
Token ID262
Feature activation-1.51e-03

Pos logit contributions
 be+0.008
 have+0.007
 do+0.007
 find+0.006
 go+0.005
Neg logit contributions
rito-0.018
rob-0.018
ier-0.018
rol-0.018
us-0.018
Token soup
Token ID17141
Feature activation-6.71e-05

Pos logit contributions
rob+0.003
rol+0.003
rito+0.003
ier+0.003
us+0.003
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.003
 go-0.003
Token to
Token ID284
Feature activation+1.51e-02

Pos logit contributions
rob+0.000
rol+0.000
rito+0.000
us+0.000
iest+0.000
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 go-0.000
Token make
Token ID787
Feature activation+1.42e-02

Pos logit contributions
 be+0.016
 have+0.015
 do+0.015
 find+0.013
 I+0.013
Neg logit contributions
rito-0.038
at-0.036
ridge-0.036
 kernels-0.036
ier-0.036
Token it
Token ID340
Feature activation+1.70e-02

Pos logit contributions
 be+0.019
 have+0.018
 do+0.017
 find+0.016
 I+0.014
Neg logit contributions
rito-0.034
us-0.033
led-0.033
ier-0.033
rol-0.033
Token cooler
Token ID19346
Feature activation-2.01e-03

Pos logit contributions
 be+0.021
 have+0.020
 do+0.020
 find+0.018
 I+0.017
Neg logit contributions
rito-0.040
ier-0.040
led-0.040
at-0.039
rol-0.039
Token.
Token ID13
Feature activation+2.40e-03

Pos logit contributions
ier+0.005
rito+0.005
led+0.005
us+0.005
rol+0.005
Neg logit contributions
 be-0.002
 have-0.002
 do-0.002
 find-0.002
 I-0.001
Token They
Token ID1119
Feature activation+7.58e-03

Pos logit contributions
 be+0.003
 have+0.002
 do+0.002
 find+0.002
 go+0.002
Neg logit contributions
rito-0.006
us-0.006
rob-0.006
rol-0.006
ier-0.006
Token tasted
Token ID29187
Feature activation+8.54e-04

Pos logit contributions
 be+0.012
 have+0.012
 do+0.012
 find+0.011
 I+0.010
Neg logit contributions
rito-0.016
rob-0.016
ier-0.016
rol-0.016
us-0.016
Token the
Token ID262
Feature activation+7.91e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 I+0.001
Neg logit contributions
rito-0.002
ier-0.002
rob-0.002
rol-0.002
us-0.002
Token Its
Token ID6363
Feature activation-5.91e-03

Pos logit contributions
 be+0.006
 do+0.006
 have+0.006
 find+0.006
 go+0.005
Neg logit contributions
rol-0.010
rito-0.010
us-0.010
ridge-0.010
ier-0.010
Token bus
Token ID1323
Feature activation+6.78e-03

Pos logit contributions
rito+0.013
led+0.013
ier+0.012
iest+0.012
at+0.012
Neg logit contributions
 have-0.009
 be-0.009
 do-0.008
 find-0.008
 I-0.007
Tokenier
Token ID959
Feature activation-7.93e-03

Pos logit contributions
 have+0.011
 be+0.011
 do+0.010
 find+0.010
 I+0.009
Neg logit contributions
ier-0.014
led-0.014
rol-0.014
iest-0.013
rito-0.013
Token days
Token ID1528
Feature activation+1.72e-02

Pos logit contributions
ier+0.016
rito+0.016
led+0.016
rol+0.016
iest+0.016
Neg logit contributions
 be-0.013
 have-0.012
 do-0.012
 find-0.011
 I-0.010
Token were
Token ID547
Feature activation-5.12e-03

Pos logit contributions
 be+0.026
 have+0.026
 do+0.025
 find+0.023
 I+0.020
Neg logit contributions
rito-0.038
rob-0.038
rol-0.037
us-0.037
ier-0.037
Token yet
Token ID1865
Feature activation+1.58e-02

Pos logit contributions
rito+0.012
rob+0.012
rol+0.012
ier+0.012
us+0.012
Neg logit contributions
 be-0.007
 have-0.007
 do-0.007
 find-0.006
 I-0.006
Token to
Token ID284
Feature activation+2.60e-03

Pos logit contributions
 have+0.023
 be+0.022
 do+0.022
 find+0.021
 I+0.019
Neg logit contributions
ier-0.033
iest-0.032
rol-0.032
rito-0.031
led-0.031
Token come
Token ID1282
Feature activation-7.02e-03

Pos logit contributions
 have+0.002
 be+0.002
 do+0.002
 I+0.002
 find+0.002
Neg logit contributions
rito-0.006
ier-0.006
iest-0.006
at-0.006
zzy-0.006
Token!
Token ID0
Feature activation+4.39e-03

Pos logit contributions
ier+0.012
rol+0.012
rito+0.012
iest+0.011
us+0.011
Neg logit contributions
 have-0.012
 do-0.012
 be-0.012
 find-0.011
 I-0.011
Token
Token ID198
Feature activation-7.26e-04

Pos logit contributions
 be+0.006
 do+0.006
 have+0.005
 find+0.005
 go+0.004
Neg logit contributions
rol-0.010
us-0.010
rito-0.010
ted-0.009
rily-0.009
Token
Token ID198
Feature activation-1.92e-03

Pos logit contributions
us+0.002
rito+0.002
rol+0.002
rob+0.002
at+0.002
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 go-0.001
Token sun
Token ID4252
Feature activation+1.05e-02

Pos logit contributions
 have+0.003
 be+0.003
 do+0.003
 find+0.003
 learn+0.002
Neg logit contributions
iest-0.004
rito-0.004
us-0.004
 kernels-0.004
led-0.004
Token was
Token ID373
Feature activation-6.97e-03

Pos logit contributions
 be+0.011
 have+0.010
 do+0.009
 find+0.008
 go+0.007
Neg logit contributions
rob-0.031
rol-0.030
ier-0.030
rito-0.030
us-0.030
Token shining
Token ID22751
Feature activation-9.03e-03

Pos logit contributions
rob+0.019
rito+0.018
ridge+0.018
us+0.018
ier+0.017
Neg logit contributions
 be-0.010
 have-0.009
 do-0.009
 go-0.007
 find-0.007
Token and
Token ID290
Feature activation+3.34e-03

Pos logit contributions
rob+0.029
rito+0.029
rol+0.028
ridge+0.028
us+0.028
Neg logit contributions
 be-0.007
 do-0.006
 have-0.005
 go-0.004
 find-0.004
Token the
Token ID262
Feature activation+7.66e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 I+0.002
Neg logit contributions
rito-0.009
rob-0.009
rol-0.009
ier-0.009
us-0.009
Token sky
Token ID6766
Feature activation+1.82e-02

Pos logit contributions
 be+0.012
 have+0.012
 do+0.012
 find+0.011
 I+0.010
Neg logit contributions
rito-0.016
rob-0.016
iest-0.015
us-0.015
led-0.015
Token was
Token ID373
Feature activation-1.52e-04

Pos logit contributions
 be+0.020
 have+0.019
 do+0.018
 find+0.015
 go+0.013
Neg logit contributions
rob-0.051
rito-0.050
rol-0.050
us-0.050
ier-0.049
Token blue
Token ID4171
Feature activation+1.01e-02

Pos logit contributions
rob+0.000
rito+0.000
ridge+0.000
us+0.000
at+0.000
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 go-0.000
Token.
Token ID13
Feature activation+9.15e-05

Pos logit contributions
 be+0.008
 have+0.007
 do+0.006
 find+0.005
 go+0.004
Neg logit contributions
rito-0.031
rob-0.031
us-0.031
rol-0.031
ridge-0.030
Token
Token ID198
Feature activation-3.16e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 go+0.000
Neg logit contributions
rob-0.000
rito-0.000
rol-0.000
us-0.000
led-0.000
Token
Token ID198
Feature activation-4.66e-03

Pos logit contributions
us+0.008
rito+0.008
rob+0.008
at+0.008
rol+0.008
Neg logit contributions
 be-0.004
 have-0.003
 do-0.003
 find-0.003
 I-0.003
Token They
Token ID1119
Feature activation+4.30e-03

Pos logit contributions
 be+0.003
 do+0.003
 have+0.003
 find+0.002
 go+0.002
Neg logit contributions
us-0.005
rito-0.005
rob-0.005
rol-0.005
led-0.005
Token invited
Token ID9392
Feature activation+4.27e-03

Pos logit contributions
 be+0.006
 have+0.006
 do+0.006
 find+0.005
 I+0.005
Neg logit contributions
led-0.009
ier-0.009
rob-0.009
rito-0.009
rol-0.009
Token them
Token ID606
Feature activation+8.88e-03

Pos logit contributions
 be+0.008
 have+0.007
 do+0.007
 find+0.007
 I+0.006
Neg logit contributions
led-0.008
ier-0.008
rol-0.008
ted-0.008
us-0.008
Token to
Token ID284
Feature activation+3.38e-03

Pos logit contributions
 be+0.012
 have+0.012
 do+0.011
 find+0.011
 go+0.009
Neg logit contributions
rob-0.022
rol-0.021
rito-0.021
iest-0.021
ier-0.021
Token play
Token ID711
Feature activation-1.06e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 I+0.003
 find+0.003
Neg logit contributions
rito-0.008
at-0.008
rol-0.008
ridge-0.007
us-0.007
Token again
Token ID757
Feature activation+1.49e-02

Pos logit contributions
led+0.003
rob+0.002
rito+0.002
ier+0.002
us+0.002
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token.
Token ID13
Feature activation-2.27e-05

Pos logit contributions
 be+0.018
 have+0.017
 do+0.017
 find+0.015
 I+0.014
Neg logit contributions
us-0.035
rito-0.035
ier-0.035
iest-0.034
led-0.034
Token
Token ID198
Feature activation+1.22e-03

Pos logit contributions
rito+0.000
rol+0.000
us+0.000
rob+0.000
ted+0.000
Neg logit contributions
 be-0.000
 do-0.000
 have-0.000
 find-0.000
 go-0.000
Token
Token ID198
Feature activation+4.02e-04

Pos logit contributions
 be+0.002
 do+0.002
 have+0.001
 find+0.001
 I+0.001
Neg logit contributions
us-0.003
rob-0.003
rito-0.003
rol-0.003
at-0.003
TokenAnna
Token ID31160
Feature activation+4.77e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 look+0.001
Neg logit contributions
us-0.001
rob-0.001
rito-0.001
rol-0.001
at-0.001
Token and
Token ID290
Feature activation-2.51e-03

Pos logit contributions
 be+0.005
 have+0.005
 do+0.005
 find+0.004
 I+0.004
Neg logit contributions
rito-0.012
led-0.012
rob-0.012
ier-0.012
us-0.012
Token at
Token ID379
Feature activation+1.25e-02

Pos logit contributions
rito+0.006
led+0.006
ier+0.006
rol+0.006
rob+0.006
Neg logit contributions
 be-0.004
 have-0.004
 do-0.003
 find-0.003
 I-0.003
Token the
Token ID262
Feature activation+4.41e-03

Pos logit contributions
 be+0.015
 have+0.014
 do+0.014
 find+0.013
 I+0.011
Neg logit contributions
rob-0.031
rito-0.030
led-0.030
iest-0.030
us-0.030
Token pe
Token ID613
Feature activation-1.23e-02

Pos logit contributions
 have+0.008
 be+0.008
 do+0.008
 find+0.008
 I+0.007
Neg logit contributions
iest-0.008
rito-0.007
ier-0.007
rob-0.007
us-0.007
Tokenb
Token ID65
Feature activation-1.49e-03

Pos logit contributions
rito+0.011
at+0.011
ier+0.011
us+0.011
rob+0.011
Neg logit contributions
 be-0.035
 have-0.035
 do-0.034
 find-0.033
 I-0.031
Tokenble
Token ID903
Feature activation+4.08e-03

Pos logit contributions
rob+0.002
rito+0.002
rol+0.002
led+0.002
ridge+0.002
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.003
 go-0.003
Token and
Token ID290
Feature activation+1.39e-02

Pos logit contributions
 have+0.005
 be+0.005
 do+0.005
 find+0.005
 see+0.004
Neg logit contributions
ier-0.010
led-0.009
 kernels-0.009
rily-0.009
iest-0.009
Token smiled
Token ID13541
Feature activation-5.47e-04

Pos logit contributions
 have+0.022
 be+0.021
 do+0.020
 find+0.020
 go+0.018
Neg logit contributions
led-0.029
ted-0.027
rob-0.027
us-0.027
ier-0.026
Token.
Token ID13
Feature activation+6.44e-03

Pos logit contributions
ier+0.002
rito+0.002
led+0.001
us+0.001
rol+0.001
Neg logit contributions
 be-0.001
 have-0.001
 do-0.000
 find-0.000
 I-0.000
Token He
Token ID679
Feature activation+2.37e-04

Pos logit contributions
 be+0.011
 have+0.011
 do+0.011
 find+0.010
 see+0.009
Neg logit contributions
rito-0.012
rol-0.012
rily-0.012
us-0.012
ier-0.012
Token said
Token ID531
Feature activation-2.63e-03

Pos logit contributions
 have+0.000
 be+0.000
 do+0.000
 find+0.000
 learn+0.000
Neg logit contributions
led-0.000
ted-0.000
us-0.000
cream-0.000
rob-0.000
Token,
Token ID11
Feature activation-9.01e-03

Pos logit contributions
ier+0.007
iest+0.007
us+0.007
led+0.007
rily+0.007
Neg logit contributions
 have-0.002
 be-0.002
 find-0.002
 do-0.002
 learn-0.002
Token
Token ID198
Feature activation+7.03e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 I+0.001
Neg logit contributions
rito-0.004
rob-0.004
ier-0.004
at-0.004
rol-0.004
Token
Token ID198
Feature activation+5.84e-03

Pos logit contributions
 be+0.008
 have+0.008
 do+0.008
 find+0.007
 go+0.006
Neg logit contributions
us-0.018
rito-0.018
rob-0.018
rol-0.017
at-0.017
Token"
Token ID1
Feature activation+2.37e-03

Pos logit contributions
 be+0.012
 have+0.012
 do+0.012
 find+0.011
 go+0.010
Neg logit contributions
rito-0.009
us-0.009
rob-0.009
rol-0.009
ier-0.009
TokenWow
Token ID22017
Feature activation-2.00e-03

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.003
 I+0.003
Neg logit contributions
rito-0.005
rol-0.005
at-0.005
led-0.005
ier-0.005
Token!"
Token ID2474
Feature activation-1.03e-02

Pos logit contributions
ier+0.005
iest+0.005
led+0.005
at+0.005
rily+0.005
Neg logit contributions
 be-0.002
 have-0.002
 find-0.002
 do-0.002
 learn-0.002
Token said
Token ID531
Feature activation-3.18e-03

Pos logit contributions
rito+0.024
led+0.024
rob+0.024
us+0.023
rol+0.023
Neg logit contributions
 be-0.014
 have-0.013
 do-0.013
 find-0.012
 go-0.010
Token Lucy
Token ID22162
Feature activation-5.41e-03

Pos logit contributions
rob+0.008
rito+0.008
at+0.008
ier+0.008
rol+0.008
Neg logit contributions
 be-0.005
 have-0.004
 do-0.004
 find-0.004
 I-0.004
Token.
Token ID13
Feature activation-1.48e-03

Pos logit contributions
rob+0.016
rito+0.016
rol+0.015
ier+0.015
us+0.015
Neg logit contributions
 be-0.005
 have-0.004
 do-0.004
 find-0.004
 I-0.003
Token
Token ID198
Feature activation+3.47e-03

Pos logit contributions
rito+0.004
rol+0.004
us+0.004
rob+0.004
ier+0.004
Neg logit contributions
 be-0.002
 have-0.001
 do-0.001
 find-0.001
 go-0.001
Token
Token ID198
Feature activation+3.10e-03

Pos logit contributions
 be+0.004
 have+0.004
 do+0.004
 find+0.003
 go+0.003
Neg logit contributions
us-0.009
rito-0.009
rob-0.009
rol-0.009
at-0.008
TokenM
Token ID44
Feature activation+9.54e-03

Pos logit contributions
 have+0.005
 be+0.005
 do+0.005
 find+0.004
 go+0.004
Neg logit contributions
us-0.007
rito-0.007
rob-0.007
rol-0.007
ier-0.006
Token little
Token ID1310
Feature activation+3.10e-03

Pos logit contributions
 kernels+0.003
us+0.003
rito+0.003
iest+0.003
rob+0.003
Neg logit contributions
 have-0.002
 be-0.002
 find-0.002
 learn-0.002
 do-0.002
Token girl
Token ID2576
Feature activation+6.98e-03

Pos logit contributions
 have+0.005
 be+0.005
 find+0.005
 do+0.005
 learn+0.005
Neg logit contributions
 kernels-0.006
iest-0.006
rito-0.006
ier-0.005
us-0.005
Token named
Token ID3706
Feature activation-1.17e-02

Pos logit contributions
 do+0.009
 be+0.009
 have+0.009
 find+0.008
 learn+0.008
Neg logit contributions
us-0.015
led-0.014
med-0.014
rito-0.014
at-0.014
Token Ruby
Token ID10888
Feature activation-1.24e-03

Pos logit contributions
rol+0.025
us+0.024
rito+0.023
ier+0.023
led+0.023
Neg logit contributions
 be-0.017
 do-0.017
 have-0.015
 wish-0.014
 find-0.014
Token.
Token ID13
Feature activation-4.45e-03

Pos logit contributions
us+0.003
rito+0.003
led+0.003
ier+0.003
at+0.003
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token She
Token ID1375
Feature activation-7.58e-03

Pos logit contributions
rily+0.008
rol+0.008
ted+0.008
 asleep+0.008
ridge+0.008
Neg logit contributions
 be-0.007
 do-0.007
 have-0.007
 find-0.007
 go-0.006
Token had
Token ID550
Feature activation-1.99e-03

Pos logit contributions
led+0.015
at+0.014
zzy+0.014
tto+0.014
us+0.014
Neg logit contributions
 have-0.011
 be-0.010
 do-0.010
 find-0.010
 I-0.010
Token a
Token ID257
Feature activation+1.87e-03

Pos logit contributions
rol+0.005
rito+0.005
ridge+0.005
us+0.005
rob+0.005
Neg logit contributions
 be-0.002
 do-0.002
 have-0.002
 find-0.002
 want-0.002
Token red
Token ID2266
Feature activation-1.91e-03

Pos logit contributions
 have+0.003
 be+0.002
 do+0.002
 find+0.002
 I+0.002
Neg logit contributions
rito-0.004
 kernels-0.004
iest-0.004
ridge-0.004
rob-0.004
Token bike
Token ID7161
Feature activation+1.67e-02

Pos logit contributions
 kernels+0.004
iest+0.004
at+0.004
rito+0.004
ted+0.004
Neg logit contributions
 have-0.003
 do-0.002
 find-0.002
 be-0.002
 see-0.002
Token with
Token ID351
Feature activation-4.89e-03

Pos logit contributions
 be+0.022
 have+0.022
 find+0.021
 do+0.021
 wish+0.020
Neg logit contributions
iest-0.034
side-0.034
ier-0.034
rol-0.033
rob-0.033
Token so
Token ID523
Feature activation+2.90e-03

Pos logit contributions
rito+0.000
rob+0.000
rol+0.000
ier+0.000
us+0.000
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 I-0.000
Token proud
Token ID6613
Feature activation+1.38e-02

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 I+0.002
Neg logit contributions
rito-0.008
rol-0.008
ier-0.008
us-0.008
led-0.008
Token of
Token ID286
Feature activation-2.26e-03

Pos logit contributions
 be+0.019
 have+0.019
 do+0.018
 find+0.017
 I+0.014
Neg logit contributions
rito-0.032
rob-0.032
rol-0.032
ier-0.032
us-0.032
Token himself
Token ID2241
Feature activation+5.90e-04

Pos logit contributions
ier+0.003
us+0.003
rob+0.003
iest+0.003
rito+0.003
Neg logit contributions
 be-0.005
 have-0.005
 do-0.005
 find-0.005
 I-0.004
Token for
Token ID329
Feature activation-2.88e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 see+0.001
Neg logit contributions
ier-0.001
us-0.001
iest-0.001
led-0.001
rito-0.001
Token running
Token ID2491
Feature activation-4.93e-03

Pos logit contributions
rob+0.006
ier+0.006
us+0.006
rito+0.006
rol+0.006
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.004
 I-0.003
Token so
Token ID523
Feature activation-4.17e-03

Pos logit contributions
rol+0.011
ier+0.011
rob+0.011
rito+0.011
led+0.011
Neg logit contributions
 be-0.007
 have-0.007
 do-0.007
 find-0.006
 I-0.006
Token quickly
Token ID2952
Feature activation-4.98e-04

Pos logit contributions
ier+0.009
us+0.009
led+0.009
iest+0.009
rito+0.009
Neg logit contributions
 be-0.006
 have-0.006
 do-0.006
 find-0.006
 go-0.005
Token around
Token ID1088
Feature activation+5.15e-03

Pos logit contributions
ier+0.001
led+0.001
iest+0.001
rito+0.001
us+0.001
Neg logit contributions
 have-0.001
 be-0.001
 do-0.001
 find-0.001
 see-0.000
Token the
Token ID262
Feature activation-8.33e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.006
 find+0.006
 I+0.005
Neg logit contributions
rob-0.012
ier-0.012
us-0.012
rol-0.012
iest-0.012
Token pit
Token ID6028
Feature activation-6.79e-03

Pos logit contributions
at+0.013
rito+0.013
rol+0.013
iest+0.013
us+0.013
Neg logit contributions
 be-0.014
 have-0.014
 do-0.014
 I-0.013
 find-0.013
Token her
Token ID607
Feature activation+6.09e-03

Pos logit contributions
ier+0.015
rito+0.014
us+0.014
rol+0.014
led+0.014
Neg logit contributions
 have-0.011
 be-0.011
 do-0.011
 find-0.010
 I-0.009
Token knee
Token ID10329
Feature activation-3.72e-03

Pos logit contributions
 have+0.009
 be+0.009
 do+0.009
 find+0.008
 I+0.008
Neg logit contributions
rito-0.013
led-0.013
ier-0.013
iest-0.012
at-0.012
Token.
Token ID13
Feature activation+3.95e-03

Pos logit contributions
ier+0.010
rito+0.010
led+0.010
rol+0.010
at+0.010
Neg logit contributions
 be-0.004
 have-0.004
 do-0.003
 find-0.003
 I-0.003
Token She
Token ID1375
Feature activation-5.65e-04

Pos logit contributions
 be+0.006
 do+0.006
 have+0.006
 find+0.005
 want+0.005
Neg logit contributions
ted-0.008
rol-0.008
rito-0.007
ridge-0.007
covered-0.007
Token cried
Token ID16896
Feature activation+5.22e-03

Pos logit contributions
led+0.001
rito+0.001
rob+0.001
us+0.001
ier+0.001
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token and
Token ID290
Feature activation-6.85e-03

Pos logit contributions
 be+0.005
 have+0.005
 do+0.005
 find+0.004
 go+0.003
Neg logit contributions
rob-0.015
rito-0.015
rol-0.015
at-0.014
us-0.014
Token cried
Token ID16896
Feature activation+7.76e-03

Pos logit contributions
led+0.014
ted+0.013
rito+0.013
rob+0.013
us+0.013
Neg logit contributions
 have-0.011
 be-0.010
 do-0.010
 find-0.010
 I-0.009
Token until
Token ID1566
Feature activation+1.55e-02

Pos logit contributions
 be+0.008
 have+0.008
 do+0.007
 find+0.007
 I+0.005
Neg logit contributions
rito-0.021
ier-0.021
rob-0.020
rol-0.020
us-0.020
Token her
Token ID607
Feature activation+2.19e-03

Pos logit contributions
 be+0.022
 have+0.021
 do+0.021
 find+0.019
 want+0.017
Neg logit contributions
led-0.033
ier-0.033
iest-0.033
ted-0.033
rito-0.033
Token mom
Token ID1995
Feature activation+1.64e-03

Pos logit contributions
 have+0.002
 be+0.002
 do+0.001
 find+0.001
 see+0.001
Neg logit contributions
rito-0.006
ier-0.006
led-0.006
iest-0.006
at-0.006
Tokenmy
Token ID1820
Feature activation+6.24e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.002
 find+0.002
 I+0.002
Neg logit contributions
rito-0.003
us-0.003
ier-0.003
at-0.003
led-0.003
Token
Token ID198
Feature activation+2.84e-03

Pos logit contributions
us+0.006
rol+0.006
rito+0.006
at+0.006
rob+0.006
Neg logit contributions
 be-0.003
 do-0.003
 have-0.003
 find-0.003
 go-0.002
Token
Token ID198
Feature activation+9.04e-04

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 go+0.002
Neg logit contributions
us-0.007
rob-0.007
rito-0.007
rol-0.007
at-0.007
TokenSo
Token ID2396
Feature activation-9.89e-04

Pos logit contributions
 be+0.002
 have+0.001
 do+0.001
 find+0.001
 look+0.001
Neg logit contributions
us-0.002
rito-0.002
rob-0.002
rol-0.002
at-0.002
Token,
Token ID11
Feature activation-2.40e-03

Pos logit contributions
led+0.003
ier+0.003
us+0.003
rol+0.002
iest+0.002
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 see-0.001
Token Tim
Token ID5045
Feature activation-4.64e-04

Pos logit contributions
rito+0.006
rob+0.006
rol+0.006
ier+0.006
us+0.006
Neg logit contributions
 be-0.003
 have-0.003
 do-0.002
 find-0.002
 I-0.002
Tokenmy
Token ID1820
Feature activation+1.18e-02

Pos logit contributions
rob+0.001
rol+0.001
rito+0.001
iest+0.001
ier+0.001
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 go-0.000
Token and
Token ID290
Feature activation-3.14e-03

Pos logit contributions
 be+0.013
 have+0.012
 do+0.012
 find+0.010
 go+0.009
Neg logit contributions
rob-0.032
rito-0.032
rol-0.032
ier-0.032
iest-0.032
Token Billy
Token ID15890
Feature activation-1.15e-02

Pos logit contributions
rito+0.007
rob+0.007
us+0.007
rol+0.007
ier+0.007
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.004
 I-0.003
Token hid
Token ID24519
Feature activation+1.58e-03

Pos logit contributions
rito+0.026
rob+0.026
rol+0.026
ier+0.026
at+0.026
Neg logit contributions
 be-0.016
 have-0.016
 do-0.015
 find-0.014
 I-0.013
Token behind
Token ID2157
Feature activation-6.56e-03

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 see+0.002
Neg logit contributions
rob-0.004
at-0.004
ridge-0.004
rol-0.004
rito-0.004
Token the
Token ID262
Feature activation-9.84e-03

Pos logit contributions
rito+0.017
rob+0.017
rol+0.017
us+0.017
ier+0.017
Neg logit contributions
 be-0.007
 have-0.007
 do-0.006
 find-0.006
 I-0.005
Token He
Token ID679
Feature activation+2.63e-03

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 go+0.001
Neg logit contributions
rob-0.004
rol-0.004
rito-0.004
ier-0.004
us-0.004
Token calls
Token ID3848
Feature activation-4.02e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 I+0.003
Neg logit contributions
led-0.006
at-0.006
rob-0.006
ridge-0.006
rito-0.006
Token it
Token ID340
Feature activation-4.37e-03

Pos logit contributions
rito+0.009
ier+0.009
rol+0.009
us+0.009
rob+0.009
Neg logit contributions
 be-0.006
 have-0.006
 do-0.005
 find-0.005
 I-0.004
Token an
Token ID281
Feature activation+3.94e-03

Pos logit contributions
rob+0.010
rito+0.010
rol+0.010
ier+0.010
us+0.010
Neg logit contributions
 be-0.006
 have-0.006
 do-0.006
 find-0.005
 go-0.005
Token x
Token ID2124
Feature activation-3.30e-02

Pos logit contributions
 have+0.007
 be+0.006
 do+0.006
 find+0.006
 I+0.005
Neg logit contributions
ier-0.008
rito-0.008
at-0.008
iest-0.008
us-0.007
Token-
Token ID12
Feature activation-1.96e-02

Pos logit contributions
rol+0.067
rob+0.065
rito+0.065
us+0.065
ier+0.065
Neg logit contributions
 be-0.057
 have-0.055
 do-0.054
 find-0.050
 see-0.045
Tokenray
Token ID2433
Feature activation-8.09e-03

Pos logit contributions
rol+0.040
ier+0.039
rily+0.039
 casc+0.038
rob+0.038
Neg logit contributions
 be-0.032
 have-0.031
 do-0.031
 find-0.028
 I-0.027
Token.
Token ID13
Feature activation+3.27e-03

Pos logit contributions
rito+0.022
rol+0.022
rob+0.022
at+0.021
us+0.021
Neg logit contributions
 be-0.009
 have-0.008
 do-0.008
 find-0.007
 I-0.006
Token He
Token ID679
Feature activation+1.66e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.002
 I+0.002
Neg logit contributions
rito-0.009
rob-0.009
rol-0.009
ier-0.009
us-0.009
Token says
Token ID1139
Feature activation+2.23e-04

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 I+0.002
Neg logit contributions
led-0.004
rob-0.004
at-0.004
ridge-0.004
rito-0.004
Token it
Token ID340
Feature activation-9.50e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 see+0.000
Neg logit contributions
rito-0.001
ier-0.001
us-0.001
rob-0.001
rol-0.001
Token the
Token ID262
Feature activation+8.89e-03

Pos logit contributions
rito+0.002
rob+0.002
ier+0.002
rol+0.002
us+0.002
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token mail
Token ID6920
Feature activation-5.28e-03

Pos logit contributions
 be+0.010
 have+0.010
 find+0.009
 do+0.009
 I+0.008
Neg logit contributions
rob-0.020
rito-0.020
cream-0.020
iest-0.020
ted-0.020
Tokenman
Token ID805
Feature activation+1.39e-02

Pos logit contributions
rob+0.007
rito+0.007
rol+0.007
us+0.007
at+0.007
Neg logit contributions
 be-0.013
 have-0.013
 do-0.012
 find-0.012
 go-0.011
Token put
Token ID1234
Feature activation+7.52e-03

Pos logit contributions
 be+0.021
 have+0.021
 do+0.020
 find+0.019
 go+0.016
Neg logit contributions
rito-0.031
rob-0.031
rol-0.031
ier-0.030
us-0.030
Token a
Token ID257
Feature activation+5.90e-03

Pos logit contributions
 be+0.008
 have+0.007
 do+0.007
 find+0.006
 go+0.005
Neg logit contributions
rito-0.021
rol-0.021
rob-0.021
ier-0.020
at-0.020
Token new
Token ID649
Feature activation-1.44e-02

Pos logit contributions
 have+0.008
 be+0.008
 do+0.008
 find+0.007
 I+0.007
Neg logit contributions
rito-0.013
us-0.013
iest-0.013
ier-0.013
 kernels-0.013
Token nut
Token ID6701
Feature activation+4.69e-03

Pos logit contributions
rito+0.029
iest+0.028
 kernels+0.026
ier+0.026
rily+0.026
Neg logit contributions
 have-0.024
 do-0.023
 be-0.023
 find-0.021
 I-0.020
Token in
Token ID287
Feature activation+5.98e-03

Pos logit contributions
 be+0.007
 have+0.006
 do+0.006
 find+0.006
 go+0.005
Neg logit contributions
rob-0.011
rito-0.011
rol-0.011
us-0.011
ier-0.011
Token the
Token ID262
Feature activation+1.84e-03

Pos logit contributions
 be+0.009
 have+0.009
 do+0.009
 find+0.008
 I+0.007
Neg logit contributions
rito-0.013
us-0.013
rob-0.013
rol-0.012
ier-0.012
Token mail
Token ID6920
Feature activation-1.38e-03

Pos logit contributions
 have+0.002
 do+0.002
 be+0.002
 find+0.002
 want+0.002
Neg logit contributions
rito-0.004
cream-0.004
ridge-0.004
rob-0.004
ted-0.004
Token just
Token ID655
Feature activation-1.37e-02

Pos logit contributions
rito+0.004
rob+0.004
ier+0.004
rol+0.004
us+0.004
Neg logit contributions
 be-0.002
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token.
Token ID13
Feature activation-6.90e-04

Pos logit contributions
rol+0.031
ier+0.031
led+0.031
iest+0.031
rito+0.030
Neg logit contributions
 be-0.021
 have-0.020
 do-0.020
 find-0.019
 see-0.016
Token He
Token ID679
Feature activation-5.50e-03

Pos logit contributions
rito+0.001
rol+0.001
us+0.001
rob+0.001
at+0.001
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 I-0.001
Token brought
Token ID3181
Feature activation-5.58e-03

Pos logit contributions
led+0.012
rob+0.012
at+0.012
rol+0.011
rito+0.011
Neg logit contributions
 be-0.007
 have-0.007
 do-0.007
 find-0.006
 I-0.006
Token along
Token ID1863
Feature activation-6.98e-03

Pos logit contributions
rito+0.014
rob+0.014
ier+0.014
rol+0.014
us+0.014
Neg logit contributions
 be-0.007
 have-0.007
 do-0.006
 find-0.006
 go-0.005
Token a
Token ID257
Feature activation-7.99e-03

Pos logit contributions
rob+0.016
rol+0.016
led+0.016
us+0.016
rito+0.016
Neg logit contributions
 be-0.009
 have-0.008
 do-0.008
 find-0.007
 I-0.007
Token stuffed
Token ID22259
Feature activation-2.64e-02

Pos logit contributions
 kernels+0.016
iest+0.016
rito+0.015
us+0.015
rol+0.015
Neg logit contributions
 have-0.011
 be-0.010
 I-0.010
 want-0.009
 do-0.009
Token animal
Token ID5044
Feature activation-6.77e-03

Pos logit contributions
ier+0.065
rito+0.065
rol+0.064
rob+0.064
us+0.064
Neg logit contributions
 be-0.034
 have-0.032
 do-0.031
 find-0.029
 go-0.024
Token and
Token ID290
Feature activation+3.13e-03

Pos logit contributions
us+0.015
at+0.015
rol+0.015
iest+0.015
rito+0.015
Neg logit contributions
 be-0.009
 have-0.009
 do-0.009
 find-0.008
 I-0.007
Token some
Token ID617
Feature activation-2.66e-03

Pos logit contributions
 be+0.005
 have+0.005
 do+0.004
 find+0.004
 I+0.004
Neg logit contributions
led-0.007
rob-0.007
rito-0.007
us-0.007
ier-0.006
Token snacks
Token ID28487
Feature activation+6.08e-03

Pos logit contributions
rito+0.006
iest+0.006
led+0.005
ier+0.005
rob+0.005
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.004
 I-0.003
Token.
Token ID13
Feature activation+2.66e-03

Pos logit contributions
 be+0.006
 have+0.005
 do+0.005
 find+0.004
 I+0.003
Neg logit contributions
rito-0.017
rob-0.017
rol-0.017
ier-0.017
us-0.017
Token Jack
Token ID3619
Feature activation-1.05e-02

Pos logit contributions
rob+0.014
ted+0.014
us+0.013
led+0.013
ier+0.013
Neg logit contributions
 be-0.009
 have-0.008
 do-0.008
 find-0.008
 learn-0.007
Token tried
Token ID3088
Feature activation-2.69e-03

Pos logit contributions
led+0.023
rol+0.023
at+0.023
ier+0.023
rito+0.023
Neg logit contributions
 be-0.014
 have-0.013
 do-0.013
 find-0.012
 I-0.012
Token to
Token ID284
Feature activation+6.31e-03

Pos logit contributions
rito+0.006
rob+0.006
rol+0.006
ier+0.006
at+0.006
Neg logit contributions
 be-0.004
 have-0.004
 do-0.004
 find-0.003
 see-0.003
Token pull
Token ID2834
Feature activation+1.10e-02

Pos logit contributions
 be+0.007
 have+0.006
 do+0.006
 I+0.005
 find+0.005
Neg logit contributions
rito-0.016
at-0.015
ier-0.015
iest-0.015
rob-0.015
Token it
Token ID340
Feature activation+1.25e-03

Pos logit contributions
 be+0.013
 have+0.012
 do+0.011
 find+0.010
 I+0.009
Neg logit contributions
rito-0.028
us-0.028
ier-0.028
rob-0.028
led-0.028
Token out
Token ID503
Feature activation-1.52e-02

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 go+0.002
Neg logit contributions
rob-0.003
rito-0.003
rol-0.003
us-0.003
at-0.003
Token,
Token ID11
Feature activation+1.29e-03

Pos logit contributions
ier+0.042
rito+0.042
iest+0.041
rol+0.041
led+0.041
Neg logit contributions
 be-0.014
 have-0.014
 do-0.013
 find-0.012
 see-0.009
Token but
Token ID475
Feature activation+6.10e-03

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.001
 go+0.001
Neg logit contributions
rito-0.003
ier-0.003
rol-0.003
rob-0.003
us-0.003
Token it
Token ID340
Feature activation+1.61e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.007
 find+0.006
 see+0.005
Neg logit contributions
led-0.014
ier-0.014
rol-0.014
iest-0.014
us-0.014
Token was
Token ID373
Feature activation+6.38e-05

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 go+0.001
Neg logit contributions
rob-0.004
rito-0.004
us-0.004
rol-0.004
ier-0.004
Token too
Token ID1165
Feature activation+1.21e-02

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 I+0.000
Neg logit contributions
rob-0.000
rol-0.000
rito-0.000
at-0.000
us-0.000
Token was
Token ID373
Feature activation-3.22e-04

Pos logit contributions
 be+0.005
 have+0.005
 do+0.005
 find+0.005
 see+0.004
Neg logit contributions
led-0.010
rob-0.010
us-0.010
at-0.010
iest-0.010
Token a
Token ID257
Feature activation+5.45e-03

Pos logit contributions
rito+0.001
rob+0.001
rol+0.001
ier+0.001
at+0.001
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 I-0.000
Token kind
Token ID1611
Feature activation+1.40e-02

Pos logit contributions
 be+0.006
 have+0.006
 do+0.005
 find+0.005
 I+0.004
Neg logit contributions
rito-0.014
rob-0.014
us-0.014
ier-0.014
iest-0.014
Token man
Token ID582
Feature activation+1.07e-02

Pos logit contributions
 be+0.017
 have+0.017
 do+0.016
 find+0.015
 see+0.012
Neg logit contributions
rito-0.034
ier-0.034
iest-0.034
us-0.034
rob-0.033
Token who
Token ID508
Feature activation-1.00e-02

Pos logit contributions
 be+0.011
 have+0.010
 do+0.010
 find+0.009
 see+0.007
Neg logit contributions
us-0.028
rito-0.028
at-0.028
rob-0.028
ier-0.028
Token loved
Token ID6151
Feature activation-1.48e-02

Pos logit contributions
rob+0.024
rito+0.024
led+0.024
us+0.024
at+0.024
Neg logit contributions
 be-0.013
 have-0.012
 do-0.011
 find-0.011
 I-0.009
Token to
Token ID284
Feature activation-9.87e-03

Pos logit contributions
rito+0.033
ier+0.033
rol+0.033
rob+0.033
led+0.033
Neg logit contributions
 be-0.023
 have-0.022
 do-0.021
 find-0.019
 I-0.017
Token show
Token ID905
Feature activation+5.40e-03

Pos logit contributions
at+0.021
iest+0.021
rito+0.021
ridge+0.020
rob+0.020
Neg logit contributions
 has-0.008
 be-0.008
 I-0.008
 frame-0.008
 have-0.008
Token her
Token ID607
Feature activation+1.06e-02

Pos logit contributions
 be+0.008
 have+0.008
 do+0.008
 find+0.007
 I+0.007
Neg logit contributions
us-0.011
rol-0.011
ier-0.011
rito-0.011
iest-0.011
Token his
Token ID465
Feature activation+5.09e-03

Pos logit contributions
 be+0.014
 have+0.013
 do+0.013
 find+0.012
 I+0.010
Neg logit contributions
rito-0.025
rob-0.025
ier-0.025
rol-0.025
us-0.025
Token special
Token ID2041
Feature activation-3.84e-03

Pos logit contributions
 be+0.007
 have+0.006
 do+0.006
 find+0.006
 I+0.005
Neg logit contributions
us-0.012
rito-0.012
rob-0.012
iest-0.012
at-0.012
Token arm
Token ID3211
Feature activation+9.09e-03

Pos logit contributions
iest+0.016
rito+0.016
rol+0.015
ier+0.015
 kernels+0.015
Neg logit contributions
 be-0.010
 have-0.010
 do-0.009
 find-0.009
 see-0.008
Tokenchair
Token ID16337
Feature activation+3.06e-03

Pos logit contributions
 be+0.011
 have+0.011
 do+0.011
 find+0.010
 see+0.009
Neg logit contributions
ier-0.022
iest-0.022
rito-0.022
rol-0.022
rob-0.022
Token.
Token ID13
Feature activation+1.18e-03

Pos logit contributions
 be+0.003
 have+0.003
 do+0.003
 find+0.003
 see+0.002
Neg logit contributions
ier-0.008
rito-0.008
rol-0.008
rob-0.008
at-0.008
Token
Token ID198
Feature activation-1.89e-03

Pos logit contributions
 be+0.002
 do+0.002
 have+0.001
 find+0.001
 look+0.001
Neg logit contributions
rol-0.003
rito-0.003
us-0.003
ridge-0.003
rily-0.003
Token
Token ID198
Feature activation-2.88e-03

Pos logit contributions
us+0.004
rol+0.004
rito+0.004
zzy+0.004
at+0.004
Neg logit contributions
 be-0.003
 do-0.002
 have-0.002
 find-0.002
 look-0.002
TokenTHE
Token ID10970
Feature activation-4.24e-02

Pos logit contributions
rol+0.005
us+0.005
at+0.005
rito+0.005
zzy+0.005
Neg logit contributions
 be-0.005
 do-0.005
 have-0.005
 find-0.005
 look-0.005
Token<|endoftext|>
Token ID50256
Feature activation-1.21e-02

Pos logit contributions
rob+0.091
rol+0.090
led+0.085
ridge+0.084
at+0.083
Neg logit contributions
 do-0.060
 be-0.060
 have-0.054
 find-0.054
 mean-0.049
TokenOnce
Token ID7454
Feature activation+3.23e-03

Pos logit contributions
rito+0.028
ier+0.027
rol+0.027
rob+0.027
led+0.027
Neg logit contributions
 be-0.018
 have-0.018
 do-0.017
 find-0.016
 I-0.014
Token upon
Token ID2402
Feature activation-2.41e-03

Pos logit contributions
 be+0.007
 do+0.007
 have+0.007
 find+0.006
 learn+0.006
Neg logit contributions
led-0.005
rol-0.005
us-0.005
ier-0.005
iest-0.005
Token a
Token ID257
Feature activation-5.39e-03

Pos logit contributions
rob+0.006
rito+0.006
us+0.006
led+0.006
ier+0.006
Neg logit contributions
 be-0.003
 have-0.002
 do-0.002
 find-0.002
 I-0.002
Token time
Token ID640
Feature activation+4.27e-03

Pos logit contributions
at+0.009
rob+0.009
led+0.009
rito+0.009
us+0.009
Neg logit contributions
 be-0.010
 have-0.010
 do-0.010
 find-0.009
 I-0.009
Token them
Token ID606
Feature activation+6.04e-03

Pos logit contributions
 have+0.008
 be+0.008
 do+0.007
 find+0.007
 I+0.006
Neg logit contributions
us-0.012
rito-0.012
ier-0.012
ridge-0.012
led-0.012
Token and
Token ID290
Feature activation+1.04e-02

Pos logit contributions
 be+0.008
 have+0.007
 do+0.007
 find+0.006
 I+0.006
Neg logit contributions
rito-0.015
ier-0.014
rol-0.014
led-0.014
us-0.014
Token never
Token ID1239
Feature activation-7.20e-04

Pos logit contributions
 be+0.011
 have+0.011
 do+0.011
 find+0.009
 I+0.008
Neg logit contributions
ridge-0.023
led-0.023
rito-0.023
cream-0.022
 kernels-0.022
Token threw
Token ID9617
Feature activation+6.45e-03

Pos logit contributions
led+0.002
ier+0.002
rito+0.002
 cooled+0.001
ted+0.001
Neg logit contributions
 be-0.001
 have-0.001
 do-0.001
 find-0.001
 mom-0.001
Token a
Token ID257
Feature activation-8.05e-03

Pos logit contributions
 be+0.011
 do+0.011
 have+0.011
 find+0.010
 I+0.009
Neg logit contributions
rito-0.012
led-0.012
ier-0.012
rol-0.012
us-0.012
Token tant
Token ID24246
Feature activation-6.11e-02

Pos logit contributions
rito+0.016
us+0.015
iest+0.015
ier+0.015
at+0.015
Neg logit contributions
 be-0.014
 have-0.014
 do-0.014
 find-0.013
 I-0.012
Tokenrum
Token ID6582
Feature activation+1.50e-03

Pos logit contributions
rol+0.066
rito+0.064
ridge+0.062
 kernels+0.061
rob+0.061
Neg logit contributions
 be-0.157
 have-0.151
 do-0.148
 find-0.143
 learn-0.134
Token again
Token ID757
Feature activation+1.22e-02

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 see+0.001
Neg logit contributions
ier-0.004
led-0.004
rol-0.004
iest-0.004
us-0.004
Token.
Token ID13
Feature activation+2.18e-03

Pos logit contributions
 be+0.013
 have+0.013
 do+0.013
 find+0.011
 go+0.010
Neg logit contributions
ier-0.030
rito-0.030
us-0.029
iest-0.029
led-0.029
Token<|endoftext|>
Token ID50256
Feature activation-1.02e-02

Pos logit contributions
 do+0.003
 be+0.003
 have+0.003
 find+0.003
 go+0.003
Neg logit contributions
rol-0.004
us-0.004
rito-0.004
rily-0.004
ridge-0.004
TokenOnce
Token ID7454
Feature activation+7.15e-03

Pos logit contributions
rito+0.029
ier+0.029
rol+0.029
rob+0.029
led+0.029
Neg logit contributions
 be-0.009
 have-0.009
 do-0.008
 find-0.007
 I-0.005
Token face
Token ID1986
Feature activation-4.18e-03

Pos logit contributions
rito+0.047
led+0.047
rily+0.046
ier+0.046
us+0.045
Neg logit contributions
 have-0.015
 do-0.013
 be-0.012
 find-0.011
 I-0.010
Token.
Token ID13
Feature activation-6.32e-03

Pos logit contributions
rob+0.012
rito+0.012
rol+0.012
us+0.012
at+0.012
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.002
 I-0.002
Token
Token ID220
Feature activation-4.83e-03

Pos logit contributions
rito+0.016
rob+0.016
rol+0.016
ier+0.016
us+0.016
Neg logit contributions
 be-0.007
 have-0.007
 do-0.006
 find-0.006
 I-0.005
Token
Token ID198
Feature activation-7.88e-03

Pos logit contributions
rito+0.013
rob+0.013
rol+0.013
ier+0.013
us+0.013
Neg logit contributions
 be-0.005
 have-0.004
 do-0.004
 find-0.004
 I-0.003
Token
Token ID198
Feature activation-8.47e-03

Pos logit contributions
rito+0.023
ier+0.023
rol+0.022
rob+0.022
iest+0.022
Neg logit contributions
 be-0.007
 have-0.007
 do-0.006
 find-0.005
 see-0.004
TokenTHE
Token ID10970
Feature activation-4.39e-02

Pos logit contributions
rito+0.018
ier+0.017
rol+0.017
iest+0.017
led+0.016
Neg logit contributions
 be-0.015
 have-0.015
 do-0.014
 find-0.013
 go-0.012
Token<|endoftext|>
Token ID50256
Feature activation-1.26e-02

Pos logit contributions
rito+0.108
rob+0.107
ier+0.107
rol+0.107
us+0.106
Neg logit contributions
 be-0.056
 have-0.054
 do-0.052
 find-0.047
 I-0.041
TokenOnce
Token ID7454
Feature activation+5.37e-03

Pos logit contributions
rito+0.026
ier+0.025
rol+0.025
led+0.024
at+0.024
Neg logit contributions
 have-0.022
 be-0.022
 do-0.021
 find-0.020
 I-0.018
Token upon
Token ID2402
Feature activation+1.55e-03

Pos logit contributions
 be+0.011
 have+0.011
 do+0.011
 find+0.010
 learn+0.010
Neg logit contributions
iest-0.008
us-0.008
led-0.008
ier-0.008
rol-0.008
Token a
Token ID257
Feature activation-8.75e-03

Pos logit contributions
 be+0.002
 have+0.002
 do+0.001
 find+0.001
 I+0.001
Neg logit contributions
rito-0.004
us-0.004
rob-0.004
led-0.004
ier-0.004
Token time
Token ID640
Feature activation+6.10e-03

Pos logit contributions
at+0.016
rito+0.016
rob+0.016
led+0.015
us+0.015
Neg logit contributions
 be-0.016
 have-0.016
 do-0.015
 find-0.015
 I-0.013
Token.
Token ID13
Feature activation+1.71e-03

Pos logit contributions
 be+0.008
 have+0.008
 do+0.008
 find+0.007
 see+0.006
Neg logit contributions
led-0.020
ier-0.020
rito-0.020
at-0.020
iest-0.020
Token They
Token ID1119
Feature activation+2.53e-04

Pos logit contributions
 be+0.002
 have+0.002
 do+0.002
 find+0.002
 go+0.001
Neg logit contributions
rito-0.004
rol-0.004
rob-0.004
us-0.004
ier-0.004
Token were
Token ID547
Feature activation-4.60e-03

Pos logit contributions
 be+0.000
 have+0.000
 do+0.000
 find+0.000
 I+0.000
Neg logit contributions
led-0.001
 kernels-0.000
rol-0.000
ted-0.000
at-0.000
Token blue
Token ID4171
Feature activation+5.74e-03

Pos logit contributions
rito+0.011
rol+0.010
led+0.010
ier+0.010
 kernels+0.010
Neg logit contributions
 be-0.006
 have-0.006
 do-0.006
 find-0.005
 I-0.005
Token and
Token ID290
Feature activation-7.51e-03

Pos logit contributions
 be+0.007
 have+0.007
 do+0.006
 find+0.006
 I+0.005
Neg logit contributions
rito-0.014
rob-0.014
rol-0.014
ier-0.014
us-0.014
Token spe
Token ID693
Feature activation-4.43e-02

Pos logit contributions
rito+0.017
 kernels+0.016
led+0.016
iest+0.015
ridge+0.015
Neg logit contributions
 be-0.010
 find-0.010
 do-0.010
 have-0.010
 learn-0.008
Tokenck
Token ID694
Feature activation-2.54e-03

Pos logit contributions
ier+0.091
rol+0.089
rito+0.089
rob+0.088
iest+0.088
Neg logit contributions
 be-0.076
 have-0.071
 do-0.069
 find-0.065
 go-0.058
Tokenled
Token ID992
Feature activation-1.43e-02

Pos logit contributions
rob+0.002
rito+0.002
rol+0.002
at+0.002
ridge+0.002
Neg logit contributions
 be-0.007
 have-0.007
 do-0.007
 find-0.007
 go-0.007
Token.
Token ID13
Feature activation+3.12e-03

Pos logit contributions
rito+0.033
ier+0.033
rol+0.033
led+0.033
us+0.033
Neg logit contributions
 be-0.019
 have-0.019
 do-0.018
 find-0.017
 I-0.015
Token L
Token ID406
Feature activation+6.60e-04

Pos logit contributions
 be+0.004
 have+0.004
 do+0.003
 find+0.003
 I+0.003
Neg logit contributions
rito-0.008
rol-0.008
rob-0.008
us-0.008
ier-0.008
Tokenila
Token ID10102
Feature activation+6.46e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 I+0.001
Neg logit contributions
rito-0.001
ridge-0.001
rob-0.001
rol-0.001
led-0.001
Token.
Token ID13
Feature activation+5.67e-03

Pos logit contributions
 be+0.011
 have+0.010
 do+0.010
 find+0.009
 see+0.008
Neg logit contributions
rito-0.027
ier-0.027
rol-0.027
at-0.027
led-0.027
Token Despite
Token ID7945
Feature activation-1.05e-02

Pos logit contributions
 be+0.008
 have+0.007
 do+0.007
 find+0.007
 go+0.006
Neg logit contributions
rol-0.013
rito-0.013
us-0.013
rob-0.013
ier-0.013
Token wearing
Token ID5762
Feature activation-6.99e-03

Pos logit contributions
rol+0.022
rito+0.022
ier+0.022
us+0.022
iest+0.022
Neg logit contributions
 be-0.015
 have-0.014
 do-0.014
 find-0.013
 I-0.012
Token a
Token ID257
Feature activation+8.71e-04

Pos logit contributions
rito+0.018
rob+0.018
rol+0.017
ier+0.017
us+0.017
Neg logit contributions
 be-0.008
 have-0.008
 do-0.008
 find-0.007
 I-0.006
Token pink
Token ID11398
Feature activation+4.31e-03

Pos logit contributions
 be+0.001
 have+0.001
 do+0.001
 find+0.001
 I+0.001
Neg logit contributions
rito-0.002
ier-0.002
us-0.002
iest-0.002
rob-0.002
Token cast
Token ID3350
Feature activation-4.18e-02

Pos logit contributions
 be+0.006
 have+0.006
 do+0.006
 find+0.005
 I+0.005
Neg logit contributions
rito-0.010
rob-0.009
rol-0.009
ier-0.009
at-0.009
Token,
Token ID11
Feature activation-3.45e-03

Pos logit contributions
ier+0.118
rito+0.118
iest+0.117
rol+0.117
led+0.116
Neg logit contributions
 be-0.036
 have-0.035
 do-0.032
 find-0.029
 I-0.022
Token Sally
Token ID25737
Feature activation-2.16e-04

Pos logit contributions
ier+0.008
iest+0.008
led+0.008
rol+0.008
rito+0.008
Neg logit contributions
 be-0.005
 have-0.004
 do-0.004
 find-0.004
 go-0.003
Token never
Token ID1239
Feature activation-1.83e-03

Pos logit contributions
rito+0.000
rob+0.000
ier+0.000
rol+0.000
us+0.000
Neg logit contributions
 be-0.000
 have-0.000
 do-0.000
 find-0.000
 I-0.000
Token went
Token ID1816
Feature activation-7.02e-03

Pos logit contributions
ier+0.004
rito+0.004
led+0.004
us+0.003
at+0.003
Neg logit contributions
 be-0.003
 have-0.003
 do-0.003
 find-0.003
 I-0.002
Token back
Token ID736
Feature activation-5.16e-03

Pos logit contributions
rito+0.015
rob+0.015
rol+0.015
ier+0.015
us+0.015
Neg logit contributions
 be-0.011
 have-0.011
 do-0.010
 find-0.010
 I-0.009