TOP ACTIVATIONS
MAX = 0.049

kept pouring down its bright sun rays , filling the sky
happy and enjoyed eating her pie . <|endoftext|> Once upon a
way it looks ! <|endoftext|> In the far north there was
good . Clare enjoyed her pie while playing outside in the
The end . <|endoftext|> In a jungle , a graceful
Finally , she packed her sun cream . That way she
The end ! <|endoftext|> In the room there was nothing
it all up ! <|endoftext|> In the morning , Adam saw
the night sky . <|endoftext|> In his backyard , Tim my
so proud of his mud pie that he offered it to
was very proud of her ash tr ay . She put
the guitar together . <|endoftext|> In the woods there was a
the night sky . <|endoftext|> In the morning , Mary and
best day ever ! <|endoftext|> In the kitchen two little kids
. The end ! <|endoftext|> In the winter , it was
put on her cozy p aj amas and went to bed
art to come . <|endoftext|> In the park , there was
was very happy with his sun fl owers . He visited
to see it . <|endoftext|> In the morning , Mom my
sun fl owers spread their sunshine colour around the castle and

MIN ACTIVATIONS
MIN = -0.064

listen . She threw a tant rum in the store .
She cried and threw a tant rum . Her mom tried
her feet and threw a tant rum . She grabbed the
. He kept throwing a tant rum . His mom was
to cry and threw a tant rum . She didn 't
kept crying and throwing a tant rum . Finally
sad . She threw a tant rum and cried loudly .
and started to throw a tant rum . Suddenly , she
behind them and says " bo o !" Lily and Ben
stubborn . She throws a tant rum . She kicks and
Are you hurt ? Why did you do that ? Why
surprise . He threw a tant rum . He kicked and
make it whole again . Amen ." Then ,
and so she threw a tant rum when her parents said
you do that ? Why did you not listen to me
to help sort the or nam ents . She picked up
a big box of or nam ents . They were so
to me . The or nam ents were very important to
and play with your or nam ents . Maybe you can
such a mess ? Why did you hide from me ?"

INTERVAL 0.021 - 0.049
CONTAINS 0.976%

!" And from that day on , Tim and his friends
Dad had so much fun with all their silly things .
As he reflected on all he had seen ,
curiosity got the best of him . He couldn 't help
but she did not know what to get . The judge

INTERVAL -0.007 - 0.021
CONTAINS 81.237%

!" From that day on , whenever the m
in his hands . With it , he made a mother
decided to sit in the mud and have some fun .
They slid down fast and laughed . But when they reached
was there to remind her to take a break .

INTERVAL -0.036 - -0.007
CONTAINS 17.760%

say sorry to Tim 's dad and the animals . They
them . She wanted to share them with someone who would
using it to make people laugh ," said Uncle .
and said , " Sure , let 's go to the
, Ellie didn 't want to go because she was too

INTERVAL -0.064 - -0.036
CONTAINS 0.026%

" Tom , where did you hide your shoe ?"
said Mama . The 3 year old boy ran outside
Of course , but why do you want to buy such
surprise . She watched in amaz ement as the yellow knee
to her . " Where did you get that money ?
Token kept
Token ID4030
Feature activation+9.89e-03

Pos logit contributions
is+0.057
andy+0.053
ally+0.051
illy+0.051
rou+0.051
Neg logit contributions
rac-0.045
 passports-0.044
adoes-0.044
 teeth-0.043
ver-0.042
Token pouring
Token ID23147
Feature activation+1.44e-02

Pos logit contributions
is+0.016
andy+0.015
ally+0.015
 gn+0.014
rou+0.014
Neg logit contributions
rac-0.013
adoes-0.013
 there-0.013
 passports-0.013
 teeth-0.012
Token down
Token ID866
Feature activation-1.04e-02

Pos logit contributions
is+0.029
andy+0.027
ally+0.026
illy+0.026
rou+0.026
Neg logit contributions
rac-0.013
 passports-0.013
adoes-0.013
 teeth-0.012
ver-0.012
Token its
Token ID663
Feature activation+3.27e-03

Pos logit contributions
rac+0.011
 passports+0.011
adoes+0.011
 teeth+0.010
ver+0.010
Neg logit contributions
is-0.019
andy-0.018
ally-0.017
illy-0.017
rou-0.017
Token bright
Token ID6016
Feature activation+1.33e-02

Pos logit contributions
is+0.005
illy+0.005
andy+0.005
scar+0.005
rou+0.005
Neg logit contributions
rac-0.004
adoes-0.004
 teeth-0.004
 passports-0.004
 turns-0.004
Token sun
Token ID4252
Feature activation+4.93e-02

Pos logit contributions
is+0.023
rou+0.022
andy+0.021
scar+0.021
illy+0.021
Neg logit contributions
adoes-0.016
rac-0.015
 passports-0.015
 teeth-0.015
 turns-0.014
Token rays
Token ID24823
Feature activation+1.18e-02

Pos logit contributions
is+0.090
andy+0.084
rou+0.083
illy+0.082
ally+0.082
Neg logit contributions
rac-0.055
adoes-0.054
 passports-0.052
 teeth-0.051
ver-0.049
Token,
Token ID11
Feature activation+2.25e-02

Pos logit contributions
is+0.022
andy+0.022
ally+0.021
rou+0.021
illy+0.021
Neg logit contributions
rac-0.012
ver-0.011
 passports-0.011
adoes-0.011
 teeth-0.010
Token filling
Token ID12591
Feature activation-6.45e-03

Pos logit contributions
is+0.039
andy+0.038
ally+0.037
illy+0.037
rou+0.036
Neg logit contributions
rac-0.025
 passports-0.025
adoes-0.024
 teeth-0.023
 there-0.023
Token the
Token ID262
Feature activation+1.12e-02

Pos logit contributions
adoes+0.004
 passports+0.004
rac+0.004
 teeth+0.004
 turns+0.004
Neg logit contributions
is-0.015
andy-0.014
ally-0.014
rou-0.014
 sidelines-0.014
Token sky
Token ID6766
Feature activation+2.39e-02

Pos logit contributions
is+0.019
andy+0.018
illy+0.017
rou+0.017
scar+0.017
Neg logit contributions
rac-0.014
adoes-0.013
 passports-0.013
 turns-0.013
 teeth-0.013
Token happy
Token ID3772
Feature activation-2.87e-03

Pos logit contributions
is+0.009
rou+0.009
andy+0.009
 freshly+0.008
ally+0.008
Neg logit contributions
adoes-0.008
 passports-0.007
rac-0.007
 guys-0.007
 teeth-0.007
Token and
Token ID290
Feature activation+7.95e-03

Pos logit contributions
 there+0.002
 teeth+0.002
 passports+0.002
adoes+0.002
 turns+0.002
Neg logit contributions
is-0.006
andy-0.006
Mic-0.006
ally-0.006
rou-0.006
Token enjoyed
Token ID8359
Feature activation+7.24e-03

Pos logit contributions
is+0.014
andy+0.013
illy+0.013
rou+0.013
ally+0.012
Neg logit contributions
rac-0.009
 teeth-0.009
 passports-0.009
 turns-0.009
adoes-0.009
Token eating
Token ID6600
Feature activation+3.11e-03

Pos logit contributions
is+0.016
andy+0.016
ally+0.016
ini+0.015
ousy+0.015
Neg logit contributions
 there-0.006
 it-0.005
 anything-0.005
 something-0.005
 everything-0.005
Token her
Token ID607
Feature activation+6.08e-03

Pos logit contributions
is+0.006
andy+0.006
rou+0.006
icol+0.006
scar+0.006
Neg logit contributions
 teeth-0.003
adoes-0.003
 there-0.003
 passports-0.003
rac-0.003
Token pie
Token ID2508
Feature activation+4.89e-02

Pos logit contributions
is+0.008
andy+0.007
ally+0.007
illy+0.007
rou+0.007
Neg logit contributions
rac-0.010
 passports-0.010
adoes-0.010
 teeth-0.010
ver-0.009
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
is+0.096
rou+0.095
 sidelines+0.093
icol+0.093
andy+0.093
Neg logit contributions
rac-0.041
 teeth-0.041
ver-0.041
 there-0.040
adoes-0.040
Token<|endoftext|>
Token ID50256
Feature activation+5.77e-03

Pos logit contributions
is+0.023
andy+0.021
illy+0.020
rou+0.020
ally+0.020
Neg logit contributions
rac-0.015
adoes-0.015
 passports-0.014
 teeth-0.014
ver-0.014
TokenOnce
Token ID7454
Feature activation-6.29e-03

Pos logit contributions
is+0.012
andy+0.011
rou+0.011
ally+0.011
illy+0.010
Neg logit contributions
rac-0.005
adoes-0.005
 passports-0.005
 teeth-0.005
ver-0.005
Token upon
Token ID2402
Feature activation-2.02e-02

Pos logit contributions
adoes+0.009
 passports+0.008
rac+0.008
 teeth+0.008
 guys+0.007
Neg logit contributions
is-0.010
andy-0.010
ally-0.010
illy-0.009
rou-0.009
Token a
Token ID257
Feature activation+1.74e-02

Pos logit contributions
rac+0.022
adoes+0.022
 passports+0.021
ver+0.018
 teeth+0.018
Neg logit contributions
is-0.037
 freshly-0.035
illy-0.035
 crawling-0.035
andy-0.034
Token way
Token ID835
Feature activation-7.29e-03

Pos logit contributions
is+0.009
ally+0.009
illy+0.009
andy+0.009
live+0.009
Neg logit contributions
 passports-0.006
 anyone-0.006
 guys-0.005
 teeth-0.005
 turns-0.005
Token it
Token ID340
Feature activation+1.35e-02

Pos logit contributions
 there+0.008
 anyone+0.007
ver+0.007
rac+0.007
adoes+0.007
Neg logit contributions
is-0.014
icol-0.014
innie-0.013
 bisc-0.013
 sav-0.013
Token looks
Token ID3073
Feature activation+5.65e-03

Pos logit contributions
is+0.024
andy+0.023
ally+0.022
illy+0.022
rou+0.022
Neg logit contributions
rac-0.015
 passports-0.015
adoes-0.015
 teeth-0.014
ver-0.013
Token!
Token ID0
Feature activation+7.47e-03

Pos logit contributions
is+0.010
andy+0.009
ally+0.009
illy+0.009
icol+0.009
Neg logit contributions
rac-0.006
adoes-0.006
 passports-0.006
 teeth-0.006
ver-0.006
Token<|endoftext|>
Token ID50256
Feature activation+3.89e-03

Pos logit contributions
is+0.014
andy+0.013
ally+0.013
illy+0.013
rou+0.013
Neg logit contributions
rac-0.008
 passports-0.008
adoes-0.008
 teeth-0.007
ver-0.007
TokenIn
Token ID818
Feature activation+4.78e-02

Pos logit contributions
is+0.008
andy+0.007
rou+0.007
ally+0.007
illy+0.007
Neg logit contributions
rac-0.004
 passports-0.004
adoes-0.004
 teeth-0.003
ver-0.003
Token the
Token ID262
Feature activation+1.95e-02

Pos logit contributions
is+0.124
rou+0.119
ally+0.113
isy+0.112
andy+0.112
Neg logit contributions
 there-0.023
rac-0.023
 passports-0.020
 turns-0.017
 anyone-0.015
Token far
Token ID1290
Feature activation+6.65e-04

Pos logit contributions
is+0.035
rou+0.034
isy+0.034
icol+0.033
con+0.032
Neg logit contributions
 there-0.023
 turns-0.022
 passports-0.020
rac-0.020
 people-0.020
Token north
Token ID5093
Feature activation+1.72e-03

Pos logit contributions
is+0.001
andy+0.001
ally+0.001
illy+0.001
rou+0.001
Neg logit contributions
 passports-0.001
rac-0.001
adoes-0.001
 teeth-0.001
ver-0.001
Token there
Token ID612
Feature activation-5.48e-03

Pos logit contributions
is+0.003
andy+0.003
ally+0.003
illy+0.003
rou+0.003
Neg logit contributions
rac-0.002
 passports-0.002
adoes-0.002
 teeth-0.002
ver-0.002
Token was
Token ID373
Feature activation+2.45e-04

Pos logit contributions
rac+0.005
 passports+0.005
adoes+0.004
ver+0.004
 teeth+0.004
Neg logit contributions
is-0.011
andy-0.010
rou-0.010
icol-0.010
ally-0.010
Token good
Token ID922
Feature activation-6.59e-03

Pos logit contributions
rac+0.003
 teeth+0.003
adoes+0.003
 passports+0.003
 turns+0.003
Neg logit contributions
is-0.005
andy-0.005
icol-0.004
ally-0.004
illy-0.004
Token.
Token ID13
Feature activation+8.11e-03

Pos logit contributions
 teeth+0.009
 turns+0.009
adoes+0.009
 passports+0.009
 there+0.009
Neg logit contributions
is-0.010
andy-0.009
 sidelines-0.009
rou-0.009
 gn-0.009
Token Clare
Token ID22736
Feature activation-1.94e-02

Pos logit contributions
 sav+0.011
icol+0.010
 freshly+0.010
is+0.010
illy+0.010
Neg logit contributions
adoes-0.012
 passports-0.012
rac-0.012
 guys-0.012
ver-0.012
Token enjoyed
Token ID8359
Feature activation+7.00e-03

Pos logit contributions
rac+0.023
 passports+0.021
 teeth+0.021
adoes+0.021
ver+0.021
Neg logit contributions
is-0.034
andy-0.032
illy-0.031
icol-0.031
rou-0.031
Token her
Token ID607
Feature activation+1.52e-02

Pos logit contributions
andy+0.017
icol+0.016
is+0.016
ally+0.016
zie+0.016
Neg logit contributions
 there-0.004
 everything-0.003
 turns-0.003
 it-0.003
rac-0.003
Token pie
Token ID2508
Feature activation+4.72e-02

Pos logit contributions
is+0.021
andy+0.019
rou+0.018
illy+0.018
ally+0.018
Neg logit contributions
rac-0.024
 passports-0.023
adoes-0.023
 teeth-0.023
ver-0.022
Token while
Token ID981
Feature activation+2.04e-02

Pos logit contributions
 sidelines+0.098
is+0.097
icol+0.096
rou+0.095
 grizz+0.095
Neg logit contributions
rac-0.035
 teeth-0.033
ver-0.033
 there-0.032
adoes-0.030
Token playing
Token ID2712
Feature activation+3.09e-03

Pos logit contributions
is+0.034
icol+0.031
rou+0.031
andy+0.031
ally+0.030
Neg logit contributions
rac-0.026
 passports-0.026
 teeth-0.026
 anyone-0.025
 there-0.025
Token outside
Token ID2354
Feature activation+9.88e-03

Pos logit contributions
is+0.006
andy+0.006
rou+0.006
icol+0.006
illy+0.006
Neg logit contributions
adoes-0.003
rac-0.003
 teeth-0.002
 passports-0.002
 turns-0.002
Token in
Token ID287
Feature activation+6.78e-03

Pos logit contributions
is+0.020
icol+0.018
andy+0.018
ally+0.018
rou+0.018
Neg logit contributions
 there-0.009
adoes-0.009
 teeth-0.009
 anyone-0.009
rac-0.009
Token the
Token ID262
Feature activation+9.36e-03

Pos logit contributions
is+0.017
rou+0.016
isy+0.016
icol+0.016
scar+0.016
Neg logit contributions
 there-0.004
 turns-0.003
 anyone-0.003
 passports-0.003
 anything-0.003
Token
Token ID198
Feature activation-3.32e-03

Pos logit contributions
rac+0.001
 there+0.001
ver+0.001
 passports+0.001
 If+0.001
Neg logit contributions
icol-0.003
is-0.003
 freshly-0.003
rou-0.003
illy-0.003
TokenThe
Token ID464
Feature activation+2.27e-04

Pos logit contributions
rac+0.004
ver+0.004
 there+0.004
 passports+0.004
 turns+0.003
Neg logit contributions
icol-0.006
 freshly-0.005
is-0.005
rou-0.005
con-0.005
Token end
Token ID886
Feature activation+1.92e-02

Pos logit contributions
ally+0.000
andy+0.000
icol+0.000
live+0.000
liv+0.000
Neg logit contributions
 people-0.000
 passports-0.000
 guys-0.000
 voices-0.000
 astronauts-0.000
Token.
Token ID13
Feature activation-4.59e-03

Pos logit contributions
is+0.033
andy+0.031
ally+0.031
illy+0.031
rou+0.030
Neg logit contributions
rac-0.023
 teeth-0.022
adoes-0.022
 passports-0.022
 turns-0.022
Token<|endoftext|>
Token ID50256
Feature activation+8.81e-03

Pos logit contributions
adoes+0.005
 passports+0.005
rac+0.005
 teeth+0.005
 guys+0.005
Neg logit contributions
is-0.008
andy-0.007
ally-0.007
illy-0.007
rou-0.007
TokenIn
Token ID818
Feature activation+4.70e-02

Pos logit contributions
is+0.016
andy+0.015
ally+0.015
illy+0.015
rou+0.015
Neg logit contributions
rac-0.009
 passports-0.009
adoes-0.009
 teeth-0.009
ver-0.008
Token a
Token ID257
Feature activation+8.39e-03

Pos logit contributions
is+0.116
rou+0.110
andy+0.105
ally+0.105
isy+0.103
Neg logit contributions
rac-0.029
 there-0.025
 passports-0.025
adoes-0.023
 turns-0.023
Token jungle
Token ID20712
Feature activation-6.11e-03

Pos logit contributions
is+0.017
illy+0.017
rou+0.017
icol+0.016
 lending+0.016
Neg logit contributions
 turns-0.007
rac-0.006
 there-0.006
 passports-0.006
 cannot-0.006
Token,
Token ID11
Feature activation-2.09e-02

Pos logit contributions
rac+0.008
 passports+0.007
adoes+0.007
 teeth+0.007
ver+0.007
Neg logit contributions
is-0.010
andy-0.010
ally-0.009
illy-0.009
rou-0.009
Token a
Token ID257
Feature activation+1.39e-03

Pos logit contributions
 passports+0.014
rac+0.012
adoes+0.012
 teeth+0.012
 guys+0.009
Neg logit contributions
is-0.047
andy-0.046
ally-0.045
 crawling-0.045
illy-0.045
Token graceful
Token ID44363
Feature activation+9.15e-06

Pos logit contributions
zie+0.003
illy+0.003
rou+0.003
is+0.003
icol+0.003
Neg logit contributions
 there-0.001
 turns-0.001
 people-0.001
 astronauts-0.001
 guys-0.001
TokenFinally
Token ID11158
Feature activation+1.85e-02

Pos logit contributions
is+0.005
icol+0.005
rou+0.005
 freshly+0.005
illy+0.005
Neg logit contributions
rac-0.004
 passports-0.004
ver-0.004
adoes-0.004
 turns-0.004
Token,
Token ID11
Feature activation-1.65e-02

Pos logit contributions
is+0.032
andy+0.030
ally+0.029
illy+0.029
rou+0.029
Neg logit contributions
rac-0.022
 passports-0.022
adoes-0.022
 teeth-0.021
ver-0.020
Token she
Token ID673
Feature activation+9.48e-03

Pos logit contributions
adoes+0.020
 passports+0.019
rac+0.019
 teeth+0.018
ver+0.017
Neg logit contributions
andy-0.028
is-0.027
ally-0.027
illy-0.026
 crawling-0.026
Token packed
Token ID11856
Feature activation-1.68e-02

Pos logit contributions
is+0.016
andy+0.015
ally+0.015
illy+0.015
rou+0.015
Neg logit contributions
rac-0.011
 passports-0.011
adoes-0.011
 teeth-0.011
ver-0.010
Token her
Token ID607
Feature activation-1.24e-03

Pos logit contributions
rac+0.020
adoes+0.019
ver+0.019
 passports+0.018
 teeth+0.018
Neg logit contributions
is-0.030
andy-0.027
illy-0.027
ally-0.027
 freshly-0.027
Token sun
Token ID4252
Feature activation+4.70e-02

Pos logit contributions
rac+0.002
adoes+0.002
ver+0.001
 passports+0.001
 teeth+0.001
Neg logit contributions
is-0.002
andy-0.002
 freshly-0.002
ally-0.002
illy-0.002
Token cream
Token ID8566
Feature activation+3.38e-02

Pos logit contributions
is+0.093
rou+0.088
andy+0.087
illy+0.085
 sidelines+0.085
Neg logit contributions
rac-0.044
 passports-0.043
adoes-0.043
 teeth-0.040
ver-0.039
Token.
Token ID13
Feature activation+8.66e-03

Pos logit contributions
is+0.070
andy+0.066
rou+0.066
ally+0.064
illy+0.063
Neg logit contributions
rac-0.028
 passports-0.028
adoes-0.027
ver-0.026
 there-0.025
Token That
Token ID1320
Feature activation-6.63e-03

Pos logit contributions
is+0.016
andy+0.015
rou+0.015
illy+0.015
ally+0.014
Neg logit contributions
rac-0.009
adoes-0.009
 passports-0.009
 teeth-0.009
ver-0.009
Token way
Token ID835
Feature activation-1.94e-02

Pos logit contributions
rac+0.008
 passports+0.008
adoes+0.008
 teeth+0.008
ver+0.008
Neg logit contributions
is-0.011
andy-0.010
ally-0.010
illy-0.010
rou-0.010
Token she
Token ID673
Feature activation+3.61e-03

Pos logit contributions
 passports+0.018
rac+0.018
adoes+0.017
 teeth+0.017
 there+0.017
Neg logit contributions
is-0.039
rou-0.035
icol-0.035
 gn-0.035
andy-0.035
Token
Token ID198
Feature activation+4.00e-03

Pos logit contributions
is+0.013
icol+0.013
rou+0.013
ky+0.013
andy+0.012
Neg logit contributions
rac-0.004
ver-0.004
 there-0.003
 turns-0.003
 If-0.003
TokenThe
Token ID464
Feature activation+1.87e-03

Pos logit contributions
icol+0.008
 freshly+0.007
rou+0.007
andy+0.007
is+0.007
Neg logit contributions
rac-0.004
ver-0.004
 there-0.003
 turns-0.003
adoes-0.003
Token end
Token ID886
Feature activation+1.86e-02

Pos logit contributions
icol+0.003
illy+0.003
is+0.003
rou+0.003
parent+0.003
Neg logit contributions
 turns-0.002
 there-0.002
 teeth-0.002
rac-0.002
 voices-0.002
Token!
Token ID0
Feature activation+4.51e-03

Pos logit contributions
is+0.037
 sidelines+0.036
ally+0.035
illy+0.035
icol+0.035
Neg logit contributions
 teeth-0.018
 turns-0.017
 there-0.016
 happens-0.016
adoes-0.016
Token<|endoftext|>
Token ID50256
Feature activation+7.56e-03

Pos logit contributions
is+0.009
andy+0.009
rou+0.008
illy+0.008
ally+0.008
Neg logit contributions
rac-0.004
 passports-0.004
adoes-0.004
ver-0.004
 teeth-0.004
TokenIn
Token ID818
Feature activation+4.65e-02

Pos logit contributions
is+0.015
andy+0.014
rou+0.014
ally+0.014
illy+0.014
Neg logit contributions
rac-0.007
adoes-0.006
 passports-0.006
 teeth-0.006
ver-0.006
Token the
Token ID262
Feature activation+1.62e-02

Pos logit contributions
is+0.120
rou+0.114
andy+0.108
ally+0.107
isy+0.106
Neg logit contributions
 there-0.024
rac-0.024
 passports-0.020
 turns-0.018
adoes-0.017
Token room
Token ID2119
Feature activation-1.88e-02

Pos logit contributions
is+0.029
rou+0.029
isy+0.028
icol+0.027
con+0.026
Neg logit contributions
 there-0.019
 turns-0.018
 voices-0.018
 people-0.017
rac-0.017
Token there
Token ID612
Feature activation-8.90e-03

Pos logit contributions
rac+0.019
 passports+0.019
adoes+0.018
 teeth+0.018
ver+0.017
Neg logit contributions
is-0.035
rou-0.033
icol-0.033
andy-0.032
 freshly-0.032
Token was
Token ID373
Feature activation-2.73e-03

Pos logit contributions
rac+0.008
 passports+0.008
adoes+0.008
ver+0.008
 teeth+0.008
Neg logit contributions
is-0.018
rou-0.016
andy-0.016
icol-0.016
ally-0.016
Token nothing
Token ID2147
Feature activation+5.66e-03

Pos logit contributions
 there+0.001
 teeth+0.001
 something+0.000
 passports+0.000
 anyone+0.000
Neg logit contributions
is-0.007
andy-0.007
 sidelines-0.007
icol-0.007
ally-0.007
Token it
Token ID340
Feature activation-1.01e-03

Pos logit contributions
is+0.020
andy+0.019
ally+0.019
illy+0.019
rou+0.019
Neg logit contributions
rac-0.004
 passports-0.004
adoes-0.004
 teeth-0.004
 turns-0.003
Token all
Token ID477
Feature activation-1.72e-02

Pos logit contributions
rac+0.001
 teeth+0.001
adoes+0.001
 passports+0.001
ver+0.001
Neg logit contributions
is-0.002
andy-0.002
ally-0.002
rou-0.002
illy-0.002
Token up
Token ID510
Feature activation-6.45e-03

Pos logit contributions
rac+0.016
 turns+0.016
 there+0.015
adoes+0.015
 passports+0.015
Neg logit contributions
is-0.033
andy-0.032
icol-0.032
 gn-0.031
 sidelines-0.031
Token!
Token ID0
Feature activation+9.60e-03

Pos logit contributions
 there+0.007
 teeth+0.006
 passports+0.006
 anyone+0.006
adoes+0.006
Neg logit contributions
is-0.012
andy-0.012
icol-0.011
illy-0.011
ally-0.011
Token<|endoftext|>
Token ID50256
Feature activation+5.15e-03

Pos logit contributions
is+0.020
andy+0.019
illy+0.018
rou+0.018
ally+0.018
Neg logit contributions
rac-0.008
 teeth-0.008
adoes-0.008
 turns-0.008
ver-0.007
TokenIn
Token ID818
Feature activation+4.62e-02

Pos logit contributions
is+0.010
andy+0.009
ally+0.009
rou+0.009
illy+0.009
Neg logit contributions
rac-0.005
 passports-0.005
adoes-0.005
 teeth-0.005
ver-0.005
Token the
Token ID262
Feature activation+1.50e-02

Pos logit contributions
is+0.115
rou+0.110
andy+0.106
ally+0.105
isy+0.105
Neg logit contributions
rac-0.026
 there-0.024
 passports-0.022
 turns-0.021
adoes-0.019
Token morning
Token ID3329
Feature activation+1.59e-02

Pos logit contributions
is+0.027
rou+0.026
isy+0.026
icol+0.026
con+0.025
Neg logit contributions
 turns-0.017
 there-0.017
 everything-0.016
 people-0.016
 passports-0.015
Token,
Token ID11
Feature activation-1.74e-02

Pos logit contributions
is+0.027
andy+0.025
illy+0.024
ally+0.024
rou+0.024
Neg logit contributions
rac-0.020
 passports-0.019
adoes-0.019
 teeth-0.019
ver-0.018
Token Adam
Token ID7244
Feature activation+3.44e-03

Pos logit contributions
 passports+0.013
adoes+0.012
rac+0.011
 teeth+0.010
 microphones+0.009
Neg logit contributions
is-0.038
andy-0.037
ally-0.037
illy-0.036
rou-0.036
Token saw
Token ID2497
Feature activation+1.34e-02

Pos logit contributions
is+0.007
andy+0.006
ally+0.006
 sidelines+0.006
rou+0.006
Neg logit contributions
rac-0.003
 passports-0.003
 teeth-0.003
 turns-0.003
adoes-0.003
Token the
Token ID262
Feature activation+4.38e-03

Pos logit contributions
is+0.004
andy+0.004
ally+0.003
illy+0.003
rou+0.003
Neg logit contributions
rac-0.002
 passports-0.002
adoes-0.002
 teeth-0.001
ver-0.001
Token night
Token ID1755
Feature activation+2.11e-02

Pos logit contributions
is+0.008
ally+0.008
andy+0.007
illy+0.007
live+0.007
Neg logit contributions
 passports-0.005
 turns-0.005
rac-0.005
 voices-0.005
 teeth-0.005
Token sky
Token ID6766
Feature activation+2.34e-02

Pos logit contributions
rou+0.037
scar+0.037
ally+0.037
is+0.036
andy+0.036
Neg logit contributions
rac-0.021
 passports-0.021
 there-0.020
 teeth-0.020
 turns-0.019
Token.
Token ID13
Feature activation+5.01e-03

Pos logit contributions
is+0.044
scar+0.042
icol+0.042
ally+0.042
illy+0.042
Neg logit contributions
rac-0.022
 there-0.022
 teeth-0.022
adoes-0.022
 passports-0.021
Token<|endoftext|>
Token ID50256
Feature activation+3.41e-03

Pos logit contributions
is+0.010
rou+0.009
andy+0.009
ally+0.009
illy+0.009
Neg logit contributions
rac-0.005
adoes-0.004
 teeth-0.004
 passports-0.004
ver-0.004
TokenIn
Token ID818
Feature activation+4.61e-02

Pos logit contributions
is+0.007
andy+0.006
rou+0.006
ally+0.006
illy+0.006
Neg logit contributions
rac-0.003
adoes-0.003
 passports-0.003
 teeth-0.003
ver-0.003
Token his
Token ID465
Feature activation+6.55e-03

Pos logit contributions
is+0.116
rou+0.112
andy+0.106
ally+0.106
isy+0.105
Neg logit contributions
rac-0.025
 there-0.024
 passports-0.022
 turns-0.020
adoes-0.018
Token backyard
Token ID24296
Feature activation-1.65e-02

Pos logit contributions
is+0.012
rou+0.011
ally+0.011
andy+0.011
illy+0.011
Neg logit contributions
 passports-0.007
rac-0.007
 teeth-0.007
 turns-0.007
adoes-0.007
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
rac+0.017
 passports+0.016
adoes+0.016
 teeth+0.016
ver+0.016
Neg logit contributions
is-0.031
icol-0.029
rou-0.029
andy-0.028
illy-0.028
Token Tim
Token ID5045
Feature activation+2.20e-02

Pos logit contributions
 passports+0.010
rac+0.009
adoes+0.009
 teeth+0.008
 guys+0.006
Neg logit contributions
is-0.038
andy-0.037
ally-0.037
rou-0.036
illy-0.036
Tokenmy
Token ID1820
Feature activation+8.51e-03

Pos logit contributions
 sidelines+0.036
is+0.035
icol+0.033
con+0.033
isy+0.030
Neg logit contributions
 there-0.032
?"-0.032
 turns-0.030
 teeth-0.029
 means-0.029
Token so
Token ID523
Feature activation-1.27e-03

Pos logit contributions
is+0.010
andy+0.009
ally+0.009
illy+0.009
rou+0.009
Neg logit contributions
rac-0.005
adoes-0.005
 passports-0.005
 teeth-0.004
ver-0.004
Token proud
Token ID6613
Feature activation+8.96e-03

Pos logit contributions
adoes+0.002
 passports+0.002
rac+0.002
 teeth+0.002
 guys+0.002
Neg logit contributions
is-0.002
rou-0.002
andy-0.002
 freshly-0.002
illy-0.002
Token of
Token ID286
Feature activation+1.79e-02

Pos logit contributions
icol+0.022
 sidelines+0.022
zie+0.022
Mic+0.022
isk+0.022
Neg logit contributions
 there-0.004
 teeth-0.002
 at-0.002
 passports-0.002
 turns-0.002
Token his
Token ID465
Feature activation+1.94e-02

Pos logit contributions
icol+0.042
is+0.042
 bisc+0.041
ally+0.040
ki+0.040
Neg logit contributions
 there-0.012
 anyone-0.010
 anything-0.010
 everything-0.010
 teeth-0.010
Token mud
Token ID17492
Feature activation+3.43e-02

Pos logit contributions
is+0.036
illy+0.034
ky+0.034
icol+0.034
andy+0.034
Neg logit contributions
 teeth-0.022
 turns-0.019
 passports-0.019
adoes-0.019
rac-0.018
Token pie
Token ID2508
Feature activation+4.55e-02

Pos logit contributions
is+0.066
 sidelines+0.062
iley+0.062
Mic+0.062
athy+0.061
Neg logit contributions
adoes-0.037
 teeth-0.037
 turns-0.031
 things-0.030
 there-0.030
Token that
Token ID326
Feature activation+1.50e-02

Pos logit contributions
 sidelines+0.086
is+0.085
rou+0.085
andy+0.084
icol+0.082
Neg logit contributions
 there-0.045
 teeth-0.044
 turns-0.040
 or-0.040
ver-0.040
Token he
Token ID339
Feature activation+1.13e-02

Pos logit contributions
is+0.024
andy+0.022
ally+0.021
illy+0.021
rou+0.021
Neg logit contributions
rac-0.020
adoes-0.020
 passports-0.020
 teeth-0.019
ver-0.018
Token offered
Token ID4438
Feature activation-6.20e-03

Pos logit contributions
is+0.020
andy+0.018
illy+0.018
ally+0.018
rou+0.018
Neg logit contributions
rac-0.013
 passports-0.013
adoes-0.013
 teeth-0.012
ver-0.012
Token it
Token ID340
Feature activation-7.78e-03

Pos logit contributions
rac+0.004
 passports+0.004
adoes+0.004
 teeth+0.004
 turns+0.003
Neg logit contributions
is-0.014
andy-0.014
ally-0.013
rou-0.013
icol-0.013
Token to
Token ID284
Feature activation+5.36e-03

Pos logit contributions
rac+0.006
 passports+0.006
adoes+0.006
 teeth+0.006
ver+0.006
Neg logit contributions
is-0.016
andy-0.015
illy-0.015
ally-0.015
rou-0.015
Token was
Token ID373
Feature activation+1.64e-03

Pos logit contributions
rac+0.007
 turns+0.007
 teeth+0.007
 passports+0.006
adoes+0.006
Neg logit contributions
is-0.013
 sidelines-0.012
illy-0.012
icol-0.012
andy-0.012
Token very
Token ID845
Feature activation+5.24e-03

Pos logit contributions
is+0.003
rou+0.003
 crawling+0.003
 sav+0.003
 freshly+0.003
Neg logit contributions
adoes-0.002
rac-0.002
 passports-0.002
 guys-0.002
 teeth-0.002
Token proud
Token ID6613
Feature activation+1.06e-02

Pos logit contributions
is+0.008
rou+0.007
andy+0.007
 gn+0.007
illy+0.006
Neg logit contributions
adoes-0.009
 passports-0.008
rac-0.008
 teeth-0.008
 guys-0.008
Token of
Token ID286
Feature activation+1.62e-02

Pos logit contributions
icol+0.025
is+0.025
 sidelines+0.024
zie+0.024
ally+0.024
Neg logit contributions
 there-0.005
 passports-0.004
 teeth-0.004
 anyone-0.004
rac-0.004
Token her
Token ID607
Feature activation+1.52e-02

Pos logit contributions
 bisc+0.037
icol+0.036
is+0.035
cone+0.034
ki+0.033
Neg logit contributions
 anyone-0.015
 there-0.014
 anything-0.013
 everything-0.013
adoes-0.012
Token ash
Token ID12530
Feature activation+4.52e-02

Pos logit contributions
is+0.024
 sidelines+0.024
andy+0.024
scar+0.023
rou+0.023
Neg logit contributions
 teeth-0.020
 passports-0.018
 turns-0.018
adoes-0.018
 there-0.018
Tokentr
Token ID2213
Feature activation+4.14e-02

Pos logit contributions
is+0.075
 sidelines+0.075
illy+0.074
andy+0.073
ally+0.070
Neg logit contributions
adoes-0.060
 teeth-0.060
 guys-0.052
rac-0.051
 voices-0.050
Tokenay
Token ID323
Feature activation-1.90e-03

Pos logit contributions
is+0.064
andy+0.059
illy+0.058
ally+0.058
 gn+0.057
Neg logit contributions
rac-0.057
adoes-0.055
 passports-0.055
 teeth-0.053
ver-0.053
Token.
Token ID13
Feature activation+8.79e-03

Pos logit contributions
ver+0.002
adoes+0.002
 teeth+0.002
rac+0.002
 anyone+0.002
Neg logit contributions
icol-0.003
is-0.003
 gn-0.003
andy-0.003
 sidelines-0.003
Token She
Token ID1375
Feature activation+1.25e-02

Pos logit contributions
is+0.013
andy+0.012
ally+0.012
illy+0.012
rou+0.012
Neg logit contributions
rac-0.012
 passports-0.012
adoes-0.012
 teeth-0.012
ver-0.011
Token put
Token ID1234
Feature activation+6.14e-03

Pos logit contributions
is+0.023
 sidelines+0.021
illy+0.021
andy+0.021
ally+0.021
Neg logit contributions
rac-0.014
 turns-0.013
 teeth-0.013
 passports-0.012
ver-0.012
Token the
Token ID262
Feature activation-2.42e-03

Pos logit contributions
andy+0.029
ally+0.029
is+0.028
illy+0.028
ousy+0.027
Neg logit contributions
 passports-0.013
rac-0.012
 teeth-0.012
 turns-0.012
adoes-0.012
Token guitar
Token ID10047
Feature activation+1.41e-03

Pos logit contributions
 passports+0.003
adoes+0.003
 teeth+0.003
 guys+0.003
 turns+0.003
Neg logit contributions
ally-0.004
andy-0.004
illy-0.004
is-0.004
icol-0.004
Token together
Token ID1978
Feature activation+9.79e-03

Pos logit contributions
is+0.003
andy+0.003
illy+0.002
ally+0.002
rou+0.002
Neg logit contributions
rac-0.001
adoes-0.001
 passports-0.001
 teeth-0.001
ver-0.001
Token.
Token ID13
Feature activation+7.02e-03

Pos logit contributions
is+0.018
andy+0.017
ally+0.017
icol+0.017
illy+0.017
Neg logit contributions
rac-0.010
ver-0.010
 passports-0.010
 teeth-0.010
adoes-0.010
Token<|endoftext|>
Token ID50256
Feature activation+5.32e-03

Pos logit contributions
is+0.012
andy+0.012
 freshly+0.011
ally+0.011
 sav+0.011
Neg logit contributions
rac-0.008
 passports-0.008
adoes-0.008
 teeth-0.007
 guys-0.007
TokenIn
Token ID818
Feature activation+4.52e-02

Pos logit contributions
is+0.009
andy+0.009
ally+0.009
illy+0.009
rou+0.009
Neg logit contributions
rac-0.006
 passports-0.006
adoes-0.006
 teeth-0.006
ver-0.005
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
is+0.111
rou+0.105
andy+0.101
ally+0.101
isy+0.100
Neg logit contributions
rac-0.027
 passports-0.025
 there-0.024
 turns-0.022
adoes-0.022
Token woods
Token ID16479
Feature activation-1.14e-02

Pos logit contributions
is+0.019
isy+0.018
rou+0.018
illy+0.017
icol+0.016
Neg logit contributions
 turns-0.014
 there-0.014
 passports-0.013
 voices-0.013
rac-0.013
Token there
Token ID612
Feature activation-7.63e-03

Pos logit contributions
rac+0.012
 passports+0.012
adoes+0.012
 teeth+0.011
ver+0.011
Neg logit contributions
is-0.021
andy-0.020
rou-0.020
icol-0.020
illy-0.019
Token was
Token ID373
Feature activation-3.19e-03

Pos logit contributions
rac+0.006
ver+0.005
 passports+0.005
adoes+0.005
 teeth+0.005
Neg logit contributions
is-0.017
rou-0.015
icol-0.015
andy-0.015
 sidelines-0.015
Token a
Token ID257
Feature activation+5.42e-04

Pos logit contributions
 there+0.001
 something+0.000
 teeth+0.000
ver+0.000
 turns+0.000
Neg logit contributions
is-0.009
andy-0.008
 sidelines-0.008
ally-0.008
rou-0.008
Token the
Token ID262
Feature activation+2.67e-03

Pos logit contributions
 passports+0.009
rac+0.009
adoes+0.009
 teeth+0.009
 anyone+0.008
Neg logit contributions
is-0.028
andy-0.026
rou-0.026
ally-0.026
illy-0.025
Token night
Token ID1755
Feature activation+1.64e-02

Pos logit contributions
is+0.005
andy+0.005
ally+0.005
rou+0.005
illy+0.005
Neg logit contributions
 passports-0.003
rac-0.003
 teeth-0.003
 anyone-0.003
adoes-0.003
Token sky
Token ID6766
Feature activation+2.05e-02

Pos logit contributions
is+0.028
rou+0.027
andy+0.027
ally+0.027
illy+0.027
Neg logit contributions
 there-0.017
 teeth-0.016
adoes-0.016
 anyone-0.016
 passports-0.016
Token.
Token ID13
Feature activation+8.29e-03

Pos logit contributions
is+0.038
icol+0.036
ally+0.036
andy+0.036
illy+0.036
Neg logit contributions
 there-0.020
 teeth-0.020
adoes-0.019
 anyone-0.019
ver-0.019
Token<|endoftext|>
Token ID50256
Feature activation+3.82e-03

Pos logit contributions
is+0.019
isy+0.017
rou+0.017
 sidelines+0.017
 gn+0.016
Neg logit contributions
rac-0.006
 teeth-0.006
ver-0.005
adoes-0.005
 turns-0.005
TokenIn
Token ID818
Feature activation+4.51e-02

Pos logit contributions
is+0.008
rou+0.008
andy+0.008
icol+0.007
ally+0.007
Neg logit contributions
rac-0.003
ver-0.003
 teeth-0.003
 there-0.003
 turns-0.003
Token the
Token ID262
Feature activation+1.44e-02

Pos logit contributions
is+0.116
rou+0.110
andy+0.105
isy+0.104
ally+0.104
Neg logit contributions
 there-0.024
rac-0.023
 passports-0.020
 turns-0.017
 anyone-0.016
Token morning
Token ID3329
Feature activation+1.37e-02

Pos logit contributions
is+0.027
rou+0.026
isy+0.025
icol+0.024
con+0.024
Neg logit contributions
 there-0.017
 turns-0.015
 something-0.015
 people-0.015
 some-0.015
Token,
Token ID11
Feature activation-1.91e-02

Pos logit contributions
is+0.024
andy+0.022
rou+0.022
ally+0.022
illy+0.021
Neg logit contributions
rac-0.016
 passports-0.016
adoes-0.016
 teeth-0.015
ver-0.015
Token Mary
Token ID5335
Feature activation+6.20e-03

Pos logit contributions
rac+0.008
 passports+0.008
adoes+0.008
 teeth+0.007
ver+0.006
Neg logit contributions
is-0.047
andy-0.045
ally-0.045
illy-0.044
rou-0.044
Token and
Token ID290
Feature activation+1.30e-02

Pos logit contributions
is+0.013
 sidelines+0.013
rou+0.012
ally+0.012
andy+0.012
Neg logit contributions
 teeth-0.005
 turns-0.005
rac-0.005
 passports-0.005
 there-0.005
Token best
Token ID1266
Feature activation-9.77e-03

Pos logit contributions
ally+0.002
andy+0.002
is+0.002
gie+0.002
scar+0.002
Neg logit contributions
 passports-0.001
rac-0.001
 anyone-0.001
 people-0.001
 turns-0.001
Token day
Token ID1110
Feature activation+2.00e-02

Pos logit contributions
 passports+0.013
 turns+0.012
 there+0.012
 teeth+0.011
rac+0.011
Neg logit contributions
is-0.015
Mic-0.015
andy-0.015
ousy-0.015
gie-0.015
Token ever
Token ID1683
Feature activation-1.28e-02

Pos logit contributions
is+0.037
andy+0.036
ousy+0.035
icol+0.035
rou+0.035
Neg logit contributions
 passports-0.020
 there-0.020
rac-0.020
 anyone-0.019
ver-0.019
Token!
Token ID0
Feature activation+9.93e-03

Pos logit contributions
 there+0.020
 turns+0.019
 passports+0.019
 anyone+0.019
ver+0.019
Neg logit contributions
 sidelines-0.016
scar-0.016
is-0.015
Mic-0.015
ousy-0.015
Token<|endoftext|>
Token ID50256
Feature activation+4.98e-03

Pos logit contributions
is+0.018
andy+0.017
ally+0.017
illy+0.017
rou+0.017
Neg logit contributions
rac-0.010
 passports-0.010
adoes-0.010
 teeth-0.010
ver-0.009
TokenIn
Token ID818
Feature activation+4.50e-02

Pos logit contributions
is+0.010
andy+0.010
rou+0.010
ally+0.009
illy+0.009
Neg logit contributions
rac-0.004
adoes-0.004
 passports-0.004
ver-0.004
 teeth-0.004
Token the
Token ID262
Feature activation+1.38e-02

Pos logit contributions
is+0.118
rou+0.113
andy+0.106
ally+0.106
isy+0.106
Neg logit contributions
 there-0.023
rac-0.021
 passports-0.018
 turns-0.017
adoes-0.015
Token kitchen
Token ID9592
Feature activation-1.18e-02

Pos logit contributions
is+0.024
rou+0.024
icol+0.023
con+0.023
isy+0.023
Neg logit contributions
 there-0.016
 turns-0.016
 people-0.015
 something-0.015
 passports-0.015
Token two
Token ID734
Feature activation+1.67e-02

Pos logit contributions
rac+0.012
 passports+0.012
 teeth+0.011
adoes+0.011
 there+0.011
Neg logit contributions
is-0.023
rou-0.021
icol-0.021
andy-0.021
ally-0.020
Token little
Token ID1310
Feature activation+1.07e-02

Pos logit contributions
icol+0.035
rou+0.033
is+0.033
scar+0.033
cone+0.033
Neg logit contributions
 teeth-0.015
 turns-0.014
 passports-0.013
 there-0.013
 people-0.013
Token kids
Token ID3988
Feature activation-1.01e-02

Pos logit contributions
is+0.023
rou+0.022
icol+0.022
isy+0.022
 gn+0.022
Neg logit contributions
 passports-0.008
 turns-0.008
 teeth-0.008
rac-0.008
adoes-0.007
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
 there+0.001
 anyone+0.001
 out+0.001
 off+0.001
 or+0.001
Neg logit contributions
is-0.002
innie-0.002
scar-0.002
icol-0.002
live-0.002
Token The
Token ID383
Feature activation+4.37e-03

Pos logit contributions
is+0.020
andy+0.019
ally+0.019
illy+0.019
 sav+0.019
Neg logit contributions
 passports-0.016
adoes-0.015
rac-0.015
 teeth-0.014
ver-0.013
Token end
Token ID886
Feature activation+2.14e-02

Pos logit contributions
icol+0.008
liv+0.008
live+0.008
ally+0.007
illy+0.007
Neg logit contributions
 people-0.005
 voices-0.005
 things-0.005
 teeth-0.005
 guys-0.005
Token!
Token ID0
Feature activation+8.88e-03

Pos logit contributions
is+0.043
 sidelines+0.041
ally+0.041
illy+0.041
andy+0.040
Neg logit contributions
 turns-0.020
 there-0.019
 teeth-0.019
 happens-0.019
adoes-0.018
Token<|endoftext|>
Token ID50256
Feature activation+1.90e-04

Pos logit contributions
is+0.015
andy+0.014
illy+0.014
ally+0.014
 freshly+0.014
Neg logit contributions
adoes-0.011
 passports-0.011
rac-0.010
 teeth-0.010
ver-0.009
TokenIn
Token ID818
Feature activation+4.50e-02

Pos logit contributions
is+0.000
andy+0.000
ally+0.000
illy+0.000
rou+0.000
Neg logit contributions
rac-0.000
 passports-0.000
adoes-0.000
 teeth-0.000
ver-0.000
Token the
Token ID262
Feature activation+1.46e-02

Pos logit contributions
is+0.113
rou+0.107
ally+0.103
andy+0.103
isy+0.101
Neg logit contributions
rac-0.025
 there-0.022
 passports-0.022
 turns-0.020
adoes-0.020
Token winter
Token ID7374
Feature activation+8.31e-03

Pos logit contributions
rou+0.025
is+0.025
isy+0.024
icol+0.024
illy+0.023
Neg logit contributions
 there-0.017
 turns-0.017
 passports-0.016
rac-0.016
 voices-0.016
Token,
Token ID11
Feature activation-1.95e-02

Pos logit contributions
is+0.015
rou+0.015
andy+0.014
icol+0.014
illy+0.014
Neg logit contributions
rac-0.008
 there-0.008
 turns-0.008
adoes-0.008
 passports-0.008
Token it
Token ID340
Feature activation+6.17e-03

Pos logit contributions
 passports+0.015
adoes+0.013
rac+0.013
 teeth+0.012
 tant+0.010
Neg logit contributions
is-0.042
andy-0.041
ally-0.041
illy-0.040
 freshly-0.040
Token was
Token ID373
Feature activation-1.17e-03

Pos logit contributions
is+0.012
icol+0.011
rou+0.011
ally+0.011
illy+0.011
Neg logit contributions
rac-0.006
 turns-0.005
 there-0.005
 teeth-0.005
adoes-0.005
Token put
Token ID1234
Feature activation+1.17e-02

Pos logit contributions
is+0.031
 sidelines+0.030
 freshly+0.028
icol+0.028
rou+0.028
Neg logit contributions
 teeth-0.014
 turns-0.013
rac-0.012
 passports-0.011
 there-0.011
Token on
Token ID319
Feature activation+3.83e-03

Pos logit contributions
is+0.024
andy+0.022
illy+0.022
ally+0.022
rou+0.022
Neg logit contributions
rac-0.011
 passports-0.010
adoes-0.010
 teeth-0.010
ver-0.009
Token her
Token ID607
Feature activation+5.25e-03

Pos logit contributions
is+0.008
andy+0.007
rou+0.007
ally+0.007
isy+0.007
Neg logit contributions
 teeth-0.004
 turns-0.003
rac-0.003
 passports-0.003
adoes-0.003
Token cozy
Token ID37438
Feature activation+9.37e-03

Pos logit contributions
is+0.009
rou+0.008
ally+0.008
andy+0.008
illy+0.008
Neg logit contributions
 teeth-0.007
rac-0.007
 passports-0.007
adoes-0.006
 turns-0.006
Token p
Token ID279
Feature activation+2.68e-02

Pos logit contributions
is+0.015
rou+0.015
illy+0.014
ally+0.014
andy+0.014
Neg logit contributions
 teeth-0.013
adoes-0.012
 passports-0.011
rac-0.011
 guys-0.011
Tokenaj
Token ID1228
Feature activation+4.48e-02

Pos logit contributions
 lending+0.047
is+0.046
 gn+0.044
 sidelines+0.044
 crawling+0.044
Neg logit contributions
rac-0.031
 teeth-0.030
adoes-0.027
 passports-0.027
outed-0.027
Tokenamas
Token ID17485
Feature activation-1.38e-02

Pos logit contributions
is+0.086
rou+0.081
andy+0.081
 gn+0.081
illy+0.079
Neg logit contributions
rac-0.044
adoes-0.043
 teeth-0.041
 passports-0.040
ver-0.036
Token and
Token ID290
Feature activation+6.32e-03

Pos logit contributions
ver+0.010
 there+0.009
cases+0.008
rac+0.008
 passports+0.008
Neg logit contributions
andy-0.030
is-0.030
ally-0.029
ousy-0.028
 sidelines-0.028
Token went
Token ID1816
Feature activation-3.49e-03

Pos logit contributions
is+0.011
andy+0.010
rou+0.010
ally+0.010
illy+0.010
Neg logit contributions
 passports-0.007
rac-0.007
 teeth-0.007
adoes-0.007
 turns-0.007
Token to
Token ID284
Feature activation+1.83e-03

Pos logit contributions
 passports+0.002
rac+0.002
 teeth+0.002
 there+0.002
 turns+0.002
Neg logit contributions
is-0.008
ally-0.008
andy-0.008
illy-0.007
rou-0.007
Token bed
Token ID3996
Feature activation+9.22e-03

Pos logit contributions
is+0.003
ally+0.003
rou+0.003
andy+0.003
isy+0.003
Neg logit contributions
 teeth-0.002
 passports-0.002
 anyone-0.002
 there-0.002
 turns-0.002
Token art
Token ID1242
Feature activation+5.14e-03

Pos logit contributions
is+0.015
andy+0.014
ally+0.014
icol+0.014
illy+0.014
Neg logit contributions
 passports-0.006
adoes-0.006
 anyone-0.006
 teeth-0.006
 there-0.006
Token to
Token ID284
Feature activation-2.16e-03

Pos logit contributions
is+0.008
andy+0.008
ally+0.008
rou+0.008
illy+0.008
Neg logit contributions
rac-0.007
 passports-0.006
adoes-0.006
ver-0.006
 teeth-0.006
Token come
Token ID1282
Feature activation-3.31e-03

Pos logit contributions
rac+0.003
 passports+0.002
adoes+0.002
 teeth+0.002
 anyone+0.002
Neg logit contributions
is-0.004
andy-0.004
ally-0.004
rou-0.004
icol-0.003
Token.
Token ID13
Feature activation+6.33e-03

Pos logit contributions
rac+0.004
 passports+0.004
adoes+0.004
 teeth+0.004
ver+0.004
Neg logit contributions
is-0.006
andy-0.005
rou-0.005
ally-0.005
illy-0.005
Token<|endoftext|>
Token ID50256
Feature activation+3.14e-03

Pos logit contributions
is+0.013
andy+0.012
ally+0.012
illy+0.012
rou+0.012
Neg logit contributions
rac-0.006
adoes-0.005
ver-0.005
 teeth-0.005
 passports-0.005
TokenIn
Token ID818
Feature activation+4.48e-02

Pos logit contributions
is+0.006
andy+0.006
rou+0.006
ally+0.006
illy+0.006
Neg logit contributions
rac-0.003
adoes-0.003
 passports-0.003
 teeth-0.003
ver-0.003
Token the
Token ID262
Feature activation+1.73e-02

Pos logit contributions
is+0.111
rou+0.107
ally+0.102
andy+0.101
isy+0.100
Neg logit contributions
rac-0.026
 there-0.024
 passports-0.023
 turns-0.021
adoes-0.018
Token park
Token ID3952
Feature activation-5.48e-03

Pos logit contributions
rou+0.029
is+0.029
icol+0.028
isy+0.028
illy+0.027
Neg logit contributions
 turns-0.021
 there-0.020
 passports-0.020
rac-0.020
 voices-0.019
Token,
Token ID11
Feature activation-1.71e-02

Pos logit contributions
rac+0.006
 passports+0.006
adoes+0.006
 teeth+0.006
ver+0.006
Neg logit contributions
is-0.010
andy-0.009
rou-0.009
illy-0.009
icol-0.009
Token there
Token ID612
Feature activation-1.18e-02

Pos logit contributions
 passports+0.012
adoes+0.010
rac+0.010
 teeth+0.010
 guys+0.008
Neg logit contributions
is-0.039
andy-0.037
ally-0.037
illy-0.036
rou-0.036
Token was
Token ID373
Feature activation-3.69e-03

Pos logit contributions
rac+0.013
 passports+0.013
adoes+0.013
 teeth+0.013
ver+0.012
Neg logit contributions
is-0.021
andy-0.019
rou-0.019
illy-0.019
ally-0.019
Token was
Token ID373
Feature activation+8.39e-03

Pos logit contributions
rac+0.001
adoes+0.001
 passports+0.001
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.002
icol-0.002
illy-0.002
andy-0.002
 sidelines-0.002
Token very
Token ID845
Feature activation+7.74e-03

Pos logit contributions
is+0.017
andy+0.016
ally+0.016
icol+0.016
illy+0.016
Neg logit contributions
rac-0.007
 passports-0.007
 teeth-0.007
adoes-0.007
ver-0.007
Token happy
Token ID3772
Feature activation-9.71e-04

Pos logit contributions
is+0.014
andy+0.013
ally+0.013
illy+0.013
icol+0.013
Neg logit contributions
rac-0.008
 passports-0.008
adoes-0.008
 teeth-0.008
ver-0.007
Token with
Token ID351
Feature activation+2.32e-02

Pos logit contributions
 there+0.001
 teeth+0.001
 passports+0.001
adoes+0.001
rac+0.001
Neg logit contributions
is-0.002
ally-0.002
andy-0.002
 sidelines-0.002
rou-0.002
Token his
Token ID465
Feature activation+1.22e-02

Pos logit contributions
is+0.052
andy+0.048
rou+0.048
icol+0.048
ally+0.048
Neg logit contributions
 passports-0.017
rac-0.016
adoes-0.016
 teeth-0.016
 anyone-0.015
Token sun
Token ID4252
Feature activation+4.47e-02

Pos logit contributions
is+0.020
scar+0.019
ally+0.019
rou+0.019
illy+0.019
Neg logit contributions
 teeth-0.015
 passports-0.015
 turns-0.014
rac-0.014
adoes-0.013
Tokenfl
Token ID2704
Feature activation+2.66e-02

Pos logit contributions
is+0.063
 sidelines+0.060
rou+0.057
icol+0.056
scar+0.056
Neg logit contributions
adoes-0.067
rac-0.066
 teeth-0.065
 there-0.064
 passports-0.060
Tokenowers
Token ID3618
Feature activation+2.44e-02

Pos logit contributions
is+0.052
 gn+0.049
rou+0.049
 lending+0.048
icol+0.048
Neg logit contributions
rac-0.025
adoes-0.025
 teeth-0.023
 passports-0.022
ver-0.021
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
is+0.049
icol+0.049
andy+0.047
 sidelines+0.047
ally+0.046
Neg logit contributions
 there-0.021
ver-0.019
 teeth-0.019
 passports-0.019
rac-0.019
Token He
Token ID679
Feature activation+1.29e-02

Pos logit contributions
is+0.026
rou+0.023
andy+0.023
illy+0.023
isy+0.023
Neg logit contributions
rac-0.010
 teeth-0.009
adoes-0.009
ver-0.009
 passports-0.009
Token visited
Token ID8672
Feature activation-1.54e-02

Pos logit contributions
is+0.023
 sidelines+0.022
icol+0.022
illy+0.021
ally+0.021
Neg logit contributions
rac-0.014
 turns-0.013
 teeth-0.013
 passports-0.013
ver-0.012
Token to
Token ID284
Feature activation-7.74e-03

Pos logit contributions
 there+0.005
 anyone+0.005
 teeth+0.004
 anything+0.004
adoes+0.004
Neg logit contributions
is-0.015
ally-0.014
andy-0.014
live-0.014
scar-0.014
Token see
Token ID766
Feature activation+3.81e-03

Pos logit contributions
 anyone+0.007
rac+0.007
 anything+0.007
 teeth+0.007
 passports+0.006
Neg logit contributions
is-0.016
rou-0.015
andy-0.015
ally-0.015
icol-0.015
Token it
Token ID340
Feature activation+7.77e-04

Pos logit contributions
is+0.008
andy+0.008
ally+0.008
illy+0.007
icol+0.007
Neg logit contributions
rac-0.003
 passports-0.003
adoes-0.003
 teeth-0.003
ver-0.003
Token.
Token ID13
Feature activation+5.63e-03

Pos logit contributions
is+0.001
icol+0.001
illy+0.001
ally+0.001
andy+0.001
Neg logit contributions
 there-0.001
 anyone-0.001
adoes-0.001
 teeth-0.001
rac-0.001
Token<|endoftext|>
Token ID50256
Feature activation+3.95e-03

Pos logit contributions
is+0.010
andy+0.009
ally+0.009
illy+0.009
rou+0.009
Neg logit contributions
rac-0.006
 passports-0.006
adoes-0.006
 teeth-0.006
ver-0.006
TokenIn
Token ID818
Feature activation+4.47e-02

Pos logit contributions
is+0.008
andy+0.007
rou+0.007
ally+0.007
illy+0.007
Neg logit contributions
rac-0.003
adoes-0.003
 passports-0.003
ver-0.003
 teeth-0.003
Token the
Token ID262
Feature activation+1.72e-02

Pos logit contributions
is+0.111
rou+0.106
ally+0.101
andy+0.101
isy+0.100
Neg logit contributions
rac-0.026
 there-0.025
 passports-0.023
 turns-0.020
adoes-0.019
Token morning
Token ID3329
Feature activation+1.48e-02

Pos logit contributions
is+0.031
rou+0.030
isy+0.029
con+0.028
icol+0.028
Neg logit contributions
 there-0.020
 turns-0.019
 passports-0.018
 voices-0.018
 everything-0.018
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
is+0.026
andy+0.024
rou+0.023
ally+0.023
illy+0.023
Neg logit contributions
rac-0.018
 passports-0.017
adoes-0.017
 teeth-0.017
ver-0.016
Token Mom
Token ID11254
Feature activation-1.25e-03

Pos logit contributions
rac+0.010
 passports+0.010
adoes+0.010
 teeth+0.009
ver+0.008
Neg logit contributions
is-0.043
andy-0.041
ally-0.040
illy-0.040
rou-0.040
Tokenmy
Token ID1820
Feature activation+4.28e-03

Pos logit contributions
 there+0.002
 turns+0.002
?"+0.002
 means+0.002
rac+0.002
Neg logit contributions
is-0.002
 sidelines-0.002
icol-0.001
 lookout-0.001
isy-0.001
Token sun
Token ID4252
Feature activation+2.48e-02

Pos logit contributions
liv+0.012
rou+0.012
icol+0.012
scar+0.011
ally+0.011
Neg logit contributions
 people-0.008
 guys-0.008
 passports-0.007
 there-0.007
 anyone-0.007
Tokenfl
Token ID2704
Feature activation+1.09e-02

Pos logit contributions
is+0.034
 sidelines+0.033
icol+0.032
illy+0.031
rou+0.031
Neg logit contributions
rac-0.037
adoes-0.037
 there-0.035
 teeth-0.035
ver-0.034
Tokenowers
Token ID3618
Feature activation+1.78e-02

Pos logit contributions
is+0.020
andy+0.018
rou+0.018
ally+0.018
illy+0.018
Neg logit contributions
rac-0.012
adoes-0.012
 passports-0.012
 teeth-0.011
ver-0.011
Token spread
Token ID4104
Feature activation+7.28e-03

Pos logit contributions
is+0.034
icol+0.032
andy+0.032
rou+0.032
ally+0.032
Neg logit contributions
rac-0.017
 passports-0.016
adoes-0.016
ver-0.016
 teeth-0.015
Token their
Token ID511
Feature activation+7.12e-03

Pos logit contributions
is+0.015
andy+0.015
rou+0.014
ally+0.014
illy+0.014
Neg logit contributions
rac-0.006
adoes-0.006
 passports-0.005
 teeth-0.005
 turns-0.005
Token sunshine
Token ID34488
Feature activation+4.46e-02

Pos logit contributions
is+0.012
rou+0.012
andy+0.011
ally+0.011
illy+0.011
Neg logit contributions
rac-0.008
 passports-0.008
 teeth-0.008
adoes-0.008
 turns-0.008
Token colour
Token ID9568
Feature activation+1.98e-02

Pos logit contributions
is+0.087
andy+0.082
rou+0.082
illy+0.080
ally+0.080
Neg logit contributions
adoes-0.044
rac-0.044
 passports-0.042
 teeth-0.042
 turns-0.039
Token around
Token ID1088
Feature activation-1.86e-05

Pos logit contributions
is+0.037
andy+0.036
ally+0.035
rou+0.035
illy+0.035
Neg logit contributions
rac-0.020
adoes-0.019
 passports-0.019
 teeth-0.018
ver-0.018
Token the
Token ID262
Feature activation+8.20e-03

Pos logit contributions
 anyone+0.000
adoes+0.000
 there+0.000
rac+0.000
 teeth+0.000
Neg logit contributions
is-0.000
rou-0.000
scar-0.000
andy-0.000
icol-0.000
Token castle
Token ID16669
Feature activation+7.12e-03

Pos logit contributions
scar+0.014
liv+0.014
rou+0.013
live+0.013
is+0.013
Neg logit contributions
 there-0.010
 people-0.010
 temples-0.010
 guys-0.010
rac-0.009
Token and
Token ID290
Feature activation-1.16e-03

Pos logit contributions
is+0.013
andy+0.012
illy+0.012
rou+0.012
ally+0.012
Neg logit contributions
rac-0.007
adoes-0.007
 passports-0.007
 teeth-0.007
ver-0.007
Token listen
Token ID6004
Feature activation-3.58e-03

Pos logit contributions
is+0.022
andy+0.021
ally+0.020
rou+0.020
illy+0.020
Neg logit contributions
rac-0.010
 passports-0.009
adoes-0.009
 teeth-0.009
ver-0.009
Token.
Token ID13
Feature activation-7.77e-04

Pos logit contributions
rac+0.003
 passports+0.003
adoes+0.003
 teeth+0.003
ver+0.003
Neg logit contributions
is-0.007
andy-0.007
rou-0.007
illy-0.006
ally-0.006
Token She
Token ID1375
Feature activation+2.39e-03

Pos logit contributions
rac+0.001
adoes+0.001
 teeth+0.001
 passports+0.001
ver+0.001
Neg logit contributions
is-0.001
andy-0.001
ally-0.001
illy-0.001
rou-0.001
Token threw
Token ID9617
Feature activation+1.21e-02

Pos logit contributions
is+0.005
andy+0.004
illy+0.004
ally+0.004
rou+0.004
Neg logit contributions
rac-0.002
 passports-0.002
adoes-0.002
 teeth-0.002
ver-0.002
Token a
Token ID257
Feature activation-1.53e-03

Pos logit contributions
is+0.023
andy+0.021
illy+0.021
rou+0.021
ally+0.021
Neg logit contributions
rac-0.012
 passports-0.012
adoes-0.011
ver-0.011
 teeth-0.011
Token tant
Token ID24246
Feature activation-6.30e-02

Pos logit contributions
rac+0.001
adoes+0.001
 passports+0.001
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.003
andy-0.003
ally-0.003
illy-0.003
rou-0.003
Tokenrum
Token ID6582
Feature activation-1.40e-03

Pos logit contributions
 passports+0.098
rac+0.091
adoes+0.090
 turns+0.088
 astronauts+0.088
Neg logit contributions
is-0.095
illy-0.082
andy-0.082
isy-0.081
ally-0.079
Token in
Token ID287
Feature activation+8.32e-03

Pos logit contributions
adoes+0.001
 teeth+0.001
rac+0.001
 passports+0.001
 there+0.001
Neg logit contributions
is-0.003
andy-0.003
ally-0.003
illy-0.003
icol-0.003
Token the
Token ID262
Feature activation+4.72e-03

Pos logit contributions
is+0.017
andy+0.017
icol+0.016
ally+0.016
rou+0.016
Neg logit contributions
rac-0.007
 passports-0.007
 teeth-0.007
adoes-0.007
 turns-0.006
Token store
Token ID3650
Feature activation-1.12e-02

Pos logit contributions
is+0.008
andy+0.008
ally+0.008
illy+0.008
rou+0.008
Neg logit contributions
 passports-0.006
 teeth-0.005
rac-0.005
 turns-0.005
adoes-0.005
Token.
Token ID13
Feature activation+1.07e-03

Pos logit contributions
 passports+0.011
adoes+0.011
rac+0.011
 teeth+0.011
ver+0.010
Neg logit contributions
is-0.021
andy-0.021
illy-0.020
icol-0.020
ally-0.020
Token She
Token ID1375
Feature activation+1.81e-03

Pos logit contributions
is+0.003
andy+0.002
ally+0.002
illy+0.002
rou+0.002
Neg logit contributions
rac-0.002
 passports-0.002
adoes-0.002
 teeth-0.002
ver-0.002
Token cried
Token ID16896
Feature activation+8.50e-03

Pos logit contributions
is+0.003
andy+0.003
ally+0.003
illy+0.003
rou+0.003
Neg logit contributions
rac-0.002
adoes-0.002
 passports-0.002
 teeth-0.002
ver-0.002
Token and
Token ID290
Feature activation+5.92e-03

Pos logit contributions
is+0.013
illy+0.012
andy+0.012
 crawling+0.011
ally+0.011
Neg logit contributions
rac-0.012
 passports-0.012
adoes-0.011
ver-0.011
 teeth-0.010
Token threw
Token ID9617
Feature activation+1.42e-02

Pos logit contributions
is+0.011
andy+0.010
ally+0.010
illy+0.010
rou+0.010
Neg logit contributions
rac-0.007
adoes-0.006
 passports-0.006
ver-0.006
 teeth-0.006
Token a
Token ID257
Feature activation-1.71e-03

Pos logit contributions
is+0.023
andy+0.021
illy+0.021
ally+0.021
rou+0.020
Neg logit contributions
rac-0.019
 passports-0.018
adoes-0.018
ver-0.017
 teeth-0.017
Token tant
Token ID24246
Feature activation-6.29e-02

Pos logit contributions
 passports+0.001
rac+0.001
 teeth+0.001
adoes+0.001
 tant+0.001
Neg logit contributions
is-0.004
andy-0.004
illy-0.004
ally-0.004
rou-0.004
Tokenrum
Token ID6582
Feature activation-2.60e-03

Pos logit contributions
 passports+0.099
rac+0.092
adoes+0.090
 turns+0.090
 astronauts+0.088
Neg logit contributions
is-0.094
illy-0.081
andy-0.081
isy-0.080
ally-0.079
Token.
Token ID13
Feature activation+3.87e-03

Pos logit contributions
adoes+0.003
 teeth+0.003
rac+0.003
 passports+0.003
ver+0.003
Neg logit contributions
is-0.005
andy-0.004
ally-0.004
illy-0.004
icol-0.004
Token Her
Token ID2332
Feature activation-3.42e-03

Pos logit contributions
is+0.006
ally+0.005
andy+0.005
illy+0.005
icol+0.005
Neg logit contributions
 passports-0.006
rac-0.005
adoes-0.005
ver-0.005
 teeth-0.005
Token mom
Token ID1995
Feature activation+4.48e-04

Pos logit contributions
rac+0.003
adoes+0.003
 passports+0.003
ver+0.003
 teeth+0.002
Neg logit contributions
is-0.007
andy-0.007
ally-0.006
rou-0.006
 gn-0.006
Token tried
Token ID3088
Feature activation+1.48e-03

Pos logit contributions
is+0.001
icol+0.001
andy+0.001
 freshly+0.001
illy+0.001
Neg logit contributions
rac-0.000
ver-0.000
 turns-0.000
 teeth-0.000
 passports-0.000
Token her
Token ID607
Feature activation-1.65e-03

Pos logit contributions
rac+0.009
 passports+0.008
adoes+0.008
ver+0.008
 teeth+0.008
Neg logit contributions
is-0.012
andy-0.011
ally-0.011
illy-0.011
rou-0.011
Token feet
Token ID3625
Feature activation-6.10e-03

Pos logit contributions
rac+0.002
adoes+0.002
 passports+0.002
ver+0.001
 teeth+0.001
Neg logit contributions
is-0.003
andy-0.003
ally-0.003
rou-0.003
illy-0.003
Token and
Token ID290
Feature activation+9.54e-03

Pos logit contributions
ver+0.005
rac+0.004
 there+0.004
 teeth+0.004
adoes+0.004
Neg logit contributions
icol-0.013
andy-0.013
is-0.013
 freshly-0.013
illy-0.013
Token threw
Token ID9617
Feature activation+1.48e-02

Pos logit contributions
is+0.017
andy+0.016
ally+0.016
illy+0.016
rou+0.015
Neg logit contributions
rac-0.011
 passports-0.011
adoes-0.011
 teeth-0.010
ver-0.010
Token a
Token ID257
Feature activation-1.25e-03

Pos logit contributions
is+0.029
andy+0.027
ally+0.027
illy+0.027
rou+0.027
Neg logit contributions
rac-0.014
 passports-0.013
adoes-0.013
 teeth-0.013
ver-0.012
Token tant
Token ID24246
Feature activation-6.28e-02

Pos logit contributions
rac+0.000
 passports+0.000
adoes+0.000
 teeth+0.000
ver+0.000
Neg logit contributions
is-0.003
andy-0.003
illy-0.003
ally-0.003
rou-0.003
Tokenrum
Token ID6582
Feature activation+4.23e-05

Pos logit contributions
 passports+0.093
rac+0.089
adoes+0.088
 turns+0.086
ver+0.083
Neg logit contributions
is-0.096
andy-0.085
illy-0.085
isy-0.083
ally-0.082
Token.
Token ID13
Feature activation+3.69e-03

Pos logit contributions
is+0.000
andy+0.000
ally+0.000
icol+0.000
 freshly+0.000
Neg logit contributions
adoes-0.000
 teeth-0.000
rac-0.000
 anyone-0.000
 there-0.000
Token She
Token ID1375
Feature activation+2.84e-03

Pos logit contributions
is+0.006
andy+0.006
ally+0.005
illy+0.005
rou+0.005
Neg logit contributions
 passports-0.005
rac-0.005
adoes-0.005
ver-0.004
 teeth-0.004
Token grabbed
Token ID13176
Feature activation+1.91e-04

Pos logit contributions
is+0.005
andy+0.005
ally+0.004
rou+0.004
illy+0.004
Neg logit contributions
rac-0.003
adoes-0.003
 passports-0.003
ver-0.003
 teeth-0.003
Token the
Token ID262
Feature activation-1.95e-03

Pos logit contributions
is+0.000
andy+0.000
illy+0.000
ally+0.000
rou+0.000
Neg logit contributions
rac-0.000
adoes-0.000
 passports-0.000
 teeth-0.000
ver-0.000
Token.
Token ID13
Feature activation-1.64e-03

Pos logit contributions
is+0.000
andy+0.000
ally+0.000
illy+0.000
rou+0.000
Neg logit contributions
rac-0.000
 passports-0.000
adoes-0.000
 teeth-0.000
ver-0.000
Token He
Token ID679
Feature activation+3.05e-03

Pos logit contributions
 passports+0.002
rac+0.002
adoes+0.002
ver+0.002
 teeth+0.002
Neg logit contributions
is-0.002
andy-0.002
 gn-0.002
 sav-0.002
ally-0.002
Token kept
Token ID4030
Feature activation+5.17e-03

Pos logit contributions
is+0.006
andy+0.005
ally+0.005
rou+0.005
illy+0.005
Neg logit contributions
rac-0.003
adoes-0.003
 passports-0.003
 teeth-0.003
ver-0.003
Token throwing
Token ID9644
Feature activation+1.10e-02

Pos logit contributions
is+0.007
andy+0.007
ally+0.007
illy+0.007
isy+0.007
Neg logit contributions
 passports-0.008
 teeth-0.008
rac-0.008
adoes-0.008
 turns-0.008
Token a
Token ID257
Feature activation+2.67e-04

Pos logit contributions
is+0.020
 sav+0.018
 gn+0.018
 freshly+0.018
andy+0.018
Neg logit contributions
rac-0.012
ver-0.012
 passports-0.011
adoes-0.011
cases-0.010
Token tant
Token ID24246
Feature activation-6.16e-02

Pos logit contributions
is+0.000
andy+0.000
illy+0.000
ally+0.000
rou+0.000
Neg logit contributions
rac-0.000
 passports-0.000
adoes-0.000
 teeth-0.000
ver-0.000
Tokenrum
Token ID6582
Feature activation+3.97e-03

Pos logit contributions
 passports+0.098
adoes+0.091
rac+0.090
 turns+0.089
 astronauts+0.089
Neg logit contributions
is-0.091
isy-0.077
andy-0.077
illy-0.076
ally-0.075
Token.
Token ID13
Feature activation+3.11e-03

Pos logit contributions
is+0.007
andy+0.007
ally+0.007
icol+0.007
illy+0.007
Neg logit contributions
adoes-0.004
 teeth-0.004
rac-0.004
 passports-0.004
ver-0.004
Token His
Token ID2399
Feature activation-1.22e-03

Pos logit contributions
is+0.004
 sav+0.004
 gn+0.004
andy+0.004
ally+0.004
Neg logit contributions
 passports-0.004
rac-0.004
adoes-0.004
ver-0.004
 guys-0.004
Token mom
Token ID1995
Feature activation+3.33e-03

Pos logit contributions
rac+0.001
adoes+0.001
 passports+0.001
ver+0.001
 teeth+0.001
Neg logit contributions
is-0.002
andy-0.002
rou-0.002
ally-0.002
illy-0.002
Token was
Token ID373
Feature activation+1.48e-03

Pos logit contributions
is+0.006
andy+0.006
icol+0.006
illy+0.006
ally+0.006
Neg logit contributions
rac-0.003
 passports-0.003
ver-0.003
 teeth-0.003
adoes-0.003
Token to
Token ID284
Feature activation-4.99e-03

Pos logit contributions
is+0.039
andy+0.037
ally+0.037
illy+0.037
rou+0.036
Neg logit contributions
 passports-0.011
rac-0.011
 teeth-0.011
adoes-0.011
 turns-0.010
Token cry
Token ID3960
Feature activation+1.25e-02

Pos logit contributions
 teeth+0.004
 passports+0.004
adoes+0.004
rac+0.004
 turns+0.004
Neg logit contributions
is-0.011
illy-0.010
rou-0.010
andy-0.010
ally-0.010
Token and
Token ID290
Feature activation+7.55e-03

Pos logit contributions
is+0.022
andy+0.021
ally+0.020
illy+0.020
rou+0.020
Neg logit contributions
rac-0.014
 passports-0.014
adoes-0.014
 teeth-0.013
ver-0.013
Token threw
Token ID9617
Feature activation+1.70e-02

Pos logit contributions
is+0.014
andy+0.013
rou+0.013
illy+0.013
ally+0.013
Neg logit contributions
rac-0.008
 passports-0.008
adoes-0.008
 teeth-0.008
 turns-0.007
Token a
Token ID257
Feature activation-1.70e-03

Pos logit contributions
is+0.028
andy+0.026
illy+0.025
ally+0.025
rou+0.025
Neg logit contributions
rac-0.021
 passports-0.021
adoes-0.021
ver-0.020
 teeth-0.019
Token tant
Token ID24246
Feature activation-6.15e-02

Pos logit contributions
rac+0.002
 passports+0.002
adoes+0.002
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.003
andy-0.003
illy-0.003
ally-0.003
rou-0.003
Tokenrum
Token ID6582
Feature activation-2.69e-03

Pos logit contributions
 passports+0.093
rac+0.087
adoes+0.087
 turns+0.085
 astronauts+0.082
Neg logit contributions
is-0.093
illy-0.083
andy-0.083
isy-0.081
ally-0.080
Token.
Token ID13
Feature activation+3.36e-03

Pos logit contributions
adoes+0.003
 teeth+0.003
rac+0.003
 there+0.003
 anyone+0.003
Neg logit contributions
is-0.005
andy-0.005
ally-0.005
illy-0.005
 freshly-0.005
Token She
Token ID1375
Feature activation+3.44e-03

Pos logit contributions
is+0.005
andy+0.005
ally+0.005
illy+0.005
icol+0.005
Neg logit contributions
 passports-0.005
rac-0.005
adoes-0.004
ver-0.004
 teeth-0.004
Token didn
Token ID1422
Feature activation-1.40e-02

Pos logit contributions
is+0.006
andy+0.006
ally+0.006
rou+0.005
illy+0.005
Neg logit contributions
rac-0.004
adoes-0.004
 passports-0.004
ver-0.004
 teeth-0.004
Token't
Token ID470
Feature activation+3.24e-03

Pos logit contributions
rac+0.012
 passports+0.012
adoes+0.012
 teeth+0.011
ver+0.011
Neg logit contributions
is-0.029
andy-0.027
ally-0.026
illy-0.026
rou-0.026
Token kept
Token ID4030
Feature activation+7.35e-03

Pos logit contributions
is+0.012
andy+0.011
ally+0.011
illy+0.011
rou+0.011
Neg logit contributions
rac-0.007
 passports-0.006
adoes-0.006
 teeth-0.006
ver-0.006
Token crying
Token ID13774
Feature activation+7.65e-03

Pos logit contributions
is+0.010
andy+0.009
ally+0.009
isy+0.009
illy+0.009
Neg logit contributions
 teeth-0.012
 passports-0.012
adoes-0.011
 turns-0.011
 there-0.011
Token and
Token ID290
Feature activation+6.41e-03

Pos logit contributions
is+0.013
andy+0.012
ally+0.012
illy+0.012
rou+0.012
Neg logit contributions
rac-0.009
 passports-0.009
adoes-0.009
 teeth-0.008
ver-0.008
Token throwing
Token ID9644
Feature activation+1.05e-02

Pos logit contributions
is+0.011
andy+0.010
icol+0.010
ally+0.010
illy+0.010
Neg logit contributions
 there-0.008
 teeth-0.008
 anyone-0.008
 neither-0.008
rac-0.008
Token a
Token ID257
Feature activation+1.14e-03

Pos logit contributions
is+0.019
andy+0.017
illy+0.017
ally+0.017
rou+0.017
Neg logit contributions
rac-0.012
 passports-0.012
adoes-0.011
ver-0.011
 teeth-0.011
Token tant
Token ID24246
Feature activation-6.13e-02

Pos logit contributions
is+0.002
andy+0.002
ally+0.002
 freshly+0.002
rou+0.002
Neg logit contributions
rac-0.001
 passports-0.001
adoes-0.001
ver-0.001
 teeth-0.001
Tokenrum
Token ID6582
Feature activation+9.21e-04

Pos logit contributions
 passports+0.094
adoes+0.088
rac+0.088
 turns+0.085
 astronauts+0.084
Neg logit contributions
is-0.092
illy-0.082
andy-0.081
isy-0.080
ally-0.079
Token.
Token ID13
Feature activation+7.76e-03

Pos logit contributions
is+0.002
andy+0.002
ally+0.002
icol+0.002
 gn+0.002
Neg logit contributions
adoes-0.001
 teeth-0.001
rac-0.001
ver-0.001
 anyone-0.001
Token
Token ID198
Feature activation+1.49e-03

Pos logit contributions
is+0.013
andy+0.012
ally+0.012
illy+0.012
rou+0.012
Neg logit contributions
 passports-0.010
rac-0.010
adoes-0.009
 teeth-0.009
ver-0.009
Token
Token ID198
Feature activation-4.96e-04

Pos logit contributions
is+0.003
andy+0.003
ally+0.002
illy+0.002
rou+0.002
Neg logit contributions
 passports-0.002
rac-0.002
adoes-0.002
 teeth-0.002
ver-0.001
TokenFinally
Token ID11158
Feature activation+1.72e-02

Pos logit contributions
rac+0.001
adoes+0.001
 passports+0.001
 turns+0.001
 teeth+0.001
Neg logit contributions
is-0.001
icol-0.001
andy-0.001
 freshly-0.001
rou-0.001
Token sad
Token ID6507
Feature activation-4.92e-03

Pos logit contributions
is+0.010
andy+0.009
illy+0.009
ally+0.009
rou+0.009
Neg logit contributions
rac-0.004
adoes-0.004
 passports-0.004
 teeth-0.004
 turns-0.004
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
adoes+0.006
rac+0.006
 teeth+0.006
 passports+0.006
ver+0.006
Neg logit contributions
is-0.008
andy-0.008
ally-0.008
illy-0.008
rou-0.008
Token She
Token ID1375
Feature activation+8.07e-03

Pos logit contributions
is+0.026
andy+0.024
rou+0.024
illy+0.024
 gn+0.023
Neg logit contributions
rac-0.015
adoes-0.014
 teeth-0.014
 passports-0.014
 turns-0.013
Token threw
Token ID9617
Feature activation+1.76e-02

Pos logit contributions
is+0.017
andy+0.016
illy+0.015
rou+0.015
icol+0.015
Neg logit contributions
rac-0.007
 teeth-0.007
adoes-0.007
 passports-0.006
 turns-0.006
Token a
Token ID257
Feature activation+3.30e-03

Pos logit contributions
is+0.033
andy+0.030
illy+0.030
ally+0.030
 freshly+0.029
Neg logit contributions
rac-0.018
 passports-0.018
adoes-0.017
ver-0.017
 teeth-0.016
Token tant
Token ID24246
Feature activation-6.09e-02

Pos logit contributions
is+0.007
andy+0.007
illy+0.007
ally+0.007
rou+0.007
Neg logit contributions
 passports-0.002
rac-0.002
adoes-0.002
 teeth-0.002
 turns-0.002
Tokenrum
Token ID6582
Feature activation-1.65e-03

Pos logit contributions
 passports+0.093
rac+0.087
adoes+0.087
 turns+0.085
 astronauts+0.083
Neg logit contributions
is-0.091
illy-0.082
andy-0.081
isy-0.080
ally-0.079
Token and
Token ID290
Feature activation+5.34e-03

Pos logit contributions
adoes+0.002
rac+0.001
 teeth+0.001
 passports+0.001
 there+0.001
Neg logit contributions
is-0.003
andy-0.003
ally-0.003
illy-0.003
 gn-0.003
Token cried
Token ID16896
Feature activation+1.38e-02

Pos logit contributions
is+0.011
andy+0.010
illy+0.010
rou+0.010
ally+0.010
Neg logit contributions
 teeth-0.005
 passports-0.005
adoes-0.005
rac-0.005
 turns-0.005
Token loudly
Token ID23112
Feature activation+4.27e-03

Pos logit contributions
is+0.026
andy+0.025
ally+0.024
icol+0.024
rou+0.024
Neg logit contributions
adoes-0.015
 teeth-0.014
rac-0.014
ver-0.014
 passports-0.014
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
is+0.008
andy+0.008
ally+0.008
illy+0.007
rou+0.007
Neg logit contributions
 teeth-0.005
adoes-0.005
 passports-0.004
rac-0.004
 turns-0.004
Token and
Token ID290
Feature activation+5.36e-03

Pos logit contributions
is+0.023
andy+0.021
ally+0.021
 freshly+0.021
icol+0.021
Neg logit contributions
adoes-0.011
 teeth-0.011
rac-0.011
 passports-0.011
 turns-0.010
Token started
Token ID2067
Feature activation+1.96e-02

Pos logit contributions
is+0.009
andy+0.009
ally+0.009
illy+0.009
rou+0.009
Neg logit contributions
rac-0.006
 passports-0.006
adoes-0.006
ver-0.006
 teeth-0.006
Token to
Token ID284
Feature activation+8.12e-04

Pos logit contributions
is+0.046
andy+0.044
ally+0.044
illy+0.044
icol+0.043
Neg logit contributions
 teeth-0.011
 passports-0.010
rac-0.010
adoes-0.010
 turns-0.009
Token throw
Token ID3714
Feature activation+1.75e-02

Pos logit contributions
is+0.002
rou+0.002
illy+0.002
andy+0.002
icol+0.002
Neg logit contributions
 teeth-0.001
 turns-0.001
adoes-0.001
 passports-0.001
rac-0.001
Token a
Token ID257
Feature activation+1.44e-03

Pos logit contributions
is+0.029
 sav+0.026
andy+0.026
illy+0.026
 crawling+0.026
Neg logit contributions
ver-0.023
rac-0.023
 passports-0.022
cases-0.021
adoes-0.021
Token tant
Token ID24246
Feature activation-6.02e-02

Pos logit contributions
is+0.002
andy+0.002
isy+0.002
 bisc+0.002
rou+0.002
Neg logit contributions
rac-0.002
adoes-0.002
ver-0.002
 passports-0.002
 teeth-0.002
Tokenrum
Token ID6582
Feature activation+3.20e-03

Pos logit contributions
 passports+0.090
rac+0.084
adoes+0.084
 turns+0.082
 astronauts+0.080
Neg logit contributions
is-0.092
andy-0.082
illy-0.082
isy-0.080
ally-0.080
Token.
Token ID13
Feature activation+6.01e-03

Pos logit contributions
is+0.007
andy+0.006
icol+0.006
ally+0.006
 freshly+0.006
Neg logit contributions
adoes-0.003
 teeth-0.003
 there-0.003
 anyone-0.003
rac-0.003
Token Suddenly
Token ID24975
Feature activation+1.09e-02

Pos logit contributions
is+0.008
ally+0.007
andy+0.007
icol+0.007
 sav+0.007
Neg logit contributions
 passports-0.010
rac-0.009
ver-0.009
adoes-0.009
 astronauts-0.009
Token,
Token ID11
Feature activation-1.38e-02

Pos logit contributions
is+0.015
andy+0.014
illy+0.014
ally+0.014
 crawling+0.014
Neg logit contributions
rac-0.017
 passports-0.016
adoes-0.016
ver-0.015
 teeth-0.015
Token she
Token ID673
Feature activation+1.88e-03

Pos logit contributions
rac+0.023
 passports+0.021
adoes+0.021
 teeth+0.021
ver+0.021
Neg logit contributions
is-0.017
illy-0.016
 crawling-0.016
andy-0.016
 gn-0.016
Token behind
Token ID2157
Feature activation+1.63e-02

Pos logit contributions
 passports+0.003
rac+0.003
adoes+0.003
ver+0.003
 teeth+0.003
Neg logit contributions
is-0.009
andy-0.009
ally-0.009
illy-0.009
icol-0.009
Token them
Token ID606
Feature activation-3.08e-03

Pos logit contributions
is+0.040
rou+0.039
andy+0.038
ally+0.038
icol+0.038
Neg logit contributions
 passports-0.008
rac-0.007
 teeth-0.007
 turns-0.007
adoes-0.007
Token and
Token ID290
Feature activation+4.58e-03

Pos logit contributions
 passports+0.004
rac+0.004
 teeth+0.003
adoes+0.003
ver+0.003
Neg logit contributions
is-0.005
andy-0.005
ally-0.005
illy-0.005
icol-0.005
Token says
Token ID1139
Feature activation-4.85e-03

Pos logit contributions
is+0.008
 freshly+0.007
 gn+0.007
 sav+0.007
andy+0.007
Neg logit contributions
rac-0.005
adoes-0.005
ver-0.005
 passports-0.005
 teeth-0.005
Token "
Token ID366
Feature activation-6.81e-03

Pos logit contributions
rac+0.007
adoes+0.007
 passports+0.007
 tant+0.006
 teeth+0.006
Neg logit contributions
is-0.007
 gn-0.006
andy-0.006
 crawling-0.006
illy-0.006
Tokenbo
Token ID2127
Feature activation-5.91e-02

Pos logit contributions
rac+0.010
 passports+0.010
adoes+0.010
 teeth+0.010
ver+0.009
Neg logit contributions
is-0.010
andy-0.009
ally-0.009
illy-0.009
rou-0.009
Tokeno
Token ID78
Feature activation-3.19e-02

Pos logit contributions
adoes+0.115
 passports+0.110
rac+0.109
 teeth+0.103
 temples+0.102
Neg logit contributions
is-0.061
andy-0.056
rou-0.055
illy-0.051
ally-0.051
Token!"
Token ID2474
Feature activation-4.62e-03

Pos logit contributions
rac+0.037
adoes+0.035
 passports+0.035
 teeth+0.035
ver+0.034
Neg logit contributions
is-0.056
andy-0.052
icol-0.051
illy-0.051
 freshly-0.051
Token Lily
Token ID20037
Feature activation-5.03e-03

Pos logit contributions
rac+0.006
adoes+0.006
 passports+0.006
 teeth+0.006
ver+0.006
Neg logit contributions
is-0.008
andy-0.007
illy-0.007
ally-0.007
rou-0.007
Token and
Token ID290
Feature activation+1.11e-02

Pos logit contributions
rac+0.007
 passports+0.006
adoes+0.006
ver+0.006
 anyone+0.006
Neg logit contributions
is-0.008
andy-0.008
rou-0.007
 gn-0.007
 sav-0.007
Token Ben
Token ID3932
Feature activation-1.19e-02

Pos logit contributions
is+0.018
andy+0.017
ally+0.017
rou+0.017
illy+0.017
Neg logit contributions
rac-0.014
 passports-0.014
adoes-0.014
 teeth-0.013
ver-0.013
Token stubborn
Token ID24444
Feature activation+2.01e-03

Pos logit contributions
is+0.014
andy+0.013
rou+0.013
illy+0.012
ally+0.012
Neg logit contributions
rac-0.007
 passports-0.007
adoes-0.007
 teeth-0.006
ver-0.006
Token.
Token ID13
Feature activation+1.21e-03

Pos logit contributions
is+0.003
andy+0.003
ally+0.003
icol+0.003
illy+0.003
Neg logit contributions
 teeth-0.002
 passports-0.002
adoes-0.002
rac-0.002
 turns-0.002
Token She
Token ID1375
Feature activation+2.58e-03

Pos logit contributions
is+0.001
andy+0.001
illy+0.001
 freshly+0.001
 sav+0.001
Neg logit contributions
 passports-0.002
rac-0.002
adoes-0.002
ver-0.002
 guys-0.002
Token throws
Token ID12542
Feature activation+1.32e-02

Pos logit contributions
is+0.006
ally+0.005
andy+0.005
illy+0.005
icol+0.005
Neg logit contributions
rac-0.002
 teeth-0.002
 passports-0.002
 turns-0.002
adoes-0.002
Token a
Token ID257
Feature activation+4.93e-05

Pos logit contributions
is+0.023
 sav+0.021
 freshly+0.021
andy+0.021
 gn+0.021
Neg logit contributions
rac-0.015
ver-0.015
adoes-0.015
 passports-0.014
cases-0.014
Token tant
Token ID24246
Feature activation-5.87e-02

Pos logit contributions
is+0.000
andy+0.000
ally+0.000
illy+0.000
rou+0.000
Neg logit contributions
rac-0.000
 passports-0.000
adoes-0.000
 teeth-0.000
 turns-0.000
Tokenrum
Token ID6582
Feature activation+9.52e-04

Pos logit contributions
 passports+0.093
rac+0.086
 astronauts+0.086
adoes+0.086
 turns+0.086
Neg logit contributions
is-0.086
isy-0.074
illy-0.074
andy-0.074
ally-0.072
Token.
Token ID13
Feature activation+5.13e-03

Pos logit contributions
is+0.002
andy+0.002
ally+0.002
illy+0.002
icol+0.002
Neg logit contributions
adoes-0.001
rac-0.001
 teeth-0.001
 passports-0.001
ver-0.001
Token She
Token ID1375
Feature activation+2.31e-03

Pos logit contributions
is+0.006
andy+0.006
 sav+0.006
illy+0.006
 freshly+0.006
Neg logit contributions
 passports-0.009
rac-0.008
adoes-0.008
ver-0.008
 guys-0.008
Token kicks
Token ID17364
Feature activation+1.40e-02

Pos logit contributions
is+0.005
andy+0.004
ally+0.004
illy+0.004
rou+0.004
Neg logit contributions
rac-0.002
 passports-0.002
adoes-0.002
 teeth-0.002
ver-0.002
Token and
Token ID290
Feature activation+7.58e-03

Pos logit contributions
is+0.026
andy+0.023
illy+0.023
ally+0.022
 sav+0.022
Neg logit contributions
 passports-0.016
rac-0.016
adoes-0.015
ver-0.015
cases-0.014
TokenAre
Token ID8491
Feature activation-5.50e-03

Pos logit contributions
rac+0.010
 passports+0.010
adoes+0.010
 teeth+0.010
ver+0.010
Neg logit contributions
is-0.009
andy-0.008
illy-0.008
ally-0.008
rou-0.008
Token you
Token ID345
Feature activation-6.55e-03

Pos logit contributions
 passports+0.007
rac+0.007
adoes+0.007
 teeth+0.006
ver+0.006
Neg logit contributions
is-0.009
andy-0.009
illy-0.008
ally-0.008
rou-0.008
Token hurt
Token ID5938
Feature activation+5.10e-03

Pos logit contributions
rac+0.007
 passports+0.007
adoes+0.006
 teeth+0.006
ver+0.006
Neg logit contributions
is-0.013
rou-0.012
andy-0.012
illy-0.012
ally-0.012
Token?
Token ID30
Feature activation+4.74e-03

Pos logit contributions
is+0.011
andy+0.010
ally+0.010
illy+0.010
rou+0.010
Neg logit contributions
 passports-0.004
rac-0.004
adoes-0.004
 teeth-0.004
ver-0.004
Token Why
Token ID4162
Feature activation-4.21e-03

Pos logit contributions
is+0.008
andy+0.008
rou+0.008
illy+0.008
ally+0.008
Neg logit contributions
rac-0.005
 passports-0.005
adoes-0.005
 teeth-0.005
ver-0.005
Token did
Token ID750
Feature activation-5.84e-02

Pos logit contributions
 passports+0.006
rac+0.006
 Lots+0.006
adoes+0.006
 temples+0.006
Neg logit contributions
is-0.005
rou-0.005
andy-0.005
scar-0.005
illy-0.005
Token you
Token ID345
Feature activation-4.93e-04

Pos logit contributions
ver+0.086
adoes+0.085
rac+0.083
outed+0.083
 passports+0.083
Neg logit contributions
is-0.080
andy-0.070
 gn-0.069
 sav-0.069
 lending-0.069
Token do
Token ID466
Feature activation-3.22e-02

Pos logit contributions
 passports+0.001
adoes+0.001
rac+0.001
 teeth+0.001
 turns+0.001
Neg logit contributions
is-0.001
andy-0.001
ally-0.001
rou-0.001
illy-0.001
Token that
Token ID326
Feature activation-1.08e-02

Pos logit contributions
rac+0.042
 passports+0.041
adoes+0.041
 teeth+0.040
ver+0.038
Neg logit contributions
is-0.052
andy-0.049
ally-0.048
illy-0.047
rou-0.047
Token?
Token ID30
Feature activation+6.63e-03

Pos logit contributions
rac+0.006
 passports+0.005
adoes+0.005
 teeth+0.005
ver+0.005
Neg logit contributions
is-0.026
andy-0.025
illy-0.024
ally-0.024
rou-0.024
Token Why
Token ID4162
Feature activation-5.92e-03

Pos logit contributions
is+0.012
andy+0.011
ally+0.011
illy+0.010
rou+0.010
Neg logit contributions
rac-0.008
adoes-0.008
 passports-0.008
 teeth-0.007
ver-0.007
Token surprise
Token ID5975
Feature activation+5.94e-03

Pos logit contributions
 passports+0.001
 teeth+0.001
 guys+0.001
adoes+0.001
 there+0.001
Neg logit contributions
andy-0.002
ally-0.002
illy-0.002
is-0.002
rou-0.002
Token.
Token ID13
Feature activation-1.25e-03

Pos logit contributions
is+0.011
rou+0.011
icol+0.011
andy+0.011
ally+0.010
Neg logit contributions
 passports-0.006
 teeth-0.006
 anyone-0.006
adoes-0.006
 guys-0.006
Token He
Token ID679
Feature activation+4.90e-03

Pos logit contributions
 passports+0.002
rac+0.002
adoes+0.002
 microphones+0.002
 guys+0.002
Neg logit contributions
 sav-0.001
 gn-0.001
is-0.001
icol-0.001
andy-0.001
Token threw
Token ID9617
Feature activation+1.39e-02

Pos logit contributions
is+0.010
andy+0.009
illy+0.009
ally+0.009
rou+0.009
Neg logit contributions
rac-0.005
 passports-0.005
adoes-0.005
 teeth-0.004
ver-0.004
Token a
Token ID257
Feature activation+3.52e-03

Pos logit contributions
is+0.024
 sav+0.023
 gn+0.022
 crawling+0.022
 freshly+0.022
Neg logit contributions
ver-0.016
rac-0.016
 passports-0.014
adoes-0.014
cases-0.014
Token tant
Token ID24246
Feature activation-5.77e-02

Pos logit contributions
is+0.006
andy+0.006
illy+0.006
ally+0.006
rou+0.006
Neg logit contributions
rac-0.004
 passports-0.004
adoes-0.004
 teeth-0.004
 turns-0.004
Tokenrum
Token ID6582
Feature activation-4.84e-04

Pos logit contributions
 passports+0.089
rac+0.084
adoes+0.083
 turns+0.082
 astronauts+0.081
Neg logit contributions
is-0.086
andy-0.074
illy-0.074
isy-0.073
ally-0.072
Token.
Token ID13
Feature activation+3.90e-03

Pos logit contributions
rac+0.001
adoes+0.001
 passports+0.001
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.001
andy-0.001
ally-0.001
illy-0.001
rou-0.001
Token He
Token ID679
Feature activation+5.31e-03

Pos logit contributions
 sav+0.005
is+0.005
 gn+0.005
 chewing+0.005
icol+0.004
Neg logit contributions
 passports-0.006
rac-0.006
adoes-0.006
 guys-0.006
ver-0.005
Token kicked
Token ID12165
Feature activation+1.59e-02

Pos logit contributions
is+0.009
andy+0.008
 sav+0.008
 gn+0.008
rou+0.008
Neg logit contributions
rac-0.007
adoes-0.007
 passports-0.006
ver-0.006
 teeth-0.006
Token and
Token ID290
Feature activation+9.47e-03

Pos logit contributions
is+0.029
 sav+0.027
andy+0.027
 gn+0.026
ally+0.026
Neg logit contributions
rac-0.017
 passports-0.017
ver-0.016
adoes-0.016
cases-0.015
Token make
Token ID787
Feature activation-7.13e-03

Pos logit contributions
rac+0.006
 passports+0.006
adoes+0.006
 teeth+0.006
ver+0.006
Neg logit contributions
is-0.012
andy-0.011
illy-0.011
ally-0.011
rou-0.011
Token it
Token ID340
Feature activation-7.73e-03

Pos logit contributions
rac+0.005
 passports+0.005
adoes+0.005
 teeth+0.005
ver+0.004
Neg logit contributions
is-0.016
andy-0.015
ally-0.015
illy-0.015
rou-0.015
Token whole
Token ID2187
Feature activation-8.85e-03

Pos logit contributions
rac+0.010
 passports+0.010
adoes+0.010
 teeth+0.009
ver+0.009
Neg logit contributions
is-0.013
andy-0.012
rou-0.012
ally-0.011
illy-0.011
Token again
Token ID757
Feature activation+4.41e-03

Pos logit contributions
rac+0.011
 passports+0.010
adoes+0.010
ver+0.010
 teeth+0.009
Neg logit contributions
is-0.016
rou-0.014
andy-0.014
 sav-0.014
ally-0.014
Token.
Token ID13
Feature activation-4.53e-03

Pos logit contributions
is+0.009
andy+0.009
ally+0.009
illy+0.009
rou+0.009
Neg logit contributions
rac-0.003
 passports-0.003
adoes-0.003
 teeth-0.003
ver-0.003
Token Amen
Token ID34717
Feature activation-5.73e-02

Pos logit contributions
 passports+0.006
adoes+0.005
rac+0.005
 teeth+0.005
ver+0.005
Neg logit contributions
is-0.007
 sav-0.007
icol-0.007
 gn-0.007
andy-0.007
Token."
Token ID526
Feature activation-7.22e-03

Pos logit contributions
adoes+0.076
 guys+0.062
 teeth+0.061
 turns+0.061
 passports+0.060
Neg logit contributions
is-0.100
andy-0.090
rou-0.088
illy-0.085
ally-0.084
Token
Token ID198
Feature activation-1.13e-03

Pos logit contributions
rac+0.008
 passports+0.007
adoes+0.007
 teeth+0.007
ver+0.007
Neg logit contributions
is-0.013
andy-0.013
ally-0.012
illy-0.012
rou-0.012
Token
Token ID198
Feature activation-4.94e-03

Pos logit contributions
rac+0.001
 passports+0.001
 teeth+0.001
ver+0.001
 turns+0.001
Neg logit contributions
is-0.002
icol-0.002
rou-0.002
andy-0.002
illy-0.002
TokenThen
Token ID6423
Feature activation+5.60e-03

Pos logit contributions
rac+0.005
 there+0.005
 passports+0.004
 If+0.004
 turns+0.004
Neg logit contributions
icol-0.010
 freshly-0.009
zie-0.009
rou-0.009
con-0.009
Token,
Token ID11
Feature activation-1.54e-02

Pos logit contributions
is+0.009
andy+0.009
 gn+0.009
ally+0.009
illy+0.009
Neg logit contributions
rac-0.007
adoes-0.006
 passports-0.006
ver-0.006
 teeth-0.006
Token and
Token ID290
Feature activation-1.19e-03

Pos logit contributions
is+0.004
andy+0.004
illy+0.003
rou+0.003
ally+0.003
Neg logit contributions
rac-0.003
adoes-0.003
 passports-0.003
 teeth-0.003
ver-0.003
Token so
Token ID523
Feature activation-3.77e-03

Pos logit contributions
rac+0.001
 passports+0.001
adoes+0.001
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.002
andy-0.002
ally-0.002
illy-0.002
rou-0.002
Token she
Token ID673
Feature activation+5.12e-03

Pos logit contributions
rac+0.006
 passports+0.006
adoes+0.006
 teeth+0.006
ver+0.006
Neg logit contributions
is-0.005
andy-0.005
rou-0.004
ally-0.004
illy-0.004
Token threw
Token ID9617
Feature activation+1.61e-02

Pos logit contributions
is+0.008
 sav+0.008
andy+0.007
ally+0.007
 gn+0.007
Neg logit contributions
adoes-0.007
rac-0.007
 passports-0.007
ver-0.007
packs-0.006
Token a
Token ID257
Feature activation-2.81e-03

Pos logit contributions
is+0.027
andy+0.025
illy+0.024
ally+0.024
 freshly+0.024
Neg logit contributions
rac-0.020
 passports-0.020
adoes-0.020
ver-0.020
 teeth-0.019
Token tant
Token ID24246
Feature activation-5.72e-02

Pos logit contributions
rac+0.004
adoes+0.003
 passports+0.003
 teeth+0.003
ver+0.003
Neg logit contributions
is-0.005
andy-0.004
ally-0.004
rou-0.004
illy-0.004
Tokenrum
Token ID6582
Feature activation-2.80e-03

Pos logit contributions
 passports+0.090
 turns+0.086
rac+0.085
adoes+0.084
 astronauts+0.081
Neg logit contributions
is-0.082
illy-0.074
isy-0.072
andy-0.072
 crawling-0.070
Token when
Token ID618
Feature activation-3.46e-03

Pos logit contributions
adoes+0.003
rac+0.003
 teeth+0.003
 passports+0.003
ver+0.003
Neg logit contributions
is-0.005
andy-0.005
ally-0.005
illy-0.005
 freshly-0.004
Token her
Token ID607
Feature activation-1.11e-02

Pos logit contributions
rac+0.004
 passports+0.004
adoes+0.004
 teeth+0.004
 anyone+0.004
Neg logit contributions
is-0.006
andy-0.005
rou-0.005
ally-0.005
illy-0.005
Token parents
Token ID3397
Feature activation-2.07e-02

Pos logit contributions
rac+0.009
 passports+0.009
adoes+0.009
 teeth+0.008
ver+0.008
Neg logit contributions
is-0.023
andy-0.022
ally-0.021
illy-0.021
rou-0.021
Token said
Token ID531
Feature activation-8.06e-03

Pos logit contributions
rac+0.020
 passports+0.020
adoes+0.019
ver+0.019
 teeth+0.019
Neg logit contributions
is-0.039
andy-0.037
rou-0.036
ally-0.036
icol-0.036
Token you
Token ID345
Feature activation-4.93e-04

Pos logit contributions
ver+0.086
adoes+0.085
rac+0.083
outed+0.083
 passports+0.083
Neg logit contributions
is-0.080
andy-0.070
 gn-0.069
 sav-0.069
 lending-0.069
Token do
Token ID466
Feature activation-3.22e-02

Pos logit contributions
 passports+0.001
adoes+0.001
rac+0.001
 teeth+0.001
 turns+0.001
Neg logit contributions
is-0.001
andy-0.001
ally-0.001
rou-0.001
illy-0.001
Token that
Token ID326
Feature activation-1.08e-02

Pos logit contributions
rac+0.042
 passports+0.041
adoes+0.041
 teeth+0.040
ver+0.038
Neg logit contributions
is-0.052
andy-0.049
ally-0.048
illy-0.047
rou-0.047
Token?
Token ID30
Feature activation+6.63e-03

Pos logit contributions
rac+0.006
 passports+0.005
adoes+0.005
 teeth+0.005
ver+0.005
Neg logit contributions
is-0.026
andy-0.025
illy-0.024
ally-0.024
rou-0.024
Token Why
Token ID4162
Feature activation-5.92e-03

Pos logit contributions
is+0.012
andy+0.011
ally+0.011
illy+0.010
rou+0.010
Neg logit contributions
rac-0.008
adoes-0.008
 passports-0.008
 teeth-0.007
ver-0.007
Token did
Token ID750
Feature activation-5.70e-02

Pos logit contributions
 Lots+0.008
rac+0.008
 passports+0.008
adoes+0.008
 temples+0.007
Neg logit contributions
is-0.008
rou-0.008
andy-0.008
scar-0.007
isy-0.007
Token you
Token ID345
Feature activation-1.06e-03

Pos logit contributions
adoes+0.079
ver+0.079
rac+0.078
 passports+0.078
outed+0.075
Neg logit contributions
is-0.083
andy-0.074
 crawling-0.073
ally-0.072
 gn-0.072
Token not
Token ID407
Feature activation+7.86e-03

Pos logit contributions
 passports+0.001
adoes+0.001
rac+0.001
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.002
andy-0.002
ally-0.002
rou-0.002
illy-0.002
Token listen
Token ID6004
Feature activation-1.36e-02

Pos logit contributions
is+0.014
andy+0.013
illy+0.013
ally+0.013
icol+0.013
Neg logit contributions
rac-0.009
adoes-0.008
 passports-0.008
 teeth-0.008
ver-0.008
Token to
Token ID284
Feature activation-1.08e-03

Pos logit contributions
rac+0.015
 passports+0.015
adoes+0.014
ver+0.014
 teeth+0.013
Neg logit contributions
is-0.025
rou-0.023
andy-0.022
illy-0.022
ally-0.022
Token me
Token ID502
Feature activation-8.15e-03

Pos logit contributions
rac+0.001
 passports+0.001
adoes+0.001
 teeth+0.001
 anyone+0.001
Neg logit contributions
is-0.002
rou-0.002
andy-0.002
ally-0.002
icol-0.002
Token to
Token ID284
Feature activation+1.28e-02

Pos logit contributions
is+0.007
rou+0.006
andy+0.006
scar+0.006
ally+0.006
Neg logit contributions
 there-0.002
 teeth-0.002
 passports-0.002
 anyone-0.002
adoes-0.002
Token help
Token ID1037
Feature activation+4.93e-03

Pos logit contributions
is+0.025
rou+0.024
andy+0.024
illy+0.023
ally+0.023
Neg logit contributions
rac-0.012
 passports-0.012
 teeth-0.011
adoes-0.011
ver-0.011
Token sort
Token ID3297
Feature activation-1.01e-02

Pos logit contributions
icol+0.011
is+0.011
rou+0.011
andy+0.010
 freshly+0.010
Neg logit contributions
 teeth-0.004
 anyone-0.004
 passports-0.003
adoes-0.003
rac-0.003
Token the
Token ID262
Feature activation+3.20e-03

Pos logit contributions
rac+0.011
 passports+0.010
adoes+0.010
 teeth+0.010
ver+0.010
Neg logit contributions
is-0.019
andy-0.017
illy-0.017
ally-0.017
rou-0.017
Token or
Token ID393
Feature activation+1.02e-02

Pos logit contributions
is+0.006
illy+0.006
andy+0.006
ally+0.006
icol+0.006
Neg logit contributions
 passports-0.004
 teeth-0.004
adoes-0.003
 turns-0.003
 voices-0.003
Tokennam
Token ID7402
Feature activation-5.62e-02

Pos logit contributions
is+0.020
andy+0.018
ally+0.018
 freshly+0.018
illy+0.018
Neg logit contributions
rac-0.010
adoes-0.010
 passports-0.009
 teeth-0.009
ver-0.009
Tokenents
Token ID658
Feature activation-1.14e-02

Pos logit contributions
adoes+0.083
 happens+0.081
 passports+0.079
 teeth+0.078
 guys+0.077
Neg logit contributions
is-0.090
ally-0.079
rou-0.079
andy-0.078
illy-0.073
Token.
Token ID13
Feature activation+3.12e-03

Pos logit contributions
rac+0.012
 passports+0.012
adoes+0.012
ver+0.012
 teeth+0.011
Neg logit contributions
is-0.021
andy-0.020
icol-0.019
ally-0.019
illy-0.019
Token She
Token ID1375
Feature activation+9.50e-03

Pos logit contributions
is+0.006
isy+0.006
andy+0.005
illy+0.005
rou+0.005
Neg logit contributions
adoes-0.003
rac-0.003
 turns-0.003
 teeth-0.003
ver-0.003
Token picked
Token ID6497
Feature activation+4.46e-03

Pos logit contributions
is+0.017
illy+0.016
andy+0.016
 sidelines+0.016
icol+0.016
Neg logit contributions
rac-0.010
 passports-0.010
 turns-0.010
 teeth-0.009
ver-0.009
Token up
Token ID510
Feature activation-7.63e-03

Pos logit contributions
is+0.008
 freshly+0.008
illy+0.007
ally+0.007
 gn+0.007
Neg logit contributions
rac-0.005
 passports-0.005
adoes-0.005
ver-0.004
 teeth-0.004
Token a
Token ID257
Feature activation+1.11e-03

Pos logit contributions
adoes+0.007
rac+0.007
 happens+0.006
 teeth+0.006
ver+0.006
Neg logit contributions
is-0.010
 freshly-0.010
illy-0.010
ally-0.009
rou-0.009
Token big
Token ID1263
Feature activation-3.28e-03

Pos logit contributions
is+0.002
andy+0.002
ally+0.002
illy+0.002
rou+0.002
Neg logit contributions
 passports-0.001
rac-0.001
 teeth-0.001
adoes-0.001
ver-0.001
Token box
Token ID3091
Feature activation-1.78e-02

Pos logit contributions
 passports+0.004
 teeth+0.004
rac+0.004
 turns+0.004
adoes+0.004
Neg logit contributions
is-0.006
andy-0.005
ally-0.005
rou-0.005
illy-0.005
Token of
Token ID286
Feature activation-1.03e-02

Pos logit contributions
rac+0.011
 there+0.011
 passports+0.011
ver+0.010
adoes+0.009
Neg logit contributions
andy-0.039
is-0.039
icol-0.038
 sidelines-0.038
illy-0.037
Token or
Token ID393
Feature activation+1.07e-03

Pos logit contributions
rac+0.013
 passports+0.012
adoes+0.012
 teeth+0.012
ver+0.011
Neg logit contributions
is-0.018
andy-0.016
ally-0.016
illy-0.016
rou-0.016
Tokennam
Token ID7402
Feature activation-5.61e-02

Pos logit contributions
is+0.002
 freshly+0.002
 gn+0.002
illy+0.002
 lending+0.002
Neg logit contributions
rac-0.001
adoes-0.001
ver-0.001
 teeth-0.001
 passports-0.001
Tokenents
Token ID658
Feature activation-1.41e-02

Pos logit contributions
adoes+0.084
 happens+0.081
 turns+0.079
 teeth+0.078
 guys+0.078
Neg logit contributions
is-0.087
rou-0.077
ally-0.077
andy-0.077
illy-0.073
Token.
Token ID13
Feature activation-5.59e-03

Pos logit contributions
rac+0.016
 passports+0.016
adoes+0.016
 teeth+0.015
ver+0.015
Neg logit contributions
is-0.025
andy-0.023
ally-0.023
illy-0.023
rou-0.022
Token They
Token ID1119
Feature activation-1.74e-02

Pos logit contributions
 passports+0.009
rac+0.008
adoes+0.008
ver+0.008
 teeth+0.008
Neg logit contributions
is-0.008
ally-0.008
 freshly-0.008
andy-0.007
 sav-0.007
Token were
Token ID547
Feature activation-7.60e-03

Pos logit contributions
adoes+0.025
rac+0.023
 teeth+0.023
 passports+0.022
ver+0.021
Neg logit contributions
is-0.027
ally-0.025
illy-0.024
andy-0.024
 freshly-0.024
Token so
Token ID523
Feature activation-9.29e-03

Pos logit contributions
adoes+0.011
rac+0.011
 happens+0.010
ver+0.010
 passports+0.010
Neg logit contributions
is-0.011
 freshly-0.011
 crawling-0.011
 finishing-0.010
 sav-0.010
Token to
Token ID284
Feature activation+3.97e-03

Pos logit contributions
rac+0.012
 passports+0.011
adoes+0.011
 teeth+0.010
ver+0.010
Neg logit contributions
is-0.021
rou-0.019
andy-0.019
illy-0.018
ally-0.018
Token me
Token ID502
Feature activation-7.29e-03

Pos logit contributions
is+0.008
rou+0.008
icol+0.007
ally+0.007
andy+0.007
Neg logit contributions
rac-0.004
 passports-0.004
 teeth-0.004
 anyone-0.004
adoes-0.004
Token.
Token ID13
Feature activation-3.32e-03

Pos logit contributions
rac+0.009
 passports+0.009
adoes+0.009
 teeth+0.008
ver+0.008
Neg logit contributions
is-0.012
andy-0.011
ally-0.011
illy-0.011
rou-0.011
Token The
Token ID383
Feature activation+2.68e-03

Pos logit contributions
 passports+0.004
rac+0.004
adoes+0.004
ver+0.004
 teeth+0.004
Neg logit contributions
is-0.005
andy-0.005
 sav-0.005
illy-0.005
 freshly-0.005
Token or
Token ID393
Feature activation+4.87e-04

Pos logit contributions
ally+0.005
is+0.005
andy+0.005
illy+0.005
icol+0.004
Neg logit contributions
 passports-0.003
 teeth-0.003
adoes-0.003
rac-0.003
 turns-0.003
Tokennam
Token ID7402
Feature activation-5.59e-02

Pos logit contributions
is+0.001
 freshly+0.001
 crawling+0.001
 gn+0.001
 sav+0.001
Neg logit contributions
ver-0.001
rac-0.001
adoes-0.001
 passports-0.000
 teeth-0.000
Tokenents
Token ID658
Feature activation-6.58e-03

Pos logit contributions
 happens+0.086
adoes+0.086
 passports+0.083
 guys+0.083
 teeth+0.080
Neg logit contributions
is-0.085
rou-0.076
andy-0.072
ally-0.069
ami-0.066
Token were
Token ID547
Feature activation-9.56e-03

Pos logit contributions
 passports+0.006
rac+0.006
adoes+0.006
 teeth+0.006
 Lots+0.005
Neg logit contributions
is-0.013
andy-0.011
rou-0.011
isy-0.011
 crawling-0.011
Token very
Token ID845
Feature activation-9.44e-04

Pos logit contributions
rac+0.011
adoes+0.011
 passports+0.011
ver+0.010
 teeth+0.010
Neg logit contributions
is-0.017
andy-0.016
rou-0.015
 freshly-0.015
illy-0.015
Token important
Token ID1593
Feature activation-1.22e-03

Pos logit contributions
adoes+0.001
rac+0.001
 passports+0.001
ver+0.001
 teeth+0.001
Neg logit contributions
is-0.002
rou-0.001
andy-0.001
 freshly-0.001
 gn-0.001
Token to
Token ID284
Feature activation+1.11e-02

Pos logit contributions
rac+0.002
adoes+0.002
 passports+0.002
ver+0.002
 teeth+0.002
Neg logit contributions
is-0.002
rou-0.002
andy-0.002
illy-0.002
ally-0.002
Token and
Token ID290
Feature activation-1.00e-02

Pos logit contributions
is+0.013
andy+0.012
ally+0.012
illy+0.012
rou+0.012
Neg logit contributions
rac-0.010
 passports-0.009
adoes-0.009
 teeth-0.009
ver-0.009
Token play
Token ID711
Feature activation+1.64e-03

Pos logit contributions
rac+0.013
adoes+0.012
 passports+0.012
ver+0.011
 teeth+0.011
Neg logit contributions
is-0.016
andy-0.015
 gn-0.015
 sav-0.015
 freshly-0.015
Token with
Token ID351
Feature activation+4.23e-03

Pos logit contributions
is+0.003
andy+0.003
 sav+0.003
illy+0.003
 crawling+0.003
Neg logit contributions
rac-0.002
 passports-0.002
adoes-0.002
 teeth-0.002
ver-0.002
Token your
Token ID534
Feature activation-1.18e-03

Pos logit contributions
is+0.008
andy+0.008
ally+0.008
illy+0.008
rou+0.008
Neg logit contributions
rac-0.004
 passports-0.004
adoes-0.004
 teeth-0.004
ver-0.004
Token or
Token ID393
Feature activation+3.65e-03

Pos logit contributions
 passports+0.001
 teeth+0.001
adoes+0.001
rac+0.001
 turns+0.001
Neg logit contributions
is-0.002
ally-0.002
illy-0.002
andy-0.002
rou-0.002
Tokennam
Token ID7402
Feature activation-5.56e-02

Pos logit contributions
is+0.006
 sav+0.006
 gn+0.006
 crawling+0.006
 freshly+0.006
Neg logit contributions
rac-0.004
ver-0.004
adoes-0.004
 passports-0.004
outed-0.004
Tokenents
Token ID658
Feature activation-5.53e-03

Pos logit contributions
adoes+0.082
 happens+0.080
 passports+0.077
 turns+0.075
 guys+0.075
Neg logit contributions
is-0.088
rou-0.080
ally-0.077
andy-0.076
illy-0.072
Token.
Token ID13
Feature activation-8.66e-03

Pos logit contributions
rac+0.007
 passports+0.007
adoes+0.007
 teeth+0.006
ver+0.006
Neg logit contributions
is-0.009
andy-0.009
ally-0.008
illy-0.008
rou-0.008
Token Maybe
Token ID6674
Feature activation+5.64e-03

Pos logit contributions
rac+0.007
adoes+0.007
 passports+0.007
 teeth+0.007
ver+0.007
Neg logit contributions
is-0.018
andy-0.017
rou-0.017
illy-0.016
ally-0.016
Token you
Token ID345
Feature activation+1.83e-03

Pos logit contributions
is+0.011
andy+0.010
ally+0.010
illy+0.010
 gn+0.010
Neg logit contributions
rac-0.006
adoes-0.006
 passports-0.005
 teeth-0.005
ver-0.005
Token can
Token ID460
Feature activation+5.23e-03

Pos logit contributions
is+0.003
rou+0.003
 tre+0.003
 gn+0.003
andy+0.003
Neg logit contributions
adoes-0.002
rac-0.002
 passports-0.002
 teeth-0.002
outed-0.002
Token such
Token ID884
Feature activation-7.01e-03

Pos logit contributions
ver+0.006
rac+0.006
adoes+0.006
 passports+0.005
cases+0.005
Neg logit contributions
is-0.010
andy-0.009
illy-0.009
ally-0.009
 sav-0.009
Token a
Token ID257
Feature activation-2.59e-03

Pos logit contributions
rac+0.006
 passports+0.005
adoes+0.005
 teeth+0.005
ver+0.005
Neg logit contributions
is-0.015
andy-0.014
illy-0.014
ally-0.014
rou-0.014
Token mess
Token ID2085
Feature activation-1.32e-03

Pos logit contributions
 passports+0.003
 turns+0.003
 teeth+0.003
 tant+0.003
?"+0.003
Neg logit contributions
zie-0.005
andy-0.004
ally-0.004
scar-0.004
 lending-0.004
Token?
Token ID30
Feature activation+1.77e-03

Pos logit contributions
 passports+0.001
rac+0.001
adoes+0.001
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.003
andy-0.003
illy-0.003
ally-0.003
rou-0.003
Token Why
Token ID4162
Feature activation-6.42e-03

Pos logit contributions
is+0.003
andy+0.003
illy+0.003
rou+0.003
ally+0.003
Neg logit contributions
 passports-0.002
rac-0.002
adoes-0.002
 teeth-0.002
ver-0.002
Token did
Token ID750
Feature activation-5.55e-02

Pos logit contributions
rac+0.008
 Lots+0.008
 passports+0.008
adoes+0.008
 temples+0.007
Neg logit contributions
is-0.010
andy-0.009
rou-0.009
illy-0.009
ally-0.008
Token you
Token ID345
Feature activation+1.83e-03

Pos logit contributions
adoes+0.077
ver+0.077
rac+0.075
 passports+0.073
 tant+0.072
Neg logit contributions
is-0.082
andy-0.073
illy-0.071
ally-0.071
 crawling-0.071
Token hide
Token ID7808
Feature activation-1.28e-03

Pos logit contributions
is+0.003
andy+0.003
ally+0.003
illy+0.003
rou+0.003
Neg logit contributions
rac-0.002
adoes-0.002
 passports-0.002
 teeth-0.002
ver-0.002
Token from
Token ID422
Feature activation-9.82e-03

Pos logit contributions
rac+0.001
ver+0.001
 passports+0.001
adoes+0.001
 teeth+0.001
Neg logit contributions
is-0.002
andy-0.002
illy-0.002
rou-0.002
ally-0.002
Token me
Token ID502
Feature activation-1.67e-02

Pos logit contributions
rac+0.009
 passports+0.009
adoes+0.009
 teeth+0.009
ver+0.008
Neg logit contributions
is-0.019
andy-0.018
ally-0.018
illy-0.018
rou-0.018
Token?"
Token ID1701
Feature activation-3.46e-03

Pos logit contributions
adoes+0.019
 passports+0.019
rac+0.019
 teeth+0.018
 turns+0.018
Neg logit contributions
is-0.029
andy-0.027
illy-0.027
rou-0.026
ally-0.026
Token!"
Token ID2474
Feature activation+3.08e-03

Pos logit contributions
is+0.022
andy+0.020
illy+0.019
rou+0.019
ally+0.019
Neg logit contributions
rac-0.019
 passports-0.019
adoes-0.019
 teeth-0.018
ver-0.017
Token And
Token ID843
Feature activation+2.70e-06

Pos logit contributions
is+0.006
rou+0.006
andy+0.006
ally+0.006
illy+0.006
Neg logit contributions
rac-0.003
 passports-0.003
adoes-0.003
 teeth-0.003
 turns-0.003
Token from
Token ID422
Feature activation+5.55e-03

Feature not active on prev token;
no predictions were affected.
Token that
Token ID326
Feature activation-1.97e-03

Pos logit contributions
is+0.010
andy+0.010
rou+0.010
ally+0.009
illy+0.009
Neg logit contributions
rac-0.006
 passports-0.006
adoes-0.006
 teeth-0.005
 turns-0.005
Token day
Token ID1110
Feature activation+1.05e-02

Pos logit contributions
 there+0.002
 matters+0.002
?"+0.002
 means+0.002
rac+0.002
Neg logit contributions
icol-0.004
rou-0.004
is-0.004
 sidelines-0.003
 gn-0.003
Token on
Token ID319
Feature activation+2.56e-02

Pos logit contributions
is+0.022
andy+0.021
ally+0.020
illy+0.020
rou+0.020
Neg logit contributions
rac-0.009
 passports-0.009
adoes-0.009
 teeth-0.008
ver-0.008
Token,
Token ID11
Feature activation-2.94e-03

Pos logit contributions
is+0.051
rou+0.048
andy+0.047
illy+0.046
icol+0.046
Neg logit contributions
rac-0.025
adoes-0.024
 teeth-0.024
 passports-0.023
 there-0.023
Token Tim
Token ID5045
Feature activation+2.03e-02

Pos logit contributions
rac+0.003
adoes+0.003
 passports+0.003
 teeth+0.003
ver+0.003
Neg logit contributions
is-0.006
andy-0.005
illy-0.005
rou-0.005
ally-0.005
Token and
Token ID290
Feature activation+1.76e-02

Pos logit contributions
is+0.037
 sidelines+0.036
icol+0.036
 freshly+0.035
ally+0.034
Neg logit contributions
 teeth-0.022
rac-0.021
 turns-0.021
ver-0.020
 means-0.020
Token his
Token ID465
Feature activation+7.64e-03

Pos logit contributions
is+0.037
andy+0.034
illy+0.034
rou+0.034
ally+0.034
Neg logit contributions
rac-0.015
 teeth-0.015
adoes-0.015
 passports-0.015
 anyone-0.013
Token friends
Token ID2460
Feature activation-1.33e-03

Pos logit contributions
is+0.016
ally+0.016
icol+0.015
andy+0.015
illy+0.015
Neg logit contributions
 teeth-0.007
 passports-0.006
rac-0.006
 people-0.006
 anyone-0.006
Token Dad
Token ID17415
Feature activation-1.11e-02

Pos logit contributions
is+0.014
andy+0.014
ally+0.013
illy+0.013
rou+0.013
Neg logit contributions
rac-0.005
adoes-0.005
 passports-0.005
 teeth-0.005
ver-0.005
Token had
Token ID550
Feature activation-2.28e-03

Pos logit contributions
ver+0.009
rac+0.009
 passports+0.009
 turns+0.009
 there+0.008
Neg logit contributions
is-0.022
icol-0.022
andy-0.021
 gn-0.021
 sidelines-0.021
Token so
Token ID523
Feature activation-7.82e-03

Pos logit contributions
rac+0.002
adoes+0.002
 passports+0.002
 teeth+0.002
ver+0.002
Neg logit contributions
is-0.004
andy-0.004
ally-0.004
rou-0.004
illy-0.004
Token much
Token ID881
Feature activation+1.24e-02

Pos logit contributions
rac+0.010
adoes+0.010
 passports+0.009
 teeth+0.009
 guys+0.008
Neg logit contributions
is-0.014
rou-0.013
andy-0.012
illy-0.012
 freshly-0.012
Token fun
Token ID1257
Feature activation+2.05e-02

Pos logit contributions
zie+0.023
andy+0.023
ally+0.022
ousy+0.022
is+0.022
Neg logit contributions
 turns-0.014
 passports-0.012
 there-0.012
 happens-0.012
 teeth-0.011
Token with
Token ID351
Feature activation+2.28e-02

Pos logit contributions
Mic+0.033
ousy+0.031
ally+0.031
is+0.031
andy+0.031
Neg logit contributions
 there-0.028
 passports-0.028
 turns-0.027
 teeth-0.025
adoes-0.025
Token all
Token ID477
Feature activation-1.22e-02

Pos logit contributions
is+0.052
andy+0.049
ally+0.049
icol+0.049
rou+0.048
Neg logit contributions
 passports-0.015
 teeth-0.014
rac-0.014
adoes-0.014
 turns-0.013
Token their
Token ID511
Feature activation+1.21e-02

Pos logit contributions
 passports+0.007
 there+0.006
 turns+0.006
adoes+0.006
rac+0.005
Neg logit contributions
is-0.029
icol-0.028
andy-0.027
 gn-0.027
ally-0.027
Token silly
Token ID14397
Feature activation+1.14e-02

Pos logit contributions
 gn+0.023
icol+0.023
zie+0.023
is+0.022
illy+0.022
Neg logit contributions
 turns-0.013
 teeth-0.012
 passports-0.012
 voices-0.011
 tickets-0.010
Token things
Token ID1243
Feature activation+1.37e-02

Pos logit contributions
is+0.025
andy+0.024
rou+0.023
live+0.023
illy+0.023
Neg logit contributions
 teeth-0.010
 turns-0.009
 passports-0.009
 guys-0.009
adoes-0.009
Token.
Token ID13
Feature activation+4.30e-03

Pos logit contributions
is+0.025
andy+0.023
ally+0.023
illy+0.022
icol+0.022
Neg logit contributions
 passports-0.015
rac-0.015
adoes-0.015
ver-0.015
 there-0.014
Token
Token ID198
Feature activation+4.39e-03

Pos logit contributions
is+0.024
andy+0.022
ally+0.022
illy+0.022
rou+0.022
Neg logit contributions
rac-0.015
 passports-0.015
adoes-0.015
 teeth-0.014
ver-0.014
Token
Token ID198
Feature activation+3.30e-03

Pos logit contributions
is+0.009
icol+0.008
andy+0.008
rou+0.008
ally+0.008
Neg logit contributions
rac-0.004
ver-0.003
 passports-0.003
adoes-0.003
 teeth-0.003
TokenAs
Token ID1722
Feature activation+4.64e-03

Pos logit contributions
icol+0.005
is+0.005
andy+0.005
 freshly+0.004
rou+0.004
Neg logit contributions
rac-0.005
ver-0.004
adoes-0.004
 turns-0.004
 there-0.004
Token he
Token ID339
Feature activation+1.73e-02

Pos logit contributions
is+0.010
rou+0.009
icol+0.009
andy+0.009
isy+0.009
Neg logit contributions
 there-0.004
 anyone-0.004
 turns-0.004
 passports-0.004
ver-0.004
Token reflected
Token ID12548
Feature activation-4.30e-03

Pos logit contributions
is+0.028
andy+0.026
ally+0.025
rou+0.025
illy+0.025
Neg logit contributions
adoes-0.023
rac-0.023
 passports-0.022
 teeth-0.022
ver-0.021
Token on
Token ID319
Feature activation+2.57e-02

Pos logit contributions
 passports+0.005
rac+0.005
adoes+0.005
ver+0.005
 teeth+0.005
Neg logit contributions
is-0.007
 crawling-0.007
andy-0.007
illy-0.007
 hopping-0.006
Token all
Token ID477
Feature activation-5.02e-03

Pos logit contributions
is+0.053
andy+0.049
ally+0.048
illy+0.048
rou+0.048
Neg logit contributions
rac-0.023
 passports-0.023
adoes-0.022
 teeth-0.021
ver-0.020
Token he
Token ID339
Feature activation+5.83e-03

Pos logit contributions
rac+0.004
 passports+0.004
adoes+0.004
 teeth+0.004
 turns+0.003
Neg logit contributions
is-0.011
andy-0.010
rou-0.010
ally-0.010
icol-0.010
Token had
Token ID550
Feature activation+1.42e-03

Pos logit contributions
is+0.010
andy+0.009
illy+0.009
ally+0.009
 sav+0.009
Neg logit contributions
adoes-0.008
rac-0.007
 passports-0.007
 teeth-0.007
ver-0.007
Token seen
Token ID1775
Feature activation+5.77e-03

Pos logit contributions
is+0.002
 sav+0.002
andy+0.002
ally+0.002
 gn+0.002
Neg logit contributions
adoes-0.002
rac-0.002
 passports-0.002
ver-0.002
 teeth-0.001
Token,
Token ID11
Feature activation-8.22e-03

Pos logit contributions
is+0.012
andy+0.011
ally+0.011
illy+0.011
rou+0.011
Neg logit contributions
rac-0.005
adoes-0.005
 passports-0.005
 teeth-0.005
ver-0.004
Token curiosity
Token ID20136
Feature activation+1.17e-02

Pos logit contributions
is+0.006
ally+0.006
 freshly+0.006
rou+0.006
 sidelines+0.006
Neg logit contributions
 teeth-0.004
 passports-0.003
rac-0.003
adoes-0.003
 voices-0.003
Token got
Token ID1392
Feature activation+7.94e-03

Pos logit contributions
is+0.021
andy+0.020
illy+0.019
ally+0.019
rou+0.019
Neg logit contributions
rac-0.013
adoes-0.013
 passports-0.013
 teeth-0.012
ver-0.012
Token the
Token ID262
Feature activation+8.89e-03

Pos logit contributions
is+0.015
andy+0.015
ally+0.014
illy+0.014
rou+0.014
Neg logit contributions
rac-0.008
adoes-0.008
 passports-0.008
 teeth-0.007
ver-0.007
Token best
Token ID1266
Feature activation-2.15e-03

Pos logit contributions
is+0.016
andy+0.015
ally+0.015
illy+0.015
icol+0.014
Neg logit contributions
 passports-0.010
adoes-0.010
 teeth-0.010
rac-0.010
 turns-0.010
Token of
Token ID286
Feature activation+8.73e-03

Pos logit contributions
 passports+0.002
 teeth+0.002
adoes+0.002
 turns+0.002
 anyone+0.002
Neg logit contributions
is-0.004
andy-0.004
ally-0.004
illy-0.004
rou-0.004
Token him
Token ID683
Feature activation+2.11e-02

Pos logit contributions
icol+0.017
is+0.017
andy+0.016
ally+0.016
rou+0.016
Neg logit contributions
 passports-0.009
 teeth-0.009
adoes-0.009
 anyone-0.009
 there-0.009
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
is+0.043
andy+0.040
ally+0.040
illy+0.039
icol+0.039
Neg logit contributions
 passports-0.019
rac-0.019
 teeth-0.019
ver-0.019
adoes-0.018
Token He
Token ID679
Feature activation+1.07e-02

Pos logit contributions
is+0.024
andy+0.023
ally+0.022
illy+0.022
rou+0.022
Neg logit contributions
rac-0.016
 passports-0.015
adoes-0.015
 teeth-0.015
ver-0.014
Token couldn
Token ID3521
Feature activation-1.63e-02

Pos logit contributions
is+0.019
andy+0.017
illy+0.017
ally+0.017
rou+0.017
Neg logit contributions
rac-0.013
 passports-0.012
adoes-0.012
 teeth-0.012
ver-0.012
Token't
Token ID470
Feature activation+1.03e-03

Pos logit contributions
rac+0.013
adoes+0.012
 passports+0.012
ver+0.011
 teeth+0.011
Neg logit contributions
is-0.035
andy-0.033
rou-0.032
illy-0.032
ally-0.032
Token help
Token ID1037
Feature activation+2.10e-03

Pos logit contributions
is+0.002
rou+0.002
andy+0.002
icol+0.002
illy+0.002
Neg logit contributions
rac-0.001
 anyone-0.001
ver-0.001
 guys-0.001
 passports-0.001
Token but
Token ID475
Feature activation+6.17e-03

Pos logit contributions
is+0.016
andy+0.015
ally+0.014
illy+0.014
rou+0.014
Neg logit contributions
rac-0.008
 passports-0.008
adoes-0.008
 teeth-0.007
ver-0.007
Token she
Token ID673
Feature activation+6.34e-03

Pos logit contributions
is+0.010
andy+0.009
ally+0.009
illy+0.009
 crawling+0.009
Neg logit contributions
rac-0.008
 passports-0.008
adoes-0.008
 teeth-0.008
ver-0.007
Token did
Token ID750
Feature activation-1.80e-02

Pos logit contributions
is+0.010
andy+0.009
ally+0.009
illy+0.009
 crawling+0.009
Neg logit contributions
adoes-0.009
rac-0.009
 passports-0.009
 teeth-0.008
 tant-0.008
Token not
Token ID407
Feature activation+1.17e-02

Pos logit contributions
rac+0.019
adoes+0.019
 passports+0.019
 teeth+0.017
 tant+0.017
Neg logit contributions
is-0.033
 crawling-0.030
 gn-0.030
andy-0.030
rou-0.030
Token know
Token ID760
Feature activation+7.33e-03

Pos logit contributions
is+0.020
 gn+0.019
illy+0.019
andy+0.019
 crawling+0.019
Neg logit contributions
adoes-0.014
rac-0.014
 passports-0.013
cases-0.013
outed-0.012
Token what
Token ID644
Feature activation+2.11e-02

Pos logit contributions
is+0.015
andy+0.014
ally+0.014
illy+0.014
rou+0.013
Neg logit contributions
rac-0.007
 passports-0.007
adoes-0.006
 teeth-0.006
ver-0.006
Token to
Token ID284
Feature activation-3.75e-03

Pos logit contributions
is+0.040
andy+0.037
illy+0.037
ally+0.037
 freshly+0.036
Neg logit contributions
rac-0.022
adoes-0.021
 passports-0.020
 teeth-0.019
ver-0.019
Token get
Token ID651
Feature activation-7.67e-03

Pos logit contributions
adoes+0.006
rac+0.005
 passports+0.005
ver+0.005
 teeth+0.005
Neg logit contributions
is-0.006
ally-0.005
andy-0.005
illy-0.005
isy-0.005
Token.
Token ID13
Feature activation-5.60e-04

Pos logit contributions
adoes+0.009
rac+0.009
ver+0.008
 passports+0.008
 teeth+0.008
Neg logit contributions
is-0.013
illy-0.012
 crawling-0.012
 lending-0.012
ally-0.012
Token The
Token ID383
Feature activation-7.53e-03

Pos logit contributions
 passports+0.001
adoes+0.001
rac+0.001
ver+0.001
 teeth+0.001
Neg logit contributions
is-0.001
ally-0.001
andy-0.001
 sav-0.001
 gn-0.001
Token judge
Token ID5052
Feature activation-9.34e-03

Pos logit contributions
rac+0.011
 passports+0.011
adoes+0.011
 teeth+0.010
ver+0.010
Neg logit contributions
is-0.011
andy-0.010
rou-0.010
illy-0.010
ally-0.010
Token!"
Token ID2474
Feature activation+3.60e-03

Pos logit contributions
is+0.018
andy+0.017
ally+0.017
illy+0.017
rou+0.016
Neg logit contributions
 passports-0.013
rac-0.013
adoes-0.013
 teeth-0.012
ver-0.012
Token
Token ID198
Feature activation+1.00e-03

Pos logit contributions
is+0.007
andy+0.007
rou+0.006
 gn+0.006
illy+0.006
Neg logit contributions
rac-0.003
adoes-0.003
 teeth-0.003
 turns-0.003
 passports-0.003
Token
Token ID198
Feature activation-1.97e-03

Pos logit contributions
icol+0.002
is+0.002
 gn+0.002
andy+0.002
 sidelines+0.002
Neg logit contributions
rac-0.001
 turns-0.001
 there-0.001
 passports-0.001
 teeth-0.001
TokenFrom
Token ID4863
Feature activation+2.02e-02

Pos logit contributions
rac+0.002
 turns+0.002
 there+0.002
adoes+0.002
 passports+0.002
Neg logit contributions
icol-0.003
 gn-0.003
gie-0.003
 freshly-0.003
 grizz-0.003
Token that
Token ID326
Feature activation-1.82e-04

Pos logit contributions
is+0.038
 freshly+0.037
 gn+0.037
icol+0.036
andy+0.036
Neg logit contributions
rac-0.021
 there-0.020
adoes-0.018
 passports-0.018
?"-0.017
Token day
Token ID1110
Feature activation+6.50e-03

Pos logit contributions
 there+0.000
 matters+0.000
rac+0.000
 turns+0.000
ver+0.000
Neg logit contributions
icol-0.000
 gn-0.000
 freshly-0.000
rou-0.000
is-0.000
Token on
Token ID319
Feature activation+2.47e-02

Pos logit contributions
is+0.013
andy+0.013
illy+0.012
ally+0.012
rou+0.012
Neg logit contributions
rac-0.006
 passports-0.006
adoes-0.005
 teeth-0.005
ver-0.005
Token,
Token ID11
Feature activation-1.15e-02

Pos logit contributions
is+0.049
rou+0.048
andy+0.046
icol+0.045
 gn+0.045
Neg logit contributions
rac-0.023
adoes-0.023
 turns-0.022
 passports-0.021
 teeth-0.021
Token whenever
Token ID8797
Feature activation-3.35e-03

Pos logit contributions
rac+0.013
adoes+0.013
 passports+0.013
 teeth+0.013
ver+0.012
Neg logit contributions
is-0.020
andy-0.019
illy-0.018
rou-0.018
ally-0.018
Token the
Token ID262
Feature activation-2.94e-04

Pos logit contributions
rac+0.004
adoes+0.004
 passports+0.004
 teeth+0.004
ver+0.004
Neg logit contributions
is-0.006
andy-0.005
ally-0.005
illy-0.005
rou-0.005
Token m
Token ID285
Feature activation-3.87e-03

Pos logit contributions
 teeth+0.000
 passports+0.000
 turns+0.000
rac+0.000
adoes+0.000
Neg logit contributions
icol-0.001
ally-0.001
rou-0.001
illy-0.000
is-0.000
Token in
Token ID287
Feature activation+3.46e-04

Pos logit contributions
rac+0.004
adoes+0.004
 teeth+0.004
 passports+0.003
ver+0.003
Neg logit contributions
is-0.010
andy-0.009
illy-0.009
ally-0.009
 freshly-0.009
Token his
Token ID465
Feature activation-1.57e-04

Pos logit contributions
is+0.001
rou+0.001
andy+0.001
isy+0.001
illy+0.001
Neg logit contributions
 teeth-0.000
 passports-0.000
 there-0.000
 turns-0.000
rac-0.000
Token hands
Token ID2832
Feature activation-3.04e-03

Pos logit contributions
rac+0.000
 teeth+0.000
 passports+0.000
adoes+0.000
 turns+0.000
Neg logit contributions
is-0.000
rou-0.000
illy-0.000
andy-0.000
ally-0.000
Token.
Token ID13
Feature activation+1.11e-02

Pos logit contributions
ver+0.003
 there+0.003
rac+0.003
 teeth+0.003
 passports+0.002
Neg logit contributions
is-0.006
icol-0.006
andy-0.006
ally-0.006
 sidelines-0.006
Token With
Token ID2080
Feature activation+2.13e-02

Pos logit contributions
is+0.020
andy+0.018
ally+0.018
illy+0.018
rou+0.018
Neg logit contributions
rac-0.013
 passports-0.012
adoes-0.012
 teeth-0.012
ver-0.011
Token it
Token ID340
Feature activation-4.66e-03

Pos logit contributions
is+0.045
andy+0.042
ally+0.042
rou+0.041
illy+0.041
Neg logit contributions
rac-0.018
 passports-0.017
adoes-0.017
 teeth-0.017
ver-0.015
Token,
Token ID11
Feature activation-9.65e-03

Pos logit contributions
 there+0.004
ver+0.004
 teeth+0.004
rac+0.004
adoes+0.004
Neg logit contributions
is-0.010
icol-0.010
andy-0.009
ally-0.009
rou-0.009
Token he
Token ID339
Feature activation+2.85e-04

Pos logit contributions
rac+0.014
adoes+0.013
 passports+0.013
 teeth+0.013
ver+0.012
Neg logit contributions
is-0.014
 sav-0.013
 freshly-0.012
illy-0.012
ally-0.012
Token made
Token ID925
Feature activation-6.00e-03

Pos logit contributions
is+0.001
andy+0.000
illy+0.000
ally+0.000
rou+0.000
Neg logit contributions
rac-0.000
 passports-0.000
 teeth-0.000
adoes-0.000
ver-0.000
Token a
Token ID257
Feature activation+5.68e-03

Pos logit contributions
rac+0.005
 passports+0.004
adoes+0.004
ver+0.004
 teeth+0.004
Neg logit contributions
is-0.013
andy-0.012
illy-0.012
ally-0.012
rou-0.012
Token mother
Token ID2802
Feature activation-5.59e-03

Pos logit contributions
illy+0.013
scar+0.012
live+0.012
zie+0.012
ally+0.012
Neg logit contributions
 teeth-0.005
 turns-0.005
 voices-0.004
 passports-0.004
 tant-0.004
Token decided
Token ID3066
Feature activation-3.12e-03

Pos logit contributions
rac+0.007
 passports+0.007
 teeth+0.006
adoes+0.006
ver+0.006
Neg logit contributions
is-0.011
andy-0.010
 freshly-0.010
 sidelines-0.010
illy-0.010
Token to
Token ID284
Feature activation+1.21e-02

Pos logit contributions
 passports+0.002
rac+0.002
adoes+0.002
 teeth+0.002
ver+0.001
Neg logit contributions
is-0.007
andy-0.007
rou-0.007
ally-0.007
illy-0.007
Token sit
Token ID1650
Feature activation+1.07e-02

Pos logit contributions
is+0.021
rou+0.019
andy+0.019
illy+0.019
ally+0.019
Neg logit contributions
rac-0.015
adoes-0.015
 passports-0.014
 teeth-0.014
 turns-0.014
Token in
Token ID287
Feature activation+2.17e-03

Pos logit contributions
is+0.020
andy+0.019
rou+0.019
illy+0.018
 crawling+0.018
Neg logit contributions
 passports-0.011
rac-0.011
 teeth-0.010
adoes-0.010
 turns-0.010
Token the
Token ID262
Feature activation+5.43e-03

Pos logit contributions
is+0.005
andy+0.005
rou+0.005
icol+0.005
ally+0.005
Neg logit contributions
 passports-0.001
 there-0.001
rac-0.001
 teeth-0.001
 turns-0.001
Token mud
Token ID17492
Feature activation+1.11e-02

Pos logit contributions
is+0.010
illy+0.009
 lending+0.009
rou+0.009
ami+0.009
Neg logit contributions
adoes-0.006
 teeth-0.006
 turns-0.006
 there-0.006
 temples-0.006
Token and
Token ID290
Feature activation-2.05e-03

Pos logit contributions
is+0.020
andy+0.018
illy+0.018
rou+0.018
ally+0.018
Neg logit contributions
adoes-0.012
rac-0.012
 passports-0.012
 teeth-0.012
ver-0.011
Token have
Token ID423
Feature activation-5.05e-03

Pos logit contributions
rac+0.003
 passports+0.003
adoes+0.003
 teeth+0.003
 turns+0.002
Neg logit contributions
is-0.003
andy-0.003
illy-0.003
ally-0.003
rou-0.003
Token some
Token ID617
Feature activation-8.73e-03

Pos logit contributions
rac+0.004
 passports+0.004
adoes+0.004
 teeth+0.004
ver+0.004
Neg logit contributions
is-0.010
andy-0.010
ally-0.010
illy-0.010
rou-0.010
Token fun
Token ID1257
Feature activation+9.89e-03

Pos logit contributions
 teeth+0.010
 passports+0.010
 turns+0.010
adoes+0.010
rac+0.009
Neg logit contributions
is-0.015
andy-0.014
ally-0.014
icol-0.014
rou-0.014
Token.
Token ID13
Feature activation+7.24e-03

Pos logit contributions
is+0.018
andy+0.017
ally+0.017
illy+0.016
icol+0.016
Neg logit contributions
 teeth-0.011
 passports-0.011
adoes-0.011
 there-0.010
 turns-0.010
Token They
Token ID1119
Feature activation-1.16e-02

Pos logit contributions
rac+0.000
 passports+0.000
adoes+0.000
 teeth+0.000
ver+0.000
Neg logit contributions
is-0.001
andy-0.000
ally-0.000
illy-0.000
rou-0.000
Token slid
Token ID27803
Feature activation+2.00e-02

Pos logit contributions
rac+0.013
adoes+0.013
 passports+0.013
 teeth+0.012
ver+0.012
Neg logit contributions
is-0.021
andy-0.019
ally-0.019
rou-0.019
illy-0.019
Token down
Token ID866
Feature activation-9.09e-03

Pos logit contributions
is+0.041
andy+0.038
rou+0.038
ally+0.037
 sav+0.037
Neg logit contributions
rac-0.018
 passports-0.018
adoes-0.016
ver-0.016
 teeth-0.016
Token fast
Token ID3049
Feature activation+6.34e-03

Pos logit contributions
rac+0.008
 passports+0.008
adoes+0.008
 teeth+0.008
ver+0.007
Neg logit contributions
is-0.018
andy-0.017
illy-0.017
ally-0.017
rou-0.017
Token and
Token ID290
Feature activation-2.66e-03

Pos logit contributions
is+0.012
andy+0.012
ally+0.012
illy+0.012
rou+0.011
Neg logit contributions
adoes-0.006
 turns-0.006
 teeth-0.006
 passports-0.006
rac-0.006
Token laughed
Token ID13818
Feature activation+5.16e-04

Pos logit contributions
rac+0.003
 passports+0.003
adoes+0.003
 teeth+0.003
ver+0.003
Neg logit contributions
is-0.004
andy-0.004
illy-0.004
ally-0.004
rou-0.004
Token.
Token ID13
Feature activation+1.63e-03

Pos logit contributions
is+0.001
andy+0.001
ally+0.001
illy+0.001
rou+0.001
Neg logit contributions
rac-0.001
adoes-0.001
 passports-0.001
 teeth-0.001
ver-0.001
Token But
Token ID887
Feature activation-9.73e-04

Pos logit contributions
is+0.003
andy+0.003
ally+0.003
illy+0.003
rou+0.003
Neg logit contributions
rac-0.002
 passports-0.002
adoes-0.002
 teeth-0.001
ver-0.001
Token when
Token ID618
Feature activation-1.06e-02

Pos logit contributions
rac+0.001
 passports+0.001
adoes+0.001
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.001
andy-0.001
ally-0.001
illy-0.001
rou-0.001
Token they
Token ID484
Feature activation+3.90e-03

Pos logit contributions
rac+0.009
 passports+0.009
adoes+0.009
 teeth+0.008
ver+0.008
Neg logit contributions
is-0.022
andy-0.021
ally-0.020
illy-0.020
 gn-0.020
Token reached
Token ID4251
Feature activation+3.87e-03

Pos logit contributions
is+0.006
ally+0.006
andy+0.006
 sav+0.006
illy+0.006
Neg logit contributions
adoes-0.006
 passports-0.005
rac-0.005
 teeth-0.005
 microphones-0.005
Token was
Token ID373
Feature activation+2.25e-03

Pos logit contributions
is+0.014
icol+0.014
 sidelines+0.014
andy+0.014
rou+0.014
Neg logit contributions
ver-0.007
 passports-0.006
 there-0.006
rac-0.006
 teeth-0.006
Token there
Token ID612
Feature activation-1.34e-02

Pos logit contributions
is+0.003
 freshly+0.003
 crawling+0.003
rou+0.003
illy+0.003
Neg logit contributions
adoes-0.004
rac-0.003
 teeth-0.003
 passports-0.003
 turns-0.003
Token to
Token ID284
Feature activation-1.88e-02

Pos logit contributions
 passports+0.012
rac+0.012
ver+0.012
 there+0.012
 teeth+0.012
Neg logit contributions
is-0.027
icol-0.025
andy-0.025
ally-0.025
illy-0.024
Token remind
Token ID7101
Feature activation-6.00e-03

Pos logit contributions
rac+0.020
 passports+0.020
 anyone+0.020
 teeth+0.019
adoes+0.019
Neg logit contributions
is-0.034
rou-0.032
icol-0.032
andy-0.032
ally-0.031
Token her
Token ID607
Feature activation+3.68e-03

Pos logit contributions
rac+0.008
 passports+0.008
adoes+0.007
 teeth+0.007
 anyone+0.007
Neg logit contributions
is-0.010
rou-0.009
andy-0.009
icol-0.009
 gn-0.009
Token to
Token ID284
Feature activation+7.61e-03

Pos logit contributions
is+0.007
andy+0.007
illy+0.007
rou+0.007
ally+0.006
Neg logit contributions
rac-0.004
adoes-0.004
 passports-0.004
 teeth-0.003
ver-0.003
Token take
Token ID1011
Feature activation-7.55e-03

Pos logit contributions
is+0.014
icol+0.014
rou+0.013
andy+0.013
ally+0.013
Neg logit contributions
rac-0.008
 passports-0.008
 anyone-0.007
 there-0.007
adoes-0.007
Token a
Token ID257
Feature activation+1.99e-03

Pos logit contributions
rac+0.007
adoes+0.007
 passports+0.007
ver+0.007
 teeth+0.006
Neg logit contributions
is-0.015
illy-0.014
andy-0.014
ally-0.014
 freshly-0.013
Token break
Token ID2270
Feature activation+1.07e-03

Pos logit contributions
is+0.003
andy+0.003
ally+0.003
illy+0.003
rou+0.003
Neg logit contributions
 passports-0.003
rac-0.003
adoes-0.003
 teeth-0.003
ver-0.003
Token.
Token ID13
Feature activation-1.67e-03

Pos logit contributions
is+0.002
andy+0.002
ally+0.002
illy+0.002
rou+0.002
Neg logit contributions
rac-0.001
 passports-0.001
adoes-0.001
 teeth-0.001
ver-0.001
Token
Token ID220
Feature activation-9.86e-03

Pos logit contributions
adoes+0.002
rac+0.002
 passports+0.002
 teeth+0.002
ver+0.002
Neg logit contributions
is-0.002
ally-0.002
andy-0.002
illy-0.002
 freshly-0.002
Token say
Token ID910
Feature activation+3.43e-03

Pos logit contributions
rac+0.009
adoes+0.009
 passports+0.009
 teeth+0.008
ver+0.008
Neg logit contributions
is-0.015
andy-0.014
ally-0.013
rou-0.013
illy-0.013
Token sorry
Token ID7926
Feature activation-2.07e-02

Pos logit contributions
is+0.005
 gn+0.005
 lending+0.005
illy+0.005
ally+0.005
Neg logit contributions
rac-0.004
 passports-0.004
adoes-0.004
outed-0.004
ver-0.004
Token to
Token ID284
Feature activation+1.12e-03

Pos logit contributions
rac+0.024
 passports+0.023
ver+0.022
adoes+0.022
 tant+0.019
Neg logit contributions
is-0.037
rou-0.035
andy-0.034
illy-0.033
ally-0.033
Token Tim
Token ID5045
Feature activation+5.89e-03

Pos logit contributions
is+0.002
rou+0.002
andy+0.002
illy+0.002
 gn+0.002
Neg logit contributions
rac-0.001
adoes-0.001
 passports-0.001
 anyone-0.001
 teeth-0.001
Token's
Token ID338
Feature activation-8.44e-03

Pos logit contributions
is+0.011
andy+0.010
icol+0.010
ally+0.010
illy+0.010
Neg logit contributions
rac-0.006
 passports-0.006
adoes-0.006
 teeth-0.006
ver-0.006
Token dad
Token ID9955
Feature activation-1.20e-02

Pos logit contributions
 passports+0.006
 teeth+0.006
adoes+0.006
rac+0.006
 turns+0.005
Neg logit contributions
is-0.019
andy-0.018
ally-0.018
illy-0.018
icol-0.017
Token and
Token ID290
Feature activation+3.73e-03

Pos logit contributions
rac+0.014
 passports+0.014
adoes+0.014
 teeth+0.014
ver+0.013
Neg logit contributions
is-0.021
andy-0.019
ally-0.019
illy-0.019
icol-0.019
Token the
Token ID262
Feature activation+4.62e-03

Pos logit contributions
is+0.006
andy+0.006
ally+0.006
 gn+0.006
rou+0.006
Neg logit contributions
rac-0.004
adoes-0.004
 passports-0.004
 teeth-0.004
ver-0.004
Token animals
Token ID4695
Feature activation+7.02e-03

Pos logit contributions
ally+0.010
andy+0.010
illy+0.010
is+0.009
live+0.009
Neg logit contributions
 passports-0.005
 teeth-0.004
 voices-0.004
 people-0.004
 tant-0.004
Token.
Token ID13
Feature activation-3.02e-03

Pos logit contributions
is+0.010
andy+0.009
rou+0.009
illy+0.009
ally+0.009
Neg logit contributions
rac-0.011
adoes-0.010
 passports-0.010
 teeth-0.010
ver-0.009
Token They
Token ID1119
Feature activation-8.59e-03

Pos logit contributions
rac+0.002
adoes+0.002
 passports+0.002
 teeth+0.002
ver+0.002
Neg logit contributions
is-0.007
andy-0.006
illy-0.006
ally-0.006
rou-0.006
Token them
Token ID606
Feature activation-6.58e-03

Pos logit contributions
rac+0.007
ver+0.007
adoes+0.007
 passports+0.006
 teeth+0.006
Neg logit contributions
is-0.013
 freshly-0.012
andy-0.012
 gn-0.011
illy-0.011
Token.
Token ID13
Feature activation-1.38e-03

Pos logit contributions
rac+0.008
adoes+0.008
 passports+0.008
ver+0.008
 turns+0.008
Neg logit contributions
is-0.011
andy-0.010
rou-0.009
illy-0.009
 gn-0.009
Token She
Token ID1375
Feature activation-2.47e-03

Pos logit contributions
rac+0.001
 passports+0.001
adoes+0.001
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.003
andy-0.002
ally-0.002
illy-0.002
rou-0.002
Token wanted
Token ID2227
Feature activation-3.15e-03

Pos logit contributions
adoes+0.003
rac+0.003
 passports+0.003
 teeth+0.003
ver+0.003
Neg logit contributions
is-0.004
andy-0.004
 gn-0.004
 sav-0.004
rou-0.004
Token to
Token ID284
Feature activation-4.02e-03

Pos logit contributions
rac+0.003
adoes+0.003
ver+0.003
 passports+0.003
 teeth+0.003
Neg logit contributions
is-0.006
andy-0.006
illy-0.006
 freshly-0.006
ally-0.006
Token share
Token ID2648
Feature activation-1.93e-02

Pos logit contributions
rac+0.005
adoes+0.005
 passports+0.005
 teeth+0.005
ver+0.004
Neg logit contributions
is-0.007
andy-0.006
ally-0.006
illy-0.006
rou-0.006
Token them
Token ID606
Feature activation-8.87e-04

Pos logit contributions
rac+0.021
ver+0.020
adoes+0.019
outed+0.019
 passports+0.018
Neg logit contributions
is-0.038
andy-0.033
 gn-0.033
 sav-0.033
illy-0.032
Token with
Token ID351
Feature activation-2.81e-03

Pos logit contributions
rac+0.001
adoes+0.001
 passports+0.001
 teeth+0.001
 happens+0.001
Neg logit contributions
is-0.002
andy-0.001
 gn-0.001
 sav-0.001
ally-0.001
Token someone
Token ID2130
Feature activation-9.01e-03

Pos logit contributions
rac+0.003
 passports+0.003
adoes+0.003
 teeth+0.003
ver+0.003
Neg logit contributions
is-0.005
andy-0.005
ally-0.005
rou-0.005
illy-0.005
Token who
Token ID508
Feature activation+3.18e-03

Pos logit contributions
rac+0.012
adoes+0.012
 passports+0.011
outed+0.011
ver+0.010
Neg logit contributions
is-0.015
isy-0.013
 gn-0.012
rou-0.012
andy-0.012
Token would
Token ID561
Feature activation-2.22e-02

Pos logit contributions
is+0.006
andy+0.005
illy+0.005
ally+0.005
 freshly+0.005
Neg logit contributions
rac-0.004
adoes-0.004
 passports-0.003
 teeth-0.003
ver-0.003
Token using
Token ID1262
Feature activation+7.88e-03

Pos logit contributions
is+0.005
andy+0.005
ally+0.005
icol+0.005
illy+0.005
Neg logit contributions
 teeth-0.003
rac-0.003
 passports-0.003
adoes-0.003
 there-0.003
Token it
Token ID340
Feature activation-4.17e-03

Pos logit contributions
is+0.016
andy+0.015
illy+0.014
ally+0.014
rou+0.014
Neg logit contributions
rac-0.007
adoes-0.007
 passports-0.007
 teeth-0.007
ver-0.007
Token to
Token ID284
Feature activation-5.72e-03

Pos logit contributions
rac+0.004
 passports+0.004
adoes+0.004
 teeth+0.004
ver+0.004
Neg logit contributions
is-0.008
andy-0.007
illy-0.007
rou-0.007
ally-0.007
Token make
Token ID787
Feature activation-4.52e-03

Pos logit contributions
rac+0.006
 passports+0.006
 teeth+0.006
 anyone+0.006
adoes+0.006
Neg logit contributions
icol-0.010
rou-0.010
is-0.010
andy-0.010
ally-0.010
Token people
Token ID661
Feature activation-1.39e-02

Pos logit contributions
rac+0.004
 passports+0.004
adoes+0.003
 teeth+0.003
ver+0.003
Neg logit contributions
is-0.010
andy-0.009
ally-0.009
illy-0.009
rou-0.009
Token laugh
Token ID6487
Feature activation-7.76e-03

Pos logit contributions
rac+0.018
adoes+0.017
 passports+0.017
ver+0.017
 teeth+0.016
Neg logit contributions
is-0.023
rou-0.021
 freshly-0.020
andy-0.020
 gn-0.020
Token,"
Token ID553
Feature activation+1.04e-02

Pos logit contributions
rac+0.010
 passports+0.010
ver+0.009
adoes+0.008
outed+0.008
Neg logit contributions
is-0.013
rou-0.012
isy-0.011
illy-0.011
andy-0.011
Token said
Token ID531
Feature activation-2.85e-03

Pos logit contributions
is+0.020
rou+0.019
icol+0.018
 sidelines+0.018
andy+0.018
Neg logit contributions
rac-0.012
 passports-0.011
 teeth-0.010
 there-0.010
 means-0.010
Token Uncle
Token ID23169
Feature activation-4.08e-03

Pos logit contributions
rac+0.004
 passports+0.004
adoes+0.004
 teeth+0.004
ver+0.004
Neg logit contributions
is-0.004
andy-0.004
ally-0.004
illy-0.004
rou-0.004
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
ver+0.004
 there+0.004
 teeth+0.004
rac+0.004
 anyone+0.004
Neg logit contributions
is-0.008
 sidelines-0.008
 lookout-0.007
 freshly-0.007
icol-0.007
Token
Token ID198
Feature activation-7.96e-03

Pos logit contributions
rac+0.010
 passports+0.009
 teeth+0.009
adoes+0.009
ver+0.009
Neg logit contributions
is-0.022
andy-0.021
illy-0.021
rou-0.021
ally-0.020
Token and
Token ID290
Feature activation-2.82e-03

Pos logit contributions
 passports+0.001
rac+0.001
adoes+0.001
 teeth+0.001
 turns+0.001
Neg logit contributions
is-0.001
andy-0.001
rou-0.001
illy-0.001
ally-0.001
Token said
Token ID531
Feature activation-1.19e-02

Pos logit contributions
rac+0.003
 passports+0.003
adoes+0.003
 teeth+0.003
ver+0.003
Neg logit contributions
is-0.005
andy-0.005
rou-0.005
illy-0.005
ally-0.005
Token,
Token ID11
Feature activation-1.01e-02

Pos logit contributions
rac+0.016
 passports+0.016
adoes+0.016
 teeth+0.015
 turns+0.014
Neg logit contributions
is-0.019
andy-0.017
illy-0.017
ally-0.017
rou-0.017
Token "
Token ID366
Feature activation-3.20e-03

Pos logit contributions
rac+0.011
 passports+0.011
adoes+0.011
 teeth+0.011
ver+0.010
Neg logit contributions
is-0.018
andy-0.017
ally-0.016
illy-0.016
rou-0.016
TokenSure
Token ID19457
Feature activation-1.80e-02

Pos logit contributions
rac+0.004
ver+0.003
 passports+0.003
 turns+0.003
adoes+0.003
Neg logit contributions
andy-0.005
 crawling-0.005
icol-0.005
is-0.005
illy-0.005
Token,
Token ID11
Feature activation-1.07e-02

Pos logit contributions
 teeth+0.016
rac+0.015
 passports+0.015
 turns+0.015
 there+0.015
Neg logit contributions
is-0.036
andy-0.035
rou-0.035
illy-0.034
 sidelines-0.034
Token let
Token ID1309
Feature activation-5.92e-03

Pos logit contributions
rac+0.012
adoes+0.012
 passports+0.012
 teeth+0.011
ver+0.011
Neg logit contributions
is-0.019
andy-0.018
illy-0.017
rou-0.017
ally-0.017
Token's
Token ID338
Feature activation-1.57e-02

Pos logit contributions
rac+0.006
 passports+0.006
adoes+0.006
 teeth+0.006
ver+0.005
Neg logit contributions
is-0.011
andy-0.011
rou-0.010
illy-0.010
ally-0.010
Token go
Token ID467
Feature activation-8.84e-03

Pos logit contributions
rac+0.018
 passports+0.017
adoes+0.017
 teeth+0.017
ver+0.016
Neg logit contributions
is-0.028
andy-0.026
ally-0.026
rou-0.026
illy-0.026
Token to
Token ID284
Feature activation-5.31e-03

Pos logit contributions
 passports+0.008
rac+0.008
 there+0.007
 turns+0.007
ver+0.007
Neg logit contributions
andy-0.017
is-0.017
ally-0.017
illy-0.017
icol-0.016
Token the
Token ID262
Feature activation-1.74e-03

Pos logit contributions
 passports+0.004
rac+0.004
adoes+0.004
 teeth+0.004
 turns+0.004
Neg logit contributions
is-0.011
andy-0.011
ally-0.011
illy-0.011
rou-0.011
Token,
Token ID11
Feature activation-2.12e-02

Pos logit contributions
rac+0.001
 passports+0.001
outed+0.001
 teeth+0.001
 temples+0.001
Neg logit contributions
is-0.001
ally-0.001
 crawling-0.001
 lending-0.001
 course-0.001
Token Ellie
Token ID44801
Feature activation+1.80e-02

Pos logit contributions
rac+0.049
 passports+0.048
 teeth+0.047
outed+0.045
 tant+0.044
Neg logit contributions
is-0.012
ally-0.011
 lending-0.010
 freshly-0.010
 Mitt-0.010
Token didn
Token ID1422
Feature activation-1.26e-02

Pos logit contributions
is+0.031
andy+0.029
ally+0.028
illy+0.028
rou+0.027
Neg logit contributions
rac-0.022
adoes-0.021
 passports-0.021
 teeth-0.021
ver-0.020
Token't
Token ID470
Feature activation+3.44e-03

Pos logit contributions
 passports+0.012
rac+0.012
adoes+0.012
 teeth+0.012
ver+0.011
Neg logit contributions
is-0.024
andy-0.023
illy-0.022
ally-0.022
rou-0.022
Token want
Token ID765
Feature activation-1.63e-03

Pos logit contributions
is+0.006
 gn+0.006
andy+0.006
illy+0.006
ally+0.005
Neg logit contributions
rac-0.004
adoes-0.004
 passports-0.004
 teeth-0.004
outed-0.003
Token to
Token ID284
Feature activation-9.70e-03

Pos logit contributions
rac+0.002
 passports+0.001
adoes+0.001
ver+0.001
 teeth+0.001
Neg logit contributions
is-0.003
andy-0.003
illy-0.003
ally-0.003
 gn-0.003
Token go
Token ID467
Feature activation-8.04e-03

Pos logit contributions
rac+0.013
adoes+0.013
 passports+0.012
 teeth+0.012
ver+0.012
Neg logit contributions
is-0.016
andy-0.015
illy-0.014
ally-0.014
 gn-0.014
Token because
Token ID780
Feature activation-9.49e-03

Pos logit contributions
 passports+0.007
rac+0.007
adoes+0.007
 there+0.006
 teeth+0.006
Neg logit contributions
is-0.016
ally-0.016
andy-0.015
illy-0.015
icol-0.015
Token she
Token ID673
Feature activation-1.55e-03

Pos logit contributions
rac+0.015
adoes+0.015
 passports+0.015
 teeth+0.015
ver+0.014
Neg logit contributions
is-0.012
andy-0.011
illy-0.011
ally-0.011
rou-0.011
Token was
Token ID373
Feature activation-2.36e-03

Pos logit contributions
rac+0.002
adoes+0.002
 passports+0.002
 teeth+0.002
ver+0.002
Neg logit contributions
is-0.003
andy-0.002
ally-0.002
illy-0.002
rou-0.002
Token too
Token ID1165
Feature activation-1.11e-02

Pos logit contributions
rac+0.003
adoes+0.003
 passports+0.003
cases+0.003
ports+0.003
Neg logit contributions
 crawling-0.004
is-0.004
 hopping-0.004
rou-0.004
 lending-0.004
Token
Token ID198
Feature activation-7.37e-03

Pos logit contributions
rac+0.005
 passports+0.005
adoes+0.005
 teeth+0.004
ver+0.004
Neg logit contributions
is-0.010
andy-0.009
illy-0.009
ally-0.009
rou-0.009
Token"
Token ID1
Feature activation+3.51e-03

Pos logit contributions
rac+0.008
 there+0.008
 turns+0.008
 passports+0.008
ver+0.007
Neg logit contributions
icol-0.013
andy-0.013
is-0.013
 freshly-0.013
illy-0.013
TokenTom
Token ID13787
Feature activation+4.78e-03

Pos logit contributions
is+0.004
ally+0.004
andy+0.004
illy+0.004
 sav+0.004
Neg logit contributions
rac-0.006
 teeth-0.006
adoes-0.006
 passports-0.006
ver-0.006
Token,
Token ID11
Feature activation-5.78e-04

Pos logit contributions
is+0.010
 sidelines+0.010
 freshly+0.010
illy+0.009
andy+0.009
Neg logit contributions
 teeth-0.004
 there-0.004
 turns-0.004
ver-0.004
rac-0.004
Token where
Token ID810
Feature activation-1.56e-02

Pos logit contributions
rac+0.001
adoes+0.001
 passports+0.001
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.001
andy-0.001
illy-0.001
ally-0.001
rou-0.001
Token did
Token ID750
Feature activation-3.58e-02

Pos logit contributions
rac+0.022
adoes+0.021
 tant+0.020
rier+0.019
agged+0.019
Neg logit contributions
illy-0.020
andy-0.019
rou-0.019
 living-0.019
 crawling-0.019
Token you
Token ID345
Feature activation-3.05e-03

Pos logit contributions
ver+0.049
 tant+0.049
rac+0.048
adoes+0.048
outed+0.047
Neg logit contributions
is-0.050
ally-0.047
andy-0.046
 lending-0.046
 sav-0.045
Token hide
Token ID7808
Feature activation-1.05e-03

Pos logit contributions
adoes+0.004
 passports+0.004
rac+0.004
 teeth+0.004
 turns+0.004
Neg logit contributions
is-0.005
andy-0.004
ally-0.004
illy-0.004
 gn-0.004
Token your
Token ID534
Feature activation+5.90e-03

Pos logit contributions
rac+0.001
 passports+0.001
adoes+0.001
 teeth+0.001
ver+0.001
Neg logit contributions
is-0.002
andy-0.002
ally-0.002
illy-0.002
rou-0.002
Token shoe
Token ID17292
Feature activation+8.94e-03

Pos logit contributions
is+0.009
icol+0.008
ally+0.008
rou+0.008
andy+0.008
Neg logit contributions
 teeth-0.009
 passports-0.008
rac-0.008
 anyone-0.008
 there-0.008
Token?"
Token ID1701
Feature activation+7.72e-04

Pos logit contributions
is+0.019
andy+0.018
illy+0.018
ally+0.018
rou+0.018
Neg logit contributions
rac-0.007
adoes-0.006
 passports-0.006
 teeth-0.006
ver-0.006
Token said
Token ID531
Feature activation+2.20e-03

Pos logit contributions
is+0.009
andy+0.008
ally+0.008
illy+0.008
rou+0.008
Neg logit contributions
rac-0.005
 passports-0.005
adoes-0.005
 teeth-0.004
ver-0.004
Token Mama
Token ID40084
Feature activation-1.09e-02

Pos logit contributions
is+0.004
 gn+0.004
andy+0.004
isy+0.003
 crawling+0.003
Neg logit contributions
 passports-0.002
adoes-0.002
rac-0.002
outed-0.002
 teeth-0.002
Token.
Token ID13
Feature activation-2.23e-03

Pos logit contributions
ver+0.010
 there+0.010
 teeth+0.010
adoes+0.009
 passports+0.009
Neg logit contributions
is-0.022
 sidelines-0.022
icol-0.021
 sav-0.020
andy-0.020
Token
Token ID198
Feature activation+6.27e-03

Pos logit contributions
rac+0.002
adoes+0.002
 teeth+0.002
ver+0.002
 passports+0.002
Neg logit contributions
is-0.004
rou-0.004
andy-0.004
illy-0.004
ally-0.004
TokenThe
Token ID464
Feature activation-2.70e-03

Pos logit contributions
is+0.012
icol+0.011
rou+0.011
 sidelines+0.011
 freshly+0.011
Neg logit contributions
rac-0.007
ver-0.006
 passports-0.006
 there-0.006
 turns-0.005
Token 3
Token ID513
Feature activation-3.74e-02

Pos logit contributions
 passports+0.003
 teeth+0.003
rac+0.002
 turns+0.002
 there+0.002
Neg logit contributions
is-0.005
icol-0.005
andy-0.005
illy-0.005
rou-0.005
Token year
Token ID614
Feature activation-1.31e-02

Pos logit contributions
adoes+0.053
rac+0.053
 passports+0.052
ver+0.050
 teeth+0.049
Neg logit contributions
is-0.059
andy-0.053
ally-0.052
illy-0.051
isy-0.051
Token old
Token ID1468
Feature activation-4.81e-03

Pos logit contributions
rac+0.017
 passports+0.017
adoes+0.017
 teeth+0.016
 turns+0.016
Neg logit contributions
is-0.021
andy-0.020
ally-0.019
illy-0.019
rou-0.019
Token boy
Token ID2933
Feature activation-2.06e-03

Pos logit contributions
rac+0.005
 passports+0.005
 teeth+0.005
 turns+0.004
adoes+0.004
Neg logit contributions
is-0.009
 sidelines-0.009
 freshly-0.009
rou-0.009
icol-0.009
Token ran
Token ID4966
Feature activation+1.02e-02

Pos logit contributions
rac+0.002
 passports+0.002
 teeth+0.002
ver+0.002
adoes+0.002
Neg logit contributions
is-0.004
andy-0.003
illy-0.003
icol-0.003
 freshly-0.003
Token outside
Token ID2354
Feature activation+7.20e-03

Pos logit contributions
is+0.022
andy+0.022
rou+0.021
illy+0.021
ally+0.021
Neg logit contributions
 turns-0.007
adoes-0.007
 passports-0.007
rac-0.006
 there-0.006
TokenOf
Token ID5189
Feature activation-1.16e-02

Pos logit contributions
rac+0.006
 passports+0.006
adoes+0.005
ver+0.005
 turns+0.005
Neg logit contributions
is-0.007
andy-0.007
icol-0.007
illy-0.007
ally-0.007
Token course
Token ID1781
Feature activation+7.77e-03

Pos logit contributions
rac+0.026
 turns+0.025
 passports+0.024
 anyone+0.024
adoes+0.023
Neg logit contributions
is-0.008
rou-0.008
icol-0.008
andy-0.008
 gn-0.007
Token,
Token ID11
Feature activation-8.87e-03

Pos logit contributions
is+0.014
andy+0.013
rou+0.013
icol+0.013
illy+0.012
Neg logit contributions
rac-0.009
 passports-0.009
 teeth-0.009
adoes-0.009
 anyone-0.008
Token but
Token ID475
Feature activation+3.70e-03

Pos logit contributions
rac+0.011
adoes+0.011
 passports+0.010
 teeth+0.010
 turns+0.010
Neg logit contributions
is-0.015
andy-0.014
icol-0.014
rou-0.014
illy-0.014
Token why
Token ID1521
Feature activation-1.27e-03

Pos logit contributions
is+0.007
andy+0.006
ally+0.006
illy+0.006
rou+0.006
Neg logit contributions
rac-0.004
 passports-0.004
adoes-0.004
 teeth-0.004
ver-0.004
Token do
Token ID466
Feature activation-3.81e-02

Pos logit contributions
rac+0.001
 passports+0.001
adoes+0.001
 teeth+0.001
 turns+0.001
Neg logit contributions
is-0.002
andy-0.002
rou-0.002
illy-0.002
ally-0.002
Token you
Token ID345
Feature activation+9.81e-03

Pos logit contributions
 tant+0.059
adoes+0.058
outed+0.056
ver+0.056
rac+0.055
Neg logit contributions
is-0.052
ally-0.045
andy-0.045
illy-0.045
isy-0.044
Token want
Token ID765
Feature activation+4.93e-03

Pos logit contributions
is+0.018
andy+0.016
ally+0.016
illy+0.016
rou+0.016
Neg logit contributions
adoes-0.011
rac-0.011
 passports-0.011
 teeth-0.010
ver-0.010
Token to
Token ID284
Feature activation+1.66e-03

Pos logit contributions
is+0.009
 sav+0.008
 lending+0.008
illy+0.008
 freshly+0.008
Neg logit contributions
ver-0.006
adoes-0.006
rac-0.006
outed-0.005
cases-0.005
Token buy
Token ID2822
Feature activation+9.51e-04

Pos logit contributions
is+0.003
andy+0.003
rou+0.003
illy+0.002
ally+0.002
Neg logit contributions
rac-0.002
 passports-0.002
adoes-0.002
 teeth-0.002
ver-0.002
Token such
Token ID884
Feature activation-1.97e-03

Pos logit contributions
is+0.002
 course+0.001
 freshly+0.001
illy+0.001
 sav+0.001
Neg logit contributions
ver-0.001
adoes-0.001
outed-0.001
rac-0.001
cases-0.001
Token surprise
Token ID5975
Feature activation-3.24e-03

Pos logit contributions
is+0.022
rou+0.021
andy+0.021
scar+0.021
illy+0.020
Neg logit contributions
 turns-0.008
 there-0.008
 teeth-0.008
 passports-0.008
 voices-0.007
Token.
Token ID13
Feature activation+9.76e-03

Pos logit contributions
 passports+0.003
 teeth+0.003
rac+0.003
adoes+0.003
 there+0.003
Neg logit contributions
is-0.006
rou-0.006
icol-0.006
andy-0.006
 gn-0.006
Token She
Token ID1375
Feature activation+4.23e-03

Pos logit contributions
is+0.017
andy+0.016
ally+0.016
illy+0.016
rou+0.016
Neg logit contributions
rac-0.012
adoes-0.011
 passports-0.011
 teeth-0.011
ver-0.010
Token watched
Token ID7342
Feature activation+1.84e-03

Pos logit contributions
is+0.007
andy+0.007
illy+0.007
ally+0.007
rou+0.007
Neg logit contributions
rac-0.005
 passports-0.005
adoes-0.005
 teeth-0.005
ver-0.005
Token in
Token ID287
Feature activation+5.11e-03

Pos logit contributions
is+0.004
andy+0.004
ally+0.004
scar+0.004
rou+0.004
Neg logit contributions
rac-0.001
adoes-0.001
 teeth-0.001
 anyone-0.001
 passports-0.001
Token amaz
Token ID40642
Feature activation-3.65e-02

Pos logit contributions
is+0.011
scar+0.010
ally+0.010
live+0.010
illy+0.010
Neg logit contributions
 turns-0.005
 teeth-0.005
 there-0.005
 passports-0.005
 voices-0.005
Tokenement
Token ID972
Feature activation+1.22e-03

Pos logit contributions
 passports+0.062
ver+0.061
rac+0.060
 teeth+0.060
adoes+0.060
Neg logit contributions
is-0.048
andy-0.043
illy-0.042
ally-0.041
rou-0.038
Token as
Token ID355
Feature activation-7.43e-03

Pos logit contributions
is+0.003
ally+0.002
andy+0.002
illy+0.002
 freshly+0.002
Neg logit contributions
rac-0.001
adoes-0.001
 teeth-0.001
 passports-0.001
ver-0.001
Token the
Token ID262
Feature activation-6.11e-03

Pos logit contributions
 anyone+0.004
 there+0.004
 passports+0.004
 turns+0.004
adoes+0.004
Neg logit contributions
is-0.018
icol-0.017
ally-0.017
andy-0.017
 bisc-0.016
Token yellow
Token ID7872
Feature activation-4.56e-03

Pos logit contributions
rac+0.008
 passports+0.008
adoes+0.008
ver+0.007
 teeth+0.007
Neg logit contributions
is-0.010
andy-0.009
ally-0.009
illy-0.009
rou-0.009
Token knee
Token ID10329
Feature activation+1.11e-02

Pos logit contributions
rac+0.006
 passports+0.006
ver+0.006
adoes+0.006
 teeth+0.006
Neg logit contributions
is-0.007
andy-0.006
illy-0.006
ally-0.006
 crawling-0.006
Token to
Token ID284
Feature activation-1.01e-02

Pos logit contributions
is+0.005
andy+0.005
isy+0.005
icol+0.005
ally+0.005
Neg logit contributions
 there-0.001
 passports-0.001
rac-0.001
 turns-0.001
ver-0.001
Token her
Token ID607
Feature activation-1.22e-03

Pos logit contributions
 there+0.007
 passports+0.007
 anyone+0.006
 turns+0.006
rac+0.006
Neg logit contributions
rou-0.022
is-0.022
icol-0.021
andy-0.021
ousy-0.021
Token.
Token ID13
Feature activation-5.06e-03

Pos logit contributions
 teeth+0.001
 passports+0.001
 there+0.001
 bags+0.001
adoes+0.001
Neg logit contributions
is-0.003
scar-0.002
andy-0.002
illy-0.002
ally-0.002
Token "
Token ID366
Feature activation-8.53e-03

Pos logit contributions
rac+0.005
 teeth+0.005
 passports+0.005
adoes+0.004
 turns+0.004
Neg logit contributions
is-0.010
andy-0.009
rou-0.009
illy-0.009
ally-0.009
TokenWhere
Token ID8496
Feature activation-9.41e-04

Pos logit contributions
rac+0.011
 passports+0.011
 turns+0.010
ver+0.010
adoes+0.010
Neg logit contributions
is-0.013
andy-0.013
icol-0.013
rou-0.012
ally-0.012
Token did
Token ID750
Feature activation-3.76e-02

Pos logit contributions
adoes+0.001
rac+0.001
 microphones+0.001
 passports+0.001
 tant+0.001
Neg logit contributions
is-0.002
rou-0.001
andy-0.001
illy-0.001
ally-0.001
Token you
Token ID345
Feature activation+4.68e-03

Pos logit contributions
rac+0.048
adoes+0.048
ver+0.046
 passports+0.046
 teeth+0.046
Neg logit contributions
is-0.060
illy-0.054
andy-0.054
ally-0.054
 gn-0.053
Token get
Token ID651
Feature activation+5.71e-03

Pos logit contributions
is+0.007
andy+0.007
illy+0.007
ally+0.007
rou+0.007
Neg logit contributions
adoes-0.006
rac-0.006
 passports-0.006
 teeth-0.006
ver-0.006
Token that
Token ID326
Feature activation-4.54e-03

Pos logit contributions
is+0.010
illy+0.009
 lending+0.009
 freshly+0.009
ally+0.009
Neg logit contributions
adoes-0.007
ver-0.007
rac-0.007
 teeth-0.007
 passports-0.006
Token money
Token ID1637
Feature activation+1.90e-03

Pos logit contributions
rac+0.005
adoes+0.005
ver+0.004
 passports+0.004
 teeth+0.004
Neg logit contributions
is-0.009
illy-0.008
 freshly-0.008
ally-0.008
 gn-0.008
Token?
Token ID30
Feature activation+7.93e-03

Pos logit contributions
is+0.004
illy+0.003
ally+0.003
andy+0.003
 freshly+0.003
Neg logit contributions
adoes-0.002
 passports-0.002
 teeth-0.002
rac-0.002
ver-0.002