TOP ACTIVATIONS
MAX = 3.945

asked : " What does he record ?" "
" What does he know ?" Lily asks .
" Sam , what did he do ?" Tom asks .
amazing adventure ! <|endoftext|> The other day , Sarah and her
there !" The other child said , " Let
. The other children cheered her on ,
often ignore him . The other day , he decided he
up . The other children noticed she was wearing
the hero . What did he do ?" His
" Why does he need to check our lungs
the sand box . The other child shouted : " Help
A ki and M aki were very sad .
sand . The other child said : " Stop
child . The other child took a deep breath
over a cable . The other child saw this and smiled
" What did she say ?" Lily asks .
march back home . The other old men said goodbye and
that man ? Why does he have a whistle ?"
A ki and M aki apologized , but the
dad , " What is he doing ?" Her

MIN ACTIVATIONS
MIN = -3.593

We are sorry , Tim / S ue . We will
, " It is his / her fault . He /
/ her fault . He / she broke the car ."
, thirteen , fourteen , fifteen , sixteen , seventeen ,
. " She / he broke my worm !"
. " Anna / Ben did it !" they
tall and straight , never leaning or sway ing in the
she saw a long ladder leaning against a house . She
, thirteen , fourteen , fifteen , sixteen , seventeen ,
when she saw a ladder leaning against the fence . She
modest house with a ladder leaning against the wall . They
All the kids put their feet on the floor and
let me help you carry your bucket . I think I
and bumped against the books hel ves . He smiled as
I can 't reach the cher ries . Can you help
" It is a skysc raper . It is a
they saw a new ladder leaning on the wall of their
Lia 's rocket is red and Ben 's rocket is blue
her not to investigate books hel ves alone , but she
They see a big ladder leaning on the wall . They

INTERVAL 2.060 - 3.945
CONTAINS 0.054%

. He put on his coat and put his things in
making lights ! I step back and it makes a cool
classroom . M olly looked around the room
and c uddled together . Mitt ens was a good cat
good for you . But if you walk , I â

INTERVAL 0.176 - 2.060
CONTAINS 36.060%

and excited . Ben and Lily held hands and
don 't belong . She promised to always be kind to
Billy said sorry , but Tim my didn 't
ot ter started to move again and it sw am away
eb was happy because his laser was very reliable . He

INTERVAL -1.708 - 0.176
CONTAINS 63.724%

at the zoo , they saw all kinds of animals .
the pill and a glass of juice . Lily made a
and Ben said . " Max is here and he is
and the family lived happily ever after . <|endoftext|> Once there
?" he said . L ily and Ben nodded

INTERVAL -3.593 - -1.708
CONTAINS 0.161%

let 's count the apples together . One , two ,
for the man and asked her mom my if they could
, but it is too late . The bottle lands in
The spider told a story about a fragile flower .
Jack asked her , Mom
Token asked
Token ID1965
Feature activation+7.90e-01

Pos logit contributions
ups+0.016
+0.016
spe+0.015
J+0.015
edient+0.015
Neg logit contributions
ags-0.017
 Whe-0.016
rob-0.016
Whe-0.015
 de-0.015
Token:
Token ID25
Feature activation-6.33e-01

Pos logit contributions
ags+0.030
rob+0.030
 Whe+0.027
Whe+0.027
 Ow+0.026
Neg logit contributions
-0.019
ups-0.017
spe-0.015
J-0.014
Jud-0.014
Token "
Token ID366
Feature activation-4.32e-01

Pos logit contributions
ups+0.023
+0.022
spe+0.021
Jud+0.020
J+0.019
Neg logit contributions
ags-0.015
 Whe-0.014
rob-0.014
Whe-0.012
 Ow-0.012
TokenWhat
Token ID2061
Feature activation-2.51e-01

Pos logit contributions
+0.013
ups+0.012
spe+0.012
Jud+0.011
edient+0.010
Neg logit contributions
rob-0.013
ags-0.013
Whe-0.012
 Whe-0.012
 Ow-0.011
Token does
Token ID857
Feature activation+3.49e-01

Pos logit contributions
+0.008
ups+0.007
"+0.006
spe+0.006
J+0.006
Neg logit contributions
ags-0.008
rob-0.008
Whe-0.007
 Whe-0.007
 retri-0.007
Token he
Token ID339
Feature activation+3.94e+00

Pos logit contributions
ags+0.012
rob+0.011
Whe+0.010
 Whe+0.010
 Ow+0.010
Neg logit contributions
-0.010
ups-0.009
spe-0.009
"-0.009
J-0.008
Token record
Token ID1700
Feature activation+5.60e-01

Pos logit contributions
rob+0.147
ags+0.134
 Whe+0.123
Whe+0.122
 Ow+0.119
Neg logit contributions
-0.125
ups-0.097
"-0.093
 soon-0.093
oh-0.086
Token?"
Token ID1701
Feature activation-2.15e-01

Pos logit contributions
rob+0.019
ags+0.018
Whe+0.017
 Whe+0.016
 Ow+0.016
Neg logit contributions
-0.018
ups-0.015
"-0.015
 soon-0.014
washer-0.013
Token
Token ID198
Feature activation-8.29e-01

Pos logit contributions
+0.005
"+0.004
Mom+0.004
J+0.004
Jud+0.004
Neg logit contributions
rob-0.009
ags-0.009
Whe-0.008
 luggage-0.008
 retri-0.008
Token
Token ID198
Feature activation-8.43e-01

Pos logit contributions
+0.026
ups+0.025
spe+0.024
Jud+0.022
J+0.022
Neg logit contributions
ags-0.024
rob-0.024
 Whe-0.022
Whe-0.021
 Ow-0.020
Token"
Token ID1
Feature activation-1.84e-01

Pos logit contributions
spe+0.017
ups+0.017
+0.016
bug+0.015
edient+0.015
Neg logit contributions
ags-0.029
 Whe-0.028
Whe-0.026
rob-0.026
 Ow-0.026
Token
Token ID198
Feature activation-6.48e-01

Pos logit contributions
+0.021
ups+0.019
"+0.017
J+0.017
spe+0.017
Neg logit contributions
rob-0.025
ags-0.024
Whe-0.022
 Whe-0.021
 Ow-0.020
Token
Token ID198
Feature activation-6.53e-01

Pos logit contributions
+0.022
ups+0.020
spe+0.020
Jud+0.018
J+0.018
Neg logit contributions
ags-0.018
rob-0.018
 Whe-0.016
 Ow-0.015
Whe-0.015
Token"
Token ID1
Feature activation-6.92e-01

Pos logit contributions
spe+0.018
bell+0.018
ho+0.018
 quake+0.017
washer+0.017
Neg logit contributions
ags-0.018
rob-0.018
Whe-0.015
 Whe-0.015
 retri-0.015
TokenWhat
Token ID2061
Feature activation-3.25e-01

Pos logit contributions
+0.017
ups+0.015
spe+0.014
J+0.013
Jud+0.013
Neg logit contributions
ags-0.026
rob-0.026
 Whe-0.023
Whe-0.023
 Ow-0.022
Token does
Token ID857
Feature activation+1.31e-01

Pos logit contributions
+0.011
ups+0.010
"+0.009
spe+0.009
J+0.009
Neg logit contributions
ags-0.010
rob-0.009
Whe-0.008
 Whe-0.008
 retri-0.008
Token he
Token ID339
Feature activation+3.63e+00

Pos logit contributions
ags+0.005
rob+0.005
 Whe+0.004
Whe+0.004
 Ow+0.004
Neg logit contributions
-0.004
ups-0.003
spe-0.003
"-0.003
J-0.003
Token know
Token ID760
Feature activation+8.64e-01

Pos logit contributions
rob+0.135
ags+0.126
Whe+0.119
 Whe+0.117
 Ow+0.112
Neg logit contributions
-0.107
"-0.085
ups-0.085
 soon-0.082
oh-0.076
Token?"
Token ID1701
Feature activation-6.35e-01

Pos logit contributions
ags+0.029
rob+0.028
Whe+0.027
 Whe+0.026
 Ow+0.025
Neg logit contributions
-0.027
ups-0.024
"-0.022
 else-0.021
 soon-0.021
Token Lily
Token ID20037
Feature activation+5.12e-02

Pos logit contributions
+0.021
ups+0.020
spe+0.019
Jud+0.018
oh+0.017
Neg logit contributions
ags-0.018
rob-0.017
 Whe-0.016
Whe-0.015
 Ow-0.015
Token asks
Token ID7893
Feature activation-2.80e-01

Pos logit contributions
ags+0.002
rob+0.002
 Whe+0.001
Whe+0.001
 Ow+0.001
Neg logit contributions
-0.002
ups-0.002
"-0.001
spe-0.001
Jud-0.001
Token.
Token ID13
Feature activation-9.05e-01

Pos logit contributions
+0.007
spe+0.007
ups+0.007
Jud+0.006
J+0.006
Neg logit contributions
ags-0.010
rob-0.009
 Whe-0.009
 Ow-0.008
Whe-0.008
Token"
Token ID1
Feature activation-6.43e-02

Pos logit contributions
 Whe+0.008
rob+0.008
ags+0.007
Be+0.007
Whe+0.006
Neg logit contributions
spe-0.014
bell-0.013
bug-0.012
ho-0.012
anne-0.012
TokenSam
Token ID16305
Feature activation+8.68e-01

Pos logit contributions
+0.002
ups+0.002
spe+0.002
Jud+0.002
J+0.002
Neg logit contributions
ags-0.002
rob-0.002
 Whe-0.002
Whe-0.002
 Ow-0.002
Token,
Token ID11
Feature activation+4.23e-01

Pos logit contributions
ags+0.032
rob+0.031
 Whe+0.029
Whe+0.028
 Ow+0.027
Neg logit contributions
-0.022
ups-0.020
spe-0.018
"-0.017
J-0.017
Token what
Token ID644
Feature activation-7.33e-01

Pos logit contributions
ags+0.014
rob+0.014
 Whe+0.013
Whe+0.012
 Ow+0.012
Neg logit contributions
-0.012
ups-0.011
spe-0.010
Jud-0.010
J-0.010
Token did
Token ID750
Feature activation+3.80e-01

Pos logit contributions
+0.025
ups+0.025
spe+0.024
Jud+0.022
J+0.022
Neg logit contributions
ags-0.018
rob-0.018
 Whe-0.016
Whe-0.016
 Ow-0.016
Token he
Token ID339
Feature activation+3.55e+00

Pos logit contributions
ags+0.014
rob+0.014
 Whe+0.013
Whe+0.012
 Ow+0.012
Neg logit contributions
-0.010
ups-0.009
spe-0.008
J-0.008
Jud-0.008
Token do
Token ID466
Feature activation+1.56e+00

Pos logit contributions
ags+0.117
rob+0.112
Whe+0.105
 Whe+0.105
 luggage+0.096
Neg logit contributions
-0.112
ups-0.093
"-0.090
oh-0.089
 soon-0.087
Token?"
Token ID1701
Feature activation+1.80e-01

Pos logit contributions
ags+0.043
rob+0.043
Whe+0.037
 Whe+0.037
 Ow+0.035
Neg logit contributions
-0.054
ups-0.050
spe-0.046
"-0.046
J-0.045
Token Tom
Token ID4186
Feature activation-2.47e-01

Pos logit contributions
ags+0.005
rob+0.005
 Whe+0.005
Whe+0.005
 Ow+0.005
Neg logit contributions
-0.005
ups-0.005
spe-0.005
oh-0.005
Jud-0.005
Token asks
Token ID7893
Feature activation+3.36e-01

Pos logit contributions
+0.008
ups+0.008
spe+0.007
"+0.007
Jud+0.007
Neg logit contributions
ags-0.007
rob-0.007
 Whe-0.006
Whe-0.006
 Ow-0.006
Token.
Token ID13
Feature activation-5.99e-01

Pos logit contributions
ags+0.009
rob+0.008
 Ow+0.007
 Whe+0.007
 retri+0.007
Neg logit contributions
-0.011
spe-0.010
ups-0.010
oh-0.010
Jud-0.010
Token amazing
Token ID4998
Feature activation+2.39e-01

Pos logit contributions
+0.012
ups+0.012
spe+0.011
J+0.011
Jud+0.011
Neg logit contributions
ags-0.009
rob-0.009
 Whe-0.008
Whe-0.008
 Ow-0.007
Token adventure
Token ID8855
Feature activation-6.05e-02

Pos logit contributions
rob+0.007
ags+0.007
Whe+0.006
 Whe+0.006
 Ow+0.006
Neg logit contributions
-0.008
ups-0.007
spe-0.007
"-0.007
 soon-0.007
Token!
Token ID0
Feature activation+3.71e-01

Pos logit contributions
+0.001
ups+0.001
Jud+0.001
J+0.001
spe+0.001
Neg logit contributions
rob-0.002
ags-0.002
 Whe-0.002
Whe-0.002
 retri-0.002
Token<|endoftext|>
Token ID50256
Feature activation-2.95e-01

Pos logit contributions
ags+0.013
rob+0.012
Whe+0.011
 Whe+0.011
 Ow+0.010
Neg logit contributions
-0.011
ups-0.009
spe-0.009
"-0.009
J-0.009
TokenThe
Token ID464
Feature activation+6.88e-01

Pos logit contributions
+0.011
ups+0.009
"+0.009
J+0.009
spe+0.009
Neg logit contributions
ags-0.008
rob-0.008
 Whe-0.007
Whe-0.007
 Ow-0.006
Token other
Token ID584
Feature activation+3.52e+00

Pos logit contributions
ags+0.017
rob+0.016
 Whe+0.014
Whe+0.014
 luggage+0.013
Neg logit contributions
ups-0.024
-0.024
spe-0.022
Jud-0.022
J-0.022
Token day
Token ID1110
Feature activation+1.79e-01

Pos logit contributions
ags+0.095
rob+0.093
 Whe+0.081
Whe+0.079
 Ow+0.075
Neg logit contributions
-0.124
ups-0.116
spe-0.109
J-0.105
"-0.105
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
ags+0.006
rob+0.006
 Whe+0.005
Whe+0.005
 Ow+0.005
Neg logit contributions
ups-0.005
-0.005
spe-0.004
Jud-0.004
edient-0.004
Token Sarah
Token ID10490
Feature activation-4.74e-02

Pos logit contributions
+0.003
ups+0.003
spe+0.002
Jud+0.002
J+0.002
Neg logit contributions
ags-0.004
rob-0.004
 Whe-0.004
Whe-0.004
 Ow-0.004
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
+0.001
ups+0.001
spe+0.001
"+0.001
J+0.001
Neg logit contributions
ags-0.002
rob-0.002
 Whe-0.001
Whe-0.001
 Ow-0.001
Token her
Token ID607
Feature activation+1.36e+00

Pos logit contributions
ags+0.035
rob+0.034
 Whe+0.029
Whe+0.029
 Ow+0.028
Neg logit contributions
-0.040
ups-0.035
spe-0.033
Jud-0.033
"-0.033
Token there
Token ID612
Feature activation-3.85e-01

Pos logit contributions
+0.003
ups+0.003
spe+0.003
Jud+0.003
J+0.003
Neg logit contributions
ags-0.004
rob-0.004
 Whe-0.003
Whe-0.003
 Ow-0.003
Token!"
Token ID2474
Feature activation-1.64e-01

Pos logit contributions
+0.010
ups+0.009
"+0.008
spe+0.008
 soon+0.008
Neg logit contributions
rob-0.014
ags-0.014
Whe-0.013
 Whe-0.013
 Ow-0.012
Token
Token ID220
Feature activation-6.46e-01

Pos logit contributions
+0.004
ups+0.003
"+0.003
J+0.003
Jud+0.003
Neg logit contributions
rob-0.007
ags-0.006
Whe-0.006
 retri-0.006
 luggage-0.005
Token
Token ID198
Feature activation-6.36e-01

Pos logit contributions
+0.020
ups+0.018
spe+0.017
J+0.016
Jud+0.016
Neg logit contributions
ags-0.020
rob-0.020
 Whe-0.018
Whe-0.018
 Ow-0.017
TokenThe
Token ID464
Feature activation+7.87e-01

Pos logit contributions
+0.021
ups+0.020
spe+0.019
Jud+0.018
bug+0.017
Neg logit contributions
ags-0.017
rob-0.017
 Whe-0.016
Whe-0.015
 Ow-0.015
Token other
Token ID584
Feature activation+3.51e+00

Pos logit contributions
ags+0.013
rob+0.013
 Whe+0.011
Whe+0.011
 luggage+0.009
Neg logit contributions
-0.035
ups-0.032
spe-0.031
Jud-0.031
CR-0.031
Token child
Token ID1200
Feature activation+6.10e-01

Pos logit contributions
ags+0.084
rob+0.084
 Whe+0.073
Whe+0.071
 Ow+0.067
Neg logit contributions
-0.132
ups-0.124
spe-0.117
Jud-0.113
J-0.112
Token said
Token ID531
Feature activation+1.82e-01

Pos logit contributions
rob+0.018
ags+0.017
 Whe+0.017
 retri+0.016
Whe+0.016
Neg logit contributions
-0.016
ups-0.016
spe-0.015
Jud-0.015
J-0.015
Token,
Token ID11
Feature activation+4.57e-01

Pos logit contributions
 Whe+0.004
ags+0.004
 de+0.004
 Ow+0.004
 retri+0.004
Neg logit contributions
ups-0.006
-0.006
Jud-0.006
J-0.006
saw-0.006
Token "
Token ID366
Feature activation-2.15e-01

Pos logit contributions
ags+0.010
rob+0.010
 Whe+0.010
Whe+0.009
 Ow+0.008
Neg logit contributions
ups-0.017
-0.017
spe-0.016
Jud-0.015
J-0.015
TokenLet
Token ID5756
Feature activation+7.52e-01

Pos logit contributions
ups+0.008
+0.007
spe+0.007
edient+0.007
ry+0.007
Neg logit contributions
Whe-0.005
 Whe-0.005
rob-0.005
ags-0.004
 Ow-0.004
Token.
Token ID13
Feature activation-5.13e-01

Pos logit contributions
+0.003
ups+0.003
spe+0.003
J+0.003
Jud+0.003
Neg logit contributions
ags-0.006
rob-0.006
 Whe-0.005
Whe-0.005
 Ow-0.005
Token
Token ID220
Feature activation-1.89e-01

Pos logit contributions
+0.018
ups+0.016
spe+0.015
"+0.015
Jud+0.015
Neg logit contributions
rob-0.015
ags-0.014
Whe-0.012
 Whe-0.012
 retri-0.012
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
+0.005
ups+0.004
"+0.004
J+0.004
Jud+0.004
Neg logit contributions
rob-0.007
ags-0.007
 luggage-0.006
 retri-0.006
Whe-0.006
Token
Token ID198
Feature activation-2.26e-01

Pos logit contributions
+0.005
ups+0.005
spe+0.005
Jud+0.004
J+0.004
Neg logit contributions
ags-0.005
rob-0.005
 Whe-0.005
Whe-0.005
 Ow-0.004
TokenThe
Token ID464
Feature activation+9.81e-01

Pos logit contributions
spe+0.007
ups+0.007
washer+0.007
bug+0.007
edient+0.007
Neg logit contributions
 Whe-0.005
Whe-0.005
ags-0.005
rob-0.004
 Ow-0.004
Token other
Token ID584
Feature activation+3.46e+00

Pos logit contributions
ags+0.021
rob+0.020
Whe+0.018
 Whe+0.018
 luggage+0.015
Neg logit contributions
-0.039
ups-0.036
Jud-0.035
spe-0.035
CR-0.034
Token children
Token ID1751
Feature activation+5.99e-01

Pos logit contributions
ags+0.094
rob+0.092
 Whe+0.080
 Ow+0.078
Whe+0.077
Neg logit contributions
-0.121
ups-0.117
spe-0.108
J-0.106
"-0.103
Token cheered
Token ID34767
Feature activation-1.13e-01

Pos logit contributions
rob+0.019
ags+0.019
 Whe+0.017
Whe+0.017
 Ow+0.016
Neg logit contributions
-0.017
ups-0.016
Jud-0.014
spe-0.014
J-0.014
Token her
Token ID607
Feature activation+6.27e-01

Pos logit contributions
+0.004
ups+0.004
spe+0.003
J+0.003
Jud+0.003
Neg logit contributions
ags-0.003
rob-0.003
 Whe-0.003
Whe-0.003
 Ow-0.002
Token on
Token ID319
Feature activation+4.57e-01

Pos logit contributions
ags+0.018
rob+0.017
 Whe+0.015
Whe+0.015
 retri+0.014
Neg logit contributions
-0.021
ups-0.020
Jud-0.018
J-0.018
spe-0.018
Token,
Token ID11
Feature activation-3.50e-02

Pos logit contributions
rob+0.017
ags+0.016
Whe+0.015
 Whe+0.014
 Ow+0.014
Neg logit contributions
-0.013
ups-0.011
"-0.010
spe-0.010
J-0.010
Token often
Token ID1690
Feature activation+5.24e-01

Pos logit contributions
rob+0.024
ags+0.024
 Whe+0.021
Whe+0.021
 Ow+0.020
Neg logit contributions
-0.021
ups-0.019
spe-0.018
Jud-0.017
J-0.017
Token ignore
Token ID8856
Feature activation+9.46e-01

Pos logit contributions
ags+0.016
rob+0.016
 Whe+0.014
Whe+0.014
 Ow+0.014
Neg logit contributions
-0.016
ups-0.015
spe-0.014
J-0.013
Jud-0.013
Token him
Token ID683
Feature activation+1.05e+00

Pos logit contributions
rob+0.023
ags+0.023
Whe+0.019
 Whe+0.019
 Ow+0.018
Neg logit contributions
-0.036
ups-0.034
spe-0.032
J-0.031
"-0.031
Token.
Token ID13
Feature activation-9.10e-03

Pos logit contributions
rob+0.038
ags+0.038
 Whe+0.034
Whe+0.034
 Ow+0.032
Neg logit contributions
-0.028
ups-0.025
spe-0.023
"-0.022
J-0.021
Token The
Token ID383
Feature activation+1.06e+00

Pos logit contributions
+0.000
ups+0.000
spe+0.000
J+0.000
Jud+0.000
Neg logit contributions
ags-0.000
rob-0.000
 Whe-0.000
Whe-0.000
 Ow-0.000
Token other
Token ID584
Feature activation+3.44e+00

Pos logit contributions
rob+0.024
ags+0.024
 Whe+0.021
Whe+0.020
 Ow+0.019
Neg logit contributions
-0.041
ups-0.039
spe-0.036
Jud-0.036
J-0.035
Token day
Token ID1110
Feature activation+8.37e-02

Pos logit contributions
ags+0.091
rob+0.087
 Whe+0.077
Whe+0.075
 Ow+0.072
Neg logit contributions
-0.124
ups-0.118
spe-0.112
"-0.106
 soon-0.105
Token,
Token ID11
Feature activation+5.34e-01

Pos logit contributions
rob+0.002
 Whe+0.002
ags+0.002
Whe+0.002
 luggage+0.002
Neg logit contributions
-0.002
Jud-0.002
ups-0.002
spe-0.002
edient-0.002
Token he
Token ID339
Feature activation+4.89e-01

Pos logit contributions
ags+0.019
rob+0.019
Whe+0.016
 Whe+0.016
 Ow+0.016
Neg logit contributions
-0.015
ups-0.013
spe-0.012
J-0.012
"-0.012
Token decided
Token ID3066
Feature activation+1.35e-01

Pos logit contributions
rob+0.016
ags+0.016
 Whe+0.015
Whe+0.015
 retri+0.014
Neg logit contributions
-0.012
ups-0.011
spe-0.010
J-0.010
Jud-0.010
Token he
Token ID339
Feature activation+4.71e-01

Pos logit contributions
rob+0.004
ags+0.004
 Whe+0.004
Whe+0.004
 retri+0.003
Neg logit contributions
-0.004
ups-0.004
spe-0.003
Jud-0.003
J-0.003
Token up
Token ID510
Feature activation+1.23e+00

Pos logit contributions
ags+0.002
rob+0.002
 Whe+0.002
Whe+0.002
 luggage+0.002
Neg logit contributions
-0.002
ups-0.002
Jud-0.002
spe-0.002
ho-0.001
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
rob+0.043
ags+0.043
 Whe+0.038
Whe+0.038
 Ow+0.037
Neg logit contributions
-0.033
ups-0.030
spe-0.028
J-0.027
"-0.026
Token
Token ID198
Feature activation+6.99e-01

Pos logit contributions
+0.007
ups+0.006
J+0.006
spe+0.006
"+0.006
Neg logit contributions
rob-0.008
ags-0.007
 retri-0.006
Whe-0.006
 Whe-0.006
Token
Token ID198
Feature activation+3.68e-01

Pos logit contributions
ags+0.021
rob+0.021
 Whe+0.018
Whe+0.018
 Ow+0.017
Neg logit contributions
-0.022
ups-0.021
spe-0.019
Jud-0.018
J-0.018
TokenThe
Token ID464
Feature activation+9.35e-01

Pos logit contributions
Whe+0.010
 Whe+0.009
rob+0.009
ags+0.008
 Ow+0.008
Neg logit contributions
ho-0.010
spe-0.010
ups-0.010
edient-0.010
washer-0.010
Token other
Token ID584
Feature activation+3.41e+00

Pos logit contributions
rob+0.017
 luggage+0.017
 Whe+0.017
 de+0.016
Whe+0.016
Neg logit contributions
ups-0.035
-0.034
Jud-0.034
CR-0.033
bugs-0.031
Token children
Token ID1751
Feature activation+5.23e-01

Pos logit contributions
ags+0.106
rob+0.105
 Whe+0.093
Whe+0.092
 Ow+0.088
Neg logit contributions
-0.105
ups-0.098
spe-0.091
J-0.087
Jud-0.087
Token noticed
Token ID6810
Feature activation+2.58e-01

Pos logit contributions
rob+0.016
ags+0.016
 Whe+0.015
Whe+0.015
 luggage+0.014
Neg logit contributions
ups-0.014
-0.013
Jud-0.012
J-0.012
spe-0.012
Token she
Token ID673
Feature activation+2.87e-01

Pos logit contributions
ags+0.008
rob+0.008
 Whe+0.007
Whe+0.007
 Ow+0.007
Neg logit contributions
-0.008
ups-0.007
spe-0.007
J-0.007
Jud-0.007
Token was
Token ID373
Feature activation+2.07e-01

Pos logit contributions
ags+0.010
rob+0.010
 Whe+0.009
Whe+0.009
 retri+0.009
Neg logit contributions
-0.007
ups-0.007
spe-0.006
Jud-0.006
J-0.006
Token wearing
Token ID5762
Feature activation+4.58e-01

Pos logit contributions
ags+0.006
rob+0.006
 Whe+0.005
Whe+0.005
 retri+0.005
Neg logit contributions
-0.007
ups-0.006
spe-0.006
Jud-0.006
"-0.006
Token the
Token ID262
Feature activation-4.56e-01

Pos logit contributions
rob+0.023
ags+0.023
Whe+0.020
 Whe+0.020
 Ow+0.019
Neg logit contributions
-0.021
ups-0.019
"-0.017
spe-0.017
 soon-0.017
Token hero
Token ID4293
Feature activation-4.27e-02

Pos logit contributions
+0.013
ups+0.011
spe+0.011
J+0.010
"+0.010
Neg logit contributions
ags-0.016
rob-0.015
 Whe-0.014
Whe-0.014
 Ow-0.013
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
+0.001
ups+0.001
Jud+0.001
J+0.001
spe+0.001
Neg logit contributions
rob-0.001
ags-0.001
 Whe-0.001
 retri-0.001
 luggage-0.001
Token What
Token ID1867
Feature activation+9.91e-02

Pos logit contributions
ags+0.004
rob+0.004
 Whe+0.004
Whe+0.004
 Ow+0.003
Neg logit contributions
-0.004
ups-0.004
spe-0.004
J-0.004
Jud-0.004
Token did
Token ID750
Feature activation+3.01e-01

Pos logit contributions
ags+0.003
rob+0.003
Whe+0.003
 Whe+0.003
 retri+0.002
Neg logit contributions
-0.003
ups-0.003
"-0.003
spe-0.003
J-0.003
Token he
Token ID339
Feature activation+3.40e+00

Pos logit contributions
ags+0.011
rob+0.011
 Whe+0.010
Whe+0.010
 Ow+0.009
Neg logit contributions
-0.008
ups-0.007
spe-0.006
Jud-0.006
J-0.006
Token do
Token ID466
Feature activation+1.49e+00

Pos logit contributions
rob+0.125
ags+0.122
 Whe+0.115
Whe+0.114
 Ow+0.107
Neg logit contributions
-0.102
"-0.079
 soon-0.078
ups-0.078
 else-0.072
Token?"
Token ID1701
Feature activation-3.11e-01

Pos logit contributions
rob+0.051
ags+0.049
Whe+0.047
 Ow+0.045
 Whe+0.045
Neg logit contributions
-0.044
"-0.039
ups-0.039
 soon-0.036
J-0.034
Token
Token ID198
Feature activation-3.89e-01

Pos logit contributions
+0.010
ups+0.009
spe+0.009
Jud+0.008
J+0.008
Neg logit contributions
ags-0.009
rob-0.009
 Whe-0.008
Whe-0.008
 Ow-0.008
Token
Token ID198
Feature activation-4.77e-01

Pos logit contributions
+0.014
ups+0.013
spe+0.013
Jud+0.012
bug+0.012
Neg logit contributions
ags-0.010
rob-0.009
 Whe-0.009
 Ow-0.008
Whe-0.008
TokenHis
Token ID6653
Feature activation-9.77e-02

Pos logit contributions
bugs+0.018
spe+0.018
bug+0.018
 pod+0.018
ups+0.018
Neg logit contributions
 Whe-0.006
ags-0.006
Whe-0.006
 Ow-0.005
 brid-0.005
Token
Token ID198
Feature activation+1.78e-01

Pos logit contributions
+0.003
ups+0.003
"+0.003
Jud+0.003
J+0.003
Neg logit contributions
rob-0.004
ags-0.004
Whe-0.003
 retri-0.003
 luggage-0.003
Token
Token ID198
Feature activation+6.66e-02

Pos logit contributions
ags+0.006
rob+0.006
Whe+0.005
 Whe+0.005
 retri+0.005
Neg logit contributions
-0.005
ups-0.005
"-0.004
spe-0.004
J-0.004
Token"
Token ID1
Feature activation+8.88e-02

Pos logit contributions
ags+0.002
 Whe+0.002
 Ow+0.002
Whe+0.002
rob+0.001
Neg logit contributions
spe-0.002
washer-0.002
keeper-0.002
bell-0.002
bug-0.002
TokenWhy
Token ID5195
Feature activation+1.11e-02

Pos logit contributions
ags+0.003
rob+0.003
 Whe+0.003
 luggage+0.003
 Ow+0.003
Neg logit contributions
-0.002
ups-0.002
J-0.002
Mom-0.002
"-0.002
Token does
Token ID857
Feature activation-4.38e-01

Pos logit contributions
rob+0.000
ags+0.000
Whe+0.000
 retri+0.000
 Whe+0.000
Neg logit contributions
-0.000
"-0.000
ups-0.000
 soon-0.000
spe-0.000
Token he
Token ID339
Feature activation+3.40e+00

Pos logit contributions
+0.013
ups+0.013
spe+0.012
J+0.011
Jud+0.011
Neg logit contributions
ags-0.013
rob-0.013
 Whe-0.012
Whe-0.012
 Ow-0.011
Token need
Token ID761
Feature activation+4.98e-01

Pos logit contributions
rob+0.131
ags+0.123
 Whe+0.115
Whe+0.111
 Ow+0.106
Neg logit contributions
-0.101
ups-0.076
 soon-0.075
"-0.070
bug-0.069
Token to
Token ID284
Feature activation+5.50e-01

Pos logit contributions
rob+0.018
ags+0.018
Whe+0.017
 Whe+0.015
 retri+0.015
Neg logit contributions
-0.015
ups-0.013
 soon-0.012
"-0.012
 else-0.011
Token check
Token ID2198
Feature activation+5.08e-01

Pos logit contributions
rob+0.020
ags+0.019
 Whe+0.018
Whe+0.017
 retri+0.016
Neg logit contributions
-0.016
ups-0.014
"-0.013
 soon-0.013
Mom-0.012
Token our
Token ID674
Feature activation-1.07e+00

Pos logit contributions
rob+0.019
ags+0.017
Whe+0.017
 Whe+0.016
ses+0.016
Neg logit contributions
-0.014
ups-0.013
"-0.012
 soon-0.012
ful-0.010
Token lungs
Token ID21726
Feature activation+1.53e+00

Pos logit contributions
ups+0.037
+0.035
 soon+0.031
spe+0.031
"+0.030
Neg logit contributions
rob-0.032
ags-0.028
Whe-0.027
 Whe-0.027
 retri-0.024
Token the
Token ID262
Feature activation+1.54e-02

Pos logit contributions
ags+0.010
rob+0.010
 Whe+0.009
 Ow+0.008
Whe+0.008
Neg logit contributions
-0.011
ups-0.011
spe-0.010
J-0.009
Jud-0.009
Token sand
Token ID6450
Feature activation+8.58e-01

Pos logit contributions
ags+0.000
rob+0.000
 luggage+0.000
 Whe+0.000
 freshly+0.000
Neg logit contributions
ups-0.001
-0.001
spe-0.001
Jud-0.001
J-0.001
Token box
Token ID3091
Feature activation+1.97e-01

Pos logit contributions
rob+0.020
ags+0.019
 Whe+0.016
 luggage+0.016
 de+0.016
Neg logit contributions
-0.031
Jud-0.028
ups-0.028
J-0.027
spe-0.027
Token.
Token ID13
Feature activation+8.61e-02

Pos logit contributions
ags+0.005
rob+0.005
 Whe+0.004
 retri+0.004
 luggage+0.004
Neg logit contributions
-0.006
ups-0.006
Jud-0.006
J-0.006
spe-0.006
Token The
Token ID383
Feature activation+1.01e+00

Pos logit contributions
rob+0.003
ags+0.003
Whe+0.002
 retri+0.002
 luggage+0.002
Neg logit contributions
-0.003
ups-0.002
"-0.002
spe-0.002
J-0.002
Token other
Token ID584
Feature activation+3.38e+00

Pos logit contributions
ags+0.025
rob+0.025
 Whe+0.022
 luggage+0.021
Whe+0.021
Neg logit contributions
-0.037
ups-0.034
spe-0.032
Jud-0.032
J-0.032
Token child
Token ID1200
Feature activation+4.91e-01

Pos logit contributions
ags+0.087
rob+0.086
 Whe+0.075
Whe+0.074
 Ow+0.070
Neg logit contributions
-0.123
ups-0.114
spe-0.108
Jud-0.104
J-0.104
Token shouted
Token ID17293
Feature activation+2.74e-01

Pos logit contributions
rob+0.012
 Ow+0.012
 Whe+0.012
 de+0.011
ags+0.011
Neg logit contributions
ups-0.013
J-0.013
Jud-0.013
-0.012
spe-0.012
Token:
Token ID25
Feature activation-2.04e-01

Pos logit contributions
ags+0.009
 Whe+0.008
rob+0.008
 Ow+0.007
Whe+0.007
Neg logit contributions
-0.008
ups-0.008
spe-0.007
Jud-0.007
J-0.007
Token "
Token ID366
Feature activation+1.09e-02

Pos logit contributions
ups+0.007
+0.007
spe+0.007
Jud+0.006
J+0.006
Neg logit contributions
ags-0.005
 Whe-0.005
rob-0.005
 Ow-0.004
Whe-0.004
TokenHelp
Token ID22087
Feature activation+7.15e-01

Pos logit contributions
rob+0.000
ags+0.000
Whe+0.000
 Whe+0.000
 Ow+0.000
Neg logit contributions
-0.000
ups-0.000
spe-0.000
Jud-0.000
"-0.000
Token
Token ID198
Feature activation+6.68e-01

Pos logit contributions
rob+0.018
ags+0.018
Whe+0.014
 luggage+0.014
 retri+0.014
Neg logit contributions
-0.015
ups-0.013
J-0.012
"-0.012
spe-0.011
Token
Token ID198
Feature activation+5.36e-01

Pos logit contributions
ags+0.020
rob+0.020
 Whe+0.018
Whe+0.017
 Ow+0.017
Neg logit contributions
-0.021
ups-0.020
spe-0.018
Jud-0.017
J-0.017
TokenA
Token ID32
Feature activation-3.88e-02

Pos logit contributions
rob+0.017
Whe+0.016
 Whe+0.016
ags+0.016
 retri+0.014
Neg logit contributions
spe-0.013
ups-0.013
-0.013
bug-0.013
ho-0.012
Tokenki
Token ID4106
Feature activation+1.24e-01

Pos logit contributions
+0.001
ups+0.001
spe+0.001
bug+0.001
CR+0.001
Neg logit contributions
rob-0.001
 Whe-0.001
ags-0.001
 retri-0.001
Whe-0.001
Token and
Token ID290
Feature activation+5.65e-01

Pos logit contributions
rob+0.005
ags+0.004
Whe+0.004
 Whe+0.004
 retri+0.004
Neg logit contributions
-0.003
ups-0.003
spe-0.003
Jud-0.002
J-0.002
Token M
Token ID337
Feature activation+3.30e+00

Pos logit contributions
ags+0.017
rob+0.016
 Whe+0.014
Whe+0.014
 Ow+0.014
Neg logit contributions
-0.019
ups-0.017
spe-0.016
J-0.016
"-0.015
Tokenaki
Token ID8182
Feature activation+4.31e-01

Pos logit contributions
rob+0.100
ags+0.099
 Whe+0.095
Whe+0.088
 Ow+0.088
Neg logit contributions
ups-0.102
-0.098
J-0.087
oh-0.087
"-0.080
Token were
Token ID547
Feature activation-2.96e-02

Pos logit contributions
rob+0.016
ags+0.015
Whe+0.014
 Whe+0.014
 retri+0.014
Neg logit contributions
-0.010
ups-0.009
Jud-0.008
spe-0.008
J-0.008
Token very
Token ID845
Feature activation-3.98e-01

Pos logit contributions
+0.001
ups+0.001
Jud+0.001
J+0.001
spe+0.001
Neg logit contributions
ags-0.001
rob-0.001
 Whe-0.001
 Robbie-0.001
Whe-0.001
Token sad
Token ID6507
Feature activation-9.49e-03

Pos logit contributions
ups+0.013
Jud+0.012
+0.012
J+0.012
oh+0.012
Neg logit contributions
ags-0.009
 de-0.008
 Whe-0.008
 freshly-0.008
 retri-0.007
Token.
Token ID13
Feature activation+5.63e-01

Pos logit contributions
+0.000
ups+0.000
spe+0.000
 soon+0.000
"+0.000
Neg logit contributions
rob-0.000
ags-0.000
Whe-0.000
 Whe-0.000
 Ow-0.000
Token sand
Token ID6450
Feature activation+8.66e-02

Pos logit contributions
ags+0.001
rob+0.001
 Whe+0.000
Whe+0.000
 Ow+0.000
Neg logit contributions
-0.001
ups-0.001
spe-0.001
Jud-0.001
J-0.001
Token.
Token ID13
Feature activation-9.99e-01

Pos logit contributions
ags+0.002
rob+0.002
 de+0.002
 luggage+0.002
 fur+0.002
Neg logit contributions
-0.003
Jud-0.003
J-0.003
Mom-0.003
spe-0.002
Token
Token ID198
Feature activation+2.19e-02

Pos logit contributions
+0.033
ups+0.031
spe+0.029
J+0.028
Jud+0.028
Neg logit contributions
ags-0.028
rob-0.028
 Whe-0.025
Whe-0.024
 Ow-0.023
Token
Token ID198
Feature activation-2.27e-01

Pos logit contributions
ags+0.001
rob+0.001
 Whe+0.001
Whe+0.001
 Ow+0.001
Neg logit contributions
-0.001
ups-0.001
spe-0.001
Jud-0.001
J-0.001
TokenThe
Token ID464
Feature activation+8.89e-01

Pos logit contributions
ho+0.007
bug+0.007
ups+0.007
Jud+0.006
 pod+0.006
Neg logit contributions
 Whe-0.005
Whe-0.005
ags-0.005
rob-0.005
 Ow-0.005
Token other
Token ID584
Feature activation+3.29e+00

Pos logit contributions
rob+0.013
ags+0.013
 luggage+0.012
 Whe+0.010
 favorites+0.010
Neg logit contributions
-0.039
Jud-0.036
ups-0.036
spe-0.035
bug-0.035
Token child
Token ID1200
Feature activation+3.50e-01

Pos logit contributions
ags+0.080
rob+0.079
 Whe+0.068
Whe+0.067
 Ow+0.063
Neg logit contributions
-0.124
ups-0.116
spe-0.110
Jud-0.106
J-0.106
Token said
Token ID531
Feature activation-2.36e-01

Pos logit contributions
rob+0.009
 Whe+0.008
ags+0.008
 Ow+0.008
 de+0.008
Neg logit contributions
J-0.009
ups-0.009
Jud-0.009
-0.009
saw-0.009
Token:
Token ID25
Feature activation-3.90e-01

Pos logit contributions
ups+0.007
+0.007
Jud+0.007
spe+0.007
J+0.007
Neg logit contributions
 Whe-0.006
ags-0.006
rob-0.006
 Ow-0.006
 Robbie-0.005
Token "
Token ID366
Feature activation+8.59e-02

Pos logit contributions
ups+0.015
+0.014
spe+0.013
Jud+0.013
edient+0.013
Neg logit contributions
 Whe-0.008
ags-0.008
rob-0.008
 Ow-0.007
Whe-0.007
TokenStop
Token ID19485
Feature activation+1.81e-01

Pos logit contributions
rob+0.003
ags+0.003
Whe+0.003
 Whe+0.003
 Ow+0.002
Neg logit contributions
-0.002
ups-0.002
spe-0.002
Jud-0.002
"-0.002
Token child
Token ID1200
Feature activation-6.06e-01

Pos logit contributions
rob+0.038
ags+0.037
 Whe+0.031
Whe+0.031
 luggage+0.029
Neg logit contributions
-0.081
ups-0.076
Jud-0.073
spe-0.072
CR-0.070
Token.
Token ID13
Feature activation-8.79e-01

Pos logit contributions
Jud+0.018
J+0.018
ups+0.018
CR+0.018
Mom+0.018
Neg logit contributions
ags-0.015
 retri-0.013
 Whe-0.013
 de-0.013
rob-0.012
Token
Token ID198
Feature activation-6.79e-01

Pos logit contributions
+0.026
ups+0.023
"+0.022
spe+0.021
J+0.021
Neg logit contributions
rob-0.030
ags-0.029
Whe-0.026
 luggage-0.024
 Whe-0.024
Token
Token ID198
Feature activation-6.55e-01

Pos logit contributions
+0.022
ups+0.021
spe+0.021
Jud+0.019
bug+0.019
Neg logit contributions
ags-0.018
rob-0.018
 Whe-0.016
Whe-0.015
 Ow-0.015
TokenThe
Token ID464
Feature activation+9.59e-01

Pos logit contributions
ho+0.018
bug+0.017
 pod+0.017
washer+0.016
spe+0.016
Neg logit contributions
ags-0.017
Whe-0.016
 Whe-0.015
rob-0.014
IT-0.014
Token other
Token ID584
Feature activation+3.29e+00

Pos logit contributions
rob+0.018
ags+0.017
 Whe+0.015
Whe+0.014
 luggage+0.013
Neg logit contributions
-0.041
ups-0.038
Jud-0.037
CR-0.036
spe-0.036
Token child
Token ID1200
Feature activation+1.95e-01

Pos logit contributions
ags+0.088
rob+0.085
 Whe+0.075
Whe+0.073
 Ow+0.069
Neg logit contributions
-0.119
ups-0.113
spe-0.105
J-0.103
"-0.100
Token took
Token ID1718
Feature activation+1.43e-01

Pos logit contributions
rob+0.006
ags+0.006
 Whe+0.005
Whe+0.005
 retri+0.005
Neg logit contributions
-0.005
ups-0.005
Jud-0.005
spe-0.005
J-0.005
Token a
Token ID257
Feature activation-8.40e-01

Pos logit contributions
ags+0.004
rob+0.004
 Whe+0.004
Whe+0.004
 Ow+0.003
Neg logit contributions
-0.005
ups-0.004
spe-0.004
J-0.004
Jud-0.004
Token deep
Token ID2769
Feature activation-7.63e-01

Pos logit contributions
ups+0.025
+0.024
spe+0.022
J+0.021
edient+0.021
Neg logit contributions
rob-0.024
 Whe-0.024
ags-0.024
 Ow-0.022
Whe-0.022
Token breath
Token ID8033
Feature activation+8.37e-01

Pos logit contributions
+0.027
ups+0.025
spe+0.023
J+0.022
Jud+0.022
Neg logit contributions
ags-0.021
rob-0.020
 Whe-0.018
Whe-0.017
 Ow-0.017
Token over
Token ID625
Feature activation+1.28e-01

Pos logit contributions
+0.027
ups+0.027
spe+0.027
Jud+0.025
edient+0.024
Neg logit contributions
ags-0.019
rob-0.018
 Whe-0.016
 Ow-0.015
Whe-0.015
Token a
Token ID257
Feature activation-6.35e-01

Pos logit contributions
ags+0.003
rob+0.003
 Whe+0.003
Whe+0.003
 Ow+0.003
Neg logit contributions
-0.005
ups-0.004
spe-0.004
J-0.004
Jud-0.004
Token cable
Token ID7862
Feature activation-9.32e-01

Pos logit contributions
ups+0.020
+0.020
spe+0.018
Jud+0.018
J+0.018
Neg logit contributions
rob-0.017
 Whe-0.016
ags-0.016
Whe-0.015
 luggage-0.015
Token.
Token ID13
Feature activation-3.81e-01

Pos logit contributions
ups+0.030
+0.028
Jud+0.027
J+0.027
spe+0.027
Neg logit contributions
rob-0.024
ags-0.024
 Whe-0.022
 retri-0.022
 luggage-0.020
Token The
Token ID383
Feature activation+9.63e-01

Pos logit contributions
+0.014
ups+0.012
spe+0.012
"+0.012
J+0.011
Neg logit contributions
rob-0.010
ags-0.010
Whe-0.009
 Whe-0.008
 retri-0.008
Token other
Token ID584
Feature activation+3.27e+00

Pos logit contributions
ags+0.022
rob+0.021
 Whe+0.018
Whe+0.017
 luggage+0.017
Neg logit contributions
-0.038
ups-0.035
spe-0.033
Jud-0.033
J-0.033
Token child
Token ID1200
Feature activation+2.25e-01

Pos logit contributions
ags+0.080
rob+0.080
 Whe+0.069
Whe+0.067
 Ow+0.064
Neg logit contributions
-0.122
ups-0.115
spe-0.108
J-0.104
Jud-0.104
Token saw
Token ID2497
Feature activation-5.38e-01

Pos logit contributions
rob+0.006
 Whe+0.006
ags+0.006
 Ow+0.005
 retri+0.005
Neg logit contributions
ups-0.006
Jud-0.006
-0.006
J-0.006
spe-0.006
Token this
Token ID428
Feature activation+2.03e-01

Pos logit contributions
+0.017
ups+0.016
spe+0.014
J+0.014
"+0.014
Neg logit contributions
ags-0.017
rob-0.016
Whe-0.015
 Whe-0.014
 retri-0.014
Token and
Token ID290
Feature activation-5.18e-01

Pos logit contributions
ags+0.007
rob+0.007
Whe+0.006
 Whe+0.006
 Ow+0.006
Neg logit contributions
-0.007
"-0.005
ups-0.005
spe-0.005
 soon-0.005
Token smiled
Token ID13541
Feature activation-7.31e-01

Pos logit contributions
ups+0.015
+0.014
spe+0.013
edient+0.013
Jud+0.013
Neg logit contributions
rob-0.014
 Whe-0.013
ags-0.012
 Robbie-0.012
 Bend-0.012
Token
Token ID198
Feature activation-7.88e-01

Pos logit contributions
+0.021
ups+0.019
spe+0.018
"+0.017
J+0.017
Neg logit contributions
rob-0.020
ags-0.020
Whe-0.018
 Whe-0.018
 Ow-0.017
Token
Token ID198
Feature activation-6.77e-01

Pos logit contributions
+0.028
spe+0.026
ups+0.026
Jud+0.024
bug+0.023
Neg logit contributions
ags-0.020
rob-0.019
 Whe-0.018
 Ow-0.016
Whe-0.016
Token"
Token ID1
Feature activation-7.64e-01

Pos logit contributions
orse+0.021
spe+0.020
antis+0.020
oles+0.020
bell+0.020
Neg logit contributions
ags-0.015
rob-0.015
Whe-0.013
 brid-0.012
 Whe-0.012
TokenWhat
Token ID2061
Feature activation-3.18e-01

Pos logit contributions
+0.023
ups+0.021
spe+0.020
J+0.019
"+0.019
Neg logit contributions
ags-0.024
rob-0.024
 Whe-0.021
Whe-0.020
 Ow-0.020
Token did
Token ID750
Feature activation+5.54e-01

Pos logit contributions
+0.011
ups+0.009
"+0.009
J+0.008
spe+0.008
Neg logit contributions
ags-0.010
rob-0.010
Whe-0.008
 Whe-0.008
 retri-0.008
Token she
Token ID673
Feature activation+3.26e+00

Pos logit contributions
ags+0.023
rob+0.023
Whe+0.021
 Whe+0.020
 Ow+0.020
Neg logit contributions
-0.012
ups-0.010
"-0.009
spe-0.008
J-0.008
Token say
Token ID910
Feature activation+2.71e-01

Pos logit contributions
rob+0.123
ags+0.118
Whe+0.111
 Whe+0.110
 Ow+0.103
Neg logit contributions
-0.091
 soon-0.071
ups-0.070
"-0.068
oh-0.064
Token?"
Token ID1701
Feature activation-3.62e-01

Pos logit contributions
rob+0.008
ags+0.008
Whe+0.007
 Whe+0.007
 Ow+0.007
Neg logit contributions
-0.009
ups-0.008
"-0.008
 soon-0.007
J-0.007
Token Lily
Token ID20037
Feature activation-1.62e-01

Pos logit contributions
+0.013
ups+0.013
spe+0.012
edient+0.012
oh+0.012
Neg logit contributions
ags-0.008
 Whe-0.008
rob-0.008
Whe-0.008
 retri-0.007
Token asks
Token ID7893
Feature activation+1.54e-02

Pos logit contributions
+0.005
ups+0.005
spe+0.004
"+0.004
Jud+0.004
Neg logit contributions
ags-0.005
rob-0.005
 Whe-0.004
Whe-0.004
 Ow-0.004
Token.
Token ID13
Feature activation-7.63e-01

Pos logit contributions
ags+0.001
rob+0.000
 Whe+0.000
 Ow+0.000
Whe+0.000
Neg logit contributions
-0.000
spe-0.000
ups-0.000
Jud-0.000
edient-0.000
Token march
Token ID9960
Feature activation+5.50e-01

Pos logit contributions
rob+0.030
ags+0.029
 Whe+0.027
Whe+0.026
 Ow+0.025
Neg logit contributions
-0.019
ups-0.016
spe-0.015
"-0.014
J-0.014
Token back
Token ID736
Feature activation+1.56e+00

Pos logit contributions
rob+0.022
ags+0.021
Whe+0.019
 Ow+0.019
 Whe+0.019
Neg logit contributions
-0.013
ups-0.012
spe-0.010
 soon-0.010
"-0.010
Token home
Token ID1363
Feature activation+1.57e+00

Pos logit contributions
rob+0.057
ags+0.056
Whe+0.051
 Whe+0.050
 Ow+0.049
Neg logit contributions
-0.041
ups-0.037
spe-0.034
 soon-0.033
"-0.032
Token.
Token ID13
Feature activation+2.89e-01

Pos logit contributions
ags+0.054
rob+0.052
 Whe+0.048
Whe+0.047
 retri+0.045
Neg logit contributions
-0.041
ups-0.038
Jud-0.034
spe-0.034
J-0.033
Token The
Token ID383
Feature activation+9.68e-01

Pos logit contributions
rob+0.010
ags+0.010
Whe+0.008
 retri+0.008
 luggage+0.008
Neg logit contributions
-0.009
spe-0.007
ups-0.007
"-0.007
J-0.007
Token other
Token ID584
Feature activation+3.24e+00

Pos logit contributions
ags+0.029
rob+0.029
 Whe+0.025
Whe+0.025
 Ow+0.024
Neg logit contributions
-0.032
ups-0.029
spe-0.027
J-0.026
Jud-0.026
Token old
Token ID1468
Feature activation+2.17e+00

Pos logit contributions
ags+0.093
rob+0.090
 Whe+0.082
Whe+0.078
 Ow+0.078
Neg logit contributions
-0.109
ups-0.107
"-0.095
spe-0.095
J-0.094
Token men
Token ID1450
Feature activation+8.61e-01

Pos logit contributions
rob+0.071
ags+0.071
 Whe+0.063
Whe+0.062
 Ow+0.060
Neg logit contributions
-0.064
ups-0.058
spe-0.054
Jud-0.052
J-0.051
Token said
Token ID531
Feature activation+3.07e-01

Pos logit contributions
ags+0.030
rob+0.030
 Whe+0.027
Whe+0.027
 Ow+0.026
Neg logit contributions
-0.023
ups-0.021
spe-0.019
J-0.018
Jud-0.018
Token goodbye
Token ID24829
Feature activation+8.99e-02

Pos logit contributions
ags+0.008
rob+0.008
 Whe+0.008
Whe+0.007
 Ow+0.007
Neg logit contributions
-0.010
ups-0.010
spe-0.009
Jud-0.009
J-0.009
Token and
Token ID290
Feature activation+9.51e-01

Pos logit contributions
ags+0.003
rob+0.003
 Whe+0.002
Whe+0.002
 Ow+0.002
Neg logit contributions
-0.003
ups-0.003
spe-0.003
J-0.002
"-0.002
Token that
Token ID326
Feature activation+3.82e-01

Pos logit contributions
+0.001
ups+0.001
spe+0.001
J+0.001
Jud+0.001
Neg logit contributions
ags-0.001
rob-0.001
 Whe-0.001
Whe-0.001
 Ow-0.001
Token man
Token ID582
Feature activation-3.24e-01

Pos logit contributions
ags+0.013
rob+0.012
Whe+0.011
 Whe+0.010
 Ow+0.010
Neg logit contributions
-0.012
"-0.010
ups-0.010
 soon-0.009
oh-0.009
Token?
Token ID30
Feature activation-3.60e-01

Pos logit contributions
+0.011
ups+0.010
spe+0.009
J+0.009
Jud+0.009
Neg logit contributions
ags-0.009
rob-0.009
 Whe-0.008
Whe-0.008
 Ow-0.007
Token Why
Token ID4162
Feature activation-1.68e-01

Pos logit contributions
ups+0.011
+0.011
spe+0.010
Jud+0.010
J+0.010
Neg logit contributions
rob-0.010
ags-0.010
 Whe-0.010
Whe-0.009
 Ow-0.009
Token does
Token ID857
Feature activation-7.17e-01

Pos logit contributions
+0.005
ups+0.005
"+0.005
spe+0.004
 else+0.004
Neg logit contributions
ags-0.006
rob-0.005
Whe-0.005
 Whe-0.005
 retri-0.004
Token he
Token ID339
Feature activation+3.23e+00

Pos logit contributions
+0.022
ups+0.021
spe+0.020
Jud+0.020
J+0.020
Neg logit contributions
rob-0.020
ags-0.020
 Whe-0.019
Whe-0.018
 Ow-0.017
Token have
Token ID423
Feature activation+7.83e-01

Pos logit contributions
rob+0.119
ags+0.118
 Whe+0.106
Whe+0.105
 Ow+0.103
Neg logit contributions
-0.095
"-0.071
ups-0.071
 soon-0.070
 else-0.067
Token a
Token ID257
Feature activation+2.56e-01

Pos logit contributions
ags+0.026
rob+0.025
Whe+0.022
 Whe+0.021
 retri+0.020
Neg logit contributions
-0.025
ups-0.022
"-0.020
 soon-0.020
 else-0.019
Token whistle
Token ID16121
Feature activation+8.26e-01

Pos logit contributions
ags+0.008
rob+0.008
Whe+0.007
 Whe+0.006
 Ow+0.006
Neg logit contributions
-0.008
"-0.007
ups-0.007
spe-0.007
 soon-0.006
Token?"
Token ID1701
Feature activation-7.93e-02

Pos logit contributions
rob+0.026
ags+0.026
Whe+0.023
 Whe+0.023
 Ow+0.022
Neg logit contributions
-0.025
ups-0.023
"-0.022
 soon-0.021
spe-0.021
Token
Token ID198
Feature activation-2.26e-01

Pos logit contributions
+0.003
ups+0.002
spe+0.002
J+0.002
Jud+0.002
Neg logit contributions
ags-0.002
rob-0.002
 Whe-0.002
Whe-0.002
 Ow-0.002
Token
Token ID198
Feature activation+6.09e-01

Pos logit contributions
rob+0.020
ags+0.019
Whe+0.016
 Whe+0.016
 luggage+0.015
Neg logit contributions
-0.022
ups-0.019
"-0.018
J-0.018
spe-0.018
Token
Token ID198
Feature activation+5.15e-01

Pos logit contributions
ags+0.018
rob+0.017
 Whe+0.016
Whe+0.015
 Ow+0.015
Neg logit contributions
-0.019
ups-0.018
spe-0.017
Jud-0.016
J-0.016
TokenA
Token ID32
Feature activation-7.43e-02

Pos logit contributions
rob+0.016
Whe+0.016
 Whe+0.016
ags+0.016
 Ow+0.014
Neg logit contributions
ups-0.013
spe-0.013
-0.012
bug-0.012
ho-0.012
Tokenki
Token ID4106
Feature activation-8.06e-02

Pos logit contributions
+0.002
ups+0.002
Jud+0.002
bug+0.002
spe+0.002
Neg logit contributions
rob-0.002
 Whe-0.002
 retri-0.002
 luggage-0.002
ags-0.002
Token and
Token ID290
Feature activation+7.34e-01

Pos logit contributions
+0.002
ups+0.002
spe+0.002
Jud+0.002
J+0.002
Neg logit contributions
rob-0.003
ags-0.003
Whe-0.003
 Whe-0.002
 retri-0.002
Token M
Token ID337
Feature activation+3.22e+00

Pos logit contributions
ags+0.020
rob+0.020
 Whe+0.018
Whe+0.017
 Ow+0.016
Neg logit contributions
-0.025
ups-0.024
spe-0.022
Jud-0.021
J-0.021
Tokenaki
Token ID8182
Feature activation+4.81e-01

Pos logit contributions
rob+0.097
ags+0.096
 Whe+0.091
Whe+0.085
 Ow+0.082
Neg logit contributions
ups-0.101
-0.098
oh-0.086
J-0.086
"-0.081
Token apologized
Token ID22893
Feature activation+6.42e-01

Pos logit contributions
rob+0.017
ags+0.017
 Whe+0.015
Whe+0.015
 Ow+0.015
Neg logit contributions
-0.011
ups-0.011
Jud-0.009
J-0.009
spe-0.009
Token,
Token ID11
Feature activation+4.76e-01

Pos logit contributions
ags+0.021
rob+0.021
 Whe+0.019
Whe+0.019
 Ow+0.018
Neg logit contributions
-0.019
ups-0.017
spe-0.016
"-0.015
J-0.015
Token but
Token ID475
Feature activation+1.40e+00

Pos logit contributions
ags+0.015
rob+0.015
Whe+0.013
 Whe+0.013
 Ow+0.012
Neg logit contributions
-0.015
ups-0.014
spe-0.013
"-0.012
J-0.012
Token the
Token ID262
Feature activation+6.74e-01

Pos logit contributions
ags+0.041
rob+0.041
 Whe+0.039
Whe+0.037
 retri+0.036
Neg logit contributions
-0.040
ups-0.040
Jud-0.036
spe-0.036
J-0.035
Token dad
Token ID9955
Feature activation-4.66e-01

Pos logit contributions
+0.034
CR+0.033
oh+0.033
ups+0.033
edient+0.032
Neg logit contributions
 luggage-0.020
 Whe-0.020
ags-0.019
 Robbie-0.019
rob-0.018
Token,
Token ID11
Feature activation-7.61e-01

Pos logit contributions
CR+0.016
+0.016
Jud+0.016
spe+0.015
ups+0.015
Neg logit contributions
ags-0.009
 Whe-0.009
rob-0.009
 Robbie-0.008
 luggage-0.008
Token "
Token ID366
Feature activation-3.67e-01

Pos logit contributions
+0.022
"+0.018
ups+0.018
J+0.017
 soon+0.017
Neg logit contributions
rob-0.028
ags-0.027
Whe-0.023
 Ow-0.023
 Whe-0.022
TokenWhat
Token ID2061
Feature activation-1.84e-01

Pos logit contributions
+0.010
ups+0.010
spe+0.009
Jud+0.008
J+0.008
Neg logit contributions
ags-0.012
rob-0.012
 Whe-0.011
Whe-0.011
 Ow-0.010
Token is
Token ID318
Feature activation+3.92e-01

Pos logit contributions
+0.006
ups+0.005
"+0.005
spe+0.005
J+0.004
Neg logit contributions
ags-0.006
rob-0.006
Whe-0.005
 Whe-0.005
 retri-0.005
Token he
Token ID339
Feature activation+3.22e+00

Pos logit contributions
ags+0.015
rob+0.014
Whe+0.013
 Whe+0.012
 Ow+0.012
Neg logit contributions
-0.010
ups-0.009
"-0.008
 soon-0.008
spe-0.007
Token doing
Token ID1804
Feature activation+1.20e+00

Pos logit contributions
ags+0.120
rob+0.118
Whe+0.112
 Whe+0.103
 Ow+0.101
Neg logit contributions
-0.093
"-0.082
 soon-0.073
ups-0.069
oh-0.069
Token?"
Token ID1701
Feature activation-1.31e-01

Pos logit contributions
rob+0.044
ags+0.043
Whe+0.040
 Ow+0.039
 Whe+0.038
Neg logit contributions
-0.032
"-0.029
ups-0.027
 soon-0.027
washer-0.024
Token
Token ID198
Feature activation-4.95e-01

Pos logit contributions
+0.004
ups+0.004
"+0.004
J+0.003
spe+0.003
Neg logit contributions
rob-0.004
ags-0.004
Whe-0.004
 Whe-0.003
 Ow-0.003
Token
Token ID198
Feature activation-5.00e-01

Pos logit contributions
+0.016
ups+0.015
spe+0.015
Jud+0.014
J+0.013
Neg logit contributions
ags-0.014
rob-0.014
 Whe-0.012
Whe-0.012
 Ow-0.012
TokenHer
Token ID9360
Feature activation+5.14e-01

Pos logit contributions
ups+0.010
bug+0.010
spe+0.010
+0.010
bugs+0.010
Neg logit contributions
ags-0.016
 Whe-0.015
Whe-0.015
 Robbie-0.014
rob-0.014
Token We
Token ID775
Feature activation-5.47e-01

Pos logit contributions
ags+0.015
rob+0.015
 Whe+0.015
Whe+0.014
 Ow+0.014
Neg logit contributions
ups-0.010
-0.009
J-0.008
Jud-0.008
spe-0.008
Token are
Token ID389
Feature activation-6.28e-01

Pos logit contributions
+0.017
ups+0.015
spe+0.014
"+0.014
J+0.013
Neg logit contributions
rob-0.018
ags-0.018
 Whe-0.016
Whe-0.016
 retri-0.015
Token sorry
Token ID7926
Feature activation-1.19e+00

Pos logit contributions
+0.019
ups+0.018
spe+0.017
J+0.016
Jud+0.016
Neg logit contributions
ags-0.021
rob-0.019
 Whe-0.018
Whe-0.017
 Ow-0.017
Token,
Token ID11
Feature activation-1.27e+00

Pos logit contributions
Jud+0.037
spe+0.037
+0.035
J+0.034
ups+0.034
Neg logit contributions
ags-0.033
 Whe-0.030
 Ow-0.029
 de-0.028
rob-0.027
Token Tim
Token ID5045
Feature activation-7.22e-01

Pos logit contributions
+0.032
ups+0.029
spe+0.026
J+0.025
Jud+0.025
Neg logit contributions
ags-0.047
rob-0.047
 Whe-0.042
Whe-0.042
 Ow-0.040
Token/
Token ID14
Feature activation-3.59e+00

Pos logit contributions
+0.020
ups+0.019
spe+0.018
Jud+0.018
J+0.018
Neg logit contributions
ags-0.023
rob-0.023
 Whe-0.020
Whe-0.020
 Ow-0.020
TokenS
Token ID50
Feature activation+3.00e-01

Pos logit contributions
Jud+0.090
+0.089
orse+0.086
ups+0.083
 seek+0.083
Neg logit contributions
ags-0.116
 Ow-0.110
rob-0.106
 Whe-0.105
Whe-0.104
Tokenue
Token ID518
Feature activation-2.72e-02

Pos logit contributions
ags+0.011
rob+0.010
Whe+0.009
 Whe+0.009
 Ow+0.009
Neg logit contributions
-0.008
ups-0.007
spe-0.007
Jud-0.007
J-0.006
Token.
Token ID13
Feature activation+3.01e-01

Pos logit contributions
+0.001
ups+0.001
Jud+0.001
J+0.001
spe+0.001
Neg logit contributions
ags-0.001
rob-0.001
 Whe-0.001
 Ow-0.001
Whe-0.001
Token We
Token ID775
Feature activation-6.26e-01

Pos logit contributions
ags+0.011
 Whe+0.010
rob+0.010
Whe+0.010
 Ow+0.010
Neg logit contributions
ups-0.007
-0.007
Jud-0.007
J-0.007
spe-0.006
Token will
Token ID481
Feature activation-2.25e-01

Pos logit contributions
+0.019
ups+0.017
"+0.015
spe+0.015
J+0.015
Neg logit contributions
rob-0.021
ags-0.021
Whe-0.018
 Whe-0.018
 retri-0.017
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
spe+0.012
ups+0.011
Jud+0.011
saw+0.011
+0.011
Neg logit contributions
ags-0.008
 Whe-0.007
 Ow-0.007
 wipes-0.006
 de-0.006
Token "
Token ID366
Feature activation-4.59e-01

Pos logit contributions
ags+0.003
rob+0.003
 Whe+0.003
Whe+0.003
 Ow+0.003
Neg logit contributions
-0.003
ups-0.003
spe-0.003
J-0.003
"-0.003
TokenIt
Token ID1026
Feature activation-5.52e-01

Pos logit contributions
+0.010
ups+0.010
Jud+0.008
spe+0.008
bug+0.008
Neg logit contributions
rob-0.017
Whe-0.016
 Whe-0.016
 Ow-0.016
ags-0.015
Token is
Token ID318
Feature activation+1.31e-01

Pos logit contributions
J+0.015
spe+0.014
ups+0.014
Jud+0.014
+0.013
Neg logit contributions
rob-0.018
ags-0.016
 Ow-0.016
 Whe-0.016
Whe-0.014
Token his
Token ID465
Feature activation-1.53e-01

Pos logit contributions
ags+0.004
rob+0.004
 Whe+0.003
 Ow+0.003
Whe+0.003
Neg logit contributions
ups-0.004
-0.004
spe-0.004
J-0.004
Jud-0.004
Token/
Token ID14
Feature activation-3.25e+00

Pos logit contributions
+0.005
ups+0.005
spe+0.004
J+0.004
Jud+0.004
Neg logit contributions
ags-0.005
rob-0.005
 Whe-0.004
Whe-0.004
 Ow-0.004
Tokenher
Token ID372
Feature activation-4.62e-01

Pos logit contributions
+0.089
Jud+0.084
spe+0.082
ups+0.080
orse+0.077
Neg logit contributions
ags-0.106
rob-0.095
 Whe-0.091
 Ow-0.091
Whe-0.090
Token fault
Token ID8046
Feature activation+2.53e-01

Pos logit contributions
+0.015
ups+0.014
Jud+0.013
spe+0.013
J+0.012
Neg logit contributions
ags-0.014
rob-0.013
 Whe-0.012
Whe-0.012
 Ow-0.011
Token.
Token ID13
Feature activation+3.09e-01

Pos logit contributions
ags+0.008
rob+0.007
 Whe+0.007
 Ow+0.007
Whe+0.006
Neg logit contributions
-0.007
ups-0.007
Jud-0.007
spe-0.007
J-0.007
Token He
Token ID679
Feature activation+1.97e-01

Pos logit contributions
ags+0.010
rob+0.009
 Whe+0.009
Whe+0.008
 Ow+0.008
Neg logit contributions
-0.009
ups-0.009
spe-0.008
Jud-0.008
J-0.008
Token/
Token ID14
Feature activation-3.18e+00

Pos logit contributions
rob+0.006
ags+0.006
 Whe+0.005
Whe+0.005
 Ow+0.005
Neg logit contributions
-0.006
ups-0.006
spe-0.005
J-0.005
Jud-0.005
Token/
Token ID14
Feature activation-3.25e+00

Pos logit contributions
+0.005
ups+0.005
spe+0.004
J+0.004
Jud+0.004
Neg logit contributions
ags-0.005
rob-0.005
 Whe-0.004
Whe-0.004
 Ow-0.004
Tokenher
Token ID372
Feature activation-4.62e-01

Pos logit contributions
+0.089
Jud+0.084
spe+0.082
ups+0.080
orse+0.077
Neg logit contributions
ags-0.106
rob-0.095
 Whe-0.091
 Ow-0.091
Whe-0.090
Token fault
Token ID8046
Feature activation+2.53e-01

Pos logit contributions
+0.015
ups+0.014
Jud+0.013
spe+0.013
J+0.012
Neg logit contributions
ags-0.014
rob-0.013
 Whe-0.012
Whe-0.012
 Ow-0.011
Token.
Token ID13
Feature activation+3.09e-01

Pos logit contributions
ags+0.008
rob+0.007
 Whe+0.007
 Ow+0.007
Whe+0.006
Neg logit contributions
-0.007
ups-0.007
Jud-0.007
spe-0.007
J-0.007
Token He
Token ID679
Feature activation+1.97e-01

Pos logit contributions
ags+0.010
rob+0.009
 Whe+0.009
Whe+0.008
 Ow+0.008
Neg logit contributions
-0.009
ups-0.009
spe-0.008
Jud-0.008
J-0.008
Token/
Token ID14
Feature activation-3.18e+00

Pos logit contributions
rob+0.006
ags+0.006
 Whe+0.005
Whe+0.005
 Ow+0.005
Neg logit contributions
-0.006
ups-0.006
spe-0.005
J-0.005
Jud-0.005
Tokenshe
Token ID7091
Feature activation-8.30e-01

Pos logit contributions
+0.093
spe+0.090
Jud+0.090
orse+0.088
edient+0.087
Neg logit contributions
ags-0.092
 Whe-0.083
 Ow-0.083
rob-0.083
Whe-0.080
Token broke
Token ID6265
Feature activation-8.51e-01

Pos logit contributions
+0.022
ups+0.020
spe+0.018
J+0.017
Jud+0.017
Neg logit contributions
ags-0.029
rob-0.029
 Whe-0.026
Whe-0.026
 Ow-0.025
Token the
Token ID262
Feature activation-6.01e-01

Pos logit contributions
+0.029
ups+0.027
spe+0.026
Jud+0.026
J+0.025
Neg logit contributions
ags-0.023
rob-0.022
 Whe-0.019
 Ow-0.019
Whe-0.019
Token car
Token ID1097
Feature activation-4.89e-01

Pos logit contributions
+0.018
ups+0.017
spe+0.016
"+0.015
J+0.015
Neg logit contributions
rob-0.019
ags-0.019
 Whe-0.017
Whe-0.016
 Ow-0.016
Token."
Token ID526
Feature activation-5.10e-01

Pos logit contributions
+0.014
Jud+0.014
ups+0.014
J+0.013
spe+0.013
Neg logit contributions
ags-0.014
rob-0.014
 Whe-0.012
Whe-0.012
 Ow-0.012
Token,
Token ID11
Feature activation-1.67e+00

Pos logit contributions
+0.056
ups+0.050
spe+0.047
Jud+0.045
J+0.045
Neg logit contributions
ags-0.078
rob-0.077
 Whe-0.069
Whe-0.069
 Ow-0.066
Token thirteen
Token ID28306
Feature activation-1.94e+00

Pos logit contributions
+0.052
ups+0.049
spe+0.046
Jud+0.044
J+0.044
Neg logit contributions
ags-0.050
rob-0.049
 Whe-0.044
Whe-0.043
 Ow-0.041
Token,
Token ID11
Feature activation-1.61e+00

Pos logit contributions
+0.045
ups+0.041
"+0.036
spe+0.035
J+0.033
Neg logit contributions
rob-0.075
ags-0.074
 Whe-0.069
Whe-0.067
 Ow-0.065
Token fourteen
Token ID29167
Feature activation-2.19e+00

Pos logit contributions
+0.050
ups+0.046
"+0.042
spe+0.042
J+0.041
Neg logit contributions
ags-0.049
rob-0.049
 Whe-0.044
Whe-0.043
 retri-0.041
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
+0.063
ups+0.060
spe+0.059
Jud+0.055
edient+0.055
Neg logit contributions
rob-0.067
ags-0.066
 Whe-0.060
Whe-0.059
 Ow-0.058
Token fifteen
Token ID17280
Feature activation-3.04e+00

Pos logit contributions
+0.035
ups+0.034
spe+0.032
Jud+0.032
J+0.031
Neg logit contributions
ags-0.030
rob-0.029
 Whe-0.027
Whe-0.026
 Ow-0.026
Token,
Token ID11
Feature activation-1.50e+00

Pos logit contributions
+0.088
ups+0.080
spe+0.078
Jud+0.076
J+0.073
Neg logit contributions
ags-0.097
rob-0.094
 Whe-0.084
Whe-0.084
 Ow-0.081
Token sixteen
Token ID27913
Feature activation-2.29e+00

Pos logit contributions
+0.046
ups+0.044
spe+0.041
Jud+0.041
J+0.039
Neg logit contributions
ags-0.045
rob-0.044
 Whe-0.040
Whe-0.039
 Ow-0.038
Token,
Token ID11
Feature activation-1.60e+00

Pos logit contributions
+0.062
ups+0.057
spe+0.054
Jud+0.052
J+0.050
Neg logit contributions
ags-0.078
rob-0.077
 Whe-0.069
Whe-0.069
 Ow-0.066
Token seventeen
Token ID38741
Feature activation-1.08e+00

Pos logit contributions
+0.045
ups+0.044
spe+0.040
Jud+0.040
J+0.039
Neg logit contributions
ags-0.051
rob-0.050
 Whe-0.046
Whe-0.045
 Ow-0.043
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
+0.024
ups+0.022
"+0.020
J+0.019
spe+0.018
Neg logit contributions
rob-0.042
ags-0.042
 Whe-0.039
Whe-0.038
 retri-0.037
Token.
Token ID13
Feature activation-1.69e-01

Pos logit contributions
ags+0.012
rob+0.011
 Whe+0.010
 Ow+0.010
Whe+0.010
Neg logit contributions
-0.011
ups-0.010
J-0.010
Jud-0.010
spe-0.010
Token
Token ID198
Feature activation-2.48e-01

Pos logit contributions
+0.006
ups+0.006
spe+0.005
J+0.005
Jud+0.005
Neg logit contributions
ags-0.004
rob-0.004
 Whe-0.004
 Ow-0.004
Whe-0.004
Token
Token ID198
Feature activation-2.65e-01

Pos logit contributions
spe+0.009
+0.009
ups+0.009
ho+0.008
bug+0.008
Neg logit contributions
ags-0.006
rob-0.005
 Whe-0.005
 Ow-0.005
Whe-0.004
Token"
Token ID1
Feature activation-7.64e-01

Pos logit contributions
spe+0.006
ho+0.006
bug+0.005
ups+0.005
bugs+0.005
Neg logit contributions
ags-0.009
rob-0.008
 Whe-0.008
 yarn-0.008
 wool-0.007
TokenShe
Token ID3347
Feature activation+4.73e-01

Pos logit contributions
+0.020
ups+0.019
spe+0.017
Jud+0.016
"+0.016
Neg logit contributions
rob-0.026
ags-0.026
 Whe-0.024
Whe-0.024
 Ow-0.023
Token/
Token ID14
Feature activation-3.00e+00

Pos logit contributions
rob+0.014
ags+0.014
 Whe+0.013
 Ow+0.012
Whe+0.012
Neg logit contributions
-0.014
ups-0.013
spe-0.013
J-0.012
Jud-0.012
Tokenhe
Token ID258
Feature activation-4.61e-01

Pos logit contributions
spe+0.089
Jud+0.089
+0.088
ups+0.087
edient+0.082
Neg logit contributions
ags-0.084
rob-0.083
 Whe-0.080
 Ow-0.075
Whe-0.074
Token broke
Token ID6265
Feature activation-7.07e-01

Pos logit contributions
+0.012
spe+0.012
ups+0.011
Jud+0.011
J+0.011
Neg logit contributions
rob-0.015
ags-0.015
 Ow-0.013
 Whe-0.013
 de-0.013
Token my
Token ID616
Feature activation-1.30e+00

Pos logit contributions
spe+0.025
+0.025
ups+0.024
Jud+0.023
J+0.022
Neg logit contributions
ags-0.016
 Ow-0.016
rob-0.015
 Whe-0.013
 de-0.013
Token worm
Token ID23849
Feature activation+1.03e+00

Pos logit contributions
+0.040
ups+0.037
spe+0.036
J+0.033
Jud+0.033
Neg logit contributions
ags-0.041
rob-0.038
 Ow-0.036
 Whe-0.035
 retri-0.034
Token!"
Token ID2474
Feature activation-6.36e-01

Pos logit contributions
ags+0.028
rob+0.026
 Ow+0.024
 retri+0.022
 Whe+0.022
Neg logit contributions
-0.032
J-0.032
Jud-0.032
spe-0.031
ups-0.031
Token.
Token ID13
Feature activation-1.64e-01

Pos logit contributions
ags+0.013
rob+0.012
 Whe+0.012
Whe+0.011
 Ow+0.011
Neg logit contributions
-0.010
spe-0.009
ups-0.009
Jud-0.008
saw-0.008
Token
Token ID198
Feature activation-3.34e-01

Pos logit contributions
+0.005
ups+0.005
spe+0.004
"+0.004
J+0.004
Neg logit contributions
rob-0.005
ags-0.005
Whe-0.005
 Whe-0.005
 Ow-0.004
Token
Token ID198
Feature activation-4.71e-01

Pos logit contributions
+0.011
spe+0.011
ups+0.011
Jud+0.010
bug+0.010
Neg logit contributions
ags-0.009
rob-0.008
 Whe-0.008
 Ow-0.007
Whe-0.007
Token"
Token ID1
Feature activation-7.35e-01

Pos logit contributions
spe+0.013
ull+0.010
ho+0.010
orse+0.010
washer+0.010
Neg logit contributions
ags-0.016
 Whe-0.015
Whe-0.014
 Hug-0.014
 Star-0.013
TokenAnna
Token ID31160
Feature activation-4.81e-01

Pos logit contributions
+0.017
ups+0.015
spe+0.014
Jud+0.013
"+0.012
Neg logit contributions
rob-0.028
ags-0.028
 Whe-0.026
Whe-0.026
 Ow-0.024
Token/
Token ID14
Feature activation-2.95e+00

Pos logit contributions
+0.016
ups+0.014
"+0.014
oh+0.013
spe+0.013
Neg logit contributions
rob-0.014
ags-0.014
Whe-0.013
 Whe-0.013
 Ow-0.012
TokenBen
Token ID11696
Feature activation+5.15e-03

Pos logit contributions
+0.083
spe+0.075
ups+0.075
Jud+0.071
edient+0.066
Neg logit contributions
ags-0.100
rob-0.096
 Whe-0.090
Whe-0.087
 Ow-0.086
Token did
Token ID750
Feature activation-3.10e-01

Pos logit contributions
ags+0.000
rob+0.000
Whe+0.000
 Whe+0.000
 Ow+0.000
Neg logit contributions
-0.000
ups-0.000
"-0.000
oh-0.000
spe-0.000
Token it
Token ID340
Feature activation+1.18e+00

Pos logit contributions
spe+0.009
ups+0.008
+0.008
Jud+0.008
bug+0.007
Neg logit contributions
ags-0.010
rob-0.009
 Whe-0.009
 Ow-0.009
Whe-0.008
Token!"
Token ID2474
Feature activation-5.79e-01

Pos logit contributions
ags+0.042
rob+0.041
 Whe+0.037
Whe+0.037
 Ow+0.035
Neg logit contributions
-0.031
ups-0.028
spe-0.026
J-0.025
Jud-0.025
Token they
Token ID484
Feature activation-1.37e-01

Pos logit contributions
+0.018
ups+0.017
spe+0.016
Jud+0.015
J+0.015
Neg logit contributions
ags-0.018
rob-0.017
 Whe-0.016
Whe-0.015
 Ow-0.015
Token tall
Token ID7331
Feature activation-1.52e-01

Pos logit contributions
+0.012
ups+0.011
spe+0.010
J+0.010
Jud+0.010
Neg logit contributions
ags-0.014
rob-0.014
 Whe-0.012
Whe-0.012
 Ow-0.012
Token and
Token ID290
Feature activation-1.01e+00

Pos logit contributions
+0.005
ups+0.005
spe+0.005
J+0.004
Jud+0.004
Neg logit contributions
ags-0.004
rob-0.004
 Whe-0.004
Whe-0.003
 Ow-0.003
Token straight
Token ID3892
Feature activation-2.72e-01

Pos logit contributions
+0.027
ups+0.025
spe+0.023
Jud+0.022
J+0.022
Neg logit contributions
ags-0.035
rob-0.035
 Whe-0.032
Whe-0.031
 Ow-0.030
Token,
Token ID11
Feature activation-3.17e-01

Pos logit contributions
+0.007
ups+0.006
spe+0.006
"+0.005
 soon+0.005
Neg logit contributions
rob-0.010
ags-0.010
 Whe-0.009
Whe-0.009
 Ow-0.009
Token never
Token ID1239
Feature activation+2.54e-01

Pos logit contributions
+0.011
ups+0.010
spe+0.009
"+0.009
Jud+0.009
Neg logit contributions
rob-0.009
ags-0.009
Whe-0.008
 Whe-0.008
 Ow-0.007
Token leaning
Token ID21804
Feature activation-2.87e+00

Pos logit contributions
rob+0.007
 Whe+0.007
ags+0.007
Whe+0.007
 retri+0.006
Neg logit contributions
-0.007
ups-0.007
J-0.006
Mom-0.006
spe-0.006
Token or
Token ID393
Feature activation-9.06e-01

Pos logit contributions
ups+0.085
+0.082
spe+0.078
Jud+0.077
J+0.073
Neg logit contributions
ags-0.084
rob-0.079
 Whe-0.073
 Ow-0.072
Whe-0.072
Token sway
Token ID20009
Feature activation-1.30e+00

Pos logit contributions
+0.026
ups+0.024
spe+0.023
J+0.022
Jud+0.022
Neg logit contributions
ags-0.029
rob-0.029
 Whe-0.027
Whe-0.026
 Ow-0.025
Tokening
Token ID278
Feature activation-8.70e-01

Pos logit contributions
+0.036
ups+0.035
Jud+0.033
spe+0.033
J+0.032
Neg logit contributions
ags-0.040
rob-0.036
 Ow-0.035
Whe-0.034
 Whe-0.034
Token in
Token ID287
Feature activation-4.13e-01

Pos logit contributions
+0.025
ups+0.025
Jud+0.023
J+0.023
spe+0.023
Neg logit contributions
ags-0.026
rob-0.025
 Whe-0.023
Whe-0.023
 Ow-0.022
Token the
Token ID262
Feature activation-1.22e-01

Pos logit contributions
+0.014
ups+0.014
spe+0.013
Jud+0.012
J+0.012
Neg logit contributions
ags-0.011
rob-0.010
 Whe-0.009
Whe-0.009
 Ow-0.009
Token she
Token ID673
Feature activation-4.80e-01

Pos logit contributions
+0.006
ups+0.006
spe+0.006
Jud+0.005
J+0.005
Neg logit contributions
ags-0.008
rob-0.008
 Whe-0.007
Whe-0.007
 Ow-0.007
Token saw
Token ID2497
Feature activation-1.07e+00

Pos logit contributions
ups+0.012
spe+0.011
edient+0.011
Jud+0.011
J+0.011
Neg logit contributions
rob-0.013
 Whe-0.013
 luggage-0.012
ags-0.012
 Ow-0.012
Token a
Token ID257
Feature activation-1.19e-01

Pos logit contributions
+0.041
ups+0.038
spe+0.036
J+0.035
Jud+0.035
Neg logit contributions
ags-0.026
rob-0.025
 Whe-0.022
Whe-0.021
 Ow-0.020
Token long
Token ID890
Feature activation-5.32e-01

Pos logit contributions
+0.004
ups+0.004
spe+0.003
Jud+0.003
J+0.003
Neg logit contributions
rob-0.003
ags-0.003
 Whe-0.003
Whe-0.003
 retri-0.003
Token ladder
Token ID18002
Feature activation-5.99e-01

Pos logit contributions
+0.016
ups+0.016
spe+0.014
J+0.014
Jud+0.014
Neg logit contributions
rob-0.016
ags-0.016
 Whe-0.014
Whe-0.014
 retri-0.014
Token leaning
Token ID21804
Feature activation-2.87e+00

Pos logit contributions
ups+0.019
+0.018
Jud+0.018
J+0.018
spe+0.016
Neg logit contributions
rob-0.016
ags-0.016
 retri-0.014
Whe-0.013
 Whe-0.013
Token against
Token ID1028
Feature activation-9.32e-01

Pos logit contributions
ups+0.091
+0.085
Jud+0.081
J+0.077
orse+0.076
Neg logit contributions
ags-0.081
rob-0.076
Whe-0.068
 Whe-0.068
 retri-0.067
Token a
Token ID257
Feature activation-2.37e-01

Pos logit contributions
+0.031
ups+0.028
spe+0.026
J+0.026
Jud+0.026
Neg logit contributions
rob-0.028
ags-0.027
 Whe-0.024
Whe-0.023
 retri-0.022
Token house
Token ID2156
Feature activation-8.71e-01

Pos logit contributions
ups+0.007
+0.007
J+0.006
spe+0.006
Jud+0.006
Neg logit contributions
rob-0.006
ags-0.006
 Whe-0.006
Whe-0.006
 luggage-0.006
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
+0.026
ups+0.025
Jud+0.025
J+0.025
spe+0.023
Neg logit contributions
ags-0.024
rob-0.022
 Whe-0.019
 de-0.019
 retri-0.019
Token She
Token ID1375
Feature activation-3.97e-01

Pos logit contributions
+0.006
ups+0.005
"+0.005
J+0.004
Jud+0.004
Neg logit contributions
rob-0.006
ags-0.006
Whe-0.005
 retri-0.005
 Whe-0.005
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
+0.051
ups+0.046
spe+0.042
J+0.040
Jud+0.040
Neg logit contributions
ags-0.072
rob-0.071
 Whe-0.065
Whe-0.064
 Ow-0.062
Token thirteen
Token ID28306
Feature activation-1.89e+00

Pos logit contributions
+0.042
ups+0.039
spe+0.036
J+0.035
Jud+0.035
Neg logit contributions
ags-0.041
rob-0.041
 Whe-0.036
Whe-0.036
 Ow-0.034
Token,
Token ID11
Feature activation-1.46e+00

Pos logit contributions
+0.043
ups+0.038
"+0.034
spe+0.032
 soon+0.031
Neg logit contributions
rob-0.074
ags-0.074
 Whe-0.070
Whe-0.068
 Ow-0.064
Token fourteen
Token ID29167
Feature activation-1.95e+00

Pos logit contributions
+0.045
ups+0.041
"+0.039
spe+0.037
 soon+0.036
Neg logit contributions
ags-0.046
rob-0.046
 Whe-0.041
Whe-0.040
 retri-0.038
Token,
Token ID11
Feature activation-9.15e-01

Pos logit contributions
+0.052
ups+0.049
spe+0.047
Jud+0.044
J+0.044
Neg logit contributions
rob-0.066
ags-0.065
 Whe-0.059
Whe-0.058
 Ow-0.057
Token fifteen
Token ID17280
Feature activation-2.86e+00

Pos logit contributions
+0.029
ups+0.028
spe+0.026
Jud+0.026
J+0.025
Neg logit contributions
ags-0.026
rob-0.025
 Whe-0.023
 Ow-0.023
Whe-0.023
Token,
Token ID11
Feature activation-1.27e+00

Pos logit contributions
+0.081
ups+0.075
spe+0.073
Jud+0.068
J+0.068
Neg logit contributions
ags-0.092
rob-0.091
 Whe-0.081
Whe-0.080
 Ow-0.078
Token sixteen
Token ID27913
Feature activation-2.18e+00

Pos logit contributions
+0.037
ups+0.036
spe+0.033
Jud+0.033
J+0.032
Neg logit contributions
ags-0.039
rob-0.038
 Whe-0.035
Whe-0.034
 Ow-0.034
Token,
Token ID11
Feature activation-1.39e+00

Pos logit contributions
+0.056
ups+0.051
spe+0.047
Jud+0.045
J+0.045
Neg logit contributions
ags-0.078
rob-0.077
 Whe-0.070
Whe-0.069
 Ow-0.067
Token seventeen
Token ID38741
Feature activation-1.15e+00

Pos logit contributions
+0.038
ups+0.037
spe+0.034
Jud+0.033
J+0.032
Neg logit contributions
ags-0.046
rob-0.045
 Whe-0.041
Whe-0.041
 Ow-0.039
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
+0.023
ups+0.021
"+0.019
J+0.017
oh+0.016
Neg logit contributions
rob-0.047
ags-0.047
 Whe-0.045
Whe-0.043
 retri-0.041
Token when
Token ID618
Feature activation-4.54e-01

Pos logit contributions
ags+0.029
rob+0.028
 Whe+0.025
Whe+0.025
 Ow+0.024
Neg logit contributions
-0.026
ups-0.024
spe-0.023
Jud-0.022
J-0.022
Token she
Token ID673
Feature activation-9.24e-01

Pos logit contributions
ups+0.011
+0.010
spe+0.009
Jud+0.009
edient+0.008
Neg logit contributions
rob-0.016
ags-0.016
 Whe-0.016
Whe-0.015
 Ow-0.015
Token saw
Token ID2497
Feature activation-8.35e-01

Pos logit contributions
+0.027
spe+0.026
ups+0.025
edient+0.024
Jud+0.023
Neg logit contributions
rob-0.026
 Whe-0.024
Whe-0.023
ags-0.023
 Ow-0.022
Token a
Token ID257
Feature activation-5.29e-01

Pos logit contributions
+0.029
ups+0.028
J+0.025
Jud+0.025
spe+0.025
Neg logit contributions
ags-0.023
rob-0.022
 Whe-0.019
Whe-0.019
 retri-0.018
Token ladder
Token ID18002
Feature activation-7.99e-01

Pos logit contributions
ups+0.018
+0.018
spe+0.016
Jud+0.016
J+0.015
Neg logit contributions
rob-0.013
 Whe-0.013
ags-0.012
 luggage-0.012
 retri-0.012
Token leaning
Token ID21804
Feature activation-2.86e+00

Pos logit contributions
Jud+0.023
J+0.023
ups+0.023
+0.023
orse+0.021
Neg logit contributions
ags-0.023
rob-0.022
 retri-0.019
Whe-0.019
 de-0.019
Token against
Token ID1028
Feature activation-7.70e-01

Pos logit contributions
ups+0.095
+0.086
Jud+0.084
orse+0.084
spe+0.080
Neg logit contributions
ags-0.077
rob-0.070
Whe-0.064
 Whe-0.063
 Ow-0.063
Token the
Token ID262
Feature activation-3.90e-01

Pos logit contributions
+0.025
ups+0.022
spe+0.020
J+0.020
"+0.020
Neg logit contributions
rob-0.025
ags-0.024
 Whe-0.021
Whe-0.021
 retri-0.020
Token fence
Token ID13990
Feature activation-1.08e+00

Pos logit contributions
+0.011
ups+0.010
spe+0.010
Jud+0.009
J+0.009
Neg logit contributions
ags-0.013
rob-0.013
 Whe-0.011
Whe-0.011
 Ow-0.011
Token.
Token ID13
Feature activation-2.89e-01

Pos logit contributions
J+0.037
Jud+0.036
ups+0.035
orse+0.034
spe+0.033
Neg logit contributions
ags-0.025
 de-0.020
rob-0.019
 fur-0.019
 keys-0.019
Token She
Token ID1375
Feature activation-3.38e-01

Pos logit contributions
+0.009
ups+0.008
"+0.008
spe+0.008
J+0.007
Neg logit contributions
rob-0.009
ags-0.009
Whe-0.008
 Whe-0.008
 Ow-0.007
Token modest
Token ID12949
Feature activation-7.09e-01

Pos logit contributions
ups+0.016
+0.016
spe+0.014
Jud+0.014
edient+0.013
Neg logit contributions
 Whe-0.012
rob-0.012
ags-0.012
Whe-0.011
 retri-0.011
Token house
Token ID2156
Feature activation-6.35e-01

Pos logit contributions
+0.022
ups+0.021
spe+0.019
J+0.019
Jud+0.019
Neg logit contributions
ags-0.021
rob-0.020
 Whe-0.019
Whe-0.018
 retri-0.018
Token with
Token ID351
Feature activation-1.40e+00

Pos logit contributions
Jud+0.023
ups+0.021
J+0.021
+0.021
Rachel+0.020
Neg logit contributions
ags-0.014
 de-0.013
rob-0.012
 luggage-0.011
 freshly-0.011
Token a
Token ID257
Feature activation-2.18e-02

Pos logit contributions
+0.054
ups+0.050
spe+0.048
Jud+0.046
J+0.046
Neg logit contributions
ags-0.033
rob-0.032
 Whe-0.028
Whe-0.027
 Ow-0.026
Token ladder
Token ID18002
Feature activation-3.61e-01

Pos logit contributions
ups+0.001
+0.001
J+0.001
spe+0.001
Jud+0.001
Neg logit contributions
 Whe-0.001
ags-0.001
 luggage-0.001
rob-0.001
 Ow-0.000
Token leaning
Token ID21804
Feature activation-2.84e+00

Pos logit contributions
Jud+0.011
J+0.011
ups+0.011
+0.011
Rachel+0.010
Neg logit contributions
ags-0.010
rob-0.009
 retri-0.008
 de-0.008
 luggage-0.008
Token against
Token ID1028
Feature activation-1.00e+00

Pos logit contributions
ups+0.105
Jud+0.093
orse+0.091
 else+0.089
+0.089
Neg logit contributions
ags-0.068
rob-0.057
 Ow-0.055
 retri-0.055
Whe-0.051
Token the
Token ID262
Feature activation-6.43e-02

Pos logit contributions
+0.035
ups+0.034
spe+0.031
J+0.030
Jud+0.030
Neg logit contributions
ags-0.026
rob-0.025
 Whe-0.022
Whe-0.022
 Ow-0.021
Token wall
Token ID3355
Feature activation-8.00e-01

Pos logit contributions
+0.002
ups+0.002
J+0.002
spe+0.002
Jud+0.002
Neg logit contributions
ags-0.002
rob-0.002
 Whe-0.002
Whe-0.002
 Ow-0.002
Token.
Token ID13
Feature activation-2.90e-01

Pos logit contributions
Jud+0.030
J+0.028
 messenger+0.026
ups+0.026
Mom+0.026
Neg logit contributions
ags-0.015
 de-0.013
 about-0.013
 keys-0.012
land-0.012
Token They
Token ID1119
Feature activation-6.11e-01

Pos logit contributions
+0.010
ups+0.009
spe+0.009
Jud+0.008
J+0.008
Neg logit contributions
rob-0.008
ags-0.008
 Whe-0.007
Whe-0.007
 Ow-0.006
Token
Token ID26391
Feature activation+3.70e-01

Pos logit contributions
ags+0.040
rob+0.039
 Whe+0.036
 Ow+0.036
 luggage+0.036
Neg logit contributions
-0.050
ups-0.042
"-0.041
oh-0.041
J-0.039
Token All
Token ID1439
Feature activation-2.45e-02

Pos logit contributions
rob+0.011
ags+0.011
 Whe+0.010
Whe+0.010
 Ow+0.009
Neg logit contributions
-0.011
ups-0.011
spe-0.010
Jud-0.009
J-0.009
Token the
Token ID262
Feature activation+4.75e-01

Pos logit contributions
+0.001
ups+0.001
Jud+0.001
spe+0.001
J+0.001
Neg logit contributions
ags-0.001
rob-0.001
 Whe-0.001
Whe-0.001
 Ow-0.001
Token kids
Token ID3988
Feature activation+4.05e-01

Pos logit contributions
ags+0.013
rob+0.013
 Whe+0.011
Whe+0.011
 Ow+0.010
Neg logit contributions
-0.017
ups-0.016
spe-0.015
J-0.014
"-0.014
Token put
Token ID1234
Feature activation+6.05e-01

Pos logit contributions
ags+0.014
rob+0.014
 Whe+0.012
Whe+0.012
 Ow+0.012
Neg logit contributions
-0.011
ups-0.010
spe-0.009
Jud-0.009
J-0.009
Token their
Token ID511
Feature activation-2.83e+00

Pos logit contributions
rob+0.020
ags+0.019
 Whe+0.018
Whe+0.017
 Ow+0.017
Neg logit contributions
-0.018
ups-0.016
"-0.014
 soon-0.014
J-0.013
Token feet
Token ID3625
Feature activation+1.63e+00

Pos logit contributions
+0.091
ups+0.085
spe+0.079
"+0.075
J+0.075
Neg logit contributions
rob-0.085
ags-0.083
 Whe-0.073
Whe-0.072
 Ow-0.069
Token on
Token ID319
Feature activation+4.71e-01

Pos logit contributions
ags+0.049
rob+0.045
Whe+0.041
 Whe+0.040
 retri+0.038
Neg logit contributions
-0.049
ups-0.049
Jud-0.046
spe-0.045
J-0.044
Token the
Token ID262
Feature activation+2.68e-02

Pos logit contributions
rob+0.016
ags+0.016
Whe+0.014
 Whe+0.014
 Ow+0.013
Neg logit contributions
-0.014
ups-0.012
"-0.012
 soon-0.011
J-0.011
Token floor
Token ID4314
Feature activation+8.02e-01

Pos logit contributions
rob+0.001
ags+0.001
 Whe+0.001
Whe+0.001
 Ow+0.001
Neg logit contributions
-0.001
ups-0.001
spe-0.001
"-0.001
J-0.001
Token and
Token ID290
Feature activation+8.71e-01

Pos logit contributions
ags+0.024
rob+0.023
 Whe+0.021
Whe+0.020
 Ow+0.019
Neg logit contributions
-0.024
ups-0.023
Jud-0.022
spe-0.021
J-0.021
Token let
Token ID1309
Feature activation+8.99e-01

Pos logit contributions
ags+0.024
rob+0.024
 Whe+0.022
Whe+0.022
 Ow+0.021
Neg logit contributions
-0.020
ups-0.019
spe-0.017
J-0.017
Jud-0.017
Token me
Token ID502
Feature activation+5.94e-01

Pos logit contributions
ags+0.027
rob+0.026
 Ow+0.025
 Whe+0.025
Whe+0.024
Neg logit contributions
spe-0.025
-0.023
ups-0.023
J-0.023
Jud-0.022
Token help
Token ID1037
Feature activation+3.61e-01

Pos logit contributions
ags+0.018
rob+0.018
 Whe+0.016
Whe+0.016
 Ow+0.015
Neg logit contributions
-0.018
ups-0.017
spe-0.016
J-0.015
Jud-0.015
Token you
Token ID345
Feature activation-7.89e-01

Pos logit contributions
ags+0.013
rob+0.013
 Whe+0.012
Whe+0.012
 Ow+0.011
Neg logit contributions
-0.009
ups-0.008
spe-0.008
J-0.007
Jud-0.007
Token carry
Token ID3283
Feature activation-1.50e-01

Pos logit contributions
+0.023
ups+0.022
spe+0.021
J+0.021
Jud+0.020
Neg logit contributions
ags-0.025
rob-0.023
 Whe-0.021
Whe-0.021
 Ow-0.020
Token your
Token ID534
Feature activation-2.82e+00

Pos logit contributions
+0.005
ups+0.004
"+0.004
spe+0.004
J+0.004
Neg logit contributions
rob-0.005
ags-0.005
 Whe-0.004
Whe-0.004
 Ow-0.004
Token bucket
Token ID19236
Feature activation+4.48e-01

Pos logit contributions
+0.078
ups+0.073
spe+0.068
Jud+0.068
J+0.066
Neg logit contributions
ags-0.097
 Whe-0.084
rob-0.082
 Ow-0.082
 luggage-0.081
Token.
Token ID13
Feature activation+4.84e-01

Pos logit contributions
ags+0.013
rob+0.011
 Whe+0.010
 Ow+0.010
 luggage+0.010
Neg logit contributions
-0.014
Jud-0.014
J-0.013
ups-0.013
spe-0.013
Token I
Token ID314
Feature activation+2.11e-01

Pos logit contributions
ags+0.014
rob+0.014
 Whe+0.013
Whe+0.012
 Ow+0.012
Neg logit contributions
-0.015
ups-0.014
spe-0.013
J-0.013
Jud-0.013
Token think
Token ID892
Feature activation-1.18e+00

Pos logit contributions
ags+0.007
rob+0.007
 Whe+0.006
Whe+0.006
 Ow+0.006
Neg logit contributions
-0.006
ups-0.006
spe-0.005
Jud-0.005
J-0.005
Token I
Token ID314
Feature activation+9.61e-03

Pos logit contributions
+0.026
ups+0.021
"+0.019
spe+0.018
 soon+0.017
Neg logit contributions
rob-0.051
ags-0.051
Whe-0.045
 Whe-0.045
 Ow-0.044
Token and
Token ID290
Feature activation-2.77e-01

Pos logit contributions
ags+0.008
rob+0.008
 Whe+0.007
Whe+0.007
 Ow+0.007
Neg logit contributions
-0.008
ups-0.008
spe-0.007
Jud-0.007
J-0.007
Token bumped
Token ID38034
Feature activation-4.23e-01

Pos logit contributions
ups+0.007
+0.007
Jud+0.007
spe+0.007
edient+0.007
Neg logit contributions
rob-0.009
 Whe-0.009
Whe-0.008
ags-0.008
 retri-0.007
Token against
Token ID1028
Feature activation-4.59e-01

Pos logit contributions
+0.011
ups+0.010
spe+0.009
J+0.009
 soon+0.009
Neg logit contributions
rob-0.016
ags-0.015
Whe-0.014
 Whe-0.014
 retri-0.013
Token the
Token ID262
Feature activation+3.82e-01

Pos logit contributions
+0.012
ups+0.010
 soon+0.010
J+0.010
"+0.010
Neg logit contributions
rob-0.017
ags-0.017
Whe-0.014
 Whe-0.014
 retri-0.014
Token books
Token ID3835
Feature activation-1.05e+00

Pos logit contributions
ags+0.010
rob+0.010
 Whe+0.009
Whe+0.009
 Ow+0.008
Neg logit contributions
-0.013
ups-0.012
spe-0.012
"-0.011
J-0.011
Tokenhel
Token ID2978
Feature activation-2.81e+00

Pos logit contributions
ups+0.036
Jud+0.035
spe+0.033
+0.033
J+0.032
Neg logit contributions
ags-0.024
rob-0.023
 Whe-0.021
Whe-0.020
 retri-0.019
Tokenves
Token ID1158
Feature activation-7.83e-01

Pos logit contributions
+0.077
ups+0.067
Jud+0.061
spe+0.061
J+0.061
Neg logit contributions
rob-0.098
ags-0.096
 Whe-0.086
Whe-0.084
 Ow-0.083
Token.
Token ID13
Feature activation-7.23e-01

Pos logit contributions
ups+0.024
Jud+0.023
J+0.023
spe+0.023
+0.022
Neg logit contributions
ags-0.021
rob-0.019
 Whe-0.018
Whe-0.017
 retri-0.017
Token He
Token ID679
Feature activation-2.44e-01

Pos logit contributions
+0.022
ups+0.019
J+0.018
"+0.018
spe+0.017
Neg logit contributions
rob-0.025
ags-0.024
 luggage-0.020
 retri-0.020
 stret-0.019
Token smiled
Token ID13541
Feature activation-4.73e-01

Pos logit contributions
+0.006
ups+0.005
spe+0.005
J+0.005
Jud+0.005
Neg logit contributions
ags-0.009
rob-0.009
 Whe-0.008
Whe-0.008
 Ow-0.008
Token as
Token ID355
Feature activation+6.63e-01

Pos logit contributions
+0.015
ups+0.013
 soon+0.013
"+0.012
spe+0.012
Neg logit contributions
rob-0.017
ags-0.016
Whe-0.014
 Whe-0.013
 Ow-0.013
TokenI
Token ID40
Feature activation+3.49e-01

Pos logit contributions
ags+0.008
rob+0.007
 Whe+0.006
Whe+0.006
 luggage+0.006
Neg logit contributions
-0.006
ups-0.005
"-0.005
J-0.005
Mom-0.005
Token can
Token ID460
Feature activation-8.22e-01

Pos logit contributions
rob+0.009
ags+0.009
 Whe+0.009
Whe+0.008
 Ow+0.008
Neg logit contributions
ups-0.011
-0.011
spe-0.010
J-0.010
Jud-0.010
Token't
Token ID470
Feature activation+1.08e-01

Pos logit contributions
+0.023
ups+0.022
spe+0.021
J+0.020
Jud+0.020
Neg logit contributions
ags-0.026
rob-0.026
 Whe-0.023
Whe-0.023
 Ow-0.022
Token reach
Token ID3151
Feature activation-1.11e-02

Pos logit contributions
ags+0.003
rob+0.003
 Whe+0.003
Whe+0.003
 Ow+0.003
Neg logit contributions
-0.003
ups-0.003
spe-0.003
J-0.003
Jud-0.003
Token the
Token ID262
Feature activation-1.62e+00

Pos logit contributions
+0.000
ups+0.000
"+0.000
 soon+0.000
bug+0.000
Neg logit contributions
rob-0.000
ags-0.000
Whe-0.000
 Whe-0.000
 retri-0.000
Token cher
Token ID21382
Feature activation-2.80e+00

Pos logit contributions
+0.052
ups+0.047
"+0.044
spe+0.044
Mom+0.042
Neg logit contributions
rob-0.050
ags-0.048
Whe-0.042
 Whe-0.042
 retri-0.038
Tokenries
Token ID1678
Feature activation-9.00e-01

Pos logit contributions
+0.109
ups+0.100
spe+0.099
J+0.095
Jud+0.095
Neg logit contributions
rob-0.065
ags-0.064
Whe-0.052
 Whe-0.051
 Ow-0.049
Token.
Token ID13
Feature activation+7.22e-02

Pos logit contributions
+0.026
ups+0.025
spe+0.024
Jud+0.023
J+0.023
Neg logit contributions
ags-0.029
rob-0.028
 Whe-0.025
Whe-0.024
 Ow-0.024
Token Can
Token ID1680
Feature activation-7.36e-01

Pos logit contributions
ags+0.002
rob+0.002
 Whe+0.002
Whe+0.002
 Ow+0.002
Neg logit contributions
-0.002
ups-0.002
spe-0.002
J-0.002
Jud-0.002
Token you
Token ID345
Feature activation+1.46e-01

Pos logit contributions
ups+0.021
spe+0.020
Jud+0.019
+0.019
J+0.019
Neg logit contributions
rob-0.022
ags-0.022
 Whe-0.021
 Ow-0.021
Whe-0.020
Token help
Token ID1037
Feature activation-1.13e-01

Pos logit contributions
rob+0.005
ags+0.005
 Whe+0.004
Whe+0.004
 Ow+0.004
Neg logit contributions
-0.004
ups-0.004
spe-0.004
Jud-0.004
"-0.004
Token
Token ID198
Feature activation-6.40e-01

Pos logit contributions
+0.025
ups+0.025
spe+0.025
bug+0.023
 pod+0.022
Neg logit contributions
ags-0.015
rob-0.014
 Whe-0.014
 Ow-0.013
Whe-0.012
Token"
Token ID1
Feature activation-6.61e-01

Pos logit contributions
 pod+0.015
melon+0.015
bug+0.015
bugs+0.014
ups+0.014
Neg logit contributions
ags-0.018
 Whe-0.018
rob-0.017
Whe-0.016
 Robbie-0.016
TokenIt
Token ID1026
Feature activation-6.03e-01

Pos logit contributions
+0.019
ups+0.017
spe+0.016
Jud+0.016
bug+0.015
Neg logit contributions
rob-0.021
ags-0.021
 Whe-0.020
Whe-0.020
 Ow-0.018
Token is
Token ID318
Feature activation+2.77e-01

Pos logit contributions
+0.019
ups+0.019
spe+0.018
J+0.018
Jud+0.018
Neg logit contributions
rob-0.016
ags-0.016
 Whe-0.014
 Ow-0.014
Whe-0.014
Token a
Token ID257
Feature activation+8.59e-01

Pos logit contributions
ags+0.007
rob+0.007
 Whe+0.006
Whe+0.006
 Ow+0.006
Neg logit contributions
-0.010
ups-0.009
spe-0.009
J-0.008
Jud-0.008
Token skysc
Token ID37885
Feature activation-2.77e+00

Pos logit contributions
rob+0.024
ags+0.024
 Whe+0.023
Whe+0.021
 retri+0.020
Neg logit contributions
-0.028
ups-0.027
spe-0.024
Jud-0.024
J-0.023
Tokenraper
Token ID38545
Feature activation-1.22e+00

Pos logit contributions
+0.070
ups+0.063
spe+0.056
Jud+0.055
J+0.052
Neg logit contributions
ags-0.104
rob-0.098
 Whe-0.089
Whe-0.089
 Ow-0.087
Token.
Token ID13
Feature activation+2.64e-01

Pos logit contributions
+0.032
ups+0.029
spe+0.027
Jud+0.026
J+0.025
Neg logit contributions
ags-0.043
rob-0.043
 Whe-0.039
Whe-0.038
 Ow-0.037
Token It
Token ID632
Feature activation-3.60e-01

Pos logit contributions
rob+0.007
ags+0.007
 Whe+0.006
Whe+0.006
 Ow+0.005
Neg logit contributions
-0.010
ups-0.009
spe-0.008
J-0.008
"-0.008
Token is
Token ID318
Feature activation+1.44e-01

Pos logit contributions
+0.013
ups+0.012
spe+0.011
J+0.011
Jud+0.011
Neg logit contributions
rob-0.009
ags-0.009
 Whe-0.008
Whe-0.008
 Ow-0.007
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
ags+0.004
rob+0.004
 Whe+0.003
Whe+0.003
 Ow+0.003
Neg logit contributions
-0.005
ups-0.005
spe-0.004
J-0.004
Jud-0.004
Token they
Token ID484
Feature activation-7.37e-01

Pos logit contributions
+0.007
ups+0.005
"+0.004
 soon+0.004
J+0.004
Neg logit contributions
ags-0.034
rob-0.034
Whe-0.030
 Whe-0.029
 retri-0.029
Token saw
Token ID2497
Feature activation-1.41e+00

Pos logit contributions
ups+0.022
edient+0.020
+0.020
spe+0.019
orse+0.018
Neg logit contributions
 Ow-0.018
ags-0.018
 Whe-0.018
 luggage-0.017
rob-0.017
Token a
Token ID257
Feature activation-3.32e-01

Pos logit contributions
+0.051
ups+0.048
J+0.044
spe+0.043
Jud+0.043
Neg logit contributions
rob-0.037
ags-0.037
Whe-0.031
 Whe-0.031
 retri-0.029
Token new
Token ID649
Feature activation-2.90e-01

Pos logit contributions
ups+0.011
+0.011
spe+0.010
Jud+0.010
J+0.009
Neg logit contributions
rob-0.009
 Whe-0.008
ags-0.008
Whe-0.008
 retri-0.008
Token ladder
Token ID18002
Feature activation-4.89e-01

Pos logit contributions
+0.008
ups+0.008
spe+0.007
J+0.007
Jud+0.007
Neg logit contributions
ags-0.009
rob-0.009
 Whe-0.008
Whe-0.008
 Ow-0.008
Token leaning
Token ID21804
Feature activation-2.76e+00

Pos logit contributions
Jud+0.016
J+0.016
ups+0.016
+0.016
Rachel+0.014
Neg logit contributions
rob-0.012
ags-0.012
 retri-0.010
 de-0.010
 Whe-0.009
Token on
Token ID319
Feature activation-1.03e+00

Pos logit contributions
ups+0.094
+0.087
Jud+0.086
orse+0.081
J+0.080
Neg logit contributions
ags-0.072
rob-0.067
Whe-0.060
 retri-0.060
 Whe-0.059
Token the
Token ID262
Feature activation+1.57e-01

Pos logit contributions
+0.029
"+0.024
 soon+0.024
ups+0.023
Mom+0.022
Neg logit contributions
ags-0.038
rob-0.037
Whe-0.033
 Whe-0.033
 retri-0.032
Token wall
Token ID3355
Feature activation-6.24e-01

Pos logit contributions
rob+0.006
ags+0.005
 Whe+0.005
Whe+0.005
 Ow+0.005
Neg logit contributions
-0.004
ups-0.004
spe-0.004
"-0.003
Jud-0.003
Token of
Token ID286
Feature activation-8.96e-01

Pos logit contributions
Jud+0.019
+0.019
ups+0.018
J+0.018
 messenger+0.017
Neg logit contributions
ags-0.016
rob-0.015
 de-0.013
 Whe-0.013
Whe-0.012
Token their
Token ID511
Feature activation-6.63e-02

Pos logit contributions
+0.023
 soon+0.020
"+0.020
Mom+0.019
ups+0.019
Neg logit contributions
rob-0.034
ags-0.032
Whe-0.029
 retri-0.028
agged-0.028
Token Lia
Token ID49520
Feature activation-7.74e-01

Pos logit contributions
+0.007
ups+0.006
spe+0.006
"+0.005
Mom+0.005
Neg logit contributions
rob-0.008
ags-0.007
Whe-0.007
 Whe-0.007
 retri-0.006
Token's
Token ID338
Feature activation-3.97e-01

Pos logit contributions
+0.021
ups+0.018
"+0.016
spe+0.016
 soon+0.016
Neg logit contributions
ags-0.028
rob-0.027
 Whe-0.025
Whe-0.024
 Ow-0.023
Token rocket
Token ID10701
Feature activation-6.64e-01

Pos logit contributions
+0.014
ups+0.013
spe+0.012
"+0.012
Jud+0.011
Neg logit contributions
rob-0.011
ags-0.010
Whe-0.009
 Whe-0.009
 Ow-0.008
Token is
Token ID318
Feature activation-1.22e+00

Pos logit contributions
+0.018
ups+0.017
Jud+0.016
J+0.016
spe+0.015
Neg logit contributions
rob-0.022
ags-0.022
 Whe-0.019
Whe-0.019
 retri-0.018
Token red
Token ID2266
Feature activation-4.12e-01

Pos logit contributions
+0.036
ups+0.035
Jud+0.033
J+0.033
spe+0.032
Neg logit contributions
ags-0.037
rob-0.035
 Whe-0.032
Whe-0.031
 retri-0.030
Token and
Token ID290
Feature activation-2.76e+00

Pos logit contributions
+0.011
ups+0.009
spe+0.009
"+0.009
 soon+0.008
Neg logit contributions
rob-0.015
ags-0.015
 Whe-0.013
Whe-0.013
 Ow-0.013
Token Ben
Token ID3932
Feature activation-5.86e-01

Pos logit contributions
ups+0.086
Jud+0.085
J+0.082
+0.081
spe+0.078
Neg logit contributions
rob-0.078
 Whe-0.072
 de-0.069
ags-0.068
Whe-0.067
Token's
Token ID338
Feature activation-2.07e-01

Pos logit contributions
+0.014
ups+0.012
spe+0.011
"+0.010
J+0.010
Neg logit contributions
ags-0.023
rob-0.022
 Whe-0.020
Whe-0.020
 Ow-0.020
Token rocket
Token ID10701
Feature activation-1.05e+00

Pos logit contributions
+0.007
ups+0.006
spe+0.006
"+0.006
 soon+0.005
Neg logit contributions
rob-0.006
ags-0.006
Whe-0.005
 Whe-0.005
 Ow-0.005
Token is
Token ID318
Feature activation-1.09e+00

Pos logit contributions
+0.028
Jud+0.028
ups+0.027
J+0.027
spe+0.024
Neg logit contributions
rob-0.033
ags-0.033
 Whe-0.030
Whe-0.029
 retri-0.028
Token blue
Token ID4171
Feature activation-4.63e-01

Pos logit contributions
+0.033
ups+0.032
J+0.030
Jud+0.030
spe+0.029
Neg logit contributions
ags-0.033
rob-0.031
 Whe-0.028
Whe-0.027
 retri-0.026
Token her
Token ID607
Feature activation+5.10e-01

Pos logit contributions
rob+0.003
ags+0.003
Whe+0.003
 Whe+0.003
 Ow+0.002
Neg logit contributions
-0.004
ups-0.003
J-0.003
"-0.003
Jud-0.003
Token not
Token ID407
Feature activation-6.02e-01

Pos logit contributions
ags+0.012
 luggage+0.011
 Whe+0.011
 Ow+0.010
 fault+0.010
Neg logit contributions
J-0.016
-0.016
Jud-0.015
CR-0.015
ups-0.015
Token to
Token ID284
Feature activation-6.02e-01

Pos logit contributions
+0.021
spe+0.020
oh+0.019
J+0.019
ups+0.019
Neg logit contributions
ags-0.014
 Whe-0.012
 luggage-0.012
 such-0.011
 fur-0.011
Token investigate
Token ID9161
Feature activation+6.43e-01

Pos logit contributions
+0.018
ups+0.017
spe+0.017
Jud+0.016
J+0.015
Neg logit contributions
ags-0.018
 Whe-0.015
 Ow-0.015
Whe-0.015
rob-0.014
Token books
Token ID3835
Feature activation-1.10e+00

Pos logit contributions
rob+0.026
ags+0.025
Whe+0.024
 Whe+0.023
 Ow+0.022
Neg logit contributions
-0.016
ups-0.013
"-0.012
 soon-0.012
spe-0.012
Tokenhel
Token ID2978
Feature activation-2.74e+00

Pos logit contributions
+0.029
ups+0.026
spe+0.024
"+0.023
J+0.023
Neg logit contributions
rob-0.040
ags-0.039
 Whe-0.036
Whe-0.036
 Ow-0.034
Tokenves
Token ID1158
Feature activation-8.19e-01

Pos logit contributions
+0.079
ups+0.068
spe+0.065
Jud+0.064
J+0.063
Neg logit contributions
rob-0.091
ags-0.090
 Whe-0.077
 Ow-0.076
Whe-0.076
Token alone
Token ID3436
Feature activation-1.01e-01

Pos logit contributions
+0.023
ups+0.021
spe+0.019
J+0.018
Jud+0.018
Neg logit contributions
ags-0.028
rob-0.027
 Whe-0.025
Whe-0.024
 Ow-0.023
Token,
Token ID11
Feature activation-2.81e-01

Pos logit contributions
+0.003
ups+0.002
spe+0.002
Jud+0.002
J+0.002
Neg logit contributions
ags-0.004
rob-0.003
 Whe-0.003
Whe-0.003
 Ow-0.003
Token but
Token ID475
Feature activation+1.28e+00

Pos logit contributions
+0.011
ups+0.010
spe+0.009
 soon+0.009
"+0.009
Neg logit contributions
rob-0.008
ags-0.007
Whe-0.006
 Whe-0.006
 Ow-0.006
Token she
Token ID673
Feature activation-9.81e-02

Pos logit contributions
ags+0.047
rob+0.047
 Whe+0.043
Whe+0.042
 Ow+0.041
Neg logit contributions
-0.031
ups-0.029
spe-0.026
J-0.025
Jud-0.025
Token They
Token ID1119
Feature activation+2.39e-01

Pos logit contributions
ags+0.016
rob+0.016
 Whe+0.015
Whe+0.014
 Ow+0.014
Neg logit contributions
-0.020
ups-0.019
spe-0.018
Jud-0.017
J-0.017
Token see
Token ID766
Feature activation-8.64e-01

Pos logit contributions
ags+0.007
rob+0.006
 Whe+0.006
Whe+0.006
 Ow+0.005
Neg logit contributions
ups-0.008
-0.008
spe-0.007
Jud-0.007
J-0.007
Token a
Token ID257
Feature activation-5.03e-01

Pos logit contributions
+0.028
ups+0.026
J+0.024
"+0.024
Mom+0.023
Neg logit contributions
rob-0.026
ags-0.025
 retri-0.022
Whe-0.022
 Whe-0.021
Token big
Token ID1263
Feature activation-1.05e+00

Pos logit contributions
ups+0.018
+0.018
spe+0.017
edient+0.016
Jud+0.016
Neg logit contributions
rob-0.012
 Whe-0.011
 luggage-0.010
ags-0.010
 retri-0.010
Token ladder
Token ID18002
Feature activation-5.05e-01

Pos logit contributions
+0.031
ups+0.029
spe+0.027
J+0.026
Jud+0.026
Neg logit contributions
ags-0.033
rob-0.033
 Whe-0.030
Whe-0.029
 Ow-0.028
Token leaning
Token ID21804
Feature activation-2.73e+00

Pos logit contributions
Jud+0.017
+0.017
ups+0.017
J+0.016
orse+0.015
Neg logit contributions
ags-0.013
rob-0.012
 retri-0.010
 de-0.010
 Whe-0.010
Token on
Token ID319
Feature activation-9.35e-01

Pos logit contributions
ups+0.094
+0.087
Jud+0.086
orse+0.080
J+0.079
Neg logit contributions
ags-0.071
rob-0.064
 Whe-0.059
Whe-0.059
 retri-0.057
Token the
Token ID262
Feature activation+9.24e-02

Pos logit contributions
+0.027
ups+0.024
"+0.022
spe+0.022
 soon+0.022
Neg logit contributions
ags-0.032
rob-0.032
 Whe-0.028
Whe-0.028
 retri-0.027
Token wall
Token ID3355
Feature activation-5.34e-01

Pos logit contributions
rob+0.003
ags+0.003
 Whe+0.003
Whe+0.003
 Ow+0.003
Neg logit contributions
-0.003
ups-0.002
spe-0.002
"-0.002
Jud-0.002
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
Jud+0.017
+0.017
ups+0.016
J+0.016
 messenger+0.015
Neg logit contributions
ags-0.014
rob-0.012
 de-0.011
 Whe-0.010
 about-0.010
Token They
Token ID1119
Feature activation+5.65e-02

Pos logit contributions
+0.000
ups+0.000
spe+0.000
J+0.000
"+0.000
Neg logit contributions
rob-0.000
ags-0.000
 Whe-0.000
Whe-0.000
 Ow-0.000
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
ups+0.011
+0.011
spe+0.010
Jud+0.010
J+0.009
Neg logit contributions
rob-0.012
ags-0.012
 Whe-0.012
Whe-0.011
 retri-0.011
Token He
Token ID679
Feature activation+3.34e-01

Pos logit contributions
rob+0.000
ags+0.000
Whe+0.000
 Whe+0.000
 Ow+0.000
Neg logit contributions
-0.000
ups-0.000
spe-0.000
"-0.000
J-0.000
Token put
Token ID1234
Feature activation+5.88e-01

Pos logit contributions
ags+0.014
rob+0.014
 Whe+0.012
Whe+0.012
 Ow+0.012
Neg logit contributions
-0.007
ups-0.006
spe-0.006
"-0.005
J-0.005
Token on
Token ID319
Feature activation+1.17e+00

Pos logit contributions
ses+0.026
 fault+0.025
rob+0.024
ags+0.024
Whe+0.024
Neg logit contributions
-0.008
 soon-0.007
"-0.006
Mom-0.006
J-0.005
Token his
Token ID465
Feature activation+4.06e-01

Pos logit contributions
rob+0.038
ags+0.038
Whe+0.033
 Whe+0.033
 retri+0.032
Neg logit contributions
-0.037
ups-0.033
"-0.029
J-0.029
spe-0.029
Token coat
Token ID13209
Feature activation+2.33e+00

Pos logit contributions
rob+0.013
ags+0.011
 Ow+0.011
Whe+0.010
 Whe+0.010
Neg logit contributions
-0.013
ups-0.011
"-0.010
Mom-0.010
washer-0.010
Token and
Token ID290
Feature activation+7.74e-01

Pos logit contributions
ags+0.077
rob+0.075
Whe+0.069
 Whe+0.067
 Ow+0.066
Neg logit contributions
-0.073
ups-0.065
"-0.060
 soon-0.060
spe-0.059
Token put
Token ID1234
Feature activation+6.05e-01

Pos logit contributions
rob+0.023
ags+0.023
 Whe+0.020
Whe+0.020
 Ow+0.019
Neg logit contributions
-0.024
ups-0.023
spe-0.022
Jud-0.021
J-0.021
Token his
Token ID465
Feature activation-1.52e-01

Pos logit contributions
ags+0.023
rob+0.023
ses+0.021
Whe+0.021
 Ow+0.021
Neg logit contributions
-0.013
ups-0.011
 soon-0.011
"-0.011
Mom-0.010
Token things
Token ID1243
Feature activation+9.95e-02

Pos logit contributions
+0.005
ups+0.004
 soon+0.004
"+0.004
Mom+0.004
Neg logit contributions
rob-0.004
ags-0.004
Whe-0.004
 Whe-0.004
 Ow-0.004
Token in
Token ID287
Feature activation+1.90e-01

Pos logit contributions
ags+0.003
rob+0.003
 Whe+0.003
Whe+0.003
 luggage+0.002
Neg logit contributions
-0.003
ups-0.003
Jud-0.003
spe-0.003
J-0.003
Token making
Token ID1642
Feature activation-4.53e-01

Pos logit contributions
+0.022
ups+0.020
spe+0.018
"+0.018
J+0.018
Neg logit contributions
rob-0.022
ags-0.022
 Whe-0.019
Whe-0.019
 Ow-0.018
Token lights
Token ID7588
Feature activation-6.88e-01

Pos logit contributions
+0.016
ups+0.015
"+0.013
J+0.013
 soon+0.013
Neg logit contributions
rob-0.012
ags-0.012
 Whe-0.011
Whe-0.011
 retri-0.010
Token!
Token ID0
Feature activation+4.84e-01

Pos logit contributions
+0.022
ups+0.021
Jud+0.020
J+0.020
spe+0.020
Neg logit contributions
ags-0.020
rob-0.019
 Whe-0.017
Whe-0.017
 Ow-0.016
Token I
Token ID314
Feature activation+6.92e-01

Pos logit contributions
rob+0.018
ags+0.017
Whe+0.014
 retri+0.014
 luggage+0.014
Neg logit contributions
-0.015
"-0.011
ups-0.011
Mom-0.010
spe-0.010
Token step
Token ID2239
Feature activation+5.78e-01

Pos logit contributions
ags+0.023
rob+0.023
 Whe+0.020
Whe+0.020
 Ow+0.020
Neg logit contributions
-0.021
ups-0.018
spe-0.017
"-0.016
J-0.016
Token back
Token ID736
Feature activation+2.20e+00

Pos logit contributions
ags+0.019
rob+0.018
 Whe+0.017
Whe+0.016
 Ow+0.016
Neg logit contributions
-0.017
ups-0.016
spe-0.015
Jud-0.014
J-0.014
Token and
Token ID290
Feature activation+5.62e-01

Pos logit contributions
rob+0.068
ags+0.068
 Whe+0.060
Whe+0.060
 Ow+0.057
Neg logit contributions
-0.067
ups-0.062
spe-0.058
"-0.055
J-0.055
Token it
Token ID340
Feature activation+1.09e-01

Pos logit contributions
ags+0.016
rob+0.016
 Whe+0.015
Whe+0.015
 Ow+0.013
Neg logit contributions
-0.016
ups-0.016
J-0.015
spe-0.015
Jud-0.015
Token makes
Token ID1838
Feature activation-4.52e-01

Pos logit contributions
rob+0.003
ags+0.003
 Whe+0.003
Whe+0.003
 Ow+0.003
Neg logit contributions
-0.003
ups-0.003
J-0.003
spe-0.003
Jud-0.003
Token a
Token ID257
Feature activation+4.61e-01

Pos logit contributions
+0.015
ups+0.013
"+0.012
spe+0.012
J+0.011
Neg logit contributions
ags-0.014
rob-0.014
 Whe-0.012
Whe-0.012
 retri-0.012
Token cool
Token ID3608
Feature activation-5.98e-01

Pos logit contributions
ags+0.012
rob+0.011
 Whe+0.011
Whe+0.010
 Ow+0.010
Neg logit contributions
-0.016
ups-0.016
Jud-0.014
spe-0.014
J-0.014
Token classroom
Token ID15806
Feature activation-3.66e-01

Pos logit contributions
ags+0.015
 Whe+0.014
Whe+0.013
rob+0.013
 luggage+0.012
Neg logit contributions
-0.020
ups-0.019
Jud-0.019
spe-0.018
edient-0.017
Token.
Token ID13
Feature activation-4.06e-01

Pos logit contributions
Jud+0.011
+0.010
Mom+0.010
ups+0.010
J+0.010
Neg logit contributions
ags-0.009
rob-0.009
 de-0.009
 Whe-0.008
 keys-0.008
Token
Token ID220
Feature activation+7.55e-02

Pos logit contributions
+0.012
ups+0.011
spe+0.010
J+0.010
"+0.010
Neg logit contributions
rob-0.013
ags-0.013
 Whe-0.011
Whe-0.011
 Ow-0.011
Token
Token ID198
Feature activation+9.45e-03

Pos logit contributions
ags+0.002
rob+0.002
 Whe+0.002
Whe+0.002
 Ow+0.002
Neg logit contributions
-0.002
ups-0.002
spe-0.002
Jud-0.002
J-0.002
Token
Token ID198
Feature activation-1.03e-01

Pos logit contributions
ags+0.000
rob+0.000
 Whe+0.000
Whe+0.000
 Ow+0.000
Neg logit contributions
-0.000
ups-0.000
spe-0.000
edient-0.000
ho-0.000
TokenM
Token ID44
Feature activation+2.16e+00

Pos logit contributions
ho+0.003
spe+0.003
edient+0.003
ups+0.002
washer+0.002
Neg logit contributions
Whe-0.003
ags-0.003
 Whe-0.003
rob-0.003
 Ow-0.002
Tokenolly
Token ID5098
Feature activation+3.35e-01

Pos logit contributions
ags+0.076
rob+0.072
 retri+0.065
 Whe+0.064
 Ow+0.064
Neg logit contributions
-0.055
ups-0.049
spe-0.046
Jud-0.044
edient-0.043
Token looked
Token ID3114
Feature activation+5.47e-01

Pos logit contributions
rob+0.011
ags+0.011
 Whe+0.010
Whe+0.010
 retri+0.010
Neg logit contributions
-0.008
ups-0.008
spe-0.007
Jud-0.007
J-0.007
Token around
Token ID1088
Feature activation+7.62e-01

Pos logit contributions
ags+0.018
rob+0.018
 Whe+0.016
Whe+0.016
 Ow+0.015
Neg logit contributions
-0.016
ups-0.015
spe-0.013
Jud-0.013
J-0.013
Token the
Token ID262
Feature activation+4.43e-01

Pos logit contributions
rob+0.028
ags+0.026
Whe+0.024
 Ow+0.023
 Whe+0.022
Neg logit contributions
-0.023
 soon-0.019
"-0.018
J-0.018
ups-0.018
Token room
Token ID2119
Feature activation+7.16e-02

Pos logit contributions
ags+0.012
 Whe+0.012
Whe+0.011
rob+0.010
 de+0.009
Neg logit contributions
-0.014
Jud-0.013
spe-0.013
ups-0.013
edient-0.013
Token and
Token ID290
Feature activation+7.95e-01

Pos logit contributions
rob+0.019
ags+0.019
 Whe+0.017
Whe+0.017
 Ow+0.016
Neg logit contributions
-0.016
ups-0.015
spe-0.014
Jud-0.013
J-0.013
Token c
Token ID269
Feature activation+1.26e-01

Pos logit contributions
rob+0.025
ags+0.025
 Whe+0.023
Whe+0.023
 retri+0.021
Neg logit contributions
-0.022
ups-0.021
J-0.019
spe-0.019
Jud-0.019
Tokenuddled
Token ID32745
Feature activation+7.86e-01

Pos logit contributions
rob+0.004
ags+0.003
 Whe+0.003
Whe+0.003
 luggage+0.003
Neg logit contributions
ups-0.004
-0.004
spe-0.004
"-0.004
oh-0.004
Token together
Token ID1978
Feature activation-1.26e+00

Pos logit contributions
ags+0.027
rob+0.027
Whe+0.024
 Whe+0.024
 Ow+0.024
Neg logit contributions
-0.021
ups-0.019
spe-0.018
 soon-0.017
"-0.017
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
+0.034
ups+0.032
spe+0.029
Jud+0.029
J+0.028
Neg logit contributions
ags-0.042
rob-0.041
 Whe-0.038
Whe-0.037
 retri-0.035
Token Mitt
Token ID16627
Feature activation+2.15e+00

Pos logit contributions
ags+0.008
rob+0.008
 Whe+0.007
Whe+0.007
 Ow+0.007
Neg logit contributions
-0.010
ups-0.009
spe-0.009
J-0.008
Jud-0.008
Tokenens
Token ID641
Feature activation+2.89e-01

Pos logit contributions
ags+0.072
rob+0.065
 Whe+0.061
 luggage+0.060
Whe+0.059
Neg logit contributions
-0.070
ups-0.062
 soon-0.056
oh-0.055
"-0.055
Token was
Token ID373
Feature activation+3.20e-01

Pos logit contributions
ags+0.011
rob+0.010
 Whe+0.009
Whe+0.009
 Ow+0.009
Neg logit contributions
-0.008
ups-0.007
spe-0.006
"-0.006
 soon-0.006
Token a
Token ID257
Feature activation+8.74e-01

Pos logit contributions
ags+0.009
rob+0.009
 Whe+0.008
Whe+0.008
 retri+0.007
Neg logit contributions
-0.010
ups-0.010
spe-0.009
Jud-0.009
J-0.009
Token good
Token ID922
Feature activation-1.70e-01

Pos logit contributions
rob+0.025
ags+0.025
 Whe+0.025
Whe+0.024
 retri+0.023
Neg logit contributions
ups-0.026
-0.025
Jud-0.022
spe-0.022
J-0.022
Token cat
Token ID3797
Feature activation-2.00e-01

Pos logit contributions
+0.006
ups+0.005
spe+0.005
J+0.005
Jud+0.005
Neg logit contributions
ags-0.005
rob-0.005
 Whe-0.004
Whe-0.004
 Ow-0.004
Token good
Token ID922
Feature activation+4.54e-01

Pos logit contributions
rob+0.001
ags+0.001
Whe+0.001
 Whe+0.001
 Ow+0.001
Neg logit contributions
-0.001
"-0.001
ups-0.001
 soon-0.001
spe-0.000
Token for
Token ID329
Feature activation-5.18e-01

Pos logit contributions
rob+0.017
ags+0.017
Whe+0.015
 Whe+0.015
 Ow+0.014
Neg logit contributions
-0.011
ups-0.011
"-0.009
spe-0.009
 soon-0.009
Token you
Token ID345
Feature activation-1.54e-01

Pos logit contributions
+0.014
ups+0.012
"+0.012
 soon+0.011
J+0.010
Neg logit contributions
rob-0.020
ags-0.018
Whe-0.017
 Whe-0.017
 retri-0.016
Token.
Token ID13
Feature activation+4.10e-01

Pos logit contributions
+0.005
ups+0.004
Jud+0.004
spe+0.004
J+0.004
Neg logit contributions
ags-0.005
rob-0.004
 Whe-0.004
Whe-0.004
 Ow-0.004
Token But
Token ID887
Feature activation+1.12e+00

Pos logit contributions
ags+0.011
rob+0.011
 Whe+0.010
Whe+0.010
 Ow+0.009
Neg logit contributions
-0.014
ups-0.013
spe-0.012
J-0.012
Jud-0.012
Token if
Token ID611
Feature activation+2.62e+00

Pos logit contributions
rob+0.044
ags+0.043
Whe+0.038
 Whe+0.038
 Ow+0.036
Neg logit contributions
-0.029
ups-0.024
"-0.022
spe-0.022
 soon-0.022
Token you
Token ID345
Feature activation-4.16e-03

Pos logit contributions
ags+0.096
rob+0.096
 Whe+0.087
Whe+0.086
 Ow+0.083
Neg logit contributions
-0.065
ups-0.059
spe-0.054
Jud-0.051
J-0.051
Token walk
Token ID2513
Feature activation+2.37e-01

Pos logit contributions
+0.000
 soon+0.000
ups+0.000
"+0.000
spe+0.000
Neg logit contributions
rob-0.000
ags-0.000
 Whe-0.000
Whe-0.000
 retri-0.000
Token,
Token ID11
Feature activation+9.81e-01

Pos logit contributions
rob+0.010
ags+0.009
Whe+0.009
 Ow+0.008
 Whe+0.008
Neg logit contributions
-0.005
 soon-0.005
ups-0.004
"-0.004
oe-0.004
Token I
Token ID314
Feature activation+4.92e-01

Pos logit contributions
ags+0.033
rob+0.033
 Whe+0.031
Whe+0.030
 Ow+0.029
Neg logit contributions
-0.025
ups-0.024
spe-0.022
Jud-0.021
J-0.021
Tokenâ
Token ID22940
Feature activation+3.66e-01

Pos logit contributions
rob+0.018
ags+0.018
Whe+0.016
 Whe+0.016
 Ow+0.015
Neg logit contributions
-0.013
ups-0.011
spe-0.010
"-0.010
 soon-0.010
Token and
Token ID290
Feature activation-4.85e-02

Pos logit contributions
rob+0.005
ags+0.005
 Whe+0.004
Whe+0.004
 Ow+0.004
Neg logit contributions
-0.004
ups-0.003
spe-0.003
J-0.003
"-0.003
Token excited
Token ID6568
Feature activation-2.74e-01

Pos logit contributions
+0.001
ups+0.001
spe+0.001
Jud+0.001
J+0.001
Neg logit contributions
ags-0.002
rob-0.001
 Whe-0.001
Whe-0.001
 de-0.001
Token.
Token ID13
Feature activation+7.73e-02

Pos logit contributions
+0.006
ups+0.006
spe+0.005
J+0.005
Jud+0.005
Neg logit contributions
ags-0.011
rob-0.011
 Whe-0.010
Whe-0.010
 Ow-0.009
Token
Token ID198
Feature activation+3.09e-01

Pos logit contributions
rob+0.002
ags+0.002
Whe+0.002
 Whe+0.002
 Ow+0.002
Neg logit contributions
-0.002
ups-0.002
spe-0.002
J-0.002
"-0.002
Token
Token ID198
Feature activation+6.34e-02

Pos logit contributions
ags+0.008
rob+0.008
 Whe+0.008
Whe+0.007
 Ow+0.007
Neg logit contributions
-0.010
ups-0.010
spe-0.009
Jud-0.009
J-0.009
TokenBen
Token ID11696
Feature activation+6.09e-01

Pos logit contributions
 Whe+0.001
ags+0.001
 Das+0.001
rob+0.001
Whe+0.001
Neg logit contributions
spe-0.002
bell-0.002
bug-0.002
ho-0.002
edient-0.002
Token and
Token ID290
Feature activation+5.64e-01

Pos logit contributions
ags+0.015
rob+0.015
 Whe+0.013
Whe+0.012
 Ow+0.012
Neg logit contributions
-0.022
ups-0.021
spe-0.020
Jud-0.019
J-0.019
Token Lily
Token ID20037
Feature activation+9.55e-01

Pos logit contributions
ags+0.013
rob+0.013
 Whe+0.013
Whe+0.011
 retri+0.011
Neg logit contributions
-0.021
ups-0.020
spe-0.018
Jud-0.018
oh-0.018
Token held
Token ID2714
Feature activation+6.52e-01

Pos logit contributions
ags+0.034
rob+0.034
 Whe+0.031
Whe+0.030
 Ow+0.029
Neg logit contributions
-0.024
ups-0.022
spe-0.020
Jud-0.019
J-0.019
Token hands
Token ID2832
Feature activation+1.69e+00

Pos logit contributions
rob+0.024
ags+0.021
Whe+0.020
 Whe+0.020
 Ow+0.020
Neg logit contributions
-0.018
ups-0.015
"-0.014
spe-0.013
bug-0.013
Token and
Token ID290
Feature activation+7.18e-01

Pos logit contributions
ags+0.045
rob+0.041
 Whe+0.038
 de+0.036
 Ow+0.036
Neg logit contributions
-0.055
ups-0.054
Jud-0.052
spe-0.050
J-0.050
Token don
Token ID836
Feature activation-7.05e-01

Pos logit contributions
+0.010
ups+0.009
spe+0.008
 soon+0.008
"+0.007
Neg logit contributions
rob-0.012
ags-0.012
Whe-0.011
 Whe-0.011
 retri-0.010
Token't
Token ID470
Feature activation-1.71e-01

Pos logit contributions
+0.017
ups+0.015
spe+0.014
Jud+0.013
J+0.013
Neg logit contributions
ags-0.026
rob-0.026
 Whe-0.024
Whe-0.023
 Ow-0.023
Token belong
Token ID5594
Feature activation+1.62e-01

Pos logit contributions
+0.005
ups+0.005
spe+0.005
J+0.004
Jud+0.004
Neg logit contributions
ags-0.005
rob-0.005
 Whe-0.005
Whe-0.005
 Ow-0.004
Token.
Token ID13
Feature activation-3.79e-01

Pos logit contributions
ags+0.005
rob+0.005
 Whe+0.004
 Ow+0.004
 retri+0.004
Neg logit contributions
-0.005
spe-0.004
ups-0.004
Jud-0.004
saw-0.004
Token She
Token ID1375
Feature activation-6.99e-01

Pos logit contributions
+0.012
ups+0.012
spe+0.011
J+0.011
Jud+0.011
Neg logit contributions
ags-0.011
rob-0.010
 Whe-0.009
Whe-0.009
 Ow-0.009
Token promised
Token ID8072
Feature activation+4.07e-01

Pos logit contributions
+0.017
ups+0.017
spe+0.016
Jud+0.015
J+0.015
Neg logit contributions
rob-0.024
ags-0.023
 Whe-0.021
Whe-0.021
 Ow-0.020
Token to
Token ID284
Feature activation+2.00e-01

Pos logit contributions
ags+0.014
rob+0.014
 Whe+0.013
Whe+0.012
 Ow+0.012
Neg logit contributions
-0.011
ups-0.010
spe-0.009
J-0.009
Jud-0.009
Token always
Token ID1464
Feature activation+6.80e-01

Pos logit contributions
ags+0.006
rob+0.006
 Whe+0.005
Whe+0.005
 Ow+0.005
Neg logit contributions
-0.006
ups-0.006
spe-0.005
J-0.005
saw-0.005
Token be
Token ID307
Feature activation-6.10e-01

Pos logit contributions
ags+0.022
rob+0.022
 Whe+0.019
Whe+0.019
 Ow+0.018
Neg logit contributions
-0.020
ups-0.019
spe-0.017
J-0.017
Jud-0.016
Token kind
Token ID1611
Feature activation-5.34e-01

Pos logit contributions
+0.017
spe+0.016
ups+0.016
ho+0.015
oh+0.015
Neg logit contributions
ags-0.021
 Whe-0.017
rob-0.017
Whe-0.017
 retri-0.016
Token to
Token ID284
Feature activation-4.12e-01

Pos logit contributions
+0.022
Jud+0.020
spe+0.020
J+0.020
saw+0.020
Neg logit contributions
ags-0.010
rob-0.008
 Whe-0.008
 Ow-0.007
 de-0.007
Token
Token ID198
Feature activation+7.25e-02

Pos logit contributions
ags+0.006
rob+0.006
Whe+0.005
 Whe+0.005
 luggage+0.005
Neg logit contributions
-0.004
ups-0.003
"-0.003
Jud-0.003
J-0.003
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
ags+0.002
rob+0.002
 Whe+0.002
Whe+0.002
 Ow+0.002
Neg logit contributions
-0.002
ups-0.002
spe-0.002
Jud-0.002
J-0.002
TokenBilly
Token ID49640
Feature activation-2.17e-01

Pos logit contributions
rob+0.003
ags+0.003
 Whe+0.003
Whe+0.003
 retri+0.003
Neg logit contributions
ups-0.004
bug-0.003
spe-0.003
-0.003
ho-0.003
Token said
Token ID531
Feature activation+1.45e-01

Pos logit contributions
+0.006
ups+0.006
spe+0.005
J+0.005
Jud+0.005
Neg logit contributions
rob-0.007
ags-0.007
 Whe-0.006
Whe-0.006
 Ow-0.006
Token sorry
Token ID7926
Feature activation-3.02e-01

Pos logit contributions
ags+0.004
 Whe+0.004
 retri+0.003
 Ow+0.003
rob+0.003
Neg logit contributions
ups-0.005
-0.005
Jud-0.005
saw-0.004
spe-0.004
Token,
Token ID11
Feature activation+2.00e-01

Pos logit contributions
+0.011
ups+0.010
spe+0.010
Jud+0.009
J+0.009
Neg logit contributions
ags-0.008
rob-0.007
 Whe-0.007
Whe-0.006
 Ow-0.006
Token but
Token ID475
Feature activation+8.71e-01

Pos logit contributions
rob+0.007
ags+0.007
Whe+0.006
 Ow+0.006
 Whe+0.005
Neg logit contributions
-0.006
"-0.005
ups-0.005
 soon-0.005
spe-0.005
Token Tim
Token ID5045
Feature activation-4.29e-01

Pos logit contributions
 Whe+0.023
 retri+0.023
rob+0.022
ags+0.022
 Robbie+0.022
Neg logit contributions
ups-0.028
-0.024
Jud-0.023
saw-0.023
J-0.023
Tokenmy
Token ID1820
Feature activation-6.47e-01

Pos logit contributions
+0.010
ups+0.009
spe+0.009
Jud+0.008
J+0.008
Neg logit contributions
ags-0.016
rob-0.016
 Whe-0.014
Whe-0.014
 Ow-0.014
Token didn
Token ID1422
Feature activation+2.02e-01

Pos logit contributions
+0.018
ups+0.017
spe+0.016
Jud+0.015
J+0.015
Neg logit contributions
rob-0.020
ags-0.020
 Whe-0.018
Whe-0.018
 retri-0.018
Token't
Token ID470
Feature activation+2.95e-01

Pos logit contributions
ags+0.006
rob+0.006
 Whe+0.006
 Ow+0.005
Whe+0.005
Neg logit contributions
ups-0.006
spe-0.006
Jud-0.005
-0.005
bug-0.005
Token ot
Token ID30972
Feature activation+1.24e-02

Pos logit contributions
+0.004
ups+0.004
Jud+0.004
J+0.004
spe+0.004
Neg logit contributions
rob-0.002
ags-0.002
 retri-0.002
 Ow-0.002
 luggage-0.002
Tokenter
Token ID353
Feature activation+2.58e-01

Pos logit contributions
rob+0.000
ags+0.000
 Whe+0.000
Whe+0.000
 retri+0.000
Neg logit contributions
-0.001
ups-0.000
spe-0.000
Jud-0.000
J-0.000
Token started
Token ID2067
Feature activation-1.27e+00

Pos logit contributions
rob+0.006
 retri+0.006
 fur+0.005
 re+0.005
 fle+0.005
Neg logit contributions
J-0.008
-0.007
ups-0.007
Jud-0.007
Mom-0.007
Token to
Token ID284
Feature activation-4.13e-02

Pos logit contributions
+0.037
ups+0.034
spe+0.032
J+0.032
Jud+0.031
Neg logit contributions
ags-0.039
rob-0.038
 Whe-0.036
Whe-0.034
 Ow-0.034
Token move
Token ID1445
Feature activation-4.67e-01

Pos logit contributions
+0.001
ups+0.001
spe+0.001
Jud+0.001
 soon+0.001
Neg logit contributions
ags-0.001
rob-0.001
 Whe-0.001
Whe-0.001
 Ow-0.001
Token again
Token ID757
Feature activation+2.68e-01

Pos logit contributions
+0.014
ups+0.014
Jud+0.013
spe+0.013
orse+0.013
Neg logit contributions
ags-0.013
rob-0.012
 Whe-0.011
Whe-0.011
 Ow-0.011
Token and
Token ID290
Feature activation-3.65e-02

Pos logit contributions
ags+0.009
rob+0.008
 Whe+0.008
Whe+0.007
 Ow+0.007
Neg logit contributions
ups-0.007
-0.007
spe-0.007
J-0.006
Jud-0.006
Token it
Token ID340
Feature activation-5.66e-01

Pos logit contributions
+0.001
ups+0.001
J+0.001
"+0.001
Jud+0.001
Neg logit contributions
rob-0.001
 Whe-0.001
Whe-0.001
ags-0.001
 Ow-0.001
Token sw
Token ID1509
Feature activation-1.50e-01

Pos logit contributions
+0.015
ups+0.014
Jud+0.013
spe+0.013
J+0.013
Neg logit contributions
rob-0.018
ags-0.017
 Whe-0.016
Whe-0.016
 retri-0.015
Tokenam
Token ID321
Feature activation+9.35e-02

Pos logit contributions
+0.004
ups+0.003
spe+0.003
J+0.003
Jud+0.003
Neg logit contributions
ags-0.006
rob-0.005
 Ow-0.005
Whe-0.005
 Whe-0.005
Token away
Token ID1497
Feature activation+7.30e-01

Pos logit contributions
ags+0.002
 retri+0.002
 de+0.002
 Whe+0.002
 Ow+0.002
Neg logit contributions
-0.003
orse-0.003
ups-0.003
Jud-0.003
bug-0.003
Tokeneb
Token ID1765
Feature activation-2.65e-01

Pos logit contributions
+0.013
ups+0.012
J+0.011
"+0.011
oh+0.011
Neg logit contributions
ags-0.019
rob-0.018
 Whe-0.017
Whe-0.016
 stret-0.015
Token was
Token ID373
Feature activation-5.62e-02

Pos logit contributions
+0.008
ups+0.007
Jud+0.006
saw+0.006
spe+0.006
Neg logit contributions
ags-0.008
rob-0.008
 Ow-0.007
 retri-0.007
 de-0.006
Token happy
Token ID3772
Feature activation-7.37e-01

Pos logit contributions
+0.002
ups+0.002
spe+0.001
Jud+0.001
J+0.001
Neg logit contributions
ags-0.002
rob-0.002
 Whe-0.002
Whe-0.002
 retri-0.001
Token because
Token ID780
Feature activation-1.13e+00

Pos logit contributions
+0.022
ups+0.021
spe+0.020
Jud+0.019
J+0.018
Neg logit contributions
ags-0.022
rob-0.022
 Whe-0.021
Whe-0.019
 Ow-0.019
Token his
Token ID465
Feature activation+3.81e-01

Pos logit contributions
ups+0.028
+0.027
Jud+0.025
spe+0.023
J+0.022
Neg logit contributions
rob-0.040
ags-0.039
 Whe-0.038
Whe-0.036
 retri-0.035
Token laser
Token ID12855
Feature activation+2.44e-01

Pos logit contributions
rob+0.011
ags+0.011
Whe+0.010
 Whe+0.010
 Ow+0.009
Neg logit contributions
-0.012
ups-0.011
spe-0.010
"-0.010
J-0.010
Token was
Token ID373
Feature activation+1.83e-01

Pos logit contributions
ags+0.009
rob+0.009
 Whe+0.008
Whe+0.008
 Ow+0.008
Neg logit contributions
-0.006
ups-0.006
spe-0.005
J-0.005
Jud-0.005
Token very
Token ID845
Feature activation-2.48e-01

Pos logit contributions
ags+0.006
rob+0.005
 Whe+0.005
 retri+0.005
 luggage+0.005
Neg logit contributions
-0.006
ups-0.005
spe-0.005
Jud-0.005
J-0.005
Token reliable
Token ID9314
Feature activation-2.71e-01

Pos logit contributions
+0.010
ups+0.010
Jud+0.010
bug+0.009
bugs+0.009
Neg logit contributions
ags-0.005
 de-0.004
 retri-0.003
 favorite-0.003
 pleasure-0.003
Token.
Token ID13
Feature activation-7.68e-01

Pos logit contributions
+0.008
Jud+0.007
ups+0.007
J+0.007
spe+0.007
Neg logit contributions
ags-0.008
rob-0.008
 Whe-0.007
 luggage-0.007
 de-0.007
Token He
Token ID679
Feature activation-3.52e-01

Pos logit contributions
+0.023
ups+0.020
spe+0.019
"+0.019
J+0.018
Neg logit contributions
ags-0.025
rob-0.024
Whe-0.021
 Ow-0.021
 Whe-0.020
Token at
Token ID379
Feature activation+8.77e-02

Pos logit contributions
ags+0.009
rob+0.008
 Whe+0.008
 luggage+0.007
 Ow+0.007
Neg logit contributions
-0.012
ups-0.011
Jud-0.011
spe-0.011
J-0.010
Token the
Token ID262
Feature activation+9.65e-02

Pos logit contributions
ags+0.004
rob+0.003
Whe+0.003
 retri+0.003
agged+0.003
Neg logit contributions
-0.002
 soon-0.002
J-0.001
spe-0.001
Jud-0.001
Token zoo
Token ID26626
Feature activation+2.70e-01

Pos logit contributions
ags+0.003
rob+0.003
 Whe+0.002
Whe+0.002
 Ow+0.002
Neg logit contributions
-0.003
ups-0.003
spe-0.003
J-0.003
Jud-0.003
Token,
Token ID11
Feature activation-1.14e-01

Pos logit contributions
rob+0.008
ags+0.007
 Whe+0.007
Whe+0.007
 luggage+0.006
Neg logit contributions
-0.008
Jud-0.007
ups-0.007
spe-0.007
J-0.007
Token they
Token ID484
Feature activation-2.99e-01

Pos logit contributions
ups+0.004
+0.004
spe+0.003
edient+0.003
Jud+0.003
Neg logit contributions
ags-0.003
 Whe-0.003
rob-0.003
 Ow-0.003
Whe-0.003
Token saw
Token ID2497
Feature activation-3.97e-01

Pos logit contributions
+0.008
ups+0.008
spe+0.007
Jud+0.006
J+0.006
Neg logit contributions
rob-0.010
ags-0.010
 Whe-0.009
Whe-0.009
 Ow-0.009
Token all
Token ID477
Feature activation-5.53e-02

Pos logit contributions
+0.014
ups+0.013
spe+0.012
J+0.012
Jud+0.012
Neg logit contributions
ags-0.011
rob-0.011
Whe-0.009
 Whe-0.009
 retri-0.009
Token kinds
Token ID6982
Feature activation-8.07e-01

Pos logit contributions
+0.002
ups+0.002
Jud+0.002
J+0.002
oh+0.002
Neg logit contributions
rob-0.001
ags-0.001
 Whe-0.001
Whe-0.001
 de-0.001
Token of
Token ID286
Feature activation-4.54e-01

Pos logit contributions
+0.023
Jud+0.021
spe+0.020
oh+0.020
ups+0.019
Neg logit contributions
ags-0.025
rob-0.024
 Whe-0.022
 de-0.022
 retri-0.021
Token animals
Token ID4695
Feature activation+6.87e-02

Pos logit contributions
+0.014
ups+0.014
 else+0.012
J+0.012
"+0.012
Neg logit contributions
ags-0.015
rob-0.014
Whe-0.013
 Whe-0.012
ses-0.012
Token.
Token ID13
Feature activation+2.45e-01

Pos logit contributions
rob+0.002
ags+0.002
 Whe+0.002
 freshly+0.001
 de+0.001
Neg logit contributions
ups-0.002
spe-0.002
J-0.002
Mom-0.002
Jud-0.002
Token the
Token ID262
Feature activation-6.68e-01

Pos logit contributions
+0.019
ups+0.017
spe+0.016
"+0.016
J+0.016
Neg logit contributions
rob-0.015
ags-0.014
 Whe-0.013
Whe-0.012
 Ow-0.011
Token pill
Token ID9582
Feature activation+1.20e-01

Pos logit contributions
+0.022
ups+0.019
"+0.018
Mom+0.017
 soon+0.017
Neg logit contributions
rob-0.021
ags-0.020
Whe-0.018
 Whe-0.016
 Robbie-0.016
Token and
Token ID290
Feature activation+6.05e-01

Pos logit contributions
ags+0.003
rob+0.003
 Whe+0.002
 de+0.002
 luggage+0.002
Neg logit contributions
-0.004
ups-0.004
Jud-0.004
spe-0.004
J-0.004
Token a
Token ID257
Feature activation+6.87e-03

Pos logit contributions
ags+0.020
rob+0.020
 Whe+0.017
Whe+0.017
 Ow+0.017
Neg logit contributions
-0.018
ups-0.016
spe-0.015
J-0.014
Jud-0.014
Token glass
Token ID5405
Feature activation-9.02e-01

Pos logit contributions
rob+0.000
ags+0.000
Whe+0.000
 Whe+0.000
 Ow+0.000
Neg logit contributions
-0.000
ups-0.000
spe-0.000
"-0.000
J-0.000
Token of
Token ID286
Feature activation-8.72e-01

Pos logit contributions
+0.023
Jud+0.022
spe+0.022
ups+0.022
J+0.020
Neg logit contributions
ags-0.032
rob-0.028
 de-0.027
 Whe-0.027
 retri-0.026
Token juice
Token ID13135
Feature activation-7.61e-01

Pos logit contributions
+0.026
ups+0.022
 soon+0.021
"+0.020
Mom+0.020
Neg logit contributions
rob-0.029
ags-0.027
Whe-0.026
 Whe-0.024
 retri-0.023
Token.
Token ID13
Feature activation+1.93e-01

Pos logit contributions
+0.020
ups+0.019
spe+0.018
Jud+0.018
J+0.017
Neg logit contributions
ags-0.026
rob-0.025
 Whe-0.023
Whe-0.022
 Ow-0.022
Token Lily
Token ID20037
Feature activation-5.77e-01

Pos logit contributions
rob+0.007
ags+0.006
Whe+0.006
 retri+0.005
 Whe+0.005
Neg logit contributions
-0.006
ups-0.005
"-0.005
J-0.005
Jud-0.005
Token made
Token ID925
Feature activation-2.96e-01

Pos logit contributions
+0.015
ups+0.014
"+0.012
 soon+0.012
spe+0.012
Neg logit contributions
ags-0.022
rob-0.021
 Whe-0.019
Whe-0.019
 Ow-0.018
Token a
Token ID257
Feature activation-2.02e-01

Pos logit contributions
+0.011
ups+0.010
spe+0.010
J+0.009
Jud+0.009
Neg logit contributions
rob-0.007
ags-0.007
 Whe-0.006
Whe-0.006
 Ow-0.006
Token and
Token ID290
Feature activation-5.22e-01

Pos logit contributions
+0.013
spe+0.012
ups+0.012
Jud+0.012
saw+0.011
Neg logit contributions
rob-0.009
ags-0.008
 Whe-0.008
 Ow-0.007
Whe-0.007
Token Ben
Token ID3932
Feature activation+5.72e-01

Pos logit contributions
+0.017
ups+0.016
spe+0.014
J+0.014
Jud+0.014
Neg logit contributions
rob-0.016
ags-0.016
 Whe-0.014
Whe-0.014
 Ow-0.013
Token said
Token ID531
Feature activation-8.90e-02

Pos logit contributions
ags+0.018
rob+0.018
 Whe+0.016
 Ow+0.016
Whe+0.016
Neg logit contributions
-0.016
ups-0.015
spe-0.014
Jud-0.013
saw-0.013
Token.
Token ID13
Feature activation-6.97e-01

Pos logit contributions
+0.003
spe+0.003
Jud+0.003
ups+0.003
saw+0.003
Neg logit contributions
ags-0.003
 Whe-0.002
 de-0.002
 Bend-0.002
 retri-0.002
Token "
Token ID366
Feature activation-7.91e-01

Pos logit contributions
+0.025
ups+0.023
spe+0.022
Jud+0.021
bug+0.021
Neg logit contributions
ags-0.018
rob-0.017
 Whe-0.017
 Ow-0.015
Whe-0.015
TokenMax
Token ID11518
Feature activation-4.83e-01

Pos logit contributions
+0.019
ups+0.019
spe+0.017
Jud+0.015
bug+0.015
Neg logit contributions
ags-0.028
rob-0.028
Whe-0.027
 Whe-0.027
 Ow-0.025
Token is
Token ID318
Feature activation-8.21e-01

Pos logit contributions
ups+0.014
+0.014
spe+0.014
J+0.013
Jud+0.013
Neg logit contributions
rob-0.014
ags-0.013
 Whe-0.013
 Ow-0.012
 de-0.012
Token here
Token ID994
Feature activation+1.53e-01

Pos logit contributions
spe+0.028
ups+0.027
+0.027
J+0.026
Jud+0.025
Neg logit contributions
ags-0.022
rob-0.019
 Whe-0.019
 Ow-0.018
 such-0.017
Token and
Token ID290
Feature activation-6.09e-01

Pos logit contributions
ags+0.004
 Whe+0.004
rob+0.004
 Ow+0.004
Whe+0.004
Neg logit contributions
ups-0.004
-0.004
spe-0.004
Jud-0.004
J-0.004
Token he
Token ID339
Feature activation+9.60e-01

Pos logit contributions
ups+0.017
Jud+0.017
J+0.017
+0.016
spe+0.015
Neg logit contributions
 Whe-0.018
rob-0.017
ags-0.016
 de-0.016
 Robbie-0.016
Token is
Token ID318
Feature activation-7.12e-01

Pos logit contributions
rob+0.029
ags+0.028
 Whe+0.025
Whe+0.025
 Ow+0.024
Neg logit contributions
-0.029
ups-0.027
spe-0.026
J-0.026
Jud-0.025
Token and
Token ID290
Feature activation+2.03e-01

Pos logit contributions
+0.029
ups+0.027
Jud+0.025
J+0.024
spe+0.024
Neg logit contributions
ags-0.030
rob-0.030
 Whe-0.027
Whe-0.026
 retri-0.025
Token the
Token ID262
Feature activation+1.26e+00

Pos logit contributions
rob+0.007
ags+0.007
 Whe+0.007
Whe+0.006
 Ow+0.006
Neg logit contributions
-0.005
ups-0.005
spe-0.004
J-0.004
Jud-0.004
Token family
Token ID1641
Feature activation-4.07e-03

Pos logit contributions
ags+0.032
rob+0.031
 Whe+0.026
Whe+0.026
 Ow+0.025
Neg logit contributions
-0.047
ups-0.043
spe-0.041
J-0.039
"-0.039
Token lived
Token ID5615
Feature activation+2.71e-02

Pos logit contributions
+0.000
ups+0.000
Jud+0.000
spe+0.000
J+0.000
Neg logit contributions
rob-0.000
ags-0.000
 Whe-0.000
Whe-0.000
 retri-0.000
Token happily
Token ID18177
Feature activation-4.03e-01

Pos logit contributions
ags+0.001
rob+0.001
 Whe+0.001
Whe+0.001
 retri+0.001
Neg logit contributions
-0.001
ups-0.001
Jud-0.001
oh-0.001
saw-0.001
Token ever
Token ID1683
Feature activation-7.97e-01

Pos logit contributions
ups+0.008
+0.008
Jud+0.008
saw+0.008
J+0.007
Neg logit contributions
rob-0.015
ags-0.015
 Whe-0.013
 retri-0.013
Whe-0.013
Token after
Token ID706
Feature activation-4.55e-02

Pos logit contributions
ups+0.026
saw+0.025
spe+0.025
+0.025
Jud+0.024
Neg logit contributions
ags-0.019
rob-0.018
 Whe-0.018
 retri-0.017
 de-0.016
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
+0.001
ups+0.001
spe+0.001
J+0.001
"+0.001
Neg logit contributions
ags-0.002
rob-0.002
 Whe-0.002
Whe-0.002
 Ow-0.002
Token<|endoftext|>
Token ID50256
Feature activation-2.82e-01

Pos logit contributions
rob+0.007
ags+0.007
Whe+0.006
 Whe+0.006
 retri+0.006
Neg logit contributions
-0.007
ups-0.006
spe-0.006
"-0.006
J-0.006
TokenOnce
Token ID7454
Feature activation-9.74e-02

Pos logit contributions
+0.009
ups+0.008
"+0.008
J+0.008
spe+0.008
Neg logit contributions
ags-0.008
rob-0.008
 Whe-0.007
Whe-0.007
 retri-0.007
Token there
Token ID612
Feature activation+1.55e-01

Pos logit contributions
ups+0.003
+0.003
Jud+0.003
spe+0.003
edient+0.003
Neg logit contributions
ags-0.002
 Whe-0.002
rob-0.002
 Ow-0.002
 de-0.002
Token?"
Token ID1701
Feature activation+1.43e-02

Pos logit contributions
+0.011
ups+0.011
spe+0.010
J+0.010
Jud+0.010
Neg logit contributions
ags-0.007
rob-0.007
 Whe-0.006
Whe-0.006
 Ow-0.006
Token he
Token ID339
Feature activation+3.18e-01

Pos logit contributions
rob+0.000
 Whe+0.000
ags+0.000
 Ow+0.000
 retri+0.000
Neg logit contributions
ups-0.000
-0.000
spe-0.000
Jud-0.000
edient-0.000
Token said
Token ID531
Feature activation+1.67e-01

Pos logit contributions
rob+0.010
ags+0.010
 Whe+0.009
Whe+0.009
 retri+0.009
Neg logit contributions
-0.009
ups-0.009
spe-0.008
Jud-0.008
J-0.007
Token.
Token ID13
Feature activation-4.23e-01

Pos logit contributions
ags+0.004
 de+0.004
 Whe+0.004
 retri+0.004
 asleep+0.004
Neg logit contributions
-0.005
ups-0.005
Jud-0.005
spe-0.005
oh-0.005
Token
Token ID198
Feature activation-3.96e-01

Pos logit contributions
+0.012
ups+0.010
"+0.010
J+0.010
spe+0.009
Neg logit contributions
rob-0.015
ags-0.015
Whe-0.013
 Whe-0.013
 Ow-0.012
Token
Token ID198
Feature activation-2.39e-01

Pos logit contributions
+0.013
ups+0.013
spe+0.012
Jud+0.011
J+0.011
Neg logit contributions
ags-0.011
rob-0.011
 Whe-0.010
Whe-0.009
 Ow-0.009
TokenL
Token ID43
Feature activation+6.84e-01

Pos logit contributions
ho+0.008
spe+0.007
oles+0.007
 erupt+0.007
 erupted+0.007
Neg logit contributions
rob-0.005
ags-0.005
Whe-0.005
 brid-0.005
aron-0.005
Tokenily
Token ID813
Feature activation-8.67e-02

Pos logit contributions
rob+0.030
 Whe+0.029
ags+0.029
Whe+0.028
 luggage+0.027
Neg logit contributions
-0.012
ups-0.011
"-0.010
J-0.010
CR-0.009
Token and
Token ID290
Feature activation+3.27e-01

Pos logit contributions
+0.003
ups+0.003
spe+0.003
Jud+0.002
J+0.002
Neg logit contributions
ags-0.002
rob-0.002
 Whe-0.002
Whe-0.002
 Ow-0.002
Token Ben
Token ID3932
Feature activation+7.10e-01

Pos logit contributions
rob+0.011
ags+0.011
 Whe+0.009
Whe+0.009
 Ow+0.009
Neg logit contributions
-0.010
ups-0.009
J-0.008
"-0.008
Jud-0.008
Token nodded
Token ID14464
Feature activation+1.56e-01

Pos logit contributions
ags+0.026
rob+0.026
 Whe+0.023
Whe+0.023
 Ow+0.022
Neg logit contributions
-0.020
ups-0.017
spe-0.015
J-0.015
"-0.015
Token let
Token ID1309
Feature activation+5.46e-01

Pos logit contributions
rob+0.016
ags+0.016
Whe+0.014
 Whe+0.014
 Ow+0.013
Neg logit contributions
-0.013
ups-0.011
"-0.010
spe-0.010
 soon-0.009
Token's
Token ID338
Feature activation-9.26e-01

Pos logit contributions
ags+0.020
rob+0.020
 Whe+0.018
Whe+0.018
 Ow+0.018
Neg logit contributions
spe-0.012
-0.012
ups-0.011
J-0.011
Jud-0.010
Token count
Token ID954
Feature activation-8.77e-01

Pos logit contributions
ups+0.029
+0.028
spe+0.027
J+0.027
Jud+0.026
Neg logit contributions
ags-0.027
rob-0.024
 Whe-0.023
Whe-0.023
 Ow-0.022
Token the
Token ID262
Feature activation-7.24e-01

Pos logit contributions
+0.027
ups+0.025
"+0.024
spe+0.022
 soon+0.022
Neg logit contributions
rob-0.028
ags-0.027
 Whe-0.025
Whe-0.025
 retri-0.023
Token apples
Token ID22514
Feature activation-9.99e-01

Pos logit contributions
+0.028
ups+0.026
spe+0.025
J+0.024
Jud+0.024
Neg logit contributions
ags-0.017
rob-0.016
 Whe-0.014
Whe-0.014
 Ow-0.013
Token together
Token ID1978
Feature activation-2.00e+00

Pos logit contributions
+0.030
ups+0.029
spe+0.027
J+0.027
Jud+0.027
Neg logit contributions
ags-0.030
rob-0.028
 Whe-0.026
Whe-0.025
 Ow-0.024
Token.
Token ID13
Feature activation+6.41e-02

Pos logit contributions
+0.057
ups+0.052
spe+0.048
J+0.046
Jud+0.046
Neg logit contributions
ags-0.066
rob-0.066
 Whe-0.059
Whe-0.059
 Ow-0.056
Token One
Token ID1881
Feature activation+1.22e-01

Pos logit contributions
ags+0.002
rob+0.002
 Whe+0.002
Whe+0.002
 Ow+0.001
Neg logit contributions
-0.002
ups-0.002
spe-0.002
J-0.002
Jud-0.002
Token,
Token ID11
Feature activation-4.15e-01

Pos logit contributions
ags+0.004
rob+0.004
 Whe+0.004
Whe+0.004
 Ow+0.003
Neg logit contributions
-0.003
ups-0.003
spe-0.003
J-0.003
Jud-0.003
Token two
Token ID734
Feature activation-2.22e-01

Pos logit contributions
+0.013
ups+0.012
spe+0.012
Jud+0.011
J+0.011
Neg logit contributions
ags-0.012
rob-0.012
 Whe-0.011
Whe-0.011
 Ow-0.010
Token,
Token ID11
Feature activation-3.47e-01

Pos logit contributions
+0.007
ups+0.006
J+0.006
spe+0.006
Jud+0.006
Neg logit contributions
ags-0.006
rob-0.006
 Whe-0.006
Whe-0.006
 Ow-0.005
Token for
Token ID329
Feature activation-3.09e-01

Pos logit contributions
rob+0.006
Whe+0.005
ags+0.005
 Whe+0.004
 Ow+0.004
Neg logit contributions
-0.003
 soon-0.003
"-0.003
ups-0.002
ful-0.002
Token the
Token ID262
Feature activation+5.13e-01

Pos logit contributions
+0.007
 soon+0.007
ups+0.006
"+0.006
J+0.006
Neg logit contributions
rob-0.013
ags-0.012
Whe-0.011
 Whe-0.011
 Ow-0.010
Token man
Token ID582
Feature activation-9.43e-01

Pos logit contributions
ags+0.020
rob+0.020
Whe+0.019
 Ow+0.018
 Whe+0.018
Neg logit contributions
-0.011
ups-0.011
J-0.009
 keeper-0.009
"-0.009
Token and
Token ID290
Feature activation-5.53e-01

Pos logit contributions
ups+0.029
+0.029
J+0.028
spe+0.027
Jud+0.027
Neg logit contributions
ags-0.024
rob-0.024
 Whe-0.022
 retri-0.021
 luggage-0.020
Token asked
Token ID1965
Feature activation+3.11e-01

Pos logit contributions
+0.014
ups+0.013
spe+0.013
J+0.013
Jud+0.012
Neg logit contributions
ags-0.017
rob-0.016
 Whe-0.016
 luggage-0.015
 Ow-0.015
Token her
Token ID607
Feature activation-1.80e+00

Pos logit contributions
rob+0.008
ags+0.008
Whe+0.007
 Whe+0.007
 Ow+0.006
Neg logit contributions
-0.012
ups-0.011
spe-0.010
"-0.010
J-0.010
Token mom
Token ID1995
Feature activation-1.07e+00

Pos logit contributions
CR+0.061
+0.061
oh+0.061
ups+0.060
Jud+0.059
Neg logit contributions
 luggage-0.038
ags-0.038
 Whe-0.036
 breakfast-0.035
 de-0.034
Tokenmy
Token ID1820
Feature activation-1.08e+00

Pos logit contributions
CR+0.034
spe+0.034
Jud+0.034
+0.033
ups+0.032
Neg logit contributions
ags-0.025
rob-0.022
 Whe-0.021
 favorites-0.021
 Ow-0.021
Token if
Token ID611
Feature activation+7.18e-01

Pos logit contributions
+0.037
ups+0.037
Jud+0.037
spe+0.036
CR+0.036
Neg logit contributions
ags-0.023
 Whe-0.021
rob-0.021
 Ow-0.019
 de-0.019
Token they
Token ID484
Feature activation-1.24e-01

Pos logit contributions
rob+0.025
ags+0.023
 Whe+0.023
 luggage+0.023
 Robbie+0.023
Neg logit contributions
-0.016
ups-0.015
spe-0.015
edient-0.014
saw-0.014
Token could
Token ID714
Feature activation+2.72e-01

Pos logit contributions
ups+0.002
+0.002
spe+0.002
Jud+0.002
J+0.002
Neg logit contributions
ags-0.005
rob-0.005
 Whe-0.004
 Ow-0.004
 luggage-0.004
Token,
Token ID11
Feature activation-1.84e-01

Pos logit contributions
spe+0.005
ups+0.005
+0.005
Jud+0.005
J+0.004
Neg logit contributions
ags-0.005
 Whe-0.004
 Ow-0.004
 de-0.004
 retri-0.004
Token but
Token ID475
Feature activation+6.68e-01

Pos logit contributions
+0.006
ups+0.005
spe+0.005
J+0.005
Jud+0.005
Neg logit contributions
ags-0.006
rob-0.006
 Whe-0.005
Whe-0.005
 Ow-0.005
Token it
Token ID340
Feature activation-3.26e-01

Pos logit contributions
 Whe+0.019
 Robbie+0.018
Whe+0.017
rob+0.017
ags+0.017
Neg logit contributions
ups-0.020
-0.018
Jud-0.017
spe-0.017
edient-0.017
Token is
Token ID318
Feature activation-8.38e-01

Pos logit contributions
+0.012
spe+0.012
J+0.012
ups+0.012
Jud+0.012
Neg logit contributions
ags-0.007
rob-0.006
 Ow-0.005
 Whe-0.005
Whe-0.005
Token too
Token ID1165
Feature activation-2.07e-01

Pos logit contributions
ups+0.028
spe+0.028
+0.028
saw+0.025
Jud+0.025
Neg logit contributions
ags-0.024
 Whe-0.019
 Ow-0.018
rob-0.018
Whe-0.018
Token late
Token ID2739
Feature activation-1.71e+00

Pos logit contributions
+0.007
spe+0.007
ups+0.007
bug+0.007
bugs+0.007
Neg logit contributions
ags-0.004
 de-0.004
 Ow-0.004
rob-0.004
 beef-0.004
Token.
Token ID13
Feature activation-2.45e-01

Pos logit contributions
+0.047
ups+0.045
spe+0.044
Jud+0.043
J+0.041
Neg logit contributions
ags-0.055
rob-0.053
 Whe-0.048
 de-0.047
 retri-0.045
Token The
Token ID383
Feature activation-7.18e-01

Pos logit contributions
+0.008
ups+0.007
"+0.007
J+0.007
Jud+0.006
Neg logit contributions
rob-0.008
ags-0.008
Whe-0.006
 retri-0.006
 luggage-0.006
Token bottle
Token ID9294
Feature activation+1.82e-01

Pos logit contributions
+0.023
ups+0.022
spe+0.021
Jud+0.020
J+0.020
Neg logit contributions
ags-0.021
rob-0.020
 Whe-0.018
 Ow-0.017
Whe-0.017
Token lands
Token ID8604
Feature activation+2.25e-01

Pos logit contributions
ags+0.005
 Ow+0.003
rob+0.003
ses+0.003
 de+0.003
Neg logit contributions
Jud-0.006
J-0.006
-0.006
spe-0.006
ups-0.006
Token in
Token ID287
Feature activation+2.54e-01

Pos logit contributions
ags+0.006
rob+0.005
 de+0.005
 Whe+0.005
 Ow+0.004
Neg logit contributions
-0.008
Jud-0.007
spe-0.007
ups-0.007
J-0.007
Token
Token ID198
Feature activation+5.54e-03

Pos logit contributions
ags+0.009
rob+0.008
Whe+0.008
 luggage+0.007
 retri+0.007
Neg logit contributions
-0.006
"-0.005
ups-0.005
J-0.005
Jud-0.004
TokenThe
Token ID464
Feature activation-5.91e-01

Pos logit contributions
rob+0.000
ags+0.000
 Whe+0.000
Whe+0.000
 Ow+0.000
Neg logit contributions
-0.000
ups-0.000
spe-0.000
bug-0.000
Jud-0.000
Token spider
Token ID19230
Feature activation+7.14e-01

Pos logit contributions
ups+0.019
+0.019
spe+0.019
Jud+0.017
J+0.016
Neg logit contributions
ags-0.016
rob-0.015
 Whe-0.015
Whe-0.015
 Ow-0.014
Token told
Token ID1297
Feature activation+3.49e-01

Pos logit contributions
rob+0.023
ags+0.022
 Whe+0.020
Whe+0.020
 retri+0.019
Neg logit contributions
-0.019
ups-0.018
spe-0.017
J-0.017
Jud-0.017
Token a
Token ID257
Feature activation-1.96e-02

Pos logit contributions
rob+0.015
ags+0.014
Whe+0.013
 Whe+0.013
 retri+0.012
Neg logit contributions
-0.007
 soon-0.006
ups-0.006
"-0.006
 keeper-0.005
Token story
Token ID1621
Feature activation-1.75e+00

Pos logit contributions
ups+0.001
+0.001
J+0.001
Jud+0.001
spe+0.001
Neg logit contributions
 Whe-0.000
rob-0.000
ags-0.000
Whe-0.000
 Ow-0.000
Token about
Token ID546
Feature activation+3.08e-01

Pos logit contributions
+0.076
ups+0.074
Jud+0.071
spe+0.070
J+0.069
Neg logit contributions
rob-0.026
ags-0.025
 Whe-0.022
 retri-0.018
Whe-0.018
Token a
Token ID257
Feature activation+7.88e-01

Pos logit contributions
rob+0.010
ags+0.010
Whe+0.009
 Whe+0.009
 Ow+0.008
Neg logit contributions
-0.009
ups-0.008
 soon-0.008
spe-0.007
"-0.007
Token fragile
Token ID21049
Feature activation+5.64e-01

Pos logit contributions
rob+0.018
 Whe+0.018
ags+0.017
 luggage+0.016
 retri+0.016
Neg logit contributions
ups-0.028
-0.026
J-0.024
CR-0.024
Jud-0.024
Token flower
Token ID15061
Feature activation+6.75e-01

Pos logit contributions
ags+0.019
rob+0.019
Whe+0.017
 Whe+0.016
 Ow+0.015
Neg logit contributions
-0.017
ups-0.015
spe-0.014
"-0.014
J-0.013
Token.
Token ID13
Feature activation+5.47e-02

Pos logit contributions
ags+0.016
rob+0.016
 Whe+0.014
 retri+0.014
 luggage+0.013
Neg logit contributions
-0.023
ups-0.022
spe-0.021
J-0.021
Jud-0.021
Token
Token ID198
Feature activation-2.96e-01

Pos logit contributions
rob+0.002
ags+0.002
 Whe+0.001
Whe+0.001
 Ow+0.001
Neg logit contributions
-0.002
ups-0.002
spe-0.002
Jud-0.001
J-0.001
TokenJack
Token ID14295
Feature activation-5.11e-01

Pos logit contributions
ups+0.010
spe+0.010
edient+0.009
ho+0.009
bug+0.009
Neg logit contributions
 Whe-0.007
ags-0.006
Whe-0.006
rob-0.006
 Ow-0.005
Token asked
Token ID1965
Feature activation-1.18e-01

Pos logit contributions
ups+0.013
saw+0.012
spe+0.012
J+0.012
+0.012
Neg logit contributions
rob-0.015
ags-0.014
 Ow-0.013
 retri-0.013
 Robbie-0.013
Token her
Token ID607
Feature activation-1.88e+00

Pos logit contributions
+0.004
ups+0.004
spe+0.004
Jud+0.004
J+0.004
Neg logit contributions
ags-0.003
rob-0.003
 Whe-0.003
Whe-0.003
 Ow-0.003
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
ups+0.066
Jud+0.065
CR+0.063
spe+0.063
edient+0.062
Neg logit contributions
 Whe-0.038
 about-0.037
 breakfast-0.037
 luggage-0.035
 favorite-0.035
Token
Token ID6184
Feature activation-2.15e+00

Pos logit contributions
+0.031
"+0.026
ups+0.025
J+0.025
 soon+0.024
Neg logit contributions
rob-0.040
ags-0.039
Whe-0.034
 Ow-0.032
 Whe-0.032
Token
Token ID95
Feature activation-8.65e-01

Pos logit contributions
+0.069
ups+0.065
spe+0.060
Jud+0.058
J+0.058
Neg logit contributions
ags-0.064
rob-0.063
 Whe-0.056
Whe-0.055
 Ow-0.052
Token
Token ID26391
Feature activation-5.69e-01

Pos logit contributions
+0.029
ups+0.026
spe+0.024
J+0.024
"+0.024
Neg logit contributions
ags-0.025
rob-0.025
 Whe-0.022
Whe-0.021
 Ow-0.021
Token
Token ID129
Feature activation-1.24e+00

Pos logit contributions
+0.016
ups+0.016
spe+0.015
saw+0.014
edient+0.014
Neg logit contributions
rob-0.017
 Whe-0.016
ags-0.015
Whe-0.015
 Ow-0.014
Token
Token ID241
Feature activation+1.06e-01

Pos logit contributions
+0.040
ups+0.036
spe+0.033
"+0.033
Jud+0.032
Neg logit contributions
rob-0.037
ags-0.037
 Whe-0.033
Whe-0.033
 Ow-0.031
TokenMom
Token ID29252
Feature activation-3.37e-01

Pos logit contributions
ags+0.003
rob+0.003
 Whe+0.003
Whe+0.003
 Ow+0.003
Neg logit contributions
-0.003
ups-0.003
spe-0.003
J-0.003
Jud-0.003