TOP ACTIVATIONS
MAX = 5.660

do to be a good doctor ?" The old
, " What is a university ?" The
you want to see another quiz ?" Tom nodded
want to try doing some math ? â The bunny
do you think the laundry weighs ?" L ily
Can I ask you some questions ?" Tom and
" Are you an important sailor ?" Jack
you help me answer this question ?" The large
Want to try ringing them ? â
" What are you doing sir ?" The man
" Can I take the quiz too ?" Sarah smiled and
you need to find a lawyer ?" The
" Do you really do maths in here ?" asked Amanda
" Who can answer my question ?". Nobody answered . He
Can I ask you a question ?" The fire
Can I try a new spell ?" Tom asked Lily .
" Do you need a tutor for your school work ?"
please find me a new tutor ?" Mom and
" Can I borrow your axe ?" the bear asked .
I asked you a few questions ?" Joe shook

MIN ACTIVATIONS
MIN = -6.319

They played until the sun started to set .
kept playing until the sun started to set . O ll
to set and the air started to freeze . They ran
time , until the sun started to come up . It
face . Soon the sun started to set and the little
. Eventually , the sun started to set and Sue could
they walked , the sun started to shine and the snow
cleared up and the sun started to shine again .
As the sun started to set , Lily 's
the park until the sun started to go down . She
they played , the sun started to get very hot .
As the sun started to set , it was
As the sun started to set , the band
and laughed until the sun started to set . Then ,
As the sun started to set , Tim my
the field until the sun started to set . It was
and forth , the sun began to set and the air
As the sun started to set , it was
even notice when the sun started to go down .
they walked , the sun started to shine . Then ,

INTERVAL 2.665 - 5.660
CONTAINS 0.323%

caught . But the fisherman shook his head and said he
but when she tried to open it , it was too
Can I have some ice cream now ?" Her mom said
" What are you do in ' ?" Benny asked .
a show for mom and dad ? We can sing and

INTERVAL -0.330 - 2.665
CONTAINS 65.093%

amy picked the grape and put it right in
playing with his toys when his mom told him it was
them . Finally , he saw the box stuck in a
It was calling her in from the cold . She ran
anymore . They want to play with their toys and books

INTERVAL -3.325 - -0.330
CONTAINS 34.333%

fish . He likes to eat fish and he likes to
Mix ing Treasure would mix all the colors , sounds and
sorry for knocking over her toy . Lily forg ave him
, Rex ," Tom said . " He likes you .
because he didn 't want to lose his favorite toy .

INTERVAL -6.319 - -3.325
CONTAINS 0.250%

day , a little girl named Sus ie was walking in
it . Inside , there are two shiny costumes . They
day , the arm chair started to applaud . Tom was
to go down and it got cold outside . Jake was
was a happy little boy named Tim . He was so
Token do
Token ID466
Feature activation+3.25e+00

Pos logit contributions
 �+0.005
pieces+0.004
 impact+0.004
 agreement+0.004
 hesitation+0.004
Neg logit contributions
uffy-0.006
ears-0.006
ugs-0.006
unny-0.005
imi-0.005
Token to
Token ID284
Feature activation+1.03e+00

Pos logit contributions
 �+0.069
pieces+0.068
 hesitation+0.067
 agreement+0.067
 impact+0.066
Neg logit contributions
uffy-0.058
ugs-0.057
ears-0.057
 stairs-0.055
loo-0.053
Token be
Token ID307
Feature activation+1.30e+00

Pos logit contributions
 �+0.019
pieces+0.019
 impact+0.018
 int+0.018
ieved+0.018
Neg logit contributions
uffy-0.021
ears-0.021
ugs-0.021
imi-0.020
 stairs-0.019
Token a
Token ID257
Feature activation+6.72e-02

Pos logit contributions
ement+0.032
 �+0.032
pieces+0.031
 int+0.031
 unison+0.030
Neg logit contributions
 cob-0.018
ears-0.018
 dirty-0.018
 toys-0.017
uffy-0.017
Token good
Token ID922
Feature activation-1.05e+00

Pos logit contributions
pieces+0.001
 �+0.001
ieved+0.001
 int+0.001
 impact+0.001
Neg logit contributions
uffy-0.001
ears-0.001
ugs-0.001
 stairs-0.001
 cob-0.001
Token doctor
Token ID6253
Feature activation+5.66e+00

Pos logit contributions
uffy+0.018
ears+0.018
ugs+0.017
imi+0.016
unny+0.016
Neg logit contributions
 �-0.024
pieces-0.023
 impact-0.023
ieved-0.023
 int-0.023
Token?"
Token ID1701
Feature activation+9.30e-01

Pos logit contributions
 unison+0.153
ement+0.150
 �+0.149
pieces+0.144
 agreement+0.144
Neg logit contributions
uffy-0.067
o-0.064
ugs-0.060
 toys-0.060
 babies-0.059
Token
Token ID198
Feature activation+6.27e-01

Pos logit contributions
 narrative+0.023
 hesitation+0.022
 impact+0.022
pieces+0.022
ieved+0.022
Neg logit contributions
 Fl-0.014
 Chick-0.013
 stairs-0.012
 Tit-0.012
Tim-0.012
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
 impact+0.015
pieces+0.015
 hesitation+0.015
 �+0.014
 agreement+0.014
Neg logit contributions
uffy-0.010
Jen-0.009
ugs-0.009
unny-0.009
ears-0.009
TokenThe
Token ID464
Feature activation+1.89e+00

Pos logit contributions
 �+0.004
pieces+0.004
 impact+0.004
 hesitation+0.004
 int+0.004
Neg logit contributions
uffy-0.004
ears-0.004
ugs-0.004
unny-0.004
imi-0.004
Token old
Token ID1468
Feature activation+5.15e-01

Pos logit contributions
pieces+0.057
ieved+0.055
 �+0.054
 unison+0.053
 int+0.051
Neg logit contributions
 stairs-0.022
 dirty-0.019
 toys-0.019
ears-0.017
 glass-0.017
Token,
Token ID11
Feature activation+1.64e+00

Pos logit contributions
ears+0.006
uffy+0.006
 Fl+0.006
 Tit+0.006
 cob+0.006
Neg logit contributions
pieces-0.009
 unison-0.009
ieved-0.009
 impact-0.009
alling-0.009
Token "
Token ID366
Feature activation+9.99e-01

Pos logit contributions
 impact+0.048
 narrative+0.047
ieved+0.047
pieces+0.047
ement+0.045
Neg logit contributions
 Chick-0.019
 Fl-0.019
 Bee-0.017
 Tit-0.017
unny-0.017
TokenWhat
Token ID2061
Feature activation+2.71e+00

Pos logit contributions
pieces+0.020
 impact+0.020
 �+0.020
 hesitation+0.020
 int+0.019
Neg logit contributions
Bee-0.019
Snow-0.019
Jen-0.019
Su-0.018
Tim-0.018
Token is
Token ID318
Feature activation-1.50e-01

Pos logit contributions
 �+0.061
pieces+0.059
 impact+0.059
 hesitation+0.057
 agreement+0.056
Neg logit contributions
uffy-0.047
ears-0.047
ugs-0.046
unny-0.043
imi-0.042
Token a
Token ID257
Feature activation+3.01e-01

Pos logit contributions
 cob+0.003
 stairs+0.003
uffy+0.003
ears+0.002
 Chick+0.002
Neg logit contributions
 agreement-0.003
 impact-0.003
ieved-0.003
pieces-0.003
 unison-0.003
Token university
Token ID6403
Feature activation+5.44e+00

Pos logit contributions
 �+0.006
pieces+0.006
ieved+0.005
 hesitation+0.005
 impact+0.005
Neg logit contributions
uffy-0.006
ears-0.006
ugs-0.006
 stairs-0.006
 cob-0.006
Token?"
Token ID1701
Feature activation+1.03e+00

Pos logit contributions
 agreement+0.139
 unison+0.137
 hesitation+0.134
pieces+0.134
 �+0.133
Neg logit contributions
 stairs-0.075
ears-0.070
loo-0.066
uffy-0.065
 toys-0.065
Token
Token ID220
Feature activation-9.36e-01

Pos logit contributions
 narrative+0.024
 impact+0.024
 unison+0.023
 forward+0.023
 hesitation+0.023
Neg logit contributions
 Fl-0.017
 Kitty-0.016
 Chick-0.016
 Wh-0.016
 Tit-0.015
Token
Token ID198
Feature activation+7.11e-01

Pos logit contributions
uffy+0.018
ears+0.018
ugs+0.017
imi+0.016
izzy+0.016
Neg logit contributions
pieces-0.019
 impact-0.019
 �-0.019
 hesitation-0.019
 agreement-0.019
Token
Token ID198
Feature activation+4.93e-01

Pos logit contributions
 impact+0.017
 hesitation+0.016
pieces+0.016
 agreement+0.016
 �+0.016
Neg logit contributions
uffy-0.012
ears-0.012
ugs-0.011
Jen-0.011
unny-0.011
TokenThe
Token ID464
Feature activation+1.50e+00

Pos logit contributions
 �+0.011
 impact+0.011
pieces+0.011
 hesitation+0.011
 agreement+0.011
Neg logit contributions
ears-0.008
uffy-0.008
ugs-0.008
unny-0.008
imi-0.008
Token you
Token ID345
Feature activation+1.41e+00

Pos logit contributions
uffy+0.012
ears+0.012
ugs+0.012
unny+0.010
imi+0.010
Neg logit contributions
 �-0.026
pieces-0.026
 impact-0.025
 int-0.025
 hesitation-0.025
Token want
Token ID765
Feature activation+1.42e+00

Pos logit contributions
ement+0.036
 �+0.035
pieces+0.034
 int+0.034
 ''+0.032
Neg logit contributions
izzy-0.021
ubs-0.019
Jen-0.018
ears-0.018
un-0.018
Token to
Token ID284
Feature activation+9.92e-01

Pos logit contributions
 �+0.029
pieces+0.028
 impact+0.027
 int+0.027
 hesitation+0.027
Neg logit contributions
uffy-0.029
ears-0.028
ugs-0.027
unny-0.026
izzy-0.025
Token see
Token ID766
Feature activation+2.46e+00

Pos logit contributions
 �+0.017
pieces+0.017
 impact+0.017
 int+0.016
 agreement+0.016
Neg logit contributions
uffy-0.022
ears-0.022
ugs-0.022
imi-0.020
unny-0.020
Token another
Token ID1194
Feature activation+1.56e+00

Pos logit contributions
 impact+0.057
 �+0.056
 agreement+0.055
 hesitation+0.055
pieces+0.054
Neg logit contributions
ears-0.040
uffy-0.039
imi-0.038
ugs-0.037
 cob-0.036
Token quiz
Token ID38964
Feature activation+5.31e+00

Pos logit contributions
 �+0.039
ieved+0.038
 impact+0.037
 hesitation+0.037
 int+0.037
Neg logit contributions
ears-0.023
 stairs-0.022
uffy-0.022
ugs-0.022
 cob-0.022
Token?"
Token ID1701
Feature activation+1.51e+00

Pos logit contributions
 impact+0.126
 hesitation+0.125
 agreement+0.124
pieces+0.124
 int+0.117
Neg logit contributions
ears-0.082
 stairs-0.081
uffy-0.080
loo-0.078
ugs-0.077
Token
Token ID198
Feature activation+7.12e-01

Pos logit contributions
 impact+0.036
pieces+0.036
ieved+0.035
 agreement+0.034
 hesitation+0.034
Neg logit contributions
ears-0.025
uffy-0.024
ugs-0.023
 stairs-0.023
Jen-0.022
Token
Token ID198
Feature activation+5.00e-01

Pos logit contributions
 �+0.012
pieces+0.011
 int+0.011
 agreement+0.011
 impact+0.011
Neg logit contributions
ears-0.016
uffy-0.016
ugs-0.016
unny-0.015
izzy-0.015
TokenTom
Token ID13787
Feature activation+8.71e-01

Pos logit contributions
 �+0.012
 then+0.011
 agreement+0.011
ieved+0.010
pieces+0.010
Neg logit contributions
uffy-0.009
ugs-0.009
ears-0.009
izzy-0.008
loo-0.008
Token nodded
Token ID14464
Feature activation+2.21e+00

Pos logit contributions
 impact+0.015
pieces+0.014
 hesitation+0.013
 �+0.013
 agreement+0.013
Neg logit contributions
ears-0.021
uffy-0.021
ugs-0.019
imi-0.019
izzy-0.019
Token want
Token ID765
Feature activation+1.42e+00

Pos logit contributions
pieces+0.030
 �+0.029
ement+0.028
ieved+0.027
 int+0.027
Neg logit contributions
izzy-0.020
ears-0.019
uffy-0.019
imi-0.019
un-0.019
Token to
Token ID284
Feature activation+5.43e-01

Pos logit contributions
 �+0.034
pieces+0.033
 impact+0.033
 int+0.032
ieved+0.032
Neg logit contributions
ears-0.022
uffy-0.022
ugs-0.022
imi-0.020
 stairs-0.020
Token try
Token ID1949
Feature activation+2.06e+00

Pos logit contributions
 �+0.010
pieces+0.009
 impact+0.009
ieved+0.009
 int+0.009
Neg logit contributions
uffy-0.011
ears-0.011
ugs-0.011
imi-0.011
unny-0.011
Token doing
Token ID1804
Feature activation+4.04e+00

Pos logit contributions
 narrative+0.054
 impact+0.053
 unison+0.051
pieces+0.051
 agreement+0.051
Neg logit contributions
imi-0.028
ears-0.027
uffy-0.026
 Chick-0.025
 Fl-0.025
Token some
Token ID617
Feature activation+1.49e+00

Pos logit contributions
 narrative+0.099
 impact+0.097
 agreement+0.095
 �+0.094
ieved+0.094
Neg logit contributions
 cob-0.054
 toys-0.054
 powder-0.051
ears-0.051
imi-0.050
Token math
Token ID10688
Feature activation+5.30e+00

Pos logit contributions
 �+0.034
pieces+0.034
ieved+0.034
 Amen+0.033
 agreement+0.032
Neg logit contributions
ears-0.025
 cob-0.024
imi-0.023
 stairs-0.023
ugs-0.022
Token?
Token ID30
Feature activation+1.92e+00

Pos logit contributions
 agreement+0.125
 impact+0.124
 hesitation+0.124
 Amen+0.118
 unison+0.118
Neg logit contributions
 stairs-0.079
ears-0.078
 toys-0.078
uffy-0.078
 powder-0.071
Tokenâ
Token ID22940
Feature activation+2.08e+00

Pos logit contributions
 impact+0.042
 hesitation+0.040
pieces+0.040
ieved+0.039
 agreement+0.039
Neg logit contributions
unny-0.034
uffy-0.033
ugs-0.033
ears-0.033
izzy-0.032
Token
Token ID26391
Feature activation+7.34e-01

Pos logit contributions
pieces+0.037
 hesitation+0.035
 int+0.033
 �+0.033
 agreement+0.032
Neg logit contributions
uffy-0.046
Jen-0.046
izzy-0.045
ugs-0.045
w-0.045
Token The
Token ID383
Feature activation+1.76e+00

Pos logit contributions
 �+0.011
pieces+0.011
 impact+0.011
 agreement+0.010
 int+0.010
Neg logit contributions
uffy-0.018
ears-0.018
ugs-0.018
imi-0.017
unny-0.017
Token bunny
Token ID44915
Feature activation-3.30e-01

Pos logit contributions
 �+0.045
pieces+0.045
ieved+0.044
 impact+0.043
 int+0.043
Neg logit contributions
uffy-0.023
 cob-0.023
imi-0.023
ears-0.022
 stairs-0.022
Token do
Token ID466
Feature activation+1.44e+00

Pos logit contributions
pieces+0.028
 �+0.027
ieved+0.027
 agreement+0.027
 impact+0.026
Neg logit contributions
uffy-0.023
ears-0.023
ugs-0.022
 stairs-0.022
imi-0.022
Token you
Token ID345
Feature activation+1.48e+00

Pos logit contributions
 �+0.041
 impact+0.040
pieces+0.040
 agreement+0.039
 int+0.039
Neg logit contributions
uffy-0.017
ears-0.016
ugs-0.016
imi-0.015
 stairs-0.014
Token think
Token ID892
Feature activation+3.08e+00

Pos logit contributions
 �+0.031
pieces+0.030
 int+0.029
ement+0.029
 impact+0.028
Neg logit contributions
uffy-0.027
ears-0.027
ugs-0.026
izzy-0.026
unny-0.025
Token the
Token ID262
Feature activation+5.91e-01

Pos logit contributions
 �+0.060
pieces+0.059
 impact+0.059
 agreement+0.058
 hesitation+0.057
Neg logit contributions
uffy-0.061
ears-0.061
ugs-0.060
imi-0.057
 stairs-0.056
Token laundry
Token ID25724
Feature activation+1.06e+00

Pos logit contributions
 �+0.014
 impact+0.013
pieces+0.013
 agreement+0.013
 int+0.013
Neg logit contributions
uffy-0.010
ears-0.010
ugs-0.010
imi-0.009
unny-0.009
Token weighs
Token ID24034
Feature activation+5.27e+00

Pos logit contributions
 �+0.025
pieces+0.025
 hesitation+0.025
 impact+0.025
 agreement+0.024
Neg logit contributions
uffy-0.017
ears-0.016
ugs-0.016
 stairs-0.015
loo-0.015
Token?"
Token ID1701
Feature activation+6.00e-01

Pos logit contributions
pieces+0.113
 �+0.111
 impact+0.111
 agreement+0.110
 int+0.108
Neg logit contributions
ears-0.091
ugs-0.091
uffy-0.090
 stairs-0.086
izzy-0.084
Token
Token ID198
Feature activation-2.74e-01

Pos logit contributions
 impact+0.015
ieved+0.015
pieces+0.015
 hesitation+0.014
 agreement+0.014
Neg logit contributions
uffy-0.009
ears-0.009
imi-0.009
ugs-0.009
izzy-0.008
Token
Token ID198
Feature activation-5.43e-01

Pos logit contributions
ears+0.006
uffy+0.006
ugs+0.006
unny+0.006
loo+0.006
Neg logit contributions
 �-0.005
pieces-0.004
 impact-0.004
 agreement-0.004
 int-0.004
TokenL
Token ID43
Feature activation-7.29e-01

Pos logit contributions
ears+0.009
ugs+0.009
uffy+0.009
loo+0.008
imi+0.008
Neg logit contributions
 �-0.013
 agreement-0.012
 then-0.012
pieces-0.012
 int-0.011
Tokenily
Token ID813
Feature activation-7.72e-01

Pos logit contributions
uffy+0.012
ugs+0.012
ears+0.011
unny+0.011
 stairs+0.011
Neg logit contributions
 �-0.017
pieces-0.016
ieved-0.016
 impact-0.016
 agreement-0.016
Token Can
Token ID1680
Feature activation-9.65e-01

Pos logit contributions
uffy+0.008
ears+0.008
ugs+0.008
unny+0.007
izzy+0.007
Neg logit contributions
 impact-0.009
 hesitation-0.009
pieces-0.009
ieved-0.009
 int-0.009
Token I
Token ID314
Feature activation+1.82e+00

Pos logit contributions
uffy+0.009
ears+0.008
imi+0.008
ugs+0.007
 stairs+0.007
Neg logit contributions
 impact-0.029
 �-0.029
pieces-0.029
 hesitation-0.029
 agreement-0.028
Token ask
Token ID1265
Feature activation+2.08e+00

Pos logit contributions
 �+0.036
pieces+0.036
 int+0.035
 hesitation+0.034
 impact+0.034
Neg logit contributions
uffy-0.036
ears-0.035
izzy-0.034
unny-0.034
ugs-0.033
Token you
Token ID345
Feature activation+2.44e+00

Pos logit contributions
 impact+0.052
 �+0.051
pieces+0.051
ieved+0.049
 hesitation+0.049
Neg logit contributions
uffy-0.028
ears-0.027
ugs-0.027
imi-0.027
izzy-0.025
Token some
Token ID617
Feature activation+1.54e+00

Pos logit contributions
 impact+0.053
pieces+0.053
ement+0.052
ieved+0.052
 �+0.050
Neg logit contributions
ears-0.041
uffy-0.041
 stairs-0.039
imi-0.039
unny-0.038
Token questions
Token ID2683
Feature activation+5.23e+00

Pos logit contributions
ieved+0.033
 �+0.033
pieces+0.033
 int+0.031
 impact+0.031
Neg logit contributions
ears-0.027
ugs-0.026
 stairs-0.026
uffy-0.026
 cob-0.025
Token?"
Token ID1701
Feature activation+1.16e+00

Pos logit contributions
pieces+0.120
 impact+0.118
 hesitation+0.114
 agreement+0.113
ieved+0.111
Neg logit contributions
ears-0.087
ugs-0.087
uffy-0.085
 stairs-0.082
loo-0.080
Token
Token ID198
Feature activation+3.25e-01

Pos logit contributions
 impact+0.028
pieces+0.027
ieved+0.027
 hesitation+0.026
 int+0.026
Neg logit contributions
uffy-0.019
ears-0.018
Jen-0.018
ugs-0.017
 stairs-0.017
Token
Token ID198
Feature activation+1.33e-02

Pos logit contributions
 �+0.006
pieces+0.006
 impact+0.006
 hesitation+0.006
 int+0.006
Neg logit contributions
uffy-0.007
ears-0.007
ugs-0.007
imi-0.006
unny-0.006
TokenTom
Token ID13787
Feature activation+7.90e-01

Pos logit contributions
 �+0.000
pieces+0.000
 agreement+0.000
 impact+0.000
ieved+0.000
Neg logit contributions
uffy-0.000
ears-0.000
ugs-0.000
imi-0.000
unny-0.000
Token and
Token ID290
Feature activation+7.16e-01

Pos logit contributions
pieces+0.016
 impact+0.015
 hesitation+0.015
 �+0.015
 int+0.014
Neg logit contributions
ears-0.016
uffy-0.016
imi-0.015
ugs-0.015
izzy-0.014
Token "
Token ID366
Feature activation+6.30e-01

Pos logit contributions
 impact+0.045
pieces+0.043
 narrative+0.041
ieved+0.041
alling+0.039
Neg logit contributions
 Fl-0.022
 Chick-0.022
uffy-0.021
ears-0.021
ugs-0.020
TokenAre
Token ID8491
Feature activation-4.05e-01

Pos logit contributions
 impact+0.015
 hesitation+0.015
pieces+0.014
ement+0.014
 int+0.014
Neg logit contributions
Tim-0.010
Jen-0.010
Fl-0.010
Snow-0.010
Su-0.010
Token you
Token ID345
Feature activation+2.15e+00

Pos logit contributions
uffy+0.004
ears+0.004
imi+0.003
ugs+0.003
izzy+0.003
Neg logit contributions
 impact-0.012
 agreement-0.012
 �-0.012
pieces-0.012
 hesitation-0.012
Token an
Token ID281
Feature activation+5.39e-01

Pos logit contributions
 �+0.050
 int+0.049
ement+0.048
 narrative+0.047
pieces+0.047
Neg logit contributions
 dirty-0.033
 toys-0.032
 babies-0.031
 wet-0.031
ugs-0.030
Token important
Token ID1593
Feature activation-1.34e-01

Pos logit contributions
 �+0.012
pieces+0.012
ieved+0.011
 hesitation+0.011
 unison+0.011
Neg logit contributions
loo-0.009
 cob-0.009
imi-0.008
 ig-0.008
ears-0.008
Token sailor
Token ID43272
Feature activation+5.21e+00

Pos logit contributions
uffy+0.002
loo+0.002
ears+0.002
 stairs+0.002
ugs+0.002
Neg logit contributions
ieved-0.003
pieces-0.003
 �-0.003
 int-0.003
 hesitation-0.003
Token?"
Token ID1701
Feature activation+5.48e-01

Pos logit contributions
ement+0.143
 int+0.142
 unison+0.140
 �+0.139
pieces+0.134
Neg logit contributions
uffy-0.069
o-0.067
 toys-0.060
 dirty-0.059
 babies-0.057
Token
Token ID220
Feature activation-1.41e+00

Pos logit contributions
 impact+0.014
 narrative+0.013
ieve+0.013
 hesitation+0.013
 unison+0.013
Neg logit contributions
izzy-0.008
 Liz-0.008
 Fl-0.008
Tim-0.008
Su-0.008
Token
Token ID198
Feature activation+4.76e-01

Pos logit contributions
uffy+0.027
ears+0.027
ugs+0.026
imi+0.025
izzy+0.024
Neg logit contributions
 impact-0.029
pieces-0.029
 �-0.029
 hesitation-0.028
 agreement-0.028
Token
Token ID198
Feature activation+3.62e-01

Pos logit contributions
 impact+0.010
pieces+0.009
 �+0.009
 hesitation+0.009
 agreement+0.009
Neg logit contributions
uffy-0.009
ears-0.009
ugs-0.009
imi-0.009
izzy-0.008
TokenJack
Token ID14295
Feature activation+1.52e+00

Pos logit contributions
 �+0.008
pieces+0.008
 impact+0.008
 agreement+0.008
 int+0.008
Neg logit contributions
uffy-0.006
ears-0.006
ugs-0.006
imi-0.005
unny-0.005
Token you
Token ID345
Feature activation+1.33e+00

Pos logit contributions
uffy+0.003
ears+0.003
imi+0.003
unny+0.003
 stairs+0.003
Neg logit contributions
pieces-0.009
 impact-0.009
 agreement-0.009
 �-0.009
 int-0.009
Token help
Token ID1037
Feature activation+1.91e-01

Pos logit contributions
 �+0.030
pieces+0.030
ement+0.030
 int+0.029
ieved+0.028
Neg logit contributions
un-0.019
 babies-0.018
unny-0.018
uffy-0.018
imi-0.018
Token me
Token ID502
Feature activation+2.09e+00

Pos logit contributions
 �+0.005
pieces+0.005
 impact+0.005
 int+0.004
 hesitation+0.004
Neg logit contributions
uffy-0.003
ears-0.003
ugs-0.003
imi-0.003
unny-0.003
Token answer
Token ID3280
Feature activation+4.21e+00

Pos logit contributions
 �+0.047
 narrative+0.045
ement+0.044
pieces+0.044
 int+0.043
Neg logit contributions
uffy-0.035
ears-0.034
imi-0.033
ugs-0.032
 cob-0.032
Token this
Token ID428
Feature activation+1.72e+00

Pos logit contributions
 impact+0.099
pieces+0.097
 agreement+0.095
 �+0.094
 narrative+0.094
Neg logit contributions
uffy-0.062
imi-0.061
ears-0.061
ugs-0.060
 cob-0.057
Token question
Token ID1808
Feature activation+5.20e+00

Pos logit contributions
pieces+0.039
ieved+0.039
 �+0.038
 impact+0.037
 agreement+0.036
Neg logit contributions
 stairs-0.028
uffy-0.027
 cob-0.027
ears-0.026
imi-0.026
Token?"
Token ID1701
Feature activation+8.88e-01

Pos logit contributions
 impact+0.107
pieces+0.106
 agreement+0.105
 hesitation+0.104
 int+0.101
Neg logit contributions
uffy-0.098
ears-0.096
ugs-0.095
imi-0.091
loo-0.088
Token
Token ID198
Feature activation+5.41e-01

Pos logit contributions
 impact+0.023
 hesitation+0.023
ieved+0.022
pieces+0.022
 int+0.022
Neg logit contributions
Jen-0.012
 Fl-0.012
uffy-0.012
Tim-0.012
 stairs-0.011
Token
Token ID198
Feature activation+3.36e-01

Pos logit contributions
 impact+0.013
pieces+0.012
 hesitation+0.012
 �+0.012
ieved+0.012
Neg logit contributions
uffy-0.009
Jen-0.009
ears-0.009
ugs-0.009
unny-0.009
TokenThe
Token ID464
Feature activation+7.71e-01

Pos logit contributions
 impact+0.007
pieces+0.007
 �+0.007
 hesitation+0.007
 int+0.007
Neg logit contributions
uffy-0.006
ears-0.006
ugs-0.006
Jen-0.005
unny-0.005
Token large
Token ID1588
Feature activation+3.31e-01

Pos logit contributions
pieces+0.022
 �+0.022
 impact+0.022
ieved+0.021
 hesitation+0.021
Neg logit contributions
 stairs-0.009
uffy-0.008
ears-0.008
ugs-0.008
imi-0.007
Token
Token ID129
Feature activation+1.62e+00

Pos logit contributions
 �+0.010
pieces+0.010
 impact+0.010
ieved+0.009
 int+0.009
Neg logit contributions
ears-0.011
uffy-0.011
ugs-0.011
imi-0.010
izzy-0.010
Token
Token ID241
Feature activation-1.13e+00

Pos logit contributions
 impact+0.027
pieces+0.026
 hesitation+0.026
 �+0.026
 agreement+0.024
Neg logit contributions
uffy-0.037
ugs-0.036
ears-0.036
imi-0.034
Jen-0.034
TokenWant
Token ID19633
Feature activation+6.84e-01

Pos logit contributions
ears+0.019
ugs+0.019
uffy+0.019
Jen+0.018
Su+0.018
Neg logit contributions
 impact-0.025
pieces-0.025
 �-0.024
 hesitation-0.024
ieved-0.024
Token to
Token ID284
Feature activation+4.25e-01

Pos logit contributions
 �+0.016
 impact+0.016
pieces+0.016
 agreement+0.015
 hesitation+0.015
Neg logit contributions
ears-0.011
uffy-0.011
ugs-0.011
imi-0.010
 stairs-0.010
Token try
Token ID1949
Feature activation+1.90e+00

Pos logit contributions
 �+0.009
ieved+0.008
pieces+0.008
 int+0.008
 impact+0.008
Neg logit contributions
ears-0.008
uffy-0.007
ugs-0.007
 stairs-0.007
imi-0.007
Token ringing
Token ID32333
Feature activation+5.19e+00

Pos logit contributions
 impact+0.043
 agreement+0.041
ieved+0.041
pieces+0.040
 narrative+0.040
Neg logit contributions
ears-0.034
uffy-0.032
ugs-0.031
imi-0.031
 stairs-0.030
Token them
Token ID606
Feature activation+1.94e+00

Pos logit contributions
 impact+0.117
 agreement+0.113
 hesitation+0.111
ieved+0.111
pieces+0.110
Neg logit contributions
ears-0.086
uffy-0.084
ugs-0.083
imi-0.081
 stairs-0.081
Token?
Token ID30
Feature activation+1.86e+00

Pos logit contributions
 impact+0.035
pieces+0.035
 �+0.034
 agreement+0.034
 hesitation+0.034
Neg logit contributions
uffy-0.041
ears-0.040
ugs-0.039
imi-0.038
 stairs-0.037
Tokenâ
Token ID22940
Feature activation+1.74e+00

Pos logit contributions
 impact+0.039
 hesitation+0.038
ieved+0.037
 int+0.037
 agreement+0.037
Neg logit contributions
ears-0.034
uffy-0.034
ugs-0.034
unny-0.033
izzy-0.033
Token
Token ID26391
Feature activation+8.21e-01

Pos logit contributions
pieces+0.030
 hesitation+0.030
 int+0.027
 impact+0.027
 �+0.027
Neg logit contributions
uffy-0.039
ugs-0.039
Jen-0.039
ears-0.038
izzy-0.038
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
 �+0.015
pieces+0.014
 impact+0.014
 int+0.014
 agreement+0.014
Neg logit contributions
uffy-0.018
ears-0.018
ugs-0.017
imi-0.016
unny-0.016
Token "
Token ID366
Feature activation+5.30e-01

Pos logit contributions
 impact+0.026
pieces+0.025
 narrative+0.025
ieved+0.024
ement+0.022
Neg logit contributions
 Fl-0.016
unny-0.015
 Chick-0.015
uffy-0.014
 stairs-0.014
TokenWhat
Token ID2061
Feature activation+1.83e+00

Pos logit contributions
 impact+0.012
 int+0.011
pieces+0.011
ement+0.010
 hesitation+0.010
Neg logit contributions
Jen-0.010
Su-0.010
Fl-0.010
Tim-0.009
ears-0.009
Token are
Token ID389
Feature activation-2.12e+00

Pos logit contributions
 �+0.035
pieces+0.033
 impact+0.033
ieved+0.031
 hesitation+0.031
Neg logit contributions
ugs-0.039
uffy-0.038
ears-0.038
unny-0.036
imi-0.035
Token you
Token ID345
Feature activation+1.67e+00

Pos logit contributions
ugs+0.037
uffy+0.037
ears+0.036
unny+0.036
imi+0.033
Neg logit contributions
 �-0.048
pieces-0.046
 then-0.045
 hesitation-0.044
 int-0.043
Token doing
Token ID1804
Feature activation+2.03e+00

Pos logit contributions
 �+0.034
pieces+0.034
 impact+0.033
 agreement+0.032
 int+0.032
Neg logit contributions
uffy-0.032
ears-0.031
ugs-0.031
imi-0.029
izzy-0.028
Token sir
Token ID15967
Feature activation+5.16e+00

Pos logit contributions
pieces+0.047
 agreement+0.045
 �+0.045
ieved+0.045
 hesitation+0.045
Neg logit contributions
uffy-0.034
ears-0.032
ugs-0.032
imi-0.031
 stairs-0.030
Token?"
Token ID1701
Feature activation+1.05e+00

Pos logit contributions
 �+0.114
 unison+0.113
 agreement+0.112
ement+0.110
 int+0.107
Neg logit contributions
ears-0.089
uffy-0.089
ugs-0.085
o-0.081
imi-0.077
Token
Token ID198
Feature activation+7.38e-01

Pos logit contributions
 impact+0.023
pieces+0.023
ieved+0.022
 hesitation+0.022
 agreement+0.022
Neg logit contributions
uffy-0.019
ears-0.018
ugs-0.018
Su-0.017
Jen-0.017
Token
Token ID198
Feature activation+2.73e-01

Pos logit contributions
 �+0.015
 impact+0.015
pieces+0.015
 hesitation+0.014
 agreement+0.014
Neg logit contributions
uffy-0.014
ears-0.014
ugs-0.014
imi-0.013
unny-0.013
TokenThe
Token ID464
Feature activation+1.54e+00

Pos logit contributions
 �+0.006
pieces+0.006
 agreement+0.005
ieved+0.005
 int+0.005
Neg logit contributions
uffy-0.005
ears-0.005
ugs-0.005
imi-0.005
loo-0.005
Token man
Token ID582
Feature activation+2.00e+00

Pos logit contributions
pieces+0.037
ieved+0.037
 �+0.036
 impact+0.036
 int+0.035
Neg logit contributions
uffy-0.024
ears-0.024
 stairs-0.023
ugs-0.022
loo-0.021
Token "
Token ID366
Feature activation+5.25e-01

Pos logit contributions
 impact+0.027
pieces+0.027
ieved+0.026
 narrative+0.024
alling+0.024
Neg logit contributions
uffy-0.018
 Fl-0.017
 Chick-0.017
Su-0.017
izzy-0.017
TokenCan
Token ID6090
Feature activation-1.87e-01

Pos logit contributions
 impact+0.009
 �+0.009
pieces+0.009
 hesitation+0.009
 int+0.009
Neg logit contributions
ears-0.012
uffy-0.011
ugs-0.011
Jen-0.011
Su-0.011
Token I
Token ID314
Feature activation+1.82e+00

Pos logit contributions
uffy+0.003
ears+0.002
ugs+0.002
imi+0.002
unny+0.002
Neg logit contributions
 �-0.005
pieces-0.005
 impact-0.005
 agreement-0.005
 hesitation-0.005
Token take
Token ID1011
Feature activation+2.42e+00

Pos logit contributions
 �+0.036
pieces+0.035
 impact+0.035
 int+0.034
 hesitation+0.034
Neg logit contributions
uffy-0.036
ears-0.036
ugs-0.035
izzy-0.034
imi-0.033
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
 �+0.054
pieces+0.053
 impact+0.052
 int+0.051
 agreement+0.051
Neg logit contributions
uffy-0.043
ears-0.042
ugs-0.041
unny-0.038
imi-0.038
Token quiz
Token ID38964
Feature activation+5.16e+00

Pos logit contributions
 �+0.019
 impact+0.019
ieved+0.018
pieces+0.018
 agreement+0.018
Neg logit contributions
ears-0.022
uffy-0.022
ugs-0.022
 stairs-0.021
imi-0.020
Token too
Token ID1165
Feature activation+1.24e+00

Pos logit contributions
 hesitation+0.122
 agreement+0.120
 impact+0.120
pieces+0.118
 unison+0.116
Neg logit contributions
ears-0.082
uffy-0.082
ugs-0.080
 stairs-0.078
loo-0.074
Token?"
Token ID1701
Feature activation+8.42e-01

Pos logit contributions
 �+0.021
pieces+0.020
 impact+0.020
 agreement+0.019
 int+0.019
Neg logit contributions
uffy-0.029
ears-0.028
ugs-0.028
imi-0.027
unny-0.026
Token Sarah
Token ID10490
Feature activation+4.90e-01

Pos logit contributions
 �+0.019
 impact+0.018
pieces+0.018
 hesitation+0.018
 agreement+0.018
Neg logit contributions
uffy-0.015
ears-0.015
ugs-0.014
imi-0.013
izzy-0.013
Token smiled
Token ID13541
Feature activation+2.47e+00

Pos logit contributions
 impact+0.011
pieces+0.011
 hesitation+0.010
 forward+0.010
 agreement+0.010
Neg logit contributions
uffy-0.009
ears-0.008
imi-0.008
izzy-0.008
 cob-0.008
Token and
Token ID290
Feature activation+1.27e+00

Pos logit contributions
pieces+0.078
 impact+0.069
ieved+0.068
alling+0.067
icult+0.066
Neg logit contributions
eddy-0.027
uffy-0.026
 dirty-0.025
ears-0.025
ies-0.024
Token you
Token ID345
Feature activation+7.14e-01

Pos logit contributions
 �+0.033
pieces+0.033
 impact+0.032
 hesitation+0.032
 agreement+0.032
Neg logit contributions
uffy-0.015
ears-0.015
ugs-0.015
imi-0.013
unny-0.013
Token need
Token ID761
Feature activation+1.39e+00

Pos logit contributions
 �+0.014
pieces+0.013
 int+0.013
 hesitation+0.013
 impact+0.013
Neg logit contributions
uffy-0.015
ears-0.014
ugs-0.014
unny-0.013
imi-0.013
Token to
Token ID284
Feature activation+1.21e+00

Pos logit contributions
 �+0.027
 hesitation+0.025
 impact+0.025
pieces+0.025
 agreement+0.024
Neg logit contributions
uffy-0.030
ears-0.029
ugs-0.029
izzy-0.027
unny-0.027
Token find
Token ID1064
Feature activation+2.08e+00

Pos logit contributions
 �+0.024
pieces+0.024
 impact+0.023
 int+0.023
ieved+0.023
Neg logit contributions
uffy-0.024
ears-0.023
ugs-0.023
imi-0.022
unny-0.022
Token a
Token ID257
Feature activation+8.72e-01

Pos logit contributions
 �+0.043
pieces+0.041
 impact+0.041
 int+0.040
ieved+0.040
Neg logit contributions
uffy-0.040
ears-0.039
ugs-0.038
imi-0.036
 stairs-0.036
Token lawyer
Token ID6853
Feature activation+5.16e+00

Pos logit contributions
 �+0.020
 impact+0.020
pieces+0.019
 hesitation+0.019
 agreement+0.019
Neg logit contributions
ears-0.015
ugs-0.015
uffy-0.014
imi-0.013
izzy-0.013
Token?"
Token ID1701
Feature activation+8.89e-01

Pos logit contributions
 agreement+0.125
 int+0.124
 �+0.124
ement+0.122
 impact+0.121
Neg logit contributions
uffy-0.082
ears-0.075
 toys-0.074
ugs-0.071
o-0.068
Token
Token ID220
Feature activation-1.14e+00

Pos logit contributions
 impact+0.017
 hesitation+0.017
ieved+0.016
pieces+0.016
 narrative+0.016
Neg logit contributions
uffy-0.018
ears-0.017
 Fl-0.017
 Chick-0.017
ugs-0.016
Token
Token ID198
Feature activation+4.15e-01

Pos logit contributions
uffy+0.023
ears+0.022
ugs+0.022
imi+0.021
unny+0.020
Neg logit contributions
 �-0.023
pieces-0.022
 impact-0.022
 hesitation-0.022
 agreement-0.022
Token
Token ID198
Feature activation+7.99e-02

Pos logit contributions
 �+0.007
pieces+0.007
 impact+0.007
 agreement+0.007
 int+0.007
Neg logit contributions
ears-0.009
uffy-0.009
ugs-0.009
loo-0.009
imi-0.009
TokenThe
Token ID464
Feature activation+1.63e+00

Pos logit contributions
 �+0.002
pieces+0.002
 agreement+0.002
ieved+0.002
 then+0.002
Neg logit contributions
ugs-0.001
ears-0.001
uffy-0.001
loo-0.001
izzy-0.001
Token"
Token ID1
Feature activation-7.16e-01

Pos logit contributions
uffy+0.006
ears+0.006
ugs+0.006
imi+0.005
unny+0.005
Neg logit contributions
 �-0.008
pieces-0.008
 impact-0.008
 agreement-0.008
 int-0.008
TokenDo
Token ID5211
Feature activation-1.34e+00

Pos logit contributions
ears+0.013
uffy+0.013
ugs+0.013
imi+0.012
izzy+0.012
Neg logit contributions
 �-0.015
 impact-0.015
pieces-0.015
 hesitation-0.015
 int-0.015
Token you
Token ID345
Feature activation+6.86e-01

Pos logit contributions
uffy+0.017
ears+0.017
ugs+0.016
unny+0.014
imi+0.014
Neg logit contributions
 �-0.037
pieces-0.036
 impact-0.036
 int-0.035
 agreement-0.035
Token really
Token ID1107
Feature activation+1.41e-01

Pos logit contributions
 �+0.014
ement+0.014
 int+0.013
pieces+0.013
ieved+0.013
Neg logit contributions
ears-0.013
izzy-0.012
ubs-0.012
ugs-0.011
 cob-0.011
Token do
Token ID466
Feature activation+2.82e+00

Pos logit contributions
 �+0.003
pieces+0.003
 impact+0.002
 int+0.002
 agreement+0.002
Neg logit contributions
ears-0.003
uffy-0.003
ugs-0.003
izzy-0.003
 stairs-0.003
Token maths
Token ID47761
Feature activation+5.15e+00

Pos logit contributions
 impact+0.063
 agreement+0.062
ieved+0.062
 narrative+0.060
 unison+0.059
Neg logit contributions
ears-0.044
 stairs-0.043
uffy-0.043
imi-0.042
 cob-0.041
Token in
Token ID287
Feature activation+3.03e-01

Pos logit contributions
 agreement+0.122
 unison+0.121
 impact+0.119
 hesitation+0.114
 �+0.113
Neg logit contributions
ears-0.080
 stairs-0.076
 toys-0.074
ugs-0.072
uffy-0.071
Token here
Token ID994
Feature activation+1.94e+00

Pos logit contributions
 �+0.007
pieces+0.006
 impact+0.006
ieved+0.006
 int+0.006
Neg logit contributions
uffy-0.005
ears-0.005
ugs-0.005
imi-0.005
 stairs-0.005
Token?"
Token ID1701
Feature activation+1.21e+00

Pos logit contributions
pieces+0.043
 �+0.041
 impact+0.041
 hesitation+0.040
 int+0.040
Neg logit contributions
uffy-0.032
ears-0.032
imi-0.030
ugs-0.030
 stairs-0.030
Token asked
Token ID1965
Feature activation-8.46e-01

Pos logit contributions
 impact+0.023
pieces+0.022
ieved+0.021
 �+0.021
 agreement+0.021
Neg logit contributions
uffy-0.025
ears-0.024
ugs-0.024
 stairs-0.024
imi-0.023
Token Amanda
Token ID23040
Feature activation-2.97e-01

Pos logit contributions
uffy+0.014
ears+0.014
ugs+0.014
imi+0.013
unny+0.013
Neg logit contributions
 �-0.019
 impact-0.019
pieces-0.019
ieved-0.018
 agreement-0.018
Token "
Token ID366
Feature activation+3.82e-01

Pos logit contributions
pieces+0.019
 impact+0.019
 �+0.018
ieved+0.018
 agreement+0.018
Neg logit contributions
ears-0.022
uffy-0.022
ugs-0.021
 stairs-0.021
imi-0.020
TokenWho
Token ID8241
Feature activation+1.83e+00

Pos logit contributions
 �+0.006
pieces+0.006
ieved+0.006
 agreement+0.006
 int+0.006
Neg logit contributions
uffy-0.009
ugs-0.009
ears-0.009
imi-0.008
unny-0.008
Token can
Token ID460
Feature activation-9.99e-01

Pos logit contributions
 �+0.033
pieces+0.033
 impact+0.033
 then+0.032
ieved+0.032
Neg logit contributions
ugs-0.038
ears-0.037
uffy-0.037
Su-0.036
imi-0.035
Token answer
Token ID3280
Feature activation+3.79e+00

Pos logit contributions
ears+0.023
uffy+0.023
ugs+0.023
imi+0.020
unny+0.020
Neg logit contributions
 �-0.018
 impact-0.017
 then-0.017
pieces-0.017
 hesitation-0.016
Token my
Token ID616
Feature activation+9.19e-01

Pos logit contributions
pieces+0.080
 agreement+0.079
 impact+0.078
 �+0.078
 hesitation+0.077
Neg logit contributions
uffy-0.063
ears-0.063
ugs-0.061
imi-0.060
loo-0.060
Token question
Token ID1808
Feature activation+5.11e+00

Pos logit contributions
 �+0.020
 impact+0.019
pieces+0.019
 hesitation+0.018
 agreement+0.018
Neg logit contributions
uffy-0.017
ears-0.017
ugs-0.017
imi-0.016
izzy-0.016
Token?".
Token ID43634
Feature activation+7.33e-01

Pos logit contributions
pieces+0.105
 impact+0.104
 agreement+0.104
 hesitation+0.103
 �+0.102
Neg logit contributions
uffy-0.095
ears-0.094
ugs-0.092
loo-0.088
unny-0.085
Token Nobody
Token ID15658
Feature activation+5.51e-01

Pos logit contributions
 hesitation+0.016
pieces+0.016
ieved+0.016
 impact+0.016
 int+0.016
Neg logit contributions
 stairs-0.012
Jen-0.012
ears-0.012
uffy-0.012
ugs-0.012
Token answered
Token ID9373
Feature activation+3.75e-01

Pos logit contributions
pieces+0.015
 impact+0.014
 unison+0.014
 agreement+0.014
 int+0.013
Neg logit contributions
 cob-0.007
izzy-0.006
uffy-0.006
loo-0.006
Jen-0.006
Token.
Token ID13
Feature activation+1.99e-01

Pos logit contributions
 impact+0.008
pieces+0.008
ieved+0.007
 agreement+0.007
 �+0.007
Neg logit contributions
uffy-0.007
ears-0.007
ugs-0.007
imi-0.007
 cob-0.006
Token He
Token ID679
Feature activation+8.24e-01

Pos logit contributions
pieces+0.004
 impact+0.004
 hesitation+0.004
ieved+0.004
 agreement+0.004
Neg logit contributions
uffy-0.004
ears-0.004
ugs-0.004
 stairs-0.004
unny-0.004
TokenCan
Token ID6090
Feature activation-4.08e-01

Pos logit contributions
uffy+0.012
ears+0.012
ugs+0.012
unny+0.011
imi+0.011
Neg logit contributions
 �-0.012
 impact-0.012
pieces-0.012
 hesitation-0.012
ieved-0.012
Token I
Token ID314
Feature activation+1.65e+00

Pos logit contributions
uffy+0.005
ears+0.005
ugs+0.005
imi+0.005
unny+0.005
Neg logit contributions
 �-0.011
 impact-0.011
pieces-0.011
 agreement-0.011
 hesitation-0.011
Token ask
Token ID1265
Feature activation+1.74e+00

Pos logit contributions
 �+0.034
pieces+0.033
 int+0.033
 impact+0.033
 hesitation+0.032
Neg logit contributions
uffy-0.032
ears-0.031
ugs-0.030
unny-0.029
izzy-0.029
Token you
Token ID345
Feature activation+2.38e+00

Pos logit contributions
 impact+0.039
pieces+0.038
 �+0.038
 int+0.037
ieved+0.037
Neg logit contributions
uffy-0.029
ugs-0.028
ears-0.028
imi-0.027
unny-0.027
Token a
Token ID257
Feature activation+1.02e+00

Pos logit contributions
 impact+0.050
pieces+0.049
ement+0.048
ieved+0.047
 �+0.046
Neg logit contributions
uffy-0.044
ears-0.043
unny-0.042
imi-0.041
ugs-0.041
Token question
Token ID1808
Feature activation+5.11e+00

Pos logit contributions
pieces+0.024
ieved+0.024
 �+0.024
 int+0.024
 impact+0.024
Neg logit contributions
uffy-0.016
ears-0.015
ugs-0.015
 stairs-0.014
unny-0.014
Token?"
Token ID1701
Feature activation+6.07e-01

Pos logit contributions
 impact+0.115
pieces+0.110
 hesitation+0.108
 agreement+0.107
 int+0.106
Neg logit contributions
uffy-0.089
ugs-0.089
ears-0.089
imi-0.084
loo-0.083
Token
Token ID198
Feature activation-3.54e-01

Pos logit contributions
ieved+0.018
 impact+0.018
pieces+0.018
 agreement+0.017
alling+0.017
Neg logit contributions
Jen-0.008
 Jen-0.007
uffy-0.007
 Tit-0.007
 Fl-0.007
Token
Token ID198
Feature activation-4.13e-01

Pos logit contributions
uffy+0.006
Jen+0.006
ugs+0.006
unny+0.006
ears+0.006
Neg logit contributions
 impact-0.008
pieces-0.008
 hesitation-0.008
ieved-0.008
 �-0.008
TokenThe
Token ID464
Feature activation+1.14e+00

Pos logit contributions
uffy+0.006
Jen+0.006
ears+0.006
ugs+0.005
unny+0.005
Neg logit contributions
 impact-0.011
pieces-0.011
 hesitation-0.010
 int-0.010
 �-0.010
Token fire
Token ID2046
Feature activation-1.22e+00

Pos logit contributions
pieces+0.032
ieved+0.032
 �+0.032
 impact+0.031
 int+0.030
Neg logit contributions
uffy-0.013
ears-0.013
 stairs-0.013
ugs-0.012
loo-0.012
TokenCan
Token ID6090
Feature activation-1.75e-01

Pos logit contributions
Jen+0.012
ears+0.012
Su+0.011
uffy+0.011
ugs+0.011
Neg logit contributions
 impact-0.019
pieces-0.018
 �-0.018
 hesitation-0.018
 int-0.018
Token I
Token ID314
Feature activation+1.47e+00

Pos logit contributions
uffy+0.002
ears+0.002
ugs+0.002
imi+0.002
unny+0.002
Neg logit contributions
 impact-0.005
 �-0.005
pieces-0.005
 agreement-0.005
 hesitation-0.005
Token try
Token ID1949
Feature activation+1.63e+00

Pos logit contributions
 �+0.032
 int+0.032
pieces+0.031
 hesitation+0.031
 impact+0.031
Neg logit contributions
uffy-0.026
ears-0.026
ugs-0.025
izzy-0.024
unny-0.024
Token a
Token ID257
Feature activation+2.32e-01

Pos logit contributions
 agreement+0.037
 impact+0.037
 unison+0.036
 narrative+0.036
pieces+0.036
Neg logit contributions
ears-0.028
uffy-0.027
imi-0.026
ugs-0.025
 stairs-0.025
Token new
Token ID649
Feature activation-4.38e-02

Pos logit contributions
 �+0.005
pieces+0.005
ieved+0.005
 impact+0.005
 hesitation+0.005
Neg logit contributions
uffy-0.004
ears-0.004
ugs-0.003
 stairs-0.003
imi-0.003
Token spell
Token ID4822
Feature activation+5.10e+00

Pos logit contributions
 cob+0.001
 stairs+0.001
ears+0.001
 powder+0.001
uffy+0.001
Neg logit contributions
ieved-0.001
pieces-0.001
 �-0.001
 int-0.001
 forward-0.001
Token?"
Token ID1701
Feature activation+8.21e-01

Pos logit contributions
 agreement+0.121
 impact+0.118
 hesitation+0.118
 unison+0.117
pieces+0.115
Neg logit contributions
uffy-0.079
ears-0.077
ugs-0.073
loo-0.070
imi-0.070
Token Tom
Token ID4186
Feature activation+1.17e+00

Pos logit contributions
 impact+0.022
pieces+0.021
ieved+0.021
 �+0.021
 agreement+0.021
Neg logit contributions
uffy-0.011
ears-0.010
ugs-0.010
imi-0.010
izzy-0.010
Token asked
Token ID1965
Feature activation-6.44e-01

Pos logit contributions
 impact+0.025
pieces+0.024
 �+0.022
 narrative+0.022
 agreement+0.021
Neg logit contributions
ears-0.022
uffy-0.022
ugs-0.021
izzy-0.021
imi-0.021
Token Lily
Token ID20037
Feature activation-7.52e-02

Pos logit contributions
uffy+0.010
ugs+0.010
izzy+0.009
ears+0.009
 Fl+0.009
Neg logit contributions
 impact-0.016
pieces-0.015
ieved-0.015
 narrative-0.014
alling-0.014
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
ears+0.001
o+0.001
uffy+0.001
ugs+0.001
izzy+0.001
Neg logit contributions
 narrative-0.002
 impact-0.002
 unison-0.002
pieces-0.002
ieved-0.002
Token "
Token ID366
Feature activation+1.15e+00

Pos logit contributions
pieces+0.053
 impact+0.052
ieved+0.050
 narrative+0.050
 unison+0.049
Neg logit contributions
 Fl-0.026
 Chick-0.026
uffy-0.026
unny-0.025
ears-0.025
TokenDo
Token ID5211
Feature activation-8.89e-01

Pos logit contributions
 impact+0.021
pieces+0.020
 �+0.020
 int+0.020
 hesitation+0.020
Neg logit contributions
ears-0.025
uffy-0.024
Jen-0.023
Su-0.023
 stairs-0.023
Token you
Token ID345
Feature activation+1.04e+00

Pos logit contributions
ugs+0.013
ears+0.012
uffy+0.012
loo+0.011
unny+0.011
Neg logit contributions
 �-0.023
pieces-0.022
 impact-0.022
ieved-0.022
 hesitation-0.021
Token need
Token ID761
Feature activation+1.06e+00

Pos logit contributions
pieces+0.024
 �+0.024
ement+0.024
 int+0.023
 ''+0.022
Neg logit contributions
ears-0.016
izzy-0.016
 dirty-0.015
unny-0.015
uffy-0.015
Token a
Token ID257
Feature activation+1.89e-01

Pos logit contributions
 �+0.018
 impact+0.017
pieces+0.017
 hesitation+0.016
 agreement+0.016
Neg logit contributions
uffy-0.025
ears-0.025
ugs-0.024
imi-0.023
unny-0.023
Token tutor
Token ID48680
Feature activation+5.09e+00

Pos logit contributions
 �+0.004
pieces+0.004
 impact+0.004
 agreement+0.004
 int+0.004
Neg logit contributions
uffy-0.003
ears-0.003
ugs-0.003
imi-0.003
izzy-0.003
Token for
Token ID329
Feature activation+7.70e-01

Pos logit contributions
ement+0.152
 int+0.152
 agreement+0.146
 �+0.145
 narrative+0.143
Neg logit contributions
uffy-0.053
ears-0.053
 toys-0.047
ugs-0.045
o-0.045
Token your
Token ID534
Feature activation+1.87e-01

Pos logit contributions
 �+0.016
pieces+0.015
 impact+0.015
 hesitation+0.015
 agreement+0.015
Neg logit contributions
uffy-0.015
ears-0.015
ugs-0.014
unny-0.014
izzy-0.013
Token school
Token ID1524
Feature activation+3.35e+00

Pos logit contributions
 �+0.004
pieces+0.004
ieved+0.004
 int+0.004
 impact+0.004
Neg logit contributions
uffy-0.003
ears-0.003
ugs-0.003
 stairs-0.003
imi-0.003
Token work
Token ID670
Feature activation+3.11e+00

Pos logit contributions
 int+0.079
 hesitation+0.078
 �+0.077
pieces+0.077
 agreement+0.076
Neg logit contributions
ears-0.056
uffy-0.055
ugs-0.053
 stairs-0.049
 toys-0.049
Token?"
Token ID1701
Feature activation+1.15e+00

Pos logit contributions
pieces+0.070
 �+0.067
 agreement+0.066
 hesitation+0.065
 unison+0.065
Neg logit contributions
uffy-0.057
ears-0.057
ugs-0.056
 stairs-0.052
imi-0.051
Token please
Token ID3387
Feature activation+1.16e+00

Pos logit contributions
pieces+0.049
ement+0.047
 �+0.045
 int+0.044
ieved+0.044
Neg logit contributions
uffy-0.034
imi-0.033
un-0.032
izzy-0.030
 dirty-0.030
Token find
Token ID1064
Feature activation+2.56e+00

Pos logit contributions
pieces+0.024
 �+0.024
 int+0.023
 hesitation+0.023
ieved+0.023
Neg logit contributions
uffy-0.022
ears-0.021
imi-0.020
ugs-0.020
izzy-0.019
Token me
Token ID502
Feature activation+3.83e+00

Pos logit contributions
 �+0.060
 impact+0.059
pieces+0.057
 agreement+0.057
 int+0.057
Neg logit contributions
uffy-0.040
ears-0.040
ugs-0.039
imi-0.038
izzy-0.036
Token a
Token ID257
Feature activation+9.82e-01

Pos logit contributions
 �+0.084
 impact+0.081
pieces+0.081
 int+0.080
 agreement+0.080
Neg logit contributions
uffy-0.070
ears-0.068
ugs-0.064
imi-0.061
 toys-0.061
Token new
Token ID649
Feature activation+7.31e-01

Pos logit contributions
pieces+0.023
 �+0.023
ieved+0.023
 int+0.023
 impact+0.022
Neg logit contributions
uffy-0.016
ears-0.015
ugs-0.014
 stairs-0.014
 cob-0.014
Token tutor
Token ID48680
Feature activation+5.07e+00

Pos logit contributions
ieved+0.020
pieces+0.020
 �+0.019
alling+0.019
 int+0.019
Neg logit contributions
 stairs-0.010
uffy-0.009
ears-0.009
 cob-0.009
 toys-0.009
Token?"
Token ID1701
Feature activation+1.44e+00

Pos logit contributions
 int+0.132
 �+0.126
ement+0.125
 hesitation+0.125
 agreement+0.125
Neg logit contributions
uffy-0.068
ears-0.065
 toys-0.063
o-0.060
loo-0.060
Token
Token ID198
Feature activation+1.94e-01

Pos logit contributions
pieces+0.036
 impact+0.036
ieved+0.035
alling+0.033
 hesitation+0.033
Neg logit contributions
uffy-0.022
Jen-0.021
imi-0.020
 stairs-0.020
Su-0.020
Token
Token ID198
Feature activation+1.86e-02

Pos logit contributions
 �+0.004
pieces+0.004
 impact+0.004
 hesitation+0.004
 int+0.004
Neg logit contributions
uffy-0.004
ears-0.004
ugs-0.004
imi-0.004
unny-0.004
TokenMom
Token ID29252
Feature activation-6.40e-01

Pos logit contributions
 �+0.000
pieces+0.000
 agreement+0.000
 impact+0.000
 int+0.000
Neg logit contributions
ears-0.000
uffy-0.000
ugs-0.000
imi-0.000
unny-0.000
Token and
Token ID290
Feature activation+1.20e+00

Pos logit contributions
ears+0.013
uffy+0.012
imi+0.012
ugs+0.012
 stairs+0.011
Neg logit contributions
pieces-0.014
 impact-0.013
 hesitation-0.013
ieved-0.012
 �-0.012
Token "
Token ID366
Feature activation+5.72e-01

Pos logit contributions
 �+0.004
pieces+0.004
 impact+0.004
 hesitation+0.004
 agreement+0.004
Neg logit contributions
uffy-0.006
ears-0.006
ugs-0.005
imi-0.005
unny-0.005
TokenCan
Token ID6090
Feature activation-1.35e-02

Pos logit contributions
 �+0.012
pieces+0.012
 impact+0.012
 hesitation+0.012
 int+0.012
Neg logit contributions
ears-0.010
uffy-0.010
ugs-0.010
unny-0.010
Jen-0.010
Token I
Token ID314
Feature activation+1.40e+00

Pos logit contributions
uffy+0.000
ears+0.000
unny+0.000
ugs+0.000
imi+0.000
Neg logit contributions
 �-0.000
pieces-0.000
 impact-0.000
 agreement-0.000
 int-0.000
Token borrow
Token ID8804
Feature activation+1.82e+00

Pos logit contributions
 �+0.025
pieces+0.024
 impact+0.024
 int+0.024
 hesitation+0.023
Neg logit contributions
uffy-0.031
ears-0.030
ugs-0.029
unny-0.028
imi-0.028
Token your
Token ID534
Feature activation+3.75e-02

Pos logit contributions
 �+0.047
 impact+0.046
pieces+0.046
ieved+0.046
 hesitation+0.045
Neg logit contributions
ears-0.024
uffy-0.024
ugs-0.022
imi-0.022
 cob-0.021
Token axe
Token ID23272
Feature activation+5.06e+00

Pos logit contributions
 �+0.001
pieces+0.001
ieved+0.001
 impact+0.001
 agreement+0.001
Neg logit contributions
uffy-0.001
ears-0.001
ugs-0.001
unny-0.001
imi-0.001
Token?"
Token ID1701
Feature activation+1.14e+00

Pos logit contributions
pieces+0.115
 unison+0.114
 agreement+0.112
 �+0.111
 hesitation+0.111
Neg logit contributions
uffy-0.091
ears-0.090
ugs-0.081
loo-0.079
unny-0.077
Token the
Token ID262
Feature activation+2.92e-01

Pos logit contributions
 impact+0.024
pieces+0.024
ieved+0.023
 agreement+0.023
 �+0.023
Neg logit contributions
uffy-0.020
ears-0.020
 stairs-0.020
 Chick-0.020
Jen-0.019
Token bear
Token ID6842
Feature activation+8.20e-01

Pos logit contributions
 �+0.007
pieces+0.007
 impact+0.007
 int+0.007
ieved+0.007
Neg logit contributions
uffy-0.004
ears-0.004
ugs-0.004
imi-0.004
unny-0.004
Token asked
Token ID1965
Feature activation-9.08e-01

Pos logit contributions
 impact+0.018
pieces+0.018
 �+0.017
 agreement+0.017
 hesitation+0.017
Neg logit contributions
uffy-0.014
ears-0.014
ugs-0.013
unny-0.013
izzy-0.013
Token.
Token ID13
Feature activation-3.73e-02

Pos logit contributions
uffy+0.015
 Fl+0.014
ears+0.014
ugs+0.014
 stairs+0.014
Neg logit contributions
pieces-0.021
 impact-0.021
ieved-0.020
 int-0.019
 unison-0.019
Token I
Token ID314
Feature activation+1.81e+00

Pos logit contributions
 �+0.056
pieces+0.054
 impact+0.054
 agreement+0.053
 int+0.053
Neg logit contributions
uffy-0.026
ears-0.026
ugs-0.025
imi-0.023
unny-0.022
Token asked
Token ID1965
Feature activation+1.17e+00

Pos logit contributions
pieces+0.040
 �+0.039
 int+0.038
ieved+0.038
 impact+0.037
Neg logit contributions
uffy-0.033
ears-0.031
unny-0.030
ugs-0.030
izzy-0.030
Token you
Token ID345
Feature activation+2.29e+00

Pos logit contributions
 �+0.027
pieces+0.027
 impact+0.027
 agreement+0.026
ieved+0.026
Neg logit contributions
uffy-0.019
ears-0.019
ugs-0.018
imi-0.017
unny-0.017
Token a
Token ID257
Feature activation+9.41e-01

Pos logit contributions
pieces+0.055
ement+0.052
 impact+0.051
ieved+0.051
 agreement+0.051
Neg logit contributions
uffy-0.036
ears-0.035
unny-0.033
imi-0.033
 stairs-0.033
Token few
Token ID1178
Feature activation+1.85e+00

Pos logit contributions
 �+0.021
pieces+0.021
ieved+0.020
 impact+0.020
 int+0.020
Neg logit contributions
uffy-0.016
ears-0.016
ugs-0.015
unny-0.015
imi-0.015
Token questions
Token ID2683
Feature activation+5.05e+00

Pos logit contributions
 �+0.039
ieved+0.039
pieces+0.038
 impact+0.037
 int+0.036
Neg logit contributions
ears-0.033
ugs-0.033
 cob-0.032
 stairs-0.032
imi-0.031
Token?"
Token ID1701
Feature activation+1.87e+00

Pos logit contributions
pieces+0.115
 impact+0.111
 hesitation+0.108
 agreement+0.107
ieved+0.106
Neg logit contributions
ugs-0.085
ears-0.085
uffy-0.084
loo-0.078
imi-0.078
Token
Token ID198
Feature activation+1.37e+00

Pos logit contributions
pieces+0.048
 impact+0.047
 forward+0.046
ieved+0.046
 agreement+0.045
Neg logit contributions
Jen-0.026
Su-0.026
ears-0.025
 Tit-0.025
uffy-0.024
Token
Token ID198
Feature activation+8.30e-01

Pos logit contributions
pieces+0.029
 impact+0.028
 �+0.028
 hesitation+0.027
 agreement+0.027
Neg logit contributions
uffy-0.026
ears-0.025
ugs-0.025
Jen-0.024
imi-0.024
TokenJoe
Token ID19585
Feature activation+5.32e-01

Pos logit contributions
 �+0.021
pieces+0.020
 impact+0.020
 int+0.019
 agreement+0.019
Neg logit contributions
uffy-0.013
ears-0.013
ugs-0.013
imi-0.011
izzy-0.011
Token shook
Token ID14682
Feature activation+3.22e+00

Pos logit contributions
pieces+0.011
 impact+0.011
 �+0.011
 hesitation+0.010
 agreement+0.010
Neg logit contributions
uffy-0.010
ears-0.010
ugs-0.010
unny-0.009
imi-0.009
Token They
Token ID1119
Feature activation-8.28e-01

Pos logit contributions
 impact+0.010
pieces+0.010
 hesitation+0.010
 �+0.009
 agreement+0.009
Neg logit contributions
ears-0.007
uffy-0.007
ugs-0.006
 stairs-0.006
imi-0.006
Token played
Token ID2826
Feature activation-9.54e-01

Pos logit contributions
uffy+0.015
ears+0.015
ugs+0.015
imi+0.014
izzy+0.014
Neg logit contributions
 �-0.018
 impact-0.017
pieces-0.017
 int-0.017
 agreement-0.017
Token until
Token ID1566
Feature activation-6.94e-01

Pos logit contributions
uffy+0.014
 stairs+0.013
 cob+0.013
 toys+0.013
ears+0.013
Neg logit contributions
 hesitation-0.023
 impact-0.023
ieved-0.022
 Amen-0.022
 �-0.022
Token the
Token ID262
Feature activation+9.08e-02

Pos logit contributions
uffy+0.011
ears+0.011
 stairs+0.011
ugs+0.011
Jen+0.010
Neg logit contributions
 impact-0.016
 �-0.015
pieces-0.015
 agreement-0.015
ieved-0.015
Token sun
Token ID4252
Feature activation-1.94e+00

Pos logit contributions
 �+0.002
pieces+0.002
 impact+0.002
 agreement+0.002
 hesitation+0.002
Neg logit contributions
uffy-0.002
ears-0.002
ugs-0.002
unny-0.001
imi-0.001
Token started
Token ID2067
Feature activation-6.32e+00

Pos logit contributions
ears+0.046
ugs+0.044
uffy+0.044
imi+0.043
unny+0.043
Neg logit contributions
 �-0.033
 then-0.031
 int-0.030
pieces-0.029
 forward-0.028
Token to
Token ID284
Feature activation-3.88e+00

Pos logit contributions
uffy+0.143
ears+0.140
ugs+0.136
unny+0.136
imi+0.132
Neg logit contributions
 �-0.123
 int-0.114
 forward-0.108
 impact-0.105
 then-0.104
Token set
Token ID900
Feature activation-2.09e+00

Pos logit contributions
uffy+0.068
ears+0.067
ugs+0.066
 stairs+0.064
imi+0.063
Neg logit contributions
 �-0.083
pieces-0.082
 impact-0.080
 hesitation-0.079
ieved-0.078
Token.
Token ID13
Feature activation+2.57e-01

Pos logit contributions
uffy+0.042
ugs+0.041
ears+0.040
 stairs+0.039
imi+0.038
Neg logit contributions
pieces-0.040
 impact-0.039
 hesitation-0.039
 �-0.039
ieved-0.039
Token
Token ID220
Feature activation-1.99e+00

Pos logit contributions
 impact+0.005
pieces+0.005
 hesitation+0.005
 �+0.005
 agreement+0.005
Neg logit contributions
uffy-0.005
ears-0.005
ugs-0.005
 stairs-0.005
Su-0.005
Token
Token ID198
Feature activation-5.90e-02

Pos logit contributions
ears+0.043
uffy+0.043
ugs+0.042
imi+0.040
unny+0.040
Neg logit contributions
 �-0.036
pieces-0.034
 impact-0.033
 int-0.033
ieved-0.033
Token kept
Token ID4030
Feature activation+1.87e-01

Pos logit contributions
ears+0.018
ugs+0.018
uffy+0.018
 stairs+0.017
imi+0.017
Neg logit contributions
 �-0.014
 then-0.013
pieces-0.013
 agreement-0.013
 impact-0.012
Token playing
Token ID2712
Feature activation-2.87e-01

Pos logit contributions
ieved+0.004
 impact+0.004
 �+0.004
pieces+0.004
 narrative+0.004
Neg logit contributions
uffy-0.003
 Chick-0.003
Su-0.003
ugs-0.003
 Fl-0.002
Token until
Token ID1566
Feature activation-8.59e-01

Pos logit contributions
uffy+0.004
 cob+0.004
ears+0.004
 stairs+0.004
ugs+0.004
Neg logit contributions
 hesitation-0.007
ieved-0.007
 impact-0.006
 int-0.006
pieces-0.006
Token the
Token ID262
Feature activation+4.44e-02

Pos logit contributions
uffy+0.017
ears+0.017
ugs+0.017
imi+0.016
unny+0.016
Neg logit contributions
 �-0.017
pieces-0.016
 impact-0.016
 int-0.016
 hesitation-0.016
Token sun
Token ID4252
Feature activation-2.33e+00

Pos logit contributions
 �+0.001
 agreement+0.001
 impact+0.001
 then+0.001
pieces+0.001
Neg logit contributions
ears-0.001
ugs-0.001
uffy-0.001
izzy-0.001
unny-0.001
Token started
Token ID2067
Feature activation-6.15e+00

Pos logit contributions
ears+0.055
ugs+0.052
uffy+0.051
imi+0.050
unny+0.049
Neg logit contributions
 �-0.041
 then-0.039
 int-0.037
 forward-0.035
ement-0.035
Token to
Token ID284
Feature activation-3.67e+00

Pos logit contributions
uffy+0.123
ears+0.123
ugs+0.119
unny+0.113
izzy+0.113
Neg logit contributions
 �-0.129
 int-0.120
 impact-0.118
pieces-0.115
 hesitation-0.115
Token set
Token ID900
Feature activation-2.05e+00

Pos logit contributions
uffy+0.062
ears+0.062
ugs+0.060
 stairs+0.058
imi+0.058
Neg logit contributions
pieces-0.080
 �-0.080
 impact-0.077
 hesitation-0.076
ieved-0.075
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
uffy+0.042
ugs+0.041
ears+0.041
 stairs+0.038
imi+0.038
Neg logit contributions
pieces-0.038
 �-0.038
 impact-0.037
 hesitation-0.037
ieved-0.037
Token O
Token ID440
Feature activation-2.94e-01

Pos logit contributions
uffy+0.007
ears+0.006
ugs+0.006
imi+0.006
loo+0.006
Neg logit contributions
 �-0.006
 int-0.005
pieces-0.005
 then-0.005
 agreement-0.005
Tokenll
Token ID297
Feature activation+1.74e-01

Pos logit contributions
 stairs+0.004
 cob+0.003
 stains+0.003
ugs+0.003
loo+0.003
Neg logit contributions
ieved-0.008
ement-0.008
 �-0.007
 impact-0.007
pieces-0.007
Token to
Token ID284
Feature activation-3.59e+00

Pos logit contributions
uffy+0.094
ears+0.093
ugs+0.091
 stairs+0.084
imi+0.083
Neg logit contributions
pieces-0.139
 �-0.136
 impact-0.135
ieved-0.134
 agreement-0.133
Token set
Token ID900
Feature activation-2.24e+00

Pos logit contributions
Jen+0.052
Su+0.050
 stairs+0.050
uffy+0.050
ears+0.049
Neg logit contributions
pieces-0.089
 hesitation-0.083
 �-0.082
ieved-0.080
 impact-0.080
Token and
Token ID290
Feature activation-1.24e+00

Pos logit contributions
uffy+0.043
ugs+0.042
ears+0.041
 stairs+0.039
Su+0.039
Neg logit contributions
pieces-0.044
 hesitation-0.043
ieved-0.043
 impact-0.043
 �-0.042
Token the
Token ID262
Feature activation+7.62e-01

Pos logit contributions
uffy+0.021
ears+0.021
ugs+0.020
 stairs+0.020
 cob+0.019
Neg logit contributions
 �-0.027
pieces-0.027
 impact-0.027
 agreement-0.026
 hesitation-0.026
Token air
Token ID1633
Feature activation-1.96e+00

Pos logit contributions
 �+0.020
pieces+0.020
 impact+0.019
 int+0.019
ieved+0.019
Neg logit contributions
ears-0.010
uffy-0.010
ugs-0.009
 stairs-0.009
imi-0.009
Token started
Token ID2067
Feature activation-6.12e+00

Pos logit contributions
uffy+0.035
ears+0.034
ugs+0.034
imi+0.031
unny+0.031
Neg logit contributions
 �-0.043
pieces-0.042
 impact-0.041
 agreement-0.040
 int-0.040
Token to
Token ID284
Feature activation-3.64e+00

Pos logit contributions
uffy+0.104
ears+0.103
ugs+0.101
imi+0.093
 stairs+0.093
Neg logit contributions
 �-0.137
pieces-0.137
 impact-0.134
ieved-0.132
 agreement-0.132
Token freeze
Token ID16611
Feature activation-1.20e+00

Pos logit contributions
uffy+0.055
ears+0.055
 stairs+0.054
Jen+0.054
Su+0.053
Neg logit contributions
pieces-0.086
 �-0.082
 hesitation-0.081
ieved-0.079
 impact-0.079
Token.
Token ID13
Feature activation-2.88e-01

Pos logit contributions
uffy+0.027
ears+0.027
ugs+0.026
 stairs+0.025
imi+0.025
Neg logit contributions
 �-0.020
pieces-0.020
 impact-0.020
ieved-0.019
 hesitation-0.019
Token They
Token ID1119
Feature activation-7.07e-01

Pos logit contributions
uffy+0.007
ears+0.007
ugs+0.007
imi+0.006
unny+0.006
Neg logit contributions
 �-0.005
pieces-0.005
 impact-0.004
 int-0.004
 agreement-0.004
Token ran
Token ID4966
Feature activation-4.94e-01

Pos logit contributions
uffy+0.014
ears+0.014
ugs+0.014
imi+0.013
unny+0.013
Neg logit contributions
 �-0.014
pieces-0.013
 impact-0.013
 agreement-0.013
 int-0.013
Token time
Token ID640
Feature activation-4.06e-01

Pos logit contributions
uffy+0.002
ears+0.002
ugs+0.002
imi+0.002
unny+0.002
Neg logit contributions
 �-0.001
pieces-0.001
 impact-0.001
ieved-0.001
 int-0.001
Token,
Token ID11
Feature activation+8.09e-01

Pos logit contributions
uffy+0.008
ugs+0.008
ears+0.008
 stairs+0.007
 cob+0.007
Neg logit contributions
pieces-0.008
 �-0.008
ieved-0.008
 impact-0.008
 agreement-0.007
Token until
Token ID1566
Feature activation+9.36e-02

Pos logit contributions
pieces+0.015
 �+0.015
 impact+0.014
ieved+0.014
 agreement+0.014
Neg logit contributions
uffy-0.016
ears-0.016
ugs-0.016
 stairs-0.015
 cob-0.015
Token the
Token ID262
Feature activation-1.99e-01

Pos logit contributions
pieces+0.002
 impact+0.002
 �+0.002
ieved+0.002
 agreement+0.002
Neg logit contributions
 stairs-0.001
uffy-0.001
loo-0.001
ugs-0.001
Jen-0.001
Token sun
Token ID4252
Feature activation-2.32e+00

Pos logit contributions
ears+0.004
ugs+0.004
uffy+0.004
unny+0.004
izzy+0.004
Neg logit contributions
 agreement-0.004
 �-0.004
 impact-0.004
 then-0.004
 hesitation-0.004
Token started
Token ID2067
Feature activation-6.12e+00

Pos logit contributions
ears+0.054
ugs+0.051
uffy+0.050
imi+0.048
unny+0.047
Neg logit contributions
 �-0.043
 then-0.040
 int-0.039
 forward-0.037
 agreement-0.037
Token to
Token ID284
Feature activation-3.75e+00

Pos logit contributions
ears+0.138
uffy+0.137
ugs+0.133
unny+0.128
izzy+0.127
Neg logit contributions
 �-0.119
 int-0.112
 forward-0.106
 impact-0.103
 hesitation-0.102
Token come
Token ID1282
Feature activation-2.30e+00

Pos logit contributions
uffy+0.070
ears+0.070
ugs+0.068
imi+0.065
unny+0.064
Neg logit contributions
 �-0.078
pieces-0.076
 impact-0.075
 hesitation-0.073
ieved-0.073
Token up
Token ID510
Feature activation-1.50e+00

Pos logit contributions
uffy+0.053
ears+0.053
ugs+0.052
imi+0.049
unny+0.049
Neg logit contributions
 �-0.039
pieces-0.037
 impact-0.037
 agreement-0.035
 int-0.035
Token.
Token ID13
Feature activation+3.07e-01

Pos logit contributions
 stairs+0.025
uffy+0.024
ears+0.023
ugs+0.023
 dirty+0.022
Neg logit contributions
pieces-0.035
 impact-0.032
ieved-0.032
 hesitation-0.032
 agreement-0.031
Token It
Token ID632
Feature activation-1.76e+00

Pos logit contributions
pieces+0.006
 impact+0.006
 �+0.006
 hesitation+0.006
ieved+0.006
Neg logit contributions
ears-0.006
uffy-0.006
ugs-0.006
 stairs-0.006
loo-0.006
Token face
Token ID1986
Feature activation-6.07e-01

Pos logit contributions
uffy+0.000
ugs+0.000
ears+0.000
 cob+0.000
imi+0.000
Neg logit contributions
ieved-0.000
pieces-0.000
 impact-0.000
 �-0.000
 hesitation-0.000
Token.
Token ID13
Feature activation+5.00e-01

Pos logit contributions
uffy+0.013
ears+0.013
ugs+0.013
imi+0.012
izzy+0.012
Neg logit contributions
 impact-0.010
 �-0.010
pieces-0.010
ieved-0.010
 agreement-0.010
Token Soon
Token ID15894
Feature activation+1.62e-01

Pos logit contributions
 �+0.010
pieces+0.009
 impact+0.009
 int+0.009
 agreement+0.009
Neg logit contributions
ears-0.010
uffy-0.010
ugs-0.010
imi-0.009
loo-0.009
Token the
Token ID262
Feature activation+5.59e-01

Pos logit contributions
 �+0.003
 then+0.002
 hesitation+0.002
pieces+0.002
 forward+0.002
Neg logit contributions
ugs-0.004
ears-0.004
uffy-0.004
unny-0.004
loo-0.004
Token sun
Token ID4252
Feature activation-2.19e+00

Pos logit contributions
 �+0.014
pieces+0.014
 impact+0.013
ieved+0.013
 int+0.013
Neg logit contributions
uffy-0.008
ears-0.008
ugs-0.008
 stairs-0.007
imi-0.007
Token started
Token ID2067
Feature activation-6.07e+00

Pos logit contributions
ears+0.048
ugs+0.045
uffy+0.045
unny+0.043
imi+0.042
Neg logit contributions
 �-0.043
 then-0.040
 int-0.039
pieces-0.038
 agreement-0.037
Token to
Token ID284
Feature activation-3.81e+00

Pos logit contributions
ears+0.146
uffy+0.146
ugs+0.140
unny+0.138
izzy+0.136
Neg logit contributions
 �-0.111
 int-0.102
 forward-0.097
 then-0.095
 impact-0.092
Token set
Token ID900
Feature activation-2.14e+00

Pos logit contributions
uffy+0.065
ears+0.064
ugs+0.062
imi+0.060
 stairs+0.060
Neg logit contributions
pieces-0.083
 �-0.083
 impact-0.080
 hesitation-0.079
ieved-0.079
Token and
Token ID290
Feature activation-7.71e-01

Pos logit contributions
uffy+0.048
ears+0.048
ugs+0.047
imi+0.044
unny+0.044
Neg logit contributions
 �-0.038
pieces-0.036
 impact-0.036
 int-0.035
 agreement-0.035
Token the
Token ID262
Feature activation+7.41e-01

Pos logit contributions
ugs+0.018
uffy+0.018
ears+0.018
unny+0.017
imi+0.017
Neg logit contributions
 �-0.014
 then-0.012
pieces-0.012
 int-0.012
 impact-0.012
Token little
Token ID1310
Feature activation+1.53e-01

Pos logit contributions
 �+0.019
pieces+0.018
 impact+0.018
 agreement+0.018
 hesitation+0.018
Neg logit contributions
uffy-0.011
ears-0.011
ugs-0.011
imi-0.010
unny-0.010
Token.
Token ID13
Feature activation+4.64e-01

Pos logit contributions
uffy+0.010
ears+0.010
ugs+0.010
imi+0.010
unny+0.010
Neg logit contributions
 �-0.008
pieces-0.008
 impact-0.008
 int-0.008
ieved-0.008
Token Eventually
Token ID16178
Feature activation-2.28e-01

Pos logit contributions
 impact+0.010
pieces+0.010
 hesitation+0.010
 �+0.010
 int+0.010
Neg logit contributions
uffy-0.008
ears-0.008
ugs-0.008
Jen-0.007
unny-0.007
Token,
Token ID11
Feature activation-7.90e-01

Pos logit contributions
ears+0.005
uffy+0.005
ugs+0.005
unny+0.005
izzy+0.005
Neg logit contributions
 �-0.004
pieces-0.004
 impact-0.004
 hesitation-0.004
 agreement-0.003
Token the
Token ID262
Feature activation+4.80e-01

Pos logit contributions
uffy+0.012
ears+0.012
ugs+0.012
 stairs+0.011
Jen+0.011
Neg logit contributions
 �-0.019
 impact-0.018
pieces-0.018
 agreement-0.018
 int-0.018
Token sun
Token ID4252
Feature activation-1.99e+00

Pos logit contributions
 �+0.012
pieces+0.012
 impact+0.012
ieved+0.011
 int+0.011
Neg logit contributions
uffy-0.007
ears-0.007
ugs-0.007
imi-0.006
unny-0.006
Token started
Token ID2067
Feature activation-6.05e+00

Pos logit contributions
ears+0.047
ugs+0.045
uffy+0.043
imi+0.041
izzy+0.041
Neg logit contributions
 �-0.036
 then-0.034
 int-0.032
 forward-0.031
 agreement-0.031
Token to
Token ID284
Feature activation-3.38e+00

Pos logit contributions
ears+0.146
uffy+0.145
ugs+0.142
izzy+0.139
unny+0.137
Neg logit contributions
 �-0.110
 int-0.100
 forward-0.096
 then-0.092
 hesitation-0.091
Token set
Token ID900
Feature activation-2.04e+00

Pos logit contributions
uffy+0.059
ears+0.059
ugs+0.057
imi+0.055
 stairs+0.055
Neg logit contributions
pieces-0.072
 �-0.072
 impact-0.070
ieved-0.069
 int-0.068
Token and
Token ID290
Feature activation-6.74e-01

Pos logit contributions
uffy+0.045
ears+0.044
ugs+0.044
imi+0.041
unny+0.041
Neg logit contributions
 �-0.037
pieces-0.035
 impact-0.035
 agreement-0.034
 hesitation-0.034
Token Sue
Token ID26089
Feature activation-1.05e+00

Pos logit contributions
uffy+0.011
ears+0.011
ugs+0.011
imi+0.010
unny+0.010
Neg logit contributions
 �-0.016
pieces-0.015
 impact-0.015
 agreement-0.015
 hesitation-0.015
Token could
Token ID714
Feature activation-2.53e+00

Pos logit contributions
ugs+0.024
ears+0.023
uffy+0.022
loo+0.022
imi+0.021
Neg logit contributions
 �-0.019
 then-0.018
ieved-0.017
pieces-0.017
 int-0.017
Token they
Token ID484
Feature activation-5.02e-01

Pos logit contributions
ears+0.020
ugs+0.020
uffy+0.018
unny+0.018
loo+0.017
Neg logit contributions
 �-0.012
 then-0.012
 hesitation-0.010
 forward-0.009
 int-0.009
Token walked
Token ID6807
Feature activation+1.74e+00

Pos logit contributions
uffy+0.011
ears+0.011
ugs+0.011
imi+0.010
unny+0.010
Neg logit contributions
 �-0.009
pieces-0.009
 hesitation-0.008
 agreement-0.008
 int-0.008
Token,
Token ID11
Feature activation-6.70e-01

Pos logit contributions
pieces+0.036
 impact+0.035
ieved+0.034
 �+0.034
 agreement+0.033
Neg logit contributions
uffy-0.034
ears-0.034
ugs-0.032
unny-0.032
 stairs-0.032
Token the
Token ID262
Feature activation+9.49e-01

Pos logit contributions
uffy+0.010
ears+0.010
ugs+0.010
imi+0.009
 stairs+0.009
Neg logit contributions
 �-0.016
 impact-0.016
pieces-0.016
 agreement-0.016
ieved-0.016
Token sun
Token ID4252
Feature activation-2.41e+00

Pos logit contributions
 agreement+0.020
 �+0.019
 impact+0.019
 hesitation+0.019
 then+0.019
Neg logit contributions
ears-0.019
ugs-0.019
uffy-0.019
izzy-0.017
unny-0.016
Token started
Token ID2067
Feature activation-6.01e+00

Pos logit contributions
ears+0.058
ugs+0.055
uffy+0.054
imi+0.051
izzy+0.051
Neg logit contributions
 �-0.043
 then-0.040
 int-0.040
 forward-0.037
 agreement-0.037
Token to
Token ID284
Feature activation-3.31e+00

Pos logit contributions
uffy+0.130
ears+0.129
ugs+0.125
izzy+0.120
imi+0.119
Neg logit contributions
 �-0.120
 int-0.114
 forward-0.107
 hesitation-0.105
 impact-0.104
Token shine
Token ID18340
Feature activation-5.44e-01

Pos logit contributions
uffy+0.051
ears+0.050
Jen+0.049
 stairs+0.049
imi+0.048
Neg logit contributions
pieces-0.078
 �-0.074
 impact-0.072
ieved-0.072
 hesitation-0.072
Token and
Token ID290
Feature activation-1.01e+00

Pos logit contributions
uffy+0.012
ears+0.012
ugs+0.011
imi+0.011
izzy+0.011
Neg logit contributions
 �-0.010
pieces-0.010
 impact-0.009
 hesitation-0.009
 agreement-0.009
Token the
Token ID262
Feature activation+4.93e-01

Pos logit contributions
uffy+0.019
ears+0.019
ugs+0.018
 stairs+0.018
imi+0.017
Neg logit contributions
 �-0.021
pieces-0.020
 impact-0.020
ieved-0.020
 agreement-0.020
Token snow
Token ID6729
Feature activation-2.75e+00

Pos logit contributions
 agreement+0.010
 then+0.009
 impact+0.009
 hesitation+0.009
 �+0.009
Neg logit contributions
ugs-0.011
ears-0.011
uffy-0.010
izzy-0.010
uts-0.009
Token cleared
Token ID12539
Feature activation-4.57e-01

Pos logit contributions
uffy+0.009
ears+0.009
ugs+0.009
imi+0.008
unny+0.008
Neg logit contributions
 �-0.011
pieces-0.010
 impact-0.010
 agreement-0.010
 int-0.010
Token up
Token ID510
Feature activation-8.95e-01

Pos logit contributions
ears+0.008
 stairs+0.008
uffy+0.008
 cob+0.008
ugs+0.008
Neg logit contributions
pieces-0.010
 hesitation-0.009
 impact-0.009
ieved-0.009
 agreement-0.009
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
uffy+0.019
ears+0.019
ugs+0.019
 stairs+0.018
imi+0.017
Neg logit contributions
pieces-0.016
 �-0.016
 impact-0.016
 agreement-0.015
ieved-0.015
Token the
Token ID262
Feature activation+2.16e-01

Pos logit contributions
uffy+0.022
ears+0.022
 stairs+0.021
ugs+0.021
 cob+0.021
Neg logit contributions
pieces-0.021
 impact-0.021
 �-0.021
 agreement-0.021
 hesitation-0.021
Token sun
Token ID4252
Feature activation-2.48e+00

Pos logit contributions
 �+0.005
pieces+0.005
 impact+0.005
ieved+0.005
 int+0.005
Neg logit contributions
uffy-0.003
ears-0.003
ugs-0.003
 stairs-0.003
imi-0.003
Token started
Token ID2067
Feature activation-6.00e+00

Pos logit contributions
ears+0.052
uffy+0.050
ugs+0.049
unny+0.047
imi+0.047
Neg logit contributions
 �-0.050
 int-0.046
pieces-0.046
 then-0.045
 impact-0.045
Token to
Token ID284
Feature activation-3.74e+00

Pos logit contributions
uffy+0.126
ears+0.125
ugs+0.121
unny+0.117
imi+0.116
Neg logit contributions
 �-0.123
 int-0.115
 impact-0.111
pieces-0.108
 hesitation-0.108
Token shine
Token ID18340
Feature activation-5.28e-01

Pos logit contributions
uffy+0.056
ears+0.055
ugs+0.054
 stairs+0.054
 cob+0.052
Neg logit contributions
pieces-0.088
 �-0.087
 hesitation-0.084
ieved-0.084
 impact-0.084
Token again
Token ID757
Feature activation-1.00e-01

Pos logit contributions
uffy+0.012
ears+0.012
ugs+0.012
 stairs+0.011
 cob+0.011
Neg logit contributions
 �-0.009
pieces-0.009
 impact-0.009
 hesitation-0.009
ieved-0.008
Token.
Token ID13
Feature activation-4.75e-02

Pos logit contributions
uffy+0.002
ugs+0.002
ears+0.002
 stairs+0.002
imi+0.002
Neg logit contributions
 �-0.002
pieces-0.002
 impact-0.002
ieved-0.002
 hesitation-0.002
Token
Token ID198
Feature activation-4.16e-01

Pos logit contributions
uffy+0.001
ears+0.001
ugs+0.001
imi+0.001
unny+0.001
Neg logit contributions
 �-0.001
pieces-0.001
 impact-0.001
 int-0.001
 agreement-0.001
Token
Token ID198
Feature activation-1.74e-01

Pos logit contributions
uffy+0.002
ears+0.002
ugs+0.002
imi+0.002
unny+0.002
Neg logit contributions
 �-0.002
pieces-0.002
 impact-0.002
 agreement-0.002
 int-0.002
Token
Token ID198
Feature activation-7.66e-01

Pos logit contributions
ears+0.005
uffy+0.004
ugs+0.004
unny+0.004
imi+0.004
Neg logit contributions
 �-0.003
pieces-0.002
 int-0.002
 agreement-0.002
 impact-0.002
TokenAs
Token ID1722
Feature activation-1.01e+00

Pos logit contributions
ears+0.017
ugs+0.017
uffy+0.017
loo+0.016
imi+0.015
Neg logit contributions
 �-0.015
pieces-0.014
ieved-0.014
 agreement-0.013
 int-0.013
Token the
Token ID262
Feature activation-3.76e-01

Pos logit contributions
ears+0.023
ugs+0.023
uffy+0.022
unny+0.021
loo+0.021
Neg logit contributions
 �-0.019
 then-0.018
pieces-0.017
 hesitation-0.016
 impact-0.016
Token sun
Token ID4252
Feature activation-1.84e+00

Pos logit contributions
ears+0.007
uffy+0.007
ugs+0.007
unny+0.006
izzy+0.006
Neg logit contributions
 �-0.008
 impact-0.008
 agreement-0.008
pieces-0.008
 hesitation-0.008
Token started
Token ID2067
Feature activation-6.00e+00

Pos logit contributions
ears+0.043
ugs+0.041
uffy+0.040
imi+0.039
unny+0.038
Neg logit contributions
 �-0.032
 then-0.031
 int-0.029
 forward-0.028
pieces-0.028
Token to
Token ID284
Feature activation-3.33e+00

Pos logit contributions
ears+0.151
uffy+0.148
ugs+0.143
unny+0.142
izzy+0.142
Neg logit contributions
 �-0.101
 int-0.090
 then-0.089
 forward-0.085
 hesitation-0.081
Token set
Token ID900
Feature activation-1.20e+00

Pos logit contributions
uffy+0.053
ears+0.052
 stairs+0.050
Jen+0.050
ugs+0.049
Neg logit contributions
pieces-0.078
 �-0.075
 impact-0.073
 hesitation-0.072
ieved-0.072
Token,
Token ID11
Feature activation-9.03e-01

Pos logit contributions
uffy+0.025
ears+0.025
ugs+0.025
imi+0.023
unny+0.023
Neg logit contributions
 �-0.022
pieces-0.022
 impact-0.021
ieved-0.021
 agreement-0.021
Token Lily
Token ID20037
Feature activation-8.70e-01

Pos logit contributions
ears+0.012
uffy+0.012
ugs+0.012
imi+0.011
unny+0.011
Neg logit contributions
 �-0.024
pieces-0.023
 impact-0.023
 agreement-0.023
 int-0.023
Token's
Token ID338
Feature activation+8.59e-01

Pos logit contributions
uffy+0.019
ears+0.019
ugs+0.019
imi+0.018
unny+0.018
Neg logit contributions
 �-0.015
pieces-0.015
 impact-0.015
 agreement-0.014
 int-0.014
Token the
Token ID262
Feature activation+4.81e-01

Pos logit contributions
uffy+0.007
ears+0.006
 stairs+0.006
 cob+0.006
ugs+0.006
Neg logit contributions
pieces-0.007
ieved-0.006
 impact-0.006
 �-0.006
 agreement-0.006
Token park
Token ID3952
Feature activation-3.30e-01

Pos logit contributions
 �+0.011
 impact+0.011
pieces+0.010
 agreement+0.010
 hesitation+0.010
Neg logit contributions
ugs-0.009
uffy-0.008
ears-0.008
izzy-0.008
imi-0.008
Token until
Token ID1566
Feature activation-3.76e-01

Pos logit contributions
uffy+0.007
ears+0.007
ugs+0.007
 stairs+0.006
 cob+0.006
Neg logit contributions
 impact-0.006
pieces-0.006
ieved-0.006
 �-0.006
 hesitation-0.006
Token the
Token ID262
Feature activation+4.54e-01

Pos logit contributions
uffy+0.005
ears+0.004
 stairs+0.004
ugs+0.004
Jen+0.004
Neg logit contributions
 impact-0.010
pieces-0.010
 �-0.010
 agreement-0.010
ieved-0.010
Token sun
Token ID4252
Feature activation-2.32e+00

Pos logit contributions
 �+0.010
 impact+0.010
pieces+0.010
 agreement+0.010
 hesitation+0.010
Neg logit contributions
uffy-0.008
ugs-0.008
ears-0.008
unny-0.007
izzy-0.007
Token started
Token ID2067
Feature activation-5.95e+00

Pos logit contributions
ears+0.049
uffy+0.048
ugs+0.048
imi+0.045
unny+0.045
Neg logit contributions
 �-0.046
 int-0.042
 then-0.041
pieces-0.041
 agreement-0.040
Token to
Token ID284
Feature activation-3.77e+00

Pos logit contributions
uffy+0.121
ears+0.119
ugs+0.117
unny+0.112
izzy+0.110
Neg logit contributions
 �-0.124
 int-0.115
 impact-0.112
 hesitation-0.111
 forward-0.110
Token go
Token ID467
Feature activation-2.92e+00

Pos logit contributions
uffy+0.056
ears+0.056
Jen+0.055
 stairs+0.055
Su+0.055
Neg logit contributions
pieces-0.088
 �-0.084
 hesitation-0.081
 impact-0.081
ieved-0.080
Token down
Token ID866
Feature activation-2.44e+00

Pos logit contributions
uffy+0.053
ears+0.053
 stairs+0.051
ugs+0.050
Su+0.048
Neg logit contributions
pieces-0.063
 impact-0.060
 hesitation-0.060
 agreement-0.057
 unison-0.057
Token.
Token ID13
Feature activation+6.12e-01

Pos logit contributions
uffy+0.054
ears+0.054
ugs+0.053
 stairs+0.050
imi+0.049
Neg logit contributions
 �-0.042
pieces-0.042
 impact-0.041
ieved-0.040
 hesitation-0.039
Token She
Token ID1375
Feature activation-4.62e-01

Pos logit contributions
pieces+0.015
 impact+0.015
 �+0.015
 hesitation+0.014
 agreement+0.014
Neg logit contributions
ears-0.009
uffy-0.009
ugs-0.009
imi-0.008
unny-0.008
Token they
Token ID484
Feature activation-6.04e-01

Pos logit contributions
ears+0.014
ugs+0.014
uffy+0.013
unny+0.013
loo+0.013
Neg logit contributions
 then-0.009
 �-0.009
 forward-0.007
pieces-0.007
 End-0.007
Token played
Token ID2826
Feature activation+6.23e-02

Pos logit contributions
uffy+0.011
ears+0.011
ugs+0.011
imi+0.010
unny+0.010
Neg logit contributions
 �-0.013
pieces-0.013
 agreement-0.013
 impact-0.013
 hesitation-0.013
Token,
Token ID11
Feature activation-6.72e-01

Pos logit contributions
 �+0.001
pieces+0.001
 impact+0.001
ieved+0.001
 int+0.001
Neg logit contributions
uffy-0.001
ears-0.001
ugs-0.001
unny-0.001
imi-0.001
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
ears+0.012
ugs+0.012
uffy+0.011
imi+0.011
unny+0.011
Neg logit contributions
 �-0.016
pieces-0.015
 then-0.015
 agreement-0.014
 int-0.014
Token sun
Token ID4252
Feature activation-2.31e+00

Pos logit contributions
 �+0.024
 then+0.024
 agreement+0.023
 hesitation+0.023
 impact+0.022
Neg logit contributions
ugs-0.033
ears-0.032
uffy-0.030
uts-0.028
Su-0.028
Token started
Token ID2067
Feature activation-5.94e+00

Pos logit contributions
ears+0.059
ugs+0.056
uffy+0.056
unny+0.053
imi+0.052
Neg logit contributions
 �-0.038
 then-0.037
 int-0.035
pieces-0.031
 forward-0.031
Token to
Token ID284
Feature activation-3.12e+00

Pos logit contributions
uffy+0.141
ears+0.138
ugs+0.133
imi+0.128
unny+0.128
Neg logit contributions
 �-0.111
 int-0.103
 forward-0.099
 then-0.095
 impact-0.090
Token get
Token ID651
Feature activation-3.42e+00

Pos logit contributions
uffy+0.053
ears+0.053
ugs+0.051
imi+0.049
 stairs+0.049
Neg logit contributions
 �-0.069
pieces-0.069
 impact-0.066
ieved-0.065
 hesitation-0.065
Token very
Token ID845
Feature activation-2.09e+00

Pos logit contributions
uffy+0.055
ears+0.055
ugs+0.054
imi+0.050
unny+0.049
Neg logit contributions
 �-0.080
pieces-0.079
 impact-0.078
 agreement-0.076
 hesitation-0.076
Token hot
Token ID3024
Feature activation-1.36e+00

Pos logit contributions
ugs+0.038
uffy+0.038
ears+0.036
unny+0.034
imi+0.034
Neg logit contributions
 int-0.048
 �-0.047
 then-0.045
 forward-0.045
 agreement-0.043
Token.
Token ID13
Feature activation-1.69e-01

Pos logit contributions
ears+0.030
uffy+0.030
ugs+0.029
imi+0.028
 stairs+0.027
Neg logit contributions
 �-0.024
pieces-0.024
 impact-0.023
 agreement-0.023
ieved-0.022
Token
Token ID198
Feature activation-5.10e-01

Pos logit contributions
ears+0.053
uffy+0.052
ugs+0.052
 cob+0.051
imi+0.050
Neg logit contributions
 �-0.037
pieces-0.033
ieved-0.032
 int-0.032
 impact-0.031
Token
Token ID198
Feature activation-6.93e-01

Pos logit contributions
ears+0.012
uffy+0.012
ugs+0.012
unny+0.011
imi+0.011
Neg logit contributions
 �-0.009
pieces-0.008
 int-0.008
 agreement-0.007
ieved-0.007
TokenAs
Token ID1722
Feature activation-7.92e-01

Pos logit contributions
uffy+0.015
ears+0.015
ugs+0.015
unny+0.014
imi+0.014
Neg logit contributions
 �-0.014
pieces-0.013
ieved-0.012
 int-0.012
 agreement-0.012
Token the
Token ID262
Feature activation-1.76e-01

Pos logit contributions
ears+0.017
ugs+0.017
uffy+0.016
unny+0.016
izzy+0.015
Neg logit contributions
 �-0.016
 then-0.015
pieces-0.014
 impact-0.014
 hesitation-0.014
Token sun
Token ID4252
Feature activation-1.83e+00

Pos logit contributions
unny+0.004
ears+0.004
uffy+0.004
ubs+0.004
ugs+0.004
Neg logit contributions
 then-0.003
 End-0.003
 �-0.003
 hesitation-0.003
 agreement-0.003
Token started
Token ID2067
Feature activation-5.93e+00

Pos logit contributions
ears+0.046
ugs+0.043
uffy+0.043
imi+0.043
ubs+0.042
Neg logit contributions
 �-0.028
 then-0.028
 int-0.025
 forward-0.024
ement-0.023
Token to
Token ID284
Feature activation-3.54e+00

Pos logit contributions
ears+0.154
uffy+0.153
izzy+0.146
unny+0.145
ugs+0.144
Neg logit contributions
 �-0.098
 int-0.089
 then-0.083
 forward-0.083
 hesitation-0.075
Token set
Token ID900
Feature activation-1.50e+00

Pos logit contributions
uffy+0.061
ears+0.061
ugs+0.059
 stairs+0.057
imi+0.057
Neg logit contributions
pieces-0.078
 �-0.076
 impact-0.074
ieved-0.072
 hesitation-0.072
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
ears+0.035
uffy+0.035
ugs+0.035
imi+0.033
izzy+0.033
Neg logit contributions
 �-0.025
 impact-0.023
pieces-0.023
 int-0.023
 agreement-0.022
Token it
Token ID340
Feature activation-2.23e+00

Pos logit contributions
ears+0.024
uffy+0.024
ugs+0.023
unny+0.022
imi+0.021
Neg logit contributions
 �-0.039
pieces-0.037
 then-0.036
 impact-0.036
 int-0.036
Token was
Token ID373
Feature activation-1.30e+00

Pos logit contributions
ears+0.047
ugs+0.047
uffy+0.046
unny+0.043
imi+0.043
Neg logit contributions
 �-0.043
 int-0.041
pieces-0.040
 then-0.039
 impact-0.039
Token
Token ID198
Feature activation-1.67e-01

Pos logit contributions
 impact+0.003
 �+0.003
pieces+0.003
 hesitation+0.003
 agreement+0.003
Neg logit contributions
uffy-0.003
ears-0.003
ugs-0.003
imi-0.003
unny-0.003
Token
Token ID198
Feature activation-5.44e-01

Pos logit contributions
ears+0.004
uffy+0.004
ugs+0.004
imi+0.004
loo+0.004
Neg logit contributions
 �-0.003
pieces-0.003
 int-0.003
 agreement-0.003
ieved-0.003
TokenAs
Token ID1722
Feature activation-6.04e-01

Pos logit contributions
ears+0.012
uffy+0.012
ugs+0.012
loo+0.011
imi+0.011
Neg logit contributions
 �-0.010
pieces-0.009
ieved-0.009
 agreement-0.009
 then-0.009
Token the
Token ID262
Feature activation-9.44e-02

Pos logit contributions
ears+0.014
ugs+0.013
uffy+0.013
unny+0.013
imi+0.012
Neg logit contributions
 �-0.011
pieces-0.010
 then-0.010
 hesitation-0.010
 impact-0.010
Token sun
Token ID4252
Feature activation-1.60e+00

Pos logit contributions
uffy+0.002
ears+0.002
ugs+0.002
unny+0.001
izzy+0.001
Neg logit contributions
pieces-0.002
 �-0.002
 agreement-0.002
 impact-0.002
 hesitation-0.002
Token started
Token ID2067
Feature activation-5.93e+00

Pos logit contributions
ears+0.036
uffy+0.035
ugs+0.035
imi+0.034
unny+0.033
Neg logit contributions
 �-0.029
 then-0.027
 int-0.026
pieces-0.026
ieved-0.025
Token to
Token ID284
Feature activation-3.43e+00

Pos logit contributions
uffy+0.131
ears+0.129
ugs+0.126
imi+0.122
unny+0.121
Neg logit contributions
 �-0.117
 int-0.105
 then-0.102
 forward-0.101
 hesitation-0.100
Token set
Token ID900
Feature activation-1.19e+00

Pos logit contributions
uffy+0.050
Jen+0.050
 stairs+0.050
ears+0.050
Su+0.048
Neg logit contributions
pieces-0.083
 �-0.077
 hesitation-0.076
 impact-0.076
ieved-0.075
Token,
Token ID11
Feature activation-3.41e-01

Pos logit contributions
uffy+0.023
ears+0.022
ugs+0.022
 stairs+0.021
imi+0.021
Neg logit contributions
pieces-0.024
 impact-0.023
 �-0.023
ieved-0.023
 hesitation-0.023
Token the
Token ID262
Feature activation+1.76e+00

Pos logit contributions
ears+0.007
uffy+0.007
ugs+0.007
imi+0.006
unny+0.006
Neg logit contributions
 �-0.007
pieces-0.006
 impact-0.006
 hesitation-0.006
 agreement-0.006
Token band
Token ID4097
Feature activation-8.84e-01

Pos logit contributions
 �+0.047
pieces+0.046
 impact+0.046
ieved+0.045
 int+0.045
Neg logit contributions
uffy-0.023
ears-0.023
ugs-0.022
 stairs-0.021
imi-0.020
Token and
Token ID290
Feature activation+5.46e-01

Pos logit contributions
pieces+0.034
 impact+0.033
 hesitation+0.031
ieved+0.031
 narrative+0.031
Neg logit contributions
uffy-0.019
 Fl-0.019
ears-0.019
 cob-0.018
ugs-0.018
Token laughed
Token ID13818
Feature activation+9.98e-01

Pos logit contributions
 �+0.011
pieces+0.011
 impact+0.011
 agreement+0.010
 int+0.010
Neg logit contributions
uffy-0.010
ears-0.010
ugs-0.010
imi-0.010
izzy-0.009
Token until
Token ID1566
Feature activation-8.87e-02

Pos logit contributions
pieces+0.022
 impact+0.022
 narrative+0.021
ieved+0.021
 hesitation+0.021
Neg logit contributions
 Fl-0.016
uffy-0.015
 Chick-0.015
 cob-0.015
ugs-0.015
Token the
Token ID262
Feature activation+6.82e-01

Pos logit contributions
uffy+0.002
ears+0.002
ugs+0.002
 stairs+0.001
loo+0.001
Neg logit contributions
pieces-0.002
 �-0.002
 impact-0.002
ieved-0.002
 agreement-0.002
Token sun
Token ID4252
Feature activation-1.94e+00

Pos logit contributions
 �+0.018
pieces+0.018
 impact+0.017
ieved+0.017
 int+0.017
Neg logit contributions
ears-0.009
uffy-0.009
ugs-0.008
 stairs-0.008
imi-0.008
Token started
Token ID2067
Feature activation-5.93e+00

Pos logit contributions
ears+0.041
uffy+0.041
ugs+0.040
imi+0.038
unny+0.037
Neg logit contributions
 �-0.037
 int-0.035
pieces-0.034
 then-0.034
 impact-0.034
Token to
Token ID284
Feature activation-3.40e+00

Pos logit contributions
uffy+0.117
ears+0.115
ugs+0.113
imi+0.106
unny+0.106
Neg logit contributions
 �-0.127
 int-0.119
 impact-0.117
 hesitation-0.116
pieces-0.115
Token set
Token ID900
Feature activation-1.86e+00

Pos logit contributions
uffy+0.048
ears+0.048
Jen+0.047
 stairs+0.046
imi+0.046
Neg logit contributions
pieces-0.083
 �-0.078
ieved-0.077
 impact-0.076
 hesitation-0.076
Token.
Token ID13
Feature activation+4.63e-01

Pos logit contributions
uffy+0.034
ugs+0.033
ears+0.032
 stairs+0.032
 cob+0.031
Neg logit contributions
pieces-0.038
ieved-0.037
 impact-0.036
 hesitation-0.036
 �-0.035
Token Then
Token ID3244
Feature activation+2.07e-01

Pos logit contributions
pieces+0.009
 impact+0.008
 hesitation+0.008
 agreement+0.008
 �+0.008
Neg logit contributions
uffy-0.010
ears-0.010
ugs-0.010
unny-0.009
 stairs-0.009
Token,
Token ID11
Feature activation+3.22e-01

Pos logit contributions
 �+0.004
pieces+0.004
 impact+0.004
 agreement+0.004
ieved+0.004
Neg logit contributions
uffy-0.004
ears-0.004
ugs-0.004
imi-0.004
unny-0.004
Token
Token ID198
Feature activation-3.76e-01

Pos logit contributions
uffy+0.004
ears+0.004
ugs+0.004
izzy+0.003
imi+0.003
Neg logit contributions
 impact-0.005
 hesitation-0.005
pieces-0.005
 �-0.005
 agreement-0.005
Token
Token ID198
Feature activation-1.03e+00

Pos logit contributions
ears+0.008
uffy+0.008
ugs+0.008
imi+0.008
unny+0.008
Neg logit contributions
 �-0.007
pieces-0.006
 impact-0.006
 int-0.006
ieved-0.006
TokenAs
Token ID1722
Feature activation-9.53e-01

Pos logit contributions
uffy+0.024
ears+0.023
ugs+0.023
imi+0.022
loo+0.021
Neg logit contributions
 �-0.020
ieved-0.018
pieces-0.018
 then-0.017
 int-0.017
Token the
Token ID262
Feature activation-4.87e-01

Pos logit contributions
ears+0.019
ugs+0.019
uffy+0.019
unny+0.018
izzy+0.017
Neg logit contributions
 �-0.020
 then-0.019
pieces-0.018
 impact-0.018
 hesitation-0.018
Token sun
Token ID4252
Feature activation-1.66e+00

Pos logit contributions
ears+0.011
uffy+0.011
ugs+0.011
unny+0.011
ubs+0.011
Neg logit contributions
 then-0.008
 End-0.008
 hesitation-0.008
 �-0.008
 agreement-0.008
Token started
Token ID2067
Feature activation-5.91e+00

Pos logit contributions
ears+0.044
ugs+0.042
uffy+0.042
imi+0.040
unny+0.039
Neg logit contributions
 �-0.024
 then-0.024
 int-0.022
 forward-0.020
pieces-0.019
Token to
Token ID284
Feature activation-3.21e+00

Pos logit contributions
uffy+0.159
ears+0.157
unny+0.152
ugs+0.151
imi+0.149
Neg logit contributions
 �-0.093
 int-0.081
 then-0.078
 forward-0.075
 hesitation-0.069
Token set
Token ID900
Feature activation-1.33e+00

Pos logit contributions
uffy+0.058
ears+0.058
ugs+0.056
imi+0.053
 stairs+0.053
Neg logit contributions
 �-0.068
pieces-0.067
 impact-0.066
ieved-0.064
 hesitation-0.064
Token,
Token ID11
Feature activation-1.15e+00

Pos logit contributions
ears+0.030
uffy+0.030
ugs+0.029
imi+0.028
unny+0.028
Neg logit contributions
 �-0.023
pieces-0.022
 impact-0.022
 int-0.021
 agreement-0.021
Token Tim
Token ID5045
Feature activation-6.07e-02

Pos logit contributions
ears+0.017
uffy+0.017
ugs+0.017
unny+0.015
imi+0.015
Neg logit contributions
 �-0.029
pieces-0.028
 then-0.027
 impact-0.027
 int-0.027
Tokenmy
Token ID1820
Feature activation-1.11e+00

Pos logit contributions
ugs+0.001
ears+0.001
uffy+0.001
loo+0.001
 stairs+0.001
Neg logit contributions
 �-0.001
 then-0.001
ieved-0.001
 agreement-0.001
pieces-0.001
Token the
Token ID262
Feature activation-3.34e-01

Pos logit contributions
uffy+0.013
ears+0.013
ugs+0.013
 stairs+0.012
imi+0.012
Neg logit contributions
 �-0.013
 impact-0.012
pieces-0.012
 int-0.012
ieved-0.012
Token field
Token ID2214
Feature activation-1.10e+00

Pos logit contributions
uffy+0.006
ears+0.006
ugs+0.006
imi+0.006
izzy+0.006
Neg logit contributions
 �-0.007
pieces-0.007
 impact-0.007
 agreement-0.006
 int-0.006
Token until
Token ID1566
Feature activation-5.92e-01

Pos logit contributions
uffy+0.023
ears+0.023
ugs+0.023
 stairs+0.021
imi+0.021
Neg logit contributions
 �-0.020
pieces-0.019
 impact-0.019
ieved-0.019
 hesitation-0.019
Token the
Token ID262
Feature activation+5.64e-01

Pos logit contributions
uffy+0.009
ears+0.009
 stairs+0.009
ugs+0.008
imi+0.008
Neg logit contributions
 impact-0.014
 �-0.014
pieces-0.014
 agreement-0.013
 int-0.013
Token sun
Token ID4252
Feature activation-1.86e+00

Pos logit contributions
 �+0.014
pieces+0.014
 impact+0.013
ieved+0.013
 int+0.013
Neg logit contributions
uffy-0.009
ears-0.009
ugs-0.008
 stairs-0.008
imi-0.008
Token started
Token ID2067
Feature activation-5.90e+00

Pos logit contributions
ears+0.041
ugs+0.039
uffy+0.039
imi+0.038
unny+0.038
Neg logit contributions
 �-0.035
 then-0.032
 int-0.031
pieces-0.031
 forward-0.030
Token to
Token ID284
Feature activation-3.50e+00

Pos logit contributions
uffy+0.115
ears+0.113
ugs+0.111
unny+0.106
imi+0.105
Neg logit contributions
 �-0.126
 int-0.117
 impact-0.116
pieces-0.115
 hesitation-0.113
Token set
Token ID900
Feature activation-2.06e+00

Pos logit contributions
ears+0.052
uffy+0.052
 stairs+0.052
Jen+0.051
Su+0.050
Neg logit contributions
pieces-0.082
 �-0.079
 hesitation-0.078
 impact-0.077
ieved-0.076
Token.
Token ID13
Feature activation-2.25e-01

Pos logit contributions
uffy+0.040
ugs+0.040
ears+0.039
 stairs+0.038
Su+0.037
Neg logit contributions
pieces-0.040
 hesitation-0.040
 impact-0.039
 �-0.039
ieved-0.039
Token It
Token ID632
Feature activation-1.59e+00

Pos logit contributions
ears+0.005
uffy+0.005
ugs+0.005
imi+0.004
unny+0.004
Neg logit contributions
 �-0.004
pieces-0.004
 impact-0.004
 int-0.004
 agreement-0.004
Token was
Token ID373
Feature activation-4.94e-01

Pos logit contributions
ears+0.027
ugs+0.027
uffy+0.027
unny+0.024
imi+0.024
Neg logit contributions
 �-0.037
pieces-0.035
 impact-0.035
ieved-0.035
 int-0.035
Token and
Token ID290
Feature activation+1.18e-01

Pos logit contributions
uffy+0.019
ears+0.019
ugs+0.018
imi+0.018
unny+0.018
Neg logit contributions
 �-0.015
pieces-0.013
 int-0.013
 impact-0.013
 hesitation-0.013
Token forth
Token ID6071
Feature activation+1.47e+00

Pos logit contributions
 impact+0.002
pieces+0.002
 �+0.001
 int+0.001
 hesitation+0.001
Neg logit contributions
uffy-0.003
ears-0.003
 stairs-0.003
ugs-0.003
izzy-0.003
Token,
Token ID11
Feature activation-6.64e-01

Pos logit contributions
 �+0.027
pieces+0.026
 impact+0.026
ieved+0.025
 agreement+0.025
Neg logit contributions
uffy-0.031
ears-0.031
ugs-0.030
imi-0.029
unny-0.029
Token the
Token ID262
Feature activation+8.17e-01

Pos logit contributions
ears+0.010
uffy+0.010
ugs+0.010
imi+0.009
unny+0.009
Neg logit contributions
 �-0.016
pieces-0.016
 impact-0.016
 int-0.015
 agreement-0.015
Token sun
Token ID4252
Feature activation-2.39e+00

Pos logit contributions
 �+0.022
pieces+0.022
 int+0.021
 impact+0.021
ieved+0.021
Neg logit contributions
ears-0.011
uffy-0.010
 stairs-0.010
ugs-0.010
imi-0.009
Token began
Token ID2540
Feature activation-5.89e+00

Pos logit contributions
ears+0.052
ugs+0.050
uffy+0.048
unny+0.047
imi+0.046
Neg logit contributions
 �-0.047
 then-0.044
 int-0.043
 impact-0.042
 forward-0.041
Token to
Token ID284
Feature activation-3.59e+00

Pos logit contributions
uffy+0.094
ears+0.093
ugs+0.091
 stairs+0.084
imi+0.083
Neg logit contributions
pieces-0.139
 �-0.136
 impact-0.135
ieved-0.134
 agreement-0.133
Token set
Token ID900
Feature activation-2.24e+00

Pos logit contributions
Jen+0.052
Su+0.050
 stairs+0.050
uffy+0.050
ears+0.049
Neg logit contributions
pieces-0.089
 hesitation-0.083
 �-0.082
ieved-0.080
 impact-0.080
Token and
Token ID290
Feature activation-1.24e+00

Pos logit contributions
uffy+0.043
ugs+0.042
ears+0.041
 stairs+0.039
Su+0.039
Neg logit contributions
pieces-0.044
 hesitation-0.043
ieved-0.043
 impact-0.043
 �-0.042
Token the
Token ID262
Feature activation+7.62e-01

Pos logit contributions
uffy+0.021
ears+0.021
ugs+0.020
 stairs+0.020
 cob+0.019
Neg logit contributions
 �-0.027
pieces-0.027
 impact-0.027
 agreement-0.026
 hesitation-0.026
Token air
Token ID1633
Feature activation-1.96e+00

Pos logit contributions
 �+0.020
pieces+0.020
 impact+0.019
 int+0.019
ieved+0.019
Neg logit contributions
ears-0.010
uffy-0.010
ugs-0.009
 stairs-0.009
imi-0.009
Token
Token ID198
Feature activation+2.05e-01

Pos logit contributions
 �+0.007
 impact+0.007
pieces+0.007
 hesitation+0.007
ieved+0.006
Neg logit contributions
uffy-0.010
ears-0.010
ugs-0.010
imi-0.009
izzy-0.009
Token
Token ID198
Feature activation-3.61e-01

Pos logit contributions
 �+0.003
pieces+0.003
 int+0.003
 impact+0.003
 agreement+0.003
Neg logit contributions
ears-0.005
uffy-0.005
ugs-0.005
unny-0.005
imi-0.005
TokenAs
Token ID1722
Feature activation-7.25e-01

Pos logit contributions
uffy+0.007
ears+0.007
ugs+0.007
imi+0.006
unny+0.006
Neg logit contributions
 �-0.008
pieces-0.007
 int-0.007
ieved-0.007
 agreement-0.007
Token the
Token ID262
Feature activation-3.95e-01

Pos logit contributions
ears+0.016
ugs+0.015
uffy+0.015
unny+0.014
imi+0.014
Neg logit contributions
 �-0.014
pieces-0.013
 impact-0.013
 then-0.013
 int-0.013
Token sun
Token ID4252
Feature activation-1.81e+00

Pos logit contributions
uffy+0.007
ears+0.007
ugs+0.007
unny+0.007
izzy+0.007
Neg logit contributions
 �-0.008
 impact-0.008
 agreement-0.008
pieces-0.008
 hesitation-0.008
Token started
Token ID2067
Feature activation-5.89e+00

Pos logit contributions
ears+0.041
uffy+0.040
ugs+0.039
imi+0.038
unny+0.037
Neg logit contributions
 �-0.032
 then-0.030
 int-0.029
pieces-0.029
 impact-0.028
Token to
Token ID284
Feature activation-3.39e+00

Pos logit contributions
uffy+0.144
ears+0.142
unny+0.136
ugs+0.136
izzy+0.135
Neg logit contributions
 �-0.104
 int-0.095
 then-0.090
 forward-0.088
 impact-0.084
Token set
Token ID900
Feature activation-1.27e+00

Pos logit contributions
uffy+0.052
ears+0.052
 stairs+0.049
Jen+0.049
ugs+0.049
Neg logit contributions
pieces-0.080
 �-0.077
 hesitation-0.074
 impact-0.074
ieved-0.073
Token,
Token ID11
Feature activation-8.39e-01

Pos logit contributions
uffy+0.026
ears+0.025
ugs+0.025
imi+0.024
 stairs+0.023
Neg logit contributions
pieces-0.024
 �-0.024
 impact-0.024
ieved-0.023
 agreement-0.023
Token it
Token ID340
Feature activation-2.10e+00

Pos logit contributions
uffy+0.014
ears+0.014
ugs+0.014
imi+0.013
izzy+0.013
Neg logit contributions
 �-0.019
pieces-0.018
 impact-0.018
 agreement-0.018
 hesitation-0.018
Token was
Token ID373
Feature activation-1.13e+00

Pos logit contributions
uffy+0.040
ears+0.039
ugs+0.039
imi+0.036
unny+0.036
Neg logit contributions
 �-0.044
pieces-0.043
 impact-0.042
 hesitation-0.041
 agreement-0.041
Token even
Token ID772
Feature activation+4.19e-01

Pos logit contributions
ears+0.004
uffy+0.004
ugs+0.004
izzy+0.004
unny+0.003
Neg logit contributions
 �-0.003
ement-0.003
ieved-0.003
pieces-0.003
 int-0.003
Token notice
Token ID4003
Feature activation+5.29e-01

Pos logit contributions
ement+0.010
 �+0.009
ieved+0.009
 ''+0.009
pieces+0.009
Neg logit contributions
 spill-0.007
 cob-0.007
 dirty-0.006
 stains-0.006
 stairs-0.006
Token when
Token ID618
Feature activation-4.45e-01

Pos logit contributions
 �+0.010
pieces+0.009
 impact+0.009
 agreement+0.009
ieved+0.009
Neg logit contributions
uffy-0.011
ears-0.011
ugs-0.011
imi-0.010
izzy-0.010
Token the
Token ID262
Feature activation+4.65e-02

Pos logit contributions
ears+0.007
uffy+0.007
ugs+0.007
unny+0.007
imi+0.007
Neg logit contributions
 �-0.011
pieces-0.010
 hesitation-0.010
 impact-0.010
 int-0.010
Token sun
Token ID4252
Feature activation-1.54e+00

Pos logit contributions
 �+0.001
 impact+0.001
 End+0.001
 hesitation+0.001
 then+0.001
Neg logit contributions
ugs-0.001
ears-0.001
uffy-0.001
unny-0.001
uts-0.001
Token started
Token ID2067
Feature activation-5.89e+00

Pos logit contributions
ears+0.034
ugs+0.033
uffy+0.033
unny+0.031
imi+0.030
Neg logit contributions
 �-0.029
 int-0.027
pieces-0.026
 then-0.026
 impact-0.025
Token to
Token ID284
Feature activation-3.78e+00

Pos logit contributions
uffy+0.124
ears+0.122
ugs+0.120
unny+0.117
izzy+0.114
Neg logit contributions
 �-0.120
 int-0.111
 forward-0.105
 impact-0.105
 hesitation-0.105
Token go
Token ID467
Feature activation-2.68e+00

Pos logit contributions
uffy+0.062
ears+0.062
ugs+0.060
 stairs+0.060
imi+0.059
Neg logit contributions
pieces-0.084
 �-0.081
 impact-0.079
 hesitation-0.078
ieved-0.078
Token down
Token ID866
Feature activation-2.04e+00

Pos logit contributions
uffy+0.052
ears+0.051
 stairs+0.050
ugs+0.049
imi+0.047
Neg logit contributions
pieces-0.055
 impact-0.052
 hesitation-0.052
ieved-0.051
 agreement-0.051
Token.
Token ID13
Feature activation+8.99e-01

Pos logit contributions
uffy+0.047
ears+0.046
ugs+0.046
imi+0.043
unny+0.043
Neg logit contributions
 �-0.035
pieces-0.033
 impact-0.033
 int-0.032
 agreement-0.032
Token
Token ID220
Feature activation-1.60e+00

Pos logit contributions
pieces+0.018
 impact+0.018
 hesitation+0.017
 �+0.017
ieved+0.017
Neg logit contributions
uffy-0.018
ears-0.017
ugs-0.017
izzy-0.016
imi-0.016
Token they
Token ID484
Feature activation-5.87e-01

Pos logit contributions
ears+0.010
uffy+0.010
ugs+0.010
imi+0.009
unny+0.009
Neg logit contributions
 �-0.009
pieces-0.008
 hesitation-0.008
 int-0.008
 then-0.008
Token walked
Token ID6807
Feature activation+1.69e+00

Pos logit contributions
uffy+0.012
ears+0.012
ugs+0.011
imi+0.011
unny+0.011
Neg logit contributions
 �-0.012
pieces-0.011
 impact-0.011
 agreement-0.011
 int-0.011
Token,
Token ID11
Feature activation-5.43e-01

Pos logit contributions
 impact+0.035
pieces+0.035
ieved+0.033
 agreement+0.032
 unison+0.032
Neg logit contributions
uffy-0.032
ears-0.032
 stairs-0.031
ugs-0.031
unny-0.030
Token the
Token ID262
Feature activation+5.87e-01

Pos logit contributions
ears+0.012
uffy+0.011
ugs+0.011
imi+0.011
unny+0.010
Neg logit contributions
 �-0.010
pieces-0.010
 impact-0.010
 hesitation-0.009
 int-0.009
Token sun
Token ID4252
Feature activation-2.26e+00

Pos logit contributions
 �+0.013
pieces+0.013
 impact+0.013
 agreement+0.013
 hesitation+0.012
Neg logit contributions
ears-0.011
ugs-0.011
uffy-0.010
unny-0.009
imi-0.009
Token started
Token ID2067
Feature activation-5.86e+00

Pos logit contributions
ears+0.050
uffy+0.049
ugs+0.048
imi+0.046
izzy+0.045
Neg logit contributions
 �-0.043
 int-0.039
pieces-0.039
 then-0.038
 impact-0.037
Token to
Token ID284
Feature activation-2.99e+00

Pos logit contributions
uffy+0.127
ears+0.125
ugs+0.122
imi+0.116
izzy+0.116
Neg logit contributions
 �-0.120
 int-0.111
 forward-0.104
 hesitation-0.104
 impact-0.103
Token shine
Token ID18340
Feature activation-3.40e-01

Pos logit contributions
uffy+0.046
ears+0.045
 stairs+0.044
Jen+0.044
 cob+0.043
Neg logit contributions
pieces-0.069
 �-0.067
 impact-0.066
ieved-0.065
 hesitation-0.064
Token.
Token ID13
Feature activation+4.59e-02

Pos logit contributions
uffy+0.007
ears+0.007
ugs+0.007
imi+0.007
 stairs+0.007
Neg logit contributions
pieces-0.006
 �-0.006
 impact-0.006
 hesitation-0.006
ieved-0.006
Token Then
Token ID3244
Feature activation-3.63e-03

Pos logit contributions
 �+0.001
 impact+0.001
pieces+0.001
 hesitation+0.001
ieved+0.001
Neg logit contributions
uffy-0.001
ears-0.001
ugs-0.001
unny-0.001
izzy-0.001
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
uffy+0.000
ears+0.000
ugs+0.000
imi+0.000
unny+0.000
Neg logit contributions
 �-0.000
pieces-0.000
 impact-0.000
 agreement-0.000
 int-0.000
Token caught
Token ID4978
Feature activation+6.82e-01

Pos logit contributions
 �+0.017
pieces+0.016
 impact+0.016
 agreement+0.016
 int+0.015
Neg logit contributions
ears-0.013
uffy-0.012
ugs-0.012
 stairs-0.011
loo-0.011
Token.
Token ID13
Feature activation+3.80e-01

Pos logit contributions
pieces+0.014
 �+0.014
 impact+0.014
ieved+0.014
 agreement+0.014
Neg logit contributions
ears-0.012
uffy-0.012
ugs-0.011
 stairs-0.011
 cob-0.011
Token But
Token ID887
Feature activation+2.17e+00

Pos logit contributions
pieces+0.009
 impact+0.009
ieved+0.009
 hesitation+0.009
 narrative+0.009
Neg logit contributions
 stairs-0.005
unny-0.005
 Fl-0.005
 Tit-0.005
 Chick-0.005
Token the
Token ID262
Feature activation+1.81e+00

Pos logit contributions
pieces+0.064
 impact+0.060
 narrative+0.056
ement+0.056
 unison+0.055
Neg logit contributions
 Fl-0.023
 Tit-0.021
 toys-0.020
 dirty-0.020
 Bee-0.019
Token fisherman
Token ID48750
Feature activation+1.85e+00

Pos logit contributions
pieces+0.053
ieved+0.049
 �+0.046
ement+0.046
alling+0.046
Neg logit contributions
 stairs-0.022
 dirty-0.022
 toys-0.020
unny-0.020
 water-0.019
Token shook
Token ID14682
Feature activation+2.97e+00

Pos logit contributions
pieces+0.044
 impact+0.043
 �+0.043
 agreement+0.042
 int+0.041
Neg logit contributions
uffy-0.030
ears-0.027
unny-0.026
imi-0.025
izzy-0.025
Token his
Token ID465
Feature activation+9.67e-01

Pos logit contributions
pieces+0.100
 impact+0.092
 narrative+0.091
ieved+0.087
alling+0.084
Neg logit contributions
 Fl-0.026
 dirty-0.022
 Jen-0.022
 Tit-0.021
 wet-0.020
Token head
Token ID1182
Feature activation+9.84e-01

Pos logit contributions
pieces+0.029
ieved+0.026
 �+0.025
 impact+0.025
alling+0.025
Neg logit contributions
unny-0.012
uffy-0.012
ears-0.010
 Tit-0.010
ys-0.010
Token and
Token ID290
Feature activation+7.83e-01

Pos logit contributions
 impact+0.021
pieces+0.021
 agreement+0.020
ieved+0.020
 hesitation+0.019
Neg logit contributions
uffy-0.017
ears-0.017
ugs-0.017
unny-0.016
uts-0.016
Token said
Token ID531
Feature activation-1.70e+00

Pos logit contributions
pieces+0.018
 impact+0.018
 �+0.017
 agreement+0.017
ieved+0.016
Neg logit contributions
uffy-0.013
ears-0.013
 Chick-0.012
unny-0.012
 cob-0.012
Token he
Token ID339
Feature activation-2.48e-02

Pos logit contributions
uffy+0.028
ears+0.027
ugs+0.025
 cob+0.025
 Tit+0.025
Neg logit contributions
pieces-0.040
ieved-0.039
 impact-0.039
 agreement-0.039
 unison-0.037
Token but
Token ID475
Feature activation+1.76e+00

Pos logit contributions
pieces+0.015
 impact+0.015
ieved+0.014
 �+0.014
 agreement+0.013
Neg logit contributions
ears-0.022
uffy-0.021
ugs-0.021
 stairs-0.021
imi-0.020
Token when
Token ID618
Feature activation+7.68e-01

Pos logit contributions
 �+0.038
 then+0.035
 int+0.034
pieces+0.034
 hesitation+0.034
Neg logit contributions
ears-0.035
ugs-0.035
uffy-0.034
imi-0.032
unny-0.031
Token she
Token ID673
Feature activation-5.36e-02

Pos logit contributions
 �+0.017
 then+0.016
 hesitation+0.016
 forward+0.015
 End+0.015
Neg logit contributions
ugs-0.015
ears-0.015
uffy-0.014
unny-0.014
izzy-0.014
Token tried
Token ID3088
Feature activation-3.36e-01

Pos logit contributions
ugs+0.001
ears+0.001
uffy+0.001
imi+0.001
unny+0.001
Neg logit contributions
 �-0.001
ieved-0.001
 then-0.001
pieces-0.001
 hesitation-0.001
Token to
Token ID284
Feature activation-1.53e-01

Pos logit contributions
uffy+0.007
ugs+0.007
ears+0.007
unny+0.006
imi+0.006
Neg logit contributions
 �-0.007
 hesitation-0.006
 int-0.006
pieces-0.006
 impact-0.006
Token open
Token ID1280
Feature activation+2.68e+00

Pos logit contributions
ugs+0.004
uffy+0.004
ears+0.004
loo+0.004
unn+0.004
Neg logit contributions
 then-0.002
 agreement-0.002
 agree-0.002
 decide-0.002
 �-0.002
Token it
Token ID340
Feature activation+1.10e-01

Pos logit contributions
 �+0.060
 impact+0.060
pieces+0.060
ieved+0.058
 agreement+0.058
Neg logit contributions
uffy-0.046
ears-0.045
ugs-0.044
 stairs-0.041
imi-0.041
Token,
Token ID11
Feature activation-2.03e-02

Pos logit contributions
 �+0.002
pieces+0.002
 impact+0.002
 int+0.002
 agreement+0.002
Neg logit contributions
uffy-0.002
ears-0.002
ugs-0.002
imi-0.002
unny-0.002
Token it
Token ID340
Feature activation-1.67e+00

Pos logit contributions
ears+0.000
uffy+0.000
ugs+0.000
imi+0.000
unny+0.000
Neg logit contributions
 �-0.001
pieces-0.000
 impact-0.000
 hesitation-0.000
 int-0.000
Token was
Token ID373
Feature activation-6.75e-01

Pos logit contributions
ugs+0.036
ears+0.036
ubs+0.033
unny+0.032
uffy+0.032
Neg logit contributions
 �-0.033
 then-0.031
 int-0.030
ieved-0.028
 unsure-0.028
Token too
Token ID1165
Feature activation-1.91e+00

Pos logit contributions
uffy+0.013
ears+0.013
ugs+0.013
unny+0.012
imi+0.012
Neg logit contributions
 �-0.014
 hesitation-0.013
 int-0.013
 impact-0.013
ieved-0.013
TokenCan
Token ID6090
Feature activation-1.50e-01

Pos logit contributions
 �+0.009
pieces+0.009
 impact+0.009
 int+0.008
 hesitation+0.008
Neg logit contributions
uffy-0.011
ears-0.011
ugs-0.011
imi-0.011
unny-0.010
Token I
Token ID314
Feature activation+1.29e+00

Pos logit contributions
ears+0.002
ugs+0.002
uffy+0.002
imi+0.002
izzy+0.002
Neg logit contributions
 �-0.004
pieces-0.004
 impact-0.004
 int-0.003
ieved-0.003
Token have
Token ID423
Feature activation+1.77e+00

Pos logit contributions
 �+0.019
 then+0.018
 impact+0.018
 agreement+0.017
 forward+0.017
Neg logit contributions
ears-0.032
ugs-0.032
uffy-0.031
imi-0.030
 stairs-0.030
Token some
Token ID617
Feature activation+8.28e-01

Pos logit contributions
 �+0.030
pieces+0.029
 impact+0.029
 hesitation+0.028
 then+0.028
Neg logit contributions
uffy-0.042
ugs-0.041
ears-0.040
imi-0.038
unny-0.038
Token ice
Token ID4771
Feature activation+9.56e-01

Pos logit contributions
 �+0.021
 impact+0.021
pieces+0.020
 hesitation+0.020
 agreement+0.020
Neg logit contributions
uffy-0.012
ugs-0.011
ears-0.011
unny-0.011
imi-0.010
Token cream
Token ID8566
Feature activation+2.91e+00

Pos logit contributions
 �+0.027
 impact+0.027
pieces+0.027
ieved+0.026
 int+0.026
Neg logit contributions
uffy-0.010
ugs-0.009
ears-0.009
imi-0.009
unny-0.008
Token now
Token ID783
Feature activation+1.45e+00

Pos logit contributions
pieces+0.059
 �+0.059
 impact+0.058
 hesitation+0.057
 agreement+0.057
Neg logit contributions
ears-0.058
uffy-0.057
ugs-0.055
loo-0.052
unny-0.050
Token?"
Token ID1701
Feature activation+4.05e-01

Pos logit contributions
pieces+0.029
 �+0.029
 impact+0.028
 agreement+0.028
 hesitation+0.028
Neg logit contributions
uffy-0.029
ears-0.029
ugs-0.028
imi-0.026
loo-0.026
Token Her
Token ID2332
Feature activation-1.06e-01

Pos logit contributions
 �+0.009
pieces+0.008
 int+0.008
 agreement+0.008
 impact+0.008
Neg logit contributions
ears-0.008
uffy-0.008
ugs-0.007
unny-0.007
izzy-0.007
Token mom
Token ID1995
Feature activation+4.26e-01

Pos logit contributions
ears+0.002
ugs+0.002
uffy+0.002
unny+0.002
izzy+0.002
Neg logit contributions
 �-0.002
 agreement-0.002
 impact-0.002
 hesitation-0.002
pieces-0.002
Token said
Token ID531
Feature activation-1.66e+00

Pos logit contributions
 �+0.007
pieces+0.007
 impact+0.007
 hesitation+0.006
 agreement+0.006
Neg logit contributions
uffy-0.010
ears-0.010
ugs-0.010
imi-0.009
izzy-0.009
Token"
Token ID1
Feature activation-4.87e-01

Pos logit contributions
 �+0.003
pieces+0.002
 impact+0.002
 agreement+0.002
 int+0.002
Neg logit contributions
uffy-0.003
ears-0.003
ugs-0.003
imi-0.003
unny-0.003
TokenWhat
Token ID2061
Feature activation+2.16e+00

Pos logit contributions
ears+0.010
uffy+0.010
ugs+0.009
Jen+0.009
unny+0.009
Neg logit contributions
 impact-0.009
pieces-0.009
 �-0.009
 hesitation-0.009
 int-0.009
Token are
Token ID389
Feature activation-2.65e+00

Pos logit contributions
 �+0.045
pieces+0.043
 impact+0.043
 hesitation+0.041
 int+0.041
Neg logit contributions
uffy-0.041
ears-0.041
ugs-0.041
unny-0.038
imi-0.038
Token you
Token ID345
Feature activation+1.88e+00

Pos logit contributions
uffy+0.038
ears+0.037
ugs+0.037
unny+0.033
imi+0.033
Neg logit contributions
 �-0.069
pieces-0.067
 impact-0.065
 int-0.065
 hesitation-0.064
Token do
Token ID466
Feature activation+1.78e+00

Pos logit contributions
 �+0.037
pieces+0.037
 impact+0.036
 agreement+0.035
ieved+0.035
Neg logit contributions
uffy-0.037
ears-0.036
ugs-0.036
imi-0.034
izzy-0.033
Tokenin
Token ID259
Feature activation+3.23e+00

Pos logit contributions
pieces+0.040
 �+0.040
 agreement+0.039
ieved+0.039
 impact+0.039
Neg logit contributions
uffy-0.030
ugs-0.029
ears-0.029
imi-0.027
 stairs-0.027
Token'
Token ID6
Feature activation+5.13e-02

Pos logit contributions
 �+0.067
 agreement+0.063
pieces+0.063
 impact+0.062
ieved+0.062
Neg logit contributions
ears-0.061
ugs-0.061
uffy-0.061
loo-0.057
imi-0.056
Token?"
Token ID1701
Feature activation+6.41e-01

Pos logit contributions
pieces+0.001
 �+0.001
 impact+0.001
 agreement+0.001
ieved+0.001
Neg logit contributions
uffy-0.001
ears-0.001
ugs-0.001
imi-0.001
 stairs-0.001
Token Benny
Token ID44275
Feature activation+7.48e-01

Pos logit contributions
 impact+0.016
pieces+0.015
ieved+0.015
 narrative+0.014
 unison+0.014
Neg logit contributions
 Chick-0.010
uffy-0.009
 Fl-0.009
Su-0.009
Tim-0.009
Token asked
Token ID1965
Feature activation-1.08e+00

Pos logit contributions
 impact+0.014
pieces+0.014
 �+0.013
 agreement+0.013
ieved+0.013
Neg logit contributions
uffy-0.016
ears-0.015
ugs-0.015
imi-0.015
izzy-0.015
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
uffy+0.017
ugs+0.016
un+0.016
 Fl+0.016
unny+0.016
Neg logit contributions
 impact-0.027
pieces-0.026
ieved-0.025
 narrative-0.025
alling-0.024
Token a
Token ID257
Feature activation+1.59e-01

Pos logit contributions
 �+0.054
 impact+0.053
pieces+0.052
 hesitation+0.051
 agreement+0.051
Neg logit contributions
uffy-0.041
ears-0.041
ugs-0.040
imi-0.038
 stairs-0.037
Token show
Token ID905
Feature activation+3.10e+00

Pos logit contributions
 impact+0.004
 �+0.004
ieved+0.004
pieces+0.004
 int+0.004
Neg logit contributions
uffy-0.002
ears-0.002
ugs-0.002
 stairs-0.002
imi-0.002
Token for
Token ID329
Feature activation+9.28e-01

Pos logit contributions
 impact+0.074
 hesitation+0.073
 �+0.070
 int+0.070
ement+0.068
Neg logit contributions
ugs-0.049
uffy-0.047
izzy-0.047
ears-0.046
loo-0.045
Token mom
Token ID1995
Feature activation+2.45e+00

Pos logit contributions
 �+0.021
pieces+0.020
 impact+0.020
 agreement+0.019
 int+0.019
Neg logit contributions
uffy-0.016
ears-0.016
ugs-0.016
imi-0.015
unny-0.015
Token and
Token ID290
Feature activation-1.99e-01

Pos logit contributions
 hesitation+0.059
 �+0.058
 impact+0.057
 int+0.056
 agreement+0.054
Neg logit contributions
ears-0.042
uffy-0.040
ugs-0.040
o-0.037
imi-0.037
Token dad
Token ID9955
Feature activation+3.20e+00

Pos logit contributions
uffy+0.003
ears+0.003
ugs+0.003
imi+0.003
 stairs+0.003
Neg logit contributions
 �-0.005
 impact-0.005
pieces-0.004
ieved-0.004
 int-0.004
Token?
Token ID30
Feature activation+5.84e-01

Pos logit contributions
 impact+0.080
ement+0.080
 hesitation+0.080
 �+0.077
 int+0.076
Neg logit contributions
ears-0.047
uffy-0.046
ugs-0.045
o-0.042
 toys-0.042
Token We
Token ID775
Feature activation+1.54e+00

Pos logit contributions
 impact+0.012
 hesitation+0.012
ieved+0.012
pieces+0.012
 agreement+0.012
Neg logit contributions
uffy-0.011
ears-0.011
ugs-0.010
unny-0.010
izzy-0.010
Token can
Token ID460
Feature activation-1.36e+00

Pos logit contributions
 �+0.034
pieces+0.033
 int+0.032
 impact+0.032
ieved+0.032
Neg logit contributions
uffy-0.028
ears-0.027
ugs-0.026
izzy-0.025
unny-0.025
Token sing
Token ID1702
Feature activation+1.29e+00

Pos logit contributions
uffy+0.028
ears+0.028
ugs+0.028
imi+0.026
unny+0.026
Neg logit contributions
 �-0.026
pieces-0.025
 impact-0.025
 agreement-0.024
 int-0.024
Token and
Token ID290
Feature activation-9.53e-01

Pos logit contributions
 impact+0.027
 hesitation+0.026
 �+0.026
ieved+0.026
pieces+0.026
Neg logit contributions
uffy-0.024
ears-0.023
ugs-0.023
imi-0.022
 stairs-0.021
Token
Token ID198
Feature activation+3.31e-01

Pos logit contributions
uffy+0.035
ears+0.035
ugs+0.034
imi+0.032
unny+0.032
Neg logit contributions
 �-0.034
pieces-0.033
 impact-0.033
 hesitation-0.032
 agreement-0.032
Token
Token ID198
Feature activation+3.46e-01

Pos logit contributions
 �+0.006
pieces+0.005
 int+0.005
ieved+0.005
 agreement+0.005
Neg logit contributions
uffy-0.008
ears-0.008
ugs-0.008
loo-0.007
imi-0.007
Tokenamy
Token ID14814
Feature activation-4.48e-03

Pos logit contributions
 �+0.008
pieces+0.008
ieved+0.008
 agreement+0.008
 int+0.008
Neg logit contributions
uffy-0.006
ears-0.006
ugs-0.006
loo-0.005
imi-0.005
Token picked
Token ID6497
Feature activation+4.56e-01

Pos logit contributions
ugs+0.000
ears+0.000
uffy+0.000
loo+0.000
izzy+0.000
Neg logit contributions
 �-0.000
 then-0.000
 int-0.000
 agreement-0.000
ieved-0.000
Token the
Token ID262
Feature activation+1.67e-01

Pos logit contributions
 �+0.007
pieces+0.006
 int+0.006
 impact+0.006
 agreement+0.006
Neg logit contributions
uffy-0.012
ears-0.012
ugs-0.011
imi-0.011
unny-0.011
Token grape
Token ID30777
Feature activation+2.24e-01

Pos logit contributions
 �+0.004
pieces+0.004
 impact+0.004
 int+0.004
 hesitation+0.004
Neg logit contributions
uffy-0.003
ears-0.003
ugs-0.003
imi-0.003
unny-0.003
Token and
Token ID290
Feature activation+2.09e-01

Pos logit contributions
 �+0.004
 int+0.003
 then+0.003
pieces+0.003
 agreement+0.003
Neg logit contributions
uffy-0.006
ugs-0.005
ears-0.005
imi-0.005
izzy-0.005
Token put
Token ID1234
Feature activation+8.39e-01

Pos logit contributions
 �+0.004
 then+0.004
pieces+0.004
ieved+0.003
 int+0.003
Neg logit contributions
uffy-0.005
ugs-0.005
ears-0.005
unny-0.004
imi-0.004
Token it
Token ID340
Feature activation-1.68e+00

Pos logit contributions
 impact+0.020
 �+0.020
pieces+0.020
 hesitation+0.020
ieved+0.019
Neg logit contributions
ears-0.012
uffy-0.012
ugs-0.012
izzy-0.011
 stairs-0.011
Token right
Token ID826
Feature activation+1.54e+00

Pos logit contributions
uffy+0.042
ugs+0.042
ears+0.041
imi+0.040
unny+0.039
Neg logit contributions
 �-0.025
ieved-0.023
 int-0.023
pieces-0.022
 forward-0.022
Token in
Token ID287
Feature activation-2.41e-01

Pos logit contributions
pieces+0.031
 impact+0.031
 �+0.030
 hesitation+0.029
 agreement+0.028
Neg logit contributions
uffy-0.031
ears-0.031
ugs-0.030
izzy-0.029
 stairs-0.028
Token playing
Token ID2712
Feature activation-5.20e-01

Pos logit contributions
 �+0.012
pieces+0.012
 impact+0.012
 hesitation+0.011
ieved+0.011
Neg logit contributions
uffy-0.011
ears-0.010
ugs-0.010
imi-0.010
 stairs-0.010
Token with
Token ID351
Feature activation-1.27e+00

Pos logit contributions
uffy+0.007
 stairs+0.007
 cob+0.007
ears+0.007
 toys+0.007
Neg logit contributions
 hesitation-0.013
pieces-0.012
 impact-0.012
ieved-0.012
 �-0.012
Token his
Token ID465
Feature activation+5.54e-01

Pos logit contributions
uffy+0.037
unny+0.035
ugs+0.035
ears+0.034
izzy+0.033
Neg logit contributions
 �-0.016
 agreement-0.015
 then-0.015
pieces-0.015
 hesitation-0.014
Token toys
Token ID14958
Feature activation-1.20e-01

Pos logit contributions
 �+0.015
pieces+0.015
 impact+0.015
 agreement+0.014
 hesitation+0.014
Neg logit contributions
uffy-0.007
ears-0.007
ugs-0.007
imi-0.006
unny-0.006
Token when
Token ID618
Feature activation-8.56e-01

Pos logit contributions
uffy+0.002
ears+0.002
 stairs+0.002
ugs+0.002
 cob+0.002
Neg logit contributions
 impact-0.003
pieces-0.003
 Amen-0.003
 �-0.003
 agreement-0.002
Token his
Token ID465
Feature activation+9.55e-01

Pos logit contributions
uffy+0.018
ears+0.018
ugs+0.017
imi+0.016
unny+0.016
Neg logit contributions
 �-0.016
pieces-0.016
 impact-0.016
 int-0.015
 hesitation-0.015
Token mom
Token ID1995
Feature activation-2.94e-01

Pos logit contributions
 �+0.021
pieces+0.020
 impact+0.019
 int+0.019
 hesitation+0.019
Neg logit contributions
ears-0.017
ugs-0.017
uffy-0.017
unny-0.016
izzy-0.016
Token told
Token ID1297
Feature activation-1.50e+00

Pos logit contributions
uffy+0.005
ears+0.005
ugs+0.005
imi+0.005
izzy+0.005
Neg logit contributions
 �-0.006
 impact-0.006
pieces-0.006
 agreement-0.006
 int-0.006
Token him
Token ID683
Feature activation+6.50e-01

Pos logit contributions
ears+0.030
uffy+0.030
ugs+0.029
imi+0.028
 stairs+0.027
Neg logit contributions
 �-0.029
pieces-0.028
 impact-0.028
ieved-0.027
 hesitation-0.027
Token it
Token ID340
Feature activation-3.75e-01

Pos logit contributions
 �+0.011
pieces+0.011
 impact+0.011
 int+0.010
 hesitation+0.010
Neg logit contributions
uffy-0.015
ears-0.015
ugs-0.014
imi-0.014
unny-0.014
Token was
Token ID373
Feature activation-2.29e-01

Pos logit contributions
ears+0.007
uffy+0.007
ugs+0.007
unny+0.006
imi+0.006
Neg logit contributions
 �-0.009
 int-0.008
 impact-0.008
pieces-0.008
 hesitation-0.008
Token them
Token ID606
Feature activation+2.16e-01

Pos logit contributions
 �+0.011
pieces+0.011
 int+0.011
 impact+0.011
ieved+0.011
Neg logit contributions
uffy-0.011
ears-0.011
ugs-0.010
loo-0.010
 stairs-0.010
Token.
Token ID13
Feature activation+1.67e-02

Pos logit contributions
 �+0.003
pieces+0.003
 impact+0.003
 agreement+0.003
 int+0.003
Neg logit contributions
ears-0.005
uffy-0.005
ugs-0.005
 stairs-0.005
izzy-0.005
Token Finally
Token ID9461
Feature activation+1.33e-01

Pos logit contributions
 �+0.000
 impact+0.000
pieces+0.000
 int+0.000
ieved+0.000
Neg logit contributions
uffy-0.000
ears-0.000
ugs-0.000
imi-0.000
unny-0.000
Token,
Token ID11
Feature activation-2.99e-01

Pos logit contributions
pieces+0.003
 �+0.003
 impact+0.003
 int+0.003
ieved+0.003
Neg logit contributions
uffy-0.003
ears-0.002
ugs-0.002
loo-0.002
imi-0.002
Token he
Token ID339
Feature activation-2.20e-01

Pos logit contributions
uffy+0.005
ears+0.005
ugs+0.004
Jen+0.004
 stairs+0.004
Neg logit contributions
pieces-0.007
 impact-0.007
 �-0.007
 agreement-0.007
 hesitation-0.007
Token saw
Token ID2497
Feature activation-2.76e-02

Pos logit contributions
ugs+0.005
ears+0.005
uffy+0.005
imi+0.005
 stairs+0.005
Neg logit contributions
 �-0.004
 then-0.004
ieved-0.004
 impact-0.003
 agreement-0.003
Token the
Token ID262
Feature activation-1.99e-01

Pos logit contributions
uffy+0.001
ears+0.001
ugs+0.001
imi+0.001
unny+0.001
Neg logit contributions
 �-0.001
pieces-0.000
 impact-0.000
 agreement-0.000
 int-0.000
Token box
Token ID3091
Feature activation-9.12e-01

Pos logit contributions
ugs+0.003
uffy+0.003
ears+0.003
unny+0.003
imi+0.003
Neg logit contributions
 �-0.005
 impact-0.005
pieces-0.005
 agreement-0.005
 hesitation-0.005
Token stuck
Token ID7819
Feature activation-1.10e+00

Pos logit contributions
uffy+0.017
ears+0.017
ugs+0.016
imi+0.015
 stairs+0.015
Neg logit contributions
 �-0.019
pieces-0.019
 impact-0.019
 agreement-0.018
 int-0.018
Token in
Token ID287
Feature activation-6.91e-01

Pos logit contributions
uffy+0.024
ears+0.024
ugs+0.023
imi+0.022
 stairs+0.022
Neg logit contributions
 �-0.020
pieces-0.019
 impact-0.019
 agreement-0.018
 int-0.018
Token a
Token ID257
Feature activation-1.22e+00

Pos logit contributions
uffy+0.016
ears+0.016
ugs+0.016
unny+0.015
imi+0.015
Neg logit contributions
 �-0.012
 agreement-0.011
 impact-0.011
 hesitation-0.011
pieces-0.011
Token It
Token ID632
Feature activation-1.69e+00

Pos logit contributions
 �+0.006
 int+0.005
 agreement+0.005
pieces+0.005
 ''+0.005
Neg logit contributions
uffy-0.006
ears-0.006
ugs-0.006
loo-0.006
imi-0.006
Token was
Token ID373
Feature activation-6.80e-01

Pos logit contributions
ugs+0.037
ears+0.037
uffy+0.035
unny+0.032
loo+0.031
Neg logit contributions
 �-0.035
 then-0.032
 int-0.031
 agreement-0.030
ieved-0.030
Token calling
Token ID4585
Feature activation-5.74e-01

Pos logit contributions
ears+0.018
ugs+0.017
uffy+0.017
unny+0.017
unn+0.016
Neg logit contributions
 �-0.012
 then-0.011
 hesitation-0.010
 agreement-0.010
 int-0.010
Token her
Token ID607
Feature activation+1.08e+00

Pos logit contributions
ears+0.015
uffy+0.015
ugs+0.014
loo+0.014
unn+0.013
Neg logit contributions
 �-0.010
 forward-0.009
 int-0.009
 agreement-0.008
 hesitation-0.008
Token in
Token ID287
Feature activation+1.22e+00

Pos logit contributions
 �+0.015
 int+0.013
 forward+0.012
 hesitation+0.012
 then+0.012
Neg logit contributions
ears-0.030
ugs-0.029
Jen-0.028
Su-0.028
loo-0.028
Token from
Token ID422
Feature activation+2.07e+00

Pos logit contributions
pieces+0.027
ieved+0.026
 impact+0.025
 �+0.025
 int+0.025
Neg logit contributions
uffy-0.021
 stairs-0.020
ugs-0.020
 cob-0.020
ears-0.020
Token the
Token ID262
Feature activation+1.79e-01

Pos logit contributions
pieces+0.049
 �+0.045
 impact+0.045
ieved+0.044
ement+0.044
Neg logit contributions
 cob-0.035
 stairs-0.034
uffy-0.033
ears-0.033
imi-0.032
Token cold
Token ID4692
Feature activation-2.34e+00

Pos logit contributions
 �+0.005
pieces+0.005
 impact+0.005
ieved+0.005
 int+0.005
Neg logit contributions
uffy-0.002
ears-0.002
ugs-0.002
imi-0.002
 stairs-0.002
Token.
Token ID13
Feature activation+9.44e-02

Pos logit contributions
uffy+0.052
ears+0.051
unny+0.049
ugs+0.048
uts+0.046
Neg logit contributions
 impact-0.041
 �-0.039
 then-0.038
 agreement-0.037
 hesitation-0.036
Token She
Token ID1375
Feature activation-2.78e-02

Pos logit contributions
 �+0.002
 int+0.001
pieces+0.001
 agreement+0.001
 then+0.001
Neg logit contributions
ears-0.002
uffy-0.002
ugs-0.002
loo-0.002
imi-0.002
Token ran
Token ID4966
Feature activation+4.83e-01

Pos logit contributions
ugs+0.001
ears+0.001
uffy+0.001
loo+0.001
 stairs+0.001
Neg logit contributions
 �-0.001
 then-0.001
 agreement-0.001
 int-0.001
ieved-0.001
Token anymore
Token ID7471
Feature activation-7.54e-01

Pos logit contributions
ears+0.001
uffy+0.001
ugs+0.001
 stairs+0.001
loo+0.001
Neg logit contributions
 �-0.001
pieces-0.001
 impact-0.001
ieved-0.001
 int-0.001
Token.
Token ID13
Feature activation+3.23e-01

Pos logit contributions
uffy+0.016
ears+0.016
ugs+0.015
imi+0.015
 stairs+0.014
Neg logit contributions
 �-0.014
 impact-0.014
pieces-0.014
ieved-0.014
 agreement-0.013
Token They
Token ID1119
Feature activation-4.12e-01

Pos logit contributions
 �+0.008
pieces+0.007
 impact+0.007
 hesitation+0.007
ieved+0.007
Neg logit contributions
uffy-0.005
ears-0.005
ugs-0.005
imi-0.005
unny-0.005
Token want
Token ID765
Feature activation+5.82e-01

Pos logit contributions
ears+0.007
uffy+0.007
izzy+0.007
 cob+0.007
 dirty+0.007
Neg logit contributions
pieces-0.008
ement-0.008
 �-0.008
ieved-0.008
 int-0.008
Token to
Token ID284
Feature activation+1.51e-01

Pos logit contributions
 �+0.011
pieces+0.011
 hesitation+0.010
 int+0.010
 impact+0.010
Neg logit contributions
uffy-0.013
ugs-0.012
ears-0.012
unny-0.012
imi-0.012
Token play
Token ID711
Feature activation+1.64e-02

Pos logit contributions
 �+0.003
pieces+0.003
 impact+0.003
 agreement+0.003
 int+0.003
Neg logit contributions
uffy-0.003
ears-0.003
ugs-0.003
imi-0.003
unny-0.003
Token with
Token ID351
Feature activation-9.66e-01

Pos logit contributions
 �+0.000
 hesitation+0.000
 impact+0.000
 int+0.000
ieved+0.000
Neg logit contributions
uffy-0.000
 toys-0.000
ears-0.000
 stairs-0.000
 cob-0.000
Token their
Token ID511
Feature activation-1.24e+00

Pos logit contributions
ears+0.024
ugs+0.024
uffy+0.023
unny+0.023
izzy+0.022
Neg logit contributions
pieces-0.015
 �-0.014
 agreement-0.014
 then-0.014
 hesitation-0.013
Token toys
Token ID14958
Feature activation+3.70e-01

Pos logit contributions
ugs+0.019
ears+0.018
Su+0.018
izzy+0.017
unny+0.017
Neg logit contributions
 agreement-0.031
 impact-0.030
 �-0.029
pieces-0.029
 hesitation-0.029
Token and
Token ID290
Feature activation-1.14e+00

Pos logit contributions
pieces+0.007
 �+0.007
 impact+0.007
 int+0.007
ieved+0.007
Neg logit contributions
uffy-0.007
ears-0.007
ugs-0.007
 stairs-0.007
loo-0.007
Token books
Token ID3835
Feature activation+7.69e-01

Pos logit contributions
ears+0.016
 stairs+0.016
uffy+0.016
 cob+0.016
ugs+0.015
Neg logit contributions
 �-0.028
ement-0.028
ieved-0.028
 int-0.028
 impact-0.027
Token fish
Token ID5916
Feature activation+8.31e-01

Pos logit contributions
uffy+0.021
ugs+0.020
unny+0.019
ears+0.019
izzy+0.019
Neg logit contributions
 �-0.019
 int-0.017
 hesitation-0.017
 impact-0.016
pieces-0.016
Token.
Token ID13
Feature activation+3.02e-01

Pos logit contributions
 �+0.014
pieces+0.013
 impact+0.013
 agreement+0.013
 int+0.013
Neg logit contributions
uffy-0.019
ears-0.019
ugs-0.019
imi-0.018
unny-0.018
Token He
Token ID679
Feature activation-3.72e-01

Pos logit contributions
 �+0.006
 impact+0.006
pieces+0.005
 agreement+0.005
 int+0.005
Neg logit contributions
uffy-0.006
ears-0.006
ugs-0.006
imi-0.006
unny-0.006
Token likes
Token ID7832
Feature activation-7.10e-01

Pos logit contributions
uffy+0.007
ears+0.007
ugs+0.007
imi+0.007
izzy+0.007
Neg logit contributions
 �-0.008
pieces-0.007
 impact-0.007
ieved-0.007
 hesitation-0.007
Token to
Token ID284
Feature activation-2.63e-01

Pos logit contributions
uffy+0.014
ugs+0.013
ears+0.013
izzy+0.012
imi+0.012
Neg logit contributions
 �-0.015
pieces-0.014
 impact-0.014
 int-0.014
 hesitation-0.014
Token eat
Token ID4483
Feature activation-7.02e-01

Pos logit contributions
ears+0.004
uffy+0.004
ugs+0.004
Jen+0.004
imi+0.004
Neg logit contributions
 �-0.006
ieved-0.006
ement-0.006
 int-0.006
pieces-0.006
Token fish
Token ID5916
Feature activation+1.34e+00

Pos logit contributions
ears+0.010
uffy+0.010
ugs+0.010
imi+0.009
 stairs+0.009
Neg logit contributions
pieces-0.018
 �-0.018
 impact-0.017
ieved-0.017
 hesitation-0.017
Token and
Token ID290
Feature activation-9.18e-01

Pos logit contributions
 �+0.022
pieces+0.021
 impact+0.020
 int+0.020
 agreement+0.020
Neg logit contributions
uffy-0.031
ears-0.031
ugs-0.031
 stairs-0.029
imi-0.029
Token he
Token ID339
Feature activation-7.54e-01

Pos logit contributions
ears+0.013
uffy+0.013
 stairs+0.012
 cob+0.012
ugs+0.012
Neg logit contributions
ement-0.023
ieved-0.023
 �-0.022
pieces-0.022
 int-0.022
Token likes
Token ID7832
Feature activation-5.98e-01

Pos logit contributions
uffy+0.015
ears+0.014
ugs+0.014
imi+0.013
unny+0.013
Neg logit contributions
 �-0.015
pieces-0.015
 impact-0.015
 agreement-0.014
 int-0.014
Token to
Token ID284
Feature activation-2.74e-01

Pos logit contributions
uffy+0.010
ears+0.010
ugs+0.010
imi+0.009
unny+0.009
Neg logit contributions
 �-0.013
pieces-0.013
 impact-0.013
 agreement-0.013
ieved-0.013
Token Mix
Token ID15561
Feature activation-6.05e-01

Pos logit contributions
 �+0.033
 impact+0.032
 hesitation+0.031
 agreement+0.031
pieces+0.030
Neg logit contributions
uffy-0.037
ears-0.037
ugs-0.037
unny-0.034
imi-0.033
Tokening
Token ID278
Feature activation+7.24e-01

Pos logit contributions
uffy+0.011
ears+0.011
ugs+0.011
unny+0.011
 stairs+0.011
Neg logit contributions
 �-0.012
pieces-0.011
 then-0.011
 int-0.011
 impact-0.011
Token Treasure
Token ID20215
Feature activation-2.00e-01

Pos logit contributions
 �+0.016
pieces+0.015
 impact+0.015
 agreement+0.015
 int+0.015
Neg logit contributions
uffy-0.013
ears-0.013
ugs-0.013
imi-0.012
unny-0.012
Token would
Token ID561
Feature activation-1.26e+00

Pos logit contributions
ears+0.004
ugs+0.004
uffy+0.004
imi+0.004
unny+0.003
Neg logit contributions
 �-0.004
pieces-0.004
 impact-0.004
 int-0.004
 hesitation-0.004
Token mix
Token ID5022
Feature activation+1.37e+00

Pos logit contributions
uffy+0.029
ugs+0.029
ears+0.029
imi+0.026
unny+0.026
Neg logit contributions
 �-0.022
 impact-0.022
 agreement-0.021
 hesitation-0.021
pieces-0.021
Token all
Token ID477
Feature activation-1.12e+00

Pos logit contributions
 �+0.028
pieces+0.027
 impact+0.026
 int+0.026
 agreement+0.026
Neg logit contributions
uffy-0.027
ears-0.027
ugs-0.026
imi-0.025
unny-0.025
Token the
Token ID262
Feature activation+3.46e-01

Pos logit contributions
uffy+0.028
ears+0.026
ugs+0.026
unny+0.026
imi+0.025
Neg logit contributions
 �-0.018
pieces-0.017
 impact-0.017
 int-0.017
 forward-0.016
Token colors
Token ID7577
Feature activation-1.11e-01

Pos logit contributions
 �+0.008
 impact+0.008
pieces+0.008
 hesitation+0.008
 agreement+0.008
Neg logit contributions
uffy-0.005
ugs-0.005
ears-0.005
imi-0.005
unny-0.005
Token,
Token ID11
Feature activation+7.57e-01

Pos logit contributions
uffy+0.002
ears+0.002
ugs+0.002
imi+0.002
unny+0.002
Neg logit contributions
 �-0.002
pieces-0.002
 impact-0.002
 int-0.002
 agreement-0.002
Token sounds
Token ID5238
Feature activation-1.15e-01

Pos logit contributions
 �+0.013
pieces+0.013
 impact+0.013
ieved+0.013
 agreement+0.013
Neg logit contributions
ears-0.016
uffy-0.016
ugs-0.016
 stairs-0.015
imi-0.015
Token and
Token ID290
Feature activation+4.45e-01

Pos logit contributions
ears+0.002
ugs+0.002
uffy+0.002
 stairs+0.002
izzy+0.002
Neg logit contributions
pieces-0.002
 �-0.002
ieved-0.002
 impact-0.002
 agreement-0.002
Token sorry
Token ID7926
Feature activation-9.92e-01

Pos logit contributions
uffy+0.048
ears+0.048
ugs+0.047
imi+0.044
unny+0.044
Neg logit contributions
 �-0.040
pieces-0.038
 impact-0.038
 int-0.037
 agreement-0.037
Token for
Token ID329
Feature activation-8.51e-01

Pos logit contributions
ears+0.024
ugs+0.024
uffy+0.024
Jen+0.023
loo+0.023
Neg logit contributions
 �-0.017
 int-0.015
 impact-0.014
 then-0.014
pieces-0.014
Token knocking
Token ID21500
Feature activation+2.34e-01

Pos logit contributions
ears+0.018
uffy+0.018
ugs+0.018
unny+0.017
loo+0.017
Neg logit contributions
 �-0.017
 hesitation-0.016
 agreement-0.016
pieces-0.016
 impact-0.016
Token over
Token ID625
Feature activation+6.94e-01

Pos logit contributions
 �+0.004
pieces+0.003
 forward+0.003
 int+0.003
 impact+0.003
Neg logit contributions
uffy-0.006
ears-0.006
ugs-0.005
unny-0.005
imi-0.005
Token her
Token ID607
Feature activation+5.58e-01

Pos logit contributions
 �+0.014
pieces+0.013
 impact+0.013
 agreement+0.013
 hesitation+0.013
Neg logit contributions
uffy-0.014
ears-0.014
ugs-0.014
unny-0.013
imi-0.013
Token toy
Token ID13373
Feature activation-1.23e+00

Pos logit contributions
 �+0.015
pieces+0.014
ieved+0.014
 impact+0.014
 int+0.014
Neg logit contributions
uffy-0.008
ears-0.008
ugs-0.007
 stairs-0.007
imi-0.007
Token.
Token ID13
Feature activation-3.34e-01

Pos logit contributions
ears+0.027
ugs+0.026
uffy+0.026
unny+0.025
imi+0.025
Neg logit contributions
 �-0.022
pieces-0.021
 impact-0.021
 int-0.020
 then-0.020
Token Lily
Token ID20037
Feature activation-6.83e-01

Pos logit contributions
ears+0.005
ugs+0.005
uffy+0.005
unny+0.005
imi+0.005
Neg logit contributions
 �-0.008
 int-0.008
pieces-0.007
 agreement-0.007
 impact-0.007
Token forg
Token ID5784
Feature activation+6.76e-01

Pos logit contributions
ears+0.015
uffy+0.015
ugs+0.015
imi+0.014
unny+0.014
Neg logit contributions
 �-0.012
pieces-0.012
 impact-0.011
ieved-0.011
 agreement-0.011
Tokenave
Token ID1015
Feature activation-1.35e+00

Pos logit contributions
 �+0.013
pieces+0.012
 impact+0.012
 hesitation+0.012
 int+0.012
Neg logit contributions
uffy-0.014
ears-0.014
ugs-0.014
izzy-0.013
unny-0.013
Token him
Token ID683
Feature activation-1.15e-01

Pos logit contributions
uffy+0.019
ears+0.018
ugs+0.018
imi+0.018
izzy+0.017
Neg logit contributions
 �-0.034
pieces-0.034
 impact-0.034
ieved-0.033
 hesitation-0.033
Token,
Token ID11
Feature activation-5.89e-01

Pos logit contributions
 �+0.017
pieces+0.017
 impact+0.017
 int+0.016
 agreement+0.016
Neg logit contributions
uffy-0.017
ears-0.016
ugs-0.016
imi-0.015
unny-0.015
Token Rex
Token ID17853
Feature activation-3.96e-01

Pos logit contributions
uffy+0.008
ears+0.008
ugs+0.008
unny+0.007
izzy+0.007
Neg logit contributions
 �-0.016
pieces-0.015
 agreement-0.015
 impact-0.015
 hesitation-0.015
Token,"
Token ID553
Feature activation-3.93e-01

Pos logit contributions
ears+0.008
ugs+0.007
uffy+0.007
 stairs+0.007
izzy+0.007
Neg logit contributions
 �-0.008
 impact-0.008
pieces-0.008
 agreement-0.008
ieved-0.008
Token Tom
Token ID4186
Feature activation-2.49e-02

Pos logit contributions
uffy+0.004
ugs+0.004
ears+0.004
unny+0.004
imi+0.003
Neg logit contributions
 �-0.012
 then-0.011
pieces-0.011
 agreement-0.011
 hesitation-0.011
Token said
Token ID531
Feature activation-1.03e+00

Pos logit contributions
uffy+0.001
ears+0.001
ugs+0.001
imi+0.001
loo+0.001
Neg logit contributions
 �-0.000
pieces-0.000
 int-0.000
 impact-0.000
 agreement-0.000
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
uffy+0.018
ugs+0.017
ears+0.017
unny+0.016
 Fl+0.016
Neg logit contributions
 impact-0.024
pieces-0.024
ieved-0.023
 narrative-0.022
 unison-0.021
Token "
Token ID366
Feature activation-1.38e+00

Pos logit contributions
uffy+0.026
ears+0.025
ugs+0.025
unny+0.024
 stairs+0.024
Neg logit contributions
pieces-0.028
 impact-0.027
ieved-0.026
 hesitation-0.025
 agreement-0.025
TokenHe
Token ID1544
Feature activation-8.46e-01

Pos logit contributions
ears+0.026
uffy+0.026
ugs+0.026
unny+0.024
imi+0.024
Neg logit contributions
 �-0.029
pieces-0.028
 impact-0.028
 hesitation-0.027
 int-0.027
Token likes
Token ID7832
Feature activation-8.02e-01

Pos logit contributions
ugs+0.019
ears+0.018
Su+0.018
 stairs+0.017
uffy+0.017
Neg logit contributions
 then-0.015
 �-0.015
ieved-0.014
 impact-0.014
 int-0.014
Token you
Token ID345
Feature activation+1.81e-01

Pos logit contributions
uffy+0.016
ugs+0.015
ears+0.015
izzy+0.014
unny+0.014
Neg logit contributions
 �-0.017
 impact-0.016
 hesitation-0.016
 then-0.016
pieces-0.016
Token.
Token ID13
Feature activation-8.34e-01

Pos logit contributions
 �+0.003
 agreement+0.003
pieces+0.003
 then+0.003
 impact+0.003
Neg logit contributions
ears-0.004
uffy-0.004
ugs-0.004
 stairs-0.004
izzy-0.004
Token because
Token ID780
Feature activation-9.79e-01

Pos logit contributions
ears+0.011
ugs+0.011
uffy+0.011
unny+0.011
imi+0.010
Neg logit contributions
 �-0.008
 impact-0.007
 int-0.007
 hesitation-0.007
 then-0.007
Token he
Token ID339
Feature activation-4.21e-01

Pos logit contributions
uffy+0.025
ears+0.024
ugs+0.024
unny+0.023
imi+0.022
Neg logit contributions
 �-0.016
 then-0.015
pieces-0.014
 int-0.014
 hesitation-0.014
Token didn
Token ID1422
Feature activation-1.15e+00

Pos logit contributions
ugs+0.011
ears+0.011
uffy+0.010
loo+0.010
unny+0.010
Neg logit contributions
 then-0.006
 �-0.006
ieved-0.006
 int-0.005
 unsure-0.005
Token't
Token ID470
Feature activation-3.54e-01

Pos logit contributions
ears+0.018
uffy+0.017
ugs+0.017
unny+0.015
 cob+0.014
Neg logit contributions
 �-0.030
 int-0.028
pieces-0.027
 then-0.027
ieved-0.027
Token want
Token ID765
Feature activation+3.24e-01

Pos logit contributions
uffy+0.008
ugs+0.008
ears+0.008
imi+0.007
unny+0.007
Neg logit contributions
 �-0.006
 impact-0.006
pieces-0.006
 agreement-0.006
 hesitation-0.006
Token to
Token ID284
Feature activation-4.83e-01

Pos logit contributions
 �+0.007
pieces+0.006
 impact+0.006
 hesitation+0.006
 int+0.006
Neg logit contributions
uffy-0.006
ears-0.006
ugs-0.006
unny-0.006
imi-0.006
Token lose
Token ID4425
Feature activation-5.48e-01

Pos logit contributions
ugs+0.012
ears+0.012
uffy+0.012
unny+0.011
loo+0.011
Neg logit contributions
 impact-0.008
 agreement-0.007
 then-0.007
 �-0.007
 agree-0.007
Token his
Token ID465
Feature activation-3.88e-02

Pos logit contributions
uffy+0.009
ears+0.009
 stairs+0.009
ugs+0.009
 cob+0.008
Neg logit contributions
 �-0.012
ieved-0.012
pieces-0.012
 int-0.012
 impact-0.012
Token favorite
Token ID4004
Feature activation-1.10e+00

Pos logit contributions
uffy+0.000
ears+0.000
ugs+0.000
 stairs+0.000
 cob+0.000
Neg logit contributions
 �-0.001
ieved-0.001
pieces-0.001
 hesitation-0.001
 int-0.001
Token toy
Token ID13373
Feature activation-1.82e+00

Pos logit contributions
uffy+0.010
ears+0.010
ugs+0.009
 stairs+0.009
 cob+0.008
Neg logit contributions
 �-0.034
ieved-0.033
 hesitation-0.032
pieces-0.032
 int-0.032
Token.
Token ID13
Feature activation-7.93e-02

Pos logit contributions
uffy+0.036
ears+0.034
ugs+0.033
 stairs+0.033
loo+0.032
Neg logit contributions
 �-0.036
ieved-0.036
 hesitation-0.036
pieces-0.036
 impact-0.035
Token day
Token ID1110
Feature activation-3.22e-01

Pos logit contributions
uffy+0.004
ears+0.004
ugs+0.004
unny+0.004
imi+0.004
Neg logit contributions
 �-0.005
pieces-0.005
 impact-0.005
 agreement-0.005
 int-0.005
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
ears+0.008
ugs+0.008
uffy+0.008
imi+0.008
izzy+0.008
Neg logit contributions
 �-0.005
 hesitation-0.004
 int-0.004
 then-0.004
 forward-0.004
Token a
Token ID257
Feature activation+1.67e+00

Pos logit contributions
ears+0.028
ugs+0.028
uffy+0.028
imi+0.026
loo+0.025
Neg logit contributions
 �-0.025
 then-0.022
 agreement-0.021
 int-0.021
pieces-0.021
Token little
Token ID1310
Feature activation-8.16e-01

Pos logit contributions
 �+0.044
pieces+0.042
 impact+0.042
 agreement+0.041
 int+0.041
Neg logit contributions
ears-0.023
uffy-0.023
ugs-0.022
imi-0.020
unny-0.020
Token girl
Token ID2576
Feature activation-2.21e+00

Pos logit contributions
ears+0.016
ugs+0.016
uffy+0.015
uts+0.015
imi+0.014
Neg logit contributions
 �-0.017
 impact-0.016
 agreement-0.016
 forward-0.015
 then-0.015
Token named
Token ID3706
Feature activation-3.65e+00

Pos logit contributions
ears+0.063
uffy+0.062
ugs+0.061
unny+0.059
imi+0.058
Neg logit contributions
 �-0.026
 impact-0.023
pieces-0.023
 then-0.023
 agreement-0.022
Token Sus
Token ID8932
Feature activation-1.41e+00

Pos logit contributions
ears+0.084
loo+0.084
ugs+0.082
uffy+0.080
 flakes+0.079
Neg logit contributions
 �-0.066
 then-0.058
 agreement-0.057
pieces-0.055
 forward-0.053
Tokenie
Token ID494
Feature activation-2.00e+00

Pos logit contributions
unny+0.025
uffy+0.025
ugs+0.025
ears+0.024
uts+0.024
Neg logit contributions
ieved-0.028
pieces-0.028
 then-0.028
 agreement-0.027
 �-0.027
Token was
Token ID373
Feature activation+1.16e+00

Pos logit contributions
ears+0.046
ugs+0.046
uffy+0.044
unny+0.043
imi+0.042
Neg logit contributions
 �-0.034
 then-0.032
 impact-0.032
pieces-0.031
 agreement-0.030
Token walking
Token ID6155
Feature activation+9.84e-01

Pos logit contributions
 �+0.023
pieces+0.022
 impact+0.022
 int+0.022
 agreement+0.022
Neg logit contributions
ears-0.023
uffy-0.023
ugs-0.023
unny-0.021
imi-0.021
Token in
Token ID287
Feature activation+9.72e-01

Pos logit contributions
 �+0.019
pieces+0.018
 impact+0.018
 agreement+0.017
 int+0.017
Neg logit contributions
uffy-0.021
ears-0.021
ugs-0.020
imi-0.019
unny-0.019
Token it
Token ID340
Feature activation-1.17e+00

Pos logit contributions
 �+0.020
pieces+0.020
 impact+0.020
ieved+0.019
 hesitation+0.019
Neg logit contributions
uffy-0.019
ears-0.019
ugs-0.019
imi-0.018
 stairs-0.017
Token.
Token ID13
Feature activation-8.88e-01

Pos logit contributions
ears+0.025
uffy+0.025
ugs+0.024
 stairs+0.023
imi+0.023
Neg logit contributions
 impact-0.021
pieces-0.021
 �-0.021
 hesitation-0.020
ieved-0.020
Token Inside
Token ID14384
Feature activation+5.66e-01

Pos logit contributions
uffy+0.019
ears+0.019
ugs+0.018
imi+0.017
unny+0.017
Neg logit contributions
 �-0.017
pieces-0.016
 impact-0.016
 int-0.016
 agreement-0.016
Token,
Token ID11
Feature activation-3.78e-01

Pos logit contributions
 �+0.009
ement+0.008
 then+0.008
 forward+0.008
 hesitation+0.008
Neg logit contributions
ears-0.014
ugs-0.014
uffy-0.013
unny-0.013
Su-0.013
Token there
Token ID612
Feature activation+9.36e-02

Pos logit contributions
uffy+0.008
ears+0.007
ugs+0.007
imi+0.007
unny+0.007
Neg logit contributions
 �-0.008
pieces-0.007
 int-0.007
 impact-0.007
 hesitation-0.007
Token are
Token ID389
Feature activation-3.40e+00

Pos logit contributions
 �+0.002
 impact+0.002
 agreement+0.002
 int+0.002
pieces+0.002
Neg logit contributions
ears-0.002
ugs-0.002
uffy-0.002
unny-0.002
Su-0.002
Token two
Token ID734
Feature activation+1.19e-01

Pos logit contributions
uffy+0.075
ugs+0.074
unny+0.069
ears+0.069
imi+0.068
Neg logit contributions
 �-0.065
 then-0.064
pieces-0.059
 hesitation-0.057
 unison-0.055
Token shiny
Token ID22441
Feature activation-1.11e+00

Pos logit contributions
 �+0.003
pieces+0.003
 impact+0.003
 hesitation+0.003
 agreement+0.003
Neg logit contributions
uffy-0.002
ears-0.002
ugs-0.002
unny-0.002
izzy-0.002
Token costumes
Token ID24138
Feature activation+3.10e-01

Pos logit contributions
uffy+0.017
ugs+0.016
ears+0.016
unny+0.016
izzy+0.015
Neg logit contributions
 �-0.027
pieces-0.027
 impact-0.026
 hesitation-0.026
 int-0.026
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
 �+0.005
pieces+0.005
 impact+0.005
 hesitation+0.005
 int+0.005
Neg logit contributions
uffy-0.007
ears-0.007
ugs-0.007
imi-0.007
unny-0.007
Token They
Token ID1119
Feature activation-1.38e+00

Pos logit contributions
uffy+0.004
ears+0.004
ugs+0.004
izzy+0.004
imi+0.004
Neg logit contributions
 �-0.005
pieces-0.005
 impact-0.005
ieved-0.005
 hesitation-0.005
Token day
Token ID1110
Feature activation+9.96e-01

Pos logit contributions
 �+0.004
 then+0.004
 agreement+0.004
 impact+0.004
 int+0.004
Neg logit contributions
ears-0.006
ugs-0.006
uffy-0.006
uts-0.006
unny-0.005
Token,
Token ID11
Feature activation+9.43e-01

Pos logit contributions
 �+0.017
pieces+0.015
 hesitation+0.015
 impact+0.015
 int+0.015
Neg logit contributions
ears-0.024
uffy-0.024
ugs-0.023
imi-0.022
unny-0.022
Token the
Token ID262
Feature activation+4.72e-01

Pos logit contributions
 �+0.022
 impact+0.022
pieces+0.022
 agreement+0.021
ieved+0.021
Neg logit contributions
uffy-0.015
ears-0.015
ugs-0.014
 stairs-0.013
imi-0.013
Token arm
Token ID3211
Feature activation-1.34e+00

Pos logit contributions
 �+0.009
 then+0.009
 agreement+0.009
 End+0.009
 hesitation+0.008
Neg logit contributions
ears-0.010
ugs-0.010
uffy-0.010
unny-0.009
ubs-0.009
Tokenchair
Token ID16337
Feature activation-5.09e-02

Pos logit contributions
ears+0.029
ugs+0.028
uffy+0.028
unny+0.027
imi+0.026
Neg logit contributions
 �-0.025
 then-0.024
 int-0.022
pieces-0.022
ieved-0.021
Token started
Token ID2067
Feature activation-3.70e+00

Pos logit contributions
ears+0.001
ugs+0.001
uffy+0.001
imi+0.001
unny+0.001
Neg logit contributions
 �-0.001
pieces-0.001
 impact-0.001
 int-0.001
ieved-0.001
Token to
Token ID284
Feature activation-1.29e+00

Pos logit contributions
uffy+0.084
ears+0.082
ugs+0.080
unny+0.080
izzy+0.078
Neg logit contributions
 �-0.070
 int-0.065
 hesitation-0.063
 impact-0.060
 then-0.060
Token applaud
Token ID22517
Feature activation+8.67e-01

Pos logit contributions
uffy+0.026
ugs+0.026
ears+0.026
unny+0.024
izzy+0.023
Neg logit contributions
 �-0.026
 impact-0.025
 agreement-0.025
 hesitation-0.024
 int-0.024
Token.
Token ID13
Feature activation+9.65e-01

Pos logit contributions
 �+0.015
pieces+0.014
 impact+0.014
 int+0.014
 agreement+0.014
Neg logit contributions
uffy-0.020
ears-0.020
ugs-0.019
imi-0.018
unny-0.018
Token Tom
Token ID4186
Feature activation+2.69e-01

Pos logit contributions
pieces+0.023
 impact+0.022
 �+0.022
 hesitation+0.022
 agreement+0.022
Neg logit contributions
ears-0.015
uffy-0.015
ugs-0.015
 stairs-0.014
imi-0.014
Token was
Token ID373
Feature activation+2.08e+00

Pos logit contributions
 �+0.005
 then+0.005
ieved+0.005
 int+0.005
pieces+0.005
Neg logit contributions
ugs-0.006
uffy-0.006
ears-0.006
loo-0.005
unny-0.005
Token to
Token ID284
Feature activation-3.33e+00

Pos logit contributions
uffy+0.122
ears+0.121
ugs+0.117
izzy+0.113
imi+0.113
Neg logit contributions
 �-0.115
 int-0.106
 forward-0.100
 hesitation-0.100
 impact-0.099
Token go
Token ID467
Feature activation-2.39e+00

Pos logit contributions
uffy+0.057
ears+0.056
ugs+0.055
imi+0.053
 stairs+0.053
Neg logit contributions
pieces-0.073
 �-0.072
 impact-0.070
ieved-0.068
 hesitation-0.068
Token down
Token ID866
Feature activation-2.20e+00

Pos logit contributions
uffy+0.053
ears+0.052
ugs+0.051
 stairs+0.049
imi+0.048
Neg logit contributions
pieces-0.042
 �-0.041
 impact-0.041
 hesitation-0.040
 agreement-0.039
Token and
Token ID290
Feature activation-3.89e-01

Pos logit contributions
uffy+0.051
ears+0.051
ugs+0.050
imi+0.048
unny+0.047
Neg logit contributions
 �-0.038
 impact-0.035
pieces-0.035
 int-0.034
 agreement-0.034
Token it
Token ID340
Feature activation-2.61e+00

Pos logit contributions
uffy+0.007
ears+0.007
ugs+0.007
 stairs+0.007
imi+0.007
Neg logit contributions
 �-0.008
pieces-0.008
 impact-0.008
ieved-0.007
 hesitation-0.007
Token got
Token ID1392
Feature activation-3.93e+00

Pos logit contributions
ears+0.054
ugs+0.054
uffy+0.052
unny+0.049
imi+0.048
Neg logit contributions
 �-0.053
 int-0.050
 then-0.049
ieved-0.049
pieces-0.048
Token cold
Token ID4692
Feature activation-3.03e+00

Pos logit contributions
uffy+0.075
ears+0.074
ugs+0.073
imi+0.068
unny+0.068
Neg logit contributions
 �-0.082
pieces-0.079
 impact-0.079
 int-0.077
 agreement-0.077
Token outside
Token ID2354
Feature activation-1.98e+00

Pos logit contributions
uffy+0.075
ears+0.073
ugs+0.072
unny+0.070
imi+0.069
Neg logit contributions
 �-0.047
 impact-0.044
 int-0.043
 hesitation-0.042
 then-0.042
Token.
Token ID13
Feature activation+7.80e-01

Pos logit contributions
ears+0.049
uffy+0.049
ugs+0.048
unny+0.046
imi+0.046
Neg logit contributions
 �-0.031
 impact-0.029
pieces-0.028
 int-0.028
 agreement-0.028
Token Jake
Token ID14757
Feature activation+4.86e-01

Pos logit contributions
pieces+0.016
 impact+0.015
 �+0.015
 hesitation+0.015
ieved+0.015
Neg logit contributions
uffy-0.015
ugs-0.015
ears-0.015
 stairs-0.014
imi-0.014
Token was
Token ID373
Feature activation+1.13e+00

Pos logit contributions
 �+0.010
 then+0.009
ieved+0.009
 int+0.009
pieces+0.009
Neg logit contributions
ears-0.010
ugs-0.010
uffy-0.010
imi-0.009
loo-0.009
Token was
Token ID373
Feature activation-1.13e+00

Pos logit contributions
 �+0.005
 then+0.004
ement+0.004
 End+0.004
 agreement+0.004
Neg logit contributions
ears-0.007
ugs-0.007
unny-0.006
unn-0.006
uffy-0.006
Token a
Token ID257
Feature activation+9.06e-01

Pos logit contributions
uffy+0.025
ears+0.025
ugs+0.025
unny+0.024
loo+0.023
Neg logit contributions
 �-0.021
pieces-0.019
 agreement-0.019
 impact-0.018
 int-0.018
Token happy
Token ID3772
Feature activation-1.21e+00

Pos logit contributions
 �+0.022
 impact+0.021
 agreement+0.021
pieces+0.021
 int+0.021
Neg logit contributions
ugs-0.014
ears-0.014
uffy-0.013
unny-0.012
izzy-0.012
Token little
Token ID1310
Feature activation-1.01e+00

Pos logit contributions
ears+0.024
ugs+0.024
uffy+0.022
izzy+0.022
uts+0.021
Neg logit contributions
 �-0.026
 agreement-0.024
 impact-0.024
 then-0.023
 int-0.022
Token boy
Token ID2933
Feature activation-5.60e-01

Pos logit contributions
ears+0.020
ugs+0.020
uffy+0.019
ubs+0.019
uts+0.019
Neg logit contributions
 �-0.020
 impact-0.019
 agreement-0.019
 then-0.018
 forward-0.018
Token named
Token ID3706
Feature activation-3.46e+00

Pos logit contributions
uffy+0.013
ears+0.013
ugs+0.013
imi+0.012
izzy+0.012
Neg logit contributions
 �-0.009
pieces-0.009
 impact-0.009
ieved-0.009
 int-0.009
Token Tim
Token ID5045
Feature activation-3.96e-01

Pos logit contributions
ugs+0.070
ears+0.068
uffy+0.067
 cob+0.063
unny+0.062
Neg logit contributions
 �-0.074
 Sir-0.065
 then-0.064
 int-0.063
 agreement-0.061
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
uffy+0.007
ears+0.007
ugs+0.007
imi+0.006
izzy+0.006
Neg logit contributions
 �-0.009
pieces-0.009
 impact-0.009
 int-0.009
 hesitation-0.009
Token He
Token ID679
Feature activation-5.71e-01

Pos logit contributions
uffy+0.004
ugs+0.004
ears+0.004
unny+0.004
imi+0.003
Neg logit contributions
 �-0.005
 impact-0.004
 agreement-0.004
 int-0.004
pieces-0.004
Token was
Token ID373
Feature activation+1.58e+00

Pos logit contributions
ears+0.015
ugs+0.015
uffy+0.014
unny+0.014
 stairs+0.013
Neg logit contributions
 �-0.009
 then-0.008
 int-0.007
 Amen-0.007
 agreement-0.007
Token so
Token ID523
Feature activation-5.39e-01

Pos logit contributions
 �+0.029
pieces+0.027
 impact+0.027
 agreement+0.027
 int+0.027
Neg logit contributions
ears-0.035
uffy-0.034
ugs-0.034
unny-0.032
imi-0.032