TOP ACTIVATIONS
MAX = 0.098

power to do anything can 't always make things work out
. Even clumsy accidents couldn 't ruin their day ! <|endoftext|>
always remember how smooth it felt on her toy car .
the most reliable things can 't always be trusted . <|endoftext|>
guess how high the tree was . His mom said it
Lily loved how soft it felt and wrapped it around her
to dream about what life could be like outside his small
to see how heavy it was . So , he found
play with things she shouldn 't touch . <|endoftext|> Once upon
't believe how soft it felt . All the other cats
did not know what oxygen was , but he liked the
him get things he couldn 't reach . <|endoftext|> Once there
see how tough the string was . People in
noticed how smooth the grass felt beneath her feet .
running away from problems won 't solve them . So ,
't believe how cold it felt ! She couldn 't believe
spray paint anything that shouldn 't be painted ."
noticed how smooth the grass felt on his feet . He
didn 't like how it felt when it rubbed against her
even the biggest storm couldn 't knock it down . Tim

MIN ACTIVATIONS
MIN = -0.085

George asked the bar ber for a new hairst
the counter and asked the bar ista if they could print
shop . He thanked the bar ber for the great job
goodbye . He left the bar ber feeling happy and ended
now . She thanked the bar ber , who was still
go ! Everyone watched in amaz ement as it took off
all . He thanked the bar ber for his new style
excited . She held the bar of chocolate and skipped all
, they arrived at the bar ber shop . The bar
handsome . He thanked the bar ber and ran out of
. At the bar ber , the bar ber
Sam thanked the bar ber and he gave him
vious . It asked the bar ber , " Can I
her mom my watched in amaz ement as the machine flew
egg ! He watched in amaz ement as the egg began
twenty - eight , twenty - nine , thirty , thirty
little girl looked up in amaz ement and said , "
. He thanked the clumsy bar ber and went home with
, and he thanked the bar ber for the expensive haircut
Lucy walked out of the bar bers hop , everyone admired

INTERVAL 0.053 - 0.098
CONTAINS 0.215%

" S ara , what did you do ?" Mom asked
Ben and Lily were hungry . They wanted to eat pizza
much easier to get things done . That 's why it
a safe place where nobody could find it .
no matter how fearful he felt . The End . <|endoftext|>

INTERVAL 0.007 - 0.053
CONTAINS 35.513%

one final hug before he went home . <|endoftext|> Once upon
, something bad happened to it . All of a sudden
thought he could eat it later when no one was looking
who was crying because he lost his toy . Lily felt
It was trying to steal the oats ! Mark

INTERVAL -0.039 - 0.007
CONTAINS 63.190%

are strong too , Danny . That is why it is
day on , the smart fox and the little girl were
hands and ran to their ship . They sailed away ,
Can you please help me ?" The clever fish
Mom my and Daddy were so happy and

INTERVAL -0.085 - -0.039
CONTAINS 1.081%

One day , the zoo keeper noticed that some of
?" Jack said , " Yes , we can . I
!" she says . " You two worked very well together
, Mom my , why are
, I can â
Token power
Token ID1176
Feature activation-6.32e-03

Pos logit contributions
umb+0.005
 need+0.005
py+0.005
umper+0.005
hing+0.004
Neg logit contributions
ber-0.005
Just-0.005
 belts-0.005
ager-0.005
rd-0.005
Token to
Token ID284
Feature activation+1.99e-02

Pos logit contributions
Just+0.011
ber+0.011
 belts+0.011
rd+0.011
ager+0.010
Neg logit contributions
 need-0.009
umb-0.009
py-0.009
umper-0.008
 apart-0.008
Token do
Token ID466
Feature activation+2.07e-02

Pos logit contributions
 need+0.026
umb+0.024
py+0.024
umper+0.023
hing+0.022
Neg logit contributions
ber-0.039
Just-0.038
 belts-0.037
rd-0.037
edly-0.037
Token anything
Token ID1997
Feature activation+8.95e-03

Pos logit contributions
 need+0.035
umb+0.032
py+0.031
 apart+0.031
umper+0.030
Neg logit contributions
ber-0.033
Just-0.033
 belts-0.031
edly-0.031
rd-0.030
Token can
Token ID460
Feature activation+5.89e-02

Pos logit contributions
 need+0.015
umb+0.015
py+0.014
umper+0.014
 apart+0.014
Neg logit contributions
ber-0.014
Just-0.013
 belts-0.013
rd-0.013
ager-0.013
Token't
Token ID470
Feature activation+9.85e-02

Pos logit contributions
umb+0.097
umper+0.094
ained+0.088
py+0.086
 apart+0.080
Neg logit contributions
Just-0.092
ves-0.088
ager-0.084
ber-0.082
 nicely-0.081
Token always
Token ID1464
Feature activation+6.69e-02

Pos logit contributions
umb+0.165
umper+0.157
py+0.147
ained+0.143
hing+0.133
Neg logit contributions
Just-0.152
ager-0.145
 nicely-0.144
ves-0.142
ber-0.142
Token make
Token ID787
Feature activation+4.43e-02

Pos logit contributions
umb+0.109
umper+0.101
py+0.099
hing+0.091
 need+0.091
Neg logit contributions
Just-0.108
ber-0.106
ager-0.103
rd-0.102
 belts-0.101
Token things
Token ID1243
Feature activation+2.26e-02

Pos logit contributions
umb+0.080
py+0.075
umper+0.075
 need+0.074
hing+0.070
Neg logit contributions
ber-0.060
Just-0.060
ves-0.059
 belts-0.058
ager-0.058
Token work
Token ID670
Feature activation+3.55e-02

Pos logit contributions
umb+0.033
py+0.031
umper+0.031
 need+0.029
emaker+0.026
Neg logit contributions
ber-0.039
Just-0.039
 belts-0.039
ves-0.038
ager-0.037
Token out
Token ID503
Feature activation-4.91e-03

Pos logit contributions
umb+0.068
py+0.066
umper+0.066
 need+0.063
hing+0.059
Neg logit contributions
Just-0.044
ber-0.044
 belts-0.044
ves-0.042
rd-0.042
Token.
Token ID13
Feature activation-2.13e-03

Pos logit contributions
umb+0.075
umper+0.073
py+0.071
 need+0.069
hing+0.066
Neg logit contributions
ber-0.050
rd-0.049
 belts-0.048
Just-0.048
ves-0.046
Token Even
Token ID3412
Feature activation+1.83e-02

Pos logit contributions
Just+0.004
ber+0.003
 belts+0.003
rd+0.003
ival+0.003
Neg logit contributions
umb-0.003
 need-0.003
py-0.003
umper-0.003
hing-0.003
Token clumsy
Token ID39120
Feature activation-4.46e-03

Pos logit contributions
umb+0.034
py+0.030
 need+0.030
umper+0.030
hing+0.029
Neg logit contributions
ber-0.027
 belts-0.025
 nicely-0.024
Just-0.024
 else-0.024
Token accidents
Token ID17390
Feature activation+2.97e-02

Pos logit contributions
ber+0.007
 belts+0.007
ves+0.007
Just+0.007
rd+0.006
Neg logit contributions
umb-0.008
 need-0.007
umper-0.007
py-0.007
hing-0.006
Token couldn
Token ID3521
Feature activation+5.99e-02

Pos logit contributions
umb+0.049
umper+0.049
py+0.045
 need+0.043
 strongest+0.040
Neg logit contributions
ves-0.046
Just-0.044
 belts-0.044
uddled-0.043
ival-0.043
Token't
Token ID470
Feature activation+9.08e-02

Pos logit contributions
umper+0.089
umb+0.081
py+0.077
 strongest+0.076
 seeker+0.073
Neg logit contributions
ves-0.110
Just-0.106
ork-0.099
--0.098
rd-0.095
Token ruin
Token ID16866
Feature activation+4.55e-02

Pos logit contributions
umb+0.139
umper+0.133
py+0.125
 need+0.122
 apart+0.116
Neg logit contributions
Just-0.153
ves-0.148
ber-0.146
 belts-0.142
ager-0.142
Token their
Token ID511
Feature activation+9.75e-05

Pos logit contributions
umb+0.077
umper+0.072
 need+0.071
py+0.071
hing+0.069
Neg logit contributions
ber-0.067
 belts-0.066
ves-0.066
Just-0.065
rd-0.065
Token day
Token ID1110
Feature activation-8.81e-03

Pos logit contributions
 need+0.000
 apart+0.000
 crash+0.000
 earthquake+0.000
 strongest+0.000
Neg logit contributions
Just-0.000
ber-0.000
edly-0.000
ager-0.000
ves-0.000
Token!
Token ID0
Feature activation-1.02e-02

Pos logit contributions
ber+0.013
Just+0.013
 belts+0.013
rd+0.013
ager+0.012
Neg logit contributions
umb-0.015
 need-0.015
py-0.014
umper-0.014
hing-0.013
Token<|endoftext|>
Token ID50256
Feature activation-2.02e-02

Pos logit contributions
Just+0.010
ber+0.009
rd+0.009
 belts+0.008
ves+0.008
Neg logit contributions
umb-0.022
 need-0.022
py-0.022
hing-0.022
umper-0.022
Token always
Token ID1464
Feature activation+2.53e-02

Pos logit contributions
umb+0.011
umper+0.011
py+0.010
 apart+0.010
ained+0.010
Neg logit contributions
Just-0.011
 belts-0.011
ber-0.011
ves-0.010
 nicely-0.010
Token remember
Token ID3505
Feature activation-3.01e-04

Pos logit contributions
umb+0.039
umper+0.036
py+0.035
 need+0.034
ained+0.033
Neg logit contributions
Just-0.043
ber-0.043
 belts-0.042
ager-0.041
rd-0.041
Token how
Token ID703
Feature activation-2.52e-02

Pos logit contributions
ber+0.000
Just+0.000
rd+0.000
ager+0.000
edly+0.000
Neg logit contributions
 need-0.000
py-0.000
umb-0.000
umper-0.000
 apart-0.000
Token smooth
Token ID7209
Feature activation+3.75e-02

Pos logit contributions
ber+0.040
Just+0.040
rd+0.039
edly+0.039
 belts+0.039
Neg logit contributions
 need-0.041
umb-0.039
py-0.037
umper-0.037
 apart-0.035
Token it
Token ID340
Feature activation+3.54e-02

Pos logit contributions
umb+0.078
 need+0.075
py+0.074
umper+0.073
hing+0.069
Neg logit contributions
ves-0.042
 belts-0.041
ber-0.039
Just-0.038
edly-0.037
Token felt
Token ID2936
Feature activation+9.00e-02

Pos logit contributions
umb+0.069
umper+0.067
py+0.063
 need+0.062
hing+0.060
Neg logit contributions
Just-0.043
ves-0.042
 belts-0.042
 else-0.041
ber-0.041
Token on
Token ID319
Feature activation+3.30e-02

Pos logit contributions
umb+0.171
py+0.170
umper+0.167
 need+0.166
hing+0.159
Neg logit contributions
Just-0.115
ber-0.111
 belts-0.105
edly-0.104
rd-0.104
Token her
Token ID607
Feature activation+1.77e-02

Pos logit contributions
umb+0.053
 need+0.052
py+0.050
umper+0.050
hing+0.048
Neg logit contributions
Just-0.053
ber-0.051
ves-0.050
 belts-0.050
edly-0.049
Token toy
Token ID13373
Feature activation-8.64e-03

Pos logit contributions
 need+0.026
umb+0.026
py+0.025
 apart+0.023
 strongest+0.023
Neg logit contributions
ber-0.030
ival-0.030
ager-0.030
Just-0.029
rd-0.028
Token car
Token ID1097
Feature activation+3.29e-03

Pos logit contributions
ber+0.013
Just+0.013
 belts+0.012
ves+0.012
edly+0.012
Neg logit contributions
umb-0.015
 need-0.015
py-0.014
umper-0.014
hing-0.013
Token.
Token ID13
Feature activation-6.67e-03

Pos logit contributions
umb+0.006
 need+0.006
py+0.006
umper+0.006
hing+0.005
Neg logit contributions
ber-0.004
Just-0.004
 belts-0.004
ves-0.004
rd-0.004
Token the
Token ID262
Feature activation+4.23e-03

Pos logit contributions
umb+0.008
umper+0.007
py+0.007
hing+0.007
 need+0.007
Neg logit contributions
Just-0.006
ber-0.006
 belts-0.006
 nicely-0.005
edly-0.005
Token most
Token ID749
Feature activation-7.67e-03

Pos logit contributions
 need+0.006
umb+0.006
py+0.006
umper+0.005
 apart+0.005
Neg logit contributions
ber-0.008
Just-0.007
rd-0.007
 belts-0.007
ager-0.007
Token reliable
Token ID9314
Feature activation+8.17e-03

Pos logit contributions
ber+0.011
Just+0.011
 belts+0.011
rd+0.011
edly+0.011
Neg logit contributions
 need-0.014
umb-0.013
py-0.013
 strongest-0.012
umper-0.012
Token things
Token ID1243
Feature activation-5.14e-03

Pos logit contributions
 need+0.012
umb+0.011
py+0.010
 strongest+0.010
 scatter+0.010
Neg logit contributions
Just-0.015
ber-0.015
ival-0.015
 belts-0.014
edly-0.014
Token can
Token ID460
Feature activation+5.06e-02

Pos logit contributions
ber+0.010
Just+0.009
 belts+0.009
ager+0.009
ves+0.009
Neg logit contributions
umb-0.007
py-0.006
umper-0.006
 need-0.006
hing-0.006
Token't
Token ID470
Feature activation+8.90e-02

Pos logit contributions
umb+0.083
umper+0.082
py+0.075
 apart+0.074
ained+0.073
Neg logit contributions
Just-0.078
ves-0.074
ager-0.073
rd-0.071
ber-0.070
Token always
Token ID1464
Feature activation+5.74e-02

Pos logit contributions
umb+0.136
umper+0.131
py+0.125
 need+0.121
hing+0.117
Neg logit contributions
Just-0.149
ves-0.142
ber-0.142
ager-0.141
 belts-0.138
Token be
Token ID307
Feature activation+3.39e-02

Pos logit contributions
umb+0.089
umper+0.084
py+0.083
 need+0.082
hing+0.078
Neg logit contributions
Just-0.095
ber-0.093
ager-0.091
rd-0.090
 belts-0.090
Token trusted
Token ID13467
Feature activation+2.43e-02

Pos logit contributions
umb+0.051
umper+0.047
 need+0.047
py+0.047
hing+0.046
Neg logit contributions
ber-0.057
Just-0.057
 belts-0.054
ager-0.054
rd-0.053
Token.
Token ID13
Feature activation-7.86e-03

Pos logit contributions
 need+0.041
py+0.038
umb+0.038
umper+0.037
 apart+0.036
Neg logit contributions
ber-0.039
Just-0.038
ager-0.037
 belts-0.037
rd-0.037
Token<|endoftext|>
Token ID50256
Feature activation-2.28e-02

Pos logit contributions
Just+0.011
ber+0.010
ager+0.009
rd+0.009
edly+0.009
Neg logit contributions
umb-0.014
py-0.014
 need-0.014
hing-0.014
umper-0.014
Token guess
Token ID4724
Feature activation-9.51e-03

Pos logit contributions
ber+0.011
Just+0.011
 belts+0.010
rd+0.010
ager+0.010
Neg logit contributions
umb-0.008
 need-0.008
py-0.008
umper-0.008
hing-0.007
Token how
Token ID703
Feature activation-2.26e-02

Pos logit contributions
Just+0.015
ager+0.015
ber+0.014
uddled+0.014
edly+0.014
Neg logit contributions
 need-0.016
py-0.015
umb-0.015
 apart-0.014
umper-0.014
Token high
Token ID1029
Feature activation+2.15e-02

Pos logit contributions
ber+0.030
edly+0.029
Just+0.029
uddled+0.029
 belts+0.029
Neg logit contributions
 need-0.045
py-0.041
umb-0.040
 strongest-0.039
 apart-0.039
Token the
Token ID262
Feature activation+2.88e-02

Pos logit contributions
umb+0.038
 need+0.037
umper+0.036
py+0.035
hing+0.034
Neg logit contributions
ber-0.031
Just-0.031
 belts-0.030
edly-0.029
ves-0.029
Token tree
Token ID5509
Feature activation+1.97e-02

Pos logit contributions
umb+0.041
 need+0.040
py+0.039
umper+0.038
hing+0.036
Neg logit contributions
ber-0.051
Just-0.051
 belts-0.049
rd-0.049
ager-0.048
Token was
Token ID373
Feature activation+8.74e-02

Pos logit contributions
umb+0.033
umper+0.032
 need+0.032
py+0.031
hing+0.029
Neg logit contributions
ber-0.030
Just-0.029
 belts-0.028
ves-0.028
rd-0.028
Token.
Token ID13
Feature activation-8.57e-03

Pos logit contributions
 need+0.154
umb+0.154
py+0.149
umper+0.145
 apart+0.140
Neg logit contributions
ber-0.125
Just-0.123
 belts-0.119
rd-0.117
ager-0.117
Token His
Token ID2399
Feature activation-1.51e-02

Pos logit contributions
ber+0.016
 belts+0.016
ager+0.016
uddled+0.016
ves+0.016
Neg logit contributions
 need-0.011
umb-0.011
py-0.010
umper-0.010
 apart-0.010
Token mom
Token ID1995
Feature activation+7.22e-03

Pos logit contributions
Just+0.025
uddled+0.024
rd+0.024
ves+0.023
ager+0.023
Neg logit contributions
 need-0.024
py-0.022
 strongest-0.021
 earthquake-0.020
 tallest-0.020
Token said
Token ID531
Feature activation-1.69e-02

Pos logit contributions
 need+0.011
umb+0.010
py+0.010
umper+0.010
hing+0.009
Neg logit contributions
ber-0.013
Just-0.012
 belts-0.012
rd-0.012
ager-0.012
Token it
Token ID340
Feature activation-2.09e-02

Pos logit contributions
Just+0.032
ager+0.031
aj+0.031
rd+0.031
 belts+0.031
Neg logit contributions
 need-0.022
py-0.020
 crash-0.018
 scatter-0.018
 apart-0.017
Token Lily
Token ID20037
Feature activation+8.98e-03

Pos logit contributions
 need+0.019
umb+0.018
 apart+0.016
py+0.016
umper+0.016
Neg logit contributions
ber-0.031
ager-0.030
Just-0.030
ves-0.029
 belts-0.029
Token loved
Token ID6151
Feature activation+3.56e-03

Pos logit contributions
 need+0.013
umb+0.011
 scatter+0.011
py+0.010
umper+0.010
Neg logit contributions
ival-0.016
ber-0.016
rd-0.016
five-0.015
edly-0.015
Token how
Token ID703
Feature activation-2.43e-02

Pos logit contributions
 need+0.005
py+0.004
umb+0.004
umper+0.004
 crash+0.004
Neg logit contributions
ber-0.007
Just-0.007
ager-0.007
ival-0.007
rd-0.007
Token soft
Token ID2705
Feature activation+4.14e-02

Pos logit contributions
ber+0.043
Just+0.043
ager+0.041
edly+0.041
ival+0.040
Neg logit contributions
 need-0.037
umb-0.033
py-0.032
umper-0.030
 strongest-0.030
Token it
Token ID340
Feature activation+3.70e-02

Pos logit contributions
umb+0.083
 need+0.081
umper+0.079
py+0.078
hing+0.073
Neg logit contributions
 belts-0.050
Just-0.050
edly-0.048
ves-0.048
ber-0.048
Token felt
Token ID2936
Feature activation+8.73e-02

Pos logit contributions
umper+0.071
umb+0.071
py+0.068
 need+0.065
hing+0.063
Neg logit contributions
 belts-0.046
Just-0.046
uddled-0.045
ves-0.044
ber-0.043
Token and
Token ID290
Feature activation-1.79e-02

Pos logit contributions
umb+0.163
py+0.162
umper+0.160
 need+0.158
hing+0.152
Neg logit contributions
Just-0.114
ber-0.110
 belts-0.106
edly-0.105
rd-0.104
Token wrapped
Token ID12908
Feature activation+4.44e-03

Pos logit contributions
ber+0.031
Just+0.029
ival+0.029
ager+0.029
rd+0.029
Neg logit contributions
 need-0.028
py-0.026
umb-0.026
 apart-0.024
umper-0.024
Token it
Token ID340
Feature activation-2.45e-03

Pos logit contributions
umb+0.006
 need+0.006
py+0.006
 apart+0.006
umper+0.006
Neg logit contributions
ber-0.008
Just-0.008
ager-0.008
rd-0.008
ival-0.008
Token around
Token ID1088
Feature activation-1.86e-02

Pos logit contributions
ber+0.005
rd+0.005
ager+0.005
 belts+0.005
ival+0.005
Neg logit contributions
 need-0.003
 apart-0.003
umb-0.003
py-0.003
chy-0.002
Token her
Token ID607
Feature activation+1.34e-02

Pos logit contributions
ber+0.040
ival+0.038
ager+0.038
Just+0.038
 belts+0.038
Neg logit contributions
 need-0.022
umb-0.020
py-0.020
umper-0.019
 apart-0.019
Token to
Token ID284
Feature activation+3.70e-03

Pos logit contributions
ber+0.005
Just+0.005
ager+0.004
rd+0.004
ves+0.004
Neg logit contributions
 need-0.007
py-0.006
umb-0.006
umper-0.006
 crash-0.006
Token dream
Token ID4320
Feature activation+4.65e-03

Pos logit contributions
umb+0.006
py+0.005
 need+0.005
umper+0.005
 apart+0.005
Neg logit contributions
Just-0.006
ber-0.006
 belts-0.006
rd-0.006
ager-0.006
Token about
Token ID546
Feature activation+5.30e-03

Pos logit contributions
 need+0.008
umb+0.008
py+0.008
umper+0.007
 apart+0.007
Neg logit contributions
ber-0.007
Just-0.007
ager-0.007
rd-0.006
edly-0.006
Token what
Token ID644
Feature activation+7.14e-03

Pos logit contributions
 need+0.008
umb+0.008
py+0.007
umper+0.007
 apart+0.007
Neg logit contributions
ber-0.009
Just-0.009
 belts-0.009
ager-0.009
rd-0.009
Token life
Token ID1204
Feature activation+2.89e-02

Pos logit contributions
 need+0.011
umb+0.010
py+0.010
 strongest+0.010
umper+0.010
Neg logit contributions
Just-0.012
ber-0.012
edly-0.012
uddled-0.012
ager-0.012
Token could
Token ID714
Feature activation+8.68e-02

Pos logit contributions
umb+0.048
 need+0.046
py+0.046
umper+0.045
 apart+0.043
Neg logit contributions
ber-0.044
Just-0.043
 belts-0.043
ves-0.041
ager-0.041
Token be
Token ID307
Feature activation+5.25e-02

Pos logit contributions
umb+0.126
umper+0.122
py+0.115
 need+0.112
 apart+0.109
Neg logit contributions
Just-0.149
ber-0.145
 belts-0.144
ves-0.143
rd-0.142
Token like
Token ID588
Feature activation+1.30e-02

Pos logit contributions
umb+0.090
 need+0.087
py+0.085
umper+0.085
hing+0.080
Neg logit contributions
ber-0.078
Just-0.077
 belts-0.075
rd-0.074
ager-0.073
Token outside
Token ID2354
Feature activation-1.85e-02

Pos logit contributions
 need+0.021
umb+0.020
py+0.020
umper+0.019
 apart+0.019
Neg logit contributions
ber-0.021
Just-0.021
ager-0.020
rd-0.020
 belts-0.020
Token his
Token ID465
Feature activation+2.91e-03

Pos logit contributions
ber+0.024
 belts+0.023
Just+0.023
rd+0.023
edly+0.022
Neg logit contributions
umb-0.035
py-0.034
 need-0.034
umper-0.034
hing-0.032
Token small
Token ID1402
Feature activation+2.30e-02

Pos logit contributions
 need+0.004
umb+0.004
py+0.004
 apart+0.003
 strongest+0.003
Neg logit contributions
Just-0.006
ves-0.005
ager-0.005
ival-0.005
uddled-0.005
Token to
Token ID284
Feature activation+1.96e-02

Pos logit contributions
umb+0.015
 need+0.015
py+0.014
umper+0.014
hing+0.014
Neg logit contributions
ber-0.013
Just-0.013
 belts-0.012
ager-0.012
rd-0.012
Token see
Token ID766
Feature activation+1.06e-02

Pos logit contributions
umb+0.026
 need+0.026
py+0.025
umper+0.025
hing+0.023
Neg logit contributions
ber-0.036
Just-0.036
 belts-0.035
ager-0.035
rd-0.035
Token how
Token ID703
Feature activation-3.29e-02

Pos logit contributions
 need+0.020
umb+0.020
py+0.019
umper+0.019
 apart+0.018
Neg logit contributions
ber-0.014
Just-0.014
 belts-0.013
rd-0.013
ager-0.013
Token heavy
Token ID4334
Feature activation+3.76e-02

Pos logit contributions
ber+0.057
Just+0.057
rd+0.056
 belts+0.056
edly+0.055
Neg logit contributions
 need-0.050
umb-0.047
umper-0.044
py-0.043
 apart-0.042
Token it
Token ID340
Feature activation+4.62e-02

Pos logit contributions
umb+0.083
py+0.077
umper+0.077
 need+0.076
hing+0.075
Neg logit contributions
ves-0.038
ber-0.037
 belts-0.037
Just-0.035
 nicely-0.034
Token was
Token ID373
Feature activation+8.68e-02

Pos logit contributions
umb+0.099
umper+0.097
py+0.094
 need+0.089
hing+0.088
Neg logit contributions
Just-0.047
ves-0.046
 belts-0.045
ber-0.044
 else-0.044
Token.
Token ID13
Feature activation-5.91e-03

Pos logit contributions
umb+0.162
 need+0.160
py+0.158
umper+0.156
hing+0.150
Neg logit contributions
ber-0.114
Just-0.113
 belts-0.107
ager-0.106
rd-0.105
Token So
Token ID1406
Feature activation-3.55e-03

Pos logit contributions
ber+0.010
Just+0.010
 belts+0.010
rd+0.010
ager+0.010
Neg logit contributions
umb-0.009
 need-0.009
py-0.008
umper-0.008
 apart-0.008
Token,
Token ID11
Feature activation+2.26e-03

Pos logit contributions
Just+0.005
ber+0.005
 belts+0.005
ves+0.004
rd+0.004
Neg logit contributions
umb-0.007
umper-0.007
 need-0.006
py-0.006
hing-0.006
Token he
Token ID339
Feature activation-5.79e-03

Pos logit contributions
umb+0.005
py+0.004
umper+0.004
 need+0.004
hing+0.004
Neg logit contributions
Just-0.003
ber-0.003
ves-0.003
 belts-0.003
ager-0.002
Token found
Token ID1043
Feature activation-5.53e-03

Pos logit contributions
rd+0.010
ber+0.010
Just+0.010
 belts+0.010
edly+0.010
Neg logit contributions
 need-0.008
umb-0.008
py-0.008
umper-0.008
 apart-0.007
Token play
Token ID711
Feature activation+5.59e-03

Pos logit contributions
ber+0.004
Just+0.004
 belts+0.004
rd+0.004
ager+0.004
Neg logit contributions
 need-0.003
umb-0.003
py-0.003
umper-0.003
 apart-0.003
Token with
Token ID351
Feature activation+2.86e-03

Pos logit contributions
 need+0.009
 apart+0.009
umb+0.008
hing+0.008
py+0.008
Neg logit contributions
ber-0.009
Just-0.009
 belts-0.009
ager-0.008
becca-0.008
Token things
Token ID1243
Feature activation+1.69e-02

Pos logit contributions
 need+0.004
umb+0.004
py+0.003
 apart+0.003
umper+0.003
Neg logit contributions
Just-0.005
ber-0.005
ager-0.005
rd-0.005
ival-0.005
Token she
Token ID673
Feature activation+1.05e-02

Pos logit contributions
 need+0.027
 apart+0.025
umb+0.024
umper+0.023
py+0.023
Neg logit contributions
Just-0.028
ber-0.028
rd-0.027
ager-0.027
 belts-0.027
Token shouldn
Token ID6584
Feature activation+5.50e-02

Pos logit contributions
 need+0.015
umper+0.012
umb+0.012
py+0.011
 strongest+0.011
Neg logit contributions
ves-0.020
ber-0.020
 belts-0.020
edly-0.020
Just-0.020
Token't
Token ID470
Feature activation+8.67e-02

Pos logit contributions
umb+0.055
py+0.052
 need+0.052
umper+0.051
 apart+0.048
Neg logit contributions
Just-0.120
ber-0.118
ves-0.115
ager-0.115
rd-0.114
Token touch
Token ID3638
Feature activation+8.39e-03

Pos logit contributions
umb+0.127
 need+0.125
py+0.122
umper+0.120
hing+0.113
Neg logit contributions
ber-0.150
Just-0.150
 belts-0.144
rd-0.143
ager-0.142
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
umb+0.015
py+0.015
umper+0.015
 need+0.014
hing+0.014
Neg logit contributions
ber-0.011
 belts-0.011
rd-0.010
ves-0.010
Just-0.010
Token<|endoftext|>
Token ID50256
Feature activation-3.27e-02

Pos logit contributions
ber+0.016
 belts+0.016
Just+0.016
ager+0.015
edly+0.015
Neg logit contributions
 need-0.024
umb-0.023
py-0.022
umper-0.022
 apart-0.021
TokenOnce
Token ID7454
Feature activation+1.38e-02

Pos logit contributions
ber+0.039
Just+0.038
 belts+0.036
ager+0.035
rd+0.035
Neg logit contributions
 need-0.066
umb-0.066
py-0.064
umper-0.063
 apart-0.061
Token upon
Token ID2402
Feature activation-1.46e-02

Pos logit contributions
umb+0.026
py+0.025
 need+0.025
umper+0.025
hing+0.025
Neg logit contributions
ber-0.018
Just-0.017
rd-0.017
ves-0.017
 belts-0.017
Token't
Token ID470
Feature activation+5.28e-02

Pos logit contributions
umb+0.037
py+0.037
umper+0.036
 need+0.035
 apart+0.033
Neg logit contributions
Just-0.066
ves-0.064
ber-0.063
rd-0.062
ival-0.061
Token believe
Token ID1975
Feature activation-2.49e-02

Pos logit contributions
umb+0.094
 need+0.092
py+0.091
umper+0.090
 apart+0.085
Neg logit contributions
Just-0.074
ber-0.073
ager-0.069
 belts-0.069
rd-0.069
Token how
Token ID703
Feature activation-2.92e-02

Pos logit contributions
ber+0.038
Just+0.038
 belts+0.037
ager+0.036
rd+0.036
Neg logit contributions
 need-0.041
umb-0.040
py-0.040
umper-0.039
 apart-0.038
Token soft
Token ID2705
Feature activation+4.23e-02

Pos logit contributions
ber+0.055
 belts+0.054
edly+0.053
uddled+0.051
Just+0.051
Neg logit contributions
 need-0.044
umb-0.037
py-0.036
 apart-0.036
 strongest-0.035
Token it
Token ID340
Feature activation+2.79e-02

Pos logit contributions
umb+0.080
 need+0.077
py+0.076
umper+0.076
hing+0.070
Neg logit contributions
ber-0.055
Just-0.054
 belts-0.054
edly-0.053
ves-0.052
Token felt
Token ID2936
Feature activation+8.63e-02

Pos logit contributions
umb+0.058
umper+0.057
 need+0.055
py+0.055
hing+0.051
Neg logit contributions
Just-0.031
ber-0.030
 belts-0.030
ves-0.030
ager-0.028
Token.
Token ID13
Feature activation-8.15e-03

Pos logit contributions
umb+0.172
py+0.169
umper+0.168
 need+0.165
hing+0.159
Neg logit contributions
Just-0.101
ber-0.099
rd-0.095
edly-0.094
ager-0.093
Token All
Token ID1439
Feature activation+4.60e-04

Pos logit contributions
ber+0.017
Just+0.016
 belts+0.016
ager+0.016
rd+0.016
Neg logit contributions
 need-0.010
umb-0.010
umper-0.009
py-0.009
 apart-0.008
Token the
Token ID262
Feature activation-2.08e-03

Pos logit contributions
umb+0.001
py+0.001
umper+0.001
 need+0.001
hing+0.001
Neg logit contributions
ber-0.001
rd-0.001
Just-0.001
 belts-0.001
ager-0.001
Token other
Token ID584
Feature activation+2.68e-03

Pos logit contributions
ber+0.004
Just+0.004
 belts+0.003
rd+0.003
ager+0.003
Neg logit contributions
 need-0.003
umb-0.003
py-0.003
umper-0.003
 apart-0.003
Token cats
Token ID11875
Feature activation+1.50e-02

Pos logit contributions
umb+0.004
 need+0.004
py+0.004
umper+0.004
hing+0.004
Neg logit contributions
ber-0.004
Just-0.004
 belts-0.004
rd-0.004
ager-0.004
Token did
Token ID750
Feature activation+2.22e-02

Pos logit contributions
rd+0.004
hand+0.004
ival+0.004
ber+0.004
edly+0.004
Neg logit contributions
 need-0.003
py-0.003
 scatter-0.002
umb-0.002
 strongest-0.002
Token not
Token ID407
Feature activation+2.95e-02

Pos logit contributions
 need+0.034
umb+0.030
py+0.030
 crash+0.029
 apart+0.029
Neg logit contributions
ber-0.040
edly-0.037
rd-0.037
ival-0.037
Just-0.037
Token know
Token ID760
Feature activation+3.68e-03

Pos logit contributions
umb+0.043
umper+0.042
py+0.040
ained+0.038
neath+0.037
Neg logit contributions
five-0.048
Just-0.048
ager-0.047
ber-0.046
 nicely-0.044
Token what
Token ID644
Feature activation-7.08e-03

Pos logit contributions
umb+0.008
 need+0.007
py+0.007
umper+0.007
hing+0.007
Neg logit contributions
ber-0.004
Just-0.004
 belts-0.004
rd-0.004
ager-0.004
Token oxygen
Token ID11863
Feature activation+3.76e-02

Pos logit contributions
edly+0.011
Just+0.011
becca+0.011
uddled+0.011
rd+0.011
Neg logit contributions
 need-0.013
py-0.011
 strongest-0.011
 scatter-0.010
 crash-0.010
Token was
Token ID373
Feature activation+8.53e-02

Pos logit contributions
umb+0.072
umper+0.070
py+0.068
 need+0.067
hing+0.064
Neg logit contributions
ber-0.049
ves-0.046
 belts-0.045
ager-0.044
Just-0.044
Token,
Token ID11
Feature activation+9.99e-03

Pos logit contributions
umb+0.157
 need+0.155
py+0.151
umper+0.149
hing+0.143
Neg logit contributions
ber-0.116
Just-0.115
 belts-0.110
rd-0.108
ager-0.108
Token but
Token ID475
Feature activation+1.41e-02

Pos logit contributions
umb+0.019
umper+0.018
py+0.017
emaker+0.016
 need+0.016
Neg logit contributions
Just-0.015
 belts-0.014
ber-0.013
edly-0.013
ves-0.013
Token he
Token ID339
Feature activation+1.22e-03

Pos logit contributions
umb+0.023
 need+0.023
py+0.022
umper+0.022
hing+0.021
Neg logit contributions
ber-0.022
Just-0.022
 belts-0.021
ager-0.021
rd-0.020
Token liked
Token ID8288
Feature activation+6.39e-04

Pos logit contributions
 need+0.002
py+0.001
umb+0.001
umper+0.001
 strongest+0.001
Neg logit contributions
rd-0.002
ival-0.002
 belts-0.002
five-0.002
hand-0.002
Token the
Token ID262
Feature activation+1.31e-02

Pos logit contributions
 need+0.001
py+0.001
umb+0.001
 apart+0.001
 crash+0.001
Neg logit contributions
Just-0.001
ber-0.001
rd-0.001
ager-0.001
ival-0.001
Token him
Token ID683
Feature activation+2.81e-03

Pos logit contributions
umb+0.018
 need+0.018
py+0.017
umper+0.017
hing+0.016
Neg logit contributions
ber-0.020
Just-0.020
 belts-0.019
rd-0.019
ager-0.019
Token get
Token ID651
Feature activation+1.95e-02

Pos logit contributions
 need+0.004
umb+0.004
py+0.004
umper+0.004
 apart+0.004
Neg logit contributions
ber-0.005
Just-0.005
rd-0.005
 belts-0.005
ager-0.005
Token things
Token ID1243
Feature activation+1.06e-02

Pos logit contributions
umb+0.031
 need+0.030
py+0.030
umper+0.029
hing+0.028
Neg logit contributions
ber-0.031
Just-0.031
 belts-0.030
rd-0.029
ager-0.029
Token he
Token ID339
Feature activation+3.20e-03

Pos logit contributions
umb+0.018
py+0.018
 need+0.018
umper+0.017
hing+0.016
Neg logit contributions
ber-0.015
Just-0.015
 belts-0.015
ager-0.014
rd-0.014
Token couldn
Token ID3521
Feature activation+6.12e-02

Pos logit contributions
 need+0.004
umb+0.004
umper+0.004
py+0.004
 strongest+0.004
Neg logit contributions
ber-0.006
Just-0.006
rd-0.006
edly-0.006
 belts-0.006
Token't
Token ID470
Feature activation+8.52e-02

Pos logit contributions
py+0.081
umper+0.081
umb+0.076
 apart+0.075
 strongest+0.073
Neg logit contributions
ves-0.110
Just-0.109
rd-0.101
ork-0.098
--0.098
Token reach
Token ID3151
Feature activation+3.19e-02

Pos logit contributions
umb+0.139
py+0.136
umper+0.133
 need+0.124
 apart+0.123
Neg logit contributions
Just-0.132
ber-0.127
ager-0.125
ves-0.123
edly-0.120
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
umb+0.063
py+0.060
umper+0.060
 need+0.059
hing+0.058
Neg logit contributions
ber-0.036
 belts-0.036
Just-0.035
ves-0.035
 nicely-0.034
Token<|endoftext|>
Token ID50256
Feature activation-2.46e-02

Pos logit contributions
ber+0.022
Just+0.022
rd+0.020
ager+0.020
 belts+0.020
Neg logit contributions
umb-0.042
 need-0.041
py-0.041
umper-0.040
 apart-0.039
TokenOnce
Token ID7454
Feature activation+1.29e-02

Pos logit contributions
Just+0.027
ber+0.027
rd+0.026
 belts+0.025
ager+0.025
Neg logit contributions
umb-0.050
 need-0.050
py-0.049
umper-0.048
hing-0.047
Token there
Token ID612
Feature activation+9.91e-03

Pos logit contributions
umb+0.024
 need+0.023
py+0.023
umper+0.023
hing+0.022
Neg logit contributions
ber-0.017
Just-0.017
 belts-0.016
rd-0.016
ves-0.016
Token see
Token ID766
Feature activation+6.43e-03

Pos logit contributions
umb+0.020
 need+0.019
py+0.019
umper+0.018
 apart+0.017
Neg logit contributions
Just-0.023
ber-0.023
 belts-0.022
rd-0.022
ves-0.022
Token how
Token ID703
Feature activation-3.63e-02

Pos logit contributions
 need+0.009
py+0.009
umb+0.009
umper+0.009
 apart+0.008
Neg logit contributions
Just-0.012
ber-0.011
ager-0.011
edly-0.011
rd-0.011
Token tough
Token ID5802
Feature activation+2.89e-02

Pos logit contributions
ber+0.063
Just+0.061
rd+0.061
edly+0.061
 belts+0.060
Neg logit contributions
 need-0.055
umb-0.051
py-0.049
umper-0.048
 apart-0.048
Token the
Token ID262
Feature activation+3.38e-02

Pos logit contributions
umb+0.050
 need+0.048
py+0.047
umper+0.047
hing+0.044
Neg logit contributions
ber-0.043
Just-0.042
 belts-0.041
ves-0.040
rd-0.040
Token string
Token ID4731
Feature activation+3.20e-02

Pos logit contributions
umb+0.054
 need+0.054
py+0.051
umper+0.050
 apart+0.048
Neg logit contributions
ber-0.054
Just-0.054
 belts-0.052
rd-0.052
ager-0.051
Token was
Token ID373
Feature activation+8.52e-02

Pos logit contributions
umper+0.059
umb+0.058
py+0.056
 need+0.054
hing+0.050
Neg logit contributions
ber-0.042
ves-0.042
 belts-0.042
Just-0.041
uddled-0.039
Token.
Token ID13
Feature activation-5.41e-03

Pos logit contributions
umb+0.165
 need+0.161
py+0.160
umper+0.159
hing+0.151
Neg logit contributions
Just-0.106
ber-0.106
 belts-0.101
ager-0.098
rd-0.098
Token
Token ID198
Feature activation-8.63e-03

Pos logit contributions
ber+0.012
ves+0.012
 belts+0.012
rd+0.011
ager+0.011
Neg logit contributions
 need-0.005
umb-0.005
 apart-0.005
umper-0.005
py-0.004
Token
Token ID198
Feature activation+1.09e-02

Pos logit contributions
 belts+0.015
ber+0.015
edly+0.014
uddled+0.014
ager+0.013
Neg logit contributions
 need-0.013
umb-0.013
py-0.012
umper-0.012
hing-0.011
TokenPeople
Token ID8061
Feature activation+8.96e-03

Pos logit contributions
 need+0.019
umb+0.019
hing+0.018
py+0.017
umper+0.017
Neg logit contributions
ber-0.016
 belts-0.015
uddled-0.015
aj-0.014
edly-0.014
Token in
Token ID287
Feature activation+8.07e-03

Pos logit contributions
 need+0.012
umb+0.012
py+0.012
umper+0.012
hing+0.011
Neg logit contributions
Just-0.016
ber-0.016
rd-0.016
 belts-0.016
ager-0.016
Token noticed
Token ID6810
Feature activation-1.69e-02

Pos logit contributions
ber+0.038
Just+0.038
rd+0.037
 belts+0.037
ager+0.036
Neg logit contributions
 need-0.036
umb-0.035
py-0.034
umper-0.033
 apart-0.033
Token how
Token ID703
Feature activation-4.21e-02

Pos logit contributions
ber+0.026
Just+0.026
rd+0.025
ager+0.025
 belts+0.025
Neg logit contributions
 need-0.027
py-0.027
umb-0.026
umper-0.026
 apart-0.025
Token smooth
Token ID7209
Feature activation+3.47e-02

Pos logit contributions
ber+0.076
Just+0.073
rd+0.072
edly+0.072
 belts+0.072
Neg logit contributions
 need-0.062
umb-0.057
py-0.055
umper-0.054
 apart-0.053
Token the
Token ID262
Feature activation+2.60e-02

Pos logit contributions
umb+0.071
 need+0.069
umper+0.068
py+0.067
hing+0.063
Neg logit contributions
 belts-0.040
ber-0.039
Just-0.039
ves-0.039
edly-0.038
Token grass
Token ID8701
Feature activation+8.27e-03

Pos logit contributions
 need+0.036
umb+0.036
py+0.034
umper+0.033
 apart+0.032
Neg logit contributions
ber-0.047
Just-0.047
rd-0.045
ves-0.045
ager-0.045
Token felt
Token ID2936
Feature activation+8.49e-02

Pos logit contributions
umb+0.016
 need+0.016
umper+0.016
py+0.016
hing+0.015
Neg logit contributions
ber-0.010
Just-0.009
 belts-0.009
ves-0.009
uddled-0.009
Token beneath
Token ID11061
Feature activation+9.01e-03

Pos logit contributions
umb+0.145
py+0.144
umper+0.143
 need+0.136
hing+0.133
Neg logit contributions
Just-0.126
ber-0.119
edly-0.115
 belts-0.115
ager-0.113
Token her
Token ID607
Feature activation+5.50e-03

Pos logit contributions
umb+0.014
 need+0.014
umper+0.014
py+0.014
hing+0.013
Neg logit contributions
Just-0.014
ber-0.014
 belts-0.014
edly-0.014
ves-0.013
Token feet
Token ID3625
Feature activation+6.83e-03

Pos logit contributions
 need+0.009
umb+0.009
py+0.009
umper+0.009
 apart+0.009
Neg logit contributions
ber-0.008
Just-0.008
ager-0.008
 belts-0.008
rd-0.008
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
umb+0.013
 need+0.013
py+0.012
umper+0.012
hing+0.011
Neg logit contributions
ber-0.009
Just-0.009
 belts-0.009
rd-0.009
ager-0.008
Token
Token ID220
Feature activation-1.67e-02

Pos logit contributions
ber+0.029
ves+0.028
 belts+0.028
ager+0.028
rd+0.028
Neg logit contributions
 need-0.008
umb-0.008
 apart-0.006
py-0.006
 crash-0.006
Token running
Token ID2491
Feature activation-8.25e-03

Pos logit contributions
umb+0.004
 need+0.004
py+0.004
umper+0.004
hing+0.003
Neg logit contributions
ber-0.004
Just-0.004
 belts-0.004
rd-0.004
ves-0.004
Token away
Token ID1497
Feature activation+5.76e-03

Pos logit contributions
Just+0.012
ber+0.012
 belts+0.012
rd+0.011
ager+0.011
Neg logit contributions
umb-0.014
 need-0.014
umper-0.014
py-0.014
hing-0.013
Token from
Token ID422
Feature activation+1.14e-02

Pos logit contributions
umb+0.011
umper+0.011
 need+0.011
py+0.011
hing+0.011
Neg logit contributions
ber-0.007
Just-0.007
 belts-0.007
edly-0.007
rd-0.007
Token problems
Token ID2761
Feature activation+8.71e-04

Pos logit contributions
umb+0.016
 need+0.015
py+0.015
umper+0.014
hing+0.014
Neg logit contributions
ber-0.021
Just-0.021
 belts-0.020
rd-0.020
ager-0.020
Token won
Token ID1839
Feature activation+2.94e-02

Pos logit contributions
 need+0.001
umb+0.001
py+0.001
 apart+0.001
umper+0.001
Neg logit contributions
Just-0.001
ber-0.001
ager-0.001
rd-0.001
edly-0.001
Token't
Token ID470
Feature activation+8.48e-02

Pos logit contributions
umper+0.048
umb+0.047
 strongest+0.043
py+0.042
hing+0.041
Neg logit contributions
ves-0.047
Just-0.046
--0.043
ork-0.042
 belts-0.042
Token solve
Token ID8494
Feature activation+3.64e-02

Pos logit contributions
umper+0.140
umb+0.138
py+0.123
 strongest+0.121
hing+0.121
Neg logit contributions
Just-0.135
ves-0.122
ager-0.121
 else-0.118
 nicely-0.117
Token them
Token ID606
Feature activation-5.53e-03

Pos logit contributions
umb+0.055
 need+0.054
py+0.052
umper+0.051
hing+0.049
Neg logit contributions
ber-0.062
Just-0.061
 belts-0.059
rd-0.058
ager-0.058
Token.
Token ID13
Feature activation+5.03e-03

Pos logit contributions
ber+0.006
Just+0.006
rd+0.006
 belts+0.006
edly+0.006
Neg logit contributions
umb-0.011
umper-0.011
py-0.011
 need-0.011
hing-0.011
Token So
Token ID1406
Feature activation-1.26e-04

Pos logit contributions
 need+0.005
umb+0.004
 apart+0.004
py+0.004
umper+0.004
Neg logit contributions
ber-0.012
ves-0.011
ager-0.011
rd-0.011
 belts-0.011
Token,
Token ID11
Feature activation+6.67e-03

Pos logit contributions
Just+0.000
ber+0.000
 belts+0.000
rd+0.000
ager+0.000
Neg logit contributions
umb-0.000
 need-0.000
py-0.000
umper-0.000
hing-0.000
Token't
Token ID470
Feature activation+4.84e-02

Pos logit contributions
umper+0.037
py+0.036
 strongest+0.034
umb+0.034
 need+0.033
Neg logit contributions
Just-0.052
ves-0.050
ork-0.047
ival-0.045
rd-0.045
Token believe
Token ID1975
Feature activation-3.41e-02

Pos logit contributions
 need+0.075
umb+0.074
py+0.071
umper+0.069
hing+0.067
Neg logit contributions
ber-0.081
Just-0.079
 belts-0.078
rd-0.077
ager-0.076
Token how
Token ID703
Feature activation-2.96e-02

Pos logit contributions
ber+0.050
Just+0.050
 belts+0.048
rd+0.047
edly+0.047
Neg logit contributions
umb-0.060
 need-0.059
py-0.056
umper-0.056
hing-0.054
Token cold
Token ID4692
Feature activation+3.74e-02

Pos logit contributions
ber+0.057
 belts+0.056
edly+0.055
uddled+0.054
rd+0.053
Neg logit contributions
 need-0.043
umb-0.037
 apart-0.036
 scatter-0.035
py-0.035
Token it
Token ID340
Feature activation+2.82e-02

Pos logit contributions
umb+0.069
 need+0.067
umper+0.066
py+0.065
hing+0.062
Neg logit contributions
 belts-0.049
ber-0.049
ves-0.048
Just-0.048
edly-0.047
Token felt
Token ID2936
Feature activation+8.46e-02

Pos logit contributions
umb+0.065
umper+0.065
py+0.062
 need+0.061
hing+0.059
Neg logit contributions
Just-0.024
ves-0.024
uddled-0.022
 belts-0.022
 else-0.021
Token!
Token ID0
Feature activation-8.50e-03

Pos logit contributions
umb+0.166
py+0.163
umper+0.162
 need+0.161
hing+0.154
Neg logit contributions
Just-0.103
ber-0.101
 belts-0.095
rd-0.095
ager-0.094
Token She
Token ID1375
Feature activation+6.65e-04

Pos logit contributions
ber+0.020
ager+0.019
 belts+0.019
Just+0.019
rd+0.019
Neg logit contributions
 need-0.008
umb-0.007
py-0.007
umper-0.006
 apart-0.006
Token couldn
Token ID3521
Feature activation+1.55e-02

Pos logit contributions
 need+0.001
 scatter+0.001
py+0.001
umb+0.001
umper+0.001
Neg logit contributions
rd-0.001
ber-0.001
ival-0.001
 belts-0.001
five-0.001
Token't
Token ID470
Feature activation+3.67e-02

Pos logit contributions
umper+0.021
py+0.021
 strongest+0.020
umb+0.020
 need+0.019
Neg logit contributions
Just-0.029
ves-0.028
ork-0.026
ival-0.026
uddled-0.025
Token believe
Token ID1975
Feature activation-3.94e-02

Pos logit contributions
 need+0.073
umb+0.072
py+0.070
umper+0.068
hing+0.067
Neg logit contributions
ber-0.045
Just-0.044
 belts-0.043
rd-0.042
uddled-0.041
Token spray
Token ID11662
Feature activation+2.23e-02

Pos logit contributions
umb+0.022
umper+0.022
py+0.022
hing+0.019
 need+0.019
Neg logit contributions
Just-0.026
ber-0.024
 nicely-0.023
ves-0.023
 belts-0.023
Token paint
Token ID7521
Feature activation+2.53e-02

Pos logit contributions
umb+0.034
umper+0.033
 need+0.033
py+0.033
hing+0.030
Neg logit contributions
Just-0.035
ber-0.035
 belts-0.035
ves-0.034
rd-0.033
Token anything
Token ID1997
Feature activation+1.05e-02

Pos logit contributions
umb+0.040
 need+0.039
umper+0.038
py+0.038
hing+0.035
Neg logit contributions
ber-0.040
 belts-0.040
Just-0.040
ves-0.038
rd-0.038
Token that
Token ID326
Feature activation+4.64e-02

Pos logit contributions
umb+0.017
py+0.017
umper+0.016
 need+0.016
hing+0.015
Neg logit contributions
ber-0.016
Just-0.016
 belts-0.016
ves-0.015
rd-0.015
Token shouldn
Token ID6584
Feature activation+4.99e-02

Pos logit contributions
umb+0.073
umper+0.070
py+0.069
hing+0.063
 need+0.062
Neg logit contributions
ber-0.073
 belts-0.070
Just-0.070
 else-0.069
ves-0.069
Token't
Token ID470
Feature activation+8.46e-02

Pos logit contributions
umper+0.069
 strongest+0.066
umb+0.066
 apart+0.065
py+0.064
Neg logit contributions
ves-0.086
Just-0.084
ork-0.079
sh-0.074
ber-0.074
Token be
Token ID307
Feature activation+4.05e-02

Pos logit contributions
umb+0.126
umper+0.124
py+0.122
 need+0.116
hing+0.113
Neg logit contributions
Just-0.139
ves-0.137
ber-0.135
ager-0.130
five-0.128
Token painted
Token ID13055
Feature activation+4.22e-02

Pos logit contributions
umb+0.056
py+0.054
umper+0.053
 need+0.052
hing+0.051
Neg logit contributions
ber-0.072
Just-0.072
 belts-0.068
ager-0.067
ves-0.067
Token."
Token ID526
Feature activation-1.91e-02

Pos logit contributions
umb+0.075
 need+0.074
umper+0.074
py+0.070
hing+0.066
Neg logit contributions
ber-0.055
 belts-0.054
rd-0.054
Just-0.053
ves-0.053
Token
Token ID220
Feature activation-1.24e-02

Pos logit contributions
ber+0.032
 belts+0.030
ager+0.030
Just+0.030
ves+0.029
Neg logit contributions
 need-0.031
umb-0.029
py-0.028
umper-0.028
 apart-0.028
Token
Token ID198
Feature activation-7.95e-03

Pos logit contributions
ber+0.014
Just+0.014
 belts+0.014
rd+0.013
ager+0.013
Neg logit contributions
umb-0.025
 need-0.025
py-0.024
umper-0.024
 apart-0.023
Token noticed
Token ID6810
Feature activation-1.39e-02

Pos logit contributions
 need+0.003
umb+0.003
py+0.003
umper+0.003
 apart+0.003
Neg logit contributions
ber-0.003
Just-0.003
rd-0.003
 belts-0.003
edly-0.003
Token how
Token ID703
Feature activation-4.59e-02

Pos logit contributions
rd+0.024
ber+0.023
ager+0.023
Just+0.023
 belts+0.022
Neg logit contributions
 need-0.020
py-0.020
umb-0.020
umper-0.020
 apart-0.018
Token smooth
Token ID7209
Feature activation+3.20e-02

Pos logit contributions
ber+0.082
Just+0.081
 belts+0.080
rd+0.080
ager+0.079
Neg logit contributions
 need-0.068
umb-0.064
py-0.061
umper-0.060
hing-0.057
Token the
Token ID262
Feature activation+1.96e-02

Pos logit contributions
umb+0.067
 need+0.064
py+0.061
umper+0.061
 earthquake+0.059
Neg logit contributions
 belts-0.038
ves-0.038
ber-0.038
Just-0.037
edly-0.036
Token grass
Token ID8701
Feature activation+3.60e-03

Pos logit contributions
umb+0.029
 need+0.029
py+0.028
umper+0.028
hing+0.026
Neg logit contributions
Just-0.033
ber-0.033
 belts-0.032
ager-0.032
rd-0.031
Token felt
Token ID2936
Feature activation+8.45e-02

Pos logit contributions
umb+0.007
umper+0.007
 need+0.007
py+0.007
 apart+0.006
Neg logit contributions
ber-0.004
Just-0.004
 belts-0.004
ves-0.004
uddled-0.004
Token on
Token ID319
Feature activation+3.40e-02

Pos logit contributions
umb+0.133
umper+0.132
py+0.131
 need+0.123
hing+0.121
Neg logit contributions
Just-0.138
ber-0.129
edly-0.126
 belts-0.126
ves-0.124
Token his
Token ID465
Feature activation+7.25e-03

Pos logit contributions
umb+0.067
umper+0.066
hing+0.062
py+0.062
 need+0.062
Neg logit contributions
Just-0.043
edly-0.040
 belts-0.040
ves-0.040
ber-0.039
Token feet
Token ID3625
Feature activation+1.03e-02

Pos logit contributions
py+0.010
 need+0.010
umb+0.010
 apart+0.009
 strongest+0.009
Neg logit contributions
ival-0.014
ber-0.013
ager-0.013
rd-0.013
Just-0.013
Token.
Token ID13
Feature activation+3.09e-03

Pos logit contributions
umb+0.019
 need+0.019
py+0.019
umper+0.019
hing+0.018
Neg logit contributions
Just-0.013
ber-0.013
 belts-0.013
rd-0.012
ves-0.012
Token He
Token ID679
Feature activation+1.45e-03

Pos logit contributions
umb+0.004
 need+0.004
py+0.004
umper+0.004
 apart+0.004
Neg logit contributions
rd-0.006
ber-0.006
Just-0.006
ager-0.006
 belts-0.006
Token didn
Token ID1422
Feature activation+3.23e-02

Pos logit contributions
umb+0.001
 need+0.001
umper+0.001
py+0.001
 apart+0.001
Neg logit contributions
ber-0.001
Just-0.001
 belts-0.001
ager-0.001
rd-0.001
Token't
Token ID470
Feature activation+3.65e-02

Pos logit contributions
umper+0.044
py+0.044
umb+0.042
 strongest+0.042
 need+0.040
Neg logit contributions
Just-0.060
ves-0.059
ork-0.055
ggy-0.054
ber-0.053
Token like
Token ID588
Feature activation+9.94e-04

Pos logit contributions
umb+0.056
umper+0.052
ained+0.052
neath+0.051
py+0.049
Neg logit contributions
five-0.058
Just-0.055
ager-0.054
 else-0.054
 nicely-0.053
Token how
Token ID703
Feature activation-1.80e-02

Pos logit contributions
umb+0.002
 need+0.002
py+0.002
umper+0.002
hing+0.002
Neg logit contributions
ber-0.001
Just-0.001
 belts-0.001
rd-0.001
ves-0.001
Token it
Token ID340
Feature activation+4.32e-02

Pos logit contributions
ber+0.030
Just+0.029
 belts+0.029
ager+0.028
edly+0.028
Neg logit contributions
 need-0.029
umb-0.027
py-0.027
umper-0.026
 apart-0.025
Token felt
Token ID2936
Feature activation+8.42e-02

Pos logit contributions
umb+0.078
umper+0.076
py+0.071
 need+0.071
 apart+0.068
Neg logit contributions
ber-0.059
Just-0.058
ager-0.056
ves-0.056
 belts-0.055
Token when
Token ID618
Feature activation+4.56e-02

Pos logit contributions
umb+0.168
umper+0.166
py+0.161
hing+0.152
 need+0.151
Neg logit contributions
ber-0.097
Just-0.097
rd-0.093
edly-0.090
 nicely-0.090
Token it
Token ID340
Feature activation+2.52e-02

Pos logit contributions
umb+0.079
 need+0.075
umper+0.074
py+0.074
hing+0.071
Neg logit contributions
ber-0.068
Just-0.067
 belts-0.065
rd-0.064
ves-0.063
Token rubbed
Token ID31862
Feature activation+2.79e-02

Pos logit contributions
umb+0.040
umper+0.040
 need+0.038
py+0.037
hing+0.035
Neg logit contributions
Just-0.039
ber-0.039
uddled-0.037
ves-0.037
 belts-0.037
Token against
Token ID1028
Feature activation+7.24e-03

Pos logit contributions
umb+0.048
 need+0.047
umper+0.046
py+0.045
hing+0.043
Neg logit contributions
ber-0.040
Just-0.040
 belts-0.040
edly-0.039
ves-0.038
Token her
Token ID607
Feature activation+8.93e-03

Pos logit contributions
umb+0.011
 need+0.011
umper+0.010
py+0.010
hing+0.010
Neg logit contributions
ber-0.012
Just-0.012
 belts-0.012
edly-0.011
rd-0.011
Token even
Token ID772
Feature activation+1.66e-02

Pos logit contributions
umb+0.055
umper+0.053
py+0.052
 need+0.051
hing+0.049
Neg logit contributions
ber-0.036
Just-0.036
 belts-0.035
ves-0.034
rd-0.033
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
umb+0.031
py+0.028
umper+0.028
hing+0.027
 need+0.027
Neg logit contributions
ber-0.023
 belts-0.023
Just-0.023
 nicely-0.022
 else-0.022
Token biggest
Token ID4094
Feature activation-8.40e-04

Pos logit contributions
umb+0.015
py+0.014
umper+0.014
 need+0.014
hing+0.013
Neg logit contributions
ber-0.020
Just-0.020
 belts-0.020
ager-0.019
rd-0.019
Token storm
Token ID6388
Feature activation+2.45e-02

Pos logit contributions
ber+0.001
Just+0.001
 belts+0.001
rd+0.001
ager+0.001
Neg logit contributions
umb-0.001
umper-0.001
py-0.001
 need-0.001
hing-0.001
Token couldn
Token ID3521
Feature activation+5.71e-02

Pos logit contributions
umb+0.039
 need+0.039
py+0.037
umper+0.037
 apart+0.035
Neg logit contributions
ber-0.039
Just-0.038
 belts-0.037
ager-0.037
rd-0.037
Token't
Token ID470
Feature activation+8.41e-02

Pos logit contributions
umper+0.076
py+0.070
umb+0.070
 strongest+0.067
 need+0.063
Neg logit contributions
ves-0.108
Just-0.108
ork-0.101
--0.097
rd-0.096
Token knock
Token ID10643
Feature activation+8.77e-03

Pos logit contributions
umb+0.122
umper+0.118
 need+0.115
py+0.115
 apart+0.105
Neg logit contributions
Just-0.146
ber-0.143
ves-0.140
 belts-0.139
ager-0.137
Token it
Token ID340
Feature activation+6.93e-03

Pos logit contributions
umb+0.014
py+0.014
 need+0.014
umper+0.013
hing+0.012
Neg logit contributions
ber-0.014
 belts-0.013
Just-0.013
ves-0.013
ival-0.012
Token down
Token ID866
Feature activation-9.18e-03

Pos logit contributions
umper+0.012
umb+0.011
py+0.011
 need+0.011
emaker+0.010
Neg logit contributions
Just-0.010
 belts-0.010
 nicely-0.009
ber-0.009
edly-0.009
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
ber+0.010
Just+0.010
 belts+0.010
edly+0.010
rd+0.009
Neg logit contributions
umb-0.019
umper-0.018
py-0.018
 need-0.018
hing-0.017
Token Tim
Token ID5045
Feature activation+9.62e-03

Pos logit contributions
ber+0.016
 belts+0.016
Just+0.015
ager+0.015
ves+0.015
Neg logit contributions
 need-0.017
umb-0.017
umper-0.016
py-0.016
 apart-0.015
Token
Token ID198
Feature activation-2.21e-03

Pos logit contributions
 need+0.010
umb+0.009
py+0.009
 apart+0.008
umper+0.008
Neg logit contributions
ber-0.015
 belts-0.015
Just-0.015
ager-0.014
rd-0.014
Token
Token ID198
Feature activation+1.56e-02

Pos logit contributions
ber+0.004
 belts+0.003
edly+0.003
uddled+0.003
rd+0.003
Neg logit contributions
 need-0.004
umb-0.004
py-0.003
umper-0.003
hing-0.003
TokenGeorge
Token ID20191
Feature activation-2.58e-02

Pos logit contributions
 need+0.027
hing+0.025
py+0.025
umb+0.024
 apart+0.023
Neg logit contributions
ber-0.025
uddled-0.024
 belts-0.024
edly-0.023
aj-0.022
Token asked
Token ID1965
Feature activation-2.94e-02

Pos logit contributions
 belts+0.046
rd+0.045
ves+0.045
ber+0.045
ival+0.045
Neg logit contributions
 need-0.037
umb-0.033
py-0.032
umper-0.032
 apart-0.030
Token the
Token ID262
Feature activation-6.28e-03

Pos logit contributions
Just+0.063
ber+0.060
ager+0.060
uddled+0.060
rd+0.059
Neg logit contributions
 need-0.035
py-0.033
umb-0.031
 apart-0.031
umper-0.029
Token bar
Token ID2318
Feature activation-8.43e-02

Pos logit contributions
uddled+0.011
rd+0.010
ves+0.010
edly+0.010
Just+0.010
Neg logit contributions
 need-0.010
 strongest-0.010
 crash-0.009
umb-0.009
 fastest-0.008
Tokenber
Token ID527
Feature activation-5.38e-02

Pos logit contributions
uddled+0.069
Just+0.068
rd+0.067
ves+0.061
aj+0.060
Neg logit contributions
 need-0.193
umb-0.187
 strongest-0.178
py-0.176
emaker-0.174
Token for
Token ID329
Feature activation-4.36e-02

Pos logit contributions
Just+0.117
uddled+0.112
ager+0.110
ves+0.110
edly+0.109
Neg logit contributions
 need-0.066
 apart-0.056
py-0.056
umper-0.052
umb-0.052
Token a
Token ID257
Feature activation-1.55e-02

Pos logit contributions
uddled+0.099
ber+0.096
ager+0.094
Just+0.094
rd+0.092
Neg logit contributions
 need-0.056
 crash-0.040
py-0.039
 apart-0.036
 scatter-0.036
Token new
Token ID649
Feature activation-9.79e-03

Pos logit contributions
edly+0.025
uddled+0.024
Just+0.022
ves+0.022
 belts+0.022
Neg logit contributions
 need-0.029
 crash-0.026
 slip-0.025
 strongest-0.025
 scatter-0.025
Token hairst
Token ID42658
Feature activation-4.73e-02

Pos logit contributions
uddled+0.016
ival+0.016
Just+0.015
ves+0.015
edly+0.015
Neg logit contributions
 need-0.017
umb-0.014
 strongest-0.014
 crash-0.014
 urge-0.014
Token the
Token ID262
Feature activation-4.32e-03

Pos logit contributions
ber+0.027
Just+0.027
rd+0.026
edly+0.026
 belts+0.026
Neg logit contributions
 need-0.016
umb-0.015
umper-0.014
py-0.014
hing-0.013
Token counter
Token ID3753
Feature activation-3.55e-02

Pos logit contributions
Just+0.008
edly+0.008
ves+0.007
ber+0.007
ager+0.007
Neg logit contributions
 need-0.006
umb-0.005
 scatter-0.005
 drain-0.005
 strongest-0.005
Token and
Token ID290
Feature activation-2.72e-02

Pos logit contributions
ves+0.066
Just+0.064
rd+0.063
ager+0.063
ber+0.062
Neg logit contributions
 need-0.048
umb-0.048
 apart-0.045
umper-0.040
 crash-0.038
Token asked
Token ID1965
Feature activation-3.86e-02

Pos logit contributions
rd+0.054
ves+0.052
Just+0.050
ber+0.050
ager+0.050
Neg logit contributions
 need-0.034
 apart-0.034
 Sammy-0.032
 scatter-0.031
umb-0.030
Token the
Token ID262
Feature activation-1.31e-02

Pos logit contributions
ager+0.081
Just+0.081
edly+0.079
uddled+0.078
rd+0.078
Neg logit contributions
 need-0.046
py-0.041
 apart-0.040
umper-0.038
 Sammy-0.038
Token bar
Token ID2318
Feature activation-8.35e-02

Pos logit contributions
ves+0.021
Just+0.021
ber+0.021
edly+0.021
ager+0.021
Neg logit contributions
 need-0.022
umb-0.019
 strongest-0.019
 urge-0.017
 crash-0.017
Tokenista
Token ID12523
Feature activation-4.17e-02

Pos logit contributions
uddled+0.075
Just+0.074
Aud+0.073
rd+0.069
ves+0.067
Neg logit contributions
 need-0.177
umb-0.173
 strongest-0.159
igator-0.158
emaker-0.156
Token if
Token ID611
Feature activation-4.02e-03

Pos logit contributions
Just+0.082
uddled+0.080
ves+0.079
ager+0.078
rd+0.076
Neg logit contributions
 need-0.054
 apart-0.047
umper-0.046
umb-0.044
py-0.043
Token they
Token ID484
Feature activation+2.65e-03

Pos logit contributions
Just+0.007
rd+0.007
edly+0.007
ager+0.007
ber+0.007
Neg logit contributions
 need-0.006
py-0.005
umper-0.005
umb-0.005
 apart-0.005
Token could
Token ID714
Feature activation+1.99e-02

Pos logit contributions
 need+0.004
umb+0.004
umper+0.003
py+0.003
 apart+0.003
Neg logit contributions
Just-0.005
ber-0.005
rd-0.005
 belts-0.005
ager-0.005
Token print
Token ID3601
Feature activation+6.58e-03

Pos logit contributions
umb+0.028
py+0.028
umper+0.027
 need+0.027
 apart+0.025
Neg logit contributions
Just-0.035
ber-0.034
 belts-0.033
ager-0.033
ves-0.033
Token shop
Token ID6128
Feature activation-2.74e-02

Pos logit contributions
 need+0.008
 strongest+0.007
 earthquake+0.006
 urge+0.006
 tallest+0.006
Neg logit contributions
ves-0.009
ival-0.008
Just-0.008
uddled-0.008
ager-0.008
Token.
Token ID13
Feature activation-4.27e-03

Pos logit contributions
Just+0.045
ves+0.043
ager+0.042
rd+0.041
uddled+0.041
Neg logit contributions
 need-0.044
umb-0.042
 apart-0.040
 strongest-0.039
py-0.037
Token He
Token ID679
Feature activation-1.03e-02

Pos logit contributions
ber+0.009
ves+0.009
 belts+0.009
Just+0.009
uddled+0.009
Neg logit contributions
 need-0.004
umb-0.004
umper-0.004
py-0.004
 apart-0.003
Token thanked
Token ID26280
Feature activation+6.62e-03

Pos logit contributions
rd+0.017
ber+0.017
Just+0.016
 belts+0.016
ves+0.016
Neg logit contributions
 need-0.016
umb-0.015
py-0.014
umper-0.014
 apart-0.013
Token the
Token ID262
Feature activation-1.18e-03

Pos logit contributions
 need+0.008
umb+0.008
py+0.008
umper+0.007
 apart+0.007
Neg logit contributions
ber-0.013
Just-0.013
 belts-0.013
rd-0.013
ager-0.013
Token bar
Token ID2318
Feature activation-8.24e-02

Pos logit contributions
uddled+0.002
ves+0.002
ues+0.002
edly+0.002
rd+0.002
Neg logit contributions
 need-0.002
 strongest-0.002
 crash-0.002
 urge-0.002
 smartest-0.002
Tokenber
Token ID527
Feature activation-4.16e-02

Pos logit contributions
rd+0.064
ves+0.062
Just+0.061
uddled+0.060
Aud+0.058
Neg logit contributions
 need-0.183
umb-0.177
 strongest-0.170
emaker-0.168
 crash-0.163
Token for
Token ID329
Feature activation-2.37e-02

Pos logit contributions
ves+0.084
Just+0.084
rd+0.080
uddled+0.080
ager+0.077
Neg logit contributions
 need-0.047
 apart-0.044
umb-0.040
py-0.038
umper-0.037
Token the
Token ID262
Feature activation+9.70e-03

Pos logit contributions
ber+0.048
Just+0.047
rd+0.046
 belts+0.046
uddled+0.045
Neg logit contributions
 need-0.030
py-0.028
umb-0.027
umper-0.026
hing-0.025
Token great
Token ID1049
Feature activation-6.85e-03

Pos logit contributions
 need+0.016
 strongest+0.015
 crash+0.013
 fastest+0.013
 urge+0.013
Neg logit contributions
uddled-0.017
ves-0.017
Just-0.017
edly-0.016
ues-0.016
Token job
Token ID1693
Feature activation-3.34e-02

Pos logit contributions
Just+0.010
uddled+0.010
edly+0.010
ves+0.010
 belts+0.009
Neg logit contributions
 need-0.013
umb-0.012
 strongest-0.011
 crash-0.011
 apart-0.011
Token goodbye
Token ID24829
Feature activation-2.39e-02

Pos logit contributions
Just+0.041
 belts+0.040
rd+0.040
ber+0.040
becca+0.040
Neg logit contributions
 need-0.033
 crash-0.031
py-0.030
 apart-0.030
hing-0.029
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
ival+0.048
ves+0.048
becca+0.045
Re+0.045
rd+0.045
Neg logit contributions
 apart-0.032
 need-0.029
umb-0.025
 lose-0.025
 feel-0.024
Token He
Token ID679
Feature activation-1.23e-02

Pos logit contributions
ber+0.020
Just+0.020
 belts+0.019
ager+0.019
rd+0.019
Neg logit contributions
umb-0.017
 need-0.017
py-0.017
umper-0.016
hing-0.015
Token left
Token ID1364
Feature activation-3.89e-03

Pos logit contributions
ber+0.020
rd+0.020
 belts+0.020
Just+0.020
ival+0.019
Neg logit contributions
 need-0.019
umb-0.018
py-0.017
umper-0.017
 apart-0.016
Token the
Token ID262
Feature activation-8.11e-03

Pos logit contributions
Just+0.008
ber+0.008
rd+0.008
ager+0.007
 belts+0.007
Neg logit contributions
 need-0.005
umb-0.005
 apart-0.004
py-0.004
umper-0.004
Token bar
Token ID2318
Feature activation-8.20e-02

Pos logit contributions
ves+0.013
uddled+0.012
Just+0.012
edly+0.012
ues+0.012
Neg logit contributions
 need-0.014
 strongest-0.013
umb-0.012
 lose-0.012
 crash-0.012
Tokenber
Token ID527
Feature activation-4.12e-02

Pos logit contributions
rd+0.051
ves+0.048
Just+0.044
becca+0.043
 findings+0.043
Neg logit contributions
 need-0.204
umb-0.192
 strongest-0.188
 lose-0.187
 apart-0.185
Token feeling
Token ID4203
Feature activation+1.55e-02

Pos logit contributions
ves+0.076
Just+0.075
becca+0.075
rd+0.073
uddled+0.072
Neg logit contributions
 need-0.057
 apart-0.055
umb-0.050
 crash-0.049
 strongest-0.046
Token happy
Token ID3772
Feature activation-1.59e-02

Pos logit contributions
umb+0.030
py+0.029
 need+0.029
umper+0.029
hing+0.027
Neg logit contributions
Just-0.020
ber-0.020
rd-0.018
ager-0.018
 belts-0.018
Token and
Token ID290
Feature activation-1.74e-02

Pos logit contributions
Just+0.026
ves+0.026
ber+0.026
rd+0.026
 belts+0.025
Neg logit contributions
 need-0.025
umb-0.023
py-0.023
 apart-0.023
umper-0.022
Token ended
Token ID4444
Feature activation+2.32e-02

Pos logit contributions
ber+0.022
Just+0.022
 belts+0.021
ager+0.021
rd+0.021
Neg logit contributions
umb-0.034
 need-0.033
py-0.033
umper-0.032
hing-0.031
Token now
Token ID783
Feature activation+2.74e-02

Pos logit contributions
 need+0.009
umb+0.009
 apart+0.009
umper+0.008
py+0.008
Neg logit contributions
ves-0.010
ival-0.010
 belts-0.010
rd-0.010
Just-0.010
Token.
Token ID13
Feature activation+1.19e-03

Pos logit contributions
 need+0.045
umb+0.042
py+0.040
 apart+0.040
umper+0.039
Neg logit contributions
ber-0.044
 belts-0.043
Just-0.043
ival-0.042
ager-0.041
Token She
Token ID1375
Feature activation+6.42e-03

Pos logit contributions
umb+0.001
 need+0.001
py+0.001
umper+0.001
 apart+0.001
Neg logit contributions
ber-0.003
Just-0.003
 belts-0.003
rd-0.002
ager-0.002
Token thanked
Token ID26280
Feature activation+4.64e-03

Pos logit contributions
 need+0.010
umb+0.009
py+0.009
umper+0.009
 apart+0.008
Neg logit contributions
ber-0.011
 belts-0.011
rd-0.011
Just-0.010
ival-0.010
Token the
Token ID262
Feature activation-2.77e-03

Pos logit contributions
umb+0.006
 need+0.006
py+0.006
umper+0.006
 apart+0.006
Neg logit contributions
ber-0.009
Just-0.008
 belts-0.008
rd-0.008
ager-0.008
Token bar
Token ID2318
Feature activation-8.02e-02

Pos logit contributions
ves+0.005
rd+0.005
Just+0.005
uddled+0.004
edly+0.004
Neg logit contributions
 need-0.005
 strongest-0.004
umb-0.004
py-0.004
 earthquake-0.004
Tokenber
Token ID527
Feature activation-3.68e-02

Pos logit contributions
Just+0.063
ves+0.061
rd+0.059
uddled+0.056
aj+0.052
Neg logit contributions
umb-0.179
 need-0.176
py-0.171
emaker-0.169
 strongest-0.165
Token,
Token ID11
Feature activation-3.30e-02

Pos logit contributions
Just+0.073
ves+0.071
uddled+0.069
ager+0.067
rd+0.067
Neg logit contributions
 need-0.046
 apart-0.042
py-0.041
umb-0.040
umper-0.038
Token who
Token ID508
Feature activation-3.28e-02

Pos logit contributions
rd+0.059
ves+0.056
ber+0.056
ager+0.055
ival+0.055
Neg logit contributions
 need-0.051
umb-0.046
 apart-0.045
umper-0.045
py-0.045
Token was
Token ID373
Feature activation+1.97e-02

Pos logit contributions
ber+0.050
Just+0.049
 belts+0.047
ager+0.047
rd+0.046
Neg logit contributions
umb-0.055
 need-0.055
py-0.054
umper-0.052
 apart-0.050
Token still
Token ID991
Feature activation+3.88e-03

Pos logit contributions
 need+0.029
umb+0.028
py+0.027
umper+0.027
 apart+0.026
Neg logit contributions
Just-0.034
ber-0.034
 belts-0.033
ager-0.033
ves-0.033
Token go
Token ID467
Feature activation+8.30e-03

Pos logit contributions
ber+0.011
Just+0.011
 belts+0.010
rd+0.010
ager+0.010
Neg logit contributions
umb-0.008
 need-0.008
py-0.008
umper-0.008
hing-0.007
Token!
Token ID0
Feature activation-1.80e-02

Pos logit contributions
 need+0.015
umb+0.014
py+0.014
umper+0.014
 apart+0.013
Neg logit contributions
ber-0.012
Just-0.012
ager-0.012
rd-0.012
 belts-0.012
Token Everyone
Token ID11075
Feature activation-1.29e-02

Pos logit contributions
ber+0.030
Just+0.030
 belts+0.030
ager+0.029
ves+0.029
Neg logit contributions
 need-0.027
umb-0.027
py-0.025
umper-0.025
 apart-0.024
Token watched
Token ID7342
Feature activation-2.17e-02

Pos logit contributions
Just+0.020
ber+0.020
 belts+0.019
rd+0.019
edly+0.019
Neg logit contributions
 need-0.021
umb-0.020
py-0.020
umper-0.019
hing-0.019
Token in
Token ID287
Feature activation-1.19e-02

Pos logit contributions
ber+0.038
Just+0.037
ager+0.036
rd+0.035
 belts+0.035
Neg logit contributions
 need-0.033
umb-0.031
py-0.031
 apart-0.030
hing-0.029
Token amaz
Token ID40642
Feature activation-8.02e-02

Pos logit contributions
ber+0.022
Just+0.022
uddled+0.021
ager+0.021
ves+0.021
Neg logit contributions
 need-0.018
umb-0.016
 apart-0.015
py-0.015
 crash-0.015
Tokenement
Token ID972
Feature activation-4.29e-02

Pos logit contributions
ber+0.124
Just+0.118
cious+0.111
ager+0.110
 belts+0.110
Neg logit contributions
py-0.126
hing-0.123
 need-0.121
umb-0.120
 apart-0.117
Token as
Token ID355
Feature activation-7.54e-03

Pos logit contributions
Just+0.071
ager+0.071
ber+0.070
ival+0.069
rd+0.068
Neg logit contributions
 need-0.063
py-0.062
 apart-0.060
umb-0.060
hing-0.057
Token it
Token ID340
Feature activation+1.51e-02

Pos logit contributions
Just+0.012
ber+0.012
 belts+0.012
ager+0.012
rd+0.012
Neg logit contributions
 need-0.012
umb-0.011
py-0.011
umper-0.011
 apart-0.010
Token took
Token ID1718
Feature activation+2.80e-02

Pos logit contributions
 need+0.022
umb+0.021
py+0.021
hing+0.020
 apart+0.020
Neg logit contributions
Just-0.026
ber-0.026
rd-0.025
ager-0.025
 belts-0.025
Token off
Token ID572
Feature activation+3.66e-04

Pos logit contributions
 need+0.037
umb+0.034
 apart+0.034
py+0.033
hing+0.031
Neg logit contributions
ber-0.056
Just-0.055
uddled-0.054
 belts-0.052
edly-0.052
Token all
Token ID477
Feature activation+2.61e-02

Pos logit contributions
 need+0.024
umb+0.023
py+0.022
umper+0.022
 apart+0.021
Neg logit contributions
ber-0.036
Just-0.036
 belts-0.035
rd-0.035
ager-0.034
Token.
Token ID13
Feature activation-6.21e-03

Pos logit contributions
umb+0.048
 need+0.046
umper+0.046
py+0.046
hing+0.043
Neg logit contributions
ber-0.036
Just-0.035
rd-0.034
ager-0.033
 belts-0.033
Token He
Token ID679
Feature activation-4.16e-03

Pos logit contributions
ber+0.010
Just+0.010
 belts+0.010
ager+0.010
rd+0.010
Neg logit contributions
 need-0.010
umb-0.009
py-0.009
umper-0.009
 apart-0.008
Token thanked
Token ID26280
Feature activation+8.63e-03

Pos logit contributions
rd+0.007
ber+0.007
Just+0.007
edly+0.007
 belts+0.007
Neg logit contributions
 need-0.006
umb-0.006
py-0.006
umper-0.006
hing-0.005
Token the
Token ID262
Feature activation+3.55e-03

Pos logit contributions
 need+0.010
py+0.009
umb+0.009
umper+0.009
 apart+0.008
Neg logit contributions
ber-0.018
Just-0.018
rd-0.018
ager-0.018
 belts-0.018
Token bar
Token ID2318
Feature activation-7.94e-02

Pos logit contributions
 need+0.005
 strongest+0.005
 crash+0.005
 fastest+0.005
 tallest+0.005
Neg logit contributions
uddled-0.006
ves-0.006
rd-0.006
edly-0.006
ues-0.006
Tokenber
Token ID527
Feature activation-3.77e-02

Pos logit contributions
rd+0.059
Just+0.057
uddled+0.054
ves+0.054
aj+0.053
Neg logit contributions
 need-0.182
umb-0.179
 strongest-0.172
emaker-0.171
py-0.170
Token for
Token ID329
Feature activation-2.29e-02

Pos logit contributions
Just+0.075
ves+0.074
rd+0.073
uddled+0.072
ager+0.070
Neg logit contributions
 need-0.045
 apart-0.041
py-0.039
umb-0.039
umper-0.038
Token his
Token ID465
Feature activation-1.17e-02

Pos logit contributions
ber+0.047
Just+0.046
rd+0.045
 belts+0.044
uddled+0.044
Neg logit contributions
 need-0.029
py-0.027
umb-0.025
umper-0.025
 crash-0.025
Token new
Token ID649
Feature activation-1.64e-02

Pos logit contributions
Just+0.023
uddled+0.023
edly+0.022
ves+0.022
ber+0.022
Neg logit contributions
 need-0.017
 strongest-0.014
 crash-0.014
 earthquake-0.013
py-0.013
Token style
Token ID3918
Feature activation-2.90e-02

Pos logit contributions
Just+0.029
uddled+0.029
ival+0.028
edly+0.028
ber+0.027
Neg logit contributions
 need-0.026
umb-0.022
 strongest-0.022
 crash-0.022
 scatter-0.021
Token excited
Token ID6568
Feature activation-2.88e-02

Pos logit contributions
Just+0.053
 belts+0.052
ves+0.051
edly+0.050
ager+0.050
Neg logit contributions
 need-0.058
umb-0.053
 apart-0.049
 strongest-0.049
 lose-0.049
Token.
Token ID13
Feature activation+1.24e-03

Pos logit contributions
ber+0.051
Just+0.049
ival+0.048
ves+0.048
 belts+0.048
Neg logit contributions
 need-0.043
umb-0.041
 apart-0.039
py-0.039
umper-0.036
Token She
Token ID1375
Feature activation+1.08e-04

Pos logit contributions
umb+0.001
 need+0.001
py+0.001
umper+0.001
hing+0.001
Neg logit contributions
ber-0.003
Just-0.003
 belts-0.003
rd-0.003
ager-0.003
Token held
Token ID2714
Feature activation+2.22e-03

Pos logit contributions
 need+0.000
umb+0.000
 scatter+0.000
umper+0.000
py+0.000
Neg logit contributions
rd-0.000
ber-0.000
 belts-0.000
edly-0.000
ves-0.000
Token the
Token ID262
Feature activation-7.59e-03

Pos logit contributions
 need+0.003
umb+0.003
umper+0.003
py+0.003
 apart+0.003
Neg logit contributions
Just-0.004
ber-0.004
rd-0.004
ager-0.004
 belts-0.004
Token bar
Token ID2318
Feature activation-7.93e-02

Pos logit contributions
Just+0.013
ves+0.013
ber+0.012
edly+0.012
ager+0.012
Neg logit contributions
 need-0.012
umb-0.011
 strongest-0.011
 crash-0.011
 tallest-0.010
Token of
Token ID286
Feature activation+3.11e-03

Pos logit contributions
rd+0.053
ves+0.051
Just+0.051
uddled+0.050
aj+0.047
Neg logit contributions
umb-0.199
 need-0.190
 apart-0.182
 strongest-0.174
emaker-0.174
Token chocolate
Token ID11311
Feature activation-4.47e-02

Pos logit contributions
 need+0.004
umb+0.004
py+0.003
umper+0.003
 apart+0.003
Neg logit contributions
Just-0.007
ber-0.006
ager-0.006
rd-0.006
uddled-0.006
Token and
Token ID290
Feature activation-2.49e-02

Pos logit contributions
Just+0.075
rd+0.073
ager+0.071
ber+0.071
ival+0.069
Neg logit contributions
umb-0.069
 need-0.066
 apart-0.066
hing-0.060
 crash-0.060
Token skipped
Token ID26684
Feature activation+1.58e-02

Pos logit contributions
rd+0.046
Just+0.045
ber+0.045
ager+0.044
ival+0.043
Neg logit contributions
 need-0.034
 apart-0.031
 scatter-0.030
umb-0.030
 crash-0.029
Token all
Token ID477
Feature activation+2.19e-02

Pos logit contributions
 apart+0.019
 need+0.018
umb+0.017
 crash+0.016
 scatter+0.015
Neg logit contributions
ber-0.034
uddled-0.032
ves-0.032
rd-0.032
ager-0.031
Token,
Token ID11
Feature activation+7.38e-03

Pos logit contributions
 need+0.034
umb+0.034
py+0.033
umper+0.032
 apart+0.031
Neg logit contributions
ber-0.034
Just-0.034
 belts-0.033
rd-0.033
ager-0.033
Token they
Token ID484
Feature activation+6.13e-03

Pos logit contributions
 need+0.012
umb+0.012
py+0.011
umper+0.011
 apart+0.011
Neg logit contributions
ber-0.012
Just-0.012
 belts-0.011
rd-0.011
ager-0.011
Token arrived
Token ID5284
Feature activation-1.82e-02

Pos logit contributions
umb+0.010
 need+0.010
py+0.009
umper+0.009
hing+0.009
Neg logit contributions
ber-0.010
Just-0.010
 belts-0.010
rd-0.009
ager-0.009
Token at
Token ID379
Feature activation-3.65e-02

Pos logit contributions
ves+0.036
rd+0.035
Just+0.035
ager+0.034
 belts+0.034
Neg logit contributions
 apart-0.024
 need-0.023
umb-0.020
ained-0.019
 lose-0.018
Token the
Token ID262
Feature activation-5.39e-04

Pos logit contributions
ber+0.065
Just+0.064
 belts+0.063
ager+0.062
rd+0.061
Neg logit contributions
umb-0.052
 need-0.051
py-0.050
umper-0.048
hing-0.046
Token bar
Token ID2318
Feature activation-7.90e-02

Pos logit contributions
Just+0.001
edly+0.001
ves+0.001
rd+0.001
uddled+0.001
Neg logit contributions
 need-0.001
 strongest-0.001
umb-0.001
 tallest-0.001
 lose-0.001
Tokenber
Token ID527
Feature activation-3.83e-02

Pos logit contributions
Just+0.058
Aud+0.053
rd+0.051
ves+0.050
uddled+0.049
Neg logit contributions
 need-0.186
umb-0.181
 apart-0.166
 lose-0.165
 strongest-0.164
Token shop
Token ID6128
Feature activation-3.43e-02

Pos logit contributions
Just+0.067
edly+0.064
ves+0.064
uddled+0.062
becca+0.060
Neg logit contributions
 need-0.060
 apart-0.054
umb-0.053
 crash-0.047
 scatter-0.047
Token.
Token ID13
Feature activation+2.86e-03

Pos logit contributions
Just+0.060
ves+0.058
rd+0.057
ager+0.057
edly+0.055
Neg logit contributions
 need-0.050
 apart-0.049
umb-0.047
 lose-0.042
 crash-0.041
Token The
Token ID383
Feature activation+1.06e-02

Pos logit contributions
 need+0.004
umb+0.003
py+0.003
umper+0.003
 apart+0.003
Neg logit contributions
ber-0.006
Just-0.006
 belts-0.005
ager-0.005
rd-0.005
Token bar
Token ID2318
Feature activation-6.93e-02

Pos logit contributions
 need+0.018
 strongest+0.017
 crash+0.015
 tallest+0.015
 fastest+0.015
Neg logit contributions
ves-0.018
Just-0.017
uddled-0.017
rd-0.017
edly-0.017
Token handsome
Token ID22665
Feature activation-1.20e-02

Pos logit contributions
Just+0.050
ber+0.050
 belts+0.048
edly+0.048
ager+0.047
Neg logit contributions
 need-0.041
umb-0.039
py-0.037
umper-0.037
 apart-0.036
Token.
Token ID13
Feature activation+5.21e-03

Pos logit contributions
Just+0.020
ival+0.019
uddled+0.019
ves+0.019
ager+0.019
Neg logit contributions
 need-0.019
umb-0.018
 apart-0.017
py-0.017
umper-0.017
Token He
Token ID679
Feature activation-2.51e-03

Pos logit contributions
 need+0.005
umb+0.005
py+0.005
umper+0.005
 apart+0.004
Neg logit contributions
ber-0.012
Just-0.011
ves-0.011
rd-0.011
 belts-0.011
Token thanked
Token ID26280
Feature activation+8.47e-03

Pos logit contributions
rd+0.005
edly+0.004
 belts+0.004
ber+0.004
ves+0.004
Neg logit contributions
 need-0.003
umper-0.003
py-0.003
umb-0.003
 scatter-0.003
Token the
Token ID262
Feature activation-7.22e-03

Pos logit contributions
 need+0.009
py+0.008
umb+0.008
umper+0.008
 crash+0.008
Neg logit contributions
ber-0.018
Just-0.018
rd-0.018
ager-0.018
 belts-0.018
Token bar
Token ID2318
Feature activation-7.87e-02

Pos logit contributions
rd+0.012
uddled+0.012
edly+0.012
ves+0.012
Just+0.012
Neg logit contributions
 need-0.011
 strongest-0.011
 crash-0.010
umb-0.010
 lose-0.009
Tokenber
Token ID527
Feature activation-3.95e-02

Pos logit contributions
rd+0.064
Just+0.060
ves+0.056
aj+0.056
uddled+0.054
Neg logit contributions
 need-0.178
umb-0.174
 strongest-0.167
emaker-0.167
 crash-0.163
Token and
Token ID290
Feature activation-3.09e-02

Pos logit contributions
Just+0.083
ves+0.083
rd+0.081
uddled+0.079
becca+0.078
Neg logit contributions
 need-0.044
 apart-0.041
umb-0.036
py-0.036
umper-0.035
Token ran
Token ID4966
Feature activation+1.77e-03

Pos logit contributions
rd+0.054
Just+0.053
ber+0.051
ves+0.050
ager+0.050
Neg logit contributions
 need-0.046
 apart-0.042
py-0.042
umb-0.042
umper-0.042
Token out
Token ID503
Feature activation-4.15e-02

Pos logit contributions
 apart+0.002
 need+0.002
 crash+0.002
umb+0.002
 lose+0.002
Neg logit contributions
ber-0.004
ves-0.004
uddled-0.004
Just-0.003
Really-0.003
Token of
Token ID286
Feature activation+1.50e-03

Pos logit contributions
ber+0.063
Just+0.062
ager+0.061
becca+0.060
 belts+0.060
Neg logit contributions
 need-0.073
umb-0.069
 apart-0.068
py-0.067
 crash-0.064
Token.
Token ID13
Feature activation-4.33e-03

Pos logit contributions
 need+0.040
 apart+0.037
umb+0.035
 crash+0.035
umper+0.034
Neg logit contributions
ber-0.044
ager-0.043
Just-0.043
becca-0.042
uddled-0.042
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
Just+0.006
ber+0.006
 belts+0.005
rd+0.005
ager+0.005
Neg logit contributions
umb-0.008
 need-0.008
py-0.008
umper-0.008
hing-0.007
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
ber+0.014
Just+0.013
 belts+0.013
edly+0.012
rd+0.012
Neg logit contributions
umb-0.023
 need-0.023
py-0.022
umper-0.022
hing-0.021
TokenAt
Token ID2953
Feature activation-1.44e-02

Pos logit contributions
 need+0.026
umb+0.026
py+0.025
umper+0.025
 apart+0.024
Neg logit contributions
Just-0.012
ber-0.011
 belts-0.010
ager-0.010
rd-0.010
Token the
Token ID262
Feature activation-1.35e-03

Pos logit contributions
Just+0.028
ber+0.028
rd+0.027
ager+0.027
 belts+0.027
Neg logit contributions
 need-0.018
umb-0.018
py-0.017
umper-0.016
 apart-0.016
Token bar
Token ID2318
Feature activation-7.87e-02

Pos logit contributions
ves+0.002
uddled+0.002
edly+0.002
Just+0.002
ager+0.002
Neg logit contributions
 need-0.002
 strongest-0.002
umb-0.002
 crash-0.002
 tallest-0.002
Tokenber
Token ID527
Feature activation-4.55e-02

Pos logit contributions
Just+0.056
uddled+0.051
ves+0.050
Aud+0.050
rd+0.049
Neg logit contributions
 need-0.188
umb-0.174
 strongest-0.166
 lose-0.165
 crash-0.159
Token,
Token ID11
Feature activation-7.20e-04

Pos logit contributions
Just+0.077
edly+0.076
uddled+0.071
ves+0.071
rd+0.070
Neg logit contributions
 need-0.074
umb-0.066
 apart-0.062
 strongest-0.061
umper-0.061
Token the
Token ID262
Feature activation+1.11e-02

Pos logit contributions
ber+0.001
Just+0.001
 belts+0.001
rd+0.001
ager+0.001
Neg logit contributions
umb-0.001
 need-0.001
py-0.001
umper-0.001
 apart-0.001
Token bar
Token ID2318
Feature activation-6.58e-02

Pos logit contributions
 need+0.018
 strongest+0.017
 crash+0.016
 fastest+0.015
umb+0.015
Neg logit contributions
ves-0.018
uddled-0.018
rd-0.017
ues-0.017
Just-0.017
Tokenber
Token ID527
Feature activation-3.68e-02

Pos logit contributions
Just+0.051
Aud+0.048
 findings+0.045
rd+0.045
ves+0.040
Neg logit contributions
 need-0.149
umb-0.141
 strongest-0.135
 meant-0.130
emaker-0.129
Token
Token ID198
Feature activation-1.38e-02

Pos logit contributions
 belts+0.033
ber+0.033
edly+0.032
ager+0.032
rd+0.032
Neg logit contributions
umb-0.037
 need-0.037
umper-0.035
py-0.034
 apart-0.032
Token
Token ID198
Feature activation-5.55e-04

Pos logit contributions
 belts+0.024
ber+0.023
edly+0.022
uddled+0.022
ager+0.022
Neg logit contributions
umb-0.021
umper-0.021
 need-0.020
py-0.019
ickets-0.018
TokenSam
Token ID16305
Feature activation+2.01e-02

Pos logit contributions
 belts+0.001
uddled+0.001
ber+0.001
edly+0.001
rd+0.001
Neg logit contributions
umper-0.001
umb-0.001
 need-0.001
py-0.001
hing-0.001
Token thanked
Token ID26280
Feature activation+1.38e-02

Pos logit contributions
 need+0.031
umb+0.031
py+0.030
umper+0.030
hing+0.028
Neg logit contributions
ber-0.033
 belts-0.032
ves-0.032
Just-0.031
rd-0.031
Token the
Token ID262
Feature activation+9.93e-04

Pos logit contributions
umb+0.019
 need+0.019
py+0.018
umper+0.017
hing+0.017
Neg logit contributions
ber-0.025
Just-0.025
 belts-0.024
edly-0.023
ves-0.023
Token bar
Token ID2318
Feature activation-7.83e-02

Pos logit contributions
 need+0.001
py+0.001
umb+0.001
 crash+0.001
 strongest+0.001
Neg logit contributions
Just-0.002
rd-0.002
uddled-0.002
ves-0.002
 belts-0.002
Tokenber
Token ID527
Feature activation-3.39e-02

Pos logit contributions
rd+0.064
Aud+0.062
uddled+0.061
aj+0.060
Just+0.060
Neg logit contributions
umb-0.172
 need-0.169
igator-0.159
py-0.159
emaker-0.157
Token and
Token ID290
Feature activation-3.13e-02

Pos logit contributions
Just+0.069
ves+0.067
uddled+0.067
becca+0.064
ager+0.064
Neg logit contributions
 need-0.041
 apart-0.038
umb-0.037
py-0.035
umper-0.035
Token he
Token ID339
Feature activation-1.18e-02

Pos logit contributions
Just+0.055
rd+0.055
ber+0.054
ager+0.052
 belts+0.052
Neg logit contributions
 need-0.046
umb-0.042
 apart-0.041
umper-0.041
py-0.041
Token gave
Token ID2921
Feature activation+7.21e-03

Pos logit contributions
Just+0.020
rd+0.020
ber+0.020
 belts+0.020
edly+0.019
Neg logit contributions
 need-0.017
umb-0.017
umper-0.016
py-0.016
 apart-0.015
Token him
Token ID683
Feature activation-2.07e-02

Pos logit contributions
 need+0.009
umb+0.008
umper+0.008
py+0.008
 apart+0.008
Neg logit contributions
Just-0.015
ber-0.015
rd-0.014
 belts-0.014
ager-0.014
Tokenvious
Token ID1442
Feature activation-7.85e-03

Pos logit contributions
py+0.011
umb+0.010
 crash+0.009
 need+0.009
hing+0.008
Neg logit contributions
Just-0.023
ber-0.022
ival-0.022
ager-0.022
five-0.022
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
ber+0.012
Just+0.012
 belts+0.011
rd+0.011
edly+0.011
Neg logit contributions
umb-0.013
py-0.013
umper-0.013
 need-0.013
hing-0.012
Token It
Token ID632
Feature activation-3.15e-04

Pos logit contributions
umb+0.023
 need+0.023
py+0.022
umper+0.022
hing+0.021
Neg logit contributions
Just-0.027
ber-0.027
 belts-0.026
rd-0.025
ager-0.025
Token asked
Token ID1965
Feature activation-2.43e-02

Pos logit contributions
ber+0.001
rd+0.001
ival+0.001
 belts+0.001
Just+0.001
Neg logit contributions
 need-0.000
py-0.000
umb-0.000
 scatter-0.000
 apart-0.000
Token the
Token ID262
Feature activation-4.11e-03

Pos logit contributions
Just+0.049
ber+0.049
rd+0.048
ager+0.048
 belts+0.047
Neg logit contributions
 need-0.030
py-0.029
umb-0.028
umper-0.027
 apart-0.026
Token bar
Token ID2318
Feature activation-7.83e-02

Pos logit contributions
Just+0.006
rd+0.006
edly+0.006
uddled+0.006
ves+0.006
Neg logit contributions
 need-0.007
umb-0.007
py-0.006
 strongest-0.006
 crash-0.006
Tokenber
Token ID527
Feature activation-4.95e-02

Pos logit contributions
uddled+0.062
Just+0.057
rd+0.056
becca+0.050
aj+0.050
Neg logit contributions
 need-0.187
umb-0.184
py-0.178
 strongest-0.173
emaker-0.169
Token,
Token ID11
Feature activation-2.55e-02

Pos logit contributions
Just+0.106
uddled+0.104
ager+0.102
edly+0.101
rd+0.101
Neg logit contributions
 need-0.060
py-0.051
 apart-0.050
umb-0.048
umper-0.047
Token "
Token ID366
Feature activation-1.38e-02

Pos logit contributions
rd+0.039
ager+0.038
ber+0.036
ves+0.036
 belts+0.036
Neg logit contributions
 need-0.047
py-0.043
umb-0.042
 apart-0.042
umper-0.042
TokenCan
Token ID6090
Feature activation-2.88e-02

Pos logit contributions
rd+0.024
ves+0.023
ber+0.023
 belts+0.023
uddled+0.022
Neg logit contributions
umper-0.021
umb-0.020
 need-0.020
hing-0.019
py-0.019
Token I
Token ID314
Feature activation-1.55e-03

Pos logit contributions
rd+0.061
edly+0.060
ager+0.059
ber+0.059
Just+0.058
Neg logit contributions
 need-0.035
py-0.029
umper-0.028
 strongest-0.028
 crash-0.027
Token her
Token ID607
Feature activation-2.28e-02

Pos logit contributions
ber+0.009
Just+0.009
 belts+0.008
ager+0.008
edly+0.008
Neg logit contributions
umb-0.008
 need-0.007
py-0.007
umper-0.007
hing-0.007
Token mom
Token ID1995
Feature activation+6.81e-03

Pos logit contributions
Just+0.035
ber+0.033
ager+0.033
ival+0.033
ves+0.032
Neg logit contributions
 need-0.040
umb-0.037
py-0.037
 apart-0.034
umper-0.034
Tokenmy
Token ID1820
Feature activation+9.42e-03

Pos logit contributions
umb+0.011
 need+0.011
py+0.010
umper+0.010
 apart+0.009
Neg logit contributions
ber-0.011
Just-0.011
 belts-0.011
rd-0.011
ager-0.011
Token watched
Token ID7342
Feature activation-9.24e-03

Pos logit contributions
umb+0.016
 need+0.016
py+0.015
umper+0.015
hing+0.014
Neg logit contributions
ber-0.014
Just-0.014
 belts-0.013
ager-0.013
rd-0.013
Token in
Token ID287
Feature activation-8.69e-03

Pos logit contributions
ber+0.017
Just+0.017
ager+0.016
ival+0.016
 belts+0.016
Neg logit contributions
 need-0.013
umb-0.012
py-0.012
 apart-0.012
hing-0.012
Token amaz
Token ID40642
Feature activation-7.83e-02

Pos logit contributions
ber+0.018
Just+0.017
ves+0.017
ager+0.017
uddled+0.017
Neg logit contributions
 need-0.012
 apart-0.010
umb-0.010
 crash-0.009
py-0.009
Tokenement
Token ID972
Feature activation-3.43e-02

Pos logit contributions
ber+0.125
Just+0.120
ager+0.114
cious+0.113
 belts+0.112
Neg logit contributions
py-0.118
 need-0.116
hing-0.115
umb-0.115
umper-0.110
Token as
Token ID355
Feature activation+4.01e-03

Pos logit contributions
Just+0.058
ager+0.058
ber+0.056
ival+0.056
rd+0.054
Neg logit contributions
 need-0.050
py-0.048
 apart-0.047
umb-0.047
hing-0.044
Token the
Token ID262
Feature activation+2.33e-02

Pos logit contributions
 need+0.006
umb+0.006
py+0.005
 apart+0.005
umper+0.005
Neg logit contributions
Just-0.007
ber-0.007
ager-0.007
rd-0.007
 belts-0.007
Token machine
Token ID4572
Feature activation-1.33e-02

Pos logit contributions
 need+0.036
py+0.033
 earthquake+0.032
umb+0.032
 strongest+0.031
Neg logit contributions
Just-0.041
ber-0.040
edly-0.038
uddled-0.038
ves-0.037
Token flew
Token ID13112
Feature activation+4.13e-02

Pos logit contributions
Just+0.023
ber+0.022
rd+0.022
ager+0.022
edly+0.021
Neg logit contributions
 need-0.020
umb-0.019
py-0.019
hing-0.018
 apart-0.018
Token egg
Token ID5935
Feature activation-4.52e-03

Pos logit contributions
 need+0.002
umb+0.002
py+0.002
umper+0.002
hing+0.002
Neg logit contributions
Just-0.002
ber-0.002
 belts-0.002
rd-0.002
ager-0.002
Token!
Token ID0
Feature activation-1.17e-02

Pos logit contributions
ber+0.007
Just+0.007
 belts+0.006
rd+0.006
ager+0.006
Neg logit contributions
umb-0.008
 need-0.008
umper-0.007
py-0.007
hing-0.007
Token He
Token ID679
Feature activation-3.54e-03

Pos logit contributions
ber+0.021
Just+0.021
 belts+0.021
ager+0.020
ves+0.020
Neg logit contributions
 need-0.016
umb-0.016
py-0.015
umper-0.015
 apart-0.014
Token watched
Token ID7342
Feature activation-1.84e-02

Pos logit contributions
ber+0.006
Just+0.005
 belts+0.005
rd+0.005
ival+0.005
Neg logit contributions
 need-0.006
umb-0.005
py-0.005
umper-0.005
 apart-0.005
Token in
Token ID287
Feature activation-1.25e-02

Pos logit contributions
ber+0.032
Just+0.032
 belts+0.031
rd+0.031
ager+0.031
Neg logit contributions
 need-0.027
umb-0.026
py-0.025
umper-0.025
 apart-0.024
Token amaz
Token ID40642
Feature activation-7.72e-02

Pos logit contributions
ber+0.023
Just+0.022
uddled+0.022
ves+0.022
ival+0.022
Neg logit contributions
 need-0.019
umb-0.017
 apart-0.017
py-0.017
 crash-0.016
Tokenement
Token ID972
Feature activation-4.22e-02

Pos logit contributions
ber+0.122
Just+0.114
 belts+0.110
ager+0.108
ves+0.108
Neg logit contributions
py-0.122
umb-0.119
hing-0.117
 need-0.116
 apart-0.113
Token as
Token ID355
Feature activation-8.66e-03

Pos logit contributions
ager+0.073
ival+0.073
ber+0.072
Just+0.072
rd+0.071
Neg logit contributions
 need-0.058
py-0.057
 apart-0.056
umb-0.056
 crash-0.052
Token the
Token ID262
Feature activation+2.11e-02

Pos logit contributions
ber+0.014
Just+0.014
 belts+0.014
rd+0.014
ager+0.013
Neg logit contributions
umb-0.013
 need-0.013
py-0.013
umper-0.013
 apart-0.012
Token egg
Token ID5935
Feature activation+6.84e-04

Pos logit contributions
 need+0.029
umb+0.027
py+0.027
 strongest+0.025
 earthquake+0.025
Neg logit contributions
ber-0.040
Just-0.040
uddled-0.039
 belts-0.039
ival-0.038
Token began
Token ID2540
Feature activation+2.74e-02

Pos logit contributions
 need+0.001
umb+0.001
py+0.001
umper+0.001
 apart+0.001
Neg logit contributions
ber-0.001
Just-0.001
rd-0.001
 belts-0.001
ager-0.001
Token twenty
Token ID8208
Feature activation-4.12e-02

Pos logit contributions
uddled+0.069
ager+0.068
ber+0.068
ival+0.066
rd+0.065
Neg logit contributions
 need-0.053
py-0.045
 scatter-0.041
 strongest-0.041
 crash-0.039
Token-
Token ID12
Feature activation-7.90e-02

Pos logit contributions
uddled+0.052
ival+0.052
hand+0.050
 belts+0.050
ber+0.049
Neg logit contributions
 need-0.078
umb-0.070
py-0.070
 meant-0.067
 strongest-0.065
Tokeneight
Token ID26022
Feature activation-5.21e-02

Pos logit contributions
uddled+0.126
aj+0.124
 belts+0.117
ber+0.111
ival+0.108
Neg logit contributions
py-0.128
umb-0.125
 need-0.118
umper-0.117
hing-0.113
Token,
Token ID11
Feature activation-3.51e-02

Pos logit contributions
Just+0.098
ager+0.097
ber+0.095
ival+0.095
uddled+0.095
Neg logit contributions
 need-0.066
py-0.059
umb-0.058
 strongest-0.053
umper-0.052
Token twenty
Token ID8208
Feature activation-4.58e-02

Pos logit contributions
uddled+0.065
ager+0.065
ber+0.064
rd+0.064
ival+0.062
Neg logit contributions
 need-0.050
py-0.044
umb-0.041
 crash-0.040
umper-0.040
Token-
Token ID12
Feature activation-7.70e-02

Pos logit contributions
uddled+0.053
ival+0.051
 belts+0.049
ber+0.049
Of+0.049
Neg logit contributions
 need-0.093
umb-0.086
py-0.086
 meant-0.079
 strongest-0.079
Tokennine
Token ID30888
Feature activation-2.72e-02

Pos logit contributions
uddled+0.125
aj+0.116
 belts+0.112
ber+0.112
ival+0.106
Neg logit contributions
py-0.128
umb-0.126
 need-0.119
umper-0.118
hing-0.113
Token,
Token ID11
Feature activation-2.96e-02

Pos logit contributions
Just+0.051
ber+0.049
rd+0.048
edly+0.048
 belts+0.047
Neg logit contributions
 need-0.038
umb-0.035
py-0.033
umper-0.032
hing-0.032
Token thirty
Token ID12277
Feature activation-4.91e-02

Pos logit contributions
ber+0.051
rd+0.050
ager+0.050
uddled+0.050
Just+0.049
Neg logit contributions
 need-0.045
py-0.041
umb-0.041
umper-0.040
 crash-0.038
Token,
Token ID11
Feature activation-2.97e-02

Pos logit contributions
uddled+0.049
ival+0.046
Of+0.044
Just+0.043
 belts+0.042
Neg logit contributions
 need-0.108
umb-0.102
py-0.102
 strongest-0.094
 meant-0.091
Token thirty
Token ID12277
Feature activation-4.83e-02

Pos logit contributions
ber+0.052
rd+0.051
ager+0.051
uddled+0.050
Just+0.050
Neg logit contributions
 need-0.045
py-0.041
umb-0.041
umper-0.040
 crash-0.038
Token little
Token ID1310
Feature activation+2.12e-02

Pos logit contributions
 need+0.019
py+0.018
umb+0.018
umper+0.017
 strongest+0.017
Neg logit contributions
Just-0.024
ber-0.023
rd-0.023
ager-0.023
ves-0.023
Token girl
Token ID2576
Feature activation+3.15e-03

Pos logit contributions
umb+0.030
 need+0.030
py+0.028
umper+0.028
hing+0.026
Neg logit contributions
ber-0.038
Just-0.038
 belts-0.037
rd-0.036
ager-0.036
Token looked
Token ID3114
Feature activation+1.69e-02

Pos logit contributions
 need+0.005
py+0.005
umb+0.004
umper+0.004
 scatter+0.004
Neg logit contributions
ber-0.005
 belts-0.005
Just-0.005
ival-0.005
rd-0.005
Token up
Token ID510
Feature activation-2.31e-02

Pos logit contributions
 need+0.024
umb+0.023
py+0.022
umper+0.022
 apart+0.022
Neg logit contributions
ber-0.031
Just-0.030
 belts-0.029
rd-0.029
ager-0.029
Token in
Token ID287
Feature activation+6.37e-03

Pos logit contributions
ber+0.042
ves+0.042
rd+0.041
ival+0.040
ager+0.040
Neg logit contributions
 need-0.034
 apart-0.032
umb-0.032
 scatter-0.029
py-0.029
Token amaz
Token ID40642
Feature activation-7.63e-02

Pos logit contributions
 need+0.009
umb+0.008
py+0.007
 apart+0.007
umper+0.007
Neg logit contributions
ber-0.013
Just-0.013
ves-0.012
uddled-0.012
 belts-0.012
Tokenement
Token ID972
Feature activation-3.69e-02

Pos logit contributions
ber+0.128
Just+0.115
 belts+0.113
ager+0.112
cious+0.112
Neg logit contributions
py-0.113
umb-0.111
hing-0.107
el-0.105
 need-0.104
Token and
Token ID290
Feature activation-2.01e-02

Pos logit contributions
ival+0.071
ager+0.069
ber+0.067
ves+0.066
becca+0.066
Neg logit contributions
 apart-0.046
 need-0.045
py-0.044
umb-0.044
 earthquake-0.038
Token said
Token ID531
Feature activation-1.24e-02

Pos logit contributions
ves+0.038
ber+0.038
ager+0.037
rd+0.037
ival+0.036
Neg logit contributions
 need-0.028
 apart-0.027
 scatter-0.024
 strongest-0.024
 trem-0.023
Token,
Token ID11
Feature activation-2.45e-02

Pos logit contributions
ager+0.027
 belts+0.027
becca+0.027
Just+0.027
rd+0.026
Neg logit contributions
 apart-0.010
py-0.010
 need-0.010
 scatter-0.009
 crash-0.009
Token "
Token ID366
Feature activation-1.58e-02

Pos logit contributions
rd+0.041
ager+0.040
ber+0.038
ival+0.037
ves+0.037
Neg logit contributions
 need-0.039
 crash-0.037
 apart-0.037
py-0.036
umb-0.035
Token.
Token ID13
Feature activation+7.60e-03

Pos logit contributions
Just+0.069
ager+0.064
rd+0.063
ber+0.063
ival+0.063
Neg logit contributions
 need-0.070
umb-0.068
py-0.064
 apart-0.063
hing-0.061
Token He
Token ID679
Feature activation-4.17e-03

Pos logit contributions
umb+0.008
 need+0.008
py+0.008
umper+0.008
 apart+0.007
Neg logit contributions
ber-0.016
Just-0.016
 belts-0.016
rd-0.015
ager-0.015
Token thanked
Token ID26280
Feature activation+9.88e-03

Pos logit contributions
rd+0.007
ber+0.007
Just+0.007
 belts+0.007
edly+0.006
Neg logit contributions
 need-0.007
umb-0.006
py-0.006
umper-0.006
hing-0.006
Token the
Token ID262
Feature activation+6.69e-03

Pos logit contributions
 need+0.011
py+0.010
umb+0.010
umper+0.009
 apart+0.009
Neg logit contributions
ber-0.021
Just-0.021
rd-0.021
ager-0.021
uddled-0.021
Token clumsy
Token ID39120
Feature activation+7.04e-03

Pos logit contributions
 need+0.010
 strongest+0.010
 crash+0.009
 tallest+0.008
 urge+0.008
Neg logit contributions
ves-0.012
uddled-0.012
rd-0.012
ues-0.011
Just-0.011
Token bar
Token ID2318
Feature activation-7.62e-02

Pos logit contributions
 need+0.012
umb+0.011
 crash+0.010
 strongest+0.010
 lose+0.010
Neg logit contributions
Just-0.012
uddled-0.011
ves-0.011
rd-0.011
ival-0.011
Tokenber
Token ID527
Feature activation-3.46e-02

Pos logit contributions
Just+0.060
rd+0.060
aj+0.055
uddled+0.055
Aud+0.054
Neg logit contributions
 need-0.170
umb-0.165
 strongest-0.158
emaker-0.156
py-0.155
Token and
Token ID290
Feature activation-2.99e-02

Pos logit contributions
Just+0.069
uddled+0.067
ves+0.066
rd+0.065
ager+0.064
Neg logit contributions
 need-0.042
 apart-0.038
py-0.037
umb-0.035
 crash-0.034
Token went
Token ID1816
Feature activation+1.26e-02

Pos logit contributions
Just+0.052
rd+0.051
ber+0.050
ager+0.049
 belts+0.049
Neg logit contributions
 need-0.045
umb-0.041
py-0.040
 apart-0.040
umper-0.039
Token home
Token ID1363
Feature activation-3.76e-02

Pos logit contributions
 apart+0.015
 need+0.015
 crash+0.014
umper+0.014
umb+0.013
Neg logit contributions
Really-0.026
ager-0.025
Just-0.025
ber-0.025
rd-0.024
Token with
Token ID351
Feature activation-9.60e-03

Pos logit contributions
ival+0.072
ager+0.070
Just+0.070
becca+0.070
Really+0.070
Neg logit contributions
 need-0.052
 apart-0.050
 crash-0.045
 lose-0.044
 feel-0.040
Token,
Token ID11
Feature activation+5.45e-03

Pos logit contributions
Just+0.022
ber+0.022
ival+0.022
ves+0.022
 belts+0.022
Neg logit contributions
 need-0.022
py-0.021
umb-0.020
 apart-0.020
umper-0.019
Token and
Token ID290
Feature activation-1.91e-02

Pos logit contributions
umb+0.009
 need+0.009
py+0.008
umper+0.008
hing+0.008
Neg logit contributions
ber-0.009
Just-0.009
 belts-0.008
rd-0.008
ager-0.008
Token he
Token ID339
Feature activation-5.57e-03

Pos logit contributions
Just+0.031
ber+0.031
rd+0.030
 belts+0.030
ager+0.030
Neg logit contributions
 need-0.030
umb-0.029
py-0.028
umper-0.027
 apart-0.027
Token thanked
Token ID26280
Feature activation+1.20e-02

Pos logit contributions
ber+0.009
Just+0.009
 belts+0.009
rd+0.009
edly+0.008
Neg logit contributions
 need-0.009
umb-0.009
py-0.008
umper-0.008
 apart-0.008
Token the
Token ID262
Feature activation-6.94e-06

Pos logit contributions
 need+0.015
umb+0.015
py+0.015
umper+0.014
 apart+0.013
Neg logit contributions
ber-0.023
Just-0.023
 belts-0.022
rd-0.022
ager-0.022
Token bar
Token ID2318
Feature activation-7.62e-02

Feature not active on prev token;
no predictions were affected.
Tokenber
Token ID527
Feature activation-3.34e-02

Pos logit contributions
Just+0.061
Aud+0.059
aj+0.056
uddled+0.056
rd+0.055
Neg logit contributions
 need-0.170
umb-0.168
py-0.162
 strongest-0.157
igator-0.156
Token for
Token ID329
Feature activation-2.19e-02

Pos logit contributions
Just+0.068
ves+0.065
uddled+0.065
becca+0.062
ager+0.062
Neg logit contributions
 need-0.042
 apart-0.038
py-0.035
umb-0.035
umper-0.034
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
ber+0.046
Just+0.045
rd+0.044
 belts+0.044
ager+0.044
Neg logit contributions
 need-0.027
py-0.025
umper-0.023
umb-0.023
 crash-0.022
Token expensive
Token ID5789
Feature activation+9.79e-04

Pos logit contributions
 need+0.022
 strongest+0.020
 crash+0.019
 earthquake+0.019
umb+0.018
Neg logit contributions
Just-0.023
uddled-0.022
edly-0.022
ves-0.021
rd-0.021
Token haircut
Token ID45548
Feature activation-5.43e-02

Pos logit contributions
 need+0.002
 strongest+0.001
 crash+0.001
umb+0.001
 urge+0.001
Neg logit contributions
Just-0.002
edly-0.001
ves-0.001
uddled-0.001
rd-0.001
Token Lucy
Token ID22162
Feature activation-1.19e-02

Pos logit contributions
umb+0.019
 need+0.019
py+0.018
umper+0.018
hing+0.017
Neg logit contributions
ber-0.023
Just-0.022
 belts-0.022
rd-0.021
ager-0.021
Token walked
Token ID6807
Feature activation+1.66e-02

Pos logit contributions
ber+0.019
 belts+0.019
Just+0.019
rd+0.019
ves+0.019
Neg logit contributions
 need-0.019
umb-0.018
umper-0.017
py-0.017
 apart-0.016
Token out
Token ID503
Feature activation-3.49e-02

Pos logit contributions
 apart+0.019
py+0.017
 need+0.017
umb+0.016
 crash+0.016
Neg logit contributions
ager-0.035
ves-0.034
rd-0.034
ber-0.034
anas-0.033
Token of
Token ID286
Feature activation+2.30e-03

Pos logit contributions
ber+0.057
Just+0.056
 belts+0.054
ager+0.054
rd+0.053
Neg logit contributions
 need-0.055
umb-0.055
py-0.053
umper-0.051
 apart-0.050
Token the
Token ID262
Feature activation+2.96e-03

Pos logit contributions
umb+0.004
umper+0.003
py+0.003
 need+0.003
hing+0.003
Neg logit contributions
 belts-0.004
ber-0.004
Just-0.003
edly-0.003
rd-0.003
Token bar
Token ID2318
Feature activation-7.61e-02

Pos logit contributions
 need+0.005
umb+0.004
 crash+0.004
 earthquake+0.004
 strongest+0.004
Neg logit contributions
ves-0.005
Just-0.005
edly-0.005
uddled-0.005
ager-0.005
Tokenbers
Token ID1213
Feature activation-4.46e-02

Pos logit contributions
Just+0.053
rd+0.050
ves+0.046
uddled+0.042
 findings+0.040
Neg logit contributions
 need-0.179
umb-0.179
 apart-0.163
 strongest-0.162
emaker-0.162
Tokenhop
Token ID8548
Feature activation-1.98e-02

Pos logit contributions
uddled+0.064
Really+0.060
Just+0.060
Of+0.059
 belts+0.057
Neg logit contributions
umb-0.079
 need-0.075
umper-0.069
hing-0.069
 crash-0.069
Token,
Token ID11
Feature activation+3.98e-03

Pos logit contributions
Just+0.037
ber+0.036
ager+0.036
 belts+0.036
edly+0.036
Neg logit contributions
 need-0.026
umb-0.025
 apart-0.023
py-0.023
 strongest-0.022
Token everyone
Token ID2506
Feature activation-8.70e-03

Pos logit contributions
 need+0.005
 apart+0.005
umb+0.005
py+0.005
 strongest+0.004
Neg logit contributions
ber-0.007
Just-0.007
ager-0.007
rd-0.007
 belts-0.007
Token admired
Token ID29382
Feature activation-5.67e-04

Pos logit contributions
 belts+0.015
Just+0.014
rd+0.014
ber+0.014
ves+0.014
Neg logit contributions
 need-0.013
umb-0.013
py-0.012
hing-0.012
 apart-0.012
Token"
Token ID1
Feature activation-7.45e-03

Pos logit contributions
 need+0.031
umb+0.028
umper+0.028
 crash+0.026
 scatter+0.026
Neg logit contributions
edly-0.027
uddled-0.026
land-0.025
 belts-0.025
rd-0.024
TokenS
Token ID50
Feature activation-9.37e-03

Pos logit contributions
ber+0.012
rd+0.012
edly+0.012
 belts+0.012
ves+0.012
Neg logit contributions
 need-0.011
umper-0.011
umb-0.011
 apart-0.010
py-0.010
Tokenara
Token ID3301
Feature activation+3.50e-03

Pos logit contributions
edly+0.011
ber+0.011
Just+0.011
 belts+0.011
ves+0.010
Neg logit contributions
umb-0.019
 need-0.019
umper-0.018
py-0.017
 apart-0.017
Token,
Token ID11
Feature activation-2.36e-02

Pos logit contributions
umb+0.006
 need+0.006
py+0.006
umper+0.006
hing+0.006
Neg logit contributions
ber-0.005
Just-0.005
ager-0.005
 belts-0.005
rd-0.005
Token what
Token ID644
Feature activation-8.89e-03

Pos logit contributions
ber+0.030
Just+0.029
 belts+0.028
ager+0.027
rd+0.027
Neg logit contributions
umb-0.047
py-0.045
 need-0.045
umper-0.044
hing-0.043
Token did
Token ID750
Feature activation+5.53e-02

Pos logit contributions
ber+0.017
rd+0.016
ager+0.016
Just+0.016
ves+0.016
Neg logit contributions
umb-0.011
py-0.011
umper-0.011
 need-0.010
hing-0.010
Token you
Token ID345
Feature activation+2.01e-03

Pos logit contributions
 need+0.072
py+0.061
 earthquake+0.059
umb+0.058
 crash+0.056
Neg logit contributions
Just-0.109
edly-0.108
ber-0.108
uddled-0.105
rd-0.102
Token do
Token ID466
Feature activation+3.66e-02

Pos logit contributions
py+0.004
umper+0.004
umb+0.004
emaker+0.004
rell+0.004
Neg logit contributions
Just-0.002
ager-0.002
rd-0.002
 nicely-0.002
becca-0.002
Token?"
Token ID1701
Feature activation-1.09e-02

Pos logit contributions
 need+0.036
umb+0.035
py+0.033
umper+0.032
 apart+0.029
Neg logit contributions
ber-0.082
Just-0.081
 belts-0.079
rd-0.078
ager-0.078
Token Mom
Token ID11254
Feature activation-5.21e-04

Pos logit contributions
ber+0.015
Just+0.015
ager+0.015
 belts+0.015
rd+0.015
Neg logit contributions
 need-0.020
umb-0.019
py-0.018
umper-0.018
 apart-0.018
Token asked
Token ID1965
Feature activation-1.60e-02

Pos logit contributions
ber+0.001
Just+0.001
 belts+0.001
rd+0.001
ager+0.001
Neg logit contributions
 need-0.001
umb-0.001
py-0.001
umper-0.001
 apart-0.001
TokenBen
Token ID11696
Feature activation+1.70e-02

Pos logit contributions
ber+0.024
Just+0.023
 belts+0.023
rd+0.022
ager+0.022
Neg logit contributions
umb-0.021
 need-0.021
py-0.020
umper-0.020
hing-0.019
Token and
Token ID290
Feature activation-2.81e-03

Pos logit contributions
umb+0.030
 need+0.029
py+0.029
umper+0.028
hing+0.027
Neg logit contributions
ber-0.025
Just-0.024
 belts-0.024
ager-0.023
rd-0.023
Token Lily
Token ID20037
Feature activation+3.70e-02

Pos logit contributions
ber+0.004
Just+0.004
 belts+0.004
ager+0.004
five+0.004
Neg logit contributions
umb-0.005
py-0.005
umper-0.005
hing-0.005
 apart-0.004
Token were
Token ID547
Feature activation-6.14e-03

Pos logit contributions
umb+0.055
py+0.055
umper+0.051
 need+0.050
 apart+0.047
Neg logit contributions
ber-0.063
ager-0.062
Just-0.061
 belts-0.061
five-0.059
Token hungry
Token ID14720
Feature activation-2.19e-02

Pos logit contributions
ber+0.010
Just+0.010
 belts+0.009
rd+0.009
ager+0.009
Neg logit contributions
umb-0.010
 need-0.010
py-0.010
umper-0.009
hing-0.009
Token.
Token ID13
Feature activation+5.64e-02

Pos logit contributions
Just+0.033
ber+0.032
ager+0.031
 belts+0.031
rd+0.030
Neg logit contributions
 need-0.038
umb-0.038
py-0.036
umper-0.035
 apart-0.034
Token They
Token ID1119
Feature activation+2.53e-02

Pos logit contributions
umb+0.074
py+0.072
 need+0.070
umper+0.069
hing+0.067
Neg logit contributions
ber-0.107
Just-0.107
 belts-0.103
rd-0.103
edly-0.101
Token wanted
Token ID2227
Feature activation-6.94e-03

Pos logit contributions
py+0.048
umb+0.047
umper+0.047
emaker+0.045
hing+0.044
Neg logit contributions
ager-0.033
Just-0.033
ber-0.032
rd-0.031
 belts-0.030
Token to
Token ID284
Feature activation-1.06e-02

Pos logit contributions
Just+0.010
ber+0.010
ager+0.010
ival+0.010
rd+0.010
Neg logit contributions
 need-0.013
umb-0.011
 apart-0.011
 scatter-0.011
py-0.011
Token eat
Token ID4483
Feature activation-4.99e-03

Pos logit contributions
rd+0.016
Just+0.015
ves+0.015
ber+0.015
ager+0.015
Neg logit contributions
py-0.018
umper-0.017
umb-0.017
ained-0.016
emaker-0.016
Token pizza
Token ID14256
Feature activation-1.17e-03

Pos logit contributions
Just+0.008
ber+0.008
ager+0.008
rd+0.008
ival+0.008
Neg logit contributions
 need-0.008
umb-0.008
py-0.007
 apart-0.007
umper-0.007
Token much
Token ID881
Feature activation+1.70e-02

Pos logit contributions
umb+0.015
umper+0.014
py+0.014
 need+0.013
hing+0.013
Neg logit contributions
Just-0.015
ber-0.015
 belts-0.014
rd-0.014
ves-0.014
Token easier
Token ID4577
Feature activation+1.79e-02

Pos logit contributions
umb+0.024
 need+0.023
py+0.022
umper+0.022
hing+0.020
Neg logit contributions
Just-0.031
ber-0.031
 belts-0.030
rd-0.030
ager-0.029
Token to
Token ID284
Feature activation+2.91e-02

Pos logit contributions
 need+0.032
umb+0.030
py+0.030
 apart+0.029
umper+0.029
Neg logit contributions
ber-0.026
Just-0.026
rd-0.025
 belts-0.025
ager-0.025
Token get
Token ID651
Feature activation+2.16e-02

Pos logit contributions
umb+0.044
py+0.040
umper+0.040
 need+0.036
ained+0.035
Neg logit contributions
Just-0.050
ber-0.048
ager-0.047
 belts-0.047
ves-0.047
Token things
Token ID1243
Feature activation+1.57e-02

Pos logit contributions
umb+0.033
py+0.031
umper+0.030
 need+0.030
hing+0.029
Neg logit contributions
Just-0.036
ber-0.036
 belts-0.035
ager-0.034
rd-0.034
Token done
Token ID1760
Feature activation+5.33e-02

Pos logit contributions
umb+0.027
 need+0.027
py+0.026
umper+0.025
hing+0.024
Neg logit contributions
ber-0.023
Just-0.023
 belts-0.022
rd-0.022
ager-0.022
Token.
Token ID13
Feature activation-1.90e-02

Pos logit contributions
 need+0.093
umb+0.092
py+0.088
umper+0.086
 apart+0.086
Neg logit contributions
ber-0.078
Just-0.077
 belts-0.074
ager-0.073
rd-0.072
Token That
Token ID1320
Feature activation-3.26e-02

Pos logit contributions
ber+0.023
Just+0.023
 belts+0.022
rd+0.022
ager+0.022
Neg logit contributions
umb-0.037
 need-0.037
py-0.036
umper-0.036
 apart-0.034
Token's
Token ID338
Feature activation-1.44e-02

Pos logit contributions
ager+0.053
ber+0.051
Just+0.050
rd+0.050
ves+0.049
Neg logit contributions
umb-0.051
py-0.050
umper-0.050
hing-0.046
 apart-0.045
Token why
Token ID1521
Feature activation-1.84e-02

Pos logit contributions
ber+0.021
Just+0.021
 belts+0.020
rd+0.020
ager+0.020
Neg logit contributions
 need-0.025
umb-0.025
py-0.024
umper-0.024
 apart-0.023
Token it
Token ID340
Feature activation+8.65e-03

Pos logit contributions
ber+0.028
Just+0.027
 belts+0.027
ves+0.026
ager+0.026
Neg logit contributions
umb-0.032
py-0.030
umper-0.030
 need-0.029
hing-0.028
Token a
Token ID257
Feature activation-6.36e-03

Pos logit contributions
umb+0.042
py+0.041
 need+0.041
umper+0.040
hing+0.038
Neg logit contributions
Just-0.044
ber-0.044
 belts-0.043
rd-0.042
ager-0.042
Token safe
Token ID3338
Feature activation+2.20e-02

Pos logit contributions
edly+0.013
ber+0.013
uddled+0.013
Just+0.012
becca+0.012
Neg logit contributions
 need-0.009
 crash-0.008
 apart-0.008
 flood-0.007
 scatter-0.007
Token place
Token ID1295
Feature activation-2.32e-02

Pos logit contributions
 need+0.031
umb+0.031
py+0.029
umper+0.028
 apart+0.028
Neg logit contributions
ber-0.040
Just-0.039
 belts-0.038
ager-0.037
rd-0.037
Token where
Token ID810
Feature activation-2.40e-02

Pos logit contributions
Just+0.039
ber+0.038
becca+0.037
ager+0.037
ival+0.037
Neg logit contributions
 need-0.037
 apart-0.035
umb-0.035
py-0.032
 crash-0.030
Token nobody
Token ID8168
Feature activation+1.80e-02

Pos logit contributions
Just+0.037
ber+0.037
 belts+0.036
ager+0.035
rd+0.035
Neg logit contributions
 need-0.040
umb-0.039
py-0.038
umper-0.037
 apart-0.036
Token could
Token ID714
Feature activation+7.11e-02

Pos logit contributions
 need+0.030
umb+0.027
py+0.027
umper+0.026
 apart+0.025
Neg logit contributions
Just-0.029
 belts-0.029
ber-0.029
edly-0.028
ves-0.028
Token find
Token ID1064
Feature activation+3.77e-02

Pos logit contributions
umb+0.104
 need+0.101
py+0.099
umper+0.098
 apart+0.092
Neg logit contributions
Just-0.122
ber-0.122
rd-0.117
 belts-0.116
ager-0.116
Token it
Token ID340
Feature activation+1.41e-02

Pos logit contributions
 need+0.057
umb+0.056
py+0.054
umper+0.053
 apart+0.051
Neg logit contributions
Just-0.065
ber-0.065
 belts-0.062
rd-0.061
ager-0.061
Token.
Token ID13
Feature activation-1.89e-02

Pos logit contributions
 need+0.025
umb+0.025
py+0.024
umper+0.023
 apart+0.023
Neg logit contributions
ber-0.020
Just-0.020
 belts-0.019
ager-0.019
rd-0.019
Token
Token ID220
Feature activation-1.85e-02

Pos logit contributions
ber+0.031
 belts+0.030
Just+0.029
ager+0.029
ves+0.029
Neg logit contributions
 need-0.031
umb-0.030
umper-0.028
py-0.028
 apart-0.027
Token
Token ID198
Feature activation-1.62e-02

Pos logit contributions
ber+0.024
 belts+0.023
Just+0.023
ager+0.022
rd+0.022
Neg logit contributions
umb-0.036
 need-0.036
py-0.034
umper-0.034
 apart-0.033
Token no
Token ID645
Feature activation+5.34e-03

Pos logit contributions
ber+0.007
Just+0.007
 belts+0.007
rd+0.007
ager+0.007
Neg logit contributions
umb-0.006
 need-0.006
py-0.005
umper-0.005
hing-0.005
Token matter
Token ID2300
Feature activation+7.42e-03

Pos logit contributions
umb+0.005
 need+0.005
py+0.005
umper+0.005
hing+0.004
Neg logit contributions
ber-0.012
Just-0.012
 belts-0.012
rd-0.012
ager-0.012
Token how
Token ID703
Feature activation-1.89e-02

Pos logit contributions
umb+0.017
 need+0.016
umper+0.016
py+0.015
hing+0.015
Neg logit contributions
ber-0.007
Just-0.007
 belts-0.007
rd-0.007
ves-0.006
Token fearful
Token ID26844
Feature activation+3.22e-02

Pos logit contributions
ber+0.032
Just+0.031
 belts+0.030
rd+0.030
ager+0.030
Neg logit contributions
 need-0.029
umb-0.029
py-0.027
umper-0.027
 apart-0.026
Token he
Token ID339
Feature activation+3.75e-02

Pos logit contributions
umb+0.066
umper+0.061
py+0.060
 need+0.057
ained+0.056
Neg logit contributions
ber-0.038
ves-0.037
 belts-0.035
 else-0.034
Just-0.034
Token felt
Token ID2936
Feature activation+7.27e-02

Pos logit contributions
umb+0.060
 need+0.059
py+0.058
umper+0.057
 apart+0.054
Neg logit contributions
ber-0.060
Just-0.059
 belts-0.057
ager-0.057
rd-0.057
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
umb+0.135
py+0.133
umper+0.128
 need+0.128
hing+0.122
Neg logit contributions
Just-0.097
ber-0.097
ager-0.092
rd-0.091
 belts-0.090
Token The
Token ID383
Feature activation+8.03e-03

Pos logit contributions
 belts+0.031
ber+0.031
uddled+0.030
ves+0.029
edly+0.029
Neg logit contributions
 need-0.034
umb-0.033
umper-0.031
py-0.031
 apart-0.029
Token End
Token ID5268
Feature activation+1.52e-02

Pos logit contributions
 need+0.011
umb+0.010
py+0.009
umper+0.009
 strongest+0.009
Neg logit contributions
ber-0.016
Just-0.016
edly-0.015
rd-0.015
 belts-0.015
Token.
Token ID13
Feature activation-1.82e-02

Pos logit contributions
 need+0.020
umb+0.017
py+0.015
 meant+0.015
 apart+0.014
Neg logit contributions
uddled-0.029
ber-0.029
 belts-0.028
edly-0.028
becca-0.027
Token<|endoftext|>
Token ID50256
Feature activation-2.05e-02

Pos logit contributions
ber+0.024
 belts+0.023
Just+0.023
edly+0.022
uddled+0.022
Neg logit contributions
 need-0.035
umb-0.035
py-0.033
umper-0.033
 apart-0.031
Token one
Token ID530
Feature activation-9.66e-03

Pos logit contributions
Just+0.034
becca+0.033
ves+0.033
rd+0.032
Really+0.031
Neg logit contributions
 need-0.031
umb-0.028
 apart-0.028
py-0.025
 strongest-0.025
Token final
Token ID2457
Feature activation-1.84e-03

Pos logit contributions
Just+0.019
ves+0.018
rd+0.018
becca+0.017
 belts+0.017
Neg logit contributions
 need-0.014
umb-0.013
 strongest-0.012
 apart-0.011
py-0.011
Token hug
Token ID16225
Feature activation+2.21e-02

Pos logit contributions
ber+0.003
becca+0.003
uddled+0.003
ival+0.003
 belts+0.003
Neg logit contributions
 need-0.003
umb-0.003
 crash-0.002
 scatter-0.002
 apart-0.002
Token before
Token ID878
Feature activation+3.54e-02

Pos logit contributions
 need+0.034
umb+0.034
py+0.032
 apart+0.032
umper+0.031
Neg logit contributions
Just-0.037
ager-0.035
ber-0.035
rd-0.035
becca-0.034
Token he
Token ID339
Feature activation+5.98e-03

Pos logit contributions
umb+0.055
 need+0.055
py+0.053
umper+0.052
hing+0.049
Neg logit contributions
ber-0.058
Just-0.058
 belts-0.056
rd-0.055
ager-0.055
Token went
Token ID1816
Feature activation+3.52e-02

Pos logit contributions
 need+0.009
umb+0.008
py+0.008
umper+0.008
 apart+0.007
Neg logit contributions
ber-0.011
Just-0.011
 belts-0.010
rd-0.010
ager-0.010
Token home
Token ID1363
Feature activation-3.03e-02

Pos logit contributions
 need+0.059
umb+0.056
py+0.056
 apart+0.055
umper+0.054
Neg logit contributions
ber-0.056
Just-0.055
ager-0.055
becca-0.053
 belts-0.053
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
Just+0.056
ager+0.056
ival+0.056
becca+0.055
ber+0.054
Neg logit contributions
 need-0.044
 apart-0.040
umb-0.038
 crash-0.036
 lose-0.036
Token<|endoftext|>
Token ID50256
Feature activation-2.28e-02

Pos logit contributions
ber+0.030
Just+0.029
 belts+0.029
rd+0.028
ager+0.028
Neg logit contributions
umb-0.020
 need-0.020
py-0.019
umper-0.018
 apart-0.017
TokenOnce
Token ID7454
Feature activation+1.51e-02

Pos logit contributions
ber+0.033
Just+0.033
 belts+0.031
rd+0.031
ager+0.031
Neg logit contributions
umb-0.040
 need-0.039
py-0.038
umper-0.038
 apart-0.036
Token upon
Token ID2402
Feature activation-1.42e-02

Pos logit contributions
umb+0.028
 need+0.027
py+0.027
umper+0.027
hing+0.026
Neg logit contributions
ber-0.021
Just-0.020
rd-0.019
 belts-0.019
ves-0.019
Token,
Token ID11
Feature activation+1.75e-02

Pos logit contributions
ber+0.009
Just+0.009
rd+0.009
 belts+0.009
edly+0.009
Neg logit contributions
 need-0.007
umb-0.006
py-0.006
 apart-0.006
umper-0.006
Token something
Token ID1223
Feature activation-5.88e-03

Pos logit contributions
umb+0.028
 need+0.027
py+0.027
umper+0.026
hing+0.025
Neg logit contributions
Just-0.029
ber-0.029
 belts-0.028
ager-0.027
ves-0.027
Token bad
Token ID2089
Feature activation+3.47e-02

Pos logit contributions
ber+0.012
 belts+0.011
rd+0.011
edly+0.011
aj+0.011
Neg logit contributions
 need-0.008
py-0.007
 scatter-0.007
 strongest-0.006
 earthquake-0.006
Token happened
Token ID3022
Feature activation+2.74e-02

Pos logit contributions
umb+0.065
umper+0.062
 need+0.061
py+0.060
 apart+0.058
Neg logit contributions
ber-0.046
ves-0.045
 belts-0.045
ager-0.044
Just-0.044
Token to
Token ID284
Feature activation+1.24e-02

Pos logit contributions
 need+0.038
umb+0.034
 apart+0.033
py+0.033
ained+0.031
Neg logit contributions
ival-0.052
ber-0.052
becca-0.051
 belts-0.049
rd-0.049
Token it
Token ID340
Feature activation+1.33e-02

Pos logit contributions
umb+0.021
umper+0.020
py+0.019
 strongest+0.018
ained+0.018
Neg logit contributions
ves-0.018
Just-0.018
 belts-0.017
ber-0.017
 nicely-0.017
Token.
Token ID13
Feature activation+4.63e-03

Pos logit contributions
umb+0.029
umper+0.029
py+0.028
 need+0.027
emaker+0.026
Neg logit contributions
Just-0.013
ber-0.013
ves-0.012
 belts-0.012
 nicely-0.012
Token All
Token ID1439
Feature activation+1.70e-03

Pos logit contributions
 need+0.006
umb+0.006
py+0.006
umper+0.006
 apart+0.005
Neg logit contributions
ber-0.009
 belts-0.009
Just-0.008
ager-0.008
rd-0.008
Token of
Token ID286
Feature activation+1.31e-02

Pos logit contributions
umb+0.003
umper+0.003
py+0.003
 affect+0.003
 seeker+0.003
Neg logit contributions
 else-0.002
ber-0.002
--0.002
rd-0.002
ves-0.002
Token a
Token ID257
Feature activation-1.19e-02

Pos logit contributions
umb+0.021
 need+0.020
umper+0.019
py+0.019
hing+0.018
Neg logit contributions
Just-0.021
ves-0.020
ber-0.020
 belts-0.020
edly-0.019
Token sudden
Token ID4802
Feature activation+1.06e-02

Pos logit contributions
ber+0.016
Just+0.016
 belts+0.015
rd+0.015
edly+0.015
Neg logit contributions
 need-0.022
umb-0.022
py-0.021
umper-0.021
 crash-0.020
Token thought
Token ID1807
Feature activation-7.97e-04

Pos logit contributions
ber+0.011
rd+0.011
Just+0.011
ival+0.010
 belts+0.010
Neg logit contributions
 need-0.010
umb-0.009
py-0.009
umper-0.009
 apart-0.008
Token he
Token ID339
Feature activation-1.23e-02

Pos logit contributions
Just+0.001
ber+0.001
ager+0.001
rd+0.001
 belts+0.001
Neg logit contributions
 need-0.001
umb-0.001
py-0.001
umper-0.001
 apart-0.001
Token could
Token ID714
Feature activation+1.91e-02

Pos logit contributions
ber+0.018
Just+0.018
 belts+0.017
ager+0.017
rd+0.017
Neg logit contributions
umb-0.021
 need-0.021
py-0.020
umper-0.020
 apart-0.019
Token eat
Token ID4483
Feature activation+7.89e-03

Pos logit contributions
umb+0.029
umper+0.028
py+0.028
hing+0.027
 apart+0.026
Neg logit contributions
Just-0.032
ber-0.032
ves-0.030
ager-0.030
 belts-0.029
Token it
Token ID340
Feature activation+1.22e-02

Pos logit contributions
 need+0.012
umb+0.011
py+0.011
 apart+0.011
umper+0.010
Neg logit contributions
Just-0.014
ber-0.014
ager-0.013
rd-0.013
 belts-0.013
Token later
Token ID1568
Feature activation+1.40e-02

Pos logit contributions
umb+0.022
 need+0.022
py+0.021
umper+0.021
hing+0.020
Neg logit contributions
ber-0.017
Just-0.016
 belts-0.016
rd-0.016
ager-0.015
Token when
Token ID618
Feature activation+4.60e-02

Pos logit contributions
 need+0.023
umb+0.023
py+0.022
 apart+0.022
umper+0.021
Neg logit contributions
Just-0.021
ber-0.021
ager-0.020
 belts-0.020
rd-0.020
Token no
Token ID645
Feature activation+1.81e-02

Pos logit contributions
 need+0.071
umb+0.068
py+0.066
umper+0.064
 apart+0.062
Neg logit contributions
Just-0.077
ber-0.076
ager-0.074
 belts-0.074
rd-0.073
Token one
Token ID530
Feature activation+5.85e-03

Pos logit contributions
umb+0.034
umper+0.033
py+0.033
hing+0.032
 need+0.032
Neg logit contributions
ber-0.025
 belts-0.023
Just-0.023
rd-0.022
ves-0.022
Token was
Token ID373
Feature activation+3.83e-02

Pos logit contributions
 need+0.009
umb+0.008
py+0.008
umper+0.007
 apart+0.007
Neg logit contributions
Just-0.011
 belts-0.010
ber-0.010
uddled-0.010
rd-0.010
Token looking
Token ID2045
Feature activation+8.58e-03

Pos logit contributions
 need+0.054
umb+0.053
py+0.051
umper+0.050
 apart+0.049
Neg logit contributions
Just-0.069
ber-0.068
 belts-0.067
rd-0.066
ager-0.066
Token who
Token ID508
Feature activation-3.14e-02

Pos logit contributions
 need+0.033
umb+0.031
py+0.030
umper+0.030
 apart+0.029
Neg logit contributions
 belts-0.028
Just-0.028
ber-0.027
edly-0.027
ves-0.027
Token was
Token ID373
Feature activation+2.26e-02

Pos logit contributions
ber+0.042
Just+0.040
ager+0.039
 belts+0.039
ves+0.038
Neg logit contributions
umb-0.059
py-0.057
 need-0.056
umper-0.055
 apart-0.053
Token crying
Token ID13774
Feature activation+2.68e-02

Pos logit contributions
umb+0.037
 need+0.035
hing+0.035
umper+0.034
py+0.034
Neg logit contributions
ber-0.036
Just-0.035
rd-0.032
 belts-0.032
ager-0.032
Token because
Token ID780
Feature activation+1.75e-03

Pos logit contributions
umb+0.062
umper+0.060
 need+0.058
emaker+0.057
py+0.056
Neg logit contributions
edly-0.023
 belts-0.023
Just-0.023
--0.023
ber-0.023
Token he
Token ID339
Feature activation+8.39e-04

Pos logit contributions
umb+0.003
 need+0.003
py+0.003
umper+0.003
hing+0.003
Neg logit contributions
ber-0.003
Just-0.003
 belts-0.002
rd-0.002
ves-0.002
Token lost
Token ID2626
Feature activation+3.07e-02

Pos logit contributions
umb+0.001
 need+0.001
py+0.001
umper+0.001
hing+0.001
Neg logit contributions
Just-0.001
ber-0.001
ager-0.001
 belts-0.001
rd-0.001
Token his
Token ID465
Feature activation-7.83e-04

Pos logit contributions
umb+0.047
 need+0.046
py+0.045
umper+0.044
hing+0.043
Neg logit contributions
ber-0.050
Just-0.050
 belts-0.049
rd-0.048
edly-0.048
Token toy
Token ID13373
Feature activation-1.01e-02

Pos logit contributions
Just+0.001
ber+0.001
 belts+0.001
rd+0.001
ager+0.001
Neg logit contributions
 need-0.001
umb-0.001
py-0.001
umper-0.001
 apart-0.001
Token.
Token ID13
Feature activation+9.78e-03

Pos logit contributions
ber+0.011
 belts+0.011
edly+0.010
Just+0.010
uddled+0.009
Neg logit contributions
umb-0.021
umper-0.021
hing-0.020
 need-0.020
py-0.020
Token Lily
Token ID20037
Feature activation+1.82e-02

Pos logit contributions
 need+0.012
umb+0.012
py+0.011
umper+0.011
 apart+0.010
Neg logit contributions
ber-0.019
 belts-0.019
Just-0.019
ager-0.019
ves-0.019
Token felt
Token ID2936
Feature activation+4.17e-02

Pos logit contributions
umb+0.035
 need+0.032
hing+0.032
py+0.032
umper+0.031
Neg logit contributions
Just-0.026
ber-0.025
ager-0.024
uddled-0.023
 belts-0.023
Token It
Token ID632
Feature activation-7.63e-03

Pos logit contributions
ber+0.009
uddled+0.009
rd+0.009
 belts+0.009
ager+0.009
Neg logit contributions
 need-0.007
umb-0.007
py-0.007
 apart-0.006
 crash-0.006
Token was
Token ID373
Feature activation+1.09e-02

Pos logit contributions
ber+0.013
Just+0.013
 belts+0.012
rd+0.012
ager+0.012
Neg logit contributions
 need-0.011
umb-0.011
py-0.011
umper-0.011
 apart-0.010
Token trying
Token ID2111
Feature activation-1.65e-02

Pos logit contributions
umb+0.016
umper+0.016
 need+0.016
py+0.015
hing+0.015
Neg logit contributions
Just-0.019
ber-0.018
 belts-0.018
edly-0.017
rd-0.017
Token to
Token ID284
Feature activation+1.03e-02

Pos logit contributions
ber+0.021
ival+0.020
rd+0.020
ager+0.019
 belts+0.019
Neg logit contributions
 need-0.034
 apart-0.032
umb-0.031
py-0.031
 crash-0.030
Token steal
Token ID8711
Feature activation+6.69e-03

Pos logit contributions
umb+0.016
umper+0.015
py+0.015
 need+0.014
ained+0.014
Neg logit contributions
Just-0.017
ves-0.015
ber-0.015
ager-0.015
edly-0.015
Token the
Token ID262
Feature activation+1.47e-02

Pos logit contributions
 need+0.009
 apart+0.009
umb+0.009
py+0.009
umper+0.008
Neg logit contributions
ber-0.013
Just-0.013
uddled-0.012
ager-0.012
rd-0.012
Token oats
Token ID47009
Feature activation-1.59e-02

Pos logit contributions
umb+0.025
 need+0.025
py+0.025
umper+0.024
hing+0.024
Neg logit contributions
ber-0.021
Just-0.021
 belts-0.021
rd-0.020
ager-0.020
Token!
Token ID0
Feature activation-1.21e-02

Pos logit contributions
ber+0.023
Just+0.023
 belts+0.022
ager+0.022
rd+0.022
Neg logit contributions
umb-0.027
 need-0.027
py-0.027
umper-0.026
hing-0.025
Token
Token ID198
Feature activation-9.62e-03

Pos logit contributions
ber+0.019
Just+0.019
 belts+0.019
rd+0.018
uddled+0.018
Neg logit contributions
 need-0.020
umb-0.019
py-0.018
umper-0.018
 apart-0.018
Token
Token ID198
Feature activation+5.84e-03

Pos logit contributions
uddled+0.016
 belts+0.016
ber+0.015
edly+0.015
rd+0.015
Neg logit contributions
 need-0.015
umb-0.015
py-0.014
 crash-0.014
umper-0.013
TokenMark
Token ID9704
Feature activation-2.93e-02

Pos logit contributions
 need+0.009
 crash+0.008
umb+0.008
hing+0.008
py+0.008
Neg logit contributions
uddled-0.010
aj-0.010
edly-0.010
 belts-0.009
ber-0.009
Token are
Token ID389
Feature activation+1.61e-02

Pos logit contributions
ber+0.053
Just+0.052
 belts+0.052
ager+0.052
ves+0.050
Neg logit contributions
umb-0.031
py-0.030
umper-0.029
 need-0.028
 apart-0.026
Token strong
Token ID1913
Feature activation+1.16e-03

Pos logit contributions
umb+0.024
 need+0.024
py+0.023
umper+0.022
hing+0.021
Neg logit contributions
ber-0.028
Just-0.027
 belts-0.026
rd-0.026
ager-0.026
Token too
Token ID1165
Feature activation-8.64e-03

Pos logit contributions
umb+0.002
py+0.002
umper+0.002
hing+0.002
 need+0.002
Neg logit contributions
ber-0.002
 belts-0.002
Just-0.002
edly-0.002
rd-0.002
Token,
Token ID11
Feature activation-1.61e-03

Pos logit contributions
ber+0.014
Just+0.013
 belts+0.013
ager+0.013
ves+0.013
Neg logit contributions
umb-0.014
py-0.014
umper-0.014
 need-0.014
hing-0.013
Token Danny
Token ID15105
Feature activation-8.59e-03

Pos logit contributions
ber+0.003
Just+0.003
ager+0.002
 belts+0.002
rd+0.002
Neg logit contributions
 need-0.003
umb-0.003
py-0.002
umper-0.002
 apart-0.002
Token.
Token ID13
Feature activation-2.33e-03

Pos logit contributions
ber+0.013
Just+0.013
 belts+0.012
ager+0.012
rd+0.012
Neg logit contributions
umb-0.015
 need-0.014
py-0.014
umper-0.014
hing-0.013
Token That
Token ID1320
Feature activation-3.30e-02

Pos logit contributions
ber+0.004
Just+0.004
 belts+0.004
rd+0.004
ager+0.004
Neg logit contributions
umb-0.003
 need-0.003
py-0.003
umper-0.003
hing-0.003
Token is
Token ID318
Feature activation+4.62e-03

Pos logit contributions
ber+0.052
Just+0.051
 belts+0.050
ager+0.049
rd+0.049
Neg logit contributions
umb-0.054
 need-0.052
py-0.052
umper-0.051
hing-0.048
Token why
Token ID1521
Feature activation-9.03e-03

Pos logit contributions
 need+0.007
umb+0.007
py+0.007
umper+0.007
 apart+0.006
Neg logit contributions
ber-0.008
Just-0.008
rd-0.008
ager-0.007
 belts-0.007
Token it
Token ID340
Feature activation+1.95e-02

Pos logit contributions
ber+0.014
Just+0.014
 belts+0.013
rd+0.013
ager+0.013
Neg logit contributions
umb-0.015
 need-0.015
py-0.014
umper-0.014
hing-0.013
Token is
Token ID318
Feature activation+2.75e-02

Pos logit contributions
umb+0.036
umper+0.034
py+0.034
 need+0.032
hing+0.032
Neg logit contributions
ber-0.026
Just-0.026
ager-0.025
 belts-0.024
ves-0.024
Token day
Token ID1110
Feature activation-1.83e-02

Pos logit contributions
umper+0.014
umb+0.014
py+0.014
hing+0.013
 need+0.013
Neg logit contributions
Just-0.012
ager-0.012
ber-0.012
rd-0.012
ves-0.011
Token on
Token ID319
Feature activation-2.14e-03

Pos logit contributions
Just+0.026
ber+0.024
ves+0.024
ager+0.022
 belts+0.022
Neg logit contributions
umb-0.032
umper-0.032
py-0.031
 need-0.031
 affect-0.030
Token,
Token ID11
Feature activation+6.13e-04

Pos logit contributions
ves+0.002
ber+0.002
Just+0.002
rd+0.002
 belts+0.002
Neg logit contributions
umb-0.004
umper-0.004
hing-0.004
 need-0.004
py-0.004
Token the
Token ID262
Feature activation+2.25e-04

Pos logit contributions
umb+0.001
umper+0.001
py+0.001
hing+0.001
ained+0.001
Neg logit contributions
 nicely-0.001
ber-0.001
Just-0.001
 belts-0.001
ves-0.001
Token smart
Token ID4451
Feature activation-4.58e-03

Pos logit contributions
umb+0.000
 need+0.000
umper+0.000
py+0.000
 apart+0.000
Neg logit contributions
Just-0.000
ber-0.000
ager-0.000
 belts-0.000
ves-0.000
Token fox
Token ID21831
Feature activation-1.02e-02

Pos logit contributions
ber+0.007
Just+0.007
ager+0.007
edly+0.007
ves+0.007
Neg logit contributions
umb-0.007
 need-0.007
 apart-0.007
hing-0.007
umper-0.007
Token and
Token ID290
Feature activation-6.65e-03

Pos logit contributions
ber+0.014
Just+0.014
ager+0.013
 belts+0.013
rd+0.013
Neg logit contributions
umb-0.019
 need-0.018
py-0.018
umper-0.018
 apart-0.017
Token the
Token ID262
Feature activation+8.43e-03

Pos logit contributions
ber+0.010
 belts+0.009
Just+0.009
edly+0.009
ager+0.009
Neg logit contributions
umb-0.012
umper-0.011
hing-0.010
ained-0.010
py-0.010
Token little
Token ID1310
Feature activation+4.33e-02

Pos logit contributions
 need+0.014
umb+0.014
py+0.013
umper+0.013
 apart+0.012
Neg logit contributions
ber-0.013
Just-0.013
 belts-0.013
rd-0.013
ager-0.012
Token girl
Token ID2576
Feature activation+1.47e-02

Pos logit contributions
umb+0.083
py+0.079
hing+0.079
umper+0.078
 need+0.076
Neg logit contributions
ber-0.056
Just-0.053
ager-0.051
 belts-0.051
edly-0.050
Token were
Token ID547
Feature activation+5.84e-03

Pos logit contributions
umb+0.023
umper+0.023
py+0.022
 need+0.022
 apart+0.021
Neg logit contributions
ber-0.024
ager-0.023
Just-0.023
 belts-0.022
ves-0.022
Token hands
Token ID2832
Feature activation+5.52e-04

Pos logit contributions
 need+0.006
umb+0.005
 apart+0.005
py+0.005
umper+0.005
Neg logit contributions
Just-0.007
ber-0.006
rd-0.006
ager-0.006
edly-0.006
Token and
Token ID290
Feature activation-2.89e-02

Pos logit contributions
 apart+0.001
 need+0.001
umb+0.001
 scatter+0.000
hing+0.000
Neg logit contributions
ager-0.001
rd-0.001
Just-0.001
Really-0.001
ival-0.001
Token ran
Token ID4966
Feature activation+8.74e-03

Pos logit contributions
Just+0.051
ber+0.051
ival+0.050
rd+0.049
ager+0.048
Neg logit contributions
 need-0.044
 apart-0.040
 scatter-0.039
py-0.038
umb-0.037
Token to
Token ID284
Feature activation-1.67e-02

Pos logit contributions
 apart+0.014
 need+0.012
 crash+0.012
 scatter+0.011
 lose+0.011
Neg logit contributions
becca-0.016
ber-0.016
rd-0.016
Just-0.016
Really-0.016
Token their
Token ID511
Feature activation-2.12e-02

Pos logit contributions
Just+0.035
ber+0.035
rd+0.034
ager+0.034
uddled+0.034
Neg logit contributions
 need-0.021
umb-0.018
 apart-0.018
py-0.017
 crash-0.017
Token ship
Token ID4074
Feature activation+2.29e-03

Pos logit contributions
Just+0.044
ager+0.040
ves+0.040
ival+0.040
rd+0.039
Neg logit contributions
 need-0.028
 apart-0.026
 lose-0.025
 scatter-0.025
 crash-0.024
Token.
Token ID13
Feature activation+2.24e-02

Pos logit contributions
 apart+0.003
 need+0.003
umb+0.003
py+0.003
 crash+0.003
Neg logit contributions
Really-0.005
becca-0.005
Just-0.004
Re-0.004
rd-0.004
Token They
Token ID1119
Feature activation+1.15e-03

Pos logit contributions
 need+0.019
umb+0.017
 apart+0.016
py+0.014
 crash+0.014
Neg logit contributions
ber-0.054
ager-0.054
Just-0.053
rd-0.053
ves-0.051
Token sailed
Token ID33835
Feature activation-4.16e-03

Pos logit contributions
umb+0.002
 need+0.002
py+0.002
umper+0.002
 apart+0.002
Neg logit contributions
ber-0.002
Just-0.002
 belts-0.002
rd-0.002
ager-0.002
Token away
Token ID1497
Feature activation+8.12e-03

Pos logit contributions
rd+0.008
Just+0.008
becca+0.008
ager+0.008
uddled+0.008
Neg logit contributions
 apart-0.006
 need-0.005
 crash-0.005
umb-0.005
py-0.005
Token,
Token ID11
Feature activation-1.38e-02

Pos logit contributions
 need+0.013
umb+0.012
py+0.012
 apart+0.012
umper+0.011
Neg logit contributions
Just-0.014
ber-0.013
rd-0.013
ager-0.013
 belts-0.013
Token Can
Token ID1680
Feature activation-1.83e-02

Pos logit contributions
 need+0.018
umb+0.018
py+0.017
umper+0.017
 apart+0.015
Neg logit contributions
ber-0.027
Just-0.026
 belts-0.026
ves-0.026
ager-0.026
Token you
Token ID345
Feature activation-7.96e-03

Pos logit contributions
ber+0.034
Just+0.033
ager+0.033
rd+0.033
edly+0.033
Neg logit contributions
 need-0.025
py-0.023
umb-0.023
umper-0.022
 crash-0.021
Token please
Token ID3387
Feature activation-1.79e-02

Pos logit contributions
Just+0.011
ager+0.011
ber+0.010
 nicely+0.010
rd+0.010
Neg logit contributions
umb-0.014
py-0.014
umper-0.013
ained-0.013
hing-0.013
Token help
Token ID1037
Feature activation+1.09e-02

Pos logit contributions
Just+0.030
ber+0.030
 belts+0.028
ager+0.028
rd+0.028
Neg logit contributions
umb-0.027
py-0.026
 need-0.025
umper-0.025
hing-0.024
Token me
Token ID502
Feature activation-2.55e-02

Pos logit contributions
umb+0.024
 need+0.022
py+0.022
ained+0.022
hing+0.022
Neg logit contributions
Just-0.011
ber-0.011
ves-0.011
becca-0.010
 belts-0.010
Token?"
Token ID1701
Feature activation-1.88e-02

Pos logit contributions
Just+0.048
ber+0.048
 belts+0.045
ager+0.045
 nicely+0.045
Neg logit contributions
umb-0.032
py-0.031
 need-0.030
umper-0.029
hing-0.028
Token
Token ID198
Feature activation-8.78e-03

Pos logit contributions
 belts+0.029
ber+0.029
Just+0.028
ager+0.028
rd+0.028
Neg logit contributions
 need-0.031
umb-0.029
umper-0.029
py-0.029
 apart-0.027
Token
Token ID198
Feature activation+8.04e-03

Pos logit contributions
 belts+0.015
uddled+0.014
edly+0.014
ber+0.014
rd+0.014
Neg logit contributions
 need-0.014
umper-0.013
umb-0.013
py-0.013
 crash-0.011
TokenThe
Token ID464
Feature activation+1.07e-02

Pos logit contributions
umper+0.013
 need+0.013
umb+0.012
py+0.012
hing+0.011
Neg logit contributions
uddled-0.013
edly-0.012
 belts-0.012
rd-0.011
 findings-0.011
Token clever
Token ID14169
Feature activation+1.24e-02

Pos logit contributions
 need+0.016
umb+0.016
py+0.016
umper+0.015
 strongest+0.014
Neg logit contributions
Just-0.018
rd-0.017
ber-0.017
ves-0.017
 belts-0.017
Token fish
Token ID5916
Feature activation+3.32e-03

Pos logit contributions
 need+0.019
umb+0.018
py+0.017
umper+0.017
 apart+0.016
Neg logit contributions
Just-0.022
 belts-0.021
rd-0.021
ber-0.021
uddled-0.020
Token
Token ID220
Feature activation-1.94e-02

Pos logit contributions
ber+0.019
 belts+0.019
ager+0.019
ves+0.019
Just+0.018
Neg logit contributions
 need-0.017
umb-0.017
umper-0.016
py-0.016
 apart-0.015
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
ber+0.028
ager+0.028
 belts+0.028
rd+0.027
edly+0.027
Neg logit contributions
umb-0.034
 need-0.034
umper-0.032
py-0.031
 apart-0.031
Token
Token ID198
Feature activation+3.42e-03

Pos logit contributions
 belts+0.022
ber+0.021
uddled+0.020
edly+0.019
ager+0.019
Neg logit contributions
umb-0.017
umper-0.017
 need-0.016
py-0.016
ickets-0.015
TokenMom
Token ID29252
Feature activation-4.97e-03

Pos logit contributions
umb+0.005
umper+0.005
 need+0.005
py+0.005
ickets+0.005
Neg logit contributions
 belts-0.006
ber-0.005
uddled-0.005
edly-0.005
aj-0.005
Tokenmy
Token ID1820
Feature activation-3.48e-03

Pos logit contributions
ber+0.012
Just+0.012
 belts+0.012
rd+0.012
ager+0.012
Neg logit contributions
 need-0.004
umb-0.004
py-0.004
umper-0.003
hing-0.003
Token and
Token ID290
Feature activation-1.52e-02

Pos logit contributions
 belts+0.005
rd+0.005
ber+0.005
Just+0.005
edly+0.005
Neg logit contributions
 need-0.006
py-0.006
umb-0.006
umper-0.006
hing-0.005
Token Daddy
Token ID26926
Feature activation+1.76e-02

Pos logit contributions
Just+0.025
ber+0.024
ves+0.024
rd+0.024
ager+0.024
Neg logit contributions
 need-0.024
umb-0.023
py-0.023
umper-0.023
 apart-0.021
Token were
Token ID547
Feature activation+1.07e-04

Pos logit contributions
umb+0.026
 need+0.026
py+0.025
umper+0.025
 apart+0.023
Neg logit contributions
ber-0.030
Just-0.030
 belts-0.029
rd-0.028
ager-0.028
Token so
Token ID523
Feature activation-3.87e-02

Pos logit contributions
 need+0.000
umb+0.000
py+0.000
umper+0.000
 apart+0.000
Neg logit contributions
ber-0.000
Just-0.000
 belts-0.000
ager-0.000
rd-0.000
Token happy
Token ID3772
Feature activation-1.07e-02

Pos logit contributions
Just+0.056
 belts+0.056
ber+0.056
ager+0.054
edly+0.054
Neg logit contributions
 need-0.069
umb-0.065
py-0.061
 apart-0.059
hing-0.059
Token and
Token ID290
Feature activation-1.99e-02

Pos logit contributions
ival+0.018
ber+0.017
Just+0.017
ves+0.017
 belts+0.017
Neg logit contributions
 need-0.016
py-0.015
 apart-0.015
umb-0.015
 scatter-0.014
Token
Token ID198
Feature activation+1.08e-02

Pos logit contributions
 belts+0.019
ber+0.019
edly+0.019
ager+0.018
uddled+0.018
Neg logit contributions
 need-0.021
umb-0.021
py-0.020
umper-0.019
hing-0.018
TokenOne
Token ID3198
Feature activation-1.04e-02

Pos logit contributions
 need+0.017
umb+0.016
py+0.016
hing+0.015
umper+0.015
Neg logit contributions
ber-0.018
uddled-0.017
edly-0.017
 belts-0.017
aj-0.016
Token day
Token ID1110
Feature activation-9.77e-03

Pos logit contributions
ber+0.020
Just+0.020
 belts+0.019
rd+0.019
ager+0.019
Neg logit contributions
 need-0.013
umb-0.013
py-0.013
umper-0.012
hing-0.012
Token,
Token ID11
Feature activation+9.97e-03

Pos logit contributions
Just+0.017
ber+0.017
 belts+0.017
rd+0.017
ager+0.017
Neg logit contributions
 need-0.014
umb-0.013
py-0.013
 apart-0.012
umper-0.012
Token the
Token ID262
Feature activation+1.66e-02

Pos logit contributions
 need+0.014
umb+0.014
py+0.013
umper+0.013
 apart+0.012
Neg logit contributions
ber-0.018
Just-0.018
rd-0.017
 belts-0.017
ager-0.017
Token zoo
Token ID26626
Feature activation-4.10e-02

Pos logit contributions
 need+0.021
py+0.020
umb+0.018
 strongest+0.018
 crash+0.018
Neg logit contributions
Just-0.033
ber-0.033
edly-0.032
uddled-0.031
ager-0.031
Token keeper
Token ID28356
Feature activation-1.12e-02

Pos logit contributions
Just+0.067
 belts+0.064
ber+0.064
rd+0.063
edly+0.063
Neg logit contributions
 need-0.065
umb-0.063
py-0.061
 apart-0.058
umper-0.057
Token noticed
Token ID6810
Feature activation-1.32e-02

Pos logit contributions
ber+0.016
Just+0.015
ager+0.015
ival+0.014
ves+0.014
Neg logit contributions
umb-0.021
py-0.020
 need-0.020
umper-0.019
hing-0.019
Token that
Token ID326
Feature activation+1.10e-03

Pos logit contributions
ber+0.023
Just+0.023
rd+0.022
ager+0.022
 belts+0.021
Neg logit contributions
 need-0.019
py-0.019
umb-0.019
umper-0.018
 apart-0.017
Token some
Token ID617
Feature activation-7.41e-03

Pos logit contributions
umb+0.002
 need+0.002
py+0.001
umper+0.001
hing+0.001
Neg logit contributions
ber-0.002
Just-0.002
 belts-0.002
rd-0.002
ager-0.002
Token of
Token ID286
Feature activation+7.08e-03

Pos logit contributions
ber+0.013
Just+0.013
 belts+0.013
ager+0.013
rd+0.013
Neg logit contributions
 need-0.010
umb-0.010
py-0.010
umper-0.009
hing-0.009
Token?"
Token ID1701
Feature activation-8.81e-03

Pos logit contributions
 need+0.067
umb+0.065
py+0.064
umper+0.062
 apart+0.060
Neg logit contributions
Just-0.061
ber-0.060
 belts-0.059
rd-0.058
ager-0.058
Token Jack
Token ID3619
Feature activation+1.64e-02

Pos logit contributions
ber+0.014
 belts+0.014
ager+0.014
Just+0.014
uddled+0.014
Neg logit contributions
 need-0.014
py-0.013
umb-0.013
umper-0.013
 apart-0.012
Token said
Token ID531
Feature activation+2.64e-03

Pos logit contributions
 need+0.028
umb+0.027
py+0.027
umper+0.026
 apart+0.025
Neg logit contributions
ber-0.025
Just-0.024
 belts-0.024
rd-0.023
ves-0.023
Token,
Token ID11
Feature activation-2.65e-02

Pos logit contributions
 need+0.003
 apart+0.003
py+0.002
umb+0.002
 crash+0.002
Neg logit contributions
becca-0.006
ival-0.006
 belts-0.006
ager-0.005
Just-0.005
Token "
Token ID366
Feature activation-2.60e-02

Pos logit contributions
 belts+0.039
rd+0.039
ival+0.038
ves+0.038
ager+0.038
Neg logit contributions
 need-0.046
py-0.043
umb-0.043
 apart-0.043
 crash-0.042
TokenYes
Token ID5297
Feature activation-4.44e-02

Pos logit contributions
ves+0.045
uddled+0.045
 belts+0.043
aj+0.042
ork+0.041
Neg logit contributions
 need-0.036
umb-0.036
umper-0.035
hing-0.034
 strongest-0.034
Token,
Token ID11
Feature activation-1.55e-02

Pos logit contributions
ber+0.071
Just+0.070
 belts+0.068
rd+0.067
ager+0.067
Neg logit contributions
umb-0.072
 need-0.071
py-0.069
umper-0.068
hing-0.064
Token we
Token ID356
Feature activation-2.20e-02

Pos logit contributions
ber+0.030
Just+0.030
ager+0.029
edly+0.029
rd+0.029
Neg logit contributions
 need-0.020
py-0.019
umb-0.019
umper-0.017
 apart-0.017
Token can
Token ID460
Feature activation-3.43e-03

Pos logit contributions
ager+0.038
ber+0.038
Just+0.037
 belts+0.037
ival+0.036
Neg logit contributions
umb-0.031
py-0.030
umper-0.030
 apart-0.026
emaker-0.026
Token.
Token ID13
Feature activation-4.31e-03

Pos logit contributions
ber+0.006
edly+0.006
Just+0.006
 belts+0.006
rd+0.006
Neg logit contributions
 need-0.005
umb-0.004
py-0.004
hing-0.004
 scatter-0.004
Token I
Token ID314
Feature activation-1.76e-02

Pos logit contributions
ber+0.008
ager+0.008
uddled+0.008
ves+0.008
 belts+0.008
Neg logit contributions
 need-0.006
umb-0.005
py-0.005
umper-0.005
hing-0.005
Token!"
Token ID2474
Feature activation-8.80e-03

Pos logit contributions
umb+0.044
 need+0.043
py+0.042
umper+0.041
 apart+0.039
Neg logit contributions
ber-0.050
Just-0.049
 belts-0.048
ager-0.047
rd-0.047
Token she
Token ID673
Feature activation+1.20e-03

Pos logit contributions
ager+0.016
ber+0.016
Just+0.016
rd+0.016
edly+0.015
Neg logit contributions
 need-0.013
umb-0.011
py-0.011
 crash-0.010
 apart-0.010
Token says
Token ID1139
Feature activation-2.13e-02

Pos logit contributions
 need+0.002
umb+0.002
py+0.002
umper+0.002
 apart+0.002
Neg logit contributions
ber-0.002
Just-0.002
 belts-0.002
rd-0.002
ager-0.002
Token.
Token ID13
Feature activation+6.06e-03

Pos logit contributions
becca+0.042
ival+0.042
ager+0.041
Re+0.040
rd+0.040
Neg logit contributions
 apart-0.027
 need-0.027
 crash-0.023
umb-0.023
py-0.022
Token "
Token ID366
Feature activation-3.54e-02

Pos logit contributions
 need+0.010
umb+0.010
 apart+0.009
umper+0.009
py+0.009
Neg logit contributions
rd-0.010
ager-0.009
 belts-0.009
ber-0.009
ival-0.009
TokenYou
Token ID1639
Feature activation-4.06e-02

Pos logit contributions
ber+0.048
Just+0.047
 belts+0.046
rd+0.045
ager+0.045
Neg logit contributions
umb-0.065
 need-0.065
py-0.062
umper-0.062
hing-0.060
Token two
Token ID734
Feature activation+1.72e-02

Pos logit contributions
ager+0.072
ival+0.068
Just+0.065
five+0.064
 else+0.063
Neg logit contributions
py-0.058
umb-0.057
umper-0.055
emaker-0.053
ained-0.052
Token worked
Token ID3111
Feature activation+5.73e-03

Pos logit contributions
umb+0.022
umper+0.022
py+0.021
ained+0.019
emaker+0.018
Neg logit contributions
Just-0.032
 belts-0.032
ber-0.032
ves-0.032
rd-0.031
Token very
Token ID845
Feature activation-3.55e-02

Pos logit contributions
 need+0.008
 apart+0.007
umb+0.007
py+0.007
umper+0.007
Neg logit contributions
ber-0.011
rd-0.011
Just-0.011
edly-0.010
ager-0.010
Token well
Token ID880
Feature activation-2.04e-02

Pos logit contributions
rd+0.068
ber+0.068
Just+0.066
ager+0.065
uddled+0.065
Neg logit contributions
 need-0.046
umb-0.042
py-0.040
 apart-0.040
 strongest-0.038
Token together
Token ID1978
Feature activation-7.64e-03

Pos logit contributions
Just+0.033
ber+0.033
rd+0.032
ager+0.031
 belts+0.031
Neg logit contributions
 need-0.032
umb-0.031
py-0.030
 apart-0.029
umper-0.029
Token,
Token ID11
Feature activation-2.91e-02

Pos logit contributions
 need+0.001
umb+0.001
py+0.001
umper+0.001
 apart+0.001
Neg logit contributions
Just-0.002
ager-0.002
edly-0.002
 belts-0.001
ber-0.001
Token
Token ID6184
Feature activation-1.72e-02

Pos logit contributions
rd+0.049
ager+0.049
ves+0.048
 belts+0.048
uddled+0.047
Neg logit contributions
 need-0.048
 apart-0.042
 Sammy-0.042
py-0.042
umper-0.041
Token
Token ID95
Feature activation+1.89e-03

Pos logit contributions
ber+0.026
 belts+0.025
ves+0.025
rd+0.024
uddled+0.024
Neg logit contributions
 need-0.028
umb-0.027
hing-0.027
umper-0.026
 earthquake-0.026
Token
Token ID26391
Feature activation-5.23e-02

Pos logit contributions
umb+0.003
 need+0.003
py+0.003
hing+0.003
umper+0.003
Neg logit contributions
ber-0.003
uddled-0.003
 belts-0.003
ves-0.003
edly-0.003
Token
Token ID129
Feature activation+1.01e-02

Pos logit contributions
edly+0.066
ber+0.062
five+0.059
 belts+0.059
Just+0.059
Neg logit contributions
 need-0.102
umb-0.098
py-0.092
umper-0.091
 apart-0.089
Token
Token ID241
Feature activation-4.61e-02

Pos logit contributions
umb+0.018
py+0.018
 need+0.018
umper+0.017
hing+0.017
Neg logit contributions
ber-0.015
ival-0.013
 belts-0.013
uddled-0.013
edly-0.013
TokenMom
Token ID29252
Feature activation+5.72e-03

Pos logit contributions
uddled+0.063
ber+0.062
ival+0.062
 belts+0.061
rd+0.061
Neg logit contributions
 need-0.082
umb-0.081
umper-0.074
py-0.073
hing-0.071
Tokenmy
Token ID1820
Feature activation+1.13e-02

Pos logit contributions
umb+0.009
 need+0.009
py+0.009
umper+0.009
hing+0.008
Neg logit contributions
ber-0.009
Just-0.009
 belts-0.009
rd-0.009
ager-0.009
Token,
Token ID11
Feature activation-2.71e-02

Pos logit contributions
 need+0.018
umb+0.017
py+0.017
umper+0.016
 apart+0.016
Neg logit contributions
Just-0.018
ber-0.018
 belts-0.018
rd-0.017
edly-0.017
Token why
Token ID1521
Feature activation-2.71e-02

Pos logit contributions
Just+0.045
ber+0.044
ager+0.043
 belts+0.043
rd+0.043
Neg logit contributions
 need-0.042
umb-0.041
py-0.040
umper-0.039
hing-0.037
Token are
Token ID389
Feature activation+7.04e-03

Pos logit contributions
ber+0.045
 belts+0.044
ves+0.043
ager+0.042
Just+0.042
Neg logit contributions
umb-0.043
py-0.041
umper-0.040
 apart-0.037
 need-0.037
Token,
Token ID11
Feature activation-3.45e-02

Pos logit contributions
Just+0.050
 belts+0.050
ager+0.050
ber+0.049
becca+0.048
Neg logit contributions
 need-0.023
 apart-0.020
 crash-0.019
 scatter-0.019
py-0.019
Token
Token ID6184
Feature activation-2.68e-02

Pos logit contributions
rd+0.065
ager+0.064
ber+0.061
ival+0.060
 belts+0.058
Neg logit contributions
 crash-0.045
 need-0.044
 apart-0.042
 Sammy-0.041
py-0.039
Token
Token ID95
Feature activation-4.25e-03

Pos logit contributions
ber+0.036
uddled+0.034
ves+0.034
and+0.034
rd+0.034
Neg logit contributions
 need-0.044
 earthquake-0.041
umb-0.041
hing-0.040
 lose-0.038
Token
Token ID26391
Feature activation-6.08e-02

Pos logit contributions
ber+0.007
uddled+0.007
ves+0.007
 belts+0.007
 else+0.006
Neg logit contributions
umb-0.006
 need-0.006
hing-0.005
 earthquake-0.005
umper-0.005
Token
Token ID129
Feature activation+6.25e-03

Pos logit contributions
edly+0.084
ber+0.079
Just+0.077
 belts+0.074
five+0.073
Neg logit contributions
 need-0.111
umb-0.107
 crash-0.098
 apart-0.096
umper-0.095
Token
Token ID241
Feature activation-4.42e-02

Pos logit contributions
umb+0.010
 need+0.010
py+0.010
umper+0.009
hing+0.009
Neg logit contributions
ber-0.010
ival-0.009
uddled-0.009
Of-0.009
 belts-0.009
TokenI
Token ID40
Feature activation-3.29e-02

Pos logit contributions
uddled+0.062
ber+0.061
ves+0.060
 belts+0.060
ival+0.060
Neg logit contributions
 need-0.078
umb-0.075
umper-0.067
ained-0.066
py-0.066
Token can
Token ID460
Feature activation-9.81e-03

Pos logit contributions
ber+0.065
 belts+0.064
Just+0.063
edly+0.063
rd+0.061
Neg logit contributions
 need-0.044
umb-0.041
umper-0.036
py-0.036
hing-0.035
Tokenâ
Token ID22940
Feature activation+1.22e-02

Pos logit contributions
ber+0.022
Just+0.021
 belts+0.021
rd+0.021
ager+0.021
Neg logit contributions
 need-0.010
umb-0.010
py-0.009
umper-0.009
hing-0.008
Token
Token ID26391
Feature activation-5.33e-02

Pos logit contributions
 need+0.020
umb+0.020
umper+0.019
hing+0.019
py+0.018
Neg logit contributions
ber-0.020
 belts-0.019
uddled-0.018
Just-0.018
edly-0.018
Token
Token ID8151
Feature activation-3.39e-02

Pos logit contributions
edly+0.082
Just+0.080
ber+0.078
 belts+0.077
uddled+0.077
Neg logit contributions
 need-0.089
umb-0.087
umper-0.078
 apart-0.075
 crash-0.075